TOMORROW'S APE
Posts
State of AI in the Real World: We are Deep in Demo Mode ⏯

State of AI in the Real World: We are Deep in Demo Mode ⏯

Includes: Our Final Part Deep Dive into the AI Art Revolution

September 21, 2023

Images in this Newsletter made with ideogram.ai

It’s often been said this year, but it has truly been a big week for AI! So here’s what we’ve got:

Fallen Fruit - A Summary of Announcements, Releases and Latest Findings.
Our Conclusion to our Deep Dive into the AI Art Revolution
Swinging from the High Branches - What are the Smart People Saying this Week?
100 years of Star Wars
Retro Rewind Movie featuring AI

Fallen Fruit

Announcements, Releases & Latest Findings

📦Windows AI Copilot

The new Windows platform announced at the Microsoft Surface and AI event features Copilot which essentially adds AI integration throughout.

It works like a personal assistant across all the Windows Apps.

🧰YouTube

It looks like you’ll be able to create content using new AI Tools on YouTube like

AI Generated Backgrounds

Plus editing, dubbing and AI assisted music search

📢Neural Link

Neural Link announces first in-human clinical trials.

They have asked for those with quadriplegia amyotrophic lateral sclerosis to register.

It’s part of Neural Links first phase mission of helping people with various brain/nervous system injuries and conditions.

📦Google Bard Extensions

Connect to Google Apps and Services.

Here’s a How To Use Article on the new features

Bard Extensions enables Bard to assist you with Trip Planning Docs and Draft a Marketplace listing etc

Bard can find and show you relevant information from the Google tools you use every day — like Gmail, Docs, Drive, Google Maps, YouTube, and Google Flights and hotels — even when the information you need is across multiple apps and services.

📦DALL-E 3

New Release from OpenAi that Put’s the DALL-E image maker right inside Chat GPT.

It’s image generation with a brain!

This integration allows for more nuanced prompting, less ‘prompt engineering’ and even understands how to add a consistent character through various scenes and styles.

Essentially it sets a new standard in Generative Art above other stand alone Diffusion Models and the beginning of the age of Multimodal AI.

Multimodal refers to an AI that combines multiple types, or modes, of data to create more accurate determinations, draw insightful conclusions or make more precise predictions about real-world problems.

Looking forward to seeing it’s impact on the fledgling AI Art community.

📊Harvard Business School & Boston Consulting Group Study

Productivity in the workplace with ChatGPT

The subjects of the study were reportedly faster at the work with ChatGPT than those who didn’t.

☢️ Nuclear Fusion Marching Forward

Keep an eye on Nuclear Fusion.

A lot of movement around the world after scientists repeat the magic trick. Many countries are making moves to implement scientific teams and infrastructure to be at the cutting edge of this now potentially viable form of energy.

The Imitation Game Part 3 :🙈 👀 :

OpenAI have just ushered in the new age of multimodal AI

With this weeks announcement that DALL-E3 will now sit inside ChatGPT we are going to see a new level of nuance and accuracy that will likely remove the need for hefty ‘Prompt Engineering’ and the clunkiness of throwing words at the model hoping it will get what you mean. Some reasoning perhaps can come from ChatGPT to assist with the interpretive process.

This multimodal approach shows yet another paradigm shift in AI human collaboration and the depth at which creative work can be achieved.

It will be interesting to see these capabilities when it is fully rolled out in the next couple of weeks.

Artists without a worldwide reputation and tonnes of sales behind them can now employ assistants.

It is clear that Artist’s and Creatives can leverage greatly from the instant and iterative capabilities and now everyone has effectively a studio’s full of assistants to help them not only produce their work at an exponential rate, but also help develop it with further personal data input.

Won’t all art just be created by AI’s?

Maybe, given that we are in the infancy of this new tech and we are already seeing evolutionary steps to further enhance the collaborative process, there is no doubt it will have a wide stream appeal and usage.

So what’s next?

The way we create imagery is clearly enhancing.

The speed, iterations and interactivity we have with a skilfully executing and image forming assistant at our command will surely provide a greater experience of the creative process and garner amazing new possibilities.

Artists who utilise and maybe even train several models will have super powers unseen by previous generations. From hand painting to paint brush to camera to computer you get it!

As far as the score card goes, well it is already changing with the advent of multimodal AI and who knows how it will shape up.

Maybe we will essentially have equal partners in the application of creative ideas with purely the Perspective, Authenticity and Direction given by us the human part of the equation.

And when AI’s do this too and it’s no longer a collaboration? Well you can still draw a squiggle on some note paper and amuse yourself...

C’mon the world will be yearning for that squiggle by then! What have you got to say?

Swinging from the High Branches

What are the Smart People Saying this Week?

🎙This Week in Startups: Episode with Modular CEO Chris Lattner

Summary: Deploying AI into the real world is really hard and overly complicated.

Even AI’s current full capacity won’t be realised until this is solved.

Much of the AI model architecture has been built on specific hardware and/or by teams of researchers.

It’s not production ready and therefore easy for companies to integrate and deploy into their own products.

🎤All-In Summit: Bill Gurly 2056 miles

If you missed this last week it’s a must watch.

Gurly beautifully illustrates the Industry Capture dynamics of Washington.

It is a cautionary tail in the face of imminent AI regulation.

𝕏: Jim Fan

“I think DALL·E 3 is not just a stance against MidJourney. It's actually a sneak peak of the upcoming, epic battle of massively multimodal LLMs, against DeepMind Gemini.

Quote: "DALL·E 3 is built natively on ChatGPT". This is the key phrase.

DALL·E 3's extraordinary language alignment is built on a solid textual GPT foundation. MidJourney doesn't really have much "reasoning brain", which is why so much prompt hacking is needed.”

Brain first, pixel second -> that's the way to build strong multimodal AI.

🎙Upstream with Erik Torenberg: Episode with Jon Askonas

Summary: Medieval people were more prepared for AI than we are?!🧛🏻‍♂️🧚🏻🧜‍♀️🧞‍♂️🧙🏻‍♀️

Since the enlightenment we have been ‘reasoncentric‘ in our thinking and see ourselves as being at the top of the intelligence totem pole.

Whereas in Medieval times people were used to the idea of Vampires, Demons, Witches and Angels all of which were craftier.

Technology Review: Mustafa Suleyman

Summary:

The Three Waves of AI Evolution

1. First Wave - Classification

2. Second Wave - Generative Phase (Now)

3. Third Wave - Interactive Wave

Explained:

“The first wave of AI was about classification. Deep learning showed that we can train a computer to classify various types of input data: images, video, audio, language. Now we’re in the generative wave, where you take that input data and produce new data.

The third wave will be the interactive phase. That’s why I’ve bet for a long time that conversation is the future interface. You know, instead of just clicking on buttons and typing, you’re going to talk to your AI.

And these AIs will be able to take actions. You will just give it a general, high-level goal and it will use all the tools it has to act on that. They’ll talk to other people, talk to other AIs. This is what we’re going to do with Pi.

That’s a huge shift in what technology can do. It’s a very, very profound moment in the history of technology that I think many people underestimate. Technology today is static. It does, roughly speaking, what you tell it to do.”

100 Years of Star Wars

Pika Labs & @douggypledger

RETRO MOVIE REWIND: Virtuosity (1997) Russell Crowe

When a virtual reality simulation created using the personalities of multiple serial killers manages to escape into the real world, an ex-cop is tasked with stopping its reign of terror.

Link to IMDB here

Here’s a little taste of Next Week’s Issue:

Runaway RunwayML?

We’ll take a look at the AI Motion Landscape, who’s crushing it and what’s on the horizon.

The future of Prompt Engineering and AI Wrapping?

Is Multimodal and Enterprise integration killing off the first order of Human AI resources.