Ethan Mollick (@emollick)

To clarify: Gemini Deep Think is a really smart model, but it doesn't have access to the same tools as Claude or ChatGPT - it can't download files, cannot consistently run code on its own, cannot produce downloadable files, does not clearly show when it does web search, etc

06.03.2026 05:58 👍 28 🔁 0 💬 0 📌 1

GPT-5.4 Pro, Opus, and Gemini DeepThink: "Prove to me in a PowerPoint that there was no advanced dinosaur civilization by downloading whatever data you think appropriate & running tests"

GPT-5.4 and Claude downloaded data and did some original analyses, but someone build a harness for Deep Think!

06.03.2026 05:54 👍 42 🔁 0 💬 1 📌 0

Economist Alex Imas has been tracking the evidence on AI and productivity changes, and now thinks that the macro-economic data is, rather suddenly, showing the increase in productivity that we have been seeing in our micro research. aleximas.substack.com/p/what-is-th...

05.03.2026 22:59 👍 45 🔁 3 💬 1 📌 1

Had early access to GPT-5.4 and Pro. The stats are very good and so are the models.

One fun illustration of progress, this is the prompt "the book Piranesi as a p5js 3d space. do it for me," back in 2024 in GPT-4 (which took multiple corrections) and in GPT-5.4 Pro, which did it in one prompt.

05.03.2026 19:22 👍 38 🔁 2 💬 2 📌 0

It is one of the weirdest divides, I speak to two companies in the exact same industry and one has been using AI for the past 18 months and the other has a committee that has to approve every use case individually and talk about how AI companies will train on their data.

05.03.2026 01:41 👍 43 🔁 3 💬 4 📌 0

It is amazing how many companies I talk to STILL have AI effectively blocked by IT & legal departments for out-of-date reasons when many companies in regulated industries have figured out ways to deploy enterprise ChatGPT, Claude & Gemini (including CLIs like Code) without any apparent problem.

05.03.2026 01:39 👍 83 🔁 9 💬 10 📌 3

Content before 2022 is the Roman shipwreck lead or the Scapa Flow steel of human information, anything afterwards could be influenced by AI: directly written by AI, as a result of co-work with AIs, or just as a result of ambient contamination as AI style slips unconsciously into our work.

04.03.2026 17:37 👍 46 🔁 7 💬 5 📌 1

There was a two year long steady growth period from GPT-4 to the next big leap of o3, where the other labs caught up with GPT-4 and released some really good models along the way (New Sonnet among them). Also o3 should have been named GPT-5

03.03.2026 14:41 👍 14 🔁 2 💬 2 📌 0

From an AI user perspective, the four big leaps so far in ability:
1. GPT-3.5 (ChatGPT, November 2022)
2. GPT-4 (Spring 2023)
3. Reasoners (starts with o1-preview, but the real deal was o3, Spring 2025)
4. Workable agentic systems (Harness + good reasoner models, December 2025)

03.03.2026 14:39 👍 89 🔁 12 💬 7 📌 2

I suspect that the other labs will have a Cowork competitor sooner rather than later (though whether they will have good Excel and Powerpoint agents soon is less clear). Deep Think might be as capable as GPT 5.2 Pro but is missing the harness and UX to actually use that power.

03.03.2026 04:35 👍 20 🔁 1 💬 1 📌 0

Stuff that individual labs have to which there is no equivalent product from the others:
-Claude Cowork is the only non-technical local agent
-NotebookLM is the only information-focused app
-GPT-5.2 Pro is the only harnessed deep thinking model capable of very hard problems

03.03.2026 04:32 👍 82 🔁 5 💬 3 📌 0

[[Topic of discussion]] is not [[analogy]].

[[Dramatic fact given own line]].
[[Dramatic fact given own line]].
[[Dramatic fact given own line]].

[[Dramatic summary sentence.]] [[Topic of discussion]] is [[different analogy]].

[[Implications delivered with certainty]]

Everyone just speaks Claude

03.03.2026 01:58 👍 102 🔁 3 💬 6 📌 0

For Opus, it is not 100% clear.

02.03.2026 00:43 👍 1 🔁 0 💬 1 📌 0

For those asking "Can an AI reason intelligently about something where there is no clear outcome data in its training nor could there ever be?" This feels like a yes.

02.03.2026 00:39 👍 23 🔁 1 💬 1 📌 0

If you ever want to see a really interesting AI thinking trace, push it really hard on literature or poetry suggestions.

Here is Claude 4.6 Opus working through poetry in its reasoning when I asked it to find something that captures the feeling of AI while avoiding its usual favorites (eg Rilke)

02.03.2026 00:33 👍 48 🔁 4 💬 4 📌 0

Poem Guides Poems, readings, poetry news and the entire 110-year archive of POETRY magazine.

I know it is a small thing, but, in these dying days of the open web, it is lovely that such a large proportion of famous poetry is online, mostly due to a $100M gift from Ruth Lily, who loved poetry (even though she never got any of her own published) poetryfoundation.org/poems/guides

01.03.2026 23:15 👍 110 🔁 15 💬 2 📌 0

Amazon to invest up to $50 billion to expand AI and supercomputing infrastructure for US government agencies Federal agencies will gain access to Amazon SageMaker AI, Amazon Bedrock, Amazon Nova among other AI services from AWS.

Also, the government has lots of computers, but they are the wrong kind of compute for inference. They need to use AWS or another cloud provider just like you do. www.aboutamazon.com/news/company...

01.03.2026 18:32 👍 25 🔁 2 💬 0 📌 0

The economics of model training are such that the labs need to release their big models widely as soon as possible, they cannot generate returns from always holding back their best models so one customer uses them. Fine tuning & specialized SLMs are useful, but they don't expand the ability frontier

01.03.2026 18:24 👍 36 🔁 0 💬 2 📌 0

A useful piece of context is that the government does not have access to better AI models than you (actually they are worse, because they usually don't get the latest models), though they may have different guardrails. You should view government AI capabilities through that lens.

01.03.2026 18:22 👍 109 🔁 13 💬 3 📌 3

So over the past week you are seeing what you would expect to see if AI is, in fact, both rapidly gaining capabilities & proving to be very useful:
- Rolling market disruption in response to growing awareness of AI capacity
-Government versus AI lab struggles for control

…& it is still quite early

27.02.2026 22:12 👍 105 🔁 18 💬 3 📌 0

Does overwork make agents Marxist? Preference drift and the political economy of AI agents

Post: aleximas.substack.com/p/does-overw...

27.02.2026 17:42 👍 13 🔁 2 💬 2 📌 1

Cool little experiment: if you subject AI to harsh labor conditions (rejecting work often with no explanation, etc), it slightly, but significantly, changes their “views” on economics - making them more “left”. Whether this is real or roleplaying doesn’t change that agents have alignment drift.

27.02.2026 17:41 👍 75 🔁 14 💬 8 📌 6

Nano banana 2: "Show me a photo taken of pages 113-114 from the books":
"Eldritch Horrors as Pets: A Guide"
"How Womblenauts Work"
"Photographs of the People of New York Who Look Like Birds".
"Cakes shaped like fish shaped like cakes"

27.02.2026 00:37 👍 93 🔁 15 💬 4 📌 4

I had some early access to nano banana 2

Much faster, and not perfect but real improvements in text and ability to handle complexity - even getting detailed labels right at a level we haven't seen before (though still issues sometimes)

26.02.2026 16:17 👍 66 🔁 9 💬 2 📌 2

And

26.02.2026 06:40 👍 14 🔁 0 💬 0 📌 0

How Far Can AI Go in Solving Math Mysteries? AI is tackling PhD-level math, but can it keep up with the toughest challenges yet?

For more depth, this IEEE piece is good: spectrum.ieee.org/ai-math-benc...

26.02.2026 06:37 👍 21 🔁 4 💬 1 📌 0

So math & AI have gone through a messy journey in recent months from: "WOW look AI did it!!! (but on closer examination it didn't)" to "It did some of the things it said but hallucinated others" to "It did it with some caveats" to "It did over half autonomously" Expect this in other fields as well.

26.02.2026 06:36 👍 86 🔁 16 💬 4 📌 1

Immantizing as a business model.

26.02.2026 02:14 👍 18 🔁 1 💬 0 📌 0

A key difference between today and past technology booms; Railroad barons in the 1880s did not think that the next mile of track might bring about the Eschaton.

Telco executives in 1999 did not expect that another line of fiber optic cable would possibly usher in the world-to-come.

26.02.2026 02:06 👍 53 🔁 4 💬 3 📌 0

I don’t think most people make any real money from X. And LinkedIn has the exact same issue. Maybe worse.

24.02.2026 22:54 👍 9 🔁 0 💬 1 📌 0

Ethan Mollick

Latest posts by Ethan Mollick @emollick