Alex Chen's Avatar

Alex Chen

@chenalexander

Creative Director at Google Creative Lab. Exploring multimodal AI. Opinions are my own. https://chenalexander.com/

13
Followers
7
Following
29
Posts
09.01.2025
Joined
Posts Following

Latest posts by Alex Chen @chenalexander

Post image

How it works: I prompt Gemini with the first frame of video to give me (x, y) coordinates with descriptions. Then I use optical flow to track those positions over time. Of course optical flow has limitations, but it was a fast lightweight way to prototype the idea quickly.

30.01.2026 21:47 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

I made it with Gemini spatial intelligence + opencv.js in Google AI Studio. Link here: ai.studio/apps/drive/1... It's been fun testing on old videos from my photo library as Gemini is able to uncover new details.

30.01.2026 21:47 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Visualizing Gemini intelligence with things in my life πŸ”ŠπŸ§΅

30.01.2026 21:47 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Gemini spatial understanding + opencv.js πŸ”Š

29.01.2026 16:34 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Try it here WIP link ai.studio/apps/drive/1...

22.01.2026 22:09 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I really like this labeled format. Feels like a napkin sketch stream of consciousness where I can follow what Gemini's thinking as it draws.

22.01.2026 22:00 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Testing streaming vector shapes with Gemini 3 Flash. Really fast. Realtime screencapβš‘πŸ”Š

22.01.2026 22:00 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Yes, right metaphor is hard because a single bit of text can represent so much. For my own prototypes, I am using text to represent the feature (I want it to do [x]), design (exact tone.js sounds, hex colors, etc), documentation (how it was implemented).

21.01.2026 15:30 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

I always ask Gemini to discuss ideas with me before coding, so I can craft the details I really care about - the sine wave, wobbly-ness, the amplitude ... πŸ₯

20.01.2026 22:24 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

Video has sound πŸ”Š

20.01.2026 22:24 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

Sketch ➑️ animation ✏️ Multimodal prompting has been really powerful for this drum machine prototype ... 🧡

20.01.2026 22:24 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

As a result I have a growing collection of docs that represent my favorite sounds, colors, and other random interaction design choices. It feels like documentation (or source code?) for my own universe of personal apps that can be built at any time on the fly.

20.01.2026 18:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I provide it without the original source code, as I think it helps model flexibly adapt it into my new code base with "fresh eyes" unbiased by the original code.

20.01.2026 18:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I've been saying things like"Write a concise spec for [x] ..." to port features between apps. Gemini gives me a readable spec (which I enjoy reading for my own understanding of the code), which I use to prompt a re-implementation in my new app.

20.01.2026 18:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Porting features with language ... 🧡

20.01.2026 18:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Vibe coding an interface inspired by this fun 1960s drum machine with Gemini.

20.01.2026 15:07 πŸ‘ 6 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

If you make something, feel free to share the song link here. Code is open-source, built with Tone.js. Forkable here on Google AI Studio ➑️ ai.studio/apps/drive/1...

16.01.2026 21:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

Polyrhythms are especially easy and fun to make. Song link: alexanderchen.github.io/typeloop/?so...

16.01.2026 21:22 πŸ‘ 5 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

Every song is represented as a string of text, so you can share it by just copy-pasting or making a link. Here are all the sounds you can try. πŸ₯

16.01.2026 21:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Type Loop 🎡⌨️ Create and share music by typing! Play here: alexanderchen.github.io/typeloop/ πŸ”Š Open-source. Built w/ Gemini. 🧡 #genuary #genuary10

16.01.2026 21:22 πŸ‘ 5 πŸ” 1 πŸ’¬ 1 πŸ“Œ 2

Drawing + code. ✏️ Love these experiments our Creative Lab collaborator @szymonkaliski.com is doing with Gemini 3.

13.01.2026 15:09 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

You can learn so much by just moving a shape around and bringing it to life. I wanted to make it really easy to discover the joy of animation. Easing, speed, scale, squash, sound. I also found the curve editor became a really neat way to simply visualize time. ⏱️

07.01.2026 22:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Link + code here: ai.studio/apps/drive/1...

07.01.2026 22:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Dot Motion πŸ”΄πŸŽ΅ A simple way to explore motion, time, and sound. Code is open-source. Video has πŸ”Š Link in 🧡 #genuary #genuary2

07.01.2026 22:05 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

You can click "+" to add more agents. (Fun to watch the parallel conversations all unfold)

07.01.2026 16:52 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Not a new idea, lots of great research in the space of cooperative agents in past years. But the speed and low cost of Gemini 3 Flash and vibe coding environments like AI Studio make experimenting in this space so much more accessible.

07.01.2026 16:50 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Each agent remembers past conversations and uses memory to decide on goal.

07.01.2026 16:50 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Information is passed through back-and-forth conversations like this one.

07.01.2026 16:50 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

TinyTown πŸ™‚πŸ„ Lightweight multi-agent social simulation built w/ Gemini 3 Flash. Agents look for πŸ„. When one finds it, it tells coordinates to others. Not sure where I'm taking this prototype next (open to ideas!) Code is open-source ➑️ ai.studio/apps/drive/1...

07.01.2026 16:50 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0