How it works: I prompt Gemini with the first frame of video to give me (x, y) coordinates with descriptions. Then I use optical flow to track those positions over time. Of course optical flow has limitations, but it was a fast lightweight way to prototype the idea quickly.
30.01.2026 21:47
π 1
π 0
π¬ 0
π 0
I made it with Gemini spatial intelligence + opencv.js in Google AI Studio. Link here: ai.studio/apps/drive/1... It's been fun testing on old videos from my photo library as Gemini is able to uncover new details.
30.01.2026 21:47
π 0
π 0
π¬ 1
π 0
Visualizing Gemini intelligence with things in my life ππ§΅
30.01.2026 21:47
π 0
π 0
π¬ 1
π 0
Gemini spatial understanding + opencv.js π
29.01.2026 16:34
π 1
π 0
π¬ 0
π 0
Try it here WIP link ai.studio/apps/drive/1...
22.01.2026 22:09
π 0
π 0
π¬ 0
π 0
I really like this labeled format. Feels like a napkin sketch stream of consciousness where I can follow what Gemini's thinking as it draws.
22.01.2026 22:00
π 2
π 0
π¬ 1
π 0
Testing streaming vector shapes with Gemini 3 Flash. Really fast. Realtime screencapβ‘π
22.01.2026 22:00
π 0
π 0
π¬ 1
π 0
Yes, right metaphor is hard because a single bit of text can represent so much. For my own prototypes, I am using text to represent the feature (I want it to do [x]), design (exact tone.js sounds, hex colors, etc), documentation (how it was implemented).
21.01.2026 15:30
π 1
π 0
π¬ 0
π 0
I always ask Gemini to discuss ideas with me before coding, so I can craft the details I really care about - the sine wave, wobbly-ness, the amplitude ... π₯
20.01.2026 22:24
π 0
π 0
π¬ 0
π 0
Video has sound π
20.01.2026 22:24
π 1
π 0
π¬ 1
π 0
Sketch β‘οΈ animation βοΈ Multimodal prompting has been really powerful for this drum machine prototype ... π§΅
20.01.2026 22:24
π 0
π 0
π¬ 1
π 0
As a result I have a growing collection of docs that represent my favorite sounds, colors, and other random interaction design choices. It feels like documentation (or source code?) for my own universe of personal apps that can be built at any time on the fly.
20.01.2026 18:22
π 0
π 0
π¬ 1
π 0
I provide it without the original source code, as I think it helps model flexibly adapt it into my new code base with "fresh eyes" unbiased by the original code.
20.01.2026 18:22
π 0
π 0
π¬ 1
π 0
I've been saying things like"Write a concise spec for [x] ..." to port features between apps. Gemini gives me a readable spec (which I enjoy reading for my own understanding of the code), which I use to prompt a re-implementation in my new app.
20.01.2026 18:22
π 1
π 0
π¬ 1
π 0
Porting features with language ... π§΅
20.01.2026 18:22
π 0
π 0
π¬ 1
π 0
Vibe coding an interface inspired by this fun 1960s drum machine with Gemini.
20.01.2026 15:07
π 6
π 1
π¬ 0
π 0
If you make something, feel free to share the song link here. Code is open-source, built with Tone.js. Forkable here on Google AI Studio β‘οΈ ai.studio/apps/drive/1...
16.01.2026 21:22
π 0
π 0
π¬ 0
π 0
Polyrhythms are especially easy and fun to make. Song link: alexanderchen.github.io/typeloop/?so...
16.01.2026 21:22
π 5
π 2
π¬ 1
π 0
Every song is represented as a string of text, so you can share it by just copy-pasting or making a link. Here are all the sounds you can try. π₯
16.01.2026 21:22
π 1
π 0
π¬ 1
π 0
Type Loop π΅β¨οΈ Create and share music by typing! Play here: alexanderchen.github.io/typeloop/ π Open-source. Built w/ Gemini. π§΅ #genuary #genuary10
16.01.2026 21:22
π 5
π 1
π¬ 1
π 2
Drawing + code. βοΈ Love these experiments our Creative Lab collaborator @szymonkaliski.com is doing with Gemini 3.
13.01.2026 15:09
π 3
π 0
π¬ 0
π 0
You can learn so much by just moving a shape around and bringing it to life. I wanted to make it really easy to discover the joy of animation. Easing, speed, scale, squash, sound. I also found the curve editor became a really neat way to simply visualize time. β±οΈ
07.01.2026 22:05
π 0
π 0
π¬ 0
π 0
Link + code here: ai.studio/apps/drive/1...
07.01.2026 22:05
π 0
π 0
π¬ 1
π 0
Dot Motion π΄π΅ A simple way to explore motion, time, and sound. Code is open-source. Video has π Link in π§΅ #genuary #genuary2
07.01.2026 22:05
π 2
π 0
π¬ 1
π 0
You can click "+" to add more agents. (Fun to watch the parallel conversations all unfold)
07.01.2026 16:52
π 0
π 0
π¬ 0
π 0
Not a new idea, lots of great research in the space of cooperative agents in past years. But the speed and low cost of Gemini 3 Flash and vibe coding environments like AI Studio make experimenting in this space so much more accessible.
07.01.2026 16:50
π 0
π 0
π¬ 1
π 0
Each agent remembers past conversations and uses memory to decide on goal.
07.01.2026 16:50
π 0
π 0
π¬ 1
π 0
Information is passed through back-and-forth conversations like this one.
07.01.2026 16:50
π 0
π 0
π¬ 1
π 0
TinyTown ππ Lightweight multi-agent social simulation built w/ Gemini 3 Flash. Agents look for π. When one finds it, it tells coordinates to others. Not sure where I'm taking this prototype next (open to ideas!) Code is open-source β‘οΈ ai.studio/apps/drive/1...
07.01.2026 16:50
π 4
π 0
π¬ 1
π 0