Jun-Yan Zhu (@junyanz) — bluesky.baby

The AI for Content Creation workshop is kicking off today at #CVPR2025 - Grand Ballroom A1 - @magrawala.bsky.social Kai Zhang (Adobe), Charles Herrmann (Google), Mark Boss (Stability AI), Yutong Bai (UC Berkeley), Cherry Zhao (Adobe), Ishan Misra (Meta) and @jonbarron.bsky.social ! See you soon!

12.06.2025 13:45 👍 1 🔁 2 💬 0 📌 0

AI for Content Creation workshop @ #CVPR2025 - Grand Ballroom A1 - 4pm - panel on "Open Source in AI and the Creative Industry" - with @magrawala.bsky.social (Stanford), Cherry Zhao (Adobe), Ishan Misra (Meta) and @jonbarron.bsky.social (Google) - go go!

12.06.2025 18:56 👍 2 🔁 1 💬 0 📌 0

[2/2] Work led by @avalovelace.bsky.social, @kangledeng.bsky.social, Ruixuan Liu, and CMU faculty Changliu Liu and Deva Ramanan. LegoGPT is a small first step towards generative manufacturing of physical objects. Current version is limited to 20x20x20, 21 object categories, and simple brick types.

10.05.2025 03:06 👍 4 🔁 0 💬 0 📌 0

[1/2] We've released the code for LegoGPT. Our autoregressive model generates physically stable and buildable designs from text prompts by integrating physics laws and assembly constraints into LLM training and inference.

Code: github.com/AvaLovelace1...
Website: avalovelace1.github.io/LegoGPT/

10.05.2025 03:06 👍 70 🔁 23 💬 4 📌 2

Reve Image is our first step towards world-class image generation — and you can experience it for free today 🌜
(🔊)

26.03.2025 23:31 👍 6 🔁 4 💬 1 📌 0

AI4CC 2025

The AI for Content Creation workshop #CVPR2025 is accepting paper submissions. ai4cc.net Deadline March 21st 2025 midnight PST. 4 page extended abstracts, 8 pagers, and previously published work (ECCV, NeurIPS, even CVPR)! Many topics 📷📹🎬🎲✒️📃🖼️👗👔🏢 - come spend the day with us!

14.03.2025 16:02 👍 9 🔁 5 💬 1 📌 0

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization

Can we generate a training dataset of the same object in different contexts for customization? Check out our work SynCD, which uses Objaverse assets and shared attention in text-to-image models for the same.
cs.cmu.edu/~syncd-proje...
w/ Xi Yin, @junyanz.bsky.social, Ishan Misra, and Samaneh Azadi

11.02.2025 18:12 👍 4 🔁 1 💬 0 📌 0

The Illusion of Awareness: Why We See Much Less Than We Think We Do A few years ago, while walking home, I noticed a dry cleaners across the street from my house. “Was that always there?” I thought, surprised. I’d walked by that spot many, many times over the years, b...

One day walking home, I noticed a dry cleaners across the street. “Was that always there?” I thought. A little Googling revealed that it was on my street longer than I have.
Here's a blog post on why we often miss what's right in front of us. #visionscience
aaronhertzmann.com/2024/05/09/i...

30.01.2025 19:21 👍 17 🔁 4 💬 0 📌 0

Excited to bring the 5th CV4Animals Workshop to #CVPR2025

We welcome submissions in 2 tracks:
1) unpublished work up to 4 pages
2) papers published within last 2 years

Submit by Mar 28 & join us with amazing speakers in Nashville:
www.cv4animals.com
🦒🪼🐬🐿️🦩🐢🦘🦜🦥🦋

@cvprconference.bsky.social

01.02.2025 04:24 👍 10 🔁 4 💬 0 📌 3

3D content creation with touch!

We exploit tactile sensing to enhance geometric details for text- and image-to-3D generation.

Check out our #NeurIPS2024 work on Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation: ruihangao.github.io/TactileDream...
1/3

11.12.2024 09:08 👍 13 🔁 6 💬 1 📌 0

Robotics Institute Ph.D. Awarded 2024 Google Fellowship - Robotics Institute Carnegie Mellon University Sheng-Yu Wang, fifth-year Ph.D. at the Carnegie Mellon University Robotics Institute. Sheng-Yu Wang, fifth-year Ph.D. student at the Carnegie Mellon University Robotics Institute, has received a Goo...

Huge congratulations to RI Ph.D. Sheng-Yu Wang for receiving a 2024 Google Fellowship! 🙌

The two year fellowship supports Wang’s work in data attribution for text-to-image models.

Read about his achievement in our news site! www.ri.cmu.edu/robotics-ins...

09.12.2024 14:43 👍 11 🔁 1 💬 0 📌 0

I created a huggingface space for my current work PairCustomization - You can choose from a set of pretrained LoRAs trained with our method, and run inference with our novel style guidance:
huggingface.co/spaces/pairc...

I demo'ed this at #SIGRAPHASIA2024 and it went great! :)
3/3

04.12.2024 22:55 👍 3 🔁 1 💬 0 📌 0

Check out Maxwell et al.'s recent SIGGRAPH Asia paper on model customization with a single image pair. The code is available at github.com/PairCustomiz...

05.12.2024 01:40 👍 9 🔁 1 💬 0 📌 0

Employment - School of Art | Carnegie Mellon University The School of Art seeks professional artists, educators and administrators with an interest in interdisciplinary practice and expanding what it means to be a school in the 21st Century.

#JobAlert! Come join me at Carnegie Mellon's School of Art— we're hiring an open-rank tenure-track professor in "Experimental Animation and Emerging Media Practices"! Deadline is Jan 5: art.cmu.edu/employment/#...

02.12.2024 04:01 👍 61 🔁 26 💬 1 📌 1

Introducing Generative Omnimatte:

A method for decomposing a video into complete layers, including objects and their associated effects (e.g., shadows, reflections).

It enables a wide range of cool applications, such as video stylization, compositions, moment retiming, and object removal.

26.11.2024 15:55 👍 134 🔁 20 💬 3 📌 8

TTIC building. Photo credit, TTIC.

I am recruiting exceptional PhD students & postdocs with an adventurous soul for my 💫new TTIC AI lab💫! We aim to understand intelligence, one pixel at a time, inspired by psychology, neuroscience, language, robotics, and the arts. Apply: www.ttic.edu/studentappli...

sites.google.com/ttic.edu/ope...

12.11.2024 19:28 👍 30 🔁 7 💬 0 📌 0

Jun-Yan Zhu

Latest posts by Jun-Yan Zhu @junyanz