β
End to end generation of expressive performance *audio* from score *images*!
An important step towards seamless interaction with computer music systems and a fun collaboration between Dasaemβs group at Sogang University and my group at CMU
@chrisdonahue.com
Research in generative AI for **human** creativity in music + more. Assistant professor at CMU CSD, leading the πΌ G-CLef lab. Part time research scientist at Google DeepMind on the Magenta team (views my own)
β
End to end generation of expressive performance *audio* from score *images*!
An important step towards seamless interaction with computer music systems and a fun collaboration between Dasaemβs group at Sogang University and my group at CMU
At #CHI2025 in Yokohama this week πΈ. My first CHI, excited to finally get to attend! Happy to chat with anyone about human AI interaction for music or programming
Congrats Kaitlyn and Cornell!!
Also βrelative inefficiency of input-space models starts to be economically preferable over the increased engineering complexity of latent-space modelsβ
I wonder about this! If latents shift the scaling laws for generative modeling by an order of magnitude or more, hard to imagine this going away
Incredible post. I still donβt have a clear mental model for the need for *both* perceptual and adversarial losses. Seems like they both encourage preservation of certain higher frequency material. Is using both just a hack that works or is there some more fundamental explanation?
Remarkably thorough and crisp as usual. Probably the single best resource for understanding the latents behind generative modeling that power modern gen AI
Sander shh π€« youβre giving away all of the good research ideas!!
I have acquired a Disklavier and Piano Genie has been resurrected :)
@pcastr.bsky.social Disklavier jam session over the internet soon?
Thrilled to share that my *incoming* PhD student Yewon Kimβs work on multimodal inspiration in music AI has been recognized with a Best Paper Award at #CHI2025 π
Yewon really knocked it out of the park here. Can't wait to see what she does for her PhD!
arxiv.org/abs/2412.18940
Inaugurating new acct to share work from my PhD student!
Wayne et al have been running a live eval platform Copilot Arena - a VSCode extension serving code completions from AI systems to real developers. See π§΅ for findings and preprint
Excited to be evaluating human-AI *workflows* holistically!