AI may be reshaping not just the economy, but the political assumptions built around labor. In this blog post, I explore what this could mean for the future landscape of political economy: mengyeren.substack.com/p/politics-a...
AI may be reshaping not just the economy, but the political assumptions built around labor. In this blog post, I explore what this could mean for the future landscape of political economy: mengyeren.substack.com/p/politics-a...
Sharing my thoughts on Moltbook in a recent interview by The Independent.
I have updated my tutorial on making Vision Language Action models. This tutorial starts with a basic Transformer and walks people through the steps to transform it into a full VLA that uses PaliGemma as the pretrained VLM. Links below.
Corporate PRs are becoming a disservice to science. We see amazing things with no idea how they were done. It's just a way to grab smart people and pump equity, while discouraging junior students by making them think there's nothing left to be done in research.
Check out our Midway Network paper on learning hierarchical latent motion tokens from watching videos. Recently got accepted to ICLR 2026!
Verifiers are increasingly being used today in RL to provide rewards. We did a systematic study on when it is the best to use LLMs to verify solutions.
Our latest research Midway Networks learn recognition and motion representations from scratch by letting the network learn from watching videos. The latent motion vectors are refined in a top-down hierarchy. Interesting tracking results using our forward perturbation viz.
Very soon we will see a rekindled interest in an AI that learns from an individual experience, defining a subjective sense of what is truly new and creative. We will start wondering: what if an AI has only learned handwritten digits in its lifetime? 2/2
We are so comfortable with the concept of pretraining in foundation models today that we assume an AI is supposed to have seen everything humanity has created. 1/2
Excited to share our new research on local RL without backprop!
Lab gathering at #NeurIPS2025. Proud of this yearβs work and excited about the ideas weβre building toward next!
I will be at NeurIPS next week. Let's connect if you are also interested in continual learning AI!
Be part of #NeurIPS2025 in Mexico City! Submit your proposals:
π
Deadlines
Tutorials: Sept 26 π neurips.cc/Conferences/...
Workshops: Sept 30 π neurips.cc/Conferences/...
Socials: Oct 1 π neurips.cc/Conferences/...
Startup Pitch: Oct 15 π neurips.cc/Conferences/...
Pitch your AI startup at NeurIPS 2025 in Mexico City! The application deadline is just a month away on October 15th.
Learn more and apply now: neurips.cc/Conferences/...
Watch this new video of work by CDS Asst. Prof. Mengye Ren (@mengyer.bsky.social) and colleagues, who built PooDLe, an algorithm that helps AI systems learn from complex environments like busy streets.
The method mimics how humans process cluttered scenes.
www.nyu.edu/about/news-p...
We're excited to announce a second physical location for NeurIPS 2025, in Mexico City, which we hope will address concerns around skyrocketing attendance and difficulties in travel visas that some attendees have experienced in previous years.
Read more in our blog:
blog.neurips.cc/2025/07/16/n...
NeurIPS is endorsing EurIPS, an independently-organized meeting which will offer researchers an opportunity to additionally present NeurIPS work in Europe concurrently with NeurIPS.
Read more in our blog post and on the EurIPS website:
blog.neurips.cc/2025/07/16/n...
eurips.cc
NeurIPS is seeking additional ethics reviewers this year. If you are able and willing to participate in the review process, please sign up at the form in the link:
neurips.cc/Conferences/...
Please share this call with your colleagues!
CDS Asst. Prof. @mengyer.bsky.social, Courant PhD students Alex N. Wang and Christopher Hoang, and @ylecun.bsky.social introduce PooDLe: a self-supervised learning method enhancing AI vision in real-world videos by improving small object detection.
nyudatascience.medium.com/learning-to-...
The NeurIPS Position Track is seeking additional reviewers this year. If you'd like to serve or nominate a colleague, please complete our nomination form! docs.google.com/forms/d/e/1F...
The proposed 5% remittance tax on non-citizens is another blatant attack on foreign workers, who've contributed tremendously to the U.S. economy, have little path to citizenship, no birthright for their kids, and will face double taxation on their hard-earned dollars.
10/ This @agentic-ai-lab.bsky.social project was led by
Alex Wang @alexnwang.bsky.social and Chris Hoang @choang.bsky.social , together with Yuwen Xiong, @yann-lecun.bsky.social and @mengyer.bsky.social.
9/ For more details, please check out our paper and website, or stop by our poster (Fri 10 AM, Hall 3 + Hall 2B #336) at ICLR!
Paper: arxiv.org/abs/2408.11208
Website: agenticlearning.ai/poodle/
8/ We also study how data augmentation choices like crop scale, input resolution, and time between sampled frames can have a large impact on video pretraining.
7/ These performance differences manifest visually too! IN1K has noisy segmentations and FlowE misses small objects, while PooDLe avoids both problems.
6/ Interestingly, we find that dense SSL performance is driven by large classes whereas ImageNet pretraining does well on small, foreground classes.
PooDLe is able to perform well on both small and large classes!
5/ PooDLe, pretrained on BDD100K and Walking Tours, outperforms prior iconic and dense SSL methods on semantic segmentation and object detection!
We also release WT-Sem, an in-distribution semantic segmentation task for Walking Tours.