Fei Liu (@feiliu-nlp) — bluesky.baby

Overall, these spot talks were a gem. There don't seem to be recordings, and I hope the slides could be released. Already looking forward to next year! #NeurIPS2025 #NeurIPSanDiego

07.12.2025 20:11 👍 1 🔁 0 💬 0 📌 0

Boson AI Demo - Voice Chat

Dec 4: Alex Smola's "Boson.AI Talk to me - Engineering Conversational Intelligence" was a great presentation. The talk covers data collection, model design, and alignment for high-quality voice AI, which're key ingredients to train models that sound realistic. Try the demo: www.boson.ai/demo/shop

07.12.2025 20:11 👍 4 🔁 0 💬 1 📌 0

T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning Large Language Models (LLMs) have demonstrated impressive capabilities as intelligent agents capable of solving complex problems. However, effective planning in scenarios involving dependencies betwee...

Dec 3: Shixiong Zhang and Genta Winata introduced Capital One's T1 dataset. This is a tool-augmented, multi-domain, multi-turn conversational dataset designed for agent planning. Loved seeing how T1-Agent handles complex, dependency-heavy workflows. Paper link: arxiv.org/abs/2505.16986

07.12.2025 20:11 👍 0 🔁 0 💬 1 📌 0

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation A world model enables an intelligent agent to imagine, predict, and reason about how the world evolves in response to its actions, and accordingly to plan and strategize. While recent video generation...

Dec 3: Zhengzhong Liu and Jiannan Xiang shared “Towards a Blueprint for Open Science of Foundation Models.” The talk presents an interactive, long-horizon world model that predicts future states via high-quality video simulation. Work done at IFM@MBZUAI. Read the paper: arxiv.org/abs/2511.09057

07.12.2025 20:11 👍 0 🔁 0 💬 1 📌 0

From agent soup to proper software design: Mellea puts developers back in control - David Cox YouTube video by All Things Open

Dec 2: David Cox's talk, "From Agent Soup to Proper Software Design" offered a refreshing take on building reliable LLM systems. Loved the pitch behind Mellea, a generative AI library that gives developers more control through software design principles. YouTube link: www.youtube.com/watch?v=j2ou...

07.12.2025 20:11 👍 0 🔁 0 💬 1 📌 0

Really enjoyed the Exhibitor Spot Talks at #NeurIPS this year! These are 12-minute short talks packed with interesting ideas. Some of the most fun talks I attended are:

07.12.2025 20:11 👍 1 🔁 0 💬 1 📌 0

PlanGenLLMs: A Modern Survey of LLM Planning Capabilities LLMs have immense potential for generating plans, transforming an initial world state into a desired goal state. A large body of research has explored the use of LLMs for various planning tasks, from ...

That makes it difficult to compare systems across domains, or figure out which one's best for a new planning problem.

That's where our paper comes in: We offer a comprehensive overview of LLM planning agents, highlighting gaps, challenges, and what's next.

Check it out 👉 arxiv.org/abs/2502.11221

30.07.2025 16:42 👍 2 🔁 0 💬 0 📌 0

🧠 Planning is a core aspect of both human and artificial intelligence.

LLMs/agents have been used in various planning tasks, from navigating websites and planning trips to querying databases, but most benchmarks are narrow and task-specific.

30.07.2025 16:42 👍 0 🔁 0 💬 1 📌 0

🏆 Thrilled that our paper #PlanGenLLMs (arxiv.org/abs/2502.11221) won the SAC Award at #ACL2025!!

Couldn't have done it without the amazing team: Hui Wei, Zihao Zhang, Shenghua He, Tian Xia, and Shijia Pan. So thankful and beyond proud! 💖 #ACL2025NLP #NLProc

30.07.2025 16:42 👍 2 🔁 0 💬 1 📌 0

Happy to share our paper got selected as an Oral Presentation at #ACL2025!

Out of 8,000+ submissions and 3,000+ accepted papers, only 245 were chosen for oral (<3%)!

📄 Paper: arxiv.org/abs/2502.11221
💻 Resource: github.com/wll199566/Aw...

05.07.2025 23:41 👍 0 🔁 0 💬 0 📌 0

Autonomous agents are powerful, but without guardrails, they drift into inefficiency.

We view 'cost' as a form of guardrail and use Monte Carlo Tree Search with explicit cost-awareness to guide LLM-based planning.

Link: arxiv.org/pdf/2505.14656

29.06.2025 13:31 👍 1 🔁 0 💬 0 📌 0

Fei Liu

Latest posts by Fei Liu @feiliu-nlp