Albert Villanova del Moral's Avatar

Albert Villanova del Moral

@albertvillanova

Machine Learning Engineer @ Hugging Face

402
Followers
1,332
Following
52
Posts
15.11.2024
Joined
Posts Following

Latest posts by Albert Villanova del Moral @albertvillanova

Preview
Release v0.29.0 · huggingface/trl Features Add environment_factory to GRPOTrainer GRPOTrainer now accepts an environment_factory argument, allowing users to specify a custom environment class for training. This enables more flexibl...

If you’re building AI agents, RLHF pipelines, or training infra: give trl-training a spin in v0.29.0 and let us know what you think. We are working at improving it! 🤗

The era of agent-native ML tooling is here 🚀

🔗 github.com/huggingface/...

26.02.2026 14:21 👍 1 🔁 1 💬 0 📌 0

Under the hood:
- A dedicated trl-training skill definition
- CLI-first design
- Clean integration with existing TRL commands
- Structured documentation so LLM agents can reason over it

26.02.2026 14:21 👍 0 🔁 0 💬 1 📌 0
Preview
Overview - Agent Skills A simple, open format for giving agents new capabilities and expertise.

What does this mean?

Instead of “figuring out” how to train with TRL…

Agents now have a clearly defined interface to:
✅ Run SFT
✅ Run DPO
✅ Run GRPO
✅ Launch training jobs
✅ Work with the TRL CLI safely

All through a structured SKILL.md.

26.02.2026 14:21 👍 0 🔁 0 💬 1 📌 0
Post image

🚀 Big news for TRL users with AI agents:

We just released the trl-training SKILL in TRL v0.29.0 🤗

Agents can now use the TRL CLI as a structured capability:

26.02.2026 14:21 👍 2 🔁 2 💬 1 📌 0
Post image

5 years already working in democratizing AI 🤗
Grateful to be part of such an awesome team making it happen every day.

12.02.2026 08:33 👍 2 🔁 0 💬 0 📌 0
Post image

Latest smolagents release supports GPT-5: build agents that think, plan, and act.
⚡ Upgrade now and put GPT-5 to work!

12.08.2025 10:46 👍 1 🔁 0 💬 0 📌 0
Preview
Release v1.21.0 · huggingface/smolagents Enhancements ✨ Tool & Model Improvements Add Tool to_code_prompt and to_tool_calling_prompt by @albertvillanova in #1599 Improve function body extraction in tools by @HairlessVillager in #1627 S...

🚀 smolagents v1.21.0 is here!
Now with improved safety in the local Python executor: dunder calls are blocked!
⚠️ Still, not fully isolated: for untrusted code, use a remote executor instead: Docker, E2B, Wasm.
✨ Many bug fixes: more reliable code.
👉 github.com/huggingface/...

11.08.2025 09:19 👍 4 🔁 1 💬 1 📌 0
Preview
Release v1.20.0 · huggingface/smolagents New Features ✨ Remote Execution: Implement remote Python WasmExecutor by @albertvillanova in #1261 Callbacks: Support callbacks after planning step by @albertvillanova in #1445 Support multi...

🔥 This feature is live in smolagents v1.20.0!

Try it out.
Break things. Extend it. Give us feedback.

Let's build safer, smarter agents; together 🧠⚙️

👉 github.com/huggingface/...

#smolagents #opensource #AIagents #WebAssembly #Deno #Pyodide #HuggingFace #AgenticAI

11.07.2025 13:53 👍 3 🔁 1 💬 0 📌 0
Preview
Implement remote Python WebAssemblyExecutor by albertvillanova · Pull Request #1261 · huggingface/smolagents Implement remote Python WebAssemblyExecutor.

Check out the PR 👉 github.com/huggingface/...

Let's reimagine what agent-driven code execution can look like: remote, safe, and Wasm-powered.

11.07.2025 13:53 👍 0 🔁 0 💬 1 📌 0

👷 Want to help?

This is a foundation we're inviting the community to build on:
• Add feature support
• Expand compatibility to new use cases
• Help make agents even more secure + capable

Contribute, test, break things, and PR back 🚀

11.07.2025 13:53 👍 0 🔁 0 💬 1 📌 0

🙅‍♂️ Why no network calls (for now)?

Because Pyodide in Node.js needs experimental flags for stack switching, and some APIs (like urllib3) aren’t yet compatible out of the box.

But this is just the beginning.

11.07.2025 13:53 👍 0 🔁 0 💬 1 📌 0

🚧 What’s supported right now?

✅ Basic Python logic
✅ Standard control flow
✅ Limited stdlib support
❌ No HTTP requests (yet)
❌ No advanced I/O or external packages

It's early: a solid MVP for secure use cases. 💡

11.07.2025 13:53 👍 0 🔁 0 💬 1 📌 0

🔐 Why WebAssembly?

WebAssembly = portable, fast, and secure.
It enables:
• Safe remote execution of Python code
• Zero access to the host machine
• Consistent, deterministic environments
• Serverless + edge-compatible agent workflows

11.07.2025 13:53 👍 0 🔁 0 💬 1 📌 0

💥 Big milestone just dropped in smolagents:

The CodeAgent can now execute Python code remotely in WebAssembly, powered by Pyodide + Deno!

Your agent-generated Python runs in a secure, sandboxed Wasm runtime: no local execution required.

Here's why this changes the game 🧵👇

11.07.2025 13:53 👍 0 🔁 0 💬 1 📌 0
Preview
Release v1.19.0 · huggingface/smolagents Enhancements 🛠️ Agent Upgrades: Support managed agents in ToolCallingAgent by @albertvillanova in #1456 Support context managers for agent cleanup by @tobiasofsn in #1422 Change code tags to xml...

7/
🧵 That’s a wrap for smolagents v1.19.0!

Grab the latest, test the improvements, and tell us what you build.
👇👇👇
github.com/huggingface/...
#smolagents #AIagents #OSS #HuggingFace

24.06.2025 14:31 👍 0 🔁 0 💬 0 📌 0

6/
📚 Docs Game: Leveled Up

* ✨ Quickstart got tool usage examples
* 💡 Added open in Colab buttons!
* 🧠 Reference docs now include AgentMemory, better docstrings

24.06.2025 14:31 👍 0 🔁 0 💬 1 📌 0

5/
🐛 Bug Fixes Blitz
From final answer logic to Docker logs, this release patched up:

* Planning logic bugs 🧠
* Type-safe wrappers 🛡️
* Executor output matching 🧵
* Prompt fixes for additional_args 📝

24.06.2025 14:31 👍 0 🔁 0 💬 1 📌 0

4/
📋 Agent Output Tracking

* CodeAgent outputs are now stored in ActionStep
📂 More structured traceability of actions & results!

24.06.2025 14:31 👍 0 🔁 0 💬 1 📌 0

3/
📡 Streaming Just Got Smarter

* Streaming event aggregation was moved off the Model class
➕ Cleaner structure, better separation of concerns

24.06.2025 14:31 👍 0 🔁 0 💬 1 📌 0

2/
🎛️ UI Improvements

* New reset_agent_memory feature in GradioUI 💾
- Handy for debugging or starting fresh mid-run!

24.06.2025 14:31 👍 0 🔁 0 💬 1 📌 0

1/
🧠 Agent Upgrades

* You can now use managed agents with ToolCallingAgent 🔧
* Agent cleanup supports context managers: easier resource handling!
* Switched code output tags to XML for clarity

24.06.2025 14:31 👍 0 🔁 0 💬 1 📌 0

🚀New Release: smolagents v1.19.0 is out!

This one's packed with agent upgrades, smoother streaming, better docs, and bug squashing galore. 🧵👇

24.06.2025 14:31 👍 0 🔁 0 💬 1 📌 0
Preview
Release v1.17.0 · huggingface/smolagents New Features ✨ Structured Generation in CodeAgent: Add optional support for structured outputs in CodeAgent, enabling more reliable and consistent generation patterns Adding optional structured g...

New in smolagents v1.17.0:
- Structured generation in CodeAgent 🧱
- Streamable HTTP MCP support 🌐
- Agent.run() returns rich RunResult 📦

Smarter agents, smoother workflows.
Try it now: github.com/huggingface/...

27.05.2025 12:07 👍 1 🔁 0 💬 0 📌 0
Preview
TinyAgents: A Minimal Experiment with Code Agents and MCP Tools A Blog post by Albert Villanova del Moral on Hugging Face

Read the blog post 👉 huggingface.co/blog/albertv...

16.05.2025 12:54 👍 0 🔁 0 💬 0 📌 0
Preview
TinyAgents: A Minimal Experiment with Code Agents and MCP Tools A Blog post by Albert Villanova del Moral on Hugging Face

🚀 Experiment with Tiny Agents: minimalist LLM agents powered by async MCP tools, inspired by @julien_c

🔍 A side-by-side comparison of tool-calling vs. code agent

💡 See how code agents handle complex workflows in one step, reducing model calls and boosting efficiency!

16.05.2025 12:54 👍 0 🔁 0 💬 1 📌 0
GitHub - huggingface/smolagents: 🤗 smolagents: a barebones library for agents that think in python code. 🤗 smolagents: a barebones library for agents that think in python code. - huggingface/smolagents

New in smolagents v1.16.0:
🔍 Bing support in WebSearchTool
🐍 Custom functions & executor_kwargs in LocalPythonExecutor
🔧 Streaming GradioUI fixes
🌐 Local web agents via api_base & api_key
📚 Better docs

👉 github.com/huggingface/smolagents

16.05.2025 06:56 👍 1 🔁 0 💬 0 📌 0
Preview
Building Code Agents with Hugging Face smolagents Build agents that write and execute code to perform complex tasks, using Hugging Face’s smolagents.

Code Agents generate full code blocks and execute them in one go, instead of calling tools one-by-one. Learn how to build smarter, faster agents in a new @deeplearningai.bsky.social course by @hf.co.

Build with smolagents → sandboxed, reliable, powerful.

🔗 www.deeplearning.ai/short-course...

23.04.2025 17:23 👍 1 🔁 0 💬 0 📌 0
Preview
Release v1.14.0 · huggingface/smolagents New features 🚀 SmolAgents MCP Client by @grll in #1200 Introduce a new MCPClient class that allows users to manage connections to one or more MCP servers, making integration with SmolAgents more ...

Full release 👉 github.com/huggingface/...
#smolagents #LLM #AgenticAI

22.04.2025 07:27 👍 1 🔁 0 💬 0 📌 0
Preview
Release v1.14.0 · huggingface/smolagents New features 🚀 SmolAgents MCP Client by @grll in #1200 Introduce a new MCPClient class that allows users to manage connections to one or more MCP servers, making integration with SmolAgents more ...

smolagents v1.14.0 is out! 🚀
🔌 MCPClient: A sleek new client for connecting to remote MCP servers, making integrations more flexible and scalable.
🪨 Amazon Bedrock: Native support for Bedrock-hosted models.
SmolAgents is now more powerful, flexible, and enterprise-ready. 💼

22.04.2025 07:27 👍 1 🔁 1 💬 1 📌 0

International Women's Day (IWD) 2025: www.internationalwomensday.com

08.03.2025 17:41 👍 0 🔁 0 💬 0 📌 0