Xuan Son Nguyen (@ngxson.hf.co)

Very nice touch, Gmail 😅

05.10.2025 21:11 👍 3 🔁 0 💬 0 📌 0

Building My Smart Home - Part 2: ESPHome & RF433 Implementing an RF433 gateway using ESPHome and a custom Home Assistant add-on to manage multiple RF433 receivers for my smart home.

Link to article: blog.ngxson.com/building-my-...

29.08.2025 13:33 👍 0 🔁 0 💬 0 📌 0

Part 2 of my journey building a smart home! 🚀

In this part:
> ESPHome & custom component
> RF433 receiver & transmitter
> Hassio custom addon

29.08.2025 13:33 👍 0 🔁 0 💬 1 📌 0

Building My Smart Home - Part 1: Home Assistant Designing a smart home system from electrical wiring to Home Assistant automations, using affordable devices and network-inspired architecture.

Link to article: blog.ngxson.com/building-my-...

27.08.2025 19:09 👍 0 🔁 0 💬 0 📌 0

Just published a new article on my blog 🏃‍♂️

Building My Smart Home - Part 1: Plan, Idea & Home Assistant

Check it out!

27.08.2025 19:09 👍 0 🔁 0 💬 1 📌 0

Gemma 3-270m - a ggml-org Collection Collection of models for Gemma 3-270m

Link here: huggingface.co/collections/...

14.08.2025 16:49 👍 3 🔁 0 💬 0 📌 0

Kudos to Google and the llama.cpp team! 🤝

GGUF support for Gemma 270M right from day-0

14.08.2025 16:49 👍 5 🔁 0 💬 1 📌 0

Watch it here: www.youtube.com/watch?v=Qtzz...

21.07.2025 15:53 👍 0 🔁 0 💬 0 📌 0

Richy Mini and SmolLM3 are featured in Github's weekly news! 🚀 🚀

21.07.2025 15:53 👍 0 🔁 0 💬 1 📌 0

Gemma 3n has arrived in llama.cpp 👨‍🍳 🍰

Comes in 2 flavors: E2B and E4B (E means "effective/active parameters")

26.06.2025 18:46 👍 1 🔁 0 💬 0 📌 0

See you this Sunday at AI Plumbers conference: 2nd edition!

📍 Where: GLS Event Campus Berlin, Kastanienallee 82 | 10435 Berlin
👉 Register here: lu.ma/vqx423ct

11.06.2025 09:09 👍 0 🔁 0 💬 0 📌 0

✨✨ AIFoundry is bringing you the AI Plumbers Conference: 2nd edition — an open source meetup for low-level AI builders to dive deep into "the plumbing" of modern AI

📍 Where: GLS Event Campus Berlin, Kastanienallee 82 | 10435 Berlin
📅 When: June 15, 2025
👉 Register now: lu.ma/vqx423ct

03.06.2025 12:19 👍 2 🔁 1 💬 0 📌 0

Hugging Face Inference Endpoints now officially support deploying **vision** models via llama.cpp 👀 👀

Try it now: endpoints.huggingface.co/catalog

15.05.2025 14:43 👍 0 🔁 0 💬 0 📌 0

GitHub - ngxson/smolvlm-realtime-webcam Contribute to ngxson/smolvlm-realtime-webcam development by creating an account on GitHub.

Check it out: github.com/ngxson/smolv...

12.05.2025 17:30 👍 0 🔁 0 💬 0 📌 0

Real-time webcam demo with @huggingface.bsky.social SmolVLM and llama.cpp server.

All running locally on a Macbook M3

12.05.2025 17:27 👍 2 🔁 0 💬 1 📌 0

Although we have A100, H200, M3 Ultra, etc

Still can't match the power of that Casio FX 😆

25.04.2025 13:01 👍 2 🔁 0 💬 0 📌 0

llama.cpp vision support just got much better! 🚀

Traditionally, models with complicated chat template like MiniCPM-V or Gemma 3 requires a dedicated binary to run.

Now, you can use all supported models via a "llama-mtmd-cli" 🔥

(Only Qwen2VL is not yet supported)

21.04.2025 13:46 👍 5 🔁 0 💬 0 📌 0

Learn more: blog.ngxson.com/introducing-...

20.04.2025 23:27 👍 1 🔁 0 💬 0 📌 0

Finally have time to write a blog post about ggml-easy! 😂

ggml-easy is a header-only wrapper for GGML, simplifies development with a cleaner API, easy debugging utilities, and native safetensors loading ✨ Great for rapid prototyping!

20.04.2025 23:27 👍 0 🔁 0 💬 1 📌 0

Someone at Google definitely had a lot of fun making this 😆

And if you don't know, it's available in "Starter apps" section on AI Studio. The app is called "Gemini 95"

20.04.2025 22:40 👍 1 🔁 0 💬 0 📌 0

Telling LLM memory requirement WITHOUT a calculator?

Just use your good old human brain 🧠 😎

Check out my 3‑step estimation 🚀

20.04.2025 11:00 👍 3 🔁 1 💬 0 📌 0

Google having a quite good sense of humor 😂

Joke aside, 1B model quantized to Q4 without performance degrading is sweet 🤏

19.04.2025 17:00 👍 2 🔁 1 💬 0 📌 0

GitHub - ngxson/ggml-easy: Thin wrapper around GGML to make life easier Thin wrapper around GGML to make life easier. Contribute to ngxson/ggml-easy development by creating an account on GitHub.

Where to try? ggml-easy --> github.com/ngxson/ggml-...

31.03.2025 15:25 👍 0 🔁 0 💬 0 📌 0

Cooking a fun thing today, I can now load safetensors file directly to GGML without having to convert it to GGUF!

Why? Because this allow me to do experiments faster, especially with models outside of llama.cpp 😆

31.03.2025 15:25 👍 0 🔁 0 💬 1 📌 0

No vibe coding. Just code it ✅

Visit my website --> ngxson.com

30.03.2025 20:01 👍 2 🔁 0 💬 0 📌 0

The State of On-Device LLMs Xuan-Son Nguyen, an engineer at Hugging Face, specializes in on-device large language models (LLMs) and runtime optimization, working extensively with llam...

📅 The Live Webinar will happen at
🕔 11 AM SF — 2 PM NYC — 6 PM London — 19h00 Paris
👉👉👉 Register here: app.getcontrast.io/register/sot... 👈👈👈

20.03.2025 13:37 👍 0 🔁 0 💬 0 📌 0

On Monday, the 24th, I'm proud to give a talk at sota's webinar.

My main talk will last for an hour to deep dive into the current state of on-device LLMs, exploring their advantages, trade-offs, and limitations.

The session will end with an Q&A, where you can ask me anything about this subject.

20.03.2025 13:36 👍 3 🔁 0 💬 1 📌 0

Had a fantastic chat today with Georgi Gerganov, the brilliant mind behind ggml, llama.cpp, and whisper.cpp! We discussed about:

🚀 The integration of vision models into llama.cpp
🚀 The challenges of maintaining a smooth UX/DX
🚀 The exciting future of llama.cpp

Big things ahead - stay tuned!

19.03.2025 14:53 👍 2 🔁 0 💬 0 📌 0

OK now you are the best, Gememe 2.0

13.03.2025 11:23 👍 0 🔁 0 💬 0 📌 0

Yes, while waiting for the proper support, I made this temporary playground so that people can have an idea of what llama.cpp will become in near future :)

12.03.2025 20:52 👍 1 🔁 0 💬 0 📌 0

Xuan Son Nguyen

Latest posts by Xuan Son Nguyen @ngxson.hf.co