Alvaro Bartolome's Avatar

Alvaro Bartolome

@alvarobartt.com

machine learning @hf.co (inference + cloud) opinions, code and mistakes are my own http://github.com/alvarobartt

712
Followers
39
Following
38
Posts
19.11.2024
Joined
Posts Following

Latest posts by Alvaro Bartolome @alvarobartt.com

Preview
Release v1.9.0 · huggingface/text-embeddings-inference What's changed? 🚨 Breaking changes Default HiddenAct::Gelu to GeLU + tanh in favour of GeLU erf by @vrdn-23 in #753 Default GeLU implementation is now GeLU + tanh approximation instead of exact...

And much more additions, improvements and fixes, that couldn't have been possible without the community support and contributions πŸ™πŸ»

github.com/huggingface/...

17.02.2026 16:05 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

- πŸ’š NVIDIA Blackwell support, ready for next-gen GPUs as B200, GB200, or RTX 50-series.

17.02.2026 16:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

- πŸ”„ Add bidirectional attention support for 3, enabling newer embedding models as Voyage AI by MongoDB.

17.02.2026 16:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

- πŸ¦™ Add support for Meta Llama 2 and 3 architectures with Flash Attention support, enabling embedding models as NVIDIA Llama Embed Nemotron.

17.02.2026 16:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

- πŸŽ‰ Add support for Microsoft Deberta V2 and V3, for both feature-extraction (and sentence-similarity) and text-classification, enabling models as Meta Llama Prompt Guard.

17.02.2026 16:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

More embedding models and an even more reliable inference engine is what you get with @hf.co Text Embeddings Inference v1.9.0 πŸ’₯

More in the thread 🧡

17.02.2026 16:05 πŸ‘ 3 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0

github.com/alvarobartt/...

05.01.2026 16:30 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

`hf-mem` is all you need to estimate the required VRAM for inference of any model on @huggingface based on Safetensors metadata.

- Written in Python
- Lightweight, only depends on `httpx`
- Runs w/ `uvx` as `uvx hf-mem ...`
- Works with any Safetensors repository
- Output inspired by usgraphics.com

05.01.2026 16:30 πŸ‘ 0 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

🧨 I built something with #Zig!

`tokeni.zig` is a std-only implementation of the Byte Pair Encoding (BPE) algorithm in Zig for tokenizing sequences of text, used by OpenAI (among many others) to tokenize the text when pretraining their large language models!

github.com/alvarobartt/...

13.03.2025 15:50 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
How to read and parse JSON with Zig 0.13 How to read and parse JSON with Zig 0.13 by alvarobartt

https://alvarobartt.me/how-to-read-and-parse-json-with-zig-0-13

10.02.2025 16:15 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

For anyone interested in Zig I wrote a small post titled "How to read and parse JSON with Zig 0.13" that explains how to read JSON from a file with keys with different value types and how to access those values.

10.02.2025 16:15 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

love this quote "working smarter helps, but the real superpower is resting smarter"

a highly recommended read!

03.02.2025 09:05 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Right, the point is that on Rust you end up "refactoring" a lot (at least I do), but seems easier to handle, whilst on Zig I don't feel is as easy, not especially complex either, just more cumbersome

31.01.2025 16:05 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

πŸ€— Here's a simple script that calculates the required VRAM for serving DeepSeek R1 from @huggingface Hub safetensor's metadata!

P.S. The result of the script above is: "model_id='deepseek-ai/DeepSeek-R1' requires memory=756.716GB"

31.01.2025 16:04 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

hmm refactoring in zig is not as easy as it's in rust, even though seems fairly common too, right? or is it just me? πŸ€”

31.01.2025 08:31 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

stuff that matters takes time

29.01.2025 12:28 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1 Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.

Last moments of closed-source AI πŸͺ¦ :
Hugging Face is openly reproducing the pipeline of 🐳 DeepSeek-R1. Open data, open training. open models, open collaboration.

🫡 Let's go!
github.com/huggingface/...

25.01.2025 14:36 πŸ‘ 33 πŸ” 7 πŸ’¬ 0 πŸ“Œ 1
Preview
DeepSeek-R1 - a deepseek-ai Collection We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Check DeepSeek-R1 collection on the Hugging Face Hub, with not just DeepSeek-R1 and DeepSeek-R1-Zero, but also distilled their reasoning patterns to fine-tune smaller models!

huggingface.co/collections/...

23.01.2025 13:49 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

🐐 DeepSeek is not on the @hf.co Hub to take part, they are there to take over!

Amazing stuff from the DeepSeek team, ICYMI they recently released some reasoning models (DeepSeek-R1 and DeepSeek-R1-Zero), fully open-source, their performance is on par with OpenAI-o1 and it's MIT licensed!

23.01.2025 13:45 πŸ‘ 10 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

you can find so much gold in github gists wow, i was not a big fan because the discoverability doesn't seem great, but been exploring gists lately and so much gold stuff in there!

23.01.2025 08:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Hugging Face Hugging Face Email Forms

in case anyone missed it, we're running a certified course on ai agents at hugging face starting on feb 2nd; the course is on how to build you own ai agents for different cool use cases built on top of open source!

πŸ‘‡ you can sign up in the link below, don't miss it!

bit.ly/hf-learn-age...

22.01.2025 12:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

ok, here we go again πŸ˜…

22.01.2025 08:23 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

because it's my native language, anyway it was just an idea, not sure I'll do it anyway πŸ€—

28.11.2024 08:42 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Not quite sure yet about how's following me here, but I may consider not just x-posting but also eventually post more random thoughts + content in Spanish, is that something you'd be interested in?

27.11.2024 07:43 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

awesome πŸ€—

20.11.2024 09:40 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

how do I get in there? πŸ€—

20.11.2024 09:39 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

here we go again!

i work at hugging face and here you can expect posts about machine learning (llms mainly), some rust, some nvim nerdy stuff and anything related to hugging face πŸ€—

posting is not easy for me, but i’ll try to do better from now on, support is highly appreciated!

20.11.2024 09:30 πŸ‘ 14 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Post image

Read more about the Serverless Inference API in the documentation!

https://huggingface.co/docs/api-inference

19.11.2024 16:15 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

πŸ”₯ Finally, if you are willing to get started quickly and experiment with LLMs feel free to give the recently released Inference Playground a try!

https://huggingface.co/playground

19.11.2024 16:15 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0