Jonathon Belotti's Avatar

Jonathon Belotti

@jonathonbelotti

Peeling back the layers @modal_labs. Previously data & ML platform @canva. dms open.

62
Followers
131
Following
27
Posts
08.11.2024
Joined
Posts Following

Latest posts by Jonathon Belotti @jonathonbelotti

Are there counter examples of companies that scaled their system this fast and had better reliability?

02.03.2026 23:36 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

10 months later, we're hiring for a reliability focused engineer :)

jobs.ashbyhq.com/modal/84467a...

If you know someone great that'd want to be a founding reliability eng for Modal's GPU-focused cloud platform, lmk!

02.02.2026 04:08 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Any book recommendations for stuff about aviation safety improvements? I’m currently reading Engineering a Safer World and I’m keen to read more about aviation and other engineering industry’s successes.

08.01.2026 21:41 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Damn ok that’s surprising.

08.01.2026 21:37 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

How come @bcantrill.bsky.social didn’t like Drift into Failure by Dekker?

08.01.2026 15:22 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Modal was originally built with CPU batch in mind, but the GenAI wave certainly swept us up :)

We love working with computational bio/chemistry customers on their mostly CPU-intensive batch workloads. modal.com/use-cases/co...

If you want a demo let me know :)

23.05.2025 16:09 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Introducing Modal Batch: Process 1 Million Jobs with 1 Line of Code Modal Batch is a new interface backed by a new durable queue system built specifically to make job processing easy, scalable, and fault-tolerant.

When I was at Canva I had to get a batch job to run demographics analysis vision models on over 90 million images. It took a couple weeks, but felt like with the right infra it could be done in a couple hours.

The infra now exists :)

23.05.2025 15:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1
Preview
Hypervisor as a Library

Hypervisor as a Library #rustlang seiya.me/blog/hypervi...

20.05.2025 16:42 πŸ‘ 31 πŸ” 6 πŸ’¬ 1 πŸ“Œ 0
Preview
Linear Programming for Fun and Profit How we use an eighty-year-old algorithm to find arbitrages in the cloud market.

We won't tell you where the deep and cheap GPU capacity is, but we will teach you how to fish.

07.05.2025 18:28 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Is The Soul of a New Machine the best book about the computer industry because Kidder is..

- an outsider
- a full-time writer
- simply a first rate writer

Other industries have had β€˜homegrown’ Pulitzersβ€”medicine, law, finance, bilogyβ€”but not computing.

05.04.2025 19:53 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

LLMs have shifted my blog post drafting back to my static-site repository away from Notion. I can play with HTML and JS so easily during iteration with Cursor.

Markdown+Blocks is hamstringing.

03.04.2025 23:51 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Is there a SRE book but for startups? Google’s book is great but it doesn’t contend with the tradeoffs and constraints of a young growing startup.

31.03.2025 13:41 πŸ‘ 5 πŸ” 1 πŸ’¬ 1 πŸ“Œ 1
The First LLM A tracing of the history of GPT-1 and its predecessors.

History-posting once again: thundergolfer.com/blog/the-fir...

24.03.2025 13:52 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
'I paid for the whole GPU, I am going to use the whole GPU': A high-level guide to GPU utilization A guide to maximizing the utilization of GPUs, from cloud allocations to FLOP/s.

My brilliant @modal-labs.bsky.social colleague Charles won't stop until we're all pushing our GPUs to their limits: modal.com/blog/gpu-uti...

25.02.2025 23:01 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Oh was β€œit’s an OOM larger” referring to the training cluster size?

18.02.2025 14:38 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

What’s the param count?

18.02.2025 14:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Deep Dive into LLMs like ChatGPT
Deep Dive into LLMs like ChatGPT YouTube video by Andrej Karpathy

The Tom Brady of Youtube educational content is at it again. One of those rare lectures which deserves to be called enlightening. I hope @karpathy.bsky.social is steering clear of helicopters, submersibles, and smoking. We need him.

16.02.2025 16:47 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

What didn’t you like about Modal’s exp? We’re actively working on it, being dissatisfied about certain areas.

06.02.2025 13:43 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

At @modal-labs.bsky.social your customer questions sometimes get a whole blog post :)

Why does an NVIDIA H100 SXM 80GB card offer 85.52 GB?

thundergolfer.com/blog/nvidia-...

02.02.2025 22:06 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Found out you get 220MiB more H100 HBM3e VRAM on Oracle Cloud compared to GCP. GCP's instances have more `reserved` memory and I can't figure out if this is configurable from the guest.

01.02.2025 18:33 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - thundergolfer/telefork: Like fork() but teleports the forked process to a different computer! Like fork() but teleports the forked process to a different computer! - thundergolfer/telefork

To learn about container snapshotting I highly recommend Tristan Hume's telefork repostory. I forked it and started hacking on file descriptor restore: github.com/thundergolfe....

Once PID restore works there's a good chance a simple NVIDIA GPU checkpoint/restore would pass!

29.01.2025 21:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Memory Snapshots: Checkpoint/Restore for Sub-second Startup Serializing container state to disk for aggressive cold start optimization.

Wrote about how a warmed up container can be saved to disk and later restored for a 2.5x cold start performance boost. Saving live container processes to disk turns out to be pretty whacky and interesting!

modal.com/blog/mem-sna...

29.01.2025 21:18 πŸ‘ 10 πŸ” 2 πŸ’¬ 2 πŸ“Œ 2
Preview
Living in the future, by the numbers Instead of making the traditional New Year predictions, let’s talk instead about the beautiful technological future we live in: the one that exists right now but we don’t always notice.

This contains the shortest and clearest explanation of S3’s $0.02/gib/month cloud economics that I’ve seen, and lots more good stuff.

tailscale.com/blog/living-...

22.01.2025 15:40 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Ha all too true. I wouldn’t even get 100% and I just reviewed the post. Still, we aspire…

20.01.2025 01:36 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Beyond β€˜latency numbers every programmer should know’ Took 10 years, but there's finally a better list.

post url: thundergolfer.com/latency-numb...

19.01.2025 17:32 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Quick post about those 'latency numbers every programmer must know'. It became mostly an affirmation of
@sirupsen.bsky.social 's napkin-math repo (bookmark it).

19.01.2025 17:31 πŸ‘ 14 πŸ” 3 πŸ’¬ 2 πŸ“Œ 0
An β€˜All Souls Examination’ of the machine Inspiration towards a better software essay.

New goal in 2025 is to write one software essay with the zany energy of an All Souls College essay prompt: Does the moral character of an orgy change when the participants wear Nazi uniforms?

thundergolfer.com/all-souls-so...

07.01.2025 02:51 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
README | GPU Glossary

GPUs can be understood: modal.com/gpu-glossary...

12.12.2024 23:06 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0