Are there counter examples of companies that scaled their system this fast and had better reliability?
Are there counter examples of companies that scaled their system this fast and had better reliability?
10 months later, we're hiring for a reliability focused engineer :)
jobs.ashbyhq.com/modal/84467a...
If you know someone great that'd want to be a founding reliability eng for Modal's GPU-focused cloud platform, lmk!
Any book recommendations for stuff about aviation safety improvements? Iβm currently reading Engineering a Safer World and Iβm keen to read more about aviation and other engineering industryβs successes.
Damn ok thatβs surprising.
How come @bcantrill.bsky.social didnβt like Drift into Failure by Dekker?
Modal was originally built with CPU batch in mind, but the GenAI wave certainly swept us up :)
We love working with computational bio/chemistry customers on their mostly CPU-intensive batch workloads. modal.com/use-cases/co...
If you want a demo let me know :)
When I was at Canva I had to get a batch job to run demographics analysis vision models on over 90 million images. It took a couple weeks, but felt like with the right infra it could be done in a couple hours.
The infra now exists :)
Hypervisor as a Library #rustlang seiya.me/blog/hypervi...
We won't tell you where the deep and cheap GPU capacity is, but we will teach you how to fish.
Is The Soul of a New Machine the best book about the computer industry because Kidder is..
- an outsider
- a full-time writer
- simply a first rate writer
Other industries have had βhomegrownβ Pulitzersβmedicine, law, finance, bilogyβbut not computing.
LLMs have shifted my blog post drafting back to my static-site repository away from Notion. I can play with HTML and JS so easily during iteration with Cursor.
Markdown+Blocks is hamstringing.
Is there a SRE book but for startups? Googleβs book is great but it doesnβt contend with the tradeoffs and constraints of a young growing startup.
History-posting once again: thundergolfer.com/blog/the-fir...
My brilliant @modal-labs.bsky.social colleague Charles won't stop until we're all pushing our GPUs to their limits: modal.com/blog/gpu-uti...
Oh was βitβs an OOM largerβ referring to the training cluster size?
Whatβs the param count?
The Tom Brady of Youtube educational content is at it again. One of those rare lectures which deserves to be called enlightening. I hope @karpathy.bsky.social is steering clear of helicopters, submersibles, and smoking. We need him.
What didnβt you like about Modalβs exp? Weβre actively working on it, being dissatisfied about certain areas.
At @modal-labs.bsky.social your customer questions sometimes get a whole blog post :)
Why does an NVIDIA H100 SXM 80GB card offer 85.52 GB?
thundergolfer.com/blog/nvidia-...
Found out you get 220MiB more H100 HBM3e VRAM on Oracle Cloud compared to GCP. GCP's instances have more `reserved` memory and I can't figure out if this is configurable from the guest.
To learn about container snapshotting I highly recommend Tristan Hume's telefork repostory. I forked it and started hacking on file descriptor restore: github.com/thundergolfe....
Once PID restore works there's a good chance a simple NVIDIA GPU checkpoint/restore would pass!
Wrote about how a warmed up container can be saved to disk and later restored for a 2.5x cold start performance boost. Saving live container processes to disk turns out to be pretty whacky and interesting!
modal.com/blog/mem-sna...
This contains the shortest and clearest explanation of S3βs $0.02/gib/month cloud economics that Iβve seen, and lots more good stuff.
tailscale.com/blog/living-...
Ha all too true. I wouldnβt even get 100% and I just reviewed the post. Still, we aspireβ¦
Quick post about those 'latency numbers every programmer must know'. It became mostly an affirmation of
@sirupsen.bsky.social 's napkin-math repo (bookmark it).
New goal in 2025 is to write one software essay with the zany energy of an All Souls College essay prompt: Does the moral character of an orgy change when the participants wear Nazi uniforms?
thundergolfer.com/all-souls-so...