Snowflake's Arctic Long Sequence Training: How to Train LLMs on 15 Million Tokens Without Selling a Kidney
techlife.blog/posts/snowfl...
#ALST #Snowflake #LongContextTraining #DeepSpeed #HuggingFace #SequenceParallelism #LLMTraining #H100 #Llama8B #Qwen3 #GPUMemoryOptimization
#DeepSpeed
Posts tagged #DeepSpeed on Bluesky
0
0
0
0
New episode: #TypeChat, #DeepSpeed, #Entra & #Purview updates—and why we love a bit of improv in tech.
Start your week with practical insights and real-world commentary.
youtu.be/y2MeLt-o-D4
#CloudyWithAChance #MicrosoftCloud #Podcast #AI #Cybersecurity
3
0
0
0
DeepNVMe just got faster and more flexible:
✅ Gen5 NVMe support
✅ 20X faster model checkpointing
✅ Cost-efficient SGLang inference via ZeRO-Inference
✅ CPU-only pinned memory support
📘 pytorch.org/blog/deepnvm...
#PyTorch #DeepSpeed #AIInfrastructure
4
1
0
0
The PyTorch Foundation Transitions to a New Umbrella Structure for AI Innovation #United_States #Paris #PyTorch_Foundation #vLLM #DeepSpeed
1
0
0
0