Home New Trending Search
About Privacy Terms
#
#DeepSpeed
Posts tagged #DeepSpeed on Bluesky
Preview
Snowflake's Arctic Long Sequence Training: How to Train LLMs on 15 Million Tokens Without Selling a Kidney Snowflake AI Research just open-sourced Arctic Long Sequence Training (ALST), a framework that pushes LLM training from a measly 32K tokens to over 15 million — a 469x improvement — using standard Hug...

Snowflake's Arctic Long Sequence Training: How to Train LLMs on 15 Million Tokens Without Selling a Kidney

techlife.blog/posts/snowfl...

#ALST #Snowflake #LongContextTraining #DeepSpeed #HuggingFace #SequenceParallelism #LLMTraining #H100 #Llama8B #Qwen3 #GPUMemoryOptimization

0 0 0 0
TypeChat, DeepSpeed, Entra & Purview Updates and the Joys of Not Preparing | EP22
TypeChat, DeepSpeed, Entra & Purview Updates and the Joys of Not Preparing | EP22 YouTube video by Cloudy with a Chance of Insights, The MSFT Podcast

New episode: #TypeChat, #DeepSpeed, #Entra & #Purview updates—and why we love a bit of improv in tech.
Start your week with practical insights and real-world commentary.

youtu.be/y2MeLt-o-D4

#CloudyWithAChance #MicrosoftCloud #Podcast #AI #Cybersecurity

3 0 0 0
Post image

DeepNVMe just got faster and more flexible:
✅ Gen5 NVMe support
✅ 20X faster model checkpointing
✅ Cost-efficient SGLang inference via ZeRO-Inference
✅ CPU-only pinned memory support

📘 pytorch.org/blog/deepnvm...
#PyTorch #DeepSpeed #AIInfrastructure

4 1 0 0
Preview
The PyTorch Foundation Transitions to a New Umbrella Structure for AI Innovation The PyTorch Foundation has broadened its scope by becoming an umbrella foundation, welcoming innovative projects like vLLM and DeepSpeed to foster open-source AI development.

The PyTorch Foundation Transitions to a New Umbrella Structure for AI Innovation #United_States #Paris #PyTorch_Foundation #vLLM #DeepSpeed

1 0 0 0