JobSet scaling to 130,000 Pods across 130,000 Nodes β sustaining 1,000 Pods/sec.
Incredible to see #Kubernetes pushed to this level of performance π
cloud.google.com/blog/product...
JobSet scaling to 130,000 Pods across 130,000 Nodes β sustaining 1,000 Pods/sec.
Incredible to see #Kubernetes pushed to this level of performance π
cloud.google.com/blog/product...
Kubeflow Trainer v2.1.0 is released!
β‘οΈ Stream in-memory tabular data to GPUs from distributed cache with zero-copy transfer
π₯ Fine-tune LLMs on #Kubernetes with MLX on CUDA β now easier than ever!
π§ Topology Aware Scheduling with #Kueue or #Volcano β essential for GB200
bit.ly/4qO8iM1
Kubeflow Trainer 2.0 is here π
Built in collaboration with the #Kubernetes & #Kubeflow communities to make scalable AI model training easier than ever - with a Python SDK, resilient @pytorch.org support, LLM fine-tuning, gang scheduling, MPI runtimes & more.
blog.kubeflow.org/trainer/intro/
Want to see how we've made it super easy to perform distributed training for ML frameworks like MLX and DeepSpeed on #Kubernetes? Check out our talk tomorrow at 2pm: From High Performance Computing To AI Workloads on Kubernetes: MPI Runtime in Kubeflow TrainJob
sched.co/1tx9k
This #KubeCon + #CloudNativeCon 2025 in London promises to be an inspiring and insightful event.
Donβt miss these sessions to discover how weβre pushing the boundaries of innovation in Cloud Native AI/ML and #GenAI π
sched.co/1tx9k
sched.co/1tcz0
sched.co/1u5fl
sched.co/1u5ii
π New Kubeflow Python SDK Proposal! π
We're working on a Kubeflow Python SDK to improve the user experience for data scientists & ML engineers.
More information can be found at: groups.google.com/g/kubeflow-d...
We need your feedback!
#Kubeflow #AI #ML #PythonSDK
Truly inspiring to see a student I mentored during GSoC 2024 presenting at #KubeCon + #CloudNativeCon π
Kubeflow is a fantastic opportunity for anyone looking to shape the future of AI and cloud native LLMOps - donβt miss out!
youtu.be/4myE0DPp6Ko
Excited to introduce the new MPI Runtime in Kubeflow Trainer V2 at #KubeCon + #CloudNativeCon Europe.
We will showcase how it empowers ML frameworks like MLX, DeepSpeed, and NVIDIA NeMo to streamline distributed AI model development on #Kubernetesπ
sched.co/1tx9k
#AIML #HPC @cncf.bsky.social
π Exciting news from the Kubeflow community!
Welcome Francisco Javier Arceo & Julius von Kohout to the Kubeflow Steering Committee! π Huge thanks to Mathew Wicks, Josh Bottum, & James Wu for their leadership & dedication. More information: groups.google.com/g/kubeflow-d...
#Kubeflow #OpenSource #AI
It is incredible to see what our team has accomplished externally over the past year. I am super proud to be part of this journey. More things to come in 2025!
π£οΈPublic talks: lnkd.in/ebbEUU8X
π Leadership: lnkd.in/ew6GXcex
π» OSS Contributions: lnkd.in/ei2j2wrk
I am excited to join the Program Committee for Kubeflow Summit, co-located with #KubeCon + #CloudNativeCon in London 2025, alongside @akgraner.bsky.social.
Have a story to share about #Kubeflow ecosystem and Cloud Native AI/ML? CFP is open by December 4th π
bit.ly/4ib9S6a
π #KubeCon #CloudNativeCon Londonβs CFP hit record breaking with 2800+ submissions! This compares to 2541 for Paris, an 10%+ increase... may the odds be in everyone's favor for what looks to be a record breaking event! (book hotels early!) events.linuxfoundation.org/kubecon-clou...
I shared the #Kubeflow 2024 highlights and future roadmap at the latest #CNCF WG AI meeting. The Kubeflow community has made incredible progress this year, driving the future of cloud native AI/ML on Kubernetes π
Watch the recording: youtu.be/u4Mf3Jh8v2E?...
View the slides: bit.ly/4fEql19
This marks a significant milestone for the Kubeflow Community. Stay tuned for details on our upcoming Kubeflow Steering Committee Election 2024!
If you missed our session announcing the Kubeflow Training V2 at the #KubeCon + #CloudNativeCon NA, check out the recording. We showcased how weβve made it effortless to fine-tune and train LLMs on Kubernetes.
More to come soon!π
youtu.be/Lgy4ir1AhYw
Amazing article from the @cncf.bsky.social!
They asked professional developers about the usefulness of Batch and AI/ML compute technologies. It's great to see that more teams are adopting projects from the #Kubeflow ecosystem.
www.cncf.io/reports/cncf...
I am super excited to present our latest updates on the Kubeflow Training V2 at the #KubeCon + #CloudNativeCon NA on November 14th π
This milestone is the result of an incredible collaboration between the #Kubernetes Batch and #Kubeflow Training working groups. Don't miss it!
sched.co/1i7nV
Hi there, let's go back to 2010 π