Github has a recommendation on doing dotfiles:
dotfiles.github.io
I just published a new guide on configuring #OpenClaw 🦀 to run with #Azure AI Foundry models. You control data control, so more privacy, talk to it from #Telegram or using the console!
Check it out here: suraj.io/post/2026/op...
Apple has a new native container CLI for macOS! Run Linux containers without Docker Desktop—with sub-second startup times. 🚀
My guide covers setup, resource limits, and fixing macOS firewall blocks:
🔗 suraj.io/post/2026/us...
#macOS #Containers
Try it:
/goodreads tell me about project hail mary by andy weir
/goodreads add the midnight library to my want to read shelf
Install with one command:
clawhub install goodreads
🔗 clawhub.ai/surajssd/goodreads
🐙 github.com/surajssd/openclaw-goodreads-skill
2/n: Since Goodreads deprecated their API in 2020, this skill uses browser automation under the hood. No API keys (but you'd need to login once) — just the browser tool doing what you'd do manually!
1/n 📚 Made something for fellow book nerds using Openclaw:
A Goodreads skill that lets your AI agent search for books, pull up details & reviews, get personalized recommendations, and manage your reading lists — all through natural language.
Deploying #Kimi K2.5 on #Azure: A Complete Guide to Running MoonshotAI's Model suraj.io/post/2026/de...
Running Pydantic’s Monty Rust sandboxed Python subset in WebAssembly
simonwillison.net/2026/Feb/6/p...
Thanks to @scott.hanselman.com for showing me Handy (handy.computer) — a free, open-source speech-to-text tool that runs locally on your machine. Push-to-talk, privacy-focused, and just works. Check it out!
Running Docker Commands on a Remote Machine via SSH suraj.io/post/2026/re...
#docker #ssh #remote #containers #cli #development #devops
Using Claude Code with GitHub-Hosted Anthropic Models suraj.io/post/2026/us... #claude #github-models #ai #litellm #anthropic
Meta’s Kubernetes-based Portable AI Research Environment youtu.be/ts7bI51gRCo?...
Our talk (me & Yuhan Liu) on improving LLM serving efficienty is on YouTube now!
youtu.be/2YCDvZokqnk?...
#vllm #kubernetes #kubecon
Infinite scale: The architecture behind the Azure AI superfactory
blogs.microsoft.com/blog/2025/11...
Gemini 3, Open AI kv cache and much more
open.substack.com/pub/simonw/p...
and also allow you to do kv cache offload to local storage for 24hrs! Also they cache only when the query is greater than 1024 tokens!
Open AI gave some of the details from the user POV as to what kv cache features are available platform.openai.com/docs/guides/... It is interesting to see that they cache for 10 min and if no request is found they remove hot caches from GPU
From Wisconsin to Atlanta: Microsoft connects datacenters to build its first AI superfactory
news.microsoft.com/source/featu...
Satya Nadella – How Microsoft thinks about AGI
youtu.be/8-boBsWcr5A?...
How One Line of Code Freed 30,000 CPU Cores: Deep-Diving Fluent Bit at Petabyte Scale www.youtube.com/watch?v=pbOv...
Come see us (me & Yuhan Liu) tomorrow for our talk.
Specifically, Wednesday November 12, 2025 5:30pm - 6:00pm EST at Building B | Level 5 | Thomas Murphy Ballroom 1.
More info: sched.co/27FcQ #kubecon #vllm
Announcing Ray Direct Transport: RDMA Support in Ray Core
www.anyscale.com/blog/ray-dir...
This has become whackamole now, source: www.youtube.com/watch?v=AXN-...
I ran the following command in Mac's terminal to get Chrome working with uBlock Origin:
```
open -a /Applications/Google\ Chrome.app --args --disable-features=ExtensionManifestV2Unsupported,ExtensionManifestV2Disabled
```
Building a tool to copy-paste share terminal sessions using Claude Code for web
open.substack.com/pub/simonw/p...
LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference
arxiv.org/abs/2510.09665
Understanding Memory Management on Hardware-Coherent Platforms | NVIDIA Technical Blog developer.nvidia.com/blog/underst...
Join me and Yuhan Liu for our talk at the upcoming #Kubecon NA 2025 in Atlanta: sched.co/27FcQ we will talk about increasing efficency while serving #LLMs using #vLLM & #LMCache!
Using Claude Code but with Github Copilot hosted Claude models:
github.com/surajssd/dot...
TFS @nilekh.bsky.social
NVIDIA Blackwell Leads on SemiAnalysis InferenceMAX v1 Benchmarks | NVIDIA Technical Blog developer.nvidia.com/blog/nvidia-...