NNsight is part of a growing open-source ecosystem. We're building the infrastructure so you can focus on the science.
Upgrade to NNsight 0.6 today: pip install nnsight --upgrade
nnsight.net
github.com/ndif-team/nnsight
discuss.ndif.us
@ndif-team
The National Deep Inference Fabric, an NSF-funded computational infrastructure to enable research on large-scale Artificial Intelligence. π NDIF: https://ndif.us π§° NNsight API: https://nnsight.net πΈ GitHub: https://github.com/ndif-team/nnsight
NNsight is part of a growing open-source ecosystem. We're building the infrastructure so you can focus on the science.
Upgrade to NNsight 0.6 today: pip install nnsight --upgrade
nnsight.net
github.com/ndif-team/nnsight
discuss.ndif.us
Read our blog post to learn more about the design of this release and its features: nnsight.net/blog/2026/02/26/introducing-nnsight-06/
We also ship first-class support for AI coding agents, including skills for Claude Code and Codex, Context7 MCP server for live docs, and comprehensive guides in the repo.
github.com/ndif-team/skills
Other additions include:
- Clean error tracebacks that point to YOUR code, not NNsight internals
- Check NDIF before submitting jobs with ndif.status()
- Standard for step-in tracer.iter[:] generation loops (faster than with blocks!)
0.6 also comes with 2.4β3.9x performance improvements.
- Empty trace: 1196ΞΌs β 308ΞΌs
- 12 .save() calls: 1697ΞΌs β 716ΞΌs.
The big wins: always-on trace caching, persistent pymount, and batched variable sync. Setup cost dropped from ~1,100ΞΌs to ~210ΞΌs
NNsight 0.6 also introduces first-class support for VisionLanguageModel (e.g., LLaVA, Qwen-VL) and DiffusionModel (e.g., Stable Diffusion, Flux)! Available remote on NDIF soon π
vLLM integration got a major upgrade, now supporting single-GPU, multi-GPU tensor parallelism, Ray distributed execution, and even multi-node experiments, all using the same tracing API. NNsight handles the tensor gather/scatter, allowing you to intervene on unsharded tensors.
Also new is async mode with real-time token streaming. Build interactive apps like chat interfaces, or live visualizations with interventions running on every forward pass:
This is huge for the ecosystem. Libraries like NNterp can ship new features without waiting for NDIF to update. You always run whatever version you have locally.
Our #1 request: "I want to run my own analysis code on NDIF, not just inline interventions." Now, with NNsight source code serialization, you can! Your local packages work, even if NDIF doesn't have them installed.
NNsight 0.6 is out now! We directly address your feedback in our biggest release yet. Pain points included cryptic errors, slow traces, no remote execution of custom code, and limited vLLM support. We tackle all of these and more in this new release.
π§΅ Here's what changed:
For more details, read the paper: arxiv.org/abs/2503.10965
Red teams trained a model with a secret objective by exploiting RLHF reward models. Blue teams then audited the model, using techniques such as interpretability with sparse autoencoders, behavioral attacks, and training data analysis to successfully uncover the hidden objective.
Watch Sam Marks present his work "Auditing Language Models for Hidden Objectives" in our new YouTube video! Sam's team ran a blind auditing game to assess efficacy of black box and white box techniques for LLM alignment auditing.
π youtu.be/jZiOJTHqB6M
Big thanks to the whole organizing team, especially @neelnanda.bsky.social and
Andy Arditi, for hosting such a great workshop and inviting us to speak!
Adam Belfki discusses NDIF and Workbench (youtu.be/zmHyaHiw8XU)
- workbench.ndif.us/
- ndif.us/
- nnsight.net/
@sfeucht.bsky.social presents their work on concept induction heads (youtu.be/Jc-sTXW31W0)
- dualroute.baulab.info/
@davidbau.bsky.social shares his thoughts on pragmatic interpretability (youtu.be/iMIsg32mVHM)
- davidbau.com/archives/20...
- In response to: www.alignmentforum.org/posts/StENz...
Check out the NDIF & Bau Lab lightning talks at the NeurIPS 2025 Mechanistic Interpretability Workshop (mechinterpworkshop.com/): youtube.com/playlist?li...
New YouTube video posted! @benjaminfeuer.bsky.social discusses LLM's annus mirabilis, presenting his work on open questions surrounding LLM judges, benchmark trustworthiness, and maximizing the potential of synthetic data.
Watch here: www.youtube.com/watch?v=pehc...
π₯I am super excited for the official release of an open-source library we've been working on for about a year!
πͺinterpreto is an interpretability toolbox for HF language modelsπ€. In both generation and classification!
Why do you need it, and for what?
1/8 (links at the end)
π Paper: arxiv.org/abs/2411.16725
π» Code & Visualizations: github.com/revelio-dif......
π Deepti's Website: deeptigp.github.io/
New year, new YouTube videos! We are resuming our regular interpretability seminar posts, with a fantastic talk by Deepti Ghadiyaram on interpreting diffusion models.
Watch the video: youtu.be/4eqvABPX5rA
So excited to have you on the team, @gsarti.com!
Report issues or contribute to the open-source project: github.com/ndif-team/n...
Add support for new models (or custom ones): ndif-team.github.io/nnterp/addi...
Try out built-in interventions like logit lens and patchscope: ndif-team.github.io/nnterp/inte...
nnterp by @butanium.bsky.social is now part of the NDIF ecosystem! nnterp standardizes transformer naming conventions, includes built-in best practices for common interventions, and is perfectly compatible with original HF model implementations.
Learn more: ndif-team.github.io/nnterp/