Spent years chasing bigger modelsโฆ turns out smarter engineering wins. ๐
I joined the @odsc.bsky.social AI X Podcast to talk about real-world production AI: RAG failures, quantization, SLMs, and building efficient systems that actually ship. ๐ง
Listen here: bit.ly/4ssrdMm
#AI
09.03.2026 13:20
๐ 2
๐ 1
๐ฌ 0
๐ 0
. @socallinuxexpo.bsky.social starts tomorrow!
I'll be presenting:
โข A Practical Guide to Training a Small Language Model: bit.ly/3LkKjo0
โข The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend: bit.ly/4rlTiEP
Sneak peek below ๐
04.03.2026 14:01
๐ 0
๐ 0
๐ฌ 0
๐ 0
Bigger models used to win headlines. Now they win power bills โก
On the @odsc.bsky.social blog, I break down how quantization and specialized SLMs are reshaping agentic AI around efficiency, not ego. It's about value per watt, not parameter count ๐
Read here: bit.ly/4s6iKye
02.03.2026 14:08
๐ 1
๐ 0
๐ฌ 0
๐ 0
Excited to join @WDI_conference 2026 ๐ My VoD session, โRethinking RAG: How MCP and Agent2Agent Will Transform the Future of Intelligent Searchโ, dives into governance, grounding & multi-agent design ๐
Register: bit.ly/474WJrs
Code: WID26SP20 ๐๏ธ
#GenAI
28.02.2026 15:50
๐ 0
๐ 0
๐ฌ 0
๐ 0
1 week until @socallinuxexpo.bsky.social
Session: The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend
bit.ly/4rlTiEP
Session: A Practical Guide to Training a Small Language Model: Tokenizers, Training, and Real-World Pitfalls
bit.ly/3LkKjo0
27.02.2026 14:13
๐ 0
๐ 0
๐ฌ 0
๐ 0
Excited to share that my talk was accepted forย @devoxx.fr.ย I'll be presenting "The Sound of Your Secrets: Teaching Your Model to Spy So You Can Learn to Defend," all about acoustic side-channel attacks and defenses.
More details: www.devoxx.fr/
25.02.2026 14:01
๐ 0
๐ 0
๐ฌ 0
๐ 0
Chorouk Malmoum, thank you for sharing this! It's deeply validating to see this coming out of NVIDIA's own research.
I've been saying this for almost a year now. The first time I put it on recordโฆ | David vonThenen
Chorouk Malmoum, thank you for sharing this! It's deeply validating to see this coming out of NVIDIA's own research.
I've been saying this for almost a year now. The first time I put it on record was at Devoxx UK 2025. At the time, it was based on hands-on experience building and validating multi-agent systems. But in this field, experience alone doesn't move the needle. You need data. You need papers. You need proof.
And now we have it.
The idea that Small Language Models are better suited for agentic workflows makes sense from a software engineering perspective.
Separation of concerns.
Encapsulation.
Instead of one giant "do everything" model, you create focused SLMs that act as subject matter experts. Each one handles a narrow domain.
There's another benefit people don't talk about enough: control.
A tightly scoped SLM is far more likely to say "I don't know" when it's outside its boundary. This is a good thing. A massive general model? It tends to guess. And it guesses confidently. In production systems, that's not intelligence. That's liability.
The second paper, introducing NVIDIA's Orchestrator-8B, is the piece I've been waiting for. A router that decides when to escalate a problem/question versus when to call a cheap tool or a smaller model. I'm very interested in experimenting with Orchestrator-8B. If the benchmarks hold up, this could materially change how we design cost-efficient agent systems at scale.
This news also just so happens to coincide with a workshop/tutorial I will be giving at Open Data Science Conference (ODSC) East (end of April) titled: ๐๐๐ฌ๐ฌ ๐๐จ๐ฆ๐ฉ๐ฎ๐ญ๐, ๐๐จ๐ซ๐ ๐๐ฆ๐ฉ๐๐๐ญ: ๐๐จ๐ฐ ๐๐จ๐๐๐ฅ ๐๐ฎ๐๐ง๐ญ๐ข๐ณ๐๐ญ๐ข๐จ๐ง ๐
๐ฎ๐๐ฅ๐ฌ ๐ญ๐ก๐ ๐๐๐ฑ๐ญ ๐๐๐ฏ๐ ๐จ๐ ๐๐ ๐๐ง๐ญ๐ข๐ ๐๐ . I will do my best to include the findings/learnings from Orchestrator-8B in that session.
(Tentative) Session Date: Tuesday, April 28
Session Info: https://bit.ly/4rxXeTg
Read Chorouk's full breakdown below for more information and links to the research.
.
NVIDIA's research backs it: Small Language Models > giant LLMs for agentic workflows ๐ฅ Focused SLMs = better control, lower cost, fewer "confident guesses."
Orchestrator-8B as a smart router? Game changer. I'll cover this at @odsc.bsky.social East! ๐
More info: bit.ly/4aCsYPG
23.02.2026 15:10
๐ 0
๐ 0
๐ฌ 0
๐ 0
Really looking forward to SCaLE this year. Going to be a lot of fun (with learning some cool stuff)!
22.02.2026 22:56
๐ 0
๐ 0
๐ฌ 0
๐ 0
Automate or Die Trying | David vonThenen
Recorded a great conversation last week with Wil Ramos (https://lnkd.in/gV9jQBUV) on the ๐๐ฎ๐ญ๐จ๐ฆ๐๐ญ๐ ๐จ๐ซ ๐๐ข๐ ๐๐ซ๐ฒ๐ข๐ง๐ podcast.
Wil, thanks for having me on. I appreciate the space to go deep on topics that don't always fit into a conference talk.
The episode should be out in a week or two. I'll share the link here once it's live.
Here's some of what we covered:
๐ How to harden RAG and agent workflows so they act only on verifiable evidence
- Grounded data
- Clear audit trails
- Preventing agents from drifting into hallucinated "actions" or "decisions"
๐ What it takes to make agentic automation safe enough to run unattended
- Guardrails
- Checkpoints
- Human-in-the-loop
๐ And the broader state of AI right now. Where it's moving. Where it's messy. And where we need to be more disciplined.
I've been listening to several episodes of ๐๐ฎ๐ญ๐จ๐ฆ๐๐ญ๐ ๐จ๐ซ ๐๐ข๐ ๐๐ซ๐ฒ๐ข๐ง๐ , and they're worth your time. What I like most is the range of perspectives. Different guests, different takes, real-world lessons. It's people building and securing real systems.
If you're into automation, security, or AI systems, subscribe to the podcast here:
YouTube: https://lnkd.in/gRKCEAJw
Spotify: https://lnkd.in/ggDUeAyR
More soon, once the episode drops.
Recorded an episode of "Automate or Die Trying" Podcast with Wil Ramos! ๐ We went deep on hardening RAG + agent workflows... grounded data, audit trails, guardrails, human-in-the-loop.
Teaser post on LinkedIn: bit.ly/4qQiyCo
22.02.2026 19:15
๐ 0
๐ 0
๐ฌ 0
๐ 0
OpenClaw: The Viral AI Agent that Broke the Internet - Peter Steinberger | Lex Fridman Podcast #491 | David vonThenen
I listened to the latest Lex Fridman Podcast episode:
๐๐ฉ๐๐ง๐๐ฅ๐๐ฐ: ๐๐ก๐ ๐๐ข๐ซ๐๐ฅ ๐๐ ๐๐ ๐๐ง๐ญ ๐ญ๐ก๐๐ญ ๐๐ซ๐จ๐ค๐ ๐ญ๐ก๐ ๐๐ง๐ญ๐๐ซ๐ง๐๐ญ ๐ฐ๐ข๐ญ๐ก ๐๐๐ญ๐๐ซ ๐๐ญ๐๐ข๐ง๐๐๐ซ๐ ๐๐ซ.
If you're building agents, it's worth a listen.
OpenClaw has exploded on GitHub. But what stood out to me was this (time: 23:26):
๐๐ง๐ ๐ญ๐ก๐๐ง ๐ญ๐ก๐ ๐๐ ๐๐ง๐ญ ๐ฐ๐จ๐ฎ๐ฅ๐ ๐ฃ๐ฎ๐ฌ๐ญ ๐ฆ๐จ๐๐ข๐๐ฒ ๐ข๐ญ๐ฌ ๐จ๐ฐ๐ง ๐ฌ๐จ๐๐ญ๐ฐ๐๐ซ๐โฆ ๐ ๐ฃ๐ฎ๐ฌ๐ญ ๐๐ฎ๐ข๐ฅ๐ญ ๐ข๐ญโฆ ๐ข๐ญ ๐ฃ๐ฎ๐ฌ๐ญ ๐ก๐๐ฉ๐ฉ๐๐ง๐๐.
We're not talking about scripted automation anymore. We're talking about systems that change themselves. That's impressive. It's also a different level of power.
Midway through, Lex raises the obvious issue (time: 52:50):
๐๐ซ๐จ๐ฆ๐ฉ๐ญ ๐ข๐ง๐ฃ๐๐๐ญ๐ข๐จ๐ง ๐ข๐ฌ ๐ฌ๐ญ๐ข๐ฅ๐ฅ ๐๐ง ๐จ๐ฉ๐๐ง ๐ฉ๐ซ๐จ๐๐ฅ๐๐ฆโฆ ๐ญ๐ก๐๐ซ๐'๐ฌ ๐ฌ๐จ ๐ฆ๐๐ง๐ฒ ๐ฉ๐จ๐ฌ๐ฌ๐ข๐๐ข๐ฅ๐ข๐ญ๐ข๐๐ฌโฆ ๐ง๐ฎ๐๐ง๐๐๐ ๐๐ญ๐ญ๐๐๐ค ๐ฏ๐๐๐ญ๐จ๐ซ๐ฌ.
Peter talks about progress, like scanning skills with VirusTotal. That's good. But the bigger point remains.
OpenClaw is a privileged automation runtime.
Your risk is dominated by:
1๏ธโฃ Credential exposure
2๏ธโฃ Network exposure
3๏ธโฃ Tool/skill supply chain
4๏ธโฃ Prompt injection and social engineering
You're basically giving a script sudo on your machine. Except now it improvises. Please see: https://bit.ly/4aFyXDn
And hopefully, you read the docs and you aren't running this on your actual machine, but some isolated cloud instance, VM, etc.
To run OpenClaw safely, Peter's advice is clear (time: 1:00:45):
๐๐ ๐ฒ๐จ๐ฎ ๐ฆ๐๐ค๐ ๐ฌ๐ฎ๐ซ๐ ๐ญ๐ก๐๐ญ ๐ฒ๐จ๐ฎ ๐๐ซ๐ ๐ญ๐ก๐ ๐จ๐ง๐ฅ๐ฒ ๐ฉ๐๐ซ๐ฌ๐จ๐ง ๐ฐ๐ก๐จ ๐ญ๐๐ฅ๐ค๐ฌ ๐ญ๐จ ๐ข๐ญโฆ ๐ข๐ง ๐ ๐ฉ๐ซ๐ข๐ฏ๐๐ญ๐ ๐ง๐๐ญ๐ฐ๐จ๐ซ๐คโฆ ๐ญ๐ก๐ ๐ซ๐ข๐ฌ๐ค ๐ฉ๐ซ๐จ๐๐ข๐ฅ๐ ๐๐๐ฅ๐ฅ๐ฌ ๐๐ฐ๐๐ฒ.
Isolation is the safe move.
And here's my 2 cents...
If you isolate OpenClaw or any AI assistant completely... no personal data, no real integrations, no privileged API keys... how useful is it really?
An AI assistant only becomes valuable when it knows who you are and can act on your behalf. That requires access. And access creates risk.
Without that, you have a cool demo. Not a production system. Not a ๐ฌ๐๐๐ AI assistant. I think it could be useful to perform long running tasks based on the knowledge contained within the LLM... those in the AI space with some know how, probably already have some equivalent of that. BUT, I have a feeling that with some of that OpenAI resource, a safe and production version might become a reality sooner rather than later.
The episode is fascinating and very honest about both the power and the risks... and also some really really amazing piece of tech that is taking the internet by storm.
Give it a listen (it's 3+ hours, but worth it):
https://bit.ly/4c0iFra
Just listened to Lex Fridman w/OpenClaw's creator ๐คฏ๐ค Self-modifying AI agents are hereโฆ and they're powerful.
But let's be real: security is a real concern ๐โ ๏ธ Privileged automation + access + personal info = serious risk.
I break it down here: bit.ly/4tTsmhJ ๐
20.02.2026 14:25
๐ 0
๐ 0
๐ฌ 0
๐ 0
Excited to share that my talk was accepted for NDC Copenhagen! I'll be presenting The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend.
๐๏ธ Session Time/Date: Thurs, Jun 4 at 15:00
๐ Location: Room 4
๐ Session Link: bit.ly/4qvPWxT
16.02.2026 14:07
๐ 0
๐ 0
๐ฌ 0
๐ 0
Three weeks out and I can't wait to bring this one to @socallinuxexpo.bsky.social 23x in Pasadena, CA ๐ง๐
Check out my session: The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend
๐๏ธ Sat, March 7 @ 14:30
๐ Room 101
๐ bit.ly/4rlTiEP
#SCaLE23x
11.02.2026 14:08
๐ 0
๐ 0
๐ฌ 0
๐ 0
Three weeks to go and I'm fired up to be speaking at @socallinuxexpo 23x in Pasadena ๐๐ง
Session: A Practical Guide to Training a Small Language Model: Tokenizers, Training, and Real-World Pitfalls
๐๏ธ Fri, Mar 6 @ 15:45
๐ Ballroom DE
๐ bit.ly/3LkKjo0
#SCaLE23x
09.02.2026 14:16
๐ 0
๐ 0
๐ฌ 0
๐ 0
Happy Friday! Just obtained my latest certificate.
Just a reminder... if you deploy anything, deploy to a VM, cloud instance, etc in complete isolation.
06.02.2026 16:20
๐ 0
๐ 0
๐ฌ 0
๐ 0
๐จ Exposed DBs, leaked prompts, 91% injection success. Moltbook wasn't hacked by geniuses, it failed basic engineering ๐ฌ
Handing control and data to unproven AI agents isn't innovation, it's risk.
Read the full breakdown ๐ bit.ly/4qXmqlI
๐ฅ #AISecurity
04.02.2026 14:58
๐ 0
๐ 0
๐ฌ 0
๐ 0
Home ODSC AI East 2026 - Boston
Learn, grow, and connect with 3.5K+ data practitioners in the heart of the AI boom. Expert-led sessions on LLMs, ML, Generative AI and more.
Excited to share that my talk was accepted for @odsc.bsky.social East! I'll be presenting: "Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI."
Details here: odsc.ai/east/
02.02.2026 14:13
๐ 0
๐ 0
๐ฌ 0
๐ 0
The internet is biting back. Creators are deploying "tarpits" to trap AI scrapers in infinite mazes of gibberish data. This "data poisoning" spikes training costs and breaks the uncompensated data buffet. Be indigestible. Grow spikes.
Blog: bit.ly/4jvNTId
28.01.2026 14:18
๐ 0
๐ 0
๐ฌ 0
๐ 0
Teams are rethinking RAG ๐๐ BM25-based Hybrid RAG hits the sweet spot with clear observability, solid answer quality, and one OpenSearch stack to run ๐คโจ An alternative to Graph-based variations!
This is why it sticks... learn more ๐ bit.ly/4pz0D3b
26.01.2026 14:07
๐ 0
๐ 0
๐ฌ 0
๐ 0
We Had 400 People Shop For Groceries. What We Found Will Shock You.
EXCLUSIVE: We uncovered a secret corporate scheme to raise grocery prices. We found that Instacart is using AI algorithms to charge customers different price...
AI isn't only optimizing search & ads, it's reshaping prices. ๐ธ
This podcast exposes techflation: algorithms charging different people different prices to extract max value. Groceries cost more because AI knows what you'll tolerate. Dark precedent.
๐ฅ bit.ly/4jKdDkl
22.01.2026 14:12
๐ 1
๐ 0
๐ฌ 0
๐ 0
I'll be presenting at @devoxxgreece.bsky.social ๐
Session 1: "The Sound of Your Secrets" will discuss acoustic side-channel attacks/defenses.
Session 2: "How Model Quantization Fuels the Next Wave of Agentic AI" will discuss efficient AI systems and models.
Info: devoxx.gr/
20.01.2026 14:08
๐ 0
๐ 0
๐ฌ 0
๐ 0
I guess you can say I was pretty active with speaking sessions at conferences in 2025 ๐คฃ
16.01.2026 14:13
๐ 0
๐ 0
๐ฌ 0
๐ 0
MIT study finds AI can already replace 11.7% of U.S. workforce
Artificial intelligence can already replace 11.7% of the U.S. labor market, across finance, health care and professional services, according to MIT's study.
๐ Reading some eye-opening stuff over the break...
MIT just dropped a study showing AI can already replace 11.7% of U.S. jobs. Not hype. Real data. Worth a read if you care about work, skills, and what's next.
๐ bit.ly/3Y76fWg
14.01.2026 14:18
๐ 0
๐ 0
๐ฌ 0
๐ 0
More powerful search: Rethinking RAG agents
The future of intelligent search starts with rethinking RAG, retrieval augmented generation. By harnessing the power of MCP (Model context protocol) and Agen...
New video is live! ๐๐ I break down how rethinking RAG with MCP + Agent2Agent unlocks more powerful, secure, agentic search. This is the same approach I shared at 3 conferences in late 2025.
Watch it now on the @Instaclustr YouTube! ๐ฅโจ bit.ly/3NiViPe
12.01.2026 15:17
๐ 1
๐ 0
๐ฌ 0
๐ 0
Hybrid RAG breaks the black box ๐๐ Combining Graphs or BM25, and vectors turns retrieval into something explainable, governable, and production-ready for real enterprise AI ๐คโจ Want agents you can trust?
Read the full breakdown here ๐ bit.ly/4pz0D3b
#HybridRAG
07.01.2026 15:29
๐ 0
๐ 0
๐ฌ 0
๐ 0
Got two talks accepted at SCaLE 23x! ๐ง
1. A Practical Guide to Training a Small Language Model
2. The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend
One of the best conferences of the year! See you there!
Register: bit.ly/4aMlprJ ๐
05.01.2026 14:05
๐ 0
๐ 0
๐ฌ 0
๐ 0
2026 AI Outlook: How Agents, Context, and Governance Will Shape Real-World AI
As artificial intelligence continues its rapid evolution, one thing has become increasingly clear: progress isnโt being driven by a single breakthrough, but by a convergence of architectural shifts, cultural changes, and hard-earned lessons from production deployments. ODSC AI is built on our community โ and that involves our speakers, attendees,...
What did 2025 really change about AI, and what must 2026 fix to make it work in the real world? ๐คโจ Top builders and thinkers share bold predictions (including mine) on agents, context, and governance.
Read it here on the @odsc.bsky.social blog ๐ bit.ly/3YdViSW
#AI #Governance
02.01.2026 14:11
๐ 1
๐ 0
๐ฌ 0
๐ 0
Got my ChatGPT year in review... I know ChatGPT is the ultimate hype man, but it's good to see it thinks I am chasing difficult problems and trying to bring that information/material to others. See you in 2026!
31.12.2025 14:13
๐ 0
๐ 0
๐ฌ 0
๐ 0
IBM CEO says there is 'no way' spending trillions on AI data centers will pay off at today's infrastructure costs
IBM CEO Arvind Krishna walked through some napkin math on Big Tech's AI data center spending โ and raised some doubts on if it'll prove profitable.
Some reading for the break. I was thinking about this a lot...
IBM's CEO walks through the data center math and basically says the returns don't add up ๐ธ
The capex numbers are wild, the AGI bets feel shaky. This math is math-ing for me.
Link: bit.ly/4qkwCE8 ๐
29.12.2025 14:12
๐ 1
๐ 0
๐ฌ 0
๐ 0
Linux Foundation Announces the Formation of the Agentic AI Foundation (AAIF), Anchored by New Project Contributions Including Model Context Protocol (MCP), goose and AGENTS.md โ Agentic AI Foundation (AAIF)
Been catching up on AI stuff, and this is a definite must-read ๐ค
The @linuxfoundation.org just launched the Agentic AI Foundation. Open governance, real standards, and projects like MCP, goose, etc, under one roof. ๐
Link: bit.ly/4s4cGHv
24.12.2025 14:04
๐ 1
๐ 0
๐ฌ 0
๐ 0
AI Expert: We Have 2 Years Before Everything Changes! We Need To Start Protesting! - Tristan Harris
Ex-Google Insider and AI Expert TRISTAN HARRIS reveals how ChatGPT, China, and Elon Musk are racing to build uncontrollable AI, and warns it will blackmail h...
Iโve been binge listening to Diary of a CEO and these two AI interviews are wild. Super fascinating from a social angle. Perfect holiday break listens ๐ง๐
Check them out:
1. bit.ly/3Y7405i
2. bit.ly/3NcAetx
22.12.2025 14:14
๐ 1
๐ 0
๐ฌ 0
๐ 0