When to call it quits in LLM reasoning? π
βͺMartina's internship project suggests trace monitoring metrics and classifiers that can detect when an LLM reasoning trace is going to fail in mid way. The approach saves up to 70% of token usage, and it even helps with increasing accuracy by 2%-3%.
22.10.2025 22:39
π 2
π 1
π¬ 0
π 0
Working on this project was a great experience during my internship at @msftresearch.bsky.social π
Learned so much from this amazing team! Huge thanks to my coauthors: @vidhishab.bsky.social, Safoora Yousefi, @besmiranushi.bsky.social, @erichorvitz.bsky.social
22.10.2025 15:38
π 0
π 0
π¬ 0
π 0
We also found that these signals emerge EARLY in reasoning! At just 4k tokens, we can predict solution quality with ROC-AUC > 0.6.
This enables early path selection during parallel generation and ~60% token savings with +2.1% accuracy gains π
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
Using LT signals for answer selection in multi-sample inference leads to:
β‘ 48% average token reduction (up to 70%!)
π +2.6% accuracy improvement over majority voting
π― Works by identifying correct paths even when the majority is wrong
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
Hidden states have distinctive temporal patterns for correct paths. They show:
β΄οΈ Larger overall representational change (Net β)
β΄οΈ Less wandering in latent space (Cumulative β)
β΄οΈ More direct progress toward final state (Aligned β)
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
Across 3 reasoning models (DeepSeek-R1, Phi-4-Reasoning-Plus, Qwen3) and diverse domains (GPQA, AIME, TSP), LT signals:
β
Significantly predict correctness
β
Outperform output-based confidence measures and cross-layer signals
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
We track how representations evolve through the trace and extract 3 complementary signals:
π Net Change: Overall shift (start β end)
π Cumulative Change: Total movement
π― Aligned Change: Progress toward final state
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
Identifying trace quality is critical: it enables more reliable predictions, improves efficiency by avoiding wasted compute, and can be used to guide models toward productive reasoning strategies.
Our solution: Look inside the temporal evolution of the model's latent space! π
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
But not all reasoning traces are equal βοΈ β some contain productive steps that lead to correct solutions β
, while others deviate into overthinking, fail to converge, or exhibit inconsistent reasoning patterns β
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
Modern LLMs use chain-of-thought reasoning to solve complex problems, generating step-by-step solutions that can span thousands of tokens.
πScaling this inference-time compute (longer traces, multiple samples) significantly improves performance across reasoning tasks.
22.10.2025 15:38
π 0
π 0
π¬ 1
π 0
Looking forward to presenting this work next week at #ICLR2025! DM me if you are attending and want to grab a coffee to discuss these topics π«
18.04.2025 18:55
π 21
π 4
π¬ 0
π 0
December 5th our ML theory group at Cohere For AI is hosting @mathildepapillon.bsky.social to discuss their recent review arxiv.org/abs/2407.09468 on geometric/topological/algebraic ML.
Join us online π«
02.12.2024 13:14
π 14
π 1
π¬ 0
π 2
Iβm putting together a starter pack for researchers working on human-centered AI evaluation. Reply or DM me if youβd like to be added, or if you have suggestions! Thank you!
(It looks NLP-centric at the moment, but thatβs due to the current limits of my own knowledge π)
go.bsky.app/G3w9LpE
21.11.2024 15:56
π 36
π 10
π¬ 15
π 1
I tried to find everyone who works in the area but I certainly missed some folks so please lmk...
go.bsky.app/BYkRryU
23.11.2024 05:11
π 53
π 18
π¬ 32
π 0
Does anyone know of any feeds (or similar) for student internship opportunities in ML/CV/NLP?
22.11.2024 07:19
π 45
π 11
π¬ 2
π 1
I've found starter packs on NLP, vision, graphics, etc. But personally, I would love to know and hear from researchers working on vision-language. So, let me know if you'd like to join this starter pack, would be happy to add!
go.bsky.app/TENRRBb
19.11.2024 19:52
π 55
π 13
π¬ 42
π 2
How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledgeπ¦? In our new preprint, we look at the pretraining data and find evidence against this:
Procedural knowledge in pretraining drives LLM reasoning βοΈπ’
π§΅β¬οΈ
20.11.2024 16:31
π 854
π 138
π¬ 36
π 24
LLMs tend to match problem-solving strategies based on textual similarity rather than truly understanding the underlying principles of mathematical problems.
Paper: Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From Cognitive Psychology
18.11.2024 21:29
π 47
π 7
π¬ 0
π 1
A starter pack of people working on interpretability / explainability of all kinds, using theoretical and/or empirical approaches.
Reply or DM if you want to be added, and help me reach others!
go.bsky.app/DZv6TSS
14.11.2024 17:00
π 80
π 26
π¬ 34
π 0
If youβre interested in mechanistic interpretability, I just found this starter pack and wanted to boost it (thanks for creating it @butanium.bsky.social !). Excited to have a mech interp community on bluesky π
go.bsky.app/LisK3CP
19.11.2024 00:28
π 36
π 8
π¬ 3
π 2
π I also work on the field (examples on my profile). Would love to be added!
19.11.2024 09:42
π 1
π 0
π¬ 0
π 0
Bluesky Network Analyzer
Find accounts that you don't follow (yet) but are followed by lots of accounts that you do follow.
I forgot from whom in my feed I got this from, but anyway, this network analyzer is crazy efficient. It gives you ideas for accounts to follow based on your own followees. I just added 50 accounts or so.
bsky-follow-finder.theo.io
18.11.2024 21:32
π 81
π 24
π¬ 9
π 6
there are many smart speakers and thinkers around AI/ML and/or NLP. but i find almost everything to be kinda predictable by now, minor stylistic variations on the same story. who are some *interesting* speakers i should listen/read? i want things that may surprise or inspire me.
16.11.2024 20:41
π 95
π 12
π¬ 12
π 0
Any Latin Americans here working in Cognitive Science, very broadly construed? (Neuroscience, Psychology, Artificial Intelligence, Anthropology, Linguistics, Economics, Ethics, Philosophy, and moreβ¦)
I thought Iβd create a starter pack but I could only find a handful of us. Say hi?
17.11.2024 13:37
π 1
π 5
π¬ 2
π 0
It is intuitive to observe some complex-looking model behavior (e.g., the classification of images of different animals using an abstract category) and infer an interesting capacity of the model (e.g., the ability to build rich representations that abstract away from particular animals).
17.11.2024 14:34
π 0
π 1
π¬ 1
π 0
We found that the mechanisms behind the emergence of these representations are similar to those of LLMs, and can be found across a variety of vision transformers and layer types.
17.11.2024 14:06
π 1
π 0
π¬ 0
π 0