π’ PhD position in the NeuroAI of Language
Why can LLMs predict brain activity so well? We're hiring a PhD student to find out -- AI interpretability meets neuroimaging
Deadline March 20
Please RT π
π
mpi.nl/career-education/vacancies/vacancy/fully-funded-4-year-phd-position-neuroai-language
05.03.2026 13:34
π 45
π 35
π¬ 2
π 1
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics
Do large language models (LLMs) solve reasoning tasks by learning robust generalizable algorithms, or do they memorize training data? To investigate this question, we use arithmetic reasoning as a rep...
Great test for anyone learning mech interp is reading Nikankin et al's "Arithmetic Without Algorithms" which uses activation patching / circuits, probing, logit lens, describing max activating examples.. If you you follow along while reading, you'll realize you know a lot!
arxiv.org/abs/2410.21272
04.03.2026 06:07
π 15
π 4
π¬ 1
π 0
We were thrilled to host @mtutek.bsky.social at our lab last week.
His talk "From Internals to Integrity: How Insights into Transformer LMs Improve Safety, Interpretability, and Explanation Faithfulness" led to great discussions! π
#Transformers #AISafety #ExplainableAI #MLResearch #NLProc
24.02.2026 14:05
π 18
π 3
π¬ 0
π 0
Can LLMs figure out who you are from your anonymous posts?
From a handful of comments, LLMs can infer where you live, what you do, and your interests; then search for you on the web.
New π w/ @SimonLermenAI, @joshua_swans, @AerniMichael, Nicholas Carlini, @florian_tramer π§΅
20.02.2026 17:03
π 122
π 44
π¬ 8
π 14
Code examples for the Python package elfen:
# initializing extractor
extractor = elfen.Extractor(
data = df,
language = "en",
text_column = "text")
# extracting a single feature: ttr
extractor.extract("ttr")
# extracting a feature area/group: readability
extractor.extract_feature_group("readability")
# extracting all available features
extractor.extract_features()
Extracting structural/linguistic properties for large text datasets can be annoying. Existing tools either are not maintained, do not scale, or do not cover extensive sets of linguistic features.
For this reason, I implemented π§elfen, a Python package for efficient linguistic feature extraction
18.02.2026 15:55
π 10
π 5
π¬ 1
π 0
π¨ Emergency reviewer needed for ARR Resources and Evaluation track! Please ping me if you could review one paper by Friday. Topic is AI hallucinations, broadly speaking.
11.02.2026 11:10
π 1
π 3
π¬ 1
π 0
Hello #NLProc #ACL2026NLP people. I am looking for **two emergency reviewers** in the Safety and Alignment in LLMs track for ACL/ARR.
Reviews are due Feb 15th. Please DM if interested and available.
Happy to offer drinks/food if you live in/pass by Lisbon βοΈ
10.02.2026 14:59
π 6
π 10
π¬ 0
π 0
Hello #NLProc #ACL2026NLP community, I'm looking for an emergency reviewer for an ARR submission on LLM interpretability.
If you're available to complete a review before Feb 15, please reply or DM π
10.02.2026 14:41
π 2
π 6
π¬ 0
π 0
I need one too!
10.02.2026 14:35
π 2
π 2
π¬ 0
π 0
I could use an emergency reviewer for an ACL submission involving interpretability and syntax. Please DM me if you might be able to provide an emergency review before February 15!
10.02.2026 07:38
π 4
π 4
π¬ 1
π 0
I am looking for 2 emergency reviewers for the ARR Ethics, Bias & Fairness track. Please DM me if you are available π
10.02.2026 09:27
π 6
π 6
π¬ 0
π 0
I'm looking for two emergency reviewers π§βππ©βπ for the ARR January Generalizability and Transfer track.
Please reach out if you have time & qualify for review or RT for visibilityππ
10.02.2026 11:43
π 2
π 6
π¬ 0
π 0
Seems to be a common situation for ACs this round, but I'm also looking for two emergency reviewers for the January #ARR Evaluation and Resources track. I'd appreciate any help (reposts, encouragement, black magic...)
10.02.2026 11:15
π 3
π 6
π¬ 0
π 0
Looking for emergency reviewers for ARR Special Track "Explainability of NLP Models". Topics: Faithfulness, mechanistic interpretability, surveys and position papers. Deadline Feb 14 AoE. #ACL2026NLP
09.02.2026 17:33
π 8
π 7
π¬ 1
π 1
I'm sorry the guy changing the rules on the fly is named what
07.02.2026 14:34
π 17870
π 4468
π¬ 250
π 110
We already know prompt repetition is a handy hack to improve a decoder-only LMβs performance as it allows the model to βseeβ bidirectionally, an ability otherwise suppressed by the causal mask.
But what happens if we increase the number of repetitions? π€π§΅ @eaclmeeting.bsky.social #EACL2026
02.02.2026 12:04
π 5
π 4
π¬ 1
π 1
ManagerBench was accepted to ICLR! @iclr-conf.bsky.social #ICLR2026
LLMs are still either unsafe, or completely harm avoidant - even when the harm affects furniture ποΈ
Check out our benchmark, online or in Rio π§π·
04.02.2026 19:45
π 3
π 1
π¬ 0
π 1
Matija Luka Kuki\'c, Marko \v{C}uljak, David Duki\'c, Martin Tutek, Jan \v{S}najder
Sequence Repetition Enhances Token Embeddings and Improves Sequence Labeling with Decoder-only Language Models
https://arxiv.org/abs/2601.17585
28.01.2026 01:15
π 2
π 1
π¬ 0
π 0
simplified overview of our aligned probing setup, where we join the behavioral and internal evaluation of LMs' toxicity
LMs that "know more" about toxicity are less toxic!
Our #TACL π connects behavior and internals:
π LMs amplify toxicity beyond humans
π Information about toxicity peaks in lower layers
π Bypassing these layers increases toxicity
More detailsπ #NLProc #interpretability (1/π§΅)
27.01.2026 13:01
π 11
π 5
π¬ 1
π 0
Can you solve this algebra puzzle? π§©
cb=c, ac=b, ab=?
A small transformer can learn to solve problems like this!
And since the letters don't have inherent meaning, this lets us study how context alone imparts meaning. Here's what we found:π§΅β¬οΈ
22.01.2026 16:09
π 48
π 10
π¬ 2
π 2
β³ Deadline approaching! Weβre hiring 2 fully funded postdocs in #NLP.
Join the MilaNLP team and contribute to our upcoming research projects (SALMON & TOLD)
π Details + how to apply: milanlproc.github.io/open_positio...
β° Deadline: Jan 31, 2026
19.01.2026 17:24
π 11
π 10
π¬ 0
π 1
I'd like a link as well!
15.01.2026 09:20
π 1
π 0
π¬ 0
π 0
Nathan Stringham, Fateme Hashemi Chaleshtori, Xinyuan Yan, Zhichao Xu, Bei Wang, Ana Marasovi\'c
Teaching People LLM's Errors and Getting it Right
https://arxiv.org/abs/2512.21422
29.12.2025 07:45
π 4
π 1
π¬ 0
π 0
π Weβre opening 2 fully funded postdoc positions in #NLP!
Join the MilaNLP team and contribute to our upcoming research projects.
π More details: milanlproc.github.io/open_positio...
β° Deadline: Jan 31, 2026
18.12.2025 15:29
π 19
π 13
π¬ 0
π 2
Llama enjoying a mug of hot cocoa in an office with Tuesday, March 31 circled on a calendar behind them
COLM 2026 is just around the corner! Mark your calendars for:
π‘ Abstract deadline: Thursday, March 26, 2026
π Full paper submission deadline: Tuesday, March 31, 2026
Call for papers (website coming soon):
docs.google.com/document/d/1...
16.12.2025 15:31
π 9
π 4
π¬ 1
π 1
The Doge of Venice visits a Murano glassworks in the 17th century. I will talk about why glassmaking in this era has some similarities to AI research today.
At the #Neurips2025 mechanistic interpretability workshop I gave a brief talk about Venetian glassmaking, since I think we face a similar moment in AI research today.
Here is a blog post summarizing the talk:
davidbau.com/archives/202...
11.12.2025 15:02
π 17
π 3
π¬ 2
π 2
Iβm recruiting a postdoc to work on algorithms for cancer genome reconstruction. We have access to a rich set of tumour samples sequenced across multiple technologies. If interested, feel free to DM. Please share.
11.12.2025 03:04
π 13
π 12
π¬ 0
π 1
π§βπ¬Iβm recruiting PhD students in Natural Language Processing @unileipzig.bsky.social Computer Science, together with @scadsai.bsky.social!
Topics include, but arenβt limited to:
πLinguistic Interpretability
πMultilingual Evaluation
πComputational Typology
Please share!
#NLProc #NLP
11.12.2025 13:36
π 41
π 25
π¬ 1
π 3
I will be @euripsconf.bsky.social this week to present our paper as non-archival at the PAIG workshop (Beyong Regulation:
Private Governance & Oversight Mechanisms for AI). Very much looking forward to the discussions!
If you are at #EurIPS and want to chat about LLM's training data. Reach out!
02.12.2025 21:47
π 9
π 4
π¬ 0
π 0
π’ Postdoc position π’
Iβm recruiting a postdoc for my lab at NYU! Topics include LM reasoning, creativity, limitations of scaling, AI for science, & more! Apply by Feb 1.
(Different from NYU Faculty Fellows, which are also great but less connected to my lab.)
Link in π§΅
02.12.2025 16:04
π 21
π 12
π¬ 2
π 1