Emile van Krieken (@emilevankrieken.com)

LLMs are nothing more than models of the distribution of the word forms in their training data, with weights modified by post-training to produce somewhat different distributions.

07.03.2026 07:09 👍 85 🔁 2 💬 3 📌 0

The AI discourse sometimes seems to center on "Is AI good or is it bad?"

I find this framing unproductive. AI is not a fixed thing.

I would prefer to ask "How might we use this technology for good, and mitigate the bad?"

What a shame if the best use we can come up with is no use at all.

06.03.2026 05:29 👍 38 🔁 5 💬 4 📌 2

Graph Homomorphism Distortion: A Metric to Distinguish Them All and in the Latent Space Bind Them A large driver of the complexity of graph learning is the interplay between structure and features. When analyzing the expressivity of graph neural networks, however, existing approaches ignore featur...

To kick off the PhD journey with @pseudomanifold.topology.rocks:

What are the limitations of the WL metric, and what is an 𝘪𝘯𝘧𝘰𝘳𝘮𝘢𝘵𝘪𝘷𝘦 𝘮𝘦𝘵𝘳𝘪𝘤?

We answer these questions with our 𝗚𝗿𝗮𝗽𝗵 𝗛𝗼𝗺𝗼𝗺𝗼𝗿𝗽𝗵𝗶𝘀𝗺 𝗗𝗶𝘀𝘁𝗼𝗿𝘁𝗶𝗼𝗻

arxiv.org/abs/2511.03068

@olgatticus.bsky.social, Kavir and @erikjbekkers.bsky.social

04.03.2026 09:51 👍 12 🔁 4 💬 1 📌 1

"He is from [MASK] [MASK]" → "San York"? dLLMs fail because they ignore token dependencies. This Factorization Barrier arises from a structural misspecification: models are restricted to fully factorized outputs. We break this barrier with CoDD, enabling coherent parallel generation. 🚀

04.03.2026 06:25 👍 18 🔁 5 💬 1 📌 4

Sam is a snake

28.02.2026 07:08 👍 79 🔁 2 💬 3 📌 0

time traveler from 12 months from now just sent me this

27.02.2026 21:25 👍 1619 🔁 205 💬 66 📌 34

In light of the current funding situation (worldwide), a modest proposal: instead of pouring billions of dollars into GenAI claiming "it *could* accelerate science and research," consider putting 1% of that amount in what *will* accelerate science and research. Namely, funding science and research.

26.02.2026 12:44 👍 71 🔁 8 💬 9 📌 2

why do science? it won,t make the model Bigger

24.02.2026 22:19 👍 47 🔁 4 💬 4 📌 0

Storchastic: A Framework for General Stochastic Automatic Differentiation Modelers use automatic differentiation (AD) of computation graphs to implement complex Deep Learning models without defining gradient computations. Stochastic AD extends AD to stochastic computation g...

I spent way too long trying to understand stop gradients lol arxiv.org/abs/2104.00428 (see the first appendix).

I'd argue it is about the loss, but rather you're defining a surrogate loss that should optimise the true loss you're interested in.

23.02.2026 17:18 👍 2 🔁 0 💬 1 📌 0

X is hiring a creative writing specialist at $40 an hour to make Grok better at writing and a true LOL at the qualifications

30.01.2026 20:14 👍 5744 🔁 1148 💬 475 📌 817

New open source: cuthbert 🐛

State space models with all the hotness: (temporally) parallelisable, JAX, Kalman, SMC

30.01.2026 16:26 👍 35 🔁 9 💬 1 📌 1

Best conference with the best people and in the best place 😎 😜

Also the submission deadline is conveniently one month later than #ICML2026, just in case you needed it 😅

27.01.2026 13:20 👍 14 🔁 4 💬 0 📌 1

Call for Papers NeSy AI is the association for neurosymbolic Artificial Intelligence. It runs NeSy, the premier international conference on neural-symbolic learning and reasoning, yearly since 2005, with a focus on n...

🦕The 20th conference on Neurosymbolic AI will be in Lisbon, Portugal, September 1-4, 2026!

The CFP is out: 2026.nesyconf.org/call-for-pap... with two phases:
🚨 Deadline 1: Feb 24 (abstract), Mar 3 (full)
🚨 Deadline 2: Jun 9 (abstract), Jun 16 (full)

#neurosymbolic #NeSy2026

20.01.2026 15:35 👍 6 🔁 3 💬 1 📌 1

We introduce epiplexity, a new measure of information that provides a foundation for how to select, generate, or transform data for learning systems. We have been working on this for almost 2 years, and I cannot contain my excitement! arxiv.org/abs/2601.03220 1/7

07.01.2026 17:27 👍 143 🔁 34 💬 9 📌 9

Good call! I maintain a list of Neurosymbolic folks on Bsky, see here 🦕
go.bsky.app/RMJ8q3i

13.01.2026 09:36 👍 3 🔁 1 💬 0 📌 0

I am recruiting 1 PhD student (4-year position) and 2 postdocs (3-year positions) to work on logic and machine learning at the University of Helsinki:
- PhD 1: jobs.helsinki.fi/job/Helsinki...
- Postdoc 1: jobs.helsinki.fi/job/Helsinki...
-Postdoc 2: jobs.helsinki.fi/job/Helsinki...

10.01.2026 15:01 👍 17 🔁 6 💬 0 📌 0

#XAI, #neurosymbolic methods #nesy and #causal #representation #learning #CRL all care about learning #interpretable #concepts, but in different ways.

We are organizing this #ICLR2026 workshop to bring these three communities together and learn from each other 🦾🔥💥

Submission deadline: 30 Jan 2026

22.12.2025 16:41 👍 13 🔁 4 💬 1 📌 2

Thanks for the fantastic talk, and totally agree! (Writing this in the train from Copenhagen :-))

08.12.2025 13:37 👍 1 🔁 0 💬 0 📌 0

Emile will present our work on Knowledge Graph Embeddings at Eurips' Salon des Refusés on Friday!
We show how linearity prevent KGEs from scaling to larger graphs + propose a simple solution using a Mixture of Softmaxes (see LLM literature) to break the limitations at a low parameter cost. 🔨

03.12.2025 16:12 👍 3 🔁 1 💬 0 📌 0

NeSy conference The NeSy conference studies the integration of deep learning and symbolic AI, combining neural network-based statistical machine learning with knowledge representation and reasoning from symbolic appr...

Recordings of the NeSy 2025 keynotes are now available! 🎥

Check out insightful talks from @guyvdb.bsky.social, @tkipf.bsky.social and D McGuinness on our new Youtube channel www.youtube.com/@NeSyconfere...

Topics include using symbolic reasoning for LLM, and object-centric representations!

29.11.2025 08:21 👍 7 🔁 3 💬 0 📌 0

🚨 New paper alert!
We introduce Vision-Language Programs (VLP), a neuro-symbolic framework that combines the perceptual power of VLMs with program synthesis for robust visual reasoning.

30.11.2025 01:32 👍 15 🔁 7 💬 1 📌 2

Interested in meeting up in Copenhagen? Do shoot a message!

28.11.2025 17:30 👍 4 🔁 0 💬 0 📌 0

And finally #3

🔨 Rank bottlenecks in KGEs:

At Friday's "Salon des Refuses" I will present @sbadredd.bsky.social 's new work on how rank bottlenecks limit knowledge graph embeddings
arxiv.org/abs/2506.22271

28.11.2025 17:30 👍 6 🔁 0 💬 1 📌 1

#2
🍇 GRAPES: At Tuesday's ELLIS Unconference poster session.
We study adaptive graph sampling for scaling GNNs!

Work with Taraneh Younesian, Daniel Daza, @thiviyan.bsky.social, @pbloem.sigmoid.social.ap.brid.gy

arxiv.org/abs/2310.03399

28.11.2025 17:30 👍 3 🔁 1 💬 1 📌 0

Almost off to @euripsconf.bsky.social in Copenhagen 🇩🇰 🇪🇺! I'll present 3 posters:

🧠 Neurosymbolic Diffusion Models: Thursday's poster session.

Going to NeurIPS? @edoardo-ponti.bsky.social and @nolovedeeplearning.bsky.social will present the paper in San Diego Thu 13:00
arxiv.org/abs/2505.13138

28.11.2025 17:30 👍 21 🔁 3 💬 1 📌 0

Beyond Smoothed Analysis: Analyzing the Simplex Method by the Book Narrowing the gap between theory and practice is a longstanding goal of the algorithm analysis community. To further progress our understanding of how algorithms work in practice, we propose a new alg...

The simplex algorithm is super efficient. 80 years of experience says it runs in linear time. Nobody can explain _why_ it is so fast.

We invented a new algorithm analysis framework to find out.

27.10.2025 01:43 👍 212 🔁 49 💬 5 📌 13

Precies hetzelfde hier...

08.11.2025 22:26 👍 0 🔁 0 💬 0 📌 0

Want to use your favourite #NeSy model but afraid of the reasoning shortcuts?🫣

Fear not💪🏻In our #NeurIPS2025 paper we show that you just need to equip your favourite NeSy model with prototypical networks and the reasoning shortcuts will be a problem of the past!

06.11.2025 10:40 👍 14 🔁 3 💬 1 📌 1

I'm in Suzhou to present our work on MultiBLiMP, Friday @ 11:45 in the Multilinguality session (A301)!

Come check it out if your interested in multilingual linguistic evaluation of LLMs (there will be parse trees on the slides! There's still use for syntactic structure!)

arxiv.org/abs/2504.02768

06.11.2025 07:08 👍 27 🔁 7 💬 0 📌 0

🌍Introducing BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data!

LLMs learn from vastly more data than humans ever experience. BabyLM challenges this paradigm by focusing on developmentally plausible data

We extend this effort to 45 new languages!

15.10.2025 10:53 👍 44 🔁 16 💬 1 📌 4

Emile van Krieken

Latest posts by Emile van Krieken @emilevankrieken.com