Memorization vs. generalization in deep learning: implicit biases, benign overfitting, and more
Or: how I learned to stop worrying and love the memorization
What is the relationship between memorization and generalization in AI? Is there a fundamental tradeoff? In infinitefaculty.substack.com/p/memorizati... Iโve reviewed some of the evolving perspectives on memorization & generalization in machine learning, from classic perspectives through LLMs.
18.02.2026 15:54
๐ 133
๐ 27
๐ฌ 4
๐ 5
Top: Modality shift in block instruction: R1: Take the green block and put it on the left side of the grid. A hand is holding an imaginary piece toward the left column of a 2ร2 grid; label reads Redundant position and orientation. R4: the green block pointing this way. A hand is pointing near the bottom left cell with an arrow showing movement toward the top left cell; label reads Complementary position and orientation. Target tower: a 3 block green and red C-shape tower on a 2x2 grid. Bottom: Modality shift in tower instruction: R1: they are going to form a C-shape. A c-shape hand pose with the index and thumb is shown far from the grid; the label reads No information about position or orientation. R4: Put the C on the left side, facing away from you. Right hand shows the C shape facing away, and left hand with the palm open indicates placement on the left side; labels read Redundant position and orientation.
People form ad hoc conventions, establishing linguistic & gestural abstractions, and shift information across speech and gesture to communicate more efficiently over time.
We study this in our #CHI2026 paper, led by Kiyosu Maeda with @judithfan.bsky.social @rdhawkins.bsky.social and team
๐งต๐โจ1/4
19.02.2026 20:43
๐ 21
๐ 3
๐ฌ 1
๐ 0
How do diverse context structures reshape representations in LLMs?
In our new work, we explore this via representational straightening. We found LLMs are like a Swiss Army knife: they select different computational mechanisms reflected in different representational structures. 1/
04.02.2026 02:54
๐ 38
๐ 11
๐ฌ 1
๐ 1
1/7 Can infants recognise the world around them? ๐ถ๐ง As part of the FOUNDCOG project, we scanned 134 awake infants using fMRI. Published today in Nature Neuroscience, our research reveals 2-month-old infants already possess complex visual representations in VVC that align with DNNs.
02.02.2026 16:00
๐ 155
๐ 70
๐ฌ 4
๐ 8
Our paper (w/ @bodowinter.bsky.social and @mperlman.bsky.social) is finally out, officially ๐ฅณ. In it, we set ourselves the lofty goal of defining iconicity, focusing on its subjectivity, context-dependence, and gradability. Let us know if you agree with our definition? ๐ค
doi.org/10.1093/oxfo...
02.02.2026 17:44
๐ 26
๐ 8
๐ฌ 3
๐ 0
๐ญ How do LLMs (mis)represent culture?
๐งฎ How often?
๐ง Misrepresentations = missing knowledge? spoiler: NO!
At #CHI2026 we are bringing โจTALESโจ a participatory evaluation of cultural (mis)reps & knowledge in multilingual LLM-stories for India
๐ arxiv.org/abs/2511.21322
1/10
02.02.2026 21:38
๐ 45
๐ 21
๐ฌ 1
๐ 2
With some trepidation, I'm putting this out into the world:
gershmanlab.com/textbook.html
It's a textbook called Computational Foundations of Cognitive Neuroscience, which I wrote for my class.
My hope is that this will be a living document, continuously improved as I get feedback.
09.01.2026 01:27
๐ 585
๐ 237
๐ฌ 16
๐ 10
๐จ New preprint ๐จ
Excited to share new work led by @tikhomirova.bsky.social, presenting a large-scale evaluation of the accessibility of psycholinguistic information in transformer language models.
Preprint: arxiv.org/abs/2601.03798
08.01.2026 10:48
๐ 8
๐ 5
๐ฌ 1
๐ 0
Unnsteinsson Elmar & Harris Daniel W., Genre and Conversation - PhilPapers
Conversations can belong to different types, or genres. We consider four dimensions of variation as case studies: Some conversations are about sharing information, others about making decisions; some ...
Elmar Unnsteinsson (@eunnsteins.bsky.social) and I have a new paper forthcoming in Noรปs.
"Genre and Conversation"
We show how to generalize the classic pragmatic theories to conversational genres that aren't factual, cooperative, committal information exchanges.
philpapers.org/rec/ELMGAC-2
07.01.2026 20:01
๐ 31
๐ 4
๐ฌ 1
๐ 4
New paper from the IMC lab! I am very excited about this one. For years, I have been arguing that one of the main claims of the so-called "simulation heuristic" is likely not true for episodic counterfactual thinking, namely that the harder it is to mentally simulate it, the less plausible (1/n)
07.01.2026 23:11
๐ 24
๐ 12
๐ฌ 1
๐ 1
New paper w/ @ryskin.bsky.social in Open Mind!
Words change: โbroadcastโ once meant scattering seeds; โtweetโ was just a bird sound. Do older adults keep earlier meanings, or update as language evolves?
Our new paper investigates how semantic representations differ across age groups. ๐งต๐
02.01.2026 18:48
๐ 17
๐ 4
๐ฌ 2
๐ 1
Will US science survive Trump 2.0?
President Donald Trump and his administration have gutted science agencies, terminated research programmes and cancelled billions of dollars in grants to universities. What are the long-term impacts f...
The question is an open one and it depends on us. The would-be destroyers of science #Trump #Vought #RFKJr #Bhattacharya #Memoli are just men. Zealots, fools, charlatans, opportunists. There are more of us than there are of them. www.nature.com/articles/d41...
29.12.2025 11:07
๐ 115
๐ 52
๐ฌ 4
๐ 4
Redirecting
New study out in Neuron: doi.org/10.1016/j.ne.... This work led by Zaid Zada uses fMRI hyperscanning of real dyads to show that speaking and listening rely on shared neural systems; and that conversation recruits unique brain processes that aren't observed in passive comprehension.
18.12.2025 22:08
๐ 15
๐ 5
๐ฌ 0
๐ 0
Why Do Humans Have Linguistic Intuition?
| Cadernos de Linguรญstica
Thom Scott-Phillips presents a novel analysis of people's spontaneous intuitions about sentence acceptability "grounded in theoretical and empirical knowledge from cognitive linguistics, cognitive psychology and evolutionary approaches to the mind." cadernos.abralin.org/index.php/ca...
18.12.2025 19:11
๐ 14
๐ 9
๐ฌ 1
๐ 0
Now out in TopiCS as "Simulating Symbolic Evolution in the Lab: Potentials and Implications of Using Transmission Chains to Study Early Symbolic Behavior at the Emergence of Homo sapiens" doi.org/10.1111/tops... Thread below! w @felixthehauskat.bsky.social & many others.
18.12.2025 14:03
๐ 15
๐ 1
๐ฌ 0
๐ 1
Now in press at Topics in Cognitive Science! We review children's understanding of different kinds of visual media across contexts, and argue that this work has important implications for childhood learning and assessment tools @cogscisociety.bsky.social
onlinelibrary.wiley.com/doi/10.1111/...
13.12.2025 01:58
๐ 32
๐ 9
๐ฌ 1
๐ 0
From sensory to perceptual manifolds: The twist of neural geometry
The brain uses geometric twists to expand neural dimensionality, thus untangling perception from sensation.
www.science.org/doi/10.1126/...
damn this is so very clever!
"... we investigated how the brain categorizes stimuli that are not linearly separable in the physical world ... The sensory manifold was ... expanded into a seven-dimensional perceptual manifold..."
13.12.2025 09:07
๐ 70
๐ 9
๐ฌ 1
๐ 1
The MIT Press and Open Mind partner with Lyrasis to support diamond open access publishing through the Open Access Community Investment Program
The Open Access Community Investment Program (OACIP), an innovative model for community action, will seek support for MIT Press journal Open Mind through July 2026
The Press and @openmindjournal.bsky.social are pleased to announce a partnership with Lyrasis through the Open Access Community Investment Program (OACIP).
Learn how your institution can support this initiative to continue providing the latest #cogsci researchโfree of chargeโhere: bit.ly/452nMma
11.12.2025 14:30
๐ 22
๐ 9
๐ฌ 1
๐ 0
Sign on bathroom wall:
(In red) IF TOILET IS STUCK, DO NOT FLUSH AGAIN OR TOILET WILL OVERFLOW. (Picture of overflowing toilet)
(In green) GET A STAFF:
TO HELP OR USE THE PLUNGER. (Two pictures of plungers)
This is a nice in-the-wild example of a rather complex dynamic semantic speech act: a conditional with a factual antecedent, an imperative consequent which itself embeds a factual consequent, producing a warning, and is itself conjoined to another disjunctive imperative consequent.
๐ฆ๐ฆ #philsky
08.12.2025 01:32
๐ 22
๐ 3
๐ฌ 6
๐ 1
Approaching lexical variation in Swedish Sign Language
Languages exhibit variation, which may reflect ethnic, geographic, social or age- or gender-based differences between language users. Many sign languages are known to exhibit lexicalย variation, with m...
New paper on lexical variation in Swedish Sign Language with Swedish colleagues.
Using a combo of elicitation (in-person), survey (online) & corpus data, we look at some changes in lexical choices over time & discuss methods for measuring variation of variation
#linguistics
doi.org/10.16995/glo...
10.12.2025 10:57
๐ 23
๐ 5
๐ฌ 1
๐ 0
If you speak whale, you may know that sperm whales communicate with click vocalizations that they group into units called codas. A new study in @openmindjournal.bsky.social shows that within these codas appear to be vowels & diphthongs, used similarly to humans: direct.mit.edu/opmi/article...
10.12.2025 13:16
๐ 21
๐ 7
๐ฌ 1
๐ 0