and the Polifonia project (lnkd.in/eaUyUBk), it examines how Generative AI can be used to augment the curation of structured data while addressing problems where standard approaches struggle with highly specialised humanities knowledge.
and the Polifonia project (lnkd.in/eaUyUBk), it examines how Generative AI can be used to augment the curation of structured data while addressing problems where standard approaches struggle with highly specialised humanities knowledge.
๐ฌDescription: The talk explores the intersection of semantic technologies and historical research within the context of initiatives like the Listening Experience Database (LED) (lnkd.in/efCzWwgQ)
๐Join us for the CDH Guest Lecture - Enrico Daga: Humanities Knowledge Graphs: opportunities and challenges with gen-AI
๐
Date: March 4th, 2026
โฐTime: 11:00-12:30
๐ซLocation: Turft06 (Turftorenstraat, room 06)
The call is open!
All researchers in linguistics and related fields and research projects are welcome to join and submit an abstract.
Abstracts can be submitted through this link until March 16th 2026. For more information, check out the submission instructions on the TABU Dag website tabudag.nl
Are you based in Groningen and want to help us evaluate the Generative AI puzzle? ๐ซ
We are looking for participants of every age between 16 and 60 years old. ๐
Contact us and we will deliver the puzzle and cards to you in person :)
My latest on Substack -- a write-up of the talk I gave at NeurIPS in December.
aiguide.substack.com/p/on-evaluat...
Thrilled to announce the 1st Workshop on Computational Developmental Linguistics (CDL) at ACL 2026 ๐ A new venue at the intersection of development linguistics ร modern NLP, spearheaded by @fredashi.bsky.social @marstin.bsky.social, and and outstanding team of colleagues!
A thread ๐งต
We canโt wait to welcome you, over the coming months weโll be gradually revealing more information about the amazing keynote speakers who will be presenting at the Academiegebouw next summer ๐คซ. Stay tuned! ๐ค
โฃ
This year Iโm part of the organizing team, and Iโm incredibly excited to work on and plan this historic event for the linguistics community in Groningen and across Northern Europe.โฃ
We are thrilled to officially announce the dates of the 46th edition of ๐ง๐ฎ๐ฏ๐๐๐๐! Save the date for ๐ง๐ต๐๐ฟ๐๐ฑ๐ฎ๐ ๐ฎ๐ป๐ฑ ๐๐ฟ๐ถ๐ฑ๐ฎ๐, ๐๐๐ป๐ฒ ๐ญ๐ญโ๐ญ๐ฎ ๐ซโฃ
๐ฃ I'm hiring ๐ฃ
Post-doc position at @facultyofartsug.bsky.social as part of @rug.nl CIT Innovation Fun "A3i โ Artificial Intelligence Innovation Identifier"
Full call is available here: lnkd.in/esdQ7YGD
๐
Application deadline: Jan. 15 2026
๐ Look what ๐
has broght just before Christmas ๐: a brand new Research Master in Natural Language Processing at @facultyofartsug.bsky.social @rug.nl
Program: www.rug.nl/masters/natu...
Applications (2026/2027) are open! Come and study with us (you will also learn why we have a ๐ฎ in our logo)
The Multilingual Minds & Machines Meetings call for abstracts is now open! Everything you need to know is here -> mmmm2026.github.io
It was wonderful to continue our personal and professional exchange outside the walls of the department and with other colleagues! Thanks for stopping by :)
Then at our weekly @gronlp.bsky.social Reading Group she talked about โ๐๐ก๐๐ญ ๐๐จ ๐ฌ๐๐ฅ๐-๐ฌ๐ฎ๐ฉ๐๐ซ๐ฏ๐ข๐ฌ๐๐ ๐ฌ๐ฉ๐๐๐๐ก ๐ฆ๐จ๐๐๐ฅ๐ฌ ๐ฅ๐๐๐ซ๐ง ๐๐๐จ๐ฎ๐ญ ๐๐ฎ๐ญ๐๐ก?โ, which is also part of an ongoing research project. ๐
Marianne presented her work on โ๐๐ฎ๐ฆ๐๐ง-๐ฅ๐ข๐ค๐ ๐ฅ๐ข๐ง๐ ๐ฎ๐ข๐ฌ๐ญ๐ข๐ ๐๐ข๐๐ฌ๐๐ฌ ๐ข๐ง ๐ง๐๐ฎ๐ซ๐๐ฅ ๐ฌ๐ฉ๐๐๐๐ก ๐ฆ๐จ๐๐๐ฅ๐ฌโ at the monthly edition of the Linguistic Lunch I organize in our CLCG department.
Last week I had the pleasure of hosting a fantastic friend and researcher, @mdhk.net , who came to visit us in Groningen for a couple of days from Amsterdam! ๐
Cognitive Modeling and Computational Linguistics (CMCL) workshop will be co-located with LREC 2026 in Palma, Mallorca!๐ดStay tuned for more details!โจ
@byungdoh.bsky.social Tatsuki Kuribayashi @grambelli.bsky.social Philipp Wicke, Jixing Li, Ryo Yoshida @cmclworkshop.bsky.social
๐ฃ Save the date ๐๏ธ to present your exciting statistical learning research at the 6th Interdisciplinary Advances in Statistical Learning Conference June 10-12 2026 in San Sebastiรกn ๐ช๐ธ
Keynotes by
@jennysaffran.bsky.social
@noranewcombe.bsky.social
@pyoudeyer.bsky.social
More info to follow #IASL26
Breaking: we release a fully synthetic generalist dataset for pretraining, SYNTH and two new SOTA reasoning models exclusively trained on it. Despite having seen only 200 billion tokens, Baguettotron is currently best-in-class in its size range. pleias.fr/blog/blogsyn...
Interested in developmentally plausible LMs, and the role of child-directed language data?
Come to our poster TODAY (Fr 7 Nov, 10.30-12.00) #EMNLP!
Thrilled to be heading to Suzhou with a big team of GroNLP'ers ๐ฎ
Interested in Interpretable, Cognitively inspired, Low-resource LMs? Don't miss our posters & talks #EMNLP2025!
...ii) a direct quality reward from a teacher model, and iii) a reward based on the log probabilities of a teacher model (and its dialogue continuations). While these rewards did not improve our models performance, two different DPO approaches did!
Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning) Francesca Padovani1โ Bastian Bunzeck2โ Manar Ali2 Omar Momen2 Arianna Bisazza1 Hendrik Buschmeier2 Sina Zarrieร2 1Center for Language and Cognition (CLCG), University of Groningen 2CRC 1646 โ Linguistic Creativity in Communication, Bielefeld University f.padovani@rug.nl bastian.bunzeck@uni-bielefeld.de
As part of this year's BabyLM challenge, we (researchers from @gronlp.bsky.social and @clausebielefeld.bsky.social diverged from established pretraining paradigm by training only on dialogue data from CHILDES.
Donโt hesitate to reach out with any questions or doubts :)
Every contribution is more than welcome! ๐๐
These are living datasets that researchers around the world can enrich as new resources become available
On the website of the project babylm.github.io/babybabellm/
you can find information about the dataset creation pipeline, training, and evaluation.
This was made possible thanks to the dedication of each language-specific expert and the coordination of senior researchers, @jumelet.bsky.social and @lchoshen.bsky.social among others.
๐๐จ ๐ฒ๐จ๐ฎ ๐ซ๐๐๐ฅ๐ฅ๐ฒ ๐ฐ๐๐ง๐ญ ๐ญ๐จ ๐ฌ๐๐ ๐ฐ๐ก๐๐ญ ๐ฆ๐ฎ๐ฅ๐ญ๐ข๐ฅ๐ข๐ง๐ ๐ฎ๐๐ฅ ๐๐๐๐จ๐ซ๐ญ ๐ฅ๐จ๐จ๐ค๐ฌ ๐ฅ๐ข๐ค๐? ๐จ๐ณ๐ฎ๐ฉ๐ธ๐ช
Hereโs the proof! ๐๐๐๐ฒ๐๐๐๐๐ฅ๐๐ is the first Multilingual Benchmark of Developmentally Plausible Training Data available for 45 languages to the NLP community ๐
arxiv.org/abs/2510.10159
This was made possible thanks to the dedication of each language-specific expert and the coordination of senior researchers, in particular @jumelet.bsky.social ! ๐คฉ
Computational Psycholinguistics Meeting 2025
cpl2025.sites.uu.nl
When: December 18โ19, 2025
Where: Utrecht, the Netherlands
Abstract submission deadline: June 15, 2025
Organizers: Jakub Dotlaฤil, Lena Jรคger, Bruno Nicenboim, Ece Takmaz