Home New Trending Search
About Privacy Terms
Posts
CLAUSE - Computational Linguistics @ Bielefeld University's posts
Stefan Hartmann presenting a slide

Stefan Hartmann presenting a slide

Happening right now — @stefanhartmann.bsky.social presenting an extremely interesting case study on snowclones like »x is the new y«. 🗣️

1 month ago 13 1 0 1

Tomorrow!

1 month ago 7 1 0 0
Post image

I have just returned from a week-long visit to Bielefeld University! Thank you very much for hosting me Sina Zarrieß and @ozgealacam.bsky.social 😊 @clausebielefeld.bsky.social

1 month ago 8 2 1 0
Post image

This week we’re having @ecekt.bsky.social as our guest in Bielefeld. She gave a highly timely talk on language+vision models, how they process images under noise conditions, and about how to train a highly effective multimodal BabyLM with model merging. 🗣️👀💻

2 months ago 12 1 0 1
Post image

For years since the GPT-2 paper, emergent in-context learning (ICL) from 'next-token' training has been treated as something deeply tied to 𝐡𝐮𝐦𝐚𝐧 𝐥𝐚𝐧𝐠𝐮𝐚𝐠𝐞. But … is it?

3 months ago 2 2 1 1
AI generated image

AI generated image

Am I evil? Am I likeable?

Need a 10 minutes break? Like Fantasy? Loath it? Take part in our study and help us by rating images of fictional characters here:
bixprag.lili.uni-bielefeld.de/publix/0aSWK...

3 months ago 2 5 0 0
Post image

For this week’s group colloquium, we invited Loulou Kosmala from Paris-Est Créteil University. She gave a talk on multimodal feedback during all types of conversation, from real life to virtual, from learners to adults, from L1 to L2, and more! 🤩

4 months ago 3 0 0 0
Dialogue Is Not Enough to Make a Communicative BabyLM
(But Neither Is Developmentally Inspired Reinforcement Learning)
Francesca Padovani1∗ Bastian Bunzeck2∗ Manar Ali2 Omar Momen2
Arianna Bisazza1 Hendrik Buschmeier2 Sina Zarrieß2
1Center for Language and Cognition (CLCG), University of Groningen
2CRC 1646 – Linguistic Creativity in Communication, Bielefeld University
f.padovani@rug.nl bastian.bunzeck@uni-bielefeld.de

Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning) Francesca Padovani1∗ Bastian Bunzeck2∗ Manar Ali2 Omar Momen2 Arianna Bisazza1 Hendrik Buschmeier2 Sina Zarrieß2 1Center for Language and Cognition (CLCG), University of Groningen 2CRC 1646 – Linguistic Creativity in Communication, Bielefeld University f.padovani@rug.nl bastian.bunzeck@uni-bielefeld.de

As part of this year's BabyLM challenge, we (researchers from @gronlp.bsky.social and @clausebielefeld.bsky.social diverged from established pretraining paradigm by training only on dialogue data from CHILDES.

4 months ago 16 3 1 0

Preprint alert! We release BabyBabelLM, a multilingual benchmark of developmentally plausible training data. I was responsible for German and Polish data as well as various child-directed wikis. Immensely rewarding project with exceptionally cool co-authors. 🥳🚀

5 months ago 11 3 0 1
Post image

𝐃𝐨 𝐲𝐨𝐮 𝐫𝐞𝐚𝐥𝐥𝐲 𝐰𝐚𝐧𝐭 𝐭𝐨 𝐬𝐞𝐞 𝐰𝐡𝐚𝐭 𝐦𝐮𝐥𝐭𝐢𝐥𝐢𝐧𝐠𝐮𝐚𝐥 𝐞𝐟𝐟𝐨𝐫𝐭 𝐥𝐨𝐨𝐤𝐬 𝐥𝐢𝐤𝐞? 🇨🇳🇮🇩🇸🇪

Here’s the proof! 𝐁𝐚𝐛𝐲𝐁𝐚𝐛𝐞𝐥𝐋𝐌 is the first Multilingual Benchmark of Developmentally Plausible Training Data available for 45 languages to the NLP community 🎉

arxiv.org/abs/2510.10159

5 months ago 42 16 2 1

Happening in an hour! 🥳

5 months ago 1 0 0 0

If you are at #IWCS, then you should not miss Sanne‘s talk ”Not Just Who or What: Modeling the Interaction of Linguistic and Annotator Variation in Hateful Word Interpretation“ (Sanne Hoeken, Özge Alacam, Dong Nguyen, Massimo Poesio, Sina Zarrieß), tomorrow at 16:30! 🕟
@sannehoeken.bsky.social

5 months ago 4 1 0 1
Sina in front of a slide with different size circles

Sina in front of a slide with different size circles

Sina Zarieß is giving the KONVENS keynote on training BabyLMs #nlproc
The slide shows the number of words a 12yo human has seen in their lifetime compared to the numbers of words typical language models have seen in training #llm

6 months ago 6 3 0 0
Post image

Happening now: Sina‘s keynote on our BabyLM work. 🥳

6 months ago 5 0 0 1
Post image

Great first day at #KONVENS2015 today. Looking forward to another engaging day with a keynote by Sina Zarrieß tomorrow 🤓
@clausebielefeld.bsky.social

6 months ago 2 1 1 0

Don’t miss Sina‘s keynote on BabyLMs at #konvens tomorrow!

6 months ago 3 0 0 0
Post image

Final Keynote of #semdial by David Schlangen on ”Meaningful Interaction with Unreal Speakers?“ 😇💬

6 months ago 2 0 1 0

Final day at #semdial2025 #bialogue — four more presentations, one key note and hopefully many engaging discussions. Let's go!

6 months ago 0 1 0 0
Post image

Second #semdial keynote by Robert Hawkins on ”Foraging for common ground“

6 months ago 3 0 0 0
Post image

Day 2 of #semdial starts with a session on LMs and dialogue systems 🤩

6 months ago 3 0 0 0
Post image

Actually yes! Dialogue differs distinctly from monologues in terms of phonetic features and in the production of novel phonetic forms!

6 months ago 2 0 0 0
Post image

Leonie Schade asks whether it takes two to do an articulatory tango 😁

6 months ago 6 1 1 0

And the second talk features contributions by our PI Sina Zarrieß. 🤩

6 months ago 6 0 1 0

#semdial has begun 💬

6 months ago 1 0 0 0
Post image

#semdial is about to begin 🥳

6 months ago 2 2 1 0

Program: semdial2025.github.io/program/
Proceedings: purl.org/semdial/2025...

6 months ago 0 0 0 0
Post image

#semdial2025, the long-awaited #bialogue conference starts tomorrow! We are looking forward to three wonderful conference days, featuring three exciting keynotes, and many oral and poster presentations on the semantics and pragmatics of dialogue. 👄💬
Check out the program and proceedings below. 👇

6 months ago 3 0 1 1
Post image

Let’s go!

7 months ago 3 0 0 0

Is simpler child-directed language easier to learn?

Check out our CoNLL paper "Do Construction Distributions Shape Formal Language Learning in German BabyLMs?"

@conll-conf.bsky.social

7 months ago 2 2 1 0
Preview
Components of Creativity: Language Model-based Predictors for Clustering and Switching in Verbal Fluency Sina Zarrieß, Simeon Junker, Judith Sieker, Özge Alacam. Proceedings of the 29th Conference on Computational Natural Language Learning. 2025.

Find the paper here: aclanthology.org/2025.conll-1...

7 months ago 3 0 1 0
CLAUSE - Computational Linguistics @ Bielefeld University
CLAUSE - Computational Linguistics @ Bielefeld University
@clausebielefeld
413 Followers 372 Following 40 Posts
Posts Following