michiel van der ree's Avatar

michiel van der ree

@mhvdr.nl

doing applied (ai ∪ ml ∪ nlp ∪ llms) to accelerate research @ university of groningen (nl)

108
Followers
165
Following
3
Posts
26.10.2024
Joined
Posts Following

Latest posts by michiel van der ree @mhvdr.nl

Parliamentary votes

With Dutch parliamentary elections next week, explore this interactive AI-powered analysis of every vote since December 2023. See parties’ stances by topic, impact and beneficiaries (NB in Dutch).
datascience.web.rug.nl/parliamentar...

24.10.2025 09:49 👍 0 🔁 0 💬 0 📌 0
The Model is the Product | Vintage Data Old data, new models

The model is the product.

New blog post on what the latest research trends mean for the next commercial cycle: specialized models behaving like an integrated systems, model providers moving up to application layer, training or being trained on. vintagedata.org/blog/posts/m...

02.03.2025 13:57 👍 25 🔁 5 💬 3 📌 3
Post image

I'll get straight to the point.

We trained 2 new models. Like BERT, but modern. ModernBERT.

Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff.

It's much faster, more accurate, longer context, and more useful. 🧵

19.12.2024 16:45 👍 620 🔁 147 💬 19 📌 34
Post image

“They said it could not be done”. We’re releasing Pleias 1.0, the first suite of models trained on open data (either permissibly licensed or uncopyrighted): Pleias-3b, Pleias-1b and Pleias-350m, all based on the two trillion tokens set from Common Corpus.

05.12.2024 16:39 👍 248 🔁 85 💬 11 📌 19
Post image

🪼 Welcome JUST-OS!

JUST-OS is an exciting initiative by researchers at the University of Groningen and FORRT (@forrt.bsky.social; forrt.org). We’re developing an AI-based chatbot to simplify navigating Open Science resources.

21.11.2024 15:40 👍 14 🔁 13 💬 1 📌 1
Preview
Ai2 OpenScholar: Scientific literature synthesis with retrieval-augmented language models | Ai2 Ai2’s & UW’s OpenScholar, a retrieval-augmented LM, helps scientists navigate and synthesize scientific literature.

Try it out: openscholar.allen.ai
Read more: allenai.org/blog/opensch...
Paper: openscholar.allen.ai/paper

19.11.2024 15:38 👍 5 🔁 2 💬 1 📌 0

Thanks for the heads up! Not sure if I'm the intended audience but @mark-l-thompson.bsky.social can fill me in if he sees any way I can contribute.

18.11.2024 08:39 👍 2 🔁 0 💬 1 📌 0
Post image

Releasing two trillion tokens in the open. huggingface.co/blog/Pclangl...

13.11.2024 17:59 👍 118 🔁 42 💬 5 📌 2

Introducing Early American HistoriChat, a chatbot trained on the EvansTCP corpus, ~5,000 American printed texts from 1640 to 1800: eahc.mhvdr.nl

Inspired by the work of @dorialexander.bsky.social; designed & built by @michielree.bsky.social in collaboration with MLT & the H-GEAR project

30.10.2024 13:36 👍 7 🔁 5 💬 1 📌 1

liking 10 @vickiboykis.com posts to bootstrap this bsky algorithm

26.10.2024 15:48 👍 10 🔁 0 💬 1 📌 1