Marten Risius (@risius)

We're trying to shed some light into the debate around age verification technologies and how it can be implemented while protecting user privacy.

25.02.2026 07:18 👍 0 🔁 0 💬 0 📌 0

In this line of research we consider trust & safety issues from a broader information warfare (IW) perspective. Turns out, the term IW is used quite loosly ... and now we start talking about other types of warfare. In this paper @afrenzel.bsky.social gives an overview over common IW practices

16.02.2026 11:16 👍 2 🔁 0 💬 0 📌 0

Anthropic safety researcher quits, warning ‘world is in peril’ Other AI safety researchers have also left leading firms, citing concerns about potentially catastrophic risks.

I am becoming a "safety by design" advocate

Two key members of OpenAI’s “Superalignment” team [...] quit in 2024, saying the company emphasized financial gain over minimizing the dangers [...]"

www.semafor.com/article/02/1...

11.02.2026 21:47 👍 1 🔁 0 💬 0 📌 0

Ever wondered how Incels and RWEs gatekeep new members online? We did! We find two fundamentally different ways in which members socialize in these extremist communities. Intriguing work by @cvdavid.bsky.social!

02.02.2026 18:43 👍 2 🔁 1 💬 0 📌 0

The Disastrous Rollout of the Trump-Approved TikTok Serves as a Stark Warning for Us All Big Tech platforms wield far too much power over our information landscape to continue operating without transparency.

I know there is a vivid debate around how to operationalize hate speech. Not sure I agree with TT-US' understanding of what constitutes hateful content:
zeteo.com/p/trump-tikt...

01.02.2026 20:32 👍 0 🔁 0 💬 0 📌 0

AI For Good? An EOOH Webinar Take this survey powered by surveymonkey.com. Create your own surveys for free.

If you're working or researching hate speech issues, consider joining this free 1-hour Violence Prevention Network webinar where we present our tool maxplain.com that supports the legal assessment of potentially hateful images:

www.surveymonkey.com/r/eoohwebinar

14.01.2026 16:19 👍 1 🔁 0 💬 0 📌 0

Fractures on the (Storm-)Front: Contesting the Role of Women in White Supremacy - GNET

As part of GNET’s series aligning with the UN’s 16 Days of Activism Against Gender-Based Violence, Christopher David & @risius.bsky.social examine how Stormfront’s ideological unity is eroding as members increasingly diverge in their conceptualisations of white women.

@dsrc.bsky.social

01.12.2025 18:58 👍 4 🔁 3 💬 0 📌 1

X wants to call you out for using a VPN (and maybe catch a few trolls, too) The situation began in October when Mikita Bier, X's Head of Product, announced that new information would be shown on user profiles, such as the date they...

What is the gossip on why X is releasing new content moderation features? Seems off-brand to me. What am I missing?

www.techspot.com/news/110332-...

25.11.2025 12:32 👍 2 🔁 0 💬 0 📌 0

@riekers.bsky.social is doing awesome work using multi-agent systems, in this case to detect hate speech in memes. Quite a challenging task but a promising approach and much more to come.

Reach out if you're interested or struggling to find explainable AI solutions for multimodal applications

04.11.2025 13:59 👍 3 🔁 0 💬 0 📌 0

We have a new preprint: osf.io/preprints/so...

What have we learned about social media - the constantly moving target of empirical research - over the past decade?

30.10.2025 10:53 👍 84 🔁 39 💬 2 📌 4

While prebunking has mostly been used in the disinformation context, @marcoduerr.bsky.social explores its potential for dealing with other kinds of harm - in this case authoritarian attitudes. While these preliminary results are promising, we work to further refine these interventions.

23.10.2025 10:05 👍 3 🔁 0 💬 0 📌 0

Soooooo ... are we now embracing research?... 👁️

20.10.2025 19:51 👍 0 🔁 0 💬 0 📌 0

I am so thankful you're tracing this for us

20.10.2025 19:40 👍 1 🔁 0 💬 0 📌 0

Such great work, Alexios. Here in Germany, the foreign affairs office is tracking this influence operation "Doppelgaenger":

www.auswaertiges-amt.de/resource/blo...

20.10.2025 10:53 👍 0 🔁 0 💬 1 📌 0

Published 2025 but maybe already outdated? The rapid advance of AI and the fundamental shifts to our (increasingly hyperpersonalized) information environment, we might have to revisit this sooner than expected.

14.10.2025 12:33 👍 1 🔁 0 💬 0 📌 0

“The world will be Tlön.” - Jorge Luis Borges, Tlön, Uqbar, Orbis Tertius (1940)

08.10.2025 13:33 👍 2 🔁 0 💬 0 📌 0

This is where we will share most of our work, for anyone interested in #TrustandSafety reserach

08.10.2025 13:05 👍 2 🔁 0 💬 0 📌 0

It's a correlation, but it might indicate the value of proper content moderation

03.10.2025 13:39 👍 0 🔁 0 💬 0 📌 0

a picture of a smiling man with the name tom on top ALT: a picture of a smiling man with the name tom on top

No welcome from Tom Anderson?... I don't know about this

29.09.2025 06:07 👍 6 🔁 1 💬 0 📌 0

Marten Risius

Latest posts by Marten Risius @risius