We're trying to shed some light into the debate around age verification technologies and how it can be implemented while protecting user privacy.
We're trying to shed some light into the debate around age verification technologies and how it can be implemented while protecting user privacy.
In this line of research we consider trust & safety issues from a broader information warfare (IW) perspective. Turns out, the term IW is used quite loosly ... and now we start talking about other types of warfare. In this paper @afrenzel.bsky.social gives an overview over common IW practices
I am becoming a "safety by design" advocate
Two key members of OpenAIβs βSuperalignmentβ team [...] quit in 2024, saying the company emphasized financial gain over minimizing the dangers [...]"
www.semafor.com/article/02/1...
Ever wondered how Incels and RWEs gatekeep new members online? We did! We find two fundamentally different ways in which members socialize in these extremist communities. Intriguing work by @cvdavid.bsky.social!
I know there is a vivid debate around how to operationalize hate speech. Not sure I agree with TT-US' understanding of what constitutes hateful content:
zeteo.com/p/trump-tikt...
If you're working or researching hate speech issues, consider joining this free 1-hour Violence Prevention Network webinar where we present our tool maxplain.com that supports the legal assessment of potentially hateful images:
www.surveymonkey.com/r/eoohwebinar
As part of GNETβs series aligning with the UNβs 16 Days of Activism Against Gender-Based Violence, Christopher David & @risius.bsky.social examine how Stormfrontβs ideological unity is eroding as members increasingly diverge in their conceptualisations of white women.
@dsrc.bsky.social
What is the gossip on why X is releasing new content moderation features? Seems off-brand to me. What am I missing?
www.techspot.com/news/110332-...
@riekers.bsky.social is doing awesome work using multi-agent systems, in this case to detect hate speech in memes. Quite a challenging task but a promising approach and much more to come.
Reach out if you're interested or struggling to find explainable AI solutions for multimodal applications
We have a new preprint: osf.io/preprints/so...
What have we learned about social media - the constantly moving target of empirical research - over the past decade?
While prebunking has mostly been used in the disinformation context, @marcoduerr.bsky.social explores its potential for dealing with other kinds of harm - in this case authoritarian attitudes. While these preliminary results are promising, we work to further refine these interventions.
Soooooo ... are we now embracing research?... ποΈ
I am so thankful you're tracing this for us
Such great work, Alexios. Here in Germany, the foreign affairs office is tracking this influence operation "Doppelgaenger":
www.auswaertiges-amt.de/resource/blo...
Published 2025 but maybe already outdated? The rapid advance of AI and the fundamental shifts to our (increasingly hyperpersonalized) information environment, we might have to revisit this sooner than expected.
βThe world will be TlΓΆn.β - Jorge Luis Borges, TlΓΆn, Uqbar, Orbis Tertius (1940)
This is where we will share most of our work, for anyone interested in #TrustandSafety reserach
It's a correlation, but it might indicate the value of proper content moderation