This is super helpful feedback! And obviously not good. Although we are using a third party classifier for the weather report, I’ll chat with my collaborators about ways for us to adjust this. Thanks Tea!
@fayjohnson
Depolarization Expert. Trust & Safety Product Executive at LinkedIn. Former T&S @ Meta, Twitter, Nextdoor; Fellow, Harvard’s Berkman Klein Center; Founder, Deliberate Discourse. Speaker Creator of CLR:SKY a civility project for Bluesky. www.fayjohnson.com
This is super helpful feedback! And obviously not good. Although we are using a third party classifier for the weather report, I’ll chat with my collaborators about ways for us to adjust this. Thanks Tea!
Thanks Ian. The goal here isn’t to automate anything. And unchecked hate speech has often hurt marginalized ppl more than the insulated majority. The hope here is to reduce harm.
And the tool doesn’t alter your post (unless you use the rewrite tool as an assist); it doesn’t stop you from posting.
The goal is to provide a visible indicator of how content is often scored. It’s not an enforcement tool. It’s making visible classification that often happens on the backend. There’s a lot of critique of downranking that happens after posting; this is a pre-post indicator that you can take or leave
We/I don’t own this model but all classification models have errors and fall short. It’s one of the reasons who chose a broad range of weather scores that are directional and not a binary, to allow the user to take it or leave it.
Hey Karl. The score is not just word based — it’s a tone classifier more broadly.
That’s a really valid critique. I appreciate you bringing it up.
Hey Tea. The tone classifier it not analyzing the political position or accuracy of the words but their overall tone, so if a word that is negative is included the weather score will change. Thanks for playing around with it! Appreciate the feedback.
Having worked on moderation @ large platforms I’m painfully aware that moderation at scale gets things wrong. And AI by nature is biased. The idea here was to make a common scoring framework visible as a signal that authors can use (or not!), a response to the critique of the invisible “hand” of mod
Hey All. Thanks for your input. @clrskyai.bsky.social is a project I am testing as part of a research project so your input is welcomed. The weather scores model is based on the # of ppl out of 10 who might find something offensive. The highest score = 9/10. A lot of moderation is hidden. This isn’t
Hey Erica! Great question and one we should always ask of both AI and moderation systems. In this case, the classification is defined by Perspective API, which is scored on what X/10 ppl surveyed believed is toxic in nature. I don’t know who was included in that survey pool, so representation is key
Looking forward to this!
There are 6K of you using our CLR:SKY (www.clrsky.ai) overlay for @bsky.app. If you are finding the weather reports and rewrite assistance useful, I'd love to hear about! We'll share findings next wk at the Berkman Klein Center for Internet & Society @harvard.edu
#research #harvard #socialmedia
It is current 'written over', although we are working on a 'undo' button so you can revert to the original if you prefer it. No storage of anything you type.
Officially launching 🚀 today CLR:SKY @clrskyai.bsky.social - a civility tool built into @bsky.app's composer window. No new login credentials needed - simply access your Bluesky account from our login page (www.clrsky.co) -- We would love you try it out and send feedback!
I thought the hand was supposed to be invisible
@christiannaj.bsky.social I think you might find this project interesting - making a common invisible content moderation tool visible to folks on Bluesky through an overlay called CLR:SKY. www.clrsky.ai
So excited to be bringing this to you all -- I would love you to check it out, and share your feedback! We will be sending out brief surveys to those who want to provide more detailed feedback. You can sign up on our website: www.clrsky.ai/feedback
Hi Bluesky! We are CLR:SKY, a new AI-powered interface enhancement for the Bluesky aimed at helping people craft more civil and constructive conversations. With CLR:SKY you get instantaneous feedback on the tone of your post through a “Weather Report”. You can rewrite with AI if you want.
I stand with the penguins of Heard and McDonald Islands!
This morning, the New York Times ran a guest editorial I wrote about the challenges, but importance of, content moderation on large social platforms. Check it out in the opinion section
www.nytimes.com/2025/02/19/o... @fayjohnson.bsky.social is a voice of reason on the issue of safety online
visiting savannah and being reminded of the anti-literacy laws pre-civil war as a tool of oppression is an unsettling backdrop for news about the govt destroying the department of education
I have worked on T&S at Meta, Twitter and Nextdoor and advise startups about their product strategy. I summarized the changes Meta announced in January. Link as the video is longer than 60 sec.
Okay, so just a few* thoughts (*this got longer as I wrote 😅….long thread)-
Will Meta's retreat from proactive/automated enforcement of its community standards lead to Instagram and Facebook becoming deeper cess pools? Well, it's complicated and the devil will be in the details. Let's look at the data and see what questions we should be asking... 🧵 1/12
My former colleague @samidh.bsky.social has a good hot take on the recent announcement from Meta. The aspiration of 3rd party fact checking is noble, but challenging to implement well at scale.
New @acm-cscw.bsky.social paper, new content moderation paradigm.
Post Guidance lets moderators prevent rule-breaking by triggering interventions as users write posts!
We implemented PG on Reddit and tested it in a massive field experiment (n=97k). It became a feature!
arxiv.org/abs/2411.16814
Considering how to support civil dialogue online, particularly in public forums where debate & disagreement are common. I'd love to see more perspective taking but the character limits of a 'tweet' (what we calling these?) are limiting.
If there was a perspective taking tool, would you use it?
At Harvard to speak at the BKC on the future of social media, 20 years after FB was invented here. What good has emerged? What behaviors or ways of interacting do we want to facilitate for in the future? Will there be emergent platforms or will Meta/YouTube/TikTok continue to own social moving fwd?
As one of the former Directors of Product at Twitter, where I worked alongside amazing people in the Health (Trust & Safety Product) team, it's a pleasure to 'see' you all here. Thanks @jay.bsky.team & team for continuing to invest in this platform.