We had a blast hosting Sarah @craicexeter.bsky.social . We should be having more of these much needed discussions as scholars - Sarahβs talk featured a great mix of critical edge and openness to dialogue
@gildersleve.uk
Lecturer (Asst Prof) in Communications and AI at the University of Exeter Prev. LSE Methodology, PhD Oxford Internet Institute Wikipedia, News, Attention he/him ππ³ linktr.ee/gildersleve π https://gildersleve.uk
We had a blast hosting Sarah @craicexeter.bsky.social . We should be having more of these much needed discussions as scholars - Sarahβs talk featured a great mix of critical edge and openness to dialogue
A paper screenshot: Refractive datasets as a sensemaking methodology in closed data ecosystems Anna Beers, Viviane Ito, Agustin Orozco, Patrick Gildersleve, Pablo AragΓ³n, and Francesca Tripodi Abstract As digital platforms restrict their APIs, researchers face diminishing options for studying social phenomena in digital environments. During what has been called the post-API era, researchers have found themselves looking for reliable data sources in an unreliable and frequently changing platform data ecosystem. In this context, we propose analyzing refractive datasets as a methodology for researchers to understand the dynamics of closed data platforms. Refractive datasets come from platforms with relatively more open data policies, and their analysis sheds light on platforms with more restrictive data policies. Like a prism, refractive datasets reflect but also transform data-based phenomena unfolding on closed platforms. Using refractive datasets from Wikipedia and Google Trends, we present three studies to demonstrate our methodology. We first show how refractive data from Wikipedia's multiple language editions can be used to understand a fractured global platform ecosystem in a case study of hydroxychloroquine, a purported COVID-19 medicine. Second, we use Google Trends to show how similar refractive analyses can be used to understand information lost to platform deletion, in a profile of an online panic over the drug brand Galaxy Gas. Finally, we show how Wikipedia data can be used as a grounding point for a refractive analysis of how new generative algorithms reproduce and distort data across the social web. We discuss how refractive datasets can be a way for researchers to βsensemakeβ in increasingly opaque big data environments, enabling interpretivist analyses which aim to generate new hypotheses rather than verify existing claims.
Happy 25th birthday to Wikipedia! π₯³
A fitting moment to share
1. Their great site to mark the occasion: wikipedia25.org
2. A paper in Big Data & Society, published over the winter break, where we develop Wikipedia as a βRefractive Datasetβ, led by @beeeeeers.bsky.social: doi.org/10.1177/2053...
Thanks Felix!
Thank you Nicole!
Iβm chuffed to share that Iβve been awarded this grant with @ftripodi.bsky.social and Brett Zehner π₯³
Weβll be studying how AI systems may reproduce or reinforce biases in Wikipedia, whether by extracting knowledge from the platform or by contributing content back to it. Excited to get started!
πICYMI: "It matters less that Grokipedia succeeds than whether it helps to delegitimise Wikipedia."
@gildersleve.uk #Grok #Wikipedia #Knowledge
Launched as a competitor to Wikipedia, Elon Musk's #Grokipedia is one of the first LLM-based attempts to create an encyclopaedia.
@gildersleve.uk finds it compares poorly to Wikipedia, but the latter faces challenges as AI tools cannibalise its content and audiences.
@lseimpactblog.bsky.social
I enjoyed writing this piece for the LSE Impact Blog on Musk's latest swing at Wikipedia, and its future in a hostile AI-skewed political landscape π
It was a great experience organising this workshop with colleagues - looking forward to more good CrAIC!
Super excited for the launch workshop for our research centre. Sign up below for the public roundtable!
A really great read overall on how Wikipedia has survived the Internet, and the challenges it still faces. Going straight on my students' reading lists!
Iβm hardly a fan of much of the UKβs speech laws (especially regarding protest, Palestine recently), and cannot imagine what Esther Ghey has gone through. But what on earth are we doing here???
A screenshot of the BBC news app showing two stories. The first βMet chief calls for law change after Graham Linehan arrestβ, with a photo of Graham Linehan. The second βWatch: Mother of Brianna Ghey calls for smartphone ban in schoolsβ, with an image showing Brianna and Esther Ghey.
A depressing juxtaposition of news stories, there is such a disconnect in the national conversation.
A public figure directly calling for violence against trans people is met with police sympathy and instead we get phone ban campaigns to tackle online harms π.
A glimmer of hope - the judge issued a warning shot towards Ofcom and the govt on protecting Wikipedia's operation in the UK. Wikimedia have focused on this in their response. We expect to find out more on Wikipedia's categorisation and obligations later this summer. This will rumble on...
Troubling news as Wikimedia's claim against the OSA is dismissed. This opens the door for onerous new requirements such as ID verification and stricter content moderation. These measures run antithetical to Wikipedia's practical operation and core principles as the encyclopaedia anyone can edit.
For even more WikiResearch at #IC2S2, I'm presenting our work on WikiReddit on Wednesday 14:30 in 'Social Media II' π§βπ»
I'll explain how these complementary platforms are powered by the magic of β¨π©ππππ§ππ«π²β¨, and how our dataset can be used to study cross-platform flows of information and attention π
Great session on Wikipedia at #IC2S2 with some very cool research. Strong themes of multilingual analysis and coordination dynamics
@smfsamir.bsky.social
@feloe.bsky.social
I am pleased to announce the launch of the Manifesto for Wikimedia Research manifesto.wiki. As my co-authored Big Data & Society commentary explains, the manifesto is dedicated to a humanist and critical tradition of taking Wikipedia's importance seriously. journals.sagepub.com/doi/10.1177/...
Presenter (Patrick Gildersleve) in front of a screen summarising the WikiReddit Dataset project. The slide describes it as "Every Wikipedia mention and link on Reddit, 2020-2023", includes some example usage, describes the scale of the dataset, and offers suggested use cases.
Had a great time meeting everyone and seeing all the interesting work @icwsm.bsky.social. I presented our study on the Wikireddit dataset - exploring Wikipediaβs role in fact-checking, discussion, and cross-platform attention on the web. Thank you to the organisers!
π: ojs.aaai.org/index.php/IC...
Important starter pack research in action! I received lots of follow notifications during this talk - hopefully this work prompts more ICWSM activity here!
Presenter in front of screen at ICWSM. Slide content: Summary First study of how original Wikipedia content has been forked and manipulated to meet the requirements of a national regulation. β’ RWFork edits target top visited and controversial pages β’ Editors follow office-hour activity patterns β’ Changes are systematic, mostly focused on the Russo-Ukrainian War RWFork dataset is available for further study: https://zenodo.org/records/15073728
Very cool work at #ICWSM tracking information manipulation in response to national regulation on the Russian Wikipedia fork
From Mykola Trokhymovych, @elaragon.bsky.social, @e-migrante.bsky.social & others
I'll be at #ICWSM 2025 next week to present our paper about Bluesky Starter Packs.
For the occasion, I've created a Starter Pack with all the organizers, speakers, and authors of this year I could find on Bluesky!
Link: go.bsky.app/GDkQ3y7
Let me know if I missed anyone!
Big congrats both!!!
You can still sign up for our My580 Methods Short Course π Modern Text Analysis and NLP with Python, by @gildersleve.uk
π Tuesday 27 May, from 10am - 3pm
π±οΈ www.eventbrite.co.uk/e/modern-tex...
Next up on our My580 Methods Short Courses π Modern Text Analysis and NLP with Python, by @gildersleve.uk
π Tuesday 27 May, from 10am - 3pm
π±οΈ www.eventbrite.co.uk/e/modern-tex...
The Royal Courts of Justice main gate. Text says "Wikimedia Foundation brings legal challenge to new UK Online Safety Act requirements".
We are challenging the lawfulness of the UKβs Online Safety Act Categorisation Regulations, which determine our obligations under UK laws.
These rules create unacceptable risks for Wikipediaβs volunteer users, such as disempowering those who wish to keep their identity private.
Read β‘οΈ w.wiki/E3XD
I have two postdoc positions for a computer scientist and a social scientist to work on my ERC-funded project HUMANET Human-Machine Social Systems at LSE. Please read more and apply at jobs.lse.ac.uk/Vacancies/W/.... More information about the project is available at humanet.science.
AI generated photograph featuring a group of people sat around a table and code overlayed
π£Call for papers!
1οΈβ£ week left to send your 250-word abstract for @audreyalejandro.bsky.social and βͺ@dandekadt.bsky.socialβ¬'s two-day workshop on "Computational Social Science meets Qualitative Research"
Learn moreβ‘οΈ www.lse.ac.uk/Methodology/...
@lsedatascience.bsky.social