A feature in Nature reports how research shows that βexercise snacksβ and other forms of everyday movement can greatly reduce the risk of heart disease and death. #medsky π§ͺ
@savcisens.com
Postdoc @nunetsi.bsky.social (Northeastern Uni) π Computational Social Science πΎ β¨ work on stability of belief in LLMs & Human-AI Collaboration πΏ he/him π±π» πΊπ¦ | www.savcisens.com
A feature in Nature reports how research shows that βexercise snacksβ and other forms of everyday movement can greatly reduce the risk of heart disease and death. #medsky π§ͺ
Itβs been two years since we published the #life2vec paper, and itβs still circulating widely.
People keep discovering it but much of what circulates online is misleading.
Agter a long time, I finally wrote a short explainer to clear up a few things:
converges.medium.com/is-life2vec-...
Attending @neuripsconf.bsky.social this week!
If you want to chat about LLMs for behaviour / health / labour modeling... or about beliefs and opinions of LLMs, hit me up.
Iβll also be presenting a poster on truth tracking at the Mechanistic Interpretability workshop later. Come say hi!
My 2 cents: If you exploited the #openreview bug or are actively searching for the leaked data, you should seriously reconsider your place in research.
If you cannot uphold the basic principle of double-blind review, how can we trust anything you publish?
Truthfulness isnβt always binary. Sometimes itβsβ¦ neither π€ Our Trilemma of Truth paper is headed to the @neuripsconf.bsky.social Mechanistic Interpretability workshop π Letβs connect in San Diego! π΄
Preprint: arxiv.org/abs/2506.23921
Code and data: github.com/carlomarxdk/...
Had the pleasure of presenting our work on Three-valued veracity probes for LLMs at #NEMI Workshop! Mechanistic Interpretability has such a great and welcoming community.
If we crossed paths - letβs connect! π
Poster: zenodo.org/records/1690...
Preprint: arxiv.org/abs/2506.23921
@nunetsi.bsky.social had a great week at @ic2s2.bsky.social! Looking forward to the next #IC2S2 in Vermont π¬β°οΈ
π₯ All keynote talks from ICΒ²SΒ² β25 are now available on our YouTube channel!
#ic2s2 #css
Colleagues are making sure that I stay focused at #IC2S2 π€
Presented our work on veracity-tracking in LLMs at #IC2S2 today!
Now looking forward to the next few days of great talks and conversations β¨οΈπ
Little wins: our "Trilemma of Truth" dataset just hit 150 downloads. It contains true, false, and neither-valued statements (inspired by the three-valued logic) used to stress-test LLMs for fact-checking, veracity tracking, and uncertainty handling.
Datasetπ: huggingface.co/datasets/car...
Perfect weather, charming streets, and a poster so big it almost needed its own boarding pass π§³β¨
Excited to attend #IC2S2 in NorrkΓΆping πΈπͺ Find me at the Poster Session on Tuesday: "Improving Probes that Track Veracity in Large Language Models" (Poster ID: 39) π§ͺ
Iβm presenting a poster on my latest project: βThe Trilemma of Truth.β
Drop by to see how LLMs leverage threeβvalued logic to model truth π’π€
And hey, if you fancy grabbing a coffee β, DM me!
π Poster: zenodo.org/records/1605...
π Preprint: arxiv.org/abs/2506.23921
Dataset with statements related to City Locations, Medical Indication, and Word Definitions is available on π€ huggingface.co/datasets/car...
Germans Savcisens, Tina Eliassi-Rad: The Trilemma of Truth in Large Language Models https://arxiv.org/abs/2506.23921 https://arxiv.org/pdf/2506.23921 https://arxiv.org/html/2506.23921
π¨ New preprint!
Do LLMs really know whatβs true?
In our paper, @eliassi.bsky.social and I introduce sAwMIL: a probing method that distinguishes between true, false, and neitherβcapturing what LLMs actually βretain.β
We evaluated 16 open models across 3 new datasets.
π arxiv.org/abs/2506.23921
That happens way too often to me π₯²
Thanks π
What's the coolest guide/source on "Complex Data Visualization"? I am looking for some inspiration to visualize graphs and high dimensional data.
Great read from 2019 about abandoning the use of p-values in a dichotomous way and what we can do instead. More thinking, and less relying on significance tests to decide things for us!
www.nature.com/articles/d41...
Sometimes we need a reality check π @serge.belongie.com
Amazing time with the folks from @tint-philosophy.bsky.social at the retreat on Predictability of Human Lives. Great people & discussions, and so much to reflect onβespecially around integrative modeling and how neural networks can help us get there. Plus, a relaxing sauna to top it off!
Visiting @mpidr.bsky.social this weekβsuper excited to see whatβs happening in Demographic Studies (donβt miss my talk!).
Also, Iβll be in Berlin on Feb 2, Helsinki from Feb 3-6, and Copenhagen from Feb 10-12. Let me know if youβre around and up for a coffee π§ͺπ¬βοΈ
For those interdisciplinary students/scholars who are having identity crisis, this is for you (from 2018):
"How to survive as an interdisciplinary being"
www.slideshare.net/slideshow/ho...
#NetSciX2025
New tool to estimate the level of participation in collective action expressed in natural language.
Applied to social media, it can produce large-scale and granular estimates of behavior change wrt collective action.
github.com/ariannap13/e...
@nerdsitu.bsky.social @itu.dk @carlsbergfondet.dk
I once asked ChatGPT how it thinks my life would look like in 20 years. And "Visionary Multidimensional Social Scientist" sounds like a great job title π I guess it captured my love for the "His Dark Materials" trilogy.
π’ @savcisens.com discusses a recent study that shows that LLMs exhibit social identity biases similar to humans. www.nature.com/articles/s43...
πhttps://rdcu.be/d5owe
Happy to write this News & Views piece on the recent audit showing LLMs picking up "us versus them" biases: www.nature.com/articles/s43... (Read-only version: rdcu.be/d5ovo)
Check out the amazing (original) paper here: www.nature.com/articles/s43...
Could you add me as well π¦π¦
I don't like the way many CS papers are written, even the supposedly good ones, but these tips are very generically applicable and useful. Just ignore the bit about the acronyms...