Franzi Weeber's Avatar

Franzi Weeber

@franziweeber

PhD Student @ IMS Stuttgart

37
Followers
59
Following
4
Posts
22.01.2025
Joined
Posts Following

Latest posts by Franzi Weeber @franziweeber

Post image

Alignment with demographic subgroups can look good for single survey questions, yet miss the correlation structure of cultural values.

Tristan Williams, Sebastian Padรณ, Alan Akbik and I propose a 2-level eval framework and apply it to demographically aligned LLMs.

arxiv.org/abs/2601.15755

30.01.2026 13:49 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Paper accepted to #EACL2026 main conference ๐ŸŽ‰
@taniseceron.bsky.social, Sebastian Padรณ and I test multilingual LLMs before and after English-only fine-tuning and find strong cross-lingual political opinion transfer across five Western languages.

www.arxiv.org/abs/2508.05553

29.01.2026 09:09 ๐Ÿ‘ 9 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1

While the responses for different cues are highly correlated, we find stronger biases for more explicit but less externally valid cues. We advise future work to focus on externally valid cues such as human written conversation histories or to base their findings on a mixture of cues (2/2)

28.01.2026 16:22 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Different sociodemographic cues for gender (male, female, and non-binary personas) result in different LLM answers to a medical advice request.

Different sociodemographic cues for gender (male, female, and non-binary personas) result in different LLM answers to a medical advice request.

Even with identical sociodemographic info, how LLMs are given it changes downstream bias results. Our new preprint (w/ @veraneplenbroek.bsky.social, Jan Batzner & Sebastian Padรณ) tests cues with varying external validity across 10 personas, 4 tasks & 7 LLMs: arxiv.org/abs/2601.18572

28.01.2026 16:22 ๐Ÿ‘ 10 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1