Reunion in Singapore!πΈπ¬ @barbaraplank.bsky.social, @xinpeng.bsky.social, who's currently on a research stay at NYU, and Chengzhi are presenting their work at @iclr-conf.bsky.social
Reunion in Singapore!πΈπ¬ @barbaraplank.bsky.social, @xinpeng.bsky.social, who's currently on a research stay at NYU, and Chengzhi are presenting their work at @iclr-conf.bsky.social
Upcoming ICLR 2025 paper: βοΈ Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation
We propose a surgical & flexible approach to mitigate false refusal in LLMs with minimal effect on performance and inference cost
led by @xinpeng.bsky.social (1/2)
The hand-drawn sign from three years ago.
πMaiNLP is turning 3 today!ππ₯³ Weβve grown a lot since @barbaraplank.bsky.social started this group with nothing but three aspiring researches and a hand-drawn sign on the door. Huge thanks to all the amazing people who have joined or visited us since. Hereβs to many more years of exciting research!π
Iβm thrilled to share that our paper on mitigating false refusal in language models has been accepted to ICLR 2025 @iclr-conf.bsky.social!
arxiv.org/abs/2410.03415
Joint work with chengzhi, @paul-rottger.bsky.social, @barbaraplank.bsky.social.