Excited to be joining #ACL2025NLP in Vienna π¦πΉ!
DM me if you would like to meet up and chat π
Excited to be joining #ACL2025NLP in Vienna π¦πΉ!
DM me if you would like to meet up and chat π
A powerful step for linguistic tech: ETH Zurich student Hanna Yukhymenko developed MamayLM β a Ukrainian #LLM fluent in πΊπ¦ & π¬π§, capturing language, culture & history. Supervised by Prof. Vechev & alumnus A. Alexandrov, in collab with INSAIT. bit.ly/3ED4o5k
@ayukh.bsky.social @ethzurich.bsky.social
It's just a game to these people
Π¨ΠΎ?
1/1 complete
π’ Our benchmark on self-supervised learning for single-cell data𧬠is accepted at the #NeurIPS2024 SSL workshop. We take a first step towards establishing best practices for SSL methods for single-cell data, and benchmark 8 SSL methods on 3 downstream tasks across 8 datasets.
Watch me stalk Kaggle in Vancouver to get stickers
Is MMLU Western-centric? π€
As part of a massive cross-institutional collaboration:
π½Find MMLU is heavily overfit to western culture
π Professional annotation of cultural sensitivity data
π Release improved Global-MMLU 42 languages
π Paper: arxiv.org/pdf/2412.03304
π Data: hf.co/datasets/Coh...
Going to #neurips2024 next week
Yes, I should have specified it before maybe :)
This means basically more publicly available materials, yes
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for evalπ
Before I have seen many papers claiming Ukrainian language to be low-resource, even though there are ~40 mil UA speakers worldwide, so there should be a lot of proof to that
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased
Π¦Π΅ΠΉ Π΄Π΅Π½Ρ Π½Π°ΡΡΠ°Π² - Ukrainian is finally recognized as a mid-resource language πΊπ¦π¦ π¦ π¦
Exciting end of the year!
- Won the GraySwanAI jailbreaking challenge for harmful code generation
- Proud Ukrainian ambassador for Cohere4AI new Aya Expanse models
- Started my master thesisπ©βπ³πΊπ¦π
- Going to Vancouverπ¨π¦ for @neuripsconf.bsky.social to chat about LLM privacy and SynthPAI
#neurips
Seems to be a hot take here: why are people getting mad about stuff they post *themselves* online? Your posts online are getting scraped all the time and you chose an open-source movement leader as a scapegoat.
Just initiate a discussion with HF about the "right to be forgotten" from GDPR
π
Zaporizhzhia, a big Ukrainian city of almost one million, is under a massive drone attack. terrorism is russian culture.
One year of ChatGPT has shown incredible capabilities of LLMs. However, they still have lots of problems! The LVE project aims at addressing this - with LVEs we track LLM vulnerabilities and exposures in an open-source community-first approach.
Contribute and more info: lve-project.org
#NLP #LLM
Π’ΡΠ΅Π΄ Π· ΠΊΠΎΡΠΈΡΠ½ΠΈΠΌΠΈ ΠΏΠΎΡΠ°Π΄Π°ΠΌΠΈ Π΄Π»Ρ ΡΠΈΡ , Ρ ΡΠΎ ΡΠΎΠΉΠ½ΠΎ Π΄ΠΎΠ»ΡΡΠΈΠ²ΡΡ. π§΅
ΠΡΠΏΠΈΠ»Π° ΡΠΊΡΠ°ΡΠ½ΡΡΠΊΡ Π²Π°ΡΠ΅Π½ΠΈΠΊΠΈ Π² Π¨Π²Π΅ΠΉΡΠ°ΡΡΡ
π€π«‘
Π―ΠΊ ΠΊΠ°ΠΆΡΡΡ
ΠΡΠΊΡΡ Π·Π° ΡΠ½Π²Π°ΠΉΡπ