This is indeed delightful, thanks for posting! Their channel seems to have whole albums' worth of similar songs but not sure if any of the others have subtitles or dancing π
This is indeed delightful, thanks for posting! Their channel seems to have whole albums' worth of similar songs but not sure if any of the others have subtitles or dancing π
@aclanthology.org not sure where to report, but in the last few months I've often had issues with long loading times/timeouts on aclanthology.org. It's particularly bad today---maybe related to the upcoming ARR deadline?
Idk about "primarily" mate
You mean the most popular *US* politicians on this list
Personally, sleeping more and vitamin D in the winter.
...sorry, not much of a baker
@aclrollingreview.bsky.social Why is the reviewing window (still) so short this cycle? Wasn't the cycle extended to ten weeks specifically to make the process more manageable? Wasn't it three weeks in past cycles? Instead reviewers don't even get two full weeks to handle 4+ submissions.
TokShop @ #ICML2025 got way more submissions than expected! π We could really use a few more reviewers to help out. If you have the capacity to review a #tokenization paper by Saturday, please fill out this form: forms.gle/32A6sQHQrMSb... π
Beyond text: Modern AI tokenizes images too! Vision models split photos into patches, treating each 16x16 pixel square as a "token." πΌοΈβ‘οΈπ€ #VisualTokenization
Interested in tokenization? Join our workshop tokenization-workshop.github.io
The submission deadline is already May 30!
I'll be presenting this paper in Gather Town (Session 1) in a few hours π Come along!
This is a fantastic oral history of the last 10 years of NLP and AI. www.quantamagazine.org/when-chatgpt...
As a second language English speaker this also confused me for so long. Eventually I decided it must be from the phrase "having cake" which also means eating the cake
Me posing with my poster
The tour guide standing next to a statue of Professor Lichtenberg.
A slide of the vocabulary learning algorithm "SaGe"
Just spent two days in GΓΆttingen at #HumanCLAIM workshop! Re-presented my poster on surveying methods for cross-lingual representation alignment, got a city tour, heard cool talks and had interesting conversations π¬π
Oh very nice to see a paper for this intuition, and the data could be very useful! Adding to the reading list π
Figure 1: Eflomal score (bottom), a measure of token alignability, predicts downstream transfer performance better than the previous metric of distributional token overlap (top). The difference is especially stark for language pairs with different scripts (β’), compared to language pairs with the same script (Γ). The orange line shows the linear fit across all included pairs.
Alignability is more predictive of cross-lingual transfer than divergence of literal token distributions, particularly for language pairs with disparate scripts.
Basically we argue that token overlap measures for predicting multilingual performance are too literal, and introduce the notion of **token alignability**, which can be measured via the scores of a statistical aligner over a corpus tokenised with a given tokenised.
Happy to say that our paper "Beyond Literal Token Overlap: Token Alignability for Multilinguality" will be presented at #NAACL2025!
This is work with @tomlim.bsky.social, @jlibovicky.bsky.social, and Alex Fraser.
arxiv.org/abs/2502.06468
#newpaper #NLP #NLProc
Following the MT Marathon, we're hosting a hackathon in Prague. Researchers and students from five institutions (+1 online) are working together to assess how robust #LLMs are to grammar errors in machine translation and related tasks. Thanks to EAMT for their support.
@queerinai.com Hi, I was invited to review for the workshop the other day but the email is not clear on when reviews will be due. This info will be important to decide if I'm able to serve; can you share the deadlines? Thanks!
π
Gotta say I'm not sure what pronunciation "luh-BOEV" is referring to but in my head it sounds like French beef
Germany. a) ground floor b) first floor. This matches how we count in German but the German terms basically treat the "upper floors" separately from the "ground floor"
Bill Labov died this morning. I'm not coherent enough to talk about how important and influential and brilliant he was. I am very sad.
I was so lucky to know him, and I am grateful every day that he (and Gillian, and Walt, etc) built an academic field where kindness is expected.
To add to the reviewing complaints π Why do authors so often respond with an absolute wall of text? (Biggest response I got this time was four comments long.) As a reviewer, I find this very tough to engage with in the short discussion period, and as an author, I try to be concise in my responses.
5k is a small town, honestly π
Just wanted to say a quick thank you for organising a lovely social! ππ
Right now the app is being very laggy though?
Today I finally deactivated my Twitter account (not that I'd been super active there but hey) and decided to check out Bluesky. Looks like there's already a LOT of people here!