Nikos Aletras (@naletras)

🎉 Belated Happy New Year 2025 & Happy Lunar New Year! 🐉✨

We’re kicking off the year with SheffieldNLP's latest research, accepted at NAACL, ICLR & ECIR!

Stay tuned for a thread summarizing our published work! 👀

31.01.2025 11:44 👍 3 🔁 1 💬 0 📌 1

Joking aside DeepSeek is really impressive, showing that scaling is **not** all you need

28.01.2025 13:16 👍 0 🔁 0 💬 0 📌 0

[Prompt:] Write an epic rap battle between Donald Trump and Xi Jinping on East versus West. [Prompt:] Good job. Add 8 Mile vibes. [Prompt:] Cool. Now make Donald rapping in the style of Snoop Dog and Xi in the style of Eminem.

ChatGPT 🔥 - DeepSeek❄️

28.01.2025 13:16 👍 1 🔁 1 💬 1 📌 0

Help! Looking for two emergency reviewers for an ARR December 2024 paper on topic modelling. Please msg me if you can provide a review by tomorrow 🙏

24.01.2025 14:21 👍 2 🔁 1 💬 0 📌 0

Synthetic calibration data (for pruning and quantization) generated by the LLM itself is a better approx of the pre-training data dist than "external" data.

Really cool work by Miles (@mileswil.bsky.social) and George (@soon1otis.bsky.social) to be presented at #NAACL2025

arxiv.org/abs/2410.17170

23.01.2025 14:23 👍 6 🔁 1 💬 0 📌 0

Lecturer in Computational Social Media Analysis and Natural Language Processing at University of Sheffield An academic position as a Lecturer in Computational Social Media Analysis and Natural Language Processing is being advertised on jobs.ac.uk. Click now to find more details and explore additional acade...

We're hiring! Looking for a Lecturer (~Assistant Prof.) at the intersection of #NLProc and computational social media analysis (i.e. computational social science)

Info and how to apply:

www.jobs.ac.uk/job/DLI664/l...

10.01.2025 09:01 👍 1 🔁 3 💬 0 📌 0

How Private are Language Models in Abstractive Summarization? Language models (LMs) have shown outstanding performance in text summarization including sensitive domains such as medicine and law. In these settings, it is important that personally identifying info...

Wrote up my first piece of PhD work last week! 🧵

Summarization via LMs is great at extracting info from documents, but how does summarization look in sensitive settings where privacy-preservation is essential?

Short answer: LMs are poor privacy preservers.

Arxiv: arxiv.org/abs/2412.12040

22.12.2024 21:41 👍 7 🔁 3 💬 1 📌 0

Findings:
❓ Privacy-preservation at inference-time is really underexplored! 🔍 LMs struggle to prevent PII leakage in their summaries. 👩‍⚖️ Human evaluations reveal privacy risks that metrics may overlook.

Paper w/ @naletras.bsky.social and Ning Ma
Cc. @sltcdt.bsky.social

22.12.2024 21:41 👍 5 🔁 1 💬 0 📌 0

Volunteer to join ACL 2025 Programme Committee Use this form to express your interest in joining the ACL 2025 programme committee as a reviewer or area chair (AC). The review period is 1st to 20th of March 2025. ACs need to be available for variou...

We invite nominations to join the ACL2025 PC as reviewer or area chair(AC). Review process through ARR Feb cycle. Tentative timeline: Review 1-20 Mar 2025, Rebuttal is 26-31 Mar 2025. ACs must be available throughout the Feb cycle. Nominations by 20 Dec 2024:
shorturl.at/TaUh9 #NLProc #ACL2025NLP

16.12.2024 00:28 👍 11 🔁 12 💬 0 📌 1

a man in a suit and tie is sitting at a desk . ALT: a man in a suit and tie is sitting at a desk .

Participated to a #EU "Survey on Simplifying Applications in EU Grants", I clicked to receive my response by email, it required CAPTCHA obviously. After a few failed attempts, I managed to receive it. Glad that it didn't ask me to provide detailed KTPs, TRLs and a 27B-6.

13.12.2024 17:08 👍 0 🔁 0 💬 0 📌 0

Great to have Joe Stacey today for his talk on atomic inference for interpretable NLI! 🚀🌺 Breaking tasks into atoms for transparency + outperforming baselines was inspiring. Loved his insights on robustness—and a fun trip to the Christmas Market after! 🎄✨

03.12.2024 13:04 👍 7 🔁 2 💬 0 📌 0

*word

26.11.2024 22:50 👍 0 🔁 0 💬 0 📌 0

Finding the needle in the ocean: multimodal retrieval systems for unstructured data (S3.5-COM-Zhao) at University of Sheffield on FindAPhD.com PhD Project - Finding the needle in the ocean: multimodal retrieval systems for unstructured data (S3.5-COM-Zhao) at University of Sheffield, listed on FindAPhD.com

Cass and I are looking for a #PhD student to work on multimodal LLMs @sheffieldnlp.bsky.social.

This is a fully-funded scholarhsip (including stipend), open to home and international candidates.

Deadline: 29/1/2025

Please spread the work!

#nlproc

www.findaphd.com/phds/project...

26.11.2024 14:13 👍 9 🔁 5 💬 1 📌 1

Having a large number of short ARR cycles a year doesn’t make sense to me. Since there is no arxiv anonymity period anymore, we can move to less rushed cycles, more engagement during discussion period and better review/metareview quality given the extra time

25.11.2024 09:04 👍 2 🔁 0 💬 0 📌 0

The author response period of @ReviewAcl is way too short (this time during a weekend). We defo can do better by extending it for more meaningful discussions.

This perhaps would mean a smaller number (e.g. 3 or 4) of longer ARR cycles but it’s still worth it #nlproc #naacl2025

23.11.2024 12:28 👍 6 🔁 1 💬 1 📌 0

Fun fact: If I remember correctly we got desk rejected by Plos One because they couldn't find reviewers with appropriate expertise. I don't think that we even tried *ACL because we didn't have a "novel" END-TO-END model. PeerJ CS got some extremely high quality reviewers though!

21.11.2024 17:21 👍 0 🔁 0 💬 0 📌 0

The paper was published eight years ago:
peerj.com/articles/cs-...

and apart from inspiring further research in NLP and #legaltech, it also resulted in the creation of the NLLP Workshop and its amazing community 5/5

21.11.2024 15:42 👍 1 🔁 0 💬 1 📌 0

The initial idea was conceived at a coffee shop in Sheffield (the Couch, still operating) in 2014, trying to explain NLP and text classification to Dimitris 4/n

21.11.2024 15:42 👍 0 🔁 0 💬 1 📌 0

Looking back, it still amazes me that this work was just a side project, not specifically funded by a grant and published in a less prestigious outlet (although a UCL press release helped enormously) 3/n

21.11.2024 15:42 👍 0 🔁 0 💬 1 📌 0

For the first time, we showed that it is possible to just use the fact descriptions of legal cases to train classifiers - SVMs (ehm what?!) acting as 👩‍⚖️- for predicting judicial decisions. This sparked huge interest (and 🔥debates) in the use AI in the legal domain 2/n

21.11.2024 15:42 👍 0 🔁 0 💬 1 📌 0

Our paper (w/ Bill, Dimitris and Daniel) crossed the 1000 citations mark. While citation count as a metric only partially captures the true impact of a paper, it still indicates how influential this work was at the intersection of law and NLP 1/n

21.11.2024 15:42 👍 4 🔁 0 💬 1 📌 0

A huge thank you to Xiting Wang (Renmin University of China) for an insightful talk! 🌟🌺
Her talk on explaining large & small language models and uncovering safety risks through Concept Activation Vectors was very interesting and truly inspiring. 🚀
scholar.google.com/citations?us...

20.11.2024 14:12 👍 9 🔁 2 💬 0 📌 0

🙋🏻‍♂️

20.11.2024 13:51 👍 0 🔁 0 💬 1 📌 0

🙌🏼🙌🏼

20.11.2024 13:22 👍 1 🔁 0 💬 0 📌 0

Hello #nlproc world

20.11.2024 09:17 👍 1 🔁 0 💬 1 📌 0

Nikos Aletras

Latest posts by Nikos Aletras @naletras