Alysson's Avatar

Alysson

@leastsquared

nlp. ai/ml. optimization. researcher.

242
Followers
1,078
Following
74
Posts
16.07.2023
Joined
Posts Following

Latest posts by Alysson @leastsquared

Additionally, extractive models were considered, including TextRank (Nenkova and McKeown, 2011), LexRank (Erkan and Radev, 2004), LSA (Steinberger and JeΕ½ek, 2004), KLSum (Haghighi and Vanderwende, 2009), and SumBasic (Woodsend and Lapata, 2011)

07.02.2026 15:34 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

The following models were evaluated:

BART (Lewis et al., 2019), Gemma (Gemma Team et al., 2024), SabiΓ‘ (Pires et al., 2023), Llama (Team, 2024a), TeenyTinyLlama (CorrΓͺa et al., 2024), Hermes (Teknium et al., 2024), Qwen (Team, 2024b), and Tucano (CorrΓͺa et al., 2025).

07.02.2026 15:34 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I experimentally evaluated the use of small language models in the task of text summarization in the context of auditing in Brazilian public health using news data.

07.02.2026 15:33 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Frontiers | Small language models applied in text summarization task of health-related news to improve public health audit: an experimental case study ContextFraud and corruption are among the main crimes affecting public institutions, with the healthcare sector being particularly vulnerable due to its stru...

My article, titled "Small language models applied in text summarization task of health-related news to improve public health audit: an experimental case study" has just been published in Frontiers in Artificial Intelligence.

doi.org/10.3389/frai...

#nlp #nlproc #llm #slm

07.02.2026 15:32 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Breaking the jar: Why NeuroAI needs embodiment Brain function is inexorably shaped by the body. Embracing this will benefit computational models of real brain function and the design of ANNs.

Embodiment is the concept that the function of the brain is inexorably shaped by the body, a lens that is often neglected when neuroscientists study specific brain subsystems, write @bingbrunton.bsky.social and @tuthill.bsky.social.

#neuroskyence #neuroai

www.thetransmitter.org/neuroai/brea...

21.07.2025 14:28 πŸ‘ 59 πŸ” 30 πŸ’¬ 1 πŸ“Œ 1

boa

23.07.2025 13:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

My deep learning course at the University of Geneva is available on-line. 1000+ slides, ~20h of screen-casts. Full of examples in PyTorch.

fleuret.org/dlc/

And my "Little Book of Deep Learning" is available as a phone-formatted pdf (nearing 700k downloads!)

fleuret.org/lbdl/

26.11.2024 06:15 πŸ‘ 1252 πŸ” 248 πŸ’¬ 46 πŸ“Œ 17

Oi aqui Γ© o Luigi Mangione eu estou precisando de um pix pra pagar a minha advogada

23.12.2024 22:35 πŸ‘ 675 πŸ” 159 πŸ’¬ 14 πŸ“Œ 8

Acordando cedo sem despertador

24.12.2024 10:15 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image
15.12.2024 01:34 πŸ‘ 50 πŸ” 5 πŸ’¬ 0 πŸ“Œ 2
Post image

Pre-training as we know it will end - Dr. Ilya Sutskever at NeurIPS 2024

13.12.2024 23:17 πŸ‘ 45 πŸ” 8 πŸ’¬ 4 πŸ“Œ 5

No. Words not seen by the model are "understood" yes. At least, the numerical representation will be approximated, simply because the models are trained contextually. If you simply give a model a task with sentences with some invented words, it will be able to solve the task.

14.12.2024 21:52 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Reading ≃ 450 abstracts of studies on text summarization applied in the context of health, medicine and biomedicine to carry out a systematic mapping of the literature, and I thought there would be many more studies on text summarization in this area.

14.12.2024 21:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

#EMNLP has a nice set of tokenization/subword modeling papers this year.

It's a good mix of tokenization algorithms, tokenization evaluation, tokenization-free methods, and subword embedding probing. Lmk if I missed some!

Here is a list with links + presentation time (in chronological order).

11.11.2024 22:38 πŸ‘ 47 πŸ” 16 πŸ’¬ 5 πŸ“Œ 2

the data not seen by the model follows the structure of the language of the data seen. So, in a way, they can indeed "reason" (with many quotation marks) about unseen data.

28.11.2024 02:08 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Working on my dissertation qualification. Already receiving several criticisms about points of improvement from my advisor, but I ended up having other insights

27.11.2024 00:21 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

A great book TBR

26.11.2024 02:41 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

A starter pack of starter packs:

Robotics and AI go.bsky.app/DfAoaJ1
Computer Vision go.bsky.app/PkAKJu5
Computer Graphics Research go.bsky.app/ckQ1u9
Grumpy Machine Learners go.bsky.app/6ddpivr
Reinforcement Learning go.bsky.app/3WPHcHg

19.11.2024 04:36 πŸ‘ 95 πŸ” 29 πŸ’¬ 7 πŸ“Œ 3
Post image

Wow that is an impressive image of neurons and their beautiful connections

From:
Super-resolution imaging of fast morphological dynamics of neurons in behaving animals
www.nature.com/articles/s41...

25.11.2024 21:55 πŸ‘ 89 πŸ” 20 πŸ’¬ 0 πŸ“Œ 3

πŸ€– ML/AI Mega Starter Pack

1. Open-source LLMS
go.bsky.app/FELkyDr

🧡

22.11.2024 09:28 πŸ‘ 24 πŸ” 9 πŸ’¬ 3 πŸ“Œ 2
Preview
VeraAI text analysis tools for fact-checking A webinar series on innovative AI-based fact-checking tools from the veraAI project

Webinar reminder: on 26 November 2024 from 11-12 am CET the @sheffieldnlp.bsky.social team (led by K Bontcheva) will showcase veraAI work on text mining and analysis, and how this can support #factchecking. Register here for access (it's organized by @ebu.bsky.social).
tech.ebu.ch/events/2024/...

22.11.2024 14:25 πŸ‘ 5 πŸ” 3 πŸ’¬ 0 πŸ“Œ 2
NLP com ClassificaΓ§Γ£o e EspaΓ§os Vetoriais por Alysson GuimarΓ£es
NLP com ClassificaΓ§Γ£o e EspaΓ§os Vetoriais por Alysson GuimarΓ£es YouTube video by RDSE - Recife Data Science & Engineering

Talk sobre NLP, espaΓ§os vetoriais e classificaΓ§Γ£o de textos no RDSE

youtu.be/R1m7T59R-T0?...

cc @samsantosb.bsky.social #bolhadev #datascience #datasciencebr

01.09.2024 14:11 πŸ‘ 6 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image

🚨🚨The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead?πŸ§‘πŸ€–

This is a massive question that is both important and timely.

πŸ“œhttps://aclanthology.org/2024.emnlp-main.1230/

w/ Sabrina Akter, JP Singh, and @antonisa.bsky.social

Accepted to #EMNLP2024 Main,

1/3

19.11.2024 04:01 πŸ‘ 22 πŸ” 4 πŸ’¬ 3 πŸ“Œ 0
Post image Post image

Stop oversampling! Changing the cutoff in probabilistic classifiers is enough for imbalanced data.
In our new paper, Gabriel O. AssunΓ§Γ£o, Marcos O. Prates, and I explore this in depth. jds-online.org/journal/JDS/...
#DataScience #MachineLearning #ImbalancedData #AI #Oversampling

16.10.2024 16:25 πŸ‘ 8 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

Thanks for sharing

24.11.2024 19:34 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

πŸ§ πŸ€–I just put together a starter pack for CogSci & human-centered AI researchers. Looking to add more folks hereβ€” let me know!

go.bsky.app/NTjjUwG

#cogsci #ai #hci

22.11.2024 18:30 πŸ‘ 13 πŸ” 3 πŸ’¬ 7 πŸ“Œ 0
Preview
CiΓͺncia Cognitiva 🧠 Neste artigo sΓ£o apresentados conceitos introdutΓ³rios sobre CiΓͺncia Cognitiva.

medium.com/data-hackers...

#cogsci #cogscibr

22.11.2024 01:34 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

New here? Interested in AI/ML? Check out these great starter packs!

AI: go.bsky.app/SipA7it
RL: go.bsky.app/3WPHcHg
Women in AI: go.bsky.app/LaGDpqg
NLP: go.bsky.app/SngwGeS
AI and news: go.bsky.app/5sFqVNS

You can also search all starter packs here: blueskydirectory.com/starter-pack...

09.11.2024 09:13 πŸ‘ 552 πŸ” 212 πŸ’¬ 67 πŸ“Œ 55

Thanks for sharing

21.11.2024 02:16 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I've seen some interest emerging about neurosymbolic AIs. Which would be great for explainability and to eliminate this subjectivity that exists about LLM vector spaces.

21.11.2024 02:07 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0