Javier Cancela's Avatar

Javier Cancela

@javiercancela.com

Interested in data and AI (mostly LLMs). Mainly in English, but sometimes in Spanish or Galician. Director of Data Analytics @ Joor

98
Followers
681
Following
55
Posts
03.11.2023
Joined
Posts Following

Latest posts by Javier Cancela @javiercancela.com

Si por casualidad alguien necesita a alguien con amplia experiencia en sistemas, cloud, kubernetes.... Y alguna otra cosilla por aquΓ­ estoy :)

11.01.2025 08:41 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
nick331642's comment on "Felix Hill has died {DM}" Explore this conversation and more from the reinforcementlearning community

Via Reddit: www.reddit.com/r/reinforcem...

04.01.2025 11:13 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
On mental health, psychedelics and life On mental health, psychedelics and life This is a story about mental health, psychedelics, psychology and the mind. It is a story about the joy of family, the joy of friends, the joy of being in love...

This is a hard read. On depression, drugs, and suicide from Felix Hill, a DeepMind researcher, before taking his own life.
docs.google.com/document/d/1...

04.01.2025 11:10 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Still from the movie "Upstream Color". A woman and a man looking at each other, clear dusk sky in the background, silhouettes of birds in the sky.

Still from the movie "Upstream Color". A woman and a man looking at each other, clear dusk sky in the background, silhouettes of birds in the sky.

I love sci-fi, and I read and watch a lot of it.

Made a list of some good, lesser-known science fiction movies β†’ rakhim.exotext.com/lesser-known...

Shorter version in the thread below 🧡

15.12.2024 14:21 πŸ‘ 11 πŸ” 2 πŸ’¬ 3 πŸ“Œ 1
Improving Team Communication: Why I Replaced Weekly 1:1s with Status Emails and Monthly Check-ins Discover why I transitioned from weekly 1:1 meetings to a more effective communication strategy involving weekly status emails and monthly performance check-ins. Learn how this change improved team dy...

I liked this post: acalustra.com/less-11-meet... (I found the blog via @antonmry.bsky.social)

14.12.2024 12:54 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Oops

13.12.2024 18:56 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I did. I agree with Cambridge's warning.

10.12.2024 19:43 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I knew my bug-fixing deployments were "magical thinking"...

10.12.2024 17:03 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
CMU Database Group Carnegie Mellon University Database Group

I don't think young people understand how mind-blowing things like this would look to college-me (back in the 90s): www.youtube.com/@CMUDatabase...

Top quality education for free.

10.12.2024 15:28 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Uh oh...

02.12.2023 19:14 πŸ‘ 1391 πŸ” 237 πŸ’¬ 6 πŸ“Œ 7

Companies die from a lack of cash, which in your average SaaS company means a lack of product market fit.

04.12.2024 14:03 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I need an emoji for β€œskeptical interest”

04.12.2024 13:46 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Primera vez que oigo el nombre

03.12.2024 18:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

For anyone interested in fine-tuning or aligning LLMs, I’m running this free and open course called smol course. It’s not a big deal, it’s just smol.

🧡>>

03.12.2024 09:21 πŸ‘ 326 πŸ” 64 πŸ’¬ 9 πŸ“Œ 4
Video thumbnail

you: why'd it take so long to remake your website
me: needed a physics engine for my text
you: ???
me:

02.12.2024 17:46 πŸ‘ 440 πŸ” 51 πŸ’¬ 15 πŸ“Œ 1
Preview
From the ChatGPT community on Reddit: Unfolding ChatGPT's mysterious censorship and David Mayer Explore this post and more from the ChatGPT community

More or less: www.reddit.com/r/ChatGPT/co...

02.12.2024 17:13 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Dispensario antituberculosis Casa Sol na rΓΊa do OrzΓ‘n na CoruΓ±a

Dispensario antituberculosis Casa Sol na rΓΊa do OrzΓ‘n na CoruΓ±a

Un dΓ­a coma hoxe, o 2 de decembro de 1906, abrΓ­a as sΓΊas portas na CoruΓ±a o Dispensario Antituberculoso, situado na emblemΓ‘tica Casa do Sol, fronte ao Paseo do OrzΓ‘n. Un pequeno edificio que agocha unha gran historia! Segue lendoπŸ‘‡

#ACoruΓ±a #PedroMariΓ±o #HistoriadaCoruΓ±a

02.12.2024 10:19 πŸ‘ 6 πŸ” 7 πŸ’¬ 2 πŸ“Œ 1
Post image

7 databases in 7 weeks

cool idea :)

matt.blwt.io/post/7-datab...

01.12.2024 15:15 πŸ‘ 104 πŸ” 12 πŸ’¬ 3 πŸ“Œ 1

It’s an interesting ranking, but most of Spain’s score apparently comes from having a great Internet speed and some regulations.

01.12.2024 20:20 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Speaking of this, the original MapReduce paper contains what I imagine are the first versions of Borg, Jupiter, and Colossus.

01.12.2024 17:13 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Lo que no sabemos nos cuesta demasiado

Lo que no sabemos nos cuesta demasiado

El recibo de la opacidad lo pagamos juntos. πŸ’Έ

Cada contrato oculto, cada euro malgastado, cada acciΓ³n en la sombra engordan una factura que aceptamos sin rechistar. Pero no tiene por quΓ© ser asΓ­.

Podemos investigar mΓ‘s. Informarnos mejor. Exigir cuentas. Incluso aliarnos para cambiar esto.

30.11.2024 20:00 πŸ‘ 22 πŸ” 18 πŸ’¬ 1 πŸ“Œ 2

Using it for the first time.

30.11.2024 21:37 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I just found out that the + button to the left of the language selector allows you to create threads 🀦.

30.11.2024 21:37 πŸ‘ 1 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Microservices
Microservices YouTube video by KRAZAM

Reviewing BigQuery technical info, I see that Borg orchestrates the communitcation between Dremel, Jupiter, and Colossus.

But what about Galactus???
www.youtube.com/watch?v=y8On...

30.11.2024 21:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

Damn this is good

30.11.2024 17:57 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Question from Leo on Twitter:
It doesn’t interpolate, does it? 

If I ask β€œWhat color is a Gropy?”, and we had 100 labellers say it’s blue and 100 labellers say it’s yellow, it’s going to randomly say blue or yellow - but never β€œIt’s a debated question, some say blue, some say yellow”. Right?

Answer from Andrej:
Excellent question and yes exactly, it responds with blue or yellow with 50% probability. Saying β€œIt’s a debated question, some say blue, some say yellow” is just a sequence of tokens that would be super unlikely, it doesn't match the statistics of the training data at all.

Question from Leo on Twitter: It doesn’t interpolate, does it? If I ask β€œWhat color is a Gropy?”, and we had 100 labellers say it’s blue and 100 labellers say it’s yellow, it’s going to randomly say blue or yellow - but never β€œIt’s a debated question, some say blue, some say yellow”. Right? Answer from Andrej: Excellent question and yes exactly, it responds with blue or yellow with 50% probability. Saying β€œIt’s a debated question, some say blue, some say yellow” is just a sequence of tokens that would be super unlikely, it doesn't match the statistics of the training data at all.

I especially liked this answer.

29.11.2024 20:55 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Text from Andrej Karpathy in Twitter: People have too inflated sense of what it means to "ask an AI" about something. The AI are language models trained basically by imitation on data from human labelers. Instead of the mysticism of "asking an AI", think of it more as "asking the average data labeler" on the internet.

Few caveats apply because e.g. in many domains (e.g. code, math, creative writing) the companies hire skilled data labelers (so think of it as asking them instead), and this is not 100% true when reinforcement learning is involved, though I have an earlier rant on how RLHF is just barely RL, and "actual RL" is still too early and/or constrained to domains that offer easy reward functions (math etc.).

But roughly speaking (and today), you're not asking some magical AI. You're asking a human data labeler. Whose average essence was lossily distilled into statistical token tumblers that are LLMs. This can still be super useful ofc ourse. Post triggered by someone suggesting we ask an AI how to run the government etc. TLDR you're not asking an AI, you're asking some mashup spirit of its average data labeler.

Text from Andrej Karpathy in Twitter: People have too inflated sense of what it means to "ask an AI" about something. The AI are language models trained basically by imitation on data from human labelers. Instead of the mysticism of "asking an AI", think of it more as "asking the average data labeler" on the internet. Few caveats apply because e.g. in many domains (e.g. code, math, creative writing) the companies hire skilled data labelers (so think of it as asking them instead), and this is not 100% true when reinforcement learning is involved, though I have an earlier rant on how RLHF is just barely RL, and "actual RL" is still too early and/or constrained to domains that offer easy reward functions (math etc.). But roughly speaking (and today), you're not asking some magical AI. You're asking a human data labeler. Whose average essence was lossily distilled into statistical token tumblers that are LLMs. This can still be super useful ofc ourse. Post triggered by someone suggesting we ask an AI how to run the government etc. TLDR you're not asking an AI, you're asking some mashup spirit of its average data labeler.

Andrej is one of the reasons why I still check Twitter.

29.11.2024 20:54 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

This is really nice!

29.11.2024 17:06 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
Simpson's paradox - Wikipedia

P.D. Lo del primer mensaje: en.wikipedia.org/wiki/Simpson...

29.11.2024 12:05 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0