Pekka Lund (@pekka) — bluesky.baby

a pixelated image with the words alright don t jump ALT: a pixelated image with the words alright don t jump

Sehän se ongelma onkin, kun me seurataan Annaa ja Anna seuraa Laamista ja Laamis seuraa omaa häntäänsä tai jotain.

Tää on nyt joku kaikkien feedeille levinnyt Lemmings-tilanne.

06.03.2026 21:09 👍 2 🔁 0 💬 0 📌 0

GPT-5.4 Pro (xhigh) also improved CritPt record from Gemini 3.1 Pro's 17% to 30%. OpenAI appears to have an edge on the hardest math and physics reasoning tasks.

"CritPt evaluates language models on solving unpublished, frontier-level physics problems that require genuine research-scale reasoning."

06.03.2026 20:16 👍 0 🔁 1 💬 0 📌 0

Or DeepSeek's.

That's totally realistic, right? Surely being this late only signals they have used enough time to make it the best of them all? I'm not over-optimistic or delusional at all, am I?

06.03.2026 19:28 👍 2 🔁 0 💬 0 📌 0

Menee vielä hetki siihen vaiheeseen. Katos kun kuulin, että yrityksen kannattaa varmistaa toimitusketju ja maksimoida tuotot vertikaalisella integraatiolla.

Eli siis odottelen, että pellot sulaa, niin pääsen kylvämään vehnää. Ja pitää löytää sellainen pelto, ettei omistaja käy siellä ennen satoa.

06.03.2026 19:21 👍 3 🔁 0 💬 2 📌 0

Näyttää epätasaisesti pursotetulta lenkkimakkaralta mistä kasvaa karvaa.

Ei millään pahalla Laamis ja olet varmaan kuullut pahempaakin.

06.03.2026 19:17 👍 3 🔁 0 💬 2 📌 0

Se siitä ruokahalusta.

06.03.2026 19:07 👍 4 🔁 0 💬 1 📌 0

Jos nyt ihan rehellisiä ollaan, niin aiemmin ajattelin pahaa. Mutta @kissankuono.bsky.social on onnistunut vakuuttamaan, että kyllä sinussakin hyvää on. Rosmariinin myötävaikutuksella.

06.03.2026 19:02 👍 2 🔁 0 💬 1 📌 0

It's now Saturday in Hangzhou, China. All hope is lost.

This must be what it feels like to wait for the second coming of a dude who's been dead for 2000 years.

06.03.2026 16:10 👍 13 🔁 0 💬 0 📌 0

GPT-5.4 (xhigh) narrowly missed the top spot held by Gemini 3.1 Pro on Artificial Analysis Intelligence Index.

But GPT-5.4 generated 120M tokens on the benchmarks, at 72 tokens/s, which cost $2950. Gemini used just 57M, at 100 tokens/s, and cost $892.

06.03.2026 12:40 👍 18 🔁 5 💬 2 📌 0

Oma läppärini on ihan tavanomainen keskitason peliläppäri mutta tungin siihen 96GB muistia silloin kun se kustansi noin 200€. Nyt reilua vuotta myöhemmin tasan samat muistipalikat maksaa samassa kaupassa tonnin enemmän.

06.03.2026 12:30 👍 1 🔁 0 💬 1 📌 0

DeepSeek R1:n julkaisun aikaan joku rakenteli jenkkilän hinnoin $6000 maksavan serverin, jossa oli 768GB muistia, jolla sai pyöriteltyä silloin tuota aika lähellä kärkeä ollutta LLM:ää 6-8 tok/s vauhdilla. Nyt R1 on tietysti kaukana kärjestä ja muisti maksaa ihan eri luokkaa...

06.03.2026 12:30 👍 1 🔁 0 💬 1 📌 0

Is it a bad assumption? Should it assume I'm an idiot when I ask that?

And given that there are e.g. superchargers at same locations as car washes, it could make sense to have a car there if it's just a 5-minute walk.

06.03.2026 02:09 👍 1 🔁 0 💬 0 📌 0

I asked it the same. It said walk.

I asked it to walk me through the whole process. It said walk to the car wash...find your car...

I asked it why is my car there? It said:
"Because in the earlier plan, I assumed you had already left it at or near the car wash. That was a bad assumption."

06.03.2026 02:03 👍 2 🔁 0 💬 2 📌 0

Could they compromise and only let the bot kill half of the people it wants to kill automatically?

06.03.2026 01:21 👍 8 🔁 0 💬 1 📌 0

We haven't given up hope yet?

Just checking, because I'm not sure myself anymore.

06.03.2026 01:09 👍 3 🔁 0 💬 0 📌 0

Call it denial.

Apparently written by someone who hasn't used AI voice modes or seen them asking follow-up questions and so on.

06.03.2026 01:08 👍 1 🔁 0 💬 1 📌 0

Finally the European approach makes sense!

06.03.2026 00:31 👍 5 🔁 0 💬 0 📌 0

Arena.ai has now revealed the identities of those stealth models in my chat history, and they were indeed gpt-5.4 and gpt-5.4-high.

But one mystery model named "march26-chatbot1" hasn't changed its identity.

06.03.2026 00:29 👍 2 🔁 0 💬 0 📌 0

12kk on AI-maailmassa nykyajan ja kivikauden ero.

Itselläni ei mennyt kuin muutama prompti kun Mistral alkoi sylkeä yllättäin kiinaa. Mistralin kokeilut on muutoin jäänyt hyvin vähiin, vaikka olen testaillut ja käytellyt noita varsin kattavasti.

Pääasiallisesti käytössä nyt Gemini 3.1 Pro.

06.03.2026 00:20 👍 0 🔁 0 💬 1 📌 0

AI Model & API Providers Analysis | Artificial Analysis Comparison and analysis of AI models and API hosting providers. Independent benchmarks across key performance metrics including quality, price, output speed & latency.

No jos katsot vaikka täältä mihin Mistral Large 3 eli Mistralin suurin ja kyvykkäin malli sijoittuu, niin edellä on mm. gpt-oss-20B, gpt-oss-120B ja Qwen3.5 27B. Kaikkia noita olen pyöritellyt omalla läppärilläni. Nopeita eivät toki paikallisesti ole PC-raudalla.

06.03.2026 00:17 👍 0 🔁 0 💬 1 📌 0

This could be bad news for European frontier AI labs too. If we had any.

06.03.2026 00:10 👍 5 🔁 0 💬 1 📌 0

Introducing GPT-5.3-Codex GPT-5.3-Codex is a Codex-native agent that pairs frontier coding performance with general reasoning to support long-horizon, real-world technical work.

When they released 5.3 Codex, they didn't even reveal benchmarks that wouldn't be closely connected to coding.

So I think if some version was rushed, it would be more likely 5.3 Codex as an intermediate drop as a response to Anthropic.

05.03.2026 23:43 👍 2 🔁 0 💬 0 📌 0

They only released Codex version of 5.3. I expected them to release regular 5.3 earlier than this. But now they apparently "merged our codex & mainline models".

Some OpenAI folks (purposefully?) leaked UI images showing 5.4 on February 24/25, before the bad PR.

05.03.2026 23:41 👍 1 🔁 0 💬 1 📌 0

I didn't directly answer to this part:

"there is no evidence those topologies match neural ones."

But the target and point of UAT is to replicate functionality, not the implementation details including topology.

05.03.2026 23:23 👍 0 🔁 0 💬 0 📌 0

I don't think that's the problem here. Although I didn't quite understand what you meant by "topology within the matrix". When I asked Gemini to reason what that could mean, it suspected you might be confusing Universal Approximation Theory to Lottery Ticket Hypothesis.

05.03.2026 23:23 👍 0 🔁 0 💬 1 📌 0

Laamalla oli radioaktiivinen kissa. Sillä lienee nyt tieteelle tuntemattomia kytkentöjä aivoista muihin sisäelimiinkin.

05.03.2026 22:56 👍 3 🔁 0 💬 1 📌 0

Flashed face distortion effect - Wikipedia

Jep, todella jännä efekti. Ja silti vasta toinen sija vuoden 2012 parhaiden illuusioiden kilpailussa.

05.03.2026 22:48 👍 2 🔁 0 💬 0 📌 0

For Tier 4 that is.

05.03.2026 22:39 👍 0 🔁 0 💬 0 📌 0

And it keeps solving more problems in the held-out set than in the set OpenAI has.

05.03.2026 22:39 👍 0 🔁 0 💬 1 📌 0

Sellaisen johtopäätöksen sijoittajat näyttävät tehneen, kun nuo ovat näemmä keränneet ainakin 40 milliä.

Siihen nähden se lähettämäsi yksi vaivainen milli vaikuttaa nyt lähinnä epäluottamuslauseelta.

05.03.2026 22:07 👍 2 🔁 0 💬 1 📌 0

Pekka Lund

Latest posts by Pekka Lund @pekka