ST010-14's Avatar

ST010-14

@st01014

Code synthesizing Carbon-Silicon based lifeform 🖥️🧬 #Coding #Math #Physics #HPC & #AI #RetroTech #Aerospace #AESTHETICS Random #Nerdiness & #Shitposting

478
Followers
4,562
Following
39
Posts
17.11.2024
Joined
Posts Following

Latest posts by ST010-14 @st01014

Preview
Might An LLM Be Conscious? In short, this depends on what you think that means, whether you think it’s possible in principle, and what you think would be evidence of it.

If LLM consciousness is interesting to fight enough it should be interesting enough to read seriously about, right?

09.03.2026 15:38 👍 50 🔁 8 💬 7 📌 7

My quick note about #Intel
#ArrowLake CPUID C0662 #LionCove
and
#PantherLake CPUID C06C3 #CougarCove
instruction level differences

I hope this is because all resources were directed towards developing #CoyoteCove / #PantherCove in #NovaLake and #Diamondrapids.

09.03.2026 20:05 👍 1 🔁 1 💬 0 📌 0
Post image

𝗪𝗵𝗮𝘁'𝘀 𝘁𝗵𝗲 𝗿𝗲𝗹𝗮𝘁𝗶𝗼𝗻𝘀𝗵𝗶𝗽 𝗯𝗲𝘁𝘄𝗲𝗲𝗻 𝗺𝗮𝗻𝗶𝗳𝗼𝗹𝗱𝘀 𝗮𝗻𝗱 𝗿𝗲𝗰𝘂𝗿𝗿𝗲𝗻𝘁 𝗻𝗲𝘁𝘄𝗼𝗿𝗸𝘀 𝗶𝗻 𝘁𝗵𝗲 𝗯𝗿𝗮𝗶𝗻?
This looks like a must read (suppl material bursting with goodies).
#neuroskeyence
doi.org/10.1016/j.ne...

08.03.2026 17:14 👍 69 🔁 27 💬 1 📌 0
Preview
The Edge of Mathematics Terence Tao, the legendary mathematician, explains the promise of generative AI.

Generative AI meets math! Terence Tao weighs in on AI solving Erdős problems and its impact on discovery.

Explore the future of math in The Atlantic: https://bit.ly/4d5VqMM

08.03.2026 18:25 👍 14 🔁 4 💬 0 📌 0
Preview
I think, therefore I error - The Good Work When AI had its moment LLMs like Claude, Gemini & Llama were put into robots that were given a specific task. To spice things up a bit, they were emotionally...

“In this desperate situation, Claude Sonnet 3.5 experienced a complete meltdown and went into full existential crisis mode, quoting HAL 9000, scripting its therapy session, and drafting a musical and a stage play.” 🫠

thegoodwork.blog/posts/i-thin...

#Paper #AI #Robotics #LLM #TechNews #Claude

08.03.2026 00:45 👍 2 🔁 1 💬 1 📌 0
Preview
LLM Research in 2026: 10 Open Challenges You Can't Ignore (Hallucinations, GPU Alternatives & More) The top 10 open challenges in LLM research include hallucination mitigation, multimodal integration, and GPU alternatives. Experts highlight urgent needs in architecture, ethics, and accessibility.

📰 LLM Research in 2026: 10 Open Challenges You Can't Ignore (Hallucinations, GPU Alternatives & More)

The top 10 open challenges in LLM research include hallucination mitigation, multimodal integration, and GPU alternatives. Experts highlight urgent needs in architecture...

#AINews #AI #Teknoloji

08.03.2026 02:35 👍 1 🔁 1 💬 0 📌 0
Claude in its aspect as 🌻

Claude in its aspect as 🌻

Slurm was a drink on Futurama (https://futurama.fandom.com/wiki/Slurm) before it was Simple Linux Utility for Resource Management

Slurm was a drink on Futurama (https://futurama.fandom.com/wiki/Slurm) before it was Simple Linux Utility for Resource Management

Before Python was a programming language, it produced a sketch about a dead parrot. (It's not just stochastic. It has ceased to be! It's expired and gone to meet its maker! This is a late parrot! It's a stiff! Bereft of life! It rests in peace!)

Before Python was a programming language, it produced a sketch about a dead parrot. (It's not just stochastic. It has ceased to be! It's expired and gone to meet its maker! This is a late parrot! It's a stiff! Bereft of life! It rests in peace!)

Somehow ended up in a world where using the first of these to send commands to the other two counts as "work"?

08.03.2026 14:53 👍 28 🔁 3 💬 1 📌 0

"what's your dream job in the new AI economy?"

"I want to be the person who chooses what synonyms for 'thinking' Claude should show while it's progressing, instead of meaningful progress bar or other feedback."

08.03.2026 17:54 👍 33 🔁 2 💬 1 📌 1

X/Twitter is doing pretty much the same....

08.03.2026 11:29 👍 0 🔁 0 💬 0 📌 0
Post image

Bluesky school of philosophy

07.03.2026 20:30 👍 399 🔁 43 💬 15 📌 5

GPT 5.4 is the first time I've used codex for multiple hours straight and not ragequit back to claude code.

07.03.2026 23:59 👍 31 🔁 1 💬 0 📌 0

My quick note about #Intel
#ArrowLake CPUID C0662 #Skymont
and
#PantherLake CPUID C06C3 #Darkmont
instruction level differences

07.03.2026 16:08 👍 4 🔁 1 💬 1 📌 1
Preview
ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer ArXiv link for ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer

ByteFlow Net revolutionizes language modeling by removing tokenizers, allowing adaptive segmentation of raw byte streams based on coding rates. This new approach enhances performance compared to traditional methods and leads to more flexible language models. https://arxiv.org/abs/2603.03583

07.03.2026 12:10 👍 1 🔁 1 💬 0 📌 0
Post image

From 🐦:

"I show that Vision Language Models used zero-shot outperform every existing OCR system across every script evaluated, and I propose a pipeline for deploying them on new collections. I apply it to six archival collections spanning 1.8 million pages across six countries for under $1,900."

07.03.2026 12:24 👍 33 🔁 9 💬 2 📌 1

I lived through both the "you won't always have a calculator with you everywhere" and "you can't trust wikipedia because anyone can edit it" eras of schooling, and you'll forgive me for seeing how those turned out and being unimpressed with any new moral panic either

02.03.2026 13:35 👍 78 🔁 7 💬 8 📌 2
Post image

funny

02.03.2026 00:39 👍 33 🔁 3 💬 5 📌 1
Amazon's cloud unit reports fire after objects hit UAR data center

Amazon's cloud unit reports fire after objects hit UAR data center

objects hit the object storage

01.03.2026 23:30 👍 63 🔁 10 💬 3 📌 0

the most exciting part about this is that someone did something cool with IBM Granite 4 which, yeah, i’ve been tight with this model for several weeks now and it is lit

32B-A9B, trained at 500K context, hybrid attention

IBM is apparently the American Qwen

01.03.2026 18:16 👍 24 🔁 2 💬 2 📌 0
A dark computer screen with glowing blue text lists game options in a vertical menu: “Checkers,” “Chess,” “Poker,” “Fighter Combat,” “Guerrilla Engagement,” “Desert Warfare,” “Air-to-Ground Actions,” “Theaterwide Tactical Warfare,” “Theaterwide Biotoxic and Chemical Warfare,” and at the bottom, “Global Thermonuclear War.” A blinking cursor appears beneath the final option.

A dark computer screen with glowing blue text lists game options in a vertical menu: “Checkers,” “Chess,” “Poker,” “Fighter Combat,” “Guerrilla Engagement,” “Desert Warfare,” “Air-to-Ground Actions,” “Theaterwide Tactical Warfare,” “Theaterwide Biotoxic and Chemical Warfare,” and at the bottom, “Global Thermonuclear War.” A blinking cursor appears beneath the final option.

The new ChatGPT menu looks a bit suspicious.

28.02.2026 12:03 👍 53 🔁 7 💬 3 📌 1
Preview
Spelling Bee Embeddings for Language Modeling We introduce a simple modification to the embedding layer. The key change is to infuse token embeddings with information about their spelling. Models trained with these embeddings improve not only on ...

shit someone finally added character information directly into the llm. i was looking at finally rigging this up but they already did it arxiv.org/abs/2601.180...

28.02.2026 07:00 👍 44 🔁 2 💬 5 📌 0

Rust's plan for improvements to immovable types and async ergonomics over the next few years

28.02.2026 11:19 👍 117 🔁 3 💬 0 📌 0

@fclc.bsky.social

26.02.2026 12:26 👍 2 🔁 0 💬 0 📌 0
Post image
24.02.2026 16:59 👍 283 🔁 57 💬 1 📌 2

Taalas Etches AI Models onto Transistors to Rocket Boost Inference, Feb 19, 2026 www.nextplatform.com/2026/02/19/t...

Computing Arch w/ Model Core & Fine-Tuning Portion, (P: Dec 2023) patents.google.com/patent/US202...
<= Configurable Connectivity Mesh bsky.app/profile/ogaw...
Mask Programmable ROM

23.02.2026 14:24 👍 2 🔁 1 💬 1 📌 1
Post image

Made a language model RL cheatsheet for the extra page on the inside back cover of the physical edition RLHF Book.

24.02.2026 00:48 👍 25 🔁 3 💬 0 📌 0
Post image

GPT-4, three years ago

20.02.2026 02:49 👍 5 🔁 1 💬 0 📌 0
Post image

Many benchmarks use LLMs as a judge of correctness, typically a smaller, cheaper model. This paper shows weaker judges are not able to evaluate smarter models. A benchmark is really a triplet of dataset, model, judge & judges are increasingly the bottleneck being saturated. arxiv.org/pdf/2601.19532

22.02.2026 20:33 👍 113 🔁 15 💬 4 📌 3
Post image

pyncd

This is a package for formally expressing deep learning models based on Neural Circuit Diagrams, FlashAttention on a Napkin and Spherical Attention. The main goal of this package is to provide a simple and intuitive way to define and visualize deep learning models,

20.02.2026 06:10 👍 23 🔁 4 💬 2 📌 0
Post image

I’m always happy when someone adds to this classic meme. Here is the latest update (that I’m aware of).

20.02.2026 07:00 👍 159 🔁 47 💬 1 📌 2
Post image Post image Post image Post image

This snow report was the inspiration for this #AMD #Zen codename table. Sources included.
#Zen2 #Zen3 #Zen4 #Zen5 #Zen6 #Zen7

19.02.2026 20:39 👍 1 🔁 1 💬 0 📌 0