Max Spero (@max.pangram.com)

No problem!

28.02.2026 02:08 👍 2 🔁 0 💬 0 📌 0

ESPERANTO: Evaluating Synthesized Phrases to Enhance Robustness in AI Detection for Text Origination While large language models (LLMs) exhibit significant utility across various domains, they simultaneously are susceptible to exploitation for unethical purposes, including academic misconduct and dis...

Ah that's fair. I think the authors consider the topic of adversarial robustness to be out of scope: there are other papers that measure this better

- arxiv.org/abs/2409.14285
- arxiv.org/html/2501.03... (ours)

28.02.2026 02:07 👍 2 🔁 0 💬 0 📌 0

EditLens: Quantifying the Extent of AI Editing in Text A significant proportion of queries to large language models ask them to edit user-provided text, rather than generate new text from scratch. While previous work focuses on detecting fully AI-generate...

Oh nooo unfortunately the AI answer is confidently wrong. We don't use any perplexity or burstiness metrics. It's a specialized ML algorithm trained to detect AI/Human/AI-assisted with a setup that measures the magnitude of AI edits
arxiv.org/abs/2510.03154

28.02.2026 01:55 👍 4 🔁 0 💬 2 📌 0

I'm glad you cited this study, curious how you think it's flawed?

28.02.2026 01:52 👍 1 🔁 0 💬 2 📌 0

Founder of Pangram here to comment on a couple of points
- human writing flagged as ai? That's a big problem: we benchmark at 1 in 10,000 false positive rate and stand by it. Was this real world writing?
- we are 12 people. 2 sales, 1 marketing, and the rest of the team is technical

28.02.2026 01:49 👍 1 🔁 0 💬 0 📌 0

Pangram is releasing a new model today!

Pangram 3.2 is a significant improvement over 3.1 in several aspects
- Improved performance on 'humanized' text
- Improved performance on adversarial prompts
- Minimum word reduced from 75 to 50
- Overall improvement on Claude 4.6 recall

27.02.2026 21:46 👍 4 🔁 1 💬 1 📌 0

Wow this is so cool

09.02.2026 22:40 👍 9 🔁 0 💬 0 📌 0

Would 100% be possible to do a labeler based on reports. Probably too pricey for all content.

Bsky bot is still on my side project to-do list

08.02.2026 23:49 👍 8 🔁 0 💬 1 📌 0

ai dot com purchased for $70m.
METR evals going exponential.

former crypto grifters signalling we're in a bubble

big labs signalling recursive self improvement

I wonder, if the bubble pops, will it even matter?

07.02.2026 07:00 👍 2 🔁 0 💬 2 📌 0

I'm not seeing the self evident part of your argument

05.02.2026 04:41 👍 1 🔁 0 💬 1 📌 0

Me too. I really hope it's the former

05.02.2026 00:29 👍 2 🔁 0 💬 0 📌 0

I get why Google and meta don't always have the data to do this, but it's inexcusable when Amazon shows me ads for things that I just bought

05.02.2026 00:06 👍 1 🔁 0 💬 1 📌 0

I was going to write a whole effortpost about model evaluations but instead here, just take this chart

04.02.2026 20:32 👍 188 🔁 8 💬 32 📌 5

But overall, I believe that an open Internet and free technologies like ChatGPT will bring so good into the world, and ads are the only way to make tech like this universally free.

04.02.2026 23:39 👍 0 🔁 0 💬 0 📌 0

Yes, there are lazy and bad ways to do ads. One of the worst things companies can do is to show ads to already-paying customers. This happens most often when consumers have no alternative to a monopoly.

04.02.2026 23:39 👍 0 🔁 0 💬 1 📌 0

Digital ads have better targeting and attribution. Well-targeted ads cost more, meaning that more marketing dollars are exhausted on fewer ad impressions. A world where you see half as many ads because they are twice as relevant is a world in which everybody is happier.

04.02.2026 23:39 👍 0 🔁 0 💬 2 📌 0

Ads enable a powerful technology to become a global utility. They are the great equalizer: even the poorest user can pay for the service with their attention.

04.02.2026 23:39 👍 0 🔁 0 💬 1 📌 0

Digital ads are one of the greatest positive-sum technologies of the 21st century.

Google Search has 4 billion users, and they make on average $40 per user per year. This type of business model couldn't exist without ads; Google cannot find 4 billion people who would pay $40 for ad-free search.

04.02.2026 23:39 👍 3 🔁 0 💬 2 📌 0

Allow me to introduce cases where human intelligence is not fixed

04.02.2026 23:37 👍 6 🔁 0 💬 1 📌 0

Hurts my heart... Poor llm

31.01.2026 21:24 👍 2 🔁 0 💬 0 📌 0

Lmao

31.01.2026 16:13 👍 1 🔁 0 💬 0 📌 0

From the LocalLLaMA community on Reddit Explore this post and more from the LocalLLaMA community

www.reddit.com/r/LocalLLaMA...

31.01.2026 14:43 👍 19 🔁 0 💬 0 📌 0

Pretty cool project on /r/localllama - they take human written text and sloppify it 10x with 4o-mini, then train the model to de-slop by reversing the transformation

31.01.2026 14:43 👍 160 🔁 14 💬 2 📌 4

extremely prescient

30.01.2026 21:29 👍 1 🔁 0 💬 0 📌 0

Thank you!

30.01.2026 19:17 👍 1 🔁 0 💬 0 📌 0

There are only two hard problems in computer science: building artificial superintelligence, and naming things

30.01.2026 16:10 👍 67 🔁 6 💬 2 📌 1

🚨 New Study 🚨

@arxiv.bsky.social has recently decided to prohibit any 'position' paper from being submitted to its CS servers.
Why? Because of the "AI slop", and allegedly higher ratios of LLM-generated content in review papers, compared to non-review papers.

29.01.2026 14:00 👍 29 🔁 9 💬 2 📌 2

Thanks I appreciate that!

28.01.2026 00:51 👍 1 🔁 0 💬 0 📌 0

Omg I didn't even clock that

28.01.2026 00:48 👍 7 🔁 0 💬 1 📌 1

I will let you know when I try atproto! Surely it's better documented than the X api

28.01.2026 00:42 👍 4 🔁 0 💬 1 📌 0

Max Spero

Latest posts by Max Spero @max.pangram.com