Stuart Gray's Avatar

Stuart Gray

@sgray

He/Him. AI Wrangler. Web Geek. F1 Fan. All views my own. ๐Ÿค– AI, LLMs, GenAI, NLP ๐Ÿ Python Dev ๐Ÿš€ Indie Hacker ๐ŸŽฎ Game Dev, ProcGen, Unity, C# ๐ŸŽ๏ธ F1 Fan ๐Ÿ‡ฌ๐Ÿ‡ง UK Based ๐Ÿฆฃ mastodonapp.uk/@StuartGray โœ–๏ธ x.com/StuartGray (inactive)

595
Followers
1,423
Following
1,068
Posts
06.02.2024
Joined
Posts Following

Latest posts by Stuart Gray @sgray

Preview
New York considers bill that would ban chatbots from giving legal, medical advice | StateScoop A bill under consideration in New York would provide a private right of action, allowing people to file lawsuits against chatbot owners who violate the law.

The latest NY chatbot bill would bar chatbots from conveying information that could fall within the scope of a licensed profession.

Itโ€™s basically a censorship bill disguised as licensure protection.

statescoop.com/new-york-bil...

06.03.2026 05:03 ๐Ÿ‘ 74 ๐Ÿ” 14 ๐Ÿ’ฌ 28 ๐Ÿ“Œ 17

Iโ€™m not discounting any of that, Iโ€™m simply focused on the lack of prompt injection in the wild.

Donโ€™t you think itโ€™s slightly strange we havenโ€™t heard it mentioned in post-incident reviews?

05.03.2026 22:10 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

people hawking โ€œsecureโ€ email are not to be trusted, exhibit 9000

05.03.2026 20:43 ๐Ÿ‘ 41 ๐Ÿ” 10 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

The interesting part of all this to me is the prompt injection.

Itโ€™s a well known LLM issue, and thereโ€™s been a lot of speculation about why we havenโ€™t seen prominent examples of it deployed in anger in the wild, not just a PoC.

This is the first Iโ€™ve seen.

Seen any others? @simonwillison.net

05.03.2026 21:22 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0
Preview
ICO writes to Meta over 'concerning' AI smart glasses report Videos, including of glasses-wearers using the toilet or having sex, are sometimes reviewed by a Kenya-based subcontractor.

Last year when I was checking into a hotel, the desk person was wearing Meta glasses. I kindly asked them to take them off. They were annoyed. I said, โ€œI do not consent to you looking at my credit card and ID with Meta glasses on.โ€ My instincts were correct: www.bbc.com/news/article...

05.03.2026 15:27 ๐Ÿ‘ 5779 ๐Ÿ” 2314 ๐Ÿ’ฌ 89 ๐Ÿ“Œ 178
Can coding agents relicense open source through a โ€œclean roomโ€ implementation of code? Over the past few months itโ€™s become clear that coding agents are extraordinarily good at building a weird version of a โ€œclean roomโ€ implementation of code. The most famous version โ€ฆ

As usual, @simonwillison.net to the rescue simonwillison.net/2026/Mar/5/c...

05.03.2026 17:40 ๐Ÿ‘ 7 ๐Ÿ” 4 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1

Interesting discussion on HN. If I see a painting of a sunset, and I paint a sunset, โ‰  copyright violation. If I study a codebase (or a closed-source end product) and go off and rewrite it on my own, โ‰  license violation. Does this change if I use a coding agent to help me?

05.03.2026 13:57 ๐Ÿ‘ 11 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

I worked in retail while I was at college ~35 years ago.

I canโ€™t say that organised theft rings was a thing back then, but we had some very brazen & prolific shoplifters - quiet spot, large bag, slide everything of a clothing rail into it & away.

I assume it was sold at car boot sales back then.

05.03.2026 15:13 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

instructive to compare the default outputs in diff. languages across a range of dev tasks.

That shows you where the quality floor is, and what you get by default if you donโ€™t have strict guidance or prompts covering it.

05.03.2026 13:38 ๐Ÿ‘ 3 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Language choice def matters, and varies somewhat between models.

Iโ€™ve not tested go so Iโ€™m not sure where it tends to sit support wise, but generally speaking Python is nearly always best supported & Rust tends to sit in the middle of the pack.

They both improve with guidance, but itโ€™s especially

05.03.2026 13:38 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Iโ€™m not sure about YouTube but I assume itโ€™s a combination of the format, the types of video that gain most views, huge volume making switching cheap & easy, and conveying that in a single image.

Closest analogy I can think of is those cheap weekly soap/gossip-focused magazine covers in newsagents.

05.03.2026 09:20 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

1. A short thread on a Bluesky phenomenon that might be described as "They are a dead-eyed cultist who must be cast out lest the heresy take root!" OP has blocked me for mocking them - I'd usually obscure their name but since they themselves were quote-dunking to demand someone else be blocked ...

04.03.2026 13:57 ๐Ÿ‘ 659 ๐Ÿ” 145 ๐Ÿ’ฌ 53 ๐Ÿ“Œ 79

This is conflating two related but separate things.

Yes, the questions have been around for a while across all models.

The question as posed was about an increase in their number, not claiming they were new.

04.03.2026 18:56 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Interestingโ€ฆ I wonder if this is a direct result of OpenAI introducing advertising to ChatGPT?

Pretty much every website or app that relies on Ad revenue introduces UI patterns designed to increase use & retention, with a goal of serving more ads in the process.

Itโ€™s hard to conclude otherwise.

04.03.2026 18:16 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

โ€œNow we have a faster horseโ€, to shred that infamous Ford quote.

03.03.2026 15:38 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
How Claude remembers your project - Claude Code Docs Give Claude persistent instructions with CLAUDE.md files, and let Claude accumulate learnings automatically with auto memory.

The docs do describe nested directory support, and also multiple files outside your project (for cross project content):

โ€œCLAUDE.md files in subdirectories load on demand when Claude reads files in those directories.โ€

code.claude.com/docs/en/memo...

03.03.2026 15:24 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Video thumbnail

Secretary Rubio's remarks indicate that Israel put U.S. forces in harm's way by insisting on attacking Iran. And the administration was complicitโ€”joining their war instead of talking them down.

This is unacceptable of the President, and unacceptable of a country that calls itself our ally.

02.03.2026 22:10 ๐Ÿ‘ 2135 ๐Ÿ” 782 ๐Ÿ’ฌ 206 ๐Ÿ“Œ 127

The end result.

github.com/dollspace-ga...

01.03.2026 19:05 ๐Ÿ‘ 77 ๐Ÿ” 4 ๐Ÿ’ฌ 7 ๐Ÿ“Œ 2

It makes it look like the majority of Sonnet 4.6s improvement, and to a lesser degree, Opus 4.6s, rides of the back of increased reasoning.

A factor thatโ€™s recently been gaining attention because itโ€™s not captured by benchmarks, leaving a โ€˜loopholeโ€™ that makes direct comparisons harder :/

01.03.2026 18:28 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Not sure what Anthropicโ€™s intention is, but the initial benchmark testing showed that Sonnet 4.6 used double the number of tokens Opus 4.6 did to reach their respective scores

Sonnet 4.6 vs. 4.5 was worse at nearly 5x more tokens.

Obv. RW results will depend on your prompts/use cases.

01.03.2026 18:25 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

I see Trump is continuing to follow the tried and tested mafia-style playbook.

One of his favourites:

โ€œNice [country/state/institution/corporation/job] youโ€™ve got there, itโ€™d be a real shame if anything happened to it.โ€

01.03.2026 16:56 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Woozle effect

The Woozle effect, also known as evidence by citation, occurs when a source is widely cited for a claim that the source does not adequately support, giving said claim undeserved credibility

Woozle effect The Woozle effect, also known as evidence by citation, occurs when a source is widely cited for a claim that the source does not adequately support, giving said claim undeserved credibility

I had to look this up

24.02.2026 11:34 ๐Ÿ‘ 49 ๐Ÿ” 7 ๐Ÿ’ฌ 4 ๐Ÿ“Œ 3
Interactive explanations - Agentic Engineering Patterns - Simon Willison's Weblog

New chapter of my Agentic Engineering Patterns guide. This one is about having coding agents build custom interactive and animated explanations to help fight back against cognitive debt simonwillison.net/guides/agent...

28.02.2026 23:14 ๐Ÿ‘ 165 ๐Ÿ” 11 ๐Ÿ’ฌ 13 ๐Ÿ“Œ 1

This shouldnโ€™t be a surprise for anyone paying even a little bit of attention to recent US & UK politics.

However, it seems clear Reform are going to cry foul over every local & national election loss going forward, and use it to further whip up racial division through a compliant right wing media.

01.03.2026 09:28 ๐Ÿ‘ 5 ๐Ÿ” 5 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Statement on the comments from Secretary of War Pete Hegseth Anthropic's response to the Secretary of War and advice for customers

Incredibly proud of our team at @anthropic.com and the work weโ€™re doing. In many ways, this shows how fortunate I am to be working alongside some of the smartest folks in the industry that donโ€™t shy away from doing the right thing. Itโ€™s a one-of-a-kind place.

www.anthropic.com/news/stateme...

28.02.2026 02:48 ๐Ÿ‘ 104 ๐Ÿ” 10 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 2

Doll would really love it if everyone got chainlink to the required levels so doll could quallify to get this. github.com/dollspace-ga...

27.02.2026 16:45 ๐Ÿ‘ 59 ๐Ÿ” 16 ๐Ÿ’ฌ 10 ๐Ÿ“Œ 6

canโ€™t compete on technical excellence, but equally a very telling one once you know the driver.

Itโ€™s the same one lawyers use when they have a weak legal case and appeal to emotion & public reaction instead.

27.02.2026 20:28 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Itโ€™s also a source of constant amusement that, despite his billions & self proclaimed brilliance, he continues to lag behind all frontier model providers.

And that the inherent nature of the product makes it impossible to hide.

Instead he resorts to shock & awe, which is a valid tactic when you

27.02.2026 20:28 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

Have you/Central seen this?

Might be a useful alternative, with code available on their GitHub:

bsky.app/profile/isol...

27.02.2026 14:15 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
Text-to-LoRA: Instant Transformer Adaption While Foundation Models provide a general tool for rapid content creation, they regularly require task-specific adaptation. Traditionally, this exercise involves careful curation of datasets and repea...

Sakana has developed a way to, if I understand correctly, instantly generate LORAs on demand from long texts or documents

arxiv.org/abs/2506.06105
arxiv.org/abs/2602.15902

27.02.2026 05:51 ๐Ÿ‘ 54 ๐Ÿ” 6 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 4