Amit Chaudhary (@amitness.com)

For pre-training data, this thread has good paper recommendations
bsky.app/profile/mari...

04.06.2025 07:27 👍 4 🔁 0 💬 0 📌 0

Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge) Use cases, techniques, alignment, finetuning, and critiques against LLM-evaluators.

Not academic work, but for evals and data, these survey articles are quite in-depth with links to papers.

LLM judge survey: eugeneyan.com/writing/llm-...
Synthetic pre-training/post-training survey: eugeneyan.com/writing/synt...

04.06.2025 07:26 👍 5 🔁 0 💬 2 📌 0

A Python package & command-line tool to gather text on the Web — Trafilatura 2.0.0 documentation Trafilatura is a Python package and command-line tool designed to gather text on the Web. Its main applications are web crawling, downloads, scraping, and extraction of main texts, comments and metada...

I had watched a talk from @thomwolf.bsky.social of @hf.co and they use trafilatura.readthedocs.io for the HTML to text conversion in their library datatrove (github.com/huggingface/...).

The talk is more focused on filtering though but here it is:
www.youtube.com/watch?v=2-SP...

10.05.2025 08:02 👍 3 🔁 0 💬 1 📌 0

👋

19.02.2025 15:08 👍 1 🔁 0 💬 0 📌 0

Wrote a package for the gpu-poor/mac-poor to run ollama via remote servers (paid, colab, kaggle etc.)

Just 2 lines and your local ollama can run all models on server-side GPU:

> pip install ollama-remote
> ollama-remote

github.com/amitness/oll...

16.02.2025 10:24 👍 5 🔁 0 💬 0 📌 0

Introspection for Function Calling Learn runtime introspection in Python to convert functions into JSON schema for function calling with LLMs

Wrote up a deep-dive on how @tool decorators in various Agent frameworks leverage python runtime introspection for function to JSON schema conversion.

amitness.com/posts/functi...

15.02.2025 17:45 👍 3 🔁 0 💬 0 📌 0

Sure, SFT is simulating annotators from those countries

But, you see this multiple times on reddit/linkedin, where people downvote and point out some comment as "sounds like chatgpt". Cause it has antislop phrase or syntax

Not accurate as you pointed, but that's what a layman is using as proxy

14.02.2025 13:21 👍 0 🔁 0 💬 0 📌 0

Picking a few keywords from this antislop list:

github.com/sam-paech/an...

14.02.2025 12:57 👍 0 🔁 0 💬 1 📌 0

You actually don't need multiple --with. A comma separated list of packages also works (though looks a bit uglier)

uvx --with llm,sqlite-utils ipython

14.02.2025 12:52 👍 6 🔁 0 💬 0 📌 0

Evals for Diversity in Synthetic Data An overview of evaluation metrics for measuring linguistic diversity in LLM-generated synthetic data

Wrote a literature review on various automated evals for measuring linguistic diversity in LLM generated synthetic data.

Useful to systematically test impact of various techniques on diversity

amitness.com/posts/divers...

09.02.2025 13:58 👍 8 🔁 0 💬 0 📌 0

You can do it with skyfeed + running your custom logic on github actions

bsky.app/profile/amit...

06.01.2025 13:48 👍 8 🔁 0 💬 0 📌 0

Same energy (h/t @hamel.bsky.social )

x.com/HamelHusain/...

30.12.2024 15:04 👍 22 🔁 0 💬 0 📌 0

Text Embeddings Reveal (Almost) As Much As Text How much private information do text embeddings reveal about the original text? We investigate the problem of embedding \textit{inversion}, reconstructing the full text represented in dense text embed...

Reminded me of this paper: arxiv.org/abs/2310.06816

25.12.2024 12:49 👍 2 🔁 0 💬 1 📌 0

I just rely on these:
- alphasignal for daily updates
- email subs to blogs (eugeneyan, simonw, hamel, jasonliu)
- read orielly for bird-eye surveys (chip huyen's ai eng, jay's hands-on llm etc.)
- deeplearning.ai "short" courses to know what's out there (topics I don't touch at work e.g. agents)

19.12.2024 10:10 👍 1 🔁 0 💬 0 📌 0

how are you tackling the last 2 points?

19.12.2024 09:48 👍 1 🔁 0 💬 1 📌 0

That's super cool, I'll give it a try and thank you for building Skyfeed!

03.12.2024 08:03 👍 1 🔁 0 💬 0 📌 0

cc: @pfrazee.com
@simonwillison.net (another git scraping avenue)

01.12.2024 14:43 👍 6 🔁 0 💬 0 📌 0

Zero-Cost Custom Feeds on Bluesky A simple stack for generating custom feeds for Bluesky programmatically without a backend server

Wrote down the process to build your own custom feeds for Bluesky programmatically in Python and run it 100% free

Uses @skyfeed.app + @github.com actions to do periodic filtering and re-ranking and @cloudflare.social static pages to provide data to @bsky.app

01.12.2024 14:42 👍 136 🔁 25 💬 10 📌 2

Papers with Code - Stance Detection Stance detection is the extraction of a subject's reaction to a claim made by a primary actor. It is a core part of a set of approaches to fake news assessment. Example: * Source: "Apples are the mo...

Would this be stance detection? A controversial post would have a high entropy of stance distribution in replies/quotes aka the "1M posts" drama.

paperswithcode.com/task/stance-...

Mutes to a post might also be a good proxy to downvotes but those are private and can't be accessed via API.

29.11.2024 03:38 👍 2 🔁 0 💬 0 📌 0

It's also why the feed loads super fast. Bluesky is simply making a request to this static endpoint on cloudflare when you open the feed and just fetches the JSON for the post ids and loads that into their UI.

bluesky-1tj.pages.dev/xrpc/app.bsk...

28.11.2024 09:41 👍 1 🔁 0 💬 1 📌 0

Thanks; the trick is how bluesky protocol operates. It makes GET requests to 3 endpoints and expects JSON

So, instead of running a server 24/7, you can offload indexing to @skyfeed.app, periodically filter the feed via github actions and just dump that into cloudflare pages with correct paths

28.11.2024 09:37 👍 1 🔁 0 💬 1 📌 0

I fetch the feed created by skyfeed using bluesky sdk, and for posts with arxiv links, used the pyarxiv library to fetch the category and filtered items to these categories: cs.AI, cs.CL, cs.CV, cs.LG, cs.MA

Here is the relevant code

The filtering runs every 30m for free via github actions

27.11.2024 08:45 👍 3 🔁 0 💬 1 📌 0

Hey @mariaa.bsky.social, I got it working. Here you go

bsky.app/profile/amit...

27.11.2024 08:30 👍 1 🔁 0 💬 1 📌 0

The most interesting part is the filtering and ranking; you can do a bunch of stuff. I went with hackernews ranking for as a start to balance recency vs popularity.

You could even train your own classifiers to make it more personalized; bluesky seems super hackable, love it!

27.11.2024 08:28 👍 2 🔁 0 💬 0 📌 0

Built a custom feed that shows latest arxiv+acl papers that belong to AI/ML/NLP/Computer vision categories. No bots/random papers belonging to other fields now.

bsky.app/profile/amit...

Generated in python but runs 100% free without a server; I'll do a write-up soon
github.com/amitness/blu...

27.11.2024 08:28 👍 12 🔁 3 💬 3 📌 1

Bookmark labeler/feed! Unlike the pin feed where you have to comment 📌 to save a post, this alternative is completely private!

Subscribe to both the labeler and feed here. Report a post to the label to add to the feed! Report again to remove!

bsky.app/profile/book...

17.11.2024 19:29 👍 4777 🔁 2484 💬 205 📌 516

I am also planning to hack-around some ML powered feed this weekend. Goal is to see if it can be hosted for free (thinking cloudflare workers + kv cache free tier)

Also saw this earlier, could be useful to join
bsky.app/profile/dani...

26.11.2024 02:12 👍 4 🔁 0 💬 0 📌 0

Want to use bluesky replies as your blog’s comment section?

`npm install bluesky-comments`

Built by @coryzue.com

25.11.2024 22:32 👍 3195 🔁 568 💬 156 📌 71

If using OpenAI, you need to make sure the "reasoning" key is before the "answer" fields (i.e. ResponseFormatA if using pydantic)

@dylancastillo.co has done a nice analysis on this and the order matters

dylancastillo.co/posts/llm-py...

25.11.2024 16:55 👍 2 🔁 0 💬 0 📌 0

Amit Chaudhary

Latest posts by Amit Chaudhary @amitness.com