Evgeny Ivanov (@e10v.me)

I don’t need to keep as much context in my head’s “cache” as I did when I implemented features manually, and that makes context switching less costly.

26.02.2026 17:55 👍 0 🔁 0 💬 0 📌 0

And no, it hasn’t hurt code quality. Quite the opposite. Codex is good at implementing features that work as intended. I’m still picky about design details; I just handle those in review.

26.02.2026 17:55 👍 0 🔁 0 💬 1 📌 0

I work on tea-tasting (github.com/e10v/tea-tas...) in my spare time. I used to save bigger changes for holidays, when I had long uninterrupted coding time. Now I can do it in the evening or even while I’m eating 🙈

26.02.2026 17:54 👍 0 🔁 0 💬 1 📌 0

Codex doesn’t just save time when writing code. The bigger thing is that it lowers the cost per context switch.

26.02.2026 17:54 👍 0 🔁 0 💬 1 📌 0

With solid tests, linting, type checking, and an AGENTS md, one prompt is often enough to build a feature, update docstrings, and add tests.

06.02.2026 21:25 👍 0 🔁 0 💬 0 📌 0

Codex 5.3 built the last four tea-tasting (github.com/e10v/tea-tas...) features—and it’s been great! I shipped two package versions in one evening. I reviewed the code thoroughly—my comments were minor.

06.02.2026 21:25 👍 0 🔁 0 💬 1 📌 0

Cloudflare outage on November 18, 2025 Cloudflare suffered a service outage on November 18, 2025. The outage was triggered by a bug in generation logic for a Bot Management feature file causing many Cloudflare services to be affected.

blog.cloudflare.com/18-november-...

19.11.2025 07:47 👍 1 🔁 0 💬 0 📌 0

ClickHouse permission tweak + a missing SQL filter → ML feature pipeline down → the internet breaks 🤔

Another case supporting Taleb’s thesis that the world has become more fragile

Other examples that come to mind: the Amazon outage last month and the CrowdStrike-related outage last year

19.11.2025 07:39 👍 0 🔁 0 💬 1 📌 0

P.S. Paying to apply may sound provocative and require thoughtful consideration and careful testing. But here, I focus on why price signals may address this problem better than AI-based screening.

24.10.2025 09:21 👍 0 🔁 0 💬 0 📌 0

Prices as model weights: learning from dispersed data | Evgeny Ivanov Market design to reduce labor market congestion.

Read more: e10v.me/prices-as-mo...

24.10.2025 06:34 👍 0 🔁 0 💬 1 📌 0

Then I asked myself what this solution has that AI doesn't. That’s how I arrived at the analogy that prices act as model weights: they encode market information. An important difference: prices incorporate signals from dispersed, hard-to-observe data that an AI/ML model may not access.

24.10.2025 06:34 👍 0 🔁 0 💬 1 📌 0

Prices as model weights: learning from dispersed data | Evgeny Ivanov Market design to reduce labor market congestion.

I was thinking about the labor market congestion problem and came up with a solution that is often used in service marketplaces: pay to apply... 🧵

24.10.2025 06:33 👍 0 🔁 0 💬 1 📌 0

Another perspective on marshmallow test: market risk and diminishing marginal utility

10.10.2025 07:31 👍 2 🔁 0 💬 0 📌 0

In NotebookLM, the output language is global. A per-notebook setting would be much more convenient

31.08.2025 07:02 👍 0 🔁 0 💬 0 📌 0

As a non-native English speaker, I don’t want YouTube auto-translating titles, chapters, and descriptions; I want the originals

31.08.2025 07:02 👍 0 🔁 0 💬 1 📌 0

It’s astonishing that Google employs so many multilingual people yet designs products as if users speak only one language

31.08.2025 07:01 👍 2 🔁 0 💬 1 📌 0

Some mentions of the quote "all models are wrong, but some are useful" should be followed by "the map is not the territory"

17.07.2025 16:03 👍 0 🔁 0 💬 0 📌 0

We wanted flying cars, instead we got scooter riders honking at our backs and knocking us off our feet

02.07.2025 07:49 👍 0 🔁 0 💬 0 📌 0

Pre-LLM era take-home assignments: focus on evaluating *answers*; LLM era: focus on evaluating *questions*.

"Here is the problem-solving case: […]. Use an LLM to solve it. Provide a summary of your conversation with the LLM, including the questions you asked and the final solution you obtained."

26.05.2025 10:03 👍 1 🔁 0 💬 0 📌 0

Every relevant cost can ultimately be framed as an opportunity cost. So, in essence, opportunity cost is the only cost that matters.

13.05.2025 13:01 👍 0 🔁 0 💬 0 📌 0

In statistics, answering a wrong question is sometimes called a Type III error. I've already mentioned it in a blog post: e10v.me/ranking-two-...

12.05.2025 12:37 👍 0 🔁 0 💬 0 📌 0

From my personal experience, people often skip steps 1 and 3, which can lead to bad decisions or mediocre solutions.

You probably read this in a statistical or data-analysis context. But I believe the framework applies more broadly.

12.05.2025 12:36 👍 0 🔁 0 💬 1 📌 0

...
4. What data do we need to apply the model?
5. What conclusions can we draw after applying the model?

12.05.2025 12:36 👍 0 🔁 0 💬 1 📌 0

A ladder of questions in analysis:

1. Which question should we ask to address a decision or a problem?
2. Which model should we choose to answer the question?
3. Which assumptions must hold for the model to be valid, and do they hold?
... 🧵

#DecisionIntelligence #ModelThinking #Logic

12.05.2025 12:35 👍 0 🔁 0 💬 1 📌 0

Demo of Narwhals dataframe-agnostic function which supports PySpark

Plot of PySpark dataframe after converting it to PyArrow

SQL generation from Polars syntax

✨ Narwhals now supports PySpark

🎇 If you have a dataframe-agnostic function, you can pass in `pyspark.sql.DataFrame`

📈 Here's a demo, made with @marimo.io

🎁 BONUS feature: combine with SQLFrame, to generate SQL from @pola.rs syntax 🪄

09.05.2025 13:41 👍 1 🔁 1 💬 0 📌 0

Release tea-tasting 1.0.0 · e10v/tea-tasting Breaking changes feat!: make repr() equivalent to str() for all result classes by @e10v in #153 Highlights feat: add simulated experiments (incl. a/a tests) by @e10v in #147 docs: add examples/g...

🔗🔗

- What’s new in tea-tasting: github.com/e10v/tea-tas...
- Guide on simulated experiments and A/A tests: tea-tasting.e10v.me/simulated-ex...
- Examples/guides as marimo notebooks: github.com/e10v/tea-tas...

05.05.2025 13:43 👍 1 🔁 0 💬 0 📌 0

A/A tests are useful for identifying potential issues before conducting the actual A/B test. Treatment simulations are great for power analysis—especially when you need a specific uplift distribution or when an analytical formula doesn’t exist.

05.05.2025 13:43 👍 0 🔁 0 💬 1 📌 0

@marimo.io is not only a great tool for reproducible and interactive research—it's also perfect for interactive documentation where users can play with examples. You can run them as WASM notebooks entirely in your browser—no local setup needed. I personally love marimo's attention to detail.

05.05.2025 13:43 👍 0 🔁 0 💬 1 📌 0

The version 1.0 of tea-tasting, a Python package for the statistical analysis of A/B tests, is now available. Notable improvements:

- Interactive user guides built with @marimo_io notebooks.
- Simulated experiments, including A/A tests.

#abtesting #statistics #datascience

05.05.2025 13:42 👍 0 🔁 0 💬 1 📌 0

Wow, ChatGPT is really smart now 😜

But better check for yourself: github.com/e10v/tea-tas...

#abtesting #statistics #chatgpt #ai

31.03.2025 09:14 👍 0 🔁 0 💬 0 📌 0

Evgeny Ivanov

Latest posts by Evgeny Ivanov @e10v.me