Michael Cizmar (@michaelcizmar)

A Concrete Evaluation Framework for LLM-Powered Pipelines | Michael Cizmar How to stay confident in your model choices when the landscape changes every 90 days

Most #AI teams don't have an evaluation strategy. They have an evaluation event.

Tested before launch. Shipped. Never looked back.
Meanwhile 3 new models dropped this month, 2 cheaper, 1 probably better for the task.

Here's a 5-step framework 2 fix that: michaelcizmar.com/blog/2026/03...

#MLOps

06.03.2026 16:00 👍 0 🔁 0 💬 0 📌 0

The Epstein Files Are a Risk Management Case Study Nobody Asked For | Michael Cizmar The Epstein Files exposed a KYC blind spot that no sanctions list can fix — here's what modern market intelligence infrastructure needs to do differently.

The #EpsteinFiles aren't just a political story.

They're a #KYC audit nobody asked for - and most PE and hedge fund risk frameworks failed it.

#AI solutions do not scale easily and then the problem becomes the data...
michaelcizmar.com/blog/2026/02...

28.02.2026 00:20 👍 0 🔁 0 💬 0 📌 0

LinkedIn Login, Sign in | LinkedIn Login to LinkedIn to keep in touch with people you know, share ideas, and build your career.

I'm already tired of hearing about the #hussleculture. It seems to project laziness and apathy. While running an award winning small business for 20 years, I tried to instill 1 thought:

"If you are not exceptional, you are obsolete".

#leadership #oneteam #culture

www.linkedin.com/analytics/po...

04.02.2026 16:13 👍 1 🔁 0 💬 0 📌 0

GitHub - mcplusa/opensearch-docker-compose: A repository to develop and refine an OpenSearch Docker Compose A repository to develop and refine an OpenSearch Docker Compose - mcplusa/opensearch-docker-compose

Who doesn't like a multi-node OpenSearch docker compose? github.com/mcplusa/open...

30.12.2025 02:06 👍 0 🔁 0 💬 0 📌 0

Purrview: The Tiny AI Project That Worked—And Why Most Don’t Most AI projects fail. Not because of missing technology.

AI projects fail bcause they’r chartered poorly. I built #Purrview 2 prove the opposite...a tiny tool that detects when a cat enters frame & records it.
1) No POC.
2) No roadmap.
3) No “AI transformation.”
4) Use what works
& it works.#AI isn’t failing.AI projects are.
linkedin.com/pulse/purrvi...

07.12.2025 22:07 👍 0 🔁 0 💬 0 📌 0

Microsoft Virtual Events Powered by Teams Microsoft Virtual Events Powered by Teams

“Hey Michael, this event page is pure AI slop.”

Fair enough, but dont judge a book by its cover.

U, on the other hand, still have 1 hour to join 2 of the most experienced AI practitioners as they share notes from the field bfore it turns in2 actual slop:
events.teams.microsoft.com/event/cc5517...

12.11.2025 17:04 👍 0 🔁 0 💬 0 📌 0

Auto-Generating Related Articles with QDrant and Local Embeddings We are working on a new website at MC+A, and we’re always looking for ways to incorporate our tradecraft externally and we thought of a way…

Simple is generally better than complex, I just published Auto-Generating Related Articles with @qdrant.bsky.social and Local Embeddings medium.com/p/auto-gener... #KNN

09.10.2025 15:43 👍 0 🔁 0 💬 0 📌 0

Hello @microsoft.com #ai #tour to #chicago

25.09.2025 16:31 👍 0 🔁 0 💬 0 📌 0

Why PostgreSQL Search Isn’t Enough: A Case for Purpose-Built Retrieval Systems Postgres is a great database — but it’s not a search engine.Instacart’s recent move to consolidate search on Postgres highlights the risks: poor autocomplete, weak ranking, and frustrated users.If rel...

Why #PostgreSQL Search Isn’t Enough: A Case for Purpose-Built Retrieval Systems #VectorSearch #LTR #Elasticsearch

@mcplusa mcplusa.com/why-postgres... #Insights

11.09.2025 02:57 👍 0 🔁 0 💬 0 📌 0

Checkout this image I made with #chatgpt5. The prompt was ‘generate me an image of me getting after it’

14.08.2025 10:40 👍 0 🔁 0 💬 0 📌 0

At least it was strangers doing it to you versus your loved ones.

21.07.2025 19:43 👍 0 🔁 0 💬 0 📌 0

#Skype is a dead product except for when you....

21.07.2025 19:42 👍 0 🔁 0 💬 0 📌 0

It’s a plant! Don’t be fooled.

21.07.2025 14:16 👍 1 🔁 0 💬 1 📌 0

Why Federated Search is Still Relevant Today Glean’s recent blog post claims that federated search is on its way out. However, federated search is more relevant today than ever, and protocols like MCP are enhancing its effectiveness

#MCP does not kill the univeral index, it really just extends it and like all things...it depends on the use case if universal versus #federatedsearch makes sense.

mcplusa.com/why-federate...

01.07.2025 14:10 👍 0 🔁 0 💬 0 📌 0

Drive Requirements to Testing with BDD to Deliver AI Successful AI Projects are Focused on Outcomes, Not Simply Outputs

Projects get stuck in the POC Production often because we do not describe the behavior or the work steams we aim to proof. "Is this an image of a cat?" - Asked no one. "Is this claim something we should further review" - Priceless.
#LLM #BDD #TDD #Agents

michaelcizmar.com/drive-requir...

10.12.2024 16:40 👍 0 🔁 0 💬 0 📌 0

Judging LLM Performance By Synthetic Data Is A Failing Approach — Part 1 Knock-offs are never as good as the real thing

Having the Title be : "Plastic Foodservice Film" and your LLM creating the synthentic queries is not the best method to judge your LLM's performance at finding Seran Rap or saran wrap.
#LLM #AI #Relevancy
michaelcizmar.com/judging-llm-...

05.12.2024 15:51 👍 1 🔁 0 💬 0 📌 0

Michael Cizmar

Latest posts by Michael Cizmar @michaelcizmar