Site Performance: Loading 30,000 Showings in a Browser
At least twice a day, the pipeline scrapes 250+ London cinemas and produces a dataset of 1,500+ films...
Getting 30k cinema showings into a browser fast 🍿
Highlights:
• Deleted compress-json → saved 333 KB (it was fighting gzip)
• Content-hashed chunks + Cloudflare → warm cache transfers 20 KB, not 5.5 MB
• Lighthouse warm cache: 92/100 (Next battle: Improve cold cache 74/100)
dev.to/alistairjcbr...
11.03.2026 10:06
👍 0
🔁 0
💬 0
📌 0
Three Things I Learned Using LLMs in a Data Pipeline
This is a submission for the Built with Google Gemini: Writing Challenge What I Built with...
Using Gemini in Clusterflick — not as a coding assistant, but to identify films from messy cinema listings.
What I learned:
1. A `reason` key made the model more honest
2. Gemini rewrites my prompts better than I do
3. Every response is untrusted input
dev.to/alistairjcbr...
04.03.2026 15:03
👍 0
🔁 0
💬 0
📌 0
Getting the Data Model Right: Movie -> Showings -> Performances
When I started building cinema aggregation tooling — pulling listings from multiple independent...
I Started building Clusterflick with a flat Movie → Performances schema. This works fine until you look at how venues actually programme screenings.
Movie → Showings → Performances
Each layer earns its place when dealing with 250+ venues.
Full write-up 🎬 dev.to/alistairjcbr...
25.02.2026 10:28
👍 0
🔁 0
💬 0
📌 0
Scaling From 3 Cinemas to 240+ Venues: What Broke and What Evolved
When I started scraping London cinema listings, I had three venues and a simple script. Fetch a page,...
Week 3 of the Clusterflick series 🍿
Scaling from 3 cinemas to 240+ venues: things break, patterns emerge (& I end up with 6 Raspberry Pis in my living room)
dev.to/alistairjcbr...
Also, the retrieve/transform split, why the BFI site is a nightmare, and the joy of common cinema platforms!
18.02.2026 12:40
👍 0
🔁 0
💬 0
📌 0
Calendar Feeds: Where It All Started
When I lived in Belfast, I had one problem: I wanted to know what was showing at the Strand Cinema...
🦹 My not so grand villain origin story: wanting to know what was on at my local cinema without dealing with their clunky website
One ICS file in 2023 became 240 venues worth of cinema data - my first post in the series, where Clusterflick all started 🍿
dev.to/alistairjcbr...
11.02.2026 20:45
👍 1
🔁 0
💬 0
📌 0
I got tired of checking cinema websites to see what's on in London, so I built an aggregator that does it for me 🍿
Clusterflick currently covers 240 venues, logging over 1200 movies, and mainly runs on Github for infrastrucure (+ the 6 Raspberry Pis in my living room 😁)
08.02.2026 00:51
👍 1
🔁 0
💬 0
📌 0