Jason Kealey (@softwareengineering.ca)

Didn’t need incremental loads for that project but would likely attack that with some dagster wizardry should the need arise

20.02.2026 23:20 👍 0 🔁 0 💬 0 📌 0

Hoping you don’t stumble into a new gif-like debate

15.02.2026 00:59 👍 0 🔁 0 💬 0 📌 0

Problem is we’re in the small window before all of AI’s answers are guided by advertisers and the cycle repeats itself

15.02.2026 00:33 👍 7 🔁 0 💬 2 📌 0

Is using 67 AI agents better than 12 or 4?

My LinkedIn feed is full of these humble brags.

08.02.2026 13:45 👍 0 🔁 0 💬 0 📌 0

What are your thoughts about the inverse direction?

Can a developer use AI to become a CFO? You’ve done it the hard way, but are these new tools letting anyone with a thirst for knowledge do it a lot faster?

07.02.2026 02:40 👍 1 🔁 0 💬 1 📌 0

It’s gonna be a fun ride for everyone who isn’t afraid to learn new things

07.02.2026 02:11 👍 4 🔁 0 💬 0 📌 0

Not 4pm deploy to production and leave for the weekend?

06.02.2026 13:19 👍 1 🔁 0 💬 1 📌 0

What we’re working with
• LLMs
• AI agents
• Machine learning models
• Unstructured data (video, voice, text)
• Modern data engineering and cloud tooling

Location
• Gatineau / Ottawa, NYC or remote

DM me.

06.02.2026 12:31 👍 0 🔁 0 💬 0 📌 0

Open roles
• Head of Data Engineering: pipelines, modeling, and foundations
• Head of Applied ML Engineering: predictive models and interpretable signals
• Head of AI Agent Engineering: agent workflows
• Founding Product Engineer (Full-Stack): product surfaces and system integration

06.02.2026 12:31 👍 0 🔁 0 💬 1 📌 0

I’m hiring four technical leaders for a new AI company in the multi-unit space.

The team
• Two cofounders, both exited CEOs
• Backgrounds in data and analytics and multi-unit operations
• Backed by top-tier NYC VCs

06.02.2026 12:30 👍 1 🔁 0 💬 1 📌 0

Open to trying it out!

30.06.2025 14:26 👍 0 🔁 0 💬 1 📌 0

Vanna is good for the loading up the schema plus docs into a vector db for RAG part, just the charts part are weaker.

23.04.2025 11:06 👍 0 🔁 0 💬 1 📌 0

I want the graph part too, that’s where Vanna w/ plotly is failing. It’s not rendering when I use booleans or timestamps and it does scatter plots at inappropriate times

23.04.2025 11:05 👍 0 🔁 0 💬 1 📌 0

Vanna.AI - Personalized AI SQL Agent

Anyone got open source alternatives to vanna.ai for text to SQL?

The SQL part is pretty good but the plotly charts it recommends are wonky.

22.04.2025 23:21 👍 1 🔁 0 💬 2 📌 0

I’ve also been playing with dbt - and considered sqlmesh instead.

Chose to go deeper with dbt as I feel like sqlmesh’s real value shines when you want to avoid transforming the data both in dev and prod.

… and I’m just prototyping stuff locally to play with different open source BI frontends.

16.04.2025 11:01 👍 0 🔁 0 💬 0 📌 0

Last week I did some experimentation with unstructured.io to extract content out of some pdfs.

Also played with github.com/getomni-ai/z... as a more lightweight option.

Overall, vision models do a much better job than classic OCR (ex: tesseract) on tables in docs.

16.04.2025 10:57 👍 1 🔁 0 💬 0 📌 0

Review of Data Orchestration Landscape ... and honest one ...

Here’s a decent overview of the data pipeline orchestration tools on the market.

dataengineeringcentral.substack.com/p/review-of-...

16.04.2025 10:52 👍 1 🔁 0 💬 0 📌 0

Does this API exist in both the hosted and self-hosted versions?

When reading the docs last week I sometimes got mixed up in what features needed a subscription.

10.04.2025 20:02 👍 1 🔁 0 💬 0 📌 0

Nice! Hello!

Is the api giving you just the metadata of the metric or also translating to the SQL you’d run to build charts like you do in lightdash itself?

09.04.2025 19:58 👍 1 🔁 0 💬 1 📌 0

Lightdash may be a good UI option for me, but then I’m defining metrics at the presentation layer and tightly coupled with it.

04.04.2025 21:25 👍 1 🔁 0 💬 1 📌 0

I’m looking for something to be able to express KPIs centrally and cleanly, and have the BI layer autogenerated from it.

Dbt semantic layer could be that, but doesn’t seem like many open source BI layers support it.

04.04.2025 21:25 👍 0 🔁 0 💬 0 📌 0

I spent some time this week playing with some tools to setup a data pipeline for simple BI and data science.

Played with airbyte, dbt, duckdb and metabase.

Planning on trying lightdash next week.

Trying to avoid hosted data warehouses and use open source for the full chain.

04.04.2025 21:20 👍 8 🔁 0 💬 3 📌 0

Keep it up!

I’ve never felt motivated by hyping whatever I’m building to peers who’d never be clients/users.

26.03.2025 11:44 👍 0 🔁 0 💬 0 📌 0

I sold my solution by ignoring the sales rules and just flat out asking “how do you do <process>?” and replying we had an app for that when they outlined their manual process.

I felt dirty not outlining benefits at first, but I later realized this opener was better aligned with my ICP (operations)

26.03.2025 11:40 👍 1 🔁 0 💬 1 📌 0

My past experiment with a tax form was bad because the LLM used basic OCR instead of something fancier for tables.

And here I’m trying to do something more generic without knowing the form format ahead of time.

25.03.2025 22:35 👍 0 🔁 0 💬 0 📌 0

Imagine a government form with some weird tabular layout to shove as many fields into a condensed space as possible.

Generalized use case is read/write to forms. Basically reverse engineering a domain model from a form.

Azure DocIntel lets you do it well for a known form via training.

25.03.2025 22:34 👍 0 🔁 0 💬 1 📌 0

If making zip files named _final-website(2)-final-tuesday-3.zip is too complicated maybe something like dropbox with revision history baked in could work - at least for one file at a time.

25.03.2025 22:14 👍 2 🔁 0 💬 1 📌 0

That’s ironic but I guess totally expected since it’s their main revenue source hah

25.03.2025 11:37 👍 1 🔁 0 💬 0 📌 0

Interesting compensation model.

24.03.2025 12:17 👍 5 🔁 0 💬 0 📌 0

#chordle sounded kinda sus

24.03.2025 01:12 👍 1 🔁 0 💬 1 📌 0

Jason Kealey

Latest posts by Jason Kealey @softwareengineering.ca