Fabric Spark's Native Execution Engine: What Speeds Up, What Falls Back, and What to Watch
Fabric Spark's Native Execution Engine: What Speeds Up, What Falls Back, and What to Watch
Open Mirroring + OneLake: Spark patterns that keep latency from eating your weekends
Open Mirroring is deceptively simple to set up and tricky to run well with Spark at scale. Here are the architecture choices, anti-patterns, and validation checks that keep your pipelines from falling apart in…
What “Execute Power Query Programmatically” Means for Fabric Spark Teams
What “Execute Power Query Programmatically” Means for Fabric Spark Teams Somewhere in a Fabric workspace right now, two teams are maintaining the same transformation twice. The BI team owns it in Power Query. The Spark team…
What the February 2026 Fabric Influencers Spotlight means for your Spark team
What the February 2026 Fabric Influencers Spotlight means for your Spark team Microsoft published its February 2026 Fabric Influencers Spotlight last week. Twelve community posts. MVPs and Super Users. Most people skim…
Fabric Spark failure playbook: OneLake and mirroring under real production pressure
A field-tested runbook for the failures that hide between Spark, OneLake, and mirroring in Microsoft Fabric: detection signals, triage sequences, and remediation tradeoffs from real production incidents.
The most boring technology announcement might be the most important one for your Fabric Spark team
Microsoft's new ODBC Driver for Fabric Data Engineering looks like a checkbox feature. It isn't. Here's what it means for production Spark teams, the migration risks nobody's talking about, and a…
fabric-cicd Is Now Officially Supported — Here's Your Production Deployment Checklist
The Spark-to-Warehouse Connector in Fabric: What It Does, How It Breaks, and When to Use It
Fabric Spark billing just got clearer. Here's how to make the most of it.
From Demo to Production: ML-Enriched Power BI in Microsoft Fabric
Microsoft's new end-to-end pattern for enriching Power BI reports with ML in Fabric looks clean in the demo. Here's the production migration checklist for Spark teams crossing the gap from notebook to ops.
Microsoft Fabric Warehouse + Spark: Interoperability Patterns That Actually Work
What SQL database in Fabric actually means for your Spark pipelines
There is a particular kind of excitement that sweeps through data engineering teams when Microsoft announces a new database option. It is the same mixture of curiosity and low-grade dread you might feel upon learning that your…
Microsoft Fabric Table Maintenance Optimization: A Cross-Workload Survival Guide
Your Delta tables are drowning. Thousands of tiny Parquet files pile up after every streaming microbatch. Power BI dashboards stall on cold-cache queries. SQL analytics endpoints grind through fragmented row groups.…
Optimizing Spark Performance with the Native Execution Engine (NEE) in Microsoft Fabric
The Best Thing That Ever Happened to Your Spark Pipeline Is a SQL Database
Monitoring Spark Jobs in Real Time in Microsoft Fabric
If Spark performance work is surgery, monitoring is your live telemetry. Microsoft Fabric gives you multiple monitoring entry points for Spark workloads: Monitor hub for cross-item visibility, item Recent runs for focused context, and…
Running OpenClaw in Production: Reliability, Alerts, and Runbooks That Actually Work
Lakehouse Table Optimization: VACUUM, OPTIMIZE, and Z-ORDER
If your Lakehouse tables are getting slower (or more expensive) over time, it’s often not "Spark is slow." It’s usually table layout drift: too many small files, suboptimal clustering, and old files piling up. In Fabric Lakehouse, the…
OneLake catalog in Microsoft Fabric: Explore, Govern, and Secure
If your Fabric tenant has grown past "a handful of workspaces," the problem isn’t just storage or compute—it’s finding the right items, understanding what they are, and making governance actionable. That’s the motivation behind the…
Understanding Spark Execution in Microsoft Fabric
Spark performance work is mostly execution work: understanding where the DAG splits into stages, where shuffles happen, and why a handful of tasks can dominate runtime. This post is a quick, practical refresher on the Spark execution model — with…
Fabric Spark Shuffle Tuning: AQE + partitions for Faster Joins
Shuffles are where Spark jobs go to get expensive: a wide join or aggregation forces data to move across the network, materialize shuffle files, and often spill when memory pressure spikes. In Microsoft Fabric Spark workloads, the…
OneLake Shortcuts + Spark: Practical Patterns for a Single Virtual Lakehouse
If you’ve adopted Microsoft Fabric, there’s a good chance you’re trying to reduce the number of ‘copies’ of data that exist just so different teams and engines can access it. OneLake shortcuts are one of the core…
When ‘Native Execution Engine’ Doesn’t Stick: Debugging Fabric Environment Deployments with fabric-cicd
If you’re treating Microsoft Fabric workspaces as source-controlled assets, you’ve probably started leaning on code-first deployment tooling (either Fabric’s built-in Git integration or…
New OSS drop: Sparkwise (PyPI: sparkwise). Built by Santhosh Kumar Ravindran to help teams improve Fabric Spark price/perf with automated diagnostics + profiling. If you run Spark in Fabric, this will save you time and vCores.
Gil Gerard, Buck Rogers, and the Kind of Grief That Shows Up in December
Build Your Own Spark Job Doctor in Microsoft Fabric
Microsoft Fabric simplifies Spark workload management but diagnosing performance issues remains challenging. This post introduces the "Job Doctor," an AI tool that analyzes Spark telemetry to identify problems like skew or excessive shuffles,…
Time to Automate: Why Sports Card Grading Needs an AI Revolution
As I head to the National for the first time, this is a topic I have been thinking about for quite some time, and a recent video inspired me to put this together with help from ChatGPT’s o3 model doing deep research. Enjoy!…
Humans + Machines: From Co-Pilots to Convergence — A Friendly Response to Josh Caplan’s “Interview with AI”
1. Setting the Table Josh, I loved how you framed your conversation with ChatGPT-4o around three crisp horizons — 5, 25 and 100 years. It’s a structure that forces us to check our near-term…
The Rise and Heartbreak of Antonio McDyess: A Superstar’s Path Cut Short