Migrating from a legacy system should be done incrementally, but you should get buy-in.
The worst is buying all your tools without a plan.
Look at the guide we wrote for #sqlServer
tinyurl.com/bddvtku5
Migrating from a legacy system should be done incrementally, but you should get buy-in.
The worst is buying all your tools without a plan.
Look at the guide we wrote for #sqlServer
tinyurl.com/bddvtku5
Hadoop, Spark and Iceberg are not alternatives. They are the same thing evolving.
#apachespark #apacheiceberg #opentableformat
Setting up #dbtcore in #apacheairflow? Stop. Another life is possible
www.youtube.com/watch?v=-XDu...
#dataengineering #analyticsengineering
You are welcome.
ELT / Data Pipeline architecture for Fabric + databricks
medium.com/@hugolu87/el...
#fabric #msfabric #databricks #elt
4. Automatically adding tests
This bit is really good as it means you don't need to spend anytime thinking about how to write custom #dbtmacros to see what works
Check out the video on Youtube
www.youtube.com/watch?v=s-Xx...
#dbt #databuildtool #dbtpoweruser
2. Auto generation of /yml files.
If like me you find writing yml really terse, then you can automatically generate entire schema using this extension
3. Automatically generating docs
If you also #hate writing documentation then you can rinse someone else's #OpenAI credits
(1/many) If you're not using dbt power user and you use #dbtcore you should be. WHY?
1. COLUMN-LEVEL LINEAGE (free)
You can visualise column-lineage for easy exposition and data re #architecture in the platform easy no problem
#msfabric data architecture for 2025 in this link below. Must read for anyone building data pipelines in the azure stack
www.getorchestra.io/whitepaper/m...
Straight out of the playbook mate
Great work
At ThoughtSpot Embed - these guys are absolute pros. Driving revenue for businesses through data and embedded #analytics. This is not a fad
It is concerning that orchestration often *doesn't* come up when data teams speak to my friends that run smaller ELT companies.
Do you really think you can get away with roguing it out? It's only a matter of time before you get found out as an amateur. #dataorchestration #datang
The phrase data observability is meaningless and kinda hard to pronounce.
Whatever happened to just all round decent architecture?
medium.com/@hugolu87/le...
#dataquality #datanegineering
What aren't you sure about?
Many people that use #dbt don't realise you can prevent having any mission periods in your datasets if you would but bother to write this single test
www.youtube.com/watch?v=e09U...
#dbt #dataquality #orchestra #datajesus #cometojesus
Data Job Market is carnage now. Why?
- excess supply βdata engineering is so hot right notβ
- excess supply βpeople switching during ZIRP as it was easy and well
Paidβ
- not enough demand βData team is a cost centreβ
- massive investment in SAAS
What did I miss? #dataengineering #jobmarket
There is massive conflation of role titles, problems people have, and tooling. For example you would expect governance practitioners to solve governance problems with governance tools - alas
Welcome to the party
Interesting article. The table of how to evaluate storage needs can be helpful. I wonder how many data engineers could explain raw hardware
People gonna be watching this
Check out how #orchestra is changing the game for data teams. Our most powerful integration yet
medium.com/@hugolu87/ou...
#python #orchestra #dataengineering
#cometojesus
What's up it's me data jesus