Hugo Lu's Avatar

Hugo Lu

@datajesus

CEO of Orchestra - Unified Control Plane for Data Pipelines

21
Followers
39
Following
23
Posts
25.11.2024
Joined
Posts Following

Latest posts by Hugo Lu @datajesus

Preview
Top Considerations when considering a migration from SQL Server to Cloud How to avoid the most common enterprise architecture pitfalls

Migrating from a legacy system should be done incrementally, but you should get buy-in.

The worst is buying all your tools without a plan.

Look at the guide we wrote for #sqlServer

tinyurl.com/bddvtku5

04.01.2025 07:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Hadoop, Spark and Iceberg are not alternatives. They are the same thing evolving.
#apachespark #apacheiceberg #opentableformat

31.12.2024 10:07 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Easiest way to run dbt Core! How to run dbt Core in Production with Orchestra #dbt #analytics
Easiest way to run dbt Core! How to run dbt Core in Production with Orchestra #dbt #analytics YouTube video by Orchestra

Setting up #dbtcore in #apacheairflow? Stop. Another life is possible

www.youtube.com/watch?v=-XDu...

#dataengineering #analyticsengineering

27.12.2024 10:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

You are welcome.

27.12.2024 10:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
ELT with Fabric, Azure and Databricks Data Pipeline Patterns for 2025 and beyond

ELT / Data Pipeline architecture for Fabric + databricks
medium.com/@hugolu87/el...

#fabric #msfabric #databricks #elt

23.12.2024 22:05 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

4. Automatically adding tests

This bit is really good as it means you don't need to spend anytime thinking about how to write custom #dbtmacros to see what works

Check out the video on Youtube

www.youtube.com/watch?v=s-Xx...


#dbt #databuildtool #dbtpoweruser

18.12.2024 09:32 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Introduction to dbt Power User. Using dbt-core and not using this, you're missing out. #dbt
Introduction to dbt Power User. Using dbt-core and not using this, you're missing out. #dbt A lot of the time, writing dbt / data build tool code can be really arduous, tedious and boring. Nobody wants to be defining custom macro after custom macro and writing terse .yml files just to get by. Fortunately the folks at Altimate.ai have you covered. In this tutorial Hugo Lu shows you how to install dbt power user and how to get started with dbt power user. Specifically, there are a few things we really like 1. It is free (for now) 2. You can leverage the extension to automatically generate documentation 3. You can leverage it to autopopulate schcema 4. You can leverage it to autopopulate tests 5. You can use the API Key method to explore column-level lineage All for free Quite frankly we aren't sure what Altimate's game is here, especially because the extension is free and clearly includes GPT credits under-the-hood. So leverage it while...

2. Auto generation of /yml files.

If like me you find writing yml really terse, then you can automatically generate entire schema using this extension

3. Automatically generating docs

If you also #hate writing documentation then you can rinse someone else's #OpenAI credits

18.12.2024 09:30 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Introduction to dbt Power User. Using dbt-core and not using this, you're missing out. #dbt
Introduction to dbt Power User. Using dbt-core and not using this, you're missing out. #dbt A lot of the time, writing dbt / data build tool code can be really arduous, tedious and boring. Nobody wants to be defining custom macro after custom macro and writing terse .yml files just to get by. Fortunately the folks at Altimate.ai have you covered. In this tutorial Hugo Lu shows you how to install dbt power user and how to get started with dbt power user. Specifically, there are a few things we really like 1. It is free (for now) 2. You can leverage the extension to automatically generate documentation 3. You can leverage it to autopopulate schcema 4. You can leverage it to autopopulate tests 5. You can use the API Key method to explore column-level lineage All for free Quite frankly we aren't sure what Altimate's game is here, especially because the extension is free and clearly includes GPT credits under-the-hood. So leverage it while...

(1/many) If you're not using dbt power user and you use #dbtcore you should be. WHY?

1. COLUMN-LEVEL LINEAGE (free)

You can visualise column-lineage for easy exposition and data re #architecture in the platform easy no problem

18.12.2024 09:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Microsoft Fabric Reference Architecture: MS Fabric in 2025 | Orchestra ELT using Microsoft Fabric has never been simplier with this standard reference architecture.

#msfabric data architecture for 2025 in this link below. Must read for anyone building data pipelines in the azure stack

www.getorchestra.io/whitepaper/m...

14.12.2024 10:57 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Straight out of the playbook mate

11.12.2024 13:42 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Great work

11.12.2024 13:40 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

At ThoughtSpot Embed - these guys are absolute pros. Driving revenue for businesses through data and embedded #analytics. This is not a fad

11.12.2024 13:39 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

It is concerning that orchestration often *doesn't* come up when data teams speak to my friends that run smaller ELT companies.

Do you really think you can get away with roguing it out? It's only a matter of time before you get found out as an amateur. #dataorchestration #datang

09.12.2024 13:53 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Let’s never use the phrase Data Observability Ever Again No-one even knows what it is, let alone pronounce it

The phrase data observability is meaningless and kinda hard to pronounce.

Whatever happened to just all round decent architecture?

medium.com/@hugolu87/le...

#dataquality #datanegineering

08.12.2024 20:47 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

What aren't you sure about?

08.12.2024 13:14 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
How to PREVENT incomplete data using dbt tests and Orchestra | dbt test tutorial #dataquality
How to PREVENT incomplete data using dbt tests and Orchestra | dbt test tutorial #dataquality YouTube video by Orchestra

Many people that use #dbt don't realise you can prevent having any mission periods in your datasets if you would but bother to write this single test
www.youtube.com/watch?v=e09U...

#dbt #dataquality #orchestra #datajesus #cometojesus

08.12.2024 13:14 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Data Job Market is carnage now. Why?
- excess supply β€œdata engineering is so hot right not”
- excess supply β€œpeople switching during ZIRP as it was easy and well
Paid”
- not enough demand β€œData team is a cost centre”
- massive investment in SAAS
What did I miss? #dataengineering #jobmarket

27.11.2024 19:32 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

There is massive conflation of role titles, problems people have, and tooling. For example you would expect governance practitioners to solve governance problems with governance tools - alas

27.11.2024 19:23 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Welcome to the party

27.11.2024 19:21 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Interesting article. The table of how to evaluate storage needs can be helpful. I wonder how many data engineers could explain raw hardware

27.11.2024 19:20 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

People gonna be watching this

27.11.2024 19:13 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Our most powerful integration yet: native python support Code based utility and python execution from within orchestra

Check out how #orchestra is changing the game for data teams. Our most powerful integration yet

medium.com/@hugolu87/ou...

#python #orchestra #dataengineering
#cometojesus

27.11.2024 19:10 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

What's up it's me data jesus

26.11.2024 13:24 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0