#PySpark — Bluesky Posts

3 months ago

🚀 New Lab Replay: Using Delta Tables in Apache Spark (Microsoft Fabric)
🎥 Watch the full session:
👉 www.youtube.com/live/gT21FS8...

#MicrosoftFabric #DeltaTables #ApacheSpark #DeltaLake #DP600 #DP700 #Lakehouse #DataEngineering #BigData #ACID #TimeTravel #SparkSQL #PySpark #MicrosoftLearn

0 0 0 0

La Experimental Newslatter

3 months ago

🔥 New Lab Replay: Analyze Data with Apache Spark in Microsoft Fabric
🎥 Watch the full lab session:
👉 www.youtube.com/live/lsv2Oi8...
#MicrosoftFabric #ApacheSpark #SparkAnalytics #DP600 #DP700 #Lakehouse #PySpark #DeltaTables #BigData #DataEngineering #Analytics #FabricCommunity

0 0 0 0

@laexperimental.bsky.social

4 months ago

Ya disponible La Experimental #14

🌐 Tendencias #web
💻 Gestión de #Git hooks
🧑🏻‍💻 Diseño #TUI con #GoLang
🐍 #Python sin GIL
💾 Guía de #PySpark SQL
🤖 Agente #IA local
🐧 Guía de seguridad #Linux
🌩️ Monitorización #SelfHosted
💼 Informe laboral #Tech de #manfred

Link: open.substack.com/pub/laexperi...

2 0 0 0

PostgreSQL

@postgresql.activitypub.awakari.com.ap.brid.gy

4 months ago

Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study Food price volatility has always been a sensitive issue across Kenya. From urban households in...

Building a Modern Data Platform to Track Kenya’s Food Prices — A Data Engineering Case Study Food price volatility has always been a sensitive issue across Kenya. From urban households in Nairo...

#spark #pyspark #grafana #dataengineering

Origin | Interest | Match

1 0 0 0

And Son

@ansonsination.bsky.social

5 months ago

Y isn't #Rust replacing #scala and #pyspark as the main functional language in #spark? Is there an alternative to #spark that is built on #rust?

0 0 1 0

Andrii Kuznietsov

@andriikuznietsov75.bsky.social

7 months ago

👨‍💻📝🐍 #python #pandas 🆚 #pyspark

0 0 0 0

7 months ago

What is the default engine used in Fabric Notebooks?
The default engine is PySpark, which runs on top of the Apache Spark engine.
#MicrosoftFabric #FabricNotebooks #PySpark #ApacheSpark #BigData #DataEngineering #PowerBI #DataPlatform #OneLake #FabricCommunity #DP700 #SparkEngine #DataProcessing

0 0 0 0

7 months ago

What languages can be used in Fabric Notebooks?
Microsoft Fabric Notebooks support:
🔹 PySpark
🔹 Spark (Scala)
🔹 SparkSQL
🔹 SparkR (R)
🔹 HTML
#MicrosoftFabric #FabricNotebooks #PySpark #SparkSQL #SparkR #Scala #BigData #DataEngineering #DataScience #OneLake #FabricCommunity #DataPlatform #DP700

1 0 0 0

Ronen Ariely

@pitoach.bsky.social

7 months ago

Cloud Data Driven | 2025-07-17 | Intro to PySpark in Microsoft Fabric | Jared Kuehn YouTube video by DataDrivenCommunity

📣 Missed the community meetup from July 17nd, with Jared Kuehn and Ronen Ariely?

🚀 Dive into #PySpark in #Microsoft #Fabric with Jared Kuehn - a powerhouse speaker and veteran data engineer - as he demystifies how to work with PySpark in #MicrosoftFabric.

youtu.be/Y4Uxnj0CAeA?...

1 0 0 0

Introduction to PySpark in Microsoft Fabric, with Jared Kuehn, Thu, Jul 17, 2025, 12:00 PM | Meetup **Presentation Title:** Introduction to PySpark in Microsoft Fabric **Description:** With all of the engineering features in Microsoft Fabric, which medium should you use

7 months ago

What are Fabric Notebooks best suited for?
They’re ideal for:
🔹 Handling large external datasets
🔹 Performing complex data transformations
🔹 Running custom code in languages like PySpark, SQL, or Scala
#MicrosoftFabric #FabricNotebooks #PySpark #BigData #DataTransformation #DataEngineering #PowerBI

0 0 0 0

Damavis

@damavis.bsky.social

7 months ago

📈 Monitoriza tus métricas con #Spark y #Prometheus

1️⃣ Requisitos previos
2️⃣ #Pyspark
3️⃣ JMX Exporter: ¿Qué es y cómo se configura?
4️⃣ Ejecución de Spark
5️⃣ Configuración de Prometheus

➡️ blog.damavis.com/integracion-...

#ApacheSpark #BigData #DataEngineering

1 0 0 0

Ronen Ariely

@pitoach.bsky.social

8 months ago

Unlock the power of #PySpark in #Microsoft #Fabric with Jared Kuehn!

Learn #Spark management, #Python tips, and boost #performance in this live event 🚀

🗓 July 17, 12 PM EDT
🎤 Hosted by Ronen Ariely @pitoach.bsky.social

👉 www.meetup.com/cloud-data-d...

#MicrosoftFabric #DataEngineering

2 1 0 0

8 months ago

🚀 Starting a new series: #PySpark + #AI
What happens when distributed computing meets intelligent automation?
I'm documenting hands-on work integrating PySpark with ML & LLMs (LangChain, Azure, etc).
Let's bridge Big Data + Smart Logic.
#DataScience #MLOps #LLM #BigData

1 0 0 0

8 months ago

PySpark: Read CSV like a pro

df = spark.read.csv("data.csv", header=True, inferSchema=True)
df.show(3)

✅ Auto schema
✅ Header as columns
✅ Ready to transform

Small win, big impact.
#PySpark #DataEngineer #BigData #xavierdatatech

2 0 0 0

8 months ago

🚀 Working with #PySpark in the cloud — juggling multiple #DataFrames in parallel.

🔍 Combining filter(), select(), and join() efficiently is teaching me how to optimize both loading and exploration on large datasets.

#BigData #Databricks #DataEngineering #ApacheSpark

1 0 0 0

8 months ago

🚀 Unlocking Big Data Potential with PySpark!
Key Features:
🔹 Spark SQL
🔹 Spark MLlib
🔹 Spark Streaming
🔹 DataFrame API

#PySpark #BigData #DataScience #ApacheSpark #MachineLearning #DataEngineering #XavierDataTech

1 0 0 0

8 months ago

Top 8 Data Visualization Libraries

#Python
#PySpark #SQL #BigData #Databricks #BusinessIntelligence #DataEngineering #PowerBI #DataAnalytics #SparkSQL #XavierDataTech

3 0 0 0

8 months ago

🚀 Working with PySpark SQL? Here's a quick and powerful example!

You can query DataFrames using SQL syntax in Spark — great for teams coming from SQL backgrounds.

#PySpark #BigData #SparkSQL #DataEngineering #ETL #ApacheSpark #SQL #DataScience #XavierDataTech

2 0 0 0

8 months ago

Supported chart types: scatter, line, bar, area, pie, histogram, box, and KDE — optimized for Spark performance with smart sampling.

#PySpark #BigData #AI #DataVisualization #Spark40 #DataScience #MLOps #XavierDataTech #Databricks

databricks.com/blog/pyspark-n…

2 0 0 0