Edgar's Avatar

Edgar

@theotheredgar

#rstats #visualizations #datascience

1,721
Followers
90
Following
9
Posts
18.02.2024
Joined
Posts Following

Latest posts by Edgar @theotheredgar

Screenshot of both sides of the printable version of the cheatsheet

Screenshot of both sides of the printable version of the cheatsheet

Screenshot of the web version of the recipes cheatsheet

Screenshot of the web version of the recipes cheatsheet

#tidymodels now has its very first cheatsheet! "Preprocessing data with {recipes}" is now available in Web and PDF versions here: rstudio.github.io/cheatsheets/... #rstats #posit #rstudio

02.03.2026 17:23 πŸ‘ 48 πŸ” 14 πŸ’¬ 0 πŸ“Œ 1

Here I am! πŸ˜†

26.01.2026 18:58 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Data Science Lab

Have you checked out the {mall} package? mlverse.github.io/mall/ I think it may do what you are looking for!

@theotheredgar.bsky.social will be featuring {mall} at a Data Science Lab on Jan 27 if you'd like to see it in action! pos.it/dslab

16.01.2026 19:07 πŸ‘ 4 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Preview
Posit AI Blog: mall 0.2.0 The mall 0.2.0 update for R and Python introduces support for external LLM providers like OpenAI and Gemini. This version also features parallel processing for R users, the ability to run NLP on str...

New πŸ“¦ release alert! {mall} 0.2.0 is out now for #rstats and #pydata. Now you can use external #llm like OpenAI & Gemini, and a brand new cheatsheet! New blog post here: blogs.rstudio.com/ai/posts/202...

20.08.2025 16:23 πŸ‘ 13 πŸ” 4 πŸ’¬ 0 πŸ“Œ 0
A graphic illustrating data science and big data technologies. On the left, stacked vertically, are the logos for R and Python. In the center, also stacked vertically, are the Orbital logo (featuring a satellite) and the Scala logo (a blue serpent). On the right, stacked vertically, are the Databricks logo and a generic database cylinder icon. The background is a light blue with a subtle, dark blue dot pattern at the bottom.

A graphic illustrating data science and big data technologies. On the left, stacked vertically, are the logos for R and Python. In the center, also stacked vertically, are the Orbital logo (featuring a satellite) and the Scala logo (a blue serpent). On the right, stacked vertically, are the Databricks logo and a generic database cylinder icon. The background is a light blue with a subtle, dark blue dot pattern at the bottom.

Announcing streamlined MLOps with Orbital on Databricks πŸ›°οΈπŸ§±

Orbital translates #ScikitLearn #Python or #tidymodels #RStats to native #SQL for direct database model execution.

@theotheredgar.bsky.social's post uses Databricks as an integrated environment.

Learn more: posit.co/blog/databri...

21.07.2025 13:26 πŸ‘ 10 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
mall

Hi! That's is currently in the dev version of the package, we're going to use ellmer/chatlas as the way to get an external integration with LLMs: mlverse.github.io/mall/

10.07.2025 02:36 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
APRIL 30th: Easier data and asset sharing across projects and teams with {pins} and Databricks
APRIL 30th: Easier data and asset sharing across projects and teams with {pins} and Databricks YouTube video by Posit PBC

the pins πŸ“Œ package gets a lot of love in the chat at community events - and we're excited to share a workflow today!

this one was asked about specifically at a Data Science Hangout!

....we're talking about pins + Databricks with @theotheredgar.bsky.social at 11am ET!

youtu.be/ab4CIlafsbo?...

30.04.2025 14:03 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Shoutout to @ivelasq3.bsky.social and @posit.co for the opportunity to write a blog post about how I'm using `library(mall)` and integrating large language models into our energy security research! #textdata #LLM #energy #energysecurity #socialscience #datascience #NLProc

25.03.2025 14:09 πŸ‘ 17 πŸ” 3 πŸ’¬ 2 πŸ“Œ 0
Illustration demonstrating how an AI-powered system gathers data from various documents and compiles it into a single, organized format, using the mall hex to represent the AI processing.

Illustration demonstrating how an AI-powered system gathers data from various documents and compiles it into a single, organized format, using the mall hex to represent the AI processing.

Discover how the mall package simplifies LLM integration in R!

In @camlivio.bsky.social's guest post, she walks through how she uses mall to summarize dense PDF reports, extract key entities, and visualize the frequency of relevant terms, all with #RStats.

Read it here: posit.co/blog/mall-ai...

25.03.2025 13:58 πŸ‘ 35 πŸ” 5 πŸ’¬ 1 πŸ“Œ 2
the odbc hex logo

the odbc hex logo

We're excited to announce a new release of odbc!

This release includes a new hex logo (thanks, @theotheredgar.bsky.social!), viewer-based credentials on Posit Connect for `databricks()` and `snowflake()`, and more.

Read more in the release notes: odbc.r-dbi.org/news/index.h...

#RStats

10.03.2025 14:04 πŸ‘ 28 πŸ” 4 πŸ’¬ 2 πŸ“Œ 1
A hexagonal sticker with an all black background and "odbc" written in layered shades of blue.

A hexagonal sticker with an all black background and "odbc" written in layered shades of blue.

odbc 1.6.0 is now on #rstats CRAN! Includes a new helper for Redshift, a hex sticker (finally!), and many QOL improvements for Databricks, Snowflake, MSSQL, etc.

Read more: odbc.r-dbi.org/news/index.h...

04.03.2025 15:24 πŸ‘ 39 πŸ” 12 πŸ’¬ 0 πŸ“Œ 0

The 2nd wave of uv is here β€” developers of other systems building on top of uv to make new and effortless workflows that use Python virtual environments behind the scenes 🀩

03.03.2025 15:08 πŸ‘ 23 πŸ” 7 πŸ’¬ 1 πŸ“Œ 1
Preview
GitHub - mlverse/lang: Uses LLMs to translate R help docs on the fly Uses LLMs to translate R help docs on the fly. Contribute to mlverse/lang development by creating an account on GitHub.

(1/3) Every week, I review an open-source project in my newsletter. This week, the focus is on the lang project.

The lang library enables the translation of any function's documentation to a different language by using LLM on the fly.

github.com/mlverse/lang

#RStats #LLM #AI

14.02.2025 17:03 πŸ‘ 12 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
mlverse package with an RStudio help window in Spanish

mlverse package with an RStudio help window in Spanish

Introducing the {lang} package by @theotheredgar.bsky.social for translating R help using your local LLM! Lea la ayuda en #RStats en su propio idioma!

lang helps you translate your documentation and include it as part of your package.

Check it out here! github.com/mlverse/lang

06.02.2025 21:10 πŸ‘ 29 πŸ” 7 πŸ’¬ 0 πŸ“Œ 1
Preview
Pins in Databricks - Posit The pins R package now has support for the `board_databricks()` function, which allows you to access and store pins in Databrick’s Volumes from your R script.

Pin your data and model objects to Databricks Volumes in #RStats!

With pins, you can store an object on a board, like Dropbox, Posit Connect, or Amazon S3.

We have merged support for pinning objects to Databricks Volumes with the `board_databricks()` function!

Read more: posit.co/blog/pins-in...

18.12.2024 15:15 πŸ‘ 12 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
Pins in Databricks - Posit The pins R package now has support for the `board_databricks()` function, which allows you to access and store pins in Databrick’s Volumes from your R script.

The recent work that @theotheredgar.bsky.social did in the pins #rstats package to support Databricks Volumes is QUITE NICE for folks who need more flexibility in how they store objects/files there! πŸ“Œ

Read a bit about it here:
posit.co/blog/pins-in...

16.12.2024 23:02 πŸ‘ 28 πŸ” 7 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - mlverse/lang: Uses LLMs to translate R help docs on the fly Uses LLMs to translate R help docs on the fly. Contribute to mlverse/lang development by creating an account on GitHub.

πŸ“¦ Lea la ayuda en R en su propio idioma! Es un gusto de introducir a {lang}, traduce la ayuda 'en vivo' utilizando Ollama y lo muestra en la misma ventana de 'help' en su entorno de desarrollo github.com/mlverse/lang #rstats #ollama #llm

11.12.2024 17:47 πŸ‘ 16 πŸ” 6 πŸ’¬ 0 πŸ“Œ 0
GitHub - mlverse/lang: Uses LLMs to translate R help docs on the fly Uses LLMs to translate R help docs on the fly. Contribute to mlverse/lang development by creating an account on GitHub.

Dev πŸ“¦ alert! {lang} translates R help on-the-fly using your local LLM! It also overrides the `?` so you can easily access the translated docs and have them displayed on your IDE's help pane github.com/mlverse/lang #rstats #llm #ollama

11.12.2024 17:38 πŸ‘ 18 πŸ” 6 πŸ’¬ 0 πŸ“Œ 1

Hi, as long as 'mall' is imported, the new Polars DF should automatically have `llm` namespace

import polars as pl
import mall
f = open("test.csv", "w")
f.write("text\n\"I am happy\"\n")
f.close()
df = pl.read_csv("test.csv")
df.llm.sentiment("text")

26.11.2024 00:18 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Posit AI Blog: Introducing mall for R...and Python We are proud to introduce the {mall}. With {mall}, you can use a local LLM to run NLP operations across a data frame. (sentiment, summarization, translation, etc). {mall} has been simultaneusly rele...

Introducing the mall package for running multiple LLM predictions against a data frame in #RStats or #Python!

mall is inspired by the SQL AI functions offered by vendors such as Databricks and Snowflake.

Learn more in this blog post by @theotheredgar.bsky.social: blogs.rstudio.com/ai/posts/202...

21.11.2024 15:06 πŸ‘ 51 πŸ” 8 πŸ’¬ 0 πŸ“Œ 1
A screenshot of two examples, one from R and the other from Python. It shows how you can set the 1 and 0 to represent positive and negative.

A screenshot of two examples, one from R and the other from Python. It shows how you can set the 1 and 0 to represent positive and negative.

πŸ“¦ In today's, cool things {mall} can do: You can set the values returned per sentiment. Saving you the need for the extra step, and making the code more concise #rstats #pydata #polars #ollama

15.11.2024 14:39 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
mall's homepage screenshot

mall's homepage screenshot

Screenshot of the results from using mall with Polars

Screenshot of the results from using mall with Polars

Results of using mall with R

Results of using mall with R

New πŸ“¦ alert! {mall} is out now for both #rstats...and #python! The package uses #llm 's to run NLP operations recursively over a data frame (sentiment, summarization, translation, etc). For Python it's a #polars extension. Both use #ollama to interact with the LLM. mlverse.github.io/mall/

29.10.2024 13:40 πŸ‘ 42 πŸ” 14 πŸ’¬ 0 πŸ“Œ 1
Preview
Parallelize R code using user-defined functions (UDFs) in sparklyr - Posit The sparklyr package enables writing user-defined functions (UDFs) in R, which allow you to leverage Spark for efficient big data processing.

Are you a Spark user who prefers writing in R? User-defined functions with sparklyr might be what you need ✨

With `spark_apply()`, you can write functions in #RStats and use them in #Spark queries.

Learn more in the blog post: posit.co/blog/databri...

01.08.2024 13:55 πŸ‘ 7 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image

A new version of the #rstats probably package is on CRAN. A minor update with a bug fix and under-the-hood changes for the upcoming tune version.

But there’s finally a hex logo (thanks to Edgar Ruiz) so we have that going for us. Which is nice.

probably.tidymodels.org

23.02.2024 12:07 πŸ‘ 17 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0