SMT's Avatar

SMT

@pp0196

Sequences and consequences. Credit Pic : Cellular landscape cross-section through a eukaryotic cell, by Evan Ingersoll

451
Followers
711
Following
446
Posts
24.08.2023
Joined
Posts Following

Latest posts by SMT @pp0196

They are likely using a cheap model. You get similar behavior in vscode copilot with cheaper models

07.03.2026 22:41 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image Post image Post image

Using semantic search to find publicly available gene-expression datasets academic.oup.com/bioinformati... πŸ§¬πŸ’»πŸ§ͺ github.com/srp33/GEO_NLP

05.03.2026 19:00 πŸ‘ 6 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Practical Power Analysis in R

today's #rstats package of the day πŸ† is pwrss which made it very easy to compute the power of a difference between correlations

cran.r-project.org/web/packages...

03.03.2026 18:59 πŸ‘ 19 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Preview
HTTP streaming and Server-Sent Events in R with nanonext HTTP streaming and Server-Sent Events in R with nanonext - sse.R

HTTP streaming and Server-Sent Events in R with nanonext. Here's an example: gist.github.com/jrosell/178e...

02.03.2026 20:43 πŸ‘ 8 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0

#ridiculousbutcool
Embedding #RStats in #Duckdb via #TinyCC and Duckdb's C extensions API

27.02.2026 14:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image

#Duckdb #TinyCC Ridiculous, dangerous but cool !
(very roundabout way to test out ideas of a potential api for a #RStats UDFs extension)
Link for segfault lovers : github.com/sounkou-bioi...

26.02.2026 11:39 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

Very happy to see Wasm gaining traction as a low barrier to entry alternative for programming thanks to the massive efforts by @quantstack.bsky.social πŸŽ‰ We're collecting Wasm resources for bioinformatics and beyond at wasmodic.github.io

25.02.2026 22:52 πŸ‘ 12 πŸ” 4 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

A small generative animation made in #rstats using #ggplot2 and my experimental packages, playing with polar lines, hue rotation, and shader-style deformations.

25.02.2026 13:52 πŸ‘ 7 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

Excited to share my first preprint on federated conditional analysis of rare single variant and aggregate association tests across six genetically-inferred ancestry groups in All of Us and UK Biobank doi.org/10.64898/202...

06.02.2026 09:32 πŸ‘ 21 πŸ” 11 πŸ’¬ 1 πŸ“Œ 4

#rstats 4.5.3 "Reassured Reassurer" scheduled for March 11. Full schedule on developer.r-project.org (or the svn if you're impatient.) This should be the wrap-up release for the 4.5 series.

23.02.2026 14:12 πŸ‘ 28 πŸ” 17 πŸ’¬ 0 πŸ“Œ 0
Foto of a feminine nonbinary person in a purple floral shirt in front of a purple poster. The poster is displaying a structural equation model analysing the influence of parental education on children’s school grades. The person is smiling cheekily to the side.

Foto of a feminine nonbinary person in a purple floral shirt in front of a purple poster. The poster is displaying a structural equation model analysing the influence of parental education on children’s school grades. The person is smiling cheekily to the side.

Foto of a child with four fluffy buns and blue sunglasses. The child has lowered the sunglasses below their nose and is gazing over them.

Foto of a child with four fluffy buns and blue sunglasses. The child has lowered the sunglasses below their nose and is gazing over them.

Hi, I’m this week’s curator, Josi! Iβ€˜m a PhD student in statistical genetics 🧬 and use #RStats to do #SEM. I dabble in #DataViz and recently wrote my first #Rpackage. My hobbies are reading #Fantasy books πŸ“š and #maximalism. You may catch me at a conference matching my outfit to my poster πŸ€“πŸ§œπŸ½β€β™€οΈ #rladies

23.02.2026 17:51 πŸ‘ 59 πŸ” 9 πŸ’¬ 2 πŸ“Œ 1

#RStats #tinyverse gurus: is their a mutation testing package out there that is compatible with {tinytest} ?

23.02.2026 13:27 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

tinyverse vs ....

21.02.2026 23:54 πŸ‘ 6 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
SMT (@bioinfhotep@genomic.social) Attached: 4 images #Genomics #Bioinformatics Release of duckhts: #htslib based #Duckdb Extension for High Throughput Sequencing File Formats https://duckdb.org/community_extensions/extensions/duckh...

#Duckdb #htslib #Genomics #Bioinformatics #RStats

duckths: Read HTS (VCF/BCF/BAM/CRAM/FASTA/FASTQ/GTF/GFF) files in DuckDB via htslib

Rduckhts: 'DuckDB' High Throughput Sequencing File Formats Reader Extension
genomic.social/@bioinfhotep...

21.02.2026 23:04 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

base R packages

21.02.2026 14:09 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
dtplyr: Data Table Back-End for 'dplyr' Provides a data.table backend for 'dplyr'. The goal of 'dtplyr' is to allow you to write 'dplyr' code that is automatically translated to the equivalent, but usually much faster, data.table code.

i guess you can always use this package cran.r-project.org/web/packages...

20.02.2026 18:44 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

link is broken

20.02.2026 16:59 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
GitHub - r-tooling/r4r: Monorepo for everything linked to the r4r project Monorepo for everything linked to the r4r project. Contribute to r-tooling/r4r development by creating an account on GitHub.

R4R is a tool for creating a reproducible environment from a dynamic program trace, from our lab in Prague.

From a R script/notebook, it generates a Docker image that contains everything to run the R code and reproduce the results.

Try it out!

Also with an article at ACM REP 2025

14.08.2025 21:26 πŸ‘ 2 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Preview
GitHub - PRL-PRG/rcp Contribute to PRL-PRG/rcp development by creating an account on GitHub.

#RStats Interesting Work to follow from Workers in the Vitek group

crbcc: R bytecode compiler implemented in C

github.com/PRL-PRG/crbcc

rcp : Copy-and-Patch JIT Compiler for R

github.com/PRL-PRG/rcp

20.02.2026 16:53 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Nallo: a Nextflow pipeline for comprehensive human long-read genome analysis AbstractMotivation. Long-read sequencing (LRS) is increasingly used for human medical research and clinical diagnostics, due to its capacity to generate co

Say hello to Nallo - our Nextflow pipeline for long-read WGS analysis!πŸ‘‹ It handles both ONT and PacBio data and we’re using this for rare disease and population projects in Sweden. A big team effort by Felix Lenner, Anders Jemt et al.πŸ§¬πŸ’» academic.oup.com/bioinformati...

20.02.2026 09:04 πŸ‘ 11 πŸ” 4 πŸ’¬ 0 πŸ“Œ 0

#RStats When i was looking at some of the perf in quickr readme, i was thinking oh these tails are R's garbage collection pauses, the R-to-C (LLM enabled) rabbit hole i went in makes the pauses worse because it is really R to R's C API and sometimes calling into R via Rf_eval :D

19.02.2026 21:22 πŸ‘ 0 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
GitHub - dickoa/samplyr: A Tidy Grammar for Survey Sampling A Tidy Grammar for Survey Sampling. Contribute to dickoa/samplyr development by creating an account on GitHub.

I love R for everything statistics, but I've always been a little jealous of statisticians using SPSS Complex Samples or SAS SURVEYSELECT to design their samples.

So I decided to build samplyr, a tidy and pipe-friendly grammar for survey sampling design in #rstats.

dickoa.gitlab.io/samplyr/inde...

18.02.2026 14:47 πŸ‘ 109 πŸ” 23 πŸ’¬ 5 πŸ“Œ 3

Feb. update to the LLM+R guide πŸ’ͺ

8 new packages including:
code review, predictive modeling, speech-to-text, text-to-speech, HuggingFace integration, Gemini CLI companion, a CLI coding agent written in R🀯 ,and more!

available in English πŸ‡±πŸ‡· and Spanish πŸ‡²πŸ‡½
luisdva.github.io/llmsr-book/
#rstats

16.02.2026 17:30 πŸ‘ 34 πŸ” 10 πŸ’¬ 0 πŸ“Œ 2

#tinycc based #RStats #C #Transpiler, only 4x slower than #{quickr} for the convolution benchmark ! 2-3x faster than naive #C + marshaling. #tinycc transpilation will have an obvious advantage since it can always fallback when transpiling and use `Rf_lang3` since it has access to R's C runtime !

17.02.2026 00:20 πŸ‘ 1 πŸ” 4 πŸ’¬ 0 πŸ“Œ 1
Preview
GitHub - r-xla/tengen Contribute to r-xla/tengen development by creating an account on GitHub.

#Rstats #HPC gurus
Is there a package for mixed precision arithmetic that covers most of the exotic new floating points or one needs specialized ML package from the r-xla organization like tengen github.com/r-xla/tengen

16.02.2026 19:39 πŸ‘ 1 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - eddelbuettel/r2u: CRAN as Ubuntu Binaries CRAN as Ubuntu Binaries. Contribute to eddelbuettel/r2u development by creating an account on GitHub.

First thing to do when you open a #Rstats project in #codex webapp, copy paste the r2u installation docs github.com/eddelbuettel... and tell it to write #Rstats scripts for R package dev instead of snake crap (they RL perl out this model for some unknown reason)

16.02.2026 14:44 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

#RStats #Statsky
Any references and packages out there on statistical inference around binary outcomes for whom some outcome have no measurement error but the other has ? Should i just use linear models and get over it ?

14.02.2026 22:54 πŸ‘ 0 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - comp-med/r-ofhelper: Utility R package for the Our Future Health DNAnexus Trusted Research Environment Utility R package for the Our Future Health DNAnexus Trusted Research Environment - comp-med/r-ofhelper

For those of us R-users who are working with/on #dnanexus (specifically the Our Future Health TRE), I've written a small #rstats package to interact with their `dx` utility and submit jobs (among other things): github.com/comp-med/r-o...

12.02.2026 15:02 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

#Rstats to avoid if/else and switch death in this project, a functional and centralized (reviewable) definition of the rules for future improvements, i thought that #{lambda.r} package's (by Brian Lee Yung Rowe) pattern matching would be great

github.com/zatonovo/lam...

12.02.2026 02:16 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 1