Sarah • Machine Learning Engineer (@galaxyinferno)

🚀 Ready to Finally Upgrade from messy ML notebooks? Setup to Shipping Challenge Tired of hard-to-maintain ML code? Join me on the 4-week ML Project Blueprint Challenge and turn a blank repo into a clean, configurable, and shippable ML project.

August ML coding challenge is alive and kicking - join for free and learn something with me 👀

www.sarahglasmacher.com/ml-repo-stru...

09.08.2025 02:13 👍 1 🔁 0 💬 0 📌 0

Sarah Glasmacher Read about what I'm learning as an ML engineer, what I observe in my field, useful links and resources I found, incl. courses and books and get updates on new content and tutorials I'm releasing

now i’ll go and finish the setup for my august coding challenge and send out the newsletter and maybe i’ll get back to my optuna code after running or tomorrow 🤞🏻

galaxy-inferno-codes.kit.com

01.08.2025 19:34 👍 1 🔁 0 💬 0 📌 0

this week was a bit full and nothing felt like enough with the upcoming deadline, but we made great progress in the end merging all our feature branches together and still are getting a tracked and registered machine learning model in the end

01.08.2025 19:34 👍 0 🔁 0 💬 1 📌 0

happy friday, coders ⌨️🎊 technically the work week is over - however I have an important project presentation next tuesday, so I think i’ll tweak my gridsearch some more 🫣

01.08.2025 19:34 👍 2 🔁 0 💬 1 📌 0

for me it’s also “it’s more and more embarrassing to be wrong or to admit I don’t know” 🫤
personally, I believe we should admit to not knowing a lot more than we do, but the longer I’m in corporate, the more I think it damages my image if I do admit this publicly.

29.04.2025 18:18 👍 8 🔁 0 💬 1 📌 0

Being an adult truly involves a lot of cleaning - not only your apartment but for some reason also your computer? And your phone camera roll (don't tell me I'm the only one with 2k screenshots 🫣) - and then there is work documentation to do as well. Anyways, I did some cleaning this weekend

13.04.2025 21:35 👍 3 🔁 0 💬 0 📌 0

this one felt okay though & I think it’s important for data scientists to move more towards tools like this to ever be able to deploy projects. That’s my large goal for the rest of the year: how can we enable data science teams to develop “better” code in small achievable steps 👣

08.04.2025 23:01 👍 1 🔁 0 💬 0 📌 0

published my 4th article of the year on my blog 🥳 and my first ever post about mlflow, where i’m trying to overcome my imposter syndrome: i’m learning a lot, but am far from an expert and am sooo afraid of giving bad advice 🫠

buff.ly/pwLyjbL

08.04.2025 23:01 👍 0 🔁 0 💬 1 📌 0

my linkedin post finally published *with* its image attached 🥳

(also someone pls tell me why writing 400 word social posts feels easy but writing an 800 word blog post feels impossible?)

06.04.2025 22:11 👍 0 🔁 0 💬 0 📌 0

“Rust” always trips me up because I can never tell on first glance if we’re talking about the game or the coding language - somehow my brain always defaults to the game and ends up confused 🤷🏻‍♀️

05.04.2025 07:25 👍 1 🔁 0 💬 0 📌 0

it’s not just the pay - the mindset is so extremely risk-averse and stubborn in my experience. And I’m in Germany, we’re not big risk takers to begin with, but public companies are an extreme even here. It’s really hard to change things or create new ideas.. at some point you get tired of it

03.04.2025 22:10 👍 1 🔁 0 💬 0 📌 0

No thank you, chatgpt, i would *not* like a printable tracker, I haven’t used my printer in 5 years 😅🫠 Is this a new thing or did I somehow tell it I love printing things?

29.03.2025 18:15 👍 0 🔁 0 💬 0 📌 0

what are the more appealing options? i’m out of the loop

25.03.2025 16:13 👍 0 🔁 0 💬 0 📌 0

Another day of triple checking all data and… finding out two temperature columns are not the same?! 🫠😤 I will never take sorted and orderly data for granted again.

24.03.2025 23:54 👍 0 🔁 0 💬 0 📌 0

We almost deployed a model trained on missing data 🫣 MAR 17

New newsletter: my team and I were deep into a data audit - the forecasting model has already been trained - so the data should be very clean and complete, right? After all, someone has already worked with it... Spoiler alert: our data wasn’t nearly as complete as we thought. 😅

17.03.2025 22:34 👍 1 🔁 0 💬 0 📌 0

Is there any good source on comparing which LLMs/providers people actually use day to day in terms of percentages? I feel like we always hear from vocal minorities when new models launch, but how much has, for example, Gemini actually grown on the market in total? 👀

15.03.2025 12:34 👍 0 🔁 0 💬 0 📌 0

not sure, i just tried to google it but most results just describe “normal” packages. I do the “pip install -e .” in the folder of my local library and then add it as a dev dependency to my project.toml too. Maybe not quite the *correct* uv way, but at least it works

14.03.2025 01:14 👍 0 🔁 0 💬 0 📌 0

not sure if i understand your set up correctly, but i think you need to build the library first before you can install it? Or run something like “pip install -e .” which keeps updating the build on changes 🤔 not sure if “uv add” handles all of that

14.03.2025 01:08 👍 0 🔁 0 💬 1 📌 0

building the prototype was never the real challenge?! so I struggle to see the “life changing” advantage here

11.03.2025 11:11 👍 0 🔁 0 💬 0 📌 0

building a fast prototype is nice and all, but if it’s fast and dirty, it tells you nothing about the upcoming challenges in the deployment process and you will need to start completely from scratch to build it into a fully deployable and maintainable product 🤷🏻‍♀️

11.03.2025 11:11 👍 1 🔁 0 💬 1 📌 0

vibe coding?
see you in 2 years, when you start discussing “vibe debugging”, “vibe tech debt”, “vibe monitoring”, “vibe deployment”, “vibe security” and “vibe cost analytics”

11.03.2025 11:11 👍 0 🔁 0 💬 1 📌 0

However, it also seems like many embedding models are *trained* using cosine similarity for the loss function, so in a way it makes sense to use it for retrieval via embeddings too - and many of the vector metrics share properties, so it's not like another metric computes smth completely different

03.03.2025 21:56 👍 0 🔁 0 💬 0 📌 0

FYI even OpenAI say in their documentation of text-embedding-3-large the following: "We recommend cosine similarity. The choice of distance function typically doesn't matter much." 😅 🤷‍♀️ platform.openai.com/docs/guides/...

03.03.2025 21:56 👍 0 🔁 0 💬 1 📌 0

I just wanted to write a quick tutorial on how I've used cosine similarity in pgvector to search for RAG sources in my side project... and now I'm doing a whole deep dive into "why tf are we using cosine similarity at all?" 🤔 and that's why I'm a very inconsistent content creator

03.03.2025 21:56 👍 1 🔁 0 💬 1 📌 0

today I added all my Obsidian/second brain markdown files into the local RAG vector store and did some small improvements to the retrieval to get a flexible number of results back

01.03.2025 23:33 👍 1 🔁 0 💬 0 📌 0

👩🏻‍💻 RAG side project update: Somehow I thought that fine-tuning a similarity threshold for RAG would be super difficult, but it turns out for my use case the “default” of 0.5 works just fine 🤷🏻‍♀️😅

01.03.2025 23:33 👍 2 🔁 0 💬 1 📌 0

this will be mainly an exploration task, i don’t mean to solve the whole problem in 4 days, but 4 days is enough to try out 1-2 simple techniques in a notebook or quick script and write a short post about the problem itself, so end of month sprint, let’s go 🧨

25.02.2025 08:25 👍 0 🔁 0 💬 0 📌 0

I promised I would figure out what to do with my side project - here we are: i quickly threw together a short post on my website about a veery quick February project 😬

I said I would do a coding project each month, and I don’t intend to break that promise even if it means doing it in 4 days 😂

25.02.2025 08:25 👍 1 🔁 0 💬 1 📌 0

the irony of starting a RAG project & then immediately being thrown into a time series forecasting project at work😭 so now i’m spending all my free time learning about time series. It’s not even just about the available time - i find it incredibly hard to go “sponge-mode” on two topics at once. 🧽

24.02.2025 08:25 👍 1 🔁 0 💬 0 📌 0

i’ve been in the office quite a bit more lately, the parking situation is horrendous but the smaller building is cozy - it’s pretty much exclusively IT of the company so I know most of the people I meet in the hallway and we have our own bigger kitchen to hang out in 😊

23.02.2025 08:25 👍 1 🔁 0 💬 0 📌 0

Sarah • Machine Learning Engineer

Latest posts by Sarah • Machine Learning Engineer @galaxyinferno