LimiX
A new player enters the arena of Foundation Models on Tabular Data: www.limix.ai - novel methods for pre-training and data generation that look highly relevant. Their evaluation on selected datasets is showing strong performance. Exciting times, looking forward to further in depth comparisons!
15.09.2025 11:28
๐ 0
๐ 0
๐ฌ 0
๐ 0
VLDB 2025: AI Meets Enterprise Data Management โ The Tabular FM Moment โ Johannes Hoffart
At #VLDB2025 London I joined a panel on Neural Relational Data. My Take: LLMs solve some data management tasks, but the next wave is Foundation Models on Relational Data and Semantically Linked Tables. More on this and further trends in #AI and #DataManagement - www.hoffart.ai/vldb-2025-ai...
11.09.2025 11:55
๐ 4
๐ 0
๐ฌ 0
๐ 0
Senior/Principal Applied Research Scientist (f/m/d): Foundation Models on Linked Business Data
Senior/Principal Applied Research Scientist (f/m/d): Foundation Models on Linked Business Data
Our team developing Foundation Models on Tables & Linked Business Data is looking for a new Senior Applied Research Scientist! Excited about pushing the frontier in foundation models on tabular data? Want to have business impact and academic visibility?
Look no further: jobs.sap.com/job/Walldorf...
01.08.2025 07:47
๐ 3
๐ 0
๐ฌ 0
๐ 0
5 Minute Papers on AI for the Planet
AI is more than just chatbots! Learn about how AI can be used to protect biodiversity, fight climate change, and just better understand our planet through 5-minute explainers covering academic papers ...
For the past 3 years, I've taught a course on Machine Learning for Climate Change to undergrads. At times, people have asked if the course lectures could be made available online. While I can't offer that, I have decided to start making "5 Minute Papers on AI for the Planet" videos. Hope its useful!
20.06.2025 01:55
๐ 203
๐ 54
๐ฌ 10
๐ 2
Can you train a performant language model using only openly licensed text?
We are thrilled to announce the Common Pile v0.1, an 8TB dataset of openly licensed and public domain text. We train 7B models for 1T and 2T tokens and match the performance similar models like LLaMA 1 & 2
06.06.2025 19:18
๐ 147
๐ 59
๐ฌ 2
๐ 2
I asked "on the other platform" what were the most important improvements to the original 2017 transformer.
That was quite popular and here is a synthesis of the responses:
28.04.2025 06:47
๐ 204
๐ 43
๐ฌ 4
๐ 3
This was helpful.
Also worth noting that Bluesky remains a very fraught place for AI discussions for a variety of reasons, good & bad, but with the impact of keeping a lot of the most relevant AI news, paper discussions & biggest names on X
That might change, but it hasnโt yet. Still posting, tho.
26.04.2025 02:55
๐ 241
๐ 17
๐ฌ 12
๐ 1
Very long time ago, ACM SIGMOD 2012. Submitted my thesis outline and got very good feedback both during the session and afterwards during an individual lunch with a very senior colleague. Would recommend!
31.03.2025 11:16
๐ 0
๐ 0
๐ฌ 0
๐ 0
The State of LLM Reasoning Models
Part 1: Inference-Time Compute Scaling Methods
I just shared a new article, "The State of Reasoning Models", where I am exploring 12 new research articles on improving the reasoning capabilities of LLMs (all published after the release of DeepSeek R1): magazine.sebastianraschka.com/p/state-of-l...
Happy reading!
08.03.2025 14:37
๐ 61
๐ 14
๐ฌ 1
๐ 1
I shared a controversial take the other day at an event and I decided to write it down in a longer format: Iโm afraid AI won't give us a "compressed 21st century"
Here: thomwolf.io/blog/scienti...
It's an extension of this interview discussion from the AI summit: youtu.be/AxBd3G0lFLs?...
06.03.2025 13:03
๐ 132
๐ 34
๐ฌ 11
๐ 12
When using LLM-as-a-judge, practitioners often use greedy decoding to get the most likely judgment. But we found that deriving a score from the judgment distribution (like taking the mean) works better!
โLLM-as-a-judge with greedy decoding
๐Using the distribution of the judgeโs labels
06.03.2025 22:04
๐ 27
๐ 4
๐ฌ 1
๐ 0
Discover European cities โ๏ธ while building your career! Check out the ELLIS PhD/Postdoc Program's 2025 Winter & Summer School Schedule! Dive deep into cutting-edge #AI research, learn from top researchers & connect with peers across Europe. Learn more: bit.ly/42iow66 #PhD #machinelearning
13.01.2025 12:45
๐ 15
๐ 6
๐ฌ 1
๐ 0
Our first release of 2025: ๐จ๐ข๐ค๐ก๐๐๐๐ฃ๐ฉ๐จ, ๐๐ต๐ฒ ๐๐ถ๐บ๐ฝ๐น๐ฒ๐๐ ๐น๐ถ๐ฏ๐ฟ๐ฎ๐ฟ๐ ๐๐ผ ๐ฏ๐๐ถ๐น๐ฑ ๐ฎ๐ด๐ฒ๐ป๐๐ถ๐ฐ ๐๐๐๐๐ฒ๐บ๐!
๐ฅ Main logic in ~1000 LoC
๐งโ๐ป Agent writes its actions in code! LLMs are much better at writing code than current standard of writing JSON => higher perf
๐ Any LLM support (h/t LiteLLM)
๐ก๏ธ Secure code exec (h/t E2B)
01.01.2025 15:21
๐ 123
๐ 18
๐ฌ 4
๐ 3
Could not be more proud of our crew! Kudos to @marcospinaci.bsky.social Marek Polewczyk Markus Kohler Sam Thelin @tj-klein.bsky.social Clemens Biehl @margaridacosta15.bsky.social Margarida Costa Andrรฉ Sreลก Jonas Kolk - tremendous achievement!
14.12.2024 17:56
๐ 1
๐ 0
๐ฌ 0
๐ 0
SALT: Sales Autocompletion Linked Business Tables Dataset
Foundation models, particularly those that incorporate Transformer architectures, have demonstrated exceptional performance in domains such as natural language processing and image processing....
โถ๏ธ Open source, multi-table data set containing millions of sales orders sourced from real, production linked business data:
Paper ๐ "SALT: Sales Autocompletion Linked Business Tables Dataset" openreview.net/forum?id=UZb...
Data ๐ป github.com/SAP-samples/...
14.12.2024 17:56
๐ 5
๐ 0
๐ฌ 1
๐ 0
Have a look at our work on foundation models on tabular data, published today at #TRL @ #NeurIPS2024:
๐ PORTAL, an open weight and code foundation model trained on tabular data, and
๐ SALT, a real business data set containing millions of sales orders across multiple tables.
Further details ๐
14.12.2024 17:56
๐ 9
๐ 1
๐ฌ 1
๐ 0
Table Representation Learning Workshop
TRL Workshop ---
The 3rd Table Representation Learning (TRL) workshop at NeurIPS 2024 is approaching soon โจ
Join us Saturday 14 Dec from 8:30AM for an amazing program and discussions about all things neural models + tabular data (table-representation-learning.github.io ).
Not in Vancouver? Join online neurips.cc ๐
09.12.2024 18:18
๐ 9
๐ 3
๐ฌ 1
๐ 0
SAP Knowledge Graph - SAP Jobs
Find SAP Knowledge Graph at SAP
We are growing the team building the SAP Knowledge Graph and are #hiring AI & Data Scientists, Data Engineers, Knowledge Engineers and Applied Research Scientists in Germany (Berlin, Walldorf) and India (Bangalore): jobs.sap.com/search/?crea...
Let's take GenAI to the next level with #KG!
04.12.2024 10:36
๐ 6
๐ 1
๐ฌ 0
๐ 0
Tired of saturated benchmarks? Want scope for a significant leap in capabilities?
๐ฅ Introducing BALROG: a Benchmark for Agentic LLM and VLM Reasoning On Games!
BALROG is a challenging benchmark for LLM agentic capabilities, designed to stay relevant for years to come.
1/๐งต
21.11.2024 16:24
๐ 95
๐ 20
๐ฌ 4
๐ 7
What are data spaces and what do they do?
Learn more about data spaces: what they are, what they do and whatโs next.
Great blog post from @odihq.bsky.social @esimperl.bsky.social on the current development state of #dataspaces in Europe.
theodi.org/news-and-eve...
30.11.2024 12:29
๐ 3
๐ 1
๐ฌ 0
๐ 0
How AutoML Creates New Opportunities for Europe - Frank Hutter // CyberValley Podcast #5
YouTube video by Cyber Valley
Tabular DL and AutoML podcast just dropped. For sure watching this
youtu.be/3qpQ-sMRafE
26.11.2024 18:42
๐ 11
๐ 2
๐ฌ 1
๐ 0
Let me surface this again now that this place is more lively: Come join us at SAP in the US or Germany for a PhD Summer Internship in 2025 in Foundation Models on Structured Data, Table Representation Learning, LLMs and Knowledge Graphs! #MLInternships
26.11.2024 20:51
๐ 9
๐ 3
๐ฌ 0
๐ 0
Added some more folks to the Open Source AI Starter Pack:
go.bsky.app/N8yVZdW
24.11.2024 18:43
๐ 79
๐ 22
๐ฌ 22
๐ 1
AI@HPI Conference logo
I am chairing the
AI@HPI Conference: Responsible AI
December 3-4 in Potsdam (Berlin metropolitan area)
Discussing AI with regard to bias, elections/society, trustworthiness, copyright, the EU AI Act, and best practices.
Registration:
hpi.de/en/ai-hpi-co...
Please spread the word!
21.11.2024 17:36
๐ 7
๐ 2
๐ฌ 1
๐ 2
Hi ๐ We're glad to be here on @bsky.app and looking forward to engaging in this community. But first, learn a little more about us...
#ELLISforEurope #AI #ML #CrossBorderCollab #PhD
21.11.2024 10:37
๐ 121
๐ 18
๐ฌ 3
๐ 1