Great to be part of this project led by the amazing @hamishivi.bsky.social. The most fun (in retrospect) thing is to observe how the results start to shift as we scale up the candidate pool, evaluation suite, and selection size :) And eventually we find a simple method does the best!
04.03.2025 21:14
๐ 2
๐ 1
๐ฌ 0
๐ 0
How well do data-selection methods work for instruction-tuning at scale?
Turns out, when you look at large, varied data pools, lots of recent methods lag behind simple baselines, and a simple embedding-based method (RDS) does best!
More below โฌ๏ธ (1/8)
04.03.2025 17:10
๐ 13
๐ 4
๐ฌ 1
๐ 2
This is a great effort for the migration, thanks for putting it together! Can I be added to the list?
12.11.2024 22:23
๐ 1
๐ 0
๐ฌ 1
๐ 0