Characterizing Datasets and Building Better Models with Continued Pre-Training
What’s the most effective way to add new domain knowledge into an open LLM? A new blog post from my team covers experiments we did at the beginning of the year to start answering this question. It starts, unsurprisingly, with sweeping your learning rate… www.databricks.com/blog/charact...
25.11.2024 23:28
👍 22
🔁 8
💬 0
📌 1
PLDI 2025 Artifact Evaluation Committee Self Nomination
This form allows any member of the community to nominate *yourself* to be part of the Artifact Evaluation Committee for PLDI 2025. While we cannot select all qualified candidates, we will do our best ...
Attention🚨 We are looking for motivated students and researchers to be members of the PLDI 2025 Artifact Evaluation Committee. This year, we are accepting self-nominations (the form is here: forms.gle/2TPmixasDmqM...). Deadline: Dec 23rd, 2024.
For more info: pldi25.sigplan.org/track/pldi-2...
22.11.2024 23:37
👍 30
🔁 21
💬 3
📌 0
Mat: are rerankers supposed to do this?
Team: 👀
<2 months later>
Paper!
This has been incredibly fun work to be a part; my favorite kind of science is finding holes in commonly held assumptions.
21.11.2024 17:28
👍 7
🔁 1
💬 1
📌 0