New Paper Day! For ACL 2025 Findings:
You should **drop dropout** when you are training your LMs AND MLMs!
New Paper Day! For ACL 2025 Findings:
You should **drop dropout** when you are training your LMs AND MLMs!
Paper: arxiv.org/abs/2405.16039
Code: github.com/robertcsorda...
Come visit our poster "MoEUT: Mixture-of-Experts Universal Transformers" on Friday at 4:30 in East Exhibit Hall A-C #1907 on #NeurIPS2024. With Kazuki Irie, JΓΌrgen Schmidhuber, Christopher Potts and @chrmanning.bsky.social.
πExcited for #neurips2024 and our "System 2 Reasoning at Scale" workshop. We have an excited lineup of speakers who will answer your most burning questions about AI and reasoning π
π₯Got spicy questions? Submit & vote here:
app.sli.do/event/dJNU63...