Felipe's Avatar

Felipe

@ffffelipe

Pre-training @ Cohere

9
Followers
71
Following
3
Posts
12.11.2023
Joined
Posts Following

Latest posts by Felipe @ffffelipe

Post image

Fun sonnet 4 hallucination on muP

The Yang-Lecun correspondence

30.05.2025 07:59 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
Command A: An Enterprise-Ready Large Language Model In this report we describe the development of Command A, a powerful large language model purpose-built to excel at real-world enterprise use cases. Command A is an agent-optimised and multilingual-cap...

Very happy to share the command-A tech report! I believe this the largest published model with muP+fp8 :)

Lots of interesting post-training details as well. And great performance ofc!

arxiv.org/abs/2504.00698

02.04.2025 19:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

> spend some time porting critical code from c++ to python

> c++ code is slower than python

> After a while optimizing it, figure out you forgot to add -O3

> Runs much faster obviously

> At the end the python bindings eat up half of the runtime benefits

๐Ÿฅฒ๐ŸŽข

25.11.2024 21:32 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0