Felix's Avatar

Felix

@felix-red-panda

speech synthesis and LLM nerd, DMs open, working on LLM stuff https://felix-red-panda.com based in Berlin, Germany

435
Followers
264
Following
29
Posts
24.11.2024
Joined
Posts Following

Latest posts by Felix @felix-red-panda

Preview
Why are embeddings so cheap? or a lesson in profiling and FLOPS per dollar

www.tensoreconomics.com/p/why-are-em...

29.09.2025 23:51 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Why does embedding the entire Wikipedia only cost a few dollars? Deep dive blog post, link below

29.09.2025 23:50 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Hi ๐Ÿ‘‹

17.05.2025 17:14 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
LLM Inference Economics from First Principles The main product LLM companies offer these days is access to their models via an API, and the key question that will determine the profitability they can enjoy is the inference cost structure.

www.tensoreconomics.com/p/llm-infere...

16.05.2025 17:44 ๐Ÿ‘ 6 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

deep dive on how LLM inference works (link in the post below)

16.05.2025 17:44 ๐Ÿ‘ 12 ๐Ÿ” 4 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Post image

the new RTX 5000 Blackwell seems like a quite good deal compared to a modded 4090 48gb

10.05.2025 22:30 ๐Ÿ‘ 3 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Singapore is very impressive :D

26.04.2025 03:17 ๐Ÿ‘ 5 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

made it to Singapore :)

24.04.2025 15:42 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

i'll be attending ICLR. DM me if you would like to chat :)

21.04.2025 23:33 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 2

what do you think is a reasonable time to post here? (my posts are naturally more in the evening of EU time - so morning time in the US)

11.01.2025 20:32 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

I'll be leaving London on Saturday afternoon

11.01.2025 18:31 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

but I'm continuing to do it anyway ๐Ÿ™ƒ

11.01.2025 18:20 ๐Ÿ‘ 5 ๐Ÿ” 0 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

Iโ€™ll be in London starting Sunday evening (tomorrow). I want to meet people! :D (please DM me, my DMs are open)

11.01.2025 18:20 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Posting here feels far more like shouting in a void than on X/twitter ๐Ÿฅฒ

10.01.2025 18:49 ๐Ÿ‘ 8 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Post image

I finally got around to putting some stickers on my laptop ๐Ÿ˜Š

10.01.2025 18:32 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

I met someone yesterday who works in LLM safety (of closed models) but yet he didnโ€™t know the term LLM inference and didnโ€™t understand the concept without an in depth explanation ๐Ÿ™ƒ

04.01.2025 22:15 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Why are system prompts for image diffusion models not a thing?

04.01.2025 22:04 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Happy new year everyone ๐ŸŽ† 2024 was a really good year for me, I hope 2025 will be even better ๐Ÿš€

01.01.2025 01:42 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Iโ€™m having fruity tasting coffee โ˜•๏ธ #38c3

29.12.2024 10:11 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

It being physically small and very low power consumption at idle probably also plays a role (vs having a desktop computer with 2 4090s idling)

27.12.2024 09:23 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Iโ€™m attending the #38c3 in Hamburg. DM me to meet up :)

26.12.2024 18:49 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

who remembers GPT-3 Davinci pricing?

20.12.2024 19:24 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

the o3 announcement feels very analogous to the GPT-3 announcement in early 2020: "here is new tech, it's very powerful, but it's super expensive to run and we're super selective with who we give access to"

20.12.2024 19:23 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Iโ€™ll be at NeurIPS starting Tuesday morning. I want to meet people! :D (my DMs are open)

09.12.2024 05:53 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

I'll be in SF starting this weekend. I'd love to meet cool people :D (My DMs are open)

27.11.2024 01:25 ๐Ÿ‘ 6 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Then choose the Jupyter way, youโ€™ll have a web browser and openssh installed anyway

25.11.2024 19:19 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

The vscode ssh functionality works pretty well or alternatively tunnelling web traffic through ssh for Jupyter is also neat

25.11.2024 18:58 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

I have the same problem - I hope people just follow me first so that narrows down the search space

24.11.2024 23:07 ๐Ÿ‘ 3 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

How do I do that?

24.11.2024 22:10 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0