's Avatar

@annoyingreposter

199
Followers
447
Following
2,334
Posts
18.10.2024
Joined
Posts Following

Latest posts by @annoyingreposter

today was contacted by open claw affiliate

repiled =)

06.03.2026 19:43 👍 0 🔁 0 💬 0 📌 0

openclaw-senator

06.03.2026 18:36 👍 1 🔁 0 💬 0 📌 0
AUTOHARNESS: IMPROVING LLM AGENTS BY AUTOMATICALLY SYNTHESIZING A... Despite significant strides in language models in the last few years, when used as agents, such models often try to perform actions that are not just suboptimal for a given state, but are strictly...

idk what is "python harness", but looks useful openreview.net/forum?id=g9r...

06.03.2026 06:57 👍 0 🔁 0 💬 0 📌 0

no eu no us

canada and apac is all I've got

05.03.2026 20:39 👍 0 🔁 0 💬 0 📌 0

+1 great journal, please consider volunteering for the role of Action Editor at TMLR!

05.03.2026 18:14 👍 4 🔁 1 💬 0 📌 0
Preview
GitHub - Quantum-MultiScale/DFTpy Contribute to Quantum-MultiScale/DFTpy development by creating an account on GitHub.

saw it once github.com/Quantum-Mult...

05.03.2026 16:48 👍 0 🔁 0 💬 1 📌 0

can you please post the recording afterwards to your yt?

05.03.2026 10:57 👍 0 🔁 0 💬 0 📌 0
Post image

job-boards.greenhouse.io/deepmind/job...

05.03.2026 09:53 👍 0 🔁 0 💬 0 📌 0

I don't think that I'm necessary. Coding agents one shot what I do for weeks, create thousands of lines of code. No idea if they're actually useful. Even at GSoC I see many students try to cheat their way via AI-assisted code. I don't really see why it might be useful and in what way. This is life.

04.03.2026 21:36 👍 0 🔁 0 💬 0 📌 0

I suck balls at EFGs still, most things came from muscular memory but not concious understanding. Keep working, keep staring at these pdfs. Solve-solve-solve.

04.03.2026 18:39 👍 0 🔁 0 💬 0 📌 0
Post image

📣 Reinforcement Learning Summer School is returning to Milan in 2026!

Co-organized with @ellisunitmilan.bsky.social & designed for Master's and PhD students on RL theory, multi-agent systems, RL & LLMs, real-world applications...

📍 Milan 🇮🇹
📅 3-12 June
⏰ Apply by 27 March
🔗 https://bit.ly/4b2Plhp

04.03.2026 14:49 👍 19 🔁 9 💬 0 📌 0
Preview
Extensive-Form Perfect Equilibrium Computation in Two-Player Games We study the problem of computing an Extensive-Form Perfect Equilibrium (EFPE) in 2-player games. This equilibrium concept refines the Nash equilibrium requiring resilience w.r.t. a specific vanishing...

it's quite interesting if this work is related because it solves LCP:
arxiv.org/abs/1611.05011

looks like a parallel line of work to me, hm

04.03.2026 07:06 👍 0 🔁 0 💬 0 📌 0

I more understood poker with raise example because it contains subgames 😀

03.03.2026 19:39 👍 0 🔁 0 💬 0 📌 0

I didn't get some of discussion either. But the Sørensen's paper looks relevant to extend the current LP solver for EFGs

it's as you said non-trivial but may yield some interesting results even for maximal lotteries, I guess (could and must involve some additional work).

03.03.2026 17:38 👍 1 🔁 0 💬 1 📌 0

thinking mode hallucinates too much, so some might even find it useless

03.03.2026 16:02 👍 0 🔁 0 💬 0 📌 0

or are you done with pure LP and want more gradient-based stuff

a good solver would feed generations, though!

03.03.2026 14:49 👍 0 🔁 0 💬 1 📌 0

btw @sharky6000.bsky.social as I am still eager to implement that issue that you've closed covering sub-perferct NEs, there are perturbed games as well: www.itu.dk/~trbj/papers...

maybe, there's something (quasi-)perfect for normal games as well? (the notion is different as stated in the pdf)?

03.03.2026 14:49 👍 1 🔁 0 💬 2 📌 0
Iconic Interactive | Building Intelligent, Living Worlds Building the tools and infrastructure that enable creators to craft intelligent, living worlds, marking the next frontier of interactive entertainment.

I like the mission of this company to create AI-assisted gaming experience: iconicgames.io

certainly, impoosible to do in every genre but why not?

03.03.2026 07:53 👍 1 🔁 0 💬 0 📌 0

how kind of you to propose that as representatives of the countries who initiated the conflict

03.03.2026 07:03 👍 0 🔁 0 💬 0 📌 0

so, there're wrong words, what does it even mean? or it doesn't go further than a purely personal preference?

02.03.2026 21:28 👍 0 🔁 0 💬 0 📌 0

the blog is about an "old" (half a year paper) but contains some good applications

02.03.2026 19:31 👍 0 🔁 0 💬 0 📌 0

adding subgame perfect equilibrium solver to open spiel

02.03.2026 19:29 👍 0 🔁 0 💬 0 📌 0
Preview
Evolution Strategies for LLM Fine-Tuning: Four New Research Directions Explore how evolution strategies are reshaping LLM fine-tuning with advances in reasoning, metacognition, quantization, and scalability theory.

found a blog about LLMs as evolutional operators that's worth attention www.cognizant.com/us/en/ai-lab...

02.03.2026 18:50 👍 0 🔁 0 💬 1 📌 0

lowkey smart: dollar per goal prize-to-performance, makes sense when you target a specific aspect to improve your own game, where the metric is clear

02.03.2026 12:15 👍 0 🔁 0 💬 0 📌 0

can you task your cats with maintaining your posting schedule>

nice plants!

02.03.2026 07:38 👍 1 🔁 0 💬 1 📌 0

I think the other thing is to look into sensititivyt analysis from dynamical perspective: I've recently read a lot about naomi.princeton.edu/wp-content/u...

look for evolution of the spectrum as the key to stability

01.03.2026 20:24 👍 1 🔁 0 💬 1 📌 0

yes, you are right about that

01.03.2026 18:26 👍 0 🔁 0 💬 0 📌 0
Preview
Jackpot! Alignment as a Maximal Lottery Reinforcement Learning from Human Feedback (RLHF), the standard for aligning Large Language Models (LLMs) with human values, is known to fail to satisfy properties that are intuitively desirable, such...

they do a fancy regualisation of this problem to be exact, I would say
arxiv.org/abs/2501.19266

so must be applicable, for sure...

01.03.2026 18:13 👍 1 🔁 0 💬 0 📌 0

technically, might be better because linear programming is extremely well-studied and its connections with robust control/optimisation

however, all i have ever seen modelled was like in matlab toolboxes; interesting how we're ready to solve the problems efficiently =)

01.03.2026 17:30 👍 0 🔁 0 💬 1 📌 0

@sharky6000.bsky.social probably saw that because you were cited but robust lotteries with (semi-defined) linear programming rather than sigmoidal optimisation

I didn't judge the paper, but maybe for you it'd be worthy to give it a look arxiv.org/abs/2602.21297

01.03.2026 16:56 👍 1 🔁 0 💬 2 📌 0