It did quite well! I appended the transcript to the previously shared Google Doc. The Flash model "cheats" a bit by relying more on Google Search results, but the Pro model did a very good job of breaking down the solution into considering primordial helium and ongoing fusion-produced helium.
25.04.2025 23:03
👍 1
🔁 0
💬 1
📌 0
It's worth retrying with the "reasoning" models. I haven't paid for the advanced OpenAI models, but I tried with Gemini 2.5 Flash and 2.5 Pro and both got it right on the first time and on two subsequent prompts to reevaluate. These latest reasoning models are much better at resisting hallucination!
25.04.2025 21:50
👍 1
🔁 0
💬 1
📌 0