Interesting
www.axios.com/2026/03/11/a...
Interesting
www.axios.com/2026/03/11/a...
I am Glad oil prices are going up, at least for now, because it breaks the myth Trump improved the economy. But nope, mining the strait would be terrible, hurting Iran's for and friends alike.
What's up with all those news outlets now using Polymarket as a substitute for proper analysis or as a supporting argument?
The point isn't so much this question but the "Oh wait" while it had the good answer. Models with good reasoning don't do that, but doing so will boost benchmarks score for the 5% of times where the model would have made a mistake.
A bit out of subject I agree. But I am not sure Qwen model is actually intelligent.
Those benchmarks miss the mark sometimes. I asked the model about how many R's in strawberry and it had to triple check its answers and doubted itself while there was no need. Not sure it would even have finished.
JUST IN: Iran indicated it had chosen Ayatollah Ali Khamenei's son Mojtaba as his successor reut.rs/4cB2AIQ
Israel/Netanyahu have clearly acted in ways contrary to US interests, like attacking Quatar, or even supporting/tolerating the killing of an american citizen in the West Bank.
Also, Marco Rubio said the US attacked because Israel was going to attack anyway, and US would have been hit.
MYTH: As commander-in-chief, Trump has the authority to take military action. FACT: It is unconstitutional for a U.S. president to declare war without the approval of the Knesset.
Trumpβs War On Iran: Myth Vs. Fact https://theonion.com/trumps-war-on-iran-myth-vs-fact/
(Not a quote) Any conservative still supporting ICE tactics isn't conservative, it's just a MAGA fanatic or unable to see the truth. The Trump administration is transforming ICE into an unofficial military that intimidates Americans. Good luck for everyone in the US.
Reid said her life, too, has grown smaller in the months since her arrest, detention and the government tweet of her name, photo and alleged crime. She laments curtailing her protests out of fear. βThat is our right as U.S. citizens,β she said in an interview. βNow, thatβs being stifled.β
Others say the time spent on flimsy cases takes them away from prosecuting drug cases, public corruption and gun-related crimes.
"Federal prosecutors in cities with high-profile immigration operations said they have been pressured by Justice Department leaders to aggressively pursue assault charges, even in cases undermined by contradictory evidence or ones that fail to appear worthy of prosecution."
www.euractiv.com/news/intelli...
Intelligence assessment: Casualties and confusion grip Iranβs internal security forces
========================================================================================= Browser | Passed | Total | Current (%) | 5y ago (%) (Abs) | Delta Info: 'Absolute' is an approximation. Historical browsers weren't tested on newer tests they might have supported. Baseline: Today's total tests (2,160,994). ----------------------------------------------------------------------------------------- Note: Ladybird, Flow, Servo comparison date clipped to project start baseline. Chrome | 2,109,653 | 2,160,994 | 97.62% | 79.30% | +18.32% Trend: β β (Range: 79.3% - 97.6%) Firefox | 2,055,767 | 2,137,161 | 96.19% | 77.19% | +19.00% Trend: ββ (Range: 77.2% - 95.1%) Safari | 2,045,539 | 2,145,709 | 95.33% | 75.51% | +19.82% Trend: ββ β (Range: 75.5% - 94.7%) Ladybird | 1,978,533 | 2,126,395 | 93.05% | 42.30%* | +50.75% (Since Aug 2024) Trend: ββ (Range: 42.3% - 91.6%) Flow | 1,936,347 | 2,121,243 | 91.28% | 70.92%* | +20.36% (Since Dec 2022) Trend: ββ (Range: 70.9% - 89.6%) Servo | 1,864,740 | 2,094,152 | 89.05% | 58.89%* | +30.16% (Since Jan 2022)
If you want to try it out, just copy the code and paste it in a text editor. Save it with a filename ending in .py. Then use it as a cli program.
Or if you just want to see what I looks like.
Commentary: Anyone Else Have Those Weird Dreams Where Sobbing Future Generations Beg You To Change Course?
It's really nice to have this iterative process, you start small and iterate again, as possibilities expand. OpenCode works well, and the cost tracker is nice.
I just vibecode a data visualiser for wpt.fyi with Open Code and Gemini 3 Flash. It worked quitte well. People imagine you prompt it once and you are done, but it's more like being a manager telling and intern what the project will look.
Does the EU have a plan to fund Ukraine? Politico now says Ukraine will run out of fund by the very end of march. www.politico.eu/article/eu-t...
From this source, a new government might not be formed until May
helpers.hu/hungarian-ci...
What's the plan here ?
How did it generators a benchmark though on reviewing academic papers ?
If it generated it itself, I am a bit lost on why the same model couldn't then pass the test.
Looks nice overall, but hopefully this goes or is copied in the open, because Claude models are way too expensive.
The single biggest thing holding back small Qwen models (including 3.5) is self-doubting and the 'wait,..." and triple-checking (or more).
It might increase scores in benchmarks, but it's very inconvenient/debilitating.
*proxied
Maybe contact opensource.google ?
It's sad honestly. This seems bad.
Is it because it's proxies ?
*Cornyn
Make sense. Do we know yet why Coryn performd so well (relative to expectations) ?
It only has 8GB of RAM. Definitely fine for education but for serious work it gets a though pill. Many phones have more RAM nowadays.
the state department is now using GPT-4.1 due to the SCR designation
imo thatβs what Anthropicβs value is β they produce GPT-5 class models that you can prompt like GPT-4
sure, GPT-5.3 can beat Opus, if youβre up for a fight
www.reuters.com/business/us-...
This illustrates that in some cases, EU actually increase State sovereignty. Spain would never dare if its trade wasn't managed by the EU.