BERJAYA

GPT-5.4 Is A Substantial Upgrade

Posted on March 11, 2026 by TheZvi

Benchmarks have never been less useful for telling us which models are best.

They are good for giving a general sense of the landscape. They definitely paint a picture. But if you’re comparing top models, like GPT-5.4 against Opus 4.6 against Gemini 3.1 Pro, you have to use the models, talk to the models, get reports from those who have and form a gestalt. The reports will contract each other and you have to work through that. There’s no other way.

Thus, I try to gather and sort a reasonably comprehensive set of reactions, so you can browse the sections that make you most curious.

Continue reading →

Posted in Uncategorized | Tagged ai, artificial-intelligence, chatgpt, llm, technology | Leave a comment

Claude Code, Claude Cowork and Codex #5

Posted on March 9, 2026 by TheZvi

It feels good to get back to some of the fun stuff.

The comments here can double as a place for GPT-5.4 reactions, in addition to my Twitter thread. I hope to get that review out soon.

Almost all of this will be a summary of agentic coding developments, after a note.

Posted in Uncategorized | Tagged ai, artificial-intelligence, chatgpt, llm, technology | Leave a comment

Anthropic Officially, Arbitrarily and Capriciously Designated a Supply Chain Risk

Posted on March 6, 2026 by TheZvi

Make no mistake about what is happening.

The Department of War (DoW) demanded Anthropic bend the knee, and give them ‘unfettered access’ to Claude, without understanding what that even meant. If they didn’t get what they want, they threatened to both use the Defense Production Act (DPA) to make Anthropic give the military this vital product, and also designate the company a supply chain risk (SCR).

Hegseth sent out an absurdly broad SCR announcement on Twitter that had absolutely no legal basis, that if implemented as written would have been corporate murder. They have now issued an official notification, which is still illegal, arbitrary and capricious, but is scoped narrowly and won’t be too disruptive.

Continue reading →

Posted in Uncategorized | Tagged ai, anthropic, artificial-intelligence, chatgpt, technology | Leave a comment

AI #158: The Department of War

Posted on March 5, 2026 by TheZvi

This was the worst week I have had in quite a while, maybe ever.

The situation between Anthropic and the Department of War (DoW) spun completely out of control. Trump tried to de-escalate by putting out a Truth merely banning Anthropic from direct use by the Federal Government with a six month wind down. Then Secretary of War Hegseth went rogue and declared Anthropic a supply chain risk, with wording indicating an intent to outright murder Anthropic as a company.

Then that evening OpenAI signed a contact with DoW,

I’ve been trying to figure out the situation and help as best I can. I’ve been in a lot of phone calls, often off the record. Conduct is highly unbecoming and often illegal, arbitrary and capricious. The house is on fire, the Republic in peril. I have people lying to me and being lied to by others. There is fog of war. One gets it from all sides. It’s terrifying to think about what might happen with one wrong move.

Continue reading →

Posted in Uncategorized | Tagged ai, artificial-intelligence, chatgpt, llm, technology | 2 Comments

Gemini 3.1 Pro Aces Benchmarks, I Suppose

Posted on March 4, 2026 by TheZvi

I’ve been trying to find a slot for this one for a while. I am thrilled that today had sufficiently little news that I am comfortable posting this.

Gemini 3.1 scores very well on benchmarks, but most of us had the same reaction after briefly trying it: “It’s a Gemini model.”

And that was that, given our alternatives. But it’s got its charms.

Consider this a nice little, highly skippable break.

The Pitch

It’s a good model, sir. That’s the pitch.

Sundar Pichai (CEO Google): Gemini 3.1 Pro is here. Hitting 77.1% on ARC-AGI-2, it’s a step forward in core reasoning (more than 2x 3 Pro).

With a more capable baseline, it’s great for super complex tasks like visualizing difficult concepts, synthesizing data into a single view, or bringing creative projects to life.

We’re shipping 3.1 Pro across our consumer and developer products to bring this underlying leap in intelligence to your everyday applications right away.

Continue reading →

Posted in Uncategorized | Tagged ai, artificial-intelligence, chatgpt, llm, technology | Leave a comment

A Tale of Three Contracts

Posted on March 3, 2026 by TheZvi

The attempt on Friday by Secretary of War Pete Hegsted to label Anthropic as a supply chain risk and commit corporate murder had a variety of motivations.

On its face, the conflict is a tale of three contracts and the associated working relationships.

The contract Anthropic signed with the Department of War (DoW) in 2025.
The new contract Anthropic was negotiating with DoW, that would have been modified to favor DoW, but where the parties could not reach agreement.
The contract OpenAI was negotiating and signed with DoW, which was per OpenAI modified favorably to OpenAI and thus may be modified further.

Continue reading →

Posted in Uncategorized | Tagged ai, artificial-intelligence, chatgpt, llm, technology | Leave a comment

Secretary of War Tweets That Anthropic is Now a Supply Chain Risk

Posted on March 2, 2026 by TheZvi

This is the long version of what happened so far. I will strive for shorter ones later, when I have the time to write them.

Most of you should read the first two sections, then choose the remaining sections that are relevant to your interests.

But first, seriously, read Dean Ball’s post Clawed. Do that first. I will not quote too extensively from it, because I am telling all of you to read it. Now. You’re not allowed to keep reading this or anything else until after you do. I’m not kidding.

That’s out of the way? Good. Let’s get started.

Continue reading →

Posted in Uncategorized | Tagged ai, artificial-intelligence, chatgpt, openai, technology | Leave a comment

Anthropic and the DoW: Anthropic Responds

Posted on February 27, 2026 by TheZvi

The Department of War gave Anthropic until 5:01pm on Friday the 27th to either give the Pentagon ‘unfettered access’ to Claude for ‘all lawful uses,’ or else. With the ‘or else’ being not the sensible ‘okay we will cancel the contract then’ but also expanding to either being designated a supply chain risk or having the government invoke the Defense Production Act.

It is perfectly legitimate for the Department of War to decide that it does not wish to continue on Anthropic’s terms, and that it will terminate the contract. There is no reason things need be taken further than that.

Continue reading →

Posted in Uncategorized | Tagged ai, anthropic, artificial-intelligence, chatgpt, technology | Leave a comment

AI #157: Burn the Boats

Posted on February 26, 2026 by TheZvi

Events continue to be fast and furious.

This was the first actually stressful week of the year.

That was mostly due to issues around Anthropic and the Department of War. This is the big event the news is not picking up, with the Pentagon on the verge of invoking one of two extreme options that would both be extremely damaging to national security and that would potentially endanger our Republic. The post has details, and the first section here has a few additional notes.

Also stressful for many was the impact of Citrini’s AI scenario, where it is 2028 and AI agents are sufficiently capable to disrupt the whole economy but this turns out to be bearish for stocks. People freaked out enough about this that it seems to have directly impacted the stock market, although most stocks other than the credit card companies seem to have bounced back. Of course, in a scenario like that we probably all die and definitely the world transforms, and you have bigger things to worry about than the stock market, but the post does raise a lot of very good detailed points, so I spend my post going over that.

Continue reading →

Posted in Uncategorized | Tagged ai, artificial-intelligence, chatgpt, llm, technology | 7 Comments

Anthropic and the Department of War

Posted on February 25, 2026 by TheZvi

The situation in AI in 2026 is crazy. The confrontation between Anthropic and Secretary of War Pete Hegseth is a new level of crazy. It risks turning quite bad for all. There’s also nothing stopped it from turning out fine for everyone.

By at least one report the recent meeting between the two parties was cordial and all business, but Anthropic has been given a deadline of 5pm eastern on Friday to modify its existing agreed-upon contract to grant ‘unfettered access’ to Claude, or else.

Anthropic has been the most enthusiastic supporter our military has in AI and in tech, but on this point have strongly signaled they with this they cannot comply. Prediction markets find it highly unlikely Anthropic will comply (14%), and think it is highly possible Anthropic will either be declared a Supply Chain Risk (16%) or be subjected to the Defense Production Act (23%).

Continue reading →

Posted in Uncategorized | Tagged ai, anthropic, artificial-intelligence, chatgpt, technology | Leave a comment

	Diseconomies of scal… on Quotes from Moral Mazes
	sam on AI #158: The Department of…
	Anonymous-backtick on AI #157: Burn the Boats
	hadoveretharishona on Counterfactuals about Social…
	anon on AI #158: The Department of…

Don't Worry About the Vase

GPT-5.4 Is A Substantial Upgrade

Claude Code, Claude Cowork and Codex #5

Table of Contents

Anthropic Officially, Arbitrarily and Capriciously Designated a Supply Chain Risk

AI #158: The Department of War

Gemini 3.1 Pro Aces Benchmarks, I Suppose

The Pitch

A Tale of Three Contracts

Secretary of War Tweets That Anthropic is Now a Supply Chain Risk

Anthropic and the DoW: Anthropic Responds

AI #157: Burn the Boats

Anthropic and the Department of War

Recent Posts

Recent Comments

Archives

Categories

Meta

Blogroll