Bip America News

collapse
Home / Daily News Analysis / The US government’s Anthropic models ban was never about an AI jailbreak

The US government’s Anthropic models ban was never about an AI jailbreak

Jun 29, 2026  Twila Rosenbaum 29 views
The US government’s Anthropic models ban was never about an AI jailbreak

The US government’s enforcement letter to Anthropic, which effectively forced the company to pull its latest AI models offline just before the weekend, should be a wake-up call for any US tech company — AI lab or otherwise.

On Friday afternoon, the US Commerce Department sent Anthropic a letter invoking an obscure export control directive that banned non-Americans, including Anthropic’s employees, from accessing the company’s latest models, Fable 5 and Mythos 5, citing an unspecified national security concern. Anthropic said it believes the letter is related to a bypass of the model’s guardrails, but is not sure because the letter lacks specific details. The letter has not been made public.

In response, Anthropic shut down both of its top models to all customers to ensure compliance with the directive. The result was that the US government successfully forced a tech company to pull its models offline with a swift and unilateral action that did not appear to require court approval. This intervention by the Trump administration demonstrates that the AI industry is not immune to government interference. It also serves as a warning to the wider tech industry: comply, or we can shut you and your products down.

The background of the directive

Export control directives are typically used to prevent sensitive technology from falling into the hands of foreign adversaries. The obscure provision invoked here appears to have been designed for physical goods or software that could directly contribute to weapons development. Applying it to a language model that can assist with cybersecurity tasks is a significant departure from past practice. Past administrations have made sweeping decisions on knowledge gaps. For instance, language used by the US government during the 2010s to fix export law covering cybersecurity tools that could also be used for cyberattacks was so broad that inadvertently, it nearly outlawed legitimate security and vulnerability research.

However, the Trump administration’s directive appears retaliatory. Citing sources, Axios described a tense situation over the weekend between the two major players, saying that the “personality differences” between Anthropic and the Trump administration led to the export directive, rather than a technical issue with the AI products. New details about the issue that emerged over the weekend cast further doubt on the government’s reasoning.

What really happened with the guardrails?

Katie Moussouris, a cybersecurity veteran and researcher who founded Luta Security, said in a blog post that Anthropic recently shared with her a private copy of a paper written by security researchers describing an alleged guardrail bypass in Fable 5. (The Wall Street Journal reports that the paper’s authors are security researchers at Amazon.) Moussouris said that Anthropic reached out to ask for her take on the paper.

Moussouris’ blog post described how the researchers triggered the guardrail bypass, but said that the bypass itself “should never have triggered an export control.” The difference is largely between asking an AI model to “review code for security issues” versus asking it to “fix this code.” The end result is largely the same, even if the questions are posed slightly differently. “The behavior described in the paper cannot meaningfully be fixed, and any attempt would only weaken the model for defense,” said Moussouris, who criticized the export control directive as hasty, heavy-handed, and misguided. Moussouris and dozens of other top security researchers and experts have since called on the Trump administration to revoke the export control order, calling the move to pull advanced cybersecurity capabilities from network defenders in the US as “dangerous.”

The core issue hinges on how AI models can be used for both offensive and defensive cybersecurity. The paper’s alleged bypass involves prompting the model to produce code that could be harmful, but the model’s guardrails are designed to prevent exactly that. The researchers demonstrated a way to circumvent those guardrails, but Moussouris argues that this is not a vulnerability that can be fixed by blocking access. Instead, it highlights a fundamental challenge in AI safety: models that can understand code can also be misused. The solution lies in better prompting and usage policies, not export controls that cripple legitimate research and defense.

The political dimension

Justin Hendrix, the editor of Tech Policy Press, said the Trump administration’s move is “likely to raise alarms in foreign capitals about the reliability of American AI for critical applications.” The message is that AI companies in the United States cannot be trusted to operate without interference from the US government. The Trump administration has not confirmed why it invoked its export control directive. Did officials misread the report and overreact? Did Amazon CEO Andy Jassy say something to senior government officials that prompted the reaction, out of caution or spite? Was something lost in translation, or was this a way to pressure Anthropic, with whom the administration already has a fractious relationship? It is possible that the White House was unaware of the far-reaching consequences of the letter’s demand and officials are scrambling to undo the damage of their own making.

To quote Hendrix, “the climate is one of a cloud of suspicion that senior officials are picking favorites based on personal and political factors.” The aftermath is that the government has set a dangerous precedent about how much control it intends to wield over the release of American-made software. This time the government took issue with Anthropic; tomorrow it could be with anyone else.

Implications for the AI industry

This incident represents a landmark moment in the relationship between AI developers and the US government. It demonstrates that the government is willing to use obscure regulatory tools to intervene in the release of AI models, potentially chilling innovation and driving companies to self-censor or delay launches out of fear of similar actions. For Anthropic, which has positioned itself as a responsible AI company focused on safety, the irony is stark: a company that voluntarily restricts its own models to prevent misuse is now being punished for a perceived security flaw that experts say is not a real vulnerability.

The move also has international repercussions. Other countries may view this as evidence that US AI products cannot be relied upon because they can be arbitrarily shut down by Washington. This could accelerate efforts to develop domestic AI alternatives in Europe, China, and elsewhere, fragmenting the global AI market. Furthermore, the lack of transparency in the government’s actions undermines trust. If companies cannot understand why they are being targeted, they cannot take corrective action. The situation calls for clear guidelines on when export controls apply to AI models, and for a process that involves technical experts rather than political considerations.

The broader lesson is that the AI industry must engage more actively with policymakers to develop sensible regulations that protect national security without stifling innovation. The current ad-hoc approach, driven by personality conflicts and opaque directives, is harmful to all parties. The Anthropic case should serve as a catalyst for a more structured dialogue. Until then, every AI company operates under the shadow of a government that can, at any moment, pull the plug on their products based on a misunderstanding or political vendetta.


Source:TechCrunch News


Share:

Your experience on this site will be improved by allowing cookies Cookie Policy