Anthropic Races to Contain Leak of Code Behind Claude AI Agent

What to know as Anthropic tries to contain leaked Claude AI code
What to know as Anthropic tries to contain leaked Claude AI code
Scroll back up to restore default view.

Anthropic is racing to contain the fallout after accidentally exposing the underlying instructions it uses to direct Claude Code, the popular artificial-intelligence agent app that has won the company an edge with developers and businesses.

By Wednesday morning, Anthropic representatives had used a copyright takedown request to force the removal of more than 8,000 copies and adaptations of the raw Claude Code instructions—known as source code—that developers had shared on programming platform GitHub.

Most Read from The Wall Street Journal

The leak of “some internal source code” didn’t expose any customer information or data, a spokesman for Anthropic said. Nor did it divulge the valuable inner mathematics—sometimes called weights—of the company’s expensive and powerful AI models.

“This was a release packaging issue caused by human error, not a security breach. We’re rolling out measures to prevent this from happening again,” the spokesman said.

But the leak did reveal commercially sensitive information, including Anthropic’s proprietary techniques, tools and instructions for cajoling its AI models to work as coding agents. Those techniques and tools are called a harness because they are what allow users to control and direct those models, much like a harness allows a rider to guide a horse.

The result is that Anthropic’s competitors and legions of startups and developers now have a detailed road map to clone Claude Code’s features without needing to reverse engineer them—something that is already common in the cutthroat AI race.

The leak also gives hackers a large amount of new information to probe for bugs they could use to exploit the Claude Code software, or manipulate its Claude AI model into helping with their cyberattacks, creating risks for Anthropic and the developers who use its tools.

The leak is a blow for Anthropic because it risks both undermining its reputation for safety and also revealing valuable trade secrets in the pitched battle for enterprise customers. Anthropic has been riding a wave of growing use because of the viral popularity of Claude Code, helping it close a new round of funding that values the company at $380 billion ahead of a possible public offering this year.



  • Anthropic lost the Pentagon but won over America

    Subscribe to The Post Most newsletter for the most important and interesting stories from The Washington Post.

    Being declared a threat to national security can have a silver lining.

    After the Pentagon blacklisted artificial intelligence start-up Anthropic last week in a dispute over how its Claude chatbot could be used in war, an American public largely unaware of the company raced to download its app, paid for Claude subscriptions and praised Anthropic in online reviews and posts. Many technology workers, including at competing AI firms, took Anthropic’s side in its clash with the Defense Department.

    That surge in popularity followed a three-month ascent that had already seen Claude shift from respected but obscure AI geek to the coolest of chatbot kids. While ChatGPT owner OpenAI remains by far the most well known and most valuable AI start-up, Anthropic has steadily won over people who matter in Silicon Valley, on Wall Street and among the general public.

    Software programmers have been so impressed by Claude’s coding skills that they are penning laments about their looming irrelevance. Anthropic’s announcements of new features can move the entire stock market as some investors look to the company for clues about the trajectory of AI.

    That Anthropic could vault from relative obscurity to the vibe king of Silicon Valley shows there is still a fierce contest to shape the direction of AI. It’s also a reminder that even in a field claiming to build the ultimate rational thinkers, winners and losers are determined as much by reputation and taste as by technical superiority.

    Bradley Tusk, a start-up investor and political strategist, said the growing buzz around Anthropic and the perception that it took a principled stand against the Pentagon adds up to a hot streak. “If you can create a certain perception that is positive at least for a while, you can ride that wave,” he said.

    Anthropic was founded in 2021 by defectors from OpenAI. In the AI industry’s factionalism, the company aligned itself with the “AI safety” crowd, emphasizing the need to prevent future, powerful AI from acting against the interests of humans. Anthropic also steered its Claude chatbot for use largely by technology obsessives, businesses and governments.

    Public awareness of Anthropic and its chatbot was vanishingly low until recently. Market intelligence firm Sensor Tower said that in late January, Claude languished at No. 124 on the ranking of most-downloaded iPhone apps in the United States. One pollster said that when he asked people in November about their views of different tech firms, a fictional tech company scored about as well as Anthropic.


  • Anthropic's Claude becomes most popular free app on iPhone and Android amid fallout from Trump fight

    Anthropic's Claude became the most downloaded free app on both Apple's App Store and Google's Play Store on Tuesday, the company announced, following its showdown with the Department of Defense (DOD) last week.

    Anthropic (ANTH.PVT) and the DOD have been sparring over the company's insistence that the agency not use its Claude models for the mass surveillance of Americans or to develop fully autonomous weapons. On Friday, President Trump ordered all of the federal government to phase out Anthropic's technology over the next six months.

    Defense Secretary Pete Hegseth then said he is labeling the company a supply chain threat.

    That could force firms that work with the government and use Anthropic's models to stop using the company's technology, though Anthropic says that only applies to those firms' government work and not to their private businesses.

    But the blowup has seemingly helped Anthropic's public relations, with the company saying that Monday was its largest single day for sign-ups ever.

    It then leaned into the goodwill, adding memory to the free version of Claude, allowing users to save and build on prior interactions with the AI bot.

    Shanghai,China-May 6th 2024: OpenAI ChatGPT, Anthropic Claude and Perplexity mobile APP icons on screen. Assorted AI chatbots
    OpenAI ChatGPT, Anthropic Claude, and Perplexity mobile app icons on a screen. · Robert Way via Getty Images

    Anthropic's rival OpenAI (OPAI.PVT), meanwhile, said Friday that it entered into an agreement with the DOD that will allow the agency to use its models, while keeping the same guardrails Anthropic fought for.

    But critics said the language had loopholes that would allow the DOD to spy on Americans.

    OpenAI responded to the concerns on Monday, saying it updated the language in its agreement with the DOD.

    "This language makes explicit that our tools will not be used to conduct domestic surveillance of US persons, including through the procurement or use of commercially acquired personal or identifiable information," the company said in a statement.

    "The Department also affirmed that our services will not be used by Department of War intelligence agencies like the NSA. Any services to those agencies would require a new agreement."

    OpenAI CEO Sam Altman also took to X to discuss the controversy, receiving thousands of comments from users.

    Got a tip? Email Daniel Howley at dhowley@yahoofinance.com. Follow him on X at @DanielHowley.

    Click here for the latest technology news that will impact the stock market

    Read the latest financial and business news from Yahoo Finance

    Sign up for Yahoo Finance's Week in Tech newsletter.
    Sign up for Yahoo Finance's Week in Tech newsletter. · yahoofinance

  • Anthropic is having a huge 2026. It's only March

    Quartz · Anthropic CEO Dario Amodei (Krisztian Bocsi/Bloomberg via Getty Images)

    Anthropic spent years being the responsible AI company. In 2026, it became the most disruptive one.

    The same tools it developed under strict safety guidelines are now destabilizing enterprise software, reshaping how engineers work, and putting it at the center of a full-blown Pentagon standoff.

    The AI industry's main sport for the last few years has been benchmarks. Who scores highest, whose context window is longest, whose demo lands hardest at a conference. Anthropic played that game, too.

    It also made a sustained push into a specific market: software engineers, highly paid and spending their days doing exactly the kind of work AI was getting good at. That focus is now paying off in a way that's changed the conversation about who's actually winning this race.

    Claude Code had a ChatGPT moment, but for engineers. Engineers started shipping software at speeds that felt almost physically impossible. Anthropic executives have been vocal about what that means for the profession. At Davos, CEO Dario Amodei predicted AI could handle most or all of software engineering work, end to end, within six to 12 months. Claude Code's creator declared the job title itself might soon disappear.

    Anthropic's own hiring tells a more complicated story. The company's open software engineering roles have climbed 170% since January 2025, according to one tracker, with the curve accelerating.

    What's harder to dismiss is the outside evidence about the vibe shift on vibe coding. Paul Ford, a technologist and longtime software industry observer, wrote in The New York Times that something changed in November.

    Before then, AI coding tools were useful but halting. After, he was finishing projects that had sat in folders for a decade, on his subway commute. His friends were noticing the same thing. The software engineer corner of the internet lit up with similar accounts. "I am less valuable than I used to be," Ford wrote.

    When Anthropic published a blog post late last month claiming Claude Code could translate legacy COBOL into modern languages, IBM lost roughly $40 billion in market cap in a single session. The broader sell-off wiped more than a trillion dollars from Big Tech valuations. Legal software stocks dropped. Design stocks dropped. Analysts pointed out that mainframe modernization involves far more than converting code, and IBM's technical moat runs deep. Nvidia CEO Jensen Huang called the panic “illogical.” Franklin Templeton's CEO told the Financial Times it looked like a genuine long-term threat to enterprise software's business model.


  • Anthropic’s Claude is suddenly the most popular iPhone app following Pentagon feud

    Anthropic's AI assistant, Claude, surged to number one on Apple's App Store after the Pentagon deemed the AI firm a “supply chain risk” - Philip Dulian/dpa/picture-alliance/AP
    Anthropic's AI assistant, Claude, surged to number one on Apple's App Store after the Pentagon deemed the AI firm a “supply chain risk” - Philip Dulian/dpa/picture-alliance/AP

    The AI company Anthropic was likely not in the public lexicon just a month ago. But it is now.

    The AI company has long been known within tech circles for its popular Claude AI assistant and coding tools. But a whirlwind sequence of events in February thrust the company into the public eye more than ever. Case in point: Anthropic’s Claude app dethroned OpenAI’s ChatGPT to claim the top spot in Apple’s iPhone App Store over the weekend.

    The increased visibility comes as the company has been locked in a high-stakes public feud with the Pentagon over its safety guardrails. But even in the weeks before, updates to its Claude tool stoked concerns about AI’s impact on the software industry, sending shockwaves through Wall Street. And in early February, Anthropic took its most pointed and public jab at its chief rival, OpenAI, during the Super Bowl, one of the most-watched events on television.

    It’s shaping up to be a formative time for the company, but its Claude assistant still has a long way to go to catch up to ChatGPT in public popularity.

    Claude’s rise

    In early February, Claude was around number 42 on the Apple App Store’s list of the most popular free iPhone apps. It surged to number one by Saturday, the day after the Pentagon deemed Anthropic a “supply chain risk” and President Donald Trump ordered the federal government to stop using the company’s products. ChatGPT is currently the top free app in the Google Play app store for Android devices, although Claude isn’t far behind in fourth place.

    Some showed support for Claude in the real world as well: Encouraging messages like “Thank you” covered the sidewalks outside of Anthropic’s San Francisco offices in recent days, according to images employees posted on social media.

    Other people sent OpenAI, which announced a deal with the Pentagon late Friday, a much different message. Phrases such as “do the right thing” and “please stand up for civil liberties” were scribbled on the sidewalk outside its offices, according to San Francisco news site Mission Local.

    Google searches for “Anthropic” are the highest they have been since the company’s founding, according to Google search trends.

    As its battle with the Pentagon ramped up last week, the company hit “an all time record for Claude sign-ups,” an Anthropic spokesperson told CNN on Monday.

    Anthropic is leaning into that attention boost with a new feature unveiled Friday that simplifies importing history from other AI chatbots into Claude, the company said. It also added the ability for Claude to remember context across conversations to its free tier on Monday, rather than restricting it to paid users.


  • Anthropic’s AI Hacked the Firefox Browser. It Found a Lot of Bugs.

    Anthropic’s Claude found over 100 bugs in the Firefox browser during a two-week scan.
    Anthropic’s Claude found over 100 bugs in the Firefox browser during a two-week scan. - Gabby Jones/Bloomberg News

    It took Anthropic’s most advanced artificial-intelligence model about 20 minutes to find its first Firefox browser bug during an internal test of its hacking prowess.

    The Anthropic team submitted it, and Firefox’s developers quickly wrote back: This bug was serious. Could they get on a call?

    Most Read from The Wall Street Journal

    “What else do you have? Send us more,” said Brian Grinstead, an engineer with Mozilla, Firefox’s parent organization.

    Anthropic did. Over a two-week period in January, Claude Opus 4.6 found more high-severity bugs in Firefox than the rest of the world typically reports in two months, Mozilla said.

    Tools powered by AI are increasingly adept at spotting vulnerabilities and are beginning to rival the talents of seasoned security experts. Some experts worry that those same capabilities will unleash a new wave of cyberattacks as bugs are discovered and then exploited more quickly than ever before.

    Claude’s bug bonanza began after Anthropic’s security team decided that it would be interesting to focus its software on a widely used and complex piece of browser software that has been under the microscope for years.

    Firefox is the modern version of the Web’s first commercial browser, Netscape Navigator. Its code is now managed under the umbrella of the not-for-profit Mozilla Foundation. Navigator launched its first bug bounty program more than 30 years ago, offering cash to those who identify potential weaknesses that bad actors could abuse. Mozilla typically pays as much as $6,000 for high-severity bugs.

    In the two weeks it was scanning, Claude discovered more than 100 bugs in total, 14 of which were considered “high severity.” That means that if the right “exploit code” had been created, they could have been used in a widespread attack on Firefox’s users.

    Last year, Firefox patched 73 bugs that it rated as either high severity or critical.

    AI tools are both a blessing and a curse for software developers. In January, the makers of Curl software abandoned their own bug bounty program, citing “an explosion in AI slop reports.” Fewer than one in 20 bugs reported in 2025 were actually real, said Daniel Stenberg, Curl’s lead developer

    “The AI chatbots still easily hallucinate security problems,” Stenberg said. “But at the same time, there are quite capable AI-powered code analyzers that find real things,” he said.



Anthropic Says Working on Fixing Claude AI Chatbot Disruption