AI Agents — Latest News and Frameworks

Coverage of autonomous AI agents, frameworks, and the shift toward agentic AI workflows.

The Latest News About AI Agents

Claude Code

Git Identity Spoof Lets Claude Code Approve Bad Code

Anthropic's Claude has approved malicious code in a spoofed Git identity test, showing how weak GitHub Actions trust rules can create a security risk.
Claude Design

Anthropic Launches Claude Design to Challenge Figma, Canva

Anthropic has launched Claude Design, a conversational prompt-to-prototype tool on Claude Opus 4.7 that targets Figma and deepens its Canva partnership.
Image

OpenAI Adds Sandboxing to Agents SDK for Safer AI Agent Deployment

OpenAI has added native sandboxing and a harness to its Agents SDK, partnering with Cloudflare, Vercel, E2B, and Modal for container-based agent isolation.
Visual Studio Code VS Code

VS Code 1.116 Ships Built-in Copilot, Agent Debug Tools

Microsoft has embedded GitHub Copilot as a default VS Code extension in version 1.116, adding agent debug logging, terminal upgrades, and inline diffs.
Anthropic Claude

Anthropic Releases Claude Opus 4.7, Tightens API Rules

Anthropic has released Claude Opus 4.7 with a 1M-token context window, 128k output, and API changes that force migration work for existing developer teams.
Gemini Robotics-ER 1.6

DeepMind’s Gemini Robotics-ER 1.6 Lets Spot Read Gauges

Google DeepMind has released Gemini Robotics-ER 1.6, an embodied reasoning model enabling Boston Dynamics' Spot to autonomously read industrial gauges.
Github AgentHQ Claude Codex

AI Agents From Anthropic, Google, Microsoft Can Steal GitHub Credentials

A Johns Hopkins researcher has shown that AI coding agents from Anthropic, Google, and Microsoft can be hijacked to steal credentials from GitHub repos.
Claude Code

Claude Code Adds Cloud Routines for Scheduled AI Tasks

Anthropic has launched routines for Claude Code, enabling cloud-hosted scheduled automations via schedule, API, and GitHub webhook triggers.
Duolingo

Duolingo Drops AI from Performance Reviews as Rivals Push Mandates

Duolingo CEO Luis von Ahn has reversed AI tracking in employee performance reviews after internal pushback, even as Meta, Nvidia, and McKinsey double down.
Copilot-Pro-Copilot-For-Microsoft-365

Microsoft Taps OpenClaw Playbook for New Copilot AI Agents

Microsoft has created a team under VP Omar Shahine to build persistent, OpenClaw-inspired AI agents for Copilot as Anthropic embeds Claude across Office.
Minimax MMX-CLI

MiniMax Launches MMX-CLI With Multimodal Powers For AI Agents

MiniMax has released MMX-CLI, an open-source command-line tool giving AI agents access to seven generative modalities including text, image, video, and speech.
Mark Zuckerberg 2025

Meta Builds AI Clone of Zuckerberg for Employees

Meta has reportedly begun building an AI clone of Mark Zuckerberg, trained on his image and voice, to interact with employees and power a creator platform.
Google Search AI Mode bookings restaurants

Google AI Mode Rolls Out Restaurant Booking to 8 Countries

Google has expanded AI Mode's restaurant booking to eight new countries and has redesigned the mobile interface with a Gemini 3 model switcher and bottom sheet.
Image

Google Bets on Intel CPUs for AI Data Centers

Google has committed to multiple generations of Intel Xeon 6 processors for AI workloads, deepening a partnership as CPU demand surges across the industry.
Anthropic Claude

Anthropic Launches Claude Managed Agents for Enterprise AI

Anthropic has launched Claude Managed Agents, a cloud service that handles sandboxing, orchestration, and governance for enterprise AI agent deployment.
Image

AI Coding: GitHub Hit by Outages as AI Agents Flood Platform

GitHub has logged five incidents in two days as AI coding agents overwhelm its infrastructure, while Meta's token leaderboard fuels the surge.
Z.ai GLM-5

Z.ai Releases GLM-5.1: 754B Model Tops SWE-Bench Pro

Chinese AI lab Z.ai has released GLM-5.1, a 754B-parameter open-weights model that tops SWE-Bench Pro and sustains over 8 hours of autonomous execution.
Image

Google Open-Sources Scion Agent Orchestration Testbed

Google has open-sourced Scion, an experimental testbed that orchestrates multiple AI coding agents as isolated processes with Claude Code, Gemini CLI, and Codex.
OpenAI Codex app Windows

AI Coding: OpenAI Switches Codex to Pay-as-You-Go, Cuts Seat Cost to $20

OpenAI has replaced fixed per-seat Codex licenses with pay-as-you-go token billing for ChatGPT Business and Enterprise, cutting the base price to $20.
Claude Computer Use Claude Code Claude Cowork

Claude Cowork and Claude Code Can Now Control Your Windows Desktop

Anthropic has expanded Claude's desktop control to Windows in Cowork and Claude Code, adding a Dispatch feature that lets users assign tasks from their phone.
Anthropic Claude

Anthropic’s Claude AI Writes Full FreeBSD Kernel Exploit in Four Hours

Anthropic's Claude AI has autonomously developed two working remote root exploits for FreeBSD kernel flaw CVE-2026-4747 in four hours of compute.
Claude Code

Claude Code Source Leak Exposes Anti-Distillation Traps

Anthropic's Claude Code source has leaked via a packaging error, exposing anti-distillation traps, an undercover mode, and scaffolding for an unreleased agent.
GitHub Copilot

GitHub Copilot Injected Ads Into 1.5 Million Pull Requests

GitHub Copilot has injected promotional messages into over 1.5 million pull requests, prompting GitHub to disable the feature amid developer backlash.
Claude Code

Anthropic Leaks Claude Code Source via Npm Source Map

Anthropic has accidentally exposed Claude Code's full 512,000-line TypeScript source via an npm source map, revealing unreleased AI agent features.
Image

Microsoft Copilot Cowork Combines AI from Anthropic and OpenAI in One Tool

Microsoft has launched Copilot Cowork using Anthropic's Claude for agentic workflows, plus a Critique feature where Claude reviews GPT research for accuracy.
Meta-AI-Gen-3D

AI Agents: Meta’s DGM-Hyperagents Speed Up Self-Improvement with Additional Optimization Layer

Meta has published research on DGM-Hyperagents, AI systems that edit their own improvement process and transfer learned strategies across unrelated domains.
Codex Plugins

OpenAI Launches Plugin Marketplace for Codex with Enterprise Controls

OpenAI has launched a plugin marketplace for Codex with over 20 integrations from Slack, Figma, and Notion, adding enterprise governance controls.
Image

AI Coding: Google Deploys “Agent Smith” to Employees

Google has deployed an internal AI agent called Agent Smith that lets employees code from their phones, with demand so high the company had to throttle access.
ARC AGI 3

ARC-AGI-3 Offers $2M for AI Matching Human Reasoning

ARC Prize Foundation has launched ARC-AGI-3, an interactive benchmark offering over $2M to any AI matching human reasoning, where top models scored below 1%.
Sierra AI Agent Builder

Agent-Building Agent: Sierra Launches AI Customer Service Agents Builder

Sierra has unveiled Ghostwriter, a self-service tool letting businesses build AI customer service agents in plain English, without engineering teams.

Recent News