The Linux Foundation launched Akrites, a coordinated effort to harden critical open source software as AI-assisted vulnerability discovery accelerates. Backed by AWS, Anthropic, Google, IBM, Microsoft and GitHub, NVIDIA, OpenAI, Red Hat and others, Akrites creates a shared Security Incident Response Team and standardized Coordinated Vulnerability Disclosure process so maintainers can receive tested fixes upstream before flaws are exploited.
Your Daily AI Briefing
AI News Today
Looking for today's AI news? This is a fast, no-noise feed of the most important developments in artificial intelligence — covering new AI models, AI agents, research, chips and hardware, security, and how enterprises are putting AI to work. Filter by topic, search the headlines, and click through to the original source for the full story.
The latest dispatches, as of Jun 26, 2026
OpenAI says Codex agents are transforming long-horizon knowledge work
OpenAI published an Economic Research paper on how agentic AI is changing work from short chatbot interactions to delegated, long-horizon tasks. The company said Codex is now the primary AI tool across every OpenAI department, accounts for more than 85% of output tokens for the average worker, and is seeing rapid adoption from non-developers using agents for automation, analysis, debugging and cross-functional execution.
Mistral adds governed connectors for enterprise AI agents and Vibe Code
Mistral AI introduced new connector controls for production AI agents, including workspace and tool-level admin permissions, connector-scoped API keys, multi-account connectors, a public-preview Connectors Debugger, governed connectors in Vibe Code, and workflow connectors for long-running jobs. The connector directory now covers more than 60 integrations across data, communication, developer, automation and research tools.
OpenAI and Broadcom unveil Jalapeno LLM inference chip
OpenAI and Broadcom unveiled Jalapeno, OpenAI's first Intelligence Processor and the first accelerator in a multi-generation LLM inference platform. OpenAI said engineering samples are running ML workloads in the lab, early testing shows substantially better performance per watt than current state-of-the-art systems, and deployment is planned at gigawatt scale with data center partners beginning by the end of 2026.
Anthropic launches Claude Tag beta to bring team AI agents into Slack
Anthropic introduced Claude Tag, a beta for Claude Enterprise and Team customers that lets teams tag @Claude in Slack channels and delegate tasks across connected tools, data and codebases. Claude Tag builds shared channel context, can work asynchronously, supports scoped permissions and spend controls, and replaces the existing Claude in Slack app for organizations that opt in.
Mistral OCR 4 upgrades document intelligence for enterprise RAG
Mistral AI released OCR 4, a document intelligence model that extracts text alongside bounding boxes, block classifications and inline confidence scores. The model supports 170 languages across 10 language groups, accepts enterprise formats such as PDF, DOC, PPT and OpenDocument, can run self-hosted in a single container, and feeds structured content into RAG, enterprise search and agentic document workflows.
NVIDIA BioNeMo Agent Toolkit gives life-science agents scientific tools
NVIDIA announced the BioNeMo Agent Toolkit, an agent-ready stack for life sciences workflows spanning biology, chemistry, genomics and drug discovery. The toolkit combines BioNeMo, NIM microservices, Parabricks, NeMo, Nemotron, NemoClaw and OpenShell so agents can call scientific tools, run computational experiments and support tasks such as virtual screening, protein binder design and genomic analysis.
NVIDIA Halos for Robotics brings full-stack safety to physical AI
NVIDIA announced Halos for Robotics, a full-stack safety system for robots and physical AI that spans IGX Thor compute, Holoscan Sensor Bridge sensor connectivity, the Halos OS software stack, and an ANAB-accredited AI Systems Inspection Lab. Agility is the first partner using Halos elements for industrial humanoids working in factories, warehouses, and logistics operations.
NVIDIA Vera Rubin targets exascale AI supercomputing for science
NVIDIA said its Vera Rubin platform is coming to scientific supercomputing with systems that combine Rubin GPUs, Vera CPUs, NVLink-C2C, ConnectX-9, and BlueField-4 in direct liquid-cooled racks. The company says a Vera Rubin supercomputing system can deliver more than 7 exaflops of AI for science, 5 petaflops of native FP64 performance, and extreme memory bandwidth with up to 144 GPUs.
Samsung Electronics rolls out ChatGPT Enterprise and Codex to global employees
OpenAI said Samsung Electronics is deploying ChatGPT Enterprise and Codex to all employees in Korea and all Device eXperience division employees worldwide, one of OpenAI's largest enterprise launches to date. Samsung plans to use the tools across software development, product development, manufacturing, marketing, corporate functions, and other operations, while OpenAI works with Samsung on secure adoption and employee enablement.
Google DeepMind publishes AI Control Roadmap for safer agents
Google DeepMind published its AI Control Roadmap and a policy framework called Three Layers of Agent Security, arguing that advanced internal agents need system-level safeguards in addition to model alignment. The roadmap treats agents as potential insider threats, maps mitigations to capabilities, and uses monitoring, prevention and response metrics after analyzing one million coding-agent trajectories.
OpenAI adds ChatGPT Enterprise analytics and spend controls for AI adoption
OpenAI introduced credit usage analytics and updated spend controls for ChatGPT Enterprise, giving admins a Global Admin Console view of ChatGPT and Codex credit consumption across users, products and models. Admins can now track usage trends, set workspace defaults, configure group limits, create individual overrides, and expose credit usage to employees so enterprise AI programs can scale with clearer cost governance.
OpenAI o3 Deep Research helps diagnose rare childhood diseases
OpenAI said researchers from Boston Children's Hospital, Harvard University, and OpenAI used o3 Deep Research to analyze de-identified clinical and genomic information from 376 previously unsolved pediatric rare-disease cases. After expert review, additional testing, and clinical confirmation, physicians established 18 diagnoses, adding a 4.8% diagnostic yield.
OpenAI says GPT-5.5 Instant improves ChatGPT health intelligence for free users
OpenAI said GPT-5.5 Instant now brings stronger health intelligence to free ChatGPT users, with gains in recognizing urgent-care situations, asking for missing context, explaining uncertainty and simplifying complex medical information. The company cited physician-led evaluations, more than 700,000 reviewed model responses, and a 71% drop over two months in production health responses flagged for factuality issues.
Anthropic opens Seoul office and signs Korea AI safety partnerships
Anthropic opened its Seoul office and announced Korean AI ecosystem partnerships, including an MOU with Korea's Ministry of Science and ICT on AI safety and cybersecurity. The company cited Claude deployments at NAVER, LG CNS, Hanwha Solutions, Samsung SDS, and Channel Corp, and said it will provide Claude access to up to 60 National AI Research Lab-affiliated researchers.
OpenAI launches LifeSciBench for real-world life-science AI evaluation
OpenAI introduced LifeSciBench, an expert-written and expert-reviewed benchmark for measuring whether AI systems can support realistic life-science research tasks rather than isolated biology questions. The benchmark includes 750 tasks across seven workflows and seven biological domains, with 79% requiring multiple reasoning steps and more than half requiring models to interpret or synthesize artifacts.
ACE Robotics' open Kairos world model tops embodied-AI benchmarks
ACE Robotics said its open-source Kairos world model ranked first among evaluated world models and vision-language-action systems across four global embodied-intelligence benchmarks — RoboTwin 2.0, LIBERO-Plus, WorldModelBench Robot and DreamGen — as of June 12. The company says Kairos leads on complex robotic manipulation, scene-level generalization, physical-world modeling and zero-shot transfer, and is openly available on GitHub, Hugging Face and ModelScope.
Kimchi Coding becomes first agent to offer MiniMax M3 open-weight model
Cast AI said its autonomous Kimchi Coding agent is the first to offer MiniMax M3, making it the default builder model in Kimchi's orchestration layer. Cast AI cited M3's 59% score on SWE-bench Pro and its MiniMax Sparse Attention architecture, which it says cuts per-token compute at one-million-token context to 1/20th of prior levels with 15x faster decoding. Access is rolling out via an Early Access program.
US government directive forces Anthropic to suspend Claude Fable 5 and Mythos 5
Anthropic launched Claude Fable 5 (a generally available, safety-tuned model) and the restricted Claude Mythos 5 on June 9, but said on June 12 it was suspending access to both after the US government issued an export control directive. Anthropic apologized for the disruption and said it was working to restore access; other Claude models such as Opus 4.8 remain available.
OpenAI backs EU code on AI-content transparency and provenance
OpenAI announced support for the European Commission's Code of Practice on Transparency of AI-Generated Content, an early step in implementing the EU AI Act. OpenAI pointed to its provenance work since 2024, including C2PA metadata in image tools and SynthID-style marking and detection, and said it will comply with the transparency requirements that apply to its products.
OpenAI to acquire Ona to run Codex agents in customer clouds
OpenAI said it will acquire Ona to bring secure cloud execution and orchestration into its Codex ecosystem, letting long-running agents operate inside an organization's own cloud while OpenAI provides the intelligence. The company says the deal expands Codex beyond a single device or session and is subject to customary closing conditions and regulatory approvals.
Google DeepMind releases open DiffusionGemma for 4x faster text generation
Google DeepMind released DiffusionGemma, an experimental open 26B-total / 3.8B-active mixture-of-experts model under Apache 2.0. The model uses text diffusion to generate 256-token blocks in parallel instead of one token at a time, which DeepMind says enables up to 4x faster generation on dedicated GPUs for local, speed-critical workflows.
Cohere open-sources North Mini Code, its first agentic coding model
Cohere launched North Mini Code under an Apache 2.0 license — a 30B-total / 3B-active mixture-of-experts model with a 256K context window aimed at code generation, agentic software engineering, and terminal tasks. Cohere says it is the first of a new generation of models and is available on Hugging Face, the Cohere API, Model Vault and OpenRouter, running on a single H100 at FP8.
Google ships Gemini 3.5 Live Translate for real-time speech in 70+ languages
Google launched Gemini 3.5 Live Translate, an audio model that delivers near real-time speech-to-speech translation across more than 70 languages while preserving the speaker's intonation, pacing and pitch. It is rolling out to developers via the Gemini Live API and AI Studio, to enterprises in Google Meet, and to everyone through the Google Translate app on Android and iOS.
NVIDIA releases open 550B Nemotron 3 Ultra for long-running agents
NVIDIA released Nemotron 3 Ultra, a fully open 550B-parameter mixture-of-experts model with 55B active parameters, built to orchestrate complex, long-running agent workflows. It uses hybrid Mamba-Transformer layers and NVFP4 quantization that NVIDIA says delivers up to 5x higher throughput, with a single checkpoint that runs across Hopper, Blackwell and Ampere GPUs. Weights, data and recipes are open.
Google's Gemma 4 12B brings encoder-free multimodal AI to laptops
Google introduced Gemma 4 12B, a unified, encoder-free multimodal model that feeds vision and audio directly into the LLM backbone, with native audio inputs and a 256K context. Google says it nears the performance of its 26B MoE model at less than half the memory footprint and runs locally on laptops with 16GB of RAM, released under an Apache 2.0 license.
Anthropic confidentially files draft S-1 for an IPO
Anthropic said it confidentially submitted a draft S-1 registration statement to the US SEC for a proposed initial public offering, giving it the option to go public after the SEC completes its review. The number of shares and price have not been set, and the company said any offering will depend on market conditions.
NVIDIA launches Cosmos 3, an open foundation model for physical AI
NVIDIA launched Cosmos 3, an open world foundation model for physical AI built on a mixture-of-transformers architecture that combines vision reasoning, world generation and action prediction in one system. NVIDIA describes it as the first fully open omnimodel spanning text, image, video, ambient sound and action, available now as Cosmos 3 Super and Nano, with an Edge variant coming soon.
Cloud Security Alliance details two-wave AI developer supply-chain attack
The Cloud Security Alliance published a May 22 analysis of TeamPCP's Shai-Hulud/Megalodon campaign against AI developer infrastructure. CSA says Mini Shai-Hulud compromised 172 npm packages and 2 PyPI packages across 404 malicious versions, then Megalodon pushed 5,718 malicious commits to 5,561 GitHub repositories in under six hours, with persistence hooks targeting tools including Claude Code and Visual Studio Code.
OpenAI Codex named a Leader in enterprise AI coding agents
OpenAI said Codex was recognized as a Leader in Gartner's 2026 Magic Quadrant for Enterprise AI Coding Agents. The company says Codex is used by more than 4 million people each week, and highlighted enterprise controls including approval gates, RBAC, customizable policies, OS-level sandboxing, auditable workspace governance, IDE and CLI surfaces, SDKs, and cloud orchestration.
Virgin Atlantic says Codex speeds refactors and app testing
OpenAI published a Virgin Atlantic case study saying the airline used Codex to ship a revamped mobile app with near-complete unit test coverage and zero P1 defects at launch. Virgin Atlantic also reported 78% to 80% codebase size reductions on some legacy refactors and said work that once took two weeks can now take about 30 minutes to an hour.
AdventHealth deploys ChatGPT for Healthcare across clinical workflows
OpenAI detailed AdventHealth's deployment of ChatGPT Enterprise and ChatGPT for Healthcare across a hospital system operating in nine states. AdventHealth says the rollout targets administrative burden, utilization-management summaries, structured rationales, and operational workflows, with an 80% reduction in time spent on some administrative tasks and an emphasis on governance and measured adoption.
Hark raises $700M for a universal AI interface and hardware
TechCrunch reported that Hark, the AI lab founded by Figure AI and Archer founder Brett Adcock, raised a $700 million Series A at a $6 billion post-money valuation. Hark says it is building an agentic AI system as a universal interface for the digital world, expects to release multimodal models this summer, and plans custom hardware after that.
Microsoft Foundry Labs ships new open agentic stack and benchmarks
Microsoft Foundry Labs released a May roundup with SocialReasoning-Bench for measuring whether agents act in a user's best interest, plus an open end-to-end agentic stack made up of MagenticLite, MagenticBrain, and Fara 1.5. The stack emphasizes visible reasoning, browser and local-file workflows, sandboxed code execution, human approvals for critical actions, and small computer-use models built on Qwen 3.5.
NVIDIA Vera Rubin NVL72 and Jetson Thor win COMPUTEX AI awards
NVIDIA said its Vera Rubin NVL72 rack-scale AI supercomputer, Jetson Thor edge AI and robotics platform, and Alpamayo autonomous-vehicle platform won COMPUTEX 2026 Best Choice Awards. NVIDIA says Vera Rubin NVL72 is designed for agentic AI, reasoning, and long-context workloads, while Jetson Thor delivers up to 2,070 FP4 teraflops for physical AI and autonomous robots.
OpenAI model disproves long-standing discrete geometry conjecture
OpenAI reported that an internal general-purpose reasoning model disproved a central conjecture in the planar unit distance problem, producing an infinite family of constructions with polynomial improvement over the long-believed square-grid bound. OpenAI says external mathematicians checked the proof and wrote companion remarks, calling the result a milestone for AI-assisted mathematics.
Anthropic hires Andrej Karpathy for Claude pretraining research
OpenAI cofounder and former Tesla AI director Andrej Karpathy said he is joining Anthropic. CNBC reports Karpathy will be part of Anthropic's pretraining team, building a group focused on using Claude to accelerate the research that gives the company's models their core knowledge and capabilities.
Google brings AI agents and generative UI into Search
Google said AI Mode in Search now uses Gemini 3.5 Flash globally and introduced a redesigned AI-powered Search box. New Search agents will monitor the web in the background, send synthesized updates, help with booking tasks, and eventually generate custom interactive layouts, simulations, dashboards, and trackers with Antigravity-powered coding.
Google expands SynthID and Content Credentials verification
Google expanded AI-content verification across Search, Gemini, Chrome, Pixel, and Google Cloud, saying SynthID has watermarked more than 100 billion images and videos and 60,000 years of audio. OpenAI, Kakao, and ElevenLabs are adopting SynthID for more AI-generated content, while a new Google Cloud AI Content Detection API is launching with trusted partners.
Google launches Gemini Omni Flash for multimodal video generation
Google introduced Gemini Omni, a new model family that combines Gemini reasoning with generative media, beginning with video output. The first release, Gemini Omni Flash, can use text, images, video, and audio references to generate or conversationally edit videos, is rolling out to Google AI Plus, Pro, and Ultra subscribers through Gemini and Flow, and will come to developer and enterprise APIs in the coming weeks.
Google previews Gemini Spark as a 24/7 personal AI agent
Google announced Gemini Spark, a cloud-based personal agent powered by Gemini 3.5 and the Antigravity harness. Spark is designed to keep working after a laptop closes, integrate with Gmail, Docs, Slides, and other connected apps, ask before high-stakes actions, and roll out first to trusted testers before a U.S. beta for Google AI Ultra subscribers.
Google releases Gemini 3.5 Flash for agents and coding
At Google I/O 2026, Google introduced Gemini 3.5 as a model family focused on complex agentic workflows, starting with Gemini 3.5 Flash. Google says Flash is now available globally in the Gemini app, AI Mode in Search, Antigravity, the Gemini API, AI Studio, Android Studio, and Gemini Enterprise, with claimed gains on coding and agentic benchmarks plus 4x faster output than other frontier models.
Ocean emerges from stealth with $28M to fight AI phishing
Ocean, an agentic email-security startup founded by former Israeli cybersecurity researcher Shay Shwartz, emerged from stealth with $28 million in total funding led by Lightspeed Venture Partners. The company says AI has automated spear-phishing at much larger scale and that its small language model analyzes billions of emails each month for customers including Kayak, Kingston Technology, and Headspace.
Anthropic acquires Stainless to strengthen agent connectivity
Anthropic acquired Stainless, the SDK and MCP server tooling company that has generated official Anthropic SDKs since the API's early days. Stainless creates SDKs, CLIs, and MCP servers from API specs across TypeScript, Python, Go, Java, and more, and Anthropic says the deal will help Claude agents connect more reliably to external systems.
NVIDIA ships first Vera CPUs to top AI labs
NVIDIA delivered its first standalone Vera CPU systems to Anthropic, OpenAI, SpaceXAI, and Oracle Cloud Infrastructure, moving the agentic-AI processor from announcement to customer evaluation. Vera packs 88 NVIDIA-designed Olympus cores, 1.2TB/s of memory bandwidth, and 50% faster per-core performance for agent sandboxes, tool calls, orchestration, and long-context retrieval workloads.
Anthropic and Gates Foundation commit $200M to beneficial AI programs
Anthropic announced a four-year, $200 million partnership with the Gates Foundation spanning Claude usage credits, technical support, and grant funding. The work targets global health, life sciences, education, and economic mobility, including public health datasets, healthcare AI benchmarks, disease-modeling support, AI tools for neglected diseases, K-12 tutoring, and agricultural productivity applications.
Khosla backs Synthetic with $10M for autonomous AI bookkeeping
Synthetic, founded by former Bench Accounting CEO Ian Crosby, raised a $10 million seed round led by Khosla Ventures to pursue a fully autonomous AI bookkeeper for accrual-based financials. The startup plans to focus on AI and software companies first, while acknowledging that current foundation models still make bookkeeping mistakes and the product remains in the design phase.
Lovable backs Atech to bring vibe coding to hardware prototypes
Danish startup Atech raised an $800,000 pre-seed round with backing from Lovable, a16z scout fund, Sequoia Scout Fund, and Nordic Makers. Atech pairs hardware starter kits with an AI chatbot that turns natural-language prototype ideas into code for working hardware builds, aiming to reduce the engineering barrier for physical products.
OpenAI brings Codex to the ChatGPT mobile app
OpenAI rolled out Codex in preview on iOS and Android so users can follow active coding threads, review diffs and terminal output, approve actions, and redirect long-running agent work from a phone. The update also makes Remote SSH generally available, adds generally available Codex hooks, introduces programmatic access tokens for Business and Enterprise workspaces, and supports eligible HIPAA-compliant local Codex deployments.
OpenAI says two employee devices were hit by TanStack supply-chain attack
After malicious TanStack package versions spread through npm, OpenAI confirmed two employee devices were affected and that a limited subset of internal source-code repositories saw unauthorized credential access. The company said it found no evidence that user data, production systems, intellectual property, or software releases were compromised and began rotating signing certificates as a precaution.
OpenAI updates ChatGPT to better track risk in sensitive conversations
OpenAI detailed new safety updates that help ChatGPT recognize when self-harm, suicide, or harm-to-others risk emerges over time. The system uses short-lived, narrowly scoped safety summaries for rare high-risk cases and improved safe-response performance by 50% in long suicide and self-harm evaluations, 16% in harm-to-others scenarios, and 39% to 52% across multi-conversation GPT-5.5 Instant tests.
Twin Prime raises $10M to build frontier AI for defense and security
London-based Twin Prime landed a $10 million pre-seed round led by Expeditions to develop multimodal AI models for defense and security. The startup is building systems that reason across sensor modalities and compress perception-to-decision workflows for real-time threat response, with plans for a joint venture with European defense prime Theon.
Anthropic launches Claude for Small Business
Announced May 13, Claude for Small Business plugs directly into QuickBooks, PayPal, HubSpot, Canva, DocuSign, Google Workspace, and Microsoft 365. It ships with 15 ready-to-run agentic workflows spanning finance, ops, sales, marketing, HR, and customer service — including automated payroll planning, month-end reconciliation, campaign management, and invoice tracking.
Meta unveils four new MTIA chips for its AI data centers
Meta announced a new MTIA (Meta Training and Inference Accelerator) lineup. MTIA 300 is already deployed for training smaller ranking and recommendation models; MTIA 400, 450, and 500 are in development for generative AI inference and will launch by 2027.
NVIDIA launches Nemotron 3 Nano Omni multimodal model
Nemotron 3 Nano Omni is an open multimodal model unifying vision, audio, and language. NVIDIA reports up to 9× higher throughput than competing open models, targeting more efficient AI agents on commodity hardware.
NVIDIA partners with David Silver's Ineffable Intelligence
NVIDIA announced a collaboration with British AI startup Ineffable Intelligence, founded by former DeepMind RL lead David Silver, to develop systems that learn through reinforcement learning rather than human data. The work will run on NVIDIA's Grace Blackwell and Vera Rubin platforms.
NVIDIA releases Star Elastic: one checkpoint, three reasoning models
NVIDIA Research introduced Star Elastic, a post-training method that embeds nested 30B, 23B, and 12B reasoning submodels inside a single checkpoint with zero-shot slicing. Operators can pick a model size at inference time without retraining.
OpenAI releases GPT-5.5 ("Spud"), its most agentic model yet
Rolled out to paid ChatGPT and Codex users on May 13, GPT-5.5 is tuned for long-running agentic tasks with minimal prompting. API access will follow once additional security guardrails are in place. OpenAI did not publish SWE-bench Verified scores, where Anthropic's Claude Mythos Preview currently leads at 93.9%.
Thinking Machines Lab loses key talent to Meta, OpenAI, and xAI
After founding employees crossed the one-year cliff and unlocked equity, Thinking Machines Lab saw a wave of departures. Meta reportedly recruited seven founding team members plus a star researcher with compensation packages worth hundreds of millions.
Google DeepMind reimagines the mouse pointer with Gemini
DeepMind unveiled an AI-enabled pointer powered by Gemini that understands on-screen visual context. Users can issue shorthand commands like "Fix this" or "Show me directions" without switching windows or writing long prompts.
Google publishes patterns for long-running enterprise agents
Google's Developers Blog detailed how to build pause-and-resume agents with the Agent Development Kit (ADK). The approach uses durable memory schemas and event-driven dormancy gates — instead of stateless chatbot patterns — to support multi-week workflows like HR onboarding without losing context.
IBM debuts Red Hat AI Inference and OpenShift Virtualization on IBM Cloud
IBM announced two managed offerings on May 12: Red Hat AI Inference Service and Red Hat OpenShift Virtualization Service on IBM Cloud. Both are aimed at helping enterprises operationalize AI and run virtualized workloads at scale with built-in governance controls.
Microsoft's MDASH agentic security system tops CyberGym
Microsoft's new multi-model security system (codename MDASH) orchestrates 100+ specialized agents and posted an industry-leading 88.45% on the CyberGym benchmark. In the announcement, Microsoft says the system has already discovered 16 new vulnerabilities in Windows, including four critical RCE flaws.
OpenAI introduces "Daybreak" cyber platform
Announced May 12, Daybreak combines OpenAI's language models with Codex's agentic capabilities to automate vulnerability detection, patch validation, and secure software development inside enterprise security workflows. The launch puts OpenAI head-to-head with Anthropic's Mythos in enterprise cyber.
Power Apps MCP server adds closed-loop learning for agents
Microsoft introduced closed-loop learning on the Power Apps MCP server: user corrections automatically improve enterprise agent performance using memory-based optimization and a genetic-Pareto optimization step.
SAP and Anthropic bring Claude to SAP Business AI Platform
At SAP Sapphire, SAP and Anthropic announced plans to embed Claude across the Business AI Platform to advance the "Autonomous Enterprise." Claude will power agentic capabilities such as financial closing, employee leave questions, and supplier order management directly inside SAP systems.
SAP and NVIDIA co-define enterprise-grade agent execution
SAP and NVIDIA detailed a joint framework for secure, auditable, and governable AI agents built on NVIDIA OpenShell. The work focuses on the runtime controls enterprises need before pushing autonomous agents into production.
SAP unveils the Autonomous Enterprise with 50+ Joule Assistants
SAP introduced a unified Business AI Platform and Autonomous Suite, deploying more than 50 domain-specific Joule Assistants across finance, supply chain, and HR. Partnerships span Anthropic, AWS, Google Cloud, Microsoft, NVIDIA, and Palantir. SAP says its Autonomous Close Assistant can compress financial closing from weeks to days.
Microsoft research: AI agents still struggle with long workflows
A Microsoft study using the new DELEGATE-52 benchmark tested frontier models (Gemini 3.1 Pro, Claude 4.6 Opus, GPT-5.4) across 52 professional workflows. The team found models lose ~25% of document content over 20 interactions on average, with severe corruption in 80% of conditions. Only Python programming hit "ready" status at 98%+ accuracy.
OpenAI launches the "OpenAI Deployment Company"
A new entity dedicated to helping organizations build and deploy AI for mission-critical work. The Deployment Company starts with $4B in initial backing from 19 global investment firms and consultancies, and absorbs Tomoro to bring on roughly 150 Forward Deployed Engineers and Deployment Specialists.
Reader Questions
AI News — Frequently Asked Questions
How often is the AI news updated?+
This page is curated and refreshed regularly with the latest artificial intelligence announcements, model releases, research, and enterprise rollouts. The "last updated" date always reflects our most recent refresh.
What kind of AI news do you cover?+
We cover new AI model releases, AI agents, research breakthroughs, AI hardware and chips, AI security, enterprise AI adoption, and notable talent moves — focusing on the stories that matter most to founders, engineers, and technology leaders.
Where do these AI news stories come from?+
Every story is compiled from public reporting and primary sources. Each card links directly to the original source so you can verify the details and read more.
How can I get AI news in my inbox?+
Subscribe to the weekly DonvitoCodes AI newsletter to get the most important AI news, releases, and analysis delivered to your inbox every week.