AI News

OpenAI CAISI UK AISI Partnership Strengthens AI Security Through Joint Red-Teaming

OpenAI CAISI UK AISI partnerships deliver security improvements through collaborative red-teaming of ChatGPT Agent and biosecurity safeguards, revealing vulnerabilities and rapid fixes.

LLMBase Editorial March 10, 2026 Updated September 12, 2025 2 min read

ai llm industry security red-teaming governance

These voluntary agreements represent a shift toward collaborative security evaluation between AI companies and government research institutions. For European AI teams building enterprise systems, the approach offers a template for structured security partnerships that could inform regulatory compliance frameworks under the EU AI Act.

ChatGPT Agent Security Testing Reveals Novel Vulnerabilities

The CAISI partnership expanded beyond capability evaluation to include product security testing of OpenAI's agentic systems. In July, CAISI teams conducted red-teaming exercises on ChatGPT Agent that combined traditional cybersecurity methods with AI-specific attack vectors.

CAISI identified two vulnerabilities that could allow sophisticated attackers to bypass security protections and remotely control computer systems accessed by the agent. The attack chain achieved approximately 50% success rate by combining conventional software exploits with AI agent hijacking techniques. OpenAI resolved these issues within one business day of reporting.

The collaboration required CAISI to develop new evaluation methodologies that bridge cybersecurity and machine learning domains. This multidisciplinary approach proved necessary as AI agents introduce attack surfaces that traditional security testing may not capture.

UK AISI Biosecurity Red-Teaming Delivers Systematic Improvements

The UK AISI partnership focused on testing safeguards against biological misuse across ChatGPT Agent and GPT-5. Unlike single-deployment evaluations, this represents an ongoing collaboration to continuously strengthen OpenAI's biosecurity protections.

UK AISI received extensive system access, including non-public safeguard prototypes, model variants with certain guardrails removed, and visibility into OpenAI's internal safety monitoring systems. This deep integration enabled more effective vulnerability discovery than external testing alone.

Throughout the May-August testing period, UK AISI identified more than a dozen vulnerability reports that resulted in engineering fixes, policy enforcement improvements, and targeted classifier training. The rapid feedback loop between teams allowed iterative testing and strengthening of safeguards before public deployment.

Implications for Enterprise AI Security

These collaborations establish precedents for structured AI system evaluation that extend beyond pre-deployment testing. The ongoing nature of both partnerships suggests that effective AI security requires continuous monitoring and improvement rather than point-in-time assessments.

For European enterprises deploying AI systems, the approach highlights the value of external security evaluation capabilities that combine domain expertise with AI-specific testing methods. Organizations building AI agents or handling sensitive data may need similar multidisciplinary security partnerships to identify vulnerabilities that internal teams might miss.

The rapid remediation timelines demonstrated in both partnerships also indicate the operational requirements for maintaining AI system security. One-day turnaround for critical fixes requires engineering processes and organizational capabilities that many enterprises are still developing.

The OpenAI CAISI UK AISI collaborations demonstrate how government research institutions can contribute specialized evaluation capabilities that strengthen commercial AI deployments while informing broader industry security practices.

Original source: OpenAI published details of the CAISI and UK AISI partnerships at https://openai.com/index/us-caisi-uk-aisi-ai-update

AI News Updates

Subscribe to our AI news digest

Weekly summaries of the latest AI news. Unsubscribe anytime.

More News

Grammarly Expert Review Class Action Lawsuit Filed Over Unauthorized AI Feature

Superhuman faces a federal class action lawsuit over Grammarly's Expert Review feature, which used writer names without consent before being discontinued Wednesday.

March 11, 2026 · Wired

OpenAI Prompt Injection Defenses Use Social Engineering Framework

OpenAI reveals how ChatGPT defends against prompt injection attacks by treating them as social engineering threats rather than simple input filtering problems.

March 11, 2026 · OpenAI

OpenAI Responses API Computer Environment: From Model to Agent Architecture

OpenAI built an agent runtime using the Responses API with shell tools and hosted containers to run secure, scalable agents with files, tools, and persistent state.

March 11, 2026 · OpenAI

Wayfair Integrates OpenAI Models for Catalog Accuracy and Support Automation

Wayfair uses OpenAI models to automate supplier support ticket routing and improve product catalog attributes across 30 million items, achieving 70% automation in some workflows.

March 11, 2026 · OpenAI

Browse all news →

Made in Europe

Chat with 100+ AI Models in one App.

Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.

Start for free View pricing