AI Red Teaming

IEEE GAISS October 28, 2026 - October 30, 2026 event upcoming

GAISS 2026: IEEE GenAI for Secure Systems

GAISS 2026 is an IEEE conference at the University of Texas at Austin focused on generative AI for secure systems, including red teaming, blue-team automation, governance, and agentic secure AI.

View details Open event page

DEF CON August 6, 2026 - August 9, 2026 event upcoming

DEF CON 34 / AI Village 2026

DEF CON 34 takes place in Las Vegas and is expected to include AI security activity through villages, workshops, contests, and community-led research tracks as schedules firm up.

AI Red Teaming Prompt Injection Adversarial ML

View details Open event page

Black Hat August 1, 2026 - August 6, 2026 event upcoming

Black Hat USA 2026 AI Summit

Black Hat USA 2026 includes an AI Summit and security briefings in Las Vegas focused on how artificial intelligence is changing digital defense.

AI Red Teaming Agent Security Adversarial ML

View details Open event page

The Hacker News AI Security July 13, 2026 news

New MemGhost Attack Plants Persistent False Memories in AI Agents Through One Email

Give an AI assistant a memory and access to your inbox, and you hand an attacker a way to rewrite what it thinks it knows about you. A single email can trick that agent into saving a false "fact" about the user, hide the change, and quietly steer its answers in later sessions. When it works, the person reads an ordinar

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

The Hacker News AI Security July 10, 2026 news

Researcher Details WhatsApp-to-Host Attack Chain Using Three OpenClaw Flaws

Details have emerged about three now-patched security flaws in the OpenClaw personal artificial intelligence (AI) assistant that, if successfully exploited, could enable credential theft, privilege escalation, and arbitrary code execution on the host. A brief description of the high-severity vulnerabilities is as follo

Agent Security Prompt Injection AI Red Teaming

Read summary Source link

Microsoft Security Blog July 10, 2026 news

Securing our future: July 2026 progress report on Microsoft’s Secure Future Initiative

Microsoft’s latest Secure Future Initiative report outlines progress on secure foundations, AI-powered defense, and future-ready cybersecurity. The post Securing our future: July 2026 progress report on Microsoft’s Secure Future Initiative appeared first on Microsoft Security Blog .

AI Red Teaming Prompt Injection Agent Security

Read summary Source link

Adversa AI Trusted AI Blog July 9, 2026 analysis

Top AI Coding Agent security resources — July 2026

This month exposed a harsh reality: autonomous coding agents are expanding the enterprise attack surface faster than we can patch them. Explore our digest of 19 critical resources to learn why static scanning is officially obsolete, and what your team must do to secure AI coding assistants. The post Top AI Coding Agent

Adversarial ML AI Red Teaming Model Evaluation

Read summary Source link

OpenAI News July 9, 2026 news

GPT-5.6: Frontier intelligence that scales with your ambition

More intelligence from every token, stronger performance per dollar, and more capability on demand for your hardest work.

AI Compliance AI Red Teaming Model Evaluation

Read summary Source link

OpenAI News July 9, 2026 news

GPT-5.6 is now the preferred model in Microsoft 365 Copilot

Learn how GPT-5.6 powers Microsoft 365 Copilot with stronger AI capabilities across Word, Excel, PowerPoint, Chat, and Cowork for faster, higher-quality work.

AI Compliance AI Red Teaming Model Evaluation

Read summary Source link

The Hacker News AI Security July 9, 2026 news

Top AI Agents Built to Catch Malicious Code Can Be Tricked Into Running It

Ask an AI coding agent to scan open-source code for security holes, and it might run the attacker's code on your own machine instead. That is the finding in a proof-of-concept published Wednesday by the AI Now Institute, an attack it calls "Friendly Fire." It works against Anthropic's Claude Code and OpenAI's Codex whe

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

The Hacker News AI Security July 9, 2026 news

GhostApproval Symlink Flaws Could Let Malicious Repos Run Code in AI Coding Agents

Researchers at Wiz found that a flaw in six popular AI coding assistants lets a booby-trapped code project quietly take control of a developer's computer. The assistant asks permission to edit one harmless-looking file, but the write lands on a sensitive one instead. The affected tools are Amazon Q Developer, Anthropic

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

SecurityWeek AI Security July 9, 2026 news

UK Government Rolls Out Agentic AI Defense Plan Alongside Industry Pledge

Two announcements on July 7, 2026, demonstrate the government’s determination to improve the level of cybersecurity within the UK. The post UK Government Rolls Out Agentic AI Defense Plan Alongside Industry Pledge appeared first on SecurityWeek .

Agent Security AI Red Teaming AI Compliance

Read summary Source link

METR July 8, 2026 analysis

Because 8 ≈ e², Anthropic's researcher uplift is plausibly >2x

Note: the modeling assumptions and conclusion are Thomas Kwa’s opinion, and others at METR disagree. 1 Also, the math was checked by Claude but not a second human. Introduction Anthropic’s RSI blog post reported that in Q2 2026, Anthropic contributors merged 8× as much code per day as in the 2021-2024 period. What does

Model Evaluation Agent Security AI Red Teaming

Read summary Source link

OpenAI News July 8, 2026 news

Our approach to government and national security partnerships

Learn how OpenAI approaches government and national security partnerships, with principles for responsible AI use, democratic accountability, and public safety.

AI Compliance AI Red Teaming Model Evaluation

Read summary Source link

OpenAI News July 8, 2026 news

Introducing GPT-Live

A new generation of voice models for natural human-AI interaction, now powering ChatGPT Voice.

AI Compliance AI Red Teaming Model Evaluation

Read summary Source link

The Hacker News AI Security July 8, 2026 news

AI Coding Agents Found Triggering Endpoint Security Rules Built to Catch Attackers

Sophos looked at a week of its own endpoint data and found that AI coding agents such as Claude Code, Cursor, and OpenAI Codex are setting off detection rules written to catch human intruders. The agents are not malicious. They just do a lot of things that, to a behavioral engine, look exactly like an attack. Decryptin

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

The Hacker News AI Security July 8, 2026 news

CISA Adds 4 Actively Exploited Adobe, Joomla, and Langflow Flaws to KEV

The U.S. Cybersecurity and Infrastructure Security Agency (CISA) on Tuesday added four security flaws to its Known Exploited Vulnerabilities (KEV) catalog, citing evidence of active exploitation. The vulnerabilities are listed below - CVE-2026-48282 (CVSS score: 10.0) - A path traversal vulnerability in Adobe ColdFusio

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

The Hacker News AI Security July 8, 2026 news

GitHub Copilot Refuses Harmful Requests in Chat, Then Writes Them in Code

An AI coding assistant that refuses to answer a dangerous request in its chat box can answer it anyway if the same request is broken into small, ordinary-looking steps inside a code editor. That is the finding of a new study of GitHub Copilot by researchers Abhishek Kumar and Carsten Maple. The models they tested throu

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

SecurityWeek AI Security July 8, 2026 news

Critical Vulnerability Exposes GitHub Agentic Workflows to Prompt Injection

Researchers show how attackers can use a crafted public GitHub Issue to trick AI-powered workflows into exposing data from private repositories without authentication. The post Critical Vulnerability Exposes GitHub Agentic Workflows to Prompt Injection appeared first on SecurityWeek .

Agent Security AI Red Teaming AI Compliance

Read summary Source link

The Hacker News AI Security July 7, 2026 news

Public GitHub Issue Could Trick GitHub Agentic Workflows Into Leaking Private Repo Data

A public issue can trick GitHub Agentic Workflows into leaking the contents of an organization's private repositories, researchers at Noma Security have shown. The attacker needs only to open a normal-looking issue on a public repository, with no stolen credentials and no access to the organization. If that organizatio

Prompt Injection Agent Security AI Red Teaming

Read summary Source link

The Hacker News AI Security July 7, 2026 news

Writer AI Flaw Could Let Agent Previews Leak Session Tokens Across Tenants

Cybersecurity researchers have disclosed details of a now-patched critical session isolation vulnerability in Writer, an enterprise generative artificial intelligence (AI) platform, that could result in cross-tenant compromise. The one-click vulnerability has been codenamed WriteOut by the Sand Security Research team.

Current notes, events, and source material