Model Evaluation

NeurIPS December 6, 2026 - December 12, 2026 event upcoming

NeurIPS 2026

NeurIPS 2026 is the fortieth annual Conference on Neural Information Processing Systems, with the primary dates listed for Sydney, Australia, and additional satellite locations in Atlanta and Paris.

View details Open event page

OpenAI September 29, 2026 event upcoming

OpenAI DevDay 2026

OpenAI DevDay 2026 is scheduled for September 29 in San Francisco and is OpenAI’s primary developer event for platform updates.

AI Engineering Agent Security Model Evaluation

View details Open event page

Don't Ship Skills Without Evals — Philipp Schmid, Google DeepMind video thumbnail

Play video

AI Engineer YouTube July 14, 2026 video

Don't Ship Skills Without Evals — Philipp Schmid, Google DeepMind

There are thousands of agent skills. Almost none of them are tested. They get vibe-checked with two manual runs, maybe a thumbs-up from a colleague, then shipped. You wouldn't merge code without tests — so why are we shipping skills without evals? This talk covers the full lifecycle of building reliable agent skills: w

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

WTF Is the Context Layer? The Missing Infrastructure for Production Agents — Prukalpa Sankar video thumbnail

Play video

AI Engineer YouTube July 14, 2026 video

WTF Is the Context Layer? The Missing Infrastructure for Production Agents — Prukalpa Sankar

In the last two years, models have gotten exponentially smarter. Two years ago they couldn't pass the bar. Today, top 1% of test scorers. And yet most agents still can't answer a simple business question correctly. You ship a demo that works. You deploy it. The business abandons it in a month. The missing variable is c

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

Forward Deployed Engineering at Cursor — Pauline Brunet video thumbnail

Play video

AI Engineer YouTube July 14, 2026 video

Forward Deployed Engineering at Cursor — Pauline Brunet

# How Forward Deployed Engineering is done at Cursor **Location:** Forward Deployed Engineering / Room 2020 **When:** Day 2 - June 30, 2026 · 11:10am-11:30am ## Speakers ### Pauline Brunet VP, Forward Deployed Engineering · Cursor [LinkedIn](https://www.linkedin.com/in/pauline-brunet/) VP of Forward Deployed Engineerin

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

"The engineer of the future is the person who is able to choose what is worth doing." — Addy Osmani video thumbnail

Play video

AI Engineer YouTube July 14, 2026 video

"The engineer of the future is the person who is able to choose what is worth doing." — Addy Osmani

For his closing keynote, Addy Osmani explores the evolving role of software engineers in the age of AI agents. He argues that as coding tasks become increasingly automated, the true value of an engineer shifts from mere code production to accountability, judgment, and system ownership. https://addyosmani.com/ https://x

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

Google Cloud Security Blog July 13, 2026 news

Securing the AI supply chain on GKE: Introducing k8s-aibom for automated AI BOMs

Top AI Coding Agent security resources — July 2026

This month exposed a harsh reality: autonomous coding agents are expanding the enterprise attack surface faster than we can patch them. Explore our digest of 19 critical resources to learn why static scanning is officially obsolete, and what your team must do to secure AI coding assistants. The post Top AI Coding Agent

Adversarial ML AI Red Teaming Model Evaluation

Read summary Source link

OpenAI News July 9, 2026 news

GPT-5.6: Frontier intelligence that scales with your ambition

More intelligence from every token, stronger performance per dollar, and more capability on demand for your hardest work.

AI Compliance AI Red Teaming Model Evaluation

Read summary Source link

OpenAI News July 9, 2026 news

GPT-5.6 is now the preferred model in Microsoft 365 Copilot

Learn how GPT-5.6 powers Microsoft 365 Copilot with stronger AI capabilities across Word, Excel, PowerPoint, Chat, and Cowork for faster, higher-quality work.

AI Compliance AI Red Teaming Model Evaluation

Read summary Source link

The Hacker News AI Security July 9, 2026 news

Meta's New AI Image Tool Lets Others Use Your Public Instagram Photos in AI Images

Meta has announced that its new artificial intelligence (AI) model Muse Image lets people use public Instagram posts and reels to generate AI content, and it's enabled by default. "You can also @-mention Instagram accounts in the Meta AI app to bring specific Instagram profiles right into your images," the social media

AI Compliance Model Evaluation

Read summary Source link

The Golden Age of AI Engineering — Alexander Embiricos & Romain Huet & Peter Steinberger, OpenAI video thumbnail

Play video

AI Engineer YouTube July 9, 2026 video

The Golden Age of AI Engineering — Alexander Embiricos & Romain Huet & Peter Steinberger, OpenAI

OpenAI's Dev Day 2024 demo ran on an o1 preview model that could not run or check its own code, so Romain Huet had to cross his fingers live on stage. A year later, the same kind of demo ran a full camera and lighting rig, because the model could now test its own work. Alexander Embiricos and Huet use that jump to show

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

Google Cloud Security Blog July 8, 2026 news

Meet the 33 cybersecurity startups joining the Gemini Startup Forum

Our flagship Google for Startups program, Gemini Startup Forum: Cybersecurity, has selected its first 33 trailblazing startups.

Agent Security Prompt Injection Model Evaluation

Read summary Source link

METR July 8, 2026 analysis

Because 8 ≈ e², Anthropic's researcher uplift is plausibly >2x

Note: the modeling assumptions and conclusion are Thomas Kwa’s opinion, and others at METR disagree. 1 Also, the math was checked by Claude but not a second human. Introduction Anthropic’s RSI blog post reported that in Q2 2026, Anthropic contributors merged 8× as much code per day as in the 2021-2024 period. What does

Model Evaluation Agent Security AI Red Teaming

Read summary Source link

OpenAI News July 8, 2026 news

Our approach to government and national security partnerships

Open notes Watch on YouTube

AWS Security Blog July 7, 2026 analysis

Enforce zero data retention on Amazon Bedrock with Bedrock Projects and service control policies

With the introduction of models that require data sharing with third-party providers—such as Claude Fable 5—organizations need a way to centrally enforce data retention policies. Amazon Bedrock gives you control over whether your prompts and model outputs are retained after an inference request completes. You might nee

Agent Security Model Evaluation AI Compliance

Read summary Source link

Google Cloud Security Blog July 7, 2026 news

Drive proactive security, prioritize risks with Google Threat Intelligence and Wiz ASM

To help you match your real-world exposures with real-time adversary activity, we’ve begun integrating Google Threat Intelligence with Wiz Attack Surface Management.

Agent Security Prompt Injection Model Evaluation

Read summary Source link

How we taught agents to use good retrieval - Hanna Lichtenberg, Mixedbread AI video thumbnail

Play video

AI Engineer YouTube July 7, 2026 video

How we taught agents to use good retrieval - Hanna Lichtenberg, Mixedbread AI

RAG is dead. Again. Vector search is useless. All you need is BM25. Not even BM25, all you need is grep. Or maybe even just cat+ls. If you care at all about agents, you probably read a variation of this as part of your daily routine. In a way, isn't it true that semantic search is full of failure cases? And yet, in all

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

ICML July 6, 2026 - July 11, 2026 event event archive

ICML 2026

ICML 2026 takes place at COEX in Seoul, South Korea, with tutorials, main conference sessions, and workshops covering core machine learning research.

Model Evaluation Adversarial ML AI Engineering

View details Open event page

Adversa AI Trusted AI Blog July 2, 2026 analysis

Top Agentic AI security resources — July 2026

July 2026's agentic AI security roundup: agentic zero trust whitepapers, AutoJack & other new exploits, and the newest agent defenses. The post Top Agentic AI security resources — July 2026 first appeared on Adversa AI .

Adversarial ML AI Red Teaming Model Evaluation

Read summary Source link

Fable 5 vs GPT 5.6 Sol: The Early Results video thumbnail

Play video

AI Explained YouTube July 2, 2026 video

Fable 5 vs GPT 5.6 Sol: The Early Results

Fable 5 (newly re-released) vs GPT 5.6 Sol, what comparisons can we unearth? Plus, Sonnet 5, a 5% equity seizure by US Govt, the ‘largest heist’, Beetlejuice and more… Exclusive Vids in AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:00 - Introduction 01:06 - Fable Timeline 02:59 - Sol Release? 06:0

Model Evaluation AI Compliance Agent Security

Open notes Watch on YouTube

Adversa AI Trusted AI Blog June 30, 2026 analysis

GuardFall: a universal shell injection vulnerability in open-source AI agents

Adversa AI analysis of GuardFall, a shell-injection pattern affecting open-source AI coding agents. The key issue is that agents often run shell commands with developer privileges, so old command-injection tricks can bypass modern AI safety filters if execution is not isolated and constrained.

Adversarial ML AI Red Teaming Model Evaluation

Read summary Source link

METR June 26, 2026 analysis

Summary of METR's predeployment evaluation of GPT-5.6 Sol

METR pre-deployment evaluation summary of a frontier model, emphasizing independent assessment, capability evidence, and launch-risk considerations. Relevant to model evaluation and safety gating.

Model Evaluation Agent Security AI Red Teaming

Read summary Source link

AI Engineer World's Fair 2026 Day 2 Livestream video thumbnail

Play video

AI Engineer YouTube June 25, 2026 video

AI Engineer World's Fair 2026 Day 2 Livestream

Live from San Francisco, AI Engineer World’s Fair 2026 continues with Day 2 of session programming from the main stage. Watch live for keynote sessions, main-stage programming, and more from World’s Fair 2026 as AI Engineer brings another full day of AI engineering content to viewers online. Event: AI Engineer World’s

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

AI Engineer World's Fair 2026 Day 3 Livestream video thumbnail

Play video

AI Engineer YouTube June 25, 2026 video

AI Engineer World's Fair 2026 Day 3 Livestream

Live from San Francisco, AI Engineer World’s Fair 2026 wraps with the final day of main-stage programming. Watch live for keynote sessions, featured talks, and closing-day highlights from World’s Fair 2026 as AI Engineer streams the final day of the event online. Event: AI Engineer World’s Fair 2026 Date: Thursday, Jul

AI Engineering Model Evaluation Prompt Engineering

Open notes Watch on YouTube

OpenAI News June 22, 2026 news

Daybreak: Tools for securing every organization in the world

OpenAI introduces new Daybreak tools, including Codex Security and GPT-5.5-Cyber, to help organizations find, validate, and patch vulnerabilities at scale.

AI Compliance AI Red Teaming Model Evaluation

Read summary Source link

Google DeepMind Blog June 16, 2026 news

Securing the future of AI agents

Google DeepMind post on security requirements for AI agents. Relevant to alignment between autonomy, permissions, sandboxing, evaluation, and operational control design.

Model Evaluation Agent Security AI Engineering

Read summary Source link

Databricks June 15, 2026 - June 18, 2026 event event archive

Data + AI Summit 2026

Data + AI Summit 2026 is Databricks’ global data and AI conference in San Francisco and online, with 800+ sessions across data engineering, analytics, ML, governance, and agent applications.

AI Engineering AI Compliance Model Evaluation

METR report summarizing frontier model risk observations across February and March 2026. Relevant to external evaluation, risk monitoring, and pre-release assurance practices.

Model Evaluation Agent Security AI Red Teaming

Read summary Source link

Google DeepMind Blog May 15, 2026 news

Gemini 3.5: frontier intelligence with action

Gemini 3.5 is built to help you execute complex, agentic workflows.

Model Evaluation Agent Security AI Engineering

Read summary Source link

Breaking the Black Box: Why Testing Generative AI Is Full Spectrum - Jason Ross - NDC Security 2026 video thumbnail

Play video

NDC Conferences YouTube May 7, 2026 video

Breaking the Black Box: Why Testing Generative AI Is Full Spectrum - Jason Ross - NDC Security 2026

NDC Security 2026 talk on testing generative AI systems, arguing that AI red teaming needs to evaluate variable behavior, safety boundaries, and AI-assisted testing methods rather than a single exploit result.

AI Red Teaming Model Evaluation

Open notes Watch on YouTube

garak Releases May 1, 2026 tool

v0.15.0

Open notes Watch on YouTube

Anthropic Frontier Red Team April 7, 2026 news

Assessing Claude Mythos Preview’s cybersecurity capabilities

Claude Mythos Preview is a new general-purpose language model that is strikingly capable at computer security tasks. This post provides technical details for researchers and practitioners who want to understand exactly how we have been testing this model, and what we have found over the past month. We hope this will sh

AI Red Teaming Agent Security Model Evaluation

Read summary Source link

NIST April 7, 2026 framework

NIST AI RMF and Critical Infrastructure Profile

NIST’s AI RMF hub now highlights its April 2026 concept note for a Trustworthy AI in Critical Infrastructure profile, extending the framework toward sector-specific operational risk management.

AI Compliance Model Evaluation

Read summary Source link

garak Releases April 3, 2026 tool

v0.14.1

garak release with new generators, probe metadata, and evaluation workflow improvements. Relevant to maintaining repeatable LLM security testing coverage.

AI Red Teaming Adversarial ML Model Evaluation

Read summary Source link

Two AI Models Set to “stir government urgency”, But Will This Challenge Undo Them? video thumbnail

Play video

AI Explained March 26, 2026 video

Two AI Models Set to “stir government urgency”, But Will This Challenge Undo Them?

This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation AI Compliance AI Red Teaming Adversarial ML

Open notes Watch on YouTube

Anthropic Frontier Red Team March 6, 2026 news

Reverse engineering Claude's CVE-2026-2796 exploit

This post dives deep into how Claude wrote an exploit for one of the vulnerabilities it found in Firefox.

AI Red Teaming Agent Security Model Evaluation

Read summary Source link

Anthropic Frontier Red Team March 6, 2026 news

Partnering with Mozilla to improve Firefox’s security

In a collaboration with researchers at Mozilla, Claude Opus 4.6 discovered 22 Firefox vulnerabilities over the course of two weeks.

AI Red Teaming Agent Security Model Evaluation

Read summary Source link

What the New ChatGPT 5.4 Means for the World video thumbnail

Play video

AI Explained March 6, 2026 video

What the New ChatGPT 5.4 Means for the World

This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation AI Compliance AI Red Teaming

Open notes Watch on YouTube

Deadline Day for Autonomous AI Weapons & Mass Surveillance video thumbnail

Play video

AI Explained February 27, 2026 video

Deadline Day for Autonomous AI Weapons & Mass Surveillance

This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation AI Compliance Agent Security AI Red Teaming

Open notes Watch on YouTube

Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI video thumbnail

Play video

AI Explained February 20, 2026 video

Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI

This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation Agent Security Adversarial ML

Open notes Watch on YouTube

The Two Best AI Models/Enemies Just Got Released Simultaneously video thumbnail

Play video

AI Explained February 6, 2026 video

The Two Best AI Models/Enemies Just Got Released Simultaneously

This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation Agent Security Adversarial ML

Open notes Watch on YouTube

Anthropic Frontier Red Team February 5, 2026 news

LLM-discovered 0-days

AI models can now find high-severity vulnerabilities at scale. This is a moment to empower defenders. We're now using Claude to find and help fix vulnerabilities in open source software.

AI Red Teaming Agent Security Model Evaluation

Read summary Source link

Claude AI Co-founder Publishes 4 Big Claims about Near Future: Breakdown video thumbnail

Play video

AI Explained January 28, 2026 video

Claude AI Co-founder Publishes 4 Big Claims about Near Future: Breakdown

This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation AI Compliance Adversarial ML

Open notes Watch on YouTube

Keynote: Can you trust your (large language) model? - Jodie Burchell - NDC AI 2025 video thumbnail

Play video

NDC Conferences YouTube January 28, 2026 video

Keynote: Can you trust your (large language) model? - Jodie Burchell - NDC AI 2025

NDC AI 2025 keynote on whether machine-learning and LLM outputs can be trusted, and why black-box model behavior makes performance assessment and validation difficult.

Model Evaluation AI Compliance

Open notes Watch on YouTube

Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0 video thumbnail

Play video

AI Engineer January 24, 2026 video

Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0

AI Engineer session on Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0. It adds practical context for how teams are building and operating AI systems in production.

AI Engineering Agent Security Model Evaluation

Open notes Watch on YouTube

Welcome to AIE CODE - Jed Borovik, Google DeepMind video thumbnail

Play video

AI Engineer January 24, 2026 video

Welcome to AIE CODE - Jed Borovik, Google DeepMind

AI Engineer session on Welcome to AIE CODE - Jed Borovik, Google DeepMind. It adds practical context for how teams are building and operating AI systems in production.

AI Engineering Model Evaluation

Open notes Watch on YouTube

Anthropic Frontier Red Team January 16, 2026 news

AI Models on Realistic Cyber Ranges

In a recent evaluation of AI models’ cyber capabilities, current Claude models can now succeed at multistage attacks on networks with dozens of hosts using only standard, open-source tools, instead of the custom tools needed by previous generations.

AI Red Teaming Agent Security Model Evaluation

Read summary Source link

Anthropic Frontier Red Team January 14, 2026 news

Finding Bugs with Claude and Property-based Testing

Ensuring that programs are bug-free is one of the most challenging aspects of software engineering. We developed an agent that can efficiently identify bugs in large software projects. Our agent infers general properties of code that should be true, and then applies property-based testing. After extensive manual valida

AI Red Teaming Agent Security Model Evaluation

Read summary Source link

Anthropic: Our AI just created a tool that can ‘automate all white collar work’, Me: video thumbnail

Play video

AI Explained January 14, 2026 video

Anthropic: Our AI just created a tool that can ‘automate all white collar work’, Me:

Open notes Watch on YouTube

Gemini Exponential, Demis Hassabis' ‘Proto-AGI’ coming, but … video thumbnail

Play video

AI Explained December 19, 2025 video

Gemini Exponential, Demis Hassabis' ‘Proto-AGI’ coming, but …

Open notes Watch on YouTube

Real-time Experiments with an AI Co-Scientist - Stefania Druga, fmr. Google Deepmind video thumbnail

Play video

AI Engineer August 24, 2025 video

Real-time Experiments with an AI Co-Scientist - Stefania Druga, fmr. Google Deepmind

AI Engineer session on Real-time Experiments with an AI Co-Scientist - Stefania Druga, fmr. Google Deepmind. It adds practical context for how teams are building and operating AI systems in production.

AI Engineering Model Evaluation

Open notes Watch on YouTube

Protect AI Blog August 15, 2025 analysis

Automated Red Teaming Scans of Dataiku Agents Using Protect AI Recon

Open notes Watch on YouTube

How Not to Read a Headline on AI (ft. new Olympiad Gold, GPT-5 …) video thumbnail

Play video

AI Explained July 21, 2025 video

How Not to Read a Headline on AI (ft. new Olympiad Gold, GPT-5 …)

Open notes Watch on YouTube

Did AI Just Get Commoditized? Gemini 2.5, New DeepSeek V3, & Microsoft vs OpenAI video thumbnail

Play video

AI Explained March 25, 2025 video

Did AI Just Get Commoditized? Gemini 2.5, New DeepSeek V3, & Microsoft vs OpenAI

This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation AI Compliance Adversarial ML

Open notes Watch on YouTube

NIST March 24, 2025 framework

Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations

NIST finalizes AI 100-2e2025, providing a terminology and taxonomy for adversarial machine learning across predictive and generative AI systems.

Adversarial ML Model Evaluation AI Compliance

Read summary Source link

Anthropic March 19, 2025 analysis

Progress from our Frontier Red Team

Anthropic shares lessons from frontier red teaming and discusses where models are showing early-warning signs of higher-risk cyber and biology capabilities.

AI Red Teaming Model Evaluation

Read summary Source link

Manus AI - The Calm Before the Hypestorm … (vs Deep Research + Grok 3) video thumbnail

Play video

AI Explained March 13, 2025 video

Manus AI - The Calm Before the Hypestorm … (vs Deep Research + Grok 3)

This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation Agent Security AI Red Teaming Adversarial ML

Open notes Watch on YouTube

GPT 4.5 - not so much wow video thumbnail

Play video

AI Explained February 28, 2025 video

GPT 4.5 - not so much wow

Open notes Watch on YouTube

Play video

AI Explained January 20, 2023 video

GPT 4 - hype vs reality

This AI Explained video reviews a major AI development through the lens of scaling and compute economics. It is useful context for AI engineering, evaluation, governance, and operational risk.

Model Evaluation

Open notes Watch on YouTube

Current notes, events, and source material