Agent security is about what happens when AI systems can browse, use tools, remember state, and take actions across multiple steps. The security boundary moves from a chat response to a longer workflow with identity, permissions, memory, and operational consequences.
Agent Security
Controls and attack paths for browsing, tool use, memory, identity, and action-taking agents.
- What tools the agent can reach and under which identity
- How memory, plans, and previous steps influence later actions
- What approvals or reversibility exist when the agent gets it wrong
- Unsafe tool use and hidden privilege expansion
- Prompt injection flowing into planning and execution
- Long-running workflows accumulating risky state or momentum
- Teams shipping assistant-to-agent product transitions
- Practitioners studying autonomy and tool use
- Operators responsible for controls around high-impact actions
Current notes, events, and source material
These items are included because they add useful evidence, framing, implementation detail, or upcoming context for teams working in this area.
GAISS 2026: IEEE GenAI for Secure Systems
GAISS 2026 is an IEEE conference at the University of Texas at Austin focused on generative AI for secure systems, including red teaming, blue-team automation, governance, and agentic secure AI.
IAPP P.S.R. + AI Governance Global 2026
IAPP Privacy. Security. Risk. + AI Governance Global 2026 brings privacy, cybersecurity law, technology, and AI governance professionals together in Seattle.
OpenAI DevDay 2026
OpenAI DevDay 2026 is scheduled for September 29 in San Francisco and is OpenAI’s primary developer event for platform updates.
AGNTCon + MCPCon Europe 2026
AGNTCon + MCPCon Europe 2026 brings agent and MCP builders to Amsterdam to cover agent architectures, protocols, infrastructure, security, observability, and interoperability.
Black Hat USA 2026 AI Summit
Black Hat USA 2026 includes an AI Summit and security briefings in Las Vegas focused on how artificial intelligence is changing digital defense.
Gartner Security & Risk Management Summit 2026
Gartner Security & Risk Management Summit 2026 brings CISOs and security leaders together in National Harbor, Maryland, with tracks covering AI, cyber risk, application security, data security, operations, privacy, and governance.
Malicious npm packages abuse dependency confusion to profile developer environments
A dependency confusion campaign leveraged 33 malicious npm packages to collect reconnaissance data from developer and build environments. This report details the attack chain, observed tradecraft, and detection opportunities to help organizations identify and disrupt related activity. The post Malicious npm packages ab
How Braintrust turns customer requests into code with Codex
How Braintrust engineers use Codex with GPT-5.5 to run experiments and code faster.
A shared playbook for trustworthy third party evaluations
OpenAI shares guidance on third-party AI evaluations, covering how to assess model capabilities, safeguards, and validity for frontier systems.
Boston Children’s uses AI to unlock new diagnoses
Boston Children’s Hospital uses OpenAI technology to improve patient care, reduce operational burden, and help diagnose more than 40 rare disease cases.
Strengthening societal resilience with Rosalind Biodefense
OpenAI launches Rosalind Biodefense, expanding trusted access to GPT-Rosalind for vetted developers and U.S. government partners advancing biodefense, public health, and pandemic preparedness through frontier AI.
Microsoft is named a Leader in the 2026 Gartner® Magic Quadrant™ for Endpoint Protection
Microsoft is named a Leader in the 2026 Gartner® Magic Quadrant™ for Endpoint Protection. The post Microsoft is named a Leader in the 2026 Gartner® Magic Quadrant™ for Endpoint Protection appeared first on Microsoft Security Blog .
Typosquatted npm packages used to steal cloud and CI/CD secrets
The Mini Shai-Hulud campaign used malicious npm packages to target cloud and CI/CD credentials across developer environments. This report details the attack chain, detection opportunities, and mitigation guidance to help organizations identify and disrupt related activity. The post Typosquatted npm packages used to ste
Cloud CISO Perspectives: How to build an AI-ready security program for the public sector
From industrial control systems to decades-old municipal databases, here’s our CISO guidance to prep AI-ready security programs for the public sector.
Play video
New Claude Opus 4.8: 15 Things You May’ve Missed
The ‘best’ generally available AI model just dropped, but there is plenty I bet you missed about what it is, how it performs, and what the release tells us. 15 highlights from the 244 page system card, plus private testing, leader interview and more. AI Insiders ($9!): https://www.patreon.com/AIExplained Chapters: 00:0
How Endava builds an agentic organization with Codex
Learn how Endava uses Codex to build an agentic organization, accelerating software delivery and reducing requirements analysis from weeks to hours.
MUFG aims to become AI-native with OpenAI
MUFG uses ChatGPT Enterprise to build an AI-native organization, improve workflows, and deliver new AI-powered financial services at scale.
OpenAI’s Frontier Governance Framework
Explore OpenAI’s Frontier Governance Framework and how our AI safety, security, and risk practices align with emerging EU and California regulations.
The Gentlemen ransomware: Dissecting a self-propagating Go encryptor
Microsoft Threat Intelligence presents a comprehensive analysis of The Gentlemen, a Go-based ransomware deployed by affiliates of Storm-2697 that combines per-file ephemeral key encryption with an aggressive self-propagation module to deploy itself across an entire network using series of simultaneous lateral movement
Cisco and OpenAI redefine enterprise engineering with Codex
Cisco and OpenAI are redefining enterprise engineering with Codex, helping Cisco scale AI-native development, accelerate AI Defense work, and automate defect remediation.
Election information and safeguards in 2026
Ahead of global elections, we’re helping people access information, supporting cyber defenders, and increasing AI transparency
Warp’s big bet on building open source with GPT-5.5
Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows.
Building self-improving tax agents with Codex
See how OpenAI, Thrive, and Crete built a self-improving tax agent with Codex, automating filings, improving accuracy, and accelerating workflows.
Introducing Google AI Threat Defense to help you outpace the adversary
AI Threat Defense is a comprehensive AI-powered cybersecurity solution, an always-on security platform to outpace AI-driven attacks.
ACM CAIS 2026
ACM CAIS 2026 is a research-focused conference on compound AI architectures, optimization, deployment, and agentic AI systems in San Jose, California.
From poisoned search results to GPU mining: A cryptojacking campaign abusing ScreenConnect and Microsoft .NET utilities
Microsoft exposes a cryptojacking campaign using SEO poisoning and ScreenConnect to target high-performance PCs, with malicious sites also surfaced through AI chatbots. The post From poisoned search results to GPU mining: A cryptojacking campaign abusing ScreenConnect and Microsoft .NET utilities appeared first on Micr
OpenAI, Grupo Folha and Grupo UOL announce strategic content partnership
OpenAI partners with Grupo Folha and Grupo UOL to bring trusted Brazilian journalism to ChatGPT, expanding access to news with attribution and transparency.
Project Glasswing: An initial update
Anthropic reports early Project Glasswing results using Mythos Preview with infrastructure partners and external testers, including large-scale vulnerability discovery and a cautious disclosure posture.
How Virgin Atlantic ships faster with Codex
How Virgin Atlantic used Codex to ship its revamped mobile app on a fixed holiday travel deadline, reaching near-total unit test coverage and zero P1 defects.
Microsoft recognized as a Leader in The Forrester Wave™ for Workforce Identity Security Platforms
Microsoft has been recognized as a Leader in The Forrester Wave™: Workforce Identity Security Platforms, Q2 2026, receiving the highest scores in both the current offering and strategy categories. The post Microsoft recognized as a Leader in The Forrester Wave™ for Workforce Identity Security Platforms appeared first o
From edge appliance to enterprise compromise: Multi-stage Linux intrusion via F5 and Confluence
A multi-stage attack on Linux devices began with an exposed F5 BIG-IP edge appliance and pivoted to an internal Confluence server for credential theft and identity compromise. Learn how the threat actor attempted Kerberos relay and lateral movement, and how Microsoft Defender detected, blocked, and unraveled the attack
Microsoft Security success stories: How St. Luke’s and ManpowerGroup are securing AI foundations
How Frontier firms secure AI at scale: read how Microsoft customers embed governance, identity, and cloud security to make protection an enabler of AI growth. The post Microsoft Security success stories: How St. Luke’s and ManpowerGroup are securing AI foundations appeared first on Microsoft Security Blog .
Measuring LLMs' Ability to Develop Exploits
Anthropic evaluates Mythos Preview against ExploitBench, ExploitGym, and an updated smart-contract exploitation benchmark, showing a step change in models that can turn vulnerabilities into working exploit chains.
OpenAI named a Leader in enterprise coding agents by Gartner
OpenAI is named a leader in the 2026 Gartner Magic Quadrant for Enterprise AI Coding Agents, with Codex recognized for innovation and enterprise-scale deployment.
ChatGPT Enterprise & Edu Codex release notes: May 2026
OpenAI’s Enterprise and Edu release notes describe Codex updates including goal mode, browser improvements, locked computer use, app-window context, admin analytics, and plugin sharing status.
AdventHealth advances whole-person care with OpenAI
AdventHealth is using ChatGPT for Healthcare to streamline workflows, reduce administrative burden, and return more time to patient care.
What’s new in Microsoft Security: May 2026
Microsoft Security’s latest updates extend visibility, control, and protection across expanding ecosystems as organizations accelerate AI adoption. The post What’s new in Microsoft Security: May 2026 appeared first on Microsoft Security Blog .
An OpenAI model has disproved a central conjecture in discrete geometry
An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven mathematics.
How Ramp engineers accelerate code review with Codex
How Ramp engineers use Codex with GPT-5.5 to review code and ship improvements, allowing them to get substantive feedback in minutes instead of hours.
Mini Shai Hulud: Compromised @antv npm packages enable CI/CD credential theft
Compromised @antv npm packages deploy the Mini Shai-Hulud payload to steal CI/CD secrets from Linux-based automation environments. The malware executes during npm install and targets credentials across GitHub, AWS, Kubernetes, Vault, npm, and 1Password platforms. The post Mini Shai Hulud: Compromised @antv npm packages
Securing the gaming culture of cultures
Read about the unique challenges and rewards of securing gaming platforms and how to better protect gaming communities. The post Securing the gaming culture of cultures appeared first on Microsoft Security Blog .
Play video
Two Rival Bets on AGI: Google I/O Highlights
The biggest Google AI push of the year, but what is the bigger story? Why is Google pursuing a different fork in the road than OpenAI or Anthropic? https://assemblyai.com/aiexplained What does Gemini 3.5 Flash mean for the near-term future of AI? Plus the highlights from a provocative new paper on AI, 8 key moments you
The next phase of OpenAI’s Education for Countries
OpenAI advances Education for Countries, expanding AI adoption in schools with new partnerships, teacher training, and tools to improve global learning outcomes.
Introducing RAMPART and Clarity: Open source tools to bring safety into Agent development workflow
The AI systems shipping inside enterprises today are fundamentally different from the ones we were building even two years ago, because they have moved well past answering questions and into accessing your email, retrieving records from your CRM, writing and executing code, and taking actions on your behalf across doze
Introducing OpenAI for Singapore
OpenAI for Singapore launches a multi-year AI partnership to expand deployment, build local talent, and support businesses and public services with AI.
Advancing content provenance for a safer, more transparent AI ecosystem
OpenAI advances AI content provenance with Content Credentials, SynthID, and a verification tool to help people identify and trust AI-generated media.
Exposing Fox Tempest: A malware-signing service operation
Fox Tempest is a financially motivated threat actor operating a malware‑signing‑as‑a‑service (MSaaS) used by other cybercriminals, including Vanilla Tempest and Storm groups, to more effectively distribute malicious code, including ransomware. The post Exposing Fox Tempest: A malware-signing service operation appeared
OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments
OpenAI and Dell partner to bring Codex to hybrid and on-premise environments, helping enterprises deploy AI coding agents securely across data and workflows.
How Storm-2949 turned a compromised identity into a cloud-wide breach
Storm-2949 turned stolen credentials into a cloud-wide breach, moving from identity compromise to large-scale data theft without using malware. This incident shows how threat actors can exploit trusted systems to operate undetected. The post How Storm-2949 turned a compromised identity into a cloud-wide breach appeared
How to better protect your growing business in an AI-powered world
See how built-in security helps keep your growing business running, protect customer trust, and support growth. The post How to better protect your growing business in an AI-powered world appeared first on Microsoft Security Blog .
OpenAI and Malta partner to bring ChatGPT Plus to all citizens
OpenAI and Malta partner to expand AI access, offering ChatGPT Plus and training to help citizens build practical AI skills and use AI responsibly.
How business operations teams use Codex
See how business operations teams can use Codex to create initiative briefs, strategy updates, leadership decision packets, progress updates, and more from real work inputs.
Databricks brings GPT-5.5 to enterprise agent workflows
Databricks uses GPT-5.5 for enterprise agent workflows after the model set a new state of the art on the OfficeQA Pro benchmark.
How data science teams use Codex
See how data science teams can use Codex to build root-cause briefs, impact readouts, KPI memos, scoped analyses, and dashboard specs from real work inputs.
A new personal finance experience in ChatGPT
Preview a new personal finance experience in ChatGPT for Pro users in the U.S. Securely connect your financial accounts and get AI-powered insights and guidance grounded in your financial context, goals, and priorities.
How sales teams use Codex
See how sales teams can use Codex to create pipeline briefs, meeting prep packets, forecast reviews, account plans, and stalled-deal diagnoses from real work inputs.
Sea's View on the Future of Agentic Software Development with Codex
Sea Limited's CPO explains why the company is deploying Codex across engineering teams to accelerate AI-native software development in Asia.
Work with Codex from anywhere
Use Codex anywhere with the ChatGPT mobile app. Monitor, steer, and approve coding tasks in real time across devices and remote environments.
Helping ChatGPT better recognize context in sensitive conversations
Learn how new ChatGPT safety updates improve context awareness in sensitive conversations, helping detect risk over time and respond more safely.
Defense in depth for autonomous AI agents
Microsoft lays out a defense-in-depth model for autonomous agents, covering new threat classes such as agent hijacking, intent breaking, sensitive data leakage, supply-chain compromise, and inappropriate reliance.
Kazuar: Anatomy of a nation-state botnet
Kazuar, a sophisticated malware family attributed to the Russian state actor Secret Blizzard, has been under constant development for years and continues to evolve in support of espionage-focused operations. Over time, Kazuar has expanded from a relatively traditional backdoor into a highly modular peer-to-peer (P2P) b
Cloud CISO Perspectives: How Google + Wiz changes multicloud strategy for CISOs
By centering developers and shifting security left, Wiz has seen a significant increase in security resolution. Here’s why this strategy matters for CISOs.
When configuration becomes a vulnerability: Exploitable misconfigurations in AI apps
Exposed UIs, weak authentication, and risky defaults could turn cloud-native AI apps on Kubernetes into potential targets by threat actors. Learn how exploitable misconfigurations lead to RCE and data leaks. The post When configuration becomes a vulnerability: Exploitable misconfigurations in AI apps appeared first on
Building a safe, effective sandbox to enable Codex on Windows
Learn how OpenAI built a secure sandbox for Codex on Windows, enabling safe, efficient coding agents with controlled file access and network restrictions.
Our response to the TanStack npm supply chain attack
OpenAI describes its response to the TanStack npm supply-chain attack, including certificate rotation for macOS apps and guidance to update ChatGPT, Codex, and related desktop tooling from official channels.
The new era of SaMD: Why cloud infrastructure is the foundation for digital health in 2026
As SaMD moves from reactive diagnostics to proactive learning systems, cloud has become a superior foundation for regulated medical software.
Beyond source code: The files AI coding agents trust — and attackers exploit
As AI coding agents become embedded in developer workflows, defenders must rethink how to protect against malicious files. Here’s what you need to know.
How finance teams use Codex
See how finance teams can use Codex to build MBRs, reporting packs, variance bridges, model checks, and planning scenarios from real work inputs.
How NVIDIA engineers and researchers build with Codex
Teams use Codex with GPT-5.5 to ship production systems and turn research ideas into runnable experiments.
What Parameter Golf taught us about AI-assisted research
Parameter Golf brought together 1,000+ participants and 2,000+ submissions to explore AI-assisted machine learning research, coding agents, quantization, and novel model design under strict constraints.
Play video
AgentCraft: Putting the Orc in Orchestration — Ido Salomon
AI Engineer session on AgentCraft: Putting the Orc in Orchestration, presented by Ido Salomon. It adds practical context for how teams are building and operating AI systems in production.
Play video
GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies
GPT 5.5 full analysis, plus DeepSeek V4 paper highlights, comparisons with Mythos, a vibe-coded game w/ GPT Image 2, and 50 data-points you wouldn’t get from just reading the headlines. https://80000hours.org/aiexplained Check out my fast-growing (!) app, free to use, and code INSIDER15 for paid tiers: https://lmcounci
Play video
Agents need more than a chat - Jacob Lauritzen, CTO Legora
AI Engineer session on Agents need more than a chat - Jacob Lauritzen, CTO Legora. It adds practical context for how teams are building and operating AI systems in production.
Play video
Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi
AI Engineer session on Full Workshop: Build Your Own Deep Research Agents - Louis-François Bouchard, Paul Iusztin, Samridhi. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Future of MCP — David Soria Parra, Anthropic
AI Engineer session on The Future of MCP, presented by David Soria Parra, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Harness Engineering: How to Build Software When Humans Steer, Agents Execute — Ryan Lopopolo, OpenAI
AI Engineer session on Harness Engineering: How to Build Software When Humans Steer, Agents Execute, presented by Ryan Lopopolo, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Paperclip: Open Source Human Control Plane for AI Labor — Dotta Bippa
AI Engineer session on Paperclip: Open Source Human Control Plane for AI Labor, presented by Dotta Bippa. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Engineering: Working With AI, Not Just Using It — Brendan O'Leary
AI Engineer session on Agentic Engineering: Working With AI, Not Just Using It, presented by Brendan O'Leary. It adds practical context for how teams are building and operating AI systems in production.
Play video
Bending a Public MCP Server Without Breaking It — Nimrod Hauser, Baz
AI Engineer session on Bending a Public MCP Server Without Breaking It, presented by Nimrod Hauser, Baz. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Chaos to Choreography: Multi-Agent Orchestration Patterns That Actually Work — Sandipan Bhaumik
AI Engineer session on From Chaos to Choreography: Multi-Agent Orchestration Patterns That Actually Work, presented by Sandipan Bhaumik. It adds practical context for how teams are building and operating AI systems in production.
Play video
Judge the Judge: Building LLM Evaluators That Actually Work with GEPA — Mahmoud Mabrouk, Agenta AI
AI Engineer session on Judge the Judge: Building LLM Evaluators That Actually Work with GEPA, presented by Mahmoud Mabrouk, Agenta AI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Platforms for Humans and Machines: Engineering for the Age of Agents — Juan Herreros Elorza
AI Engineer session on Platforms for Humans and Machines: Engineering for the Age of Agents, presented by Juan Herreros Elorza. It adds practical context for how teams are building and operating AI systems in production.
Play video
Your Insecure MCP Server Won't Survive Production — Tun Shwe, Lenses
AI Engineer session on Your Insecure MCP Server Won't Survive Production, presented by Tun Shwe, Lenses. It adds practical context for how teams are building and operating AI systems in production.
Play video
Claude Mythos: Highlights from 244-page Release
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Assessing Claude Mythos Preview’s cybersecurity capabilities
Claude Mythos Preview is a new general-purpose language model that is strikingly capable at computer security tasks. This post provides technical details for researchers and practitioners who want to understand exactly how we have been testing this model, and what we have found over the past month. We hope this will sh
Detecting and analyzing prompt abuse in AI tools
Microsoft Incident Response explains how to detect prompt abuse using logging, telemetry, and incident response workflows.
Designing AI agents to resist prompt injection
OpenAI frames prompt injection as an agent-security problem that increasingly resembles social engineering rather than simple string matching.
Reverse engineering Claude's CVE-2026-2796 exploit
This post dives deep into how Claude wrote an exploit for one of the vulnerabilities it found in Firefox.
Play video
Deadline Day for Autonomous AI Weapons & Mass Surveillance
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
MITRE ATLAS OpenClaw Investigation Discovers New and Likeliest Techniques
MITRE maps incidents in an open-source agentic ecosystem to ATLAS techniques, showing how AI-first systems create distinct attacker paths.
Play video
The Two Best AI Models/Enemies Just Got Released Simultaneously
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
LLM-discovered 0-days
AI models can now find high-severity vulnerabilities at scale. This is a moment to empower defenders. We're now using Claude to find and help fix vulnerabilities in open source software.
Play video
Automating Large Scale Refactors with Parallel Agents - Robert Brennan, OpenHands
AI Engineer session on Automating Large Scale Refactors with Parallel Agents - Robert Brennan, OpenHands. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building durable Agents with Workflow DevKit & AI SDK - Peter Wielander, Vercel
AI Engineer session on Building durable Agents with Workflow DevKit & AI SDK - Peter Wielander, Vercel. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Intelligent Research Agents with Manus - Ivan Leo, Manus AI (now Meta Superintelligence)
AI Engineer session on Building Intelligent Research Agents with Manus - Ivan Leo, Manus AI (now Meta Superintelligence). It adds practical context for how teams are building and operating AI systems in production.
Play video
Claude Agent SDK [Full Workshop] — Thariq Shihipar, Anthropic
AI Engineer session on Claude Agent SDK [Full Workshop], presented by Thariq Shihipar, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0
AI Engineer session on Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal
AI Engineer session on OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Spec-Driven Development: Agentic Coding at FAANG Scale and Quality — Al Harris, Amazon Kiro
AI Engineer session on Spec-Driven Development: Agentic Coding at FAANG Scale and Quality, presented by Al Harris, Amazon Kiro. It adds practical context for how teams are building and operating AI systems in production.
Play video
Why Agent Hype can fall short of reality — Joel Becker, METR
AI Engineer session on Why Agent Hype can fall short of reality, presented by Joel Becker, METR. It adds practical context for how teams are building and operating AI systems in production.
Play video
Your MCP Server is Bad (and you should feel bad) - Jeremiah Lowin, Prefect
AI Engineer session on Your MCP Server is Bad (and you should feel bad) - Jeremiah Lowin, Prefect. It adds practical context for how teams are building and operating AI systems in production.
AI Models on Realistic Cyber Ranges
In a recent evaluation of AI models’ cyber capabilities, current Claude models can now succeed at multistage attacks on networks with dozens of hosts using only standard, open-source tools, instead of the custom tools needed by previous generations.
Play video
Anthropic: Our AI just created a tool that can ‘automate all white collar work’, Me:
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Finding Bugs with Claude and Property-based Testing
Ensuring that programs are bug-free is one of the most challenging aspects of software engineering. We developed an agent that can efficiently identify bugs in large software projects. Our agent infers general properties of code that should be true, and then applies property-based testing. After extensive manual valida
Experimenting with AI to Defend Critical Infrastructure
AI could help defenders of critical infrastructure identify the vulnerabilities that attackers might exploit—and close them before they are exploited. Anthropic has partnered with Pacific Northwest National Laboratory (PNNL) to explore this defensive application of AI, demonstrating both the potential of AI-accelerated
Play video
Agent Reinforcement Fine Tuning — Will Hang & Cathy Zhou, OpenAI
AI Engineer session on Agent Reinforcement Fine Tuning, presented by Will Hang & Cathy Zhou, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents are Robots Too: What Self-Driving Taught Me About Building Agents — Jesse Hu, Abundant
AI Engineer session on Agents are Robots Too: What Self-Driving Taught Me About Building Agents, presented by Jesse Hu, Abundant. It adds practical context for how teams are building and operating AI systems in production.
Play video
Backlog.md: Terminal Kanban Board for Managing Tasks with AI Agents — Alex Gavrilescu, Funstage
AI Engineer session on Backlog.md: Terminal Kanban Board for Managing Tasks with AI Agents, presented by Alex Gavrilescu, Funstage. It adds practical context for how teams are building and operating AI systems in production.
Play video
Developer Experience in the Age of AI Coding Agents — Max Kanat-Alexander, Capital One
AI Engineer session on Developer Experience in the Age of AI Coding Agents, presented by Max Kanat-Alexander, Capital One. It adds practical context for how teams are building and operating AI systems in production.
Play video
Developing Taste in Coding Agents: Applied Meta Neuro-Symbolic RL — Ahmad Awais, CommandCode
AI Engineer session on Developing Taste in Coding Agents: Applied Meta Neuro-Symbolic RL, presented by Ahmad Awais, CommandCode. It adds practical context for how teams are building and operating AI systems in production.
Play video
Don't Build Agents, Build Skills Instead — Barry Zhang & Mahesh Murag, Anthropic
AI Engineer session on Don't Build Agents, Build Skills Instead, presented by Barry Zhang & Mahesh Murag, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Enterprise Deep Research: The Next Killer App for Enterprise AI — Ofer Mendelevitch, Vectara
AI Engineer session on Enterprise Deep Research: The Next Killer App for Enterprise AI, presented by Ofer Mendelevitch, Vectara. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Stateless Nightmares to Durable Agents — Samuel Colvin, Pydantic
AI Engineer session on From Stateless Nightmares to Durable Agents, presented by Samuel Colvin, Pydantic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Future-Proof Coding Agents — Bill Chen & Brian Fioca, OpenAI
AI Engineer session on Future-Proof Coding Agents, presented by Bill Chen & Brian Fioca, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Government Agents: AI Agents Meet Tough Regulations — Mark Myshatyn, Los Alamos National Lab
AI Engineer session on Government Agents: AI Agents Meet Tough Regulations, presented by Mark Myshatyn, Los Alamos National Lab. It adds practical context for how teams are building and operating AI systems in production.
Play video
Hacking Subagents Into Codex CLI — Brian John, Betterup
AI Engineer session on Hacking Subagents Into Codex CLI, presented by Brian John, Betterup. It adds practical context for how teams are building and operating AI systems in production.
Play video
Hard Won Lessons from Building Effective AI Coding Agents — Nik Pash, Cline
AI Engineer session on Hard Won Lessons from Building Effective AI Coding Agents, presented by Nik Pash, Cline. It adds practical context for how teams are building and operating AI systems in production.
Play video
Infra that fixes itself, thanks to coding agents — Mahmoud Abdelwahab, Railway
AI Engineer session on Infra that fixes itself, thanks to coding agents, presented by Mahmoud Abdelwahab, Railway. It adds practical context for how teams are building and operating AI systems in production.
Play video
Katelyn Lesse — Evolving Claude APIs for Agents, Anthropic
AI Engineer session on Katelyn Lesse, presented by Evolving Claude APIs for Agents, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Making Codebases Agent Ready — Eno Reyes, Factory AI
AI Engineer session on Making Codebases Agent Ready, presented by Eno Reyes, Factory AI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Proactive Agents — Kath Korevec, Google Labs
AI Engineer session on Proactive Agents, presented by Kath Korevec, Google Labs. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Unbearable Lightness of Agent Optimization — Alberto Romero, Jointly
AI Engineer session on The Unbearable Lightness of Agent Optimization, presented by Alberto Romero, Jointly. It adds practical context for how teams are building and operating AI systems in production.
Play video
What the Freakiness of 2025 in AI Tells Us About 2026
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Continuously hardening ChatGPT Atlas against prompt injection attacks
OpenAI describes using automated red teaming and reinforcement learning to discover agent prompt injection attacks before they appear in the wild.
Play video
Gemini Exponential, Demis Hassabis' ‘Proto-AGI’ coming, but …
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Building a Production-Ready AI Security Foundation
Google Cloud outlines a defense-in-depth view of AI security spanning application controls, data protections, and infrastructure isolation.
Play video
Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Did you miss these 2 AI stories? A *Real* LLM-crafted Breakthrough + Continual Learning Blocked?
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Sora 2 - It will only get more realistic from here
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Building an Agentic Platform — Ben Kus, CTO Box
AI Engineer session on Building an Agentic Platform, presented by Ben Kus, CTO Box. It adds practical context for how teams are building and operating AI systems in production.
Play video
An ‘AI Bubble’? What Altman Actually said, the Facts and Nano Banana
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs
AI Engineer session on [Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs. It adds practical context for how teams are building and operating AI systems in production.
Play video
A2A & MCP Workshop: Automating Business Processes with LLMs — Damien Murphy, Bench
AI Engineer session on A2A & MCP Workshop: Automating Business Processes with LLMs, presented by Damien Murphy, Bench. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai
AI Engineer session on Agents vs Workflows: Why Not Both?, presented by Sam Bhagwat, Mastra.ai. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai
AI Engineer session on Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents at Cloud Scale — Antje Barth, AWS
AI Engineer session on Building Agents at Cloud Scale, presented by Antje Barth, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Applications with AI Agents — Michael Albada, Microsoft
AI Engineer session on Building Applications with AI Agents, presented by Michael Albada, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building the platform for agent coordination — Tom Moor, Linear
AI Engineer session on Building the platform for agent coordination, presented by Tom Moor, Linear. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval
AI Engineer session on From Self-driving to Autonomous Voice Agents, presented by Brooke Hopkins, Coval. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Secure Agents using OAuth — Jared Hanson (Keycard, Passport.js)
AI Engineer session on How to Secure Agents using OAuth, presented by Jared Hanson (Keycard, Passport.js). It adds practical context for how teams are building and operating AI systems in production.
Play video
How we hacked YC Spring 2025 batch’s AI agents — Rene Brandel, Casco
AI Engineer session on How we hacked YC Spring 2025 batch’s AI agents, presented by Rene Brandel, Casco. It adds practical context for how teams are building and operating AI systems in production.
Play video
Multi Agent AI and Network Knowledge Graphs for Change — Ola Mabadeje, Cisco
AI Engineer session on Multi Agent AI and Network Knowledge Graphs for Change, presented by Ola Mabadeje, Cisco. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI on Securing Code-Executing AI Agents — Fouad Matin (Codex, Agent Robustness)
AI Engineer session on OpenAI on Securing Code-Executing AI Agents, presented by Fouad Matin (Codex, Agent Robustness). It adds practical context for how teams are building and operating AI systems in production.
Play video
Piloting agents in GitHub Copilot - Christopher Harrison, Microsoft
AI Engineer session on Piloting agents in GitHub Copilot - Christopher Harrison, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily
AI Engineer session on Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily. It adds practical context for how teams are building and operating AI systems in production.
Play video
Scaling AI Agents Without Breaking Reliability — Preeti Somal, Temporal
AI Engineer session on Scaling AI Agents Without Breaking Reliability, presented by Preeti Somal, Temporal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ship Agents that Ship: A Hands-On Workshop - Kyle Penfound, Jeremy Adams, Dagger
AI Engineer session on Ship Agents that Ship: A Hands-On Workshop - Kyle Penfound, Jeremy Adams, Dagger. It adds practical context for how teams are building and operating AI systems in production.
Play video
Software Development Agents: What Works and What Doesn't - Robert Brennan, OpenHands
AI Engineer session on Software Development Agents: What Works and What Doesn't - Robert Brennan, OpenHands. It adds practical context for how teams are building and operating AI systems in production.
Play video
Your Coding Agent Just Got Cloned And Your Brain Isn't Ready - Rustin Banks, Google Jules
AI Engineer session on Your Coding Agent Just Got Cloned And Your Brain Isn't Ready - Rustin Banks, Google Jules. It adds practical context for how teams are building and operating AI systems in production.
Play video
Genie 3: The World Becomes Playable (DeepMind)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
"Data readiness" is a Myth: Reliable AI with an Agentic Semantic Layer — Anushrut Gupta, PromptQL
AI Engineer session on "Data readiness" is a Myth: Reliable AI with an Agentic Semantic Layer, presented by Anushrut Gupta, PromptQL. It adds practical context for how teams are building and operating AI systems in production.
Play video
(possible dupe but better sound) What does Enterprise Ready MCP mean? — Tobin South, WorkOS
AI Engineer session on (possible dupe but better sound) What does Enterprise Ready MCP mean?, presented by Tobin South, WorkOS. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han
AI Engineer session on [Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents, presented by Daniel Han. It adds practical context for how teams are building and operating AI systems in production.
Play video
[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser
AI Engineer session on [Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai, presented by Nick Nisi, Zack Proser. It adds practical context for how teams are building and operating AI systems in production.
Play video
12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer
AI Engineer session on 12-Factor Agents: Patterns of reliable LLM applications, presented by Dex Horthy, HumanLayer. It adds practical context for how teams are building and operating AI systems in production.
Play video
3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph
AI Engineer session on 3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Excellence: Mastering AI Agent Evals w/ Azure AI Evaluation SDK — Cedric Vidal, Microsoft
AI Engineer session on Agentic Excellence: Mastering AI Agent Evals w/ Azure AI Evaluation SDK, presented by Cedric Vidal, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic GraphRAG: AI’s Logical Edge — Stephen Chin, Neo4j
AI Engineer session on Agentic GraphRAG: AI’s Logical Edge, presented by Stephen Chin, Neo4j. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic GraphRAG: Simplifying Retrieval Across Structured & Unstructured Data — Zach Blumenfeld
AI Engineer session on Agentic GraphRAG: Simplifying Retrieval Across Structured & Unstructured Data, presented by Zach Blumenfeld. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents, Access, and the Future of Machine Identity — Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare)
AI Engineer session on Agents, Access, and the Future of Machine Identity, presented by Nick Nisi (WorkOS) + Lizzie Siegle (Cloudflare). It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Red Teaming Agent: Azure AI Foundry — Nagkumar Arkalgud & Keiji Kanazawa, Microsoft
AI Engineer session on AI Red Teaming Agent: Azure AI Foundry, presented by Nagkumar Arkalgud & Keiji Kanazawa, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Architecting Agent Memory: Principles, Patterns, and Best Practices — Richmond Alake, MongoDB
AI Engineer session on Architecting Agent Memory: Principles, Patterns, and Best Practices, presented by Richmond Alake, MongoDB. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building agent fleet architectures your CISO doesn't hate — Lou Bichard, Gitpod
AI Engineer session on Building agent fleet architectures your CISO doesn't hate, presented by Lou Bichard, Gitpod. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agentic Applications w/ Heroku Managed Inference and Agents — Julián Duque & Anush Dsouza
AI Engineer session on Building Agentic Applications w/ Heroku Managed Inference and Agents, presented by Julián Duque & Anush Dsouza. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents (the hard parts!) - Rita Kozlov, Cloudflare
AI Engineer session on Building Agents (the hard parts!) - Rita Kozlov, Cloudflare. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Code First AI Agents with Azure AI Agent Service — Cedric Vidal, Microsoft
AI Engineer session on Building Code First AI Agents with Azure AI Agent Service, presented by Cedric Vidal, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI
AI Engineer session on Building Effective Voice Agents, presented by Toki Sherbakov + Anoop Kotha, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Multimodal AI Agents From Scratch — Apoorva Joshi, MongoDB
AI Engineer session on Building Multimodal AI Agents From Scratch, presented by Apoorva Joshi, MongoDB. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building voice agents with OpenAI — Dominik Kundel, OpenAI
AI Engineer session on Building voice agents with OpenAI, presented by Dominik Kundel, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
CIAM for AI: Authn/Authz for Agents — Michael Grinich, CEO of WorkOS
AI Engineer session on CIAM for AI: Authn/Authz for Agents, presented by Michael Grinich, CEO of WorkOS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic
AI Engineer session on Claude Code & the evolution of agentic coding, presented by Boris Cherny, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Collaborating with Agents in your Software Dev Workflow - Jon Peck & Christopher Harrison, Microsoft
AI Engineer session on Collaborating with Agents in your Software Dev Workflow - Jon Peck & Christopher Harrison, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Conquering Agent Chaos — Rick Blalock, Agentuity
AI Engineer session on Conquering Agent Chaos, presented by Rick Blalock, Agentuity. It adds practical context for how teams are building and operating AI systems in production.
Play video
Containing Agent Chaos — Solomon Hykes, Dagger
AI Engineer session on Containing Agent Chaos, presented by Solomon Hykes, Dagger. It adds practical context for how teams are building and operating AI systems in production.
Play video
Effective agent design patterns in production — Laurie Voss, LlamaIndex
AI Engineer session on Effective agent design patterns in production, presented by Laurie Voss, LlamaIndex. It adds practical context for how teams are building and operating AI systems in production.
Play video
Events are the Wrong Abstraction for Your AI Agents - Mason Egger, Temporal.io
AI Engineer session on Events are the Wrong Abstraction for Your AI Agents - Mason Egger, Temporal.io. It adds practical context for how teams are building and operating AI systems in production.
Play video
Forget RAG Pipelines — Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual
AI Engineer session on Forget RAG Pipelines, presented by Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters
AI Engineer session on From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters. It adds practical context for how teams are building and operating AI systems in production.
Play video
From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva
AI Engineer session on From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva. It adds practical context for how teams are building and operating AI systems in production.
Play video
Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode
AI Engineer session on Full Spec MCP: Hidden Capabilities of the MCP spec, presented by Harald Kirschner, Microsoft/VSCode. It adds practical context for how teams are building and operating AI systems in production.
Play video
How agents will unlock the $500B promise of AI - Donald Hruska, Retool
AI Engineer session on How agents will unlock the $500B promise of AI - Donald Hruska, Retool. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to build Enterprise Aware Agents - Chau Tran, Glean
AI Engineer session on How to build Enterprise Aware Agents - Chau Tran, Glean. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Build Planning Agents without losing control - Yogendra Miraje, Factset
AI Engineer session on How to Build Planning Agents without losing control - Yogendra Miraje, Factset. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe
AI Engineer session on How to Train Your Agent: Building Reliable Agents with RL, presented by Kyle Corbitt, OpenPipe. It adds practical context for how teams are building and operating AI systems in production.
Play video
Introducing Strands Agents, an Open Source AI Agents SDK — Suman Debnath, AWS
AI Engineer session on Introducing Strands Agents, an Open Source AI Agents SDK, presented by Suman Debnath, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Knowledge Graphs in Litigation Agents — Tom Smoker, WhyHow
AI Engineer session on Knowledge Graphs in Litigation Agents, presented by Tom Smoker, WhyHow. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP is all you need — Samuel Colvin, Pydantic
AI Engineer session on MCP is all you need, presented by Samuel Colvin, Pydantic. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP Is Not Good Yet — David Cramer, Sentry
AI Engineer session on MCP Is Not Good Yet, presented by David Cramer, Sentry. It adds practical context for how teams are building and operating AI systems in production.
Play video
Memory Masterclass: Make Your AI Agents Remember What They Do! — Mark Bain, AIUS
AI Engineer session on Memory Masterclass: Make Your AI Agents Remember What They Do!, presented by Mark Bain, AIUS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat
AI Engineer session on Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat. It adds practical context for how teams are building and operating AI systems in production.
Play video
Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft
AI Engineer session on Real world MCPs in GitHub Copilot Agent Mode, presented by Jon Peck, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Securing Agents with Open Standards — Bobby Tiernay and Kam Sween, Auth0
AI Engineer session on Securing Agents with Open Standards, presented by Bobby Tiernay and Kam Sween, Auth0. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ship it! Building Production Ready Agents — Mike Chambers, AWS
AI Engineer session on Ship it! Building Production Ready Agents, presented by Mike Chambers, AWS. It adds practical context for how teams are building and operating AI systems in production.
Play video
Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin
AI Engineer session on Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin. It adds practical context for how teams are building and operating AI systems in production.
Play video
Stateful environments for vertical agents — Josh Purtell, Synth Labs
AI Engineer session on Stateful environments for vertical agents, presented by Josh Purtell, Synth Labs. It adds practical context for how teams are building and operating AI systems in production.
Play video
Taming Rogue AI Agents with Observability-Driven Evaluation — Jim Bennett, Galileo
AI Engineer session on Taming Rogue AI Agents with Observability-Driven Evaluation, presented by Jim Bennett, Galileo. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Agent Awakens: Collaborative Development with Copilot - Christopher Harrison, GitHub
AI Engineer session on The Agent Awakens: Collaborative Development with Copilot - Christopher Harrison, GitHub. It adds practical context for how teams are building and operating AI systems in production.
Play video
The emerging skillset of wielding coding agents — Beyang Liu, Sourcegraph / Amp
AI Engineer session on The emerging skillset of wielding coding agents, presented by Beyang Liu, Sourcegraph / Amp. It adds practical context for how teams are building and operating AI systems in production.
Play video
The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify
AI Engineer session on The rise of the agentic economy on the shoulders of MCP, presented by Jan Curn, Apify. It adds practical context for how teams are building and operating AI systems in production.
Play video
To the moon! Navigating deep context in legacy code with Augment Agent — Forrest Brazeal, Matt Ball
AI Engineer session on To the moon! Navigating deep context in legacy code with Augment Agent, presented by Forrest Brazeal, Matt Ball. It adds practical context for how teams are building and operating AI systems in production.
Play video
Training Agentic Reasoners — Will Brown, Prime Intellect
AI Engineer session on Training Agentic Reasoners, presented by Will Brown, Prime Intellect. It adds practical context for how teams are building and operating AI systems in production.
Play video
UX Design Principles for Semi Autonomous Multi Agent Systems — Victor Dibia, Microsoft
AI Engineer session on UX Design Principles for Semi Autonomous Multi Agent Systems, presented by Victor Dibia, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Why Your Agent’s Brain Needs a Playbook: Practical Wins from Using Ontologies - Jesús Barrasa, Neo4j
AI Engineer session on Why Your Agent’s Brain Needs a Playbook: Practical Wins from Using Ontologies - Jesús Barrasa, Neo4j. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Not to Read a Headline on AI (ft. new Olympiad Gold, GPT-5 …)
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Agentic Enterprise - What your CEO must know about AI - Hubert Misztela
AI Engineer session on Agentic Enterprise - What your CEO must know about AI - Hubert Misztela. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agents reported thousands of bugs, how many were real? - Ian Butler and Nick Gregory
AI Engineer session on Agents reported thousands of bugs, how many were real? - Ian Butler and Nick Gregory. It adds practical context for how teams are building and operating AI systems in production.
Play video
Are MCPs Overhyped? A Rant about MCPs — Henry Mao, Smithery
AI Engineer session on Are MCPs Overhyped? A Rant about MCPs, presented by Henry Mao, Smithery. It adds practical context for how teams are building and operating AI systems in production.
Play video
Blender MCP and The Future Of Creative Tools - Siddharth Ahuja
AI Engineer session on Blender MCP and The Future Of Creative Tools - Siddharth Ahuja. It adds practical context for how teams are building and operating AI systems in production.
Play video
Break It 'Til You Make It: Building the Self-Improving Stack for AI Agents - Aparna Dhinakaran
AI Engineer session on Break It 'Til You Make It: Building the Self-Improving Stack for AI Agents - Aparna Dhinakaran. It adds practical context for how teams are building and operating AI systems in production.
Play video
Breaking the Chain: Agent Continuations for Resumable AI Workflows - Greg Benson
AI Engineer session on Breaking the Chain: Agent Continuations for Resumable AI Workflows - Greg Benson. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents with Amazon Nova Act and MCP - Du'An Lightfoot, Amazon (Full Workshop)
AI Engineer session on Building Agents with Amazon Nova Act and MCP - Du'An Lightfoot, Amazon (Full Workshop). It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI Agents that actually automate Knowledge Work - Jerry Liu, LlamaIndex
AI Engineer session on Building AI Agents that actually automate Knowledge Work - Jerry Liu, LlamaIndex. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Protected MCP Servers — Den Delimarsky and Julia Kasper, MCP Steering Committee & Microsoft
AI Engineer session on Building Protected MCP Servers, presented by Den Delimarsky and Julia Kasper, MCP Steering Committee & Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Reliable Support Agents Using the Effect Typescript Library - Michael Fester
AI Engineer session on Building Reliable Support Agents Using the Effect Typescript Library - Michael Fester. It adds practical context for how teams are building and operating AI systems in production.
Play video
Case Study + Deep Dive: Telemedicine Support Agents with LangGraph/MCP - Dan Mason
AI Engineer session on Case Study + Deep Dive: Telemedicine Support Agents with LangGraph/MCP - Dan Mason. It adds practical context for how teams are building and operating AI systems in production.
Play video
Effective AI Agents Need Data Flywheels, Not The Next Biggest LLM — Sylendran Arunagiri, NVIDIA
AI Engineer session on Effective AI Agents Need Data Flywheels, Not The Next Biggest LLM, presented by Sylendran Arunagiri, NVIDIA. It adds practical context for how teams are building and operating AI systems in production.
Play video
Exposing Agents as MCP servers with mcp-agent: Sarmad Qadri
AI Engineer session on Exposing Agents as MCP servers with mcp-agent: Sarmad Qadri. It adds practical context for how teams are building and operating AI systems in production.
Play video
How agents broke app-level infrastructure - Evan Boyle
AI Engineer session on How agents broke app-level infrastructure - Evan Boyle. It adds practical context for how teams are building and operating AI systems in production.
Play video
Letting AI Interface with your App with MCP — Kent C Dodds
AI Engineer session on Letting AI Interface with your App with MCP, presented by Kent C Dodds. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP Agent Fine tuning Workshop - Ronan McGovern
AI Engineer session on MCP Agent Fine tuning Workshop - Ronan McGovern. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCP: Origins and Requests For Startups — Theodora Chu, Model Context Protocol PM, Anthropic
AI Engineer session on MCP: Origins and Requests For Startups, presented by Theodora Chu, Model Context Protocol PM, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
MCPs are Boring (or: Why we are losing the Sparkle of LLMs) - Manuel Odendahl
AI Engineer session on MCPs are Boring (or: Why we are losing the Sparkle of LLMs) - Manuel Odendahl. It adds practical context for how teams are building and operating AI systems in production.
Play video
Real AI Agents Need Planning, Not Just Prompting - Yuval Belfer
AI Engineer session on Real AI Agents Need Planning, Not Just Prompting - Yuval Belfer. It adds practical context for how teams are building and operating AI systems in production.
Play video
Remote MCPs: What we learned from shipping — John Welsh, Anthropic
AI Engineer session on Remote MCPs: What we learned from shipping, presented by John Welsh, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Supercharging developer workflow with Amazon Q Developer - Vikash Agrawal
AI Engineer session on Supercharging developer workflow with Amazon Q Developer - Vikash Agrawal. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Agent Native Company — Rick Blalock, Agentuity
AI Engineer session on The Agent Native Company, presented by Rick Blalock, Agentuity. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Current State of Browser Agents - Jerry Wu and Wyatt Marshall
AI Engineer session on The Current State of Browser Agents - Jerry Wu and Wyatt Marshall. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Demo I Wish I'd Had: OpenAI's Agents SDK... serverless! - Brook Riggio
AI Engineer session on The Demo I Wish I'd Had: OpenAI's Agents SDK... serverless! - Brook Riggio. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Future of Qwen: A Generalist Agent Model — Junyang Lin, Alibaba Qwen
AI Engineer session on The Future of Qwen: A Generalist Agent Model, presented by Junyang Lin, Alibaba Qwen. It adds practical context for how teams are building and operating AI systems in production.
Play video
The State of MCP observability: Observable.tools — Alex Volkov and Benjamin Eckel, W&B and Dylibso
AI Engineer session on The State of MCP observability: Observable.tools, presented by Alex Volkov and Benjamin Eckel, W&B and Dylibso. It adds practical context for how teams are building and operating AI systems in production.
Play video
When Will AI Models Blackmail You, and Why?
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Why the Best AI Agents Are Built Without Frameworks (Primitives over Frameworks) — Ahmad Awais, CHAI
AI Engineer session on Why the Best AI Agents Are Built Without Frameworks (Primitives over Frameworks), presented by Ahmad Awais, CHAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
Will Agent evaluation via MCP Stabilize Agent Networks? - Ari Heljakka
AI Engineer session on Will Agent evaluation via MCP Stabilize Agent Networks? - Ari Heljakka. It adds practical context for how teams are building and operating AI systems in production.
Cloud CISO Perspectives: How Google secures AI Agents
Google’s CISO perspective on why agents need a new security paradigm and what changes when models can observe, plan, and act.
Play video
Creating Agents that Co-Create — Karina Nguyen, OpenAI
AI Engineer session on Creating Agents that Co-Create, presented by Karina Nguyen, OpenAI. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Improves at Self-improving
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
o3 breaks (some) records, but AI becomes pay-to-win
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Agent Evals: Finally, With The Map
AI Engineer session on Agent Evals: Finally, With The Map. It adds practical context for how teams are building and operating AI systems in production.
Play video
Agentic Workflows on Vertex AI: Rukma Sen
AI Engineer session on Agentic Workflows on Vertex AI: Rukma Sen. It adds practical context for how teams are building and operating AI systems in production.
Play video
AI Agents, Meet Test Driven Development
AI Engineer session on AI Agents, Meet Test Driven Development. It adds practical context for how teams are building and operating AI systems in production.
Play video
Architecting and Testing Controllable Agents: Lance Martin
AI Engineer session on Architecting and Testing Controllable Agents: Lance Martin. It adds practical context for how teams are building and operating AI systems in production.
Play video
Beyond APIs: How AI Web Agents Are Automating the "Long Tail" of Knowledge Work
AI Engineer session on Beyond APIs: How AI Web Agents Are Automating the "Long Tail" of Knowledge Work. It adds practical context for how teams are building and operating AI systems in production.
Play video
Build an AI Research Agent: Apoorva Joshi
AI Engineer session on Build an AI Research Agent: Apoorva Joshi. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Agents with Model Context Protocol - Full Workshop with Mahesh Murag of Anthropic
AI Engineer session on Building Agents with Model Context Protocol - Full Workshop with Mahesh Murag of Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building AI Agents with Real ROI in the Enterprise SDLC: Bruno (Booking.com) & Beyang (Sourcegraph)
AI Engineer session on Building AI Agents with Real ROI in the Enterprise SDLC: Bruno (Booking.com) & Beyang (Sourcegraph). It adds practical context for how teams are building and operating AI systems in production.
Play video
Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil
AI Engineer session on Building and evaluating AI Agents, presented by Sayash Kapoor, AI Snake Oil. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building and Scaling an AI Agent Swarm of low latency real time voice bots: Damien Murphy
AI Engineer session on Building and Scaling an AI Agent Swarm of low latency real time voice bots: Damien Murphy. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Multi agent Systems with Finite State Machines
AI Engineer session on Building Multi agent Systems with Finite State Machines. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building Reliable Agentic Systems: Eno Reyes
AI Engineer session on Building Reliable Agentic Systems: Eno Reyes. It adds practical context for how teams are building and operating AI systems in production.
Play video
Building State of the Art Open Weights Tool Use: The Command R Family: Sandra Kublik
AI Engineer session on Building State of the Art Open Weights Tool Use: The Command R Family: Sandra Kublik. It adds practical context for how teams are building and operating AI systems in production.
Play video
Cohere: Building enterprise LLM agents that work (Shaan Desai)
AI Engineer session on Cohere: Building enterprise LLM agents that work (Shaan Desai). It adds practical context for how teams are building and operating AI systems in production.
Play video
Disrupting the $15 Trillion Construction Industry with Autonomous Agents: Dr. Sarah Buchner
AI Engineer session on Disrupting the $15 Trillion Construction Industry with Autonomous Agents: Dr. Sarah Buchner. It adds practical context for how teams are building and operating AI systems in production.
Play video
Emergence Launch: AI Agents and the future enterprise: Dr. Satya Nitta
AI Engineer session on Emergence Launch: AI Agents and the future enterprise: Dr. Satya Nitta. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ensure AI Agents Work: Evaluation Frameworks for Scaling Success — Aparna Dhinkaran, CEO Arize
AI Engineer session on Ensure AI Agents Work: Evaluation Frameworks for Scaling Success, presented by Aparna Dhinkaran, CEO Arize. It adds practical context for how teams are building and operating AI systems in production.
Play video
Finetuning: 500m AI agents in production with 2 engineers — Mustafa Ali & Kyle Corbitt
AI Engineer session on Finetuning: 500m AI agents in production with 2 engineers, presented by Mustafa Ali & Kyle Corbitt. It adds practical context for how teams are building and operating AI systems in production.
Play video
Giving a Voice to AI Agents: Scott Stephenson, CEO, Deepgram
AI Engineer session on Giving a Voice to AI Agents: Scott Stephenson, CEO, Deepgram. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Coding Agents change Software Development Forever - Hailong Zhang
AI Engineer session on How Coding Agents change Software Development Forever - Hailong Zhang. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Deep Research Works - Mukund Sridhar & Aarush Selvan, Google DeepMind
AI Engineer session on How Deep Research Works - Mukund Sridhar & Aarush Selvan, Google DeepMind. It adds practical context for how teams are building and operating AI systems in production.
Play video
How to Improve Your Agents: Academic Lit Review
AI Engineer session on How to Improve Your Agents: Academic Lit Review. It adds practical context for how teams are building and operating AI systems in production.
Play video
How We Build Effective Agents: Barry Zhang, Anthropic
AI Engineer session on How We Build Effective Agents: Barry Zhang, Anthropic. It adds practical context for how teams are building and operating AI systems in production.
Play video
How Windsurf writes 90% of your code with an Agentic IDE - Kevin Hou, Windsurf
AI Engineer session on How Windsurf writes 90% of your code with an Agentic IDE - Kevin Hou, Windsurf. It adds practical context for how teams are building and operating AI systems in production.
Play video
Ionic Launch: Opening the economy to AI agents
AI Engineer session on Ionic Launch: Opening the economy to AI agents. It adds practical context for how teams are building and operating AI systems in production.
Play video
Keynote: Why people think "agent" is a buzzword but it isn't
AI Engineer session on Keynote: Why people think "agent" is a buzzword but it isn't. It adds practical context for how teams are building and operating AI systems in production.
Play video
Lets Build An Agent from Scratch
AI Engineer session on Lets Build An Agent from Scratch. It adds practical context for how teams are building and operating AI systems in production.
Play video
Multi model multimodal and multi agent innovations in Azure AI: Cedric Vidal
AI Engineer session on Multi model multimodal and multi agent innovations in Azure AI: Cedric Vidal. It adds practical context for how teams are building and operating AI systems in production.
Play video
OpenAI for VP's of AI + Advice for Building Agents
AI Engineer session on OpenAI for VP's of AI + Advice for Building Agents. It adds practical context for how teams are building and operating AI systems in production.
Play video
Patrick Dougherty: How to Build AI Agents that Actually Work
AI Engineer session on Patrick Dougherty: How to Build AI Agents that Actually Work. It adds practical context for how teams are building and operating AI systems in production.
Play video
Personal, Local, Private AI Agents: Soumith Chintala
AI Engineer session on Personal, Local, Private AI Agents: Soumith Chintala. It adds practical context for how teams are building and operating AI systems in production.
Play video
Personality Driven Development: Exploring the Frontier of Agents with Attitude
AI Engineer session on Personality Driven Development: Exploring the Frontier of Agents with Attitude. It adds practical context for how teams are building and operating AI systems in production.
Play video
Privacy First Enterprise AI: Building AI Agents that Never Leave Your Security Boundary
AI Engineer session on Privacy First Enterprise AI: Building AI Agents that Never Leave Your Security Boundary. It adds practical context for how teams are building and operating AI systems in production.
Play video
RAG Agents in Prod: 10 Lessons We Learned — Douwe Kiela, creator of RAG
AI Engineer session on RAG Agents in Prod: 10 Lessons We Learned, presented by Douwe Kiela, creator of RAG. It adds practical context for how teams are building and operating AI systems in production.
Play video
Reinforcement Learning for Agents - Will Brown, ML Researcher at Morgan Stanley
AI Engineer session on Reinforcement Learning for Agents - Will Brown, ML Researcher at Morgan Stanley. It adds practical context for how teams are building and operating AI systems in production.
Play video
Rethinking how we Scaffold AI Agents - Rahul Sengottuvelu, Ramp
AI Engineer session on Rethinking how we Scaffold AI Agents - Rahul Sengottuvelu, Ramp. It adds practical context for how teams are building and operating AI systems in production.
Play video
Reverse Conway's law and GenAI: How agents will take over the organisation - Patrick Debois
AI Engineer session on Reverse Conway's law and GenAI: How agents will take over the organisation - Patrick Debois. It adds practical context for how teams are building and operating AI systems in production.
Play video
Scaling Agents for Gen AI Products - Anju Kambadur, Bloomberg Head of AI Engineering
AI Engineer session on Scaling Agents for Gen AI Products - Anju Kambadur, Bloomberg Head of AI Engineering. It adds practical context for how teams are building and operating AI systems in production.
Play video
Self Coding Agents — Colin Flaherty, Augment Code
AI Engineer session on Self Coding Agents, presented by Colin Flaherty, Augment Code. It adds practical context for how teams are building and operating AI systems in production.
Play video
Stateful Agents — Full Workshop with Charles Packer of Letta and MemGPT
AI Engineer session on Stateful Agents, presented by Full Workshop with Charles Packer of Letta and MemGPT. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Agent Development Life Cycle — Zack Reneau-Wedeen, Sierra
AI Engineer session on The Agent Development Life Cycle, presented by Zack Reneau-Wedeen, Sierra. It adds practical context for how teams are building and operating AI systems in production.
Play video
The missing pieces of workflow automation — Shirsha Chaudhuri, Thomson Reuters Labs
AI Engineer session on The missing pieces of workflow automation, presented by Shirsha Chaudhuri, Thomson Reuters Labs. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Price of Intelligence - AI Agent Pricing in 2025
AI Engineer session on The Price of Intelligence - AI Agent Pricing in 2025. It adds practical context for how teams are building and operating AI systems in production.
Play video
This video was edited with AI agent. But how?
AI Engineer session on This video was edited with AI agent. But how?. It adds practical context for how teams are building and operating AI systems in production.
Play video
Tool Calling Is Not Just Plumbing for AI Agents — Roy Derks
AI Engineer session on Tool Calling Is Not Just Plumbing for AI Agents, presented by Roy Derks. It adds practical context for how teams are building and operating AI systems in production.
Play video
Trust, but Verify: Knowledge Agents for Finance Workflows - Mike Conover
AI Engineer session on Trust, but Verify: Knowledge Agents for Finance Workflows - Mike Conover. It adds practical context for how teams are building and operating AI systems in production.
Play video
Using agents to build an agent company: Joao Moura
AI Engineer session on Using agents to build an agent company: Joao Moura. It adds practical context for how teams are building and operating AI systems in production.
Play video
Vercel AI SDK Masterclass: From Fundamentals to Deep Research
AI Engineer session on Vercel AI SDK Masterclass: From Fundamentals to Deep Research. It adds practical context for how teams are building and operating AI systems in production.
Play video
Voice Agent Engineering — Nik Caryotakis, SuperDial
AI Engineer session on Voice Agent Engineering, presented by Nik Caryotakis, SuperDial. It adds practical context for how teams are building and operating AI systems in production.
Play video
Voice Agents: the good, the bad, and the ugly
AI Engineer session on Voice Agents: the good, the bad, and the ugly. It adds practical context for how teams are building and operating AI systems in production.
Play video
Why Agent Engineering — swyx
AI Engineer session on Why Agent Engineering, presented by swyx. It adds practical context for how teams are building and operating AI systems in production.
Play video
Your AI Agent Isn't an Engineer: The Art of Thoughtful Anthropomorphism
AI Engineer session on Your AI Agent Isn't an Engineer: The Art of Thoughtful Anthropomorphism. It adds practical context for how teams are building and operating AI systems in production.
Play video
Manus AI - The Calm Before the Hypestorm … (vs Deep Research + Grok 3)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Announcing AI Protection: Security for the AI era
Google introduced AI Protection and Model Armor to address prompt injection, jailbreaks, data loss, and multicloud AI workload security.
Play video
Claude 3.7 is More Significant than its Name Implies (ft DeepSeek R2 + GPT 4.5 coming soon)
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Nothing Much Happens in AI, Then Everything Does All At Once
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Operator System Card
The Operator system card documents red teaming and mitigation choices for a computer-using agent, with prompt injections listed as a central risk area.
Play video
Altman Expects a ‘Fast Take-off’, ‘Super-Agent’ Debuting Soon and DeepSeek R1 Out
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Enhancing AI safety: Insights and lessons from red teaming
Microsoft summarizes lessons from red teaming more than one hundred generative AI products, emphasizing system-level testing, human expertise, and automation.
Play video
OpenAI Backtracks, Gunning for Superintelligence: Altman Brings His AGI Timeline Closer - '25 to '29
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
OWASP Top 10 for Large Language Model Applications
OWASP’s GenAI security project remains a practical baseline for teams building or assessing LLM applications and agentic systems.
Play video
Never Browse Alone? Gemini 2 Live and ChatGPT Vision
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Breaks Its Silence: OpenAI’s ‘Next 12 Days’, Genie 2, and a Word of Caution
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
New Google Model Ranked ‘No. 1 LLM’, But There’s a Problem
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
The New Claude 3.5 Sonnet: Better, Yes, But Not Just in the Way You Might Think
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
OpenAI: ‘We Just Reached Human-level Reasoning’.
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
New OpenAI Model 'Imminent' and AI Stakes Get Raised (plus Med Gemini, GPT 2 Chatbot and Scale AI)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
The Age of the Agent: Flo Crivello
AI Engineer session on The Age of the Agent: Flo Crivello. It adds practical context for how teams are building and operating AI systems in production.
Play video
‘Her’ AI, Almost Here? Llama 3, Vasa-1, and Altman ‘Plugging Into Everything You Want To Do’
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Agents Take the Wheel: Devin, SIMA, Figure 01 and The Future of Jobs
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
State of AI 2023: Highlights of 163 Page Report + Eureka Self-Improvement, MEG, Suno AI and GPT F
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
An Actually Big Week in AI: AutoGen, The A-Phone, Mistral 7B, GPT-Fathom and Meta Hunts CharacterAI
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
9 AI Developments: HeyGen 2.0 to AjaxGPT, Open Interpreter to NExT-GPT and Roblox AI
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AGI Will Not Be A Chatbot - Autonomy, Acceleration, and Arguments Behind the Scenes
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Google Gemini: AlphaGo-GPT?
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
12 New Code Interpreter Uses (Image to 3D, Book Scans, Multiple Datasets, Error Analysis ... )
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4 Got Upgraded - Code Interpreter (ft. Image Editing, MP4s, 3D Plots, Data Analytics and more!)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4 is Smarter than You Think: Introducing SmartGPT
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
8 Signs It's The Future: Thought-to-Text, Nvidia Text-to-Video, Character AI, and P(Doom) @Ted
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
‘We Must Slow Down the Race’ – X AI, GPT 4 Can Now Do Science and Altman GPT 5 Statement
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Can GPT 4 Prompt Itself? MemoryGPT, AutoGPT, Jarvis, Claude-Next [10x GPT 4!] and more...
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.