Adversarial ML gives useful language for thinking about evasion, poisoning, privacy attacks, and model misuse. It becomes especially valuable when teams need to distinguish classic model risk from the broader application-layer risks found in modern LLM systems.
Adversarial ML
Adversarial machine learning attacks, taxonomies, and mitigations across the ML lifecycle.
- Model-centric attacks versus system-centric AI failures
- Poisoning, evasion, privacy, and misuse in practical terms
- Where adversarial ML taxonomies still help in generative AI discussions
- It creates cleaner language for policy, governance, and technical review
- It helps teams choose the right controls for the right threat model
- It prevents application-layer AI risk from being described too vaguely
- Security teams linking GenAI issues to broader model risk
- Researchers working across predictive and generative AI
- Practitioners who need disciplined taxonomy and terminology
Current notes, events, and source material
These items are included because they add useful evidence, framing, implementation detail, or upcoming context for teams working in this area.
NeurIPS 2026
NeurIPS 2026 is the fortieth annual Conference on Neural Information Processing Systems, with the primary dates listed for Sydney, Australia, and additional satellite locations in Atlanta and Paris.
GAISS 2026: IEEE GenAI for Secure Systems
GAISS 2026 is an IEEE conference at the University of Texas at Austin focused on generative AI for secure systems, including red teaming, blue-team automation, governance, and agentic secure AI.
DEF CON 34 / AI Village 2026
DEF CON 34 takes place in Las Vegas and is expected to include AI security activity through villages, workshops, contests, and community-led research tracks as schedules firm up.
Black Hat USA 2026 AI Summit
Black Hat USA 2026 includes an AI Summit and security briefings in Las Vegas focused on how artificial intelligence is changing digital defense.
ICML 2026
ICML 2026 takes place at COEX in Seoul, South Korea, with tutorials, main conference sessions, and workshops covering core machine learning research.
Play video
Claude Opus 4.7 - A New Frontier, in Performance … and Drama
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Bending a Public MCP Server Without Breaking It — Nimrod Hauser, Baz
AI Engineer session on Bending a Public MCP Server Without Breaking It, presented by Nimrod Hauser, Baz. It adds practical context for how teams are building and operating AI systems in production.
Play video
Two AI Models Set to “stir government urgency”, But Will This Challenge Undo Them?
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
The Two Best AI Models/Enemies Just Got Released Simultaneously
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Claude AI Co-founder Publishes 4 Big Claims about Near Future: Breakdown
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
What the Freakiness of 2025 in AI Tells Us About 2026
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Gemini Exponential, Demis Hassabis' ‘Proto-AGI’ coming, but …
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 5.2: OpenAI Strikes Back
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Building a Production-Ready AI Security Foundation
Google Cloud outlines a defense-in-depth view of AI security spanning application controls, data protections, and infrastructure isolation.
Play video
Nano Banana Pro: But Did You Catch These 10 Details?
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
An ‘AI Bubble’? What Altman Actually said, the Facts and Nano Banana
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Scaling AI Agents Without Breaking Reliability — Preeti Somal, Temporal
AI Engineer session on Scaling AI Agents Without Breaking Reliability, presented by Preeti Somal, Temporal. It adds practical context for how teams are building and operating AI systems in production.
Play video
GPT-5 has Arrived
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Red Teaming Agent: Azure AI Foundry — Nagkumar Arkalgud & Keiji Kanazawa, Microsoft
AI Engineer session on AI Red Teaming Agent: Azure AI Foundry, presented by Nagkumar Arkalgud & Keiji Kanazawa, Microsoft. It adds practical context for how teams are building and operating AI systems in production.
Play video
Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai
AI Engineer session on Production software keeps breaking and it will only get worse, presented by Anish Agarwal, Traversal.ai. It adds practical context for how teams are building and operating AI systems in production.
Play video
Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting
AI Engineer session on Prompt Engineering and AI Red Teaming, presented by Sander Schulhoff, HackAPrompt/LearnPrompting. It adds practical context for how teams are building and operating AI systems in production.
Play video
When Vectors Break Down: Graph-Based RAG for Dense Enterprise Knowledge - Sam Julien, Writer
AI Engineer session on When Vectors Break Down: Graph-Based RAG for Dense Enterprise Knowledge - Sam Julien, Writer. It adds practical context for how teams are building and operating AI systems in production.
Play video
Grok 4 - 10 New Things to Know
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Breaking the Chain: Agent Continuations for Resumable AI Workflows - Greg Benson
AI Engineer session on Breaking the Chain: Agent Continuations for Resumable AI Workflows - Greg Benson. It adds practical context for how teams are building and operating AI systems in production.
Play video
Apple’s ‘AI Can’t Reason’ Claim Seen By 13M+, What You Need to Know
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Accelerates: New Gemini Model + AI Unemployment Stories Analysed
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Claude 4: Full 120 Page Breakdown … Is it the Best New Model?
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Google Takes No Prisoners Amid Torrent of AI Announcements
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
o3 breaks (some) records, but AI becomes pay-to-win
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Breaking AI's 1-GHz Barrier: Sunny Madra (Groq)
AI Engineer session on Breaking AI's 1-GHz Barrier: Sunny Madra (Groq). It adds practical context for how teams are building and operating AI systems in production.
Play video
How Codeium Breaks Through the Ceiling for Retrieval: Kevin Hou
AI Engineer session on How Codeium Breaks Through the Ceiling for Retrieval: Kevin Hou. It adds practical context for how teams are building and operating AI systems in production.
Play video
The Adversarial Path to the Personal Assistant: Sumit Agarwal
AI Engineer session on The Adversarial Path to the Personal Assistant: Sumit Agarwal. It adds practical context for how teams are building and operating AI systems in production.
Play video
Understanding AI Stakes to Break Production Code: Philip Rathle
AI Engineer session on Understanding AI Stakes to Break Production Code: Philip Rathle. It adds practical context for how teams are building and operating AI systems in production.
Play video
o3 and o4-mini - they’re great, but easy to over-hype
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Gemini 2.5 Pro - It’s a Darn Smart Chatbot … (New Simple High Score)
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Did AI Just Get Commoditized? Gemini 2.5, New DeepSeek V3, & Microsoft vs OpenAI
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations
NIST finalizes AI 100-2e2025, providing a terminology and taxonomy for adversarial machine learning across predictive and generative AI systems.
Play video
Manus AI - The Calm Before the Hypestorm … (vs Deep Research + Grok 3)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4.5 - not so much wow
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Deep Research by OpenAI - The Ups and Downs vs DeepSeek R1 Search + Gemini Deep Research
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Nothing Much Happens in AI, Then Everything Does All At Once
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Altman Expects a ‘Fast Take-off’, ‘Super-Agent’ Debuting Soon and DeepSeek R1 Out
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
OpenAI Backtracks, Gunning for Superintelligence: Altman Brings His AGI Timeline Closer - '25 to '29
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
OWASP Top 10 for Large Language Model Applications
OWASP’s GenAI security project remains a practical baseline for teams building or assessing LLM applications and agentic systems.
Play video
o3 - wow
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Never Browse Alone? Gemini 2 Live and ChatGPT Vision
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
o1 Pro Mode – ChatGPT Pro Full Analysis (plus o1 paper highlights)
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Breaks Its Silence: OpenAI’s ‘Next 12 Days’, Genie 2, and a Word of Caution
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
New Google Model Ranked ‘No. 1 LLM’, But There’s a Problem
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
The New Claude 3.5 Sonnet: Better, Yes, But Not Just in the Way You Might Think
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
OpenAI: ‘We Just Reached Human-level Reasoning’.
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
ChatGPT o1 - In-Depth Analysis and Reaction (o1-preview)
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Llama 405b: Full 92 page Analysis, and Uncontaminated SIMPLE Benchmark Results
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT-4o - Full Breakdown + Bonus Details
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
New OpenAI Model 'Imminent' and AI Stakes Get Raised (plus Med Gemini, GPT 2 Chatbot and Scale AI)
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Move Fast Break Nothing: Dedy Kredo
AI Engineer session on Move Fast Break Nothing: Dedy Kredo. It adds practical context for how teams are building and operating AI systems in production.
Play video
‘Her’ AI, Almost Here? Llama 3, Vasa-1, and Altman ‘Plugging Into Everything You Want To Do’
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Udio, the Mysterious GPT Update, and Infinite Attention
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
AI Agents Take the Wheel: Devin, SIMA, Figure 01 and The Future of Jobs
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
The New, Smartest AI: Claude 3 – Tested vs Gemini 1.5 + GPT-4
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Gemini 1.5 and The Biggest Night in AI
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
SmartGPT: Major Benchmark Broken - 89.0% on MMLU + Exam's Many Errors
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Llama 2: Full Breakdown
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Orca: The Model Few Saw Coming
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
Enter PaLM 2 (New Bard): Full Breakdown - 92 Pages Read and Gemini Before GPT 5? Google I/O
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4 is Smarter than You Think: Introducing SmartGPT
This AI Explained video reviews a major AI development through the lens of agentic workflows and tool-use risk. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 4: 9 Revelations (not covered elsewhere)
This AI Explained video reviews a major AI development through the lens of governance and responsible deployment. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
GPT 5 is All About Data
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.
Play video
8 Ways ChatGPT 4 [Is] Better Than ChatGPT
This AI Explained video reviews a major AI development through the lens of benchmarks and evaluation evidence. It is useful context for AI engineering, evaluation, governance, and operational risk.