AIEWF 2025 Complete Playlist

12:43

Rishabh Garg, Tesla Optimus — Challenges in High Performance Robotics Systems

AI Engineer

19:06

Building an Agentic Platform — Ben Kus, CTO Box

AI Engineer

19:46

Five hard earned lessons about Evals — Ankur Goyal, Braintrust

AI Engineer

16:28

Perceptual Evaluations: Evals for Aesthetics — Diego Rodriguez, Krea.ai

AI Engineer

18:47

How BlackRock Builds Custom Knowledge Apps at Scale — Vaibhav Page & Infant Vasanth, BlackRock

AI Engineer

15:35

Form factors for your new AI coworkers — Craig Wattrus, Flatfile

AI Engineer

19:12

Fuzzing in the GenAI Era — Leonard Tang, Haize Labs

AI Engineer

18:49

Multi Agent AI and Network Knowledge Graphs for Change — Ola Mabadeje, Cisco

AI Engineer

18:43

Wisdom-Driven Knowledge Augmented Generation at Scale - Chin Keong Lam, Patho AI

AI Engineer

22:16

The Next Unicorns: 7 Top AI startups from the HF0 Residency

AI Engineer

41:05

#define AI Engineer - Greg Brockman, OpenAI (ft. Jensen Huang)

AI Engineer

5:14

The Future of Evals - Ankur Goyal, Braintrust

AI Engineer

13:02

Designing AI-Intensive Applications - swyx

AI Engineer

19:23

How to look at your data — Jeff Huber (Choma) + Jason Liu (567)

AI Engineer

19:12

On Engineering AI Systems that Endure The Bitter Lesson - Omar Khattab, DSPy & Databricks

AI Engineer

15:22

Evals Are Not Unit Tests — Ido Pesok, Vercel v0

AI Engineer

19:14

2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI

AI Engineer

20:55

Vibe Coding with Confidence — Itamar Friedman, Qodo

AI Engineer

17:49

AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL

AI Engineer

1:09:41

Full Workshop: Realtime Voice AI — Mark Backman, Daily

AI Engineer

17:24

Vision AI in 2025 — Peter Robicheaux, Roboflow

AI Engineer

14:55

Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear

AI Engineer

7:30

How to Improve your Vibe Coding — Ian Butler

AI Engineer

7:30

How to Improve your Vibe Coding — Ian Butler

AI Engineer

15:34

Vibes won't cut it — Chris Kelly, Augment Code

AI Engineer

1:19:33

Real World Development with GitHub Copilot and VS Code — Harald Kirschner, Christopher Harrison

AI Engineer

19:00

Building Agents at Cloud Scale — Antje Barth, AWS

AI Engineer

23:52

State of Startups and AI 2025 - Sarah Guo, Conviction

AI Engineer

19:58

Useful General Intelligence — Danielle Perszyk, Amazon AGI

AI Engineer

12:33

The 2025 AI Engineering Report — Barr Yaron, Amplify

AI Engineer

15:37

Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai

AI Engineer

14:18

Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic

AI Engineer

19:31

Infrastructure for the Singularity — Jesse Han, Morph

AI Engineer

20:25

Hacking the Inference Pareto Frontier - Kyle Kranen, NVIDIA

AI Engineer

26:46

Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily

AI Engineer

1:01:42

[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs

AI Engineer

19:32

From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval

AI Engineer

27:03

Why ChatGPT Keeps Interrupting You — Dr. Tom Shapland, LiveKit

AI Engineer

16:30

Your realtime AI is ngmi — Sean DuBois (OpenAI), Kwindla Kramer (Daily)

AI Engineer

16:09

Serving Voice AI at $1/hr: Open-source, LoRAs, Latency, Load Balancing - Neil Dwyer, Gabber

AI Engineer

20:12

How to defend your sites from AI bots — David Mytton, Arcjet

AI Engineer

20:12

How to defend your sites from AI bots — David Mytton, Arcjet

AI Engineer

20:36

The Unofficial Guide to Apple’s Private Cloud Compute - Jmo, CONFSEC

AI Engineer

18:59

How to Secure Agents using OAuth — Jared Hanson (Keycard, Passport.js)

AI Engineer

17:33

How we hacked YC Spring 2025 batch’s AI agents — Rene Brandel, Casco

AI Engineer

14:00

OpenAI on Securing Code-Executing AI Agents — Fouad Matin (Codex, Agent Robustness)

AI Engineer

20:33

Evaluating AI Search: A Practical Framework for Augmented AI Systems — Quotient AI + Tavily

AI Engineer

16:40

Scaling Enterprise-Grade RAG: Lessons from Legal Frontier - Calvin Qi (Harvey), Chang She (Lance)

AI Engineer

22:18

Building Alice’s Brain: an AI Sales Rep that Learns Like a Human - Sherwood & Satwik, 11x

AI Engineer

20:22

Layering every technique in RAG, one query at a time - David Karam, Pi Labs (fmr. Google Search)

AI Engineer

18:42

Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai

AI Engineer

40:28

[Full Workshop] Building Metrics that actually work — David Karam, Pi Labs (fmr Google Search)

AI Engineer

19:18

Make your LLM app a Domain Expert: How to Build an Expert System — Christopher Lovejoy, Anterior

AI Engineer

19:34

Shipping Products When You Don't Know What they Can Do — Ben Stein, Teammates

AI Engineer

16:17

Shipping something to someone always wins — Kenneth Auchenberg (ex. Stripe, VSCode)

AI Engineer

18:37

Why your product needs an AI product manager, and why it should be you — James Lowe, i.AI

AI Engineer

25:15

Everything is ugly, so go build something that isn't — Raiza Martin, Huxe (ex NotebookLM)

AI Engineer

19:43

Building the platform for agent coordination — Tom Moor, Linear

AI Engineer

19:43

Building the platform for agent coordination — Tom Moor, Linear

AI Engineer

17:47

What Is a Humanoid Foundation Model? An Introduction to GR00T N1 - Annika & Aastha

AI Engineer

18:42

Real-time Experiments with an AI Co-Scientist - Stefania Druga, fmr. Google Deepmind

AI Engineer

15:01

Scaling AI Agents Without Breaking Reliability — Preeti Somal, Temporal

AI Engineer

16:33

Government Agents: AI Agents vs Tough Regulations — Mark Myshatyn, Los Alamos National Laboratory

AI Engineer

1:21:00

Ship Agents that Ship: A Hands-On Workshop - Kyle Penfound, Jeremy Adams, Dagger

AI Engineer

34:17

The AI Engineer’s Guide to Raising VC — Dani Grant (Jam), Chelcie Taylor (Notable)

AI Engineer

32:28

Strategies for LLM Evals (GuideLLM, lm-eval-harness, OpenAI Evals Workshop) — Taylor Jordan Smith

AI Engineer

21:11

Why you should care about AI interpretability - Mark Bissell, Goodfire AI

AI Engineer

1:48:07

Information Retrieval from the Ground Up - Philipp Krenn, Elastic

AI Engineer

43:42

Introduction to LLM serving with SGLang - Philip Kiely and Yineng Zhang, Baseten

AI Engineer

18:07

Robotics: why now? - Quan Vuong and Jost Tobias Springberg, Physical Intelligence

AI Engineer

17:28

Waymo's EMMA: Teaching Cars to Think - Jyh Jing Hwang, Waymo

AI Engineer

1:23:14

A2A & MCP Workshop: Automating Business Processes with LLMs — Damien Murphy, Bench

AI Engineer

16:06

Ship Production Software in Minutes, Not Months — Eno Reyes, Factory

AI Engineer

17:59

Beyond the Prototype: Using AI to Write High-Quality Code - Josh Albrecht, Imbue

AI Engineer

16:46

Software Development Agents: What Works and What Doesn't - Robert Brennan, AllHands/OpenHands

AI Engineer

16:13

Devin 2.0 and the Future of SWE - Scott Wu, Cognition

AI Engineer

13:40

Your Coding Agent Just Got Cloned And Your Brain Isn't Ready - Rustin Banks, Google Jules

AI Engineer

53:54

Latent Space Paper Club: AIEWF Special Edition (Test of Time, DeepSeek R1/V3) — VIbhu Sapra

AI Engineer

53:54

Latent Space Paper Club: AIEWF Special Edition (Test of Time, DeepSeek R1/V3) — VIbhu Sapra

AI Engineer

12:02

Human seeded Evals — Samuel Colvin, Pydantic

AI Engineer

18:42

Building AI Products That Actually Work — Ben Hylak (Raindrop), Sid Bendre (Oleve)

AI Engineer

18:55

Rise of the AI Architect — Clay Bavor, Cofounder, Sierra w/ Alessio Fanelli

AI Engineer

18:19

AI That Pays: Lessons from Revenue Cycle — Nathan Wan, Ensemble Health

AI Engineer

17:40

Structuring a modern AI team — Denys Linkov, Wisedocs

AI Engineer

16:50

The Rise of Open Models in the Enterprise — Amir Haghighat, Baseten

AI Engineer

19:06

Mentoring the Machine — Eric Hou, Augment Code

AI Engineer

15:50

Building Applications with AI Agents — Michael Albada, Microsoft

AI Engineer

15:25

AX is the only Experience that Matters - Ivan Burazin, Daytona

AI Engineer

19:53

How to build Enterprise Aware Agents - Chau Tran, Glean

AI Engineer

18:18

Monetizing AI — Alvaro Morales, Orb

AI Engineer

18:12

Does AI Actually Boost Developer Productivity? (100k Devs Study) - Yegor Denisov-Blanch, Stanford

AI Engineer

16:22

How agents will unlock the $500B promise of AI - Donald Hruska, Retool

AI Engineer

18:59

How Intuit uses LLMs to explain taxes to millions of taxpayers - Jaspreet Singh, Intuit

AI Engineer

18:59

How Intuit uses LLMs to explain taxes to millions of taxpayers - Jaspreet Singh, Intuit

AI Engineer

20:55

3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph

AI Engineer

19:29

From Hype to Habit: How We’re Building an AI-First SaaS Company—While Still Shipping the Roadmap

AI Engineer

19:29

From Hype to Habit: How We’re Building an AI-First SaaS Company—While Still Shipping the Roadmap

AI Engineer

19:37

Machines of Buying and Selling Grace - Adam Behrens, New Generation

AI Engineer

19:37

Machines of Buying and Selling Grace - Adam Behrens, New Generation

AI Engineer

15:58

How to Build Planning Agents without losing control - Yogendra Miraje, Factset

AI Engineer

15:58

How to Build Planning Agents without losing control - Yogendra Miraje, Factset

AI Engineer

21:12

Building Agents (the hard parts!) - Rita Kozlov, Cloudflare

AI Engineer

19:16

POC to PROD: Hard Lessons from 200+ Enterprise GenAI Deployments - Randall Hunt, Caylent

AI Engineer

18:10

Build Dynamic Products, and Stop the AI Sideshow — Eliza Cabrera (Workday) + Jeremy Silva (Freeplay)

AI Engineer

17:04

The Billable Hour is Dead; Long Live the Billable Hour — Kevin Madura + Mo Bhasin, Alix Partners

AI Engineer

19:45

From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters

AI Engineer

6:45

How to Hire AI Engineers when EVERYONE is cheating with AI — Beth Glenfield, DevDay

AI Engineer

6:51

Stateful environments for vertical agents — Josh Purtell, Synth Labs

AI Engineer

9:44

Books reimagined: AI to create new experiences for things you know — Lukasz Gandecki, TheBrain.pro

AI Engineer

10:21

AI powered entomology: Lessons from millions of AI code reviews — Tomas Reimers, Graphite

AI Engineer

19:04

Critical AI Inference your CIO can Trust — Sahil Yadav, Hariharan Ganesan, Telemetrak

AI Engineer

9:24

How to run Evals at Scale: Thinking beyond Accuracy or Similarity — Muktesh Mishra, Adobe

AI Engineer

11:31

Continuous Profiling for GPUs — Matthias Loibl, Polar Signals

AI Engineer

4:23

Top Ten Challenges to Reach AGI — Stephen Chin, Andreas Kollegger

AI Engineer

19:46

Practical GraphRAG: Making LLMs smarter with Knowledge Graphs — Michael, Jesus, and Stephen, Neo4j

AI Engineer

19:13

Knowledge Graphs in Litigation Agents — Tom Smoker, WhyHow

AI Engineer

15:47

When Vectors Break Down: Graph-Based RAG for Dense Enterprise Knowledge - Sam Julien, Writer

AI Engineer

7:02

Stop Using RAG as Memory — Daniel Chalef, Zep

AI Engineer

20:24

HybridRAG: A Fusion of Graph and Vector Retrieval - Mitesh Patel, NVIDIA

AI Engineer

18:45

tldraw.computer - Steve Ruiz, tldraw

AI Engineer

16:59

Excalidraw: AI and Human Whiteboarding Partnership - Christopher Chedeau

AI Engineer

14:24

The Bitter Layout or: How I Learned to Love the Model Picker — Maximillian Piras, Yutori

AI Engineer

20:13

CIAM for AI: Authn/Authz for Agents — Michael Grinich, CEO of WorkOS

AI Engineer

20:25

Good design hasn’t changed with AI — John Pham, SF Compute

AI Engineer

17:17

Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI

AI Engineer

18:58

Robots as professional Chefs - Nikhil Abraham, CloudChef

AI Engineer

2:42:28

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

AI Engineer

20:28

Google Photos Magic Editor: GenAI Under the Hood of a Billion-User App - Kelvin Ma, Google Photos

AI Engineer

19:03

Dream Machine: Scaling to 1m users in 4 days — Keegan McCallum, Luma AI

AI Engineer

51:25

ComfyUI Full Workshop — first workshop from ComfyAnonymous himself!

AI Engineer

19:26

Design like Karpathy is watching — Zeke Sikelianos, Replicate

AI Engineer

18:35

On Curiosity — Sharif Shameem, Lexica

AI Engineer

14:27

Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft

AI Engineer

18:08

The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify

AI Engineer

14:53

Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode

AI Engineer

17:10

Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin

AI Engineer

17:10

Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin

AI Engineer

17:14

The State of Generative Media - Gorkem Yurtseven, FAL

AI Engineer

22:51

Teaching Gemini to Speak YouTube: Adapting LLMs for Video Recommendations to 2B+DAU - Devansh Tandon

AI Engineer

21:10

Transforming search and discovery using LLMs — Tejaswi & Vinesh, Instacart

AI Engineer

22:28

Netflix's Big Bet: One model to rule recommendations: Yesu Feng, Netflix

AI Engineer

21:59

360Brew: LLM-based Personalized Ranking and Recommendation - Hamed and Maziar, LinkedIn AI

AI Engineer

18:13

What We Learned from Using LLMs in Pinterest — Mukuntha Narayanan, Han Wang, Pinterest

AI Engineer

18:28

Measuring AGI: Interactive Reasoning Benchmarks for ARC-AGI-3 — Greg Kamradt, ARC Prize Foundation

AI Engineer

19:27

RL for Autonomous Coding — Aakanksha Chowdhery, Reflection.ai

AI Engineer

20:54

Recsys Keynote: Improving Recommendation Systems & Search in the Age of LLMs - Eugene Yan, Amazon

AI Engineer

15:44

Benchmarks Are Memes: How What We Measure Shapes AI—and Us - Alex Duffy, Every.to

AI Engineer

17:36

Small AI Teams with Huge Impact — Vik Paruchuri, Datalab

AI Engineer

18:06

Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma

AI Engineer

12:03

Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop

AI Engineer

18:47

Using OSS models to build AI apps with millions of users — Hassan El Mghari

AI Engineer

17:33

Bolt.new: How we scaled $0-20m ARR in 60 days, with 15 people — Eric Simons, Bolt

AI Engineer

2:01:05

Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting

AI Engineer

14:10

Survive the AI Knife Fight: Building Products That Win — Brian Balfour, Reforge

AI Engineer

58:18

Automating Escrow with USDC and AI - Corey Cooper, Circle

AI Engineer

1:41:34

How LLMs work for Web Devs: GPT in 600 lines of Vanilla JS - Ishan Anand

AI Engineer

1:51:14

[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser

AI Engineer

1:44:51

AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind

AI Engineer

21:36

The New Code — Sean Grove, OpenAI

AI Engineer

18:13

Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai

AI Engineer

18:13

Thinking Deeper in Gemini — Jack Rae, Google DeepMind

AI Engineer

11:58

A year of Gemini progress + what comes next — Logan Kilpatrick, Google DeepMind

AI Engineer

18:30

2025 in LLMs so far, illustrated by Pelicans on Bicycles — Simon Willison

AI Engineer

17:52

Trends Across the AI Frontier — George Cameron, ArtificialAnalysis.ai

AI Engineer

19:17

Training Agentic Reasoners — Will Brown, Prime Intellect

AI Engineer

18:31

New York Times' Connections: A Case Study on NLP in Word Games — Shafik Quoraishee, NYT Games

AI Engineer

18:12

Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic

AI Engineer

17:06

12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer

AI Engineer

16:41

MCP Is Not Good Yet — David Cramer, Sentry

AI Engineer

19:26

Your Personal Open-Source Humanoid Robot for $8,999 — JX Mo, K-Scale Labs

AI Engineer

12:50

The Build-Operate Divide: Bridging Product Vision and AI Operational Reality

AI Engineer

13:26

The New Lean Startup — Sid Bendre, Oleve

AI Engineer

15:13

Optimizing inference for voice models in production - Philip Kiely, Baseten

AI Engineer

14:40

Conquering Agent Chaos — Rick Blalock, Agentuity

AI Engineer

1:25:08

[Evals Workshop] Mastering AI Evaluation: From Playground to Production

AI Engineer

1:18:35

Intro to GraphRAG — Zach Blumenfeld

AI Engineer

18:41

Securing Agents with Open Standards — Bobby Tiernay and Kam Sween, Auth0

AI Engineer

35:06

The emerging skillset of wielding coding agents — Beyang Liu, Sourcegraph / Amp

AI Engineer

16:15

Turning Fails into Features: Zapier’s Hard-Won Eval Lessons — Rafal Willinski, Vitor Balocco, Zapier

AI Engineer

1:25:35

Building voice agents with OpenAI — Dominik Kundel, OpenAI

AI Engineer

23:48

Containing Agent Chaos — Solomon Hykes, Dagger

AI Engineer

48:31

Evals 101 — Doug Guthrie, Braintrust

AI Engineer

5:41

Why should anyone care about Evals? — Manu Goyal, Braintrust

AI Engineer

24:46

Engineering Better Evals: Scalable LLM Evaluation Pipelines That Work — Dat Ngo, Aman Khan, Arize

AI Engineer

15:24

To the moon! Navigating deep context in legacy code with Augment Agent — Forrest Brazeal, Matt Ball

AI Engineer

17:05

Serving Voice AI at Scale — Arjun Desai (Cartesia) & Rohit Talluri (AWS)

AI Engineer

19:37

Ship it! Building Production Ready Agents — Mike Chambers, AWS

AI Engineer

19:37

Ship it! Building Production Ready Agents — Mike Chambers, AWS

AI Engineer

14:26

Introducing Strands Agents, an Open Source AI Agents SDK — Suman Debnath, AWS

AI Engineer

20:10

Data is Your Differentiator: Building Secure and Tailored AI Systems — Mani Khanuja, AWS

AI Engineer

1:43:46

How to build world-class AI products — Sarah Sachs (AI lead @ Notion) & Carlos Esteban (Braintrust)

AI Engineer

53:15

From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva

AI Engineer

1:15:43

Forget RAG Pipelines—Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual

AI Engineer

21:43

Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat

AI Engineer

18:46

Realtime Conversational Video with Pipecat and Tavus — Chad Bailey and Brian Johnson, Daily & Tavus

AI Engineer

14:10

Vector Search Benchmark[eting] - Philipp Krenn, Elastic

AI Engineer

16:14

Taming Rogue AI Agents with Observability-Driven Evaluation — Jim Bennett, Galileo

AI Engineer

13:52

Building agent fleet architectures your CISO doesn't hate — Lou Bichard, Gitpod

AI Engineer

5:45

Don’t get one-shotted: Use AI to test, review, merge, and deploy code — Tomas Reimers, Graphite

AI Engineer

15:38

Effective agent design patterns in production — Laurie Voss, LlamaIndex

AI Engineer