12:43
Rishabh Garg, Tesla Optimus — Challenges in High Performance Robotics Systems
AI Engineer
19:06
Building an Agentic Platform — Ben Kus, CTO Box
19:46
Five hard earned lessons about Evals — Ankur Goyal, Braintrust
16:28
Perceptual Evaluations: Evals for Aesthetics — Diego Rodriguez, Krea.ai
18:47
How BlackRock Builds Custom Knowledge Apps at Scale — Vaibhav Page & Infant Vasanth, BlackRock
15:35
Form factors for your new AI coworkers — Craig Wattrus, Flatfile
19:12
Fuzzing in the GenAI Era — Leonard Tang, Haize Labs
18:49
Multi Agent AI and Network Knowledge Graphs for Change — Ola Mabadeje, Cisco
18:43
Wisdom-Driven Knowledge Augmented Generation at Scale - Chin Keong Lam, Patho AI
22:16
The Next Unicorns: 7 Top AI startups from the HF0 Residency
41:05
#define AI Engineer - Greg Brockman, OpenAI (ft. Jensen Huang)
5:14
The Future of Evals - Ankur Goyal, Braintrust
13:02
Designing AI-Intensive Applications - swyx
19:23
How to look at your data — Jeff Huber (Choma) + Jason Liu (567)
On Engineering AI Systems that Endure The Bitter Lesson - Omar Khattab, DSPy & Databricks
15:22
Evals Are Not Unit Tests — Ido Pesok, Vercel v0
19:14
2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI
20:55
Vibe Coding with Confidence — Itamar Friedman, Qodo
17:49
AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL
1:09:41
Full Workshop: Realtime Voice AI — Mark Backman, Daily
17:24
Vision AI in 2025 — Peter Robicheaux, Roboflow
14:55
Practical tactics to build reliable AI apps — Dmitry Kuchin, Multinear
7:30
How to Improve your Vibe Coding — Ian Butler
15:34
Vibes won't cut it — Chris Kelly, Augment Code
1:19:33
Real World Development with GitHub Copilot and VS Code — Harald Kirschner, Christopher Harrison
19:00
Building Agents at Cloud Scale — Antje Barth, AWS
23:52
State of Startups and AI 2025 - Sarah Guo, Conviction
19:58
Useful General Intelligence — Danielle Perszyk, Amazon AGI
12:33
The 2025 AI Engineering Report — Barr Yaron, Amplify
15:37
Agents vs Workflows: Why Not Both? — Sam Bhagwat, Mastra.ai
14:18
Why We Don’t Need More Data Centers - Dr. Jasper Zhang, Hyperbolic
19:31
Infrastructure for the Singularity — Jesse Han, Morph
20:25
Hacking the Inference Pareto Frontier - Kyle Kranen, NVIDIA
26:46
Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily
1:01:42
[Full Workshop] Building Conversational AI Agents - Thor Schaeff, ElevenLabs
19:32
From Self-driving to Autonomous Voice Agents — Brooke Hopkins, Coval
27:03
Why ChatGPT Keeps Interrupting You — Dr. Tom Shapland, LiveKit
16:30
Your realtime AI is ngmi — Sean DuBois (OpenAI), Kwindla Kramer (Daily)
16:09
Serving Voice AI at $1/hr: Open-source, LoRAs, Latency, Load Balancing - Neil Dwyer, Gabber
20:12
How to defend your sites from AI bots — David Mytton, Arcjet
20:36
The Unofficial Guide to Apple’s Private Cloud Compute - Jmo, CONFSEC
18:59
How to Secure Agents using OAuth — Jared Hanson (Keycard, Passport.js)
17:33
How we hacked YC Spring 2025 batch’s AI agents — Rene Brandel, Casco
14:00
OpenAI on Securing Code-Executing AI Agents — Fouad Matin (Codex, Agent Robustness)
20:33
Evaluating AI Search: A Practical Framework for Augmented AI Systems — Quotient AI + Tavily
16:40
Scaling Enterprise-Grade RAG: Lessons from Legal Frontier - Calvin Qi (Harvey), Chang She (Lance)
22:18
Building Alice’s Brain: an AI Sales Rep that Learns Like a Human - Sherwood & Satwik, 11x
20:22
Layering every technique in RAG, one query at a time - David Karam, Pi Labs (fmr. Google Search)
18:42
Building a Smarter AI Agent with Neural RAG - Will Bryk, Exa.ai
40:28
[Full Workshop] Building Metrics that actually work — David Karam, Pi Labs (fmr Google Search)
19:18
Make your LLM app a Domain Expert: How to Build an Expert System — Christopher Lovejoy, Anterior
19:34
Shipping Products When You Don't Know What they Can Do — Ben Stein, Teammates
16:17
Shipping something to someone always wins — Kenneth Auchenberg (ex. Stripe, VSCode)
18:37
Why your product needs an AI product manager, and why it should be you — James Lowe, i.AI
25:15
Everything is ugly, so go build something that isn't — Raiza Martin, Huxe (ex NotebookLM)
19:43
Building the platform for agent coordination — Tom Moor, Linear
17:47
What Is a Humanoid Foundation Model? An Introduction to GR00T N1 - Annika & Aastha
Real-time Experiments with an AI Co-Scientist - Stefania Druga, fmr. Google Deepmind
15:01
Scaling AI Agents Without Breaking Reliability — Preeti Somal, Temporal
16:33
Government Agents: AI Agents vs Tough Regulations — Mark Myshatyn, Los Alamos National Laboratory
1:21:00
Ship Agents that Ship: A Hands-On Workshop - Kyle Penfound, Jeremy Adams, Dagger
34:17
The AI Engineer’s Guide to Raising VC — Dani Grant (Jam), Chelcie Taylor (Notable)
32:28
Strategies for LLM Evals (GuideLLM, lm-eval-harness, OpenAI Evals Workshop) — Taylor Jordan Smith
21:11
Why you should care about AI interpretability - Mark Bissell, Goodfire AI
1:48:07
Information Retrieval from the Ground Up - Philipp Krenn, Elastic
43:42
Introduction to LLM serving with SGLang - Philip Kiely and Yineng Zhang, Baseten
18:07
Robotics: why now? - Quan Vuong and Jost Tobias Springberg, Physical Intelligence
17:28
Waymo's EMMA: Teaching Cars to Think - Jyh Jing Hwang, Waymo
1:23:14
A2A & MCP Workshop: Automating Business Processes with LLMs — Damien Murphy, Bench
16:06
Ship Production Software in Minutes, Not Months — Eno Reyes, Factory
17:59
Beyond the Prototype: Using AI to Write High-Quality Code - Josh Albrecht, Imbue
16:46
Software Development Agents: What Works and What Doesn't - Robert Brennan, AllHands/OpenHands
16:13
Devin 2.0 and the Future of SWE - Scott Wu, Cognition
13:40
Your Coding Agent Just Got Cloned And Your Brain Isn't Ready - Rustin Banks, Google Jules
53:54
Latent Space Paper Club: AIEWF Special Edition (Test of Time, DeepSeek R1/V3) — VIbhu Sapra
12:02
Human seeded Evals — Samuel Colvin, Pydantic
Building AI Products That Actually Work — Ben Hylak (Raindrop), Sid Bendre (Oleve)
18:55
Rise of the AI Architect — Clay Bavor, Cofounder, Sierra w/ Alessio Fanelli
18:19
AI That Pays: Lessons from Revenue Cycle — Nathan Wan, Ensemble Health
17:40
Structuring a modern AI team — Denys Linkov, Wisedocs
16:50
The Rise of Open Models in the Enterprise — Amir Haghighat, Baseten
Mentoring the Machine — Eric Hou, Augment Code
15:50
Building Applications with AI Agents — Michael Albada, Microsoft
15:25
AX is the only Experience that Matters - Ivan Burazin, Daytona
19:53
How to build Enterprise Aware Agents - Chau Tran, Glean
18:18
Monetizing AI — Alvaro Morales, Orb
18:12
Does AI Actually Boost Developer Productivity? (100k Devs Study) - Yegor Denisov-Blanch, Stanford
16:22
How agents will unlock the $500B promise of AI - Donald Hruska, Retool
How Intuit uses LLMs to explain taxes to millions of taxpayers - Jaspreet Singh, Intuit
3 ingredients for building reliable enterprise agents - Harrison Chase, LangChain/LangGraph
19:29
From Hype to Habit: How We’re Building an AI-First SaaS Company—While Still Shipping the Roadmap
19:37
Machines of Buying and Selling Grace - Adam Behrens, New Generation
15:58
How to Build Planning Agents without losing control - Yogendra Miraje, Factset
21:12
Building Agents (the hard parts!) - Rita Kozlov, Cloudflare
19:16
POC to PROD: Hard Lessons from 200+ Enterprise GenAI Deployments - Randall Hunt, Caylent
18:10
Build Dynamic Products, and Stop the AI Sideshow — Eliza Cabrera (Workday) + Jeremy Silva (Freeplay)
17:04
The Billable Hour is Dead; Long Live the Billable Hour — Kevin Madura + Mo Bhasin, Alix Partners
19:45
From Copilot to Colleague: Trustworthy Agents for High-Stakes - Joel Hron, CTO Thomson Reuters
6:45
How to Hire AI Engineers when EVERYONE is cheating with AI — Beth Glenfield, DevDay
6:51
Stateful environments for vertical agents — Josh Purtell, Synth Labs
9:44
Books reimagined: AI to create new experiences for things you know — Lukasz Gandecki, TheBrain.pro
10:21
AI powered entomology: Lessons from millions of AI code reviews — Tomas Reimers, Graphite
19:04
Critical AI Inference your CIO can Trust — Sahil Yadav, Hariharan Ganesan, Telemetrak
9:24
How to run Evals at Scale: Thinking beyond Accuracy or Similarity — Muktesh Mishra, Adobe
11:31
Continuous Profiling for GPUs — Matthias Loibl, Polar Signals
4:23
Top Ten Challenges to Reach AGI — Stephen Chin, Andreas Kollegger
Practical GraphRAG: Making LLMs smarter with Knowledge Graphs — Michael, Jesus, and Stephen, Neo4j
19:13
Knowledge Graphs in Litigation Agents — Tom Smoker, WhyHow
15:47
When Vectors Break Down: Graph-Based RAG for Dense Enterprise Knowledge - Sam Julien, Writer
7:02
Stop Using RAG as Memory — Daniel Chalef, Zep
20:24
HybridRAG: A Fusion of Graph and Vector Retrieval - Mitesh Patel, NVIDIA
18:45
tldraw.computer - Steve Ruiz, tldraw
16:59
Excalidraw: AI and Human Whiteboarding Partnership - Christopher Chedeau
14:24
The Bitter Layout or: How I Learned to Love the Model Picker — Maximillian Piras, Yutori
20:13
CIAM for AI: Authn/Authz for Agents — Michael Grinich, CEO of WorkOS
Good design hasn’t changed with AI — John Pham, SF Compute
17:17
Building Effective Voice Agents — Toki Sherbakov + Anoop Kotha, OpenAI
18:58
Robots as professional Chefs - Nikhil Abraham, CloudChef
2:42:28
[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han
20:28
Google Photos Magic Editor: GenAI Under the Hood of a Billion-User App - Kelvin Ma, Google Photos
19:03
Dream Machine: Scaling to 1m users in 4 days — Keegan McCallum, Luma AI
51:25
ComfyUI Full Workshop — first workshop from ComfyAnonymous himself!
19:26
Design like Karpathy is watching — Zeke Sikelianos, Replicate
18:35
On Curiosity — Sharif Shameem, Lexica
14:27
Real world MCPs in GitHub Copilot Agent Mode — Jon Peck, Microsoft
18:08
The rise of the agentic economy on the shoulders of MCP — Jan Curn, Apify
14:53
Full Spec MCP: Hidden Capabilities of the MCP spec — Harald Kirschner, Microsoft/VSCode
17:10
Shipping an Enterprise Voice AI Agent in 100 Days - Peter Bar, Intercom Fin
17:14
The State of Generative Media - Gorkem Yurtseven, FAL
22:51
Teaching Gemini to Speak YouTube: Adapting LLMs for Video Recommendations to 2B+DAU - Devansh Tandon
21:10
Transforming search and discovery using LLMs — Tejaswi & Vinesh, Instacart
22:28
Netflix's Big Bet: One model to rule recommendations: Yesu Feng, Netflix
21:59
360Brew: LLM-based Personalized Ranking and Recommendation - Hamed and Maziar, LinkedIn AI
18:13
What We Learned from Using LLMs in Pinterest — Mukuntha Narayanan, Han Wang, Pinterest
18:28
Measuring AGI: Interactive Reasoning Benchmarks for ARC-AGI-3 — Greg Kamradt, ARC Prize Foundation
19:27
RL for Autonomous Coding — Aakanksha Chowdhery, Reflection.ai
20:54
Recsys Keynote: Improving Recommendation Systems & Search in the Age of LLMs - Eugene Yan, Amazon
15:44
Benchmarks Are Memes: How What We Measure Shapes AI—and Us - Alex Duffy, Every.to
17:36
Small AI Teams with Huge Impact — Vik Paruchuri, Datalab
18:06
Rethinking Team Building: how a 30-person Startup serves 50 Million Users — Grant Lee, Gamma
12:03
Building a 10 person unicorn - Max Brodeur-Urbas, Gumloop
Using OSS models to build AI apps with millions of users — Hassan El Mghari
Bolt.new: How we scaled $0-20m ARR in 60 days, with 15 people — Eric Simons, Bolt
2:01:05
Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting
14:10
Survive the AI Knife Fight: Building Products That Win — Brian Balfour, Reforge
58:18
Automating Escrow with USDC and AI - Corey Cooper, Circle
1:41:34
How LLMs work for Web Devs: GPT in 600 lines of Vanilla JS - Ishan Anand
1:51:14
[Workshop] AI Pipelines and Agents in Pure TypeScript with Mastra.ai — Nick Nisi, Zack Proser
1:44:51
AI Engineering with the Google Gemini 2.5 Model Family - Philipp Schmid, Google DeepMind
21:36
The New Code — Sean Grove, OpenAI
Production software keeps breaking and it will only get worse — Anish Agarwal, Traversal.ai
Thinking Deeper in Gemini — Jack Rae, Google DeepMind
11:58
A year of Gemini progress + what comes next — Logan Kilpatrick, Google DeepMind
18:30
2025 in LLMs so far, illustrated by Pelicans on Bicycles — Simon Willison
17:52
Trends Across the AI Frontier — George Cameron, ArtificialAnalysis.ai
19:17
Training Agentic Reasoners — Will Brown, Prime Intellect
18:31
New York Times' Connections: A Case Study on NLP in Word Games — Shafik Quoraishee, NYT Games
Claude Code & the evolution of agentic coding — Boris Cherny, Anthropic
17:06
12-Factor Agents: Patterns of reliable LLM applications — Dex Horthy, HumanLayer
16:41
MCP Is Not Good Yet — David Cramer, Sentry
Your Personal Open-Source Humanoid Robot for $8,999 — JX Mo, K-Scale Labs
12:50
The Build-Operate Divide: Bridging Product Vision and AI Operational Reality
13:26
The New Lean Startup — Sid Bendre, Oleve
15:13
Optimizing inference for voice models in production - Philip Kiely, Baseten
14:40
Conquering Agent Chaos — Rick Blalock, Agentuity
1:25:08
[Evals Workshop] Mastering AI Evaluation: From Playground to Production
1:18:35
Intro to GraphRAG — Zach Blumenfeld
18:41
Securing Agents with Open Standards — Bobby Tiernay and Kam Sween, Auth0
35:06
The emerging skillset of wielding coding agents — Beyang Liu, Sourcegraph / Amp
16:15
Turning Fails into Features: Zapier’s Hard-Won Eval Lessons — Rafal Willinski, Vitor Balocco, Zapier
1:25:35
Building voice agents with OpenAI — Dominik Kundel, OpenAI
23:48
Containing Agent Chaos — Solomon Hykes, Dagger
48:31
Evals 101 — Doug Guthrie, Braintrust
5:41
Why should anyone care about Evals? — Manu Goyal, Braintrust
24:46
Engineering Better Evals: Scalable LLM Evaluation Pipelines That Work — Dat Ngo, Aman Khan, Arize
15:24
To the moon! Navigating deep context in legacy code with Augment Agent — Forrest Brazeal, Matt Ball
17:05
Serving Voice AI at Scale — Arjun Desai (Cartesia) & Rohit Talluri (AWS)
Ship it! Building Production Ready Agents — Mike Chambers, AWS
14:26
Introducing Strands Agents, an Open Source AI Agents SDK — Suman Debnath, AWS
20:10
Data is Your Differentiator: Building Secure and Tailored AI Systems — Mani Khanuja, AWS
1:43:46
How to build world-class AI products — Sarah Sachs (AI lead @ Notion) & Carlos Esteban (Braintrust)
53:15
From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva
1:15:43
Forget RAG Pipelines—Build Production Ready Agents in 15 Mins: Nina Lopatina, Rajiv Shah, Contextual
21:43
Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat
18:46
Realtime Conversational Video with Pipecat and Tavus — Chad Bailey and Brian Johnson, Daily & Tavus
Vector Search Benchmark[eting] - Philipp Krenn, Elastic
16:14
Taming Rogue AI Agents with Observability-Driven Evaluation — Jim Bennett, Galileo
13:52
Building agent fleet architectures your CISO doesn't hate — Lou Bichard, Gitpod
5:45
Don’t get one-shotted: Use AI to test, review, merge, and deploy code — Tomas Reimers, Graphite
15:38
Effective agent design patterns in production — Laurie Voss, LlamaIndex