-
12:30–12:48Evaluation Precedes Evolution: Rubrics as the Load-Bearing Infrastructure of Self-Improving AgentsTanya Dixit
-
12:50–13:08Beyond Forgetful Bots: Architectural Patterns for Persistent, Proactive Claw-Style AI AgentsNavan Tirupathi
-
13:10–13:28Shipping Sandboxed Workers for Notion AgentsAdam Hudson
-
13:30–13:48Close your agentic loopMoss Ebeling
-
13:50–14:08How Many Agents Are Too Many? The Hidden Cost of Multi-Agent SystemsAnannya Roy Chowdhury
-
14:10–14:28Kill the God AgentAdesh Gairola
-
15:30–15:48Agent Observability: Monitoring and Understanding Agents at Internet ScaleDaniel Nadasi
-
15:50–16:08Our AI Hallucinated in Production: How We Fixed It With EvalsYicheng Guo
-
16:10–16:28The Application Layer Is the New Research LabAbdul Karim
-
16:30–16:48Orbital Lasers vs For Loops: Economically Matching Models to TasksStephen Sennett
-
16:50–17:08Your AI Can’t Engineer (Yet)Theodoros Galanos
-
17:10–17:28Flue: The Agent Harness Framework, at CloudflareMichael Hart