ออฟไลน์ด้วยแอป Player FM !
Building an AI Mathematician with Carina Hong - #754
Manage episode 517748376 series 2355587
In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a pivotal moment for AI in mathematics, citing a convergence of three key areas: the advanced reasoning capabilities of modern LLMs, the rise of formal proof languages like Lean, and breakthroughs in code generation. We explore the core technical challenges, including the massive data gap between general-purpose code and formal math code, and the difficult problem of "autoformalization," or translating natural language proofs into a machine-verifiable format. Carina also shares Axiom's vision for a self-improving system that uses a self-play loop of conjecturing and proving to discover new mathematical knowledge. Finally, we discuss the broader applications of this technology in areas like formal verification for high-stakes software and hardware.
The complete show notes for this episode can be found at https://twimlai.com/go/754.
774 ตอน
Building an AI Mathematician with Carina Hong - #754
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Manage episode 517748376 series 2355587
In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a pivotal moment for AI in mathematics, citing a convergence of three key areas: the advanced reasoning capabilities of modern LLMs, the rise of formal proof languages like Lean, and breakthroughs in code generation. We explore the core technical challenges, including the massive data gap between general-purpose code and formal math code, and the difficult problem of "autoformalization," or translating natural language proofs into a machine-verifiable format. Carina also shares Axiom's vision for a self-improving system that uses a self-play loop of conjecturing and proving to discover new mathematical knowledge. Finally, we discuss the broader applications of this technology in areas like formal verification for high-stakes software and hardware.
The complete show notes for this episode can be found at https://twimlai.com/go/754.
774 ตอน
Semua episod
×ขอต้อนรับสู่ Player FM!
Player FM กำลังหาเว็บ