
ออฟไลน์ด้วยแอป Player FM !
Panic or Progress? Reading Between the Lines of AI Safety Tests
Manage episode 490872625 series 3572101
In Ep 2 we ask: "Panic or Progress? Reading Between the Lines of AI Safety Tests." We unpack the recent Claude Opus 4 "blackmail" test result, OpenAI's new transparency pledge, and why safety evaluations sometimes sound scarier than they are. Listeners will leave with a clear framework for interpreting headline-grabbing safety reports—and practical advice on when to worry, when to wait, and how to separate red flags from red herrings.
33 ตอน
Manage episode 490872625 series 3572101
In Ep 2 we ask: "Panic or Progress? Reading Between the Lines of AI Safety Tests." We unpack the recent Claude Opus 4 "blackmail" test result, OpenAI's new transparency pledge, and why safety evaluations sometimes sound scarier than they are. Listeners will leave with a clear framework for interpreting headline-grabbing safety reports—and practical advice on when to worry, when to wait, and how to separate red flags from red herrings.
33 ตอน
ทุกตอน
×ขอต้อนรับสู่ Player FM!
Player FM กำลังหาเว็บ