ออฟไลน์ด้วยแอป Player FM !
Робимо мобільну аппку
Manage episode 463980041 series 3342243
React Native, Коля.
Notes → Roam → Reflect → Notes → Obsidian
“By some measurements, DeepSeek is over ~45x more efficient than other leading-edge models.”
“With R1, DeepSeek essentially cracked one of the holy grails of AI: getting models to reason step-by-step without relying on massive supervised datasets.”
“It directly addresses the single biggest weakness of the otherwise phenomenally successful Transformer model, which is its propensity to "hallucinate".”
“DeepSeek figured out how to predict multiple tokens while maintaining the quality you'd get from single-token prediction.”
This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit newsletter.maxua.com
132 ตอน
Manage episode 463980041 series 3342243
React Native, Коля.
Notes → Roam → Reflect → Notes → Obsidian
“By some measurements, DeepSeek is over ~45x more efficient than other leading-edge models.”
“With R1, DeepSeek essentially cracked one of the holy grails of AI: getting models to reason step-by-step without relying on massive supervised datasets.”
“It directly addresses the single biggest weakness of the otherwise phenomenally successful Transformer model, which is its propensity to "hallucinate".”
“DeepSeek figured out how to predict multiple tokens while maintaining the quality you'd get from single-token prediction.”
This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit newsletter.maxua.com
132 ตอน
All episodes
×ขอต้อนรับสู่ Player FM!
Player FM กำลังหาเว็บ