ออฟไลน์ด้วยแอป Player FM !
[QA] Towards a Theoretical Understanding of the `Reversal Curse' via Training Dynamics
Manage episode 417327954 series 3524393
The paper analyzes the "reversal curse" in large language models, explaining why they struggle with logical reasoning tasks like inverse search and chain-of-thought.
https://arxiv.org/abs//2405.04669
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1137 ตอน
Manage episode 417327954 series 3524393
The paper analyzes the "reversal curse" in large language models, explaining why they struggle with logical reasoning tasks like inverse search and chain-of-thought.
https://arxiv.org/abs//2405.04669
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1137 ตอน
Minden epizód
×ขอต้อนรับสู่ Player FM!
Player FM กำลังหาเว็บ