RLC 2024 - Posters And Hallways 1 TalkRL: The Reinforcement Learning podcast

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

Reinforcement Learning Machine Learning Robin Ranjit Singh Chauhan Artificial Intelligence Tech

Player FM - Internet Radio Done Right

85 subscribers

Artificial Intelligence

เพิ่มแล้วเมื่อ fiveปีที่ผ่านมา

เนื้อหาจัดทำโดย Robin Ranjit Singh Chauhan เนื้อหาพอดแคสต์ทั้งหมด รวมถึงตอน กราฟิก และคำอธิบายพอดแคสต์ได้รับการอัปโหลดและจัดหาให้โดยตรงจาก Robin Ranjit Singh Chauhan หรือพันธมิตรแพลตฟอร์มพอดแคสต์ของพวกเขา หากคุณเชื่อว่ามีบุคคลอื่นใช้งานที่มีลิขสิทธิ์ของคุณโดยไม่ได้รับอนุญาต คุณสามารถปฏิบัติตามขั้นตอนที่แสดงไว้ที่นี่ https://th.player.fm/legal

S

Squid Game: The Official Podcast

Squid Game: The Official Podcast podcast artwork

Squid Game: The Official Podcast podcast artwork

1
Bread and Lottery - S2 Ep 1 31:25

21 วันที่แล้ว31:25

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

31:25

Squid Game is back, and so is Player 456. In the gripping Season 2 premiere, Player 456 returns with a vengeance, leading a covert manhunt for the Recruiter. Hosts Phil Yu and Kiera Please dive into Gi-hun’s transformation from victim to vigilante, the Recruiter’s twisted philosophy on fairness, and the dark experiments that continue to haunt the Squid Game. Plus, we touch on the new characters, the enduring trauma of old ones, and Phil and Kiera go head-to-head in a game of Ddakjji. Finally, our resident mortician, Lauren Bowser is back to drop more truth bombs on all things death. SPOILER ALERT! Make sure you watch Squid Game Season 2 Episode 1 before listening on. Let the new games begin! IG - @SquidGameNetflix X (f.k.a. Twitter) - @SquidGame Check out more from Phil Yu @angryasianman , Kiera Please @kieraplease and Lauren Bowser @thebitchinmortician on IG Listen to more from Netflix Podcasts . Squid Game: The Official Podcast is produced by Netflix and The Mash-Up Americans.…

TalkRL: The Reinforcement Learning Podcast « »
RLC 2024 - Posters and Hallways 1

ประมาณหนึ่งปีที่แล้ว 5:46

แบ่งปัน

MP3•หน้าโฮมของตอน

เนื้อหาจัดทำโดย Robin Ranjit Singh Chauhan เนื้อหาพอดแคสต์ทั้งหมด รวมถึงตอน กราฟิก และคำอธิบายพอดแคสต์ได้รับการอัปโหลดและจัดหาให้โดยตรงจาก Robin Ranjit Singh Chauhan หรือพันธมิตรแพลตฟอร์มพอดแคสต์ของพวกเขา หากคุณเชื่อว่ามีบุคคลอื่นใช้งานที่มีลิขสิทธิ์ของคุณโดยไม่ได้รับอนุญาต คุณสามารถปฏิบัติตามขั้นตอนที่แสดงไว้ที่นี่ https://th.player.fm/legal

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.

Featuring:

0:01 Ann Huang from Harvard on Learning Dynamics and the Geometry of Neural Dynamics in Recurrent Neural Controllers
1:37 Jannis Blüml from TU Darmstadt on HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
3:13 Benjamin Fuhrer from NVIDIA on Gradient Boosting Reinforcement Learning
3:54 Paul Festor from Imperial College London on Evaluating the impact of explainable RL on physician decision-making in high-fidelity simulations: insights from eye-tracking metrics

… continue reading

62 ตอน

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

RLC 2024 - Posters and Hallways 1

TalkRL: The Reinforcement Learning Podcast

85 subscribers

published ประมาณหนึ่งปีที่แล้ว

แบ่งปัน

MP3•หน้าโฮมของตอน

เนื้อหาจัดทำโดย Robin Ranjit Singh Chauhan เนื้อหาพอดแคสต์ทั้งหมด รวมถึงตอน กราฟิก และคำอธิบายพอดแคสต์ได้รับการอัปโหลดและจัดหาให้โดยตรงจาก Robin Ranjit Singh Chauhan หรือพันธมิตรแพลตฟอร์มพอดแคสต์ของพวกเขา หากคุณเชื่อว่ามีบุคคลอื่นใช้งานที่มีลิขสิทธิ์ของคุณโดยไม่ได้รับอนุญาต คุณสามารถปฏิบัติตามขั้นตอนที่แสดงไว้ที่นี่ https://th.player.fm/legal

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.

Featuring:

0:01 Ann Huang from Harvard on Learning Dynamics and the Geometry of Neural Dynamics in Recurrent Neural Controllers
1:37 Jannis Blüml from TU Darmstadt on HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
3:13 Benjamin Fuhrer from NVIDIA on Gradient Boosting Reinforcement Learning
3:54 Paul Festor from Imperial College London on Evaluating the impact of explainable RL on physician decision-making in high-fidelity simulations: insights from eye-tracking metrics

… continue reading

62 ตอน

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

ทุกตอน

×

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Neurips 2024 RL meetup Hot takes: What sucks about RL? 17:45

24 วันที่แล้ว17:45

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

17:45

What do RL researchers complain about after hours at the bar? In this "Hot takes" episode, we find out! Recorded at The Pearl in downtown Vancouver, during the RL meetup after a day of Neurips 2024. Special thanks to "David Beckham" for the inspiration :)

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
RLC 2024 - Posters and Hallways 5 13:17

17 weeksที่แล้ว13:17

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

13:17

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 David Radke of the Chicago Blackhawks NHL on RL for professional sports 0:56 Abhishek Naik from the National Research Council on Continuing RL and Average Reward 2:42 Daphne Cornelisse from NYU on Autonomous Driving and Multi-Agent RL 08:58 Shray Bansal from Georgia Tech on Cognitive Bias for Human AI Ad hoc Teamwork 10:21 Claas Voelcker from University of Toronto on Can we hop in general? 11:23 Brent Venable from The Institute for Human & Machine Cognition on Cooperative information dissemination…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
RLC 2024 - Posters and Hallways 4 4:52

17 weeksที่แล้ว4:52

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

4:52

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 David Abel from DeepMind on 3 Dogmas of RL 0:55 Kevin Wang from Brown on learning variable depth search for MCTS 2:17 Ashwin Kumar from Washington University in St Louis on fairness in resource allocation 3:36 Prabhat Nagarajan from UAlberta on Value overestimation…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
RLC 2024 - Posters and Hallways 3 6:43

17 weeksที่แล้ว6:43

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

6:43

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Kris De Asis from Openmind on Time Discretization 2:23 Anna Hakhverdyan from U of Alberta on Online Hyperparameters 3:59 Dilip Arumugam from Princeton on Information Theory and Exploration 5:04 Micah Carroll from UC Berkeley on Changing preferences and AI alignment…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
RLC 2024 - Posters and Hallways 2 15:52

17 weeksที่แล้ว15:52

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

15:52

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Hector Kohler from Centre Inria de l'Université de Lille with " Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning " 2:29 Quentin Delfosse from TU Darmstadt on " Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents " 4:15 Sonja Johnson-Yu from Harvard on " Understanding biological active sensing behaviors by interpreting learned artificial agent policies " 6:42 Jannis Blüml from TU Darmstadt on " OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments " 8:20 Cameron Allen from UC Berkeley on " Resolving Partial Observability in Decision Processes via the Lambda Discrepancy " 9:48 James Staley from Tufts on " Agent-Centric Human Demonstrations Train World Models " 14:54 Jonathan Li from Rensselaer Polytechnic Institute…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
RLC 2024 - Posters and Hallways 1 5:46

18 weeksที่แล้ว5:46

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

5:46

Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Ann Huang from Harvard on Learning Dynamics and the Geometry of Neural Dynamics in Recurrent Neural Controllers 1:37 Jannis Blüml from TU Darmstadt on HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning 3:13 Benjamin Fuhrer from NVIDIA on Gradient Boosting Reinforcement Learning 3:54 Paul Festor from Imperial College London on Evaluating the impact of explainable RL on physician decision-making in high-fidelity simulations: insights from eye-tracking metrics…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Finale Doshi-Velez on RL for Healthcare @ RCL 2024 7:35

19 weeksที่แล้ว7:35

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

7:35

Finale Doshi-Velez is a Professor at the Harvard Paulson School of Engineering and Applied Sciences. This off-the-cuff interview was recorded at UMass Amherst during the workshop day of RL Conference on August 9th 2024. Host notes: I've been a fan of some of Prof Doshi-Velez' past work on clinical RL and hoped to feature her for some time now, so I jumped at the chance to get a few minutes of her thoughts -- even though you can tell I was not prepared and a bit flustered tbh. Thanks to Prof Doshi-Velez for taking a moment for this, and I hope to cross paths in future for a more in depth interview. References Finale Doshi-Velez Homepage @ Harvard Finale Doshi-Velez on Google Scholar…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
David Silver 2 - Discussion after Keynote @ RCL 2024 16:17

20 weeksที่แล้ว16:17

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

16:17

Thanks to Professor Silver for permission to record this discussion after his RLC 2024 keynote lecture. Recorded at UMass Amherst during RCL 2024. Due to the live recording environment, audio quality varies. We publish this audio in its raw form to preserve the authenticity and immediacy of the discussion. References AlphaProof announcement on DeepMind's blog Discovering Reinforcement Learning Algorithms , Oh et al -- His keynote at RLC 2024 referred to more recent update to this work, yet to be published Reinforcement Learning Conference 2024 David Silver on Google Scholar…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
David Silver @ RCL 2024 11:27

20 weeksที่แล้ว11:27

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

11:27

David Silver is a principal research scientist at DeepMind and a professor at University College London. This interview was recorded at UMass Amherst during RLC 2024. References Discovering Reinforcement Learning Algorithms , Oh et al -- His keynote at RLC 2024 referred to more recent update to this work, yet to be published Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , Silver et al 2017 -- the AlphaZero algo was used in his recent work on AlphaProof AlphaProof on the DeepMind blog AlphaFold on the DeepMind blog Reinforcement Learning Conference 2024 David Silver on Google Scholar…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Vincent Moens on TorchRL 40:14

40 weeksที่แล้ว40:14

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

40:14

Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch. Featured References TorchRL: A data-driven decision-making library for PyTorch Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis, Vincent Moens Additional References TorchRL on github TensorDict Documentation…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Arash Ahmadian on Rethinking RLHF 33:30

42 weeksที่แล้ว33:30

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

33:30

Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI. Featured Reference Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker Additional References Self-Rewarding Language Models , Yuan et al 2024 Reinforcement Learning: An Introduction , Sutton and Barto 1992 Learning from Delayed Rewards , Chris Watkins 1989 Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , Williams 1992…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Glen Berseth on RL Conference 21:38

44 weeksที่แล้ว21:38

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

21:38

Glen Berseth is an assistant professor at the Université de Montréal, a core academic member of the Mila - Quebec AI Institute, a Canada CIFAR AI chair, member l'Institute Courtios, and co-director of the Robotics and Embodied AI Lab (REAL). Featured Links Reinforcement Learning Conference Closing the Gap between TD Learning and Supervised Learning--A Generalisation Point of View Raj Ghugare, Matthieu Geist, Glen Berseth, Benjamin Eysenbach…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Ian Osband 1:08:26

45 weeksที่แล้ว1:08:26

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

1:08:26

Ian Osband is a Research scientist at OpenAI (ex DeepMind, Stanford) working on decision making under uncertainty. We spoke about: - Information theory and RL - Exploration, epistemic uncertainty and joint predictions - Epistemic Neural Networks and scaling to LLMs Featured References Reinforcement Learning, Bit by Bit Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen From Predictions to Decisions: The Importance of Joint Predictive Distributions Zheng Wen, Ian Osband, Chao Qin, Xiuyuan Lu, Morteza Ibrahimi, Vikranth Dwaracherla, Mohammad Asghari, Benjamin Van Roy Epistemic Neural Networks Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy Approximate Thompson Sampling via Epistemic Neural Networks Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy Additional References Thesis defence , Ian Osband Homepage , Ian Osband Epistemic Neural Networks at Stanford RL Forum Behaviour Suite for Reinforcement Learning , Osband et al 2019 Efficient Exploration for LLMs , Dwaracherla et al 2024…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Sharath Chandra Raparthy 40:41

48 weeksที่แล้ว40:41

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

40:41

Sharath Chandra Raparthy on In-Context Learning for Sequential Decision Tasks, GFlowNets, and more! Sharath Chandra Raparthy is an AI Resident at FAIR at Meta, and did his Master's at Mila. Featured Reference Generalization to New Sequential Decision Making Tasks with In-Context Learning Sharath Chandra Raparthy , Eric Hambro, Robert Kirk , Mikael Henaff, , Roberta Raileanu Additional References Sharath Chandra Raparthy Homepage Human-Timescale Adaptation in an Open-Ended Task Space , Adaptive Agent Team 2023 Data Distributional Properties Drive Emergent In-Context Learning in Transformers , Chan et al 2022 Decision Transformer: Reinforcement Learning via Sequence Modeling , Chen et al 2021…

T

TalkRL: The Reinforcement Learning Podcast

TalkRL: The Reinforcement Learning Podcast podcast artwork

1
Pierluca D'Oro and Martin Klissarov 57:24

1 yearที่แล้ว57:24

ลิสต์เล่นในภายหลัง

ลิสต์เล่นในภายหลัง

ลิสต์

ถูกใจ

ที่ถูกใจแล้ว

57:24

Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more! Pierluca D'Oro is PhD student at Mila and visiting researcher at Meta. Martin Klissarov is a PhD student at Mila and McGill and research scientist intern at Meta. Featured References Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov*, Pierluca D'Oro*, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control Nate Rahn*, Pierluca D'Oro*, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare To keep doing RL research, stop calling yourself an RL researcher Pierluca D'Oro…

ขอต้อนรับสู่ Player FM!

Player FM กำลังหาเว็บ

เปิดฟังกว่า 500+ หัวข้อ

คู่มืออ้างอิงด่วน

พอดคาสต์ยอดนิยม

The Secret Sauce

สัพเพHEYไรว้าาา

Geek Forever’s Podcast

วอยซ์ ออฟ อเมริกา

ข่าวสดสายตรงจากวีโอเอ ภาคภาษาไทย 8:30–9:00 น. - วอยซ์ ออฟ อเมริกา

ปลดล็อกกับหมอเวช

WiTcast (ฟีดเก่า ไม่ใช้แล้ว)

ฟังรายการนี้ในขณะที่คุณสำรวจ