พอดคาสท์ NLP Highlights ชั้นนำ (2025)

1
Are LLMs safe? 42:15

10M ago42:15

42:15

Curious about the safety of LLMs? 🤔 Join us for an insightful new episode featuring Suchin Gururangan, Young Investigator at Allen Institute for Artificial Intelligence and Data Science Engineer at Appuri. 🚀 Don't miss out on expert insights into the world of LLMs!โดย Allen Institute for Artificial Intelligence

1
"Imaginative AI" with Mohamed Elhoseiny 23:19

1y ago23:19

23:19

This podcast episode features Dr. Mohamed Elhoseiny, a true luminary in the realm of computer vision with over a decade of groundbreaking research. As an Assistant Professor at KAUST, Dr. Elhoseiny's work delves into the intersections of Computer Vision, Language & Vision, and Computational Creativity in Art, Fashion, and AI. Notably, he co-organiz…

1
142 - Science Of Science, with Kyle Lo 48:57

1y ago48:57

48:57

Our first guest with this new format is Kyle Lo, the most senior lead scientist in the Semantic Scholar team at Allen Institute for AI (AI2), who kindly agreed to share his perspective on #Science of #Science (#scisci) on our podcast. SciSci is concerned with studying how people do science, and includes developing methods and tools to help people c…

1
141 - Building an open source LM, with Iz Beltagy and Dirk Groeneveld 29:36

1+ y ago29:36

29:36

In this special episode of NLP Highlights, we discussed building and open sourcing language models. What is the usual recipe for building large language models? What does it mean to open source them? What new research questions can we answer by open sourcing them? We particularly focused on the ongoing Open Language Model (OLMo) project at AI2, and…

1
140 - Generative AI and Copyright, with Chris Callison-Burch 51:28

1+ y ago51:28

51:28

In this special episode, we chatted with Chris Callison-Burch about his testimony in the recent U.S. Congress Hearing on the Interoperability of AI and Copyright Law. We started by asking Chris about the purpose and the structure of this hearing. Then we talked about the ongoing discussion on how the copyright law is applicable to content generated…

1
139 - Coherent Long Story Generation, with Kevin Yang 45:18

2y ago45:18

45:18

How can we generate coherent long stories from language models? Ensuring that the generated story has long range consistency and that it conforms to a high level plan is typically challenging. In this episode, Kevin Yang describes their system that prompts language models to first generate an outline, and iteratively generate the story while follow…

1
138 - Compositional Generalization in Neural Networks, with Najoung Kim 48:22

2y ago48:22

48:22

Compositional generalization refers to the capability of models to generalize to out-of-distribution instances by composing information obtained from the training data. In this episode we chatted with Najoung Kim, on how to explicitly evaluate specific kinds of compositional generalization in neural network models of language. Najoung described COG…

1
137 - Nearest Neighbor Language Modeling and Machine Translation, with Urvashi Khandelwal 35:56

2y ago35:56

35:56

We invited Urvashi Khandelwal, a research scientist at Google Brain to talk about nearest neighbor language and machine translation models. These models interpolate parametric (conditional) language models with non-parametric distributions over the closest values in some data stores built from relevant data. Not only are these models shown to outpe…

1
136 - Including Signed Languages in NLP, with Kayo Yin and Malihe Alikhani 1:02:15

2+ y ago1:02:15

1:02:15

In this episode, we talk with Kayo Yin, an incoming PhD at Berkeley, and Malihe Alikhani, an assistant professor at the University of Pittsburgh, about opportunities for the NLP community to contribute to Sign Language Processing (SLP). We talked about history and misconceptions about sign languages, high-level similarities and differences between …

1
135 - PhD Application Series: After Submitting Applications 36:53

3y ago36:53

36:53

This episode is the third in our current series on PhD applications.We talk about what the PhD application process looks like after applications are submitted. We start with a general overview of the timeline, then talk about how to approach interviews and conversations with faculty, and finish by discussing the different factors to consider in dec…

1
134 - PhD Application Series: PhDs in Europe versus the US, with Barbara Plank and Gonçalo Correia 38:29

3y ago38:29

38:29

This episode is the second in our current series on PhD applications.How do PhD programs in Europe differ from PhD programs in the US, and how should people decide between them? In this episode, we invite Barbara Plank (Professor at ITU, IT University of Copenhagen) and Gonçalo Correia (ELLIS PhD student at University of Lisbon and University of Am…

1
133 - PhD Application Series: Preparing Application Materials, with Nathan Schneider and Roma Patel 43:54

3+ y ago43:54

43:54

This episode is the first in our current series on PhD applications.How should people prepare their applications to PhD programs in NLP? In this episode, we invite Nathan Schneider (Professor of Linguistics and Computer Science at Georgetown University) and Roma Patel (PhD student in Computer Science at Brown University) to share their perspectives…

1
132 - Alexa Prize Socialbot Grand Challenge and Alquist 4.0, with Petr Marek 41:43

3+ y ago41:43

41:43

In this episode, we discussed the Alexa Prize Socialbot Grand Challenge and this year's winning submission, Alquist 4.0, with Petr Marek, a member of the winning team. Petr gave us an overview of their submission, the design choices that led to them winning the competition, including combining a hardcoded dialog tree and a neural generator model an…

1
131 - Opportunities and Barriers between HCI and NLP, with Nanna Inie and Leon Derczynski 46:54

3+ y ago46:54

46:54

What can NLP researchers learn from Human Computer Interaction (HCI) research? We chatted with Nanna Inie and Leon Derczynski to find out. We discussed HCI's research processes including methods of inquiry, the data annotation processes used in HCI, and how they are different from NLP, and the cognitive methods used in HCI for qualitative error ana…

1
130 - Linking human cognitive patterns to NLP Models, with Lisa Beinborn 44:02

3+ y ago44:02

44:02

In this episode, we talk with Lisa Beinborn, an assistant professor at Vrije Universiteit Amsterdam, about how to use human cognitive signals to improve and analyze NLP models. We start by discussing different kinds of cognitive signals—eye-tracking, EEG, MEG, and fMRI—and challenges associated with using them. We then turn to Lisa’s recent work co…

1
129 - Transformers and Hierarchical Structure, with Shunyu Yao 35:43

3+ y ago35:43

35:43

In this episode, we talk to Shunyu Yao about recent insights into how transformers can represent hierarchical structure in language. Bounded-depth hierarchical structure is thought to be a key feature of natural languages, motivating Shunyu and his coauthors to show that transformers can efficiently represent bounded-depth Dyck languages, which can…

1
128 - Dynamic Benchmarking, with Douwe Kiela 47:00

3+ y ago47:00

47:00

We discussed adversarial dataset construction and dynamic benchmarking in this episode with Douwe Kiela, a research scientist at Facebook AI Research who has been working on a dynamic benchmarking platform called Dynabench. Dynamic benchmarking tries to address the issue of many recent datasets getting solved with little progress being made towards…

1
127 - Masakhane and Participatory Research for African Languages, with Tosin Adewumi and Perez Ogayo 47:17

3+ y ago47:17

47:17

We invited members of Masakhane, Tosin Adewumi and Perez Ogayo, to talk about their EMNLP Findings paper that discusses why typical research is limited for low-resourced NLP and how participatory research can help. As a result of participatory research, Masakhane has many, many success stories: first datasets and benchmarks in African languages, fi…

1
126 - Optimizing Continuous Prompts for Generation, with Lisa Li 47:38

3+ y ago47:38

47:38

We invited Lisa Li to talk about her recent work, Prefix-Tuning: Optimizing Continuous Prompts for Generation. Prefix tuning is a lightweight alternative to finetuning, and the idea is to tune only a fixed-length task-specific continuous vector, and to keep the pretrained transformer parameters frozen. We discussed how prefix tuning compares with f…

1
125 - VQA for Real Users, with Danna Gurari 42:10

3+ y ago42:10

42:10

How can we build Visual Question Answering systems for real users? For this episode, we chatted with Danna Gurari, about her work in building datasets and models towards VQA for people who are blind. We talked about the differences between the existing datasets, and Vizwiz, a dataset built by Gurari et al., and the resulting algorithmic changes. We…

1
124 - Semantic Machines and Task-Oriented Dialog, with Jayant Krishnamurthy and Hao Fang 45:37

3+ y ago45:37

45:37

We invited Jayant Krishnamurthy and Hao Fang, researchers at Microsoft Semantic Machines to discuss their platform for building task-oriented dialog systems, and their recent TACL paper on the topic. The paper introduces a new formalism for task-oriented dialog to effectively handle references and revisions in complex dialog, and a large realistic …

1
123 - Robust NLP, with Robin Jia 47:59

4y ago47:59

47:59

In this episode, Robin Jia talks about how to build robust NLP systems. We discuss the different senses in which a system can be robust, reasons to care about system robustness, and the challenges involved in evaluating robustness of NLP models. We talk about how to build certifiably robust models through interval bound propagation and discrete enc…

1
122 - Statutory Reasoning in Tax Law, with Nils Holzenberger 46:18

4y ago46:18

46:18

We invited Nils Holzenberger, a PhD student at JHU to talk about a dataset involving statutory reasoning in tax law Holzenberger et al. released recently. This dataset includes difficult textual entailment and question answering problems that involve reasoning about how sections in tax law are applicable to specific cases. They also released a Prol…

1
121 - Language and the Brain, with Alona Fyshe 42:38

4y ago42:38

42:38

We invited Alona Fyshe to talk about the link between NLP and the human brain. We began by talking about what we currently know about the connection between representations used in NLP and representations recorded in the brain. We also discussed how different brain imaging techniques compare to each other. We then dove into experiments investigatin…

1
120 - Evaluation of Text Generation, with Asli Celikyilmaz 55:13

4+ y ago55:13

55:13

We invited Asli Celikyilmaz for this episode to talk about evaluation of text generation systems. We discussed the challenges in evaluating generated text, and covered human and automated metrics, with a discussion of recent developments in learning metrics. We also talked about some open research questions, including the difficulties in evaluating…

1
119 - Social NLP, with Diyi Yang 53:32

4+ y ago53:32

53:32

In this episode, Diyi Yang gives us an overview of using NLP models for social applications, including understanding social relationships, processes, roles, and power. As NLP systems are getting used more and more in the real world, they additionally have increasing social impacts that must be studied. We talk about how to get started in this field…

1
118 - Coreference Resolution, with Marta Recasens 47:30

4+ y ago47:30

47:30

In this episode, we talked about Coreference Resolution with Marta Recasens, a Research Scientist at Google. We discussed the complexity involved in resolving references in language, the simplification of the problem that the NLP community has focused on by talking about specific datasets, and the complex coreference phenomena that are not yet capt…

1
117 - Interpreting NLP Model Predictions, with Sameer Singh 56:56

4+ y ago56:56

56:56

We interviewed Sameer Singh for this episode, and discussed an overview of recent work in interpreting NLP model predictions, particularly instance-level interpretations. We started out by talking about why it is important to interpret model outputs and why it is a hard problem. We then dove into the details of three kinds of interpretation techniq…

1
116 - Grounded Language Understanding, with Yonatan Bisk 59:28

4+ y ago59:28

59:28

We invited Yonatan Bisk to talk about grounded language understanding. We started off by discussing an overview of the topic, its research goals, and the the challenges involved. In the latter half of the conversation, we talked about ALFRED (Shridhar et al., 2019), a grounded instruction following benchmark that simulates training a robot butler. …

1
115 - AllenNLP, interviewing Matt Gardner 33:25

4+ y ago33:25

33:25

In this special episode, Carissa Schoenick, a program manager and communications director at AI2 interviewed Matt Gardner about AllenNLP. We chatted about the origins of AllenNLP, the early challenges in building it, and the design decisions behind the library. Given the release of AllenNLP 1.0 this week, we asked Matt what users can expect from th…

1
114 - Behavioral Testing of NLP Models, with Marco Tulio Ribeiro 43:32

4+ y ago43:32

43:32

We invited Marco Tulio Ribeiro, a Senior Researcher at Microsoft, to talk about evaluating NLP models using behavioral testing, a framework borrowed from Software Engineering. Marco describes three kinds of black-box tests the check whether NLP models satisfy certain necessary conditions. While breaking the standard IID assumption, this framework p…

1
113 - Managing Industry Research Teams, with Fernando Pereira 42:22

4+ y ago42:22

42:22

We invited Fernando Pereira, a VP and Distinguished Engineer at Google, where he leads NLU and ML research, to talk about managing NLP research teams in industry. Topics we discussed include prioritizing research against product development and effective collaboration with product teams, dealing with potential research interest mismatch between ind…

1
112 - Alignment of Multilingual Contextual Representations, with Steven Cao 33:15

4+ y ago33:15

33:15

We invited Steven Cao to talk about his paper on multilingual alignment of contextual word embeddings. We started by discussing how multilingual transformers work in general, and then focus on Steven’s work on aligning word representations. The core idea is to start from a list of words automatically aligned from parallel corpora and to ensure the …

1
111 - Typologically diverse, multi-lingual, information-seeking questions, with Jon Clark 38:29

4+ y ago38:29

38:29

We invited Jon Clark from Google to talk about TyDi QA, a new question answering dataset, for this episode. The dataset contains information seeking questions in 11 languages that are typologically diverse, i.e., they differ from each other in terms of key structural and functional features. The questions in TyDiQA are information-seeking, like tho…

1
110 - Natural Questions, with Tom Kwiatkowski and Michael Collins 43:30

5y ago43:30

43:30

In this episode, Tom Kwiatkowski and Michael Collins talk about Natural Questions, a benchmark for question answering research. We discuss how the dataset was collected to reflect naturally-occurring questions, the criteria used for identifying short and long answers, how this dataset differs from other QA datasets, and how easy it might be to game…

1
109 - What Does Your Model Know About Language, with Ellie Pavlick 46:58

5y ago46:58

46:58

How do we know, in a concrete quantitative sense, what a deep learning model knows about language? In this episode, Ellie Pavlick talks about two broad directions to address this question: structural and behavioral analysis of models. In structural analysis, we often train a linear classifier for some linguistic phenomenon we'd like to probe (e.g.,…

1
108 - Data-To-Text Generation, with Verena Rieser and Ondřej Dušek 49:30

5y ago49:30

49:30

In this episode we invite Verena Rieser and Ondřej Dušek on to talk to us about the complexities of generating natural language when you have some kind of structured meaning representation as input. We talk about when you might want to do this, which is often is some kind of a dialog system, but also generating game summaries, and even some langu…

1
107 - Multi-Modal Transformers, with Hao Tan and Mohit Bansal 37:34

5y ago37:34

37:34

In this episode, we invite Hao Tan and Mohit Bansal to talk about multi-modal training of transformers, focusing in particular on their EMNLP 2019 paper that introduced LXMERT, a vision+language transformer. We spend the first third of the episode talking about why you might want to have multi-modal representations. We then move to the specifics of…

1
106 - Ethical Considerations In NLP Research, with Emily Bender 39:18

5y ago39:18

39:18

In this episode, we talked to Emily Bender about the ethical considerations in developing NLP models and putting them in production. Emily cited specific examples of ethical issues, and talked about the kinds of potential concerns to keep in mind, both when releasing NLP models that will be used by real people, and also while conducting NLP researc…

1
105 - Question Generation, with Sudha Rao 42:59

5y ago42:59

42:59

In this episode we invite Sudha Rao to talk about question generation. We talk about different settings where you might want to generate questions: for human testing scenarios (rare), for data augmentation (has been done a bunch for SQuAD-like tasks), for detecting missing information / asking clarification questions, for dialog uses, and others. A…

1
104 - Model Distillation, with Victor Sanh and Thomas Wolf 31:22

5y ago31:22

31:22

In this episode we talked with Victor Sanh and Thomas Wolf from HuggingFace about model distillation, and DistilBERT as one example of distillation. The idea behind model distillation is compressing a large model by building a smaller model, with much fewer parameters, that approximates the output distribution of the original model, typically for i…

1
103 - Processing Language in Social Media, with Brendan O'Connor 43:12

5y ago43:12

43:12

We talked to Brendan O’Connor for this episode about processing language in social media. Brendan started off by telling us about his projects that studied the linguistic and geographical patterns of African American English (AAE), and how obtaining data from Twitter made these projects possible. We then talked about how many tools built for standa…

1
102 - Biomedical NLP research at the National Institute of Health with Dina Demner-Fushman 36:52

5y ago36:52

36:52

What exciting NLP research problems are involved in processing biomedical and clinical data? In this episode, we spoke with Dina Demner-Fushman, who leads NLP and IR research at the Lister Hill National Center for Biomedical Communications, part of the National Library of Medicine. We talked about processing biomedical scientific literature, unders…

1
101 - The lottery ticket hypothesis, with Jonathan Frankle 41:16

5y ago41:16

41:16

In this episode, Jonathan Frankle describes the lottery ticket hypothesis, a popular explanation of how over-parameterization helps in training neural networks. We discuss pruning methods used to uncover subnetworks (winning tickets) which were initialized in a particularly effective way. We also discuss patterns observed in pruned networks, stabil…

1
100 - NLP Startups, with Oren Etzioni 30:55

5y ago30:55

30:55

For our 100th episode, we invite AI2 CEO Oren Etzioni to talk to us about NLP startups. Oren has founded several successful startups, is himself an investor in startups, and helps with AI2's startup incubator.Some of our discussion topics include: What's the similarity between being a researcher and an entrepreneur? How do you transition from being…

1
99 - Evaluating Protein Transfer Learning, With Roshan Rao And Neil Thomas 44:49

5y ago44:49

44:49

For this episode, we chatted with Neil Thomas and Roshan Rao about modeling protein sequences and evaluating transfer learning methods for a set of five protein modeling tasks. Learning representations using self-supervised pretaining objectives has shown promising results in transferring to downstream tasks in protein sequence modeling, just like …

1
98 - Analyzing Information Flow In Transformers, With Elena Voita 37:05

5y ago37:05

37:05

What function do the different attention heads serve in multi-headed attention models? In this episode, Lena describes how to use attribution methods to assess the importance and contribution of different heads in several tasks, and describes a gating mechanism to prune the number of effective heads used when combined with an auxiliary loss. Then, …

1
97 - Automated Analysis Of Historical Printed Documents, With Taylor Berg-Kirkpatrick 44:11

5y ago44:11

44:11

In this episode, we talk to Taylor Berg-Kirkpatrick about optical character recognition (OCR) on historical documents. Taylor starts off by describing some practical issues related to old scanning processes of documents that make performing OCR on them a difficult problem. Then he explains how one can build latent variable models for this data usin…

1
96 - Question Answering as an Annotation Format, with Luke Zettlemoyer 29:54

5y ago29:54

29:54

In this episode, we chat with Luke Zettlemoyer about Question Answering as a format for crowdsourcing annotations of various semantic phenomena in text. We start by talking about QA-SRL and QAMR, two datasets that use QA pairs to annotate predicate-argument relations at the sentence level. Luke describes how this annotation scheme makes it possible…

1
95 - Common sense reasoning, with Yejin Choi 35:29

5+ y ago35:29

35:29

In this episode, we invite Yejin Choi to talk about common sense knowledge and reasoning, a growing area in NLP. We start by discussing a working definition of “common sense” and the practical utility of studying it. We then talk about some of the datasets and resources focused on studying different aspects of common sense (e.g., ReCoRD, Commonsens…

พอดคาสต์ที่ควรค่าแก่การฟัง

NLP Highlights พอดคาสต์

พอดคาสต์ที่ควรค่าแก่การฟัง

คู่มืออ้างอิงด่วน