Artwork

เนื้อหาจัดทำโดย Real Python เนื้อหาพอดแคสต์ทั้งหมด รวมถึงตอน กราฟิก และคำอธิบายพอดแคสต์ได้รับการอัปโหลดและจัดหาให้โดยตรงจาก Real Python หรือพันธมิตรแพลตฟอร์มพอดแคสต์ของพวกเขา หากคุณเชื่อว่ามีบุคคลอื่นใช้งานที่มีลิขสิทธิ์ของคุณโดยไม่ได้รับอนุญาต คุณสามารถปฏิบัติตามขั้นตอนที่แสดงไว้ที่นี่ https://th.player.fm/legal
Player FM - แอป Podcast
ออฟไลน์ด้วยแอป Player FM !

Improving Classification Models With XGBoost

1:05:00
 
แบ่งปัน
 

Manage episode 375236016 series 2637014
เนื้อหาจัดทำโดย Real Python เนื้อหาพอดแคสต์ทั้งหมด รวมถึงตอน กราฟิก และคำอธิบายพอดแคสต์ได้รับการอัปโหลดและจัดหาให้โดยตรงจาก Real Python หรือพันธมิตรแพลตฟอร์มพอดแคสต์ของพวกเขา หากคุณเชื่อว่ามีบุคคลอื่นใช้งานที่มีลิขสิทธิ์ของคุณโดยไม่ได้รับอนุญาต คุณสามารถปฏิบัติตามขั้นตอนที่แสดงไว้ที่นี่ https://th.player.fm/legal

How can you improve a classification model while avoiding overfitting? Once you have a model, what tools can you use to explain it to others? This week on the show, we talk with author and Python trainer Matt Harrison about his new book Effective XGBoost: Tuning, Understanding, and Deploying Classification Models.

Matt talks about the process of developing the book and how he wanted it to be an interactive experience for the reader. He explains the concept of gradient boosting and provides metaphors for developing a model. He shares his appreciation for exploratory data analysis as a crucial step in understanding your data.

He also shares additional libraries to help you explain your model. We discuss how difficult it is to develop the story of how the model works to share it with stakeholders.

He illustrates why covering the complete process is essential, from exploring data and building a model to finally deploying it. He shares many of the tools he found along the way.

This week’s episode is brought to you by Scout APM.

Course Spotlight: Starting With Linear Regression in Python

In this video course, you’ll get started with linear regression in Python. Linear regression is one of the fundamental statistical and machine learning techniques, and Python is a popular choice for machine learning.

Topics:

  • 00:00:00 – Introduction
  • 00:02:16 – Starting on the book
  • 00:04:36 – What is tabular prediction?
  • 00:06:50 – Who could leverage XGBoost?
  • 00:09:46 – Background to get started
  • 00:11:50 – Using XGBoost to explore data
  • 00:21:06 – Sponsor: ScoutAPM
  • 00:21:54 – Focusing on using the tool
  • 00:26:37 – Not being a developer
  • 00:30:53 – Contrasting XGBoost and logistic regression
  • 00:41:57 – Video Course Spotlight
  • 00:43:21 – Using SHAP to explain the model
  • 00:48:06 – Working with hyperparameters
  • 00:51:40 – Deploying your model
  • 00:53:09 – XGBoost Feature Interactions Reshaped (XGBFIR)
  • 00:55:47 – Communicating the story of a model
  • 00:57:57 – How to find the book
  • 00:59:07 – What are you excited about in the world of Python?
  • 01:02:46 – What do you want to learn next?
  • 01:03:12 – How can people follow what you do online?
  • 01:03:59 – Thanks and goodbye

Show Links:

Level up your Python skills with our expert-led courses:

Support the podcast & join our community of Pythonistas

  continue reading

203 ตอน

Artwork
iconแบ่งปัน
 
Manage episode 375236016 series 2637014
เนื้อหาจัดทำโดย Real Python เนื้อหาพอดแคสต์ทั้งหมด รวมถึงตอน กราฟิก และคำอธิบายพอดแคสต์ได้รับการอัปโหลดและจัดหาให้โดยตรงจาก Real Python หรือพันธมิตรแพลตฟอร์มพอดแคสต์ของพวกเขา หากคุณเชื่อว่ามีบุคคลอื่นใช้งานที่มีลิขสิทธิ์ของคุณโดยไม่ได้รับอนุญาต คุณสามารถปฏิบัติตามขั้นตอนที่แสดงไว้ที่นี่ https://th.player.fm/legal

How can you improve a classification model while avoiding overfitting? Once you have a model, what tools can you use to explain it to others? This week on the show, we talk with author and Python trainer Matt Harrison about his new book Effective XGBoost: Tuning, Understanding, and Deploying Classification Models.

Matt talks about the process of developing the book and how he wanted it to be an interactive experience for the reader. He explains the concept of gradient boosting and provides metaphors for developing a model. He shares his appreciation for exploratory data analysis as a crucial step in understanding your data.

He also shares additional libraries to help you explain your model. We discuss how difficult it is to develop the story of how the model works to share it with stakeholders.

He illustrates why covering the complete process is essential, from exploring data and building a model to finally deploying it. He shares many of the tools he found along the way.

This week’s episode is brought to you by Scout APM.

Course Spotlight: Starting With Linear Regression in Python

In this video course, you’ll get started with linear regression in Python. Linear regression is one of the fundamental statistical and machine learning techniques, and Python is a popular choice for machine learning.

Topics:

  • 00:00:00 – Introduction
  • 00:02:16 – Starting on the book
  • 00:04:36 – What is tabular prediction?
  • 00:06:50 – Who could leverage XGBoost?
  • 00:09:46 – Background to get started
  • 00:11:50 – Using XGBoost to explore data
  • 00:21:06 – Sponsor: ScoutAPM
  • 00:21:54 – Focusing on using the tool
  • 00:26:37 – Not being a developer
  • 00:30:53 – Contrasting XGBoost and logistic regression
  • 00:41:57 – Video Course Spotlight
  • 00:43:21 – Using SHAP to explain the model
  • 00:48:06 – Working with hyperparameters
  • 00:51:40 – Deploying your model
  • 00:53:09 – XGBoost Feature Interactions Reshaped (XGBFIR)
  • 00:55:47 – Communicating the story of a model
  • 00:57:57 – How to find the book
  • 00:59:07 – What are you excited about in the world of Python?
  • 01:02:46 – What do you want to learn next?
  • 01:03:12 – How can people follow what you do online?
  • 01:03:59 – Thanks and goodbye

Show Links:

Level up your Python skills with our expert-led courses:

Support the podcast & join our community of Pythonistas

  continue reading

203 ตอน

ทุกตอน

×
 
Loading …

ขอต้อนรับสู่ Player FM!

Player FM กำลังหาเว็บ

 

คู่มืออ้างอิงด่วน