Skip to main content

Track overview

Data Analysis With Statcast

5 units • 45 lessons

Construct reproducible data workflows from event-level baseball data.

What to expect
Confirm listed prerequisites for this track before starting.
Time commitment
Self-paced. Phase length scales with lesson count; backend catalog stores per-lesson minute estimates for dashboard scale.
Prerequisites
Python or notebook workflow
Learner outcomes
Engineer and validate features without leakage.

Before you open a unit

  • Confirm listed prerequisites for this track before starting.
  • This phase: open and complete all 10 linked lessons; pass each lesson's Final Mastery Checkpoint when present before marking complete.
  • Data QA audits

Unit 1: Statcast Schema And Data Cleaning

10 lessons

Confirm listed prerequisites for this track before starting.

Self-paced. Phase length scales with lesson count; backend catalog stores per-lesson minute estimates for dashboard scale.

Unit 2: Feature Engineering For Ball Flight And HR Outcomes

10 lessons

Prior phase: all 10 lesson(s) completed with checkpoints passed where the lesson includes them.

Self-paced. Phase length scales with lesson count; backend catalog stores per-lesson minute estimates for dashboard scale.

Unit 3: Visualization Design For Baseball Questions

8 lessons

Prior phase: all 10 lesson(s) completed with checkpoints passed where the lesson includes them.

Self-paced. Phase length scales with lesson count; backend catalog stores per-lesson minute estimates for dashboard scale.

Unit 4: Reproducible Pipelines And Notebooks

9 lessons

Prior phase: all 8 lesson(s) completed with checkpoints passed where the lesson includes them.

Self-paced. Phase length scales with lesson count; backend catalog stores per-lesson minute estimates for dashboard scale.

Unit 5: Applied Projects — Recreating Deadball Tracker Figures

8 lessons

Prior phase: all 9 lesson(s) completed with checkpoints passed where the lesson includes them.

Self-paced. Phase length scales with lesson count; backend catalog stores per-lesson minute estimates for dashboard scale.