HOME > Development > Baseball Data Wrangling with Vagrant, R, and Retrosheet

Baseball Data Wrangling with Vagrant, R, and Retrosheet

  • Development
  • Mar 20, 2025
SynopsisBaseball Data Wrangling with Vagrant, R, and Retrosheet, avai...
Baseball Data Wrangling with Vagrant, R, and Retrosheet  No.1

Baseball Data Wrangling with Vagrant, R, and Retrosheet, available at Free, has an average rating of 4.35, with 28 lectures, based on 167 reviews, and has 12998 subscribers.

You will learn about install VirtualBox and Vagrant run a virtual Linux machine install the Chadwick software tools extract game and play-by-play baseball data from Retrosheet files produce graphs with ggplot This course is ideal for individuals who are This course is for those interested in doing baseball analytics with Retrosheet files. or No background is needed for the first part of the course. A background in the R package dplyr is necessary to follow the second part of the course. It is particularly useful for This course is for those interested in doing baseball analytics with Retrosheet files. or No background is needed for the first part of the course. A background in the R package dplyr is necessary to follow the second part of the course.

Enroll now: Baseball Data Wrangling with Vagrant, R, and Retrosheet

Summary

Title: Baseball Data Wrangling with Vagrant, R, and Retrosheet

Price: Free

Average Rating: 4.35

Number of Lectures: 28

Number of Published Lectures: 28

Number of Curriculum Items: 28

Number of Published Curriculum Objects: 28

Original Price: Free

Quality Status: approved

Status: Live

What You Will Learn

  • install VirtualBox and Vagrant
  • run a virtual Linux machine
  • install the Chadwick software tools
  • extract game and play-by-play baseball data from Retrosheet files
  • produce graphs with ggplot
  • Who Should Attend

  • This course is for those interested in doing baseball analytics with Retrosheet files.
  • No background is needed for the first part of the course. A background in the R package dplyr is necessary to follow the second part of the course.
  • Target Audiences

  • This course is for those interested in doing baseball analytics with Retrosheet files.
  • No background is needed for the first part of the course. A background in the R package dplyr is necessary to follow the second part of the course.
  • This course is for those interested in doing baseball analytics with the Retrosheet game-by-game and play-by-play data. The main tools for working with such data are in the Chadwick software. We install a virtual Linux machine, on which we will install the Chadwick software. We will then learn how to extract baseball data with the Chadwick software, how to further filter the data with dplyr in R, and how to plot our results with ggplot.

    For the first part of the course, in which we install the virtual Linux machine and learn how to work with the Chadwick software, there are no prerequisites. To follow the second part of the course, knowledge of dplyr is necessary. This can be obtained through my course Baseball Database Queries with SQL and dplyr.

    At a relaxed pace, the course should take two to three weeks to complete.

    Course Curriculum

    Chapter 1: Setting up Vagrant

    Lecture 1: Introduction

    Lecture 2: Installing VirtualBox

    Lecture 3: Installing Vagrant

    Lecture 4: Creating a Project Folder

    Lecture 5: Vagrant Up

    Lecture 6: Directory Structure

    Chapter 2: Installing and Working with the Chadwick Software

    Lecture 1: Downloading the Chadwick Software

    Lecture 2: Installing the Chadwick Software

    Lecture 3: The Retrosheet Files

    Lecture 4: cwevent and cwgame

    Chapter 3: Project #1: Mike Schmidt and Greg Luzinski

    Lecture 1: Data Extraction

    Lecture 2: Reading our data into R

    Lecture 3: The Result Column

    Lecture 4: The Date Column

    Lecture 5: The Date Column Part II

    Lecture 6: The Player Data Frames

    Lecture 7: ggplot Crash Course

    Lecture 8: Cumulative Home Run Plots

    Lecture 9: Colors and Legend

    Chapter 4: Project #2: Dykstra, Murray, and Brett

    Lecture 1: Project Description

    Lecture 2: Data Extraction

    Lecture 3: Reading the data into R

    Lecture 4: The Date Column

    Lecture 5: The Result and AB Columns

    Lecture 6: The Player Data Frames

    Lecture 7: The Plots

    Lecture 8: The Four-Hundred Line

    Lecture 9: The Marchi/Albert Book and Course Wrap-Up

    Instructors

  • Baseball Data Wrangling with Vagrant, R, and Retrosheet  No.2
    Charles Redmond
    Professor at Mercyhurst University
  • Rating Distribution

  • 1 stars: 1 votes
  • 2 stars: 2 votes
  • 3 stars: 12 votes
  • 4 stars: 45 votes
  • 5 stars: 107 votes
  • Frequently Asked Questions

    How long do I have access to the course materials?

    You can view and review the lecture materials indefinitely, like an on-demand channel.

    Can I take my courses with me wherever I go?

    Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!