Date of Award

Spring 5-2020

Document Type


Degree Name

Doctor of Philosophy (PhD)


Computational and Data Sciences

First Advisor

Dr. Cyril Rakovski

Second Advisor

Dr. Vincent Berardi

Third Advisor

Dr. Adrian Vajiac


Baseball has quickly become one of the most analyzed sports with significant growth in the last 20 years [1] with an enormous amount of data collected every game that requires professional teams to have a state-of the-art analytics team in order to compete in today's game. Statcast, introduced in 2015, "allows for the collection and analysis of a massive amount of baseball data, in ways that were never possible in the past" [2]. Using this new Statcast data that is updated every pitch, a novel metric was developed, Pitcher Effectiveness, that is updated dynamically throughout a game. It was shown to be predictive of runs in combination with rate of change of the metric as well as effective in evaluating a starting pitcher on the game level and overall. Baseball can be broken down into a Markov Chain with 24 different states based on the combination of outs and baserunners where throughout the game teams will transition from one base/out state to another when events such as hits, outs, walks, and others occur [3]. Using this idea, pitch sequencing was explored on the micro level of each state individually. Looking at the last three pitches in a sequence, certain sequences in particular states were shown to have some predictive power in predicting outs, hits, and strikeouts. In addition, proportion tests showed significant differences in the proportion of outs and strikeouts of sequences depending on the baseball state. From fantasy baseball to Major League Baseball (MLB) front offices, projections of players’ future performance are important and are explored quite often. Several machine learning methods were explored for projecting future weighted on base average (wOBA) [3]. These methods were evaluated and the best were compared to 2020 projections from the reputable Steamer [4].

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.



To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.