Date of Award

Spring 5-2022

Document Type


Degree Name

Master of Science (MS)


Computational and Data Sciences

First Advisor

Erik Linstead, Ph.D.

Second Advisor

Cyril Rakovski, Ph.D.

Third Advisor

Elizabeth Stevens, Ph.D.


Free and open source software for statistical modeling and machine learning have advanced productivity in data science significantly. Packages such as SciPy in Python and caret in R provide fundamental tools for statistical modeling and machine learning in the two most popular programming languages used by data scientists. Unfortunately, robust tools similar to these are limited in terms of causal inference. The tools in R that exist lack consistent and standardized methodologies and inputs. R lacks a comprehensive package that offers traditional causal inference methods such as standardization, IP weighting, G-estimation, outcome regression, and propensity matching in one common package. CausalModels is meant to fill the gap in open source software concerning causal inference. It offers tools for these methods while accounting for biases in observational data without requiring extensive statistical knowledge from the user. For the purposes of this thesis, CausalModels creates a foundation by implementing popular fundamental methods and excludes more advanced methods that may be added over time.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.

Included in

Data Science Commons



To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.