Document Type


Publication Date



Investigating drivers’ injury level and detecting contributing factors that aggravate the damage level imposed on drivers and vehicles is a critical subject in the field of crash analysis. In this study, a comprehensive vehicle-by-vehicle crash data set is developed by integrating 5 years of data from California crash, vehicles involved, and road databases. The data set is used to model the severity of rear-end crashes for comparing three analytic techniques: multinomial logit, mixed multinomial logit, and support vector machine (SVM). The results of the crash severity models and the role of contributing factors to the severity outcome of rear-end crashes are extensively discussed. In terms of prediction performance, all three models yielded comparable results; although, the SVM performed slightly better than the other two methods. The results from this study will inform aspects of our driver safety education and design, either vehicle or roadway design, required to be improved to alleviate the probability of severe injuries.


This is an Accepted Manuscript of an article published in Journal of Transportation Safety & Security, volume 12, issue 4, in 2020, available online at It may differ slightly from the final version of record.

The Creative Commons license below applies only to this version of the article.


Taylor & Francis

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial 4.0 License



To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.