Due to the popularity of internet and video capture, sports data are increasingly easy to be available, and the size of data is growing at an explosive speed. These data are typically related with team, individual or league competitions, with possibly various features. Collecting and cleaning this type of data is quite challenging and the analysis involves sophisticated statistical and machine learning tools. Students are required to be proficient in R and know how to crawl data on internet. Interest in sports is preferable but not necessary.
The applicant is mainly responsible for cleaning data and analyze data with statistical software.
The student will learn to understand statistical modeling and apply various estimation methodologies.