The course provides students a hands-on introduction to data science, with applied examples of data collection, processing, transformation, management and analysis. Students will explore key concepts related to data science, including applied statistics, information visualization, text mining and machine learning. R, the open source statistical analysis and visualization system, will be used throughout the course. R is reckoned by many to be the most popular choice among data analysts worldwide; having knowledge and skill with using it is considered a valuable and marketable job skill for most data scientists. Students will also learn how to use supervised and unsupervised machine learning techniques. They will focus on structured data, using R (e.g., support vector machines, association rules mining) in conjunction with learning the full life cycle of data science.
Current Semester Syllabus: Winter 2020