Introduction to Data Science
Term
Format
Online
Subject Area
Course Number
PSCI 107 900
Course Code
PSCI107900
Course Key
71710
Instructor
Primary Program
Secondary Program
Course Description
Understanding and interpreting large, quantitative data sets is increasingly central in political and social science. Whether one seeks to understand political communication, international trade, inter-group conflict, or other issues, the availability of large quantities of digital data has revolutionized the study of politics. Nonetheless, most data-related courses focus on statistical estimation, rather than on the related but distinctive problems of data acquisition, management and visualization--in a term, data science. This course addresses that imbalance by focusing squarely on data science. Leaving this course, students will be able to acquire, format, analyze, and visualize various types of political data using the statistical programming language R. This course is not a statistics class, but it will increase the capacity of students to thrive in future statistics classes. While no background in statistics or political science is required, students are expected to be generally familiar with contemporary computing environments (e.g. know how to use a computer) and have a willingness to learn a variety of data science tools. You are encouraged (but certainly not required) to register for both this course and PSCI 338 at the same time, as the courses cover distinct, but complimentary material.
Syllabus
Subject Area Vocab