Detailed course offerings (Time Schedule) are available for
DATA 501 Data Science Visualization Lab (1)
The Data Science Visualization Lab class will provide students additional opportunities to practice and discuss data visualization concepts, with additional emphasis on user-centered design approaches and software development. Students will work in small groups on structured data visualization exercises and UCD methods, and implement simple visualizations.
View course details in MyPlan: DATA 501
DATA 511 Data Visualization for Data Scientists (4)
Introduction to the visual tools and techniques used in modern data science to develop and deploy data driven insights. Provides a foundation for visualization to support exploratory analysis, statistical modeling, machine learning, and presentation of results on structured and unstructured data. Students develop and present deep analyses for wider audiences. Course overlaps with: CSE 512; HCDE 511; and IMT 562. Recommended: basic familiarity with computer programming concepts as well as some prior experience working with data using programming languages like R or Python. Offered: A.
View course details in MyPlan: DATA 511
DATA 512 Human-Centered Data Science (5)
Fundamental principles of data science and its human implications. Data ethics, data privacy, differential privacy, algorithmic bias, legal frameworks and intellectual property, provenance and reproducibility, data curation and preservation, user experience design and usability testing for big data, ethics of crowdwork, data communication and societal impacts of data science. Offered: A.
View course details in MyPlan: DATA 512
DATA 514 Data Management for Data Science (5)
Introduces database management systems and techniques that use such systems; data models, query languages, database tuning and optimization, data warehousing, and parallel processing. Intended for professional students and non-CSE-majors. Offered: jointly with CSE D 514.
View course details in MyPlan: DATA 514
DATA 515 Software Design for Data Science (5)
Introduces software design and engineering practices and concepts, including version control, testing, and automatic build management. Intended for professional students and non-CSE-majors. Offered: jointly with CSE D 515.
View course details in MyPlan: DATA 515
DATA 516 Scalable Data Systems and Algorithms (5)
Principles and algorithms for data management and analysis at scale. Designs of traditional and modern big data systems and how to use those systems. Basics of cloud computing. Prerequisite: either CSE D 514/DATA 514 and CSE D 515/DATA 515, or permission of instructor.
Offered: jointly with CSE D 516; A.
View course details in MyPlan: DATA 516
DATA 556 Introduction to Statistics and Probability (5)
Overview of probability; conditional probability and independence; Bayes Theorem; discrete and continuous random variables including jointly distributed; key distributions including the normal and its spin offs; properties of expectation and variance; conditional expectation; covariance and correlation; Central Limit Theorem; law of large numbers; Parameter Estimation. Offered: jointly with BIOST 556/STAT 556; A.
View course details in MyPlan: DATA 556
DATA 557 Applied Statistics and Experimental Design (5)
Inferential statistical methods for discrete and continuous random variables including tests for difference in means and proportions; linear and logistic regression; causation versus correlation; confounding; resampling methods; study design. Prerequisite: either STAT 556/BIOST 556/DATA 556 or permission of instructor. Offered: jointly with BIOST 557/STAT 557; W.
View course details in MyPlan: DATA 557
DATA 558 Statistical Machine Learning for Data Scientists (5)
Bias-variance trade-off; training versus test error; overfitting; cross-validation; subset selection methods; regularized approaches for linear/logistic regression: ridge and lasso; non-parametric regression: trees, bagging, random forests; local regression and splines; generalized additive models; support vector machines; k-means and hierarchical clustering; principal components analysis. Prerequisite: STAT/BIOST/DATA 557, or permission of instructor. Offered: jointly with BIOST 558/STAT 558; Sp.
View course details in MyPlan: DATA 558
DATA 590 Data Science Capstone I- Project Preparation (2)
Part one of a two-course capstone sequence where students organize project teams, select project topics, write a project proposal and begin preparing project data sets. Prerequisite: either permission of instructor, or DATA 511; CSE D 514/DATA 514; CSE D 515/DATA 515; STAT 556/BIOST 556/DATA 556; BIOST 557/STAT 557/DATA 557; and STAT 558/BIOST 558/DATA 558.
View course details in MyPlan: DATA 590
DATA 591 Data Science Capstone II- Project Implementation (3)
Part two of a two-course capstone sequence designed to build upon the student driven project from DATA 590. Students will synthesize and apply knowledge and techniques acquired throughout the MSDS program in working with large data sets, deriving insights from data and sharing insights with other people. Prerequisite: either permission of instructor, or DATA 511; DATA 512; CSE D 514/DATA 514; CSE D 515/DATA 515; CSE D 516/DATA 516; STAT 556/BIOST 556/DATA 556; BIOST 557/STAT 557/DATA 557; STAT 558/BIOST 558/DATA 558; and DATA 590.
View course details in MyPlan: DATA 591
DATA 598 Special Topics in Data Science (1-5, max. 15)
Introduction to innovative and specialized topics relating to the study of data science in any context.
View course details in MyPlan: DATA 598