An introduction to techniques for the automated and human-assisted analysis of data sets. These "big data" techniques are applied to data sets from multiple disciplines and include cluster, network, and other analytical methods paired with appropriate visualizations.
The course will use selections from the following textbooks (all available for free online or on reserve in the science library):
Mining of Massive Datasets by Anand Rajaraman and Jeffrey D. Ullman.
Visualization Design and Analysis: Abstractions, Principles, and Methods by Tamara Munzner. Available in the science library.
The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman.
Interactive Data Visualization for the Web by Scott Murray.