GSO Spring Speaker 2014


On the Computational and Statistical Interface and "Big Data"

Michael I. Jordan
Department of Statistics, University of California, Berkeley

Start Date and Time: Fri, 4 Apr 2014, 10:30 AM

End Date and Time: Fri, 4 Apr 2014, 12:00 PM

Venue: SC239


The rapid growth in the size and scope of datasets in science and technology has created a need for novel foundational perspectives on data analysis that blend the statistical and computational sciences. That classical perspectives from these fields are not adequate to address emerging problems in "Big Data" is apparent from their sharply divergent nature at an elementary level---in computer science, the growth of the number of data points is a source of "complexity" that must be tamed via algorithms or hardware, whereas in statistics, the growth of the number of data points is a source of "simplicity" in that inferences are generally stronger and asymptotic results can be invoked. We wish to blend these perspectives. In this talk we show how statistical decision theory provides a mathematical point of departure for achieving such a blending. We develop theoretical tradeoffs between statistical risk, amount of data and "externalities" such as computation, communication and privacy. We develop procedures that allow one to choose desired operating points along such tradeoff curves. [Joint work with Venkat Chandrasekaran, John Duchi, Martin Wainwright and Yuchen Zhang.]

Last Updated: Sep 19, 2017 8:27 AM

Purdue Department of Statistics, 250 N. University St, West Lafayette, IN 47907

Phone: (765) 494-6030, Fax: (765) 494-0558

© 2018 Purdue University | An equal access/equal opportunity university | Copyright Complaints

Trouble with this page? Disability-related accessibility issue? Please contact the College of Science Webmaster.