Name: 2019 Joint Statistical Meetings
Start: 2019-07-27T07:00:00+00:00
End: 2019-08-01
Location: Colorado Convention Center

Activity Number:	392 - Large-Scale Data Analysis via Spectral Methods
Type:	Topic Contributed
Date/Time:	Tuesday, July 30, 2019 : 2:00 PM to 3:50 PM
Sponsor:	IMS
Abstract #302984
Title:	Unsupervised Ensemble Learning: a Spectral Approach
Author(s):	Boaz Nadler*
Companies:	Weizmann Institute of Science
Keywords:	ensemble learning; unsupervised learning; low rank matrices; latent variable models
Abstract:	In various applications, one is given the advice or predictions of several classifiers of unknown reliability, over multiple questions or queries. This scenario is different from standard supervised learning where classifier accuracy can be assessed from available labeled training or validation data, and raises several questions: given only the predictions of several classifiers of unknown accuracies, over a large set of unlabeled test data, is it possible to a) reliably rank them, and b) construct a meta-classifier more accurate than any individual classifier in the ensemble? In this talk we'll show that under various independence assumptions between classifier errors, this high dimensional data hides simple low dimensional structures. Exploiting these, we will present simple spectral methods to address the above questions, and derive new unsupervised spectral meta-learners. We'll prove these methods are asymptotically consistent when the model assumptions hold, and present their empirical success on a variety of unsupervised learning problems.

Authors who are presenting talks have a * after their name.

JSM 2019 Online Program