Passer au contenu

/ Département d'informatique et de recherche opérationnelle

Je donne

Rechercher

Navigation secondaire

Better understanding of modern paradigms in probabilistic models

Better understanding of modern paradigms in probabilistic models

par

Andrej Risteski

Norbert Wiener Fellow at the Institute for Data Science and Statistics (IDSS) and an Instructor of Applied Mathematics at MIT

Jeudi 7 mars, 10:30-12:00Salle 3195, Pavillon André-Aisenstadt

    Université de Montréal, 2920 Chemin de la Tour

Résumé:

In recent years, one of the areas of machine learning that has seen the most exciting progress is unsupervised learning, namely learning in the absence of labels or annotation. An integral part of these advances have been complex probabilistic models for high-dimensional data, capturing different types of intricate latent structure. As a consequence, a lot of statistical and algorithmic issues have emerged, stemming from all major aspects of probabilistic models: representation (expressivity and interpretability of the model), learning (fitting a model from raw data) and inference (probabilistic queries and sampling from a known model).

A common theme is that the models that are used in practice are often intractable in the worst-case (either computationally or statistically), yet even simple algorithms are, to borrow from Wigner, unreasonably effective in practice. It thus behooves us to ask why this happens.

I will showcase some of my research addressing this question, in the context of (i) computationally efficient inference using Langevin dynamics in the presence of multimodality; (ii) statistical guarantees for Learning distributions using GANs (Generative Adversarial etworks); iii) explaining surprising properties of vector representations of words (word embeddings).

 

Biographie :

Andrej Risteski holds a joint position as the Norbert Wiener Fellow at the Institute for Data Science and Statistics (IDSS) and an Instructor of Applied Mathematics at MIT.

Before MIT, he was a PhD student in the Computer Science Department at Princeton University,
working under the advisement of Sanjeev Arora. Prior to that he received his B.S.E. degree at Princeton University as well.

His work lies in the intersection of machine learning and theoretical computer science. The broad goal of his research is theoretically understanding statistical and algorithmic phenomena and problems arising in modern machine learning.

Website: math.mit.edu/~risteski.