Twenty Years of Mixture of Experts

doi:10.1109/TNNLS.2012.2200299

Journal ArticleDOI

Twenty Years of Mixture of Experts

Seniha Esen Yuksel, +2 more

- 11 Jun 2012 -

IEEE Transactions on Neural Networks

- Vol. 23, Iss: 8, pp 1177-1193

TLDR

A comprehensive survey of the mixture of experts (ME), discussing the fundamental models for regression and classification and also their training with the expectation-maximization algorithm, and covering the variational learning of ME in detail.

Abstract:

In this paper, we provide a comprehensive survey of the mixture of experts (ME). We discuss the fundamental models for regression and classification and also their training with the expectation-maximization algorithm. We follow the discussion with improvements to the ME model and focus particularly on the mixtures of Gaussian process experts. We provide a review of the literature for other training methods, such as the alternative localized ME training, and cover the variational learning of ME in detail. In addition, we describe the model selection literature which encompasses finding the optimum number of experts, as well as the depth of the tree. We present the advances in ME in the classification area and present some issues concerning the classification model. We list the statistical properties of ME, discuss how the model has been modified over the years, compare ME to some popular algorithms, and list several applications. We conclude our survey with future directions and provide a list of publicly available datasets and a list of publicly available software that implement ME. Finally, we provide examples for regression and classification. We believe that the study described in this paper will provide quick access to the relevant literature for researchers and practitioners who would like to improve or use ME, and that it will stimulate further studies in ME.

Twenty Years of Mixture of Experts

Citations

Pattern Recognition and Machine Learning

Multi-level Factorisation Net for Person Re-identification

When Gaussian Process Meets Big Data: A Review of Scalable GPs

Dynamic classifier selection

A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees

References

Random Forests

Maximum likelihood from incomplete data via the EM algorithm

Neural Networks: A Comprehensive Foundation

Statistical learning theory

Pattern Recognition and Machine Learning

Related Papers (5)

Maximum likelihood from incomplete data via the EM algorithm

Random Forests

Bagging predictors

Estimating the Dimension of a Model

Finite Mixture Models