
Dataset Shift in Machine Learning
Published on 12. December 2008
Book
Hardback
248 pages
978-0-262-17005-5 (ISBN)
Description
An overview of recent efforts in the machine learning community to deal with dataset and covariate shift, which occurs when test and training inputs and outputs have different distributions.Dataset shift is a common problem in predictive modeling that occurs when the joint distribution of inputs and outputs differs between training and test stages. Covariate shift, a particular case of dataset shift, occurs when only the input distribution changes. Dataset shift is present in most practical applications, for reasons ranging from the bias introduced by experimental design to the irreproducibility of the testing conditions at training time. (An example is -email spam filtering, which may fail to recognize spam that differs in form from the spam the automatic filter has been built on.) Despite this, and despite the attention given to the apparently similar problems of semi-supervised learning and active learning, dataset shift has received relatively little attention in the machine learning community until recently. This volume offers an overview of current efforts to deal with dataset and covariate shift. The chapters offer a mathematical and philosophical introduction to the problem, place dataset shift in relationship to transfer learning, transduction, local learning, active learning, and semi-supervised learning, provide theoretical views of dataset and covariate shift (including decision theoretic and Bayesian perspectives), and present algorithms for covariate shift.Contributors
Shai Ben-David, Steffen Bickel, Karsten Borgwardt, Michael Brückner, David Corfield, Amir Globerson, Arthur Gretton, Lars Kai Hansen, Matthias Hein, Jiayuan Huang, Choon Hui Teo, Takafumi Kanamori, Klaus-Robert Müller, Sam Roweis, Neil Rubens, Tobias Scheffer, Marcel Schmittfull, Bernhard Schölkopf Hidetoshi Shimodaira, Alex Smola, Amos Storkey, Masashi Sugiyama
Shai Ben-David, Steffen Bickel, Karsten Borgwardt, Michael Brückner, David Corfield, Amir Globerson, Arthur Gretton, Lars Kai Hansen, Matthias Hein, Jiayuan Huang, Choon Hui Teo, Takafumi Kanamori, Klaus-Robert Müller, Sam Roweis, Neil Rubens, Tobias Scheffer, Marcel Schmittfull, Bernhard Schölkopf Hidetoshi Shimodaira, Alex Smola, Amos Storkey, Masashi Sugiyama
More details
Series
Language
English
Place of publication
Cambridge
United States
Publishing group
MIT Press Ltd
Target group
Professional and scholarly
US School Grade: From College Freshman to College Graduate Student
Product notice
Cloth over boards
Dimensions
Height: 254 mm
Width: 203 mm
Thickness: 17 mm
Weight
703 gr
ISBN-13
978-0-262-17005-5 (9780262170055)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Persons
Anton Schwaighofer is an Applied Researcherin the Online Services and Advertising Group at Microsoft Research, Cambridge, U.K.
Joaquin Quiñonero-Candela is a Researcher in the Online Services and Advertising Group at Microsoft Research Cambridge, U.K. Masashi Sugiyama is Associate Professor in the Department of Computer Science at the Tokyo Institute of Technology. Anton Schwaighofer is an Applied Researcherin the Online Services and Advertising Group at Microsoft Research, Cambridge, U.K. Neil D. Lawrence is Senior Research Fellow and Member of the Machine Learning and Optimisation Research Group in the School of Computer Science at the University of Manchester.
Joaquin Quiñonero-Candela is a Researcher in the Online Services and Advertising Group at Microsoft Research Cambridge, U.K. Masashi Sugiyama is Associate Professor in the Department of Computer Science at the Tokyo Institute of Technology. Anton Schwaighofer is an Applied Researcherin the Online Services and Advertising Group at Microsoft Research, Cambridge, U.K. Neil D. Lawrence is Senior Research Fellow and Member of the Machine Learning and Optimisation Research Group in the School of Computer Science at the University of Manchester.
Editor
Microsoft Research Ltd.
Associate ProfessorTokyo Institute of Technology
Microsoft Research Ltd.
The University of Sheffield
Contributions
University of Edinburgh
University of Kent
Universitaet des Saarlandes
Technical University of Denmark
University of Waterloo
Nagoya University