Distance Metric Learning for Large Margin
Nearest Neighbor Classification
 
 
K. Q. Weinberger, J. Blitzer, and L. K. Saul (2006).
In Y. Weiss, B. Schoelkopf, and J. Platt (eds.), Advances in Neural Information Processing Systems 18. MIT Press: Cambridge, MA.
 
Distance Metric Learning for Large Margin
Nearest Neighbor Classification
 
Jan 10, 2006
We show how to learn a Mahanalobis distance metric for k-nearest neighbor (kNN) classification by semidefinite programming. The metric is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin. On seven data sets of varying size and difficulty,we find that metrics trained in this way lead to significant improvements in kNN classification---for example, achieving a test error rate of 1.3% on the MNIST handwritten digits. Our approach has many parallels to support vector machines, including a convex objective function based on the hinge loss, but does not require modifications for problems with large numbers of classes.