A Generalized Kernel Approach to Dissimilarity-based Classification

EM Pekalska; P Paclik; RPW Duin

A Generalized Kernel Approach to Dissimilarity-based Classification

EM Pekalska, P Paclik, RPW Duin

Research output: Contribution to journal › Article › Scientific › peer-review

Abstract

Usually, objects to be classified are represented by features. In this paper, we discuss an alternative object representation based on dissimilarity values. If such distances separate the classes well, the nearest neighbor method offers a good solution. However, dissimilarities used in practice are usually far from ideal and the performance of the nearest neighbor rule suffers from its sensitivity to noisy examples. We show that other, more global classification techniques are preferable to the nearest neighbor rule, in such cases. For classification purposes, two different ways of using generalized dissimilarity kernels are considered. In the first one, distances are isometrically embedded in a pseudo-Euclidean space and the classification task is performed there. In the second approach, classifiers are built directly on distance kernels. Both approaches are described theoretically and then compared using experiments with different dissimilarity measures and datasets including degraded data simulating the problem of missing values. Keywords: dissimilarity, embedding, pseudo-Euclidean space, nearest mean classifier, support vector classifier, Fisher linear discriminant

Original language	Undefined/Unknown
Pages (from-to)	175-211
Number of pages	37
Journal	Journal of Machine Learning Research
Volume	2
Issue number	2
Publication status	Published - 2002

Bibliographical note

Special Issue on Kernel Methods, phpub 4

Keywords

academic journal papers
ZX CWTS JFIS < 1.00

Cite this

@article{3bdd485fc090414d96a8ba3602124de4,

title = "A Generalized Kernel Approach to Dissimilarity-based Classification",

abstract = "Usually, objects to be classified are represented by features. In this paper, we discuss an alternative object representation based on dissimilarity values. If such distances separate the classes well, the nearest neighbor method offers a good solution. However, dissimilarities used in practice are usually far from ideal and the performance of the nearest neighbor rule suffers from its sensitivity to noisy examples. We show that other, more global classification techniques are preferable to the nearest neighbor rule, in such cases. For classification purposes, two different ways of using generalized dissimilarity kernels are considered. In the first one, distances are isometrically embedded in a pseudo-Euclidean space and the classification task is performed there. In the second approach, classifiers are built directly on distance kernels. Both approaches are described theoretically and then compared using experiments with different dissimilarity measures and datasets including degraded data simulating the problem of missing values. Keywords: dissimilarity, embedding, pseudo-Euclidean space, nearest mean classifier, support vector classifier, Fisher linear discriminant",

keywords = "academic journal papers, ZX CWTS JFIS < 1.00",

author = "EM Pekalska and P Paclik and RPW Duin",

note = "Special Issue on Kernel Methods, phpub 4",

year = "2002",

language = "Undefined/Unknown",

volume = "2",

pages = "175--211",

journal = "Journal of Machine Learning Research ",

issn = "1532-4435",

publisher = "Microtome Publishing",

number = "2",

}

TY - JOUR

T1 - A Generalized Kernel Approach to Dissimilarity-based Classification

AU - Pekalska, EM

AU - Paclik, P

AU - Duin, RPW

N1 - Special Issue on Kernel Methods, phpub 4

PY - 2002

Y1 - 2002

N2 - Usually, objects to be classified are represented by features. In this paper, we discuss an alternative object representation based on dissimilarity values. If such distances separate the classes well, the nearest neighbor method offers a good solution. However, dissimilarities used in practice are usually far from ideal and the performance of the nearest neighbor rule suffers from its sensitivity to noisy examples. We show that other, more global classification techniques are preferable to the nearest neighbor rule, in such cases. For classification purposes, two different ways of using generalized dissimilarity kernels are considered. In the first one, distances are isometrically embedded in a pseudo-Euclidean space and the classification task is performed there. In the second approach, classifiers are built directly on distance kernels. Both approaches are described theoretically and then compared using experiments with different dissimilarity measures and datasets including degraded data simulating the problem of missing values. Keywords: dissimilarity, embedding, pseudo-Euclidean space, nearest mean classifier, support vector classifier, Fisher linear discriminant

AB - Usually, objects to be classified are represented by features. In this paper, we discuss an alternative object representation based on dissimilarity values. If such distances separate the classes well, the nearest neighbor method offers a good solution. However, dissimilarities used in practice are usually far from ideal and the performance of the nearest neighbor rule suffers from its sensitivity to noisy examples. We show that other, more global classification techniques are preferable to the nearest neighbor rule, in such cases. For classification purposes, two different ways of using generalized dissimilarity kernels are considered. In the first one, distances are isometrically embedded in a pseudo-Euclidean space and the classification task is performed there. In the second approach, classifiers are built directly on distance kernels. Both approaches are described theoretically and then compared using experiments with different dissimilarity measures and datasets including degraded data simulating the problem of missing values. Keywords: dissimilarity, embedding, pseudo-Euclidean space, nearest mean classifier, support vector classifier, Fisher linear discriminant

KW - academic journal papers

KW - ZX CWTS JFIS < 1.00

UR - http://www.ai.mit.edu/projects/jmlr/papers/volume2/pekalska01a/rev1/pekalska01ar.pdf

M3 - Article

SN - 1532-4435

VL - 2

SP - 175

EP - 211

JO - Journal of Machine Learning Research

JF - Journal of Machine Learning Research

IS - 2

ER -

A Generalized Kernel Approach to Dissimilarity-based Classification

Abstract

Bibliographical note

Keywords

Other files and links

Cite this