The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier

CJ Veenman, MJT Reinders

Research output: Contribution to journalArticleScientificpeer-review

125 Citations (Scopus)

Abstract

We present the Nearest Subclass Classifier (NSC), which is a classification algorithm that unifies the flexibility of the nearest neighbor classifier with the robustness of the nearest mean classifier. The algorithm is based on the Maximum Variance Cluster algorithm and, as such, it belongs to the class of prototype-based classifiers. The variance constraint parameter of the cluster algorithm serves to regularize the classifier, that is, to prevent overfitting. With a low variance constraint value, the classifier turns into the nearest neighbor classifier and, with a high variance parameter, it becomes the nearest mean classifier with the respective properties. In other words, the number of prototypes ranges from the whole training set to only one per class. In the experiments, we compared the NSC with regard to its performance and data set compression ratio to several other prototype-based methods. On several data sets, the NSC performed similarly to the k-nearest neighbor classifier, which is a well-established classifier in many domains. Also concerning storage requirements and classification speed, the NSC has favorable properties, so it gives a good compromise between classification performance and efficiency.
Original languageUndefined/Unknown
Pages (from-to)1417-1429
Number of pages13
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume27
Issue number9
DOIs
Publication statusPublished - 2005

Keywords

  • academic journal papers
  • ZX CWTS 1.00 <= JFIS < 3.00

Cite this