Advanced search
1 file | 1.10 MB

Bayesian clustering of fuzzy feature vectors using a quasi-likelihood approach

Author
Organization
Abstract
Bayesian model-based classifiers, both unsupervised and supervised, have been studied extensively, and their value and versatility have been demonstrated on a wide spectrum of applications within science and engineering. A majority of the classifiers are built on the assumption of intrinsic discreteness of the considered data features or on their discretization prior to the modeling. On the other hand, Gaussian mixture classifiers have also been utilized to a large extent for continuous features in the Bayesian framework. Often, the primary reason for discretization in the classification context is the simplification of the analytical and numerical properties of the Bayesian models. However, the discretization can be problematic due to its ad hoc nature and the decreased statistical power to detect the correct classes ( or clusters) in the resulting procedure. Here, we introduce an unsupervised classification approach for fuzzy feature vectors that utilizes a discrete model structure while preserving the continuous characteristics of data. This goal is achieved by replacing the ordinary likelihood by a binomial quasi-likelihood to yield an analytical expression for the posterior probability of a given clustering solution. The resulting model can also be justified from an information-theoretic perspective. Our method is shown to yield highly accurate clusterings for challenging synthetic and empirical data sets and to perform favorably compared to some alternative approaches.
Keywords
fuzzy modeling, continuous data, quasi-likelihood, Bayesian clustering

Downloads

  • (...).pdf
    • full text
    • |
    • UGent only
    • |
    • PDF
    • |
    • 1.10 MB

Citation

Please use this url to cite or link to this publication:

Chicago
Marttinen, Pekka, Jing Tang, Bernard De Baets, Peter Dawyndt, and Jukka Corander. 2009. “Bayesian Clustering of Fuzzy Feature Vectors Using a Quasi-likelihood Approach.” IEEE Transactions on Pattern Analysis and Machine Intelligence 31 (1): 74–85.
APA
Marttinen, P., Tang, J., De Baets, B., Dawyndt, P., & Corander, J. (2009). Bayesian clustering of fuzzy feature vectors using a quasi-likelihood approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(1), 74–85.
Vancouver
1.
Marttinen P, Tang J, De Baets B, Dawyndt P, Corander J. Bayesian clustering of fuzzy feature vectors using a quasi-likelihood approach. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2009;31(1):74–85.
MLA
Marttinen, Pekka et al. “Bayesian Clustering of Fuzzy Feature Vectors Using a Quasi-likelihood Approach.” IEEE Transactions on Pattern Analysis and Machine Intelligence 31.1 (2009): 74–85. Print.
@article{693700,
  abstract     = {Bayesian model-based classifiers, both unsupervised and supervised, have been studied extensively, and their value and versatility have been demonstrated on a wide spectrum of applications within science and engineering. A majority of the classifiers are built on the assumption of intrinsic discreteness of the considered data features or on their discretization prior to the modeling. On the other hand, Gaussian mixture classifiers have also been utilized to a large extent for continuous features in the Bayesian framework. Often, the primary reason for discretization in the classification context is the simplification of the analytical and numerical properties of the Bayesian models. However, the discretization can be problematic due to its ad hoc nature and the decreased statistical power to detect the correct classes ( or clusters) in the resulting procedure. Here, we introduce an unsupervised classification approach for fuzzy feature vectors that utilizes a discrete model structure while preserving the continuous characteristics of data. This goal is achieved by replacing the ordinary likelihood by a binomial quasi-likelihood to yield an analytical expression for the posterior probability of a given clustering solution. The resulting model can also be justified from an information-theoretic perspective. Our method is shown to yield highly accurate clusterings for challenging synthetic and empirical data sets and to perform favorably compared to some alternative approaches.},
  author       = {Marttinen, Pekka and Tang, Jing and De Baets, Bernard and Dawyndt, Peter and Corander, Jukka},
  issn         = {0162-8828},
  journal      = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
  language     = {eng},
  number       = {1},
  pages        = {74--85},
  title        = {Bayesian clustering of fuzzy feature vectors using a quasi-likelihood approach},
  url          = {http://dx.doi.org/10.1109/TPAMI.2008.53},
  volume       = {31},
  year         = {2009},
}

Altmetric
View in Altmetric
Web of Science
Times cited: