Advanced search
1 file | 248.20 KB Add to list

A scalable approach to fuzzy rough nearest neighbour classification with ordered weighted averaging operators

Oliver Urs Lenz (UGent) , Daniel Peralta (UGent) and Chris Cornelis (UGent)
Author
Organization
Abstract
Fuzzy rough sets have been successfully applied in classification tasks, in particular in combination with OWA operators. There has been a lot of research into adapting algorithms for use with Big Data through parallelisation, but no concrete strategy exists to design a Big Data fuzzy rough sets based classifier. Existing Big Data approaches use fuzzy rough sets for feature and prototype selection, and have often not involved very large datasets. We fill this gap by presenting the first Big Data extension of an algorithm that uses fuzzy rough sets directly to classify test instances, a distributed implementation of FRNN-OWA in Apache Spark. Through a series of systematic tests involving generated datasets, we demonstrate that it can achieve a speedup effectively equal to the number of computing cores used, meaning that it can scale to arbitrarily large datasets.
Keywords
fuzzy rough sets, OWA operators, Big Data, Apache Spark

Downloads

  • lenz et al 2019 a scalable approach.pdf
    • full text (Accepted manuscript)
    • |
    • open access
    • |
    • PDF
    • |
    • 248.20 KB

Citation

Please use this url to cite or link to this publication:

MLA
Lenz, Oliver Urs, et al. “A Scalable Approach to Fuzzy Rough Nearest Neighbour Classification with Ordered Weighted Averaging Operators.” Rough Sets. International Joint Conference, IJCRS 2019, edited by Tamás Mihálydeák et al., vol. 11499, Springer Nature, 2019, pp. 197–209.
APA
Lenz, O. U., Peralta, D., & Cornelis, C. (2019). A scalable approach to fuzzy rough nearest neighbour classification with ordered weighted averaging operators. In T. Mihálydeák, F. Min, G. Wang, M. Banerjee, I. Düntsch, Z. Suraj, & D. Ciucci (Eds.), Rough sets. International joint conference, IJCRS 2019 (Vol. 11499, pp. 197–209). Cham, Switzerland: Springer Nature.
Chicago author-date
Lenz, Oliver Urs, Daniel Peralta, and Chris Cornelis. 2019. “A Scalable Approach to Fuzzy Rough Nearest Neighbour Classification with Ordered Weighted Averaging Operators.” In Rough Sets. International Joint Conference, IJCRS 2019, edited by Tamás Mihálydeák, Fan Min, Guoyin Wang, Mohua Banerjee, Ivo Düntsch, Zbigniew Suraj, and Davide Ciucci, 11499:197–209. Cham, Switzerland: Springer Nature.
Chicago author-date (all authors)
Lenz, Oliver Urs, Daniel Peralta, and Chris Cornelis. 2019. “A Scalable Approach to Fuzzy Rough Nearest Neighbour Classification with Ordered Weighted Averaging Operators.” In Rough Sets. International Joint Conference, IJCRS 2019, ed by. Tamás Mihálydeák, Fan Min, Guoyin Wang, Mohua Banerjee, Ivo Düntsch, Zbigniew Suraj, and Davide Ciucci, 11499:197–209. Cham, Switzerland: Springer Nature.
Vancouver
1.
Lenz OU, Peralta D, Cornelis C. A scalable approach to fuzzy rough nearest neighbour classification with ordered weighted averaging operators. In: Mihálydeák T, Min F, Wang G, Banerjee M, Düntsch I, Suraj Z, et al., editors. Rough sets International joint conference, IJCRS 2019. Cham, Switzerland: Springer Nature; 2019. p. 197–209.
IEEE
[1]
O. U. Lenz, D. Peralta, and C. Cornelis, “A scalable approach to fuzzy rough nearest neighbour classification with ordered weighted averaging operators,” in Rough sets. International joint conference, IJCRS 2019, Debrecen, Hungary, 2019, vol. 11499, pp. 197–209.
@inproceedings{8612581,
  abstract     = {Fuzzy rough sets have been successfully applied in classification tasks, in particular in combination with OWA operators. There has been a lot of research into adapting algorithms for use with Big Data through parallelisation, but no concrete strategy exists to design a Big Data fuzzy rough sets based classifier. Existing Big Data approaches use fuzzy rough sets for feature and prototype selection, and have often not involved very large datasets. We fill this gap by presenting the first  Big Data extension of an algorithm that uses fuzzy rough sets directly to classify test instances, a distributed implementation of FRNN-OWA in Apache Spark. Through a series of systematic tests involving generated datasets, we demonstrate that it can achieve a speedup effectively equal to the number of computing cores used, meaning that it can scale to arbitrarily large datasets.},
  author       = {Lenz, Oliver Urs and Peralta, Daniel and Cornelis, Chris},
  booktitle    = {Rough sets. International joint conference, IJCRS 2019},
  editor       = {Mihálydeák, Tamás and Min, Fan and Wang, Guoyin and Banerjee, Mohua and Düntsch, Ivo and Suraj, Zbigniew and Ciucci, Davide},
  isbn         = {9783030228149},
  issn         = {0302-9743},
  keywords     = {fuzzy rough sets,OWA operators,Big Data,Apache Spark},
  language     = {eng},
  location     = {Debrecen, Hungary},
  pages        = {197--209},
  publisher    = {Springer Nature},
  title        = {A scalable approach to fuzzy rough nearest neighbour classification with ordered weighted averaging operators},
  url          = {http://dx.doi.org/10.1007/978-3-030-22815-6_16},
  volume       = {11499},
  year         = {2019},
}

Altmetric
View in Altmetric