Advanced search

Accurate statistical approaches for generating representative workload compositions

Author
Organization
Abstract
Composing a representative workload is a crucial step during the design process of a microprocessor. The workload should be composed in such a way that it is representative for the target domain of application and yet, the amount of redundancy in the workload should be minimized as much as possible in order not to overly increase the total simulation time. As a result, there is an important trade-off that needs to be made between workload representativeness and simulation accuracy versus simulation speed. Previous work used statistical data analysis techniques to identify representative benchmarks and corresponding inputs, also called a subset, from a large set of potential benchmarks and inputs. These methodologies measure a number of program characteristics on which Principal Components Analysis (PCA) is applied before identifying distinct program behaviors among the benchmarks using cluster analysis. In this paper we propose Independent Components Analysis (ICA) as a better alternative to PCA as it does not assume that the original data set has a Gaussian distribution, which allows ICA to better find the important axes in the workload space. Our experimental results using SPEC CPU2000 benchmarks show that ICA significantly outperforms PCA in that ICA achieves smaller benchmark subsets that are more accurate than those found by PCA.
Keywords
ALGORITHMS, INDEPENDENT COMPONENT ANALYSIS

Citation

Please use this url to cite or link to this publication:

Chicago
Eeckhout, Lieven, R Sundareswara, J Yi, D Lilja, and P Schrater. 2005. “Accurate Statistical Approaches for Generating Representative Workload Compositions.” In International Symposium on Workload Characterization Proceedings, 56–66. Los Alamitos, CA, USA: IEEE Computer Society.
APA
Eeckhout, L., Sundareswara, R., Yi, J., Lilja, D., & Schrater, P. (2005). Accurate statistical approaches for generating representative workload compositions. International Symposium on Workload Characterization Proceedings (pp. 56–66). Presented at the IEEE International Symposium on Workload Characterization, Los Alamitos, CA, USA: IEEE Computer Society.
Vancouver
1.
Eeckhout L, Sundareswara R, Yi J, Lilja D, Schrater P. Accurate statistical approaches for generating representative workload compositions. International Symposium on Workload Characterization Proceedings. Los Alamitos, CA, USA: IEEE Computer Society; 2005. p. 56–66.
MLA
Eeckhout, Lieven, R Sundareswara, J Yi, et al. “Accurate Statistical Approaches for Generating Representative Workload Compositions.” International Symposium on Workload Characterization Proceedings. Los Alamitos, CA, USA: IEEE Computer Society, 2005. 56–66. Print.
@inproceedings{405689,
  abstract     = {Composing a representative workload is a crucial step during the design process of a microprocessor. The workload should be composed in such a way that it is representative for the target domain of application and yet, the amount of redundancy in the workload should be minimized as much as possible in order not to overly increase the total simulation time. As a result, there is an important trade-off that needs to be made between workload representativeness and simulation accuracy versus simulation speed.
Previous work used statistical data analysis techniques to identify representative benchmarks and corresponding inputs, also called a subset, from a large set of potential benchmarks and inputs. These methodologies measure a number of program characteristics on which Principal Components Analysis (PCA) is applied before identifying distinct program behaviors among the benchmarks using cluster analysis. In this paper we propose Independent Components Analysis (ICA) as a better alternative to PCA as it does not assume that the original data set has a Gaussian distribution, which allows ICA to better find the important axes in the workload space. Our experimental results using SPEC CPU2000 benchmarks show that ICA significantly outperforms PCA in that ICA achieves smaller benchmark subsets that are more accurate than those found by PCA.},
  author       = {Eeckhout, Lieven and Sundareswara, R and Yi, J and Lilja, D and Schrater, P},
  booktitle    = {International Symposium on Workload Characterization Proceedings},
  isbn         = {0-7803-9461-5},
  keywords     = {ALGORITHMS,INDEPENDENT COMPONENT ANALYSIS},
  language     = {eng},
  location     = {Austin, TX, USA},
  pages        = {56--66},
  publisher    = {IEEE Computer Society},
  title        = {Accurate statistical approaches for generating representative workload compositions},
  year         = {2005},
}

Web of Science
Times cited: