Advanced search
1 file | 2.21 MB Add to list

CytoNorm : a normalization algorithm for cytometry data

(2020) CYTOMETRY PART A. 97(3). p.268-278
Author
Organization
Abstract
High-dimensional flow cytometry has matured to a level that enables deep phenotyping of cellular systems at a clinical scale. The resulting high-content data sets allow characterizing the human immune system at unprecedented single cell resolution. However, the results are highly dependent on sample preparation and measurements might drift over time. While various controls exist for assessment and improvement of data quality in a single sample, the challenges of cross-sample normalization attempts have been limited to aligning marker distributions across subjects. These approaches, inspired by bulk genomics and proteomics assays, ignore the single-cell nature of the data and risk the removal of biologically relevant signals. This work proposes CytoNorm, a normalization algorithm to ensure internal consistency between clinical samples based on shared controls across various study batches. Data from the shared controls is used to learn the appropriate transformations for each batch (e.g., each analysis day). Importantly, some sources of technical variation are strongly influenced by the amount of protein expressed on specific cell types, requiring several population-specific transformations to normalize cells from a heterogeneous sample. To address this, our approach first identifies the overall cellular distribution using a clustering step, and calculates subset-specific transformations on the control samples by computing their quantile distributions and aligning them with splines. These transformations are then applied to all other clinical samples in the batch to remove the batch-specific variations. We evaluated the algorithm on a customized data set with two shared controls across batches. One control sample was used for calculation of the normalization transformations and the second control was used as a blinded test set and evaluated with Earth Mover's distance. Additional results are provided using two real-world clinical data sets. Overall, our method compared favorably to standard normalization procedures.
Keywords
FLOW-CYTOMETRY, MASS CYTOMETRY, normalization, mass cytometry, computational cytometry, barcoding

Downloads

  • 4248 19VanGassen.pdf
    • full text (Published version)
    • |
    • open access
    • |
    • PDF
    • |
    • 2.21 MB

Citation

Please use this url to cite or link to this publication:

MLA
Van Gassen, Sofie, et al. “CytoNorm : A Normalization Algorithm for Cytometry Data.” CYTOMETRY PART A, vol. 97, no. 3, 2020, pp. 268–78, doi:10.1002/cyto.a.23904.
APA
Van Gassen, S., Gaudilliere, B., Angst, M. S., Saeys, Y., & Aghaeepour, N. (2020). CytoNorm : a normalization algorithm for cytometry data. CYTOMETRY PART A, 97(3), 268–278. https://doi.org/10.1002/cyto.a.23904
Chicago author-date
Van Gassen, Sofie, Brice Gaudilliere, Martin S Angst, Yvan Saeys, and Nima Aghaeepour. 2020. “CytoNorm : A Normalization Algorithm for Cytometry Data.” CYTOMETRY PART A 97 (3): 268–78. https://doi.org/10.1002/cyto.a.23904.
Chicago author-date (all authors)
Van Gassen, Sofie, Brice Gaudilliere, Martin S Angst, Yvan Saeys, and Nima Aghaeepour. 2020. “CytoNorm : A Normalization Algorithm for Cytometry Data.” CYTOMETRY PART A 97 (3): 268–278. doi:10.1002/cyto.a.23904.
Vancouver
1.
Van Gassen S, Gaudilliere B, Angst MS, Saeys Y, Aghaeepour N. CytoNorm : a normalization algorithm for cytometry data. CYTOMETRY PART A. 2020;97(3):268–78.
IEEE
[1]
S. Van Gassen, B. Gaudilliere, M. S. Angst, Y. Saeys, and N. Aghaeepour, “CytoNorm : a normalization algorithm for cytometry data,” CYTOMETRY PART A, vol. 97, no. 3, pp. 268–278, 2020.
@article{8639582,
  abstract     = {{High-dimensional flow cytometry has matured to a level that enables deep phenotyping of cellular systems at a clinical scale. The resulting high-content data sets allow characterizing the human immune system at unprecedented single cell resolution. However, the results are highly dependent on sample preparation and measurements might drift over time. While various controls exist for assessment and improvement of data quality in a single sample, the challenges of cross-sample normalization attempts have been limited to aligning marker distributions across subjects. These approaches, inspired by bulk genomics and proteomics assays, ignore the single-cell nature of the data and risk the removal of biologically relevant signals. This work proposes CytoNorm, a normalization algorithm to ensure internal consistency between clinical samples based on shared controls across various study batches. Data from the shared controls is used to learn the appropriate transformations for each batch (e.g., each analysis day). Importantly, some sources of technical variation are strongly influenced by the amount of protein expressed on specific cell types, requiring several population-specific transformations to normalize cells from a heterogeneous sample. To address this, our approach first identifies the overall cellular distribution using a clustering step, and calculates subset-specific transformations on the control samples by computing their quantile distributions and aligning them with splines. These transformations are then applied to all other clinical samples in the batch to remove the batch-specific variations. We evaluated the algorithm on a customized data set with two shared controls across batches. One control sample was used for calculation of the normalization transformations and the second control was used as a blinded test set and evaluated with Earth Mover's distance. Additional results are provided using two real-world clinical data sets. Overall, our method compared favorably to standard normalization procedures.}},
  author       = {{Van Gassen, Sofie and Gaudilliere, Brice and Angst, Martin S and Saeys, Yvan and Aghaeepour, Nima}},
  issn         = {{1552-4922}},
  journal      = {{CYTOMETRY PART A}},
  keywords     = {{FLOW-CYTOMETRY,MASS CYTOMETRY,normalization,mass cytometry,computational cytometry,barcoding}},
  language     = {{eng}},
  number       = {{3}},
  pages        = {{268--278}},
  title        = {{CytoNorm : a normalization algorithm for cytometry data}},
  url          = {{http://doi.org/10.1002/cyto.a.23904}},
  volume       = {{97}},
  year         = {{2020}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: