Advanced search
Add to list

Kernel-based data fusion for gene prioritization

(2007) BIOINFORMATICS. 23(13). p.I125-I132
Author
Organization
Abstract
Motivation: Hunting disease genes is a problem of primary importance in biomedical research. Biologists usually approach this problem in two steps: first a set of candidate genes is identified using traditional positional cloning or high- throughput genomics techniques; second, these genes are further investigated and validated in the wet lab, one by one. To speed up discovery and limit the number of costly wet lab experiments, biologists must test the candidate genes starting with the most probable candidates. So far, biologists have relied on literature studies, extensive queries to multiple databases and hunches about expected properties of the disease gene to determine such an ordering. Recently, we have introduced the data mining tool ENDEAVOUR (Aerts et al., 2006), which performs this task automatically by relying on different genome-wide data sources, such as Gene Ontology, literature, microarray, sequence and more. Results: In this article, we present a novel kernel method that operates in the same setting: based on a number of different views on a set of training genes, a prioritization of test genes is obtained. We furthermore provide a thorough learning theoretical analysis of the method's guaranteed performance. Finally, we apply the method to the disease data sets on which ENDEAVOUR (Aerts et al., 2006) has been benchmarked, and report a considerable improvement in empirical performance.
Keywords
SUPPORT, GENOMIC DATA FUSION

Citation

Please use this url to cite or link to this publication:

MLA
De Bie, Tijl, et al. “Kernel-Based Data Fusion for Gene Prioritization.” BIOINFORMATICS, vol. 23, no. 13, 2007, pp. I125–32, doi:10.1093/bioinformatics/btm187.
APA
De Bie, T., Tranchevent, L.-C., Van Oeffelen, L. M., & Moreau, Y. (2007). Kernel-based data fusion for gene prioritization. BIOINFORMATICS, 23(13), I125–I132. https://doi.org/10.1093/bioinformatics/btm187
Chicago author-date
De Bie, Tijl, Leon-Charles Tranchevent, Liesbeth MM Van Oeffelen, and Yves Moreau. 2007. “Kernel-Based Data Fusion for Gene Prioritization.” BIOINFORMATICS 23 (13): I125–32. https://doi.org/10.1093/bioinformatics/btm187.
Chicago author-date (all authors)
De Bie, Tijl, Leon-Charles Tranchevent, Liesbeth MM Van Oeffelen, and Yves Moreau. 2007. “Kernel-Based Data Fusion for Gene Prioritization.” BIOINFORMATICS 23 (13): I125–I132. doi:10.1093/bioinformatics/btm187.
Vancouver
1.
De Bie T, Tranchevent L-C, Van Oeffelen LM, Moreau Y. Kernel-based data fusion for gene prioritization. BIOINFORMATICS. 2007;23(13):I125–32.
IEEE
[1]
T. De Bie, L.-C. Tranchevent, L. M. Van Oeffelen, and Y. Moreau, “Kernel-based data fusion for gene prioritization,” BIOINFORMATICS, vol. 23, no. 13, pp. I125–I132, 2007.
@article{6936504,
  abstract     = {{Motivation: Hunting disease genes is a problem of primary importance in biomedical research. Biologists usually approach this problem in two steps: first a set of candidate genes is identified using traditional positional cloning or high- throughput genomics techniques; second, these genes are further investigated and validated in the wet lab, one by one. To speed up discovery and limit the number of costly wet lab experiments, biologists must test the candidate genes starting with the most probable candidates. So far, biologists have relied on literature studies, extensive queries to multiple databases and hunches about expected properties of the disease gene to determine such an ordering. Recently, we have introduced the data mining tool ENDEAVOUR (Aerts et al., 2006), which performs this task automatically by relying on different genome-wide data sources, such as Gene Ontology, literature, microarray, sequence and more.
 
Results: In this article, we present a novel kernel method that operates in the same setting: based on a number of different views on a set of training genes, a prioritization of test genes is obtained. We furthermore provide a thorough learning theoretical analysis of the method's guaranteed performance. Finally, we apply the method to the disease data sets on which ENDEAVOUR (Aerts et al., 2006) has been benchmarked, and report a considerable improvement in empirical performance.}},
  author       = {{De Bie, Tijl and Tranchevent, Leon-Charles and Van Oeffelen, Liesbeth MM and Moreau, Yves}},
  issn         = {{1367-4803}},
  journal      = {{BIOINFORMATICS}},
  keywords     = {{SUPPORT,GENOMIC DATA FUSION}},
  language     = {{eng}},
  location     = {{Vienna, Austria}},
  number       = {{13}},
  pages        = {{I125--I132}},
  title        = {{Kernel-based data fusion for gene prioritization}},
  url          = {{http://doi.org/10.1093/bioinformatics/btm187}},
  volume       = {{23}},
  year         = {{2007}},
}

Altmetric
View in Altmetric
Web of Science
Times cited: