TY - JOUR
T1 - FABIA
T2 - Factor analysis for bicluster acquisition
AU - Hochreiter, Sepp
AU - Bodenhofer, Ulrich
AU - Heusel, Martin
AU - Mayr, Andreas
AU - Mitterecker, Andreas
AU - Kasim, Adetayo
AU - Khamiakova, Tatsiana
AU - van Sanden, Suzy
AU - Lin, Dan
AU - Talloen, Willem
AU - Bijnens, Luc
AU - Göhlmann, Hinrich W.H.
AU - Shkedy, Ziv
AU - Clevert, Djork Arné
PY - 2010/4/23
Y1 - 2010/4/23
N2 - Motivation: Biclustering of transcriptomic data groups genes and samples simultaneously. It is emerging as a standard tool for extracting knowledge from gene expression measurements. We propose a novel generative approach for biclustering called 'FABIA: Factor Analysis for Bicluster Acquisition'. FABIA is based on a multiplicative model, which accounts for linear dependencies between gene expression and conditions, and also captures heavy-tailed distributions as observed in real-world transcriptomic data. The generative framework allows to utilize well-founded model selection methods and to apply Bayesian techniques. Results: On 100 simulated datasets with known true, artificially implanted biclusters, FABIA clearly outperformed all 11 competitors. On these datasets, FABIA was able to separate spurious biclusters from true biclusters by ranking biclusters according to their information content. FABIA was tested on three microarray datasets with known subclusters, where it was two times the best and once the second best method among the compared biclustering approaches. Availability: FABIA is available as an R package on Bioconductor (http://www.bioconductor.org). All datasets, results and software are available at http://www.bioinf.jku.at/software/fabia/fabia.html. Contact: [email protected]. Supplementary information: Supplementary data are available at Bioinformatics online.
AB - Motivation: Biclustering of transcriptomic data groups genes and samples simultaneously. It is emerging as a standard tool for extracting knowledge from gene expression measurements. We propose a novel generative approach for biclustering called 'FABIA: Factor Analysis for Bicluster Acquisition'. FABIA is based on a multiplicative model, which accounts for linear dependencies between gene expression and conditions, and also captures heavy-tailed distributions as observed in real-world transcriptomic data. The generative framework allows to utilize well-founded model selection methods and to apply Bayesian techniques. Results: On 100 simulated datasets with known true, artificially implanted biclusters, FABIA clearly outperformed all 11 competitors. On these datasets, FABIA was able to separate spurious biclusters from true biclusters by ranking biclusters according to their information content. FABIA was tested on three microarray datasets with known subclusters, where it was two times the best and once the second best method among the compared biclustering approaches. Availability: FABIA is available as an R package on Bioconductor (http://www.bioconductor.org). All datasets, results and software are available at http://www.bioinf.jku.at/software/fabia/fabia.html. Contact: [email protected]. Supplementary information: Supplementary data are available at Bioinformatics online.
KW - Algorithms
KW - Factor Analysis, Statistical
KW - Gene Expression
KW - Gene Expression Profiling/methods
KW - Oligonucleotide Array Sequence Analysis/methods
KW - Pattern Recognition, Automated
KW - Software
UR - http://www.scopus.com/inward/record.url?scp=77954206273&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btq227
DO - 10.1093/bioinformatics/btq227
M3 - Article
C2 - 20418340
AN - SCOPUS:77954206273
SN - 1367-4803
VL - 26
SP - 1520
EP - 1527
JO - Bioinformatics
JF - Bioinformatics
IS - 12
M1 - btq227
ER -