Academic Journals Database
Disseminating quality controlled scientific knowledge

FDR-FET: an optimizing gene set enrichment analysis method

ADD TO MY LIST
 
Author(s): Rui-Ru Ji | Karl-Heinz Ott | Roumyana Yordanova | et al

Journal: Advances and Applications in Bioinformatics and Chemistry
ISSN 1178-6949

Volume: 2011;
Issue: default;
Start page: 37;
Date: 2011;
Original page

ABSTRACT
Rui-Ru Ji1, Karl-Heinz Ott1, Roumyana Yordanova1, Robert E Bruccoleri21Applied Genomics, Research and Development, Bristol-Myers Squibb, Pennington, NJ, USA; 2Congenomics, Glastonbury, CT, USAAbstract: Gene set enrichment analysis for analyzing large profiling and screening experiments can reveal unifying biological schemes based on previously accumulated knowledge represented as “gene sets”. Most of the existing implementations use a fixed fold-change or P value cutoff to generate regulated gene lists. However, the threshold selection in most cases is arbitrary, and has a significant effect on the test outcome and interpretation of the experiment. We developed a new gene set enrichment analysis method, ie, FDR-FET, which dynamically optimizes the threshold choice and improves the sensitivity and selectivity of gene set enrichment analysis. The procedure translates experimental results into a series of regulated gene lists at multiple false discovery rate (FDR) cutoffs, and computes the P value of the overrepresentation of a gene set using a Fisher’s exact test (FET) in each of these gene lists. The lowest P value is retained to represent the significance of the gene set. We also implemented improved methods to define a more relevant global reference set for the FET. We demonstrate the validity of the method using a published microarray study of three protease inhibitors of the human immunodeficiency virus and compare the results with those from other popular gene set enrichment analysis algorithms. Our results show that combining FDR with multiple cutoffs allows us to control the error while retaining genes that increase information content. We conclude that FDR-FET can selectively identify significant affected biological processes. Our method can be used for any user-generated gene list in the area of transcriptome, proteome, and other biological and scientific applications.Keywords: gene set enrichment analysis, false discovery rate, Fisher’s exact test, microarray profiling, protease inhibitors

Tango Rapperswil
Tango Rapperswil

    
RPA Switzerland

RPA Switzerland

Robotic process automation