Academic Journals Database
Disseminating quality controlled scientific knowledge

Identification of homologs in insignificant blast hits by exploiting extrinsic gene properties

Author(s): Boekhorst Jos | Snel Berend

Journal: BMC Bioinformatics
ISSN 1471-2105

Volume: 8;
Issue: 1;
Start page: 356;
Date: 2007;
Original page

Abstract Background Homology is a key concept in both evolutionary biology and genomics. Detection of homology is crucial in fields like the functional annotation of protein sequences and the identification of taxon specific genes. Basic homology searches are still frequently performed by pairwise search methods such as BLAST. Vast improvements have been made in the identification of homologous proteins by using more advanced methods that use sequence profiles. However additional improvement could be made by exploiting sources of genomic information other than the primary sequence or tertiary structure. Results We test the hypothesis that extrinsic gene properties gene length and gene order can be of help in differentiating spurious sequence similarity from homology in the gray zone. Sharing gene order and similarity in size dramatically increase the chance of a query-hit pair being homologous: gray zone query-hit pairs of similar size and with conserved gene order are homologous in 99% of all cases, while for query-hit pairs without gene order conservation and with different sizes this is only 55%. Conclusion We have shown that using gene length and gene order drastically improves the detection of homologs within the BLAST gray zone. Our findings suggest that the use of such extrinsic gene properties can also improve the performance of homology detection by more advanced methods, and our study thereby underscores the importance of true data integration for fully exploiting genomic information.

Tango Jona
Tangokurs Rapperswil-Jona

     Save time & money - Smart Internet Solutions