Academic Journals Database
Disseminating quality controlled scientific knowledge

ESMP: A high-throughput computational pipeline for mining SSR markers from ESTs

Author(s): Ranjan Sarmah | Jagajjit Sahu | Budheswar Dehury | Kishore Sarma | Smita Sahoo | Mousumi Sahu | Madhumita Barooah | Priyabrata Sen | Mahendra Kumar Modi

Journal: Bioinformation
ISSN 0973-2063

Volume: 8;
Issue: 4;
Start page: 206;
Date: 2012;
Original page

Keywords: Expressed Sequence Tag | Simple Sequence Repeats | ESMP | Single Nucleotide Polymorphism

With the advent of high-throughput sequencing technology, sequences from many genomes are being deposited to public databases at a brisk rate. Open access to large amount of expressed sequence tag (EST) data in the public databases has provided a powerful platform for simple sequence repeat (SSR) development in species where sequence information is not available. SSRs are markers of choice for their high reproducibility, abundant polymorphism and high inter-specific transferability. The mining of SSRs from ESTs requires different high-throughput computational tools that need to be executed individually which are computationally intensive and time consuming. To reduce the time lag and to streamline the cumbersome process of SSR mining from ESTs, we have developed a user-friendly, web-based EST-SSR pipeline “EST-SSR-MARKER PIPELINE (ESMP)”. This pipeline integrates EST pre-processing, clustering, assembly and subsequently mining of SSRs from assembled EST sequences. The mining of SSRs from ESTs provides valuable information on the abundance of SSRs in ESTs and will facilitate the development of markers for genetic analysis and related applications such as marker-assisted breeding.
Why do you need a reservation system?      Save time & money - Smart Internet Solutions