USDA Logo
ARS Logo

  Bovine Functional Genomics
Printer FriendlyPrintable version     Email this pageEmail this page
 
Search
 
 
This site only
  Advanced Search
 
Research
  Programs and Projects
 
 
  Display category headings
Research
Research >
Research Project: Development of Bioinformatics Tools for Livestock

Location: Bovine Functional Genomics Laboratory

Title: Development of Algorithms for Prediction and Validation of Polymorphisms in Polyploids (Soybean) Using Est Data

Authors
item Matukumalli, Lakshmi - GEORGE MASON UNIVERSITY
item Grefenstette, John - GEORGE MASON UNIVERSITY
item Van Tassell, Curtis - curt
item Choii, Ik-Young
item Cregan, Perry

Submitted to: International Genome Sequencing And Analysis Conference
Publication Acceptance Date: August 3, 2004
Publication Date: October 25, 2004
Citation: Matukumalli, L.K., Grefenstette, J.J., Van Tassell, C.P., Choii, I., Cregan, P.B. 2004. Development Of Algorithms For Prediction And Validation Of Polymorphisms In Polyploids (Soybean) Using Est Data. International Genome Sequencing And Analysis Conference.

Technical Abstract: Identification of polymorphisms in polyploid species with complex genomes is difficult, but they can help to characterize variations that confer disease resistance, improved quality, increased tolerance and increased productivity. Only Expressed sequence tags (EST) sequencing has been performed in these species to derive more information about the genes. In this study we used soybean as a model for studying polyploid genomes.Existing EST assemblies (e.g., TIGR gene index) were found to be not suitable for polymorphisms detection because they have not used sequence quality information and also they were built using a diploid gene model. We developed a gene model for computational analysis of polyploids that considered paralogs i.e., duplicate gene copies and alternate splice sites in analyzing EST data. We have developed two algorithmic methods for distinguishing the paralogs and predict polymorphisms. These predictions were experimentally tested using a high throughput software pipeline (SNP-PHAGE: SNP ' Pipeline for Haplotype analysis and GEnbank [dbSNP] submissions) that was developed as a part of this project. Approximately 6,000 expert verified polymorphisms were discovered using this pipeline. The bioinformatics tools developed in this project were generalized to be applicable to polyploid species like Wheat, Cotton, Canola, Corn, Potatoes, Alfalfa and are made available open source.

 
Project Team
Van Tassell, Curtis - Curt

Publications

Related National Programs
  Food Animal Production (101)

Related Projects
   Application of Bioinformatics to Livestock Genomes

 
ARS Home |  USDA |  Home | About Us | Research | Products & Services | People & Places  | News & Events | Partnering | Careers | Contact Us | Help |
Site Map |  Freedom of Information Act |  Statements & Disclaimers |  Employee Resources |  FirstGov |  White House