NCBI
Entrez PubMed Nucleotide Protein Genome Structure Map Viewer LocusLink UniGene OMIM
 Search for
  Limits  Preview/Index  History  Clipboard  Details     
About Entrez


HomoloGene
Home
Query Tips
Build Procedure
FTP Site

Genome Resources
Homo sapiens
Mus musculus
Rattus norvegicus
Danio rerio

HomoloGene is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic geneomes.
HomoloGene Release Statistics

Initial numbers of genes from complete genomes, numbers of genes placed in a homology group, and the numbers of groups for each species

HomoloGene Build 37.2
Species Number of Genes HomoloGene
  input grouped   Groups
H.sapiens 23,144 17,654   16,998
M.musculus 24,954 19,545   18,100
R.norvegicus 20,913 17,440   16,437
D.melanogaster 12,918 7,445   7,356
A.gambiae 12,012 7,580   7,251
C.elegans 19,109 3,171   2,768
S.pombe 4,947 2,863   2,824
S.cerevisiae 5,863 4,414   4,289
E.gossypii 4,726 3,924   3,917
N.crassa 10,079 5,446   5,439
M.grisea 11,109 5,743   5,449
A.thaliana 26,281 5,670   5,411
P.falciparum 5,222 1,168   1,153

Last updated on: 10/25/2004



We have recently adopted a new build procedure which makes use of amino acid sequence searching (blastp) to find more distant relationships, but still refers to the DNA sequence for computation of some of the statistics. The matching strategy is guided by the taxonomic tree, such that more closely related organisms are compared first. Moreover, HomoloGene entries now include paralogs in addition to orthologs.


Sources of Additional Information

HomoloGene entries have been augumented with homology and phenotype information drawn from the following sources.
Online Mendelian Inheritance in Man (OMIM)
Mouse Genome Informatics (MGI)
Zebrafish Information Network (ZFIN)
Saccharomyces Genome Database (SGD)
Clusters of Orthologous Groups (COG)
FlyBase
 
Related Resources

Entrez Genome

A collection of complete genome sequences that includes more than 1000 viruses and over hundred microbes
  Archaea
  Bacteria
  Eukaryota
  Viruses

Tax Plot
Three-way view of genome similarities



Compare

to

and


  COGs
Phylogenetic classification of proteins encoded in complete genomes.