You are viewing a Web site, archived on 19:52:18 Nov 05, 2004. It is now a Federal record managed by the National Archives and Records Administration.
External links, forms, and search boxes may not function within this collection. Note that this document was downloaded, and not saved because it was a duplicate of a previously captured version (07:01:49 Nov 05, 2004). HTTP headers presented here are from the original capture.
NCBI Structure logo
PubMed Entrez BLAST OMIM Books TaxBrowser Entrez Structure
  Search Entrez  for


Short summary


Detailed explanations

Linking to MMDB

direct WWW access to the MMDB server

Read about MMDB

Papers about MMDB


Taxonomy in MMDB

Cn3D v4.1

3D-structure viewer


Conserved Domain Database


Structure comparisons

VAST Search

Submit structure database searches


Research topics and staff

MMDB - Entrez's Structure Database

NCBI's Entrez includes a database of experimentally determined three-dimensional biomolecular structures. Most 3D structure data are obtained from X-ray crystallography and NMR spectroscopy; they provide a wealth of information on biological function, on mechanisms linked to the function, and on the evolutionary history of and relationships between macromolecules. Our goals in adding structure data to Entrez are to make this information easily accessible to biologists and to facilitate comparative analysis involving 3D structure.

NCBI's structure database is called MMDB (Molecular Modeling DataBase), and it is a subset of three-dimensional structures obtained from the Protein Data Bank (PDB), excluding theoretical models. MMDB is a database of ASN.1-formatted records. It was designed for flexibility, and as such, is capable of archiving conventional structural data as well as future descriptions of biomolecules, such as those generated by electron microscopy (surface models).

The structure database is considerably smaller than Entrez's protein or nucleotide databases, but a large fraction of all known protein sequences have homologs in this set, and one may often learn more about a protein by examining 3D structures of its homologs. Protein sequences from MMDB are extracted and available in the Entrez protein sequence database. They are linked to the 3D structures; therefore, it is possible to determine whether a protein sequence in Entrez has homologs among known structures by examining its "Related Sequences" or "Protein Neighbors" and checking whether this set has any "Structure Links".

How and when is MMDB updated?

The Protein Data Bank (PDB) is updated weekly, and major releases (also available on CD-Rom) are issued several times a year. The NCBI mirrors PDB for the purpose of making MMDB. Once a month the new entries are parsed and loaded into the PKB (Protein Knowledge Base) database.

The software that builds the PKB database has a comprehensive PDB parser. This parser is able to extract the sequences and citations from PDB records, in addition to the structural information. All these data elements are translated into ASN.1. An explicit chemical graph representing the connectivity of the structure based on its chemistry is created in PDB and written out into ASN.1 records. PDB data are checked for consistency in the purported chemistry, the sequence, and the 3D coordinates.

Entrez users may occasionally notice that the sequence of a PDB-derived entry differs slightly from the PDB file, because all non-standard or chemically modified residues, as judged by their 3D structure, are explicitly identified as such in MMDB. These changes to PDB data are intended to support computational applications, such as homology modeling and structure comparison.

The original models (coordinates) provided by PDB are preserved and translated into ASN.1. In addition, new models representing single-coordinate per-atom molecules (i.e., any multiple conformations for single atoms are removed) and alpha-carbon backbones are created. PKB also generates explicit bonding and secondary structure information in addition to any provided by PDB, and this information is used, for example, to create vector models for the purposes of structural comparison and alignment.

With each update the taxonomy is assigned to each chain of the new entries by a semiautomated method. When new structure data are added to the Entrez system, they get cross-indexed to nodes in the taxonomy hierarchy, to citations in PubMed, and to sequences derived from the structures.

With each update, structure neighbors for all the protein chains and their individual domains are calculated by comparing the 3D architectures to all the other previously determined structures. This set of relationships can be used to search for "Structure Neighbors" from the Structure Summary pages of each entry as well as from within Entrez. All of this makes MMDB a "value-added" structure database. In Entrez, powerful search and selection mechanisms are available. Structure data can be retrieved by organism (or higher node in the taxonomical hierarchy), by publication date, author names, or substance names, just to name a few.

The Structure Neighbors data are an invaluable tool in studying remote homologs and distant evolutionary relationships. Further software development will focus on easy accessibility to this wealth of biological data.

Searching MMDB:

The structure database may be queried directly, using specific fields such as author names, or text terms occurring anywhere in the structure description. Entry points for queries are the Search Bar at the top of all Structure Group WWW pages or the WWW-Entrez interface to the 3-D structure database.

Alternatively you can use a PDB 4-character code or a numerical MMDB-Id to retrieve structure summary pages directly:
PDB/MMDB Code(s):

Updated 07/06/04

Privacy statement


Help Desk NCBI NLM NIH Credits