Display COMPLETE DOCUMENT Scroll Up Scroll DOWN MORE! TOP

What databases are available for Fasta searches?

The welcome notice when you start up GCG tells you what databases are available and when each one was last updated.

The databases can be accessed in groups and sub-groups. For example, Bacterial sequences can be searched separately as ba:*, gb_ba:*, or as part of Genbank (Genbank:*), or GenbankPlus (GBPlus:*), or GenEMBLPlus (GEPlus:*). The database section of the GCG manual lists the short names for each protein and nucleic acid database. (On helix, you can also type 'genmanual' and go to 'Databases').

Genbank and EMBL trade data nightly, so that Genbank encompasses almost all of EMBL at any given time. Thus it is only necessary to maintain one of these databases. On the helix systems, we keep Genbank up-to-date, and so EMBL on our system is vanishingly small. Thus, the GenEMBLPlus is essentially the same as GenbankPlus, GenEMBL is essentially equal to Genbank, and so on.

To use any of these databases in a Fasta search, remember to type ':*' after the database name! For example, you can search the Tags library using 'Tags:*' or 'Gb_Tags:*'. The database specifications are case-insensitive.