Display COMPLETE DOCUMENT Scroll Up Scroll DOWN MORE! TOP

How do I translate the coding regions of a Genbank entry?

  • Search for your protein by accession number or name in Lookup.
    In the 'Features:' field, type 'cds.
    In the 'Form of output list' field, select 'Fragments'. (Use the space bar on your keyboard to toggle options in this field).
  • Save the result in a file, which will contain the coding regions.
  • Then use 'translate' to translate them.

    Sample session:
    helix% lookup
    LookUp identifies sequences by name, accession number, author, organism,
    keyword, title, reference, feature, definition, length, or date.  The output
    is a list of sequences. 
    
    The LookUp program is experimental in this release--please look carefully at
    your results. 
    
     LOOKUP in what sequence libraries:
    
       a) swissprot
       b) pir
       c) embl
       d) genbank
       e) em_tags
       f) gb_tags
       g) gb_new
       h) genpept
       i) All libraries
     
       q) quit
    
     Please choose one or more (* i *): d
    
     Complete the query form below:
    
                     All text:  tryptophan synthase
                   Definition:
                       Author:
                      Keyword:
                Sequence name:
             Accession number:
                     Organism:
                    Reference:
                        Title:
                      Feature:  cds
      On or after (dd-mmm-yy):               On or before (dd-mmm-yy):
     Shortest sequence length:                Longest sequence length:
    
         Inter-field operator:  AND             Form of output list:  Fragments
    
     Press D to continue.
    
     Searching genbank
    
     32 features were found.
    
     Do you wish to:
    
       1) write out this list to a file
       2) preview the results
       3) refine the query
       4) choose different libraries
     
       q) quit
    
     Please choose one (* 1 *): 1
     What should I call the output file (* lookup.list *) ?  test.list
    
     ....
     32 features were written to "test.list"
    
     helix% translate @test.list
    Translate translates nucleotide sequences into peptide sequences.  
    
                 acctrpf          from    506 to   1147
                 acctrpf          from   1149 to   1466
                 bactrpab         from    179 to   1393
                 bactrpab         from   1374 to   2183
                 bactrsyb         from     31 to   1233
                 brltrp2          from      1 to    366
                 buhtrpb          from      1 to    676
                 buhtrpba         from      1 to    676
                 buhtrpbb         from      1 to    679
                 buhtrpbc         from      1 to    676
                 ccrtrpfba        from    181 to    450
                 ccrtrpfba        from    500 to   1159
                 ccrtrpfba        from   1182 to   2402
                 ccrtrpfba        from   2415 to   3242
                 mvotrpba         from      1 to    206
                 mvotrpba         from    304 to   1533
                 mvotrpba         from   1571 to   2425
                 mvotrpba         from   2460 to   2600
    (reverse of) mvotrpba         from   2666 to   2874
    (reverse of) pptrpab          from      1 to     18
                 pptrpab          from    129 to   1346
                 pptrpab          from   1343 to   2152
    (reverse of) psetrpbai        from     99 to    908
    (reverse of) psetrpbai        from    908 to   2134
                 psetrpbai        from   2245 to   3141
                 syctrpb          from    518 to   1756
                 ab003491         from      1 to   1413
                 athtrpb          from   1517 to   1858
                 athtrpb          from   2109 to   2465
                 athtrpb          from   2549 to   2991
                 athtrpb          from   3084 to   3178
                 athtrpb          from   3275 to   3450
                 athtrpsb         from    442 to    798
                 athtrpsb         from   1174 to   1530
                 athtrpsb         from   1606 to   2048
                 athtrpsb         from   2136 to   2230
                 athtrpsb         from   2322 to   2497
                 cynpltrnk        from    148 to    876
                 mzeorp1a         from      1 to   1170
                 mzeorp2a         from      3 to   1334
    
       Output files called "*.pep"