Display COMPLETE DOCUMENT Scroll Up Scroll DOWN MORE! TOP

How can I search for promoter regions?

The GCG program Findpatterns can be used to find any pattern, and the GCG file tfsites.dat contains patterns from the Transcription Factor database. Sample session:
helix 75% findpatterns -data=genmoredata:tfsites.dat

FindPatterns identifies sequences that contain short patterns like
GAATTC or YRYRYRYR.  You can define the patterns ambiguously and allow
mismatches. You can provide the patterns in a file or simply type them
in from the terminal

 FINDPATTERNS in what sequence(s) ? aw778747.gb_est46

  What should I call the output file (* aw778747.find *) ?

     aw778747.gb_est46 len:        410     ....................................
...............................................................................
...............................................................................
..................................................

 FINDPATTERNS in what sequence(s) ?
      
     Total finds:             45
    Total length:            410
 Total sequences:              1
        CPU time:          01.63

     Output file: aw778747.find
The file tfsites.dat is in the GCG directory genmoredata. You can copy this to your own directory using the 'fetch' program. e.g.
helix% fetch tfsites.dat

Fetch copies GCG sequences or data files from the GCG database
into your directory or displays them on your terminal screen.

 tfsites.dat

You can then modify this file to retain only the sites of your interest.