RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6088
         (289 letters)



>gnl|CDD|197673 smart00352, POU, Found in Pit-Oct-Unc transcription factors. 
          Length = 75

 Score =  144 bits (365), Expect = 3e-44
 Identities = 59/75 (78%), Positives = 63/75 (84%)

Query: 148 DETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNM 207
           D+ TD  ELE FAK FKQRRIKLGFTQ DVGLA+G LYG  FSQTTI RFEAL LSFKNM
Sbjct: 1   DDDTDPRELEAFAKQFKQRRIKLGFTQADVGLALGALYGPAFSQTTICRFEALQLSFKNM 60

Query: 208 CKLKPLLQKWLSDAD 222
           CKLKPLLQKWL +A+
Sbjct: 61  CKLKPLLQKWLEEAE 75


>gnl|CDD|189427 pfam00157, Pou, Pou domain - N-terminal to homeobox domain. 
          Length = 75

 Score =  140 bits (356), Expect = 9e-43
 Identities = 60/75 (80%), Positives = 67/75 (89%)

Query: 148 DETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNM 207
           D+TTDL+ELEQFAK FKQRRIKLG+TQ DVGLA+G LYG  FSQTTI RFEAL LSFKNM
Sbjct: 1   DDTTDLDELEQFAKEFKQRRIKLGYTQADVGLALGALYGPVFSQTTICRFEALQLSFKNM 60

Query: 208 CKLKPLLQKWLSDAD 222
           CKLKPLL+KWL +A+
Sbjct: 61  CKLKPLLEKWLEEAE 75


>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain. 
          Length = 57

 Score = 41.7 bits (99), Expect = 1e-05
 Identities = 15/39 (38%), Positives = 21/39 (53%)

Query: 245 RKKRTSIETSVRVALEKAFLQNPKPTSEEISVLADNLCM 283
           R+KRT+        LEK F +N  P++EE   LA  L +
Sbjct: 1   RRKRTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGL 39


>gnl|CDD|197696 smart00389, HOX, Homeodomain.  DNA-binding factors that are
           involved in the transcriptional regulation of key
           developmental processes.
          Length = 57

 Score = 37.6 bits (88), Expect = 3e-04
 Identities = 17/38 (44%), Positives = 21/38 (55%)

Query: 244 RRKKRTSIETSVRVALEKAFLQNPKPTSEEISVLADNL 281
           +R+KRTS        LEK F +NP P+ EE   LA  L
Sbjct: 1   KRRKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKL 38


>gnl|CDD|238039 cd00086, homeodomain, Homeodomain;  DNA binding domains involved in
           the transcriptional regulation of key eukaryotic
           developmental processes; may bind to DNA as monomers or
           as homo- and/or heterodimers, in a sequence-specific
           manner.
          Length = 59

 Score = 36.1 bits (84), Expect = 0.001
 Identities = 16/37 (43%), Positives = 19/37 (51%)

Query: 245 RKKRTSIETSVRVALEKAFLQNPKPTSEEISVLADNL 281
           R+KRT         LEK F +NP P+ EE   LA  L
Sbjct: 1   RRKRTRFTPEQLEELEKEFEKNPYPSREEREELAKEL 37


>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
          Length = 438

 Score = 37.5 bits (88), Expect = 0.007
 Identities = 22/108 (20%), Positives = 42/108 (38%), Gaps = 8/108 (7%)

Query: 14  NHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSS-AANS 72
           +     L   +   L    L+  QQ L + Q  LQ  L++ +       +  S      +
Sbjct: 292 DQLARRLKFQSPERL----LAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQQQN 347

Query: 73  PTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKL---SPQS 117
           P     +AQ  L+   Q+      +QL++ +++     Q+L   SP +
Sbjct: 348 PQRRIERAQQRLEQLEQRLRRAMRRQLKRKRQRLEALAQQLEALSPLA 395



 Score = 31.3 bits (72), Expect = 0.56
 Identities = 23/114 (20%), Positives = 43/114 (37%), Gaps = 17/114 (14%)

Query: 36  LQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQ----VQQA 91
           L Q LQQ Q  L   +++ +      ++      A       P+     Q Q    +QQ 
Sbjct: 265 LLQRLQQLQQRLARAMRRRL----EQKRQRLDQLARRLKFQSPERLLAQQQQRLDRLQQR 320

Query: 92  VAQAAQ-QLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRVPAHHIKQTPARL 144
           + +A + +L+  +++     Q+L  Q+        P   I      ++Q   RL
Sbjct: 321 LQRALERRLRLAKQRLERLSQRLQQQN--------PQRRIERAQQRLEQLEQRL 366



 Score = 30.9 bits (71), Expect = 0.71
 Identities = 18/79 (22%), Positives = 32/79 (40%), Gaps = 2/79 (2%)

Query: 32  ELSTLQQTL--QQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQ 89
            L  L + L  Q  +  L  Q Q+    Q   Q+        +   L   +Q   Q   Q
Sbjct: 290 RLDQLARRLKFQSPERLLAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQ 349

Query: 90  QAVAQAAQQLQQLQKQSHH 108
           + + +A Q+L+QL+++   
Sbjct: 350 RRIERAQQRLEQLEQRLRR 368


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 36.9 bits (86), Expect = 0.010
 Identities = 22/98 (22%), Positives = 39/98 (39%), Gaps = 10/98 (10%)

Query: 30  PSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQ 89
           P +   L   LQQ   +L+ QL+     ++ SQ  A +            A   L  +++
Sbjct: 137 PEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQEL------VALEGLAAELE 190

Query: 90  QAVAQAAQQLQQLQKQ----SHHHHQKLSPQSQQSGNT 123
           +   +   QL+QLQ++    S    QK    + Q+   
Sbjct: 191 EKQQELEAQLEQLQEKAAETSQERKQKRKEITDQAAKR 228


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 36.7 bits (85), Expect = 0.014
 Identities = 20/130 (15%), Positives = 26/130 (20%), Gaps = 11/130 (8%)

Query: 12  LQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAAN 71
                  +   P L    P +L      LQQ     Q       +     Q         
Sbjct: 219 FLPAPSQAPAQPPL----PPQLPQQPPPLQQP----QFPGLSQQMPPPPPQPPQQQQQPP 270

Query: 72  SPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIR 131
            P   PP      Q      + Q         +Q         PQ QQ G          
Sbjct: 271 QPQAQPPPQN---QPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQL 327

Query: 132 VPAHHIKQTP 141
                   + 
Sbjct: 328 SQQQREALSQ 337



 Score = 36.3 bits (84), Expect = 0.016
 Identities = 19/104 (18%), Positives = 27/104 (25%), Gaps = 8/104 (7%)

Query: 41  QQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQ 100
             +    Q   +Q    Q     + + +    P  LP Q     Q Q      Q      
Sbjct: 201 PGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPP 260

Query: 101 QLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRVPAHHIKQTPARL 144
           Q         Q+  PQ Q      PP +         +   A L
Sbjct: 261 QPP-----QQQQQPPQPQAQP---PPQNQPTPHPGLPQGQNAPL 296



 Score = 35.5 bits (82), Expect = 0.030
 Identities = 26/117 (22%), Positives = 33/117 (28%), Gaps = 6/117 (5%)

Query: 8   SQLLLQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASS 67
           SQ   Q      LP        P      QQ         Q Q Q          QN  +
Sbjct: 224 SQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPT 283

Query: 68  S------AANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
                    N+P   P Q Q     Q  Q   +  Q  +QL + S    + LS +  
Sbjct: 284 PHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEA 340



 Score = 34.4 bits (79), Expect = 0.078
 Identities = 20/111 (18%), Positives = 32/111 (28%), Gaps = 3/111 (2%)

Query: 32  ELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQA 91
           ++ +L++   Q Q   QA        Q   Q      AA    G P Q   + Q      
Sbjct: 152 KMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHP 211

Query: 92  VAQAAQQLQQLQKQS---HHHHQKLSPQSQQSGNTTPPSSNIRVPAHHIKQ 139
                QQ      Q+        +L  Q         P  + ++P    + 
Sbjct: 212 EQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQP 262



 Score = 32.4 bits (74), Expect = 0.31
 Identities = 21/115 (18%), Positives = 24/115 (20%), Gaps = 9/115 (7%)

Query: 37  QQTLQQHQHSLQAQLQQFVLFQSASQQ---NASSSAANSPTGLPPQAQFFLQNQVQQAVA 93
           Q   QQ   +      Q  L     QQ               +PP      Q Q Q    
Sbjct: 213 QVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQP 272

Query: 94  QAAQQLQQLQKQSHHHHQ------KLSPQSQQSGNTTPPSSNIRVPAHHIKQTPA 142
           QA    Q          Q          Q Q       P    R P    +    
Sbjct: 273 QAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQL 327



 Score = 28.2 bits (63), Expect = 6.8
 Identities = 16/100 (16%), Positives = 29/100 (29%)

Query: 29  SPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQV 88
           + S   +  +     ++    Q    +   +   +     A  S       AQ  L  + 
Sbjct: 99  TFSPCQSGYEASTDPEYIPDLQPDPSLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEE 158

Query: 89  QQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSS 128
            +A  Q  QQ  QL +       +  P  Q +     P  
Sbjct: 159 VEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPE 198


>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3.  Aip3p/Bud6p is a
           regulator of cell and cytoskeletal polarity in
           Saccharomyces cerevisiae that was previously identified
           as an actin-interacting protein. Actin-interacting
           protein 3 (Aip3p) localizes at the cell cortex where
           cytoskeleton assembly must be achieved to execute
           polarized cell growth, and deletion of AIP3 causes gross
           defects in cell and cytoskeletal polarity. Aip3p
           localization is mediated by the secretory pathway,
           mutations in early- or late-acting components of the
           secretory apparatus lead to Aip3p mislocalization.
          Length = 426

 Score = 36.2 bits (84), Expect = 0.018
 Identities = 23/107 (21%), Positives = 43/107 (40%), Gaps = 10/107 (9%)

Query: 21  PLPNLSSLSPSELSTLQQTLQQHQHSLQAQ---LQQFVLFQSASQQNASSSAANSPTGL- 76
            L  +      E+ TLQ  L + + +L++Q   +Q+    Q  S  N +  AA+    L 
Sbjct: 75  ELDEVKKHIDDEIDTLQNELDEVKQALESQREAIQRLKERQQNSAANIARPAASPSPVLA 134

Query: 77  PPQAQFFLQNQVQQAVAQAAQQLQQLQ------KQSHHHHQKLSPQS 117
              +   L N   +   +   +L+ LQ      +Q+H+       +S
Sbjct: 135 SSSSAISLANNPDKLNKEQRAELKSLQRELAVLRQTHNSFFTEIKES 181


>gnl|CDD|222221 pfam13560, HTH_31, Helix-turn-helix domain.  This domain is a
           helix-turn-helix domain that probably binds to DNA.
          Length = 63

 Score = 31.7 bits (73), Expect = 0.042
 Identities = 10/41 (24%), Positives = 20/41 (48%), Gaps = 6/41 (14%)

Query: 158 QFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
           +     ++ R + G TQ ++   +G       S++T+SR E
Sbjct: 1   ELGARLRRLRERAGLTQAELAERLG------VSRSTLSRIE 35


>gnl|CDD|238045 cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins.
           Prokaryotic DNA binding proteins belonging to the
           xenobiotic response element family of transcriptional
           regulators.
          Length = 58

 Score = 31.0 bits (71), Expect = 0.076
 Identities = 16/53 (30%), Positives = 26/53 (49%), Gaps = 8/53 (15%)

Query: 160 AKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEA--LNLSFKNMCKL 210
            +  K+ R + G TQ ++   +G       S++TISR E    N S + + KL
Sbjct: 1   GERLKELRKEKGLTQEELAEKLG------VSRSTISRIENGKRNPSLETLEKL 47


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 34.3 bits (78), Expect = 0.090
 Identities = 34/146 (23%), Positives = 54/146 (36%), Gaps = 15/146 (10%)

Query: 29  SPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQV 88
            P +    QQ  QQ Q  +  Q Q     Q  + Q         P    PQ Q   Q   
Sbjct: 754 QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQP-QYQQPQQPVAPQPQYQQPQQPVA 812

Query: 89  QQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQS---------GNTTP---PSSNIRVPAHH 136
            Q   Q  QQ    Q Q     Q ++PQ Q +         G++ P   P++ +  P+  
Sbjct: 813 PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPL--PSLD 870

Query: 137 IKQTPARLLDPDETTDLEELEQFAKT 162
           +   P   ++P +T  LE++ +  + 
Sbjct: 871 LLTPPPSEVEPVDTFALEQMARLVEA 896



 Score = 32.4 bits (73), Expect = 0.33
 Identities = 26/105 (24%), Positives = 31/105 (29%), Gaps = 6/105 (5%)

Query: 23  PNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQF 82
           P+    +P      Q            Q QQ V  Q   QQ         P    PQ Q 
Sbjct: 740 PHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQ------PQQPVAPQPQYQQ 793

Query: 83  FLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPS 127
             Q    Q   Q  QQ    Q Q     Q ++PQ Q      P +
Sbjct: 794 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA 838


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 34.0 bits (78), Expect = 0.097
 Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 3/64 (4%)

Query: 38  QTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQ 97
             L+Q + +L+ Q QQ     + + Q  +    + P GL          Q+QQ +AQ AQ
Sbjct: 780 TQLEQLKQNLENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVT---VEQIQQELAQLAQ 836

Query: 98  QLQQ 101
           QL++
Sbjct: 837 QLRE 840



 Score = 32.5 bits (74), Expect = 0.29
 Identities = 28/93 (30%), Positives = 33/93 (35%), Gaps = 7/93 (7%)

Query: 31  SELSTLQQTLQQHQHSLQAQLQQF--VLFQSASQQNASSSAA----NSPTGLPPQAQFFL 84
           S+L TLQQ        LQ    QF   L  S      +  AA     + T L    Q  L
Sbjct: 731 SQLQTLQQQDVLEAQRLQKAQAQFDTALQASVFDDQQAFLAALLDEETLTQLEQLKQN-L 789

Query: 85  QNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQS 117
           +NQ QQA     Q  Q L +   H    L    
Sbjct: 790 ENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTV 822


>gnl|CDD|205103 pfam12844, HTH_19, Helix-turn-helix domain.  Members of this family
           contains a DNA-binding helix-turn-helix domain.
          Length = 64

 Score = 30.2 bits (69), Expect = 0.14
 Identities = 8/39 (20%), Positives = 16/39 (41%), Gaps = 6/39 (15%)

Query: 160 AKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
            +  K+ R   G +Q D+            S++ +S+ E
Sbjct: 1   GERLKELREAKGLSQEDLA------EKLGISKSYLSKIE 33


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 32.7 bits (74), Expect = 0.27
 Identities = 24/120 (20%), Positives = 32/120 (26%), Gaps = 5/120 (4%)

Query: 2   GASPNASQLLLQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSAS 61
           G  P+    +             +   S    S     +   Q   Q Q        +  
Sbjct: 129 GMGPHQMSRVGTMQPGGQAG-GMMQQSSGQPQSQQPNQMGPQQGQAQGQAGG----MNQG 183

Query: 62  QQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSG 121
           QQ           G P       Q Q+QQ      QQ Q  Q Q    +Q+     QQ G
Sbjct: 184 QQGPVGQQQPPQMGQPGMPGGGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQG 243



 Score = 31.9 bits (72), Expect = 0.40
 Identities = 23/72 (31%), Positives = 25/72 (34%), Gaps = 2/72 (2%)

Query: 37  QQTLQQHQHSLQAQLQQFVLFQS--ASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQ 94
           QQ   Q Q  LQ Q QQ +  Q   A  Q           G+ PQ     Q QV      
Sbjct: 220 QQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQMQGGQMQVPMQQQP 279

Query: 95  AAQQLQQLQKQS 106
             QQ QQ Q   
Sbjct: 280 PQQQPQQSQLGM 291



 Score = 29.6 bits (66), Expect = 2.2
 Identities = 18/101 (17%), Positives = 27/101 (26%), Gaps = 4/101 (3%)

Query: 30  PSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQ----FFLQ 85
            S+L  L   +QQ     Q    Q +                   G+    Q        
Sbjct: 286 QSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPGAVPQGGQAVQQGVMSAGQQQLKQMKL 345

Query: 86  NQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPP 126
             ++       QQ QQ       H Q+++ Q  Q G     
Sbjct: 346 RNMRGQQQTQQQQQQQGGNHPAAHQQQMNQQVGQGGQMVAL 386



 Score = 27.7 bits (61), Expect = 8.1
 Identities = 22/84 (26%), Positives = 24/84 (28%)

Query: 37  QQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAA 96
                Q Q   Q   QQ    Q   Q              P  AQ  +  Q Q       
Sbjct: 204 GGGQGQMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQP 263

Query: 97  QQLQQLQKQSHHHHQKLSPQSQQS 120
           QQ+Q  Q Q     Q    Q QQS
Sbjct: 264 QQMQGGQMQVPMQQQPPQQQPQQS 287


>gnl|CDD|197775 smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins. 
          Length = 56

 Score = 29.4 bits (67), Expect = 0.27
 Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 6/35 (17%)

Query: 164 KQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
           K+ R + G TQ ++   +G       S++T+SR E
Sbjct: 3   KELREEKGLTQEELAEKLGV------SRSTLSRIE 31


>gnl|CDD|182828 PRK10907, PRK10907, intramembrane serine protease GlpG;
          Provisional.
          Length = 276

 Score = 31.9 bits (73), Expect = 0.28
 Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 23/71 (32%)

Query: 39 TLQQHQHS------------LQAQLQQFVL------FQSASQQNASSSAANSPTGLPPQA 80
          T+QQH  S            ++A+L +F+       + +AS Q+  +++     GL  + 
Sbjct: 28 TIQQHNQSDIWLADESQAERVRAELARFLENPADPRYLAASWQSGHTNS-----GLRYRR 82

Query: 81 QFFLQNQVQQA 91
            FL    ++A
Sbjct: 83 FPFLATLRERA 93


>gnl|CDD|216879 pfam02112, PDEase_II, cAMP phosphodiesterases class-II. 
          Length = 323

 Score = 32.1 bits (73), Expect = 0.33
 Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 119 QSGNTTPPSSNIRVPAHHIKQTPARLLDPDETTDLEELEQFA 160
            + +T+ P   + V   H+K  PA   +P   T LEEL Q  
Sbjct: 268 LTTSTSQPLLGLNVIVTHVKSPPADPNNP-RDTILEELNQLN 308


>gnl|CDD|227863 COG5576, COG5576, Homeodomain-containing transcription factor
           [Transcription].
          Length = 156

 Score = 31.3 bits (71), Expect = 0.34
 Identities = 18/93 (19%), Positives = 32/93 (34%), Gaps = 8/93 (8%)

Query: 191 QTTISRFEALNLSFKNMCKLKPLLQKWLSDADSTTANPSCLSNPLTTPEAIGRRRKKRTS 250
              +S  E+  +      K      K   + ++  +          +     +RR+  T 
Sbjct: 6   MQPLSALESQRMPQIRTIKT----TKNKREVEAADSEMKLERKQDGSSPPKSKRRRT-TD 60

Query: 251 IETSVRVALEKAFLQNPKPTSEEISVLADNLCM 283
            +  V   LE+ F  NP P+S     L+  L M
Sbjct: 61  EQLMV---LEREFEINPYPSSITRIKLSLLLNM 90


>gnl|CDD|201759 pfam01381, HTH_3, Helix-turn-helix.  This large family of DNA
           binding helix-turn helix proteins includes Cro and CI.
           Within the Neisseria gonorrhoeae phage associated
           protein NGO0477, the full protein fold incorporates a
           helix-turn-helix motif, but the function of this member
           is unlikely to be that of a DNA-binding regulator, the
           function of most other members, so is not necessarily
           characteristic of the whole family.
          Length = 55

 Score = 28.3 bits (64), Expect = 0.59
 Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 8/49 (16%)

Query: 164 KQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEA--LNLSFKNMCKL 210
           K+ R +LG +Q ++   +G       S++TIS+ E      S + + KL
Sbjct: 2   KELREELGLSQEELAEKLGV------SRSTISKIENGKREPSLETLKKL 44


>gnl|CDD|234937 PRK01294, PRK01294, lipase chaperone; Provisional.
          Length = 336

 Score = 30.8 bits (70), Expect = 0.82
 Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 9/96 (9%)

Query: 12  LQNHQLASLPLPNLSSLSPSELSTLQQTL-QQHQHSLQAQLQQFVLFQSASQQNASSSAA 70
            Q + L  L +    SLS ++ +     L  Q    L+A LQ+    Q+  QQ A   A+
Sbjct: 187 YQRYALERLRIAQDPSLSDAQKAARLAALEAQLPEDLRAALQESQRQQALLQQLAQLQAS 246

Query: 71  NSPTGLPPQAQFFLQNQVQQAV-AQAAQQLQQLQKQ 105
               G  PQ          Q V  +AAQ+L+QL +Q
Sbjct: 247 ----GASPQELR---LMRAQLVGPEAAQRLEQLDQQ 275


>gnl|CDD|132242 TIGR03198, pucE, xanthine dehydrogenase E subunit.  This gene has
           been characterized in B. subtilis as the Iron-sulfur
           cluster binding-subunit of xanthine dehydrogenase
           (pucE), acting in conjunction with pucC, the FAD-binding
           subunit and pucD, the molybdopterin binding subunit. The
           more common XDH complex (GenProp0640) includes the xdhA
           gene as the Fe-S cluster binding component.
          Length = 151

 Score = 29.8 bits (67), Expect = 0.88
 Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 1/27 (3%)

Query: 257 VALEKAFLQNPKPTSEEIS-VLADNLC 282
           VAL+  F + P+P+ E++   L+ NLC
Sbjct: 107 VALKALFRETPQPSDEDMEEGLSGNLC 133


>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
           SFB2/subunit SFB3 [Intracellular trafficking and
           secretion].
          Length = 861

 Score = 30.9 bits (70), Expect = 0.97
 Identities = 28/129 (21%), Positives = 44/129 (34%), Gaps = 17/129 (13%)

Query: 40  LQQHQHSLQAQLQQFV------LFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVA 93
           + QH+  +  Q Q  V        +SA    A ++ +    G+PP     LQ Q ++ + 
Sbjct: 1   MSQHKKGVYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQID 60

Query: 94  QAAQQLQQLQKQS---------HHHHQKLSPQSQQSG--NTTPPSSNIRVPAHHIKQTPA 142
           QAA  +      +             QK S     S    T P  +N  VP    +  P 
Sbjct: 61  QAATAMHNTGANNPAPSVMSPAFQSQQKFSSPYGGSMADGTAPKPTNPLVPVDLFEDQPP 120

Query: 143 RLLDPDETT 151
            + D     
Sbjct: 121 PISDLFLPP 129


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
           gram-negative Gammaproteobacteria such as Pseudomonas
           syringae of tomato and the fire blight plant pathogen
           Erwinia amylovora, amongst others. It is an essential
           pathogenicity factor of approximately 198 kDa. Its
           injection into the host-plant is dependent upon the
           bacterial type III or Hrp secretion system. The family
           is long and carries a number of predicted functional
           regions, including an ERMS or endoplasmic reticulum
           membrane retention signal at both the C- and the
           N-termini, a leucine-zipper motif from residues 539-560,
           and a nuclear localisation signal at 1358-1361. this
           conserved AvrE-family of effectors is among the few that
           are required for full virulence of many phytopathogenic
           pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 30.9 bits (70), Expect = 0.98
 Identities = 19/114 (16%), Positives = 33/114 (28%), Gaps = 12/114 (10%)

Query: 58  QSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHH----HQKL 113
              + Q  ++ +A +PTGL        Q   +    +A+  L    K++        QK 
Sbjct: 10  TKTAVQPEATPSAGAPTGL--------QQSSESPTQRASHSLASEGKKNRKKMPKVFQKS 61

Query: 114 SPQSQQSGNTTPPSSNIRVPAHHIKQTPARLLDPDETTDLEELEQFAKTFKQRR 167
           S   Q         +         +    R L      D E     +    +R 
Sbjct: 62  SAPRQIQAAPPQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPESSPSARRL 115


>gnl|CDD|213767 TIGR03070, couple_hipB, transcriptional regulator, y4mF family.
           Members of this family belong to a clade of
           helix-turn-helix DNA-binding proteins, among the larger
           family pfam01381 (HTH_3; Helix-turn-helix). Members are
           similar in sequence to the HipB protein of E. coli.
           Genes for members of the seed alignment for this protein
           family were found to be closely linked to genes encoding
           proteins related to HipA. The HibBA operon appears to
           have some features in common with toxin-antitoxin
           post-segregational killing systems.
          Length = 58

 Score = 28.0 bits (63), Expect = 1.0
 Identities = 9/26 (34%), Positives = 12/26 (46%)

Query: 157 EQFAKTFKQRRIKLGFTQGDVGLAMG 182
           +Q     + RR  LG TQ D+    G
Sbjct: 1   KQIGILVRARRKALGLTQADLADLAG 26


>gnl|CDD|233592 TIGR01834, PHA_synth_III_E, poly(R)-hydroxyalkanoic acid synthase,
           class III, PhaE subunit.  This model represents the PhaE
           subunit of the heterodimeric class (class III) of
           polymerase for poly(R)-hydroxyalkanoic acids (PHAs),
           carbon and energy storage polymers of many bacteria. The
           most common PHA is polyhydroxybutyrate but about 150
           different constituent hydroxyalkanoic acids (HAs) have
           been identified in various species. This model must be
           designated subfamily to indicate the heterogeneity of
           PHAs [Cellular processes, Adaptations to atypical
           conditions, Fatty acid and phospholipid metabolism,
           Biosynthesis].
          Length = 320

 Score = 30.3 bits (68), Expect = 1.0
 Identities = 22/90 (24%), Positives = 28/90 (31%), Gaps = 1/90 (1%)

Query: 18  ASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSA-ANSPTGL 76
           A  P P + S      S  QQ LQQ Q   +    QF  + S   Q A+  A       L
Sbjct: 27  AQAPPPWMGSADQWFESLPQQALQQQQFYGELLKLQFEAWFSLWPQFANGFAPGAVQGYL 86

Query: 77  PPQAQFFLQNQVQQAVAQAAQQLQQLQKQS 106
             Q          QA+        Q   + 
Sbjct: 87  QLQTWIEQYEATTQALQGDMDGQGQCYIKE 116


>gnl|CDD|102369 PRK06424, PRK06424, transcription factor; Provisional.
          Length = 144

 Score = 29.5 bits (66), Expect = 1.1
 Identities = 15/69 (21%), Positives = 27/69 (39%), Gaps = 5/69 (7%)

Query: 110 HQKLSPQSQQSGNTTPPSSNIRVPAHHI-KQTPARLLDPDETTDLEELEQFAKTFKQRRI 168
           H K     ++      P   I VP +    +   +        DL+ +E +A+  K  R 
Sbjct: 39  HNKFKE-VKEDIKVKLPEKKIIVPTYKKAYKKYKK---KASDEDLDIVEDYAELVKNARE 94

Query: 169 KLGFTQGDV 177
           +L  +Q D+
Sbjct: 95  RLSMSQADL 103


>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein. 
          Length = 122

 Score = 29.1 bits (66), Expect = 1.3
 Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 6/82 (7%)

Query: 38  QTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQN------QVQQA 91
              QQ     + +L++   ++   +Q  S S            Q F+        Q QQ 
Sbjct: 8   AEAQQELEQAEEKLEELQQYRLEYRQQLSGSGQGISAAELRNYQAFISALDEAIAQQQQE 67

Query: 92  VAQAAQQLQQLQKQSHHHHQKL 113
           + QA +Q++Q ++Q    +Q+ 
Sbjct: 68  LEQAEKQVEQAREQWQEANQER 89


>gnl|CDD|225251 COG2376, DAK1, Dihydroxyacetone kinase [Carbohydrate transport and
           metabolism].
          Length = 323

 Score = 29.9 bits (68), Expect = 1.4
 Identities = 9/16 (56%), Positives = 9/16 (56%)

Query: 174 QGDVGLAMGKLYGNDF 189
            G  GLA G  YG DF
Sbjct: 94  LGKGGLASGPNYGGDF 109


>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631).  The
           members of this family are sequences derived from a
           group of hypothetical proteins expressed by certain
           bacterial species. The region concerned is approximately
           440 amino acid residues in length.
          Length = 729

 Score = 30.3 bits (69), Expect = 1.4
 Identities = 22/115 (19%), Positives = 34/115 (29%), Gaps = 12/115 (10%)

Query: 18  ASLPLPNLSSLSPSELSTLQQTLQQHQHSLQA--QLQQFVLFQSASQQNASSSAANSPTG 75
            S   P  S  S  + ++ Q   +  +    A   L       SA+    +  A  S   
Sbjct: 233 RSQAPPAASGPSTDDAASGQGGQESAEALFAALQGLLAQARSSSAAPAAPAGQAPGSAQP 292

Query: 76  LPPQAQFFLQNQVQQAVAQAA----------QQLQQLQKQSHHHHQKLSPQSQQS 120
           L  +      + +QQ     A          + LQQL  +      K     QQ 
Sbjct: 293 LSTRELLAALSSLQQQAPALALGEPADFDLRELLQQLLSRLSSEDGKDRALGQQD 347



 Score = 27.6 bits (62), Expect = 8.2
 Identities = 23/102 (22%), Positives = 37/102 (36%), Gaps = 8/102 (7%)

Query: 59  SASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
             SQ   ++S  ++      Q       +  +A+  A Q L    + S       +P  Q
Sbjct: 232 PRSQAPPAASGPSTDDAASGQGG----QESAEALFAALQGLLAQARSS--SAAPAAPAGQ 285

Query: 119 QSGNTTPPSSN--IRVPAHHIKQTPARLLDPDETTDLEELEQ 158
             G+  P S+   +   +   +Q PA  L      DL EL Q
Sbjct: 286 APGSAQPLSTRELLAALSSLQQQAPALALGEPADFDLRELLQ 327


>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
           recombination, and repair].
          Length = 440

 Score = 29.5 bits (67), Expect = 1.9
 Identities = 15/88 (17%), Positives = 34/88 (38%), Gaps = 7/88 (7%)

Query: 33  LSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAV 92
           LS  QQ L +    L+  L+  +  +    +  +            + Q  LQ   ++  
Sbjct: 308 LSEQQQRLDELAIRLRRALENQLALKKQRLERLTQRLNPQIQ----RQQQRLQQLERRLD 363

Query: 93  AQAAQQLQQLQKQSHHHHQKL---SPQS 117
               +QL++ +++     ++L   SP +
Sbjct: 364 KALRRQLKRKRERLEALVEQLESLSPLA 391


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
           family of proteins are found in large numbers in the
           Trichomonas vaginalis proteome. The function of this
           protein is unknown.
          Length = 422

 Score = 29.6 bits (66), Expect = 1.9
 Identities = 19/79 (24%), Positives = 29/79 (36%), Gaps = 8/79 (10%)

Query: 41  QQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQ 100
            +H+H+ +   Q  V  Q+ +QQ    + A          Q   Q  VQ A     Q   
Sbjct: 195 HRHRHAPKPTQQPTV--QNPAQQPTVQNPA------QQPQQQPQQQPVQPAQQPTPQNPA 246

Query: 101 QLQKQSHHHHQKLSPQSQQ 119
           Q   Q+   H++   Q  Q
Sbjct: 247 QQPPQTEQGHKRSREQGNQ 265


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins
           in this family for which functions are known are flap
           endonucleases that generate the 3' incision next to DNA
           damage as part of nucleotide excision repair. This
           family is related to many other flap endonuclease
           families including the fen1 family. This family is based
           on the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1034

 Score = 29.5 bits (66), Expect = 2.3
 Identities = 27/140 (19%), Positives = 48/140 (34%), Gaps = 19/140 (13%)

Query: 38  QTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSP-------TGLPPQAQFFLQNQVQQ 90
            T   H+   +A     +   + S   +  +  +         T  P  ++F L +Q + 
Sbjct: 476 VTSSGHEAVPKAVQSLLLGATNDSPIPSEFTILDRKSELSIERTVKPVSSEFGLPSQRED 535

Query: 91  AVAQAAQQLQQLQKQSHH------------HHQKLSPQSQQSGNTTPPSSNIRVPAHHIK 138
            +A   +  Q LQ  S H               K +  +  S   T  S N  +P+    
Sbjct: 536 KLAIPTEGTQNLQGISDHPEQFEFQNELSPLETKNNESNLSSDAETEGSPNPEMPSWSSV 595

Query: 139 QTPARLLDPDETTDLEELEQ 158
             P+  LD  ETT+    ++
Sbjct: 596 TVPSEALDNYETTNPSNAKE 615


>gnl|CDD|99778 cd00322, FNR_like, Ferredoxin reductase (FNR), an FAD and NAD(P)
           binding protein, was intially identified as a
           chloroplast reductase activity, catalyzing the electron
           transfer from reduced iron-sulfur protein ferredoxin to
           NADP+ as the final step in the electron transport
           mechanism of photosystem I. FNR transfers electrons from
           reduced ferredoxin to FAD (forming FADH2 via a
           semiquinone intermediate) and then transfers a hydride
           ion to convert NADP+ to NADPH. FNR has since been shown
           to utilize a variety of electron acceptors and donors
           and has a variety of physiological functions including
           nitrogen assimilation, dinitrogen fixation, steroid
           hydroxylation, fatty acid metabolism, oxygenase
           activity, and methane assimilation in many organisms.
           FNR has an NAD(P)-binding sub-domain of the alpha/beta
           class and a discrete (usually N-terminal) flavin
           sub-domain which vary in orientation with respect to the
           NAD(P) binding domain. The N-terminal moeity may contain
           a flavin prosthetic group (as in flavoenzymes) or use
           flavin as a substrate. Because flavins such as FAD can
           exist in oxidized, semiquinone (one- electron reduced),
           or fully reduced hydroquinone forms, FNR can interact
           with one and 2 electron carriers. FNR has a strong
           preference for NADP(H) vs NAD(H).
          Length = 223

 Score = 29.0 bits (65), Expect = 2.4
 Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 4/57 (7%)

Query: 124 TPPSSNIRVPAHHIKQTPARLL----DPDETTDLEELEQFAKTFKQRRIKLGFTQGD 176
           TP  S +R  A         LL     P +   L+ELE+ AK     R+ L  ++  
Sbjct: 110 TPFRSMLRHLAADKPGGEITLLYGARTPADLLFLDELEELAKEGPNFRLVLALSRES 166


>gnl|CDD|239723 cd03754, proteasome_alpha_type_6, proteasome_alpha_type_6. The 20S
           proteasome, multisubunit proteolytic complex, is the
           central enzyme of nonlysosomal protein degradation in
           both the cytosol and nucleus. It is composed of 28
           subunits arranged as four homoheptameric rings that
           stack on top of one another forming an elongated
           alpha-beta-beta-alpha cylinder with a central cavity.
           The proteasome alpha and beta subunits are members of
           the N-terminal nucleophile (Ntn)-hydrolase superfamily.
           Their N-terminal threonine residues are exposed as a
           nucleophile in peptide bond hydrolysis. Mammals have 7
           alpha and 7 beta proteasome subunits while archaea have
           one of each.
          Length = 215

 Score = 28.7 bits (65), Expect = 2.6
 Identities = 18/84 (21%), Positives = 34/84 (40%), Gaps = 18/84 (21%)

Query: 138 KQTPARLLDPDETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRF 197
           K+ P +L+DP   T L  +                   ++G  M  +  +  SQ   +R+
Sbjct: 47  KKVPDKLIDPSTVTHLFRITD-----------------EIGCVMTGMIADSRSQVQRARY 89

Query: 198 EALNLSFKNMCKLKP-LLQKWLSD 220
           EA    +K   ++   +L K ++D
Sbjct: 90  EAAEFKYKYGYEMPVDVLAKRIAD 113


>gnl|CDD|182947 PRK11073, glnL, nitrogen regulation protein NR(II); Provisional.
          Length = 348

 Score = 29.3 bits (66), Expect = 2.7
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 3  ASPNASQLLLQNHQ-LASLPLPNLSSLSPSELSTLQQTLQQHQ 44
          A+P A QLL Q+ + L   PLP L S     +  ++++LQ  Q
Sbjct: 32 ANPAAQQLLAQSSRKLFGTPLPELLSYFSLNIELMRESLQAGQ 74


>gnl|CDD|217789 pfam03915, AIP3, Actin interacting protein 3. 
          Length = 424

 Score = 29.2 bits (66), Expect = 2.8
 Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 13/92 (14%)

Query: 32  ELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQA 91
           ++ +L+Q L   Q ++  +L       S  QQN +   A      PP +     +     
Sbjct: 93  KIKSLKQALDDQQSTILQRL-------SELQQNTAKEVARPAAAPPPSSLPNSSSSPDSI 145

Query: 92  VAQAAQQLQQLQ------KQSHHHHQKLSPQS 117
                ++LQ L+      +Q+H   +     S
Sbjct: 146 NPAGLKELQSLRRELAVLRQTHSDFKSEIQAS 177


>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
           dihydrolipoamide succinyltransferase (E2 component).
           This model describes the TCA cycle 2-oxoglutarate system
           E2 component, dihydrolipoamide succinyltransferase. It
           is closely related to the pyruvate dehydrogenase E2
           component, dihydrolipoamide acetyltransferase. The seed
           for this model includes mitochondrial and Gram-negative
           bacterial forms. Mycobacterial candidates are highly
           derived, differ in having and extra copy of the
           lipoyl-binding domain at the N-terminus. They score
           below the trusted cutoff, but above the noise cutoff and
           above all examples of dihydrolipoamide acetyltransferase
           [Energy metabolism, TCA cycle].
          Length = 403

 Score = 28.9 bits (65), Expect = 3.4
 Identities = 26/139 (18%), Positives = 46/139 (33%), Gaps = 29/139 (20%)

Query: 60  ASQQNASSSAANSPTGLPPQAQFFL-QNQVQQAVAQAAQQLQQLQKQS--HHHHQKLSPQ 116
           A+   A+ +AA +   L P A+    ++ +  +         ++ K+          S Q
Sbjct: 97  AASAAAAPTAAANRPSLSPAARRLAKEHGIDLSAVPGTGVTGRVTKEDIIKKTEAPASAQ 156

Query: 117 SQQSGNTTP-----PSSNIRVPAHHIKQTPA-RLLDPDETT------------DLEEL-- 156
                              RV    ++Q  A RL +   +T             + EL  
Sbjct: 157 QPAPAAAAKAPANFTRPEERVKMTRLRQRIAERLKEAQNSTAMLTTFNEVDMSAVMELRK 216

Query: 157 ---EQFAKTFKQRRIKLGF 172
              E+F    K+  +KLGF
Sbjct: 217 RYKEEF---EKKHGVKLGF 232


>gnl|CDD|132237 TIGR03193, 4hydroxCoAred, 4-hydroxybenzoyl-CoA reductase, gamma
           subunit.  4-hydroxybenzoyl-CoA reductase converts
           4-hydroxybenzoyl-CoA to benzoyl-CoA, a common
           intermediate in the degradation of aromatic compounds.
           This protein family represents the gamma chain of this
           three-subunit enzyme.
          Length = 148

 Score = 27.9 bits (62), Expect = 3.7
 Identities = 12/27 (44%), Positives = 17/27 (62%), Gaps = 1/27 (3%)

Query: 257 VALEKAFLQNPKPTSEEI-SVLADNLC 282
           +A E    +NP P+ +EI + LA NLC
Sbjct: 106 MAAEALLRRNPSPSRDEIRAALAGNLC 132


>gnl|CDD|168934 PRK07394, PRK07394, hypothetical protein; Provisional.
          Length = 342

 Score = 28.3 bits (64), Expect = 4.2
 Identities = 15/76 (19%), Positives = 34/76 (44%), Gaps = 10/76 (13%)

Query: 32  ELSTLQQTLQQHQHSLQAQ---LQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQV 88
              + ++ L+Q Q +L  +   L Q +++        +  +++   G+  +A+  L +  
Sbjct: 274 PWESTEEWLEQAQAALNGEPGPLTQALIWNGGFYLWRAGISSSLEEGIE-KAEELLNSG- 331

Query: 89  QQAVAQAAQQLQQLQK 104
                +A Q+LQQL  
Sbjct: 332 -----KALQKLQQLIA 342


>gnl|CDD|216112 pfam00773, RNB, RNB domain.  This domain is the catalytic domain of
           ribonuclease II.
          Length = 322

 Score = 28.4 bits (64), Expect = 4.8
 Identities = 9/38 (23%), Positives = 17/38 (44%)

Query: 134 AHHIKQTPARLLDPDETTDLEELEQFAKTFKQRRIKLG 171
              + +      +     DLE L + AK  +++R+K G
Sbjct: 127 VDELLEGKPSEREAPVADDLELLYELAKALRKKRLKRG 164


>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138).  This
          family of proteins with unknown function appear to be
          restricted to Proteobacteria.
          Length = 514

 Score = 28.3 bits (63), Expect = 4.9
 Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 7/87 (8%)

Query: 17 LASLPLPNLSSLSPSELSTLQ---QTLQQHQHSLQAQL--QQFVLFQSASQQNASSSAAN 71
            +LP    ++   S++  LQ     LQQ  + L+A L  +       A  Q+A+++AA 
Sbjct: 11 AGALPGAAAAASDASQIKALQAQLTALQQQVNELRAALAAKPAAAGGGAKIQSAAAAAAA 70

Query: 72 SPTGLPPQAQFFLQNQVQQAVAQAAQQ 98
          +P+     A     + V Q   Q A Q
Sbjct: 71 APSSDAAAA--LTNDDVTQMREQIANQ 95


>gnl|CDD|147759 pfam05782, ECM1, Extracellular matrix protein 1 (ECM1).  This
           family consists of several eukaryotic extracellular
           matrix protein 1 (ECM1) sequences. ECM1 has been shown
           to regulate endochondral bone formation, stimulate the
           proliferation of endothelial cells and induce
           angiogenesis. Mutations in the ECM1 gene can cause
           lipoid proteinosis, a disorder which causes generalised
           thickening of skin, mucosae and certain viscera.
           Classical features include beaded eyelid papules and
           laryngeal infiltration leading to hoarseness.
          Length = 419

 Score = 28.4 bits (63), Expect = 5.4
 Identities = 18/86 (20%), Positives = 28/86 (32%)

Query: 58  QSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQS 117
           QS  Q   S           P       ++   AV Q A  LQ+         ++   + 
Sbjct: 76  QSEVQPPPSPEDIPVYEEDWPTFLNPNVDKAGPAVPQEAIPLQKELPPPQVPIEQKEVKP 135

Query: 118 QQSGNTTPPSSNIRVPAHHIKQTPAR 143
               + +PP      PA H +Q  +R
Sbjct: 136 APLADQSPPEPESWNPAQHCQQGRSR 161


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 28.4 bits (64), Expect = 6.2
 Identities = 16/91 (17%), Positives = 27/91 (29%), Gaps = 5/91 (5%)

Query: 31  SELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQ 90
             LS L+Q L+Q Q   +  L +F   +   +                     L   V +
Sbjct: 520 MRLSELEQRLRQQQ-RAERLLAEFC--KRLGKNLDDEDELEQLQEELEARLESLSESVSE 576

Query: 91  AVAQAAQQLQQLQ--KQSHHHHQKLSPQSQQ 119
           A  +     QQL+  +         +P    
Sbjct: 577 ARERRMALRQQLEQLQARIQRLAARAPAWLA 607



 Score = 27.6 bits (62), Expect = 9.7
 Identities = 20/89 (22%), Positives = 39/89 (43%), Gaps = 2/89 (2%)

Query: 30   PSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQ 89
            P +   L+Q  QQ Q + +   QQ        Q+ A  S  ++   L   +   L  +++
Sbjct: 934  PEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRAHFSYEDAAEMLAKNSD--LNEKLR 991

Query: 90   QAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
            Q + QA Q+  + ++Q      +L+  +Q
Sbjct: 992  QRLEQAEQERTRAREQLRQAQAQLAQYNQ 1020


>gnl|CDD|234630 PRK00095, mutL, DNA mismatch repair protein; Reviewed.
          Length = 617

 Score = 27.9 bits (63), Expect = 6.7
 Identities = 11/103 (10%), Positives = 30/103 (29%), Gaps = 2/103 (1%)

Query: 34  STLQQTLQQHQHSLQAQLQQFVL--FQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQA 91
             +Q+ L Q      A     VL   +        +    S +  P  +      +  + 
Sbjct: 319 QAIQEALAQSGLIPAAAGANQVLEPAEPEPLPLQQTPLYASGSSPPASSPSSAPPEQSEE 378

Query: 92  VAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRVPA 134
             + +   +   + +    +  +  S ++    P ++     A
Sbjct: 379 SQEESSAEKNPLQPNASQSEAAAAASAEAAAAAPAAAPEPAEA 421


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
           This model describes a conserved domain found in surface
           proteins of a number of Firmutes. Many members have
           LPXTG C-terminal anchoring motifs and a substantial
           number have the KxYKxGKxW putative sorting signal at the
           N-terminus. The tetracycline resistance plasmid pCF10 in
           Enterococcus faecalis promotes conjugal plasmid transfer
           in response to sex pheromones, but PgrA/Sec10 encoded by
           that plasmid, a member of this family, specifically
           inhibits the ability of cells to receive homologous
           plasmids. The phenomenon is called surface exclusion.
          Length = 356

 Score = 27.8 bits (62), Expect = 7.1
 Identities = 18/88 (20%), Positives = 28/88 (31%), Gaps = 7/88 (7%)

Query: 24  NLSSLSPSELSTLQQTLQQHQHSLQA------QLQQFVLFQSASQQNASSSAANSPTGLP 77
                +P+ L  LQ  L   Q  L A        Q  +     +   A ++ A +   L 
Sbjct: 247 TPIPNTPNSLKALQAALATAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKELA 306

Query: 78  PQAQFFLQNQVQQAVAQAAQQLQQLQKQ 105
             AQ       Q  +A A   L   + +
Sbjct: 307 -NAQAQALQTAQNNLATAQAALANAEAR 333


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 27.8 bits (62), Expect = 7.6
 Identities = 25/172 (14%), Positives = 35/172 (20%)

Query: 13  QNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANS 72
               +A LP P         L  L    QQ  ++             A     +     +
Sbjct: 55  APPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQT 114

Query: 73  PTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSNIRV 132
             G     Q  +      A +   Q  QQ   Q            Q          N   
Sbjct: 115 EPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDES 174

Query: 133 PAHHIKQTPARLLDPDETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKL 184
                          D   D E             ++      + G AM  L
Sbjct: 175 QLQQQPNGETPPQQTDGAGDDESEALVRLREADGTLEQRIKGAEGGGAMKVL 226


>gnl|CDD|204415 pfam10212, TTKRSYEDQ, Predicted coiled-coil domain-containing
           protein.  This is the C-terminal 500 amino acids of a
           family of proteins with a predicted coiled-coil domain
           conserved from nematodes to humans. It carries a
           characteristic TTKRSYEDQ sequence-motif. The function is
           not known.
          Length = 518

 Score = 27.6 bits (61), Expect = 8.4
 Identities = 20/101 (19%), Positives = 39/101 (38%), Gaps = 5/101 (4%)

Query: 20  LPLPNLSSLSPSELSTLQQTLQQHQHSLQAQLQQF--VLFQSASQQNASSSAANSPTGLP 77
           LP    S     E S     L+     L   +++F  V  +  +  +  +  +  P GL 
Sbjct: 103 LPYQLKSLEEECESSLCTAALRARNQELHNDMKKFTAVFEKLQTYVSLLALPSTKPDGLL 162

Query: 78  PQAQFFLQNQVQQAVAQAAQQLQQLQKQSHHHHQKLSPQSQ 118
                 +  Q+  A+      +++L K   H++QK S + +
Sbjct: 163 RTNYSTVFTQLAAALHGLHDAVKELSK---HYNQKASLEQE 200


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 27.5 bits (62), Expect = 8.6
 Identities = 13/59 (22%), Positives = 20/59 (33%)

Query: 48  QAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQKQS 106
            A  Q      +A     + +    P   P QA      +    +  A QQLQ+ Q  +
Sbjct: 378 AASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGAT 436


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 27.6 bits (61), Expect = 9.0
 Identities = 18/96 (18%), Positives = 38/96 (39%), Gaps = 4/96 (4%)

Query: 10  LLLQNHQLASLPLPNLSSLSPSELSTLQQTLQQHQHSLQA--QLQQFVLFQSASQQNASS 67
            +  N  + +   P     +PS +  L+Q +++ Q  LQ   Q +  +  Q +   +   
Sbjct: 386 GMSANGTVENCI-PENKISTPSAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLER 444

Query: 68  SAANSPTGLPPQAQFFLQNQVQQAVAQAAQQLQQLQ 103
           S  +    L  +    LQ ++   V+   +  Q +Q
Sbjct: 445 SLKSDLGQL-KKENDMLQTKLNSMVSAKQKDKQSMQ 479


>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
           protein.  This family represents an accessory protein
           that works with the bacteriocin maturation and ABC
           transport secretion protein described by TIGR01193
           [Transport and binding proteins, Other].
          Length = 457

 Score = 27.4 bits (61), Expect = 9.1
 Identities = 25/110 (22%), Positives = 39/110 (35%), Gaps = 25/110 (22%)

Query: 34  STLQQTLQQHQHSLQAQLQQFVLFQSASQQNASSSAANSPTGLPPQAQF----------- 82
              +  L Q       +LQ +   ++A       +  N     P Q+ +           
Sbjct: 171 EKTKAQLDQQISKTDQKLQDYQALKNAISNGTKVANFN-----PYQSLYENYQAQLKSAS 225

Query: 83  --FLQNQV-QQAVAQAAQQLQQLQKQSHHHHQKLSPQSQQSGNTTPPSSN 129
               +NQV    +A   QQ+ QLQK         S Q Q++G T   +SN
Sbjct: 226 DKDQKNQVKSTILATIQQQIDQLQKSIA------SYQVQKAGLTKSTASN 269


>gnl|CDD|234137 TIGR03176, AllC, allantoate amidohydrolase.  This enzyme catalyzes
           the breakdown of allantoate , first to ureidoglycine by
           hydrolysis and then decarboxylation of one of the two
           equivalent ureido groups. Ureidoglycine then
           spontaneously exchanges ammonia for water resulting in
           ureidoglycolate. This enzyme is an alternative to
           allantoicase (3.5.3.4) which releases urea [Central
           intermediary metabolism, Nitrogen metabolism].
          Length = 406

 Score = 27.5 bits (61), Expect = 9.5
 Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 4/52 (7%)

Query: 143 RLLDPDETTDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTI 194
           RLL   E   L   +QF K   +  ++  F   DVG   G+L G +F + TI
Sbjct: 25  RLLYSPEW--LAAQQQFKKRMAESGLETRFD--DVGNLYGRLVGTEFPEETI 72


>gnl|CDD|224314 COG1396, HipB, Predicted transcriptional regulators
           [Transcription].
          Length = 120

 Score = 26.5 bits (57), Expect = 9.9
 Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 6/41 (14%)

Query: 158 QFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFE 198
              +  K+ R K G +Q ++   +G       S++TISR E
Sbjct: 1   SIGERLKELRKKKGLSQEELAERLG------VSRSTISRIE 35


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.124    0.341 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,951,064
Number of extensions: 1275146
Number of successful extensions: 2572
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2254
Number of HSP's successfully gapped: 275
Length of query: 289
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 193
Effective length of database: 6,679,618
Effective search space: 1289166274
Effective search space used: 1289166274
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.0 bits)