RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13443
         (188 letters)



>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 39.2 bits (91), Expect = 9e-04
 Identities = 15/21 (71%), Positives = 18/21 (85%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
           GG+ EEEEEEEEEEE+  E+E
Sbjct: 860 GGDSEEEEEEEEEEEEEEEEE 880



 Score = 38.4 bits (89), Expect = 0.002
 Identities = 19/31 (61%), Positives = 21/31 (67%), Gaps = 6/31 (19%)

Query: 160 VDKKGG------EEEEEEEEEEEEEDNIEQE 184
           VD  GG      EEEEEEEEEEEEE+  E+E
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882



 Score = 37.7 bits (87), Expect = 0.002
 Identities = 15/19 (78%), Positives = 17/19 (89%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEEEEEEEEEEE+  E+E
Sbjct: 868 EEEEEEEEEEEEEEEEEEE 886



 Score = 37.3 bits (86), Expect = 0.003
 Identities = 16/24 (66%), Positives = 19/24 (79%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           D +  EEEEEEEEEEEEE+  E+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEE 885



 Score = 37.3 bits (86), Expect = 0.003
 Identities = 15/17 (88%), Positives = 16/17 (94%)

Query: 166 EEEEEEEEEEEEEDNIE 182
           EEEEEEEEEEEEE+N E
Sbjct: 876 EEEEEEEEEEEEEENEE 892



 Score = 37.3 bits (86), Expect = 0.004
 Identities = 15/19 (78%), Positives = 17/19 (89%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEEEEEEEEEEE+  E+E
Sbjct: 871 EEEEEEEEEEEEEEEEEEE 889



 Score = 37.3 bits (86), Expect = 0.004
 Identities = 15/24 (62%), Positives = 20/24 (83%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           +++  EEEEEEEEEEEEE+  E+E
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEE 888



 Score = 36.9 bits (85), Expect = 0.005
 Identities = 15/19 (78%), Positives = 16/19 (84%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEEEEEEEEEEE+  E E
Sbjct: 873 EEEEEEEEEEEEEEEEENE 891



 Score = 36.5 bits (84), Expect = 0.006
 Identities = 15/23 (65%), Positives = 17/23 (73%)

Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
           EEEEEEEEEEEEE+  E+    E
Sbjct: 875 EEEEEEEEEEEEEEENEEPLSLE 897



 Score = 36.5 bits (84), Expect = 0.007
 Identities = 15/21 (71%), Positives = 17/21 (80%)

Query: 166 EEEEEEEEEEEEEDNIEQEFL 186
           EEEEEEEEEEEEE+   +E L
Sbjct: 874 EEEEEEEEEEEEEEEENEEPL 894



 Score = 35.7 bits (82), Expect = 0.010
 Identities = 14/19 (73%), Positives = 16/19 (84%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEEEEEEEEEEE+  E+ 
Sbjct: 872 EEEEEEEEEEEEEEEEEEN 890



 Score = 33.0 bits (75), Expect = 0.083
 Identities = 13/20 (65%), Positives = 16/20 (80%)

Query: 166 EEEEEEEEEEEEEDNIEQEF 185
           EEEEEEEEEEE E+ +  E+
Sbjct: 879 EEEEEEEEEEENEEPLSLEW 898


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 33.1 bits (76), Expect = 0.004
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 121 HLDDHMRRHTGEKPHACGMCGYECAQ 146
           +L  HMR HTGEKP+ C +CG   + 
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 27.4 bits (61), Expect = 0.57
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 93  YMRTHIDTHNGEKPFRCALCAYSARR 118
            +R H+ TH GEKP++C +C  S   
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 25.4 bits (56), Expect = 2.7
 Identities = 7/25 (28%), Positives = 13/25 (52%)

Query: 1  MLRHVRTHLAKQQFECDFCPYGAKQ 25
          + RH+RTH  ++ ++C  C      
Sbjct: 2  LRRHMRTHTGEKPYKCPVCGKSFSS 26


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 34.5 bits (80), Expect = 0.004
 Identities = 12/14 (85%), Positives = 14/14 (100%)

Query: 166 EEEEEEEEEEEEED 179
           EEE++EEEEEEEED
Sbjct: 67  EEEKKEEEEEEEED 80



 Score = 34.1 bits (79), Expect = 0.006
 Identities = 10/14 (71%), Positives = 14/14 (100%)

Query: 166 EEEEEEEEEEEEED 179
           E++EEEEEEEE++D
Sbjct: 69  EKKEEEEEEEEDDD 82



 Score = 34.1 bits (79), Expect = 0.007
 Identities = 11/14 (78%), Positives = 14/14 (100%)

Query: 166 EEEEEEEEEEEEED 179
           EE++EEEEEEEE+D
Sbjct: 68  EEKKEEEEEEEEDD 81



 Score = 31.1 bits (71), Expect = 0.080
 Identities = 10/14 (71%), Positives = 13/14 (92%)

Query: 166 EEEEEEEEEEEEED 179
            EEE++EEEEEEE+
Sbjct: 66  AEEEKKEEEEEEEE 79



 Score = 29.5 bits (67), Expect = 0.34
 Identities = 9/14 (64%), Positives = 12/14 (85%)

Query: 166 EEEEEEEEEEEEED 179
             EEE++EEEEEE+
Sbjct: 65  AAEEEKKEEEEEEE 78



 Score = 28.7 bits (65), Expect = 0.52
 Identities = 7/16 (43%), Positives = 9/16 (56%)

Query: 163 KGGEEEEEEEEEEEEE 178
                  EEE++EEEE
Sbjct: 60  AAAAAAAEEEKKEEEE 75



 Score = 28.7 bits (65), Expect = 0.58
 Identities = 8/18 (44%), Positives = 11/18 (61%)

Query: 162 KKGGEEEEEEEEEEEEED 179
                  EEE++EEEEE+
Sbjct: 60  AAAAAAAEEEKKEEEEEE 77



 Score = 27.6 bits (62), Expect = 1.5
 Identities = 7/21 (33%), Positives = 12/21 (57%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
                  EEE++EEE+  E++
Sbjct: 60  AAAAAAAEEEKKEEEEEEEED 80


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 34.2 bits (79), Expect = 0.009
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIE 182
             ++K  EEEEEEE+EE EE+   
Sbjct: 76  AAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 32.6 bits (75), Expect = 0.036
 Identities = 12/22 (54%), Positives = 17/22 (77%)

Query: 163 KGGEEEEEEEEEEEEEDNIEQE 184
              EE++EEEEEEEE++  E+E
Sbjct: 75  AAAEEKKEEEEEEEEKEESEEE 96



 Score = 31.5 bits (72), Expect = 0.088
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIE 182
                E++EEEEEEEE+E++ E
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEE 95


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 35.2 bits (81), Expect = 0.013
 Identities = 17/37 (45%), Positives = 24/37 (64%), Gaps = 1/37 (2%)

Query: 149 QLMQHLRKV-HKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +L + L++  H V ++  EEEEEEEEEEEEE    + 
Sbjct: 385 ELREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEA 421


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 34.7 bits (80), Expect = 0.020
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
           EEE+EEEEEEE+ED    +   +
Sbjct: 392 EEEDEEEEEEEDEDEGPSKEHSD 414



 Score = 30.5 bits (69), Expect = 0.52
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 161 DKKGGEEEEEEEEEEEEED 179
           D +  E+EEEEEEE+E+E 
Sbjct: 389 DDEEEEDEEEEEEEDEDEG 407



 Score = 29.3 bits (66), Expect = 1.3
 Identities = 10/21 (47%), Positives = 17/21 (80%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
             ++EEEE+EEEEEE++ ++ 
Sbjct: 387 ERDDEEEEDEEEEEEEDEDEG 407



 Score = 28.6 bits (64), Expect = 2.4
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           D     ++EEEE+EEEEE+  E E
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDE 406



 Score = 27.8 bits (62), Expect = 4.2
 Identities = 9/26 (34%), Positives = 19/26 (73%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +++    E ++EEEE+EEEE+  +++
Sbjct: 380 EIEDANTERDDEEEEDEEEEEEEDED 405


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 33.5 bits (77), Expect = 0.024
 Identities = 14/33 (42%), Positives = 21/33 (63%)

Query: 152 QHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           QHL    + +    EE+EEEE++EE++D  E E
Sbjct: 104 QHLVASEEDESDDDEEDEEEEDDEEDDDEDESE 136



 Score = 30.8 bits (70), Expect = 0.22
 Identities = 8/19 (42%), Positives = 17/19 (89%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           E+EEEE++EE+++++  +E
Sbjct: 119 EDEEEEDDEEDDDEDESEE 137



 Score = 29.6 bits (67), Expect = 0.48
 Identities = 9/19 (47%), Positives = 18/19 (94%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEE++EE+++E+++ E+E
Sbjct: 121 EEEEDDEEDDDEDESEEEE 139



 Score = 28.4 bits (64), Expect = 1.4
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 153 HLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           H+   H V  +  E +++EE+EEEE+D  + +
Sbjct: 100 HISGQHLVASEEDESDDDEEDEEEEDDEEDDD 131


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 34.3 bits (79), Expect = 0.033
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           R      ++  EEEE+E+EEEEEE+   +E
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEEEPEE 367



 Score = 33.5 bits (77), Expect = 0.045
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
                ++K  EEE+E+EEEEEEE+  E+ 
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEP 368



 Score = 32.7 bits (75), Expect = 0.085
 Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           D++  +E+EEEEEEEEE +  E E
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPE 371



 Score = 31.2 bits (71), Expect = 0.27
 Identities = 14/24 (58%), Positives = 17/24 (70%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
            K   EE+E+EEEEEEEE+  E E
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEPE 369



 Score = 30.8 bits (70), Expect = 0.39
 Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 165 GEEEEEEEEEEEEEDNIEQEFLFE 188
            E+++EEEE+E+EE+  E+E   E
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEEPEE 367



 Score = 30.0 bits (68), Expect = 0.72
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +   +  E+E+EEEEEEEEE    + 
Sbjct: 345 EQKDEEEEQEDEEEEEEEEEPEEPEP 370



 Score = 29.3 bits (66), Expect = 1.1
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQEFL 186
           + +++  EEEEEEEE EE E       L
Sbjct: 350 EEEQEDEEEEEEEEEPEEPEPEEGPPLL 377



 Score = 26.6 bits (59), Expect = 10.0
 Identities = 14/26 (53%), Positives = 19/26 (73%)

Query: 163 KGGEEEEEEEEEEEEEDNIEQEFLFE 188
           +G  + EEEEEEEEEE+  E++ L E
Sbjct: 282 RGLYQLEEEEEEEEEEEPAERDELEE 307


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 34.1 bits (79), Expect = 0.033
 Identities = 11/18 (61%), Positives = 14/18 (77%)

Query: 162 KKGGEEEEEEEEEEEEED 179
           ++  EEEEEEEEE  EE+
Sbjct: 303 EEEEEEEEEEEEEPSEEE 320



 Score = 33.3 bits (77), Expect = 0.048
 Identities = 15/19 (78%), Positives = 17/19 (89%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEEEEEEEEEEE+  E+E
Sbjct: 302 EEEEEEEEEEEEEEPSEEE 320



 Score = 32.5 bits (75), Expect = 0.11
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNI 181
              ++  EEEEEEEEEE  E+  
Sbjct: 299 AAAEEEEEEEEEEEEEEPSEEEA 321



 Score = 31.4 bits (72), Expect = 0.26
 Identities = 13/19 (68%), Positives = 15/19 (78%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
            EEEEEEEEEEEE+   +E
Sbjct: 301 AEEEEEEEEEEEEEEPSEE 319


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 33.2 bits (76), Expect = 0.043
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query: 151 MQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           ++ L      ++   +EEEEEEEEEE+ED  + +
Sbjct: 160 LKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193



 Score = 31.6 bits (72), Expect = 0.17
 Identities = 14/33 (42%), Positives = 23/33 (69%)

Query: 148 SQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDN 180
           S L + L+++   D    +E++EEEEEEEEE++
Sbjct: 154 SMLEKKLKELEAEDVDEEDEKDEEEEEEEEEED 186



 Score = 31.3 bits (71), Expect = 0.22
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           D+K  EEEEEEEEE+E+ D+ + +
Sbjct: 172 DEKDEEEEEEEEEEDEDFDDDDDD 195



 Score = 29.7 bits (67), Expect = 0.73
 Identities = 8/29 (27%), Positives = 19/29 (65%), Gaps = 4/29 (13%)

Query: 161 DKKGGEEEEEEEEEEEEED----NIEQEF 185
           +++  EEE+E+ ++++++D    N E  F
Sbjct: 178 EEEEEEEEDEDFDDDDDDDDDDYNAENYF 206



 Score = 29.3 bits (66), Expect = 0.90
 Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           + +  EEEEEEEEEE+E+ + + +
Sbjct: 171 EDEKDEEEEEEEEEEDEDFDDDDD 194



 Score = 27.8 bits (62), Expect = 2.8
 Identities = 11/34 (32%), Positives = 21/34 (61%)

Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           +      +   EE+E++EEEEEEE+  +++F  +
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDD 192


>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6.  This region covers
           TM regions 4-6 of the ryanodine receptor 1 family.
          Length = 217

 Score = 33.1 bits (76), Expect = 0.051
 Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 6/37 (16%)

Query: 158 HKVDKKGGEEEEEEE------EEEEEEDNIEQEFLFE 188
           +KV     EEEEEE        EE+EE+ I    L E
Sbjct: 145 YKVSTSPAEEEEEEGSGSGDDGEEDEEEGIVYFVLQE 181


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 33.5 bits (77), Expect = 0.054
 Identities = 16/32 (50%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 148 SQLMQHLRKV-HKVDKKGGEEEEEEEEEEEEE 178
            +L + L++  H V ++   EEEEEEEEEEEE
Sbjct: 385 EELKKFLKEKEHPVVERWAAEEEEEEEEEEEE 416



 Score = 30.0 bits (68), Expect = 0.70
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
           EEEEEEEEEEEEE+ + +  +  
Sbjct: 406 EEEEEEEEEEEEEEPVAEVMMMP 428



 Score = 27.0 bits (60), Expect = 6.7
 Identities = 11/17 (64%), Positives = 13/17 (76%)

Query: 168 EEEEEEEEEEEDNIEQE 184
             EEEEEEEEE+  E+E
Sbjct: 403 AAEEEEEEEEEEEEEEE 419



 Score = 27.0 bits (60), Expect = 7.2
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 167 EEEEEEEEEEEEDNIEQE 184
             EEEEEEEEEE+  E+ 
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 33.1 bits (76), Expect = 0.076
 Identities = 14/25 (56%), Positives = 16/25 (64%)

Query: 164 GGEEEEEEEEEEEEEDNIEQEFLFE 188
             EEEE  EEEEEEE+  E+ F  E
Sbjct: 309 EEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 28.5 bits (64), Expect = 2.5
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 166 EEEEEEEEEEEEEDNIEQEF 185
            E EEEEEEEEE    E+E 
Sbjct: 303 PEPEEEEEEEEEVPEEEEEE 322


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 31.6 bits (72), Expect = 0.078
 Identities = 12/22 (54%), Positives = 17/22 (77%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIE 182
           +++  EEEEEEEEEE EE+ + 
Sbjct: 77  EEEEEEEEEEEEEEESEEEAMA 98



 Score = 30.8 bits (70), Expect = 0.13
 Identities = 15/19 (78%), Positives = 17/19 (89%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEEEEEEEEEEE+  E+E
Sbjct: 77  EEEEEEEEEEEEEEESEEE 95



 Score = 30.8 bits (70), Expect = 0.14
 Identities = 14/19 (73%), Positives = 16/19 (84%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           EEEEEEEEEEEEE+   +E
Sbjct: 76  EEEEEEEEEEEEEEEESEE 94



 Score = 30.0 bits (68), Expect = 0.24
 Identities = 12/23 (52%), Positives = 16/23 (69%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNI 181
             +++  EEEEEEEEEEE E+  
Sbjct: 74  AAEEEEEEEEEEEEEEEESEEEA 96



 Score = 29.6 bits (67), Expect = 0.40
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
               EEEEEEEEEEE+  E+ 
Sbjct: 72  AAAAEEEEEEEEEEEEEEEES 92



 Score = 29.6 bits (67), Expect = 0.40
 Identities = 14/19 (73%), Positives = 15/19 (78%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
            EEEEEEEEEEEE+  E E
Sbjct: 75  AEEEEEEEEEEEEEEEESE 93



 Score = 28.5 bits (64), Expect = 0.79
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
                EEEEEEEEEE+  E+E
Sbjct: 71  AAAAAEEEEEEEEEEEEEEEE 91



 Score = 26.5 bits (59), Expect = 4.4
 Identities = 11/21 (52%), Positives = 13/21 (61%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
                 EEEEEEEEE+  E+E
Sbjct: 70  AAAAAAEEEEEEEEEEEEEEE 90


>gnl|CDD|215121 PLN02189, PLN02189, cellulose synthase.
          Length = 1040

 Score = 33.1 bits (75), Expect = 0.092
 Identities = 28/106 (26%), Positives = 41/106 (38%), Gaps = 26/106 (24%)

Query: 98  IDTHNGEKPFR------CALCAYSARRKTHLDDHMRRHTGEKPHACGMCG---------Y 142
           I  H   KP R      C +C          D+      G+   AC  CG         Y
Sbjct: 20  IHGHEEPKPLRNLDGQVCEICG---------DEIGLTVDGDLFVACNECGFPVCRPCYEY 70

Query: 143 ECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           E  +G+Q     +  +K  K  G    E +++EE+ D+IE EF  +
Sbjct: 71  ERREGTQNCPQCKTRYKRLK--GSPRVEGDDDEEDIDDIEHEFNID 114


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 31.2 bits (71), Expect = 0.098
 Identities = 11/14 (78%), Positives = 12/14 (85%)

Query: 166 EEEEEEEEEEEEED 179
           EEEE+EEE EEE D
Sbjct: 87  EEEEKEEEAEEESD 100



 Score = 31.2 bits (71), Expect = 0.10
 Identities = 10/21 (47%), Positives = 15/21 (71%)

Query: 162 KKGGEEEEEEEEEEEEEDNIE 182
            +  E EEEE+EEE EE++ +
Sbjct: 81  AEADEAEEEEKEEEAEEESDD 101



 Score = 30.8 bits (70), Expect = 0.14
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEFLF 187
                +E EEEE+EEE E+  + + LF
Sbjct: 79  AAAEADEAEEEEKEEEAEEESDDDMLF 105



 Score = 28.5 bits (64), Expect = 0.83
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEF 185
           +     +E EEEE+EEE +    + 
Sbjct: 78  EAAAEADEAEEEEKEEEAEEESDDD 102



 Score = 27.4 bits (61), Expect = 2.4
 Identities = 10/28 (35%), Positives = 14/28 (50%), Gaps = 2/28 (7%)

Query: 159 KVDKKGGEEEEEEEE--EEEEEDNIEQE 184
                G E   E +E  EEE+E+  E+E
Sbjct: 71  AAAAAGAEAAAEADEAEEEEKEEEAEEE 98


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 30.9 bits (70), Expect = 0.13
 Identities = 10/22 (45%), Positives = 18/22 (81%)

Query: 162 KKGGEEEEEEEEEEEEEDNIEQ 183
           +K  E+EEE+++EEE+E+  E+
Sbjct: 76  EKAEEKEEEKKKEEEKEEEEEE 97



 Score = 29.8 bits (67), Expect = 0.28
 Identities = 11/24 (45%), Positives = 19/24 (79%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIE 182
           K ++K  E+++EEE+EEEEE+ + 
Sbjct: 77  KAEEKEEEKKKEEEKEEEEEEALA 100



 Score = 29.8 bits (67), Expect = 0.31
 Identities = 10/23 (43%), Positives = 19/23 (82%)

Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
           ++  EE+EEE+++EEE++  E+E
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEEEE 97



 Score = 28.2 bits (63), Expect = 1.1
 Identities = 9/18 (50%), Positives = 15/18 (83%)

Query: 161 DKKGGEEEEEEEEEEEEE 178
           +K   +EEE+++EEE+EE
Sbjct: 76  EKAEEKEEEKKKEEEKEE 93



 Score = 26.7 bits (59), Expect = 4.3
 Identities = 9/21 (42%), Positives = 16/21 (76%)

Query: 159 KVDKKGGEEEEEEEEEEEEED 179
           +   +  EEE+++EEE+EEE+
Sbjct: 75  EEKAEEKEEEKKKEEEKEEEE 95


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 32.0 bits (72), Expect = 0.15
 Identities = 25/77 (32%), Positives = 32/77 (41%), Gaps = 2/77 (2%)

Query: 85  NYHTPTRKYMRTHIDTHNGEKPFRCALCAYSARRKTHLDDHMRRHTGEKPHACGMCGYEC 144
           N    +     T     N  +P  C  C  S  R  HL  H+R HTGEKP  C   G + 
Sbjct: 12  NNSVLSSTPKSTLKSLSNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDK 71

Query: 145 AQG--SQLMQHLRKVHK 159
           +     +L +HLR  H 
Sbjct: 72  SFSRPLELSRHLRTHHN 88



 Score = 27.4 bits (60), Expect = 6.3
 Identities = 18/65 (27%), Positives = 26/65 (40%), Gaps = 2/65 (3%)

Query: 73  PDASYTYMCYLCNYHTPTRKYMRTHIDTHNGEKPFRCAL--CAYSARRKTHLDDHMRRHT 130
            +A     C  C       +++  HI +H GEKP +C+   C  S  R   L  H+R H 
Sbjct: 28  SNAPRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHH 87

Query: 131 GEKPH 135
                
Sbjct: 88  NNPSD 92


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 32.0 bits (72), Expect = 0.15
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
            +  E+EEEEEEEEEEE  +E+ 
Sbjct: 133 DESDEDEEEEEEEEEEEAEVEEN 155



 Score = 32.0 bits (72), Expect = 0.15
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +K     KK  +E+E +E+EEEEE+  E+E
Sbjct: 120 KKAGNAGKKATKEDESDEDEEEEEEEEEEE 149



 Score = 31.2 bits (70), Expect = 0.26
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +K  K D+   +EEEEEEEEEEE +  E E
Sbjct: 127 KKATKEDESDEDEEEEEEEEEEEAEVEENE 156



 Score = 29.3 bits (65), Expect = 1.2
 Identities = 13/18 (72%), Positives = 14/18 (77%)

Query: 166 EEEEEEEEEEEEEDNIEQ 183
           EEEEEEEEEE E +  EQ
Sbjct: 140 EEEEEEEEEEAEVEENEQ 157



 Score = 26.6 bits (58), Expect = 8.6
 Identities = 9/19 (47%), Positives = 10/19 (52%)

Query: 164 GGEEEEEEEEEEEEEDNIE 182
           GG+  EE EEE   E   E
Sbjct: 177 GGDNGEEGEEESVTEAEAE 195


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 30.4 bits (69), Expect = 0.15
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 148 SQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           + L      +   DK   ++EE EE++EE  + I++E
Sbjct: 37  TDLYYRTDYLEVTDKVKEKKEEREEDKEELIERIKKE 73


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 30.8 bits (70), Expect = 0.16
 Identities = 9/20 (45%), Positives = 16/20 (80%)

Query: 162 KKGGEEEEEEEEEEEEEDNI 181
            +  +EE++EEEEEE +D++
Sbjct: 79  AEAKKEEKKEEEEEESDDDM 98



 Score = 30.0 bits (68), Expect = 0.28
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEFLF 187
                ++EE++EEEEEE D+     LF
Sbjct: 77  AAAEAKKEEKKEEEEEESDDDMGFGLF 103


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 31.1 bits (71), Expect = 0.19
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           K +K+  EEEE E EE +EE+ I++ 
Sbjct: 98  KKEKEEEEEEEVEVEELDEEEQIDEL 123



 Score = 28.4 bits (64), Expect = 1.5
 Identities = 18/32 (56%), Positives = 21/32 (65%)

Query: 153 HLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
            +RK+  +DKK  EEEEEEE E EE D  EQ 
Sbjct: 89  KVRKLLGLDKKEKEEEEEEEVEVEELDEEEQI 120



 Score = 28.0 bits (63), Expect = 1.9
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
              +K  EEEEE E EE +E+    E
Sbjct: 97  DKKEKEEEEEEEVEVEELDEEEQIDE 122



 Score = 27.3 bits (61), Expect = 3.2
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           + +++  EE E EE +EEE+ +   E
Sbjct: 100 EKEEEEEEEVEVEELDEEEQIDELLE 125



 Score = 26.9 bits (60), Expect = 4.9
 Identities = 8/26 (30%), Positives = 18/26 (69%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           + +++   E EE +EEE+ ++ +E+E
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKE 127



 Score = 26.5 bits (59), Expect = 6.1
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 162 KKGGEEEEEEEEEEEEEDNIEQEF 185
            K  +EEEEEEE E EE + E++ 
Sbjct: 97  DKKEKEEEEEEEVEVEELDEEEQI 120


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 30.6 bits (69), Expect = 0.20
 Identities = 11/20 (55%), Positives = 17/20 (85%)

Query: 162 KKGGEEEEEEEEEEEEEDNI 181
           K   ++EE++EEEEEEED++
Sbjct: 87  KAEAKKEEKKEEEEEEEDDL 106



 Score = 29.1 bits (65), Expect = 0.55
 Identities = 12/23 (52%), Positives = 15/23 (65%), Gaps = 3/23 (13%)

Query: 157 VHKVDKKGGEEEEEEEEEEEEED 179
                KK   EE++EEEEEEE+D
Sbjct: 86  AKAEAKK---EEKKEEEEEEEDD 105



 Score = 27.2 bits (60), Expect = 3.1
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 161 DKKGGEEEEEEEEEEEEEDN 180
             K   ++EE++EEEEEE++
Sbjct: 85  GAKAEAKKEEKKEEEEEEED 104


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 31.4 bits (71), Expect = 0.24
 Identities = 13/40 (32%), Positives = 23/40 (57%)

Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           +L+  L ++ K D     E  E +EE+E+ +N E ++L E
Sbjct: 81  RLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLRE 120


>gnl|CDD|238704 cd01413, SIR2_Af2, SIR2_Af2: Archaeal and prokaryotic group which
           includes Archaeoglobus fulgidus Sir2-Af2, Sulfolobus
           solfataricus ssSir2, and several bacterial homologs; and
           are members of the SIR2 family of proteins, silent
           information regulator 2 (Sir2) enzymes which catalyze
           NAD+-dependent protein/histone deacetylation. Sir2
           proteins have been shown to regulate gene silencing, DNA
           repair, metabolic enzymes, and life span. The Sir2
           homolog from the archaea Sulfolobus solftaricus
           deacetylates the non-specific DNA protein Alba to
           mediate transcription repression.
          Length = 222

 Score = 30.8 bits (70), Expect = 0.35
 Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 9/56 (16%)

Query: 22  GAKQAADVHNHVQQIHMGVNFVCVHCKQ---FEVVPTRKTQTLEHCATCVDMVRPD 74
           G+K   ++H  +Q  +      CV+C      E V   K   +  C  C  ++RPD
Sbjct: 100 GSKNVIELHGTLQTAY------CVNCGSKYDLEEVKYAKKHEVPRCPKCGGIIRPD 149


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 29.8 bits (67), Expect = 0.38
 Identities = 10/15 (66%), Positives = 11/15 (73%)

Query: 164 GGEEEEEEEEEEEEE 178
           G   +EEEEEEEE E
Sbjct: 92  GHTRQEEEEEEEENE 106



 Score = 28.6 bits (64), Expect = 1.1
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 165 GEEEEEEEEEEEEEDNIEQE 184
           G     +EEEEEEE+N +Q+
Sbjct: 90  GTGHTRQEEEEEEEENEKQQ 109



 Score = 27.9 bits (62), Expect = 2.2
 Identities = 10/19 (52%), Positives = 12/19 (63%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
            +EEEEEEEE E+     E
Sbjct: 95  RQEEEEEEEENEKQQQSDE 113



 Score = 27.5 bits (61), Expect = 2.5
 Identities = 10/21 (47%), Positives = 14/21 (66%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
           G     +EEEEEEEE+  +Q+
Sbjct: 90  GTGHTRQEEEEEEEENEKQQQ 110



 Score = 27.1 bits (60), Expect = 3.2
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 163 KGGEEEEEEEEEEEEEDNIE 182
              EEEEEEEE E+++ + E
Sbjct: 94  TRQEEEEEEEENEKQQQSDE 113


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 30.8 bits (70), Expect = 0.38
 Identities = 8/18 (44%), Positives = 9/18 (50%)

Query: 162 KKGGEEEEEEEEEEEEED 179
                    EEEEEEE+D
Sbjct: 286 AAAAAAAPAEEEEEEEDD 303



 Score = 29.6 bits (67), Expect = 0.86
 Identities = 8/21 (38%), Positives = 8/21 (38%)

Query: 159 KVDKKGGEEEEEEEEEEEEED 179
                        EEEEEEED
Sbjct: 282 AAAAAAAAAAAPAEEEEEEED 302



 Score = 29.2 bits (66), Expect = 1.2
 Identities = 7/14 (50%), Positives = 9/14 (64%)

Query: 166 EEEEEEEEEEEEED 179
               EEEEEEE++ 
Sbjct: 291 AAPAEEEEEEEDDM 304


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 31.1 bits (70), Expect = 0.39
 Identities = 8/26 (30%), Positives = 15/26 (57%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
              ++  + EE EEE+ EEE  + ++
Sbjct: 312 PEIEQDEDSEESEEEKNEEEGGLSKK 337


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 28.1 bits (63), Expect = 0.40
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           E+ E+EEE E+EE+  EQ 
Sbjct: 24  EKREDEEENEDEEEGEEQS 42



 Score = 27.3 bits (61), Expect = 0.80
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query: 167 EEEEEEEEEEEEDNIEQEF 185
           EEE+ E+EEE ED  E E 
Sbjct: 22  EEEKREDEEENEDEEEGEE 40



 Score = 25.8 bits (57), Expect = 2.5
 Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 167 EEEEEEEEEEEEDNIEQE 184
            E+EEE E+EEE   + E
Sbjct: 26  REDEEENEDEEEGEEQSE 43



 Score = 25.0 bits (55), Expect = 5.0
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 161 DKKGGEEEEEEEEEEEEED 179
           ++K  +EEE E+EEE EE 
Sbjct: 23  EEKREDEEENEDEEEGEEQ 41



 Score = 25.0 bits (55), Expect = 6.0
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           E+EEE E+EEE E+  E +
Sbjct: 27  EDEEENEDEEEGEEQSEVK 45


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 30.7 bits (69), Expect = 0.46
 Identities = 10/31 (32%), Positives = 20/31 (64%)

Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           ++K + + K G E EEE  EEE++++  + +
Sbjct: 32  MKKENAIRKLGKEAEEEAMEEEDDDEEDDDD 62


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.8 bits (69), Expect = 0.54
 Identities = 8/20 (40%), Positives = 18/20 (90%)

Query: 165 GEEEEEEEEEEEEEDNIEQE 184
            ++E+E+E++++EED+ E+E
Sbjct: 155 DDDEDEDEDDDDEEDDEEEE 174



 Score = 30.0 bits (67), Expect = 0.80
 Identities = 9/24 (37%), Positives = 19/24 (79%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           D +  +E++++EE++EEE+  E+E
Sbjct: 156 DDEDEDEDDDDEEDDEEEEEEEEE 179



 Score = 28.9 bits (64), Expect = 1.8
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
             +E+E+E++++EE+D  E+E
Sbjct: 155 DDDEDEDEDDDDEEDDEEEEE 175



 Score = 27.7 bits (61), Expect = 4.2
 Identities = 9/30 (30%), Positives = 20/30 (66%)

Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           R    +D    +E+E++++EE++E+  E+E
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEE 177


>gnl|CDD|215224 PLN02400, PLN02400, cellulose synthase.
          Length = 1085

 Score = 30.7 bits (69), Expect = 0.55
 Identities = 26/109 (23%), Positives = 43/109 (39%), Gaps = 26/109 (23%)

Query: 92  KYMRTHIDTHNGEKPFR------CALCAYSARRKTHLDDHMRRHTGEKPHACGMCG---- 141
           + +R   D+ +G KP +      C +C          DD     TG+   AC  C     
Sbjct: 16  ELVRIRHDSDSGPKPLKNLNGQICQICG---------DDVGVTETGDVFVACNECAFPVC 66

Query: 142 -----YECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEF 185
                YE   G+Q     +   +  +  G    E +E+E++ D++E EF
Sbjct: 67  RPCYEYERKDGTQCCPQCKT--RYRRHKGSPRVEGDEDEDDVDDLENEF 113


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 30.5 bits (69), Expect = 0.56
 Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 2/37 (5%)

Query: 148 SQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           S++    R   ++D    EE +E+E+EEEE+ + E E
Sbjct: 352 SKMRDKRRA--RLDPIDFEEVDEDEDEEEEQRSDEHE 386



 Score = 27.8 bits (62), Expect = 3.4
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 165 GEEEEEEEEEEEEEDNIEQE 184
            E +E+E+EEEE+  +  +E
Sbjct: 368 EEVDEDEDEEEEQRSDEHEE 387



 Score = 27.4 bits (61), Expect = 4.4
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           E+E+EEEE+  +E   E+ 
Sbjct: 372 EDEDEEEEQRSDEHEEEEG 390


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
           Ribosomal proteins P1 and P2 are the eukaryotic proteins
           that are functionally equivalent to bacterial L7/L12.
           L12p is the archaeal homolog. Unlike other ribosomal
           proteins, the archaeal L12p and eukaryotic P1 and P2 do
           not share sequence similarity with their bacterial
           counterparts. They are part of the ribosomal stalk
           (called the L7/L12 stalk in bacteria), along with 28S
           rRNA and the proteins L11 and P0 in eukaryotes (23S
           rRNA, L11, and L10e in archaea). In bacterial ribosomes,
           L7/L12 homodimers bind the extended C-terminal helix of
           L10 to anchor the L7/L12 molecules to the ribosome.
           Eukaryotic P1/P2 heterodimers and archaeal L12p
           homodimers are believed to bind the L10 equivalent
           proteins, eukaryotic P0 and archaeal L10e, in a similar
           fashion. P1 and P2 (L12p, L7/L12) are the only proteins
           in the ribosome to occur as multimers, always appearing
           as sets of dimers. Recent data indicate that most
           archaeal species contain six copies of L12p (three
           homodimers), while eukaryotes have two copies each of P1
           and P2 (two heterodimers). Bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. As in bacteria, the stalk is crucial for
           binding of initiation, elongation, and release factors
           in eukaryotes and archaea.
          Length = 105

 Score = 29.2 bits (65), Expect = 0.56
 Identities = 11/14 (78%), Positives = 13/14 (92%)

Query: 166 EEEEEEEEEEEEED 179
           EE +EEEEEEE+ED
Sbjct: 86  EEAKEEEEEEEDED 99



 Score = 28.0 bits (62), Expect = 1.4
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 167 EEEEEEEEEEEEDNIEQEFLF 187
            EE +EEEEEEED      LF
Sbjct: 85  AEEAKEEEEEEEDEDFGFGLF 105



 Score = 26.8 bits (59), Expect = 3.3
 Identities = 9/15 (60%), Positives = 13/15 (86%)

Query: 166 EEEEEEEEEEEEEDN 180
            EE +EEEEEEE+++
Sbjct: 85  AEEAKEEEEEEEDED 99


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 30.2 bits (68), Expect = 0.62
 Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 149 QLMQHLRKV-HKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +L + L++  H V K+   E +EEE EEEEE    +E
Sbjct: 383 ELRKFLKEKGHPVVKRVVREVDEEEIEEEEEAMQPEE 419


>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
           (CLPTM1).  This family consists of several eukaryotic
           cleft lip and palate transmembrane protein 1 sequences.
           Cleft lip with or without cleft palate is a common birth
           defect that is genetically complex. The nonsyndromic
           forms have been studied genetically using linkage and
           candidate-gene association studies with only partial
           success in defining the loci responsible for orofacial
           clefting. CLPTM1 encodes a transmembrane protein and has
           strong homology to two Caenorhabditis elegans genes,
           suggesting that CLPTM1 may belong to a new gene family.
           This family also contains the human cisplatin resistance
           related protein CRR9p which is associated with
           CDDP-induced apoptosis.
          Length = 437

 Score = 30.0 bits (68), Expect = 0.76
 Identities = 14/41 (34%), Positives = 19/41 (46%), Gaps = 6/41 (14%)

Query: 146 QGSQLMQHLRKVHKVDKK----GGEEEEEEEEEEEEEDNIE 182
               L  +L K  K  KK    GG+ E+EE EEE+     +
Sbjct: 140 FVFPLTTYLPK--KKVKKKNLLGGKSEKEEPEEEKTPAPDK 178


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The CAF-1
           or chromatin assembly factor-1 consists of three
           subunits, and this is the first, or A. The A domain is
           uniquely required for the progression of S phase in
           mouse cells, independent of its ability to promote
           histone deposition but dependent on its ability to
           interact with HP1 - heterochromatin protein 1-rich
           heterochromatin domains next to centromeres that are
           crucial for chromosome segregation during mitosis. This
           HP1-CAF-1 interaction module functions as a built-in
           replication control for heterochromatin, which, like a
           control barrier, has an impact on S-phase progression in
           addition to DNA-based checkpoints.
          Length = 76

 Score = 28.0 bits (63), Expect = 0.80
 Identities = 6/16 (37%), Positives = 13/16 (81%)

Query: 165 GEEEEEEEEEEEEEDN 180
            +EE+EEE+++++ D 
Sbjct: 60  EDEEDEEEDDDDDMDG 75



 Score = 26.8 bits (60), Expect = 2.5
 Identities = 10/25 (40%), Positives = 16/25 (64%), Gaps = 5/25 (20%)

Query: 166 EEEEE-----EEEEEEEEDNIEQEF 185
           EEEEE      E+EE+EE++ + + 
Sbjct: 49  EEEEEGEDLESEDEEDEEEDDDDDM 73



 Score = 25.6 bits (57), Expect = 6.3
 Identities = 5/16 (31%), Positives = 12/16 (75%)

Query: 164 GGEEEEEEEEEEEEED 179
             EE+EEE+++++ + 
Sbjct: 60  EDEEDEEEDDDDDMDG 75



 Score = 25.3 bits (56), Expect = 8.7
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query: 165 GEEEEEEEEEEEEED 179
           GE+ E E+EE+EEED
Sbjct: 54  GEDLESEDEEDEEED 68


>gnl|CDD|227003 COG4656, RnfC, Predicted NADH:ubiquinone oxidoreductase, subunit
           RnfC [Energy production and conversion].
          Length = 529

 Score = 30.0 bits (68), Expect = 0.82
 Identities = 16/60 (26%), Positives = 24/60 (40%), Gaps = 11/60 (18%)

Query: 107 FRCALCAYSARR-----KTHLDDHMRRHTGEKPHA------CGMCGYECAQGSQLMQHLR 155
            RC+LCA +        + +      +H  E+ H       CG C Y C     L+Q+ R
Sbjct: 368 IRCSLCADACPVNLLPQQLYWFAKGEQHDEEEEHNLLDCIECGACAYVCPSNIPLVQYFR 427


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 29.5 bits (66), Expect = 0.87
 Identities = 13/17 (76%), Positives = 16/17 (94%)

Query: 166 EEEEEEEEEEEEEDNIE 182
           EE+EEEEEEEEEE+ I+
Sbjct: 229 EEDEEEEEEEEEEEEID 245



 Score = 29.1 bits (65), Expect = 1.2
 Identities = 13/22 (59%), Positives = 16/22 (72%)

Query: 158 HKVDKKGGEEEEEEEEEEEEED 179
              D +  E+EEEEEEEEEEE+
Sbjct: 222 SGSDSESEEDEEEEEEEEEEEE 243



 Score = 27.2 bits (60), Expect = 6.1
 Identities = 12/17 (70%), Positives = 14/17 (82%)

Query: 166 EEEEEEEEEEEEEDNIE 182
           E EE+EEEEEEEE+  E
Sbjct: 227 ESEEDEEEEEEEEEEEE 243



 Score = 26.8 bits (59), Expect = 6.7
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 165 GEEEEEEEEEEEEEDNIEQE 184
           G + E EE+EEEEE+  E+E
Sbjct: 223 GSDSESEEDEEEEEEEEEEE 242



 Score = 26.8 bits (59), Expect = 7.1
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 164 GGEEEEEEEEEEEEEDNIEQE 184
           G + E EE+EEEEEE+  E+E
Sbjct: 223 GSDSESEEDEEEEEEEEEEEE 243


>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
           Provisional.
          Length = 130

 Score = 28.6 bits (64), Expect = 0.96
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 167 EEEEEEEEEEEEDNIEQEFLFE 188
           ++EEE+EEE  E   E+E L E
Sbjct: 96  KKEEEKEEEIPEPTKEEELLGE 117



 Score = 27.9 bits (62), Expect = 1.8
 Identities = 10/31 (32%), Positives = 18/31 (58%)

Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           ++  +K+  K  EE+EEE  E  +E+ +  E
Sbjct: 87  VKVFNKLTSKKEEEKEEEIPEPTKEEELLGE 117


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score = 29.3 bits (67), Expect = 1.1
 Identities = 11/43 (25%), Positives = 21/43 (48%), Gaps = 5/43 (11%)

Query: 148 SQLMQHLR-----KVHKVDKKGGEEEEEEEEEEEEEDNIEQEF 185
              + +L          V +   +  +E+EEE+EE++  E+EF
Sbjct: 452 EFWISYLEEFDGKPFKSVARGDLDLGKEDEEEKEEKEEAEEEF 494



 Score = 28.2 bits (64), Expect = 2.8
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFL 186
            + V + D   G+E+EEE+EE+EE +   +  L
Sbjct: 466 FKSVARGDLDLGKEDEEEKEEKEEAEEEFKPLL 498


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 28.4 bits (64), Expect = 1.1
 Identities = 8/14 (57%), Positives = 12/14 (85%)

Query: 166 EEEEEEEEEEEEED 179
           EE++EE EEE ++D
Sbjct: 89  EEKKEESEEESDDD 102



 Score = 27.6 bits (62), Expect = 1.9
 Identities = 8/14 (57%), Positives = 12/14 (85%)

Query: 166 EEEEEEEEEEEEED 179
           +EE++EE EEE +D
Sbjct: 88  KEEKKEESEEESDD 101



 Score = 27.2 bits (61), Expect = 2.2
 Identities = 8/14 (57%), Positives = 12/14 (85%)

Query: 166 EEEEEEEEEEEEED 179
           ++EE++EE EEE D
Sbjct: 87  KKEEKKEESEEESD 100


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 29.4 bits (66), Expect = 1.2
 Identities = 8/29 (27%), Positives = 15/29 (51%), Gaps = 5/29 (17%)

Query: 165 GEEEEEEEEEEEEEDNIEQ-----EFLFE 188
            EEE +E ++ + +D  E      E +F+
Sbjct: 285 PEEEPDEPDQTDPDDGEETDQIPEELMFD 313


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 29.0 bits (65), Expect = 1.2
 Identities = 14/24 (58%), Positives = 18/24 (75%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           +K+  E+EEEEE EEEEE+  E E
Sbjct: 262 EKELKEDEEEEETEEEEEEEDEDE 285



 Score = 27.5 bits (61), Expect = 4.3
 Identities = 12/20 (60%), Positives = 17/20 (85%)

Query: 160 VDKKGGEEEEEEEEEEEEED 179
           V++K  +E+EEEEE EEEE+
Sbjct: 260 VEEKELKEDEEEEETEEEEE 279



 Score = 26.7 bits (59), Expect = 7.7
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 152 QHLRKVHKVDKKGGEEEEEEEEEEEEE 178
               K  K D++  E EEEEEEE+E+E
Sbjct: 259 YVEEKELKEDEEEEETEEEEEEEDEDE 285


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 29.3 bits (65), Expect = 1.3
 Identities = 11/36 (30%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 145 AQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDN 180
           +Q S++ Q   + ++  K    + EE++++ EEEDN
Sbjct: 94  SQESEV-QDNTEQNQDTKGSKTDSEEDDDDSEEEDN 128


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 28.7 bits (64), Expect = 1.3
 Identities = 11/17 (64%), Positives = 16/17 (94%)

Query: 168 EEEEEEEEEEEDNIEQE 184
           E+EEEEEE+EEDN++ +
Sbjct: 76  EDEEEEEEDEEDNVDLK 92



 Score = 27.9 bits (62), Expect = 2.3
 Identities = 9/19 (47%), Positives = 15/19 (78%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           E+EEEEEE+EE+  +++  
Sbjct: 76  EDEEEEEEDEEDNVDLKDI 94



 Score = 27.9 bits (62), Expect = 2.3
 Identities = 12/28 (42%), Positives = 18/28 (64%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           D +  EEE+EEE EE E+   E+E + +
Sbjct: 50  DDEEEEEEDEEEIEEPEDIEDEEEIVED 77



 Score = 27.9 bits (62), Expect = 2.5
 Identities = 12/23 (52%), Positives = 19/23 (82%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQ 183
           D K  ++E++EEEEEE+E+ IE+
Sbjct: 42  DVKDEKQEDDEEEEEEDEEEIEE 64


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 29.0 bits (65), Expect = 1.4
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 164 GGEEEEEEEEEEEEEDNIEQEFLFE 188
           G EEE EE EE+EEE+  E+   FE
Sbjct: 79  GEEEEVEEGEEDEEEEGEEESEEFE 103


>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain.  This
           family represents the C-terminus (approximately 300
           residues) of proteins that are involved as binding
           partners for Prp19 as part of the nuclear pore complex.
           The family in Drosophila is necessary for pre-mRNA
           splicing, and the human protein has been found in
           purifications of the spliceosome. In the past this
           family was thought, erroneously, to be associated with
           microfibrillin.
          Length = 277

 Score = 28.7 bits (64), Expect = 1.5
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           ++    EEEEEE EEEEE + E +
Sbjct: 9   EEDESGEEEEEESEEEEETDSEDD 32


>gnl|CDD|129294 TIGR00190, thiC, thiamine biosynthesis protein ThiC.  The thiC
           ortholog is designated thiA in Bacillus subtilis
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Thiamine].
          Length = 423

 Score = 28.9 bits (65), Expect = 1.5
 Identities = 13/45 (28%), Positives = 15/45 (33%), Gaps = 6/45 (13%)

Query: 101 HNGEKPFRCALCAYSARRKTHLDDHMRRHTGEKPHACGMCGYECA 145
            + E+ F  AL    AR              E    C MCG  CA
Sbjct: 376 FDWERQFELALDPEKAREY-----RDETPP-EDEDFCSMCGKYCA 414


>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
          Length = 1627

 Score = 29.0 bits (65), Expect = 1.7
 Identities = 11/43 (25%), Positives = 23/43 (53%)

Query: 146 QGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           +G   ++ L++  +   +   EE + EE +EE +  E+ F +E
Sbjct: 271 EGWDWLKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYE 313


>gnl|CDD|237359 PRK13352, PRK13352, thiamine biosynthesis protein ThiC;
           Provisional.
          Length = 431

 Score = 28.9 bits (66), Expect = 1.7
 Identities = 15/42 (35%), Positives = 16/42 (38%), Gaps = 7/42 (16%)

Query: 104 EKPFRCALCAYSARRKTHLDDHMRRHTGEKPHACGMCGYECA 145
           EK F  AL    AR         R    +   AC MCG  CA
Sbjct: 382 EKQFELALDPEKARE-------YRDERLKDEEACSMCGKFCA 416


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 28.7 bits (65), Expect = 1.7
 Identities = 11/34 (32%), Positives = 20/34 (58%)

Query: 151 MQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
            +  R+  K +KK      +++EEEEEE+  ++E
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452



 Score = 27.2 bits (61), Expect = 6.5
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           K      +EEEEEEE+E++E+  E+E
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEE 458


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 28.9 bits (65), Expect = 2.1
 Identities = 13/38 (34%), Positives = 21/38 (55%), Gaps = 3/38 (7%)

Query: 145 AQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIE 182
            QG +L    RK+        +E+E+ + EEE+ED+ E
Sbjct: 309 RQGEELR---RKIEGKSVSEEDEDEDSDSEEEDEDDDE 343


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 28.1 bits (62), Expect = 2.2
 Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 4/40 (10%)

Query: 151 MQHLRKVHKVDKKGGEEEEEEEEE----EEEEDNIEQEFL 186
           M+  R+  K+D++  +EE+E E E    EE+ D+ E E +
Sbjct: 142 MEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEII 181


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 28.2 bits (63), Expect = 2.4
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           K + KG EE+ E EE E+EE   + E
Sbjct: 24  KEEDKGKEEDLEFEEIEKEEIIEDSE 49


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 28.5 bits (63), Expect = 2.5
 Identities = 5/25 (20%), Positives = 11/25 (44%)

Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQE 184
             +   EE + +  +E  E+  E +
Sbjct: 303 NAEAHKEEVQSDRPDEIGEEKEEDD 327


>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein.  This family includes the M1
           phosphoprotein non-structural RNA polymerase alpha
           subunit, which is thought to be a component of the
           active polymerase, and may be involved in template
           binding.
          Length = 298

 Score = 28.2 bits (63), Expect = 2.7
 Identities = 8/30 (26%), Positives = 16/30 (53%)

Query: 150 LMQHLRKVHKVDKKGGEEEEEEEEEEEEED 179
           L + + ++   D K  +  EEE+E+E   +
Sbjct: 49  LPEDMSRLQISDAKPSQFTEEEDEDEGSSE 78


>gnl|CDD|220661 pfam10263, SprT-like, SprT-like family.  This family represents a
           domain found in eukaryotes and prokaryotes. The domain
           contains a characteristic motif of the zinc
           metallopeptidases. This family includes the bacterial
           SprT protein.
          Length = 153

 Score = 27.7 bits (62), Expect = 2.7
 Identities = 11/46 (23%), Positives = 18/46 (39%), Gaps = 8/46 (17%)

Query: 67  CVDMVRPDASYTYMCYLCNYHTPTRKYMRTHIDTHNGEKPFRCALC 112
            +++V     Y Y C  C    P ++ +R H         +RC  C
Sbjct: 108 DIEVVSGRKRYIYRCGSCGQLYPRKRRIRRHK--------YRCGRC 145


>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
          Length = 268

 Score = 28.1 bits (63), Expect = 2.8
 Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 3/50 (6%)

Query: 137 CGMCGYECAQGSQLMQHLRKVH---KVDKKGGEEEEEEEEEEEEEDNIEQ 183
           C +CG    +  +L+   R V    + +++  E +E EEE  E++  IE+
Sbjct: 19  CEVCGKRIEKVVELLGKNRIVPIMCECEREAEEAKEREEENREKQRRIER 68


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 28.1 bits (63), Expect = 2.9
 Identities = 8/24 (33%), Positives = 16/24 (66%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           +    E E+E+EE+E+EE+  + +
Sbjct: 292 EDDYSESEDEDEEDEDEEEEEDDD 315



 Score = 26.5 bits (59), Expect = 8.2
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           K +++  E++  E E+E+EED  E+E
Sbjct: 285 KAEEEEEEDDYSESEDEDEEDEDEEE 310


>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
           Prothymosin alpha and parathymosin are two ubiquitous
           small acidic nuclear proteins that are thought to be
           involved in cell cycle progression, proliferation, and
           cell differentiation.
          Length = 106

 Score = 26.8 bits (59), Expect = 3.0
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 161 DKKGGEEEEEEEEEEEEE 178
           ++  GEEEE EEEEE E 
Sbjct: 64  EEGEGEEEEGEEEEETEG 81



 Score = 26.1 bits (57), Expect = 5.8
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           D +  EEEE +E++EEEE   E+E
Sbjct: 48  DDEMEEEEEVDEDDEEEEGEGEEE 71



 Score = 25.7 bits (56), Expect = 8.0
 Identities = 12/24 (50%), Positives = 14/24 (58%)

Query: 165 GEEEEEEEEEEEEEDNIEQEFLFE 188
           GE EEEE EEEEE +    +   E
Sbjct: 66  GEGEEEEGEEEEETEGATGKRAAE 89


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 28.0 bits (62), Expect = 3.0
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 160 VDKKGGEEEEEEEEEEEEED 179
           VD +  E  EE +  EEEED
Sbjct: 174 VDNEEEERLEESDGREEEED 193



 Score = 26.5 bits (58), Expect = 9.3
 Identities = 14/28 (50%), Positives = 16/28 (57%), Gaps = 3/28 (10%)

Query: 159 KVDKKGGEEEEEEEE---EEEEEDNIEQ 183
           ++DKK GEEEE EEE     E E   E 
Sbjct: 232 RIDKKQGEEEEMEEEVINLFEIEWEEES 259


>gnl|CDD|172627 PRK14138, PRK14138, NAD-dependent deacetylase; Provisional.
          Length = 244

 Score = 27.9 bits (62), Expect = 3.0
 Identities = 13/57 (22%), Positives = 29/57 (50%), Gaps = 10/57 (17%)

Query: 22  GAKQAADVHNHVQQIHMGVNFVCVHC-KQF---EVVPTRKTQTLEHCATCVDMVRPD 74
           G+K+  ++H +V+       + CV C K++   +V+   +   +  C  C  ++RP+
Sbjct: 106 GSKKVIELHGNVE------EYYCVRCGKRYTVEDVIEKLEKSDVPRCDDCSGLIRPN 156


>gnl|CDD|129414 TIGR00314, cdhA, CO dehydrogenase/acetyl-CoA synthase complex,
           epsilon subunit.  Acetyl-CoA decarbonylase/synthase
           (ACDS) is a multienzyme complex. Carbon monoxide
           dehydrogenase is a synonym. The ACDS complex carries out
           an unusual reaction involving the reversible cleavage
           and synthesis of acetyl-CoA in methanogens. The model
           contains the prosite signature for 4Fe-4S ferredoxins
           [C-x(2)-C-x(2)-C-x(3)-C-[PEG]] between residues 448-462
           of the model [Energy metabolism, Chemoautotrophy].
          Length = 784

 Score = 27.9 bits (62), Expect = 3.4
 Identities = 14/54 (25%), Positives = 19/54 (35%)

Query: 117 RRKTHLDDHMRRHTGEKPHACGMCGYECAQGSQLMQHLRKVHKVDKKGGEEEEE 170
            RK   DD        K   CG C   C    ++ + +    K D    E+ EE
Sbjct: 384 ERKFLPDDEELMELANKCTQCGNCVRTCPNSLRVDEAMAHAQKGDLSKLEQLEE 437


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
           function of this eukaryotic protein family is unknown.
           The yeast orthologues have been implicated in cell cycle
           progression and biogenesis of 60S ribosomal subunits.
           The Schistosoma mansoni Mak16 has been shown to target
           protein transport to the nucleolus.
          Length = 97

 Score = 26.8 bits (59), Expect = 3.5
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           ++ +  EE +EEEEEEEEE++  +     
Sbjct: 55  LEAEESEENDEEEEEEEEEEDEGEIEYVS 83


>gnl|CDD|153040 pfam12606, RELT, Tumour necrosis factor receptor superfamily member
           19.  This family of proteins is found in bacteria and
           eukaryotes. Proteins in this family are typically
           between 49 and 288 amino acids in length. There are two
           completely conserved residues (K and Y) that may be
           functionally important. The members of tumor necrosis
           factor receptor (TNFR) superfamily have been designated
           as the "guardians of the immune system" due to their
           roles in immune cell proliferation, differentiation,
           activation, and death (apoptosis). The messenger RNA of
           RELT is especially abundant in hematologic tissues such
           as spleen, lymph node, and peripheral blood leukocytes
           as well as in leukemias and lymphomas. RELT is able to
           activate the NF-kappaB pathway and selectively binds
           tumor necrosis factor receptor-associated factor 1.
          Length = 50

 Score = 25.7 bits (57), Expect = 3.5
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 158 HKVDKKGGEEEEEEEEEEEE 177
             VD    E EE++EEE+EE
Sbjct: 31  CTVDPLEDEPEEKKEEEKEE 50


>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal.  This
           domain family is found in eukaryotes, and is typically
           between 99 and 114 amino acids in length. The family is
           found in association with pfam08697, pfam01585. There
           are two completely conserved residues (G and F) that may
           be functionally important. TIP is involved in enamel
           assembly by interacting with one of the major proteins
           responsible for biomineralisation of enamel - tuftelin.
          Length = 106

 Score = 26.9 bits (60), Expect = 3.6
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
           K+  +E EE+++E+E ED+ E +
Sbjct: 77  KQAAKEPEEDDKEDESEDDDESD 99


>gnl|CDD|234013 TIGR02785, addA_Gpos, helicase-exonuclease AddAB, AddA subunit,
           Firmicutes type.  AddAB, also called RexAB, substitutes
           for RecBCD in several bacterial lineages. These DNA
           recombination proteins act before synapse and are
           particularly important for DNA repair of double-stranded
           breaks by homologous recombination. The term AddAB is
           used broadly, with AddA homologous between the
           Firmicutes (as modeled here) and the
           alphaproteobacteria, while the partner AddB proteins
           show no strong homology across the two groups of species
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 1230

 Score = 28.1 bits (63), Expect = 3.8
 Identities = 10/28 (35%), Positives = 15/28 (53%)

Query: 157 VHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +++       EEEE +EE E  D  +QE
Sbjct: 518 LYEKLLIEEAEEEEIDEEAEILDKAQQE 545


>gnl|CDD|216813 pfam01964, ThiC, ThiC family.  ThiC is found within the thiamine
           biosynthesis operon. ThiC is involved in pyrimidine
           biosynthesis. The precise catalytic function of ThiC is
           still not known. ThiC participates in the formation of
           4-Amino-5-hydroxymethyl-2-methylpyrimidine from AIR, an
           intermediate in the de novo pyrimidine biosynthesis.
          Length = 421

 Score = 27.8 bits (63), Expect = 3.9
 Identities = 15/43 (34%), Positives = 16/43 (37%), Gaps = 8/43 (18%)

Query: 104 EKPFRCALCAYSARRKTHLDDHMRRHTG-EKPHACGMCGYECA 145
           EK F  AL    AR        +R     E    C MCG  CA
Sbjct: 378 EKQFELALDPEKARE-------IRDERLPEDADFCSMCGKFCA 413


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 27.9 bits (62), Expect = 3.9
 Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 11/64 (17%)

Query: 131 GEKPHACGMCG---------YECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNI 181
           GE   AC  C          YE  +G+Q     +  +K  K  G    E +EEE++ D++
Sbjct: 52  GEPFVACNECAFPVCRPCYEYERREGNQACPQCKTRYKRIK--GSPRVEGDEEEDDIDDL 109

Query: 182 EQEF 185
           E EF
Sbjct: 110 ENEF 113


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 27.0 bits (60), Expect = 4.1
 Identities = 6/26 (23%), Positives = 21/26 (80%)

Query: 159 KVDKKGGEEEEEEEEEEEEEDNIEQE 184
           ++D+ GG+ ++++E++E++E+   ++
Sbjct: 46  RIDEDGGDIDDDDEDDEDDEEADAED 71


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 27.6 bits (61), Expect = 4.2
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 157 VHKVDKKGGEEEEEEEEEEEEEDNIEQ 183
           V  V+K+ G+   EE  +EE ED +E 
Sbjct: 332 VEVVEKQEGDVVTEESTDEESEDEVEI 358



 Score = 27.2 bits (60), Expect = 6.3
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQ 183
           V  +   +EE E+E E +E  IE+
Sbjct: 342 VVTEESTDEESEDEVEIDESVIEE 365


>gnl|CDD|236631 PRK09885, PRK09885, putative toxin YafO; Provisional.
          Length = 132

 Score = 27.1 bits (60), Expect = 4.2
 Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 4/40 (10%)

Query: 87  HTPTRKYMR----THIDTHNGEKPFRCALCAYSARRKTHL 122
            + T   ++     HI   N   PF   L  +S     HL
Sbjct: 47  DSFTWPLLKFERVAHIHLANVNNPFPLQLRQFSRTSGYHL 86


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 27.6 bits (61), Expect = 4.2
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
            +  E E+EEEEEE E DN  +E
Sbjct: 462 SEEEEGEDEEEEEEVEADNGSEE 484



 Score = 26.8 bits (59), Expect = 7.9
 Identities = 14/24 (58%), Positives = 17/24 (70%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           + +  E  EEEEEEEEEE+  EQE
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQE 461



 Score = 26.8 bits (59), Expect = 9.9
 Identities = 14/31 (45%), Positives = 18/31 (58%)

Query: 158 HKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
               ++  EEE  EEEEEEEE+  E+E   E
Sbjct: 433 SMASQESEEEESVEEEEEEEEEEEEEEQESE 463


>gnl|CDD|132613 TIGR03574, selen_PSTK, L-seryl-tRNA(Sec) kinase, archaeal.  Members
           of this protein are L-seryl-tRNA(Sec) kinase. This
           enzyme is part of a two-step pathway in Eukaryota and
           Archaea for performing selenocysteine biosynthesis by
           changing serine misacylated on selenocysteine-tRNA to
           selenocysteine. This enzyme performs the first step,
           phosphorylation of the OH group of the serine side
           chain. This family represents archaeal proteins with
           this activity [Protein synthesis, tRNA aminoacylation].
          Length = 249

 Score = 27.5 bits (61), Expect = 4.3
 Identities = 12/49 (24%), Positives = 23/49 (46%), Gaps = 11/49 (22%)

Query: 149 QLMQHLRKVHKVDKKGGEEEEE-----------EEEEEEEEDNIEQEFL 186
           Q++  L K  K+DK+  ++  E            + E+E+   +E+EF 
Sbjct: 197 QIVGELIKTGKLDKELIKKLSELRKEFLKSIKDIKAEKEDFRRVEKEFK 245


>gnl|CDD|145556 pfam02478, Pneumo_phosprot, Pneumovirus phosphoprotein.  This
           family represents the phosphoprotein of Paramyxoviridae,
           a putative RNA polymerase alpha subunit that may
           function in template binding.
          Length = 266

 Score = 27.5 bits (61), Expect = 4.4
 Identities = 12/17 (70%), Positives = 15/17 (88%)

Query: 165 GEEEEEEEEEEEEEDNI 181
           GE EEEEE+E+ EED+I
Sbjct: 250 GESEEEEEDEDLEEDDI 266


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 27.7 bits (61), Expect = 4.4
 Identities = 6/18 (33%), Positives = 13/18 (72%)

Query: 167  EEEEEEEEEEEEDNIEQE 184
            + +E+E+EEE  D++  +
Sbjct: 3949 DRQEKEDEEEMSDDVGID 3966


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 27.4 bits (62), Expect = 4.5
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 152 QHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           +  R     +++  E EEEEEEEEEEE    + 
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257


>gnl|CDD|180707 PRK06811, PRK06811, RNA polymerase factor sigma-70; Validated.
          Length = 189

 Score = 27.2 bits (61), Expect = 4.6
 Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 4/34 (11%)

Query: 155 RKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           RK+ K +    E +  +E     E++IE E + +
Sbjct: 90  RKLTKNN----EIDSIDEFILISEESIENEIILK 119


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 27.3 bits (61), Expect = 5.0
 Identities = 11/29 (37%), Positives = 15/29 (51%)

Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           K  K D      EE+EEE  EEE+   ++
Sbjct: 143 KKAKEDSDEELSEEDEEEAAEEEEAEAEK 171


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 27.3 bits (60), Expect = 5.2
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 146 QGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
           Q    +Q   + HK ++K  +EE+EEEE EEE    E +
Sbjct: 201 QLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQ 239


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 27.0 bits (60), Expect = 5.2
 Identities = 13/20 (65%), Positives = 14/20 (70%)

Query: 159 KVDKKGGEEEEEEEEEEEEE 178
            VD +  EEEEEE  EEEEE
Sbjct: 156 TVDLEPPEEEEEEIAEEEEE 175



 Score = 26.2 bits (58), Expect = 9.7
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
            EEEEEE  EEEE+  E E
Sbjct: 162 PEEEEEEIAEEEEEVKEPE 180


>gnl|CDD|237609 PRK14096, pgi, glucose-6-phosphate isomerase; Provisional.
          Length = 528

 Score = 27.5 bits (62), Expect = 5.2
 Identities = 12/25 (48%), Positives = 15/25 (60%), Gaps = 1/25 (4%)

Query: 21  YGAKQAADVHNHVQQIHMGV-NFVC 44
           YG K + D H +VQQ+  GV NF  
Sbjct: 342 YGNKGSTDQHAYVQQLRDGVDNFFV 366


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 27.3 bits (60), Expect = 5.3
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
            EEEEE E EE+E+  E+E
Sbjct: 140 LEEEEEVEMEEDEEYYEKE 158


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 27.0 bits (60), Expect = 5.4
 Identities = 15/39 (38%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLF 187
            L++ L K+ K ++   +E EEEE+  EEE   E+E L 
Sbjct: 146 ALLRELEKI-KKERAEEKEREEEEKAAEEEKAREEEILT 183


>gnl|CDD|178484 PLN02896, PLN02896, cinnamyl-alcohol dehydrogenase.
          Length = 353

 Score = 27.1 bits (60), Expect = 5.5
 Identities = 18/79 (22%), Positives = 32/79 (40%), Gaps = 8/79 (10%)

Query: 103 GEKPFRCALCAYSARRKT----HLDDHMRRHT--GEKPHACG--MCGYECAQGSQLMQHL 154
           G+      L A ++R  +    H++D    H    E+  A G  +C  +    S+L+ HL
Sbjct: 228 GDSKLFSILSAVNSRMGSIALVHIEDICDAHIFLMEQTKAEGRYICCVDSYDMSELINHL 287

Query: 155 RKVHKVDKKGGEEEEEEEE 173
            K +         +EE+  
Sbjct: 288 SKEYPCSNIQVRLDEEKRG 306


>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
          Length = 232

 Score = 27.1 bits (60), Expect = 5.6
 Identities = 10/18 (55%), Positives = 14/18 (77%)

Query: 167 EEEEEEEEEEEEDNIEQE 184
           E+E  EEEEEE+ ++E E
Sbjct: 70  EDEGGEEEEEEDADVETE 87


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 25.9 bits (57), Expect = 6.0
 Identities = 13/23 (56%), Positives = 18/23 (78%)

Query: 166 EEEEEEEEEEEEEDNIEQEFLFE 188
           EEEEEEEE++ E+ + E EF+ E
Sbjct: 12  EEEEEEEEDDLEDLSDEDEFIDE 34



 Score = 25.5 bits (56), Expect = 9.8
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 165 GEEEEEEEEEEEEEDNIEQE 184
            +EEEEEEEEE++ +++  E
Sbjct: 9   DDEEEEEEEEEDDLEDLSDE 28


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 27.2 bits (60), Expect = 6.2
 Identities = 10/15 (66%), Positives = 14/15 (93%)

Query: 166 EEEEEEEEEEEEEDN 180
           +E++EEEE EEEE+N
Sbjct: 336 QEQQEEEENEEEEEN 350



 Score = 26.5 bits (58), Expect = 9.2
 Identities = 10/15 (66%), Positives = 14/15 (93%)

Query: 166 EEEEEEEEEEEEEDN 180
           E++EEEE EEEEE++
Sbjct: 337 EQQEEEENEEEEEND 351


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 27.1 bits (61), Expect = 6.3
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query: 160 VDKKGGEEEEEEEEEEEEEDNIEQE 184
               G E EE +++E+EEE+  E +
Sbjct: 182 PAHVGSELEELDDDEDEEEEEDEND 206


>gnl|CDD|238703 cd01412, SIRT5_Af1_CobB, SIRT5_Af1_CobB: Eukaryotic, archaeal and
           prokaryotic group (class3) which includes human sirtuin
           SIRT5, Archaeoglobus fulgidus Sir2-Af1, and E. coli
           CobB; and are members of the SIR2 family of proteins,
           silent information regulator 2 (Sir2) enzymes which
           catalyze NAD+-dependent protein/histone deacetylation.
           Sir2 proteins have been shown to regulate gene
           silencing, DNA repair, metabolic enzymes, and life span.
           CobB is a bacterial sirtuin that deacetylates acetyl-CoA
           synthetase at an active site lysine to stimulate its
           enzymatic activity. .
          Length = 224

 Score = 26.8 bits (60), Expect = 6.3
 Identities = 8/32 (25%), Positives = 11/32 (34%), Gaps = 1/32 (3%)

Query: 44  CVHCKQ-FEVVPTRKTQTLEHCATCVDMVRPD 74
           C  C    E       + L  C  C  ++RP 
Sbjct: 112 CSSCGYVGENNEEIPEEELPRCPKCGGLLRPG 143


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 26.6 bits (59), Expect = 6.4
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQE 184
              K  KK  E+  EEE++ E +D+  +E
Sbjct: 64  TTKKSKKKDKEKLTEEEKKPESDDDKTEE 92


>gnl|CDD|237078 PRK12366, PRK12366, replication factor A; Reviewed.
          Length = 637

 Score = 27.3 bits (61), Expect = 6.4
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 160 VDKKGGEEEEEEEEEEEEEDNI 181
           V ++ G + E  +EEEEE+  I
Sbjct: 45  VAQELGIDIEYGQEEEEEDFKI 66


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 27.3 bits (61), Expect = 6.5
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQEF 185
           D  G  + EE+EEEEE+  + E E 
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEE 336


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 27.1 bits (60), Expect = 6.7
 Identities = 11/47 (23%), Positives = 17/47 (36%)

Query: 142 YECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           Y+       +  L    + ++    E EE  E E+ E   E   L E
Sbjct: 376 YDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 26.4 bits (58), Expect = 6.9
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 162 KKGGEEEEEEEEEEEEEDNIEQE 184
            +G E + EEE+EE +E   E E
Sbjct: 25  DEGDESDAEEEDEEIQEALREAE 47


>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890).  This
           family is conserved in dsDNA adenoviruses of
           vertebrates. The function is not known.
          Length = 172

 Score = 26.4 bits (58), Expect = 7.0
 Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 166 EEEEEEEEEEEEEDNIEQ 183
           +EE+EE EE EEE     
Sbjct: 46  DEEDEEAEEVEEETAASS 63



 Score = 26.4 bits (58), Expect = 8.1
 Identities = 11/24 (45%), Positives = 18/24 (75%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQE 184
           D +  E EE+EEE E+ ED++++E
Sbjct: 25  DSQAEEVEEDEEEMEDWEDSLDEE 48


>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
           inhibitor of apoptosis (IAP)-associated factor (VIAF)
           subfamily; VIAF is a Phd-like protein that functions in
           caspase activation during apoptosis. It was identified
           as an IAP binding protein through a screen of a human
           B-cell library using a prototype IAP. VIAF lacks a
           consensus IAP binding motif and while it does not
           function as an IAP antagonist, it still plays a
           regulatory role in the complete activation of caspases.
           VIAF itself is a substrate for IAP-mediated
           ubiquitination, suggesting that it may be a target of
           IAPs in the prevention of cell death. The similarity of
           VIAF to Phd points to a potential role distinct from
           apoptosis regulation. Phd functions as a cytosolic
           regulator of G protein by specifically binding to G
           protein betagamma (Gbg)-subunits. The C-terminal domain
           of Phd adopts a thioredoxin fold, but it does not
           contain a CXXC motif. Phd interacts with G protein beta
           mostly through the N-terminal helical domain.
          Length = 192

 Score = 26.5 bits (59), Expect = 7.0
 Identities = 8/30 (26%), Positives = 13/30 (43%)

Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEE 178
           Q         K+  +  EE +EEE++   E
Sbjct: 33  QEAHENALEKKLLDELDEELDEEEDDRFLE 62


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 26.8 bits (60), Expect = 7.2
 Identities = 7/20 (35%), Positives = 14/20 (70%)

Query: 161 DKKGGEEEEEEEEEEEEEDN 180
           +    ++E+E+E++EEE D 
Sbjct: 54  EDDDDDDEDEDEDDEEEADL 73



 Score = 26.4 bits (59), Expect = 9.5
 Identities = 6/23 (26%), Positives = 16/23 (69%)

Query: 161 DKKGGEEEEEEEEEEEEEDNIEQ 183
            +   +++E+E+E++EEE ++  
Sbjct: 53  LEDDDDDDEDEDEDDEEEADLGP 75


>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
          Length = 177

 Score = 26.4 bits (59), Expect = 7.3
 Identities = 9/20 (45%), Positives = 16/20 (80%)

Query: 166 EEEEEEEEEEEEEDNIEQEF 185
           +++EEE EEEE +D++ + F
Sbjct: 149 DDDEEESEEEESDDDLLRLF 168


>gnl|CDD|237731 PRK14494, PRK14494, putative molybdopterin-guanine dinucleotide
           biosynthesis protein MobB/FeS domain-containing protein
           protein; Provisional.
          Length = 229

 Score = 26.5 bits (59), Expect = 7.9
 Identities = 5/10 (50%), Positives = 6/10 (60%)

Query: 137 CGMCGYECAQ 146
           CG CG+ C  
Sbjct: 153 CGHCGFNCKG 162


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 26.9 bits (60), Expect = 8.0
 Identities = 7/31 (22%), Positives = 16/31 (51%)

Query: 158 HKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
              D    ++E+EE++E +E + +  +  F 
Sbjct: 158 EDDDDDDVDDEDEEKKEAKELEKLSDDDDFV 188


>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
           factor RRN3.  This family consists of several eukaryotic
           proteins which are homologous to the yeast RRN3 protein.
           RRN3 is one of the RRN genes specifically required for
           the transcription of rDNA by RNA polymerase I (Pol I) in
           Saccharomyces cerevisiae.
          Length = 554

 Score = 26.9 bits (60), Expect = 8.2
 Identities = 14/74 (18%), Positives = 31/74 (41%), Gaps = 6/74 (8%)

Query: 121 HLDDHMRRHTGEKPHACGMCGYECAQGSQLMQ----HLRK--VHKVDKKGGEEEEEEEEE 174
           H     +   G   +   +  Y    G  +++     L K  V   ++    ++EEEE  
Sbjct: 174 HKTKSKKVLVGYVRNLLRLTEYCPELGFDILELIIERLLKLDVEIQNELDDIDDEEEERV 233

Query: 175 EEEEDNIEQEFLFE 188
             +ED+ +++ +F+
Sbjct: 234 LADEDDDDEDDMFD 247


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 26.6 bits (59), Expect = 8.5
 Identities = 10/17 (58%), Positives = 12/17 (70%)

Query: 166 EEEEEEEEEEEEEDNIE 182
             EEEEEE EE E+ +E
Sbjct: 149 LPEEEEEEPEEMEEELE 165


>gnl|CDD|178497 PLN02909, PLN02909, Endoglucanase.
          Length = 486

 Score = 26.8 bits (59), Expect = 8.7
 Identities = 14/46 (30%), Positives = 21/46 (45%)

Query: 70  MVRPDASYTYMCYLCNYHTPTRKYMRTHIDTHNGEKPFRCALCAYS 115
           MV     + Y   L N      K+ + H  T++GE PF C+   Y+
Sbjct: 194 MVFRHVDHKYSRRLLNKAKLLFKFAKAHKGTYDGECPFYCSYSGYN 239


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 26.9 bits (59), Expect = 8.7
 Identities = 9/14 (64%), Positives = 10/14 (71%)

Query: 166 EEEEEEEEEEEEED 179
           EE E EEE E+ ED
Sbjct: 381 EESEHEEEVEDYED 394



 Score = 26.9 bits (59), Expect = 9.4
 Identities = 8/33 (24%), Positives = 13/33 (39%)

Query: 156 KVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
            V K+ K   E      E++ E    ++   FE
Sbjct: 47  SVRKIGKFDYELAVYRVEDDSEGGTGDEAENFE 79


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 26.7 bits (59), Expect = 8.7
 Identities = 9/29 (31%), Positives = 18/29 (62%)

Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEE 177
           QL++   +  +   + G+E+EEE+ E+E 
Sbjct: 50  QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 26.2 bits (58), Expect = 9.0
 Identities = 9/43 (20%), Positives = 27/43 (62%)

Query: 141 GYECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQ 183
           GY+ ++  ++++   K+ ++ KK   ++E++ ++ + ED  E+
Sbjct: 13  GYDPSEYKEVVEEYEKLEELRKKLKAKKEDDSDDTDYEDEDEE 55


>gnl|CDD|224459 COG1542, COG1542, Uncharacterized conserved protein [Function
           unknown].
          Length = 593

 Score = 26.8 bits (59), Expect = 9.7
 Identities = 10/33 (30%), Positives = 18/33 (54%)

Query: 154 LRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFL 186
           L KV +++++G   E +   EE +E  +E E  
Sbjct: 11  LLKVKELEERGEAIEVKALTEEFKERLLELELQ 43


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 24.0 bits (52), Expect = 9.7
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 107 FRCALCAYSARRKTHLDDHMRRH 129
           +RC  C    + K+ L +HMR H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|234635 PRK00104, scpA, segregation and condensation protein A; Reviewed.
          Length = 242

 Score = 26.3 bits (59), Expect = 9.8
 Identities = 8/17 (47%), Positives = 13/17 (76%)

Query: 167 EEEEEEEEEEEEDNIEQ 183
           EE EEEEE+  ++ +E+
Sbjct: 78  EEFEEEEEDPRQELLEK 94


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
           kDa subunit.  This family contains several DNA-directed
           RNA polymerase 19 kDa polypeptides. The Poxvirus
           DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
           DNA-template-directed extension of the 3'-end of an RNA
           strand by one nucleotide at a time.
          Length = 167

 Score = 26.2 bits (58), Expect = 9.8
 Identities = 9/19 (47%), Positives = 14/19 (73%)

Query: 166 EEEEEEEEEEEEEDNIEQE 184
           ++ EE EEEEE+E++ E  
Sbjct: 15  DDSEEYEEEEEDEEDAESL 33


>gnl|CDD|240275 PTZ00113, PTZ00113, proliferating cell nuclear antigen;
           Provisional.
          Length = 275

 Score = 26.3 bits (58), Expect = 9.9
 Identities = 18/56 (32%), Positives = 26/56 (46%), Gaps = 2/56 (3%)

Query: 133 KPHACGMCGYECAQGSQLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           K H  G   Y C +   L  ++  V KV K     +    + EE++DNI   F+FE
Sbjct: 51  KLHDSGFSHYRCDRERALGINIASVTKVFKLCSNNDSVLIQSEEDKDNI--NFVFE 104


>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
           modification].
          Length = 470

 Score = 26.5 bits (58), Expect = 9.9
 Identities = 20/100 (20%), Positives = 34/100 (34%), Gaps = 18/100 (18%)

Query: 101 HNGEKPFRCALCA---YSARRKTHLDDHMRRHTGEKPHACGMCGYECAQGS--------- 148
           H  +  F C +C    Y  R +        RH  E  H  G+                  
Sbjct: 369 HGLDIEFECEICGNYVYYGRDR------FDRHFEEDRHIYGLECLGIKPSRVFKGITRIG 422

Query: 149 QLMQHLRKVHKVDKKGGEEEEEEEEEEEEEDNIEQEFLFE 188
           + M+   ++ +         E  EE E+EE N+  + ++E
Sbjct: 423 EAMKLWNRMEESSSSLKVPTEYSEEFEDEEGNVMSKKVYE 462


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.134    0.434 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,278,775
Number of extensions: 795552
Number of successful extensions: 4955
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4136
Number of HSP's successfully gapped: 390
Length of query: 188
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 97
Effective length of database: 6,901,388
Effective search space: 669434636
Effective search space used: 669434636
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 56 (25.2 bits)