Query         047037
Match_columns 85
No_of_seqs    113 out of 654
Neff          6.8 
Searched_HMMs 46136
Date          Fri Mar 29 07:02:03 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/047037.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/047037hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF13975 gag-asp_proteas:  gag-  99.8 3.2E-18   7E-23  101.4   8.2   65   20-84      3-68  (72)
  2 cd05484 retropepsin_like_LTR_2  99.6 1.6E-15 3.5E-20   92.5   8.3   59   26-84      1-59  (91)
  3 PF08284 RVP_2:  Retroviral asp  99.6 1.2E-15 2.5E-20  100.2   7.6   65   20-85     16-81  (135)
  4 cd05479 RP_DDI RP_DDI; retrope  99.6 4.1E-15 8.8E-20   95.8   8.4   63   22-84     13-75  (124)
  5 TIGR02281 clan_AA_DTGA clan AA  99.5   4E-13 8.7E-18   86.6   9.8   61   21-81      7-68  (121)
  6 COG3577 Predicted aspartyl pro  99.4 3.7E-12   8E-17   89.0   8.6   65   11-79     95-160 (215)
  7 PF13650 Asp_protease_2:  Aspar  99.3 1.2E-11 2.6E-16   73.5   8.2   56   28-83      1-57  (90)
  8 cd05483 retropepsin_like_bacte  99.3 1.7E-11 3.8E-16   73.5   8.5   60   24-83      1-60  (96)
  9 cd06095 RP_RTVL_H_like Retrope  99.0 1.6E-09 3.6E-14   65.6   6.2   47   28-77      1-47  (86)
 10 TIGR03698 clan_AA_DTGF clan AA  98.9 1.2E-08 2.7E-13   64.4   6.7   44   34-79     14-58  (107)
 11 cd05481 retropepsin_like_LTR_1  98.8 4.5E-08 9.8E-13   60.5   6.7   55   30-84      3-61  (93)
 12 PF00077 RVP:  Retroviral aspar  98.7 2.7E-08 5.8E-13   60.9   4.9   54   27-83      7-60  (100)
 13 cd00303 retropepsin_like Retro  98.6 3.1E-07 6.6E-12   51.5   6.8   54   29-82      2-56  (92)
 14 PF09668 Asp_protease:  Asparty  98.6 1.4E-07   3E-12   61.5   5.6   40   22-61     21-60  (124)
 15 PF12384 Peptidase_A2B:  Ty3 tr  98.5 6.7E-07 1.5E-11   61.0   8.0   54   22-75     31-84  (177)
 16 cd05480 NRIP_C NRIP_C; putativ  98.2 2.1E-06 4.6E-11   54.3   4.5   33   29-61      2-34  (103)
 17 cd06094 RP_Saci_like RP_Saci_l  98.0 8.9E-06 1.9E-10   50.3   4.4   45   36-84      9-53  (89)
 18 PF05585 DUF1758:  Putative pep  98.0 7.7E-06 1.7E-10   54.4   4.4   39   35-73     11-49  (164)
 19 KOG0012 DNA damage inducible p  97.4 0.00017 3.6E-09   54.4   3.8   42   21-62    231-272 (380)
 20 COG5550 Predicted aspartyl pro  97.2   0.002 4.4E-08   42.1   6.4   55   24-80     11-69  (125)
 21 PF02160 Peptidase_A3:  Caulifl  97.1  0.0015 3.3E-08   45.8   5.5   57   27-85      8-68  (201)
 22 cd05482 HIV_retropepsin_like R  96.3   0.006 1.3E-07   37.4   3.3   27   29-55      2-28  (87)
 23 PF12382 Peptidase_A2E:  Retrot  91.1     0.2 4.4E-06   32.1   2.3   30   36-65     47-76  (137)
 24 PF00026 Asp:  Eukaryotic aspar  90.4    0.65 1.4E-05   32.7   4.6   27   26-52      2-30  (317)
 25 cd05470 pepsin_retropepsin_lik  89.5    0.52 1.1E-05   28.4   3.1   26   31-56      4-31  (109)
 26 cd05474 SAP_like SAPs, pepsin-  89.5     1.8 3.8E-05   30.5   6.2   26   26-51      3-30  (295)
 27 cd06096 Plasmepsin_5 Plasmepsi  86.6     1.2 2.7E-05   32.2   4.1   41   36-79    231-271 (326)
 28 cd05476 pepsin_A_like_plant Ch  86.6     1.7 3.6E-05   30.5   4.7   18   38-55    178-195 (265)
 29 cd06097 Aspergillopepsin_like   86.4    0.98 2.1E-05   31.8   3.4   27   27-53      2-30  (278)
 30 cd06096 Plasmepsin_5 Plasmepsi  86.0     1.7 3.7E-05   31.5   4.5   30   24-53      2-33  (326)
 31 cd05475 nucellin_like Nucellin  85.7     1.5 3.2E-05   31.0   4.0   27   25-51      2-30  (273)
 32 cd05476 pepsin_A_like_plant Ch  85.2     2.7 5.9E-05   29.4   5.2   27   26-52      2-30  (265)
 33 cd05487 renin_like Renin stimu  85.0       2 4.3E-05   31.1   4.5   30   23-52      6-37  (326)
 34 cd06098 phytepsin Phytepsin, a  84.6     2.3   5E-05   30.6   4.7   31   23-53      8-40  (317)
 35 COG0282 ackA Acetate kinase [E  84.4    0.51 1.1E-05   36.3   1.2   37   44-80    179-215 (396)
 36 cd05478 pepsin_A Pepsin A, asp  83.2     2.7 5.8E-05   30.2   4.5   31   23-53      8-40  (317)
 37 cd06097 Aspergillopepsin_like   82.4    0.99 2.1E-05   31.8   2.0   23   36-58    198-220 (278)
 38 PTZ00147 plasmepsin-1; Provisi  82.0     2.9 6.2E-05   32.4   4.5   31   24-54    138-170 (453)
 39 cd06396 PB1_NBR1 The PB1 domai  81.9     5.9 0.00013   24.0   5.0   47   26-77      1-49  (81)
 40 cd05490 Cathepsin_D2 Cathepsin  81.8       3 6.6E-05   30.0   4.3   29   24-52      5-35  (325)
 41 PTZ00013 plasmepsin 4 (PM4); P  81.6     3.1 6.7E-05   32.2   4.5   32   24-55    137-170 (450)
 42 cd05472 cnd41_like Chloroplast  81.4     1.9 4.1E-05   30.6   3.1   21   38-58    173-193 (299)
 43 cd05477 gastricsin Gastricsins  80.8     3.5 7.6E-05   29.6   4.4   29   25-53      3-33  (318)
 44 cd05477 gastricsin Gastricsins  80.6     2.6 5.6E-05   30.3   3.6   28   31-58    189-223 (318)
 45 cd05473 beta_secretase_like Be  80.6     2.7 5.8E-05   30.9   3.8   28   26-53      4-33  (364)
 46 cd06098 phytepsin Phytepsin, a  79.4     3.1 6.8E-05   30.0   3.7   30   30-59    197-233 (317)
 47 cd05486 Cathespin_E Cathepsin   79.3       2 4.3E-05   30.9   2.7   28   31-58    187-220 (316)
 48 cd05485 Cathepsin_D_like Cathe  78.9     2.8   6E-05   30.5   3.4   21   38-58    212-232 (329)
 49 cd05485 Cathepsin_D_like Cathe  78.9     4.5 9.7E-05   29.4   4.4   31   23-53      9-41  (329)
 50 cd05478 pepsin_A Pepsin A, asp  78.8     3.2   7E-05   29.8   3.7   29   30-58    194-228 (317)
 51 cd05486 Cathespin_E Cathepsin   78.8     2.8 6.2E-05   30.1   3.4   26   28-53      3-30  (316)
 52 cd05488 Proteinase_A_fungi Fun  78.4       4 8.7E-05   29.4   4.1   30   24-53      9-40  (320)
 53 cd05472 cnd41_like Chloroplast  78.2     2.6 5.6E-05   30.0   3.0   26   26-51      2-29  (299)
 54 cd05473 beta_secretase_like Be  74.5     4.1 8.8E-05   30.0   3.2   20   38-57    213-232 (364)
 55 PF14543 TAXi_N:  Xylanase inhi  74.0     6.5 0.00014   25.9   3.9   25   27-51      2-28  (164)
 56 cd05471 pepsin_like Pepsin-lik  73.5     5.1 0.00011   27.5   3.4   22   35-56     12-33  (283)
 57 cd05471 pepsin_like Pepsin-lik  72.9     3.7 8.1E-05   28.2   2.6   27   33-59    199-225 (283)
 58 cd05490 Cathepsin_D2 Cathepsin  70.3     3.4 7.3E-05   29.7   2.0   22   37-58    207-228 (325)
 59 cd05488 Proteinase_A_fungi Fun  70.2     3.4 7.3E-05   29.8   2.0   22   37-58    206-227 (320)
 60 PLN03146 aspartyl protease fam  69.8     8.2 0.00018   29.5   4.0   30   23-52     82-113 (431)
 61 PTZ00165 aspartyl protease; Pr  69.5     9.4  0.0002   29.8   4.4   34   23-56    118-153 (482)
 62 cd05474 SAP_like SAPs, pepsin-  66.8       5 0.00011   28.1   2.2   24   35-58    177-200 (295)
 63 cd05487 renin_like Renin stimu  62.3     6.2 0.00013   28.5   2.0   21   38-58    209-229 (326)
 64 PF13793 Pribosyltran_N:  N-ter  60.8     8.8 0.00019   24.3   2.3   36   40-77      2-37  (116)
 65 PF00026 Asp:  Eukaryotic aspar  58.8      11 0.00023   26.5   2.7   23   36-58    199-221 (317)
 66 PTZ00147 plasmepsin-1; Provisi  58.2      19 0.00041   28.0   4.1   23   36-58    332-354 (453)
 67 PF14541 TAXi_C:  Xylanase inhi  57.1     9.6 0.00021   24.8   2.1   19   37-55     30-48  (161)
 68 COG2861 Uncharacterized protei  56.6     9.9 0.00021   27.7   2.2   29   37-65    151-179 (250)
 69 PF01568 Molydop_binding:  Moly  55.6      14 0.00029   22.2   2.5   28   47-76     31-58  (110)
 70 PTZ00013 plasmepsin 4 (PM4); P  54.6     9.5  0.0002   29.6   2.0   22   37-58    332-353 (450)
 71 COG2336 MazE Growth regulator   54.1      33 0.00073   20.9   3.9   31   47-79     13-43  (82)
 72 cd05489 xylanase_inhibitor_I_l  53.0      19 0.00041   26.8   3.3   20   38-57    231-250 (362)
 73 cd02778 MopB_CT_Thiosulfate-R-  52.6      37  0.0008   20.7   4.2   32   45-78     29-60  (123)
 74 PF07629 DUF1590:  Protein of u  52.6      12 0.00026   18.5   1.5   21    1-21      1-21  (32)
 75 PTZ00165 aspartyl protease; Pr  52.5      21 0.00045   27.9   3.6   22   37-58    328-349 (482)
 76 PF08922 DUF1905:  Domain of un  52.1      48   0.001   19.5   4.5   46   24-71     31-79  (80)
 77 PF07894 DUF1669:  Protein of u  51.6      24 0.00052   26.1   3.6   47   32-80    172-218 (284)
 78 COG2093 DNA-directed RNA polym  50.9      20 0.00044   20.8   2.5   28   37-70     36-63  (64)
 79 PF03419 Peptidase_U4:  Sporula  50.4      20 0.00044   25.8   3.1   25   22-46    154-180 (293)
 80 cd02790 MopB_CT_Formate-Dh_H F  50.4      24 0.00052   21.2   3.0   30   46-77     35-64  (116)
 81 cd02788 MopB_CT_NDH-1_NuoG2-N7  50.0      28 0.00062   20.7   3.3   29   47-77     30-58  (96)
 82 PLN03146 aspartyl protease fam  49.9      14  0.0003   28.3   2.2   18   38-55    309-326 (431)
 83 cd02792 MopB_CT_Formate-Dh-Na-  49.2      30 0.00064   21.1   3.3   30   46-77     35-64  (122)
 84 cd02779 MopB_CT_Arsenite-Ox Th  48.8      57  0.0012   19.9   4.6   30   45-76     32-61  (115)
 85 cd01813 UBP_N UBP ubiquitin pr  48.5      29 0.00063   20.0   3.0   39   26-64      1-39  (74)
 86 cd02787 MopB_CT_ydeP The MopB_  48.2      30 0.00066   21.0   3.2   31   46-78     31-61  (112)
 87 cd02785 MopB_CT_4 The MopB_CT_  48.1      33 0.00072   21.2   3.5   29   47-77     33-61  (124)
 88 PRK12440 acetate kinase; Revie  48.1      11 0.00024   29.1   1.4   38   44-81    180-217 (397)
 89 KOG1339 Aspartyl protease [Pos  46.4      18 0.00039   27.1   2.3   32   24-55     45-78  (398)
 90 cd02794 MopB_CT_DmsA-EC The Mo  46.3      37  0.0008   20.9   3.4   29   47-77     31-59  (121)
 91 KOG3048 Molecular chaperone Pr  45.9      17 0.00037   24.5   1.9   28   22-51     69-96  (153)
 92 cd02781 MopB_CT_Acetylene-hydr  45.8      35 0.00076   21.1   3.3   30   46-77     33-62  (130)
 93 cd02775 MopB_CT Molybdopterin-  45.3      38 0.00082   19.6   3.3   30   46-77     23-52  (101)
 94 cd06407 PB1_NLP A PB1 domain i  45.0      66  0.0014   19.1   5.1   47   26-74      1-47  (82)
 95 cd02789 MopB_CT_FmdC-FwdD The   44.8      41  0.0009   20.5   3.5   31   44-76     29-59  (106)
 96 TIGR02854 spore_II_GA sigma-E   44.6      45 0.00097   24.3   4.1   25   22-46    155-181 (288)
 97 cd00508 MopB_CT_Fdh-Nap-like T  43.8      38 0.00082   20.4   3.2   30   46-77     35-64  (120)
 98 cd02786 MopB_CT_3 The MopB_CT_  42.9      40 0.00086   20.4   3.2   30   46-77     31-60  (116)
 99 PF12812 PDZ_1:  PDZ-like domai  42.3      63  0.0014   19.0   3.8   35   38-75      6-40  (78)
100 PF00455 DeoRC:  DeoR C termina  41.9     7.7 0.00017   25.6  -0.2   42   38-79     22-74  (161)
101 PF03539 Spuma_A9PTase:  Spumav  41.9      33 0.00072   23.4   2.8   42   32-77      1-42  (163)
102 KOG3101 Esterase D [General fu  41.8      12 0.00025   27.4   0.6   25   31-55    210-234 (283)
103 cd02791 MopB_CT_Nitrate-R-NapA  41.4      42 0.00091   20.4   3.1   30   46-77     35-64  (122)
104 cd02784 MopB_CT_PHLH The MopB_  40.6      89  0.0019   20.3   4.7   33   46-81     38-70  (137)
105 KOG3838 Mannose lectin ERGIC-5  40.5   1E+02  0.0023   24.3   5.6   71    5-76    147-240 (497)
106 PF09379 FERM_N:  FERM N-termin  40.2      41 0.00089   18.9   2.8   45   31-75      3-48  (80)
107 TIGR00016 ackA acetate kinase.  40.1      15 0.00034   28.3   1.2   37   44-80    185-221 (404)
108 cd02780 MopB_CT_Tetrathionate_  40.1      53  0.0011   20.9   3.5   30   46-77     30-59  (143)
109 PRK05654 acetyl-CoA carboxylas  39.8      86  0.0019   23.1   5.0   54   22-75    108-166 (292)
110 cd02777 MopB_CT_DMSOR-like The  38.4      48   0.001   20.5   3.1   30   46-77     34-63  (127)
111 PRK12379 propionate/acetate ki  37.9      13 0.00029   28.6   0.5   37   44-80    176-212 (396)
112 TIGR00515 accD acetyl-CoA carb  37.6   1E+02  0.0022   22.7   5.0   54   22-75    107-165 (285)
113 PF02645 DegV:  Uncharacterised  37.4      36 0.00078   24.3   2.6   32   37-74      2-33  (280)
114 cd02782 MopB_CT_1 The MopB_CT_  37.3      59  0.0013   20.1   3.4   31   45-77     32-62  (129)
115 cd04459 Rho_CSD Rho_CSD: Rho p  36.9      36 0.00079   19.8   2.2   21   43-63     24-44  (68)
116 cd01812 BAG1_N Ubiquitin-like   36.6      60  0.0013   17.7   3.0   37   27-63      2-38  (71)
117 PF04083 Abhydro_lipase:  Parti  35.9      50  0.0011   18.7   2.6   26   52-80      2-27  (63)
118 PF13188 PAS_8:  PAS domain; PD  35.7      57  0.0012   17.2   2.8   24   37-61     13-36  (64)
119 PF05618 Zn_protease:  Putative  35.3      39 0.00085   22.1   2.4   22   35-56     15-36  (138)
120 PF04748 Polysacc_deac_2:  Dive  35.0      24 0.00053   24.6   1.4   27   38-64    120-146 (213)
121 PRK04923 ribose-phosphate pyro  34.8      40 0.00086   25.0   2.6   38   39-78      7-44  (319)
122 PRK12397 propionate kinase; Re  34.7      15 0.00032   28.5   0.3   37   44-80    180-216 (404)
123 cd02793 MopB_CT_DMSOR-BSOR-TMA  34.6      58  0.0013   20.3   3.0   31   45-77     32-62  (129)
124 KOG1339 Aspartyl protease [Pos  34.6      34 0.00073   25.6   2.2   21   37-57    269-289 (398)
125 CHL00174 accD acetyl-CoA carbo  34.6 1.1E+02  0.0024   22.7   4.9   54   22-75    120-178 (296)
126 PRK00934 ribose-phosphate pyro  34.3      30 0.00064   25.1   1.8   33   43-77      4-36  (285)
127 PRK08351 DNA-directed RNA poly  33.7      51  0.0011   18.9   2.4   28   37-70     33-60  (61)
128 cd00248 Mth938-like Mth938-lik  33.4      43 0.00094   20.8   2.3   28   36-63     53-83  (109)
129 PF00240 ubiquitin:  Ubiquitin   32.4      45 0.00098   18.2   2.0   32   32-63      3-34  (69)
130 PF00564 PB1:  PB1 domain;  Int  30.6 1.1E+02  0.0023   17.3   5.5   51   25-78      1-54  (84)
131 PRK00180 acetate kinase A/prop  30.5      28  0.0006   26.9   1.2   37   44-80    181-217 (402)
132 cd02783 MopB_CT_2 The MopB_CT_  30.2      83  0.0018   20.5   3.3   30   46-77     32-61  (156)
133 PF07285 DUF1444:  Protein of u  30.0      78  0.0017   22.8   3.4   39   25-63    105-143 (265)
134 COG3865 Uncharacterized protei  29.7 1.4E+02  0.0031   20.2   4.3   25   24-49     49-73  (151)
135 PRK10681 DNA-binding transcrip  29.1      15 0.00032   26.0  -0.4   40   38-77     95-145 (252)
136 PRK02812 ribose-phosphate pyro  29.1      57  0.0012   24.3   2.6   37   40-78     23-59  (330)
137 cd02511 Beta4Glucosyltransfera  29.1      56  0.0012   22.1   2.4   24   37-62     28-51  (229)
138 PRK07199 phosphoribosylpyropho  28.9      49  0.0011   24.3   2.2   37   40-78      4-40  (301)
139 cd05992 PB1 The PB1 domain is   28.6 1.1E+02  0.0025   17.0   4.4   36   27-62      2-38  (81)
140 COG0777 AccD Acetyl-CoA carbox  28.4 1.1E+02  0.0024   22.8   4.0   54   23-76    110-168 (294)
141 cd01398 RPI_A RPI_A: Ribose 5-  27.9      18 0.00038   25.2  -0.2   22   37-58     17-38  (213)
142 PRK12608 transcription termina  27.8      58  0.0013   25.0   2.5   26   37-62    279-319 (380)
143 cd05560 Xcc1710_like Xcc1710_l  27.4      66  0.0014   20.0   2.3   23   36-58     53-75  (109)
144 smart00295 B41 Band 4.1 homolo  27.0   1E+02  0.0022   20.1   3.3   43   30-72      9-51  (207)
145 PRK00702 ribose-5-phosphate is  26.9      23  0.0005   25.0   0.2   21   38-58     23-43  (220)
146 COG3188 FimD P pilus assembly   26.6 2.1E+02  0.0045   24.3   5.6   55   10-64     46-107 (835)
147 cd02776 MopB_CT_Nitrate-R-NarG  26.4 1.1E+02  0.0023   19.8   3.3   32   45-78     30-61  (141)
148 PF00871 Acetate_kinase:  Aceto  26.2      26 0.00055   26.8   0.3   36   45-80    179-214 (388)
149 KOG0679 Actin-related protein   26.1      36 0.00078   26.6   1.1   19   31-49    149-167 (426)
150 PRK02458 ribose-phosphate pyro  25.9      71  0.0015   23.7   2.6   38   39-78     10-47  (323)
151 COG0462 PrsA Phosphoribosylpyr  25.3      57  0.0012   24.5   2.0   36   42-79      8-43  (314)
152 COG0231 Efp Translation elonga  25.3 1.2E+02  0.0026   19.6   3.4   51   29-79     64-124 (131)
153 PRK01259 ribose-phosphate pyro  24.8      61  0.0013   23.8   2.1   36   41-78      3-38  (309)
154 PF00567 TUDOR:  Tudor domain;   24.6      39 0.00084   19.8   0.9   41   33-74     79-121 (121)
155 PF11976 Rad60-SLD:  Ubiquitin-  24.5 1.3E+02  0.0029   16.4   4.9   49   27-79      4-52  (72)
156 KOG3238 Chloride ion current i  24.4      57  0.0012   23.2   1.8   28    8-35     59-89  (216)
157 PRK11347 antitoxin ChpS; Provi  24.0 1.7E+02  0.0036   17.5   3.6   31   47-79     13-43  (83)
158 PRK13846 putative glycerol-3-p  23.8      81  0.0017   23.7   2.6   40   16-57    120-159 (316)
159 TIGR01699 XAPA xanthosine phos  23.6 2.1E+02  0.0045   20.5   4.6   38   10-49     21-63  (248)
160 PF06820 Phage_fiber_C:  Putati  23.5      51  0.0011   19.0   1.2   28   48-78     31-58  (64)
161 COG1107 Archaea-specific RecJ-  23.2      53  0.0011   27.1   1.6   29   34-62    417-446 (715)
162 PRK00553 ribose-phosphate pyro  23.1      84  0.0018   23.4   2.6   36   41-78     12-47  (332)
163 PTZ00145 phosphoribosylpyropho  23.0      83  0.0018   24.7   2.6   38   39-78    120-157 (439)
164 PRK09798 antitoxin MazE; Provi  23.0 1.8E+02  0.0038   17.3   3.6   30   48-79     15-44  (82)
165 COG1237 Metal-dependent hydrol  22.9 1.4E+02   0.003   21.9   3.5   34   29-64     24-58  (259)
166 KOG0318 WD40 repeat stress pro  22.5 1.6E+02  0.0036   24.0   4.1   39   37-75    130-168 (603)
167 PRK10434 srlR DNA-bindng trans  22.4      20 0.00044   25.4  -0.8   23   38-60     94-116 (256)
168 PRK10906 DNA-binding transcrip  22.3      22 0.00049   25.2  -0.6   25   38-62     94-118 (252)
169 PF11925 DUF3443:  Protein of u  22.0 1.2E+02  0.0026   23.4   3.2   27   27-53    245-272 (370)
170 cd01742 GATase1_GMP_Synthase T  21.8      47   0.001   21.7   0.9   26   39-64      2-27  (181)
171 TIGR01251 ribP_PPkin ribose-ph  21.7      74  0.0016   23.2   2.0   33   43-77      5-37  (308)
172 PF04430 DUF498:  Protein of un  21.7      92   0.002   19.1   2.2   24   35-58     53-76  (110)
173 PF13856 Gifsy-2:  ATP-binding   21.4 1.3E+02  0.0029   17.9   2.8   24   26-49     20-43  (95)
174 PF01150 GDA1_CD39:  GDA1/CD39   21.4      60  0.0013   24.6   1.5   14   35-48      7-20  (434)
175 PF07076 DUF1344:  Protein of u  21.3 1.8E+02  0.0039   16.7   3.3   45    7-51     14-58  (61)
176 PRK02269 ribose-phosphate pyro  21.2      97  0.0021   22.9   2.6   37   40-78      7-43  (320)
177 TIGR02707 butyr_kinase butyrat  21.0      78  0.0017   23.7   2.1   35   44-78    153-187 (351)
178 PF07532 Big_4:  Bacterial Ig-l  21.0      98  0.0021   16.7   2.0   14   21-34     45-58  (59)
179 PRK06393 rpoE DNA-directed RNA  20.9 1.7E+02  0.0037   16.9   3.0   29   37-71     35-63  (64)
180 PRK07058 acetate kinase; Provi  20.8      59  0.0013   25.2   1.4   36   44-80    181-216 (396)
181 COG4067 Uncharacterized protei  20.7      72  0.0016   21.8   1.6   24   36-59     39-62  (162)
182 cd06404 PB1_aPKC PB1 domain is  20.6 2.2E+02  0.0047   17.3   3.8   47   26-74      1-47  (83)
183 PRK13668 hypothetical protein;  20.3 1.5E+02  0.0032   21.8   3.3   38   26-63    106-143 (267)
184 smart00666 PB1 PB1 domain. Pho  20.2 1.8E+02  0.0039   16.3   5.4   37   26-62      2-38  (81)
185 TIGR03778 VPDSG_CTERM VPDSG-CT  20.0      32 0.00069   16.5  -0.1   15   42-56      3-17  (26)

No 1  
>PF13975 gag-asp_proteas:  gag-polyprotein putative aspartyl protease
Probab=99.77  E-value=3.2e-18  Score=101.41  Aligned_cols=65  Identities=28%  Similarity=0.356  Sum_probs=61.0

Q ss_pred             cCcCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCC-cEEEEeCCCCEEEEeeEe
Q 047037           20 DQVLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANN-RLEIMVAFGEKLMSSGKC   84 (85)
Q Consensus        20 ~~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~-~~~V~~anG~~~~~~g~c   84 (85)
                      ...++.|++.+.|+|+.+.+|||||||||||+.++|+|||++..+.. +.+|++|||....+.|..
T Consensus         3 ~~~~g~~~v~~~I~g~~~~alvDtGat~~fis~~~a~rLgl~~~~~~~~~~v~~a~g~~~~~~g~~   68 (72)
T PF13975_consen    3 TPDPGLMYVPVSIGGVQVKALVDTGATHNFISESLAKRLGLPLEKPPSPIRVKLANGSVIEIRGVA   68 (72)
T ss_pred             cccCCEEEEEEEECCEEEEEEEeCCCcceecCHHHHHHhCCCcccCCCCEEEEECCCCccccceEE
Confidence            45689999999999999999999999999999999999999999975 999999999999998875


No 2  
>cd05484 retropepsin_like_LTR_2 Retropepsins_like_LTR, pepsin-like aspartate proteases. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classif
Probab=99.64  E-value=1.6e-15  Score=92.52  Aligned_cols=59  Identities=20%  Similarity=0.305  Sum_probs=55.0

Q ss_pred             EEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEEeeEe
Q 047037           26 MRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMSSGKC   84 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~~g~c   84 (85)
                      +++++.|||+++.+||||||+++||+++++++++++.......++++|||+.+.+.|+|
T Consensus         1 ~~~~~~Ing~~i~~lvDTGA~~svis~~~~~~lg~~~~~~~~~~v~~a~G~~~~~~G~~   59 (91)
T cd05484           1 KTVTLLVNGKPLKFQLDTGSAITVISEKTWRKLGSPPLKPTKKRLRTATGTKLSVLGQI   59 (91)
T ss_pred             CEEEEEECCEEEEEEEcCCcceEEeCHHHHHHhCCCccccccEEEEecCCCEeeEeEEE
Confidence            36889999999999999999999999999999999985568899999999999999986


No 3  
>PF08284 RVP_2:  Retroviral aspartyl protease;  InterPro: IPR013242 This region defines single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein; usually pol, more rarely gag. Retroviral proteases appear to be homologous to a single domain of the two-domain eukaryotic aspartyl proteases. 
Probab=99.63  E-value=1.2e-15  Score=100.20  Aligned_cols=65  Identities=22%  Similarity=0.341  Sum_probs=57.6

Q ss_pred             cCcCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCC-CcEEEEeCCCCEEEEeeEeC
Q 047037           20 DQVLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNAN-NRLEIMVAFGEKLMSSGKCT   85 (85)
Q Consensus        20 ~~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~-~~~~V~~anG~~~~~~g~c~   85 (85)
                      ...+..|.....|+++++.+|||||||||||++++|+++++++++. .++.|.. +|..+.|.+.|+
T Consensus        16 ~~~~~vi~g~~~I~~~~~~vLiDSGAThsFIs~~~a~~~~l~~~~l~~~~~V~~-~g~~~~~~~~~~   81 (135)
T PF08284_consen   16 EESPDVITGTFLINSIPASVLIDSGATHSFISSSFAKKLGLPLEPLPRPIVVSA-PGGSINCEGVCP   81 (135)
T ss_pred             cCCCCeEEEEEEeccEEEEEEEecCCCcEEccHHHHHhcCCEEEEccCeeEEec-ccccccccceee
Confidence            3578999999999999999999999999999999999999999995 7777775 566788888774


No 4  
>cd05479 RP_DDI RP_DDI; retropepsin-like domain of DNA damage inducible protein. The family represents the retropepsin-like domain of DNA damage inducible protein. DNA damage inducible protein has a retropepsin-like domain and an amino-terminal ubiquitin-like domain and/or a UBA (ubiquitin-associated) domain. This CD represents the retropepsin-like domain of DDI.
Probab=99.61  E-value=4.1e-15  Score=95.84  Aligned_cols=63  Identities=17%  Similarity=0.208  Sum_probs=55.0

Q ss_pred             cCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEEeeEe
Q 047037           22 VLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMSSGKC   84 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~~g~c   84 (85)
                      ...+|++.++|||+++.+|||||||||||++++|+|+|++.....++.++++++....+.++|
T Consensus        13 ~~~~~~v~~~Ing~~~~~LvDTGAs~s~Is~~~a~~lgl~~~~~~~~~~~~~g~g~~~~~g~~   75 (124)
T cd05479          13 KVPMLYINVEINGVPVKAFVDSGAQMTIMSKACAEKCGLMRLIDKRFQGIAKGVGTQKILGRI   75 (124)
T ss_pred             eeeEEEEEEEECCEEEEEEEeCCCceEEeCHHHHHHcCCccccCcceEEEEecCCCcEEEeEE
Confidence            357899999999999999999999999999999999999987667788777775556777776


No 5  
>TIGR02281 clan_AA_DTGA clan AA aspartic protease, TIGR02281 family. This family consists of predicted aspartic proteases, typically from 180 to 230 amino acids in length, in MEROPS clan AA. This model describes the well-conserved 121-residue C-terminal region. The poorly conserved, variable length N-terminal region usually contains a predicted transmembrane helix. Sequences in the seed alignment and those scoring above the trusted cutoff are Proteobacterial; homologs scroing between trusted and noise are found in Pyrobaculum aerophilum str. IM2 (archaeal), Pirellula sp. (Planctomycetes), and Nostoc sp. PCC 7120 (Cyanobacteria).
Probab=99.49  E-value=4e-13  Score=86.55  Aligned_cols=61  Identities=26%  Similarity=0.368  Sum_probs=54.8

Q ss_pred             CcCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccC-CCcEEEEeCCCCEEEEe
Q 047037           21 QVLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNA-NNRLEIMVAFGEKLMSS   81 (85)
Q Consensus        21 ~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~-~~~~~V~~anG~~~~~~   81 (85)
                      ...++|++.++|||+++.||||||||+++|++++|+++|++... ..+..+.+|||......
T Consensus         7 ~~~g~~~v~~~InG~~~~flVDTGAs~t~is~~~A~~Lgl~~~~~~~~~~~~ta~G~~~~~~   68 (121)
T TIGR02281         7 DGDGHFYATGRVNGRNVRFLVDTGATSVALNEEDAQRLGLDLNRLGYTVTVSTANGQIKAAR   68 (121)
T ss_pred             cCCCeEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCcccCCceEEEEeCCCcEEEEE
Confidence            46799999999999999999999999999999999999998866 47899999999876543


No 6  
>COG3577 Predicted aspartyl protease [General function prediction only]
Probab=99.37  E-value=3.7e-12  Score=89.01  Aligned_cols=65  Identities=26%  Similarity=0.390  Sum_probs=58.3

Q ss_pred             eeEEEeccccCcCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCC-CcEEEEeCCCCEEE
Q 047037           11 KISLHDVARDQVLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNAN-NRLEIMVAFGEKLM   79 (85)
Q Consensus        11 ~iSl~A~~g~~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~-~~~~V~~anG~~~~   79 (85)
                      ++++++    +..+||.+.|.|||+.|.+|||||||-..++++.|+|+|+.+... .++.|.+|||+...
T Consensus        95 ~v~Lak----~~~GHF~a~~~VNGk~v~fLVDTGATsVal~~~dA~RlGid~~~l~y~~~v~TANG~~~A  160 (215)
T COG3577          95 EVSLAK----SRDGHFEANGRVNGKKVDFLVDTGATSVALNEEDARRLGIDLNSLDYTITVSTANGRARA  160 (215)
T ss_pred             EEEEEe----cCCCcEEEEEEECCEEEEEEEecCcceeecCHHHHHHhCCCccccCCceEEEccCCcccc
Confidence            455544    457999999999999999999999999999999999999999985 99999999998654


No 7  
>PF13650 Asp_protease_2:  Aspartyl protease
Probab=99.34  E-value=1.2e-11  Score=73.55  Aligned_cols=56  Identities=25%  Similarity=0.439  Sum_probs=48.9

Q ss_pred             EEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCC-CcEEEEeCCCCEEEEeeE
Q 047037           28 IQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNAN-NRLEIMVAFGEKLMSSGK   83 (85)
Q Consensus        28 ~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~-~~~~V~~anG~~~~~~g~   83 (85)
                      +.+.|||+++.+||||||++++|++++|++++++..+. .+..+..+||+.....+.
T Consensus         1 V~v~vng~~~~~liDTGa~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~   57 (90)
T PF13650_consen    1 VPVKVNGKPVRFLIDTGASISVISRSLAKKLGLKPRPKSVPISVSGAGGSVTVYRGR   57 (90)
T ss_pred             CEEEECCEEEEEEEcCCCCcEEECHHHHHHcCCCCcCCceeEEEEeCCCCEEEEEEE
Confidence            46889999999999999999999999999999998876 378999999996554443


No 8  
>cd05483 retropepsin_like_bacteria Bacterial aspartate proteases, retropepsin-like protease family. This family of bacteria aspartate proteases is a subfamily of retropepsin-like protease family, which includes enzymes from retrovirus and retrotransposons. While fungal and mammalian pepsin-like aspartate proteases are bilobal proteins with structurally related N- and C-termini, this family of bacteria aspartate proteases is half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate proteases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=99.32  E-value=1.7e-11  Score=73.49  Aligned_cols=60  Identities=22%  Similarity=0.383  Sum_probs=53.0

Q ss_pred             CeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEEeeE
Q 047037           24 ETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMSSGK   83 (85)
Q Consensus        24 ~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~~g~   83 (85)
                      +++.+.+.||++++.+||||||++++|+.++|+++++......+.+++.++|........
T Consensus         1 ~~~~v~v~i~~~~~~~llDTGa~~s~i~~~~~~~l~~~~~~~~~~~~~~~~G~~~~~~~~   60 (96)
T cd05483           1 GHFVVPVTINGQPVRFLLDTGASTTVISEELAERLGLPLTLGGKVTVQTANGRVRAARVR   60 (96)
T ss_pred             CcEEEEEEECCEEEEEEEECCCCcEEcCHHHHHHcCCCccCCCcEEEEecCCCccceEEE
Confidence            478999999999999999999999999999999999855556889999999988776553


No 9  
>cd06095 RP_RTVL_H_like Retropepsin of the RTVL_H family of human endogenous retrovirus-like elements. This family includes aspartate proteases from retroelements with LTR (long terminal repeats) including the RTVL_H family of human endogenous retrovirus-like elements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where 
Probab=98.99  E-value=1.6e-09  Score=65.58  Aligned_cols=47  Identities=19%  Similarity=0.422  Sum_probs=41.6

Q ss_pred             EEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           28 IQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        28 ~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      +...|||+++.+||||||+++.|++++|+++   .....+..++.++|..
T Consensus         1 ~~v~InG~~~~fLvDTGA~~tii~~~~a~~~---~~~~~~~~v~gagG~~   47 (86)
T cd06095           1 VTITVEGVPIVFLVDTGATHSVLKSDLGPKQ---ELSTTSVLIRGVSGQS   47 (86)
T ss_pred             CEEEECCEEEEEEEECCCCeEEECHHHhhhc---cCCCCcEEEEeCCCcc
Confidence            3578999999999999999999999999998   2334899999999985


No 10 
>TIGR03698 clan_AA_DTGF clan AA aspartic protease, AF_0612 family. Members of this protein family are clan AA aspartic proteases, related to family TIGR02281. These proteins resemble retropepsins, pepsin-like proteases of retroviruses such as HIV. Members of this family are found in archaea and bacteria.
Probab=98.86  E-value=1.2e-08  Score=64.40  Aligned_cols=44  Identities=16%  Similarity=0.292  Sum_probs=38.6

Q ss_pred             CEEEEEEEeCCCCcee-eCHHHHHhcCCcccCCCcEEEEeCCCCEEE
Q 047037           34 GKSVVILIDSGSTHNS-INAEVAQRVNLSPNANNRLEIMVAFGEKLM   79 (85)
Q Consensus        34 g~~v~~LiDSGat~~F-i~~~~a~~l~l~~~~~~~~~V~~anG~~~~   79 (85)
                      ..++.+|||||||+.+ |++++|+++|++..  ...++++|||....
T Consensus        14 ~~~v~~LVDTGat~~~~l~~~~a~~lgl~~~--~~~~~~tA~G~~~~   58 (107)
T TIGR03698        14 FMEVRALVDTGFSGFLLVPPDIVNKLGLPEL--DQRRVYLADGREVL   58 (107)
T ss_pred             ceEEEEEEECCCCeEEecCHHHHHHcCCCcc--cCcEEEecCCcEEE
Confidence            4589999999999998 99999999999875  57799999997544


No 11 
>cd05481 retropepsin_like_LTR_1 Retropepsins_like_LTR; pepsin-like aspartate protease from retrotransposons with long terminal repeats. Retropepsin of retrotransposons with long terminal repeats are pepsin-like aspartate proteases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identifi
Probab=98.75  E-value=4.5e-08  Score=60.48  Aligned_cols=55  Identities=18%  Similarity=0.213  Sum_probs=48.3

Q ss_pred             EEECC-EEEEEEEeCCCCceeeCHHHHHhcC---CcccCCCcEEEEeCCCCEEEEeeEe
Q 047037           30 GSIKG-KSVVILIDSGSTHNSINAEVAQRVN---LSPNANNRLEIMVAFGEKLMSSGKC   84 (85)
Q Consensus        30 g~i~g-~~v~~LiDSGat~~Fi~~~~a~~l~---l~~~~~~~~~V~~anG~~~~~~g~c   84 (85)
                      -.|+| +++.+++||||+.|.|+.++.++++   .+.-...+++++.+||..+...|.+
T Consensus         3 ~~i~g~~~v~~~vDtGA~vnllp~~~~~~l~~~~~~~L~~t~~~L~~~~g~~~~~~G~~   61 (93)
T cd05481           3 MKINGKQSVKFQLDTGATCNVLPLRWLKSLTPDKDPELRPSPVRLTAYGGSTIPVEGGV   61 (93)
T ss_pred             eEeCCceeEEEEEecCCEEEeccHHHHhhhccCCCCcCccCCeEEEeeCCCEeeeeEEE
Confidence            35788 9999999999999999999999998   4555567899999999999999854


No 12 
>PF00077 RVP:  Retroviral aspartyl protease The Prosite entry also includes Pfam:PF00026;  InterPro: IPR018061 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to the MEROPS peptidase family A2 (retropepsin family, clan AA), subfamily A2A. The family includes the single domain aspartic proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). Retroviral aspartyl protease is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins.; PDB: 3D3T_B 3SQF_A 1NSO_A 2HB3_A 2HS2_A 2HS1_B 3K4V_A 3GGV_C 1HTG_B 2FDE_A ....
Probab=98.72  E-value=2.7e-08  Score=60.90  Aligned_cols=54  Identities=20%  Similarity=0.273  Sum_probs=46.4

Q ss_pred             EEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEEeeE
Q 047037           27 RIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMSSGK   83 (85)
Q Consensus        27 ~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~~g~   83 (85)
                      ++...++|+++.+||||||+.++|+++.+.....+  ......+..++|.. ...+.
T Consensus         7 ~i~v~i~g~~i~~LlDTGA~vsiI~~~~~~~~~~~--~~~~~~v~~~~g~~-~~~~~   60 (100)
T PF00077_consen    7 YITVKINGKKIKALLDTGADVSIISEKDWKKLGPP--PKTSITVRGAGGSS-SILGS   60 (100)
T ss_dssp             EEEEEETTEEEEEEEETTBSSEEESSGGSSSTSSE--EEEEEEEEETTEEE-EEEEE
T ss_pred             eEEEeECCEEEEEEEecCCCcceeccccccccccc--ccCCceeccCCCcc-eeeeE
Confidence            56789999999999999999999999999888765  45788999999987 66554


No 13 
>cd00303 retropepsin_like Retropepsins; pepsin-like aspartate proteases. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements, as well as eukaryotic dna-damage-inducible proteins (DDIs), and bacterial aspartate peptidases. While fungal and mammalian pepsins are bilobal proteins with structurally related N and C-terminals, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples
Probab=98.61  E-value=3.1e-07  Score=51.51  Aligned_cols=54  Identities=30%  Similarity=0.439  Sum_probs=44.8

Q ss_pred             EEEECCEEEEEEEeCCCCceeeCHHHHHhcCC-cccCCCcEEEEeCCCCEEEEee
Q 047037           29 QGSIKGKSVVILIDSGSTHNSINAEVAQRVNL-SPNANNRLEIMVAFGEKLMSSG   82 (85)
Q Consensus        29 ~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l-~~~~~~~~~V~~anG~~~~~~g   82 (85)
                      ...++|+++.+|+|+||++++++..+++++++ +.....+..+..++|......+
T Consensus         2 ~~~~~~~~~~~liDtgs~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   56 (92)
T cd00303           2 KGKINGVPVRALVDSGASVNFISESLAKKLGLPPRLLPTPLKVKGANGSSVKTLG   56 (92)
T ss_pred             EEEECCEEEEEEEcCCCcccccCHHHHHHcCCCcccCCCceEEEecCCCEeccCc
Confidence            46788999999999999999999999999988 4444688888998887655443


No 14 
>PF09668 Asp_protease:  Aspartyl protease;  InterPro: IPR019103 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure.  This family of eukaryotic aspartyl proteases have a fold similar to retroviral proteases which implies they function proteolytically during regulated protein turnover []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 3S8I_A 2I1A_B.
Probab=98.59  E-value=1.4e-07  Score=61.53  Aligned_cols=40  Identities=23%  Similarity=0.395  Sum_probs=32.4

Q ss_pred             cCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCc
Q 047037           22 VLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLS   61 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~   61 (85)
                      ....+++...|||+++.++|||||-+|.|+.++|+|+||.
T Consensus        21 ~v~mLyI~~~ing~~vkA~VDtGAQ~tims~~~a~r~gL~   60 (124)
T PF09668_consen   21 QVSMLYINCKINGVPVKAFVDTGAQSTIMSKSCAERCGLM   60 (124)
T ss_dssp             -----EEEEEETTEEEEEEEETT-SS-EEEHHHHHHTTGG
T ss_pred             CcceEEEEEEECCEEEEEEEeCCCCccccCHHHHHHcCCh
Confidence            4578999999999999999999999999999999999986


No 15 
>PF12384 Peptidase_A2B:  Ty3 transposon peptidase;  InterPro: IPR024650 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Ty3 is a gypsy-type, retrovirus-like, element found in the budding yeast. The Ty3 aspartyl protease is required for processing of the viral polyprotein into its mature species [].
Probab=98.54  E-value=6.7e-07  Score=61.02  Aligned_cols=54  Identities=22%  Similarity=0.326  Sum_probs=49.9

Q ss_pred             cCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCC
Q 047037           22 VLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFG   75 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG   75 (85)
                      -.++..+.-.++|.++.+|+||||-.+||+..++++|+|+.-..+|++++.+-+
T Consensus        31 vg~T~~v~l~~~~t~i~vLfDSGSPTSfIr~di~~kL~L~~~~app~~fRG~vs   84 (177)
T PF12384_consen   31 VGKTAIVQLNCKGTPIKVLFDSGSPTSFIRSDIVEKLELPTHDAPPFRFRGFVS   84 (177)
T ss_pred             cCcEEEEEEeecCcEEEEEEeCCCccceeehhhHHhhCCccccCCCEEEeeecc
Confidence            358999999999999999999999999999999999999999999999887653


No 16 
>cd05480 NRIP_C NRIP_C; putative nuclear receptor interacting protein. Proteins in this family have been described as probable nuclear receptor interacting proteins. The  C-terminal domain of this family is homologous to the retroviral aspartyl protease domain. The domain is structurally related to one lobe of the pepsin molecule. The conserved active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This group of aspartate peptidases is classified by MEROPS as the peptidase family A2 (retropepsin family, clan AA), subfamily A2A.
Probab=98.24  E-value=2.1e-06  Score=54.26  Aligned_cols=33  Identities=30%  Similarity=0.439  Sum_probs=31.2

Q ss_pred             EEEECCEEEEEEEeCCCCceeeCHHHHHhcCCc
Q 047037           29 QGSIKGKSVVILIDSGSTHNSINAEVAQRVNLS   61 (85)
Q Consensus        29 ~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~   61 (85)
                      ...+||+++.++|||||-+|.||+..|+|+||.
T Consensus         2 nCk~nG~~vkAfVDsGaQ~timS~~caercgL~   34 (103)
T cd05480           2 SCQCAGKELRALVDTGCQYNLISAACLDRLGLK   34 (103)
T ss_pred             ceeECCEEEEEEEecCCchhhcCHHHHHHcChH
Confidence            457999999999999999999999999999987


No 17 
>cd06094 RP_Saci_like RP_Saci_like, retropepsin family. Retropepsin on retrotransposons with long terminal repeats (LTR) including Saci-1, -2 and -3 of Schistosoma mansoni. Retropepsins are related to fungal and mammalian pepsins. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified
Probab=98.05  E-value=8.9e-06  Score=50.35  Aligned_cols=45  Identities=18%  Similarity=0.304  Sum_probs=38.1

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEEeeEe
Q 047037           36 SVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMSSGKC   84 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~~g~c   84 (85)
                      .+.+||||||.++.|+....++.    ....++.++.|||..+..-|..
T Consensus         9 ~~~fLVDTGA~vSviP~~~~~~~----~~~~~~~l~AANgt~I~tyG~~   53 (89)
T cd06094           9 GLRFLVDTGAAVSVLPASSTKKS----LKPSPLTLQAANGTPIATYGTR   53 (89)
T ss_pred             CcEEEEeCCCceEeecccccccc----ccCCceEEEeCCCCeEeeeeeE
Confidence            47899999999999999988864    3346889999999999987753


No 18 
>PF05585 DUF1758:  Putative peptidase (DUF1758);  InterPro: IPR008737  This is a family of nematode proteins of unknown function []. However, it seems likely that these proteins act as aspartic peptidases. 
Probab=98.04  E-value=7.7e-06  Score=54.44  Aligned_cols=39  Identities=28%  Similarity=0.329  Sum_probs=32.7

Q ss_pred             EEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeC
Q 047037           35 KSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVA   73 (85)
Q Consensus        35 ~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~a   73 (85)
                      +.+.+|+||||..+||.+++|++|+|+......+.+.+.
T Consensus        11 ~~~~~LlDsGSq~SfIt~~la~~L~L~~~~~~~~~~~~~   49 (164)
T PF05585_consen   11 VEARALLDSGSQRSFITESLANKLNLPGTGEKILVIGTF   49 (164)
T ss_pred             EEEEEEEecCCchhHHhHHHHHHhCCCCCCceEEEEecc
Confidence            468889999999999999999999999887665555544


No 19 
>KOG0012 consensus DNA damage inducible protein [Replication, recombination and repair]
Probab=97.43  E-value=0.00017  Score=54.36  Aligned_cols=42  Identities=26%  Similarity=0.411  Sum_probs=38.0

Q ss_pred             CcCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcc
Q 047037           21 QVLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSP   62 (85)
Q Consensus        21 ~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~   62 (85)
                      ..-..+++...|||++|.++|||||-.+-||...|+|+||.-
T Consensus       231 ~~v~ML~iN~~ing~~VKAfVDsGaq~timS~~Caer~gL~r  272 (380)
T KOG0012|consen  231 TQVTMLYINCEINGVPVKAFVDSGAQTTIMSAACAERCGLNR  272 (380)
T ss_pred             ccceEEEEEEEECCEEEEEEEcccchhhhhhHHHHHHhChHH
Confidence            345678999999999999999999999999999999999753


No 20 
>COG5550 Predicted aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=97.19  E-value=0.002  Score=42.10  Aligned_cols=55  Identities=22%  Similarity=0.211  Sum_probs=39.5

Q ss_pred             CeEEEEEEE---CCEEEEEEEeCCCC-ceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           24 ETMRIQGSI---KGKSVVILIDSGST-HNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        24 ~~~~~~g~i---~g~~v~~LiDSGat-~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      .++.++...   +.+....|||||+| ...|++++|++++++..  ...++..|+|..+..
T Consensus        11 ~~v~~~f~~~~~Gd~~~~~LiDTGFtg~lvlp~~vaek~~~~~~--~~~~~~~a~~~~v~t   69 (125)
T COG5550          11 VTVPVTFRLPGQGDFVYDELIDTGFTGYLVLPPQVAEKLGLPLF--STIRIVLADGGVVKT   69 (125)
T ss_pred             eeEEEEEEecCCCcEEeeeEEecCCceeEEeCHHHHHhcCCCcc--CChhhhhhcCCEEEE
Confidence            445555554   33445559999999 88999999999998887  445566667666554


No 21 
>PF02160 Peptidase_A3:  Cauliflower mosaic virus peptidase (A3);  InterPro: IPR000588 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of sequences contain an aspartic peptidase signature that belongs to MEROPS peptidase family A3, subfamily A3A (cauliflower mosaic virus-type endopeptidase, clan AA). Cauliflower mosaic virus belongs to the Retro-transcribing viruses, which have a double-stranded DNA genome. The genome includes an open reading frame (ORF V) that shows similarities to the pol gene of retroviruses. This ORF codes for a polyprotein that includes a reverse transcriptase, which, on the basis of a DTG triplet near the N terminus, was suggested to include an aspartic protease. The presence of an aspartic protease has been confirmed by mutational studies, implicating Asp-45 in catalysis. The protease releases itself from the polyprotein and is involved in reactions required to process the ORF IV polyprotein, which includes the viral coat protein []. The viral aspartic peptidase signature has also been found associated with a polyprotein encoded by integrated pararetrovirus-like sequences in the genome of Nicotiana tabacum (Common tobacco) []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis
Probab=97.09  E-value=0.0015  Score=45.77  Aligned_cols=57  Identities=14%  Similarity=0.162  Sum_probs=39.4

Q ss_pred             EEEEEECC---EEEEEEEeCCCCceeeCHHHHHhcCCcccC-CCcEEEEeCCCCEEEEeeEeC
Q 047037           27 RIQGSIKG---KSVVILIDSGSTHNSINAEVAQRVNLSPNA-NNRLEIMVAFGEKLMSSGKCT   85 (85)
Q Consensus        27 ~~~g~i~g---~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~-~~~~~V~~anG~~~~~~g~c~   85 (85)
                      +++-.+.|   ..+..+||||||.+.++....-.--  .+. ..++.|+.|||....-.-+|+
T Consensus         8 ~~~i~~~gy~~~~~~~~vDTGAt~C~~~~~iiP~e~--we~~~~~i~v~~an~~~~~i~~~~~   68 (201)
T PF02160_consen    8 KVKISFPGYKKFNYHCYVDTGATICCASKKIIPEEY--WEKSKKPIKVKGANGSIIQINKKAK   68 (201)
T ss_pred             EEEEEEcCceeEEEEEEEeCCCceEEecCCcCCHHH--HHhCCCcEEEEEecCCceEEEEEec
Confidence            33444444   4577899999999998887662221  122 378999999999877766663


No 22 
>cd05482 HIV_retropepsin_like Retropepsins, pepsin-like aspartate proteases. This is a subfamily of retropepsins. The family includes pepsin-like aspartate proteases from retroviruses, retrotransposons and retroelements. While fungal and mammalian pepsins are bilobal proteins with structurally related N- and C-termini, retropepsins are half as long as their fungal and mammalian counterparts. The monomers are structurally related to one lobe of the pepsin molecule and retropepsins function as homodimers. The active site aspartate occurs within a motif (Asp-Thr/Ser-Gly), as it does in pepsin. Retroviral aspartyl protease is synthesized as part of the POL polyprotein that contains an aspartyl protease, a reverse transcriptase, RNase H, and an integrase. The POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. In aspartate peptidases, Asp residues are ligands of an activated water molecule in all examples where catalytic residues have been identified. This gro
Probab=96.26  E-value=0.006  Score=37.41  Aligned_cols=27  Identities=26%  Similarity=0.197  Sum_probs=23.5

Q ss_pred             EEEECCEEEEEEEeCCCCceeeCHHHH
Q 047037           29 QGSIKGKSVVILIDSGSTHNSINAEVA   55 (85)
Q Consensus        29 ~g~i~g~~v~~LiDSGat~~Fi~~~~a   55 (85)
                      +-.|+|+.+.+|+||||-++.|++..-
T Consensus         2 ~~~i~g~~~~~llDTGAd~Tvi~~~~~   28 (87)
T cd05482           2 TLYINGKLFEGLLDTGADVSIIAENDW   28 (87)
T ss_pred             EEEECCEEEEEEEccCCCCeEEccccc
Confidence            457899999999999999999997543


No 23 
>PF12382 Peptidase_A2E:  Retrotransposon peptidase;  InterPro: IPR024648 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  This entry represents a small family of fungal retroviral aspartyl peptidases.
Probab=91.10  E-value=0.2  Score=32.14  Aligned_cols=30  Identities=30%  Similarity=0.376  Sum_probs=27.0

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhcCCcccCC
Q 047037           36 SVVILIDSGSTHNSINAEVAQRVNLSPNAN   65 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~   65 (85)
                      .+.-|||+||-.|.|.++.++...||..+.
T Consensus        47 sipclidtgaq~niiteetvrahklptrpw   76 (137)
T PF12382_consen   47 SIPCLIDTGAQVNIITEETVRAHKLPTRPW   76 (137)
T ss_pred             cceeEEccCceeeeeehhhhhhccCCCCcc
Confidence            367799999999999999999999998874


No 24 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=90.37  E-value=0.65  Score=32.67  Aligned_cols=27  Identities=26%  Similarity=0.394  Sum_probs=23.6

Q ss_pred             EEEEEEEC--CEEEEEEEeCCCCceeeCH
Q 047037           26 MRIQGSIK--GKSVVILIDSGSTHNSINA   52 (85)
Q Consensus        26 ~~~~g~i~--g~~v~~LiDSGat~~Fi~~   52 (85)
                      |.+...|+  ++++.++|||||+..|+..
T Consensus         2 Y~~~v~iGtp~q~~~~~iDTGS~~~wv~~   30 (317)
T PF00026_consen    2 YYINVTIGTPPQTFRVLIDTGSSDTWVPS   30 (317)
T ss_dssp             EEEEEEETTTTEEEEEEEETTBSSEEEEB
T ss_pred             eEEEEEECCCCeEEEEEEecccceeeece
Confidence            56677887  8999999999999999984


No 25 
>cd05470 pepsin_retropepsin_like Cellular and retroviral pepsin-like aspartate proteases. This family includes both cellular and retroviral pepsin-like aspartate proteases. The cellular pepsin and pepsin-like enzymes are twice as long as their retroviral counterparts. The cellular pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, rennin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (rennin, cathepsin D and E, pepsin) or commercially (chymosin) important. The eukaryotic pepsin-like proteases contain two domains possessing similar topological features. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except in the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The eukaryotic pepsin-like proteases have two active site 
Probab=89.50  E-value=0.52  Score=28.43  Aligned_cols=26  Identities=23%  Similarity=0.336  Sum_probs=21.0

Q ss_pred             EECC--EEEEEEEeCCCCceeeCHHHHH
Q 047037           31 SIKG--KSVVILIDSGSTHNSINAEVAQ   56 (85)
Q Consensus        31 ~i~g--~~v~~LiDSGat~~Fi~~~~a~   56 (85)
                      .|+.  +++.+++||||+..++...-..
T Consensus         4 ~vGtP~q~~~~~~DTGSs~~Wv~~~~c~   31 (109)
T cd05470           4 GIGTPPQTFNVLLDTGSSNLWVPSVDCQ   31 (109)
T ss_pred             EeCCCCceEEEEEeCCCCCEEEeCCCCC
Confidence            4554  8899999999999999876543


No 26 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=89.45  E-value=1.8  Score=30.46  Aligned_cols=26  Identities=23%  Similarity=0.346  Sum_probs=22.7

Q ss_pred             EEEEEEECC--EEEEEEEeCCCCceeeC
Q 047037           26 MRIQGSIKG--KSVVILIDSGSTHNSIN   51 (85)
Q Consensus        26 ~~~~g~i~g--~~v~~LiDSGat~~Fi~   51 (85)
                      |.+...|+.  +++.+++||||+..+|.
T Consensus         3 Y~~~i~iGtp~q~~~v~~DTgS~~~wv~   30 (295)
T cd05474           3 YSAELSVGTPPQKVTVLLDTGSSDLWVP   30 (295)
T ss_pred             EEEEEEECCCCcEEEEEEeCCCCcceee
Confidence            556677777  89999999999999999


No 27 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=86.64  E-value=1.2  Score=32.18  Aligned_cols=41  Identities=22%  Similarity=0.280  Sum_probs=31.2

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEE
Q 047037           36 SVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLM   79 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~   79 (85)
                      ...++||||.|..+++.++.+++.-.+   +.+.+...+|..+.
T Consensus       231 ~~~aivDSGTs~~~lp~~~~~~l~~~~---P~i~~~f~~g~~~~  271 (326)
T cd06096         231 GLGMLVDSGSTLSHFPEDLYNKINNFF---PTITIIFENNLKID  271 (326)
T ss_pred             CCCEEEeCCCCcccCCHHHHHHHHhhc---CcEEEEEcCCcEEE
Confidence            455899999999999999998874333   66777777665554


No 28 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=86.60  E-value=1.7  Score=30.47  Aligned_cols=18  Identities=28%  Similarity=0.300  Sum_probs=16.2

Q ss_pred             EEEEeCCCCceeeCHHHH
Q 047037           38 VILIDSGSTHNSINAEVA   55 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a   55 (85)
                      .++||||+|..++++.+.
T Consensus       178 ~ai~DTGTs~~~lp~~~~  195 (265)
T cd05476         178 GTIIDSGTTLTYLPDPAY  195 (265)
T ss_pred             cEEEeCCCcceEcCcccc
Confidence            389999999999998887


No 29 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=86.36  E-value=0.98  Score=31.84  Aligned_cols=27  Identities=15%  Similarity=0.146  Sum_probs=21.9

Q ss_pred             EEEEEECC--EEEEEEEeCCCCceeeCHH
Q 047037           27 RIQGSIKG--KSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        27 ~~~g~i~g--~~v~~LiDSGat~~Fi~~~   53 (85)
                      .+.-.|+.  +++.+++||||+..|+...
T Consensus         2 ~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~   30 (278)
T cd06097           2 LTPVKIGTPPQTLNLDLDTGSSDLWVFSS   30 (278)
T ss_pred             eeeEEECCCCcEEEEEEeCCCCceeEeeC
Confidence            34556666  8999999999999999765


No 30 
>cd06096 Plasmepsin_5 Plasmepsins are a class of aspartic proteinases produced by the plasmodium parasite. The family contains a group of aspartic proteinases homologous to plasmepsin 5.  Plasmepsins are a class of at least 10 enzymes produced by the plasmodium parasite. Through their haemoglobin-degrading activity, they are an important cause of symptoms in malaria sufferers. This family of enzymes is a potential target for anti-malarial drugs. Plasmepsins are aspartic acid proteases, which means their active site contains two aspartic acid residues. These two aspartic acid residue act respectively as proton donor and proton acceptor, catalyzing the hydrolysis of peptide bond in proteins. Aspartic proteinases are composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal l
Probab=85.98  E-value=1.7  Score=31.46  Aligned_cols=30  Identities=20%  Similarity=0.195  Sum_probs=24.4

Q ss_pred             CeEEEEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           24 ETMRIQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        24 ~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      +-|.+.-.|+  .+++.+++||||+..+++-.
T Consensus         2 ~~Y~~~i~vGtP~Q~~~v~~DTGS~~~wv~~~   33 (326)
T cd06096           2 AYYFIDIFIGNPPQKQSLILDTGSSSLSFPCS   33 (326)
T ss_pred             ceEEEEEEecCCCeEEEEEEeCCCCceEEecC
Confidence            3466677777  68999999999999999764


No 31 
>cd05475 nucellin_like Nucellins, plant aspartic proteases specifically expressed in nucellar cells during degradation. Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. This degradation is a characteristic of programmed cell death. Nucellins are plant aspartic proteases specifically expressed in nucellar cells during degradation. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region, and two other regions nearly identical to two regions of plant aspartic proteases. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. Although the three-dimensional structures of the two lobes are very similar, the amino acid sequences are more d
Probab=85.69  E-value=1.5  Score=31.02  Aligned_cols=27  Identities=22%  Similarity=0.201  Sum_probs=22.4

Q ss_pred             eEEEEEEEC--CEEEEEEEeCCCCceeeC
Q 047037           25 TMRIQGSIK--GKSVVILIDSGSTHNSIN   51 (85)
Q Consensus        25 ~~~~~g~i~--g~~v~~LiDSGat~~Fi~   51 (85)
                      .|.+.-.|+  .+.+.+++||||++.+|.
T Consensus         2 ~Y~~~i~iGtP~q~~~v~~DTGS~~~Wv~   30 (273)
T cd05475           2 YYYVTINIGNPPKPYFLDIDTGSDLTWLQ   30 (273)
T ss_pred             ceEEEEEcCCCCeeEEEEEccCCCceEEe
Confidence            356666776  788999999999999995


No 32 
>cd05476 pepsin_A_like_plant Chroloplast Nucleoids DNA-binding Protease and Nucellin, pepsin-like aspartic proteases from plants. This family contains pepsin like aspartic proteases from plants including Chloroplast Nucleoids DNA-binding Protease and Nucellin. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco and Nucellins are important regulators of nucellar cell's progressive degradation after ovule fertilization. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  The enzymes specifically cleave bonds in peptides which 
Probab=85.17  E-value=2.7  Score=29.39  Aligned_cols=27  Identities=19%  Similarity=0.253  Sum_probs=21.3

Q ss_pred             EEEEEEEC--CEEEEEEEeCCCCceeeCH
Q 047037           26 MRIQGSIK--GKSVVILIDSGSTHNSINA   52 (85)
Q Consensus        26 ~~~~g~i~--g~~v~~LiDSGat~~Fi~~   52 (85)
                      |.+.-.|+  .+++.+++||||++.++..
T Consensus         2 Y~~~i~iGtP~q~~~v~~DTGSs~~wv~~   30 (265)
T cd05476           2 YLVTLSIGTPPQPFSLIVDTGSDLTWTQC   30 (265)
T ss_pred             eEEEEecCCCCcceEEEecCCCCCEEEcC
Confidence            34455566  6789999999999999974


No 33 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=85.01  E-value=2  Score=31.06  Aligned_cols=30  Identities=13%  Similarity=0.117  Sum_probs=25.3

Q ss_pred             CCeEEEEEEEC--CEEEEEEEeCCCCceeeCH
Q 047037           23 LETMRIQGSIK--GKSVVILIDSGSTHNSINA   52 (85)
Q Consensus        23 ~~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~   52 (85)
                      ...+.+.-.|+  .+.+.+++||||+..+|..
T Consensus         6 ~~~y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~   37 (326)
T cd05487           6 DTQYYGEIGIGTPPQTFKVVFDTGSSNLWVPS   37 (326)
T ss_pred             CCeEEEEEEECCCCcEEEEEEeCCccceEEcc
Confidence            35667777787  7999999999999999975


No 34 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=84.58  E-value=2.3  Score=30.65  Aligned_cols=31  Identities=13%  Similarity=0.115  Sum_probs=26.0

Q ss_pred             CCeEEEEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           23 LETMRIQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        23 ~~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      ...|.+.-.|+  .+++.+++||||+..++...
T Consensus         8 ~~~Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~   40 (317)
T cd06098           8 DAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSS   40 (317)
T ss_pred             CCEEEEEEEECCCCeEEEEEECCCccceEEecC
Confidence            35677788887  78999999999999999764


No 35 
>COG0282 ackA Acetate kinase [Energy production and conversion]
Probab=84.38  E-value=0.51  Score=36.25  Aligned_cols=37  Identities=14%  Similarity=0.246  Sum_probs=34.1

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      |-||.|++.+.|+.||-|++..+-+...++||.+++.
T Consensus       179 GtSh~YVs~~aa~~L~k~~~~l~~I~~HLGNGASicA  215 (396)
T COG0282         179 GTSHKYVSQRAAEILGKPLEDLNLITCHLGNGASICA  215 (396)
T ss_pred             ccchHHHHHHHHHHhCCCccccCEEEEEecCchhhhh
Confidence            7799999999999999999999999999999987754


No 36 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=83.18  E-value=2.7  Score=30.23  Aligned_cols=31  Identities=16%  Similarity=0.090  Sum_probs=25.8

Q ss_pred             CCeEEEEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           23 LETMRIQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        23 ~~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      ...|.+.-.|+  .+++.++|||||+..++...
T Consensus         8 ~~~Y~~~i~vGtp~q~~~v~~DTGS~~~wv~~~   40 (317)
T cd05478           8 DMEYYGTISIGTPPQDFTVIFDTGSSNLWVPSV   40 (317)
T ss_pred             CCEEEEEEEeCCCCcEEEEEEeCCCccEEEecC
Confidence            45677777887  68899999999999999754


No 37 
>cd06097 Aspergillopepsin_like Aspergillopepsin_like, aspartic proteases of fungal origin. The members of this family are aspartic proteases of fungal origin, including aspergillopepsin, rhizopuspepsin, endothiapepsin, and rodosporapepsin. The various fungal species in this family may be the most economically important genus of fungi. They may serve as virulence factors or as industrial aids. For example, Aspergillopepsin from A. fumigatus is involved in invasive aspergillosis owing to its elastolytic activity and Aspergillopepsins from the mold A. saitoi are used in fermentation industry. Aspartic proteinases are a group of proteolytic enzymes in which the scissile peptide bond is attacked by a nucleophilic water molecule activated by two aspartic residues in a DT(S)G motif at the active site. They have a similar fold composed of two beta-barrel domains. Between the N-terminal and C-terminal domains, each of which contributes one catalytic aspartic residue, there is an extended active-
Probab=82.44  E-value=0.99  Score=31.82  Aligned_cols=23  Identities=13%  Similarity=0.226  Sum_probs=19.5

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhc
Q 047037           36 SVVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ...++||||++..+++..+++++
T Consensus       198 ~~~~iiDSGTs~~~lP~~~~~~l  220 (278)
T cd06097         198 GFSAIADTGTTLILLPDAIVEAY  220 (278)
T ss_pred             CceEEeecCCchhcCCHHHHHHH
Confidence            45689999999999998877665


No 38 
>PTZ00147 plasmepsin-1; Provisional
Probab=81.98  E-value=2.9  Score=32.40  Aligned_cols=31  Identities=13%  Similarity=0.154  Sum_probs=26.5

Q ss_pred             CeEEEEEEEC--CEEEEEEEeCCCCceeeCHHH
Q 047037           24 ETMRIQGSIK--GKSVVILIDSGSTHNSINAEV   54 (85)
Q Consensus        24 ~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~~   54 (85)
                      ..+.....|+  .+++.+++||||+..++...-
T Consensus       138 ~~Y~~~I~IGTP~Q~f~Vi~DTGSsdlWVps~~  170 (453)
T PTZ00147        138 VMSYGEAKLGDNGQKFNFIFDTGSANLWVPSIK  170 (453)
T ss_pred             CEEEEEEEECCCCeEEEEEEeCCCCcEEEeecC
Confidence            5677788888  899999999999999997543


No 39 
>cd06396 PB1_NBR1 The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
Probab=81.94  E-value=5.9  Score=24.01  Aligned_cols=47  Identities=21%  Similarity=0.294  Sum_probs=37.5

Q ss_pred             EEEEEEECCEEEEEEEeCCC--CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           26 MRIQGSIKGKSVVILIDSGS--THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGa--t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ++++...+|..+.+.++..+  +..=+...++++++++     ++.++-.|-+-
T Consensus         1 V~vKaty~~d~~rf~~~~~~~~~~~~L~~ev~~rf~l~-----~f~lKYlDde~   49 (81)
T cd06396           1 VNLKVTYNGESQSFLVSDSENTTWASVEAMVKVSFGLN-----DIQIKYVDEEN   49 (81)
T ss_pred             CEEEEEECCeEEEEEecCCCCCCHHHHHHHHHHHhCCC-----cceeEEEcCCC
Confidence            36889999999999999955  6666788999999988     66666666443


No 40 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=81.78  E-value=3  Score=29.99  Aligned_cols=29  Identities=14%  Similarity=0.187  Sum_probs=24.0

Q ss_pred             CeEEEEEEEC--CEEEEEEEeCCCCceeeCH
Q 047037           24 ETMRIQGSIK--GKSVVILIDSGSTHNSINA   52 (85)
Q Consensus        24 ~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~   52 (85)
                      ..|.+.-.|+  .+++.+++||||+..++..
T Consensus         5 ~~Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~~   35 (325)
T cd05490           5 AQYYGEIGIGTPPQTFTVVFDTGSSNLWVPS   35 (325)
T ss_pred             CEEEEEEEECCCCcEEEEEEeCCCccEEEEc
Confidence            4567777777  4889999999999999964


No 41 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=81.60  E-value=3.1  Score=32.23  Aligned_cols=32  Identities=13%  Similarity=0.260  Sum_probs=26.0

Q ss_pred             CeEEEEEEEC--CEEEEEEEeCCCCceeeCHHHH
Q 047037           24 ETMRIQGSIK--GKSVVILIDSGSTHNSINAEVA   55 (85)
Q Consensus        24 ~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~~a   55 (85)
                      ..+.....|+  ++++.+++||||+..++...-.
T Consensus       137 ~~Yy~~i~IGTP~Q~f~vi~DTGSsdlWV~s~~C  170 (450)
T PTZ00013        137 IMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKC  170 (450)
T ss_pred             CEEEEEEEECCCCeEEEEEEeCCCCceEEecccC
Confidence            4566677887  8999999999999999976543


No 42 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=81.41  E-value=1.9  Score=30.65  Aligned_cols=21  Identities=19%  Similarity=0.335  Sum_probs=17.7

Q ss_pred             EEEEeCCCCceeeCHHHHHhc
Q 047037           38 VILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      .++||||+|..++++.+.+++
T Consensus       173 ~~ivDSGTt~~~lp~~~~~~l  193 (299)
T cd05472         173 GVIIDSGTVITRLPPSAYAAL  193 (299)
T ss_pred             CeEEeCCCcceecCHHHHHHH
Confidence            479999999999998777654


No 43 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=80.81  E-value=3.5  Score=29.57  Aligned_cols=29  Identities=21%  Similarity=0.242  Sum_probs=23.2

Q ss_pred             eEEEEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           25 TMRIQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        25 ~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      .|...-.|+  .+++.++|||||+..++...
T Consensus         3 ~y~~~i~iGtP~q~~~v~~DTGS~~~wv~~~   33 (318)
T cd05477           3 SYYGEISIGTPPQNFLVLFDTGSSNLWVPSV   33 (318)
T ss_pred             EEEEEEEECCCCcEEEEEEeCCCccEEEccC
Confidence            355566666  48999999999999999853


No 44 
>cd05477 gastricsin Gastricsins, asparate proteases produced in gastric mucosa. Gastricsin is also called pepsinogen C. Gastricsins are produced in gastric mucosa of mammals. It is synthesized by the chief cells in the stomach as an inactive zymogen. It is self-converted to a mature enzyme under acidic conditions. Human gastricsin is distributed throughout all parts of the stomach. Gastricsin is synthesized as an inactive progastricsin that has an approximately 40 residue prosequence. It is self-converting to a mature enzyme being triggered by a drop in pH from neutrality to acidic conditions. Like other aspartic proteases, gastricsin are characterized by two catalytic aspartic residues at the active site, and display optimal activity at acidic pH. Mature enzyme has a pseudo-2-fold symmetry that passes through the active site between the catalytic aspartate residues. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic aspartate residue, with an exten
Probab=80.61  E-value=2.6  Score=30.26  Aligned_cols=28  Identities=18%  Similarity=0.412  Sum_probs=22.2

Q ss_pred             EECCEEE-------EEEEeCCCCceeeCHHHHHhc
Q 047037           31 SIKGKSV-------VILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        31 ~i~g~~v-------~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      .++++.+       .++||||.|..++++.+.+++
T Consensus       189 ~v~g~~~~~~~~~~~~iiDSGtt~~~lP~~~~~~l  223 (318)
T cd05477         189 QINGQATGWCSQGCQAIVDTGTSLLTAPQQVMSTL  223 (318)
T ss_pred             EECCEEecccCCCceeeECCCCccEECCHHHHHHH
Confidence            5666543       589999999999999888764


No 45 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=80.59  E-value=2.7  Score=30.90  Aligned_cols=28  Identities=29%  Similarity=0.389  Sum_probs=22.7

Q ss_pred             EEEEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           26 MRIQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        26 ~~~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      |.+.-.|+  .+++.+++||||+..++...
T Consensus         4 Y~~~i~iGtP~Q~~~v~~DTGSs~lWv~~~   33 (364)
T cd05473           4 YYIEMLIGTPPQKLNILVDTGSSNFAVAAA   33 (364)
T ss_pred             eEEEEEecCCCceEEEEEecCCcceEEEcC
Confidence            55666676  58999999999999999753


No 46 
>cd06098 phytepsin Phytepsin, a plant homolog of mammalian lysosomal pepsins. Phytepsin, a plant homolog of mammalian lysosomal pepsins, resides in grains, roots, stems, leaves and flowers. Phytepsin may participate in metabolic turnover and in protein processing events. In addition, it highly expressed in several plant tissues undergoing apoptosis. Phytepsin contains an internal region consisting of about 100 residues not present in animal or microbial pepsins. This region is thus called a plant specific insert. The insert is highly similar to saponins, which are lysosomal sphingolipid-activating proteins in mammalian cells. The saponin-like domain may have a role in the vacuolar targeting of phytepsin. Phytepsin, as its animal counterparts, possesses a topology typical of all aspartic proteases.  They are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe has probably evolved fro
Probab=79.37  E-value=3.1  Score=29.97  Aligned_cols=30  Identities=27%  Similarity=0.337  Sum_probs=24.2

Q ss_pred             EEECCEEE-------EEEEeCCCCceeeCHHHHHhcC
Q 047037           30 GSIKGKSV-------VILIDSGSTHNSINAEVAQRVN   59 (85)
Q Consensus        30 g~i~g~~v-------~~LiDSGat~~Fi~~~~a~~l~   59 (85)
                      ..|+++.+       .++||||.|..++++++++++.
T Consensus       197 i~v~g~~~~~~~~~~~aivDTGTs~~~lP~~~~~~i~  233 (317)
T cd06098         197 VLIGGKSTGFCAGGCAAIADSGTSLLAGPTTIVTQIN  233 (317)
T ss_pred             EEECCEEeeecCCCcEEEEecCCcceeCCHHHHHhhh
Confidence            35666653       5899999999999999998764


No 47 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=79.30  E-value=2  Score=30.86  Aligned_cols=28  Identities=11%  Similarity=0.278  Sum_probs=21.7

Q ss_pred             EECCEEE------EEEEeCCCCceeeCHHHHHhc
Q 047037           31 SIKGKSV------VILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        31 ~i~g~~v------~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      .++|+.+      .++||||++..+++.+..+++
T Consensus       187 ~v~g~~~~~~~~~~aiiDTGTs~~~lP~~~~~~l  220 (316)
T cd05486         187 QVGGTVIFCSDGCQAIVDTGTSLITGPSGDIKQL  220 (316)
T ss_pred             EEecceEecCCCCEEEECCCcchhhcCHHHHHHH
Confidence            4555543      589999999999999877654


No 48 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=78.92  E-value=2.8  Score=30.45  Aligned_cols=21  Identities=10%  Similarity=0.205  Sum_probs=18.5

Q ss_pred             EEEEeCCCCceeeCHHHHHhc
Q 047037           38 VILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      .++||||++..+++.++.+.+
T Consensus       212 ~~iiDSGtt~~~lP~~~~~~l  232 (329)
T cd05485         212 QAIADTGTSLIAGPVDEIEKL  232 (329)
T ss_pred             EEEEccCCcceeCCHHHHHHH
Confidence            699999999999999877754


No 49 
>cd05485 Cathepsin_D_like Cathepsin_D_like, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets an
Probab=78.87  E-value=4.5  Score=29.37  Aligned_cols=31  Identities=16%  Similarity=0.112  Sum_probs=26.6

Q ss_pred             CCeEEEEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           23 LETMRIQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        23 ~~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      ...|.+.-.|+  .+++.+++||||+..|+...
T Consensus         9 ~~~Y~~~i~vGtP~q~~~v~~DTGSs~~Wv~~~   41 (329)
T cd05485           9 DAQYYGVITIGTPPQSFKVVFDTGSSNLWVPSK   41 (329)
T ss_pred             CCeEEEEEEECCCCcEEEEEEcCCCccEEEecC
Confidence            36778888888  58999999999999999864


No 50 
>cd05478 pepsin_A Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which 
Probab=78.82  E-value=3.2  Score=29.81  Aligned_cols=29  Identities=17%  Similarity=0.259  Sum_probs=23.6

Q ss_pred             EEECCEEE------EEEEeCCCCceeeCHHHHHhc
Q 047037           30 GSIKGKSV------VILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        30 g~i~g~~v------~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ..++|+.+      .++||||++..+++....+++
T Consensus       194 v~v~g~~~~~~~~~~~iiDTGts~~~lp~~~~~~l  228 (317)
T cd05478         194 VTINGQVVACSGGCQAIVDTGTSLLVGPSSDIANI  228 (317)
T ss_pred             EEECCEEEccCCCCEEEECCCchhhhCCHHHHHHH
Confidence            46777765      589999999999999887653


No 51 
>cd05486 Cathespin_E Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalyt
Probab=78.78  E-value=2.8  Score=30.07  Aligned_cols=26  Identities=23%  Similarity=0.240  Sum_probs=20.2

Q ss_pred             EEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           28 IQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        28 ~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      +.-.|+  .+++.+++||||+..++...
T Consensus         3 ~~i~iGtP~Q~~~v~~DTGSs~~Wv~s~   30 (316)
T cd05486           3 GQISIGTPPQNFTVIFDTGSSNLWVPSI   30 (316)
T ss_pred             EEEEECCCCcEEEEEEcCCCccEEEecC
Confidence            334454  67899999999999999753


No 52 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=78.41  E-value=4  Score=29.39  Aligned_cols=30  Identities=10%  Similarity=0.174  Sum_probs=25.4

Q ss_pred             CeEEEEEEEC--CEEEEEEEeCCCCceeeCHH
Q 047037           24 ETMRIQGSIK--GKSVVILIDSGSTHNSINAE   53 (85)
Q Consensus        24 ~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~~   53 (85)
                      ..|.+.-.|+  .+++.+++||||+..+|...
T Consensus         9 ~~Y~~~i~iGtp~q~~~v~~DTGSs~~wv~~~   40 (320)
T cd05488           9 AQYFTDITLGTPPQKFKVILDTGSSNLWVPSV   40 (320)
T ss_pred             CEEEEEEEECCCCcEEEEEEecCCcceEEEcC
Confidence            5577788887  48999999999999999764


No 53 
>cd05472 cnd41_like Chloroplast Nucleoids DNA-binding Protease, catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase. Chloroplast Nucleoids DNA-binding Protease catalyzes the degradation of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) in senescent leaves of tobacco. Antisense tobacco with reduced amount of CND41 maintained green leaves and constant protein levels, especially Rubisco.  CND41 has DNA-binding as well as aspartic protease activities. The pepsin-like aspartic protease domain is located at the C-terminus of the protein. The enzyme is characterized by having two aspartic protease catalytic site motifs, the Asp-Thr-Gly-Ser in the N-terminal and Asp-Ser-Gly-Ser in the C-terminal region. Aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. This fami
Probab=78.21  E-value=2.6  Score=29.96  Aligned_cols=26  Identities=12%  Similarity=0.322  Sum_probs=20.6

Q ss_pred             EEEEEEEC--CEEEEEEEeCCCCceeeC
Q 047037           26 MRIQGSIK--GKSVVILIDSGSTHNSIN   51 (85)
Q Consensus        26 ~~~~g~i~--g~~v~~LiDSGat~~Fi~   51 (85)
                      |.+.-.|+  .+++.+++||||+..++.
T Consensus         2 Y~~~i~iGtP~q~~~v~~DTGSs~~Wv~   29 (299)
T cd05472           2 YVVTVGLGTPARDQTVIVDTGSDLTWVQ   29 (299)
T ss_pred             eEEEEecCCCCcceEEEecCCCCccccc
Confidence            34455565  578999999999999994


No 54 
>cd05473 beta_secretase_like Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two 
Probab=74.54  E-value=4.1  Score=29.96  Aligned_cols=20  Identities=25%  Similarity=0.390  Sum_probs=17.4

Q ss_pred             EEEEeCCCCceeeCHHHHHh
Q 047037           38 VILIDSGSTHNSINAEVAQR   57 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~   57 (85)
                      .++||||+|..+++....+.
T Consensus       213 ~~ivDSGTs~~~lp~~~~~~  232 (364)
T cd05473         213 KAIVDSGTTNLRLPVKVFNA  232 (364)
T ss_pred             cEEEeCCCcceeCCHHHHHH
Confidence            48999999999999887764


No 55 
>PF14543 TAXi_N:  Xylanase inhibitor N-terminal; PDB: 3HD8_A 3VLB_A 3VLA_A 3AUP_D 1T6G_A 1T6E_X 2B42_A.
Probab=73.98  E-value=6.5  Score=25.91  Aligned_cols=25  Identities=20%  Similarity=0.335  Sum_probs=17.8

Q ss_pred             EEEEEECC--EEEEEEEeCCCCceeeC
Q 047037           27 RIQGSIKG--KSVVILIDSGSTHNSIN   51 (85)
Q Consensus        27 ~~~g~i~g--~~v~~LiDSGat~~Fi~   51 (85)
                      .+...|+.  +++.++||+|+...++.
T Consensus         2 ~~~~~iGtP~~~~~lvvDtgs~l~W~~   28 (164)
T PF14543_consen    2 YVSVSIGTPPQPFSLVVDTGSDLTWVQ   28 (164)
T ss_dssp             EEEEECTCTTEEEEEEEETT-SSEEEE
T ss_pred             EEEEEeCCCCceEEEEEECCCCceEEc
Confidence            34444544  68999999999999973


No 56 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=73.47  E-value=5.1  Score=27.50  Aligned_cols=22  Identities=14%  Similarity=0.270  Sum_probs=18.7

Q ss_pred             EEEEEEEeCCCCceeeCHHHHH
Q 047037           35 KSVVILIDSGSTHNSINAEVAQ   56 (85)
Q Consensus        35 ~~v~~LiDSGat~~Fi~~~~a~   56 (85)
                      +++.+++||||+..++......
T Consensus        12 q~~~l~~DTGS~~~wv~~~~c~   33 (283)
T cd05471          12 QKFSVIFDTGSSLLWVPSSNCT   33 (283)
T ss_pred             cEEEEEEeCCCCCEEEecCCCC
Confidence            5899999999999999776544


No 57 
>cd05471 pepsin_like Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH. Pepsin-like aspartic proteases are found in mammals, plants, fungi and bacteria. These well known and extensively characterized enzymes include pepsins, chymosin, renin, cathepsins, and fungal aspartic proteases. Several have long been known to be medically (renin, cathepsin D and E, pepsin) or commercially (chymosin) important. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event.  Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residu
Probab=72.90  E-value=3.7  Score=28.19  Aligned_cols=27  Identities=15%  Similarity=0.163  Sum_probs=22.5

Q ss_pred             CCEEEEEEEeCCCCceeeCHHHHHhcC
Q 047037           33 KGKSVVILIDSGSTHNSINAEVAQRVN   59 (85)
Q Consensus        33 ~g~~v~~LiDSGat~~Fi~~~~a~~l~   59 (85)
                      ......++||||++..+++..+.+.+-
T Consensus       199 ~~~~~~~iiDsGt~~~~lp~~~~~~l~  225 (283)
T cd05471         199 SSGGGGAIVDSGTSLIYLPSSVYDAIL  225 (283)
T ss_pred             cCCCcEEEEecCCCCEeCCHHHHHHHH
Confidence            335688999999999999998887763


No 58 
>cd05490 Cathepsin_D2 Cathepsin_D2, pepsin family of proteinases. Cathepsin D is the major aspartic proteinase of the lysosomal compartment where it functions in protein catabolism. It is a member of the pepsin family of proteinases. This enzyme is distinguished from other members of the pepsin family by two features that are characteristic of lysosomal hydrolases. First, mature Cathepsin D is found predominantly in a two-chain form due to a posttranslational cleavage event. Second, it contains phosphorylated, N-linked oligosaccharides that target the enzyme to lysosomes via mannose-6-phosphate receptors. Cathepsin D preferentially attacks peptide bonds flanked by bulky hydrophobic amino acids and its pH optimum is between pH 2.8 and 4.0. Two active site aspartic acid residues are essential for the catalytic activity of aspartic proteinases. Like other aspartic proteinases, Cathepsin D is a bilobed molecule; the two evolutionary related lobes are mostly made up of beta-sheets and flank 
Probab=70.26  E-value=3.4  Score=29.75  Aligned_cols=22  Identities=14%  Similarity=0.229  Sum_probs=19.3

Q ss_pred             EEEEEeCCCCceeeCHHHHHhc
Q 047037           37 VVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ..++||||+|..+++...++.+
T Consensus       207 ~~aiiDSGTt~~~~p~~~~~~l  228 (325)
T cd05490         207 CEAIVDTGTSLITGPVEEVRAL  228 (325)
T ss_pred             CEEEECCCCccccCCHHHHHHH
Confidence            4689999999999999888765


No 59 
>cd05488 Proteinase_A_fungi Fungal Proteinase A , aspartic proteinase superfamily. Fungal Proteinase A, a proteolytic enzyme distributed among a variety of organisms, is a member of the aspartic proteinase superfamily. In Saccharomyces cerevisiae, targeted to the vacuole as a zymogen, activation of proteinases A at acidic pH can occur by two different pathways: a one-step process to release mature proteinase A, involving the intervention of proteinase B, or a step-wise pathway via the auto-activation product known as pseudo-proteinase A. Once active, S. cerevisiae proteinase A is essential to the activities of other yeast vacuolar hydrolases, including proteinase B and carboxypeptidase Y. The mature enzyme is bilobal, with each lobe providing one of the two catalytically essential aspartic acid residues in the active site. The crystal structure of free proteinase A shows that flap loop is atypically pointing directly into the S(1) pocket of the enzyme.  Proteinase A preferentially hydro
Probab=70.22  E-value=3.4  Score=29.78  Aligned_cols=22  Identities=18%  Similarity=0.344  Sum_probs=19.1

Q ss_pred             EEEEEeCCCCceeeCHHHHHhc
Q 047037           37 VVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ..++||||+|..++++++++.+
T Consensus       206 ~~~ivDSGtt~~~lp~~~~~~l  227 (320)
T cd05488         206 TGAAIDTGTSLIALPSDLAEML  227 (320)
T ss_pred             CeEEEcCCcccccCCHHHHHHH
Confidence            3579999999999999998764


No 60 
>PLN03146 aspartyl protease family protein; Provisional
Probab=69.80  E-value=8.2  Score=29.46  Aligned_cols=30  Identities=17%  Similarity=0.079  Sum_probs=25.4

Q ss_pred             CCeEEEEEEEC--CEEEEEEEeCCCCceeeCH
Q 047037           23 LETMRIQGSIK--GKSVVILIDSGSTHNSINA   52 (85)
Q Consensus        23 ~~~~~~~g~i~--g~~v~~LiDSGat~~Fi~~   52 (85)
                      .+.|.+...|+  .+++.+++||||...|+.-
T Consensus        82 ~~~Y~v~i~iGTPpq~~~vi~DTGS~l~Wv~C  113 (431)
T PLN03146         82 GGEYLMNISIGTPPVPILAIADTGSDLIWTQC  113 (431)
T ss_pred             CccEEEEEEcCCCCceEEEEECCCCCcceEcC
Confidence            45788888888  5789999999999999953


No 61 
>PTZ00165 aspartyl protease; Provisional
Probab=69.48  E-value=9.4  Score=29.83  Aligned_cols=34  Identities=21%  Similarity=0.191  Sum_probs=28.1

Q ss_pred             CCeEEEEEEECC--EEEEEEEeCCCCceeeCHHHHH
Q 047037           23 LETMRIQGSIKG--KSVVILIDSGSTHNSINAEVAQ   56 (85)
Q Consensus        23 ~~~~~~~g~i~g--~~v~~LiDSGat~~Fi~~~~a~   56 (85)
                      ...|...-.|+.  +++.+++||||+..++...-..
T Consensus       118 d~~Y~~~I~IGTPpQ~f~Vv~DTGSS~lWVps~~C~  153 (482)
T PTZ00165        118 NSQYFGEIQVGTPPKSFVVVFDTGSSNLWIPSKECK  153 (482)
T ss_pred             CCeEEEEEEeCCCCceEEEEEeCCCCCEEEEchhcC
Confidence            356788888886  9999999999999999875443


No 62 
>cd05474 SAP_like SAPs, pepsin-like proteinases secreted from pathogens to degrade host proteins. SAPs (Secreted aspartic proteinases) are secreted from a group of pathogenic fungi, predominantly Candida species. They are secreted from the pathogen to degrade host proteins. SAP is one of the most significant extracellular hydrolytic enzymes produced by C. albicans. SAP proteins, encoded by a family of 10 SAP genes. All 10 SAP genes of C. albicans encode preproenzymes, approximately 60 amino acid longer than the mature enzyme, which are processed when transported via the secretory pathway. The mature enzymes contain sequence motifs typical for all aspartyl proteinases, including the two conserved aspartate residues other active site and conserved cysteine residues implicated in the maintenance of the three-dimensional structure. Most Sap proteins contain putative N-glycosylation sites, but it remains to be determined which Sap proteins are glycosylated. This family of aspartate proteases
Probab=66.78  E-value=5  Score=28.12  Aligned_cols=24  Identities=25%  Similarity=0.509  Sum_probs=20.4

Q ss_pred             EEEEEEEeCCCCceeeCHHHHHhc
Q 047037           35 KSVVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        35 ~~v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ....++||||++..+++..+.+++
T Consensus       177 ~~~~~iiDSGt~~~~lP~~~~~~l  200 (295)
T cd05474         177 KNLPALLDSGTTLTYLPSDIVDAI  200 (295)
T ss_pred             CCccEEECCCCccEeCCHHHHHHH
Confidence            345789999999999999988765


No 63 
>cd05487 renin_like Renin stimulates production of angiotensin and thus affects blood pressure. Renin, also known as angiotensinogenase, is a circulating enzyme that participates in the renin-angiotensin system that mediates extracellular volume, arterial vasoconstriction, and consequently mean arterial blood pressure. The enzyme is secreted by the kidneys from specialized juxtaglomerular cells in response to decreases in glomerular filtration rate (a consequence of low blood volume), diminished filtered sodium chloride and sympathetic nervous system innervation. The enzyme circulates in the blood stream and hydrolyzes angiotensinogen secreted from the liver into the peptide angiotensin I. Angiotensin I is further cleaved in the lungs by endothelial bound angiotensin converting enzyme (ACE) into angiotensin II, the final active peptide. Renin is a member of the aspartic protease family. Structurally, aspartic proteases are bilobal enzymes, each lobe contributing a catalytic Aspartate  r
Probab=62.27  E-value=6.2  Score=28.51  Aligned_cols=21  Identities=14%  Similarity=0.273  Sum_probs=17.9

Q ss_pred             EEEEeCCCCceeeCHHHHHhc
Q 047037           38 VILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      .++||||.+..+++.+..+++
T Consensus       209 ~aiiDSGts~~~lP~~~~~~l  229 (326)
T cd05487         209 TAVVDTGASFISGPTSSISKL  229 (326)
T ss_pred             EEEECCCccchhCcHHHHHHH
Confidence            589999999999999866554


No 64 
>PF13793 Pribosyltran_N:  N-terminal domain of ribose phosphate pyrophosphokinase; PDB: 2JI4_A 1DKU_B 1IBS_B 1DKR_B 3MBI_C 3LRT_B 3LPN_B 3NAG_B 2H07_B 2H06_B ....
Probab=60.78  E-value=8.8  Score=24.33  Aligned_cols=36  Identities=19%  Similarity=0.232  Sum_probs=25.6

Q ss_pred             EEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           40 LIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        40 LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      .|=+|+++..+..++|++||+++..  .-.-+-+||+.
T Consensus         2 ~I~~g~~~~~La~~ia~~L~~~~~~--~~~~~F~dGE~   37 (116)
T PF13793_consen    2 VIFSGSSSQDLAERIAEALGIPLGK--VETKRFPDGET   37 (116)
T ss_dssp             EEEESSSGHHHHHHHHHHTTS-EE---EEEEE-TTS-E
T ss_pred             EEEECCCCHHHHHHHHHHhCCceee--eEEEEcCCCCE
Confidence            4568999999999999999998874  44445666654


No 65 
>PF00026 Asp:  Eukaryotic aspartyl protease The Prosite entry also includes Pfam:PF00077.;  InterPro: IPR001461 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A1 (pepsin family, clan AA). The type example is pepsin A from Homo sapiens (Human) .  More than 70 aspartic peptidases, from all from eukaryotic organisms, have been identified. These include pepsins, cathepsins, and renins. The enzymes are synthesised with signal peptides, and the proenzymes are secreted or passed into the lysosomal/endosomal system, where acidification leads to autocatalytic activation. Most members of the pepsin family specifically cleave bonds in peptides that are at least six residues in length, with hydrophobic residues in both the P1 and P1' positions []. Crystallography has shown the active site to form a groove across the junction of the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors within the active site []. Specificity is determined by several hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. Cysteine residues are well conserved within the pepsin family, pepsin itself containing three disulphide loops. The first loop is found in all but the fungal enzymes, and is usually around five residues in length, but is longer in barrierpepsin and candidapepsin; the second loop is also small and found only in the animal enzymes; and the third loop is the largest, found in all members of the family, except for the cysteine-free polyporopepsin. The loops are spread unequally throughout the two lobes, suggesting that they formed after the initial gene duplication and fusion event []. This family does not include the retroviral nor retrotransposon aspartic proteases which are much smaller and appear to be homologous to the single domain aspartic proteases.; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 1CZI_E 3CMS_A 1CMS_A 4CMS_A 1YG9_A 2NR6_A 3LIZ_A 1FLH_A 3UTL_A 1QRP_E ....
Probab=58.84  E-value=11  Score=26.46  Aligned_cols=23  Identities=9%  Similarity=0.245  Sum_probs=19.9

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhc
Q 047037           36 SVVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ...++||||++..+++..+...+
T Consensus       199 ~~~~~~Dtgt~~i~lp~~~~~~i  221 (317)
T PF00026_consen  199 GQQAILDTGTSYIYLPRSIFDAI  221 (317)
T ss_dssp             EEEEEEETTBSSEEEEHHHHHHH
T ss_pred             ceeeecccccccccccchhhHHH
Confidence            46799999999999999887754


No 66 
>PTZ00147 plasmepsin-1; Provisional
Probab=58.17  E-value=19  Score=27.96  Aligned_cols=23  Identities=17%  Similarity=0.421  Sum_probs=19.5

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhc
Q 047037           36 SVVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ...++||||.|..+++...++++
T Consensus       332 ~~~aIiDSGTsli~lP~~~~~ai  354 (453)
T PTZ00147        332 KANVIVDSGTSVITVPTEFLNKF  354 (453)
T ss_pred             ceeEEECCCCchhcCCHHHHHHH
Confidence            35689999999999999988653


No 67 
>PF14541 TAXi_C:  Xylanase inhibitor C-terminal; PDB: 3AUP_D 3HD8_A 1T6G_A 1T6E_X 2B42_A 3VLB_A 3VLA_A.
Probab=57.11  E-value=9.6  Score=24.81  Aligned_cols=19  Identities=32%  Similarity=0.289  Sum_probs=14.3

Q ss_pred             EEEEEeCCCCceeeCHHHH
Q 047037           37 VVILIDSGSTHNSINAEVA   55 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a   55 (85)
                      --+.||||.+.+++.+.+=
T Consensus        30 g~~iiDSGT~~T~L~~~~y   48 (161)
T PF14541_consen   30 GGTIIDSGTTYTYLPPPVY   48 (161)
T ss_dssp             CSEEE-SSSSSEEEEHHHH
T ss_pred             CCEEEECCCCccCCcHHHH
Confidence            3467999999999987653


No 68 
>COG2861 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=56.65  E-value=9.9  Score=27.66  Aligned_cols=29  Identities=21%  Similarity=0.401  Sum_probs=25.5

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCcccCC
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLSPNAN   65 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~~~~~   65 (85)
                      =++.||||.+-+-+..++|++.++|....
T Consensus       151 ~l~flDs~T~a~S~a~~iAk~~gVp~~~r  179 (250)
T COG2861         151 GLYFLDSGTIANSLAGKIAKEIGVPVIKR  179 (250)
T ss_pred             CeEEEcccccccchhhhhHhhcCCceeee
Confidence            46789999999999999999999988753


No 69 
>PF01568 Molydop_binding:  Molydopterin dinucleotide binding domain;  InterPro: IPR006657 A domain in this entry corresponds to the C-terminal domain IV in dimethyl sulphoxide (DMSO)reductase which interacts with the 2-amino pyrimidone ring of both molybdopterin guanine dinucleotide molecules [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2IVF_A 1OGY_G 3ML1_A 3O5A_A 1TI2_G 1VLE_M 1VLD_U 1VLF_O 1TI4_I 1TI6_E ....
Probab=55.60  E-value=14  Score=22.20  Aligned_cols=28  Identities=18%  Similarity=0.285  Sum_probs=22.0

Q ss_pred             ceeeCHHHHHhcCCcccCCCcEEEEeCCCC
Q 047037           47 HNSINAEVAQRVNLSPNANNRLEIMVAFGE   76 (85)
Q Consensus        47 ~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~   76 (85)
                      ..+|+++.|+++||.--  ..++|....|.
T Consensus        31 ~v~inp~dA~~~Gi~~G--d~V~v~s~~G~   58 (110)
T PF01568_consen   31 FVEINPEDAAKLGIKDG--DWVRVSSPRGS   58 (110)
T ss_dssp             EEEEEHHHHHHCT--TT--CEEEEEETTEE
T ss_pred             EEEEcHHHHHHhcCcCC--CEEEEEeccce
Confidence            45899999999996554  88999988884


No 70 
>PTZ00013 plasmepsin 4 (PM4); Provisional
Probab=54.55  E-value=9.5  Score=29.59  Aligned_cols=22  Identities=23%  Similarity=0.428  Sum_probs=18.4

Q ss_pred             EEEEEeCCCCceeeCHHHHHhc
Q 047037           37 VVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ..++||||.|..++++..++++
T Consensus       332 ~~aIlDSGTSli~lP~~~~~~i  353 (450)
T PTZ00013        332 ANVIVDSGTTTITAPSEFLNKF  353 (450)
T ss_pred             cceEECCCCccccCCHHHHHHH
Confidence            4589999999999998877654


No 71 
>COG2336 MazE Growth regulator [Signal transduction mechanisms]
Probab=54.09  E-value=33  Score=20.86  Aligned_cols=31  Identities=19%  Similarity=0.348  Sum_probs=24.7

Q ss_pred             ceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEE
Q 047037           47 HNSINAEVAQRVNLSPNANNRLEIMVAFGEKLM   79 (85)
Q Consensus        47 ~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~   79 (85)
                      ..-|+..+++.+++...  ..+.|.+.|++.+-
T Consensus        13 avrIP~~l~kql~l~~g--~~v~v~v~n~~~i~   43 (82)
T COG2336          13 AVRIPAALLKQLNLTIG--DEVEVEVGNDQSIL   43 (82)
T ss_pred             eeeccHHHHHHhCCCcC--ceEEEEEcCCcEEE
Confidence            34589999999997666  89999999885543


No 72 
>cd05489 xylanase_inhibitor_I_like TAXI-I inhibits degradation of xylan in the cell wall. Xylanase inhibitor-I (TAXI-I) is a member of potent TAXI-type inhibitors of fungal and bacterial family 11 xylanases. Plants developed a diverse battery of defense mechanisms in response to continual challenges by a broad spectrum of pathogenic microorganisms. Their defense arsenal includes inhibitors of cell wall-degrading enzymes, which hinder a possible invasion and colonization by antagonists. Xylanases of fungal and bacterial pathogens are the key enzymes in the degradation of xylan in the cell wall. Plants secrete proteins that inhibit these degradation glycosidases, including xylanase. Surprisingly, TAXI-I displays structural homology with the pepsin-like family of aspartic proteases but is proteolytically nonfunctional, because one or more residues of the essential catalytic triad are absent. The structure of the TAXI-inhibitor, Aspergillus niger xylanase I complex, illustrates the ability 
Probab=52.95  E-value=19  Score=26.83  Aligned_cols=20  Identities=0%  Similarity=0.074  Sum_probs=16.6

Q ss_pred             EEEEeCCCCceeeCHHHHHh
Q 047037           38 VILIDSGSTHNSINAEVAQR   57 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~   57 (85)
                      -++||||.+..+++..+.+.
T Consensus       231 g~iiDSGTs~t~lp~~~y~~  250 (362)
T cd05489         231 GVKLSTVVPYTVLRSDIYRA  250 (362)
T ss_pred             cEEEecCCceEEECHHHHHH
Confidence            38999999999988876554


No 73 
>cd02778 MopB_CT_Thiosulfate-R-like The MopB_CT_Thiosulfate-R-like CD contains thiosulfate-, sulfur-, and polysulfide-reductases, and other related proteins. Thiosulfate reductase catalyzes the cleavage of sulfur-sulfur bonds in thiosulfate. Polysulfide reductase is a membrane-bound enzyme that catalyzes the reduction of polysulfide using either hydrogen or formate as the electron donor. Also included in this CD is the phenylacetyl-CoA:acceptor oxidoreductase, large subunit (PadB2), which has been characterized as a membrane-bound molybdenum-iron-sulfur enzyme involved in anaerobic metabolism of phenylalanine in the denitrifying bacterium Thauera aromatica. The MopB_CT_Thiosulfate-R-like CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=52.59  E-value=37  Score=20.72  Aligned_cols=32  Identities=28%  Similarity=0.361  Sum_probs=25.0

Q ss_pred             CCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           45 STHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        45 at~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      ....+|+++.|+++|+.--  ..++|....|+..
T Consensus        29 ~~~v~i~p~dA~~~gi~~G--d~V~v~s~~G~i~   60 (123)
T cd02778          29 ENTLWINPETAARLGIKDG--DRVEVSSARGKVT   60 (123)
T ss_pred             CCeEEECHHHHHHcCCCCC--CEEEEEeCCCcEE
Confidence            3567999999999997544  7888888888643


No 74 
>PF07629 DUF1590:  Protein of unknown function (DUF1590);  InterPro: IPR011481 These hypothetical proteins in Rhodopirellula baltica have a conserved C-terminal region.
Probab=52.56  E-value=12  Score=18.52  Aligned_cols=21  Identities=24%  Similarity=0.354  Sum_probs=14.2

Q ss_pred             CCcccccCCceeEEEeccccC
Q 047037            1 MDDVCENATPKISLHDVARDQ   21 (85)
Q Consensus         1 ~~~~~e~~~~~iSl~A~~g~~   21 (85)
                      |+.-++--.|+|||+|..-++
T Consensus         1 m~~ga~~pppeislna~fptp   21 (32)
T PF07629_consen    1 MENGADCPPPEISLNARFPTP   21 (32)
T ss_pred             CCCCCCCCCCcceeccccCCC
Confidence            444445456899999986543


No 75 
>PTZ00165 aspartyl protease; Provisional
Probab=52.52  E-value=21  Score=27.95  Aligned_cols=22  Identities=23%  Similarity=0.209  Sum_probs=18.5

Q ss_pred             EEEEEeCCCCceeeCHHHHHhc
Q 047037           37 VVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ..++||||.+...++..+.+.+
T Consensus       328 ~~aIiDTGTSli~lP~~~~~~i  349 (482)
T PTZ00165        328 CKAAIDTGSSLITGPSSVINPL  349 (482)
T ss_pred             eEEEEcCCCccEeCCHHHHHHH
Confidence            4589999999999999886544


No 76 
>PF08922 DUF1905:  Domain of unknown function (DUF1905);  InterPro: IPR015018 This family consist of hypothetical bacterial proteins. ; PDB: 2D9R_A.
Probab=52.08  E-value=48  Score=19.51  Aligned_cols=46  Identities=17%  Similarity=0.320  Sum_probs=25.5

Q ss_pred             CeEEEEEEECCEEEEEEEeC-CCCceee--CHHHHHhcCCcccCCCcEEEE
Q 047037           24 ETMRIQGSIKGKSVVILIDS-GSTHNSI--NAEVAQRVNLSPNANNRLEIM   71 (85)
Q Consensus        24 ~~~~~~g~i~g~~v~~LiDS-Gat~~Fi--~~~~a~~l~l~~~~~~~~~V~   71 (85)
                      +.+++.++|+|.+..-=+=. |.-..+|  +..+=+.++  ......+.|.
T Consensus        31 g~v~V~~tI~g~~~~~sl~p~g~G~~~Lpv~~~vRk~~g--~~~Gd~V~v~   79 (80)
T PF08922_consen   31 GRVPVRGTIDGHPWRTSLFPMGNGGYILPVKAAVRKAIG--KEAGDTVEVT   79 (80)
T ss_dssp             S-EEEEEEETTEEEEEEEEESSTT-EEEEE-HHHHHHHT----TTSEEEEE
T ss_pred             CceEEEEEECCEEEEEEEEECCCCCEEEEEcHHHHHHcC--CCCCCEEEEE
Confidence            89999999999986543333 5444444  444445555  4444555554


No 77 
>PF07894 DUF1669:  Protein of unknown function (DUF1669);  InterPro: IPR012461 This family is composed of sequences derived from hypothetical eukaryotic proteins of unknown function. Some members of this family are annotated as being potential phospholipases but no literature was found to support this. 
Probab=51.61  E-value=24  Score=26.11  Aligned_cols=47  Identities=17%  Similarity=0.289  Sum_probs=41.2

Q ss_pred             ECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           32 IKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        32 i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      -.+.+|.+|+|=-.-..|+  +.++++++.....+.++|++..|..-.|
T Consensus       172 kR~VpVYiLLD~~~~~~Fl--~Mc~~~~v~~~~~~nmrVRsv~G~~y~~  218 (284)
T PF07894_consen  172 KRGVPVYILLDEQNLPHFL--EMCEKLGVNLQHLKNMRVRSVTGCTYYS  218 (284)
T ss_pred             hcCCcEEEEechhcChHHH--HHHHHCCCChhhcCCeEEEEecCCeeec
Confidence            4577999999999999998  5899999999999999999999987655


No 78 
>COG2093 DNA-directed RNA polymerase, subunit E'' [Transcription]
Probab=50.85  E-value=20  Score=20.82  Aligned_cols=28  Identities=25%  Similarity=0.528  Sum_probs=20.5

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEE
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLSPNANNRLEI   70 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V   70 (85)
                      ++++||+-      ..+.|+|+|++.--...++|
T Consensus        36 ~~iIidpe------~SeIAkrlgi~~Pg~yAl~V   63 (64)
T COG2093          36 LLIIIDPE------KSEIAKRLGIKIPGKYALRV   63 (64)
T ss_pred             EEEEEcCc------HHHHHHHhCCCCCceEEEEe
Confidence            56777776      45899999988765566655


No 79 
>PF03419 Peptidase_U4:  Sporulation factor SpoIIGA  This family belongs to family U4 of the peptidase classification.;  InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-).  Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=50.41  E-value=20  Score=25.84  Aligned_cols=25  Identities=24%  Similarity=0.276  Sum_probs=19.7

Q ss_pred             cCCeEEEEEEECCEE--EEEEEeCCCC
Q 047037           22 VLETMRIQGSIKGKS--VVILIDSGST   46 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~--v~~LiDSGat   46 (85)
                      ....+.++-.++|++  +..|+|||..
T Consensus       154 ~~~~~~v~i~~~~~~~~~~allDTGN~  180 (293)
T PF03419_consen  154 KDYLYPVTIEIGGKKIELKALLDTGNQ  180 (293)
T ss_pred             CcEEEEEEEEECCEEEEEEEEEECCCc
Confidence            346688888888887  7889999954


No 80 
>cd02790 MopB_CT_Formate-Dh_H Formate dehydrogenase H (Formate-Dh-H) catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. It is a component of the anaerobic formate hydrogen lyase complex. The E. coli formate dehydrogenase H (Fdh-H) is a monomer composed of a single polypeptide chain with a  Mo active site region and a [4Fe-4S] center. This CD (MopB_CT_Formate-Dh_H) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=50.38  E-value=24  Score=21.25  Aligned_cols=30  Identities=20%  Similarity=0.293  Sum_probs=23.7

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++|+.  ....+.|..+.|+.
T Consensus        35 ~~v~in~~dA~~lgi~--~Gd~V~v~~~~G~~   64 (116)
T cd02790          35 EYVEINPEDAKRLGIE--DGEKVRVSSRRGSV   64 (116)
T ss_pred             cEEEECHHHHHHcCCC--CCCEEEEEcCCEEE
Confidence            4579999999999964  44788888888753


No 81 
>cd02788 MopB_CT_NDH-1_NuoG2-N7 MopB_CT_NDH-1_NuoG2-N7: C-terminal region of the NuoG-like subunit (of the variant with a [4Fe-4S] cluster, N7) of the NADH-quinone oxidoreductase/NADH dehydrogenase-1 (NDH-1) found in various bacteria. The NDH-1 is the first energy-transducting complex in the respiratory chain and functions as a redox pump that uses the redox energy to translocate H+ ions across the membrane, resulting in a significant contribution to energy production. In Escherichia coli NDH-1, the largest subunit is encoded by the nuoG gene, and is part of the 14 distinct subunits constituting the functional enzyme. The NuoG subunit is made of two domains: the first contains three binding sites for FeS clusters (the fer2 domain), the second domain, is of unknown function or, as postulated, has lost an ancestral formate dehydrogenase activity that became redundant during the evolution of the complex I enzyme. Unique to this group, compared to the other prokaryotic and eukaryotic groups
Probab=50.04  E-value=28  Score=20.74  Aligned_cols=29  Identities=17%  Similarity=0.226  Sum_probs=23.1

Q ss_pred             ceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           47 HNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        47 ~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ..+|+++.|+++|+.--  ..++|....|+.
T Consensus        30 ~v~inp~dA~~lGi~~G--d~V~v~s~~G~i   58 (96)
T cd02788          30 YARLSPADAARLGLADG--DLVEFSLGDGTL   58 (96)
T ss_pred             EEEECHHHHHHcCCCCC--CEEEEEECCeEE
Confidence            46899999999997544  788888888764


No 82 
>PLN03146 aspartyl protease family protein; Provisional
Probab=49.88  E-value=14  Score=28.25  Aligned_cols=18  Identities=33%  Similarity=0.486  Sum_probs=15.6

Q ss_pred             EEEEeCCCCceeeCHHHH
Q 047037           38 VILIDSGSTHNSINAEVA   55 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a   55 (85)
                      -++||||.|.+++.+..-
T Consensus       309 ~~iiDSGTt~t~Lp~~~y  326 (431)
T PLN03146        309 NIIIDSGTTLTLLPSDFY  326 (431)
T ss_pred             cEEEeCCccceecCHHHH
Confidence            479999999999999853


No 83 
>cd02792 MopB_CT_Formate-Dh-Na-like Formate dehydrogenase N, alpha subunit (Formate-Dh-Na) is a major component of nitrate respiration in bacteria such as in the E. coli formate dehydrogenase N (Fdh-N). Fdh-N is a membrane protein that is a complex of three different subunits and is the major electron donor to the nitrate respiratory chain. Also included in this CD is the Desulfovibrio gigas tungsten formate dehydrogenase, DgW-FDH. In contrast to Fdh-N, which is a  functional heterotrimer, DgW-FDH is a heterodimer. The DgW-FDH complex is composed of a large subunit carrying the W active site and one [4Fe-4S] center, and a small subunit that harbors a series of three [4Fe-4S] clusters as well as a putative vacant binding site for a fourth cluster. The smaller subunit is not included in this alignment. This CD (MopB_CT_Formate-Dh-Na-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=49.22  E-value=30  Score=21.07  Aligned_cols=30  Identities=13%  Similarity=0.144  Sum_probs=23.3

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|++..|+++||.-  ...++|....|+.
T Consensus        35 ~~v~i~p~dA~~lgi~~--Gd~V~v~s~~G~~   64 (122)
T cd02792          35 MFVEISPELAAERGIKN--GDMVWVSSPRGKI   64 (122)
T ss_pred             cEEEECHHHHHHcCCCC--CCEEEEEcCCceE
Confidence            34789999999999654  4788888887753


No 84 
>cd02779 MopB_CT_Arsenite-Ox This CD contains the molybdopterin_binding C-terminal (MopB_CT) region of Arsenite oxidase (Arsenite-Ox) and related proteins. Arsenite oxidase oxidizes arsenite to the less toxic arsenate; it transfers the electrons obtained from the oxidation of arsenite towards the soluble periplasmic electron carriers cytochrome c and/or amicyanin.
Probab=48.85  E-value=57  Score=19.90  Aligned_cols=30  Identities=23%  Similarity=0.329  Sum_probs=23.7

Q ss_pred             CCceeeCHHHHHhcCCcccCCCcEEEEeCCCC
Q 047037           45 STHNSINAEVAQRVNLSPNANNRLEIMVAFGE   76 (85)
Q Consensus        45 at~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~   76 (85)
                      ....+|+++.|+++|+.--  ..+.|....|+
T Consensus        32 ~~~v~in~~dA~~lgi~~G--d~V~v~s~~G~   61 (115)
T cd02779          32 LPYIEVNPEDAKREGLKNG--DLVEVYNDYGS   61 (115)
T ss_pred             CCEEEECHHHHHHcCCCCC--CEEEEEeCCEE
Confidence            3457999999999996554  78888888874


No 85 
>cd01813 UBP_N UBP ubiquitin processing protease. The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates.  This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP).   This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
Probab=48.55  E-value=29  Score=20.00  Aligned_cols=39  Identities=10%  Similarity=0.063  Sum_probs=34.5

Q ss_pred             EEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccC
Q 047037           26 MRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNA   64 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~   64 (85)
                      |++....+|+...+=|+..+|..-+-+.+.++.|+|.+.
T Consensus         1 ~~i~vk~~g~~~~v~v~~~~Tv~~lK~~i~~~tgvp~~~   39 (74)
T cd01813           1 VPVIVKWGGQEYSVTTLSEDTVLDLKQFIKTLTGVLPER   39 (74)
T ss_pred             CEEEEEECCEEEEEEECCCCCHHHHHHHHHHHHCCCHHH
Confidence            467788899999999999999999999999999988774


No 86 
>cd02787 MopB_CT_ydeP The MopB_CT_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=48.20  E-value=30  Score=20.97  Aligned_cols=31  Identities=23%  Similarity=0.402  Sum_probs=24.8

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      ...+|+++.|+++|+.--  ..++|....|...
T Consensus        31 ~~v~i~p~dA~~lgI~dG--d~V~v~s~~G~i~   61 (112)
T cd02787          31 DVVFMNPDDIARLGLKAG--DRVDLESAFGDGQ   61 (112)
T ss_pred             cEEEECHHHHHHhCCCCC--CEEEEEecCCCCe
Confidence            347999999999996654  7888888888643


No 87 
>cd02785 MopB_CT_4 The MopB_CT_4 CD includes a group of related uncharacterized bacterial and archaeal molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=48.15  E-value=33  Score=21.17  Aligned_cols=29  Identities=17%  Similarity=0.139  Sum_probs=23.1

Q ss_pred             ceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           47 HNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        47 ~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ..+|+++.|+++|+.-  ...++|....|+.
T Consensus        33 ~v~i~p~dA~~~gi~~--Gd~V~v~s~~G~i   61 (124)
T cd02785          33 RVKINPIDAAARGIAH--GDLVEVYNDRGSV   61 (124)
T ss_pred             eEEECHHHHHHcCCCC--CCEEEEEeCCCEE
Confidence            4699999999999754  4788888888754


No 88 
>PRK12440 acetate kinase; Reviewed
Probab=48.15  E-value=11  Score=29.10  Aligned_cols=38  Identities=16%  Similarity=0.286  Sum_probs=33.8

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEEe
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMSS   81 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~~   81 (85)
                      |-+|-|+.+++++.+|-+....+-+...++||-++...
T Consensus       180 GlS~~~v~~~~a~~lg~~~~~~~~Iv~HLG~G~Si~Ai  217 (397)
T PRK12440        180 GTSHYFVSREAAKMLNKPIEESSFISVHLGNGASVCAI  217 (397)
T ss_pred             HHhHHHHHHHHHHHhCCChHHcCEEEEEeCCCcEeeee
Confidence            67899999999999998888889999999999888653


No 89 
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=46.39  E-value=18  Score=27.12  Aligned_cols=32  Identities=19%  Similarity=0.132  Sum_probs=23.7

Q ss_pred             CeEEEEEEECC--EEEEEEEeCCCCceeeCHHHH
Q 047037           24 ETMRIQGSIKG--KSVVILIDSGSTHNSINAEVA   55 (85)
Q Consensus        24 ~~~~~~g~i~g--~~v~~LiDSGat~~Fi~~~~a   55 (85)
                      +.|.+.-.|+-  +.+.+++||||...+|+-.-.
T Consensus        45 ~~Y~~~i~IGTPpq~f~v~~DTGS~~lWV~c~~c   78 (398)
T KOG1339|consen   45 GEYYGNISIGTPPQSFTVVLDTGSDLLWVPCAPC   78 (398)
T ss_pred             cccEEEEecCCCCeeeEEEEeCCCCceeeccccc
Confidence            45555555553  569999999999999988444


No 90 
>cd02794 MopB_CT_DmsA-EC The MopB_CT_DmsA-EC CD includes the DmsA enzyme of the dmsABC operon encoding the anaerobic dimethylsulfoxide reductase (DMSOR) of Escherichia coli and other related DMSOR-like enzymes. Unlike other DMSOR-like enzymes, this group has a  predicted N-terminal iron-sulfur [4Fe-4S] cluster binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=46.27  E-value=37  Score=20.88  Aligned_cols=29  Identities=17%  Similarity=0.127  Sum_probs=23.1

Q ss_pred             ceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           47 HNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        47 ~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ..+|+++.|+++|+.--  ..++|....|+.
T Consensus        31 ~v~i~p~~A~~~gi~~G--d~V~v~s~~g~i   59 (121)
T cd02794          31 EVWINPLDAAARGIKDG--DRVLVFNDRGKV   59 (121)
T ss_pred             CEEECHHHHHHcCCCCC--CEEEEEcCCceE
Confidence            36999999999996654  778888888754


No 91 
>KOG3048 consensus Molecular chaperone Prefoldin, subunit 5 [Posttranslational modification, protein turnover, chaperones]
Probab=45.87  E-value=17  Score=24.55  Aligned_cols=28  Identities=18%  Similarity=0.399  Sum_probs=22.8

Q ss_pred             cCCeEEEEEEECCEEEEEEEeCCCCceeeC
Q 047037           22 VLETMRIQGSIKGKSVVILIDSGSTHNSIN   51 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~   51 (85)
                      -...++++|+++. .=.+|||-| |..||-
T Consensus        69 LTsSlYVPGkl~d-~~k~lVDIG-TGYyVE   96 (153)
T KOG3048|consen   69 LTSSLYVPGKLSD-NSKFLVDIG-TGYYVE   96 (153)
T ss_pred             cccceeccceecc-ccceeEecc-CceEEe
Confidence            3467999999998 778999999 667764


No 92 
>cd02781 MopB_CT_Acetylene-hydratase The MopB_CT_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins. The acetylene hydratase of Pelobacter acetylenicus is a tungsten iron-sulfur protein involved in the fermentation of acetylene to ethanol and acetate. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=45.82  E-value=35  Score=21.09  Aligned_cols=30  Identities=17%  Similarity=0.261  Sum_probs=23.3

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++||.--  ..++|....|+.
T Consensus        33 ~~v~inp~dA~~~gi~~G--d~V~v~s~~G~~   62 (130)
T cd02781          33 PVAEINPETAAKLGIADG--DWVWVETPRGRA   62 (130)
T ss_pred             CEEEECHHHHHHcCCCCC--CEEEEECCCCEE
Confidence            347999999999997543  777888877754


No 93 
>cd02775 MopB_CT Molybdopterin-Binding, C-terminal (MopB_CT) domain of the MopB superfamily of proteins, a  large, diverse, heterogeneous superfamily of enzymes that, in general, bind molybdopterin as a cofactor. The MopB domain is found in a wide variety of molybdenum- and tungsten-containing enzymes, including formate dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms of nitrate reductase (Nap, Nas, NarG), dimethylsulfoxide reductase (DMSOR), thiosulfate reductase, formylmethanofuran dehydrogenase, and arsenite oxidase. Molybdenum is present in most of these enzymes in the form of molybdopterin, a modified pterin ring with a dithiolene side chain, which is responsible for ligating the Mo. In many bacterial and archaeal species, molybdopterin is in the form of a dinucleotide, with two molybdopterin dinucleotide units per molybdenum. These proteins can function as monomers, heterodimers, or heterotrimers, depending on the protein and organism. Also included in the MopB superfamily is
Probab=45.30  E-value=38  Score=19.65  Aligned_cols=30  Identities=17%  Similarity=0.168  Sum_probs=22.6

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++|+.--  ..+.|....|..
T Consensus        23 ~~v~~~~~da~~lgl~~G--d~v~v~~~~g~~   52 (101)
T cd02775          23 PVVEINPEDAAALGIKDG--DLVRVESRRGSV   52 (101)
T ss_pred             CEEEECHHHHHHcCCCCC--CEEEEEcCCcEE
Confidence            356999999999997654  677777766653


No 94 
>cd06407 PB1_NLP A PB1 domain is present in NIN like proteins (NLP), a key enzyme in a process of establishment of symbiosis betweeen legumes and nitrogen fixing bacteria (Rhizobium). The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes like osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster.  Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-inte
Probab=44.99  E-value=66  Score=19.12  Aligned_cols=47  Identities=11%  Similarity=0.224  Sum_probs=38.2

Q ss_pred             EEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCC
Q 047037           26 MRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAF   74 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~an   74 (85)
                      |+++...|+-.+.+-+.+..+..=+-.++++++++.-.  .++.++--+
T Consensus         1 ~~vK~~~~~d~~r~~l~~~~~~~~L~~~i~~r~~~~~~--~~f~LkY~D   47 (82)
T cd06407           1 VRVKATYGEEKIRFRLPPSWGFTELKQEIAKRFKLDDM--SAFDLKYLD   47 (82)
T ss_pred             CEEEEEeCCeEEEEEcCCCCCHHHHHHHHHHHhCCCCC--CeeEEEEEC
Confidence            57888999999999999999999999999999997532  456555544


No 95 
>cd02789 MopB_CT_FmdC-FwdD The MopB_FmdC-FwdD CD includes the  C-terminus of subunit C of molybdenum formylmethanofuran dehydrogenase (FmdC) and subunit D of tungsten formylmethanofuran dehydrogenase (FwdD), and other related proteins. Formylmethanofuran dehydrogenase catalyzes the first step in methane formation from CO2 in methanogenic archaea and some eubacteria. Members of this CD belong to the molybdopterin_binding superfamily of proteins. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=44.76  E-value=41  Score=20.51  Aligned_cols=31  Identities=19%  Similarity=0.327  Sum_probs=24.4

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCC
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGE   76 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~   76 (85)
                      +....+|+++.|+++|+.--  ..+.|....|+
T Consensus        29 ~~~~v~i~p~dA~~lgi~~G--d~V~v~~~~G~   59 (106)
T cd02789          29 ACAYCEINPEDYKLLGKPEG--DKVKVTSEFGE   59 (106)
T ss_pred             CCcEEEECHHHHHHcCCCCC--CEEEEEcCCEE
Confidence            45578999999999996655  67777777774


No 96 
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=44.61  E-value=45  Score=24.27  Aligned_cols=25  Identities=16%  Similarity=0.335  Sum_probs=20.6

Q ss_pred             cCCeEEEEEEECCEE--EEEEEeCCCC
Q 047037           22 VLETMRIQGSIKGKS--VVILIDSGST   46 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~--v~~LiDSGat   46 (85)
                      ....+.+.-.++|+.  +..|+|||..
T Consensus       155 ~~~~~~v~i~~~g~~~~~~alvDTGN~  181 (288)
T TIGR02854       155 DKQIYELEICLDGKKVTIKGFLDTGNQ  181 (288)
T ss_pred             hceEEEEEEEECCEEEEEEEEEecCCc
Confidence            457788888899987  6889999964


No 97 
>cd00508 MopB_CT_Fdh-Nap-like This CD includes formate dehydrogenases (Fdh) H and N; nitrate reductases, Nap and Nas; and other related proteins. Formate dehydrogenase H is a component of the anaerobic formate hydrogen lyase complex  and catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. Formate dehydrogenase N (alpha subunit) is the major electron donor to the bacterial nitrate respiratory chain and nitrate reductases, Nap and Nas, catalyze the reduction of nitrate to nitrite. This CD (MopB_CT_Fdh-Nap-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=43.84  E-value=38  Score=20.37  Aligned_cols=30  Identities=17%  Similarity=0.225  Sum_probs=23.0

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++||.-  ...+.|....|+.
T Consensus        35 ~~v~inp~dA~~lgi~~--Gd~V~v~~~~G~~   64 (120)
T cd00508          35 PFVEIHPEDAARLGIKD--GDLVRVSSRRGSV   64 (120)
T ss_pred             CEEEECHHHHHHcCCCC--CCEEEEEeCCEEE
Confidence            35799999999999654  3777887777753


No 98 
>cd02786 MopB_CT_3 The MopB_CT_3 CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=42.86  E-value=40  Score=20.39  Aligned_cols=30  Identities=10%  Similarity=0.057  Sum_probs=23.4

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++||.--  ..++|....|+.
T Consensus        31 ~~v~i~p~dA~~lgi~~G--d~V~v~s~~G~~   60 (116)
T cd02786          31 PTLLIHPADAAARGIADG--DLVVVFNDRGSV   60 (116)
T ss_pred             CEEEECHHHHHHcCCCCC--CEEEEEcCCeEE
Confidence            357999999999997654  677777777754


No 99 
>PF12812 PDZ_1:  PDZ-like domain
Probab=42.30  E-value=63  Score=19.01  Aligned_cols=35  Identities=11%  Similarity=0.185  Sum_probs=26.8

Q ss_pred             EEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCC
Q 047037           38 VILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFG   75 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG   75 (85)
                      +++--.||+..=|+.+.|+.+++++.   -+-+..++|
T Consensus         6 r~v~~~Ga~f~~Ls~q~aR~~~~~~~---gv~v~~~~g   40 (78)
T PF12812_consen    6 RFVEVCGAVFHDLSYQQARQYGIPVG---GVYVAVSGG   40 (78)
T ss_pred             EEEEEcCeecccCCHHHHHHhCCCCC---EEEEEecCC
Confidence            56777899999999999999998776   444444444


No 100
>PF00455 DeoRC:  DeoR C terminal sensor domain;  InterPro: IPR014036 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after Escherichia coli deoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerization domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].
Probab=41.93  E-value=7.7  Score=25.64  Aligned_cols=42  Identities=26%  Similarity=0.368  Sum_probs=30.7

Q ss_pred             EEEEeCCCCceeeCHHHHHhcCCcccC-----------CCcEEEEeCCCCEEE
Q 047037           38 VILIDSGSTHNSINAEVAQRVNLSPNA-----------NNRLEIMVAFGEKLM   79 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l~l~~~~-----------~~~~~V~~anG~~~~   79 (85)
                      .+.+|+|+|...+-+.+..+.++.+.+           .+..+|.+.+|+.-.
T Consensus        22 ~Ifld~GtT~~~la~~L~~~~~ltVvTnsl~ia~~l~~~~~~~vi~~GG~~~~   74 (161)
T PF00455_consen   22 TIFLDSGTTTLELAKYLPDKKNLTVVTNSLPIANELSENPNIEVILLGGEVNP   74 (161)
T ss_pred             EEEEECchHHHHHHHHhhcCCceEEEECCHHHHHHHHhcCceEEEEeCCEEEc
Confidence            467899999999999999986665543           235677777775443


No 101
>PF03539 Spuma_A9PTase:  Spumavirus aspartic protease (A9);  InterPro: IPR001641 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Aspartic endopeptidases 3.4.23. from EC of vertebrate, fungal and retroviral origin have been characterised []. More recently, aspartic endopeptidases associated with the processing of bacterial type 4 prepilin [] and archaean preflagellin have been described [, ]. Structurally, aspartic endopeptidases are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localised between the two lobes of the molecule. One lobe has probably evolved from the other through a gene duplication event in the distant past. In modern-day enzymes, although the three-dimensional structures are very similar, the amino acid sequences are more divergent, except for the catalytic site motif, which is very conserved. The presence and position of disulphide bridges are other conserved features of aspartic peptidases. All or most aspartate peptidases are endopeptidases. These enzymes have been assigned into clans (proteins which are evolutionary related), and further sub-divided into families, largely on the basis of their tertiary structure. This group of aspartic peptidases belong to MEROPS peptidase family A9 (spumapepsin family, clan AA). Foamy viruses are single-stranded enveloped retroviruses that have been noted to infect monkeys, cats and humans. In the human virus, the aspartic protease is encoded by the retroviral gag gene [], and in monkeys by the pol gene []. At present, the virus has not been proven to cause any particular disease. However, studies have shown Human foamy virus causes neurological disorders in infected mice []. It is not clear whether the Foamy virus/spumavirus proteases share a common evolutionary origin with other aspartic proteases. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis; PDB: 2JYS_A.
Probab=41.91  E-value=33  Score=23.35  Aligned_cols=42  Identities=26%  Similarity=0.268  Sum_probs=25.0

Q ss_pred             ECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           32 IKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        32 i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      |.|..+..--||||+.+.|++.+-..-    .+.....+++-.|+.
T Consensus         1 ikg~~l~~~wDsga~ITCiP~~fl~~E----~Pi~~~~i~Tihg~~   42 (163)
T PF03539_consen    1 IKGTKLKGHWDSGAQITCIPESFLEEE----QPIGKTLIKTIHGEK   42 (163)
T ss_dssp             ETTEEEEEEE-TT-SSEEEEGGGTTT-------SEEEEEE-SS-EE
T ss_pred             CCCceeeEEecCCCeEEEccHHHhCcc----ccccceEEEEecCce
Confidence            578889999999999999998875431    122344555555543


No 102
>KOG3101 consensus Esterase D [General function prediction only]
Probab=41.80  E-value=12  Score=27.37  Aligned_cols=25  Identities=28%  Similarity=0.452  Sum_probs=20.3

Q ss_pred             EECCEEEEEEEeCCCCceeeCHHHH
Q 047037           31 SIKGKSVVILIDSGSTHNSINAEVA   55 (85)
Q Consensus        31 ~i~g~~v~~LiDSGat~~Fi~~~~a   55 (85)
                      .+++....+|||-|+..+|+..+|-
T Consensus       210 ~y~~~~~~ilIdqG~~D~Fl~~qLl  234 (283)
T KOG3101|consen  210 NYRGVGDDILIDQGAADNFLAEQLL  234 (283)
T ss_pred             hcCCCCccEEEecCccchhhhhhcC
Confidence            4566777899999999999986554


No 103
>cd02791 MopB_CT_Nitrate-R-NapA-like Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite. Monomeric Nas is located in the cytoplasm and participates in nitrogen assimilation. Dimeric Nap is located in the periplasm and is coupled to quinol oxidation via a membrane-anchored tetraheme cytochrome. This CD (MopB_CT_Nitrate-R-Nap) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs
Probab=41.37  E-value=42  Score=20.37  Aligned_cols=30  Identities=23%  Similarity=0.264  Sum_probs=23.1

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++|+.--  ..+.|....|+.
T Consensus        35 ~~v~in~~dA~~lgi~~G--d~V~v~~~~G~~   64 (122)
T cd02791          35 PYVEIHPEDAARLGLKEG--DLVRVTSRRGEV   64 (122)
T ss_pred             CEEEECHHHHHHcCCCCC--CEEEEEcCCEEE
Confidence            347999999999997443  777777777754


No 104
>cd02784 MopB_CT_PHLH The MopB_CT_PHLH CD includes a group of related uncharacterized putative hydrogenase-like homologs (PHLH) of molybdopterin binding proteins. This CD is of the PHLH region homologous to the conserved molybdopterin-binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=40.59  E-value=89  Score=20.25  Aligned_cols=33  Identities=9%  Similarity=0.061  Sum_probs=25.0

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEEe
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMSS   81 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~~   81 (85)
                      ...+|+++.|+++||.-  ...++|....|. +...
T Consensus        38 ~~v~InP~dA~~lGI~d--GD~V~V~s~~G~-i~~~   70 (137)
T cd02784          38 NAALVSPRTAEALGLLQ--GDVVRIRRGGRT-IELP   70 (137)
T ss_pred             ceEEECHHHHHHcCCCC--CCEEEEEeCCeE-EEEE
Confidence            34699999999999754  478888888884 4433


No 105
>KOG3838 consensus Mannose lectin ERGIC-53, involved in glycoprotein traffic [Intracellular trafficking, secretion, and vesicular transport]
Probab=40.54  E-value=1e+02  Score=24.30  Aligned_cols=71  Identities=23%  Similarity=0.244  Sum_probs=48.5

Q ss_pred             cccCCceeEEEecccc----------------------CcCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhc-CCc
Q 047037            5 CENATPKISLHDVARD----------------------QVLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRV-NLS   61 (85)
Q Consensus         5 ~e~~~~~iSl~A~~g~----------------------~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l-~l~   61 (85)
                      -.-..|.||+-+--|+                      ..|-.+|.+-..-+..+.++||.|-|-+ =+.++.-|. ++-
T Consensus       147 ~qknnP~Is~~lndGt~~ydh~~DGasQ~LssCqrDFRNkPyPvRarItY~~nvLtv~innGmtp~-d~yE~C~rve~~~  225 (497)
T KOG3838|consen  147 GQKNNPAISVLLNDGTIPYDHPGDGASQGLSSCQRDFRNKPYPVRARITYYGNVLTVMINNGMTPS-DDYEFCVRVENLL  225 (497)
T ss_pred             CCcCCccEEEEecCCcccccCCCccHHHHHHHhhHHhccCCCCceEEEEEeccEEEEEEcCCCCCC-CCcceeEecccee
Confidence            3345799998766665                      2234567777778999999999999988 677776655 333


Q ss_pred             ccCCCcEEEEeCCCC
Q 047037           62 PNANNRLEIMVAFGE   76 (85)
Q Consensus        62 ~~~~~~~~V~~anG~   76 (85)
                      +.+.--+-|..|-|.
T Consensus       226 lp~nGyFGvSAATGg  240 (497)
T KOG3838|consen  226 LPPNGYFGVSAATGG  240 (497)
T ss_pred             ccCCCeeeeeecccc
Confidence            333455667666654


No 106
>PF09379 FERM_N:  FERM N-terminal domain ;  InterPro: IPR018979  This domain is the N-terminal ubiquitin-like structural domain of the FERM domain.  The FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane []. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus of the majority of FERM-containing proteins [, ], which includes:    Band 4.1, which links the spectrin-actin cytoskeleton of erythrocytes to the plasma membrane. Ezrin, a component of the undercoat of the microvilli plasma membrane. Moesin, which is probably involved in binding major cytoskeletal structures to the plasma membrane. Radixin, which is involved in the binding of the barbed end of actin filaments to the plasma membrane in the undercoat of the cell- to-cell adherens junction. Talin, a cytoskeletal protein concentrated in regions of cell-substratum contact and, in lymphocytes, of cell-cell contacts. Filopodin, a slime mold protein that binds actin and which is involved in the control of cell motility and chemotaxis. Merlin (or schwannomin). Protein NBL4. Unconventional myosins X, VIIa and XV, which are mutated in congenital deafness. Focal-adhesion kinases (FAKs), cytoplasmic protein tyrosine kinases involved in signalling through integrins. Janus tyrosine kinases (JAKs), cytoplasmic tyrosine kinases that are non-covalently associated with the cytoplasmic tails of receptors for cytokines or polypeptidic hormones. Non-receptor tyrosine-protein kinase TYK2. Protein-tyrosine phosphatases PTPN3 and PTPN4, enzyme that appear to act at junctions between the membrane and the cytoskeleton. Protein-tyrosine phosphatases PTPN14 and PTP-D1, PTP-RL10 and PTP2E.  Caenorhabditis elegans protein phosphatase ptp-1.   Ezrin, moesin, and radixin are highly related proteins (ERM protein family), but the other proteins in which the FERM domain is found do not share any region of similarity outside of this domain. ERM proteins are made of three domains, the FERM domain, a central helical domain and a C-terminal tail domain, which binds F-actin. The amino-acid sequence of the FERM domain is highly conserved among ERM proteins and is responsible for membrane association by direct binding to the cytoplasmic domain or tail of integral membrane proteins. ERM proteins are regulated by an intramolecular association of the FERM and C-terminal tail domains that masks their binding sites for other molecules. For cytoskeleton-membrane cross-linking, the dormant molecules becomes activated and the FERM domain attaches to the membrane by binding specific membrane proteins, while the last 34 residues of the tail bind actin filaments. Aside from binding to membranes, the activated FERM domain of ERM proteins can also bind the guanine nucleotide dissociation inhibitor of Rho GTPase (RhoDGI), which suggests that in addition to functioning as a cross-linker, ERM proteins may influence Rho signalling pathways. The crystal structure of the FERM domain reveals that it is composed of three structural modules (F1, F2, and F3) that together form a compact clover-shaped structure []. The FERM domain has also been called the amino-terminal domain, the 30kDa domain, 4.1N30, the membrane-cytoskeletal-linking domain, the ERM-like domain, the ezrin-like domain of the band 4.1 superfamily, the conserved N-terminal region, and the membrane attachment domain [].; PDB: 1EF1_B 1SGH_A 1E5W_A 2KC2_A 2KMA_A 3IVF_A 1H4R_B 3U8Z_A 1ISN_A 3BIN_A ....
Probab=40.18  E-value=41  Score=18.95  Aligned_cols=45  Identities=16%  Similarity=0.241  Sum_probs=34.8

Q ss_pred             EECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEE-EeCCC
Q 047037           31 SIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEI-MVAFG   75 (85)
Q Consensus        31 ~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V-~~anG   75 (85)
                      ..+|....+-||..+|.--+=..++++++|.-..-..+.+ ...+|
T Consensus         3 llD~~~~~~~v~~~~t~~~l~~~v~~~l~l~e~~~FgL~~~~~~~~   48 (80)
T PF09379_consen    3 LLDGTTKTFEVDPKTTGQDLLEQVCDKLGLKEKEYFGLQYQVDKDG   48 (80)
T ss_dssp             ESSEEEEEEEEETTSBHHHHHHHHHHHHTTSSGGGEEEEE-EBTTS
T ss_pred             CcCCCcEEEEEcCCCcHHHHHHHHHHHcCCCCccEEEEEEeecCCC
Confidence            3577889999999999999999999999987544455565 33344


No 107
>TIGR00016 ackA acetate kinase. Acetate kinase is involved in the activation of acetate to acetyl CoA and in the secretion of acetate. It catalyzes the reaction ATP + acetate = ADP + acetyl phosphate. Some members of this family have been shown to act on propionate as well as acetate. An example of a propionate/acetate kinase is TdcD of E. coli, an enzyme of an anaerobic pathway of threonine catabolism. It is not known how many members of this family act on additional substrates besides acetate.
Probab=40.13  E-value=15  Score=28.35  Aligned_cols=37  Identities=16%  Similarity=0.257  Sum_probs=32.9

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      |-+|-|+..++|+.+|-+....+-+...++||-++..
T Consensus       185 GlS~~~va~~~a~~lg~~~~~~~~Iv~HLG~G~Si~A  221 (404)
T TIGR00016       185 GTSHKYVTQRAAKILNKPLDDLNLIVCHLGNGASVCA  221 (404)
T ss_pred             HHHHHHHHHHHHHHhCCChhHcCEEEEEeCCCceeee
Confidence            5689999999999999888888999999999988765


No 108
>cd02780 MopB_CT_Tetrathionate_Arsenate-R This CD contains the molybdopterin_binding C-terminal (MopB_CT) region of tetrathionate reductase, subunit A, (TtrA); respiratory arsenate As(V) reductase, catalytic subunit (ArrA); and other related proteins.
Probab=40.12  E-value=53  Score=20.88  Aligned_cols=30  Identities=20%  Similarity=0.304  Sum_probs=24.0

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++|+.--  ..+.|...+|+.
T Consensus        30 ~~v~inp~dA~~lgI~~G--d~V~v~s~~G~i   59 (143)
T cd02780          30 NPVWINPEDAAKLGIKTG--DRVRVVTPGGSV   59 (143)
T ss_pred             CEEEECHHHHHHcCCCCC--CEEEEEeCCceE
Confidence            457999999999997544  788888888854


No 109
>PRK05654 acetyl-CoA carboxylase subunit beta; Validated
Probab=39.82  E-value=86  Score=23.10  Aligned_cols=54  Identities=22%  Similarity=0.301  Sum_probs=39.6

Q ss_pred             cCCeEEEEEEECCEEEEEEEe-----CCCCceeeCHHHHHhcCCcccCCCcEEEEeCCC
Q 047037           22 VLETMRIQGSIKGKSVVILID-----SGSTHNSINAEVAQRVNLSPNANNRLEIMVAFG   75 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~v~~LiD-----SGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG   75 (85)
                      ..+.+...|+|+|+++.+...     .||-.....+++++-..+-....-|+..-++.|
T Consensus       108 ~d~vVtG~g~I~G~~V~v~a~D~~f~gGS~g~~~~eKi~r~~e~A~~~~lPlV~l~dsg  166 (292)
T PRK05654        108 KDAVVTGKGTIEGMPVVLAVMDFSFMGGSMGSVVGEKIVRAVERAIEEKCPLVIFSASG  166 (292)
T ss_pred             CCcEEEEEEEECCEEEEEEEEecccccCCccHHHHHHHHHHHHHHHHcCCCEEEEEcCC
Confidence            467888899999999998887     788888888887776666555555554444433


No 110
>cd02777 MopB_CT_DMSOR-like The MopB_CT_DMSOR-like CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR),  trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO. Also included in this group is the pyrogallol-phloroglucinol transhydroxylase from Pelobacter acidigallici. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB hom
Probab=38.38  E-value=48  Score=20.55  Aligned_cols=30  Identities=13%  Similarity=0.049  Sum_probs=23.0

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++|+.--  ..++|....|..
T Consensus        34 ~~v~i~p~dA~~lgi~~G--d~V~v~s~~g~i   63 (127)
T cd02777          34 EPVWINPLDAAARGIKDG--DIVRVFNDRGAV   63 (127)
T ss_pred             CeEEECHHHHHHcCCCCC--CEEEEEcCCeEE
Confidence            347999999999997644  677777777743


No 111
>PRK12379 propionate/acetate kinase; Provisional
Probab=37.86  E-value=13  Score=28.64  Aligned_cols=37  Identities=14%  Similarity=0.297  Sum_probs=32.3

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      |-+|-|+.+++|+.+|-+....+-+...++||-++..
T Consensus       176 GlS~~~va~~~a~~lg~~~~~~~lIv~HLG~G~Si~A  212 (396)
T PRK12379        176 GTSHRYVSQRAHSLLNLDEDDSGLVVAHLGNGASICA  212 (396)
T ss_pred             HHHHHHHHHHHHHHhCCChhHCCEEEEEeCCCcchhe
Confidence            5688999999999999888888999999999987754


No 112
>TIGR00515 accD acetyl-CoA carboxylase, carboxyl transferase, beta subunit. The enzyme acetyl-CoA carboxylase contains a biotin carboxyl carrier protein or domain, a biotin carboxylase, and a carboxyl transferase. This model represents the beta chain of the carboxyl transferase for cases in which the architecture of the protein is as in E. coli, in which the carboxyltransferase portion consists of two non-identical subnits, alpha and beta.
Probab=37.57  E-value=1e+02  Score=22.70  Aligned_cols=54  Identities=19%  Similarity=0.258  Sum_probs=40.5

Q ss_pred             cCCeEEEEEEECCEEEEEEEe-----CCCCceeeCHHHHHhcCCcccCCCcEEEEeCCC
Q 047037           22 VLETMRIQGSIKGKSVVILID-----SGSTHNSINAEVAQRVNLSPNANNRLEIMVAFG   75 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~v~~LiD-----SGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG   75 (85)
                      ..+.+...|+|+|+++.+...     -||-.....+++++-..+-....-|+..-++.|
T Consensus       107 ~d~vVtG~g~I~G~~V~v~a~D~~f~gGSmg~~~geKi~r~~e~A~~~~lPlV~l~dSg  165 (285)
T TIGR00515       107 KDAVVTGKGTLYGMPIVVAVFDFAFMGGSMGSVVGEKFVRAIEKALEDNCPLIIFSASG  165 (285)
T ss_pred             CCcEEEEEEEECCEEEEEEEEeccccCCCccHHHHHHHHHHHHHHHHcCCCEEEEEcCC
Confidence            458889999999999999888     588888888887776666666555655555443


No 113
>PF02645 DegV:  Uncharacterised protein, DegV family COG1307;  InterPro: IPR003797 This family of proteins is related to DegV of Bacillus subtilis and includes paralogous sets in several species (B. subtilis, Deinococcus radiodurans, Mycoplasma pneumoniae) that are closer in percent identity to each other than to most homologs from other species. This suggests both recent paralogy and diversity of function.; PDB: 2DT8_A 3LUP_A 3NYI_B 3PL5_A 1PZX_B 1MGP_A 1VPV_B 3FYS_A 3EGL_C 3JR7_A ....
Probab=37.40  E-value=36  Score=24.29  Aligned_cols=32  Identities=25%  Similarity=0.353  Sum_probs=22.6

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCC
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAF   74 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~an   74 (85)
                      +-+++||++.   |++++++++++.+.   |+.|...+
T Consensus         2 i~IitDS~~d---l~~~~~~~~~i~vv---Pl~i~~~~   33 (280)
T PF02645_consen    2 IAIITDSTSD---LPPELAEEYGIYVV---PLNIIIDG   33 (280)
T ss_dssp             EEEEEEGGG------HHHHHHTTEEEE-----EEEETT
T ss_pred             EEEEECCCCC---CCHHHHHhCCeEEE---eEEEecCC
Confidence            5688999976   68999999998888   77777654


No 114
>cd02782 MopB_CT_1 The MopB_CT_1 CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=37.33  E-value=59  Score=20.10  Aligned_cols=31  Identities=16%  Similarity=0.254  Sum_probs=23.8

Q ss_pred             CCceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           45 STHNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        45 at~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ....+|+++.|+++||.-  ...++|....|+.
T Consensus        32 ~~~v~i~p~dA~~~gi~~--Gd~V~v~s~~g~~   62 (129)
T cd02782          32 RCTLRIHPDDAAALGLAD--GDKVRVTSAAGSV   62 (129)
T ss_pred             CceEEECHHHHHHcCCCC--CCEEEEEcCCCeE
Confidence            345799999999999754  3777888777754


No 115
>cd04459 Rho_CSD Rho_CSD: Rho protein cold-shock domain (CSD). Rho protein is a transcription termination factor in most bacteria. In bacteria, there are two distinct mechanisms for mRNA transcription termination. In intrinsic termination, RNA polymerase and nascent mRNA are released from DNA template by an mRNA stem loop structure, which resembles the transcription termination mechanism used by eukaryotic pol III. The second mechanism is mediated by Rho factor. Rho factor terminates transcription by using energy from ATP hydrolysis to forcibly dissociate the transcripts from RNA polymerase. Rho protein contains an N-terminal S1-like domain, which binds single-stranded RNA. Rho has a C-terminal ATPase domain which hydrolyzes ATP to provide energy to strip RNA polymerase and mRNA from the DNA template. Rho functions as a homohexamer.
Probab=36.95  E-value=36  Score=19.76  Aligned_cols=21  Identities=19%  Similarity=0.314  Sum_probs=18.3

Q ss_pred             CCCCceeeCHHHHHhcCCccc
Q 047037           43 SGSTHNSINAEVAQRVNLSPN   63 (85)
Q Consensus        43 SGat~~Fi~~~~a~~l~l~~~   63 (85)
                      .|....|++..+.+|++|..-
T Consensus        24 ~~~~DvyVs~~~Irr~~LR~G   44 (68)
T cd04459          24 PGPDDIYVSPSQIRRFNLRTG   44 (68)
T ss_pred             CCCCCEEECHHHHHHhCCCCC
Confidence            588899999999999997654


No 116
>cd01812 BAG1_N Ubiquitin-like domain of BAG1. BAG1_N  N-terminal ubiquitin-like (Ubl) domain of the BAG1 protein.  This domain occurs together with the BAG domain and is closely related to the Ubl domain of a family of deubiquitinases that includes Rpn11, UBP6 (USP14), USP7 (HAUSP).
Probab=36.56  E-value=60  Score=17.74  Aligned_cols=37  Identities=22%  Similarity=0.292  Sum_probs=30.8

Q ss_pred             EEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCccc
Q 047037           27 RIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPN   63 (85)
Q Consensus        27 ~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~   63 (85)
                      ++..+-+|+...+=++..+|..-+-.+++++.|++.+
T Consensus         2 ~i~vk~~g~~~~i~v~~~~tv~~lK~~i~~~~gi~~~   38 (71)
T cd01812           2 RVRVKHGGESHDLSISSQATFGDLKKMLAPVTGVEPR   38 (71)
T ss_pred             EEEEEECCEEEEEEECCCCcHHHHHHHHHHhhCCChH
Confidence            4556667888888899999999999999999998775


No 117
>PF04083 Abhydro_lipase:  Partial alpha/beta-hydrolase lipase region;  InterPro: IPR006693 The alpha/beta hydrolase fold is common to several hydrolytic enzymes of widely differing phylogenetic origin and catalytic function. The core of each enzyme is similar: an alpha/beta sheet, not barrel, of eight beta-sheets connected by alpha-helices []. This entry represents the N-terminal part of an alpha/beta hydrolase domain found in a number of lipases.; GO: 0006629 lipid metabolic process; PDB: 1K8Q_B 1HLG_B.
Probab=35.85  E-value=50  Score=18.68  Aligned_cols=26  Identities=15%  Similarity=0.119  Sum_probs=17.7

Q ss_pred             HHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           52 AEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        52 ~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      .+++++.|.+.+   ...|.+.||-.+.-
T Consensus         2 ~~~i~~~GY~~E---~h~V~T~DGYiL~l   27 (63)
T PF04083_consen    2 PELIEKHGYPCE---EHEVTTEDGYILTL   27 (63)
T ss_dssp             HHHHHHTT---E---EEEEE-TTSEEEEE
T ss_pred             HHHHHHcCCCcE---EEEEEeCCCcEEEE
Confidence            467889998888   78999999977754


No 118
>PF13188 PAS_8:  PAS domain; PDB: 2JHE_D 3VOL_A.
Probab=35.66  E-value=57  Score=17.24  Aligned_cols=24  Identities=21%  Similarity=0.328  Sum_probs=18.8

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCc
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLS   61 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~   61 (85)
                      -++++| |....++++.+.+-+|.+
T Consensus        13 ~i~i~d-~~~i~~~N~~~~~l~g~~   36 (64)
T PF13188_consen   13 GILIID-GGRIIYVNPAFEELFGYS   36 (64)
T ss_dssp             EEEEEE-TSBEEEE-HHHHHHHCS-
T ss_pred             ceEEEE-CCChHHhhHHHHHHhCCC
Confidence            356689 778999999999999976


No 119
>PF05618 Zn_protease:  Putative ATP-dependant zinc protease;  InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=35.33  E-value=39  Score=22.11  Aligned_cols=22  Identities=23%  Similarity=0.350  Sum_probs=14.3

Q ss_pred             EEEEEEEeCCCCceeeCHHHHH
Q 047037           35 KSVVILIDSGSTHNSINAEVAQ   56 (85)
Q Consensus        35 ~~v~~LiDSGat~~Fi~~~~a~   56 (85)
                      ..+.+=|||||..+-|+..-.+
T Consensus        15 ~~~~aKiDTGA~tSSLhA~~I~   36 (138)
T PF05618_consen   15 LTIKAKIDTGAKTSSLHATDIE   36 (138)
T ss_dssp             EEEEEEE-TT-SSEEEE-EEEE
T ss_pred             CEEEEEEcCCCcccceeecceE
Confidence            3466677999999999876555


No 120
>PF04748 Polysacc_deac_2:  Divergent polysaccharide deacetylase;  InterPro: IPR006837 This is a family of uncharacterised proteins that includes YibQ.; PDB: 2QV5_A 2NLY_A.
Probab=35.02  E-value=24  Score=24.59  Aligned_cols=27  Identities=22%  Similarity=0.439  Sum_probs=17.8

Q ss_pred             EEEEeCCCCceeeCHHHHHhcCCcccC
Q 047037           38 VILIDSGSTHNSINAEVAQRVNLSPNA   64 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l~l~~~~   64 (85)
                      ++.|||..+-.-+-.+.|+++|+|.-.
T Consensus       120 l~FvDS~T~~~s~a~~~A~~~gvp~~~  146 (213)
T PF04748_consen  120 LFFVDSRTTPRSVAPQVAKELGVPAAR  146 (213)
T ss_dssp             -EEEE-S--TT-SHHHHHHHCT--EEE
T ss_pred             CEEEeCCCCcccHHHHHHHHcCCCEEe
Confidence            578899999999999999999998654


No 121
>PRK04923 ribose-phosphate pyrophosphokinase; Provisional
Probab=34.80  E-value=40  Score=25.03  Aligned_cols=38  Identities=16%  Similarity=0.278  Sum_probs=29.7

Q ss_pred             EEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           39 ILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        39 ~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      ++|=+|+++-.+.+++|++||+++.  ..-.-+-+||+..
T Consensus         7 ~~i~~g~~~~~La~~ia~~lg~~l~--~~~~~~FpdGE~~   44 (319)
T PRK04923          7 LLVFSGNANKPLAQSICKELGVRMG--KALVTRFSDGEVQ   44 (319)
T ss_pred             eEEEECCCCHHHHHHHHHHhCCcee--eeEEEECCCCCEE
Confidence            3556889999999999999998877  5556667788753


No 122
>PRK12397 propionate kinase; Reviewed
Probab=34.72  E-value=15  Score=28.48  Aligned_cols=37  Identities=14%  Similarity=0.298  Sum_probs=32.5

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      |-+|-|+.+++|+.+|-+....+-+...++||-++..
T Consensus       180 GlS~~yva~~~a~~lg~~~~~~~lIv~HLG~GaSi~A  216 (404)
T PRK12397        180 GTSHKYVSGVLAEKLGVPLSALRVICCHLGNGSSICA  216 (404)
T ss_pred             HHHHHHHHHHHHHHhCCChhHCCEEEEEeCCCcchhe
Confidence            6689999999999999888888999999999987754


No 123
>cd02793 MopB_CT_DMSOR-BSOR-TMAOR The MopB_DMSOR-BSOR-TMAOR CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR),  trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO.This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=34.64  E-value=58  Score=20.35  Aligned_cols=31  Identities=13%  Similarity=0.101  Sum_probs=23.9

Q ss_pred             CCceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           45 STHNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        45 at~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      -...+|+++.|+++|+.--  ..++|....|+.
T Consensus        32 ~~~v~i~p~dA~~~gi~~G--d~V~v~s~~G~~   62 (129)
T cd02793          32 REPIRINPADAAARGIADG--DIVRVFNDRGAC   62 (129)
T ss_pred             CCEEEECHHHHHHcCCCCC--CEEEEEcCCEEE
Confidence            3457999999999996654  677787777754


No 124
>KOG1339 consensus Aspartyl protease [Posttranslational modification, protein turnover, chaperones]
Probab=34.60  E-value=34  Score=25.64  Aligned_cols=21  Identities=19%  Similarity=0.213  Sum_probs=17.2

Q ss_pred             EEEEEeCCCCceeeCHHHHHh
Q 047037           37 VVILIDSGSTHNSINAEVAQR   57 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~   57 (85)
                      .-+++|||.+..+++.+..+.
T Consensus       269 ~~~iiDSGTs~t~lp~~~y~~  289 (398)
T KOG1339|consen  269 GGAIIDSGTSLTYLPTSAYNA  289 (398)
T ss_pred             CCEEEECCcceeeccHHHHHH
Confidence            457999999999999887443


No 125
>CHL00174 accD acetyl-CoA carboxylase beta subunit; Reviewed
Probab=34.59  E-value=1.1e+02  Score=22.71  Aligned_cols=54  Identities=19%  Similarity=0.246  Sum_probs=40.7

Q ss_pred             cCCeEEEEEEECCEEEEEEEe-----CCCCceeeCHHHHHhcCCcccCCCcEEEEeCCC
Q 047037           22 VLETMRIQGSIKGKSVVILID-----SGSTHNSINAEVAQRVNLSPNANNRLEIMVAFG   75 (85)
Q Consensus        22 ~~~~~~~~g~i~g~~v~~LiD-----SGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG   75 (85)
                      ..+.+...|+|+|+++.+..+     -||-....-+++++-..+-++..-|+..-++.|
T Consensus       120 ~dgVVtG~G~I~Gr~v~v~a~Dftf~gGSmG~v~geKi~ra~e~A~~~rlPlV~l~~SG  178 (296)
T CHL00174        120 TDAVQTGIGQLNGIPVALGVMDFQFMGGSMGSVVGEKITRLIEYATNESLPLIIVCASG  178 (296)
T ss_pred             CccEEEEEEEECCEEEEEEEECCcccccCcCHHHHHHHHHHHHHHHHcCCCEEEEECCC
Confidence            457888899999999888776     588888888888877776666665665555543


No 126
>PRK00934 ribose-phosphate pyrophosphokinase; Provisional
Probab=34.27  E-value=30  Score=25.08  Aligned_cols=33  Identities=24%  Similarity=0.151  Sum_probs=25.9

Q ss_pred             CCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           43 SGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        43 SGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      +|+++-.+..++|++||+++.  ..-.-+-+||+.
T Consensus         4 ~~~~~~~la~~ia~~l~~~~~--~~~~~~FpdGE~   36 (285)
T PRK00934          4 GGSASQLLASEVARLLNTELA--LVETKRFPDGEL   36 (285)
T ss_pred             eCCCCHHHHHHHHHHHCCceE--eeEEEECCCCCE
Confidence            488888999999999998877  455556677764


No 127
>PRK08351 DNA-directed RNA polymerase subunit E''; Validated
Probab=33.73  E-value=51  Score=18.88  Aligned_cols=28  Identities=21%  Similarity=0.452  Sum_probs=19.8

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEE
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLSPNANNRLEI   70 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V   70 (85)
                      +++++|.=      ...+|+++|+...-...++|
T Consensus        33 ~viI~dPe------~S~IAk~l~i~~pG~YAlkV   60 (61)
T PRK08351         33 LVIIIDVE------NSRIAKKLGAKVPGKYAIRV   60 (61)
T ss_pred             EEEEeCCc------HhHHHHHhCCCCCCeEEEEe
Confidence            56677776      34899999986665566655


No 128
>cd00248 Mth938-like Mth938-like domain. The members of this family include: Mth938, 2P1, Xcr35, Rpa2829, and several uncharacterized sequences. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer.  2P1 is a partially characterized nuclear protein which is homologous to E3-3 from rat and known to be alternately spliced. Xcr35 and Rpa2829 are hypothetical proteins of unknown function from the Xanthomonas campestris and Rhodopseudomonas palustris genomes, respectively, for which the crystal structures have been determined.
Probab=33.42  E-value=43  Score=20.78  Aligned_cols=28  Identities=11%  Similarity=0.238  Sum_probs=21.9

Q ss_pred             EEEEEEeCCCCceeeCHHHHHh---cCCccc
Q 047037           36 SVVILIDSGSTHNSINAEVAQR---VNLSPN   63 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~---l~l~~~   63 (85)
                      +=.++|-||.++-|+++++.+.   .|+.++
T Consensus        53 peiliiGTG~~~~~~~~~~~~~l~~~gI~vE   83 (109)
T cd00248          53 PDILLIGTGAEIAFLPRALRAALRAAGIGVE   83 (109)
T ss_pred             CCEEEEcCCCCCCcCCHHHHHHHHHcCCeEE
Confidence            4578999999999999999854   455544


No 129
>PF00240 ubiquitin:  Ubiquitin family;  InterPro: IPR000626 Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2, IPR000608 from INTERPRO), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade. There are many different E3 ligases, which are responsible for the type of ubiquitin chain formed, the specificity of the target protein, and the regulation of the ubiquitinylation process []. Ubiquitinylation is an important regulatory tool that controls the concentration of key signalling proteins, such as those involved in cell cycle control, as well as removing misfolded, damaged or mutant proteins that could be harmful to the cell. Several ubiquitin-like molecules have been discovered, such as Ufm1 (IPR005375 from INTERPRO), SUMO1 (IPR003653 from INTERPRO), NEDD8, Rad23 (IPR004806 from INTERPRO), Elongin B and Parkin (IPR003977 from INTERPRO), the latter being involved in Parkinson's disease []. Ubiquitin is a protein of 76 amino acid residues, found in all eukaryotic cells and whose sequence is extremely well conserved from protozoan to vertebrates. Ubiquitin acts through its post-translational attachment (ubiquitinylation) to other proteins, where these modifications alter the function, location or trafficking of the protein, or targets it for destruction by the 26S proteasome []. The terminal glycine in the C-terminal 4-residue tail of ubiquitin can form an isopeptide bond with a lysine residue in the target protein, or with a lysine in another ubiquitin molecule to form a ubiquitin chain that attaches itself to a target protein. Ubiquitin has seven lysine residues, any one of which can be used to link ubiquitin molecules together, resulting in different structures that alter the target protein in different ways. It appears that Lys(11)-, Lys(29) and Lys(48)-linked poly-ubiquitin chains target the protein to the proteasome for degradation, while mono-ubiquitinylated and Lys(6)- or Lys(63)-linked poly-ubiquitin chains signal reversible modifications in protein activity, location or trafficking []. For example, Lys(63)-linked poly-ubiquitinylation is known to be involved in DNA damage tolerance, inflammatory response, protein trafficking and signal transduction through kinase activation []. In addition, the length of the ubiquitin chain alters the fate of the target protein. Regulatory proteins such as transcription factors and histones are frequent targets of ubquitinylation [].; GO: 0005515 protein binding; PDB: 2DZI_A 2XEW_E 3NOB_E 2KWU_B 2Y5B_F 3PHD_G 2KWV_B 2KOX_A 2XK5_B 3NHE_B ....
Probab=32.42  E-value=45  Score=18.20  Aligned_cols=32  Identities=19%  Similarity=0.502  Sum_probs=29.2

Q ss_pred             ECCEEEEEEEeCCCCceeeCHHHHHhcCCccc
Q 047037           32 IKGKSVVILIDSGSTHNSINAEVAQRVNLSPN   63 (85)
Q Consensus        32 i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~   63 (85)
                      ++|+.+.+=|+.-.|..-+-..++++.++|..
T Consensus         3 ~~g~~~~~~v~~~~tV~~lK~~i~~~~~~~~~   34 (69)
T PF00240_consen    3 LSGKTFTLEVDPDDTVADLKQKIAEETGIPPE   34 (69)
T ss_dssp             TTSEEEEEEEETTSBHHHHHHHHHHHHTSTGG
T ss_pred             CCCcEEEEEECCCCCHHHhhhhcccccccccc
Confidence            57889999999999999999999999998776


No 130
>PF00564 PB1:  PB1 domain;  InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=30.57  E-value=1.1e+02  Score=17.28  Aligned_cols=51  Identities=14%  Similarity=0.294  Sum_probs=39.0

Q ss_pred             eEEEEEEECCEEEE-EEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEe--CCCCEE
Q 047037           25 TMRIQGSIKGKSVV-ILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMV--AFGEKL   78 (85)
Q Consensus        25 ~~~~~g~i~g~~v~-~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~--anG~~~   78 (85)
                      |++++...++.... +-+..+.+..-+-..++++++++   ...+.++-  .+|..+
T Consensus         1 t~~vK~~~~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~---~~~~~l~Y~D~dgD~V   54 (84)
T PF00564_consen    1 TVRVKVRYGGDIRRIISLPSDVSFDDLRSKIREKFGLL---DEDFQLKYKDEDGDLV   54 (84)
T ss_dssp             SEEEEEEETTEEEEEEEECSTSHHHHHHHHHHHHHTTS---TSSEEEEEEETTSSEE
T ss_pred             CEEEEEEECCeeEEEEEcCCCCCHHHHHHHHHHHhCCC---CccEEEEeeCCCCCEE
Confidence            57888899998888 88889999999999999999987   23444443  345444


No 131
>PRK00180 acetate kinase A/propionate kinase 2; Reviewed
Probab=30.49  E-value=28  Score=26.89  Aligned_cols=37  Identities=14%  Similarity=0.248  Sum_probs=31.5

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      |-+|-||..++++.+|-+....+-+...++||.++..
T Consensus       181 gLS~~~va~~~a~~lg~~~~~~~lIvaHLG~GaSi~A  217 (402)
T PRK00180        181 GTSHRYVSQRAAELLGKPLEELNLITCHLGNGASIAA  217 (402)
T ss_pred             HHHHHHHHHHHHHHhCCChhHcCEEEEEeCCCceeee
Confidence            5578899999999999888877889999999988764


No 132
>cd02783 MopB_CT_2 The MopB_CT_2 CD includes a group of related uncharacterized bacterial and archaeal molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=30.17  E-value=83  Score=20.52  Aligned_cols=30  Identities=7%  Similarity=0.094  Sum_probs=23.7

Q ss_pred             CceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           46 THNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        46 t~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      ...+|+++.|+++||.--  ..+.|....|..
T Consensus        32 ~~v~inp~dA~~~GI~dG--d~V~v~s~~G~~   61 (156)
T cd02783          32 NYLYMHPKTAKELGIKDG--DWVWVESVNGRV   61 (156)
T ss_pred             CEEEECHHHHHHcCCCCC--CEEEEEcCCeeE
Confidence            346899999999996554  788888888754


No 133
>PF07285 DUF1444:  Protein of unknown function (DUF1444);  InterPro: IPR010838 This family contains several hypothetical bacterial proteins of unknown function that are approximately 250 residues long.
Probab=29.97  E-value=78  Score=22.79  Aligned_cols=39  Identities=10%  Similarity=0.168  Sum_probs=32.9

Q ss_pred             eEEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCccc
Q 047037           25 TMRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPN   63 (85)
Q Consensus        25 ~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~   63 (85)
                      .+...-......+...+|.|-+.-||+.+.-+++|+..+
T Consensus       105 ~~v~~~~~~dl~I~ya~D~~~s~r~i~~~~L~~~g~t~e  143 (265)
T PF07285_consen  105 PLVYRPHTADLRIYYALDLGESYRFIDEKMLEEWGLTEE  143 (265)
T ss_pred             CcccccCCCCeEEEEEEECCCeeEecCHHHHhHcCCCHH
Confidence            455555667788999999999999999999999998865


No 134
>COG3865 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.69  E-value=1.4e+02  Score=20.15  Aligned_cols=25  Identities=20%  Similarity=0.380  Sum_probs=21.3

Q ss_pred             CeEEEEEEECCEEEEEEEeCCCCcee
Q 047037           24 ETMRIQGSIKGKSVVILIDSGSTHNS   49 (85)
Q Consensus        24 ~~~~~~g~i~g~~v~~LiDSGat~~F   49 (85)
                      ..+.+..+++|+++..| |+|..++|
T Consensus        49 ~Vl~a~F~l~g~~f~~l-d~g~~~~f   73 (151)
T COG3865          49 KVLVAEFTLNGQSFMAL-DGGPNTSF   73 (151)
T ss_pred             cEEEEEEEECCeEEEEE-cCCCCcCC
Confidence            56888999999998876 99988766


No 135
>PRK10681 DNA-binding transcriptional repressor DeoR; Provisional
Probab=29.14  E-value=15  Score=25.99  Aligned_cols=40  Identities=15%  Similarity=0.251  Sum_probs=25.7

Q ss_pred             EEEEeCCCCceeeCHHHHHhcCCcccC-----------CCcEEEEeCCCCE
Q 047037           38 VILIDSGSTHNSINAEVAQRVNLSPNA-----------NNRLEIMVAFGEK   77 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l~l~~~~-----------~~~~~V~~anG~~   77 (85)
                      .+.+|+|+|..++.+.+....++.+.+           .+.+.|.+.+|..
T Consensus        95 tIflD~GtT~~~la~~L~~~~~ltvvTnsl~i~~~l~~~~~~~villGG~~  145 (252)
T PRK10681         95 TLFFDCGTTTPWIIEAIDNELPFTAVCYSLNTFLALQEKPHCRAILCGGEF  145 (252)
T ss_pred             EEEEECCccHHHHHHhcCCCCCeEEEECCHHHHHHHhhCCCCEEEEECcEE
Confidence            467899999988888776543333332           2345666666654


No 136
>PRK02812 ribose-phosphate pyrophosphokinase; Provisional
Probab=29.08  E-value=57  Score=24.34  Aligned_cols=37  Identities=24%  Similarity=0.264  Sum_probs=28.4

Q ss_pred             EEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           40 LIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        40 LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      +|=+|+++..+..++|++||+++..  .-.-+-+||+..
T Consensus        23 ~i~~g~~~~~la~~ia~~lg~~l~~--~~~~~FpDGE~~   59 (330)
T PRK02812         23 RLFSGSSNPALAQEVARYLGMDLGP--MIRKRFADGELY   59 (330)
T ss_pred             EEEECCCCHHHHHHHHHHhCCCcee--eEEEECCCCCEE
Confidence            3446999999999999999988763  445567788643


No 137
>cd02511 Beta4Glucosyltransferase UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide. UDP-glucose: lipooligosaccharide (LOS)  beta-1-4-glucosyltransferase catalyzes the addition of the first residue, glucose, of the lacto-N-neotetrase structure to HepI of the LOS inner core.  LOS is the major constituent of the outer leaflet of the outer membrane of gram-positive bacteria. It consists of a short oligosaccharide chain of variable composition (alpha chain) attached to a branched inner core which is lined in turn to lipid A. Beta 1,4 glucosyltransferase is required to attach the alpha chain to the inner core.
Probab=29.07  E-value=56  Score=22.05  Aligned_cols=24  Identities=33%  Similarity=0.515  Sum_probs=16.5

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCcc
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLSP   62 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~~   62 (85)
                      -+++||+|||..  ..+++++.+..+
T Consensus        28 eiivvD~gStD~--t~~i~~~~~~~v   51 (229)
T cd02511          28 EIIVVDSGSTDR--TVEIAKEYGAKV   51 (229)
T ss_pred             EEEEEeCCCCcc--HHHHHHHcCCEE
Confidence            367899999976  356676665443


No 138
>PRK07199 phosphoribosylpyrophosphate synthetase; Provisional
Probab=28.94  E-value=49  Score=24.28  Aligned_cols=37  Identities=11%  Similarity=0.026  Sum_probs=28.5

Q ss_pred             EEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           40 LIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        40 LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      +|=+|+.+..+..++|++||+++.  ..-.-+-+||+..
T Consensus         4 ~i~~~~~~~~la~~ia~~lg~~~~--~~~~~~F~dGE~~   40 (301)
T PRK07199          4 LLLALPGNEAAAGRLAAALGVEVG--RIELHRFPDGESY   40 (301)
T ss_pred             EEEECCCCHHHHHHHHHHhCCcee--eeEEEECCCCCEE
Confidence            455788889999999999998876  4555566777654


No 139
>cd05992 PB1 The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as a noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants.
Probab=28.62  E-value=1.1e+02  Score=17.02  Aligned_cols=36  Identities=11%  Similarity=0.290  Sum_probs=29.9

Q ss_pred             EEEEEECCEEEEEEEe-CCCCceeeCHHHHHhcCCcc
Q 047037           27 RIQGSIKGKSVVILID-SGSTHNSINAEVAQRVNLSP   62 (85)
Q Consensus        27 ~~~g~i~g~~v~~LiD-SGat~~Fi~~~~a~~l~l~~   62 (85)
                      +++...+|....+.+. .+.|..=+-..++++++++.
T Consensus         2 ~vK~~~~~~~~~~~~~~~~~s~~~L~~~i~~~~~~~~   38 (81)
T cd05992           2 RVKVKYGGEIRRFVVVSRSISFEDLRSKIAEKFGLDA   38 (81)
T ss_pred             cEEEEecCCCEEEEEecCCCCHHHHHHHHHHHhCCCC
Confidence            4566677777788888 99999999999999999875


No 140
>COG0777 AccD Acetyl-CoA carboxylase beta subunit [Lipid metabolism]
Probab=28.38  E-value=1.1e+02  Score=22.78  Aligned_cols=54  Identities=20%  Similarity=0.306  Sum_probs=42.3

Q ss_pred             CCeEEEEEEECCEEEE-EEEeC----CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCC
Q 047037           23 LETMRIQGSIKGKSVV-ILIDS----GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGE   76 (85)
Q Consensus        23 ~~~~~~~g~i~g~~v~-~LiDS----Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~   76 (85)
                      ...+...|.|+|.++. +..|.    ||-.+.+.+++++.+..-++..-|+.+-.|.|.
T Consensus       110 davvtg~g~i~G~pvv~av~df~FmgGSmGsVvGeki~ra~E~A~e~k~P~v~f~aSGG  168 (294)
T COG0777         110 DAVVTGEGTINGLPVVLAVMDFAFMGGSMGSVVGEKITRAIERAIEDKLPLVLFSASGG  168 (294)
T ss_pred             cceEEEeeEECCeEEEEEEEeccccccchhHHHHHHHHHHHHHHHHhCCCEEEEecCcc
Confidence            4678889999999954 55564    788888889999888878888777777777663


No 141
>cd01398 RPI_A RPI_A: Ribose 5-phosphate isomerase type A (RPI_A) subfamily; RPI catalyzes the reversible conversion of ribose-5-phosphate to ribulose 5-phosphate, the first step of the non-oxidative branch of the pentose phosphate pathway. This reaction leads to the conversion of phosphosugars into glycolysis intermediates, which are precursors for the synthesis of amino acids, vitamins, nucleotides, and cell wall components. In plants, RPI is part of the Calvin cycle as ribulose 5-phosphate is the carbon dioxide receptor in the first dark reaction of photosynthesis. There are two unrelated types of RPIs (A and B), which catalyze the same reaction, at least one type of RPI is present in an organism. RPI_A is more widely distributed than RPI_B in bacteria, eukaryotes, and archaea.
Probab=27.92  E-value=18  Score=25.18  Aligned_cols=22  Identities=32%  Similarity=0.401  Sum_probs=18.3

Q ss_pred             EEEEEeCCCCceeeCHHHHHhc
Q 047037           37 VVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      ..+.+|||+|..++-+.++++.
T Consensus        17 ~~I~ldsGST~~~l~~~L~~~~   38 (213)
T cd01398          17 MVIGLGTGSTVAYFIEALGERV   38 (213)
T ss_pred             CEEEECchHHHHHHHHHHHHhh
Confidence            3577899999999999998764


No 142
>PRK12608 transcription termination factor Rho; Provisional
Probab=27.81  E-value=58  Score=25.04  Aligned_cols=26  Identities=23%  Similarity=0.413  Sum_probs=20.0

Q ss_pred             EEEEEeCCCC---------------ceeeCHHHHHhcCCcc
Q 047037           37 VVILIDSGST---------------HNSINAEVAQRVNLSP   62 (85)
Q Consensus        37 v~~LiDSGat---------------~~Fi~~~~a~~l~l~~   62 (85)
                      ..+|||||+-               |..+|+++|++--.|-
T Consensus       279 ~TvLvetg~~mdd~I~ee~kg~~dg~ivLsR~lA~~~~fPA  319 (380)
T PRK12608        279 ATALVDTGSRMDEVIFEEFKGTGNMEIVLDRELADKRVFPA  319 (380)
T ss_pred             EEEEEecCCCCCcchHHHhcccCCCeEEECHHHHhCCCCCc
Confidence            4579998876               6899999998765543


No 143
>cd05560 Xcc1710_like Xcc1710_like family, specific to proteobacteria. Xcc1710 is a hypothetical protein from Xanthomonas campestris pv. campestris str. ATCC 33913, similar to Mth938, a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. Their three-dimensional structures have been determined, but their functions are unknown.
Probab=27.43  E-value=66  Score=20.00  Aligned_cols=23  Identities=17%  Similarity=0.258  Sum_probs=19.3

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhc
Q 047037           36 SVVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      +=.++|=||.++.|+++++.+.+
T Consensus        53 peiliiGTG~~~~~~~~~~~~~l   75 (109)
T cd05560          53 PEVILLGTGERQRFPPPALLAPL   75 (109)
T ss_pred             CCEEEEecCCCCCcCCHHHHHHH
Confidence            34789999999999999987654


No 144
>smart00295 B41 Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Probab=26.96  E-value=1e+02  Score=20.13  Aligned_cols=43  Identities=14%  Similarity=0.215  Sum_probs=32.7

Q ss_pred             EEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEe
Q 047037           30 GSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMV   72 (85)
Q Consensus        30 g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~   72 (85)
                      ...+|....+-||+-.|.--|-..+|+++|++......+....
T Consensus         9 ~l~dg~~~~~~~~~~~t~~ev~~~v~~~~~l~~~~~F~L~~~~   51 (207)
T smart00295        9 YLLDGTTLEFEVDSSTTAEELLETVCRKLGIRESEYFGLQFED   51 (207)
T ss_pred             EecCCCEEEEEECCCCCHHHHHHHHHHHhCCCccceeEEEEEc
Confidence            3456778899999999999999999999999554334444433


No 145
>PRK00702 ribose-5-phosphate isomerase A; Provisional
Probab=26.86  E-value=23  Score=24.95  Aligned_cols=21  Identities=24%  Similarity=0.390  Sum_probs=17.5

Q ss_pred             EEEEeCCCCceeeCHHHHHhc
Q 047037           38 VILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      .+.+|||+|..++-+.++++.
T Consensus        23 ~IgLgsGST~~~l~~~L~~~~   43 (220)
T PRK00702         23 IVGLGTGSTAAYFIDALGERV   43 (220)
T ss_pred             EEEECCcHHHHHHHHHHHhhh
Confidence            468899999999888888754


No 146
>COG3188 FimD P pilus assembly protein, porin PapC [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=26.55  E-value=2.1e+02  Score=24.28  Aligned_cols=55  Identities=20%  Similarity=0.361  Sum_probs=39.6

Q ss_pred             ceeEEEeccccCcCCeEEEEEEECCEEEE---EEE----eCCCCceeeCHHHHHhcCCcccC
Q 047037           10 PKISLHDVARDQVLETMRIQGSIKGKSVV---ILI----DSGSTHNSINAEVAQRVNLSPNA   64 (85)
Q Consensus        10 ~~iSl~A~~g~~~~~~~~~~g~i~g~~v~---~Li----DSGat~~Fi~~~~a~~l~l~~~~   64 (85)
                      +.+|...-.+...|+.+++...||++.+-   +.+    +.++..-++++++-+++|++.+.
T Consensus        46 ~dls~f~~~~~~~pG~Y~vdv~vN~~~~~~~~~~f~~~~~~~~~~~Cls~~~L~~~Gv~~~~  107 (835)
T COG3188          46 PDLSRFESGNYVLPGTYRVDVYVNGQKVGSQDVIFKAAKGEGSLVPCLTPELLKQLGLKTDA  107 (835)
T ss_pred             cChhhhccCCCCCCceEEEEEEECCEEccceeEEEEccCCccCccEECCHHHHHHcCCCccc
Confidence            44444333334568999999999998742   233    24589999999999999988444


No 147
>cd02776 MopB_CT_Nitrate-R-NarG-like Respiratory nitrate reductase A (NarGHI), alpha chain (NarG) and related proteins. Under anaerobic conditions in the presence of nitrate, E. coli synthesizes the cytoplasmic membrane-bound quinol-nitrate oxidoreductase (NarGHI), which reduces nitrate to nitrite and forms part of a redox loop generating a proton-motive force. Found in prokaryotes and some archaea, NarGHI usually functions as a heterotrimer. The alpha chain contains the molybdenum cofactor-containing Mo-bisMGD catalytic subunit. This CD (MopB_CT_Nitrate-R-NarG-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=26.36  E-value=1.1e+02  Score=19.79  Aligned_cols=32  Identities=16%  Similarity=0.196  Sum_probs=23.9

Q ss_pred             CCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           45 STHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        45 at~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      ....+|+++.|+++|+.--  ..++|....|+..
T Consensus        30 ~~~v~inp~dA~~lgI~dG--d~V~v~~~~G~v~   61 (141)
T cd02776          30 GPVVWMNPKDAAELGIKDN--DWVEVFNDNGVVV   61 (141)
T ss_pred             CCEEEECHHHHHHcCCCCC--CEEEEEeCCeEEE
Confidence            3457899999999996654  6777777777543


No 148
>PF00871 Acetate_kinase:  Acetokinase family;  InterPro: IPR000890 Acetate kinase, which is predominantly found in micro-organisms, facilitates the production of acetyl-CoA by phosphorylating acetate in the presence of ATP and a divalent cation [, ]. The enzyme is important in the process of glycolysis, enzyme levels being increased in the presence of excess glucose. The growth of a bacterial mutant lacking acetate kinase has been shown to be inhibited by glucose, suggesting that the enzyme is involved in excretion of excess carbohydrate []. A related enzyme, butyrate kinase, facilitates the formation of butyryl-CoA by phosphorylating butyrate in the presence of ATP to form butyryl phosphate [].; GO: 0016301 kinase activity, 0016774 phosphotransferase activity, carboxyl group as acceptor, 0008152 metabolic process, 0016310 phosphorylation, 0005622 intracellular; PDB: 3P4I_B 3R9P_B 2IIR_J 1SAZ_A 1X9J_D 4DQ8_B 1TUU_A 1TUY_B 1G99_A 1X3N_A ....
Probab=26.21  E-value=26  Score=26.78  Aligned_cols=36  Identities=8%  Similarity=0.038  Sum_probs=30.1

Q ss_pred             CCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           45 STHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        45 at~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      -+|-||.+++|+.+|-+....+-+...++||.++..
T Consensus       179 LS~~~va~~~a~~lgk~~~~~~lIvaHLG~G~Sv~A  214 (388)
T PF00871_consen  179 LSYRYVARRAAELLGKDYEDLNLIVAHLGSGASVCA  214 (388)
T ss_dssp             HHHHHHHHHHHHHTTSCGGG-EEEEEEESSSEEEEE
T ss_pred             HhHHHHHHHHHHHcCCcccccCEEEEEeCCCcEEEE
Confidence            467788999999999999888899999999987765


No 149
>KOG0679 consensus Actin-related protein - Arp4p/Act3p [Cytoskeleton]
Probab=26.11  E-value=36  Score=26.57  Aligned_cols=19  Identities=37%  Similarity=0.672  Sum_probs=15.2

Q ss_pred             EECCEEEEEEEeCCCCcee
Q 047037           31 SIKGKSVVILIDSGSTHNS   49 (85)
Q Consensus        31 ~i~g~~v~~LiDSGat~~F   49 (85)
                      .-+|+.--++||+||+|+-
T Consensus       149 FA~GrstalVvDiGa~~~s  167 (426)
T KOG0679|consen  149 FANGRSTALVVDIGATHTS  167 (426)
T ss_pred             HhcCCCceEEEEecCCCce
Confidence            3467788889999999863


No 150
>PRK02458 ribose-phosphate pyrophosphokinase; Provisional
Probab=25.90  E-value=71  Score=23.73  Aligned_cols=38  Identities=18%  Similarity=0.116  Sum_probs=29.2

Q ss_pred             EEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           39 ILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        39 ~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      ++|=+|+++-.+..++|++||+++.  ..-.-+-+||+..
T Consensus        10 ~~i~~~~~~~~la~~ia~~lg~~l~--~~~~~~FpdGE~~   47 (323)
T PRK02458         10 IKLFSLNSNLEIAEKIAQAAGVPLG--KLSSRQFSDGEIM   47 (323)
T ss_pred             eEEEECCCCHHHHHHHHHHhCCcee--eeEEEECCCCCEE
Confidence            4556788889999999999998876  4555566788753


No 151
>COG0462 PrsA Phosphoribosylpyrophosphate synthetase [Nucleotide transport and metabolism / Amino acid transport and metabolism]
Probab=25.35  E-value=57  Score=24.50  Aligned_cols=36  Identities=19%  Similarity=0.218  Sum_probs=28.4

Q ss_pred             eCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEE
Q 047037           42 DSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLM   79 (85)
Q Consensus        42 DSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~   79 (85)
                      =+|+++-=+.+++|++|++++.  +.-.-+-+||+..-
T Consensus         8 f~g~s~~~La~~ia~~l~~~l~--~~~~~rF~DGE~~V   43 (314)
T COG0462           8 FSGSSNPELAEKIAKRLGIPLG--KVEVKRFPDGEIYV   43 (314)
T ss_pred             EECCCCHHHHHHHHHHhCCCcc--cceeEEcCCCcEEE
Confidence            3678899999999999998888  45566678887643


No 152
>COG0231 Efp Translation elongation factor P (EF-P)/translation initiation factor 5A (eIF-5A) [Translation, ribosomal structure and biogenesis]
Probab=25.27  E-value=1.2e+02  Score=19.63  Aligned_cols=51  Identities=16%  Similarity=0.216  Sum_probs=37.8

Q ss_pred             EEEECCEEEEEEEeCCCCceeeCHHHHHhcCCccc----------CCCcEEEEeCCCCEEE
Q 047037           29 QGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPN----------ANNRLEIMVAFGEKLM   79 (85)
Q Consensus        29 ~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~----------~~~~~~V~~anG~~~~   79 (85)
                      ...|..+...+|-..|....|+|.+.=+.+-++..          ....+.|...||+.+.
T Consensus        64 ~a~ie~~~~q~lY~dg~~~~FMD~etyeq~~v~~~~~~d~~~~l~eg~~v~v~~~~g~~i~  124 (131)
T COG0231          64 VAIVERKTAQYLYIDGDFYVFMDLETYEQYELPKDQIGDAAKFLKEGMEVEVLLYNGEPIA  124 (131)
T ss_pred             EeEEeeeeEEEEEcCCCeEEEccCCCceEEEecchhhhhHHHhcCCCCEEEEEEECCEEEE
Confidence            56788899999999999999999665544444433          2344788888887765


No 153
>PRK01259 ribose-phosphate pyrophosphokinase; Provisional
Probab=24.76  E-value=61  Score=23.82  Aligned_cols=36  Identities=11%  Similarity=0.123  Sum_probs=27.8

Q ss_pred             EeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           41 IDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        41 iDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      |=+|+++..+..++|++||+++.  ..-.-+-+||+..
T Consensus         3 i~~~~~~~~la~~ia~~lg~~~~--~~~~~~FpdGE~~   38 (309)
T PRK01259          3 LFAGNANPELAEKIAKYLGIPLG--KASVGRFSDGEIS   38 (309)
T ss_pred             EEECCCCHHHHHHHHHHhCCcee--eeEEEECCCCCEE
Confidence            34688999999999999998876  4555667788753


No 154
>PF00567 TUDOR:  Tudor domain;  InterPro: IPR008191 There are multiple copies of this domain in the Drosophila melanogaster tudor protein and it has been identified in several RNA-binding proteins []. Although the function of this domain is unknown, in Drosophila melanogaster the tudor protein is required during oogenesis for the formation of primordial germ cells and for normal abdominal segmentation [].; PDB: 3NTI_A 3NTK_B 3NTH_A 2DIQ_A 3FDR_A 3PNW_O 3S6W_A 3PMT_A 2WAC_A 2O4X_A ....
Probab=24.57  E-value=39  Score=19.81  Aligned_cols=41  Identities=20%  Similarity=0.201  Sum_probs=23.3

Q ss_pred             CCEEEEEEEeCCCCceeeCHHHHHhcCCcccC--CCcEEEEeCC
Q 047037           33 KGKSVVILIDSGSTHNSINAEVAQRVNLSPNA--NNRLEIMVAF   74 (85)
Q Consensus        33 ~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~--~~~~~V~~an   74 (85)
                      ++.-.+.+||.|.+.. ++..-.+++......  ...+++.++|
T Consensus        79 ~~~~~V~~iD~G~~~~-v~~~~l~~l~~~~~~~P~~a~~~~L~g  121 (121)
T PF00567_consen   79 ENQYKVFLIDYGNTEK-VSASDLRPLPPEFASLPPQAIKCKLAG  121 (121)
T ss_dssp             TTEEEEEETTTTEEEE-EEGGGEEE--HHHCSSSSSCEEEEET-
T ss_pred             cceeEEEEEecCceEE-EcHHHhhhhCHHHhhCChhhEEEEEcC
Confidence            6666889999999988 555444444322222  2445555543


No 155
>PF11976 Rad60-SLD:  Ubiquitin-2 like Rad60 SUMO-like;  InterPro: IPR022617 This entry includes small ubiquitin-related modifier (SUMO) proteins. SUMOs are small proteins that are covalently attached to lysines as post-translational modifications and are used to control multiple cellular process including signal transduction, nuclear transport and DNA replication and repair []. Unlike ubiquitin, they are not involved in protein degradation.  This entry also contains the C-terminal Rad60 DNA repair protein SUMO-like domain.; PDB: 3RD2_A 2JXX_A 3RCZ_A 3GOE_A 3A4S_D 3A4R_B 2IO1_D 1U4A_A 2K1F_A 1WZ0_A ....
Probab=24.45  E-value=1.3e+02  Score=16.42  Aligned_cols=49  Identities=12%  Similarity=0.384  Sum_probs=35.3

Q ss_pred             EEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEE
Q 047037           27 RIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLM   79 (85)
Q Consensus        27 ~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~   79 (85)
                      .+++. +|+.+.+-|..-.+..-|=..++++.+++..  ..++... +|+.+.
T Consensus         4 ~v~~~-~~~~~~~~v~~~~~~~~l~~~~~~~~~i~~~--~~~~l~f-dG~~L~   52 (72)
T PF11976_consen    4 KVRSQ-DGKEIKFKVKPTTTVSKLIEKYCEKKGIPPE--ESIRLIF-DGKRLD   52 (72)
T ss_dssp             EEEET-TSEEEEEEEETTSCCHHHHHHHHHHHTTTT---TTEEEEE-TTEEE-
T ss_pred             EEEeC-CCCEEEEEECCCCcHHHHHHHHHHhhCCCcc--ceEEEEE-CCEEcC
Confidence            34443 6778999999999999999999999998884  3455544 554443


No 156
>KOG3238 consensus Chloride ion current inducer protein [Inorganic ion transport and metabolism]
Probab=24.38  E-value=57  Score=23.19  Aligned_cols=28  Identities=32%  Similarity=0.390  Sum_probs=19.0

Q ss_pred             CCceeEEEeccccCc---CCeEEEEEEECCE
Q 047037            8 ATPKISLHDVARDQV---LETMRIQGSIKGK   35 (85)
Q Consensus         8 ~~~~iSl~A~~g~~~---~~~~~~~g~i~g~   35 (85)
                      +-|+|||||++..+.   +.+++++...++.
T Consensus        59 ey~~IsLHaiSrdps~c~~~~ly~~Ve~~~~   89 (216)
T KOG3238|consen   59 EYPTISLHAISRDPSDCLSEHLYVMVEWKFE   89 (216)
T ss_pred             ecceeEEEeeeCChhhcchhhheeeeeeccc
Confidence            469999999988653   3556665555443


No 157
>PRK11347 antitoxin ChpS; Provisional
Probab=24.01  E-value=1.7e+02  Score=17.49  Aligned_cols=31  Identities=23%  Similarity=0.313  Sum_probs=23.9

Q ss_pred             ceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEE
Q 047037           47 HNSINAEVAQRVNLSPNANNRLEIMVAFGEKLM   79 (85)
Q Consensus        47 ~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~   79 (85)
                      ..-|+..+++++++..-  ..+.+.+.+|..+-
T Consensus        13 ~vriPk~il~~l~l~~G--~~v~i~v~~~~iii   43 (83)
T PRK11347         13 GMVIPNIVMKELNLQPG--QSVEAQVSNNQLIL   43 (83)
T ss_pred             eEEeCHHHHHHcCCCCC--CEEEEEEECCEEEE
Confidence            34579999999997766  67888888886553


No 158
>PRK13846 putative glycerol-3-phosphate acyltransferase PlsX; Provisional
Probab=23.80  E-value=81  Score=23.69  Aligned_cols=40  Identities=10%  Similarity=0.086  Sum_probs=30.2

Q ss_pred             eccccCcCCeEEEEEEECCEEEEEEEeCCCCceeeCHHHHHh
Q 047037           16 DVARDQVLETMRIQGSIKGKSVVILIDSGSTHNSINAEVAQR   57 (85)
Q Consensus        16 A~~g~~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~   57 (85)
                      -+.|..+|.-+.+.-+.+|.  .+|+|.||....=++.|++.
T Consensus       120 ri~Gi~RPala~~lPt~~g~--~~lLD~GAN~d~~pe~L~qF  159 (316)
T PRK13846        120 MFPAVPRPALLVSVPTMRGF--AVILDVGANVSVNPEEMVGF  159 (316)
T ss_pred             cCCCCCcceeeeeccCCCCC--EEEEECCccCCCCHHHHHHH
Confidence            34566777777777777777  69999999998877777653


No 159
>TIGR01699 XAPA xanthosine phosphorylase. (TIGR01698, TIGR01700).
Probab=23.63  E-value=2.1e+02  Score=20.50  Aligned_cols=38  Identities=24%  Similarity=0.275  Sum_probs=26.7

Q ss_pred             ceeEEEeccccCc---CCeE--EEEEEECCEEEEEEEeCCCCcee
Q 047037           10 PKISLHDVARDQV---LETM--RIQGSIKGKSVVILIDSGSTHNS   49 (85)
Q Consensus        10 ~~iSl~A~~g~~~---~~~~--~~~g~i~g~~v~~LiDSGat~~F   49 (85)
                      ..|++..+.+.+.   ++|-  .+.|+++|++|.++  +|-.|.+
T Consensus        21 ~~i~y~~ip~~p~~~v~gh~g~l~~G~l~g~~Vv~~--~Gr~h~y   63 (248)
T TIGR01699        21 VAISYEKLPGFPVSTVHGHAGELVLGHLQGVPVVCM--KGRGHFY   63 (248)
T ss_pred             EEEECCCCCCCCCCcccCCcceEEEEEECCEEEEEE--eCCCccc
Confidence            4666666666543   2332  55799999999996  8999954


No 160
>PF06820 Phage_fiber_C:  Putative prophage tail fibre C-terminus;  InterPro: IPR009640 This entry represents the C terminus of phage 933W tail fibre protein. This domain is found together with conserved RLGP motif. The characteristics of the protein distribution suggest prophage matches.
Probab=23.49  E-value=51  Score=18.97  Aligned_cols=28  Identities=18%  Similarity=0.102  Sum_probs=22.2

Q ss_pred             eeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           48 NSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        48 ~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      .|++++.+.+..+-..   +++|.-++|-..
T Consensus        31 tfL~pkd~~~vq~~f~---~LQv~fgDGpWq   58 (64)
T PF06820_consen   31 TFLDPKDATRVQGVFR---HLQVRFGDGPWQ   58 (64)
T ss_pred             EEecccCchhheeeee---eeEEEeccCChh
Confidence            4889999998876554   899999998653


No 161
>COG1107 Archaea-specific RecJ-like exonuclease, contains DnaJ-type Zn finger domain [DNA replication, recombination, and repair]
Probab=23.21  E-value=53  Score=27.15  Aligned_cols=29  Identities=24%  Similarity=0.399  Sum_probs=19.3

Q ss_pred             CEEEEEEEeCCCCceeeCH-HHHHhcCCcc
Q 047037           34 GKSVVILIDSGSTHNSINA-EVAQRVNLSP   62 (85)
Q Consensus        34 g~~v~~LiDSGat~~Fi~~-~~a~~l~l~~   62 (85)
                      ..|+++|||.|||--=++. +.++-.|+++
T Consensus       417 KlPL~VlvDnGsTeEDipA~~~~k~Ygi~i  446 (715)
T COG1107         417 KLPLLVLVDNGSTEEDIPAIKQLKAYGIDI  446 (715)
T ss_pred             ccceEEEEcCCCcccccHHHHHHHhcCCCE
Confidence            4589999999999765543 3444445443


No 162
>PRK00553 ribose-phosphate pyrophosphokinase; Provisional
Probab=23.10  E-value=84  Score=23.44  Aligned_cols=36  Identities=19%  Similarity=0.226  Sum_probs=28.1

Q ss_pred             EeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           41 IDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        41 iDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      |=+|+++-.+..++|++||+++.  ..-.-+-+||+.-
T Consensus        12 i~~~~~~~~La~~ia~~lg~~l~--~~~~~~FpdGE~~   47 (332)
T PRK00553         12 IFSLSKAKKLVDSICRKLSMKPG--EIVIQKFADGETY   47 (332)
T ss_pred             EEECCCCHHHHHHHHHHhCCcee--eeEEEECCCCCEE
Confidence            44788889999999999998776  4556677888753


No 163
>PTZ00145 phosphoribosylpyrophosphate synthetase; Provisional
Probab=23.02  E-value=83  Score=24.69  Aligned_cols=38  Identities=18%  Similarity=0.121  Sum_probs=29.0

Q ss_pred             EEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           39 ILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        39 ~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      ++|=+|+++-.+..++|+.||+++.  ..-.-+-+||+..
T Consensus       120 m~I~sgs~~~~LA~~IA~~Lg~~l~--~~~~~rFpDGE~~  157 (439)
T PTZ00145        120 AILFSGSSNPLLSKNIADHLGTILG--RVHLKRFADGEVS  157 (439)
T ss_pred             eEEEECCCCHHHHHHHHHHhCCCce--eeEEEECCCCCEE
Confidence            3344899999999999999998776  4555566777753


No 164
>PRK09798 antitoxin MazE; Provisional
Probab=23.02  E-value=1.8e+02  Score=17.31  Aligned_cols=30  Identities=23%  Similarity=0.349  Sum_probs=23.3

Q ss_pred             eeeCHHHHHhcCCcccCCCcEEEEeCCCCEEE
Q 047037           48 NSINAEVAQRVNLSPNANNRLEIMVAFGEKLM   79 (85)
Q Consensus        48 ~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~   79 (85)
                      .=|+..+++.+||...  ..+.+.+-+|+.+-
T Consensus        15 vRIPk~~l~~l~l~~g--~~vei~v~~~~iiI   44 (82)
T PRK09798         15 VRIPATLMQALNLNID--DEVKIDLVDGKLII   44 (82)
T ss_pred             EEcCHHHHHHcCCCCC--CEEEEEEECCEEEE
Confidence            3579999999997765  67788887877653


No 165
>COG1237 Metal-dependent hydrolases of the beta-lactamase superfamily II [General function prediction only]
Probab=22.86  E-value=1.4e+02  Score=21.95  Aligned_cols=34  Identities=15%  Similarity=0.297  Sum_probs=22.4

Q ss_pred             EEEECCEEEEEEEeCC-CCceeeCHHHHHhcCCcccC
Q 047037           29 QGSIKGKSVVILIDSG-STHNSINAEVAQRVNLSPNA   64 (85)
Q Consensus        29 ~g~i~g~~v~~LiDSG-at~~Fi~~~~a~~l~l~~~~   64 (85)
                      ...|....-.+|+||| .+..|+..  |+.||+.+..
T Consensus        24 S~LVE~~~~riLFDtG~~~~~ll~N--a~~lgvd~~d   58 (259)
T COG1237          24 SALVEDEGTRILFDTGTDSDVLLHN--ARLLGVDLRD   58 (259)
T ss_pred             EEEEEcCCeEEEEeCCCCcHHHHHH--HHHcCCCccc
Confidence            3445555578999999 66666643  5677765543


No 166
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=22.46  E-value=1.6e+02  Score=23.98  Aligned_cols=39  Identities=18%  Similarity=0.208  Sum_probs=32.3

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCC
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFG   75 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG   75 (85)
                      ..||-|||.+.==|+-+--+-+.+.+.+.+|+++.++.-
T Consensus       130 ~~F~~DSG~SvGei~GhSr~ins~~~KpsRPfRi~T~sd  168 (603)
T KOG0318|consen  130 HVFLWDSGNSVGEITGHSRRINSVDFKPSRPFRIATGSD  168 (603)
T ss_pred             EEEEecCCCccceeeccceeEeeeeccCCCceEEEeccC
Confidence            678999999988887776666778888899999988753


No 167
>PRK10434 srlR DNA-bindng transcriptional repressor SrlR; Provisional
Probab=22.36  E-value=20  Score=25.43  Aligned_cols=23  Identities=26%  Similarity=0.353  Sum_probs=17.1

Q ss_pred             EEEEeCCCCceeeCHHHHHhcCC
Q 047037           38 VILIDSGSTHNSINAEVAQRVNL   60 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l~l   60 (85)
                      .+.+|+|+|...+.+.+....++
T Consensus        94 tIfld~GtT~~~la~~L~~~~~l  116 (256)
T PRK10434         94 SIILDAGSTVLQMVPLLSRFNNI  116 (256)
T ss_pred             EEEEcCcHHHHHHHHHhccCCCe
Confidence            46789999998887777654333


No 168
>PRK10906 DNA-binding transcriptional repressor GlpR; Provisional
Probab=22.30  E-value=22  Score=25.20  Aligned_cols=25  Identities=24%  Similarity=0.416  Sum_probs=18.1

Q ss_pred             EEEEeCCCCceeeCHHHHHhcCCcc
Q 047037           38 VILIDSGSTHNSINAEVAQRVNLSP   62 (85)
Q Consensus        38 ~~LiDSGat~~Fi~~~~a~~l~l~~   62 (85)
                      .+.+|+|+|...+.+.+..+-++.+
T Consensus        94 tIflD~GtT~~~la~~L~~~~~ltV  118 (252)
T PRK10906         94 TLFIDIGTTPEAVAHALLNHSNLRI  118 (252)
T ss_pred             EEEEcCcHHHHHHHHHhcCCCCcEE
Confidence            4678999999888888765434443


No 169
>PF11925 DUF3443:  Protein of unknown function (DUF3443);  InterPro: IPR021847  This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 400 to 434 amino acids in length. This protein has two conserved sequence motifs: NPV and DNNG. 
Probab=22.02  E-value=1.2e+02  Score=23.42  Aligned_cols=27  Identities=19%  Similarity=0.288  Sum_probs=23.3

Q ss_pred             EEEEEECCEEEEE-EEeCCCCceeeCHH
Q 047037           27 RIQGSIKGKSVVI-LIDSGSTHNSINAE   53 (85)
Q Consensus        27 ~~~g~i~g~~v~~-LiDSGat~~Fi~~~   53 (85)
                      .+....+|+.+.. .||||+...|.+..
T Consensus       245 ~~tt~~~G~t~~~sf~DSGSNg~fF~d~  272 (370)
T PF11925_consen  245 DFTTTFNGQTYSASFFDSGSNGYFFPDS  272 (370)
T ss_pred             eEEEEecCceeeeeeEecCCceeeccCC
Confidence            3777889999888 99999999999854


No 170
>cd01742 GATase1_GMP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. Glutamine amidotransferase (GATase) activity catalyse the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate.  GMP synthetase catalyses the amination of the nucleotide precursor xanthosine 5'-monophospahte to form GMP.  GMP synthetase belongs to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site.
Probab=21.84  E-value=47  Score=21.70  Aligned_cols=26  Identities=15%  Similarity=0.291  Sum_probs=20.4

Q ss_pred             EEEeCCCCceeeCHHHHHhcCCcccC
Q 047037           39 ILIDSGSTHNSINAEVAQRVNLSPNA   64 (85)
Q Consensus        39 ~LiDSGat~~Fi~~~~a~~l~l~~~~   64 (85)
                      .+||.|++.++.=.+..+++|..+..
T Consensus         2 ~~iD~g~~~~~~~~~~l~~~G~~~~~   27 (181)
T cd01742           2 LILDFGSQYTHLIARRVRELGVYSEI   27 (181)
T ss_pred             EEEECCCchHHHHHHHHHhcCceEEE
Confidence            58999999888778888888875543


No 171
>TIGR01251 ribP_PPkin ribose-phosphate pyrophosphokinase. In some systems, close homologs lacking enzymatic activity exist and perform regulatory functions. The model is designated subfamily rather than equivalog for this reason.
Probab=21.71  E-value=74  Score=23.25  Aligned_cols=33  Identities=24%  Similarity=0.240  Sum_probs=26.4

Q ss_pred             CCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCE
Q 047037           43 SGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEK   77 (85)
Q Consensus        43 SGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~   77 (85)
                      +|+++..+..++|+.||+++.+  .-.-+-+||+.
T Consensus         5 ~~~~~~~la~~ia~~lg~~~~~--~~~~~FpdGE~   37 (308)
T TIGR01251         5 SGSSNQELAQKVAKNLGLPLGD--VEVKRFPDGEL   37 (308)
T ss_pred             ECCCCHHHHHHHHHHhCCeeee--eEEEECCCCCE
Confidence            6899999999999999988873  45556677764


No 172
>PF04430 DUF498:  Protein of unknown function (DUF498/DUF598);  InterPro: IPR007523  NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This is entry represents an essential factor for the assembly of mitochondrial NADH:ubiquinone oxidoreductase complex (complex I) []. The crystal structure of this protein revealed a 3-layer beta+alpha/beta/alpha topology [].; PDB: 2K2E_A 2Q4Q_B 2AB1_A 2FVT_A 2CYJ_A 1IHN_B 2GM2_A 3CPK_A 2FI9_A.
Probab=21.66  E-value=92  Score=19.13  Aligned_cols=24  Identities=21%  Similarity=0.375  Sum_probs=19.1

Q ss_pred             EEEEEEEeCCCCceeeCHHHHHhc
Q 047037           35 KSVVILIDSGSTHNSINAEVAQRV   58 (85)
Q Consensus        35 ~~v~~LiDSGat~~Fi~~~~a~~l   58 (85)
                      .+=.++|=||.++-|+++++.+.+
T Consensus        53 ~pe~liiGtG~~~~~~~~~~~~~l   76 (110)
T PF04430_consen   53 KPEVLIIGTGKRQLFLPPELREYL   76 (110)
T ss_dssp             S-SEEEEEETTS-SECTHHHHHHH
T ss_pred             CCcEEEEccCCccccCCHHHHHHH
Confidence            455788999999999999998866


No 173
>PF13856 Gifsy-2:  ATP-binding sugar transporter from pro-phage; PDB: 2PP6_A.
Probab=21.39  E-value=1.3e+02  Score=17.90  Aligned_cols=24  Identities=17%  Similarity=0.237  Sum_probs=16.1

Q ss_pred             EEEEEEECCEEEEEEEeCCCCcee
Q 047037           26 MRIQGSIKGKSVVILIDSGSTHNS   49 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGat~~F   49 (85)
                      |--...|||+++.+.+|..--.-.
T Consensus        20 fg~~~~InG~~~~~v~d~~~~~~~   43 (95)
T PF13856_consen   20 FGEEHTINGKPYRAVFDEPELEER   43 (95)
T ss_dssp             H-EEEEETTEEEEEEEGGGT----
T ss_pred             hCCeEEECCEEEEEEECCchhhhc
Confidence            445678999999999998744443


No 174
>PF01150 GDA1_CD39:  GDA1/CD39 (nucleoside phosphatase) family;  InterPro: IPR000407  A number of nucleoside diphosphate and triphosphate hydrolases as well as some yet uncharacterised proteins have been found to belong to the same family [, ]. The uncharacterised proteins all seem to be membrane-bound. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0016787 hydrolase activity; PDB: 3AAP_A 3AAR_A 3AAQ_A 3AGR_A 4A5B_B 4A57_D 4A59_A 4A5A_B 3CJA_A 3CJ1_A ....
Probab=21.38  E-value=60  Score=24.56  Aligned_cols=14  Identities=29%  Similarity=0.546  Sum_probs=11.4

Q ss_pred             EEEEEEEeCCCCce
Q 047037           35 KSVVILIDSGSTHN   48 (85)
Q Consensus        35 ~~v~~LiDSGat~~   48 (85)
                      ....++||.|||++
T Consensus         7 ~~y~vviDAGSsgs   20 (434)
T PF01150_consen    7 RKYGVVIDAGSSGS   20 (434)
T ss_dssp             EEEEEEEEEESSEE
T ss_pred             ccEEEEEEcCCCCc
Confidence            35678999999986


No 175
>PF07076 DUF1344:  Protein of unknown function (DUF1344);  InterPro: IPR009780 This family consists of several short, hypothetical bacterial proteins of around 80 residues in length. Members of this family are found in Rhizobium, Agrobacterium and Brucella species. The function of this family is unknown.
Probab=21.33  E-value=1.8e+02  Score=16.69  Aligned_cols=45  Identities=24%  Similarity=0.287  Sum_probs=28.6

Q ss_pred             cCCceeEEEeccccCcCCeEEEEEEECCEEEEEEEeCCCCceeeC
Q 047037            7 NATPKISLHDVARDQVLETMRIQGSIKGKSVVILIDSGSTHNSIN   51 (85)
Q Consensus         7 ~~~~~iSl~A~~g~~~~~~~~~~g~i~g~~v~~LiDSGat~~Fi~   51 (85)
                      .....|+|..=....-|..+.+.|.--|.+|.+..|-..-.++|.
T Consensus        14 ~~~~titLdDGksy~lp~ef~~~~L~~G~kV~V~yd~~~gk~vit   58 (61)
T PF07076_consen   14 PETMTITLDDGKSYKLPEEFDFDGLKPGMKVVVFYDEVDGKRVIT   58 (61)
T ss_pred             CCceEEEecCCCEEECCCcccccccCCCCEEEEEEEccCCcEEee
Confidence            344566665544445566777777777778887777766555543


No 176
>PRK02269 ribose-phosphate pyrophosphokinase; Provisional
Probab=21.21  E-value=97  Score=22.92  Aligned_cols=37  Identities=16%  Similarity=0.114  Sum_probs=27.8

Q ss_pred             EEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           40 LIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        40 LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      .|=+|+++..+.+++|++||+++.  ..-.-+-+||+..
T Consensus         7 ~i~~~~~~~~la~~ia~~lg~~l~--~~~~~~FpdGE~~   43 (320)
T PRK02269          7 KLFALSSNKELAEKVAQEIGIELG--KSSVRQFSDGEIQ   43 (320)
T ss_pred             EEEECCCCHHHHHHHHHHhCCcee--eeEEEECCCCCEE
Confidence            344688899999999999998776  4455566777643


No 177
>TIGR02707 butyr_kinase butyrate kinase. This model represents an enzyme family in which members are designated either butryate kinase or branched-chain carboxylic acid kinase. The EC designation 2.7.2.7 describes an enzyme with relatively broad specificity; gene products whose context suggests a role in metabolism of aliphatic amino acids are likely to act as branched-chain carboxylic acid kinase. The gene typically found adjacent, ptb (phosphate butyryltransferase), likewise encodes an enzyme that may have a broad specificity that includes a role in aliphatic amino acid cabolism.
Probab=21.00  E-value=78  Score=23.74  Aligned_cols=35  Identities=9%  Similarity=0.103  Sum_probs=29.7

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEE
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKL   78 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~   78 (85)
                      |-+|-|+..++++.+|.+.....-+.+.+++|-..
T Consensus       153 gls~~~v~~~~~~~~g~~~~~~~~I~~hLGtGig~  187 (351)
T TIGR02707       153 ALNQKAVARRIAKELGKRYEEMNLIVAHMGGGISV  187 (351)
T ss_pred             hhhHHHHHHHHHHHcCCCcccCCEEEEEeCCCcee
Confidence            77899999999999998877778888888888654


No 178
>PF07532 Big_4:  Bacterial Ig-like domain (group 4);  InterPro: IPR011081 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.
Probab=20.99  E-value=98  Score=16.73  Aligned_cols=14  Identities=21%  Similarity=0.480  Sum_probs=12.0

Q ss_pred             CcCCeEEEEEEECC
Q 047037           21 QVLETMRIQGSIKG   34 (85)
Q Consensus        21 ~~~~~~~~~g~i~g   34 (85)
                      ..+++|.+.|.+.|
T Consensus        45 ~~~G~y~v~G~v~G   58 (59)
T PF07532_consen   45 NKPGTYTVTGTVEG   58 (59)
T ss_pred             cCCEEEEEEEEEec
Confidence            56899999999976


No 179
>PRK06393 rpoE DNA-directed RNA polymerase subunit E''; Validated
Probab=20.87  E-value=1.7e+02  Score=16.88  Aligned_cols=29  Identities=10%  Similarity=0.331  Sum_probs=19.7

Q ss_pred             EEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEE
Q 047037           37 VVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIM   71 (85)
Q Consensus        37 v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~   71 (85)
                      +++++|.-      +..+|+.+++...-.+.++|+
T Consensus        35 ~v~i~dPe------~S~vAk~~~i~~pG~YAlkVr   63 (64)
T PRK06393         35 FLIITEPE------GSAIAKRAGITEPGMYAIKVR   63 (64)
T ss_pred             EEEEECCc------hhHHHHHhCCCCCCeEEEEee
Confidence            45666765      568999999884334666654


No 180
>PRK07058 acetate kinase; Provisional
Probab=20.83  E-value=59  Score=25.19  Aligned_cols=36  Identities=14%  Similarity=0.189  Sum_probs=29.0

Q ss_pred             CCCceeeCHHHHHhcCCcccCCCcEEEEeCCCCEEEE
Q 047037           44 GSTHNSINAEVAQRVNLSPNANNRLEIMVAFGEKLMS   80 (85)
Q Consensus        44 Gat~~Fi~~~~a~~l~l~~~~~~~~~V~~anG~~~~~   80 (85)
                      |-+|-|+..++|+.+ -+....+-+...++||-++..
T Consensus       181 GlS~~~va~~~a~~l-~~~~~~~~Iv~HLG~G~Si~A  216 (396)
T PRK07058        181 GLSYKFVAGELRRRA-PELARGKVVAAHLGSGASLCA  216 (396)
T ss_pred             HHHHHHHHHHHHHhc-CCcccCCEEEEEeCCCceeee
Confidence            457889999999998 455556888999999988765


No 181
>COG4067 Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]
Probab=20.66  E-value=72  Score=21.79  Aligned_cols=24  Identities=25%  Similarity=0.390  Sum_probs=18.8

Q ss_pred             EEEEEEeCCCCceeeCHHHHHhcC
Q 047037           36 SVVILIDSGSTHNSINAEVAQRVN   59 (85)
Q Consensus        36 ~v~~LiDSGat~~Fi~~~~a~~l~   59 (85)
                      .+.+=|||||..+-++..-.+++.
T Consensus        39 ~~kAkiDTGA~TSsL~A~dI~~fk   62 (162)
T COG4067          39 QLKAKIDTGAVTSSLSASDIERFK   62 (162)
T ss_pred             eeeeeecccceeeeEEeecceeee
Confidence            366778999999999987766554


No 182
>cd06404 PB1_aPKC PB1 domain is an essential modular domain of the atypical protein kinase C (aPKC) which in complex with Par6 and Par3  proteins is crucial for establishment of apical-basal polarity of animal cells. PB1 domain is a modular domain mediating specific protein-protein interaction which play roles in many critical cell processes. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster.  Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi,
Probab=20.58  E-value=2.2e+02  Score=17.34  Aligned_cols=47  Identities=13%  Similarity=0.237  Sum_probs=38.1

Q ss_pred             EEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcccCCCcEEEEeCC
Q 047037           26 MRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPNANNRLEIMVAF   74 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~~~~~~~V~~an   74 (85)
                      ++++...+|.-+..-+|-..|.-.+.+++.+-.++...  .+++++--|
T Consensus         1 i~~K~~y~gdi~it~~d~~~s~e~L~~~v~~~c~~~~~--q~ft~kw~D   47 (83)
T cd06404           1 VRVKAAYNGDIMITSIDPSISLEELCNEVRDMCRFHND--QPFTLKWID   47 (83)
T ss_pred             CeEEEEecCcEEEEEcCCCcCHHHHHHHHHHHhCCCCC--CcEEEEEEC
Confidence            36788889999999999999988899999888876554  677776655


No 183
>PRK13668 hypothetical protein; Provisional
Probab=20.25  E-value=1.5e+02  Score=21.80  Aligned_cols=38  Identities=13%  Similarity=0.246  Sum_probs=31.6

Q ss_pred             EEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCccc
Q 047037           26 MRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSPN   63 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~~   63 (85)
                      +...-......+...+|.|-+.-||+++..+++|+..+
T Consensus       106 lv~~~htae~~I~YalDlg~syr~I~e~~L~~~git~e  143 (267)
T PRK13668        106 LIYDEHTAETRIYYALDLGKTYRLIDERMLEKLGLTEE  143 (267)
T ss_pred             ccccccccceEEEEEEcCCcceEEeCHHHHHHcCCCHH
Confidence            44455566778899999999999999999999998754


No 184
>smart00666 PB1 PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
Probab=20.22  E-value=1.8e+02  Score=16.30  Aligned_cols=37  Identities=14%  Similarity=0.275  Sum_probs=31.6

Q ss_pred             EEEEEEECCEEEEEEEeCCCCceeeCHHHHHhcCCcc
Q 047037           26 MRIQGSIKGKSVVILIDSGSTHNSINAEVAQRVNLSP   62 (85)
Q Consensus        26 ~~~~g~i~g~~v~~LiDSGat~~Fi~~~~a~~l~l~~   62 (85)
                      ++++...++....+.++.+.|..=+-.++.++++++.
T Consensus         2 ~~vK~~~~~~~~~~~~~~~~s~~dL~~~i~~~~~~~~   38 (81)
T smart00666        2 VDVKLRYGGETRRLSVPRDISFEDLRSKVAKRFGLDN   38 (81)
T ss_pred             ccEEEEECCEEEEEEECCCCCHHHHHHHHHHHhCCCC
Confidence            4566677889999999999999999999999999764


No 185
>TIGR03778 VPDSG_CTERM VPDSG-CTERM exosortase interaction domain. Through in silico analysis, we previously described the PEP-CTERM/exosortase system (PubMed:16930487). This model describes a PEP-CTERM-like variant C-terminal protein sorting signal, as found at the C-terminus of twenty otherwise unrelated proteins in Verrucomicrobiae bacterium DG1235. The variant motif, VPDSG, seems an intermediate between the VPEP motif (TIGR02595) of typical exosortase systems and the classical LPXTG of sortase in Gram-positive bacteria.
Probab=20.02  E-value=32  Score=16.52  Aligned_cols=15  Identities=33%  Similarity=0.328  Sum_probs=10.2

Q ss_pred             eCCCCceeeCHHHHH
Q 047037           42 DSGSTHNSINAEVAQ   56 (85)
Q Consensus        42 DSGat~~Fi~~~~a~   56 (85)
                      |||+|...+.-.++-
T Consensus         3 DsGST~~Ll~~~l~~   17 (26)
T TIGR03778         3 DSGSTLALLGLGLLG   17 (26)
T ss_pred             CchhHHHHHHHHHHH
Confidence            899887766555443


Done!