Query         psy5907
Match_columns 56
No_of_seqs    126 out of 1035
Neff          8.7 
Searched_HMMs 46136
Date          Fri Aug 16 21:19:53 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5907.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5907hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd01389 MATA_HMG-box MATA_HMG-  99.8 1.8E-20 3.9E-25   92.8   7.5   54    1-54      7-60  (77)
  2 cd01388 SOX-TCF_HMG-box SOX-TC  99.8 2.8E-20 6.1E-25   91.2   7.5   54    1-54      7-60  (72)
  3 PTZ00199 high mobility group p  99.8 7.9E-20 1.7E-24   93.7   7.4   54    1-54     28-83  (94)
  4 cd01390 HMGB-UBF_HMG-box HMGB-  99.8 1.4E-19 3.1E-24   86.7   6.9   54    1-54      6-59  (66)
  5 PF00505 HMG_box:  HMG (high mo  99.8 1.7E-18 3.7E-23   83.6   7.0   54    1-54      6-59  (69)
  6 smart00398 HMG high mobility g  99.8   4E-18 8.6E-23   82.1   7.5   54    1-54      7-60  (70)
  7 cd00084 HMG-box High Mobility   99.8 1.1E-17 2.4E-22   79.8   7.4   54    1-54      6-59  (66)
  8 KOG0527|consensus               99.7 4.1E-18 8.9E-23  102.4   5.4   54    1-54     68-121 (331)
  9 PF09011 HMG_box_2:  HMG-box do  99.7 3.8E-16 8.3E-21   76.6   6.7   54    1-54      9-63  (73)
 10 KOG0381|consensus               99.6 1.3E-15 2.8E-20   77.6   7.2   54    1-54     28-81  (96)
 11 COG5648 NHP6B Chromatin-associ  99.6 1.9E-15 4.1E-20   86.0   6.2   54    1-54     76-129 (211)
 12 KOG4715|consensus               99.4 6.2E-13 1.3E-17   79.9   4.6   54    1-54     70-123 (410)
 13 KOG3248|consensus               99.2 1.9E-11 4.2E-16   74.0   5.6   54    1-54    197-250 (421)
 14 KOG0528|consensus               99.1 3.8E-11 8.2E-16   75.0   0.6   52    1-52    331-382 (511)
 15 KOG0526|consensus               99.0 2.9E-10 6.3E-15   72.0   4.1   50    1-54    541-590 (615)
 16 KOG2746|consensus               98.6 2.6E-08 5.6E-13   64.4   3.2   54    1-54    187-242 (683)
 17 PF06382 DUF1074:  Protein of u  98.4 1.4E-06 2.9E-11   49.1   5.7   47    1-51     84-130 (183)
 18 PF08073 CHDNT:  CHDNT (NUC034)  97.8 0.00013 2.8E-09   34.2   4.7   37    2-38     15-51  (55)
 19 PF04690 YABBY:  YABBY protein;  97.6 0.00019 4.1E-09   40.4   4.7   38    1-38    127-164 (170)
 20 PF14887 HMG_box_5:  HMG (high   97.5 0.00091   2E-08   33.4   5.5   50    4-54     12-61  (85)
 21 COG5648 NHP6B Chromatin-associ  97.4 0.00012 2.7E-09   42.2   2.2   52    3-54    151-202 (211)
 22 PF06244 DUF1014:  Protein of u  97.0  0.0029 6.2E-08   34.0   4.5   39    2-40     79-117 (122)
 23 KOG3223|consensus               95.6   0.035 7.5E-07   32.1   4.1   45    2-49    171-215 (221)
 24 PRK15117 ABC transporter perip  95.4   0.044 9.4E-07   31.7   4.2   33   19-51     66-99  (211)
 25 TIGR03481 HpnM hopanoid biosyn  95.1   0.049 1.1E-06   31.2   3.9   33   19-51     62-95  (198)
 26 PF05494 Tol_Tol_Ttg2:  Toluene  94.8   0.035 7.7E-07   30.7   2.6   33   19-51     36-69  (170)
 27 PF06945 DUF1289:  Protein of u  92.9    0.46 9.9E-06   21.6   4.0   29   20-53     21-49  (51)
 28 PF13875 DUF4202:  Domain of un  92.4    0.63 1.4E-05   26.7   4.9   40    2-44    131-170 (185)
 29 PF09164 VitD-bind_III:  Vitami  90.1     1.3 2.8E-05   21.5   4.6   32    2-33     10-41  (68)
 30 PF01352 KRAB:  KRAB box;  Inte  88.0    0.65 1.4E-05   20.3   2.1   29   23-51      3-32  (41)
 31 PF12650 DUF3784:  Domain of un  87.0    0.55 1.2E-05   23.8   1.7   16   33-48     25-40  (97)
 32 PF11304 DUF3106:  Protein of u  86.1     3.4 7.3E-05   21.5   6.1   40   12-51     32-73  (107)
 33 PRK10236 hypothetical protein;  84.9     1.5 3.3E-05   26.1   3.0   31   19-49    106-141 (237)
 34 PF04769 MAT_Alpha1:  Mating-ty  83.5     3.6 7.9E-05   23.9   4.1   42    2-49     50-91  (201)
 35 PF11304 DUF3106:  Protein of u  83.1       4 8.7E-05   21.3   3.9   17   30-46     34-50  (107)
 36 PF13945 NST1:  Salt tolerance   82.7     3.3 7.1E-05   23.9   3.7   30   22-51     99-128 (190)
 37 PRK12751 cpxP periplasmic stre  81.8     4.3 9.3E-05   22.8   3.9   27   25-51    118-144 (162)
 38 PF12881 NUT_N:  NUT protein N   81.5     9.8 0.00021   23.8   5.6   49    3-51    232-280 (328)
 39 KOG1610|consensus               79.7     7.4 0.00016   24.3   4.7   30   23-52    218-247 (322)
 40 PF13412 HTH_24:  Winged helix-  77.7     3.5 7.5E-05   17.8   2.3   22   15-36     12-33  (48)
 41 PRK10363 cpxP periplasmic repr  74.8     9.3  0.0002   21.7   3.9   29   23-51    110-138 (166)
 42 PF15581 Imm35:  Immunity prote  73.3      11 0.00023   19.4   3.9   28   21-48     30-57  (93)
 43 PF12290 DUF3802:  Protein of u  73.0      12 0.00027   19.9   4.7   28   23-50     73-100 (113)
 44 cd08317 Death_ank Death domain  72.9     1.9   4E-05   21.3   0.8   22   19-40      3-24  (84)
 45 PF05388 Carbpep_Y_N:  Carboxyp  72.4      12 0.00025   19.9   3.7   28   23-50     45-72  (113)
 46 COG2854 Ttg2D ABC-type transpo  71.0       5 0.00011   23.4   2.3   33   19-51     68-101 (202)
 47 PF08367 M16C_assoc:  Peptidase  67.6      18 0.00039   21.1   4.2   33   23-55     12-44  (248)
 48 COG3313 Predicted Fe-S protein  66.4     8.6 0.00019   19.0   2.3   22   20-46     26-47  (74)
 49 PF06628 Catalase-rel:  Catalas  66.0     6.4 0.00014   18.7   1.8   19   30-48     13-31  (68)
 50 PRK10455 periplasmic protein;   64.7      23  0.0005   19.8   4.1   27   25-51    118-144 (161)
 51 PF07813 LTXXQ:  LTXXQ motif fa  63.5      16 0.00036   17.8   3.9   25   24-48     75-99  (100)
 52 PRK00523 hypothetical protein;  62.6      18 0.00038   17.8   5.0   36    6-41     25-60  (72)
 53 PF03672 UPF0154:  Uncharacteri  61.4      18 0.00038   17.4   5.1   34    8-41     19-52  (64)
 54 PRK14291 chaperone protein Dna  61.1      33 0.00072   21.6   4.7   37    8-44     22-62  (382)
 55 PRK12750 cpxP periplasmic repr  60.5      29 0.00062   19.6   4.1   24   28-51    128-151 (170)
 56 PRK10266 curved DNA-binding pr  59.4      38 0.00082   20.6   5.2   40    8-47     23-67  (306)
 57 PRK14299 chaperone protein Dna  58.6      39 0.00084   20.4   4.8   39    8-46     23-66  (291)
 58 PRK14281 chaperone protein Dna  58.5      28  0.0006   22.0   4.1   40    7-46     21-66  (397)
 59 PHA03102 Small T antigen; Revi  58.2      26 0.00057   19.5   3.6   37    7-43     25-62  (153)
 60 PF02026 RyR:  RyR domain;  Int  58.0      14  0.0003   18.8   2.3   19   33-51     60-78  (94)
 61 PRK14279 chaperone protein Dna  57.7      27 0.00059   22.1   3.9   40    8-47     28-73  (392)
 62 PF11521 TFIIE-A_C-term:  C-ter  57.5      14 0.00031   18.7   2.2   30   22-52     51-80  (86)
 63 PF02376 CUT:  CUT domain;  Int  57.4      14 0.00031   18.5   2.2   32   19-50     37-70  (87)
 64 PRK14283 chaperone protein Dna  57.3      46 0.00099   20.9   4.9   40    7-46     23-67  (378)
 65 PRK14280 chaperone protein Dna  55.7      49  0.0011   20.8   4.9   40    8-47     23-67  (376)
 66 smart00271 DnaJ DnaJ molecular  55.6      11 0.00023   16.8   1.5   33    8-40     20-58  (60)
 67 PRK14296 chaperone protein Dna  55.2      46   0.001   20.9   4.6   39    8-46     23-66  (372)
 68 PRK01844 hypothetical protein;  54.3      26 0.00057   17.2   5.1   36    6-41     24-59  (72)
 69 KOG1827|consensus               53.4    0.35 7.5E-06   32.3  -5.0   41    2-42    555-595 (629)
 70 PF02650 HTH_WhiA:  WhiA C-term  52.9      12 0.00025   18.9   1.5   20   13-32     47-66  (85)
 71 TIGR00787 dctP tripartite ATP-  51.7      24 0.00051   20.6   2.9   21   31-51    213-233 (257)
 72 COG1638 DctP TRAP-type C4-dica  50.1      27 0.00058   21.7   3.0   24   31-54    244-267 (332)
 73 PTZ00037 DnaJ_C chaperone prot  49.9      60  0.0013   20.9   4.6   40    7-46     46-87  (421)
 74 COG3763 Uncharacterized protei  48.8      34 0.00073   16.8   4.9   34    7-40     25-58  (71)
 75 PRK14284 chaperone protein Dna  48.6      53  0.0012   20.7   4.2   39    8-46     20-64  (391)
 76 PF11588 DUF3243:  Protein of u  48.1      22 0.00047   17.9   2.0   19   27-45     53-71  (81)
 77 PRK14285 chaperone protein Dna  47.4      53  0.0011   20.6   4.0   41    7-47     21-67  (365)
 78 PLN02150 terpene synthase/cycl  47.1      11 0.00025   19.2   1.0   19   22-40     24-42  (96)
 79 PF01630 Glyco_hydro_56:  Hyalu  46.1      75  0.0016   20.1   4.5   32    4-35    123-154 (337)
 80 PRK14287 chaperone protein Dna  45.5      76  0.0016   19.9   4.9   39    8-46     23-66  (371)
 81 cd08805 Death_ank1 Death domai  45.4     6.2 0.00013   19.8  -0.2   21   20-40      4-24  (84)
 82 PRK14276 chaperone protein Dna  45.1      78  0.0017   19.9   4.6   39    8-46     23-66  (380)
 83 KOG0713|consensus               45.1      79  0.0017   20.0   4.9   40    8-47     35-79  (336)
 84 PRK09897 hypothetical protein;  44.5      53  0.0012   21.8   3.8   29   23-51    353-381 (534)
 85 PRK14294 chaperone protein Dna  43.7      80  0.0017   19.7   4.9   39    8-46     23-67  (366)
 86 PF00226 DnaJ:  DnaJ domain;  I  43.0      33 0.00071   15.4   2.1   38    8-45     19-63  (64)
 87 cd01670 Death Death Domain: a   42.7      14  0.0003   17.4   0.8   17   24-40      3-19  (79)
 88 PF08373 RAP:  RAP domain;  Int  42.6      22 0.00047   15.8   1.5   15   33-47     41-56  (58)
 89 PRK14298 chaperone protein Dna  42.5      76  0.0017   20.0   4.2   37    8-44     24-64  (377)
 90 PF03480 SBP_bac_7:  Bacterial   42.4      28  0.0006   20.5   2.2   20   32-51    214-233 (286)
 91 PF02936 COX4:  Cytochrome c ox  41.9      15 0.00032   20.2   0.9   22   23-44     29-54  (142)
 92 PRK05439 pantothenate kinase;   41.2      20 0.00043   22.2   1.5   49    1-50      8-58  (311)
 93 PF05914 RIB43A:  RIB43A;  Inte  40.8     4.9 0.00011   25.4  -1.2   34   14-47    252-287 (379)
 94 PRK14292 chaperone protein Dna  40.7      89  0.0019   19.6   4.3   39    8-46     21-64  (371)
 95 PF09829 DUF2057:  Uncharacteri  40.0      39 0.00084   19.0   2.5   24   23-46    164-187 (189)
 96 smart00351 PAX Paired Box doma  39.4      58  0.0013   17.1   3.0   20   13-32     86-105 (125)
 97 cd08318 Death_NMPP84 Death dom  39.1      17 0.00037   18.0   0.9   16   23-38     10-25  (86)
 98 PRK14295 chaperone protein Dna  38.6   1E+02  0.0022   19.5   4.4   40    8-47     28-73  (389)
 99 PRK14290 chaperone protein Dna  38.5      83  0.0018   19.7   3.9   36    8-43     22-63  (365)
100 cd08779 Death_PIDD Death Domai  38.4      15 0.00032   18.3   0.5   17   23-39      5-21  (86)
101 PF14098 SSPI:  Small, acid-sol  38.3      49  0.0011   15.9   2.3   17   32-48     44-60  (65)
102 COG4735 Uncharacterized protei  38.0      81  0.0018   18.7   3.5   33   19-51     35-67  (211)
103 PF01710 HTH_Tnp_IS630:  Transp  38.0      48   0.001   17.3   2.5   20   12-31     63-82  (119)
104 PRK14278 chaperone protein Dna  37.2 1.1E+02  0.0023   19.3   4.3   39    8-46     22-65  (378)
105 PF11616 EZH2_WD-Binding:  WD r  37.2      32 0.00069   14.0   1.4   14   24-37     12-25  (30)
106 PF06242 DUF1013:  Protein of u  36.6      33 0.00071   19.0   1.7   18   15-32     78-95  (140)
107 PF11334 DUF3136:  Protein of u  36.5      55  0.0012   15.7   3.3   30    2-36      9-38  (64)
108 cd00922 Cyt_c_Oxidase_IV Cytoc  36.5      19 0.00041   19.6   0.8   14   33-46     43-56  (136)
109 PRK14286 chaperone protein Dna  36.4 1.1E+02  0.0024   19.2   5.0   40    8-47     23-68  (372)
110 COG2938 Uncharacterized conser  36.3      66  0.0014   16.6   2.8   24   28-51     35-58  (94)
111 PF03810 IBN_N:  Importin-beta   35.3      52  0.0011   15.1   2.4   23   25-47     39-69  (77)
112 PF11903 DUF3423:  Protein of u  35.2      60  0.0013   15.8   2.8   20   13-32     37-56  (72)
113 PRK14288 chaperone protein Dna  34.9 1.2E+02  0.0026   19.1   5.2   39    8-46     22-66  (369)
114 PF09556 RE_HaeIII:  HaeIII res  34.9      16 0.00035   22.6   0.4   28   24-51    150-177 (300)
115 cd07911 RNRR2_Rv0233_like Ribo  34.8      41 0.00089   20.1   2.1   16   33-48     35-50  (280)
116 PRK09731 putative general secr  34.2      95  0.0021   17.8   3.6   34    3-44      6-39  (178)
117 KOG0715|consensus               33.9 1.1E+02  0.0025   18.6   4.6   37    9-45     63-103 (288)
118 PRK14297 chaperone protein Dna  33.5   1E+02  0.0022   19.5   3.7   39    8-46     23-67  (380)
119 PRK14289 chaperone protein Dna  33.5 1.3E+02  0.0027   19.0   4.7   40    8-47     24-69  (386)
120 PF13542 HTH_Tnp_ISL3:  Helix-t  33.3      37  0.0008   14.6   1.4   32    6-37     12-44  (52)
121 smart00413 ETS erythroblast tr  33.3      26 0.00056   17.8   1.0   24   30-53     39-62  (87)
122 smart00206 NTR Tissue inhibito  33.1      26 0.00056   20.0   1.0   18   33-50    101-118 (172)
123 PF12376 DUF3654:  Protein of u  33.1      63  0.0014   17.9   2.5   21   35-55      6-26  (137)
124 TIGR02349 DnaJ_bact chaperone   32.8 1.2E+02  0.0027   18.7   4.3   39    8-46     19-62  (354)
125 PHA02947 S-S bond formation pa  32.7      49  0.0011   19.6   2.1   28   18-45     58-85  (215)
126 PF02216 B:  B domain;  InterPr  32.6      60  0.0013   15.1   2.4   17   35-51     20-36  (54)
127 PHA02955 hypothetical protein;  32.4      49  0.0011   19.6   2.1   28   18-45     58-85  (213)
128 PF00356 LacI:  Bacterial regul  32.4      53  0.0012   14.4   2.9   32   19-51      9-40  (46)
129 cd08804 Death_ank2 Death domai  32.1      20 0.00042   17.8   0.4   20   21-40      5-24  (84)
130 cd06257 DnaJ DnaJ domain or J-  31.6      52  0.0011   14.0   3.5   31    8-38     19-54  (55)
131 cd09071 FAR_C C-terminal domai  31.4      33 0.00071   16.7   1.2   15   31-45     53-67  (92)
132 PF04363 DUF496:  Protein of un  31.3      71  0.0015   16.5   2.3   17   19-35     47-63  (95)
133 cd03585 NTR_TIMP NTR domain, T  31.2      28  0.0006   20.1   1.0   18   33-50    106-123 (183)
134 PRK05423 hypothetical protein;  31.1      87  0.0019   16.4   3.0   18   19-36     54-71  (104)
135 PRK14277 chaperone protein Dna  31.0 1.4E+02  0.0031   18.8   4.8   39    8-46     24-68  (386)
136 PF06424 PRP1_N:  PRP1 splicing  31.0      48   0.001   18.1   1.8   26   14-39     95-126 (133)
137 PHA02662 ORF131 putative membr  30.9      55  0.0012   19.6   2.1   28   18-45     71-98  (226)
138 PF10675 DUF2489:  Protein of u  30.5      76  0.0017   17.1   2.6   19   30-48     94-112 (131)
139 cd08319 Death_RAIDD Death doma  30.1      24 0.00053   17.6   0.5   16   23-38      5-20  (83)
140 cd08306 Death_FADD Fas-associa  30.1      27 0.00059   17.3   0.7   17   22-38      4-20  (86)
141 COG4282 SMI1 Protein involved   29.2 1.2E+02  0.0027   17.5   3.3   29   18-46     30-58  (191)
142 PF12926 MOZART2:  Mitotic-spin  28.8      91   0.002   15.9   3.7   28   19-46      5-32  (88)
143 PF07757 AdoMet_MTase:  Predict  28.6      70  0.0015   17.1   2.1   25   27-51     19-43  (112)
144 PF11460 DUF3007:  Protein of u  28.3   1E+02  0.0022   16.3   4.1   20   29-48     82-101 (104)
145 PRK13965 ribonucleotide-diphos  28.2      61  0.0013   20.1   2.1   17   33-49     60-76  (335)
146 PRK14293 chaperone protein Dna  27.9 1.6E+02  0.0035   18.5   4.4   41    7-47     21-66  (374)
147 PF13404 HTH_AsnC-type:  AsnC-t  27.9      63  0.0014   13.8   2.2   16   16-31     13-28  (42)
148 PF04255 DUF433:  Protein of un  27.9      71  0.0015   14.4   2.9   19   12-30     35-53  (56)
149 PF14647 FAM91_N:  FAM91 N-term  27.8 1.2E+02  0.0027   19.0   3.3   31    1-31     87-122 (308)
150 TIGR00647 MG103 conserved hypo  27.7      74  0.0016   19.5   2.4   20   12-31    236-255 (279)
151 TIGR03092 SASP_sspI small, aci  27.6      85  0.0018   15.2   2.2   18   31-48     42-59  (65)
152 PF11625 DUF3253:  Protein of u  27.5      94   0.002   15.7   3.2   38   13-50     15-55  (83)
153 PLN02492 ribonucleoside-diphos  27.4      63  0.0014   19.8   2.1   17   32-48     45-61  (324)
154 PF07800 DUF1644:  Protein of u  27.2      30 0.00066   19.6   0.7   30    8-37    122-151 (162)
155 PF00887 ACBP:  Acyl CoA bindin  27.0      89  0.0019   15.2   3.7   49    3-52     30-82  (87)
156 PRK06946 lipid A biosynthesis   27.0 1.3E+02  0.0028   18.0   3.3   26   13-38     49-74  (293)
157 PRK07920 lipid A biosynthesis   27.0      85  0.0018   18.8   2.6   26   13-38     44-69  (298)
158 TIGR02208 lipid_A_msbB lipid A  26.9 1.3E+02  0.0028   18.1   3.3   26   13-38     60-85  (305)
159 PF14374 Ribos_L4_asso_C:  60S   26.8      33 0.00072   17.1   0.7   14   19-32      8-21  (80)
160 PF11961 DUF3475:  Domain of un  26.8      82  0.0018   14.7   2.1   21   26-46      9-29  (57)
161 PRK13967 nrdF1 ribonucleotide-  26.7      67  0.0014   19.8   2.1   17   33-49     47-63  (322)
162 PRK05645 lipid A biosynthesis   26.7      95  0.0021   18.5   2.7   26   13-38     49-74  (295)
163 PRK08734 lipid A biosynthesis   26.5      96  0.0021   18.7   2.7   26   13-38     50-75  (305)
164 PF12550 GCR1_C:  Transcription  26.4      81  0.0017   15.3   2.1   22   19-40     24-45  (81)
165 PF13867 SAP30_Sin3_bdg:  Sin3   26.4      77  0.0017   14.3   2.8   26   19-44     19-44  (53)
166 PRK10767 chaperone protein Dna  26.4 1.7E+02  0.0037   18.3   4.7   39    8-46     23-67  (371)
167 smart00509 TFS2N Domain in the  26.2      47   0.001   16.0   1.2   21   17-37     52-72  (75)
168 PF01527 HTH_Tnp_1:  Transposas  26.2      35 0.00076   15.8   0.7   25    6-30      7-33  (76)
169 PF01047 MarR:  MarR family;  I  26.1      37  0.0008   14.9   0.8   16   16-31     13-28  (59)
170 cd01776 Rin1_RA Ubiquitin doma  25.8 1.1E+02  0.0023   15.7   2.8   23   18-40     21-43  (87)
171 PRK09614 nrdF ribonucleotide-d  25.5      73  0.0016   19.4   2.1   16   33-48     47-62  (324)
172 PF05047 L51_S25_CI-B8:  Mitoch  25.0      76  0.0017   13.8   2.0   17    6-22      4-20  (52)
173 COG2926 Uncharacterized protei  24.9 1.2E+02  0.0026   15.9   3.0   18   19-36     54-71  (109)
174 TIGR00534 OpcA opcA protein. T  24.9      83  0.0018   19.6   2.3   19   18-36      5-23  (311)
175 PRK13966 nrdF2 ribonucleotide-  24.9      75  0.0016   19.6   2.1   16   33-48     49-64  (324)
176 PRK08943 lipid A biosynthesis   24.8 1.5E+02  0.0032   18.0   3.3   26   13-38     69-94  (314)
177 COG0484 DnaJ DnaJ-class molecu  24.8   2E+02  0.0043   18.5   4.5   40    8-47     23-67  (371)
178 PRK14282 chaperone protein Dna  24.7 1.9E+02  0.0041   18.2   4.0   39    8-46     23-68  (369)
179 cd08780 Death_TRADD Death Doma  24.6      44 0.00096   17.1   0.9   17   23-39      5-21  (90)
180 PRK14301 chaperone protein Dna  24.2 1.9E+02  0.0042   18.2   4.1   39    8-46     23-67  (373)
181 PF12339 DNAJ_related:  DNA-J r  24.1      86  0.0019   17.1   2.0   18   20-37    114-131 (132)
182 PTZ00211 ribonucleoside-diphos  24.1      79  0.0017   19.5   2.1   17   32-48     56-72  (330)
183 PF10407 Cytokin_check_N:  Cdc1  24.1      87  0.0019   15.3   1.9   23   19-41     11-33  (73)
184 TIGR02269 Myxococcus xanthus p  23.6 1.7E+02  0.0037   17.3   3.8   24    9-32    174-197 (211)
185 PF07055 Eno-Rase_FAD_bd:  Enoy  23.4      62  0.0013   15.6   1.3   19   23-41     16-34  (65)
186 PF10077 DUF2314:  Uncharacteri  23.3 1.4E+02   0.003   16.1   2.8   20   29-48    109-128 (133)
187 cd08803 Death_ank3 Death domai  22.8      31 0.00067   17.2   0.2   24   22-45      6-29  (84)
188 cd07765 KRAB_A-box KRAB (Krupp  22.6      51  0.0011   11.0   3.0   18   32-49     13-30  (40)
189 PRK08733 lipid A biosynthesis   22.1 1.8E+02  0.0038   17.6   3.3   26   13-38     64-89  (306)
190 PRK08326 ribonucleotide-diphos  21.9      95  0.0021   19.0   2.1   17   32-48     51-67  (311)
191 PF09349 OHCU_decarbox:  OHCU d  21.7 1.6E+02  0.0034   16.2   3.1   28   18-45     31-58  (159)
192 PF11953 DUF3470:  Domain of un  21.5      37 0.00081   14.9   0.3   19   22-40      3-21  (43)
193 cd00183 TFIIS_I N-terminal dom  21.2      65  0.0014   15.5   1.1   21   17-37     54-74  (76)
194 PRK06860 lipid A biosynthesis   21.1   2E+02  0.0042   17.4   3.3   27   12-38     63-89  (309)
195 PF04031 Las1:  Las1-like ;  In  21.1      98  0.0021   17.1   1.9   29    2-30     83-111 (154)
196 PRK09101 nrdB ribonucleotide-d  20.9   1E+02  0.0022   19.5   2.1   16   33-48     62-77  (376)
197 cd08777 Death_RIP1 Death Domai  20.9      47   0.001   16.5   0.6   17   23-39      5-21  (86)
198 PRK02955 small acid-soluble sp  20.8 1.2E+02  0.0027   14.7   2.2   17   32-48     46-62  (68)
199 PF10562 CaM_bdg_C0:  Calmoduli  20.6      85  0.0018   12.7   1.4   14   23-36     13-26  (29)
200 cd03577 NTR_TIMP_like NTR doma  20.6      41 0.00089   17.2   0.3   17   33-49     97-113 (116)
201 cd00155 RasGEF Guanine nucleot  20.6 1.4E+02  0.0031   17.0   2.6   22   29-50    111-132 (237)
202 TIGR02147 Fsuc_second hypothet  20.5 1.9E+02  0.0041   17.6   3.2   27    5-31     12-38  (271)
203 PF04545 Sigma70_r4:  Sigma-70,  20.4      91   0.002   13.3   1.5   14   18-31     18-31  (50)

No 1  
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.84  E-value=1.8e-20  Score=92.81  Aligned_cols=54  Identities=35%  Similarity=0.508  Sum_probs=52.4

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +|||+|+++.+..++.+||++++++|+++||++|+.||+++|.+|.++|++++.
T Consensus         7 naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~   60 (77)
T cd01389           7 NAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKE   60 (77)
T ss_pred             cHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHH
Confidence            589999999999999999999999999999999999999999999999999876


No 2  
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.84  E-value=2.8e-20  Score=91.17  Aligned_cols=54  Identities=24%  Similarity=0.406  Sum_probs=52.4

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +|||+|+++.+..++.+||++++.+|+++||++|+.||+++|.+|.+++..++.
T Consensus         7 naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~   60 (72)
T cd01388           7 NAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKE   60 (72)
T ss_pred             cHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence            589999999999999999999999999999999999999999999999999886


No 3  
>PTZ00199 high mobility group protein; Provisional
Probab=99.82  E-value=7.9e-20  Score=93.67  Aligned_cols=54  Identities=26%  Similarity=0.628  Sum_probs=51.8

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCC--HHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLK--LWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~--~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      |||++|++++|..|..+||+++  +++|+++||++|+.||+++|.+|.+.|+.++.
T Consensus        28 sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~   83 (94)
T PTZ00199         28 SAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKV   83 (94)
T ss_pred             cHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence            6899999999999999999986  89999999999999999999999999999976


No 4  
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.81  E-value=1.4e-19  Score=86.69  Aligned_cols=54  Identities=31%  Similarity=0.609  Sum_probs=52.2

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +||++|+++.+..++.+||++++.+|++.||++|++||+++|.+|.+.+++++.
T Consensus         6 saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~   59 (66)
T cd01390           6 SAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKE   59 (66)
T ss_pred             cHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence            589999999999999999999999999999999999999999999999999876


No 5  
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.78  E-value=1.7e-18  Score=83.61  Aligned_cols=54  Identities=37%  Similarity=0.634  Sum_probs=51.6

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +||++|+.+.+..++.+||+++..+|++++|++|++||+++|.+|.+.+.+++.
T Consensus         6 ~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~   59 (69)
T PF00505_consen    6 NAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKE   59 (69)
T ss_dssp             -HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHH
T ss_pred             CHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence            589999999999999999999999999999999999999999999999999875


No 6  
>smart00398 HMG high mobility group.
Probab=99.77  E-value=4e-18  Score=82.12  Aligned_cols=54  Identities=33%  Similarity=0.546  Sum_probs=52.0

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +||++|+++.+..+..+||++++.+|++.||.+|+.||+++|.+|.+.+++++.
T Consensus         7 ~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~   60 (70)
T smart00398        7 SAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKE   60 (70)
T ss_pred             cHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence            589999999999999999999999999999999999999999999999999865


No 7  
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.76  E-value=1.1e-17  Score=79.80  Aligned_cols=54  Identities=39%  Similarity=0.674  Sum_probs=52.1

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +||++|+++.+..++.+||+++..+|++.+|.+|+.||+++|.+|.+.+++++.
T Consensus         6 ~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~   59 (66)
T cd00084           6 SAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKE   59 (66)
T ss_pred             cHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence            589999999999999999999999999999999999999999999999999875


No 8  
>KOG0527|consensus
Probab=99.74  E-value=4.1e-18  Score=102.43  Aligned_cols=54  Identities=26%  Similarity=0.450  Sum_probs=52.2

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      .|||+|++..|.++..+||+++++||||.||..|+.|+++||.+|+++|++.|+
T Consensus        68 NAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~  121 (331)
T KOG0527|consen   68 NAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRA  121 (331)
T ss_pred             chhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHH
Confidence            489999999999999999999999999999999999999999999999999875


No 9  
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.67  E-value=3.8e-16  Score=76.61  Aligned_cols=54  Identities=24%  Similarity=0.447  Sum_probs=48.7

Q ss_pred             ChHHhHHHHHHHHHHHh-CCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQ-NLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~-~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +||++|+.+++..++.. .+.....|+++.||..|++||+++|.+|++.|++++.
T Consensus         9 say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~   63 (73)
T PF09011_consen    9 SAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKE   63 (73)
T ss_dssp             SHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHH
T ss_pred             CHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHH
Confidence            68999999999999988 7888999999999999999999999999999999886


No 10 
>KOG0381|consensus
Probab=99.65  E-value=1.3e-15  Score=77.64  Aligned_cols=54  Identities=30%  Similarity=0.561  Sum_probs=51.5

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +||++|+.+.+..++.+||+++..+|++++|++|++|++++|.+|...+..++.
T Consensus        28 sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~   81 (96)
T KOG0381|consen   28 SAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKE   81 (96)
T ss_pred             cHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence            589999999999999999999999999999999999999999999999988775


No 11 
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.62  E-value=1.9e-15  Score=86.00  Aligned_cols=54  Identities=24%  Similarity=0.611  Sum_probs=52.0

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      ||||+|+..+|+.++..+|++++.+|++.+|++|++|++++|++|..++..++.
T Consensus        76 sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~e  129 (211)
T COG5648          76 SAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRE  129 (211)
T ss_pred             hHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHH
Confidence            689999999999999999999999999999999999999999999999988775


No 12 
>KOG4715|consensus
Probab=99.39  E-value=6.2e-13  Score=79.92  Aligned_cols=54  Identities=76%  Similarity=1.375  Sum_probs=52.4

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      ++||.|++.+|++|+.+||++..-||.++||.+|..|++++|..|..+++.+|+
T Consensus        70 ~pymrySrkvWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKi  123 (410)
T KOG4715|consen   70 MPYMRYSRKVWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKI  123 (410)
T ss_pred             chhhHHhhhhhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHH
Confidence            579999999999999999999999999999999999999999999999999887


No 13 
>KOG3248|consensus
Probab=99.25  E-value=1.9e-11  Score=74.04  Aligned_cols=54  Identities=22%  Similarity=0.405  Sum_probs=51.3

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      .|||+|.+++|.+|..++--...++|.++||.+|..||.+|..+|.++|.++++
T Consensus       197 NAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerq  250 (421)
T KOG3248|consen  197 NAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQ  250 (421)
T ss_pred             HHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHH
Confidence            389999999999999999878899999999999999999999999999999886


No 14 
>KOG0528|consensus
Probab=99.05  E-value=3.8e-11  Score=75.03  Aligned_cols=52  Identities=19%  Similarity=0.412  Sum_probs=48.4

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQE   52 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~   52 (56)
                      .|||+|.++.|.++...+|++++.+|+++||.+|+.|+..||++|.++-...
T Consensus       331 NAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRL  382 (511)
T KOG0528|consen  331 NAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARL  382 (511)
T ss_pred             chhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHH
Confidence            3899999999999999999999999999999999999999999999876653


No 15 
>KOG0526|consensus
Probab=99.04  E-value=2.9e-10  Score=72.01  Aligned_cols=50  Identities=18%  Similarity=0.440  Sum_probs=47.7

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      +|||+|.+..|..++.+  +.++++|++..|++|+.|+.  |.+|++.|+.+|+
T Consensus       541 sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~  590 (615)
T KOG0526|consen  541 SAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQ  590 (615)
T ss_pred             hHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHH
Confidence            58999999999999987  99999999999999999999  9999999999987


No 16 
>KOG2746|consensus
Probab=98.64  E-value=2.6e-08  Score=64.43  Aligned_cols=54  Identities=22%  Similarity=0.340  Sum_probs=51.0

Q ss_pred             ChHHhHHHHHH--HHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           1 MPYMRYSRKVW--DQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         1 ~ay~lf~~~~~--~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      ++|++|++.++  ..+...||+..+.-|+++||+.|-.|.+.||+.|.+++.+.|.
T Consensus       187 naf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~  242 (683)
T KOG2746|consen  187 NAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKE  242 (683)
T ss_pred             HHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHH
Confidence            47999999999  8999999999999999999999999999999999999998764


No 17 
>PF06382 DUF1074:  Protein of unknown function (DUF1074);  InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=98.41  E-value=1.4e-06  Score=49.14  Aligned_cols=47  Identities=17%  Similarity=0.306  Sum_probs=41.6

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      .||+-|..+++..    |.+++..|+....+..|..|++.+|..|..++..
T Consensus        84 naYLNFLReFRrk----h~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~  130 (183)
T PF06382_consen   84 NAYLNFLREFRRK----HCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPS  130 (183)
T ss_pred             hHHHHHHHHHHHH----ccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcch
Confidence            3799999887775    7899999999999999999999999999986543


No 18 
>PF08073 CHDNT:  CHDNT (NUC034) domain;  InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=97.77  E-value=0.00013  Score=34.16  Aligned_cols=37  Identities=16%  Similarity=0.360  Sum_probs=33.4

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .|-+|++..|+.+.+.||++..+.+..+++.+|+.-+
T Consensus        15 ~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~   51 (55)
T PF08073_consen   15 NYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQ   51 (55)
T ss_pred             HHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHH
Confidence            3668999999999999999999999999999998543


No 19 
>PF04690 YABBY:  YABBY protein;  InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=97.62  E-value=0.00019  Score=40.41  Aligned_cols=38  Identities=24%  Similarity=0.432  Sum_probs=36.0

Q ss_pred             ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907           1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      +||..|.++.-+.++..+|+++..|.-...+..|...|
T Consensus       127 saYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p  164 (170)
T PF04690_consen  127 SAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP  164 (170)
T ss_pred             hhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence            68999999999999999999999999999999998765


No 20 
>PF14887 HMG_box_5:  HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=97.47  E-value=0.00091  Score=33.41  Aligned_cols=50  Identities=10%  Similarity=0.233  Sum_probs=42.1

Q ss_pred             HhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           4 MRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         4 ~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      -+|.++....+...+++-...+. +.+...|++|++.+|-+|+..|.++..
T Consensus        12 e~Wqq~vi~dYla~~~~dr~K~~-kam~~~W~~me~Kekl~WIkKA~EdqK   61 (85)
T PF14887_consen   12 EIWQQSVIGDYLAKFRNDRKKAL-KAMEAQWSQMEKKEKLKWIKKAAEDQK   61 (85)
T ss_dssp             HHHHHHHHHHHHHHTTSTHHHHH-HHHHHHHHTTGGGHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHhhHhHHHHH-HHHHHHHHHhhhhhhhHHHHHHHHHHH
Confidence            46778888888888888777774 599999999999999999999988754


No 21 
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=97.41  E-value=0.00012  Score=42.23  Aligned_cols=52  Identities=29%  Similarity=0.533  Sum_probs=45.3

Q ss_pred             HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907           3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus         3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      |+-+....+..+...+|+-+..+..+++|..|+.|+++-|.+|.+.+..++.
T Consensus       151 ~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~  202 (211)
T COG5648         151 FIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKE  202 (211)
T ss_pred             hhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHH
Confidence            4556667777777888888999999999999999999999999999998764


No 22 
>PF06244 DUF1014:  Protein of unknown function (DUF1014);  InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=96.95  E-value=0.0029  Score=33.99  Aligned_cols=39  Identities=21%  Similarity=0.498  Sum_probs=36.2

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHH
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~   40 (56)
                      ||.-|....-+.++.++|++-.+.+-.+|...|...|+.
T Consensus        79 Ay~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPeN  117 (122)
T PF06244_consen   79 AYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPEN  117 (122)
T ss_pred             HHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCCC
Confidence            689999999999999999999999999999999887753


No 23 
>KOG3223|consensus
Probab=95.58  E-value=0.035  Score=32.13  Aligned_cols=45  Identities=13%  Similarity=0.414  Sum_probs=39.3

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHH
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDY   49 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a   49 (56)
                      ||.-|-...-+.++.+||++..+..-.+|-..|...|+.   ||.+.+
T Consensus       171 A~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN---P~Nq~~  215 (221)
T KOG3223|consen  171 AFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN---PFNQAA  215 (221)
T ss_pred             HHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC---hhhHHh
Confidence            688899999999999999999999999999999988875   665543


No 24 
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=95.43  E-value=0.044  Score=31.67  Aligned_cols=33  Identities=15%  Similarity=0.432  Sum_probs=29.3

Q ss_pred             CCCCHHHHHH-HHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          19 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        19 p~~~~~ei~k-~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      |...+..+++ .+|.-|+.+|+++|..|.+....
T Consensus        66 p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~   99 (211)
T PRK15117         66 PYVQVKYAGALVLGRYYKDATPAQREAYFAAFRE   99 (211)
T ss_pred             ccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHH
Confidence            7788999986 68999999999999999988765


No 25 
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=95.14  E-value=0.049  Score=31.16  Aligned_cols=33  Identities=15%  Similarity=0.533  Sum_probs=28.7

Q ss_pred             CCCCHHHHHH-HHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          19 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        19 p~~~~~ei~k-~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      |...+..+++ .+|..|+.+|+++|+.|.+....
T Consensus        62 ~~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~   95 (198)
T TIGR03481        62 EAFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRE   95 (198)
T ss_pred             HhCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHH
Confidence            5678888887 68999999999999999988765


No 26 
>PF05494 Tol_Tol_Ttg2:  Toluene tolerance, Ttg2 ;  InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=94.77  E-value=0.035  Score=30.66  Aligned_cols=33  Identities=21%  Similarity=0.545  Sum_probs=26.9

Q ss_pred             CCCCHHHHHHH-HHHHhcCCCHHHHHHHHHHHHH
Q psy5907          19 LDLKLWEIGKI-IGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        19 p~~~~~ei~k~-l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      |...+..+++. ||..|+.+|++++..|.+....
T Consensus        36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~   69 (170)
T PF05494_consen   36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQ   69 (170)
T ss_dssp             GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHH
T ss_pred             HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHH
Confidence            77888888865 8889999999999999988765


No 27 
>PF06945 DUF1289:  Protein of unknown function (DUF1289);  InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=92.86  E-value=0.46  Score=21.65  Aligned_cols=29  Identities=17%  Similarity=0.463  Sum_probs=22.6

Q ss_pred             CCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhh
Q psy5907          20 DLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEK   53 (56)
Q Consensus        20 ~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k   53 (56)
                      ..+..||..     |+.|++++|..........+
T Consensus        21 ~RT~dEI~~-----W~~~s~~er~~i~~~l~~R~   49 (51)
T PF06945_consen   21 GRTLDEIRD-----WKSMSDDERRAILARLRARR   49 (51)
T ss_pred             CCcHHHHHH-----HhhCCHHHHHHHHHHHHHHh
Confidence            457888877     99999999988776665544


No 28 
>PF13875 DUF4202:  Domain of unknown function (DUF4202)
Probab=92.45  E-value=0.63  Score=26.74  Aligned_cols=40  Identities=15%  Similarity=0.341  Sum_probs=33.4

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHH
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTE   44 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~   44 (56)
                      +.++|...+...+...|   ....+..+|...|+.||+.-++-
T Consensus       131 acLVFL~~~f~~F~~~~---deeK~v~Il~KTw~KMS~~g~~~  170 (185)
T PF13875_consen  131 ACLVFLEYYFEDFAAKH---DEEKIVDILRKTWRKMSERGHEA  170 (185)
T ss_pred             HHHHhHHHHHHHHHhcC---CHHHHHHHHHHHHHHCCHHHHHH
Confidence            46788888888888766   77889999999999999987753


No 29 
>PF09164 VitD-bind_III:  Vitamin D binding protein, domain III;  InterPro: IPR015247 This domain is predominantly found in Vitamin D binding proteins, and adopts a multihelical structure. It is required for formation of an actin 'clamp', allowing the protein to bind to actin []. ; PDB: 1MA9_A 1KW2_A 1KXP_D 1J7E_A 1J78_A 1LOT_A.
Probab=90.08  E-value=1.3  Score=21.47  Aligned_cols=32  Identities=13%  Similarity=0.345  Sum_probs=24.9

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHH
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQM   33 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~   33 (56)
                      .|.-|-+...+.++...|+.+..++..++.++
T Consensus        10 tFtEyKKrL~e~l~~k~P~at~~~l~~lve~R   41 (68)
T PF09164_consen   10 TFTEYKKRLAERLRAKLPDATPTELKELVEKR   41 (68)
T ss_dssp             -HHHHHHHHHHHHHHH-TTS-HHHHHHHHHHH
T ss_pred             cHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
Confidence            36778888889999999999999999887654


No 30 
>PF01352 KRAB:  KRAB box;  InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=88.05  E-value=0.65  Score=20.27  Aligned_cols=29  Identities=17%  Similarity=0.274  Sum_probs=16.4

Q ss_pred             HHHHHHHHH-HHhcCCCHHHHHHHHHHHHH
Q psy5907          23 LWEIGKIIG-QMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        23 ~~ei~k~l~-~~W~~l~~~eK~~y~~~a~~   51 (56)
                      +.+|+--.+ +.|..|.+.+|..|.+.-.+
T Consensus         3 f~Dvav~fs~eEW~~L~~~Qk~ly~dvm~E   32 (41)
T PF01352_consen    3 FEDVAVYFSQEEWELLDPAQKNLYRDVMLE   32 (41)
T ss_dssp             ----TT---HHHHHTS-HHHHHHHHHHHHH
T ss_pred             EEEEEEEcChhhcccccceecccchhHHHH
Confidence            444544444 56999999999999876543


No 31 
>PF12650 DUF3784:  Domain of unknown function (DUF3784);  InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=86.99  E-value=0.55  Score=23.76  Aligned_cols=16  Identities=13%  Similarity=0.430  Sum_probs=13.8

Q ss_pred             HhcCCCHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVED   48 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~   48 (56)
                      -+++||++||+.|++.
T Consensus        25 Gyntms~eEk~~~D~~   40 (97)
T PF12650_consen   25 GYNTMSKEEKEKYDKK   40 (97)
T ss_pred             hcccCCHHHHHHhhHH
Confidence            4899999999999854


No 32 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=86.14  E-value=3.4  Score=21.54  Aligned_cols=40  Identities=15%  Similarity=0.288  Sum_probs=25.4

Q ss_pred             HHHHHhCCCCCHHHHHHHHH--HHhcCCCHHHHHHHHHHHHH
Q psy5907          12 DQVKAQNLDLKLWEIGKIIG--QMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        12 ~~~~~~~p~~~~~ei~k~l~--~~W~~l~~~eK~~y~~~a~~   51 (56)
                      -.+....+.+++.+-.++..  ..|..||++++..-...+..
T Consensus        32 l~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~~~   73 (107)
T PF11304_consen   32 LQIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENYQR   73 (107)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence            34555567777776665544  46788888877766555543


No 33 
>PRK10236 hypothetical protein; Provisional
Probab=84.93  E-value=1.5  Score=26.11  Aligned_cols=31  Identities=19%  Similarity=0.610  Sum_probs=23.6

Q ss_pred             CCCCHHHH-----HHHHHHHhcCCCHHHHHHHHHHH
Q psy5907          19 LDLKLWEI-----GKIIGQMWRDLPEDQKTEYVEDY   49 (56)
Q Consensus        19 p~~~~~ei-----~k~l~~~W~~l~~~eK~~y~~~a   49 (56)
                      +..+..+|     .+.+++.|+.||++|++.+.+.-
T Consensus       106 ~~~st~~iE~~il~kll~~a~~kms~eE~~~L~~~l  141 (237)
T PRK10236        106 KEMSTFEIEQQLLEQFLRNTWKKMDEEHKQEFLHAV  141 (237)
T ss_pred             CCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
Confidence            34555554     47789999999999998877543


No 34 
>PF04769 MAT_Alpha1:  Mating-type protein MAT alpha 1;  InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=83.50  E-value=3.6  Score=23.88  Aligned_cols=42  Identities=17%  Similarity=0.212  Sum_probs=29.4

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHH
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDY   49 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a   49 (56)
                      +||.|+.=.-    ...|+....++|..|+..|..=+-  |..|.-.+
T Consensus        50 ~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~dp~--k~~W~l~a   91 (201)
T PF04769_consen   50 GFMAFRSYYS----PIFPPLPQKELSGILTKLWEKDPF--KNKWSLMA   91 (201)
T ss_pred             HHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCCcc--HhHHHHHh
Confidence            5777766444    446899999999999999986433  34444333


No 35 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=83.06  E-value=4  Score=21.26  Aligned_cols=17  Identities=18%  Similarity=0.517  Sum_probs=8.3

Q ss_pred             HHHHhcCCCHHHHHHHH
Q psy5907          30 IGQMWRDLPEDQKTEYV   46 (56)
Q Consensus        30 l~~~W~~l~~~eK~~y~   46 (56)
                      ++..|..||++++..+.
T Consensus        34 ~a~r~~~mspeqq~r~~   50 (107)
T PF11304_consen   34 IAERWPSMSPEQQQRLR   50 (107)
T ss_pred             HHHHHhcCCHHHHHHHH
Confidence            34455555555554443


No 36 
>PF13945 NST1:  Salt tolerance down-regulator
Probab=82.71  E-value=3.3  Score=23.94  Aligned_cols=30  Identities=17%  Similarity=0.340  Sum_probs=23.6

Q ss_pred             CHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          22 KLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        22 ~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      +..+-...|-+.|-+|+++||...+..-++
T Consensus        99 Ss~eEre~LkeFW~SL~eeERr~LVkIEKe  128 (190)
T PF13945_consen   99 SSQEEREKLKEFWESLSEEERRSLVKIEKE  128 (190)
T ss_pred             hhHHHHHHHHHHHHccCHHHHHHHHHhhHH
Confidence            355667789999999999999987755443


No 37 
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=81.84  E-value=4.3  Score=22.81  Aligned_cols=27  Identities=26%  Similarity=0.385  Sum_probs=22.3

Q ss_pred             HHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          25 EIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        25 ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      +..+...+++..|++++|..|.+..++
T Consensus       118 ~~~~~~~qmy~lLTPEQra~l~~~~e~  144 (162)
T PRK12751        118 EMAKVRNQMYNLLTPEQKEALNKKHQE  144 (162)
T ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence            445666788899999999999988775


No 38 
>PF12881 NUT_N:  NUT protein N terminus;  InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=81.47  E-value=9.8  Score=23.77  Aligned_cols=49  Identities=14%  Similarity=0.030  Sum_probs=39.9

Q ss_pred             HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907           3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus         3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      +--|..-.-..+....|.++..|-....-..|.-.|.-+|..|.++|++
T Consensus       232 lSCFLIpvLrsLar~kPtMtlEeGl~ra~qEW~~~SnfdRmifyemaek  280 (328)
T PF12881_consen  232 LSCFLIPVLRSLARLKPTMTLEEGLWRAVQEWQHTSNFDRMIFYEMAEK  280 (328)
T ss_pred             hhhhHHHHHHHHHhcCCCccHHHHHHHHHHHhhccccccHHHHHHHHHH
Confidence            4445555555566677999999998888899999999999999999876


No 39 
>KOG1610|consensus
Probab=79.65  E-value=7.4  Score=24.30  Aligned_cols=30  Identities=33%  Similarity=0.680  Sum_probs=26.5

Q ss_pred             HHHHHHHHHHHhcCCCHHHHHHHHHHHHHh
Q psy5907          23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQE   52 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~   52 (56)
                      ...+.+.+.+.|..+|++.|+.|-+.+-.+
T Consensus       218 ~~~~~~~~~~~w~~l~~e~k~~YGedy~~~  247 (322)
T KOG1610|consen  218 PEKLEKRMKEIWERLPQETKDEYGEDYFED  247 (322)
T ss_pred             hHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Confidence            578889999999999999999999887664


No 40 
>PF13412 HTH_24:  Winged helix-turn-helix DNA-binding; PDB: 1I1G_B 2IA0_B 3I4P_A 2GQQ_A 2L4A_A 2CFX_B 2DBB_B 2EFO_A 2EFQ_A 2PN6_A ....
Probab=77.67  E-value=3.5  Score=17.85  Aligned_cols=22  Identities=14%  Similarity=0.128  Sum_probs=15.3

Q ss_pred             HHhCCCCCHHHHHHHHHHHhcC
Q psy5907          15 KAQNLDLKLWEIGKIIGQMWRD   36 (56)
Q Consensus        15 ~~~~p~~~~~ei~k~l~~~W~~   36 (56)
                      ..++|+++..++++.+|=.+..
T Consensus        12 l~~~~~~t~~ela~~~~is~~t   33 (48)
T PF13412_consen   12 LRENPRITQKELAEKLGISRST   33 (48)
T ss_dssp             HHHCTTS-HHHHHHHHTS-HHH
T ss_pred             HHHcCCCCHHHHHHHhCCCHHH
Confidence            3568999999999988754443


No 41 
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=74.78  E-value=9.3  Score=21.69  Aligned_cols=29  Identities=31%  Similarity=0.367  Sum_probs=23.8

Q ss_pred             HHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      ..+..++-.++.+-|+|++|..|.+..++
T Consensus       110 ~Vem~k~~nqmy~lLTPEQKaq~~~~~~~  138 (166)
T PRK10363        110 QVEMAKVRNQMYRLLTPEQQAVLNEKHQQ  138 (166)
T ss_pred             HHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence            34566777789999999999999988765


No 42 
>PF15581 Imm35:  Immunity protein 35
Probab=73.29  E-value=11  Score=19.37  Aligned_cols=28  Identities=29%  Similarity=0.527  Sum_probs=21.2

Q ss_pred             CCHHHHHHHHHHHhcCCCHHHHHHHHHH
Q psy5907          21 LKLWEIGKIIGQMWRDLPEDQKTEYVED   48 (56)
Q Consensus        21 ~~~~ei~k~l~~~W~~l~~~eK~~y~~~   48 (56)
                      .+...+...|.+.|+.||+++=..-.+.
T Consensus        30 ~~i~~l~~lIe~eWRGl~~~qV~~kl~a   57 (93)
T PF15581_consen   30 RTIRNLESLIEHEWRGLPEEQVLYKLEA   57 (93)
T ss_pred             HHHHHHHHHHHHHHcCCCHHHHHHHHHH
Confidence            4567788899999999999875544433


No 43 
>PF12290 DUF3802:  Protein of unknown function (DUF3802);  InterPro: IPR020979  This family of proteins is found in bacteria and are typically between 114 and 143 amino acids in length. There is a conserved KNLFD sequence motif. The annotation with this family suggests that it may be the B subunit of bacterial type IIA DNA topoisomerase but there is no evidence to support this annotation. 
Probab=73.01  E-value=12  Score=19.93  Aligned_cols=28  Identities=7%  Similarity=0.657  Sum_probs=22.1

Q ss_pred             HHHHHHHHHHHhcCCCHHHHHHHHHHHH
Q psy5907          23 LWEIGKIIGQMWRDLPEDQKTEYVEDYE   50 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~   50 (56)
                      ..++...|+.-|...+-.+..-|.+++-
T Consensus        73 v~DLeEVLa~V~~~~aT~eQ~~Fi~Ef~  100 (113)
T PF12290_consen   73 VYDLEEVLASVWNQKATNEQIAFIEEFI  100 (113)
T ss_pred             HHHHHHHHHHHHcCCCCHHHHHHHHHHH
Confidence            3567788999999988888888877653


No 44 
>cd08317 Death_ank Death domain associated with Ankyrins. Death Domain (DD) associated with Ankyrins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. Ankyrins function as adaptor proteins and they interact, through ANK repeats, with structurally diverse membrane proteins, including ion channels/pumps, calcium release channels, and cell adhesion molecules. They play critical roles in the proper expression and membrane localization of these proteins. In mammals, this family includes ankyrin-R for restricted (or ANK1), ankyrin-B for broadly expressed (or ANK2) and ankyrin-G for general or giant (or ANK3). They are expressed in different combinations in many tissues and play non-overlapping functions. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-associati
Probab=72.88  E-value=1.9  Score=21.28  Aligned_cols=22  Identities=27%  Similarity=0.732  Sum_probs=18.1

Q ss_pred             CCCCHHHHHHHHHHHhcCCCHH
Q psy5907          19 LDLKLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~l~~~   40 (56)
                      |+.....|++.||.-|+.|..+
T Consensus         3 ~~~~l~~ia~~lG~dW~~LAr~   24 (84)
T cd08317           3 ADIRLADISNLLGSDWPQLARE   24 (84)
T ss_pred             ccchHHHHHHHHhhHHHHHHHH
Confidence            5778899999999999877543


No 45 
>PF05388 Carbpep_Y_N:  Carboxypeptidase Y pro-peptide;  InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=72.35  E-value=12  Score=19.87  Aligned_cols=28  Identities=21%  Similarity=0.233  Sum_probs=24.0

Q ss_pred             HHHHHHHHHHHhcCCCHHHHHHHHHHHH
Q psy5907          23 LWEIGKIIGQMWRDLPEDQKTEYVEDYE   50 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~   50 (56)
                      ...+++.+++..+.|+.+-|..|.+...
T Consensus        45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~   72 (113)
T PF05388_consen   45 LEKISKYLNEPLKSLTSEAKALWDEMML   72 (113)
T ss_pred             HHHHHHHHHHHHhhccHHHHHHHHHHHH
Confidence            5566778999999999999999998765


No 46 
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=70.98  E-value=5  Score=23.45  Aligned_cols=33  Identities=9%  Similarity=0.354  Sum_probs=25.0

Q ss_pred             CCCCHHHHHH-HHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          19 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        19 p~~~~~ei~k-~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      |-..+.=+++ .||.-|+.+|+++++.|.+...+
T Consensus        68 p~vd~~~~a~~vLGk~~k~aspeQ~~~F~~aF~~  101 (202)
T COG2854          68 PYVDFKYAAKLVLGKYYKTASPEQRQAFFKAFRT  101 (202)
T ss_pred             hhhcHHHHHHHHhccccccCCHHHHHHHHHHHHH
Confidence            4445444443 48999999999999999987765


No 47 
>PF08367 M16C_assoc:  Peptidase M16C associated;  InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=67.56  E-value=18  Score=21.14  Aligned_cols=33  Identities=18%  Similarity=0.120  Sum_probs=26.6

Q ss_pred             HHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhhc
Q psy5907          23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKIY   55 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~~   55 (56)
                      ..+..+.|.+...+|++++++...+..++.+.+
T Consensus        12 ~~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~   44 (248)
T PF08367_consen   12 EEEEKEKLAAYKASLSEEEKEKIIEQTKELKER   44 (248)
T ss_dssp             HHHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHH
Confidence            356778899999999999999999888876653


No 48 
>COG3313 Predicted Fe-S protein [General function prediction only]
Probab=66.38  E-value=8.6  Score=19.04  Aligned_cols=22  Identities=18%  Similarity=0.362  Sum_probs=16.9

Q ss_pred             CCCHHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907          20 DLKLWEIGKIIGQMWRDLPEDQKTEYV   46 (56)
Q Consensus        20 ~~~~~ei~k~l~~~W~~l~~~eK~~y~   46 (56)
                      ..+..||..     |..||++||.---
T Consensus        26 ~Rt~~Ei~~-----W~~msd~Er~aVl   47 (74)
T COG3313          26 GRTRDEIFN-----WSSMSDDERRAVL   47 (74)
T ss_pred             cccHHHHHH-----HhhCCHHHHHHHH
Confidence            346778877     9999999987544


No 49 
>PF06628 Catalase-rel:  Catalase-related immune-responsive;  InterPro: IPR010582 Catalases (1.11.1.6 from EC) are antioxidant enzymes that catalyse the conversion of hydrogen peroxide to water and molecular oxygen, serving to protect cells from its toxic effects []. Hydrogen peroxide is produced as a consequence of oxidative cellular metabolism and can be converted to the highly reactive hydroxyl radical via transition metals, this radical being able to damage a wide variety of molecules within a cell, leading to oxidative stress and cell death. Catalases act to neutralise hydrogen peroxide toxicity, and are produced by all aerobic organisms ranging from bacteria to man. Most catalases are mono-functional, haem-containing enzymes, although there are also bifunctional haem-containing peroxidase/catalases (IPR000763 from INTERPRO) that are closely related to plant peroxidases, and non-haem, manganese-containing catalases (IPR007760 from INTERPRO) that are found in bacteria []. This entry represents a small conserved region within catalase enzymes that carries the immune-responsive amphipathic octa-peptide that is recognised by T cells [].; PDB: 2CAH_A 1NM0_A 1H7K_A 1E93_A 1H6N_A 3HB6_A 2CAG_A 1M85_A 1MQF_A 1A4E_C ....
Probab=66.04  E-value=6.4  Score=18.71  Aligned_cols=19  Identities=21%  Similarity=0.553  Sum_probs=15.5

Q ss_pred             HHHHhcCCCHHHHHHHHHH
Q psy5907          30 IGQMWRDLPEDQKTEYVED   48 (56)
Q Consensus        30 l~~~W~~l~~~eK~~y~~~   48 (56)
                      -+..|+.|++++|..+...
T Consensus        13 a~~ly~~l~~~er~~lv~n   31 (68)
T PF06628_consen   13 ARDLYRVLSDEERERLVEN   31 (68)
T ss_dssp             HHHHHHHSSHHHHHHHHHH
T ss_pred             HHHHHHHCCHHHHHHHHHH
Confidence            4567999999999988754


No 50 
>PRK10455 periplasmic protein; Reviewed
Probab=64.65  E-value=23  Score=19.81  Aligned_cols=27  Identities=15%  Similarity=0.349  Sum_probs=21.3

Q ss_pred             HHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          25 EIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        25 ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      +..+.-...+..|++++|..|.+..++
T Consensus       118 ~~~~~~~qiy~vLTPEQr~q~~~~~ek  144 (161)
T PRK10455        118 AHMETQNKIYNVLTPEQKKQFNANFEK  144 (161)
T ss_pred             HHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence            344555677899999999999988765


No 51 
>PF07813 LTXXQ:  LTXXQ motif family protein;  InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT). This is a periplasmic protein that aids in combating extracytoplasmic protein-mediated toxicity, and may also be involved in the response to alkaline pH []. Another member of this family, Spy (P77754 from SWISSPROT) is also a periplasmic protein that may be involved in the response to stress []. The homology between CpxP and Spy may indicate that these two proteins are functionally related []. The motif is found repeated twice in many members of this entry. ; GO: 0042597 periplasmic space; PDB: 3ITF_B 3QZC_B 3OEO_D 3O39_A.
Probab=63.52  E-value=16  Score=17.76  Aligned_cols=25  Identities=20%  Similarity=0.174  Sum_probs=18.6

Q ss_pred             HHHHHHHHHHhcCCCHHHHHHHHHH
Q psy5907          24 WEIGKIIGQMWRDLPEDQKTEYVED   48 (56)
Q Consensus        24 ~ei~k~l~~~W~~l~~~eK~~y~~~   48 (56)
                      ..+.......+..|++++|..|..+
T Consensus        75 ~~~~~~~~~~~~vLt~eQk~~~~~l   99 (100)
T PF07813_consen   75 EERAKAQHALYAVLTPEQKEKFDQL   99 (100)
T ss_dssp             HHHHHHHHHHHTTS-HHHHHHHHHH
T ss_pred             HHHHHHHHHHHhcCCHHHHHHHHHh
Confidence            4455667788999999999988754


No 52 
>PRK00523 hypothetical protein; Provisional
Probab=62.56  E-value=18  Score=17.83  Aligned_cols=36  Identities=25%  Similarity=0.196  Sum_probs=28.0

Q ss_pred             HHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHH
Q psy5907           6 YSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQ   41 (56)
Q Consensus         6 f~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~e   41 (56)
                      +.+..-.+.-.+||.++..-|..+...+-+..|+.+
T Consensus        25 iark~~~k~l~~NPpine~mir~M~~QMGqKPSekk   60 (72)
T PRK00523         25 VSKKMFKKQIRENPPITENMIRAMYMQMGRKPSESQ   60 (72)
T ss_pred             HHHHHHHHHHHHCcCCCHHHHHHHHHHhCCCccHHH
Confidence            334455566678999999999999999998887653


No 53 
>PF03672 UPF0154:  Uncharacterised protein family (UPF0154);  InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=61.42  E-value=18  Score=17.42  Aligned_cols=34  Identities=26%  Similarity=0.208  Sum_probs=26.5

Q ss_pred             HHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHH
Q psy5907           8 RKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQ   41 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~e   41 (56)
                      .....+.-.+||.++..-|..+...+-+..|+.+
T Consensus        19 r~~~~k~l~~NPpine~mir~M~~QMG~kpSekq   52 (64)
T PF03672_consen   19 RKYMEKQLKENPPINEKMIRAMMMQMGRKPSEKQ   52 (64)
T ss_pred             HHHHHHHHHHCCCCCHHHHHHHHHHhCCCccHHH
Confidence            3334445578999999999999999999888764


No 54 
>PRK14291 chaperone protein DnaJ; Provisional
Probab=61.08  E-value=33  Score=21.56  Aligned_cols=37  Identities=16%  Similarity=0.151  Sum_probs=27.5

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKTE   44 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~~   44 (56)
                      +.+|...+.-||+.+    ..+.-+.|.+.|..|++..|..
T Consensus        22 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vLsd~~kR~   62 (382)
T PRK14291         22 KAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRK   62 (382)
T ss_pred             HHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhcCHHHHH
Confidence            456677777889875    3466678999999998877553


No 55 
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=60.46  E-value=29  Score=19.56  Aligned_cols=24  Identities=25%  Similarity=0.231  Sum_probs=19.4

Q ss_pred             HHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          28 KIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        28 k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      +...+....|++++|..|.+..++
T Consensus       128 ~~~~~~~~vLTpEQRak~~e~~~~  151 (170)
T PRK12750        128 EKRHQMLSILTPEQKAKFQELQQE  151 (170)
T ss_pred             HHHHHHHHhCCHHHHHHHHHHHHH
Confidence            345567999999999999987655


No 56 
>PRK10266 curved DNA-binding protein CbpA; Provisional
Probab=59.41  E-value=38  Score=20.60  Aligned_cols=40  Identities=18%  Similarity=0.251  Sum_probs=28.8

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~~   47 (56)
                      +.+|...+.-||+.+    ..+.-+.+.+.|..|+...|. .|..
T Consensus        23 ~ayr~la~k~HPD~~~~~~~~~~f~~i~~Ay~~L~~~~kr~~yD~   67 (306)
T PRK10266         23 TAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQ   67 (306)
T ss_pred             HHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence            456666777789865    667778999999999865544 5543


No 57 
>PRK14299 chaperone protein DnaJ; Provisional
Probab=58.59  E-value=39  Score=20.43  Aligned_cols=39  Identities=18%  Similarity=0.136  Sum_probs=27.6

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +..|...+.-||+.+    ..+.-+.|.+.|..|++.+|. .|.
T Consensus        23 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~kr~~yD   66 (291)
T PRK14299         23 KAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYD   66 (291)
T ss_pred             HHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhcCHHHHHHHH
Confidence            455666677788865    456667899999999876654 444


No 58 
>PRK14281 chaperone protein DnaJ; Provisional
Probab=58.52  E-value=28  Score=22.05  Aligned_cols=40  Identities=18%  Similarity=0.104  Sum_probs=29.1

Q ss_pred             HHHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           7 SRKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         7 ~~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      .+.+|...+.-||+.+.     .+.-+.|.+.|..|++.+|. .|.
T Consensus        21 kkayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD   66 (397)
T PRK14281         21 KKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRRRYD   66 (397)
T ss_pred             HHHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhhhhhhhhhhh
Confidence            35567777788898753     46778999999999977654 444


No 59 
>PHA03102 Small T antigen; Reviewed
Probab=58.18  E-value=26  Score=19.51  Aligned_cols=37  Identities=14%  Similarity=0.084  Sum_probs=27.4

Q ss_pred             HHHHHHHHHHhCCCCC-HHHHHHHHHHHhcCCCHHHHH
Q psy5907           7 SRKVWDQVKAQNLDLK-LWEIGKIIGQMWRDLPEDQKT   43 (56)
Q Consensus         7 ~~~~~~~~~~~~p~~~-~~ei~k~l~~~W~~l~~~eK~   43 (56)
                      .+..|...+.-||+.. ..+..+.|++.|..|++..+.
T Consensus        25 KkAYr~la~~~HPDkgg~~e~~k~in~Ay~~L~d~~~r   62 (153)
T PHA03102         25 RKAYLRKCLEFHPDKGGDEEKMKELNTLYKKFRESVKS   62 (153)
T ss_pred             HHHHHHHHHHHCcCCCchhHHHHHHHHHHHHHhhHHHh
Confidence            3556777788899875 456678889999888876654


No 60 
>PF02026 RyR:  RyR domain;  InterPro: IPR003032 This domain is called RyR for Ryanodine receptor []. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.; PDB: 4ETV_A 3RQR_A 4ETT_A 4ERT_A 4ESU_A 4ETU_A 4ERV_A 3NRT_E.
Probab=57.96  E-value=14  Score=18.82  Aligned_cols=19  Identities=16%  Similarity=0.295  Sum_probs=14.6

Q ss_pred             HhcCCCHHHHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~a~~   51 (56)
                      -|..||+++|..+...+.+
T Consensus        60 py~~L~e~eK~~dr~~~~e   78 (94)
T PF02026_consen   60 PYDELSEEEKEKDRDMVRE   78 (94)
T ss_dssp             -GGGS-HHHHHHHHHHHHH
T ss_pred             ChhhCCHHHHHHhHHHHHH
Confidence            4899999999998877765


No 61 
>PRK14279 chaperone protein DnaJ; Provisional
Probab=57.69  E-value=27  Score=22.06  Aligned_cols=40  Identities=20%  Similarity=0.140  Sum_probs=28.9

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~~   47 (56)
                      +.+|...+.-||+.+.     .+.-+.|.+.+..|++.+|. .|..
T Consensus        28 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vLsD~~KR~~YD~   73 (392)
T PRK14279         28 KAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYDE   73 (392)
T ss_pred             HHHHHHHHHHCcCCCCCChHHHHHHHHHHHHHHHhcchhhhhHHHH
Confidence            4567777788898753     46667899999999987765 4443


No 62 
>PF11521 TFIIE-A_C-term:  C-terminal general transcription factor TFIIE alpha;  InterPro: IPR021600  TFIIE is compiled of two subunits, alpha and beta. This family of proteins are the C-terminal domain of the alpha subunit of the protein which is the largest subunit and contains several functional domains which are important for basal transcription and cell growth. The C-terminal end of the protein binds directly to the amino-terminal PH domain of p62/Tfb1 (of IIH) which is involved in the recruitment of the general transcription factor IIH to the transcription preinitiation complex. P53 competes for the same binding site as TFIIE alpha which shows their structural similarity. Like p53, TFIIE alpha 336-439 can activate transcription in vivo []. ; PDB: 2RNR_A 2RNQ_A 2JTX_A.
Probab=57.47  E-value=14  Score=18.68  Aligned_cols=30  Identities=10%  Similarity=0.206  Sum_probs=19.0

Q ss_pred             CHHHHHHHHHHHhcCCCHHHHHHHHHHHHHh
Q psy5907          22 KLWEIGKIIGQMWRDLPEDQKTEYVEDYEQE   52 (56)
Q Consensus        22 ~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~   52 (56)
                      ...+|+. --+.=..|+++||.-|++..++.
T Consensus        51 ~~~eVtq-~p~LV~qMT~~EKEaYi~v~Q~~   80 (86)
T PF11521_consen   51 PYSEVTQ-RPELVAQMTPEEKEAYIQVGQEM   80 (86)
T ss_dssp             EHHHHHH--HHHHHHS-HHHHHHHHHHHHHH
T ss_pred             ehhhcCc-chHHHHHcCHHHHHHHHHHHHHH
Confidence            3566660 01222479999999999988764


No 63 
>PF02376 CUT:  CUT domain;  InterPro: IPR003350 A class, also called ONECUT, of homeodomain proteins. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain (IPR001356 from INTERPRO), often found downstream of the CUT domain. Proteins display two modes of DNA binding, which hinge on the homeodomain and on the linker that separates it from the cut domain, and two modes of transcriptional stimulation, which hinge on the homeodomain [].; GO: 0003677 DNA binding; PDB: 1WH6_A 2D5V_A 2CSF_A 1X2L_A 2O49_A 2O4A_A 1YSE_A 1S7E_A 1WIZ_A 1WH8_A.
Probab=57.36  E-value=14  Score=18.55  Aligned_cols=32  Identities=16%  Similarity=0.442  Sum_probs=25.3

Q ss_pred             CCCCHHHHHHHHH--HHhcCCCHHHHHHHHHHHH
Q psy5907          19 LDLKLWEIGKIIG--QMWRDLPEDQKTEYVEDYE   50 (56)
Q Consensus        19 p~~~~~ei~k~l~--~~W~~l~~~eK~~y~~~a~   50 (56)
                      -+.+.+-++.+|.  ..|..|.+..|..|..++.
T Consensus        37 L~rsqgtlsdlL~~PKpW~~L~~~gre~y~RM~n   70 (87)
T PF02376_consen   37 LNRSQGTLSDLLRKPKPWNKLKPKGREPYIRMYN   70 (87)
T ss_dssp             TSS-HHHHHHHHHSCCTTCHHHHHHHHHHHHHHH
T ss_pred             hccChhHHHHHhCCCCCHHHcCccccHHHHHHHH
Confidence            4677888888888  6799999999999988763


No 64 
>PRK14283 chaperone protein DnaJ; Provisional
Probab=57.31  E-value=46  Score=20.90  Aligned_cols=40  Identities=20%  Similarity=0.201  Sum_probs=29.2

Q ss_pred             HHHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           7 SRKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         7 ~~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      .+.++...++-||+.+    ..+.-+.|.+.|..|++.+|. .|.
T Consensus        23 k~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~~Lsd~~kR~~YD   67 (378)
T PRK14283         23 KKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYD   67 (378)
T ss_pred             HHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhchhHHHHHHh
Confidence            3456667777889875    456778899999999888754 443


No 65 
>PRK14280 chaperone protein DnaJ; Provisional
Probab=55.67  E-value=49  Score=20.77  Aligned_cols=40  Identities=25%  Similarity=0.250  Sum_probs=28.3

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHH-HHHHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQK-TEYVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK-~~y~~   47 (56)
                      +.+|...+.-||+.+    ..+.-+.|.+.|..|++.+| ..|..
T Consensus        23 ~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~   67 (376)
T PRK14280         23 KAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQYDQ   67 (376)
T ss_pred             HHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhccHhHHHHHHh
Confidence            455666667788764    55777889999999986664 45543


No 66 
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=55.60  E-value=11  Score=16.76  Aligned_cols=33  Identities=15%  Similarity=0.006  Sum_probs=20.5

Q ss_pred             HHHHHHHHHhCCCCCH------HHHHHHHHHHhcCCCHH
Q psy5907           8 RKVWDQVKAQNLDLKL------WEIGKIIGQMWRDLPED   40 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~------~ei~k~l~~~W~~l~~~   40 (56)
                      +..+..++.-||+...      .+....|.+.|..|...
T Consensus        20 ~ay~~l~~~~HPD~~~~~~~~~~~~~~~l~~Ay~~L~~~   58 (60)
T smart00271       20 KAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDP   58 (60)
T ss_pred             HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHHHcCC
Confidence            4455566666887654      35666677777666543


No 67 
>PRK14296 chaperone protein DnaJ; Provisional
Probab=55.16  E-value=46  Score=20.92  Aligned_cols=39  Identities=18%  Similarity=0.103  Sum_probs=27.6

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...++-||+.+    ..+-=+.|++.+..|++.+|. .|.
T Consensus        23 ~ayrkla~~~HPD~n~~~~a~~~F~~i~~AyevLsD~~KR~~YD   66 (372)
T PRK14296         23 QAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYD   66 (372)
T ss_pred             HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHhcCHHHhhhhh
Confidence            455666677788865    445557799999999977765 444


No 68 
>PRK01844 hypothetical protein; Provisional
Probab=54.32  E-value=26  Score=17.23  Aligned_cols=36  Identities=14%  Similarity=0.072  Sum_probs=27.9

Q ss_pred             HHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHH
Q psy5907           6 YSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQ   41 (56)
Q Consensus         6 f~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~e   41 (56)
                      +.+..-.+.-++||.++..-|..+...+-+..|+..
T Consensus        24 ~ark~~~k~lk~NPpine~mir~Mm~QMGqkPSekk   59 (72)
T PRK01844         24 IARKYMMNYLQKNPPINEQMLKMMMMQMGQKPSQKK   59 (72)
T ss_pred             HHHHHHHHHHHHCCCCCHHHHHHHHHHhCCCccHHH
Confidence            334455566778999999999999999988877653


No 69 
>KOG1827|consensus
Probab=53.39  E-value=0.35  Score=32.34  Aligned_cols=41  Identities=27%  Similarity=0.414  Sum_probs=35.7

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHH
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQK   42 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK   42 (56)
                      +|++|+.+.+..+-.++|+..+++++.++|..|..++..-|
T Consensus       555 ~~~~~s~~~~~~~~~~np~v~~~~~~~~vg~~~~~lp~~~k  595 (629)
T KOG1827|consen  555 PYILDSIENRTIIWFENPTVGFGEVSIIVGNDWDKLPNINK  595 (629)
T ss_pred             cccccccccCceeeeeCCCcccceeEEeecCCcccCccccc
Confidence            57778888888888899999999999999999999995544


No 70 
>PF02650 HTH_WhiA:  WhiA C-terminal HTH domain;  InterPro: IPR023054  This domain is found at the C terminus of the sporulation regulator WhiA. It is predicted to form a DNA binding helix-turn-helix structure []. ; PDB: 3HYI_A.
Probab=52.88  E-value=12  Score=18.88  Aligned_cols=20  Identities=25%  Similarity=0.519  Sum_probs=13.8

Q ss_pred             HHHHhCCCCCHHHHHHHHHH
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQ   32 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~   32 (56)
                      .++.+||+++-.|++..+..
T Consensus        47 ~lRl~~Pd~SL~EL~~~~~~   66 (85)
T PF02650_consen   47 ELRLENPDASLKELGELLEP   66 (85)
T ss_dssp             HHHHH-TTS-HHHHHHTT--
T ss_pred             HHHHHCccccHHHHHHHHcC
Confidence            57788999999999887654


No 71 
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=51.67  E-value=24  Score=20.55  Aligned_cols=21  Identities=24%  Similarity=0.529  Sum_probs=17.1

Q ss_pred             HHHhcCCCHHHHHHHHHHHHH
Q psy5907          31 GQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        31 ~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      .+.|..||++.|.-..+.+.+
T Consensus       213 ~~~~~~L~~e~q~~i~~a~~~  233 (257)
T TIGR00787       213 KAFWKSLPPDLQAVVKEAAKE  233 (257)
T ss_pred             HHHHhcCCHHHHHHHHHHHHH
Confidence            467999999999988776655


No 72 
>COG1638 DctP TRAP-type C4-dicarboxylate transport system, periplasmic component [Carbohydrate transport and metabolism]
Probab=50.14  E-value=27  Score=21.66  Aligned_cols=24  Identities=25%  Similarity=0.522  Sum_probs=19.6

Q ss_pred             HHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907          31 GQMWRDLPEDQKTEYVEDYEQEKI   54 (56)
Q Consensus        31 ~~~W~~l~~~eK~~y~~~a~~~k~   54 (56)
                      ...|..||++.|.-..+.+.+.-.
T Consensus       244 ~~~w~~L~~e~q~il~~aa~e~~~  267 (332)
T COG1638         244 KAFWDSLPEEDQTILLEAAKEAAE  267 (332)
T ss_pred             HHHHhcCCHHHHHHHHHHHHHHHH
Confidence            467999999999999888877543


No 73 
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=49.87  E-value=60  Score=20.91  Aligned_cols=40  Identities=18%  Similarity=0.047  Sum_probs=29.0

Q ss_pred             HHHHHHHHHHhCCCCCH-HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           7 SRKVWDQVKAQNLDLKL-WEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         7 ~~~~~~~~~~~~p~~~~-~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      .+.+|...++-||+.+. .+.-+.|++.+..|++.+|. .|.
T Consensus        46 KkAYrkla~k~HPDk~~~~e~F~~i~~AYevLsD~~kR~~YD   87 (421)
T PTZ00037         46 KKAYRKLAIKHHPDKGGDPEKFKEISRAYEVLSDPEKRKIYD   87 (421)
T ss_pred             HHHHHHHHHHHCCCCCchHHHHHHHHHHHHHhccHHHHHHHh
Confidence            35667777788998763 46667899999999877655 444


No 74 
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.81  E-value=34  Score=16.81  Aligned_cols=34  Identities=21%  Similarity=0.149  Sum_probs=26.5

Q ss_pred             HHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHH
Q psy5907           7 SRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus         7 ~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~   40 (56)
                      ++..-.+.-.+||.++..-|.-+.+.+-..-|+.
T Consensus        25 ark~~~k~lk~NPpine~~iR~M~~qmGqKpSe~   58 (71)
T COG3763          25 ARKQMKKQLKDNPPINEEMIRMMMAQMGQKPSEK   58 (71)
T ss_pred             HHHHHHHHHhhCCCCCHHHHHHHHHHhCCCchHH
Confidence            3444455667899999999999999998887765


No 75 
>PRK14284 chaperone protein DnaJ; Provisional
Probab=48.56  E-value=53  Score=20.74  Aligned_cols=39  Identities=15%  Similarity=0.073  Sum_probs=27.9

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+.     .+.-+.|.+.+..|++.+|. .|.
T Consensus        20 kayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD   64 (391)
T PRK14284         20 KAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKRESYD   64 (391)
T ss_pred             HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhcCHHHHHHHH
Confidence            4566777777898753     45678899999998866654 444


No 76 
>PF11588 DUF3243:  Protein of unknown function (DUF3243);  InterPro: IPR021637 This family of proteins with unknown function includes uncharacterised proteins ymfJ and yflH. The family appears to be restricted to Firmicutes.; PDB: 3D0W_B.
Probab=48.11  E-value=22  Score=17.90  Aligned_cols=19  Identities=11%  Similarity=0.513  Sum_probs=14.2

Q ss_pred             HHHHHHHhcCCCHHHHHHH
Q psy5907          27 GKIIGQMWRDLPEDQKTEY   45 (56)
Q Consensus        27 ~k~l~~~W~~l~~~eK~~y   45 (56)
                      .+.|.+.|..-+++||..-
T Consensus        53 erlLkELW~va~e~Eq~~L   71 (81)
T PF11588_consen   53 ERLLKELWDVADEEEQHAL   71 (81)
T ss_dssp             HHHHHHHHHC--HHHHHHH
T ss_pred             HHHHHHHHHhCCHHHHHHH
Confidence            3789999999999999764


No 77 
>PRK14285 chaperone protein DnaJ; Provisional
Probab=47.44  E-value=53  Score=20.59  Aligned_cols=41  Identities=15%  Similarity=0.092  Sum_probs=28.7

Q ss_pred             HHHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHH-HHHHHH
Q psy5907           7 SRKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQ-KTEYVE   47 (56)
Q Consensus         7 ~~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~e-K~~y~~   47 (56)
                      .+.+|...+.-||+.+     ..+.-+.|.+.|..|++.+ |..|..
T Consensus        21 k~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yd~   67 (365)
T PRK14285         21 KKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRAQYDR   67 (365)
T ss_pred             HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHcCcchhHHHHh
Confidence            3566777778899875     2355678999999998765 445543


No 78 
>PLN02150 terpene synthase/cyclase family protein
Probab=47.13  E-value=11  Score=19.16  Aligned_cols=19  Identities=21%  Similarity=0.343  Sum_probs=15.2

Q ss_pred             CHHHHHHHHHHHhcCCCHH
Q psy5907          22 KLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus        22 ~~~ei~k~l~~~W~~l~~~   40 (56)
                      ...++..++.+.|+.|.++
T Consensus        24 A~~~i~~li~~~WK~iN~e   42 (96)
T PLN02150         24 AVSELKKMIRDNYKIVMEE   42 (96)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4678889999999986654


No 79 
>PF01630 Glyco_hydro_56:  Hyaluronidase;  InterPro: IPR018155 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 56 GH56 from CAZY comprises enzymes with only one known activity; hyaluronidase 3.2.1.35 from EC. The venom of Apis mellifera (Honeybee) contains several biologically-active peptides and two enzymes, one of which is a hyaluronidase []. The amino acid sequence of bee venom hyaluronidase contains 349 amino acids, and includes four cysteines and a number of potential glycosylation sites []. The sequence shows a high degree of similarity to PH-20, a membrane protein of mammalian sperm involved in sperm-egg adhesion, supporting the view that hyaluronidases play a role in fertilisation []. PH-20 is required for sperm adhesion to the egg zona pellucida; it is located on both the sperm plasma membrane and acrosomal membrane []. The amino acid sequence of the mature protein contains 468 amino acids, and includes six potential N-linked glycosylation sites and twelve cysteines, eight of which are tightly clustered near the C terminus [].; GO: 0004415 hyalurononglucosaminidase activity, 0005975 carbohydrate metabolic process; PDB: 1FCQ_A 1FCV_A 1FCU_A 2J88_A 2PE4_A 2ATM_A.
Probab=46.07  E-value=75  Score=20.07  Aligned_cols=32  Identities=19%  Similarity=0.162  Sum_probs=24.2

Q ss_pred             HhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhc
Q psy5907           4 MRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWR   35 (56)
Q Consensus         4 ~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~   35 (56)
                      .+|...-...++.+||+++..++.+.....+.
T Consensus       123 ~iYr~~S~~lv~~~hp~ws~~~v~~~A~~~FE  154 (337)
T PF01630_consen  123 DIYRNESIELVRQQHPDWSEKEVEKEAKKEFE  154 (337)
T ss_dssp             HHHHHHHHHHHHHHSTTS-HHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHH
Confidence            35677778889999999999999886665554


No 80 
>PRK14287 chaperone protein DnaJ; Provisional
Probab=45.50  E-value=76  Score=19.95  Aligned_cols=39  Identities=15%  Similarity=0.176  Sum_probs=27.1

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+    ..+.-+.|.+.+..|++.+|. .|.
T Consensus        23 ~ayr~la~~~HpD~~~~~~~~~~f~~i~~Ay~~L~d~~kR~~YD   66 (371)
T PRK14287         23 KAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAHYD   66 (371)
T ss_pred             HHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhCcHhHHHHHH
Confidence            455666667788865    345667899999999877654 444


No 81 
>cd08805 Death_ank1 Death domain of Ankyrin-1. Death Domain (DD) of the human protein ankyrin-1 (ANK-1) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-1, also called ankyrin-R (for restricted), is found in brain, muscle, and erythrocytes and is thought to function in linking integral membrane proteins to the underlying cytoskeleton. It plays a critical nonredundant role in erythroid development and is associated with hereditary spherocytosis (HS), a common disorder of the red cell membrane. The small alternatively-spliced variant, sANK-1, found in striated muscle and concentrated in the sarcoplasmic reticulum (SR) binds obscurin and titin, which facilitates the anchoring of the network SR to the contractile apparatus. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common featur
Probab=45.38  E-value=6.2  Score=19.78  Aligned_cols=21  Identities=24%  Similarity=0.605  Sum_probs=15.7

Q ss_pred             CCCHHHHHHHHHHHhcCCCHH
Q psy5907          20 DLKLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus        20 ~~~~~ei~k~l~~~W~~l~~~   40 (56)
                      ++....|+..||..|..|..+
T Consensus         4 ~~~l~~Ia~~LG~dW~~Lar~   24 (84)
T cd08805           4 EMKMAVIREHLGLSWAELARE   24 (84)
T ss_pred             hhHHHHHHHHhcchHHHHHHH
Confidence            456678888899999876544


No 82 
>PRK14276 chaperone protein DnaJ; Provisional
Probab=45.10  E-value=78  Score=19.95  Aligned_cols=39  Identities=21%  Similarity=0.198  Sum_probs=27.7

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+    ..+.-+.|.+.|..|++.+|. .|.
T Consensus        23 ~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD   66 (380)
T PRK14276         23 KAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAAYD   66 (380)
T ss_pred             HHHHHHHHHHCcCCCCCcCHHHHHHHHHHHHHHhcCHhhhhhHh
Confidence            455666667788754    457778899999988877754 444


No 83 
>KOG0713|consensus
Probab=45.07  E-value=79  Score=20.05  Aligned_cols=40  Identities=15%  Similarity=0.048  Sum_probs=27.7

Q ss_pred             HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHHHHHH
Q psy5907           8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKTEYVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~~y~~   47 (56)
                      +..|....+-||+.+     ..+.=+.|+..|.-|++.+|..-.+
T Consensus        35 kAYRKLALk~HPDkNpddp~A~e~F~~in~AYEVLsDpekRk~YD   79 (336)
T KOG0713|consen   35 KAYRKLALKYHPDKNPDDPNANEKFKEINAAYEVLSDPEKRKHYD   79 (336)
T ss_pred             HHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhcCHHHHHHHH
Confidence            445666667788654     5566678999999998777655443


No 84 
>PRK09897 hypothetical protein; Provisional
Probab=44.46  E-value=53  Score=21.78  Aligned_cols=29  Identities=7%  Similarity=0.272  Sum_probs=22.7

Q ss_pred             HHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      ...+...+..-|..||.++|+.|......
T Consensus       353 i~~l~~~~~~iw~~l~~~d~~rf~~~l~~  381 (534)
T PRK09897        353 ILRLHEAVQEIVPHLNEHDHKRFSKGLAR  381 (534)
T ss_pred             HHHHHHHHHHHHHhCCHHHHHHHHHhhhH
Confidence            34455677899999999999999876543


No 85 
>PRK14294 chaperone protein DnaJ; Provisional
Probab=43.73  E-value=80  Score=19.75  Aligned_cols=39  Identities=15%  Similarity=0.073  Sum_probs=27.4

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+.     .+.-+.+.+.|..|++.++. .|.
T Consensus        23 ~ayr~la~~~HPD~~~~~~~~~~~f~~~~~Ay~vL~d~~~r~~yD   67 (366)
T PRK14294         23 KSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRGIYD   67 (366)
T ss_pred             HHHHHHHHHHCCCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHH
Confidence            4566666777888753     46667899999999876654 444


No 86 
>PF00226 DnaJ:  DnaJ domain;  InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation:  +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+   It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=42.98  E-value=33  Score=15.41  Aligned_cols=38  Identities=16%  Similarity=0.094  Sum_probs=25.7

Q ss_pred             HHHHHHHHHhCCCCC--H----HHHHHHHHHHhcCCCHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK--L----WEIGKIIGQMWRDLPEDQK-TEY   45 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~--~----~ei~k~l~~~W~~l~~~eK-~~y   45 (56)
                      +..+..++.-||+..  .    .+....|.+.|..|+...+ ..|
T Consensus        19 ~~y~~l~~~~HPD~~~~~~~~~~~~~~~i~~Ay~~L~~~~~R~~Y   63 (64)
T PF00226_consen   19 KAYRRLSKQYHPDKNSGDEAEAEEKFARINEAYEILSDPERRRRY   63 (64)
T ss_dssp             HHHHHHHHHTSTTTGTSTHHHHHHHHHHHHHHHHHHHSHHHHHHH
T ss_pred             HHHHhhhhccccccchhhhhhhhHHHHHHHHHHHHhCCHHHHHhc
Confidence            455666777788762  2    3778889999998765554 444


No 87 
>cd01670 Death Death Domain: a protein-protein interaction domain. Death Domains (DDs) are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. Structural analysis of DD-DD complexes show that the domains interact with each other in many different ways. DD-containing proteins serve as adaptors in signaling pathways and they can recruit other proteins into signaling complexes. In mammals, they are prominent components of the programmed cell death (apoptosis) pathway and are found in a number of other signaling pathways. In invertebrates, they are involved in transcriptional regulation of zygotic patterning genes in insect embryogenesis, and are components of the ToII/NF-kappaB pathway, a conserved innate immune pathway in a
Probab=42.69  E-value=14  Score=17.39  Aligned_cols=17  Identities=24%  Similarity=0.607  Sum_probs=12.0

Q ss_pred             HHHHHHHHHHhcCCCHH
Q psy5907          24 WEIGKIIGQMWRDLPED   40 (56)
Q Consensus        24 ~ei~k~l~~~W~~l~~~   40 (56)
                      ..|+..+|..|+.|...
T Consensus         3 ~~ia~~lg~~W~~la~~   19 (79)
T cd01670           3 DKLAKKLGKDWKKLARK   19 (79)
T ss_pred             HHHHHHHhhHHHHHHHH
Confidence            46778888888766543


No 88 
>PF08373 RAP:  RAP domain;  InterPro: IPR013584 The ~60-residue RAP (an acronym for RNA-binding domain abundant in Apicomplexans) domain is found in various proteins in eukaryotes. It is particularly abundant in apicomplexans and might mediate a range of cellular functions through its potential interactions with RNA []. The RAP domain consists of multiple blocks of charged and aromatics residues and is predicted to be composed of alpha helical and beta strand structures. Two predicted loop regions that are dominated by glycine and tryptophan residues are found before and after the central beta sheet []. Some proteins known to contain a RAP domain are listed below:   Human hypothetical protein MGC5297,  Mammalian FAST kinase domain-containing proteins (FASTKDs),   Chlamydomonas reinhardtii chloroplastic trans-splicing factor Raa3. 
Probab=42.64  E-value=22  Score=15.82  Aligned_cols=15  Identities=33%  Similarity=0.871  Sum_probs=11.8

Q ss_pred             HhcCC-CHHHHHHHHH
Q psy5907          33 MWRDL-PEDQKTEYVE   47 (56)
Q Consensus        33 ~W~~l-~~~eK~~y~~   47 (56)
                      .|..| +.++|..|..
T Consensus        41 eW~~l~~~~~k~~YL~   56 (58)
T PF08373_consen   41 EWNKLKSREEKIEYLK   56 (58)
T ss_pred             HHHhcCCHHHHHHHHh
Confidence            48888 8888888764


No 89 
>PRK14298 chaperone protein DnaJ; Provisional
Probab=42.46  E-value=76  Score=20.00  Aligned_cols=37  Identities=16%  Similarity=0.058  Sum_probs=25.6

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKTE   44 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~~   44 (56)
                      +.+|...+.-||+.+    ..+.-+.|.+.|..|++.+|..
T Consensus        24 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~kR~   64 (377)
T PRK14298         24 KAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAEKRA   64 (377)
T ss_pred             HHHHHHHHHhCccccCChhHHHHHHHHHHHHHHhcchHhhh
Confidence            445666667788875    3345577999999888877543


No 90 
>PF03480 SBP_bac_7:  Bacterial extracellular solute-binding protein, family 7;  InterPro: IPR018389 This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein DctP [, ] and the sialic acid-binding protein SiaP. The structure of the SiaP receptor has revealed an overall topology similar to ATP binding cassette ESR (extracytoplasmic solute receptors) proteins []. Upon binding of sialic acid, SiaP undergoes domain closure about a hinge region and kinking of an alpha-helix hinge component [].; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space; PDB: 2HZK_C 2HZL_B 2HPG_C 2XWI_A 2XWK_A 2WX9_A 2CEY_A 2WYP_A 3B50_A 2CEX_B ....
Probab=42.43  E-value=28  Score=20.53  Aligned_cols=20  Identities=15%  Similarity=0.589  Sum_probs=15.3

Q ss_pred             HHhcCCCHHHHHHHHHHHHH
Q psy5907          32 QMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        32 ~~W~~l~~~eK~~y~~~a~~   51 (56)
                      +.|..||++.|+-..+.+.+
T Consensus       214 ~~w~~L~~e~q~~l~~~~~~  233 (286)
T PF03480_consen  214 DWWDSLPDEDQEALDDAADE  233 (286)
T ss_dssp             HHHHHS-HHHHHHHHHHHHH
T ss_pred             HHHhcCCHHHHHHHHHHHHH
Confidence            56999999999888776655


No 91 
>PF02936 COX4:  Cytochrome c oxidase subunit IV;  InterPro: IPR004203 Cytochrome c oxidase, a 13 sub-unit complex (1.9.3.1 from EC) is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium discoideum (Slime mould) member of this family is called COX VI. The Saccharomyces cerevisiae protein YGX6_YEAST appears to be the yeast COX IV subunit.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3ABK_Q 3AG1_Q 3ASN_Q 1OCZ_D 2EIN_Q 2OCC_D 2YBB_O 3AG3_D 1OCO_Q 1V55_Q ....
Probab=41.89  E-value=15  Score=20.18  Aligned_cols=22  Identities=23%  Similarity=0.347  Sum_probs=12.7

Q ss_pred             HHHHHHHHH----HHhcCCCHHHHHH
Q psy5907          23 LWEIGKIIG----QMWRDLPEDQKTE   44 (56)
Q Consensus        23 ~~ei~k~l~----~~W~~l~~~eK~~   44 (56)
                      .+++.+.|.    .-|+.||.+||.-
T Consensus        29 ~~~~~~~LkeKekg~Wk~LS~eEKka   54 (142)
T PF02936_consen   29 QTEIIEALKEKEKGDWKKLSLEEKKA   54 (142)
T ss_dssp             --HHHHHHHHHTTS-GGGS-HHHHHH
T ss_pred             hHHHHHHHHHHHhCCHhhCCHHHHHH
Confidence            345555555    4599999998864


No 92 
>PRK05439 pantothenate kinase; Provisional
Probab=41.18  E-value=20  Score=22.16  Aligned_cols=49  Identities=20%  Similarity=0.437  Sum_probs=30.3

Q ss_pred             ChHHhHHHHHHHHHHHhCC-CCCHHHHHHHHHHHhcCCC-HHHHHHHHHHHH
Q psy5907           1 MPYMRYSRKVWDQVKAQNL-DLKLWEIGKIIGQMWRDLP-EDQKTEYVEDYE   50 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~p-~~~~~ei~k~l~~~W~~l~-~~eK~~y~~~a~   50 (56)
                      ++|..|..+.|..+....| .++..|+.++-| .-..++ ++-...|..++.
T Consensus         8 ~~~~~~~r~~w~~l~~~~~~~l~~~~~~~l~~-~~~~~~~~~v~~iy~plar   58 (311)
T PRK05439          8 SPYLEFSREQWAALRDSTPLTLTEEELERLRG-LNDPISLEEVAEIYLPLSR   58 (311)
T ss_pred             CCceeECHHHHHHHHhcCCCCCCHHHHHHhhc-CCCCCCHHHHHHHHHHHHH
Confidence            4788899999999987776 567777766433 222222 333445544443


No 93 
>PF05914 RIB43A:  RIB43A;  InterPro: IPR008805 This family consists of several RIB43A-like eukaryotic proteins. Ciliary and flagellar microtubules contain a specialised set of protofilaments, termed ribbons, that are composed of tubulin and several associated proteins. RIB43A was first characterised in the unicellular biflagellate, Chlamydomonas reinhardtii although highly related sequences are present in several higher eukaryotes including humans. The function of this protein is unknown although the structure of RIB43A and its association with the specialised protofilament ribbons and with basal bodies is relevant to the proposed role of ribbons in forming and stabilising doublet and triplet microtubules and in organising their three-dimensional structure. Human RIB43A homologues could represent a structural requirement in centriole replication in dividing cells [].
Probab=40.84  E-value=4.9  Score=25.41  Aligned_cols=34  Identities=15%  Similarity=0.344  Sum_probs=23.9

Q ss_pred             HHHhCCCCC--HHHHHHHHHHHhcCCCHHHHHHHHH
Q psy5907          14 VKAQNLDLK--LWEIGKIIGQMWRDLPEDQKTEYVE   47 (56)
Q Consensus        14 ~~~~~p~~~--~~ei~k~l~~~W~~l~~~eK~~y~~   47 (56)
                      +..++|...  ...-.+.+...|+.||+++...+..
T Consensus       252 lLtEnp~~a~s~~gp~Rv~~d~wKGMs~eQl~~i~~  287 (379)
T PF05914_consen  252 LLTENPQVAQSSFGPHRVIPDRWKGMSPEQLEEIRK  287 (379)
T ss_pred             cccCCchhccccCCCCCCCCcccCCCCHHHHHHHHH
Confidence            335667653  2233567889999999999988764


No 94 
>PRK14292 chaperone protein DnaJ; Provisional
Probab=40.74  E-value=89  Score=19.55  Aligned_cols=39  Identities=15%  Similarity=0.058  Sum_probs=27.3

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHH-HHHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQK-TEYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK-~~y~   46 (56)
                      +.++...+.-||+..    ..+.-+.|.+....|++.++ ..|.
T Consensus        21 ~ayr~l~~~~hpD~~~~~~a~~~~~~i~~Ay~vL~d~~~r~~yd   64 (371)
T PRK14292         21 SAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRAHYD   64 (371)
T ss_pred             HHHHHHHHHHCCCCCCChhHHHHHHHHHHHHHHhcchhhhhhHh
Confidence            456667777788865    45666788899998886654 4444


No 95 
>PF09829 DUF2057:  Uncharacterized protein conserved in bacteria (DUF2057);  InterPro: IPR018635 The proteins in this entry are functionally uncharacterised.
Probab=40.01  E-value=39  Score=18.97  Aligned_cols=24  Identities=4%  Similarity=0.178  Sum_probs=20.4

Q ss_pred             HHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907          23 LWEIGKIIGQMWRDLPEDQKTEYV   46 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~eK~~y~   46 (56)
                      ..+...+|-..|...++++|+.|.
T Consensus       164 ~~~~~~~Lq~wy~qAs~~eRk~F~  187 (189)
T PF09829_consen  164 ESEALQMLQYWYLQASKEERKAFL  187 (189)
T ss_pred             cccHHHHHHHHHHhCCHHHHHHHh
Confidence            456678899999999999999875


No 96 
>smart00351 PAX Paired Box domain.
Probab=39.40  E-value=58  Score=17.15  Aligned_cols=20  Identities=25%  Similarity=0.366  Sum_probs=16.6

Q ss_pred             HHHHhCCCCCHHHHHHHHHH
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQ   32 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~   32 (56)
                      .+..+||+++..||...|.+
T Consensus        86 ~~~~~~p~~t~~el~~~L~~  105 (125)
T smart00351       86 DYKQENPGIFAWEIRDRLLS  105 (125)
T ss_pred             HHHHHCCCCCHHHHHHHHHH
Confidence            35678999999999988875


No 97 
>cd08318 Death_NMPP84 Death domain of Nuclear Matrix Protein P84. Death domain (DD) found in the Nuclear Matrix Protein P84 (also known as HPR1 or THOC1). HPR1/p84 resides in the nuclear matrix and is part of the THO complex, also called TREX (transcription/export) complex, which functions in mRNP biogenesis at the interface between transcription and export of mRNA from the nucleus. Mice lacking THOC1 have abnormal testis development and are sterile. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.
Probab=39.06  E-value=17  Score=17.99  Aligned_cols=16  Identities=19%  Similarity=0.561  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHhcCCC
Q psy5907          23 LWEIGKIIGQMWRDLP   38 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~   38 (56)
                      ...|+..||..|+.|.
T Consensus        10 l~~ia~~iG~~Wk~La   25 (86)
T cd08318          10 ITVFANKLGEDWKTLA   25 (86)
T ss_pred             HHHHHHHHhhhHHHHH
Confidence            5668888999998664


No 98 
>PRK14295 chaperone protein DnaJ; Provisional
Probab=38.58  E-value=1e+02  Score=19.52  Aligned_cols=40  Identities=23%  Similarity=0.223  Sum_probs=28.1

Q ss_pred             HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHH-HHHH
Q psy5907           8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKT-EYVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~-~y~~   47 (56)
                      +.+|...+.-||+.+     ..+.-+.|.+.|..|++.+|. .|..
T Consensus        28 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~   73 (389)
T PRK14295         28 KAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRKEYDE   73 (389)
T ss_pred             HHHHHHHHHHCCCcCCCchhHHHHHHHHHHHHHHHCchhhHHHHHH
Confidence            455666677788864     346677899999988877654 4543


No 99 
>PRK14290 chaperone protein DnaJ; Provisional
Probab=38.50  E-value=83  Score=19.69  Aligned_cols=36  Identities=25%  Similarity=0.184  Sum_probs=25.9

Q ss_pred             HHHHHHHHHhCCCCCH------HHHHHHHHHHhcCCCHHHHH
Q psy5907           8 RKVWDQVKAQNLDLKL------WEIGKIIGQMWRDLPEDQKT   43 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~------~ei~k~l~~~W~~l~~~eK~   43 (56)
                      +.+|...++-||+.+.      .+.-+.|.+.|..|++..+.
T Consensus        22 ~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~~L~d~~~r   63 (365)
T PRK14290         22 KAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKR   63 (365)
T ss_pred             HHHHHHHHHHCcCCCCCchhHHHHHHHHHHHHHHHhcChhhh
Confidence            4566677777888753      35557799999998887754


No 100
>cd08779 Death_PIDD Death Domain of p53-induced protein with a death domain. Death domain (DD) found in PIDD (p53-induced protein with a death domain) and similar proteins. PIDD is a component of the PIDDosome complex, which is an oligomeric caspase-activating complex involved in caspase-2 activation and plays a role in mediating stress-induced apoptosis. The PIDDosome complex is composed of three components, PIDD, RAIDD and caspase-2, which interact through their DDs and DD-like domains. The DD of PIDD interacts with the DD of RAIDD, which also contains a Caspase Activation and Recruitment Domain (CARD) that interacts with the caspase-2 CARD. Autoproteolysis of PIDD determines the downstream signaling event, between pro-survival NF-kB or pro-death caspase-2 activation. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members 
Probab=38.42  E-value=15  Score=18.29  Aligned_cols=17  Identities=24%  Similarity=0.454  Sum_probs=13.0

Q ss_pred             HHHHHHHHHHHhcCCCH
Q psy5907          23 LWEIGKIIGQMWRDLPE   39 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~   39 (56)
                      ...|+..||..|+.+..
T Consensus         5 l~~ia~~LG~~Wk~lar   21 (86)
T cd08779           5 LLSIAGRLGLDWQAIGL   21 (86)
T ss_pred             HHHHHHHHhHHHHHHHH
Confidence            45788899999987653


No 101
>PF14098 SSPI:  Small, acid-soluble spore protein I
Probab=38.35  E-value=49  Score=15.94  Aligned_cols=17  Identities=29%  Similarity=0.667  Sum_probs=13.9

Q ss_pred             HHhcCCCHHHHHHHHHH
Q psy5907          32 QMWRDLPEDQKTEYVED   48 (56)
Q Consensus        32 ~~W~~l~~~eK~~y~~~   48 (56)
                      -.|+.+++++|....+.
T Consensus        44 ~~W~~~~~~ek~~m~~~   60 (65)
T PF14098_consen   44 VIWKNSDESEKQEMVNT   60 (65)
T ss_pred             HHHHhCCHHHHHHHHHH
Confidence            46999999999887654


No 102
>COG4735 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.03  E-value=81  Score=18.67  Aligned_cols=33  Identities=15%  Similarity=0.213  Sum_probs=26.5

Q ss_pred             CCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          19 LDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      .+.++...-+..|..|+.++.+++....-.+.+
T Consensus        35 gg~s~anflr~~G~lyk~il~d~~~~Lk~k~~k   67 (211)
T COG4735          35 GGLSAANFLRDLGQLYKKILCDEKDKLKVKVQK   67 (211)
T ss_pred             ccHHHHHHHHHHHHHHHHcCHHHHHHHHHHHHH
Confidence            467788888889999999999998877665554


No 103
>PF01710 HTH_Tnp_IS630:  Transposase;  InterPro: IPR002622 Transposase proteins are necessary for efficient DNA transposition. This entry includes insertion sequences from Synechocystis sp. (strain PCC 6803) three of which are characterised as homologous to bacterial IS5- and IS4- and to several members of the IS630-Tc1-mariner superfamily []. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=37.98  E-value=48  Score=17.26  Aligned_cols=20  Identities=25%  Similarity=0.413  Sum_probs=14.7

Q ss_pred             HHHHHhCCCCCHHHHHHHHH
Q psy5907          12 DQVKAQNLDLKLWEIGKIIG   31 (56)
Q Consensus        12 ~~~~~~~p~~~~~ei~k~l~   31 (56)
                      ..+...+|+++..|+++.+|
T Consensus        63 ~~~v~~~pd~tl~Ela~~l~   82 (119)
T PF01710_consen   63 KALVEENPDATLRELAERLG   82 (119)
T ss_pred             HHHHHHCCCcCHHHHHHHcC
Confidence            44556789999999887643


No 104
>PRK14278 chaperone protein DnaJ; Provisional
Probab=37.22  E-value=1.1e+02  Score=19.34  Aligned_cols=39  Identities=18%  Similarity=0.079  Sum_probs=26.9

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+    ..+.-+.|++.|..|++.+|. .|.
T Consensus        22 ~ayr~la~~~hpD~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~YD   65 (378)
T PRK14278         22 RAYRKLARELHPDVNPDEEAQEKFKEISVAYEVLSDPEKRRIVD   65 (378)
T ss_pred             HHHHHHHHHHCCCCCCcHHHHHHHHHHHHHHHHhchhhhhhhhh
Confidence            456667777788875    335556799999999766544 444


No 105
>PF11616 EZH2_WD-Binding:  WD repeat binding protein EZH2;  InterPro: IPR021654  This family of proteins represents Enhancer of zest homologue 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression []. This interaction is required for the HMTase activity of PCR2 []. ; GO: 0018024 histone-lysine N-methyltransferase activity; PDB: 2QXV_B.
Probab=37.21  E-value=32  Score=14.05  Aligned_cols=14  Identities=29%  Similarity=0.624  Sum_probs=10.0

Q ss_pred             HHHHHHHHHHhcCC
Q psy5907          24 WEIGKIIGQMWRDL   37 (56)
Q Consensus        24 ~ei~k~l~~~W~~l   37 (56)
                      .|-+.++.+.|+.+
T Consensus        12 ~e~t~iLN~eWk~l   25 (30)
T PF11616_consen   12 QERTDILNEEWKKL   25 (30)
T ss_dssp             HHHHHHHHHHHHH-
T ss_pred             HHHHHHHHHHHHHh
Confidence            45677888999865


No 106
>PF06242 DUF1013:  Protein of unknown function (DUF1013);  InterPro: IPR010421 This is a family of uncharacterised proteins found in Proteobacteria.
Probab=36.61  E-value=33  Score=19.01  Aligned_cols=18  Identities=28%  Similarity=0.362  Sum_probs=15.2

Q ss_pred             HHhCCCCCHHHHHHHHHH
Q psy5907          15 KAQNLDLKLWEIGKIIGQ   32 (56)
Q Consensus        15 ~~~~p~~~~~ei~k~l~~   32 (56)
                      ...||.++-+.|++.+|.
T Consensus        78 vk~hPel~DaQI~kLiGT   95 (140)
T PF06242_consen   78 VKNHPELSDAQIAKLIGT   95 (140)
T ss_pred             HhcCCcccHHHHHHHhcC
Confidence            356899999999999884


No 107
>PF11334 DUF3136:  Protein of unknown function (DUF3136);  InterPro: IPR021483  This family of proteins with unknown function appear to be restricted to Cyanobacteria. 
Probab=36.47  E-value=55  Score=15.74  Aligned_cols=30  Identities=20%  Similarity=0.274  Sum_probs=22.2

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcC
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRD   36 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~   36 (56)
                      .|.+||+..+-.+.   -+.+..+|.+.+  .|.-
T Consensus         9 ~y~~YckALr~Lv~---~G~~~~~i~rTv--CW~r   38 (64)
T PF11334_consen    9 GYPLYCKALRRLVA---DGRSEEEIRRTV--CWDR   38 (64)
T ss_pred             CchHHHHHHHHHHH---cCCCHHHHHHHH--HHHH
Confidence            37899999999887   477888887654  4643


No 108
>cd00922 Cyt_c_Oxidase_IV Cytochrome c oxidase subunit IV. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit IV is the largest of the nuclear-encoded subunits. It binds ATP at the matrix side, leading to an allosteric inhibition of enzyme activity at high intramitochondrial ATP/ADP ratios. In mammals, subunit IV has a lung-specific isoform and a ubiquitously expressed isoform.
Probab=36.47  E-value=19  Score=19.60  Aligned_cols=14  Identities=21%  Similarity=0.541  Sum_probs=11.3

Q ss_pred             HhcCCCHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYV   46 (56)
Q Consensus        33 ~W~~l~~~eK~~y~   46 (56)
                      -|+.||.+||.-..
T Consensus        43 dW~~LT~~EKkAlY   56 (136)
T cd00922          43 DWKQLTLEEKKALY   56 (136)
T ss_pred             CHhhCCHHHHhhHh
Confidence            49999999997644


No 109
>PRK14286 chaperone protein DnaJ; Provisional
Probab=36.37  E-value=1.1e+02  Score=19.23  Aligned_cols=40  Identities=15%  Similarity=0.006  Sum_probs=28.3

Q ss_pred             HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHH-HHHHH
Q psy5907           8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQK-TEYVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK-~~y~~   47 (56)
                      +.+|...+.-||+.+     ..+.-+.|.+.+..|++.+| ..|..
T Consensus        23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~   68 (372)
T PRK14286         23 SAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAYDQ   68 (372)
T ss_pred             HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHHH
Confidence            456677777889875     24566789999999987664 45553


No 110
>COG2938 Uncharacterized conserved protein [Function unknown]
Probab=36.27  E-value=66  Score=16.59  Aligned_cols=24  Identities=13%  Similarity=0.223  Sum_probs=18.6

Q ss_pred             HHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          28 KIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        28 k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      .-....|..||++++..|..+-+.
T Consensus        35 ~Fae~~~~~lsd~el~~f~~LLe~   58 (94)
T COG2938          35 PFAEKEFDSLSDEELDEFERLLEC   58 (94)
T ss_pred             HHHHHHHhhCCHHHHHHHHHHHcC
Confidence            334578899999999999877543


No 111
>PF03810 IBN_N:  Importin-beta N-terminal domain;  InterPro: IPR001494 Karyopherins are a group of proteins involved in transporting molecules through the pores of the nuclear envelope. Karyopherins, which may act as importins or exportins, are part of the Importin-beta super-family, which all share a similar three-dimensional structure. Members of the importin-beta (karyopherin-beta) family can bind and transport cargo by themselves, or can form heterodimers with importin-alpha. As part of a heterodimer, importin-beta mediates interactions with the pore complex, while importin-alpha acts as an adaptor protein to bind the nuclear localisation signal (NLS) on the cargo through the classical NLS import of proteins. Importin-beta is a helicoidal molecule constructed from 19 HEAT repeats. Many nuclear pore proteins contain FG sequence repeats that can bind to HEAT repeats within importins [, ], which is important for importin-beta mediated transport. Ran GTPase helps to control the unidirectional transfer of cargo. The cytoplasm contains primarily RanGDP and the nucleus RanGTP through the actions of RanGAP and RanGEF, respectively. In the nucleus, RanGTP binds to importin-beta within the importin/cargo complex, causing a conformational change in importin-beta that releases it from importin-alpha-bound cargo. As a result, the N-terminal auto-inhibitory region on importin-alpha is free to loop back and bind to the major NLS-binding site, causing the cargo to be released []. There are additional release factors as well. This entry represents the N-terminal domain of karyopherins that is important for the binding of the Ran protein []. More information about these proteins can be found at Protein of the Month: Importins [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport; PDB: 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 1IBR_D 1QGR_A 3LWW_A 1F59_A 2Q5D_A ....
Probab=35.34  E-value=52  Score=15.11  Aligned_cols=23  Identities=26%  Similarity=0.606  Sum_probs=16.5

Q ss_pred             HHHHHHHHHhc--------CCCHHHHHHHHH
Q psy5907          25 EIGKIIGQMWR--------DLPEDQKTEYVE   47 (56)
Q Consensus        25 ei~k~l~~~W~--------~l~~~eK~~y~~   47 (56)
                      -+-..|...|.        .+|+++|..-..
T Consensus        39 ~LKn~I~~~W~~~~~~~~~~~~~~~k~~Ik~   69 (77)
T PF03810_consen   39 LLKNLIKKNWSPSKQKGWSQLPEEEKEQIKS   69 (77)
T ss_dssp             HHHHHHHHSGGHHHHHHHHGSSHHHHHHHHH
T ss_pred             HHHHHHHHcCchhhccCCCCCCHHHHHHHHH
Confidence            34455777788        999999976543


No 112
>PF11903 DUF3423:  Protein of unknown function (DUF3423);  InterPro: IPR021831  This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 73 to 118 amino acids in length. 
Probab=35.20  E-value=60  Score=15.83  Aligned_cols=20  Identities=20%  Similarity=0.174  Sum_probs=15.4

Q ss_pred             HHHHhCCCCCHHHHHHHHHH
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQ   32 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~   32 (56)
                      .+...||+++...|...+-.
T Consensus        37 r~~E~np~l~~~~I~~~l~~   56 (72)
T PF11903_consen   37 RAAEDNPDLSYSFIEEMLLA   56 (72)
T ss_pred             HHHHHCCCCCHHHHHHHHHH
Confidence            44578999999999887643


No 113
>PRK14288 chaperone protein DnaJ; Provisional
Probab=34.90  E-value=1.2e+02  Score=19.10  Aligned_cols=39  Identities=18%  Similarity=0.138  Sum_probs=27.7

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+.     .+.-+.|.+.+..|++.+|. .|.
T Consensus        22 kayrkla~k~HPD~~~~~~~a~~~f~~i~~AYevLsd~~kR~~YD   66 (369)
T PRK14288         22 KSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKKRALYD   66 (369)
T ss_pred             HHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHHhccHHHHHHHH
Confidence            4567777788898753     34556799999999876654 454


No 114
>PF09556 RE_HaeIII:  HaeIII restriction endonuclease;  InterPro: IPR019059 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below:   Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA.   Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone [].   This entry represents the restriction endonuclease HaeIII, which recognises and cleaves the double-stranded sequence GG^CC. 
Probab=34.89  E-value=16  Score=22.59  Aligned_cols=28  Identities=29%  Similarity=0.548  Sum_probs=21.0

Q ss_pred             HHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          24 WEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        24 ~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      -+.-+.-+..|+.++..+-..|..+-..
T Consensus       150 L~~~k~~~~~W~~l~nKe~~vYiPLL~A  177 (300)
T PF09556_consen  150 LEEEKKEGTKWRELPNKEEDVYIPLLKA  177 (300)
T ss_pred             HHHHHHcCcchhhcccccccchHHHHHH
Confidence            3444556679999999999999876543


No 115
>cd07911 RNRR2_Rv0233_like Ribonucleotide Reductase R2-like protein, Mn/Fe-binding domain. Rv0233 is a Mycobacterium tuberculosis ribonucleotide reductase R2 protein with a  heterodinuclear manganese/iron-carboxylate cofactor located in its metal center. The Rv0233-like family may represent a structural/functional counterpart of the evolutionary ancestor of the RNRR2's (Ribonucleotide Reductase, R2/beta subunit) and the bacterial multicomponent monooxygenases.  RNRR2s belong to a broad superfamily of ferritin-like diiron-carboxylate proteins. The RNR protein catalyzes the conversion of ribonucleotides to deoxyribonucleotides and is found in prokaryotes and archaea. The catalytically active form of RNR is a proposed alpha2-beta2 tetramer. The homodimeric alpha subunit (R1) contains the active site and redox active cysteines as well as the allosteric binding sites.
Probab=34.79  E-value=41  Score=20.06  Aligned_cols=16  Identities=19%  Similarity=0.435  Sum_probs=13.7

Q ss_pred             HhcCCCHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVED   48 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~   48 (56)
                      .|+.|++.||..|...
T Consensus        35 ~w~~L~~~Er~~~~~~   50 (280)
T cd07911          35 DWEQLSEEERDLALRL   50 (280)
T ss_pred             HHHhCCHHHHHHHHHH
Confidence            5999999999988754


No 116
>PRK09731 putative general secretion pathway protein YghD; Provisional
Probab=34.17  E-value=95  Score=17.80  Aligned_cols=34  Identities=15%  Similarity=0.199  Sum_probs=23.3

Q ss_pred             HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHH
Q psy5907           3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTE   44 (56)
Q Consensus         3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~   44 (56)
                      |+.|++..|..      .++.+|  .-++..|..+++-|+..
T Consensus         6 ~~~~~~~~~~~------~~~~~~--~~~~~~W~~ls~REq~l   39 (178)
T PRK09731          6 FIHYFQQWRER------QLSRGE--HWLAQHLAGRSPREKGM   39 (178)
T ss_pred             HHHHHHHHHHH------Hhcchh--hHHHHHHccCCHHHHHH
Confidence            67777766654      233333  36888999999998864


No 117
>KOG0715|consensus
Probab=33.87  E-value=1.1e+02  Score=18.62  Aligned_cols=37  Identities=16%  Similarity=0.127  Sum_probs=26.8

Q ss_pred             HHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHHHH
Q psy5907           9 KVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKTEY   45 (56)
Q Consensus         9 ~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~~y   45 (56)
                      ..+...+.-||+.+    ..+.-..|.+.+..|++++|..=
T Consensus        63 Af~~LaKkyHPD~n~~~~a~~kF~eI~~AYEiLsd~eKR~~  103 (288)
T KOG0715|consen   63 AFRKLAKKYHPDVNKDKEASKKFKEISEAYEILSDEEKRQE  103 (288)
T ss_pred             HHHHHHHhhCCCCCCCcchhhHHHHHHHHHHHhcCHHHHHH
Confidence            45566677788764    66677788999998887777543


No 118
>PRK14297 chaperone protein DnaJ; Provisional
Probab=33.49  E-value=1e+02  Score=19.45  Aligned_cols=39  Identities=21%  Similarity=0.141  Sum_probs=27.4

Q ss_pred             HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+     ..+.-+.|.+.|..|++.+|. .|.
T Consensus        23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD   67 (380)
T PRK14297         23 KAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKAQYD   67 (380)
T ss_pred             HHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhcCHhhhCchh
Confidence            455666677788875     346678899999998886644 444


No 119
>PRK14289 chaperone protein DnaJ; Provisional
Probab=33.47  E-value=1.3e+02  Score=19.02  Aligned_cols=40  Identities=15%  Similarity=0.086  Sum_probs=27.5

Q ss_pred             HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKTE-YVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~~-y~~   47 (56)
                      +..+...+.-||+.+     ..+.-+.+.+.|..|++.++.. |..
T Consensus        24 ~ayr~la~~~HpD~~~~~~~a~~~f~~i~~Ay~~L~d~~~R~~yD~   69 (386)
T PRK14289         24 KAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRSRYDQ   69 (386)
T ss_pred             HHHHHHHHHHCCCCCCCChHHHHHHHHHHHHHHHhcCHHHHHHHHH
Confidence            345556666788875     3455677889999998887554 664


No 120
>PF13542 HTH_Tnp_ISL3:  Helix-turn-helix domain of transposase family ISL3
Probab=33.29  E-value=37  Score=14.58  Aligned_cols=32  Identities=6%  Similarity=0.257  Sum_probs=20.2

Q ss_pred             HHHHHHHHHHHhCCCC-CHHHHHHHHHHHhcCC
Q psy5907           6 YSRKVWDQVKAQNLDL-KLWEIGKIIGQMWRDL   37 (56)
Q Consensus         6 f~~~~~~~~~~~~p~~-~~~ei~k~l~~~W~~l   37 (56)
                      +.......+....-+. +..+|++.+|=.|...
T Consensus        12 ~T~~~~~~i~~~~~~~~s~~~vA~~~~vs~~TV   44 (52)
T PF13542_consen   12 ITKRLEQYILKLLRESRSFKDVARELGVSWSTV   44 (52)
T ss_pred             HHHHHHHHHHHHHhhcCCHHHHHHHHCCCHHHH
Confidence            3344444455444444 8999999888777643


No 121
>smart00413 ETS erythroblast transformation specific domain. variation of the helix-turn-helix motif
Probab=33.25  E-value=26  Score=17.84  Aligned_cols=24  Identities=8%  Similarity=0.318  Sum_probs=10.8

Q ss_pred             HHHHhcCCCHHHHHHHHHHHHHhh
Q psy5907          30 IGQMWRDLPEDQKTEYVEDYEQEK   53 (56)
Q Consensus        30 l~~~W~~l~~~eK~~y~~~a~~~k   53 (56)
                      +++.|..-......-|+.++...+
T Consensus        39 vA~lWG~~Knk~~M~YeklSRaLR   62 (87)
T smart00413       39 VARLWGQRKNKPNMNYEKLSRALR   62 (87)
T ss_pred             HHHHHhhhcCCCCCCHHHHHHHHH
Confidence            444554433333344555554443


No 122
>smart00206 NTR Tissue inhibitor of metalloproteinase family. Form complexes with metalloproteinases, such as collagenases, and irreversibly inactivate them.
Probab=33.14  E-value=26  Score=20.02  Aligned_cols=18  Identities=22%  Similarity=0.462  Sum_probs=15.5

Q ss_pred             HhcCCCHHHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVEDYE   50 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~a~   50 (56)
                      .|.+|+..+|.-+...+.
T Consensus       101 pW~~lT~~Qr~Gl~~~Y~  118 (172)
T smart00206      101 PWDSLSLAQRKGLNKRYH  118 (172)
T ss_pred             ehHhCCHHHHhhhhHhhc
Confidence            599999999999887764


No 123
>PF12376 DUF3654:  Protein of unknown function (DUF3654) ;  InterPro: IPR022115  This family of proteins is found in eukaryotes. Proteins in this family are typically between 193 and 612 amino acids in length. 
Probab=33.08  E-value=63  Score=17.86  Aligned_cols=21  Identities=29%  Similarity=0.373  Sum_probs=15.3

Q ss_pred             cCCCHHHHHHHHHHHHHhhhc
Q psy5907          35 RDLPEDQKTEYVEDYEQEKIY   55 (56)
Q Consensus        35 ~~l~~~eK~~y~~~a~~~k~~   55 (56)
                      ..||+++|..+...-..-|.|
T Consensus         6 ~~~s~eEk~k~~e~l~~ikey   26 (137)
T PF12376_consen    6 SELSEEEKKKEEEMLNKIKEY   26 (137)
T ss_pred             cccCHHHHHHHHHHHHHHHHH
Confidence            467888888888777666654


No 124
>TIGR02349 DnaJ_bact chaperone protein DnaJ. This model represents bacterial forms of DnaJ, part of the DnaK-DnaJ-GrpE chaperone system. The three components typically are encoded by consecutive genes. DnaJ homologs occur in many genomes, typically not near DnaK and GrpE-like genes; most such genes are not included by this family. Eukaryotic (mitochondrial and chloroplast) forms are not included in the scope of this family.
Probab=32.78  E-value=1.2e+02  Score=18.73  Aligned_cols=39  Identities=21%  Similarity=0.155  Sum_probs=27.5

Q ss_pred             HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.++...+.-||+..    ..+.-+.|.+.|..|++..+. .|.
T Consensus        19 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~~R~~yd   62 (354)
T TIGR02349        19 KAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQYD   62 (354)
T ss_pred             HHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhhChHHHHhhh
Confidence            455666677788875    345667899999988877744 444


No 125
>PHA02947 S-S bond formation pathway protein; Provisional
Probab=32.73  E-value=49  Score=19.62  Aligned_cols=28  Identities=25%  Similarity=0.321  Sum_probs=23.6

Q ss_pred             CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907          18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY   45 (56)
Q Consensus        18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y   45 (56)
                      ++..++.-+.+.+.+--..||+++|..-
T Consensus        58 na~~sf~lli~a~~Et~~~Lp~~qk~~i   85 (215)
T PHA02947         58 NKRLSFTLLIKTFKEVISTLPEKERREL   85 (215)
T ss_pred             CHHHHHHHHHHHHHHHHHhCCHHHHHHH
Confidence            4567788889999999999999999754


No 126
>PF02216 B:  B domain;  InterPro: IPR003132 This entry represents the immunoglobulin-binding domain found in the Staphylococcus aureus virulence factor protein A (SpA). Protein A contains five highly homologous Ig-binding domains in tandem (designated domains E, D, A, B and C), which share a common structure consisting of three helices in a closed left-handed twist. Protein A can exist in both secreted and membrane-bound forms, and has two distinct Ig-binding activities: each domain can bind Fc-gamma (the constant region of IgG involved in effector functions) and Fab (the Ig fragment responsible for antigen recognition) [].; GO: 0019865 immunoglobulin binding, 0009405 pathogenesis; PDB: 1EDL_A 1EDI_A 1EDJ_A 1EDK_A 2B88_A 2B87_A 2B89_A 1FC2_C 1DEE_H 1ZXG_A ....
Probab=32.64  E-value=60  Score=15.06  Aligned_cols=17  Identities=24%  Similarity=0.628  Sum_probs=11.8

Q ss_pred             cCCCHHHHHHHHHHHHH
Q psy5907          35 RDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        35 ~~l~~~eK~~y~~~a~~   51 (56)
                      ..|++++|.-|++.-+.
T Consensus        20 ~nLteeQrn~yI~~lKd   36 (54)
T PF02216_consen   20 PNLTEEQRNGYIQSLKD   36 (54)
T ss_dssp             TTS-HHHHHHHHHHHHH
T ss_pred             CCcCHHHHHhHHHHHhh
Confidence            46788888888876554


No 127
>PHA02955 hypothetical protein; Provisional
Probab=32.43  E-value=49  Score=19.61  Aligned_cols=28  Identities=11%  Similarity=0.194  Sum_probs=23.9

Q ss_pred             CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907          18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY   45 (56)
Q Consensus        18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y   45 (56)
                      ++..++.-+.+.+.+--..||+++|..-
T Consensus        58 na~~sf~lli~a~~Et~~~Lp~~qk~~i   85 (213)
T PHA02955         58 DEEKNFQLLIEALIETIENFPEKEQKEI   85 (213)
T ss_pred             CHHHHHHHHHHHHHHHHHhCCHHHHHHH
Confidence            4567788899999999999999999765


No 128
>PF00356 LacI:  Bacterial regulatory proteins, lacI family;  InterPro: IPR000843 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. These proteins are very diverse, but for convenience may be grouped into subfamilies on the basis of sequence similarity. One such family groups together a range of proteins, including ascG, ccpA, cytR, ebgR, fruR, galR, galS, lacI, malI, opnR, purF, rafR, rbtR and scrR [, ]. Within this family, the HTH motif is situated towards the N terminus.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 3KJX_C 1ZAY_A 1VPW_A 2PUA_A 1QQA_A 1PNR_A 1JFT_A 1QP4_A 2PUD_A 1JH9_A ....
Probab=32.42  E-value=53  Score=14.39  Aligned_cols=32  Identities=9%  Similarity=0.233  Sum_probs=25.1

Q ss_pred             CCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          19 LDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      -+.+.+-|++.|... ..++++.++.-.+.+++
T Consensus         9 agvS~~TVSr~ln~~-~~vs~~tr~rI~~~a~~   40 (46)
T PF00356_consen    9 AGVSKSTVSRVLNGP-PRVSEETRERILEAAEE   40 (46)
T ss_dssp             HTSSHHHHHHHHTTC-SSSTHHHHHHHHHHHHH
T ss_pred             HCcCHHHHHHHHhCC-CCCCHHHHHHHHHHHHH
Confidence            478889999999887 78899988876655544


No 129
>cd08804 Death_ank2 Death domain of Ankyrin-2. Death Domain (DD) of Ankyrin-2 (ANK-2) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-2, also called ankyrin-B (for broadly expressed), is required for proper function of the Na/Ca ion exchanger-1 in cardiomyocytes, and is thought to function in linking integral membrane proteins to the underlying cytoskeleton. Human ANK-2 is associated with "Ankyrin-B syndrome", an atypical arrythmia disorder with risk of sudden cardiac death. It also plays key roles in the brain and striated muscle. Loss of ANK-2 is associated with significant nervous system defects and sarcomere disorganization. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other
Probab=32.05  E-value=20  Score=17.78  Aligned_cols=20  Identities=25%  Similarity=0.513  Sum_probs=14.9

Q ss_pred             CCHHHHHHHHHHHhcCCCHH
Q psy5907          21 LKLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus        21 ~~~~ei~k~l~~~W~~l~~~   40 (56)
                      .....|+..||..|+.+..+
T Consensus         5 ~~l~~ia~~LG~dWk~LAr~   24 (84)
T cd08804           5 ERLAVIADHLGFSWTELARE   24 (84)
T ss_pred             hHHHHHHHHHhhhHHHHHHH
Confidence            44577888999999877544


No 130
>cd06257 DnaJ DnaJ domain or J-domain.  DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=31.65  E-value=52  Score=14.04  Aligned_cols=31  Identities=16%  Similarity=-0.016  Sum_probs=18.7

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCC
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLP   38 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~   38 (56)
                      +..+..++.-||+...     .+....|.+.|..|+
T Consensus        19 ~~y~~l~~~~HPD~~~~~~~~~~~~~~l~~Ay~~L~   54 (55)
T cd06257          19 KAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLS   54 (55)
T ss_pred             HHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhc
Confidence            4456666777887653     345556666666554


No 131
>cd09071 FAR_C C-terminal domain of fatty acyl CoA reductases. C-terminal domain of fatty acyl CoA reductases, a family of SDR-like proteins. SDRs or short-chain dehydrogenases/reductases are Rossmann-fold NAD(P)H-binding proteins. Many proteins in this FAR_C family may function as fatty acyl-CoA reductases (FARs), acting on medium and long chain fatty acids, and have been reported to be involved in diverse processes such as the biosynthesis of insect pheromones, plant cuticular wax production, and mammalian wax biosynthesis. In Arabidopsis thaliana, proteins with this particular architecture have also been identified as the MALE STERILITY 2 (MS2) gene product, which is implicated in male gametogenesis. Mutations in MS2 inhibit the synthesis of exine (sporopollenin), rendering plants unable to reduce pollen wall fatty acids to corresponding alcohols. The function of this C-terminal domain is unclear.
Probab=31.43  E-value=33  Score=16.66  Aligned_cols=15  Identities=20%  Similarity=0.720  Sum_probs=10.8

Q ss_pred             HHHhcCCCHHHHHHH
Q psy5907          31 GQMWRDLPEDQKTEY   45 (56)
Q Consensus        31 ~~~W~~l~~~eK~~y   45 (56)
                      -..|+.|++++|+.|
T Consensus        53 ~~L~~~l~~~Dr~~F   67 (92)
T cd09071          53 RALWERLSEEDRELF   67 (92)
T ss_pred             HHHHHHCCHHHHHhC
Confidence            345788888888755


No 132
>PF04363 DUF496:  Protein of unknown function (DUF496);  InterPro: IPR007458 Members of this family are uncharacterised proteins.
Probab=31.26  E-value=71  Score=16.47  Aligned_cols=17  Identities=29%  Similarity=0.384  Sum_probs=15.0

Q ss_pred             CCCCHHHHHHHHHHHhc
Q psy5907          19 LDLKLWEIGKIIGQMWR   35 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~   35 (56)
                      |+++..+|..+|..|-.
T Consensus        47 ~~Ms~edi~~II~nMr~   63 (95)
T PF04363_consen   47 PDMSIEDIRAIIENMRS   63 (95)
T ss_pred             CCCCHHHHHHHHHHHHh
Confidence            99999999999988754


No 133
>cd03585 NTR_TIMP NTR domain, TIMP subfamily; TIMPs, or tissue inibitors of metalloproteases, are essential regulators of extracellular matrix turnover and remodeling. They form complexes with matrix metalloproteases (MMPs) and inactivate them irreversibly by non-covalently binding their active zinc-binding sites. The levels of activated membrane-type MMPs, MMPs, and free TIMPs determine the balance between matrix degradation and matrix formation or stabilization. Consequently, TIMPs play roles in processes that require the remodeling and degradation of connective tissue, such as development, morphogenesis, wound healing, as well as in various diseases and pathological states such as tumor cell metastasis, arthritis, and artherosclerosis. Most TIMPs bind to a variety of MMPs. TIMP-1 and TIMP-2 appear to be multifunctional proteins with diverse biological action. They may exhibit growth factor-like activity and can inhibit angiogenesis. TIMP-3 has been implicated in apoptosis.
Probab=31.20  E-value=28  Score=20.08  Aligned_cols=18  Identities=28%  Similarity=0.486  Sum_probs=15.4

Q ss_pred             HhcCCCHHHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVEDYE   50 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~a~   50 (56)
                      .|.+|+..+|.-+...+.
T Consensus       106 pW~~lT~~Qr~gl~~~Y~  123 (183)
T cd03585         106 PWDSLSLTQKKGLNHRYQ  123 (183)
T ss_pred             ehHhCCHHHHhhhhHhhc
Confidence            599999999998887764


No 134
>PRK05423 hypothetical protein; Provisional
Probab=31.05  E-value=87  Score=16.38  Aligned_cols=18  Identities=33%  Similarity=0.357  Sum_probs=15.4

Q ss_pred             CCCCHHHHHHHHHHHhcC
Q psy5907          19 LDLKLWEIGKIIGQMWRD   36 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~   36 (56)
                      |+++..+|..+|..|-..
T Consensus        54 ~~Ms~e~i~~II~nMr~D   71 (104)
T PRK05423         54 PGMSIEEIQGIIANMKSD   71 (104)
T ss_pred             CCCCHHHHHHHHHHHHhh
Confidence            899999999999888543


No 135
>PRK14277 chaperone protein DnaJ; Provisional
Probab=31.01  E-value=1.4e+02  Score=18.85  Aligned_cols=39  Identities=23%  Similarity=0.176  Sum_probs=27.3

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+.     .+.-+.|.+.+..|++.+|. .|.
T Consensus        24 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD   68 (386)
T PRK14277         24 KAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRAQYD   68 (386)
T ss_pred             HHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhCCHHHHHHHH
Confidence            4566666777888753     45677899999999866654 444


No 136
>PF06424 PRP1_N:  PRP1 splicing factor, N-terminal;  InterPro: IPR010491 This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1 []. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005634 nucleus
Probab=30.96  E-value=48  Score=18.15  Aligned_cols=26  Identities=15%  Similarity=0.599  Sum_probs=16.1

Q ss_pred             HHHhCCCCC--HHHHHHHHH----HHhcCCCH
Q psy5907          14 VKAQNLDLK--LWEIGKIIG----QMWRDLPE   39 (56)
Q Consensus        14 ~~~~~p~~~--~~ei~k~l~----~~W~~l~~   39 (56)
                      ...++|.++  |+++-+.|+    +.|-+||+
T Consensus        95 ~~~~~pkI~~QFaDLKR~La~VS~eeW~~IPE  126 (133)
T PF06424_consen   95 YRKENPKIQQQFADLKRSLATVSEEEWENIPE  126 (133)
T ss_pred             hhccCchHHHHHHHHHHHHccCCHHHHhcCCc
Confidence            334456554  666666654    67888876


No 137
>PHA02662 ORF131 putative membrane protein; Provisional
Probab=30.87  E-value=55  Score=19.59  Aligned_cols=28  Identities=11%  Similarity=0.133  Sum_probs=23.7

Q ss_pred             CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907          18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY   45 (56)
Q Consensus        18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y   45 (56)
                      ++..++.-+.+.+.+--..||+++|..-
T Consensus        71 na~~sf~lll~Al~Et~~~Lp~~qK~~i   98 (226)
T PHA02662         71 DAADALALASAALAETLAELPRADRLAV   98 (226)
T ss_pred             CHHHHHHHHHHHHHHHHHhCCHHHHHHH
Confidence            4566788899999999999999999763


No 138
>PF10675 DUF2489:  Protein of unknown function (DUF2489);  InterPro: IPR019617  This entry represents bacterial uncharacterised proteins. 
Probab=30.51  E-value=76  Score=17.14  Aligned_cols=19  Identities=11%  Similarity=0.267  Sum_probs=15.4

Q ss_pred             HHHHhcCCCHHHHHHHHHH
Q psy5907          30 IGQMWRDLPEDQKTEYVED   48 (56)
Q Consensus        30 l~~~W~~l~~~eK~~y~~~   48 (56)
                      +++.|+.|+..++..++..
T Consensus        94 theaRk~L~k~ermk~D~e  112 (131)
T PF10675_consen   94 THEARKALPKKERMKLDLE  112 (131)
T ss_pred             cHHHHHcCCHHHHHHHhHH
Confidence            4677999999999887744


No 139
>cd08319 Death_RAIDD Death domain of RIP-associated ICH-1 homologous protein with a death domain. Death domain (DD) of RAIDD (RIP-associated ICH-1 homologous protein with a death domain), also known as CRADD (Caspase and RIP adaptor). RAIDD is an adaptor protein that together with the p53-inducible protein PIDD and caspase-2, forms the PIDDosome complex, which is required for caspase-2 activation and plays a role in mediating stress-induced apoptosis. RAIDD contains an N-terminal Caspase Activation and Recruitment Domain (CARD), which interacts with the caspase-2 CARD, and a C-terminal DD, which interacts with the DD of PIDD. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD, DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other pr
Probab=30.12  E-value=24  Score=17.56  Aligned_cols=16  Identities=19%  Similarity=0.557  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHhcCCC
Q psy5907          23 LWEIGKIIGQMWRDLP   38 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~   38 (56)
                      ...|+..||..|+.|.
T Consensus         5 L~~la~~LG~~W~~La   20 (83)
T cd08319           5 LNQLAQRLGPEWEQVL   20 (83)
T ss_pred             HHHHHHHHhhhHHHHH
Confidence            4568888999998764


No 140
>cd08306 Death_FADD Fas-associated Death Domain protein-protein interaction domain. Death domain (DD) found in FAS-associated via death domain (FADD). FADD is a component of the death-inducing signaling complex (DISC) and serves as an adaptor in the signaling pathway of death receptor proteins. It modulates apoptosis as well as non-apoptotic processes such as cell cycle progression, survival, innate immune signaling, and hematopoiesis. FADD contains an N-terminal DED and a C-terminal DD. Its DD interacts with the DD of the activated death receptor, FAS, and its DED recruits the initiator caspases, caspase-8 and -10, to the DISC complex via a homotypic interaction with the N-terminal DED of the caspase. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain),
Probab=30.11  E-value=27  Score=17.28  Aligned_cols=17  Identities=29%  Similarity=0.526  Sum_probs=12.0

Q ss_pred             CHHHHHHHHHHHhcCCC
Q psy5907          22 KLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        22 ~~~ei~k~l~~~W~~l~   38 (56)
                      .+.-|+..+|..|+.+.
T Consensus         4 ~f~~i~~~lG~~Wk~la   20 (86)
T cd08306           4 AFDVICENVGRDWRKLA   20 (86)
T ss_pred             HHHHHHHHHhhhHHHHH
Confidence            35667788888887543


No 141
>COG4282 SMI1 Protein involved in beta-1,3-glucan synthesis [Carbohydrate transport and metabolism]
Probab=29.21  E-value=1.2e+02  Score=17.55  Aligned_cols=29  Identities=17%  Similarity=0.214  Sum_probs=25.6

Q ss_pred             CCCCCHHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907          18 NLDLKLWEIGKIIGQMWRDLPEDQKTEYV   46 (56)
Q Consensus        18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y~   46 (56)
                      +|+++..+|.+.=.++-..+|++-|+.|.
T Consensus        30 ~pgat~~di~~aE~dlg~tlPpdvResl~   58 (191)
T COG4282          30 RPGATCGDIQRAEADLGRTLPPDVRESLA   58 (191)
T ss_pred             CCCccHHHHHHHHHHhcCcCChHHHHHHH
Confidence            37888999999999999999999998764


No 142
>PF12926 MOZART2:  Mitotic-spindle organizing gamma-tubulin ring associated;  InterPro: IPR024332 The MOZART2 family of proteins (also known as FAM128 and Mitotic-spindle organizing protein 2) operate as part of the gamma-tubulin ring complex, gamma-TuRC, one of the complexes necessary for chromosome segregation. This complex is located at centrosomes and mediates the formation of bipolar spindles in mitosis; it consists of six subunits. However, unlike the other four known subunits, the MOZART proteins, both 1 and 2, do not carry the conserved 'Spc97-Spc98' GCP domain, so the TUBGCP nomenclature cannot be used for it. The exact function of MOZART2 is not clear [].
Probab=28.81  E-value=91  Score=15.94  Aligned_cols=28  Identities=11%  Similarity=-0.074  Sum_probs=22.2

Q ss_pred             CCCCHHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907          19 LDLKLWEIGKIIGQMWRDLPEDQKTEYV   46 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~l~~~eK~~y~   46 (56)
                      |+....++.+.--..-+.|+++|.+-|+
T Consensus         5 ~~~~~~~~~k~~~~rk~~Ls~eE~EL~E   32 (88)
T PF12926_consen    5 PGSPTAQVYKYSLRRKKVLSAEEVELYE   32 (88)
T ss_pred             CCCChHHHHHHHHHHHhccCHHHHHHHH
Confidence            5556677777777888999999998775


No 143
>PF07757 AdoMet_MTase:  Predicted AdoMet-dependent methyltransferase;  InterPro: IPR011671 tRNA (uracil-O(2)-)-methyltransferase catalyses the formation of O(2)-methyl-uracil at position 44 (m2U44) in tRNA(Ser) [].; GO: 0008168 methyltransferase activity
Probab=28.62  E-value=70  Score=17.09  Aligned_cols=25  Identities=12%  Similarity=0.217  Sum_probs=19.5

Q ss_pred             HHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907          27 GKIIGQMWRDLPEDQKTEYVEDYEQ   51 (56)
Q Consensus        27 ~k~l~~~W~~l~~~eK~~y~~~a~~   51 (56)
                      ++.|-+.|..-++-.|..|++.+=+
T Consensus        19 a~~lv~~W~E~TdP~K~VfEDlaIA   43 (112)
T PF07757_consen   19 ARWLVDNWPESTDPQKHVFEDLAIA   43 (112)
T ss_pred             HHHHHHhCcccCCchhhHHHHHHHH
Confidence            4566778998888889999987643


No 144
>PF11460 DUF3007:  Protein of unknown function (DUF3007);  InterPro: IPR021562  This is a family of uncharacterised proteins found in bacteria and eukaryotes. 
Probab=28.30  E-value=1e+02  Score=16.27  Aligned_cols=20  Identities=5%  Similarity=0.146  Sum_probs=14.1

Q ss_pred             HHHHHhcCCCHHHHHHHHHH
Q psy5907          29 IIGQMWRDLPEDQKTEYVED   48 (56)
Q Consensus        29 ~l~~~W~~l~~~eK~~y~~~   48 (56)
                      .+.++..+||++|.+.-...
T Consensus        82 ~lqkRle~l~~eE~~~L~~e  101 (104)
T PF11460_consen   82 ELQKRLEELSPEELEALQAE  101 (104)
T ss_pred             HHHHHHHhCCHHHHHHHHHH
Confidence            34557788999988765543


No 145
>PRK13965 ribonucleotide-diphosphate reductase subunit beta; Provisional
Probab=28.16  E-value=61  Score=20.12  Aligned_cols=17  Identities=29%  Similarity=0.579  Sum_probs=14.0

Q ss_pred             HhcCCCHHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVEDY   49 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~a   49 (56)
                      .|+.||++||..|...-
T Consensus        60 dw~~Lt~~Er~~~~~~l   76 (335)
T PRK13965         60 SWRSLGEDWQQLITRTF   76 (335)
T ss_pred             HHHhCCHHHHHHHHHHH
Confidence            38999999999987643


No 146
>PRK14293 chaperone protein DnaJ; Provisional
Probab=27.91  E-value=1.6e+02  Score=18.50  Aligned_cols=41  Identities=15%  Similarity=0.181  Sum_probs=28.3

Q ss_pred             HHHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHH-HHHHHH
Q psy5907           7 SRKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQ-KTEYVE   47 (56)
Q Consensus         7 ~~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~e-K~~y~~   47 (56)
                      .+.++...+.-||+.+    ..+.-+.|.+.|..|++.. |..|..
T Consensus        21 k~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~vL~~~~~R~~yd~   66 (374)
T PRK14293         21 KRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRARYDQ   66 (374)
T ss_pred             HHHHHHHHHHHCCCCCCCcCHHHHHHHHHHHHHHHhchHHHHHHhh
Confidence            3455666667788754    5566778999999888666 555553


No 147
>PF13404 HTH_AsnC-type:  AsnC-type helix-turn-helix domain; PDB: 2ZNY_E 2ZNZ_G 1RI7_A 2CYY_A 2E1C_A 2VC1_B 2QZ8_A 2W29_C 2IVM_B 2VBX_B ....
Probab=27.87  E-value=63  Score=13.78  Aligned_cols=16  Identities=13%  Similarity=0.171  Sum_probs=10.1

Q ss_pred             HhCCCCCHHHHHHHHH
Q psy5907          16 AQNLDLKLWEIGKIIG   31 (56)
Q Consensus        16 ~~~p~~~~~ei~k~l~   31 (56)
                      ..++..++.+|++.+|
T Consensus        13 q~d~r~s~~~la~~lg   28 (42)
T PF13404_consen   13 QEDGRRSYAELAEELG   28 (42)
T ss_dssp             HH-TTS-HHHHHHHHT
T ss_pred             HHcCCccHHHHHHHHC
Confidence            4457778888887765


No 148
>PF04255 DUF433:  Protein of unknown function (DUF433);  InterPro: IPR007367 This is a family of uncharacterised proteins.; PDB: 2GA1_B.
Probab=27.86  E-value=71  Score=14.38  Aligned_cols=19  Identities=16%  Similarity=0.281  Sum_probs=11.3

Q ss_pred             HHHHHhCCCCCHHHHHHHH
Q psy5907          12 DQVKAQNLDLKLWEIGKII   30 (56)
Q Consensus        12 ~~~~~~~p~~~~~ei~k~l   30 (56)
                      +.+...+|+++..+|...+
T Consensus        35 eeI~~~yp~Lt~~~i~aAl   53 (56)
T PF04255_consen   35 EEIAEDYPSLTLEDIRAAL   53 (56)
T ss_dssp             HHHHHHSTT--HHHHHHHH
T ss_pred             HHHHHHCCCCCHHHHHHHH
Confidence            4566677878777776554


No 149
>PF14647 FAM91_N:  FAM91 N-terminus
Probab=27.84  E-value=1.2e+02  Score=19.00  Aligned_cols=31  Identities=13%  Similarity=0.326  Sum_probs=23.3

Q ss_pred             ChHHhHHHHHHHHHHHhC-----CCCCHHHHHHHHH
Q psy5907           1 MPYMRYSRKVWDQVKAQN-----LDLKLWEIGKIIG   31 (56)
Q Consensus         1 ~ay~lf~~~~~~~~~~~~-----p~~~~~ei~k~l~   31 (56)
                      +||-.|+.-....++.+.     |+-+.+|.-+.+|
T Consensus        87 TPF~YY~~ml~~lm~~ekSYDsLPNFTAaD~LRllG  122 (308)
T PF14647_consen   87 TPFSYYIDMLAELMKSEKSYDSLPNFTAADCLRLLG  122 (308)
T ss_pred             CcHHHHHHHHHHHHHcCCCcccCCCCcHHHHHHHhc
Confidence            467777777777777554     8889999888776


No 150
>TIGR00647 MG103 conserved hypothetical protein.
Probab=27.73  E-value=74  Score=19.53  Aligned_cols=20  Identities=20%  Similarity=0.217  Sum_probs=16.8

Q ss_pred             HHHHHhCCCCCHHHHHHHHH
Q psy5907          12 DQVKAQNLDLKLWEIGKIIG   31 (56)
Q Consensus        12 ~~~~~~~p~~~~~ei~k~l~   31 (56)
                      ..++.+||+++-.|+...+.
T Consensus       236 a~lRl~~Pd~SL~ELgell~  255 (279)
T TIGR00647       236 CLLKIDHPDWSLEQIAEFFA  255 (279)
T ss_pred             HHHHHhCcccCHHHHHHHhc
Confidence            35788999999999988874


No 151
>TIGR03092 SASP_sspI small, acid-soluble spore protein I. This protein family is restricted to a subset of endospore-forming bacteria such as Bacillus subtilis, all of which are in the Firmicutes (low-GC Gram-positive) lineage. It is a minor SASP (small, acid-soluble spore protein) designated SspI. The gene in Bacillus subtilis previously was designated ysfA.
Probab=27.57  E-value=85  Score=15.15  Aligned_cols=18  Identities=28%  Similarity=0.647  Sum_probs=13.9

Q ss_pred             HHHhcCCCHHHHHHHHHH
Q psy5907          31 GQMWRDLPEDQKTEYVED   48 (56)
Q Consensus        31 ~~~W~~l~~~eK~~y~~~   48 (56)
                      -..|+.+++.+|....+.
T Consensus        42 E~~W~~~~~~ek~~m~~~   59 (65)
T TIGR03092        42 EAIWKHANEQEKDEMLET   59 (65)
T ss_pred             HHHHHhcCHHHHHHHHHH
Confidence            346999999999876543


No 152
>PF11625 DUF3253:  Protein of unknown function (DUF3253);  InterPro: IPR021660  This bacterial family of proteins has no known function. ; PDB: 2NS0_A.
Probab=27.51  E-value=94  Score=15.66  Aligned_cols=38  Identities=18%  Similarity=0.246  Sum_probs=23.8

Q ss_pred             HHHHhCCC--CCHHHHHHHHHH-HhcCCCHHHHHHHHHHHH
Q psy5907          13 QVKAQNLD--LKLWEIGKIIGQ-MWRDLPEDQKTEYVEDYE   50 (56)
Q Consensus        13 ~~~~~~p~--~~~~ei~k~l~~-~W~~l~~~eK~~y~~~a~   50 (56)
                      .+....|+  +.++|+++.++- .|+.+-+..+.-=..++.
T Consensus        15 ll~~R~~~ktiCPSevARal~~~~WR~lm~~vR~~A~~L~~   55 (83)
T PF11625_consen   15 LLAARGPGKTICPSEVARALGPDDWRDLMPPVRAAARRLAR   55 (83)
T ss_dssp             HHHHS-TT--B-HHHHHHHH-TTS-GGGHHHHHHHHHHHHH
T ss_pred             HHHhcCCCCccCHHHHHHHHCchhhHHHHHHHHHHHHHHHH
Confidence            34444455  458999999984 499998888876555554


No 153
>PLN02492 ribonucleoside-diphosphate reductase
Probab=27.36  E-value=63  Score=19.81  Aligned_cols=17  Identities=18%  Similarity=0.415  Sum_probs=14.1

Q ss_pred             HHhcCCCHHHHHHHHHH
Q psy5907          32 QMWRDLPEDQKTEYVED   48 (56)
Q Consensus        32 ~~W~~l~~~eK~~y~~~   48 (56)
                      ..|+.||++||..|...
T Consensus        45 ~dw~~Lt~~Er~~~~~i   61 (324)
T PLN02492         45 KDWEKLTDDERHFISHV   61 (324)
T ss_pred             HHHHhCCHHHHHHHHHH
Confidence            34999999999988754


No 154
>PF07800 DUF1644:  Protein of unknown function (DUF1644);  InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain. 
Probab=27.21  E-value=30  Score=19.60  Aligned_cols=30  Identities=10%  Similarity=0.241  Sum_probs=24.1

Q ss_pred             HHHHHHHHHhCCCCCHHHHHHHHHHHhcCC
Q psy5907           8 RKVWDQVKAQNLDLKLWEIGKIIGQMWRDL   37 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l   37 (56)
                      .+.+.-++.+||...+.+|--.....|+.|
T Consensus       122 ~eLrKHar~~HP~~rP~~vDP~rq~~W~~l  151 (162)
T PF07800_consen  122 SELRKHARSEHPSARPSEVDPQRQRDWERL  151 (162)
T ss_pred             HHHHHHHHhhCCCCCCccCCHHHHHHHHHH
Confidence            456677888999999999888888888765


No 155
>PF00887 ACBP:  Acyl CoA binding protein;  InterPro: IPR000582 Acyl-CoA-binding protein (ACBP) is a small (10 Kd) protein that binds medium- and long-chain acyl-CoA esters with very high affinity and may function as an intracellular carrier of acyl-CoA esters []. ACBP is also known as diazepam binding inhibitor (DBI) or endozepine (EP) because of its ability to displace diazepam from the benzodiazepine (BZD) recognition site located on the GABA type A receptor. It is therefore possible that this protein also acts as a neuropeptide to modulate the action of the GABA receptor []. ACBP is a highly conserved protein of about 90 residues that is found in all four eukaryotic kingdoms, Animalia, Plantae, Fungi and Protista, and in some eubacterial species []. Although ACBP occurs as a completely independent protein, intact ACB domains have been identified in a number of large, multifunctional proteins in a variety of eukaryotic species. These include large membrane-associated proteins with N-terminal ACB domains, multifunctional enzymes with both ACB and peroxisomal enoyl-CoA Delta(3), Delta(2)-enoyl-CoA isomerase domains, and proteins with both an ACB domain and ankyrin repeats (IPR002110 from INTERPRO) []. The ACB domain consists of four alpha-helices arranged in a bowl shape with a highly exposed acyl-CoA-binding site. The ligand is bound through specific interactions with residues on the protein, most notably several conserved positive charges that interact with the phosphate group on the adenosine-3'phosphate moiety, and the acyl chain is sandwiched between the hydrophobic surfaces of CoA and the protein []. Other proteins containing an ACB domain include:   Endozepine-like peptide (ELP) (gene DBIL5) from mouse []. ELP is a testis-specific ACBP homologue that may be involved in the energy metabolism of the mature sperm. MA-DBI, a transmembrane protein of unknown function which has been found in mammals. MA-DBI contains a N-terminal ACB domain. DRS-1 [], a human protein of unknown function that contains a N-terminal ACB domain and a C-terminal enoyl-CoA isomerase/hydratase domain.  ; GO: 0000062 fatty-acyl-CoA binding; PDB: 2CB8_A 2FJ9_A 2LBB_A 1ST7_A 3EPY_B 2FDQ_C 1NTI_A 1HB8_A 1ACA_A 1NVL_A ....
Probab=27.02  E-value=89  Score=15.24  Aligned_cols=49  Identities=20%  Similarity=0.300  Sum_probs=27.9

Q ss_pred             HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCC----HHHHHHHHHHHHHh
Q psy5907           3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP----EDQKTEYVEDYEQE   52 (56)
Q Consensus         3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~----~~eK~~y~~~a~~~   52 (56)
                      |-+|.+...+.+....|+.. .-+.+..-+.|+.+.    ++-+..|++...+.
T Consensus        30 YalyKQAt~Gd~~~~~P~~~-d~~~~~K~~AW~~l~gms~~eA~~~Yi~~v~~~   82 (87)
T PF00887_consen   30 YALYKQATHGDCDTPRPGFF-DIEGRAKWDAWKALKGMSKEEAMREYIELVEEL   82 (87)
T ss_dssp             HHHHHHHHTSS--S-CTTTT-CHHHHHHHHHHHTTTTTHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHhCCCcCCCCcch-hHHHHHHHHHHHHccCCCHHHHHHHHHHHHHHH
Confidence            45666666655555556533 334455567897654    55677788777653


No 156
>PRK06946 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.99  E-value=1.3e+02  Score=18.05  Aligned_cols=26  Identities=12%  Similarity=0.012  Sum_probs=20.4

Q ss_pred             HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .+..-.|+++..|+.+++.+.|.++.
T Consensus        49 NL~~~fPe~s~~e~~~i~r~~~~~~~   74 (293)
T PRK06946         49 NLKLCFPDWSDARREELARRHFRHVI   74 (293)
T ss_pred             HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence            34444699999999999988887665


No 157
>PRK07920 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.97  E-value=85  Score=18.82  Aligned_cols=26  Identities=8%  Similarity=0.037  Sum_probs=20.0

Q ss_pred             HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .+..-.|+++..|+.+++.+.|.++.
T Consensus        44 NL~~~fPe~s~~e~~~i~~~~~~~~~   69 (298)
T PRK07920         44 NLARVVPDAPAAVLDALVRAAMRSYA   69 (298)
T ss_pred             HHHHhcCCCCHHHHHHHHHHHHHHHH
Confidence            44445699999999999888887654


No 158
>TIGR02208 lipid_A_msbB lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase. This family consists of MsbB in E. coli and closely related proteins in other species. MsbB is homologous to HtrB (TIGR02207) and acts immediately after it in the biosynthesis of KDO-2 lipid A (also called Re LPS and Re endotoxin). These two enzymes act after creation of KDO-2 lipid IV-A by addition of the KDO sugars.
Probab=26.94  E-value=1.3e+02  Score=18.13  Aligned_cols=26  Identities=15%  Similarity=0.026  Sum_probs=20.4

Q ss_pred             HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .+..-.|+++..|+.+++.+.|.++.
T Consensus        60 NL~~~fP~~s~~e~~~i~~~~~~~~g   85 (305)
T TIGR02208        60 NLSACFPEKSEAERETIIDNNFATFV   85 (305)
T ss_pred             HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence            34445699999999999999887664


No 159
>PF14374 Ribos_L4_asso_C:  60S ribosomal protein L4 C-terminal domain; PDB: 3O5H_D 3IZS_D 3O58_D 3IZR_D 4A17_C 4A1C_C 4A1E_C 4A1A_C.
Probab=26.83  E-value=33  Score=17.12  Aligned_cols=14  Identities=14%  Similarity=0.280  Sum_probs=7.3

Q ss_pred             CCCCHHHHHHHHHH
Q psy5907          19 LDLKLWEIGKIIGQ   32 (56)
Q Consensus        19 p~~~~~ei~k~l~~   32 (56)
                      |-++++|++++|.+
T Consensus         8 ~~MtN~Dl~RiInS   21 (80)
T PF14374_consen    8 PKMTNADLSRIINS   21 (80)
T ss_dssp             -S-S---HHHHHHS
T ss_pred             chhccccHHHHhcc
Confidence            67889999999863


No 160
>PF11961 DUF3475:  Domain of unknown function (DUF3475);  InterPro: IPR021864  This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 60 amino acids in length. This domain is found associated with PF05003 from PFAM. 
Probab=26.79  E-value=82  Score=14.75  Aligned_cols=21  Identities=10%  Similarity=0.343  Sum_probs=13.7

Q ss_pred             HHHHHHHHhcCCCHHHHHHHH
Q psy5907          26 IGKIIGQMWRDLPEDQKTEYV   46 (56)
Q Consensus        26 i~k~l~~~W~~l~~~eK~~y~   46 (56)
                      ..-.+...|++|++++=....
T Consensus         9 ~msk~~~L~~SLsd~~i~~Lr   29 (57)
T PF11961_consen    9 TMSKLVNLWQSLSDEEIARLR   29 (57)
T ss_pred             HHHHHHHHHHHcChHHHHHHH
Confidence            334456789999988654443


No 161
>PRK13967 nrdF1 ribonucleotide-diphosphate reductase subunit beta; Provisional
Probab=26.72  E-value=67  Score=19.80  Aligned_cols=17  Identities=12%  Similarity=0.507  Sum_probs=13.7

Q ss_pred             HhcCCCHHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVEDY   49 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~a   49 (56)
                      .|+.|+++||..|....
T Consensus        47 dw~~Lt~~Er~~i~~~l   63 (322)
T PRK13967         47 SWQTLSSTEQQTTIRVF   63 (322)
T ss_pred             HHHhCCHHHHHHHHHHH
Confidence            49999999999877543


No 162
>PRK05645 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.67  E-value=95  Score=18.53  Aligned_cols=26  Identities=23%  Similarity=0.276  Sum_probs=20.2

Q ss_pred             HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .+..-.|+++..|+.+++.+.|.++.
T Consensus        49 NL~~~fP~~s~~e~~~i~~~~~~~~~   74 (295)
T PRK05645         49 NLSKCFPELSPAELEKLVGQSLMDIG   74 (295)
T ss_pred             HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence            34445699999999999888887654


No 163
>PRK08734 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.47  E-value=96  Score=18.72  Aligned_cols=26  Identities=12%  Similarity=0.150  Sum_probs=20.2

Q ss_pred             HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .+..-.|+++..++.+++.+.|.++.
T Consensus        50 NL~~~fP~~s~~e~~~i~~~~~~~~g   75 (305)
T PRK08734         50 NLELAYPELSPQQRAQLHAQILRSTA   75 (305)
T ss_pred             HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence            44455699999999999888887654


No 164
>PF12550 GCR1_C:  Transcriptional activator of glycolytic enzymes;  InterPro: IPR022210  This domain family is found in eukaryotes, and is approximately 80 amino acids in length. This family is activates the transcription of glycolytic enzymes. 
Probab=26.44  E-value=81  Score=15.31  Aligned_cols=22  Identities=18%  Similarity=0.341  Sum_probs=17.0

Q ss_pred             CCCCHHHHHHHHHHHhcCCCHH
Q psy5907          19 LDLKLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~l~~~   40 (56)
                      .+.+..++.+.-|..|+.-+.+
T Consensus        24 g~psI~~le~~yG~~WR~~~~~   45 (81)
T PF12550_consen   24 GQPSIRSLEKKYGSKWRRDSKE   45 (81)
T ss_pred             CCCCHHHHHHHhChhhccCccc
Confidence            3567899999999999974443


No 165
>PF13867 SAP30_Sin3_bdg:  Sin3 binding region of histone deacetylase complex subunit SAP30; PDB: 2LD7_A.
Probab=26.44  E-value=77  Score=14.28  Aligned_cols=26  Identities=8%  Similarity=0.130  Sum_probs=17.6

Q ss_pred             CCCCHHHHHHHHHHHhcCCCHHHHHH
Q psy5907          19 LDLKLWEIGKIIGQMWRDLPEDQKTE   44 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~l~~~eK~~   44 (56)
                      |+.+-.+++..+..-+.+++-+|..-
T Consensus        19 ~~~sK~qLa~~V~kHF~s~~v~E~ev   44 (53)
T PF13867_consen   19 PRSSKEQLANAVRKHFNSQPVDENEV   44 (53)
T ss_dssp             SS--HHHHHHHHHHHHTT----HHHH
T ss_pred             CCCCHHHHHHHHHHHHhcCCCCHHHH
Confidence            68899999999999999999887754


No 166
>PRK10767 chaperone protein DnaJ; Provisional
Probab=26.38  E-value=1.7e+02  Score=18.30  Aligned_cols=39  Identities=21%  Similarity=0.095  Sum_probs=27.0

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHH-HHHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQK-TEYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK-~~y~   46 (56)
                      +.+|...+.-||+...     .+.-+.|.+.|..|++.++ ..|.
T Consensus        23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~~L~d~~~r~~yd   67 (371)
T PRK10767         23 KAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRAAYD   67 (371)
T ss_pred             HHHHHHHHHHCCCCCCCcHHHHHHHHHHHHHHHHhcchhhhhHhh
Confidence            4566666777898752     3555789999999987665 4444


No 167
>smart00509 TFS2N Domain in the N-terminus of transcription elongation factor S-II (and elsewhere).
Probab=26.21  E-value=47  Score=16.01  Aligned_cols=21  Identities=14%  Similarity=0.442  Sum_probs=15.4

Q ss_pred             hCCCCCHHHHHHHHHHHhcCC
Q psy5907          17 QNLDLKLWEIGKIIGQMWRDL   37 (56)
Q Consensus        17 ~~p~~~~~ei~k~l~~~W~~l   37 (56)
                      .|++.....+++.|-+.|+.+
T Consensus        52 kh~~~~I~~~A~~Li~~WK~~   72 (75)
T smart00509       52 KHKNEEIRKLAKKLIKSWKKL   72 (75)
T ss_pred             cCCcHHHHHHHHHHHHHHHHH
Confidence            356666778888888888754


No 168
>PF01527 HTH_Tnp_1:  Transposase;  InterPro: IPR002514 Transposase proteins are necessary for efficient DNA transposition. This family consists of various Escherichia coli insertion elements and other bacterial transposases some of which are members of the IS3 family. This region includes a helix-turn-helix motif (HTH) at the N terminus followed by a leucine zipper (LZ) motif. The LZ motif has been shown to mediate oligomerisation of the transposase components in IS911 []. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated; PDB: 2JN6_A 2RN7_A.
Probab=26.16  E-value=35  Score=15.80  Aligned_cols=25  Identities=16%  Similarity=0.056  Sum_probs=13.2

Q ss_pred             HHHHHHHHHHHhC--CCCCHHHHHHHH
Q psy5907           6 YSRKVWDQVKAQN--LDLKLWEIGKII   30 (56)
Q Consensus         6 f~~~~~~~~~~~~--p~~~~~ei~k~l   30 (56)
                      |+.+.+..+....  ++.+..+|++..
T Consensus         7 ys~e~K~~~v~~~~~~g~sv~~va~~~   33 (76)
T PF01527_consen    7 YSPEFKLQAVREYLESGESVSEVAREY   33 (76)
T ss_dssp             --HHHHHHHHHHHHHHHCHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHHHCCCceEeeeccc
Confidence            4555555554444  566667766654


No 169
>PF01047 MarR:  MarR family;  InterPro: IPR000835 The MarR-type HTH domain is a DNA-binding, winged helix-turn-helix (wHTH) domain of about 135 amino acids present in transcription regulators of the MarR/SlyA family, involved in the development of antibiotic resistance. This family of transcription regulators is named after Escherichia coli MarR, a repressor of genes which activate the multiple antibiotic resistance and oxidative stress regulons, and after slyA from Salmonella typhimurium and E. coli, a transcription regulator that is required for virulence and survival in the macrophage environment. Regulators with the MarR-type HTH domain are present in bacteria and archaea and control a variety of biological functions, including resistance to multiple antibiotics, household disinfectants, organic solvents, oxidative stress agents and regulation of the virulence factor synthesis in pathogens of humans and plants. Many of the MarR-like regulators respond to aromatic compounds [, , ]. The crystal structures of MarR, MexR and SlyA have been determined and show a winged HTH DNA-binding core flanked by helices involved in dimerisation. The DNA-binding domains are ascribed to the superfamily of winged helix proteins, containing a three (four)-helix (H) bundle and a three-stranded antiparallel beta-sheet (B) in the topology: H1-(H1')-H2-B1-H3-H4-B2-B3-H5-H6. Helices 3 and 4 comprise the helix-turn-helix motif and the beta-sheet is called the wing. Helix 4 is termed the recognition helix, like in other HTHs where it binds the DNA major groove. The helices 1, 5 and 6 are involved in dimerisation, as most MarR-like transcription regulators form dimers [, ]. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1JGS_A 2NYX_D 2PEX_B 2PFB_A 3BPX_A 3BPV_A 2BV6_A 3BJA_A 3E6M_B 2ETH_A ....
Probab=26.09  E-value=37  Score=14.91  Aligned_cols=16  Identities=19%  Similarity=0.260  Sum_probs=11.3

Q ss_pred             HhCCCCCHHHHHHHHH
Q psy5907          16 AQNLDLKLWEIGKIIG   31 (56)
Q Consensus        16 ~~~p~~~~~ei~k~l~   31 (56)
                      .++|+++..++++.++
T Consensus        13 ~~~~~~~~~~la~~~~   28 (59)
T PF01047_consen   13 YENGGITQSELAEKLG   28 (59)
T ss_dssp             HHHSSEEHHHHHHHHT
T ss_pred             HHcCCCCHHHHHHHHC
Confidence            3467788888877654


No 170
>cd01776 Rin1_RA Ubiquitin domain of RIN1 RAS effector. Rin1_RA   RIN1 is a RAS effector that binds with specificity and high affinity to activated RAS via its carboxy-terminal RA (RAS-associated) domain. RIN1 competes directly with RAF1 for RAS binding and is thought to divert signaling away from RAF and the MAPK pathway while also shunting RAS signals through alternate pathways. In addition, Rin1 and Rin2 are Rab5-binding proteins, binding preferentially to the GTP-bound form,  that enhance the GDP-GTP exchange reaction on Rab5 that regulate the docking and fusion processes of endocytic vesicles.  In addition to the RA domain, RIN1 and RIN2 have an SH2 (Src homology 2) domain, a proline-rich SH3 domain, and a Vps9 domain.
Probab=25.76  E-value=1.1e+02  Score=15.65  Aligned_cols=23  Identities=0%  Similarity=0.066  Sum_probs=18.3

Q ss_pred             CCCCCHHHHHHHHHHHhcCCCHH
Q psy5907          18 NLDLKLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus        18 ~p~~~~~ei~k~l~~~W~~l~~~   40 (56)
                      -|+.+..++..+++++-+--.++
T Consensus        21 ~P~~tt~~vc~lcA~Kf~V~qPe   43 (87)
T cd01776          21 RPYITTEDVCQLCAEKFKVTQPE   43 (87)
T ss_pred             CCCCcHHHHHHHHHHHhccCChh
Confidence            38899999999999887655444


No 171
>PRK09614 nrdF ribonucleotide-diphosphate reductase subunit beta; Reviewed
Probab=25.45  E-value=73  Score=19.44  Aligned_cols=16  Identities=25%  Similarity=0.669  Sum_probs=13.5

Q ss_pred             HhcCCCHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVED   48 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~   48 (56)
                      .|+.||++||..|...
T Consensus        47 dw~~Lt~~Er~~~~~~   62 (324)
T PRK09614         47 DWKKLSDEEKNLYTRV   62 (324)
T ss_pred             HHHhCCHHHHHHHHHH
Confidence            4899999999988754


No 172
>PF05047 L51_S25_CI-B8:  Mitochondrial ribosomal protein L51 / S25 / CI-B8 domain ;  InterPro: IPR007741 Proteins containing this domain are located in the mitochondrion and include ribosomal protein L51, and S25. This domain is also found in mitochondrial NADH-ubiquinone oxidoreductase B8 subunit (CI-B8) 1.6.5.3 from EC. It is not known whether all members of this family form part of the NADH-ubiquinone oxidoreductase and whether they are also all ribosomal proteins.; PDB: 1S3A_A.
Probab=25.02  E-value=76  Score=13.79  Aligned_cols=17  Identities=12%  Similarity=0.249  Sum_probs=12.0

Q ss_pred             HHHHHHHHHHHhCCCCC
Q psy5907           6 YSRKVWDQVKAQNLDLK   22 (56)
Q Consensus         6 f~~~~~~~~~~~~p~~~   22 (56)
                      |.....+.++..||++.
T Consensus         4 F~~~~lp~l~~~NP~v~   20 (52)
T PF05047_consen    4 FLKNNLPTLKYHNPQVQ   20 (52)
T ss_dssp             HHHHTHHHHHHHSTT--
T ss_pred             HHHHhHHHHHHHCCCcE
Confidence            55777788888899864


No 173
>COG2926 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.94  E-value=1.2e+02  Score=15.92  Aligned_cols=18  Identities=39%  Similarity=0.409  Sum_probs=14.8

Q ss_pred             CCCCHHHHHHHHHHHhcC
Q psy5907          19 LDLKLWEIGKIIGQMWRD   36 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~   36 (56)
                      |+++..||..+|..|-..
T Consensus        54 ~~Ms~eei~~II~~MksD   71 (109)
T COG2926          54 PDMSIEEIQGIIESMKSD   71 (109)
T ss_pred             CCCCHHHHHHHHHHHHHH
Confidence            899999999998887543


No 174
>TIGR00534 OpcA opcA protein. The opcA gene is found immediately downstream of zwf, the glucose-6-phosphate dehydrogenase (G6PDH) gene, in a number of species, including Mycobacterium tuberculosis, Streptomyces coelicolor, Nostoc punctiforme, and Synechococcus sp. PCC 7942. In the latter, disruption of opcA was shown to block assembly of G6PDH into active oligomeric forms.
Probab=24.92  E-value=83  Score=19.60  Aligned_cols=19  Identities=37%  Similarity=0.688  Sum_probs=16.7

Q ss_pred             CCCCCHHHHHHHHHHHhcC
Q psy5907          18 NLDLKLWEIGKIIGQMWRD   36 (56)
Q Consensus        18 ~p~~~~~ei~k~l~~~W~~   36 (56)
                      -|+.+..+|++.|.+.|.+
T Consensus         5 l~dtt~~~I~~~L~~l~~~   23 (311)
T TIGR00534         5 PTDITLNEINKELNQLWQS   23 (311)
T ss_pred             CCCCCHHHHHHHHHHHHHh
Confidence            3688999999999999975


No 175
>PRK13966 nrdF2 ribonucleotide-diphosphate reductase subunit beta; Provisional
Probab=24.90  E-value=75  Score=19.62  Aligned_cols=16  Identities=19%  Similarity=0.320  Sum_probs=13.5

Q ss_pred             HhcCCCHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVED   48 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~   48 (56)
                      .|+.||+.+|..|...
T Consensus        49 dw~~Lt~~Ek~~~~~~   64 (324)
T PRK13966         49 SWGTLTAGEKQLTMRV   64 (324)
T ss_pred             HHHhCCHHHHHHHHHH
Confidence            4899999999988754


No 176
>PRK08943 lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase; Validated
Probab=24.78  E-value=1.5e+02  Score=18.01  Aligned_cols=26  Identities=19%  Similarity=0.148  Sum_probs=20.5

Q ss_pred             HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .+..-.|+++..++.+++.+.|.++.
T Consensus        69 NL~~afPe~s~~e~~~i~~~~~~n~~   94 (314)
T PRK08943         69 NLSLCFPEKSEAEREAIIDEMFATAP   94 (314)
T ss_pred             HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence            44445699999999999999887664


No 177
>COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]
Probab=24.75  E-value=2e+02  Score=18.53  Aligned_cols=40  Identities=18%  Similarity=0.082  Sum_probs=28.0

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHHHHHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKTEYVE   47 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~~y~~   47 (56)
                      +.+|...++-||+.+.     .|-=+.|.+....|++.+|..-.+
T Consensus        23 kAYRkLA~kyHPD~n~g~~~AeeKFKEI~eAYEVLsD~eKRa~YD   67 (371)
T COG0484          23 KAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRAAYD   67 (371)
T ss_pred             HHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhCCHHHHHHhh
Confidence            4556667777887764     445577889999998887765333


No 178
>PRK14282 chaperone protein DnaJ; Provisional
Probab=24.66  E-value=1.9e+02  Score=18.17  Aligned_cols=39  Identities=26%  Similarity=0.110  Sum_probs=26.6

Q ss_pred             HHHHHHHHHhCCCCCH------HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907           8 RKVWDQVKAQNLDLKL------WEIGKIIGQMWRDLPEDQKT-EYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~------~ei~k~l~~~W~~l~~~eK~-~y~   46 (56)
                      +.+|...+.-||+.+.      .+.-+.|.+.+..|++.+|. .|.
T Consensus        23 ~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD   68 (369)
T PRK14282         23 RAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKRAMYD   68 (369)
T ss_pred             HHHHHHHHHHCCCCCccchhHHHHHHHHHHHHHHHhcChhhHHHHh
Confidence            4566667777898753      34556788889888877654 444


No 179
>cd08780 Death_TRADD Death Domain of Tumor Necrosis Factor Receptor 1-Associated Death Domain protein. Death domain (DD) of TRADD (TNF Receptor 1-Associated Death Domain or TNFRSF1A-associated via death domain) protein. TRADD is a central signaling adaptor for TNF-receptor 1 (TNFR1), mediating activation of Nuclear Factor -kappaB (NF-kB) and c-Jun N-terminal kinase (JNK), as well as caspase-dependent apoptosis. It also carries important immunological roles including germinal center formation, DR3-mediated T-cell stimulation, and TNFalpha-mediated inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into s
Probab=24.60  E-value=44  Score=17.15  Aligned_cols=17  Identities=18%  Similarity=0.503  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHhcCCCH
Q psy5907          23 LWEIGKIIGQMWRDLPE   39 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~   39 (56)
                      +..++.-+|..|+.+..
T Consensus         5 ~q~~~~nvGr~WK~laR   21 (90)
T cd08780           5 QQHFAKSVGKKWKPVGR   21 (90)
T ss_pred             HHHHHHHHhHHHHHHHH
Confidence            45566777777765443


No 180
>PRK14301 chaperone protein DnaJ; Provisional
Probab=24.18  E-value=1.9e+02  Score=18.18  Aligned_cols=39  Identities=13%  Similarity=0.034  Sum_probs=26.9

Q ss_pred             HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHH-HHHH
Q psy5907           8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQK-TEYV   46 (56)
Q Consensus         8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK-~~y~   46 (56)
                      +.+|...+.-||+.+.     .+.-+.|.+.|..|++..| ..|.
T Consensus        23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD   67 (373)
T PRK14301         23 KAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRARYD   67 (373)
T ss_pred             HHHHHHHHHhCCCcCCCChHHHHHHHHHHHHHHHhcchhhhhhhh
Confidence            4566677777888753     3455688899998887664 4444


No 181
>PF12339 DNAJ_related:  DNA-J related protein ;  InterPro: IPR021059  This domain family is approximately 130 amino acids in length and contains a conserved YYLD sequence motif. The proteins have a C-terminal DNA-J domain PF00226 from PFAM and most of the sequences are annotated as DNA-J related proteins, other annotations include: DnaJ-class molecular chaperon and formate dehydrogenase; but there is currently no publications to support these annotations. 
Probab=24.15  E-value=86  Score=17.05  Aligned_cols=18  Identities=11%  Similarity=0.499  Sum_probs=15.3

Q ss_pred             CCCHHHHHHHHHHHhcCC
Q psy5907          20 DLKLWEIGKIIGQMWRDL   37 (56)
Q Consensus        20 ~~~~~ei~k~l~~~W~~l   37 (56)
                      +.+..+|.+.+.+.|+.+
T Consensus       114 ~t~~~~V~~LL~~FW~r~  131 (132)
T PF12339_consen  114 ETSEAEVERLLNSFWQRY  131 (132)
T ss_pred             hcCHHHHHHHHHHHHHhc
Confidence            567899999999999754


No 182
>PTZ00211 ribonucleoside-diphosphate reductase small subunit; Provisional
Probab=24.13  E-value=79  Score=19.47  Aligned_cols=17  Identities=12%  Similarity=0.292  Sum_probs=14.1

Q ss_pred             HHhcCCCHHHHHHHHHH
Q psy5907          32 QMWRDLPEDQKTEYVED   48 (56)
Q Consensus        32 ~~W~~l~~~eK~~y~~~   48 (56)
                      ..|+.||+++|..|...
T Consensus        56 ~dw~~Lt~~Er~~~~~~   72 (330)
T PTZ00211         56 KDWEKLNDGERHFIKHV   72 (330)
T ss_pred             HHHHhCCHHHHHHHHHH
Confidence            34999999999988754


No 183
>PF10407 Cytokin_check_N:  Cdc14 phosphatase binding protein N-terminus   ;  InterPro: IPR018844  Cytokinesis in yeasts involves a family of proteins whose essential function is to bind Cdc14-family phosphatase and prevent this from being sequestered and inhibited in the nucleolus. This is the highly conserved N terminus of a family of proteins which act as cytokinesis checkpoint controls by allowing cells to cope with cytokinesis defects. These proteins are required for rDNA silencing and mini-chromosome maintenance []. 
Probab=24.13  E-value=87  Score=15.33  Aligned_cols=23  Identities=13%  Similarity=0.203  Sum_probs=18.7

Q ss_pred             CCCCHHHHHHHHHHHhcCCCHHH
Q psy5907          19 LDLKLWEIGKIIGQMWRDLPEDQ   41 (56)
Q Consensus        19 p~~~~~ei~k~l~~~W~~l~~~e   41 (56)
                      |+.+..++...|.++|+.|=+.+
T Consensus        11 ~~~tl~~L~~eI~~~f~kLYP~~   33 (73)
T PF10407_consen   11 PNNTLSQLKEEIEERFKKLYPNE   33 (73)
T ss_pred             CCCcHHHHHHHHHHHHHHHCCCC
Confidence            78889999999999998765544


No 184
>TIGR02269 Myxococcus xanthus paralogous lipoprotein family TIGR02269. This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown.
Probab=23.55  E-value=1.7e+02  Score=17.29  Aligned_cols=24  Identities=17%  Similarity=0.242  Sum_probs=18.3

Q ss_pred             HHHHHHHHhCCCCCHHHHHHHHHH
Q psy5907           9 KVWDQVKAQNLDLKLWEIGKIIGQ   32 (56)
Q Consensus         9 ~~~~~~~~~~p~~~~~ei~k~l~~   32 (56)
                      +.+..+...||+.+..+|-+-.++
T Consensus       174 ~aWr~f~~~~~~as~e~i~~hage  197 (211)
T TIGR02269       174 QAWRDFARRYPGASPEEIWRHAGE  197 (211)
T ss_pred             HHHHHHHHhCCCCCHHHHHHHHHH
Confidence            345567778999999999877665


No 185
>PF07055 Eno-Rase_FAD_bd:  Enoyl reductase FAD binding domain;  InterPro: IPR010758 This family contains a number of bacterial putative reductases. ; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3S8M_A 3ZU5_A 3ZU3_A 3ZU4_A 3ZU2_A.
Probab=23.44  E-value=62  Score=15.55  Aligned_cols=19  Identities=5%  Similarity=0.427  Sum_probs=14.0

Q ss_pred             HHHHHHHHHHHhcCCCHHH
Q psy5907          23 LWEIGKIIGQMWRDLPEDQ   41 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~~e   41 (56)
                      ..+|...+.+.|..++.+-
T Consensus        16 ~~dvQ~~V~~lw~~it~en   34 (65)
T PF07055_consen   16 RPDVQAEVAELWEQITTEN   34 (65)
T ss_dssp             SHHHHHHHHHHHCCT-CCC
T ss_pred             CHHHHHHHHHHHHHhcccc
Confidence            3577888999999888653


No 186
>PF10077 DUF2314:  Uncharacterized protein conserved in bacteria (DUF2314);  InterPro: IPR018756  This domain of unkown function is found in various bacterial hypothetical proteins, as well as putative ankyrin repeat proteins. 
Probab=23.31  E-value=1.4e+02  Score=16.09  Aligned_cols=20  Identities=25%  Similarity=0.337  Sum_probs=15.1

Q ss_pred             HHHHHhcCCCHHHHHHHHHH
Q psy5907          29 IIGQMWRDLPEDQKTEYVED   48 (56)
Q Consensus        29 ~l~~~W~~l~~~eK~~y~~~   48 (56)
                      .+--+.+.||++|+..|.+.
T Consensus       109 Ti~~~~~~m~~~e~~~~d~~  128 (133)
T PF10077_consen  109 TIRAMRSRMSEEERDEHDEA  128 (133)
T ss_pred             HHHHHHhhCCHHHHHHHHHH
Confidence            34556788999999888764


No 187
>cd08803 Death_ank3 Death domain of Ankyrin-3. Death Domain (DD) of the human protein ankyrin-3 (ANK-3) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-3, also called anykyrin-G (for general or giant), is found in neurons and at least one splice variant has been shown to be essential for propagation of action potentials as a binding partner to neurofascin and voltage-gated sodium channels. It is required for maintaining axo-dendritic polarity, and may be a genetic risk factor associated with bipolar disorder. ANK-3 may also play roles in other cell types. Mutations affecting ANK-3 pathways for Na channel localization are associated with Brugada syndrome, a potentially fata arrythmia. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by se
Probab=22.81  E-value=31  Score=17.17  Aligned_cols=24  Identities=13%  Similarity=0.386  Sum_probs=14.4

Q ss_pred             CHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907          22 KLWEIGKIIGQMWRDLPEDQKTEY   45 (56)
Q Consensus        22 ~~~ei~k~l~~~W~~l~~~eK~~y   45 (56)
                      ....|+..||.-|..|..+-.-++
T Consensus         6 ~l~~ia~~LG~dW~~LA~eLg~s~   29 (84)
T cd08803           6 RMAIVADHLGLSWTELARELNFSV   29 (84)
T ss_pred             HHHHHHHHhhccHHHHHHHcCCCH
Confidence            345677777777776655443333


No 188
>cd07765 KRAB_A-box KRAB (Kruppel-associated box) domain -A box. The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through 
Probab=22.62  E-value=51  Score=10.95  Aligned_cols=18  Identities=22%  Similarity=0.387  Sum_probs=13.4

Q ss_pred             HHhcCCCHHHHHHHHHHH
Q psy5907          32 QMWRDLPEDQKTEYVEDY   49 (56)
Q Consensus        32 ~~W~~l~~~eK~~y~~~a   49 (56)
                      +.|..+...++..|.+..
T Consensus        13 ~~~~~~~~~~~~~~~~~~   30 (40)
T cd07765          13 EEWELLDPAQRDLYRDVM   30 (40)
T ss_pred             HHHhcCCHHHHHHHHHHH
Confidence            578888888887776543


No 189
>PRK08733 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=22.07  E-value=1.8e+02  Score=17.57  Aligned_cols=26  Identities=8%  Similarity=-0.039  Sum_probs=19.6

Q ss_pred             HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          13 QVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        13 ~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      .+..-.|+++..|+.+++.+.|.++.
T Consensus        64 NL~~~fP~~s~~e~~~i~~~~~~~~~   89 (306)
T PRK08733         64 NLKLCFPEQDDAWRARLLRDSFDALG   89 (306)
T ss_pred             HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence            34444699999999998888887544


No 190
>PRK08326 ribonucleotide-diphosphate reductase subunit beta; Validated
Probab=21.93  E-value=95  Score=19.02  Aligned_cols=17  Identities=12%  Similarity=0.384  Sum_probs=14.0

Q ss_pred             HHhcCCCHHHHHHHHHH
Q psy5907          32 QMWRDLPEDQKTEYVED   48 (56)
Q Consensus        32 ~~W~~l~~~eK~~y~~~   48 (56)
                      ..|+.||+.||..|...
T Consensus        51 ~dw~~Lt~~Er~~~~~i   67 (311)
T PRK08326         51 EDWEKLSDEERDYATRL   67 (311)
T ss_pred             HHHHhCCHHHHHHHHHH
Confidence            45999999999988754


No 191
>PF09349 OHCU_decarbox:  OHCU decarboxylase;  InterPro: IPR018020  The proteins in this entry are OHCU decarboxylase, an enzyme of the purine catabolism that catalyses the conversion of OHCU into S(+)-allantoin []; it is the third step of the conversion of uric acid (a purine derivative) to allantoin. Step one is catalysed by urate oxidase (IPR002042 from INTERPRO) and step two is catalysed by hydroxyisourate hydrolase (IPR000895 from INTERPRO). ; PDB: 3O7I_B 3O7H_B 3O7J_A 3O7K_A 2Q37_A 2O70_B 2O73_C 2O74_C 2O8I_A.
Probab=21.75  E-value=1.6e+02  Score=16.22  Aligned_cols=28  Identities=18%  Similarity=0.049  Sum_probs=22.9

Q ss_pred             CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907          18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY   45 (56)
Q Consensus        18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y   45 (56)
                      .|-.+..++...+...|..++++++.-.
T Consensus        31 rPf~s~~~L~~a~~~~~~~~~~~~~~~~   58 (159)
T PF09349_consen   31 RPFASVDALIAAADEAVRSLSEEDKLEA   58 (159)
T ss_dssp             GS-SSHHHHHHHHHHHHHCS-HHHHHHH
T ss_pred             CCCCCHHHHHHHHHHHHHhCCHHHHHHH
Confidence            5889999999999999999999887653


No 192
>PF11953 DUF3470:  Domain of unknown function (DUF3470);  InterPro: IPR022569  This functionally uncharacterised domain is found in bacteria and is about 50 amino acids in length. It is found C-terminal to PF00037 from PFAM. It contains a single completely conserved residue N that may be functionally important. ; PDB: 1FRX_A 1GAO_A 1FRL_A 1AXQ_A 1FTC_B 1FDA_A 1A6L_A 1FDD_A 1F5B_A 1F5C_A ....
Probab=21.50  E-value=37  Score=14.87  Aligned_cols=19  Identities=16%  Similarity=0.534  Sum_probs=11.6

Q ss_pred             CHHHHHHHHHHHhcCCCHH
Q psy5907          22 KLWEIGKIIGQMWRDLPED   40 (56)
Q Consensus        22 ~~~ei~k~l~~~W~~l~~~   40 (56)
                      .+.+|...++..|-+++..
T Consensus         3 ~f~elN~ela~~WP~It~k   21 (43)
T PF11953_consen    3 HFIELNAELAKKWPNITEK   21 (43)
T ss_dssp             HHHHHHHHHHTTS-BE-S-
T ss_pred             HHHHHHHHHHHhCCCcCcc
Confidence            3567778888888776654


No 193
>cd00183 TFIIS_I N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a domain found in elongin A and CRSP70; likely to be involved in transcription; domain I from TFIIS interacts with RNA polymerase II holoenzyme
Probab=21.19  E-value=65  Score=15.45  Aligned_cols=21  Identities=19%  Similarity=0.495  Sum_probs=14.4

Q ss_pred             hCCCCCHHHHHHHHHHHhcCC
Q psy5907          17 QNLDLKLWEIGKIIGQMWRDL   37 (56)
Q Consensus        17 ~~p~~~~~ei~k~l~~~W~~l   37 (56)
                      .|++-....+++.|-..|+.+
T Consensus        54 kh~~~~i~~~A~~Lv~~Wk~~   74 (76)
T cd00183          54 KHSNEKIRKLAKALIKSWKKL   74 (76)
T ss_pred             cCCcHHHHHHHHHHHHHHHHh
Confidence            355555677778888888753


No 194
>PRK06860 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=21.10  E-value=2e+02  Score=17.39  Aligned_cols=27  Identities=7%  Similarity=0.031  Sum_probs=20.6

Q ss_pred             HHHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907          12 DQVKAQNLDLKLWEIGKIIGQMWRDLP   38 (56)
Q Consensus        12 ~~~~~~~p~~~~~ei~k~l~~~W~~l~   38 (56)
                      ..+..-.|+++..|+.+++.+.|.++.
T Consensus        63 ~NL~~~fPe~s~~e~~~i~~~~~~~~g   89 (309)
T PRK06860         63 RNLELCFPEMSEQEREAIVVKNFESVG   89 (309)
T ss_pred             HHHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence            344455699999999998888887654


No 195
>PF04031 Las1:  Las1-like ;  InterPro: IPR007174 Las1 is an essential nuclear protein involved in cell morphogenesis and cell surface growth [].
Probab=21.10  E-value=98  Score=17.09  Aligned_cols=29  Identities=14%  Similarity=0.206  Sum_probs=13.2

Q ss_pred             hHHhHHHHHHHHHHHhCCCCCHHHHHHHH
Q psy5907           2 PYMRYSRKVWDQVKAQNLDLKLWEIGKII   30 (56)
Q Consensus         2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l   30 (56)
                      |++.|....-+.........+...+++.+
T Consensus        83 aivRfVNgl~D~~Q~~~~a~si~~~A~~i  111 (154)
T PF04031_consen   83 AIVRFVNGLVDPSQQGKYARSIASLAKEI  111 (154)
T ss_pred             hHHHHHHHhhhHhhccchhhhHHHHHHHc
Confidence            45556555555443323333444444443


No 196
>PRK09101 nrdB ribonucleotide-diphosphate reductase subunit beta; Reviewed
Probab=20.93  E-value=1e+02  Score=19.49  Aligned_cols=16  Identities=25%  Similarity=0.634  Sum_probs=13.2

Q ss_pred             HhcCCCHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVED   48 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~   48 (56)
                      .|..||+++|..|...
T Consensus        62 dw~~Lt~~Er~~~~~~   77 (376)
T PRK09101         62 DYQALPEHEKHIFISN   77 (376)
T ss_pred             HHHhCCHHHHHHHHHH
Confidence            4999999999988643


No 197
>cd08777 Death_RIP1 Death Domain of Receptor-Interacting Protein 1. Death domain (DD) found in Receptor-Interacting Protein 1 (RIP1) and related proteins. RIP kinases serve as essential sensors of cellular stress. Vertebrates contain several types containing a homologous N-terminal kinase domain and varying C-terminal domains. RIP1 harbors a C-terminal DD, which binds death receptors (DRs) including TNF receptor 1, Fas, TNF-related apoptosis-inducing ligand receptor 1 (TRAILR1), and TRAILR2. It also interacts with other DD-containing adaptor proteins such as TRADD and FADD. RIP1 plays a crucial role in determining a cell's fate, between survival or death, following exposure to stress signals. It is important in the signaling of NF-kappaB and MAPKs, and it links DR-associated signaling to reactive oxygen species (ROS) production. Abnormal RIP1 function may result in ROS accumulation affecting inflammatory responses, innate immunity, stress responses, and cell survival. In general, DDs ar
Probab=20.89  E-value=47  Score=16.52  Aligned_cols=17  Identities=18%  Similarity=0.423  Sum_probs=12.2

Q ss_pred             HHHHHHHHHHHhcCCCH
Q psy5907          23 LWEIGKIIGQMWRDLPE   39 (56)
Q Consensus        23 ~~ei~k~l~~~W~~l~~   39 (56)
                      ..-|+..+|..|+.+..
T Consensus         5 l~~l~~~lG~~Wk~lar   21 (86)
T cd08777           5 LDLLRENLGKKWKRCAR   21 (86)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            45677888899986653


No 198
>PRK02955 small acid-soluble spore protein SspI; Provisional
Probab=20.83  E-value=1.2e+02  Score=14.71  Aligned_cols=17  Identities=29%  Similarity=0.739  Sum_probs=13.3

Q ss_pred             HHhcCCCHHHHHHHHHH
Q psy5907          32 QMWRDLPEDQKTEYVED   48 (56)
Q Consensus        32 ~~W~~l~~~eK~~y~~~   48 (56)
                      ..|+.+++.+|....+.
T Consensus        46 ~~W~~~~~~ek~~m~~~   62 (68)
T PRK02955         46 VIWKNADENEKDEMLET   62 (68)
T ss_pred             HHHHhcCHHHHHHHHHH
Confidence            35999999999876543


No 199
>PF10562 CaM_bdg_C0:  Calmodulin-binding domain C0 of NMDA receptor NR1 subunit;  InterPro: IPR018882  This is a very short highly conserved domain that is C-terminal to the cytosolic transmembrane region IV of the NMDA-receptor 1. It has been shown to bind Calmodulin-Calcium with high affinity. The ionotropic N-methyl-D-aspartate receptor (NMDAR) is a major source of calcium flux into neurons in the brain and plays a critical role in learning, memory, neural development, and synaptic plasticity. Calmodulin (CaM) regulates NMDARs by binding tightly to the C0 and C1 regions of their NR1 subunit. The conserved tryptophan is considered to be the anchor residue []. 
Probab=20.63  E-value=85  Score=12.69  Aligned_cols=14  Identities=21%  Similarity=0.556  Sum_probs=10.8

Q ss_pred             HHHHHHHHHHHhcC
Q psy5907          23 LWEIGKIIGQMWRD   36 (56)
Q Consensus        23 ~~ei~k~l~~~W~~   36 (56)
                      --|+++...+.|+.
T Consensus        13 ~~elAr~a~dkWR~   26 (29)
T PF10562_consen   13 QLELARHAADKWRG   26 (29)
T ss_pred             HHHHHHHHHHHHHh
Confidence            35788888889974


No 200
>cd03577 NTR_TIMP_like NTR domain, TIMP-like subfamily; TIMPs, or tissue inibitors of metalloproteases, are essential regulators of extracellular matrix turnover and remodeling. They form complexes with matrix metalloproteases (MMPs) and inactivate them irreversibly by non-covalently binding their active zinc-binding sites. This group contains domains similar to the TIMP NTR domain, which binds MMPs. Members of this group may or may not function as MMP inhibitors.
Probab=20.57  E-value=41  Score=17.17  Aligned_cols=17  Identities=35%  Similarity=0.597  Sum_probs=12.2

Q ss_pred             HhcCCCHHHHHHHHHHH
Q psy5907          33 MWRDLPEDQKTEYVEDY   49 (56)
Q Consensus        33 ~W~~l~~~eK~~y~~~a   49 (56)
                      .|+.|+.++|.-....+
T Consensus        97 ~W~~ls~~qkr~l~~~y  113 (116)
T cd03577          97 PWDDLTKEQKRGLKGLY  113 (116)
T ss_pred             EhHHCCHHHHhhhhhcc
Confidence            58888888887665544


No 201
>cd00155 RasGEF Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
Probab=20.57  E-value=1.4e+02  Score=17.00  Aligned_cols=22  Identities=23%  Similarity=0.299  Sum_probs=17.4

Q ss_pred             HHHHHhcCCCHHHHHHHHHHHH
Q psy5907          29 IIGQMWRDLPEDQKTEYVEDYE   50 (56)
Q Consensus        29 ~l~~~W~~l~~~eK~~y~~~a~   50 (56)
                      .+...|..++++.+..|.++..
T Consensus       111 RL~~tw~~l~~~~~~~~~~l~~  132 (237)
T cd00155         111 RLKKTWEVLSSKLKKLFEELEE  132 (237)
T ss_pred             hHHHHHHHCCHHHHHHHHHHHH
Confidence            3577899999999988876654


No 202
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=20.51  E-value=1.9e+02  Score=17.61  Aligned_cols=27  Identities=15%  Similarity=0.294  Sum_probs=15.8

Q ss_pred             hHHHHHHHHHHHhCCCCCHHHHHHHHH
Q psy5907           5 RYSRKVWDQVKAQNLDLKLWEIGKIIG   31 (56)
Q Consensus         5 lf~~~~~~~~~~~~p~~~~~ei~k~l~   31 (56)
                      .|.++....-+..+|..+..++++.+|
T Consensus        12 ~fl~d~ye~rk~~~p~fS~R~fa~~~G   38 (271)
T TIGR02147        12 KYLRDYYEERKKTDPAFSWRFFAEKAG   38 (271)
T ss_pred             HHHHHHHHHHhccCcCcCHHHHHHHhC
Confidence            455565555555566666666655555


No 203
>PF04545 Sigma70_r4:  Sigma-70, region 4;  InterPro: IPR007630 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes.  With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ].  Region 4 of sigma-70 like sigma-factors is involved in binding to the -35 promoter element via a helix-turn-helix motif []. Due to the way Pfam works, the threshold has been set artificially high to prevent overlaps with other helix-turn-helix families. Therefore there are many false negatives.; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 2P7V_B 3IYD_F 1TLH_B 1KU7_A 1RIO_H 3N97_A 1KU3_A 1RP3_C 1SC5_A 1NR3_A ....
Probab=20.45  E-value=91  Score=13.27  Aligned_cols=14  Identities=36%  Similarity=0.342  Sum_probs=10.1

Q ss_pred             CCCCCHHHHHHHHH
Q psy5907          18 NLDLKLWEIGKIIG   31 (56)
Q Consensus        18 ~p~~~~~ei~k~l~   31 (56)
                      ..+++..||+..+|
T Consensus        18 ~~~~t~~eIa~~lg   31 (50)
T PF04545_consen   18 FEGLTLEEIAERLG   31 (50)
T ss_dssp             TST-SHHHHHHHHT
T ss_pred             cCCCCHHHHHHHHC
Confidence            36788888888776


Done!