Query         037321
Match_columns 75
No_of_seqs    106 out of 621
Neff          7.3 
Searched_HMMs 46136
Date          Fri Mar 29 11:02:34 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/037321.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/037321hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PLN02150 terpene synthase/cycl 100.0 4.5E-33 9.8E-38  166.3   8.5   73    1-75     22-95  (96)
  2 cd00684 Terpene_cyclase_plant_  99.9   5E-22 1.1E-26  146.0   8.6   71    1-73    471-542 (542)
  3 PLN02279 ent-kaur-16-ene synth  99.8 4.2E-21 9.2E-26  146.0   7.7   71    1-75    705-777 (784)
  4 PLN02592 ent-copalyl diphospha  99.1 5.7E-10 1.2E-14   85.9   7.5   67    1-75    732-800 (800)
  5 cd00868 Terpene_cyclase_C1 Ter  96.8  0.0052 1.1E-07   41.2   5.7   47    1-48    237-284 (284)
  6 cd00385 Isoprenoid_Biosyn_C1 I  89.9    0.33 7.2E-06   30.6   2.6   41    1-42    203-243 (243)
  7 PF14805 THDPS_N_2:  Tetrahydro  73.2      14  0.0003   20.7   4.6   35    5-42      1-35  (70)
  8 PF10540 Membr_traf_MHD:  Munc1  63.4      33 0.00072   21.2   5.4   51    5-55     25-106 (137)
  9 PF09504 RE_Bsp6I:  Bsp6I restr  61.5      23  0.0005   23.3   4.3   41   13-56     24-68  (180)
 10 PF06086 Pox_A30L_A26L:  Orthop  59.3      21 0.00045   24.4   3.9   46    4-53     23-68  (220)
 11 COG4046 Uncharacterized protei  58.5      34 0.00074   24.9   5.1   53    4-57    101-154 (368)
 12 PF07431 DUF1512:  Protein of u  55.7      40 0.00087   24.6   5.1   53    4-57     88-141 (355)
 13 PF08285 DPM3:  Dolichol-phosph  54.4      16 0.00035   21.4   2.5   20    1-20     67-86  (91)
 14 PF07637 PSD5:  Protein of unkn  52.8      36 0.00078   18.3   5.3   56    1-73      1-56  (64)
 15 PF01484 Col_cuticle_N:  Nemato  47.7      15 0.00033   18.5   1.5   14    5-18     40-53  (53)
 16 cd05026 S-100Z S-100Z: S-100Z   46.4      49  0.0011   18.8   3.7   37   32-69      4-41  (93)
 17 PF11707 Npa1:  Ribosome 60S bi  42.9      82  0.0018   22.1   5.1   58    5-71    173-233 (330)
 18 PF11989 Dsl1_C:  Retrograde tr  40.7      25 0.00054   24.8   2.2   31   40-72    241-271 (291)
 19 cd04470 S1_EF-P_repeat_1 S1_EF  40.6     3.4 7.3E-05   22.3  -1.8   15   42-56      3-17  (61)
 20 PF11613 UCN2:  Agonist of cort  40.2      18 0.00038   18.0   1.0   17   29-45      7-23  (38)
 21 PF02269 TFIID-18kDa:  Transcri  38.6      26 0.00057   20.4   1.8   26   44-71      8-33  (93)
 22 PRK10626 hypothetical protein;  38.1      25 0.00054   24.3   1.8   57    4-60     88-148 (239)
 23 PF04659 Arch_fla_DE:  Archaeal  38.0      46   0.001   19.8   2.8   42   29-71     50-91  (99)
 24 PF11802 CENP-K:  Centromere-as  37.9      46   0.001   23.4   3.1   20    2-21     16-35  (268)
 25 PHA02676 A-type inclusion prot  37.3      35 0.00076   26.1   2.6   48    2-53      6-53  (520)
 26 cd07018 S49_SppA_67K_type Sign  36.3 1.1E+02  0.0023   20.2   4.6   38    1-39    155-192 (222)
 27 PF13798 PCYCGC:  Protein of un  36.0      29 0.00064   22.5   1.8   23    4-30    134-156 (158)
 28 PF10835 DUF2573:  Protein of u  35.5      66  0.0014   18.6   3.0   21    2-23     57-77  (82)
 29 cd01389 MATA_HMG-box MATA_HMG-  35.5      27 0.00058   19.1   1.4   17    5-21     30-46  (77)
 30 PF12357 PLD_C:  Phospholipase   34.4      52  0.0011   18.7   2.4   22    2-23     12-33  (74)
 31 PF05341 DUF708:  Protein of un  34.2      55  0.0012   19.9   2.7   35   12-49     20-54  (105)
 32 PF12910 RelB_N:  Antitoxin of   33.9      37  0.0008   17.1   1.7   16    2-17      8-23  (46)
 33 PF01132 EFP:  Elongation facto  32.6     2.7 5.9E-05   22.2  -2.9   16   42-57      2-17  (55)
 34 PF10305 Fmp27_SW:  RNA pol II   30.9      57  0.0012   19.3   2.4   19    2-20      1-19  (103)
 35 PF14394 DUF4423:  Domain of un  30.8      63  0.0014   20.8   2.7   24    1-24    131-154 (171)
 36 PF01023 S_100:  S-100/ICaBP ty  30.5      76  0.0016   15.9   2.5   18   50-68     19-36  (44)
 37 PF13864 Enkurin:  Calmodulin-b  30.4      83  0.0018   18.2   3.0   23    2-24     40-62  (98)
 38 TIGR00965 dapD 2,3,4,5-tetrahy  30.1 1.3E+02  0.0028   21.1   4.3   31    6-39      1-31  (269)
 39 TIGR00706 SppA_dom signal pept  30.1 1.5E+02  0.0033   19.2   4.5   37    1-38    137-173 (207)
 40 cd07978 TAF13 The TATA Binding  30.0      61  0.0013   18.9   2.3   26   44-71      9-34  (92)
 41 PF14799 FAM195:  FAM195 family  29.6      86  0.0019   18.6   3.0   21    3-23     33-53  (99)
 42 PF12807 eIF3_p135:  Translatio  29.4 1.4E+02   0.003   18.9   4.1   19    7-25     46-64  (169)
 43 cd01388 SOX-TCF_HMG-box SOX-TC  29.3      40 0.00087   18.1   1.4   17    5-21     30-46  (72)
 44 cd07019 S49_SppA_1 Signal pept  29.0 1.2E+02  0.0025   19.8   3.8   35    2-37    146-180 (211)
 45 PF07553 Lipoprotein_Ltp:  Host  28.8      32  0.0007   17.8   0.9   12   45-56     27-38  (48)
 46 PF08494 DEAD_assoc:  DEAD/H as  28.8 1.7E+02  0.0036   18.9   5.7   42   12-57     80-121 (187)
 47 PF09011 HMG_box_2:  HMG-box do  27.8      68  0.0015   17.2   2.2   17    5-21     33-49  (73)
 48 cd01390 HMGB-UBF_HMG-box HMGB-  27.8      52  0.0011   16.8   1.7   18    4-21     28-45  (66)
 49 PF01713 Smr:  Smr domain;  Int  27.6      59  0.0013   17.9   1.9   22    1-22      9-30  (83)
 50 PRK11830 dapD 2,3,4,5-tetrahyd  26.9   2E+02  0.0044   20.1   4.8   31    6-39      3-33  (272)
 51 PF13904 DUF4207:  Domain of un  26.9      17 0.00038   25.0  -0.4   14   62-75    250-263 (264)
 52 cd00687 Terpene_cyclase_nonpla  26.8      75  0.0016   21.5   2.7   22    1-22    244-265 (303)
 53 PHA02718 hypothetical protein;  26.7      21 0.00045   19.4  -0.0   28   41-72     28-55  (69)
 54 KOG0027 Calmodulin and related  26.7 1.2E+02  0.0026   18.5   3.4   35   37-73      7-41  (151)
 55 smart00463 SMR Small MutS-rela  26.5      66  0.0014   17.5   2.0   19    1-19     12-30  (80)
 56 PF14555 UBA_4:  UBA-like domai  26.3      44 0.00095   16.4   1.1   18   36-53     25-42  (43)
 57 COG3351 FlaD Putative archaeal  26.1      63  0.0014   22.0   2.1   41   29-71    140-180 (214)
 58 PHA02896 A-type inclusion like  25.8      81  0.0018   24.5   2.9   39    2-45     10-49  (616)
 59 PF15581 Imm35:  Immunity prote  25.2      37 0.00081   20.1   0.8   17    5-21     33-49  (93)
 60 PRK01371 sec-independent trans  24.9 1.2E+02  0.0027   19.1   3.2   20    3-22     35-54  (137)
 61 KOG1766 Enhancer of rudimentar  24.8 1.7E+02  0.0036   17.6   4.1   54    2-57     27-81  (104)
 62 smart00398 HMG high mobility g  24.6      56  0.0012   16.7   1.4   17    5-21     30-46  (70)
 63 smart00872 Alpha-mann_mid Alph  24.5 1.1E+02  0.0024   16.7   2.7   23   29-55     42-64  (79)
 64 PF09184 PPP4R2:  PPP4R2;  Inte  24.5 1.4E+02  0.0031   20.9   3.8   19    5-23     42-60  (288)
 65 KOG2204 Mannosyl-oligosacchari  24.4      69  0.0015   25.1   2.3   22    2-23    174-195 (625)
 66 PF05002 SGS:  SGS domain ;  In  23.9 1.2E+02  0.0025   17.5   2.7   25   40-71     26-50  (82)
 67 PF02512 UK:  Virulence determi  23.7      44 0.00096   19.5   0.9   21    9-34     50-70  (96)
 68 PF03564 DUF1759:  Protein of u  23.5 1.2E+02  0.0025   18.3   2.9   14   12-25     57-70  (145)
 69 PHA02935 Hypothetical protein;  23.5 1.1E+02  0.0024   21.2   3.0   49    8-57    256-307 (349)
 70 cd00084 HMG-box High Mobility   23.4      72  0.0016   16.1   1.7   16    5-20     29-44  (66)
 71 PF14684 Tricorn_C1:  Tricorn p  23.4 1.2E+02  0.0026   16.4   2.7   20    5-24      2-21  (70)
 72 PF00505 HMG_box:  HMG (high mo  23.4      68  0.0015   16.6   1.6   18    4-21     28-45  (69)
 73 PTZ00195 60S ribosomal protein  21.8      82  0.0018   21.2   2.0   37   14-57     44-80  (198)
 74 PF08153 NGP1NT:  NGP1NT (NUC09  21.7      38 0.00083   21.3   0.4   35    2-37     47-81  (130)
 75 cd07023 S49_Sppa_N_C Signal pe  21.6   2E+02  0.0044   18.5   3.9   38    1-39    142-179 (208)
 76 PF12763 EF-hand_4:  Cytoskelet  21.5      89  0.0019   18.5   2.0   20   21-41     33-52  (104)
 77 cd05027 S-100B S-100B: S-100B   21.1 1.7E+02  0.0038   16.5   3.4   32   36-68      6-38  (88)
 78 PF10859 DUF2660:  Protein of u  20.7      99  0.0021   17.9   2.0   13    7-19     40-52  (86)
 79 PRK14857 tatA twin arginine tr  20.5 1.9E+02  0.0042   16.9   3.2   22    3-24     37-58  (90)
 80 PF00036 EF-hand_1:  EF hand;    20.4 1.1E+02  0.0023   13.7   1.9   26   42-69      4-29  (29)
 81 cd07022 S49_Sppa_36K_type Sign  20.4 2.6E+02  0.0056   18.2   4.3   31    1-32    149-179 (214)
 82 PHA03369 capsid maturational p  20.3   2E+02  0.0043   22.8   4.0   23   31-53    299-321 (663)
 83 COG2900 SlyX Uncharacterized p  20.3 1.6E+02  0.0034   16.7   2.7   21    1-21     21-41  (72)

No 1  
>PLN02150 terpene synthase/cyclase family protein
Probab=100.00  E-value=4.5e-33  Score=166.29  Aligned_cols=73  Identities=21%  Similarity=0.305  Sum_probs=70.3

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHh-hccCCCCCCCchhhHHHHHHhhccccCC
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCM-YQFGEGHGHGVQEITKARVLSLIVDPIA   75 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~-Y~~~Dg~t~~~~~~~k~~I~sll~~Pi~   75 (75)
                      |||+++|++||+++||+||+|++++++ +|++++++++|+||+++++ |++|||||.+++. +|++|.+||++||+
T Consensus        22 eeA~~~i~~li~~~WK~iN~e~l~~~~-~p~~~~~~~~NlaR~~~~~~Y~~~Dg~t~~~~~-~K~~I~sLlv~pi~   95 (96)
T PLN02150         22 EEAVSELKKMIRDNYKIVMEEFLTIKD-VPRPVLVRCLNLARLIDVYCYNEGDGFTYPHGK-LKDLITSLFFHPLP   95 (96)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHcCCCC-CCHHHHHHHHHHHHHHHhheecCCCCCCCCcHH-HHHHHHHHhccCCC
Confidence            689999999999999999999999888 9999999999999999999 9999999988777 99999999999986


No 2  
>cd00684 Terpene_cyclase_plant_C1 Plant Terpene Cyclases, Class 1. This CD includes a diverse group of monomeric plant terpene cyclases (Tspa-Tspf) that convert the acyclic isoprenoid diphosphates, geranyl diphosphate (GPP), farnesyl diphosphate (FPP), or geranylgeranyl diphosphate (GGPP) into cyclic monoterpenes, diterpenes, or sesquiterpenes, respectively; a few form acyclic species. Terpnoid cyclases are soluble enzymes localized to the cytosol (sesquiterpene synthases) or plastids (mono- and diterpene synthases). All monoterpene and diterpene synthases have restrict substrate specificity, however, some sesquiterpene synthases can accept both FPP and GPP. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl diphosphates, via bridging Mg2+ ions (K+ preferred by gymnosperm cyclases), inducing conformational changes such that an N-terminal regi
Probab=99.87  E-value=5e-22  Score=146.02  Aligned_cols=71  Identities=31%  Similarity=0.535  Sum_probs=68.1

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcC-CCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcccc
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVN-PHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDP   73 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~-~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~P   73 (75)
                      |||+++++++|+++||+||++++++ ++ +|++|+++++|+||+++++|+++||||.|++. +|++|++||++|
T Consensus       471 eeA~~~i~~~ie~~wk~ln~e~l~~~~~-~p~~~~~~~~n~~r~~~~~Y~~~D~~t~~~~~-~~~~i~~ll~~p  542 (542)
T cd00684         471 EEAREEIKKMIEDAWKELNEEFLKPSSD-VPRPIKQRFLNLARVIDVFYKEGDGFTHPEGE-IKDHITSLLFEP  542 (542)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCHHHHHHHHHHHHHHHHHhcCCCCCCCccHH-HHHHHHHHhcCC
Confidence            6899999999999999999999987 77 99999999999999999999999999999777 999999999998


No 3  
>PLN02279 ent-kaur-16-ene synthase
Probab=99.84  E-value=4.2e-21  Score=146.01  Aligned_cols=71  Identities=18%  Similarity=0.292  Sum_probs=66.5

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcC--CCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhccccCC
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVN--PHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDPIA   75 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~--~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~Pi~   75 (75)
                      |||+++++++|+++||+||++++++  +. +|++|+++++|++|++++||++|||||.+  . +|++|++||++||+
T Consensus       705 EEAi~~i~~~Ie~~wKeLn~~~l~~~~~~-vp~~~~~~~ln~aR~~~~~Y~~~Dgyt~~--~-~k~~i~~ll~ePi~  777 (784)
T PLN02279        705 EEAIESMKGLIESQRRELLRLVLQEKGSN-VPRECKDLFWKMSKVLHLFYRKDDGFTSN--D-MMSLVKSVIYEPVS  777 (784)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhccCCCC-CCHHHHHHHHHHHHhhhhheeCCCCCChH--H-HHHHHHHHhccCCc
Confidence            6899999999999999999999963  46 99999999999999999999999999963  6 99999999999986


No 4  
>PLN02592 ent-copalyl diphosphate synthase
Probab=99.06  E-value=5.7e-10  Score=85.86  Aligned_cols=67  Identities=9%  Similarity=0.124  Sum_probs=59.5

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhc--CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhccccCC
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFV--NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDPIA   75 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~--~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~Pi~   75 (75)
                      +||.+++++.|+..+++|.+.+++  ++. +|++|++.+++++|   +||..  ||+.| ++ ++++|..++++|++
T Consensus       732 ~~a~~~~~~~ie~~~~eL~~lvl~~~~~~-vp~~cK~~f~~~~k---~fy~~--~~~~~-~~-~~~~i~~vl~epv~  800 (800)
T PLN02592        732 PEEKSKTTPSIESDMQELVQLVLQNSSDD-IDPVIKQTFLMVAK---SFYYA--AYCDP-GT-INYHIAKVLFERVA  800 (800)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhcCCCC-CCHHHHHHHHHHHH---HHHHh--hcCCH-HH-HHHHHHHHhCCCCC
Confidence            379999999999999999999996  245 99999999999999   55655  99998 56 99999999999985


No 5  
>cd00868 Terpene_cyclase_C1 Terpene cyclases, Class 1. Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid biosynthesis enzymes, which share the 'isoprenoid synthase fold' and convert linear, all-trans, isoprenoids, geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate into numerous cyclic forms of monoterpenes, diterpenes, and sesquiterpenes. Also included in this CD are the cis-trans terpene cyclases such as trichodiene synthase. The class I terpene cyclization reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational ch
Probab=96.76  E-value=0.0052  Score=41.15  Aligned_cols=47  Identities=26%  Similarity=0.437  Sum_probs=41.3

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcC-CCCCcHHHHHHHHHHHHHHHHhh
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVN-PHFVSDVFVGIAMNLARMSQCMY   48 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~-~~~~p~~~~~~~~NlaR~~~~~Y   48 (75)
                      +||++++.+++++.|+++++..... ++ .++.+++.+.+..|+....|
T Consensus       237 ~eA~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~g~~~w~  284 (284)
T cd00868         237 EEALEELRKMIEEAWKELNEEVLKLSSD-VPRAVLETLLNLARGIYVWY  284 (284)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCHHHHHHHHHHHHhhhhcC
Confidence            5899999999999999999998853 35 78999999999999987765


No 6  
>cd00385 Isoprenoid_Biosyn_C1 Isoprenoid Biosynthesis enzymes, Class 1. Superfamily of trans-isoprenyl diphosphate synthases (IPPS) and class I terpene cyclases which either synthesis geranyl/farnesyl diphosphates (GPP/FPP) or longer chained products from isoprene precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), or use geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate as substrate. These enzymes produce a myriad of precursors for such end products as steroids, cholesterol, sesquiterpenes, heme, carotenoids, retinoids, and diterpenes; and are widely distributed among archaea, bacteria, and eukaryota.The enzymes in this superfamily share the same 'isoprenoid synthase fold' and include several subgroups. The head-to-tail (HT) IPPS catalyze the successive 1'-4 condensation of the 5-carbon IPP to the growing isoprene chain to form linear, all-trans, C10-, C15-, C20- C25-, C30-, C35-, C40-, C45-, or C50-isoprenoid diphosphates. Cyclic monoter
Probab=89.93  E-value=0.33  Score=30.55  Aligned_cols=41  Identities=27%  Similarity=0.392  Sum_probs=31.3

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHH
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLAR   42 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR   42 (75)
                      ++|.+++..++++.|+++++.....+. .++.+++.+.++.|
T Consensus       203 ~~a~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~  243 (243)
T cd00385         203 EEALEELAKLAEEALKELNELILSLPD-VPRALLALALNLYR  243 (243)
T ss_pred             HHHHHHHHHHHHHHHHHHhcCCCCcHH-HHHHHHHHHHHHhC
Confidence            478999999999999999987664323 56777777777653


No 7  
>PF14805 THDPS_N_2:  Tetrahydrodipicolinate N-succinyltransferase N-terminal; PDB: 3EG4_A 3TDT_A 2TDT_A 1KGT_A 1TDT_A 1KGQ_A 3BXY_A 3GOS_A 3TK8_A.
Probab=73.24  E-value=14  Score=20.70  Aligned_cols=35  Identities=11%  Similarity=0.347  Sum_probs=22.9

Q ss_pred             HHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLAR   42 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR   42 (75)
                      +.++..|+++|..  ++-++++. .+...+..+..+-.
T Consensus         1 ~~l~~~Ie~aw~~--r~~l~~~~-~~~~~~~av~~~i~   35 (70)
T PF14805_consen    1 SQLQKIIEAAWEN--RDELTPSN-ADPELRDAVEEVIE   35 (70)
T ss_dssp             HHHHHHHHHHHHG--GGG-BTTT---HHHHHHHHHHHH
T ss_pred             ChHHHHHHHHHHh--HhhCCCcc-CCHHHHHHHHHHHH
Confidence            4688999999987  77776655 56666666655543


No 8  
>PF10540 Membr_traf_MHD:  Munc13 (mammalian uncoordinated) homology domain;  InterPro: IPR019558  Mammalian uncoordinated homology 13 (Munc13) proteins constitute a family of three highly homologous molecules (Munc13-1, Munc13-2 and Munc13-3) with homology to Caenorhabditis elegans unc-13p. Munc13 proteins contain a phorbol ester-binding C1 domain and two C2 domains, which are Ca2+/phospholipid binding domains. Sequence analyses have uncovered two regions called Munc13 homology domains 1 (MHD1) and 2 (MHD2) that are arranged between two flanking C2 domains. MHD1 and MHD2 domains are present in a wide variety of proteins from Arabidopsis thaliana (Mouse-ear cress), C. elegans, Drosophila melanogaster (Fruit fly), Mus musculus (Mouse), Rattus norvegicus (Rat) and Homo sapiens (Human), some of which may function in a Munc13-like manner to regulate membrane trafficking. The MHD1 and MHD2 domains are predicted to be alpha-helical. ; PDB: 3SWH_A.
Probab=63.38  E-value=33  Score=21.24  Aligned_cols=51  Identities=10%  Similarity=0.160  Sum_probs=28.7

Q ss_pred             HHHHHHHHHHHHHHHHHh---h-cC---------------------------CCCCcHHHHHHHHHHHHHHHHhhccCCC
Q 037321            5 EHIHDLIAQTWMKMNRDR---F-VN---------------------------PHFVSDVFVGIAMNLARMSQCMYQFGEG   53 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e~---l-~~---------------------------~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg   53 (75)
                      +-+++++.+.|+.+-...   + -|                           .+.-|+.|.....=+--+.+|||.+|+|
T Consensus        25 ~~f~~vl~~lW~~vl~~l~~llvlP~ls~~~~~~~~~~~~~~~~~~~~~~~~~~Lt~~q~~~l~~~L~~L~~FFhA~G~G  104 (137)
T PF10540_consen   25 ENFKRVLKELWKVVLETLEELLVLPPLSDKPMLGLLQSAVSSLSSHGIGGSQRPLTPKQCDRLFKWLDTLKDFFHAEGNG  104 (137)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHTTS-G------------GG-TTS-------------TCHHHHHHHHHHHHHHHHCCCTS
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHcCCCcchhHHHHHHHHHHHHHHhhcccccCCCCCHHHHHHHHHHHHHHHHHHhCCCCC
Confidence            346788889999887553   4 11                           1101333444444444457999998888


Q ss_pred             CC
Q 037321           54 HG   55 (75)
Q Consensus        54 ~t   55 (75)
                      -.
T Consensus       105 l~  106 (137)
T PF10540_consen  105 LP  106 (137)
T ss_dssp             --
T ss_pred             CC
Confidence            64


No 9  
>PF09504 RE_Bsp6I:  Bsp6I restriction endonuclease;  InterPro: IPR019037 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below:   Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA.   Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone [].   This entry includes the restriction endonuclease Bsp6I, which recognises and cleaves the double-stranded sequence GC^NGC. 
Probab=61.54  E-value=23  Score=23.33  Aligned_cols=41  Identities=20%  Similarity=0.229  Sum_probs=26.3

Q ss_pred             HHHHHHHHHhhcC----CCCCcHHHHHHHHHHHHHHHHhhccCCCCCC
Q 037321           13 QTWMKMNRDRFVN----PHFVSDVFVGIAMNLARMSQCMYQFGEGHGH   56 (75)
Q Consensus        13 ~~wK~ln~e~l~~----~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~   56 (75)
                      -.||+||......    -. +|.++-+..+=.+  ..+.-..|+|++.
T Consensus        24 ~~WK~LN~~IKn~~~RGiN-~pd~fSE~l~Cy~--~~y~~~~G~~~~D   68 (180)
T PF09504_consen   24 FKWKDLNNYIKNIYSRGIN-FPDVFSEPLVCYC--NGYYRNRGTGSGD   68 (180)
T ss_pred             HHHHHHHHHhhhccCCCcc-chHHHHHHHHHHH--HhhhhhcCCcccc
Confidence            4699999986642    24 8888777766554  4444456666543


No 10 
>PF06086 Pox_A30L_A26L:  Orthopoxvirus A26L/A30L protein;  InterPro: IPR009285 This family consists of several Orthopoxvirus A26L and A30L proteins. The Vaccinia A30L gene is regulated by a late promoter and encodes a protein of approximately 9 kDa. It is thought that the A30L protein is needed for vaccinia virus morphogenesis, specifically the association of the dense viroplasm with viral membranes [].
Probab=59.33  E-value=21  Score=24.42  Aligned_cols=46  Identities=17%  Similarity=0.125  Sum_probs=35.2

Q ss_pred             HHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCC
Q 037321            4 REHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEG   53 (75)
Q Consensus         4 ~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg   53 (75)
                      +..++.||++.|..-..+   .+. +|+.-.+++=|+-|--..-+..-||
T Consensus        23 I~~F~slV~~~W~~~L~~---~sC-I~R~~RKiIRnViR~ym~s~p~~d~   68 (220)
T PF06086_consen   23 IREFKSLVSETWNKPLNE---NSC-IPREDRKIIRNVIREYMKSAPKMDE   68 (220)
T ss_pred             HHHHHHHHHHhcCCcccc---ccc-cCHHHHHHHHHHHHHHHHhCCCCCc
Confidence            678999999999753332   567 9999999999999976665654444


No 11 
>COG4046 Uncharacterized protein conserved in archaea [Function unknown]
Probab=58.52  E-value=34  Score=24.90  Aligned_cols=53  Identities=13%  Similarity=0.124  Sum_probs=40.7

Q ss_pred             HHHHHHHHHHHHHHHHHHhhc-CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321            4 REHIHDLIAQTWMKMNRDRFV-NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG   57 (75)
Q Consensus         4 ~~~i~~~i~~~wK~ln~e~l~-~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~   57 (75)
                      .+|+-++=++..+++-+-+.. -+. +.++=+..++..++.++++||+-+-|-..
T Consensus       101 ~~~Ll~~~~dr~~~~v~r~~p~~D~-v~r~~l~~a~ev~~~m~~I~K~vrH~l~~  154 (368)
T COG4046         101 LRHLLEMGEDRFRKLVRRLVPAADG-VRRSNLEMALEVAASMNFIYKYVRHYLLL  154 (368)
T ss_pred             HHHHHHhhhHHHHHHHHHcCCcccc-eehhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            356666667777777776663 255 88999999999999999999988776543


No 12 
>PF07431 DUF1512:  Protein of unknown function (DUF1512);  InterPro: IPR009995 This family consists of several archaeal proteins of around 370 residues in length. The function of this family is unknown.
Probab=55.74  E-value=40  Score=24.64  Aligned_cols=53  Identities=13%  Similarity=0.038  Sum_probs=40.8

Q ss_pred             HHHHHHHHHHHHHHHHHHhhc-CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321            4 REHIHDLIAQTWMKMNRDRFV-NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG   57 (75)
Q Consensus         4 ~~~i~~~i~~~wK~ln~e~l~-~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~   57 (75)
                      .+|+-+.-++.+|++-+..+. .+. +-++-...++..+|.++++||--+-|...
T Consensus        88 ~~hll~t~e~~~~~~v~~~~p~~d~-~~~s~~~~~lei~~~ln~IyKvvrHy~~l  141 (355)
T PF07431_consen   88 LKHLLRTYEDRFEKEVKRLAPNADE-VEVSNIETALEIARSLNMIYKVVRHYLLL  141 (355)
T ss_pred             HHHHHHhhHHHHHHHHHHhCCCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            356667777888888887663 244 67788899999999999999887776644


No 13 
>PF08285 DPM3:  Dolichol-phosphate mannosyltransferase subunit 3 (DPM3);  InterPro: IPR013174 This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex [].
Probab=54.35  E-value=16  Score=21.41  Aligned_cols=20  Identities=15%  Similarity=0.265  Sum_probs=17.6

Q ss_pred             ChHHHHHHHHHHHHHHHHHH
Q 037321            1 REAREHIHDLIAQTWMKMNR   20 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~   20 (75)
                      +||.+++++.|++|-++|-+
T Consensus        67 peA~~eL~~eI~eAK~dLr~   86 (91)
T PF08285_consen   67 PEAAKELQKEIKEAKADLRK   86 (91)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            58999999999999988765


No 14 
>PF07637 PSD5:  Protein of unknown function (DUF1595);  InterPro: IPR013043  A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013039 from INTERPRO and IPR013042 from INTERPRO.
Probab=52.82  E-value=36  Score=18.30  Aligned_cols=56  Identities=18%  Similarity=0.240  Sum_probs=37.2

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcccc
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDP   73 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~P   73 (75)
                      +.|++-|+.....+|+.         + ++..=++..+.+..-   -+..|++|.   .. ++.-+..+|..|
T Consensus         1 ~~a~~~l~~Fa~rAfRR---------p-~~~~e~~~~~~~~~~---~~~~g~~~~---~a-~~~~l~aiL~SP   56 (64)
T PF07637_consen    1 ACAREILRRFARRAFRR---------P-LTDEEVDRYLALYDS---ARAQGEDFE---EA-LKEALQAILCSP   56 (64)
T ss_pred             ChHHHHHHHHHHHHhCC---------C-CCHHHHHHHHHHHHH---HHHcCCCHH---HH-HHHHHHHHHcCc
Confidence            35778888888888854         3 555546666665432   355566654   34 899999998877


No 15 
>PF01484 Col_cuticle_N:  Nematode cuticle collagen N-terminal domain;  InterPro: IPR002486 The function of this domain is unknown. It is found in the N-terminal region of nematode cuticle collagens (see IPR008160 from INTERPRO). Cuticle is a tough elastic structure secreted by hypodermal cells and is primarily composed of collagen proteins [, ].; GO: 0042302 structural constituent of cuticle
Probab=47.68  E-value=15  Score=18.52  Aligned_cols=14  Identities=21%  Similarity=0.667  Sum_probs=10.1

Q ss_pred             HHHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKM   18 (75)
Q Consensus         5 ~~i~~~i~~~wK~l   18 (75)
                      +..|...+++|++|
T Consensus        40 ~~fk~~s~d~W~~m   53 (53)
T PF01484_consen   40 EEFKEISDDAWNEM   53 (53)
T ss_pred             HHHHHHHHHHHhhC
Confidence            46677778888764


No 16 
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z,  the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=46.38  E-value=49  Score=18.83  Aligned_cols=37  Identities=16%  Similarity=0.047  Sum_probs=21.6

Q ss_pred             HHHHHHHHHHHHHHHhh-ccCCCCCCCchhhHHHHHHhh
Q 037321           32 VFVGIAMNLARMSQCMY-QFGEGHGHGVQEITKARVLSL   69 (75)
Q Consensus        32 ~~~~~~~NlaR~~~~~Y-~~~Dg~t~~~~~~~k~~I~sl   69 (75)
                      .+=..+-.+.++.+.+= ++|||++.+..+ ++..+.+.
T Consensus         4 ~le~a~~~~~~~F~~~dd~dgdg~~Is~~E-L~~ll~~~   41 (93)
T cd05026           4 QLEGAMDTLIRIFHNYSGKEGDRYKLSKGE-LKELLQRE   41 (93)
T ss_pred             HHHHHHHHHHHHHHHHHccCCCCCEECHHH-HHHHHHHH
Confidence            34344455555555532 357777666556 88888664


No 17 
>PF11707 Npa1:  Ribosome 60S biogenesis N-terminal;  InterPro: IPR021714  Npa1p is required for ribosome biogenesis and operates in the same functional environment as Rsa3p and Dbp6p during early maturation of 60S ribosomal subunits []. The protein partners of Npa1p include eight putative helicases as well as the novel Npa2p factor. Npa1p can also associate with a subset of H/ACA and C/D small nucleolar RNPs (snoRNPs) involved in the chemical modification of residues in the vicinity of the peptidyl transferase centre []. The protein has also been referred to as Urb1, and this domain at the N-terminal is one of several conserved regions along the length. 
Probab=42.88  E-value=82  Score=22.14  Aligned_cols=58  Identities=14%  Similarity=0.154  Sum_probs=35.0

Q ss_pred             HHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH---HHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321            5 EHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN---LARMSQCMYQFGEGHGHGVQEITKARVLSLIV   71 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N---laR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~   71 (75)
                      +-|..+++--|+.+    +.+.. +|+..+-.+||   +.+++. +|...++  ..... +.+.+..+|.
T Consensus       173 ~~v~~iL~~l~~~V----l~~~~-v~r~~K~~~fn~~~L~~l~~-Ly~~~~~--~~~~~-~~~~vh~fL~  233 (330)
T PF11707_consen  173 ETVILILETLKDKV----LKDSS-VSRSTKCKLFNEWTLSQLAS-LYSRDGE--DEKSS-VADLVHEFLL  233 (330)
T ss_pred             HHHHHHHHHHHHHh----ccCCC-CChhhhhhhcCHHHHHHHHH-HhcccCC--cccch-HHHHHHHHHH
Confidence            34445555555333    34556 99999999998   677777 5665555  22234 6666665543


No 18 
>PF11989 Dsl1_C:  Retrograde transport protein Dsl1 C terminal;  InterPro: IPR021876  Dsl1 is a peripheral membrane protein required for transport between the Golgi and the endoplasmic reticulum []. It is localised to the ER membrane, and in vitro it specifically binds to coatomer, the major component of the protein coat of COPI vesicles []. Binding sites for coatomer are found on a disorganised region between the C and N termini of Dsl1 []. The C-terminal domain is involved in binding to the Sec39 subunit of the Dsl1p complex []. The N-terminal complexes with another subunit of the Dsl1p complex called Tip20 which forms heterodimers by pairing the N termini of each protein []. ; PDB: 3K8P_C.
Probab=40.65  E-value=25  Score=24.84  Aligned_cols=31  Identities=29%  Similarity=0.356  Sum_probs=20.8

Q ss_pred             HHHHHHHhhccCCCCCCCchhhHHHHHHhhccc
Q 037321           40 LARMSQCMYQFGEGHGHGVQEITKARVLSLIVD   72 (75)
Q Consensus        40 laR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~   72 (75)
                      ++=++..||+ ||=|-.+..+ +-+.|++||.+
T Consensus       241 LkDIm~~Fy~-Gel~~fsTdE-lI~lIkslFad  271 (291)
T PF11989_consen  241 LKDIMEMFYQ-GELYDFSTDE-LIQLIKSLFAD  271 (291)
T ss_dssp             HHHHHHHHHT-TGGGGS-HHH-HHHHHHHHS--
T ss_pred             HHHHHHHHhc-CchhcccHHH-HHHHHHHHhcC
Confidence            6677888887 4544444446 99999999986


No 19 
>cd04470 S1_EF-P_repeat_1 S1_EF-P_repeat_1: Translation elongation factor P (EF-P), S1-like RNA-binding domain, repeat 1. EF-P stimulates the peptidyltransferase activity in the prokaryotic 70S ribosome. EF-P enhances the synthesis of certain dipeptides with N-formylmethionyl-tRNA and puromycine in vitro. EF-P binds to both the 30S and 50S ribosomal subunits. EF-P binds near the streptomycine binding site of the 16S rRNA in the 30S subunit. EF-P interacts with domains 2 and 5 of the 23S rRNA. The L16 ribosomal protein of the 50S or its N-terminal fragment are required for EF-P mediated peptide bond synthesis, whereas L11, L15, and L7/L12 are not required in this reaction, suggesting that EF-P may function at a different ribosomal site than most other translation factors. EF-P is essential for cell viability and is required for protein synthesis. EF-P is mainly present in bacteria. The EF-P homologs in archaea and eukaryotes are the initiation factors aIF5A and eIF5A, respectively. EF-P 
Probab=40.56  E-value=3.4  Score=22.31  Aligned_cols=15  Identities=27%  Similarity=0.339  Sum_probs=11.3

Q ss_pred             HHHHHhhccCCCCCC
Q 037321           42 RMSQCMYQFGEGHGH   56 (75)
Q Consensus        42 R~~~~~Y~~~Dg~t~   56 (75)
                      |-++++|++||+|..
T Consensus         3 ~~~qylY~dg~~~~F   17 (61)
T cd04470           3 REMQYLYKDGDNYVF   17 (61)
T ss_pred             ceEEEEEeCCCEEEE
Confidence            456888999988743


No 20 
>PF11613 UCN2:  Agonist of corticotropin releasing factor R2, Urocortin-2;  InterPro: IPR024270 This entry represents urocortin II and III, which are members of the corticotropin-releasing factor (CRF) family of neuropeptides. Urocortin II and III are secreted peptides and bind with high affinity to CRF receptors 2-alpha and 2-beta. They are involved in the suppression of food intake, the delay of gastric emptying, and may represent endogenous ligands for maintaining homeostasis after stress. Their backbone is mainly alpha-helical, but contains a helix-loop-helix motif [].; GO: 0051431 corticotropin-releasing hormone receptor 2 binding, 0006950 response to stress, 0007586 digestion, 0005615 extracellular space; PDB: 2RMH_A 3N93_C 3N95_F 2RMG_A.
Probab=40.18  E-value=18  Score=17.97  Aligned_cols=17  Identities=24%  Similarity=0.409  Sum_probs=14.8

Q ss_pred             CcHHHHHHHHHHHHHHH
Q 037321           29 VSDVFVGIAMNLARMSQ   45 (75)
Q Consensus        29 ~p~~~~~~~~NlaR~~~   45 (75)
                      +|..+.+..+|.||.=.
T Consensus         7 VPt~im~iL~~~Akak~   23 (38)
T PF11613_consen    7 VPTNIMNILFNIAKAKN   23 (38)
T ss_dssp             TTHHHHHHHHHCCHHHH
T ss_pred             CchhHHHHHHHHHHHHH
Confidence            99999999999998643


No 21 
>PF02269 TFIID-18kDa:  Transcription initiation factor IID, 18kD subunit;  InterPro: IPR003195 This family includes the Spt3 yeast transcription factors and the 18 kDa subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone fold [].; GO: 0006366 transcription from RNA polymerase II promoter; PDB: 1BH9_A 1BH8_A.
Probab=38.61  E-value=26  Score=20.37  Aligned_cols=26  Identities=31%  Similarity=0.414  Sum_probs=14.1

Q ss_pred             HHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321           44 SQCMYQFGEGHGHGVQEITKARVLSLIV   71 (75)
Q Consensus        44 ~~~~Y~~~Dg~t~~~~~~~k~~I~sll~   71 (75)
                      .++||..||. ..|..+ +-+.|..++.
T Consensus         8 ~~mMy~fGD~-~~P~~e-Tv~lvE~iv~   33 (93)
T PF02269_consen    8 RQMMYGFGDV-EEPLPE-TVDLVEDIVR   33 (93)
T ss_dssp             HHHHHCTTS--SS--HH-HHHHHHHHHH
T ss_pred             HHHHHHcCCC-CCCcHH-HHHHHHHHHH
Confidence            4678999986 334444 5555555543


No 22 
>PRK10626 hypothetical protein; Provisional
Probab=38.08  E-value=25  Score=24.28  Aligned_cols=57  Identities=11%  Similarity=0.119  Sum_probs=31.0

Q ss_pred             HHHHHHHHHHHHHHHHHHhhc-CCC-C-CcHHHHHHHHHH-HHHHHHhhccCCCCCCCchh
Q 037321            4 REHIHDLIAQTWMKMNRDRFV-NPH-F-VSDVFVGIAMNL-ARMSQCMYQFGEGHGHGVQE   60 (75)
Q Consensus         4 ~~~i~~~i~~~wK~ln~e~l~-~~~-~-~p~~~~~~~~Nl-aR~~~~~Y~~~Dg~t~~~~~   60 (75)
                      .+...+=++.++.-|.+-... ... + +-..+-...-++ .|+-.++|..+|||+.++..
T Consensus        88 ~~~a~~~l~~A~~alD~Vi~~~~G~~snvr~rl~~l~~~l~~q~~~i~e~r~d~~~f~~~~  148 (239)
T PRK10626         88 DEGAKSRLEKARVALDKVIVQELGESSNVRNRLTKLDAQLKQQMNRIIEHRSDGLTFHHQA  148 (239)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhccCccchHHHHHHHHHHHHHHHHHHHHHccCCceeehHHH
Confidence            334444555666666655542 111 0 211222332222 46777889999999988653


No 23 
>PF04659 Arch_fla_DE:  Archaeal flagella protein ;  InterPro: IPR006752  Archaeal flagella are unique motility structures, and the absence of bacterial structural motility genes in the complete genome sequences of flagellated archaeal species has always suggested that archaeal flagellar biogenesis is likely mediated by novel components. FlaD and FlaE, are present in the cell as membrane-associated proteins but are not major components of isolated flagellar filaments. Interestingly, flaD was found to encode two proteins, each translated from a separate ribosome binding site. This group of sequences contain the archaeal flaD and flaE proteins. The conserved region that defines these sequences is found in the N-teminal region of flaE but towards the C-terminal region of flaD [].; GO: 0001539 ciliary or flagellar motility
Probab=37.96  E-value=46  Score=19.81  Aligned_cols=42  Identities=10%  Similarity=-0.107  Sum_probs=23.1

Q ss_pred             CcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321           29 VSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIV   71 (75)
Q Consensus        29 ~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~   71 (75)
                      ++..+..-....+|.+...-.+.+.-+...-. +.||+.||++
T Consensus        50 ISe~V~~~L~~~~~G~~~~~~~~~~~~~~~Lt-~~DH~~SL~y   91 (99)
T PF04659_consen   50 ISEEVREQLLKYLKGFKYDHDDFEWKPSWKLT-IDDHIKSLLY   91 (99)
T ss_pred             cCHHHHHHHHHHHhCCCCCccccccCCCCCCC-HHHHHHHHHH
Confidence            55566666667776665322111111222224 7899999975


No 24 
>PF11802 CENP-K:  Centromere-associated protein K;  InterPro: IPR020993 Cenp-K is one of seven new Cenp-A-nucleosome distal (CAD) centromere components (the others being Cenp-L, Cenp-O, Cenp-P, Cenp-Q, Cenp-R and Cenp-S) that are identified as assembling on the Cenp-A nucleosome associated complex, NAC []. The Cenp-A NAC is essential, as disruption of the complex causes errors of chromosome alignment and segregation that preclude cell survival despite continued centromere-derived mitotic checkpoint signalling. Cenp-K is centromere-associated through its interaction with one or more components of the Cenp-A NAC.; GO: 0005634 nucleus
Probab=37.86  E-value=46  Score=23.39  Aligned_cols=20  Identities=20%  Similarity=0.420  Sum_probs=17.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHH
Q 037321            2 EAREHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e   21 (75)
                      +|.+.+.+-.++.||+|++.
T Consensus        16 ~~~~~l~~eCEe~wk~me~~   35 (268)
T PF11802_consen   16 DAKEELIKECEELWKDMEEC   35 (268)
T ss_pred             hHHHHHHHHHHHHHHHHHHH
Confidence            57888999999999999864


No 25 
>PHA02676 A-type inclusion protein; Provisional
Probab=37.34  E-value=35  Score=26.08  Aligned_cols=48  Identities=8%  Similarity=0.067  Sum_probs=34.3

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCC
Q 037321            2 EAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEG   53 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg   53 (75)
                      .-++.++.||++.|..-..+   .+. +|+.-.+++=|+-|--..-+---||
T Consensus         6 klV~~FkslV~~tW~~~Ln~---~sC-IpR~~RKiIRnVlR~YI~~aP~i~g   53 (520)
T PHA02676          6 SLVAEFKSLVQRHWHHPLNS---VSC-IPREDRKLIRDILREYIRLAPPIDG   53 (520)
T ss_pred             HHHHHHHHHHHHhcCCcccc---ccc-ccHHHHHHHHHHHHHHHHhCCCccc
Confidence            45678999999999653332   566 9999999999999965444433344


No 26 
>cd07018 S49_SppA_67K_type Signal peptide peptidase A (SppA) 67K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 67K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily contain an amino-terminal domain in addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members (sometimes referred to as 67K type), similar to E. coli and Arabidopsis thaliana SppA peptidases. Unlike the eukaryotic functional homologs that are proposed to be aspartic proteases, site-directed mutagenesis and sequence analysis have shown that members in this subfamily, mostly bacterial, are serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys
Probab=36.35  E-value=1.1e+02  Score=20.22  Aligned_cols=38  Identities=21%  Similarity=0.266  Sum_probs=28.9

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN   39 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N   39 (75)
                      +++++.++.++++.|...-+..-+... ++...++...+
T Consensus       155 ~~~r~~~~~~l~~~~~~f~~~Va~~R~-~~~~~~~~~~~  192 (222)
T cd07018         155 PEAREQTQALLDSLWDQYLADVAASRG-LSPDALEALID  192 (222)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHcC-CCHHHHHHHHH
Confidence            478899999999999998888876555 66665554443


No 27 
>PF13798 PCYCGC:  Protein of unknown function with PCYCGC motif
Probab=36.02  E-value=29  Score=22.52  Aligned_cols=23  Identities=9%  Similarity=0.124  Sum_probs=16.8

Q ss_pred             HHHHHHHHHHHHHHHHHHhhcCCCCCc
Q 037321            4 REHIHDLIAQTWMKMNRDRFVNPHFVS   30 (75)
Q Consensus         4 ~~~i~~~i~~~wK~ln~e~l~~~~~~p   30 (75)
                      .++||..|++.||+   .+.+|++ -|
T Consensus       134 ~~eIR~~ID~kYk~---g~~~pTp-Tp  156 (158)
T PF13798_consen  134 PKEIRQYIDEKYKE---GYAKPTP-TP  156 (158)
T ss_pred             HHHHHHHHHHHHHh---CCCCCCC-CC
Confidence            46899999999985   3666665 44


No 28 
>PF10835 DUF2573:  Protein of unknown function (DUF2573);  InterPro: IPR020393 This entry contains proteins with no known function.
Probab=35.53  E-value=66  Score=18.64  Aligned_cols=21  Identities=33%  Similarity=0.529  Sum_probs=17.4

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhh
Q 037321            2 EAREHIHDLIAQTWMKMNRDRF   23 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l   23 (75)
                      ||.+.|+.+|++- |+||++.-
T Consensus        57 eaK~~ik~li~~I-k~lNe~~r   77 (82)
T PF10835_consen   57 EAKEEIKELIEEI-KQLNEAHR   77 (82)
T ss_pred             hHHHHHHHHHHHH-HHHHHHHH
Confidence            7899999999875 69998754


No 29 
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=35.46  E-value=27  Score=19.08  Aligned_cols=17  Identities=18%  Similarity=0.517  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e   21 (75)
                      .+|..+|.+.|+.|..+
T Consensus        30 ~eisk~~g~~Wk~ls~e   46 (77)
T cd01389          30 NEISRIIGRMWRSESPE   46 (77)
T ss_pred             HHHHHHHHHHHhhCCHH
Confidence            46788888999876543


No 30 
>PF12357 PLD_C:  Phospholipase D C terminal ;  InterPro: IPR024632 Phospholipase D (PLD) catalyses the hydrolysis of the phosphodiester bond of glycerophospholipids to generate phosphatidic acid and a free head group. Phospholipase D activities have been detected in simple to complex organisms from viruses and bacteria to yeast, plants, and mammals []. In higher organisms, PLD specifically catalyzes the hydrolysis of phosphatidylcholine (PC) to phosphatidic acid (PA) and choline and is activated in response to stimulators of vesicle transport, endocytosis, exocytosis, cell migration, and mitosis. This entry represents the C-terminal domain of eukaryotic phospholipase D. The domain is approximately 70 amino acids in length and contains a conserved FPD sequence motif.
Probab=34.37  E-value=52  Score=18.73  Aligned_cols=22  Identities=9%  Similarity=0.472  Sum_probs=17.5

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhh
Q 037321            2 EAREHIHDLIAQTWMKMNRDRF   23 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l   23 (75)
                      |.++.++++.++.|+..-.+-+
T Consensus        12 eCVr~Vn~iae~nW~~y~~ee~   33 (74)
T PF12357_consen   12 ECVRRVNEIAEENWKQYASEEV   33 (74)
T ss_pred             HHHHHHHHHHHHHHHHhhcccc
Confidence            5678999999999987665544


No 31 
>PF05341 DUF708:  Protein of unknown function (DUF708);  InterPro: IPR008005 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf68; it is a family of uncharacterised viral proteins.
Probab=34.25  E-value=55  Score=19.87  Aligned_cols=35  Identities=11%  Similarity=0.136  Sum_probs=23.2

Q ss_pred             HHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhc
Q 037321           12 AQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQ   49 (75)
Q Consensus        12 ~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~   49 (75)
                      |.|||+|-=..+..++   ..--+..++-|-.-+|=|+
T Consensus        20 e~AWkdLii~~l~~tP---~~tyRt~l~ka~~EnFDYk   54 (105)
T PF05341_consen   20 EHAWKDLIILLLRNTP---RSTYRTMLRKANFENFDYK   54 (105)
T ss_pred             HHHHHHHHHHHHhcCc---HHHHHhhhhhhhhhccCCC
Confidence            6899999998887654   4333555666655555554


No 32 
>PF12910 RelB_N:  Antitoxin of toxin-antitoxin stability system N-terminal; PDB: 3K6Q_C.
Probab=33.87  E-value=37  Score=17.10  Aligned_cols=16  Identities=31%  Similarity=0.401  Sum_probs=13.2

Q ss_pred             hHHHHHHHHHHHHHHH
Q 037321            2 EAREHIHDLIAQTWMK   17 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~   17 (75)
                      ||+.++.++++.+..+
T Consensus         8 eAR~~ls~l~d~v~~~   23 (46)
T PF12910_consen    8 EARNNLSKLIDRVVNG   23 (46)
T ss_dssp             HHHHTHHHHHHHHHHT
T ss_pred             HHHHhHHHHHHHHHcC
Confidence            7888999998888754


No 33 
>PF01132 EFP:  Elongation factor P (EF-P) OB domain;  InterPro: IPR001059 Elongation factor P (EF-P) is a prokaryotic protein translation factor required for efficient peptide bond synthesis on 70S ribosomes from fMet-tRNAfMet []. Probably functions indirectly by altering the affinity of the ribosome for aminoacyl-tRNA, thus increasing their reactivity as acceptors for peptidyl transferase. This entry reresents the central domain of elongation factor P and its homologues. It forms an oligonucleotide-binding (OB) fold, though it is not clear if this region is involved in binding nucleic acids [].; GO: 0003746 translation elongation factor activity, 0006414 translational elongation; PDB: 1YBY_A 3A5Z_H 3TRE_A 1UEB_B 3HUW_V 3HUY_V 3OYY_B.
Probab=32.57  E-value=2.7  Score=22.18  Aligned_cols=16  Identities=25%  Similarity=0.245  Sum_probs=11.4

Q ss_pred             HHHHHhhccCCCCCCC
Q 037321           42 RMSQCMYQFGEGHGHG   57 (75)
Q Consensus        42 R~~~~~Y~~~Dg~t~~   57 (75)
                      |-++++|++||+|..-
T Consensus         2 r~~qylY~dgd~~~FM   17 (55)
T PF01132_consen    2 REMQYLYKDGDNYVFM   17 (55)
T ss_dssp             EEEEEEEEESSEEEEE
T ss_pred             ceEEEEEeCCCEEEEe
Confidence            3457889998887654


No 34 
>PF10305 Fmp27_SW:  RNA pol II promoter Fmp27 protein domain;  InterPro: IPR019415 The function of the FMP27 protein is not known. FMP27 is the product of a nuclear encoded gene but it is detected in highly purified mitochondria in high-throughput studies []. This entry represents a conserved region within FMP27 that contains characteristic SW and GKG sequence motifs. 
Probab=30.94  E-value=57  Score=19.29  Aligned_cols=19  Identities=32%  Similarity=0.667  Sum_probs=14.7

Q ss_pred             hHHHHHHHHHHHHHHHHHH
Q 037321            2 EAREHIHDLIAQTWMKMNR   20 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~   20 (75)
                      ||+..+.+-...+|+.=-+
T Consensus         1 eA~~rL~~~~S~SWi~ri~   19 (103)
T PF10305_consen    1 EARERLQENNSTSWIRRIR   19 (103)
T ss_pred             CHHHHHHHHHHHHHHHHHH
Confidence            6888888888888876433


No 35 
>PF14394 DUF4423:  Domain of unknown function (DUF4423)
Probab=30.80  E-value=63  Score=20.79  Aligned_cols=24  Identities=25%  Similarity=0.068  Sum_probs=20.0

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhc
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFV   24 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~   24 (75)
                      +++.++|+++|++.+|+++...=+
T Consensus       131 ~~~~~ki~~~i~~fRk~i~~i~~~  154 (171)
T PF14394_consen  131 REDYEKIKKEIREFRKKIIAIAEE  154 (171)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhc
Confidence            368899999999999999976543


No 36 
>PF01023 S_100:  S-100/ICaBP type calcium binding domain;  InterPro: IPR013787 The calcium-binding domain found in S100 and CaBP-9k proteins is a subfamily of the EF-hand calcium-binding domain []. S100s are small dimeric acidic calcium and zinc-binding proteins abundant in the brain, with S100B playing an important role in modulating the proliferation and differentiation of neurons and glia cells []. S100 proteins have two different types of calcium-binding sites: a low affinity one with a special structure, and a 'normal' EF-hand type high-affinity site. Calbindin-D9k (CaBP-9k) also belong to this family of proteins, but it does not form dimers. CaBP-9k is a cytosolic protein expressed in a variety of tissues. Although its precise function is unknown, it appears to be under the control of the steroid hormones oestrogen and progesterone in the female reproductive system []. In the intestine, CaBP-9k may be involved in calcium absorption by mediating intracellular diffusion []. This entry represents a subdomain of the calcium-binding domain found in S100, CaBP-9k, and related proteins.; PDB: 2RGI_A 4DUQ_B 2KAY_B 2KAX_A 2CNP_A 1CNP_A 1A03_A 1JWD_B 2JTT_A 1XK4_B ....
Probab=30.45  E-value=76  Score=15.86  Aligned_cols=18  Identities=17%  Similarity=0.082  Sum_probs=10.5

Q ss_pred             cCCCCCCCchhhHHHHHHh
Q 037321           50 FGEGHGHGVQEITKARVLS   68 (75)
Q Consensus        50 ~~Dg~t~~~~~~~k~~I~s   68 (75)
                      +||..+.+..+ +|..+..
T Consensus        19 ~Gd~~~Lsk~E-lk~Ll~~   36 (44)
T PF01023_consen   19 EGDKDTLSKKE-LKELLEK   36 (44)
T ss_dssp             SSSTTSEEHHH-HHHHHHH
T ss_pred             CCCCCeEcHHH-HHHHHHH
Confidence            45666665545 7766653


No 37 
>PF13864 Enkurin:  Calmodulin-binding
Probab=30.36  E-value=83  Score=18.21  Aligned_cols=23  Identities=17%  Similarity=0.469  Sum_probs=18.1

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhc
Q 037321            2 EAREHIHDLIAQTWMKMNRDRFV   24 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l~   24 (75)
                      +-+..+-.-+...|.++|.++-.
T Consensus        40 eER~~lL~~Lk~~~~el~~ey~~   62 (98)
T PF13864_consen   40 EERQELLEGLKKNWDELNKEYQK   62 (98)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHh
Confidence            34566777788999999999874


No 38 
>TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. The closely related TabB protein of Pseudomonas syringae (pv. tabaci) appears to act in the biosynthesis of tabtoxin rather than lysine. The trusted cutoff is set high enough to exclude this gene. Sequences below trusted also include a version of this enzyme which apparently utilize acetate rather than succinate (EC: 2.3.1.89).
Probab=30.12  E-value=1.3e+02  Score=21.09  Aligned_cols=31  Identities=10%  Similarity=0.247  Sum_probs=19.4

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321            6 HIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN   39 (75)
Q Consensus         6 ~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N   39 (75)
                      +++..|+++|.+  ++-+.+.. .+...++.+..
T Consensus         1 ~~~~~i~~~~~~--~~~~~~~~-~~~~~~~~~~~   31 (269)
T TIGR00965         1 QLQNIIETAFER--RAEITPAN-ADTVTKEAVNE   31 (269)
T ss_pred             CHHHHHHHHHHH--HHhCCCcC-CCHHHHHHHHH
Confidence            368899999987  44465444 44445444433


No 39 
>TIGR00706 SppA_dom signal peptide peptidase SppA, 36K type. The member of this family from Bacillus subtilis was shown to have properties consistent with a role in degrading signal peptides after cleavage from precursor proteins, although it was not demonstrated conclusively.
Probab=30.12  E-value=1.5e+02  Score=19.23  Aligned_cols=37  Identities=11%  Similarity=0.097  Sum_probs=27.3

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHH
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAM   38 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~   38 (75)
                      ++.++.++.++++.|...-...-+... ++..-++.++
T Consensus       137 ~~~~e~~~~~l~~~~~~f~~~va~~R~-~~~~~~~~~~  173 (207)
T TIGR00706       137 PEERDILQNLVNESYEQFVQVVAKGRN-LPVEDVKKFA  173 (207)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHHh
Confidence            367899999999999998888776555 6655444433


No 40 
>cd07978 TAF13 The TATA Binding Protein (TBP) Associated Factor 13 (TAF13) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 13 (TAF13) is one of several TAFs that bind TBP and is  involved  in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAFs orthologs and paralogs. Several hy
Probab=29.96  E-value=61  Score=18.90  Aligned_cols=26  Identities=31%  Similarity=0.498  Sum_probs=14.7

Q ss_pred             HHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321           44 SQCMYQFGEGHGHGVQEITKARVLSLIV   71 (75)
Q Consensus        44 ~~~~Y~~~Dg~t~~~~~~~k~~I~sll~   71 (75)
                      .+++|..||.- .|..+ +-+.|..++.
T Consensus         9 ~~mmy~~GD~~-~P~~e-Tv~llE~iv~   34 (92)
T cd07978           9 RQMMYGFGDVQ-NPLPE-TVDLLEDIVV   34 (92)
T ss_pred             HHHHHHcCCCC-CCCHH-HHHHHHHHHH
Confidence            46778888862 24334 5555554443


No 41 
>PF14799 FAM195:  FAM195 family
Probab=29.64  E-value=86  Score=18.59  Aligned_cols=21  Identities=24%  Similarity=0.491  Sum_probs=17.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHhh
Q 037321            3 AREHIHDLIAQTWMKMNRDRF   23 (75)
Q Consensus         3 A~~~i~~~i~~~wK~ln~e~l   23 (75)
                      |-+++-+.|-++|..+.++.=
T Consensus        33 qHEElVrfI~esW~~V~~~~~   53 (99)
T PF14799_consen   33 QHEELVRFIYESWNKVERELE   53 (99)
T ss_pred             hHHHHHHHHHHHHHHHHHHcc
Confidence            457788899999999998853


No 42 
>PF12807 eIF3_p135:  Translation initiation factor eIF3 subunit 135
Probab=29.41  E-value=1.4e+02  Score=18.85  Aligned_cols=19  Identities=16%  Similarity=0.345  Sum_probs=16.8

Q ss_pred             HHHHHHHHHHHHHHHhhcC
Q 037321            7 IHDLIAQTWMKMNRDRFVN   25 (75)
Q Consensus         7 i~~~i~~~wK~ln~e~l~~   25 (75)
                      +.+||..+.|.+-++.+..
T Consensus        46 ~~EmiaR~~K~i~r~~l~~   64 (169)
T PF12807_consen   46 LQEMIARAAKHILREYLRS   64 (169)
T ss_pred             HHHHHHHHHHHHHHHHHHc
Confidence            4789999999999999974


No 43 
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=29.26  E-value=40  Score=18.14  Aligned_cols=17  Identities=12%  Similarity=0.585  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e   21 (75)
                      ..|.+++.+.|+.|..+
T Consensus        30 ~eisk~l~~~Wk~ls~~   46 (72)
T cd01388          30 RAISKILGDRWKALSNE   46 (72)
T ss_pred             HHHHHHHHHHHHcCCHH
Confidence            46777888889876543


No 44 
>cd07019 S49_SppA_1 Signal peptide peptidase A (SppA), a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV): SppAs in this subfamily are found in all three domains of life and are involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Site-directed mutagenesis and sequence analysis have shown these bacterial, archaeal and thylakoid SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad (both residues absolutely conserved within bacteria, chloroplast and mitochondrial signal peptidase family members) and not the usual Ser-His-Asp catalytic triad found in the majority of serine proteases. In addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members, the E. coli SppA contains an amino-te
Probab=28.97  E-value=1.2e+02  Score=19.84  Aligned_cols=35  Identities=11%  Similarity=-0.016  Sum_probs=26.1

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHH
Q 037321            2 EAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIA   37 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~   37 (75)
                      |+++.++.++++.|....+..-+... +|..-++..
T Consensus       146 e~r~~~~~~ld~~~~~f~~~Va~~R~-~~~~~l~~~  180 (211)
T cd07019         146 EAQLGLQLSIENGYKRFITLVADARH-STPEQIDKI  180 (211)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhCC-CCHHHHHHh
Confidence            67889999999999999887776555 665543333


No 45 
>PF07553 Lipoprotein_Ltp:  Host cell surface-exposed lipoprotein;  InterPro: IPR011434 This domain is found as 1-3 copies in a small family of proteins of unknown function.
Probab=28.78  E-value=32  Score=17.80  Aligned_cols=12  Identities=33%  Similarity=0.598  Sum_probs=8.8

Q ss_pred             HHhhccCCCCCC
Q 037321           45 QCMYQFGEGHGH   56 (75)
Q Consensus        45 ~~~Y~~~Dg~t~   56 (75)
                      ++.+.+|||||.
T Consensus        27 QL~se~ge~Ft~   38 (48)
T PF07553_consen   27 QLTSEYGEGFTE   38 (48)
T ss_pred             HHHhhcccCCCH
Confidence            445678999985


No 46 
>PF08494 DEAD_assoc:  DEAD/H associated;  InterPro: IPR013701 This domain is found in ATP-dependent helicases as well as a number of hypothetical proteins together with the helicase conserved C-terminal domain (IPR011545 from INTERPRO) and the IPR001650 from INTERPRO domain. ; GO: 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
Probab=28.75  E-value=1.7e+02  Score=18.92  Aligned_cols=42  Identities=12%  Similarity=0.048  Sum_probs=31.9

Q ss_pred             HHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321           12 AQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG   57 (75)
Q Consensus        12 ~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~   57 (75)
                      .+...++-++.+..++    -++..+.+.|+.+-.+-++.-|...|
T Consensus        80 ~~~l~~~l~~al~~s~----lfk~rfr~vA~r~gli~r~~~g~~~~  121 (187)
T PF08494_consen   80 PDNLEEDLREALKRSE----LFKRRFRHVARRFGLIPRNYPGKRKS  121 (187)
T ss_pred             HHHHHHHHHHHHhcCH----HHHHHHHHHHHHhCCCcCCCCCCcCc
Confidence            5777788888886554    78899999999998887765555544


No 47 
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=27.84  E-value=68  Score=17.22  Aligned_cols=17  Identities=12%  Similarity=0.522  Sum_probs=11.8

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e   21 (75)
                      ..+...|.+.|+.|..+
T Consensus        33 ~e~~k~~~~~Wk~Ls~~   49 (73)
T PF09011_consen   33 REVMKEISERWKSLSEE   49 (73)
T ss_dssp             HHHHHHHHHHHHHS-HH
T ss_pred             HHHHHHHHHHHHhcCHH
Confidence            45667888889887754


No 48 
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=27.76  E-value=52  Score=16.83  Aligned_cols=18  Identities=6%  Similarity=0.562  Sum_probs=12.2

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q 037321            4 REHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         4 ~~~i~~~i~~~wK~ln~e   21 (75)
                      ...|...+.+.|+.|..+
T Consensus        28 ~~~i~~~~~~~W~~ls~~   45 (66)
T cd01390          28 VTEVTKILGEKWKELSEE   45 (66)
T ss_pred             HHHHHHHHHHHHHhCCHH
Confidence            346777788888776543


No 49 
>PF01713 Smr:  Smr domain;  InterPro: IPR002625 This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 (P23909 from SWISSPROT) protein in the case of Smr proteins and with the N-terminal MutS related region of MutS2, P94545 from SWISSPROT [].; PDB: 3QD7_X 2D9I_A 3FAU_A 2VKC_A 2ZQE_A.
Probab=27.57  E-value=59  Score=17.86  Aligned_cols=22  Identities=14%  Similarity=0.237  Sum_probs=17.3

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHh
Q 037321            1 REAREHIHDLIAQTWMKMNRDR   22 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~   22 (75)
                      +||+..+...++++|+.-.+.+
T Consensus         9 ~eA~~~l~~~l~~~~~~~~~~~   30 (83)
T PF01713_consen    9 EEALRALEEFLDEARQRGIREL   30 (83)
T ss_dssp             HHHHHHHHHHHHHHHHTTHSEE
T ss_pred             HHHHHHHHHHHHHHHHcCCCEE
Confidence            4799999999999987655554


No 50 
>PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional
Probab=26.95  E-value=2e+02  Score=20.06  Aligned_cols=31  Identities=10%  Similarity=0.322  Sum_probs=19.9

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321            6 HIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN   39 (75)
Q Consensus         6 ~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N   39 (75)
                      .++..||++|..  ++-+.+.. .+...++.+..
T Consensus         3 ~~~~~i~~~~~~--~~~~~~~~-~~~~~~~~~~~   33 (272)
T PRK11830          3 QLQKIIEEAWER--RAELTPAT-ADTEVREAVEE   33 (272)
T ss_pred             HHHHHHHHHHHh--HhhCCccC-CCHHHHHHHHH
Confidence            588999999987  44465544 45555444443


No 51 
>PF13904 DUF4207:  Domain of unknown function (DUF4207)
Probab=26.87  E-value=17  Score=24.98  Aligned_cols=14  Identities=14%  Similarity=0.434  Sum_probs=11.0

Q ss_pred             HHHHHHhhccccCC
Q 037321           62 TKARVLSLIVDPIA   75 (75)
Q Consensus        62 ~k~~I~sll~~Pi~   75 (75)
                      +.++|.-+|++|+|
T Consensus       250 Lrgt~s~~y~NP~p  263 (264)
T PF13904_consen  250 LRGTYSPIYINPIP  263 (264)
T ss_pred             ccCcCCccccCCCC
Confidence            66667788999986


No 52 
>cd00687 Terpene_cyclase_nonplant_C1 Non-plant Terpene Cyclases, Class 1. This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are found in
Probab=26.85  E-value=75  Score=21.47  Aligned_cols=22  Identities=14%  Similarity=0.166  Sum_probs=18.8

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHh
Q 037321            1 REAREHIHDLIAQTWMKMNRDR   22 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~   22 (75)
                      |+|++++.+++.+.+++..+..
T Consensus       244 ~eA~~~~~~~~~~~~~~f~~~~  265 (303)
T cd00687         244 EEAISVVRDMHNERITQFEELE  265 (303)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            5899999999999998877553


No 53 
>PHA02718 hypothetical protein; Provisional
Probab=26.70  E-value=21  Score=19.43  Aligned_cols=28  Identities=29%  Similarity=0.266  Sum_probs=19.9

Q ss_pred             HHHHHHhhccCCCCCCCchhhHHHHHHhhccc
Q 037321           41 ARMSQCMYQFGEGHGHGVQEITKARVLSLIVD   72 (75)
Q Consensus        41 aR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~   72 (75)
                      -|..+|+|-.+..||.-    +|+.+.|-+++
T Consensus        28 hrssdflylsdrdytsi----tketlvseive   55 (69)
T PHA02718         28 HRSSDFLYLSDRDYTSI----TKETLVSEIVE   55 (69)
T ss_pred             eeccceEEEcccccchh----hHHHHHHHHHH
Confidence            48889999988888743    56666655554


No 54 
>KOG0027 consensus Calmodulin and related proteins (EF-Hand superfamily) [Signal transduction mechanisms]
Probab=26.69  E-value=1.2e+02  Score=18.50  Aligned_cols=35  Identities=26%  Similarity=0.209  Sum_probs=26.4

Q ss_pred             HHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcccc
Q 037321           37 AMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIVDP   73 (75)
Q Consensus        37 ~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~~P   73 (75)
                      ...+.++.+.+-+++||+-.. .+ +...+++|-..|
T Consensus         7 ~~el~~~F~~fD~d~~G~i~~-~e-l~~~lr~lg~~~   41 (151)
T KOG0027|consen    7 ILELKEAFQLFDKDGDGKISV-EE-LGAVLRSLGQNP   41 (151)
T ss_pred             HHHHHHHHHHHCCCCCCcccH-HH-HHHHHHHcCCCC
Confidence            455677777878889987765 46 888888887765


No 55 
>smart00463 SMR Small MutS-related domain.
Probab=26.51  E-value=66  Score=17.50  Aligned_cols=19  Identities=16%  Similarity=0.190  Sum_probs=15.6

Q ss_pred             ChHHHHHHHHHHHHHHHHH
Q 037321            1 REAREHIHDLIAQTWMKMN   19 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln   19 (75)
                      +||+..+...+++.|+.-.
T Consensus        12 ~eA~~~l~~~l~~~~~~~~   30 (80)
T smart00463       12 EEALTALDKFLNNARLKGL   30 (80)
T ss_pred             HHHHHHHHHHHHHHHHcCC
Confidence            5899999999998887643


No 56 
>PF14555 UBA_4:  UBA-like domain; PDB: 2DAL_A 3BQ3_A 2L4E_A 2L4F_A 2DZL_A 2L2D_A 2DAM_A 1V92_A 3E21_A.
Probab=26.28  E-value=44  Score=16.38  Aligned_cols=18  Identities=17%  Similarity=0.242  Sum_probs=12.6

Q ss_pred             HHHHHHHHHHHhhccCCC
Q 037321           36 IAMNLARMSQCMYQFGEG   53 (75)
Q Consensus        36 ~~~NlaR~~~~~Y~~~Dg   53 (75)
                      .-+|+-+.++.+|..+++
T Consensus        25 ~~wdle~Av~~y~~~~~~   42 (43)
T PF14555_consen   25 NNWDLEAAVNAYFDDGEA   42 (43)
T ss_dssp             TTT-HHHHHHHHHHSS-S
T ss_pred             cCCCHHHHHHHHHhCCCC
Confidence            346899999999987765


No 57 
>COG3351 FlaD Putative archaeal flagellar protein D/E [Cell motility and secretion]
Probab=26.09  E-value=63  Score=21.96  Aligned_cols=41  Identities=12%  Similarity=0.219  Sum_probs=24.7

Q ss_pred             CcHHHHHHHHHHHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321           29 VSDVFVGIAMNLARMSQCMYQFGEGHGHGVQEITKARVLSLIV   71 (75)
Q Consensus        29 ~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~   71 (75)
                      ++..++...+.++|.+-.... .+.+..-... +.|||-||+|
T Consensus       140 Is~~v~~qLl~f~kg~~~~~~-~~~~~s~kLt-~~DHiiSLly  180 (214)
T COG3351         140 ISEKVLEQLLKFAKGTKVMND-DQEEKSIKLT-MQDHIISLLF  180 (214)
T ss_pred             ccHHHHHHHHHHHhcCCCCCh-hhhhhccCCC-HHHHHHHHHH
Confidence            455667777777777655321 1233322234 8999999976


No 58 
>PHA02896 A-type inclusion like protein; Provisional
Probab=25.78  E-value=81  Score=24.53  Aligned_cols=39  Identities=21%  Similarity=0.265  Sum_probs=30.5

Q ss_pred             hHHHHHHHHHHHHHHH-HHHHhhcCCCCCcHHHHHHHHHHHHHHH
Q 037321            2 EAREHIHDLIAQTWMK-MNRDRFVNPHFVSDVFVGIAMNLARMSQ   45 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~-ln~e~l~~~~~~p~~~~~~~~NlaR~~~   45 (75)
                      .-+..++.+|++.|.+ ||+    .+. +|+.-.+++=|+-|-.-
T Consensus        10 KcV~eFkSlVertWnk~Lns----~SC-IpRk~RKiIRNILR~YI   49 (616)
T PHA02896         10 KCIRKFESLIIRTWDHDLNE----RSF-LNRKDRKIIRNIFRCFI   49 (616)
T ss_pred             HHHHHHHHHHHHhhCCcccc----ccC-cCHHHHHHHHHHHHHHH
Confidence            4567899999999953 333    567 99999999999999543


No 59 
>PF15581 Imm35:  Immunity protein 35
Probab=25.25  E-value=37  Score=20.07  Aligned_cols=17  Identities=18%  Similarity=0.604  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e   21 (75)
                      .-+..+|++.||-|-++
T Consensus        33 ~~l~~lIe~eWRGl~~~   49 (93)
T PF15581_consen   33 RNLESLIEHEWRGLPEE   49 (93)
T ss_pred             HHHHHHHHHHHcCCCHH
Confidence            34567899999875543


No 60 
>PRK01371 sec-independent translocase; Provisional
Probab=24.94  E-value=1.2e+02  Score=19.14  Aligned_cols=20  Identities=0%  Similarity=0.156  Sum_probs=17.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHh
Q 037321            3 AREHIHDLIAQTWMKMNRDR   22 (75)
Q Consensus         3 A~~~i~~~i~~~wK~ln~e~   22 (75)
                      ++..+|+++.+++.++.+|+
T Consensus        35 ~ir~~R~~~~~ak~~i~~El   54 (137)
T PRK01371         35 TLRQLREMANNARNDLRSEL   54 (137)
T ss_pred             HHHHHHHHHHHHHHHHHHHh
Confidence            67889999999999999887


No 61 
>KOG1766 consensus Enhancer of rudimentary [General function prediction only]
Probab=24.77  E-value=1.7e+02  Score=17.64  Aligned_cols=54  Identities=20%  Similarity=0.148  Sum_probs=36.0

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhc-CCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321            2 EAREHIHDLIAQTWMKMNRDRFV-NPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG   57 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l~-~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~   57 (75)
                      ||.+-|.+|-|+.-|+.|-..-. ... ++ .+-+++=.++-++..+|+..-|-..|
T Consensus        27 e~megiCk~yEe~Lkk~nPs~~~ITYD-Is-qlfeFiD~L~DlS~lVy~~~t~tY~p   81 (104)
T KOG1766|consen   27 ECMEGICKMYEEHLKKKNPSAPPITYD-IS-QLFEFIDDLADLSMLVYNRETGTYIP   81 (104)
T ss_pred             HHHHHHHHHHHHHHHhcCCCCCCccee-HH-HHHHHHHHHhhhhhhheecccccccC
Confidence            67788888888888887643221 112 33 34466667888888899887766666


No 62 
>smart00398 HMG high mobility group.
Probab=24.58  E-value=56  Score=16.74  Aligned_cols=17  Identities=12%  Similarity=0.489  Sum_probs=11.7

Q ss_pred             HHHHHHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e   21 (75)
                      ..+..++...|+.|..+
T Consensus        30 ~~i~~~~~~~W~~l~~~   46 (70)
T smart00398       30 AEISKKLGERWKLLSEE   46 (70)
T ss_pred             HHHHHHHHHHHHcCCHH
Confidence            45677788888776543


No 63 
>smart00872 Alpha-mann_mid Alpha mannosidase, middle domain. Members of this entry belong to the glycosyl hydrolase family 38, This domain, which is found in the central region adopts a structure consisting of three alpha helices, in an immunoglobulin/albumin-binding domain-like fold. The domain is predominantly found in the enzyme alpha-mannosidase PUBMED:12634058.
Probab=24.51  E-value=1.1e+02  Score=16.72  Aligned_cols=23  Identities=9%  Similarity=-0.028  Sum_probs=10.3

Q ss_pred             CcHHHHHHHHHHHHHHHHhhccCCCCC
Q 037321           29 VSDVFVGIAMNLARMSQCMYQFGEGHG   55 (75)
Q Consensus        29 ~p~~~~~~~~NlaR~~~~~Y~~~Dg~t   55 (75)
                      .|...++.++...    ++-+.-|..+
T Consensus        42 ~~~~~l~~~wk~l----l~~q~HD~i~   64 (79)
T smart00872       42 YPSEQLEELWKAL----LLNQFHDAIT   64 (79)
T ss_pred             CcHHHHHHHHHHH----HHhcCcccCC
Confidence            4555444444322    3344456554


No 64 
>PF09184 PPP4R2:  PPP4R2;  InterPro: IPR015267 PPP4R2 (protein phosphatase 4 core regulatory subunit R2) is the regulatory subunit of the histone H2A phosphatase complex. It has been shown to confer resistance to the anticancer drug cisplatin in yeast [], and may confer resistance in higher eukaryotes. 
Probab=24.48  E-value=1.4e+02  Score=20.92  Aligned_cols=19  Identities=16%  Similarity=0.237  Sum_probs=14.2

Q ss_pred             HHHHHHHHHHHHHHHHHhh
Q 037321            5 EHIHDLIAQTWMKMNRDRF   23 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e~l   23 (75)
                      ..++.++...+..+-.+++
T Consensus        42 ~~lk~l~~~kl~~v~~e~~   60 (288)
T PF09184_consen   42 SLLKSLFRHKLEKVIDEFF   60 (288)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4677777777777777777


No 65 
>KOG2204 consensus Mannosyl-oligosaccharide alpha-1,2-mannosidase and related glycosyl hydrolases [Carbohydrate transport and metabolism]
Probab=24.44  E-value=69  Score=25.07  Aligned_cols=22  Identities=18%  Similarity=0.626  Sum_probs=19.0

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhh
Q 037321            2 EAREHIHDLIAQTWMKMNRDRF   23 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l   23 (75)
                      ++|++|++||..+|-.--...+
T Consensus       174 ~rrekVkemm~hAw~~Yk~yaw  195 (625)
T KOG2204|consen  174 ERREKVKEMMKHAWNNYKKYAW  195 (625)
T ss_pred             HHHHHHHHHHHHHHHhhhhhcc
Confidence            6899999999999988776665


No 66 
>PF05002 SGS:  SGS domain ;  InterPro: IPR007699 This domain was thought to be unique to the SGT1-like proteins, but is also found in calcyclin binding proteins. Sgt1p is a highly conserved eukaryotic protein that is required for both SCF (Skp1p/Cdc53p-Cullin-F-box)-mediated ubiquitination and kinetochore function in yeast and also plays a role in the cAMP pathway. Calcyclin (S100A6) is a member of the S100A family of calcium binding proteins and appears to play a role in cell proliferation [].; PDB: 1X5M_A 2JTT_D.
Probab=23.94  E-value=1.2e+02  Score=17.54  Aligned_cols=25  Identities=20%  Similarity=0.377  Sum_probs=14.9

Q ss_pred             HHHHHHHhhccCCCCCCCchhhHHHHHHhhcc
Q 037321           40 LARMSQCMYQFGEGHGHGVQEITKARVLSLIV   71 (75)
Q Consensus        40 laR~~~~~Y~~~Dg~t~~~~~~~k~~I~sll~   71 (75)
                      +-.+++-||.+||.      + +|..|..-|.
T Consensus        26 lm~lfkkiY~~gDD------e-~KRam~KSf~   50 (82)
T PF05002_consen   26 LMNLFKKIYDNGDD------E-MKRAMMKSFT   50 (82)
T ss_dssp             HHHHHHHHHTTS-S------C-HHHHHHHHHH
T ss_pred             HHHHHHHHHccCCH------H-HHHHHHHHHH
Confidence            44567788998876      3 5555554443


No 67 
>PF02512 UK:  Virulence determinant
Probab=23.71  E-value=44  Score=19.52  Aligned_cols=21  Identities=24%  Similarity=0.411  Sum_probs=13.8

Q ss_pred             HHHHHHHHHHHHHhhcCCCCCcHHHH
Q 037321            9 DLIAQTWMKMNRDRFVNPHFVSDVFV   34 (75)
Q Consensus         9 ~~i~~~wK~ln~e~l~~~~~~p~~~~   34 (75)
                      +-|.+-||.+|+    ... +|-.|-
T Consensus        50 ewiaeywkgi~r----gnd-vpcccp   70 (96)
T PF02512_consen   50 EWIAEYWKGIKR----GND-VPCCCP   70 (96)
T ss_pred             HHHHHHHHhccc----CCC-CCccCC
Confidence            457778888887    444 665553


No 68 
>PF03564 DUF1759:  Protein of unknown function (DUF1759);  InterPro: IPR005312 This is a small family of proteins of unknown function. 
Probab=23.54  E-value=1.2e+02  Score=18.26  Aligned_cols=14  Identities=7%  Similarity=0.147  Sum_probs=8.6

Q ss_pred             HHHHHHHHHHhhcC
Q 037321           12 AQTWMKMNRDRFVN   25 (75)
Q Consensus        12 ~~~wK~ln~e~l~~   25 (75)
                      +.+|+.|.+.|=++
T Consensus        57 ~~a~~~L~~~yg~~   70 (145)
T PF03564_consen   57 EEAWELLEERYGNP   70 (145)
T ss_pred             HHHHHHHHHHhCCc
Confidence            56677766665443


No 69 
>PHA02935 Hypothetical protein; Provisional
Probab=23.52  E-value=1.1e+02  Score=21.16  Aligned_cols=49  Identities=10%  Similarity=0.167  Sum_probs=34.5

Q ss_pred             HHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCC---CCCCC
Q 037321            8 HDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGE---GHGHG   57 (75)
Q Consensus         8 ~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~D---g~t~~   57 (75)
                      +.+.|..|.-||..-=..+. .|.+--..+-+++-|+.-+-.+.|   +|+.|
T Consensus       256 rhicesewealnnnndnsss-mpashnnlandlssmmsqlqndnddnndysap  307 (349)
T PHA02935        256 RHICESEWEALNNNNDNSSS-MPASHNNLANDLSSMMSQLQNDNDDNNDYSAP  307 (349)
T ss_pred             hhhhHHHHHHhhcCCCcccc-CcchhcchhhHHHHHHHHhhccCCccccccCC
Confidence            45778999999876554555 888877888888877766554443   56655


No 70 
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=23.45  E-value=72  Score=16.09  Aligned_cols=16  Identities=13%  Similarity=0.586  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHHHHHH
Q 037321            5 EHIHDLIAQTWMKMNR   20 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~   20 (75)
                      ..|..++...|+.|..
T Consensus        29 ~~i~~~~~~~W~~l~~   44 (66)
T cd00084          29 GEISKILGEMWKSLSE   44 (66)
T ss_pred             HHHHHHHHHHHHhCCH
Confidence            4567777778876543


No 71 
>PF14684 Tricorn_C1:  Tricorn protease C1 domain; PDB: 1N6F_D 1N6D_C 1N6E_C 1K32_A.
Probab=23.42  E-value=1.2e+02  Score=16.36  Aligned_cols=20  Identities=5%  Similarity=0.332  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHHHHHHHHhhc
Q 037321            5 EHIHDLIAQTWMKMNRDRFV   24 (75)
Q Consensus         5 ~~i~~~i~~~wK~ln~e~l~   24 (75)
                      ++=+.+.++.|+.+++.++.
T Consensus         2 ~E~~~~F~~~W~~~~~~f~d   21 (70)
T PF14684_consen    2 AEWRQMFDEAWRLVRENFYD   21 (70)
T ss_dssp             HHHHHHHHHHHHHHHHHSS-
T ss_pred             HHHHHHHHHHHHHHHHhcCC
Confidence            34567888899888887663


No 72 
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=23.35  E-value=68  Score=16.58  Aligned_cols=18  Identities=22%  Similarity=0.680  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q 037321            4 REHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         4 ~~~i~~~i~~~wK~ln~e   21 (75)
                      ...|..++...|+.|-.+
T Consensus        28 ~~~i~~~~~~~W~~l~~~   45 (69)
T PF00505_consen   28 NKEISKILAQMWKNLSEE   45 (69)
T ss_dssp             HHHHHHHHHHHHHCSHHH
T ss_pred             cccchhhHHHHHhcCCHH
Confidence            356778888889876543


No 73 
>PTZ00195 60S ribosomal protein L18; Provisional
Probab=21.82  E-value=82  Score=21.22  Aligned_cols=37  Identities=8%  Similarity=-0.032  Sum_probs=24.2

Q ss_pred             HHHHHHHHhhcCCCCCcHHHHHHHHHHHHHHHHhhccCCCCCCC
Q 037321           14 TWMKMNRDRFVNPHFVSDVFVGIAMNLARMSQCMYQFGEGHGHG   57 (75)
Q Consensus        14 ~wK~ln~e~l~~~~~~p~~~~~~~~NlaR~~~~~Y~~~Dg~t~~   57 (75)
                      -||.+-+..+.+..      ....+|++|+..++ +.+||.+.+
T Consensus        44 fnk~VakRL~~sr~------nRppVsLSrI~r~~-k~~d~~~~~   80 (198)
T PTZ00195         44 FNKVVYQRLIKSRS------NRAPISLSRIAVVM-KRKAVFTAK   80 (198)
T ss_pred             HHHHHHHHHhcccc------cCCceeHHHHHHHH-hcccccccc
Confidence            38888887775432      13457899998875 666665544


No 74 
>PF08153 NGP1NT:  NGP1NT (NUC091) domain;  InterPro: IPR012971 This N-terminal domain is found in a subfamily of hypothetical nucleolar GTP-binding proteins similar to human NGP1 [].
Probab=21.69  E-value=38  Score=21.28  Aligned_cols=35  Identities=9%  Similarity=0.018  Sum_probs=26.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHH
Q 037321            2 EAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIA   37 (75)
Q Consensus         2 eA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~   37 (75)
                      +|.+++++-+.+.-++=-.-.|+.+. +|.++++=.
T Consensus        47 ~~L~~FReem~~~~~DPy~VlLk~~K-LPmsLL~d~   81 (130)
T PF08153_consen   47 EALEKFREEMGEKVKDPYSVLLKRSK-LPMSLLQDS   81 (130)
T ss_pred             HHHHHHHHHHHHhhcCCceEeeeCCc-CCHHHhccc
Confidence            67788888888777776666677777 999987655


No 75 
>cd07023 S49_Sppa_N_C Signal peptide peptidase A (SppA), a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV): SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. This subfamily contains members with either a single domain (sometimes referred to as 36K type), such as sohB peptidase, protein C and archaeal signal peptide peptidase, or an amino-terminal domain in addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members (sometimes referred to as 67K type), similar to E. coli and Arabidopsis thaliana SppA peptidases. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad 
Probab=21.57  E-value=2e+02  Score=18.49  Aligned_cols=38  Identities=11%  Similarity=0.063  Sum_probs=27.1

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHHHHH
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDVFVGIAMN   39 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~~~~~~~N   39 (75)
                      ++.++.++.++++.|..+-+..-+... ++..-++..++
T Consensus       142 ~~~~e~~~~~l~~~~~~f~~~Va~~R~-~~~~~~~~~~~  179 (208)
T cd07023         142 EEERAILQALVDDIYDQFVDVVAEGRG-MSGERLDKLAD  179 (208)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcCC-CCHHHHHHhcC
Confidence            357889999999999888887776555 65554444333


No 76 
>PF12763 EF-hand_4:  Cytoskeletal-regulatory complex EF hand; PDB: 2QPT_A 2KSP_A 2KFG_A 2JQ6_A 2KFH_A 2KFF_A 1IQ3_A 3FIA_A 2KHN_A 2KGR_A ....
Probab=21.49  E-value=89  Score=18.53  Aligned_cols=20  Identities=20%  Similarity=0.265  Sum_probs=15.5

Q ss_pred             HhhcCCCCCcHHHHHHHHHHH
Q 037321           21 DRFVNPHFVSDVFVGIAMNLA   41 (75)
Q Consensus        21 e~l~~~~~~p~~~~~~~~Nla   41 (75)
                      ++|..+. +|...+..+|++|
T Consensus        33 ~~f~~S~-L~~~~L~~IW~La   52 (104)
T PF12763_consen   33 EFFMKSG-LPRDVLAQIWNLA   52 (104)
T ss_dssp             HHHHHTT-SSHHHHHHHHHHH
T ss_pred             HHHHHcC-CCHHHHHHHHhhh
Confidence            3454567 9999999999986


No 77 
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target  proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=21.10  E-value=1.7e+02  Score=16.46  Aligned_cols=32  Identities=16%  Similarity=0.093  Sum_probs=18.2

Q ss_pred             HHHHHHHHHHHhh-ccCCCCCCCchhhHHHHHHh
Q 037321           36 IAMNLARMSQCMY-QFGEGHGHGVQEITKARVLS   68 (75)
Q Consensus        36 ~~~NlaR~~~~~Y-~~~Dg~t~~~~~~~k~~I~s   68 (75)
                      .+..+.++.+.+= ++|||+..+..+ ++..+.+
T Consensus         6 ~~~~l~~aF~~fD~~dgdG~~I~~~e-L~~ll~~   38 (88)
T cd05027           6 AMVALIDVFHQYSGREGDKHKLKKSE-LKELINN   38 (88)
T ss_pred             HHHHHHHHHHHhcccCCCcCEECHHH-HHHHHHH
Confidence            3444555554432 467886444445 8887776


No 78 
>PF10859 DUF2660:  Protein of unknown function (DUF2660);  InterPro: IPR022589 This is a bacterial group of proteins with unknown function found in Rickettsiales.
Probab=20.74  E-value=99  Score=17.87  Aligned_cols=13  Identities=15%  Similarity=0.496  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHHH
Q 037321            7 IHDLIAQTWMKMN   19 (75)
Q Consensus         7 i~~~i~~~wK~ln   19 (75)
                      +++.||-+||-|=
T Consensus        40 Lqe~iElSWkFLY   52 (86)
T PF10859_consen   40 LQERIELSWKFLY   52 (86)
T ss_pred             HHHHHHHHHHHHH
Confidence            6788999998654


No 79 
>PRK14857 tatA twin arginine translocase protein A; Provisional
Probab=20.50  E-value=1.9e+02  Score=16.91  Aligned_cols=22  Identities=5%  Similarity=-0.017  Sum_probs=18.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhc
Q 037321            3 AREHIHDLIAQTWMKMNRDRFV   24 (75)
Q Consensus         3 A~~~i~~~i~~~wK~ln~e~l~   24 (75)
                      ++..+++.+++.+.+++++.-+
T Consensus        37 ~i~~fkk~~~~~~~e~~~~~~~   58 (90)
T PRK14857         37 TLKGFQEASKEFENEIKREMAE   58 (90)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHh
Confidence            6788999999999999988753


No 80 
>PF00036 EF-hand_1:  EF hand;  InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=20.41  E-value=1.1e+02  Score=13.74  Aligned_cols=26  Identities=19%  Similarity=0.087  Sum_probs=16.5

Q ss_pred             HHHHHhhccCCCCCCCchhhHHHHHHhh
Q 037321           42 RMSQCMYQFGEGHGHGVQEITKARVLSL   69 (75)
Q Consensus        42 R~~~~~Y~~~Dg~t~~~~~~~k~~I~sl   69 (75)
                      ++...+=+++||+=.. .+ ++..+++|
T Consensus         4 ~~F~~~D~d~dG~I~~-~E-f~~~~~~L   29 (29)
T PF00036_consen    4 EAFREFDKDGDGKIDF-EE-FKEMMKKL   29 (29)
T ss_dssp             HHHHHHSTTSSSEEEH-HH-HHHHHHHT
T ss_pred             HHHHHHCCCCCCcCCH-HH-HHHHHHhC
Confidence            4455555788887655 35 77776654


No 81 
>cd07022 S49_Sppa_36K_type Signal peptide peptidase A (SppA) 36K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 36K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily are all bacterial and include sohB peptidase and protein C. These are sometimes referred to as 36K type since they contain only one domain, unlike E. coli SppA that also contains an amino-terminal domain. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad and not the usual Ser-His-Asp catalytic triad found in the majority of serine proteases.
Probab=20.37  E-value=2.6e+02  Score=18.19  Aligned_cols=31  Identities=16%  Similarity=0.139  Sum_probs=24.8

Q ss_pred             ChHHHHHHHHHHHHHHHHHHHhhcCCCCCcHH
Q 037321            1 REAREHIHDLIAQTWMKMNRDRFVNPHFVSDV   32 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e~l~~~~~~p~~   32 (75)
                      ++.++.++.++++.|...-+..-+... .+..
T Consensus       149 ~~~re~~~~~l~~~~~~f~~~V~~~R~-~~~~  179 (214)
T cd07022         149 DEARARLQAEVDALYAMFVAAVARNRG-LSAA  179 (214)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhCC-CCHH
Confidence            468899999999999999888876555 5544


No 82 
>PHA03369 capsid maturational protease; Provisional
Probab=20.28  E-value=2e+02  Score=22.85  Aligned_cols=23  Identities=13%  Similarity=0.323  Sum_probs=18.5

Q ss_pred             HHHHHHHHHHHHHHHHhhccCCC
Q 037321           31 DVFVGIAMNLARMSQCMYQFGEG   53 (75)
Q Consensus        31 ~~~~~~~~NlaR~~~~~Y~~~Dg   53 (75)
                      ..+.+.+.|+-|++..+|++.|+
T Consensus       299 e~~~~ql~~~~k~l~~~~~~kde  321 (663)
T PHA03369        299 EALADQLNNLYKLLRTIYKHKDE  321 (663)
T ss_pred             HHHHHHHHHHHHHHHHhccCccc
Confidence            34566778888999999999887


No 83 
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.27  E-value=1.6e+02  Score=16.69  Aligned_cols=21  Identities=14%  Similarity=0.327  Sum_probs=16.4

Q ss_pred             ChHHHHHHHHHHHHHHHHHHH
Q 037321            1 REAREHIHDLIAQTWMKMNRD   21 (75)
Q Consensus         1 EeA~~~i~~~i~~~wK~ln~e   21 (75)
                      |.+++++...+.+.|+.+-+-
T Consensus        21 E~tieeLn~~laEq~~~i~k~   41 (72)
T COG2900          21 EQTIEELNDALAEQQLVIDKL   41 (72)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            457788889999999887653


Done!