Query         psy4903
Match_columns 73
No_of_seqs    112 out of 1019
Neff          7.4 
Searched_HMMs 46136
Date          Fri Aug 16 20:48:35 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4903.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4903hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1214|consensus               99.6 3.4E-16 7.3E-21  118.3   6.3   66    8-73   1113-1182(1289)
  2 PF00058 Ldl_recept_b:  Low-den  99.6 8.9E-15 1.9E-19   74.7   5.9   39   18-56      1-42  (42)
  3 KOG1214|consensus               99.6 2.1E-15 4.7E-20  114.1   4.6   68    6-73   1025-1094(1289)
  4 KOG1215|consensus               99.2 1.5E-11 3.2E-16   93.4   4.8   66    7-72    524-593 (877)
  5 smart00135 LY Low-density lipo  98.9 2.4E-09 5.2E-14   53.0   3.7   34    5-38      8-43  (43)
  6 smart00135 LY Low-density lipo  98.8 1.2E-08 2.7E-13   50.4   4.1   34   40-73      2-35  (43)
  7 KOG1215|consensus               98.7 2.5E-08 5.3E-13   76.0   4.8   62    8-73      1-63  (877)
  8 PLN02919 haloacid dehalogenase  98.1   2E-05 4.3E-10   62.0   8.2   67    5-71    567-648 (1057)
  9 PF08450 SGL:  SMP-30/Gluconola  98.1 3.6E-05 7.8E-10   50.5   8.2   60    7-69      1-62  (246)
 10 PF03088 Str_synth:  Strictosid  98.0 3.8E-05 8.2E-10   44.8   6.4   62   10-72      2-82  (89)
 11 PF08450 SGL:  SMP-30/Gluconola  97.9 0.00014   3E-09   47.7   8.1   68    4-72    132-208 (246)
 12 PLN02919 haloacid dehalogenase  97.6  0.0004 8.7E-09   54.8   8.6   67    5-71    623-707 (1057)
 13 PF06977 SdiA-regulated:  SdiA-  97.2  0.0018 3.8E-08   43.9   6.2   63    3-66    168-240 (248)
 14 KOG1520|consensus               97.1  0.0012 2.5E-08   47.3   5.0   67    4-71    159-243 (376)
 15 PF03022 MRJP:  Major royal jel  97.0  0.0051 1.1E-07   42.2   7.5   60    7-67    187-255 (287)
 16 COG3386 Gluconolactonase [Carb  97.0  0.0032 6.9E-08   43.8   6.3   58    5-64    162-228 (307)
 17 COG3386 Gluconolactonase [Carb  96.9  0.0064 1.4E-07   42.3   7.1   60   11-72    116-188 (307)
 18 TIGR02604 Piru_Ver_Nterm putat  96.6  0.0074 1.6E-07   42.3   6.0   57    8-67    126-203 (367)
 19 KOG4659|consensus               96.6  0.0056 1.2E-07   49.9   5.5   63    4-67    473-553 (1899)
 20 PF01436 NHL:  NHL repeat;  Int  96.5  0.0087 1.9E-07   27.4   4.0   25   46-71      1-25  (28)
 21 PF07995 GSDH:  Glucose / Sorbo  96.5  0.0078 1.7E-07   41.8   5.4   62    8-70    116-204 (331)
 22 KOG4659|consensus               96.1   0.016 3.5E-07   47.4   5.8   61    7-68    408-495 (1899)
 23 PF00058 Ldl_recept_b:  Low-den  95.6   0.011 2.4E-07   29.6   2.1   15   59-73      1-16  (42)
 24 PF10282 Lactonase:  Lactonase,  95.5    0.18 3.9E-06   34.9   8.3   66    7-72    246-317 (345)
 25 PF01731 Arylesterase:  Arylest  95.3    0.07 1.5E-06   30.8   4.8   38   33-71     41-78  (86)
 26 TIGR03606 non_repeat_PQQ dehyd  95.1    0.11 2.4E-06   38.2   6.6   42   27-69    200-251 (454)
 27 KOG1520|consensus               94.8   0.097 2.1E-06   37.7   5.5   60    8-68    117-181 (376)
 28 PF10282 Lactonase:  Lactonase,  94.6    0.41 8.9E-06   33.1   8.1   64    8-71    194-269 (345)
 29 PRK11028 6-phosphogluconolacto  93.4     1.4   3E-05   29.8   8.8   62    8-69     37-102 (330)
 30 PF13449 Phytase-like:  Esteras  92.9    0.42   9E-06   33.1   5.7   57    9-66     88-166 (326)
 31 PF06977 SdiA-regulated:  SdiA-  92.9     1.6 3.5E-05   29.6   8.4   47    8-54     24-72  (248)
 32 COG3204 Uncharacterized protei  92.0    0.77 1.7E-05   32.4   6.1   62    3-65    230-301 (316)
 33 COG3391 Uncharacterized conser  91.7     2.3   5E-05   30.0   8.3   63    7-71    117-184 (381)
 34 PRK11028 6-phosphogluconolacto  90.5     2.4 5.1E-05   28.7   7.3   63    9-71    231-298 (330)
 35 PF03088 Str_synth:  Strictosid  90.2    0.46   1E-05   27.6   3.1   32    5-36     56-89  (89)
 36 PF03022 MRJP:  Major royal jel  88.5     1.3 2.8E-05   30.4   4.8   62   10-72    132-210 (287)
 37 KOG4499|consensus               88.2     2.7 5.9E-05   29.2   6.1   66    4-70    156-234 (310)
 38 PF07995 GSDH:  Glucose / Sorbo  88.2     3.4 7.3E-05   28.7   6.8   60    6-66      2-71  (331)
 39 PF01731 Arylesterase:  Arylest  87.8       1 2.2E-05   25.9   3.4   25    5-29     53-79  (86)
 40 PF05345 He_PIG:  Putative Ig d  86.4    0.88 1.9E-05   23.4   2.4   19    6-24     11-29  (49)
 41 COG3391 Uncharacterized conser  85.5     9.3  0.0002   27.0   7.9   60    7-67     75-136 (381)
 42 COG2133 Glucose/sorbosone dehy  85.1       4 8.7E-05   29.7   5.9   59    9-69    180-261 (399)
 43 TIGR02604 Piru_Ver_Nterm putat  84.9     4.1 8.9E-05   28.5   5.8   60    5-67     71-143 (367)
 44 COG2706 3-carboxymuconate cycl  83.9      14  0.0003   26.6   8.5   65    8-72    193-269 (346)
 45 KOG4499|consensus               81.6     4.4 9.6E-05   28.2   4.8   62    3-65    209-296 (310)
 46 PF13449 Phytase-like:  Esteras  75.7      16 0.00034   25.3   6.2   58    9-68     23-105 (326)
 47 KOG4649|consensus               74.2     3.6 7.7E-05   29.1   2.7   24   11-34     36-61  (354)
 48 PF05787 DUF839:  Bacterial pro  70.4      36 0.00078   25.6   7.3   56    5-60    349-449 (524)
 49 COG2706 3-carboxymuconate cycl  70.0      38 0.00083   24.4   8.2   64    8-71     42-113 (346)
 50 COG3823 Glutamine cyclotransfe  67.5      11 0.00024   25.8   3.8   21   45-65    227-247 (262)
 51 PF02333 Phytase:  Phytase;  In  62.4      37  0.0008   24.7   5.9   48    7-54    209-264 (381)
 52 TIGR03606 non_repeat_PQQ dehyd  62.3      62  0.0013   24.0   7.3   60    5-65     29-103 (454)
 53 PF06739 SBBP:  Beta-propeller   61.2      16 0.00034   17.6   2.8   19   46-65     12-30  (38)
 54 COG3204 Uncharacterized protei  60.5      22 0.00049   25.2   4.4   47    8-54     88-136 (316)
 55 TIGR03075 PQQ_enz_alc_DH PQQ-d  60.2      26 0.00057   26.1   5.0   48   11-65    239-287 (527)
 56 TIGR02658 TTQ_MADH_Hv methylam  59.8      62  0.0013   23.2   7.9   57   11-69    253-322 (352)
 57 PF12814 Mcp5_PH:  Meiotic cell  57.7      18 0.00039   21.7   3.2   17   11-27     35-52  (123)
 58 TIGR02276 beta_rpt_yvtn 40-res  57.3      19 0.00042   16.6   5.5   40   15-55      1-42  (42)
 59 PF00930 DPPIV_N:  Dipeptidyl p  54.7      44 0.00095   23.2   5.1   48   11-59    286-340 (353)
 60 PF14759 Reductase_C:  Reductas  53.7      29 0.00062   19.3   3.4   26   21-46      2-29  (85)
 61 PLN03216 actin depolymerizing   53.1     6.2 0.00013   24.4   0.6   15    1-16      1-15  (141)
 62 PF06433 Me-amine-dh_H:  Methyl  51.0      68  0.0015   23.1   5.6   49   16-64    194-255 (342)
 63 KOG4649|consensus               49.7      17 0.00038   25.8   2.4   26   48-73     32-57  (354)
 64 PF03664 Glyco_hydro_62:  Glyco  46.9      49  0.0011   23.0   4.2   30   14-43    141-179 (271)
 65 TIGR03866 PQQ_ABC_repeats PQQ-  46.5      75  0.0016   20.2   8.4   58    8-66     33-92  (300)
 66 COG3823 Glutamine cyclotransfe  45.0      44 0.00096   23.0   3.7   29    7-35    230-260 (262)
 67 PF10349 WWbp:  WW-domain ligan  42.2      14 0.00031   22.2   1.0   15    2-16     10-24  (116)
 68 PRK04043 tolB translocation pr  41.4 1.3E+02  0.0029   21.6   6.4   51   13-66    284-338 (419)
 69 TIGR03866 PQQ_ABC_repeats PQQ-  40.7      95   0.002   19.7   8.8   59    8-67    209-269 (300)
 70 PF05096 Glu_cyclase_2:  Glutam  40.4      29 0.00063   24.0   2.4   29    6-34    231-261 (264)
 71 PF14251 DUF4346:  Domain of un  40.2      20 0.00042   22.1   1.4   17    8-24     42-58  (119)
 72 TIGR03032 conserved hypothetic  38.0      98  0.0021   22.3   4.7   46   17-66    212-260 (335)
 73 TIGR02658 TTQ_MADH_Hv methylam  34.9 1.7E+02  0.0036   21.0   5.6   52   14-65    202-266 (352)
 74 cd01213 tensin Tensin Phosphot  33.3      84  0.0018   19.6   3.5   38    3-41     48-88  (138)
 75 PF01011 PQQ:  PQQ enzyme repea  32.3      48   0.001   15.4   1.9   12   12-23     14-25  (38)
 76 PF08310 LGFP:  LGFP repeat;  I  30.6      58  0.0013   16.6   2.1   15   10-24     16-30  (54)
 77 PF02239 Cytochrom_D1:  Cytochr  30.4   2E+02  0.0044   20.4   7.9   57   12-69     43-100 (369)
 78 PF11314 DUF3117:  Protein of u  30.2      73  0.0016   16.6   2.4   19   25-43     17-35  (51)
 79 PF02333 Phytase:  Phytase;  In  30.1 2.2E+02  0.0048   20.8   7.3   55   14-68    166-229 (381)
 80 KOG3567|consensus               29.0      61  0.0013   24.5   2.6   56    8-67    404-485 (501)
 81 PF01267 F-actin_cap_A:  F-acti  28.7 1.4E+02   0.003   20.6   4.2   43   29-71     52-94  (271)
 82 PF00930 DPPIV_N:  Dipeptidyl p  25.6 2.3E+02   0.005   19.5   5.4   53   15-67    245-301 (353)
 83 PRK04922 tolB translocation pr  25.0 2.6E+02  0.0056   19.9   7.5   55   11-66    297-355 (433)
 84 PLN02553 inositol-phosphate ph  24.5      56  0.0012   21.8   1.7   15   10-24    121-135 (270)
 85 cd01643 Bacterial_IMPase_like_  24.4      59  0.0013   21.4   1.8   16    9-24    106-121 (242)
 86 TIGR02067 his_9_proposed histi  24.2      62  0.0013   21.4   1.9   16    9-24    109-124 (251)
 87 PF13804 HERV-K_env_2:  Retro-t  23.3 2.1E+02  0.0046   18.6   4.1   18   51-68    147-164 (170)
 88 PF02820 MBT:  mbt repeat;  Int  23.1 1.4E+02  0.0029   16.1   2.8   14   11-24      4-17  (73)
 89 TIGR02647 DNA conserved hypoth  23.0      27 0.00059   19.8  -0.0   17    6-24     43-59  (77)
 90 PF14884 EFF-AFF:  Type I membr  22.9 1.5E+02  0.0032   23.1   3.7   31   25-55    371-401 (589)
 91 smart00564 PQQ beta-propeller   22.7      85  0.0018   13.5   1.9   10   13-22     21-30  (33)
 92 cd01248 PH_PLC Phospholipase C  22.6 1.7E+02  0.0037   16.9   4.2   23   45-67     19-41  (115)
 93 cd00472 Ribosomal_L24e_L24 Rib  22.3 1.4E+02  0.0029   15.7   3.6   28    2-29      2-29  (54)
 94 TIGR01331 bisphos_cysQ 3'(2'),  22.1      73  0.0016   21.0   1.9   16    9-24    111-126 (249)
 95 PRK09439 PTS system glucose-sp  21.7      84  0.0018   20.2   2.0   14    8-21     48-61  (169)
 96 cd01638 CysQ CysQ, a 3'-Phosph  21.5      72  0.0016   20.9   1.7   15   10-24    110-124 (242)
 97 KOG0646|consensus               21.4 3.7E+02   0.008   20.4   5.8   21   46-66    217-237 (476)
 98 PF09917 DUF2147:  Uncharacteri  21.2      80  0.0017   18.4   1.8   15    8-22     70-84  (114)
 99 COG5649 Uncharacterized conser  21.1      84  0.0018   19.4   1.8   15    3-17     75-90  (132)
100 PF14583 Pectate_lyase22:  Olig  21.0 2.9E+02  0.0063   20.3   4.8   43   25-67     59-101 (386)
101 PRK10757 inositol monophosphat  21.0      75  0.0016   21.3   1.8   16    9-24    113-128 (267)

No 1  
>KOG1214|consensus
Probab=99.64  E-value=3.4e-16  Score=118.32  Aligned_cols=66  Identities=18%  Similarity=0.241  Sum_probs=63.5

Q ss_pred             ccceEEeCCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903           8 SSGENRTREQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIET   73 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~   73 (73)
                      |+++|+||..|.|||||    +++|+++.|||.++++++++++..|+||++|+.++.|+|+|.+..+.||
T Consensus      1113 PR~iv~D~~rgnLYwtDWnRenPkIets~mDG~NrRilin~DigLPNGLtfdpfs~~LCWvDAGt~rleC 1182 (1289)
T KOG1214|consen 1113 PRAIVVDPIRGNLYWTDWNRENPKIETSSMDGENRRILINTDIGLPNGLTFDPFSKLLCWVDAGTKRLEC 1182 (1289)
T ss_pred             cceEEeecccCceeeccccccCCcceeeccCCccceEEeecccCCCCCceeCcccceeeEEecCCcceeE
Confidence            78889999999999999    6999999999999999999999999999999999999999999998885


No 2  
>PF00058 Ldl_recept_b:  Low-density lipoprotein receptor repeat class B;  InterPro: IPR000033  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=99.58  E-value=8.9e-15  Score=74.66  Aligned_cols=39  Identities=26%  Similarity=0.514  Sum_probs=36.8

Q ss_pred             CeEEEeC--CC-cEEEEEcCCCCeEEEEeCCCceeeEEEEcC
Q psy4903          18 ALMFVAD--SN-QILRTNMDGTMAMSIVSEAAYKASGVALDI   56 (73)
Q Consensus        18 g~lywtd--~~-~I~ra~ldGs~~~~l~~~~~~~p~~lavD~   56 (73)
                      |+|||+|  .. +|++++|||+++++++.+++..|.||||||
T Consensus         1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~   42 (42)
T PF00058_consen    1 GKIYWTDWSQDPSIERANLDGSNRRTVISDDLQHPEGIAVDW   42 (42)
T ss_dssp             TEEEEEETTTTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred             CEEEEEECCCCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence            6899999  45 999999999999999999999999999996


No 3  
>KOG1214|consensus
Probab=99.57  E-value=2.1e-15  Score=114.06  Aligned_cols=68  Identities=24%  Similarity=0.487  Sum_probs=64.6

Q ss_pred             eeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903           6 KFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIET   73 (73)
Q Consensus         6 ~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~   73 (73)
                      +++.||..|.+..++||||  .+.|.||+|+|+..++++++++.+|+||||||..+++||+|+.++.||.
T Consensus      1025 ~IiVGidfDC~e~mvyWtDv~g~SI~rasL~G~Ep~ti~n~~L~SPEGiAVDh~~Rn~ywtDS~lD~Iev 1094 (1289)
T KOG1214|consen 1025 SIIVGIDFDCRERMVYWTDVAGRSISRASLEGAEPETIVNSGLISPEGIAVDHIRRNMYWTDSVLDKIEV 1094 (1289)
T ss_pred             ceeeeeecccccceEEEeecCCCccccccccCCCCceeecccCCCccceeeeeccceeeeeccccchhhe
Confidence            4578999999999999999  7999999999999999999999999999999999999999999999973


No 4  
>KOG1215|consensus
Probab=99.21  E-value=1.5e-11  Score=93.36  Aligned_cols=66  Identities=30%  Similarity=0.551  Sum_probs=61.1

Q ss_pred             eccceEEeCCCCeEEEeC---CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC-eee
Q psy4903           7 FSSGENRTREQALMFVAD---SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLD-YIE   72 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd---~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~-~ie   72 (73)
                      -++++++||..|++||+|   .++|+|+.|||+.+++++..++.+|+||++|...+++||+|.... .|+
T Consensus       524 ~~r~~~v~p~~g~~~wtd~~~~~~i~ra~~dg~~~~~l~~~~~~~p~glt~d~~~~~~yw~d~~~~~~i~  593 (877)
T KOG1215|consen  524 LPRSIAVDPEKGLMFWTDWGQPPRIERASLDGSERAVLVTNGILWPNGLTIDYETDRLYWADAKLDYTIE  593 (877)
T ss_pred             CccceeeccccCeeEEecCCCCchhhhhcCCCCCceEEEeCCccCCCcceEEeecceeEEEcccCCccee
Confidence            567889999999999999   368999999999999999999999999999999999999999988 554


No 5  
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.90  E-value=2.4e-09  Score=53.03  Aligned_cols=34  Identities=21%  Similarity=0.264  Sum_probs=30.7

Q ss_pred             eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCe
Q psy4903           5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMA   38 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~   38 (73)
                      +..|.|+++||..++|||+|  ...|+|+.+||+++
T Consensus         8 ~~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~g~~~   43 (43)
T smart00135        8 LGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR   43 (43)
T ss_pred             CCCcCEEEEeecCCEEEEEeCCCCEEEEEeCCCCCC
Confidence            45789999999999999999  69999999999863


No 6  
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.79  E-value=1.2e-08  Score=50.41  Aligned_cols=34  Identities=38%  Similarity=0.589  Sum_probs=30.1

Q ss_pred             EEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903          40 SIVSEAAYKASGVALDINAKRLFWCDNLLDYIET   73 (73)
Q Consensus        40 ~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~   73 (73)
                      +++...+..|+|||+|+.+++|||+|.....|++
T Consensus         2 ~~~~~~~~~~~~la~d~~~~~lYw~D~~~~~I~~   35 (43)
T smart00135        2 TLLSEGLGHPNGLAVDWIEGRLYWTDWGLDVIEV   35 (43)
T ss_pred             EEEECCCCCcCEEEEeecCCEEEEEeCCCCEEEE
Confidence            4566789999999999999999999999988863


No 7  
>KOG1215|consensus
Probab=98.68  E-value=2.5e-08  Score=76.03  Aligned_cols=62  Identities=24%  Similarity=0.379  Sum_probs=55.8

Q ss_pred             ccceEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903           8 SSGENRTREQALMFVAD-SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIET   73 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~   73 (73)
                      |+.++++|..+.|||+| ...|+    ||+.+..+....+.+|+|++||+..+++||+|.+.+.||+
T Consensus         1 pra~~~~p~~~~l~wtd~~~~i~----dg~~~~~~~~~~~~~~ng~~id~~~~~~y~~d~~~~~i~~   63 (877)
T KOG1215|consen    1 PRAIALLPLEGLLFWTDWGANIE----DGGERKILEKEEFEWPNGLTIDLAWQRIYWADAKNDLIES   63 (877)
T ss_pred             CceEEeccccceEEEecCCcccc----cCcceEEeeccceeCCCcceecchhheeeeccccCCceEE
Confidence            56889999999999999 33466    8888999999999999999999999999999999998874


No 8  
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.10  E-value=2e-05  Score=61.96  Aligned_cols=67  Identities=22%  Similarity=0.357  Sum_probs=54.4

Q ss_pred             eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeC-------------CCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903           5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSE-------------AAYKASGVALDINAKRLFWCDNLLD   69 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~-------------~~~~p~~lavD~~~~~LYW~D~~~~   69 (73)
                      +.||.|+++|+.+|.+|.+|  +.+|.+.+.+|.....+-..             .+..|.|+++|..++.||++|...+
T Consensus       567 l~~P~gvavd~~~g~lyVaDs~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt~n~  646 (1057)
T PLN02919        567 LKFPGKLAIDLLNNRLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADTENH  646 (1057)
T ss_pred             CCCCceEEEECCCCeEEEEECCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeCCCc
Confidence            57899999999999999999  79999999998644333321             2567999999998889999998765


Q ss_pred             ee
Q psy4903          70 YI   71 (73)
Q Consensus        70 ~i   71 (73)
                      +|
T Consensus       647 ~I  648 (1057)
T PLN02919        647 AL  648 (1057)
T ss_pred             eE
Confidence            54


No 9  
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.08  E-value=3.6e-05  Score=50.46  Aligned_cols=60  Identities=20%  Similarity=0.222  Sum_probs=49.0

Q ss_pred             eccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903           7 FSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLD   69 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~   69 (73)
                      |+-|.+.||.+|.|||+|  +.+|.|...++...+++....   |.|++++...++||.++....
T Consensus         1 l~Egp~~d~~~g~l~~~D~~~~~i~~~~~~~~~~~~~~~~~---~~G~~~~~~~g~l~v~~~~~~   62 (246)
T PF08450_consen    1 LGEGPVWDPRDGRLYWVDIPGGRIYRVDPDTGEVEVIDLPG---PNGMAFDRPDGRLYVADSGGI   62 (246)
T ss_dssp             CEEEEEEETTTTEEEEEETTTTEEEEEETTTTEEEEEESSS---EEEEEEECTTSEEEEEETTCE
T ss_pred             CCcceEEECCCCEEEEEEcCCCEEEEEECCCCeEEEEecCC---CceEEEEccCCEEEEEEcCce
Confidence            467899999999999999  799999999886555433323   999999977799999987643


No 10 
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=98.01  E-value=3.8e-05  Score=44.83  Aligned_cols=62  Identities=15%  Similarity=0.244  Sum_probs=45.2

Q ss_pred             ceEEeCCCCeEEEeC-------------------CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCe
Q psy4903          10 GENRTREQALMFVAD-------------------SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDY   70 (73)
Q Consensus        10 g~~~dp~~g~lywtd-------------------~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~   70 (73)
                      ++.+++..|.+||||                   ++++.+-+.... +..++-.++..|+|+++...++.|..++....+
T Consensus         2 dldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~-~~~vl~~~L~fpNGVals~d~~~vlv~Et~~~R   80 (89)
T PF03088_consen    2 DLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTK-ETTVLLDGLYFPNGVALSPDESFVLVAETGRYR   80 (89)
T ss_dssp             EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTT-EEEEEEEEESSEEEEEE-TTSSEEEEEEGGGTE
T ss_pred             ceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCC-eEEEehhCCCccCeEEEcCCCCEEEEEeccCce
Confidence            688999999999998                   257777666654 444555689999999999999999999887776


Q ss_pred             ee
Q psy4903          71 IE   72 (73)
Q Consensus        71 ie   72 (73)
                      |-
T Consensus        81 i~   82 (89)
T PF03088_consen   81 IL   82 (89)
T ss_dssp             EE
T ss_pred             EE
Confidence            63


No 11 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=97.88  E-value=0.00014  Score=47.70  Aligned_cols=68  Identities=21%  Similarity=0.242  Sum_probs=50.7

Q ss_pred             eeeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCC-----eEEEEeC--CCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903           4 KMKFSSGENRTREQALMFVAD--SNQILRTNMDGTM-----AMSIVSE--AAYKASGVALDINAKRLFWCDNLLDYIE   72 (73)
Q Consensus         4 ~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~-----~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~~~ie   72 (73)
                      .+.+|.||+++|..+.||+++  ..+|.+..++...     ++++..-  ....|.|+++|. ..+||-++...+.|.
T Consensus       132 ~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~-~G~l~va~~~~~~I~  208 (246)
T PF08450_consen  132 GLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDS-DGNLWVADWGGGRIV  208 (246)
T ss_dssp             EESSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBT-TS-EEEEEETTTEEE
T ss_pred             CcccccceEECCcchheeecccccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcC-CCCEEEEEcCCCEEE
Confidence            466799999999999999999  6899999997432     3444432  223699999996 557898877666653


No 12 
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.65  E-value=0.0004  Score=54.80  Aligned_cols=67  Identities=10%  Similarity=0.173  Sum_probs=52.5

Q ss_pred             eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeC----------------CCceeeEEEEcCCCCeEEEEeC
Q psy4903           5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSE----------------AAYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~----------------~~~~p~~lavD~~~~~LYW~D~   66 (73)
                      +..|.|+++||..+.||++|  +.+|.+.++.+....++...                .+..|.++++|..++.||.+|.
T Consensus       623 f~~P~GIavd~~gn~LYVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~  702 (1057)
T PLN02919        623 FNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMA  702 (1057)
T ss_pred             cCCCcEEEEeCCCCEEEEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEEC
Confidence            34589999999999999999  68888888766544444321                1567999999998999999998


Q ss_pred             CCCee
Q psy4903          67 LLDYI   71 (73)
Q Consensus        67 ~~~~i   71 (73)
                      ..++|
T Consensus       703 ~~~~I  707 (1057)
T PLN02919        703 GQHQI  707 (1057)
T ss_pred             CCCeE
Confidence            76655


No 13 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=97.18  E-value=0.0018  Score=43.91  Aligned_cols=63  Identities=16%  Similarity=0.247  Sum_probs=38.1

Q ss_pred             eeeeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEE--Ee------CCCceeeEEEEcCCCCeEEEEeC
Q psy4903           3 FKMKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSI--VS------EAAYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus         3 ~~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l--~~------~~~~~p~~lavD~~~~~LYW~D~   66 (73)
                      +.+.=+||+.+||.+|.+|--+  ++.|.....+|.-...+  ..      ..+.+|+|+|+|. ..+||-+..
T Consensus       168 ~~~~d~S~l~~~p~t~~lliLS~es~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~-~G~LYIvsE  240 (248)
T PF06977_consen  168 LFVRDLSGLSYDPRTGHLLILSDESRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDP-DGNLYIVSE  240 (248)
T ss_dssp             --SS---EEEEETTTTEEEEEETTTTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-T-T--EEEEET
T ss_pred             ceeccccceEEcCCCCeEEEEECCCCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECC-CCCEEEEcC
Confidence            4455689999999999998887  68899988888744433  22      3567899999997 558887654


No 14 
>KOG1520|consensus
Probab=97.10  E-value=0.0012  Score=47.29  Aligned_cols=67  Identities=13%  Similarity=0.292  Sum_probs=50.7

Q ss_pred             eeeeccceEEeCCCCeEEEeC------CCcEEEEEcCCC--C----------eEEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903           4 KMKFSSGENRTREQALMFVAD------SNQILRTNMDGT--M----------AMSIVSEAAYKASGVALDINAKRLFWCD   65 (73)
Q Consensus         4 ~~~~~~g~~~dp~~g~lywtd------~~~I~ra~ldGs--~----------~~~l~~~~~~~p~~lavD~~~~~LYW~D   65 (73)
                      +++|..++.+|| +|-+||||      .....-|.+.|.  +          ...++-.+++-|+|+|+-+.+..+-.|+
T Consensus       159 ~~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD~~tK~~~VLld~L~F~NGlaLS~d~sfvl~~E  237 (376)
T KOG1520|consen  159 PFKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVFAALEGDPTGRLFRYDPSTKVTKVLLDGLYFPNGLALSPDGSFVLVAE  237 (376)
T ss_pred             eeeecCceeEcC-CCeEEEeccccccchhheEEeeecCCCccceEEecCcccchhhhhhcccccccccCCCCCCEEEEEe
Confidence            678999999999 99999999      145666666662  1          2224446899999999998898887777


Q ss_pred             CCCCee
Q psy4903          66 NLLDYI   71 (73)
Q Consensus        66 ~~~~~i   71 (73)
                      ....+|
T Consensus       238 t~~~ri  243 (376)
T KOG1520|consen  238 TTTARI  243 (376)
T ss_pred             ecccee
Confidence            765554


No 15 
>PF03022 MRJP:  Major royal jelly protein;  InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=97.03  E-value=0.0051  Score=42.20  Aligned_cols=60  Identities=18%  Similarity=0.142  Sum_probs=44.6

Q ss_pred             eccceEEeCCCCeEEEeC--CCcEEEEEcCC----CCeEEEEe-CC-CceeeEEEEcC-CCCeEEEEeCC
Q psy4903           7 FSSGENRTREQALMFVAD--SNQILRTNMDG----TMAMSIVS-EA-AYKASGVALDI-NAKRLFWCDNL   67 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldG----s~~~~l~~-~~-~~~p~~lavD~-~~~~LYW~D~~   67 (73)
                      -+.|++.|+ +|.||+++  +..|.+.+.++    .+..+|.. .. +.+|.+++++. ..+.||..-..
T Consensus       187 ~s~g~~~D~-~G~ly~~~~~~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snr  255 (287)
T PF03022_consen  187 QSDGMAIDP-NGNLYFTDVEQNAIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNR  255 (287)
T ss_dssp             SECEEEEET-TTEEEEEECCCTEEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S
T ss_pred             CCceEEECC-CCcEEEecCCCCeEEEEeCCCCcCccchheeEEcCceeeccceeeeccccCceEEEEECc
Confidence            568999999 99999999  68999999998    34444543 44 89999999996 35677776544


No 16 
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=97.00  E-value=0.0032  Score=43.80  Aligned_cols=58  Identities=24%  Similarity=0.440  Sum_probs=44.0

Q ss_pred             eeeccceEEeCCCCeEEEeC--CCcEEEEEcC---CC--Ce--EEEEeCCCceeeEEEEcCCCCeEEEE
Q psy4903           5 MKFSSGENRTREQALMFVAD--SNQILRTNMD---GT--MA--MSIVSEAAYKASGVALDINAKRLFWC   64 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ld---Gs--~~--~~l~~~~~~~p~~lavD~~~~~LYW~   64 (73)
                      +.++.||+.+|....||++|  ..+|.|..+|   |.  ++  .+.....-..|.|+++|-.+  .||+
T Consensus       162 ~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG--~lw~  228 (307)
T COG3386         162 LTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADG--NLWV  228 (307)
T ss_pred             EEecCceEECCCCCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCC--CEEE
Confidence            67899999999999999999  6999999998   32  22  23333345789999999444  4563


No 17 
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=96.90  E-value=0.0064  Score=42.31  Aligned_cols=60  Identities=17%  Similarity=0.321  Sum_probs=44.6

Q ss_pred             eEEeCCCCeEEEeC-C------------CcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903          11 ENRTREQALMFVAD-S------------NQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIE   72 (73)
Q Consensus        11 ~~~dp~~g~lywtd-~------------~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie   72 (73)
                      .++||. |.+|+++ +            ..++|..-+| ....++...+..|+|||+++.++.+|++|...++|.
T Consensus       116 ~~v~pd-G~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g-~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~  188 (307)
T COG3386         116 GVVDPD-GRIWFGDMGYFDLGKSEERPTGSLYRVDPDG-GVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIH  188 (307)
T ss_pred             eeEcCC-CCEEEeCCCccccCccccCCcceEEEEcCCC-CEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEE
Confidence            345565 6667665 3            3577777644 455566667999999999999999999999887764


No 18 
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=96.65  E-value=0.0074  Score=42.27  Aligned_cols=57  Identities=19%  Similarity=0.314  Sum_probs=45.3

Q ss_pred             ccceEEeCCCCeEEEeCC---------------------CcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903           8 SSGENRTREQALMFVADS---------------------NQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd~---------------------~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~   66 (73)
                      +.|++++| .|+||+++.                     ..|.|.+-||+..+ ++...+..|.|+++|. ...||.+|.
T Consensus       126 ~~~l~~gp-DG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e-~~a~G~rnp~Gl~~d~-~G~l~~tdn  202 (367)
T TIGR02604       126 LNSLAWGP-DGWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLR-VVAHGFQNPYGHSVDS-WGDVFFCDN  202 (367)
T ss_pred             ccCceECC-CCCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEE-EEecCcCCCccceECC-CCCEEEEcc
Confidence            56888888 689999761                     35899999987766 4567899999999997 567888887


Q ss_pred             C
Q psy4903          67 L   67 (73)
Q Consensus        67 ~   67 (73)
                      .
T Consensus       203 ~  203 (367)
T TIGR02604       203 D  203 (367)
T ss_pred             C
Confidence            4


No 19 
>KOG4659|consensus
Probab=96.59  E-value=0.0056  Score=49.87  Aligned_cols=63  Identities=17%  Similarity=0.242  Sum_probs=45.3

Q ss_pred             eeeeccceEEeCCCCeEEEeCCCcEEEEEcCCCCe------------------EEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903           4 KMKFSSGENRTREQALMFVADSNQILRTNMDGTMA------------------MSIVSEAAYKASGVALDINAKRLFWCD   65 (73)
Q Consensus         4 ~~~~~~g~~~dp~~g~lywtd~~~I~ra~ldGs~~------------------~~l~~~~~~~p~~lavD~~~~~LYW~D   65 (73)
                      +.-+|+|+++|. .|.||+.|...|-...-+|--.                  +.|..-.++||..||||+..+.||-.|
T Consensus       473 ~L~~PkGIa~dk-~g~lYfaD~t~IR~iD~~giIstlig~~~~~~~p~~C~~~~kl~~~~leWPT~LaV~Pmdnsl~Vld  551 (1899)
T KOG4659|consen  473 QLIFPKGIAFDK-MGNLYFADGTRIRVIDTTGIISTLIGTTPDQHPPRTCAQITKLVDLQLEWPTSLAVDPMDNSLLVLD  551 (1899)
T ss_pred             eeccCCceeEcc-CCcEEEecccEEEEeccCceEEEeccCCCCccCccccccccchhheeeecccceeecCCCCeEEEee
Confidence            456899999996 6799999955555544443211                  112222788999999999999999888


Q ss_pred             CC
Q psy4903          66 NL   67 (73)
Q Consensus        66 ~~   67 (73)
                      ..
T Consensus       552 ~n  553 (1899)
T KOG4659|consen  552 TN  553 (1899)
T ss_pred             cc
Confidence            63


No 20 
>PF01436 NHL:  NHL repeat;  InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ].  The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=96.53  E-value=0.0087  Score=27.44  Aligned_cols=25  Identities=20%  Similarity=0.251  Sum_probs=21.8

Q ss_pred             CceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903          46 AYKASGVALDINAKRLFWCDNLLDYI   71 (73)
Q Consensus        46 ~~~p~~lavD~~~~~LYW~D~~~~~i   71 (73)
                      +..|.|+++| ..++||-+|....+|
T Consensus         1 f~~P~gvav~-~~g~i~VaD~~n~rV   25 (28)
T PF01436_consen    1 FNYPHGVAVD-SDGNIYVADSGNHRV   25 (28)
T ss_dssp             BSSEEEEEEE-TTSEEEEEECCCTEE
T ss_pred             CcCCcEEEEe-CCCCEEEEECCCCEE
Confidence            3579999999 888999999988776


No 21 
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=96.51  E-value=0.0078  Score=41.81  Aligned_cols=62  Identities=27%  Similarity=0.336  Sum_probs=47.0

Q ss_pred             ccceEEeCCCCeEEEeC---------------CCcEEEEEcCCCC------------eEEEEeCCCceeeEEEEcCCCCe
Q psy4903           8 SSGENRTREQALMFVAD---------------SNQILRTNMDGTM------------AMSIVSEAAYKASGVALDINAKR   60 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd---------------~~~I~ra~ldGs~------------~~~l~~~~~~~p~~lavD~~~~~   60 (73)
                      ..+|+++| .|+||++-               ..+|.|.+.||+-            ...++..++.+|.++++|+.+..
T Consensus       116 g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~ilri~~dG~~p~dnP~~~~~~~~~~i~A~GlRN~~~~~~d~~tg~  194 (331)
T PF07995_consen  116 GGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKILRIDPDGSIPADNPFVGDDGADSEIYAYGLRNPFGLAFDPNTGR  194 (331)
T ss_dssp             EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEEEEEETTSSB-TTSTTTTSTTSTTTEEEE--SEEEEEEEETTTTE
T ss_pred             CccccCCC-CCcEEEEeCCCCCcccccccccccceEEEecccCcCCCCCccccCCCceEEEEEeCCCccccEEEECCCCc
Confidence            35688888 66999983               3799999999971            34567889999999999999999


Q ss_pred             EEEEeCCCCe
Q psy4903          61 LFWCDNLLDY   70 (73)
Q Consensus        61 LYW~D~~~~~   70 (73)
                      ||-+|.+.+.
T Consensus       195 l~~~d~G~~~  204 (331)
T PF07995_consen  195 LWAADNGPDG  204 (331)
T ss_dssp             EEEEEE-SSS
T ss_pred             EEEEccCCCC
Confidence            9999976553


No 22 
>KOG4659|consensus
Probab=96.14  E-value=0.016  Score=47.37  Aligned_cols=61  Identities=18%  Similarity=0.283  Sum_probs=44.4

Q ss_pred             eccceEEeCCCCeEEEeC--CCcEEEEE-cCCC----CeEEEEeC--------------------CCceeeEEEEcCCCC
Q psy4903           7 FSSGENRTREQALMFVAD--SNQILRTN-MDGT----MAMSIVSE--------------------AAYKASGVALDINAK   59 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd--~~~I~ra~-ldGs----~~~~l~~~--------------------~~~~p~~lavD~~~~   59 (73)
                      ++==+|++|+.|.||-+|  +.+|.|.. +.+.    +.+++..+                    .+..|.|||+| ..+
T Consensus       408 h~Yy~AvsPvdgtlyvSdp~s~qv~rv~sl~~~d~~~N~evvaG~Ge~Clp~desCGDGalA~dA~L~~PkGIa~d-k~g  486 (1899)
T KOG4659|consen  408 HSYYIAVSPVDGTLYVSDPLSKQVWRVSSLEPQDSRNNYEVVAGDGEVCLPADESCGDGALAQDAQLIFPKGIAFD-KMG  486 (1899)
T ss_pred             ceeEEEecCcCceEEecCCCcceEEEeccCCccccccCeeEEeccCcCccccccccCcchhcccceeccCCceeEc-cCC
Confidence            334478999999999999  68888854 3332    23444321                    57789999999 566


Q ss_pred             eEEEEeCCC
Q psy4903          60 RLFWCDNLL   68 (73)
Q Consensus        60 ~LYW~D~~~   68 (73)
                      .||++|...
T Consensus       487 ~lYfaD~t~  495 (1899)
T KOG4659|consen  487 NLYFADGTR  495 (1899)
T ss_pred             cEEEecccE
Confidence            899999864


No 23 
>PF00058 Ldl_recept_b:  Low-density lipoprotein receptor repeat class B;  InterPro: IPR000033  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=95.65  E-value=0.011  Score=29.55  Aligned_cols=15  Identities=33%  Similarity=0.685  Sum_probs=12.7

Q ss_pred             CeEEEEeCCCC-eeeC
Q psy4903          59 KRLFWCDNLLD-YIET   73 (73)
Q Consensus        59 ~~LYW~D~~~~-~ie~   73 (73)
                      ++|||+|...+ .|++
T Consensus         1 ~~iYWtD~~~~~~I~~   16 (42)
T PF00058_consen    1 GKIYWTDWSQDPSIER   16 (42)
T ss_dssp             TEEEEEETTTTEEEEE
T ss_pred             CEEEEEECCCCcEEEE
Confidence            58999999999 8763


No 24 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=95.45  E-value=0.18  Score=34.91  Aligned_cols=66  Identities=12%  Similarity=0.201  Sum_probs=49.2

Q ss_pred             eccceEEeCCCCeEEEeC--CCcEEEEEcCCC-CeEEEE---eCCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903           7 FSSGENRTREQALMFVAD--SNQILRTNMDGT-MAMSIV---SEAAYKASGVALDINAKRLFWCDNLLDYIE   72 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs-~~~~l~---~~~~~~p~~lavD~~~~~LYW~D~~~~~ie   72 (73)
                      .+.+++++|...+||-++  ...|..-.+|+. +....+   ...-..|.++++|+.++.||-+....+.|.
T Consensus       246 ~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~  317 (345)
T PF10282_consen  246 APAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVS  317 (345)
T ss_dssp             SEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEE
T ss_pred             CceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEE
Confidence            577899999999999998  578888888653 233332   233456999999999999999998877664


No 25 
>PF01731 Arylesterase:  Arylesterase;  InterPro: IPR002640  The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity [].   Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity.   Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL.   Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo [].  This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=95.25  E-value=0.07  Score=30.85  Aligned_cols=38  Identities=26%  Similarity=0.367  Sum_probs=29.7

Q ss_pred             cCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903          33 MDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYI   71 (73)
Q Consensus        33 ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~i   71 (73)
                      .||+... .+.+++..|+||++|+..+.||-++.....|
T Consensus        41 yd~~~~~-~va~g~~~aNGI~~s~~~k~lyVa~~~~~~I   78 (86)
T PF01731_consen   41 YDGKEVK-VVASGFSFANGIAISPDKKYLYVASSLAHSI   78 (86)
T ss_pred             EeCCEeE-EeeccCCCCceEEEcCCCCEEEEEeccCCeE
Confidence            3454443 3456899999999999999999999877665


No 26 
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=95.13  E-value=0.11  Score=38.18  Aligned_cols=42  Identities=21%  Similarity=0.379  Sum_probs=35.6

Q ss_pred             cEEEEEcCCCC----------eEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903          27 QILRTNMDGTM----------AMSIVSEAAYKASGVALDINAKRLFWCDNLLD   69 (73)
Q Consensus        27 ~I~ra~ldGs~----------~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~   69 (73)
                      +|.|.+.||+-          +..|...++.+|-|+++|+ ...||.+|.+.+
T Consensus       200 kILRin~DGsiP~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~Gp~  251 (454)
T TIGR03606       200 KVLRLNLDGSIPKDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQGPN  251 (454)
T ss_pred             EEEEEcCCCCCCCCCCccCCCcceEEEEeccccceeEECC-CCCEEEEecCCC
Confidence            69999999972          4568888999999999999 789999998753


No 27 
>KOG1520|consensus
Probab=94.82  E-value=0.097  Score=37.69  Aligned_cols=60  Identities=17%  Similarity=0.236  Sum_probs=47.4

Q ss_pred             ccceEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeC----CCceeeEEEEcCCCCeEEEEeCCC
Q psy4903           8 SSGENRTREQALMFVAD-SNQILRTNMDGTMAMSIVSE----AAYKASGVALDINAKRLFWCDNLL   68 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~----~~~~p~~lavD~~~~~LYW~D~~~   68 (73)
                      |=||+.+.+.|-+|-+| ---+....-+|...+.+..+    .+...+++.||. +..+||+|...
T Consensus       117 PLGl~f~~~ggdL~VaDAYlGL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSs  181 (376)
T KOG1520|consen  117 PLGIRFDKKGGDLYVADAYLGLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSS  181 (376)
T ss_pred             cceEEeccCCCeEEEEecceeeEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEecccc
Confidence            56999999999999999 45677777777665555544    455578999998 99999999864


No 28 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=94.60  E-value=0.41  Score=33.13  Aligned_cols=64  Identities=11%  Similarity=0.166  Sum_probs=45.9

Q ss_pred             ccceEEeCCCCeEEEeC--CCcEEEEEcC-CCCeEEEEeC---------CCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903           8 SSGENRTREQALMFVAD--SNQILRTNMD-GTMAMSIVSE---------AAYKASGVALDINAKRLFWCDNLLDYI   71 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd--~~~I~ra~ld-Gs~~~~l~~~---------~~~~p~~lavD~~~~~LYW~D~~~~~i   71 (73)
                      |+.++++|...++|-.+  ...|....++ .......+..         .-..|.+|++++.++.||-+....+.|
T Consensus       194 PRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI  269 (345)
T PF10282_consen  194 PRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSI  269 (345)
T ss_dssp             EEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEE
T ss_pred             CcEEEEcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEE
Confidence            45689999888999998  6888888888 3332222221         112699999999999999999988776


No 29 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=93.38  E-value=1.4  Score=29.82  Aligned_cols=62  Identities=11%  Similarity=0.133  Sum_probs=43.4

Q ss_pred             ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeC--CCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903           8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSE--AAYKASGVALDINAKRLFWCDNLLD   69 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~~   69 (73)
                      +++++++|....||-+.  ...|..-.+++.....++..  ....|.++++++.++.||=+....+
T Consensus        37 ~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~~l~v~~~~~~  102 (330)
T PRK11028         37 VQPMVISPDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGRFLFSASYNAN  102 (330)
T ss_pred             CccEEECCCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCCEEEEEEcCCC
Confidence            56788999888999886  57776666664333333322  2347999999999999998865433


No 30 
>PF13449 Phytase-like:  Esterase-like activity of phytase
Probab=92.90  E-value=0.42  Score=33.12  Aligned_cols=57  Identities=19%  Similarity=0.302  Sum_probs=43.3

Q ss_pred             cceEEeCCCCeEEEeC--C------CcEEEEEcCCCCeEEE-EeCCC-------------ceeeEEEEcCCCCeEEEEeC
Q psy4903           9 SGENRTREQALMFVAD--S------NQILRTNMDGTMAMSI-VSEAA-------------YKASGVALDINAKRLFWCDN   66 (73)
Q Consensus         9 ~g~~~dp~~g~lywtd--~------~~I~ra~ldGs~~~~l-~~~~~-------------~~p~~lavD~~~~~LYW~D~   66 (73)
                      -||++ +..|.+||++  .      ++|.+...+|...+.+ +...+             .-.+|||+...++.||-+-.
T Consensus        88 Egi~~-~~~g~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~~E  166 (326)
T PF13449_consen   88 EGIAV-PPDGSFWISSEGGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAAME  166 (326)
T ss_pred             hHeEE-ecCCCEEEEeCCccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEEEC
Confidence            37888 8899999998  6      9999999999875555 32221             23799999988887876544


No 31 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=92.89  E-value=1.6  Score=29.56  Aligned_cols=47  Identities=17%  Similarity=0.285  Sum_probs=35.5

Q ss_pred             ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEE
Q psy4903           8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVAL   54 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lav   54 (73)
                      .|||+.+|.++.||=..  .+.|..-+++|.-.+.+--.+...|+||+.
T Consensus        24 ~SGLTy~pd~~tLfaV~d~~~~i~els~~G~vlr~i~l~g~~D~EgI~y   72 (248)
T PF06977_consen   24 LSGLTYNPDTGTLFAVQDEPGEIYELSLDGKVLRRIPLDGFGDYEGITY   72 (248)
T ss_dssp             EEEEEEETTTTEEEEEETTTTEEEEEETT--EEEEEE-SS-SSEEEEEE
T ss_pred             ccccEEcCCCCeEEEEECCCCEEEEEcCCCCEEEEEeCCCCCCceeEEE
Confidence            48999999999988665  689999999987666655567888999988


No 32 
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=92.01  E-value=0.77  Score=32.39  Aligned_cols=62  Identities=19%  Similarity=0.267  Sum_probs=44.7

Q ss_pred             eeeeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEE--------eCCCceeeEEEEcCCCCeEEEEe
Q psy4903           3 FKMKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIV--------SEAAYKASGVALDINAKRLFWCD   65 (73)
Q Consensus         3 ~~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~--------~~~~~~p~~lavD~~~~~LYW~D   65 (73)
                      +-++=-||+..|+.++.|+--.  ...+.....+|--+..+.        ++++.+|+|+|.|..+ .||-+.
T Consensus       230 ~f~~DvSgl~~~~~~~~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g-~lYIvS  301 (316)
T COG3204         230 LFVLDVSGLEFNAITNSLLVLSDESRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMDDDG-NLYIVS  301 (316)
T ss_pred             eEeeccccceecCCCCcEEEEecCCceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEECCCC-CEEEEe
Confidence            3455568999999988776554  688888888886544443        2488899999999544 566543


No 33 
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=91.69  E-value=2.3  Score=30.04  Aligned_cols=63  Identities=13%  Similarity=0.175  Sum_probs=43.4

Q ss_pred             eccceEEeCCCCeEEEeC-C---CcEEEEEcCCCCeEEEEeCCCc-eeeEEEEcCCCCeEEEEeCCCCee
Q psy4903           7 FSSGENRTREQALMFVAD-S---NQILRTNMDGTMAMSIVSEAAY-KASGVALDINAKRLFWCDNLLDYI   71 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd-~---~~I~ra~ldGs~~~~l~~~~~~-~p~~lavD~~~~~LYW~D~~~~~i   71 (73)
                      .|.|++++|..+.+|-++ +   .++....-.  ...++...... .|.++++|+.++++|=++...+.+
T Consensus       117 ~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~--t~~~~~~~~vG~~P~~~a~~p~g~~vyv~~~~~~~v  184 (381)
T COG3391         117 GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAA--TNKVTATIPVGNTPTGVAVDPDGNKVYVTNSDDNTV  184 (381)
T ss_pred             CCceEEECCCCCEEEEEecccCCceEEEEeCC--CCeEEEEEecCCCcceEEECCCCCeEEEEecCCCeE
Confidence            578999999999999998 2   444443333  22222222222 689999999999999999665544


No 34 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=90.52  E-value=2.4  Score=28.71  Aligned_cols=63  Identities=14%  Similarity=0.153  Sum_probs=42.5

Q ss_pred             cceEEeCCCCeEEEeC--CCcEEEEEcCCCC-eEEEEeC--CCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903           9 SGENRTREQALMFVAD--SNQILRTNMDGTM-AMSIVSE--AAYKASGVALDINAKRLFWCDNLLDYI   71 (73)
Q Consensus         9 ~g~~~dp~~g~lywtd--~~~I~ra~ldGs~-~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~~~i   71 (73)
                      .+++++|..+++|=++  ...|..-.++-.. ...++..  .-..|.++++++.++.||=+....+.|
T Consensus       231 ~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~~~~~~~p~~~~~~~dg~~l~va~~~~~~v  298 (330)
T PRK11028        231 ADIHITPDGRHLYACDRTASLISVFSVSEDGSVLSFEGHQPTETQPRGFNIDHSGKYLIAAGQKSHHI  298 (330)
T ss_pred             eeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEEEeccccCCceEECCCCCEEEEEEccCCcE
Confidence            3578899888899887  4667666654221 2222222  224799999999999999888755544


No 35 
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=90.17  E-value=0.46  Score=27.57  Aligned_cols=32  Identities=25%  Similarity=0.506  Sum_probs=27.6

Q ss_pred             eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCC
Q psy4903           5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGT   36 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs   36 (73)
                      .-||.|+++.|....++.++  ..+|.|-++.|.
T Consensus        56 L~fpNGVals~d~~~vlv~Et~~~Ri~rywl~Gp   89 (89)
T PF03088_consen   56 LYFPNGVALSPDESFVLVAETGRYRILRYWLKGP   89 (89)
T ss_dssp             ESSEEEEEE-TTSSEEEEEEGGGTEEEEEESSST
T ss_pred             CCccCeEEEcCCCCEEEEEeccCceEEEEEEeCC
Confidence            45899999999999999999  699999999883


No 36 
>PF03022 MRJP:  Major royal jelly protein;  InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=88.54  E-value=1.3  Score=30.41  Aligned_cols=62  Identities=16%  Similarity=0.277  Sum_probs=38.7

Q ss_pred             ceEEeCC---CCeEEEeC--CCcEEEEEcC----CCC--e------EEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903          10 GENRTRE---QALMFVAD--SNQILRTNMD----GTM--A------MSIVSEAAYKASGVALDINAKRLFWCDNLLDYIE   72 (73)
Q Consensus        10 g~~~dp~---~g~lywtd--~~~I~ra~ld----Gs~--~------~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie   72 (73)
                      |+++.|.   .+.|||..  +.++++...+    .+.  .      ...+.+......|+++|. ...||.++...+.|-
T Consensus       132 gial~~~~~d~r~LYf~~lss~~ly~v~T~~L~~~~~~~~~~~~~~v~~lG~k~~~s~g~~~D~-~G~ly~~~~~~~aI~  210 (287)
T PF03022_consen  132 GIALSPISPDGRWLYFHPLSSRKLYRVPTSVLRDPSLSDAQALASQVQDLGDKGSQSDGMAIDP-NGNLYFTDVEQNAIG  210 (287)
T ss_dssp             EEEE-TTSTTS-EEEEEETT-SEEEEEEHHHHCSTT--HHH-HHHT-EEEEE---SECEEEEET-TTEEEEEECCCTEEE
T ss_pred             ccccCCCCCCccEEEEEeCCCCcEEEEEHHHhhCccccccccccccceeccccCCCCceEEECC-CCcEEEecCCCCeEE
Confidence            4667663   46899998  5678887764    221  1      122333335679999997 889999999877664


No 37 
>KOG4499|consensus
Probab=88.22  E-value=2.7  Score=29.25  Aligned_cols=66  Identities=14%  Similarity=0.178  Sum_probs=46.5

Q ss_pred             eeeeccceEEeCCCCeEEEeC--CCcEEEEEcC---C--CCeEEEEeC------CCceeeEEEEcCCCCeEEEEeCCCCe
Q psy4903           4 KMKFSSGENRTREQALMFVAD--SNQILRTNMD---G--TMAMSIVSE------AAYKASGVALDINAKRLFWCDNLLDY   70 (73)
Q Consensus         4 ~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ld---G--s~~~~l~~~------~~~~p~~lavD~~~~~LYW~D~~~~~   70 (73)
                      .+.+|.||+-|-....+|++|  +-.|..-..|   |  +++.+++.-      +-..|.|++||- +.+||-+-...++
T Consensus       156 ~v~IsNgl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~-eG~L~Va~~ng~~  234 (310)
T KOG4499|consen  156 CVGISNGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDT-EGNLYVATFNGGT  234 (310)
T ss_pred             hccCCccccccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEcc-CCcEEEEEecCcE
Confidence            467899999999999999999  5556444433   2  568888752      334599999995 6677766544433


No 38 
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=88.22  E-value=3.4  Score=28.71  Aligned_cols=60  Identities=15%  Similarity=0.232  Sum_probs=42.2

Q ss_pred             eeccceEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEe------CCCceeeEEEEcCC---CCeEEEEeC
Q psy4903           6 KFSSGENRTREQALMFVAD-SNQILRTNMDGTMAMSIVS------EAAYKASGVALDIN---AKRLFWCDN   66 (73)
Q Consensus         6 ~~~~g~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~------~~~~~p~~lavD~~---~~~LYW~D~   66 (73)
                      .-|.+|+..|. |.||.++ ..+|.+..-+|+....+..      .....+-|||+|+.   ++.||.+-.
T Consensus         2 ~~P~~~a~~pd-G~l~v~e~~G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t   71 (331)
T PF07995_consen    2 NNPRSMAFLPD-GRLLVAERSGRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYT   71 (331)
T ss_dssp             SSEEEEEEETT-SCEEEEETTTEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEE
T ss_pred             CCceEEEEeCC-CcEEEEeCCceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEE
Confidence            35789999997 8899998 7999999988876333322      24456899999983   467766544


No 39 
>PF01731 Arylesterase:  Arylesterase;  InterPro: IPR002640  The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity [].   Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity.   Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL.   Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo [].  This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=87.82  E-value=1  Score=25.94  Aligned_cols=25  Identities=20%  Similarity=0.469  Sum_probs=20.5

Q ss_pred             eeeccceEEeCCCCeEEEeC--CCcEE
Q psy4903           5 MKFSSGENRTREQALMFVAD--SNQIL   29 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~~~I~   29 (73)
                      +.+|.||+++|...+||-++  ...|.
T Consensus        53 ~~~aNGI~~s~~~k~lyVa~~~~~~I~   79 (86)
T PF01731_consen   53 FSFANGIAISPDKKYLYVASSLAHSIH   79 (86)
T ss_pred             CCCCceEEEcCCCCEEEEEeccCCeEE
Confidence            57899999999999999887  34444


No 40 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=86.45  E-value=0.88  Score=23.41  Aligned_cols=19  Identities=11%  Similarity=0.017  Sum_probs=17.6

Q ss_pred             eeccceEEeCCCCeEEEeC
Q psy4903           6 KFSSGENRTREQALMFVAD   24 (73)
Q Consensus         6 ~~~~g~~~dp~~g~lywtd   24 (73)
                      .+|+||.+||.+|.|.|+-
T Consensus        11 ~LP~gLs~d~~tG~isGtp   29 (49)
T PF05345_consen   11 GLPSGLSLDPSTGTISGTP   29 (49)
T ss_pred             CCCCcEEEeCCCCEEEeec
Confidence            5899999999999999985


No 41 
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=85.46  E-value=9.3  Score=27.01  Aligned_cols=60  Identities=13%  Similarity=0.236  Sum_probs=43.3

Q ss_pred             eccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903           7 FSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL   67 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~   67 (73)
                      .+.|+++.|...++|=++  ...|.-..++......-+.... .|.++++|+..+.+|-++..
T Consensus        75 ~p~~i~v~~~~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG~-~P~~~~~~~~~~~vYV~n~~  136 (381)
T COG3391          75 YPAGVAVNPAGNKVYVTTGDSNTVSVIDTATNTVLGSIPVGL-GPVGLAVDPDGKYVYVANAG  136 (381)
T ss_pred             cccceeeCCCCCeEEEecCCCCeEEEEcCcccceeeEeeecc-CCceEEECCCCCEEEEEecc
Confidence            467899999999999887  4666665543322222222232 89999999999999999984


No 42 
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=85.15  E-value=4  Score=29.73  Aligned_cols=59  Identities=15%  Similarity=0.097  Sum_probs=42.4

Q ss_pred             cceEEeCCCCeEEEeC---------------CCcEEEEEcC--------CCCeEEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903           9 SGENRTREQALMFVAD---------------SNQILRTNMD--------GTMAMSIVSEAAYKASGVALDINAKRLFWCD   65 (73)
Q Consensus         9 ~g~~~dp~~g~lywtd---------------~~~I~ra~ld--------Gs~~~~l~~~~~~~p~~lavD~~~~~LYW~D   65 (73)
                      +-|+.+|.. +||-+-               ..+|.|..-+        +.+ ..++..++.+|.|++.|+.+..||-+|
T Consensus       180 ~~l~f~pDG-~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a~~~~~d~p~~~-~~i~s~G~RN~qGl~w~P~tg~Lw~~e  257 (399)
T COG2133         180 GRLVFGPDG-KLYVTTGSNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPN-SEIWSYGHRNPQGLAWHPVTGALWTTE  257 (399)
T ss_pred             ccEEECCCC-cEEEEeCCCCCcccccCccccccceeeeccCcccccCCCCCC-cceEEeccCCccceeecCCCCcEEEEe
Confidence            446777766 888773               1334443333        333 456777999999999999999999999


Q ss_pred             CCCC
Q psy4903          66 NLLD   69 (73)
Q Consensus        66 ~~~~   69 (73)
                      ...+
T Consensus       258 ~g~d  261 (399)
T COG2133         258 HGPD  261 (399)
T ss_pred             cCCC
Confidence            8764


No 43 
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=84.86  E-value=4.1  Score=28.55  Aligned_cols=60  Identities=17%  Similarity=0.215  Sum_probs=40.9

Q ss_pred             eeeccceEEeCCCCeEEEeCCCcEEEE-EcCCC-----CeEEEEeC---C----CceeeEEEEcCCCCeEEEEeCC
Q psy4903           5 MKFSSGENRTREQALMFVADSNQILRT-NMDGT-----MAMSIVSE---A----AYKASGVALDINAKRLFWCDNL   67 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd~~~I~ra-~ldGs-----~~~~l~~~---~----~~~p~~lavD~~~~~LYW~D~~   67 (73)
                      +.+|.|+++.+..  ||-++.++|.|. ..+|.     .+++++..   .    ...|+++++++. .+||+++..
T Consensus        71 l~~p~Gi~~~~~G--lyV~~~~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gpD-G~LYv~~G~  143 (367)
T TIGR02604        71 LSMVTGLAVAVGG--VYVATPPDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGPD-GWLYFNHGN  143 (367)
T ss_pred             CCCccceeEecCC--EEEeCCCeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECCC-CCEEEeccc
Confidence            3467888888754  888887788887 44442     34555542   1    345889999874 589998873


No 44 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=83.91  E-value=14  Score=26.64  Aligned_cols=65  Identities=6%  Similarity=0.044  Sum_probs=49.5

Q ss_pred             ccceEEeCCCCeEEEeC--CCcEEEEEcCCC-CeEEEEe---------CCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903           8 SSGENRTREQALMFVAD--SNQILRTNMDGT-MAMSIVS---------EAAYKASGVALDINAKRLFWCDNLLDYIE   72 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs-~~~~l~~---------~~~~~p~~lavD~~~~~LYW~D~~~~~ie   72 (73)
                      |+=|+.+|...+.|-..  +.+|.....++. .+-..++         .+-.+..+|.|.+.++.||-++.+.+.|-
T Consensus       193 PRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~  269 (346)
T COG2706         193 PRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIA  269 (346)
T ss_pred             cceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEE
Confidence            45588999999999887  899999998874 2222221         14556788999999999999999988763


No 45 
>KOG4499|consensus
Probab=81.63  E-value=4.4  Score=28.24  Aligned_cols=62  Identities=16%  Similarity=0.129  Sum_probs=40.8

Q ss_pred             eeeeeccceEEeCCCCeEEEeC-----------------------CCcEEEEEcCCCCeEEEEe---CCCceeeEEEEcC
Q psy4903           3 FKMKFSSGENRTREQALMFVAD-----------------------SNQILRTNMDGTMAMSIVS---EAAYKASGVALDI   56 (73)
Q Consensus         3 ~~~~~~~g~~~dp~~g~lywtd-----------------------~~~I~ra~ldGs~~~~l~~---~~~~~p~~lavD~   56 (73)
                      |.-++|.|+++|- .|.||-+-                       .++|.+|-+.|.+...+..   ..+..|.++--|+
T Consensus       209 ~e~~~PDGm~ID~-eG~L~Va~~ng~~V~~~dp~tGK~L~eiklPt~qitsccFgGkn~d~~yvT~aa~~~dp~~~~~~p  287 (310)
T KOG4499|consen  209 FESLEPDGMTIDT-EGNLYVATFNGGTVQKVDPTTGKILLEIKLPTPQITSCCFGGKNLDILYVTTAAKFDDPVRTNTDP  287 (310)
T ss_pred             cCCCCCCcceEcc-CCcEEEEEecCcEEEEECCCCCcEEEEEEcCCCceEEEEecCCCccEEEEEehhcccCchhcccCC
Confidence            4556788888775 55555442                       3677788888876555543   2666788777777


Q ss_pred             CCCeEEEEe
Q psy4903          57 NAKRLFWCD   65 (73)
Q Consensus        57 ~~~~LYW~D   65 (73)
                      ....+|=+.
T Consensus       288 ~aG~iykit  296 (310)
T KOG4499|consen  288 NAGTIYKIT  296 (310)
T ss_pred             CCccEEEec
Confidence            677776543


No 46 
>PF13449 Phytase-like:  Esterase-like activity of phytase
Probab=75.70  E-value=16  Score=25.33  Aligned_cols=58  Identities=16%  Similarity=0.192  Sum_probs=37.2

Q ss_pred             cceEEeCCCCeEEEe--CC------CcEEEEEcCCC----CeEEEE------eC---CC----ceeeEEEEcCCCCeEEE
Q psy4903           9 SGENRTREQALMFVA--DS------NQILRTNMDGT----MAMSIV------SE---AA----YKASGVALDINAKRLFW   63 (73)
Q Consensus         9 ~g~~~dp~~g~lywt--d~------~~I~ra~ldGs----~~~~l~------~~---~~----~~p~~lavD~~~~~LYW   63 (73)
                      |||+++|..|. ||+  |.      +++++..++..    ....+.      ..   .+    ..++||++ .....+||
T Consensus        23 Sgl~~~~~~~~-~~avSD~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~G~~~~~~~~D~Egi~~-~~~g~~~i  100 (326)
T PF13449_consen   23 SGLDYDPDDGR-FYAVSDRGPNKGPPRFYTFRIDYDQGGIGGVTILDMIPLRDPDGQPFPKNGLDPEGIAV-PPDGSFWI  100 (326)
T ss_pred             eeEEEeCCCCE-EEEEECCCCCCCCCcEEEEEeeccCCCccceEeccceeccCCCCCcCCcCCCChhHeEE-ecCCCEEE
Confidence            89999986654 555  33      34888777751    111111      11   11    16789999 67889999


Q ss_pred             EeCCC
Q psy4903          64 CDNLL   68 (73)
Q Consensus        64 ~D~~~   68 (73)
                      ++.+.
T Consensus       101 s~E~~  105 (326)
T PF13449_consen  101 SSEGG  105 (326)
T ss_pred             EeCCc
Confidence            99877


No 47 
>KOG4649|consensus
Probab=74.25  E-value=3.6  Score=29.10  Aligned_cols=24  Identities=4%  Similarity=-0.082  Sum_probs=20.1

Q ss_pred             eEEeCCCCeEEEeC--CCcEEEEEcC
Q psy4903          11 ENRTREQALMFVAD--SNQILRTNMD   34 (73)
Q Consensus        11 ~~~dp~~g~lywtd--~~~I~ra~ld   34 (73)
                      +++||++|.|+|-.  ..+|+...+-
T Consensus        36 ~avd~~sG~~~We~ilg~RiE~sa~v   61 (354)
T KOG4649|consen   36 IAVDPQSGNLIWEAILGVRIECSAIV   61 (354)
T ss_pred             EEecCCCCcEEeehhhCceeeeeeEE
Confidence            46899999999998  7888887654


No 48 
>PF05787 DUF839:  Bacterial protein of unknown function (DUF839);  InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=70.39  E-value=36  Score=25.57  Aligned_cols=56  Identities=16%  Similarity=0.191  Sum_probs=39.3

Q ss_pred             eeeccceEEeCCCCeEEEeC--C-------------------CcEEEEEcCCC-------CeEEEEe-------------
Q psy4903           5 MKFSSGENRTREQALMFVAD--S-------------------NQILRTNMDGT-------MAMSIVS-------------   43 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~-------------------~~I~ra~ldGs-------~~~~l~~-------------   43 (73)
                      |.=+-|+.++|.+|.+|.+.  +                   .+|.|-..++.       ..+.++.             
T Consensus       349 f~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~~  428 (524)
T PF05787_consen  349 FDRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGSN  428 (524)
T ss_pred             ccCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCcccccccccC
Confidence            44467889999999999985  2                   37888777654       2333222             


Q ss_pred             ----CCCceeeEEEEcCCCCe
Q psy4903          44 ----EAAYKASGVALDINAKR   60 (73)
Q Consensus        44 ----~~~~~p~~lavD~~~~~   60 (73)
                          ..+..|..|++|..++.
T Consensus       429 ~~~~~~f~sPDNL~~d~~G~L  449 (524)
T PF05787_consen  429 KCDDNGFASPDNLAFDPDGNL  449 (524)
T ss_pred             cccCCCcCCCCceEECCCCCE
Confidence                24778999999986653


No 49 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=70.04  E-value=38  Score=24.40  Aligned_cols=64  Identities=9%  Similarity=0.146  Sum_probs=49.6

Q ss_pred             ccceEEeCCCCeEEEeC----CCcEEEEEcCCC-CeEEEEeCCC---ceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903           8 SSGENRTREQALMFVAD----SNQILRTNMDGT-MAMSIVSEAA---YKASGVALDINAKRLFWCDNLLDYI   71 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd----~~~I~ra~ldGs-~~~~l~~~~~---~~p~~lavD~~~~~LYW~D~~~~~i   71 (73)
                      |+=|+++|....||=..    ...|..=..|+- ++.++++...   ..|..+++|..++.||=+....+.|
T Consensus        42 ptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~g~v  113 (346)
T COG2706          42 PTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHSGSV  113 (346)
T ss_pred             CceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccCceE
Confidence            44578899999999776    478888888874 7888877532   2359999999999999988877665


No 50 
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=67.53  E-value=11  Score=25.81  Aligned_cols=21  Identities=19%  Similarity=0.253  Sum_probs=17.2

Q ss_pred             CCceeeEEEEcCCCCeEEEEe
Q psy4903          45 AAYKASGVALDINAKRLFWCD   65 (73)
Q Consensus        45 ~~~~p~~lavD~~~~~LYW~D   65 (73)
                      +..-++|||-|+..+++|-+-
T Consensus       227 ~~nvlNGIA~~~~~~r~~iTG  247 (262)
T COG3823         227 NDNVLNGIAHDPQQDRFLITG  247 (262)
T ss_pred             ccccccceeecCcCCeEEEec
Confidence            455799999999999998654


No 51 
>PF02333 Phytase:  Phytase;  InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=62.42  E-value=37  Score=24.69  Aligned_cols=48  Identities=21%  Similarity=0.298  Sum_probs=32.4

Q ss_pred             eccceEEeCCCCeEEEeC-CCcEEEEEcC---CCCeEEEEe---CCC-ceeeEEEE
Q psy4903           7 FSSGENRTREQALMFVAD-SNQILRTNMD---GTMAMSIVS---EAA-YKASGVAL   54 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd-~~~I~ra~ld---Gs~~~~l~~---~~~-~~p~~lav   54 (73)
                      =+.|.|+|...|.||..+ +.-|.+-..+   |..++.+..   ..+ ...+||+|
T Consensus       209 Q~EGCVVDDe~g~LYvgEE~~GIW~y~Aep~~~~~~~~v~~~~g~~l~aDvEGlal  264 (381)
T PF02333_consen  209 QPEGCVVDDETGRLYVGEEDVGIWRYDAEPEGGNDRTLVASADGDGLVADVEGLAL  264 (381)
T ss_dssp             -EEEEEEETTTTEEEEEETTTEEEEEESSCCC-S--EEEEEBSSSSB-S-EEEEEE
T ss_pred             cceEEEEecccCCEEEecCccEEEEEecCCCCCCcceeeecccccccccCccceEE
Confidence            357999999999999999 7888887776   334444322   123 45889988


No 52 
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=62.32  E-value=62  Score=24.03  Aligned_cols=60  Identities=13%  Similarity=0.157  Sum_probs=39.7

Q ss_pred             eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEE------e-CCCceeeEEEEcCC------CCeEEEEe
Q psy4903           5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIV------S-EAAYKASGVALDIN------AKRLFWCD   65 (73)
Q Consensus         5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~------~-~~~~~p~~lavD~~------~~~LYW~D   65 (73)
                      +..|.+++..|. |.||-++  ..+|.+..-++.....+.      . ....-+-|||+|+.      ++.||.+-
T Consensus        29 L~~Pw~maflPD-G~llVtER~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~~lYvsy  103 (454)
T TIGR03606        29 LNKPWALLWGPD-NQLWVTERATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNPYVYISY  103 (454)
T ss_pred             CCCceEEEEcCC-CeEEEEEecCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCcEEEEEE
Confidence            356788888884 6888888  488998875543333221      1 13455799999854      45788873


No 53 
>PF06739 SBBP:  Beta-propeller repeat;  InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=61.17  E-value=16  Score=17.57  Aligned_cols=19  Identities=21%  Similarity=0.398  Sum_probs=13.9

Q ss_pred             CceeeEEEEcCCCCeEEEEe
Q psy4903          46 AYKASGVALDINAKRLFWCD   65 (73)
Q Consensus        46 ~~~p~~lavD~~~~~LYW~D   65 (73)
                      ...|.+||+|..+ ++|-+=
T Consensus        12 ~~~~~~IavD~~G-NiYv~G   30 (38)
T PF06739_consen   12 QDYGNGIAVDSNG-NIYVTG   30 (38)
T ss_pred             ceeEEEEEECCCC-CEEEEE
Confidence            3579999999665 588653


No 54 
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=60.55  E-value=22  Score=25.24  Aligned_cols=47  Identities=11%  Similarity=0.159  Sum_probs=37.8

Q ss_pred             ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEE
Q psy4903           8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVAL   54 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lav   54 (73)
                      -|+|+.+|.++.||=+-  .+.|.....+|--..++--..+..|++|+.
T Consensus        88 vS~LTynp~~rtLFav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Iey  136 (316)
T COG3204          88 VSSLTYNPDTRTLFAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEY  136 (316)
T ss_pred             ccceeeCCCcceEEEecCCCceEEEEecCCceEEEecccccCChhHeEE
Confidence            47999999999999775  588888888887766666667888888753


No 55 
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=60.20  E-value=26  Score=26.09  Aligned_cols=48  Identities=17%  Similarity=0.281  Sum_probs=35.3

Q ss_pred             eEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903          11 ENRTREQALMFVAD-SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCD   65 (73)
Q Consensus        11 ~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D   65 (73)
                      .++||.+|.+||-- ++.-    .+|+.|   -..++..-+=+|||..+.++-|.=
T Consensus       239 ~s~D~~~~lvy~~tGnp~p----~~~~~r---~gdnl~~~s~vAld~~TG~~~W~~  287 (527)
T TIGR03075       239 GSYDPETNLIYFGTGNPSP----WNSHLR---PGDNLYTSSIVARDPDTGKIKWHY  287 (527)
T ss_pred             eeEcCCCCeEEEeCCCCCC----CCCCCC---CCCCccceeEEEEccccCCEEEee
Confidence            48999999999987 5433    455555   234666777889998888888863


No 56 
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=59.83  E-value=62  Score=23.17  Aligned_cols=57  Identities=7%  Similarity=0.070  Sum_probs=35.4

Q ss_pred             eEEeCCCCeEEEeC-C----------CcEEEEEcCCCCeEEEEeCC-CceeeEEEEcCCCC-eEEEEeCCCC
Q psy4903          11 ENRTREQALMFVAD-S----------NQILRTNMDGTMAMSIVSEA-AYKASGVALDINAK-RLFWCDNLLD   69 (73)
Q Consensus        11 ~~~dp~~g~lywtd-~----------~~I~ra~ldGs~~~~l~~~~-~~~p~~lavD~~~~-~LYW~D~~~~   69 (73)
                      ++++|..+.+|-.. +          ..|.....  ..++++-.-. =..|.++++...++ .||=++...+
T Consensus       253 ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~--~t~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~s~  322 (352)
T TIGR02658       253 VAYHRARDRIYLLADQRAKWTHKTASRFLFVVDA--KTGKRLRKIELGHEIDSINVSQDAKPLLYALSTGDK  322 (352)
T ss_pred             EEEcCCCCEEEEEecCCccccccCCCCEEEEEEC--CCCeEEEEEeCCCceeeEEECCCCCeEEEEeCCCCC
Confidence            89999999999843 1          23333332  2222222111 23689999998888 8887776544


No 57 
>PF12814 Mcp5_PH:  Meiotic cell cortex C-terminal pleckstrin homology;  InterPro: IPR024774 This pleckstrin homology domain is found in eukaryotic proteins, including Mcp5, a fungal protein that anchors dynein at the cell cortex during the horsetail phase (prophase I) of meiosis. During prophase I of fission yeast all the telomeres become bundled at the spindle pole body and subsequently the nucleus undergoes a dynamic oscillation, resulting in elongated nuclear morphology known as "horsetail" nucleus. The pleckstrin homology domain is necessary for the cortical localisation of the Mcp5 protein during meiosis [].; GO: 0005515 protein binding, 0032065 cortical protein anchoring, 0005938 cell cortex
Probab=57.70  E-value=18  Score=21.70  Aligned_cols=17  Identities=0%  Similarity=-0.126  Sum_probs=13.6

Q ss_pred             eEEeCCCCeEEEeC-CCc
Q psy4903          11 ENRTREQALMFVAD-SNQ   27 (73)
Q Consensus        11 ~~~dp~~g~lywtd-~~~   27 (73)
                      .-+||..+.|||++ ++.
T Consensus        35 fwv~~~~~~L~Ws~~~p~   52 (123)
T PF12814_consen   35 FWVDPYTRTLYWSSSNPK   52 (123)
T ss_pred             EEEeCCCCEEEecCCCCC
Confidence            44899999999998 443


No 58 
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=57.32  E-value=19  Score=16.56  Aligned_cols=40  Identities=15%  Similarity=0.238  Sum_probs=21.4

Q ss_pred             CCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEc
Q psy4903          15 REQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALD   55 (73)
Q Consensus        15 p~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD   55 (73)
                      |..++||-++  ..+|...+........-+.. -..|.+++++
T Consensus         1 pd~~~lyv~~~~~~~v~~id~~~~~~~~~i~v-g~~P~~i~~~   42 (42)
T TIGR02276         1 PDGTKLYVTNSGSNTVSVIDTATNKVIATIPV-GGYPFGVAVS   42 (42)
T ss_pred             CCCCEEEEEeCCCCEEEEEECCCCeEEEEEEC-CCCCceEEeC
Confidence            4566788777  46676655432222122222 2468887764


No 59 
>PF00930 DPPIV_N:  Dipeptidyl peptidase IV (DPP IV) N-terminal region;  InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis.  Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide  It is a type II membrane protein that forms a homodimer.  CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=54.72  E-value=44  Score=23.15  Aligned_cols=48  Identities=10%  Similarity=0.248  Sum_probs=31.1

Q ss_pred             eEEeCCCCeEEEeC------CCcEEEEEcC-CCCeEEEEeCCCceeeEEEEcCCCC
Q psy4903          11 ENRTREQALMFVAD------SNQILRTNMD-GTMAMSIVSEAAYKASGVALDINAK   59 (73)
Q Consensus        11 ~~~dp~~g~lywtd------~~~I~ra~ld-Gs~~~~l~~~~~~~p~~lavD~~~~   59 (73)
                      +.+|+.++.+|++.      ...+++++++ |...+.|....-.. ...++.+.++
T Consensus       286 ~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~~~~~~~LT~~~~~~-~~~~~Spdg~  340 (353)
T PF00930_consen  286 LGWDEDNNRIYFTANGDNPGERHLYRVSLDSGGEPKCLTCEDGDH-YSASFSPDGK  340 (353)
T ss_dssp             EEEECTSSEEEEEESSGGTTSBEEEEEETTETTEEEESSTTSSTT-EEEEE-TTSS
T ss_pred             ceEcCCCCEEEEEecCCCCCceEEEEEEeCCCCCeEeccCCCCCc-eEEEECCCCC
Confidence            56799999999997      4799999999 65555444322111 3555555444


No 60 
>PF14759 Reductase_C:  Reductase C-terminal; PDB: 3FG2_P 3LXD_A 2YVG_A 2GR1_A 2GQW_A 2GR3_A 2YVF_A 1F3P_A 2GR0_A 2GR2_A ....
Probab=53.68  E-value=29  Score=19.34  Aligned_cols=26  Identities=19%  Similarity=0.258  Sum_probs=17.5

Q ss_pred             EEeC--CCcEEEEEcCCCCeEEEEeCCC
Q psy4903          21 FVAD--SNQILRTNMDGTMAMSIVSEAA   46 (73)
Q Consensus        21 ywtd--~~~I~ra~ldGs~~~~l~~~~~   46 (73)
                      ||||  +.+|+-+..-+..-++++....
T Consensus         2 FWSdQ~~~~iq~~G~~~~~~~~v~rg~~   29 (85)
T PF14759_consen    2 FWSDQYGVRIQIAGLPGGADEVVVRGDP   29 (85)
T ss_dssp             EEEEETTEEEEEEE-STTSSEEEEEEET
T ss_pred             eecccCCCeEEEEECCCCCCEEEEEccC
Confidence            8999  6888888876665555555443


No 61 
>PLN03216 actin depolymerizing factor; Provisional
Probab=53.08  E-value=6.2  Score=24.40  Aligned_cols=15  Identities=47%  Similarity=0.574  Sum_probs=13.1

Q ss_pred             CceeeeeccceEEeCC
Q psy4903           1 MKFKMKFSSGENRTRE   16 (73)
Q Consensus         1 ~~~~~~~~~g~~~dp~   16 (73)
                      |-|||. .||+.++|.
T Consensus         1 ~~~~m~-~SGi~v~~~   15 (141)
T PLN03216          1 MAFKMA-TTGMWVTDE   15 (141)
T ss_pred             Ccceec-CCCCeeCHH
Confidence            789999 999999875


No 62 
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=51.01  E-value=68  Score=23.07  Aligned_cols=49  Identities=12%  Similarity=0.224  Sum_probs=34.2

Q ss_pred             CCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCC--------c-eeeE---EEEcCCCCeEEEE
Q psy4903          16 EQALMFVAD-SNQILRTNMDGTMAMSIVSEAA--------Y-KASG---VALDINAKRLFWC   64 (73)
Q Consensus        16 ~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~--------~-~p~~---lavD~~~~~LYW~   64 (73)
                      ..+.+||.. ++.|+.+.+.|......-.-.+        . .|-|   +|++...++||-.
T Consensus       194 ~~~~~~F~Sy~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvL  255 (342)
T PF06433_consen  194 DGGRLYFVSYEGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVL  255 (342)
T ss_dssp             TTTEEEEEBTTSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEE
T ss_pred             CCCeEEEEecCCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEE
Confidence            456888888 9999999999987554432111        2 2433   7889899999953


No 63 
>KOG4649|consensus
Probab=49.66  E-value=17  Score=25.75  Aligned_cols=26  Identities=27%  Similarity=0.271  Sum_probs=22.4

Q ss_pred             eeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903          48 KASGVALDINAKRLFWCDNLLDYIET   73 (73)
Q Consensus        48 ~p~~lavD~~~~~LYW~D~~~~~ie~   73 (73)
                      .-.-+|||+.+.++||-.....+||+
T Consensus        32 s~~~~avd~~sG~~~We~ilg~RiE~   57 (354)
T KOG4649|consen   32 SGIVIAVDPQSGNLIWEAILGVRIEC   57 (354)
T ss_pred             CceEEEecCCCCcEEeehhhCceeee
Confidence            34557899999999999999999986


No 64 
>PF03664 Glyco_hydro_62:  Glycosyl hydrolase family 62 ;  InterPro: IPR005193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is a family of alpha -L-arabinofuranosidases (3.2.1.55 from EC) which are all members of glycoside hydrolase family 62 (GH62 from CAZY). This enzyme hydrolyzed aryl alpha-L-arabinofuranosides and cleaves arabinosyl side chains from arabinoxylan and arabinan.; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process
Probab=46.89  E-value=49  Score=23.04  Aligned_cols=30  Identities=23%  Similarity=0.438  Sum_probs=22.1

Q ss_pred             eCCCCeEEEeC-CCcEEEEEcC--------CCCeEEEEe
Q psy4903          14 TREQALMFVAD-SNQILRTNMD--------GTMAMSIVS   43 (73)
Q Consensus        14 dp~~g~lywtd-~~~I~ra~ld--------Gs~~~~l~~   43 (73)
                      |-.+-+||+++ +++|+|+.|-        |+.-+++.+
T Consensus       141 D~~n~yLFfa~DnGkiYRs~~~i~nFP~~fgs~~~vvms  179 (271)
T PF03664_consen  141 DDTNMYLFFAGDNGKIYRSSMPIGNFPGGFGSSYTVVMS  179 (271)
T ss_pred             cCCceEEEEcCCCCcEEEeccchhhCCCCCCCceEEEEe
Confidence            45566899998 8999999985        545555554


No 65 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=46.47  E-value=75  Score=20.16  Aligned_cols=58  Identities=17%  Similarity=0.273  Sum_probs=34.9

Q ss_pred             ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903           8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~   66 (73)
                      +.+++++|..+.+|-+.  ...|...+.++......+.. -..|..+++++.++.+|-+..
T Consensus        33 ~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~-~~~~~~~~~~~~g~~l~~~~~   92 (300)
T TIGR03866        33 PRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPS-GPDPELFALHPNGKILYIANE   92 (300)
T ss_pred             CCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccC-CCCccEEEECCCCCEEEEEcC
Confidence            45677888777777665  46677666654332222222 234677788777777765543


No 66 
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=44.99  E-value=44  Score=22.99  Aligned_cols=29  Identities=10%  Similarity=0.227  Sum_probs=20.3

Q ss_pred             eccceEEeCCCCeEEEeC-C-CcEEEEEcCC
Q psy4903           7 FSSGENRTREQALMFVAD-S-NQILRTNMDG   35 (73)
Q Consensus         7 ~~~g~~~dp~~g~lywtd-~-~~I~ra~ldG   35 (73)
                      ...|+|.||..+++|-|- . |.+..+.+++
T Consensus       230 vlNGIA~~~~~~r~~iTGK~wp~lfEVk~~~  260 (262)
T COG3823         230 VLNGIAHDPQQDRFLITGKLWPLLFEVKLDE  260 (262)
T ss_pred             cccceeecCcCCeEEEecCcCceeEEEEecC
Confidence            567888888888888775 2 6666665554


No 67 
>PF10349 WWbp:  WW-domain ligand protein;  InterPro: IPR018826  This entry represents a sequence domain found in WW domain-binding protein that is characterised by several short PY and PT-like motifs of the PPPPY form. These appear to bind directly to the WW domains of WWP1 and WWP2 and other such diverse proteins as dystrophin and YAP (Yes-associated protein). The presence of a phosphotyrosine residue in the pWBP-1 peptide abolishes WW domain binding which suggests a potential regulatory role for tyrosine phosphorylation in modulating WW domain-ligand interactions. Given the likelihood that WWP1 and WWP2 function as E3 ubiquitin-protein ligases, it is possible that initial substrate-specific recognition occurs via WW domain-substrate protein interaction followed by ubiquitin transfer and subsequent proteolysis []. This domain lies just downstream of IPR004182 from INTERPRO in many sequences. 
Probab=42.22  E-value=14  Score=22.20  Aligned_cols=15  Identities=33%  Similarity=0.503  Sum_probs=12.3

Q ss_pred             ceeeeeccceEEeCC
Q psy4903           2 KFKMKFSSGENRTRE   16 (73)
Q Consensus         2 ~~~~~~~~g~~~dp~   16 (73)
                      +|||+|++|=++|-.
T Consensus        10 ~fKltFk~GGAieFg   24 (116)
T PF10349_consen   10 EFKLTFKSGGAIEFG   24 (116)
T ss_pred             EEEEEeCCCChHHHH
Confidence            699999999887743


No 68 
>PRK04043 tolB translocation protein TolB; Provisional
Probab=41.43  E-value=1.3e+02  Score=21.64  Aligned_cols=51  Identities=4%  Similarity=0.122  Sum_probs=32.9

Q ss_pred             EeCCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903          13 RTREQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus        13 ~dp~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~   66 (73)
                      ..|....|+++.    .++|++..++|...+.+.......+   ++.+.++.|.++..
T Consensus       284 ~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~~~~---~~SPDG~~Ia~~~~  338 (419)
T PRK04043        284 FVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGKNNS---SVSTYKNYIVYSSR  338 (419)
T ss_pred             ECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCCcCc---eECCCCCEEEEEEc
Confidence            456666888886    4799999999877655554332222   45556666665543


No 69 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=40.69  E-value=95  Score=19.68  Aligned_cols=59  Identities=15%  Similarity=0.182  Sum_probs=36.2

Q ss_pred             ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903           8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL   67 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~   67 (73)
                      +.+++++|....+|.+.  ..+|..-.++.......+. .-..|.++++.+.++.||=+...
T Consensus       209 ~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~-~~~~~~~~~~~~~g~~l~~~~~~  269 (300)
T TIGR03866       209 PVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLL-VGQRVWQLAFTPDEKYLLTTNGV  269 (300)
T ss_pred             ccceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEE-eCCCcceEEECCCCCEEEEEcCC
Confidence            34677788888888765  4556655554322222222 12357889998888888876543


No 70 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=40.41  E-value=29  Score=24.00  Aligned_cols=29  Identities=14%  Similarity=0.205  Sum_probs=21.6

Q ss_pred             eeccceEEeCCCCeEEEeC-C-CcEEEEEcC
Q psy4903           6 KFSSGENRTREQALMFVAD-S-NQILRTNMD   34 (73)
Q Consensus         6 ~~~~g~~~dp~~g~lywtd-~-~~I~ra~ld   34 (73)
                      -+..|||.||..+.+|-|= . +++++..+.
T Consensus       231 dVLNGIAyd~~~~~l~vTGK~Wp~lyeV~l~  261 (264)
T PF05096_consen  231 DVLNGIAYDPETDRLFVTGKLWPKLYEVKLV  261 (264)
T ss_dssp             S-EEEEEEETTTTEEEEEETT-SEEEEEEEE
T ss_pred             CeeEeEeEeCCCCEEEEEeCCCCceEEEEEE
Confidence            3568999999999998875 3 777776543


No 71 
>PF14251 DUF4346:  Domain of unknown function (DUF4346)
Probab=40.15  E-value=20  Score=22.07  Aligned_cols=17  Identities=12%  Similarity=0.071  Sum_probs=13.6

Q ss_pred             ccceEEeCCCCeEEEeC
Q psy4903           8 SSGENRTREQALMFVAD   24 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd   24 (73)
                      -.|+++||.+|.+.=+.
T Consensus        42 ~~Gla~Dpetge~i~~~   58 (119)
T PF14251_consen   42 DKGLAVDPETGEVIPCR   58 (119)
T ss_pred             cccceeCCCCCCEEEEe
Confidence            46999999999876554


No 72 
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=37.96  E-value=98  Score=22.29  Aligned_cols=46  Identities=11%  Similarity=0.089  Sum_probs=31.8

Q ss_pred             CCeEEEeC--CCcEEEEEcC-CCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903          17 QALMFVAD--SNQILRTNMD-GTMAMSIVSEAAYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus        17 ~g~lywtd--~~~I~ra~ld-Gs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~   66 (73)
                      .|.||..|  ...+.+...+ |....+  ...-..|.||+..  ++.++-.-+
T Consensus       212 dgrLwvldsgtGev~~vD~~~G~~e~V--a~vpG~~rGL~f~--G~llvVgmS  260 (335)
T TIGR03032       212 QGKLWLLNSGRGELGYVDPQAGKFQPV--AFLPGFTRGLAFA--GDFAFVGLS  260 (335)
T ss_pred             CCeEEEEECCCCEEEEEcCCCCcEEEE--EECCCCCccccee--CCEEEEEec
Confidence            68889998  6889998887 544433  3344689998776  666665443


No 73 
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=34.86  E-value=1.7e+02  Score=21.00  Aligned_cols=52  Identities=13%  Similarity=0.316  Sum_probs=34.5

Q ss_pred             eCCCCeEEEeC-CCcEEEEEcCCCCeEEE--EeC---C----CceeeE---EEEcCCCCeEEEEe
Q psy4903          14 TREQALMFVAD-SNQILRTNMDGTMAMSI--VSE---A----AYKASG---VALDINAKRLFWCD   65 (73)
Q Consensus        14 dp~~g~lywtd-~~~I~ra~ldGs~~~~l--~~~---~----~~~p~~---lavD~~~~~LYW~D   65 (73)
                      .|..|..+|.. ..+|.-+.+.|......  +..   .    --.|.|   ++++..++++|-..
T Consensus       202 ~~~dg~~~~vs~eG~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~  266 (352)
T TIGR02658       202 SNKSGRLVWPTYTGKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLA  266 (352)
T ss_pred             EcCCCcEEEEecCCeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEe
Confidence            34467778887 78999998876543332  211   1    114555   99999999999843


No 74 
>cd01213 tensin Tensin Phosphotyrosine-binding (PTB) domain. Tensin Phosphotyrosine-binding (PTB) domain. Tensin is a a focal adhesion protein, which contains a C-terminal SH2 domain followed by a PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
Probab=33.29  E-value=84  Score=19.61  Aligned_cols=38  Identities=21%  Similarity=0.232  Sum_probs=28.1

Q ss_pred             eeeeeccceEEeCCCCeEEEeC---CCcEEEEEcCCCCeEEE
Q psy4903           3 FKMKFSSGENRTREQALMFVAD---SNQILRTNMDGTMAMSI   41 (73)
Q Consensus         3 ~~~~~~~g~~~dp~~g~lywtd---~~~I~ra~ldGs~~~~l   41 (73)
                      ||++ ..|+.+--..+++|+-.   ...|.-|.+|+.+++.+
T Consensus        48 ~kVS-~qGItLtDn~rk~ffrrhypl~~Vs~ca~dp~n~~~~   88 (138)
T cd01213          48 FKVS-SQGITLTDNTRKKFFRRHYKVDSVIFCAIDPEERMWE   88 (138)
T ss_pred             EEEE-cCCeeeeccccceeehhhCCcCeEEEEeeCCcccccc
Confidence            5666 57888755566788765   47899999999887554


No 75 
>PF01011 PQQ:  PQQ enzyme repeat family.;  InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=32.31  E-value=48  Score=15.41  Aligned_cols=12  Identities=0%  Similarity=-0.078  Sum_probs=8.8

Q ss_pred             EEeCCCCeEEEe
Q psy4903          12 NRTREQALMFVA   23 (73)
Q Consensus        12 ~~dp~~g~lywt   23 (73)
                      ++|..+|.+-|.
T Consensus        14 AlD~~TG~~~W~   25 (38)
T PF01011_consen   14 ALDAKTGKVLWK   25 (38)
T ss_dssp             EEETTTTSEEEE
T ss_pred             EEECCCCCEEEe
Confidence            567777777776


No 76 
>PF08310 LGFP:  LGFP repeat;  InterPro: IPR013207 This 54 amino acid repeat is found in many hypothetical proteins. Several hypothetical proteins from Corynebacterium glutamicum (Brevibacterium flavum) and Corynebacterium efficiens along with PS1 protein contain this repeat region. The N-terminal region of PS1 contains an esterase domain which transfers corynomycolic acid. The C-terminal region consists of 4 tandem LGFP repeats. It is hypothesised that the PS1 proteins in Corynebacterium, when associated with the cell wall, may be anchored via the LGFP tandem repeats that may be important for maintaining cell wall integrity. Deletion of Q01377 from SWISSPROT protein results in a 10-fold increase in the cell volume of the organism and infers the corresponding involvement of the protein in the cell shape formation []. The secondary structure of each repeat is predicted to comprise two beta-strands and one alpha-helix.
Probab=30.63  E-value=58  Score=16.56  Aligned_cols=15  Identities=7%  Similarity=0.244  Sum_probs=11.8

Q ss_pred             ceEEeCCCCeEEEeC
Q psy4903          10 GENRTREQALMFVAD   24 (73)
Q Consensus        10 g~~~dp~~g~lywtd   24 (73)
                      |-.-+-.+|.|||+.
T Consensus        16 G~~q~F~~G~Iywsp   30 (54)
T PF08310_consen   16 GRYQDFQNGTIYWSP   30 (54)
T ss_pred             EEEEEcCCeEEEEeC
Confidence            366677889999987


No 77 
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=30.36  E-value=2e+02  Score=20.39  Aligned_cols=57  Identities=9%  Similarity=0.195  Sum_probs=32.1

Q ss_pred             EEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903          12 NRTREQALMFVAD-SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLD   69 (73)
Q Consensus        12 ~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~   69 (73)
                      +..|-.+++|-++ +..|....+.-.....-+..+ ..|.|+++...++.+|=+....+
T Consensus        43 ~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G-~~~~~i~~s~DG~~~~v~n~~~~  100 (369)
T PF02239_consen   43 KFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVG-GNPRGIAVSPDGKYVYVANYEPG  100 (369)
T ss_dssp             E-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-S-SEEEEEEE--TTTEEEEEEEETT
T ss_pred             EecCCCCEEEEEcCCCeEEEEECCcccEEEEEecC-CCcceEEEcCCCCEEEEEecCCC
Confidence            3445556677666 566666665543322223333 36999999988999987765443


No 78 
>PF11314 DUF3117:  Protein of unknown function (DUF3117);  InterPro: IPR021465  This family of proteins with unknown function appears to be restricted to Actinobacteria. 
Probab=30.20  E-value=73  Score=16.61  Aligned_cols=19  Identities=11%  Similarity=0.279  Sum_probs=16.0

Q ss_pred             CCcEEEEEcCCCCeEEEEe
Q psy4903          25 SNQILRTNMDGTMAMSIVS   43 (73)
Q Consensus        25 ~~~I~ra~ldGs~~~~l~~   43 (73)
                      ...|+|..++|..|.++--
T Consensus        17 R~ivmRvPleGGGRLVvEl   35 (51)
T PF11314_consen   17 RGIVMRVPLEGGGRLVVEL   35 (51)
T ss_pred             ceEEEEEecCCCcEEEEEe
Confidence            4679999999999988753


No 79 
>PF02333 Phytase:  Phytase;  InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=30.13  E-value=2.2e+02  Score=20.78  Aligned_cols=55  Identities=15%  Similarity=0.300  Sum_probs=30.6

Q ss_pred             eCCCCeEEEeC---CCcEEEEEc--CCCC--eEEEEeC--CCceeeEEEEcCCCCeEEEEeCCC
Q psy4903          14 TREQALMFVAD---SNQILRTNM--DGTM--AMSIVSE--AAYKASGVALDINAKRLFWCDNLL   68 (73)
Q Consensus        14 dp~~g~lywtd---~~~I~ra~l--dGs~--~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~   68 (73)
                      +|.+|.+|-.-   +..++.-.|  +|..  ...++.+  .-.+|+|+++|-....||-++...
T Consensus       166 ~~~~g~~ya~v~~k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~~  229 (381)
T PF02333_consen  166 SPSTGALYAFVNGKDGRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEEDV  229 (381)
T ss_dssp             -TTT--EEEEEEETTSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETTT
T ss_pred             cCCCCcEEEEEecCCceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCcc
Confidence            45666555432   344444333  2332  3334433  335899999999999999998754


No 80 
>KOG3567|consensus
Probab=29.04  E-value=61  Score=24.50  Aligned_cols=56  Identities=18%  Similarity=0.367  Sum_probs=34.2

Q ss_pred             ccceEEeCCCCeEEEeC-----------------------CCcEEEEEcCCCCeEEEE---eCCCceeeEEEEcCCCCeE
Q psy4903           8 SSGENRTREQALMFVAD-----------------------SNQILRTNMDGTMAMSIV---SEAAYKASGVALDINAKRL   61 (73)
Q Consensus         8 ~~g~~~dp~~g~lywtd-----------------------~~~I~ra~ldGs~~~~l~---~~~~~~p~~lavD~~~~~L   61 (73)
                      .+|++.||+...+-+--                       .++|.+....  ++..+-   ...+..|.||.+|.  +--
T Consensus       404 v~gva~d~~~~l~vfhr~g~vwd~~sfd~~f~~~~~~pi~~~~ilvi~~~--n~~~l~~~g~~~fylphgl~~dk--dgf  479 (501)
T KOG3567|consen  404 VSGVALDPKNNLVVFHRGGHVWDGNSFDSKFVYQQIGPIEEDTILVIDPN--NAAVLQSSGKNLFYLPHGLSIDK--DGF  479 (501)
T ss_pred             eeeeccCCccceEEEccCCceEeechhhhhhhhhcccccccceEEEEcCc--chhhhhhccCCceecCCcceecC--CCc
Confidence            46777788766554432                       2556665555  322222   23777899999994  446


Q ss_pred             EEEeCC
Q psy4903          62 FWCDNL   67 (73)
Q Consensus        62 YW~D~~   67 (73)
                      ||++..
T Consensus       480 ~~~tdv  485 (501)
T KOG3567|consen  480 YWVTDV  485 (501)
T ss_pred             EEeecc
Confidence            787754


No 81 
>PF01267 F-actin_cap_A:  F-actin capping protein alpha subunit;  InterPro: IPR002189 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. The actin filament system, a prominent part of the cytoskeleton in eukaryotic cells, is both a static structure and a dynamic network that can undergo rearrangements: it is thought to be involved in processes such as cell movement and phagocytosis [], as well as muscle contraction. The F-actin capping protein binds in a calcium-independent manner to the fast growing ends of actin filaments (barbed end) thereby blocking the exchange of subunits at these ends. Unlike gelsolin and severin this protein does not sever actin filaments. The F-actin capping protein is a heterodimer composed of two unrelated subunits: alpha and beta (see IPR001698 from INTERPRO). Neither of the subunits shows sequence similarity to other filament-capping proteins []. The alpha subunit is a protein of about 268 to 286 amino acid residues whose sequence is well conserved in eukaryotic species [].; GO: 0003779 actin binding, 0030036 actin cytoskeleton organization, 0008290 F-actin capping protein complex; PDB: 1MQ1_D 1MWN_X 2KBM_Y 2KZ7_A 3AAE_C 3AA6_A 3LK4_G 1IZN_A 3AA7_A 3AAA_A ....
Probab=28.68  E-value=1.4e+02  Score=20.56  Aligned_cols=43  Identities=19%  Similarity=0.206  Sum_probs=31.4

Q ss_pred             EEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903          29 LRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYI   71 (73)
Q Consensus        29 ~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~i   71 (73)
                      .-+.++|.+..+|++..-....+.-+|+.++..|-.|+....+
T Consensus        52 ~~v~~~~~~~~viis~~n~~~~~ry~Dp~~~~~F~~DHl~~~~   94 (271)
T PF01267_consen   52 IPVKLPGSDHKVIISKYNKLGGNRYFDPRSKKSFSVDHLKQKA   94 (271)
T ss_dssp             EEE--TTSSS-EEE-GGGBESTTEEEETTTTEEEEEETTTTEE
T ss_pred             EEEEeCCCCcceEEecccccCCCeEEEeccCcEEEEeeeccEE
Confidence            4466888888888887555578899999999999999976654


No 82 
>PF00930 DPPIV_N:  Dipeptidyl peptidase IV (DPP IV) N-terminal region;  InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis.  Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide  It is a type II membrane protein that forms a homodimer.  CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=25.61  E-value=2.3e+02  Score=19.54  Aligned_cols=53  Identities=13%  Similarity=0.157  Sum_probs=33.4

Q ss_pred             CCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903          15 REQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL   67 (73)
Q Consensus        15 p~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~   67 (73)
                      |..+.++|..    -..|+....+|.....|......--.-+.+|..++.||+.-..
T Consensus       245 ~~~~~~l~~s~~~G~~hly~~~~~~~~~~~lT~G~~~V~~i~~~d~~~~~iyf~a~~  301 (353)
T PF00930_consen  245 PDGNEFLWISERDGYRHLYLYDLDGGKPRQLTSGDWEVTSILGWDEDNNRIYFTANG  301 (353)
T ss_dssp             TTSSEEEEEEETTSSEEEEEEETTSSEEEESS-SSS-EEEEEEEECTSSEEEEEESS
T ss_pred             CCCCEEEEEEEcCCCcEEEEEcccccceeccccCceeecccceEcCCCCEEEEEecC
Confidence            4444455554    2788888888877554443333222457889999999988764


No 83 
>PRK04922 tolB translocation protein TolB; Provisional
Probab=25.03  E-value=2.6e+02  Score=19.89  Aligned_cols=55  Identities=11%  Similarity=0.175  Sum_probs=34.7

Q ss_pred             eEEeCCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903          11 ENRTREQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus        11 ~~~dp~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~   66 (73)
                      ....|...+|+++.    .++|+..++++...+.+.... ......++.+.++.|+++..
T Consensus       297 ~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~  355 (433)
T PRK04922        297 PTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHG  355 (433)
T ss_pred             eEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEC
Confidence            34566666777764    367999988876555444322 23345667777888887654


No 84 
>PLN02553 inositol-phosphate phosphatase
Probab=24.52  E-value=56  Score=21.80  Aligned_cols=15  Identities=20%  Similarity=0.051  Sum_probs=13.8

Q ss_pred             ceEEeCCCCeEEEeC
Q psy4903          10 GENRTREQALMFVAD   24 (73)
Q Consensus        10 g~~~dp~~g~lywtd   24 (73)
                      |++.+|..+.+||+.
T Consensus       121 GvV~~P~~~e~~~A~  135 (270)
T PLN02553        121 GVVYNPILDELFTAV  135 (270)
T ss_pred             EEEecCCCCCeEEEE
Confidence            788999999999987


No 85 
>cd01643 Bacterial_IMPase_like_2 Bacterial family of Mg++ dependent phosphatases, related to inositol monophosphatases. These enzymes may dephosphorylate inositol monophosphate or similar substrates.
Probab=24.36  E-value=59  Score=21.44  Aligned_cols=16  Identities=25%  Similarity=0.013  Sum_probs=14.0

Q ss_pred             cceEEeCCCCeEEEeC
Q psy4903           9 SGENRTREQALMFVAD   24 (73)
Q Consensus         9 ~g~~~dp~~g~lywtd   24 (73)
                      -|++.+|..+.+||+.
T Consensus       106 ~GvV~~P~~~~~~~A~  121 (242)
T cd01643         106 FGVIALPALNQTFVAF  121 (242)
T ss_pred             EEEEecCCCCCEEEEE
Confidence            3788999999999986


No 86 
>TIGR02067 his_9_proposed histidinol-phosphate phosphatase HisN, inositol monophosphatase family. This subfamily belongs to the inositol monophosphatase family (pfam00459). The members of this family consist of no more than one per species and are found only in species in which histidine is synthesized de novo but no histidinol phosphatase can be found in either of the two described families (TIGR01261, TIGR01856). In at least one species, the member of this family is found near known histidine biosynthesis genes.
Probab=24.21  E-value=62  Score=21.36  Aligned_cols=16  Identities=13%  Similarity=-0.105  Sum_probs=14.0

Q ss_pred             cceEEeCCCCeEEEeC
Q psy4903           9 SGENRTREQALMFVAD   24 (73)
Q Consensus         9 ~g~~~dp~~g~lywtd   24 (73)
                      -|++.+|..+.+||+.
T Consensus       109 ~gvv~~P~~~~~~~A~  124 (251)
T TIGR02067       109 LGVIFQPATGERWWAA  124 (251)
T ss_pred             EEEEEEcCCCCEEEEe
Confidence            3778999999999987


No 87 
>PF13804 HERV-K_env_2:  Retro-transcribing viruses envelope glycoprotein
Probab=23.29  E-value=2.1e+02  Score=18.58  Aligned_cols=18  Identities=28%  Similarity=0.543  Sum_probs=15.3

Q ss_pred             EEEEcCCCCeEEEEeCCC
Q psy4903          51 GVALDINAKRLFWCDNLL   68 (73)
Q Consensus        51 ~lavD~~~~~LYW~D~~~   68 (73)
                      ++.+||.=+.+||.+=..
T Consensus       147 ~~iIDWaPkg~f~~nCss  164 (170)
T PF13804_consen  147 GNIIDWAPKGMFWQNCSS  164 (170)
T ss_pred             eEEEEecCCceeeeccCc
Confidence            899999999999987543


No 88 
>PF02820 MBT:  mbt repeat;  InterPro: IPR004092 The function of the malignant brain tumor (MBT) repeat is unknown, but is found in a number of nuclear proteins involved in transcriptional repression. The repeat contains a completely conserved glutamate at its amino terminus that may be important for function.  The crystal structure of the two MBT repeats of human SCM-like 2 protein has been reported. Each repeat consists of an extended "arm" and a globular core. The arm of the first repeat packs against the core of the second repeat and vice versa. The structure of the core-interacting part of each arm consists of an N-terminal alpha-helix and a turn of 310 helix connected by a short beta-strand. The core consists of an Src homology 3-like five-stranded beta-barrel followed by a C-terminal alpha-helix and another short beta-strand. Each arm interacts with its partner core in a similar way, with the orientation of the N-terminal helix relative to the barrel varying slightly. There are also extensive interactions between the two barrels [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2P0K_A 3F70_B 3CEY_A 2VYT_A 2BIV_A 1OI1_A 3OQ5_A 2RJE_B 2RJD_A 2RI3_A ....
Probab=23.10  E-value=1.4e+02  Score=16.05  Aligned_cols=14  Identities=7%  Similarity=0.081  Sum_probs=11.0

Q ss_pred             eEEeCCCCeEEEeC
Q psy4903          11 ENRTREQALMFVAD   24 (73)
Q Consensus        11 ~~~dp~~g~lywtd   24 (73)
                      +++||.+...+|..
T Consensus         4 Ea~d~~~~~~~~vA   17 (73)
T PF02820_consen    4 EAVDPRNPSLICVA   17 (73)
T ss_dssp             EEEETTECCEEEEE
T ss_pred             EEECCCCCCeEEEE
Confidence            57899888788864


No 89 
>TIGR02647 DNA conserved hypothetical protein TIGR02647. Members of this family are found, so far, only in the Gammaproteobacteria. The function is unknown. The location on the chromosome usually is not far from housekeeping genes rather than in what is clearly, say, a prophage region. Some members have been annotated in public databases as DNA-binding protein inhibitor Id-2-related protein, putative transcriptional regulator, or hypothetical DNA binding protein.
Probab=22.96  E-value=27  Score=19.85  Aligned_cols=17  Identities=6%  Similarity=0.227  Sum_probs=14.2

Q ss_pred             eeccceEEeCCCCeEEEeC
Q psy4903           6 KFSSGENRTREQALMFVAD   24 (73)
Q Consensus         6 ~~~~g~~~dp~~g~lywtd   24 (73)
                      +|.+|+.-+|..|  |.|+
T Consensus        43 LheKGLI~~pdGg--yLT~   59 (77)
T TIGR02647        43 LHEKGLTTQPDGG--YLTS   59 (77)
T ss_pred             HHHcCCccCCCCC--EecH
Confidence            6788999999998  6776


No 90 
>PF14884 EFF-AFF:  Type I membrane glycoproteins cell-cell fusogen
Probab=22.90  E-value=1.5e+02  Score=23.05  Aligned_cols=31  Identities=19%  Similarity=0.124  Sum_probs=24.1

Q ss_pred             CCcEEEEEcCCCCeEEEEeCCCceeeEEEEc
Q psy4903          25 SNQILRTNMDGTMAMSIVSEAAYKASGVALD   55 (73)
Q Consensus        25 ~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD   55 (73)
                      .+=|.||..|++.|++.+...-..+..+.|.
T Consensus       371 ~~WIrsv~id~s~R~V~V~H~egt~v~Vsl~  401 (589)
T PF14884_consen  371 YPWIRSVRIDSSSRQVTVTHAEGTNVQVSLH  401 (589)
T ss_pred             ccceeEEEEcCCCceEEEEecCCCcEEEEEE
Confidence            3668999999999999988766666666664


No 91 
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=22.68  E-value=85  Score=13.50  Aligned_cols=10  Identities=0%  Similarity=-0.064  Sum_probs=4.9

Q ss_pred             EeCCCCeEEE
Q psy4903          13 RTREQALMFV   22 (73)
Q Consensus        13 ~dp~~g~lyw   22 (73)
                      +|+.+|.+-|
T Consensus        21 ~d~~~G~~~W   30 (33)
T smart00564       21 LDAKTGEILW   30 (33)
T ss_pred             EEcccCcEEE
Confidence            4444555544


No 92 
>cd01248 PH_PLC Phospholipase C (PLC) pleckstrin homology (PH) domain. Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them.  PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding.  Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Probab=22.58  E-value=1.7e+02  Score=16.92  Aligned_cols=23  Identities=22%  Similarity=0.252  Sum_probs=17.8

Q ss_pred             CCceeeEEEEcCCCCeEEEEeCC
Q psy4903          45 AAYKASGVALDINAKRLFWCDNL   67 (73)
Q Consensus        45 ~~~~p~~lavD~~~~~LYW~D~~   67 (73)
                      .-..+.-+.||..+..|||.+..
T Consensus        19 ~~~~~~~f~ld~~~~~l~W~~~~   41 (115)
T cd01248          19 SRERRRLFRLDEKGFFLYWKDEG   41 (115)
T ss_pred             CceeeEEEEEcCCCcEEEEeCCC
Confidence            34456778889999999998765


No 93 
>cd00472 Ribosomal_L24e_L24 Ribosomal protein L24e/L24 is a ribosomal protein found in eukaryotes (L24) and in archaea (L24e, distinct from archaeal L24). L24e/L24 is located on the surface of the large subunit, adjacent to proteins L14 and L3, and near the translation factor binding site.  L24e/L24 appears to play a role in the kinetics of peptide synthesis, and may be involved in interactions between the large and small subunits, either directly or through other factors. In mouse, a deletion mutation in L24 has been identified as the cause for the belly spot and tail (Bst) mutation that results in disrupted pigmentation, somitogenesis and retinal cell fate determination.  L24 may be an important protein in eukaryotic reproduction:  in shrimp, L24 expression is elevated in the ovary, suggesting a role in oogenesis, and in Arabidopsis, L24 has been proposed to have a specific function in gynoecium development. No protein with sequence or structural homology to L24e/L24 has been identifi
Probab=22.27  E-value=1.4e+02  Score=15.67  Aligned_cols=28  Identities=14%  Similarity=0.153  Sum_probs=19.0

Q ss_pred             ceeeeeccceEEeCCCCeEEEeCCCcEE
Q psy4903           2 KFKMKFSSGENRTREQALMFVADSNQIL   29 (73)
Q Consensus         2 ~~~~~~~~g~~~dp~~g~lywtd~~~I~   29 (73)
                      +-..--.+|.-+-|-+|.+|.-.+++|.
T Consensus         2 ~~~~C~f~g~~I~PG~G~~~Vr~Dgkv~   29 (54)
T cd00472           2 KTEKCSFCGYKIYPGHGKMYVRNDGKVF   29 (54)
T ss_pred             cEEEecCcCCeecCCCccEEEecCCCEE
Confidence            3444456788899999999976544443


No 94 
>TIGR01331 bisphos_cysQ 3'(2'),5'-bisphosphate nucleotidase, bacterial. Sulfate is incorporated into 3-phosphoadenylylsulfate, PAPS, for utilization in pathways such as methionine biosynthesis. Transfer of sulfate from PAPS to an acceptor leaves adenosine 3'-5'-bisphosphate, APS. This model describes a form found in bacteria of the enzyme 3'(2'),5'-bisphosphate nucleotidase, which removes the 3'-phosphate from APS to regenerate AMP and help drive the cycle.
Probab=22.15  E-value=73  Score=21.02  Aligned_cols=16  Identities=13%  Similarity=0.160  Sum_probs=14.0

Q ss_pred             cceEEeCCCCeEEEeC
Q psy4903           9 SGENRTREQALMFVAD   24 (73)
Q Consensus         9 ~g~~~dp~~g~lywtd   24 (73)
                      -|.+.+|..|.+||+.
T Consensus       111 ~gvI~~P~~~~~~~A~  126 (249)
T TIGR01331       111 LGVVYAPATGVTYFAT  126 (249)
T ss_pred             EEEEEecCCCCEEEEE
Confidence            3778999999999986


No 95 
>PRK09439 PTS system glucose-specific transporter subunit; Provisional
Probab=21.73  E-value=84  Score=20.23  Aligned_cols=14  Identities=14%  Similarity=0.055  Sum_probs=9.6

Q ss_pred             ccceEEeCCCCeEE
Q psy4903           8 SSGENRTREQALMF   21 (73)
Q Consensus         8 ~~g~~~dp~~g~ly   21 (73)
                      ..|+++.|..|.+|
T Consensus        48 GdGvAI~P~~~~v~   61 (169)
T PRK09439         48 GDGIAIKPTGNKMV   61 (169)
T ss_pred             cceEEEEccCCEEE
Confidence            45777777777666


No 96 
>cd01638 CysQ CysQ, a 3'-Phosphoadenosine-5'-phosphosulfate (PAPS) 3'-phosphatase, is a bacterial member of the inositol monophosphatase family. It has been proposed that CysQ helps control intracellular levels of PAPS, which is an intermediate in cysteine biosynthesis (a principal route of sulfur assimilation).
Probab=21.50  E-value=72  Score=20.90  Aligned_cols=15  Identities=13%  Similarity=0.040  Sum_probs=13.7

Q ss_pred             ceEEeCCCCeEEEeC
Q psy4903          10 GENRTREQALMFVAD   24 (73)
Q Consensus        10 g~~~dp~~g~lywtd   24 (73)
                      |++.+|..+.+||+.
T Consensus       110 gvi~~P~~~~~~~A~  124 (242)
T cd01638         110 GVVYAPALGELYYAL  124 (242)
T ss_pred             EEEecCCCCCEEEEE
Confidence            788899999999987


No 97 
>KOG0646|consensus
Probab=21.41  E-value=3.7e+02  Score=20.38  Aligned_cols=21  Identities=19%  Similarity=0.234  Sum_probs=16.5

Q ss_pred             CceeeEEEEcCCCCeEEEEeC
Q psy4903          46 AYKASGVALDINAKRLFWCDN   66 (73)
Q Consensus        46 ~~~p~~lavD~~~~~LYW~D~   66 (73)
                      -..+.++++|+.+.++|-...
T Consensus       217 p~si~av~lDpae~~~yiGt~  237 (476)
T KOG0646|consen  217 PSSIKAVALDPAERVVYIGTE  237 (476)
T ss_pred             CCcceeEEEcccccEEEecCC
Confidence            345789999999999886544


No 98 
>PF09917 DUF2147:  Uncharacterized protein conserved in bacteria (DUF2147);  InterPro: IPR019223  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=21.22  E-value=80  Score=18.42  Aligned_cols=15  Identities=13%  Similarity=0.206  Sum_probs=11.1

Q ss_pred             ccceEEeCCCCeEEE
Q psy4903           8 SSGENRTREQALMFV   22 (73)
Q Consensus         8 ~~g~~~dp~~g~lyw   22 (73)
                      ..|-++||.+|+.|=
T Consensus        70 ~~G~iydP~~GktY~   84 (114)
T PF09917_consen   70 KGGKIYDPEDGKTYS   84 (114)
T ss_pred             cCcEEEeCCCCCEEE
Confidence            347788888888774


No 99 
>COG5649 Uncharacterized conserved protein [Function unknown]
Probab=21.13  E-value=84  Score=19.42  Aligned_cols=15  Identities=20%  Similarity=0.388  Sum_probs=11.7

Q ss_pred             eeeeeccceEE-eCCC
Q psy4903           3 FKMKFSSGENR-TREQ   17 (73)
Q Consensus         3 ~~~~~~~g~~~-dp~~   17 (73)
                      .|+.|.+|..+ ||..
T Consensus        75 vK~tFakGAsL~dPa~   90 (132)
T COG5649          75 VKVTFAKGASLADPAP   90 (132)
T ss_pred             EEEEEecccccCCCCc
Confidence            47888889888 7754


No 100
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=21.02  E-value=2.9e+02  Score=20.28  Aligned_cols=43  Identities=9%  Similarity=0.154  Sum_probs=20.5

Q ss_pred             CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903          25 SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL   67 (73)
Q Consensus        25 ~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~   67 (73)
                      +..++..+++....+.|....-...-|..+-...+.+|++...
T Consensus        59 ~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~  101 (386)
T PF14583_consen   59 NRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNG  101 (386)
T ss_dssp             S-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETT
T ss_pred             CcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECC
Confidence            4667777777655555554322222245555667888777543


No 101
>PRK10757 inositol monophosphatase; Provisional
Probab=21.02  E-value=75  Score=21.25  Aligned_cols=16  Identities=13%  Similarity=0.030  Sum_probs=14.1

Q ss_pred             cceEEeCCCCeEEEeC
Q psy4903           9 SGENRTREQALMFVAD   24 (73)
Q Consensus         9 ~g~~~dp~~g~lywtd   24 (73)
                      -|.+.+|..+.+||+.
T Consensus       113 ~GvV~~P~~~~~~~A~  128 (267)
T PRK10757        113 VAVVYDPMRNELFTAT  128 (267)
T ss_pred             EEEEEcCCCCCEEEEE
Confidence            3788999999999986


Done!