Query         psy4773
Match_columns 160
No_of_seqs    129 out of 569
Neff          6.9 
Searched_HMMs 46136
Date          Fri Aug 16 17:20:14 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4773.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4773hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1520|consensus              100.0 5.3E-40 1.2E-44  277.4  13.0  141    3-159   235-376 (376)
  2 COG3386 Gluconolactonase [Carb  99.1 8.1E-10 1.8E-14   92.8  11.2   93    2-150   178-277 (307)
  3 PF08450 SGL:  SMP-30/Gluconola  99.0 2.8E-09   6E-14   85.3   9.5   88    3-148   150-245 (246)
  4 KOG4499|consensus               98.0 4.2E-05 9.2E-10   62.5   9.1   88    2-147   173-273 (310)
  5 COG4257 Vgb Streptogramin lyas  96.3   0.027 5.8E-07   47.3   8.3   92    3-157   119-216 (353)
  6 PF03088 Str_synth:  Strictosid  95.8  0.0094   2E-07   41.5   3.0   43    7-53     35-77  (89)
  7 TIGR03032 conserved hypothetic  95.1    0.09   2E-06   44.7   7.1   48    3-56    217-265 (335)
  8 PF08450 SGL:  SMP-30/Gluconola  93.7     1.2 2.7E-05   35.2  10.6   88   20-158    68-163 (246)
  9 PLN02919 haloacid dehalogenase  91.9     2.6 5.7E-05   41.2  11.9   30  130-159   858-888 (1057)
 10 COG3386 Gluconolactonase [Carb  89.9     4.6 9.9E-05   34.1  10.2   95   21-159    94-193 (307)
 11 TIGR02604 Piru_Ver_Nterm putat  87.9      10 0.00022   32.3  11.1   46    8-53     89-144 (367)
 12 PLN02919 haloacid dehalogenase  85.4      22 0.00047   35.0  13.1   44    3-50    584-642 (1057)
 13 PF07494 Reg_prop:  Two compone  84.2    0.98 2.1E-05   23.5   1.9   13   38-50     10-22  (24)
 14 PF05787 DUF839:  Bacterial pro  82.8     1.9   4E-05   39.1   4.4   45    9-53    391-456 (524)
 15 PF07995 GSDH:  Glucose / Sorbo  82.6     1.8   4E-05   36.5   4.0   46    8-53     81-134 (331)
 16 PF06977 SdiA-regulated:  SdiA-  82.2     3.4 7.3E-05   33.9   5.3   49  100-148   185-240 (248)
 17 PF01436 NHL:  NHL repeat;  Int  80.1    0.91   2E-05   24.4   0.9   19   34-52      3-21  (28)
 18 COG4257 Vgb Streptogramin lyas  79.1      17 0.00037   30.9   8.4   29   19-52    217-252 (353)
 19 PRK11028 6-phosphogluconolacto  79.0      34 0.00074   28.0  10.6   39    8-48     56-95  (330)
 20 PF07995 GSDH:  Glucose / Sorbo  77.6     1.7 3.7E-05   36.6   2.3   49    3-51    275-324 (331)
 21 COG3204 Uncharacterized protei  72.6      11 0.00023   32.1   5.6   52   99-150   246-304 (316)
 22 TIGR03606 non_repeat_PQQ dehyd  68.8      26 0.00055   31.4   7.5   46    8-53    115-166 (454)
 23 KOG0639|consensus               66.7      33 0.00072   31.4   7.7   63    6-75    436-505 (705)
 24 KOG1520|consensus               66.4     6.1 0.00013   34.4   3.1   45    5-53    195-239 (376)
 25 COG3211 PhoX Predicted phospha  64.6     5.8 0.00012   36.5   2.7   45    9-53    455-520 (616)
 26 KOG0404|consensus               63.8     3.5 7.6E-05   34.1   1.1   16   24-39     53-68  (322)
 27 cd08033 LARP_6 La RNA-binding   63.3       3 6.6E-05   28.2   0.5   14  108-121    63-76  (77)
 28 PRK11028 6-phosphogluconolacto  62.1      86  0.0019   25.6  11.7   40    8-49    196-245 (330)
 29 COG2088 SpoVG Uncharacterized   60.8      26 0.00056   24.5   4.7   69    7-77      4-74  (95)
 30 PF03022 MRJP:  Major royal jel  55.8      65  0.0014   26.7   7.4   64   31-149   184-255 (287)
 31 PHA02865 MHC-like TNF binding   53.7      12 0.00025   32.3   2.6   23   31-53    264-289 (338)
 32 PF06977 SdiA-regulated:  SdiA-  52.8      44 0.00095   27.3   5.8   54  101-156    37-91  (248)
 33 cd08030 LA_like_plant La-motif  50.7      19 0.00042   25.0   2.9   14  108-121    76-89  (90)
 34 cd08031 LARP_4_5_like La RNA-b  48.8      18 0.00039   24.3   2.4   14  108-121    61-74  (75)
 35 TIGR02604 Piru_Ver_Nterm putat  46.3      32  0.0007   29.2   4.2   50    3-52    286-340 (367)
 36 cd08035 LARP_4 La RNA-binding   44.5     8.1 0.00018   26.0   0.2   14  108-121    61-74  (75)
 37 COG3292 Predicted periplasmic   43.0      58  0.0013   30.3   5.4   48  110-157   227-274 (671)
 38 COG3292 Predicted periplasmic   41.6      87  0.0019   29.2   6.3   33  115-149   362-395 (671)
 39 cd08036 LARP_5 La RNA-binding   40.9      11 0.00023   25.5   0.3   14  108-121    61-74  (75)
 40 COG2133 Glucose/sorbosone dehy  40.0      84  0.0018   27.7   5.8   44   10-53    148-197 (399)
 41 COG3204 Uncharacterized protei  39.6      68  0.0015   27.3   5.0   51  101-151   101-160 (316)
 42 PF09154 DUF1939:  Domain of un  37.0      69  0.0015   20.2   3.7   38   12-53     12-49  (57)
 43 PF00058 Ldl_recept_b:  Low-den  35.5      53  0.0011   19.0   2.8   36    3-41      5-41  (42)
 44 COG4294 Uve UV damage repair e  32.7      10 0.00022   32.3  -0.9   36    1-39    167-205 (347)
 45 cd08032 LARP_7 La RNA-binding   32.6      20 0.00044   24.5   0.7   14  108-121    68-81  (82)
 46 PF13570 PQQ_3:  PQQ-like domai  32.4      95  0.0021   17.3   4.2   20  133-152    14-33  (40)
 47 PF06739 SBBP:  Beta-propeller   32.3      42 0.00091   19.2   1.9   20   34-53     14-33  (38)
 48 cd06481 ACD_HspB9_like Alpha c  32.0      30 0.00066   23.4   1.5   28   22-50      7-36  (87)
 49 cd08029 LA_like_fungal La-moti  31.9      42 0.00091   22.5   2.1   14  108-121    62-75  (76)
 50 PF11208 DUF2992:  Protein of u  31.8      27 0.00059   26.0   1.3   12   43-54      6-17  (132)
 51 PF06236 MelC1:  Tyrosinase co-  31.4      43 0.00092   24.8   2.3   47    2-52     44-95  (125)
 52 TIGR03032 conserved hypothetic  31.1 2.9E+02  0.0064   23.8   7.5   30  130-159   202-231 (335)
 53 PF06079 Apyrase:  Apyrase;  In  30.2 2.6E+02  0.0056   23.7   6.9   53  100-153    66-123 (291)
 54 PTZ00486 apyrase Superfamily;   29.3 3.4E+02  0.0073   23.6   7.6   53  100-152   127-185 (352)
 55 PF13449 Phytase-like:  Esteras  29.2      92   0.002   26.0   4.3   18   36-53     88-105 (326)
 56 COG2706 3-carboxymuconate cycl  28.9 3.9E+02  0.0084   23.2  11.3   52  108-159    66-119 (346)
 57 PHA02598 denA endonuclease II;  28.6      71  0.0015   24.0   3.1   39  110-148    13-51  (138)
 58 PF13600 DUF4140:  N-terminal d  27.9 1.1E+02  0.0024   20.9   3.9   26   20-45     20-47  (104)
 59 PF03022 MRJP:  Major royal jel  27.7 3.5E+02  0.0076   22.3  10.2   52  108-159    34-96  (287)
 60 KOG0639|consensus               26.6      72  0.0016   29.3   3.3   44    3-52    525-572 (705)
 61 TIGR03606 non_repeat_PQQ dehyd  26.5 4.7E+02    0.01   23.4   9.2   29   22-51     20-48  (454)
 62 PRK11138 outer membrane biogen  25.4 2.5E+02  0.0054   23.7   6.3   43  107-151   344-387 (394)
 63 TIGR02276 beta_rpt_yvtn 40-res  24.4 1.3E+02  0.0029   16.3   3.5   19  140-158     3-21  (42)
 64 PF14583 Pectate_lyase22:  Olig  24.3 2.5E+02  0.0055   24.7   6.1   83    9-127   102-188 (386)
 65 PF03088 Str_synth:  Strictosid  24.0 2.5E+02  0.0053   19.3   8.1   51  105-158    34-86  (89)
 66 PF01731 Arylesterase:  Arylest  23.6 1.1E+02  0.0024   20.9   3.1   30   22-52     44-74  (86)
 67 PF06089 Asparaginase_II:  L-as  23.3      86  0.0019   26.9   3.0   24  105-128    13-36  (324)
 68 cd07323 LAM LA motif RNA-bindi  22.3      41 0.00089   22.3   0.7   15   63-77     36-50  (75)
 69 PF07463 NUMOD4:  NUMOD4 motif;  22.3      77  0.0017   19.2   1.9   23   25-48      1-23  (51)
 70 TIGR03300 assembly_YfgL outer   22.2 2.5E+02  0.0054   23.4   5.7   42  106-150    73-115 (377)
 71 PF05411 Peptidase_C32:  Equine  21.7      44 0.00095   24.7   0.8   42  107-148    58-102 (127)
 72 TIGR03300 assembly_YfgL outer   21.0 3.2E+02   0.007   22.7   6.1   44  106-151   328-372 (377)
 73 PF01502 PRA-CH:  Phosphoribosy  20.8 2.1E+02  0.0046   19.2   3.9   33   13-53     28-60  (75)
 74 PF04398 DUF538:  Protein of un  20.7      86  0.0019   22.4   2.2   24   30-53     10-39  (110)
 75 PHA02122 hypothetical protein   20.7 2.4E+02  0.0053   18.0   4.0   33  114-147    28-60  (65)
 76 PF13360 PQQ_2:  PQQ-like domai  20.1 3.9E+02  0.0084   20.1   7.1   47  108-158    46-93  (238)

No 1  
>KOG1520|consensus
Probab=100.00  E-value=5.3e-40  Score=277.38  Aligned_cols=141  Identities=33%  Similarity=0.603  Sum_probs=123.5

Q ss_pred             cccCCCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhh
Q psy4773           3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLL   82 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~   82 (160)
                      +|||+++||+||||+|+|+|++|+|++|||||||||+++++|+||||+.+.|+    .+.+++.+|||+||++.++|..+
T Consensus       235 ~~Et~~~ri~rywi~g~k~gt~EvFa~~LPG~PDNIR~~~~G~fWVal~~~~~----~~~~~~~~~p~vr~~~~~~~~~~  310 (376)
T KOG1520|consen  235 VAETTTARIKRYWIKGPKAGTSEVFAEGLPGYPDNIRRDSTGHFWVALHSKRS----TLWRLLMKYPWVRKFIAKLPKYM  310 (376)
T ss_pred             EEeeccceeeeeEecCCccCchhhHhhcCCCCCcceeECCCCCEEEEEecccc----hHHHhhhcChHHHHHHHhhccch
Confidence            79999999999999999999999999999999999999999999999999854    48999999999999999998765


Q ss_pred             ccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCC-CcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773          83 EKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGS-LKGSSDVEEYNGAYYFGSPISKHLARVPLA  159 (160)
Q Consensus        83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~-~~~iS~v~e~~g~LylGS~~~~~i~~~~l~  159 (160)
                      ...      .++.      .....++.|.++|++|+|+++|||++++ +..+|+|.|++|+||+||+.++||+|++|+
T Consensus       311 ~~~------~~~~------~~~~p~~~V~~~d~~G~il~~lhD~~g~~~~~~sev~E~dg~LyiGS~~~p~i~~lkl~  376 (376)
T KOG1520|consen  311 ELL------YFLN------NGGKPHSAVKLSDETGKILESLHDKEGKVITLVSEVGEHDGHLYIGSLFNPYIARLKLP  376 (376)
T ss_pred             hhh------hhhh------ccCCCceEEEEecCCCcEEEEEecCCCCceEEEEEEeecCCeEEEcccCcceeEEEecC
Confidence            431      1110      0112237788888999999999999974 799999999999999999999999999985


No 2  
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=99.11  E-value=8.1e-10  Score=92.78  Aligned_cols=93  Identities=26%  Similarity=0.330  Sum_probs=76.0

Q ss_pred             CcccCCCcEEEEEEec---CCCCCcc-eEecccCCCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHH
Q psy4773           2 GQTDYCNFSFWRYYLK---GPKQGKS-EVFIDGLPGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVAR   77 (160)
Q Consensus         2 ~~~Et~~~Ri~Ry~l~---G~k~G~~-evfid~LPG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~   77 (160)
                      =++||..+||+||++.   |+..+.. .++.+.-||+||+++.|++|++|++....                        
T Consensus       178 y~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~~a~~~------------------------  233 (307)
T COG3386         178 YVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWVAAVWG------------------------  233 (307)
T ss_pred             EEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCCCEEEecccC------------------------
Confidence            3789999999999998   7765542 45565678999999999999999733221                        


Q ss_pred             hhhhhccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEEC---CEEEEecCCC
Q psy4773          78 FLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEYN---GAYYFGSPIS  150 (160)
Q Consensus        78 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~---g~LylGS~~~  150 (160)
                                                   -+.|.+++++|+.+..+..|.   ..+|.+...|   ++||+.|...
T Consensus       234 -----------------------------g~~v~~~~pdG~l~~~i~lP~---~~~t~~~FgG~~~~~L~iTs~~~  277 (307)
T COG3386         234 -----------------------------GGRVVRFNPDGKLLGEIKLPV---KRPTNPAFGGPDLNTLYITSARS  277 (307)
T ss_pred             -----------------------------CceEEEECCCCcEEEEEECCC---CCCccceEeCCCcCEEEEEecCC
Confidence                                         046889999999999999997   5688888887   9999999888


No 3  
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=99.00  E-value=2.8e-09  Score=85.28  Aligned_cols=88  Identities=24%  Similarity=0.403  Sum_probs=70.4

Q ss_pred             cccCCCcEEEEEEecCC--CCCcceEecccCC---CCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHH
Q psy4773           3 QTDYCNFSFWRYYLKGP--KQGKSEVFIDGLP---GLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVAR   77 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~G~--k~G~~evfid~LP---G~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~   77 (160)
                      ++||.++||+||.++.+  ..+..++|++ ++   |+||++..|++|++|||....                        
T Consensus       150 v~ds~~~~i~~~~~~~~~~~~~~~~~~~~-~~~~~g~pDG~~vD~~G~l~va~~~~------------------------  204 (246)
T PF08450_consen  150 VADSFNGRIWRFDLDADGGELSNRRVFID-FPGGPGYPDGLAVDSDGNLWVADWGG------------------------  204 (246)
T ss_dssp             EEETTTTEEEEEEEETTTCCEEEEEEEEE--SSSSCEEEEEEEBTTS-EEEEEETT------------------------
T ss_pred             ecccccceeEEEeccccccceeeeeeEEE-cCCCCcCCCcceEcCCCCEEEEEcCC------------------------
Confidence            48999999999999754  3466788876 54   469999999999999998763                        


Q ss_pred             hhhhhccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEEC---CEEEEecC
Q psy4773          78 FLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEYN---GAYYFGSP  148 (160)
Q Consensus        78 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~---g~LylGS~  148 (160)
                                                    +.|.++|++|+++..++.|.   +.+|.+...+   ++||+.|-
T Consensus       205 ------------------------------~~I~~~~p~G~~~~~i~~p~---~~~t~~~fgg~~~~~L~vTta  245 (246)
T PF08450_consen  205 ------------------------------GRIVVFDPDGKLLREIELPV---PRPTNCAFGGPDGKTLYVTTA  245 (246)
T ss_dssp             ------------------------------TEEEEEETTSCEEEEEE-SS---SSEEEEEEESTTSSEEEEEEB
T ss_pred             ------------------------------CEEEEECCCccEEEEEcCCC---CCEEEEEEECCCCCEEEEEeC
Confidence                                          45789999999999999884   4789999853   88999874


No 4  
>KOG4499|consensus
Probab=98.02  E-value=4.2e-05  Score=62.53  Aligned_cols=88  Identities=18%  Similarity=0.207  Sum_probs=70.4

Q ss_pred             CcccCCCcEE--EEEEecCCCCCcceEecccC-------CCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHH
Q psy4773           2 GQTDYCNFSF--WRYYLKGPKQGKSEVFIDGL-------PGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIR   72 (160)
Q Consensus         2 ~~~Et~~~Ri--~Ry~l~G~k~G~~evfid~L-------PG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lR   72 (160)
                      -|.||..+.|  ..|+..+-.....++.++ |       +=.||++..|.+|.+|||.++.                   
T Consensus       173 Y~iDsln~~V~a~dyd~~tG~~snr~~i~d-lrk~~~~e~~~PDGm~ID~eG~L~Va~~ng-------------------  232 (310)
T KOG4499|consen  173 YYIDSLNYEVDAYDYDCPTGDLSNRKVIFD-LRKSQPFESLEPDGMTIDTEGNLYVATFNG-------------------  232 (310)
T ss_pred             EEEccCceEEeeeecCCCcccccCcceeEE-eccCCCcCCCCCCcceEccCCcEEEEEecC-------------------
Confidence            4789999999  555577666666666665 3       2389999999999999999885                   


Q ss_pred             HHHHHhhhhhccCCccccceeccceeeeeeecCCeEEEEEEeC-CCeEEEEEEcCCCCCcCceEEEEEC---CEEEEec
Q psy4773          73 KFVARFLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISS-QGEIVDALHSVDGSLKGSSDVEEYN---GAYYFGS  147 (160)
Q Consensus        73 k~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~-~G~il~~l~d~~g~~~~iS~v~e~~---g~LylGS  147 (160)
                                                         +.|.++|+ +||++..+.-|.   +.||++.+.|   +-||...
T Consensus       233 -----------------------------------~~V~~~dp~tGK~L~eiklPt---~qitsccFgGkn~d~~yvT~  273 (310)
T KOG4499|consen  233 -----------------------------------GTVQKVDPTTGKILLEIKLPT---PQITSCCFGGKNLDILYVTT  273 (310)
T ss_pred             -----------------------------------cEEEEECCCCCcEEEEEEcCC---CceEEEEecCCCccEEEEEe
Confidence                                               66899995 699999999996   6799999887   5677654


No 5  
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=96.30  E-value=0.027  Score=47.34  Aligned_cols=92  Identities=20%  Similarity=0.251  Sum_probs=70.4

Q ss_pred             cccCCCcEEEEEEecCCCCCcceEecccCCC--CCCcee---ecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHH
Q psy4773           3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPG--LPDNVK---RDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVAR   77 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG--~PDNI~---~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~   77 (160)
                      +|||.. -|.|+   ++|.++.+.|-  ||+  -++|+.   .|++|++|-.-...                        
T Consensus       119 itd~~~-aI~R~---dpkt~evt~f~--lp~~~a~~nlet~vfD~~G~lWFt~q~G------------------------  168 (353)
T COG4257         119 ITDTGL-AIGRL---DPKTLEVTRFP--LPLEHADANLETAVFDPWGNLWFTGQIG------------------------  168 (353)
T ss_pred             EecCcc-eeEEe---cCcccceEEee--cccccCCCcccceeeCCCccEEEeeccc------------------------
Confidence            578877 89998   56778888886  675  688885   68999999754322                        


Q ss_pred             hhhhhccCCccccceeccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEE-ECCEEEEecCCCCeEEEE
Q psy4773          78 FLHLLEKIPSEDVKHVVGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEE-YNGAYYFGSPISKHLARV  156 (160)
Q Consensus        78 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e-~~g~LylGS~~~~~i~~~  156 (160)
                                                  -++   |+|+.-..++.+..|.|.  ....++. -+|.+|..|+..+||+++
T Consensus       169 ----------------------------~yG---rLdPa~~~i~vfpaPqG~--gpyGi~atpdGsvwyaslagnaiari  215 (353)
T COG4257         169 ----------------------------AYG---RLDPARNVISVFPAPQGG--GPYGICATPDGSVWYASLAGNAIARI  215 (353)
T ss_pred             ----------------------------cce---ecCcccCceeeeccCCCC--CCcceEECCCCcEEEEeccccceEEc
Confidence                                        123   789888999999998864  2345554 479999999999999998


Q ss_pred             e
Q psy4773         157 P  157 (160)
Q Consensus       157 ~  157 (160)
                      +
T Consensus       216 d  216 (353)
T COG4257         216 D  216 (353)
T ss_pred             c
Confidence            7


No 6  
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=95.81  E-value=0.0094  Score=41.52  Aligned_cols=43  Identities=21%  Similarity=0.324  Sum_probs=34.1

Q ss_pred             CCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773           7 CNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus         7 ~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~   53 (160)
                      ...|++||..+   .++.++++++|. ||.+|..++||.|.+-.-+.
T Consensus        35 ~~GRll~ydp~---t~~~~vl~~~L~-fpNGVals~d~~~vlv~Et~   77 (89)
T PF03088_consen   35 PTGRLLRYDPS---TKETTVLLDGLY-FPNGVALSPDESFVLVAETG   77 (89)
T ss_dssp             --EEEEEEETT---TTEEEEEEEEES-SEEEEEE-TTSSEEEEEEGG
T ss_pred             CCcCEEEEECC---CCeEEEehhCCC-ccCeEEEcCCCCEEEEEecc
Confidence            46899999755   588999999999 89999999999987755443


No 7  
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=95.12  E-value=0.09  Score=44.74  Aligned_cols=48  Identities=17%  Similarity=0.093  Sum_probs=40.1

Q ss_pred             cccCCCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCC-eEEEeecCCCC
Q psy4773           3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGN-FLVSLVCPVDE   56 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~-fWval~~~r~~   56 (160)
                      ++|+.+.++.+++.+   .|+.|+.++ +||+|-+|-..  |. ..|+++.+|.+
T Consensus       217 vldsgtGev~~vD~~---~G~~e~Va~-vpG~~rGL~f~--G~llvVgmSk~R~~  265 (335)
T TIGR03032       217 LLNSGRGELGYVDPQ---AGKFQPVAF-LPGFTRGLAFA--GDFAFVGLSKLRES  265 (335)
T ss_pred             EEECCCCEEEEEcCC---CCcEEEEEE-CCCCCccccee--CCEEEEEeccccCC
Confidence            678899999999754   478999997 99999999988  55 56999999753


No 8  
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=93.71  E-value=1.2  Score=35.16  Aligned_cols=88  Identities=19%  Similarity=0.265  Sum_probs=59.2

Q ss_pred             CCCcceEecccCC------CCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccCCcccccee
Q psy4773          20 KQGKSEVFIDGLP------GLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKIPSEDVKHV   93 (160)
Q Consensus        20 k~G~~evfid~LP------G~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~~~~~~~~~   93 (160)
                      +.|+.+.+++ ++      -.|..+..|++|++|++-.....   ..                                 
T Consensus        68 ~~g~~~~~~~-~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~---~~---------------------------------  110 (246)
T PF08450_consen   68 DTGKVTVLAD-LPDGGVPFNRPNDVAVDPDGNLYVTDSGGGG---AS---------------------------------  110 (246)
T ss_dssp             TTTEEEEEEE-EETTCSCTEEEEEEEE-TTS-EEEEEECCBC---TT---------------------------------
T ss_pred             CCCcEEEEee-ccCCCcccCCCceEEEcCCCCEEEEecCCCc---cc---------------------------------
Confidence            4578888887 42      36777889999999999876521   00                                 


Q ss_pred             ccceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEE--CCEEEEecCCCCeEEEEeC
Q psy4773          94 VGHFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEY--NGAYYFGSPISKHLARVPL  158 (160)
Q Consensus        94 ~~~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~--~g~LylGS~~~~~i~~~~l  158 (160)
                                ....+.+.+++.+|++.....+-    ...-.+...  +..||+.......|.++++
T Consensus       111 ----------~~~~g~v~~~~~~~~~~~~~~~~----~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~  163 (246)
T PF08450_consen  111 ----------GIDPGSVYRIDPDGKVTVVADGL----GFPNGIAFSPDGKTLYVADSFNGRIWRFDL  163 (246)
T ss_dssp             ----------CGGSEEEEEEETTSEEEEEEEEE----SSEEEEEEETTSSEEEEEETTTTEEEEEEE
T ss_pred             ----------cccccceEEECCCCeEEEEecCc----ccccceEECCcchheeecccccceeEEEec
Confidence                      00117889999998877766542    233444444  4579999999999998886


No 9  
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=91.91  E-value=2.6  Score=41.20  Aligned_cols=30  Identities=13%  Similarity=0.196  Sum_probs=22.6

Q ss_pred             CcCceEEEEE-CCEEEEecCCCCeEEEEeCC
Q psy4773         130 LKGSSDVEEY-NGAYYFGSPISKHLARVPLA  159 (160)
Q Consensus       130 ~~~iS~v~e~-~g~LylGS~~~~~i~~~~l~  159 (160)
                      +...+++... +|+||+....++-|-++++.
T Consensus       858 l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~  888 (1057)
T PLN02919        858 LSEPAGLALGENGRLFVADTNNSLIRYLDLN  888 (1057)
T ss_pred             cCCceEEEEeCCCCEEEEECCCCEEEEEECC
Confidence            4456666654 68899999999999888764


No 10 
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=89.94  E-value=4.6  Score=34.08  Aligned_cols=95  Identities=15%  Similarity=0.157  Sum_probs=58.3

Q ss_pred             CCcc-eEecccCCCCCCc----eeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccCCccccceecc
Q psy4773          21 QGKS-EVFIDGLPGLPDN----VKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKIPSEDVKHVVG   95 (160)
Q Consensus        21 ~G~~-evfid~LPG~PDN----I~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~~~~~~~~~~~   95 (160)
                      .|.. +.+++.-+|.|+|    .+.+++|.||++-... .++..                                    
T Consensus        94 ~~~~~t~~~~~~~~~~~~r~ND~~v~pdG~~wfgt~~~-~~~~~------------------------------------  136 (307)
T COG3386          94 TGGKITLLAEPEDGLPLNRPNDGVVDPDGRIWFGDMGY-FDLGK------------------------------------  136 (307)
T ss_pred             CCceeEEeccccCCCCcCCCCceeEcCCCCEEEeCCCc-cccCc------------------------------------
Confidence            3444 6777756665554    5688999999988652 00000                                    


Q ss_pred             ceeeeeeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773          96 HFESILFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISKHLARVPLA  159 (160)
Q Consensus        96 ~~~~~~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~~i~~~~l~  159 (160)
                            .-.++.+.+.++|++|.+++.+.+.- .++.--.....+..||+.--..+.|-|++++
T Consensus       137 ------~~~~~~G~lyr~~p~g~~~~l~~~~~-~~~NGla~SpDg~tly~aDT~~~~i~r~~~d  193 (307)
T COG3386         137 ------SEERPTGSLYRVDPDGGVVRLLDDDL-TIPNGLAFSPDGKTLYVADTPANRIHRYDLD  193 (307)
T ss_pred             ------cccCCcceEEEEcCCCCEEEeecCcE-EecCceEECCCCCEEEEEeCCCCeEEEEecC
Confidence                  01233478999999899988887621 1111111222345899988888888888764


No 11 
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=87.87  E-value=10  Score=32.27  Aligned_cols=46  Identities=15%  Similarity=0.195  Sum_probs=33.7

Q ss_pred             CcEEEEEEe-cCC--CCCcceEecccCCCC-------CCceeecCCCCeEEEeecC
Q psy4773           8 NFSFWRYYL-KGP--KQGKSEVFIDGLPGL-------PDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus         8 ~~Ri~Ry~l-~G~--k~G~~evfid~LPG~-------PDNI~~~~~G~fWval~~~   53 (160)
                      ..+|+||.- .|.  ..|+.|++++.+++-       |.++..+++|.+|++....
T Consensus        89 ~~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gpDG~LYv~~G~~  144 (367)
T TIGR02604        89 PPDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGPDGWLYFNHGNT  144 (367)
T ss_pred             CCeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECCCCCEEEecccC
Confidence            457888843 343  234788999877652       6789999999999988753


No 12 
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=85.40  E-value=22  Score=35.00  Aligned_cols=44  Identities=20%  Similarity=0.105  Sum_probs=29.1

Q ss_pred             cccCCCcEEEEEEecCCCCCcceEeccc--------------CCCCCCceeecCCCC-eEEEe
Q psy4773           3 QTDYCNFSFWRYYLKGPKQGKSEVFIDG--------------LPGLPDNVKRDSKGN-FLVSL   50 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~--------------LPG~PDNI~~~~~G~-fWval   50 (160)
                      ++|+..+||+++++.|.    ...++..              .-..|-+|..|++|+ +||+=
T Consensus       584 VaDs~n~rI~v~d~~G~----~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaD  642 (1057)
T PLN02919        584 ISDSNHNRIVVTDLDGN----FIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVAD  642 (1057)
T ss_pred             EEECCCCeEEEEeCCCC----EEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEe
Confidence            57899999999998874    2222221              112477888887765 66754


No 13 
>PF07494 Reg_prop:  Two component regulator propeller;  InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=84.19  E-value=0.98  Score=23.46  Aligned_cols=13  Identities=31%  Similarity=0.583  Sum_probs=10.0

Q ss_pred             eeecCCCCeEEEe
Q psy4773          38 VKRDSKGNFLVSL   50 (160)
Q Consensus        38 I~~~~~G~fWval   50 (160)
                      |..|++|.+|||-
T Consensus        10 i~~D~~G~lWigT   22 (24)
T PF07494_consen   10 IYEDSDGNLWIGT   22 (24)
T ss_dssp             EEE-TTSCEEEEE
T ss_pred             EEEcCCcCEEEEe
Confidence            4578999999985


No 14 
>PF05787 DUF839:  Bacterial protein of unknown function (DUF839);  InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=82.84  E-value=1.9  Score=39.08  Aligned_cols=45  Identities=27%  Similarity=0.301  Sum_probs=31.6

Q ss_pred             cEEEEEEecCCC----CCcceEecccC-----------------CCCCCceeecCCCCeEEEeecC
Q psy4773           9 FSFWRYYLKGPK----QGKSEVFIDGL-----------------PGLPDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus         9 ~Ri~Ry~l~G~k----~G~~evfid~L-----------------PG~PDNI~~~~~G~fWval~~~   53 (160)
                      ..|+||..++..    ..+.++|+..-                 -..||||..|++|++||+-=..
T Consensus       391 G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~~~~~~~~f~sPDNL~~d~~G~LwI~eD~~  456 (524)
T PF05787_consen  391 GQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGSNKCDDNGFASPDNLAFDPDGNLWIQEDGG  456 (524)
T ss_pred             cEEEEecccCCccccceeEEEEEEEecCcccccccccCcccCCCcCCCCceEECCCCCEEEEeCCC
Confidence            478898887641    23556666521                 1279999999999999986443


No 15 
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=82.62  E-value=1.8  Score=36.46  Aligned_cols=46  Identities=22%  Similarity=0.326  Sum_probs=33.1

Q ss_pred             CcEEEEEEecCC--CCCcceEecccCCCCCCc------eeecCCCCeEEEeecC
Q psy4773           8 NFSFWRYYLKGP--KQGKSEVFIDGLPGLPDN------VKRDSKGNFLVSLVCP   53 (160)
Q Consensus         8 ~~Ri~Ry~l~G~--k~G~~evfid~LPG~PDN------I~~~~~G~fWval~~~   53 (160)
                      ..||.||-++..  ..++.+++++.+|..+.+      |..+++|.+||++-..
T Consensus        81 ~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgpDG~LYvs~G~~  134 (331)
T PF07995_consen   81 DNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGPDGKLYVSVGDG  134 (331)
T ss_dssp             EEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-TTSEEEEEEB-T
T ss_pred             ceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCCCCcEEEEeCCC
Confidence            468999999765  456677777768864444      6788999999999876


No 16 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=82.17  E-value=3.4  Score=33.85  Aligned_cols=49  Identities=24%  Similarity=0.377  Sum_probs=30.5

Q ss_pred             eeeecCCeEEEEEEeCCCeEEEEEEcCCC------CCcCceEEEEE-CCEEEEecC
Q psy4773         100 ILFLQGTRYTLLVISSQGEIVDALHSVDG------SLKGSSDVEEY-NGAYYFGSP  148 (160)
Q Consensus       100 ~~~~~~~~~~vv~vd~~G~il~~l~d~~g------~~~~iS~v~e~-~g~LylGS~  148 (160)
                      ++.++.+..+++++|.+|+++..+.-..+      .++..-.+... +|.||+-|=
T Consensus       185 lliLS~es~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE  240 (248)
T PF06977_consen  185 LLILSDESRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE  240 (248)
T ss_dssp             EEEEETTTTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET
T ss_pred             EEEEECCCCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC
Confidence            45567777899999999999999987653      35666666664 599999983


No 17 
>PF01436 NHL:  NHL repeat;  InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ].  The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=80.09  E-value=0.91  Score=24.39  Aligned_cols=19  Identities=37%  Similarity=0.452  Sum_probs=16.3

Q ss_pred             CCCceeecCCCCeEEEeec
Q psy4773          34 LPDNVKRDSKGNFLVSLVC   52 (160)
Q Consensus        34 ~PDNI~~~~~G~fWval~~   52 (160)
                      .|-+|..+++|.+||+=..
T Consensus         3 ~P~gvav~~~g~i~VaD~~   21 (28)
T PF01436_consen    3 YPHGVAVDSDGNIYVADSG   21 (28)
T ss_dssp             SEEEEEEETTSEEEEEECC
T ss_pred             CCcEEEEeCCCCEEEEECC
Confidence            5889999999999998744


No 18 
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=79.07  E-value=17  Score=30.93  Aligned_cols=29  Identities=24%  Similarity=0.373  Sum_probs=18.2

Q ss_pred             CCCCcceEecccCCCCCCc-------eeecCCCCeEEEeec
Q psy4773          19 PKQGKSEVFIDGLPGLPDN-------VKRDSKGNFLVSLVC   52 (160)
Q Consensus        19 ~k~G~~evfid~LPG~PDN-------I~~~~~G~fWval~~   52 (160)
                      |++|..|+|..     |+.       |-.|+-|+.|+.-..
T Consensus       217 p~~~~aev~p~-----P~~~~~gsRriwsdpig~~wittwg  252 (353)
T COG4257         217 PFAGHAEVVPQ-----PNALKAGSRRIWSDPIGRAWITTWG  252 (353)
T ss_pred             cccCCcceecC-----CCcccccccccccCccCcEEEeccC
Confidence            45666676654     555       345677888887544


No 19 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=79.00  E-value=34  Score=27.99  Aligned_cols=39  Identities=28%  Similarity=0.601  Sum_probs=20.7

Q ss_pred             CcEEEEEEecCCCCCcceEecc-cCCCCCCceeecCCCCeEE
Q psy4773           8 NFSFWRYYLKGPKQGKSEVFID-GLPGLPDNVKRDSKGNFLV   48 (160)
Q Consensus         8 ~~Ri~Ry~l~G~k~G~~evfid-~LPG~PDNI~~~~~G~fWv   48 (160)
                      ..+|.-|.+++  .|+.+..-. .++|-|..|..+++|++..
T Consensus        56 ~~~i~~~~~~~--~g~l~~~~~~~~~~~p~~i~~~~~g~~l~   95 (330)
T PRK11028         56 EFRVLSYRIAD--DGALTFAAESPLPGSPTHISTDHQGRFLF   95 (330)
T ss_pred             CCcEEEEEECC--CCceEEeeeecCCCCceEEEECCCCCEEE
Confidence            35566666652  233221111 2556778888888876433


No 20 
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=77.56  E-value=1.7  Score=36.61  Aligned_cols=49  Identities=29%  Similarity=0.468  Sum_probs=39.0

Q ss_pred             cccCCCcEEEEEEec-CCCCCcceEecccCCCCCCceeecCCCCeEEEee
Q psy4773           3 QTDYCNFSFWRYYLK-GPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLV   51 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~-G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~   51 (160)
                      +++-..+||+++.++ +.+..+.+.|+.+..+.|-.|..++||.+||+-.
T Consensus       275 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~~~v~~~pDG~Lyv~~d  324 (331)
T PF07995_consen  275 VADYGGGRIWRLDLDEDGSVTEEEEFLGGFGGRPRDVAQGPDGALYVSDD  324 (331)
T ss_dssp             EEETTTTEEEEEEEETTEEEEEEEEECTTSSS-EEEEEEETTSEEEEEE-
T ss_pred             EecCCCCEEEEEeeecCCCccceEEccccCCCCceEEEEcCCCeEEEEEC
Confidence            466778999999997 3345678888887888899999999999999865


No 21 
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=72.56  E-value=11  Score=32.12  Aligned_cols=52  Identities=23%  Similarity=0.392  Sum_probs=37.3

Q ss_pred             eeeeecCCeEEEEEEeCCCeEEEEEEcCCC------CCcCceEEEE-ECCEEEEecCCC
Q psy4773          99 SILFLQGTRYTLLVISSQGEIVDALHSVDG------SLKGSSDVEE-YNGAYYFGSPIS  150 (160)
Q Consensus        99 ~~~~~~~~~~~vv~vd~~G~il~~l~d~~g------~~~~iS~v~e-~~g~LylGS~~~  150 (160)
                      +++.++.+..+++++|.+|++++.+....|      .++..-.|.- .+|.||+-|=-+
T Consensus       246 ~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g~lYIvSEPn  304 (316)
T COG3204         246 SLLVLSDESRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMDDDGNLYIVSEPN  304 (316)
T ss_pred             cEEEEecCCceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEECCCCCEEEEecCC
Confidence            456677888999999999999998876544      2344444544 359999987433


No 22 
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=68.82  E-value=26  Score=31.36  Aligned_cols=46  Identities=15%  Similarity=0.184  Sum_probs=31.8

Q ss_pred             CcEEEEEEecC--CCCCcceEecccCCCCC----CceeecCCCCeEEEeecC
Q psy4773           8 NFSFWRYYLKG--PKQGKSEVFIDGLPGLP----DNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus         8 ~~Ri~Ry~l~G--~k~G~~evfid~LPG~P----DNI~~~~~G~fWval~~~   53 (160)
                      ..||.||-+..  .+....++.++.+|...    -.|..++||.+||++-..
T Consensus       115 ~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~GgrI~FgPDG~LYVs~GD~  166 (454)
T TIGR03606       115 HTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHNGGRLVFGPDGKIYYTIGEQ  166 (454)
T ss_pred             CcEEEEEEecCCCCccccceEEEecCCCCCCcCCceEEECCCCcEEEEECCC
Confidence            57999999963  23333444455588532    367789999999998764


No 23 
>KOG0639|consensus
Probab=66.70  E-value=33  Score=31.42  Aligned_cols=63  Identities=21%  Similarity=0.281  Sum_probs=42.3

Q ss_pred             CCCcEEEEEE-ecCCCCCcceEecccCCCC-CCceee----cCCCC-eEEEeecCCCCcChhhhhhcCCChHHHHHH
Q psy4773           6 YCNFSFWRYY-LKGPKQGKSEVFIDGLPGL-PDNVKR----DSKGN-FLVSLVCPVDEYTPQLLHIIGPFPNIRKFV   75 (160)
Q Consensus         6 t~~~Ri~Ry~-l~G~k~G~~evfid~LPG~-PDNI~~----~~~G~-fWval~~~r~~~~~~~~~~l~~~P~lRk~i   75 (160)
                      |+..++.++| |+++  |..-... .|+.+ +||--+    -+||+ +.||--..    +-+++|+.++.|.++--+
T Consensus       436 TgGkgcVKVWdis~p--g~k~Pvs-qLdcl~rdnyiRSckL~pdgrtLivGGeas----tlsiWDLAapTprikael  505 (705)
T KOG0639|consen  436 TGGKGCVKVWDISQP--GNKSPVS-QLDCLNRDNYIRSCKLLPDGRTLIVGGEAS----TLSIWDLAAPTPRIKAEL  505 (705)
T ss_pred             ecCCCeEEEeeccCC--CCCCccc-cccccCcccceeeeEecCCCceEEeccccc----eeeeeeccCCCcchhhhc
Confidence            6777888887 4555  4444444 48887 999754    47776 55555432    446899999999887543


No 24 
>KOG1520|consensus
Probab=66.35  E-value=6.1  Score=34.42  Aligned_cols=45  Identities=20%  Similarity=0.365  Sum_probs=35.8

Q ss_pred             cCCCcEEEEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773           5 DYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus         5 Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~   53 (160)
                      -....|++||+-+   +-++++++|+|- ||.++..++++.|-+-+-+.
T Consensus       195 g~~~GRl~~YD~~---tK~~~VLld~L~-F~NGlaLS~d~sfvl~~Et~  239 (376)
T KOG1520|consen  195 GDPTGRLFRYDPS---TKVTKVLLDGLY-FPNGLALSPDGSFVLVAETT  239 (376)
T ss_pred             CCCccceEEecCc---ccchhhhhhccc-ccccccCCCCCCEEEEEeec
Confidence            3467899999754   467889999998 79999999999987655443


No 25 
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=64.65  E-value=5.8  Score=36.47  Aligned_cols=45  Identities=18%  Similarity=0.203  Sum_probs=30.0

Q ss_pred             cEEEEEEecCC----CCCcceEeccc-----C------------CCCCCceeecCCCCeEEEeecC
Q psy4773           9 FSFWRYYLKGP----KQGKSEVFIDG-----L------------PGLPDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus         9 ~Ri~Ry~l~G~----k~G~~evfid~-----L------------PG~PDNI~~~~~G~fWval~~~   53 (160)
                      .+|.||--.+.    ..++-|+|+.-     +            =+.||||..|+.|++||+--..
T Consensus       455 G~I~r~~p~~~d~t~~~ftWdlF~~aG~~~~~~~~~~~~~~~~~f~~PDnl~fD~~GrLWi~TDg~  520 (616)
T COG3211         455 GQIVRWIPATGDHTDTKFTWDLFVEAGNPSVLEGGASANINANWFNSPDNLAFDPWGRLWIQTDGS  520 (616)
T ss_pred             cceEEEecCCCCccCccceeeeeeecCCccccccccccCcccccccCCCceEECCCCCEEEEecCC
Confidence            46777655432    23467777751     1            1239999999999999987543


No 26 
>KOG0404|consensus
Probab=63.84  E-value=3.5  Score=34.13  Aligned_cols=16  Identities=25%  Similarity=0.775  Sum_probs=13.0

Q ss_pred             ceEecccCCCCCCcee
Q psy4773          24 SEVFIDGLPGLPDNVK   39 (160)
Q Consensus        24 ~evfid~LPG~PDNI~   39 (160)
                      +.+=++|.|||||+|.
T Consensus        53 TTT~veNfPGFPdgi~   68 (322)
T KOG0404|consen   53 TTTDVENFPGFPDGIT   68 (322)
T ss_pred             eeeccccCCCCCcccc
Confidence            4455789999999996


No 27 
>cd08033 LARP_6 La RNA-binding domain of La-related protein 6. This domain is found in animal and plant proteins related to the La autoantigen. A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=63.25  E-value=3  Score=28.17  Aligned_cols=14  Identities=36%  Similarity=0.479  Sum_probs=10.7

Q ss_pred             EEEEEEeCCCeEEE
Q psy4773         108 YTLLVISSQGEIVD  121 (160)
Q Consensus       108 ~~vv~vd~~G~il~  121 (160)
                      +-+++++++|+-|+
T Consensus        63 S~~lev~~d~~~VR   76 (77)
T cd08033          63 SSKLVVSEDGKKVR   76 (77)
T ss_pred             CCeEEEcCCCCccC
Confidence            44789999998664


No 28 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=62.07  E-value=86  Score=25.57  Aligned_cols=40  Identities=23%  Similarity=0.297  Sum_probs=21.7

Q ss_pred             CcEEEEEEecCCCCCcceEecccCCCCCC---------ceeecCCCCe-EEE
Q psy4773           8 NFSFWRYYLKGPKQGKSEVFIDGLPGLPD---------NVKRDSKGNF-LVS   49 (160)
Q Consensus         8 ~~Ri~Ry~l~G~k~G~~evfid~LPG~PD---------NI~~~~~G~f-Wva   49 (160)
                      ...|..|.++. ..|+.+.... +...|+         .|..+++|++ +++
T Consensus       196 ~~~v~v~~~~~-~~~~~~~~~~-~~~~p~~~~~~~~~~~i~~~pdg~~lyv~  245 (330)
T PRK11028        196 NSSVDVWQLKD-PHGEIECVQT-LDMMPADFSDTRWAADIHITPDGRHLYAC  245 (330)
T ss_pred             CCEEEEEEEeC-CCCCEEEEEE-EecCCCcCCCCccceeEEECCCCCEEEEe
Confidence            56777777763 2344444332 333333         4777888874 443


No 29 
>COG2088 SpoVG Uncharacterized protein, involved in the regulation of septum location [Cell envelope biogenesis, outer membrane]
Probab=60.78  E-value=26  Score=24.47  Aligned_cols=69  Identities=10%  Similarity=0.161  Sum_probs=39.9

Q ss_pred             CCcEEEEEEecCCCCCcceEecccCCCCCCcee-ecCCCCeEEEeecCCCCcChhhhhhcCC-ChHHHHHHHH
Q psy4773           7 CNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVK-RDSKGNFLVSLVCPVDEYTPQLLHIIGP-FPNIRKFVAR   77 (160)
Q Consensus         7 ~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~-~~~~G~fWval~~~r~~~~~~~~~~l~~-~P~lRk~i~~   77 (160)
                      +.-||.|..=+|.-.--..+=+||-= .--+|+ .+.+-++|||+++.|. ++..+-|..|+ ++..|+-|..
T Consensus         4 TdVRirkv~~dgrmkA~vsvT~D~ef-VvhdirVi~G~~GlfVAMPSrrt-~dgEFrDI~HPI~~~~R~kIq~   74 (95)
T COG2088           4 TDVRIRKVDTDGRMKAYVSVTLDNEF-VVHDIRVIEGNNGLFVAMPSRRT-PDGEFRDIAHPINSDTREKIQD   74 (95)
T ss_pred             eeEEEEEecCCCcEEEEEEEEecceE-EEeccEEEeCCcceEEEccCccC-CCcchhhccCcCCHHHHHHHHH
Confidence            34577777655431111122222111 123444 4566679999999876 55578888887 6788876554


No 30 
>PF03022 MRJP:  Major royal jelly protein;  InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=55.79  E-value=65  Score=26.66  Aligned_cols=64  Identities=14%  Similarity=0.090  Sum_probs=40.6

Q ss_pred             CCCCCCceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccCCccccceeccceeeeeeecCCeEEE
Q psy4773          31 LPGLPDNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKIPSEDVKHVVGHFESILFLQGTRYTL  110 (160)
Q Consensus        31 LPG~PDNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v  110 (160)
                      -++--|++..|++|.++.+....                                                      ..|
T Consensus       184 k~~~s~g~~~D~~G~ly~~~~~~------------------------------------------------------~aI  209 (287)
T PF03022_consen  184 KGSQSDGMAIDPNGNLYFTDVEQ------------------------------------------------------NAI  209 (287)
T ss_dssp             ---SECEEEEETTTEEEEEECCC------------------------------------------------------TEE
T ss_pred             cCCCCceEEECCCCcEEEecCCC------------------------------------------------------CeE
Confidence            44567999999999999888663                                                      236


Q ss_pred             EEEeCCC-----eEEEEEEcCCCCCcCceEEEEEC---CEEEEecCC
Q psy4773         111 LVISSQG-----EIVDALHSVDGSLKGSSDVEEYN---GAYYFGSPI  149 (160)
Q Consensus       111 v~vd~~G-----~il~~l~d~~g~~~~iS~v~e~~---g~LylGS~~  149 (160)
                      .+.|+++     +.--..+|+. .+..+++.....   |+||+-|-.
T Consensus       210 ~~w~~~~~~~~~~~~~l~~d~~-~l~~pd~~~i~~~~~g~L~v~snr  255 (287)
T PF03022_consen  210 GCWDPDGPYTPENFEILAQDPR-TLQWPDGLKIDPEGDGYLWVLSNR  255 (287)
T ss_dssp             EEEETTTSB-GCCEEEEEE-CC--GSSEEEEEE-T--TS-EEEEE-S
T ss_pred             EEEeCCCCcCccchheeEEcCc-eeeccceeeeccccCceEEEEECc
Confidence            7788888     5545556654 355667776655   999997643


No 31 
>PHA02865 MHC-like TNF binding protein; Provisional
Probab=53.73  E-value=12  Score=32.30  Aligned_cols=23  Identities=39%  Similarity=0.720  Sum_probs=19.5

Q ss_pred             CCCCCCceeecCCC-Ce--EEEeecC
Q psy4773          31 LPGLPDNVKRDSKG-NF--LVSLVCP   53 (160)
Q Consensus        31 LPG~PDNI~~~~~G-~f--Wval~~~   53 (160)
                      -||||-|++.|.|| +|  |+++..+
T Consensus       264 e~~~~~~~~pn~DggTfQ~~~~v~v~  289 (338)
T PHA02865        264 EPGFPTNTKKDNDKNTFSSTPSVRVP  289 (338)
T ss_pred             ccccccceeeCCCCCeeEEEEEEEeC
Confidence            58999999999995 75  9988665


No 32 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=52.78  E-value=44  Score=27.32  Aligned_cols=54  Identities=13%  Similarity=0.190  Sum_probs=30.4

Q ss_pred             eeecCCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEE-CCEEEEecCCCCeEEEE
Q psy4773         101 LFLQGTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEY-NGAYYFGSPISKHLARV  156 (160)
Q Consensus       101 ~~~~~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~-~g~LylGS~~~~~i~~~  156 (160)
                      .....+.+.++++|.+|++++.+.-..  +...-++... +|.+.+.+=...-+-++
T Consensus        37 faV~d~~~~i~els~~G~vlr~i~l~g--~~D~EgI~y~g~~~~vl~~Er~~~L~~~   91 (248)
T PF06977_consen   37 FAVQDEPGEIYELSLDGKVLRRIPLDG--FGDYEGITYLGNGRYVLSEERDQRLYIF   91 (248)
T ss_dssp             EEEETTTTEEEEEETT--EEEEEE-SS---SSEEEEEE-STTEEEEEETTTTEEEEE
T ss_pred             EEEECCCCEEEEEcCCCCEEEEEeCCC--CCCceeEEEECCCEEEEEEcCCCcEEEE
Confidence            345566788999999999999997543  2233333333 46666666334444433


No 33 
>cd08030 LA_like_plant La-motif domain of plant proteins similar to the La autoantigen. This domain is found in plant proteins related to the La autoantigen. A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=50.73  E-value=19  Score=25.03  Aligned_cols=14  Identities=43%  Similarity=0.529  Sum_probs=10.6

Q ss_pred             EEEEEEeCCCeEEE
Q psy4773         108 YTLLVISSQGEIVD  121 (160)
Q Consensus       108 ~~vv~vd~~G~il~  121 (160)
                      +-++++++||+-|+
T Consensus        76 S~~levseD~~~VR   89 (90)
T cd08030          76 STLLKVSEDGKRVG   89 (90)
T ss_pred             CCEEEEcCCCCccC
Confidence            44789999997653


No 34 
>cd08031 LARP_4_5_like La RNA-binding domain of proteins similar to La-related proteins 4 and 5. This domain is found in proteins similar to La-related proteins 4 and 5 (LARP4, LARP5). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=48.84  E-value=18  Score=24.28  Aligned_cols=14  Identities=21%  Similarity=0.287  Sum_probs=10.5

Q ss_pred             EEEEEEeCCCeEEE
Q psy4773         108 YTLLVISSQGEIVD  121 (160)
Q Consensus       108 ~~vv~vd~~G~il~  121 (160)
                      +.+++++++|+-|+
T Consensus        61 S~~lev~ed~~~VR   74 (75)
T cd08031          61 SPNVQVDEKGEKVR   74 (75)
T ss_pred             CCeEEEcCCCCccC
Confidence            44788999998653


No 35 
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=46.35  E-value=32  Score=29.17  Aligned_cols=50  Identities=24%  Similarity=0.197  Sum_probs=35.4

Q ss_pred             cccCCCcEEEEEEecCCC---CCcceEecccC-C-CCCCceeecCCCCeEEEeec
Q psy4773           3 QTDYCNFSFWRYYLKGPK---QGKSEVFIDGL-P-GLPDNVKRDSKGNFLVSLVC   52 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~G~k---~G~~evfid~L-P-G~PDNI~~~~~G~fWval~~   52 (160)
                      ++|-...+|.|+.|+...   +++.+.|+... . +-|=.+..++||.+||+=..
T Consensus       286 v~~~~~~~v~~~~l~~~g~~~~~~~~~~l~~~~~~~rp~dv~~~pDG~Lyv~d~~  340 (367)
T TIGR02604       286 VGDAHGQLIVRYSLEPKGAGFKGERPEFLRSNDTWFRPVNVTVGPDGALYVSDWY  340 (367)
T ss_pred             eeeccCCEEEEEEeecCCCccEeecCceEecCCCcccccceeECCCCCEEEEEec
Confidence            467788999999997332   23334555532 2 67888999999999998644


No 36 
>cd08035 LARP_4 La RNA-binding domain of La-related protein 4. This domain is found in vertebrate La-related protein 4 (LARP4), also known as c-MPL binding protein. La-type domains often co-occur with RNA-recognition motifs (RRMs). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=44.47  E-value=8.1  Score=26.04  Aligned_cols=14  Identities=21%  Similarity=0.389  Sum_probs=10.5

Q ss_pred             EEEEEEeCCCeEEE
Q psy4773         108 YTLLVISSQGEIVD  121 (160)
Q Consensus       108 ~~vv~vd~~G~il~  121 (160)
                      +..+++|++|+-|+
T Consensus        61 S~~levsedg~kVR   74 (75)
T cd08035          61 SPMVQVDETGEKVR   74 (75)
T ss_pred             CCeEEEcCCCCccC
Confidence            34689999998653


No 37 
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=43.05  E-value=58  Score=30.26  Aligned_cols=48  Identities=15%  Similarity=0.127  Sum_probs=26.4

Q ss_pred             EEEEeCCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCCeEEEEe
Q psy4773         110 LLVISSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISKHLARVP  157 (160)
Q Consensus       110 vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~~i~~~~  157 (160)
                      |...++.|.-...+-++-..-.-...+....|++|+||.+.=.+.|.|
T Consensus       227 v~~~e~~G~~~sn~~~~lp~~~I~ll~qD~qG~lWiGTenGl~r~~l~  274 (671)
T COG3292         227 VYLQEAEGWRASNWGPMLPSGNILLLVQDAQGELWIGTENGLWRTRLP  274 (671)
T ss_pred             eEEEchhhccccccCCCCcchheeeeecccCCCEEEeecccceeEecC
Confidence            445556664444443322111112233344699999998876666665


No 38 
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=41.64  E-value=87  Score=29.19  Aligned_cols=33  Identities=30%  Similarity=0.497  Sum_probs=23.2

Q ss_pred             CCCeEEEEEEcCCCCCcCceEEEEE-CCEEEEecCC
Q psy4773         115 SQGEIVDALHSVDGSLKGSSDVEEY-NGAYYFGSPI  149 (160)
Q Consensus       115 ~~G~il~~l~d~~g~~~~iS~v~e~-~g~LylGS~~  149 (160)
                      ..|.++.+.+--.+  ..||.+.+. +|.||+||..
T Consensus       362 stG~~v~sv~q~Rg--~nit~~~~d~~g~lWlgs~q  395 (671)
T COG3292         362 STGELVRSVHQLRG--MNITTTLEDSRGRLWLGSMQ  395 (671)
T ss_pred             CCCcEEEEeeeccc--cccchhhhccCCcEEEEecc
Confidence            56888888765444  345555543 7999999976


No 39 
>cd08036 LARP_5 La RNA-binding domain of La-related protein 5. This domain is found in vertebrate La-related protein 5 (LARP5). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=40.86  E-value=11  Score=25.51  Aligned_cols=14  Identities=29%  Similarity=0.498  Sum_probs=10.7

Q ss_pred             EEEEEEeCCCeEEE
Q psy4773         108 YTLLVISSQGEIVD  121 (160)
Q Consensus       108 ~~vv~vd~~G~il~  121 (160)
                      +..+++|++|+-++
T Consensus        61 S~~vevse~g~kVR   74 (75)
T cd08036          61 LPLVQVDEKGEKVR   74 (75)
T ss_pred             CCeEEECCCCCccC
Confidence            44789999998653


No 40 
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=40.04  E-value=84  Score=27.71  Aligned_cols=44  Identities=18%  Similarity=0.187  Sum_probs=28.5

Q ss_pred             EEEEEEec-CC-CCCcceEecccCCCCC----CceeecCCCCeEEEeecC
Q psy4773          10 SFWRYYLK-GP-KQGKSEVFIDGLPGLP----DNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus        10 Ri~Ry~l~-G~-k~G~~evfid~LPG~P----DNI~~~~~G~fWval~~~   53 (160)
                      |+-+.-+. |. |..+..+++++||+.+    -.|..++||.++|++-..
T Consensus       148 ~~~~~~~~~g~~~l~~~~~i~~~lP~~~~H~g~~l~f~pDG~Lyvs~G~~  197 (399)
T COG2133         148 RVAIGRLPGGDTKLSEPKVIFRGIPKGGHHFGGRLVFGPDGKLYVTTGSN  197 (399)
T ss_pred             EEEEEEcCCCccccccccEEeecCCCCCCcCcccEEECCCCcEEEEeCCC
Confidence            44444443 42 4444455555699754    678899999999998775


No 41 
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.63  E-value=68  Score=27.34  Aligned_cols=51  Identities=10%  Similarity=0.216  Sum_probs=34.8

Q ss_pred             eeecCCeEEEEEEeCCCeEEEEE-----EcCCC-CC---cCceEEEEECCEEEEecCCCC
Q psy4773         101 LFLQGTRYTLLVISSQGEIVDAL-----HSVDG-SL---KGSSDVEEYNGAYYFGSPISK  151 (160)
Q Consensus       101 ~~~~~~~~~vv~vd~~G~il~~l-----~d~~g-~~---~~iS~v~e~~g~LylGS~~~~  151 (160)
                      -....+.+.+++++.+|++++.+     +||++ .+   ...--+.|.+..||+-++...
T Consensus       101 Fav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Ieyig~n~fvi~dER~~~l~~~~vd~~  160 (316)
T COG3204         101 FAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEYIGGNQFVIVDERDRALYLFTVDAD  160 (316)
T ss_pred             EEecCCCceEEEEecCCceEEEecccccCChhHeEEecCCEEEEEehhcceEEEEEEcCC
Confidence            34556778999999999999887     45543 11   222334467788888887765


No 42 
>PF09154 DUF1939:  Domain of unknown function (DUF1939);  InterPro: IPR015237 This entry represents a C-terminal domain associated with prokaryotic alpha-amylases. It adopts a secondary structure consisting of an eight-stranded antiparallel beta-sheet containing a Greek key motif. Its exact function has not, as yet, been determined []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 1MXD_A 1MWO_A 1MXG_A 1W9X_A 2DIE_A 1VJS_A 1BPL_B 1BLI_A 1OB0_A 1E3Z_A ....
Probab=36.98  E-value=69  Score=20.18  Aligned_cols=38  Identities=21%  Similarity=0.274  Sum_probs=25.8

Q ss_pred             EEEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773          12 WRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus        12 ~Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~   53 (160)
                      .+.|+....+|  ++|.|-+--..+.+..|++|  |+-+..+
T Consensus        12 k~~~Vgt~~ag--~~~~D~tGn~~~~vtid~dG--~~~f~v~   49 (57)
T PF09154_consen   12 KRMWVGTNWAG--KTFYDYTGNSSETVTIDEDG--WGEFPVP   49 (57)
T ss_dssp             EEEEEEGGGTT--EEEEETTSSSSSEEEE-TTS--EEEEEE-
T ss_pred             EEEEEccccCC--CEEEEccCCCCCeEEECCCe--EEEEEEC
Confidence            35566555565  59999555578899999999  7766554


No 43 
>PF00058 Ldl_recept_b:  Low-density lipoprotein receptor repeat class B;  InterPro: IPR000033  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=35.47  E-value=53  Score=18.99  Aligned_cols=36  Identities=25%  Similarity=0.270  Sum_probs=26.0

Q ss_pred             cccCCCc-EEEEEEecCCCCCcceEecccCCCCCCceeec
Q psy4773           3 QTDYCNF-SFWRYYLKGPKQGKSEVFIDGLPGLPDNVKRD   41 (160)
Q Consensus         3 ~~Et~~~-Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~~~   41 (160)
                      ++|.... +|.+-.+.|..   .++++.+--.-|.+|..|
T Consensus         5 WtD~~~~~~I~~a~~dGs~---~~~vi~~~l~~P~giaVD   41 (42)
T PF00058_consen    5 WTDWSQDPSIERANLDGSN---RRTVISDDLQHPEGIAVD   41 (42)
T ss_dssp             EEETTTTEEEEEEETTSTS---EEEEEESSTSSEEEEEEE
T ss_pred             EEECCCCcEEEEEECCCCC---eEEEEECCCCCcCEEEEC
Confidence            5677888 99999999853   455555334478888776


No 44 
>COG4294 Uve UV damage repair endonuclease [DNA replication, recombination, and repair]
Probab=32.68  E-value=10  Score=32.28  Aligned_cols=36  Identities=42%  Similarity=0.750  Sum_probs=26.1

Q ss_pred             CCcccCCCcEEEEEEe---cCCCCCcceEecccCCCCCCcee
Q psy4773           1 MGQTDYCNFSFWRYYL---KGPKQGKSEVFIDGLPGLPDNVK   39 (160)
Q Consensus         1 ~~~~Et~~~Ri~Ry~l---~G~k~G~~evfid~LPG~PDNI~   39 (160)
                      ||++|  + -++-..+   .|-|.++.|-|+.|.+-+||+|+
T Consensus       167 mgl~~--R-s~~~lhlgg~~gGK~~~~eqf~kni~~LP~~vk  205 (347)
T COG4294         167 MGLAE--R-SVWNLHLGGTHGGKKERLEQFIKNIQRLPDSVK  205 (347)
T ss_pred             cCCCc--C-CceEEEeccccCCchhHHHHHHHHHhhcCHHHH
Confidence            67777  2 2333444   45567778999999999999996


No 45 
>cd08032 LARP_7 La RNA-binding domain of La-related protein 7. LARP7 is a component of the 7SK snRNP, a key factor in the regulation of RNA polymerase II transcription. 7SK functionality is dependent on the presence of LARP7, which is thought to stabilize the 7SK RNA by interacting with its 3' end. The release of 7SK RNA from P-TEFb/HEXIM/7SK complexes activates the cyclin-dependent kinase P-TEFb, which in turn phosphorylates the C-terminal domain of RNA pol II and mediates a transition into productive transcription elongation.
Probab=32.57  E-value=20  Score=24.47  Aligned_cols=14  Identities=7%  Similarity=0.275  Sum_probs=10.6

Q ss_pred             EEEEEEeCCCeEEE
Q psy4773         108 YTLLVISSQGEIVD  121 (160)
Q Consensus       108 ~~vv~vd~~G~il~  121 (160)
                      +-++++|++|+-|+
T Consensus        68 S~~lev~ed~~~VR   81 (82)
T cd08032          68 SSVVELNLEGTRIR   81 (82)
T ss_pred             CCEEEEcCCCCccC
Confidence            45789999998653


No 46 
>PF13570 PQQ_3:  PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=32.37  E-value=95  Score=17.26  Aligned_cols=20  Identities=20%  Similarity=0.147  Sum_probs=12.3

Q ss_pred             ceEEEEECCEEEEecCCCCe
Q psy4773         133 SSDVEEYNGAYYFGSPISKH  152 (160)
Q Consensus       133 iS~v~e~~g~LylGS~~~~~  152 (160)
                      .++....+|+||+|+.....
T Consensus        14 ~~~~~v~~g~vyv~~~dg~l   33 (40)
T PF13570_consen   14 WSSPAVAGGRVYVGTGDGNL   33 (40)
T ss_dssp             -S--EECTSEEEEE-TTSEE
T ss_pred             CcCCEEECCEEEEEcCCCEE
Confidence            35557779999999985543


No 47 
>PF06739 SBBP:  Beta-propeller repeat;  InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=32.27  E-value=42  Score=19.17  Aligned_cols=20  Identities=25%  Similarity=0.315  Sum_probs=16.6

Q ss_pred             CCCceeecCCCCeEEEeecC
Q psy4773          34 LPDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus        34 ~PDNI~~~~~G~fWval~~~   53 (160)
                      .|-+|..|++|+++|+-.+-
T Consensus        14 ~~~~IavD~~GNiYv~G~T~   33 (38)
T PF06739_consen   14 YGNGIAVDSNGNIYVTGYTN   33 (38)
T ss_pred             eEEEEEECCCCCEEEEEeec
Confidence            46789999999999987663


No 48 
>cd06481 ACD_HspB9_like Alpha crystallin domain (ACD) found in mammalian small heat shock protein (sHsp) HspB9 and similar proteins. sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Human (h) HspB9 is expressed exclusively in the normal testis and in various tumor samples and is a cancer/testis antigen. hHspB9  interacts with TCTEL1 (T-complex testis expressed protein -1), a subunit of dynein. hHspB9 and TCTEL1 are co-expressed in similar cells within the testis and in tumor cells. Included in this group is Xenopus Hsp30, a developmentally-regulated heat-inducible molecular chaperone.
Probab=32.04  E-value=30  Score=23.39  Aligned_cols=28  Identities=18%  Similarity=0.404  Sum_probs=19.7

Q ss_pred             CcceEecccCCCC-CCceeecCCCC-eEEEe
Q psy4773          22 GKSEVFIDGLPGL-PDNVKRDSKGN-FLVSL   50 (160)
Q Consensus        22 G~~evfid~LPG~-PDNI~~~~~G~-fWval   50 (160)
                      ...++-++ |||+ ||+|...-+++ +.|..
T Consensus         7 d~~~v~~d-lpG~~~edI~V~v~~~~L~I~g   36 (87)
T cd06481           7 EGFSLKLD-VRGFSPEDLSVRVDGRKLVVTG   36 (87)
T ss_pred             ceEEEEEE-CCCCChHHeEEEEECCEEEEEE
Confidence            34567776 9997 99999876554 55543


No 49 
>cd08029 LA_like_fungal La-motif domain of fungal proteins similar to the La autoantigen. This domain is found in fungal proteins related to the La autoantigen. A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=31.91  E-value=42  Score=22.46  Aligned_cols=14  Identities=43%  Similarity=0.501  Sum_probs=10.6

Q ss_pred             EEEEEEeCCCeEEE
Q psy4773         108 YTLLVISSQGEIVD  121 (160)
Q Consensus       108 ~~vv~vd~~G~il~  121 (160)
                      +-++++|++|+-|+
T Consensus        62 S~~lev~~d~~~VR   75 (76)
T cd08029          62 SELLEVSEDGENVR   75 (76)
T ss_pred             CCeEEEeCCCCccc
Confidence            34789999998664


No 50 
>PF11208 DUF2992:  Protein of unknown function (DUF2992);  InterPro: IPR016787 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=31.80  E-value=27  Score=25.97  Aligned_cols=12  Identities=25%  Similarity=0.343  Sum_probs=9.9

Q ss_pred             CCCeEEEeecCC
Q psy4773          43 KGNFLVSLVCPV   54 (160)
Q Consensus        43 ~G~fWval~~~r   54 (160)
                      ||.||||++.-.
T Consensus         6 dg~FWvGv~E~~   17 (132)
T PF11208_consen    6 DGPFWVGVFERH   17 (132)
T ss_pred             cCCcEEEEEEEE
Confidence            688999998853


No 51 
>PF06236 MelC1:  Tyrosinase co-factor MelC1;  InterPro: IPR010928 This family consists of several tyrosinase co-factor MELC1 proteins from a number of Streptomyces species. The melanin operon (melC) of Streptomyces antibioticus contains two genes, melC1 and melC2 (apotyrosinase). It is thought that MelC1 forms a transient binary complex with the downstream apotyrosinase MelC2 to facilitate the incorporation of copper ion and the secretion of tyrosinase indicating that MelC1 is a chaperone for the apotyrosinase MelC2 [].; GO: 0005507 copper ion binding, 0042438 melanin biosynthetic process; PDB: 1WX4_B 2ZWD_B 2ZMZ_B 1WX2_B 1WX5_D 3AWX_B 3AWS_B 2ZMY_B 3AX0_B 2ZWG_B ....
Probab=31.43  E-value=43  Score=24.80  Aligned_cols=47  Identities=26%  Similarity=0.193  Sum_probs=29.4

Q ss_pred             CcccCCCcEEEEEEecCC-----CCCcceEecccCCCCCCceeecCCCCeEEEeec
Q psy4773           2 GQTDYCNFSFWRYYLKGP-----KQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVC   52 (160)
Q Consensus         2 ~~~Et~~~Ri~Ry~l~G~-----k~G~~evfid~LPG~PDNI~~~~~G~fWval~~   52 (160)
                      .|.|+.+.|-.+-+....     ..+..+|+||   |-|=-+.+..|| =|+..+.
T Consensus        44 ~FdE~YrGRrI~g~~~~~~~~~~~~~~~~V~ID---Gr~LhvMr~ADG-swlS~V~   95 (125)
T PF06236_consen   44 SFDEVYRGRRIQGTPAAAGGHAAHGGGYEVTID---GRPLHVMRRADG-SWLSVVN   95 (125)
T ss_dssp             -EEEEETTEEEEEEE-----------SEEEEET---TEEE-EEE-TTS--EEETTE
T ss_pred             ccceeecceeEEeeeCCCCccccCCCceEEEEC---CeEeeeEEcCCC-CEEeeee
Confidence            488999999888777532     2345689999   666678888888 5776644


No 52 
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=31.12  E-value=2.9e+02  Score=23.80  Aligned_cols=30  Identities=17%  Similarity=0.122  Sum_probs=24.7

Q ss_pred             CcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773         130 LKGSSDVEEYNGAYYFGSPISKHLARVPLA  159 (160)
Q Consensus       130 ~~~iS~v~e~~g~LylGS~~~~~i~~~~l~  159 (160)
                      +.+.-+-..++|+||+-......+.+++++
T Consensus       202 LsmPhSPRWhdgrLwvldsgtGev~~vD~~  231 (335)
T TIGR03032       202 LSMPHSPRWYQGKLWLLNSGRGELGYVDPQ  231 (335)
T ss_pred             ccCCcCCcEeCCeEEEEECCCCEEEEEcCC
Confidence            455566778999999999999999988864


No 53 
>PF06079 Apyrase:  Apyrase;  InterPro: IPR009283 This family consists of several eukaryotic apyrase (or adenosine diphosphatase) proteins (3.6.1.5 from EC), and related nucleoside diphosphatases (3.6.1.6 from EC). The salivary apyrases of blood-feeding arthropods are nucleotide hydrolysing enzymes implicated in the inhibition of host platelet aggregation through the hydrolysis of extracellular adenosine diphosphate [].; GO: 0005509 calcium ion binding, 0016462 pyrophosphatase activity; PDB: 2H2N_A 1S18_A 2H2U_A 1S1D_B.
Probab=30.22  E-value=2.6e+02  Score=23.69  Aligned_cols=53  Identities=15%  Similarity=0.122  Sum_probs=29.0

Q ss_pred             eeeecCCeEEEEEEeCCCeEEEE--EEcCCCCC--cCceE-EEEECCEEEEecCCCCeE
Q psy4773         100 ILFLQGTRYTLLVISSQGEIVDA--LHSVDGSL--KGSSD-VEEYNGAYYFGSPISKHL  153 (160)
Q Consensus       100 ~~~~~~~~~~vv~vd~~G~il~~--l~d~~g~~--~~iS~-v~e~~g~LylGS~~~~~i  153 (160)
                      ++.+--+.|+|.++..+ +.+--  |.|-+|..  .+=++ +...++.||+||....+.
T Consensus        66 Lys~DDrTGiVyeI~~~-~~vPwviL~dGdG~~~kGfK~EWaTVKd~~LyvGs~Gkewt  123 (291)
T PF06079_consen   66 LYSFDDRTGIVYEIKGD-KAVPWVILSDGDGNTSKGFKAEWATVKDDKLYVGSIGKEWT  123 (291)
T ss_dssp             EEEEETTT-EEEEEETT-EEEEEEE-BSTTTTESSB----EEEEETTEEEEE--SS-EE
T ss_pred             EeeeeCCCceEEEEeCC-ceeceEEEeCCCCCccccccceeeEEeCCeeeeccCCCceE
Confidence            34456677999999865 66554  44444431  11122 557799999999886554


No 54 
>PTZ00486 apyrase Superfamily; Provisional
Probab=29.27  E-value=3.4e+02  Score=23.62  Aligned_cols=53  Identities=13%  Similarity=0.131  Sum_probs=32.6

Q ss_pred             eeeecCCeEEEEEEeCCCe-EE--EEEEcCCCCC--cCceE-EEEECCEEEEecCCCCe
Q psy4773         100 ILFLQGTRYTLLVISSQGE-IV--DALHSVDGSL--KGSSD-VEEYNGAYYFGSPISKH  152 (160)
Q Consensus       100 ~~~~~~~~~~vv~vd~~G~-il--~~l~d~~g~~--~~iS~-v~e~~g~LylGS~~~~~  152 (160)
                      +...--+.|.|.+++.+++ .+  ..|.|-+|..  .+=.+ +...+++||+||....+
T Consensus       127 Lys~DDrTGiVy~i~~~~~~~~PwvIL~dGdG~~~kGfK~EWaTVKd~~LyVGs~Gkew  185 (352)
T PTZ00486        127 LYGFDDRTGIVYEIDIDKKKAYPRHILSDGNGNSDKGMKIEWATVYDDKLYVGSIGKEF  185 (352)
T ss_pred             EEEEeCCceEEEEEEcCCCcEeeEEEEecCCCCCCCCcceeeEEEECCEEEEeccccee
Confidence            3455667799999985554 33  3445544421  22223 44589999999998443


No 55 
>PF13449 Phytase-like:  Esterase-like activity of phytase
Probab=29.22  E-value=92  Score=26.03  Aligned_cols=18  Identities=17%  Similarity=0.289  Sum_probs=14.4

Q ss_pred             CceeecCCCCeEEEeecC
Q psy4773          36 DNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus        36 DNI~~~~~G~fWval~~~   53 (160)
                      -.|...++|.|||+.-..
T Consensus        88 Egi~~~~~g~~~is~E~~  105 (326)
T PF13449_consen   88 EGIAVPPDGSFWISSEGG  105 (326)
T ss_pred             hHeEEecCCCEEEEeCCc
Confidence            378888899999988654


No 56 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=28.94  E-value=3.9e+02  Score=23.21  Aligned_cols=52  Identities=17%  Similarity=0.116  Sum_probs=33.9

Q ss_pred             EEEEEEeCC-CeEEEEEEcCC-CCCcCceEEEEECCEEEEecCCCCeEEEEeCC
Q psy4773         108 YTLLVISSQ-GEIVDALHSVD-GSLKGSSDVEEYNGAYYFGSPISKHLARVPLA  159 (160)
Q Consensus       108 ~~vv~vd~~-G~il~~l~d~~-g~~~~iS~v~e~~g~LylGS~~~~~i~~~~l~  159 (160)
                      ..++++|.+ |+.-..=+... |.-+-=.++.+.+.+||..+.....|.++|+.
T Consensus        66 vaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~g~v~v~p~~  119 (346)
T COG2706          66 VAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHSGSVSVYPLQ  119 (346)
T ss_pred             EEEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccCceEEEEEcc
Confidence            456788865 77544333333 33222234556677999999999999999974


No 57 
>PHA02598 denA endonuclease II; Provisional
Probab=28.65  E-value=71  Score=24.02  Aligned_cols=39  Identities=13%  Similarity=0.143  Sum_probs=26.7

Q ss_pred             EEEEeCCCeEEEEEEcCCCCCcCceEEEEECCEEEEecC
Q psy4773         110 LLVISSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSP  148 (160)
Q Consensus       110 vv~vd~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~  148 (160)
                      =+.+|.+|.|.+++-........|=.....|..||+|-.
T Consensus        13 ~l~l~~~~~i~~~f~~~~~~~n~VY~~~~~~~viYVGKA   51 (138)
T PHA02598         13 ELELDKNGRIDRSFIKCPNKKNVIYAIAVDDELVYIGKT   51 (138)
T ss_pred             EEEecCCCcCcccccCCcccceEEEEEEeCCeEEEEeeh
Confidence            367899999999987543333344444456788999864


No 58 
>PF13600 DUF4140:  N-terminal domain of unknown function (DUF4140)
Probab=27.91  E-value=1.1e+02  Score=20.93  Aligned_cols=26  Identities=31%  Similarity=0.727  Sum_probs=21.5

Q ss_pred             CCCcceEecccCCC--CCCceeecCCCC
Q psy4773          20 KQGKSEVFIDGLPG--LPDNVKRDSKGN   45 (160)
Q Consensus        20 k~G~~evfid~LPG--~PDNI~~~~~G~   45 (160)
                      ++|..++.+.+||.  -||-|+....|+
T Consensus        20 ~~G~~~i~~~~Lp~~~d~~Sl~V~~~g~   47 (104)
T PF13600_consen   20 PAGENEIIFEGLPPSLDPDSLRVSGEGG   47 (104)
T ss_pred             CCCceEEEEeCCCcccCCCcEEEEecCC
Confidence            46899999999996  677888888776


No 59 
>PF03022 MRJP:  Major royal jelly protein;  InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=27.67  E-value=3.5e+02  Score=22.29  Aligned_cols=52  Identities=13%  Similarity=0.011  Sum_probs=37.8

Q ss_pred             EEEEEEe-CCCeEEEEEEcCCCC---CcCceEEEEEC-------CEEEEecCCCCeEEEEeCC
Q psy4773         108 YTLLVIS-SQGEIVDALHSVDGS---LKGSSDVEEYN-------GAYYFGSPISKHLARVPLA  159 (160)
Q Consensus       108 ~~vv~vd-~~G~il~~l~d~~g~---~~~iS~v~e~~-------g~LylGS~~~~~i~~~~l~  159 (160)
                      ..++.+| .++++++.++-|...   -+.+...+...       ++.|+.-...+.|-++++.
T Consensus        34 pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~   96 (287)
T PF03022_consen   34 PKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLA   96 (287)
T ss_dssp             -EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETT
T ss_pred             cEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCCCcCcEEEEEcc
Confidence            5678899 578999999987643   35666666544       6999999999988888775


No 60 
>KOG0639|consensus
Probab=26.61  E-value=72  Score=29.33  Aligned_cols=44  Identities=9%  Similarity=0.110  Sum_probs=32.2

Q ss_pred             cccCCCcEEEEEEecCCCCCcceEecccCCCCCCcee---ecCCC-CeEEEeec
Q psy4773           3 QTDYCNFSFWRYYLKGPKQGKSEVFIDGLPGLPDNVK---RDSKG-NFLVSLVC   52 (160)
Q Consensus         3 ~~Et~~~Ri~Ry~l~G~k~G~~evfid~LPG~PDNI~---~~~~G-~fWval~~   52 (160)
                      |+...-.-|.-++|..      ++.+..++|++|+.+   ...+| ++|-|-..
T Consensus       525 FsccsdGnI~vwDLhn------q~~VrqfqGhtDGascIdis~dGtklWTGGlD  572 (705)
T KOG0639|consen  525 FSCCSDGNIAVWDLHN------QTLVRQFQGHTDGASCIDISKDGTKLWTGGLD  572 (705)
T ss_pred             eeeccCCcEEEEEccc------ceeeecccCCCCCceeEEecCCCceeecCCCc
Confidence            5555666677777774      577888999999976   45667 49987644


No 61 
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=26.48  E-value=4.7e+02  Score=23.42  Aligned_cols=29  Identities=17%  Similarity=0.193  Sum_probs=23.3

Q ss_pred             CcceEecccCCCCCCceeecCCCCeEEEee
Q psy4773          22 GKSEVFIDGLPGLPDNVKRDSKGNFLVSLV   51 (160)
Q Consensus        22 G~~evfid~LPG~PDNI~~~~~G~fWval~   51 (160)
                      =+.++++++|. .|=.|...++|++||+--
T Consensus        20 f~~~~va~GL~-~Pw~maflPDG~llVtER   48 (454)
T TIGR03606        20 FDKKVLLSGLN-KPWALLWGPDNQLWVTER   48 (454)
T ss_pred             cEEEEEECCCC-CceEEEEcCCCeEEEEEe
Confidence            35788888887 588898888898888764


No 62 
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=25.44  E-value=2.5e+02  Score=23.73  Aligned_cols=43  Identities=12%  Similarity=-0.029  Sum_probs=28.3

Q ss_pred             eEEEEEEe-CCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCC
Q psy4773         107 RYTLLVIS-SQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISK  151 (160)
Q Consensus       107 ~~~vv~vd-~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~  151 (160)
                      .+.+..+| .+|+++...+-..+.  ..++-...+++||+++-...
T Consensus       344 ~G~l~~ld~~tG~~~~~~~~~~~~--~~s~P~~~~~~l~v~t~~G~  387 (394)
T PRK11138        344 EGYLHWINREDGRFVAQQKVDSSG--FLSEPVVADDKLLIQARDGT  387 (394)
T ss_pred             CCEEEEEECCCCCEEEEEEcCCCc--ceeCCEEECCEEEEEeCCce
Confidence            35566777 478888877643221  23444567999999987654


No 63 
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=24.37  E-value=1.3e+02  Score=16.29  Aligned_cols=19  Identities=11%  Similarity=0.128  Sum_probs=11.0

Q ss_pred             CCEEEEecCCCCeEEEEeC
Q psy4773         140 NGAYYFGSPISKHLARVPL  158 (160)
Q Consensus       140 ~g~LylGS~~~~~i~~~~l  158 (160)
                      +++||+.....+.|..+++
T Consensus         3 ~~~lyv~~~~~~~v~~id~   21 (42)
T TIGR02276         3 GTKLYVTNSGSNTVSVIDT   21 (42)
T ss_pred             CCEEEEEeCCCCEEEEEEC
Confidence            3456666666666665553


No 64 
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=24.31  E-value=2.5e+02  Score=24.66  Aligned_cols=83  Identities=17%  Similarity=0.205  Sum_probs=41.9

Q ss_pred             cEEEEEEecCCCCCcceEecccCC-CCC--CceeecCCCCeEEEeecCCCCcChhhhhhcCCChHHHHHHHHhhhhhccC
Q psy4773           9 FSFWRYYLKGPKQGKSEVFIDGLP-GLP--DNVKRDSKGNFLVSLVCPVDEYTPQLLHIIGPFPNIRKFVARFLHLLEKI   85 (160)
Q Consensus         9 ~Ri~Ry~l~G~k~G~~evfid~LP-G~P--DNI~~~~~G~fWval~~~r~~~~~~~~~~l~~~P~lRk~i~~l~~~~~~~   85 (160)
                      .++.++.|++-   +.++..+ .| ++-  .+...+++++..+++-..+.     ...-+.++..+++++..        
T Consensus       102 ~~l~~vdL~T~---e~~~vy~-~p~~~~g~gt~v~n~d~t~~~g~e~~~~-----d~~~l~~~~~f~e~~~a--------  164 (386)
T PF14583_consen  102 RSLRRVDLDTL---EERVVYE-VPDDWKGYGTWVANSDCTKLVGIEISRE-----DWKPLTKWKGFREFYEA--------  164 (386)
T ss_dssp             TEEEEEETTT-----EEEEEE---TTEEEEEEEEE-TTSSEEEEEEEEGG-----G-----SHHHHHHHHHC--------
T ss_pred             CeEEEEECCcC---cEEEEEE-CCcccccccceeeCCCccEEEEEEEeeh-----hccCccccHHHHHHHhh--------
Confidence            47888888863   3444444 44 222  23446888999888866432     22223456667766543        


Q ss_pred             CccccceeccceeeeeeecCCeEEEEEEe-CCCeEEEEEEcCC
Q psy4773          86 PSEDVKHVVGHFESILFLQGTRYTLLVIS-SQGEIVDALHSVD  127 (160)
Q Consensus        86 ~~~~~~~~~~~~~~~~~~~~~~~~vv~vd-~~G~il~~l~d~~  127 (160)
                                         .++..++.+| ..|++-..+++..
T Consensus       165 -------------------~p~~~i~~idl~tG~~~~v~~~~~  188 (386)
T PF14583_consen  165 -------------------RPHCRIFTIDLKTGERKVVFEDTD  188 (386)
T ss_dssp             ----------------------EEEEEEETTT--EEEEEEESS
T ss_pred             -------------------CCCceEEEEECCCCceeEEEecCc
Confidence                               2346778888 4566655555544


No 65 
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=23.95  E-value=2.5e+02  Score=19.26  Aligned_cols=51  Identities=14%  Similarity=-0.053  Sum_probs=33.4

Q ss_pred             CCeEEEEEEeCCCeEEEEEEcCCCCCcCceEEEEE--CCEEEEecCCCCeEEEEeC
Q psy4773         105 GTRYTLLVISSQGEIVDALHSVDGSLKGSSDVEEY--NGAYYFGSPISKHLARVPL  158 (160)
Q Consensus       105 ~~~~~vv~vd~~G~il~~l~d~~g~~~~iS~v~e~--~g~LylGS~~~~~i~~~~l  158 (160)
                      ...|.+++.|+..+.+..|-+.   +.....|...  +..|.+.--...-|-|+=|
T Consensus        34 ~~~GRll~ydp~t~~~~vl~~~---L~fpNGVals~d~~~vlv~Et~~~Ri~rywl   86 (89)
T PF03088_consen   34 RPTGRLLRYDPSTKETTVLLDG---LYFPNGVALSPDESFVLVAETGRYRILRYWL   86 (89)
T ss_dssp             ---EEEEEEETTTTEEEEEEEE---ESSEEEEEE-TTSSEEEEEEGGGTEEEEEES
T ss_pred             CCCcCEEEEECCCCeEEEehhC---CCccCeEEEcCCCCEEEEEeccCceEEEEEE
Confidence            3459999999988877777642   3456666665  3568887777766666544


No 66 
>PF01731 Arylesterase:  Arylesterase;  InterPro: IPR002640  The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity [].   Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity.   Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL.   Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo [].  This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=23.65  E-value=1.1e+02  Score=20.89  Aligned_cols=30  Identities=10%  Similarity=0.232  Sum_probs=23.2

Q ss_pred             CcceEecccCCCCCCceeecCCCC-eEEEeec
Q psy4773          22 GKSEVFIDGLPGLPDNVKRDSKGN-FLVSLVC   52 (160)
Q Consensus        22 G~~evfid~LPG~PDNI~~~~~G~-fWval~~   52 (160)
                      ++..+.+++++ +|.+|..++++. ++||-..
T Consensus        44 ~~~~~va~g~~-~aNGI~~s~~~k~lyVa~~~   74 (86)
T PF01731_consen   44 KEVKVVASGFS-FANGIAISPDKKYLYVASSL   74 (86)
T ss_pred             CEeEEeeccCC-CCceEEEcCCCCEEEEEecc
Confidence            34778888887 799999998875 6776644


No 67 
>PF06089 Asparaginase_II:  L-asparaginase II;  InterPro: IPR010349 This family consists of several bacterial L-asparaginase II proteins. L-asparaginase (3.5.1.1 from EC) catalyses the hydrolysis of L-asparagine to L-aspartate and ammonium. Rhizobium etli possesses two asparaginases: asparaginase I, which is thermostable and constitutive, and asparaginase II, which is thermolabile, induced by asparagine and repressed by the carbon source [].
Probab=23.27  E-value=86  Score=26.86  Aligned_cols=24  Identities=17%  Similarity=0.264  Sum_probs=21.1

Q ss_pred             CCeEEEEEEeCCCeEEEEEEcCCC
Q psy4773         105 GTRYTLLVISSQGEIVDALHSVDG  128 (160)
Q Consensus       105 ~~~~~vv~vd~~G~il~~l~d~~g  128 (160)
                      ..++.++.+|.+|+++.++-|++.
T Consensus        13 ~H~G~~vVvd~~G~v~~~~Gd~~~   36 (324)
T PF06089_consen   13 VHRGHAVVVDADGRVLASAGDPDR   36 (324)
T ss_pred             eEEEEEEEECCCCCEEEEecCCCC
Confidence            356889999999999999999984


No 68 
>cd07323 LAM LA motif RNA-binding domain. This domain is found at the N-terminus of La RNA-binding proteins as well as in other related proteins. Typically, the domain co-occurs with an RNA-recognition motif (RRM), and together these domains function to bind primary transcripts of RNA polymerase III in the La autoantigen (Lupus La protein, LARP3, or Sjoegren syndrome type B antigen, SS-B). A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=22.28  E-value=41  Score=22.34  Aligned_cols=15  Identities=13%  Similarity=0.534  Sum_probs=10.1

Q ss_pred             hhcCCChHHHHHHHH
Q psy4773          63 HIIGPFPNIRKFVAR   77 (160)
Q Consensus        63 ~~l~~~P~lRk~i~~   77 (160)
                      +.+.+++.+|++...
T Consensus        36 ~~i~~F~r~k~l~~~   50 (75)
T cd07323          36 SLLASFNRVKKLTTD   50 (75)
T ss_pred             HHHhCchHHHHHcCC
Confidence            456677888877643


No 69 
>PF07463 NUMOD4:  NUMOD4 motif;  InterPro: IPR010902 NUMOD4 is a putative DNA-binding motif found in homing endonucleases and related proteins [].; GO: 0016788 hydrolase activity, acting on ester bonds; PDB: 1U3E_M.
Probab=22.26  E-value=77  Score=19.20  Aligned_cols=23  Identities=17%  Similarity=0.380  Sum_probs=14.9

Q ss_pred             eEecccCCCCCCceeecCCCCeEE
Q psy4773          25 EVFIDGLPGLPDNVKRDSKGNFLV   48 (160)
Q Consensus        25 evfid~LPG~PDNI~~~~~G~fWv   48 (160)
                      |++.+ .||||.--..+..|++.-
T Consensus         1 E~Wk~-I~g~~~~Y~VSn~GrVrs   23 (51)
T PF07463_consen    1 EIWKP-IPGYEGKYEVSNLGRVRS   23 (51)
T ss_dssp             --EEE--TTSTTTEEEETTS-EEE
T ss_pred             CcceE-cCCCCCcEEEcCCceEEE
Confidence            56666 889998788888887654


No 70 
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=22.20  E-value=2.5e+02  Score=23.36  Aligned_cols=42  Identities=14%  Similarity=0.162  Sum_probs=29.1

Q ss_pred             CeEEEEEEe-CCCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCC
Q psy4773         106 TRYTLLVIS-SQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPIS  150 (160)
Q Consensus       106 ~~~~vv~vd-~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~  150 (160)
                      ..+.+..+| .+|+++=..+.+..   ..+.....++.+|+|+...
T Consensus        73 ~~g~v~a~d~~tG~~~W~~~~~~~---~~~~p~v~~~~v~v~~~~g  115 (377)
T TIGR03300        73 ADGTVVALDAETGKRLWRVDLDER---LSGGVGADGGLVFVGTEKG  115 (377)
T ss_pred             CCCeEEEEEccCCcEeeeecCCCC---cccceEEcCCEEEEEcCCC
Confidence            346788999 68999877765442   1234455689999998654


No 71 
>PF05411 Peptidase_C32:  Equine arteritis virus putative proteinase;  InterPro: IPR008742 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases corresponds to MEROPS peptidase family C32 (clan CA). The type example is equine arteritis virus-type cysteine proteinase (porcine reproductive and respiratory syndrome virus), which is involved in viral polyprotein processing [].; GO: 0003968 RNA-directed RNA polymerase activity, 0004197 cysteine-type endopeptidase activity, 0004252 serine-type endopeptidase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides; PDB: 3MTV_A.
Probab=21.73  E-value=44  Score=24.74  Aligned_cols=42  Identities=21%  Similarity=0.170  Sum_probs=25.2

Q ss_pred             eEEEEEEeCCCeE-EEEEEcCCCCCcCceEEEE--ECCEEEEecC
Q psy4773         107 RYTLLVISSQGEI-VDALHSVDGSLKGSSDVEE--YNGAYYFGSP  148 (160)
Q Consensus       107 ~~~vv~vd~~G~i-l~~l~d~~g~~~~iS~v~e--~~g~LylGS~  148 (160)
                      +|+-..+|++|++ ++.++.++.-+.+|+-+-|  ..|-..|+++
T Consensus        58 ~GLraV~d~~G~~~v~af~~~~SwirHv~l~~ep~~pgfv~l~r~  102 (127)
T PF05411_consen   58 NGLRAVVDPYGPIHVYAFSCPESWIRHVSLADEPVPPGFVRLGRI  102 (127)
T ss_dssp             TTEEEEE-TT-SEEEEEESSTT-S-EEEEETTS---TTEEEEEEE
T ss_pred             cCeeEEECCCCCEEEEEecCCccceeeeEecCCCCCCCcEEEEEE
Confidence            4677788999995 5677767766677776654  4466666653


No 72 
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=20.96  E-value=3.2e+02  Score=22.67  Aligned_cols=44  Identities=14%  Similarity=-0.000  Sum_probs=29.5

Q ss_pred             CeEEEEEEeC-CCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCC
Q psy4773         106 TRYTLLVISS-QGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISK  151 (160)
Q Consensus       106 ~~~~vv~vd~-~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~  151 (160)
                      ..+.+..+|. +|+++..+.-..+  ...++-...+++||+++....
T Consensus       328 ~~G~l~~~d~~tG~~~~~~~~~~~--~~~~sp~~~~~~l~v~~~dG~  372 (377)
T TIGR03300       328 FEGYLHWLSREDGSFVARLKTDGS--GIASPPVVVGDGLLVQTRDGD  372 (377)
T ss_pred             CCCEEEEEECCCCCEEEEEEcCCC--ccccCCEEECCEEEEEeCCce
Confidence            3466777774 5888887764332  124455677889999998754


No 73 
>PF01502 PRA-CH:  Phosphoribosyl-AMP cyclohydrolase;  InterPro: IPR002496 Phosphoribosyl-AMP cyclohydrolase 3.5.4.19 from EC catalyses the third step in the histidine biosynthetic pathway:  5-phosphoribosyl-AMP + H2O = 5-(5-phospho-D-ribosylaminoformimino)-1-(5-phospho-ribosyl) imidazole-4-carboxamide  It requires Zn2+ ions for activity [].; GO: 0004635 phosphoribosyl-AMP cyclohydrolase activity, 0000105 histidine biosynthetic process; PDB: 1ZPS_B.
Probab=20.83  E-value=2.1e+02  Score=19.18  Aligned_cols=33  Identities=24%  Similarity=0.481  Sum_probs=21.9

Q ss_pred             EEEecCCCCCcceEecccCCCCCCceeecCCCCeEEEeecC
Q psy4773          13 RYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCP   53 (160)
Q Consensus        13 Ry~l~G~k~G~~evfid~LPG~PDNI~~~~~G~fWval~~~   53 (160)
                      +.|.||+..|....+++        |+.|=|+.-.+..+.+
T Consensus        28 ~lW~KGetSG~~q~v~~--------i~~DCD~D~ll~~V~~   60 (75)
T PF01502_consen   28 RLWRKGETSGNTQKVVE--------IRLDCDGDALLFKVEQ   60 (75)
T ss_dssp             EEEETTTTTS--EEEEE--------EEE-TTSSEEEEEEEE
T ss_pred             cEeeEECCCCCEEEEEE--------EEecCCCCeEEEEEEe
Confidence            78999999998888776        6677666655544443


No 74 
>PF04398 DUF538:  Protein of unknown function, DUF538;  InterPro: IPR007493 This family consists of several plant proteins of unknown function.; PDB: 1YDU_A.
Probab=20.71  E-value=86  Score=22.41  Aligned_cols=24  Identities=46%  Similarity=0.726  Sum_probs=13.7

Q ss_pred             cCCC--CCCcee---ecCC-CCeEEEeecC
Q psy4773          30 GLPG--LPDNVK---RDSK-GNFLVSLVCP   53 (160)
Q Consensus        30 ~LPG--~PDNI~---~~~~-G~fWval~~~   53 (160)
                      |||.  +|.+..   .+++ |.|||-+..+
T Consensus        10 glP~GLLP~~v~~y~l~~~tG~f~v~l~~~   39 (110)
T PF04398_consen   10 GLPRGLLPLGVTEYGLNRDTGFFWVKLKSP   39 (110)
T ss_dssp             S-TT-TTTSSS-EEEE-TTT-SEEEE-SS-
T ss_pred             CCCCCcCCCCceEEEEecCCcEEEEEecCC
Confidence            6774  788774   4565 9999999775


No 75 
>PHA02122 hypothetical protein
Probab=20.66  E-value=2.4e+02  Score=18.00  Aligned_cols=33  Identities=9%  Similarity=0.053  Sum_probs=21.6

Q ss_pred             eCCCeEEEEEEcCCCCCcCceEEEEECCEEEEec
Q psy4773         114 SSQGEIVDALHSVDGSLKGSSDVEEYNGAYYFGS  147 (160)
Q Consensus       114 d~~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS  147 (160)
                      |-+|-|+.++.|....+ -+---..++|.|++|.
T Consensus        28 ~~~~iiihs~~~~gd~v-~vn~e~~~ng~l~i~q   60 (65)
T PHA02122         28 GCENIIIHSFKDDGDEV-IVNFELVVNGKLIINQ   60 (65)
T ss_pred             CCCcEEEEeeccCCCEE-EEEEEEEECCEEEEee
Confidence            56899999998754322 1222234789999985


No 76 
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=20.14  E-value=3.9e+02  Score=20.09  Aligned_cols=47  Identities=15%  Similarity=0.231  Sum_probs=33.5

Q ss_pred             EEEEEEeC-CCeEEEEEEcCCCCCcCceEEEEECCEEEEecCCCCeEEEEeC
Q psy4773         108 YTLLVISS-QGEIVDALHSVDGSLKGSSDVEEYNGAYYFGSPISKHLARVPL  158 (160)
Q Consensus       108 ~~vv~vd~-~G~il~~l~d~~g~~~~iS~v~e~~g~LylGS~~~~~i~~~~l  158 (160)
                      +.++.+|. +|+++-....+. .+...  ....++.+|+++..+ .|-.++.
T Consensus        46 ~~l~~~d~~tG~~~W~~~~~~-~~~~~--~~~~~~~v~v~~~~~-~l~~~d~   93 (238)
T PF13360_consen   46 GNLYALDAKTGKVLWRFDLPG-PISGA--PVVDGGRVYVGTSDG-SLYALDA   93 (238)
T ss_dssp             SEEEEEETTTSEEEEEEECSS-CGGSG--EEEETTEEEEEETTS-EEEEEET
T ss_pred             CEEEEEECCCCCEEEEeeccc-cccce--eeeccccccccccee-eeEeccc
Confidence            67889995 999999998743 22222  477889999998444 6666653


Done!