Query         psy16052
Match_columns 476
No_of_seqs    409 out of 2690
Neff          7.8 
Searched_HMMs 46136
Date          Fri Aug 16 19:38:38 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy16052.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16052hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 TIGR03212 uraD_N-term-dom puta 100.0 2.6E-33 5.6E-38  277.2  20.1  224  181-455    66-295 (297)
  2 PRK15394 4-deoxy-4-formamido-L 100.0   3E-27 6.4E-32  233.0  22.6  228  181-462     4-280 (296)
  3 TIGR03006 pepcterm_polyde poly  99.9 6.4E-24 1.4E-28  207.2  19.3  210  193-448    29-264 (265)
  4 TIGR02764 spore_ybaN_pdaB poly  99.9   9E-23 1.9E-27  190.8  19.1  184  178-448     5-191 (191)
  5 TIGR02884 spore_pdaA delta-lac  99.9 5.2E-22 1.1E-26  190.1  20.1  117  178-309    36-155 (224)
  6 TIGR02873 spore_ylxY probable   99.9 7.7E-22 1.7E-26  193.0  18.8  182  179-449    85-268 (268)
  7 PF01522 Polysacc_deac_1:  Poly  99.8 2.2E-21 4.7E-26  167.8   8.9  118  177-310     4-123 (123)
  8 COG0726 CDA1 Predicted xylanas  99.8 2.6E-17 5.7E-22  160.2  19.4  115  179-308    65-181 (267)
  9 PRK14581 hmsF outer membrane N  99.5 1.4E-13 3.1E-18  148.5  13.6  125  178-314   106-275 (672)
 10 PRK14582 pgaB outer membrane N  99.5 2.4E-13 5.2E-18  147.0  13.5  125  178-314   106-275 (671)
 11 PF00057 Ldl_recept_a:  Low-den  99.0 3.9E-10 8.4E-15   75.9   2.6   36  102-137     2-37  (37)
 12 cd00112 LDLa Low Density Lipop  98.9 1.4E-09 2.9E-14   72.5   2.4   35  103-137     1-35  (35)
 13 smart00192 LDLa Low-density li  98.5 5.9E-08 1.3E-12   63.7   2.7   32  103-134     2-33  (33)
 14 PF09960 DUF2194:  Uncharacteri  98.1 0.00016 3.4E-09   78.0  18.4  206  193-452   288-531 (585)
 15 PF10096 DUF2334:  Uncharacteri  98.0 0.00029 6.3E-09   68.4  15.9   99  194-304    18-143 (243)
 16 PF01607 CBM_14:  Chitin bindin  97.9 4.8E-06   1E-10   60.4   1.0   53   26-83      1-53  (53)
 17 COG3233 Predicted deacetylase   97.7 0.00054 1.2E-08   63.9  11.7  123  181-315     6-143 (233)
 18 smart00494 ChtBD2 Chitin-bindi  97.1 0.00044 9.5E-09   50.4   3.3   49   25-78      2-50  (56)
 19 PF11959 DUF3473:  Domain of un  97.0  0.0034 7.3E-08   55.0   8.2   82  348-449    29-131 (133)
 20 PF12999 PRKCSH-like:  Glucosid  96.5  0.0029 6.3E-08   57.6   3.8   49   78-134    55-110 (176)
 21 PF00057 Ldl_recept_a:  Low-den  95.5  0.0062 1.3E-07   40.9   1.2   33   56-92      5-37  (37)
 22 cd00112 LDLa Low Density Lipop  95.5  0.0066 1.4E-07   40.2   1.3   32   57-92      4-35  (35)
 23 KOG1215|consensus               94.9    0.02 4.4E-07   65.9   3.7   74   56-139   220-296 (877)
 24 KOG1215|consensus               94.9   0.067 1.5E-06   61.6   7.8  115   56-175   140-292 (877)
 25 PF03065 Glyco_hydro_57:  Glyco  94.8   0.071 1.5E-06   54.8   7.0  196  227-453   114-322 (360)
 26 COG1449 Alpha-amylase/alpha-ma  94.6     0.6 1.3E-05   51.3  13.7   79  227-309   183-262 (615)
 27 PF04748 Polysacc_deac_2:  Dive  93.8     2.8 6.1E-05   39.8  15.1  182  211-449    19-213 (213)
 28 PF15421 Polysacc_deac_3:  Puta  92.5     5.1 0.00011   40.9  15.0  167  226-451   132-303 (425)
 29 smart00192 LDLa Low-density li  92.1    0.08 1.7E-06   34.5   1.3   28   58-89      6-33  (33)
 30 COG2861 Uncharacterized protei  81.4      53  0.0012   31.7  15.9  204  178-449    28-245 (250)
 31 PF12999 PRKCSH-like:  Glucosid  77.9     1.6 3.5E-05   39.9   2.3   31  105-135    35-67  (176)
 32 KOG2397|consensus               75.3     2.1 4.5E-05   44.8   2.6   49   82-138    65-117 (480)
 33 COG4878 Uncharacterized protei  68.4 1.1E+02  0.0024   30.1  12.2   74  230-306    52-130 (309)
 34 KOG3509|consensus               63.5      13 0.00029   42.6   5.9   79   55-139    33-112 (964)
 35 PF07315 DUF1462:  Protein of u  62.0      27 0.00058   28.3   5.7   53  376-434    10-63  (93)
 36 COG3603 Uncharacterized conser  46.5      58  0.0012   27.8   5.5   67  167-240    52-127 (128)
 37 cd02995 PDI_a_PDI_a'_C PDIa fa  43.4      48   0.001   26.4   4.8   51  394-445     9-59  (104)
 38 PF01074 Glyco_hydro_38:  Glyco  42.9 1.2E+02  0.0026   29.5   8.3   80  227-314    64-145 (275)
 39 COG4837 Uncharacterized protei  36.9      69  0.0015   26.1   4.4   53  376-434    17-70  (106)
 40 COG3536 Uncharacterized protei  36.5      22 0.00048   29.7   1.6   81  166-248     2-92  (120)
 41 COG1543 Uncharacterized conser  36.4      89  0.0019   33.1   6.3   76  228-310   128-204 (504)
 42 PRK05406 LamB/YcsF family prot  35.6 2.9E+02  0.0064   26.8   9.3   82  194-287    22-110 (246)
 43 PTZ00062 glutaredoxin; Provisi  33.3 1.1E+02  0.0024   28.7   6.1   41  401-445    15-55  (204)
 44 PRK12569 hypothetical protein;  33.1 3.2E+02  0.0069   26.5   9.1   83  194-288    25-114 (245)
 45 cd02992 PDI_a_QSOX PDIa family  31.9   1E+02  0.0022   25.7   5.1   50  394-444    10-60  (114)
 46 COG1540 Uncharacterized protei  31.6 4.9E+02   0.011   25.2  10.5   83  194-288    22-111 (252)
 47 KOG1958|consensus               31.0 9.1E+02    0.02   28.2  15.0   44  227-274   229-273 (1129)
 48 cd07286 PX_SNX18 The phosphoin  28.3 1.1E+02  0.0023   26.6   4.6   74  389-468    42-122 (127)
 49 PF12340 DUF3638:  Protein of u  28.1 3.1E+02  0.0067   26.4   8.1   61  385-449    83-143 (229)
 50 KOG1225|consensus               27.0 1.6E+02  0.0035   31.9   6.5   15  149-164   345-360 (525)
 51 TIGR01068 thioredoxin thioredo  26.0 1.4E+02  0.0029   23.3   4.7   48  394-444     5-52  (101)
 52 KOG3509|consensus               25.6      78  0.0017   36.7   4.1   59   78-140    11-69  (964)
 53 PTZ00413 lipoate synthase; Pro  25.0 3.1E+02  0.0068   28.5   7.9   70  383-460   276-345 (398)
 54 TIGR01126 pdi_dom protein disu  24.8 1.1E+02  0.0023   24.1   3.9   49  394-444     5-53  (102)
 55 cd02956 ybbN ybbN protein fami  24.5 1.2E+02  0.0025   23.9   4.0   47  395-444     3-50  (96)
 56 PF07579 DUF1548:  Domain of Un  24.1      33 0.00071   29.8   0.7   48  385-432    47-94  (135)
 57 COG4637 Predicted ATPase [Gene  23.5 1.9E+02  0.0042   29.3   5.9   61  385-448   273-333 (373)
 58 cd02957 Phd_like Phosducin (Ph  23.4 2.1E+02  0.0046   23.5   5.6   37  403-443    24-60  (113)
 59 cd02996 PDI_a_ERp44 PDIa famil  22.0 1.8E+02  0.0039   23.5   4.8   40  394-434    10-49  (108)
 60 cd02954 DIM1 Dim1 family; Dim1  22.0 2.8E+02   0.006   23.5   5.9   46  393-442     3-50  (114)
 61 PLN02701 alpha-mannosidase      21.6 2.7E+02  0.0057   33.1   7.5   75  227-309   106-182 (1050)
 62 cd02987 Phd_like_Phd Phosducin  20.6 2.3E+02   0.005   25.8   5.6   37  404-444    84-120 (175)
 63 TIGR01358 DAHP_synth_II 3-deox  20.2   6E+02   0.013   26.9   8.9   77  195-274   293-379 (443)

No 1  
>TIGR03212 uraD_N-term-dom putative urate catabolism protein. This model represents a protein that is predominantly found just upstream of the UraD protein (OHCU decarboxylase) and in a number of instances as a N-terminal fusion with it. UraD itself catalyzes the last step in the catabolism of urate to allantoate. The function of this protein is presently unknown. It shows homology with the pfam01522 polysaccharide deacetylase domain family.
Probab=100.00  E-value=2.6e-33  Score=277.22  Aligned_cols=224  Identities=18%  Similarity=0.257  Sum_probs=181.8

Q ss_pred             EEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC--CChHHHHHHHHcCCeeeeCCCCCCCCccccCCCCH
Q psy16052        181 ITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY--TNYSAVQELHRKGHEVAVHSITHNDDENFWSNATV  258 (476)
Q Consensus       181 V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~--~~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~  258 (476)
                      ..+++-+..+..+.++++++|++       ++||+|||+++..  .+++++++|.++|||||+||++|...    ..++.
T Consensus        66 ~~~s~~~YG~rvG~~RiLdlL~~-------~gv~aTffv~g~~~e~~P~~v~~i~~~GHEIg~Hg~~H~~~----~~ls~  134 (297)
T TIGR03212        66 SMESLYEYGSRAGFWRLLRLFTE-------RGIPVTVFGVAMALARNPEAVAAMKEAGWEIASHGLRWIDY----QDMDE  134 (297)
T ss_pred             cchhhhhhcchhCHHHHHHHHHH-------cCCCEEEEeEHHHHHHCHHHHHHHHHcCCEEeeccccCccc----ccCCH
Confidence            35556566667778999999999       9999999999864  34599999999999999999999875    46899


Q ss_pred             HHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHH-cCCeEeeecccCCCCCCCCCccccccCCCCcCC
Q psy16052        259 DDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEE-QAFLYDSTITAPLSNPPLWPYTMYFRMPHRCHG  337 (476)
Q Consensus       259 e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~-~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~  337 (476)
                      ++.+++|.++.++|+   .++|..|+|||++  ..+.+ ++++|++ .||+|+||+..+  +.|+|..+           
T Consensus       135 ~~e~~~i~~s~~~i~---~~tG~~P~G~~~~--~~s~~-T~~LL~e~~Gf~Y~sd~~~d--D~Py~~~~-----------  195 (297)
T TIGR03212       135 AQEREHIAEAIRLHT---EVTGERPLGWYTG--RTSPN-TRRLVAEEGGFLYDADSYAD--DLPYWDEV-----------  195 (297)
T ss_pred             HHHHHHHHHHHHHHH---HHhCCCCceEECC--CCChh-HHHHHHHhcCceEeCchhhc--CCCeEeec-----------
Confidence            999999999999999   5889999999975  55665 8999998 999999999976  34555332           


Q ss_pred             CCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCC---cceEEeecchh
Q psy16052        338 NLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNR---APLGLYFHAAW  414 (476)
Q Consensus       338 ~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r---~p~~i~~H~~w  414 (476)
                             ...+++.||+++..++  ..|.. .         ....++++++++|+++|+.+|+..+   ..|.|.+| ++
T Consensus       196 -------~~~~~l~lP~~~~~nD--~~~~~-~---------~~~~~~~~~~~~~~d~fd~l~~eg~~~~~~~~i~lH-p~  255 (297)
T TIGR03212       196 -------AGRPQLIVPYTLDAND--MRFAT-P---------QGFNTGEQFFTYLRDAFDVLYAEGEGAPKMMSIGLH-CR  255 (297)
T ss_pred             -------CCCCeEEEecccccCc--HHHHh-c---------cCCCCHHHHHHHHHHHHHHHHHhCCCCCceEEEecC-cc
Confidence                   1124677999875432  11211 0         1125789999999999999997543   35789999 77


Q ss_pred             hcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhCCc
Q psy16052        415 LKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQNPR  455 (476)
Q Consensus       415 ~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~p~  455 (476)
                      ++|+|.|+.+|++||+||++ +++|||+|.+||++|++...
T Consensus       256 i~G~p~R~~~L~~~l~~i~~-~~~VW~at~~eIA~~~~~~~  295 (297)
T TIGR03212       256 LVGRPGRIAALQRFLDYVQS-HDKVWVARRIDIARHWHETH  295 (297)
T ss_pred             ccCCHHHHHHHHHHHHHHHh-CCCEEEEcHHHHHHHHHHhC
Confidence            99999999999999999998 99999999999999986543


No 2  
>PRK15394 4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol deformylase ArnD; Provisional
Probab=99.96  E-value=3e-27  Score=233.05  Aligned_cols=228  Identities=16%  Similarity=0.191  Sum_probs=176.8

Q ss_pred             EEEeecC---CCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC----------------------------------
Q psy16052        181 ITITFDD---AINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY----------------------------------  223 (476)
Q Consensus       181 V~lTFDd---g~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~----------------------------------  223 (476)
                      |+|-.|-   -.+..+.++++++|++       ++++||||.+..-                                  
T Consensus         4 ~~l~idvdt~~~~~~g~~~~~~~~~~-------~~~~a~f~~~~gpd~~g~~~~r~~~~~fl~k~~r~~~~~~yg~~~l~   76 (296)
T PRK15394          4 VGLRIDVDTFRGTREGVPRLLEILSK-------HGIQASFFFSVGPDNMGRHLWRLLKPRFLWKMLRSNAASLYGWDILL   76 (296)
T ss_pred             eEEEEEeecccccccCHHHHHHHHHH-------cCCCEEEEeccCCCchhHHHHHHhhhHHHHHHHhcCchhhcCHHHHh
Confidence            5555552   1345678999999999       9999999997310                                  


Q ss_pred             -----C-------ChHHHHHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCc
Q psy16052        224 -----T-------NYSAVQELHRKGHEVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYL  291 (476)
Q Consensus       224 -----~-------~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l  291 (476)
                           +       ++++++++.++ ||||+|||+|.........++.++++++|.++.++|+   .++|..++|||+|++
T Consensus        77 ~g~~~PG~~I~~~~P~lvr~i~~~-HEIG~Hg~~H~~wq~~~~~ls~~e~~~eI~ra~~~Le---~itG~~p~g~raPgw  152 (296)
T PRK15394         77 AGTAWPGKEIGKALADIIREAAKA-HEVGLHAWDHHAWQAWSGVWSRQQLIEQIARGVDALE---EIIGQPVTCSAAAGW  152 (296)
T ss_pred             hcccCCCchHHHHCHHHHHHHHhc-CEehhcccCccchhcccccCCHHHHHHHHHHHHHHHH---HHhCCCCCEEeCCCc
Confidence                 0       23789999999 9999999999953222346899999999999999999   589999999999999


Q ss_pred             ccCchHHHHHHHHcCCeEeeecccCCCCCCCCCccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCC
Q psy16052        292 RVGGNNQFTMMEEQAFLYDSTITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPG  371 (476)
Q Consensus       292 ~~~~~~~~~~l~~~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g  371 (476)
                      +.+.+ +++++++.||.|+||+..+   .|++|.+.+                ...+++|||+++..+      |+.. |
T Consensus       153 ~~~~~-tl~ll~e~Gf~Y~Ss~~~d---~Py~~~~~~----------------g~~~~veIP~tl~t~------D~~~-g  205 (296)
T PRK15394        153 RADQR-VVEAKEAFGFRYNSDCRGT---HPFRPLLPD----------------GSLGTVQIPVTLPTW------DEVI-G  205 (296)
T ss_pred             cCCHH-HHHHHHHcCCeeecCCCCC---CceeeecCC----------------CcCCeEEEccccccc------hhhh-c
Confidence            99876 8999999999999999753   566654421                223589999997432      2211 1


Q ss_pred             CcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHh
Q psy16052        372 CAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWI  451 (476)
Q Consensus       372 ~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~  451 (476)
                              ...+.+++.+++++.|++   ..+. .++++| ++++|.| ++.+|+++|+++++  .+|||+|++||+.-.
T Consensus       206 --------~~~~~~~f~~~~~d~~~~---~~g~-~V~tiH-P~v~G~~-~~~~fe~lL~~~~~--~gv~F~tl~el~~~~  269 (296)
T PRK15394        206 --------RDVKAEDFNDFILDRILR---DKGT-PVYTIH-AEVEGIA-YAHNFEDLLKRAAQ--EGITFCPLSELLPDD  269 (296)
T ss_pred             --------CCCCHHHHHHHHHHHhcc---ccCc-eEEEEC-CCcCCcc-hHHHHHHHHHHHhc--CCcEEeeHHHHHhhc
Confidence                    125678899999777754   2233 489999 5688888 99999999999985  679999999999766


Q ss_pred             hCCcchhcccc
Q psy16052        452 QNPRTITEVKN  462 (476)
Q Consensus       452 ~~p~~~~~~~~  462 (476)
                      ++..|+.++..
T Consensus       270 ~~~~p~~~~~~  280 (296)
T PRK15394        270 LETLPLGKVVR  280 (296)
T ss_pred             hhhCCccceee
Confidence            68888777654


No 3  
>TIGR03006 pepcterm_polyde polysaccharide deactylase family protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide deacetylases (pfam01522). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene. The highest scoring homologs below the trusted cutoff for this model are found in several species of Methanosarcina, an archaeal genus.
Probab=99.92  E-value=6.4e-24  Score=207.16  Aligned_cols=210  Identities=22%  Similarity=0.361  Sum_probs=145.4

Q ss_pred             cHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHH
Q psy16052        193 NIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSNATVDDWAKEMAGMRI  270 (476)
Q Consensus       193 ~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~  270 (476)
                      ++.+++++|++       ++++|||||++...  ..++|++|.++|||||+||++|...    ..++.+++++||.++++
T Consensus        29 nt~riL~lL~~-------~gikATFFv~g~~~e~~p~lir~i~~~GhEIgsHg~sH~~l----~~ls~ee~~~eI~~s~~   97 (265)
T TIGR03006        29 NTDRILDLLDR-------HGVKATFFTLGWVAERYPELVRRIVAAGHELASHGYGHERV----TTQTPEAFRADIRRSKA   97 (265)
T ss_pred             hHHHHHHHHHH-------cCCcEEEEEeccchhhCHHHHHHHHHcCCEeeeccccCcCc----hhCCHHHHHHHHHHHHH
Confidence            67899999999       99999999998643  3489999999999999999999975    56899999999999999


Q ss_pred             HHHHhhcccCCcccEEecCCcccCchH--HHHHHHHcCCeEeeecc---cCCCCCC---CCCccccccCCCCcCCCCcCC
Q psy16052        271 IAEKYANLSDNSVVGVRAPYLRVGGNN--QFTMMEEQAFLYDSTIT---APLSNPP---LWPYTMYFRMPHRCHGNLQHC  342 (476)
Q Consensus       271 ~l~~~~~i~g~~~~gfR~P~l~~~~~~--~~~~l~~~G~~Ydss~~---~~~~~~p---~~pytl~~~~~~~c~~~~~~c  342 (476)
                      +|+   .++|..++|||+|++..+..+  .+++|+++||.||||+.   .+..+.|   .+||.                
T Consensus        98 ~Le---~itG~~~~gfRaP~~s~~~~t~~a~~iL~e~Gy~YdsS~~p~~~d~~g~~~~~~~p~~----------------  158 (265)
T TIGR03006        98 LLE---DLSGQPVRGYRAPSFSIGKKNLWALDVLAEAGYRYSSSIYPVRHDHYGMPDAPRFPFR----------------  158 (265)
T ss_pred             HHH---HHhCCCceEEECCCCCCCCCcHHHHHHHHHCCCEEEEeeccCcCCcCCCCCCCCCcee----------------
Confidence            999   588999999999999887652  37999999999999995   2211122   12221                


Q ss_pred             CCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhcC--ChH
Q psy16052        343 PTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKN--NPE  420 (476)
Q Consensus       343 p~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~--~p~  420 (476)
                       ....+|+|||++.+.     .++..+| +. ..+-.. +-|-.+.+++   +.+.....+.|+++||| +|...  +|.
T Consensus       159 -~~~g~l~e~P~s~~~-----~~~~~~p-~~-gg~~~r-~~P~~~~~~~---~~~~~~~~~~~~~~y~H-pwe~dp~~~~  225 (265)
T TIGR03006       159 -PPNGRLLEVPVTTVR-----LGGRNLP-AA-GGGYFR-LLPYALSRWA---LRRVNGREGRPAIFYFH-PWEIDPDQPR  225 (265)
T ss_pred             -cCCCcEEEEccchhh-----hcccccc-cC-CCcchh-cChHHHHHHH---HHHHHhcCCceEEEEEc-hhhcCCCCCC
Confidence             112349999998753     1222233 00 000000 1223333333   33333333689999999 77763  221


Q ss_pred             --------------HHHHHHHHHHHHHhhCCCEEEEeHHHHH
Q psy16052        421 --------------FLDAFLYWVDEIIERHKDVYFVTMTQVI  448 (476)
Q Consensus       421 --------------~~~al~~fl~~i~~~~~dV~fvT~~qi~  448 (476)
                                    .+...+.=|+.++   .|.-|+|++|++
T Consensus       226 ~~~~~~~~~fr~y~~~~~~~~~l~~ll---~~~~~~~~~~~~  264 (265)
T TIGR03006       226 IAGISMKSRFRHYLNLDRMEGRLRRLL---GDFRWGRMDRVF  264 (265)
T ss_pred             CCCCCccceeeeecCHHHHHHHHHHHH---hcCceeeHHHhh
Confidence                          1234444456665   488999999986


No 4  
>TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB. This model describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reducted sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterized polysaccharide deacetylase; the exact function this protein family is unknown.
Probab=99.90  E-value=9e-23  Score=190.76  Aligned_cols=184  Identities=20%  Similarity=0.287  Sum_probs=143.2

Q ss_pred             CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCC
Q psy16052        178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSN  255 (476)
Q Consensus       178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~  255 (476)
                      .+.|+||||||+....+.+++++|++       ++++|||||++.+.  .++++++|.++|||||+|||+|...    ..
T Consensus         5 ~k~V~LTFDDgp~~~~t~~~l~~L~~-------~~ikaTfFv~g~~~~~~~~~~~~i~~~Gheig~Ht~~H~~~----~~   73 (191)
T TIGR02764         5 DKKIALTFDISWGNDYTEPILDTLKE-------YDVKATFFLSGSWAERHPELVKEIVKDGHEIGSHGYRHKNY----TT   73 (191)
T ss_pred             CCEEEEEEECCCCcccHHHHHHHHHH-------cCCCEEEEeccHHHHHCHHHHHHHHhCCCEEEECCcCCCCc----cc
Confidence            46899999999998777899999999       99999999999753  3489999999999999999999975    56


Q ss_pred             CCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEee-ecccCCCCCCCCCccccccCCCC
Q psy16052        256 ATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYDS-TITAPLSNPPLWPYTMYFRMPHR  334 (476)
Q Consensus       256 ~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Yds-s~~~~~~~~p~~pytl~~~~~~~  334 (476)
                      ++.+++++||.++.+.|+   .+.|..+++||+||+..+.. ++++++++||.+-. ++-     ...|-          
T Consensus        74 ~~~~~~~~ei~~~~~~l~---~~~g~~~~~fr~P~G~~~~~-~~~~l~~~G~~~v~w~~~-----~~D~~----------  134 (191)
T TIGR02764        74 LEDEKIKKDILRAQEIIE---KLTGKKPTLFRPPSGAFNKA-VLKAAESLGYTVVHWSVD-----SRDWK----------  134 (191)
T ss_pred             CCHHHHHHHHHHHHHHHH---HHhCCCCCEEECCCcCCCHH-HHHHHHHcCCeEEEecCC-----CCccC----------
Confidence            899999999999999998   57788999999999999876 89999999995211 100     00110          


Q ss_pred             cCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchh
Q psy16052        335 CHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAW  414 (476)
Q Consensus       335 c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w  414 (476)
                                                                    ..+++++++.+..+.   ..  +  -+|.+|.. 
T Consensus       135 ----------------------------------------------~~~~~~i~~~~~~~~---~~--g--~Iil~Hd~-  160 (191)
T TIGR02764       135 ----------------------------------------------NPGVESIVDRVVKNT---KP--G--DIILLHAS-  160 (191)
T ss_pred             ----------------------------------------------CCCHHHHHHHHHhcC---CC--C--CEEEEeCC-
Confidence                                                          013445554443322   12  2  37889942 


Q ss_pred             hcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHH
Q psy16052        415 LKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVI  448 (476)
Q Consensus       415 ~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~  448 (476)
                       ...+.-.++|.++|+++++  .+..|||.+|++
T Consensus       161 -~~~~~t~~~l~~~i~~l~~--~Gy~~vtl~~l~  191 (191)
T TIGR02764       161 -DSAKQTVKALPTIIKKLKE--KGYEFVTISELI  191 (191)
T ss_pred             -CCcHhHHHHHHHHHHHHHH--CCCEEEEHHHhC
Confidence             1245678999999999986  799999999874


No 5  
>TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase. Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores in the peptidoglycan wall, or spore cortex. The proteins in this family are PdaA (yfjS), a member of a larger family of polysaccharide deacetylases, and are specificially involved in delta-lactam biosynthesis. PdaA acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the following transpeptidation for lactam ring formation, as heterologous expression in E. coli of CwlD and PdaA together is sufficient for delta-lactam production.
Probab=99.89  E-value=5.2e-22  Score=190.07  Aligned_cols=117  Identities=18%  Similarity=0.327  Sum_probs=105.2

Q ss_pred             CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCC
Q psy16052        178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSN  255 (476)
Q Consensus       178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~  255 (476)
                      .+.|+||||||+.....++++++|++       ++++|||||++.+.  +.+++++|+++|||||+|||+|...    ..
T Consensus        36 ~k~VaLTFDDGp~~~~t~~lL~~L~~-------~~vkATFFv~G~~~~~~p~~ir~i~~~GheIgnHt~~H~~~----~~  104 (224)
T TIGR02884        36 KKVIYLTFDNGYENGYTPKILDVLKE-------KKVPAAFFVTGHYIKTQPDLIKRMVDEGHIVGNHSVHHPSL----TA  104 (224)
T ss_pred             CCEEEEEEECCCCccchHHHHHHHHH-------cCCCeEEEeechhhHHCHHHHHHHHHcCCEeeecCccCcCc----cc
Confidence            57899999999987777899999999       99999999999764  4589999999999999999999875    56


Q ss_pred             CCHHHHHHHHHHHHHHHHHhhcccCCc-ccEEecCCcccCchHHHHHHHHcCCeE
Q psy16052        256 ATVDDWAKEMAGMRIIAEKYANLSDNS-VVGVRAPYLRVGGNNQFTMMEEQAFLY  309 (476)
Q Consensus       256 ~s~e~~~~ei~~~~~~l~~~~~i~g~~-~~gfR~P~l~~~~~~~~~~l~~~G~~Y  309 (476)
                      ++.+++++||.++.++|+   .++|.. +..||+||+..+.. ++++++++||.+
T Consensus       105 ls~~~~~~ei~~~~~~i~---~~~G~~~~~~fR~P~G~~~~~-~~~~l~~~Gy~~  155 (224)
T TIGR02884       105 VNDEKFKEELTGVEEEFK---KVTGQKEMKYFRPPRGVFSER-TLAYTKELGYYT  155 (224)
T ss_pred             CCHHHHHHHHHHHHHHHH---HHhCCCCCCEEeCCCCCcCHH-HHHHHHHcCCcE
Confidence            899999999999999999   477877 89999999999876 899999999963


No 6  
>TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family. Members of this protein family are most closely related to TIGR02764, a subset of polysaccharide deacetylase family proteins found in a species if and only if the species forms endospores like those of Bacillus subtilis or Clostridium tetani. This family is likewise restricted to spore-formers, but is not universal among them in having sequences with full-length matches to the model.
Probab=99.88  E-value=7.7e-22  Score=193.01  Aligned_cols=182  Identities=17%  Similarity=0.204  Sum_probs=141.6

Q ss_pred             CeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCCC
Q psy16052        179 QMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSNA  256 (476)
Q Consensus       179 q~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~  256 (476)
                      .+|+||||||+....+++++++|++       ++++||||++|.+.  +.+++++++++|||||+|||+|...    ..+
T Consensus        85 k~VaLTFDdg~~~~~t~~iL~iLkk-------~~vkATFFv~G~~i~~~p~l~k~i~~~GheIGnHT~sH~~l----~~l  153 (268)
T TIGR02873        85 PMVALLINVAWGNEYLPEILQILKK-------HDVKATFFLEGKWVKENSQLAKMIVEQGHEIGNHAYNHPDM----ATL  153 (268)
T ss_pred             CEEEEEEeCCCCcchHHHHHHHHHH-------CCCCEEEEeehHhhhHCHHHHHHHHHCCCEEEecCCcCCCc----ccC
Confidence            6899999999988888999999999       99999999999753  4589999999999999999999885    568


Q ss_pred             CHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEeeecccCCCCCCCCCccccccCCCCcC
Q psy16052        257 TVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYDSTITAPLSNPPLWPYTMYFRMPHRCH  336 (476)
Q Consensus       257 s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~  336 (476)
                      +.+++++||.++.++|+   .++|..++.||+||+..+.. ++++++++||..-           .|  +.+.       
T Consensus       154 s~~~~~~Ei~~~~~~i~---~~~G~~p~~fRpP~G~~n~~-~~~~l~~~G~~~v-----------~W--svd~-------  209 (268)
T TIGR02873       154 SKEEIYDQINQTNEIIE---ATIGVTPKWFAPPSGSFNDN-VVQIAADLQMGTI-----------MW--TVDT-------  209 (268)
T ss_pred             CHHHHHHHHHHHHHHHH---HHhCCCCCEEECCCCCCCHH-HHHHHHHCCCeEE-----------Ee--ccCC-------
Confidence            99999999999999999   57888999999999998876 8999999998421           11  1110       


Q ss_pred             CCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhc
Q psy16052        337 GNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLK  416 (476)
Q Consensus       337 ~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~  416 (476)
                                                  .|..            ..+++.+.+.+...    .. ++  -+|.+|..   
T Consensus       210 ----------------------------~Dw~------------~~~~~~i~~~v~~~----~~-~G--~IILmHd~---  239 (268)
T TIGR02873       210 ----------------------------IDWK------------NPSPSVMVNRVLSK----IH-PG--AMVLMHPT---  239 (268)
T ss_pred             ----------------------------CCCC------------CCCHHHHHHHHHhc----CC-CC--cEEEEcCC---
Confidence                                        0000            01344444444221    11 22  36788943   


Q ss_pred             CChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052        417 NNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQ  449 (476)
Q Consensus       417 ~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~  449 (476)
                        ....++|.++|.++++  .+.-|||.+|+++
T Consensus       240 --~~T~~aL~~iI~~Lk~--kGy~fvtl~ell~  268 (268)
T TIGR02873       240 --ASSTEGLEEMITIIKE--KGYKIGTITELLD  268 (268)
T ss_pred             --ccHHHHHHHHHHHHHH--CCCEEEeHHHhhC
Confidence              2357899999999985  8999999999863


No 7  
>PF01522 Polysacc_deac_1:  Polysaccharide deacetylase;  InterPro: IPR002509 This domain is found in polysaccharide deacetylase. This family of polysaccharide deacetylases includes NodB (nodulation protein B from Rhizobium) which is a chitooligosaccharide deacetylase []. It also includes chitin deacetylase from yeast [], and endoxylanases which hydrolyses glucosidic bonds in xylan [].; GO: 0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, 0005975 carbohydrate metabolic process; PDB: 2IW0_A 2CC0_B 2VYO_A 2J13_A 2C71_A 2C79_A 1W1A_1 1W1B_1 1W17_A 1NY1_B ....
Probab=99.85  E-value=2.2e-21  Score=167.79  Aligned_cols=118  Identities=25%  Similarity=0.439  Sum_probs=98.3

Q ss_pred             CCCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCCC--hHHHHHHHHcCCeeeeCCCCCCCCccccC
Q psy16052        177 VPQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYTN--YSAVQELHRKGHEVAVHSITHNDDENFWS  254 (476)
Q Consensus       177 ~Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~--~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~  254 (476)
                      .++.|+||||||+. ....+++++|++       +++++||||++.+..  ++++++|.++|||||+||++|...    .
T Consensus         4 ~~~~v~ltfDdg~~-~~~~~~~~~l~~-------~~i~at~fv~~~~~~~~~~~l~~l~~~G~ei~~H~~~H~~~----~   71 (123)
T PF01522_consen    4 PKKSVALTFDDGYR-DNYDRLLPLLKK-------YGIPATFFVIGSWVERYPDQLRELAAAGHEIGNHGWSHPNL----S   71 (123)
T ss_dssp             TSSEEEEEEESHCH-THHHHHHHHHHH-------TT--EEEEE-HHHHHHHHHHHHHHHHTT-EEEEE-SSSSCG----G
T ss_pred             CCCEEEEEEecCch-hhHHHHHHHHHh-------cccceeeeecccccccccccchhHHHHHHHHHhcCCccccc----c
Confidence            46899999999994 445789999999       999999999987542  389999999999999999999985    5


Q ss_pred             CCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEe
Q psy16052        255 NATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYD  310 (476)
Q Consensus       255 ~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Yd  310 (476)
                      .++.+++++||.++++.|+   .+.|..+.+||+||+..+.+ ++++|+++||.||
T Consensus        72 ~~~~~~~~~ei~~~~~~l~---~~~g~~~~~f~~P~g~~~~~-~~~~l~~~G~~y~  123 (123)
T PF01522_consen   72 TLSPEELRREIERSREILE---EITGRPPKGFRYPFGSYDDN-TLQALREAGYKYD  123 (123)
T ss_dssp             GS-HHHHHHHHHHHHHHHH---HHHSSEESEEE-GGGEECHH-HHHHHHHTT-EEE
T ss_pred             cCCHHHHHHHHHHHHHHHH---HHhCCCCcEEECCCCCCCHH-HHHHHHHcCCCcC
Confidence            6899999999999999999   47788999999999999987 9999999999997


No 8  
>COG0726 CDA1 Predicted xylanase/chitin deacetylase [Carbohydrate transport and metabolism]
Probab=99.76  E-value=2.6e-17  Score=160.15  Aligned_cols=115  Identities=23%  Similarity=0.409  Sum_probs=103.0

Q ss_pred             CeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCCC
Q psy16052        179 QMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSNA  256 (476)
Q Consensus       179 q~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~  256 (476)
                      ..|+||||||+...++++++++|++       ++++|||||++.+.  ..++++++.+.|||||+|+++|...    ..+
T Consensus        65 k~v~lTFDDg~~~~~~~~il~iL~k-------~~i~ATfFv~g~~i~~~p~~~~~~~~~Gheig~H~~~h~~~----~~~  133 (267)
T COG0726          65 KAVALTFDDGPLDGNTPRILPLLKK-------YGIKATFFVVGSWIERNPDLVKRIAEAGHEIGNHGYDHPDL----QDL  133 (267)
T ss_pred             CeEEEEeecCCCCCCcHHHHHHHHH-------cCCceEEEEehHhhHHCHHHHHHHHhccCeehhCcccCCCc----ccC
Confidence            7899999999998677899999999       99999999999653  2489999999999999999999985    568


Q ss_pred             CHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCe
Q psy16052        257 TVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFL  308 (476)
Q Consensus       257 s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~  308 (476)
                      +.++.+++|.++...|.   .++|..+++||+|++..+.. +..++++.|+.
T Consensus       134 ~~~~~~~~i~~~~~~l~---~~~g~~~~~~r~p~g~~~~~-~~~~~~~~g~~  181 (267)
T COG0726         134 SLEELGAEIARAHDILK---KITGGRPRGFRPPYGSYNES-SLALARRLGYR  181 (267)
T ss_pred             CHHHHHHHHHHHHHHHH---HHhCCCCceEECCCCccCHH-HHHHHHhCCCc
Confidence            89999999999999998   58888899999999999865 88899999984


No 9  
>PRK14581 hmsF outer membrane N-deacetylase; Provisional
Probab=99.50  E-value=1.4e-13  Score=148.52  Aligned_cols=125  Identities=13%  Similarity=0.193  Sum_probs=99.6

Q ss_pred             CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC---------------------CChHHHHHHHHcC
Q psy16052        178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY---------------------TNYSAVQELHRKG  236 (476)
Q Consensus       178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~---------------------~~~~~v~~l~~~G  236 (476)
                      ++.|+||||||+.. +...++.||++       ++++||||+++.+                     .+|++|++|.+.|
T Consensus       106 ~KaV~LTFDDGy~s-ny~~AlPILKk-------yg~pATfFvVg~wid~~~~~~v~~~~~~~~r~~~LTW~qIrEM~~sG  177 (672)
T PRK14581        106 DKAVLLTFDDGYSS-FYRRVYPLLKA-------YKWSAVLAPVGTWIDTATDKKVDFGGLSTDRDRFATWKQITEMSKSG  177 (672)
T ss_pred             CCeEEEEEEcCCcc-hHHHHHHHHHH-------cCCCEEEEEechhhccCcccccccccccccccccCCHHHHHHHHHCC
Confidence            58999999999976 45788999999       9999999997643                     2457999999999


Q ss_pred             -CeeeeCCCCCCCCcc---------------ccC----CCC----HHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcc
Q psy16052        237 -HEVAVHSITHNDDEN---------------FWS----NAT----VDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLR  292 (476)
Q Consensus       237 -HEIg~Hs~~H~~~~~---------------~~~----~~s----~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~  292 (476)
                       ||||+||++|.....               +..    ..+    .+++++||.++++.|+   ..+|..++.|++||+.
T Consensus       178 LvEIGSHT~sHh~~~~~np~g~~~pa~~~~~y~~~~~~yEs~~~~~~rl~~Di~~s~~~Ie---~~lG~~p~~FayPyG~  254 (672)
T PRK14581        178 LVEIGAHTYASHYGVIANPQGNTEPAAANLQYDPKTKQYETVEAFKQRMEKDVALITQRIV---QATGKQPRVWVWPYGA  254 (672)
T ss_pred             CcEEEcCCcccccccccCccccccchhhhccccccccccccHHHHHHHHHHHHHHHHHHHH---HHhCCCCCEEEcCCCC
Confidence             699999999873110               000    012    2346678888999998   5888999999999999


Q ss_pred             cCchHHHHHHHHcCCeEeeecc
Q psy16052        293 VGGNNQFTMMEEQAFLYDSTIT  314 (476)
Q Consensus       293 ~~~~~~~~~l~~~G~~Ydss~~  314 (476)
                      ++.. +.+++++.||.+.-+..
T Consensus       255 yn~~-~~~iak~aGy~~afTt~  275 (672)
T PRK14581        255 PNGT-VLNILRQHGYQLAMTLD  275 (672)
T ss_pred             cCHH-HHHHHHHCCCcEEEECC
Confidence            9886 89999999998776654


No 10 
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=99.48  E-value=2.4e-13  Score=147.02  Aligned_cols=125  Identities=14%  Similarity=0.195  Sum_probs=99.7

Q ss_pred             CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC---------------------ChHHHHHHHHcC
Q psy16052        178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT---------------------NYSAVQELHRKG  236 (476)
Q Consensus       178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~---------------------~~~~v~~l~~~G  236 (476)
                      ++.|+||||||+.... ..++.||++       ++++||||+++.+.                     +|++|++|.+.|
T Consensus       106 ~K~VaLTFDDGy~s~y-t~A~PILkk-------ygvpATfFlvg~wi~~~~~~~~~~~~~~~~r~~~LtWeqIreM~~sG  177 (671)
T PRK14582        106 EKAVLLTFDDGYSSFY-TRVFPILQA-------FQWPAVWAPVGSWVDTPADQPVKFGGEMVPREYFATWQQVREVARSR  177 (671)
T ss_pred             CCeEEEEEEcCCCchH-HHHHHHHHH-------cCCCEEEEEechhhccCccccccccccccccccCCCHHHHHHHHhCC
Confidence            5799999999997655 578999999       99999999986432                     357999999999


Q ss_pred             -CeeeeCCC-CCCCCcc--------------cc----CCCCH----HHHHHHHHHHHHHHHHhhcccCCcccEEecCCcc
Q psy16052        237 -HEVAVHSI-THNDDEN--------------FW----SNATV----DDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLR  292 (476)
Q Consensus       237 -HEIg~Hs~-~H~~~~~--------------~~----~~~s~----e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~  292 (476)
                       ||||+||+ +|.....              |.    .-.+.    +.+++||.++++.|+   ..+|..++.|++||+.
T Consensus       178 lvEIGSHT~~sH~~l~anp~g~~~pa~~~r~~~~~~~~yEs~~e~~~ri~~DL~~s~~~Ie---~~tG~~p~~FayPyG~  254 (671)
T PRK14582        178 LVEIASHTWNSHYGIQANPQGSLLPAAVNRAYFTDHARYETAAEYRERIRLDAVKMTEYIR---TKAGKNPRVWVWPYGE  254 (671)
T ss_pred             CeEEEcCCchhccccccccccccccchhcccccccccccccHHHHHHHHHHHHHHHHHHHH---HHhCCCCcEEecCCCC
Confidence             69999999 5874311              00    00122    336789999999999   5889999999999999


Q ss_pred             cCchHHHHHHHHcCCeEeeecc
Q psy16052        293 VGGNNQFTMMEEQAFLYDSTIT  314 (476)
Q Consensus       293 ~~~~~~~~~l~~~G~~Ydss~~  314 (476)
                      ++.. ..+++++.||.+.-+..
T Consensus       255 ~n~~-~~~iakeaGY~~afT~~  275 (671)
T PRK14582        255 ANGI-ALEELKKLGYDMAFTLE  275 (671)
T ss_pred             CCHH-HHHHHHHCCCeEEEEeC
Confidence            9886 89999999998766643


No 11 
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.95  E-value=3.9e-10  Score=75.92  Aligned_cols=36  Identities=47%  Similarity=1.150  Sum_probs=34.1

Q ss_pred             CCCCCceecCCCCccCCCcccCCCCCCCCCCCCCCC
Q psy16052        102 LCQDGFLACGDGACIERGLFCNGEKDCNDGSDENTC  137 (476)
Q Consensus       102 ~C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe~~c  137 (476)
                      .|+..+|+|.+|.||+..++|||..||.|+|||.+|
T Consensus         2 ~C~~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C   37 (37)
T PF00057_consen    2 TCPPGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC   37 (37)
T ss_dssp             SSSTTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred             cCcCCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence            488999999999999999999999999999999876


No 12 
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.85  E-value=1.4e-09  Score=72.48  Aligned_cols=35  Identities=46%  Similarity=1.132  Sum_probs=32.7

Q ss_pred             CCCCceecCCCCccCCCcccCCCCCCCCCCCCCCC
Q psy16052        103 CQDGFLACGDGACIERGLFCNGEKDCNDGSDENTC  137 (476)
Q Consensus       103 C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe~~c  137 (476)
                      |++.+|+|.+|.||+..++|||..||.|+|||..|
T Consensus         1 C~~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~C   35 (35)
T cd00112           1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC   35 (35)
T ss_pred             CCCCeEEcCCCCeeCHHHcCCCccCCCCCcccccC
Confidence            56789999999999999999999999999999876


No 13 
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.53  E-value=5.9e-08  Score=63.72  Aligned_cols=32  Identities=47%  Similarity=1.096  Sum_probs=30.1

Q ss_pred             CCCCceecCCCCccCCCcccCCCCCCCCCCCC
Q psy16052        103 CQDGFLACGDGACIERGLFCNGEKDCNDGSDE  134 (476)
Q Consensus       103 C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe  134 (476)
                      |...+|+|.+|.||+..++|||..||.|+|||
T Consensus         2 C~~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE   33 (33)
T smart00192        2 CPPGEFQCDNGRCIPLSWVCDGVDDCSDGSDE   33 (33)
T ss_pred             CCCCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence            66679999999999999999999999999998


No 14 
>PF09960 DUF2194:  Uncharacterized protein conserved in bacteria (DUF2194);  InterPro: IPR018695 This family of prokaryotic proteins has no known function; however it may be a membrane protein.
Probab=98.10  E-value=0.00016  Score=78.04  Aligned_cols=206  Identities=13%  Similarity=0.100  Sum_probs=125.2

Q ss_pred             cHHHHHHHHhccCCCCCCCCceEEEEecccCCCh-----------------HHHHHHHHcCCeeeeCCCCCCCCccccCC
Q psy16052        193 NIALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY-----------------SAVQELHRKGHEVAVHSITHNDDENFWSN  255 (476)
Q Consensus       193 ~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~-----------------~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~  255 (476)
                      =|+.++++-++       +++|.|.++...|.+.                 ...++|.+.|||||.|||.|+.+..  ..
T Consensus       288 WWpDM~~la~~-------y~ikyTg~~I~~Y~~~~~~pf~~~~~~~~~~~~~~g~~Ll~~ggElG~HGYNHqpL~~--~~  358 (585)
T PF09960_consen  288 WWPDMKKLAKK-------YGIKYTGVMIENYNDQTNPPFEFLEQEDTDRFIYFGRELLKSGGELGLHGYNHQPLTL--EG  358 (585)
T ss_pred             ccHHHHHHHHH-------cCCeEEEEEccccCCCCCCCcccccccchhhHHHHHHHHHhcCCceEEecccCCCCcC--CC
Confidence            44566677777       9999999999876421                 3568899999999999999998632  12


Q ss_pred             ----------C-CHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHc-C-CeEeeecccCCCCCCC
Q psy16052        256 ----------A-TVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQ-A-FLYDSTITAPLSNPPL  322 (476)
Q Consensus       256 ----------~-s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~-G-~~Ydss~~~~~~~~p~  322 (476)
                                + +.+++.+-+...++.+++  -.+...+..+-+|---++.. -.++|.+. . +..-+|......  -.
T Consensus       359 ~~~~~~~Y~~W~~~~~m~~sl~~l~~f~~~--l~p~~~~~~YVPPSNils~e-G~e~L~~~~P~ik~IaS~Y~~~~--~~  433 (585)
T PF09960_consen  359 DYGDEYGYKPWPSKEDMAESLKELKRFVKS--LFPNYEPSSYVPPSNILSEE-GREALKKAFPEIKTIASLYFGDD--EE  433 (585)
T ss_pred             cccccccCcCCCCHHHHHHHHHHHHHHHHH--hCCCcceEEecCCccccCHH-HHHHHHHhCCCeEEEEEeeecCC--cC
Confidence                      2 556666666655555554  34567888899998888776 56666653 1 334444432110  02


Q ss_pred             CCccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccC
Q psy16052        323 WPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQN  402 (476)
Q Consensus       323 ~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~  402 (476)
                      -+|...|+..             ..+++|+|=..             .|.          -+++..  +...+...-.- 
T Consensus       434 ~~y~QEF~~~-------------~dgi~e~PRis-------------SG~----------~~~~~~--~~~~~s~l~~~-  474 (585)
T PF09960_consen  434 GEYVQEFEIA-------------PDGIVEFPRIS-------------SGY----------YPDDYM--LWAIVSALNNH-  474 (585)
T ss_pred             CcceEEeeEC-------------CCCeEEeCccc-------------cCC----------ccChHH--HHHHHHHHHhh-
Confidence            2455555431             23588888432             121          011211  11223222110 


Q ss_pred             CcceEEeecchhhcC--C------hHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhh
Q psy16052        403 RAPLGLYFHAAWLKN--N------PEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQ  452 (476)
Q Consensus       403 r~p~~i~~H~~w~~~--~------p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~  452 (476)
                       ..+.=.+||.=+..  |      .+....|++||.++.++.|-....|.+|.+.-++
T Consensus       475 -Gv~sHFiHPDDvld~dR~~~~~W~~l~~~l~~~~~~i~~~~P~Lr~~T~Se~~~~~~  531 (585)
T PF09960_consen  475 -GVFSHFIHPDDVLDEDRNAGKGWEELYKSLDKFMSWIKESYPWLRNLTASEAAGALQ  531 (585)
T ss_pred             -eeeecccCCccccccccCCCCCHHHHHHHHHHHHHHHHHhCCCcccCcHHHHHHHHH
Confidence             11222346531211  1      2367899999999998899999999999986553


No 15 
>PF10096 DUF2334:  Uncharacterized protein conserved in bacteria (DUF2334);  InterPro: IPR018763 This group of proteins has no known function.
Probab=97.97  E-value=0.00029  Score=68.37  Aligned_cols=99  Identities=17%  Similarity=0.290  Sum_probs=73.9

Q ss_pred             HHHHHHHHhccCCCCCCCCceEEEEecccCCC---------------hHHHHHHHHcCCeeeeCCCCCCCCc-------c
Q psy16052        194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTN---------------YSAVQELHRKGHEVAVHSITHNDDE-------N  251 (476)
Q Consensus       194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~---------------~~~v~~l~~~GHEIg~Hs~~H~~~~-------~  251 (476)
                      ...+.++|.+       .+|+.+.+|+..+.+               ...++.+.++|++|+.||++|....       .
T Consensus        18 l~~i~d~l~~-------~~ipf~v~vIP~~~d~~~~~~~~l~~~~~f~~~L~~~~~~Gg~I~lHGYtHq~~~~~sg~~~e   90 (243)
T PF10096_consen   18 LKEIADYLYK-------YGIPFSVAVIPVYVDPNGGITVNLSDNPEFVEYLRYLQARGGEIVLHGYTHQYGNSVSGDGFE   90 (243)
T ss_pred             HHHHHHHHHH-------CCCCEEEEEEecccCCCCcccccchhhHHHHHHHHHHHhcCCEEEEEecceecCCCcccccce
Confidence            3445677877       799988888754321               1578888999999999999999832       2


Q ss_pred             ccC-----CCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHH
Q psy16052        252 FWS-----NATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEE  304 (476)
Q Consensus       252 ~~~-----~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~  304 (476)
                      +|.     ..+.+..++-|.++.+++++    .|-.+.||-+|....+.. +++++++
T Consensus        91 f~~~~~~~~~~~~~a~~ri~~gl~~l~~----~gi~p~~f~aP~y~~s~~-~~~~~~~  143 (243)
T PF10096_consen   91 FWDSEFDAGDSEEEAKERIEKGLEILAE----LGIYPVGFEAPHYWASPE-TYKVLEK  143 (243)
T ss_pred             eccccccccCCHHHHHHHHHHHHHHHHH----CCCcccEEECCccccCHH-HHHHHHh
Confidence            333     25677888888888888874    255899999999998886 6766643


No 16 
>PF01607 CBM_14:  Chitin binding Peritrophin-A domain;  InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=97.86  E-value=4.8e-06  Score=60.43  Aligned_cols=53  Identities=34%  Similarity=0.767  Sum_probs=34.8

Q ss_pred             cCCCCCCceeeccCCCCCCCCCccccCCCCcccccCCCCcccccccccccCCcccccc
Q psy16052         26 CKDKDAGEWFRLVAGEGDNCRDVIQCTSSGLQAIRCPAGLYFDIEKQTCDWKGAVKNC   83 (476)
Q Consensus        26 C~~~~~~~~~~~~~~~~~~C~~~~~C~~g~~~~~~C~~g~~~~~~~~~C~~~~~~~~C   83 (476)
                      |.++.++ +|+++.    +|+.|+.|.+|.....+|+.|+.||...+.|+++..+..|
T Consensus         1 C~~~~~~-~~~~~~----~C~~Y~~C~~g~~~~~~C~~g~~fd~~~~~C~~~~~~~~C   53 (53)
T PF01607_consen    1 CPGRGDG-FYPHPD----DCRKYYQCVNGQAVEQRCPEGLYFDPSSQRCVPPSNVVQC   53 (53)
T ss_dssp             ----SSE-EE--SS-----SSEEEEEETTEEEEEE-TTS-EE-TTTSSEE-TTT-TT-
T ss_pred             CCccCCe-eEeCCC----CCCEEEEeeCCcEECCCCcCCCEECcCcCEEcCCccCCCC
Confidence            5666544 777777    8999999999999999999999999999999987664444


No 17 
>COG3233 Predicted deacetylase [General function prediction only]
Probab=97.68  E-value=0.00054  Score=63.88  Aligned_cols=123  Identities=14%  Similarity=0.139  Sum_probs=85.7

Q ss_pred             EEEeecCCC-CCccHHHHHHHHhccCCCCCCCCceEEE-EecccCCC----------hHHHHHHHHcCCeeeeCCCCCCC
Q psy16052        181 ITITFDDAI-NNNNIALYKEMFNGKRKNPNGCDIKATF-FVSHKYTN----------YSAVQELHRKGHEVAVHSITHND  248 (476)
Q Consensus       181 V~lTFDdg~-~~~~~~~~l~ll~~~~~np~~c~i~aTF-Fv~~~~~~----------~~~v~~l~~~GHEIg~Hs~~H~~  248 (476)
                      |.|-.|-.+ +..+...+.+++++ .     +....|+ +|..++-+          .+.+.+..++|+|++.|||.|.+
T Consensus         6 iillhdVSpv~~~~~~~i~~~ide-~-----~~~~~t~lLViPn~~~~~~l~~d~rf~~~l~~r~e~Gdel~lHGy~h~d   79 (233)
T COG3233           6 IILLHDVSPVYWPTLSNIDAAIDE-Y-----GAQNSTVLLVIPNHANDYPLSKDPRFVDLLTEREEEGDELVLHGYDHID   79 (233)
T ss_pred             eEEEEecCcccchhHHHHHHHHHH-h-----CCCCceEEEEeeccCCCCCcccChHHHHHHHHHHhcCCEEEEechhhcc
Confidence            444444433 33344455666665 1     3455666 77654321          27788889999999999999987


Q ss_pred             Cc---cccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEeeeccc
Q psy16052        249 DE---NFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYDSTITA  315 (476)
Q Consensus       249 ~~---~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Ydss~~~  315 (476)
                      ..   .| .-+...+.+.-+..+.+.+++    .|-..+||-+|.+.++.. ++++|-+++|+|-++..+
T Consensus        80 ~~~~gEF-~~l~~~eA~~RL~~a~~~l~~----~G~~~~~FvpPrwlmS~g-t~~aL~~~~frv~a~l~~  143 (233)
T COG3233          80 TKRRGEF-ACLRAHEARLRLMAAIEELEA----LGFPLRGFVPPRWLMSEG-TRQALLENNFRVTADLRG  143 (233)
T ss_pred             ccCcccc-ccchHHHHHHHHHHHHHHHHH----cCCcceeecCcceecCHh-HHHHhccCCcEEEecccc
Confidence            21   11 334456666677777777873    355699999999999987 999999999999988863


No 18 
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=97.12  E-value=0.00044  Score=50.36  Aligned_cols=49  Identities=33%  Similarity=0.717  Sum_probs=42.7

Q ss_pred             ccCCCCCCceeeccCCCCCCCCCccccCCCCcccccCCCCcccccccccccCCc
Q psy16052         25 LCKDKDAGEWFRLVAGEGDNCRDVIQCTSSGLQAIRCPAGLYFDIEKQTCDWKG   78 (476)
Q Consensus        25 ~C~~~~~~~~~~~~~~~~~~C~~~~~C~~g~~~~~~C~~g~~~~~~~~~C~~~~   78 (476)
                      .|..+. +.+|+++.    +|..|+.|..+....+.|+.|+.|+...+.|++..
T Consensus         2 ~C~~~~-~g~~~~~~----~C~~y~~C~~~~~~~~~C~~g~~f~~~~~~C~~~~   50 (56)
T smart00494        2 ECPGRG-DGLYPHPT----DCSKYYQCSNGRPIVGSCPAGLVFDPATQTCDWPD   50 (56)
T ss_pred             cCCCCC-CccccCcc----cCCeeEEcCCCcEEeccCcCCCeECCCCCCcCCcc
Confidence            477754 55998888    89999999999889999999999999999998754


No 19 
>PF11959 DUF3473:  Domain of unknown function (DUF3473);  InterPro: IPR022560  This domain, found in bacteria and archaea, is functionally uncharacterised. It is about 130 amino acids in length and is found C-terminal to PF01522 from PFAM. It contains two completely conserved residues (P and H) that may be functionally important. 
Probab=97.01  E-value=0.0034  Score=55.03  Aligned_cols=82  Identities=26%  Similarity=0.489  Sum_probs=48.7

Q ss_pred             CeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHH-----HHHHHHHhccCCcceEEeecchhhcC--ChH
Q psy16052        348 AVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFL-----NHNFDRHYDQNRAPLGLYFHAAWLKN--NPE  420 (476)
Q Consensus       348 ~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l-----~~~f~~~y~~~r~p~~i~~H~~w~~~--~p~  420 (476)
                      .|||||++.+.     .+...+|           .++..+++++     +..+.+.....+.|+++||| +|...  +|.
T Consensus        29 ~l~E~P~st~~-----~~g~~lP-----------~~GGgyfRl~P~~l~~~~~~~~~~~~~~~~~~YfH-PwE~dp~qpr   91 (133)
T PF11959_consen   29 SLVEFPISTSP-----LFGKRLP-----------LGGGGYFRLFPYWLYRWLIRRINRRGGQPAVFYFH-PWEFDPDQPR   91 (133)
T ss_pred             eEEEECCcccc-----cCCeeee-----------ecCChhHHHccHHHHHHHHHHHHhCCCcceEEEEe-ceecCCCCcc
Confidence            39999998753     2222333           2233344444     33333333334489999999 87762  221


Q ss_pred             --------------HHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052        421 --------------FLDAFLYWVDEIIERHKDVYFVTMTQVIQ  449 (476)
Q Consensus       421 --------------~~~al~~fl~~i~~~~~dV~fvT~~qi~~  449 (476)
                                    .+..+++=|+.++   .+.-|+|+.|+++
T Consensus        92 ~~~~~~~~rf~~y~~l~~~~~rl~~Ll---~~f~f~t~~~~~~  131 (133)
T PF11959_consen   92 VPGAPLKSRFRHYNNLDRMEKRLDRLL---SDFRFGTMREVLA  131 (133)
T ss_pred             cCCCCcceeEEEEcCHHHHHHHHHHHH---hhCceEEHHHHhh
Confidence                          1355555566654   4889999999975


No 20 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=96.45  E-value=0.0029  Score=57.56  Aligned_cols=49  Identities=29%  Similarity=0.498  Sum_probs=40.0

Q ss_pred             cccccccccccccccCCCCCCCCCCCCCCceecCCC----CccCCCcccCCCCC---CCCCCCC
Q psy16052         78 GAVKNCKLKNKERKVKPLLYTDEPLCQDGFLACGDG----ACIERGLFCNGEKD---CNDGSDE  134 (476)
Q Consensus        78 ~~~~~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~g----~Ci~~~~~C~g~~d---C~d~sDe  134 (476)
                      |+.=||.|+|||..=        ..|..+.|.|.|.    .-|+.+.+=||.=|   |-|||||
T Consensus        55 DdyCDC~DGSDEPGT--------sAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CCDGSDE  110 (176)
T PF12999_consen   55 DDYCDCPDGSDEPGT--------SACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICCDGSDE  110 (176)
T ss_pred             CcceeCCCCCCcccc--------ccCcCceEeeccCCCCCceeehhhhcCCcCcccccCCCCCC
Confidence            455579999999732        3488889999764    77888999999999   9999999


No 21 
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=95.54  E-value=0.0062  Score=40.92  Aligned_cols=33  Identities=18%  Similarity=0.425  Sum_probs=27.0

Q ss_pred             cccccCCCCcccccccccccCCccccccccccccccc
Q psy16052         56 LQAIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKERKV   92 (476)
Q Consensus        56 ~~~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de~~c   92 (476)
                      ..+|+|.+|.++.. .++|   +++.||.+++||.+|
T Consensus         5 ~~~f~C~~~~CI~~-~~~C---Dg~~DC~dgsDE~~C   37 (37)
T PF00057_consen    5 PGEFRCGNGQCIPK-SWVC---DGIPDCPDGSDEQNC   37 (37)
T ss_dssp             TTEEEETTSSEEEG-GGTT---SSSCSSSSSTTTSSH
T ss_pred             CCeeEcCCCCEECh-HHcC---CCCCCCCCCcccccC
Confidence            56899999886654 5677   899999999999876


No 22 
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=95.52  E-value=0.0066  Score=40.22  Aligned_cols=32  Identities=19%  Similarity=0.440  Sum_probs=26.2

Q ss_pred             ccccCCCCcccccccccccCCccccccccccccccc
Q psy16052         57 QAIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKERKV   92 (476)
Q Consensus        57 ~~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de~~c   92 (476)
                      .+|+|.+|.++.. .+.|   +++.||.++|||..|
T Consensus         4 ~~f~C~~~~Ci~~-~~~C---Dg~~DC~dgsDE~~C   35 (35)
T cd00112           4 NEFRCANGRCIPS-SWVC---DGEDDCGDGSDEENC   35 (35)
T ss_pred             CeEEcCCCCeeCH-HHcC---CCccCCCCCcccccC
Confidence            5899999877654 4677   889999999999865


No 23 
>KOG1215|consensus
Probab=94.90  E-value=0.02  Score=65.85  Aligned_cols=74  Identities=27%  Similarity=0.678  Sum_probs=57.2

Q ss_pred             cccccCCCC-cccccccccccCCccccccccccccc--ccCCCCCCCCCCCCCCceecCCCCccCCCcccCCCCCCCCCC
Q psy16052         56 LQAIRCPAG-LYFDIEKQTCDWKGAVKNCKLKNKER--KVKPLLYTDEPLCQDGFLACGDGACIERGLFCNGEKDCNDGS  132 (476)
Q Consensus        56 ~~~~~C~~g-~~~~~~~~~C~~~~~~~~C~~~~de~--~c~~~~~~~~~~C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~s  132 (476)
                      ...++|..+ .+.. ..+.|   ++..+|.+++||.  .|.-      ..|...++.|.++.|+...+.|+|..||.||+
T Consensus       220 ~~~~~c~g~~~~i~-~~~~~---Dg~~dc~~~~de~~~~~~~------~~~~~~e~~~~~~~~~~~~~~~~g~~d~pdg~  289 (877)
T KOG1215|consen  220 IEVTRCDGSSRCIL-ISEVC---DGPRDCVDGPDEGVMNCSD------ATCEAPEIECADGDCSDRQKLCDGDLDCPDGL  289 (877)
T ss_pred             eeEEEecCCCcEEe-ehhcc---CCCcccccCCcCceeEeec------cccCCcceeecCCCCccceEEecCccCCCCcc
Confidence            356677764 3322 23445   8899999999995  3422      24777899999999999999999999999999


Q ss_pred             CCCCCCC
Q psy16052        133 DENTCDS  139 (476)
Q Consensus       133 De~~c~~  139 (476)
                      ||..|..
T Consensus       290 de~~~~~  296 (877)
T KOG1215|consen  290 DEDYCKK  296 (877)
T ss_pred             ccccccc
Confidence            9998874


No 24 
>KOG1215|consensus
Probab=94.87  E-value=0.067  Score=61.58  Aligned_cols=115  Identities=22%  Similarity=0.389  Sum_probs=73.0

Q ss_pred             cccccCC--CCcccccccccccCCcccccccccccccccCCCC---C----------------------CCCCCC-----
Q psy16052         56 LQAIRCP--AGLYFDIEKQTCDWKGAVKNCKLKNKERKVKPLL---Y----------------------TDEPLC-----  103 (476)
Q Consensus        56 ~~~~~C~--~g~~~~~~~~~C~~~~~~~~C~~~~de~~c~~~~---~----------------------~~~~~C-----  103 (476)
                      ...|+|.  ++.|.. ..+.|   ++..+|.+++||..|....   .                      .++..+     
T Consensus       140 ~~~~~c~~~~~~Cip-~~~~c---d~~~~C~dg~de~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~d~~~~~~~~~  215 (877)
T KOG1215|consen  140 LDKFSCRTGSCKCIP-GDWLC---DGEADCPDGSDELNCAVRRCEPRGASLDCIVAIKVCDIQHDCADDYDESEGRIYWT  215 (877)
T ss_pred             CCCCCCcCccccCCC-CceeC---CCCCccccchhhhcccccccCccccccccceeeeecCcccccccccccccCccccc
Confidence            4556666  333322 23445   7777899999998885210   0                      001111     


Q ss_pred             ---CCCceecCC-CCccCCCcccCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCcccCCccccCCCCCCCCCCCCCC
Q psy16052        104 ---QDGFLACGD-GACIERGLFCNGEKDCNDGSDE--NTCDSENDPNRAPPCDPAVCVLPDCFCSEDGTYIPGDLPSK  175 (476)
Q Consensus       104 ---~~~~~~C~~-g~Ci~~~~~C~g~~dC~d~sDe--~~c~~~~~~~~~~~C~~~~C~~p~C~C~~~~~~iP~~l~~~  175 (476)
                         ....++|.. ++||...+.|||+.||.|++||  ..|....+......|..+.|....-.|..... .|.++...
T Consensus       216 ~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~~~~de~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~g~~d-~pdg~de~  292 (877)
T KOG1215|consen  216 DDSRIEVTRCDGSSRCILISEVCDGPRDCVDGPDEGVMNCSDATCEAPEIECADGDCSDRQKLCDGDLD-CPDGLDED  292 (877)
T ss_pred             CCcceeEEEecCCCcEEeehhccCCCcccccCCcCceeEeeccccCCcceeecCCCCccceEEecCccC-CCCccccc
Confidence               135788866 5999999999999999999999  46666555555677777777765555665322 35555443


No 25 
>PF03065 Glyco_hydro_57:  Glycosyl hydrolase family 57;  InterPro: IPR004300 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 57 GH57 from CAZY comprises enzymes with two known activities; alpha-amylase (3.2.1.1 from EC) and 4-alpha-glucanotransferase (2.4.1 from EC).; GO: 0003824 catalytic activity, 0005975 carbohydrate metabolic process; PDB: 1K1X_B 1K1W_A 1K1Y_A 3N98_A 3N8T_A 3N92_A 1UFA_A 3P0B_A.
Probab=94.77  E-value=0.071  Score=54.81  Aligned_cols=196  Identities=15%  Similarity=0.165  Sum_probs=106.9

Q ss_pred             HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCH-HHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHH
Q psy16052        227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATV-DDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEE  304 (476)
Q Consensus       227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~-e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~  304 (476)
                      +.++++.++|+ ||-.-+++|...+.    ++. ++.+.+|+...+.++   .+.|..|+||..|-..++.. ..++|++
T Consensus       114 ~~~~~l~~~G~iEll~~~~~h~ilpl----~~~~~d~~~Qi~~~~~~~~---~~FG~~P~G~W~pE~a~~~~-l~~~l~~  185 (360)
T PF03065_consen  114 EAFRELAERGQIELLTSPYYHPILPL----LPDPEDFRAQIEMGREYFK---KHFGRRPRGFWLPECAYSPG-LEEILAE  185 (360)
T ss_dssp             HHHHHHHHTTSEEEEEE-TT-B-GGG----SSHHHHHHHHHHHHHHHHH---HHHSS--SBEE-GGG-B-TT-HHHHHHH
T ss_pred             HHHHHHHHCCCEEEEeCchHHHHHHH----hCCHHHHHHHHHHHHHHHH---HHhCCCCCceECcccccCHH-HHHHHHH
Confidence            67889999998 99999999998753    566 999999999999988   58899999999999999987 8999999


Q ss_pred             cCCeEee---ecccCCCCCCCCCccccccCCCCcCCCCcCCCCCC-CCeEEEeccccccCCCCCCCC-CCCCCcccCCCC
Q psy16052        305 QAFLYDS---TITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRS-HAVWEMVMNELDRREDPNFDE-YLPGCAMVDSCS  379 (476)
Q Consensus       305 ~G~~Yds---s~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~-~~lweiP~~~~~~~~~p~~~~-~~~g~~~~d~~~  379 (476)
                      .|+.|--   +........|..+|..+.            .+... .+|..++.+.-... .-.|.. ..||        
T Consensus       186 ~Gi~~~i~d~~~~~~~~~~~~~~y~py~------------~~~~~~~~l~vf~rd~~ls~-~I~f~~~g~~~--------  244 (360)
T PF03065_consen  186 AGIRYTILDGHQLLGAGSPPNNPYRPYR------------VEDQGGSGLAVFFRDQELSD-DIWFRYSGYPG--------  244 (360)
T ss_dssp             TT--EEEEECHHHHHHT--CGGCSSEEE------------ECE-TTCSEEEEEB-HHHHH-HHHTTTTSGGG--------
T ss_pred             cCCEEEEECcHHhCCCCCccCCCCceeE------------eecCCCceEEEEecCHHHHH-hcCCccccccc--------
Confidence            9998732   222111123344444331            11111 23777765531110 000111 0111        


Q ss_pred             CCCChHHHHHHHHHHHHHHhccCCcce-EEeecchhh-----cCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhC
Q psy16052        380 NILTGDQFYNFLNHNFDRHYDQNRAPL-GLYFHAAWL-----KNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQN  453 (476)
Q Consensus       380 ~~~~~~~~~~~l~~~f~~~y~~~r~p~-~i~~H~~w~-----~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~  453 (476)
                       ..+.+++++.+...+..........+ ++.+-..-+     ......++.|.+.+..+.+ .+.|-++|.+|+++-+..
T Consensus       245 -~~~~~~~~~~l~~~~~~~~~~~~~~~v~i~~DgE~~~~~~~~~~~~~l~~l~~~~~~~~~-~~~i~~~t~se~l~~~~~  322 (360)
T PF03065_consen  245 -DPAYREFYRYLLYEYASWISYDPGPVVVIALDGENFWGHYWDEGEGFLEFLYQLLEELHS-RGFIETVTPSEYLERHPP  322 (360)
T ss_dssp             -STTSBHSSCHHC----HC-BS-TT-EEEECCEGGGCCCCCCCCTCEHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHST
T ss_pred             -ccCHHHHHHhhhhhhhhhhccCCCCeEEEeECCcccCCcCCcccCCHHHHHHHHHHHhhc-CCeEEecCHHHHHhhCCC
Confidence             12345566555411333333333333 343322111     1234577778877777765 445899999999986633


No 26 
>COG1449 Alpha-amylase/alpha-mannosidase [Carbohydrate transport and metabolism]
Probab=94.56  E-value=0.6  Score=51.34  Aligned_cols=79  Identities=18%  Similarity=0.197  Sum_probs=65.9

Q ss_pred             HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHc
Q psy16052        227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQ  305 (476)
Q Consensus       227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~  305 (476)
                      ...|++.+.|+ |+-.+++.|.-.+-+..+--.+++.++|...+++..   .+.|..++|||+|.+.++.+ ..+++++.
T Consensus       183 ~~~r~l~e~g~~e~~~~pyyHsl~pl~~d~gw~~d~~~qv~~~~~~~~---elfG~~p~~~~~tEl~y~~~-i~~~~~e~  258 (615)
T COG1449         183 EAFRELAESGKVELTASPYYHSLIPLLADDGWYEDFKEQVMMSRELYK---ELFGVWPSGFWNTELAYNDQ-ILEYFEES  258 (615)
T ss_pred             HHHHHHHhcCceEEEecccccccchhcccCCchHHHHHHHHHHHHHHH---HHhCCCCccccChhhhccHH-HHHHHHHc
Confidence            56788888887 999999999976544332235688899999988888   47888999999999999987 99999999


Q ss_pred             CCeE
Q psy16052        306 AFLY  309 (476)
Q Consensus       306 G~~Y  309 (476)
                      ||.|
T Consensus       259 G~~~  262 (615)
T COG1449         259 GFSW  262 (615)
T ss_pred             CCcc
Confidence            9986


No 27 
>PF04748 Polysacc_deac_2:  Divergent polysaccharide deacetylase;  InterPro: IPR006837 This is a family of uncharacterised proteins that includes YibQ.; PDB: 2QV5_A 2NLY_A.
Probab=93.84  E-value=2.8  Score=39.78  Aligned_cols=182  Identities=15%  Similarity=0.191  Sum_probs=88.5

Q ss_pred             CCceEEEEecccCCCh-HHHHHHHHcCCeeeeCCC------CCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcc
Q psy16052        211 CDIKATFFVSHKYTNY-SAVQELHRKGHEVAVHSI------THNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSV  283 (476)
Q Consensus       211 c~i~aTFFv~~~~~~~-~~v~~l~~~GHEIg~Hs~------~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~  283 (476)
                      -+.+.||-|....+.- +.++..+++||||-.|--      .+.........++.++.+.-+..+   |.   .+++  .
T Consensus        19 Lp~pvT~ai~P~~~~~~~~a~~a~~~G~EvllhlPMep~~~~~~gp~~L~~~~~~~~i~~~l~~a---l~---~vp~--a   90 (213)
T PF04748_consen   19 LPFPVTFAILPYAPYSREWAERARAAGHEVLLHLPMEPKGYKDPGPGALLTGMSEEEIRKRLEAA---LA---RVPG--A   90 (213)
T ss_dssp             SSTTCEEEEETTSTTHHHHHHHHHHCT-EEEEEEEE--TTTT---TT-B-TTS-HHHHHHHHHHH---HC---CSTT---
T ss_pred             CCCCeEEEECCCCCChHHHHHHHHHcCCEEEEeCCCCCCCCCCcccccccCCCCHHHHHHHHHHH---HH---HCCC--c
Confidence            5788899888765543 566777899999999852      111111223356777666665543   22   3443  3


Q ss_pred             cEEecCCc-ccCch-----HHHHHHHHcCCeEeeecccCCCCCCCCCccccccCCCCcCCCCcCCCCCCCCeEEEecccc
Q psy16052        284 VGVRAPYL-RVGGN-----NQFTMMEEQAFLYDSTITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNEL  357 (476)
Q Consensus       284 ~gfR~P~l-~~~~~-----~~~~~l~~~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~  357 (476)
                      +|....-+ ....+     ..++.|++.|+.|=-|...+        .+.-....            +..++--.=-..+
T Consensus        91 ~GvnNhmGS~~T~~~~~m~~vl~~l~~~gl~FvDS~T~~--------~s~a~~~A------------~~~gvp~~~rdvf  150 (213)
T PF04748_consen   91 VGVNNHMGSRFTSDREAMRWVLEVLKERGLFFVDSRTTP--------RSVAPQVA------------KELGVPAARRDVF  150 (213)
T ss_dssp             SEEEEEE-CCHHC-HHHHHHHHHHHHHTT-EEEE-S--T--------T-SHHHHH------------HHCT--EEE-SEE
T ss_pred             EEEecCCCccccCCHHHHHHHHHHHHHcCCEEEeCCCCc--------ccHHHHHH------------HHcCCCEEeecee
Confidence            44433222 11111     26778889998664443321        11110000            0111111101111


Q ss_pred             ccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCC
Q psy16052        358 DRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHK  437 (476)
Q Consensus       358 ~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~  437 (476)
                      .                 |   +..+.+.+.+.|.......... + ..+.--| +    +|.-+++|++++..+.+  .
T Consensus       151 L-----------------D---~~~~~~~I~~ql~~~~~~A~~~-G-~aI~Igh-~----~p~Tl~~L~~~~~~l~~--~  201 (213)
T PF04748_consen  151 L-----------------D---NDQDEAAIRRQLDQAARIARKQ-G-SAIAIGH-P----RPETLEALEEWLPELEA--Q  201 (213)
T ss_dssp             T-----------------T---ST-SHHHHHHHHHHHHHHHHCC-S-EEEEEEE------SCCHHHHHHHHHHHHHH--C
T ss_pred             c-----------------C---CCCCHHHHHHHHHHHHHhhhhc-C-cEEEEEc-C----CHHHHHHHHHHHhHHhh--C
Confidence            1                 1   1134566666665555444332 2 2333446 4    26789999999999975  8


Q ss_pred             CEEEEeHHHHHH
Q psy16052        438 DVYFVTMTQVIQ  449 (476)
Q Consensus       438 dV~fvT~~qi~~  449 (476)
                      +|-+|+.+++++
T Consensus       202 gi~lVpvS~llr  213 (213)
T PF04748_consen  202 GIELVPVSALLR  213 (213)
T ss_dssp             TEEE--HHHH--
T ss_pred             CEEEEEhHHhhC
Confidence            999999999874


No 28 
>PF15421 Polysacc_deac_3:  Putative polysaccharide deacetylase; PDB: 4DWE_A.
Probab=92.47  E-value=5.1  Score=40.91  Aligned_cols=167  Identities=16%  Similarity=0.213  Sum_probs=73.9

Q ss_pred             hHHHHHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHc
Q psy16052        226 YSAVQELHRKGHEVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQ  305 (476)
Q Consensus       226 ~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~  305 (476)
                      |+.+++|+.-|..|+.|...+...     ..+.++.+.+..-+..+|.+  .+.|...+-+..|-+    +..+- .+.+
T Consensus       132 W~evkem~nyG~~ia~Hdv~~~~~-----~~t~~~~~~~~~i~~~~i~~--~l~GR~~K~l~ePnG----n~~Yi-~aa~  199 (425)
T PF15421_consen  132 WDEVKEMLNYGTGIAFHDVNTYDV-----DTTDDQILAHFCIAQNSIFK--NLSGRGCKMLAEPNG----NKKYI-TAAL  199 (425)
T ss_dssp             HHHHHHHHTTT-EEEB---SS-GG-----G-SHHHHHHHHHHHHHHHHH--HTTT----EEE--TT-----HHHH-HHHT
T ss_pred             HHHHHHHHhcCcceEecccccccc-----cccchhhhhhhhhHHHHHHH--HhcCcccceeeCCCC----CHHHH-HHHH
Confidence            589999999999999999988774     34556666665555555654  677877776666643    32222 1222


Q ss_pred             CCeEeeecccCCCCCCCCCccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChH
Q psy16052        306 AFLYDSTITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGD  385 (476)
Q Consensus       306 G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~  385 (476)
                      ++.---++..-.+..+.+|+.....+.               . ..++-.         |.         +    ....+
T Consensus       200 ~~d~Iqtitaq~G~~kv~pf~~~~dL~---------------k-~~~~R~---------F~---------~----~~~~~  241 (425)
T PF15421_consen  200 VYDPIQTITAQSGAIKVYPFKVNGDLE---------------K-VVIERG---------FY---------D----SPDND  241 (425)
T ss_dssp             TSTT--EEEESSS-B---TTT--S--T---------------T-EEEE-B---------------------------THH
T ss_pred             cCCchhhhHhhcCCcceeccccCCchh---------------h-heeeee---------ec---------C----CcchH
Confidence            221111233222344677776443220               0 111111         10         0    11234


Q ss_pred             HHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhh-----CCCEEEEeHHHHHHHh
Q psy16052        386 QFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIER-----HKDVYFVTMTQVIQWI  451 (476)
Q Consensus       386 ~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~-----~~dV~fvT~~qi~~w~  451 (476)
                      ++.+.+.......- .+|.-..|..|..     ...+..   ||.||...     .+.|||.+.+|+-++.
T Consensus       242 ~~k~~I~~~l~~p~-e~r~ai~iG~h~t-----d~~W~~---fl~wLn~~YG~~G~D~lWfp~~eEyyEY~  303 (425)
T PF15421_consen  242 QYKQDIEDQLKNPV-EEREAINIGVHGT-----DSNWED---FLLWLNDTYGKDGDDSLWFPSQEEYYEYN  303 (425)
T ss_dssp             HHHHHHHHHHTS-G-GG---EEEEESS-------HHHHH---HHHHHHHHTSTTTT--EEEE-HHHHHHHH
T ss_pred             HHHHHHHHHhccch-hchhheeecccCC-----CCcHHH---HHHHHHHHhCcCCCcccccCCHHHHHHHH
Confidence            56555555444322 3466678999942     233434   66666433     4669999999999843


No 29 
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=92.13  E-value=0.08  Score=34.46  Aligned_cols=28  Identities=21%  Similarity=0.495  Sum_probs=22.8

Q ss_pred             cccCCCCcccccccccccCCcccccccccccc
Q psy16052         58 AIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKE   89 (476)
Q Consensus        58 ~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de   89 (476)
                      +|+|.+|.++. ..++|   +++.+|.+++||
T Consensus         6 ~f~C~~~~Ci~-~~~~C---dg~~dC~dgsDE   33 (33)
T smart00192        6 EFQCDNGRCIP-LSWVC---DGVDDCSDGSDE   33 (33)
T ss_pred             eEECCCCCEEC-chhhC---CCcCcCcCCCCC
Confidence            89999887554 55677   888999999987


No 30 
>COG2861 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=81.38  E-value=53  Score=31.67  Aligned_cols=204  Identities=15%  Similarity=0.183  Sum_probs=103.6

Q ss_pred             CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEe--cccCCChHHHHHHHHcCCeeeeCCCC----CCCCc-
Q psy16052        178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFV--SHKYTNYSAVQELHRKGHEVAVHSIT----HNDDE-  250 (476)
Q Consensus       178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv--~~~~~~~~~v~~l~~~GHEIg~Hs~~----H~~~~-  250 (476)
                      +-.++|-.||-........  ++++-        +...|.-+  .+.+. ....+++++.||||-.|.--    +.... 
T Consensus        28 ~~klaIVIddlG~~~~~~~--~i~~L--------p~eVTlA~~P~~~~~-~~~~~~A~~~G~evlih~PmeP~~~~~~e~   96 (250)
T COG2861          28 GPKLAIVIDDLGLSQTGTQ--AILAL--------PPEVTLAFAPFAPHA-REWAQKARNAGHEVLIHMPMEPFSYPKIEP   96 (250)
T ss_pred             CceEEEEECCccccHHHHH--HHHhC--------CccceEEecCCCchh-HHHHHHHHhcCCEEEEeccCCcccCCCCCC
Confidence            4468888888665555443  44442        44444444  44432 26788999999999999732    22111 


Q ss_pred             -cccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCC-cccCch--H---HHHHHHHcCCeEeeecccCCCCCCCC
Q psy16052        251 -NFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPY-LRVGGN--N---QFTMMEEQAFLYDSTITAPLSNPPLW  323 (476)
Q Consensus       251 -~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~-l~~~~~--~---~~~~l~~~G~~Ydss~~~~~~~~p~~  323 (476)
                       .....++.++..+.|.++...+.      .  .+|+.--- .+...+  .   ..+.|++.|.-|--|-...       
T Consensus        97 gtL~~~~s~~e~~~rl~~a~~~v~------~--~~GlnNhmGs~~tsn~~aM~~~m~~Lk~r~l~flDs~T~a-------  161 (250)
T COG2861          97 GTLRPGMSAEEILRRLRKAMNKVP------D--AVGLNNHMGSRFTSNEDAMEKLMEALKERGLYFLDSGTIA-------  161 (250)
T ss_pred             CCcccCCCHHHHHHHHHHHHhhCc------c--ceeehhhhhhhhcCcHHHHHHHHHHHHHCCeEEEcccccc-------
Confidence             11124667777777766554443      1  12221110 111112  2   4556667665332222110       


Q ss_pred             CccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCC
Q psy16052        324 PYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNR  403 (476)
Q Consensus       324 pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r  403 (476)
                       .++--.+               ..=.+||+-.  ++   .|         +|   +..+...+.+.|.....- -+.|+
T Consensus       162 -~S~a~~i---------------Ak~~gVp~~~--rd---vf---------LD---~e~~~~~V~kql~~~~~~-Ark~G  207 (250)
T COG2861         162 -NSLAGKI---------------AKEIGVPVIK--RD---VF---------LD---DEDTEAAVLKQLDAAEKL-ARKNG  207 (250)
T ss_pred             -cchhhhh---------------HhhcCCceee--ee---ee---------ec---CcCCHHHHHHHHHHHHHH-HHhcC
Confidence             0000000               0011222211  10   01         11   123556666666543332 23345


Q ss_pred             cceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052        404 APLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQ  449 (476)
Q Consensus       404 ~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~  449 (476)
                      ...+|. | +    .+.-+.+++++++.+.+  +++-+|.++.|+.
T Consensus       208 ~ai~IG-h-~----~~~Tv~vl~~~~~~l~~--~gIelV~~s~L~~  245 (250)
T COG2861         208 SAIGIG-H-P----HKNTVAVLQQWLDELPA--RGIELVPVSALLG  245 (250)
T ss_pred             ceEEec-C-C----chhHHHHHHHHHHhCCC--CCeEEecHHHhhc
Confidence            444443 4 3    36789999999999863  8999999999975


No 31 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=77.92  E-value=1.6  Score=39.90  Aligned_cols=31  Identities=39%  Similarity=0.622  Sum_probs=26.1

Q ss_pred             CCceecCCCC-c-cCCCcccCCCCCCCCCCCCC
Q psy16052        105 DGFLACGDGA-C-IERGLFCNGEKDCNDGSDEN  135 (476)
Q Consensus       105 ~~~~~C~~g~-C-i~~~~~C~g~~dC~d~sDe~  135 (476)
                      .+.|.|-+|. = |+...+.|+.=||.|||||-
T Consensus        35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEP   67 (176)
T PF12999_consen   35 NGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEP   67 (176)
T ss_pred             CCceEecCCCCceecHHHccCcceeCCCCCCcc
Confidence            4579997763 3 79999999999999999993


No 32 
>KOG2397|consensus
Probab=75.32  E-value=2.1  Score=44.81  Aligned_cols=49  Identities=29%  Similarity=0.477  Sum_probs=39.1

Q ss_pred             cccccccccccCCCCCCCCCCCCCCceecCC----CCccCCCcccCCCCCCCCCCCCCCCC
Q psy16052         82 NCKLKNKERKVKPLLYTDEPLCQDGFLACGD----GACIERGLFCNGEKDCNDGSDENTCD  138 (476)
Q Consensus        82 ~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~----g~Ci~~~~~C~g~~dC~d~sDe~~c~  138 (476)
                      ||.|++||.  +      ...|+.+.|.|.|    ..=|+..-+=||.-||-|+|||..-.
T Consensus        65 DC~DGsDEP--G------tsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~Sg  117 (480)
T KOG2397|consen   65 DCLDGSDEP--G------TSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYLSG  117 (480)
T ss_pred             cCCCCCCCC--c------cccCCCCceeeeecCCCceeeechhccCcccccccCCCCccCC
Confidence            689999984  2      1359999999965    36778888999999999999995443


No 33 
>COG4878 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=68.44  E-value=1.1e+02  Score=30.09  Aligned_cols=74  Identities=16%  Similarity=0.320  Sum_probs=46.7

Q ss_pred             HHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHH--HHHHHHHHHHhh---cccCCcccEEecCCcccCchHHHHHHHH
Q psy16052        230 QELHRKGHEVAVHSITHNDDENFWSNATVDDWAKE--MAGMRIIAEKYA---NLSDNSVVGVRAPYLRVGGNNQFTMMEE  304 (476)
Q Consensus       230 ~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~e--i~~~~~~l~~~~---~i~g~~~~gfR~P~l~~~~~~~~~~l~~  304 (476)
                      +++...|-|+|.|||.|..++.   ...++-|+.|  |.++...|..+.   -++|....-|-+|-........+.+++.
T Consensus        52 ~el~~~gge~gIhgYNh~sl~p---k~~~knw~see~m~ka~~~L~~y~~~ls~~~y~~~vfVppSnil~q~gk~alvk~  128 (309)
T COG4878          52 TELTFDGGEKGIHGYNHKSLQP---KDILKNWESEEIMNKANYTLADYGDILSITGYDTFVFVPPSNILLQKGKFALVKQ  128 (309)
T ss_pred             cEEEecCCceeEeeccCCCCCc---cccccchhhHHHHHHHHHHHHHHHHHHhccccceEEEeCcccccchhHHHHHHHh
Confidence            3456688999999999998531   2233334332  455555555442   2456678888999887777656666665


Q ss_pred             cC
Q psy16052        305 QA  306 (476)
Q Consensus       305 ~G  306 (476)
                      ..
T Consensus       129 ~p  130 (309)
T COG4878         129 AP  130 (309)
T ss_pred             CC
Confidence            43


No 34 
>KOG3509|consensus
Probab=63.46  E-value=13  Score=42.64  Aligned_cols=79  Identities=23%  Similarity=0.435  Sum_probs=56.6

Q ss_pred             CcccccCCCCcccccccccccCCcccccccccccccccCCCCCCCCCCCCCCceecCCC-CccCCCcccCCCCCCCCCCC
Q psy16052         55 GLQAIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKERKVKPLLYTDEPLCQDGFLACGDG-ACIERGLFCNGEKDCNDGSD  133 (476)
Q Consensus        55 ~~~~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~g-~Ci~~~~~C~g~~dC~d~sD  133 (476)
                      ++++++|.++.+ -.....|   +..-.|...+.+..|....  ....|.+.++.|-+- +|-.++..|+|.+||.|+++
T Consensus        33 ~p~~~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~--~~s~~~~~~~~c~~~~~~~~~~~~~~g~~~~~~~~~  106 (964)
T KOG3509|consen   33 SPNEFKCNNPRC-VQPEALL---DADSTCGPNSTPSGCNAKP--SASDCKPTETQCRDRLRCNPQSFQCDGTNDCKDGSD  106 (964)
T ss_pred             CcchhccCCccc-cCchhhh---ccccccCCCCCcCCccccc--cccccCCcccccccchhcCCccccccCCCCCCccch
Confidence            378899988743 2233445   3334567777777665332  234577889999776 89999999999999999999


Q ss_pred             CCCCCC
Q psy16052        134 ENTCDS  139 (476)
Q Consensus       134 e~~c~~  139 (476)
                      |..+..
T Consensus       107 ~~~~~~  112 (964)
T KOG3509|consen  107 EVGCKS  112 (964)
T ss_pred             hccccc
Confidence            987754


No 35 
>PF07315 DUF1462:  Protein of unknown function (DUF1462);  InterPro: IPR009190 There are currently no experimental data for members of this group of bacterial proteins or their homologues. A crystal structure of Q7A6J8 from SWISSPROT revealed a thioredoxin-like fold, its core consisting of three layers alpha/beta/alpha.; PDB: 1XG8_A.
Probab=62.02  E-value=27  Score=28.26  Aligned_cols=53  Identities=26%  Similarity=0.554  Sum_probs=34.5

Q ss_pred             CCCCCCCChHHHHHHHHHHHHHHhccCCcceEE-eecchhhcCChHHHHHHHHHHHHHHh
Q psy16052        376 DSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGL-YFHAAWLKNNPEFLDAFLYWVDEIIE  434 (476)
Q Consensus       376 d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i-~~H~~w~~~~p~~~~al~~fl~~i~~  434 (476)
                      .||.+..|..+++++|..++.|.|-.  .||.+ |+.   +. .|..-+.-++|++.|..
T Consensus        10 ASCVn~PsSkeTyeWL~aal~RKyp~--~~f~~~YiD---i~-~p~~~~~~~~~a~~I~e   63 (93)
T PF07315_consen   10 ASCVNAPSSKETYEWLEAALKRKYPD--QPFEFTYID---IE-NPPENDHDQQFAERILE   63 (93)
T ss_dssp             GGGSSS--HHHHHHHHHHHHHHH-TT--S-EEEEEEE---TT-T----HHHHHHHHHHHT
T ss_pred             hhhcCCCCchhHHHHHHHHHhCcCCC--CceEEEEEe---cC-CCCccHHHHHHHHHHHh
Confidence            57888899999999999999999874  57755 333   22 23333677889998874


No 36 
>COG3603 Uncharacterized conserved protein [Function unknown]
Probab=46.50  E-value=58  Score=27.84  Aligned_cols=67  Identities=22%  Similarity=0.285  Sum_probs=38.2

Q ss_pred             CCCCCCCCCCCCCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCCCh---------HHHHHHHHcCC
Q psy16052        167 YIPGDLPSKEVPQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY---------SAVQELHRKGH  237 (476)
Q Consensus       167 ~iP~~l~~~~~Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~---------~~v~~l~~~GH  237 (476)
                      ++|......  +....|+|+.-..-.-...+..+++..+.|    +| .-|-++.--++|         ..++.|..+||
T Consensus        52 ~vp~~V~~~--~GW~~lk~~gpf~FgltGilasV~~pLsd~----gi-gIFavStydtDhiLVr~~dLekAv~~L~eaGh  124 (128)
T COG3603          52 RVPDVVQIE--KGWSCLKFEGPFDFGLTGILASVSQPLSDN----GI-GIFAVSTYDTDHILVREEDLEKAVKALEEAGH  124 (128)
T ss_pred             cCCcceEec--CCeEEEEEeccccCCcchhhhhhhhhHhhC----Cc-cEEEEEeccCceEEEehhhHHHHHHHHHHcCC
Confidence            455544333  578889998766544444444444432222    22 234444433343         67899999999


Q ss_pred             eee
Q psy16052        238 EVA  240 (476)
Q Consensus       238 EIg  240 (476)
                      ||.
T Consensus       125 ev~  127 (128)
T COG3603         125 EVL  127 (128)
T ss_pred             ccc
Confidence            985


No 37 
>cd02995 PDI_a_PDI_a'_C PDIa family, C-terminal TRX domain (a') subfamily; composed of the C-terminal redox active a' domains of PDI, ERp72, ERp57 (or ERp60) and EFP1. PDI, ERp72 and ERp57 are endoplasmic reticulum (ER)-resident eukaryotic proteins involved in oxidative protein folding. They are oxidases, catalyzing the formation of disulfide bonds of newly synthesized polypeptides in the ER. They also exhibit reductase activity in acting as isomerases to correct any non-native disulfide bonds, as well as chaperone activity to prevent protein aggregation and facilitate the folding of newly synthesized proteins. PDI and ERp57 have the abb'a' domain structure (where a and a' are redox active TRX domains while b and b' are redox inactive TRX-like domains). PDI also contains an acidic region (c domain) after the a' domain that is absent in ERp57. ERp72 has an additional a domain at the N-terminus (a"abb'a' domain structure). ERp57 interacts with the lectin chaperones, calnexin and calreticu
Probab=43.38  E-value=48  Score=26.41  Aligned_cols=51  Identities=16%  Similarity=0.241  Sum_probs=32.7

Q ss_pred             HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHH
Q psy16052        394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMT  445 (476)
Q Consensus       394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~  445 (476)
                      +|+......+.+++|.|+++|-..-......|+++.+.+.. ...+-|+...
T Consensus         9 ~f~~~i~~~~~~~~v~f~~~~C~~C~~~~~~~~~~~~~~~~-~~~~~~~~id   59 (104)
T cd02995           9 NFDEVVLDSDKDVLVEFYAPWCGHCKALAPIYEELAEKLKG-DDNVVIAKMD   59 (104)
T ss_pred             hhHHHHhCCCCcEEEEEECCCCHHHHHHhhHHHHHHHHhcC-CCCEEEEEEe
Confidence            45555555557899999999865333456677777666543 4567776543


No 38 
>PF01074 Glyco_hydro_38:  Glycosyl hydrolases family 38 N-terminal domain;  InterPro: IPR000602 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 38 GH38 from CAZY comprises enzymes with only one known activity; alpha-mannosidase (3.2.1.24 from EC) (3.2.1.114 from EC). Lysosomal alpha-mannosidase is necessary for the catabolism of N-linked carbohydrates released during glycoprotein turnover. The enzyme catalyses the hydrolysis of terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides, and can cleave all known types of alpha-mannosidic linkages. Defects in the gene cause lysosomal alpha-mannosidosis (AM), a lysosomal storage disease characterised by the accumulation of unbranched oligo-saccharide chains.; GO: 0004559 alpha-mannosidase activity, 0005975 carbohydrate metabolic process; PDB: 2WYI_A 2WYH_A 1O7D_A 3CZN_A 2FYV_A 3D50_A 3EJU_A 3EJS_A 3DX3_A 3BVX_A ....
Probab=42.86  E-value=1.2e+02  Score=29.55  Aligned_cols=80  Identities=9%  Similarity=0.098  Sum_probs=50.1

Q ss_pred             HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEec-CCcccCchHHHHHHHH
Q psy16052        227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRA-PYLRVGGNNQFTMMEE  304 (476)
Q Consensus       227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~-P~l~~~~~~~~~~l~~  304 (476)
                      ..+++++++|- ||..=+|.-.+.    .-.+.|.+.+.|...++.+++  .......++|-+ +|+....  +-++|+.
T Consensus        64 ~~~~~lv~~Gri~~vgg~~~~~D~----~l~~~Eslirql~~G~~~~~~--~fg~~~~~~~~~D~FG~~~~--lP~il~~  135 (275)
T PF01074_consen   64 KRIKKLVKEGRIEIVGGWYVQPDE----NLPSGESLIRQLLYGHKYLRK--EFGVRPKVAWQPDSFGHSAQ--LPQILKQ  135 (275)
T ss_dssp             HHHHHHHHTTSEEESSSBSS-B-S----SSS-HHHHHHHHHHHHHHHHH--HHTG--SEEEESSSSSB-TC--HHHHHHT
T ss_pred             HHHHHHHHhceeEEeCceeeeccc----cCCCHHHHHHHHhhhHHHHHH--hcCCCCCeEEeCCCCCCchh--hHHHHhc
Confidence            56688999995 543333433332    236789999999999997764  222334466666 7777543  7899999


Q ss_pred             cCCeEeeecc
Q psy16052        305 QAFLYDSTIT  314 (476)
Q Consensus       305 ~G~~Ydss~~  314 (476)
                      .|+.|-....
T Consensus       136 ~Gi~~~v~~r  145 (275)
T PF01074_consen  136 FGIKYFVIWR  145 (275)
T ss_dssp             TT-SEEEESS
T ss_pred             cCcceEEEec
Confidence            9998766654


No 39 
>COG4837 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.92  E-value=69  Score=26.14  Aligned_cols=53  Identities=28%  Similarity=0.562  Sum_probs=39.6

Q ss_pred             CCCCCCCChHHHHHHHHHHHHHHhccCCcceEE-eecchhhcCChHHHHHHHHHHHHHHh
Q psy16052        376 DSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGL-YFHAAWLKNNPEFLDAFLYWVDEIIE  434 (476)
Q Consensus       376 d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i-~~H~~w~~~~p~~~~al~~fl~~i~~  434 (476)
                      .||.+..|..+++++|...+.|.|-  ..||.+ |+.   +. +|-.-+...+|++.|.+
T Consensus        17 ASCV~aPtsKdt~eWLeaalkRKyp--~~~F~~~YiD---I~-n~~~e~~~~~~aekI~~   70 (106)
T COG4837          17 ASCVNAPTSKDTYEWLEAALKRKYP--NQPFKYTYID---IT-NPPLEDHDLQFAEKIEQ   70 (106)
T ss_pred             HHhcCCCcchhHHHHHHHHHhccCC--CCCcEEEEEE---cC-CCccHHHHHHHHHHHhc
Confidence            5788888999999999999998886  367754 443   22 34466777888888874


No 40 
>COG3536 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.52  E-value=22  Score=29.75  Aligned_cols=81  Identities=20%  Similarity=0.229  Sum_probs=45.3

Q ss_pred             CCCCCCCCCCCCCCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC----------CChHHHHHHHHc
Q psy16052        166 TYIPGDLPSKEVPQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY----------TNYSAVQELHRK  235 (476)
Q Consensus       166 ~~iP~~l~~~~~Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~----------~~~~~v~~l~~~  235 (476)
                      +.+|..+....--...++||+||-.......++.++-....- -|++..--.-|.|+-          -| -.|+-++..
T Consensus         2 ~~~P~~lrvsk~r~~L~l~f~dg~~~~L~ae~LRv~sPsAEv-qghgp~q~v~v~gkr~V~i~~i~p~Gn-Yavri~FdD   79 (120)
T COG3536           2 TQWPTELRVSKDRKVLSLTFADGAPFRLPAEFLRVLSPSAEV-QGHGPGQRVLVPGKRNVQIRDIEPVGN-YAVRILFDD   79 (120)
T ss_pred             CCCCceeEEeccCcEEEEEeCCCCcccCCHHHHHhcCchHHh-cccCCCCceeeeccccceEEEeEecCc-eEEEEEecC
Confidence            455666666555678999999976544444566665431110 012221111122211          12 246778899


Q ss_pred             CCeeeeCCCCCCC
Q psy16052        236 GHEVAVHSITHND  248 (476)
Q Consensus       236 GHEIg~Hs~~H~~  248 (476)
                      ||+-|.++|+...
T Consensus        80 gHDsGiy~W~YL~   92 (120)
T COG3536          80 GHDSGIYTWAYLH   92 (120)
T ss_pred             CcccCcccHHHHH
Confidence            9999999987543


No 41 
>COG1543 Uncharacterized conserved protein [Function unknown]
Probab=36.39  E-value=89  Score=33.13  Aligned_cols=76  Identities=16%  Similarity=0.130  Sum_probs=59.6

Q ss_pred             HHHHHHHcC-CeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcC
Q psy16052        228 AVQELHRKG-HEVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQA  306 (476)
Q Consensus       228 ~v~~l~~~G-HEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G  306 (476)
                      ..+.+.+.| -||-.-+.+|-.++-+  +. .+.+.++|..+.+..++   ..|..|.||-.|-..+... .-.+|++.|
T Consensus       128 ~f~~~~~~g~ieilts~athg~lPll--~~-peAi~AQi~~g~~~ye~---~fg~~P~GiWlPEcay~pg-ie~~l~~~G  200 (504)
T COG1543         128 AFKELQRSGGIEILTSAATHGYLPLL--GG-PEAIEAQILTGIELYEE---HFGLAPKGIWLPECAYAPG-IERILKDAG  200 (504)
T ss_pred             HHHHHHHcCCceeeeehhhheehhhc--CC-chhhHHHHHHHHHHHHH---HhCCCCCceechhhccccc-hHHHHHhcC
Confidence            456666666 5998888899877642  23 78888999988888874   5667799999998888875 778999999


Q ss_pred             CeEe
Q psy16052        307 FLYD  310 (476)
Q Consensus       307 ~~Yd  310 (476)
                      +.|-
T Consensus       201 i~yf  204 (504)
T COG1543         201 IEYF  204 (504)
T ss_pred             ceEE
Confidence            9883


No 42 
>PRK05406 LamB/YcsF family protein; Provisional
Probab=35.60  E-value=2.9e+02  Score=26.81  Aligned_cols=82  Identities=15%  Similarity=0.116  Sum_probs=53.6

Q ss_pred             HHHHHHHHhccCCCCCCCCceEEEEecccCCCh----HHHHHHHHcCCeeeeCCCCCCCCcccc---CCCCHHHHHHHHH
Q psy16052        194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY----SAVQELHRKGHEVAVHSITHNDDENFW---SNATVDDWAKEMA  266 (476)
Q Consensus       194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~----~~v~~l~~~GHEIg~Hs~~H~~~~~~~---~~~s~e~~~~ei~  266 (476)
                      ...++.++...  | --|+.+|        .++    ..|+...+.|-.||.|- +-.+...|-   -.++.+++++.+.
T Consensus        22 D~~lmp~IssA--N-IACG~HA--------GDp~~M~~tv~lA~~~gV~IGAHP-gypD~~gFGRR~m~~s~~el~~~v~   89 (246)
T PRK05406         22 DEALLPLVTSA--N-IACGFHA--------GDPAVMRRTVRLAKENGVAIGAHP-GYPDLEGFGRRNMDLSPEELYALVL   89 (246)
T ss_pred             HHHHHHHhhhH--H-HhccccC--------CCHHHHHHHHHHHHHcCCeEccCC-CCCccCCCCCCCCCCCHHHHHHHHH
Confidence            45677777660  0 0144444        444    56677788999999985 222222221   1589999999999


Q ss_pred             HHHHHHHHhhcccCCcccEEe
Q psy16052        267 GMRIIAEKYANLSDNSVVGVR  287 (476)
Q Consensus       267 ~~~~~l~~~~~i~g~~~~gfR  287 (476)
                      .+...|+.++...|.++....
T Consensus        90 yQigAL~~~a~~~g~~l~hVK  110 (246)
T PRK05406         90 YQIGALQAIARAAGGRVSHVK  110 (246)
T ss_pred             HHHHHHHHHHHHcCCeeEEeC
Confidence            999999988777776555554


No 43 
>PTZ00062 glutaredoxin; Provisional
Probab=33.30  E-value=1.1e+02  Score=28.74  Aligned_cols=41  Identities=15%  Similarity=0.261  Sum_probs=28.1

Q ss_pred             cCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHH
Q psy16052        401 QNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMT  445 (476)
Q Consensus       401 ~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~  445 (476)
                      ++....+++||++|-.  +  -+.+..+|..+.+.+++|-|+...
T Consensus        15 ~~~g~~vl~f~a~w~~--~--C~~m~~vl~~l~~~~~~~~F~~V~   55 (204)
T PTZ00062         15 SNTGKLVLYVKSSKEP--E--YEQLMDVCNALVEDFPSLEFYVVN   55 (204)
T ss_pred             cCCCcEEEEEeCCCCc--c--hHHHHHHHHHHHHHCCCcEEEEEc
Confidence            3446689999999964  2  335555666665557888888764


No 44 
>PRK12569 hypothetical protein; Provisional
Probab=33.14  E-value=3.2e+02  Score=26.55  Aligned_cols=83  Identities=13%  Similarity=0.109  Sum_probs=54.1

Q ss_pred             HHHHHHHHhccCCCCCCCCceEEEEecccCCCh----HHHHHHHHcCCeeeeCCCCCCCCcccc---CCCCHHHHHHHHH
Q psy16052        194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY----SAVQELHRKGHEVAVHSITHNDDENFW---SNATVDDWAKEMA  266 (476)
Q Consensus       194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~----~~v~~l~~~GHEIg~Hs~~H~~~~~~~---~~~s~e~~~~ei~  266 (476)
                      .+.++.++...  | --|+.+|        .++    ..|+...+.|-.||.|- +-.+...|-   -.++.++.++.+.
T Consensus        25 D~~lmp~Itsa--N-IACG~HA--------GDp~~M~~tv~lA~~~~V~IGAHP-syPD~~gFGRr~m~~s~~el~~~v~   92 (245)
T PRK12569         25 DEALMPLISSA--N-IATGFHA--------GDPNIMRRTVELAKAHGVGIGAHP-GFRDLVGFGRRHINASPQELVNDVL   92 (245)
T ss_pred             HHHHHHHhhhH--H-HhccccC--------CCHHHHHHHHHHHHHcCCEeccCC-CCCcCCCCCCCCCCCCHHHHHHHHH
Confidence            35677777660  0 0144444        444    56677788999999986 223322221   1589999999999


Q ss_pred             HHHHHHHHhhcccCCcccEEec
Q psy16052        267 GMRIIAEKYANLSDNSVVGVRA  288 (476)
Q Consensus       267 ~~~~~l~~~~~i~g~~~~gfR~  288 (476)
                      .+...|+.++...|.++....+
T Consensus        93 yQigaL~~~~~~~g~~l~hVKP  114 (245)
T PRK12569         93 YQLGALREFARAHGVRLQHVKP  114 (245)
T ss_pred             HHHHHHHHHHHHcCCeeEEecC
Confidence            9999999887777766555543


No 45 
>cd02992 PDI_a_QSOX PDIa family, Quiescin-sulfhydryl oxidase (QSOX) subfamily; QSOX is a eukaryotic protein containing an N-terminal redox active TRX domain, similar to that of PDI, and a small C-terminal flavin adenine dinucleotide (FAD)-binding domain homologous to the yeast ERV1p protein. QSOX oxidizes thiol groups to disulfides like PDI, however, unlike PDI, this oxidation is accompanied by the reduction of oxygen to hydrogen peroxide. QSOX is localized in high concentrations in cells with heavy secretory load and prefers peptides and proteins as substrates, not monothiols like glutathione. Inside the cell, QSOX is found in the endoplasmic reticulum and Golgi. The flow of reducing equivalents in a QSOX-catalyzed reaction goes from the dithiol substrate - dithiol of the QSOX TRX domain - dithiols of the QSOX ERV1p domain - FAD - oxygen.
Probab=31.94  E-value=1e+02  Score=25.68  Aligned_cols=50  Identities=12%  Similarity=0.213  Sum_probs=30.6

Q ss_pred             HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCC-EEEEeH
Q psy16052        394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKD-VYFVTM  444 (476)
Q Consensus       394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~d-V~fvT~  444 (476)
                      +|......++.|+++.||++|-..-......|+++...+.. .++ |.|+..
T Consensus        10 ~f~~~i~~~~~~vvV~f~a~wC~~C~~~~~~~~~la~~~~~-~~~~v~~~~v   60 (114)
T cd02992          10 SFNSALLGSPSAWLVEFYASWCGHCRAFAPTWKKLARDLRK-WRPVVRVAAV   60 (114)
T ss_pred             hHHHHHhcCCCeEEEEEECCCCHHHHHHhHHHHHHHHHHHh-cCCceEEEEE
Confidence            34444455567899999999854223355667776666544 333 666543


No 46 
>COG1540 Uncharacterized proteins, homologs of lactam utilization protein B [General function prediction only]
Probab=31.59  E-value=4.9e+02  Score=25.19  Aligned_cols=83  Identities=13%  Similarity=0.117  Sum_probs=55.6

Q ss_pred             HHHHHHHHhccCCCCCCCCceEEEEecccCCCh----HHHHHHHHcCCeeeeCCCCCCCCcccc---CCCCHHHHHHHHH
Q psy16052        194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY----SAVQELHRKGHEVAVHSITHNDDENFW---SNATVDDWAKEMA  266 (476)
Q Consensus       194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~----~~v~~l~~~GHEIg~Hs~~H~~~~~~~---~~~s~e~~~~ei~  266 (476)
                      .+.++.++...  | --|+.+|        .++    ..|+...+.|--||.|- .-.+...|-   -.++.++....+.
T Consensus        22 De~~l~lvsSA--N-IACGfHA--------GDp~~M~rtV~lA~e~gV~IGAHP-gyPDl~gFGRr~m~~~~~e~~a~~l   89 (252)
T COG1540          22 DEALLPLVSSA--N-IACGFHA--------GDPLTMRRTVRLAKENGVAIGAHP-GYPDLVGFGRREMALSPEELYAQVL   89 (252)
T ss_pred             cHHHHHHHhhh--h-HhhcccC--------CCHHHHHHHHHHHHHcCCeeccCC-CCccccccCccccCCCHHHHHHHHH
Confidence            35677777651  1 1255555        333    56777888999999885 222222221   1578999999999


Q ss_pred             HHHHHHHHhhcccCCcccEEec
Q psy16052        267 GMRIIAEKYANLSDNSVVGVRA  288 (476)
Q Consensus       267 ~~~~~l~~~~~i~g~~~~gfR~  288 (476)
                      -+.-.|..|+...|..+..+.+
T Consensus        90 YQiGAL~a~~~a~G~~~~hVKp  111 (252)
T COG1540          90 YQIGALQAFARAQGGVVQHVKP  111 (252)
T ss_pred             HHHHHHHHHHHhcCCeEEEecc
Confidence            9999999888888877766654


No 47 
>KOG1958|consensus
Probab=31.01  E-value=9.1e+02  Score=28.19  Aligned_cols=44  Identities=16%  Similarity=0.204  Sum_probs=35.0

Q ss_pred             HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHH
Q psy16052        227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEK  274 (476)
Q Consensus       227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~  274 (476)
                      ..++.+..+|- ||..-||.-.+.    .+..+-...+++.....+|+.
T Consensus       229 ~a~k~lv~~GqlEIvtGGWVMpDE----An~Hy~~~i~qliEGh~Wl~~  273 (1129)
T KOG1958|consen  229 NAVKRLVKNGQLEIVTGGWVMPDE----ANSHYFAMIDQLIEGHQWLKN  273 (1129)
T ss_pred             HHHHHHHhcCcEEEEeCccccCcc----cchhHHHHHHHHhhhhhHHhc
Confidence            78999999995 999999986653    345566677888888899985


No 48 
>cd07286 PX_SNX18 The phosphoinositide binding Phox Homology domain of Sorting Nexin 18. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX18, like SNX9, contains an N-terminal Src Homology 3 (SH3) domain, a PX domain, and a C-terminal Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature. The PX-BAR structural unit helps determine specific membrane localization. SNX18 is localized to peripheral endosomal structures, and acts in a trafficki
Probab=28.31  E-value=1.1e+02  Score=26.58  Aligned_cols=74  Identities=12%  Similarity=0.255  Sum_probs=42.0

Q ss_pred             HHHHHHHHHHhccCCcc------eEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhCC-cchhccc
Q psy16052        389 NFLNHNFDRHYDQNRAP------LGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQNP-RTITEVK  461 (476)
Q Consensus       389 ~~l~~~f~~~y~~~r~p------~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~p-~~~~~~~  461 (476)
                      .+|...+.+.|-+--.|      .+-.+.+.|+.   .|..+|++||..|+. ++-.+  ...++-.++..+ +...+.+
T Consensus        42 ~~L~~~L~~~~p~~~IPpLP~K~~~g~f~~~FIe---~Rr~~Lq~FL~ria~-hp~L~--~s~~~~~FL~~~~~~~~~~~  115 (127)
T cd07286          42 DWLYARLAEKFPVISVPHIPEKQATGRFEEDFIS---KRRKGLIWWMDHMCS-HPVLA--RCDAFQHFLTCPSTDEKAWK  115 (127)
T ss_pred             HHHHHHHHHHCCCcEeCCCcCCCcCCCCCHHHHH---HHHHHHHHHHHHHHc-Ccccc--cChHHHHHhcCCcchhHHHh
Confidence            45555565555432111      12235555543   589999999999997 77666  335555666655 3444444


Q ss_pred             cCCCccc
Q psy16052        462 NFDPWRE  468 (476)
Q Consensus       462 ~~~~~~~  468 (476)
                      .+..|.+
T Consensus       116 ~~~~~~e  122 (127)
T cd07286         116 QGKRKAE  122 (127)
T ss_pred             hcchHhh
Confidence            4444433


No 49 
>PF12340 DUF3638:  Protein of unknown function (DUF3638);  InterPro: IPR022099  This domain family is found in eukaryotes, and is approximately 230 amino acids in length. There are two conserved sequence motifs: LLE and NMG. 
Probab=28.09  E-value=3.1e+02  Score=26.36  Aligned_cols=61  Identities=18%  Similarity=0.260  Sum_probs=44.7

Q ss_pred             HHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052        385 DQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQ  449 (476)
Q Consensus       385 ~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~  449 (476)
                      ++.++.|...|-..+  +|.-+.+.|+ --..-.+..++.+++.++.++. ..+|.++|.+.+..
T Consensus        83 ~q~~~~L~~~lg~l~--~r~i~~lpFs-R~~~~~~~~~~~~~~l~~~~~~-~~gill~~PEhilS  143 (229)
T PF12340_consen   83 EQMRQMLRSRLGGLL--NRRIYHLPFS-RSTPLTPETLEKIRQLLEECMR-SGGILLATPEHILS  143 (229)
T ss_pred             HHHHHHHHHHHHHHh--CCeeEEeccc-CCCCCCHHHHHHHHHHHHHHHH-cCCEEEeChHHHHH
Confidence            566777777776544  4555666666 2233356778899999999987 89999999999964


No 50 
>KOG1225|consensus
Probab=26.96  E-value=1.6e+02  Score=31.93  Aligned_cols=15  Identities=33%  Similarity=0.906  Sum_probs=9.2

Q ss_pred             CCCC-cccCCccccCCC
Q psy16052        149 CDPA-VCVLPDCFCSED  164 (476)
Q Consensus       149 C~~~-~C~~p~C~C~~~  164 (476)
                      |..+ +|+.. |.|.+.
T Consensus       345 C~~~g~cv~g-C~C~~G  360 (525)
T KOG1225|consen  345 CSGGGQCVNG-CKCKKG  360 (525)
T ss_pred             cCCCceeccC-ceeccC
Confidence            5544 56666 777763


No 51 
>TIGR01068 thioredoxin thioredoxin. Several proteins, such as protein disulfide isomerase, have two or more copies of a domain closely related to thioredoxin. This model is designed to recognize authentic thioredoxin, a small protein that should be hit exactly once by this model.
Probab=26.05  E-value=1.4e+02  Score=23.33  Aligned_cols=48  Identities=23%  Similarity=0.301  Sum_probs=26.5

Q ss_pred             HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052        394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM  444 (476)
Q Consensus       394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~  444 (476)
                      +|......++.+++|+||++|-..-......|+++.+.+.   +++.|+..
T Consensus         5 ~~~~~~~~~~~~vvi~f~~~~C~~C~~~~~~l~~~~~~~~---~~~~~~~v   52 (101)
T TIGR01068         5 NFDETIASSDKPVLVDFWAPWCGPCKMIAPILEELAKEYE---GKVKFVKL   52 (101)
T ss_pred             HHHHHHhhcCCcEEEEEECCCCHHHHHhCHHHHHHHHHhc---CCeEEEEE
Confidence            3444444456789999999985422233444555543332   24666554


No 52 
>KOG3509|consensus
Probab=25.61  E-value=78  Score=36.72  Aligned_cols=59  Identities=12%  Similarity=0.451  Sum_probs=46.6

Q ss_pred             cccccccccccccccCCCCCCCCCCCCCCceecCCCCccCCCcccCCCCCCCCCCCCCCCCCC
Q psy16052         78 GAVKNCKLKNKERKVKPLLYTDEPLCQDGFLACGDGACIERGLFCNGEKDCNDGSDENTCDSE  140 (476)
Q Consensus        78 ~~~~~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe~~c~~~  140 (476)
                      +...||-+.+|..++...    ...|++.+++|.+++|.-..+.||.+.+|..+|+..+|...
T Consensus        11 ~~~~~~~~~~~~~~~~~~----~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   69 (964)
T KOG3509|consen   11 DRQPDCRDRSDVANDPAI----GSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAK   69 (964)
T ss_pred             ccchhhHhhcccCCCccc----cccCCcchhccCCccccCchhhhccccccCCCCCcCCcccc
Confidence            445566777777665332    24578899999999999999999999999999988888653


No 53 
>PTZ00413 lipoate synthase; Provisional
Probab=24.97  E-value=3.1e+02  Score=28.54  Aligned_cols=70  Identities=13%  Similarity=0.013  Sum_probs=40.6

Q ss_pred             ChHHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhCCcchhcc
Q psy16052        383 TGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQNPRTITEV  460 (476)
Q Consensus       383 ~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~p~~~~~~  460 (476)
                      +.++.++.|+.. ...|.. +  +.+--|  .+.|-++..+.+.+.|+.+.+  -+|-++|+.|.++=-+.-.|+.++
T Consensus       276 tYe~sLe~Lr~A-Ke~f~~-g--i~tcSG--iIVGLGET~eEvie~m~dLre--lGVDivtIGQYL~Ps~~h~~V~~y  345 (398)
T PTZ00413        276 SYRQSLKVLEHV-KEFTNG-A--MLTKSS--IMLGLGETEEEVRQTLRDLRT--AGVSAVTLGQYLQPTKTRLKVSRY  345 (398)
T ss_pred             CHHHHHHHHHHH-HHHhcC-C--ceEeee--eEecCCCCHHHHHHHHHHHHH--cCCcEEeeccccCCCcccCCceec
Confidence            556667777543 233321 1  111112  133445667778888888875  699999999987544444455443


No 54 
>TIGR01126 pdi_dom protein disulfide-isomerase domain. This model describes a domain of eukaryotic protein disulfide isomerases, generally found in two copies. The high cutoff for total score reflects the expectation of finding both copies. The domain is similar to thioredoxin but the redox-active disulfide region motif is APWCGHCK.
Probab=24.84  E-value=1.1e+02  Score=24.12  Aligned_cols=49  Identities=16%  Similarity=0.255  Sum_probs=28.2

Q ss_pred             HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052        394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM  444 (476)
Q Consensus       394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~  444 (476)
                      +|+.... .+.+++|.|+++|-..-......|+++.+.+.. .+.|-|+..
T Consensus         5 ~~~~~~~-~~~~~~i~f~~~~C~~c~~~~~~~~~~~~~~~~-~~~~~~~~~   53 (102)
T TIGR01126         5 NFDDIVL-SNKDVLVEFYAPWCGHCKNLAPEYEKLAKELKG-DPDIVLAKV   53 (102)
T ss_pred             hHHHHhc-cCCcEEEEEECCCCHHHHhhChHHHHHHHHhcc-CCceEEEEE
Confidence            4444444 456789999999864322234556666555433 225777643


No 55 
>cd02956 ybbN ybbN protein family; ybbN is a hypothetical protein containing a redox-inactive TRX-like domain. Its gene has been sequenced from several gammaproteobacteria and actinobacteria.
Probab=24.49  E-value=1.2e+02  Score=23.92  Aligned_cols=47  Identities=15%  Similarity=0.218  Sum_probs=27.6

Q ss_pred             HHHHhc-cCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052        395 FDRHYD-QNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM  444 (476)
Q Consensus       395 f~~~y~-~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~  444 (476)
                      |++... ..+.|++|.|+++|-..-......|+++.+...   +.+.|+..
T Consensus         3 f~~~i~~~~~~~vlv~f~a~wC~~C~~~~~~~~~~~~~~~---~~~~~~~v   50 (96)
T cd02956           3 FQQVLQESTQVPVVVDFWAPRSPPSKELLPLLERLAEEYQ---GQFVLAKV   50 (96)
T ss_pred             hHHHHHhcCCCeEEEEEECCCChHHHHHHHHHHHHHHHhC---CcEEEEEE
Confidence            455553 336789999999996533334555666655542   23544443


No 56 
>PF07579 DUF1548:  Domain of Unknown Function (DUF1548);  InterPro: IPR013044 This domain is found in a small number of Chlamydia proteins of unknown function. It occurs together with IPR011436 from INTERPRO.
Probab=24.11  E-value=33  Score=29.85  Aligned_cols=48  Identities=21%  Similarity=0.231  Sum_probs=38.8

Q ss_pred             HHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHH
Q psy16052        385 DQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEI  432 (476)
Q Consensus       385 ~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i  432 (476)
                      ..-|+.+...|...|...+..++-++|-..+.+.|+...++..||=..
T Consensus        47 ~~~Y~~l~~~F~~~Y~~S~~~LI~~i~~q~~~sspe~~~~i~~yll~~   94 (135)
T PF07579_consen   47 HEKYDRLFNQFLNAYRNSGSNLINYILNQILTSSPEQKAAIRNYLLDD   94 (135)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Confidence            445777888999999887888999999888888899888888876443


No 57 
>COG4637 Predicted ATPase [General function prediction only]
Probab=23.50  E-value=1.9e+02  Score=29.30  Aligned_cols=61  Identities=18%  Similarity=0.177  Sum_probs=43.3

Q ss_pred             HHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHH
Q psy16052        385 DQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVI  448 (476)
Q Consensus       385 ~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~  448 (476)
                      +..+++|..  ...+-.-+.|-.|.+..|-.+=.|..+.+|.+++.-... +.-||++|++.=+
T Consensus       273 DGTlRfl~l--~t~Llsp~~p~ll~ldEPE~sLHP~lL~~La~~~~sAak-~sQv~VsTHS~rL  333 (373)
T COG4637         273 DGTLRFLAL--ATLLLSPRPPPLLLLDEPETSLHPDLLPALAELMRSAAK-RSQVIVSTHSPRL  333 (373)
T ss_pred             ccHHHHHHH--HHHHcCCCCCceeEecCcccccCHhHHHHHHHHHHHhhc-cceEEEEeCCHHH
Confidence            456677743  233444566666666656455568899999999999876 8899999998643


No 58 
>cd02957 Phd_like Phosducin (Phd)-like family; composed of Phd and Phd-like proteins (PhLP), characterized as cytosolic regulators of G protein functions. Phd and PhLPs specifically bind G protein betagamma (Gbg)-subunits with high affinity, resulting in the solubilization of Gbg from the plasma membrane and impeding G protein-mediated signal transduction by inhibiting the formation of a functional G protein trimer (G protein alphabetagamma). Phd also inhibits the GTPase activity of G protein alpha. Phd can be phosphorylated by protein kinase A and G protein-coupled receptor kinase 2, leading to its inactivation. Phd was originally isolated from the retina, where it is highly expressed and has been implicated to play an important role in light adaptation. It is also found in the pineal gland, liver, spleen, striated muscle and the brain. The C-terminal domain of Phd adopts a thioredoxin fold, but it does not contain a CXXC motif. Phd interacts with G protein beta mostly through the N-te
Probab=23.43  E-value=2.1e+02  Score=23.50  Aligned_cols=37  Identities=11%  Similarity=0.119  Sum_probs=22.0

Q ss_pred             CcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEe
Q psy16052        403 RAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVT  443 (476)
Q Consensus       403 r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT  443 (476)
                      +.|++++|+++|-.  +  -.++...|+.+...++++.|+.
T Consensus        24 ~~~vvv~F~a~~c~--~--C~~l~~~l~~la~~~~~v~f~~   60 (113)
T cd02957          24 GTRVVVHFYEPGFP--R--CKILDSHLEELAAKYPETKFVK   60 (113)
T ss_pred             CCEEEEEEeCCCCC--c--HHHHHHHHHHHHHHCCCcEEEE
Confidence            36889999999853  3  2234444444443366775554


No 59 
>cd02996 PDI_a_ERp44 PDIa family, endoplasmic reticulum protein 44 (ERp44) subfamily; ERp44 is an ER-resident protein, induced during stress, involved in thiol-mediated ER retention. It contains an N-terminal TRX domain, similar to that of PDIa, with a CXFS motif followed by two redox inactive TRX-like domains, homologous to the b and b' domains of PDI. The CXFS motif in the N-terminal domain allows ERp44 to form stable reversible mixed disulfides with its substrates. Through this activity, ERp44 mediates the ER localization of Ero1alpha, a protein that oxidizes protein disulfide isomerases into their active form. ERp44 also prevents the secretion of unassembled cargo protein with unpaired cysteines. It also modulates the activity of inositol 1,4,5-triphosphate type I receptor (IP3R1), an intracellular channel protein that mediates calcium release from the ER to the cytosol.
Probab=22.05  E-value=1.8e+02  Score=23.54  Aligned_cols=40  Identities=20%  Similarity=0.201  Sum_probs=24.0

Q ss_pred             HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHh
Q psy16052        394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIE  434 (476)
Q Consensus       394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~  434 (476)
                      +|+.+.+. ..+.+|.|+++|-..-......++++.+.+.+
T Consensus        10 ~f~~~i~~-~~~vlv~F~a~wC~~C~~~~p~~~~~a~~~~~   49 (108)
T cd02996          10 NIDDILQS-AELVLVNFYADWCRFSQMLHPIFEEAAAKIKE   49 (108)
T ss_pred             hHHHHHhc-CCEEEEEEECCCCHHHHhhHHHHHHHHHHHhh
Confidence            44444433 34688999999864323345666666666543


No 60 
>cd02954 DIM1 Dim1 family; Dim1 is also referred to as U5 small nuclear ribonucleoprotein particle (snRNP)-specific 15kD protein. It is a component of U5 snRNP, which pre-assembles with U4/U6 snRNPs to form a [U4/U6:U5] tri-snRNP complex required for pre-mRNA splicing. Dim1 interacts with multiple splicing-associated proteins, suggesting that it functions at multiple control points in the splicing of pre-mRNA as part of a large spliceosomal complex involving many protein-protein interactions. U5 snRNP contains seven core proteins (common to all snRNPs) and nine U5-specific proteins, one of which is Dim1. Dim1 adopts a thioredoxin fold but does not contain the redox active CXXC motif. It is essential for G2/M phase transition, as a consequence to its role in pre-mRNA splicing.
Probab=22.03  E-value=2.8e+02  Score=23.52  Aligned_cols=46  Identities=11%  Similarity=0.051  Sum_probs=27.3

Q ss_pred             HHHHHHhcc-CCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCC-EEEE
Q psy16052        393 HNFDRHYDQ-NRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKD-VYFV  442 (476)
Q Consensus       393 ~~f~~~y~~-~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~d-V~fv  442 (476)
                      ++|+..... +..|+++.|+++|-  .|  -.+|..+|+.+....++ |.|+
T Consensus         3 ~~~~~~i~~~~~~~vVV~F~A~WC--gp--Ck~m~P~le~la~~~~~~v~f~   50 (114)
T cd02954           3 WAVDQAILSEEEKVVVIRFGRDWD--PV--CMQMDEVLAKIAEDVSNFAVIY   50 (114)
T ss_pred             HHHHHHHhccCCCEEEEEEECCCC--hh--HHHHHHHHHHHHHHccCceEEE
Confidence            344444432 46689999999995  34  44555566665543454 4444


No 61 
>PLN02701 alpha-mannosidase
Probab=21.58  E-value=2.7e+02  Score=33.14  Aligned_cols=75  Identities=12%  Similarity=0.141  Sum_probs=54.8

Q ss_pred             HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEec-CCcccCchHHHHHHHH
Q psy16052        227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRA-PYLRVGGNNQFTMMEE  304 (476)
Q Consensus       227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~-P~l~~~~~~~~~~l~~  304 (476)
                      +.|++++++|. ||..=+|.-.+.    ....++.+.++|......|..  .+.....+||.. ||+... . +-.+|+.
T Consensus       106 ~~vk~LV~~GrLE~vgGgwvm~DE----a~~~~esiI~Ql~~G~~~l~~--~fG~~P~~~W~iDpFGhs~-~-~P~Ll~~  177 (1050)
T PLN02701        106 EAFTKLVKNGQLEIVGGGWVMNDE----ANSHYFAIIEQITEGNMWLND--TIGVAPKNSWAIDPFGYSS-T-MAYLLRR  177 (1050)
T ss_pred             HHHHHHHHcCCEEEECCceecccc----cccCHHHHHHHHHhhhHHHHh--hcCCCCCcCccCCCCCCCH-H-HHHHHHh
Confidence            78999999995 788888875553    346789999999988888864  222223456665 888743 3 7788999


Q ss_pred             cCCeE
Q psy16052        305 QAFLY  309 (476)
Q Consensus       305 ~G~~Y  309 (476)
                      .||.+
T Consensus       178 ~G~~~  182 (1050)
T PLN02701        178 MGFEN  182 (1050)
T ss_pred             CCCce
Confidence            99854


No 62 
>cd02987 Phd_like_Phd Phosducin (Phd)-like family, Phd subfamily; Phd is a cytosolic regulator of G protein functions. It specifically binds G protein betagamma (Gbg)-subunits with high affinity, resulting in the solubilization of Gbg from the plasma membrane. This impedes the formation of a functional G protein trimer (G protein alphabetagamma), thereby inhibiting G protein-mediated signal transduction. Phd also inhibits the GTPase activity of G protein alpha. Phd can be phosphorylated by protein kinase A and G protein-coupled receptor kinase 2, leading to its inactivation. Phd was originally isolated from the retina, where it is highly expressed and has been implicated to play an important role in light adaptation. It is also found in the pineal gland, liver, spleen, striated muscle and the brain. The C-terminal domain of Phd adopts a thioredoxin fold, but it does not contain a CXXC motif. Phd interacts with G protein beta mostly through the N-terminal helical domain.
Probab=20.56  E-value=2.3e+02  Score=25.77  Aligned_cols=37  Identities=8%  Similarity=-0.052  Sum_probs=27.5

Q ss_pred             cceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052        404 APLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM  444 (476)
Q Consensus       404 ~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~  444 (476)
                      .+++++|+++|..  |  -.+|...|+.|+..+++|-|+..
T Consensus        84 ~~VVV~Fya~wc~--~--Ck~m~~~l~~LA~~~~~vkF~kV  120 (175)
T cd02987          84 TTVVVHIYEPGIP--G--CAALNSSLLCLAAEYPAVKFCKI  120 (175)
T ss_pred             cEEEEEEECCCCc--h--HHHHHHHHHHHHHHCCCeEEEEE
Confidence            4789999999863  3  55677777777665888888765


No 63 
>TIGR01358 DAHP_synth_II 3-deoxy-7-phosphoheptulonate synthase, class II. Homologs scoring between trusted and noise cutoff include proteins involved in antibiotic biosynthesis; one example is active as this enzyme, while another acts on an amino analog.
Probab=20.25  E-value=6e+02  Score=26.88  Aligned_cols=77  Identities=14%  Similarity=0.175  Sum_probs=52.6

Q ss_pred             HHHHHHHhccCCCCCCCCceEEEEec-cc--C--CChHHHHHHHHcCCeeee-----CCCCCCCCccccCCCCHHHHHHH
Q psy16052        195 ALYKEMFNGKRKNPNGCDIKATFFVS-HK--Y--TNYSAVQELHRKGHEVAV-----HSITHNDDENFWSNATVDDWAKE  264 (476)
Q Consensus       195 ~~~l~ll~~~~~np~~c~i~aTFFv~-~~--~--~~~~~v~~l~~~GHEIg~-----Hs~~H~~~~~~~~~~s~e~~~~e  264 (476)
                      +.++++++.  .||..-+=+.|+-+= |.  .  .-+.+|++..+.||.|.-     ||-+......+ .+-..+.+.+|
T Consensus       293 ~~l~~L~~~--LnP~~epGRlTLI~RmGa~kV~~~LP~li~aV~~~G~~VvW~cDPMHGNT~~t~~G~-KTR~f~~Il~E  369 (443)
T TIGR01358       293 DELLRLIER--LNPENEPGRLTLISRMGADKIADKLPPLLRAVKAAGRRVVWVCDPMHGNTEEAASGY-KTRRFDDIRSE  369 (443)
T ss_pred             HHHHHHHHH--hCCCCCCceEEEEeccCchHHHHhHHHHHHHHHHcCCceEEeecCCCCCceeCCCCc-cCCcHHHHHHH
Confidence            456666653  567655556665552 21  1  123899999999999975     88777654433 45678899999


Q ss_pred             HHHHHHHHHH
Q psy16052        265 MAGMRIIAEK  274 (476)
Q Consensus       265 i~~~~~~l~~  274 (476)
                      +.+..++.++
T Consensus       370 v~~ff~vh~~  379 (443)
T TIGR01358       370 VKGFFEVHRA  379 (443)
T ss_pred             HHHHHHHHHH
Confidence            9988888774


Done!