Query         psy16054
Match_columns 563
No_of_seqs    512 out of 3123
Neff          7.3 
Searched_HMMs 46136
Date          Fri Aug 16 19:40:53 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy16054.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16054hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 TIGR03212 uraD_N-term-dom puta 100.0 1.3E-35 2.9E-40  303.5  20.7  223  221-493    66-294 (297)
  2 PRK15394 4-deoxy-4-formamido-L 100.0 2.9E-27 6.3E-32  241.2  21.7  216  231-499    17-278 (296)
  3 TIGR03006 pepcterm_polyde poly  99.9 2.7E-24 5.9E-29  217.0  21.0  211  233-487    29-264 (265)
  4 TIGR02764 spore_ybaN_pdaB poly  99.9 3.6E-23 7.7E-28  199.8  20.3  186  217-487     4-191 (191)
  5 TIGR02884 spore_pdaA delta-lac  99.9 1.7E-22 3.7E-27  200.0  19.8  119  217-349    35-155 (224)
  6 TIGR02873 spore_ylxY probable   99.9 3.8E-22 8.3E-27  201.9  20.2  118  217-349    83-202 (268)
  7 PF01522 Polysacc_deac_1:  Poly  99.9 6.1E-23 1.3E-27  183.1   9.2  118  217-350     4-123 (123)
  8 COG0726 CDA1 Predicted xylanas  99.8 2.6E-18 5.7E-23  172.1  20.4  115  219-348    65-181 (267)
  9 PRK14581 hmsF outer membrane N  99.6 3.8E-15 8.3E-20  166.1  13.7  126  217-354   105-275 (672)
 10 PRK14582 pgaB outer membrane N  99.6 6.1E-15 1.3E-19  164.7  13.8  126  217-354   105-275 (671)
 11 PF00057 Ldl_recept_a:  Low-den  99.0 1.5E-10 3.3E-15   81.1   2.6   36  142-177     2-37  (37)
 12 cd00112 LDLa Low Density Lipop  99.0 3.5E-10 7.7E-15   78.4   2.4   35  143-177     1-35  (35)
 13 smart00192 LDLa Low-density li  98.6 2.1E-08 4.6E-13   68.5   2.6   32  143-174     2-33  (33)
 14 PF10096 DUF2334:  Uncharacteri  98.2 6.3E-05 1.4E-09   75.6  16.1  117  222-348     3-146 (243)
 15 PF01607 CBM_14:  Chitin bindin  97.9 3.7E-06   8E-11   63.2   1.1   53   29-90      1-53  (53)
 16 COG3233 Predicted deacetylase   97.7 0.00028 6.2E-09   68.1  11.2  125  220-355     5-143 (233)
 17 PF09960 DUF2194:  Uncharacteri  97.7  0.0025 5.4E-08   71.3  19.1  196  249-491   297-531 (585)
 18 cd00112 LDLa Low Density Lipop  97.5 4.9E-05 1.1E-09   52.6   1.7   33   96-132     3-35  (35)
 19 PF12999 PRKCSH-like:  Glucosid  97.5   9E-05   2E-09   69.8   3.8   67   97-174    36-110 (176)
 20 PF00057 Ldl_recept_a:  Low-den  97.5   5E-05 1.1E-09   53.2   1.5   33   96-132     5-37  (37)
 21 smart00494 ChtBD2 Chitin-bindi  97.2 0.00037 8.1E-09   52.6   4.1   50   28-86      2-51  (56)
 22 KOG1215|consensus               97.1  0.0015 3.2E-08   77.6   8.7   75   96-180   220-297 (877)
 23 KOG1215|consensus               97.0  0.0012 2.6E-08   78.3   7.5  118   96-219   140-296 (877)
 24 smart00192 LDLa Low-density li  96.8 0.00066 1.4E-08   46.3   1.8   29   97-129     5-33  (33)
 25 PF04748 Polysacc_deac_2:  Dive  96.4   0.024 5.1E-07   55.9  10.1  182  251-488    19-213 (213)
 26 PF03065 Glyco_hydro_57:  Glyco  95.7   0.023   5E-07   60.4   7.2  195  266-492   113-322 (360)
 27 PF11959 DUF3473:  Domain of un  94.8    0.13 2.8E-06   46.8   7.9   55  432-488    62-131 (133)
 28 COG1449 Alpha-amylase/alpha-ma  94.2    0.83 1.8E-05   52.0  14.4   80  267-351   183-264 (615)
 29 PF15421 Polysacc_deac_3:  Puta  92.5     3.3   7E-05   43.8  14.1  173  265-497   131-312 (425)
 30 COG2861 Uncharacterized protei  92.4     4.4 9.6E-05   40.3  14.3  212  212-488    23-245 (250)
 31 KOG2397|consensus               92.3   0.099 2.1E-06   56.3   3.0   70   98-178    44-117 (480)
 32 PF12999 PRKCSH-like:  Glucosid  86.1    0.57 1.2E-05   44.5   2.7   33  144-176    34-68  (176)
 33 PF01607 CBM_14:  Chitin bindin  76.9     1.3 2.7E-05   32.6   1.1   28   96-123    26-53  (53)
 34 PF01074 Glyco_hydro_38:  Glyco  72.1      22 0.00048   36.0   9.3   80  267-354    64-145 (275)
 35 KOG1958|consensus               67.0 1.2E+02  0.0026   36.0  14.1   74  267-348   229-304 (1129)
 36 KOG3509|consensus               62.4      14 0.00031   43.8   6.1   89   83-177    18-110 (964)
 37 PF07315 DUF1462:  Protein of u  60.0      38 0.00082   28.6   6.5   53  417-474    11-63  (93)
 38 smart00494 ChtBD2 Chitin-bindi  53.8      11 0.00023   27.6   2.2   24   96-119    28-51  (56)
 39 COG4878 Uncharacterized protei  53.8      22 0.00048   36.1   5.0   98  254-354    34-139 (309)
 40 COG1543 Uncharacterized conser  51.6      36 0.00079   37.2   6.4   78  269-353   129-207 (504)
 41 PF12340 DUF3638:  Protein of u  48.5      85  0.0018   31.3   8.1   63  424-489    82-144 (229)
 42 PF04794 YdjC:  YdjC-like prote  43.1      63  0.0014   32.6   6.5  119  220-349     2-152 (261)
 43 COG4837 Uncharacterized protei  42.5      45 0.00098   28.3   4.3   54  417-475    18-71  (106)
 44 cd07286 PX_SNX18 The phosphoin  41.4      46   0.001   30.0   4.6   73  428-505    41-120 (127)
 45 PLN02701 alpha-mannosidase      40.8      63  0.0014   39.5   7.0   79  267-353   106-186 (1050)
 46 COG3590 PepO Predicted metallo  40.6     8.9 0.00019   42.7  -0.1   84  216-319   446-539 (654)
 47 COG3536 Uncharacterized protei  40.3      10 0.00022   33.0   0.3   78  207-286     3-90  (120)
 48 TIGR01358 DAHP_synth_II 3-deox  40.3 1.5E+02  0.0033   32.3   9.0   84  228-315   287-380 (443)
 49 PRK09819 alpha-mannosidase; Pr  39.7 1.1E+02  0.0024   36.7   8.8   79  266-354    69-149 (875)
 50 cd02995 PDI_a_PDI_a'_C PDIa fa  38.6      72  0.0016   26.3   5.3   51  433-483     8-58  (104)
 51 PLN02291 phospho-2-dehydro-3-d  38.0 1.7E+02  0.0036   32.2   8.9   84  228-315   307-400 (474)
 52 PF11071 DUF2872:  Protein of u  33.6      52  0.0011   29.8   3.6   52  428-490    84-136 (141)
 53 KOG3509|consensus               33.0      52  0.0011   39.4   4.5   61  113-180     9-69  (964)
 54 TIGR03473 HpnK hopanoid biosyn  31.8 2.9E+02  0.0063   28.3   9.4  117  221-349     4-154 (283)
 55 COG3603 Uncharacterized conser  31.7      73  0.0016   28.4   4.1   14  267-280   114-127 (128)
 56 PRK02134 hypothetical protein;  31.4   4E+02  0.0087   26.8  10.1  117  221-348     5-149 (249)
 57 KOG1959|consensus               30.4 1.4E+02   0.003   35.3   7.1   76  267-348   111-188 (996)
 58 COG3200 AroG 3-deoxy-D-arabino  27.8 3.2E+02  0.0069   29.1   8.6   97  199-314   277-383 (445)
 59 cd07295 PX_Grd19 The phosphoin  25.9      84  0.0018   27.7   3.6   62  428-493    47-114 (116)
 60 COG3531 Predicted protein-disu  25.9 1.6E+02  0.0034   28.8   5.6   72  267-354    20-92  (212)
 61 cd02992 PDI_a_QSOX PDIa family  25.7 1.6E+02  0.0034   25.5   5.4   50  433-482     9-59  (114)
 62 PF13788 DUF4180:  Domain of un  24.0      88  0.0019   27.7   3.3   30  461-490    83-113 (113)
 63 cd02334 ZZ_dystrophin Zinc fin  23.7      94   0.002   23.1   3.0   36  253-288    11-46  (49)
 64 TIGR02026 BchE magnesium-proto  23.1 1.9E+02  0.0041   32.2   6.6   62  421-487   410-471 (497)
 65 PF13592 HTH_33:  Winged helix-  22.6 1.4E+02  0.0029   22.9   3.8   18  336-353    25-42  (60)
 66 cd06860 PX_SNX7_30_like The ph  22.2 1.2E+02  0.0025   26.7   3.8   49  428-479    46-104 (116)
 67 PF12979 DUF3863:  Domain of Un  21.0 2.5E+02  0.0054   28.4   6.0  101  237-350    46-156 (351)
 68 PF02456 Adeno_IVa2:  Adenoviru  20.6      55  0.0012   34.2   1.6   31  203-235   151-181 (369)
 69 TIGR03646 YtoQ_fam YtoQ family  20.4      95  0.0021   28.2   2.8   52  428-490    87-139 (144)
 70 COG4637 Predicted ATPase [Gene  20.1 2.4E+02  0.0052   29.8   6.0   60  425-486   273-332 (373)

No 1  
>TIGR03212 uraD_N-term-dom putative urate catabolism protein. This model represents a protein that is predominantly found just upstream of the UraD protein (OHCU decarboxylase) and in a number of instances as a N-terminal fusion with it. UraD itself catalyzes the last step in the catabolism of urate to allantoate. The function of this protein is presently unknown. It shows homology with the pfam01522 polysaccharide deacetylase domain family.
Probab=100.00  E-value=1.3e-35  Score=303.54  Aligned_cols=223  Identities=17%  Similarity=0.210  Sum_probs=181.4

Q ss_pred             eEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCH
Q psy16054        221 ITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSY  298 (563)
Q Consensus       221 V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~  298 (563)
                      -.+|+=+..+.+....+++||++       ++|||||||+|...  ++++|++|.++|||||+||++|..+.    .++.
T Consensus        66 ~~~s~~~YG~rvG~~RiLdlL~~-------~gv~aTffv~g~~~e~~P~~v~~i~~~GHEIg~Hg~~H~~~~----~ls~  134 (297)
T TIGR03212        66 SMESLYEYGSRAGFWRLLRLFTE-------RGIPVTVFGVAMALARNPEAVAAMKEAGWEIASHGLRWIDYQ----DMDE  134 (297)
T ss_pred             cchhhhhhcchhCHHHHHHHHHH-------cCCCEEEEeEHHHHHHCHHHHHHHHHcCCEEeeccccCcccc----cCCH
Confidence            35667776666778889999974       79999999999985  67999999999999999999999864    6899


Q ss_pred             HHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHH-cCCeEEeecCCCCCCCCccceeccCCCCCCcCC
Q psy16054        299 DDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTD-QFFVYDASITASLGRVPIWPYTLYFRMPHKCNG  377 (563)
Q Consensus       299 ~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e-~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~  377 (563)
                      ++++++|.++.++|++++|+   .|+|||++  ..+. +|+++|++ .||+|+||++++  +.|||--+           
T Consensus       135 ~~e~~~i~~s~~~i~~~tG~---~P~G~~~~--~~s~-~T~~LL~e~~Gf~Y~sd~~~d--D~Py~~~~-----------  195 (297)
T TIGR03212       135 AQEREHIAEAIRLHTEVTGE---RPLGWYTG--RTSP-NTRRLVAEEGGFLYDADSYAD--DLPYWDEV-----------  195 (297)
T ss_pred             HHHHHHHHHHHHHHHHHhCC---CCceEECC--CCCh-hHHHHHHHhcCceEeCchhhc--CCCeEeec-----------
Confidence            99999999999999998885   48999974  6674 69999999 999999999875  44444332           


Q ss_pred             CCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCC---cceEEeecccc
Q psy16054        378 NAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNR---APLGLHFHASW  454 (563)
Q Consensus       378 ~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nr---ap~~l~lH~~~  454 (563)
                             ...++++||++ |+.+|. .|..          .....++++++++|+++|+.+|++.+   ..|+|.|| ++
T Consensus       196 -------~~~~~l~lP~~-~~~nD~-~~~~----------~~~~~~~~~~~~~~~d~fd~l~~eg~~~~~~~~i~lH-p~  255 (297)
T TIGR03212       196 -------AGRPQLIVPYT-LDANDM-RFAT----------PQGFNTGEQFFTYLRDAFDVLYAEGEGAPKMMSIGLH-CR  255 (297)
T ss_pred             -------CCCCeEEEecc-cccCcH-HHHh----------ccCCCCHHHHHHHHHHHHHHHHHhCCCCCceEEEecC-cc
Confidence                   11236779987 444442 2210          11235789999999999999998643   36799999 88


Q ss_pred             ccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCC
Q psy16054        455 LKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP  493 (563)
Q Consensus       455 l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p  493 (563)
                      ++|+|+|+.+|++||+||++++||||+|.+||++|++.-
T Consensus       256 i~G~p~R~~~L~~~l~~i~~~~~VW~at~~eIA~~~~~~  294 (297)
T TIGR03212       256 LVGRPGRIAALQRFLDYVQSHDKVWVARRIDIARHWHET  294 (297)
T ss_pred             ccCCHHHHHHHHHHHHHHHhCCCEEEEcHHHHHHHHHHh
Confidence            999999999999999999999999999999999998543


No 2  
>PRK15394 4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol deformylase ArnD; Provisional
Probab=99.95  E-value=2.9e-27  Score=241.20  Aligned_cols=216  Identities=16%  Similarity=0.190  Sum_probs=165.8

Q ss_pred             CCCchHHHHHhccCCCCCCCCCCceEEEEecc----------------------------C------------------C
Q psy16054        231 VDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHK----------------------------Y------------------T  264 (563)
Q Consensus       231 ~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~----------------------------~------------------~  264 (563)
                      ....+.+++||++       ++|+||||.+..                            |                  .
T Consensus        17 ~~g~~~~~~~~~~-------~~~~a~f~~~~gpd~~g~~~~r~~~~~fl~k~~r~~~~~~yg~~~l~~g~~~PG~~I~~~   89 (296)
T PRK15394         17 REGVPRLLEILSK-------HGIQASFFFSVGPDNMGRHLWRLLKPRFLWKMLRSNAASLYGWDILLAGTAWPGKEIGKA   89 (296)
T ss_pred             ccCHHHHHHHHHH-------cCCCEEEEeccCCCchhHHHHHHhhhHHHHHHHhcCchhhcCHHHHhhcccCCCchHHHH
Confidence            3445778899974       789999999853                            0                  0


Q ss_pred             CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHH
Q psy16054        265 NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTD  344 (563)
Q Consensus       265 ~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e  344 (563)
                      ++++|++|.++ ||||+|||+|..+......++.+++++||.++.++|++.+|.   .++|||+|++..+. +++++|++
T Consensus        90 ~P~lvr~i~~~-HEIG~Hg~~H~~wq~~~~~ls~~e~~~eI~ra~~~Le~itG~---~p~g~raPgw~~~~-~tl~ll~e  164 (296)
T PRK15394         90 LADIIREAAKA-HEVGLHAWDHHAWQAWSGVWSRQQLIEQIARGVDALEEIIGQ---PVTCSAAAGWRADQ-RVVEAKEA  164 (296)
T ss_pred             CHHHHHHHHhc-CEehhcccCccchhcccccCCHHHHHHHHHHHHHHHHHHhCC---CCCEEeCCCccCCH-HHHHHHHH
Confidence            34699999999 999999999996432223689999999999999999998884   58999999999995 69999999


Q ss_pred             cCCeEEeecCCCCCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCCh
Q psy16054        345 QFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTG  424 (563)
Q Consensus       345 ~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~  424 (563)
                      .||.||||+..   +.||||.+-+                ...+++|||++. ..+|     +. .|        ...+.
T Consensus       165 ~Gf~Y~Ss~~~---d~Py~~~~~~----------------g~~~~veIP~tl-~t~D-----~~-~g--------~~~~~  210 (296)
T PRK15394        165 FGFRYNSDCRG---THPFRPLLPD----------------GSLGTVQIPVTL-PTWD-----EV-IG--------RDVKA  210 (296)
T ss_pred             cCCeeecCCCC---CCceeeecCC----------------CcCCeEEEcccc-ccch-----hh-hc--------CCCCH
Confidence            99999999965   4566665422                123689999973 3222     21 11        12567


Q ss_pred             HHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCCchhhhh
Q psy16054        425 EQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNPTELTSL  499 (563)
Q Consensus       425 ~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p~~~~~~  499 (563)
                      +++.+++++.|++   ..+.+ +++|| ++++|.| ++..|++||+++++ .+|||+|++||+.-.++..|+.++
T Consensus       211 ~~f~~~~~d~~~~---~~g~~-V~tiH-P~v~G~~-~~~~fe~lL~~~~~-~gv~F~tl~el~~~~~~~~p~~~~  278 (296)
T PRK15394        211 EDFNDFILDRILR---DKGTP-VYTIH-AEVEGIA-YAHNFEDLLKRAAQ-EGITFCPLSELLPDDLETLPLGKV  278 (296)
T ss_pred             HHHHHHHHHHhcc---ccCce-EEEEC-CCcCCcc-hHHHHHHHHHHHhc-CCcEEeeHHHHHhhchhhCCccce
Confidence            8899999888865   22334 69999 6799998 99999999999997 669999999998844455555444


No 3  
>TIGR03006 pepcterm_polyde polysaccharide deactylase family protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide deacetylases (pfam01522). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene. The highest scoring homologs below the trusted cutoff for this model are found in several species of Methanosarcina, an archaeal genus.
Probab=99.92  E-value=2.7e-24  Score=217.01  Aligned_cols=211  Identities=19%  Similarity=0.293  Sum_probs=144.4

Q ss_pred             CchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHH
Q psy16054        233 NSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRL  310 (563)
Q Consensus       233 ~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~  310 (563)
                      ++..+++||++       +++||||||+|...  .+++|++|.++|||||+||++|..+.    .++.+++++||.++++
T Consensus        29 nt~riL~lL~~-------~gikATFFv~g~~~e~~p~lir~i~~~GhEIgsHg~sH~~l~----~ls~ee~~~eI~~s~~   97 (265)
T TIGR03006        29 NTDRILDLLDR-------HGVKATFFTLGWVAERYPELVRRIVAAGHELASHGYGHERVT----TQTPEAFRADIRRSKA   97 (265)
T ss_pred             hHHHHHHHHHH-------cCCcEEEEEeccchhhCHHHHHHHHHcCCEeeeccccCcCch----hCCHHHHHHHHHHHHH
Confidence            46778899974       79999999999875  45999999999999999999999763    6899999999999999


Q ss_pred             HHHHHhCCCCCcceEEecCccccChhh--HHHHHHHcCCeEEeecCCCCCCCCccceecc-CCCCCCcCCCCCCCCCC-C
Q psy16054        311 IIERFANISDGSVIGVRAPYLRVGGNK--QFSMMTDQFFVYDASITASLGRVPIWPYTLY-FRMPHKCNGNAHNCPSK-S  386 (563)
Q Consensus       311 ~l~~~~gi~~~~~~G~R~P~l~~~~~~--~~~~L~e~Gf~YDsS~~~~~~~~p~wPytld-~~~p~~c~~~~~pcp~~-~  386 (563)
                      +|++++|.   .++|||+|++.++..+  .+++|+++||.||||+.         |-..| ||+|..     ++.|-+ .
T Consensus        98 ~Le~itG~---~~~gfRaP~~s~~~~t~~a~~iL~e~Gy~YdsS~~---------p~~~d~~g~~~~-----~~~p~~~~  160 (265)
T TIGR03006        98 LLEDLSGQ---PVRGYRAPSFSIGKKNLWALDVLAEAGYRYSSSIY---------PVRHDHYGMPDA-----PRFPFRPP  160 (265)
T ss_pred             HHHHHhCC---CceEEECCCCCCCCCcHHHHHHHHHCCCEEEEeec---------cCcCCcCCCCCC-----CCCceecC
Confidence            99998874   5899999999887532  37999999999999994         11122 454422     233422 2


Q ss_pred             C-CcEEEeccccc--cCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeecccccc-CchhHH
Q psy16054        387 H-PTWEMVMNELD--RRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLK-SKKEFK  462 (563)
Q Consensus       387 ~-~lweiP~~~l~--~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~-g~p~r~  462 (563)
                      . +|||||++.+.  ....|....   |  ++     ..-+-.+++++.....+   ..+.|+.+|||+..|. ++| |+
T Consensus       161 ~g~l~e~P~s~~~~~~~~~p~~gg---~--~~-----r~~P~~~~~~~~~~~~~---~~~~~~~~y~Hpwe~dp~~~-~~  226 (265)
T TIGR03006       161 NGRLLEVPVTTVRLGGRNLPAAGG---G--YF-----RLLPYALSRWALRRVNG---REGRPAIFYFHPWEIDPDQP-RI  226 (265)
T ss_pred             CCcEEEEccchhhhcccccccCCC---c--ch-----hcChHHHHHHHHHHHHh---cCCceEEEEEchhhcCCCCC-CC
Confidence            2 39999998643  223333211   0  11     12344444444443332   2357999999966664 222 22


Q ss_pred             ---------------HHHHHHHHHHHcCCCeEEEeHHHHH
Q psy16054        463 ---------------DELIKFIEEMLDRNDVYFVTMLQVI  487 (563)
Q Consensus       463 ---------------~al~~fl~~i~~~~dVwfvT~~ei~  487 (563)
                                     ...+.=|+.|+  .|.-|+|+.|++
T Consensus       227 ~~~~~~~~fr~y~~~~~~~~~l~~ll--~~~~~~~~~~~~  264 (265)
T TIGR03006       227 AGISMKSRFRHYLNLDRMEGRLRRLL--GDFRWGRMDRVF  264 (265)
T ss_pred             CCCCccceeeeecCHHHHHHHHHHHH--hcCceeeHHHhh
Confidence                           11111134443  578899999885


No 4  
>TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB. This model describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reducted sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterized polysaccharide deacetylase; the exact function this protein family is unknown.
Probab=99.91  E-value=3.6e-23  Score=199.80  Aligned_cols=186  Identities=19%  Similarity=0.196  Sum_probs=142.2

Q ss_pred             CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054        217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS  294 (563)
Q Consensus       217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~  294 (563)
                      ..+.|+||||||+.......++++|++       +++||||||+|.+.  +++++++|.++|||||+|||+|..+.    
T Consensus         4 ~~k~V~LTFDDgp~~~~t~~~l~~L~~-------~~ikaTfFv~g~~~~~~~~~~~~i~~~Gheig~Ht~~H~~~~----   72 (191)
T TIGR02764         4 SDKKIALTFDISWGNDYTEPILDTLKE-------YDVKATFFLSGSWAERHPELVKEIVKDGHEIGSHGYRHKNYT----   72 (191)
T ss_pred             CCCEEEEEEECCCCcccHHHHHHHHHH-------cCCCEEEEeccHHHHHCHHHHHHHHhCCCEEEECCcCCCCcc----
Confidence            346899999999987766778899974       78999999999986  56999999999999999999999763    


Q ss_pred             CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEEeecCCCCCCCCccceeccCCCCCC
Q psy16054        295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYDASITASLGRVPIWPYTLYFRMPHK  374 (563)
Q Consensus       295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YDsS~~~~~~~~p~wPytld~~~p~~  374 (563)
                      .++.+++++||.++.++|++.+|.   .+++||+||+.++. .++++|+++||.+- .|..+         +.|      
T Consensus        73 ~~~~~~~~~ei~~~~~~l~~~~g~---~~~~fr~P~G~~~~-~~~~~l~~~G~~~v-~w~~~---------~~D------  132 (191)
T TIGR02764        73 TLEDEKIKKDILRAQEIIEKLTGK---KPTLFRPPSGAFNK-AVLKAAESLGYTVV-HWSVD---------SRD------  132 (191)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHhCC---CCCEEECCCcCCCH-HHHHHHHHcCCeEE-EecCC---------CCc------
Confidence            689999999999999999998885   47899999999995 79999999999842 11110         000      


Q ss_pred             cCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeecccc
Q psy16054        375 CNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASW  454 (563)
Q Consensus       375 c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~  454 (563)
                                            |.                      ..+++++++.++.+.   .  ++  -.+.||.. 
T Consensus       133 ----------------------~~----------------------~~~~~~i~~~~~~~~---~--~g--~Iil~Hd~-  160 (191)
T TIGR02764       133 ----------------------WK----------------------NPGVESIVDRVVKNT---K--PG--DIILLHAS-  160 (191)
T ss_pred             ----------------------cC----------------------CCCHHHHHHHHHhcC---C--CC--CEEEEeCC-
Confidence                                  00                      023445555444332   1  12  25778832 


Q ss_pred             ccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHH
Q psy16054        455 LKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVI  487 (563)
Q Consensus       455 l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~  487 (563)
                       ...+.-.++|.++|++++++ +.-|||++|++
T Consensus       161 -~~~~~t~~~l~~~i~~l~~~-Gy~~vtl~~l~  191 (191)
T TIGR02764       161 -DSAKQTVKALPTIIKKLKEK-GYEFVTISELI  191 (191)
T ss_pred             -CCcHhHHHHHHHHHHHHHHC-CCEEEEHHHhC
Confidence             12455689999999999965 88999999874


No 5  
>TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase. Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores in the peptidoglycan wall, or spore cortex. The proteins in this family are PdaA (yfjS), a member of a larger family of polysaccharide deacetylases, and are specificially involved in delta-lactam biosynthesis. PdaA acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the following transpeptidation for lactam ring formation, as heterologous expression in E. coli of CwlD and PdaA together is sufficient for delta-lactam production.
Probab=99.90  E-value=1.7e-22  Score=199.97  Aligned_cols=119  Identities=13%  Similarity=0.214  Sum_probs=105.4

Q ss_pred             CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054        217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS  294 (563)
Q Consensus       217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~  294 (563)
                      .++.|+||||||+.......++++|++       +++||||||+|.+.  +.+++++|+++|||||+|||+|..+.    
T Consensus        35 ~~k~VaLTFDDGp~~~~t~~lL~~L~~-------~~vkATFFv~G~~~~~~p~~ir~i~~~GheIgnHt~~H~~~~----  103 (224)
T TIGR02884        35 SKKVIYLTFDNGYENGYTPKILDVLKE-------KKVPAAFFVTGHYIKTQPDLIKRMVDEGHIVGNHSVHHPSLT----  103 (224)
T ss_pred             CCCEEEEEEECCCCccchHHHHHHHHH-------cCCCeEEEeechhhHHCHHHHHHHHHcCCEeeecCccCcCcc----
Confidence            457899999999987666778899974       78999999999986  56999999999999999999999764    


Q ss_pred             CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeE
Q psy16054        295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVY  349 (563)
Q Consensus       295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~Y  349 (563)
                      .++.+++++||.++.++|++.+|..  .+..||+||+.++. .++++|+++||.+
T Consensus       104 ~ls~~~~~~ei~~~~~~i~~~~G~~--~~~~fR~P~G~~~~-~~~~~l~~~Gy~~  155 (224)
T TIGR02884       104 AVNDEKFKEELTGVEEEFKKVTGQK--EMKYFRPPRGVFSE-RTLAYTKELGYYT  155 (224)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHhCCC--CCCEEeCCCCCcCH-HHHHHHHHcCCcE
Confidence            6899999999999999999988853  27899999999995 6999999999986


No 6  
>TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family. Members of this protein family are most closely related to TIGR02764, a subset of polysaccharide deacetylase family proteins found in a species if and only if the species forms endospores like those of Bacillus subtilis or Clostridium tetani. This family is likewise restricted to spore-formers, but is not universal among them in having sequences with full-length matches to the model.
Probab=99.89  E-value=3.8e-22  Score=201.87  Aligned_cols=118  Identities=19%  Similarity=0.192  Sum_probs=105.6

Q ss_pred             CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054        217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS  294 (563)
Q Consensus       217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~  294 (563)
                      ..++|+||||||+....+..++++|++       +++||||||+|.+.  +++++++|+++|||||+|||+|..+.    
T Consensus        83 ~~k~VaLTFDdg~~~~~t~~iL~iLkk-------~~vkATFFv~G~~i~~~p~l~k~i~~~GheIGnHT~sH~~l~----  151 (268)
T TIGR02873        83 EKPMVALLINVAWGNEYLPEILQILKK-------HDVKATFFLEGKWVKENSQLAKMIVEQGHEIGNHAYNHPDMA----  151 (268)
T ss_pred             CCCEEEEEEeCCCCcchHHHHHHHHHH-------CCCCEEEEeehHhhhHCHHHHHHHHHCCCEEEecCCcCCCcc----
Confidence            447899999999988777788999974       78999999999985  56999999999999999999999874    


Q ss_pred             CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeE
Q psy16054        295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVY  349 (563)
Q Consensus       295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~Y  349 (563)
                      .++.+++++||.++.++|++.+|.   .++.||+|||.++. .++++++++||..
T Consensus       152 ~ls~~~~~~Ei~~~~~~i~~~~G~---~p~~fRpP~G~~n~-~~~~~l~~~G~~~  202 (268)
T TIGR02873       152 TLSKEEIYDQINQTNEIIEATIGV---TPKWFAPPSGSFND-NVVQIAADLQMGT  202 (268)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHhCC---CCCEEECCCCCCCH-HHHHHHHHCCCeE
Confidence            589999999999999999998885   57899999999994 7999999999964


No 7  
>PF01522 Polysacc_deac_1:  Polysaccharide deacetylase;  InterPro: IPR002509 This domain is found in polysaccharide deacetylase. This family of polysaccharide deacetylases includes NodB (nodulation protein B from Rhizobium) which is a chitooligosaccharide deacetylase []. It also includes chitin deacetylase from yeast [], and endoxylanases which hydrolyses glucosidic bonds in xylan [].; GO: 0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, 0005975 carbohydrate metabolic process; PDB: 2IW0_A 2CC0_B 2VYO_A 2J13_A 2C71_A 2C79_A 1W1A_1 1W1B_1 1W17_A 1NY1_B ....
Probab=99.88  E-value=6.1e-23  Score=183.13  Aligned_cols=118  Identities=22%  Similarity=0.321  Sum_probs=97.5

Q ss_pred             CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054        217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS  294 (563)
Q Consensus       217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~  294 (563)
                      .+++|+||||||+... +..++++|++       ++++|||||++.++  .++++++|.++|||||+||++|..+    .
T Consensus         4 ~~~~v~ltfDdg~~~~-~~~~~~~l~~-------~~i~at~fv~~~~~~~~~~~l~~l~~~G~ei~~H~~~H~~~----~   71 (123)
T PF01522_consen    4 PKKSVALTFDDGYRDN-YDRLLPLLKK-------YGIPATFFVIGSWVERYPDQLRELAAAGHEIGNHGWSHPNL----S   71 (123)
T ss_dssp             TSSEEEEEEESHCHTH-HHHHHHHHHH-------TT--EEEEE-HHHHHHHHHHHHHHHHTT-EEEEE-SSSSCG----G
T ss_pred             CCCEEEEEEecCchhh-HHHHHHHHHh-------cccceeeeecccccccccccchhHHHHHHHHHhcCCccccc----c
Confidence            5689999999999554 5667789874       78999999999986  3489999999999999999999986    3


Q ss_pred             CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEE
Q psy16054        295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYD  350 (563)
Q Consensus       295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YD  350 (563)
                      .++.+++++||.++++.|++.+|.   .+.+||+|++..+. +++++|+++||.||
T Consensus        72 ~~~~~~~~~ei~~~~~~l~~~~g~---~~~~f~~P~g~~~~-~~~~~l~~~G~~y~  123 (123)
T PF01522_consen   72 TLSPEELRREIERSREILEEITGR---PPKGFRYPFGSYDD-NTLQALREAGYKYD  123 (123)
T ss_dssp             GS-HHHHHHHHHHHHHHHHHHHSS---EESEEE-GGGEECH-HHHHHHHHTT-EEE
T ss_pred             cCCHHHHHHHHHHHHHHHHHHhCC---CCcEEECCCCCCCH-HHHHHHHHcCCCcC
Confidence            689999999999999999999885   58999999999994 79999999999997


No 8  
>COG0726 CDA1 Predicted xylanase/chitin deacetylase [Carbohydrate transport and metabolism]
Probab=99.80  E-value=2.6e-18  Score=172.13  Aligned_cols=115  Identities=23%  Similarity=0.304  Sum_probs=103.3

Q ss_pred             CceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCCCC
Q psy16054        219 QMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWSGG  296 (563)
Q Consensus       219 q~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~  296 (563)
                      +.|+||||||+...++..+++||++       +++||||||.|.+.  ..++++++.++|||||+||++|..+.    .+
T Consensus        65 k~v~lTFDDg~~~~~~~~il~iL~k-------~~i~ATfFv~g~~i~~~p~~~~~~~~~Gheig~H~~~h~~~~----~~  133 (267)
T COG0726          65 KAVALTFDDGPLDGNTPRILPLLKK-------YGIKATFFVVGSWIERNPDLVKRIAEAGHEIGNHGYDHPDLQ----DL  133 (267)
T ss_pred             CeEEEEeecCCCCCCcHHHHHHHHH-------cCCceEEEEehHhhHHCHHHHHHHHhccCeehhCcccCCCcc----cC
Confidence            7899999999998678889999974       78999999999886  45999999999999999999999874    58


Q ss_pred             CHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCe
Q psy16054        297 SYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFV  348 (563)
Q Consensus       297 s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~  348 (563)
                      +.++.++||.++...|.+.+|..   +++||+|++.++ ..+..++++.|+.
T Consensus       134 ~~~~~~~~i~~~~~~l~~~~g~~---~~~~r~p~g~~~-~~~~~~~~~~g~~  181 (267)
T COG0726         134 SLEELGAEIARAHDILKKITGGR---PRGFRPPYGSYN-ESSLALARRLGYR  181 (267)
T ss_pred             CHHHHHHHHHHHHHHHHHHhCCC---CceEECCCCccC-HHHHHHHHhCCCc
Confidence            89999999999999999987754   899999999998 4689999999985


No 9  
>PRK14581 hmsF outer membrane N-deacetylase; Provisional
Probab=99.61  E-value=3.8e-15  Score=166.13  Aligned_cols=126  Identities=10%  Similarity=0.093  Sum_probs=100.3

Q ss_pred             CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---------------------CCHHHHHHHHHc
Q psy16054        217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---------------------TNYASVQELHRK  275 (563)
Q Consensus       217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---------------------~~y~~v~~i~~~  275 (563)
                      +++.|+||||||+.+ ++..+++||++       +++||||||++.+                     .+|++|++|.+.
T Consensus       105 P~KaV~LTFDDGy~s-ny~~AlPILKk-------yg~pATfFvVg~wid~~~~~~v~~~~~~~~r~~~LTW~qIrEM~~s  176 (672)
T PRK14581        105 PDKAVLLTFDDGYSS-FYRRVYPLLKA-------YKWSAVLAPVGTWIDTATDKKVDFGGLSTDRDRFATWKQITEMSKS  176 (672)
T ss_pred             CCCeEEEEEEcCCcc-hHHHHHHHHHH-------cCCCEEEEEechhhccCcccccccccccccccccCCHHHHHHHHHC
Confidence            468999999999986 57777899974       7999999998754                     246799999999


Q ss_pred             C-CeecccCCCCCCCcC---------------CCCC----CC----HHHHHHHHHHHHHHHHHHhCCCCCcceEEecCcc
Q psy16054        276 G-HEISVFSLTHKDDPK---------------YWSG----GS----YDDWLAEMAGGRLIIERFANISDGSVIGVRAPYL  331 (563)
Q Consensus       276 G-HEIa~H~~~H~~~~~---------------~~~~----~s----~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l  331 (563)
                      | ||||+||++|.....               |...    .+    .+++++||.++++.|++.+|.   .++.|++|||
T Consensus       177 GLvEIGSHT~sHh~~~~~np~g~~~pa~~~~~y~~~~~~yEs~~~~~~rl~~Di~~s~~~Ie~~lG~---~p~~FayPyG  253 (672)
T PRK14581        177 GLVEIGAHTYASHYGVIANPQGNTEPAAANLQYDPKTKQYETVEAFKQRMEKDVALITQRIVQATGK---QPRVWVWPYG  253 (672)
T ss_pred             CCcEEEcCCcccccccccCccccccchhhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHhCC---CCCEEEcCCC
Confidence            9 699999999863210               0000    11    244677899999999998884   5788999999


Q ss_pred             ccChhhHHHHHHHcCCeEEeecC
Q psy16054        332 RVGGNKQFSMMTDQFFVYDASIT  354 (563)
Q Consensus       332 ~~~~~~~~~~L~e~Gf~YDsS~~  354 (563)
                      .++. .+.++++++||.+..+..
T Consensus       254 ~yn~-~~~~iak~aGy~~afTt~  275 (672)
T PRK14581        254 APNG-TVLNILRQHGYQLAMTLD  275 (672)
T ss_pred             CcCH-HHHHHHHHCCCcEEEECC
Confidence            9995 799999999999888764


No 10 
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=99.60  E-value=6.1e-15  Score=164.73  Aligned_cols=126  Identities=13%  Similarity=0.095  Sum_probs=99.9

Q ss_pred             CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC---------------------CHHHHHHHHHc
Q psy16054        217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT---------------------NYASVQELHRK  275 (563)
Q Consensus       217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~---------------------~y~~v~~i~~~  275 (563)
                      +++.|+||||||+.+. +..+++||++       +++|||||+++.+.                     ++++|++|.+.
T Consensus       105 P~K~VaLTFDDGy~s~-yt~A~PILkk-------ygvpATfFlvg~wi~~~~~~~~~~~~~~~~r~~~LtWeqIreM~~s  176 (671)
T PRK14582        105 PEKAVLLTFDDGYSSF-YTRVFPILQA-------FQWPAVWAPVGSWVDTPADQPVKFGGEMVPREYFATWQQVREVARS  176 (671)
T ss_pred             CCCeEEEEEEcCCCch-HHHHHHHHHH-------cCCCEEEEEechhhccCccccccccccccccccCCCHHHHHHHHhC
Confidence            4678999999999875 5667899974       79999999987653                     34799999999


Q ss_pred             C-CeecccCC-CCCCCcC--------------CCC----CCCH----HHHHHHHHHHHHHHHHHhCCCCCcceEEecCcc
Q psy16054        276 G-HEISVFSL-THKDDPK--------------YWS----GGSY----DDWLAEMAGGRLIIERFANISDGSVIGVRAPYL  331 (563)
Q Consensus       276 G-HEIa~H~~-~H~~~~~--------------~~~----~~s~----~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l  331 (563)
                      | ||||+||+ +|..+..              |..    -.+.    +.+++||.++++.|++.+|.   .++.||+|||
T Consensus       177 GlvEIGSHT~~sH~~l~anp~g~~~pa~~~r~~~~~~~~yEs~~e~~~ri~~DL~~s~~~Ie~~tG~---~p~~FayPyG  253 (671)
T PRK14582        177 RLVEIASHTWNSHYGIQANPQGSLLPAAVNRAYFTDHARYETAAEYRERIRLDAVKMTEYIRTKAGK---NPRVWVWPYG  253 (671)
T ss_pred             CCeEEEcCCchhccccccccccccccchhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHhCC---CCcEEecCCC
Confidence            9 69999998 5974311              000    0122    34678999999999998885   4788999999


Q ss_pred             ccChhhHHHHHHHcCCeEEeecC
Q psy16054        332 RVGGNKQFSMMTDQFFVYDASIT  354 (563)
Q Consensus       332 ~~~~~~~~~~L~e~Gf~YDsS~~  354 (563)
                      .++. .++++++++||.+.-+..
T Consensus       254 ~~n~-~~~~iakeaGY~~afT~~  275 (671)
T PRK14582        254 EANG-IALEELKKLGYDMAFTLE  275 (671)
T ss_pred             CCCH-HHHHHHHHCCCeEEEEeC
Confidence            9995 799999999999887753


No 11 
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=99.02  E-value=1.5e-10  Score=81.07  Aligned_cols=36  Identities=56%  Similarity=1.304  Sum_probs=34.3

Q ss_pred             CCCCCCcccCCCceecCccccCCCCCCCCCCccccc
Q psy16054        142 VCPEGKLSCGNGECIDKTLFCDDKPDCKDESDENAC  177 (563)
Q Consensus       142 ~C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE~~C  177 (563)
                      .|...+|+|.+|.||+..|+|||..||.|||||.+|
T Consensus         2 ~C~~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C   37 (37)
T PF00057_consen    2 TCPPGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC   37 (37)
T ss_dssp             SSSTTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred             cCcCCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence            588999999999999999999999999999999876


No 12 
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.95  E-value=3.5e-10  Score=78.35  Aligned_cols=35  Identities=46%  Similarity=1.132  Sum_probs=32.9

Q ss_pred             CCCCCcccCCCceecCccccCCCCCCCCCCccccc
Q psy16054        143 CPEGKLSCGNGECIDKTLFCDDKPDCKDESDENAC  177 (563)
Q Consensus       143 C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE~~C  177 (563)
                      |.+.+|+|.+|.||+..++|||..||.|||||..|
T Consensus         1 C~~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~C   35 (35)
T cd00112           1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC   35 (35)
T ss_pred             CCCCeEEcCCCCeeCHHHcCCCccCCCCCcccccC
Confidence            56789999999999999999999999999999876


No 13 
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.62  E-value=2.1e-08  Score=68.51  Aligned_cols=32  Identities=50%  Similarity=1.136  Sum_probs=30.2

Q ss_pred             CCCCCcccCCCceecCccccCCCCCCCCCCcc
Q psy16054        143 CPEGKLSCGNGECIDKTLFCDDKPDCKDESDE  174 (563)
Q Consensus       143 C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE  174 (563)
                      |...+|+|.+|.||+..++|||..||.|||||
T Consensus         2 C~~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE   33 (33)
T smart00192        2 CPPGEFQCDNGRCIPLSWVCDGVDDCSDGSDE   33 (33)
T ss_pred             CCCCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence            66679999999999999999999999999998


No 14 
>PF10096 DUF2334:  Uncharacterized protein conserved in bacteria (DUF2334);  InterPro: IPR018763 This group of proteins has no known function.
Probab=98.17  E-value=6.3e-05  Score=75.56  Aligned_cols=117  Identities=19%  Similarity=0.345  Sum_probs=78.0

Q ss_pred             EEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--------------CH-HHHHHHHHcCCeecccCCCC
Q psy16054        222 TITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--------------NY-ASVQELHRKGHEISVFSLTH  286 (563)
Q Consensus       222 ~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--------------~y-~~v~~i~~~GHEIa~H~~~H  286 (563)
                      +|.++|---..+.+.+..|.+    ...+.+|+.+.+|.-.+.              .+ +.++.+.++|.+|+.||++|
T Consensus         3 lirleDVsP~~~~~~l~~i~d----~l~~~~ipf~v~vIP~~~d~~~~~~~~l~~~~~f~~~L~~~~~~Gg~I~lHGYtH   78 (243)
T PF10096_consen    3 LIRLEDVSPFSDLEKLKEIAD----YLYKYGIPFSVAVIPVYVDPNGGITVNLSDNPEFVEYLRYLQARGGEIVLHGYTH   78 (243)
T ss_pred             EEEEeCCCCCCCHHHHHHHHH----HHHHCCCCEEEEEEecccCCCCcccccchhhHHHHHHHHHHHhcCCEEEEEecce
Confidence            466666322233444433221    123468998888876542              12 47888889999999999999


Q ss_pred             CCCc-------CCCC-----CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCe
Q psy16054        287 KDDP-------KYWS-----GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFV  348 (563)
Q Consensus       287 ~~~~-------~~~~-----~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~  348 (563)
                      ....       .+|.     ..+.++.++-|..+.++|++. |+   .++||-+|....+. .+++++++ .|.
T Consensus        79 q~~~~~sg~~~ef~~~~~~~~~~~~~a~~ri~~gl~~l~~~-gi---~p~~f~aP~y~~s~-~~~~~~~~-~f~  146 (243)
T PF10096_consen   79 QYGNSVSGDGFEFWDSEFDAGDSEEEAKERIEKGLEILAEL-GI---YPVGFEAPHYWASP-ETYKVLEK-YFT  146 (243)
T ss_pred             ecCCCcccccceeccccccccCCHHHHHHHHHHHHHHHHHC-CC---cccEEECCccccCH-HHHHHHHh-hCc
Confidence            9831       2333     256677777788888888774 66   48999999999995 56666644 444


No 15 
>PF01607 CBM_14:  Chitin binding Peritrophin-A domain;  InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=97.90  E-value=3.7e-06  Score=63.25  Aligned_cols=53  Identities=36%  Similarity=0.773  Sum_probs=34.3

Q ss_pred             CCCCCCCCceecCCCCCCCCeeeCCCCCCCCccccceeeCCCCccccCCCCCCCCCCCcCCC
Q psy16054         29 CEGRPPEEYFRLTIEGDCRDVVRCDQAGENGITRLAGVRCPNGLAFDIDRQTCDWKTNVKNC   90 (563)
Q Consensus        29 C~~~~~~~~f~~~~~~~C~~~~~C~~~~~~~~~~~~~~~Cp~Gl~~d~~~~~Cd~~~~v~~C   90 (563)
                      |.++. .++|..  ..+|..||.|.+|.      +...+||.|+.||...+.|+++..+..|
T Consensus         1 C~~~~-~~~~~~--~~~C~~Y~~C~~g~------~~~~~C~~g~~fd~~~~~C~~~~~~~~C   53 (53)
T PF01607_consen    1 CPGRG-DGFYPH--PDDCRKYYQCVNGQ------AVEQRCPEGLYFDPSSQRCVPPSNVVQC   53 (53)
T ss_dssp             ----S-SEEE----SS-SSEEEEEETTE------EEEEE-TTS-EE-TTTSSEE-TTT-TT-
T ss_pred             CCccC-CeeEeC--CCCCCEEEEeeCCc------EECCCCcCCCEECcCcCEEcCCccCCCC
Confidence            44554 456655  67999999999998      8899999999999999999998764444


No 16 
>COG3233 Predicted deacetylase [General function prediction only]
Probab=97.74  E-value=0.00028  Score=68.15  Aligned_cols=125  Identities=11%  Similarity=0.101  Sum_probs=85.3

Q ss_pred             ceEEeecCCcCC-CCchHHHHHhccCCCCCCCCCCceEE-EEeccCC-------CH---HHHHHHHHcCCeecccCCCCC
Q psy16054        220 MITITFNGAVNV-DNSDLYDEIFNGNRQNPNGCQIKGTF-FVSHKYT-------NY---ASVQELHRKGHEISVFSLTHK  287 (563)
Q Consensus       220 ~V~lTFDD~~~~-~~~~~~~~ll~~~r~n~n~~~ikaTF-Fv~~~~~-------~y---~~v~~i~~~GHEIa~H~~~H~  287 (563)
                      .|.|-.|-++.- ..+..+..+++..      +....|+ +|.-...       +.   +.+.+..++|+|++.|||.|.
T Consensus         5 ~iillhdVSpv~~~~~~~i~~~ide~------~~~~~t~lLViPn~~~~~~l~~d~rf~~~l~~r~e~Gdel~lHGy~h~   78 (233)
T COG3233           5 LIILLHDVSPVYWPTLSNIDAAIDEY------GAQNSTVLLVIPNHANDYPLSKDPRFVDLLTEREEEGDELVLHGYDHI   78 (233)
T ss_pred             ceEEEEecCcccchhHHHHHHHHHHh------CCCCceEEEEeeccCCCCCcccChHHHHHHHHHHhcCCEEEEechhhc
Confidence            466666765432 2223344455431      2356676 6654421       22   678888999999999999998


Q ss_pred             CCcC--CCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEEeecCC
Q psy16054        288 DDPK--YWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYDASITA  355 (563)
Q Consensus       288 ~~~~--~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YDsS~~~  355 (563)
                      +...  =+.-+...+-+.-+..+.+.|++ .|++   .+||-+|-+.+|. .++++|-++||+|-+|+..
T Consensus        79 d~~~~gEF~~l~~~eA~~RL~~a~~~l~~-~G~~---~~~FvpPrwlmS~-gt~~aL~~~~frv~a~l~~  143 (233)
T COG3233          79 DTKRRGEFACLRAHEARLRLMAAIEELEA-LGFP---LRGFVPPRWLMSE-GTRQALLENNFRVTADLRG  143 (233)
T ss_pred             cccCccccccchHHHHHHHHHHHHHHHHH-cCCc---ceeecCcceecCH-hHHHHhccCCcEEEecccc
Confidence            7210  01234455556667788888876 4665   7999999999996 6999999999999999853


No 17 
>PF09960 DUF2194:  Uncharacterized protein conserved in bacteria (DUF2194);  InterPro: IPR018695 This family of prokaryotic proteins has no known function; however it may be a membrane protein.
Probab=97.68  E-value=0.0025  Score=71.35  Aligned_cols=196  Identities=14%  Similarity=0.072  Sum_probs=122.1

Q ss_pred             CCCCCceEEEEeccCC---CH--------------HHHHHHHHcCCeecccCCCCCCCcCCCCC----------C-CHHH
Q psy16054        249 NGCQIKGTFFVSHKYT---NY--------------ASVQELHRKGHEISVFSLTHKDDPKYWSG----------G-SYDD  300 (563)
Q Consensus       249 n~~~ikaTFFv~~~~~---~y--------------~~v~~i~~~GHEIa~H~~~H~~~~~~~~~----------~-s~~~  300 (563)
                      +++++|-|.++...|.   ++              -..++|.+.|||||.|||.|+.+..  .+          | ++++
T Consensus       297 ~~y~ikyTg~~I~~Y~~~~~~pf~~~~~~~~~~~~~~g~~Ll~~ggElG~HGYNHqpL~~--~~~~~~~~~Y~~W~~~~~  374 (585)
T PF09960_consen  297 KKYGIKYTGVMIENYNDQTNPPFEFLEQEDTDRFIYFGRELLKSGGELGLHGYNHQPLTL--EGDYGDEYGYKPWPSKED  374 (585)
T ss_pred             HHcCCeEEEEEccccCCCCCCCcccccccchhhHHHHHHHHHhcCCceEEecccCCCCcC--CCcccccccCcCCCCHHH
Confidence            4689999999998763   11              2678899999999999999998752  22          3 5555


Q ss_pred             HHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHc--CCeEEeecCCCCCCCCccceeccCCCCCCcCCC
Q psy16054        301 WLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQ--FFVYDASITASLGRVPIWPYTLYFRMPHKCNGN  378 (563)
Q Consensus       301 ~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~--Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~  378 (563)
                      +.+-|...++.++++  .+...++.+=+|.=-.+. .-+++|++.  -+.+-+|+......  --+|..+|+..      
T Consensus       375 m~~sl~~l~~f~~~l--~p~~~~~~YVPPSNils~-eG~e~L~~~~P~ik~IaS~Y~~~~~--~~~y~QEF~~~------  443 (585)
T PF09960_consen  375 MAESLKELKRFVKSL--FPNYEPSSYVPPSNILSE-EGREALKKAFPEIKTIASLYFGDDE--EGEYVQEFEIA------  443 (585)
T ss_pred             HHHHHHHHHHHHHHh--CCCcceEEecCCccccCH-HHHHHHHHhCCCeEEEEEeeecCCc--CCcceEEeeEC------
Confidence            555555555555553  344567778888877774 467777774  35677776542111  11455555431      


Q ss_pred             CCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCc
Q psy16054        379 AHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSK  458 (563)
Q Consensus       379 ~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~  458 (563)
                             .-+++|+|=..             .|.  .      .+  +  .++...+..+-.  -..+.-.+||.-+-..
T Consensus       444 -------~dgi~e~PRis-------------SG~--~------~~--~--~~~~~~~s~l~~--~Gv~sHFiHPDDvld~  489 (585)
T PF09960_consen  444 -------PDGIVEFPRIS-------------SGY--Y------PD--D--YMLWAIVSALNN--HGVFSHFIHPDDVLDE  489 (585)
T ss_pred             -------CCCeEEeCccc-------------cCC--c------cC--h--HHHHHHHHHHHh--heeeecccCCcccccc
Confidence                   12588888321             121  0      11  1  122333333221  2356777887554311


Q ss_pred             --------hhHHHHHHHHHHHHHc-CCCeEEEeHHHHHHHhh
Q psy16054        459 --------KEFKDELIKFIEEMLD-RNDVYFVTMLQVIQWMQ  491 (563)
Q Consensus       459 --------p~r~~al~~fl~~i~~-~~dVwfvT~~ei~~w~~  491 (563)
                              ..+.+.|++||++|.+ .|-...-|.+|++.-++
T Consensus       490 dR~~~~~W~~l~~~l~~~~~~i~~~~P~Lr~~T~Se~~~~~~  531 (585)
T PF09960_consen  490 DRNAGKGWEELYKSLDKFMSWIKESYPWLRNLTASEAAGALQ  531 (585)
T ss_pred             ccCCCCCHHHHHHHHHHHHHHHHHhCCCcccCcHHHHHHHHH
Confidence                    2367899999999986 68899999999987554


No 18 
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=97.50  E-value=4.9e-05  Score=52.60  Aligned_cols=33  Identities=15%  Similarity=0.409  Sum_probs=28.4

Q ss_pred             cCcccCCCCceecCCCCCcCCCcCCCCCCccCCCCCc
Q psy16054         96 LKQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKV  132 (563)
Q Consensus        96 ~~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~  132 (563)
                      +.+|+|.+|.|| +..++||   |+.||.|+|||..|
T Consensus         3 ~~~f~C~~~~Ci-~~~~~CD---g~~DC~dgsDE~~C   35 (35)
T cd00112           3 PNEFRCANGRCI-PSSWVCD---GEDDCGDGSDEENC   35 (35)
T ss_pred             CCeEEcCCCCee-CHHHcCC---CccCCCCCcccccC
Confidence            358999999999 6667899   89999999999743


No 19 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=97.49  E-value=9e-05  Score=69.78  Aligned_cols=67  Identities=27%  Similarity=0.416  Sum_probs=54.3

Q ss_pred             CcccCCCCcee-cCCCCCcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCC----ceecCccccCCCCC---C
Q psy16054         97 KQISCPSGLAF-DVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNG----ECIDKTLFCDDKPD---C  168 (563)
Q Consensus        97 ~~~~C~~G~~~-~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G----~Ci~~~~~Cdg~~D---C  168 (563)
                      ..|.|-+|.-+ .+-.++.|   +.=||.|||||..        ...|..+.|.|.|.    .-|+..++-||.=|   |
T Consensus        36 ~~f~Cl~~~~~~I~~~~iND---dyCDC~DGSDEPG--------TsAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~C  104 (176)
T PF12999_consen   36 GKFTCLDGSKIVIPFSQIND---DYCDCPDGSDEPG--------TSACSNGKFYCENKGHIPRYIPSSRVNDGICDYDIC  104 (176)
T ss_pred             CceEecCCCCceecHHHccC---cceeCCCCCCccc--------cccCcCceEeeccCCCCCceeehhhhcCCcCccccc
Confidence            35888777543 46667788   6778999999962        23698899999874    68999999999999   9


Q ss_pred             CCCCcc
Q psy16054        169 KDESDE  174 (563)
Q Consensus       169 ~DgsDE  174 (563)
                      =|||||
T Consensus       105 CDGSDE  110 (176)
T PF12999_consen  105 CDGSDE  110 (176)
T ss_pred             CCCCCC
Confidence            999999


No 20 
>PF00057 Ldl_recept_a:  Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor;  InterPro: IPR002172  The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing:    The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins [].      The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor [].     The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains.     The fourth domain is the hydrophobic transmembrane region.     The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits.   LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=97.48  E-value=5e-05  Score=53.24  Aligned_cols=33  Identities=15%  Similarity=0.426  Sum_probs=28.6

Q ss_pred             cCcccCCCCceecCCCCCcCCCcCCCCCCccCCCCCc
Q psy16054         96 LKQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKV  132 (563)
Q Consensus        96 ~~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~  132 (563)
                      ..+|+|.+|.|| +..++||   ++.||.|+|||.+|
T Consensus         5 ~~~f~C~~~~CI-~~~~~CD---g~~DC~dgsDE~~C   37 (37)
T PF00057_consen    5 PGEFRCGNGQCI-PKSWVCD---GIPDCPDGSDEQNC   37 (37)
T ss_dssp             TTEEEETTSSEE-EGGGTTS---SSCSSSSSTTTSSH
T ss_pred             CCeeEcCCCCEE-ChHHcCC---CCCCCCCCcccccC
Confidence            568999999999 5567899   89999999999754


No 21 
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=97.25  E-value=0.00037  Score=52.56  Aligned_cols=50  Identities=36%  Similarity=0.774  Sum_probs=41.2

Q ss_pred             cCCCCCCCCceecCCCCCCCCeeeCCCCCCCCccccceeeCCCCccccCCCCCCCCCCC
Q psy16054         28 LCEGRPPEEYFRLTIEGDCRDVVRCDQAGENGITRLAGVRCPNGLAFDIDRQTCDWKTN   86 (563)
Q Consensus        28 ~C~~~~~~~~f~~~~~~~C~~~~~C~~~~~~~~~~~~~~~Cp~Gl~~d~~~~~Cd~~~~   86 (563)
                      .|..+. .++|..+  .+|..|+.|..+.      ...+.||.|+.|+...+.|++++.
T Consensus         2 ~C~~~~-~g~~~~~--~~C~~y~~C~~~~------~~~~~C~~g~~f~~~~~~C~~~~~   51 (56)
T smart00494        2 ECPGRG-DGLYPHP--TDCSKYYQCSNGR------PIVGSCPAGLVFDPATQTCDWPDN   51 (56)
T ss_pred             cCCCCC-CccccCc--ccCCeeEEcCCCc------EEeccCcCCCeECCCCCCcCCccc
Confidence            466633 5676654  4899999999997      889999999999999999998864


No 22 
>KOG1215|consensus
Probab=97.06  E-value=0.0015  Score=77.56  Aligned_cols=75  Identities=23%  Similarity=0.626  Sum_probs=62.3

Q ss_pred             cCcccCCC-CceecCCCCCcCCCcCCCCCCccCCCC--CccCcCCCCCCCCCCCCcccCCCceecCccccCCCCCCCCCC
Q psy16054         96 LKQISCPS-GLAFDVDKQTCDWKAKVTNCNQLDRPR--KVLPILKTDEPVCPEGKLSCGNGECIDKTLFCDDKPDCKDES  172 (563)
Q Consensus        96 ~~~~~C~~-G~~~~~~~~~CD~~~~v~dC~d~sde~--~~~p~~~~~~~~C~~~~f~C~~G~Ci~~~~~Cdg~~DC~Dgs  172 (563)
                      ...++|.. +.|+ ..++.||   +..+|.+++||.  .+      ....|...++.|.++.|++..++|||..||.||+
T Consensus       220 ~~~~~c~g~~~~i-~~~~~~D---g~~dc~~~~de~~~~~------~~~~~~~~e~~~~~~~~~~~~~~~~g~~d~pdg~  289 (877)
T KOG1215|consen  220 IEVTRCDGSSRCI-LISEVCD---GPRDCVDGPDEGVMNC------SDATCEAPEIECADGDCSDRQKLCDGDLDCPDGL  289 (877)
T ss_pred             eeEEEecCCCcEE-eehhccC---CCcccccCCcCceeEe------eccccCCcceeecCCCCccceEEecCccCCCCcc
Confidence            35788877 4888 5666799   888999999995  32      2236888899999999999999999999999999


Q ss_pred             cccccCCC
Q psy16054        173 DENACSVE  180 (563)
Q Consensus       173 DE~~C~~~  180 (563)
                      ||..|...
T Consensus       290 de~~~~~~  297 (877)
T KOG1215|consen  290 DEDYCKKK  297 (877)
T ss_pred             cccccccc
Confidence            99988743


No 23 
>KOG1215|consensus
Probab=97.02  E-value=0.0012  Score=78.29  Aligned_cols=118  Identities=21%  Similarity=0.381  Sum_probs=78.7

Q ss_pred             cCcccCC--CCceecCCCCCcCCCcCCCCCCccCCCCCccCcCCCCC-------------------------------CC
Q psy16054         96 LKQISCP--SGLAFDVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDE-------------------------------PV  142 (563)
Q Consensus        96 ~~~~~C~--~G~~~~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~-------------------------------~~  142 (563)
                      ...|+|.  ++.|+ +..|.||   +..+|.+++||..|.... ...                               +.
T Consensus       140 ~~~~~c~~~~~~Ci-p~~~~cd---~~~~C~dg~de~~~~~~~-~~~~~~~~~~~~~~~~~d~~~~~~~~~d~~~~~~~~  214 (877)
T KOG1215|consen  140 LDKFSCRTGSCKCI-PGDWLCD---GEADCPDGSDELNCAVRR-CEPRGASLDCIVAIKVCDIQHDCADDYDESEGRIYW  214 (877)
T ss_pred             CCCCCCcCccccCC-CCceeCC---CCCccccchhhhcccccc-cCccccccccceeeeecCcccccccccccccCcccc
Confidence            5688998  78899 6667799   777999999998775110 000                               00


Q ss_pred             CC---CCCcccCC-CceecCccccCCCCCCCCCCccc--ccCCCCCCCCCCCCCCCCCCCCCcccCCCCCcccCCCCCCC
Q psy16054        143 CP---EGKLSCGN-GECIDKTLFCDDKPDCKDESDEN--ACSVEQDPNRAPDCDPTQCALPDCFCSADGTRIPGGIEPNQ  216 (563)
Q Consensus       143 C~---~~~f~C~~-G~Ci~~~~~Cdg~~DC~DgsDE~--~C~~~~~p~~~~~C~~~~C~~p~~~c~~~g~ciP~~~~~~~  216 (563)
                      +.   ...++|.. ++||...+.|||..||.|++||.  .|...........|....|.--..+|...-.| |.++....
T Consensus       215 ~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~~~~de~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~g~~d~-pdg~de~~  293 (877)
T KOG1215|consen  215 TDDSRIEVTRCDGSSRCILISEVCDGPRDCVDGPDEGVMNCSDATCEAPEIECADGDCSDRQKLCDGDLDC-PDGLDEDY  293 (877)
T ss_pred             cCCcceeEEEecCCCcEEeehhccCCCcccccCCcCceeEeeccccCCcceeecCCCCccceEEecCccCC-CCcccccc
Confidence            11   35788876 49999999999999999999994  56554333344456666665444556655556 65555544


Q ss_pred             CCC
Q psy16054        217 VPQ  219 (563)
Q Consensus       217 ~pq  219 (563)
                      .++
T Consensus       294 ~~~  296 (877)
T KOG1215|consen  294 CKK  296 (877)
T ss_pred             ccc
Confidence            443


No 24 
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that  plays a central role in mammalian cholesterol metabolism. The N-terminal  type A repeats in LDL receptor bind the lipoproteins. Other homologous  domains occur in related receptors, including the very low-density  lipoprotein receptor and the LDL receptor-related protein/alpha  2-macroglobulin receptor, and in proteins which are functionally  unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=96.83  E-value=0.00066  Score=46.28  Aligned_cols=29  Identities=21%  Similarity=0.464  Sum_probs=25.8

Q ss_pred             CcccCCCCceecCCCCCcCCCcCCCCCCccCCC
Q psy16054         97 KQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRP  129 (563)
Q Consensus        97 ~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde  129 (563)
                      .+|+|.+|.|| +..++||   |+.||.|+|||
T Consensus         5 ~~f~C~~~~Ci-~~~~~Cd---g~~dC~dgsDE   33 (33)
T smart00192        5 GEFQCDNGRCI-PLSWVCD---GVDDCSDGSDE   33 (33)
T ss_pred             CeEECCCCCEE-CchhhCC---CcCcCcCCCCC
Confidence            48999999999 6677899   89999999997


No 25 
>PF04748 Polysacc_deac_2:  Divergent polysaccharide deacetylase;  InterPro: IPR006837 This is a family of uncharacterised proteins that includes YibQ.; PDB: 2QV5_A 2NLY_A.
Probab=96.38  E-value=0.024  Score=55.88  Aligned_cols=182  Identities=13%  Similarity=0.093  Sum_probs=89.1

Q ss_pred             CCCceEEEEeccCCCH-HHHHHHHHcCCeecccCC------CCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcc
Q psy16054        251 CQIKGTFFVSHKYTNY-ASVQELHRKGHEISVFSL------THKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSV  323 (563)
Q Consensus       251 ~~ikaTFFv~~~~~~y-~~v~~i~~~GHEIa~H~~------~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~  323 (563)
                      -+.+.||-|.-...+- +.++..+++||||-.|=-      .+.........++.++++.-+..+.   .+   +|  ..
T Consensus        19 Lp~pvT~ai~P~~~~~~~~a~~a~~~G~EvllhlPMep~~~~~~gp~~L~~~~~~~~i~~~l~~al---~~---vp--~a   90 (213)
T PF04748_consen   19 LPFPVTFAILPYAPYSREWAERARAAGHEVLLHLPMEPKGYKDPGPGALLTGMSEEEIRKRLEAAL---AR---VP--GA   90 (213)
T ss_dssp             SSTTCEEEEETTSTTHHHHHHHHHHCT-EEEEEEEE--TTTT---TT-B-TTS-HHHHHHHHHHHH---CC---ST--T-
T ss_pred             CCCCeEEEECCCCCChHHHHHHHHHcCCEEEEeCCCCCCCCCCcccccccCCCCHHHHHHHHHHHH---HH---CC--Cc
Confidence            4789999987776655 778888899999998842      1111111113467666655444433   32   22  24


Q ss_pred             eEEecCccc---cCh---hhHHHHHHHcCCeEEeecCCCCCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEecccc
Q psy16054        324 IGVRAPYLR---VGG---NKQFSMMTDQFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNEL  397 (563)
Q Consensus       324 ~G~R~P~l~---~~~---~~~~~~L~e~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l  397 (563)
                      +|.-...++   .+.   ..+++.|++.|+-|--|.+.+.+  -.+-....+|                     +|... 
T Consensus        91 ~GvnNhmGS~~T~~~~~m~~vl~~l~~~gl~FvDS~T~~~s--~a~~~A~~~g---------------------vp~~~-  146 (213)
T PF04748_consen   91 VGVNNHMGSRFTSDREAMRWVLEVLKERGLFFVDSRTTPRS--VAPQVAKELG---------------------VPAAR-  146 (213)
T ss_dssp             SEEEEEE-CCHHC-HHHHHHHHHHHHHTT-EEEE-S--TT---SHHHHHHHCT-----------------------EEE-
T ss_pred             EEEecCCCccccCCHHHHHHHHHHHHHcCCEEEeCCCCccc--HHHHHHHHcC---------------------CCEEe-
Confidence            555443332   111   24688999999888755544211  0000001111                     12110 


Q ss_pred             ccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCC
Q psy16054        398 DRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRND  477 (563)
Q Consensus       398 ~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~d  477 (563)
                       + +.           ++|   +..+.+.+.+.|......... ++ ....--| ++    |.-+++|++++..+. ..+
T Consensus       147 -r-dv-----------fLD---~~~~~~~I~~ql~~~~~~A~~-~G-~aI~Igh-~~----p~Tl~~L~~~~~~l~-~~g  202 (213)
T PF04748_consen  147 -R-DV-----------FLD---NDQDEAAIRRQLDQAARIARK-QG-SAIAIGH-PR----PETLEALEEWLPELE-AQG  202 (213)
T ss_dssp             ---SE-----------ETT---ST-SHHHHHHHHHHHHHHHHC-CS-EEEEEEE--S----CCHHHHHHHHHHHHH-HCT
T ss_pred             -e-ce-----------ecC---CCCCHHHHHHHHHHHHHhhhh-cC-cEEEEEc-CC----HHHHHHHHHHHhHHh-hCC
Confidence             0 00           112   123455555555544433333 22 2222336 42    668999999999998 568


Q ss_pred             eEEEeHHHHHH
Q psy16054        478 VYFVTMLQVIQ  488 (563)
Q Consensus       478 VwfvT~~ei~~  488 (563)
                      |-+|+.++++.
T Consensus       203 i~lVpvS~llr  213 (213)
T PF04748_consen  203 IELVPVSALLR  213 (213)
T ss_dssp             EEE--HHHH--
T ss_pred             EEEEEhHHhhC
Confidence            99999999863


No 26 
>PF03065 Glyco_hydro_57:  Glycosyl hydrolase family 57;  InterPro: IPR004300 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 57 GH57 from CAZY comprises enzymes with two known activities; alpha-amylase (3.2.1.1 from EC) and 4-alpha-glucanotransferase (2.4.1 from EC).; GO: 0003824 catalytic activity, 0005975 carbohydrate metabolic process; PDB: 1K1X_B 1K1W_A 1K1Y_A 3N98_A 3N8T_A 3N92_A 1UFA_A 3P0B_A.
Probab=95.74  E-value=0.023  Score=60.45  Aligned_cols=195  Identities=16%  Similarity=0.141  Sum_probs=110.9

Q ss_pred             HHHHHHHHHcCC-eecccCCCCCCCcCCCCCCCH-HHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHH
Q psy16054        266 YASVQELHRKGH-EISVFSLTHKDDPKYWSGGSY-DDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMT  343 (563)
Q Consensus       266 y~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~s~-~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~  343 (563)
                      -+.++++.++|+ ||-.-+++|.-++.    ++. ++.+.+|...++.+++..|.   .|+||..|-..++. .+.++|+
T Consensus       113 i~~~~~l~~~G~iEll~~~~~h~ilpl----~~~~~d~~~Qi~~~~~~~~~~FG~---~P~G~W~pE~a~~~-~l~~~l~  184 (360)
T PF03065_consen  113 IEAFRELAERGQIELLTSPYYHPILPL----LPDPEDFRAQIEMGREYFKKHFGR---RPRGFWLPECAYSP-GLEEILA  184 (360)
T ss_dssp             HHHHHHHHHTTSEEEEEE-TT-B-GGG----SSHHHHHHHHHHHHHHHHHHHHSS-----SBEE-GGG-B-T-THHHHHH
T ss_pred             HHHHHHHHHCCCEEEEeCchHHHHHHH----hCCHHHHHHHHHHHHHHHHHHhCC---CCCceECcccccCH-HHHHHHH
Confidence            468899999997 99999999998863    566 99999999999999999994   58999999999995 6999999


Q ss_pred             HcCCeEEeecCCC------CCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEeccc-cccCCCCCCCCCCCCccccC
Q psy16054        344 DQFFVYDASITAS------LGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNE-LDRRDDPTFDESLPGCHMVD  416 (563)
Q Consensus       344 e~Gf~YDsS~~~~------~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~-l~~~d~p~~~~~~~g~~~~d  416 (563)
                      ++||.|--.-...      ....++.||....              ....+|..++.+. +.+  ...|..  .|.+   
T Consensus       185 ~~Gi~~~i~d~~~~~~~~~~~~~~y~py~~~~--------------~~~~~l~vf~rd~~ls~--~I~f~~--~g~~---  243 (360)
T PF03065_consen  185 EAGIRYTILDGHQLLGAGSPPNNPYRPYRVED--------------QGGSGLAVFFRDQELSD--DIWFRY--SGYP---  243 (360)
T ss_dssp             HTT--EEEEECHHHHHHT--CGGCSSEEEECE---------------TTCSEEEEEB-HHHHH--HHHTTT--TSGG---
T ss_pred             HcCCEEEEECcHHhCCCCCccCCCCceeEeec--------------CCCceEEEEecCHHHHH--hcCCcc--cccc---
Confidence            9999987543211      0123455664321              0122377777653 211  111211  1111   


Q ss_pred             CCCCCCChHHHHHHHHHHHHHHHccCCc-ceEEeeccccc-----cCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHh
Q psy16054        417 SCSNIQTGEQFGRLLRHNFNRHFSTNRA-PLGLHFHASWL-----KSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWM  490 (563)
Q Consensus       417 ~~~~~~~~~~~~~~l~~~F~~~y~~nra-p~~l~lH~~~l-----~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~  490 (563)
                         ...+..++++.+...+......... .+.+.+-..-+     ......++.|.+.+..+.+.+.|-++|.+|+++=+
T Consensus       244 ---~~~~~~~~~~~l~~~~~~~~~~~~~~~v~i~~DgE~~~~~~~~~~~~~l~~l~~~~~~~~~~~~i~~~t~se~l~~~  320 (360)
T PF03065_consen  244 ---GDPAYREFYRYLLYEYASWISYDPGPVVVIALDGENFWGHYWDEGEGFLEFLYQLLEELHSRGFIETVTPSEYLERH  320 (360)
T ss_dssp             ---GSTTSBHSSCHHC----HC-BS-TT-EEEECCEGGGCCCCCCCCTCEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             ---cccCHHHHHHhhhhhhhhhhccCCCCeEEEeECCcccCCcCCcccCCHHHHHHHHHHHhhcCCeEEecCHHHHHhhC
Confidence               1223445555444113333332222 23343321111     22345777777777777766668899999998865


Q ss_pred             hC
Q psy16054        491 QN  492 (563)
Q Consensus       491 ~~  492 (563)
                      ..
T Consensus       321 ~~  322 (360)
T PF03065_consen  321 PP  322 (360)
T ss_dssp             ST
T ss_pred             CC
Confidence            33


No 27 
>PF11959 DUF3473:  Domain of unknown function (DUF3473);  InterPro: IPR022560  This domain, found in bacteria and archaea, is functionally uncharacterised. It is about 130 amino acids in length and is found C-terminal to PF01522 from PFAM. It contains two completely conserved residues (P and H) that may be functionally important. 
Probab=94.81  E-value=0.13  Score=46.82  Aligned_cols=55  Identities=24%  Similarity=0.299  Sum_probs=31.3

Q ss_pred             HHHHHHHHccCCcceEEeecccccc-CchhH--------------HHHHHHHHHHHHcCCCeEEEeHHHHHH
Q psy16054        432 RHNFNRHFSTNRAPLGLHFHASWLK-SKKEF--------------KDELIKFIEEMLDRNDVYFVTMLQVIQ  488 (563)
Q Consensus       432 ~~~F~~~y~~nrap~~l~lH~~~l~-g~p~r--------------~~al~~fl~~i~~~~dVwfvT~~ei~~  488 (563)
                      +..+.+....++.|+++|||+..|. .+|..              ++.+++=|+.|.  .|.-|+|+.|+++
T Consensus        62 ~~~~~~~~~~~~~~~~~YfHPwE~dp~qpr~~~~~~~~rf~~y~~l~~~~~rl~~Ll--~~f~f~t~~~~~~  131 (133)
T PF11959_consen   62 RWLIRRINRRGGQPAVFYFHPWEFDPDQPRVPGAPLKSRFRHYNNLDRMEKRLDRLL--SDFRFGTMREVLA  131 (133)
T ss_pred             HHHHHHHHhCCCcceEEEEeceecCCCCcccCCCCcceeEEEEcCHHHHHHHHHHHH--hhCceEEHHHHhh
Confidence            3333333333447999999966663 22210              133333344443  5788999999975


No 28 
>COG1449 Alpha-amylase/alpha-mannosidase [Carbohydrate transport and metabolism]
Probab=94.21  E-value=0.83  Score=51.99  Aligned_cols=80  Identities=16%  Similarity=0.241  Sum_probs=65.6

Q ss_pred             HHHHHHHHcCC-eecccCCCCCCCcCCCCCC-CHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHH
Q psy16054        267 ASVQELHRKGH-EISVFSLTHKDDPKYWSGG-SYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTD  344 (563)
Q Consensus       267 ~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~-s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e  344 (563)
                      ++.|++.+.|+ |+-.+++.|.-.+-+ .+. -.+++.+++...+++..++.|.   .++|||+|.+.++ +.+++++++
T Consensus       183 ~~~r~l~e~g~~e~~~~pyyHsl~pl~-~d~gw~~d~~~qv~~~~~~~~elfG~---~p~~~~~tEl~y~-~~i~~~~~e  257 (615)
T COG1449         183 EAFRELAESGKVELTASPYYHSLIPLL-ADDGWYEDFKEQVMMSRELYKELFGV---WPSGFWNTELAYN-DQILEYFEE  257 (615)
T ss_pred             HHHHHHHhcCceEEEecccccccchhc-ccCCchHHHHHHHHHHHHHHHHHhCC---CCccccChhhhcc-HHHHHHHHH
Confidence            57888888886 999999999876532 112 1347888999999999999996   5899999999999 579999999


Q ss_pred             cCCeEEe
Q psy16054        345 QFFVYDA  351 (563)
Q Consensus       345 ~Gf~YDs  351 (563)
                      .||.|-.
T Consensus       258 ~G~~~~~  264 (615)
T COG1449         258 SGFSWIA  264 (615)
T ss_pred             cCCcccc
Confidence            9999743


No 29 
>PF15421 Polysacc_deac_3:  Putative polysaccharide deacetylase; PDB: 4DWE_A.
Probab=92.48  E-value=3.3  Score=43.83  Aligned_cols=173  Identities=16%  Similarity=0.300  Sum_probs=75.0

Q ss_pred             CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHH-HHHHHhCCCCCcceEEecCccccChhhHHHHHH
Q psy16054        265 NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRL-IIERFANISDGSVIGVRAPYLRVGGNKQFSMMT  343 (563)
Q Consensus       265 ~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~-~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~  343 (563)
                      .|+.|++|.+-|.+||.|.+.+...     ..+.+.++.+..-+.. +++++.|..   .+-+..|    +++...- .+
T Consensus       131 ~W~evkem~nyG~~ia~Hdv~~~~~-----~~t~~~~~~~~~i~~~~i~~~l~GR~---~K~l~eP----nGn~~Yi-~a  197 (425)
T PF15421_consen  131 SWDEVKEMLNYGTGIAFHDVNTYDV-----DTTDDQILAHFCIAQNSIFKNLSGRG---CKMLAEP----NGNKKYI-TA  197 (425)
T ss_dssp             -HHHHHHHHTTT-EEEB---SS-GG-----G-SHHHHHHHHHHHHHHHHHHTTT-------EEE------TT-HHHH-HH
T ss_pred             cHHHHHHHHhcCcceEecccccccc-----cccchhhhhhhhhHHHHHHHHhcCcc---cceeeCC----CCCHHHH-HH
Confidence            5689999999999999999988764     2354555555443333 556666633   3334445    3332221 12


Q ss_pred             HcCCeEEeecCCCCCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCC
Q psy16054        344 DQFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQT  423 (563)
Q Consensus       344 e~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~  423 (563)
                      .+++.=--++.+..|...++|+.-+..+                .=..++     +    .|         ++    ...
T Consensus       198 a~~~d~Iqtitaq~G~~kv~pf~~~~dL----------------~k~~~~-----R----~F---------~~----~~~  239 (425)
T PF15421_consen  198 ALVYDPIQTITAQSGAIKVYPFKVNGDL----------------EKVVIE-----R----GF---------YD----SPD  239 (425)
T ss_dssp             HTTSTT--EEEESSS-B---TTT--S------------------TTEEEE----------B------------------T
T ss_pred             HHcCCchhhhHhhcCCcceeccccCCch----------------hhheee-----e----ee---------cC----Ccc
Confidence            2233222345555555566776533211                001111     1    01         11    122


Q ss_pred             hHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHc------CCCeEEEeHHHHHH--HhhCCch
Q psy16054        424 GEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLD------RNDVYFVTMLQVIQ--WMQNPTE  495 (563)
Q Consensus       424 ~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~------~~dVwfvT~~ei~~--w~~~p~~  495 (563)
                      .+++.+.|.+.....- .+|.-..|+.|-     +...+..   ||.||..      .+.|||.+.+|.-+  |||.-..
T Consensus       240 ~~~~k~~I~~~l~~p~-e~r~ai~iG~h~-----td~~W~~---fl~wLn~~YG~~G~D~lWfp~~eEyyEY~Y~R~~~~  310 (425)
T PF15421_consen  240 NDQYKQDIEDQLKNPV-EEREAINIGVHG-----TDSNWED---FLLWLNDTYGKDGDDSLWFPSQEEYYEYNYYRMHTK  310 (425)
T ss_dssp             HHHHHHHHHHHHTS-G-GG---EEEEESS-------HHHHH---HHHHHHHHTSTTTT--EEEE-HHHHHHHHHHHHH--
T ss_pred             hHHHHHHHHHHhccch-hchhheeecccC-----CCCcHHH---HHHHHHHHhCcCCCcccccCCHHHHHHHHHHhhccc
Confidence            4556666666554333 346556899992     2334443   7777765      46699999999998  5555444


Q ss_pred             hh
Q psy16054        496 LT  497 (563)
Q Consensus       496 ~~  497 (563)
                      ++
T Consensus       311 i~  312 (425)
T PF15421_consen  311 IE  312 (425)
T ss_dssp             -E
T ss_pred             ee
Confidence            43


No 30 
>COG2861 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=92.42  E-value=4.4  Score=40.33  Aligned_cols=212  Identities=13%  Similarity=0.146  Sum_probs=109.4

Q ss_pred             CCCCCCCCceEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC-CHHHHHHHHHcCCeecccCC----C
Q psy16054        212 IEPNQVPQMITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT-NYASVQELHRKGHEISVFSL----T  285 (563)
Q Consensus       212 ~~~~~~pq~V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~-~y~~v~~i~~~GHEIa~H~~----~  285 (563)
                      |.....|+ ++|-.|| |++....+.   |++        -+.+.|.-+..... .-+..++++++||||-.|--    +
T Consensus        23 ~~~~~~~k-laIVIddlG~~~~~~~~---i~~--------Lp~eVTlA~~P~~~~~~~~~~~A~~~G~evlih~PmeP~~   90 (250)
T COG2861          23 WSGARGPK-LAIVIDDLGLSQTGTQA---ILA--------LPPEVTLAFAPFAPHAREWAQKARNAGHEVLIHMPMEPFS   90 (250)
T ss_pred             CCcCCCce-EEEEECCccccHHHHHH---HHh--------CCccceEEecCCCchhHHHHHHHHhcCCEEEEeccCCccc
Confidence            33344454 6777888 666554443   333        14555654433332 23788999999999999963    2


Q ss_pred             CCCCc--CCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEEeecCCCCCCCCcc
Q psy16054        286 HKDDP--KYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYDASITASLGRVPIW  363 (563)
Q Consensus       286 H~~~~--~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YDsS~~~~~~~~p~w  363 (563)
                      +....  ..-..++.++..+.|..+...+..+.|+.  .-.|-|.-.-...=+..++.|++.|+-|--|.+...      
T Consensus        91 ~~~~e~gtL~~~~s~~e~~~rl~~a~~~v~~~~Gln--NhmGs~~tsn~~aM~~~m~~Lk~r~l~flDs~T~a~------  162 (250)
T COG2861          91 YPKIEPGTLRPGMSAEEILRRLRKAMNKVPDAVGLN--NHMGSRFTSNEDAMEKLMEALKERGLYFLDSGTIAN------  162 (250)
T ss_pred             CCCCCCCCcccCCCHHHHHHHHHHHHhhCccceeeh--hhhhhhhcCcHHHHHHHHHHHHHCCeEEEccccccc------
Confidence            22111  00124677777777777666655444432  122333211010002357788888866544443211      


Q ss_pred             ceeccCCCCCCcCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHc---
Q psy16054        364 PYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFS---  440 (563)
Q Consensus       364 Pytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~---  440 (563)
                        ++--++               -.=.+||+-  .+ |.           ++|   +..+..++.    .+|+++-.   
T Consensus       163 --S~a~~i---------------Ak~~gVp~~--~r-dv-----------fLD---~e~~~~~V~----kql~~~~~~Ar  204 (250)
T COG2861         163 --SLAGKI---------------AKEIGVPVI--KR-DV-----------FLD---DEDTEAAVL----KQLDAAEKLAR  204 (250)
T ss_pred             --chhhhh---------------HhhcCCcee--ee-ee-----------eec---CcCCHHHHH----HHHHHHHHHHH
Confidence              000000               001123331  11 10           112   123444444    44444332   


Q ss_pred             cCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHH
Q psy16054        441 TNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQ  488 (563)
Q Consensus       441 ~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~  488 (563)
                      .|+..++|. | +    .+.-+++|++.++.+. -.+|-+|..++++.
T Consensus       205 k~G~ai~IG-h-~----~~~Tv~vl~~~~~~l~-~~gIelV~~s~L~~  245 (250)
T COG2861         205 KNGSAIGIG-H-P----HKNTVAVLQQWLDELP-ARGIELVPVSALLG  245 (250)
T ss_pred             hcCceEEec-C-C----chhHHHHHHHHHHhCC-CCCeEEecHHHhhc
Confidence            234344333 5 3    3678999999999986 46999999998864


No 31 
>KOG2397|consensus
Probab=92.30  E-value=0.099  Score=56.30  Aligned_cols=70  Identities=27%  Similarity=0.377  Sum_probs=51.3

Q ss_pred             cccCCCCceecCCCCCcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCC----ceecCccccCCCCCCCCCCc
Q psy16054         98 QISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNG----ECIDKTLFCDDKPDCKDESD  173 (563)
Q Consensus        98 ~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G----~Ci~~~~~Cdg~~DC~DgsD  173 (563)
                      .|.|-.|.-...-.++=|   +.=||.|||||..        ...|+.+.|.|.|-    .=|+..-+-||.-||-||||
T Consensus        44 ~~~CLdgs~~i~f~qlND---d~CDC~DGsDEPG--------tsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSD  112 (480)
T KOG2397|consen   44 MFKCLDGSKTISFSQLND---DSCDCLDGSDEPG--------TSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSD  112 (480)
T ss_pred             ceeeccCCcccCHHHhcc---ccccCCCCCCCCc--------cccCCCCceeeeecCCCceeeechhccCcccccccCCC
Confidence            566666554333344444   3446999999952        23699999999862    58889999999999999999


Q ss_pred             ccccC
Q psy16054        174 ENACS  178 (563)
Q Consensus       174 E~~C~  178 (563)
                      |..-+
T Consensus       113 E~~Sg  117 (480)
T KOG2397|consen  113 EYLSG  117 (480)
T ss_pred             CccCC
Confidence            96544


No 32 
>PF12999 PRKCSH-like:  Glucosidase II beta subunit-like
Probab=86.05  E-value=0.57  Score=44.45  Aligned_cols=33  Identities=39%  Similarity=0.634  Sum_probs=27.9

Q ss_pred             CCCCcccCCCc--eecCccccCCCCCCCCCCcccc
Q psy16054        144 PEGKLSCGNGE--CIDKTLFCDDKPDCKDESDENA  176 (563)
Q Consensus       144 ~~~~f~C~~G~--Ci~~~~~Cdg~~DC~DgsDE~~  176 (563)
                      ..+.|+|-+|.  =|+.....|+.=||.|||||-.
T Consensus        34 ~~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPG   68 (176)
T PF12999_consen   34 ENGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPG   68 (176)
T ss_pred             CCCceEecCCCCceecHHHccCcceeCCCCCCccc
Confidence            45679998874  3899999999999999999943


No 33 
>PF01607 CBM_14:  Chitin binding Peritrophin-A domain;  InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=76.85  E-value=1.3  Score=32.65  Aligned_cols=28  Identities=36%  Similarity=0.689  Sum_probs=18.7

Q ss_pred             cCcccCCCCceecCCCCCcCCCcCCCCC
Q psy16054         96 LKQISCPSGLAFDVDKQTCDWKAKVTNC  123 (563)
Q Consensus        96 ~~~~~C~~G~~~~~~~~~CD~~~~v~dC  123 (563)
                      +...+|+.|+.||+..++|++++.|..|
T Consensus        26 ~~~~~C~~g~~fd~~~~~C~~~~~~~~C   53 (53)
T PF01607_consen   26 AVEQRCPEGLYFDPSSQRCVPPSNVVQC   53 (53)
T ss_dssp             EEEEE-TTS-EE-TTTSSEE-TTT-TT-
T ss_pred             EECCCCcCCCEECcCcCEEcCCccCCCC
Confidence            4589999999999999999988764444


No 34 
>PF01074 Glyco_hydro_38:  Glycosyl hydrolases family 38 N-terminal domain;  InterPro: IPR000602 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 38 GH38 from CAZY comprises enzymes with only one known activity; alpha-mannosidase (3.2.1.24 from EC) (3.2.1.114 from EC). Lysosomal alpha-mannosidase is necessary for the catabolism of N-linked carbohydrates released during glycoprotein turnover. The enzyme catalyses the hydrolysis of terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides, and can cleave all known types of alpha-mannosidic linkages. Defects in the gene cause lysosomal alpha-mannosidosis (AM), a lysosomal storage disease characterised by the accumulation of unbranched oligo-saccharide chains.; GO: 0004559 alpha-mannosidase activity, 0005975 carbohydrate metabolic process; PDB: 2WYI_A 2WYH_A 1O7D_A 3CZN_A 2FYV_A 3D50_A 3EJU_A 3EJS_A 3DX3_A 3BVX_A ....
Probab=72.07  E-value=22  Score=35.97  Aligned_cols=80  Identities=10%  Similarity=0.086  Sum_probs=54.6

Q ss_pred             HHHHHHHHcCC-eecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054        267 ASVQELHRKGH-EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD  344 (563)
Q Consensus       267 ~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e  344 (563)
                      +.+++++++|- ||..=+|.-.+..    -.+.|.+.+.|...++.+++..|+.  ..+||-+ +||...  .+.++|+.
T Consensus        64 ~~~~~lv~~Gri~~vgg~~~~~D~~----l~~~Eslirql~~G~~~~~~~fg~~--~~~~~~~D~FG~~~--~lP~il~~  135 (275)
T PF01074_consen   64 KRIKKLVKEGRIEIVGGWYVQPDEN----LPSGESLIRQLLYGHKYLRKEFGVR--PKVAWQPDSFGHSA--QLPQILKQ  135 (275)
T ss_dssp             HHHHHHHHTTSEEESSSBSS-B-SS----SS-HHHHHHHHHHHHHHHHHHHTG----SEEEESSSSSB-T--CHHHHHHT
T ss_pred             HHHHHHHHhceeEEeCceeeecccc----CCCHHHHHHHHhhhHHHHHHhcCCC--CCeEEeCCCCCCch--hhHHHHhc
Confidence            46688888885 5544445433321    2578889999999999888777754  3467766 888865  58999999


Q ss_pred             cCCeEEeecC
Q psy16054        345 QFFVYDASIT  354 (563)
Q Consensus       345 ~Gf~YDsS~~  354 (563)
                      .||.|-....
T Consensus       136 ~Gi~~~v~~r  145 (275)
T PF01074_consen  136 FGIKYFVIWR  145 (275)
T ss_dssp             TT-SEEEESS
T ss_pred             cCcceEEEec
Confidence            9999988875


No 35 
>KOG1958|consensus
Probab=67.03  E-value=1.2e+02  Score=36.02  Aligned_cols=74  Identities=15%  Similarity=0.202  Sum_probs=53.0

Q ss_pred             HHHHHHHHcC-CeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054        267 ASVQELHRKG-HEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD  344 (563)
Q Consensus       267 ~~v~~i~~~G-HEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e  344 (563)
                      ..|+.+..+| -||..-||.-.+-    ++...-...+++......|+...|++++  .||.. |||..+  ++.=+|+.
T Consensus       229 ~a~k~lv~~GqlEIvtGGWVMpDE----An~Hy~~~i~qliEGh~Wl~~~igv~P~--s~WaiDPFG~S~--TmpYLL~~  300 (1129)
T KOG1958|consen  229 NAVKRLVKNGQLEIVTGGWVMPDE----ANSHYFAMIDQLIEGHQWLKNNIGVTPQ--SGWAIDPFGYSS--TMPYLLRR  300 (1129)
T ss_pred             HHHHHHHhcCcEEEEeCccccCcc----cchhHHHHHHHHhhhhhHHhccCCCCCC--cccccCCCCCCc--chHHHHhh
Confidence            6899999999 5999999976654    2445667788888889999998887643  35543 666533  35556666


Q ss_pred             cCCe
Q psy16054        345 QFFV  348 (563)
Q Consensus       345 ~Gf~  348 (563)
                      +||.
T Consensus       301 ag~~  304 (1129)
T KOG1958|consen  301 AGFE  304 (1129)
T ss_pred             cCch
Confidence            6654


No 36 
>KOG3509|consensus
Probab=62.45  E-value=14  Score=43.84  Aligned_cols=89  Identities=22%  Similarity=0.412  Sum_probs=63.7

Q ss_pred             CCCCcCCCcc---ccccCcccCCCCceecCCCCCcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCC-ceecC
Q psy16054         83 WKTNVKNCDL---LESLKQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNG-ECIDK  158 (563)
Q Consensus        83 ~~~~v~~C~~---~~~~~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G-~Ci~~  158 (563)
                      ..+++.+++-   .+.+.++.|.++.+--+.. .||   ..-.|.+.+.+..|.-..  ....|.+..++|-+- +|...
T Consensus        18 ~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~--~~s~~~~~~~~c~~~~~~~~~   91 (964)
T KOG3509|consen   18 DRSDVANDPAIGSACSPNEFKCNNPRCVQPEA-LLD---ADSTCGPNSTPSGCNAKP--SASDCKPTETQCRDRLRCNPQ   91 (964)
T ss_pred             hhcccCCCccccccCCcchhccCCccccCchh-hhc---cccccCCCCCcCCccccc--cccccCCcccccccchhcCCc
Confidence            3444554432   2347789999998885444 688   455788888776553211  112577889999876 79999


Q ss_pred             ccccCCCCCCCCCCccccc
Q psy16054        159 TLFCDDKPDCKDESDENAC  177 (563)
Q Consensus       159 ~~~Cdg~~DC~DgsDE~~C  177 (563)
                      ...|+|.+||.|+++|..+
T Consensus        92 ~~~~~g~~~~~~~~~~~~~  110 (964)
T KOG3509|consen   92 SFQCDGTNDCKDGSDEVGC  110 (964)
T ss_pred             cccccCCCCCCccchhccc
Confidence            9999999999999999766


No 37 
>PF07315 DUF1462:  Protein of unknown function (DUF1462);  InterPro: IPR009190 There are currently no experimental data for members of this group of bacterial proteins or their homologues. A crystal structure of Q7A6J8 from SWISSPROT revealed a thioredoxin-like fold, its core consisting of three layers alpha/beta/alpha.; PDB: 1XG8_A.
Probab=60.05  E-value=38  Score=28.61  Aligned_cols=53  Identities=19%  Similarity=0.411  Sum_probs=34.3

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHc
Q psy16054        417 SCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLD  474 (563)
Q Consensus       417 ~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~  474 (563)
                      +|.+..|..+.++||..+..|-|-.  .||.++.= . |. +|.--..-++|++.|..
T Consensus        11 SCVn~PsSkeTyeWL~aal~RKyp~--~~f~~~Yi-D-i~-~p~~~~~~~~~a~~I~e   63 (93)
T PF07315_consen   11 SCVNAPSSKETYEWLEAALKRKYPD--QPFEFTYI-D-IE-NPPENDHDQQFAERILE   63 (93)
T ss_dssp             GGSSS--HHHHHHHHHHHHHHH-TT--S-EEEEEE-E-TT-T----HHHHHHHHHHHT
T ss_pred             hhcCCCCchhHHHHHHHHHhCcCCC--CceEEEEE-e-cC-CCCccHHHHHHHHHHHh
Confidence            4888899999999999999999865  57866542 2 22 23333667889999985


No 38 
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=53.81  E-value=11  Score=27.64  Aligned_cols=24  Identities=50%  Similarity=0.942  Sum_probs=20.7

Q ss_pred             cCcccCCCCceecCCCCCcCCCcC
Q psy16054         96 LKQISCPSGLAFDVDKQTCDWKAK  119 (563)
Q Consensus        96 ~~~~~C~~G~~~~~~~~~CD~~~~  119 (563)
                      ...+.|+.|+.|+...+.|+++..
T Consensus        28 ~~~~~C~~g~~f~~~~~~C~~~~~   51 (56)
T smart00494       28 PIVGSCPAGLVFDPATQTCDWPDN   51 (56)
T ss_pred             EEeccCcCCCeECCCCCCcCCccc
Confidence            457899999999999999998754


No 39 
>COG4878 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.80  E-value=22  Score=36.05  Aligned_cols=98  Identities=18%  Similarity=0.250  Sum_probs=56.3

Q ss_pred             ceEEEEeccCC-CHH-HHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHH-H-HHHHHHHHHHHhC---CCCCcceEE
Q psy16054        254 KGTFFVSHKYT-NYA-SVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLA-E-MAGGRLIIERFAN---ISDGSVIGV  326 (563)
Q Consensus       254 kaTFFv~~~~~-~y~-~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~-E-i~~~~~~l~~~~g---i~~~~~~G~  326 (563)
                      +.-||...+.. +.. .=.+|...|-|+|.|||.|..++.   .....-|+. | |.++...|..+..   +++-+..-|
T Consensus        34 ~~gff~~~~~~~dl~~~g~el~~~gge~gIhgYNh~sl~p---k~~~knw~see~m~ka~~~L~~y~~~ls~~~y~~~vf  110 (309)
T COG4878          34 KGGFFSLEQSFDDLSAKGTELTFDGGEKGIHGYNHKSLQP---KDILKNWESEEIMNKANYTLADYGDILSITGYDTFVF  110 (309)
T ss_pred             CCccchhhccchhhhhcCcEEEecCCceeEeeccCCCCCc---cccccchhhHHHHHHHHHHHHHHHHHHhccccceEEE
Confidence            45567665544 222 223466789999999999998641   122233332 2 6666666666533   233456667


Q ss_pred             ecCccccChhhHHHHHHHcC-CeEEeecC
Q psy16054        327 RAPYLRVGGNKQFSMMTDQF-FVYDASIT  354 (563)
Q Consensus       327 R~P~l~~~~~~~~~~L~e~G-f~YDsS~~  354 (563)
                      -+|.-...++...++++.+- ..--||+.
T Consensus       111 VppSnil~q~gk~alvk~~p~lktissiy  139 (309)
T COG4878         111 VPPSNILLQKGKFALVKQAPSLKTISSIY  139 (309)
T ss_pred             eCcccccchhHHHHHHHhCCCcceeeeEe
Confidence            88877766555556666542 33445543


No 40 
>COG1543 Uncharacterized conserved protein [Function unknown]
Probab=51.56  E-value=36  Score=37.22  Aligned_cols=78  Identities=23%  Similarity=0.200  Sum_probs=61.8

Q ss_pred             HHHHHHc-CCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCC
Q psy16054        269 VQELHRK-GHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFF  347 (563)
Q Consensus       269 v~~i~~~-GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf  347 (563)
                      .+.+.+. +-||-.-+.+|-.++-+  +. ++.+.++|..+.+.-++..|.   .|.|+=.|-..+.+ .+-.+|+++|+
T Consensus       129 f~~~~~~g~ieilts~athg~lPll--~~-peAi~AQi~~g~~~ye~~fg~---~P~GiWlPEcay~p-gie~~l~~~Gi  201 (504)
T COG1543         129 FKELQRSGGIEILTSAATHGYLPLL--GG-PEAIEAQILTGIELYEEHFGL---APKGIWLPECAYAP-GIERILKDAGI  201 (504)
T ss_pred             HHHHHHcCCceeeeehhhheehhhc--CC-chhhHHHHHHHHHHHHHHhCC---CCCceechhhcccc-chHHHHHhcCc
Confidence            4445554 57999999999887642  23 788999999999999988885   38999999988885 57889999999


Q ss_pred             eEEeec
Q psy16054        348 VYDASI  353 (563)
Q Consensus       348 ~YDsS~  353 (563)
                      .|--.-
T Consensus       202 ~yf~vd  207 (504)
T COG1543         202 EYFFVD  207 (504)
T ss_pred             eEEEec
Confidence            986544


No 41 
>PF12340 DUF3638:  Protein of unknown function (DUF3638);  InterPro: IPR022099  This domain family is found in eukaryotes, and is approximately 230 amino acids in length. There are two conserved sequence motifs: LLE and NMG. 
Probab=48.52  E-value=85  Score=31.33  Aligned_cols=63  Identities=19%  Similarity=0.293  Sum_probs=48.5

Q ss_pred             hHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHH
Q psy16054        424 GEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQW  489 (563)
Q Consensus       424 ~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w  489 (563)
                      .++.++.|...|-..+  +|..+.+.|+ ....-.+..++.+++.++.+....+|.++|.+.+...
T Consensus        82 l~q~~~~L~~~lg~l~--~r~i~~lpFs-R~~~~~~~~~~~~~~l~~~~~~~~gill~~PEhilSf  144 (229)
T PF12340_consen   82 LEQMRQMLRSRLGGLL--NRRIYHLPFS-RSTPLTPETLEKIRQLLEECMRSGGILLATPEHILSF  144 (229)
T ss_pred             HHHHHHHHHHHHHHHh--CCeeEEeccc-CCCCCCHHHHHHHHHHHHHHHHcCCEEEeChHHHHHH
Confidence            3556777777775544  5666778887 5455567788999999999999999999999999653


No 42 
>PF04794 YdjC:  YdjC-like protein;  InterPro: IPR006879 This entry contains ChbG (YdjC), which is is an uncharacterised protein encoded by the chb (N,N'-diacetylchitobiose, also called [GlcNAc]2) or cel operon, which encodes enzymes involved in growth on an N,N'-diacetylchitobiose carbon source [, ]. The entry also contains HpnK, which is a protein associated with hopanoid biosynthesis.; PDB: 2E67_C 2I5I_B.
Probab=43.06  E-value=63  Score=32.56  Aligned_cols=119  Identities=8%  Similarity=0.080  Sum_probs=65.8

Q ss_pred             ceEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCceE-EEEeccCCCHHHHHHHHHc-CCeecccCC-CC---------
Q psy16054        220 MITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIKGT-FFVSHKYTNYASVQELHRK-GHEISVFSL-TH---------  286 (563)
Q Consensus       220 ~V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ikaT-FFv~~~~~~y~~v~~i~~~-GHEIa~H~~-~H---------  286 (563)
                      .|+|.=|| |........+++++...+       +.+| ..|++.+. .+.++.+.+. +-.||.|-. +.         
T Consensus         2 ~lii~ADDfG~s~~vn~gI~~~~~~G~-------vtstslM~n~p~~-~~a~~~~k~~~~~~vGlHl~Lt~g~p~~~p~~   73 (261)
T PF04794_consen    2 KLIINADDFGLSPGVNRGIIEAFENGI-------VTSTSLMVNMPAF-EEAAALAKNNPGLDVGLHLNLTSGWPVLSPTQ   73 (261)
T ss_dssp             EEEEEEEEETSSHHHHHHHHHHHHCCT--------SEEEEETTSTTH-HHHHHHHHH-TTTEEEEEE-S---SSSS-TTT
T ss_pred             EEEEECCCCCCCHHHHHHHHHHHHcCC-------ceEeeeccCChhH-HHHHHHHHhCCCCCEEEEeEccCCCCCCCccC
Confidence            36778888 666665566778876422       4444 55555543 3455555554 478888842 11         


Q ss_pred             ------------CCC----cCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccCh---hhHHHHHHHcCC
Q psy16054        287 ------------KDD----PKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGG---NKQFSMMTDQFF  347 (563)
Q Consensus       287 ------------~~~----~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~---~~~~~~L~e~Gf  347 (563)
                                  ...    ......++.++++.||..+.+.+.+ .|+.+..+-|.--  ....+   +.+++++++.|+
T Consensus        74 vpsLvd~~G~F~~~~~~~~~~~~~~~~~~ev~~E~~AQi~~f~~-~G~~P~hiD~H~h--~h~~p~v~~~~~~~a~~y~l  150 (261)
T PF04794_consen   74 VPSLVDEDGYFPRSKRLLRRALLRRADYEEVERELRAQIERFLD-MGRKPDHIDGHQH--VHFHPPVREALLDLAKEYGL  150 (261)
T ss_dssp             SGGGBBTTTBB-SCC----HCHCCC--HHHHHHHHHHHHHHHHC-CSS-ECEEEEGGG--GGCSHHHHHHHHHHHHHCT-
T ss_pred             CCCccCCCCccCCccccchhhhhccCCHHHHHHHHHHHHHHHHH-cCCCCCeEecccc--cccChHHHHHHHHHHHHcCC
Confidence                        111    0111456789999999999988887 7876554444210  01111   346777888888


Q ss_pred             eE
Q psy16054        348 VY  349 (563)
Q Consensus       348 ~Y  349 (563)
                      .+
T Consensus       151 p~  152 (261)
T PF04794_consen  151 PW  152 (261)
T ss_dssp             EB
T ss_pred             Cc
Confidence            55


No 43 
>COG4837 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=42.52  E-value=45  Score=28.32  Aligned_cols=54  Identities=19%  Similarity=0.404  Sum_probs=40.3

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcC
Q psy16054        417 SCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDR  475 (563)
Q Consensus       417 ~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~  475 (563)
                      +|.+..|..++++||...+.|-|-+  .||-+..- . +. +|..-++..+|++.|.+.
T Consensus        18 SCV~aPtsKdt~eWLeaalkRKyp~--~~F~~~Yi-D-I~-n~~~e~~~~~~aekI~~d   71 (106)
T COG4837          18 SCVNAPTSKDTYEWLEAALKRKYPN--QPFKYTYI-D-IT-NPPLEDHDLQFAEKIEQD   71 (106)
T ss_pred             HhcCCCcchhHHHHHHHHHhccCCC--CCcEEEEE-E-cC-CCccHHHHHHHHHHHhcc
Confidence            4778889999999999999998864  57744332 1 22 455677888999999853


No 44 
>cd07286 PX_SNX18 The phosphoinositide binding Phox Homology domain of Sorting Nexin 18. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX18, like SNX9, contains an N-terminal Src Homology 3 (SH3) domain, a PX domain, and a C-terminal Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature. The PX-BAR structural unit helps determine specific membrane localization. SNX18 is localized to peripheral endosomal structures, and acts in a trafficki
Probab=41.38  E-value=46  Score=30.02  Aligned_cols=73  Identities=12%  Similarity=0.195  Sum_probs=44.5

Q ss_pred             HHHHHHHHHHHHccCCcc-e-----EEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCC-chhhhhc
Q psy16054        428 GRLLRHNFNRHFSTNRAP-L-----GLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP-TELTSLR  500 (563)
Q Consensus       428 ~~~l~~~F~~~y~~nrap-~-----~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p-~~~~~~~  500 (563)
                      +.+|.+.+.+.|-+--.| +     +-.+.+.|+.   .|..+|++||..|+.+|-.+  ...++-.++..+ +.....+
T Consensus        41 F~~L~~~L~~~~p~~~IPpLP~K~~~g~f~~~FIe---~Rr~~Lq~FL~ria~hp~L~--~s~~~~~FL~~~~~~~~~~~  115 (127)
T cd07286          41 FDWLYARLAEKFPVISVPHIPEKQATGRFEEDFIS---KRRKGLIWWMDHMCSHPVLA--RCDAFQHFLTCPSTDEKAWK  115 (127)
T ss_pred             HHHHHHHHHHHCCCcEeCCCcCCCcCCCCCHHHHH---HHHHHHHHHHHHHHcCcccc--cChHHHHHhcCCcchhHHHh
Confidence            566777777777542112 1     1234545433   68999999999999998755  345566677665 3444444


Q ss_pred             cchhh
Q psy16054        501 DFQEW  505 (563)
Q Consensus       501 ~~~~w  505 (563)
                      ....|
T Consensus       116 ~~~~~  120 (127)
T cd07286         116 QGKRK  120 (127)
T ss_pred             hcchH
Confidence            44433


No 45 
>PLN02701 alpha-mannosidase
Probab=40.79  E-value=63  Score=39.49  Aligned_cols=79  Identities=11%  Similarity=0.124  Sum_probs=60.3

Q ss_pred             HHHHHHHHcCC-eecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054        267 ASVQELHRKGH-EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD  344 (563)
Q Consensus       267 ~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e  344 (563)
                      +.|++++++|. ||..=+|.-.+..    ....+.+.++|......|.+..|+.+  .+||.. |||..+  .+..+|++
T Consensus       106 ~~vk~LV~~GrLE~vgGgwvm~DEa----~~~~esiI~Ql~~G~~~l~~~fG~~P--~~~W~iDpFGhs~--~~P~Ll~~  177 (1050)
T PLN02701        106 EAFTKLVKNGQLEIVGGGWVMNDEA----NSHYFAIIEQITEGNMWLNDTIGVAP--KNSWAIDPFGYSS--TMAYLLRR  177 (1050)
T ss_pred             HHHHHHHHcCCEEEECCceeccccc----ccCHHHHHHHHHhhhHHHHhhcCCCC--CcCccCCCCCCCH--HHHHHHHh
Confidence            68999999995 7888888665542    35688888998888888877777653  467776 888754  58999999


Q ss_pred             cCCeEEeec
Q psy16054        345 QFFVYDASI  353 (563)
Q Consensus       345 ~Gf~YDsS~  353 (563)
                      .||.+---+
T Consensus       178 ~G~~~~~~~  186 (1050)
T PLN02701        178 MGFENMLIQ  186 (1050)
T ss_pred             CCCceEEEe
Confidence            999875433


No 46 
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=40.62  E-value=8.9  Score=42.67  Aligned_cols=84  Identities=18%  Similarity=0.346  Sum_probs=48.3

Q ss_pred             CCCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC--CCHHHHHHHHHcCCeecccCCCC-------
Q psy16054        216 QVPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY--TNYASVQELHRKGHEISVFSLTH-------  286 (563)
Q Consensus       216 ~~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~--~~y~~v~~i~~~GHEIa~H~~~H-------  286 (563)
                      -+||+|.--+|-..|.-.++.  .||            .+-||=+..-  .||.-|-.++  ||||+ ||..-       
T Consensus       446 M~pq~VNAYYnp~~N~IVFPA--AIL------------Q~PFfd~ea~~a~NYGgIGaVI--gHEI~-HgFDdqGakfD~  508 (654)
T COG3590         446 MPPQTVNAYYNPQKNEIVFPA--AIL------------QAPFFDPEADSAANYGGIGAVI--GHEIG-HGFDDQGAKFDG  508 (654)
T ss_pred             CCHHHhhhhcCCCCceEeeeH--Hhc------------CCCCCCCCcchhhcccCcccee--hhhhc-ccccCCccccCC
Confidence            378887655555555444443  244            4456655543  4887666665  99998 55533       


Q ss_pred             -CCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCC
Q psy16054        287 -KDDPKYWSGGSYDDWLAEMAGGRLIIERFANIS  319 (563)
Q Consensus       287 -~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~  319 (563)
                       -++.+||..-+.+.+.   +++.+++++|-|..
T Consensus       509 ~GnL~dWWT~eD~~aF~---~~T~~lvaqf~~~e  539 (654)
T COG3590         509 DGNLNDWWTDEDAAAFK---ERTKKLVAQFDGYE  539 (654)
T ss_pred             CCcHHhhcCHHHHHHHH---HHHHHHHHHhcCcc
Confidence             2334566544444433   36667777776643


No 47 
>COG3536 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.31  E-value=10  Score=32.99  Aligned_cols=78  Identities=15%  Similarity=0.306  Sum_probs=50.6

Q ss_pred             cccCCCCCCCCCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC----------CHHHHHHHHHcC
Q psy16054        207 RIPGGIEPNQVPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT----------NYASVQELHRKG  276 (563)
Q Consensus       207 ciP~~~~~~~~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~----------~y~~v~~i~~~G  276 (563)
                      -+|..++...-.+..+|||+||-.......++.++... ..-.|++..--.-|.|.-.          || .||-+.+.|
T Consensus         3 ~~P~~lrvsk~r~~L~l~f~dg~~~~L~ae~LRv~sPs-AEvqghgp~q~v~v~gkr~V~i~~i~p~GnY-avri~FdDg   80 (120)
T COG3536           3 QWPTELRVSKDRKVLSLTFADGAPFRLPAEFLRVLSPS-AEVQGHGPGQRVLVPGKRNVQIRDIEPVGNY-AVRILFDDG   80 (120)
T ss_pred             CCCceeEEeccCcEEEEEeCCCCcccCCHHHHHhcCch-HHhcccCCCCceeeeccccceEEEeEecCce-EEEEEecCC
Confidence            46777777777888999999977665555556666431 1113455544455555421          44 467778899


Q ss_pred             CeecccCCCC
Q psy16054        277 HEISVFSLTH  286 (563)
Q Consensus       277 HEIa~H~~~H  286 (563)
                      |+-|.-+|+-
T Consensus        81 HDsGiy~W~Y   90 (120)
T COG3536          81 HDSGIYTWAY   90 (120)
T ss_pred             cccCcccHHH
Confidence            9999888753


No 48 
>TIGR01358 DAHP_synth_II 3-deoxy-7-phosphoheptulonate synthase, class II. Homologs scoring between trusted and noise cutoff include proteins involved in antibiotic biosynthesis; one example is active as this enzyme, while another acts on an amino analog.
Probab=40.27  E-value=1.5e+02  Score=32.26  Aligned_cols=84  Identities=14%  Similarity=0.136  Sum_probs=59.8

Q ss_pred             CcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---C--CHHHHHHHHHcCCeec-----ccCCCCCCCcCCCCCCC
Q psy16054        228 AVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---T--NYASVQELHRKGHEIS-----VFSLTHKDDPKYWSGGS  297 (563)
Q Consensus       228 ~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---~--~y~~v~~i~~~GHEIa-----~H~~~H~~~~~~~~~~s  297 (563)
                      |++-. -+.++.|++  +-||++-+-|.|+.+--..   .  -+.+|+++.++||.|.     .||.+......| ..-.
T Consensus       287 GP~~~-p~~l~~L~~--~LnP~~epGRlTLI~RmGa~kV~~~LP~li~aV~~~G~~VvW~cDPMHGNT~~t~~G~-KTR~  362 (443)
T TIGR01358       287 GPSMT-PDELLRLIE--RLNPENEPGRLTLISRMGADKIADKLPPLLRAVKAAGRRVVWVCDPMHGNTEEAASGY-KTRR  362 (443)
T ss_pred             CCCCC-HHHHHHHHH--HhCCCCCCceEEEEeccCchHHHHhHHHHHHHHHHcCCceEEeecCCCCCceeCCCCc-cCCc
Confidence            45433 345556765  5688888888887763322   1  4589999999999997     799887754332 3456


Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy16054        298 YDDWLAEMAGGRLIIERF  315 (563)
Q Consensus       298 ~~~~~~Ei~~~~~~l~~~  315 (563)
                      .+.+.+||.+..++.++.
T Consensus       363 f~~Il~Ev~~ff~vh~~~  380 (443)
T TIGR01358       363 FDDIRSEVKGFFEVHRAE  380 (443)
T ss_pred             HHHHHHHHHHHHHHHHHc
Confidence            788999999999988764


No 49 
>PRK09819 alpha-mannosidase; Provisional
Probab=39.74  E-value=1.1e+02  Score=36.68  Aligned_cols=79  Identities=15%  Similarity=0.040  Sum_probs=54.4

Q ss_pred             HHHHHHHHHcCCeecccCC-CCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHH
Q psy16054        266 YASVQELHRKGHEISVFSL-THKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMT  343 (563)
Q Consensus       266 y~~v~~i~~~GHEIa~H~~-~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~  343 (563)
                      ++.|++++++|-=|. =+| .-.+.    .-.+.|.+.+.|.-.++.+++| |+.  ..+||-+ |||..+  .+.++|+
T Consensus        69 ~~~ik~lV~~Grl~~-G~W~v~~D~----~l~sgEsliRqll~G~~~~~~f-G~~--~~vgwlpD~FG~s~--~lPqIl~  138 (875)
T PRK09819         69 KERVKKLVQAGKLII-GPWYTQTDQ----LVVSGESIVRNLLYGIRDCREF-GEP--MKIGYLPDSFGQSG--QMPQIYN  138 (875)
T ss_pred             HHHHHHHHHcCCEeE-CcEEeeccc----ccCCHHHHHHHHHHHHHHHHHc-CCC--CceeeecCCCCCcH--HHHHHHH
Confidence            368999999997443 334 21111    1246788888888777778765 543  3567776 676644  6899999


Q ss_pred             HcCCeEEeecC
Q psy16054        344 DQFFVYDASIT  354 (563)
Q Consensus       344 e~Gf~YDsS~~  354 (563)
                      ..||.|-..|.
T Consensus       139 ~~Gi~~~~~wr  149 (875)
T PRK09819        139 GFGITRTLFWR  149 (875)
T ss_pred             hCCCCeEEEEe
Confidence            99999987774


No 50 
>cd02995 PDI_a_PDI_a'_C PDIa family, C-terminal TRX domain (a') subfamily; composed of the C-terminal redox active a' domains of PDI, ERp72, ERp57 (or ERp60) and EFP1. PDI, ERp72 and ERp57 are endoplasmic reticulum (ER)-resident eukaryotic proteins involved in oxidative protein folding. They are oxidases, catalyzing the formation of disulfide bonds of newly synthesized polypeptides in the ER. They also exhibit reductase activity in acting as isomerases to correct any non-native disulfide bonds, as well as chaperone activity to prevent protein aggregation and facilitate the folding of newly synthesized proteins. PDI and ERp57 have the abb'a' domain structure (where a and a' are redox active TRX domains while b and b' are redox inactive TRX-like domains). PDI also contains an acidic region (c domain) after the a' domain that is absent in ERp57. ERp72 has an additional a domain at the N-terminus (a"abb'a' domain structure). ERp57 interacts with the lectin chaperones, calnexin and calreticu
Probab=38.59  E-value=72  Score=26.26  Aligned_cols=51  Identities=18%  Similarity=0.288  Sum_probs=33.3

Q ss_pred             HHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeH
Q psy16054        433 HNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTM  483 (563)
Q Consensus       433 ~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~  483 (563)
                      ++|+......+.++.+.|+++|-..-......|+++.+.+.....|-|+..
T Consensus         8 ~~f~~~i~~~~~~~~v~f~~~~C~~C~~~~~~~~~~~~~~~~~~~~~~~~i   58 (104)
T cd02995           8 KNFDEVVLDSDKDVLVEFYAPWCGHCKALAPIYEELAEKLKGDDNVVIAKM   58 (104)
T ss_pred             hhhHHHHhCCCCcEEEEEECCCCHHHHHHhhHHHHHHHHhcCCCCEEEEEE
Confidence            456555544456888999999966444456677777776655456766653


No 51 
>PLN02291 phospho-2-dehydro-3-deoxyheptonate aldolase
Probab=37.96  E-value=1.7e+02  Score=32.21  Aligned_cols=84  Identities=15%  Similarity=0.100  Sum_probs=59.7

Q ss_pred             CcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---C--CHHHHHHHHHcCCeec-----ccCCCCCCCcCCCCCCC
Q psy16054        228 AVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---T--NYASVQELHRKGHEIS-----VFSLTHKDDPKYWSGGS  297 (563)
Q Consensus       228 ~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---~--~y~~v~~i~~~GHEIa-----~H~~~H~~~~~~~~~~s  297 (563)
                      |++-. -+.++.|++  +-||++-+-|.|+-+--..   .  -+.+|+++.+.||.|.     .||.+......| ..-.
T Consensus       307 GPs~~-pdel~~L~~--~LnP~~epGRlTLI~RmGa~kV~~~LP~Li~aV~~~G~~VvW~cDPMHGNT~~t~~G~-KTR~  382 (474)
T PLN02291        307 SDKMD-PEELVKLIE--ILNPQNKPGRLTIIVRMGAEKLRVKLPHLIRAVRRAGQIVTWVSDPMHGNTIKAPSGL-KTRP  382 (474)
T ss_pred             CCCCC-HHHHHHHHH--HhCCCCCCceEEEEeccchHHHHHHHHHHHHHHHHcCCceEEeecCCCCCceeCCCCc-cCCc
Confidence            55443 345566775  5688888888887763221   2  4489999999999997     799887654322 3456


Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy16054        298 YDDWLAEMAGGRLIIERF  315 (563)
Q Consensus       298 ~~~~~~Ei~~~~~~l~~~  315 (563)
                      .+.+.+||.+..++.++.
T Consensus       383 f~~Il~Ev~~ff~vh~~~  400 (474)
T PLN02291        383 FDAIRAEVRAFFDVHEQE  400 (474)
T ss_pred             HHHHHHHHHHHHHHHHHc
Confidence            788999999999988764


No 52 
>PF11071 DUF2872:  Protein of unknown function (DUF2872);  InterPro: IPR019884 This entry represents a family of uncharacterised proteins, including YtoQ from Bacillus subtilis. This family shows some sequence similarity to a family of nucleoside 2-deoxyribosyltransferases (COG3613 as iterated through CDD), but sufficiently remote that PSI-BLAST starting from YtoQ and exploring outwards does not discover the relationship. 
Probab=33.57  E-value=52  Score=29.78  Aligned_cols=52  Identities=17%  Similarity=0.178  Sum_probs=36.2

Q ss_pred             HHHHHHHHHHHHcc-CCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHh
Q psy16054        428 GRLLRHNFNRHFST-NRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWM  490 (563)
Q Consensus       428 ~~~l~~~F~~~y~~-nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~  490 (563)
                      |+.|..+||.-|.. -+.|+ |.+|++-+.      .+|++    +.+.-.+|.-|..|+++-+
T Consensus        84 YKQWNaAfDAg~a~AlgKpl-I~lh~~~~~------HpLKE----vda~A~a~~et~~Qvv~iL  136 (141)
T PF11071_consen   84 YKQWNAAFDAGYAAALGKPL-ITLHPEELH------HPLKE----VDAAALAVAETPEQVVEIL  136 (141)
T ss_pred             HHHHHHHhhHHHHHHcCCCe-EEecchhcc------ccHHH----HhHhhHhhhCCHHHHHHHH
Confidence            78899999999963 45688 889966444      23333    3344458888999987643


No 53 
>KOG3509|consensus
Probab=32.97  E-value=52  Score=39.38  Aligned_cols=61  Identities=18%  Similarity=0.530  Sum_probs=49.0

Q ss_pred             CcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCCceecCccccCCCCCCCCCCcccccCCC
Q psy16054        113 TCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNGECIDKTLFCDDKPDCKDESDENACSVE  180 (563)
Q Consensus       113 ~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE~~C~~~  180 (563)
                      .|+   +..+|.+.+|+.++.-    -...|.+.+++|.+++|.-..+.||+..+|.++|+..+|..+
T Consensus         9 ~~~---~~~~~~~~~~~~~~~~----~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   69 (964)
T KOG3509|consen    9 ACD---RQPDCRDRSDVANDPA----IGSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAK   69 (964)
T ss_pred             hhc---cchhhHhhcccCCCcc----ccccCCcchhccCCccccCchhhhccccccCCCCCcCCcccc
Confidence            466   5668888888865322    123688999999999999999999999999999988888654


No 54 
>TIGR03473 HpnK hopanoid biosynthesis associated protein HpnK. The sequences represented by this model are members of the pfam04794 "YdjC-like" family of uncharacterized proteins. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0976) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnK) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopanoid biosynthesis locus was described consisting of the genes HpnA-E and SHC (HpnF). Continuing past SHC are found a ph
Probab=31.79  E-value=2.9e+02  Score=28.31  Aligned_cols=117  Identities=9%  Similarity=0.106  Sum_probs=64.4

Q ss_pred             eEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCc-eEEEEeccCCCHHHHHHHHH-cCCeecccC-CCCCC-------C
Q psy16054        221 ITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIK-GTFFVSHKYTNYASVQELHR-KGHEISVFS-LTHKD-------D  289 (563)
Q Consensus       221 V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ik-aTFFv~~~~~~y~~v~~i~~-~GHEIa~H~-~~H~~-------~  289 (563)
                      |.|.=|| |........+.++++..+       +. +|..+++.+. .+.++.+.+ .+--||.|- .+...       .
T Consensus         4 LIinADDfG~s~~vn~gI~~~~~~G~-------v~sts~M~n~p~~-~~a~~~~~~~~~~~vGlHl~Lt~g~pl~~~~~i   75 (283)
T TIGR03473         4 LIVTADDFGLSLEVNEAVEKAHRDGI-------LTAASLMVGAPAA-EDAVERARRLPGLGVGLHLVLVDGRPVLPPDQI   75 (283)
T ss_pred             EEEecccCCCCHHHHHHHHHHHHcCc-------eeeeeeccCChhH-HHHHHHHHhCCCCCEEEEEEecCCCCCCChhhC
Confidence            5677788 666665566677776422       33 3455555543 244555554 477899992 22211       1


Q ss_pred             cC-------CC------------CCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEe----cCccccChhhHHHHHHHcC
Q psy16054        290 PK-------YW------------SGGSYDDWLAEMAGGRLIIERFANISDGSVIGVR----APYLRVGGNKQFSMMTDQF  346 (563)
Q Consensus       290 ~~-------~~------------~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R----~P~l~~~~~~~~~~L~e~G  346 (563)
                      +.       ++            .....++++.||..+.+.+.+ .|+.+..+-|.-    .|...   +..++++++.|
T Consensus        76 ~sLvd~~G~F~~~~~~~~~~~~~~~~~~~~v~~E~~aQi~~f~~-~G~~p~hiD~H~hvh~~P~v~---~~~~~la~~y~  151 (283)
T TIGR03473        76 PDLVDGQGRFGDDMVRDGVRYFFLPAVRAQLAREIRAQFEAFAA-TGLPLDHVNAHKHFHLHPTVL---SLILEIGREYG  151 (283)
T ss_pred             cCccCCCCCccHHHHHHhhHHhhccccHHHHHHHHHHHHHHHHH-hCCCCCeecCCCccccChhHH---HHHHHHHHHcC
Confidence            11       01            111346888899888888776 577655544421    23211   34566677766


Q ss_pred             CeE
Q psy16054        347 FVY  349 (563)
Q Consensus       347 f~Y  349 (563)
                      +.|
T Consensus       152 lp~  154 (283)
T TIGR03473       152 LRA  154 (283)
T ss_pred             CCe
Confidence            544


No 55 
>COG3603 Uncharacterized conserved protein [Function unknown]
Probab=31.68  E-value=73  Score=28.39  Aligned_cols=14  Identities=36%  Similarity=0.648  Sum_probs=12.2

Q ss_pred             HHHHHHHHcCCeec
Q psy16054        267 ASVQELHRKGHEIS  280 (563)
Q Consensus       267 ~~v~~i~~~GHEIa  280 (563)
                      ..|+.|.++||||.
T Consensus       114 kAv~~L~eaGhev~  127 (128)
T COG3603         114 KAVKALEEAGHEVL  127 (128)
T ss_pred             HHHHHHHHcCCccc
Confidence            47899999999985


No 56 
>PRK02134 hypothetical protein; Provisional
Probab=31.45  E-value=4e+02  Score=26.81  Aligned_cols=117  Identities=9%  Similarity=-0.003  Sum_probs=67.9

Q ss_pred             eEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCc-eEEEEeccCCCHHHHHHHHHc-CCeecccC-CCCC-------CC
Q psy16054        221 ITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIK-GTFFVSHKYTNYASVQELHRK-GHEISVFS-LTHK-------DD  289 (563)
Q Consensus       221 V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ik-aTFFv~~~~~~y~~v~~i~~~-GHEIa~H~-~~H~-------~~  289 (563)
                      +.+.=|| |........++++++..+       +. +|..+++.+.. +.++.+... +--||.|- .+..       .+
T Consensus         5 lIinADDfG~s~~vn~gI~~~~~~G~-------ltstslM~n~p~~~-~a~~l~~~~~~l~vGlHl~Lt~g~P~~~~psL   76 (249)
T PRK02134          5 LIVNADDFGLSKGQNYGIIEAHRNGV-------VTSTTAMVNMPAID-HAVQLSREYPSLGVGMHFVLTAGKPLTAMPSL   76 (249)
T ss_pred             EEEECCcCCCCHHHHHHHHHHHHCCC-------eeEEEeecCCchHH-HHHHHHHhCCCCCEEEEEEecCCCccCCCCCC
Confidence            6677888 666655556667765322       33 44666666543 344444443 67889882 2211       11


Q ss_pred             c----C----CC-----CCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEE----ecCccccChhhHHHHHHHcCCe
Q psy16054        290 P----K----YW-----SGGSYDDWLAEMAGGRLIIERFANISDGSVIGV----RAPYLRVGGNKQFSMMTDQFFV  348 (563)
Q Consensus       290 ~----~----~~-----~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~----R~P~l~~~~~~~~~~L~e~Gf~  348 (563)
                      .    .    .+     ..++.++++.||..+.+.+.+..|+.+..+-|.    ..|...   +...++.++.|+.
T Consensus        77 ~d~~G~F~~~~~~~~~~~~~~~~ev~~E~~aQi~~f~~~~G~~p~hiD~H~h~h~~p~v~---~~~~~la~ey~lp  149 (249)
T PRK02134         77 TDRDGLLGKWIWQLAEEGTLPLEEIAQELAAQYERFLSLFGRKPTHLDSHHHVHMLPQIF---PIVARFAAEYGVP  149 (249)
T ss_pred             cCCCCCccHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHhhCCCCCeeecccccccChhHH---HHHHHHHHHcCCc
Confidence            1    0    11     246688999999999999988888876666552    122111   2345566666653


No 57 
>KOG1959|consensus
Probab=30.44  E-value=1.4e+02  Score=35.30  Aligned_cols=76  Identities=17%  Similarity=0.211  Sum_probs=58.8

Q ss_pred             HHHHHHHHcC-CeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054        267 ASVQELHRKG-HEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD  344 (563)
Q Consensus       267 ~~v~~i~~~G-HEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e  344 (563)
                      .+||+|.+.| -|+..-||.-.+-.    ..-+..+.+++......|+...|.-+...+||-. |||...  .+..++++
T Consensus       111 ~~Vk~LV~~GrLEFi~Ggw~MnDEA----~~hY~dvIDQ~TlGlrfL~~tFG~cgrPrvgWqIDPFGHSr--eqAslfAq  184 (996)
T KOG1959|consen  111 EQVKKLVNEGRLEFIGGGWSMNDEA----TTHYQDVIDQFTLGLRFLKDTFGECGRPRVGWQIDPFGHSR--EQASLFAQ  184 (996)
T ss_pred             HHHHHHHhcCCEEEecCeeeecchh----hhhHHHHHHHHHHHHHHHHHHhcccCCcceeeeeCCCCcch--HHHHHHHH
Confidence            6899999999 59999998655431    2456788899999999999888855555788876 888843  57888899


Q ss_pred             cCCe
Q psy16054        345 QFFV  348 (563)
Q Consensus       345 ~Gf~  348 (563)
                      +||.
T Consensus       185 mGfd  188 (996)
T KOG1959|consen  185 MGFD  188 (996)
T ss_pred             hCcc
Confidence            9983


No 58 
>COG3200 AroG 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]
Probab=27.83  E-value=3.2e+02  Score=29.07  Aligned_cols=97  Identities=21%  Similarity=0.230  Sum_probs=64.0

Q ss_pred             cccCCCCCcccCCCCCCCCCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---C--CHHHHHHHH
Q psy16054        199 CFCSADGTRIPGGIEPNQVPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---T--NYASVQELH  273 (563)
Q Consensus       199 ~~c~~~g~ciP~~~~~~~~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---~--~y~~v~~i~  273 (563)
                      .||.  |.+-|-++.|             |++-. -+.++.+.+  +-||+.-+-+.|..+--..   .  -+.+|+++.
T Consensus       277 e~~r--gv~NPig~K~-------------GP~~~-~d~ll~l~d--~LnP~nepGRLtLi~RmG~dKV~d~LP~li~av~  338 (445)
T COG3200         277 EFLR--GVKNPIGVKI-------------GPSMT-PDELLELID--RLNPHNEPGRLTLIARMGADKVGDRLPPLVEAVE  338 (445)
T ss_pred             HHHH--hcCCcccccc-------------CCCCC-HHHHHHHHH--hcCCCCCCceEEeehhhcchHHhhhhhHHHHHHH
Confidence            4565  6777877777             55433 344445554  5578777778887764322   1  458999999


Q ss_pred             HcCCeec-----ccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHH
Q psy16054        274 RKGHEIS-----VFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIER  314 (563)
Q Consensus       274 ~~GHEIa-----~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~  314 (563)
                      ++||.|-     .||.++.....| ..-..+.+.+|+....++-..
T Consensus       339 ~eG~~VvWs~DPMHGNTi~a~~gy-KTR~fd~Il~EV~sFfeihra  383 (445)
T COG3200         339 AEGHQVIWSSDPMHGNTIKASTGY-KTRPFDRILDEVQSFFEIHRA  383 (445)
T ss_pred             HcCCceEEecCCCCCceeecCCCC-ccccHHHHHHHHHHHHHHHHh
Confidence            9999986     688888765433 234566777777777766554


No 59 
>cd07295 PX_Grd19 The phosphoinositide binding Phox Homology domain of fungal Grd19. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Grd19 is involved in the localization of late Golgi membrane proteins in yeast. Grp19 associates with the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi, and functions as a cargo-specific adaptor for the retromer.
Probab=25.89  E-value=84  Score=27.72  Aligned_cols=62  Identities=16%  Similarity=0.216  Sum_probs=34.7

Q ss_pred             HHHHHHHHHHHHccCCcc-e-----EEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCC
Q psy16054        428 GRLLRHNFNRHFSTNRAP-L-----GLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP  493 (563)
Q Consensus       428 ~~~l~~~F~~~y~~nrap-~-----~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p  493 (563)
                      +.+|...+.+.|.+..-| +     +-.+.+.|+   ..|..+|++||..|+.+|. +-.....+.++++.|
T Consensus        47 F~~L~~~L~~~~~~~~iPplP~K~~~~~~~~~~i---e~Rr~~Le~fL~~i~~~p~-l~~~s~~~~~FL~~~  114 (116)
T cd07295          47 FEYFRDILERESPRVMIPPLPGKIFTNRFSDEVI---EERRQGLETFLQSVAGHPL-LQTGSKVLAAFLQDP  114 (116)
T ss_pred             HHHHHHHHHHHCCCCccCCCCCCccccCCCHHHH---HHHHHHHHHHHHHHhcCHh-hhhCCHHHHHhcCCC
Confidence            455667776666442211 1     112333333   3588999999999998763 211233445565554


No 60 
>COG3531 Predicted protein-disulfide isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=25.85  E-value=1.6e+02  Score=28.77  Aligned_cols=72  Identities=15%  Similarity=0.181  Sum_probs=49.8

Q ss_pred             HHHHHHHHc-CCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHc
Q psy16054        267 ASVQELHRK-GHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQ  345 (563)
Q Consensus       267 ~~v~~i~~~-GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~  345 (563)
                      .+|.++.+. |-||..|+=.-.....  ..++ ..+++.|..--..|++.+|.          ||++   +..-.+|...
T Consensus        20 Pll~~l~~~~gl~~~L~~GGl~aG~r--~pl~-~~~r~~i~~hd~rIa~lTGq----------pFg~---~y~~~lL~~~   83 (212)
T COG3531          20 PLLEALSAQPGLEVVLHGGGLRAGNR--RPLD-PAFRAYILPHDQRIAQLTGQ----------PFGQ---AYFDGLLRDT   83 (212)
T ss_pred             HHHHHHHhcCCceEEEecCCccCCCC--CCCC-hHHHHHhhhhHHHHHHHhCC----------ccch---hHHhhhhccc
Confidence            477777776 9999999843221100  1244 45789999999999999885          4443   2234788899


Q ss_pred             CCeEEeecC
Q psy16054        346 FFVYDASIT  354 (563)
Q Consensus       346 Gf~YDsS~~  354 (563)
                      ||.|||--.
T Consensus        84 ~~v~DSepa   92 (212)
T COG3531          84 GFVYDSEPA   92 (212)
T ss_pred             CccccccHH
Confidence            999998753


No 61 
>cd02992 PDI_a_QSOX PDIa family, Quiescin-sulfhydryl oxidase (QSOX) subfamily; QSOX is a eukaryotic protein containing an N-terminal redox active TRX domain, similar to that of PDI, and a small C-terminal flavin adenine dinucleotide (FAD)-binding domain homologous to the yeast ERV1p protein. QSOX oxidizes thiol groups to disulfides like PDI, however, unlike PDI, this oxidation is accompanied by the reduction of oxygen to hydrogen peroxide. QSOX is localized in high concentrations in cells with heavy secretory load and prefers peptides and proteins as substrates, not monothiols like glutathione. Inside the cell, QSOX is found in the endoplasmic reticulum and Golgi. The flow of reducing equivalents in a QSOX-catalyzed reaction goes from the dithiol substrate - dithiol of the QSOX TRX domain - dithiols of the QSOX ERV1p domain - FAD - oxygen.
Probab=25.72  E-value=1.6e+02  Score=25.51  Aligned_cols=50  Identities=18%  Similarity=0.268  Sum_probs=30.7

Q ss_pred             HHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCC-eEEEe
Q psy16054        433 HNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRND-VYFVT  482 (563)
Q Consensus       433 ~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~d-VwfvT  482 (563)
                      ++|+.....++.+..+.||++|-..-..+...|+++.+.+...++ |.|+.
T Consensus         9 ~~f~~~i~~~~~~vvV~f~a~wC~~C~~~~~~~~~la~~~~~~~~~v~~~~   59 (114)
T cd02992           9 ASFNSALLGSPSAWLVEFYASWCGHCRAFAPTWKKLARDLRKWRPVVRVAA   59 (114)
T ss_pred             HhHHHHHhcCCCeEEEEEECCCCHHHHHHhHHHHHHHHHHHhcCCceEEEE
Confidence            344554455567899999999954333455666666666654444 65554


No 62 
>PF13788 DUF4180:  Domain of unknown function (DUF4180)
Probab=24.02  E-value=88  Score=27.65  Aligned_cols=30  Identities=30%  Similarity=0.485  Sum_probs=23.8

Q ss_pred             HHHHHHHHHHHHHcCCCeEEEeHH-HHHHHh
Q psy16054        461 FKDELIKFIEEMLDRNDVYFVTML-QVIQWM  490 (563)
Q Consensus       461 r~~al~~fl~~i~~~~dVwfvT~~-ei~~w~  490 (563)
                      .-++|+.|+-+-..-.+|||+.-. ++++|+
T Consensus        83 ~S~~l~dfi~EsN~G~~~~F~~~~~eA~~~L  113 (113)
T PF13788_consen   83 TSKSLRDFIYESNRGNHFFFVPDEEEAIAWL  113 (113)
T ss_pred             cchhHHHHHHHhcCCCeEEEECCHHHHHhhC
Confidence            457899999999999999999654 455574


No 63 
>cd02334 ZZ_dystrophin Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
Probab=23.73  E-value=94  Score=23.13  Aligned_cols=36  Identities=22%  Similarity=0.185  Sum_probs=28.7

Q ss_pred             CceEEEEeccCCCHHHHHHHHHcCCeecccCCCCCC
Q psy16054        253 IKGTFFVSHKYTNYASVQELHRKGHEISVFSLTHKD  288 (563)
Q Consensus       253 ikaTFFv~~~~~~y~~v~~i~~~GHEIa~H~~~H~~  288 (563)
                      |.+.-|-...-.+|++-+.-+..|.+.+.|..+|..
T Consensus        11 i~g~RykC~~C~d~DLC~~Cf~~g~~~~~H~~~Hp~   46 (49)
T cd02334          11 ITGFRYRCLKCFNYDLCQSCFFSGRTSKSHKNSHPM   46 (49)
T ss_pred             ceeeeEECCCCCCcCchHHHHhCCCcCCCCCCCCCe
Confidence            445445544556999999999999999999999974


No 64 
>TIGR02026 BchE magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase. This model respresents the cobalamin-dependent oxidative cyclase responsible for forming the distinctive E-ring of the chlorin ring system under anaerobic conditions. This step is essential in the biosynthesis of both bacteriochlorophyll and chlorophyll under anaerobic conditions (a separate enzyme, AcsF, acts under aerobic conditions). This model identifies two clades of sequences, one from photosynthetic, non-cyanobacterial bacteria and another including Synechocystis and several non-photosynthetic bacteria. The function of the Synechocystis gene is supported by gene clustering with other photosynthetic genes, so the purpose of the gene in the non-photosynthetic bacteria is uncertain. Note that homologs of this gene are not found in plants which rely solely on the aerobic cyclase.
Probab=23.11  E-value=1.9e+02  Score=32.20  Aligned_cols=62  Identities=11%  Similarity=0.149  Sum_probs=37.8

Q ss_pred             CCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHH
Q psy16054        421 IQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVI  487 (563)
Q Consensus       421 ~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~  487 (563)
                      ..+.+++.++++.++.+.|.. ++-+...++..=+. +..++.+|..|+...-   .-||..+..+.
T Consensus       410 ~m~~~El~~~~~~~~~~fy~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~---~~~~~~~~~~~  471 (497)
T TIGR02026       410 HMPRWEILLGVKLNYIRFYMR-KALWRYPFQPNKFS-RRYMLGCLKAFLKAGV---GRWFYDLGRRV  471 (497)
T ss_pred             CCCHHHHHHHHHHHHHHHHHH-HHHHhcccccCHHH-HHHHHHHHHHHhhccc---ceeeeecCcee
Confidence            468899999999999998875 54443444422122 2345666667766544   34666554443


No 65 
>PF13592 HTH_33:  Winged helix-turn helix
Probab=22.64  E-value=1.4e+02  Score=22.91  Aligned_cols=18  Identities=11%  Similarity=0.263  Sum_probs=14.5

Q ss_pred             hhHHHHHHHcCCeEEeec
Q psy16054        336 NKQFSMMTDQFFVYDASI  353 (563)
Q Consensus       336 ~~~~~~L~e~Gf~YDsS~  353 (563)
                      ..+.++|+++||.|-..-
T Consensus        25 ~~v~~lL~r~G~s~~kp~   42 (60)
T PF13592_consen   25 SGVYRLLKRLGFSYQKPR   42 (60)
T ss_pred             HHHHHHHHHcCCccccCC
Confidence            468999999999886544


No 66 
>cd06860 PX_SNX7_30_like The phosphoinositide binding Phox Homology domain of Sorting Nexins 7 and 30. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. This subfamily consists of SNX7, SNX30, and similar proteins. They harbor a Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature, C-terminal
Probab=22.17  E-value=1.2e+02  Score=26.74  Aligned_cols=49  Identities=18%  Similarity=0.289  Sum_probs=31.5

Q ss_pred             HHHHHHHHHHHHccCCc-ce----E-----EeeccccccCchhHHHHHHHHHHHHHcCCCeE
Q psy16054        428 GRLLRHNFNRHFSTNRA-PL----G-----LHFHASWLKSKKEFKDELIKFIEEMLDRNDVY  479 (563)
Q Consensus       428 ~~~l~~~F~~~y~~nra-p~----~-----l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVw  479 (563)
                      +.+|.+.+.+.|-+... |+    .     -.+.+.|+.   .|..+|++||..|+.+|.++
T Consensus        46 F~~L~~~L~~~~p~~~iPpLP~K~~~~~~~~~~~~~fie---~Rr~~Le~fL~~i~~hp~l~  104 (116)
T cd06860          46 FLWLRQKLEESHPTHIIPPLPEKHSVKGLLDRFSPEFVA---TRMRALHKFLNRIVEHPVLS  104 (116)
T ss_pred             HHHHHHHHHHHCCCCccCCCCCcchhhhhcccCCHHHHH---HHHHHHHHHHHHHHcCcccc
Confidence            56677777776654322 21    1     014544433   58899999999999988754


No 67 
>PF12979 DUF3863:  Domain of Unknown Function with PDB structure (DUF3863);  InterPro: IPR024334 This entry represents a functionally uncharacterised domain found in bacterial sequences. The structure of one of the sequences, A6LGL1 from SWISSPROT from Parabacteroides distasonis atcc 8503, has been determined.; PDB: 3LM3_A.
Probab=20.98  E-value=2.5e+02  Score=28.44  Aligned_cols=101  Identities=17%  Similarity=0.216  Sum_probs=48.4

Q ss_pred             HHHHhccCCCCCCCCCCceEEEEeccC-----CCHHHHHHHH-----HcCCeecccCCCCCCCcCCCCCCCHHHHHHHHH
Q psy16054        237 YDEIFNGNRQNPNGCQIKGTFFVSHKY-----TNYASVQELH-----RKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMA  306 (563)
Q Consensus       237 ~~~ll~~~r~n~n~~~ikaTFFv~~~~-----~~y~~v~~i~-----~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~  306 (563)
                      +.+.|.  +.||++   +.|+=++-..     .+|.+|++-.     +-|-||.---=-...+  |   ++.++..+||.
T Consensus        46 ~reAfa--~~nP~g---rLTWgft~nALed~~~nyr~IR~y~v~c~~k~GDevsyfPGyFpam--Y---lpR~rVnRems  115 (351)
T PF12979_consen   46 LREAFA--SNNPDG---RLTWGFTLNALEDGRPNYRQIRDYVVECQQKYGDEVSYFPGYFPAM--Y---LPRERVNREMS  115 (351)
T ss_dssp             HHHHHH--TT-TT------EEEE-HHHHH--SHHHHHHHHHHHHHHHHH--EEEE---S-HHH--H---S-HHHHHHHHH
T ss_pred             HHHHHH--hcCCCc---eEEEeeecchhhcCChhHHHHHHHHHHHHHHhCCceeeccccchhh--c---CcHHHHHHHHH
Confidence            344554  457765   6776554432     2786665533     3588887432222222  2   68899999999


Q ss_pred             HHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEE
Q psy16054        307 GGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYD  350 (563)
Q Consensus       307 ~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YD  350 (563)
                      .+.++|.++.|.. ..|+..-.-||+.  ++..-+.++.|+.-.
T Consensus       116 eAi~lis~~vg~g-yRP~si~gGFLsA--~Nl~ylaeke~Ih~a  156 (351)
T PF12979_consen  116 EAIQLISKMVGNG-YRPQSIMGGFLSA--DNLRYLAEKENIHTA  156 (351)
T ss_dssp             HHHHHHHHHH-TT----SEEE-SS--H--HHHHHHHHHS---EE
T ss_pred             HHHHHHHHHhcCC-CCcHHHHHhhcch--hhhhhhhhhcCceee
Confidence            9999999987632 2222223334443  455556777887543


No 68 
>PF02456 Adeno_IVa2:  Adenovirus IVa2 protein;  InterPro: IPR003389 Va2 protein can interact with the adenoviral packaging signal and this interaction involves DNA sequences that have previously been demonstrated to be required for packaging []. During the course of lytic infection, the adenovirus major late promoter (MLP) is induced to high levels after replication of viral DNA has started. IVa2 is a transcriptional activator of the major late promoter [].; GO: 0019083 viral transcription
Probab=20.58  E-value=55  Score=34.19  Aligned_cols=31  Identities=26%  Similarity=0.533  Sum_probs=23.7

Q ss_pred             CCCCcccCCCCCCCCCCceEEeecCCcCCCCch
Q psy16054        203 ADGTRIPGGIEPNQVPQMITITFNGAVNVDNSD  235 (563)
Q Consensus       203 ~~g~ciP~~~~~~~~pq~V~lTFDD~~~~~~~~  235 (563)
                      .+||.+|  ..+.-.|.||.+||||+..+.|++
T Consensus       151 ~~gTi~P--~t~t~~P~Fv~msy~e~t~~~Nld  181 (369)
T PF02456_consen  151 PDGTIVP--QTGTFRPKFVEMSYDEATSPENLD  181 (369)
T ss_pred             CCCeecc--ccccccccceeecHhhhCCccccC
Confidence            3466666  344568999999999999988765


No 69 
>TIGR03646 YtoQ_fam YtoQ family protein. Members of this family are uncharacterized proteins, including YtoQ from Bacillus subtilis. This family shows some sequence similarity to a family of nucleoside 2-deoxyribosyltransferases (COG3613 as iterated through CDD), but sufficiently remote that PSI-BLAST starting from YtoQ and exploring outwards does not discover the relationship.
Probab=20.44  E-value=95  Score=28.22  Aligned_cols=52  Identities=15%  Similarity=0.155  Sum_probs=34.4

Q ss_pred             HHHHHHHHHHHHcc-CCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHh
Q psy16054        428 GRLLRHNFNRHFST-NRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWM  490 (563)
Q Consensus       428 ~~~l~~~F~~~y~~-nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~  490 (563)
                      |+.|..+||.-|.. -+.|+ |.+|++-+.      .+|++.=.    .--+|.-|..|+++-+
T Consensus        87 YKQWNaAfDAg~aaAlgKpl-I~lh~~~~~------HpLKEvda----aA~avaetp~Qvv~iL  139 (144)
T TIGR03646        87 YKQWNAAFDAGYAAALGKPL-IILRPEELI------HPLKEVDN----KAQAVVETPEQAIETL  139 (144)
T ss_pred             HHHHHHHhhHHHHHHcCCCe-EEecchhcc------ccHHHHhH----HHHHHhcCHHHHHHHH
Confidence            78899999999864 45688 889966444      23444332    2336777888887643


No 70 
>COG4637 Predicted ATPase [General function prediction only]
Probab=20.13  E-value=2.4e+02  Score=29.75  Aligned_cols=60  Identities=15%  Similarity=0.112  Sum_probs=44.6

Q ss_pred             HHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHH
Q psy16054        425 EQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQV  486 (563)
Q Consensus       425 ~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei  486 (563)
                      |..+++|.-.  .+...-|.|-.|.+..|..+-.|..+.+|.+++..++.+.-||++|.+.=
T Consensus       273 DGTlRfl~l~--t~Llsp~~p~ll~ldEPE~sLHP~lL~~La~~~~sAak~sQv~VsTHS~r  332 (373)
T COG4637         273 DGTLRFLALA--TLLLSPRPPPLLLLDEPETSLHPDLLPALAELMRSAAKRSQVIVSTHSPR  332 (373)
T ss_pred             ccHHHHHHHH--HHHcCCCCCceeEecCcccccCHhHHHHHHHHHHHhhccceEEEEeCCHH
Confidence            3345554332  22333456666888888888889999999999999999999999998764


Done!