Query psy6580
Match_columns 490
No_of_seqs 308 out of 1563
Neff 6.1
Searched_HMMs 46136
Date Fri Aug 16 21:47:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6580.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6580hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1065|consensus 100.0 1.4E-78 2.9E-83 661.6 33.0 364 121-489 16-406 (805)
2 PLN02763 hydrolase, hydrolyzin 100.0 1.3E-49 2.9E-54 448.0 27.2 228 254-489 61-295 (978)
3 PRK10658 putative alpha-glucos 100.0 4.5E-48 9.8E-53 427.7 33.6 222 255-488 147-376 (665)
4 KOG1066|consensus 100.0 1.5E-46 3.2E-51 399.9 25.9 335 136-488 31-465 (915)
5 COG1501 Alpha-glucosidases, fa 100.0 4.4E-43 9.4E-48 391.7 29.2 225 254-489 140-373 (772)
6 PRK10426 alpha-glucosidase; Pr 100.0 1.8E-43 3.8E-48 390.0 24.5 218 257-488 72-320 (635)
7 PF01055 Glyco_hydro_31: Glyco 100.0 5.1E-31 1.1E-35 280.1 13.2 137 352-489 1-138 (441)
8 cd06600 GH31_MGAM-like This fa 100.0 5.3E-30 1.1E-34 262.2 11.2 117 371-489 1-118 (317)
9 cd06603 GH31_GANC_GANAB_alpha 100.0 7.1E-30 1.5E-34 263.5 11.2 117 371-489 1-118 (339)
10 cd06602 GH31_MGAM_SI_GAA This 100.0 2.5E-29 5.5E-34 259.4 11.7 119 371-489 1-123 (339)
11 cd06604 GH31_glucosidase_II_Ma 100.0 3.5E-29 7.7E-34 258.2 11.6 117 371-489 1-118 (339)
12 cd06591 GH31_xylosidase_XylS X 100.0 9.5E-29 2.1E-33 253.2 11.0 112 371-486 1-115 (319)
13 cd06599 GH31_glycosidase_Aec37 100.0 2.6E-28 5.6E-33 249.7 11.2 114 371-488 1-125 (317)
14 cd06598 GH31_transferase_CtsZ 99.9 8.2E-28 1.8E-32 246.0 11.3 115 371-489 1-123 (317)
15 cd06593 GH31_xylosidase_YicI Y 99.9 1.6E-27 3.6E-32 242.5 11.4 114 371-488 1-117 (308)
16 cd06595 GH31_xylosidase_XylS-l 99.9 8.5E-27 1.8E-31 236.0 10.2 99 370-470 1-109 (292)
17 cd06594 GH31_glucosidase_YihQ 99.9 3.1E-26 6.7E-31 234.5 12.3 114 371-489 1-124 (317)
18 cd06601 GH31_lyase_GLase GLase 99.9 2.2E-26 4.8E-31 236.8 8.9 89 371-459 1-90 (332)
19 cd06597 GH31_transferase_CtsY 99.9 5.3E-26 1.1E-30 234.9 11.5 117 371-487 1-142 (340)
20 cd06589 GH31 The enzymes of gl 99.9 2.2E-23 4.7E-28 208.3 7.6 87 371-457 1-90 (265)
21 cd06592 GH31_glucosidase_KIAA1 99.8 1.9E-19 4.1E-24 183.4 11.3 109 374-488 12-122 (303)
22 PF13802 Gal_mutarotas_2: Gala 99.5 4E-14 8.6E-19 113.4 6.1 65 265-333 1-68 (68)
23 cd00111 Trefoil P or trefoil o 99.4 2.1E-13 4.6E-18 100.2 2.9 43 115-161 2-44 (44)
24 smart00018 PD P or trefoil or 99.4 3.6E-13 7.8E-18 99.8 3.1 44 115-162 2-45 (46)
25 PF00088 Trefoil: Trefoil (P-t 99.3 2.7E-12 5.9E-17 93.8 2.7 43 115-160 1-43 (43)
26 COG3345 GalA Alpha-galactosida 97.4 0.00089 1.9E-08 72.5 9.7 127 352-486 260-402 (687)
27 PF02065 Melibiase: Melibiase; 97.3 0.00041 8.8E-09 73.7 6.9 68 391-460 55-131 (394)
28 cd06596 GH31_CPE1046 CPE1046 i 97.0 0.00061 1.3E-08 68.1 4.1 77 371-456 1-98 (261)
29 PLN02808 alpha-galactosidase 96.8 0.0062 1.3E-07 64.5 9.7 86 370-460 26-124 (386)
30 PLN02229 alpha-galactosidase 96.7 0.0048 1E-07 66.0 8.1 87 370-460 57-155 (427)
31 PLN02692 alpha-galactosidase 96.5 0.012 2.7E-07 62.6 9.3 97 359-460 39-148 (412)
32 PLN03231 putative alpha-galact 94.4 0.11 2.4E-06 54.6 7.4 67 392-459 16-112 (357)
33 PLN02899 alpha-galactosidase 93.9 0.19 4E-06 56.0 8.0 67 392-459 46-139 (633)
34 PLN02219 probable galactinol-- 90.4 1 2.2E-05 51.4 8.6 101 351-451 160-304 (775)
35 PLN02355 probable galactinol-- 90.2 1.9 4.2E-05 49.2 10.6 102 350-451 163-312 (758)
36 PLN02711 Probable galactinol-- 83.4 37 0.0008 39.3 15.7 165 255-421 66-257 (777)
37 PLN02684 Probable galactinol-- 82.8 5 0.00011 45.9 8.6 101 351-451 163-303 (750)
38 PF14871 GHL6: Hypothetical gl 74.8 3.6 7.7E-05 37.3 3.7 43 435-485 46-88 (132)
39 PF05691 Raffinose_syn: Raffin 74.8 56 0.0012 37.9 13.9 71 351-421 156-239 (747)
40 PF13200 DUF4015: Putative gly 72.2 21 0.00046 37.1 9.1 89 384-488 3-108 (316)
41 PF07485 DUF1529: Domain of Un 62.5 29 0.00064 31.2 6.8 70 374-444 36-118 (123)
42 PF01301 Glyco_hydro_35: Glyco 58.5 17 0.00037 37.7 5.3 74 383-460 16-90 (319)
43 smart00642 Aamy Alpha-amylase 58.3 38 0.00083 31.6 7.2 64 390-455 15-92 (166)
44 cd04883 ACT_AcuB C-terminal AC 52.3 70 0.0015 24.6 6.8 56 394-450 13-69 (72)
45 TIGR02456 treS_nterm trehalose 50.0 87 0.0019 34.8 9.5 87 390-481 24-127 (539)
46 PLN00196 alpha-amylase; Provis 49.5 61 0.0013 35.1 8.0 60 392-453 42-112 (428)
47 cd06416 GH25_Lys1-like Lys-1 i 47.6 30 0.00066 32.8 4.8 72 383-456 58-133 (196)
48 PRK10933 trehalose-6-phosphate 44.9 49 0.0011 37.0 6.5 63 391-455 30-103 (551)
49 PF08924 DUF1906: Domain of un 43.5 34 0.00073 31.1 4.1 69 375-451 64-134 (136)
50 KOG3111|consensus 42.0 60 0.0013 31.7 5.6 72 362-459 53-125 (224)
51 PF10096 DUF2334: Uncharacteri 39.3 97 0.0021 30.8 7.0 63 390-453 12-75 (243)
52 PRK09441 cytoplasmic alpha-amy 35.9 78 0.0017 34.6 6.3 59 393-453 21-101 (479)
53 PF00128 Alpha-amylase: Alpha 33.9 43 0.00094 32.8 3.6 61 393-455 3-74 (316)
54 PRK08883 ribulose-phosphate 3- 33.8 67 0.0014 31.6 4.8 35 425-459 84-119 (220)
55 PF01373 Glyco_hydro_14: Glyco 33.6 1.7E+02 0.0037 31.6 8.0 85 394-484 16-111 (402)
56 PF12138 Spherulin4: Spherulat 32.7 57 0.0012 32.8 4.2 75 378-459 68-152 (253)
57 PRK12563 sulfate adenylyltrans 32.5 1.3E+02 0.0028 31.3 6.9 73 360-453 27-101 (312)
58 PLN02645 phosphoglycolate phos 32.5 1.4E+02 0.003 30.6 7.1 51 392-454 17-68 (311)
59 PRK15447 putative protease; Pr 32.2 78 0.0017 32.5 5.2 57 394-457 15-71 (301)
60 PF10566 Glyco_hydro_97: Glyco 31.7 1.5E+02 0.0031 30.4 6.9 64 391-456 29-96 (273)
61 TIGR02764 spore_ybaN_pdaB poly 31.5 1.7E+02 0.0036 27.4 7.1 19 434-452 169-187 (191)
62 TIGR02401 trehalose_TreY malto 31.3 1.9E+02 0.004 34.3 8.5 81 392-474 14-106 (825)
63 PRK10785 maltodextrin glucosid 31.2 1E+02 0.0022 34.9 6.3 63 391-455 176-248 (598)
64 COG0036 Rpe Pentose-5-phosphat 30.9 86 0.0019 31.0 5.0 35 425-459 87-122 (220)
65 PF13344 Hydrolase_6: Haloacid 30.3 95 0.002 26.4 4.6 27 430-456 14-40 (101)
66 TIGR00559 pdxJ pyridoxine 5'-p 29.6 44 0.00095 33.4 2.7 29 430-458 107-135 (237)
67 PRK05265 pyridoxine 5'-phospha 28.8 45 0.00098 33.4 2.7 29 430-458 110-138 (239)
68 TIGR02403 trehalose_treC alpha 28.4 1.9E+02 0.0042 32.2 7.8 77 391-472 24-111 (543)
69 PF02638 DUF187: Glycosyl hydr 27.9 1.9E+02 0.0042 29.8 7.3 65 392-458 17-95 (311)
70 PF04468 PSP1: PSP1 C-terminal 27.8 2.2E+02 0.0047 23.8 6.3 50 394-445 26-78 (88)
71 cd00003 PNPsynthase Pyridoxine 27.7 48 0.001 33.1 2.6 29 430-458 107-135 (234)
72 COG0366 AmyA Glycosidases [Car 27.5 2.1E+02 0.0046 30.5 7.8 78 390-472 25-113 (505)
73 PLN02361 alpha-amylase 27.3 1.4E+02 0.0029 32.3 6.2 58 394-453 29-96 (401)
74 PRK06769 hypothetical protein; 27.2 1.1E+02 0.0023 28.5 4.8 46 409-454 3-52 (173)
75 PRK08745 ribulose-phosphate 3- 26.4 1.1E+02 0.0024 30.2 4.9 35 425-459 88-123 (223)
76 cd02742 GH20_hexosaminidase Be 25.5 2.2E+02 0.0048 29.1 7.2 64 385-454 8-93 (303)
77 PF03537 Glyco_hydro_114: Glyc 25.2 83 0.0018 25.4 3.2 24 434-457 37-60 (74)
78 TIGR01459 HAD-SF-IIA-hyp4 HAD- 24.5 1.3E+02 0.0027 29.4 5.0 39 410-454 8-48 (242)
79 COG0647 NagD Predicted sugar p 24.1 1.3E+02 0.0029 30.6 5.1 43 410-457 8-51 (269)
80 PF13546 DDE_5: DDE superfamil 23.9 2.2E+02 0.0049 28.2 6.7 49 394-455 170-218 (273)
81 PRK13745 anaerobic sulfatase-m 23.5 2.3E+02 0.005 30.2 7.1 48 391-450 150-198 (412)
82 TIGR01656 Histidinol-ppas hist 23.2 1.2E+02 0.0026 27.1 4.3 25 430-454 27-51 (147)
83 PRK11767 SpoVR family protein; 23.2 1.4E+02 0.003 33.1 5.3 47 358-416 44-92 (498)
84 PF03943 TAP_C: TAP C-terminal 22.9 36 0.00078 25.7 0.6 49 360-413 1-50 (51)
85 cd04902 ACT_3PGDH-xct C-termin 22.6 3.2E+02 0.007 20.6 6.1 59 394-453 11-71 (73)
86 TIGR01668 YqeG_hyp_ppase HAD s 22.5 1.5E+02 0.0032 27.4 4.8 43 408-454 23-67 (170)
87 PF02449 Glyco_hydro_42: Beta- 22.5 83 0.0018 33.0 3.4 58 393-454 9-68 (374)
88 TIGR02039 CysD sulfate adenyly 22.1 2.5E+02 0.0054 29.0 6.7 73 360-453 9-83 (294)
89 TIGR00262 trpA tryptophan synt 22.0 2.4E+02 0.0053 28.2 6.5 68 391-458 69-152 (256)
90 smart00354 HTH_LACI helix_turn 21.9 1.2E+02 0.0027 23.8 3.6 50 6-55 3-54 (70)
91 PF03740 PdxJ: Pyridoxal phosp 21.6 55 0.0012 32.8 1.8 29 430-458 108-136 (239)
92 KOG3974|consensus 20.7 1.5E+02 0.0033 30.3 4.6 58 352-410 62-135 (306)
93 PRK14511 maltooligosyl trehalo 20.6 4E+02 0.0087 31.8 8.7 79 392-473 18-109 (879)
94 PF14871 GHL6: Hypothetical gl 20.5 1.6E+02 0.0034 26.7 4.3 23 394-417 110-132 (132)
No 1
>KOG1065|consensus
Probab=100.00 E-value=1.4e-78 Score=661.56 Aligned_cols=364 Identities=43% Similarity=0.772 Sum_probs=337.1
Q ss_pred CCCccccccCCCCCCHhhhccCCeeeecC-CCCCCCeeecCCCCCcceEEEEeecCceEEEEEec--cCCCCCCCCCcce
Q psy6580 121 PDKERFDCFPNGQVTEESCTARGCCWSIS-NNSKVPACFYPHGLQSYKVVHIDKHSYGLDVYWKN--TIKSPYGSDVQML 197 (490)
Q Consensus 121 ~~~~R~dC~P~~~~t~~~C~~rgCcw~~~-~~~g~P~C~~p~~~~gY~v~~v~~t~~G~~a~L~r--~~~~~yg~DI~~L 197 (490)
+...|++|.|+...++.+|+.|+|||.+. ...+.|||||+.++ ||.+.+..+..+|++|.|++ ..++.||.|++.+
T Consensus 16 ~~~~ri~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-gY~~~~~~~~t~~~~a~l~~~~~~~~~~~~d~~~~ 94 (805)
T KOG1065|consen 16 SSITRINCLPELLHTKASCERRYKCPTSPYKIVLFPWCRFLIGY-GYQAESMQVSTSGQTAKLNNLDPSNPIYGTDIKSL 94 (805)
T ss_pred cccceecccchhhhhhhhhhhceeecCCCcCcccceeEEeecCC-ceEEEecccccceEEEEeeccCCCcccccCCccce
Confidence 44689999999999999999887666554 46788999999996 99999877777799999998 4789999999999
Q ss_pred EEEEEEeeccEEEEEEecCCCCeeecCCCCCC-----------C-c-------cccccCccEEeecCCCCceeEecceEE
Q psy6580 198 QMSVKFETVQRLHVKITDANATRYEPSFPEVP-----------M-F-------NNRVKSVDCLFDSRNLGGFMYSNQFIQ 258 (490)
Q Consensus 198 ~l~V~~~t~~~LrvkI~d~~~~r~evP~~~p~-----------~-~-------~~R~stg~vlfdT~~~~~lif~dqflq 258 (490)
.+.+.++|..++||+|+|.++.|||+|....+ . . ..|+++|+++|||+ .++|+|+|||||
T Consensus 95 ~l~~~~~T~~r~~i~i~d~~~~r~e~p~~~~~~~~~~s~~~~~~~s~~pF~f~v~R~s~g~~lfdTs-~~~liF~DQflq 173 (805)
T KOG1065|consen 95 LLTVSYETVERLHIKITDLNNQRWEVPHVTIRRVEGRSHNSPVSFSTGPFSFSVTRWSDGRVLFDTS-IGPLIFEDQFLQ 173 (805)
T ss_pred EeeeecccceeeEEeecCcCCCceecccccccceeccccCCCcccccCCeEEEEEEecCCceeeecC-CCceeeeeeeee
Confidence 99999999999999999999999999984311 0 0 12999999999999 999999999999
Q ss_pred EEeeCCCCc-EEecCCCCC-CCccCCCCeEEEeeeCCCCCC-CCCCCccccCeeeeeecCCCCeeEEEEecCcccccccc
Q psy6580 259 ISSRLSSPY-IYGLGEHRN-QFLLDTDWKTIVLWPLDGPPQ-DGVNGYGYHPFYLNLNASSGLAHGVFLRTSNALEIVLQ 335 (490)
Q Consensus 259 ist~lp~e~-iYGLGE~~~-~l~~~~~g~~~~l~n~D~~~~-~~~nlYgs~Pfy~s~~~~~g~~~Gvfl~Ns~~~~v~~~ 335 (490)
+++.||+++ +||+||+.+ ++.+..++.+++||++|..+. ...|+||+|||||+.+ ..+.+||||+.|||+|||.++
T Consensus 174 lsT~LPs~~~iyG~gE~~~~~~~~~~~~~t~~l~~rd~~p~~~~~NLYGshPfymcle-~~~~~~GVflLnSNamEv~~~ 252 (805)
T KOG1065|consen 174 LSTYLPSENEIYGLGEHTHQSYRHDMNNLTWTLFARDYSPNLPGVNLYGSHPFYLCLE-DDSGAHGVFLLNSNAMEVTLR 252 (805)
T ss_pred eeccCCCcccccccccccchhhhcccCCceeeeeecccCCCCCcccccccccceeEee-ccCCCceEEEeccCCceEEec
Confidence 999999998 999999887 556666899999999999984 7899999999999999 778899999999999999999
Q ss_pred CCCcEEEEEeCCeEEEEEEeCCCHHHHHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEe
Q psy6580 336 PTPAITYRVLGGILDFYYFLGPKPGDVISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWI 415 (490)
Q Consensus 336 ~~~~lt~~~~gG~LD~y~f~GptP~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~ 415 (490)
+.+.++||++||++|+|||+||+|+.|++||.++||+|+|||||+||||+|||||++.+++++++++++++|||+|++|+
T Consensus 253 p~p~L~YR~iGGilD~y~flGptPe~vvqQy~q~iG~P~m~pYWslGf~~~RwgY~nls~~~dvv~~~~~agiPld~~~~ 332 (805)
T KOG1065|consen 253 PGPSLTYRTIGGILDFYVFLGPTPEGVVQQYLQLIGRPAMPPYWSLGFQLCRWGYKNLSVVRDVVENYRAAGIPLDVIVI 332 (805)
T ss_pred cCCeeEEEeecceEEEEEecCCChHHHHHHHHHHhCCccCCchhhccceecccccccHHHHHHHHHHHHHcCCCcceeee
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCc-eecC
Q psy6580 416 DIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLP-AEGK 489 (490)
Q Consensus 416 Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~-y~G~ 489 (490)
|||||++++|||+| ..||++++|++.||++|+|+|+|+||+|+.+. .|.+|++|.++++||++.+|++ ++|+
T Consensus 333 DiDyMd~ykDFTvd~~~fp~~~~fv~~Lh~~G~kyvliidP~is~~~--~y~~y~~g~~~~v~I~~~~g~~~~lg~ 406 (805)
T KOG1065|consen 333 DIDYMDGYKDFTVDKVWFPDLKDFVDDLHARGFKYVLIIDPFISTNS--SYGPYDRGVAKDVLIKNREGSPKMLGE 406 (805)
T ss_pred ehhhhhcccceeeccccCcchHHHHHHHHhCCCeEEEEeCCccccCc--cchhhhhhhhhceeeecccCchhhhcc
Confidence 99999999999999 99999999999999999999999999999876 3899999999999999999988 7764
No 2
>PLN02763 hydrolase, hydrolyzing O-glycosyl compounds
Probab=100.00 E-value=1.3e-49 Score=447.96 Aligned_cols=228 Identities=28% Similarity=0.515 Sum_probs=204.3
Q ss_pred cceEEEEeeCC-CCcEEecCCCCCCCccCCCCeEEEeeeCCCCCC--CCCCCccccCeeeeeecCCCCeeEEEEecCccc
Q psy6580 254 NQFIQISSRLS-SPYIYGLGEHRNQFLLDTDWKTIVLWPLDGPPQ--DGVNGYGYHPFYLNLNASSGLAHGVFLRTSNAL 330 (490)
Q Consensus 254 dqflqist~lp-~e~iYGLGE~~~~l~~~~~g~~~~l~n~D~~~~--~~~nlYgs~Pfy~s~~~~~g~~~Gvfl~Ns~~~ 330 (490)
+....+++.++ ++++|||||+.+.|++ +|++++|||+|+.++ ...++|++|||||+++ .++.+||||++|++.+
T Consensus 61 ~~~~~~~~~l~~~e~~YGlGE~~g~L~r--rG~~~~lwN~D~~gy~~~~~~LY~siPf~l~~~-~~g~~yGVf~dns~~~ 137 (978)
T PLN02763 61 GDQQIVTFELPSGTSFYGTGEVSGPLER--TGKRVYTWNTDAWGYGQNTTSLYQSHPWVFVVL-PNGEALGVLADTTRRC 137 (978)
T ss_pred CCEEEEEEEcCCCCeEEeCCccCCcccc--CCcEEEEEECCCCccCCCCCcccccEeEEEEEe-cCCcEEEEEEeCCCcE
Confidence 33334555565 5899999999998877 499999999998754 4578999999999987 5668999999999999
Q ss_pred cccccCCCcEEEEEeCCeEEEEEE-eC--CCHHHHHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcC
Q psy6580 331 EIVLQPTPAITYRVLGGILDFYYF-LG--PKPGDVISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAG 407 (490)
Q Consensus 331 ~v~~~~~~~lt~~~~gG~LD~y~f-~G--ptP~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~ 407 (490)
+|+++....+.|.+.+ ++|++ .| |+|++|+++|++|||+|+|||+||||||||||+|.++++|++++++||+++
T Consensus 138 ~fDlg~~~~~~f~a~~---~y~~i~~G~gptp~eVi~~Yt~LTGrp~mpP~WALGy~qSR~~Y~sq~eV~eva~~fre~~ 214 (978)
T PLN02763 138 EIDLRKESIIRIIAPA---SYPVITFGPFPSPEALLTSLSHAIGTVFMPPKWALGYQQCRWSYESAKRVAEIARTFREKK 214 (978)
T ss_pred EEEEcCCceEEEEecC---ceEEEEecCCCCHHHHHHHHHHHhCCCCCCchHHhheeeccCCCCCHHHHHHHHHHHHHcC
Confidence 9999877666777654 66664 45 799999999999999999999999999999999999999999999999999
Q ss_pred CCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCce
Q psy6580 408 IPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPA 486 (490)
Q Consensus 408 IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y 486 (490)
||||+||+|+|||+++++|||| ++||+|++|+++||++|+|+|+|+||+|+.+. .|..|++|.++++|||+++|++|
T Consensus 215 IP~DvIwlDidYm~g~~~FTwD~~rFPdP~~mv~~Lh~~G~kvv~iidPgI~~d~--gY~~y~eg~~~~~fvk~~~G~~y 292 (978)
T PLN02763 215 IPCDVVWMDIDYMDGFRCFTFDKERFPDPKGLADDLHSIGFKAIWMLDPGIKAEE--GYFVYDSGCENDVWIQTADGKPF 292 (978)
T ss_pred CCceEEEEehhhhcCCCceeECcccCCCHHHHHHHHHHCCCEEEEEEcCCCccCC--CCHHHHhHhhcCeeEECCCCCee
Confidence 9999999999999999999999 99999999999999999999999999998643 49999999999999999999999
Q ss_pred ecC
Q psy6580 487 EGK 489 (490)
Q Consensus 487 ~G~ 489 (490)
+|+
T Consensus 293 ~G~ 295 (978)
T PLN02763 293 VGE 295 (978)
T ss_pred Eee
Confidence 874
No 3
>PRK10658 putative alpha-glucosidase; Provisional
Probab=100.00 E-value=4.5e-48 Score=427.69 Aligned_cols=222 Identities=25% Similarity=0.502 Sum_probs=201.2
Q ss_pred ceEEEEeeCC-CCcEEecCCCCCCCccCCCCeEEEeeeCCCCCCCCCCCccccCeeeeeecCCCCeeEEEEecCcccccc
Q psy6580 255 QFIQISSRLS-SPYIYGLGEHRNQFLLDTDWKTIVLWPLDGPPQDGVNGYGYHPFYLNLNASSGLAHGVFLRTSNALEIV 333 (490)
Q Consensus 255 qflqist~lp-~e~iYGLGE~~~~l~~~~~g~~~~l~n~D~~~~~~~nlYgs~Pfy~s~~~~~g~~~Gvfl~Ns~~~~v~ 333 (490)
.++...+.++ +|++|||||+++.|++ +|+++.|||+|.+ ....++|+++||||+++ +||||++|++.+.|+
T Consensus 147 ~~~~~~~~l~~~E~~yGlGE~~~~l~~--~G~~~~~wn~D~~-~~~~~~Y~~iPf~~s~~-----gyGvf~dn~~~~~fd 218 (665)
T PRK10658 147 NYMREQLDLGVGETVYGLGERFTAFVK--NGQTVDIWNRDGG-TSSEQAYKNIPFYLTNR-----GYGVFVNHPQCVSFE 218 (665)
T ss_pred EEEEEEEecCCCCeEECCCccCCcccc--CCcEEEEEECCCC-CCCccccccccEEEecC-----cEEEEEcCCCceEEE
Confidence 3566666664 6899999999999886 4999999999964 45678999999999998 999999999999999
Q ss_pred ccCC--CcEEEEEeCCeEEEEEEeCCCHHHHHHHHHHhcCCCCCCCcccCCccccc-ccc-CCHHHHHHHHHHHHHcCCC
Q psy6580 334 LQPT--PAITYRVLGGILDFYYFLGPKPGDVISQYLDLIGYPELPPYWSLGFHLCR-YGY-KNLSHIQSVVDRNVKAGIP 409 (490)
Q Consensus 334 ~~~~--~~lt~~~~gG~LD~y~f~GptP~~Vi~qYt~LtGrp~mpP~WaLGfhqsr-~gY-~s~~eV~~vv~~~r~~~IP 409 (490)
++.. ..++|.++||.||+||+.||+|++|+++|++|||+|+|||+||||||||| |.+ .++++|++++++|++++||
T Consensus 219 ~g~~~~~~~~~~~~g~~ldyy~~~G~tp~~v~~~Yt~LTGrp~lpP~WalG~w~s~~~~~~~~e~~v~~~~~~~r~~~iP 298 (665)
T PRK10658 219 VGSEKVSKVQFSVEGEYLEYFVIDGPTPKEVLDRYTALTGRPALPPAWSFGLWLTTSFTTNYDEATVNSFIDGMAERDLP 298 (665)
T ss_pred EecCccceEEEEecCCcEEEEEEeCCCHHHHHHHHHHHhCCCCCCchhhhheeeecccccCCCHHHHHHHHHHHHHcCCC
Confidence 8753 46888999999999999999999999999999999999999999999999 333 3688999999999999999
Q ss_pred eeEEEeeccccCC--Ccceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCce
Q psy6580 410 LDTVWIDIDYMER--HNNFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPA 486 (490)
Q Consensus 410 lDviw~Didym~~--~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y 486 (490)
||+||+|++||++ +++|+|| ++||+|++|+++||++|+|+++|++|+|+.. ++.|+||.++|+|||+++|+++
T Consensus 299 ~d~i~lD~~w~~~~~~~~f~wd~~~FPdp~~mi~~L~~~G~k~~~~i~P~i~~~----s~~f~e~~~~gy~vk~~~G~~~ 374 (665)
T PRK10658 299 LHVFHFDCFWMKEFQWCDFEWDPRTFPDPEGMLKRLKAKGLKICVWINPYIAQK----SPLFKEGKEKGYLLKRPDGSVW 374 (665)
T ss_pred ceEEEEchhhhcCCceeeeEEChhhCCCHHHHHHHHHHCCCEEEEeccCCcCCC----chHHHHHHHCCeEEECCCCCEe
Confidence 9999999999997 4699999 9999999999999999999999999999853 7889999999999999999998
Q ss_pred ec
Q psy6580 487 EG 488 (490)
Q Consensus 487 ~G 488 (490)
.+
T Consensus 375 ~~ 376 (665)
T PRK10658 375 QW 376 (665)
T ss_pred ee
Confidence 75
No 4
>KOG1066|consensus
Probab=100.00 E-value=1.5e-46 Score=399.91 Aligned_cols=335 Identities=24% Similarity=0.442 Sum_probs=260.6
Q ss_pred HhhhccCCeeeecCCCCCCCeeecCCCCCcceEE--EEeecCceEEEEEeccCCCCCCCCCcceEEEEEEeeccEEEEEE
Q psy6580 136 EESCTARGCCWSISNNSKVPACFYPHGLQSYKVV--HIDKHSYGLDVYWKNTIKSPYGSDVQMLQMSVKFETVQRLHVKI 213 (490)
Q Consensus 136 ~~~C~~rgCcw~~~~~~g~P~C~~p~~~~gY~v~--~v~~t~~G~~a~L~r~~~~~yg~DI~~L~l~V~~~t~~~LrvkI 213 (490)
...|++-|-|=...... ++ ..+|.+. ++.....+++|.|....+.. .+.++++..-++.+|++|
T Consensus 31 FKtCeQs~FCkR~R~i~-------~~-~~~y~l~~~si~~~~~~l~a~l~~~~~~~------~l~~sl~~Lkd~~vR~~I 96 (915)
T KOG1066|consen 31 FKTCEQSGFCKRHRAIT-------PS-LTGYELLADSITHYEDVLTANLINKQNKV------LLPLSLSGLKDSTVRFQI 96 (915)
T ss_pred cccccccchhhhhhhhc-------CC-CCceeeecccceecCCceEEeeeecCCCc------eeeEEEEEecCceEEEEE
Confidence 45788777776554221 11 2367664 45566788999987533222 688899999999999999
Q ss_pred ec---CCCCeeecCCCC---CCC---c-ccccc---------------------------CccEE--eecCC--------
Q psy6580 214 TD---ANATRYEPSFPE---VPM---F-NNRVK---------------------------SVDCL--FDSRN-------- 246 (490)
Q Consensus 214 ~d---~~~~r~evP~~~---p~~---~-~~R~s---------------------------tg~vl--fdT~~-------- 246 (490)
.. +-.+||++|... |.. . .++.+ +++.+ .+..+
T Consensus 97 DE~~~~~rkRy~~~~~lv~~~~~~~~~~~~~~~t~~~~~~~~g~~~vvv~~~PF~v~~~~~~~lv~svN~~~~L~fE~~r 176 (915)
T KOG1066|consen 97 DEKESALRKRYQVPDALVSEPEEVRISVSKNDETATKIVGTNGKYKVVVTAKPFRVDFFEDDELVVSVNARGLLNFEHFR 176 (915)
T ss_pred CCcccccccccCCchhhcCcchheeEEeeecCCcccEEEEecCCeEEEEEeCCeEEEEEeCCcEEEEEcccceeehhhcc
Confidence 75 346899998642 110 0 00000 01111 01000
Q ss_pred ---------CCceeEecce-------------EEEEeeCC-CCcEEecCCCCCCCccCC---CCeEEEeeeCCCCCC---
Q psy6580 247 ---------LGGFMYSNQF-------------IQISSRLS-SPYIYGLGEHRNQFLLDT---DWKTIVLWPLDGPPQ--- 297 (490)
Q Consensus 247 ---------~~~lif~dqf-------------lqist~lp-~e~iYGLGE~~~~l~~~~---~g~~~~l~n~D~~~~--- 297 (490)
..+-+.++.| +.+-+.++ ..++||+.||..+|.++- ....|+++|.|.+.|
T Consensus 177 ~K~~~~~e~~~~g~WeE~Fk~~~DskP~Gp~SVglD~sF~~~~~vyGIPEHA~s~~Lk~T~g~~ePYRLyNlDVFEYe~~ 256 (915)
T KOG1066|consen 177 TKEEGPEETDENGFWEETFKTHHDSKPHGPESVGLDFSFVGSKNVYGIPEHADSLRLKDTSGGSEPYRLYNLDVFEYELN 256 (915)
T ss_pred ccccCccccccccchhhhhhcccCCCCCCCceeEEeEEecccceeecCcccccceEeeccCCCCCceeEeecceEEEecC
Confidence 0001111112 11223333 579999999999988852 245899999999876
Q ss_pred CCCCCccccCeeeeeecCCCCeeEEEEecCccccccccCC---------------------CcEEEEEeCCeEEEEEEeC
Q psy6580 298 DGVNGYGYHPFYLNLNASSGLAHGVFLRTSNALEIVLQPT---------------------PAITYRVLGGILDFYYFLG 356 (490)
Q Consensus 298 ~~~nlYgs~Pfy~s~~~~~g~~~Gvfl~Ns~~~~v~~~~~---------------------~~lt~~~~gG~LD~y~f~G 356 (490)
....+|||+||++++. ....-|+||.|.+.+.|++..+ ....|.++.|++|.+||+|
T Consensus 257 spmalYGSIP~m~ah~--~~r~~g~fW~NAAeTWVDi~~~t~~~~~s~~~~~~~~s~~~p~~~thw~SEsGiiDvFi~lG 334 (915)
T KOG1066|consen 257 SPMALYGSIPFMLAHG--PNRDVGIFWLNAAETWVDIQSSTTDGLFSKMVDPVSGSQKVPQTDTHWMSESGIIDVFIFLG 334 (915)
T ss_pred CcchheecccEEEecC--CCCceeeEEecchhheEEeeccccccccccccccCCCCCCCCCccceeeccCCcEEEEEEeC
Confidence 4567999999999998 3367899999999998887531 1357889999999999999
Q ss_pred CCHHHHHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCH
Q psy6580 357 PKPGDVISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGL 435 (490)
Q Consensus 357 ptP~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~ 435 (490)
|+|.+|++||++|||+|+|||..|+|||||||.|+++++|.+|-++|.+++||+|+||+||.|.++++.|||| .+||+|
T Consensus 335 P~~~Dv~~qyaaLTG~~~LPplFsiGYHQcRWNY~DE~DV~~Vd~~FDehdiP~DviWLDIEhtdgKrYFTWDk~~FP~P 414 (915)
T KOG1066|consen 335 PKPSDVFRQYAALTGTTPLPPLFSIGYHQCRWNYNDEEDVLTVDQGFDEHDIPYDVIWLDIEHTDGKRYFTWDKHKFPNP 414 (915)
T ss_pred CChhHHHHHHHhhcCCCCCCchhhcchhhccccccchhhhhhhhcCccccCCccceEEEeeeecCCceeEeeccccCCCH
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 999999
Q ss_pred HHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCceec
Q psy6580 436 KEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEG 488 (490)
Q Consensus 436 ~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G 488 (490)
++|+++|.++|.|+|+||||+|+.++ .|..++|++.+|++||+.+|+.|.|
T Consensus 415 ~~Ml~kLa~kgRklV~IvDPHIKkD~--~Y~v~ke~~~~gy~VKd~~G~DyeG 465 (915)
T KOG1066|consen 415 KDMLKKLASKGRKLVTIVDPHIKKDD--GYFVHKEAKDKGYYVKDRDGSDYEG 465 (915)
T ss_pred HHHHHHHHhcCCceEEEeCcccccCC--CeEEhHHhhhCCeEEEecCCCcccc
Confidence 99999999999999999999999764 4999999999999999999999988
No 5
>COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]
Probab=100.00 E-value=4.4e-43 Score=391.70 Aligned_cols=225 Identities=31% Similarity=0.567 Sum_probs=205.0
Q ss_pred cceEEEEeeC-CCCcEEecCCCCCCCccCCCCeEEEeeeCCCCCC---CCCCCccccCeeeeeecCCCCeeEEEEecCcc
Q psy6580 254 NQFIQISSRL-SSPYIYGLGEHRNQFLLDTDWKTIVLWPLDGPPQ---DGVNGYGYHPFYLNLNASSGLAHGVFLRTSNA 329 (490)
Q Consensus 254 dqflqist~l-p~e~iYGLGE~~~~l~~~~~g~~~~l~n~D~~~~---~~~nlYgs~Pfy~s~~~~~g~~~Gvfl~Ns~~ 329 (490)
++.+...+.+ +++++|||||++.++.. +++++.+||.|.+.+ ...++|.++|||++.. +||+|++|+..
T Consensus 140 ~~~~~~~~~~~~~e~~yGlge~~~~l~~--~~~~~~~wn~D~~~~~~~~~~~~y~~iPf~~s~~-----gyg~~~~n~~~ 212 (772)
T COG1501 140 GGYVFLYFDLGGGEHVYGLGERAGGLGR--NGQTYRLWNADAGTYTGGDYDNLYASIPFYLSSR-----GYGLFVDNSAY 212 (772)
T ss_pred CccEEEEEEecCCceEeecccccccccc--CcceEEeeecCCCcccccccccccccCCEEEEcc-----ceEEEEECCCc
Confidence 3445555554 46899999999998866 488999999999865 4578999999999997 99999999999
Q ss_pred ccccccCCC--cEEEEEeCCeEEEEEEeCCCHHHHHHHHHHhcCCCCCCCcccCC-ccccccccCCHHHHHHHHHHHHHc
Q psy6580 330 LEIVLQPTP--AITYRVLGGILDFYYFLGPKPGDVISQYLDLIGYPELPPYWSLG-FHLCRYGYKNLSHIQSVVDRNVKA 406 (490)
Q Consensus 330 ~~v~~~~~~--~lt~~~~gG~LD~y~f~GptP~~Vi~qYt~LtGrp~mpP~WaLG-fhqsr~gY~s~~eV~~vv~~~r~~ 406 (490)
+.|+++... ...|.+++|.|||||+.||+|++|+++|++|||||+|||+|||| +|+||+.|.++++|.++++.++++
T Consensus 213 ~~fd~~~~~~~~~~~~~e~~~ldyyv~~G~~~~~vi~~yt~lTGkp~l~P~Wa~G~~~~~~~~~~~e~~v~~~i~~~~~~ 292 (772)
T COG1501 213 GSFDVGSEEYSYVQFSVEGGQLDYYVIAGPTPKDVLEKYTDLTGKPPLPPKWALGWLWTSRYTYYDEDEVLEFIDEMRER 292 (772)
T ss_pred eEEEcCCcceEEEEEEecCCcEEEEEEeCCCHHHHHHHHHHhhCCCCCCCceecCCCceeccccccHHHHHHHHhhcccc
Confidence 999998632 35567788899999999999999999999999999999999999 999999999999999999999999
Q ss_pred CCCeeEEEeecc-ccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCC
Q psy6580 407 GIPLDTVWIDID-YMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGL 484 (490)
Q Consensus 407 ~IPlDviw~Did-ym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~ 484 (490)
+||+|+||+|+| ||+++++|+|| .+||+|++|+++||++|+|+++||+|+|+.+ .+.|++++++|+|||+++|+
T Consensus 293 ~IP~d~~~lD~~~~~~~~~~F~wd~~~FP~pk~mi~~l~~~Gikl~~~i~P~i~~d----~~~~~e~~~~Gy~~k~~~g~ 368 (772)
T COG1501 293 DIPLDVFVLDIDFWMDNWGDFTWDPDRFPDPKQMIAELHEKGIKLIVIINPYIKQD----SPLFKEAIEKGYFVKDPDGE 368 (772)
T ss_pred cCcceEEEEeehhhhccccceEECcccCCCHHHHHHHHHhcCceEEEEeccccccC----CchHHHHHHCCeEEECCCCC
Confidence 999999999998 99999999999 9999999999999999999999999999976 36899999999999999999
Q ss_pred ceecC
Q psy6580 485 PAEGK 489 (490)
Q Consensus 485 ~y~G~ 489 (490)
+|.+.
T Consensus 369 ~~~~~ 373 (772)
T COG1501 369 IYQAD 373 (772)
T ss_pred Eeeec
Confidence 98764
No 6
>PRK10426 alpha-glucosidase; Provisional
Probab=100.00 E-value=1.8e-43 Score=389.98 Aligned_cols=218 Identities=19% Similarity=0.237 Sum_probs=192.6
Q ss_pred EEEEee-CCCCcEEecCCCCCCCccCCCCeEEEeeeCCCCCC-------------------CCCCCccccCeeeeeecCC
Q psy6580 257 IQISSR-LSSPYIYGLGEHRNQFLLDTDWKTIVLWPLDGPPQ-------------------DGVNGYGYHPFYLNLNASS 316 (490)
Q Consensus 257 lqist~-lp~e~iYGLGE~~~~l~~~~~g~~~~l~n~D~~~~-------------------~~~nlYgs~Pfy~s~~~~~ 316 (490)
+.+.+. .++|++|||||+...|+++ |+++.||++|++.. ...++|.++|||||++
T Consensus 72 ~~~~~~~~~~E~~yGlGE~~~~l~~r--G~~~~~w~~d~~~g~~~~~~~~~~~d~~~~~gg~~~~tY~~iPf~iSs~--- 146 (635)
T PRK10426 72 IWLRLAADPDEHIYGCGEQFSYFDLR--GKPFPLWTSEQGVGRNKQTYVTWQADCKENAGGDYYWTYFPQPTFVSSQ--- 146 (635)
T ss_pred EEEEEecCCCCeEECCCcCCChHhhC--CCEEeeeecccccCcCccccccccccccccccCccccccceeCEEEEcC---
Confidence 344444 4579999999999988875 99999999998621 1257899999999988
Q ss_pred CCeeEEEEecCccccccccCCCcEEEEEeCCeEEEEEEeCCCHHHHHHHHHHhcCCCCCCCcccC-CccccccccCCHHH
Q psy6580 317 GLAHGVFLRTSNALEIVLQPTPAITYRVLGGILDFYYFLGPKPGDVISQYLDLIGYPELPPYWSL-GFHLCRYGYKNLSH 395 (490)
Q Consensus 317 g~~~Gvfl~Ns~~~~v~~~~~~~lt~~~~gG~LD~y~f~GptP~~Vi~qYt~LtGrp~mpP~WaL-Gfhqsr~gY~s~~e 395 (490)
+||||++|++.++|+++......+...+|.+|+|||.|++|++|+++|++|||||+|||+||| |+|++.. .++++
T Consensus 147 --~ygv~~dn~~~~~fd~~~~~~~~~~~~~~~~d~y~~~G~~~~~vi~~yt~ltGr~p~~P~Wal~G~~~g~~--~~~~~ 222 (635)
T PRK10426 147 --KYYCHVDNSAYMNFDFSAPEYHELELWEDKATLRFECADTYISLLEKLTALFGRQPELPDWAYDGVTLGIQ--GGTEV 222 (635)
T ss_pred --CEEEEEcCCCcEEEEecCCCccEEEEEeCCeeEEEEeCCCHHHHHHHHHHhhCCCCCCChhhccCcccccc--CCHHH
Confidence 899999999999999987666666667899999999999999999999999999999999999 9997653 36889
Q ss_pred HHHHHHHHHHcCCCeeEEEeeccccCCC---------cceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccCCCCCCC
Q psy6580 396 IQSVVDRNVKAGIPLDTVWIDIDYMERH---------NNFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVASREDSNY 465 (490)
Q Consensus 396 V~~vv~~~r~~~IPlDviw~Didym~~~---------~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y 465 (490)
|++++++||+++||||+||+| ||++.+ .+|+|| ++||+|++||++||++|+|+|+|++|+|+.+ |
T Consensus 223 v~~v~~~~r~~~IP~d~i~ld-dw~~~~~~~~g~~~~~~~~~d~~~FPdp~~mi~~L~~~G~k~v~~i~P~v~~~----~ 297 (635)
T PRK10426 223 VQKKLDTMRNAGVKVNGIWAQ-DWSGIRMTSFGKRLMWNWKWDSERYPQLDSRIKQLNEEGIQFLGYINPYLASD----G 297 (635)
T ss_pred HHHHHHHHHHcCCCeeEEEEe-cccccccccccccccccceEChhhCCCHHHHHHHHHHCCCEEEEEEcCccCCC----C
Confidence 999999999999999999999 887531 368999 9999999999999999999999999999863 7
Q ss_pred hhhHHHHHcccEEEcCCCCceec
Q psy6580 466 LPYVEGVEKGIFVMNSSGLPAEG 488 (490)
Q Consensus 466 ~~y~eg~e~g~fvk~~~G~~y~G 488 (490)
+.|+||.++|+|||+++|++|++
T Consensus 298 ~~y~e~~~~gy~vk~~~g~~~~~ 320 (635)
T PRK10426 298 DLCEEAAEKGYLAKDADGGDYLV 320 (635)
T ss_pred HHHHHHHHCCcEEECCCCCEEEe
Confidence 89999999999999999999875
No 7
>PF01055 Glyco_hydro_31: Glycosyl hydrolases family 31 ; InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC). Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=99.97 E-value=5.1e-31 Score=280.11 Aligned_cols=137 Identities=39% Similarity=0.785 Sum_probs=119.5
Q ss_pred EEEeCCCHHHHHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-C
Q psy6580 352 YYFLGPKPGDVISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-K 430 (490)
Q Consensus 352 y~f~GptP~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~ 430 (490)
|||.|++|++|+++|++|||||+|||+||||||+|||+|.++++|++++++|++++||+|++|+|++|+.++++|+|| +
T Consensus 1 y~~~G~~~~~v~~~y~~ltG~~~~pP~walG~~~~~~~~~~~~~v~~~i~~~~~~~iP~d~~~iD~~~~~~~~~f~~d~~ 80 (441)
T PF01055_consen 1 YFFSGPTPKEVLRQYTELTGRPPLPPRWALGFWQSRWGYYNQDEVREVIDRYRSNGIPLDVIWIDDDYQDGYGDFTWDPE 80 (441)
T ss_dssp EEEEESSHHHHHHHHHHHHSSS----GGGGSEEEEESTBTSHHHHHHHHHHHHHTT--EEEEEE-GGGSBTTBTT-B-TT
T ss_pred CEEeCcCHHHHHHHHHHHHCCCCCCchhhhceEeecCcCCCHHHHHHHHHHHHHcCCCccceeccccccccccccccccc
Confidence 799999999999999999999999999999999999999999999999999999999999999999999999999999 9
Q ss_pred CCCCHHHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCceecC
Q psy6580 431 PFYGLKEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEGK 489 (490)
Q Consensus 431 ~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G~ 489 (490)
+||++++|++.||++|+|+++|++|+|...++ .|+.|+++.++|+|||+++|++|.|.
T Consensus 81 ~FPd~~~~~~~l~~~G~~~~~~~~P~v~~~~~-~~~~~~~~~~~~~~v~~~~g~~~~~~ 138 (441)
T PF01055_consen 81 RFPDPKQMIDELHDQGIKVVLWVHPFVSNDSP-DYENYDEAKEKGYLVKNPDGSPYIGR 138 (441)
T ss_dssp TTTTHHHHHHHHHHTT-EEEEEEESEEETTTT-B-HHHHHHHHTT-BEBCTTSSB-EEE
T ss_pred cccchHHHHHhHhhCCcEEEEEeecccCCCCC-cchhhhhHhhcCceeecccCCccccc
Confidence 99999999999999999999999999997654 47899999999999999999998874
No 8
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=99.96 E-value=5.3e-30 Score=262.22 Aligned_cols=117 Identities=38% Similarity=0.746 Sum_probs=112.3
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEE
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHF 449 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~ 449 (490)
|+|+|||+||||||||||+|.++++|++++++|++++||+|+||+|++||+++++|+|| ++||+|++||++||++|+|+
T Consensus 1 G~~~lpP~walG~~~sr~~y~~~~~v~~~~~~~~~~~iP~d~i~lD~~~~~~~~~f~~d~~~FPdp~~~i~~l~~~g~k~ 80 (317)
T cd06600 1 GKPALPPMWALGYHISRYSYYPQDKVVEVVDIMQKEGFPYDVVFLDIHYMDSYRLFTWDPYRFPEPKKLIDELHKRNVKL 80 (317)
T ss_pred CCCCCCchHHHHHHhcCCCCCCHHHHHHHHHHHHHcCCCcceEEEChhhhCCCCceeechhcCCCHHHHHHHHHHCCCEE
Confidence 89999999999999999999999999999999999999999999999999999999999 99999999999999999999
Q ss_pred EEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCceecC
Q psy6580 450 IPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEGK 489 (490)
Q Consensus 450 vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G~ 489 (490)
++|++|+|+.+. .|+.|.+|.++|+|||+.+|++++|+
T Consensus 81 ~~~~~P~i~~~~--~~~~~~~~~~~~~~v~~~~g~~~~~~ 118 (317)
T cd06600 81 VTIVDPGIRVDQ--NYSPFLSGMDKGKFCEIESGELFVGK 118 (317)
T ss_pred EEEeeccccCCC--CChHHHHHHHCCEEEECCCCCeEEEe
Confidence 999999998653 37889999999999999999998874
No 9
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=99.96 E-value=7.1e-30 Score=263.47 Aligned_cols=117 Identities=34% Similarity=0.667 Sum_probs=112.3
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEE
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHF 449 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~ 449 (490)
|+|+|||+||||||||||+|.|+++|++++++|++++||+|+||+|++||.++++|+|| ++||||++||++||++|+|+
T Consensus 1 G~~~lpP~walG~~~sr~~y~~~~ev~~~~~~~~~~~iP~d~i~lD~~~~~~~~~f~~d~~~FPdp~~mi~~L~~~G~k~ 80 (339)
T cd06603 1 GRPPLPPLFSLGYHQCRWNYKDQEDVKEVDAGFDEHDIPYDVIWLDIEHTDGKRYFTWDKKKFPDPEKMQEKLASKGRKL 80 (339)
T ss_pred CCCCCCchHHHHHHhcCCCCCCHHHHHHHHHHHHHcCCCceEEEEChHHhCCCCceEeCcccCCCHHHHHHHHHHCCCEE
Confidence 89999999999999999999999999999999999999999999999999999999999 99999999999999999999
Q ss_pred EEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCceecC
Q psy6580 450 IPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEGK 489 (490)
Q Consensus 450 vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G~ 489 (490)
++|++|+|..++ .|++|+||.++|+|||+++|++|+|.
T Consensus 81 ~~~~~P~v~~~~--~~~~y~e~~~~g~~vk~~~g~~~~~~ 118 (339)
T cd06603 81 VTIVDPHIKRDD--GYYVYKEAKDKGYLVKNSDGGDFEGW 118 (339)
T ss_pred EEEecCceecCC--CCHHHHHHHHCCeEEECCCCCEEEEE
Confidence 999999998653 38899999999999999999998763
No 10
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=99.96 E-value=2.5e-29 Score=259.42 Aligned_cols=119 Identities=50% Similarity=1.069 Sum_probs=113.2
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCH--HHHHHHHHHCCC
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGL--KEYVQDLHKEGR 447 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~--~~mv~~Lh~~G~ 447 (490)
|||+|||+||||||||||+|.++++|++++++|++++||+|+||+|++||+++++|+|| ++||+| ++||++||++|+
T Consensus 1 G~p~lpP~walG~~~s~~~y~~~~~v~~~~~~~r~~~iP~d~i~lD~~~~~~~~~f~~d~~~FPdp~~~~mi~~L~~~G~ 80 (339)
T cd06602 1 GRPAMPPYWALGFHLCRWGYKNVDEVKEVVENMRAAGIPLDVQWNDIDYMDRRRDFTLDPVRFPGLKMPEFVDELHANGQ 80 (339)
T ss_pred CCCCCCchHHhhhHhcCCCCCCHHHHHHHHHHHHHhCCCcceEEECcccccCccceecccccCCCccHHHHHHHHHHCCC
Confidence 89999999999999999999999999999999999999999999999999999999999 999999 999999999999
Q ss_pred EEEEEeCccccCCCC-CCChhhHHHHHcccEEEcCCCCceecC
Q psy6580 448 HFIPILDPGVASRED-SNYLPYVEGVEKGIFVMNSSGLPAEGK 489 (490)
Q Consensus 448 k~vlivdP~I~~~~~-~~y~~y~eg~e~g~fvk~~~G~~y~G~ 489 (490)
|+++|++|+|...+. ..|+.|+||+++|+|||+.+|++|+|.
T Consensus 81 k~~~~i~P~v~~~~~~~~~~~~~e~~~~g~~v~~~~g~~~~~~ 123 (339)
T cd06602 81 HYVPILDPAISANEPTGSYPPYDRGLEMDVFIKNDDGSPYIGK 123 (339)
T ss_pred EEEEEEeCccccCcCCCCCHHHHHHHHCCeEEECCCCCEEEEE
Confidence 999999999997532 358999999999999999999998764
No 11
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=99.96 E-value=3.5e-29 Score=258.24 Aligned_cols=117 Identities=34% Similarity=0.684 Sum_probs=112.4
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEE
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHF 449 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~ 449 (490)
|||+|||+||||||||||+|.++++|+++++++++++||+|+||+|++||+++++|+|| ++||+|++|+++||++|+|+
T Consensus 1 G~p~l~P~walG~~~s~~~y~~~~~v~~~~~~~~~~~iP~d~i~lD~~~~~~~~~f~~d~~~fPdp~~m~~~l~~~g~~~ 80 (339)
T cd06604 1 GRPPLPPKWALGYQQSRWSYYPEEEVREIADEFRERDIPCDAIYLDIDYMDGYRVFTWDKERFPDPKELIKELHEQGFKV 80 (339)
T ss_pred CCCCCCchHHHhHHhcCCCCCCHHHHHHHHHHHHHhCCCcceEEECchhhCCCCceeeccccCCCHHHHHHHHHHCCCEE
Confidence 89999999999999999999999999999999999999999999999999999999999 99999999999999999999
Q ss_pred EEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCceecC
Q psy6580 450 IPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEGK 489 (490)
Q Consensus 450 vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G~ 489 (490)
++|++|+|..+. .|++|+||.++|+|||+++|++|+|+
T Consensus 81 ~~~~~P~v~~~~--~~~~~~e~~~~g~~v~~~~g~~~~~~ 118 (339)
T cd06604 81 VTIIDPGVKVDP--GYDVYEEGLENDYFVKDPDGELYIGR 118 (339)
T ss_pred EEEEeCceeCCC--CChHHHHHHHCCeEEECCCCCEEEEE
Confidence 999999998643 38899999999999999999998764
No 12
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=99.95 E-value=9.5e-29 Score=253.18 Aligned_cols=112 Identities=25% Similarity=0.408 Sum_probs=107.7
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCc--ceeec-CCCCCHHHHHHHHHHCCC
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHN--NFVLA-KPFYGLKEYVQDLHKEGR 447 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~--dFT~D-~~Fpd~~~mv~~Lh~~G~ 447 (490)
|||+|||+||||||||||+|.++++|++++++|++++||||+||+|++||.+++ +|+|| ++||+|++||++||++|+
T Consensus 1 G~~~m~P~walG~~~sr~~y~~~~ev~~~~~~~~~~~iP~d~i~lD~~~~~~~~~~~f~~d~~~FPdp~~mi~~L~~~G~ 80 (319)
T cd06591 1 GKAPMMPKWAYGFWQSKERYKTQEELLDVAKEYRKRGIPLDVIVQDWFYWPKQGWGEWKFDPERFPDPKAMVRELHEMNA 80 (319)
T ss_pred CCCCCCchHHHHHHHhcccCCCHHHHHHHHHHHHHhCCCccEEEEechhhcCCCceeEEEChhhCCCHHHHHHHHHHCCC
Confidence 899999999999999999999999999999999999999999999999999998 99999 999999999999999999
Q ss_pred EEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCce
Q psy6580 448 HFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPA 486 (490)
Q Consensus 448 k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y 486 (490)
|+++|++|+|+.. ++.|++|.++|+|||+.+|+++
T Consensus 81 kv~~~i~P~v~~~----~~~y~e~~~~g~~v~~~~g~~~ 115 (319)
T cd06591 81 ELMISIWPTFGPE----TENYKEMDEKGYLIKTDRGPRV 115 (319)
T ss_pred EEEEEecCCcCCC----ChhHHHHHHCCEEEEcCCCCee
Confidence 9999999999853 5789999999999999998765
No 13
>cd06599 GH31_glycosidase_Aec37 Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.95 E-value=2.6e-28 Score=249.74 Aligned_cols=114 Identities=25% Similarity=0.473 Sum_probs=106.2
Q ss_pred CCCCCCCcccCCccccccccCC-----HHHHHHHHHHHHHcCCCeeEEEeeccccCC----Ccceeec-CCCCCHHHHHH
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKN-----LSHIQSVVDRNVKAGIPLDTVWIDIDYMER----HNNFVLA-KPFYGLKEYVQ 440 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s-----~~eV~~vv~~~r~~~IPlDviw~Didym~~----~~dFT~D-~~Fpd~~~mv~ 440 (490)
|+|+|||+||||||||||+|.+ |++|+++++++|+++||+|+||+|++||.. +++|+|| ++||+|++||+
T Consensus 1 G~~~mpP~walG~~~sr~~Y~~~~~~~q~~v~~~~~~~r~~~iP~d~i~ld~~~~~~~~~~~~~f~~d~~~FPdp~~mi~ 80 (317)
T cd06599 1 GKPAMVPRWSLGYSGSTMYYTELDPDAQEALLEFIDKCREHDIPCDSFHLSSGYTSIEGGKRYVFNWNKDRFPDPAAFVA 80 (317)
T ss_pred CCCCCCchHHHHHHhcCCCCCCCCccHHHHHHHHHHHHHHcCCCeeEEEEeccccccCCCceeeeecCcccCCCHHHHHH
Confidence 8999999999999999999988 999999999999999999999999999975 5689999 99999999999
Q ss_pred HHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCC-ceec
Q psy6580 441 DLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGL-PAEG 488 (490)
Q Consensus 441 ~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~-~y~G 488 (490)
+||++|+|+++|++|+|... ++.|+||.++|+|||+.+|+ +++|
T Consensus 81 ~L~~~g~k~~~~i~P~i~~~----~~~y~e~~~~g~~v~~~~g~~~~~~ 125 (317)
T cd06599 81 KFHERGIRLAPNIKPGLLQD----HPRYKELKEAGAFIKPPDGREPSIG 125 (317)
T ss_pred HHHHCCCEEEEEeCCcccCC----CHHHHHHHHCCcEEEcCCCCCccee
Confidence 99999999999999999864 57899999999999998875 4554
No 14
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.95 E-value=8.2e-28 Score=246.04 Aligned_cols=115 Identities=27% Similarity=0.472 Sum_probs=106.3
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccC------CCcceeec-CCCCCHHHHHHHHH
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYME------RHNNFVLA-KPFYGLKEYVQDLH 443 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~------~~~dFT~D-~~Fpd~~~mv~~Lh 443 (490)
|+|+|||+||||||||||+|.++++|++++++|++++||+|+||+|++||. .+++|+|| ++||+|++|+++||
T Consensus 1 G~~~l~P~wa~G~~~s~~~y~~~~~v~~~~~~~~~~~iP~d~i~lD~~w~~~~~~~~~~~~f~wd~~~FPdp~~mi~~L~ 80 (317)
T cd06598 1 GRQPLPPRWALGNWASRFGYRNWQEVDDTIKTLREKDFPLDAAILDLYWFGKDIDKGHMGNLDWDRKAFPDPAGMIADLA 80 (317)
T ss_pred CCCCCCchHHHHHHHhcCCCCCHHHHHHHHHHHHHhCCCceEEEEechhhcCcccCCceeeeEeccccCCCHHHHHHHHH
Confidence 899999999999999999999999999999999999999999999999997 35699999 99999999999999
Q ss_pred HCCCEEEEEeCccccCCCCCCChhhHHHHHcccE-EEcCCCCceecC
Q psy6580 444 KEGRHFIPILDPGVASREDSNYLPYVEGVEKGIF-VMNSSGLPAEGK 489 (490)
Q Consensus 444 ~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~f-vk~~~G~~y~G~ 489 (490)
++|+|+++|++|+|..+ .+.|+|+.++|+| ++..+|++++|+
T Consensus 81 ~~G~k~~~~v~P~v~~~----~~~y~e~~~~g~l~~~~~~~~~~~~~ 123 (317)
T cd06598 81 KKGVKTIVITEPFVLKN----SKNWGEAVKAGALLKKDQGGVPTLFD 123 (317)
T ss_pred HcCCcEEEEEcCcccCC----chhHHHHHhCCCEEEECCCCCEeeee
Confidence 99999999999999864 4679999999995 555678888763
No 15
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=99.94 E-value=1.6e-27 Score=242.51 Aligned_cols=114 Identities=25% Similarity=0.432 Sum_probs=108.9
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCc--ceeec-CCCCCHHHHHHHHHHCCC
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHN--NFVLA-KPFYGLKEYVQDLHKEGR 447 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~--dFT~D-~~Fpd~~~mv~~Lh~~G~ 447 (490)
|+|+|||+||||||||||+|.++++|+++++++++++||+|++|+|++||.+++ +|+|| ++||++++|+++||++|+
T Consensus 1 G~~~l~P~wa~G~~~~~~~y~~~~~v~~~~~~~~~~~iP~d~~~lD~~w~~~~~~~~f~~d~~~FPd~~~~i~~l~~~G~ 80 (308)
T cd06593 1 GRPALPPAWSFGLWLSRSFYYDEEEVNEFADGMRERNLPCDVIHLDCFWMKEFQWCDFEFDPDRFPDPEGMLSRLKEKGF 80 (308)
T ss_pred CCCCCCchHHHHHHHHcccCCCHHHHHHHHHHHHHcCCCeeEEEEecccccCCcceeeEECcccCCCHHHHHHHHHHCCC
Confidence 899999999999999999999999999999999999999999999999998876 99999 999999999999999999
Q ss_pred EEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCceec
Q psy6580 448 HFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEG 488 (490)
Q Consensus 448 k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G 488 (490)
|+++|++|+|..+ .++|+|+.++|+|||+.+|+++.+
T Consensus 81 ~~~~~~~P~i~~~----~~~~~e~~~~g~~v~~~~g~~~~~ 117 (308)
T cd06593 81 KVCLWINPYIAQK----SPLFKEAAEKGYLVKKPDGSVWQW 117 (308)
T ss_pred eEEEEecCCCCCC----chhHHHHHHCCeEEECCCCCeeee
Confidence 9999999999864 357999999999999999998875
No 16
>cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.94 E-value=8.5e-27 Score=236.03 Aligned_cols=99 Identities=28% Similarity=0.364 Sum_probs=91.6
Q ss_pred cCCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccC---------CCcceeec-CCCCCHHHHH
Q psy6580 370 IGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYME---------RHNNFVLA-KPFYGLKEYV 439 (490)
Q Consensus 370 tGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~---------~~~dFT~D-~~Fpd~~~mv 439 (490)
||+|+|||+||||||||||+|.|+++|++++++||+++||||+||+|+|||. .+++|||| ++||+|++||
T Consensus 1 TG~~~~~P~walG~~qsr~~y~s~~ev~~v~~~~r~~~iP~D~i~lD~dw~~~~~~~~~~~~~~~ft~d~~~FPdp~~mi 80 (292)
T cd06595 1 AGKIPLLPRYAFGNWWSRYWPYSDEEYLALMDRFKKHNIPLDVLVIDMDWHVTDIPSKYGSGWTGYSWNRKLFPDPEKLL 80 (292)
T ss_pred CCCCCCCchHHHHhHhhCCcCCCHHHHHHHHHHHHHhCCCccEEEEecccccccccccccCCcceeEEChhcCCCHHHHH
Confidence 7999999999999999999999999999999999999999999999999975 56799999 9999999999
Q ss_pred HHHHHCCCEEEEEeCccccCCCCCCChhhHH
Q psy6580 440 QDLHKEGRHFIPILDPGVASREDSNYLPYVE 470 (490)
Q Consensus 440 ~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~e 470 (490)
++||++|+|+|+|++|+|.... .++.|++
T Consensus 81 ~~Lh~~G~k~v~~v~P~~~~~~--~~~~y~~ 109 (292)
T cd06595 81 QDLHDRGLKVTLNLHPADGIRA--HEDQYPE 109 (292)
T ss_pred HHHHHCCCEEEEEeCCCcccCC--CcHHHHH
Confidence 9999999999999999987543 2456776
No 17
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=99.93 E-value=3.1e-26 Score=234.49 Aligned_cols=114 Identities=21% Similarity=0.411 Sum_probs=103.9
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCC---------Ccceeec-CCCCCHHHHHH
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMER---------HNNFVLA-KPFYGLKEYVQ 440 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~---------~~dFT~D-~~Fpd~~~mv~ 440 (490)
|+++|||+||||+||||+.+ |+++|++++++|++++||||+||+| |||+. .++|||| ++||+|++||+
T Consensus 1 G~~p~~P~wa~G~~~~~~~~-s~~~v~~~~~~~~~~~iP~d~i~ld-dw~~~~~~~~g~~~~~~f~~d~~~FPdp~~mi~ 78 (317)
T cd06594 1 GRQPELPDWAYGGAILGLQG-GTDKVLEALEKARAAGVKVAGLWLQ-DWTGRRETSFGDRLWWNWEWDPERYPGLDELIE 78 (317)
T ss_pred CCCCCCchhhhCcEEeeeeC-CHHHHHHHHHHHHHcCCCeeEEEEc-cccCcccccccceeeeeeEEChhhCCCHHHHHH
Confidence 89999999999999999865 9999999999999999999999999 88863 2379999 99999999999
Q ss_pred HHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCceecC
Q psy6580 441 DLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEGK 489 (490)
Q Consensus 441 ~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G~ 489 (490)
+||++|+|+|+||+|+|+.+. ...|++|.++|+|||+++|++|++.
T Consensus 79 ~Lh~~G~~~~~~i~P~v~~~~---~~~y~~~~~~g~~vk~~~g~~~~~~ 124 (317)
T cd06594 79 ELKARGIRVLTYINPYLADDG---PLYYEEAKDAGYLVKDADGSPYLVD 124 (317)
T ss_pred HHHHCCCEEEEEecCceecCC---chhHHHHHHCCeEEECCCCCeeeec
Confidence 999999999999999999753 2237999999999999999999874
No 18
>cd06601 GH31_lyase_GLase GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.
Probab=99.93 E-value=2.2e-26 Score=236.76 Aligned_cols=89 Identities=31% Similarity=0.572 Sum_probs=87.9
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEE
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHF 449 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~ 449 (490)
|+|+|||+||||||||||+|.++++|++|+++||+++||+|+||+|+|||+++++|||| ++||+|++|+++||++|+|+
T Consensus 1 G~p~mpP~WalG~~qsr~~Y~~~~ev~~v~~~~r~~~IP~D~i~lDidy~~~~~~Ft~d~~~FPdp~~mv~~L~~~G~kl 80 (332)
T cd06601 1 GRSKLKPRYALGFHQGCYGYSNRSDLEEVVEGYRDNNIPLDGLHVDVDFQDNYRTFTTNGGGFPNPKEMFDNLHNKGLKC 80 (332)
T ss_pred CCCCCCchHHhhhhhCCCCCCCHHHHHHHHHHHHHcCCCCceEEEcCchhcCCCceeecCCCCCCHHHHHHHHHHCCCeE
Confidence 89999999999999999999999999999999999999999999999999999999999 99999999999999999999
Q ss_pred EEEeCccccC
Q psy6580 450 IPILDPGVAS 459 (490)
Q Consensus 450 vlivdP~I~~ 459 (490)
|+|++|+|+.
T Consensus 81 v~~i~P~i~~ 90 (332)
T cd06601 81 STNITPVISY 90 (332)
T ss_pred EEEecCceec
Confidence 9999999985
No 19
>cd06597 GH31_transferase_CtsY CtsY (cyclic tetrasaccharide-synthesizing enzyme Y) is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.93 E-value=5.3e-26 Score=234.87 Aligned_cols=117 Identities=21% Similarity=0.379 Sum_probs=101.6
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEee-------------ccccC--C-----Ccceeec-
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWID-------------IDYME--R-----HNNFVLA- 429 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~D-------------idym~--~-----~~dFT~D- 429 (490)
|||+|||+||||||||||+|.++++|++++++|++++||||+||+| |+||. + +.+|+|+
T Consensus 1 G~p~lpP~walG~~~sr~~Y~~~~ev~~v~~~~~~~~iP~d~i~lD~W~~~~~~~~w~d~~y~~~~~~~~~~~~~~~f~~ 80 (340)
T cd06597 1 GKPELLPKWAFGLWMSANEWDTQAEVMRQMDAHEEHGIPVTVVVIEQWSDEATFYVFNDAQYTPKDGGAPLSYDDFSFPV 80 (340)
T ss_pred CCCCCCchHHhhhhhhccCCCCHHHHHHHHHHHHHcCCCeeEEEEecccCcceeeeeccchhcccccCCcceecccccCc
Confidence 8999999999999999999999999999999999999999998775 77773 3 3455565
Q ss_pred -CCCCCHHHHHHHHHHCCCEEEEEeCccccCCCC---CCChhhHHHHHcccEEEcCCCCcee
Q psy6580 430 -KPFYGLKEYVQDLHKEGRHFIPILDPGVASRED---SNYLPYVEGVEKGIFVMNSSGLPAE 487 (490)
Q Consensus 430 -~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~---~~y~~y~eg~e~g~fvk~~~G~~y~ 487 (490)
++||+|++||++||++|+|+++|++|+|..... ..+..|.++.++|+|||+.+|++|.
T Consensus 81 ~~~FPdp~~mi~~Lh~~G~kv~l~v~P~i~~~~~~~~~~~~~~~~~~~~g~~vk~~~G~~~~ 142 (340)
T cd06597 81 EGRWPNPKGMIDELHEQGVKVLLWQIPIIKLRPHPHGQADNDEDYAVAQNYLVQRGVGKPYR 142 (340)
T ss_pred cccCCCHHHHHHHHHHCCCEEEEEecCccccccccccccchhHHHHHHCCEEEEcCCCCccc
Confidence 479999999999999999999999999975321 1234688999999999999999876
No 20
>cd06589 GH31 The enzymes of glycosyl hydrolase family 31 (GH31) occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes
Probab=99.88 E-value=2.2e-23 Score=208.31 Aligned_cols=87 Identities=33% Similarity=0.679 Sum_probs=86.2
Q ss_pred CCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcce--eec-CCCCCHHHHHHHHHHCCC
Q psy6580 371 GYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNF--VLA-KPFYGLKEYVQDLHKEGR 447 (490)
Q Consensus 371 Grp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dF--T~D-~~Fpd~~~mv~~Lh~~G~ 447 (490)
|+|+|||+||||||||||+|.++++|++++++|++++||+|++|+|++||.++++| +|| ++||+|++||++||++|+
T Consensus 1 G~p~~~P~wa~G~~~~~~~~~~~~~v~~~~~~~~~~~iP~d~~~lD~~~~~~~~~f~~~~d~~~Fpdp~~~i~~l~~~g~ 80 (265)
T cd06589 1 GKPALPPKWAFGYWLSRYGYGDQDKVLEVIDGMRENDIPLDGFVLDDDYTDGYGDFTFDWDAGKFPNPKSMIDELHDNGV 80 (265)
T ss_pred CCCCCCcHHHHHHHHhcCCCCCHHHHHHHHHHHHHcCCCccEEEECcccccCCceeeeecChhhCCCHHHHHHHHHHCCC
Confidence 89999999999999999999999999999999999999999999999999999999 999 999999999999999999
Q ss_pred EEEEEeCccc
Q psy6580 448 HFIPILDPGV 457 (490)
Q Consensus 448 k~vlivdP~I 457 (490)
|+++|++|+|
T Consensus 81 ~~~~~~~P~v 90 (265)
T cd06589 81 KLVLWIDPYI 90 (265)
T ss_pred EEEEEeChhH
Confidence 9999999999
No 21
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.80 E-value=1.9e-19 Score=183.41 Aligned_cols=109 Identities=23% Similarity=0.301 Sum_probs=100.0
Q ss_pred CCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEEE
Q psy6580 374 ELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIPI 452 (490)
Q Consensus 374 ~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vli 452 (490)
.++|.|.+ |.+...+.|+++|+++++++++++||+|+||+|++||..+++|+|| ++||+|++|+++||++|+|+++|
T Consensus 12 ~~~p~W~~--W~~~~~~~s~~~v~~~~~~~~~~~iP~d~i~iD~~w~~~~g~f~~d~~~FPdp~~mi~~l~~~G~k~~l~ 89 (303)
T cd06592 12 FRSPIWST--WARYKADINQETVLNYAQEIIDNGFPNGQIEIDDNWETCYGDFDFDPTKFPDPKGMIDQLHDLGFRVTLW 89 (303)
T ss_pred hCCCccCC--hhhhccCcCHHHHHHHHHHHHHcCCCCCeEEeCCCccccCCccccChhhCCCHHHHHHHHHHCCCeEEEE
Confidence 67999997 4445788999999999999999999999999999999999999999 99999999999999999999999
Q ss_pred eCccccCCCCCCChhhHHHHHcccEEEcCCC-Cceec
Q psy6580 453 LDPGVASREDSNYLPYVEGVEKGIFVMNSSG-LPAEG 488 (490)
Q Consensus 453 vdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G-~~y~G 488 (490)
++|+|+.+ .+.|+||.++|+|||+++| .++.|
T Consensus 90 i~P~i~~~----s~~~~e~~~~g~~vk~~~g~~~~~~ 122 (303)
T cd06592 90 VHPFINTD----SENFREAVEKGYLVSEPSGDIPALT 122 (303)
T ss_pred ECCeeCCC----CHHHHhhhhCCeEEECCCCCCCccc
Confidence 99999864 4679999999999999998 66654
No 22
>PF13802 Gal_mutarotas_2: Galactose mutarotase-like; PDB: 1XSI_F 1XSK_D 1WE5_E 2F2H_F 1XSJ_F 2G3N_C 2G3M_F 3POC_A 3MKK_A 3PHA_A ....
Probab=99.49 E-value=4e-14 Score=113.37 Aligned_cols=65 Identities=26% Similarity=0.348 Sum_probs=46.7
Q ss_pred CCcEEecCCCCCCCccCCCCeEEEeeeCCCCCCC---CCCCccccCeeeeeecCCCCeeEEEEecCcccccc
Q psy6580 265 SPYIYGLGEHRNQFLLDTDWKTIVLWPLDGPPQD---GVNGYGYHPFYLNLNASSGLAHGVFLRTSNALEIV 333 (490)
Q Consensus 265 ~e~iYGLGE~~~~l~~~~~g~~~~l~n~D~~~~~---~~nlYgs~Pfy~s~~~~~g~~~Gvfl~Ns~~~~v~ 333 (490)
+|++||||||.+.++++ |+++.+||+|...+. ..++|+++||||..+ ++.+||||++|++.+.|+
T Consensus 1 de~~yGlGE~~~~l~~~--g~~~~~~n~D~~~~~~~~~~~lY~siPf~~~~~--s~~~yGvf~dn~~~~~~D 68 (68)
T PF13802_consen 1 DERFYGLGERFGPLNRR--GKRYRLWNTDAFGYNGPSSDPLYKSIPFFIAHR--SGKGYGVFFDNPSRTFFD 68 (68)
T ss_dssp T-EEEEESS-SS-SS-T--TEEEEE-B---TSS-GTTBS-BSEEEEEEEEEE--CTCEEEEEEE-SS-EEEE
T ss_pred CCcEEcccccCCCcccC--CcEEEEEECcCCCcCCCcCccccccEeEEEEeC--CCCEEEEEEeCCCceEEC
Confidence 58999999999988875 999999999998653 679999999999776 567999999999998774
No 23
>cd00111 Trefoil P or trefoil or TFF domain; Trefoil factor family domain peptides are mucin-associated molecules, largely found in epithelia of gastrointestinal tissues. Function is not known but it was originally identified from mucosal tissues, where it may have a regulatory or structural role and has also been implicated as a growth fractor in other tissues.The domain is found in 1 to 6 copies where it occurs.
Probab=99.38 E-value=2.1e-13 Score=100.17 Aligned_cols=43 Identities=56% Similarity=1.142 Sum_probs=38.9
Q ss_pred cCcCCCCCCccccccCCCCCCHhhhccCCeeeecCCCCCCCeeecCC
Q psy6580 115 VCHRNVPDKERFDCFPNGQVTEESCTARGCCWSISNNSKVPACFYPH 161 (490)
Q Consensus 115 ~C~~~v~~~~R~dC~P~~~~t~~~C~~rgCcw~~~~~~g~P~C~~p~ 161 (490)
.|. |+.+.||||+| +++|+++|++|||||+++. .++||||||.
T Consensus 2 ~C~--v~~~~R~dCg~-~git~~~C~~rgCC~~~~~-~~~p~Cfy~~ 44 (44)
T cd00111 2 WCS--VPPSERIDCGP-PGITQEECEARGCCFDPSI-SGVPWCFYPK 44 (44)
T ss_pred ccc--CCcccCcccCC-CCCCHHHHhhCCCccCCCC-CCCccCcCcC
Confidence 576 67899999998 8999999999999999884 8999999984
No 24
>smart00018 PD P or trefoil or TFF domain. Proposed role in renewal and pathology of mucous epithelia.
Probab=99.36 E-value=3.6e-13 Score=99.85 Aligned_cols=44 Identities=52% Similarity=1.100 Sum_probs=39.4
Q ss_pred cCcCCCCCCccccccCCCCCCHhhhccCCeeeecCCCCCCCeeecCCC
Q psy6580 115 VCHRNVPDKERFDCFPNGQVTEESCTARGCCWSISNNSKVPACFYPHG 162 (490)
Q Consensus 115 ~C~~~v~~~~R~dC~P~~~~t~~~C~~rgCcw~~~~~~g~P~C~~p~~ 162 (490)
+|. |+.+.|+||+|. ++|+++|++|||||+++. .++||||||..
T Consensus 2 ~C~--v~~~~R~dCg~~-~it~~~C~~rgCC~~~~~-~~~p~Cf~~~~ 45 (46)
T smart00018 2 QCS--VPPSERINCGPP-GITEAECEARGCCFDSSI-SGVPWCFYPNT 45 (46)
T ss_pred ccc--cCcccccccCCC-CCCHHHHhhCCCccCCCC-CCCccCcCCCC
Confidence 575 688999999985 999999999999999884 89999999974
No 25
>PF00088 Trefoil: Trefoil (P-type) domain; InterPro: IPR000519 A cysteine-rich domain of approximately forty five amino-acid residues has been found in some extracellular eukaryotic proteins [, , , ]. It is known as either the 'P', 'trefoil' or 'TFF' domain, and contains six cysteines linked by three disulphide bonds with connectivity 1-5, 2-4, 3-6. This leads to a characteristic three leafed structure ('trefoil'). The P-type domain is clearly composed of three looplike regions. The central core of the domain consists of a short two-stranded antiparallel beta-sheet, which is capped by an irregular loop and forms a central hairpin (loop 3). The beta-sheet is preceded by a short alpha-helix, with majority of the remainder of the domain contained in two loops, which lie on either side of the central hairpin. This domain has been found in a variety of extracellular eukaryotic proteins [, , ], including: protein pS2 (TFF1), a protein secreted by the stomach mucosa spasmolytic polypeptide (SP) (TFF2), a protein of about 115 residues that inhibits gastrointestinal motility and gastric acid secretion intestinal trefoil factor (ITF) (TFF3) Xenopus laevis stomach proteins xP1 and xP4 xenopus integumentary mucins A.1 (FIM-A.1 or preprospasmolysin) and C.1 (FIM-C.1), proteins which may be involved in defence against microbial infections by protecting the epithelia from the external environment xenopus skin protein xp2 (or APEG) Zona pellucida sperm-binding protein B (ZP-B) intestinal sucrase-isomaltase (3.2.1.48 from EC / 3.2.1.10 from EC), a vertebrate membrane bound, multifunctional enzyme complex which hydrolyses sucrose, maltose and isomaltose lysosomal alpha-glucosidase (3.2.1.20 from EC) ; PDB: 1PS2_A 1HI7_A 2PSP_B 1PSP_A 1PCP_A 3LPO_B 3LPP_C 3TOP_B 3TON_A 1PE3_2 ....
Probab=99.25 E-value=2.7e-12 Score=93.84 Aligned_cols=43 Identities=56% Similarity=1.275 Sum_probs=33.4
Q ss_pred cCcCCCCCCccccccCCCCCCHhhhccCCeeeecCCCCCCCeeecC
Q psy6580 115 VCHRNVPDKERFDCFPNGQVTEESCTARGCCWSISNNSKVPACFYP 160 (490)
Q Consensus 115 ~C~~~v~~~~R~dC~P~~~~t~~~C~~rgCcw~~~~~~g~P~C~~p 160 (490)
+|.. |+.+.|++|+|. ++++++|++|||||+++ ..++||||||
T Consensus 1 ~C~~-v~~~~R~~Cg~~-~it~~~C~~rgCCwd~~-~~~~p~Cy~p 43 (43)
T PF00088_consen 1 TCSQ-VPPSDRIDCGPP-GITEEECEARGCCWDPS-TSGVPWCYYP 43 (43)
T ss_dssp GCHT-S-GGGSCBSSTT-T--HHHHHHCTCEE-TS-STTS-SEEES
T ss_pred CCCC-cCcccCccCCCC-CCCHHHHccCCCCCCCC-CCCCCcCcCc
Confidence 4664 788999999966 69999999999999998 6799999997
No 26
>COG3345 GalA Alpha-galactosidase [Carbohydrate transport and metabolism]
Probab=97.35 E-value=0.00089 Score=72.54 Aligned_cols=127 Identities=14% Similarity=0.235 Sum_probs=96.7
Q ss_pred EEEeCCCHHHHHHHHHHhcCCCCCCCcccC---Ccccccc-cc---CCHHHHHHHHHHHHHcCCCeeEEEeeccccCCC-
Q psy6580 352 YYFLGPKPGDVISQYLDLIGYPELPPYWSL---GFHLCRY-GY---KNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERH- 423 (490)
Q Consensus 352 y~f~GptP~~Vi~qYt~LtGrp~mpP~WaL---Gfhqsr~-gY---~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~- 423 (490)
.+..++....+.+.|++++++--+|--|+= -.+.|.| +| -+.+++.+.++..++ ++++.+.+|-.|..++
T Consensus 260 v~~edqgl~~lsq~y~~~v~~~i~~~~~~~kprPi~~nsWea~Yfd~t~e~ile~vk~akk--~gvE~FvlDDGwfg~rn 337 (687)
T COG3345 260 VVLEDQGLNGLSQKYAELVRMEIVPRPRVKKPRPIGWNSWEAYYFDFTEEEILENVKEAKK--FGVELFVLDDGWFGGRN 337 (687)
T ss_pred EEEcCCCcchHHHHHHHHHHhhcCcccccCCCCcceeeceeeeeecCCHHHHHHHHHHHhh--cCeEEEEEccccccccC
Confidence 344577889999999999998877744552 1345666 33 467888888887665 5599999999999543
Q ss_pred ------cceeec-CCCCC-HHHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCce
Q psy6580 424 ------NNFVLA-KPFYG-LKEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLPA 486 (490)
Q Consensus 424 ------~dFT~D-~~Fpd-~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y 486 (490)
+||-.+ ++||. .+++++.++++|+.+-+|+-|.+...+. ..|.+ .=+++|| .+|.|.
T Consensus 338 dd~~slGDWlv~seKfPsgiE~li~~I~e~Gl~fGIWlePemvs~dS---dlfrq--HPDWvvk-~~G~p~ 402 (687)
T COG3345 338 DDLKSLGDWLVNSEKFPSGIEELIEAIAENGLIFGIWLEPEMVSEDS---DLFRQ--HPDWVVK-VNGYPL 402 (687)
T ss_pred cchhhhhceecchhhccccHHHHHHHHHHcCCccceeecchhcccch---HHHhh--CCCeEEe-cCCccc
Confidence 688889 99997 9999999999999999999999876432 33332 1278898 678764
No 27
>PF02065 Melibiase: Melibiase; InterPro: IPR000111 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycosyl hydrolase family 27, family 31 and family 36 alpha-galactosidases form the glycosyl hydrolase clan GH-D (acc_GH from CAZY), a superfamily of alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases which are likely to share a common catalytic mechanism and structural topology. Alpha-galactosidase (3.2.1.22 from EC) (melibiase) [] catalyzes the hydrolysis of melibiose into galactose and glucose. In man, the deficiency of this enzyme is the cause of Fabry's disease (X-linked sphingolipidosis). Alpha-galactosidase is present in a variety of organisms. There is a considerable degree of similarity in the sequence of alpha-galactosidase from various eukaryotic species. Escherichia coli alpha-galactosidase (gene melA), which requires NAD and magnesium as cofactors, is not structurally related to the eukaryotic enzymes; by contrast, an Escherichia coli plasmid encoded alpha-galactosidase (gene rafA P16551 from SWISSPROT) [] contains a region of about 50 amino acids which is similar to a domain of the eukaryotic alpha-galactosidases. Alpha-N-acetylgalactosaminidase (3.2.1.49 from EC) [] catalyzes the hydrolysis of terminal non-reducing N-acetyl-D-galactosamine residues in N-acetyl-alpha-D- galactosaminides. In man, the deficiency of this enzyme is the cause of Schindler and Kanzaki diseases. The sequence of this enzyme is highly related to that of the eukaryotic alpha-galactosidases.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1KTC_A 1KTB_A 1UAS_A 3H55_A 3H53_A 3IGU_B 3H54_A 3LRM_A 3LRL_A 3LRK_A ....
Probab=97.33 E-value=0.00041 Score=73.72 Aligned_cols=68 Identities=21% Similarity=0.266 Sum_probs=55.9
Q ss_pred CCHHHHHHHHHHHHHcCCCeeEEEeeccccC-------CCcceeec-CCCCC-HHHHHHHHHHCCCEEEEEeCccccCC
Q psy6580 391 KNLSHIQSVVDRNVKAGIPLDTVWIDIDYME-------RHNNFVLA-KPFYG-LKEYVQDLHKEGRHFIPILDPGVASR 460 (490)
Q Consensus 391 ~s~~eV~~vv~~~r~~~IPlDviw~Didym~-------~~~dFT~D-~~Fpd-~~~mv~~Lh~~G~k~vlivdP~I~~~ 460 (490)
-+.+.+++.++.+++.|+ +.+.+|..|+. ..++|..| ++||+ ++.+++.+|++|+++.+|+.|.+...
T Consensus 55 ~~e~~i~~~a~~~~~~G~--e~fviDDGW~~~r~~d~~~~GdW~~~~~kFP~Gl~~l~~~i~~~Gmk~GlW~ePe~v~~ 131 (394)
T PF02065_consen 55 ITEEKILELADAAAELGY--EYFVIDDGWFGGRDDDNAGLGDWEPDPKKFPNGLKPLADYIHSLGMKFGLWFEPEMVSP 131 (394)
T ss_dssp --HHHHHHHHHHHHHHT---SEEEE-SSSBCTESTTTSTTSBECBBTTTSTTHHHHHHHHHHHTT-EEEEEEETTEEES
T ss_pred CCHHHHHHHHHHHHHhCC--EEEEEcCccccccCCCcccCCceeEChhhhCCcHHHHHHHHHHCCCeEEEEeccccccc
Confidence 467899999999988765 78999999985 24789999 99998 99999999999999999999988643
No 28
>cd06596 GH31_CPE1046 CPE1046 is an uncharacterized Clostridium perfringens protein with a glycosyl hydrolase family 31 (GH31) domain. The domain architecture of CPE1046 and its orthologs includes a C-terminal fibronectin type 3 (FN3) domain and a coagulation factor 5/8 type C domain in addition to the GH31 domain. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=97.02 E-value=0.00061 Score=68.13 Aligned_cols=77 Identities=25% Similarity=0.304 Sum_probs=59.9
Q ss_pred CCCCCCCcccCCcc--cc----cccc---------------CCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec
Q psy6580 371 GYPELPPYWSLGFH--LC----RYGY---------------KNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA 429 (490)
Q Consensus 371 Grp~mpP~WaLGfh--qs----r~gY---------------~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D 429 (490)
|+|.|||.|||++- -| -|.+ ..+..+++|+++|+++++|+.-|.-+..|--
T Consensus 1 g~p~~~p~~a~~~g~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~y~~~~~plgw~lpndgyg~-------- 72 (261)
T cd06596 1 GRPFLLPEYALYLGDADCYNRDYWRKVPEGTSDGGLNGEKESTTDDARKVADKYKENDMPLGWMLPNDGYGC-------- 72 (261)
T ss_pred CCCccchHHHHhccchhhhccCccccCCCccccccCCCCcccchhhHHHHHHHHHhcCCCceeeccCCCCcc--------
Confidence 89999999998541 11 1222 3457899999999999999999877744433
Q ss_pred CCCCCHHHHHHHHHHCCCEEEEEeCcc
Q psy6580 430 KPFYGLKEYVQDLHKEGRHFIPILDPG 456 (490)
Q Consensus 430 ~~Fpd~~~mv~~Lh~~G~k~vlivdP~ 456 (490)
-|.+++++++.++++|++.-+|..-.
T Consensus 73 -~y~~l~~~~~~~~~~g~~~glwt~~~ 98 (261)
T cd06596 73 -GYENLKEVVDYLHANGVETGLWTQSG 98 (261)
T ss_pred -hHHHHHHHHHHHHHcCCccccccccc
Confidence 34589999999999999999997654
No 29
>PLN02808 alpha-galactosidase
Probab=96.80 E-value=0.0062 Score=64.50 Aligned_cols=86 Identities=21% Similarity=0.334 Sum_probs=68.8
Q ss_pred cCCCCCCCcccCCcccccc-cc---CCHHHHHHHHHHHHHcCCC---eeEEEeeccccC----CCcceeec-CCCCC-HH
Q psy6580 370 IGYPELPPYWSLGFHLCRY-GY---KNLSHIQSVVDRNVKAGIP---LDTVWIDIDYME----RHNNFVLA-KPFYG-LK 436 (490)
Q Consensus 370 tGrp~mpP~WaLGfhqsr~-gY---~s~~eV~~vv~~~r~~~IP---lDviw~Didym~----~~~dFT~D-~~Fpd-~~ 436 (490)
.|.+..|| +||- .| .| -+.+.+++.++.|.+.|+. .+-|.+|..|+. ..+.+..| ++||+ ++
T Consensus 26 ngla~tPp---mGWn--sW~~~~~~i~e~~i~~~a~~mv~~Gl~~~Gy~yv~iDd~W~~~~rd~~G~~~~d~~rFP~G~~ 100 (386)
T PLN02808 26 NGLGLTPQ---MGWN--SWNHFQCNINETLIKQTADAMVSSGLAALGYKYINLDDCWAELKRDSQGNLVPKASTFPSGIK 100 (386)
T ss_pred CcccCCCc---ceEE--chHHHCCCCCHHHHHHHHHHHHHcchHHhCCEEEEEcCCcCCCCcCCCCCEeeChhhcCccHH
Confidence 46677787 4542 22 12 3678999999998877765 889999999975 35789999 99996 99
Q ss_pred HHHHHHHHCCCEEEEEeCccccCC
Q psy6580 437 EYVQDLHKEGRHFIPILDPGVASR 460 (490)
Q Consensus 437 ~mv~~Lh~~G~k~vlivdP~I~~~ 460 (490)
.+.+.+|++|+|+-++.+++..+-
T Consensus 101 ~lad~iH~~GlkfGiy~~~G~~tC 124 (386)
T PLN02808 101 ALADYVHSKGLKLGIYSDAGTLTC 124 (386)
T ss_pred HHHHHHHHCCCceEEEecCCcccc
Confidence 999999999999999999987653
No 30
>PLN02229 alpha-galactosidase
Probab=96.70 E-value=0.0048 Score=66.01 Aligned_cols=87 Identities=23% Similarity=0.267 Sum_probs=67.6
Q ss_pred cCCCCCCCcccCCcccccccc---CCHHHHHHHHHHHHHcCCC---eeEEEeeccccCC----Ccceeec-CCCCC-HHH
Q psy6580 370 IGYPELPPYWSLGFHLCRYGY---KNLSHIQSVVDRNVKAGIP---LDTVWIDIDYMER----HNNFVLA-KPFYG-LKE 437 (490)
Q Consensus 370 tGrp~mpP~WaLGfhqsr~gY---~s~~eV~~vv~~~r~~~IP---lDviw~Didym~~----~~dFT~D-~~Fpd-~~~ 437 (490)
.|....|| +||.. -+.| -+++.+++.++.+.+.|+. .+-+.+|..|+.. .+.+..| ++||+ ++.
T Consensus 57 ngla~tPp---mGWnS-Wn~~~~~i~E~~i~~~ad~~v~~Gl~~~Gy~yv~iDDgW~~~~rd~~G~l~~d~~rFP~G~k~ 132 (427)
T PLN02229 57 NGLARTPQ---MGWNS-WNFFACNINETVIKETADALVSTGLADLGYIHVNIDDCWSNLKRDSKGQLVPDPKTFPSGIKL 132 (427)
T ss_pred CCccCCCC---ceEEc-hhhhCcccCHHHHHHHHHHHHHhHHHhCCCEEEEEcCCcCCCCcCCCCCEEEChhhcCCcHHH
Confidence 56677777 44321 1122 3678899999988665555 7889999989743 4678999 99995 999
Q ss_pred HHHHHHHCCCEEEEEeCccccCC
Q psy6580 438 YVQDLHKEGRHFIPILDPGVASR 460 (490)
Q Consensus 438 mv~~Lh~~G~k~vlivdP~I~~~ 460 (490)
+.+.+|++|+|+-++.++++.+-
T Consensus 133 ladyiH~~GlKfGIy~d~G~~TC 155 (427)
T PLN02229 133 LADYVHSKGLKLGIYSDAGVFTC 155 (427)
T ss_pred HHHHHHHCCCceEEeccCCCccc
Confidence 99999999999999999998754
No 31
>PLN02692 alpha-galactosidase
Probab=96.48 E-value=0.012 Score=62.64 Aligned_cols=97 Identities=22% Similarity=0.321 Sum_probs=73.3
Q ss_pred HHHHHHHHHHhcCCCCCCCcccCCcccccc-cc---CCHHHHHHHHHHHHHcCCC---eeEEEeeccccC----CCccee
Q psy6580 359 PGDVISQYLDLIGYPELPPYWSLGFHLCRY-GY---KNLSHIQSVVDRNVKAGIP---LDTVWIDIDYME----RHNNFV 427 (490)
Q Consensus 359 P~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~-gY---~s~~eV~~vv~~~r~~~IP---lDviw~Didym~----~~~dFT 427 (490)
-.++......--|.+..|| +|| +.| .| -+++.+++.++.+.+.|+. .+-+.+|..|+. ..+.+.
T Consensus 39 ~~~~~~~~~~~ngla~tPp---mGW--nSW~~~~~~i~E~~i~~~ad~~~~~gl~~~Gy~yv~iDDgW~~~~rd~~G~~~ 113 (412)
T PLN02692 39 DSEILRRNLLANGLGITPP---MGW--NSWNHFSCKIDEKMIKETADALVSTGLSKLGYTYVNIDDCWAEIARDEKGNLV 113 (412)
T ss_pred hhhhhhhhcccCcCcCCCc---ceE--EchhhhCcccCHHHHHHHHHHHHhccchhcCcEEEEEcCCcCCCCCCCCCCee
Confidence 3344444444567777788 443 223 22 3678899999988777664 788999988974 456788
Q ss_pred ec-CCCCC-HHHHHHHHHHCCCEEEEEeCccccCC
Q psy6580 428 LA-KPFYG-LKEYVQDLHKEGRHFIPILDPGVASR 460 (490)
Q Consensus 428 ~D-~~Fpd-~~~mv~~Lh~~G~k~vlivdP~I~~~ 460 (490)
.| ++||+ ++.+.+.+|++|+|+-++.+++..+-
T Consensus 114 ~d~~kFP~G~k~ladyiH~~GLKfGIy~d~G~~tC 148 (412)
T PLN02692 114 PKKSTFPSGIKALADYVHSKGLKLGIYSDAGYFTC 148 (412)
T ss_pred eChhhcCCcHHHHHHHHHHCCCceEEEecCCcccc
Confidence 99 99996 99999999999999999999997753
No 32
>PLN03231 putative alpha-galactosidase; Provisional
Probab=94.41 E-value=0.11 Score=54.58 Aligned_cols=67 Identities=12% Similarity=0.115 Sum_probs=52.9
Q ss_pred CHHHHHHHHHHHHHcCC---CeeEEEeeccccC--------------------CCcceeec-CCCCC------HHHHHHH
Q psy6580 392 NLSHIQSVVDRNVKAGI---PLDTVWIDIDYME--------------------RHNNFVLA-KPFYG------LKEYVQD 441 (490)
Q Consensus 392 s~~eV~~vv~~~r~~~I---PlDviw~Didym~--------------------~~~dFT~D-~~Fpd------~~~mv~~ 441 (490)
++++|++.++ +.+.|+ -.+-|.+|..|+. .++.+.-| ++||+ ++.+.+.
T Consensus 16 ~E~~i~~~Ad-~v~~gL~~~GY~Yv~iDd~W~~~~~~g~~~~~~~~~~~~~~d~~G~l~pd~~rFPs~~~~~G~k~lADy 94 (357)
T PLN03231 16 SEEQFLENAK-IVSETLKPHGYEYVVIDYLWYRKLKHGWFKTSAKSPGYDLIDKWGRPLPDPKRWPSTTGGKGFAPIAAK 94 (357)
T ss_pred CHHHHHHHHH-HHHcchHHhCCEEEEECCcccccccccccccccccccccccCCCCCcccCcccCCCCccccCcHHHHHH
Confidence 6778888886 555442 3466777877763 35667788 99995 9999999
Q ss_pred HHHCCCEEEEEeCccccC
Q psy6580 442 LHKEGRHFIPILDPGVAS 459 (490)
Q Consensus 442 Lh~~G~k~vlivdP~I~~ 459 (490)
+|++|+|+-+.++|++.+
T Consensus 95 vHs~GLKfGIY~~~G~~t 112 (357)
T PLN03231 95 VHALGLKLGIHVMRGIST 112 (357)
T ss_pred HHhCCcceEEEecCCccc
Confidence 999999999999999875
No 33
>PLN02899 alpha-galactosidase
Probab=93.85 E-value=0.19 Score=56.04 Aligned_cols=67 Identities=13% Similarity=0.183 Sum_probs=52.1
Q ss_pred CHHHHHHHHHHHHHcC---CCeeEEEeeccccCC-----------------Ccceeec-CCCC------CHHHHHHHHHH
Q psy6580 392 NLSHIQSVVDRNVKAG---IPLDTVWIDIDYMER-----------------HNNFVLA-KPFY------GLKEYVQDLHK 444 (490)
Q Consensus 392 s~~eV~~vv~~~r~~~---IPlDviw~Didym~~-----------------~~dFT~D-~~Fp------d~~~mv~~Lh~ 444 (490)
++++|++.++ +...| .-.+-|.+|.-|+.. ++.+..| ++|| +++.+.+.+|+
T Consensus 46 ~E~~i~~~Ad-~vs~GLk~~GY~YVnIDDcW~~~~~~g~~~~s~g~~~~D~~GrLvPDp~RFPSs~~g~GmK~LADYVHs 124 (633)
T PLN02899 46 SEEEFLQNAE-IVSQRLLPFGYEYVVVDYLWYRKKVEGAYVDSLGFDVIDEWGRPIPDPGRWPSSRGGKGFTEVAEKVHA 124 (633)
T ss_pred CHHHHHHHHH-HHHcchHhhCCeEEEEccccccccccccccccccccccCCCCCCccCcccCCCCccCCCcHHHHHHHHh
Confidence 6777888776 34434 234567888888642 4567789 9999 59999999999
Q ss_pred CCCEEEEEeCccccC
Q psy6580 445 EGRHFIPILDPGVAS 459 (490)
Q Consensus 445 ~G~k~vlivdP~I~~ 459 (490)
+|+|+-+.+++++.+
T Consensus 125 kGLKFGIY~~~Gi~t 139 (633)
T PLN02899 125 MGLKFGIHVMRGIST 139 (633)
T ss_pred CCcceEEEecCCCcc
Confidence 999999999999864
No 34
>PLN02219 probable galactinol--sucrose galactosyltransferase 2
Probab=90.44 E-value=1 Score=51.40 Aligned_cols=101 Identities=20% Similarity=0.275 Sum_probs=74.2
Q ss_pred EEEEeCCCHHHHHHHHHHhcCCC---------CCCCcccCCccccccc--cC--CHHHHHHHHHHHHHcCCCeeEEEeec
Q psy6580 351 FYYFLGPKPGDVISQYLDLIGYP---------ELPPYWSLGFHLCRYG--YK--NLSHIQSVVDRNVKAGIPLDTVWIDI 417 (490)
Q Consensus 351 ~y~f~GptP~~Vi~qYt~LtGrp---------~mpP~WaLGfhqsr~g--Y~--s~~eV~~vv~~~r~~~IPlDviw~Di 417 (490)
.|+-.|.+|-++++++.+.+-+- .-+|.|-=||--|.|. |. +.+.|++.++.+.+.++|...+.+|-
T Consensus 160 v~~~~G~dPy~li~~a~~av~~h~~tf~~re~K~~p~~~D~~GWCTWdafy~dVt~~~I~~~l~~l~e~gip~~~viIDD 239 (775)
T PLN02219 160 VYMHAGTNPFEVIRQAVKAVEKHMQTFLHREKKKLPSFLDWFGWCTWDAFYTDVTAEGVDEGLKSLSEGGTPPKFLIIDD 239 (775)
T ss_pred EEEecCCCHHHHHHHHHHHHHHhcccccccccccCccccceeeEEEhhHhhccCCHHHHHHHHHHHHhCCCCceEEEEcc
Confidence 45567999999999998765211 1267888777788884 33 56789999999999999999999998
Q ss_pred cccCCCc--------------------ceeecCC---------CC-CHHHHHHHHHH-CCCEEEE
Q psy6580 418 DYMERHN--------------------NFVLAKP---------FY-GLKEYVQDLHK-EGRHFIP 451 (490)
Q Consensus 418 dym~~~~--------------------dFT~D~~---------Fp-d~~~mv~~Lh~-~G~k~vl 451 (490)
.|+.-.. .|.=+.+ || +++.+|+.+++ .|+|+|-
T Consensus 240 GwQsi~~~~~~~~~~~~~g~qf~~rL~~f~en~KF~~~~~~~~fp~Glk~~V~~iK~~~~vk~V~ 304 (775)
T PLN02219 240 GWQQIENKEKDENCVVQEGAQFATRLTGIKENAKFQKNDQKNEQVSGLKHVVDDAKQRHNVKQVY 304 (775)
T ss_pred CccccccccccccccccccchhhhhhccccccccccccccccCCCCcHHHHHHHHHhccCCcEEE
Confidence 8874211 1221233 43 78999999985 6888774
No 35
>PLN02355 probable galactinol--sucrose galactosyltransferase 1
Probab=90.25 E-value=1.9 Score=49.23 Aligned_cols=102 Identities=18% Similarity=0.266 Sum_probs=70.7
Q ss_pred EEEEEeCCCHHHHHHHHHHhcCCC---------CCCCcccCCccccccc--cC--CHHHHHHHHHHHHHcCCCeeEEEee
Q psy6580 350 DFYYFLGPKPGDVISQYLDLIGYP---------ELPPYWSLGFHLCRYG--YK--NLSHIQSVVDRNVKAGIPLDTVWID 416 (490)
Q Consensus 350 D~y~f~GptP~~Vi~qYt~LtGrp---------~mpP~WaLGfhqsr~g--Y~--s~~eV~~vv~~~r~~~IPlDviw~D 416 (490)
=+|+-.|.+|-++++++.+.+=+- .-+|--.=+|--|.|. |. +.+.|++.++.+.+.++|..-+-+|
T Consensus 163 ~v~v~~g~dpy~li~~a~~~v~~hl~tf~~re~K~~P~~ld~~GWCTW~afy~~Vt~~~I~~~l~~l~~~g~p~~~viID 242 (758)
T PLN02355 163 LVFVAAGSDPFDVITNAVKAVEKHLQTFSHRERKKMPDMLNWFGWCTWDAFYTNVTAEGVKQGLESLEKGGVTPKFVIID 242 (758)
T ss_pred EEEEEcCCCHHHHHHHHHHHHHHHhccccchhhccCCcccceeeEEehhHhhccCCHHHHHHHHHHHHhCCCCccEEEEe
Confidence 357778999999999998753322 1133222134457774 33 5678999999999999999999999
Q ss_pred ccccCCC--------------------cceeecCCCC--------------CHHHHHHHHHH-CCCEEEE
Q psy6580 417 IDYMERH--------------------NNFVLAKPFY--------------GLKEYVQDLHK-EGRHFIP 451 (490)
Q Consensus 417 idym~~~--------------------~dFT~D~~Fp--------------d~~~mv~~Lh~-~G~k~vl 451 (490)
-.|..-. ..|.=+.+|| +++.+|+.+++ .|+|+|-
T Consensus 243 DGwQs~~~d~~~~~~~~~~~~q~~~rL~~f~~n~KF~~~~~~~~~~~~~~~Glk~~V~~iK~~~~vk~V~ 312 (758)
T PLN02355 243 DGWQSVGMDPTGIECLADNSANFANRLTHIKENHKFQKNGKEGHRVDDPALGLGHIVTEIKEKHSLKYVY 312 (758)
T ss_pred ccccccccccccccccccccchhhhhhccccccccccccccccccccCCCCcHHHHHHHHHhhcCCcEEE
Confidence 8887521 0122224555 78899999986 6888874
No 36
>PLN02711 Probable galactinol--sucrose galactosyltransferase
Probab=83.42 E-value=37 Score=39.29 Aligned_cols=165 Identities=19% Similarity=0.276 Sum_probs=89.7
Q ss_pred ceEEEEeeCCC-CcEEecCCCCC-CCccCCCCeEEEeeeCCCCCCCCCCCccccCeeeeeecCCCCeeEEEEecCc-ccc
Q psy6580 255 QFIQISSRLSS-PYIYGLGEHRN-QFLLDTDWKTIVLWPLDGPPQDGVNGYGYHPFYLNLNASSGLAHGVFLRTSN-ALE 331 (490)
Q Consensus 255 qflqist~lp~-e~iYGLGE~~~-~l~~~~~g~~~~l~n~D~~~~~~~nlYgs~Pfy~s~~~~~g~~~Gvfl~Ns~-~~~ 331 (490)
-|+-.+..-|. -+++-||.-.+ .|.--++-+.| |.+.-.+..+..+---.-|+|.-...++..|-||+--.. ...
T Consensus 66 ~flG~~~~~~~srhv~~~G~l~~~rfm~~fRfK~W--Wmt~~~G~~g~dip~eTQ~ll~e~~~~~~~y~~~lP~~eg~fR 143 (777)
T PLN02711 66 SFVGFDAGEPKSRHVVPIGKLKNIRFMSIFRFKVW--WTTHWVGSNGRDVENETQMMILDKSDSGRPYVLLLPLIEGPFR 143 (777)
T ss_pred eEEeeecCCCCcceeeecccccCcEeeeeehhhhh--ccchhhcCCCCCCChhheEEEEEccCCCceEEEEEeecCCceE
Confidence 35555554444 57888886554 22111111111 222111111111111223444432124667888766433 223
Q ss_pred ccccCC--CcEEEEEeCCe---------EEEEEEeCCCHHHHHHHHHHh----cCCCC-----CCCcccCCccccccc--
Q psy6580 332 IVLQPT--PAITYRVLGGI---------LDFYYFLGPKPGDVISQYLDL----IGYPE-----LPPYWSLGFHLCRYG-- 389 (490)
Q Consensus 332 v~~~~~--~~lt~~~~gG~---------LD~y~f~GptP~~Vi~qYt~L----tGrp~-----mpP~WaLGfhqsr~g-- 389 (490)
..+++. +.+.+-++.|. -=+|+-.|.+|-++|++=.+. +|.-. -+|.+-=||--|.|.
T Consensus 144 a~Lq~~~~d~~~ic~esg~~~v~~~~~~~~~~i~~g~~Py~~i~~A~~~~~~~l~tf~~reeK~~P~~~D~fGWCTWdAf 223 (777)
T PLN02711 144 ASLQPGEDDNVDICVESGSTKVCGSEFRSVLYMHAGDDPYKLVKDAMKVVRVHLGTFKLLEEKTPPGIVDKFGWCTWDAF 223 (777)
T ss_pred EEecCCCCCcEEEEEecCCcceeccccceEEEEEcCCCHHHHHHHHHHHHHHHhcccchhhhccCCcccccceEEehhHh
Confidence 334332 34444443331 114777899999998877654 33211 245454455668884
Q ss_pred cC--CHHHHHHHHHHHHHcCCCeeEEEeeccccC
Q psy6580 390 YK--NLSHIQSVVDRNVKAGIPLDTVWIDIDYME 421 (490)
Q Consensus 390 Y~--s~~eV~~vv~~~r~~~IPlDviw~Didym~ 421 (490)
|. +.+.|.+-++.+++.|||..-+.+|-.|++
T Consensus 224 y~~Vt~egI~~gl~~L~~~Gip~~~vIIDDGWQs 257 (777)
T PLN02711 224 YLTVHPQGVWEGVKGLVDGGCPPGLVLIDDGWQS 257 (777)
T ss_pred cccCCHHHHHHHHHHHHhCCCCccEEEEcCCccc
Confidence 33 568899999999999999999999988875
No 37
>PLN02684 Probable galactinol--sucrose galactosyltransferase
Probab=82.79 E-value=5 Score=45.94 Aligned_cols=101 Identities=18% Similarity=0.276 Sum_probs=73.9
Q ss_pred EEEEeCCCHHHHHHHHHHhcCCC---------CCCCcccCCccccccc--cC--CHHHHHHHHHHHHHcCCCeeEEEeec
Q psy6580 351 FYYFLGPKPGDVISQYLDLIGYP---------ELPPYWSLGFHLCRYG--YK--NLSHIQSVVDRNVKAGIPLDTVWIDI 417 (490)
Q Consensus 351 ~y~f~GptP~~Vi~qYt~LtGrp---------~mpP~WaLGfhqsr~g--Y~--s~~eV~~vv~~~r~~~IPlDviw~Di 417 (490)
+|+-.|.+|-++|++=.+.+-+. .-+|-|-=||--|.|. |. +.+.|++.++.+.+.++|..-+-+|-
T Consensus 163 ~~v~~g~~Py~~i~~a~~~v~~~l~tf~~reeK~~P~~~D~fGWCTWdafy~dVt~~~I~~~l~~l~~~g~p~~~vIIDD 242 (750)
T PLN02684 163 LFIHAGTDPFQTITDAIRAVKLHLKSFRQRHEKKLPGIVDYFGWCTWDAFYQEVTQEGVEAGLKSLAAGGTPPKFVIIDD 242 (750)
T ss_pred EEEecCCCHHHHHHHHHHHHHHHhhccchhhhccCccccceeeEEEhhHhhccCCHHHHHHHHHHHHhCCCCceEEEEec
Confidence 47778999999998875432211 1277888777788884 33 56789999999999999999999998
Q ss_pred cccCCCc-----------------ceeec--CCCC-------CHHHHHHHHH-HCCCEEEE
Q psy6580 418 DYMERHN-----------------NFVLA--KPFY-------GLKEYVQDLH-KEGRHFIP 451 (490)
Q Consensus 418 dym~~~~-----------------dFT~D--~~Fp-------d~~~mv~~Lh-~~G~k~vl 451 (490)
.|..-.. -..|. .+|| +++.+|+.|+ +.|+|+|-
T Consensus 243 GwQs~~~d~~~~~~~~~~~q~~~rL~~f~en~KF~~~~~p~~Glk~~V~~iK~~~~vk~V~ 303 (750)
T PLN02684 243 GWQSVGGDPTVEAGDEKKEQPLLRLTGIKENEKFKKKDDPNVGIKNIVNIAKEKHGLKYVY 303 (750)
T ss_pred ccccccccccccccccccchhhhhhccCccccccccccCCCccHHHHHHHHHhhcCCcEEE
Confidence 8875221 11132 5664 7899999997 56898774
No 38
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=74.83 E-value=3.6 Score=37.31 Aligned_cols=43 Identities=23% Similarity=0.272 Sum_probs=32.9
Q ss_pred HHHHHHHHHHCCCEEEEEeCccccCCCCCCChhhHHHHHcccEEEcCCCCc
Q psy6580 435 LKEYVQDLHKEGRHFIPILDPGVASREDSNYLPYVEGVEKGIFVMNSSGLP 485 (490)
Q Consensus 435 ~~~mv~~Lh~~G~k~vlivdP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~ 485 (490)
+.++|+..|+.|++++..++...... .| -..=++++++++|+|
T Consensus 46 lge~v~a~h~~Girv~ay~~~~~d~~------~~--~~HPeW~~~~~~G~~ 88 (132)
T PF14871_consen 46 LGEQVEACHERGIRVPAYFDFSWDED------AA--ERHPEWFVRDADGRP 88 (132)
T ss_pred HHHHHHHHHHCCCEEEEEEeeecChH------HH--HhCCceeeECCCCCC
Confidence 48999999999999999999883211 11 123379999999985
No 39
>PF05691 Raffinose_syn: Raffinose synthase or seed imbibition protein Sip1; InterPro: IPR008811 This family consists of several raffinose synthase proteins, also known as seed imbibition (Sip1) proteins. Raffinose (O-alpha- D-galactopyranosyl- (1-->6)- O-alpha- D-glucopyranosyl-(1-->2)- O-beta- D-fructofuranoside) is a widespread oligosaccharide in plant seeds and other tissues. Raffinose synthase (2.4.1.82 from EC) is the key enzyme that channels sucrose into the raffinose oligosaccharide pathway [].
Probab=74.78 E-value=56 Score=37.86 Aligned_cols=71 Identities=23% Similarity=0.364 Sum_probs=53.6
Q ss_pred EEEEeCCCHHHHHHHHHHhc----C-----CCCCCCcccCCccccccc--c--CCHHHHHHHHHHHHHcCCCeeEEEeec
Q psy6580 351 FYYFLGPKPGDVISQYLDLI----G-----YPELPPYWSLGFHLCRYG--Y--KNLSHIQSVVDRNVKAGIPLDTVWIDI 417 (490)
Q Consensus 351 ~y~f~GptP~~Vi~qYt~Lt----G-----rp~mpP~WaLGfhqsr~g--Y--~s~~eV~~vv~~~r~~~IPlDviw~Di 417 (490)
+|+-.|.+|-++|++=.+.. | ...-+|-|.=||--|.|. | -+.+.|++.++.+++.|||.-.+.+|-
T Consensus 156 l~v~~g~dPy~~i~~A~~~~~~~l~tf~~r~~K~~P~~~d~lGwCTWdaf~~~v~~~~i~~~l~~L~~~gi~~~~viIDD 235 (747)
T PF05691_consen 156 LYVHAGDDPYELIREAVKAVRKHLGTFRLREEKKYPEFLDGLGWCTWDAFYQDVTEEGILEGLKSLEEGGIPPRFVIIDD 235 (747)
T ss_pred EEEeccCCHHHHHHHHHHHHHhcccccccccccchhhhhhhhccccHHHhccccCHHHHHHHHHHHHhCCCCceEEEEec
Confidence 45667889999888765542 1 112367777677778884 3 356789999999999999999999998
Q ss_pred cccC
Q psy6580 418 DYME 421 (490)
Q Consensus 418 dym~ 421 (490)
.|++
T Consensus 236 GWQ~ 239 (747)
T PF05691_consen 236 GWQS 239 (747)
T ss_pred chhc
Confidence 8874
No 40
>PF13200 DUF4015: Putative glycosyl hydrolase domain
Probab=72.22 E-value=21 Score=37.13 Aligned_cols=89 Identities=22% Similarity=0.325 Sum_probs=57.7
Q ss_pred cccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec------------CCC-CCHHHHHHHHHHCCCEEE
Q psy6580 384 HLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA------------KPF-YGLKEYVQDLHKEGRHFI 450 (490)
Q Consensus 384 hqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D------------~~F-pd~~~mv~~Lh~~G~k~v 450 (490)
..+.+.-.+.+.+.++++..++.++ .++.+|+- ++.+.-+|+ ..+ .|++++++.||++|+.+|
T Consensus 3 Ylt~~~a~~~~~~~~~~~~i~~t~l--NavVIDvK--dd~G~i~y~s~~~~~~~~ga~~~~i~D~~~l~~~l~e~gIY~I 78 (316)
T PF13200_consen 3 YLTAYSAGSPERLDKLLDLIKRTEL--NAVVIDVK--DDDGNITYDSQVPLAREIGAVKPYIKDLKALVKKLKEHGIYPI 78 (316)
T ss_pred EechhhcCCHHHHHHHHHHHHhcCC--ceEEEEEe--cCCceEEecCCCchhhhcccccccccCHHHHHHHHHHCCCEEE
Confidence 3445555677788888888888865 46666653 333333331 223 689999999999999888
Q ss_pred EEe----CccccCCCCCCChhhHHHHHcccEEEcCCCCceec
Q psy6580 451 PIL----DPGVASREDSNYLPYVEGVEKGIFVMNSSGLPAEG 488 (490)
Q Consensus 451 liv----dP~I~~~~~~~y~~y~eg~e~g~fvk~~~G~~y~G 488 (490)
..| ||.++. . .-.+-++..+|+++.+
T Consensus 79 ARIv~FkD~~la~-----------~-~pe~av~~~~G~~w~d 108 (316)
T PF13200_consen 79 ARIVVFKDPVLAE-----------A-HPEWAVKTKDGSVWRD 108 (316)
T ss_pred EEEEEecChHHhh-----------h-ChhhEEECCCCCcccC
Confidence 876 333322 1 2346667788877653
No 41
>PF07485 DUF1529: Domain of Unknown Function (DUF1259); InterPro: IPR011094 This family is the lppY/lpqO homologue family. They are related to 'probable conserved lipoproteins' LppY and LpqO from Mycobacterium bovis.
Probab=62.49 E-value=29 Score=31.16 Aligned_cols=70 Identities=13% Similarity=0.116 Sum_probs=49.0
Q ss_pred CCCCcccCCccccccccCC------------HHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHH
Q psy6580 374 ELPPYWSLGFHLCRYGYKN------------LSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQ 440 (490)
Q Consensus 374 ~mpP~WaLGfhqsr~gY~s------------~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~ 440 (490)
.++|.++++-|.+-.+..+ ++||..|++.++++||.+.++++-+=+-+- +-|=.. ..-.+|..|.+
T Consensus 36 ~~~~~~~l~~~~aF~~~~~G~A~~~Gd~vll~~EV~pvi~aL~~~GI~vtAlHNH~l~e~P-rl~ymH~~~~gdp~~lA~ 114 (123)
T PF07485_consen 36 RLPPPMGLTSWIAFEPDGDGKAMVMGDFVLLEDEVNPVISALRKNGIEVTALHNHWLFEQP-RLFYMHIWGVGDPAKLAR 114 (123)
T ss_pred EcCCccccceeEEEEECCCCcEEEeecEEecHHHHHHHHHHHHHCCceEEEEecccccCCC-CEEEEEEEecCCHHHHHH
Confidence 3677788887766655433 799999999999999999999885322222 223223 33457888888
Q ss_pred HHHH
Q psy6580 441 DLHK 444 (490)
Q Consensus 441 ~Lh~ 444 (490)
.+++
T Consensus 115 ~vr~ 118 (123)
T PF07485_consen 115 KVRA 118 (123)
T ss_pred HHHH
Confidence 8765
No 42
>PF01301 Glyco_hydro_35: Glycosyl hydrolases family 35; InterPro: IPR001944 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 35 GH35 from CAZY comprises enzymes with only one known activity; beta-galactosidase (3.2.1.23 from EC). Mammalian beta-galactosidase is a lysosomal enzyme (gene GLB1) which cleaves the terminal galactose from gangliosides, glycoproteins, and glycosaminoglycans and whose deficiency is the cause of the genetic disease Gm(1) gangliosidosis (Morquio disease type B).; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3OGS_A 3OGV_A 3OGR_A 3OG2_A 1TG7_A 1XC6_A 3THC_C 3THD_D 3D3A_A 4E8D_B ....
Probab=58.46 E-value=17 Score=37.66 Aligned_cols=74 Identities=15% Similarity=0.213 Sum_probs=48.3
Q ss_pred ccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCC-cceeecCCCCCHHHHHHHHHHCCCEEEEEeCccccCC
Q psy6580 383 FHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERH-NNFVLAKPFYGLKEYVQDLHKEGRHFIPILDPGVASR 460 (490)
Q Consensus 383 fhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~-~dFT~D~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~ 460 (490)
+|-+|.- .+..+++++++++.|+-.=.+++-+.+.... +.|.|. -..|+..|++..+++|+++++..-|+|...
T Consensus 16 ~hy~r~p---~~~W~~~l~k~ka~G~n~v~~yv~W~~he~~~g~~df~-g~~dl~~f~~~a~~~gl~vilrpGpyi~aE 90 (319)
T PF01301_consen 16 FHYFRIP---PEYWRDRLQKMKAAGLNTVSTYVPWNLHEPEEGQFDFT-GNRDLDRFLDLAQENGLYVILRPGPYICAE 90 (319)
T ss_dssp E-GGGS----GGGHHHHHHHHHHTT-SEEEEE--HHHHSSBTTB---S-GGG-HHHHHHHHHHTT-EEEEEEES---TT
T ss_pred eccccCC---hhHHHHHHHHHHhCCcceEEEeccccccCCCCCccccc-chhhHHHHHHHHHHcCcEEEecccceeccc
Confidence 4555533 6788899999999999998888877766533 444444 235899999999999999999999999764
No 43
>smart00642 Aamy Alpha-amylase domain.
Probab=58.34 E-value=38 Score=31.62 Aligned_cols=64 Identities=19% Similarity=0.279 Sum_probs=42.0
Q ss_pred cCCHHHHHHHHHHHHHcCCCeeEEEeeccccC--------CCc--ce-eecCCC---CCHHHHHHHHHHCCCEEEEEeCc
Q psy6580 390 YKNLSHIQSVVDRNVKAGIPLDTVWIDIDYME--------RHN--NF-VLAKPF---YGLKEYVQDLHKEGRHFIPILDP 455 (490)
Q Consensus 390 Y~s~~eV~~vv~~~r~~~IPlDviw~Didym~--------~~~--dF-T~D~~F---pd~~~mv~~Lh~~G~k~vlivdP 455 (490)
..+...+.+-++.+++.| +++||+-.-+.. +|. +| ..+.+| .++++||+.+|++|+++|+=+-|
T Consensus 15 ~G~~~gi~~~l~yl~~lG--~~~I~l~Pi~~~~~~~~~~~gY~~~d~~~i~~~~Gt~~d~~~lv~~~h~~Gi~vilD~V~ 92 (166)
T smart00642 15 GGDLQGIIEKLDYLKDLG--VTAIWLSPIFESPQGYPSYHGYDISDYKQIDPRFGTMEDFKELVDAAHARGIKVILDVVI 92 (166)
T ss_pred CcCHHHHHHHHHHHHHCC--CCEEEECcceeCCCCCCCCCCcCccccCCCCcccCCHHHHHHHHHHHHHCCCEEEEEECC
Confidence 456777887788778876 678877643221 121 11 222344 35789999999999999986643
No 44
>cd04883 ACT_AcuB C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. This CD includes the C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. AcuB is putatively involved in the anaerobic catabolism of acetoin, and related proteins. Studies report the induction of AcuB by nitrate respiration and also by fermentation. Since acetoin can be secreted and later serve as a source of carbon, it has been proposed that, during anaerobic growth when other carbon sources are exhausted, the induction of the AcuB protein results in acetoin catabolism. AcuB-like proteins have two N-terminal tandem CBS domains and a single C-terminal ACT domain. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=52.30 E-value=70 Score=24.56 Aligned_cols=56 Identities=13% Similarity=0.255 Sum_probs=40.2
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEE
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFI 450 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~v 450 (490)
..+.++.+-+.++|+-+..++....-..+.....+. +. .++.+.++.|.++|+++.
T Consensus 13 G~l~~i~~~l~~~~inI~~i~~~~~~~~~~~~v~i~v~~-~~~~~~~~~L~~~G~~v~ 69 (72)
T cd04883 13 GQLADIAAIFKDRGVNIVSVLVYPSKEEDNKILVFRVQT-MNPRPIIEDLRRAGYEVL 69 (72)
T ss_pred CHHHHHHHHHHHcCCCEEEEEEeccCCCCeEEEEEEEec-CCHHHHHHHHHHCCCeee
Confidence 367778888999999999987653222344445555 44 567799999999998754
No 45
>TIGR02456 treS_nterm trehalose synthase. Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PubMed:15378530).
Probab=49.95 E-value=87 Score=34.80 Aligned_cols=87 Identities=16% Similarity=0.214 Sum_probs=53.0
Q ss_pred cCCHHHHHHHHHHHHHcCCCeeEEEeeccccC-----CC--cc-eeecCCCC---CHHHHHHHHHHCCCEEEEEeCcccc
Q psy6580 390 YKNLSHIQSVVDRNVKAGIPLDTVWIDIDYME-----RH--NN-FVLAKPFY---GLKEYVQDLHKEGRHFIPILDPGVA 458 (490)
Q Consensus 390 Y~s~~eV~~vv~~~r~~~IPlDviw~Didym~-----~~--~d-FT~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP~I~ 458 (490)
-.+...|.+-++-+++. -+++||+-.-+-. +| .+ |..|.+|. ++++||+.+|++|+++|+=+-+.=.
T Consensus 24 ~Gdl~gi~~~Ldyl~~L--Gv~~i~L~Pi~~~~~~~~gY~~~dy~~vd~~~Gt~~df~~Lv~~ah~~Gi~vilD~V~NH~ 101 (539)
T TIGR02456 24 IGDFPGLTSKLDYLKWL--GVDALWLLPFFQSPLRDDGYDVSDYRAILPEFGTIDDFKDFVDEAHARGMRVIIDLVLNHT 101 (539)
T ss_pred ccCHHHHHHhHHHHHHC--CCCEEEECCCcCCCCCCCCCCcccccccChhhCCHHHHHHHHHHHHHCCCEEEEEeccCcC
Confidence 35677777777777764 4577877643311 11 12 34455563 6789999999999999986644322
Q ss_pred CCCCCCChhhHHHHH------cccEEEcC
Q psy6580 459 SREDSNYLPYVEGVE------KGIFVMNS 481 (490)
Q Consensus 459 ~~~~~~y~~y~eg~e------~g~fvk~~ 481 (490)
.. ..+.|+++.+ .++|+...
T Consensus 102 s~---~~~~f~~~~~~~~~~~~~~~~~~~ 127 (539)
T TIGR02456 102 SD---QHPWFQEARSNPDGPYRDFYVWSD 127 (539)
T ss_pred CC---CCHHHHHHhhCCCCCCCceEEecC
Confidence 11 2456665543 36676543
No 46
>PLN00196 alpha-amylase; Provisional
Probab=49.54 E-value=61 Score=35.11 Aligned_cols=60 Identities=20% Similarity=0.321 Sum_probs=41.9
Q ss_pred CHHHHHHHHHHHHHcCCCeeEEEeecccc----CCC---cceeec-CCCC---CHHHHHHHHHHCCCEEEEEe
Q psy6580 392 NLSHIQSVVDRNVKAGIPLDTVWIDIDYM----ERH---NNFVLA-KPFY---GLKEYVQDLHKEGRHFIPIL 453 (490)
Q Consensus 392 s~~eV~~vv~~~r~~~IPlDviw~Didym----~~~---~dFT~D-~~Fp---d~~~mv~~Lh~~G~k~vliv 453 (490)
....+.+-+..+++.| +++||+-.-+- .+| ..|.+| .+|. ++++||+.+|++|+|+|+=+
T Consensus 42 ~~~~i~~kldyL~~LG--vtaIWL~P~~~s~s~hGY~~~D~y~ld~~~fGt~~elk~Lv~~aH~~GIkVilDv 112 (428)
T PLN00196 42 WYNFLMGKVDDIAAAG--ITHVWLPPPSHSVSEQGYMPGRLYDLDASKYGNEAQLKSLIEAFHGKGVQVIADI 112 (428)
T ss_pred CHHHHHHHHHHHHHcC--CCEEEeCCCCCCCCCCCCCccccCCCCcccCCCHHHHHHHHHHHHHCCCEEEEEE
Confidence 3566777788878776 47788774221 122 225566 5785 68899999999999998855
No 47
>cd06416 GH25_Lys1-like Lys-1 is a lysozyme encoded by the Caenorhabditis elegans lys-1 gene. This gene is one of a several lysozyme genes upregulated upon infection by the Gram-negative bacterial pathogen Serratia marcescens. Lys-1 contains a glycosyl hydrolase family 25 (GH25) catalytic domain. This family also includes Lys-5 from Caenorhabditis elegans.
Probab=47.61 E-value=30 Score=32.84 Aligned_cols=72 Identities=18% Similarity=0.154 Sum_probs=47.4
Q ss_pred ccccccc--cCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CC-CCCHHHHHHHHHHCCCEEEEEeCcc
Q psy6580 383 FHLCRYG--YKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KP-FYGLKEYVQDLHKEGRHFIPILDPG 456 (490)
Q Consensus 383 fhqsr~g--Y~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~-Fpd~~~mv~~Lh~~G~k~vlivdP~ 456 (490)
||-++-. ..-.+++..+++..+..++..+.+|+|+..... .++.+ .. =.-+.+|++.+++.|.+.++...+.
T Consensus 58 Yhf~~~~~~~~~~~Qa~~f~~~~~~~~~~~~~i~lDiE~~~~--~~~~~~~~~~~~~~~f~~~~~~~G~~~~iYt~~~ 133 (196)
T cd06416 58 YFFPCINCCGSAAGQVQTFLQYLKANGIKYGTVWIDIEQNPC--QWSSDVASNCQFLQELVSAAKALGLKVGIYSSQY 133 (196)
T ss_pred EEEecCCCCCCHHHHHHHHHHHHHhCCCceeEEEEEEecCCC--CCcCCHHHHHHHHHHHHHHHHHhCCeEEEEcCcc
Confidence 5555432 233567888888888877777888999886522 12222 10 0235688888888899999887764
No 48
>PRK10933 trehalose-6-phosphate hydrolase; Provisional
Probab=44.86 E-value=49 Score=36.99 Aligned_cols=63 Identities=19% Similarity=0.290 Sum_probs=42.1
Q ss_pred CCHHHHHHHHHHHHHcCCCeeEEEeeccccC-----CC--cc-eeecCCCC---CHHHHHHHHHHCCCEEEEEeCc
Q psy6580 391 KNLSHIQSVVDRNVKAGIPLDTVWIDIDYME-----RH--NN-FVLAKPFY---GLKEYVQDLHKEGRHFIPILDP 455 (490)
Q Consensus 391 ~s~~eV~~vv~~~r~~~IPlDviw~Didym~-----~~--~d-FT~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP 455 (490)
.+...+.+-++.+++.| +++||+-.-+-. +| .+ +..|.+|. ++++||+.+|++|+|+|+=+-+
T Consensus 30 Gdl~gi~~~ldyl~~lG--v~~i~l~P~~~~~~~~~gY~~~d~~~id~~~Gt~~d~~~lv~~~h~~gi~vilD~V~ 103 (551)
T PRK10933 30 GDLRGVTQRLDYLQKLG--VDAIWLTPFYVSPQVDNGYDVANYTAIDPTYGTLDDFDELVAQAKSRGIRIILDMVF 103 (551)
T ss_pred cCHHHHHHhhHHHHhCC--CCEEEECCCCCCCCCCCCCCcccCCCcCcccCCHHHHHHHHHHHHHCCCEEEEEECC
Confidence 46777777788777766 477887543321 12 12 23345553 6889999999999999986644
No 49
>PF08924 DUF1906: Domain of unknown function (DUF1906); InterPro: IPR015020 This entry represents a family of uncharacterised hypothetical bacterial proteins. ; PDB: 1SFS_A.
Probab=43.47 E-value=34 Score=31.08 Aligned_cols=69 Identities=12% Similarity=0.172 Sum_probs=35.8
Q ss_pred CCCcccCCccccccccCCHHHHHHHHHHHHHcCCCe-eEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEE
Q psy6580 375 LPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPL-DTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIP 451 (490)
Q Consensus 375 mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPl-Dviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vl 451 (490)
...+|+-|+.|.+ .+-++.+...++.|+|- ..||.|+|| +-.+..++ .--|=++...+.||..|.+.-+
T Consensus 64 ~~~~~~~~~~~G~------~dA~~A~~~A~~lG~p~gt~IYfavD~--d~~~~~~~~~i~~Y~~g~~~~l~~~gY~~Gv 134 (136)
T PF08924_consen 64 ETSDFTYGYAQGV------ADARDAVAAARALGFPAGTPIYFAVDY--DATDAECDSAILPYFRGWNSALGASGYRPGV 134 (136)
T ss_dssp ---S-B--HHHHH------HHHHHHHHHHHHTT--SS-EEEEE--T--S-B-HH-------HHHHHHHHHGGGT-EEEE
T ss_pred cccccccHHHHHH------HHHHHHHHHHHHcCCCCCCEEEEEeec--CCCchhhhhHHHHHHHHHHHHHhhCCCccee
Confidence 3445555555543 67778888999999999 778999983 22233333 3334567888899998877654
No 50
>KOG3111|consensus
Probab=42.01 E-value=60 Score=31.72 Aligned_cols=72 Identities=24% Similarity=0.293 Sum_probs=49.2
Q ss_pred HHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHH
Q psy6580 362 VISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQ 440 (490)
Q Consensus 362 Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~ 440 (490)
||...+.-+|+++ |.-|..--.+. ++.++.|.++|.-+ |||. +.-.++.++++
T Consensus 53 vV~slR~~~~~~~--------ffD~HmMV~~P---eq~V~~~a~agas~---------------~tfH~E~~q~~~~lv~ 106 (224)
T KOG3111|consen 53 VVESLRKHTGADP--------FFDVHMMVENP---EQWVDQMAKAGASL---------------FTFHYEATQKPAELVE 106 (224)
T ss_pred HHHHHHhccCCCc--------ceeEEEeecCH---HHHHHHHHhcCcce---------------EEEEEeeccCHHHHHH
Confidence 7777777777765 33333332333 33455666665443 6666 55556899999
Q ss_pred HHHHCCCEEEEEeCccccC
Q psy6580 441 DLHKEGRHFIPILDPGVAS 459 (490)
Q Consensus 441 ~Lh~~G~k~vlivdP~I~~ 459 (490)
++|++|+++-+-+.|+-..
T Consensus 107 ~ir~~Gmk~G~alkPgT~V 125 (224)
T KOG3111|consen 107 KIREKGMKVGLALKPGTPV 125 (224)
T ss_pred HHHHcCCeeeEEeCCCCcH
Confidence 9999999999999999765
No 51
>PF10096 DUF2334: Uncharacterized protein conserved in bacteria (DUF2334); InterPro: IPR018763 This group of proteins has no known function.
Probab=39.32 E-value=97 Score=30.79 Aligned_cols=63 Identities=19% Similarity=0.345 Sum_probs=45.8
Q ss_pred cCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEEEe
Q psy6580 390 YKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIPIL 453 (490)
Q Consensus 390 Y~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vliv 453 (490)
+.+.+.++++++-+.++|||.-+-|+= .|.+....-+++ ...++.-+.++.+.++|-.+++--
T Consensus 12 ~~~~~~l~~i~d~l~~~~ipf~v~vIP-~~~d~~~~~~~~l~~~~~f~~~L~~~~~~Gg~I~lHG 75 (243)
T PF10096_consen 12 FSDLEKLKEIADYLYKYGIPFSVAVIP-VYVDPNGGITVNLSDNPEFVEYLRYLQARGGEIVLHG 75 (243)
T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEEEe-cccCCCCcccccchhhHHHHHHHHHHHhcCCEEEEEe
Confidence 457889999999999999998877774 455544433555 556666667777778898777643
No 52
>PRK09441 cytoplasmic alpha-amylase; Reviewed
Probab=35.92 E-value=78 Score=34.56 Aligned_cols=59 Identities=15% Similarity=0.300 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHHHcCCCeeEEEeecccc-------CCC--cce----------eecCCCC---CHHHHHHHHHHCCCEEE
Q psy6580 393 LSHIQSVVDRNVKAGIPLDTVWIDIDYM-------ERH--NNF----------VLAKPFY---GLKEYVQDLHKEGRHFI 450 (490)
Q Consensus 393 ~~eV~~vv~~~r~~~IPlDviw~Didym-------~~~--~dF----------T~D~~Fp---d~~~mv~~Lh~~G~k~v 450 (490)
...|.+-++-+++.| +++||+-.-+. .+| .+| +.|.+|. ++++||+.+|++|+|+|
T Consensus 21 ~~~I~~kldyl~~LG--vtaIwl~P~~~~~~~~~~hgY~~~D~~~~~~~~~~~~id~~fGt~~dl~~Li~~~H~~Gi~vi 98 (479)
T PRK09441 21 WNRLAERAPELAEAG--ITAVWLPPAYKGTSGGYDVGYGVYDLFDLGEFDQKGTVRTKYGTKEELLNAIDALHENGIKVY 98 (479)
T ss_pred HHHHHHHHHHHHHcC--CCEEEeCCCccCCCCCCCCCCCeecccccccccccCCcCcCcCCHHHHHHHHHHHHHCCCEEE
Confidence 455666677666654 67888874322 122 122 3455663 68899999999999988
Q ss_pred EEe
Q psy6580 451 PIL 453 (490)
Q Consensus 451 liv 453 (490)
+=+
T Consensus 99 ~D~ 101 (479)
T PRK09441 99 ADV 101 (479)
T ss_pred EEE
Confidence 755
No 53
>PF00128 Alpha-amylase: Alpha amylase, catalytic domain; InterPro: IPR006047 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain, such as alpha-amylase, belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. The maltogenic alpha-amylase is an enzyme which catalyses hydrolysis of (1-4)-alpha-D-glucosidic linkages in polysaccharides so as to remove successive alpha-maltose residues from the non-reducing ends of the chains in the conversion of starch to maltose. Other enzymes include neopullulanase, which hydrolyses pullulan to panose, and cyclomaltodextrinase, which hydrolyses cyclodextrins. This entry represents the catalytic domain found in several protein members of this family. It has a structure consisting of an 8 stranded alpha/beta barrel that contains the active site, interrupted by a ~70 amino acid calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 3FAX_A 3FAW_A 2DH3_B 2DH2_A 1CIU_A 1A47_A 3BMW_A 3BMV_A 2FH8_A 2FH6_A ....
Probab=33.86 E-value=43 Score=32.81 Aligned_cols=61 Identities=21% Similarity=0.381 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHcCCCeeEEEeecccc-----CCCc--ce-eecCCCC---CHHHHHHHHHHCCCEEEEEeCc
Q psy6580 393 LSHIQSVVDRNVKAGIPLDTVWIDIDYM-----ERHN--NF-VLAKPFY---GLKEYVQDLHKEGRHFIPILDP 455 (490)
Q Consensus 393 ~~eV~~vv~~~r~~~IPlDviw~Didym-----~~~~--dF-T~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP 455 (490)
..-+.+-++.+++.| +++||+-.-+- .+|. +| ..|.+|. ++++||+.+|++|+++|+=+-|
T Consensus 3 ~~gi~~kLdyl~~lG--v~~I~l~Pi~~~~~~~~gY~~~d~~~vd~~~Gt~~d~~~Lv~~~h~~gi~VilD~V~ 74 (316)
T PF00128_consen 3 FRGIIDKLDYLKDLG--VNAIWLSPIFESPNGYHGYDPSDYYAVDPRFGTMEDFKELVDAAHKRGIKVILDVVP 74 (316)
T ss_dssp HHHHHHTHHHHHHHT--ESEEEESS-EESSSSTTTTSESEEEEESTTTBHHHHHHHHHHHHHHTTCEEEEEEET
T ss_pred HHHHHHhhHHHHHcC--CCceecccccccccccccccceeeeccccccchhhhhhhhhhccccccceEEEeeec
Confidence 344555566666654 56777753221 2232 33 4556774 6889999999999999987765
No 54
>PRK08883 ribulose-phosphate 3-epimerase; Provisional
Probab=33.81 E-value=67 Score=31.56 Aligned_cols=35 Identities=11% Similarity=0.131 Sum_probs=27.8
Q ss_pred ceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccC
Q psy6580 425 NFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVAS 459 (490)
Q Consensus 425 dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~ 459 (490)
-.|+. +.=+++.++++.+|+.|++..+-++|.-..
T Consensus 84 ~i~~H~Ea~~~~~~~l~~ik~~g~k~GlalnP~Tp~ 119 (220)
T PRK08883 84 MITFHVEASEHVDRTLQLIKEHGCQAGVVLNPATPL 119 (220)
T ss_pred EEEEcccCcccHHHHHHHHHHcCCcEEEEeCCCCCH
Confidence 45666 544678899999999999999999998543
No 55
>PF01373 Glyco_hydro_14: Glycosyl hydrolase family 14; InterPro: IPR001554 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 14 GH14 from CAZY comprises enzymes with only one known activity; beta-amylase (3.2.1.2 from EC). A Glu residue has been proposed as a catalytic residue, but it is not known if it is the nucleophile or the proton donor. Beta-amylase [, ] is an enzyme that hydrolyses 1,4-alpha-glucosidic linkages in starch-type polysaccharide substrates so as to remove successive maltose units from the non-reducing ends of the chains. Beta-amylase is present in certain bacteria as well as in plants. Three highly conserved sequence regions are found in all known beta-amylases. The first of these regions is located in the N-terminal section of the enzymes and contains an aspartate which is known [] to be involved in the catalytic mechanism. The second, located in a more central location, is centred around a glutamate which is also involved [] in the catalytic mechanism. The 3D structure of a complex of soybean beta-amylase with an inhibitor (alpha-cyclodextrin) has been determined to 3.0A resolution by X-ray diffraction []. The enzyme folds into large and small domains: the large domain has a (beta alpha)8 super-secondary structural core, while the smaller is formed from two long loops extending from the beta-3 and beta-4 strands of the (beta alpha)8 fold []. The interface of the two domains, together with shorter loops from the (beta alpha)8 core, form a deep cleft, in which the inhibitor binds []. Two maltose molecules also bind in the cleft, one sharing a binding site with alpha-cyclodextrin, and the other sitting more deeply in the cleft [].; GO: 0016161 beta-amylase activity, 0000272 polysaccharide catabolic process; PDB: 1FA2_A 2DQX_A 1WDP_A 1UKP_C 1BYC_A 1BYA_A 1Q6C_A 1V3I_A 1BTC_A 1BYB_A ....
Probab=33.59 E-value=1.7e+02 Score=31.63 Aligned_cols=85 Identities=16% Similarity=0.256 Sum_probs=52.3
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeecccc--CCC--cceeecCCCCCHHHHHHHHHHCCCEEEEEeCccccCCCCC-----C
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDIDYM--ERH--NNFVLAKPFYGLKEYVQDLHKEGRHFIPILDPGVASREDS-----N 464 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Didym--~~~--~dFT~D~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~~~~~-----~ 464 (490)
..++.-+++++..| +|+|-+|+-|- ++. +.|.|. .-+++++-+++.|+|+.+++-=+-...+.| .
T Consensus 16 ~~~~~~L~~LK~~G--V~GVmvdvWWGiVE~~~p~~ydWs----~Y~~l~~~vr~~GLk~~~vmsfH~cGgNvgD~~~Ip 89 (402)
T PF01373_consen 16 NALEAQLRALKSAG--VDGVMVDVWWGIVEGEGPQQYDWS----GYRELFEMVRDAGLKLQVVMSFHQCGGNVGDDCNIP 89 (402)
T ss_dssp HHHHHHHHHHHHTT--EEEEEEEEEHHHHTGSSTTB---H----HHHHHHHHHHHTT-EEEEEEE-S-BSSSTTSSSEB-
T ss_pred HHHHHHHHHHHHcC--CcEEEEEeEeeeeccCCCCccCcH----HHHHHHHHHHHcCCeEEEEEeeecCCCCCCCccCCc
Confidence 47888888888876 78888887653 222 334443 567888889999999999883221110001 0
Q ss_pred Ch--hhHHHHHcccEEEcCCCC
Q psy6580 465 YL--PYVEGVEKGIFVMNSSGL 484 (490)
Q Consensus 465 y~--~y~eg~e~g~fvk~~~G~ 484 (490)
-| ..+.+.+.++|.++..|.
T Consensus 90 LP~Wv~~~~~~~di~ytd~~G~ 111 (402)
T PF01373_consen 90 LPSWVWEIGKKDDIFYTDRSGN 111 (402)
T ss_dssp S-HHHHHHHHHSGGEEE-TTS-
T ss_pred CCHHHHhccccCCcEEECCCCC
Confidence 12 356778889999999885
No 56
>PF12138 Spherulin4: Spherulation-specific family 4; InterPro: IPR021986 This protein is found in bacteria, archaea and eukaryotes. Proteins in this family are typically between 250 and 398 amino acids in length. There is a conserved NPG sequence motif and there are two completely conserved G residues that may be functionally important. Starvation will often induce spherulation - the production of spores - and this process may involve DNA-methylation. Changes in the methylation of spherulin4 are associated with the formation of spherules, but these changes are probably transient. Methylation of the gene accompanies its transcriptional activation, and spherulin4 mRNA is only detectable in late spherulating cultures and mature spherules. It is a spherulation-specific protein.
Probab=32.71 E-value=57 Score=32.81 Aligned_cols=75 Identities=21% Similarity=0.368 Sum_probs=53.5
Q ss_pred cccCCccccccccCCHHHHHHHHHHHHHc-------CCCeeEEEeeccccCCCcceeecCCCCCHHHHHHHHHH---CCC
Q psy6580 378 YWSLGFHLCRYGYKNLSHIQSVVDRNVKA-------GIPLDTVWIDIDYMERHNNFVLAKPFYGLKEYVQDLHK---EGR 447 (490)
Q Consensus 378 ~WaLGfhqsr~gY~s~~eV~~vv~~~r~~-------~IPlDviw~Didym~~~~dFT~D~~Fpd~~~mv~~Lh~---~G~ 447 (490)
.=.|||--+.||-++.++|++-+++|..- ++.+|+|++|= + -.+. ...+-++.+.+.+|+ .+-
T Consensus 68 v~vlGYV~T~Yg~R~~~~V~~dI~~Y~~W~~~~~~~~~~vdGIFfDE--~--p~~~---~~~~y~~~l~~~vk~~~~~~~ 140 (253)
T PF12138_consen 68 VRVLGYVHTSYGSRPLSEVKADIDTYASWYGQSEDYGYRVDGIFFDE--A--PNDY---ANLPYYQNLYNYVKSAFGLGG 140 (253)
T ss_pred CcEEEEEEccccCCCHHHHHHHHHHHhhccccccCCCcccceEEEec--C--CCcH---HHHHHHHHHHHHHHhccccCC
Confidence 45689999999999999999999999987 69999999992 0 0000 223334555555666 445
Q ss_pred EEEEEeCccccC
Q psy6580 448 HFIPILDPGVAS 459 (490)
Q Consensus 448 k~vlivdP~I~~ 459 (490)
.-.++.+|+...
T Consensus 141 ~~~VV~NPGt~~ 152 (253)
T PF12138_consen 141 DGLVVLNPGTAV 152 (253)
T ss_pred CCEEEeCCCCCC
Confidence 566678999844
No 57
>PRK12563 sulfate adenylyltransferase subunit 2; Provisional
Probab=32.52 E-value=1.3e+02 Score=31.35 Aligned_cols=73 Identities=12% Similarity=0.201 Sum_probs=49.2
Q ss_pred HHHHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHH-HHcCCCeeEEEeeccccCCCcceeecCCCCCHHHH
Q psy6580 360 GDVISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRN-VKAGIPLDTVWIDIDYMERHNNFVLAKPFYGLKEY 438 (490)
Q Consensus 360 ~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~-r~~~IPlDviw~Didym~~~~dFT~D~~Fpd~~~m 438 (490)
.++++...+..++|.| .||+| ++...+..++.+. ...++|+.++++|-. -.||..-+|
T Consensus 27 i~ilrea~~~f~~~~v--~~SgG--------KDS~VlLhLa~kaf~~~~~~~pvl~VDTG-----------~~FpEt~ef 85 (312)
T PRK12563 27 IHILREVVAECSKPVM--LYSIG--------KDSVVMLHLAMKAFRPTRPPFPLLHVDTT-----------WKFREMIDF 85 (312)
T ss_pred HHHHHHHHHhcCCcEE--EecCC--------hHHHHHHHHHHHhhcccCCCeeEEEeCCC-----------CCCHHHHHH
Confidence 4455555444566544 67776 4455677777664 345789999999843 368888899
Q ss_pred HHHHHH-CCCEEEEEe
Q psy6580 439 VQDLHK-EGRHFIPIL 453 (490)
Q Consensus 439 v~~Lh~-~G~k~vliv 453 (490)
++++.+ .|+.+++..
T Consensus 86 rD~~a~~~gl~Liv~~ 101 (312)
T PRK12563 86 RDRRAKELGLDLVVHH 101 (312)
T ss_pred HHHHHHHhCCcEEEec
Confidence 998854 688877654
No 58
>PLN02645 phosphoglycolate phosphatase
Probab=32.52 E-value=1.4e+02 Score=30.60 Aligned_cols=51 Identities=20% Similarity=0.338 Sum_probs=38.4
Q ss_pred CHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEEEeC
Q psy6580 392 NLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIPILD 454 (490)
Q Consensus 392 s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivd 454 (490)
+.+++.+++++ .+++.+|+| +.+--+ .-||+..+.++.|+++|+++++..+
T Consensus 17 ~~~~~~~~~~~-------~~~~~~D~D-----Gtl~~~~~~~~ga~e~l~~lr~~g~~~~~~TN 68 (311)
T PLN02645 17 TLENADELIDS-------VETFIFDCD-----GVIWKGDKLIEGVPETLDMLRSMGKKLVFVTN 68 (311)
T ss_pred CHHHHHHHHHh-------CCEEEEeCc-----CCeEeCCccCcCHHHHHHHHHHCCCEEEEEeC
Confidence 34566666664 679999987 333212 6789999999999999999999765
No 59
>PRK15447 putative protease; Provisional
Probab=32.22 E-value=78 Score=32.47 Aligned_cols=57 Identities=11% Similarity=0.215 Sum_probs=42.1
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeecCCCCCHHHHHHHHHHCCCEEEEEeCccc
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLAKPFYGLKEYVQDLHKEGRHFIPILDPGV 457 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D~~Fpd~~~mv~~Lh~~G~k~vlivdP~I 457 (490)
..++.+...+.+.| .|+||+...-...+..|+. .++++.++.+|+.|.++.+.+ |.|
T Consensus 15 ~~~~~~~~~~~~~g--aDaVY~g~~~~~~R~~f~~----~~l~e~v~~~~~~gkkvyva~-p~i 71 (301)
T PRK15447 15 ETVRDFYQRAADSP--VDIVYLGETVCSKRRELKV----GDWLELAERLAAAGKEVVLST-LAL 71 (301)
T ss_pred CCHHHHHHHHHcCC--CCEEEECCccCCCccCCCH----HHHHHHHHHHHHcCCEEEEEe-ccc
Confidence 34556666666776 9999999766665555654 378899999999999987755 555
No 60
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=31.69 E-value=1.5e+02 Score=30.40 Aligned_cols=64 Identities=17% Similarity=0.190 Sum_probs=43.0
Q ss_pred CCHHHHHHHHHHHHHcCCCeeEEEeeccccCC----CcceeecCCCCCHHHHHHHHHHCCCEEEEEeCcc
Q psy6580 391 KNLSHIQSVVDRNVKAGIPLDTVWIDIDYMER----HNNFVLAKPFYGLKEYVQDLHKEGRHFIPILDPG 456 (490)
Q Consensus 391 ~s~~eV~~vv~~~r~~~IPlDviw~Didym~~----~~dFT~D~~Fpd~~~mv~~Lh~~G~k~vlivdP~ 456 (490)
.+.+..+..++--.++||+.-.| |..|-.. ..+|+-...-.+++++++.-+++|+.+.+|.+-.
T Consensus 29 ~~t~~~k~yIDfAa~~G~eYvlv--D~GW~~~~~~~~~d~~~~~~~~dl~elv~Ya~~KgVgi~lw~~~~ 96 (273)
T PF10566_consen 29 ATTETQKRYIDFAAEMGIEYVLV--DAGWYGWEKDDDFDFTKPIPDFDLPELVDYAKEKGVGIWLWYHSE 96 (273)
T ss_dssp SSHHHHHHHHHHHHHTT-SEEEE--BTTCCGS--TTT--TT-B-TT--HHHHHHHHHHTT-EEEEEEECC
T ss_pred CCHHHHHHHHHHHHHcCCCEEEe--ccccccccccccccccccCCccCHHHHHHHHHHcCCCEEEEEeCC
Confidence 46788899999999999998666 8777531 2222211223579999999999999999998643
No 61
>TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB. This model describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reducted sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterized polysaccharide deacetylase; the exact function this protein family is unknown.
Probab=31.54 E-value=1.7e+02 Score=27.36 Aligned_cols=19 Identities=26% Similarity=0.607 Sum_probs=12.7
Q ss_pred CHHHHHHHHHHCCCEEEEE
Q psy6580 434 GLKEYVQDLHKEGRHFIPI 452 (490)
Q Consensus 434 d~~~mv~~Lh~~G~k~vli 452 (490)
-++.+++.|+++|.+++.+
T Consensus 169 ~l~~~i~~l~~~Gy~~vtl 187 (191)
T TIGR02764 169 ALPTIIKKLKEKGYEFVTI 187 (191)
T ss_pred HHHHHHHHHHHCCCEEEEH
Confidence 3566777777777777653
No 62
>TIGR02401 trehalose_TreY malto-oligosyltrehalose synthase. This enzyme, formally named (1-4)-alpha-D-glucan 1-alpha-D-glucosylmutase, is the TreY enzyme of the TreYZ pathway of trehalose biosynthesis, an alternative to the OtsAB pathway. Trehalose may be incorporated into more complex compounds but is best known as compatible solute. It is one of the most effective osmoprotectants, and unlike the various betaines does not require nitrogen for its synthesis.
Probab=31.32 E-value=1.9e+02 Score=34.26 Aligned_cols=81 Identities=7% Similarity=0.092 Sum_probs=51.5
Q ss_pred CHHHHHHHHHHHHHcCCCeeEEEeecccc------CCCc--c-eeecCCCC---CHHHHHHHHHHCCCEEEEEeCccccC
Q psy6580 392 NLSHIQSVVDRNVKAGIPLDTVWIDIDYM------ERHN--N-FVLAKPFY---GLKEYVQDLHKEGRHFIPILDPGVAS 459 (490)
Q Consensus 392 s~~eV~~vv~~~r~~~IPlDviw~Didym------~~~~--d-FT~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP~I~~ 459 (490)
+.+.+.+.+.-+++.|| ++||+-.-+. .+|. | +..|..|. +++.|++.+|++|+++|+=+=|.=..
T Consensus 14 tf~~~~~~L~YL~~LGv--~~V~lsPi~~a~~gs~hGYdv~D~~~idp~lGt~edf~~Lv~aah~~Gm~vIlDiVpNH~a 91 (825)
T TIGR02401 14 TFDDAAALLPYLKSLGV--SHLYLSPILTAVPGSTHGYDVVDHSEINPELGGEEGLRRLSEAARARGLGLIVDIVPNHMA 91 (825)
T ss_pred CHHHHHHhhHHHHHcCC--CEEEeCcCccCCCCCCCCCCCCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEecccccc
Confidence 56778888888888886 7887774332 1221 2 22344553 57899999999999999976553322
Q ss_pred CCCCCChhhHHHHHc
Q psy6580 460 REDSNYLPYVEGVEK 474 (490)
Q Consensus 460 ~~~~~y~~y~eg~e~ 474 (490)
......+.|.+.++.
T Consensus 92 ~~~~~n~wf~dvl~~ 106 (825)
T TIGR02401 92 VHLEQNPWWWDVLKN 106 (825)
T ss_pred cccccChHHHHHHHh
Confidence 221124667766654
No 63
>PRK10785 maltodextrin glucosidase; Provisional
Probab=31.15 E-value=1e+02 Score=34.85 Aligned_cols=63 Identities=10% Similarity=0.084 Sum_probs=43.3
Q ss_pred CCHHHHHHHHHHHHHcCCCeeEEEeeccccC----CC--cc-eeecCCCC---CHHHHHHHHHHCCCEEEEEeCc
Q psy6580 391 KNLSHIQSVVDRNVKAGIPLDTVWIDIDYME----RH--NN-FVLAKPFY---GLKEYVQDLHKEGRHFIPILDP 455 (490)
Q Consensus 391 ~s~~eV~~vv~~~r~~~IPlDviw~Didym~----~~--~d-FT~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP 455 (490)
.+..-|.+-++-+++.| +++||+-.=+-. +| .| |..|.+|. ++++||+.+|++|+|+|+=+-+
T Consensus 176 GDl~GI~~kLdYL~~LG--v~~I~L~Pif~s~s~hgYd~~Dy~~iDp~~Gt~~df~~Lv~~aH~rGikVilD~V~ 248 (598)
T PRK10785 176 GDLDGISEKLPYLKKLG--VTALYLNPIFTAPSVHKYDTEDYRHVDPQLGGDAALLRLRHATQQRGMRLVLDGVF 248 (598)
T ss_pred cCHHHHHHHHHHHHHcC--CCEEEeCCcccCCCCCCcCcccccccCcccCCHHHHHHHHHHHHHCCCEEEEEECC
Confidence 57788888888888765 578888743221 11 12 23445554 6789999999999999986643
No 64
>COG0036 Rpe Pentose-5-phosphate-3-epimerase [Carbohydrate transport and metabolism]
Probab=30.89 E-value=86 Score=31.04 Aligned_cols=35 Identities=11% Similarity=0.115 Sum_probs=30.6
Q ss_pred ceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccC
Q psy6580 425 NFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVAS 459 (490)
Q Consensus 425 dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~ 459 (490)
..|+. +..+++.+.++.+++.|.|.-+.++|+-..
T Consensus 87 ~It~H~E~~~~~~r~i~~Ik~~G~kaGv~lnP~Tp~ 122 (220)
T COG0036 87 IITFHAEATEHIHRTIQLIKELGVKAGLVLNPATPL 122 (220)
T ss_pred EEEEEeccCcCHHHHHHHHHHcCCeEEEEECCCCCH
Confidence 45777 888899999999999999999999998664
No 65
>PF13344 Hydrolase_6: Haloacid dehalogenase-like hydrolase; PDB: 2HO4_B 1YV9_A 1WVI_B 3EPR_A 2P27_A 2OYC_A 2CFT_A 2P69_A 2CFS_A 2CFR_A ....
Probab=30.34 E-value=95 Score=26.39 Aligned_cols=27 Identities=19% Similarity=0.386 Sum_probs=20.9
Q ss_pred CCCCCHHHHHHHHHHCCCEEEEEeCcc
Q psy6580 430 KPFYGLKEYVQDLHKEGRHFIPILDPG 456 (490)
Q Consensus 430 ~~Fpd~~~mv~~Lh~~G~k~vlivdP~ 456 (490)
+-+|+..++++.|+++|.+++++.+-.
T Consensus 14 ~~ipga~e~l~~L~~~g~~~~~lTNns 40 (101)
T PF13344_consen 14 EPIPGAVEALDALRERGKPVVFLTNNS 40 (101)
T ss_dssp EE-TTHHHHHHHHHHTTSEEEEEES-S
T ss_pred CcCcCHHHHHHHHHHcCCCEEEEeCCC
Confidence 667899999999999999988876543
No 66
>TIGR00559 pdxJ pyridoxine 5'-phosphate synthase. PdxJ is required in the biosynthesis of pyridoxine (vitamin B6), a precursor to the enzyme cofactor pyridoxal phosphate. ECOCYC describes the predicted reaction equation as 1-amino-propan-2-one-3-phosphate + deoxyxylulose-5-phosphate = pyridoxine-5'-phosphate. The product of that reaction is oxidized by PdxH to pyridoxal 5'-phosphate.
Probab=29.62 E-value=44 Score=33.42 Aligned_cols=29 Identities=21% Similarity=0.205 Sum_probs=23.8
Q ss_pred CCCCCHHHHHHHHHHCCCEEEEEeCcccc
Q psy6580 430 KPFYGLKEYVQDLHKEGRHFIPILDPGVA 458 (490)
Q Consensus 430 ~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~ 458 (490)
..+..+...++.||+.|+++.+.+||-..
T Consensus 107 ~~~~~l~~~i~~l~~~gI~VSLFiDP~~~ 135 (237)
T TIGR00559 107 RLKDKLCELVKRFHAAGIEVSLFIDADKD 135 (237)
T ss_pred hCHHHHHHHHHHHHHCCCEEEEEeCCCHH
Confidence 45667888899999999999999998754
No 67
>PRK05265 pyridoxine 5'-phosphate synthase; Provisional
Probab=28.76 E-value=45 Score=33.37 Aligned_cols=29 Identities=24% Similarity=0.337 Sum_probs=24.7
Q ss_pred CCCCCHHHHHHHHHHCCCEEEEEeCcccc
Q psy6580 430 KPFYGLKEYVQDLHKEGRHFIPILDPGVA 458 (490)
Q Consensus 430 ~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~ 458 (490)
..+..++.+++.||+.|+++.+.|||-..
T Consensus 110 ~~~~~l~~~i~~L~~~gIrVSLFidP~~~ 138 (239)
T PRK05265 110 GQFDKLKPAIARLKDAGIRVSLFIDPDPE 138 (239)
T ss_pred cCHHHHHHHHHHHHHCCCEEEEEeCCCHH
Confidence 45667889999999999999999999764
No 68
>TIGR02403 trehalose_treC alpha,alpha-phosphotrehalase. Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
Probab=28.35 E-value=1.9e+02 Score=32.21 Aligned_cols=77 Identities=17% Similarity=0.248 Sum_probs=48.3
Q ss_pred CCHHHHHHHHHHHHHcCCCeeEEEeeccccC-----CC--cc-eeecCCCC---CHHHHHHHHHHCCCEEEEEeCccccC
Q psy6580 391 KNLSHIQSVVDRNVKAGIPLDTVWIDIDYME-----RH--NN-FVLAKPFY---GLKEYVQDLHKEGRHFIPILDPGVAS 459 (490)
Q Consensus 391 ~s~~eV~~vv~~~r~~~IPlDviw~Didym~-----~~--~d-FT~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP~I~~ 459 (490)
.+...+.+-++.+++.| +++||+-.-+.. +| .+ |.+|.+|. ++++||+.+|++|+++|+=+-|.=..
T Consensus 24 G~~~gi~~~l~yl~~lG--~~~i~l~Pi~~~~~~~~gY~~~d~~~id~~~Gt~~~~~~lv~~ah~~gi~vilD~v~NH~~ 101 (543)
T TIGR02403 24 GDLRGIIEKLDYLKKLG--VDYIWLNPFYVSPQKDNGYDVSDYYAINPLFGTMADFEELVSEAKKRNIKIMLDMVFNHTS 101 (543)
T ss_pred cCHHHHHHhHHHHHHcC--CCEEEECCcccCCCCCCCCCccccCccCcccCCHHHHHHHHHHHHHCCCEEEEEECccccc
Confidence 56677777777777654 577888754432 22 12 33444553 67899999999999999876543221
Q ss_pred CCCCCChhhHHHH
Q psy6580 460 REDSNYLPYVEGV 472 (490)
Q Consensus 460 ~~~~~y~~y~eg~ 472 (490)
. .++.|.++.
T Consensus 102 ~---~~~~f~~~~ 111 (543)
T TIGR02403 102 T---EHEWFKKAL 111 (543)
T ss_pred c---chHHHHHhh
Confidence 1 245565554
No 69
>PF02638 DUF187: Glycosyl hydrolase like GH101; InterPro: IPR003790 This entry describes proteins of unknown function.
Probab=27.86 E-value=1.9e+02 Score=29.80 Aligned_cols=65 Identities=17% Similarity=0.256 Sum_probs=44.4
Q ss_pred CHHHHHHHHHHHHHcCCCeeEEEeeccccC------CCc----ceeec----CCCCCHHHHHHHHHHCCCEEEEEeCccc
Q psy6580 392 NLSHIQSVVDRNVKAGIPLDTVWIDIDYME------RHN----NFVLA----KPFYGLKEYVQDLHKEGRHFIPILDPGV 457 (490)
Q Consensus 392 s~~eV~~vv~~~r~~~IPlDviw~Didym~------~~~----dFT~D----~~Fpd~~~mv~~Lh~~G~k~vlivdP~I 457 (490)
+.+++.++++.++++| +-+|+.+.-... ... ..+-. ..|.=++.||++-|++|+++-.|+..++
T Consensus 17 ~~~~~~~~l~~l~~~~--~N~V~~qVr~~gda~Y~S~~~p~s~~~~g~~~~~pg~DpL~~~I~eaHkrGlevHAW~~~~~ 94 (311)
T PF02638_consen 17 SKEQIDEMLDDLKSAG--FNAVFVQVRPRGDALYPSDIEPWSGYLTGKQGKDPGFDPLEFMIEEAHKRGLEVHAWFRVGF 94 (311)
T ss_pred CHHHHHHHHHHHHHcC--CCEEEEEEEeCcEEEecccccccccccCCCCCCCCCccHHHHHHHHHHHcCCEEEEEEEeec
Confidence 6789999999999999 667777653321 111 11111 1233478999999999999999994444
Q ss_pred c
Q psy6580 458 A 458 (490)
Q Consensus 458 ~ 458 (490)
.
T Consensus 95 ~ 95 (311)
T PF02638_consen 95 N 95 (311)
T ss_pred C
Confidence 3
No 70
>PF04468 PSP1: PSP1 C-terminal conserved region; InterPro: IPR007557 The yeast polymerase suppressor 1 (PSP1) protein partially suppresses mutations in DNA polymerases alpha and delta []. The C-terminal half of PSP1 contains a domain, which is also found in several hypothetical proteins from both eukaryotic and prokaryotic sources: Crithidia fasciculata RBP45 and RBP33, subunits of the cycling sequence binding protein (CSBP) II. RBP45 and RBP33 proteins bind specifically to the cycling sequences present in several mRNAs that accumulate periodically during the cell cycle. RBP45 and RBP33 are phosphoproteins, which are phosphorylated differentially during progression through the cell cycle. Hypothetical proteins with high sequence similarity have been identified in other kinetoplastid organisms []. Bacillus subtilis yaaT protein, which plays a significant role in phosphorelay during initiation of sporulation. It is possible that the yaaT protein is also related to DNA replication. The sequence of the yaaT protein is widely conserved in prokaryotes (bacteria and archaea), but the functions of the protein are unknown []. The actual biological significance of the PSP1 C-terminal domain has not yet been clearly established.
Probab=27.82 E-value=2.2e+02 Score=23.80 Aligned_cols=50 Identities=22% Similarity=0.404 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeeccccC--CCcceeec-CCCCCHHHHHHHHHHC
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDIDYME--RHNNFVLA-KPFYGLKEYVQDLHKE 445 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Didym~--~~~dFT~D-~~Fpd~~~mv~~Lh~~ 445 (490)
+.+...-+..+++++|+..+ |..|+- ..-.|-|. +...|..++|+.|+..
T Consensus 26 ~al~~c~~~~~~~~L~m~lv--d~e~~~D~~k~~fyy~a~~rvDFR~Lvr~L~~~ 78 (88)
T PF04468_consen 26 EALKFCRELVKELGLPMKLV--DVEYQFDGSKLTFYYTAESRVDFRELVRDLARE 78 (88)
T ss_pred HHHHHHHHHHHHcCCCeEEE--EEEEEcCCCEEEEEEEeCCcCcHHHHHHHHHHH
Confidence 34445556677899877766 555553 33466677 8889999999999863
No 71
>cd00003 PNPsynthase Pyridoxine 5'-phosphate (PNP) synthase domain; pyridoxal 5'-phosphate is the active form of vitamin B6 that acts as an essential, ubiquitous coenzyme in amino acid metabolism. In bacteria, formation of pyridoxine 5'-phosphate is a step in the biosynthesis of vitamin B6. PNP synthase, a homooctameric enzyme, catalyzes the final step in PNP biosynthesis, the condensation of 1-amino-acetone 3-phosphate and 1-deoxy-D-xylulose 5-phosphate. PNP synthase adopts a TIM barrel topology, intersubunit contacts are mediated by three ''extra'' helices, generating a tetramer of symmetric dimers with shared active sites; the open state has been proposed to accept substrates and to release products, while most of the catalytic events are likely to occur in the closed state; a hydrophilic channel running through the center of the barrel was identified as the essential structural feature that enables PNP synthase to release water molecules produced during the reaction from the closed,
Probab=27.66 E-value=48 Score=33.08 Aligned_cols=29 Identities=21% Similarity=0.271 Sum_probs=24.2
Q ss_pred CCCCCHHHHHHHHHHCCCEEEEEeCcccc
Q psy6580 430 KPFYGLKEYVQDLHKEGRHFIPILDPGVA 458 (490)
Q Consensus 430 ~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~ 458 (490)
..+..+..+++.||+.|+++.+.+||-..
T Consensus 107 ~~~~~l~~~i~~l~~~gI~VSLFiDPd~~ 135 (234)
T cd00003 107 GQAEKLKPIIERLKDAGIRVSLFIDPDPE 135 (234)
T ss_pred cCHHHHHHHHHHHHHCCCEEEEEeCCCHH
Confidence 45667888999999999999999998764
No 72
>COG0366 AmyA Glycosidases [Carbohydrate transport and metabolism]
Probab=27.48 E-value=2.1e+02 Score=30.53 Aligned_cols=78 Identities=23% Similarity=0.318 Sum_probs=47.3
Q ss_pred cCCHHHHHHHHHHHHHcCCCeeEEEeeccccC---CC----ccee-ecCCCC---CHHHHHHHHHHCCCEEEEEeCcccc
Q psy6580 390 YKNLSHIQSVVDRNVKAGIPLDTVWIDIDYME---RH----NNFV-LAKPFY---GLKEYVQDLHKEGRHFIPILDPGVA 458 (490)
Q Consensus 390 Y~s~~eV~~vv~~~r~~~IPlDviw~Didym~---~~----~dFT-~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP~I~ 458 (490)
-.+..-|.+-++-+++.| +++||+-.-+-. .. .+|+ .|..|. ++.++++.+|++|+++|+=+-.--.
T Consensus 25 ~Gdl~Gi~~~LdYl~~LG--v~aiwl~Pi~~s~~~~~gY~~~Dy~~id~~~Gt~~d~~~li~~~H~~gi~vi~D~V~NH~ 102 (505)
T COG0366 25 GGDLKGITEKLDYLKELG--VDAIWLSPIFESPQADHGYDVSDYTKVDPHFGTEEDFKELVEEAHKRGIKVILDLVFNHT 102 (505)
T ss_pred cccHHhHHHhhhHHHHhC--CCEEEeCCCCCCCccCCCccccchhhcCcccCCHHHHHHHHHHHHHCCCEEEEEeccCcC
Confidence 345666666677777665 688988854432 11 2222 224453 6789999999999999986642222
Q ss_pred CCCCCCChhhHHHH
Q psy6580 459 SREDSNYLPYVEGV 472 (490)
Q Consensus 459 ~~~~~~y~~y~eg~ 472 (490)
. ..++.+.+..
T Consensus 103 s---~~~~~f~~~~ 113 (505)
T COG0366 103 S---DEHPWFKEAR 113 (505)
T ss_pred C---CccHHHHHHh
Confidence 1 1255565555
No 73
>PLN02361 alpha-amylase
Probab=27.25 E-value=1.4e+02 Score=32.26 Aligned_cols=58 Identities=14% Similarity=0.223 Sum_probs=39.5
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeecccc----CCCc---ceeecCCCC---CHHHHHHHHHHCCCEEEEEe
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDIDYM----ERHN---NFVLAKPFY---GLKEYVQDLHKEGRHFIPIL 453 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Didym----~~~~---dFT~D~~Fp---d~~~mv~~Lh~~G~k~vliv 453 (490)
..+.+-++.+++.|| ++||+-.-+- .+|. -|..|.+|. ++++||+.+|++|+++|+=+
T Consensus 29 ~~i~~kl~~l~~lG~--t~iwl~P~~~~~~~~GY~~~d~y~~~~~~Gt~~el~~li~~~h~~gi~vi~D~ 96 (401)
T PLN02361 29 RNLEGKVPDLAKSGF--TSAWLPPPSQSLAPEGYLPQNLYSLNSAYGSEHLLKSLLRKMKQYNVRAMADI 96 (401)
T ss_pred HHHHHHHHHHHHcCC--CEEEeCCCCcCCCCCCCCcccccccCcccCCHHHHHHHHHHHHHcCCEEEEEE
Confidence 567777777777664 6787764332 1222 234456775 68899999999999998755
No 74
>PRK06769 hypothetical protein; Validated
Probab=27.23 E-value=1.1e+02 Score=28.48 Aligned_cols=46 Identities=15% Similarity=0.157 Sum_probs=30.3
Q ss_pred CeeEEEeeccccCC-Ccceee-c--CCCCCHHHHHHHHHHCCCEEEEEeC
Q psy6580 409 PLDTVWIDIDYMER-HNNFVL-A--KPFYGLKEYVQDLHKEGRHFIPILD 454 (490)
Q Consensus 409 PlDviw~Didym~~-~~dFT~-D--~~Fpd~~~mv~~Lh~~G~k~vlivd 454 (490)
|.-++.+|.|=.=. +..+.. + .-||+..++++.||++|+++.+..+
T Consensus 3 ~~~~~~~d~d~~~~~~~~~~~~~~~~~~pgv~e~L~~Lk~~G~~l~I~Tn 52 (173)
T PRK06769 3 NIQAIFIDRDGTIGGDTTIHYPGSFTLFPFTKASLQKLKANHIKIFSFTN 52 (173)
T ss_pred CCcEEEEeCCCcccCCCCCCCHHHeEECCCHHHHHHHHHHCCCEEEEEEC
Confidence 45567777554321 211222 2 5689999999999999999888754
No 75
>PRK08745 ribulose-phosphate 3-epimerase; Provisional
Probab=26.36 E-value=1.1e+02 Score=30.22 Aligned_cols=35 Identities=11% Similarity=0.164 Sum_probs=27.4
Q ss_pred ceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccccC
Q psy6580 425 NFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGVAS 459 (490)
Q Consensus 425 dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~~ 459 (490)
..|+. +.=+++.++++.+|+.|++..+-++|.-..
T Consensus 88 ~I~~H~Ea~~~~~~~l~~Ir~~g~k~GlalnP~T~~ 123 (223)
T PRK08745 88 TISFHPEASRHVHRTIQLIKSHGCQAGLVLNPATPV 123 (223)
T ss_pred EEEEcccCcccHHHHHHHHHHCCCceeEEeCCCCCH
Confidence 34555 544678899999999999999999998653
No 76
>cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself.
Probab=25.49 E-value=2.2e+02 Score=29.06 Aligned_cols=64 Identities=20% Similarity=0.295 Sum_probs=47.5
Q ss_pred ccccccCCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCC---------------------CHHHHHHHH
Q psy6580 385 LCRYGYKNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFY---------------------GLKEYVQDL 442 (490)
Q Consensus 385 qsr~gY~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fp---------------------d~~~mv~~L 442 (490)
.+| .|-+.+.++++++.|...++-.=.+++--+ ..|.+. ..|| +.+++++.-
T Consensus 8 ~aR-~~~~~~~lk~~id~ma~~K~N~lhlHl~D~-----~~~~le~~~~p~l~~~g~~~~~~~~~~~yT~~di~elv~yA 81 (303)
T cd02742 8 VSR-HFLSVESIKRTIDVLARYKINTFHWHLTDD-----QAWRIESKKFPELAEKGGQINPRSPGGFYTYAQLKDIIEYA 81 (303)
T ss_pred ccc-cCcCHHHHHHHHHHHHHhCCcEEEEeeecC-----CCceEeeCccchhhhhcccccCCCCCCeECHHHHHHHHHHH
Confidence 467 888999999999999999988766665522 123332 2222 567888888
Q ss_pred HHCCCEEEEEeC
Q psy6580 443 HKEGRHFIPILD 454 (490)
Q Consensus 443 h~~G~k~vlivd 454 (490)
+++|+.+|+-||
T Consensus 82 ~~rgI~viPEiD 93 (303)
T cd02742 82 AARGIEVIPEID 93 (303)
T ss_pred HHcCCEEEEecc
Confidence 999999999886
No 77
>PF03537 Glyco_hydro_114: Glycoside-hydrolase family GH114; InterPro: IPR004352 Eighty-one archaeal-like genes, ranging in size from 4-20kb, are clustered in 15 regions of the Thermotoga maritima genome []. Conservation of gene order between T. maritima and Archaea in many of these regions suggests that lateral gene transfer may have occurred between thermophilic Eubacteria and Archaea []. One of the T. maritima sequences (hypothetical protein TM1410) shares similarity with Methanocaldococcus jannaschii (Methanococcus jannaschii) hypothetical protein MJ1477 and with hypothetical protein DR0705 from Deinococcus radiodurans. The sequences are characterised by relatively variable N- and C-terminal domains, and a more conserved central domain. They share no similarity with any other known, functionally or structurally characterised proteins. ; PDB: 2AAM_F.
Probab=25.16 E-value=83 Score=25.44 Aligned_cols=24 Identities=25% Similarity=0.293 Sum_probs=17.6
Q ss_pred CHHHHHHHHHHCCCEEEEEeCccc
Q psy6580 434 GLKEYVQDLHKEGRHFIPILDPGV 457 (490)
Q Consensus 434 d~~~mv~~Lh~~G~k~vlivdP~I 457 (490)
..++-|+.||++|.+++-.++-+-
T Consensus 37 ~~~~~I~~L~~~G~~vicY~s~Gs 60 (74)
T PF03537_consen 37 FSKEEIARLKAQGKKVICYFSIGS 60 (74)
T ss_dssp --HHHHHHHHHTT-EEEEEEESSE
T ss_pred CCHHHHHHHHHCCCEEEEEEeCce
Confidence 357889999999999998876543
No 78
>TIGR01459 HAD-SF-IIA-hyp4 HAD-superfamily class IIA hydrolase, TIGR01459. This hypothetical equivalog is a member of the Class IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. The sequences modelled by this equivalog are all gram negative and primarily alpha proteobacteria. Only one sequence hase been annotated as other than "hypothetical." That one, from Brucella, is annotated as related to NagD, but only by sequence similarity and should be treated with some skepticism. (See comments for Class IIA subfamily)
Probab=24.54 E-value=1.3e+02 Score=29.41 Aligned_cols=39 Identities=10% Similarity=0.234 Sum_probs=29.2
Q ss_pred eeEEEeeccccCCCcceeec--CCCCCHHHHHHHHHHCCCEEEEEeC
Q psy6580 410 LDTVWIDIDYMERHNNFVLA--KPFYGLKEYVQDLHKEGRHFIPILD 454 (490)
Q Consensus 410 lDviw~Didym~~~~dFT~D--~~Fpd~~~mv~~Lh~~G~k~vlivd 454 (490)
.+++.+|+|= . -|+ .-||+..++++.|+++|+++.+..+
T Consensus 8 ~~~~~~D~dG-----~-l~~~~~~~pga~e~L~~L~~~G~~~~ivTN 48 (242)
T TIGR01459 8 YDVFLLDLWG-----V-IIDGNHTYPGAVQNLNKIIAQGKPVYFVSN 48 (242)
T ss_pred CCEEEEeccc-----c-cccCCccCccHHHHHHHHHHCCCEEEEEeC
Confidence 4577777651 1 122 6689999999999999999988765
No 79
>COG0647 NagD Predicted sugar phosphatases of the HAD superfamily [Carbohydrate transport and metabolism]
Probab=24.10 E-value=1.3e+02 Score=30.56 Aligned_cols=43 Identities=19% Similarity=0.298 Sum_probs=31.7
Q ss_pred eeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEEEEeCccc
Q psy6580 410 LDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFIPILDPGV 457 (490)
Q Consensus 410 lDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivdP~I 457 (490)
.+++.+|+| +..--. +.||+..++++.|+++|++++++.+-..
T Consensus 8 y~~~l~DlD-----Gvl~~G~~~ipga~e~l~~L~~~g~~~iflTNn~~ 51 (269)
T COG0647 8 YDGFLFDLD-----GVLYRGNEAIPGAAEALKRLKAAGKPVIFLTNNST 51 (269)
T ss_pred cCEEEEcCc-----CceEeCCccCchHHHHHHHHHHcCCeEEEEeCCCC
Confidence 456777764 222123 7899999999999999999999875443
No 80
>PF13546 DDE_5: DDE superfamily endonuclease
Probab=23.87 E-value=2.2e+02 Score=28.17 Aligned_cols=49 Identities=24% Similarity=0.338 Sum_probs=40.0
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeecCCCCCHHHHHHHHHHCCCEEEEEeCc
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLAKPFYGLKEYVQDLHKEGRHFIPILDP 455 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D~~Fpd~~~mv~~Lh~~G~k~vlivdP 455 (490)
+...+.+++....|+|...+..|.. |.....|++.|.+.|+++|.-+-+
T Consensus 170 ~la~~~i~~a~~~g~~~~~vvaDs~-------------y~~~~~f~~~l~~~~~~~i~~vr~ 218 (273)
T PF13546_consen 170 ELALEMIDRARQAGIPARWVVADSW-------------YGSSPAFRKALRERGLHYIGRVRS 218 (273)
T ss_pred HHHHHHHHHHHhcccccceEEeccc-------------cCChHHHHHHHHHCCceEEEeecc
Confidence 4567889999999999999988843 445678999999999999987743
No 81
>PRK13745 anaerobic sulfatase-maturase; Provisional
Probab=23.46 E-value=2.3e+02 Score=30.25 Aligned_cols=48 Identities=10% Similarity=0.119 Sum_probs=38.7
Q ss_pred CCHHHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHHCCCEEE
Q psy6580 391 KNLSHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHKEGRHFI 450 (490)
Q Consensus 391 ~s~~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~v 450 (490)
.+-+.|.+-++.++++|+++.+. .+++ .+...++++++.+++.|++.+
T Consensus 150 gsf~~v~~~i~~l~~~gi~~~i~------------~vv~~~n~~~~~e~~~~~~~lg~~~~ 198 (412)
T PRK13745 150 PSFVKVMKGINLLKKHGVEWNAM------------AVVNDFNADYPLDFYHFFKELDCHYI 198 (412)
T ss_pred ccHHHHHHHHHHHHHcCCCEEEE------------EEEcCCccccHHHHHHHHHHcCCCeE
Confidence 35678888899999999988665 4555 666788999999999999765
No 82
>TIGR01656 Histidinol-ppas histidinol-phosphate phosphatase family domain. This domain is a member of the haloacid-dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. This superfamily is distinguished by the presence of three motifs: an N-terminal motif containing the nucleophilic aspartate, a central motif containing an conserved serine or threonine, and a C-terminal motif containing a conserved lysine (or arginine) and conserved aspartates. More specifically, the domian modelled here is a member of subfamily III of the HAD-superfamily by virtue of lacking a "capping" domain in either of the two common positions, between motifs 1 and 2, or between motifs 2 and 3.
Probab=23.18 E-value=1.2e+02 Score=27.11 Aligned_cols=25 Identities=12% Similarity=0.111 Sum_probs=21.9
Q ss_pred CCCCCHHHHHHHHHHCCCEEEEEeC
Q psy6580 430 KPFYGLKEYVQDLHKEGRHFIPILD 454 (490)
Q Consensus 430 ~~Fpd~~~mv~~Lh~~G~k~vlivd 454 (490)
.-||+..++++.|+++|+++++..+
T Consensus 27 ~~~~g~~~~l~~Lk~~g~~~~I~Sn 51 (147)
T TIGR01656 27 QLRPGAVPALLTLRAAGYTVVVVTN 51 (147)
T ss_pred EEcCChHHHHHHHHHCCCEEEEEeC
Confidence 5589999999999999999998764
No 83
>PRK11767 SpoVR family protein; Provisional
Probab=23.17 E-value=1.4e+02 Score=33.07 Aligned_cols=47 Identities=19% Similarity=0.358 Sum_probs=34.0
Q ss_pred CHHHHHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHHHH--cCCCeeEEEee
Q psy6580 358 KPGDVISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVK--AGIPLDTVWID 416 (490)
Q Consensus 358 tP~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~--~~IPlDviw~D 416 (490)
+..+++..|.++ |-|.+-|.|+||-+-- ..-..|+. .|||.+.+.+.
T Consensus 44 ~~eqml~~~as~-GmP~rY~HWsfGk~y~-----------~~~~~Y~~gl~glaYEiVINS 92 (498)
T PRK11767 44 TAEQMMDAYSSV-GMPINYRHWSFGKHFI-----------ETEQLYRRGQMGLAYEIVINS 92 (498)
T ss_pred CHHHHHHHHHhc-CCCCCCCcccccHHHH-----------HHHHHHhcCCCCCceEEEecC
Confidence 577899999888 9999999999994332 22334443 67888877655
No 84
>PF03943 TAP_C: TAP C-terminal domain; InterPro: IPR005637 This entry contains the NXF family of shuttling transport receptors for nuclear export of mRNA, which include: vertebrate mRNA export factor TAP or nuclear RNA export factor 1 (NXF1). Caenorhabditis elegans nuclear RNA export factor 1 (nxf-1). yeast mRNA export factor MEX67. Members of the NXF family have a modular structure. A nuclear localization sequence and a noncanonical RNA recognition motif (RRM) (see PDOC00030 from PROSITEDOC) followed by four LRR repeats are located in its N-terminal half. The C-terminal half contains a NTF2 domain (see PDOC50177 from PROSITEDOC) followed by a second domain, TAP-C. The TAP-C domain is important for binding to FG repeat-containing nuclear pore proteins (FG-nucleoporins) and is sufficient to mediate nuclear shuttling [,]. The Tap-C domain is made of four alpha helices packed against each other. The arrangement of helices 1, 2 and 3 is similar to that seen in a UBA fold. and is joined to the next module by flexible 12-residue Pro-rich linker [, ].; GO: 0051028 mRNA transport, 0005634 nucleus; PDB: 1OAI_A 1GO5_A 2KHH_A 2JP7_A.
Probab=22.92 E-value=36 Score=25.73 Aligned_cols=49 Identities=16% Similarity=0.396 Sum_probs=30.2
Q ss_pred HHHHHHHHHhcCCCCCCCcccCC-ccccccccCCHHHHHHHHHHHHHcCCCeeEE
Q psy6580 360 GDVISQYLDLIGYPELPPYWSLG-FHLCRYGYKNLSHIQSVVDRNVKAGIPLDTV 413 (490)
Q Consensus 360 ~~Vi~qYt~LtGrp~mpP~WaLG-fhqsr~gY~s~~eV~~vv~~~r~~~IPlDvi 413 (490)
.++|.++...|| |=+.||.= .+++-|.|. ..+..+.+-..+..||-+++
T Consensus 1 q~mv~~~s~~Tg---mn~~~s~~CL~~n~Wd~~--~A~~~F~~l~~~~~IP~eAF 50 (51)
T PF03943_consen 1 QEMVQQFSQQTG---MNLEWSQKCLEENNWDYE--RALQNFEELKAQGKIPPEAF 50 (51)
T ss_dssp HHHHHHHHHHCS---S-CCHHHHHHHHTTT-CC--HHHHHHHHCCCTT-S-CCCC
T ss_pred CHHHHHHHHHHC---CCHHHHHHHHHHcCCCHH--HHHHHHHHHHHcCCCChHhc
Confidence 368899999999 99999963 466777775 33333444444556998764
No 85
>cd04902 ACT_3PGDH-xct C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH). The C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH), with an extended C-terminal (xct) region from bacteria, archaea, fungi, and plants. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In bacteria, 3PGDH is feedback-controlled by the end product L-serine in an allosteric manner. Some 3PGDH enzymes have an additional domain formed by an extended C-terminal region. This additional domain introduces significant asymmetry to the homotetramer. Adjacent ACT (regulatory) domains interact, creating two serine-binding sites, however, this asymmetric arrangement results in the formation of two different and distinct domain interfaces between iden
Probab=22.56 E-value=3.2e+02 Score=20.60 Aligned_cols=59 Identities=19% Similarity=0.112 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeeccccCCCcceeec-CCCCCHHHHHHHHHH-CCCEEEEEe
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDIDYMERHNNFVLA-KPFYGLKEYVQDLHK-EGRHFIPIL 453 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Didym~~~~dFT~D-~~Fpd~~~mv~~Lh~-~G~k~vliv 453 (490)
..+.++.+-+.++|+.+..+...-...++.....+. +. +.+.++++.|++ .|++.+.++
T Consensus 11 G~l~~i~~~l~~~~inI~~~~~~~~~~~~~~~~~i~v~~-~~~~~~~~~l~~~~~v~~v~~~ 71 (73)
T cd04902 11 GVIGKVGTILGEAGINIAGMQVGRDEPGGEALMVLSVDE-PVPDEVLEELRALPGILSAKVV 71 (73)
T ss_pred CHHHHHHHHHHHcCcChhheEeeccCCCCEEEEEEEeCC-CCCHHHHHHHHcCCCccEEEEE
Confidence 356677788899999998876532222344445556 44 557799999988 588777654
No 86
>TIGR01668 YqeG_hyp_ppase HAD superfamily (subfamily IIIA) phosphatase, TIGR01668. This family consists of sequences from fungi, plants, cyanobacteria, gram-positive bacteria and Deinococcus. There is presently no characterization of any sequence in this family.
Probab=22.48 E-value=1.5e+02 Score=27.42 Aligned_cols=43 Identities=16% Similarity=0.393 Sum_probs=31.6
Q ss_pred CCeeEEEeeccccCCCcceeec--CCCCCHHHHHHHHHHCCCEEEEEeC
Q psy6580 408 IPLDTVWIDIDYMERHNNFVLA--KPFYGLKEYVQDLHKEGRHFIPILD 454 (490)
Q Consensus 408 IPlDviw~Didym~~~~dFT~D--~~Fpd~~~mv~~Lh~~G~k~vlivd 454 (490)
..+..+++|.|-+-. .++ .-||+..++++.|+++|+++++..+
T Consensus 23 ~~v~~vv~D~Dgtl~----~~~~~~~~pgv~e~L~~Lk~~g~~l~I~Sn 67 (170)
T TIGR01668 23 VGIKGVVLDKDNTLV----YPDHNEAYPALRDWIEELKAAGRKLLIVSN 67 (170)
T ss_pred CCCCEEEEecCCccc----cCCCCCcChhHHHHHHHHHHcCCEEEEEeC
Confidence 467788888763211 112 5789999999999999999887654
No 87
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=22.46 E-value=83 Score=33.01 Aligned_cols=58 Identities=12% Similarity=0.054 Sum_probs=38.8
Q ss_pred HHHHHHHHHHHHHcCCCeeEE-EeeccccCC-CcceeecCCCCCHHHHHHHHHHCCCEEEEEeC
Q psy6580 393 LSHIQSVVDRNVKAGIPLDTV-WIDIDYMER-HNNFVLAKPFYGLKEYVQDLHKEGRHFIPILD 454 (490)
Q Consensus 393 ~~eV~~vv~~~r~~~IPlDvi-w~Didym~~-~~dFT~D~~Fpd~~~mv~~Lh~~G~k~vlivd 454 (490)
.+++.+.++.|+++||-+=.| ++.+..+.. .+.|.| .-+..+++.++++|+++++.+-
T Consensus 9 ~e~~~~d~~~m~~~G~n~vri~~~~W~~lEP~eG~ydF----~~lD~~l~~a~~~Gi~viL~~~ 68 (374)
T PF02449_consen 9 EEEWEEDLRLMKEAGFNTVRIGEFSWSWLEPEEGQYDF----SWLDRVLDLAAKHGIKVILGTP 68 (374)
T ss_dssp CCHHHHHHHHHHHHT-SEEEE-CCEHHHH-SBTTB-------HHHHHHHHHHHCTT-EEEEEEC
T ss_pred HHHHHHHHHHHHHcCCCEEEEEEechhhccCCCCeeec----HHHHHHHHHHHhccCeEEEEec
Confidence 378899999999999987765 345444432 222322 2488999999999999998763
No 88
>TIGR02039 CysD sulfate adenylyltransferase, small subunit. In Escherichia coli, ATP sulfurylase is a heterodimer composed of two subunits encoded by cysD and cysN, with APS kinase encoded by cysC. These genes are located in a unidirectionally transcribed gene cluster, and have been shown to be required for the synthesis of sulfur-containing amino acids. Homologous to this E.coli activation pathway are nodPQH gene products found among members of the Rhizobiaceae family. These gene products have been shown to exhibit ATP sulfurase and APS kinase activity, yet are involved in Nod factor sulfation, and sulfation of other macromolecules.
Probab=22.14 E-value=2.5e+02 Score=29.00 Aligned_cols=73 Identities=15% Similarity=0.271 Sum_probs=48.2
Q ss_pred HHHHHHHHHhcCCCCCCCcccCCccccccccCCHHHHHHHHHHHHH-cCCCeeEEEeeccccCCCcceeecCCCCCHHHH
Q psy6580 360 GDVISQYLDLIGYPELPPYWSLGFHLCRYGYKNLSHIQSVVDRNVK-AGIPLDTVWIDIDYMERHNNFVLAKPFYGLKEY 438 (490)
Q Consensus 360 ~~Vi~qYt~LtGrp~mpP~WaLGfhqsr~gY~s~~eV~~vv~~~r~-~~IPlDviw~Didym~~~~dFT~D~~Fpd~~~m 438 (490)
.++++.-.+..++|.|. ||.| ++...+..++.+.-. .++|+.++++|-.| .||..-+|
T Consensus 9 i~ilRe~~~~f~~~vv~--~SGG--------KDS~VlLhLa~kaf~~~~~p~~vl~IDTG~-----------~F~Et~ef 67 (294)
T TIGR02039 9 IHIIREVAAEFERPVML--YSIG--------KDSSVLLHLARKAFYPGPLPFPLLHVDTGW-----------KFREMIAF 67 (294)
T ss_pred HHHHHHHHHhcCCcEEE--EecC--------hHHHHHHHHHHHHhcccCCCeEEEEEecCC-----------CCHHHHHH
Confidence 34555544445666553 6766 555667777766533 37899999998432 47888889
Q ss_pred HHHHHH-CCCEEEEEe
Q psy6580 439 VQDLHK-EGRHFIPIL 453 (490)
Q Consensus 439 v~~Lh~-~G~k~vliv 453 (490)
++++.+ .|+.+++..
T Consensus 68 rd~~a~~~gl~l~v~~ 83 (294)
T TIGR02039 68 RDHMVAKYGLRLIVHS 83 (294)
T ss_pred HHHHHHHhCCCEEEEe
Confidence 988854 588877754
No 89
>TIGR00262 trpA tryptophan synthase, alpha subunit. Tryptophan synthase catalyzes the last step in the biosynthesis of tryptophan. The alpha chain is responsible for the aldol cleavage of indoleglycerol phosphate to indole and glyceraldehyde 3-phosphate. In bacteria and plants each domain is found on a separate subunit (alpha and beta chains), while in fungi the two domains are fused together on a single multifunctional protein. The signature pattern for trpA contains three conserved acidic residues. [LIVM]-E-[LIVM]-G-x(2)-[FYC]-[ST]-[DE]-[PA]-[LIVMY]-[AGLI]-[DE]-G and this is located between residues 43-58 of the model. The Sulfolobus solfataricus trpA is known to be quite divergent from other known trpA sequences.
Probab=22.02 E-value=2.4e+02 Score=28.24 Aligned_cols=68 Identities=13% Similarity=0.313 Sum_probs=43.7
Q ss_pred CCHHHHHHHHHHHHHc--CCCeeEE-Eeec-------cccCC-----Ccceeec-CCCCCHHHHHHHHHHCCCEEEEEeC
Q psy6580 391 KNLSHIQSVVDRNVKA--GIPLDTV-WIDI-------DYMER-----HNNFVLA-KPFYGLKEYVQDLHKEGRHFIPILD 454 (490)
Q Consensus 391 ~s~~eV~~vv~~~r~~--~IPlDvi-w~Di-------dym~~-----~~dFT~D-~~Fpd~~~mv~~Lh~~G~k~vlivd 454 (490)
.+.+.+.+.++++|+. ++|+..+ |... .|... -.-..+. .-+....++++.+++.|+..++.+.
T Consensus 69 ~~~~~~~~~v~~ir~~~~~~plv~m~Y~Npi~~~G~e~f~~~~~~aGvdgviipDlp~ee~~~~~~~~~~~gl~~i~lv~ 148 (256)
T TIGR00262 69 MTPEKCFELLKKVRQKHPNIPIGLLTYYNLIFRKGVEEFYAKCKEVGVDGVLVADLPLEESGDLVEAAKKHGVKPIFLVA 148 (256)
T ss_pred CCHHHHHHHHHHHHhcCCCCCEEEEEeccHHhhhhHHHHHHHHHHcCCCEEEECCCChHHHHHHHHHHHHCCCcEEEEEC
Confidence 4567889999999864 8998754 2221 11111 1112222 3334567888999999999999988
Q ss_pred cccc
Q psy6580 455 PGVA 458 (490)
Q Consensus 455 P~I~ 458 (490)
|.-.
T Consensus 149 P~T~ 152 (256)
T TIGR00262 149 PNAD 152 (256)
T ss_pred CCCC
Confidence 8754
No 90
>smart00354 HTH_LACI helix_turn _helix lactose operon repressor.
Probab=21.91 E-value=1.2e+02 Score=23.84 Aligned_cols=50 Identities=16% Similarity=0.332 Sum_probs=39.6
Q ss_pred hhhhHhhhhcChhhhHHhhccccchhhhhh--HHHHHHHhccChhhhccccc
Q psy6580 6 FRDIVDELALDKDTINEILATDKDINENLA--LDRAIKKLALDKDTINKNLA 55 (490)
Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 55 (490)
.+||-..+.+...|+..+|.....|.+..+ +.++++++........+.+.
T Consensus 3 ~~~iA~~~gvS~~TVSr~ln~~~~v~~~t~~~i~~~~~~~gy~~~~~~~~~~ 54 (70)
T smart00354 3 IKDVARLAGVSKATVSRVLNGNGRVSEETREKVLAAMEELGYIPNRVARSLK 54 (70)
T ss_pred HHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHhCCCCCHHHHHHh
Confidence 468889999999999999998888876665 77888888887665555443
No 91
>PF03740 PdxJ: Pyridoxal phosphate biosynthesis protein PdxJ; InterPro: IPR004569 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). Pyridoxal 5'-phosphate (PLP) is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination [, , ]. PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors []. Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy []. PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalysed reactions, enzymatic and non-enzymatic []. In Escherichia coli, the pdx genes involved in vitamin B6 have been characterised [, , ]. This entry represents PdxJ, which catalyses the condensation of 1-amino-3-oxo-4-(phosphohydroxy)propan-2-one and 1-deoxy-D-xylulose-5-phosphate to form pyridoxine-5'-phosphate. The product of the PdxJ reaction is then oxidized by PdxH to pyridoxal 5'-phosphate.; GO: 0008615 pyridoxine biosynthetic process, 0005737 cytoplasm; PDB: 3F4N_B 3O6D_A 3O6C_A 1M5W_G 1IXQ_D 1IXP_B 1IXN_A 1HO4_C 1HO1_A 1IXO_D ....
Probab=21.62 E-value=55 Score=32.79 Aligned_cols=29 Identities=21% Similarity=0.245 Sum_probs=21.9
Q ss_pred CCCCCHHHHHHHHHHCCCEEEEEeCcccc
Q psy6580 430 KPFYGLKEYVQDLHKEGRHFIPILDPGVA 458 (490)
Q Consensus 430 ~~Fpd~~~mv~~Lh~~G~k~vlivdP~I~ 458 (490)
.....+.+++++||+.|+++.+.+||-..
T Consensus 108 ~~~~~l~~~i~~L~~~gIrvSLFiDP~~~ 136 (239)
T PF03740_consen 108 GNRDRLKPVIKRLKDAGIRVSLFIDPDPE 136 (239)
T ss_dssp GGHHHHHHHHHHHHHTT-EEEEEE-S-HH
T ss_pred cCHHHHHHHHHHHHhCCCEEEEEeCCCHH
Confidence 34467889999999999999999999864
No 92
>KOG3974|consensus
Probab=20.68 E-value=1.5e+02 Score=30.27 Aligned_cols=58 Identities=29% Similarity=0.329 Sum_probs=43.3
Q ss_pred EEEeCCCHHHHHHHHH-HhcCCCCCCCccc---------------CCccccccccCCHHHHHHHHHHHHHcCCCe
Q psy6580 352 YYFLGPKPGDVISQYL-DLIGYPELPPYWS---------------LGFHLCRYGYKNLSHIQSVVDRNVKAGIPL 410 (490)
Q Consensus 352 y~f~GptP~~Vi~qYt-~LtGrp~mpP~Wa---------------LGfhqsr~gY~s~~eV~~vv~~~r~~~IPl 410 (490)
++|-.|....||+.|+ +|+=.|.|--.|+ .|.-+.|. =.-+.++.++++-.++.++|+
T Consensus 62 HiFCe~~Aa~vIKsYsPdLIVHP~l~~~~av~~i~k~L~RlhavVIGPGLGRd-p~~~k~i~~iley~~~~dvP~ 135 (306)
T KOG3974|consen 62 HIFCEPEAAVVIKSYSPDLIVHPVLDQENAVDIIEKLLQRLHAVVIGPGLGRD-PAILKEIAKILEYLRGKDVPL 135 (306)
T ss_pred eeeechhHHHHHhhcCCceeecccccCCchHhHHHHHHhheeEEEECCCCCCC-HHHHHHHHHHHHHHhcCCCcE
Confidence 7888899999999998 6888998876662 23334441 123567888888889999997
No 93
>PRK14511 maltooligosyl trehalose synthase; Provisional
Probab=20.61 E-value=4e+02 Score=31.79 Aligned_cols=79 Identities=13% Similarity=0.179 Sum_probs=48.3
Q ss_pred CHHHHHHHHHHHHHcCCCeeEEEeecccc------CCCc--ce-eecCCCC---CHHHHHHHHHHCCCEEEEEeCcc-cc
Q psy6580 392 NLSHIQSVVDRNVKAGIPLDTVWIDIDYM------ERHN--NF-VLAKPFY---GLKEYVQDLHKEGRHFIPILDPG-VA 458 (490)
Q Consensus 392 s~~eV~~vv~~~r~~~IPlDviw~Didym------~~~~--dF-T~D~~Fp---d~~~mv~~Lh~~G~k~vlivdP~-I~ 458 (490)
+.+.+.+.+.-+++.|| ++||+-.-+- .+|. || ..|..|. ++++|++.+|++|+++|+=+=|. ++
T Consensus 18 tf~~~~~~l~YL~~LGi--s~IyLsPi~~a~~gs~hGYdv~D~~~idp~lGt~e~f~~Lv~aah~~Gi~VIlDiV~NH~~ 95 (879)
T PRK14511 18 TFDDAAELVPYFADLGV--SHLYLSPILAARPGSTHGYDVVDHTRINPELGGEEGLRRLAAALRAHGMGLILDIVPNHMA 95 (879)
T ss_pred CHHHHHHHhHHHHHcCC--CEEEECcCccCCCCCCCCCCcCCCCCcCCCCCCHHHHHHHHHHHHHCCCEEEEEecccccc
Confidence 45677788888888764 6777764221 1221 22 2344453 57899999999999999877553 33
Q ss_pred CCCCCCChhhHHHHH
Q psy6580 459 SREDSNYLPYVEGVE 473 (490)
Q Consensus 459 ~~~~~~y~~y~eg~e 473 (490)
...+ ..+.|.+.++
T Consensus 96 ~~~~-~n~ww~dvl~ 109 (879)
T PRK14511 96 VGGP-DNPWWWDVLE 109 (879)
T ss_pred CcCc-cCHHHHHHHH
Confidence 2222 2455665554
No 94
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=20.45 E-value=1.6e+02 Score=26.68 Aligned_cols=23 Identities=13% Similarity=0.322 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHcCCCeeEEEeec
Q psy6580 394 SHIQSVVDRNVKAGIPLDTVWIDI 417 (490)
Q Consensus 394 ~eV~~vv~~~r~~~IPlDviw~Di 417 (490)
+-+.+.+++.-+ ..|+|+||+||
T Consensus 110 e~~~~~i~Ei~~-~y~~DGiF~D~ 132 (132)
T PF14871_consen 110 EFLLEQIREILD-RYDVDGIFFDI 132 (132)
T ss_pred HHHHHHHHHHHH-cCCCCEEEecC
Confidence 444444444332 23999999996
Done!