Query psy16054
Match_columns 563
No_of_seqs 512 out of 3123
Neff 7.3
Searched_HMMs 46136
Date Fri Aug 16 19:40:53 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16054.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16054hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR03212 uraD_N-term-dom puta 100.0 1.3E-35 2.9E-40 303.5 20.7 223 221-493 66-294 (297)
2 PRK15394 4-deoxy-4-formamido-L 100.0 2.9E-27 6.3E-32 241.2 21.7 216 231-499 17-278 (296)
3 TIGR03006 pepcterm_polyde poly 99.9 2.7E-24 5.9E-29 217.0 21.0 211 233-487 29-264 (265)
4 TIGR02764 spore_ybaN_pdaB poly 99.9 3.6E-23 7.7E-28 199.8 20.3 186 217-487 4-191 (191)
5 TIGR02884 spore_pdaA delta-lac 99.9 1.7E-22 3.7E-27 200.0 19.8 119 217-349 35-155 (224)
6 TIGR02873 spore_ylxY probable 99.9 3.8E-22 8.3E-27 201.9 20.2 118 217-349 83-202 (268)
7 PF01522 Polysacc_deac_1: Poly 99.9 6.1E-23 1.3E-27 183.1 9.2 118 217-350 4-123 (123)
8 COG0726 CDA1 Predicted xylanas 99.8 2.6E-18 5.7E-23 172.1 20.4 115 219-348 65-181 (267)
9 PRK14581 hmsF outer membrane N 99.6 3.8E-15 8.3E-20 166.1 13.7 126 217-354 105-275 (672)
10 PRK14582 pgaB outer membrane N 99.6 6.1E-15 1.3E-19 164.7 13.8 126 217-354 105-275 (671)
11 PF00057 Ldl_recept_a: Low-den 99.0 1.5E-10 3.3E-15 81.1 2.6 36 142-177 2-37 (37)
12 cd00112 LDLa Low Density Lipop 99.0 3.5E-10 7.7E-15 78.4 2.4 35 143-177 1-35 (35)
13 smart00192 LDLa Low-density li 98.6 2.1E-08 4.6E-13 68.5 2.6 32 143-174 2-33 (33)
14 PF10096 DUF2334: Uncharacteri 98.2 6.3E-05 1.4E-09 75.6 16.1 117 222-348 3-146 (243)
15 PF01607 CBM_14: Chitin bindin 97.9 3.7E-06 8E-11 63.2 1.1 53 29-90 1-53 (53)
16 COG3233 Predicted deacetylase 97.7 0.00028 6.2E-09 68.1 11.2 125 220-355 5-143 (233)
17 PF09960 DUF2194: Uncharacteri 97.7 0.0025 5.4E-08 71.3 19.1 196 249-491 297-531 (585)
18 cd00112 LDLa Low Density Lipop 97.5 4.9E-05 1.1E-09 52.6 1.7 33 96-132 3-35 (35)
19 PF12999 PRKCSH-like: Glucosid 97.5 9E-05 2E-09 69.8 3.8 67 97-174 36-110 (176)
20 PF00057 Ldl_recept_a: Low-den 97.5 5E-05 1.1E-09 53.2 1.5 33 96-132 5-37 (37)
21 smart00494 ChtBD2 Chitin-bindi 97.2 0.00037 8.1E-09 52.6 4.1 50 28-86 2-51 (56)
22 KOG1215|consensus 97.1 0.0015 3.2E-08 77.6 8.7 75 96-180 220-297 (877)
23 KOG1215|consensus 97.0 0.0012 2.6E-08 78.3 7.5 118 96-219 140-296 (877)
24 smart00192 LDLa Low-density li 96.8 0.00066 1.4E-08 46.3 1.8 29 97-129 5-33 (33)
25 PF04748 Polysacc_deac_2: Dive 96.4 0.024 5.1E-07 55.9 10.1 182 251-488 19-213 (213)
26 PF03065 Glyco_hydro_57: Glyco 95.7 0.023 5E-07 60.4 7.2 195 266-492 113-322 (360)
27 PF11959 DUF3473: Domain of un 94.8 0.13 2.8E-06 46.8 7.9 55 432-488 62-131 (133)
28 COG1449 Alpha-amylase/alpha-ma 94.2 0.83 1.8E-05 52.0 14.4 80 267-351 183-264 (615)
29 PF15421 Polysacc_deac_3: Puta 92.5 3.3 7E-05 43.8 14.1 173 265-497 131-312 (425)
30 COG2861 Uncharacterized protei 92.4 4.4 9.6E-05 40.3 14.3 212 212-488 23-245 (250)
31 KOG2397|consensus 92.3 0.099 2.1E-06 56.3 3.0 70 98-178 44-117 (480)
32 PF12999 PRKCSH-like: Glucosid 86.1 0.57 1.2E-05 44.5 2.7 33 144-176 34-68 (176)
33 PF01607 CBM_14: Chitin bindin 76.9 1.3 2.7E-05 32.6 1.1 28 96-123 26-53 (53)
34 PF01074 Glyco_hydro_38: Glyco 72.1 22 0.00048 36.0 9.3 80 267-354 64-145 (275)
35 KOG1958|consensus 67.0 1.2E+02 0.0026 36.0 14.1 74 267-348 229-304 (1129)
36 KOG3509|consensus 62.4 14 0.00031 43.8 6.1 89 83-177 18-110 (964)
37 PF07315 DUF1462: Protein of u 60.0 38 0.00082 28.6 6.5 53 417-474 11-63 (93)
38 smart00494 ChtBD2 Chitin-bindi 53.8 11 0.00023 27.6 2.2 24 96-119 28-51 (56)
39 COG4878 Uncharacterized protei 53.8 22 0.00048 36.1 5.0 98 254-354 34-139 (309)
40 COG1543 Uncharacterized conser 51.6 36 0.00079 37.2 6.4 78 269-353 129-207 (504)
41 PF12340 DUF3638: Protein of u 48.5 85 0.0018 31.3 8.1 63 424-489 82-144 (229)
42 PF04794 YdjC: YdjC-like prote 43.1 63 0.0014 32.6 6.5 119 220-349 2-152 (261)
43 COG4837 Uncharacterized protei 42.5 45 0.00098 28.3 4.3 54 417-475 18-71 (106)
44 cd07286 PX_SNX18 The phosphoin 41.4 46 0.001 30.0 4.6 73 428-505 41-120 (127)
45 PLN02701 alpha-mannosidase 40.8 63 0.0014 39.5 7.0 79 267-353 106-186 (1050)
46 COG3590 PepO Predicted metallo 40.6 8.9 0.00019 42.7 -0.1 84 216-319 446-539 (654)
47 COG3536 Uncharacterized protei 40.3 10 0.00022 33.0 0.3 78 207-286 3-90 (120)
48 TIGR01358 DAHP_synth_II 3-deox 40.3 1.5E+02 0.0033 32.3 9.0 84 228-315 287-380 (443)
49 PRK09819 alpha-mannosidase; Pr 39.7 1.1E+02 0.0024 36.7 8.8 79 266-354 69-149 (875)
50 cd02995 PDI_a_PDI_a'_C PDIa fa 38.6 72 0.0016 26.3 5.3 51 433-483 8-58 (104)
51 PLN02291 phospho-2-dehydro-3-d 38.0 1.7E+02 0.0036 32.2 8.9 84 228-315 307-400 (474)
52 PF11071 DUF2872: Protein of u 33.6 52 0.0011 29.8 3.6 52 428-490 84-136 (141)
53 KOG3509|consensus 33.0 52 0.0011 39.4 4.5 61 113-180 9-69 (964)
54 TIGR03473 HpnK hopanoid biosyn 31.8 2.9E+02 0.0063 28.3 9.4 117 221-349 4-154 (283)
55 COG3603 Uncharacterized conser 31.7 73 0.0016 28.4 4.1 14 267-280 114-127 (128)
56 PRK02134 hypothetical protein; 31.4 4E+02 0.0087 26.8 10.1 117 221-348 5-149 (249)
57 KOG1959|consensus 30.4 1.4E+02 0.003 35.3 7.1 76 267-348 111-188 (996)
58 COG3200 AroG 3-deoxy-D-arabino 27.8 3.2E+02 0.0069 29.1 8.6 97 199-314 277-383 (445)
59 cd07295 PX_Grd19 The phosphoin 25.9 84 0.0018 27.7 3.6 62 428-493 47-114 (116)
60 COG3531 Predicted protein-disu 25.9 1.6E+02 0.0034 28.8 5.6 72 267-354 20-92 (212)
61 cd02992 PDI_a_QSOX PDIa family 25.7 1.6E+02 0.0034 25.5 5.4 50 433-482 9-59 (114)
62 PF13788 DUF4180: Domain of un 24.0 88 0.0019 27.7 3.3 30 461-490 83-113 (113)
63 cd02334 ZZ_dystrophin Zinc fin 23.7 94 0.002 23.1 3.0 36 253-288 11-46 (49)
64 TIGR02026 BchE magnesium-proto 23.1 1.9E+02 0.0041 32.2 6.6 62 421-487 410-471 (497)
65 PF13592 HTH_33: Winged helix- 22.6 1.4E+02 0.0029 22.9 3.8 18 336-353 25-42 (60)
66 cd06860 PX_SNX7_30_like The ph 22.2 1.2E+02 0.0025 26.7 3.8 49 428-479 46-104 (116)
67 PF12979 DUF3863: Domain of Un 21.0 2.5E+02 0.0054 28.4 6.0 101 237-350 46-156 (351)
68 PF02456 Adeno_IVa2: Adenoviru 20.6 55 0.0012 34.2 1.6 31 203-235 151-181 (369)
69 TIGR03646 YtoQ_fam YtoQ family 20.4 95 0.0021 28.2 2.8 52 428-490 87-139 (144)
70 COG4637 Predicted ATPase [Gene 20.1 2.4E+02 0.0052 29.8 6.0 60 425-486 273-332 (373)
No 1
>TIGR03212 uraD_N-term-dom putative urate catabolism protein. This model represents a protein that is predominantly found just upstream of the UraD protein (OHCU decarboxylase) and in a number of instances as a N-terminal fusion with it. UraD itself catalyzes the last step in the catabolism of urate to allantoate. The function of this protein is presently unknown. It shows homology with the pfam01522 polysaccharide deacetylase domain family.
Probab=100.00 E-value=1.3e-35 Score=303.54 Aligned_cols=223 Identities=17% Similarity=0.210 Sum_probs=181.4
Q ss_pred eEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCH
Q psy16054 221 ITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSY 298 (563)
Q Consensus 221 V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~ 298 (563)
-.+|+=+..+.+....+++||++ ++|||||||+|... ++++|++|.++|||||+||++|..+. .++.
T Consensus 66 ~~~s~~~YG~rvG~~RiLdlL~~-------~gv~aTffv~g~~~e~~P~~v~~i~~~GHEIg~Hg~~H~~~~----~ls~ 134 (297)
T TIGR03212 66 SMESLYEYGSRAGFWRLLRLFTE-------RGIPVTVFGVAMALARNPEAVAAMKEAGWEIASHGLRWIDYQ----DMDE 134 (297)
T ss_pred cchhhhhhcchhCHHHHHHHHHH-------cCCCEEEEeEHHHHHHCHHHHHHHHHcCCEEeeccccCcccc----cCCH
Confidence 35667776666778889999974 79999999999985 67999999999999999999999864 6899
Q ss_pred HHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHH-cCCeEEeecCCCCCCCCccceeccCCCCCCcCC
Q psy16054 299 DDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTD-QFFVYDASITASLGRVPIWPYTLYFRMPHKCNG 377 (563)
Q Consensus 299 ~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e-~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~ 377 (563)
++++++|.++.++|++++|+ .|+|||++ ..+. +|+++|++ .||+|+||++++ +.|||--+
T Consensus 135 ~~e~~~i~~s~~~i~~~tG~---~P~G~~~~--~~s~-~T~~LL~e~~Gf~Y~sd~~~d--D~Py~~~~----------- 195 (297)
T TIGR03212 135 AQEREHIAEAIRLHTEVTGE---RPLGWYTG--RTSP-NTRRLVAEEGGFLYDADSYAD--DLPYWDEV----------- 195 (297)
T ss_pred HHHHHHHHHHHHHHHHHhCC---CCceEECC--CCCh-hHHHHHHHhcCceEeCchhhc--CCCeEeec-----------
Confidence 99999999999999998885 48999974 6674 69999999 999999999875 44444332
Q ss_pred CCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCC---cceEEeecccc
Q psy16054 378 NAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNR---APLGLHFHASW 454 (563)
Q Consensus 378 ~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nr---ap~~l~lH~~~ 454 (563)
...++++||++ |+.+|. .|.. .....++++++++|+++|+.+|++.+ ..|+|.|| ++
T Consensus 196 -------~~~~~l~lP~~-~~~nD~-~~~~----------~~~~~~~~~~~~~~~d~fd~l~~eg~~~~~~~~i~lH-p~ 255 (297)
T TIGR03212 196 -------AGRPQLIVPYT-LDANDM-RFAT----------PQGFNTGEQFFTYLRDAFDVLYAEGEGAPKMMSIGLH-CR 255 (297)
T ss_pred -------CCCCeEEEecc-cccCcH-HHHh----------ccCCCCHHHHHHHHHHHHHHHHHhCCCCCceEEEecC-cc
Confidence 11236779987 444442 2210 11235789999999999999998643 36799999 88
Q ss_pred ccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCC
Q psy16054 455 LKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP 493 (563)
Q Consensus 455 l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p 493 (563)
++|+|+|+.+|++||+||++++||||+|.+||++|++.-
T Consensus 256 i~G~p~R~~~L~~~l~~i~~~~~VW~at~~eIA~~~~~~ 294 (297)
T TIGR03212 256 LVGRPGRIAALQRFLDYVQSHDKVWVARRIDIARHWHET 294 (297)
T ss_pred ccCCHHHHHHHHHHHHHHHhCCCEEEEcHHHHHHHHHHh
Confidence 999999999999999999999999999999999998543
No 2
>PRK15394 4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol deformylase ArnD; Provisional
Probab=99.95 E-value=2.9e-27 Score=241.20 Aligned_cols=216 Identities=16% Similarity=0.190 Sum_probs=165.8
Q ss_pred CCCchHHHHHhccCCCCCCCCCCceEEEEecc----------------------------C------------------C
Q psy16054 231 VDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHK----------------------------Y------------------T 264 (563)
Q Consensus 231 ~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~----------------------------~------------------~ 264 (563)
....+.+++||++ ++|+||||.+.. | .
T Consensus 17 ~~g~~~~~~~~~~-------~~~~a~f~~~~gpd~~g~~~~r~~~~~fl~k~~r~~~~~~yg~~~l~~g~~~PG~~I~~~ 89 (296)
T PRK15394 17 REGVPRLLEILSK-------HGIQASFFFSVGPDNMGRHLWRLLKPRFLWKMLRSNAASLYGWDILLAGTAWPGKEIGKA 89 (296)
T ss_pred ccCHHHHHHHHHH-------cCCCEEEEeccCCCchhHHHHHHhhhHHHHHHHhcCchhhcCHHHHhhcccCCCchHHHH
Confidence 3445778899974 789999999853 0 0
Q ss_pred CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHH
Q psy16054 265 NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTD 344 (563)
Q Consensus 265 ~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e 344 (563)
++++|++|.++ ||||+|||+|..+......++.+++++||.++.++|++.+|. .++|||+|++..+. +++++|++
T Consensus 90 ~P~lvr~i~~~-HEIG~Hg~~H~~wq~~~~~ls~~e~~~eI~ra~~~Le~itG~---~p~g~raPgw~~~~-~tl~ll~e 164 (296)
T PRK15394 90 LADIIREAAKA-HEVGLHAWDHHAWQAWSGVWSRQQLIEQIARGVDALEEIIGQ---PVTCSAAAGWRADQ-RVVEAKEA 164 (296)
T ss_pred CHHHHHHHHhc-CEehhcccCccchhcccccCCHHHHHHHHHHHHHHHHHHhCC---CCCEEeCCCccCCH-HHHHHHHH
Confidence 34699999999 999999999996432223689999999999999999998884 58999999999995 69999999
Q ss_pred cCCeEEeecCCCCCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCCh
Q psy16054 345 QFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTG 424 (563)
Q Consensus 345 ~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~ 424 (563)
.||.||||+.. +.||||.+-+ ...+++|||++. ..+| +. .| ...+.
T Consensus 165 ~Gf~Y~Ss~~~---d~Py~~~~~~----------------g~~~~veIP~tl-~t~D-----~~-~g--------~~~~~ 210 (296)
T PRK15394 165 FGFRYNSDCRG---THPFRPLLPD----------------GSLGTVQIPVTL-PTWD-----EV-IG--------RDVKA 210 (296)
T ss_pred cCCeeecCCCC---CCceeeecCC----------------CcCCeEEEcccc-ccch-----hh-hc--------CCCCH
Confidence 99999999965 4566665422 123689999973 3222 21 11 12567
Q ss_pred HHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCCchhhhh
Q psy16054 425 EQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNPTELTSL 499 (563)
Q Consensus 425 ~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p~~~~~~ 499 (563)
+++.+++++.|++ ..+.+ +++|| ++++|.| ++..|++||+++++ .+|||+|++||+.-.++..|+.++
T Consensus 211 ~~f~~~~~d~~~~---~~g~~-V~tiH-P~v~G~~-~~~~fe~lL~~~~~-~gv~F~tl~el~~~~~~~~p~~~~ 278 (296)
T PRK15394 211 EDFNDFILDRILR---DKGTP-VYTIH-AEVEGIA-YAHNFEDLLKRAAQ-EGITFCPLSELLPDDLETLPLGKV 278 (296)
T ss_pred HHHHHHHHHHhcc---ccCce-EEEEC-CCcCCcc-hHHHHHHHHHHHhc-CCcEEeeHHHHHhhchhhCCccce
Confidence 8899999888865 22334 69999 6799998 99999999999997 669999999998844455555444
No 3
>TIGR03006 pepcterm_polyde polysaccharide deactylase family protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide deacetylases (pfam01522). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene. The highest scoring homologs below the trusted cutoff for this model are found in several species of Methanosarcina, an archaeal genus.
Probab=99.92 E-value=2.7e-24 Score=217.01 Aligned_cols=211 Identities=19% Similarity=0.293 Sum_probs=144.4
Q ss_pred CchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHH
Q psy16054 233 NSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRL 310 (563)
Q Consensus 233 ~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~ 310 (563)
++..+++||++ +++||||||+|... .+++|++|.++|||||+||++|..+. .++.+++++||.++++
T Consensus 29 nt~riL~lL~~-------~gikATFFv~g~~~e~~p~lir~i~~~GhEIgsHg~sH~~l~----~ls~ee~~~eI~~s~~ 97 (265)
T TIGR03006 29 NTDRILDLLDR-------HGVKATFFTLGWVAERYPELVRRIVAAGHELASHGYGHERVT----TQTPEAFRADIRRSKA 97 (265)
T ss_pred hHHHHHHHHHH-------cCCcEEEEEeccchhhCHHHHHHHHHcCCEeeeccccCcCch----hCCHHHHHHHHHHHHH
Confidence 46778899974 79999999999875 45999999999999999999999763 6899999999999999
Q ss_pred HHHHHhCCCCCcceEEecCccccChhh--HHHHHHHcCCeEEeecCCCCCCCCccceecc-CCCCCCcCCCCCCCCCC-C
Q psy16054 311 IIERFANISDGSVIGVRAPYLRVGGNK--QFSMMTDQFFVYDASITASLGRVPIWPYTLY-FRMPHKCNGNAHNCPSK-S 386 (563)
Q Consensus 311 ~l~~~~gi~~~~~~G~R~P~l~~~~~~--~~~~L~e~Gf~YDsS~~~~~~~~p~wPytld-~~~p~~c~~~~~pcp~~-~ 386 (563)
+|++++|. .++|||+|++.++..+ .+++|+++||.||||+. |-..| ||+|.. ++.|-+ .
T Consensus 98 ~Le~itG~---~~~gfRaP~~s~~~~t~~a~~iL~e~Gy~YdsS~~---------p~~~d~~g~~~~-----~~~p~~~~ 160 (265)
T TIGR03006 98 LLEDLSGQ---PVRGYRAPSFSIGKKNLWALDVLAEAGYRYSSSIY---------PVRHDHYGMPDA-----PRFPFRPP 160 (265)
T ss_pred HHHHHhCC---CceEEECCCCCCCCCcHHHHHHHHHCCCEEEEeec---------cCcCCcCCCCCC-----CCCceecC
Confidence 99998874 5899999999887532 37999999999999994 11122 454422 233422 2
Q ss_pred C-CcEEEeccccc--cCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeecccccc-CchhHH
Q psy16054 387 H-PTWEMVMNELD--RRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLK-SKKEFK 462 (563)
Q Consensus 387 ~-~lweiP~~~l~--~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~-g~p~r~ 462 (563)
. +|||||++.+. ....|.... | ++ ..-+-.+++++.....+ ..+.|+.+|||+..|. ++| |+
T Consensus 161 ~g~l~e~P~s~~~~~~~~~p~~gg---~--~~-----r~~P~~~~~~~~~~~~~---~~~~~~~~y~Hpwe~dp~~~-~~ 226 (265)
T TIGR03006 161 NGRLLEVPVTTVRLGGRNLPAAGG---G--YF-----RLLPYALSRWALRRVNG---REGRPAIFYFHPWEIDPDQP-RI 226 (265)
T ss_pred CCcEEEEccchhhhcccccccCCC---c--ch-----hcChHHHHHHHHHHHHh---cCCceEEEEEchhhcCCCCC-CC
Confidence 2 39999998643 223333211 0 11 12344444444443332 2357999999966664 222 22
Q ss_pred ---------------HHHHHHHHHHHcCCCeEEEeHHHHH
Q psy16054 463 ---------------DELIKFIEEMLDRNDVYFVTMLQVI 487 (563)
Q Consensus 463 ---------------~al~~fl~~i~~~~dVwfvT~~ei~ 487 (563)
...+.=|+.|+ .|.-|+|+.|++
T Consensus 227 ~~~~~~~~fr~y~~~~~~~~~l~~ll--~~~~~~~~~~~~ 264 (265)
T TIGR03006 227 AGISMKSRFRHYLNLDRMEGRLRRLL--GDFRWGRMDRVF 264 (265)
T ss_pred CCCCccceeeeecCHHHHHHHHHHHH--hcCceeeHHHhh
Confidence 11111134443 578899999885
No 4
>TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB. This model describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reducted sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterized polysaccharide deacetylase; the exact function this protein family is unknown.
Probab=99.91 E-value=3.6e-23 Score=199.80 Aligned_cols=186 Identities=19% Similarity=0.196 Sum_probs=142.2
Q ss_pred CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054 217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS 294 (563)
Q Consensus 217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~ 294 (563)
..+.|+||||||+.......++++|++ +++||||||+|.+. +++++++|.++|||||+|||+|..+.
T Consensus 4 ~~k~V~LTFDDgp~~~~t~~~l~~L~~-------~~ikaTfFv~g~~~~~~~~~~~~i~~~Gheig~Ht~~H~~~~---- 72 (191)
T TIGR02764 4 SDKKIALTFDISWGNDYTEPILDTLKE-------YDVKATFFLSGSWAERHPELVKEIVKDGHEIGSHGYRHKNYT---- 72 (191)
T ss_pred CCCEEEEEEECCCCcccHHHHHHHHHH-------cCCCEEEEeccHHHHHCHHHHHHHHhCCCEEEECCcCCCCcc----
Confidence 346899999999987766778899974 78999999999986 56999999999999999999999763
Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEEeecCCCCCCCCccceeccCCCCCC
Q psy16054 295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYDASITASLGRVPIWPYTLYFRMPHK 374 (563)
Q Consensus 295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YDsS~~~~~~~~p~wPytld~~~p~~ 374 (563)
.++.+++++||.++.++|++.+|. .+++||+||+.++. .++++|+++||.+- .|..+ +.|
T Consensus 73 ~~~~~~~~~ei~~~~~~l~~~~g~---~~~~fr~P~G~~~~-~~~~~l~~~G~~~v-~w~~~---------~~D------ 132 (191)
T TIGR02764 73 TLEDEKIKKDILRAQEIIEKLTGK---KPTLFRPPSGAFNK-AVLKAAESLGYTVV-HWSVD---------SRD------ 132 (191)
T ss_pred cCCHHHHHHHHHHHHHHHHHHhCC---CCCEEECCCcCCCH-HHHHHHHHcCCeEE-EecCC---------CCc------
Confidence 689999999999999999998885 47899999999995 79999999999842 11110 000
Q ss_pred cCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeecccc
Q psy16054 375 CNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASW 454 (563)
Q Consensus 375 c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~ 454 (563)
|. ..+++++++.++.+. . ++ -.+.||..
T Consensus 133 ----------------------~~----------------------~~~~~~i~~~~~~~~---~--~g--~Iil~Hd~- 160 (191)
T TIGR02764 133 ----------------------WK----------------------NPGVESIVDRVVKNT---K--PG--DIILLHAS- 160 (191)
T ss_pred ----------------------cC----------------------CCCHHHHHHHHHhcC---C--CC--CEEEEeCC-
Confidence 00 023445555444332 1 12 25778832
Q ss_pred ccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHH
Q psy16054 455 LKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVI 487 (563)
Q Consensus 455 l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~ 487 (563)
...+.-.++|.++|++++++ +.-|||++|++
T Consensus 161 -~~~~~t~~~l~~~i~~l~~~-Gy~~vtl~~l~ 191 (191)
T TIGR02764 161 -DSAKQTVKALPTIIKKLKEK-GYEFVTISELI 191 (191)
T ss_pred -CCcHhHHHHHHHHHHHHHHC-CCEEEEHHHhC
Confidence 12455689999999999965 88999999874
No 5
>TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase. Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores in the peptidoglycan wall, or spore cortex. The proteins in this family are PdaA (yfjS), a member of a larger family of polysaccharide deacetylases, and are specificially involved in delta-lactam biosynthesis. PdaA acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the following transpeptidation for lactam ring formation, as heterologous expression in E. coli of CwlD and PdaA together is sufficient for delta-lactam production.
Probab=99.90 E-value=1.7e-22 Score=199.97 Aligned_cols=119 Identities=13% Similarity=0.214 Sum_probs=105.4
Q ss_pred CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054 217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS 294 (563)
Q Consensus 217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~ 294 (563)
.++.|+||||||+.......++++|++ +++||||||+|.+. +.+++++|+++|||||+|||+|..+.
T Consensus 35 ~~k~VaLTFDDGp~~~~t~~lL~~L~~-------~~vkATFFv~G~~~~~~p~~ir~i~~~GheIgnHt~~H~~~~---- 103 (224)
T TIGR02884 35 SKKVIYLTFDNGYENGYTPKILDVLKE-------KKVPAAFFVTGHYIKTQPDLIKRMVDEGHIVGNHSVHHPSLT---- 103 (224)
T ss_pred CCCEEEEEEECCCCccchHHHHHHHHH-------cCCCeEEEeechhhHHCHHHHHHHHHcCCEeeecCccCcCcc----
Confidence 457899999999987666778899974 78999999999986 56999999999999999999999764
Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeE
Q psy16054 295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVY 349 (563)
Q Consensus 295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~Y 349 (563)
.++.+++++||.++.++|++.+|.. .+..||+||+.++. .++++|+++||.+
T Consensus 104 ~ls~~~~~~ei~~~~~~i~~~~G~~--~~~~fR~P~G~~~~-~~~~~l~~~Gy~~ 155 (224)
T TIGR02884 104 AVNDEKFKEELTGVEEEFKKVTGQK--EMKYFRPPRGVFSE-RTLAYTKELGYYT 155 (224)
T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCC--CCCEEeCCCCCcCH-HHHHHHHHcCCcE
Confidence 6899999999999999999988853 27899999999995 6999999999986
No 6
>TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family. Members of this protein family are most closely related to TIGR02764, a subset of polysaccharide deacetylase family proteins found in a species if and only if the species forms endospores like those of Bacillus subtilis or Clostridium tetani. This family is likewise restricted to spore-formers, but is not universal among them in having sequences with full-length matches to the model.
Probab=99.89 E-value=3.8e-22 Score=201.87 Aligned_cols=118 Identities=19% Similarity=0.192 Sum_probs=105.6
Q ss_pred CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054 217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS 294 (563)
Q Consensus 217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~ 294 (563)
..++|+||||||+....+..++++|++ +++||||||+|.+. +++++++|+++|||||+|||+|..+.
T Consensus 83 ~~k~VaLTFDdg~~~~~t~~iL~iLkk-------~~vkATFFv~G~~i~~~p~l~k~i~~~GheIGnHT~sH~~l~---- 151 (268)
T TIGR02873 83 EKPMVALLINVAWGNEYLPEILQILKK-------HDVKATFFLEGKWVKENSQLAKMIVEQGHEIGNHAYNHPDMA---- 151 (268)
T ss_pred CCCEEEEEEeCCCCcchHHHHHHHHHH-------CCCCEEEEeehHhhhHCHHHHHHHHHCCCEEEecCCcCCCcc----
Confidence 447899999999988777788999974 78999999999985 56999999999999999999999874
Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeE
Q psy16054 295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVY 349 (563)
Q Consensus 295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~Y 349 (563)
.++.+++++||.++.++|++.+|. .++.||+|||.++. .++++++++||..
T Consensus 152 ~ls~~~~~~Ei~~~~~~i~~~~G~---~p~~fRpP~G~~n~-~~~~~l~~~G~~~ 202 (268)
T TIGR02873 152 TLSKEEIYDQINQTNEIIEATIGV---TPKWFAPPSGSFND-NVVQIAADLQMGT 202 (268)
T ss_pred cCCHHHHHHHHHHHHHHHHHHhCC---CCCEEECCCCCCCH-HHHHHHHHCCCeE
Confidence 589999999999999999998885 57899999999994 7999999999964
No 7
>PF01522 Polysacc_deac_1: Polysaccharide deacetylase; InterPro: IPR002509 This domain is found in polysaccharide deacetylase. This family of polysaccharide deacetylases includes NodB (nodulation protein B from Rhizobium) which is a chitooligosaccharide deacetylase []. It also includes chitin deacetylase from yeast [], and endoxylanases which hydrolyses glucosidic bonds in xylan [].; GO: 0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, 0005975 carbohydrate metabolic process; PDB: 2IW0_A 2CC0_B 2VYO_A 2J13_A 2C71_A 2C79_A 1W1A_1 1W1B_1 1W17_A 1NY1_B ....
Probab=99.88 E-value=6.1e-23 Score=183.13 Aligned_cols=118 Identities=22% Similarity=0.321 Sum_probs=97.5
Q ss_pred CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCC
Q psy16054 217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWS 294 (563)
Q Consensus 217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~ 294 (563)
.+++|+||||||+... +..++++|++ ++++|||||++.++ .++++++|.++|||||+||++|..+ .
T Consensus 4 ~~~~v~ltfDdg~~~~-~~~~~~~l~~-------~~i~at~fv~~~~~~~~~~~l~~l~~~G~ei~~H~~~H~~~----~ 71 (123)
T PF01522_consen 4 PKKSVALTFDDGYRDN-YDRLLPLLKK-------YGIPATFFVIGSWVERYPDQLRELAAAGHEIGNHGWSHPNL----S 71 (123)
T ss_dssp TSSEEEEEEESHCHTH-HHHHHHHHHH-------TT--EEEEE-HHHHHHHHHHHHHHHHTT-EEEEE-SSSSCG----G
T ss_pred CCCEEEEEEecCchhh-HHHHHHHHHh-------cccceeeeecccccccccccchhHHHHHHHHHhcCCccccc----c
Confidence 5689999999999554 5667789874 78999999999986 3489999999999999999999986 3
Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEE
Q psy16054 295 GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYD 350 (563)
Q Consensus 295 ~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YD 350 (563)
.++.+++++||.++++.|++.+|. .+.+||+|++..+. +++++|+++||.||
T Consensus 72 ~~~~~~~~~ei~~~~~~l~~~~g~---~~~~f~~P~g~~~~-~~~~~l~~~G~~y~ 123 (123)
T PF01522_consen 72 TLSPEELRREIERSREILEEITGR---PPKGFRYPFGSYDD-NTLQALREAGYKYD 123 (123)
T ss_dssp GS-HHHHHHHHHHHHHHHHHHHSS---EESEEE-GGGEECH-HHHHHHHHTT-EEE
T ss_pred cCCHHHHHHHHHHHHHHHHHHhCC---CCcEEECCCCCCCH-HHHHHHHHcCCCcC
Confidence 689999999999999999999885 58999999999994 79999999999997
No 8
>COG0726 CDA1 Predicted xylanase/chitin deacetylase [Carbohydrate transport and metabolism]
Probab=99.80 E-value=2.6e-18 Score=172.13 Aligned_cols=115 Identities=23% Similarity=0.304 Sum_probs=103.3
Q ss_pred CceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--CHHHHHHHHHcCCeecccCCCCCCCcCCCCCC
Q psy16054 219 QMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--NYASVQELHRKGHEISVFSLTHKDDPKYWSGG 296 (563)
Q Consensus 219 q~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~ 296 (563)
+.|+||||||+...++..+++||++ +++||||||.|.+. ..++++++.++|||||+||++|..+. .+
T Consensus 65 k~v~lTFDDg~~~~~~~~il~iL~k-------~~i~ATfFv~g~~i~~~p~~~~~~~~~Gheig~H~~~h~~~~----~~ 133 (267)
T COG0726 65 KAVALTFDDGPLDGNTPRILPLLKK-------YGIKATFFVVGSWIERNPDLVKRIAEAGHEIGNHGYDHPDLQ----DL 133 (267)
T ss_pred CeEEEEeecCCCCCCcHHHHHHHHH-------cCCceEEEEehHhhHHCHHHHHHHHhccCeehhCcccCCCcc----cC
Confidence 7899999999998678889999974 78999999999886 45999999999999999999999874 58
Q ss_pred CHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCe
Q psy16054 297 SYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFV 348 (563)
Q Consensus 297 s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~ 348 (563)
+.++.++||.++...|.+.+|.. +++||+|++.++ ..+..++++.|+.
T Consensus 134 ~~~~~~~~i~~~~~~l~~~~g~~---~~~~r~p~g~~~-~~~~~~~~~~g~~ 181 (267)
T COG0726 134 SLEELGAEIARAHDILKKITGGR---PRGFRPPYGSYN-ESSLALARRLGYR 181 (267)
T ss_pred CHHHHHHHHHHHHHHHHHHhCCC---CceEECCCCccC-HHHHHHHHhCCCc
Confidence 89999999999999999987754 899999999998 4689999999985
No 9
>PRK14581 hmsF outer membrane N-deacetylase; Provisional
Probab=99.61 E-value=3.8e-15 Score=166.13 Aligned_cols=126 Identities=10% Similarity=0.093 Sum_probs=100.3
Q ss_pred CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---------------------CCHHHHHHHHHc
Q psy16054 217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---------------------TNYASVQELHRK 275 (563)
Q Consensus 217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---------------------~~y~~v~~i~~~ 275 (563)
+++.|+||||||+.+ ++..+++||++ +++||||||++.+ .+|++|++|.+.
T Consensus 105 P~KaV~LTFDDGy~s-ny~~AlPILKk-------yg~pATfFvVg~wid~~~~~~v~~~~~~~~r~~~LTW~qIrEM~~s 176 (672)
T PRK14581 105 PDKAVLLTFDDGYSS-FYRRVYPLLKA-------YKWSAVLAPVGTWIDTATDKKVDFGGLSTDRDRFATWKQITEMSKS 176 (672)
T ss_pred CCCeEEEEEEcCCcc-hHHHHHHHHHH-------cCCCEEEEEechhhccCcccccccccccccccccCCHHHHHHHHHC
Confidence 468999999999986 57777899974 7999999998754 246799999999
Q ss_pred C-CeecccCCCCCCCcC---------------CCCC----CC----HHHHHHHHHHHHHHHHHHhCCCCCcceEEecCcc
Q psy16054 276 G-HEISVFSLTHKDDPK---------------YWSG----GS----YDDWLAEMAGGRLIIERFANISDGSVIGVRAPYL 331 (563)
Q Consensus 276 G-HEIa~H~~~H~~~~~---------------~~~~----~s----~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l 331 (563)
| ||||+||++|..... |... .+ .+++++||.++++.|++.+|. .++.|++|||
T Consensus 177 GLvEIGSHT~sHh~~~~~np~g~~~pa~~~~~y~~~~~~yEs~~~~~~rl~~Di~~s~~~Ie~~lG~---~p~~FayPyG 253 (672)
T PRK14581 177 GLVEIGAHTYASHYGVIANPQGNTEPAAANLQYDPKTKQYETVEAFKQRMEKDVALITQRIVQATGK---QPRVWVWPYG 253 (672)
T ss_pred CCcEEEcCCcccccccccCccccccchhhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHhCC---CCCEEEcCCC
Confidence 9 699999999863210 0000 11 244677899999999998884 5788999999
Q ss_pred ccChhhHHHHHHHcCCeEEeecC
Q psy16054 332 RVGGNKQFSMMTDQFFVYDASIT 354 (563)
Q Consensus 332 ~~~~~~~~~~L~e~Gf~YDsS~~ 354 (563)
.++. .+.++++++||.+..+..
T Consensus 254 ~yn~-~~~~iak~aGy~~afTt~ 275 (672)
T PRK14581 254 APNG-TVLNILRQHGYQLAMTLD 275 (672)
T ss_pred CcCH-HHHHHHHHCCCcEEEECC
Confidence 9995 799999999999888764
No 10
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=99.60 E-value=6.1e-15 Score=164.73 Aligned_cols=126 Identities=13% Similarity=0.095 Sum_probs=99.9
Q ss_pred CCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC---------------------CHHHHHHHHHc
Q psy16054 217 VPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT---------------------NYASVQELHRK 275 (563)
Q Consensus 217 ~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~---------------------~y~~v~~i~~~ 275 (563)
+++.|+||||||+.+. +..+++||++ +++|||||+++.+. ++++|++|.+.
T Consensus 105 P~K~VaLTFDDGy~s~-yt~A~PILkk-------ygvpATfFlvg~wi~~~~~~~~~~~~~~~~r~~~LtWeqIreM~~s 176 (671)
T PRK14582 105 PEKAVLLTFDDGYSSF-YTRVFPILQA-------FQWPAVWAPVGSWVDTPADQPVKFGGEMVPREYFATWQQVREVARS 176 (671)
T ss_pred CCCeEEEEEEcCCCch-HHHHHHHHHH-------cCCCEEEEEechhhccCccccccccccccccccCCCHHHHHHHHhC
Confidence 4678999999999875 5667899974 79999999987653 34799999999
Q ss_pred C-CeecccCC-CCCCCcC--------------CCC----CCCH----HHHHHHHHHHHHHHHHHhCCCCCcceEEecCcc
Q psy16054 276 G-HEISVFSL-THKDDPK--------------YWS----GGSY----DDWLAEMAGGRLIIERFANISDGSVIGVRAPYL 331 (563)
Q Consensus 276 G-HEIa~H~~-~H~~~~~--------------~~~----~~s~----~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l 331 (563)
| ||||+||+ +|..+.. |.. -.+. +.+++||.++++.|++.+|. .++.||+|||
T Consensus 177 GlvEIGSHT~~sH~~l~anp~g~~~pa~~~r~~~~~~~~yEs~~e~~~ri~~DL~~s~~~Ie~~tG~---~p~~FayPyG 253 (671)
T PRK14582 177 RLVEIASHTWNSHYGIQANPQGSLLPAAVNRAYFTDHARYETAAEYRERIRLDAVKMTEYIRTKAGK---NPRVWVWPYG 253 (671)
T ss_pred CCeEEEcCCchhccccccccccccccchhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHhCC---CCcEEecCCC
Confidence 9 69999998 5974311 000 0122 34678999999999998885 4788999999
Q ss_pred ccChhhHHHHHHHcCCeEEeecC
Q psy16054 332 RVGGNKQFSMMTDQFFVYDASIT 354 (563)
Q Consensus 332 ~~~~~~~~~~L~e~Gf~YDsS~~ 354 (563)
.++. .++++++++||.+.-+..
T Consensus 254 ~~n~-~~~~iakeaGY~~afT~~ 275 (671)
T PRK14582 254 EANG-IALEELKKLGYDMAFTLE 275 (671)
T ss_pred CCCH-HHHHHHHHCCCeEEEEeC
Confidence 9995 799999999999887753
No 11
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=99.02 E-value=1.5e-10 Score=81.07 Aligned_cols=36 Identities=56% Similarity=1.304 Sum_probs=34.3
Q ss_pred CCCCCCcccCCCceecCccccCCCCCCCCCCccccc
Q psy16054 142 VCPEGKLSCGNGECIDKTLFCDDKPDCKDESDENAC 177 (563)
Q Consensus 142 ~C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE~~C 177 (563)
.|...+|+|.+|.||+..|+|||..||.|||||.+|
T Consensus 2 ~C~~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C 37 (37)
T PF00057_consen 2 TCPPGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC 37 (37)
T ss_dssp SSSTTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred cCcCCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence 588999999999999999999999999999999876
No 12
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.95 E-value=3.5e-10 Score=78.35 Aligned_cols=35 Identities=46% Similarity=1.132 Sum_probs=32.9
Q ss_pred CCCCCcccCCCceecCccccCCCCCCCCCCccccc
Q psy16054 143 CPEGKLSCGNGECIDKTLFCDDKPDCKDESDENAC 177 (563)
Q Consensus 143 C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE~~C 177 (563)
|.+.+|+|.+|.||+..++|||..||.|||||..|
T Consensus 1 C~~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~C 35 (35)
T cd00112 1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35 (35)
T ss_pred CCCCeEEcCCCCeeCHHHcCCCccCCCCCcccccC
Confidence 56789999999999999999999999999999876
No 13
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.62 E-value=2.1e-08 Score=68.51 Aligned_cols=32 Identities=50% Similarity=1.136 Sum_probs=30.2
Q ss_pred CCCCCcccCCCceecCccccCCCCCCCCCCcc
Q psy16054 143 CPEGKLSCGNGECIDKTLFCDDKPDCKDESDE 174 (563)
Q Consensus 143 C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE 174 (563)
|...+|+|.+|.||+..++|||..||.|||||
T Consensus 2 C~~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CPPGEFQCDNGRCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CCCCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence 66679999999999999999999999999998
No 14
>PF10096 DUF2334: Uncharacterized protein conserved in bacteria (DUF2334); InterPro: IPR018763 This group of proteins has no known function.
Probab=98.17 E-value=6.3e-05 Score=75.56 Aligned_cols=117 Identities=19% Similarity=0.345 Sum_probs=78.0
Q ss_pred EEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC--------------CH-HHHHHHHHcCCeecccCCCC
Q psy16054 222 TITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT--------------NY-ASVQELHRKGHEISVFSLTH 286 (563)
Q Consensus 222 ~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~--------------~y-~~v~~i~~~GHEIa~H~~~H 286 (563)
+|.++|---..+.+.+..|.+ ...+.+|+.+.+|.-.+. .+ +.++.+.++|.+|+.||++|
T Consensus 3 lirleDVsP~~~~~~l~~i~d----~l~~~~ipf~v~vIP~~~d~~~~~~~~l~~~~~f~~~L~~~~~~Gg~I~lHGYtH 78 (243)
T PF10096_consen 3 LIRLEDVSPFSDLEKLKEIAD----YLYKYGIPFSVAVIPVYVDPNGGITVNLSDNPEFVEYLRYLQARGGEIVLHGYTH 78 (243)
T ss_pred EEEEeCCCCCCCHHHHHHHHH----HHHHCCCCEEEEEEecccCCCCcccccchhhHHHHHHHHHHHhcCCEEEEEecce
Confidence 466666322233444433221 123468998888876542 12 47888889999999999999
Q ss_pred CCCc-------CCCC-----CCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCe
Q psy16054 287 KDDP-------KYWS-----GGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFV 348 (563)
Q Consensus 287 ~~~~-------~~~~-----~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~ 348 (563)
.... .+|. ..+.++.++-|..+.++|++. |+ .++||-+|....+. .+++++++ .|.
T Consensus 79 q~~~~~sg~~~ef~~~~~~~~~~~~~a~~ri~~gl~~l~~~-gi---~p~~f~aP~y~~s~-~~~~~~~~-~f~ 146 (243)
T PF10096_consen 79 QYGNSVSGDGFEFWDSEFDAGDSEEEAKERIEKGLEILAEL-GI---YPVGFEAPHYWASP-ETYKVLEK-YFT 146 (243)
T ss_pred ecCCCcccccceeccccccccCCHHHHHHHHHHHHHHHHHC-CC---cccEEECCccccCH-HHHHHHHh-hCc
Confidence 9831 2333 256677777788888888774 66 48999999999995 56666644 444
No 15
>PF01607 CBM_14: Chitin binding Peritrophin-A domain; InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=97.90 E-value=3.7e-06 Score=63.25 Aligned_cols=53 Identities=36% Similarity=0.773 Sum_probs=34.3
Q ss_pred CCCCCCCCceecCCCCCCCCeeeCCCCCCCCccccceeeCCCCccccCCCCCCCCCCCcCCC
Q psy16054 29 CEGRPPEEYFRLTIEGDCRDVVRCDQAGENGITRLAGVRCPNGLAFDIDRQTCDWKTNVKNC 90 (563)
Q Consensus 29 C~~~~~~~~f~~~~~~~C~~~~~C~~~~~~~~~~~~~~~Cp~Gl~~d~~~~~Cd~~~~v~~C 90 (563)
|.++. .++|.. ..+|..||.|.+|. +...+||.|+.||...+.|+++..+..|
T Consensus 1 C~~~~-~~~~~~--~~~C~~Y~~C~~g~------~~~~~C~~g~~fd~~~~~C~~~~~~~~C 53 (53)
T PF01607_consen 1 CPGRG-DGFYPH--PDDCRKYYQCVNGQ------AVEQRCPEGLYFDPSSQRCVPPSNVVQC 53 (53)
T ss_dssp ----S-SEEE----SS-SSEEEEEETTE------EEEEE-TTS-EE-TTTSSEE-TTT-TT-
T ss_pred CCccC-CeeEeC--CCCCCEEEEeeCCc------EECCCCcCCCEECcCcCEEcCCccCCCC
Confidence 44554 456655 67999999999998 8899999999999999999998764444
No 16
>COG3233 Predicted deacetylase [General function prediction only]
Probab=97.74 E-value=0.00028 Score=68.15 Aligned_cols=125 Identities=11% Similarity=0.101 Sum_probs=85.3
Q ss_pred ceEEeecCCcCC-CCchHHHHHhccCCCCCCCCCCceEE-EEeccCC-------CH---HHHHHHHHcCCeecccCCCCC
Q psy16054 220 MITITFNGAVNV-DNSDLYDEIFNGNRQNPNGCQIKGTF-FVSHKYT-------NY---ASVQELHRKGHEISVFSLTHK 287 (563)
Q Consensus 220 ~V~lTFDD~~~~-~~~~~~~~ll~~~r~n~n~~~ikaTF-Fv~~~~~-------~y---~~v~~i~~~GHEIa~H~~~H~ 287 (563)
.|.|-.|-++.- ..+..+..+++.. +....|+ +|.-... +. +.+.+..++|+|++.|||.|.
T Consensus 5 ~iillhdVSpv~~~~~~~i~~~ide~------~~~~~t~lLViPn~~~~~~l~~d~rf~~~l~~r~e~Gdel~lHGy~h~ 78 (233)
T COG3233 5 LIILLHDVSPVYWPTLSNIDAAIDEY------GAQNSTVLLVIPNHANDYPLSKDPRFVDLLTEREEEGDELVLHGYDHI 78 (233)
T ss_pred ceEEEEecCcccchhHHHHHHHHHHh------CCCCceEEEEeeccCCCCCcccChHHHHHHHHHHhcCCEEEEechhhc
Confidence 466666765432 2223344455431 2356676 6654421 22 678888999999999999998
Q ss_pred CCcC--CCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEEeecCC
Q psy16054 288 DDPK--YWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYDASITA 355 (563)
Q Consensus 288 ~~~~--~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YDsS~~~ 355 (563)
+... =+.-+...+-+.-+..+.+.|++ .|++ .+||-+|-+.+|. .++++|-++||+|-+|+..
T Consensus 79 d~~~~gEF~~l~~~eA~~RL~~a~~~l~~-~G~~---~~~FvpPrwlmS~-gt~~aL~~~~frv~a~l~~ 143 (233)
T COG3233 79 DTKRRGEFACLRAHEARLRLMAAIEELEA-LGFP---LRGFVPPRWLMSE-GTRQALLENNFRVTADLRG 143 (233)
T ss_pred cccCccccccchHHHHHHHHHHHHHHHHH-cCCc---ceeecCcceecCH-hHHHHhccCCcEEEecccc
Confidence 7210 01234455556667788888876 4665 7999999999996 6999999999999999853
No 17
>PF09960 DUF2194: Uncharacterized protein conserved in bacteria (DUF2194); InterPro: IPR018695 This family of prokaryotic proteins has no known function; however it may be a membrane protein.
Probab=97.68 E-value=0.0025 Score=71.35 Aligned_cols=196 Identities=14% Similarity=0.072 Sum_probs=122.1
Q ss_pred CCCCCceEEEEeccCC---CH--------------HHHHHHHHcCCeecccCCCCCCCcCCCCC----------C-CHHH
Q psy16054 249 NGCQIKGTFFVSHKYT---NY--------------ASVQELHRKGHEISVFSLTHKDDPKYWSG----------G-SYDD 300 (563)
Q Consensus 249 n~~~ikaTFFv~~~~~---~y--------------~~v~~i~~~GHEIa~H~~~H~~~~~~~~~----------~-s~~~ 300 (563)
+++++|-|.++...|. ++ -..++|.+.|||||.|||.|+.+.. .+ | ++++
T Consensus 297 ~~y~ikyTg~~I~~Y~~~~~~pf~~~~~~~~~~~~~~g~~Ll~~ggElG~HGYNHqpL~~--~~~~~~~~~Y~~W~~~~~ 374 (585)
T PF09960_consen 297 KKYGIKYTGVMIENYNDQTNPPFEFLEQEDTDRFIYFGRELLKSGGELGLHGYNHQPLTL--EGDYGDEYGYKPWPSKED 374 (585)
T ss_pred HHcCCeEEEEEccccCCCCCCCcccccccchhhHHHHHHHHHhcCCceEEecccCCCCcC--CCcccccccCcCCCCHHH
Confidence 4689999999998763 11 2678899999999999999998752 22 3 5555
Q ss_pred HHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHc--CCeEEeecCCCCCCCCccceeccCCCCCCcCCC
Q psy16054 301 WLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQ--FFVYDASITASLGRVPIWPYTLYFRMPHKCNGN 378 (563)
Q Consensus 301 ~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~--Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~ 378 (563)
+.+-|...++.++++ .+...++.+=+|.=-.+. .-+++|++. -+.+-+|+...... --+|..+|+..
T Consensus 375 m~~sl~~l~~f~~~l--~p~~~~~~YVPPSNils~-eG~e~L~~~~P~ik~IaS~Y~~~~~--~~~y~QEF~~~------ 443 (585)
T PF09960_consen 375 MAESLKELKRFVKSL--FPNYEPSSYVPPSNILSE-EGREALKKAFPEIKTIASLYFGDDE--EGEYVQEFEIA------ 443 (585)
T ss_pred HHHHHHHHHHHHHHh--CCCcceEEecCCccccCH-HHHHHHHHhCCCeEEEEEeeecCCc--CCcceEEeeEC------
Confidence 555555555555553 344567778888877774 467777774 35677776542111 11455555431
Q ss_pred CCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCc
Q psy16054 379 AHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSK 458 (563)
Q Consensus 379 ~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~ 458 (563)
.-+++|+|=.. .|. . .+ + .++...+..+-. -..+.-.+||.-+-..
T Consensus 444 -------~dgi~e~PRis-------------SG~--~------~~--~--~~~~~~~s~l~~--~Gv~sHFiHPDDvld~ 489 (585)
T PF09960_consen 444 -------PDGIVEFPRIS-------------SGY--Y------PD--D--YMLWAIVSALNN--HGVFSHFIHPDDVLDE 489 (585)
T ss_pred -------CCCeEEeCccc-------------cCC--c------cC--h--HHHHHHHHHHHh--heeeecccCCcccccc
Confidence 12588888321 121 0 11 1 122333333221 2356777887554311
Q ss_pred --------hhHHHHHHHHHHHHHc-CCCeEEEeHHHHHHHhh
Q psy16054 459 --------KEFKDELIKFIEEMLD-RNDVYFVTMLQVIQWMQ 491 (563)
Q Consensus 459 --------p~r~~al~~fl~~i~~-~~dVwfvT~~ei~~w~~ 491 (563)
..+.+.|++||++|.+ .|-...-|.+|++.-++
T Consensus 490 dR~~~~~W~~l~~~l~~~~~~i~~~~P~Lr~~T~Se~~~~~~ 531 (585)
T PF09960_consen 490 DRNAGKGWEELYKSLDKFMSWIKESYPWLRNLTASEAAGALQ 531 (585)
T ss_pred ccCCCCCHHHHHHHHHHHHHHHHHhCCCcccCcHHHHHHHHH
Confidence 2367899999999986 68899999999987554
No 18
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=97.50 E-value=4.9e-05 Score=52.60 Aligned_cols=33 Identities=15% Similarity=0.409 Sum_probs=28.4
Q ss_pred cCcccCCCCceecCCCCCcCCCcCCCCCCccCCCCCc
Q psy16054 96 LKQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKV 132 (563)
Q Consensus 96 ~~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~ 132 (563)
+.+|+|.+|.|| +..++|| |+.||.|+|||..|
T Consensus 3 ~~~f~C~~~~Ci-~~~~~CD---g~~DC~dgsDE~~C 35 (35)
T cd00112 3 PNEFRCANGRCI-PSSWVCD---GEDDCGDGSDEENC 35 (35)
T ss_pred CCeEEcCCCCee-CHHHcCC---CccCCCCCcccccC
Confidence 358999999999 6667899 89999999999743
No 19
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=97.49 E-value=9e-05 Score=69.78 Aligned_cols=67 Identities=27% Similarity=0.416 Sum_probs=54.3
Q ss_pred CcccCCCCcee-cCCCCCcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCC----ceecCccccCCCCC---C
Q psy16054 97 KQISCPSGLAF-DVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNG----ECIDKTLFCDDKPD---C 168 (563)
Q Consensus 97 ~~~~C~~G~~~-~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G----~Ci~~~~~Cdg~~D---C 168 (563)
..|.|-+|.-+ .+-.++.| +.=||.|||||.. ...|..+.|.|.|. .-|+..++-||.=| |
T Consensus 36 ~~f~Cl~~~~~~I~~~~iND---dyCDC~DGSDEPG--------TsAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~C 104 (176)
T PF12999_consen 36 GKFTCLDGSKIVIPFSQIND---DYCDCPDGSDEPG--------TSACSNGKFYCENKGHIPRYIPSSRVNDGICDYDIC 104 (176)
T ss_pred CceEecCCCCceecHHHccC---cceeCCCCCCccc--------cccCcCceEeeccCCCCCceeehhhhcCCcCccccc
Confidence 35888777543 46667788 6778999999962 23698899999874 68999999999999 9
Q ss_pred CCCCcc
Q psy16054 169 KDESDE 174 (563)
Q Consensus 169 ~DgsDE 174 (563)
=|||||
T Consensus 105 CDGSDE 110 (176)
T PF12999_consen 105 CDGSDE 110 (176)
T ss_pred CCCCCC
Confidence 999999
No 20
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=97.48 E-value=5e-05 Score=53.24 Aligned_cols=33 Identities=15% Similarity=0.426 Sum_probs=28.6
Q ss_pred cCcccCCCCceecCCCCCcCCCcCCCCCCccCCCCCc
Q psy16054 96 LKQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKV 132 (563)
Q Consensus 96 ~~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~ 132 (563)
..+|+|.+|.|| +..++|| ++.||.|+|||.+|
T Consensus 5 ~~~f~C~~~~CI-~~~~~CD---g~~DC~dgsDE~~C 37 (37)
T PF00057_consen 5 PGEFRCGNGQCI-PKSWVCD---GIPDCPDGSDEQNC 37 (37)
T ss_dssp TTEEEETTSSEE-EGGGTTS---SSCSSSSSTTTSSH
T ss_pred CCeeEcCCCCEE-ChHHcCC---CCCCCCCCcccccC
Confidence 568999999999 5567899 89999999999754
No 21
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=97.25 E-value=0.00037 Score=52.56 Aligned_cols=50 Identities=36% Similarity=0.774 Sum_probs=41.2
Q ss_pred cCCCCCCCCceecCCCCCCCCeeeCCCCCCCCccccceeeCCCCccccCCCCCCCCCCC
Q psy16054 28 LCEGRPPEEYFRLTIEGDCRDVVRCDQAGENGITRLAGVRCPNGLAFDIDRQTCDWKTN 86 (563)
Q Consensus 28 ~C~~~~~~~~f~~~~~~~C~~~~~C~~~~~~~~~~~~~~~Cp~Gl~~d~~~~~Cd~~~~ 86 (563)
.|..+. .++|..+ .+|..|+.|..+. ...+.||.|+.|+...+.|++++.
T Consensus 2 ~C~~~~-~g~~~~~--~~C~~y~~C~~~~------~~~~~C~~g~~f~~~~~~C~~~~~ 51 (56)
T smart00494 2 ECPGRG-DGLYPHP--TDCSKYYQCSNGR------PIVGSCPAGLVFDPATQTCDWPDN 51 (56)
T ss_pred cCCCCC-CccccCc--ccCCeeEEcCCCc------EEeccCcCCCeECCCCCCcCCccc
Confidence 466633 5676654 4899999999997 889999999999999999998864
No 22
>KOG1215|consensus
Probab=97.06 E-value=0.0015 Score=77.56 Aligned_cols=75 Identities=23% Similarity=0.626 Sum_probs=62.3
Q ss_pred cCcccCCC-CceecCCCCCcCCCcCCCCCCccCCCC--CccCcCCCCCCCCCCCCcccCCCceecCccccCCCCCCCCCC
Q psy16054 96 LKQISCPS-GLAFDVDKQTCDWKAKVTNCNQLDRPR--KVLPILKTDEPVCPEGKLSCGNGECIDKTLFCDDKPDCKDES 172 (563)
Q Consensus 96 ~~~~~C~~-G~~~~~~~~~CD~~~~v~dC~d~sde~--~~~p~~~~~~~~C~~~~f~C~~G~Ci~~~~~Cdg~~DC~Dgs 172 (563)
...++|.. +.|+ ..++.|| +..+|.+++||. .+ ....|...++.|.++.|++..++|||..||.||+
T Consensus 220 ~~~~~c~g~~~~i-~~~~~~D---g~~dc~~~~de~~~~~------~~~~~~~~e~~~~~~~~~~~~~~~~g~~d~pdg~ 289 (877)
T KOG1215|consen 220 IEVTRCDGSSRCI-LISEVCD---GPRDCVDGPDEGVMNC------SDATCEAPEIECADGDCSDRQKLCDGDLDCPDGL 289 (877)
T ss_pred eeEEEecCCCcEE-eehhccC---CCcccccCCcCceeEe------eccccCCcceeecCCCCccceEEecCccCCCCcc
Confidence 35788877 4888 5666799 888999999995 32 2236888899999999999999999999999999
Q ss_pred cccccCCC
Q psy16054 173 DENACSVE 180 (563)
Q Consensus 173 DE~~C~~~ 180 (563)
||..|...
T Consensus 290 de~~~~~~ 297 (877)
T KOG1215|consen 290 DEDYCKKK 297 (877)
T ss_pred cccccccc
Confidence 99988743
No 23
>KOG1215|consensus
Probab=97.02 E-value=0.0012 Score=78.29 Aligned_cols=118 Identities=21% Similarity=0.381 Sum_probs=78.7
Q ss_pred cCcccCC--CCceecCCCCCcCCCcCCCCCCccCCCCCccCcCCCCC-------------------------------CC
Q psy16054 96 LKQISCP--SGLAFDVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDE-------------------------------PV 142 (563)
Q Consensus 96 ~~~~~C~--~G~~~~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~-------------------------------~~ 142 (563)
...|+|. ++.|+ +..|.|| +..+|.+++||..|.... ... +.
T Consensus 140 ~~~~~c~~~~~~Ci-p~~~~cd---~~~~C~dg~de~~~~~~~-~~~~~~~~~~~~~~~~~d~~~~~~~~~d~~~~~~~~ 214 (877)
T KOG1215|consen 140 LDKFSCRTGSCKCI-PGDWLCD---GEADCPDGSDELNCAVRR-CEPRGASLDCIVAIKVCDIQHDCADDYDESEGRIYW 214 (877)
T ss_pred CCCCCCcCccccCC-CCceeCC---CCCccccchhhhcccccc-cCccccccccceeeeecCcccccccccccccCcccc
Confidence 5688998 78899 6667799 777999999998775110 000 00
Q ss_pred CC---CCCcccCC-CceecCccccCCCCCCCCCCccc--ccCCCCCCCCCCCCCCCCCCCCCcccCCCCCcccCCCCCCC
Q psy16054 143 CP---EGKLSCGN-GECIDKTLFCDDKPDCKDESDEN--ACSVEQDPNRAPDCDPTQCALPDCFCSADGTRIPGGIEPNQ 216 (563)
Q Consensus 143 C~---~~~f~C~~-G~Ci~~~~~Cdg~~DC~DgsDE~--~C~~~~~p~~~~~C~~~~C~~p~~~c~~~g~ciP~~~~~~~ 216 (563)
+. ...++|.. ++||...+.|||..||.|++||. .|...........|....|.--..+|...-.| |.++....
T Consensus 215 ~~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~~~~de~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~g~~d~-pdg~de~~ 293 (877)
T KOG1215|consen 215 TDDSRIEVTRCDGSSRCILISEVCDGPRDCVDGPDEGVMNCSDATCEAPEIECADGDCSDRQKLCDGDLDC-PDGLDEDY 293 (877)
T ss_pred cCCcceeEEEecCCCcEEeehhccCCCcccccCCcCceeEeeccccCCcceeecCCCCccceEEecCccCC-CCcccccc
Confidence 11 35788876 49999999999999999999994 56554333344456666665444556655556 65555544
Q ss_pred CCC
Q psy16054 217 VPQ 219 (563)
Q Consensus 217 ~pq 219 (563)
.++
T Consensus 294 ~~~ 296 (877)
T KOG1215|consen 294 CKK 296 (877)
T ss_pred ccc
Confidence 443
No 24
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=96.83 E-value=0.00066 Score=46.28 Aligned_cols=29 Identities=21% Similarity=0.464 Sum_probs=25.8
Q ss_pred CcccCCCCceecCCCCCcCCCcCCCCCCccCCC
Q psy16054 97 KQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRP 129 (563)
Q Consensus 97 ~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde 129 (563)
.+|+|.+|.|| +..++|| |+.||.|+|||
T Consensus 5 ~~f~C~~~~Ci-~~~~~Cd---g~~dC~dgsDE 33 (33)
T smart00192 5 GEFQCDNGRCI-PLSWVCD---GVDDCSDGSDE 33 (33)
T ss_pred CeEECCCCCEE-CchhhCC---CcCcCcCCCCC
Confidence 48999999999 6677899 89999999997
No 25
>PF04748 Polysacc_deac_2: Divergent polysaccharide deacetylase; InterPro: IPR006837 This is a family of uncharacterised proteins that includes YibQ.; PDB: 2QV5_A 2NLY_A.
Probab=96.38 E-value=0.024 Score=55.88 Aligned_cols=182 Identities=13% Similarity=0.093 Sum_probs=89.1
Q ss_pred CCCceEEEEeccCCCH-HHHHHHHHcCCeecccCC------CCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcc
Q psy16054 251 CQIKGTFFVSHKYTNY-ASVQELHRKGHEISVFSL------THKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSV 323 (563)
Q Consensus 251 ~~ikaTFFv~~~~~~y-~~v~~i~~~GHEIa~H~~------~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~ 323 (563)
-+.+.||-|.-...+- +.++..+++||||-.|=- .+.........++.++++.-+..+. .+ +| ..
T Consensus 19 Lp~pvT~ai~P~~~~~~~~a~~a~~~G~EvllhlPMep~~~~~~gp~~L~~~~~~~~i~~~l~~al---~~---vp--~a 90 (213)
T PF04748_consen 19 LPFPVTFAILPYAPYSREWAERARAAGHEVLLHLPMEPKGYKDPGPGALLTGMSEEEIRKRLEAAL---AR---VP--GA 90 (213)
T ss_dssp SSTTCEEEEETTSTTHHHHHHHHHHCT-EEEEEEEE--TTTT---TT-B-TTS-HHHHHHHHHHHH---CC---ST--T-
T ss_pred CCCCeEEEECCCCCChHHHHHHHHHcCCEEEEeCCCCCCCCCCcccccccCCCCHHHHHHHHHHHH---HH---CC--Cc
Confidence 4789999987776655 778888899999998842 1111111113467666655444433 32 22 24
Q ss_pred eEEecCccc---cCh---hhHHHHHHHcCCeEEeecCCCCCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEecccc
Q psy16054 324 IGVRAPYLR---VGG---NKQFSMMTDQFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNEL 397 (563)
Q Consensus 324 ~G~R~P~l~---~~~---~~~~~~L~e~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l 397 (563)
+|.-...++ .+. ..+++.|++.|+-|--|.+.+.+ -.+-....+| +|...
T Consensus 91 ~GvnNhmGS~~T~~~~~m~~vl~~l~~~gl~FvDS~T~~~s--~a~~~A~~~g---------------------vp~~~- 146 (213)
T PF04748_consen 91 VGVNNHMGSRFTSDREAMRWVLEVLKERGLFFVDSRTTPRS--VAPQVAKELG---------------------VPAAR- 146 (213)
T ss_dssp SEEEEEE-CCHHC-HHHHHHHHHHHHHTT-EEEE-S--TT---SHHHHHHHCT-----------------------EEE-
T ss_pred EEEecCCCccccCCHHHHHHHHHHHHHcCCEEEeCCCCccc--HHHHHHHHcC---------------------CCEEe-
Confidence 555443332 111 24688999999888755544211 0000001111 12110
Q ss_pred ccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCC
Q psy16054 398 DRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRND 477 (563)
Q Consensus 398 ~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~d 477 (563)
+ +. ++| +..+.+.+.+.|......... ++ ....--| ++ |.-+++|++++..+. ..+
T Consensus 147 -r-dv-----------fLD---~~~~~~~I~~ql~~~~~~A~~-~G-~aI~Igh-~~----p~Tl~~L~~~~~~l~-~~g 202 (213)
T PF04748_consen 147 -R-DV-----------FLD---NDQDEAAIRRQLDQAARIARK-QG-SAIAIGH-PR----PETLEALEEWLPELE-AQG 202 (213)
T ss_dssp ---SE-----------ETT---ST-SHHHHHHHHHHHHHHHHC-CS-EEEEEEE--S----CCHHHHHHHHHHHHH-HCT
T ss_pred -e-ce-----------ecC---CCCCHHHHHHHHHHHHHhhhh-cC-cEEEEEc-CC----HHHHHHHHHHHhHHh-hCC
Confidence 0 00 112 123455555555544433333 22 2222336 42 668999999999998 568
Q ss_pred eEEEeHHHHHH
Q psy16054 478 VYFVTMLQVIQ 488 (563)
Q Consensus 478 VwfvT~~ei~~ 488 (563)
|-+|+.++++.
T Consensus 203 i~lVpvS~llr 213 (213)
T PF04748_consen 203 IELVPVSALLR 213 (213)
T ss_dssp EEE--HHHH--
T ss_pred EEEEEhHHhhC
Confidence 99999999863
No 26
>PF03065 Glyco_hydro_57: Glycosyl hydrolase family 57; InterPro: IPR004300 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 57 GH57 from CAZY comprises enzymes with two known activities; alpha-amylase (3.2.1.1 from EC) and 4-alpha-glucanotransferase (2.4.1 from EC).; GO: 0003824 catalytic activity, 0005975 carbohydrate metabolic process; PDB: 1K1X_B 1K1W_A 1K1Y_A 3N98_A 3N8T_A 3N92_A 1UFA_A 3P0B_A.
Probab=95.74 E-value=0.023 Score=60.45 Aligned_cols=195 Identities=16% Similarity=0.141 Sum_probs=110.9
Q ss_pred HHHHHHHHHcCC-eecccCCCCCCCcCCCCCCCH-HHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHH
Q psy16054 266 YASVQELHRKGH-EISVFSLTHKDDPKYWSGGSY-DDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMT 343 (563)
Q Consensus 266 y~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~s~-~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~ 343 (563)
-+.++++.++|+ ||-.-+++|.-++. ++. ++.+.+|...++.+++..|. .|+||..|-..++. .+.++|+
T Consensus 113 i~~~~~l~~~G~iEll~~~~~h~ilpl----~~~~~d~~~Qi~~~~~~~~~~FG~---~P~G~W~pE~a~~~-~l~~~l~ 184 (360)
T PF03065_consen 113 IEAFRELAERGQIELLTSPYYHPILPL----LPDPEDFRAQIEMGREYFKKHFGR---RPRGFWLPECAYSP-GLEEILA 184 (360)
T ss_dssp HHHHHHHHHTTSEEEEEE-TT-B-GGG----SSHHHHHHHHHHHHHHHHHHHHSS-----SBEE-GGG-B-T-THHHHHH
T ss_pred HHHHHHHHHCCCEEEEeCchHHHHHHH----hCCHHHHHHHHHHHHHHHHHHhCC---CCCceECcccccCH-HHHHHHH
Confidence 468899999997 99999999998863 566 99999999999999999994 58999999999995 6999999
Q ss_pred HcCCeEEeecCCC------CCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEeccc-cccCCCCCCCCCCCCccccC
Q psy16054 344 DQFFVYDASITAS------LGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNE-LDRRDDPTFDESLPGCHMVD 416 (563)
Q Consensus 344 e~Gf~YDsS~~~~------~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~-l~~~d~p~~~~~~~g~~~~d 416 (563)
++||.|--.-... ....++.||.... ....+|..++.+. +.+ ...|.. .|.+
T Consensus 185 ~~Gi~~~i~d~~~~~~~~~~~~~~y~py~~~~--------------~~~~~l~vf~rd~~ls~--~I~f~~--~g~~--- 243 (360)
T PF03065_consen 185 EAGIRYTILDGHQLLGAGSPPNNPYRPYRVED--------------QGGSGLAVFFRDQELSD--DIWFRY--SGYP--- 243 (360)
T ss_dssp HTT--EEEEECHHHHHHT--CGGCSSEEEECE---------------TTCSEEEEEB-HHHHH--HHHTTT--TSGG---
T ss_pred HcCCEEEEECcHHhCCCCCccCCCCceeEeec--------------CCCceEEEEecCHHHHH--hcCCcc--cccc---
Confidence 9999987543211 0123455664321 0122377777653 211 111211 1111
Q ss_pred CCCCCCChHHHHHHHHHHHHHHHccCCc-ceEEeeccccc-----cCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHh
Q psy16054 417 SCSNIQTGEQFGRLLRHNFNRHFSTNRA-PLGLHFHASWL-----KSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWM 490 (563)
Q Consensus 417 ~~~~~~~~~~~~~~l~~~F~~~y~~nra-p~~l~lH~~~l-----~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~ 490 (563)
...+..++++.+...+......... .+.+.+-..-+ ......++.|.+.+..+.+.+.|-++|.+|+++=+
T Consensus 244 ---~~~~~~~~~~~l~~~~~~~~~~~~~~~v~i~~DgE~~~~~~~~~~~~~l~~l~~~~~~~~~~~~i~~~t~se~l~~~ 320 (360)
T PF03065_consen 244 ---GDPAYREFYRYLLYEYASWISYDPGPVVVIALDGENFWGHYWDEGEGFLEFLYQLLEELHSRGFIETVTPSEYLERH 320 (360)
T ss_dssp ---GSTTSBHSSCHHC----HC-BS-TT-EEEECCEGGGCCCCCCCCTCEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ---cccCHHHHHHhhhhhhhhhhccCCCCeEEEeECCcccCCcCCcccCCHHHHHHHHHHHhhcCCeEEecCHHHHHhhC
Confidence 1223445555444113333332222 23343321111 22345777777777777766668899999998865
Q ss_pred hC
Q psy16054 491 QN 492 (563)
Q Consensus 491 ~~ 492 (563)
..
T Consensus 321 ~~ 322 (360)
T PF03065_consen 321 PP 322 (360)
T ss_dssp ST
T ss_pred CC
Confidence 33
No 27
>PF11959 DUF3473: Domain of unknown function (DUF3473); InterPro: IPR022560 This domain, found in bacteria and archaea, is functionally uncharacterised. It is about 130 amino acids in length and is found C-terminal to PF01522 from PFAM. It contains two completely conserved residues (P and H) that may be functionally important.
Probab=94.81 E-value=0.13 Score=46.82 Aligned_cols=55 Identities=24% Similarity=0.299 Sum_probs=31.3
Q ss_pred HHHHHHHHccCCcceEEeecccccc-CchhH--------------HHHHHHHHHHHHcCCCeEEEeHHHHHH
Q psy16054 432 RHNFNRHFSTNRAPLGLHFHASWLK-SKKEF--------------KDELIKFIEEMLDRNDVYFVTMLQVIQ 488 (563)
Q Consensus 432 ~~~F~~~y~~nrap~~l~lH~~~l~-g~p~r--------------~~al~~fl~~i~~~~dVwfvT~~ei~~ 488 (563)
+..+.+....++.|+++|||+..|. .+|.. ++.+++=|+.|. .|.-|+|+.|+++
T Consensus 62 ~~~~~~~~~~~~~~~~~YfHPwE~dp~qpr~~~~~~~~rf~~y~~l~~~~~rl~~Ll--~~f~f~t~~~~~~ 131 (133)
T PF11959_consen 62 RWLIRRINRRGGQPAVFYFHPWEFDPDQPRVPGAPLKSRFRHYNNLDRMEKRLDRLL--SDFRFGTMREVLA 131 (133)
T ss_pred HHHHHHHHhCCCcceEEEEeceecCCCCcccCCCCcceeEEEEcCHHHHHHHHHHHH--hhCceEEHHHHhh
Confidence 3333333333447999999966663 22210 133333344443 5788999999975
No 28
>COG1449 Alpha-amylase/alpha-mannosidase [Carbohydrate transport and metabolism]
Probab=94.21 E-value=0.83 Score=51.99 Aligned_cols=80 Identities=16% Similarity=0.241 Sum_probs=65.6
Q ss_pred HHHHHHHHcCC-eecccCCCCCCCcCCCCCC-CHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHH
Q psy16054 267 ASVQELHRKGH-EISVFSLTHKDDPKYWSGG-SYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTD 344 (563)
Q Consensus 267 ~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~-s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e 344 (563)
++.|++.+.|+ |+-.+++.|.-.+-+ .+. -.+++.+++...+++..++.|. .++|||+|.+.++ +.+++++++
T Consensus 183 ~~~r~l~e~g~~e~~~~pyyHsl~pl~-~d~gw~~d~~~qv~~~~~~~~elfG~---~p~~~~~tEl~y~-~~i~~~~~e 257 (615)
T COG1449 183 EAFRELAESGKVELTASPYYHSLIPLL-ADDGWYEDFKEQVMMSRELYKELFGV---WPSGFWNTELAYN-DQILEYFEE 257 (615)
T ss_pred HHHHHHHhcCceEEEecccccccchhc-ccCCchHHHHHHHHHHHHHHHHHhCC---CCccccChhhhcc-HHHHHHHHH
Confidence 57888888886 999999999876532 112 1347888999999999999996 5899999999999 579999999
Q ss_pred cCCeEEe
Q psy16054 345 QFFVYDA 351 (563)
Q Consensus 345 ~Gf~YDs 351 (563)
.||.|-.
T Consensus 258 ~G~~~~~ 264 (615)
T COG1449 258 SGFSWIA 264 (615)
T ss_pred cCCcccc
Confidence 9999743
No 29
>PF15421 Polysacc_deac_3: Putative polysaccharide deacetylase; PDB: 4DWE_A.
Probab=92.48 E-value=3.3 Score=43.83 Aligned_cols=173 Identities=16% Similarity=0.300 Sum_probs=75.0
Q ss_pred CHHHHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHH-HHHHHhCCCCCcceEEecCccccChhhHHHHHH
Q psy16054 265 NYASVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRL-IIERFANISDGSVIGVRAPYLRVGGNKQFSMMT 343 (563)
Q Consensus 265 ~y~~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~-~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~ 343 (563)
.|+.|++|.+-|.+||.|.+.+... ..+.+.++.+..-+.. +++++.|.. .+-+..| +++...- .+
T Consensus 131 ~W~evkem~nyG~~ia~Hdv~~~~~-----~~t~~~~~~~~~i~~~~i~~~l~GR~---~K~l~eP----nGn~~Yi-~a 197 (425)
T PF15421_consen 131 SWDEVKEMLNYGTGIAFHDVNTYDV-----DTTDDQILAHFCIAQNSIFKNLSGRG---CKMLAEP----NGNKKYI-TA 197 (425)
T ss_dssp -HHHHHHHHTTT-EEEB---SS-GG-----G-SHHHHHHHHHHHHHHHHHHTTT-------EEE------TT-HHHH-HH
T ss_pred cHHHHHHHHhcCcceEecccccccc-----cccchhhhhhhhhHHHHHHHHhcCcc---cceeeCC----CCCHHHH-HH
Confidence 5689999999999999999988764 2354555555443333 556666633 3334445 3332221 12
Q ss_pred HcCCeEEeecCCCCCCCCccceeccCCCCCCcCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCC
Q psy16054 344 DQFFVYDASITASLGRVPIWPYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQT 423 (563)
Q Consensus 344 e~Gf~YDsS~~~~~~~~p~wPytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~ 423 (563)
.+++.=--++.+..|...++|+.-+..+ .=..++ + .| ++ ...
T Consensus 198 a~~~d~Iqtitaq~G~~kv~pf~~~~dL----------------~k~~~~-----R----~F---------~~----~~~ 239 (425)
T PF15421_consen 198 ALVYDPIQTITAQSGAIKVYPFKVNGDL----------------EKVVIE-----R----GF---------YD----SPD 239 (425)
T ss_dssp HTTSTT--EEEESSS-B---TTT--S------------------TTEEEE----------B------------------T
T ss_pred HHcCCchhhhHhhcCCcceeccccCCch----------------hhheee-----e----ee---------cC----Ccc
Confidence 2233222345555555566776533211 001111 1 01 11 122
Q ss_pred hHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHc------CCCeEEEeHHHHHH--HhhCCch
Q psy16054 424 GEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLD------RNDVYFVTMLQVIQ--WMQNPTE 495 (563)
Q Consensus 424 ~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~------~~dVwfvT~~ei~~--w~~~p~~ 495 (563)
.+++.+.|.+.....- .+|.-..|+.|- +...+.. ||.||.. .+.|||.+.+|.-+ |||.-..
T Consensus 240 ~~~~k~~I~~~l~~p~-e~r~ai~iG~h~-----td~~W~~---fl~wLn~~YG~~G~D~lWfp~~eEyyEY~Y~R~~~~ 310 (425)
T PF15421_consen 240 NDQYKQDIEDQLKNPV-EEREAINIGVHG-----TDSNWED---FLLWLNDTYGKDGDDSLWFPSQEEYYEYNYYRMHTK 310 (425)
T ss_dssp HHHHHHHHHHHHTS-G-GG---EEEEESS-------HHHHH---HHHHHHHHTSTTTT--EEEE-HHHHHHHHHHHHH--
T ss_pred hHHHHHHHHHHhccch-hchhheeecccC-----CCCcHHH---HHHHHHHHhCcCCCcccccCCHHHHHHHHHHhhccc
Confidence 4556666666554333 346556899992 2334443 7777765 46699999999998 5555444
Q ss_pred hh
Q psy16054 496 LT 497 (563)
Q Consensus 496 ~~ 497 (563)
++
T Consensus 311 i~ 312 (425)
T PF15421_consen 311 IE 312 (425)
T ss_dssp -E
T ss_pred ee
Confidence 43
No 30
>COG2861 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=92.42 E-value=4.4 Score=40.33 Aligned_cols=212 Identities=13% Similarity=0.146 Sum_probs=109.4
Q ss_pred CCCCCCCCceEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC-CHHHHHHHHHcCCeecccCC----C
Q psy16054 212 IEPNQVPQMITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT-NYASVQELHRKGHEISVFSL----T 285 (563)
Q Consensus 212 ~~~~~~pq~V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~-~y~~v~~i~~~GHEIa~H~~----~ 285 (563)
|.....|+ ++|-.|| |++....+. |++ -+.+.|.-+..... .-+..++++++||||-.|-- +
T Consensus 23 ~~~~~~~k-laIVIddlG~~~~~~~~---i~~--------Lp~eVTlA~~P~~~~~~~~~~~A~~~G~evlih~PmeP~~ 90 (250)
T COG2861 23 WSGARGPK-LAIVIDDLGLSQTGTQA---ILA--------LPPEVTLAFAPFAPHAREWAQKARNAGHEVLIHMPMEPFS 90 (250)
T ss_pred CCcCCCce-EEEEECCccccHHHHHH---HHh--------CCccceEEecCCCchhHHHHHHHHhcCCEEEEeccCCccc
Confidence 33344454 6777888 666554443 333 14555654433332 23788999999999999963 2
Q ss_pred CCCCc--CCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEEeecCCCCCCCCcc
Q psy16054 286 HKDDP--KYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYDASITASLGRVPIW 363 (563)
Q Consensus 286 H~~~~--~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YDsS~~~~~~~~p~w 363 (563)
+.... ..-..++.++..+.|..+...+..+.|+. .-.|-|.-.-...=+..++.|++.|+-|--|.+...
T Consensus 91 ~~~~e~gtL~~~~s~~e~~~rl~~a~~~v~~~~Gln--NhmGs~~tsn~~aM~~~m~~Lk~r~l~flDs~T~a~------ 162 (250)
T COG2861 91 YPKIEPGTLRPGMSAEEILRRLRKAMNKVPDAVGLN--NHMGSRFTSNEDAMEKLMEALKERGLYFLDSGTIAN------ 162 (250)
T ss_pred CCCCCCCCcccCCCHHHHHHHHHHHHhhCccceeeh--hhhhhhhcCcHHHHHHHHHHHHHCCeEEEccccccc------
Confidence 22111 00124677777777777666655444432 122333211010002357788888866544443211
Q ss_pred ceeccCCCCCCcCCCCCCCCCCCCCcEEEeccccccCCCCCCCCCCCCccccCCCCCCCChHHHHHHHHHHHHHHHc---
Q psy16054 364 PYTLYFRMPHKCNGNAHNCPSKSHPTWEMVMNELDRRDDPTFDESLPGCHMVDSCSNIQTGEQFGRLLRHNFNRHFS--- 440 (563)
Q Consensus 364 Pytld~~~p~~c~~~~~pcp~~~~~lweiP~~~l~~~d~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~F~~~y~--- 440 (563)
++--++ -.=.+||+- .+ |. ++| +..+..++. .+|+++-.
T Consensus 163 --S~a~~i---------------Ak~~gVp~~--~r-dv-----------fLD---~e~~~~~V~----kql~~~~~~Ar 204 (250)
T COG2861 163 --SLAGKI---------------AKEIGVPVI--KR-DV-----------FLD---DEDTEAAVL----KQLDAAEKLAR 204 (250)
T ss_pred --chhhhh---------------HhhcCCcee--ee-ee-----------eec---CcCCHHHHH----HHHHHHHHHHH
Confidence 000000 001123331 11 10 112 123444444 44444332
Q ss_pred cCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHH
Q psy16054 441 TNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQ 488 (563)
Q Consensus 441 ~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~ 488 (563)
.|+..++|. | + .+.-+++|++.++.+. -.+|-+|..++++.
T Consensus 205 k~G~ai~IG-h-~----~~~Tv~vl~~~~~~l~-~~gIelV~~s~L~~ 245 (250)
T COG2861 205 KNGSAIGIG-H-P----HKNTVAVLQQWLDELP-ARGIELVPVSALLG 245 (250)
T ss_pred hcCceEEec-C-C----chhHHHHHHHHHHhCC-CCCeEEecHHHhhc
Confidence 234344333 5 3 3678999999999986 46999999998864
No 31
>KOG2397|consensus
Probab=92.30 E-value=0.099 Score=56.30 Aligned_cols=70 Identities=27% Similarity=0.377 Sum_probs=51.3
Q ss_pred cccCCCCceecCCCCCcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCC----ceecCccccCCCCCCCCCCc
Q psy16054 98 QISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNG----ECIDKTLFCDDKPDCKDESD 173 (563)
Q Consensus 98 ~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G----~Ci~~~~~Cdg~~DC~DgsD 173 (563)
.|.|-.|.-...-.++=| +.=||.|||||.. ...|+.+.|.|.|- .=|+..-+-||.-||-||||
T Consensus 44 ~~~CLdgs~~i~f~qlND---d~CDC~DGsDEPG--------tsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSD 112 (480)
T KOG2397|consen 44 MFKCLDGSKTISFSQLND---DSCDCLDGSDEPG--------TSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSD 112 (480)
T ss_pred ceeeccCCcccCHHHhcc---ccccCCCCCCCCc--------cccCCCCceeeeecCCCceeeechhccCcccccccCCC
Confidence 566666554333344444 3446999999952 23699999999862 58889999999999999999
Q ss_pred ccccC
Q psy16054 174 ENACS 178 (563)
Q Consensus 174 E~~C~ 178 (563)
|..-+
T Consensus 113 E~~Sg 117 (480)
T KOG2397|consen 113 EYLSG 117 (480)
T ss_pred CccCC
Confidence 96544
No 32
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=86.05 E-value=0.57 Score=44.45 Aligned_cols=33 Identities=39% Similarity=0.634 Sum_probs=27.9
Q ss_pred CCCCcccCCCc--eecCccccCCCCCCCCCCcccc
Q psy16054 144 PEGKLSCGNGE--CIDKTLFCDDKPDCKDESDENA 176 (563)
Q Consensus 144 ~~~~f~C~~G~--Ci~~~~~Cdg~~DC~DgsDE~~ 176 (563)
..+.|+|-+|. =|+.....|+.=||.|||||-.
T Consensus 34 ~~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEPG 68 (176)
T PF12999_consen 34 ENGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEPG 68 (176)
T ss_pred CCCceEecCCCCceecHHHccCcceeCCCCCCccc
Confidence 45679998874 3899999999999999999943
No 33
>PF01607 CBM_14: Chitin binding Peritrophin-A domain; InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=76.85 E-value=1.3 Score=32.65 Aligned_cols=28 Identities=36% Similarity=0.689 Sum_probs=18.7
Q ss_pred cCcccCCCCceecCCCCCcCCCcCCCCC
Q psy16054 96 LKQISCPSGLAFDVDKQTCDWKAKVTNC 123 (563)
Q Consensus 96 ~~~~~C~~G~~~~~~~~~CD~~~~v~dC 123 (563)
+...+|+.|+.||+..++|++++.|..|
T Consensus 26 ~~~~~C~~g~~fd~~~~~C~~~~~~~~C 53 (53)
T PF01607_consen 26 AVEQRCPEGLYFDPSSQRCVPPSNVVQC 53 (53)
T ss_dssp EEEEE-TTS-EE-TTTSSEE-TTT-TT-
T ss_pred EECCCCcCCCEECcCcCEEcCCccCCCC
Confidence 4589999999999999999988764444
No 34
>PF01074 Glyco_hydro_38: Glycosyl hydrolases family 38 N-terminal domain; InterPro: IPR000602 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 38 GH38 from CAZY comprises enzymes with only one known activity; alpha-mannosidase (3.2.1.24 from EC) (3.2.1.114 from EC). Lysosomal alpha-mannosidase is necessary for the catabolism of N-linked carbohydrates released during glycoprotein turnover. The enzyme catalyses the hydrolysis of terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides, and can cleave all known types of alpha-mannosidic linkages. Defects in the gene cause lysosomal alpha-mannosidosis (AM), a lysosomal storage disease characterised by the accumulation of unbranched oligo-saccharide chains.; GO: 0004559 alpha-mannosidase activity, 0005975 carbohydrate metabolic process; PDB: 2WYI_A 2WYH_A 1O7D_A 3CZN_A 2FYV_A 3D50_A 3EJU_A 3EJS_A 3DX3_A 3BVX_A ....
Probab=72.07 E-value=22 Score=35.97 Aligned_cols=80 Identities=10% Similarity=0.086 Sum_probs=54.6
Q ss_pred HHHHHHHHcCC-eecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054 267 ASVQELHRKGH-EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD 344 (563)
Q Consensus 267 ~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e 344 (563)
+.+++++++|- ||..=+|.-.+.. -.+.|.+.+.|...++.+++..|+. ..+||-+ +||... .+.++|+.
T Consensus 64 ~~~~~lv~~Gri~~vgg~~~~~D~~----l~~~Eslirql~~G~~~~~~~fg~~--~~~~~~~D~FG~~~--~lP~il~~ 135 (275)
T PF01074_consen 64 KRIKKLVKEGRIEIVGGWYVQPDEN----LPSGESLIRQLLYGHKYLRKEFGVR--PKVAWQPDSFGHSA--QLPQILKQ 135 (275)
T ss_dssp HHHHHHHHTTSEEESSSBSS-B-SS----SS-HHHHHHHHHHHHHHHHHHHTG----SEEEESSSSSB-T--CHHHHHHT
T ss_pred HHHHHHHHhceeEEeCceeeecccc----CCCHHHHHHHHhhhHHHHHHhcCCC--CCeEEeCCCCCCch--hhHHHHhc
Confidence 46688888885 5544445433321 2578889999999999888777754 3467766 888865 58999999
Q ss_pred cCCeEEeecC
Q psy16054 345 QFFVYDASIT 354 (563)
Q Consensus 345 ~Gf~YDsS~~ 354 (563)
.||.|-....
T Consensus 136 ~Gi~~~v~~r 145 (275)
T PF01074_consen 136 FGIKYFVIWR 145 (275)
T ss_dssp TT-SEEEESS
T ss_pred cCcceEEEec
Confidence 9999988875
No 35
>KOG1958|consensus
Probab=67.03 E-value=1.2e+02 Score=36.02 Aligned_cols=74 Identities=15% Similarity=0.202 Sum_probs=53.0
Q ss_pred HHHHHHHHcC-CeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054 267 ASVQELHRKG-HEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD 344 (563)
Q Consensus 267 ~~v~~i~~~G-HEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e 344 (563)
..|+.+..+| -||..-||.-.+- ++...-...+++......|+...|++++ .||.. |||..+ ++.=+|+.
T Consensus 229 ~a~k~lv~~GqlEIvtGGWVMpDE----An~Hy~~~i~qliEGh~Wl~~~igv~P~--s~WaiDPFG~S~--TmpYLL~~ 300 (1129)
T KOG1958|consen 229 NAVKRLVKNGQLEIVTGGWVMPDE----ANSHYFAMIDQLIEGHQWLKNNIGVTPQ--SGWAIDPFGYSS--TMPYLLRR 300 (1129)
T ss_pred HHHHHHHhcCcEEEEeCccccCcc----cchhHHHHHHHHhhhhhHHhccCCCCCC--cccccCCCCCCc--chHHHHhh
Confidence 6899999999 5999999976654 2445667788888889999998887643 35543 666533 35556666
Q ss_pred cCCe
Q psy16054 345 QFFV 348 (563)
Q Consensus 345 ~Gf~ 348 (563)
+||.
T Consensus 301 ag~~ 304 (1129)
T KOG1958|consen 301 AGFE 304 (1129)
T ss_pred cCch
Confidence 6654
No 36
>KOG3509|consensus
Probab=62.45 E-value=14 Score=43.84 Aligned_cols=89 Identities=22% Similarity=0.412 Sum_probs=63.7
Q ss_pred CCCCcCCCcc---ccccCcccCCCCceecCCCCCcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCC-ceecC
Q psy16054 83 WKTNVKNCDL---LESLKQISCPSGLAFDVDKQTCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNG-ECIDK 158 (563)
Q Consensus 83 ~~~~v~~C~~---~~~~~~~~C~~G~~~~~~~~~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G-~Ci~~ 158 (563)
..+++.+++- .+.+.++.|.++.+--+.. .|| ..-.|.+.+.+..|.-.. ....|.+..++|-+- +|...
T Consensus 18 ~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~--~~s~~~~~~~~c~~~~~~~~~ 91 (964)
T KOG3509|consen 18 DRSDVANDPAIGSACSPNEFKCNNPRCVQPEA-LLD---ADSTCGPNSTPSGCNAKP--SASDCKPTETQCRDRLRCNPQ 91 (964)
T ss_pred hhcccCCCccccccCCcchhccCCccccCchh-hhc---cccccCCCCCcCCccccc--cccccCCcccccccchhcCCc
Confidence 3444554432 2347789999998885444 688 455788888776553211 112577889999876 79999
Q ss_pred ccccCCCCCCCCCCccccc
Q psy16054 159 TLFCDDKPDCKDESDENAC 177 (563)
Q Consensus 159 ~~~Cdg~~DC~DgsDE~~C 177 (563)
...|+|.+||.|+++|..+
T Consensus 92 ~~~~~g~~~~~~~~~~~~~ 110 (964)
T KOG3509|consen 92 SFQCDGTNDCKDGSDEVGC 110 (964)
T ss_pred cccccCCCCCCccchhccc
Confidence 9999999999999999766
No 37
>PF07315 DUF1462: Protein of unknown function (DUF1462); InterPro: IPR009190 There are currently no experimental data for members of this group of bacterial proteins or their homologues. A crystal structure of Q7A6J8 from SWISSPROT revealed a thioredoxin-like fold, its core consisting of three layers alpha/beta/alpha.; PDB: 1XG8_A.
Probab=60.05 E-value=38 Score=28.61 Aligned_cols=53 Identities=19% Similarity=0.411 Sum_probs=34.3
Q ss_pred CCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHc
Q psy16054 417 SCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLD 474 (563)
Q Consensus 417 ~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~ 474 (563)
+|.+..|..+.++||..+..|-|-. .||.++.= . |. +|.--..-++|++.|..
T Consensus 11 SCVn~PsSkeTyeWL~aal~RKyp~--~~f~~~Yi-D-i~-~p~~~~~~~~~a~~I~e 63 (93)
T PF07315_consen 11 SCVNAPSSKETYEWLEAALKRKYPD--QPFEFTYI-D-IE-NPPENDHDQQFAERILE 63 (93)
T ss_dssp GGSSS--HHHHHHHHHHHHHHH-TT--S-EEEEEE-E-TT-T----HHHHHHHHHHHT
T ss_pred hhcCCCCchhHHHHHHHHHhCcCCC--CceEEEEE-e-cC-CCCccHHHHHHHHHHHh
Confidence 4888899999999999999999865 57866542 2 22 23333667889999985
No 38
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=53.81 E-value=11 Score=27.64 Aligned_cols=24 Identities=50% Similarity=0.942 Sum_probs=20.7
Q ss_pred cCcccCCCCceecCCCCCcCCCcC
Q psy16054 96 LKQISCPSGLAFDVDKQTCDWKAK 119 (563)
Q Consensus 96 ~~~~~C~~G~~~~~~~~~CD~~~~ 119 (563)
...+.|+.|+.|+...+.|+++..
T Consensus 28 ~~~~~C~~g~~f~~~~~~C~~~~~ 51 (56)
T smart00494 28 PIVGSCPAGLVFDPATQTCDWPDN 51 (56)
T ss_pred EEeccCcCCCeECCCCCCcCCccc
Confidence 457899999999999999998754
No 39
>COG4878 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=53.80 E-value=22 Score=36.05 Aligned_cols=98 Identities=18% Similarity=0.250 Sum_probs=56.3
Q ss_pred ceEEEEeccCC-CHH-HHHHHHHcCCeecccCCCCCCCcCCCCCCCHHHHHH-H-HHHHHHHHHHHhC---CCCCcceEE
Q psy16054 254 KGTFFVSHKYT-NYA-SVQELHRKGHEISVFSLTHKDDPKYWSGGSYDDWLA-E-MAGGRLIIERFAN---ISDGSVIGV 326 (563)
Q Consensus 254 kaTFFv~~~~~-~y~-~v~~i~~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~-E-i~~~~~~l~~~~g---i~~~~~~G~ 326 (563)
+.-||...+.. +.. .=.+|...|-|+|.|||.|..++. .....-|+. | |.++...|..+.. +++-+..-|
T Consensus 34 ~~gff~~~~~~~dl~~~g~el~~~gge~gIhgYNh~sl~p---k~~~knw~see~m~ka~~~L~~y~~~ls~~~y~~~vf 110 (309)
T COG4878 34 KGGFFSLEQSFDDLSAKGTELTFDGGEKGIHGYNHKSLQP---KDILKNWESEEIMNKANYTLADYGDILSITGYDTFVF 110 (309)
T ss_pred CCccchhhccchhhhhcCcEEEecCCceeEeeccCCCCCc---cccccchhhHHHHHHHHHHHHHHHHHHhccccceEEE
Confidence 45567665544 222 223466789999999999998641 122233332 2 6666666666533 233456667
Q ss_pred ecCccccChhhHHHHHHHcC-CeEEeecC
Q psy16054 327 RAPYLRVGGNKQFSMMTDQF-FVYDASIT 354 (563)
Q Consensus 327 R~P~l~~~~~~~~~~L~e~G-f~YDsS~~ 354 (563)
-+|.-...++...++++.+- ..--||+.
T Consensus 111 VppSnil~q~gk~alvk~~p~lktissiy 139 (309)
T COG4878 111 VPPSNILLQKGKFALVKQAPSLKTISSIY 139 (309)
T ss_pred eCcccccchhHHHHHHHhCCCcceeeeEe
Confidence 88877766555556666542 33445543
No 40
>COG1543 Uncharacterized conserved protein [Function unknown]
Probab=51.56 E-value=36 Score=37.22 Aligned_cols=78 Identities=23% Similarity=0.200 Sum_probs=61.8
Q ss_pred HHHHHHc-CCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCC
Q psy16054 269 VQELHRK-GHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFF 347 (563)
Q Consensus 269 v~~i~~~-GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf 347 (563)
.+.+.+. +-||-.-+.+|-.++-+ +. ++.+.++|..+.+.-++..|. .|.|+=.|-..+.+ .+-.+|+++|+
T Consensus 129 f~~~~~~g~ieilts~athg~lPll--~~-peAi~AQi~~g~~~ye~~fg~---~P~GiWlPEcay~p-gie~~l~~~Gi 201 (504)
T COG1543 129 FKELQRSGGIEILTSAATHGYLPLL--GG-PEAIEAQILTGIELYEEHFGL---APKGIWLPECAYAP-GIERILKDAGI 201 (504)
T ss_pred HHHHHHcCCceeeeehhhheehhhc--CC-chhhHHHHHHHHHHHHHHhCC---CCCceechhhcccc-chHHHHHhcCc
Confidence 4445554 57999999999887642 23 788999999999999988885 38999999988885 57889999999
Q ss_pred eEEeec
Q psy16054 348 VYDASI 353 (563)
Q Consensus 348 ~YDsS~ 353 (563)
.|--.-
T Consensus 202 ~yf~vd 207 (504)
T COG1543 202 EYFFVD 207 (504)
T ss_pred eEEEec
Confidence 986544
No 41
>PF12340 DUF3638: Protein of unknown function (DUF3638); InterPro: IPR022099 This domain family is found in eukaryotes, and is approximately 230 amino acids in length. There are two conserved sequence motifs: LLE and NMG.
Probab=48.52 E-value=85 Score=31.33 Aligned_cols=63 Identities=19% Similarity=0.293 Sum_probs=48.5
Q ss_pred hHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHH
Q psy16054 424 GEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQW 489 (563)
Q Consensus 424 ~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w 489 (563)
.++.++.|...|-..+ +|..+.+.|+ ....-.+..++.+++.++.+....+|.++|.+.+...
T Consensus 82 l~q~~~~L~~~lg~l~--~r~i~~lpFs-R~~~~~~~~~~~~~~l~~~~~~~~gill~~PEhilSf 144 (229)
T PF12340_consen 82 LEQMRQMLRSRLGGLL--NRRIYHLPFS-RSTPLTPETLEKIRQLLEECMRSGGILLATPEHILSF 144 (229)
T ss_pred HHHHHHHHHHHHHHHh--CCeeEEeccc-CCCCCCHHHHHHHHHHHHHHHHcCCEEEeChHHHHHH
Confidence 3556777777775544 5666778887 5455567788999999999999999999999999653
No 42
>PF04794 YdjC: YdjC-like protein; InterPro: IPR006879 This entry contains ChbG (YdjC), which is is an uncharacterised protein encoded by the chb (N,N'-diacetylchitobiose, also called [GlcNAc]2) or cel operon, which encodes enzymes involved in growth on an N,N'-diacetylchitobiose carbon source [, ]. The entry also contains HpnK, which is a protein associated with hopanoid biosynthesis.; PDB: 2E67_C 2I5I_B.
Probab=43.06 E-value=63 Score=32.56 Aligned_cols=119 Identities=8% Similarity=0.080 Sum_probs=65.8
Q ss_pred ceEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCceE-EEEeccCCCHHHHHHHHHc-CCeecccCC-CC---------
Q psy16054 220 MITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIKGT-FFVSHKYTNYASVQELHRK-GHEISVFSL-TH--------- 286 (563)
Q Consensus 220 ~V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ikaT-FFv~~~~~~y~~v~~i~~~-GHEIa~H~~-~H--------- 286 (563)
.|+|.=|| |........+++++...+ +.+| ..|++.+. .+.++.+.+. +-.||.|-. +.
T Consensus 2 ~lii~ADDfG~s~~vn~gI~~~~~~G~-------vtstslM~n~p~~-~~a~~~~k~~~~~~vGlHl~Lt~g~p~~~p~~ 73 (261)
T PF04794_consen 2 KLIINADDFGLSPGVNRGIIEAFENGI-------VTSTSLMVNMPAF-EEAAALAKNNPGLDVGLHLNLTSGWPVLSPTQ 73 (261)
T ss_dssp EEEEEEEEETSSHHHHHHHHHHHHCCT--------SEEEEETTSTTH-HHHHHHHHH-TTTEEEEEE-S---SSSS-TTT
T ss_pred EEEEECCCCCCCHHHHHHHHHHHHcCC-------ceEeeeccCChhH-HHHHHHHHhCCCCCEEEEeEccCCCCCCCccC
Confidence 36778888 666665566778876422 4444 55555543 3455555554 478888842 11
Q ss_pred ------------CCC----cCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccCh---hhHHHHHHHcCC
Q psy16054 287 ------------KDD----PKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGG---NKQFSMMTDQFF 347 (563)
Q Consensus 287 ------------~~~----~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~---~~~~~~L~e~Gf 347 (563)
... ......++.++++.||..+.+.+.+ .|+.+..+-|.-- ....+ +.+++++++.|+
T Consensus 74 vpsLvd~~G~F~~~~~~~~~~~~~~~~~~ev~~E~~AQi~~f~~-~G~~P~hiD~H~h--~h~~p~v~~~~~~~a~~y~l 150 (261)
T PF04794_consen 74 VPSLVDEDGYFPRSKRLLRRALLRRADYEEVERELRAQIERFLD-MGRKPDHIDGHQH--VHFHPPVREALLDLAKEYGL 150 (261)
T ss_dssp SGGGBBTTTBB-SCC----HCHCCC--HHHHHHHHHHHHHHHHC-CSS-ECEEEEGGG--GGCSHHHHHHHHHHHHHCT-
T ss_pred CCCccCCCCccCCccccchhhhhccCCHHHHHHHHHHHHHHHHH-cCCCCCeEecccc--cccChHHHHHHHHHHHHcCC
Confidence 111 0111456789999999999988887 7876554444210 01111 346777888888
Q ss_pred eE
Q psy16054 348 VY 349 (563)
Q Consensus 348 ~Y 349 (563)
.+
T Consensus 151 p~ 152 (261)
T PF04794_consen 151 PW 152 (261)
T ss_dssp EB
T ss_pred Cc
Confidence 55
No 43
>COG4837 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=42.52 E-value=45 Score=28.32 Aligned_cols=54 Identities=19% Similarity=0.404 Sum_probs=40.3
Q ss_pred CCCCCCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcC
Q psy16054 417 SCSNIQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDR 475 (563)
Q Consensus 417 ~~~~~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~ 475 (563)
+|.+..|..++++||...+.|-|-+ .||-+..- . +. +|..-++..+|++.|.+.
T Consensus 18 SCV~aPtsKdt~eWLeaalkRKyp~--~~F~~~Yi-D-I~-n~~~e~~~~~~aekI~~d 71 (106)
T COG4837 18 SCVNAPTSKDTYEWLEAALKRKYPN--QPFKYTYI-D-IT-NPPLEDHDLQFAEKIEQD 71 (106)
T ss_pred HhcCCCcchhHHHHHHHHHhccCCC--CCcEEEEE-E-cC-CCccHHHHHHHHHHHhcc
Confidence 4778889999999999999998864 57744332 1 22 455677888999999853
No 44
>cd07286 PX_SNX18 The phosphoinositide binding Phox Homology domain of Sorting Nexin 18. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX18, like SNX9, contains an N-terminal Src Homology 3 (SH3) domain, a PX domain, and a C-terminal Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature. The PX-BAR structural unit helps determine specific membrane localization. SNX18 is localized to peripheral endosomal structures, and acts in a trafficki
Probab=41.38 E-value=46 Score=30.02 Aligned_cols=73 Identities=12% Similarity=0.195 Sum_probs=44.5
Q ss_pred HHHHHHHHHHHHccCCcc-e-----EEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCC-chhhhhc
Q psy16054 428 GRLLRHNFNRHFSTNRAP-L-----GLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP-TELTSLR 500 (563)
Q Consensus 428 ~~~l~~~F~~~y~~nrap-~-----~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p-~~~~~~~ 500 (563)
+.+|.+.+.+.|-+--.| + +-.+.+.|+. .|..+|++||..|+.+|-.+ ...++-.++..+ +.....+
T Consensus 41 F~~L~~~L~~~~p~~~IPpLP~K~~~g~f~~~FIe---~Rr~~Lq~FL~ria~hp~L~--~s~~~~~FL~~~~~~~~~~~ 115 (127)
T cd07286 41 FDWLYARLAEKFPVISVPHIPEKQATGRFEEDFIS---KRRKGLIWWMDHMCSHPVLA--RCDAFQHFLTCPSTDEKAWK 115 (127)
T ss_pred HHHHHHHHHHHCCCcEeCCCcCCCcCCCCCHHHHH---HHHHHHHHHHHHHHcCcccc--cChHHHHHhcCCcchhHHHh
Confidence 566777777777542112 1 1234545433 68999999999999998755 345566677665 3444444
Q ss_pred cchhh
Q psy16054 501 DFQEW 505 (563)
Q Consensus 501 ~~~~w 505 (563)
....|
T Consensus 116 ~~~~~ 120 (127)
T cd07286 116 QGKRK 120 (127)
T ss_pred hcchH
Confidence 44433
No 45
>PLN02701 alpha-mannosidase
Probab=40.79 E-value=63 Score=39.49 Aligned_cols=79 Identities=11% Similarity=0.124 Sum_probs=60.3
Q ss_pred HHHHHHHHcCC-eecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054 267 ASVQELHRKGH-EISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD 344 (563)
Q Consensus 267 ~~v~~i~~~GH-EIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e 344 (563)
+.|++++++|. ||..=+|.-.+.. ....+.+.++|......|.+..|+.+ .+||.. |||..+ .+..+|++
T Consensus 106 ~~vk~LV~~GrLE~vgGgwvm~DEa----~~~~esiI~Ql~~G~~~l~~~fG~~P--~~~W~iDpFGhs~--~~P~Ll~~ 177 (1050)
T PLN02701 106 EAFTKLVKNGQLEIVGGGWVMNDEA----NSHYFAIIEQITEGNMWLNDTIGVAP--KNSWAIDPFGYSS--TMAYLLRR 177 (1050)
T ss_pred HHHHHHHHcCCEEEECCceeccccc----ccCHHHHHHHHHhhhHHHHhhcCCCC--CcCccCCCCCCCH--HHHHHHHh
Confidence 68999999995 7888888665542 35688888998888888877777653 467776 888754 58999999
Q ss_pred cCCeEEeec
Q psy16054 345 QFFVYDASI 353 (563)
Q Consensus 345 ~Gf~YDsS~ 353 (563)
.||.+---+
T Consensus 178 ~G~~~~~~~ 186 (1050)
T PLN02701 178 MGFENMLIQ 186 (1050)
T ss_pred CCCceEEEe
Confidence 999875433
No 46
>COG3590 PepO Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones]
Probab=40.62 E-value=8.9 Score=42.67 Aligned_cols=84 Identities=18% Similarity=0.346 Sum_probs=48.3
Q ss_pred CCCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC--CCHHHHHHHHHcCCeecccCCCC-------
Q psy16054 216 QVPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY--TNYASVQELHRKGHEISVFSLTH------- 286 (563)
Q Consensus 216 ~~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~--~~y~~v~~i~~~GHEIa~H~~~H------- 286 (563)
-+||+|.--+|-..|.-.++. .|| .+-||=+..- .||.-|-.++ ||||+ ||..-
T Consensus 446 M~pq~VNAYYnp~~N~IVFPA--AIL------------Q~PFfd~ea~~a~NYGgIGaVI--gHEI~-HgFDdqGakfD~ 508 (654)
T COG3590 446 MPPQTVNAYYNPQKNEIVFPA--AIL------------QAPFFDPEADSAANYGGIGAVI--GHEIG-HGFDDQGAKFDG 508 (654)
T ss_pred CCHHHhhhhcCCCCceEeeeH--Hhc------------CCCCCCCCcchhhcccCcccee--hhhhc-ccccCCccccCC
Confidence 378887655555555444443 244 4456655543 4887666665 99998 55533
Q ss_pred -CCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCC
Q psy16054 287 -KDDPKYWSGGSYDDWLAEMAGGRLIIERFANIS 319 (563)
Q Consensus 287 -~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~ 319 (563)
-++.+||..-+.+.+. +++.+++++|-|..
T Consensus 509 ~GnL~dWWT~eD~~aF~---~~T~~lvaqf~~~e 539 (654)
T COG3590 509 DGNLNDWWTDEDAAAFK---ERTKKLVAQFDGYE 539 (654)
T ss_pred CCcHHhhcCHHHHHHHH---HHHHHHHHHhcCcc
Confidence 2334566544444433 36667777776643
No 47
>COG3536 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.31 E-value=10 Score=32.99 Aligned_cols=78 Identities=15% Similarity=0.306 Sum_probs=50.6
Q ss_pred cccCCCCCCCCCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccCC----------CHHHHHHHHHcC
Q psy16054 207 RIPGGIEPNQVPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKYT----------NYASVQELHRKG 276 (563)
Q Consensus 207 ciP~~~~~~~~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~~----------~y~~v~~i~~~G 276 (563)
-+|..++...-.+..+|||+||-.......++.++... ..-.|++..--.-|.|.-. || .||-+.+.|
T Consensus 3 ~~P~~lrvsk~r~~L~l~f~dg~~~~L~ae~LRv~sPs-AEvqghgp~q~v~v~gkr~V~i~~i~p~GnY-avri~FdDg 80 (120)
T COG3536 3 QWPTELRVSKDRKVLSLTFADGAPFRLPAEFLRVLSPS-AEVQGHGPGQRVLVPGKRNVQIRDIEPVGNY-AVRILFDDG 80 (120)
T ss_pred CCCceeEEeccCcEEEEEeCCCCcccCCHHHHHhcCch-HHhcccCCCCceeeeccccceEEEeEecCce-EEEEEecCC
Confidence 46777777777888999999977665555556666431 1113455544455555421 44 467778899
Q ss_pred CeecccCCCC
Q psy16054 277 HEISVFSLTH 286 (563)
Q Consensus 277 HEIa~H~~~H 286 (563)
|+-|.-+|+-
T Consensus 81 HDsGiy~W~Y 90 (120)
T COG3536 81 HDSGIYTWAY 90 (120)
T ss_pred cccCcccHHH
Confidence 9999888753
No 48
>TIGR01358 DAHP_synth_II 3-deoxy-7-phosphoheptulonate synthase, class II. Homologs scoring between trusted and noise cutoff include proteins involved in antibiotic biosynthesis; one example is active as this enzyme, while another acts on an amino analog.
Probab=40.27 E-value=1.5e+02 Score=32.26 Aligned_cols=84 Identities=14% Similarity=0.136 Sum_probs=59.8
Q ss_pred CcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---C--CHHHHHHHHHcCCeec-----ccCCCCCCCcCCCCCCC
Q psy16054 228 AVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---T--NYASVQELHRKGHEIS-----VFSLTHKDDPKYWSGGS 297 (563)
Q Consensus 228 ~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---~--~y~~v~~i~~~GHEIa-----~H~~~H~~~~~~~~~~s 297 (563)
|++-. -+.++.|++ +-||++-+-|.|+.+--.. . -+.+|+++.++||.|. .||.+......| ..-.
T Consensus 287 GP~~~-p~~l~~L~~--~LnP~~epGRlTLI~RmGa~kV~~~LP~li~aV~~~G~~VvW~cDPMHGNT~~t~~G~-KTR~ 362 (443)
T TIGR01358 287 GPSMT-PDELLRLIE--RLNPENEPGRLTLISRMGADKIADKLPPLLRAVKAAGRRVVWVCDPMHGNTEEAASGY-KTRR 362 (443)
T ss_pred CCCCC-HHHHHHHHH--HhCCCCCCceEEEEeccCchHHHHhHHHHHHHHHHcCCceEEeecCCCCCceeCCCCc-cCCc
Confidence 45433 345556765 5688888888887763322 1 4589999999999997 799887754332 3456
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy16054 298 YDDWLAEMAGGRLIIERF 315 (563)
Q Consensus 298 ~~~~~~Ei~~~~~~l~~~ 315 (563)
.+.+.+||.+..++.++.
T Consensus 363 f~~Il~Ev~~ff~vh~~~ 380 (443)
T TIGR01358 363 FDDIRSEVKGFFEVHRAE 380 (443)
T ss_pred HHHHHHHHHHHHHHHHHc
Confidence 788999999999988764
No 49
>PRK09819 alpha-mannosidase; Provisional
Probab=39.74 E-value=1.1e+02 Score=36.68 Aligned_cols=79 Identities=15% Similarity=0.040 Sum_probs=54.4
Q ss_pred HHHHHHHHHcCCeecccCC-CCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHH
Q psy16054 266 YASVQELHRKGHEISVFSL-THKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMT 343 (563)
Q Consensus 266 y~~v~~i~~~GHEIa~H~~-~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~ 343 (563)
++.|++++++|-=|. =+| .-.+. .-.+.|.+.+.|.-.++.+++| |+. ..+||-+ |||..+ .+.++|+
T Consensus 69 ~~~ik~lV~~Grl~~-G~W~v~~D~----~l~sgEsliRqll~G~~~~~~f-G~~--~~vgwlpD~FG~s~--~lPqIl~ 138 (875)
T PRK09819 69 KERVKKLVQAGKLII-GPWYTQTDQ----LVVSGESIVRNLLYGIRDCREF-GEP--MKIGYLPDSFGQSG--QMPQIYN 138 (875)
T ss_pred HHHHHHHHHcCCEeE-CcEEeeccc----ccCCHHHHHHHHHHHHHHHHHc-CCC--CceeeecCCCCCcH--HHHHHHH
Confidence 368999999997443 334 21111 1246788888888777778765 543 3567776 676644 6899999
Q ss_pred HcCCeEEeecC
Q psy16054 344 DQFFVYDASIT 354 (563)
Q Consensus 344 e~Gf~YDsS~~ 354 (563)
..||.|-..|.
T Consensus 139 ~~Gi~~~~~wr 149 (875)
T PRK09819 139 GFGITRTLFWR 149 (875)
T ss_pred hCCCCeEEEEe
Confidence 99999987774
No 50
>cd02995 PDI_a_PDI_a'_C PDIa family, C-terminal TRX domain (a') subfamily; composed of the C-terminal redox active a' domains of PDI, ERp72, ERp57 (or ERp60) and EFP1. PDI, ERp72 and ERp57 are endoplasmic reticulum (ER)-resident eukaryotic proteins involved in oxidative protein folding. They are oxidases, catalyzing the formation of disulfide bonds of newly synthesized polypeptides in the ER. They also exhibit reductase activity in acting as isomerases to correct any non-native disulfide bonds, as well as chaperone activity to prevent protein aggregation and facilitate the folding of newly synthesized proteins. PDI and ERp57 have the abb'a' domain structure (where a and a' are redox active TRX domains while b and b' are redox inactive TRX-like domains). PDI also contains an acidic region (c domain) after the a' domain that is absent in ERp57. ERp72 has an additional a domain at the N-terminus (a"abb'a' domain structure). ERp57 interacts with the lectin chaperones, calnexin and calreticu
Probab=38.59 E-value=72 Score=26.26 Aligned_cols=51 Identities=18% Similarity=0.288 Sum_probs=33.3
Q ss_pred HHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeH
Q psy16054 433 HNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTM 483 (563)
Q Consensus 433 ~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~ 483 (563)
++|+......+.++.+.|+++|-..-......|+++.+.+.....|-|+..
T Consensus 8 ~~f~~~i~~~~~~~~v~f~~~~C~~C~~~~~~~~~~~~~~~~~~~~~~~~i 58 (104)
T cd02995 8 KNFDEVVLDSDKDVLVEFYAPWCGHCKALAPIYEELAEKLKGDDNVVIAKM 58 (104)
T ss_pred hhhHHHHhCCCCcEEEEEECCCCHHHHHHhhHHHHHHHHhcCCCCEEEEEE
Confidence 456555544456888999999966444456677777776655456766653
No 51
>PLN02291 phospho-2-dehydro-3-deoxyheptonate aldolase
Probab=37.96 E-value=1.7e+02 Score=32.21 Aligned_cols=84 Identities=15% Similarity=0.100 Sum_probs=59.7
Q ss_pred CcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---C--CHHHHHHHHHcCCeec-----ccCCCCCCCcCCCCCCC
Q psy16054 228 AVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---T--NYASVQELHRKGHEIS-----VFSLTHKDDPKYWSGGS 297 (563)
Q Consensus 228 ~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---~--~y~~v~~i~~~GHEIa-----~H~~~H~~~~~~~~~~s 297 (563)
|++-. -+.++.|++ +-||++-+-|.|+-+--.. . -+.+|+++.+.||.|. .||.+......| ..-.
T Consensus 307 GPs~~-pdel~~L~~--~LnP~~epGRlTLI~RmGa~kV~~~LP~Li~aV~~~G~~VvW~cDPMHGNT~~t~~G~-KTR~ 382 (474)
T PLN02291 307 SDKMD-PEELVKLIE--ILNPQNKPGRLTIIVRMGAEKLRVKLPHLIRAVRRAGQIVTWVSDPMHGNTIKAPSGL-KTRP 382 (474)
T ss_pred CCCCC-HHHHHHHHH--HhCCCCCCceEEEEeccchHHHHHHHHHHHHHHHHcCCceEEeecCCCCCceeCCCCc-cCCc
Confidence 55443 345566775 5688888888887763221 2 4489999999999997 799887654322 3456
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy16054 298 YDDWLAEMAGGRLIIERF 315 (563)
Q Consensus 298 ~~~~~~Ei~~~~~~l~~~ 315 (563)
.+.+.+||.+..++.++.
T Consensus 383 f~~Il~Ev~~ff~vh~~~ 400 (474)
T PLN02291 383 FDAIRAEVRAFFDVHEQE 400 (474)
T ss_pred HHHHHHHHHHHHHHHHHc
Confidence 788999999999988764
No 52
>PF11071 DUF2872: Protein of unknown function (DUF2872); InterPro: IPR019884 This entry represents a family of uncharacterised proteins, including YtoQ from Bacillus subtilis. This family shows some sequence similarity to a family of nucleoside 2-deoxyribosyltransferases (COG3613 as iterated through CDD), but sufficiently remote that PSI-BLAST starting from YtoQ and exploring outwards does not discover the relationship.
Probab=33.57 E-value=52 Score=29.78 Aligned_cols=52 Identities=17% Similarity=0.178 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHcc-CCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHh
Q psy16054 428 GRLLRHNFNRHFST-NRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWM 490 (563)
Q Consensus 428 ~~~l~~~F~~~y~~-nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~ 490 (563)
|+.|..+||.-|.. -+.|+ |.+|++-+. .+|++ +.+.-.+|.-|..|+++-+
T Consensus 84 YKQWNaAfDAg~a~AlgKpl-I~lh~~~~~------HpLKE----vda~A~a~~et~~Qvv~iL 136 (141)
T PF11071_consen 84 YKQWNAAFDAGYAAALGKPL-ITLHPEELH------HPLKE----VDAAALAVAETPEQVVEIL 136 (141)
T ss_pred HHHHHHHhhHHHHHHcCCCe-EEecchhcc------ccHHH----HhHhhHhhhCCHHHHHHHH
Confidence 78899999999963 45688 889966444 23333 3344458888999987643
No 53
>KOG3509|consensus
Probab=32.97 E-value=52 Score=39.38 Aligned_cols=61 Identities=18% Similarity=0.530 Sum_probs=49.0
Q ss_pred CcCCCcCCCCCCccCCCCCccCcCCCCCCCCCCCCcccCCCceecCccccCCCCCCCCCCcccccCCC
Q psy16054 113 TCDWKAKVTNCNQLDRPRKVLPILKTDEPVCPEGKLSCGNGECIDKTLFCDDKPDCKDESDENACSVE 180 (563)
Q Consensus 113 ~CD~~~~v~dC~d~sde~~~~p~~~~~~~~C~~~~f~C~~G~Ci~~~~~Cdg~~DC~DgsDE~~C~~~ 180 (563)
.|+ +..+|.+.+|+.++.- -...|.+.+++|.+++|.-..+.||+..+|.++|+..+|..+
T Consensus 9 ~~~---~~~~~~~~~~~~~~~~----~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 69 (964)
T KOG3509|consen 9 ACD---RQPDCRDRSDVANDPA----IGSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAK 69 (964)
T ss_pred hhc---cchhhHhhcccCCCcc----ccccCCcchhccCCccccCchhhhccccccCCCCCcCCcccc
Confidence 466 5668888888865322 123688999999999999999999999999999988888654
No 54
>TIGR03473 HpnK hopanoid biosynthesis associated protein HpnK. The sequences represented by this model are members of the pfam04794 "YdjC-like" family of uncharacterized proteins. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0976) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnK) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopanoid biosynthesis locus was described consisting of the genes HpnA-E and SHC (HpnF). Continuing past SHC are found a ph
Probab=31.79 E-value=2.9e+02 Score=28.31 Aligned_cols=117 Identities=9% Similarity=0.106 Sum_probs=64.4
Q ss_pred eEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCc-eEEEEeccCCCHHHHHHHHH-cCCeecccC-CCCCC-------C
Q psy16054 221 ITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIK-GTFFVSHKYTNYASVQELHR-KGHEISVFS-LTHKD-------D 289 (563)
Q Consensus 221 V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ik-aTFFv~~~~~~y~~v~~i~~-~GHEIa~H~-~~H~~-------~ 289 (563)
|.|.=|| |........+.++++..+ +. +|..+++.+. .+.++.+.+ .+--||.|- .+... .
T Consensus 4 LIinADDfG~s~~vn~gI~~~~~~G~-------v~sts~M~n~p~~-~~a~~~~~~~~~~~vGlHl~Lt~g~pl~~~~~i 75 (283)
T TIGR03473 4 LIVTADDFGLSLEVNEAVEKAHRDGI-------LTAASLMVGAPAA-EDAVERARRLPGLGVGLHLVLVDGRPVLPPDQI 75 (283)
T ss_pred EEEecccCCCCHHHHHHHHHHHHcCc-------eeeeeeccCChhH-HHHHHHHHhCCCCCEEEEEEecCCCCCCChhhC
Confidence 5677788 666665566677776422 33 3455555543 244555554 477899992 22211 1
Q ss_pred cC-------CC------------CCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEe----cCccccChhhHHHHHHHcC
Q psy16054 290 PK-------YW------------SGGSYDDWLAEMAGGRLIIERFANISDGSVIGVR----APYLRVGGNKQFSMMTDQF 346 (563)
Q Consensus 290 ~~-------~~------------~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R----~P~l~~~~~~~~~~L~e~G 346 (563)
+. ++ .....++++.||..+.+.+.+ .|+.+..+-|.- .|... +..++++++.|
T Consensus 76 ~sLvd~~G~F~~~~~~~~~~~~~~~~~~~~v~~E~~aQi~~f~~-~G~~p~hiD~H~hvh~~P~v~---~~~~~la~~y~ 151 (283)
T TIGR03473 76 PDLVDGQGRFGDDMVRDGVRYFFLPAVRAQLAREIRAQFEAFAA-TGLPLDHVNAHKHFHLHPTVL---SLILEIGREYG 151 (283)
T ss_pred cCccCCCCCccHHHHHHhhHHhhccccHHHHHHHHHHHHHHHHH-hCCCCCeecCCCccccChhHH---HHHHHHHHHcC
Confidence 11 01 111346888899888888776 577655544421 23211 34566677766
Q ss_pred CeE
Q psy16054 347 FVY 349 (563)
Q Consensus 347 f~Y 349 (563)
+.|
T Consensus 152 lp~ 154 (283)
T TIGR03473 152 LRA 154 (283)
T ss_pred CCe
Confidence 544
No 55
>COG3603 Uncharacterized conserved protein [Function unknown]
Probab=31.68 E-value=73 Score=28.39 Aligned_cols=14 Identities=36% Similarity=0.648 Sum_probs=12.2
Q ss_pred HHHHHHHHcCCeec
Q psy16054 267 ASVQELHRKGHEIS 280 (563)
Q Consensus 267 ~~v~~i~~~GHEIa 280 (563)
..|+.|.++||||.
T Consensus 114 kAv~~L~eaGhev~ 127 (128)
T COG3603 114 KAVKALEEAGHEVL 127 (128)
T ss_pred HHHHHHHHcCCccc
Confidence 47899999999985
No 56
>PRK02134 hypothetical protein; Provisional
Probab=31.45 E-value=4e+02 Score=26.81 Aligned_cols=117 Identities=9% Similarity=-0.003 Sum_probs=67.9
Q ss_pred eEEeecC-CcCCCCchHHHHHhccCCCCCCCCCCc-eEEEEeccCCCHHHHHHHHHc-CCeecccC-CCCC-------CC
Q psy16054 221 ITITFNG-AVNVDNSDLYDEIFNGNRQNPNGCQIK-GTFFVSHKYTNYASVQELHRK-GHEISVFS-LTHK-------DD 289 (563)
Q Consensus 221 V~lTFDD-~~~~~~~~~~~~ll~~~r~n~n~~~ik-aTFFv~~~~~~y~~v~~i~~~-GHEIa~H~-~~H~-------~~ 289 (563)
+.+.=|| |........++++++..+ +. +|..+++.+.. +.++.+... +--||.|- .+.. .+
T Consensus 5 lIinADDfG~s~~vn~gI~~~~~~G~-------ltstslM~n~p~~~-~a~~l~~~~~~l~vGlHl~Lt~g~P~~~~psL 76 (249)
T PRK02134 5 LIVNADDFGLSKGQNYGIIEAHRNGV-------VTSTTAMVNMPAID-HAVQLSREYPSLGVGMHFVLTAGKPLTAMPSL 76 (249)
T ss_pred EEEECCcCCCCHHHHHHHHHHHHCCC-------eeEEEeecCCchHH-HHHHHHHhCCCCCEEEEEEecCCCccCCCCCC
Confidence 6677888 666655556667765322 33 44666666543 344444443 67889882 2211 11
Q ss_pred c----C----CC-----CCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEE----ecCccccChhhHHHHHHHcCCe
Q psy16054 290 P----K----YW-----SGGSYDDWLAEMAGGRLIIERFANISDGSVIGV----RAPYLRVGGNKQFSMMTDQFFV 348 (563)
Q Consensus 290 ~----~----~~-----~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~----R~P~l~~~~~~~~~~L~e~Gf~ 348 (563)
. . .+ ..++.++++.||..+.+.+.+..|+.+..+-|. ..|... +...++.++.|+.
T Consensus 77 ~d~~G~F~~~~~~~~~~~~~~~~ev~~E~~aQi~~f~~~~G~~p~hiD~H~h~h~~p~v~---~~~~~la~ey~lp 149 (249)
T PRK02134 77 TDRDGLLGKWIWQLAEEGTLPLEEIAQELAAQYERFLSLFGRKPTHLDSHHHVHMLPQIF---PIVARFAAEYGVP 149 (249)
T ss_pred cCCCCCccHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHhhCCCCCeeecccccccChhHH---HHHHHHHHHcCCc
Confidence 1 0 11 246688999999999999988888876666552 122111 2345566666653
No 57
>KOG1959|consensus
Probab=30.44 E-value=1.4e+02 Score=35.30 Aligned_cols=76 Identities=17% Similarity=0.211 Sum_probs=58.8
Q ss_pred HHHHHHHHcC-CeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEec-CccccChhhHHHHHHH
Q psy16054 267 ASVQELHRKG-HEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRA-PYLRVGGNKQFSMMTD 344 (563)
Q Consensus 267 ~~v~~i~~~G-HEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~-P~l~~~~~~~~~~L~e 344 (563)
.+||+|.+.| -|+..-||.-.+-. ..-+..+.+++......|+...|.-+...+||-. |||... .+..++++
T Consensus 111 ~~Vk~LV~~GrLEFi~Ggw~MnDEA----~~hY~dvIDQ~TlGlrfL~~tFG~cgrPrvgWqIDPFGHSr--eqAslfAq 184 (996)
T KOG1959|consen 111 EQVKKLVNEGRLEFIGGGWSMNDEA----TTHYQDVIDQFTLGLRFLKDTFGECGRPRVGWQIDPFGHSR--EQASLFAQ 184 (996)
T ss_pred HHHHHHHhcCCEEEecCeeeecchh----hhhHHHHHHHHHHHHHHHHHHhcccCCcceeeeeCCCCcch--HHHHHHHH
Confidence 6899999999 59999998655431 2456788899999999999888855555788876 888843 57888899
Q ss_pred cCCe
Q psy16054 345 QFFV 348 (563)
Q Consensus 345 ~Gf~ 348 (563)
+||.
T Consensus 185 mGfd 188 (996)
T KOG1959|consen 185 MGFD 188 (996)
T ss_pred hCcc
Confidence 9983
No 58
>COG3200 AroG 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism]
Probab=27.83 E-value=3.2e+02 Score=29.07 Aligned_cols=97 Identities=21% Similarity=0.230 Sum_probs=64.0
Q ss_pred cccCCCCCcccCCCCCCCCCCceEEeecCCcCCCCchHHHHHhccCCCCCCCCCCceEEEEeccC---C--CHHHHHHHH
Q psy16054 199 CFCSADGTRIPGGIEPNQVPQMITITFNGAVNVDNSDLYDEIFNGNRQNPNGCQIKGTFFVSHKY---T--NYASVQELH 273 (563)
Q Consensus 199 ~~c~~~g~ciP~~~~~~~~pq~V~lTFDD~~~~~~~~~~~~ll~~~r~n~n~~~ikaTFFv~~~~---~--~y~~v~~i~ 273 (563)
.||. |.+-|-++.| |++-. -+.++.+.+ +-||+.-+-+.|..+--.. . -+.+|+++.
T Consensus 277 e~~r--gv~NPig~K~-------------GP~~~-~d~ll~l~d--~LnP~nepGRLtLi~RmG~dKV~d~LP~li~av~ 338 (445)
T COG3200 277 EFLR--GVKNPIGVKI-------------GPSMT-PDELLELID--RLNPHNEPGRLTLIARMGADKVGDRLPPLVEAVE 338 (445)
T ss_pred HHHH--hcCCcccccc-------------CCCCC-HHHHHHHHH--hcCCCCCCceEEeehhhcchHHhhhhhHHHHHHH
Confidence 4565 6777877777 55433 344445554 5578777778887764322 1 458999999
Q ss_pred HcCCeec-----ccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHH
Q psy16054 274 RKGHEIS-----VFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIER 314 (563)
Q Consensus 274 ~~GHEIa-----~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~ 314 (563)
++||.|- .||.++.....| ..-..+.+.+|+....++-..
T Consensus 339 ~eG~~VvWs~DPMHGNTi~a~~gy-KTR~fd~Il~EV~sFfeihra 383 (445)
T COG3200 339 AEGHQVIWSSDPMHGNTIKASTGY-KTRPFDRILDEVQSFFEIHRA 383 (445)
T ss_pred HcCCceEEecCCCCCceeecCCCC-ccccHHHHHHHHHHHHHHHHh
Confidence 9999986 688888765433 234566777777777766554
No 59
>cd07295 PX_Grd19 The phosphoinositide binding Phox Homology domain of fungal Grd19. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Grd19 is involved in the localization of late Golgi membrane proteins in yeast. Grp19 associates with the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi, and functions as a cargo-specific adaptor for the retromer.
Probab=25.89 E-value=84 Score=27.72 Aligned_cols=62 Identities=16% Similarity=0.216 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHccCCcc-e-----EEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHhhCC
Q psy16054 428 GRLLRHNFNRHFSTNRAP-L-----GLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWMQNP 493 (563)
Q Consensus 428 ~~~l~~~F~~~y~~nrap-~-----~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~~~p 493 (563)
+.+|...+.+.|.+..-| + +-.+.+.|+ ..|..+|++||..|+.+|. +-.....+.++++.|
T Consensus 47 F~~L~~~L~~~~~~~~iPplP~K~~~~~~~~~~i---e~Rr~~Le~fL~~i~~~p~-l~~~s~~~~~FL~~~ 114 (116)
T cd07295 47 FEYFRDILERESPRVMIPPLPGKIFTNRFSDEVI---EERRQGLETFLQSVAGHPL-LQTGSKVLAAFLQDP 114 (116)
T ss_pred HHHHHHHHHHHCCCCccCCCCCCccccCCCHHHH---HHHHHHHHHHHHHHhcCHh-hhhCCHHHHHhcCCC
Confidence 455667776666442211 1 112333333 3588999999999998763 211233445565554
No 60
>COG3531 Predicted protein-disulfide isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=25.85 E-value=1.6e+02 Score=28.77 Aligned_cols=72 Identities=15% Similarity=0.181 Sum_probs=49.8
Q ss_pred HHHHHHHHc-CCeecccCCCCCCCcCCCCCCCHHHHHHHHHHHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHc
Q psy16054 267 ASVQELHRK-GHEISVFSLTHKDDPKYWSGGSYDDWLAEMAGGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQ 345 (563)
Q Consensus 267 ~~v~~i~~~-GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~ 345 (563)
.+|.++.+. |-||..|+=.-..... ..++ ..+++.|..--..|++.+|. ||++ +..-.+|...
T Consensus 20 Pll~~l~~~~gl~~~L~~GGl~aG~r--~pl~-~~~r~~i~~hd~rIa~lTGq----------pFg~---~y~~~lL~~~ 83 (212)
T COG3531 20 PLLEALSAQPGLEVVLHGGGLRAGNR--RPLD-PAFRAYILPHDQRIAQLTGQ----------PFGQ---AYFDGLLRDT 83 (212)
T ss_pred HHHHHHHhcCCceEEEecCCccCCCC--CCCC-hHHHHHhhhhHHHHHHHhCC----------ccch---hHHhhhhccc
Confidence 477777776 9999999843221100 1244 45789999999999999885 4443 2234788899
Q ss_pred CCeEEeecC
Q psy16054 346 FFVYDASIT 354 (563)
Q Consensus 346 Gf~YDsS~~ 354 (563)
||.|||--.
T Consensus 84 ~~v~DSepa 92 (212)
T COG3531 84 GFVYDSEPA 92 (212)
T ss_pred CccccccHH
Confidence 999998753
No 61
>cd02992 PDI_a_QSOX PDIa family, Quiescin-sulfhydryl oxidase (QSOX) subfamily; QSOX is a eukaryotic protein containing an N-terminal redox active TRX domain, similar to that of PDI, and a small C-terminal flavin adenine dinucleotide (FAD)-binding domain homologous to the yeast ERV1p protein. QSOX oxidizes thiol groups to disulfides like PDI, however, unlike PDI, this oxidation is accompanied by the reduction of oxygen to hydrogen peroxide. QSOX is localized in high concentrations in cells with heavy secretory load and prefers peptides and proteins as substrates, not monothiols like glutathione. Inside the cell, QSOX is found in the endoplasmic reticulum and Golgi. The flow of reducing equivalents in a QSOX-catalyzed reaction goes from the dithiol substrate - dithiol of the QSOX TRX domain - dithiols of the QSOX ERV1p domain - FAD - oxygen.
Probab=25.72 E-value=1.6e+02 Score=25.51 Aligned_cols=50 Identities=18% Similarity=0.268 Sum_probs=30.7
Q ss_pred HHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCC-eEEEe
Q psy16054 433 HNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRND-VYFVT 482 (563)
Q Consensus 433 ~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~d-VwfvT 482 (563)
++|+.....++.+..+.||++|-..-..+...|+++.+.+...++ |.|+.
T Consensus 9 ~~f~~~i~~~~~~vvV~f~a~wC~~C~~~~~~~~~la~~~~~~~~~v~~~~ 59 (114)
T cd02992 9 ASFNSALLGSPSAWLVEFYASWCGHCRAFAPTWKKLARDLRKWRPVVRVAA 59 (114)
T ss_pred HhHHHHHhcCCCeEEEEEECCCCHHHHHHhHHHHHHHHHHHhcCCceEEEE
Confidence 344554455567899999999954333455666666666654444 65554
No 62
>PF13788 DUF4180: Domain of unknown function (DUF4180)
Probab=24.02 E-value=88 Score=27.65 Aligned_cols=30 Identities=30% Similarity=0.485 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHHcCCCeEEEeHH-HHHHHh
Q psy16054 461 FKDELIKFIEEMLDRNDVYFVTML-QVIQWM 490 (563)
Q Consensus 461 r~~al~~fl~~i~~~~dVwfvT~~-ei~~w~ 490 (563)
.-++|+.|+-+-..-.+|||+.-. ++++|+
T Consensus 83 ~S~~l~dfi~EsN~G~~~~F~~~~~eA~~~L 113 (113)
T PF13788_consen 83 TSKSLRDFIYESNRGNHFFFVPDEEEAIAWL 113 (113)
T ss_pred cchhHHHHHHHhcCCCeEEEECCHHHHHhhC
Confidence 457899999999999999999654 455574
No 63
>cd02334 ZZ_dystrophin Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
Probab=23.73 E-value=94 Score=23.13 Aligned_cols=36 Identities=22% Similarity=0.185 Sum_probs=28.7
Q ss_pred CceEEEEeccCCCHHHHHHHHHcCCeecccCCCCCC
Q psy16054 253 IKGTFFVSHKYTNYASVQELHRKGHEISVFSLTHKD 288 (563)
Q Consensus 253 ikaTFFv~~~~~~y~~v~~i~~~GHEIa~H~~~H~~ 288 (563)
|.+.-|-...-.+|++-+.-+..|.+.+.|..+|..
T Consensus 11 i~g~RykC~~C~d~DLC~~Cf~~g~~~~~H~~~Hp~ 46 (49)
T cd02334 11 ITGFRYRCLKCFNYDLCQSCFFSGRTSKSHKNSHPM 46 (49)
T ss_pred ceeeeEECCCCCCcCchHHHHhCCCcCCCCCCCCCe
Confidence 445445544556999999999999999999999974
No 64
>TIGR02026 BchE magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase. This model respresents the cobalamin-dependent oxidative cyclase responsible for forming the distinctive E-ring of the chlorin ring system under anaerobic conditions. This step is essential in the biosynthesis of both bacteriochlorophyll and chlorophyll under anaerobic conditions (a separate enzyme, AcsF, acts under aerobic conditions). This model identifies two clades of sequences, one from photosynthetic, non-cyanobacterial bacteria and another including Synechocystis and several non-photosynthetic bacteria. The function of the Synechocystis gene is supported by gene clustering with other photosynthetic genes, so the purpose of the gene in the non-photosynthetic bacteria is uncertain. Note that homologs of this gene are not found in plants which rely solely on the aerobic cyclase.
Probab=23.11 E-value=1.9e+02 Score=32.20 Aligned_cols=62 Identities=11% Similarity=0.149 Sum_probs=37.8
Q ss_pred CCChHHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHH
Q psy16054 421 IQTGEQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVI 487 (563)
Q Consensus 421 ~~~~~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~ 487 (563)
..+.+++.++++.++.+.|.. ++-+...++..=+. +..++.+|..|+...- .-||..+..+.
T Consensus 410 ~m~~~El~~~~~~~~~~fy~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 471 (497)
T TIGR02026 410 HMPRWEILLGVKLNYIRFYMR-KALWRYPFQPNKFS-RRYMLGCLKAFLKAGV---GRWFYDLGRRV 471 (497)
T ss_pred CCCHHHHHHHHHHHHHHHHHH-HHHHhcccccCHHH-HHHHHHHHHHHhhccc---ceeeeecCcee
Confidence 468899999999999998875 54443444422122 2345666667766544 34666554443
No 65
>PF13592 HTH_33: Winged helix-turn helix
Probab=22.64 E-value=1.4e+02 Score=22.91 Aligned_cols=18 Identities=11% Similarity=0.263 Sum_probs=14.5
Q ss_pred hhHHHHHHHcCCeEEeec
Q psy16054 336 NKQFSMMTDQFFVYDASI 353 (563)
Q Consensus 336 ~~~~~~L~e~Gf~YDsS~ 353 (563)
..+.++|+++||.|-..-
T Consensus 25 ~~v~~lL~r~G~s~~kp~ 42 (60)
T PF13592_consen 25 SGVYRLLKRLGFSYQKPR 42 (60)
T ss_pred HHHHHHHHHcCCccccCC
Confidence 468999999999886544
No 66
>cd06860 PX_SNX7_30_like The phosphoinositide binding Phox Homology domain of Sorting Nexins 7 and 30. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. This subfamily consists of SNX7, SNX30, and similar proteins. They harbor a Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature, C-terminal
Probab=22.17 E-value=1.2e+02 Score=26.74 Aligned_cols=49 Identities=18% Similarity=0.289 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHHccCCc-ce----E-----EeeccccccCchhHHHHHHHHHHHHHcCCCeE
Q psy16054 428 GRLLRHNFNRHFSTNRA-PL----G-----LHFHASWLKSKKEFKDELIKFIEEMLDRNDVY 479 (563)
Q Consensus 428 ~~~l~~~F~~~y~~nra-p~----~-----l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVw 479 (563)
+.+|.+.+.+.|-+... |+ . -.+.+.|+. .|..+|++||..|+.+|.++
T Consensus 46 F~~L~~~L~~~~p~~~iPpLP~K~~~~~~~~~~~~~fie---~Rr~~Le~fL~~i~~hp~l~ 104 (116)
T cd06860 46 FLWLRQKLEESHPTHIIPPLPEKHSVKGLLDRFSPEFVA---TRMRALHKFLNRIVEHPVLS 104 (116)
T ss_pred HHHHHHHHHHHCCCCccCCCCCcchhhhhcccCCHHHHH---HHHHHHHHHHHHHHcCcccc
Confidence 56677777776654322 21 1 014544433 58899999999999988754
No 67
>PF12979 DUF3863: Domain of Unknown Function with PDB structure (DUF3863); InterPro: IPR024334 This entry represents a functionally uncharacterised domain found in bacterial sequences. The structure of one of the sequences, A6LGL1 from SWISSPROT from Parabacteroides distasonis atcc 8503, has been determined.; PDB: 3LM3_A.
Probab=20.98 E-value=2.5e+02 Score=28.44 Aligned_cols=101 Identities=17% Similarity=0.216 Sum_probs=48.4
Q ss_pred HHHHhccCCCCCCCCCCceEEEEeccC-----CCHHHHHHHH-----HcCCeecccCCCCCCCcCCCCCCCHHHHHHHHH
Q psy16054 237 YDEIFNGNRQNPNGCQIKGTFFVSHKY-----TNYASVQELH-----RKGHEISVFSLTHKDDPKYWSGGSYDDWLAEMA 306 (563)
Q Consensus 237 ~~~ll~~~r~n~n~~~ikaTFFv~~~~-----~~y~~v~~i~-----~~GHEIa~H~~~H~~~~~~~~~~s~~~~~~Ei~ 306 (563)
+.+.|. +.||++ +.|+=++-.. .+|.+|++-. +-|-||.---=-...+ | ++.++..+||.
T Consensus 46 ~reAfa--~~nP~g---rLTWgft~nALed~~~nyr~IR~y~v~c~~k~GDevsyfPGyFpam--Y---lpR~rVnRems 115 (351)
T PF12979_consen 46 LREAFA--SNNPDG---RLTWGFTLNALEDGRPNYRQIRDYVVECQQKYGDEVSYFPGYFPAM--Y---LPRERVNREMS 115 (351)
T ss_dssp HHHHHH--TT-TT------EEEE-HHHHH--SHHHHHHHHHHHHHHHHH--EEEE---S-HHH--H---S-HHHHHHHHH
T ss_pred HHHHHH--hcCCCc---eEEEeeecchhhcCChhHHHHHHHHHHHHHHhCCceeeccccchhh--c---CcHHHHHHHHH
Confidence 344554 457765 6776554432 2786665533 3588887432222222 2 68899999999
Q ss_pred HHHHHHHHHhCCCCCcceEEecCccccChhhHHHHHHHcCCeEE
Q psy16054 307 GGRLIIERFANISDGSVIGVRAPYLRVGGNKQFSMMTDQFFVYD 350 (563)
Q Consensus 307 ~~~~~l~~~~gi~~~~~~G~R~P~l~~~~~~~~~~L~e~Gf~YD 350 (563)
.+.++|.++.|.. ..|+..-.-||+. ++..-+.++.|+.-.
T Consensus 116 eAi~lis~~vg~g-yRP~si~gGFLsA--~Nl~ylaeke~Ih~a 156 (351)
T PF12979_consen 116 EAIQLISKMVGNG-YRPQSIMGGFLSA--DNLRYLAEKENIHTA 156 (351)
T ss_dssp HHHHHHHHHH-TT----SEEE-SS--H--HHHHHHHHHS---EE
T ss_pred HHHHHHHHHhcCC-CCcHHHHHhhcch--hhhhhhhhhcCceee
Confidence 9999999987632 2222223334443 455556777887543
No 68
>PF02456 Adeno_IVa2: Adenovirus IVa2 protein; InterPro: IPR003389 Va2 protein can interact with the adenoviral packaging signal and this interaction involves DNA sequences that have previously been demonstrated to be required for packaging []. During the course of lytic infection, the adenovirus major late promoter (MLP) is induced to high levels after replication of viral DNA has started. IVa2 is a transcriptional activator of the major late promoter [].; GO: 0019083 viral transcription
Probab=20.58 E-value=55 Score=34.19 Aligned_cols=31 Identities=26% Similarity=0.533 Sum_probs=23.7
Q ss_pred CCCCcccCCCCCCCCCCceEEeecCCcCCCCch
Q psy16054 203 ADGTRIPGGIEPNQVPQMITITFNGAVNVDNSD 235 (563)
Q Consensus 203 ~~g~ciP~~~~~~~~pq~V~lTFDD~~~~~~~~ 235 (563)
.+||.+| ..+.-.|.||.+||||+..+.|++
T Consensus 151 ~~gTi~P--~t~t~~P~Fv~msy~e~t~~~Nld 181 (369)
T PF02456_consen 151 PDGTIVP--QTGTFRPKFVEMSYDEATSPENLD 181 (369)
T ss_pred CCCeecc--ccccccccceeecHhhhCCccccC
Confidence 3466666 344568999999999999988765
No 69
>TIGR03646 YtoQ_fam YtoQ family protein. Members of this family are uncharacterized proteins, including YtoQ from Bacillus subtilis. This family shows some sequence similarity to a family of nucleoside 2-deoxyribosyltransferases (COG3613 as iterated through CDD), but sufficiently remote that PSI-BLAST starting from YtoQ and exploring outwards does not discover the relationship.
Probab=20.44 E-value=95 Score=28.22 Aligned_cols=52 Identities=15% Similarity=0.155 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHHcc-CCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHHHHHh
Q psy16054 428 GRLLRHNFNRHFST-NRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQVIQWM 490 (563)
Q Consensus 428 ~~~l~~~F~~~y~~-nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei~~w~ 490 (563)
|+.|..+||.-|.. -+.|+ |.+|++-+. .+|++.=. .--+|.-|..|+++-+
T Consensus 87 YKQWNaAfDAg~aaAlgKpl-I~lh~~~~~------HpLKEvda----aA~avaetp~Qvv~iL 139 (144)
T TIGR03646 87 YKQWNAAFDAGYAAALGKPL-IILRPEELI------HPLKEVDN----KAQAVVETPEQAIETL 139 (144)
T ss_pred HHHHHHHhhHHHHHHcCCCe-EEecchhcc------ccHHHHhH----HHHHHhcCHHHHHHHH
Confidence 78899999999864 45688 889966444 23444332 2336777888887643
No 70
>COG4637 Predicted ATPase [General function prediction only]
Probab=20.13 E-value=2.4e+02 Score=29.75 Aligned_cols=60 Identities=15% Similarity=0.112 Sum_probs=44.6
Q ss_pred HHHHHHHHHHHHHHHccCCcceEEeeccccccCchhHHHHHHHHHHHHHcCCCeEEEeHHHH
Q psy16054 425 EQFGRLLRHNFNRHFSTNRAPLGLHFHASWLKSKKEFKDELIKFIEEMLDRNDVYFVTMLQV 486 (563)
Q Consensus 425 ~~~~~~l~~~F~~~y~~nrap~~l~lH~~~l~g~p~r~~al~~fl~~i~~~~dVwfvT~~ei 486 (563)
|..+++|.-. .+...-|.|-.|.+..|..+-.|..+.+|.+++..++.+.-||++|.+.=
T Consensus 273 DGTlRfl~l~--t~Llsp~~p~ll~ldEPE~sLHP~lL~~La~~~~sAak~sQv~VsTHS~r 332 (373)
T COG4637 273 DGTLRFLALA--TLLLSPRPPPLLLLDEPETSLHPDLLPALAELMRSAAKRSQVIVSTHSPR 332 (373)
T ss_pred ccHHHHHHHH--HHHcCCCCCceeEecCcccccCHhHHHHHHHHHHHhhccceEEEEeCCHH
Confidence 3345554332 22333456666888888888889999999999999999999999998764
Done!