RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14882
(743 letters)
>gnl|CDD|119351 cd02872, GH18_chitolectin_chitotriosidase, This conserved domain
family includes a large number of catalytically inactive
chitinase-like lectins (chitolectins) including YKL-39,
YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic
mammalian chitinase), as well as catalytically active
chitotriosidases. The conserved domain is an
eight-stranded alpha/beta barrel fold belonging to the
family 18 glycosyl hydrolases. The fold has a
pronounced active-site cleft at the C-terminal end of
the beta-barrel. The chitolectins lack a key active
site glutamate (the proton donor required for hydrolytic
activity) but retain highly conserved residues involved
in oligosaccharide binding. Chitotriosidase is a
chitinolytic enzyme expressed in maturing macrophages,
which suggests that it plays a part in antimicrobial
defense. Chitotriosidase hydrolyzes chitotriose, as
well as colloidal chitin to yield chitobiose and is
therefore considered an exochitinase. Chitotriosidase
occurs in two major forms, the large form being
converted to the small form by either RNA or
post-translational processing. Although the small form,
containing the chitinase domain alone, is sufficient for
the chitinolytic activity, the additional C-terminal
chitin-binding domain of the large form plays a role in
processing colloidal chitin. The chitotriosidase gene is
nonessential in humans, as about 35% of the population
are heterozygous and 6% homozygous for an inactivated
form of the gene. HCGP39 is a 39-kDa human cartilage
glycoprotein thought to play a role in connective tissue
remodeling and defense against pathogens.
Length = 362
Score = 347 bits (892), Expect = e-113
Identities = 125/221 (56%), Positives = 153/221 (69%), Gaps = 11/221 (4%)
Query: 30 VVCYYTNWSQYRTKIGKFQPEDIEPDLCTHIIFAFGWLKKGKLSSFESNDETKDGKVGLY 89
VVCY+TNW+QYR GKF PE+I+P LCTHII+AF L + DE D +GLY
Sbjct: 1 VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGL--NPDGNIIILDEWNDIDLGLY 58
Query: 90 ERIEQLKKANPKLKTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDI 149
ER LK+ NP LKTLLAIGGW+FG+ KF M+ S +R+TFI SA+ FLR+ FDGLD+
Sbjct: 59 ERFNALKEKNPNLKTLLAIGGWNFGSAKFSAMAASPENRKTFIKSAIAFLRKYGFDGLDL 118
Query: 150 DWEYPAG----TDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDV 205
DWEYP +DK+N+V LLKELREAFE EA PRLLLTAAV G + I YD+
Sbjct: 119 DWEYPGQRGGPPEDKENFVTLLKELREAFEPEA-----PRLLLTAAVSAGKETIDAAYDI 173
Query: 206 PAVAGYLDFINLMAYDFHGKWESQVGHNAPLRSPSSDSEWQ 246
P ++ YLDFIN+M YDFHG WE GHN+PL + S+D+ Q
Sbjct: 174 PEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSADTGDQ 214
Score = 346 bits (891), Expect = e-113
Identities = 127/312 (40%), Positives = 162/312 (51%), Gaps = 92/312 (29%)
Query: 247 LREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDFHGKWESQ 306
LREAFE EA PRLLLTAAV G + I YD+P ++ YLDFIN+M YDFHG WE
Sbjct: 143 LREAFEPEA-----PRLLLTAAVSAGKETIDAAYDIPEISKYLDFINVMTYDFHGSWEGV 197
Query: 307 VGHNAPLRSPSSDSEWRKQLSVENAANMWVRLGAPKEQASFTLVDPTKFDIGAPASGGGE 366
GHN+PL + S+D+ +K L+V+ A W+ GA
Sbjct: 198 TGHNSPLYAGSADTGDQKYLNVDYAIKYWLSKGA-------------------------- 231
Query: 367 PGKYTAEAGFMSYYEKYLPKEKLIIGMPTYGRTFTLTNPAVFKVNSPASGGGKAGEYTIS 426
P EKL++G+PTYGR+FTL +
Sbjct: 232 ------------------PPEKLVLGIPTYGRSFTLAS---------------------- 251
Query: 427 KEKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCEMLRNGAGY 486
P+ +GAPASG G G YT EAGF++YYE+CE L++G
Sbjct: 252 -------------------PSNTGVGAPASGPGTAGPYTREAGFLAYYEICEFLKSGWTV 292
Query: 487 VWDDEMKVPYLVHGDQWVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMDDFTGTVCGGDV 546
VWDDE KVPY G+QWVG+DDE+SI K+ +LK G GGAMVW++D+DDF GT G
Sbjct: 293 VWDDEQKVPYAYKGNQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDLDDFRGTCGQG-- 350
Query: 547 KYPLIGAMREEL 558
KYPL+ A+ L
Sbjct: 351 KYPLLNAINRAL 362
Score = 29.4 bits (67), Expect = 7.4
Identities = 7/9 (77%), Positives = 9/9 (100%)
Query: 731 VVCYYTNWS 739
VVCY+TNW+
Sbjct: 1 VVCYFTNWA 9
>gnl|CDD|214753 smart00636, Glyco_18, Glyco_18 domain.
Length = 334
Score = 254 bits (650), Expect = 3e-78
Identities = 105/218 (48%), Positives = 128/218 (58%), Gaps = 12/218 (5%)
Query: 29 EVVCYYTNWSQYRTKIGKFQPEDIEPDLCTHIIFAFGWLKKGKLSSFESNDETKDGKVGL 88
VV Y+TNW Y F +DI THII+AF + + DE D +G
Sbjct: 1 RVVGYFTNWGVYGRN---FPVDDIPASKLTHIIYAFANIDPDG--TVTIGDEWAD--IGN 53
Query: 89 YERIEQLKKANPKLKTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLD 148
+ +++ LKK NP LK LL+IGGW+ F M SR+ FI S V FL++ FDG+D
Sbjct: 54 FGQLKALKKKNPGLKVLLSIGGWTES-DNFSSMLSDPASRKKFIDSIVSFLKKYGFDGID 112
Query: 149 IDWEYPAG-TDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYD-VP 206
IDWEYP G DD++NY LLKELREA + E E LLT AVP GPD I GY +P
Sbjct: 113 IDWEYPGGRGDDRENYTALLKELREALDKEGAE--GKGYLLTIAVPAGPDKIDKGYGDLP 170
Query: 207 AVAGYLDFINLMAYDFHGKWESQVGHNAPLRSPSSDSE 244
A+A YLDFINLM YDFHG W + GHNAPL + D E
Sbjct: 171 AIAKYLDFINLMTYDFHGAWSNPTGHNAPLYAGPGDPE 208
Score = 212 bits (542), Expect = 1e-62
Identities = 94/292 (32%), Positives = 125/292 (42%), Gaps = 95/292 (32%)
Query: 247 LREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYD-VPAVAGYLDFINLMAYDFHGKWES 305
LREA + E E LLT AVP GPD I GY +PA+A YLDFINLM YDFHG W +
Sbjct: 135 LREALDKEGAE--GKGYLLTIAVPAGPDKIDKGYGDLPAIAKYLDFINLMTYDFHGAWSN 192
Query: 306 QVGHNAPLRSPSSDSEWRKQLSVENAANMWVRLGAPKEQASFTLVDPTKFDIGAPASGGG 365
GHNAPL + D ++ +V+ A ++ G
Sbjct: 193 PTGHNAPLYAGPGD---PEKYNVDYAVKYYLCKGV------------------------- 224
Query: 366 EPGKYTAEAGFMSYYEKYLPKEKLIIGMPTYGRTFTLTNPAVFKVNSPASGGGKAGEYTI 425
P KL++G+P Y
Sbjct: 225 -------------------PPSKLVLGIPFY----------------------------- 236
Query: 426 SKEKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCEMLRNGAG 485
GR +TLVD + GAP +G G T E G + Y E+C++L GA
Sbjct: 237 ------------GRGWTLVDGSNNGPGAPFTGPATGGPGTWEGGVVDYREICKLL--GAT 282
Query: 486 YVWDDEMKVPYLVHGD--QWVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMD 535
V+DD K PY + QWV +DD +SI+ K +++KD G GG M+W +D D
Sbjct: 283 VVYDDTAKAPYAYNPGTGQWVSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334
>gnl|CDD|216071 pfam00704, Glyco_hydro_18, Glycosyl hydrolases family 18.
Length = 325
Score = 223 bits (570), Expect = 7e-67
Identities = 98/217 (45%), Positives = 124/217 (57%), Gaps = 13/217 (5%)
Query: 29 EVVCYYTNWSQYRTKIGKFQPEDIEPDLCTHIIFAFGWLKKGKLSSFESNDETKDGKVGL 88
+V YYT W Y F +DI D THII+AF + G + + D DG G
Sbjct: 2 RIVGYYTQWGNYGEG---FPLDDIPTDKLTHIIYAFANID-GNGTFGNNADTEDDGLKGC 57
Query: 89 YERIEQLK-KANPKLKTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGL 147
+E+++ LK NP +K LL+IGGW+F F ++ R+TF S + FL++ FDG+
Sbjct: 58 FEQLKDLKKCQNPGVKVLLSIGGWTFSG-GFSLLASDDAKRKTFADSIIDFLKKYGFDGI 116
Query: 148 DIDWEYPAGT-DDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVP 206
DIDWEYP G DDK NY LLKELR A + E K LL+AAVP GPD I G D+
Sbjct: 117 DIDWEYPGGKGDDKDNYTALLKELRAALKKE----AKAGYLLSAAVPAGPDKID-GSDIA 171
Query: 207 AVAGYLDFINLMAYDFHGKWESQVGHNAPLRSPSSDS 243
+ YLDFINLM YDFHG W + G NAPL S +
Sbjct: 172 KIGKYLDFINLMTYDFHG-WSNITGPNAPLYDGSWQN 207
Score = 167 bits (425), Expect = 2e-46
Identities = 88/290 (30%), Positives = 121/290 (41%), Gaps = 103/290 (35%)
Query: 246 QLREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDFHGKWES 305
+LR A + E K LL+AAVP GPD I G D+ + YLDFINLM YDFHG W +
Sbjct: 139 ELRAALKKE----AKAGYLLSAAVPAGPDKIDG-SDIAKIGKYLDFINLMTYDFHG-WSN 192
Query: 306 QVGHNAPLRSPSSDSEWRKQLSVENAANMWVRLGAPKEQASFTLVDPTKFDIGAPASGGG 365
G NAPL S +V+ A +++ G
Sbjct: 193 ITGPNAPLYDGSWQ-------NVDYAVQYYLKAGV------------------------- 220
Query: 366 EPGKYTAEAGFMSYYEKYLPKEKLIIGMPTYGRTFTLTNPAVFKVNSPASGGGKAGEYTI 425
P KL++G+P YGR +TL N + +PA
Sbjct: 221 -------------------PASKLVLGIPFYGRGWTLVNGSGNGGGAPA----------- 250
Query: 426 SKEKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCEMLRNGAG 485
PG T E G +SY E+C +L++GAG
Sbjct: 251 -----------------------------------PGPGTWEGGILSYKEICALLKSGAG 275
Query: 486 YVWDDEMKVPYLVHGDQWVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMD 535
+DD K PY+ G Q+V +DD +SI+ K ++KD G GG M+W++D D
Sbjct: 276 PGYDDTAKAPYIYKGKQFVSYDDPRSIKAKAKYVKDKGLGGVMIWSLDQD 325
>gnl|CDD|119365 cd06548, GH18_chitinase, The GH18 (glycosyl hydrolases, family 18)
type II chitinases hydrolyze chitin, an abundant polymer
of N-acetylglucosamine and have been identified in
bacteria, fungi, insects, plants, viruses, and protozoan
parasites. The structure of this domain is an
eight-stranded alpha/beta barrel with a pronounced
active-site cleft at the C-terminal end of the
beta-barrel.
Length = 322
Score = 207 bits (530), Expect = 5e-61
Identities = 91/237 (38%), Positives = 122/237 (51%), Gaps = 30/237 (12%)
Query: 30 VVCYYTNWSQYRTKIGKFQPE-DIEPDLCTHIIFAFG--------------WLKKGKLSS 74
VV Y+TNW Y + DI D THI +AF + S
Sbjct: 1 VVGYFTNWGIY---GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSV 57
Query: 75 FESNDETKDGKVGLYERIEQLKKANPKLKTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYS 134
D G + ++ +LK+ NP LK LL+IGGW++ F + + + SR F S
Sbjct: 58 DGGADTDDQPLKGNFGQLRKLKQKNPHLKILLSIGGWTWS-GGFSDAAATEASRAKFADS 116
Query: 135 AVPFLRQRNFDGLDIDWEYPAGT---------DDKKNYVLLLKELREAFEAEAQEVKKPR 185
AV F+R+ FDG+DIDWEYP +DK+N+ LLLKELREA +A E +
Sbjct: 117 AVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKENFTLLLKELREALDALGAETGRK- 175
Query: 186 LLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDFHGKWESQVGHNAPLRSPSSD 242
LLT A P GPD + +V +A YLDFINLM YDFHG W + GH++ L + +D
Sbjct: 176 YLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNTTGHHSNLYASPAD 231
Score = 99.2 bits (248), Expect = 1e-22
Identities = 49/155 (31%), Positives = 66/155 (42%), Gaps = 48/155 (30%)
Query: 247 LREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDFHGKWESQ 306
LREA +A E + LLT A P GPD + +V +A YLDFINLM YDFHG W +
Sbjct: 161 LREALDALGAETGRK-YLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNT 218
Query: 307 VGHNAPLRSPSSDSEWRKQLSVENAANMWVRLGAPKEQASFTLVDPTKFDIGAPASGGGE 366
GH++ L + +D SV+ A N ++ G
Sbjct: 219 TGHHSNLYASPADPPGG--YSVDAAVNYYLSAGV-------------------------- 250
Query: 367 PGKYTAEAGFMSYYEKYLPKEKLIIGMPTYGRTFT 401
P EKL++G+P YGR +T
Sbjct: 251 ------------------PPEKLVLGVPFYGRGWT 267
Score = 75.0 bits (185), Expect = 1e-14
Identities = 29/113 (25%), Positives = 45/113 (39%), Gaps = 42/113 (37%)
Query: 425 ISKEKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCEMLRNGA 484
+ EKL++G+P YGR +T G
Sbjct: 250 VPPEKLVLGVPFYGRGWT----------------------------------------GY 269
Query: 485 GYVWDDEMKVPYLVHGD--QWVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMD 535
WD+ K PYL + ++ +DD +SI+ K +++KD G GG M W + D
Sbjct: 270 TRYWDEVAKAPYLYNPSTKTFISYDDPRSIKAKADYVKDKGLGGVMFWELSGD 322
>gnl|CDD|225862 COG3325, ChiA, Chitinase [Carbohydrate transport and metabolism].
Length = 441
Score = 178 bits (452), Expect = 8e-49
Identities = 91/242 (37%), Positives = 119/242 (49%), Gaps = 33/242 (13%)
Query: 30 VVCYYTNWSQYRTKIGKFQPEDIEPDLCTHIIFAF-GWLKKGKLSSFESNDETKDGKV-- 86
VV YYT+WSQY + P DI D THI +AF GK DE V
Sbjct: 40 VVGYYTSWSQY--DRQDYFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAALYGVPN 97
Query: 87 --------------GLYERIEQLKKANPKLKTLLAIGGWSFGTQKFKEMSKSRYSRQTFI 132
G + + LK P LKTL++IGGWS + F +M+ SR+ F
Sbjct: 98 IEGVELDPWSDPLKGHFGALFDLKATYPDLKTLISIGGWSD-SGGFSDMAADDASRENFA 156
Query: 133 YSAVPFLRQRNFDGLDIDWEYPAG---------TDDKKNYVLLLKELREA-FEAEAQEVK 182
SAV F+R FDG+DIDWEYP DK NYVLLL+ELR+ +A ++ +
Sbjct: 157 KSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKDKANYVLLLQELRKKLDKAGVEDGR 216
Query: 183 KPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDFHGKWESQVGHNAPLRSPSSD 242
LT A P D ++ G + +A Y+D+IN+M YDFHG W +GH+A L D
Sbjct: 217 --HYQLTIAAPASKDKLE-GLNHAEIAQYVDYINIMTYDFHGAWNETLGHHAALYGTPKD 273
Query: 243 SE 244
Sbjct: 274 PP 275
Score = 102 bits (257), Expect = 3e-23
Identities = 72/320 (22%), Positives = 111/320 (34%), Gaps = 86/320 (26%)
Query: 247 LREA-FEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDFHGKWES 305
LR+ +A ++ + LT A P D ++G + +A Y+D+IN+M YDFHG W
Sbjct: 203 LRKKLDKAGVEDGR--HYQLTIAAPASKDKLEG-LNHAEIAQYVDYINIMTYDFHGAWNE 259
Query: 306 QVGHNAPLRSPSSDSEWRKQLSVENAANMWVRLGAPKEQASFTLVDPTKFDIGAPASGGG 365
+GH+A L D A G
Sbjct: 260 TLGHHAALYGTPKDPP------------------------------LANGGFYVDAEVDG 289
Query: 366 EPGKYTAEAGFMSYYEKYLPKEKLIIGMPTYGRTFTLTNPAVFK-VNSPASGGGKAGEYT 424
AG + P KL++GMP YGR + + G +G
Sbjct: 290 IDWLEEGFAGDV-------PPSKLVLGMPFYGRGWNGVDGGSLGTCPGLYQGLDNSGIPK 342
Query: 425 ISKEKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCEMLRNGA 484
+ E G YG+++ L + G+ Y
Sbjct: 343 GTWEA-GNGDKDYGKAYDL-----------------DANNAGKNGYERY----------- 373
Query: 485 GYVWDDEMKVPYLVHGD--QWVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMDDFTGTVC 542
WDD K PYL + + ++ +DD +S++ K ++ DN GG M W +
Sbjct: 374 ---WDDVAKAPYLYNPEKGVFISYDDPRSVKAKAEYVADNNLGGMMFWEIS--------- 421
Query: 543 GGDVKYPLIGAMREELNGIP 562
GD L+ A+ E L P
Sbjct: 422 -GDENGVLLNAVNEGLGFNP 440
Score = 29.7 bits (67), Expect = 5.8
Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 4/46 (8%)
Query: 698 KANKDDPKYKVSLVVVGSSKSSV----KANKDDSKYKVVCYYTNWS 739
KAN + +V + KA+ D ++KVV YYT+WS
Sbjct: 3 KANGRSDIIRDDFSLVVHRVPADVVNNKAHTSDDQFKVVGYYTSWS 48
>gnl|CDD|119349 cd00598, GH18_chitinase-like, The GH18 (glycosyl hydrolase, family
18) type II chitinases hydrolyze chitin, an abundant
polymer of beta-1,4-linked N-acetylglucosamine (GlcNAc)
which is a major component of the cell wall of fungi and
the exoskeleton of arthropods. Chitinases have been
identified in viruses, bacteria, fungi, protozoan
parasites, insects, and plants. The structure of the
GH18 domain is an eight-stranded beta/alpha barrel with
a pronounced active-site cleft at the C-terminal end of
the beta-barrel. The GH18 family includes
chitotriosidase, chitobiase, hevamine, zymocin-alpha,
narbonin, SI-CLP (stabilin-1 interacting chitinase-like
protein), IDGF (imaginal disc growth factor), CFLE
(cortical fragment-lytic enzyme) spore hydrolase, the
type III and type V plant chitinases, the
endo-beta-N-acetylglucosaminidases, and the
chitolectins. The GH85 (glycosyl hydrolase, family 85)
ENGases (endo-beta-N-acetylglucosaminidases) are closely
related to the GH18 chitinases and are included in this
alignment model.
Length = 210
Score = 166 bits (423), Expect = 2e-47
Identities = 77/195 (39%), Positives = 100/195 (51%), Gaps = 21/195 (10%)
Query: 30 VVCYYTNWSQYRTKIGKFQPEDIEPDLCTHIIFAF-GWLKKGKLSSFESNDETKDGKVGL 88
V+CYY WS R P DI LCTHII+AF G L+ F E
Sbjct: 1 VICYYDGWSSGRG----PDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDKSEEPL----- 51
Query: 89 YERIEQLKKANPKLKTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLD 148
+E+L P LK L++IGGW+ + + SR F S V FL+ FDG+D
Sbjct: 52 KGALEELASKKPGLKVLISIGGWTDSSPFTLASDPA--SRAAFANSLVSFLKTYGFDGVD 109
Query: 149 IDWEYP--AGTDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVP 206
IDWEYP A D++N++ LL+ELR A A LLT AVP ++ YDVP
Sbjct: 110 IDWEYPGAADNSDRENFITLLRELRSALGAA-------NYLLTIAVPASYFDLGYAYDVP 162
Query: 207 AVAGYLDFINLMAYD 221
A+ Y+DF+N+M YD
Sbjct: 163 AIGDYVDFVNVMTYD 177
Score = 59.3 bits (144), Expect = 6e-10
Identities = 42/162 (25%), Positives = 60/162 (37%), Gaps = 30/162 (18%)
Query: 151 WEYPAGTDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAG 210
E + + L+ A E A + ++L++ G + A
Sbjct: 32 AEISSDGSLNLFGDKSEEPLKGALEELASKKPGLKVLISI---GGWTDSSPFTLASDPAS 88
Query: 211 YLDFIN-----LMAYDFHG---KWESQVGHNAPLRSPSSDSEW------QLREAFEAEAQ 256
F N L Y F G WE P + +SD E +LR A A
Sbjct: 89 RAAFANSLVSFLKTYGFDGVDIDWE------YPGAADNSDRENFITLLRELRSALGAA-- 140
Query: 257 EVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYD 298
LLT AVP ++ YDVPA+ Y+DF+N+M YD
Sbjct: 141 -----NYLLTIAVPASYFDLGYAYDVPAIGDYVDFVNVMTYD 177
Score = 39.3 bits (92), Expect = 0.003
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 511 SIRYKMNWLKDNGYGGAMVWTVDMD 535
S+ K + K G GG M+W +D D
Sbjct: 186 SLGAKAKYAKQKGLGGVMIWELDQD 210
>gnl|CDD|119358 cd02879, GH18_plant_chitinase_class_V, The class V plant chitinases
have a glycosyl hydrolase family 18 (GH18) domain, but
lack the chitin-binding domain present in other GH18
enzymes. The GH18 domain of the class V chitinases has
endochitinase activity in some cases and no catalytic
activity in others. Included in this family is a lectin
found in black locust (Robinia pseudoacacia) bark, which
binds chitin but lacks chitinase activity. Also
included is a chitinase-related receptor-like kinase
(CHRK1) from tobacco (Nicotiana tabacum), with an
N-terminal GH18 domain and a C-terminal kinase domain,
which is thought to be part of a plant signaling
pathway. The GH18 domain of CHRK1 is expressed
extracellularly where it binds chitin but lacks
chitinase activity.
Length = 299
Score = 147 bits (374), Expect = 1e-39
Identities = 74/203 (36%), Positives = 111/203 (54%), Gaps = 9/203 (4%)
Query: 45 GKFQPEDIEPDLCTHIIFAFGWLKKGKLSSFESNDETKDGKVGLYERIEQLKKANPKLKT 104
+F P +I+ L TH+ +AF L S++E D E +K+ NP +KT
Sbjct: 14 EEFPPSNIDSSLFTHLFYAFADLDP---STYEVVISPSDESE-FSTFTETVKRKNPSVKT 69
Query: 105 LLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWEYPAGTDDKKNYV 164
LL+IGG + F M+ +R+ FI S++ R+ FDGLD+DWE+P+ + +N+
Sbjct: 70 LLSIGGGGSDSSAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFPSSQVEMENFG 129
Query: 165 LLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGG----YDVPAVAGYLDFINLMAY 220
LL+E R A + EA+ +P LLLTAAV P Y + A+ LD++N+MAY
Sbjct: 130 KLLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSDDSVSYPIEAINKNLDWVNVMAY 189
Query: 221 DFHGKWESQV-GHNAPLRSPSSD 242
D++G WES G A L P+S+
Sbjct: 190 DYYGSWESNTTGPAAALYDPNSN 212
Score = 80.9 bits (200), Expect = 1e-16
Identities = 52/204 (25%), Positives = 75/204 (36%), Gaps = 76/204 (37%)
Query: 220 YDFHG---KWESQVGHNAPLRSPSSDSEW--------QLREAFEAEAQEVKKPRLLLTAA 268
Y F G WE PSS E + R A + EA+ +P LLLTAA
Sbjct: 107 YGFDGLDLDWEF----------PSSQVEMENFGKLLEEWRAAVKDEARSSGRPPLLLTAA 156
Query: 269 VPVGPDNIKGG----YDVPAVAGYLDFINLMAYDFHGKWESQV-GHNAPLRSPSSDSEWR 323
V P Y + A+ LD++N+MAYD++G WES G A L P+S
Sbjct: 157 VYFSPILFLSDDSVSYPIEAINKNLDWVNVMAYDYYGSWESNTTGPAAALYDPNS----- 211
Query: 324 KQLSVENAANMWVRLGAPKEQASFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEKY 383
+ + G S+ +
Sbjct: 212 ---------------------------------------------NVSTDYGIKSWIKAG 226
Query: 384 LPKEKLIIGMPTYGRTFTLTNPAV 407
+P +KL++G+P YGR +TL +
Sbjct: 227 VPAKKLVLGLPLYGRAWTLYDTTT 250
Score = 50.1 bits (120), Expect = 2e-06
Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 44/109 (40%)
Query: 428 EKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCEMLRNGAGYV 487
+KL++G+P YGR++TL D T +S Y Y
Sbjct: 230 KKLVLGLPLYGRAWTLYDTT----------------------TVSSY----------VYA 257
Query: 488 WDDEMKVPYLVHGDQWVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMDD 536
G W+G+DD +SI K+ + K G G W V DD
Sbjct: 258 ------------GTTWIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294
>gnl|CDD|119352 cd02873, GH18_IDGF, The IDGF's (imaginal disc growth factors) are a
family of growth factors identified in insects that
include at least five members, some of which are encoded
by genes in a tight cluster. The IDGF's have an
eight-stranded alpha/beta barrel fold and are related to
the glycosyl hydrolase family 18 (GH18) chitinases, but
they have an amino acid substitution known to abolish
chitinase catalytic activity. IDGFs may have evolved
from chitinases to gain new functions as growth factors,
interacting with cell surface glycoproteins involved in
growth-promoting processes.
Length = 413
Score = 113 bits (284), Expect = 1e-26
Identities = 68/231 (29%), Positives = 103/231 (44%), Gaps = 48/231 (20%)
Query: 30 VVCYYTNWSQYRTKIGKFQPEDIEPDL--CTHIIFAFGWLKKGKLSSFESNDETKDGKVG 87
+VCYY + S R + K ED+EP L CTH+++ + + S +E D
Sbjct: 2 LVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIK-SLNEDLDLDKS 60
Query: 88 LYERIEQLKKANPKLKTLLAIGGWSF-----GTQKFKEMSKSRYSRQTFIYSAVPFLRQR 142
Y I LK+ P LK LL++GG +K+ + +S SR FI SA L+
Sbjct: 61 HYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTY 120
Query: 143 NFDGLDIDWEYP-------------------------AGTDDK-----KNYVLLLKELRE 172
FDGLD+ W++P + D+K + + L++EL+
Sbjct: 121 GFDGLDLAWQFPKNKPKKVRGTFGSAWHSFKKLFTGDSVVDEKAAEHKEQFTALVRELKN 180
Query: 173 AFEAEAQEVKKPRLLLTAAVPVGPD-NIKGGYDVPAVAGYLDFINLMAYDF 222
A + LLT + V P N +DVPA+A +DF+NL +DF
Sbjct: 181 ALRPDGL-------LLT--LTVLPHVNSTWYFDVPAIANNVDFVNLATFDF 222
Score = 88.9 bits (221), Expect = 1e-18
Identities = 66/296 (22%), Positives = 102/296 (34%), Gaps = 102/296 (34%)
Query: 280 YDVPAVAGYLDFINLMAYDFHGKWESQVGHNAPLRSPSSDSEWRKQLSVENAANMWVRLG 339
+DVPA+A +DF+NL +DF P R+P
Sbjct: 203 FDVPAIANNVDFVNLATFDFL----------TPERNP----------------------- 229
Query: 340 APKEQASFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEKYLPKEKLIIGMPTYGRT 399
E+A +T P + + + + P KL +G+ TYGR
Sbjct: 230 ---EEADYTA--PIYELYERNPH-------HNVDYQVKYWLNQGTPASKLNLGIATYGRA 277
Query: 400 FTLTNPAVFKVNSPASGGGKAGEYTISKEKLMIGMPTYGRSFTLVDPTKFDIGAPASGGG 459
+ LT +G I+ G+P G G
Sbjct: 278 WKLT--------------KDSG---IT------GVP---------------PVLETDGPG 299
Query: 460 EPGKYTAEAGFMSYYEVCEMLRNGAGYVWDDEM--KV-------------PYLVHGDQ-- 502
G T G +S+ E+C L N A D KV P +G+
Sbjct: 300 PAGPQTKTPGLLSWPEICSKLPNPANLKGADAPLRKVGDPTKRFGSYAYRPADENGEHGI 359
Query: 503 WVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMDDFTGTVCGGDVKYPLIGAMREEL 558
WV ++D + K + K G GG ++ + +DDF G C GD K+P++ + + L
Sbjct: 360 WVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDFRGQ-CTGD-KFPILRSAKYRL 413
>gnl|CDD|119357 cd02878, GH18_zymocin_alpha, Zymocin, alpha subunit. Zymocin is a
heterotrimeric enzyme that inhibits yeast cell cycle
progression. The zymocin alpha subunit has a chitinase
activity that is essential for holoenzyme action from
the cell exterior while the gamma subunit contains the
intracellular toxin responsible for G1 phase cell cycle
arrest. The zymocin alpha and beta subunits are thought
to act from the cell's exterior by docking to the cell
wall-associated chitin, thus mediating gamma-toxin
translocation. The alpha subunit has an eight-stranded
TIM barrel fold similar to that of family 18 glycosyl
hydrolases such as hevamine, chitolectin, and
chitobiase.
Length = 345
Score = 85.8 bits (213), Expect = 5e-18
Identities = 57/185 (30%), Positives = 86/185 (46%), Gaps = 36/185 (19%)
Query: 58 THIIFAFGWLKKGKLSSFESNDETKDGKVGLYERIEQLKKANPKLKTLLAIGGWSFGT-- 115
THI FAF + S F + + + + ++LK +K +L+ GGW F T
Sbjct: 29 THIHFAFANIT----SDFSVDVSSVQEQ---FSDFKKLKG----VKKILSFGGWDFSTSP 77
Query: 116 ---QKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWEYPAGTD----------DKKN 162
Q F++ K +R TF + V F+ + N DG+D DWEYP D D KN
Sbjct: 78 STYQIFRDAVKPA-NRDTFANNVVNFVNKYNLDGVDFDWEYPGAPDIPGIPAGDPDDGKN 136
Query: 163 YVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDF 222
Y+ LK L+ + L+ A P +K G+ + +A Y+D+I M YD
Sbjct: 137 YLEFLKLLKSKLPSG--------KSLSIAAPASYWYLK-GFPIKDMAKYVDYIVYMTYDL 187
Query: 223 HGKWE 227
HG+W+
Sbjct: 188 HGQWD 192
Score = 58.1 bits (141), Expect = 6e-09
Identities = 35/123 (28%), Positives = 59/123 (47%), Gaps = 9/123 (7%)
Query: 419 KAGEYTISKEKLMIGMPTYGRSFTLVDPTKFDIGAPASG---GGEPGKYTAEAGFMSYYE 475
KAG + K+++G+ +YGRSF + DP G +G G E G+ T AG+ + E
Sbjct: 226 KAG---VPSNKVVVGVASYGRSFKMADPGCTGPGCTFTGPGSGAEAGRCTCTAGYGAISE 282
Query: 476 VCEMLRNGAGY--VWDDEMKVPYLVHGD-QWVGFDDEKSIRYKMNWLKDNGYGGAMVWTV 532
+ + + + +D + LV+ D QWV + + ++ W K +GG W V
Sbjct: 283 IEIIDISKSKNKRWYDTDSDSDILVYDDDQWVAYMSPATKAARIEWYKGLNFGGTSDWAV 342
Query: 533 DMD 535
D+
Sbjct: 343 DLQ 345
>gnl|CDD|119353 cd02874, GH18_CFLE_spore_hydrolase, Cortical fragment-lytic enzyme
(CFLE) is a peptidoglycan hydrolase involved in
bacterial endospore germination. CFLE is expressed as
an inactive preprotein (called SleB) in the forespore
compartment of sporulating cells. SleB translocates
across the forespore inner membrane and is deposited as
a mature enzyme in the cortex layer of the spore. As
part of a sensory mechanism capable of initiating
germination, CFLE degrades a spore-specific
peptidoglycan constituent called muramic-acid
delta-lactam that comprises the outer cortex. CFLE has
a C-terminal glycosyl hydrolase family 18 (GH18)
catalytic domain as well as two N-terminal LysM
peptidoglycan-binding domains. In addition to SleB,
this family includes YaaH, YdhD, and YvbX from Bacillus
subtilis.
Length = 313
Score = 66.5 bits (163), Expect = 9e-12
Identities = 30/103 (29%), Positives = 48/103 (46%), Gaps = 11/103 (10%)
Query: 128 RQTFIYSAVPFLRQRNFDGLDIDWEYPAGTDDKKNYVLLLKELREAFEAEAQEVKKPRLL 187
RQ I + + ++ +DG++ID+E +D++ Y L+EL + L
Sbjct: 88 RQRLINNILALAKKYGYDGVNIDFEN-VPPEDREAYTQFLRELSDRLHPAG------YTL 140
Query: 188 LTAAVPV----GPDNIKGGYDVPAVAGYLDFINLMAYDFHGKW 226
TA VP N G YD A+ +DF+ LM YD+H +
Sbjct: 141 STAVVPKTSADQFGNWSGAYDYAAIGKIVDFVVLMTYDWHWRG 183
Score = 53.0 bits (128), Expect = 2e-07
Identities = 30/124 (24%), Positives = 57/124 (45%), Gaps = 26/124 (20%)
Query: 422 EYTIS---KEKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCE 478
+Y ++ +EK+++G+P YG +TL P K +A +S +
Sbjct: 200 QYAVTQIPREKILLGIPLYGYDWTL-----------------PYKKGGKASTISPQQAIN 242
Query: 479 ML-RNGAGYVWDDEMKVPYLVHGDQ-----WVGFDDEKSIRYKMNWLKDNGYGGAMVWTV 532
+ R GA +D+E + P+ + D+ V F+D +S++ K K+ G G W +
Sbjct: 243 LAKRYGAEIQYDEEAQSPFFRYVDEQGRRHEVWFEDARSLQAKFELAKEYGLRGVSYWRL 302
Query: 533 DMDD 536
++D
Sbjct: 303 GLED 306
Score = 44.9 bits (107), Expect = 9e-05
Identities = 23/69 (33%), Positives = 32/69 (46%), Gaps = 11/69 (15%)
Query: 245 WQLREAFEAEAQEVKKPRL------LLTAAVPV----GPDNIKGGYDVPAVAGYLDFINL 294
+ REA+ +E+ RL L TA VP N G YD A+ +DF+ L
Sbjct: 116 PEDREAYTQFLRELS-DRLHPAGYTLSTAVVPKTSADQFGNWSGAYDYAAIGKIVDFVVL 174
Query: 295 MAYDFHGKW 303
M YD+H +
Sbjct: 175 MTYDWHWRG 183
>gnl|CDD|226376 COG3858, COG3858, Predicted glycosyl hydrolase [General function
prediction only].
Length = 423
Score = 57.8 bits (140), Expect = 1e-08
Identities = 33/142 (23%), Positives = 58/142 (40%), Gaps = 14/142 (9%)
Query: 92 IEQLKKANPKLKTLLAIGGWS---FGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLD 148
IE + K ++ G FG + + + + ++ I + + L R + G++
Sbjct: 153 IEIAQCRKIKPVPGISNGTRPGANFGGELAQLLLNNETAKNRLINNIITLLDARGYRGVN 212
Query: 149 IDWEYPAGTDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKG----GYD 204
ID+E G D++ Y L+++R+A + V A D G YD
Sbjct: 213 IDFEN-VGPGDRELYTDFLRQVRDALHSGGYTVS------IAVAAKTSDLQVGSWHGAYD 265
Query: 205 VPAVAGYLDFINLMAYDFHGKW 226
A+ DF+ LM YD+H
Sbjct: 266 YVALGKIADFVILMTYDWHYSG 287
Score = 51.7 bits (124), Expect = 8e-07
Identities = 30/123 (24%), Positives = 52/123 (42%), Gaps = 23/123 (18%)
Query: 422 EYTISK---EKLMIGMPTYGRSFTLVDPTKFDIGAPASGGGEPGKYTAEAGFMSYYEVCE 478
EY ++ EK+M+G+P YG +TL +P Y A A +
Sbjct: 304 EYALTVIPAEKVMMGIPLYGYDWTL--------------PYDPLGYLARA-ISPDEAIDI 348
Query: 479 MLRNGAGYVWDDEMKVPYLVHGDQ-----WVGFDDEKSIRYKMNWLKDNGYGGAMVWTVD 533
R A +D + P+ + D+ V F+D +S + K++ +K+ G G W +
Sbjct: 349 ANRYNATIQYDATSQSPFFYYVDKEGRYHEVWFEDARSFQTKLDLIKEYGLRGVSYWVLG 408
Query: 534 MDD 536
+D
Sbjct: 409 QED 411
Score = 41.3 bits (97), Expect = 0.001
Identities = 17/62 (27%), Positives = 23/62 (37%), Gaps = 10/62 (16%)
Query: 246 QLREAFEAEAQEVKKPRLLLTAAVPVGPDNIKG----GYDVPAVAGYLDFINLMAYDFHG 301
Q+R+A + V A D G YD A+ DF+ LM YD+H
Sbjct: 232 QVRDALHSGGYTVS------IAVAAKTSDLQVGSWHGAYDYVALGKIADFVILMTYDWHY 285
Query: 302 KW 303
Sbjct: 286 SG 287
>gnl|CDD|119362 cd06545, GH18_3CO4_chitinase, The Bacteroides thetaiotaomicron
protein represented by pdb structure 3CO4 is an
uncharacterized bacterial member of the family 18
glycosyl hydrolases with homologs found in
Flavobacterium, Stigmatella, and Pseudomonas.
Length = 253
Score = 54.4 bits (131), Expect = 5e-08
Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 15/125 (12%)
Query: 102 LKTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWEYPAGTDDKK 161
+K L+++ G S ++ R+ + + ++ N DG+D+D E P T
Sbjct: 60 VKILISLAGGSPPEFTAALNDPAK--RKALVDKIINYVVSYNLDGIDVDLEGPDVT--FG 115
Query: 162 NYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYD 221
+Y++ ++ L A + E LLTAAV G +A Y DFIN+M+YD
Sbjct: 116 DYLVFIRALYAALKKEG-------KLLTAAVS---SWNGGAVSDSTLA-YFDFINIMSYD 164
Query: 222 FHGKW 226
G W
Sbjct: 165 ATGPW 169
Score = 37.4 bits (87), Expect = 0.015
Identities = 20/49 (40%), Positives = 24/49 (48%), Gaps = 4/49 (8%)
Query: 255 AQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVAGYLDFINLMAYDFHGKW 303
+KK LLTAAV G +A Y DFIN+M+YD G W
Sbjct: 125 YAALKKEGKLLTAAVS---SWNGGAVSDSTLA-YFDFINIMSYDATGPW 169
>gnl|CDD|119354 cd02875, GH18_chitobiase, Chitobiase (also known as
di-N-acetylchitobiase) is a lysosomal glycosidase that
hydrolyzes the reducing-end N-acetylglucosamine from the
chitobiose core of oligosaccharides during the ordered
degradation of asparagine-linked glycoproteins in
eukaryotes. Chitobiase can only do so if the asparagine
that joins the oligosaccharide to protein is previously
removed by a glycosylasparaginase. Chitobiase is
therefore the final step in the lysosomal degradation of
the protein/carbohydrate linkage component of
asparagine-linked glycoproteins. The catalytic domain of
chitobiase is an eight-stranded alpha/beta barrel fold
similar to that of other family 18 glycosyl hydrolases
such as hevamine and chitotriosidase.
Length = 358
Score = 52.0 bits (125), Expect = 6e-07
Identities = 31/106 (29%), Positives = 48/106 (45%), Gaps = 9/106 (8%)
Query: 119 KEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWEYPAGTDDKKNYVL--LLKELREAFEA 176
+++S Y R +I V + + DG++ID E P + Y L L+KE +AF+
Sbjct: 89 EQISNPTY-RTQWIQQKVELAKSQFMDGINIDIEQPITKGSPEYYALTELVKETTKAFKK 147
Query: 177 EAQEVKKPRLLLTAAVPVGPDNIKG-GYDVPAVAGYLDFINLMAYD 221
E P ++ V P I YD +A DF+ +M YD
Sbjct: 148 E-----NPGYQISFDVAWSPSCIDKRCYDYTGIADASDFLVVMDYD 188
Score = 39.3 bits (92), Expect = 0.005
Identities = 37/160 (23%), Positives = 63/160 (39%), Gaps = 30/160 (18%)
Query: 403 TNPAVFKVNSPAS----GGGKAGEYTISKEKLMIGMPTYGRSFT-----------LVDPT 447
+ NSP S G + I +KL++G+P YG + +
Sbjct: 195 GKECIAGANSPYSQTLSGYNNFTKLGIDPKKLVMGLPWYGYDYPCLNGNLEDVVCTIPKV 254
Query: 448 KFDIGAPASGGGEPGKYTAEAGFMSYYEVCEMLRNGAGYV-WDDEMKVPYLVHGD----- 501
F GA S A + Y E+ + + + G WD E K P+ + D
Sbjct: 255 PF-RGANCS--------DAAGRQIPYSEIMKQINSSIGGRLWDSEQKSPFYNYKDKQGNL 305
Query: 502 QWVGFDDEKSIRYKMNWLKDNGYGGAMVWTVDMDDFTGTV 541
V +D+ +S+ K+ + K+ G G +W D+ D++G
Sbjct: 306 HQVWYDNPQSLSIKVAYAKNLGLKGIGMWNGDLLDYSGLP 345
Score = 32.0 bits (73), Expect = 1.1
Identities = 11/35 (31%), Positives = 16/35 (45%), Gaps = 1/35 (2%)
Query: 265 LTAAVPVGPDNIKG-GYDVPAVAGYLDFINLMAYD 298
++ V P I YD +A DF+ +M YD
Sbjct: 154 ISFDVAWSPSCIDKRCYDYTGIADASDFLVVMDYD 188
>gnl|CDD|119350 cd02871, GH18_chitinase_D-like, GH18 domain of Chitinase D (ChiD).
ChiD, a chitinase found in Bacillus circulans,
hydrolyzes the 1,4-beta-linkages of N-acetylglucosamine
in chitin and chitodextrins. The domain architecture of
ChiD includes a catalytic glycosyl hydrolase family 18
(GH18) domain, a chitin-binding domain, and a
fibronectin type III domain. The chitin-binding and
fibronectin type III domains are located either
N-terminal or C-terminal to the catalytic domain. This
family includes exochitinase Chi36 from Bacillus cereus.
Length = 312
Score = 47.7 bits (114), Expect = 1e-05
Identities = 32/133 (24%), Positives = 51/133 (38%), Gaps = 29/133 (21%)
Query: 103 KTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWE----YPAGTD 158
K L++IGG + + F+ S V +++ FDGLDID E T
Sbjct: 75 KVLISIGGANGHVDLNHTAQE-----DNFVDSIVAIIKEYGFDGLDIDLESGSNPLNATP 129
Query: 159 DKKNYVLLLKELREAFEAEAQEVKKPRLLLTAA--------VPVGPDNIKGGYDVP---A 207
N + LK+L++ + P +LT A I G Y +P
Sbjct: 130 VITNLISALKQLKDHY--------GPNFILTMAPETPYVQGGYAAYGGIWGAY-LPLIDN 180
Query: 208 VAGYLDFINLMAY 220
+ L ++N+ Y
Sbjct: 181 LRDDLTWLNVQYY 193
>gnl|CDD|119366 cd06549, GH18_trifunctional, GH18 domain of an uncharacterized
family of bacterial proteins, which share a common
three-domain architecture: an N-terminal glycosyl
hydrolase family 18 (GH18) domain, a glycosyl
transferase family 2 domain, and a C-terminal
polysaccharide deacetylase domain.
Length = 298
Score = 47.4 bits (113), Expect = 1e-05
Identities = 41/151 (27%), Positives = 61/151 (40%), Gaps = 23/151 (15%)
Query: 92 IEQLKKANPKLKTLL---AIGGWSF-GTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGL 147
I KA+PK+ L+ + G W + +R FI + +L + DG+
Sbjct: 52 IIAAAKAHPKVLPLVQNISGGAWDGKNIARLLADPSAR---AKFIANIAAYLERNQADGI 108
Query: 148 DIDWEYPAGTDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPA 207
+D+E DD YV L ELR + + LT VP +++ A
Sbjct: 109 VLDFE-ELPADDLPKYVAFLSELR-------RRLPAQGKQLTVTVPADEA----DWNLKA 156
Query: 208 VAGYLDFINLMAYDFHGKWESQVGHNAPLRS 238
+A D + LMAYD H Q G P+ S
Sbjct: 157 LARNADKLILMAYDEHY----QGGAPGPIAS 183
>gnl|CDD|119355 cd02876, GH18_SI-CLP, Stabilin-1 interacting chitinase-like protein
(SI-CLP) is a eukaryotic chitinase-like protein of
unknown function that interacts with the
endocytic/sorting transmembrane receptor stabilin-1 and
is secreted from the lysosome. SI-CLP has a glycosyl
hydrolase family 18 (GH18) domain but lacks a
chitin-binding domain. The catalytic amino acids of the
GH18 domain are not conserved in SI-CLP, similar to the
chitolectins YKL-39, YKL-40, and YM1/2. Human SI-CLP is
sorted to late endosomes and secretory lysosomes in
alternatively activated macrophages.
Length = 318
Score = 47.3 bits (113), Expect = 1e-05
Identities = 39/151 (25%), Positives = 64/151 (42%), Gaps = 12/151 (7%)
Query: 92 IEQLKKANPKLKTL--LAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDI 149
IE+++KAN +K L + GWS+ Q + + R+ I V ++ +FDG+ +
Sbjct: 57 IEEVRKANKNIKILPRVLFEGWSY--QDLQSLLNDEQEREKLIKLLVTTAKKNHFDGIVL 114
Query: 150 D-WEYPAGTDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGY---DV 205
+ W A L++ + E K L+L P N G + D
Sbjct: 115 EVWSQLAAYGVPDKRKELIQLVIHLGETLHSANLK--LILVIPPPREKGNQNGLFTRKDF 172
Query: 206 PAVAGYLDFINLMAYDFHGKWESQVGHNAPL 236
+A ++D +LM YD+ G NAPL
Sbjct: 173 EKLAPHVDGFSLMTYDYSSPQRP--GPNAPL 201
>gnl|CDD|119361 cd06544, GH18_narbonin, Narbonin is a plant 2S protein from the
globulin fraction of narbon bean (Vicia narbonensis L.)
cotyledons with unknown function. Narbonin has a
glycosyl hydrolase family 18 (GH18) domain without the
conserved catalytic residues and with no known enzymatic
activity. Narbonin amounts to up to 3% of the total
seed globulins of mature seeds and was thought to be a
storage protein but was found to degrade too slowly
during germination. This family also includes the
VfNOD32 nodulin from Vicia faba.
Length = 253
Score = 43.5 bits (103), Expect = 2e-04
Identities = 47/206 (22%), Positives = 78/206 (37%), Gaps = 37/206 (17%)
Query: 46 KFQPEDIEPDLCTHIIFAFGWLKKGKLSSFESNDETKDGKVGLY--------ERIEQLKK 97
F I P + H I +F +++ +GK Y E ++ +K
Sbjct: 14 TFSDVPINPKVEFHFILSFA-------IDYDTESNPTNGKFNPYWDTENLTPEAVKSIKA 66
Query: 98 ANPKLKTLLAIGGWSFGTQK--FKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWE-YP 154
+P +K +++IGG F + + S ++ N DG+DID+E +P
Sbjct: 67 QHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAV-SSLTSIIQTYNLDGIDIDYEHFP 125
Query: 155 AGTDDKKNYV-LLLKELRE----------AFEAEAQEVKKPRLLLTAAVPVGPDNIK--- 200
A D + L+ EL+ E Q L L A D +
Sbjct: 126 ADPDTFVECIGQLITELKNNGVIKVASIAPSEDAEQ---SHYLALYNAYGDYIDYVNYQF 182
Query: 201 GGYDVP-AVAGYLDFINLMAYDFHGK 225
Y VP VA Y++F + +A ++ GK
Sbjct: 183 YNYGVPTTVAKYVEFYDEVANNYPGK 208
>gnl|CDD|119359 cd06542, GH18_EndoS-like, Endo-beta-N-acetylglucosaminidases are
bacterial chitinases that hydrolyze the chitin core of
various asparagine (N)-linked glycans and glycoproteins.
The endo-beta-N-acetylglucosaminidases have a glycosyl
hydrolase family 18 (GH18) catalytic domain. Some
members also have an additional C-terminal glycosyl
hydrolase family 20 (GH20) domain while others have an
N-terminal domain of unknown function (pfam08522).
Members of this family include
endo-beta-N-acetylglucosaminidase S (EndoS) from
Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and
EndoH from Flavobacterium meningosepticum, and EndoE
from Enterococcus faecalis. EndoS is a secreted
endoglycosidase from Streptococcus pyogenes that
specifically hydrolyzes the glycan on human IgG between
two core N-acetylglucosamine residues. EndoE is a
secreted endoglycosidase, encoded by the ndoE gene in
Enterococcus faecalis, that hydrolyzes the glycan on
human RNase B.
Length = 255
Score = 38.1 bits (89), Expect = 0.011
Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 6/38 (15%)
Query: 143 NFDGLDIDWEY------PAGTDDKKNYVLLLKELREAF 174
DG+D D EY + +V L+KELR+
Sbjct: 104 GLDGVDFDDEYSGYGKNGTSQPSNEAFVRLIKELRKYM 141
>gnl|CDD|226000 COG3469, COG3469, Chitinase [Carbohydrate transport and
metabolism].
Length = 332
Score = 37.5 bits (87), Expect = 0.020
Identities = 36/152 (23%), Positives = 59/152 (38%), Gaps = 27/152 (17%)
Query: 94 QLKKANPKLKT-LLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWE 152
Q+ N + K LL++GG E+ + Q F+ + + FDGLDID E
Sbjct: 90 QVGALNAEGKAVLLSLGG----ADGHIELKAGQ--EQAFVNEIIRLIETYGFDGLDIDLE 143
Query: 153 YPAGT--DDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVP---A 207
A D++ LK +++ ++ + + +T A P P G +P
Sbjct: 144 QSAILAADNQTVIPAALKAVKDHYKNQGKN-----FFITMA-PEFPYLQGWGAYIPYINE 197
Query: 208 VAGYLDFINLMAYDFHG---------KWESQV 230
+ Y DFI Y+ G W +Q
Sbjct: 198 LRDYYDFIAPQLYNQGGDGNWVTESNAWIAQN 229
>gnl|CDD|119363 cd06546, GH18_CTS3_chitinase, GH18 domain of CTS3 (chitinase 3), an
uncharacterized protein from the human fungal pathogen
Coccidioides posadasii. CTS3 has a chitinase-like
glycosyl hydrolase family 18 (GH18) domain; and has
homologs in bacteria as well as fungi.
Length = 256
Score = 33.8 bits (78), Expect = 0.25
Identities = 22/76 (28%), Positives = 32/76 (42%), Gaps = 11/76 (14%)
Query: 102 LKTLLAIGGW---SFGTQKFKEMSKSRYSRQTFIYSAVPFLRQRNFDGLDIDWEYPAGTD 158
+K + +GG SF + RY Q + +R+R DGLD+D E P D
Sbjct: 73 VKVMGMLGGAAPGSFSRLDDDDEDFERYYGQ--LRD---MIRRRGLDGLDLDVEEPMSLD 127
Query: 159 DKKNYVLLLKELREAF 174
+ L+ LR F
Sbjct: 128 ---GIIRLIDRLRSDF 140
>gnl|CDD|171770 PRK12851, groEL, chaperonin GroEL; Reviewed.
Length = 541
Score = 33.6 bits (77), Expect = 0.42
Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 10/87 (11%)
Query: 596 ELLSKKKTTTIQKQETSNLIDTKAARRRL--RRPSIKSKPSEDSGDQDVSASINRFKLKN 653
E L + K ++K+ T+ +ID ++ + R I+++ E + D D R KL+
Sbjct: 315 EQLGRAKKVVVEKENTT-IIDGAGSKTEIEGRVAQIRAQIEETTSDYD------REKLQE 367
Query: 654 RTAASSANSAVIAAG-SSKSSVKASKD 679
R A + AVI G S++ VK KD
Sbjct: 368 RLAKLAGGVAVIRVGASTEVEVKEKKD 394
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 33.8 bits (78), Expect = 0.44
Identities = 25/121 (20%), Positives = 46/121 (38%), Gaps = 3/121 (2%)
Query: 588 APIKIDVSELLSKKKTTTIQKQETSNLIDTKAARRRLRRPSIKSKPSEDSGDQD---VSA 644
A +K + + + S +I + AR+ R K + + D V+A
Sbjct: 493 ARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAA 552
Query: 645 SINRFKLKNRTAASSANSAVIAAGSSKSSVKASKDDPKYKVSLLVVRSSKSSVKANKDDP 704
+I R K K ++ A K++V A+ K K + S++ + + DP
Sbjct: 553 AIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDP 612
Query: 705 K 705
K
Sbjct: 613 K 613
>gnl|CDD|237232 PRK12852, groEL, chaperonin GroEL; Reviewed.
Length = 545
Score = 32.9 bits (75), Expect = 0.71
Identities = 30/136 (22%), Positives = 56/136 (41%), Gaps = 27/136 (19%)
Query: 596 ELLSKKKTTTIQKQETSNLIDTKAARRRL--RRPSIKSKPSEDSGDQDVSASINRFKLKN 653
++L + K I K E + +++ + + R IK++ E + D D R KL+
Sbjct: 315 KMLGRAKKVVIDK-ENTTIVNGAGKKADIEARVGQIKAQIEETTSDYD------REKLQE 367
Query: 654 RTAASSANSAVI-AAGSSKSSVKASKDD-----------------PKYKVSLLVVRSSKS 695
R A + AVI G+++ VK KD P V+LL + +
Sbjct: 368 RLAKLAGGVAVIRVGGATEVEVKEKKDRVEDALNATRAAVQEGIVPGGGVALLRAKKAVG 427
Query: 696 SVKANKDDPKYKVSLV 711
+ + D + +++V
Sbjct: 428 RINNDNADVQAGINIV 443
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 32.5 bits (74), Expect = 1.00
Identities = 20/103 (19%), Positives = 36/103 (34%), Gaps = 16/103 (15%)
Query: 624 LRRPSIKSKPSEDSGDQDVSASINRFKLKNRTAASSANSAVIAAGSSKSSVKASKDDPKY 683
L S KS S +S ++ K K ++ + + ++ DD +
Sbjct: 396 LIGSSEKSLGSNESSPAASNSDKGSKKKKGKST-----------STKGGTAESIPDDEED 444
Query: 684 KVSLLVVRSSKSSVKANKDDPKYKVSLVVVGSSKSSVKANKDD 726
+ K + K +D S G K SVK+ +D+
Sbjct: 445 -----APKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDN 482
>gnl|CDD|152481 pfam12046, DUF3529, Protein of unknown function (DUF3529). This
family of proteins is functionally uncharacterized. This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 176 to 190 amino acids
in length.
Length = 173
Score = 30.7 bits (70), Expect = 1.5
Identities = 17/60 (28%), Positives = 23/60 (38%), Gaps = 13/60 (21%)
Query: 102 LKTLLAIGGWSFGTQKFKEMSKSRYSRQTFIYSAVP---------FLRQRNFDGLDIDWE 152
L LLAIG + F + K R +R TFI P + +R + D E
Sbjct: 11 LTLLLAIGLFFF----LRASVKDRTTRVTFISPRPPLEVLGGLKEYFERRGYRVAGGDAE 66
>gnl|CDD|130078 TIGR01005, eps_transp_fam, exopolysaccharide transport protein
family. The model describes the exopolysaccharide
transport protein family in bacteria. The transport
protein is part of a large genetic locus which is
associated with exopolysaccharide (EPS) biosynthesis.
Detailed molecular characterization and gene fusion
analysis revealed atleast seven gene products are
involved in the overall regulation, which among other
things, include exopolysaccharide biosynthesis, property
of conferring virulence and exopolysaccharide export
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 754
Score = 31.5 bits (71), Expect = 1.7
Identities = 13/51 (25%), Positives = 19/51 (37%)
Query: 159 DKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKGGYDVPAVA 209
++ Y L R+A + R+ A+VP P K G V A
Sbjct: 388 KRQLYESYLTNYRQAASRQNYVPVDARVASPASVPSEPYFPKKGPIVGLAA 438
>gnl|CDD|218667 pfam05634, APO_RNA-bind, APO RNA-binding. This domain contains
conserved cysteine and histidine residues. It resembles
zinc fingers, and binds to zinc. This domain functions
as an RNA-binding domain.
Length = 207
Score = 30.2 bits (68), Expect = 2.9
Identities = 24/88 (27%), Positives = 32/88 (36%), Gaps = 14/88 (15%)
Query: 141 QRNFDGLDIDWEYPAGTDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIK 200
QRN + D P +KK Y + LK L A+E KK R + V P N
Sbjct: 14 QRNESPQNADLPKPYPKREKKPYPMPLKRLIR----RAKEEKKLRKMKPCRVLYPPKN-- 67
Query: 201 GGYDVPAVAGYLDFINLMAYDFHGKWES 228
G V + +A + H E
Sbjct: 68 -GLLVKELVP-------VAEEVHNAREL 87
>gnl|CDD|236309 PRK08599, PRK08599, coproporphyrinogen III oxidase; Provisional.
Length = 377
Score = 30.6 bits (70), Expect = 3.4
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 11/64 (17%)
Query: 98 ANP------KLKTLLAIG--GWSFGTQKFK-EMSKS--RYSRQTFIYSAVPFLRQRNFDG 146
ANP KL+ L G S G Q F E+ K R + +Y A+ ++ FD
Sbjct: 93 ANPGDLTKEKLQVLKDSGVNRISLGVQTFNDELLKKIGRTHNEEDVYEAIANAKKAGFDN 152
Query: 147 LDID 150
+ ID
Sbjct: 153 ISID 156
>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5. Members of this
family of proteins are involved in maintaining Golgi
structure. They stimulate the formation of Golgi stacks
and ribbons, and are involved in intra-Golgi retrograde
transport. Two main interactions have been
characterized: one with RAB1A that has been activated by
GTP-binding and another with isoform CASP of CUTL1.
Length = 509
Score = 29.4 bits (66), Expect = 7.6
Identities = 18/82 (21%), Positives = 30/82 (36%), Gaps = 7/82 (8%)
Query: 654 RTAASSANSAVIAAGSSKS---SVKASKDDPKYKVSLLVVRSSKSSVKANKDDPKYKVSL 710
S + S A S+K+ S K + D P +VS + +S + L
Sbjct: 38 SPVGSISWSVRETASSNKARSRSEKWNPDQPGSRVSSPSSKKDGTSRSLSSQVD----DL 93
Query: 711 VVVGSSKSSVKANKDDSKYKVV 732
SS+SS + + K+
Sbjct: 94 ASAVSSQSSSDLEDELAALKIR 115
>gnl|CDD|132143 TIGR03099, dCO2ase_PEP1, pyridoxal-dependent decarboxylase,
exosortase A system-associated. The sequences in this
family contain the pyridoxal binding domain (pfam02784)
and C-terminal sheet domain (pfam00278) of a family of
Pyridoxal-dependent decarboxylases. Characterized
enzymes in this family decarboxylate substrates such as
ornithine, diaminopimelate and arginine. The genes of
the family modeled here, with the exception of those
observed in certain Burkholderia species, are all found
in the context of exopolysaccharide biosynthesis loci
containing the exosortase/PEP-CTERM protein sorting
system. More specifically, these are characteristic of
the type 1 exosortase system represented by the Genome
Property GenProp0652. The substrate of these enzymes may
be a precursor of the carrier or linker which is
hypothesized to release the PEP-CTERM protein from the
exosortase enzyme. These enzymes are apparently most
closely related to the diaminopimelate decarboxylase
modeled by TIGR01048 which may suggest a similarity (or
identity) of substrate.
Length = 398
Score = 29.3 bits (66), Expect = 8.9
Identities = 36/136 (26%), Positives = 43/136 (31%), Gaps = 39/136 (28%)
Query: 167 LKELREAFEAEAQEVKKPRLLLTAAVPVGPD-NIKG------------GYDVPAVAGYLD 213
L+EL + R+ AV V PD +KG G D V L
Sbjct: 123 LRELNRLAALSEALGLRARV----AVRVNPDFELKGSGMKMGGGAKQFGIDAEQVPAALA 178
Query: 214 FINLMAYDFHG---KWESQVGHNAPLRSPSSDSEWQLREAFEAEAQEVK---KPRLLLTA 267
FI DF G SQ L EAQ RL +A
Sbjct: 179 FIKAADLDFQGFHIFAGSQ----------------NLNAEAIIEAQAKTLALALRLAESA 222
Query: 268 AVPVGPDNIKGGYDVP 283
PV NI GG+ +P
Sbjct: 223 PAPVRVINIGGGFGIP 238
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).
This family consists of several hypothetical proteins
from Arabidopsis thaliana and Oryza sativa. The function
of this family is unknown.
Length = 564
Score = 29.4 bits (66), Expect = 9.1
Identities = 26/120 (21%), Positives = 42/120 (35%), Gaps = 5/120 (4%)
Query: 586 KPAPIKIDVSELLSKKKTTTIQKQETSNLIDTKAARRRLRRPSI---KSKPSEDSGDQDV 642
P+P S + ++I + + L P +S + S
Sbjct: 191 APSPSGGTSCPSSSGGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARS 250
Query: 643 SASINRFKLKNRTAASSANSAVIAAGSSKSSVKASK--DDPKYKVSLLVVRSSKSSVKAN 700
S++ + FK + A+ A S + S K + K+SK P K V SSK N
Sbjct: 251 SSAKSPFKSSIQRKATKALSKLSLRASPKDTSKSSKSEVAPPKKSEAKVPSSSKKWTDGN 310
>gnl|CDD|151829 pfam11389, Porin_OmpL1, Leptospira porin protein OmpL1. OmpL1 is a
member of the outer membrane (OM) proteins in the
mammalian pathogen Leptospira. Specifically, it is a
porin.
Length = 267
Score = 28.8 bits (64), Expect = 9.9
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 195 GPDNIKGGYDVPAVAGYLDFINLMA 219
G +NIKGG+D+ A AG NL++
Sbjct: 166 GSNNIKGGHDILAAAGAGSVANLLS 190
Score = 28.8 bits (64), Expect = 9.9
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 272 GPDNIKGGYDVPAVAGYLDFINLMA 296
G +NIKGG+D+ A AG NL++
Sbjct: 166 GSNNIKGGHDILAAAGAGSVANLLS 190
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.133 0.396
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,449,456
Number of extensions: 3786089
Number of successful extensions: 2440
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2365
Number of HSP's successfully gapped: 79
Length of query: 743
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 639
Effective length of database: 6,324,786
Effective search space: 4041538254
Effective search space used: 4041538254
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 63 (28.0 bits)