RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4449
(2417 letters)
>gnl|CDD|119351 cd02872, GH18_chitolectin_chitotriosidase, This conserved domain
family includes a large number of catalytically inactive
chitinase-like lectins (chitolectins) including YKL-39,
YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic
mammalian chitinase), as well as catalytically active
chitotriosidases. The conserved domain is an
eight-stranded alpha/beta barrel fold belonging to the
family 18 glycosyl hydrolases. The fold has a pronounced
active-site cleft at the C-terminal end of the
beta-barrel. The chitolectins lack a key active site
glutamate (the proton donor required for hydrolytic
activity) but retain highly conserved residues involved
in oligosaccharide binding. Chitotriosidase is a
chitinolytic enzyme expressed in maturing macrophages,
which suggests that it plays a part in antimicrobial
defense. Chitotriosidase hydrolyzes chitotriose, as well
as colloidal chitin to yield chitobiose and is therefore
considered an exochitinase. Chitotriosidase occurs in two
major forms, the large form being converted to the small
form by either RNA or post-translational processing.
Although the small form, containing the chitinase domain
alone, is sufficient for the chitinolytic activity, the
additional C-terminal chitin-binding domain of the large
form plays a role in processing colloidal chitin. The
chitotriosidase gene is nonessential in humans, as about
35% of the population are heterozygous and 6% homozygous
for an inactivated form of the gene. HCGP39 is a 39-kDa
human cartilage glycoprotein thought to play a role in
connective tissue remodeling and defense against
pathogens.
Length = 362
Score = 531 bits (1370), Expect = e-172
Identities = 183/371 (49%), Positives = 240/371 (64%), Gaps = 15/371 (4%)
Query: 1713 IVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFYE 1772
+VCYFTNWA YR GK++P +ID LCTH+IY FA L+ D +I D W D+D YE
Sbjct: 1 VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPDGNIIIL-DEWNDIDLGLYE 59
Query: 1773 KVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDL 1830
+ ALK+K +K LAIGGWN + K+S + S + R FI + F+ ++ FDGLDL
Sbjct: 60 RFNALKEKNPNLKTLLAIGGWNFGSA-KFSAMAASPENRKTFIKSAIAFLRKYGFDGLDL 118
Query: 1831 DWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHD--LLLSAAVSPSKAVIDNAYDI 1888
DWEYP GP DK+ F L+KELR AF P LLL+AAVS K ID AYDI
Sbjct: 119 DWEYPGQRG-----GPPEDKENFVTLLKELREAFEPEAPRLLLTAAVSAGKETIDAAYDI 173
Query: 1889 PVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTT--PTFNANYSLHYWVSHGADR 1946
P +S+ LD+I+VMTYD+HG W+ TGH +P+YA DT N +Y++ YW+S GA
Sbjct: 174 PEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSADTGDQKYLNVDYAIKYWLSKGAPP 233
Query: 1947 KKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDGWVV 2006
+K++ G+P YG+SFTLA + G+ + G AG TR GFLAYYEIC+ K GW V
Sbjct: 234 EKLVLGIPTYGRSFTLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEICEF-LKSGWTV 292
Query: 2007 VRDRKRRIGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNFCGCES 2066
V D +++ PYA+KG+QWVG+DD+ I K +++K LGGAM+W++DLDDF+ CG
Sbjct: 293 VWDDEQK-VPYAYKGNQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDLDDFRGTCGQGK 351
Query: 2067 YPLLKTINRVL 2077
YPLL INR L
Sbjct: 352 YPLLNAINRAL 362
Score = 376 bits (968), Expect = e-117
Identities = 128/310 (41%), Positives = 183/310 (59%), Gaps = 13/310 (4%)
Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQ 444
VVCY TNW+ R +GKFVPENID LCTHI+YAFA L+P+ +I + DID Y+
Sbjct: 1 VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPD-GNIIILDEWNDIDLGLYE 59
Query: 445 RISSSPLVTHGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLH 504
R ++ + +K L+AIGGW S K+S + +S NRK FIKS + FLR++ F GL
Sbjct: 60 RFNALKE-KNPNLKTLLAIGGWNFGS-AKFSAMAASPENRKTFIKSAIAFLRKYGFDGLD 117
Query: 505 FDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKHD--YSIAVGISGYKEILEVAYD 562
DW YP DK NFV L++EL+ F+ + +S KE ++ AYD
Sbjct: 118 LDWEYPGQRGGP-----PEDKENFVTLLKELREAFEPEAPRLLLTAAVSAGKETIDAAYD 172
Query: 563 FPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEP--YPNYNINTALKLIDELGGD 620
P ++ +++F+++M+YD+HG+WEG+TG SPL + + N++ A+K G
Sbjct: 173 IPEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSADTGDQKYLNVDYAIKYWLSKGAP 232
Query: 621 KRKIVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTRKWT 680
K+V+G+P YG+S+TL G+GA GPG AG YT++ G LAYYEIC +K WT
Sbjct: 233 PEKLVLGIPTYGRSFTLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEICEFLK-SGWT 291
Query: 681 VQRNTKSLEP 690
V + + P
Sbjct: 292 VVWDDEQKVP 301
Score = 302 bits (775), Expect = 1e-91
Identities = 125/379 (32%), Positives = 197/379 (51%), Gaps = 34/379 (8%)
Query: 15 VVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIP-EDLEYDVIKGGY 73
VVCY + YR F +N+ P CTH+IYA+A ++P +I E + D G Y
Sbjct: 1 VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPDGNIIILDEWNDID--LGLY 58
Query: 74 KSFLGLKEANPELKVYLAV------KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLDL 127
+ F LKE NP LK LA+ + F ++ + E+R FI S + Y FDGLDL
Sbjct: 59 ERFNALKEKNPNLKTLLAIGGWNFGSAKFSAMAASPENRKTFIKSAIAFLRKYGFDGLDL 118
Query: 128 NVKDPALNDEDDDDLESIANERSDFSTFIQELSSTLRRNNYQLTLT-----SPGVIDRKT 182
+ + P +D + +F T ++EL +L LT ID
Sbjct: 119 DWEYPGQRGGPPED-------KENFVTLLKELREAFEPEAPRLLLTAAVSAGKETIDAA- 170
Query: 183 SLVDISVVAPLVDLILLKSFN----NDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAV 238
DI ++ +D I + +++ + + P+ + + + N++ A+
Sbjct: 171 --YDIPEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSA-----DTGDQKYLNVDYAI 223
Query: 239 YNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVC 298
W+ KGA PE++++GIP +G+S+ L + S G+GA GPGT G YT+ G+LA++E+C
Sbjct: 224 KYWLSKGAPPEKLVLGIPTYGRSFTLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEIC 283
Query: 299 NKFKDKTWRHFTDSNGEPFMVKKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDDF 358
K + D P+ K ++W+ Y++ +SI K+ Y+K + LGGAM+W++DLDDF
Sbjct: 284 EFLKSGWTVVWDDEQKVPYAYKGNQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDLDDF 343
Query: 359 RGFCGQ-KYPLLSAVVSNL 376
RG CGQ KYPLL+A+ L
Sbjct: 344 RGTCGQGKYPLLNAINRAL 362
Score = 292 bits (750), Expect = 3e-88
Identities = 112/267 (41%), Positives = 146/267 (54%), Gaps = 49/267 (18%)
Query: 1342 IVCYFTNWAWYRPGKGKYVLKKV----------------PWINILKLYVTLQTGLGTDQE 1385
+VCYFTNWA YRPG GK+V + + P NI+ L LG
Sbjct: 1 VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPDGNIIILDEWNDIDLG---- 56
Query: 1386 KENTFLKISELTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQ 1443
YER LK+K +K LAIGGWN K+S + S R+
Sbjct: 57 -----------------LYERFNALKEKNPNLKTLLAIGGWNFG-SAKFSAMAASPENRK 98
Query: 1444 RFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPH--G 1501
FI+ + FL KY FDGLDLDWEYP GP DKE+F ++EL +AF+P
Sbjct: 99 TFIKSAIAFLRKYGFDGLDLDWEYPGQRG-----GPPEDKENFVTLLKELREAFEPEAPR 153
Query: 1502 LLLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDD--D 1559
LLL+AAVS K+ I+AAYD+ +S+ LD+I+VMTYD+HG W+ TGH +PLY D D
Sbjct: 154 LLLTAAVSAGKETIDAAYDIPEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSADTGD 213
Query: 1560 FFYFNANFTMNYWMKKGAPSRKLVMGV 1586
Y N ++ + YW+ KGAP KLV+G+
Sbjct: 214 QKYLNVDYAIKYWLSKGAPPEKLVLGI 240
Score = 175 bits (445), Expect = 2e-47
Identities = 54/116 (46%), Positives = 72/116 (62%), Gaps = 1/116 (0%)
Query: 695 TLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTRKWTVQRNTKSLEPFTFQG 754
TL G+GA GPG AG YT++ G LAYYEIC +K WTV + + P+ ++G
Sbjct: 248 TLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEICEFLK-SGWTVVWDDEQKVPYAYKG 306
Query: 755 DQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLDDFHNKCCMESFPLLRAVNRAF 810
+QWV Y+D S+ KV+++KS G G M W+IDLDDF C +PLL A+NRA
Sbjct: 307 NQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDLDDFRGTCGQGKYPLLNAINRAL 362
Score = 155 bits (393), Expect = 1e-40
Identities = 51/137 (37%), Positives = 77/137 (56%), Gaps = 4/137 (2%)
Query: 2192 VICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFYE 2251
V+CY+T+WA R GK+ PE+ID LCTH++YAFA L+ + I + D++ G YE
Sbjct: 1 VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNP-DGNIIILDEWNDIDLGLYE 59
Query: 2252 KITKFNER--GIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEF 2309
+ E+ ++ +LAIGGW+ S K+S + R+ F++ FL YGF+GL+
Sbjct: 60 RFNALKEKNPNLKTLLAIGGWNFG-SAKFSAMAASPENRKTFIKSAIAFLRKYGFDGLDL 118
Query: 2310 AWEYPKCWQTCSTVKSH 2326
WEYP K +
Sbjct: 119 DWEYPGQRGGPPEDKEN 135
Score = 113 bits (286), Expect = 1e-26
Identities = 44/82 (53%), Positives = 52/82 (63%), Gaps = 2/82 (2%)
Query: 2320 CSTVKSHEWTVVRDSLGARGPYAYNGKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDL 2379
C +KS WTVV D + PYAY G W+ YDD I K + +K GLGGAMVWSIDL
Sbjct: 283 CEFLKS-GWTVVWDD-EQKVPYAYKGNQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDL 340
Query: 2380 DDFQNLCSCGPYPLLSAVNYEL 2401
DDF+ C G YPLL+A+N L
Sbjct: 341 DDFRGTCGQGKYPLLNAINRAL 362
Score = 58.3 bits (142), Expect = 2e-08
Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 1/35 (2%)
Query: 867 SI-LQVKFIKSNGYAGIMAWTIDLDDFHNKCCMDK 900
SI L+V+++KS G G M W+IDLDDF C K
Sbjct: 317 SIALKVQYLKSKGLGGAMVWSIDLDDFRGTCGQGK 351
>gnl|CDD|214753 smart00636, Glyco_18, Glyco_18 domain.
Length = 334
Score = 381 bits (981), Expect = e-119
Identities = 136/355 (38%), Positives = 198/355 (55%), Gaps = 31/355 (8%)
Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFY 1771
++V YFTNW Y G + DI + TH+IY FA +D D + D WAD+ N +
Sbjct: 1 RVVGYFTNWGVY---GRNFPVDDIPASKLTHIIYAFANIDPD-GTVTIGDEWADIGN--F 54
Query: 1772 EKVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLD 1829
++ ALKKK G+KV L+IGGW +S + +S +++ +R KFI +V+F+ ++ FDG+D
Sbjct: 55 GQLKALKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFLKKYGFDGID 112
Query: 1830 LDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFN-----PHDLLLSAAVSPSKAVIDN 1884
+DWEYP D++ + L+KELR A + LL+ AV ID
Sbjct: 113 IDWEYPGGRG--------DDRENYTALLKELREALDKEGAEGKGYLLTIAVPAGPDKIDK 164
Query: 1885 AYD-IPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHG 1943
Y +P +++ LD+I++MTYD+HG W TGH AP+YA P D +N +Y++ Y++ G
Sbjct: 165 GYGDLPAIAKYLDFINLMTYDFHGAWSNPTGHNAPLYAGPGDPEK-YNVDYAVKYYLCKG 223
Query: 1944 ADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDG 2003
K++ G+P YG+ +TL D + NG + G A G T G + Y EIC + G
Sbjct: 224 VPPSKLVLGIPFYGRGWTLVDGSNNGPGAPFTGPATGGPGTWEGGVVDYREICKLL---G 280
Query: 2004 WVVVRDRKRRIGPYAFKGD--QWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLD 2056
VV D + PYA+ QWV +DD I KA++VK LGG MIW LD D
Sbjct: 281 ATVVYDDTAK-APYAYNPGTGQWVSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334
Score = 290 bits (744), Expect = 5e-88
Identities = 123/413 (29%), Positives = 196/413 (47%), Gaps = 86/413 (20%)
Query: 384 KVVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFY 443
+VV Y TNW F ++I THI+YAFA++DP+ ++ G+ ADI NF
Sbjct: 1 RVVGYFTNWG---VYGRNFPVDDIPASKLTHIIYAFANIDPD-GTVTIGDEWADIG-NFG 55
Query: 444 QRISSSPLV-THGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAG 502
Q + L + +K+L++IGGWT+S + +S ++S ++RKKFI S+++FL+++ F G
Sbjct: 56 QLKA---LKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFLKKYGFDG 110
Query: 503 LHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDK-----HDYSIAVGISGYKEIL 557
+ DW YP D + N+ L++EL+ DK Y + + + + +
Sbjct: 111 IDIDWEYPGGRGDD--------RENYTALLKELREALDKEGAEGKGYLLTIAVPAGPDKI 162
Query: 558 EVAYDF-PALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDE 616
+ Y PA+ +++F++LM+YD+HGAW TG +PL + PG+P YN++ A+K
Sbjct: 163 DKGYGDLPAIAKYLDFINLMTYDFHGAWSNPTGHNAPLYAGPGDP-EKYNVDYAVKYYLC 221
Query: 617 LGGDKRKIVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKT 676
G K+V+G+PFYG+ +TL +G GA GP G T + G++ Y EIC K
Sbjct: 222 KGVPPSKLVLGIPFYGRGWTLVDGSNNGPGAPFTGPATGGPGTWEGGVVDYREIC---KL 278
Query: 677 RKWTVQRNTKSLEPRSMGTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTR 736
TV + + Y Y T
Sbjct: 279 LGATVVYDDTA--------------------------KAPY------------AYNPGTG 300
Query: 737 KWTVQRNTKSLEPFTFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
+W VSY+DP S+ K ++K G G+M W +D D
Sbjct: 301 QW-------------------VSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334
Score = 230 bits (588), Expect = 5e-67
Identities = 84/259 (32%), Positives = 131/259 (50%), Gaps = 39/259 (15%)
Query: 1341 KIVCYFTNWAWYRPGKGKYVLKKVPW-----INILKLYVTLQTGLGTDQEKENTFLKISE 1395
++V YFTNW Y + + +P I + G T ++ +
Sbjct: 1 RVVGYFTNWGVYGR---NFPVDDIPASKLTHIIYAFANID-PDGTVTIGDEWADIGNFGQ 56
Query: 1396 LTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFL 1453
L LKKK G+KV L+IGGW +S +S +++ +R++FI+ +V FL
Sbjct: 57 LKA-----------LKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFL 103
Query: 1454 LKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFK-----PHGLLLSAAV 1508
KY FDG+D+DWEYP D+E++ ++EL +A G LL+ AV
Sbjct: 104 KKYGFDGIDIDWEYPGGR--------GDDRENYTALLKELREALDKEGAEGKGYLLTIAV 155
Query: 1509 SPSKQVINAAYD-VKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANF 1567
I+ Y + A+++ LD+I++MTYD+HG W TGH APLY P D +N ++
Sbjct: 156 PAGPDKIDKGYGDLPAIAKYLDFINLMTYDFHGAWSNPTGHNAPLYAGPGDP-EKYNVDY 214
Query: 1568 TMNYWMKKGAPSRKLVMGV 1586
+ Y++ KG P KLV+G+
Sbjct: 215 AVKYYLCKGVPPSKLVLGI 233
Score = 185 bits (473), Expect = 1e-51
Identities = 100/356 (28%), Positives = 161/356 (45%), Gaps = 35/356 (9%)
Query: 14 QVVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIPEDLEYDVIKGGY 73
+VV Y Y F V ++ TH+IYA+A IDP + D D+ G
Sbjct: 1 RVVGYFTNWGVYGRN---FPVDDIPASKLTHIIYAFANIDP-DGTVTIGDEWADIGNFGQ 56
Query: 74 KSFLGLKEANPELKVYLAV-----KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLDLN 128
LK+ NP LKV L++ NF S+ SD SR FI S++ Y FDG+D++
Sbjct: 57 --LKALKKKNPGLKVLLSIGGWTESDNFSSMLSDPASRKKFIDSIVSFLKKYGFDGIDID 114
Query: 129 VKDPALNDEDDDDLESIANERSDFSTFIQELSSTLRR-----NNYQLTLTSPGVIDRKTS 183
+ P +D ++ ++ ++EL L + Y LT+ P D+
Sbjct: 115 WEYPGGRGDDREN----------YTALLKELREALDKEGAEGKGYLLTIAVPAGPDKIDK 164
Query: 184 LVD-ISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAVYNWI 242
+ +A +D I L ++ D P N + N++ AV ++
Sbjct: 165 GYGDLPAIAKYLDFINLMTY--DFHGAWSNPTGHNA--PLYAGPGDPEKYNVDYAVKYYL 220
Query: 243 KKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVCNKFK 302
KG P ++++GIPF+G+ + L + S G GA GP T G T G + + E+C
Sbjct: 221 CKGVPPSKLVLGIPFYGRGWTLVDGSNNGPGAPFTGPATGGPGTWEGGVVDYREICKLLG 280
Query: 303 DKTWRHFTDSNGEPFMVKKD--EWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLD 356
+ D+ P+ +W++Y++ SI+ K +Y+KD+ LGG M+W LD D
Sbjct: 281 AT--VVYDDTAKAPYAYNPGTGQWVSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334
Score = 119 bits (300), Expect = 9e-29
Identities = 40/129 (31%), Positives = 73/129 (56%), Gaps = 6/129 (4%)
Query: 2191 KVICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFY 2250
+V+ Y+T+W + + +DI S TH++YAFA++D + + + D+
Sbjct: 1 RVVGYFTNWGVYGR---NFPVDDIPASKLTHIIYAFANIDP-DGTVTIGDEWADIGNFGQ 56
Query: 2251 EKITKFNERGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFA 2310
K K G++V+L+IGGW++S D +S +++D +R+ F+ + FL+ YGF+G++
Sbjct: 57 LKALKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFLKKYGFDGIDID 114
Query: 2311 WEYPKCWQT 2319
WEYP
Sbjct: 115 WEYPGGRGD 123
Score = 67.7 bits (166), Expect = 2e-11
Identities = 28/60 (46%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
Query: 2323 VKSHEWTVVRDSLGARGPYAYNG--KLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLD 2380
K TVV D A+ PYAYN W+SYDD R I+ K +K GLGG M+W +D D
Sbjct: 276 CKLLGATVVYDDT-AKAPYAYNPGTGQWVSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334
Score = 35.0 bits (81), Expect = 0.44
Identities = 7/20 (35%), Positives = 11/20 (55%)
Query: 871 VKFIKSNGYAGIMAWTIDLD 890
++K G G+M W +D D
Sbjct: 315 ADYVKDKGLGGVMIWELDAD 334
>gnl|CDD|216071 pfam00704, Glyco_hydro_18, Glycosyl hydrolases family 18.
Length = 325
Score = 332 bits (852), Expect = e-102
Identities = 128/352 (36%), Positives = 179/352 (50%), Gaps = 33/352 (9%)
Query: 1711 FKIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKF 1770
+IV Y+T W Y + + DI +D TH+IY FA +D + DT D
Sbjct: 1 GRIVGYYTQWGNYGEG---FPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57
Query: 1771 YEKVTALKKK---GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDG 1827
+E++ LKK GVKV L+IGGW S +S L + R F +++F+ ++ FDG
Sbjct: 58 FEQLKDLKKCQNPGVKVLLSIGGWTFS--GGFSLLASDDAKRKTFADSIIDFLKKYGFDG 115
Query: 1828 LDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPH---DLLLSAAVSPSKAVIDN 1884
+D+DWEYP DK + L+KELRAA LLSAAV ID
Sbjct: 116 IDIDWEYPGGKG--------DDKDNYTALLKELRAALKKEAKAGYLLSAAVPAGPDKIDG 167
Query: 1885 AYDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGA 1944
DI + + LD+I++MTYD+HG W TG AP+Y N +Y++ Y++ G
Sbjct: 168 -SDIAKIGKYLDFINLMTYDFHG-WSNITGPNAPLYDGSWQ-----NVDYAVQYYLKAGV 220
Query: 1945 DRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDGW 2004
K++ G+P YG+ +TL + + NG G G T G L+Y EIC + K G
Sbjct: 221 PASKLVLGIPFYGRGWTLVNGSGNG-----GGAPAPGPGTWEGGILSYKEICALL-KSGA 274
Query: 2005 VVVRDRKRRIGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLD 2056
D PY +KG Q+V +DD I KA++VK LGG MIW+LD D
Sbjct: 275 GPGYD-DTAKAPYIYKGKQFVSYDDPRSIKAKAKYVKDKGLGGVMIWSLDQD 325
Score = 227 bits (580), Expect = 4e-66
Identities = 107/412 (25%), Positives = 170/412 (41%), Gaps = 92/412 (22%)
Query: 383 PKVVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNF 442
++V Y T W + F ++I THI+YAFA++D N + E D
Sbjct: 1 GRIVGYYTQWGNYGEG---FPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57
Query: 443 YQRISSSPLVTHGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAG 502
++++ + VK+L++IGGWT S G +S L S + RK F S++ FL+++ F G
Sbjct: 58 FEQLKDLKKCQNPGVKVLLSIGGWTFSGG--FSLLASDDAKRKTFADSIIDFLKKYGFDG 115
Query: 503 LHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKHDY-----SIAVGISGYKEIL 557
+ DW YP DK N+ L++EL+A K S AV K
Sbjct: 116 IDIDWEYPGGKG--------DDKDNYTALLKELRAALKKEAKAGYLLSAAVPAGPDK--- 164
Query: 558 EVAYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDEL 617
D + +++F++LM+YD+HG W ITG +PL + N++ A++ +
Sbjct: 165 IDGSDIAKIGKYLDFINLMTYDFHG-WSNITGPNAPLYDGSWQ-----NVDYAVQYYLKA 218
Query: 618 GGDKRKIVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTR 677
G K+V+G+PFYG+ +TL +G GA GP T + G+L+Y EIC +K+
Sbjct: 219 GVPASKLVLGIPFYGRGWTLVNGSGNGGGAPAPGP-----GTWEGGILSYKEICALLKS- 272
Query: 678 KWTVQRNTKSLEPRSMGTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTRK 737
+ P Y + Y+ IK
Sbjct: 273 GAGPGYD---------------------DTAKAP-----YIYKGKQFVSYDDPRSIKA-- 304
Query: 738 WTVQRNTKSLEPFTFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
K K++K G G+M W++D D
Sbjct: 305 -------------------------------KAKYVKDKGLGGVMIWSLDQD 325
Score = 205 bits (524), Expect = 1e-58
Identities = 86/258 (33%), Positives = 127/258 (49%), Gaps = 40/258 (15%)
Query: 1340 FKIVCYFTNWAWYRPGKGKYVLKKVPW-----INILKLYVTLQTGLGTDQEKENTFLKIS 1394
+IV Y+T W Y G + L +P I + G + + E+ LK
Sbjct: 1 GRIVGYYTQWGNYGEG---FPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKG- 56
Query: 1395 ELTYVHILFYERVVTLKKK---GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVK 1451
+E++ LKK GVKV L+IGGW S G +S L + R+ F + ++
Sbjct: 57 --------CFEQLKDLKKCQNPGVKVLLSIGGWTFS--GGFSLLASDDAKRKTFADSIID 106
Query: 1452 FLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPH---GLLLSAAV 1508
FL KY FDG+D+DWEYP DK+++ ++EL A K G LLSAAV
Sbjct: 107 FLKKYGFDGIDIDWEYPGGKG--------DDKDNYTALLKELRAALKKEAKAGYLLSAAV 158
Query: 1509 SPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFT 1568
I D+ + + LD+I++MTYD+HG W TG APLY+ N ++
Sbjct: 159 PAGPDKI-DGSDIAKIGKYLDFINLMTYDFHG-WSNITGPNAPLYDGSWQ-----NVDYA 211
Query: 1569 MNYWMKKGAPSRKLVMGV 1586
+ Y++K G P+ KLV+G+
Sbjct: 212 VQYYLKAGVPASKLVLGI 229
Score = 183 bits (465), Expect = 9e-51
Identities = 95/353 (26%), Positives = 164/353 (46%), Gaps = 37/353 (10%)
Query: 13 HQVVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIPEDLEYDVIKGG 72
++V Y Y F + ++ TH+IYA+A ID D E D +KG
Sbjct: 1 GRIVGYYTQWGNYGE---GFPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57
Query: 73 YKSFLGLKEA-NPELKVYLAV-----KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLD 126
++ LK+ NP +KV L++ F + SD R F S+++ Y FDG+D
Sbjct: 58 FEQLKDLKKCQNPGVKVLLSIGGWTFSGGFSLLASDDAKRKTFADSIIDFLKKYGFDGID 117
Query: 127 LNVKDPALNDEDDDDLESIANERSDFSTFIQELSS---TLRRNNYQLTLTSPGVIDRKTS 183
++ + P +D + +++ ++EL + + Y L+ P D K
Sbjct: 118 IDWEYPGGKGDD----------KDNYTALLKELRAALKKEAKAGYLLSAAVPAGPD-KID 166
Query: 184 LVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAVYNWIK 243
DI+ + +D I L +++ + P P + Q N++ AV ++K
Sbjct: 167 GSDIAKIGKYLDFINLMTYDFHGWSNITGPNAPLYDGSWQ---------NVDYAVQYYLK 217
Query: 244 KGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVCNKFKD 303
G ++++GIPF+G+ + L N S G GA GPGT G L++ E+C K
Sbjct: 218 AGVPASKLVLGIPFYGRGWTLVNGSGNGGGAPAPGPGTWE-----GGILSYKEICALLKS 272
Query: 304 KTWRHFTDSNGEPFMVKKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLD 356
+ D+ P++ K ++++Y++ SI+ K Y+KD+ LGG M+W+LD D
Sbjct: 273 GAGPGYDDTAKAPYIYKGKQFVSYDDPRSIKAKAKYVKDKGLGGVMIWSLDQD 325
Score = 112 bits (281), Expect = 2e-26
Identities = 39/133 (29%), Positives = 70/133 (52%), Gaps = 8/133 (6%)
Query: 2190 YKVICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGF 2249
+++ YYT W + +DI TH++YAFA++D + + + D +G
Sbjct: 1 GRIVGYYTQWG---NYGEGFPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57
Query: 2250 YEKITKFNE---RGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNG 2306
+E++ + G++V+L+IGGW+ S +S L +D R+ F + +FL+ YGF+G
Sbjct: 58 FEQLKDLKKCQNPGVKVLLSIGGWTFS--GGFSLLASDDAKRKTFADSIIDFLKKYGFDG 115
Query: 2307 LEFAWEYPKCWQT 2319
++ WEYP
Sbjct: 116 IDIDWEYPGGKGD 128
Score = 67.1 bits (164), Expect = 3e-11
Identities = 28/64 (43%), Positives = 38/64 (59%), Gaps = 2/64 (3%)
Query: 2317 WQTCSTVKSHEWTVVRDSLGARGPYAYNGKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWS 2376
+ C+ +KS D A+ PY Y GK ++SYDD R I+ K K +K GLGG M+WS
Sbjct: 264 KEICALLKS-GAGPGYDD-TAKAPYIYKGKQFVSYDDPRSIKAKAKYVKDKGLGGVMIWS 321
Query: 2377 IDLD 2380
+D D
Sbjct: 322 LDQD 325
Score = 34.0 bits (78), Expect = 1.0
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 871 VKFIKSNGYAGIMAWTIDLD 890
K++K G G+M W++D D
Sbjct: 306 AKYVKDKGLGGVMIWSLDQD 325
>gnl|CDD|119352 cd02873, GH18_IDGF, The IDGF's (imaginal disc growth factors) are a
family of growth factors identified in insects that
include at least five members, some of which are encoded
by genes in a tight cluster. The IDGF's have an
eight-stranded alpha/beta barrel fold and are related to
the glycosyl hydrolase family 18 (GH18) chitinases, but
they have an amino acid substitution known to abolish
chitinase catalytic activity. IDGFs may have evolved from
chitinases to gain new functions as growth factors,
interacting with cell surface glycoproteins involved in
growth-promoting processes.
Length = 413
Score = 243 bits (622), Expect = 2e-70
Identities = 123/416 (29%), Positives = 204/416 (49%), Gaps = 61/416 (14%)
Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDL--CTHVIYGFAVLDTDQLVIKPHDTWADLDNK 1769
K+VCY+ + ++ R+ K D++ L CTH++YG+A +D D IK + DLD
Sbjct: 1 KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60
Query: 1770 FYEKVTALKKK--GVKVTLAIGGWNDS----AGNKYSRLVNSQQARSKFIAHVVNFILEH 1823
Y +T+LK+K +KV L++GG D+ KY L+ S ++R+ FI + + +
Sbjct: 61 HYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTY 120
Query: 1824 NFDGLDLDWEYPK------------CWQ-----------VDCKQGPASDKQGFADLIKEL 1860
FDGLDL W++PK W VD K A K+ F L++EL
Sbjct: 121 GFDGLDLAWQFPKNKPKKVRGTFGSAWHSFKKLFTGDSVVDEKA--AEHKEQFTALVREL 178
Query: 1861 RAAFNPHDLLLSAAVSP---SKAVIDNAYDIPVMSENLDWISVMTYDYHG-QWDKKTGH- 1915
+ A P LLL+ V P S +D+P ++ N+D++++ T+D+ + + +
Sbjct: 179 KNALRPDGLLLTLTVLPHVNSTWY----FDVPAIANNVDFVNLATFDFLTPERNPEEADY 234
Query: 1916 VAPMYALPNDTTPTFNANYSLHYWVSHGADRKKVIFGMPMYGQSFTL-ADKNKNGL--NS 1972
AP+Y + P N +Y + YW++ G K+ G+ YG+++ L D G+
Sbjct: 235 TAPIYE-LYERNPHHNVDYQVKYWLNQGTPASKLNLGIATYGRAWKLTKDSGITGVPPVL 293
Query: 1973 QTYGGAEAGENTRARGFLAYYEICDKIQKDGWVV--------VRDRKRRIGPYAF----- 2019
+T G AG T+ G L++ EIC K+ + V D +R G YA+
Sbjct: 294 ETDGPGPAGPQTKTPGLLSWPEICSKLPNPANLKGADAPLRKVGDPTKRFGSYAYRPADE 353
Query: 2020 KGDQ--WVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNFCGCESYPLLKTI 2073
G+ WV ++D +KA + K LGG ++ L LDDF+ C + +P+L++
Sbjct: 354 NGEHGIWVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDFRGQCTGDKFPILRSA 409
Score = 174 bits (442), Expect = 1e-46
Identities = 118/423 (27%), Positives = 186/423 (43%), Gaps = 70/423 (16%)
Query: 14 QVVCYVEAKSAYRHRPATFNVKNVIP--QTCTHVIYAYAAIDPVSRALIPEDLEYDVIKG 71
++VCY ++KS R A +++++ P Q CTH++Y YA ID + + + + D+ K
Sbjct: 1 KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60
Query: 72 GYKSFLGLKEANPELKVYLAVKSN-----------FVSITSDRESRLNFISSVLEMFDMY 120
Y++ LK P LKV L+V + ++ + ESR FI+S + Y
Sbjct: 61 HYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTY 120
Query: 121 KFDGLDLNVKDPALNDE--DDDDLESI------------------ANERSDFSTFIQELS 160
FDGLDL + P N S A + F+ ++EL
Sbjct: 121 GFDGLDLAWQFPK-NKPKKVRGTFGSAWHSFKKLFTGDSVVDEKAAEHKEQFTALVRELK 179
Query: 161 STLRRNNYQLTLTS-PGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTK 219
+ LR + LTLT P V T D+ +A VD + L +F D + E P + +
Sbjct: 180 NALRPDGLLLTLTVLPHV--NSTWYFDVPAIANNVDFVNLATF--DFLTPERNPEEADYT 235
Query: 220 VNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSE---YGLGATV 276
I +N++ V W+ +G ++ +GI +G++++L S
Sbjct: 236 APIYELYERNPHHNVDYQVKYWLNQGTPASKLNLGIATYGRAWKLTKDSGITGVPPVLET 295
Query: 277 KGPGTEGKYTQMPGYLAFFEVCNKFK----------------DKTWRHFT------DSNG 314
GPG G T+ PG L++ E+C+K D T R + D NG
Sbjct: 296 DGPGPAGPQTKTPGLLSWPEICSKLPNPANLKGADAPLRKVGDPTKRFGSYAYRPADENG 355
Query: 315 EPFMVKKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDDFRGFC-GQKYPLLSAVV 373
E + W++YE+ D+ K Y K + LGG L+ L LDDFRG C G K+P+L +
Sbjct: 356 EHGI-----WVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDFRGQCTGDKFPILRSAK 410
Query: 374 SNL 376
L
Sbjct: 411 YRL 413
Score = 155 bits (393), Expect = 3e-40
Identities = 90/330 (27%), Positives = 157/330 (47%), Gaps = 49/330 (14%)
Query: 384 KVVCYVTNWSGSRKSDGKFVPENIDYKL--CTHIVYAFASLDPNTLSIQAGNPEADIDDN 441
K+VCY + S R+ K E+++ L CTH+VY +A +D +T I++ N + D+D +
Sbjct: 1 KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60
Query: 442 FYQRISSSPLVTHGKVKILIAIGGWTDSSGE----KYSQLISSGSNRKKFIKSVLTFLRR 497
Y+ I+S + +K+L+++GG D+ E KY L+ S +R FI S + L+
Sbjct: 61 HYRAITSLKR-KYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKT 119
Query: 498 FDFAGLHFDWNYPV------------CWQA-------DCSKQHKAD--KGNFVKLIQELK 536
+ F GL W +P W + D KA K F L++ELK
Sbjct: 120 YGFDGLDLAWQFPKNKPKKVRGTFGSAWHSFKKLFTGDSVVDEKAAEHKEQFTALVRELK 179
Query: 537 AEFDKHDYSIAVGISGYKEIL-----EVAYDFPALNDHVEFMSLMSYDYHGAWEG--ITG 589
+ + + L +D PA+ ++V+F++L ++D+
Sbjct: 180 NALRPDGLLLTLTV------LPHVNSTWYFDVPAIANNVDFVNLATFDFLTPERNPEEAD 233
Query: 590 LVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVIGVPFYGQSYTLKLDKEHGL---- 645
+P+ P++N++ +K G K+ +G+ YG+++ KL K+ G+
Sbjct: 234 YTAPIYELYER-NPHHNVDYQVKYWLNQGTPASKLNLGIATYGRAW--KLTKDSGITGVP 290
Query: 646 -GAETDGPGLAGEYTQQPGMLAYYEICYRI 674
ETDGPG AG T+ PG+L++ EIC ++
Sbjct: 291 PVLETDGPGPAGPQTKTPGLLSWPEICSKL 320
Score = 123 bits (311), Expect = 2e-29
Identities = 79/285 (27%), Positives = 130/285 (45%), Gaps = 53/285 (18%)
Query: 1341 KIVCYFTNWAWYRPGKGKYVLKKVPWINILKLYVTLQTGL-----GTDQEKENTFLKISE 1395
K+VCY+ + ++ R G K L L+ + T L G D + KI
Sbjct: 1 KLVCYYDSKSYLREGLAKMSL------EDLEPALQFCTHLVYGYAGIDADT----YKIKS 50
Query: 1396 LTY---VHILFYERVVTLKKK--GVKVSLAIGGWNDSLGG----KYSRLVNSATARQRFI 1446
L + Y + +LK+K +KV L++GG D+ KY L+ S+ +R FI
Sbjct: 51 LNEDLDLDKSHYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFI 110
Query: 1447 EHVVKFLLKYQFDGLDLDWEYPT------------CWQ-----VNCDAGPDSD----KES 1485
L Y FDGLDL W++P W D+ D KE
Sbjct: 111 NSAHSLLKTYGFDGLDLAWQFPKNKPKKVRGTFGSAWHSFKKLFTGDSVVDEKAAEHKEQ 170
Query: 1486 FGLFVRELHQAFKPHGLLLSAAVSPSKQVINAA--YDVKALSESLDWISVMTYDYHG-QW 1542
F VREL A +P GLLL+ V P +N+ +DV A++ ++D++++ T+D+ +
Sbjct: 171 FTALVRELKNALRPDGLLLTLTVLPH---VNSTWYFDVPAIANNVDFVNLATFDFLTPER 227
Query: 1543 DKKTGH-VAPLYEHPDDDFFYFNANFTMNYWMKKGAPSRKLVMGV 1586
+ + AP+Y + + N ++ + YW+ +G P+ KL +G+
Sbjct: 228 NPEEADYTAPIY-ELYERNPHHNVDYQVKYWLNQGTPASKLNLGI 271
Score = 112 bits (282), Expect = 8e-26
Identities = 50/133 (37%), Positives = 79/133 (59%), Gaps = 8/133 (6%)
Query: 2191 KVICYYTSWAYLRQAEGKYSPEDIDGSL--CTHLVYAFADLDEQELVIRSHNNKLDLEQG 2248
K++CYY S +YLR+ K S ED++ +L CTHLVY +A +D I+S N LDL++
Sbjct: 1 KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60
Query: 2249 FYEKITKFNER--GIQVILAIGGWSDS----RSDKYSRLVNDIPARRNFVRHVTEFLEFY 2302
Y IT + ++V+L++GG D+ ++KY L+ +R F+ L+ Y
Sbjct: 61 HYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTY 120
Query: 2303 GFNGLEFAWEYPK 2315
GF+GL+ AW++PK
Sbjct: 121 GFDGLDLAWQFPK 133
Score = 81.2 bits (201), Expect = 1e-15
Identities = 45/135 (33%), Positives = 66/135 (48%), Gaps = 25/135 (18%)
Query: 695 TLKLDKEHGL-----GAETDGPGLAGEYTQQPGMLAYYEICYRIKT-----------RKW 738
KL K+ G+ ETDGPG AG T+ PG+L++ EIC ++ RK
Sbjct: 277 AWKLTKDSGITGVPPVLETDGPGPAGPQTKTPGLLSWPEICSKLPNPANLKGADAPLRK- 335
Query: 739 TVQRNTKSLEPFTF-----QGDQ--WVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLDDF 791
V TK + + G+ WVSYEDP + K + K+ G G+ + + LDDF
Sbjct: 336 -VGDPTKRFGSYAYRPADENGEHGIWVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDF 394
Query: 792 HNKCCMESFPLLRAV 806
+C + FP+LR+
Sbjct: 395 RGQCTGDKFPILRSA 409
Score = 50.0 bits (120), Expect = 1e-05
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 7/70 (10%)
Query: 2339 GPYAY-----NGK--LWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLDDFQNLCSCGPY 2391
G YAY NG+ +W+SY+D K K GLGG ++ + LDDF+ C+ +
Sbjct: 344 GSYAYRPADENGEHGIWVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDFRGQCTGDKF 403
Query: 2392 PLLSAVNYEL 2401
P+L + Y L
Sbjct: 404 PILRSAKYRL 413
>gnl|CDD|119365 cd06548, GH18_chitinase, The GH18 (glycosyl hydrolases, family 18)
type II chitinases hydrolyze chitin, an abundant polymer
of N-acetylglucosamine and have been identified in
bacteria, fungi, insects, plants, viruses, and protozoan
parasites. The structure of this domain is an
eight-stranded alpha/beta barrel with a pronounced
active-site cleft at the C-terminal end of the
beta-barrel.
Length = 322
Score = 230 bits (588), Expect = 4e-67
Identities = 108/370 (29%), Positives = 167/370 (45%), Gaps = 74/370 (20%)
Query: 1713 IVCYFTNWAWYRQSGGKYLPS-DIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFY 1771
+V YFTNW Y G Y + DI +D TH+ Y FA +D D V+ D AD +
Sbjct: 1 VVGYFTNWGIY---GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSV 57
Query: 1772 EKVT---------------ALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIA 1814
+ LK+K +K+ L+IGGW S G +S ++ +R+KF
Sbjct: 58 DGGADTDDQPLKGNFGQLRKLKQKNPHLKILLSIGGWTWSGG--FSDAAATEASRAKFAD 115
Query: 1815 HVVNFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFN------PHD 1868
V+FI ++ FDG+D+DWEYP DK+ F L+KELR A +
Sbjct: 116 SAVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKENFTLLLKELREALDALGAETGRK 175
Query: 1869 LLLSAAVSPSKAVIDNAYDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTP 1928
LL+ A +D ++ +++ LD+I++MTYD+HG W TGH + +YA P D
Sbjct: 176 YLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNTTGHHSNLYASPADPPG 234
Query: 1929 TFNANYSLHYWVSHGADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARG 1988
++ + +++Y++S G +K++ G+P YG RG
Sbjct: 235 GYSVDAAVNYYLSAGVPPEKLVLGVPFYG-----------------------------RG 265
Query: 1989 FLAYYEICDKIQKDGWVVVRDRKRRIGPYAFKGD--QWVGFDDQAMIHHKAEFVKYNDLG 2046
+ G+ D + PY + ++ +DD I KA++VK LG
Sbjct: 266 W------------TGYTRYWDEVAK-APYLYNPSTKTFISYDDPRSIKAKADYVKDKGLG 312
Query: 2047 GAMIWALDLD 2056
G M W L D
Sbjct: 313 GVMFWELSGD 322
Score = 176 bits (448), Expect = 2e-48
Identities = 82/274 (29%), Positives = 128/274 (46%), Gaps = 37/274 (13%)
Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDD---- 440
VV Y TNW FV ++I THI YAFA +D + + + + AD
Sbjct: 1 VVGYFTNWGIY--GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSVD 58
Query: 441 ------------NFYQRISSSPLVT----HGKVKILIAIGGWTDSSGEKYSQLISSGSNR 484
NF Q L + +KIL++IGGWT S +S ++ ++R
Sbjct: 59 GGADTDDQPLKGNFGQ------LRKLKQKNPHLKILLSIGGWTWS--GGFSDAAATEASR 110
Query: 485 KKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKH-- 542
KF S + F+R++ F G+ DW YP A + DK NF L++EL+ D
Sbjct: 111 AKFADSAVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKENFTLLLKELREALDALGA 170
Query: 543 ----DYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRP 598
Y + + + L+ + + +++F++LM+YD+HGAW TG S L + P
Sbjct: 171 ETGRKYLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNTTGHHSNLYASP 229
Query: 599 GEPYPNYNINTALKLIDELGGDKRKIVIGVPFYG 632
+P Y+++ A+ G K+V+GVPFYG
Sbjct: 230 ADPPGGYSVDAAVNYYLSAGVPPEKLVLGVPFYG 263
Score = 170 bits (434), Expect = 1e-46
Identities = 78/265 (29%), Positives = 128/265 (48%), Gaps = 26/265 (9%)
Query: 1342 IVCYFTNWAWYRPGKGKYVLKKVPWINILKLY------------VTLQTGLGTDQEKENT 1389
+V YFTNW Y G+ +V +P + + VT + +
Sbjct: 1 VVGYFTNWGIY--GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSVD 58
Query: 1390 FLKISELTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIE 1447
++ + F + + LK+K +K+ L+IGGW + G +S + +R +F +
Sbjct: 59 GGADTDDQPLKGNFGQ-LRKLKQKNPHLKILLSIGGW--TWSGGFSDAAATEASRAKFAD 115
Query: 1448 HVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGL----- 1502
V F+ KY FDG+D+DWEYP + DKE+F L ++EL +A G
Sbjct: 116 SAVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKENFTLLLKELREALDALGAETGRK 175
Query: 1503 -LLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFF 1561
LL+ A ++ +V +++ LD+I++MTYD+HG W TGH + LY P D
Sbjct: 176 YLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNTTGHHSNLYASPADPPG 234
Query: 1562 YFNANFTMNYWMKKGAPSRKLVMGV 1586
++ + +NY++ G P KLV+GV
Sbjct: 235 GYSVDAAVNYYLSAGVPPEKLVLGV 259
Score = 115 bits (289), Expect = 2e-27
Identities = 78/379 (20%), Positives = 140/379 (36%), Gaps = 94/379 (24%)
Query: 15 VVCYVEAKSAYRHRPATFNVKNVIPQT-CTHVIYAYAAIDP-----------VSRALIPE 62
VV Y Y + V + IP TH+ YA+A ID A
Sbjct: 1 VVGYFTNWGIY---GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSV 57
Query: 63 DLEYDVIKGGYKSFLG----LKEANPELKVYLAV----KS-NFVSITSDRESRLNFISSV 113
D D K G LK+ NP LK+ L++ S F + SR F S
Sbjct: 58 DGGADTDDQPLKGNFGQLRKLKQKNPHLKILLSIGGWTWSGGFSDAAATEASRAKFADSA 117
Query: 114 LEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFIQELSSTL------RRNN 167
++ Y FDG+D++ + P + ++ +F+ ++EL L
Sbjct: 118 VDFIRKYGFDGIDIDWEYP--GSGGAPGNVARPEDKENFTLLLKELREALDALGAETGRK 175
Query: 168 YQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSF--------NNDHMDDEVVPVKPNTK 219
Y LT+ +P K ++++ +A +D I L ++ H + +
Sbjct: 176 YLLTIAAPAG-PDKLDKLEVAEIAKYLDFINLMTYDFHGAWSNTTGHHS----NLYASPA 230
Query: 220 VNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGP 279
++++AV ++ G PE++++G+PF+G+ + +
Sbjct: 231 DP-------PGGYSVDAAVNYYLSAGVPPEKLVLGVPFYGRGWTGYT------------- 270
Query: 280 GTEGKYTQMPGYLAFFEVCNKFKDKTWRHFTDSNGEPFMVKKD--EWITYENNDSIRRKM 337
R++ + P++ +I+Y++ SI+ K
Sbjct: 271 ---------------------------RYWDEVAKAPYLYNPSTKTFISYDDPRSIKAKA 303
Query: 338 NYIKDRHLGGAMLWTLDLD 356
+Y+KD+ LGG M W L D
Sbjct: 304 DYVKDKGLGGVMFWELSGD 322
Score = 75.7 bits (187), Expect = 4e-14
Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 23/141 (16%)
Query: 2192 VICYYTSWAYLRQAEGKYSPE-DIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFY 2250
V+ Y+T+W Y DI TH+ YAFAD+D V+ S + D
Sbjct: 1 VVGYFTNW---GIYGRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSV 57
Query: 2251 EKITKFNERGIQ-----------------VILAIGGWSDSRSDKYSRLVNDIPARRNFVR 2293
+ +++ ++ ++L+IGGW+ S +S +R F
Sbjct: 58 DGGADTDDQPLKGNFGQLRKLKQKNPHLKILLSIGGWT--WSGGFSDAAATEASRAKFAD 115
Query: 2294 HVTEFLEFYGFNGLEFAWEYP 2314
+F+ YGF+G++ WEYP
Sbjct: 116 SAVDFIRKYGFDGIDIDWEYP 136
Score = 61.5 bits (150), Expect = 2e-09
Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 3/55 (5%)
Query: 2328 WTVVRDSLGARGPYAYNG--KLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLD 2380
+T D + A+ PY YN K ++SYDD R I+ K +K GLGG M W + D
Sbjct: 269 YTRYWDEV-AKAPYLYNPSTKTFISYDDPRSIKAKADYVKDKGLGGVMFWELSGD 322
Score = 57.3 bits (139), Expect = 4e-08
Identities = 14/56 (25%), Positives = 27/56 (48%), Gaps = 2/56 (3%)
Query: 736 RKWTVQRNTKSLEPFTFQGD--QWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
+T + + P+ + ++SY+DP S+ K ++K G G+M W + D
Sbjct: 267 TGYTRYWDEVAKAPYLYNPSTKTFISYDDPRSIKAKADYVKDKGLGGVMFWELSGD 322
Score = 31.4 bits (72), Expect = 5.3
Identities = 6/20 (30%), Positives = 10/20 (50%)
Query: 871 VKFIKSNGYAGIMAWTIDLD 890
++K G G+M W + D
Sbjct: 303 ADYVKDKGLGGVMFWELSGD 322
>gnl|CDD|225862 COG3325, ChiA, Chitinase [Carbohydrate transport and metabolism].
Length = 441
Score = 185 bits (472), Expect = 3e-50
Identities = 106/397 (26%), Positives = 169/397 (42%), Gaps = 58/397 (14%)
Query: 1709 DEFKIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFA------------VLDTDQLV 1756
D+FK+V Y+T+W+ Y Y P DI D TH+ Y F V D L
Sbjct: 36 DQFKVVGYYTSWSQY--DRQDYFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAALY 93
Query: 1757 IKP------HDTWADLDNKFYEKVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQA 1808
P D W+D + + LK +K ++IGGW+DS G +S + +
Sbjct: 94 GVPNIEGVELDPWSDPLKGHFGALFDLKATYPDLKTLISIGGWSDSGG--FSDMAADDAS 151
Query: 1809 RSKFIAHVVNFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAF---- 1864
R F V F+ + FDG+D+DWEYP G DK + L++ELR
Sbjct: 152 RENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKDKANYVLLLQELRKKLDKAG 211
Query: 1865 --NPHDLLLSAAVSPSKAVIDNAYDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYAL 1922
+ L+ A SK ++ + +++ +D+I++MTYD+HG W++ GH A +Y
Sbjct: 212 VEDGRHYQLTIAAPASKDKLEGL-NHAEIAQYVDYINIMTYDFHGAWNETLGHHAALYGT 270
Query: 1923 PNDTTPT-----FNANYSLHYWVSHGAD----RKKVIFGMPMYGQSFTLADKNKNGLNSQ 1973
P D +A W+ G K++ GMP YG+ + D G
Sbjct: 271 PKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPSKLVLGMPFYGRGWNGVD---GGSLGT 327
Query: 1974 TYGGAEAGENTRAR-----------GFLAYYEICD-KIQKDGWVVVRDRKRRIGPYAF-- 2019
G + +N+ + Y++ K+G+ D + PY +
Sbjct: 328 CPGLYQGLDNSGIPKGTWEAGNGDKDYGKAYDLDANNAGKNGYERYWDDVAKA-PYLYNP 386
Query: 2020 KGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLD 2056
+ ++ +DD + KAE+V N+LGG M W + D
Sbjct: 387 EKGVFISYDDPRSVKAKAEYVADNNLGGMMFWEISGD 423
Score = 162 bits (413), Expect = 1e-42
Identities = 100/444 (22%), Positives = 174/444 (39%), Gaps = 91/444 (20%)
Query: 381 PEPKVVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEAD--- 437
+ KVV Y T+WS + P +I THI YAF ++ + SI++ +
Sbjct: 36 DQFKVVGYYTSWSQY--DRQDYFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAALY 93
Query: 438 IDDNFYQRI---SSSPLV-----------THGKVKILIAIGGWTDSSGEKYSQLISSGSN 483
N S PL T+ +K LI+IGGW+DS G +S + + ++
Sbjct: 94 GVPNIEGVELDPWSDPLKGHFGALFDLKATYPDLKTLISIGGWSDSGG--FSDMAADDAS 151
Query: 484 RKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDK-- 541
R+ F KS + F+R + F G+ DW YP + DK N+V L+QEL+ + DK
Sbjct: 152 RENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKDKANYVLLLQELRKKLDKAG 211
Query: 542 ----HDYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSR 597
Y + + K+ LE + + +V+++++M+YD+HGAW G + L
Sbjct: 212 VEDGRHYQLTIAAPASKDKLE-GLNHAEIAQYVDYINIMTYDFHGAWNETLGHHAALYGT 270
Query: 598 PGEPYPN---------YNINTALKLIDELGGDKRKIVIGVPFYGQSYTLKLDKEHGLGAE 648
P +P + L+ K+V+G+PFYG+ + +G+
Sbjct: 271 PKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPSKLVLGMPFYGRGW-------NGVDGG 323
Query: 649 TDGPGLAGEYTQQPGMLAYYEICYRIKTRKWTVQRNTKSLEPRSMGTLKLDKEHGLGAET 708
+ G G Y + + DK++G
Sbjct: 324 SLGTCP-GLYQGLDNS----------------------GIPKGTWEAGNGDKDYG----- 355
Query: 709 DGPGLAGEYTQQPGMLAYYEICY-RIKTRKWTVQRNTKSLEPFTF--QGDQWVSYEDPTS 765
Y++ + + + P+ + + ++SY+DP S
Sbjct: 356 ----------------KAYDLDANNAGKNGYERYWDDVAKAPYLYNPEKGVFISYDDPRS 399
Query: 766 LTEKVKFIKSNGYAGIMAWTIDLD 789
+ K +++ N G+M W I D
Sbjct: 400 VKAKAEYVADNNLGGMMFWEISGD 423
Score = 137 bits (346), Expect = 7e-34
Identities = 77/283 (27%), Positives = 126/283 (44%), Gaps = 40/283 (14%)
Query: 1337 DQYFKIVCYFTNWAWYRPGKGKYVLKKVPWINILKLY----------------VTLQTGL 1380
D FK+V Y+T+W+ Y + Y +P + + V + L
Sbjct: 35 DDQFKVVGYYTSWSQY--DRQDYFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAAL 92
Query: 1381 GTDQEKENTFLKISELTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNS 1438
E L + F + LK +K ++IGGW+DS G +S +
Sbjct: 93 YGVPNIEGVEL-DPWSDPLKGHFGA-LFDLKATYPDLKTLISIGGWSDS--GGFSDMAAD 148
Query: 1439 ATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFK 1498
+R+ F + V+F+ Y FDG+D+DWEYP + G DK ++ L ++EL +
Sbjct: 149 DASRENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKDKANYVLLLQELRKKLD 208
Query: 1499 PHGL------LLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPL 1552
G+ L+ A SK + +++ +D+I++MTYD+HG W++ GH A L
Sbjct: 209 KAGVEDGRHYQLTIAAPASKD-KLEGLNHAEIAQYVDYINIMTYDFHGAWNETLGHHAAL 267
Query: 1553 YEHPDD-----DFFYFNANFTMNYWMKKG----APSRKLVMGV 1586
Y P D FY +A W+++G P KLV+G+
Sbjct: 268 YGTPKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPSKLVLGM 310
Score = 96.0 bits (239), Expect = 3e-20
Identities = 84/419 (20%), Positives = 160/419 (38%), Gaps = 88/419 (21%)
Query: 3 NTAYFSQSTKHQVVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAI---------- 52
N + + +VV Y + S Y + ++ TH+ YA+ I
Sbjct: 28 NNKAHTSDDQFKVVGYYTSWSQYD--RQDYFPGDIPLDQLTHINYAFLDINSDGKSIESW 85
Query: 53 -----------DPVSRALIPEDLEYDVIKGGYKSFLGLKEANPELKVYLAV----KSN-F 96
+ L P D +KG + + LK P+LK +++ S F
Sbjct: 86 VADEAALYGVPNIEGVELDPWS---DPLKGHFGALFDLKATYPDLKTLISIGGWSDSGGF 142
Query: 97 VSITSDRESRLNFISSVLEMFDMYKFDGLDL------NVKDPALNDEDDDDLESIANERS 150
+ +D SR NF S +E Y FDG+D+ + D D ++
Sbjct: 143 SDMAADDASRENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKD--------KA 194
Query: 151 DFSTFIQELSSTLRR------NNYQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSFN- 203
++ +QEL L + +YQLT+ +P D K ++ + +A VD I + +++
Sbjct: 195 NYVLLLQELRKKLDKAGVEDGRHYQLTIAAPASKD-KLEGLNHAEIAQYVDYINIMTYDF 253
Query: 204 ----NDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAV------YNWIKKGAR----PE 249
N+ + + + T N V +W+++G P
Sbjct: 254 HGAWNET-------LGHHA--ALYGTPKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPS 304
Query: 250 QIIIGIPFFGKSYRLFNRSEY--------GLGATVKGPGTEGKYTQMPGYLAFFEV-CNK 300
++++G+PF+G+ + + GL + GT Y +++ N
Sbjct: 305 KLVLGMPFYGRGWNGVDGGSLGTCPGLYQGLDNSGIPKGTWEAGNGDKDYGKAYDLDANN 364
Query: 301 FKDKTWRHFTDSNGE-PFMV--KKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLD 356
+ + D + P++ +K +I+Y++ S++ K Y+ D +LGG M W + D
Sbjct: 365 AGKNGYERYWDDVAKAPYLYNPEKGVFISYDDPRSVKAKAEYVADNNLGGMMFWEISGD 423
Score = 73.6 bits (181), Expect = 5e-13
Identities = 44/148 (29%), Positives = 67/148 (45%), Gaps = 26/148 (17%)
Query: 2188 EPYKVICYYTSWA-YLRQAEGKYSPEDIDGSLCTHLVYAF------------------AD 2228
+ +KV+ YYTSW+ Y RQ Y P DI TH+ YAF A
Sbjct: 36 DQFKVVGYYTSWSQYDRQD---YFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAAL 92
Query: 2229 LDEQELVIRSHNNKLDLEQGFYEKITKFNER--GIQVILAIGGWSDSRSDKYSRLVNDIP 2286
+ + D +G + + ++ +++IGGWSDS +S + D
Sbjct: 93 YGVPNIEGVELDPWSDPLKGHFGALFDLKATYPDLKTLISIGGWSDS--GGFSDMAADDA 150
Query: 2287 ARRNFVRHVTEFLEFYGFNGLEFAWEYP 2314
+R NF + EF+ YGF+G++ WEYP
Sbjct: 151 SRENFAKSAVEFMRTYGFDGVDIDWEYP 178
Score = 44.0 bits (104), Expect = 9e-04
Identities = 19/55 (34%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 2328 WTVVRDSLGARGPYAYN--GKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLD 2380
+ D + A+ PY YN +++SYDD R ++ K + + + LGG M W I D
Sbjct: 370 YERYWDDV-AKAPYLYNPEKGVFISYDDPRSVKAKAEYVADNNLGGMMFWEISGD 423
>gnl|CDD|119349 cd00598, GH18_chitinase-like, The GH18 (glycosyl hydrolase, family
18) type II chitinases hydrolyze chitin, an abundant
polymer of beta-1,4-linked N-acetylglucosamine (GlcNAc)
which is a major component of the cell wall of fungi and
the exoskeleton of arthropods. Chitinases have been
identified in viruses, bacteria, fungi, protozoan
parasites, insects, and plants. The structure of the GH18
domain is an eight-stranded beta/alpha barrel with a
pronounced active-site cleft at the C-terminal end of the
beta-barrel. The GH18 family includes chitotriosidase,
chitobiase, hevamine, zymocin-alpha, narbonin, SI-CLP
(stabilin-1 interacting chitinase-like protein), IDGF
(imaginal disc growth factor), CFLE (cortical
fragment-lytic enzyme) spore hydrolase, the type III and
type V plant chitinases, the
endo-beta-N-acetylglucosaminidases, and the chitolectins.
The GH85 (glycosyl hydrolase, family 85) ENGases
(endo-beta-N-acetylglucosaminidases) are closely related
to the GH18 chitinases and are included in this alignment
model.
Length = 210
Score = 164 bits (417), Expect = 8e-46
Identities = 66/192 (34%), Positives = 106/192 (55%), Gaps = 15/192 (7%)
Query: 1713 IVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFYE 1772
++CY+ W+ R P+DI LCTH+IY FA + +D + D + E
Sbjct: 1 VICYYDGWSSGR----GPDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDKSEEPLKGALE 56
Query: 1773 KVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLDW 1832
++ + KK G+KV ++IGGW DS L + +R+ F +V+F+ + FDG+D+DW
Sbjct: 57 ELAS-KKPGLKVLISIGGWTDS---SPFTLASDPASRAAFANSLVSFLKTYGFDGVDIDW 112
Query: 1833 EYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNAYDIPVMS 1892
EYP SD++ F L++ELR+A + LL+ AV S + AYD+P +
Sbjct: 113 EYPG-------AADNSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVPAIG 165
Query: 1893 ENLDWISVMTYD 1904
+ +D+++VMTYD
Sbjct: 166 DYVDFVNVMTYD 177
Score = 144 bits (364), Expect = 1e-38
Identities = 64/197 (32%), Positives = 92/197 (46%), Gaps = 21/197 (10%)
Query: 1342 IVCYFTNWAWYRPGKGKYV-LKKVPWINILKLYVTLQTGLGTDQEKENTFLKISELTYVH 1400
++CY+ W+ R + L I ++ L N F SE
Sbjct: 1 VICYYDGWSSGRGPDPTDIPLSLCTHIIYAFAEISSDGSL-------NLFGDKSEEPLKG 53
Query: 1401 ILFYERVVTLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDG 1460
L + KK G+KV ++IGGW DS L + +R F +V FL Y FDG
Sbjct: 54 AL---EELASKKPGLKVLISIGGWTDS---SPFTLASDPASRAAFANSLVSFLKTYGFDG 107
Query: 1461 LDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAYD 1520
+D+DWEYP A +SD+E+F +REL A LL+ AV S + AYD
Sbjct: 108 VDIDWEYP-------GAADNSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYD 160
Query: 1521 VKALSESLDWISVMTYD 1537
V A+ + +D+++VMTYD
Sbjct: 161 VPAIGDYVDFVNVMTYD 177
Score = 139 bits (352), Expect = 5e-37
Identities = 62/195 (31%), Positives = 101/195 (51%), Gaps = 18/195 (9%)
Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQ 444
V+CY WS S P +I LCTHI+YAFA + + G+ + +
Sbjct: 1 VICYYDGWS----SGRGPDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDKSEEPLKGALE 56
Query: 445 RISSSPLVTHGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLH 504
++S +K+LI+IGGWTDSS L S ++R F S+++FL+ + F G+
Sbjct: 57 ELASK----KPGLKVLISIGGWTDSS---PFTLASDPASRAAFANSLVSFLKTYGFDGVD 109
Query: 505 FDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKHDYSIAVGISGYKEILEVAYDFP 564
DW YP +D+ NF+ L++EL++ +Y + + + L AYD P
Sbjct: 110 IDWEYPG-------AADNSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVP 162
Query: 565 ALNDHVEFMSLMSYD 579
A+ D+V+F+++M+YD
Sbjct: 163 AIGDYVDFVNVMTYD 177
Score = 101 bits (254), Expect = 7e-24
Identities = 62/224 (27%), Positives = 94/224 (41%), Gaps = 49/224 (21%)
Query: 2192 VICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFYE 2251
VICYY W+ + P DI SLCTH++YAFA++ + + E+
Sbjct: 1 VICYYDGWS----SGRGPDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDK---SEEPLKG 53
Query: 2252 KITKFNER--GIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEF 2309
+ + + G++V+++IGGW+DS L +D +R F + FL+ YGF+G++
Sbjct: 54 ALEELASKKPGLKVLISIGGWTDSS---PFTLASDPASRAAFANSLVSFLKTYGFDGVDI 110
Query: 2310 AWEYPKCWQTCSTVKSHEWTV-----VRDSLGARG------PYAYNGKLWLSYDDTR--- 2355
WEYP E + +R +LGA A L +YD
Sbjct: 111 DWEYPGAADNSDR----ENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVPAIGD 166
Query: 2356 -------------------DIRRKCKSIKQDGLGGAMVWSIDLD 2380
+ K K KQ GLGG M+W +D D
Sbjct: 167 YVDFVNVMTYDLVLGVPFYSLGAKAKYAKQKGLGGVMIWELDQD 210
Score = 97.8 bits (244), Expect = 2e-22
Identities = 49/193 (25%), Positives = 87/193 (45%), Gaps = 20/193 (10%)
Query: 15 VVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIPEDLEYDVIKGGYK 74
V+CY + S+ R + ++ CTH+IYA+A I + D + +KG +
Sbjct: 1 VICYYDGWSSGRG----PDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDKSEEPLKGALE 56
Query: 75 SFLGLKEANPELKVYLAV----KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLDLNVK 130
P LKV +++ S+ ++ SD SR F +S++ Y FDG+D++ +
Sbjct: 57 EL---ASKKPGLKVLISIGGWTDSSPFTLASDPASRAAFANSLVSFLKTYGFDGVDIDWE 113
Query: 131 DPALNDEDDDDLESIANERSDFSTFIQELSSTLRRNNYQLTLTSPGVIDRKTSLVDISVV 190
P D ++R +F T ++EL S L NY LT+ P D+ +
Sbjct: 114 YPGAAD---------NSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVPAI 164
Query: 191 APLVDLILLKSFN 203
VD + + +++
Sbjct: 165 GDYVDFVNVMTYD 177
Score = 40.8 bits (96), Expect = 0.003
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 332 SIRRKMNYIKDRHLGGAMLWTLDLD 356
S+ K Y K + LGG M+W LD D
Sbjct: 186 SLGAKAKYAKQKGLGGVMIWELDQD 210
Score = 38.5 bits (90), Expect = 0.024
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 764 TSLTEKVKFIKSNGYAGIMAWTIDLD 789
SL K K+ K G G+M W +D D
Sbjct: 185 YSLGAKAKYAKQKGLGGVMIWELDQD 210
Score = 32.7 bits (75), Expect = 1.8
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 2036 KAEFVKYNDLGGAMIWALDLD 2056
KA++ K LGG MIW LD D
Sbjct: 190 KAKYAKQKGLGGVMIWELDQD 210
Score = 32.3 bits (74), Expect = 2.2
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 871 VKFIKSNGYAGIMAWTIDLD 890
K+ K G G+M W +D D
Sbjct: 191 AKYAKQKGLGGVMIWELDQD 210
>gnl|CDD|119358 cd02879, GH18_plant_chitinase_class_V, The class V plant chitinases
have a glycosyl hydrolase family 18 (GH18) domain, but
lack the chitin-binding domain present in other GH18
enzymes. The GH18 domain of the class V chitinases has
endochitinase activity in some cases and no catalytic
activity in others. Included in this family is a lectin
found in black locust (Robinia pseudoacacia) bark, which
binds chitin but lacks chitinase activity. Also included
is a chitinase-related receptor-like kinase (CHRK1) from
tobacco (Nicotiana tabacum), with an N-terminal GH18
domain and a C-terminal kinase domain, which is thought
to be part of a plant signaling pathway. The GH18 domain
of CHRK1 is expressed extracellularly where it binds
chitin but lacks chitinase activity.
Length = 299
Score = 156 bits (397), Expect = 6e-42
Identities = 99/349 (28%), Positives = 149/349 (42%), Gaps = 76/349 (21%)
Query: 1722 WYRQSGGKYLPSDIDSDLCTHVIYGFAVLD--TDQLVIKPHDTWADLDNKFYEKVTALKK 1779
W ++ PS+IDS L TH+ Y FA LD T ++VI ++ F E V K
Sbjct: 9 WP-AWSEEFPPSNIDSSLFTHLFYAFADLDPSTYEVVI-SPSDESEFST-FTETVKR-KN 64
Query: 1780 KGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKCWQ 1839
VK L+IGG + ++ + + AR FI + ++ FDGLDLDWE+P
Sbjct: 65 PSVKTLLSIGGGGSDSSA-FAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFPS--- 120
Query: 1840 VDCKQGPASDKQGFADLIKELRAAFNPHD-------LLLSAAVSPSKAVIDNA----YDI 1888
+ + F L++E RAA LLL+AAV S + + Y I
Sbjct: 121 ------SQVEMENFGKLLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSDDSVSYPI 174
Query: 1889 PVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGADRKK 1948
+++NLDW++VM YDY+G W+ T A A D + +Y + W+ G KK
Sbjct: 175 EAINKNLDWVNVMAYDYYGSWESNTTGPA---AALYDPNSNVSTDYGIKSWIKAGVPAKK 231
Query: 1949 VIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDGWVVVR 2008
++ G+P+YG+++TL D T +
Sbjct: 232 LVLGLPLYGRAWTLYDT--------TTVSS------------------------------ 253
Query: 2009 DRKRRIGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDD 2057
Y + G W+G+DD I K ++ K L G WA+ DD
Sbjct: 254 --------YVYAGTTWIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294
Score = 122 bits (308), Expect = 4e-30
Identities = 64/192 (33%), Positives = 100/192 (52%), Gaps = 26/192 (13%)
Query: 1406 RVVTLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDW 1465
V K VK L+IGG ++ + + TAR+ FI +K KY FDGLDLDW
Sbjct: 58 ETVKRKNPSVKTLLSIGGGGSD-SSAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDW 116
Query: 1466 EYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPH-------GLLLSAAVSPSKQVI--- 1515
E+P+ + E+FG + E A K LLL+AAV S +
Sbjct: 117 EFPS---------SQVEMENFGKLLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSD 167
Query: 1516 -NAAYDVKALSESLDWISVMTYDYHGQWDKK-TGHVAPLYEHPDDDFFYFNANFTMNYWM 1573
+ +Y ++A++++LDW++VM YDY+G W+ TG A LY+ + + ++ + W+
Sbjct: 168 DSVSYPIEAINKNLDWVNVMAYDYYGSWESNTTGPAAALYDPNSN----VSTDYGIKSWI 223
Query: 1574 KKGAPSRKLVMG 1585
K G P++KLV+G
Sbjct: 224 KAGVPAKKLVLG 235
Score = 108 bits (273), Expect = 1e-25
Identities = 73/253 (28%), Positives = 122/253 (48%), Gaps = 32/253 (12%)
Query: 398 SDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQRI-SSSPLVTHGK 456
+F P NID L TH+ YAFA LDP+T + + F + + +P V
Sbjct: 12 WSEEFPPSNIDSSLFTHLFYAFADLDPSTYEVVISPSDESEFSTFTETVKRKNPSV---- 67
Query: 457 VKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQAD 516
K L++IGG S ++ + S + RK FI S + R++ F GL DW +P
Sbjct: 68 -KTLLSIGGGGSDS-SAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFP------ 119
Query: 517 CSKQHKADKGNFVKLIQELKAEFDKHDYS---------IAVGISGYKEILEVAYDFP--A 565
+ + NF KL++E +A S AV S + + + +P A
Sbjct: 120 ---SSQVEMENFGKLLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSDDSVSYPIEA 176
Query: 566 LNDHVEFMSLMSYDYHGAWEGI-TGLVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKI 624
+N +++++++M+YDY+G+WE TG + L +Y I + +K G +K+
Sbjct: 177 INKNLDWVNVMAYDYYGSWESNTTGPAAALYDPNSNVSTDYGIKSWIK----AGVPAKKL 232
Query: 625 VIGVPFYGQSYTL 637
V+G+P YG+++TL
Sbjct: 233 VLGLPLYGRAWTL 245
Score = 72.8 bits (179), Expect = 2e-13
Identities = 69/334 (20%), Positives = 132/334 (39%), Gaps = 85/334 (25%)
Query: 43 THVIYAYAAIDPVSRAL-IPEDLEYDVIKGGYKSFLG-LKEANPELKVYLAV------KS 94
TH+ YA+A +DP + + I E + + +F +K NP +K L++ S
Sbjct: 27 THLFYAFADLDPSTYEVVISPSDESE-----FSTFTETVKRKNPSVKTLLSIGGGGSDSS 81
Query: 95 NFVSITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFST 154
F ++ SD +R FI+S +++ Y FDGLDL+ + P ++E +F
Sbjct: 82 AFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFP----SSQVEME-------NFGK 130
Query: 155 FIQELSSTLRR-----NNYQLTLTSPGVIDRKTSLVDISVVAPL------VDLILLKSFN 203
++E + ++ L LT+ L D SV P+ +D + + +++
Sbjct: 131 LLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSDDSVSYPIEAINKNLDWVNVMAYD 190
Query: 204 NDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYR 263
+ E P + ++ ++ + +WIK G +++++G+P +G+++
Sbjct: 191 Y-YGSWESNTTGPAAALYDPNSNVSTDY-----GIKSWIKAGVPAKKLVLGLPLYGRAWT 244
Query: 264 LFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVCNKFKDKTWRHFTDSNGEPFMVKKDE 323
L++ T Y + T
Sbjct: 245 LYD------------TTTVSSYV--------------YAGTT------------------ 260
Query: 324 WITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDD 357
WI Y++ SI K+ Y K + L G W + DD
Sbjct: 261 WIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294
Score = 59.7 bits (145), Expect = 5e-09
Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 6/118 (5%)
Query: 2200 AYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQ--ELVIRSHNNKLDLEQGFYEKITKFN 2257
Y ++ P +ID SL THL YAFADLD E+VI + E + + K
Sbjct: 7 GYWPAWSEEFPPSNIDSSLFTHLFYAFADLDPSTYEVVISPSDES---EFSTFTETVKRK 63
Query: 2258 ERGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFAWEYPK 2315
++ +L+IGG S S ++ + +D AR+ F+ + YGF+GL+ WE+P
Sbjct: 64 NPSVKTLLSIGG-GGSDSSAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFPS 120
Score = 43.1 bits (102), Expect = 0.001
Identities = 19/55 (34%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 736 RKWTVQRNTKSLEPFTFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLDD 790
R WT+ +T ++ + + G W+ Y+D S+ KVK+ K G G AW + DD
Sbjct: 241 RAWTLY-DTTTVSSYVYAGTTWIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294
Score = 42.0 bits (99), Expect = 0.003
Identities = 18/41 (43%), Positives = 22/41 (53%)
Query: 2341 YAYNGKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLDD 2381
Y Y G W+ YDD + I K K KQ GL G W++ DD
Sbjct: 254 YVYAGTTWIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294
>gnl|CDD|119357 cd02878, GH18_zymocin_alpha, Zymocin, alpha subunit. Zymocin is a
heterotrimeric enzyme that inhibits yeast cell cycle
progression. The zymocin alpha subunit has a chitinase
activity that is essential for holoenzyme action from the
cell exterior while the gamma subunit contains the
intracellular toxin responsible for G1 phase cell cycle
arrest. The zymocin alpha and beta subunits are thought
to act from the cell's exterior by docking to the cell
wall-associated chitin, thus mediating gamma-toxin
translocation. The alpha subunit has an eight-stranded
TIM barrel fold similar to that of family 18 glycosyl
hydrolases such as hevamine, chitolectin, and chitobiase.
Length = 345
Score = 128 bits (325), Expect = 6e-32
Identities = 98/361 (27%), Positives = 145/361 (40%), Gaps = 32/361 (8%)
Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVL--DTDQLVIKPHDTWADLDNK 1769
K + YF + R + ID+ TH+ + FA + D V + + +
Sbjct: 1 KNIAYFEAYNLDR-PCLNMDVTQIDTSKYTHIHFAFANITSDFSVDVSSVQEQF----SD 55
Query: 1770 FYEKVTALKKKGVKVTLAIGGWNDSAG-NKYSRLVN--SQQARSKFIAHVVNFILEHNFD 1826
F K KGVK L+ GGW+ S + Y + R F +VVNF+ ++N D
Sbjct: 56 F------KKLKGVKKILSFGGWDFSTSPSTYQIFRDAVKPANRDTFANNVVNFVNKYNLD 109
Query: 1827 GLDLDWEYPKCWQV-DCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNA 1885
G+D DWEYP + G D + + + +K L++ P LS A P+
Sbjct: 110 GVDFDWEYPGAPDIPGIPAGDPDDGKNYLEFLKLLKSKL-PSGKSLSIAA-PASYWYLKG 167
Query: 1886 YDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPN------DTTPTFNANYSLHYW 1939
+ I M++ +D+I MTYD HGQWD +P N + T T +A L
Sbjct: 168 FPIKDMAKYVDYIVYMTYDLHGQWDYGNKWASPGCPAGNCLRSHVNKTETLDA---LSMI 224
Query: 1940 VSHGADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKI 1999
G KV+ G+ YG+SF +AD G G E R Y I +
Sbjct: 225 TKAGVPSNKVVVGVASYGRSFKMADPGCTGPGCTFTGPGSGAEAGRCTCTAGYGAISEIE 284
Query: 2000 QKDGWVVVRDRKRRIGPYA----FKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDL 2055
D R + + DQWV + A + E+ K + GG WA+DL
Sbjct: 285 IIDISKSKNKRWYDTDSDSDILVYDDDQWVAYMSPATKAARIEWYKGLNFGGTSDWAVDL 344
Query: 2056 D 2056
Sbjct: 345 Q 345
Score = 86.6 bits (215), Expect = 1e-17
Identities = 71/287 (24%), Positives = 114/287 (39%), Gaps = 46/287 (16%)
Query: 404 PENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQRISSSPLVTHGKVKILIAI 463
ID THI +AFA++ + +D + Q S VK +++
Sbjct: 20 VTQIDTSKYTHIHFAFANI----------TSDFSVDVSSVQ-EQFSDFKKLKGVKKILSF 68
Query: 464 GGWTDSSGEKYSQLISSG---SNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQA----D 516
GGW S+ Q+ +NR F +V+ F+ +++ G+ FDW YP A
Sbjct: 69 GGWDFSTSPSTYQIFRDAVKPANRDTFANNVVNFVNKYNLDGVDFDWEYP---GAPDIPG 125
Query: 517 CSKQHKADKGNFVKLIQELKAEF-DKHDYSIAVGISGYKEILEVAY--DFP--ALNDHVE 571
D N+++ ++ LK++ SIA S + Y FP + +V+
Sbjct: 126 IPAGDPDDGKNYLEFLKLLKSKLPSGKSLSIAAPASYW-------YLKGFPIKDMAKYVD 178
Query: 572 FMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNY--------NINTALKLIDELGGDKRK 623
++ M+YD HG W+ + PG P N AL +I + G K
Sbjct: 179 YIVYMTYDLHGQWD-----YGNKWASPGCPAGNCLRSHVNKTETLDALSMITKAGVPSNK 233
Query: 624 IVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEI 670
+V+GV YG+S+ + G G GPG E + Y I
Sbjct: 234 VVVGVASYGRSFKMADPGCTGPGCTFTGPGSGAEAGRCTCTAGYGAI 280
Score = 82.4 bits (204), Expect = 3e-16
Identities = 64/195 (32%), Positives = 87/195 (44%), Gaps = 31/195 (15%)
Query: 1410 LKKKGVKVSLAIGGWNDSLG-GKYSRL---VNSATARQRFIEHVVKFLLKYQFDGLDLDW 1465
K KGVK L+ GGW+ S Y V A R F +VV F+ KY DG+D DW
Sbjct: 57 KKLKGVKKILSFGGWDFSTSPSTYQIFRDAVKPAN-RDTFANNVVNFVNKYNLDGVDFDW 115
Query: 1466 EYPTCWQVNCDAGPD---------SDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVIN 1516
EYP PD D +++ F++ L P G LS A P+
Sbjct: 116 EYP--------GAPDIPGIPAGDPDDGKNYLEFLKLLKSKL-PSGKSLSIAA-PASYWYL 165
Query: 1517 AAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYW---M 1573
+ +K +++ +D+I MTYD HGQWD +P P + + N T M
Sbjct: 166 KGFPIKDMAKYVDYIVYMTYDLHGQWDYGNKWASP--GCPAGNCLRSHVNKTETLDALSM 223
Query: 1574 --KKGAPSRKLVMGV 1586
K G PS K+V+GV
Sbjct: 224 ITKAGVPSNKVVVGV 238
Score = 50.8 bits (122), Expect = 5e-06
Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 14/128 (10%)
Query: 2191 KVICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQ-ELVIRSHNNKLDLEQGF 2249
K I Y+ ++ R ID S TH+ +AFA++ + + S +
Sbjct: 1 KNIAYFEAYNLDR-PCLNMDVTQIDTSKYTHIHFAFANITSDFSVDVSSVQEQFSD---- 55
Query: 2250 YEKITKFNERGIQVILAIGGWSDSRS-DKYSRLVNDI-PARRN-FVRHVTEFLEFYGFNG 2306
++K+ G++ IL+ GGW S S Y + + PA R+ F +V F+ Y +G
Sbjct: 56 FKKLK-----GVKKILSFGGWDFSTSPSTYQIFRDAVKPANRDTFANNVVNFVNKYNLDG 110
Query: 2307 LEFAWEYP 2314
++F WEYP
Sbjct: 111 VDFDWEYP 118
Score = 48.1 bits (115), Expect = 4e-05
Identities = 30/123 (24%), Positives = 57/123 (46%), Gaps = 12/123 (9%)
Query: 243 KKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPG---TEGKYTQMPGYLAFFEVCN 299
K G ++++G+ +G+S+++ + G G T GPG G+ T GY A E+
Sbjct: 226 KAGVPSNKVVVGVASYGRSFKMADPGCTGPGCTFTGPGSGAEAGRCTCTAGYGAISEIEI 285
Query: 300 KFKDKT-----WRHFTDSNGEPFMV-KKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTL 353
K+ + +DS+ +V D+W+ Y + + ++ + K + GG W +
Sbjct: 286 IDISKSKNKRWYDTDSDSD---ILVYDDDQWVAYMSPATKAARIEWYKGLNFGGTSDWAV 342
Query: 354 DLD 356
DL
Sbjct: 343 DLQ 345
Score = 35.7 bits (83), Expect = 0.26
Identities = 11/39 (28%), Positives = 20/39 (51%)
Query: 751 TFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
+ DQWV+Y P + ++++ K + G W +DL
Sbjct: 307 VYDDDQWVAYMSPATKAARIEWYKGLNFGGTSDWAVDLQ 345
>gnl|CDD|119353 cd02874, GH18_CFLE_spore_hydrolase, Cortical fragment-lytic enzyme
(CFLE) is a peptidoglycan hydrolase involved in
bacterial endospore germination. CFLE is expressed as an
inactive preprotein (called SleB) in the forespore
compartment of sporulating cells. SleB translocates
across the forespore inner membrane and is deposited as a
mature enzyme in the cortex layer of the spore. As part
of a sensory mechanism capable of initiating germination,
CFLE degrades a spore-specific peptidoglycan constituent
called muramic-acid delta-lactam that comprises the outer
cortex. CFLE has a C-terminal glycosyl hydrolase family
18 (GH18) catalytic domain as well as two N-terminal LysM
peptidoglycan-binding domains. In addition to SleB, this
family includes YaaH, YdhD, and YvbX from Bacillus
subtilis.
Length = 313
Score = 98.5 bits (246), Expect = 7e-22
Identities = 57/222 (25%), Positives = 98/222 (44%), Gaps = 63/222 (28%)
Query: 1396 LTYVHILFY-------------ERVVTL-KKKGVKVSLAI-----GGWNDSLGGKYSRLV 1436
LTY+ +Y ER++ K++GVK L I G ++ L ++
Sbjct: 26 LTYIAPFWYGVDADGTLTGLPDERLIEAAKRRGVKPLLVITNLTNGNFDSELA---HAVL 82
Query: 1437 NSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQA 1496
++ ARQR I +++ KY +DG+++D+E V D+E++ F+REL
Sbjct: 83 SNPEARQRLINNILALAKKYGYDGVNIDFEN-----VPP-----EDREAYTQFLRELSDR 132
Query: 1497 FKPHGLLLSAAVSPSK-----QVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAP 1551
P G LS AV P + AYD A+ + +D++ +MTYD+H + G VAP
Sbjct: 133 LHPAGYTLSTAVVPKTSADQFGNWSGAYDYAAIGKIVDFVVLMTYDWHWRG-GPPGPVAP 191
Query: 1552 LYEHPDDDFFYFNANFTMNYWMKK-------GAPSRKLVMGV 1586
+ W+++ P K+++G+
Sbjct: 192 I------------------GWVERVLQYAVTQIPREKILLGI 215
Score = 98.5 bits (246), Expect = 7e-22
Identities = 71/307 (23%), Positives = 125/307 (40%), Gaps = 74/307 (24%)
Query: 1774 VTALKKKGVKVTLAIGGWNDS--AGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLD 1831
+ A K++GVK L I + ++++ +AR + I +++ ++ +DG+++D
Sbjct: 51 IEAAKRRGVKPLLVITNLTNGNFDSELAHAVLSNPEARQRLINNILALAKKYGYDGVNID 110
Query: 1832 WEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDN-----AY 1886
+E P D++ + ++EL +P LS AV P + AY
Sbjct: 111 FENV----------PPEDREAYTQFLRELSDRLHPAGYTLSTAVVPKTSADQFGNWSGAY 160
Query: 1887 DIPVMSENLDWISVMTYDYHGQWDKKTGHVAPM--------YAL---PNDTTPTFNANYS 1935
D + + +D++ +MTYD+H + G VAP+ YA+ P
Sbjct: 161 DYAAIGKIVDFVVLMTYDWHWRG-GPPGPVAPIGWVERVLQYAVTQIP------------ 207
Query: 1936 LHYWVSHGADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEI 1995
R+K++ G+P+YG +TL K G + T +A A+ + A
Sbjct: 208 ----------REKILLGIPLYGYDWTL--PYKKGGKASTISPQQAIN--LAKRYGA---- 249
Query: 1996 CDKIQKDGWVVVRDRKRRIGPYAFKGDQ-----WVGFDDQAMIHHKAEFVKYNDLGGAMI 2050
+IQ D + P+ D+ V F+D + K E K L G
Sbjct: 250 --EIQYD--------EEAQSPFFRYVDEQGRRHEVWFEDARSLQAKFELAKEYGLRGVSY 299
Query: 2051 WALDLDD 2057
W L L+D
Sbjct: 300 WRLGLED 306
Score = 77.7 bits (192), Expect = 7e-15
Identities = 60/271 (22%), Positives = 109/271 (40%), Gaps = 54/271 (19%)
Query: 456 KVKILIAIGGWTDS--SGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWN--YPV 511
VK L+ I T+ E ++S+ R++ I ++L +++ + G++ D+ P
Sbjct: 58 GVKPLLVITNLTNGNFDSELAHAVLSNPEARQRLINNILALAKKYGYDGVNIDFENVPP- 116
Query: 512 CWQADCSKQHKADKGNFVKLIQELKAEFDKHDYSIAVGI-----SGYKEILEVAYDFPAL 566
D+ + + ++EL Y+++ + + AYD+ A+
Sbjct: 117 -----------EDREAYTQFLRELSDRLHPAGYTLSTAVVPKTSADQFGNWSGAYDYAAI 165
Query: 567 NDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVI 626
V+F+ LM+YD+H G G V+P+ + + A+ I + KI++
Sbjct: 166 GKIVDFVVLMTYDWHWRG-GPPGPVAPIG------WVERVLQYAVTQI-----PREKILL 213
Query: 627 GVPFYGQSYTLKLDKEHGLGAETDGPG----LAGEY--------TQQPGMLAYYEICYRI 674
G+P YG +TL K G A T P LA Y Q Y +
Sbjct: 214 GIPLYGYDWTLPYKK--GGKASTISPQQAINLAKRYGAEIQYDEEAQSPFFRYVD---EQ 268
Query: 675 KTRKWTVQRNTKSLEPRSMGTLKLDKEHGLG 705
R + +SL+ + +L KE+GL
Sbjct: 269 GRRHEVWFEDARSLQAK----FELAKEYGLR 295
Score = 57.3 bits (139), Expect = 4e-08
Identities = 54/276 (19%), Positives = 103/276 (37%), Gaps = 65/276 (23%)
Query: 98 SITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANE-RSDFSTFI 156
++ S+ E+R I+++L + Y +DG+++ D E++ E R ++ F+
Sbjct: 80 AVLSNPEARQRLINNILALAKKYGYDGVNI-------------DFENVPPEDREAYTQFL 126
Query: 157 QELSSTLRRNNYQLTLTS-PGVIDRKTSLV----DISVVAPLVDLILLKSFNNDHMDDEV 211
+ELS L Y L+ P + D + + +VD ++L +++
Sbjct: 127 RELSDRLHPAGYTLSTAVVPKTSADQFGNWSGAYDYAAIGKIVDFVVLMTYDWHWRGGPP 186
Query: 212 VPVKPNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYG 271
PV P V ++ AV I + E+I++GIP +G + Y
Sbjct: 187 GPVAPIGWVE----------RVLQYAV-TQIPR----EKILLGIPLYG---YDWT-LPYK 227
Query: 272 LGATVKGPGTEG------KYTQMPGYLAFFEVCNKFKDKTWRH----FTDSNGEPFMVKK 321
G + +Y Y D+ + + D G +
Sbjct: 228 KGGKASTISPQQAINLAKRYGAEIQY-----------DEEAQSPFFRYVDEQG------R 270
Query: 322 DEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDD 357
+ +E+ S++ K K+ L G W L L+D
Sbjct: 271 RHEVWFEDARSLQAKFELAKEYGLRGVSYWRLGLED 306
>gnl|CDD|119362 cd06545, GH18_3CO4_chitinase, The Bacteroides thetaiotaomicron
protein represented by pdb structure 3CO4 is an
uncharacterized bacterial member of the family 18
glycosyl hydrolases with homologs found in
Flavobacterium, Stigmatella, and Pseudomonas.
Length = 253
Score = 88.7 bits (220), Expect = 5e-19
Identities = 67/273 (24%), Positives = 109/273 (39%), Gaps = 48/273 (17%)
Query: 1713 IVCYFTNWAW-YRQSGGKYLPSDIDSDLCTHVIYGFAVLDTD-QLVIKPHDTWADLDNKF 1770
+V Y N+ S ID TH+ FA D + L P ++L+
Sbjct: 1 VVGYLPNYDDLNALSPT------IDFSKLTHINLAFANPDANGTLNANP--VRSELN--- 49
Query: 1771 YEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDL 1830
V A VK+ +++ G + +++ +N R + ++N+++ +N DG+D+
Sbjct: 50 -SVVNAAHAHNVKILISLAG---GSPPEFTAALNDPAKRKALVDKIINYVVSYNLDGIDV 105
Query: 1831 DWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNAYDIPV 1890
D E P D + I+ L AA LL+AAVS N +
Sbjct: 106 DLEGPDVTFGD-----------YLVFIRALYAALKKEGKLLTAAVSSW-----NGGAVSD 149
Query: 1891 MSENL-DWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHG-ADRKK 1948
+ D+I++M+YD G W P + +A L+YW G A + K
Sbjct: 150 STLAYFDFINIMSYDATGPWWGDN---------PGQHSSYDDAVNDLNYWNERGLASKDK 200
Query: 1949 VIFGMPMYGQSFTLAD----KNKNGLNSQTYGG 1977
++ G+P YG F +NK Q YGG
Sbjct: 201 LVLGLPFYGYGFYYNGIPTIRNKVAFAKQNYGG 233
Score = 85.2 bits (211), Expect = 9e-18
Identities = 63/252 (25%), Positives = 100/252 (39%), Gaps = 54/252 (21%)
Query: 1342 IVCYFTNWAWYRPGKGKYVLKKVPWINILKLYV----TLQTGLGTDQEKENTFLKISELT 1397
+V Y N+ K+ IN+ TL SEL
Sbjct: 1 VVGYLPNYDDLNALSPTIDFSKLTHINLAFANPDANGTLNANPVR-----------SELN 49
Query: 1398 YVHILFYERVVTLKKKGVKV--SLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLK 1455
V V VK+ SLA G +++ +N R+ ++ ++ +++
Sbjct: 50 SV-------VNAAHAHNVKILISLAGGS-----PPEFTAALNDPAKRKALVDKIINYVVS 97
Query: 1456 YQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVI 1515
Y DG+D+D E GPD + +F+R L+ A K G LL+AAVS
Sbjct: 98 YNLDGIDVDLE-----------GPDVTFGDYLVFIRALYAALKKEGKLLTAAVSSWNG-- 144
Query: 1516 NAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYWMKK 1575
A L+ D+I++M+YD G W ++P Y +A +NYW ++
Sbjct: 145 -GAVSDSTLA-YFDFINIMSYDATGPWWG---------DNPGQHSSYDDAVNDLNYWNER 193
Query: 1576 G-APSRKLVMGV 1586
G A KLV+G+
Sbjct: 194 GLASKDKLVLGL 205
Score = 65.2 bits (159), Expect = 5e-11
Identities = 60/258 (23%), Positives = 100/258 (38%), Gaps = 51/258 (19%)
Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPN-TLSIQAGNPE--ADIDDN 441
VV Y+ N+ ID+ THI AFA+ D N TL+ E + ++
Sbjct: 1 VVGYLPNYDDLNALSPT-----IDFSKLTHINLAFANPDANGTLNANPVRSELNSVVNAA 55
Query: 442 FYQRISSSPLVTHGK-VKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDF 500
H VKILI++ G S +++ ++ + RK + ++ ++ ++
Sbjct: 56 ------------HAHNVKILISLAG---GSPPEFTAALNDPAKRKALVDKIINYVVSYNL 100
Query: 501 AGLHFDWNYPVCWQADCSKQHKADKGNFV-KLIQELKAEFDKHDYSIAVGISGYKEILEV 559
G+ D P D F+ L LK E +++ G +
Sbjct: 101 DGIDVDLEGPDVTFGD--------YLVFIRALYAALKKEGKLLTAAVSSWNGGAVSDSTL 152
Query: 560 AYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDELG- 618
AY +F+++MSYD G W G PG+ + L +E G
Sbjct: 153 AY--------FDFINIMSYDATGPWWG---------DNPGQHSSYDDAVNDLNYWNERGL 195
Query: 619 GDKRKIVIGVPFYGQSYT 636
K K+V+G+PFYG +
Sbjct: 196 ASKDKLVLGLPFYGYGFY 213
Score = 38.6 bits (90), Expect = 0.027
Identities = 30/158 (18%), Positives = 59/158 (37%), Gaps = 30/158 (18%)
Query: 106 RLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFIQELSSTLRR 165
R + ++ Y DG+D+ DLE D+ FI+ L + L++
Sbjct: 84 RKALVDKIINYVVSYNLDGIDV-------------DLEGPDVTFGDYLVFIRALYAALKK 130
Query: 166 NNYQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKVNIQVT 225
L + V V S +A D I + S++ P N
Sbjct: 131 EG---KLLTAAVSSWNGGAVSDSTLA-YFDFINIMSYDA---------TGPWWGDNPGQH 177
Query: 226 STIANFNNIESAVYNWIKKGARP-EQIIIGIPFFGKSY 262
S+ + + + W ++G +++++G+PF+G +
Sbjct: 178 SSYDDA---VNDLNYWNERGLASKDKLVLGLPFYGYGF 212
Score = 30.9 bits (70), Expect = 6.7
Identities = 23/126 (18%), Positives = 51/126 (40%), Gaps = 13/126 (10%)
Query: 2192 VICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFYE 2251
V+ Y ++ L ID S TH+ AFA+ D + + +L
Sbjct: 1 VVGYLPNYDDLNALSPT-----IDFSKLTHINLAFANPDANGTLNANPVRS-ELN----S 50
Query: 2252 KITKFNERGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFAW 2311
+ + +++++++ G S +++ +ND R+ V + ++ Y +G++
Sbjct: 51 VVNAAHAHNVKILISLAGGS---PPEFTAALNDPAKRKALVDKIINYVVSYNLDGIDVDL 107
Query: 2312 EYPKCW 2317
E P
Sbjct: 108 EGPDVT 113
>gnl|CDD|119359 cd06542, GH18_EndoS-like, Endo-beta-N-acetylglucosaminidases are
bacterial chitinases that hydrolyze the chitin core of
various asparagine (N)-linked glycans and glycoproteins.
The endo-beta-N-acetylglucosaminidases have a glycosyl
hydrolase family 18 (GH18) catalytic domain. Some
members also have an additional C-terminal glycosyl
hydrolase family 20 (GH20) domain while others have an
N-terminal domain of unknown function (pfam08522).
Members of this family include
endo-beta-N-acetylglucosaminidase S (EndoS) from
Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and
EndoH from Flavobacterium meningosepticum, and EndoE
from Enterococcus faecalis. EndoS is a secreted
endoglycosidase from Streptococcus pyogenes that
specifically hydrolyzes the glycan on human IgG between
two core N-acetylglucosamine residues. EndoE is a
secreted endoglycosidase, encoded by the ndoE gene in
Enterococcus faecalis, that hydrolyzes the glycan on
human RNase B.
Length = 255
Score = 68.9 bits (169), Expect = 2e-12
Identities = 37/143 (25%), Positives = 57/143 (39%), Gaps = 13/143 (9%)
Query: 1772 EKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLD 1831
+ L+ KG KV L+I G + AG + S A + +V+ + ++ DG+D D
Sbjct: 55 TYIRPLQAKGTKVLLSILGNHLGAGFANN---LSDAAAKAYAKAIVDTVDKYGLDGVDFD 111
Query: 1832 WEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNAYDIPVM 1891
EY + F LIKELR P D LL+ + D +
Sbjct: 112 DEY---SGYGKNGTSQPSNEAFVRLIKELRKYMGPTDKLLTIDGYGQAL----SNDGEEV 164
Query: 1892 SENLDWISVMTYDYHGQWDKKTG 1914
S +D++ Y Y+G T
Sbjct: 165 SPYVDYV---IYQYYGSSSSSTQ 184
Score = 55.1 bits (133), Expect = 1e-07
Identities = 32/144 (22%), Positives = 52/144 (36%), Gaps = 13/144 (9%)
Query: 1410 LKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPT 1469
L+ KG KV L+I G + G + S A + + + +V + KY DG+D D EY
Sbjct: 60 LQAKGTKVLLSILGNHLGAGFANN---LSDAAAKAYAKAIVDTVDKYGLDGVDFDDEY-- 114
Query: 1470 CWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAYDVKALSESLD 1529
+ E+F ++EL + P LL+ E
Sbjct: 115 -SGYGKNGTSQPSNEAFVRLIKELRKYMGPTDKLLTIDGYGQ-------ALSNDGEEVSP 166
Query: 1530 WISVMTYDYHGQWDKKTGHVAPLY 1553
++ + Y Y+G T
Sbjct: 167 YVDYVIYQYYGSSSSSTQRNWNTN 190
>gnl|CDD|226376 COG3858, COG3858, Predicted glycosyl hydrolase [General function
prediction only].
Length = 423
Score = 68.6 bits (168), Expect = 2e-11
Identities = 65/348 (18%), Positives = 128/348 (36%), Gaps = 64/348 (18%)
Query: 1741 THVIYGFAVLDTDQLVIK---PHDTWADLDNKFYEKV------------TALKKKGVKVT 1785
T +T +LV + P+ T+ DL + ++ + + +K
Sbjct: 105 TAYYAPRPPTETGELVRRHAGPYLTYVDLFSYHAQENGNLTETPNENVIEIAQCRKIKPV 164
Query: 1786 LAIGGWNDSAGNKY----SRLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKCWQVD 1841
I N L+N++ A+++ I +++ + + G+++D+E
Sbjct: 165 PGISNGTRPGANFGGELAQLLLNNETAKNRLINNIITLLDARGYRGVNIDFE-------- 216
Query: 1842 CKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDN-----AYDIPVMSENLD 1896
D++ + D ++++R A + +S AV+ + + AYD + + D
Sbjct: 217 --NVGPGDRELYTDFLRQVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIAD 274
Query: 1897 WISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGADRKKVIFGMPMY 1956
++ +MTYD+H G VA + + Y ++ KV+ G+P+Y
Sbjct: 275 FVILMTYDWHYSGGPP-GPVASI-------GWVRK---VIEYALTVIPAE-KVMMGIPLY 322
Query: 1957 GQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEIC---DKIQKDGWVVVRDRKRR 2013
G +TL + Y A A A Y D + + D++ R
Sbjct: 323 GYDWTLPY------DPLGY-LARAISPDEAIDIANRYNATIQYDATSQSPFFYYVDKEGR 375
Query: 2014 IGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNF 2061
V F+D K + +K L G W L +D +N+
Sbjct: 376 --------YHEVWFEDARSFQTKLDLIKEYGLRGVSYWVLGQEDPRNW 415
Score = 65.5 bits (160), Expect = 1e-10
Identities = 42/204 (20%), Positives = 75/204 (36%), Gaps = 36/204 (17%)
Query: 1374 VTLQTGLGTDQEKENTFLKISE--LTYVHILFYER--------------VVTLKKKGVKV 1417
T E + + LTYV + Y + + + +K
Sbjct: 104 TTAYYAPRPPTETGELVRRHAGPYLTYVDLFSYHAQENGNLTETPNENVIEIAQCRKIKP 163
Query: 1418 SLAIGGWNDSLGGKY----SRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQV 1473
I L+N+ TA+ R I +++ L + G+++D+E
Sbjct: 164 VPGISNGTRPGANFGGELAQLLLNNETAKNRLINNIITLLDARGYRGVNIDFENVG---- 219
Query: 1474 NCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVI-----NAAYDVKALSESL 1528
D+E + F+R++ A G +S AV+ + + AYD AL +
Sbjct: 220 ------PGDRELYTDFLRQVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIA 273
Query: 1529 DWISVMTYDYHGQWDKKTGHVAPL 1552
D++ +MTYD+H G VA +
Sbjct: 274 DFVILMTYDWHYSGGPP-GPVASI 296
Score = 54.7 bits (132), Expect = 4e-07
Identities = 44/193 (22%), Positives = 80/193 (41%), Gaps = 28/193 (14%)
Query: 468 DSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGN 527
+ GE L+++ + + + I +++T L + G++ D+ D+
Sbjct: 176 NFGGELAQLLLNNETAKNRLINNIITLLDARGYRGVNIDFEN----------VGPGDREL 225
Query: 528 FVKLIQELKAEFDKHDY--SIAVGISGYKEILEV---AYDFPALNDHVEFMSLMSYDYHG 582
+ +++++ Y SIAV + AYD+ AL +F+ LM+YD+H
Sbjct: 226 YTDFLRQVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIADFVILMTYDWHY 285
Query: 583 AWEGITGLVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVIGVPFYGQSYTLKLDKE 642
+ G G V+ + Y AL +I K+++G+P YG +TL D
Sbjct: 286 SG-GPPGPVASIG--WVRKVIEY----ALTVI-----PAEKVMMGIPLYGYDWTLPYD-P 332
Query: 643 HGLGAETDGPGLA 655
G A P A
Sbjct: 333 LGYLARAISPDEA 345
Score = 49.0 bits (117), Expect = 2e-05
Identities = 44/273 (16%), Positives = 98/273 (35%), Gaps = 52/273 (19%)
Query: 99 ITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANE-RSDFSTFIQ 157
+ ++ ++ I++++ + D + G+++ D E++ R ++ F++
Sbjct: 185 LLNNETAKNRLINNIITLLDARGYRGVNI-------------DFENVGPGDRELYTDFLR 231
Query: 158 ELSSTLRRNNYQLTL-----TSPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVV 212
++ L Y +++ TS + D + + D ++L +++ +
Sbjct: 232 QVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIADFVILMTYDWHYSGGPPG 291
Query: 213 PVKPNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGL 272
PV V IE A+ E++++GIP +G Y L
Sbjct: 292 PVASIGWVR----------KVIEYAL-----TVIPAEKVMMGIPLYG--YDWT------L 328
Query: 273 GATVKGPGTEGKYTQMPGYLAFFEVCNKFKDKT----WRHFTDSNGEPFMVKKDEWITYE 328
G +A D T + ++ D G + + +E
Sbjct: 329 PYDPLGYLARAISPDEAIDIANRYNATIQYDATSQSPFFYYVDKEG------RYHEVWFE 382
Query: 329 NNDSIRRKMNYIKDRHLGGAMLWTLDLDDFRGF 361
+ S + K++ IK+ L G W L +D R +
Sbjct: 383 DARSFQTKLDLIKEYGLRGVSYWVLGQEDPRNW 415
Score = 39.7 bits (93), Expect = 0.020
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 5/64 (7%)
Query: 732 RIKTRKWTVQRNTKSLEPFTFQGDQ-----WVSYEDPTSLTEKVKFIKSNGYAGIMAWTI 786
T+Q + S PF + D+ V +ED S K+ IK G G+ W +
Sbjct: 348 IANRYNATIQYDATSQSPFFYYVDKEGRYHEVWFEDARSFQTKLDLIKEYGLRGVSYWVL 407
Query: 787 DLDD 790
+D
Sbjct: 408 GQED 411
>gnl|CDD|119355 cd02876, GH18_SI-CLP, Stabilin-1 interacting chitinase-like protein
(SI-CLP) is a eukaryotic chitinase-like protein of
unknown function that interacts with the
endocytic/sorting transmembrane receptor stabilin-1 and
is secreted from the lysosome. SI-CLP has a glycosyl
hydrolase family 18 (GH18) domain but lacks a
chitin-binding domain. The catalytic amino acids of the
GH18 domain are not conserved in SI-CLP, similar to the
chitolectins YKL-39, YKL-40, and YM1/2. Human SI-CLP is
sorted to late endosomes and secretory lysosomes in
alternatively activated macrophages.
Length = 318
Score = 67.3 bits (165), Expect = 2e-11
Identities = 51/206 (24%), Positives = 83/206 (40%), Gaps = 36/206 (17%)
Query: 455 GKVKIL--IAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFD-WNYPV 511
+KIL + GW+ L++ R+K IK ++T ++ F G+ + W+
Sbjct: 65 KNIKILPRVLFEGWSYQD---LQSLLNDEQEREKLIKLLVTTAKKNHFDGIVLEVWSQ-- 119
Query: 512 CWQADCSKQHKADKGNFVKLIQELKAEFDKHDYSIAVGISGYKEILEVAY-----DFPAL 566
A + + ++L+ L + + + I +E DF L
Sbjct: 120 -LAAYGVPDKRKE---LIQLVIHLGETLHSANLKLILVIPPPREKGNQNGLFTRKDFEKL 175
Query: 567 NDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPN---YNINTALKLIDELGGDKR- 622
HV+ SLM+YDY S P P PN + + L+L+ G KR
Sbjct: 176 APHVDGFSLMTYDY---------------SSPQRPGPNAPLSWVRSCLELLLPESGKKRA 220
Query: 623 KIVIGVPFYGQSYTLKLDKEHGLGAE 648
KI++G+ FYG YTL G+E
Sbjct: 221 KILLGLNFYGNDYTLPGGGGAITGSE 246
Score = 55.4 bits (134), Expect = 1e-07
Identities = 48/202 (23%), Positives = 78/202 (38%), Gaps = 30/202 (14%)
Query: 1423 GWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCW-QVNCDAGPDS 1481
GW+ L+N R++ I+ +V K FDG+ L E W Q+ PD
Sbjct: 77 GWSYQD---LQSLLNDEQEREKLIKLLVTTAKKNHFDGIVL--EV---WSQLAAYGVPDK 128
Query: 1482 DKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAY-----DVKALSESLDWISVMTY 1536
KE V L + L L + P ++ N D + L+ +D S+MTY
Sbjct: 129 RKELIQ-LVIHLGETLHSANLKLILVIPPPREKGNQNGLFTRKDFEKLAPHVDGFSLMTY 187
Query: 1537 DYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYWMKKGAPSR-KLVMGVHPLLSTITE 1595
DY ++ G APL + + + + + R K+++G++ + T
Sbjct: 188 DYSSP--QRPGPNAPLS--------WVRSC--LELLLPESGKKRAKILLGLNFYGNDYTL 235
Query: 1596 VLGHGP--GGNYESTTEEYKPT 1615
G G G Y + KP
Sbjct: 236 PGGGGAITGSEYLKLLKSNKPK 257
Score = 54.6 bits (132), Expect = 3e-07
Identities = 47/217 (21%), Positives = 82/217 (37%), Gaps = 47/217 (21%)
Query: 1765 DLDNKFYEKVTALKKKGVKVT--LAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILE 1822
D+D + E+V K +K+ + GW+ L+N +Q R K I +V +
Sbjct: 51 DIDKGWIEEVRK-ANKNIKILPRVLFEGWS---YQDLQSLLNDEQEREKLIKLLVTTAKK 106
Query: 1823 HNFDGLDLD-WEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAV 1881
++FDG+ L+ W G ++ L+ L + +L L + P +
Sbjct: 107 NHFDGIVLEVWSQ---LAA---YGVPDKRKELIQLVIHLGETLHSANLKLILVIPPPREK 160
Query: 1882 IDNAY-----DIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSL 1936
+ D ++ ++D S+MTYDY ++ G AP+
Sbjct: 161 GNQNGLFTRKDFEKLAPHVDGFSLMTYDYSSP--QRPGPNAPLS---------------- 202
Query: 1937 HYWV---------SHGADRKKVIFGMPMYGQSFTLAD 1964
WV G R K++ G+ YG +TL
Sbjct: 203 --WVRSCLELLLPESGKKRAKILLGLNFYGNDYTLPG 237
>gnl|CDD|216601 pfam01607, CBM_14, Chitin binding Peritrophin-A domain. This domain
is called the Peritrophin-A domain and is found in chitin
binding proteins particularly peritrophic matrix proteins
of insects and animal chitinases. Copies of the domain
are also found in some baculoviruses. Relevant references
that describe proteins with this domain include. It is an
extracellular domain that contains six conserved
cysteines that probably form three disulphide bridges.
Chitin binding has been demonstrated for a protein
containing only two of these domains.
Length = 53
Score = 56.7 bits (137), Expect = 5e-10
Identities = 16/53 (30%), Positives = 24/53 (45%), Gaps = 3/53 (5%)
Query: 1138 CVNGD--YLPDPDDCRSFLICSHGNLLKQSCGPSLLWNAKKKLCDWSYNV-QC 1187
C PDP DC + CS+G + +C L+++ CD+ NV C
Sbjct: 1 CEGRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFDPALGTCDYPDNVVDC 53
Score = 54.0 bits (130), Expect = 4e-09
Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 2/51 (3%)
Query: 983 CTSGE--YYPVDGECGSYYRCILGTLKKESCAPGLHWNKVNKICDWPKSAK 1031
C YP G+C YY+C G +C GL ++ CD+P +
Sbjct: 1 CEGRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFDPALGTCDYPDNVV 51
Score = 53.6 bits (129), Expect = 6e-09
Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 1205 CQEGE--FAAYPSDCNKYQYCIWGSYQVASCSPGLYWNDKMKTCDWP 1249
C+ P DC+KY C G V +C GL ++ + TCD+P
Sbjct: 1 CEGRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFDPALGTCDYP 47
Score = 42.0 bits (99), Expect = 7e-05
Identities = 15/36 (41%), Positives = 20/36 (55%)
Query: 2129 NRTGELYQHEGDCAKFASCENGLIKIYDCPPGLHFN 2164
R LY GDC+K+ C NG ++ CP GL F+
Sbjct: 3 GRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFD 38
>gnl|CDD|214696 smart00494, ChtBD2, Chitin-binding domain type 2.
Length = 49
Score = 53.2 bits (128), Expect = 8e-09
Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 2/49 (4%)
Query: 1136 ETCVNGD--YLPDPDDCRSFLICSHGNLLKQSCGPSLLWNAKKKLCDWS 1182
C P P DC + CS+G + SC L++N + CDW
Sbjct: 1 NECPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDWP 49
Score = 52.8 bits (127), Expect = 1e-08
Identities = 17/47 (36%), Positives = 21/47 (44%), Gaps = 2/47 (4%)
Query: 983 CTSGE--YYPVDGECGSYYRCILGTLKKESCAPGLHWNKVNKICDWP 1027
C YP +C YY+C G SC GL +N + CDWP
Sbjct: 3 CPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDWP 49
Score = 52.1 bits (125), Expect = 2e-08
Identities = 20/47 (42%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 1205 CQEGEFAAYPS--DCNKYQYCIWGSYQVASCSPGLYWNDKMKTCDWP 1249
C YP DC+KY C G V SC GL +N +TCDWP
Sbjct: 3 CPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDWP 49
Score = 44.0 bits (104), Expect = 1e-05
Identities = 18/41 (43%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 2126 NRC-NRTGELYQHEGDCAKFASCENGLIKIYDCPPGLHFNF 2165
N C R LY H DC+K+ C NG + CP GL FN
Sbjct: 1 NECPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNP 41
>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein. This family of
trypanosomal proteins resemble vertebrate mucins. The
protein consists of three regions. The N and C terminii
are conserved between all members of the family, whereas
the central region is not well conserved and contains a
large number of threonine residues which can be
glycosylated. Indirect evidence suggested that these
genes might encode the core protein of parasite mucins,
glycoproteins that were proposed to be involved in the
interaction with, and invasion of, mammalian host cells.
This family contains an N-terminal signal peptide.
Length = 143
Score = 51.8 bits (123), Expect = 3e-07
Identities = 24/63 (38%), Positives = 33/63 (52%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
T+ + P T +T+T TT T TTTTT+TTTTT + P+ +TT+ P T
Sbjct: 48 TTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDT 107
Query: 1675 IKP 1677
P
Sbjct: 108 RAP 110
Score = 50.2 bits (119), Expect = 8e-07
Identities = 34/90 (37%), Positives = 42/90 (46%), Gaps = 8/90 (8%)
Query: 1603 GNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT 1662
G Y++ E ++ TT +T+ TT T ITTTTT TTTTT TTT
Sbjct: 29 GQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTT---TTT 85
Query: 1663 STTTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
+TTT P TT T+ P T T P
Sbjct: 86 TTTTEAPSKNTT-----TSEAPTTTDTRAP 110
Score = 48.3 bits (114), Expect = 5e-06
Identities = 24/66 (36%), Positives = 28/66 (42%)
Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTV 1690
+ + T TTT +TTTTT TT+TT T TT TT P TT
Sbjct: 39 AEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTT 98
Query: 1691 KPKPTT 1696
PTT
Sbjct: 99 SEAPTT 104
Score = 46.0 bits (108), Expect = 3e-05
Identities = 23/90 (25%), Positives = 31/90 (34%), Gaps = 1/90 (1%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEE-PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
T++ + + T T TT+TTTTT TTT+ TTT T
Sbjct: 24 TAQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTT 83
Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVI 1703
T TT+ P T + I
Sbjct: 84 TTTTTTEAPSKNTTTSEAPTTTDTRAPSSI 113
Score = 43.3 bits (101), Expect = 2e-04
Identities = 26/80 (32%), Positives = 33/80 (41%), Gaps = 6/80 (7%)
Query: 1630 VTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
E + + TTTTT+TT T TTT+TTTT TT TT TTT
Sbjct: 33 AAVVEAAEGQSQTTTTTTTTTPPT------TTTTTTTTTTTITTTTTKTTTTTTTTTTTT 86
Query: 1690 VKPKPTTVKPKPVIPPSTKD 1709
+ + P+T D
Sbjct: 87 TTTEAPSKNTTTSEAPTTTD 106
Score = 42.2 bits (98), Expect = 5e-04
Identities = 29/70 (41%), Positives = 34/70 (48%), Gaps = 11/70 (15%)
Query: 1627 STVVTTEEPEQTEEPITT-------TTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP 1679
S T + Q + + TTT+TTTTT PPTTT+TTTT ITT K
Sbjct: 20 SVCATAQGEGQYDAAVVEAAEGQSQTTTTTTTTT----PPTTTTTTTTTTTTITTTTTKT 75
Query: 1680 TTVKPKPTTT 1689
TT TTT
Sbjct: 76 TTTTTTTTTT 85
Score = 37.1 bits (85), Expect = 0.028
Identities = 22/70 (31%), Positives = 28/70 (40%), Gaps = 7/70 (10%)
Query: 1259 ATTTSEQVPKPTKKPTKPTKKPTTTT--EYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
A + + T T T PTTTT TT + TT+TTT T T
Sbjct: 34 AVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT-----TTTTTTTT 88
Query: 1317 WHPPTEPTTT 1326
P++ TTT
Sbjct: 89 TEAPSKNTTT 98
Score = 37.1 bits (85), Expect = 0.033
Identities = 23/70 (32%), Positives = 26/70 (37%)
Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
A Q T T PT TTTT TT +TTT+TTT + T
Sbjct: 36 VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKN 95
Query: 1317 WHPPTEPTTT 1326
PTTT
Sbjct: 96 TTTSEAPTTT 105
Score = 36.8 bits (84), Expect = 0.035
Identities = 25/71 (35%), Positives = 30/71 (42%), Gaps = 7/71 (9%)
Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEW 1315
QT+ TTT T PT T TTTT T +TTT+TTT + P+
Sbjct: 44 QTTTTTT------TTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTT 97
Query: 1316 VWHPPTEPTTT 1326
PT T T
Sbjct: 98 TSEAPT-TTDT 107
Score = 35.2 bits (80), Expect = 0.13
Identities = 18/72 (25%), Positives = 26/72 (36%)
Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPN 1311
T T+ TTT+ T T TTTT EA +K +TT+ T + P+
Sbjct: 52 TTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPS 111
Query: 1312 PTEWVWHPPTEP 1323
+
Sbjct: 112 SIREIDGSLGSS 123
Score = 34.8 bits (79), Expect = 0.17
Identities = 15/69 (21%), Positives = 17/69 (24%)
Query: 1639 EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
E + TTT+TT TT T TTT TT
Sbjct: 27 GEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTT 86
Query: 1699 PKPVIPPST 1707
P
Sbjct: 87 TTTEAPSKN 95
Score = 34.8 bits (79), Expect = 0.21
Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 4/70 (5%)
Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTS----TTTTTKRPKPPTTT 1662
+TT P + + TT +T+ TT + T TTTTT+ + TT P TT
Sbjct: 47 TTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTD 106
Query: 1663 STTTTRPKPI 1672
+ + + I
Sbjct: 107 TRAPSSIREI 116
Score = 32.5 bits (73), Expect = 1.1
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 6/63 (9%)
Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPN 1311
T + TTT+ T T TK TTTT TT+ + +TT+++ P
Sbjct: 50 TTTTPPTTTTTTTT--TTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEA----PT 103
Query: 1312 PTE 1314
T+
Sbjct: 104 TTD 106
Score = 31.8 bits (71), Expect = 2.1
Identities = 18/57 (31%), Positives = 23/57 (40%)
Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDS 1305
P T T+ TT + K T T T TTT + T++ TTT T S
Sbjct: 55 PTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPS 111
Score = 31.0 bits (69), Expect = 3.4
Identities = 22/84 (26%), Positives = 29/84 (34%), Gaps = 7/84 (8%)
Query: 1254 CKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPT 1313
C AT E + TTTT TT + T+TTT + T T
Sbjct: 18 CASVCATAQGEGQYDAAVVEAAEGQSQTTTT-------TTTTTPPTTTTTTTTTTTTITT 70
Query: 1314 EWVWHPPTEPTTTHISVTEKSPLD 1337
T TTT + T ++P
Sbjct: 71 TTTKTTTTTTTTTTTTTTTEAPSK 94
>gnl|CDD|119350 cd02871, GH18_chitinase_D-like, GH18 domain of Chitinase D (ChiD).
ChiD, a chitinase found in Bacillus circulans, hydrolyzes
the 1,4-beta-linkages of N-acetylglucosamine in chitin
and chitodextrins. The domain architecture of ChiD
includes a catalytic glycosyl hydrolase family 18 (GH18)
domain, a chitin-binding domain, and a fibronectin type
III domain. The chitin-binding and fibronectin type III
domains are located either N-terminal or C-terminal to
the catalytic domain. This family includes exochitinase
Chi36 from Bacillus cereus.
Length = 312
Score = 54.6 bits (132), Expect = 3e-07
Identities = 46/206 (22%), Positives = 77/206 (37%), Gaps = 30/206 (14%)
Query: 1713 IVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDN---- 1768
+V Y+ NW +G D + FA +D + +
Sbjct: 3 LVGYWHNWDNG--AGSGRQDLDDVPSKYNVINVAFAEPTSDGGGEVTFNNGSSPGGYSPA 60
Query: 1769 KFYEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGL 1828
+F + AL+ KG KV ++IGG N +N F+ +V I E+ FDGL
Sbjct: 61 EFKADIKALQAKGKKVLISIGGANGHV------DLNHTAQEDNFVDSIVAIIKEYGFDGL 114
Query: 1829 DLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAA------VSPSKAV- 1881
D+D E +K+L+ + P + +L+ A A
Sbjct: 115 DIDLE-----SGSNPLNATPVITNLISALKQLKDHYGP-NFILTMAPETPYVQGGYAAYG 168
Query: 1882 -IDNAYDIPV---MSENLDWISVMTY 1903
I AY +P+ + ++L W++V Y
Sbjct: 169 GIWGAY-LPLIDNLRDDLTWLNVQYY 193
Score = 50.4 bits (121), Expect = 6e-06
Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 6/57 (10%)
Query: 1410 LKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWE 1466
L+ KG KV ++IGG N + +N F++ +V + +Y FDGLD+D E
Sbjct: 69 LQAKGKKVLISIGGANGHV------DLNHTAQEDNFVDSIVAIIKEYGFDGLDIDLE 119
Score = 40.4 bits (95), Expect = 0.010
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 8/68 (11%)
Query: 2246 EQGFYEKITKFNERGIQVILAIGGWSDSRSDKYSR-LVNDIPARRNFVRHVTEFLEFYGF 2304
F I +G +V+++IGG +N NFV + ++ YGF
Sbjct: 59 PAEFKADIKALQAKGKKVLISIGG-------ANGHVDLNHTAQEDNFVDSIVAIIKEYGF 111
Query: 2305 NGLEFAWE 2312
+GL+ E
Sbjct: 112 DGLDIDLE 119
>gnl|CDD|119354 cd02875, GH18_chitobiase, Chitobiase (also known as
di-N-acetylchitobiase) is a lysosomal glycosidase that
hydrolyzes the reducing-end N-acetylglucosamine from the
chitobiose core of oligosaccharides during the ordered
degradation of asparagine-linked glycoproteins in
eukaryotes. Chitobiase can only do so if the asparagine
that joins the oligosaccharide to protein is previously
removed by a glycosylasparaginase. Chitobiase is
therefore the final step in the lysosomal degradation of
the protein/carbohydrate linkage component of
asparagine-linked glycoproteins. The catalytic domain of
chitobiase is an eight-stranded alpha/beta barrel fold
similar to that of other family 18 glycosyl hydrolases
such as hevamine and chitotriosidase.
Length = 358
Score = 52.0 bits (125), Expect = 2e-06
Identities = 64/292 (21%), Positives = 105/292 (35%), Gaps = 35/292 (11%)
Query: 1778 KKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKC 1837
KGV++ L ++ N R+++I V DG+++D E P
Sbjct: 74 HSKGVRLVL-------KGDVPLEQISNPTY-RTQWIQQKVELAKSQFMDGINIDIEQP-- 123
Query: 1838 WQVDCKQGPASDKQGFADLIKELRAAF---NPHDLLLSAAVSPSKAVID-NAYDIPVMSE 1893
+ G + +L+KE AF NP +S V+ S + ID YD +++
Sbjct: 124 --ITK--GS-PEYYALTELVKETTKAFKKENPGY-QISFDVAWSPSCIDKRCYDYTGIAD 177
Query: 1894 NLDWISVMTYDYHGQ-WDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGADRKKVIFG 1952
D++ VM YD Q W K + +P + + G D KK++ G
Sbjct: 178 ASDFLVVMDYDEQSQIWGK--------ECIAGANSPYSQTLSGYNNFTKLGIDPKKLVMG 229
Query: 1953 MPMYGQSFTLADKNKNGLNSQTYGGAEAGEN-TRARGFLAYYEICDKIQKDGWVVVRDRK 2011
+P YG + + N + G N + A G Y K
Sbjct: 230 LPWYGYDYPCLNGNLEDVVCTIPKVPFRGANCSDAAGRQIPYSEIMKQINSSIGGRLWDS 289
Query: 2012 RRIGPYAFKGD-----QWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDF 2058
+ P+ D V +D+ + K + K L G +W DL D+
Sbjct: 290 EQKSPFYNYKDKQGNLHQVWYDNPQSLSIKVAYAKNLGLKGIGMWNGDLLDY 341
Score = 42.8 bits (101), Expect = 0.002
Identities = 40/166 (24%), Positives = 69/166 (41%), Gaps = 25/166 (15%)
Query: 478 ISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQE--- 534
IS+ + R ++I+ + + G++ D P+ + +L++E
Sbjct: 91 ISNPTYRTQWIQQKVELAKSQFMDGINIDIEQPITKGSPEYYA-------LTELVKETTK 143
Query: 535 -LKAEFDKHDYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHG-AWEG--ITGL 590
K E + S V S I + YD+ + D +F+ +M YD W I G
Sbjct: 144 AFKKENPGYQISFDVAWSP-SCIDKRCYDYTGIADASDFLVVMDYDEQSQIWGKECIAGA 202
Query: 591 VSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVIGVPFYGQSYT 636
SP S+ Y N+ +LG D +K+V+G+P+YG Y
Sbjct: 203 NSP-YSQTLSGYNNFT---------KLGIDPKKLVMGLPWYGYDYP 238
Score = 42.4 bits (100), Expect = 0.002
Identities = 47/161 (29%), Positives = 74/161 (45%), Gaps = 30/161 (18%)
Query: 1435 LVNSATARQRFIEHVVKFLLKYQF-DGLDLDWEYPTCWQVNCDAGPDSDKESFGL--FVR 1491
+ N T R ++I+ V+ L K QF DG+++D E P + + E + L V+
Sbjct: 91 ISNP-TYRTQWIQQKVE-LAKSQFMDGINIDIEQP----ITKGS-----PEYYALTELVK 139
Query: 1492 ELHQAFKPH--GLLLSAAVSPSKQVIN-AAYDVKALSESLDWISVMTYDYHGQ-WDKK-- 1545
E +AFK G +S V+ S I+ YD ++++ D++ VM YD Q W K+
Sbjct: 140 ETTKAFKKENPGYQISFDVAWSPSCIDKRCYDYTGIADASDFLVVMDYDEQSQIWGKECI 199
Query: 1546 TGHVAPLYEHPDDDFFYFNANFTMNYWMKKGAPSRKLVMGV 1586
G +P Y N + K G +KLVMG+
Sbjct: 200 AGANSP----------YSQTLSGYNNFTKLGIDPKKLVMGL 230
Score = 41.6 bits (98), Expect = 0.004
Identities = 51/276 (18%), Positives = 107/276 (38%), Gaps = 40/276 (14%)
Query: 101 SDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFIQELS 160
S+ R +I +E+ DG++++++ P + + ++E +
Sbjct: 92 SNPTYRTQWIQQKVELAKSQFMDGINIDIEQPITKGSPEYYA---------LTELVKETT 142
Query: 161 STLRRNN--YQLTLT---SPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVK 215
++ N YQ++ SP ID+ D + +A D +++ ++ +
Sbjct: 143 KAFKKENPGYQISFDVAWSPSCIDK--RCYDYTGIADASDFLVVMDYD----------EQ 190
Query: 216 PNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGLGAT 275
+ + ++ S N+ K G P+++++G+P++G Y N + + T
Sbjct: 191 SQIWGKECIAGANSPYSQTLSGYNNFTKLGIDPKKLVMGLPWYGYDYPCLNGNLEDVVCT 250
Query: 276 VK------GPGTEGKYTQMPGYLAFFEVCNKFKDKTWRHFTDSNGEPFMVKKD-----EW 324
+ ++ Q+P Y + N R + PF KD
Sbjct: 251 IPKVPFRGANCSDAAGRQIP-YSEIMKQIN--SSIGGRLWDSEQKSPFYNYKDKQGNLHQ 307
Query: 325 ITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDDFRG 360
+ Y+N S+ K+ Y K+ L G +W DL D+ G
Sbjct: 308 VWYDNPQSLSIKVAYAKNLGLKGIGMWNGDLLDYSG 343
Score = 37.0 bits (86), Expect = 0.10
Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 5/71 (7%)
Query: 726 YYEICYRIKTRKWTVQRNTKSLEPFTFQGD-----QWVSYEDPTSLTEKVKFIKSNGYAG 780
Y EI +I + +++ PF D V Y++P SL+ KV + K+ G G
Sbjct: 271 YSEIMKQINSSIGGRLWDSEQKSPFYNYKDKQGNLHQVWYDNPQSLSIKVAYAKNLGLKG 330
Query: 781 IMAWTIDLDDF 791
I W DL D+
Sbjct: 331 IGMWNGDLLDY 341
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is the
most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 52.9 bits (126), Expect = 2e-06
Identities = 28/102 (27%), Positives = 32/102 (31%), Gaps = 12/102 (11%)
Query: 1601 PGGNYESTT-EEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
P G ST E+ PTS + T +TS P T TT TS T P P
Sbjct: 474 PTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATS--PTTQKTSDTPNATSPTPI 531
Query: 1660 TTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
TTT P T P+ T P
Sbjct: 532 VIGVTTTA---------TSPPTGTTSVPNATSPQVTEESPVN 564
Score = 44.4 bits (104), Expect = 9e-04
Identities = 23/96 (23%), Positives = 33/96 (34%), Gaps = 4/96 (4%)
Query: 1608 TTEEYKPTSEESKPTT-VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTT 1666
TT++ T + PT V T T P T TS T + P T T T
Sbjct: 515 TTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSV--PNATSPQVTEESPVNNTNTPVVT 572
Query: 1667 TRPKPIT-TIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
+ P +T + PT+ P++ P
Sbjct: 573 SAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTP 608
Score = 43.2 bits (101), Expect = 0.002
Identities = 22/109 (20%), Positives = 27/109 (24%), Gaps = 8/109 (7%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
T + PTT TS P +TTT TS T T T+ T P T
Sbjct: 507 TTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNT 566
Query: 1674 TIK-----PKPTTVKPKPT---TTVKPKPTTVKPKPVIPPSTKDEFKIV 1714
P T T P + +
Sbjct: 567 NTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTST 615
Score = 41.3 bits (96), Expect = 0.008
Identities = 25/103 (24%), Positives = 37/103 (35%), Gaps = 9/103 (8%)
Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST--- 1664
++ P S + T + TS T E E P +TT +T + P P TT+
Sbjct: 602 SSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGP 661
Query: 1665 --TTTRPKP----ITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
++T P +T P P P + K TV
Sbjct: 662 GNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704
Score = 39.0 bits (90), Expect = 0.039
Identities = 39/196 (19%), Positives = 55/196 (28%), Gaps = 15/196 (7%)
Query: 1530 WISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYWMKKGAPSRKLVMGVHPL 1589
W +V D+ +W T P FF N F + ++
Sbjct: 353 WGNVTQSDFKCKWVLATHGQNPEGCERSLGFFNSNRTFEVTVANPVADAKTLIITRTATN 412
Query: 1590 LSTITE--VLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT 1647
+T T V P E K T V T+ + T T+
Sbjct: 413 ATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSG 472
Query: 1648 STTTTTKRPKP----PTTTSTTTTRPKPI-TTIKPKP-----TTVKPKPTTTVK-PKPTT 1696
+ T TT P PT+ +T+ T T P TT K T P P
Sbjct: 473 TPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIV 532
Query: 1697 VKPKPV--IPPSTKDE 1710
+ PP+
Sbjct: 533 IGVTTTATSPPTGTTS 548
Score = 35.9 bits (82), Expect = 0.29
Identities = 31/132 (23%), Positives = 46/132 (34%), Gaps = 10/132 (7%)
Query: 1588 PLLSTITEVLGHG-------PGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEE 1640
P T ++V G G PG + + + S P+ T+ T +
Sbjct: 650 PGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANS 709
Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPK 1700
T+ ST + P T T P TT P T+ K +P T P T K
Sbjct: 710 TTKETSGSTLMAST--SPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQA 767
Query: 1701 PV-IPPSTKDEF 1711
V +PP+ +
Sbjct: 768 TVPVPPTQHPDH 779
Score = 35.9 bits (82), Expect = 0.29
Identities = 24/88 (27%), Positives = 28/88 (31%), Gaps = 9/88 (10%)
Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAW 1308
T A TT PT + T T T+ T TT S T TT+ A
Sbjct: 495 ATPNATSPTPAVTTPN-ATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNAT 553
Query: 1309 TPNPTEWVWHPPTEPT--TTHISVTEKS 1334
+P TE P T VT
Sbjct: 554 SPQVTE------ESPVNNTNTPVVTSAP 575
Score = 32.8 bits (74), Expect = 2.5
Identities = 25/84 (29%), Positives = 32/84 (38%), Gaps = 10/84 (11%)
Query: 1261 TTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNP-----TEW 1315
+T + PT + T T T+ T T P TT T+ A +P P T
Sbjct: 480 STLPEDTSPTSRTTSATPNATSPTPAVTTPNATSP-TTQKTSDTPNATSPTPIVIGVTTT 538
Query: 1316 VWHPP---TEPTT-THISVTEKSP 1335
PP T T VTE+SP
Sbjct: 539 ATSPPTGTTSVPNATSPQVTEESP 562
Score = 32.8 bits (74), Expect = 2.9
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTT-----TTEYNPPEATTKPSTTTSTTTDSGAWTP 1310
+T+A T+ +P + PT T PT+ TT PE T+ S TTS T ++ + TP
Sbjct: 445 KTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTP 504
Query: 1311 -NPTEWVWHPPTEPTTTHISVTEKSPL 1336
T P T+ T+ + T +P+
Sbjct: 505 AVTTPNATSPTTQKTSDTPNATSPTPI 531
Score = 31.3 bits (70), Expect = 9.3
Identities = 25/126 (19%), Positives = 38/126 (30%), Gaps = 16/126 (12%)
Query: 1588 PLLSTITEVLGHGPGGNYESTTEEY-KPTSEESKPTT---------------VSTSTVVT 1631
P++ +T P G + T E T V+T T
Sbjct: 530 PIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGT 589
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
P + I +++ ST + P TS T + IT P + T +
Sbjct: 590 GSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPG 649
Query: 1692 PKPTTV 1697
P P T
Sbjct: 650 PGPGTT 655
>gnl|CDD|119366 cd06549, GH18_trifunctional, GH18 domain of an uncharacterized family
of bacterial proteins, which share a common three-domain
architecture: an N-terminal glycosyl hydrolase family 18
(GH18) domain, a glycosyl transferase family 2 domain,
and a C-terminal polysaccharide deacetylase domain.
Length = 298
Score = 51.3 bits (123), Expect = 3e-06
Identities = 41/151 (27%), Positives = 67/151 (44%), Gaps = 26/151 (17%)
Query: 1434 RLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVREL 1493
RL+ +AR +FI ++ +L + Q DG+ LD+E P D + F+ EL
Sbjct: 81 RLLADPSARAKFIANIAAYLERNQADGIVLDFE----------ELPADDLPKYVAFLSEL 130
Query: 1494 HQAFKPHGLLLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLY 1553
+ G L+ V P+ A +++KAL+ + D + +M YD H Q G P+
Sbjct: 131 RRRLPAQGKQLTVTV-PAD---EADWNLKALARNADKLILMAYDEHYQG----GAPGPI- 181
Query: 1554 EHPDDDFFYFNANFTMNYWMKKGAPSRKLVM 1584
D+F N + K P KL++
Sbjct: 182 --ASQDWFESNLAQAV-----KKLPPEKLIV 205
Score = 45.1 bits (107), Expect = 3e-04
Identities = 29/108 (26%), Positives = 51/108 (47%), Gaps = 14/108 (12%)
Query: 1801 RLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKEL 1860
RL+ AR+KFIA++ ++ + DG+ LD+E + PA D + + EL
Sbjct: 81 RLLADPSARAKFIANIAAYLERNQADGIVLDFE----------ELPADDLPKYVAFLSEL 130
Query: 1861 RAAFNPHDLLLSAAVSPSKAVIDNAYDIPVMSENLDWISVMTYDYHGQ 1908
R L+ V P+ + +++ ++ N D + +M YD H Q
Sbjct: 131 RRRLPAQGKQLTVTV-PAD---EADWNLKALARNADKLILMAYDEHYQ 174
Score = 35.8 bits (83), Expect = 0.26
Identities = 22/102 (21%), Positives = 39/102 (38%), Gaps = 14/102 (13%)
Query: 482 SNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDK 541
S R KFI ++ +L R G+ D+ + D +V + EL+
Sbjct: 87 SARAKFIANIAAYLERNQADGIVLDFE----------ELPADDLPKYVAFLSELRRRLPA 136
Query: 542 HDYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHGA 583
+ V + E ++ AL + + + LM+YD H
Sbjct: 137 QGKQLTVTVP----ADEADWNLKALARNADKLILMAYDEHYQ 174
Score = 33.5 bits (77), Expect = 1.1
Identities = 31/159 (19%), Positives = 62/159 (38%), Gaps = 33/159 (20%)
Query: 102 DRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESI-ANERSDFSTFIQELS 160
D +R FI+++ + + DG+ L D E + A++ + F+ EL
Sbjct: 85 DPSARAKFIANIAAYLERNQADGIVL-------------DFEELPADDLPKYVAFLSELR 131
Query: 161 STLRRNNYQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKV 220
L QLT+T P + ++ +A D ++L +++ + P+
Sbjct: 132 RRLPAQGKQLTVTVP----ADEADWNLKALARNADKLILMAYDEHYQGGAPGPIASQDWF 187
Query: 221 NIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFG 259
+N+ AV K PE++I+ + +G
Sbjct: 188 ----------ESNLAQAV-----KKLPPEKLIVALGSYG 211
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in the
binding of extracellular matrix components and growth
factors.
Length = 207
Score = 47.0 bits (112), Expect = 3e-05
Identities = 23/88 (26%), Positives = 35/88 (39%), Gaps = 5/88 (5%)
Query: 1612 YKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
P+ +E +++T + T + TT+ +T+TK +TT TTTT P
Sbjct: 60 ATPSDDEDSEPVTTSATP---PKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSE 116
Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
T + TTV T T P
Sbjct: 117 TDT-EEATTTV-STETPTEGGSSAATDP 142
Score = 35.1 bits (81), Expect = 0.30
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 6/82 (7%)
Query: 1260 TTTSEQVPKPTKKPTKPTKKPTT---TTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
TTS PK T + P+ TT +T+ +P +TT +TT+ + TD+ T
Sbjct: 71 VTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDT---EEATTTVS 127
Query: 1317 WHPPTEPTTTHISVTEKSPLDQ 1338
PTE ++ + K+ L++
Sbjct: 128 TETPTEGGSSAATDPSKNLLER 149
Score = 34.3 bits (79), Expect = 0.44
Identities = 25/94 (26%), Positives = 37/94 (39%), Gaps = 15/94 (15%)
Query: 1619 SKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPK 1678
+ P+ S VTT T +TTT++S + T TT ST+T ++T
Sbjct: 60 ATPSDDEDSEPVTTS---ATPPKLTTTSSSPSNDT------TTASTSTKTSPTVSTTVTT 110
Query: 1679 PTTVKPKPTTTVKPKPTTV---KPKPVIPPSTKD 1709
T+ P T + TTV P + D
Sbjct: 111 TTS--PS-ETDTEEATTTVSTETPTEGGSSAATD 141
Score = 34.3 bits (79), Expect = 0.53
Identities = 18/66 (27%), Positives = 27/66 (40%)
Query: 1239 WNDKMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTT 1298
D K TS++ +++ T T PT T TT +P E T+ +TT
Sbjct: 65 DEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATT 124
Query: 1299 TSTTTD 1304
T +T
Sbjct: 125 TVSTET 130
Score = 33.2 bits (76), Expect = 1.3
Identities = 16/78 (20%), Positives = 23/78 (29%), Gaps = 5/78 (6%)
Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP 1694
P E+ TT++T + TTT T+ PT TTT P
Sbjct: 62 PSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTS----PTVSTTVTTTTS-PSE 116
Query: 1695 TTVKPKPVIPPSTKDEFK 1712
T + +
Sbjct: 117 TDTEEATTTVSTETPTEG 134
Score = 31.2 bits (71), Expect = 4.5
Identities = 16/59 (27%), Positives = 21/59 (35%), Gaps = 7/59 (11%)
Query: 1279 KPTTTTEYNPPEATTKPS------TTTSTTTDSGAWTPNPTEWVWHPPTEPTTTHISVT 1331
TT PP+ TT S TT ST+T + P+E T + T
Sbjct: 68 SEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTT-TSPSETDTEEATTT 125
Score = 30.5 bits (69), Expect = 8.2
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 1/51 (1%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTE-EPITTTTTSTTTTTKRPKPPTTTS 1663
S +K + ++TV TT P +T+ E TTT ++ T T T S
Sbjct: 93 TASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTETPTEGGSSAATDPS 143
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 47.1 bits (112), Expect = 4e-05
Identities = 21/84 (25%), Positives = 27/84 (32%), Gaps = 3/84 (3%)
Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
+V + + T + P+ T P T KPK K
Sbjct: 36 LVPLAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPK 95
Query: 1689 TVKPKPTTVKPKPVIPPSTKDEFK 1712
KPKP KPKP P K + K
Sbjct: 96 KPKPKP---KPKPKPKPKVKPQPK 116
Score = 45.5 bits (108), Expect = 1e-04
Identities = 30/99 (30%), Positives = 36/99 (36%), Gaps = 12/99 (12%)
Query: 1613 KPTSEESKPTTVSTS-TVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
P + V + T EPE EE T + P PT +PK
Sbjct: 37 VPLAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTE-----PETPPEPTPP-----KPKE 86
Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPTTV-KPKPVIPPSTKD 1709
+ KP KPKP KPKP +PKP PPS
Sbjct: 87 KPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTA 125
Score = 33.6 bits (77), Expect = 0.90
Identities = 23/102 (22%), Positives = 30/102 (29%), Gaps = 6/102 (5%)
Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTT 1667
PT EE +P +P E T +P+
Sbjct: 46 AKVLEAPT-EEPQPEP---EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKP 101
Query: 1668 RPKPITTIKPKPT-TVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
+PKP K KP K P+ T P +P PPS
Sbjct: 102 KPKPKPKPKVKPQPKPKKPPSKTAAKAPAA-PNQPARPPSAA 142
>gnl|CDD|119363 cd06546, GH18_CTS3_chitinase, GH18 domain of CTS3 (chitinase 3), an
uncharacterized protein from the human fungal pathogen
Coccidioides posadasii. CTS3 has a chitinase-like
glycosyl hydrolase family 18 (GH18) domain; and has
homologs in bacteria as well as fungi.
Length = 256
Score = 46.6 bits (111), Expect = 6e-05
Identities = 39/168 (23%), Positives = 59/168 (35%), Gaps = 39/168 (23%)
Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDL----CTHVIYGFAVLDTDQLVI----KPHDT- 1762
++V Y+ S G + S + TH+I ++ D + P
Sbjct: 1 RLVIYYQT---THPSNGDPISSLLLVTEKGIALTHLIVAALHINDDGNIHLNDHPPDHPR 57
Query: 1763 ----WADLDNKFYEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVN 1818
W +L L+ GVKV +GG A +SRL + + ++ + +
Sbjct: 58 FTTLWTELA--------ILQSSGVKVMGMLGGA---APGSFSRLDDDDEDFERYYGQLRD 106
Query: 1819 FILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNP 1866
I DGLDLD E P G LI LR+ F P
Sbjct: 107 MIRRRGLDGLDLDVEEPM------SLD------GIIRLIDRLRSDFGP 142
Score = 41.9 bits (99), Expect = 0.002
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)
Query: 1409 TLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYP 1468
L+ GVKV +GG G +SRL + +R+ + + + DGLDLD E P
Sbjct: 67 ILQSSGVKVMGMLGGAAP---GSFSRLDDDDEDFERYYGQLRDMIRRRGLDGLDLDVEEP 123
Score = 35.0 bits (81), Expect = 0.36
Identities = 25/130 (19%), Positives = 48/130 (36%), Gaps = 19/130 (14%)
Query: 413 TH-IVYAFASLDPNTLSIQAGNPEADIDDNFYQRISSSPLVTHGKVKILIAIGGWTDSSG 471
TH IV A D + + P+ F + ++ VK++ +GG S
Sbjct: 31 THLIVAALHINDDGNIHLNDHPPD---HPRFTTLWTELAILQSSGVKVMGMLGGAAPGS- 86
Query: 472 EKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKL 531
+S+L + +++ + +RR GL D P+ ++L
Sbjct: 87 --FSRLDDDDEDFERYYGQLRDMIRRRGLDGLDLDVEEPM------------SLDGIIRL 132
Query: 532 IQELKAEFDK 541
I L+++F
Sbjct: 133 IDRLRSDFGP 142
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 47.4 bits (112), Expect = 7e-05
Identities = 28/135 (20%), Positives = 48/135 (35%), Gaps = 20/135 (14%)
Query: 1613 KPTSEESKPTTVSTSTVVTTEE--PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
K TS ++ + + + VT + Q + T ++T T +TT++ P+
Sbjct: 176 KTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSST----EPVGTQGTTTSSNPE 231
Query: 1671 PITTIKPK-----------PTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIVCYFTN 1719
P T P P+T +TT + T + PP+T + T
Sbjct: 232 PQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHT--QRRKTPPATSNRRSPHSTATP 289
Query: 1720 WAW-YRQSGGKYLPS 1733
RQ G+ P
Sbjct: 290 PPTTKRQETGRPTPR 304
Score = 47.0 bits (111), Expect = 9e-05
Identities = 30/108 (27%), Positives = 41/108 (37%), Gaps = 6/108 (5%)
Query: 1600 GPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
P G +TT E P+ S + + P T +TT T+R K P
Sbjct: 217 EPVGTQGTTTSSNPEPQTEPPPSQRGPSG--SPQHPPSTTSQDQSTTGDGQEHTQRRKTP 274
Query: 1660 TTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
TS + P +T P PTT K + T P+PT PP +
Sbjct: 275 PATSNRRS---PHSTATPPPTT-KRQETGRPTPRPTATTQSGSSPPHS 318
Score = 38.5 bits (89), Expect = 0.038
Identities = 20/92 (21%), Positives = 29/92 (31%), Gaps = 6/92 (6%)
Query: 1238 YWNDKMKTCDWPYRTK----CKQT-SATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEA- 1291
Y C R K CK+ +A TTS+ PT+ T +
Sbjct: 145 YNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGH 204
Query: 1292 TTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEP 1323
T + ++T+ T P TEP
Sbjct: 205 QTATANQRLSSTEPVGTQGTTTSSNPEPQTEP 236
Score = 31.2 bits (70), Expect = 7.3
Identities = 19/77 (24%), Positives = 28/77 (36%), Gaps = 11/77 (14%)
Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
T TTTS P+P +P + P+ + + + S STT D T
Sbjct: 221 TQGTTTS-SNPEPQTEPPPSQRGPSGSPQ----HPPSTTSQDQSTTGDGQEHTQRRKT-- 273
Query: 1317 WHPPT--EPTTTHISVT 1331
PP + H + T
Sbjct: 274 --PPATSNRRSPHSTAT 288
Score = 30.8 bits (69), Expect = 8.9
Identities = 12/62 (19%), Positives = 22/62 (35%), Gaps = 3/62 (4%)
Query: 1242 KMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTST 1301
+ KT + +T T ++ +PT +PT TT+ + P +
Sbjct: 270 RRKTPPATSNRR--SPHSTATPPPTT-KRQETGRPTPRPTATTQSGSSPPHSSPPGVQAN 326
Query: 1302 TT 1303
T
Sbjct: 327 PT 328
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General function
prediction only].
Length = 872
Score = 47.2 bits (112), Expect = 1e-04
Identities = 33/112 (29%), Positives = 42/112 (37%), Gaps = 6/112 (5%)
Query: 1572 WMKKGAPSRKLVMGVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVT 1631
W R V + P T T + G T E + +++ T ++ T
Sbjct: 742 WGFIPTTKRVKVRIMDPASGTGTSITTSGT------FTAEVPQSPTKTETTLSYSAYSNT 795
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK 1683
+ E T IT T T T TTT P P TTS T T TT P TT
Sbjct: 796 SILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTG 847
Score = 42.9 bits (101), Expect = 0.002
Identities = 28/91 (30%), Positives = 32/91 (35%), Gaps = 17/91 (18%)
Query: 1623 TVSTSTVVTTEEPEQTEEPITTTTTSTT---------------TTTKRPKPPTTTSTTTT 1667
T T +TT E P + T T TT T+ K T T TTT+
Sbjct: 759 ASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTS 818
Query: 1668 RPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
P P T PT TTT P TT
Sbjct: 819 SPSPTQT--TSPTQTSTSTTTTTSPSQTTTG 847
Score = 42.2 bits (99), Expect = 0.004
Identities = 27/113 (23%), Positives = 35/113 (30%), Gaps = 16/113 (14%)
Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITT----------------TTTSTTTTTK 1654
+ T + + TV S V P T TTS T T +
Sbjct: 716 KNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAE 775
Query: 1655 RPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
P+ PT T TT + T TT T + + TT P P S
Sbjct: 776 VPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSP 828
Score = 35.6 bits (82), Expect = 0.37
Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 3/55 (5%)
Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT---STTTTT 1653
TT + TT S+ + T P QT TTTT+ +TT
Sbjct: 795 TSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849
Score = 34.8 bits (80), Expect = 0.76
Identities = 12/59 (20%), Positives = 23/59 (38%)
Query: 1248 WPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSG 1306
Y + T+ V T T+ T + T+ P T+ +TTT++ + +
Sbjct: 787 LSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTT 845
Score = 34.1 bits (78), Expect = 1.3
Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 3/56 (5%)
Query: 1258 SATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTS---TTTDSGAWTP 1310
TTS + K + T P+ T +P + +T +TTTS TTT G P
Sbjct: 798 LIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGGICGP 853
Score = 33.3 bits (76), Expect = 2.0
Identities = 18/81 (22%), Positives = 26/81 (32%), Gaps = 5/81 (6%)
Query: 1260 TTTSEQVPKPTKK-----PTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTE 1314
T Q P T+ T TT + T+ TTTS+ + + +P T
Sbjct: 773 TAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTS 832
Query: 1315 WVWHPPTEPTTTHISVTEKSP 1335
T P+ T P
Sbjct: 833 TSTTTTTSPSQTTTGGGICGP 853
Score = 33.3 bits (76), Expect = 2.4
Identities = 15/52 (28%), Positives = 19/52 (36%), Gaps = 1/52 (1%)
Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGA 1307
T T Q T P+ PT+ + T TT PS TT+ G
Sbjct: 803 SVVITKTVTQTQTTTSSPS-PTQTTSPTQTSTSTTTTTSPSQTTTGGGICGP 853
>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
partitioning].
Length = 264
Score = 46.0 bits (109), Expect = 1e-04
Identities = 20/79 (25%), Positives = 25/79 (31%), Gaps = 2/79 (2%)
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
T+ EQ E+ T + TT T +PKP KP+P P P K
Sbjct: 112 TQLGEQPEQARIEEQPRTQSQKA-QSQATTVQTQPVKPKPRPE-KPQPVAPAPAPEPVEK 169
Query: 1692 PKPTTVKPKPVIPPSTKDE 1710
P P E
Sbjct: 170 APKAEAAPPPKPKAEDAAE 188
Score = 36.7 bits (85), Expect = 0.097
Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 12/104 (11%)
Query: 1619 SKPTTVSTST---------VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
++P V + + E ++ + ++P+ + + +
Sbjct: 81 TEPAAVKDAERLTPEQRQLLEQMEVDQKAQPTQLGEQPEQARIEEQPRTQSQKAQSQATT 140
Query: 1670 KPITTIKPKPTTVKPK---PTTTVKPKPTTVKPKPVIPPSTKDE 1710
+KPKP KP+ P +P K + PP K E
Sbjct: 141 VQTQPVKPKPRPEKPQPVAPAPAPEPVEKAPKAEAAPPPKPKAE 184
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 46.7 bits (111), Expect = 1e-04
Identities = 14/75 (18%), Positives = 21/75 (28%), Gaps = 2/75 (2%)
Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKP-KPTTVKP 1699
P T+ + RP P +T I +P T P +P P
Sbjct: 366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRET-ATPPPVPPRPVAPPVPHT 424
Query: 1700 KPVIPPSTKDEFKIV 1714
P T+ +
Sbjct: 425 PESAPKLTRAAIPVD 439
Score = 42.9 bits (101), Expect = 0.002
Identities = 24/99 (24%), Positives = 33/99 (33%), Gaps = 7/99 (7%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
P + +KPT + S V T P + + P P+P+
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVP-----PRPVA 418
Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
P P T + P T P KPK P K+E K
Sbjct: 419 P--PVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEK 455
Score = 35.9 bits (83), Expect = 0.31
Identities = 17/82 (20%), Positives = 25/82 (30%), Gaps = 2/82 (2%)
Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTT--STTTDSG 1306
+ + T A +T + P K + T T PP P T S +
Sbjct: 374 AAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTR 433
Query: 1307 AWTPNPTEWVWHPPTEPTTTHI 1328
A P + + PP P
Sbjct: 434 AAIPVDEKPKYTPPAPPKEEEK 455
Score = 35.6 bits (82), Expect = 0.40
Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 9/59 (15%)
Query: 1656 PKPPTTT------STTTTRPKPITTIKPKPTTVKPKPTTTVKPK---PTTVKPKPVIPP 1705
P P + + RP P + +PK P + P V P+PV PP
Sbjct: 362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPP 420
Score = 32.5 bits (74), Expect = 3.4
Identities = 14/77 (18%), Positives = 22/77 (28%), Gaps = 7/77 (9%)
Query: 1257 TSATTTSEQVPKPT-KKPTK-PTKKPTTTTEYNPPEATTKPSTTT-STTTDSGAWTPNPT 1313
+ P+ +PT P+ +P N P T T P P
Sbjct: 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPH 423
Query: 1314 EWVWHPPTEPTTTHISV 1330
P + P T ++
Sbjct: 424 T----PESAPKLTRAAI 436
Score = 32.1 bits (73), Expect = 4.0
Identities = 15/87 (17%), Positives = 23/87 (26%), Gaps = 4/87 (4%)
Query: 1601 PGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPT 1660
P + +PT S + + + +EP E P P
Sbjct: 368 PAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEP-VRETATPPPVPPRPVAPPVPHTPE 426
Query: 1661 TTSTTTTRPKPITTIKPKPTTVKPKPT 1687
+ T P+ KP P P
Sbjct: 427 SAPKLTRAAIPVDE---KPKYTPPAPP 450
>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp).
Length = 305
Score = 45.5 bits (108), Expect = 2e-04
Identities = 26/105 (24%), Positives = 33/105 (31%), Gaps = 8/105 (7%)
Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT--S 1663
T Y + P ++ V T T+ P TT ++TT T T
Sbjct: 4 TELTFSYNLSDTTLFPNA-TSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLHD 62
Query: 1664 TTTTRPKPITTIKPKPTTV---KPKPTT--TVKPKPTTVKPKPVI 1703
T T T P PTT T P PT V P +
Sbjct: 63 VTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTPVPSSPAV 107
Score = 40.5 bits (95), Expect = 0.009
Identities = 24/103 (23%), Positives = 31/103 (30%), Gaps = 18/103 (17%)
Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT-----STTTTRPKPI 1672
T S + TT P T + + T T+ST T K PT T S+TT +
Sbjct: 2 SVTELTFSYNLSDTTLFPNATSKGVKTVTSSTDT-----KAPTNTTYRCVSSTTVPMTNV 56
Query: 1673 TTIK--------PKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
T T T P+ P T
Sbjct: 57 TVTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPT 99
>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24). This
family consists of several MGC-24 (or Cd164 antigen)
proteins from eukaryotic organisms. MGC-24/CD164 is a
sialomucin expressed in many normal and cancerous
tissues. In humans, soluble and transmembrane forms of
MGC-24 are produced by alternative splicing.
Length = 187
Score = 43.5 bits (102), Expect = 4e-04
Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 8/87 (9%)
Query: 1608 TTEEYKPTSEESKPTTVSTSTVV--TTEEPEQTEEPITTTTT-----STTTTTKRPKPPT 1660
+T ++ E S + VV + T+ TTT ST T P P T
Sbjct: 73 STCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTT 132
Query: 1661 TTSTTTTRPKPITTIKPKPTTVKPKPT 1687
+ T+ PK TT+ P + K T
Sbjct: 133 SHHHVTSEPKTNTTVTPTSQPDR-KST 158
Score = 33.4 bits (76), Expect = 0.78
Identities = 26/108 (24%), Positives = 33/108 (30%), Gaps = 23/108 (21%)
Query: 1217 CNKYQYCIWGSYQVASCSPGLYWNDKMKTCDWPYRTKCK----------QTSATTTSEQV 1266
N C+ ++ ++C W + C C Q TT S V
Sbjct: 59 LNSCVSCVNRTHNNSTC----VW----QQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSV 110
Query: 1267 PKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTE 1314
T PT T KPT T P TT TS + TP
Sbjct: 111 ATTTPVPTNSTAKPTIT-----PSPTTSHHHVTSEPKTNTTVTPTSQP 153
Score = 33.4 bits (76), Expect = 0.91
Identities = 28/101 (27%), Positives = 37/101 (36%), Gaps = 22/101 (21%)
Query: 1623 TVSTSTVV----TTEEPEQ--TEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK 1676
T + ST V EEP ++ + + TT TTT T +T K
Sbjct: 69 THNNSTCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTN-----STAK 123
Query: 1677 PKPTTVKPKPTTTVKPKPTTVKPK------PVIPPSTKDEF 1711
P T+ P PTT+ T +PK P P K F
Sbjct: 124 P---TITPSPTTS--HHHVTSEPKTNTTVTPTSQPDRKSTF 159
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 44.1 bits (103), Expect = 4e-04
Identities = 27/94 (28%), Positives = 38/94 (40%), Gaps = 3/94 (3%)
Query: 1603 GNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT 1662
GN TT P+ S P+T ++T+ TT P T ++T TT+ T+T P TT
Sbjct: 35 GNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTT 94
Query: 1663 STTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
S +T T K + T T
Sbjct: 95 SNAST---INVTTKVTAQNITATEAGTGTSTGVT 125
Score = 41.4 bits (96), Expect = 0.003
Identities = 29/110 (26%), Positives = 47/110 (42%), Gaps = 2/110 (1%)
Query: 1590 LSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTV-VTTEEPEQTEEPITT-TTT 1647
L+T + + + +TT T+ PTT + ST+ VTT+ Q T T
Sbjct: 61 LTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGT 120
Query: 1648 STTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTV 1697
ST T+ ++T++ TTR TT+ P ++ T + TV
Sbjct: 121 STGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTV 170
Score = 41.4 bits (96), Expect = 0.003
Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 7/82 (8%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
S + V+ +T VTT P + +TT TTT+ P TTT+ +T TT
Sbjct: 28 GSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSA----PITTTAILSTN---TTT 80
Query: 1675 IKPKPTTVKPKPTTTVKPKPTT 1696
+ TTV P PTT+
Sbjct: 81 VTSTGTTVTPVPTTSNASTINV 102
Score = 34.9 bits (79), Expect = 0.34
Identities = 27/78 (34%), Positives = 35/78 (44%), Gaps = 7/78 (8%)
Query: 1626 TSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT--TSTTTTRPKPITTIKPKPTTVK 1683
TS + T+ T TT+ TT + P+T ++T TT PITT T +
Sbjct: 20 TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITT-----TAIL 74
Query: 1684 PKPTTTVKPKPTTVKPKP 1701
TTTV TTV P P
Sbjct: 75 STNTTTVTSTGTTVTPVP 92
Score = 34.9 bits (79), Expect = 0.36
Identities = 25/115 (21%), Positives = 39/115 (33%), Gaps = 4/115 (3%)
Query: 1591 STITEVLGHGPGGNYESTTEEYKPTSEESKPTTV----STSTVVTTEEPEQTEEPITTTT 1646
+ T V P + ST + T+ + TT + +T VT+ T P T+
Sbjct: 38 TGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNA 97
Query: 1647 TSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
++ TTK T + T T + TT TT+ P
Sbjct: 98 STINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTL 152
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 43.5 bits (103), Expect = 5e-04
Identities = 19/76 (25%), Positives = 26/76 (34%), Gaps = 4/76 (5%)
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
E + + +T P P +PKP+ KPKP + K
Sbjct: 72 AEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKP-KPQQKVEAPPA 130
Query: 1692 PKPTTVKPKPVIPPST 1707
PKP PKPV+
Sbjct: 131 PKPE---PKPVVEEKA 143
Score = 42.3 bits (100), Expect = 0.001
Identities = 21/78 (26%), Positives = 28/78 (35%), Gaps = 5/78 (6%)
Query: 1635 PE-QTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPK 1693
PE E + + PP T P P+ KPKP KPKP + K
Sbjct: 68 PEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPE--PAPVEPPKPKPVE-KPKPKPKPQQK 124
Query: 1694 PT-TVKPKPVIPPSTKDE 1710
PKP P +++
Sbjct: 125 VEAPPAPKPEPKPVVEEK 142
Score = 30.4 bits (69), Expect = 9.5
Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 5/40 (12%)
Query: 1673 TTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
T+ P T V+P+P PKP KPV P K + +
Sbjct: 88 ATVAPPNTPVEPEPAPVEPPKP-----KPVEKPKPKPKPQ 122
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 43.6 bits (102), Expect = 9e-04
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 9/108 (8%)
Query: 1607 STTEEYKPTSEESKPTT---VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
+T +E P E++ T S S + +P+Q E + S T K+ + TTT
Sbjct: 67 NTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEE--QKSLNISATPAPKQEQSQTTTE 124
Query: 1664 TTTTRPKPITTIKPKPTTVKP-KPTTTVKPK-PTTVKPKPVIPPSTKD 1709
+TT PK T P T +P + T + P+ PT + + + P +D
Sbjct: 125 STT--PKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMTPKYED 170
Score = 36.6 bits (84), Expect = 0.17
Identities = 18/97 (18%), Positives = 28/97 (28%), Gaps = 19/97 (19%)
Query: 1624 VSTSTVVTTEEPEQTEEPIT----------TTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
++ T + T + T TS + K +P +
Sbjct: 54 ITAGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISAT-- 111
Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
P + + TT + TT K K PPST
Sbjct: 112 -----PAPKQEQSQTT--TESTTPKTKVTTPPSTNTP 141
Score = 35.5 bits (81), Expect = 0.33
Identities = 30/136 (22%), Positives = 55/136 (40%), Gaps = 11/136 (8%)
Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPST-TTSTTTDSGAWTPNPTEW 1315
S Q + +K + P E + TT+ +T T TT TP P +
Sbjct: 89 ASKIEKISQPKQEEQKSLNISATPAPKQEQ--SQTTTESTTPKTKVTTPPSTNTPQPMQS 146
Query: 1316 VWHP-PTEPTTTHISVTEKSPLDQYFKIVCYFTNWAWYRPGKGKYVLKK---VPWINILK 1371
P PT + T+ +P +Y + Y+T ++ + ++LK V ++N++
Sbjct: 147 TKSDTPQSPTIKQ-AQTDMTP--KYEDLRAYYTKPSFEFEKQFGFLLKPWTTVRFMNVIP 203
Query: 1372 LYVTLQTGL-GTDQEK 1386
+ L G D++K
Sbjct: 204 NRFIYKIALVGKDEKK 219
Score = 33.6 bits (76), Expect = 1.5
Identities = 27/97 (27%), Positives = 38/97 (39%), Gaps = 7/97 (7%)
Query: 1594 TEVLGHGPGGNYEST-----TEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTS 1648
T L P N E T + +P EE K +S + E+ + T E T T
Sbjct: 73 TPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKV 132
Query: 1649 TTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPK 1685
TT + P ++ + T P TIK T + PK
Sbjct: 133 TTPPSTNTPQPMQSTKSDTPQSP--TIKQAQTDMTPK 167
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium (Slime
molds). REP protein is likely involved in transcription
regulation and control of DNA replication, specifically
amplification of plasmid at low copy numbers. The
formation of homomultimers may be required for their
regulatory activity.
Length = 910
Score = 44.0 bits (104), Expect = 0.001
Identities = 26/120 (21%), Positives = 40/120 (33%), Gaps = 13/120 (10%)
Query: 1589 LLSTITEVLGHGPGGNYESTTEEYK--PTSE-ESKPTTVSTSTVVTT----EEPEQTEEP 1641
+LS + + E +E Y P E + + ++ + + ++ P
Sbjct: 193 MLSLFHPIKVDSQNRDEEILSEIYSLYPFVNLEVNESDIEQISINSENIQRINSQPSKRP 252
Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
TTTT+TTTTT TT T + K I K K KP
Sbjct: 253 NNTTTTTTTTTT------TTFQPRTRKRKSIDDHKLSLNQAPEKFKNNTKPDDDPQSDFS 306
Score = 36.3 bits (84), Expect = 0.22
Identities = 28/116 (24%), Positives = 38/116 (32%), Gaps = 10/116 (8%)
Query: 1630 VTTEEPEQTEEPITTTTTSTTTTTKRP-KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
V E E E I+ + + +P K P T+TTTT TT +P T K K
Sbjct: 222 VNLEVNESDIEQISINSENIQRINSQPSKRPNNTTTTTTT---TTTTTFQPRTRKRKSID 278
Query: 1689 TVKPKPTTVK-PKPVIPPSTKDEFKIVCYFTNWAWYRQSGG-KYLPSDIDSDLCTH 1742
K D + R+SG K + D S +H
Sbjct: 279 DHKLSLNQAPEKFKNNTKPDDDPQSDFSDKGS----RKSGSLKDVRIDNISCSVSH 330
Score = 32.1 bits (73), Expect = 5.4
Identities = 22/93 (23%), Positives = 33/93 (35%), Gaps = 14/93 (15%)
Query: 1255 KQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTT------STTTDSGAW 1308
T+ TTT+ P+ K+ + K + + TKP + SG+
Sbjct: 257 TTTTTTTTTTFQPRTRKRKSIDDHKLSLNQAPEKFKNNTKPDDDPQSDFSDKGSRKSGSL 316
Query: 1309 TP----NPTEWVWHPPT----EPTTTHISVTEK 1333
N + V H PT T ISV+ K
Sbjct: 317 KDVRIDNISCSVSHNGVNISISPTETIISVSNK 349
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 42.6 bits (100), Expect = 0.002
Identities = 23/98 (23%), Positives = 35/98 (35%), Gaps = 5/98 (5%)
Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT 1661
GN T + + + V+ ++P+ T + T T KR +P
Sbjct: 135 NGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQATAK---TEPKPVAQTPKRTEPAAP 191
Query: 1662 TSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
++T P +T PK T P T P TT P
Sbjct: 192 VASTKA-PAATSTPAPKETA-TTAPVQTASPAQTTATP 227
Score = 37.6 bits (87), Expect = 0.062
Identities = 19/68 (27%), Positives = 26/68 (38%), Gaps = 12/68 (17%)
Query: 1256 QTSATTTSEQVPKPTKK-----PTKPTKKPTTTTEYNP-------PEATTKPSTTTSTTT 1303
Q +A T + V + K+ P TK P T+ P P T P+ TT+T
Sbjct: 169 QATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPA 228
Query: 1304 DSGAWTPN 1311
G N
Sbjct: 229 AGGKTAGN 236
Score = 36.5 bits (84), Expect = 0.14
Identities = 22/76 (28%), Positives = 29/76 (38%), Gaps = 6/76 (7%)
Query: 1637 QTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK-----PKPTTTVK 1691
QT E TT + PK P T+ T +P T + +P P T+T
Sbjct: 146 QTAERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPA 205
Query: 1692 PKPT-TVKPKPVIPPS 1706
PK T T P P+
Sbjct: 206 PKETATTAPVQTASPA 221
Score = 34.1 bits (78), Expect = 0.88
Identities = 28/97 (28%), Positives = 39/97 (40%), Gaps = 12/97 (12%)
Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP-KP 1671
+P +++ ST+ T EP T P+ S T K T TTRP +
Sbjct: 107 QPQNQQQLNNVAVNSTLPT--EP-ATVAPVRNGNASRQT----AKTQTAERPATTRPARK 159
Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
I+PK KP+ T +PKP PK P +
Sbjct: 160 QAVIEPK----KPQATAKTEPKPVAQTPKRTEPAAPV 192
Score = 33.4 bits (76), Expect = 1.6
Identities = 20/95 (21%), Positives = 28/95 (29%), Gaps = 28/95 (29%)
Query: 1255 KQTSATTTSEQVP--------------KPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTS 1300
+QT+ T T+E+ KP K T + P A + +
Sbjct: 140 RQTAKTQTAERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPA 199
Query: 1301 TTTDSGAWTPNPTEWVWHPPTEPTTTHISVTEKSP 1335
T+ TP P T T V SP
Sbjct: 200 ATS-----TPAPK---------ETATTAPVQTASP 220
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is a
family of proteins conserved in yeasts. The function is
not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 43.0 bits (101), Expect = 0.002
Identities = 21/97 (21%), Positives = 32/97 (32%), Gaps = 12/97 (12%)
Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPI 1672
+ E+ S + T E E ++ + RPK T+ R P
Sbjct: 482 RDVWED-----APESQELVTTEETPEEVKSSSPGVTKPAIPSRPKKGK--PTSEKRKPPP 534
Query: 1673 TTIKPKPTT----VKPKPTTTVK-PKPTTVKPKPVIP 1704
KPKP K + + + KPKP +P
Sbjct: 535 VPKKPKPQIPARPAKLQKQQAGEEANSSAFKPKPRVP 571
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 41.6 bits (98), Expect = 0.003
Identities = 18/76 (23%), Positives = 26/76 (34%), Gaps = 3/76 (3%)
Query: 1636 EQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP- 1694
EQT + +T ++ +PP TT+ P T P P P T P
Sbjct: 109 EQTPQVPRSTVQIQQQA-QQQQPPATTAQPQPVTPPRQTTAPVQPQ-TPAPVRTQPAAPV 166
Query: 1695 TTVKPKPVIPPSTKDE 1710
T P + + E
Sbjct: 167 TQAVEAPKVEAEKEKE 182
Score = 35.8 bits (83), Expect = 0.23
Identities = 16/72 (22%), Positives = 22/72 (30%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
T + + T + P T +P T TT +P+ P T P
Sbjct: 111 TPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAV 170
Query: 1675 IKPKPTTVKPKP 1686
PK K K
Sbjct: 171 EAPKVEAEKEKE 182
Score = 33.9 bits (78), Expect = 0.82
Identities = 16/72 (22%), Positives = 19/72 (26%), Gaps = 1/72 (1%)
Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKP-KPTTVKPKPTTT 1689
T + P T + T P T TT P T P + P
Sbjct: 111 TPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAV 170
Query: 1690 VKPKPTTVKPKP 1701
PK K K
Sbjct: 171 EAPKVEAEKEKE 182
Score = 31.9 bits (73), Expect = 3.0
Identities = 20/79 (25%), Positives = 29/79 (36%), Gaps = 6/79 (7%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTT----TTKRPKPPTTTSTTTTRP 1669
P +E++ V STV ++ +Q + P TT T TT +P T T
Sbjct: 106 PYNEQTPQ--VPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPA 163
Query: 1670 KPITTIKPKPTTVKPKPTT 1688
P+T P K
Sbjct: 164 APVTQAVEAPKVEAEKEKE 182
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 41.9 bits (98), Expect = 0.003
Identities = 21/93 (22%), Positives = 28/93 (30%), Gaps = 4/93 (4%)
Query: 1620 KPTTVSTSTVVTTEE--PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKP 1677
T T+ TTEE P + + T T T K K ++K
Sbjct: 1 MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTS-TKAAKKAAKVKKT-KSVKT 58
Query: 1678 KPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
V K T K +V K V + E
Sbjct: 59 TTKKVTVKFEKTESVKKESVAKKTVKKEAVSAE 91
Score = 34.2 bits (78), Expect = 0.89
Identities = 15/56 (26%), Positives = 21/56 (37%), Gaps = 8/56 (14%)
Query: 1255 KQTSATTTSEQVPKPTKKPTK-----PTKKPTTTTE---YNPPEATTKPSTTTSTT 1302
K+T+ T+E+ KK T TKK TT K + +TT
Sbjct: 5 KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 40.6 bits (95), Expect = 0.005
Identities = 19/103 (18%), Positives = 27/103 (26%), Gaps = 4/103 (3%)
Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
++T+ T + T + E P P + P
Sbjct: 98 KATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPR 157
Query: 1666 TTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
P+P PKPT K K T PKP +
Sbjct: 158 AKSPRPAAKAAPKPTETITAK----KAKKTAAAPKPTADKTAT 196
Score = 35.2 bits (81), Expect = 0.27
Identities = 29/108 (26%), Positives = 31/108 (28%), Gaps = 20/108 (18%)
Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTT---------TTTKRPKPPTTTS 1663
K TS T + V T EP E PK P
Sbjct: 98 KATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPR 157
Query: 1664 TTTTRP------KP---ITTIKPKPTTVKPKPTTTVKPKPT--TVKPK 1700
+ RP KP IT K K T PKPT P T K K
Sbjct: 158 AKSPRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKK 205
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein. Members
of this family are the S-layer-forming halobacterial
major cell surface glycoprotein. The highest scores below
model cutoffs are fragmentary paralogs to actual members
of the family. Modifications include at N-linked and
O-linked glycosylation, a C-terminal diphytanylglyceryl
modification, and probable cleavage of the PGF-CTERM
tail.
Length = 782
Score = 41.4 bits (97), Expect = 0.007
Identities = 22/65 (33%), Positives = 27/65 (41%), Gaps = 2/65 (3%)
Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT 1661
G N + E +E TT T TT EE T T TTTT+ P TT
Sbjct: 697 GDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTGPEET--TETAEPTTTTEEPTEETT 754
Query: 1662 TSTTT 1666
T ++T
Sbjct: 755 TGSST 759
Score = 39.1 bits (91), Expect = 0.038
Identities = 20/55 (36%), Positives = 26/55 (47%), Gaps = 7/55 (12%)
Query: 1623 TVSTSTVVTTEEPEQT--EEPITTTTTSTTTTTKRPKPPTTTST-TTTRPKPITT 1674
V+ V TE P+ T E+P TTTT +TT T T+ TTT +P
Sbjct: 702 RVNVEVVEETERPDTTTSEDPTTTTTPTTTGPE----ETTETAEPTTTTEEPTEE 752
Score = 37.2 bits (86), Expect = 0.14
Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 7/53 (13%)
Query: 1616 SEESKPTTVSTSTVVTTEEPEQTEEPITT-TTTSTTTTTKRPKPPTTTSTTTT 1667
E +P T T+E+P T P TT +T T T TTT
Sbjct: 709 EETERPDTT------TSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTT 755
Score = 37.2 bits (86), Expect = 0.15
Identities = 21/60 (35%), Positives = 25/60 (41%), Gaps = 8/60 (13%)
Query: 1630 VTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
V E E+TE P TTT+ TTT T T+T P TT +P TT
Sbjct: 703 VNVEVVEETERPDTTTSEDPTTT------TTPTTTGPEETTETAE--PTTTTEEPTEETT 754
Score = 36.4 bits (84), Expect = 0.20
Identities = 20/47 (42%), Positives = 22/47 (46%), Gaps = 3/47 (6%)
Query: 1260 TTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSG 1306
TTTSE PT T T P TTE P TT+ T +TT S
Sbjct: 716 TTTSED---PTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSST 759
Score = 32.9 bits (75), Expect = 2.7
Identities = 22/73 (30%), Positives = 32/73 (43%), Gaps = 3/73 (4%)
Query: 1624 VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP-PTTTSTTTTRPKPITTIKPKPTTV 1682
V S V T + ++ T + + +P TT+ TT P TT P+ TT
Sbjct: 680 VDLSDVETGNYTVEADDGDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTG-PEETTE 738
Query: 1683 KPKPTTTVKPKPT 1695
+PTTT +PT
Sbjct: 739 TAEPTTTT-EEPT 750
Score = 32.5 bits (74), Expect = 3.4
Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 2/53 (3%)
Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT-STTTTTKRPKP 1658
TE T+ E PTT +T T EE +T EP TTT + TTT P
Sbjct: 709 EETERPDTTTSE-DPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760
Score = 31.4 bits (71), Expect = 8.0
Identities = 18/49 (36%), Positives = 23/49 (46%), Gaps = 4/49 (8%)
Query: 1277 TKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEPTT 1325
T++P TTT +P TT +T TT+ T PT P E TT
Sbjct: 711 TERPDTTTSEDPTTTTTPTTTGPEETTE----TAEPTTTTEEPTEETTT 755
Score = 31.4 bits (71), Expect = 8.2
Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 5/52 (9%)
Query: 1264 EQVPKP-TKKPTKPTKKPTTTTEY-NPPEATTKPSTTT---STTTDSGAWTP 1310
E+ +P T PT T TT T +P+TTT + T +G+ TP
Sbjct: 709 EETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid protein
may act as a link between the external capsid and the
internal DNA-protein core. The C-terminal 11 residues may
function as a protease cofactor leading to enzyme
activation.
Length = 238
Score = 40.2 bits (94), Expect = 0.008
Identities = 17/89 (19%), Positives = 26/89 (29%), Gaps = 2/89 (2%)
Query: 1621 PTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPT 1680
T +P Q +P P+PP+ T P P+
Sbjct: 111 GEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMA 170
Query: 1681 TVKPKP--TTTVKPKPTTVKPKPVIPPST 1707
P T++ P P PV+P +
Sbjct: 171 IAVPAIDTPVTLELPPAPQPPPPVVPQPS 199
>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
This family includes attachment proteins from respiratory
synctial virus. Glycoprotein G has not been shown to have
any neuraminidase or hemagglutinin activity. The amino
terminus is thought to be cytoplasmic, and the carboxyl
terminus extracellular. The extracellular region contains
four completely conserved cysteine residues.
Length = 263
Score = 40.1 bits (93), Expect = 0.009
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 1275 KPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEPTT-THISVTEK 1333
KP K PTT P TTK T T TT A T +PT PT+ TT T+ + +
Sbjct: 184 KPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTS----EPTKLTTKTNTTTPQF 239
Query: 1334 SPLD 1337
+PL
Sbjct: 240 TPLS 243
Score = 36.2 bits (83), Expect = 0.17
Identities = 27/98 (27%), Positives = 35/98 (35%), Gaps = 9/98 (9%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTT---RPKP 1671
+S K T +T T T + + T +T+ TTT T P
Sbjct: 54 SSANHKVTPTTTPTQQITNQIQNHTSTYLTQHNQLSTSP--SNQSTTTPLIHTILDDTTP 111
Query: 1672 ITTIKPKPTTVKPKPTTTVKP---KPTTVKPKPVIPPS 1706
T + TTV K TT KP T KP+ PP
Sbjct: 112 GTKSTYQHTTVGTKGRTTTPAQTNKPPT-KPRQSNPPE 148
Score = 35.0 bits (80), Expect = 0.34
Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 1/56 (1%)
Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKP-TKKPTTTTEYNPPEATTKPSTTTSTTT 1303
P + K + T + PK TKK TK T K T ++P TK +T T+TTT
Sbjct: 181 PEKKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTT 236
Score = 34.3 bits (78), Expect = 0.60
Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 1649 TTTTTKRPKPPTT---TSTTTTRPKPITTIKP--KPTTVKPKPTTTVKP 1692
TT TK+PKP TT T T TT+ K TT P +PT + K TT
Sbjct: 190 TTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQ 238
Score = 33.1 bits (75), Expect = 1.4
Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKP-KPTTVKPKPTTTVKP--KPTTVKPK 1700
K+P TT T +PKP TT K K T K K TT P +PT + K
Sbjct: 179 RIPEKKPGKAPTT-KPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTK 231
Score = 32.7 bits (74), Expect = 1.9
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 6/69 (8%)
Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTST--TTTTKRPKPPT--TTSTTTTRPK--P 1671
E KP T+ +P+ T++ T TT + TT PT TT T TT P+ P
Sbjct: 182 EKKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQFTP 241
Query: 1672 ITTIKPKPT 1680
++T +
Sbjct: 242 LSTTTTRNP 250
Score = 31.2 bits (70), Expect = 6.6
Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 5/66 (7%)
Query: 1242 KMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTK-----PTKKPTTTTEYNPPEATTKPS 1296
K KT +T+ ++ TT +PTK TK P P +TT PE T++
Sbjct: 198 KPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQFTPLSTTTTRNPELTSQME 257
Query: 1297 TTTSTT 1302
T ST
Sbjct: 258 TFHSTN 263
Score = 30.8 bits (69), Expect = 7.2
Identities = 19/68 (27%), Positives = 23/68 (33%), Gaps = 2/68 (2%)
Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKP-KPTTTVKPKPTTVKPK 1700
I TTK K P TT K TT + TT P T + K T P+
Sbjct: 180 IPEKKPGKAPTTKPTKKP-KPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQ 238
Query: 1701 PVIPPSTK 1708
+T
Sbjct: 239 FTPLSTTT 246
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 41.0 bits (96), Expect = 0.009
Identities = 25/134 (18%), Positives = 38/134 (28%), Gaps = 26/134 (19%)
Query: 1569 MNYWMKKGAPSRKLVMGVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTST 1628
M ++ K A + + + LL+ L E T+ + T +T
Sbjct: 2 MKHYFSKSA----VALTLA-LLTASNPKLAQ----------AEEIVTTTPATSTEAEQTT 46
Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
E E T T + TT + P T T P + T+
Sbjct: 47 --PVESDATEEADNTETPVAATTAAEAPSSSETAET----SDPTSEATDTTTSEARTV-- 98
Query: 1689 TVKPKPTTVKPKPV 1702
T KPV
Sbjct: 99 ---TPAATETSKPV 109
Score = 39.8 bits (93), Expect = 0.019
Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 7/90 (7%)
Query: 1625 STSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK- 1683
+++ + E T P T+T TT + T + T P TT P++ +
Sbjct: 20 ASNPKLAQAEEIVTTTPATSTEAEQTTPVE--SDATEEADNTETPVAATTAAEAPSSSET 77
Query: 1684 ---PKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
PT+ T+ P +T+
Sbjct: 78 AETSDPTSEATDTTTSEARTVT-PAATETS 106
Score = 32.9 bits (75), Expect = 2.8
Identities = 16/76 (21%), Positives = 23/76 (30%), Gaps = 2/76 (2%)
Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEA--TTKPSTTTSTTTDSGAWT 1309
T + T + P T + E T+ P++ + TT S A T
Sbjct: 38 TSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEART 97
Query: 1310 PNPTEWVWHPPTEPTT 1325
P P E T
Sbjct: 98 VTPAATETSKPVEGQT 113
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 40.6 bits (96), Expect = 0.013
Identities = 9/80 (11%), Positives = 11/80 (13%), Gaps = 4/80 (5%)
Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT----STTTTRPKPITTIKPKPTTVKPKP 1686
P T P + + P PKP
Sbjct: 34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93
Query: 1687 TTTVKPKPTTVKPKPVIPPS 1706
P P
Sbjct: 94 AAPAAPPAAAAAAAPAAAAV 113
Score = 36.0 bits (84), Expect = 0.31
Identities = 11/95 (11%), Positives = 13/95 (13%), Gaps = 9/95 (9%)
Query: 1610 EEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
+Y P T T+ P P
Sbjct: 34 ADYGPG-----STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAP--APPAAAAPAAP--P 84
Query: 1670 KPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
KP P V P
Sbjct: 85 KPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTP 119
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 40.3 bits (94), Expect = 0.016
Identities = 15/93 (16%), Positives = 29/93 (31%), Gaps = 1/93 (1%)
Query: 1616 SEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI 1675
++ + + P P T+ + P PP+ + P
Sbjct: 2805 ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR 2864
Query: 1676 KPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
+P + KP +P P +P + ST+
Sbjct: 2865 RPPSRSPAAKPAAPARP-PVRRLARPAVSRSTE 2896
Score = 40.3 bits (94), Expect = 0.019
Identities = 20/98 (20%), Positives = 28/98 (28%), Gaps = 5/98 (5%)
Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP-----PTTTSTTTT 1667
+P ++PT S +++ P T EP S T P P +
Sbjct: 2682 RPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP 2741
Query: 1668 RPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPP 1705
P P +P TT P P P
Sbjct: 2742 PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
Score = 36.8 bits (85), Expect = 0.21
Identities = 17/99 (17%), Positives = 26/99 (26%), Gaps = 5/99 (5%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP--KPPTTTSTTTTRPKP 1671
P PT+ + P P+ + +RP + P RP
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883
Query: 1672 ITTIKP---KPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
+P + T P + P P P P
Sbjct: 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922
>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 46 to 94 amino acids in length. This domain is
found associated with pfam01462, pfam00560.
Length = 122
Score = 37.2 bits (86), Expect = 0.019
Identities = 18/28 (64%), Positives = 20/28 (71%), Gaps = 3/28 (10%)
Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTR 1668
P TTTTT+TTTTT P TTT TTT+
Sbjct: 46 PTTTTTTTTTTTT---MPTTTTLPTTTK 70
Score = 34.2 bits (78), Expect = 0.21
Identities = 23/47 (48%), Positives = 28/47 (59%), Gaps = 11/47 (23%)
Query: 1659 PTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPP 1705
PTTT+TTTT TT P TT+ PTTT K + VK P++PP
Sbjct: 46 PTTTTTTTT----TTTTMPTTTTL---PTTT---KMSMVK-VPLVPP 81
Score = 32.2 bits (73), Expect = 0.89
Identities = 19/34 (55%), Positives = 20/34 (58%), Gaps = 10/34 (29%)
Query: 1643 TTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK 1676
TTTTT+TTTTT P TTTT P TT K
Sbjct: 47 TTTTTTTTTTTTMP-------TTTTLP---TTTK 70
Score = 30.3 bits (68), Expect = 4.5
Identities = 19/47 (40%), Positives = 21/47 (44%), Gaps = 7/47 (14%)
Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
KP PTT +T+T TT P TTTT TTT K P
Sbjct: 38 KPVRSIICPTTTTTTTTTTTTMP-------TTTTLPTTTKMSMVKVP 77
Score = 30.3 bits (68), Expect = 5.4
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTT 1667
I TT+TTTTT PTTT+ TT
Sbjct: 43 IICPTTTTTTTTTTTTMPTTTTLPTT 68
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 39.3 bits (92), Expect = 0.021
Identities = 18/63 (28%), Positives = 23/63 (36%), Gaps = 2/63 (3%)
Query: 1645 TTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
T +T +K K P T T T P+ T KP T V P + KPK
Sbjct: 176 TAPATVAPSKGAKVPATAETHPTPPQKPATKKP-AVNHHKTATVAVPPATSG-KPKSGAA 233
Query: 1705 PST 1707
+
Sbjct: 234 SAR 236
Score = 34.3 bits (79), Expect = 0.86
Identities = 15/69 (21%), Positives = 18/69 (26%), Gaps = 8/69 (11%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTT--TTKRPKPPTTTSTTTTRPKP 1671
PT P TV+ S + P E T T K T T KP
Sbjct: 175 PT----APATVAPSK--GAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKP 228
Query: 1672 ITTIKPKPT 1680
+
Sbjct: 229 KSGAASARA 237
Score = 33.5 bits (77), Expect = 1.2
Identities = 11/68 (16%), Positives = 16/68 (23%), Gaps = 1/68 (1%)
Query: 1640 EPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTT-VKPKPTTTVKPKPTTVK 1698
P T + + T KP T V P + K + +
Sbjct: 177 APATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASAR 236
Query: 1699 PKPVIPPS 1706
P S
Sbjct: 237 ALSSAPAS 244
Score = 32.0 bits (73), Expect = 4.5
Identities = 13/70 (18%), Positives = 18/70 (25%), Gaps = 3/70 (4%)
Query: 1626 TSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPK 1685
TST+ T + T+ T T KP T + P K
Sbjct: 171 TSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPAT---SGK 227
Query: 1686 PTTTVKPKPT 1695
P +
Sbjct: 228 PKSGAASARA 237
Score = 31.6 bits (72), Expect = 5.2
Identities = 15/57 (26%), Positives = 16/57 (28%), Gaps = 3/57 (5%)
Query: 1251 RTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGA 1307
T A + PT TKKP AT TS SGA
Sbjct: 179 ATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH---HKTATVAVPPATSGKPKSGA 232
Score = 31.2 bits (71), Expect = 7.2
Identities = 12/47 (25%), Positives = 15/47 (31%)
Query: 1662 TSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
TST T P + K + T KP T KP +
Sbjct: 171 TSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTAT 217
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 38.8 bits (90), Expect = 0.024
Identities = 16/80 (20%), Positives = 21/80 (26%), Gaps = 8/80 (10%)
Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRP-------KPITTIKP-KPTTVKPKPTTTVKP 1692
T +T + PT TRP +P K K T T P
Sbjct: 126 STLPTEPATVAPVRNASVPTAERPAITRPVRAQAVSEPAVEPKAAKTATATEAKVQTASP 185
Query: 1693 KPTTVKPKPVIPPSTKDEFK 1712
T P + + K
Sbjct: 186 AQTPATPPAGKGAAASGQLK 205
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 39.6 bits (92), Expect = 0.027
Identities = 24/108 (22%), Positives = 38/108 (35%), Gaps = 19/108 (17%)
Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTK---------RPKPPTTTSTTTTR 1668
++ TT + + T + + I+ T S+ TK P T+
Sbjct: 303 KNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIA 362
Query: 1669 PKPITTIKPKPTTVKPKPTT-TVKPKPTT-------VKPKPVIP--PS 1706
++ P+T P T V+ TT VKP P +P PS
Sbjct: 363 SATFRGLEKNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPS 410
Score = 35.4 bits (81), Expect = 0.56
Identities = 27/118 (22%), Positives = 38/118 (32%), Gaps = 25/118 (21%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPE------QTEEPITTTTTSTTTTT--------KRPKPPT 1660
TSE T+S T + E + + P++ T+ K P
Sbjct: 320 TSEGQ--VTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATFRGLEKNPSTAP 377
Query: 1661 TTSTT-TTRPKPIT------TIKPKPTT-VKPKPTTTVKPKPTTVKPKP-VIPPSTKD 1709
+T T R T +KP P P P+ T P P P +PP D
Sbjct: 378 STPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPD 435
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 38.1 bits (89), Expect = 0.038
Identities = 24/84 (28%), Positives = 30/84 (35%), Gaps = 27/84 (32%)
Query: 1634 EPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPK 1693
EPE EPI P+PP + P+ KP+P KPKP KPK
Sbjct: 73 EPEPEPEPI-------------PEPP--------KEAPVVIPKPEP---KPKPKPKPKPK 108
Query: 1694 P---TTVKPKPVIPPSTKDEFKIV 1714
P +PK + P
Sbjct: 109 PVKKVEEQPKREVKPVEPRPASPF 132
Score = 31.6 bits (72), Expect = 4.2
Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 2/50 (4%)
Query: 1267 PKPTKKPTKPTKKPTTTTEYNP--PEATTKPSTTTSTTTDSGAWTPNPTE 1314
PKP KK + K+ E P P T P+ TS+T + A P +
Sbjct: 107 PKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSSTATAAASKPVTSV 156
Score = 30.8 bits (70), Expect = 8.8
Identities = 19/79 (24%), Positives = 28/79 (35%), Gaps = 4/79 (5%)
Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
++V E +PI+ T + P+ P+P P+P P
Sbjct: 33 TSVHQVIELPAPAQPISVTMVAPADLEP-PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 91
Query: 1687 TTTVKPKPTTV---KPKPV 1702
+PKP KPKPV
Sbjct: 92 IPKPEPKPKPKPKPKPKPV 110
>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246). This is a
small family of fungal proteins one of whose members from
Pichia stipitis is described as being an extremely serine
rich protein-mucin-like protein.
Length = 208
Score = 37.8 bits (87), Expect = 0.039
Identities = 24/84 (28%), Positives = 31/84 (36%), Gaps = 10/84 (11%)
Query: 1620 KPTTVSTSTVVTT--------EEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
PT ++ TVV T ++ E I+ TT TT T PT T TT
Sbjct: 72 DPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTPTGTVNGNPTGT--VTTNGTA 129
Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPT 1695
TT+ V T T + T
Sbjct: 130 TTTVITTVEGVAVTYTGTGQTFTT 153
Score = 35.5 bits (81), Expect = 0.18
Identities = 26/105 (24%), Positives = 31/105 (29%), Gaps = 15/105 (14%)
Query: 1621 PTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK---- 1676
PT +T +E E T T T TT P TTS T KP TT
Sbjct: 40 PTGSATDDDDYDDEETDCEGQTTANPTGTVTT----DPTGTTSQTVVPTKPTTTDDDDDT 95
Query: 1677 -------PKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIV 1714
PTT+ T TV + + V
Sbjct: 96 TCVETTISDPTTITTPTGTVNGNPTGTVTTNGTATTTVITTVEGV 140
Score = 32.4 bits (73), Expect = 2.3
Identities = 27/109 (24%), Positives = 34/109 (31%), Gaps = 6/109 (5%)
Query: 1241 DKMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTS 1300
D+ C+ + TT T PTKPT + E T TT +
Sbjct: 52 DEETDCEGQTTAN-PTGTVTTDPTGTTSQTVVPTKPTTT-DDDDDTTCVETTISDPTTIT 109
Query: 1301 TTTDSGAWTPNPTEWVWHPPTEPTTTHISVTEKSPLDQYFKIVCYFTNW 1349
T T G NPT V T TT +V + FT
Sbjct: 110 TPT--GTVNGNPTGTVTTNGTATTTVITTVEGVAVTYTGTGQT--FTTD 154
Score = 31.7 bits (71), Expect = 3.2
Identities = 22/75 (29%), Positives = 26/75 (34%), Gaps = 6/75 (8%)
Query: 1613 KPTSEESKPTTVSTSTVVTT---EEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
T E T T TT E+ E +E T TTT T T T + T
Sbjct: 134 ITTVEGVAVTYTGTGQTFTTDGTEDDEDCDETTTYTTTYYTPYTTVIHGGTVYTNGVT-- 191
Query: 1670 KPITTIKPKPTTVKP 1684
I T PT V+
Sbjct: 192 -VIATHTVYPTDVED 205
Score = 30.9 bits (69), Expect = 6.1
Identities = 26/96 (27%), Positives = 30/96 (31%), Gaps = 9/96 (9%)
Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
E+ E + TT T T T P +P TT TT + TT TT
Sbjct: 54 ETDCEGQTTANPTGTVTTDPTGTTSQTVVPT---KPTTTDDDDDTTCVETTISDPTTITT 110
Query: 1666 TTRPKPITTIKPKPTTV--KPKPTTTVKPKPTTVKP 1699
T T TV TTTV V
Sbjct: 111 PTG----TVNGNPTGTVTTNGTATTTVITTVEGVAV 142
>gnl|CDD|226000 COG3469, COG3469, Chitinase [Carbohydrate transport and metabolism].
Length = 332
Score = 37.9 bits (88), Expect = 0.048
Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 7/69 (10%)
Query: 1765 DLDNKFYEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHN 1824
D D +F +V AL +G V L++GG + + L Q+ F+ ++ I +
Sbjct: 82 DPDAEFRAQVGALNAEGKAVLLSLGG-----ADGHIELKAGQEQ--AFVNEIIRLIETYG 134
Query: 1825 FDGLDLDWE 1833
FDGLD+D E
Sbjct: 135 FDGLDIDLE 143
Score = 33.3 bits (76), Expect = 1.6
Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 18/99 (18%)
Query: 2225 AFADLDEQELVIRSHNNK-----------LDLEQGFYEKITKFNERGIQVILAIGGWSDS 2273
A AD V+ K D + F ++ N G V+L++GG
Sbjct: 52 ALADTPRNYNVVTVSFMKGAGDIPTFKPYNDPDAEFRAQVGALNAEGKAVLLSLGG---- 107
Query: 2274 RSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFAWE 2312
+D + L FV + +E YGF+GL+ E
Sbjct: 108 -ADGHIELKAGQEQ--AFVNEIIRLIETYGFDGLDIDLE 143
Score = 31.4 bits (71), Expect = 5.9
Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
Query: 1403 FYERVVTLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLD 1462
F +V L +G V L SLGG + A Q F+ +++ + Y FDGLD
Sbjct: 87 FRAQVGALNAEGKAVLL-------SLGGADGHIELKAGQEQAFVNEIIRLIETYGFDGLD 139
Query: 1463 LDWE 1466
+D E
Sbjct: 140 IDLE 143
Score = 31.4 bits (71), Expect = 6.0
Identities = 15/68 (22%), Positives = 29/68 (42%), Gaps = 15/68 (22%)
Query: 97 VSITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFI 156
+ + + +E F++ ++ + + Y FDGLD+ DLE A +D T I
Sbjct: 112 IELKAGQEQ--AFVNEIIRLIETYGFDGLDI-------------DLEQSAILAADNQTVI 156
Query: 157 QELSSTLR 164
++
Sbjct: 157 PAALKAVK 164
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 38.4 bits (89), Expect = 0.051
Identities = 19/103 (18%), Positives = 41/103 (39%), Gaps = 1/103 (0%)
Query: 1596 VLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKR 1655
+L + G + S+ E ++ E + T + + + +T + T +T +
Sbjct: 21 ILTYASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQ 80
Query: 1656 PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
P + TT T K + ++ P +V TT+V + +
Sbjct: 81 ASQPHSHETTITCTKSLISV-PYYKSVDMNCTTSVGVNYSEYR 122
Score = 35.4 bits (81), Expect = 0.41
Identities = 15/77 (19%), Positives = 24/77 (31%)
Query: 1623 TVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTV 1682
ST + E + + + +T + T TT P T T + T + +
Sbjct: 25 ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQP 84
Query: 1683 KPKPTTTVKPKPTTVKP 1699
TT K P
Sbjct: 85 HSHETTITCTKSLISVP 101
>gnl|CDD|217603 pfam03537, DUF297, TM1410 hypothetical-related protein.
Length = 203
Score = 36.9 bits (86), Expect = 0.075
Identities = 35/202 (17%), Positives = 66/202 (32%), Gaps = 59/202 (29%)
Query: 1719 NWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFYEKVTALK 1778
+WA+Y Y + ++L D +VI D + ++ ALK
Sbjct: 1 SWAFY------YGGALPVAELDAF----------DVVVI-------DPFSFSKAEIAALK 37
Query: 1779 KKGVKV---------------------TLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVV 1817
G KV + G G ++ + S R + +
Sbjct: 38 AAGTKVIAYLSVGEVEDWRPDASQFPKSWLGGENPGWPGERWVD-IRSPAWRDIIMDARL 96
Query: 1818 NFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSP 1877
+ + FDG+ LD +Q G LI+EL A L +
Sbjct: 97 DRLWAKGFDGVFLDN--VDSYQNANGTGRPLTAADLVALIRELAARA--PGLAI------ 146
Query: 1878 SKAVIDNAYDI-PVMSENLDWI 1898
+++N +++ P ++ +D +
Sbjct: 147 ---ILNNGFELLPALAPLVDGV 165
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 36.5 bits (84), Expect = 0.076
Identities = 25/99 (25%), Positives = 39/99 (39%), Gaps = 6/99 (6%)
Query: 1614 PTSEESKPT-TVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP--KPPTTTSTTTTRPK 1670
P +E SK T T T T + + T ++T T++ PT+T TT R +
Sbjct: 35 PPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQ 94
Query: 1671 PI--TTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
+ T P + + K K T + PP+T
Sbjct: 95 TVDRHTTPPSSSRTQTTQAVHEK-KNTRTTSRTQTPPTT 132
Score = 35.8 bits (82), Expect = 0.13
Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 3/92 (3%)
Query: 1619 SKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI--- 1675
++ TV T + QT + + + TT+ + P T+T+ T T+
Sbjct: 91 NRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGK 150
Query: 1676 KPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
+P T+V+P+ + T + T + P ST
Sbjct: 151 EPTTTSVQPRSSATTQSHEETSQANPQSSAST 182
Score = 32.7 bits (74), Expect = 1.6
Identities = 25/93 (26%), Positives = 36/93 (38%), Gaps = 1/93 (1%)
Query: 1605 YESTTEEYKPTSEESKP-TTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
+T+E + T S P TT TV P + TT T+ T
Sbjct: 71 PAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPP 130
Query: 1664 TTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
TT+T TT +T K TT+V+P+ +
Sbjct: 131 TTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSA 163
Score = 31.5 bits (71), Expect = 3.0
Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 3/66 (4%)
Query: 1261 TTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPP 1320
TSE + PT+ +KK TT NP T P++ T + + T H
Sbjct: 24 NTSESEHHTSSPPTESSKKTPTTPTDNP---DTNPNSQHPTQQSTESSTLPAATSESHLE 80
Query: 1321 TEPTTT 1326
TEPT+T
Sbjct: 81 TEPTST 86
>gnl|CDD|221091 pfam11355, DUF3157, Protein of unknown function (DUF3157). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 197
Score = 36.6 bits (85), Expect = 0.081
Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 1/56 (1%)
Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTE-EPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
T EY ++ T ++ + ++ P P+ T TT TT +PK S
Sbjct: 39 TWEYVILETQTTATAATSQSAASSSAAPAVVAIPVATATTLTTIALAQPKLLLQLS 94
>gnl|CDD|217490 pfam03318, ETX_MTX2, Clostridium epsilon toxin ETX/Bacillus
mosquitocidal toxin MTX2. This family appears to be
distantly related to pfam01117.
Length = 228
Score = 36.6 bits (85), Expect = 0.086
Identities = 29/103 (28%), Positives = 38/103 (36%), Gaps = 26/103 (25%)
Query: 1604 NYESTTEEYKPTS-EESKPTTVSTSTV------VTTEE----PEQTEEPITTTTT----S 1648
N +T+ + S + TT ST+T P E IT + T
Sbjct: 38 NNTDSTQTLQTQSFSKKVTTTTSTTTTHGFKIGAKASGKFGIPFVAEGGITLSVTGEYNF 97
Query: 1649 TTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
++TTT TTSTT T T + TV P T TV
Sbjct: 98 SSTTT------NTTSTTET-----YTAPSQKVTVPPHTTVTVT 129
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 37.6 bits (87), Expect = 0.090
Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 11/81 (13%)
Query: 1640 EPITTTTTSTTTT--------TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
E I TT T ++R P + + +P P + P+ + +
Sbjct: 304 EVIAETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEE---P 360
Query: 1692 PKPTTVKPKPVIPPSTKDEFK 1712
P+P V P+P+ P + ++ K
Sbjct: 361 PQPKAVVPRPLSPYTAYEDLK 381
Score = 36.8 bits (85), Expect = 0.14
Identities = 20/99 (20%), Positives = 33/99 (33%), Gaps = 12/99 (12%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP----------KPPTTTS 1663
PT+ +VS+++ V P+ T + RP KPPT+ S
Sbjct: 455 PTAPTGVSPSVSSTSSV-PAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPS 513
Query: 1664 TTTTRPKPITTIKPK-PTTVKPKPTTTVKPKPTTVKPKP 1701
K + + P T + + +PKP
Sbjct: 514 PAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKP 552
Score = 32.2 bits (73), Expect = 3.9
Identities = 22/125 (17%), Positives = 36/125 (28%), Gaps = 26/125 (20%)
Query: 1610 EEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST----- 1664
E+ KP + S PT + + + P T T+ T PP
Sbjct: 445 EDLKPPTSPS-PTAPTGVSPSVSSTSSVPAVPDTAPATAATDA---AAPPPANMRPLSPY 500
Query: 1665 -------TTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTV----------KPKPVIPPST 1707
T P P + + + P T KP+P+ P +
Sbjct: 501 AVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYTM 560
Query: 1708 KDEFK 1712
++ K
Sbjct: 561 YEDLK 565
Score = 32.2 bits (73), Expect = 4.3
Identities = 22/107 (20%), Positives = 33/107 (30%), Gaps = 12/107 (11%)
Query: 1613 KPTSEESKPTTVSTSTVVTTE---EPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
KP + P+ S S V E + P+ S + KPPT+ S T
Sbjct: 406 KPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPL-----SPYARYEDLKPPTSPSPTAPTG 460
Query: 1670 KPITTIKPKPTT----VKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
+ P T P +P+ P + D+ K
Sbjct: 461 VSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLK 507
Score = 31.8 bits (72), Expect = 5.8
Identities = 18/78 (23%), Positives = 28/78 (35%), Gaps = 8/78 (10%)
Query: 1627 STVVTTEEPEQTEEPITTTTTSTT---TTTKRPKPPTTTSTTTTRP-KPITT---IKPKP 1679
S V +E + + P T T + + P RP P T +KP P
Sbjct: 325 SQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKP-P 383
Query: 1680 TTVKPKPTTTVKPKPTTV 1697
T+ P P ++ +V
Sbjct: 384 TSPIPTPPSSSPASSKSV 401
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 36.9 bits (85), Expect = 0.096
Identities = 23/72 (31%), Positives = 28/72 (38%), Gaps = 10/72 (13%)
Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI---KPKPTTVKPKPTTTVKPKPTTVK 1698
ITTTT + TT P +T P T + + KP P T P+ T
Sbjct: 21 ITTTTQAANATT----PSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQT--- 73
Query: 1699 PKPVIPPSTKDE 1710
P P STK E
Sbjct: 74 PNATTPSSTKVE 85
Score = 36.5 bits (84), Expect = 0.15
Identities = 21/90 (23%), Positives = 36/90 (40%), Gaps = 4/90 (4%)
Query: 1624 VSTSTVVTTEEPEQTEEPITTTTTS--TTTTTKRPKPPTTTSTTTTRPKP-ITTIKPKPT 1680
++T + TT + P +T + +T + + + P + TT P + + P
Sbjct: 16 LTTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPN 75
Query: 1681 TVKPKPTTTVKPK-PTTVKPKPVIPPSTKD 1709
P T P+ PTT + I P KD
Sbjct: 76 ATTPSSTKVETPQSPTTKQVPTEINPKFKD 105
Score = 35.0 bits (80), Expect = 0.40
Identities = 22/81 (27%), Positives = 31/81 (38%), Gaps = 3/81 (3%)
Query: 1607 STTEEYKPTSEESKP--TTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST 1664
+TT+ T+ S ST E P+ T +T + P T +ST
Sbjct: 23 TTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSST 82
Query: 1665 TTTRPKPITTIKPKPTTVKPK 1685
P+ TT K PT + PK
Sbjct: 83 KVETPQSPTT-KQVPTEINPK 102
Score = 34.2 bits (78), Expect = 0.67
Identities = 32/138 (23%), Positives = 57/138 (41%), Gaps = 25/138 (18%)
Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNP----PEATTKPSTTTSTTTDSGAWTPNP 1312
+S + Q P+ K P KP TT + P+ T +T +ST ++
Sbjct: 34 SSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVET------- 86
Query: 1313 TEWVWHPPTEPTTTHISVTEKSPLDQYFKIVCYFTNWAWYRPGKGKYVLKK---VPWINI 1369
P PTT + TE +P ++ + Y+T + + +LKK + ++N+
Sbjct: 87 -------PQSPTTKQVP-TEINP--KFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNV 136
Query: 1370 LKLYVTLQTGL-GTDQEK 1386
+ Y + L G D +K
Sbjct: 137 VPDYFIYKIALVGKDDKK 154
Score = 33.5 bits (76), Expect = 1.4
Identities = 27/102 (26%), Positives = 43/102 (42%), Gaps = 8/102 (7%)
Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
+ST K + +SKP +T E P+QT TT ++T + P+ PTT
Sbjct: 42 QSTPPSTKVEAPQSKPN-ATTPPSTKVEAPQQTPNA----TTPSSTKVETPQSPTTKQVP 96
Query: 1666 TT---RPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
T + K + KP+ + K TT++ V+P
Sbjct: 97 TEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNVVP 138
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 36.8 bits (85), Expect = 0.17
Identities = 19/123 (15%), Positives = 31/123 (25%), Gaps = 10/123 (8%)
Query: 1595 EVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTK 1654
G + E + S +V + + + P + + T
Sbjct: 492 TATPESSGSDSEGGASGRE-DETSSDAESVVSIKELRPRIGFINKSPPPKSPPKSRRTLI 550
Query: 1655 RPKPPTTTSTT-TTRPKPI---TTIKPKPTTVKPKPTTT-----VKPKPTTVKPKPVIPP 1705
+ ST + RPKP I P T ++VKPK
Sbjct: 551 VALSLASPSTAGSPRPKPSLGKFVIGTDPFAFANTVRLTDNMRGGNGVGSSVKPKGSASS 610
Query: 1706 STK 1708
Sbjct: 611 KPL 613
Score = 31.8 bits (72), Expect = 5.7
Identities = 17/109 (15%), Positives = 25/109 (22%), Gaps = 12/109 (11%)
Query: 1601 PGGNYESTTEEYKPTSE-ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
E T S+ E + T E I K P P
Sbjct: 485 DAKRPEVTATPESSGSDSEGGASGREDETSSDAESVVS----IKELRPRIGFINKSPPPK 540
Query: 1660 TTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
+ + T + P + +PKP+ K P
Sbjct: 541 SPPKSRRTLI-------VALSLASPSTAGSPRPKPSLGKFVIGTDPFAF 582
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 35.6 bits (82), Expect = 0.21
Identities = 18/103 (17%), Positives = 31/103 (30%), Gaps = 7/103 (6%)
Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTT 1667
T S+ S + E E ++ +ST+ T + P T
Sbjct: 84 ATAPSDAGSQASPDDDAQPAA-----EAEAADQSAPPEASSTSATDEAATDPPAT--AAA 136
Query: 1668 RPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
R P +P T + + +P + P P P +
Sbjct: 137 RDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAG 179
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this family
are typically between 146 and 237 amino acids in length.
There are two conserved sequence motifs: QYGP and PKLY.
Length = 155
Score = 34.7 bits (80), Expect = 0.24
Identities = 16/62 (25%), Positives = 21/62 (33%), Gaps = 5/62 (8%)
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
EE E EE T T P+ T + R P + + +PK K
Sbjct: 96 DEEEETEEES-----TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150
Query: 1692 PK 1693
PK
Sbjct: 151 PK 152
Score = 32.8 bits (75), Expect = 1.1
Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 5/68 (7%)
Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
E + +E + +T+E EQ + P T T + + PKP T T PK
Sbjct: 90 ELSSSDDEE-----EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144
Query: 1671 PITTIKPK 1678
KPK
Sbjct: 145 KPKPSKPK 152
Score = 31.6 bits (72), Expect = 2.7
Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 1/60 (1%)
Query: 1646 TTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT-VKPKPTTVKPKPVIP 1704
++S + + T + P+ T K K PKP T KPK KPKP P
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151
Score = 30.9 bits (70), Expect = 4.2
Identities = 12/62 (19%), Positives = 17/62 (27%)
Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKP 1677
E + E +E T T + KR P T + +P
Sbjct: 90 ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPS 149
Query: 1678 KP 1679
KP
Sbjct: 150 KP 151
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 36.1 bits (84), Expect = 0.24
Identities = 25/125 (20%), Positives = 36/125 (28%), Gaps = 14/125 (11%)
Query: 1572 WMKKGAPSRKLVMGVHPLL-STITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVV 1630
W+ SRK PLL +VLG N ES + S
Sbjct: 483 WLGM-VQSRK------PLLEQAFAKVLGRSIKLNLESQS------GSASNTAKTPPPPQK 529
Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTV 1690
+ P T T T+ T P P T +++ + P P + +
Sbjct: 530 SPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPT 589
Query: 1691 KPKPT 1695
K
Sbjct: 590 KDSSP 594
Score = 36.1 bits (84), Expect = 0.25
Identities = 10/72 (13%), Positives = 16/72 (22%)
Query: 1639 EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
+ + + P + K TT P P P P +
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAE 420
Query: 1699 PKPVIPPSTKDE 1710
P P +
Sbjct: 421 PTEPSPTPPANA 432
Score = 36.1 bits (84), Expect = 0.26
Identities = 16/77 (20%), Positives = 23/77 (29%)
Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP 1694
+T P + T P+P T T P P T + P + P P
Sbjct: 520 TAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPP 579
Query: 1695 TTVKPKPVIPPSTKDEF 1711
+P P + E
Sbjct: 580 IPEEPTPSPTKDSSPEE 596
Score = 35.7 bits (83), Expect = 0.33
Identities = 12/83 (14%), Positives = 17/83 (20%), Gaps = 8/83 (9%)
Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
S + P + T T P P P V +P
Sbjct: 366 SEIANASAPANPTPAPNPSPPPAP----IQPSAPKTKQAATTPSPPPAKASPPIPVPAEP 421
Query: 1687 TTTVKPKPTTVKPKPVIPPSTKD 1709
T T P + +
Sbjct: 422 TEP----SPTPPANAANAPPSLN 440
Score = 35.7 bits (83), Expect = 0.35
Identities = 11/76 (14%), Positives = 16/76 (21%), Gaps = 1/76 (1%)
Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP 1694
P I + T P + PK P
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAE 420
Query: 1695 TTVKPKPVIPPSTKDE 1710
T +P P P + +
Sbjct: 421 PT-EPSPTPPANAANA 435
Score = 33.0 bits (76), Expect = 2.3
Identities = 18/84 (21%), Positives = 28/84 (33%), Gaps = 3/84 (3%)
Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEW 1315
++ T K P P T PP P T T ++++ A P +
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSS- 575
Query: 1316 VWHPPTEPTTTHISVTEKSPLDQY 1339
PP P S T+ S ++
Sbjct: 576 --PPPPIPEEPTPSPTKDSSPEEI 597
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 35.8 bits (82), Expect = 0.27
Identities = 23/101 (22%), Positives = 37/101 (36%), Gaps = 12/101 (11%)
Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQT-----------EEPITTTTTSTTTTTKRP 1656
T S P++V+T + TT+ P T + +P
Sbjct: 317 TPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQP 376
Query: 1657 KPPTTTSTTTTRPKPITTIKPKPTTVKP-KPTTTVKPKPTT 1696
+P +TT T + IT+ PTT P P + + PT+
Sbjct: 377 QPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTS 417
Score = 32.4 bits (73), Expect = 3.4
Identities = 14/70 (20%), Positives = 22/70 (31%), Gaps = 2/70 (2%)
Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
+SA T S + + P P++ T TT+ S ++ V
Sbjct: 307 SSAVTQSSAITPSSAAIPSPAVIPSSVT--TQSATTTQASAVALSSAGVLPSDVTLPGTV 364
Query: 1317 WHPPTEPTTT 1326
P EP
Sbjct: 365 ALPAAEPVNM 374
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 35.5 bits (81), Expect = 0.33
Identities = 23/78 (29%), Positives = 28/78 (35%), Gaps = 5/78 (6%)
Query: 1623 TVSTSTVVTTEEP--EQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK--PK 1678
T + V TE P E+ P+ T+T T T K TTT T P K P
Sbjct: 249 TATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKE-TTTQQQTAPKAPTEAAKPAPA 307
Query: 1679 PTTVKPKPTTTVKPKPTT 1696
P+T T T
Sbjct: 308 PSTNTNANKTNTNTNTNT 325
Score = 35.5 bits (81), Expect = 0.41
Identities = 25/97 (25%), Positives = 32/97 (32%), Gaps = 5/97 (5%)
Query: 1606 ESTTEEYKPTSEESKPTTVSTST---VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT 1662
E TE P +E+ V +T TTE+ E T + T T P P T T
Sbjct: 255 EVKTE--APAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNT 312
Query: 1663 STTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
+ T T T+ K T T T
Sbjct: 313 NANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNS 349
Score = 32.9 bits (74), Expect = 2.6
Identities = 17/64 (26%), Positives = 24/64 (37%), Gaps = 4/64 (6%)
Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPP---EATTKPSTTTST-TTDSGA 1307
T +Q +A + KP P+ T T T N T+ PS T+T T +
Sbjct: 287 TTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTN 346
Query: 1308 WTPN 1311
N
Sbjct: 347 TNSN 350
Score = 31.7 bits (71), Expect = 5.0
Identities = 15/54 (27%), Positives = 21/54 (38%)
Query: 1260 TTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPT 1313
TT + PK + KP P+T T N T +T + T+ T T
Sbjct: 288 TTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNT 341
Score = 31.7 bits (71), Expect = 5.9
Identities = 15/74 (20%), Positives = 23/74 (31%), Gaps = 2/74 (2%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
+ E K TT T + P + +P +T+T T T+ T T T
Sbjct: 280 ATTEKKETTTQQQT--APKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNT 337
Query: 1675 IKPKPTTVKPKPTT 1688
+ T
Sbjct: 338 NTNTNSNTNTNSNT 351
Score = 31.7 bits (71), Expect = 6.4
Identities = 17/62 (27%), Positives = 21/62 (33%), Gaps = 1/62 (1%)
Query: 1638 TEEPITTTT-TSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
T P+ T TTT++ P T T + TT PK K P
Sbjct: 144 TSTPVAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHA 203
Query: 1697 VK 1698
VK
Sbjct: 204 VK 205
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 35.7 bits (82), Expect = 0.33
Identities = 25/92 (27%), Positives = 31/92 (33%), Gaps = 3/92 (3%)
Query: 1597 LGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVV--TTEEPEQTEEPITTTTTSTTTTTK 1654
L P G S P S P V T +T TT T +TTT+
Sbjct: 172 LAAPPLGE-GSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSP 230
Query: 1655 RPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
STT P+ TT + + T P P
Sbjct: 231 PSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262
Score = 35.3 bits (81), Expect = 0.35
Identities = 23/69 (33%), Positives = 26/69 (37%), Gaps = 7/69 (10%)
Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTST--TTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
P P T T TT P TT T TTT P T P T P+ TT +
Sbjct: 199 PADVFVPATPRPTPRTTA-----SPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEA 253
Query: 1693 KPTTVKPKP 1701
+ T P P
Sbjct: 254 EGTPAPPTP 262
Score = 34.2 bits (78), Expect = 1.0
Identities = 20/63 (31%), Positives = 21/63 (33%), Gaps = 7/63 (11%)
Query: 1273 PTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEPTTTHISVTE 1332
P P P TT PE T PSTTTS + T P TT T
Sbjct: 205 PATPRPTPRTTA---SPETTPTPSTTTSPPST----TIPAPSTTIAAPQAGTTPEAEGTP 257
Query: 1333 KSP 1335
P
Sbjct: 258 APP 260
Score = 33.0 bits (75), Expect = 2.2
Identities = 23/57 (40%), Positives = 25/57 (43%), Gaps = 6/57 (10%)
Query: 1653 TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK-----PKPVIP 1704
T RP P TT S TT P P TT P TT+ TT P+ T P P P
Sbjct: 207 TPRPTPRTTASPETT-PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262
Score = 32.2 bits (73), Expect = 3.4
Identities = 16/52 (30%), Positives = 18/52 (34%), Gaps = 3/52 (5%)
Query: 1656 PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
P P T TT P+ T TT P TT P T P+ P
Sbjct: 205 PATPRPTPRTTASPE---TTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEA 253
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein TopJ.
This model describes TopJ (MG_200, CbpA), a DnaJ homolog
and probable assembly protein of the Mycoplasma terminal
organelle. The terminal organelle is involved in both
cytadherence and gliding motility [Cellular processes,
Chemotaxis and motility].
Length = 871
Score = 35.6 bits (81), Expect = 0.37
Identities = 29/84 (34%), Positives = 35/84 (41%), Gaps = 7/84 (8%)
Query: 1256 QTSATTTSEQVPKPTKK-PTKPTKKPTTTTEYNPPE------ATTKPSTTTSTTTDSGAW 1308
+T A T +E+ P+PT+ TKPT P +T E N PE A S T STT
Sbjct: 283 ETVAETKAEEEPQPTQTVETKPTSAPESTVEENLPEINQPTQAVQPTSETISTTPVEPTD 342
Query: 1309 TPNPTEWVWHPPTEPTTTHISVTE 1332
P E T I V E
Sbjct: 343 QLKPKEVDQIQEELKKTKEIEVEE 366
Score = 31.7 bits (71), Expect = 7.0
Identities = 24/109 (22%), Positives = 40/109 (36%), Gaps = 12/109 (11%)
Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTEEPEQTE----------EPITTTTTSTTTTTKRPKP 1658
T E +P + + + ++++ E T EP+ T + T + P+P
Sbjct: 237 TRELEPQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPVDEETVAETKAEEEPQP 296
Query: 1659 PTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
T T T T + P +PT V+P T+ PV P
Sbjct: 297 TQTVETKPTSAPESTVEENLPEI--NQPTQAVQPTSETISTTPVEPTDQ 343
>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein. This domain is found
in the aflatoxin regulatory protein (AflR) which is
involved in the regulation of the biosynthesis of
aflatoxin in the fungal genus Aspergillus. It occurs
together with the fungal Zn(2)-Cys(6) binuclear cluster
domain (pfam00172).
Length = 275
Score = 34.9 bits (80), Expect = 0.39
Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 2/68 (2%)
Query: 1632 TEEPEQTEEP-ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTT-VKPKPTTT 1689
E P P I TT+ TT++ P+PP + ++ +P T P T+ PK +
Sbjct: 2 LETPNTASSPTIPANTTANTTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQ 61
Query: 1690 VKPKPTTV 1697
P +
Sbjct: 62 SPPAEPEL 69
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 35.4 bits (81), Expect = 0.40
Identities = 31/128 (24%), Positives = 42/128 (32%), Gaps = 14/128 (10%)
Query: 1574 KKGAPSRKL-----VMGVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTST 1628
KK S+K + GV + ++IT PG + +E P E SK S
Sbjct: 17 KKNKKSKKFKTGATIAGVTAIATSITV-----PGIEVIVSADETAPADEASK----SAEA 67
Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
T E P TT T T+ + T K + +P TV
Sbjct: 68 NTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKAPAEPATVSNPDNA 127
Query: 1689 TVKPKPTT 1696
T P T
Sbjct: 128 TSSSTPAT 135
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends to
by Lys/Arg-rich, and is followed by a membrane-spanning
region. This is followed by an acidic low-complexity
region of variable length and a well-conserved C-terminal
domain of two tandem regions matched by pfam05036
(Sporulation related repeat), found in several cell
division and sporulation proteins. The role of FtsN as a
suppressor for other cell division mutations is poorly
understood; it may involve cell wall hydrolysis [Cellular
processes, Cell division].
Length = 298
Score = 35.1 bits (80), Expect = 0.41
Identities = 24/120 (20%), Positives = 35/120 (29%), Gaps = 7/120 (5%)
Query: 1588 PLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT 1647
L++ E G T E + E+ + + V+ T EQT T
Sbjct: 94 VLINDPEEPSNGGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQT 153
Query: 1648 STTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
+ K P T + P+ T K + K PK T P T
Sbjct: 154 AE-------KKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIET 206
Score = 33.1 bits (75), Expect = 1.9
Identities = 17/89 (19%), Positives = 26/89 (29%), Gaps = 3/89 (3%)
Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP---PTTTS 1663
T + T + E + P+ T ++ + K P T+
Sbjct: 136 LATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTA 195
Query: 1664 TTTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
T + KPI T K KP K
Sbjct: 196 ETQSNSKPIETAPKADKADKTKPKPKEKA 224
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 34.5 bits (79), Expect = 0.46
Identities = 15/83 (18%), Positives = 20/83 (24%), Gaps = 2/83 (2%)
Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST--TTTRPKPITTIKPKPTTVKPKP 1686
V P Q E + P + + P + P VKP
Sbjct: 61 VQVVALPTQPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPPK 120
Query: 1687 TTTVKPKPTTVKPKPVIPPSTKD 1709
P PKP P +
Sbjct: 121 QPPAGAVPAKPTPKPEPKPVAEP 143
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 35.1 bits (81), Expect = 0.50
Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 5/85 (5%)
Query: 1634 EPEQTEEPITT-----TTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
P+Q +P+ T S ++ + + P+ T P TV P
Sbjct: 374 GPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPA 433
Query: 1689 TVKPKPTTVKPKPVIPPSTKDEFKI 1713
V P + P+ V P K+E KI
Sbjct: 434 AVPVNPPSTAPQAVRPAQFKEEKKI 458
Score = 32.1 bits (73), Expect = 4.1
Identities = 22/123 (17%), Positives = 37/123 (30%), Gaps = 13/123 (10%)
Query: 1622 TTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP----PTTTSTTTTRPKPITTIKP 1677
+ S S +P + T T + P P +T+ RP K
Sbjct: 398 ASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKK 457
Query: 1678 KPTTVKPK--PTTTVKPKPTTVKPKPVIP----PSTKDEF---KIVCYFTNWAWYRQSGG 1728
P + P+T + + I + K+ F + Y+ +A R
Sbjct: 458 IPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQKEIFTEEDLQYYWQEFAGTRPQEE 517
Query: 1729 KYL 1731
K L
Sbjct: 518 KAL 520
>gnl|CDD|163190 TIGR03234, OH-pyruv-isom, hydroxypyruvate isomerase. This enzyme
interconverts tartronate semi-aldehyde (TSA, aka
2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E.
coli enzyme has been characterized and found to be
specific for TSA, contain no cofactors, and have a
rather high Km for hydroxypyruvate of 12.5 mM. The gene
is ofter found in association with glyoxalate
carboligase (which produces TSA), but has been shown to
have no effect on growth on glyoxalate when knocked out.
This is consistent with the fact that the gene for
tartronate semialdehyde reductase (glxR) is also
associated and may have primary responsibility for the
catabolism of TSA.
Length = 254
Score = 34.6 bits (80), Expect = 0.51
Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 4/42 (9%)
Query: 585 EGITGLVSPLNSR--PGEPYPNYNINTALKLIDELGGDKRKI 624
G+T L+ P+NS PG + AL +ID++G + K+
Sbjct: 135 IGLTLLIEPINSFDMPG--FFLTTTEQALAVIDDVGRENLKL 174
>gnl|CDD|145047 pfam01692, Paramyxo_C, Paramyxovirus non-structural protein c. This
family consist of the C proteins (C', C, Y1, Y2) found in
Paramyxovirinae; human parainfluenza, and sendai virus.
The C proteins effect viral RNA synthesis having both a
positive and negative effect during the course of
infection. Paramyxovirus have a negative strand ssRNA
genome of 15.3kb form which six mRNAs are transcribed,
five of these are monocistronic. The P/C mRNA is
polycistronic and has two overlapping open reading frames
P and C, C encodes the nested C proteins C', C, Y1 and
Y2.
Length = 204
Score = 34.2 bits (78), Expect = 0.58
Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 2/66 (3%)
Query: 1637 QTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
Q + +++ + ++ + PT T TT PK + KP+ K K
Sbjct: 16 QENKNLSSMRSDSSLNSYPTSAPTPEKTEAGSMVSSTT--PKDSAHHAKPSVNTKTKQQK 73
Query: 1697 VKPKPV 1702
+PK +
Sbjct: 74 RRPKII 79
>gnl|CDD|119361 cd06544, GH18_narbonin, Narbonin is a plant 2S protein from the
globulin fraction of narbon bean (Vicia narbonensis L.)
cotyledons with unknown function. Narbonin has a
glycosyl hydrolase family 18 (GH18) domain without the
conserved catalytic residues and with no known enzymatic
activity. Narbonin amounts to up to 3% of the total seed
globulins of mature seeds and was thought to be a storage
protein but was found to degrade too slowly during
germination. This family also includes the VfNOD32
nodulin from Vicia faba.
Length = 253
Score = 34.3 bits (79), Expect = 0.60
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 1772 EKVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLD 1829
E V ++K + VKV ++IGG N S ++ + + I +N DG+D
Sbjct: 59 EAVKSIKAQHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAVSSLTSIIQTYNLDGID 118
Query: 1830 LDWE 1833
+D+E
Sbjct: 119 IDYE 122
Score = 33.5 bits (77), Expect = 1.2
Identities = 33/138 (23%), Positives = 62/138 (44%), Gaps = 15/138 (10%)
Query: 1405 ERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLD 1462
E V ++K + VKV ++IGG N + + + + Y DG+D
Sbjct: 59 EAVKSIKAQHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAVSSLTSIIQTYNLDGID 118
Query: 1463 LDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAYDV- 1521
+D+E + A PD+ E G + EL K +G++ A+++PS+ + Y
Sbjct: 119 IDYE-------HFPADPDTFVECIGQLITEL----KNNGVIKVASIAPSEDAEQSHYLAL 167
Query: 1522 -KALSESLDWISVMTYDY 1538
A + +D+++ Y+Y
Sbjct: 168 YNAYGDYIDYVNYQFYNY 185
Score = 32.7 bits (75), Expect = 1.8
Identities = 19/96 (19%), Positives = 35/96 (36%), Gaps = 20/96 (20%)
Query: 79 LKEANPELKVYLAV----KSNFVSITSDRESRL---NFISSVLEMFDMYKFDGLDLNVKD 131
+K +P +KV +++ N + N +SS+ + Y DG+D+
Sbjct: 64 IKAQHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAVSSLTSIIQTYNLDGIDI---- 119
Query: 132 PALNDEDDDDLESIANERSDFSTFIQELSSTLRRNN 167
D E + F I +L + L+ N
Sbjct: 120 ---------DYEHFPADPDTFVECIGQLITELKNNG 146
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 34.7 bits (80), Expect = 0.64
Identities = 18/86 (20%), Positives = 31/86 (36%), Gaps = 6/86 (6%)
Query: 1624 VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPT--TTSTTTTRPKPITTIKPKPTT 1681
+ T +V P+ P++ +T + T P + + TT P + +P PT
Sbjct: 204 MGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTA 263
Query: 1682 VKPKPTTTVKPKPTTVKPKPVIPPST 1707
P T P P+ S+
Sbjct: 264 PSSAPATA----PAAAAPQAAATSSS 285
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 35.1 bits (80), Expect = 0.65
Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 8/122 (6%)
Query: 1598 GHGPGGNYESTTEEYKPTSEESKPTTVSTST--------VVTTEEPEQTEEPITTTTTST 1649
G PG E + ++E KP+ + T + ++PE+ ++P +
Sbjct: 546 GGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQR 605
Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
T K PK P + +P + PK +P++ +P+ + P P S K
Sbjct: 606 PTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKP 665
Query: 1710 EF 1711
F
Sbjct: 666 PF 667
Score = 33.5 bits (76), Expect = 1.6
Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 23/86 (26%)
Query: 1251 RTKCKQTSATT-----TSEQVPKP-----TKKPTKPTKKPTTTTEYNPPEATTKPS---- 1296
RT +T A T +E+ + T +P + K+P + +E+ PS
Sbjct: 757 RTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPSLPKK 816
Query: 1297 ---------TTTSTTTDSGAWTPNPT 1313
+TT +D+G + +
Sbjct: 817 RHRLDGLALSTTDLESDAGRIAKDAS 842
Score = 33.1 bits (75), Expect = 2.5
Identities = 22/110 (20%), Positives = 32/110 (29%), Gaps = 7/110 (6%)
Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
E ++Y + +SK T + + E + P T T TT PK P
Sbjct: 673 EKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFP 732
Query: 1666 TTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPS---TKDEFK 1712
+P P + T P P +EFK
Sbjct: 733 FEPIGDPDAEQPDDIEFFTPPEE----ERTFFHETPADTPLPDILAEEFK 778
Score = 33.1 bits (75), Expect = 2.6
Identities = 23/133 (17%), Positives = 37/133 (27%), Gaps = 5/133 (3%)
Query: 1572 WMKKGAPSRKLVMGV---HPLLSTITEVLGHGPGGNYESTTE--EYKPTSEESKPTTVST 1626
++ A S++ V S + E L PG + + P EE +
Sbjct: 679 YLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGD 738
Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
++ E P T T P P T +P +P
Sbjct: 739 PDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS 798
Query: 1687 TTTVKPKPTTVKP 1699
+ + KP P
Sbjct: 799 PSEHEDKPPGDHP 811
Score = 32.0 bits (72), Expect = 5.2
Identities = 20/78 (25%), Positives = 27/78 (34%), Gaps = 2/78 (2%)
Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTK--KPTTTTEYNPPEATTKPSTTTSTTTDSGAWT 1309
K K+T T ++ + K T P P TT PP+ D A
Sbjct: 684 AKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQ 743
Query: 1310 PNPTEWVWHPPTEPTTTH 1327
P+ E+ P E T H
Sbjct: 744 PDDIEFFTPPEEERTFFH 761
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 34.7 bits (79), Expect = 0.71
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 6/41 (14%)
Query: 1634 EPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
P P TT+++TTT TTT+TTT P TT
Sbjct: 669 APGSVTVPAAANTTTSSTTT------TTTTTTTAAPTTTTT 703
Score = 34.0 bits (77), Expect = 1.1
Identities = 14/54 (25%), Positives = 19/54 (35%)
Query: 1636 EQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
+TTTT+TTTTT P TT + P+ + T
Sbjct: 678 AANTTTSSTTTTTTTTTTAAPTTTTTKAANAPFTYPLCNLIMSAACSAGGAGCT 731
Score = 32.1 bits (72), Expect = 4.7
Identities = 14/35 (40%), Positives = 18/35 (51%)
Query: 1639 EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
TTT+++TTTTT TT+TT P T
Sbjct: 677 AAANTTTSSTTTTTTTTTTAAPTTTTTKAANAPFT 711
Score = 31.3 bits (70), Expect = 7.6
Identities = 15/57 (26%), Positives = 25/57 (43%)
Query: 1619 SKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI 1675
S S+S T+ T TT++ STTTT+ +TS+ T + ++
Sbjct: 104 SASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSGLTSGASVVSL 160
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 34.5 bits (79), Expect = 0.74
Identities = 21/105 (20%), Positives = 44/105 (41%), Gaps = 18/105 (17%)
Query: 59 LIPEDLEYDVIKGG------YKSFLGLKEANPELKVYLAVKSNFVSITSDRESRLNFISS 112
L+P +Y+ +GG F ++A P L V L N + +D ++
Sbjct: 274 LMPLPGDYNERRGGIILDISLNEFE--QDAVPLL-VDLERNGNRIKELADF---GDYYED 327
Query: 113 VLEMFDMYKFDGLDLNVKDPALNDEDDDD----LESIANERSDFS 153
+ E+ ++ + D+ ++ DE+ +D ES+ E ++
Sbjct: 328 IFEVVEVVEKQEGDVVTEE--STDEESEDEVEIDESVIEEVAEME 370
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 34.7 bits (79), Expect = 0.78
Identities = 17/86 (19%), Positives = 25/86 (29%), Gaps = 15/86 (17%)
Query: 1636 EQTEEPITTTTTSTTTTTKRPKP-PTTTSTTTTRPKP------ITTIKPKPTTVKPKPTT 1688
I T+ T++P P P + P P + KP P P
Sbjct: 20 ANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLA-QAPTPAA 78
Query: 1689 TVKPKP-------TTVKPKPVIPPST 1707
+ K P + P P + P
Sbjct: 79 SEKFDPAPAPHQAASRAPDPAVAPQL 104
Score = 33.5 bits (76), Expect = 1.5
Identities = 15/113 (13%), Positives = 31/113 (27%), Gaps = 19/113 (16%)
Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTR 1668
T + T + + T+ ++ + ++ P P ++ +R
Sbjct: 11 TIACINLIIANLNTNIPIPELHTSAATQKPDPAPAPHQAASR----APDPAVAPTSAASR 66
Query: 1669 -------PKPITTIKPKPTTVKPKPTTTVKPKPTTV-------KPKPVIPPST 1707
P P + K P P + P P KP ++
Sbjct: 67 KPDLAQAPTPAASEKFDP-APAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTS 118
>gnl|CDD|221562 pfam12406, DUF3664, Surface protein. This family of proteins is
found in eukaryotes. Proteins in this family are
typically between 131 and 312 amino acids in length.
Length = 100
Score = 31.9 bits (72), Expect = 0.79
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 4/76 (5%)
Query: 1633 EEPEQTEEPITTTTTSTTTTTK---RPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
++ EQ++EP T+ P+ + T P+P+T+ +PK + +PT T
Sbjct: 25 DQAEQSQEPTQQEPIEPQQPTQPETEPEELEPETVTVEVPEPVTSEEPKESDQTEEPTET 84
Query: 1690 VKPKPTTVKPKPVIPP 1705
K +PV P
Sbjct: 85 QDSKQEPT-QQPVDEP 99
>gnl|CDD|140307 PTZ00284, PTZ00284, protein kinase; Provisional.
Length = 467
Score = 34.6 bits (79), Expect = 0.82
Identities = 24/122 (19%), Positives = 41/122 (33%), Gaps = 10/122 (8%)
Query: 1590 LSTITEVLGHGPGGNYESTTEEYKPTSEESKP-TTVSTSTVVTTEEPEQTEEPITTTTTS 1648
+ T L G Y+ T+ + P S ST TT +T ++
Sbjct: 1 MDTRAAGLSAARAGLYQYTSGAPVNALSGNSPKANNSASTGQTTSR---------STNSA 51
Query: 1649 TTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
+ +KR + T++ + K T P V P P K +P ++
Sbjct: 52 RRSGSKRDRETATSTDSGRTKSHEGAATTKQATTTPTTNVEVAPPPKKKKVTYALPNQSR 111
Query: 1709 DE 1710
+E
Sbjct: 112 EE 113
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 33.2 bits (76), Expect = 0.92
Identities = 14/44 (31%), Positives = 17/44 (38%)
Query: 1620 KPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
K TT S V PE E P T+ TT ++ K T
Sbjct: 25 KSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTK 68
>gnl|CDD|200461 cd11322, AmyAc_Glg_BE, Alpha amylase catalytic domain found in the
Glycogen branching enzyme (also called 1,4-alpha-glucan
branching enzyme). The glycogen branching enzyme
catalyzes the third step of glycogen biosynthesis by the
cleavage of an alpha-(1,4)-glucosidic linkage and the
formation a new alpha-(1,6)-branch by subsequent
transfer of cleaved oligosaccharide. They are part of a
group called branching enzymes which catalyze the
formation of alpha-1,6 branch points in either glycogen
or starch. This group includes proteins from bacteria,
eukaryotes, and archaea. The Alpha-amylase family
comprises the largest family of glycoside hydrolases
(GH), with the majority of enzymes acting on starch,
glycogen, and related oligo- and polysaccharides. These
proteins catalyze the transformation of alpha-1,4 and
alpha-1,6 glucosidic linkages with retention of the
anomeric center. The protein is described as having 3
domains: A, B, C. A is a (beta/alpha) 8-barrel; B is a
loop between the beta 3 strand and alpha 3 helix of A; C
is the C-terminal extension characterized by a Greek
key. The majority of the enzymes have an active site
cleft found between domains A and B where a triad of
catalytic residues (Asp, Glu and Asp) performs
catalysis. Other members of this family have lost the
catalytic activity as in the case of the human 4F2hc, or
only have 2 residues that serve as the catalytic
nucleophile and the acid/base, such as Thermus A4
beta-galactosidase with 2 Glu residues (GH42) and human
alpha-galactosidase with 2 Asp residues (GH31). The
family members are quite extensive and include: alpha
amylase, maltosyltransferase, cyclodextrin
glycotransferase, maltogenic amylase, neopullulanase,
isoamylase, 1,4-alpha-D-glucan maltotetrahydrolase,
4-alpha-glucotransferase, oligo-1,6-glucosidase,
amylosucrase, sucrose phosphorylase, and amylomaltase.
Length = 402
Score = 34.0 bits (79), Expect = 0.93
Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 569 HVEFMSLMSYDYHGAWE-GITGLVSPLNSRPGEP 601
HVE M +M + + G+W +TG +P SR G P
Sbjct: 75 HVELMPVMEHPFDGSWGYQVTGYFAP-TSRYGTP 107
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 34.5 bits (79), Expect = 0.97
Identities = 19/113 (16%), Positives = 33/113 (29%), Gaps = 9/113 (7%)
Query: 1602 GGNYESTTEEYKPTSEESKPTTV---STSTVVTTEEPEQTEE---PITTTTTSTTTTTKR 1655
G + ++ + +KP +SK S + +T S+ K
Sbjct: 467 GKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPNNKE 526
Query: 1656 P---KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPP 1705
+ T++S + I P K + TT PT K
Sbjct: 527 KNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNAKFPTVSKNNAYTLV 579
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and is
typically between 384 and 430 amino acids in length.
Length = 414
Score = 34.1 bits (78), Expect = 1.0
Identities = 27/106 (25%), Positives = 39/106 (36%), Gaps = 8/106 (7%)
Query: 1607 STTEEYKPTSEESKPTTVSTSTV--VTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST 1664
T+ KP E S T +S T +P+ E+ TT K P+ S
Sbjct: 234 EKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTEEEL--KSPEASPKESE 291
Query: 1665 TTTRPKPI-TTIKPKPTTVKPKPTTTVKPKPT---TVKPKPVIPPS 1706
+ K + + P P PKP + P + +PKP PP
Sbjct: 292 EASARKRSPSLLSPSPKAESPKPLASPGKSPRDPLSPRPKPQSPPV 337
Score = 33.7 bits (77), Expect = 1.3
Identities = 23/96 (23%), Positives = 36/96 (37%), Gaps = 8/96 (8%)
Query: 1614 PTSEESKPTTVSTST-VVTTEEPEQT--EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
P SK + S + ++ + E+T E+P T++ T + PKP T PK
Sbjct: 211 PPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKP-----RETLDPK 265
Query: 1671 PITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPS 1706
P TT + + PK + PS
Sbjct: 266 SPEKAPPIDTTEEELKSPEASPKESEEASARKRSPS 301
Score = 33.7 bits (77), Expect = 1.4
Identities = 19/113 (16%), Positives = 36/113 (31%), Gaps = 6/113 (5%)
Query: 1601 PGGNYESTTE----EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP 1656
PG + +S ++ + + + K ++ + E++ P T + K P
Sbjct: 212 PGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAP 271
Query: 1657 KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
TT + + K P+ P P PKP+ P
Sbjct: 272 PIDTTEEELKSPEASPKESEEASAR-KRSPSLLS-PSPKAESPKPLASPGKSP 322
Score = 32.1 bits (73), Expect = 3.7
Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 7/101 (6%)
Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTT-TTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
+ +E P + + + ++ I ++ + TK KP TS+ T
Sbjct: 197 SFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSA- 255
Query: 1674 TIKPKPTTV-KPKPTTTVKPKPTTV-KPKPVIPPSTKDEFK 1712
PKP PK P TT + K + E
Sbjct: 256 ---PKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESEEA 293
>gnl|CDD|212516 cd09990, Agmatinase-like, Agmatinase-like family. Agmatinase
subfamily currently includes metalloenzymes such as
agmatinase, guanidinobutyrase, guanidopropionase,
formimidoylglutamase and proclavaminate
amidinohydrolase. Agmatinase (agmatine ureohydrolase;
SpeB; EC=3.5.3.11) is the key enzyme in the synthesis of
polyamine putrescine; it catalyzes hydrolysis of
agmatine to yield putrescine and urea. This enzyme has
been found in bacteria, archaea and eukaryotes,
requiring divalent Mn and sometimes Zn, Co or Ca for
activity. In mammals, the highest level of agmatinase
mRNA was found in liver and kidney. However, catabolism
of agmatine via agmatinase apparently is a not major
path; it is mostly catabolized via diamine oxidase.
Agmatinase has been shown to be down-regulated in tumor
renal cells. Guanidinobutyrase (Gbh, EC=3.5.3.7)
catalyzes hydrolysis of 4-guanidinobutanoate to yield
4-aminobutanoate and urea in arginine degradation
pathway. Activity has been shown for purified enzyme
from Arthrobacter sp. KUJ 8602. Additionally,
guanidinobutyrase is able to hydrolyze D-arginine,
3-guanidinopropionate, 5-guanidinovaleriate and
L-arginine with much less affinity, having divalent Zn
ions for catalysis. Proclavaminate amidinohydrolase
(Pah, EC 3.5.3.22) hydrolyzes amidinoproclavaminate to
yield proclavaminate and urea in clavulanic acid
biosynthesis. Activity has been shown for purified
enzyme from Streptomyces clavuligerus. Clavulanic acid
is the effective inhibitor of beta-lactamases. This acid
is used in combination with the penicillin amoxicillin
to prevent antibiotic's beta-lactam rings from
hydrolysis, thus keeping the antibiotics biologically
active.
Length = 275
Score = 33.7 bits (78), Expect = 1.1
Identities = 10/35 (28%), Positives = 15/35 (42%), Gaps = 2/35 (5%)
Query: 220 VNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIG 254
V + F+ I AV + GA P I++G
Sbjct: 55 VPVDPGDIEKTFDRIREAVAEIAEAGAIP--IVLG 87
>gnl|CDD|119356 cd02877, GH18_hevamine_XipI_class_III, This conserved domain family
includes xylanase inhibitor Xip-I, and the class III
plant chitinases such as hevamine, concanavalin B, and
PPL2, all of which have a glycosyl hydrolase family 18
(GH18) domain. Hevamine is a class III endochitinase that
hydrolyzes the linear polysaccharide chains of chitin and
peptidoglycan and is important for defense against
pathogenic bacteria and fungi. PPL2 (Parkia platycephala
lectin 2) is a class III chitinase from Parkia
platycephala seeds that hydrolyzes beta(1-4) glycosidic
bonds linking 2-acetoamido-2-deoxy-beta-D-glucopyranose
units in chitin.
Length = 280
Score = 33.8 bits (78), Expect = 1.1
Identities = 35/149 (23%), Positives = 48/149 (32%), Gaps = 50/149 (33%)
Query: 1411 KKKGVKVSLAIGG--------------------WNDSLGGKYSRLVNSATARQRFIEHVV 1450
+ KG KV L+IGG WN GG + + R V
Sbjct: 69 QSKGKKVLLSIGGAGGSYSLSSDADAKDFADYLWNAFGGG-------TDSGVPRPFGDAV 121
Query: 1451 KFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSP 1510
DG D D E+ + N DA +R L + L+AA
Sbjct: 122 -------VDGFDFDIEHGS--PENYDA--------LAKRLRSLFASDPSKKYYLTAAP-- 162
Query: 1511 SKQ-VINAAYDVKALSESL-DWISVMTYD 1537
Q A A++ L D+I V Y+
Sbjct: 163 --QCPYPDASLGDAIATGLFDFIFVQFYN 189
Score = 33.4 bits (77), Expect = 1.4
Identities = 35/143 (24%), Positives = 51/143 (35%), Gaps = 38/143 (26%)
Query: 1778 KKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHN------------F 1825
+ KG KV L+IGG AG YS L + A+ F ++ N
Sbjct: 69 QSKGKKVLLSIGG----AGGSYS-LSSDADAK-DFADYLWNAFGGGTDSGVPRPFGDAVV 122
Query: 1826 DGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFN---PHDLLLSAAVSPSKAVI 1882
DG D D E + + L K LR+ F L+AA +
Sbjct: 123 DGFDFDIE-------------HGSPENYDALAKRLRSLFASDPSKKYYLTAAP---QCPY 166
Query: 1883 DNAYDIPVMSENL-DWISVMTYD 1904
+A ++ L D+I V Y+
Sbjct: 167 PDASLGDAIATGLFDFIFVQFYN 189
>gnl|CDD|218902 pfam06121, DUF959, Domain of Unknown Function (DUF959). This
N-terminal domain is not expressed in the 'Short' isoform
of Collagen A.
Length = 202
Score = 33.2 bits (75), Expect = 1.2
Identities = 25/98 (25%), Positives = 34/98 (34%), Gaps = 2/98 (2%)
Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPI 1672
K + KP T + T + P Q P+ +T ++TT RP T STT +
Sbjct: 20 KKPTWLWKPYTELSPTASSAAVP-QASTPVQSTESTTTHVVPRPGE-TEESTTPASSEEP 77
Query: 1673 TTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
I K T P T V P +E
Sbjct: 78 KEIVEKGKQNVVPGTVATTPTVTPVAMDVASSPDLSEE 115
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 33.9 bits (78), Expect = 1.2
Identities = 14/79 (17%), Positives = 19/79 (24%), Gaps = 1/79 (1%)
Query: 1605 YESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST 1664
E T P + + + P S T KPP
Sbjct: 116 SEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTP-EPPAASKPTPPAAAKPPEPAPA 174
Query: 1665 TTTRPKPITTIKPKPTTVK 1683
P P+ P+ T V
Sbjct: 175 AKPPPTPVARADPRETRVP 193
Score = 33.5 bits (77), Expect = 1.6
Identities = 15/64 (23%), Positives = 15/64 (23%), Gaps = 2/64 (3%)
Query: 1656 PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIVC 1715
PP K T KP P P KPT P PP K
Sbjct: 122 GAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPT--PPAAAKPPEPAPAAKPPP 179
Query: 1716 YFTN 1719
Sbjct: 180 TPVA 183
Score = 32.7 bits (75), Expect = 2.5
Identities = 15/77 (19%), Positives = 20/77 (25%), Gaps = 1/77 (1%)
Query: 1623 TVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTV 1682
+ T P TT P P ++ T P +P P
Sbjct: 117 EIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAK 176
Query: 1683 KP-KPTTTVKPKPTTVK 1698
P P P+ T V
Sbjct: 177 PPPTPVARADPRETRVP 193
Score = 32.3 bits (74), Expect = 3.2
Identities = 14/73 (19%), Positives = 18/73 (24%)
Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
S + T P + TT ++PK T KP KP P
Sbjct: 116 SEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAA 175
Query: 1687 TTTVKPKPTTVKP 1699
P
Sbjct: 176 KPPPTPVARADPR 188
Score = 32.0 bits (73), Expect = 4.4
Identities = 14/61 (22%), Positives = 15/61 (24%)
Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
I T TT +PK P KP P KP KP
Sbjct: 118 IDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKP 177
Query: 1702 V 1702
Sbjct: 178 P 178
Score = 31.2 bits (71), Expect = 7.0
Identities = 15/71 (21%), Positives = 19/71 (26%), Gaps = 1/71 (1%)
Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK-PKPTTTVKPKPTTVKP 1699
P++ T P T P+ P P KPT KP P
Sbjct: 114 PLSEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAP 173
Query: 1700 KPVIPPSTKDE 1710
PP+
Sbjct: 174 AAKPPPTPVAR 184
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 33.5 bits (77), Expect = 1.3
Identities = 31/126 (24%), Positives = 36/126 (28%), Gaps = 7/126 (5%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
S +S P ST+T T P TT T S T P P
Sbjct: 156 QNSGQSVPLDTSTTTDPATT-PAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQ 214
Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK--DEFKIVCYFTNWAWY--RQSGGK 1729
T P T P P ST D +V FT W + GK
Sbjct: 215 ANVDTAATPAPAAPAT--PDGAAPLPTDQAGVSTPAADPNALVMNFTADCWLEVTDATGK 272
Query: 1730 YLPSDI 1735
L S +
Sbjct: 273 KLFSGM 278
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This family
includes gbp a protein from Soybean that binds to GAGA
element dinucleotide repeat DNA. It seems likely that the
this domain mediates DNA binding. This putative domain
contains several conserved cysteines and a histidine
suggesting this may be a zinc-binding DNA interaction
domain.
Length = 301
Score = 33.3 bits (76), Expect = 1.4
Identities = 20/81 (24%), Positives = 26/81 (32%), Gaps = 14/81 (17%)
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
T E E P +T K+PK + PK PKP K K
Sbjct: 126 TREMHHLEVPPISTAPPEAKEVKKPK----KGQSPKVPK-----APKPKKPKKKG----- 171
Query: 1692 PKPTTVKPKPVIPPSTKDEFK 1712
P I P +K ++K
Sbjct: 172 SVSNRSVKMPGIDPRSKPDWK 192
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 33.9 bits (77), Expect = 1.4
Identities = 30/129 (23%), Positives = 46/129 (35%), Gaps = 20/129 (15%)
Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTEEPEQTE-EPITTTTTS---------TTTTTKRPKP 1658
TE+ S+E T ++ + +P+T+ TTS T P P
Sbjct: 543 TEDLDIESDEPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHP 602
Query: 1659 PTTTSTTTTRPKPITTIKPK--PTTVKPKPTTTVKPKPTT----VKPKPVIPPSTKDEFK 1712
T TT+ T P+ P ++P P ++ +P T V P P PP +
Sbjct: 603 SQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITP- 661
Query: 1713 IVCYFTNWA 1721
Y W
Sbjct: 662 ---YKPTWT 667
Score = 31.6 bits (71), Expect = 7.3
Identities = 18/72 (25%), Positives = 24/72 (33%), Gaps = 6/72 (8%)
Query: 1635 PEQTEEPITTTTTST-TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPK 1693
P Q + T T T P P+ T T P I+ P T++P P +
Sbjct: 651 PHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLP-----IQWAPGTMQPPPRAPTPMR 705
Query: 1694 PTTVKPKPVIPP 1705
P P P
Sbjct: 706 PPAAPPGRAQRP 717
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 33.5 bits (77), Expect = 1.4
Identities = 15/97 (15%), Positives = 25/97 (25%), Gaps = 2/97 (2%)
Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
+S + +S + T E E +T K +
Sbjct: 18 KSKLQPISYIYSNVL--VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK 75
Query: 1666 TTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPV 1702
+ K + + P + K K KPKP
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112
Score = 33.2 bits (76), Expect = 1.8
Identities = 15/88 (17%), Positives = 22/88 (25%), Gaps = 4/88 (4%)
Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT 1661
+ E SEE +++ E+ + + K PK
Sbjct: 29 SNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGE 88
Query: 1662 TSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
T PK K PKP
Sbjct: 89 TKLGFKTPKK----SKKTKKKPPKPKPN 112
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of AF4
(Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain. Lilliputian
represents a novel pair-rule gene that acts in
cytoskeleton regulation, segmentation and morphogenesis
in Drosophila.
Length = 1154
Score = 33.7 bits (77), Expect = 1.5
Identities = 31/123 (25%), Positives = 45/123 (36%), Gaps = 19/123 (15%)
Query: 1603 GNYESTTEEYKPTSEESKPTTVST-------------STVVTTEEPEQTEEPITTTTTST 1649
G +S+ +Y P S+E P + S S + P Q+E P T
Sbjct: 479 GKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSPAQSEAPPQRRTVGK 538
Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTV-----KPKPTTTVKPKPTTVK-PKPVI 1703
K K T RP+ P ++V +PK T KP+ K PK +
Sbjct: 539 KQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRKEPKSSV 598
Query: 1704 PPS 1706
PP+
Sbjct: 599 PPA 601
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved from
plants to fungi to humans - the number and representation
of individual subunits varying with species. It is
arranged into four different sections, a core, a head, a
tail and a kinase-activity part, and the number of
subunits within each of these is what varies with
species. Overall, Mediator regulates the transcriptional
activity of RNA polymerase II but it would appear that
each of the four different sections has a slightly
different function. Mediator subunit Hrs1/Med3 is a
physical target for Cyc8-Tup1, a yeast transcriptional
co-repressor.
Length = 381
Score = 33.5 bits (76), Expect = 1.6
Identities = 19/82 (23%), Positives = 29/82 (35%)
Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTT 1666
S T+ + + +T +T + P TTT + TT T T+ TT
Sbjct: 130 SITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATTATTPTT 189
Query: 1667 TRPKPITTIKPKPTTVKPKPTT 1688
T+ KP + K T
Sbjct: 190 TQKKPRKPRQTKKTGPAAAAKA 211
Score = 31.5 bits (71), Expect = 6.5
Identities = 19/82 (23%), Positives = 26/82 (31%), Gaps = 14/82 (17%)
Query: 1640 EPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK-------- 1691
+ +++ T T TTS+T P + P TT V
Sbjct: 121 QLGNAGASASITKTSNGSDAATTSSTANTPA-AAKVLKANAASAPNTTTGVGSAATTAAI 179
Query: 1692 -----PKPTTVKPKPVIPPSTK 1708
PTT + KP P TK
Sbjct: 180 SATTATTPTTTQKKPRKPRQTK 201
>gnl|CDD|119360 cd06543, GH18_PF-ChiA-like, PF-ChiA is an uncharacterized chitinase
found in the hyperthermophilic archaeon Pyrococcus
furiosus with a glycosyl hydrolase family 18 (GH18)
catalytic domain as well as a cellulose-binding domain.
Members of this domain family are found not only in
archaea but also in eukaryotes and prokaryotes. PF-ChiA
exhibits hydrolytic activity toward both colloidal and
crystalline (beta/alpha) chitins at high temperature.
Length = 294
Score = 33.0 bits (76), Expect = 1.7
Identities = 35/216 (16%), Positives = 63/216 (29%), Gaps = 50/216 (23%)
Query: 1716 YFTNWAWYRQSGGKY-LPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWA-----DLDNK 1769
Y Y + + F V KP W D
Sbjct: 6 YVDVTLNPPPDLTTYAAATGVK-----AFTLAFIVASGGC---KP--AWGGSYPLDQGGW 55
Query: 1770 FYEKVTALKKKGVKVTLAIGGWNDS--AGNKYSRLVNSQQARSKFIAHVVNFILEHNFDG 1827
+ AL+ G V ++ GG + + A + ++ Q + + + +
Sbjct: 56 IKSDIAALRAAGGDVIVSFGGASGTPLATSC----TSADQLAAAYQKVIDAY----GLTH 107
Query: 1828 LDLDWEYPKCWQVDCKQGPASDKQGFADL----IKELRAAFNPHDLLLSAAVSPS----- 1878
LD D E G A D + L+ + + + V P+
Sbjct: 108 LDFDIE-----------GGALTDTAAIDRRAQALALLQKEYPDLKISFTLPVLPTGLTPD 156
Query: 1879 -KAVIDNAYDIPVMSENLDWISVMTYDYHGQWDKKT 1913
V++ A V +LD +++MT DY +
Sbjct: 157 GLNVLEAAAANGV---DLDTVNIMTMDYGSSAGSQD 189
>gnl|CDD|215044 PLN00063, PLN00063, photosystem II core complex proteins psbY;
Provisional.
Length = 194
Score = 32.2 bits (73), Expect = 1.9
Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 3/51 (5%)
Query: 1643 TTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP---TTVKPKPTTTV 1690
T +T + + S T T+P + +++ KP T K T V
Sbjct: 3 TMAALATKCLSINARKNPNPSKTKTKPIILLSMQNKPKGLTISKSADNTNV 53
>gnl|CDD|115671 pfam07032, DUF1322, Protein of unknown function (DUF1322). This
family consists of several hypothetical 9.4 kDa Borrelia
burgdorferi (Lyme disease spirochete) proteins of around
78 residues in length. The function of this family is
unknown.
Length = 80
Score = 30.5 bits (68), Expect = 1.9
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 527 NFVKLIQELKAEFDKHDYSIAVGISGYKEILEVAYD 562
+ KLI E+ E DK+ + I +GI Y ++ ++ YD
Sbjct: 21 KYFKLIDEITREADKYYFPIIIGICSYDDVKKLPYD 56
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 33.0 bits (75), Expect = 2.3
Identities = 12/61 (19%), Positives = 18/61 (29%)
Query: 1653 TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
+ P + + P P T P P KP KP+ P +K
Sbjct: 15 EQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWK 74
Query: 1713 I 1713
+
Sbjct: 75 L 75
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins in
this family for which functions are known are components
of a multiprotein complex used for targeting nucleotide
excision repair to specific parts of the genome. In
humans, Rad23 complexes with the XPC protein. This family
is based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 378
Score = 32.9 bits (75), Expect = 2.3
Identities = 12/71 (16%), Positives = 22/71 (30%), Gaps = 6/71 (8%)
Query: 1642 ITTTTTSTTTTTKRPKPPT--TTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
++ T T PT T T + P + + P + + + + + T
Sbjct: 74 VSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133
Query: 1700 KPVIPPSTKDE 1710
PST
Sbjct: 134 ----SPSTSVP 140
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 31.5 bits (71), Expect = 2.3
Identities = 16/98 (16%), Positives = 27/98 (27%), Gaps = 4/98 (4%)
Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP----TTTSTTT 1666
E++P E ++ + V + + +P + +
Sbjct: 53 EHEPNQEVTEVEVIIEKEPVPAVAVAPVPVAVVAPVVAPKPKKSQPVMSQEKTASPQKSV 112
Query: 1667 TRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
P P K K V+P P V K PV
Sbjct: 113 PAPSPKEKKKKKVAKVEPAPAKAVAVPVLASKSAPVPA 150
>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47.
Length = 627
Score = 32.9 bits (74), Expect = 2.4
Identities = 28/130 (21%), Positives = 45/130 (34%), Gaps = 26/130 (20%)
Query: 1604 NYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
N++S ++ S ++++ + +T P P+T T T T + P T T
Sbjct: 232 NFDSILTTTPSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHTDTIKVTPNTPTI 291
Query: 1664 TTTTR------------PKPI--------------TTIKPKPTTVKPKPTTTVKPKPTTV 1697
T T P+P+ TIK + T P PKPT
Sbjct: 292 TAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPKSPPKPTNF 351
Query: 1698 KPKPVIPPST 1707
+ + P T
Sbjct: 352 ENTTIRIPET 361
>gnl|CDD|115650 pfam07010, Endomucin, Endomucin. This family consists of several
mammalian endomucin proteins. Endomucin is an early
endothelial-specific antigen that is also expressed on
putative hematopoietic progenitor cells.
Length = 259
Score = 32.4 bits (73), Expect = 2.6
Identities = 23/119 (19%), Positives = 35/119 (29%), Gaps = 5/119 (4%)
Query: 1585 GVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITT 1644
L + G P G S + S + TT TT + E +
Sbjct: 45 NTVSLKNVNKPTTGTPPKGTTTSELLKTSLMSTATSLTTPKHELKTTTTGVRKNESSTSK 104
Query: 1645 TTTSTTTT-----TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
T + T T + T + ++ R I+ PK T T TT +
Sbjct: 105 VTVTNVTLSNAVSTLQSSQNKTENQSSIRTTEISPTSVLQPDASPKKTGTTSASLTTAE 163
Score = 32.0 bits (72), Expect = 3.9
Identities = 28/118 (23%), Positives = 41/118 (34%), Gaps = 8/118 (6%)
Query: 1604 NYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTT------TSTTTTTKRPK 1657
N +E K + S TT +T +TT + TT T+T+ K
Sbjct: 15 NSLCNSEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTSL 74
Query: 1658 PPTTTSTTTTRPKPITTIKP--KPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKI 1713
T TS TT + + TT K + K T T V T+++ I
Sbjct: 75 MSTATSLTTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVSTLQSSQNKTENQSSI 132
>gnl|CDD|99803 cd06207, CyPoR_like, NADPH cytochrome p450 reductase (CYPOR) serves
as an electron donor in several oxygenase systems and is
a component of nitric oxide synthases and methionine
synthase reductases. CYPOR transfers two electrons from
NADPH to the heme of cytochrome p450 via FAD and FMN.
Ferredoxin-NADP+ (oxido)reductase is an FAD-containing
enzyme that catalyzes the reversible electron transfer
between NADP(H) and electron carrier proteins such as
ferredoxin and flavodoxin. Isoforms of these
flavoproteins (i.e. having a non-covalently bound FAD as
a prosthetic group) are present in chloroplasts,
mitochondria, and bacteria in which they participate in
a wide variety of redox metabolic pathways. The
C-terminal domain contains most of the NADP(H) binding
residues and the N-terminal domain interacts
non-covalently with the isoalloxazine rings of the
flavin molecule which lies largely in a large gap
betweed the two domains. Ferredoxin-NADP+ reductase
first accepts one electron from reduced ferredoxin to
form a flavin semiquinone intermediate. The enzyme then
accepts a second electron to form FADH2 which then
transfers two electrons and a proton to NADP+ to form
NADPH.
Length = 382
Score = 32.6 bits (75), Expect = 3.0
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 446 ISSSPLVTHGKVKILIAIGGWTDSSGEK 473
ISSSPL +V +L+++ W SG
Sbjct: 169 ISSSPLKNPNEVHLLVSLVSWKTPSGRS 196
>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 100
amino acids in length. This domain is found associated
with pfam00096. This domain has two conserved sequence
motifs: YPSPV and PSP.
Length = 100
Score = 30.5 bits (68), Expect = 3.1
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 1/70 (1%)
Query: 1638 TEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP-TTVKPKPTTTVKPKPTT 1696
P++T + + + + P + T+ P P+ T P ++ P P T P P+
Sbjct: 3 KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVHTSFPSPSI 62
Query: 1697 VKPKPVIPPS 1706
P + P+
Sbjct: 63 ATTYPSVSPT 72
>gnl|CDD|212088 cd11519, SLC5sbd_SMCT1, Na(+)/monocarboxylate cotransporter SMCT1 and
related proteins; solute-binding domain. SMCT1 is a
high-affinity transporter of various monocarboxylates
including lactate and pyruvate, short-chain fatty acids,
ketone bodies, nicotinate and its structural analogs,
pyroglutamate, benzoate and its derivatives, and iodide.
Human SMCT1 (hSMCT1, also called AIT) is encoded by the
tumor suppressor gene SLC5A8. Its expression is under the
control of the C/EBP transcription factor. Its
tumor-suppressive role is related to uptake of butyrate,
propionate, and pyruvate, these latter are inhibitors of
histone deacetylases. SMCT1 is expressed in the colon,
small intestine, kidney, thyroid gland, retina, and
brain. SMCT1 may contribute to the intestinal/colonic and
oral absorption of monocarboxylate drugs. SMCT1 also
mediates iodide transport from thyrocyte into the colloid
lumen in thyroid gland and through transporting l-lactate
and ketone bodies helps maintain the energy status and
the function of neurons. In the kidney its expression is
limited to the S3 segment of the proximal convoluted
tubule (in contrast to the low-affinity monocarboxylate
transporter SMCT2, belonging to a different family, which
is expressed along the entire length of the tubule). In
the retina, SMCT1 and SMCT2 may play a differential role
in monocarboxylate transport in a cell type-specific
manner, SMCT1 is expressed predominantly in retinal
neurons and in retinal pigmented epithelial (RPE) cells.
This subgroup belongs to the solute carrier 5 (SLC5)
transporter family.
Length = 541
Score = 32.4 bits (74), Expect = 3.6
Identities = 19/60 (31%), Positives = 23/60 (38%), Gaps = 14/60 (23%)
Query: 1612 YKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
Y P E + P +ST E T T S TTTT + P T T P+P
Sbjct: 457 YPPLPERTLPLPLST------------EGCNFTNTESNTTTT--TEMPFTIFPTQPSPRP 502
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 32.3 bits (73), Expect = 4.1
Identities = 14/90 (15%), Positives = 28/90 (31%), Gaps = 2/90 (2%)
Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTT 1666
+ + ++ + T T + Q + I + + + T PP
Sbjct: 355 APSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMT--AYPPVPQFCGD 412
Query: 1667 TRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
+ T+ P+P V P+PT
Sbjct: 413 PGLVSPYNPQSPGTSYGPEPVGPVPPQPTN 442
>gnl|CDD|145556 pfam02478, Pneumo_phosprot, Pneumovirus phosphoprotein. This family
represents the phosphoprotein of Paramyxoviridae, a
putative RNA polymerase alpha subunit that may function
in template binding.
Length = 266
Score = 31.8 bits (72), Expect = 4.2
Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 4/77 (5%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP---- 1669
+ S+ + T + TT + ++ I + + KP T TS T
Sbjct: 38 KVNTISEKFELPTISKPTTPKGPCIKDEIISVNKVKDIESIYEKPVTPTSDGKTPTEKSD 97
Query: 1670 KPITTIKPKPTTVKPKP 1686
T K K + +P+P
Sbjct: 98 DTPTKKKKKVSFKEPEP 114
Score = 30.6 bits (69), Expect = 9.2
Identities = 14/62 (22%), Positives = 20/62 (32%), Gaps = 3/62 (4%)
Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
T + K P + TT PK + K K ++ KP T P D
Sbjct: 41 TISEKFELPTISKPTT---PKGPCIKDEIISVNKVKDIESIYEKPVTPTSDGKTPTEKSD 97
Query: 1710 EF 1711
+
Sbjct: 98 DT 99
>gnl|CDD|212074 cd11505, SLC5sbd_SMCT, Na(+)/monocarboxylate cotransporters SMCT1 and
2 and related proteins; solute-binding domain. SMCT1 is
a high-affinity transporter of various monocarboxylates
including lactate and pyruvate, short-chain fatty acids,
ketone bodies, nicotinate and its structural analogs,
pyroglutamate, benzoate and its derivatives, and iodide.
Human SMCT1 (hSMCT1, also called AIT) is encoded by the
tumor suppressor gene SLC5A8. SMCT1 is expressed in the
colon, small intestine, kidney, thyroid gland, retina,
and brain. SMCT1 may contribute to the intestinal/colonic
and oral absorption of monocarboxylate drugs. It also
mediates iodide transport from thyrocyte into the colloid
lumen in thyroid gland and, through transporting
L-lactate and ketone bodies, helps maintain the energy
status and the function of neurons. SMCT2 is a
low-affinity transporter for short-chain fatty acids,
lactate, pyruvate, and nicotinate. hSMCT2 is encoded by
the SLC5A12 gene. SMCT2 is expressed in the kidney, small
intestine, skeletal muscle, and retina. In the kidney,
SMCT2 may initiate lactate absorption in the early parts
of the tubule, SMCT1 in the latter parts of the tubule.
In the retina, SMCT1 and SMCT2 may play a differential
role in monocarboxylate transport in a cell type-specific
manner. This subgroup belongs to the solute carrier 5
(SLC5) transporter family.
Length = 536
Score = 32.0 bits (73), Expect = 4.3
Identities = 19/60 (31%), Positives = 23/60 (38%), Gaps = 14/60 (23%)
Query: 1612 YKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
Y P E + P +ST E T T S TTTT + PTT T +P
Sbjct: 457 YPPPPERTLPLPLST------------EGCNFTNTESNTTTT--TEMPTTIFPTQPSSRP 502
>gnl|CDD|237052 PRK12313, PRK12313, glycogen branching enzyme; Provisional.
Length = 633
Score = 32.2 bits (74), Expect = 4.7
Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 4/35 (11%)
Query: 569 HVEFMSLMSYDYHGAWEG--ITGLVSPLNSRPGEP 601
HVEFM LM + G+W G +TG +P SR G P
Sbjct: 187 HVEFMPLMEHPLDGSW-GYQLTGYFAP-TSRYGTP 219
>gnl|CDD|201431 pfam00763, THF_DHG_CYH, Tetrahydrofolate
dehydrogenase/cyclohydrolase, catalytic domain.
Length = 117
Score = 30.2 bits (69), Expect = 5.2
Identities = 14/45 (31%), Positives = 24/45 (53%), Gaps = 8/45 (17%)
Query: 1772 EKVTALKKKGVKVTLAI---GGWNDSAGNKYSRLVNSQQARSKFI 1813
E+V LK+KG+ LA+ G +D A Y V S++ ++ +
Sbjct: 18 EEVAKLKEKGITPKLAVILVG--DDPASQVY---VRSKRKAAEEL 57
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is a
family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 31.7 bits (72), Expect = 6.2
Identities = 18/74 (24%), Positives = 27/74 (36%), Gaps = 3/74 (4%)
Query: 1242 KMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTST 1301
K K P + + S+ T +KP KP K +T +P K +T S
Sbjct: 52 KPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKS---KAPSTESE 108
Query: 1302 TTDSGAWTPNPTEW 1315
+ TP+P
Sbjct: 109 EEEEPEETPDPIAS 122
>gnl|CDD|183673 PRK12676, PRK12676, bifunctional inositol-1
monophosphatase/fructose-1,6-bisphosphatase; Reviewed.
Length = 263
Score = 31.0 bits (71), Expect = 6.5
Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 5/47 (10%)
Query: 252 IIGIPFFGKSYRLFNRSE--YGLGATVKGPGTEGKYTQMPGYLAFFE 296
I GIPF+ S +F + YG V T Y +PG A+
Sbjct: 96 INGIPFYAISIAVFKGGKPVYGY---VYNLATGDFYEAIPGKGAYLN 139
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 31.4 bits (71), Expect = 6.5
Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 6/53 (11%)
Query: 1657 KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
KP ++ + + KP P KP ++ +PK KP PPS+ D
Sbjct: 201 KPSSSAAPAAPKAKPSPP--PPKEEEVEKPASSPEPK----ASKPSAPPSSGD 247
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This protein,
which interacts with both microtubules and TRAF3 (tumour
necrosis factor receptor-associated factor 3), is
conserved from worms to humans. The N-terminal region is
the microtubule binding domain and is well-conserved; the
C-terminal 100 residues, also well-conserved, constitute
the coiled-coil region which binds to TRAF3. The central
region of the protein is rich in lysine and glutamic acid
and carries KKE motifs which may also be necessary for
tubulin-binding, but this region is the least
well-conserved.
Length = 506
Score = 31.4 bits (71), Expect = 6.6
Identities = 14/78 (17%), Positives = 22/78 (28%), Gaps = 9/78 (11%)
Query: 1633 EEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
+E + E K +PP + + + K V+
Sbjct: 121 KEKPKEEPK----DRKPKEEAKEKRPPKEKEKEKEKKV-----EEPRDREEEKKRERVRA 171
Query: 1693 KPTTVKPKPVIPPSTKDE 1710
K KP PP+ K E
Sbjct: 172 KSRPKKPPKKKPPNKKKE 189
>gnl|CDD|212004 TIGR04281, peripla_PGF_1, putative ABC transporter PGF-CTERM-modified
substrate-binding protein. Members of this archaeal
protein family resemble periplasmic substrate-binding
proteins of ABC transporters and appear in gene
neighborhoods with permease and ATP-binding cassette
proteins. Notably, essentially all members also have the
PGF-CTERM putative protein-sorting domain at the
C-terminus, while more distant homologs (excluded by the
trusted cutoff) instead have what appear to be
lipoprotein signal peptides at the N-terminus.
Length = 330
Score = 31.2 bits (71), Expect = 6.6
Identities = 21/76 (27%), Positives = 28/76 (36%), Gaps = 9/76 (11%)
Query: 1601 PGGNYESTT--EEYKPTSEESK------PTTV-STSTVVTTEEPEQTEEPITTTTTSTTT 1651
P Y STT EE + + P V + T+ PE EE T T + TT
Sbjct: 223 PTPAYNSTTAVEEGNVVAVNANYLSQPAPRVVEAVETLAEAFHPEAYEEAETADTEADTT 282
Query: 1652 TTKRPKPPTTTSTTTT 1667
+TTT+
Sbjct: 283 YDDSTTDAEDGTTTTS 298
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 31.4 bits (71), Expect = 6.6
Identities = 21/110 (19%), Positives = 37/110 (33%), Gaps = 17/110 (15%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTT-----------TTKRPKPPTTT 1662
P S E P +S T P+ ++ + +T +++TT T T+T
Sbjct: 91 PPSSEPTP-APPSSESTATRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSDTST 149
Query: 1663 STTTTRPKPITT----IKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
S++ P ++ + P PT +P P K
Sbjct: 150 SSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKS-AEPTAAPQAK 198
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 31.3 bits (70), Expect = 6.6
Identities = 22/84 (26%), Positives = 33/84 (39%), Gaps = 10/84 (11%)
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP------TTVKPK 1685
T EP I+ + + + T K P TT +P P K KP KPK
Sbjct: 155 TVEPGTKVAIISKSEDAASQVTPSQKIPETTDP---KPSPPAEDKQKPKVESAPVAEKPK 211
Query: 1686 PTTTVKPKPTTVKPKPVIPPSTKD 1709
++ P P +P +PP ++
Sbjct: 212 APSS-PPPPKQSAKEPQLPPKERE 234
>gnl|CDD|178321 PLN02719, PLN02719, triacylglycerol lipase.
Length = 518
Score = 31.2 bits (70), Expect = 7.1
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 6/58 (10%)
Query: 2007 VRDRKRRIGPYAFKG-DQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNFCG 2063
+RD R+I +G D W G D ++E ++Y ++ A A D D F +CG
Sbjct: 88 LRDTWRKI-----QGEDDWAGLMDPMDPVLRSELIRYGEMAQACYDAFDFDPFSRYCG 140
>gnl|CDD|146962 pfam04574, DUF592, Protein of unknown function (DUF592). This region
is found in some SIR2 family proteins (pfam02146).
Length = 153
Score = 30.1 bits (68), Expect = 7.7
Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 18/139 (12%)
Query: 1680 TTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIVCYFTNWAWYRQSGGKYLPSDIDSDL 1739
+ V P + K V PP TKD+ F + R+ KYLP++++S
Sbjct: 1 SLVPQIPKKPIIIGKNPKTGKFVFPPITKDDSLNARMFLKYYGLRKFLDKYLPTELNSLY 60
Query: 1740 CTHVI--YGFAVLDTDQLVIKPHDTWADLDNKFYEKVTALKKKGVKVTLAIGGWNDSAGN 1797
++I GF + D +L N Y+ V L+ + D
Sbjct: 61 IYYLIKLLGFELKDQ------------ELLNAIYKNVD--VDANSAEKLSYTDFEDPLEK 106
Query: 1798 KY-SRLV-NSQQARSKFIA 1814
K+ RL+ + Q+A +K ++
Sbjct: 107 KHIVRLIKDLQKAMNKVLS 125
>gnl|CDD|218481 pfam05177, RCSD, RCSD region. Proteins contain this region include
C.elegans UNC-89. This region is found repeated in UNC-89
and shows conservation in prolines, lysines and glutamic
acids. Proteins with RCSD are involved in muscle M-line
assembly, but the function of this region RCSD is not
clear.
Length = 101
Score = 29.1 bits (64), Expect = 7.9
Identities = 20/85 (23%), Positives = 28/85 (32%)
Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
P +E P S S TEE TEE + + KP + T + K T
Sbjct: 5 PGKKEKSPLRRSPSRTEKTEEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPEKSAT 64
Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVK 1698
PT + P + + K
Sbjct: 65 EEVKSPTKKEKSPEKVEEKPASPTK 89
>gnl|CDD|215584 PLN03113, PLN03113, DNA ligase 1; Provisional.
Length = 744
Score = 31.5 bits (71), Expect = 8.1
Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 7/77 (9%)
Query: 1644 TTTTSTTTTTKRPKPPT-------TTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
T + +++ KR T T + T PK TI+PK + KP+ +T+ + +
Sbjct: 25 QTQSQSSSPKKRKIGETQDANLGKTNVSEGTLPKTEDTIEPKSDSAKPRSSTSSIAEDSK 84
Query: 1697 VKPKPVIPPSTKDEFKI 1713
K S + ++
Sbjct: 85 TGTKKAQTLSKPKKDEM 101
>gnl|CDD|165461 PHA03191, PHA03191, UL14 tegument protein; Provisional.
Length = 238
Score = 30.8 bits (69), Expect = 8.6
Identities = 20/81 (24%), Positives = 26/81 (32%), Gaps = 7/81 (8%)
Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK---PKPTT 1688
EE ++ E + T R T T T PT V+ + T
Sbjct: 137 IEELDEEAEALLTKWILEQKPRPRLPTAKTAPPHTRDGTRTTEKARVPTGVQDTNSQATD 196
Query: 1689 TVKPKPTTVKP----KPVIPP 1705
P PTT P + VI P
Sbjct: 197 QNLPPPTTSTPSISREHVIHP 217
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 30.1 bits (68), Expect = 9.0
Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 5/59 (8%)
Query: 1641 PITTTTTSTTTTTKRPKPPTTTST---TTTRPKPITTIKPKPTTVKPKPTTT--VKPKP 1694
T + T TT P T+T P+P+ T + TV P + T P+P
Sbjct: 70 FRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAREVKTVVPVSSVTPVTPPRP 128
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 30.8 bits (69), Expect = 9.3
Identities = 25/130 (19%), Positives = 44/130 (33%), Gaps = 6/130 (4%)
Query: 1586 VHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEP-ITT 1644
V+ L T ++V + G Y Y P +++ T + P + P +
Sbjct: 131 VNAFLRTKSQVGHYAADGTYVPAGGTYIPAG---GTYILASGTYIPPNPPREAPAPGLPK 187
Query: 1645 TTTSTTTTTKR--PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPV 1702
T TS+ R PKP + +P + +P+ + T +
Sbjct: 188 TFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQ 247
Query: 1703 IPPSTKDEFK 1712
PP T+ K
Sbjct: 248 QPPQTEQGHK 257
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 31.0 bits (70), Expect = 9.6
Identities = 21/105 (20%), Positives = 40/105 (38%), Gaps = 10/105 (9%)
Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
+Y+ T + + + + +T ++T+ + + T K+ P + PK
Sbjct: 64 KYRKTCKRCRVSDEDINRFLTKTFEDKTQVKVKVVSAPTK---KKKAMPKSVV---RAPK 117
Query: 1671 PITTIKPK---PTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
P+ P + KP P+ V P P + PS KD +
Sbjct: 118 PLENPVPAQAESSGSKPVPSIPVSTPEVKA-PAPALTPSQKDRLE 161
>gnl|CDD|227600 COG5275, COG5275, BRCT domain type II [General function prediction
only].
Length = 276
Score = 30.5 bits (68), Expect = 9.8
Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 1/84 (1%)
Query: 1617 EESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK 1676
+ TT + I+ T+T+ P +T+ ++P T
Sbjct: 10 SDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRA 69
Query: 1677 -PKPTTVKPKPTTTVKPKPTTVKP 1699
KP K + +TT K K T
Sbjct: 70 TRKPAQPKAEKSTTSKSKSHTTTA 93
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.134 0.421
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 121,802,524
Number of extensions: 11888196
Number of successful extensions: 13606
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12065
Number of HSP's successfully gapped: 462
Length of query: 2417
Length of database: 10,937,602
Length adjustment: 113
Effective length of query: 2304
Effective length of database: 5,925,600
Effective search space: 13652582400
Effective search space used: 13652582400
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.6 bits)