BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015062
(414 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255547996|ref|XP_002515055.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
gi|223546106|gb|EEF47609.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
Length = 566
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/414 (78%), Positives = 364/414 (87%), Gaps = 1/414 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHISKPGWCCSN DGNGFFGD YFNPDLWIKGLT+MAT+FNGV NV+GMSLRNEL
Sbjct: 152 MVILDNHISKPGWCCSNFDGNGFFGDTYFNPDLWIKGLTQMATLFNGVTNVIGMSLRNEL 211
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG KQNV DWYRYM+ GAEAVH+ANP+VLVILSGLN+DKD SF+RN+ VNL+FTGK+VFE
Sbjct: 212 RGQKQNVNDWYRYMEKGAEAVHSANPDVLVILSGLNYDKDFSFLRNRPVNLSFTGKVVFE 271
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DGQAW GNPNQVCGRVVDN+MR+SGFLLEQGWP+FVSEFG D RG NVNDNR
Sbjct: 272 VHWYGFSDGQAWRSGNPNQVCGRVVDNLMRISGFLLEQGWPMFVSEFGVDQRGTNVNDNR 331
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF GVAAELDWDWALWTLVGSYYLR+GVIGLNEYYG+ +WNWCD+RNSSFL++IS+L
Sbjct: 332 YLGCFIGVAAELDWDWALWTLVGSYYLRQGVIGLNEYYGVLNWNWCDVRNSSFLQQISAL 391
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
QSPF+GPG+ ET HKVI+HP+TGLCVQRKS L+PL LG CT+SEAW YT T++L+G
Sbjct: 392 QSPFQGPGLSETNPHKVIFHPSTGLCVQRKSMLEPLRLGSCTDSEAWRYTSENTLTLRGT 451
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNT 360
YFCLQA +GKPAKLGIICTD S W++ISDSKMHLSSK NGT VCLDVDS+NTIV +T
Sbjct: 452 YFCLQADELGKPAKLGIICTDSTSKWDVISDSKMHLSSKITNGTAVCLDVDSNNTIVIST 511
Query: 361 CKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPI-LDLPGKDFIWKFFGL 413
CKCLSRD TCDP SQWFKLV+STRSS T K + N I LDLP K+F WKF GL
Sbjct: 512 CKCLSRDNTCDPESQWFKLVNSTRSSATAKPSLRINSILLDLPAKEFFWKFLGL 565
>gi|224102379|ref|XP_002312657.1| predicted protein [Populus trichocarpa]
gi|222852477|gb|EEE90024.1| predicted protein [Populus trichocarpa]
Length = 544
Score = 682 bits (1760), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/413 (76%), Positives = 360/413 (87%), Gaps = 1/413 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHISKPGWCCSNSDGNGFFGDQYF+PDLWI GLT+MA++FNGV NVVGMSLRNEL
Sbjct: 130 MVILDNHISKPGWCCSNSDGNGFFGDQYFDPDLWITGLTRMASMFNGVPNVVGMSLRNEL 189
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DWYRYMQ GAEAVH+ANP+V+VILSGLN+DKDLSF+RN+ VNLTF+ K+VFE
Sbjct: 190 RGPKQNVNDWYRYMQKGAEAVHSANPDVIVILSGLNYDKDLSFLRNRPVNLTFSRKIVFE 249
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGFTDGQAW +GNPNQVCGRVVDN+MR+SGFLL+QGWPLF+SEFG D RG NVNDNR
Sbjct: 250 VHWYGFTDGQAWKNGNPNQVCGRVVDNMMRISGFLLDQGWPLFMSEFGVDQRGTNVNDNR 309
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF GVAAELD+DWALWTLVGSYY R+GVIG+NEYYG+ + NW + RNS+FL++IS+L
Sbjct: 310 YLGCFLGVAAELDFDWALWTLVGSYYFRQGVIGMNEYYGVLNSNWRETRNSTFLQQISAL 369
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
QSPFRGPGV E LHKVI+HP+TGLCV RKS +PL LGPCT+SEAW+YTP K +S+KG
Sbjct: 370 QSPFRGPGVSEVHLHKVIFHPSTGLCVLRKSMFEPLRLGPCTQSEAWNYTPQKILSVKGT 429
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNT 360
YFCLQ + KPAKLGIICTD S WE ISDSKMHLSSKA NGT VCLD+ +NTIVT+T
Sbjct: 430 YFCLQTDELAKPAKLGIICTDSNSKWEAISDSKMHLSSKAPNGTAVCLDIGYNNTIVTST 489
Query: 361 CKCLSRDKTCDPASQWFKLVDST-RSSTTTKSFFQFNPILDLPGKDFIWKFFG 412
CKCLS+D TCDP SQWFKLV+ST RSST TK + IL+ P KDF+WKF G
Sbjct: 490 CKCLSKDNTCDPESQWFKLVNSTRRSSTMTKPSSLISSILNFPAKDFLWKFLG 542
>gi|224110842|ref|XP_002315653.1| predicted protein [Populus trichocarpa]
gi|222864693|gb|EEF01824.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/390 (78%), Positives = 342/390 (87%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHISKPGWCCSNSDGNGFFGDQYF+PDLWI GLT+MA++F GV NVVGMSLRNEL
Sbjct: 123 MVILDNHISKPGWCCSNSDGNGFFGDQYFDPDLWITGLTRMASMFKGVPNVVGMSLRNEL 182
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DWYRYMQ GAEAVH+ANP+V+VILSGLN+DKDLSF+RN+ V+LTF+GK+VFE
Sbjct: 183 RGPKQNVNDWYRYMQKGAEAVHSANPDVIVILSGLNYDKDLSFLRNRPVHLTFSGKIVFE 242
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGFTDG+AW GN NQVCGRVVDN+MR+SGFLL+QGWPLFVSEFG D RG NVNDNR
Sbjct: 243 VHWYGFTDGEAWKSGNSNQVCGRVVDNMMRVSGFLLDQGWPLFVSEFGVDQRGTNVNDNR 302
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAELD DWALWTLVGSYYLR+GVIG+NEYYG+ +WNW ++RNS+FL+ IS+L
Sbjct: 303 YLGCFLSVAAELDLDWALWTLVGSYYLRQGVIGMNEYYGVLNWNWREVRNSTFLQLISAL 362
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
QSPFRGPG+ E HKVI+HP+TGLCV RKS L PL LG CTESEAWSYTP K +S+KG
Sbjct: 363 QSPFRGPGLSEANPHKVIFHPSTGLCVLRKSMLAPLRLGRCTESEAWSYTPQKILSVKGT 422
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNT 360
YFCLQ KPAKLGIICTD S WE ISDSKMHLSS A +G TVCLD+DS+NTIVTNT
Sbjct: 423 YFCLQTDDAAKPAKLGIICTDSNSKWETISDSKMHLSSNASSGITVCLDIDSNNTIVTNT 482
Query: 361 CKCLSRDKTCDPASQWFKLVDSTRSSTTTK 390
CKCLS+D CDP SQWFKLV+STRSST TK
Sbjct: 483 CKCLSKDNACDPESQWFKLVNSTRSSTMTK 512
>gi|225424600|ref|XP_002285424.1| PREDICTED: uncharacterized protein LOC100244027 [Vitis vinifera]
Length = 552
Score = 635 bits (1639), Expect = e-180, Method: Compositional matrix adjust.
Identities = 298/405 (73%), Positives = 341/405 (84%), Gaps = 2/405 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCCS+ DGNGFFGDQYFNPDLW++GLT+MAT+F GV NVVGMSLRNEL
Sbjct: 147 MVILDNHLSKPGWCCSSFDGNGFFGDQYFNPDLWVQGLTRMATMFRGVTNVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL++D DLSFV Q + LTFTGKLVFE
Sbjct: 207 RGPKQNVKDWYRYMQKGAEAVHSANPDVLVIVSGLSYDTDLSFVLKQELELTFTGKLVFE 266
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGFTDG AW G+PNQVCGRVV++VMR G LLE+GWPLFVSEFG D RG NVNDNR
Sbjct: 267 MHWYGFTDGSAWETGSPNQVCGRVVESVMRRGGVLLEKGWPLFVSEFGVDQRGTNVNDNR 326
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YLNCFFG+AAELD+DWALWTLVGSYY REGVIGL E+YGL +WNWC++RNSSFL+RIS+L
Sbjct: 327 YLNCFFGLAAELDFDWALWTLVGSYYTREGVIGLEEFYGLLNWNWCEVRNSSFLQRISAL 386
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
QSPF+GP + + HKVI+HPATGLC+ KS +PLTLGPC ES+AWSYTP KT+ +K
Sbjct: 387 QSPFQGPDLSDARPHKVIFHPATGLCIVWKSVFEPLTLGPCPESDAWSYTPQKTLIMKET 446
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNT 360
YFCLQA G P KLGIICT+ GS WE ISDSKMHLS+K +GTTVCLD+DSSN IVTN
Sbjct: 447 YFCLQADGPGNPGKLGIICTEPGSKWESISDSKMHLSTKLGDGTTVCLDIDSSNNIVTNA 506
Query: 361 CKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKD 405
CKCLS D CDP SQWFK+V++T S T+ Q + LDL G +
Sbjct: 507 CKCLSGDNKCDPGSQWFKIVNATNIS--TRPLIQISSNLDLNGAE 549
>gi|356511061|ref|XP_003524250.1| PREDICTED: uncharacterized protein LOC100790415 [Glycine max]
Length = 557
Score = 625 bits (1613), Expect = e-177, Method: Compositional matrix adjust.
Identities = 284/391 (72%), Positives = 333/391 (85%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHI++PGWCCSNSDGNGFFGD++F+P+ WI GLTKMA++FNGV NVVGMSLRNEL
Sbjct: 153 MVILDNHITQPGWCCSNSDGNGFFGDKFFDPNQWILGLTKMASLFNGVTNVVGMSLRNEL 212
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DWY+YM GAEA+HAANP+VLVILSGLNFDKDLSF++N+ V+LTF GKLV+E
Sbjct: 213 RGPKQNVNDWYKYMVKGAEAIHAANPDVLVILSGLNFDKDLSFIQNRPVSLTFKGKLVYE 272
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
AHWY FTDGQAWV+GNPNQVCG+V N+MR SGFL+ QGWPLF+SEFG DLRG NVNDNR
Sbjct: 273 AHWYAFTDGQAWVNGNPNQVCGQVAGNMMRTSGFLVNQGWPLFISEFGGDLRGTNVNDNR 332
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YLNCF VAAELD DWALWTLVGSYY R+GVIG+ E+YG+ W+W +RN++FL RIS+L
Sbjct: 333 YLNCFLAVAAELDLDWALWTLVGSYYFRQGVIGMEEFYGILSWDWTQVRNTTFLNRISAL 392
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
Q PFRGPG+ +K+I+HP TGLCV RKS LDPLTLGPC S+ W YTP K +S+KG
Sbjct: 393 QLPFRGPGITRGNPYKLIFHPLTGLCVIRKSLLDPLTLGPCYLSDGWKYTPQKILSIKGT 452
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNT 360
YFC+QA++ G PAKLGIIC+D S WE+ISDSK+HLSSK + + VCLDVD +N IVTN
Sbjct: 453 YFCIQAENEGMPAKLGIICSDPNSRWEMISDSKLHLSSKLSDDSNVCLDVDDNNNIVTNA 512
Query: 361 CKCLSRDKTCDPASQWFKLVDSTRSSTTTKS 391
CKCLSRD+TCDP+SQWFKL+DS R S T S
Sbjct: 513 CKCLSRDRTCDPSSQWFKLIDSGRRSMLTTS 543
>gi|449435438|ref|XP_004135502.1| PREDICTED: uncharacterized protein LOC101217177 [Cucumis sativus]
Length = 549
Score = 606 bits (1563), Expect = e-171, Method: Compositional matrix adjust.
Identities = 283/396 (71%), Positives = 340/396 (85%), Gaps = 1/396 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHISKPGWCCSN DGNGFFGDQYFNPDLWIKGLT++AT+FNGV +VV MSLRNEL
Sbjct: 139 MVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLTRIATMFNGVNHVVAMSLRNEL 198
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DWYRYMQ GAEAVH+ANP++L+ILSGL++D+DLSF++NQ +NLTFT K V+E
Sbjct: 199 RGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSYDRDLSFLKNQPINLTFTSKTVYE 258
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+DG +W GN NQVCGR +N+M++SGFLL+QG+PLF+SEFG D RG NVNDNR
Sbjct: 259 VHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQGFPLFISEFGIDQRGTNVNDNR 318
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL+CF VAAELD DWA+WTLVGSYYLREGV+GLNE+YG+ DWNWC++RNS+FL+RIS+L
Sbjct: 319 YLSCFLAVAAELDLDWAVWTLVGSYYLREGVVGLNEFYGILDWNWCNLRNSTFLQRISAL 378
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
QSPF+GPG+ E + VI+HP +GLCV RKS LDPLTLGPC +++AW YTP K ++LKG
Sbjct: 379 QSPFQGPGLAERREYNVIFHPLSGLCVVRKSLLDPLTLGPCVDTDAWYYTPQKFLTLKGT 438
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSS-NTIVTN 359
YFC+QA +GK AKLGIICT + W++ISDSK+HLSSK+ NG+ VCLDVDSS N IVTN
Sbjct: 439 YFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSKSSNGSLVCLDVDSSTNEIVTN 498
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
+CKCLSRD +CDP+SQWFKLV+STRS +S
Sbjct: 499 SCKCLSRDSSCDPSSQWFKLVNSTRSLGRGRSMINM 534
>gi|449527929|ref|XP_004170960.1| PREDICTED: uncharacterized LOC101217177, partial [Cucumis sativus]
Length = 421
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 283/396 (71%), Positives = 340/396 (85%), Gaps = 1/396 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHISKPGWCCSN DGNGFFGDQYFNPDLWIKGLT++AT+FNGV +VV MSLRNEL
Sbjct: 11 MVILDNHISKPGWCCSNFDGNGFFGDQYFNPDLWIKGLTRIATMFNGVNHVVAMSLRNEL 70
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DWYRYMQ GAEAVH+ANP++L+ILSGL++D+DLSF++NQ +NLTFT K V+E
Sbjct: 71 RGPKQNVNDWYRYMQRGAEAVHSANPDILIILSGLSYDRDLSFLKNQPINLTFTSKTVYE 130
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+DG +W GN NQVCGR +N+M++SGFLL+QG+PLF+SEFG D RG NVNDNR
Sbjct: 131 VHWYAFSDGSSWESGNSNQVCGRTTNNLMKMSGFLLQQGFPLFISEFGIDQRGTNVNDNR 190
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL+CF VAAELD DWA+WTLVGSYYLREGV+GLNE+YG+ DWNWC++RNS+FL+RIS+L
Sbjct: 191 YLSCFLAVAAELDLDWAVWTLVGSYYLREGVVGLNEFYGILDWNWCNLRNSTFLQRISAL 250
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
QSPF+GPG+ E + VI+HP +GLCV RKS LDPLTLGPC +++AW YTP K ++LKG
Sbjct: 251 QSPFQGPGLAERREYNVIFHPLSGLCVVRKSLLDPLTLGPCVDTDAWYYTPQKFLTLKGT 310
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSS-NTIVTN 359
YFC+QA +GK AKLGIICT + W++ISDSK+HLSSK+ NG+ VCLDVDSS N IVTN
Sbjct: 311 YFCIQADEIGKQAKLGIICTVNNAKWDMISDSKLHLSSKSSNGSLVCLDVDSSTNEIVTN 370
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
+CKCLSRD +CDP+SQWFKLV+STRS +S
Sbjct: 371 SCKCLSRDSSCDPSSQWFKLVNSTRSLGRGRSMINM 406
>gi|356541382|ref|XP_003539156.1| PREDICTED: endoglucanase-like [Glycine max]
Length = 544
Score = 590 bits (1520), Expect = e-166, Method: Compositional matrix adjust.
Identities = 269/387 (69%), Positives = 323/387 (83%), Gaps = 1/387 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S+PGWCCSN DGNGFFGDQYF+PDLWI GLTKMATIF GV NVV MSLRNEL
Sbjct: 150 MVILDNHVSQPGWCCSNLDGNGFFGDQYFDPDLWIMGLTKMATIFKGVTNVVAMSLRNEL 209
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QNV WYRYM GAEAVHAANP+VLVILSGLNFD +LSF+RN+AV L+F GKLVFE
Sbjct: 210 RGPRQNVNVWYRYMPKGAEAVHAANPDVLVILSGLNFDTNLSFIRNEAVKLSFNGKLVFE 269
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+DGQAW GNPNQVCG+V +NVMR +GFLL+QGWPLFVSEFG DLRG +VNDNR
Sbjct: 270 VHWYSFSDGQAWTLGNPNQVCGQVTENVMRRAGFLLDQGWPLFVSEFGVDLRGTSVNDNR 329
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YLNCF + A+LD DWALWTL G+YY+R+G +G+ EY+G+ + +W +RN+SFL+RIS++
Sbjct: 330 YLNCFMALVAQLDLDWALWTLGGNYYIRQGDVGMEEYFGILNSDWIQVRNTSFLQRISAI 389
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLT-LGPCTESEAWSYTPHKTISLKG 299
Q PF+GPG+ E +KVI+HP TGLC+ R S ++PL LGPC+ S+AW YT K +S+KG
Sbjct: 390 QLPFKGPGLSEAKPYKVIFHPLTGLCILRNSPVEPLMRLGPCSNSDAWEYTDQKILSIKG 449
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTN 359
YFCLQA+ GK AKLG C+ S WE+ISDSKMHLS++ +N + VCLDVD++N IVTN
Sbjct: 450 TYFCLQAEEEGKQAKLGNACSGSNSRWEMISDSKMHLSTQTNNASGVCLDVDTNNIIVTN 509
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSS 386
C+CLS+D TCDPA+QWFKLVDSTR S
Sbjct: 510 ICQCLSKDNTCDPATQWFKLVDSTRKS 536
>gi|356528434|ref|XP_003532808.1| PREDICTED: uncharacterized protein LOC100818309 [Glycine max]
Length = 574
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 272/409 (66%), Positives = 321/409 (78%), Gaps = 5/409 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++ PGWCC SDGNGFFGD++FNPD WI GLTKMAT+FNGV NVVGMSLRNEL
Sbjct: 166 MVILDNHLTNPGWCCGYSDGNGFFGDKFFNPDQWIFGLTKMATLFNGVTNVVGMSLRNEL 225
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DWY+YM GAEAVHAANP+VLVILSG+NFD LSF+R++ V+LTF GKLVFE
Sbjct: 226 RGPKQNVNDWYKYMVKGAEAVHAANPDVLVILSGINFDTSLSFIRDRPVSLTFKGKLVFE 285
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
H YGFTDG AW DGNPNQVCG+V ++ + S FL++QGWPLFVSEFG DLRG NVNDNR
Sbjct: 286 VHRYGFTDGGAWADGNPNQVCGKVTADIKQTSTFLVDQGWPLFVSEFGGDLRGTNVNDNR 345
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YLNCF + AELD DWA WTLVGSYY REGVIG+ E+YGL W+W +R++SFL RIS+L
Sbjct: 346 YLNCFLALVAELDLDWAYWTLVGSYYFREGVIGMEEFYGLLTWDWTQVRSTSFLNRISAL 405
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
Q PFRGPG+ E +K+I+HP TGLCV KS L LTLGPC+ S+AW+YTP KT+ +
Sbjct: 406 QIPFRGPGIIEGSAYKLIFHPLTGLCVISKSQLTSLTLGPCSSSDAWTYTPQKTLLINNT 465
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNT 360
FC+ A+ GKPA L I C+D S WE+ISDS MHLSSK +G+ +CLDVD +N IVT
Sbjct: 466 NFCIHAEQEGKPATLSITCSDANSKWEMISDSNMHLSSKLSDGSNLCLDVDDNNIIVTTA 525
Query: 361 CKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKDFIWK 409
CKCL++DKTCDPASQWFKL+DS R S +T S L D +W+
Sbjct: 526 CKCLNQDKTCDPASQWFKLIDSGRRSISTTSTLSM-----LNSPDILWQ 569
>gi|356511059|ref|XP_003524249.1| PREDICTED: endoglucanase E1-like [Glycine max]
Length = 571
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 267/409 (65%), Positives = 318/409 (77%), Gaps = 5/409 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+++PGWCC N+DGNGFFGD++F+P+ WI GLTKMAT+F GV VVG+SLRNEL
Sbjct: 163 MVILDNHVTQPGWCCGNTDGNGFFGDKFFDPNQWILGLTKMATLFKGVTAVVGISLRNEL 222
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG +QNV DWY+YM GAEA HAANP+VLVILSGLNFD DLSF+R++ V+LTF GKLVFE
Sbjct: 223 RGSRQNVNDWYKYMVKGAEAAHAANPDVLVILSGLNFDTDLSFLRDRPVSLTFKGKLVFE 282
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
H YGFTDG AW DGNPNQVCG+V N+ + SGFL++QGWPLFVSEFG DLRG NVNDNR
Sbjct: 283 VHRYGFTDGGAWADGNPNQVCGKVTANIKKTSGFLVDQGWPLFVSEFGGDLRGTNVNDNR 342
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YLNCF + AELD DWA WTLVGSYY REGVIG+ E+YGL W+W +R++SFL RIS+L
Sbjct: 343 YLNCFLALVAELDLDWAYWTLVGSYYFREGVIGMEEFYGLLTWDWNQVRSTSFLNRISAL 402
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
Q PFRGPG+ E HK+I+HP TGLCV KS L LTL C+ S+AW+YTP KT+ +
Sbjct: 403 QIPFRGPGIIEGNPHKLIFHPLTGLCVISKSQLTSLTLAACSSSDAWTYTPQKTLLVNNT 462
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNT 360
FC+ A+ KPA L + C+D S WE+ISDS MHLSSK +G+ +CLDVD +N IVTN
Sbjct: 463 DFCIHAEEERKPATLSMTCSDPNSKWEMISDSNMHLSSKLSDGSNLCLDVDDNNIIVTNA 522
Query: 361 CKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKDFIWK 409
CKCLS+DKTCDP SQWFKL+DS R S +T S L D +WK
Sbjct: 523 CKCLSKDKTCDPGSQWFKLIDSGRRSISTTSTLSM-----LNSPDLLWK 566
>gi|357463207|ref|XP_003601885.1| Endoglucanase [Medicago truncatula]
gi|355490933|gb|AES72136.1| Endoglucanase [Medicago truncatula]
Length = 655
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 253/387 (65%), Positives = 307/387 (79%), Gaps = 3/387 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+++PGWCCSNSDGNGFFGDQYF+PDLW+ GLTKMAT+FNGV NVVGMSLRNEL
Sbjct: 140 MVILDNHVTQPGWCCSNSDGNGFFGDQYFDPDLWLMGLTKMATLFNGVTNVVGMSLRNEL 199
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN+ DWYRYM GAE VHAAN VLVILSGLNFD DLSF+ NQ V LTF GKLVFE
Sbjct: 200 RGPRQNLNDWYRYMPKGAEVVHAANSNVLVILSGLNFDTDLSFITNQPVKLTFNGKLVFE 259
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+D QAW NPNQVCG+V + +R SGFLL+QGWPLFVSEFG DLRG N+N NR
Sbjct: 260 EHWYSFSDSQAWTLENPNQVCGQVTSSFVRNSGFLLDQGWPLFVSEFGLDLRGKNMNQNR 319
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
+ NCF VAAELD DWA WTL GSYY+R+GV+ ++E +G+ + NW +RN+SFL+RIS++
Sbjct: 320 FFNCFMAVAAELDLDWAYWTLSGSYYIRQGVVDVDETFGILNGNWSQVRNTSFLQRISAI 379
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
Q PF+GPG+ E+ +KVI+HP GLCV KS +PL +GPC+ S+ W YTP + LKG
Sbjct: 380 QHPFQGPGLLESEPYKVIFHPLRGLCVLSKSLHEPLKMGPCSNSDGWEYTPQNILLLKGT 439
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNG---TTVCLDVDSSNTIV 357
FCLQ + GK KLG C+ STWE+ISDSKMHLS +N ++VCLDVD++N +V
Sbjct: 440 KFCLQGEGEGKQVKLGTTCSGPESTWEMISDSKMHLSYNVNNNGSTSSVCLDVDANNIVV 499
Query: 358 TNTCKCLSRDKTCDPASQWFKLVDSTR 384
TN+CKC+S+ TCDPASQWFKL+ ++
Sbjct: 500 TNSCKCISKVNTCDPASQWFKLLGASE 526
>gi|356569219|ref|XP_003552802.1| PREDICTED: uncharacterized protein LOC100805196 [Glycine max]
Length = 533
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 258/388 (66%), Positives = 300/388 (77%), Gaps = 7/388 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKP WCCSN DGNGFFGDQYF+PDLWIKGLTKMAT+F GV NVV MSLRNEL
Sbjct: 152 MVILDNHVSKPQWCCSNDDGNGFFGDQYFDPDLWIKGLTKMATLFKGVTNVVAMSLRNEL 211
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DW++YM GAEAVH ANP+VLVI+SGLN+D DLSF+R Q V L+F+ KLVFE
Sbjct: 212 RGPRQNANDWFKYMPKGAEAVHGANPDVLVIMSGLNYDLDLSFLRKQQVKLSFSRKLVFE 271
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+DG +W NPNQVCG+V VMR +G+LLEQG+PL +SEFG DLRG N NDN
Sbjct: 272 LHWYSFSDGDSWTTENPNQVCGKVTGRVMRSAGYLLEQGYPLVLSEFGWDLRGTNQNDNS 331
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y NC +AA+LD+DWA WTL GSYYLREG +GL E YG+ N + L+RIS++
Sbjct: 332 YFNCLLPLAAQLDFDWAYWTLAGSYYLREGTVGLIEVYGILTQNTTLPTTTFLLQRISAI 391
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
Q P+RGPG+ E HKVI+HP TGLC+ K L+PL LGPC+ SE W YT K +S+KG
Sbjct: 392 QLPYRGPGLSEVEAHKVIFHPLTGLCISGK--LEPLKLGPCSNSEGWEYTAQKVLSVKGR 449
Query: 301 -YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTN 359
CLQA+ GK AKLG C S WEI+SDSK+HLSSK +N + VCLDVDS+N IVTN
Sbjct: 450 NSTCLQAEGEGKEAKLGNEC----SVWEIVSDSKLHLSSKINNASDVCLDVDSNNNIVTN 505
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSST 387
CKCLS DKTCDPASQWFKLVDSTR ST
Sbjct: 506 ACKCLSGDKTCDPASQWFKLVDSTRKST 533
>gi|222629070|gb|EEE61202.1| hypothetical protein OsJ_15212 [Oryza sativa Japonica Group]
Length = 571
Score = 506 bits (1303), Expect = e-141, Method: Compositional matrix adjust.
Identities = 244/428 (57%), Positives = 304/428 (71%), Gaps = 20/428 (4%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCC N+DGNGFFGD YF+PD+W+ GLTKMAT+F V +VV MSLRNEL
Sbjct: 149 MVILDNHVSKPGWCCGNNDGNGFFGDAYFDPDVWVDGLTKMATMFAAVPSVVAMSLRNEL 208
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DWY+YMQ GAEAVHAANP V+VILSG++FD DL+F+ ++ VN++F GK+ FE
Sbjct: 209 RGPRQNSADWYKYMQRGAEAVHAANPRVVVILSGMSFDNDLAFLNSRQVNVSFAGKVAFE 268
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DGQAW GN NQVC RV +V R + +LL+QGWP+F+SEFG D RG NVNDNR
Sbjct: 269 VHWYGFSDGQAWRAGNANQVCARVAASVSRRALYLLDQGWPVFLSEFGVDNRGGNVNDNR 328
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C VAA+LD DWALWTL GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 329 YYGCVAAVAADLDLDWALWTLQGSYYLREGVLGLDEVYGVLDWAWCKPRNDTALTRLHAL 388
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKS---FLDPLTLGPCTESEAWSYTP-HKTIS 296
Q PFRGPG+ E + V++HP TG CV R+S L LG C E+EAW+YT + +S
Sbjct: 389 QRPFRGPGLAEAAPYTVMFHPTTGRCVVRRSSSVVQTTLELGSCGEAEAWAYTASQQRLS 448
Query: 297 LKGA-YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKA---------DNGTTV 346
+ + CL+A+ G+PA+LG+ C D + W + SDSK+HL+ A NG +
Sbjct: 449 PRDSPLLCLRAEGAGRPARLGLSCGDELARWSLTSDSKLHLAVNASSSSSSPETSNGGML 508
Query: 347 CLDV-DSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKD 405
CLDV D ++VTN C+CLS D +CDP SQWFKLV STRS T + + LP K
Sbjct: 509 CLDVGDDGRSLVTNPCRCLSADNSCDPESQWFKLVTSTRSVAATNTM-----LAQLPPKL 563
Query: 406 FIWKFFGL 413
WK L
Sbjct: 564 RSWKIRSL 571
>gi|38345246|emb|CAD41090.2| OSJNBb0011N17.7 [Oryza sativa Japonica Group]
Length = 555
Score = 506 bits (1302), Expect = e-141, Method: Compositional matrix adjust.
Identities = 244/428 (57%), Positives = 304/428 (71%), Gaps = 20/428 (4%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCC N+DGNGFFGD YF+PD+W+ GLTKMAT+F V +VV MSLRNEL
Sbjct: 133 MVILDNHVSKPGWCCGNNDGNGFFGDAYFDPDVWVDGLTKMATMFAAVPSVVAMSLRNEL 192
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DWY+YMQ GAEAVHAANP V+VILSG++FD DL+F+ ++ VN++F GK+ FE
Sbjct: 193 RGPRQNSADWYKYMQRGAEAVHAANPRVVVILSGMSFDNDLAFLNSRQVNVSFAGKVAFE 252
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DGQAW GN NQVC RV +V R + +LL+QGWP+F+SEFG D RG NVNDNR
Sbjct: 253 VHWYGFSDGQAWRAGNANQVCARVAASVSRRALYLLDQGWPVFLSEFGVDNRGGNVNDNR 312
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C VAA+LD DWALWTL GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 313 YYGCVAAVAADLDLDWALWTLQGSYYLREGVLGLDEVYGVLDWAWCKPRNDTALTRLHAL 372
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKS---FLDPLTLGPCTESEAWSYTP-HKTIS 296
Q PFRGPG+ E + V++HP TG CV R+S L LG C E+EAW+YT + +S
Sbjct: 373 QRPFRGPGLAEAAPYTVMFHPTTGRCVVRRSSSVVQTTLELGSCGEAEAWAYTASQQRLS 432
Query: 297 LKGA-YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKA---------DNGTTV 346
+ + CL+A+ G+PA+LG+ C D + W + SDSK+HL+ A NG +
Sbjct: 433 PRDSPLLCLRAEGAGRPARLGLSCGDELARWSLTSDSKLHLAVNASSSSSSPETSNGGML 492
Query: 347 CLDV-DSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKD 405
CLDV D ++VTN C+CLS D +CDP SQWFKLV STRS T + + LP K
Sbjct: 493 CLDVGDDGRSLVTNPCRCLSADNSCDPESQWFKLVTSTRSVAATNTM-----LAQLPPKL 547
Query: 406 FIWKFFGL 413
WK L
Sbjct: 548 RSWKIRSL 555
>gi|218195068|gb|EEC77495.1| hypothetical protein OsI_16343 [Oryza sativa Indica Group]
Length = 571
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 242/428 (56%), Positives = 300/428 (70%), Gaps = 20/428 (4%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILD+H+SKPGWCC N+DGNGFFGD YF+PD+W+ GLTKMAT+F V +VV MSLRNEL
Sbjct: 149 MVILDDHVSKPGWCCGNNDGNGFFGDAYFDPDVWVDGLTKMATMFAAVPSVVAMSLRNEL 208
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DWY+YMQ GAEAVHAANP V+VILSG++FD DL+F+ ++ VN++F GK+ FE
Sbjct: 209 RGPRQNSADWYKYMQRGAEAVHAANPRVVVILSGMSFDNDLAFLNSRQVNVSFAGKVAFE 268
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DGQAW GN NQVC RV +V R + +LL+QGWP+F+SEFG D RG NVNDNR
Sbjct: 269 VHWYGFSDGQAWRAGNANQVCARVAASVSRRALYLLDQGWPVFLSEFGVDNRGGNVNDNR 328
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C VAA+LD DWALWTL GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 329 YYGCVAAVAADLDLDWALWTLQGSYYLREGVLGLDEVYGVLDWAWCKPRNDTALTRLHAL 388
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKS---FLDPLTLGPCTESEAWSYTP--HKTI 295
Q PFRGPG+ E + V++HP TG CV R+S L LG C E+EAW+YT +
Sbjct: 389 QRPFRGPGLAEAAPYTVMFHPTTGRCVVRRSSSVVQTTLELGSCGEAEAWAYTASQQRLS 448
Query: 296 SLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKA---------DNGTTV 346
CL+A+ G+PA+LG+ C D + W + SDSK+HL+ A NG +
Sbjct: 449 PRDSPLLCLRAEGAGRPARLGLSCGDELARWSLTSDSKLHLAVNASSSSSSPETSNGGML 508
Query: 347 CLDV-DSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKD 405
CLDV D ++VTN C+CLS D +CDP SQWFKLV STRS T + + LP K
Sbjct: 509 CLDVGDDGRSLVTNPCRCLSADNSCDPESQWFKLVTSTRSVAATNTM-----LAQLPPKL 563
Query: 406 FIWKFFGL 413
WK L
Sbjct: 564 RSWKIRSL 571
>gi|90265052|emb|CAH67677.1| H0510A06.2 [Oryza sativa Indica Group]
gi|116309846|emb|CAH66882.1| OSIGBa0158F13.13 [Oryza sativa Indica Group]
Length = 555
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 242/428 (56%), Positives = 301/428 (70%), Gaps = 20/428 (4%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCC N+DGNGFFGD YF+PD+W+ GLTKMAT+F V +VV MSLRNEL
Sbjct: 133 MVILDNHVSKPGWCCGNNDGNGFFGDAYFDPDVWVDGLTKMATMFAAVPSVVAMSLRNEL 192
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DWY+YMQ GAEAVHAANP V+VILSG++FD DL+F+ ++ VN++F GK+ FE
Sbjct: 193 RGPRQNSADWYKYMQRGAEAVHAANPRVVVILSGMSFDNDLAFLNSRQVNVSFAGKVAFE 252
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DGQAW GN NQVC RV +V R + +LL+QGWP+F+S+FG D RG NVNDNR
Sbjct: 253 VHWYGFSDGQAWRAGNANQVCARVAASVSRRALYLLDQGWPVFLSDFGVDNRGGNVNDNR 312
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C VAA+LD DWALWTL GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 313 YYGCVAAVAADLDLDWALWTLQGSYYLREGVLGLDEVYGVLDWAWCKPRNDTALTRLHAL 372
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKS---FLDPLTLGPCTESEAWSYTP--HKTI 295
Q PFRGPG+ E + V++HP TG CV R+S L LG C E+EAW+YT +
Sbjct: 373 QRPFRGPGLAEAAPYTVMFHPTTGRCVVRRSSSVVQTTLELGSCGEAEAWAYTASQQRLS 432
Query: 296 SLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKA---------DNGTTV 346
CL+A+ G+PA+LG+ C D + W + SDSK+HL+ A NG +
Sbjct: 433 PRDSPLLCLRAEGAGRPARLGLSCGDELARWSLTSDSKLHLAVNASSSSSSPETSNGGML 492
Query: 347 CLDV-DSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKD 405
CLDV D ++VTN C+CLS D +CDP SQWFKLV STRS +T + + LP K
Sbjct: 493 CLDVGDDGRSLVTNPCRCLSADNSCDPESQWFKLVTSTRSVASTNTM-----LAQLPPKL 547
Query: 406 FIWKFFGL 413
WK L
Sbjct: 548 RSWKIRSL 555
>gi|42562033|ref|NP_172772.2| Cellulase (glycosyl hydrolase family 5) protein [Arabidopsis
thaliana]
gi|51536480|gb|AAU05478.1| At1g13130 [Arabidopsis thaliana]
gi|332190852|gb|AEE28973.1| Cellulase (glycosyl hydrolase family 5) protein [Arabidopsis
thaliana]
Length = 552
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 234/402 (58%), Positives = 299/402 (74%), Gaps = 6/402 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++KPGWCC+N DGNGFFGDQ+F+P +W+ L KMA FNGV NVVGMSLRNEL
Sbjct: 154 MVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNVVGMSLRNEL 213
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DW++YMQ GAEAVH+AN +VLVILSGL+FD DLSFVR++ V L+FTGKLVFE
Sbjct: 214 RGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKLSFTGKLVFE 273
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+DG +W NPN +CGRV++ + G+LL QG+PLF+SEFG D RG N NDNR
Sbjct: 274 LHWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDERGVNTNDNR 333
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C G AAE D DW+LW L GSYYLR+G +G+NEYYG+ D +W +RNSSFL++IS L
Sbjct: 334 YFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFL 393
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDP--LTLGPCTESEAWSYTPHKTISLK 298
QSP +GPG T + +++HP TGLC+ R S DP LTLGPC SE WSYT K + +K
Sbjct: 394 QSPLQGPGP-RTDAYNLVFHPLTGLCIVR-SLDDPKMLTLGPCNSSEPWSYT-KKALRIK 450
Query: 299 GAYFCLQAKHVGKPAKLG-IICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIV 357
CLQ+ P + C+ GS W+ IS S+MHL+S N T++CLDVD++N +V
Sbjct: 451 DQQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKTSLCLDVDTANNVV 510
Query: 358 TNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPIL 399
N CKCLS+DK+C+P SQWFK++ +TR +++ + + + +L
Sbjct: 511 ANACKCLSKDKSCEPMSQWFKIIKATRPLKSSRLYKEISSML 552
>gi|4850396|gb|AAD31066.1|AC007357_15 F3F19.15 [Arabidopsis thaliana]
Length = 522
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 234/402 (58%), Positives = 299/402 (74%), Gaps = 6/402 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++KPGWCC+N DGNGFFGDQ+F+P +W+ L KMA FNGV NVVGMSLRNEL
Sbjct: 124 MVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNVVGMSLRNEL 183
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DW++YMQ GAEAVH+AN +VLVILSGL+FD DLSFVR++ V L+FTGKLVFE
Sbjct: 184 RGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKLSFTGKLVFE 243
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+DG +W NPN +CGRV++ + G+LL QG+PLF+SEFG D RG N NDNR
Sbjct: 244 LHWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDERGVNTNDNR 303
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C G AAE D DW+LW L GSYYLR+G +G+NEYYG+ D +W +RNSSFL++IS L
Sbjct: 304 YFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFL 363
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDP--LTLGPCTESEAWSYTPHKTISLK 298
QSP +GPG T + +++HP TGLC+ R S DP LTLGPC SE WSYT K + +K
Sbjct: 364 QSPLQGPGP-RTDAYNLVFHPLTGLCIVR-SLDDPKMLTLGPCNSSEPWSYT-KKALRIK 420
Query: 299 GAYFCLQAKHVGKPAKLG-IICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIV 357
CLQ+ P + C+ GS W+ IS S+MHL+S N T++CLDVD++N +V
Sbjct: 421 DQQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKTSLCLDVDTANNVV 480
Query: 358 TNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPIL 399
N CKCLS+DK+C+P SQWFK++ +TR +++ + + + +L
Sbjct: 481 ANACKCLSKDKSCEPMSQWFKIIKATRPLKSSRLYKEISSML 522
>gi|297844216|ref|XP_002889989.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297335831|gb|EFH66248.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 552
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 233/402 (57%), Positives = 298/402 (74%), Gaps = 6/402 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++KPGWCC+N DGNGFFGDQ+F+P +W+ L KMA F+GV NVVGMSLRNEL
Sbjct: 154 MVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFDGVSNVVGMSLRNEL 213
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DW++YMQ GAEAVH++N +VLVILSGL+FD DLSFVR++ VNL+FTGKLVFE
Sbjct: 214 RGPKQNVNDWFKYMQQGAEAVHSSNKKVLVILSGLSFDADLSFVRSRPVNLSFTGKLVFE 273
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+DG +W NPN +CGRV++ + G+LL QG+PLF+SEFG D RG N NDNR
Sbjct: 274 LHWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDERGVNTNDNR 333
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C G AAE D DW+LW L GSYYLR+GV+G+ EYYG+ D +W +RNSSFL++IS L
Sbjct: 334 YFGCVTGWAAENDVDWSLWALTGSYYLRQGVVGMIEYYGVLDSDWISVRNSSFLQKISFL 393
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDP--LTLGPCTESEAWSYTPHKTISLK 298
QSP +GPG + +++HP TGLC+ R S DP LTLGPC SE WSYT K + +K
Sbjct: 394 QSPLQGPGP-RIDAYNLVFHPLTGLCIVR-SLDDPKMLTLGPCNSSEPWSYT-KKALRIK 450
Query: 299 GAYFCLQAKHVGKPAKLG-IICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIV 357
+ CLQ+ P + C+ GS W+ IS S+MHLSS N T++CLDVD++N IV
Sbjct: 451 DQHLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLSSTTSNKTSLCLDVDTANNIV 510
Query: 358 TNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPIL 399
N CKCLS+D +C+P SQWFK++ +TR +++ + + +L
Sbjct: 511 ANACKCLSKDSSCEPMSQWFKIIKATRPLKSSRLHKEISSML 552
>gi|225461419|ref|XP_002284888.1| PREDICTED: uncharacterized protein LOC100258876 isoform 1 [Vitis
vinifera]
Length = 509
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 247/396 (62%), Positives = 292/396 (73%), Gaps = 34/396 (8%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILD+H S+P + GNG FGDQ+FNPDLW+KGLT++AT+F+GV NVVGMSLRNEL
Sbjct: 146 MVILDSHFSEPSF-----HGNGVFGDQHFNPDLWVKGLTRIATMFSGVPNVVGMSLRNEL 200
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
R P QNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL+ DLSF+ NQ + LTFTGKLV E
Sbjct: 201 RCPNQNVKDWYRYMQKGAEAVHSANPDVLVIISGLSDGTDLSFLLNQQLELTFTGKLVLE 260
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL-FVSEFGADLRGNNVNDN 179
HW+G G+A NPN+VCGRVVD++MR G LL+QGWPL FVSE G D DN
Sbjct: 261 MHWHGSRVGRAGETSNPNKVCGRVVDSIMRGGGVLLQQGWPLMFVSELGVD-------DN 313
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
R+LNCFFG+AAELD+DWALWTL E GL +WN S+F +RIS+
Sbjct: 314 RHLNCFFGLAAELDFDWALWTL-------------EETNGLMNWN------SNFFQRISA 354
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
LQSP +GP V HK+I HP+TGLC+ R+S +PL LGPCTESEAW YTP K + +KG
Sbjct: 355 LQSPLQGPDVSRVRPHKIILHPSTGLCILRESLSEPLKLGPCTESEAWGYTPQKILIVKG 414
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTN 359
YFCLQA +GKPAKL IICT GS WEIISDSKM+LS+K +GTTVCLDVDSS+ IVT+
Sbjct: 415 TYFCLQAVGLGKPAKLSIICTQPGSNWEIISDSKMYLSTKLGDGTTVCLDVDSSSNIVTD 474
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
CKCL RD CDP SQWFK+VDST + T + Q
Sbjct: 475 ACKCLGRDDMCDPGSQWFKVVDST--NITRRPILQI 508
>gi|242073476|ref|XP_002446674.1| hypothetical protein SORBIDRAFT_06g020260 [Sorghum bicolor]
gi|241937857|gb|EES11002.1| hypothetical protein SORBIDRAFT_06g020260 [Sorghum bicolor]
Length = 556
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 233/397 (58%), Positives = 287/397 (72%), Gaps = 3/397 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS DGNGFFGD YF+PD W+KGL+ MAT+FN +NVVGMSLRNEL
Sbjct: 148 MVILDNQMTTPGWCCSRIDGNGFFGDVYFDPDEWLKGLSAMATMFNNTKNVVGMSLRNEL 207
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV WYRYMQ+GAEAVHAANP VLVILSGL+FD LSF+ + V+L+F+GKLV+E
Sbjct: 208 RGPKQNVSLWYRYMQMGAEAVHAANPNVLVILSGLDFDNTLSFLFKEKVHLSFSGKLVYE 267
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DG W N N VC VVD + FLL+QGWPLF SEFG D+ G ++ DNR
Sbjct: 268 QHWYGFSDGGNWETQNQNDVCAMVVDFIWAKGLFLLQQGWPLFFSEFGFDMSGTHIGDNR 327
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DW++W L GSYY+REG++ +E YGL W+WC RN SF++RI+SL
Sbjct: 328 YLTCFLSVAAEMDLDWSIWALQGSYYIREGILAYDESYGLLSWDWCTARNPSFIKRINSL 387
Query: 241 QSPFRGPGVFETG-LHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + +I+HP +GLCV +S L LGPC ES AW+YT + +K
Sbjct: 388 QSPFQGPGLPNSQEPYNIIFHPQSGLCVLARSS-KLLELGPCDESNAWNYTSAYNLVVKN 446
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQ K VGK AKLG C+ S W +IS+SKMH+S++ NGT VCLD + I+T
Sbjct: 447 TGQCLQVKSVGKNAKLGTDCSKPSSKWNLISNSKMHVSAELTKNGTRVCLDASPAGAIIT 506
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
N CKCLS D TCDP SQWFK++ S+R S Q
Sbjct: 507 NQCKCLSVDPTCDPESQWFKVIVSSRDIPGGDSILQL 543
>gi|225461423|ref|XP_002284895.1| PREDICTED: uncharacterized protein LOC100253729 isoform 1 [Vitis
vinifera]
Length = 512
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 244/396 (61%), Positives = 291/396 (73%), Gaps = 34/396 (8%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILD+H S+P + NG FGDQ+FNPDLW+KGLT++AT+F+GV NVVGMSLRNEL
Sbjct: 149 MVILDSHFSEPSF-----HDNGVFGDQHFNPDLWVKGLTRIATMFSGVPNVVGMSLRNEL 203
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
R P QNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL+ DLSF+ NQ + LTFTGKLV E
Sbjct: 204 RCPNQNVKDWYRYMQKGAEAVHSANPDVLVIISGLSDGTDLSFLLNQQLELTFTGKLVLE 263
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL-FVSEFGADLRGNNVNDN 179
HW+G G+A NPN+VCGRVVD++MR G LL+QGWPL FVSE G D DN
Sbjct: 264 MHWHGSRVGRAGETSNPNKVCGRVVDSIMRRGGVLLQQGWPLMFVSELGVD-------DN 316
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
R+LNCFFG+AAELD+DWALWTL E GL +WN SSF +RIS+
Sbjct: 317 RHLNCFFGLAAELDFDWALWTL-------------EETNGLMNWN------SSFFQRISA 357
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
LQSP +GP V HK+I+HP+TGLC+ R+S +PL LGPCT+SEAW YTP K +++KG
Sbjct: 358 LQSPLQGPDVSRVRRHKIIFHPSTGLCILRESGSEPLKLGPCTKSEAWGYTPQKLLTVKG 417
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTN 359
YFCLQA +GKPAKL IICT GS WE ISDSKM+LS+K +GT VCLDVDSS+TIVT+
Sbjct: 418 TYFCLQAVGLGKPAKLSIICTKPGSNWENISDSKMYLSTKLGDGTRVCLDVDSSSTIVTD 477
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
CKCL R CDP SQWFK+VDST + T + Q
Sbjct: 478 ACKCLGRGDMCDPGSQWFKVVDST--NITRRPILQI 511
>gi|326526611|dbj|BAK00694.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 561
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 231/397 (58%), Positives = 284/397 (71%), Gaps = 3/397 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD+YF+P+ W+KGL+ MAT+F +NVVGMSLRNEL
Sbjct: 152 MVILDNQMTTPGWCCSRTDGNGFFGDKYFDPEEWLKGLSAMATMFRHAKNVVGMSLRNEL 211
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP QNV WYRYMQ GAEAVHAANP VLVILSGL+FD LSF+ + V L+FTGKLVFE
Sbjct: 212 RGPYQNVSLWYRYMQQGAEAVHAANPNVLVILSGLDFDNSLSFLSPKQVKLSFTGKLVFE 271
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DG W + N N CG V+++ FLL+QGWPLF SE G D+ G ++ DNR
Sbjct: 272 QHWYGFSDGTDWENSNQNDACGEAVESIRTKGLFLLQQGWPLFFSEIGFDMSGTHIADNR 331
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG++ +E YGL W+W RN SF+ERI+SL
Sbjct: 332 YLTCFLSVAAEMDLDWAVWALEGSYYIREGILAYDETYGLLTWDWYTARNPSFIERINSL 391
Query: 241 QSPFRGPGVFET-GLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + +KVI+HP TGLCV +S + L LGPC ES AW+YT + LK
Sbjct: 392 QSPFQGPGLSSSHKSYKVIFHPLTGLCVLVES-ANVLKLGPCDESNAWNYTSTYELVLKQ 450
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CL+AK VG AKLG C+ S W++ISDS MH+S++ NGT VCLD I T
Sbjct: 451 TGQCLEAKSVGDTAKLGTGCSKSCSKWQLISDSGMHVSAELTKNGTRVCLDAGPDGVITT 510
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
+ CKCL+ D TCDP SQWFK++ S+R S Q
Sbjct: 511 DQCKCLTEDPTCDPESQWFKVISSSRGMPGEASVLQL 547
>gi|225461415|ref|XP_002284881.1| PREDICTED: uncharacterized protein LOC100264047 isoform 1 [Vitis
vinifera]
Length = 509
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 243/387 (62%), Positives = 286/387 (73%), Gaps = 32/387 (8%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M+ILD+H SKP + N FGDQYFNPDLW+KGLT+MAT+F+GV NVVGMSLRNEL
Sbjct: 146 MIILDSHFSKPSF-----HSNDIFGDQYFNPDLWVKGLTRMATVFSGVPNVVGMSLRNEL 200
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
R PKQNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL+ DLSF+ NQ + LTFTGKLV E
Sbjct: 201 RCPKQNVKDWYRYMQKGAEAVHSANPDVLVIISGLSDGTDLSFLLNQQLELTFTGKLVLE 260
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL-FVSEFGADLRGNNVNDN 179
HW G G+A NPN+VCGRVVD++MR G LL+QGWPL FVSE G D DN
Sbjct: 261 MHWDGLRAGRAGETSNPNKVCGRVVDSIMRRGGVLLQQGWPLMFVSELGVD-------DN 313
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
R LNCFFG+AAELD+DWALWT+ E GL +WN SS +RIS+
Sbjct: 314 RNLNCFFGLAAELDFDWALWTV-------------EETNGLMNWN------SSVFQRISA 354
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
LQSP +GP V HK+I HP TGLC+ +S+ +PL LGPCT+S+AW YTP K + +KG
Sbjct: 355 LQSPLQGPDVSRVRPHKIILHPPTGLCILWESWTEPLKLGPCTKSDAWGYTPQKLLIVKG 414
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTN 359
+YFCLQA +GKPAKL IICT GS W+IISDSKM+LS+K + TTVCLDVDSS+TIVT+
Sbjct: 415 SYFCLQAVELGKPAKLSIICTKPGSNWDIISDSKMYLSTKLGDSTTVCLDVDSSSTIVTD 474
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSS 386
CKCL RD TCDP SQWFK+VDST S
Sbjct: 475 ACKCLGRDDTCDPGSQWFKVVDSTNIS 501
>gi|212721186|ref|NP_001132121.1| hypothetical protein precursor [Zea mays]
gi|194693482|gb|ACF80825.1| unknown [Zea mays]
gi|413918672|gb|AFW58604.1| hypothetical protein ZEAMMB73_652830 [Zea mays]
Length = 555
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 234/397 (58%), Positives = 285/397 (71%), Gaps = 3/397 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD YF+P+ W+KGL+ MAT+FN + VVGMSLRNEL
Sbjct: 147 MVILDNQMTTPGWCCSRTDGNGFFGDVYFDPNEWLKGLSAMATMFNNTKYVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV WYRYMQLGAEAVHAANP VLVILSGL+FD LSF+ + V L+F+GKLV+E
Sbjct: 207 RGPKQNVSLWYRYMQLGAEAVHAANPNVLVILSGLDFDNTLSFLFKEKVRLSFSGKLVYE 266
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DG W N N VCG VVD + FLL+ GWPLF SEFG D+ G ++ DNR
Sbjct: 267 QHWYGFSDGGNWETQNQNDVCGMVVDFIWAKGLFLLQHGWPLFFSEFGFDMSGTHIGDNR 326
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG++ +E YGL W+WC RN SF++RI+SL
Sbjct: 327 YLTCFLSVAAEMDLDWAIWALQGSYYIREGILAYDESYGLLTWDWCTARNPSFIKRINSL 386
Query: 241 QSPFRGPGVFETGL-HKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QS F+GPG+ T + +I+HP +GLCV +S L LGPC ES AW+YT + +K
Sbjct: 387 QSAFQGPGLPNTQQPYYIIFHPQSGLCVLARSS-KLLELGPCDESNAWNYTSAYELVVKS 445
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQA+ VGK AKLG C+ S W +ISDSKMH+S++ A NGT VCL+ I T
Sbjct: 446 TGQCLQAQSVGKNAKLGTDCSKPSSKWHLISDSKMHVSAELAKNGTRVCLEASPDGAIRT 505
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
N CKCLS D TCDP SQWFK++ S+R S Q
Sbjct: 506 NQCKCLSVDPTCDPESQWFKVILSSRDIPGGDSILQL 542
>gi|11994430|dbj|BAB02432.1| unnamed protein product [Arabidopsis thaliana]
Length = 561
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 233/413 (56%), Positives = 302/413 (73%), Gaps = 11/413 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVR-NVVGMSLRNE 59
MVILDNHIS+PGWCCS++DGNGFFGD++ NP +WIKGL KMA++F V NVVGMSLRNE
Sbjct: 152 MVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFANVSSNVVGMSLRNE 211
Query: 60 LRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVF 119
LRGPKQN+KDWY+YM+ GAEAVH+ NP VLVI+SGLN+ DLSF+R + ++F K+VF
Sbjct: 212 LRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLNYATDLSFLRERPFEVSFRRKVVF 271
Query: 120 EAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDN 179
E HWYGF + W N N++CG+ + +M++SGFLLE+G PLFVSEFG D RGNN NDN
Sbjct: 272 EIHWYGFWN--TWEGDNLNKICGKETEKMMKMSGFLLEKGIPLFVSEFGIDQRGNNANDN 329
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
++L+CF +AA+ D DW+LWTL GSYY+RE IG +E YG+ D+NW IRNS+ L+ IS+
Sbjct: 330 KFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDESYGVLDFNWSSIRNSTILQMISA 389
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISL-K 298
+Q+PF GPG+ ET K+++HP+TGLC+ RKS L LG C SE+W + H+ +SL +
Sbjct: 390 IQTPFIGPGLMETQPKKIMFHPSTGLCIVRKSLFQ-LKLGSCNRSESWRLSSHRVLSLAE 448
Query: 299 GAYFCLQAKHVGKPAKLGIICTDC-GSTWEIISDSKMHLSSKADNGTTVCLDVDS-SNTI 356
CL+A GK KL + ++ S W++ SDSKM LSS NG +VCLDVD+ +N I
Sbjct: 449 EQILCLKAYEKGKSVKLRLFFSESYCSKWKLFSDSKMQLSSITKNGFSVCLDVDTENNNI 508
Query: 357 VTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKDFIWK 409
VTN+CKCL + +CDP SQWFKLV STR + K F Q + L K F+ K
Sbjct: 509 VTNSCKCLRGNSSCDPRSQWFKLVTSTRRRSRPKPFLQ----ISLYSKTFLQK 557
>gi|297818128|ref|XP_002876947.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322785|gb|EFH53206.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 546
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 232/402 (57%), Positives = 301/402 (74%), Gaps = 11/402 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVR-NVVGMSLRNE 59
MVILDNHIS+PGWCCS++DGNGFFGD++ NP +WIKGL KMA++F V NVVGM LRNE
Sbjct: 144 MVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFANVSSNVVGMCLRNE 203
Query: 60 LRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVF 119
LRGPKQN+KDWY YM+ GAEAVH+ NP+VLVI+SGLN+ DLSF+R++ ++F KLVF
Sbjct: 204 LRGPKQNIKDWYTYMRKGAEAVHSMNPDVLVIVSGLNYATDLSFLRDRPFEVSFRRKLVF 263
Query: 120 EAHWYGFTDGQAWVDG-NPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVND 178
E HWYGF + ++G N N++CG+ + +M++SGFLLE+G PLFVSEFG D RGNN ND
Sbjct: 264 EIHWYGFWNS---LEGDNLNKICGKETEKMMKMSGFLLEKGVPLFVSEFGIDQRGNNAND 320
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
N++L+CF +AA+LD DW+LWTL GSYY+RE IG +E YG+ DWNW IRNS+ L+ IS
Sbjct: 321 NKFLSCFMALAADLDLDWSLWTLAGSYYIREKTIGTDETYGVLDWNWSSIRNSTILQMIS 380
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISL- 297
++QSPF+ G+ ET K+++HP+TGLC+ RKS L LG C SE+W + H+ +SL
Sbjct: 381 AIQSPFQ--GLMETQPKKIMFHPSTGLCIVRKSLFQ-LKLGSCNRSESWKLSSHRVLSLT 437
Query: 298 KGAYFCLQAKHVGKPAKLGIICTDC-GSTWEIISDSKMHLSSKADNGTTVCLDVDS-SNT 355
+ CL+A GK KL + +D S W+++SDSKM LSS NG ++CLDVDS +N
Sbjct: 438 EEKILCLKAYEDGKSVKLRLFFSDSYCSKWKLLSDSKMQLSSITKNGVSICLDVDSNNNN 497
Query: 356 IVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNP 397
IVTN+CKCL + +CDP SQWFKLV STR + K F Q +P
Sbjct: 498 IVTNSCKCLLGNSSCDPRSQWFKLVTSTRRRSKPKPFLQISP 539
>gi|357164198|ref|XP_003579979.1| PREDICTED: endoglucanase-like [Brachypodium distachyon]
Length = 558
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 228/397 (57%), Positives = 283/397 (71%), Gaps = 3/397 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS SDGNGFFGD+YF+P+ W+KGL+ MAT+F +NVVGMSLRNEL
Sbjct: 151 MVILDNQMTTPGWCCSRSDGNGFFGDKYFDPEEWLKGLSAMATMFRYTKNVVGMSLRNEL 210
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP QNV WYRYMQ GAEAVHAANP VLVILSGL+FD LSF+ ++ V L+FTGKLVFE
Sbjct: 211 RGPYQNVSLWYRYMQQGAEAVHAANPNVLVILSGLDFDNTLSFLFSKQVELSFTGKLVFE 270
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DG W D N N CG V+++ FLL+QGWPLF SE G D+ G ++ DNR
Sbjct: 271 QHWYGFSDGTDWEDWNQNDACGVAVESIRTKGLFLLKQGWPLFFSEIGFDMSGTHIPDNR 330
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG++ +E YGL W+W RN SF++RI+ L
Sbjct: 331 YLTCFISVAAEMDLDWAIWALQGSYYIREGILAYDESYGLLSWDWYTARNPSFIKRINYL 390
Query: 241 QSPFRGPGVFETGL-HKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + I+HP TGLCV KS + L LGPC ES AW+YT + LK
Sbjct: 391 QSPFQGPGLPNSKKPYNAIFHPLTGLCVLVKS-PNSLELGPCDESNAWNYTSTCELVLKH 449
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
+ CL+AK VG AKLG + + S W++ISDS+MH+S++ NGT VCLD I T
Sbjct: 450 SGQCLEAKSVGDIAKLGTVSSKSCSKWQLISDSRMHVSTELTKNGTRVCLDASPDGVIST 509
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
N CKCL D TC+P SQWFK++ S++ S Q
Sbjct: 510 NQCKCLIEDPTCNPESQWFKVILSSKGIPGEASVLQL 546
>gi|326532384|dbj|BAK05121.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 228/399 (57%), Positives = 286/399 (71%), Gaps = 9/399 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCC N+DGNGFFGD YF PD+W+ GLTKMAT+F GV +VVGMSLRNEL
Sbjct: 155 MVILDNHVSKPGWCCDNADGNGFFGDGYFEPDVWVDGLTKMATMFAGVPHVVGMSLRNEL 214
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DWY+YMQ GAEAVHAANP VLVILSGL+FD DL+F+ ++ V+L+F K FE
Sbjct: 215 RGPRQNSNDWYKYMQRGAEAVHAANPRVLVILSGLSFDNDLAFLNSRQVSLSFARKAAFE 274
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F++GQ W GNPN+VC R+ +V R + +LL+QGWP+F+SEFG D RG N NDNR
Sbjct: 275 VHWYSFSNGQEWASGNPNEVCARIGASVSRRALYLLDQGWPVFLSEFGVDNRGGNANDNR 334
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C VAA+LD DWALWTL GSYYLR+GV+ L+E YG+ D W RN + L R+ L
Sbjct: 335 YYGCAAAVAADLDLDWALWTLQGSYYLRQGVLDLDEVYGVLDRAWSSPRNHTALRRVQPL 394
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHK-TISLKG 299
Q P RGPG E + V++HPATGLCV R+S PL LG C +EAW Y P + ++L+
Sbjct: 395 QRPLRGPGYAEAAPYTVLFHPATGLCVLRRSPTQPLELGACANTEAWEYAPQQGRLALRD 454
Query: 300 A-YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKA------DNGTTVCLDVDS 352
+ CL A+ G P + G+ C D S W ++SDSK+H++ A G +CLDVD+
Sbjct: 455 SPLMCLHAQGAGLPVRFGMPCDDDMSRWRLVSDSKLHVAVNASSSGSGSGGGMLCLDVDA 514
Query: 353 -SNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTK 390
++VTN C+CLS D +CDP +QWFKLV STRS +
Sbjct: 515 DGQSVVTNPCRCLSADNSCDPENQWFKLVTSTRSVAASS 553
>gi|15231513|ref|NP_189244.1| Cellulase (glycosyl hydrolase family 5) protein [Arabidopsis
thaliana]
gi|332643603|gb|AEE77124.1| Cellulase (glycosyl hydrolase family 5) protein [Arabidopsis
thaliana]
Length = 551
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 231/413 (55%), Positives = 300/413 (72%), Gaps = 13/413 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVR-NVVGMSLRNE 59
MVILDNHIS+PGWCCS++DGNGFFGD++ NP +WIKGL KMA++F V NVVGMSLRNE
Sbjct: 144 MVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFANVSSNVVGMSLRNE 203
Query: 60 LRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVF 119
LRGPKQN+KDWY+YM+ GAEAVH+ NP VLVI+SGLN+ DLSF+R + ++F K+VF
Sbjct: 204 LRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLNYATDLSFLRERPFEVSFRRKVVF 263
Query: 120 EAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDN 179
E HWYGF + W N N++CG+ + +M++SGFLLE+G PLFVSEFG D RGNN NDN
Sbjct: 264 EIHWYGFWN--TWEGDNLNKICGKETEKMMKMSGFLLEKGIPLFVSEFGIDQRGNNANDN 321
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
++L+CF +AA+ D DW+LWTL GSYY+RE IG +E YG+ D+NW IRNS+ L+ IS+
Sbjct: 322 KFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDESYGVLDFNWSSIRNSTILQMISA 381
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISL-K 298
+Q+PF G+ ET K+++HP+TGLC+ RKS L LG C SE+W + H+ +SL +
Sbjct: 382 IQTPF--IGLMETQPKKIMFHPSTGLCIVRKSLFQ-LKLGSCNRSESWRLSSHRVLSLAE 438
Query: 299 GAYFCLQAKHVGKPAKLGIICTDC-GSTWEIISDSKMHLSSKADNGTTVCLDVDS-SNTI 356
CL+A GK KL + ++ S W++ SDSKM LSS NG +VCLDVD+ +N I
Sbjct: 439 EQILCLKAYEKGKSVKLRLFFSESYCSKWKLFSDSKMQLSSITKNGFSVCLDVDTENNNI 498
Query: 357 VTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKDFIWK 409
VTN+CKCL + +CDP SQWFKLV STR + K F Q + L K F+ K
Sbjct: 499 VTNSCKCLRGNSSCDPRSQWFKLVTSTRRRSRPKPFLQ----ISLYSKTFLQK 547
>gi|125548741|gb|EAY94563.1| hypothetical protein OsI_16341 [Oryza sativa Indica Group]
Length = 552
Score = 466 bits (1200), Expect = e-129, Method: Compositional matrix adjust.
Identities = 228/404 (56%), Positives = 283/404 (70%), Gaps = 11/404 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD+YF+P+ W+ GL MAT+F +NVVGMSLRNEL
Sbjct: 147 MVILDNQMTTPGWCCSTTDGNGFFGDKYFDPEEWLNGLKTMATMFRKTKNVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP NV WYRYM+ GAEAVH ANP+VLVILSGL FD L+FV ++L+FTGKLVFE
Sbjct: 207 RGPYANVSLWYRYMKEGAEAVHTANPDVLVILSGLEFDNTLNFVVPNQIHLSFTGKLVFE 266
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DG W N N VCG VV + FLL+QGWPLF SEFG D+ G + DNR
Sbjct: 267 QHWYGFSDGGNWESQNQNDVCGMVVGFIKNKGLFLLQQGWPLFFSEFGFDMSGTHTGDNR 326
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG + +E YGL W+WC RN SF++RI+SL
Sbjct: 327 YLTCFLSVAAEMDLDWAIWALQGSYYIREGTLAYDESYGLLSWDWCTARNPSFIKRINSL 386
Query: 241 QSPFRGPGVFETG-LHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + VI+HP +GLCV KS + L LGPC ES AW+YT + L+
Sbjct: 387 QSPFQGPGLPNSQEPYNVIFHPLSGLCVVVKSS-EALELGPCDESNAWNYTSTHELVLQH 445
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQAK VG+ A+LG C+ S W++IS+S MH+S++ NGT VCLD I T
Sbjct: 446 TGQCLQAKSVGENAQLGTDCSKSSSKWQLISNSGMHVSTELTKNGTGVCLDATPDGIITT 505
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLP 402
N CKCL+ D C+P SQWFK++ S+R + T+ IL LP
Sbjct: 506 NPCKCLTGDPICNPESQWFKIILSSRHTGTS--------ILQLP 541
>gi|212274843|ref|NP_001130915.1| hydrolase, hydrolyzing O-glycosyl compound precursor [Zea mays]
gi|194702216|gb|ACF85192.1| unknown [Zea mays]
gi|195604160|gb|ACG23910.1| hydrolase, hydrolyzing O-glycosyl compounds [Zea mays]
gi|223947731|gb|ACN27949.1| unknown [Zea mays]
gi|223948787|gb|ACN28477.1| unknown [Zea mays]
gi|224031097|gb|ACN34624.1| unknown [Zea mays]
gi|224031425|gb|ACN34788.1| unknown [Zea mays]
gi|414586726|tpg|DAA37297.1| TPA: hydrolase, hydrolyzing O-glycosyl compound [Zea mays]
Length = 576
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 229/413 (55%), Positives = 294/413 (71%), Gaps = 15/413 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCCSN+DG+GFFGD YF+PD+W+ GLTKMAT+F GV NVVGMSLRNEL
Sbjct: 154 MVILDNHLSKPGWCCSNTDGSGFFGDTYFDPDVWVDGLTKMATMFAGVPNVVGMSLRNEL 213
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QNV DWY+YMQ GAEAVHAANP VLVILSGL+FD DL+F+ ++ VNL+FTGK+ FE
Sbjct: 214 RGPRQNVNDWYKYMQRGAEAVHAANPRVLVILSGLSFDNDLAFLNSRPVNLSFTGKVAFE 273
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F++G W GN NQ C R+ + R + +LL++GWP+ +SEFG D RG N NDNR
Sbjct: 274 VHWYSFSNGPEWSSGNANQACARITAGITRRAFYLLDRGWPVILSEFGVDNRGVNTNDNR 333
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C AA+LD DWALW L GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 334 YYGCAAAAAADLDLDWALWALQGSYYLREGVLGLDEVYGVLDWAWCRPRNDTALRRVQAL 393
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLT------LGPCTESEAWSYTPHKT 294
Q P RGPG+ E + V++HP +G+CV R+S LT LGPC E+EAW Y+ +
Sbjct: 394 QRPLRGPGLAEAAPYTVLFHPVSGMCVVRRSPTPSLTQPFQLSLGPCNETEAWEYSAQQQ 453
Query: 295 -ISLK-GAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGT------TV 346
++L+ A CL+A+ G+PA L + C D + W +SDSK+H++ A + +
Sbjct: 454 RLALRDAALLCLRAEGAGRPASLAVSCGDAMARWGFVSDSKLHVAVNATSSAGSVGDGLL 513
Query: 347 CLDVDS-SNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPI 398
CLDV + ++VTN C+CLS D +CDP QWFKLV STRS T ++ P+
Sbjct: 514 CLDVGADGRSVVTNPCRCLSADNSCDPQGQWFKLVSSTRSVVTKQTMLAQLPL 566
>gi|194690434|gb|ACF79301.1| unknown [Zea mays]
Length = 576
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 229/413 (55%), Positives = 294/413 (71%), Gaps = 15/413 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCCSN+DG+GFFGD YF+PD+W+ GLTKMAT+F GV NVVGMSLRNEL
Sbjct: 154 MVILDNHLSKPGWCCSNTDGSGFFGDTYFDPDVWVDGLTKMATMFAGVPNVVGMSLRNEL 213
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QNV DWY+YMQ GAEAVHAANP VLVILSGL+FD DL+F+ ++ VNL+FTGK+ FE
Sbjct: 214 RGPRQNVNDWYKYMQRGAEAVHAANPRVLVILSGLSFDNDLAFLNSRPVNLSFTGKVAFE 273
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F++G W GN NQ C R+ + R + +LL++GWP+ +SEFG D RG N NDNR
Sbjct: 274 VHWYSFSNGPEWSSGNANQACARITAGITRRAFYLLDRGWPVILSEFGVDNRGVNTNDNR 333
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C AA+LD DWALW L GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 334 YYGCAAAAAADLDLDWALWALQGSYYLREGVLGLDEVYGVLDWAWCRPRNDTALRRVQAL 393
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLT------LGPCTESEAWSYTPHKT 294
Q P RGPG+ E + V++HP +G+CV R+S LT LGPC E+EAW Y+ +
Sbjct: 394 QRPLRGPGLAEAAPYTVLFHPVSGMCVVRRSPTPSLTQPFQLSLGPCNETEAWEYSAQQQ 453
Query: 295 -ISLK-GAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGT------TV 346
++L+ A CL+A+ G+PA L + C D + W +SDSK+H++ A + +
Sbjct: 454 RLALRNAALLCLRAEGAGRPASLAVSCGDAMARWGFVSDSKLHVAVNATSSAGSVGDGLL 513
Query: 347 CLDVDS-SNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPI 398
CLDV + ++VTN C+CLS D +CDP QWFKLV STRS T ++ P+
Sbjct: 514 CLDVGADGRSVVTNPCRCLSADNSCDPQGQWFKLVSSTRSVVTKQTMLAQLPL 566
>gi|194702100|gb|ACF85134.1| unknown [Zea mays]
Length = 492
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 229/413 (55%), Positives = 294/413 (71%), Gaps = 15/413 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCCSN+DG+GFFGD YF+PD+W+ GLTKMAT+F GV NVVGMSLRNEL
Sbjct: 70 MVILDNHLSKPGWCCSNTDGSGFFGDTYFDPDVWVDGLTKMATMFAGVPNVVGMSLRNEL 129
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QNV DWY+YMQ GAEAVHAANP VLVILSGL+FD DL+F+ ++ VNL+FTGK+ FE
Sbjct: 130 RGPRQNVNDWYKYMQRGAEAVHAANPRVLVILSGLSFDNDLAFLNSRPVNLSFTGKVAFE 189
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F++G W GN NQ C R+ + R + +LL++GWP+ +SEFG D RG N NDNR
Sbjct: 190 VHWYSFSNGPEWSSGNANQACARITAGITRRAFYLLDRGWPVILSEFGVDNRGVNTNDNR 249
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C AA+LD DWALW L GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 250 YYGCAAAAAADLDLDWALWALQGSYYLREGVLGLDEVYGVLDWAWCRPRNDTALRRVQAL 309
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLT------LGPCTESEAWSYTPHKT 294
Q P RGPG+ E + V++HP +G+CV R+S LT LGPC E+EAW Y+ +
Sbjct: 310 QRPLRGPGLAEAAPYTVLFHPVSGMCVVRRSPTPSLTQPFQLSLGPCNETEAWEYSAQQQ 369
Query: 295 -ISLK-GAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGT------TV 346
++L+ A CL+A+ G+PA L + C D + W +SDSK+H++ A + +
Sbjct: 370 RLALRDAALLCLRAEGAGRPASLAVSCGDAMARWGFVSDSKLHVAVNATSSAGSVGDGLL 429
Query: 347 CLDVDS-SNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPI 398
CLDV + ++VTN C+CLS D +CDP QWFKLV STRS T ++ P+
Sbjct: 430 CLDVGADGRSVVTNPCRCLSADNSCDPQGQWFKLVSSTRSVVTKQTMLAQLPL 482
>gi|242073478|ref|XP_002446675.1| hypothetical protein SORBIDRAFT_06g020270 [Sorghum bicolor]
gi|241937858|gb|EES11003.1| hypothetical protein SORBIDRAFT_06g020270 [Sorghum bicolor]
Length = 586
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 234/433 (54%), Positives = 298/433 (68%), Gaps = 28/433 (6%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCCSN+DGNGFFGD F+PD+W+ GLTKMAT+F GV NVVGMSLRNEL
Sbjct: 156 MVILDNHLSKPGWCCSNTDGNGFFGDALFDPDVWVDGLTKMATMFAGVPNVVGMSLRNEL 215
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DWY+YMQ GAEAVHAANP VLVILSGL FD DL+F+ ++ VNL+FTGK+ FE
Sbjct: 216 RGPRQNANDWYKYMQRGAEAVHAANPRVLVILSGLQFDNDLAFLNSRPVNLSFTGKVAFE 275
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F++ Q W GN NQ C R+ + R + +LL++GWP+ +SEFG D RG N NDNR
Sbjct: 276 VHWYSFSNTQEWSSGNANQACARITAGIARRAFYLLDRGWPVILSEFGVDNRGGNTNDNR 335
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C AA+LD DWALW L GSYYLREGV+GL+E YG+ DW WC RN + L R+ +L
Sbjct: 336 YYGCAAAAAADLDLDWALWALQGSYYLREGVLGLDEVYGVLDWAWCKPRNDTALRRVQAL 395
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLC--VQRKS----FLDP--LTLGPCTESEAWSYTPH 292
Q P RGPG+ E + V++HP +G+C V+R+S P L LGPC+E+ AW Y+
Sbjct: 396 QRPLRGPGLAEAAPYAVLFHPVSGMCVVVRRRSPTPTLAQPFELGLGPCSETGAWEYSAQ 455
Query: 293 KT-ISLK--GAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKA--------- 340
+ + L+ A CL+A+ G+PA LG+ C D + W ++SDSK+H++ A
Sbjct: 456 QQRLGLRDTAALLCLRAEGAGRPATLGVTCGDAMARWSLVSDSKLHVAVNATSSSSAGVS 515
Query: 341 --DNGTTVCLDVDS-SNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNP 397
D +CLDV + ++VTN C+CLS D +CDP SQWFKLV STRS T ++
Sbjct: 516 DSDGNGLLCLDVGADGRSVVTNPCRCLSADNSCDPQSQWFKLVSSTRSVVTKQTM----- 570
Query: 398 ILDLPGKDFIWKF 410
+ LP K WK
Sbjct: 571 LAQLPLKLKSWKI 583
>gi|296081386|emb|CBI16819.3| unnamed protein product [Vitis vinifera]
Length = 310
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 211/267 (79%), Positives = 240/267 (89%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCCS+ DGNGFFGDQYFNPDLW++GLT+MAT+F GV NVVGMSLRNEL
Sbjct: 23 MVILDNHLSKPGWCCSSFDGNGFFGDQYFNPDLWVQGLTRMATMFRGVTNVVGMSLRNEL 82
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL++D DLSFV Q + LTFTGKLVFE
Sbjct: 83 RGPKQNVKDWYRYMQKGAEAVHSANPDVLVIVSGLSYDTDLSFVLKQELELTFTGKLVFE 142
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGFTDG AW G+PNQVCGRVV++VMR G LLE+GWPLFVSEFG D RG NVNDNR
Sbjct: 143 MHWYGFTDGSAWETGSPNQVCGRVVESVMRRGGVLLEKGWPLFVSEFGVDQRGTNVNDNR 202
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YLNCFFG+AAELD+DWALWTLVGSYY REGVIGL E+YGL +WNWC++RNSSFL+RIS+L
Sbjct: 203 YLNCFFGLAAELDFDWALWTLVGSYYTREGVIGLEEFYGLLNWNWCEVRNSSFLQRISAL 262
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCV 267
QSPF+GP + + HKVI+HPATGLC+
Sbjct: 263 QSPFQGPDLSDARPHKVIFHPATGLCI 289
>gi|357167845|ref|XP_003581360.1| PREDICTED: uncharacterized protein LOC100839976 [Brachypodium
distachyon]
Length = 601
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 220/390 (56%), Positives = 285/390 (73%), Gaps = 6/390 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S P WCCSN+DGNGFFGD YF PD+W+ GLTKMAT F V NVVGMSLRNEL
Sbjct: 189 MVILDNHVSMPKWCCSNTDGNGFFGDAYFQPDVWVDGLTKMATTFAAVPNVVGMSLRNEL 248
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN DWY+YMQ GAEAVHAANP LVILSGL++D DL+F+ ++ V L+F K FE
Sbjct: 249 RGPRQNANDWYKYMQRGAEAVHAANPRALVILSGLSYDNDLAFLSSRQVTLSFARKAAFE 308
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F++ Q W NPN+ C R+ +V R + +LL+QGWP+F+SEFG D RG N NDNR
Sbjct: 309 VHWYSFSNSQDWASSNPNEACARIGASVSRRALYLLDQGWPVFLSEFGVDNRGGNANDNR 368
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C VAA+LD DWALWTL GSYYLREGV L+E YG+ D +W + RN++ L R+ +L
Sbjct: 369 YYGCAAAVAADLDLDWALWTLQGSYYLREGVRDLDEVYGVLDRSWRNPRNATALRRVRAL 428
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSF-LDPLTLGPCTESEAWSYTPHKT-ISLK 298
Q FRGPG E + ++HP+TGLCV R+S + PL LG C +EAW+Y + ++L+
Sbjct: 429 QRAFRGPGFAEAAPYVALFHPSTGLCVARRSSPVSPLELGSCEGAEAWAYDAQRQRLALR 488
Query: 299 GA-YFCLQAKHVGKPAKLGIIC-TDCGSTWEIISDSKMHLSSKADNGT-TVCLDVDS-SN 354
+ CL+A+ G+P ++G+ C TD + W ++SDSK+H+++ A +G +CLDV +
Sbjct: 489 DSPLMCLRAEGAGRPVRVGMACATDEMARWRLVSDSKLHVAANASSGAGMLCLDVGADGR 548
Query: 355 TIVTNTCKCLSRDKTCDPASQWFKLVDSTR 384
++VTNTC+CLS D +CDP SQWFKLV STR
Sbjct: 549 SVVTNTCRCLSADNSCDPESQWFKLVSSTR 578
>gi|38345245|emb|CAD41089.2| OSJNBb0011N17.6 [Oryza sativa Japonica Group]
gi|125590763|gb|EAZ31113.1| hypothetical protein OsJ_15211 [Oryza sativa Japonica Group]
Length = 550
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 224/404 (55%), Positives = 281/404 (69%), Gaps = 13/404 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD+YF+P+ W+ GL MAT+F +NVVGMSLRNEL
Sbjct: 147 MVILDNQMTTPGWCCSTTDGNGFFGDKYFDPEEWLNGLKTMATMFRKTKNVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP +NV WY M+ GAE+VH ANP+VLVILSGL FD L+FV ++L+FTGKLVFE
Sbjct: 207 RGPYENVSLWY--MKEGAESVHTANPDVLVILSGLEFDNTLNFVVPNQIHLSFTGKLVFE 264
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DG W N N VCG VV + FLL+QGWPLF SEFG D+ G + DNR
Sbjct: 265 QHWYGFSDGGNWESQNQNDVCGMVVGFIKNKGLFLLQQGWPLFFSEFGFDMSGTHTGDNR 324
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG + +E YGL W+WC RN SF++RI+SL
Sbjct: 325 YLTCFLSVAAEMDLDWAIWALQGSYYIREGTLAYDESYGLLSWDWCTARNPSFIKRINSL 384
Query: 241 QSPFRGPGVFETG-LHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + VI+HP +GLCV KS + L LGPC ES AW+YT + L+
Sbjct: 385 QSPFQGPGLPNSQEPYNVIFHPLSGLCVVVKSS-EALELGPCDESNAWNYTSTHELVLQH 443
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQ K VG+ A+LG C+ S W++IS+S MH+S++ NGT VCLD I T
Sbjct: 444 TGQCLQVKSVGENAQLGTDCSKSSSKWQLISNSGMHVSTELTKNGTRVCLDATPDGIITT 503
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLP 402
N CKCL+ D C+P SQWFK++ S+R + T+ IL LP
Sbjct: 504 NPCKCLTGDPNCNPESQWFKIILSSRHTGTS--------ILQLP 539
>gi|90265051|emb|CAH67676.1| H0510A06.1 [Oryza sativa Indica Group]
gi|116309845|emb|CAH66881.1| OSIGBa0158F13.12 [Oryza sativa Indica Group]
Length = 553
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 224/404 (55%), Positives = 281/404 (69%), Gaps = 13/404 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD+YF+P+ W+ GL MAT+F +NVVGMSLRNEL
Sbjct: 150 MVILDNQMTTPGWCCSTTDGNGFFGDKYFDPEEWLNGLKTMATMFRKTKNVVGMSLRNEL 209
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP +NV WY M+ GAE+VH ANP+VLVILSGL FD L+FV ++L+FTGKLVFE
Sbjct: 210 RGPYENVSLWY--MKEGAESVHTANPDVLVILSGLEFDNTLNFVVPNQIHLSFTGKLVFE 267
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+DG W N N VCG VV + FLL+QGWPLF SEFG D+ G + DNR
Sbjct: 268 QHWYGFSDGGNWESQNQNDVCGMVVGFIKNKGLFLLQQGWPLFFSEFGFDMSGTHTGDNR 327
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG + +E YGL W+WC RN SF++RI+SL
Sbjct: 328 YLTCFLSVAAEMDLDWAIWALQGSYYIREGTLAYDESYGLLSWDWCTARNPSFIKRINSL 387
Query: 241 QSPFRGPGVFETG-LHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + VI+HP +GLCV KS + L LGPC ES AW+YT + L+
Sbjct: 388 QSPFQGPGLPNSQEPYNVIFHPLSGLCVVVKSS-EALELGPCDESNAWNYTSTHELVLQH 446
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQ K VG+ A+LG C+ S W++IS+S MH+S++ NGT VCLD I T
Sbjct: 447 TGQCLQVKSVGENAQLGTDCSKSSSKWQLISNSGMHVSTELTKNGTRVCLDATPDGIITT 506
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQFNPILDLP 402
N CKCL+ D C+P SQWFK++ S+R + T+ IL LP
Sbjct: 507 NPCKCLTGDPNCNPESQWFKIILSSRHTGTS--------ILQLP 542
>gi|115459010|ref|NP_001053105.1| Os04g0480900 [Oryza sativa Japonica Group]
gi|38345244|emb|CAD41088.2| OSJNBb0011N17.5 [Oryza sativa Japonica Group]
gi|113564676|dbj|BAF15019.1| Os04g0480900 [Oryza sativa Japonica Group]
gi|215766720|dbj|BAG98948.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222629069|gb|EEE61201.1| hypothetical protein OsJ_15210 [Oryza sativa Japonica Group]
Length = 554
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 225/397 (56%), Positives = 281/397 (70%), Gaps = 4/397 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD+YF P+ W+KGL+ MAT+F +NVVGMSLRNEL
Sbjct: 148 MVILDNQMTTPGWCCSENDGNGFFGDKYFGPEEWLKGLSAMATMFRNTKNVVGMSLRNEL 207
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG K NV W+RYMQLGAE VHAANP VLVILSGLNFD L F+ + LTFTGKLVFE
Sbjct: 208 RGSK-NVSLWFRYMQLGAEVVHAANPGVLVILSGLNFDNTLDFLVPNQIQLTFTGKLVFE 266
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+D W N N CG VVD++ + FLL+QGWPLF SEFG D+ G +V DNR
Sbjct: 267 QHWYGFSDDGNWGSQNQNDACGMVVDSIKKKGLFLLQQGWPLFFSEFGFDMSGTHVADNR 326
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG + +E YGL W+WC RN SF++RI++L
Sbjct: 327 YLTCFLTVAAEMDLDWAIWALQGSYYIREGTLAYDESYGLLSWDWCTARNPSFIKRINAL 386
Query: 241 QSPFRGPGVFETGL-HKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + VI+HP +GLCV KS + L LGPC +S AW+YT + LK
Sbjct: 387 QSPFQGPGLPNSQQPYNVIFHPLSGLCVLVKSS-EALELGPCDKSNAWNYTKGYELILKQ 445
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQAK VG+ AKLG C+ S W+++S+S MH+S++ +NGT VCL+ I T
Sbjct: 446 TGQCLQAKSVGENAKLGTSCSKSSSKWKLLSNSGMHVSTELTNNGTRVCLEASPDGNITT 505
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
N CKCL+ D C+P SQWFK++ S++ S Q
Sbjct: 506 NQCKCLTVDPNCNPESQWFKIILSSKHIPGGTSILQL 542
>gi|116309844|emb|CAH66880.1| OSIGBa0158F13.11 [Oryza sativa Indica Group]
Length = 554
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 225/397 (56%), Positives = 281/397 (70%), Gaps = 4/397 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD+YF P+ W+KGL+ MAT+F +NVVGMSLRNEL
Sbjct: 148 MVILDNQMTTPGWCCSENDGNGFFGDKYFGPEEWLKGLSAMATMFRNTKNVVGMSLRNEL 207
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG K NV W+RYMQLGAE VHAANP VLVILSGLNFD L F+ + LTFTGKLVFE
Sbjct: 208 RGSK-NVSLWFRYMQLGAEVVHAANPGVLVILSGLNFDNTLDFLVPNQIQLTFTGKLVFE 266
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+D W N N CG VVD++ + FLL+QGWPLF SEFG D+ G +V DNR
Sbjct: 267 QHWYGFSDDGNWGSQNQNDACGMVVDSIKKKGLFLLQQGWPLFFSEFGFDMSGTHVADNR 326
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG + +E YGL W+WC RN SF++RI++L
Sbjct: 327 YLTCFLTVAAEMDLDWAIWALQGSYYIREGTLAYDESYGLLSWDWCTARNPSFIKRINAL 386
Query: 241 QSPFRGPGVFETGL-HKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + VI+HP +GLCV KS + L LGPC +S AW+YT + LK
Sbjct: 387 QSPFQGPGLPNSQQPYNVIFHPLSGLCVLVKSS-EALELGPCDKSNAWNYTKGYELILKQ 445
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQAK VG+ AKLG C+ S W+++S+S MH+S++ +NGT VCL+ I T
Sbjct: 446 TGQCLQAKSVGENAKLGTSCSKSSSKWKLLSNSGMHVSTELTNNGTRVCLEASPDGNITT 505
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
N CKCL+ D C+P SQWFK++ S++ S Q
Sbjct: 506 NQCKCLTVDPNCNPESQWFKIILSSKHIPGGTSILQL 542
>gi|218195067|gb|EEC77494.1| hypothetical protein OsI_16340 [Oryza sativa Indica Group]
Length = 554
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 225/397 (56%), Positives = 280/397 (70%), Gaps = 4/397 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDN ++ PGWCCS +DGNGFFGD+YF P+ W+KGL+ MAT+F +NVVGMSLRNEL
Sbjct: 148 MVILDNQMTTPGWCCSENDGNGFFGDKYFGPEEWLKGLSAMATMFRNTKNVVGMSLRNEL 207
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG K NV W+RYMQLGAE VHAANP VLVILSGLNFD L F+ + LTFTGKLVFE
Sbjct: 208 RGSK-NVSLWFRYMQLGAEVVHAANPGVLVILSGLNFDNTLDFLVPNQIQLTFTGKLVFE 266
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWYGF+D W N N CG VVD + + F+L+QGWPLF SEFG D+ G +V DNR
Sbjct: 267 QHWYGFSDDGNWGSRNQNDACGMVVDFIKKKGLFILQQGWPLFFSEFGFDMSGTHVADNR 326
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL CF VAAE+D DWA+W L GSYY+REG + +E YGL W+WC RN SF++RI++L
Sbjct: 327 YLTCFLTVAAEMDLDWAIWALQGSYYIREGTLAYDESYGLLSWDWCTARNPSFIKRINAL 386
Query: 241 QSPFRGPGVFETGL-HKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
QSPF+GPG+ + + VI+HP +GLCV KS + L LGPC +S AW+YT + LK
Sbjct: 387 QSPFQGPGLPNSHQPYNVIFHPLSGLCVLVKSS-EALELGPCDKSNAWNYTKGYELILKQ 445
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK-ADNGTTVCLDVDSSNTIVT 358
CLQAK VG+ AKLG C+ S W+++S+S MH+S++ +NGT VCLD I T
Sbjct: 446 TGQCLQAKSVGENAKLGTSCSKSSSKWKLLSNSGMHVSTELTNNGTRVCLDASPDGNITT 505
Query: 359 NTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
N CKCL+ D C+P SQWFK++ S++ S Q
Sbjct: 506 NQCKCLTVDPDCNPESQWFKIILSSKHIPGGTSILQL 542
>gi|15231514|ref|NP_189245.1| Cellulase (glycosyl hydrolase family 5) protein [Arabidopsis
thaliana]
gi|11994432|dbj|BAB02434.1| unnamed protein product [Arabidopsis thaliana]
gi|332643604|gb|AEE77125.1| Cellulase (glycosyl hydrolase family 5) protein [Arabidopsis
thaliana]
Length = 508
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 217/391 (55%), Positives = 283/391 (72%), Gaps = 13/391 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++KPGWCC +DGNGFFGD +F+P WI GLTK+A F G NVVGMSLRNEL
Sbjct: 124 MVILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGMSLRNEL 183
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DW++YMQ GAEAVH ANP VLVILSGL++D DLSFVR++ VNLTFT KLVFE
Sbjct: 184 RGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNLTFTRKLVFE 243
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
H Y FT+ W NPN+ CG ++ ++ GF L + +P+F+SEFG DLRG NVNDNR
Sbjct: 244 LHRYSFTNTNTWSSKNPNEACGEILKSIENGGGFNL-RDFPVFLSEFGIDLRGKNVNDNR 302
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y+ C G AAE D DW++WTL GSYYLREGV+G++E+YG+ D +W +R+ SFL+R+S +
Sbjct: 303 YIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFLQRLSLI 362
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDP--LTLGPCTESEAWSYTPHKTISLK 298
SP +GPG ++ ++ +++HP TGLC+ +S LDP +TLG C ES+ WSYTP T++LK
Sbjct: 363 LSPLQGPGS-QSKVYNLVFHPLTGLCM-LQSILDPTKVTLGLCNESQPWSYTPQNTLTLK 420
Query: 299 GAYFCLQAKHVGKPAKLGIICTDCG----STWEIISDSKMHLSSKADNGTTVCLDVDSSN 354
CL++ P KL T C S WE IS S M L++K+ N ++CLDVD +N
Sbjct: 421 DKSLCLESTGPNAPVKLSE--TSCSSPNLSEWETISASNMLLAAKSTNN-SLCLDVDETN 477
Query: 355 TIVTNTCKCLS-RDKTCDPASQWFKLVDSTR 384
++ + CKC+ D +CDP SQWFK+V ++
Sbjct: 478 NLMASNCKCVKGEDSSCDPISQWFKIVKVSK 508
>gi|125540130|gb|EAY86525.1| hypothetical protein OsI_07905 [Oryza sativa Indica Group]
Length = 582
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 221/418 (52%), Positives = 281/418 (67%), Gaps = 33/418 (7%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S+PGWCC++ DGNGFFGD++F PD W++GL MA +F GV NVVGMSLRNEL
Sbjct: 147 MVILDNHVSRPGWCCADDDGNGFFGDRHFVPDAWVRGLGAMAALFAGVPNVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFT----GK 116
RGP+QN DWYRYMQ+GAEAVHAANP LVI+ GL +D DLSF+ + V+++F GK
Sbjct: 207 RGPRQNADDWYRYMQMGAEAVHAANPAALVIMGGLGYDTDLSFLAARPVDVSFAAAERGK 266
Query: 117 LVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNV 176
LVFE HWY F D +AW + N+VCGRV V R GFLL+ G+PLF+SEFGAD RG +
Sbjct: 267 LVFELHWYSFADARAWESEDANEVCGRVARGVARRGGFLLDAGFPLFLSEFGADTRGGSR 326
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
D+RYL C VAAELD DWALW L GSY LR+GV G +E YG+ DW+W RN++ L R
Sbjct: 327 KDDRYLPCAAAVAAELDLDWALWALQGSYALRQGVAGADEVYGVLDWSWSKPRNATALSR 386
Query: 237 ISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRK------SFLDPLTLGPCTESEAWSYT 290
I SLQ P RGPG E + V++HP TG CV R+ + L LG C +++AW+YT
Sbjct: 387 IQSLQRPLRGPGYDEARPYTVLFHPLTGRCVVRRAADDAAAAAATLELGRCEDTDAWAYT 446
Query: 291 -PHKTISLKGA-----YFCLQAKHVGKPAKL-----GIICTDCGSTWEIISDSKMHLS-- 337
P T++++GA CL+A+ G+PA+L G D STW ++S S MH++
Sbjct: 447 QPASTLAMRGAGRGSPPLCLRAEGSGRPARLATSDAGGCRGDALSTWRLVSGSTMHVAVN 506
Query: 338 -------SKADNGTTVCLDV-DSSNTIVTNTCKCL--SRDKTCDPASQWFKLVDSTRS 385
S+ G +CLDV D ++VTN C+CL + CDP +QWFKLV STRS
Sbjct: 507 ATTTTTPSRDGGGGLLCLDVGDDGRSVVTNPCRCLDDAAAGECDPETQWFKLVTSTRS 564
>gi|297818130|ref|XP_002876948.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297322786|gb|EFH53207.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 509
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 217/386 (56%), Positives = 282/386 (73%), Gaps = 10/386 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++KPGWCC +DGNGFFGD +F+P WI GL K+AT F G NVVGMSLRNEL
Sbjct: 124 MVILDNHVTKPGWCCGYNDGNGFFGDTFFDPATWISGLIKIATTFKGASNVVGMSLRNEL 183
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQNV DW++YMQ GAEAVH ANP VLVILSGL++D DLSFVR++ VNLTF+ KLVFE
Sbjct: 184 RGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRPVNLTFSRKLVFE 243
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
H Y FT+ + W NPN+ CG ++ ++ GF L + +P+F+SEFG DLRG NVNDNR
Sbjct: 244 LHRYSFTNTKTWSSKNPNEACGEILQSIENGGGFNL-RDFPVFLSEFGIDLRGKNVNDNR 302
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y+ C G AAE D DW++WTL GSYYLREGV+G++EYYG+ D +W +R+ +FL+R+S +
Sbjct: 303 YIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEYYGILDSDWVRVRSQNFLQRLSLI 362
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDP--LTLGPCTESEAWSYTPHKT-ISL 297
QSP +GPG ++ ++ +++HP TGLC+ +S LDP +TLG C ES+ WSYTP T ++L
Sbjct: 363 QSPLQGPGS-QSKVYNLVFHPLTGLCM-LQSILDPTKVTLGLCNESQPWSYTPENTLLTL 420
Query: 298 KGAYFCLQAKHVGKPAKLG-IICTDCG-STWEIISDSKMHLSSKADNGTTVCLDVDSSNT 355
K CL+ P KL C+ S WE IS S M L++K+ N ++CLDVD SN
Sbjct: 421 KDKSLCLETTGPNAPVKLSEASCSSPNLSKWETISASNMLLAAKSTN-NSLCLDVDESNN 479
Query: 356 IVTNTCKCLS-RDKTCDPASQWFKLV 380
++ + CKC+ D +CDP SQWFK+V
Sbjct: 480 LIASNCKCVKGEDSSCDPISQWFKIV 505
>gi|302143023|emb|CBI20318.3| unnamed protein product [Vitis vinifera]
Length = 364
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 231/387 (59%), Positives = 272/387 (70%), Gaps = 54/387 (13%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M+ILD+H SKP + N FGDQYFNPDLW+KGLT+MAT+F+GV NVVGMSLRNEL
Sbjct: 23 MIILDSHFSKPSF-----HSNDIFGDQYFNPDLWVKGLTRMATVFSGVPNVVGMSLRNEL 77
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
R PKQNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL+ DLSF+ NQ
Sbjct: 78 RCPKQNVKDWYRYMQKGAEAVHSANPDVLVIISGLSDGTDLSFLLNQ------------- 124
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL-FVSEFGADLRGNNVNDN 179
QA NPN+VCGRVVD++MR G LL+QGWPL FVSE G D DN
Sbjct: 125 ---------QAGETSNPNKVCGRVVDSIMRRGGVLLQQGWPLMFVSELGVD-------DN 168
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
R LNCFFG+AAELD+DWALWT+ E GL +WN SS +RIS+
Sbjct: 169 RNLNCFFGLAAELDFDWALWTV-------------EETNGLMNWN------SSVFQRISA 209
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKG 299
LQSP +GP V HK+I HP TGLC+ +S+ +PL LGPCT+S+AW YTP K + +KG
Sbjct: 210 LQSPLQGPDVSRVRPHKIILHPPTGLCILWESWTEPLKLGPCTKSDAWGYTPQKLLIVKG 269
Query: 300 AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTN 359
+YFCLQA +GKPAKL IICT GS W+IISDSKM+LS+K + TTVCLDVDSS+TIVT+
Sbjct: 270 SYFCLQAVELGKPAKLSIICTKPGSNWDIISDSKMYLSTKLGDSTTVCLDVDSSSTIVTD 329
Query: 360 TCKCLSRDKTCDPASQWFKLVDSTRSS 386
CKCL RD TCDP SQWFK+VDST S
Sbjct: 330 ACKCLGRDDTCDPGSQWFKVVDSTNIS 356
>gi|15237972|ref|NP_197252.1| glycosyl hydrolase family 5 protein [Arabidopsis thaliana]
gi|9755774|emb|CAC01894.1| putative protein [Arabidopsis thaliana]
gi|332005051|gb|AED92434.1| glycosyl hydrolase family 5 protein [Arabidopsis thaliana]
Length = 526
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 215/384 (55%), Positives = 263/384 (68%), Gaps = 10/384 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH + PGWCCSN D + FFGD FNPDLW+ GL KMATIF V+NVVGMSLRNEL
Sbjct: 147 MVILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMATIFMNVKNVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG KDWY+YMQ GAEAVH +NP VLVILSGLNFD DLSF++++ VNL+F KLV E
Sbjct: 207 RGYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLKDRPVNLSFKKKLVLE 266
Query: 121 AHWYGFTDGQA-WVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDN 179
HWY FTDG W N N C ++ R GF+L+QG+PLF+SEFG D RG ++ N
Sbjct: 267 LHWYSFTDGTGQWKSHNVNDFCSQMFSKERRTGGFVLDQGFPLFLSEFGTDQRGGDLEGN 326
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
RY+NC AAE D DWA+W + G YY REG G+ E YG+ D NW ++ N ++L R+S
Sbjct: 327 RYMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGMLDANWHNVHNYTYLRRLSV 386
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFL--DPLTLGPCTESEAWSYTPHKTISL 297
+Q P GPGV + HK I+HP TGLC+ RKS LTLGPCT+ E WSY+ + +
Sbjct: 387 IQPPHTGPGV-KHNHHKKIFHPLTGLCLVRKSHCHESELTLGPCTKDEPWSYSHGGILEI 445
Query: 298 -KGAYFCLQAK-HVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNT 355
+G CL+ + VGK KLG ICT E IS +KMHLS +G+ VCLDVDS N
Sbjct: 446 RRGHKSCLEGETAVGKSVKLGRICTKI----EQISATKMHLSFNTSDGSLVCLDVDSDNN 501
Query: 356 IVTNTCKCLSRDKTCDPASQWFKL 379
+V N+C CL+ D TC+PASQWFK+
Sbjct: 502 VVANSCNCLTGDTTCEPASQWFKI 525
>gi|297807773|ref|XP_002871770.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297317607|gb|EFH48029.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 527
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 214/384 (55%), Positives = 265/384 (69%), Gaps = 10/384 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH + PGWCCSN+D + FFGD FNPDLW+ GL KMATIF V NVVGMSLRNEL
Sbjct: 148 MVILDNHKTVPGWCCSNNDPDAFFGDPKFNPDLWMLGLKKMATIFMNVNNVVGMSLRNEL 207
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG KDWY+YMQ GAEAVH +NP VLVILSGLNFD DLSF++++ VNL+F KLV E
Sbjct: 208 RGYNHTAKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLQDRPVNLSFKKKLVLE 267
Query: 121 AHWYGFTDGQA-WVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDN 179
HWY FTDG W N N C ++ R GF+LEQG+PLF+SEFG D RG + N
Sbjct: 268 LHWYSFTDGTGQWKSHNVNDFCSQMFAKEHRTGGFVLEQGFPLFLSEFGTDQRGGDFEGN 327
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
RY++C AAE D DWA+W L G YY REG G+ E YG+ D NW ++ N ++L R+S
Sbjct: 328 RYMSCMLAWAAEKDIDWAVWALTGVYYFREGKRGVLEAYGMLDANWHNVHNYTYLRRLSV 387
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKS--FLDPLTLGPCTESEAWSYTPHKTISL 297
+Q P +GPG+ + HK I+HP TGLC+ RKS + L LGPCT+ E WSY+ + +
Sbjct: 388 IQPPHKGPGI-KHNHHKKIFHPLTGLCLVRKSSCYESELMLGPCTKGEPWSYSHGGILEI 446
Query: 298 KGAY-FCLQAK-HVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNT 355
KG + CL+ + VG+ KLG CT IS +KMHLS K ++G+ VCLDVDS N
Sbjct: 447 KGGHKSCLEGETAVGRSVKLGKKCTKIMQ----ISATKMHLSLKTNDGSLVCLDVDSDNN 502
Query: 356 IVTNTCKCLSRDKTCDPASQWFKL 379
+V N+CKCL+ D TC+PASQWFK+
Sbjct: 503 VVANSCKCLTGDITCEPASQWFKI 526
>gi|413922902|gb|AFW62834.1| hypothetical protein ZEAMMB73_750687 [Zea mays]
Length = 548
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/406 (53%), Positives = 271/406 (66%), Gaps = 22/406 (5%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S PGWCCSN DGNGFFGD+ F+P++W+ GL MATIF V NVVGMSLRNEL
Sbjct: 133 MVILDNHVSTPGWCCSNDDGNGFFGDRDFDPNVWVDGLGSMATIFADVPNVVGMSLRNEL 192
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTG--KLV 118
RGP+QN +DWY YMQ GAEAVHAA+P LVI+ GL++D DLSF+ + V ++F KLV
Sbjct: 193 RGPRQNPQDWYTYMQRGAEAVHAASPRALVIMGGLSYDYDLSFLAARQVGVSFAAENKLV 252
Query: 119 FEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVND 178
FE HWY F+D +AW + N+VCGR + R GFLL +G+PLF+SEFGAD RG + D
Sbjct: 253 FEVHWYSFSDARAWETESANEVCGRAARDFARRGGFLLARGFPLFLSEFGADSRGGDPKD 312
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NR+ C VAAE D DWA W L GSY LR+GV G++E YG+ DW+W RN + L RI
Sbjct: 313 NRFFPCAAAVAAEHDVDWAYWALQGSYALRQGVAGMDEVYGVLDWSWSKPRNETVLPRIQ 372
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRK--SFLDPLTLGPCTESEAWSYT-PHKTI 295
+LQ P +GPG E + V++HP TGLC R+ + L LGPC E++AW+Y P T+
Sbjct: 373 ALQRPLQGPGYGEALPYTVLFHPLTGLCAVRRAAAVTTTLELGPCNETDAWAYAPPSSTL 432
Query: 296 SLKGAYF----CLQAKHVGKPAKLGII-CTDCGSTWEIISDSKMHLSSKA---------- 340
L+ A CL+A+ G+PA+LGI C D STW + +DS MH++ A
Sbjct: 433 VLRDAAAAGLPCLRAEGPGQPARLGIRDCGDPMSTWRLATDSGMHVAVDAALGLGGGEDG 492
Query: 341 -DNGTTVCLDVDS-SNTIVTNTCKCLSRDKTCDPASQWFKLVDSTR 384
G +CLDV + +IVTN C C D TCDP QWFKLV STR
Sbjct: 493 GGGGGMLCLDVGTDGRSIVTNPCACQRGDGTCDPEGQWFKLVTSTR 538
>gi|242062208|ref|XP_002452393.1| hypothetical protein SORBIDRAFT_04g024910 [Sorghum bicolor]
gi|241932224|gb|EES05369.1| hypothetical protein SORBIDRAFT_04g024910 [Sorghum bicolor]
Length = 570
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 213/411 (51%), Positives = 263/411 (63%), Gaps = 27/411 (6%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHIS PGWCCSN DGNGFFGD+ F+P++W+ GL MAT F V NVVGMSLRNEL
Sbjct: 150 MVILDNHISTPGWCCSNDDGNGFFGDRDFDPNVWVDGLGSMATTFADVPNVVGMSLRNEL 209
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTG--KLV 118
RGP+QN +DWY YMQ GAEAVHAANP LVI+ GL++D DLSF+ + V ++F KLV
Sbjct: 210 RGPRQNSEDWYTYMQRGAEAVHAANPRALVIMGGLSYDYDLSFLAARQVGVSFAAENKLV 269
Query: 119 FEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVND 178
FE HWY F+D +AW N+VCGR + R FLL G+PLF+SEFGAD RG + D
Sbjct: 270 FEVHWYSFSDARAWEAEGANEVCGRAARDFARRGAFLLAGGFPLFLSEFGADSRGGDRKD 329
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NRY C VAAE D DWA W L GSY LR+GV+G++E YG+ DW+W RN + L RI
Sbjct: 330 NRYFPCAAAVAAEHDVDWAYWALQGSYALRQGVVGMDEVYGVLDWSWSKPRNQTVLPRIQ 389
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLC---VQRKSFLDPLTLGPCTESEAWSYT-PHKT 294
+LQ P +GPG E + V++HP TGLC + L LGPC E++AW+Y P T
Sbjct: 390 ALQRPLQGPGYGEALPYTVLFHPLTGLCAVRRASAAAATTLELGPCNETDAWAYAPPSST 449
Query: 295 ISLK-------GAYFCLQAKHVGKPAKLGI-ICTDCGSTWEIISDSKMHLSSKA------ 340
+ L+ G CL+A+ G+PA+LG C D STW + +DS MH++ A
Sbjct: 450 LVLRDAAAAAGGLPTCLRAEGRGQPARLGTNACGDPLSTWRLATDSAMHVAVDAAALGLG 509
Query: 341 ------DNGTTVCLDVDS-SNTIVTNTCKCLSRDKTCDPASQWFKLVDSTR 384
+CLDV + +IVTN C C D CDP QWFKLV STR
Sbjct: 510 GGEDGGGGSGMLCLDVGTDGRSIVTNPCACQRGDGACDPEGQWFKLVTSTR 560
>gi|224028863|gb|ACN33507.1| unknown [Zea mays]
Length = 369
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 207/357 (57%), Positives = 251/357 (70%), Gaps = 3/357 (0%)
Query: 41 MATIFNGVRNVVGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD 100
MAT+FN + VVGMSLRNELRGPKQNV WYRYMQLGAEAVHAANP VLVILSGL+FD
Sbjct: 1 MATMFNNTKYVVGMSLRNELRGPKQNVSLWYRYMQLGAEAVHAANPNVLVILSGLDFDNT 60
Query: 101 LSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW 160
LSF+ + V L+F+GKLV+E HWYGF+DG W N N VCG VVD + FLL+ GW
Sbjct: 61 LSFLFKEKVRLSFSGKLVYEQHWYGFSDGGNWETQNQNDVCGMVVDFIWAKGLFLLQHGW 120
Query: 161 PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGL 220
PLF SEFG D+ G ++ DNRYL CF VAAE+D DWA+W L GSYY+REG++ +E YGL
Sbjct: 121 PLFFSEFGFDMSGTHIGDNRYLTCFLSVAAEMDLDWAIWALQGSYYIREGILAYDESYGL 180
Query: 221 FDWNWCDIRNSSFLERISSLQSPFRGPGVFETGL-HKVIYHPATGLCVQRKSFLDPLTLG 279
W+WC RN SF++RI+SLQS F+GPG+ T + +I+HP +GLCV +S L LG
Sbjct: 181 LTWDWCTARNPSFIKRINSLQSAFQGPGLPNTQQPYYIIFHPQSGLCVLARSS-KLLELG 239
Query: 280 PCTESEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK 339
PC ES AW+YT + +K CLQA+ VGK AKLG C+ S W +ISDSKMH+S++
Sbjct: 240 PCDESNAWNYTSAYELVVKSTGQCLQAQSVGKNAKLGTDCSKPSSKWHLISDSKMHVSAE 299
Query: 340 -ADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
A NGT VCL+ I TN CKCLS D TCDP SQWFK++ S+R S Q
Sbjct: 300 LAKNGTRVCLEASPDGAIRTNQCKCLSVDPTCDPESQWFKVILSSRDIPGGDSILQL 356
>gi|148908776|gb|ABR17494.1| unknown [Picea sitchensis]
Length = 543
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/401 (48%), Positives = 257/401 (64%), Gaps = 14/401 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHIS+P WCC +DGNGFFGD F+P +WI+GL+ MAT F G VVGMSLRNEL
Sbjct: 145 MVILDNHISRPQWCCGYNDGNGFFGDADFDPGVWIQGLSAMATAFKGTAAVVGMSLRNEL 204
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG + N DWY+YMQ GA+AVH ANP+VLVI+SGLN+ DL F+ ++ V+L FT K+V+E
Sbjct: 205 RGSRANAADWYKYMQQGAQAVHDANPDVLVIMSGLNYAADLKFLASKPVSLEFTNKIVYE 264
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFL---LEQGWPLFVSEFGADLRGNNVN 177
HWY FTDG+AW + N++CG V V + F+ L PLF+SEFG D RG+NV
Sbjct: 265 MHWYSFTDGKAWENMPANKLCGTVTARVNDHTAFVAKTLSPPAPLFISEFGIDERGSNVG 324
Query: 178 DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERI 237
DNR++NCF AAE D+DWALWTL GSYYLR G G E YG+F+ W D+R+ F+ R+
Sbjct: 325 DNRFINCFLAFAAEGDFDWALWTLQGSYYLRNGQPGFEETYGIFNGRWDDLRDPPFVSRL 384
Query: 238 SSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSY-TPHKTIS 296
SLQ PF+ L+K +YHPATGLC+ S L L C + Y +P +
Sbjct: 385 KSLQKPFQESFSSSEPLYKTLYHPATGLCLA-TSGEGGLKLDSCDSPTLFEYKSPQGAFT 443
Query: 297 LKG-AYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLS-SKADNGTT--VCLDVDS 352
L + C+ AK G A+L C S W+ +S S + +S + + NGT+ +CLD S
Sbjct: 444 LSDESSTCIAAKGPGLAAELSTQCAASNSKWQRVSSSNLQVSVTLSVNGTSQMLCLDGKS 503
Query: 353 SNTIVTNTCKCLSRDKTC----DPASQWFKLVDSTRSSTTT 389
S ++TN C CL +D C +P QWFKL+ + R +++
Sbjct: 504 SPQVLTNDCICL-KDSDCRKDENPEMQWFKLITTNRKGSSS 543
>gi|125582736|gb|EAZ23667.1| hypothetical protein OsJ_07369 [Oryza sativa Japonica Group]
Length = 583
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 167/274 (60%), Positives = 204/274 (74%), Gaps = 4/274 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S+PGWCC++ DGNGFFGD++F+PD W++GL MA +F GV NVVGMSLRNEL
Sbjct: 147 MVILDNHVSRPGWCCADDDGNGFFGDRHFDPDAWVRGLGAMAALFAGVPNVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFT----GK 116
RGP+QN DWYRYMQ+GAEAVHAANP LVI+ GL +D DLSF+ + V+++F GK
Sbjct: 207 RGPRQNADDWYRYMQMGAEAVHAANPAALVIMGGLGYDTDLSFLAARPVDVSFAAAERGK 266
Query: 117 LVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNV 176
LVFE HWY F D +AW + N+VCGRV V R GFLL+ G+PLF+SEFGAD RG +
Sbjct: 267 LVFELHWYSFADARAWESEDANEVCGRVARGVARRGGFLLDAGFPLFLSEFGADTRGGSR 326
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
D+RYL C VAAELD DWALW L GSY LR+GV G +E YG+ DW+W RN++ L R
Sbjct: 327 KDDRYLPCAAAVAAELDLDWALWALQGSYALRQGVAGADEVYGVLDWSWSKPRNATALSR 386
Query: 237 ISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRK 270
I SLQ P RGPG E + V++HP TG CV R+
Sbjct: 387 IQSLQRPLRGPGYDEARPYTVLFHPLTGRCVVRR 420
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 338 SKADNGTTVCLDV-DSSNTIVTNTCKCL--SRDKTCDPASQWFKLVDSTRS 385
S+ G +CLDV D ++VTN C+CL + CDP +QWFKLV STRS
Sbjct: 515 SRDGGGGLLCLDVGDDGRSVVTNPCRCLDDAAAGECDPETQWFKLVTSTRS 565
>gi|225426108|ref|XP_002272386.1| PREDICTED: uncharacterized protein LOC100250178 [Vitis vinifera]
Length = 532
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 179/390 (45%), Positives = 246/390 (63%), Gaps = 7/390 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M++LDNH+SKP WCC++ DGNGFFGD +F+P WI+GLT++A+ F G VV MS+RNEL
Sbjct: 145 MLVLDNHVSKPMWCCASEDGNGFFGDMFFDPKEWIEGLTQVASRFKGKPQVVAMSMRNEL 204
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN+ DWY M GA+A+H+ NP+VLV++SGLNFD DLSF+ LT K+V+E
Sbjct: 205 RGPRQNLPDWYTNMTEGAKAIHSTNPDVLVLVSGLNFDLDLSFLNTTPFGLTLDNKVVYE 264
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNVND 178
AHWY F Q W N+VCG+ D R + FL+ + PL +SEFG D+RG N +D
Sbjct: 265 AHWYSFDFTQQWQTQPLNRVCGQCADEFQREAAFLITGDNAAPLILSEFGVDVRGVNQDD 324
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NRY NC A+ D DWALWTL GSYY REG G E Y + D+NW R+ +LER++
Sbjct: 325 NRYFNCLLPTVADKDLDWALWTLQGSYYYREGKAGPEEVYSVLDYNWDKPRDPKYLERLT 384
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKT-ISL 297
LQ + P + V+ HP +G CV + D + C E WS++ + I L
Sbjct: 385 ILQQTIQDPNSIALSYY-VLVHPESGFCVNVEG-QDNVHGSSCRERSKWSHSGDGSPIQL 442
Query: 298 KGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSN--T 355
G+ CL+A G P L C +TW+++SDS +H+++ + G ++CL+ SSN +
Sbjct: 443 VGSELCLKAVGDGVPVALSTDCKSPWATWKLVSDSMLHIAAMDEQGNSLCLESTSSNYSS 502
Query: 356 IVTNTCKCLSRDKTCDPASQWFKLVDSTRS 385
I+T +C CL+ + CDP SQWF+LV S S
Sbjct: 503 ILTRSCACLNNETNCDPQSQWFRLVPSNLS 532
>gi|297742258|emb|CBI34407.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 361 bits (927), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 179/390 (45%), Positives = 240/390 (61%), Gaps = 7/390 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M++LDNH+SKP WCC+ DGNGFFGD +F+P WI+GLT++A F G VV MS+RNE+
Sbjct: 90 MLVLDNHVSKPMWCCAREDGNGFFGDMFFDPKEWIEGLTQVAKRFKGKPQVVAMSMRNEI 149
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN+ DWY+ M+ GA+A+H+ NP+VLV++SGLNFDKDLSF+ LT K+V+E
Sbjct: 150 RGPRQNLPDWYKNMREGAKAIHSTNPDVLVLVSGLNFDKDLSFLSTTPFGLTLDKKVVYE 209
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNVND 178
AHWY F Q W N+VC + D R + FL+ + PL +SEFG DLRG N D
Sbjct: 210 AHWYSFDFTQQWQTQPLNRVCRQRADEFQREAAFLITGDNAAPLIISEFGVDLRGVNQAD 269
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NRY NC AE D DWALWTL SYY REG G E Y + D+NW R+ +LER++
Sbjct: 270 NRYFNCLLPTVAEKDLDWALWTLQASYYYREGKAGPEEVYSVLDYNWDKPRDPKYLERLT 329
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHK-TISL 297
LQ + P T + +I H +G CV + D + C E WS+ I L
Sbjct: 330 ILQQTIQDPNS-TTLSYYLIVHTESGFCVNVEG-EDNVHGSSCRERSKWSHGGDGWPIRL 387
Query: 298 KGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSN--T 355
G CL+A G P L C +TW+++SDS++H+++ + G ++CL+ SSN +
Sbjct: 388 VGGELCLKAVGDGVPVTLSTDCKSPRATWKLVSDSRLHVAAMDEQGNSLCLEATSSNYSS 447
Query: 356 IVTNTCKCLSRDKTCDPASQWFKLVDSTRS 385
I+T C C+ + CDP SQWFKLV S S
Sbjct: 448 ILTRRCACVKNEANCDPQSQWFKLVPSNLS 477
>gi|359474310|ref|XP_002272422.2| PREDICTED: uncharacterized protein LOC100245045 [Vitis vinifera]
Length = 530
Score = 361 bits (926), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 179/390 (45%), Positives = 240/390 (61%), Gaps = 7/390 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M++LDNH+SKP WCC+ DGNGFFGD +F+P WI+GLT++A F G VV MS+RNE+
Sbjct: 143 MLVLDNHVSKPMWCCAREDGNGFFGDMFFDPKEWIEGLTQVAKRFKGKPQVVAMSMRNEI 202
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP+QN+ DWY+ M+ GA+A+H+ NP+VLV++SGLNFDKDLSF+ LT K+V+E
Sbjct: 203 RGPRQNLPDWYKNMREGAKAIHSTNPDVLVLVSGLNFDKDLSFLSTTPFGLTLDKKVVYE 262
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNVND 178
AHWY F Q W N+VC + D R + FL+ + PL +SEFG DLRG N D
Sbjct: 263 AHWYSFDFTQQWQTQPLNRVCRQRADEFQREAAFLITGDNAAPLIISEFGVDLRGVNQAD 322
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NRY NC AE D DWALWTL SYY REG G E Y + D+NW R+ +LER++
Sbjct: 323 NRYFNCLLPTVAEKDLDWALWTLQASYYYREGKAGPEEVYSVLDYNWDKPRDPKYLERLT 382
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHK-TISL 297
LQ + P T + +I H +G CV + D + C E WS+ I L
Sbjct: 383 ILQQTIQDPNS-TTLSYYLIVHTESGFCVNVEG-EDNVHGSSCRERSKWSHGGDGWPIRL 440
Query: 298 KGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSN--T 355
G CL+A G P L C +TW+++SDS++H+++ + G ++CL+ SSN +
Sbjct: 441 VGGELCLKAVGDGVPVTLSTDCKSPRATWKLVSDSRLHVAAMDEQGNSLCLEATSSNYSS 500
Query: 356 IVTNTCKCLSRDKTCDPASQWFKLVDSTRS 385
I+T C C+ + CDP SQWFKLV S S
Sbjct: 501 ILTRRCACVKNEANCDPQSQWFKLVPSNLS 530
>gi|297742256|emb|CBI34405.3| unnamed protein product [Vitis vinifera]
Length = 970
Score = 342 bits (876), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 176/391 (45%), Positives = 234/391 (59%), Gaps = 11/391 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M+ILDNH+SKP WCC DGNGFFGD++F+P+ WI+GL+++A F +V+GMS+RNEL
Sbjct: 579 MLILDNHVSKPMWCCEREDGNGFFGDKFFDPEEWIEGLSQVANRFKDESHVIGMSMRNEL 638
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG + N KDWY+YM+ GA+A+H+ NP +LV++SGLNFD DLSF++ + LT K+VFE
Sbjct: 639 RGQRSNQKDWYKYMREGAKAIHSINPNLLVVVSGLNFDNDLSFLKKKPFRLTLNNKVVFE 698
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNNVND 178
AHWY F + W D P Q C R +GFL E PLF+SEFG DLR NV D
Sbjct: 699 AHWYSFDATEQWNDKPPEQECKDRAREFYRDAGFLTEGDNPAPLFISEFGMDLRMGNVVD 758
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NRY NC+ A D DWALWTL SYY R+G + L E Y + D+NW RN FLE+++
Sbjct: 759 NRYFNCYLPTVAAKDLDWALWTLQASYYYRQGSLFLEEVYSVLDFNWEKPRNPQFLEKLT 818
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHK-TISL 297
LQ + P E + VI H +GLCV G C W+++ I L
Sbjct: 819 ILQEMLQDPNS-EASPYYVIVHAKSGLCVNVDGEKKVHGSG-CRHRTKWTHSGDGWPIEL 876
Query: 298 KGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK--ADNGTTVCLDVDSSN- 354
G+ CL+A G+P K+ C S W +S K+HL+ + D+ ++CLD SS+
Sbjct: 877 MGSNLCLKAVGDGRPVKVSTDCRSQQSRWRRVSKEKLHLAVRDEHDHTESLCLDASSSDP 936
Query: 355 -TIVTNTCKCLSRDKTC--DPASQWFKLVDS 382
+IVT C C D C + S WF+LV S
Sbjct: 937 SSIVTRKCACKEGDLNCHHEEQSNWFQLVPS 967
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 126/250 (50%), Positives = 165/250 (66%), Gaps = 2/250 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M++LDNH+SKP WCC+ DGNGFFGD YF+P WIKGLT ++T F VV +S+RNE+
Sbjct: 146 MLVLDNHVSKPLWCCAREDGNGFFGDMYFDPKEWIKGLTDVSTRFKNNPQVVALSMRNEM 205
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG + NV DWYRYM+ GA+A+H ANP+VLVI+SGLNFDKDLSF+ ++ T K+VFE
Sbjct: 206 RGARSNVPDWYRYMRKGAKAIHKANPKVLVIVSGLNFDKDLSFLGRKSFGFTLNNKVVFE 265
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNVND 178
AHWY F Q W PN+ C + D R + FL ++ PLF+SE+G +L+ + D
Sbjct: 266 AHWYTFDFTQQWQQLPPNRACRQRADEFQRDAAFLTTGDKAAPLFISEYGINLQETSQVD 325
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
+RY CF AE D DWALWTL SY+ R+G G E Y + D++W R FLER+
Sbjct: 326 SRYFTCFLPTVAEKDLDWALWTLQASYHYRQGHAGGGESYSVLDYSWSKPRYPQFLERMV 385
Query: 239 SLQSPFRGPG 248
LQ+ + G
Sbjct: 386 ILQNLLQESG 395
>gi|225426114|ref|XP_002272491.1| PREDICTED: major extracellular endoglucanase [Vitis vinifera]
Length = 541
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 176/391 (45%), Positives = 234/391 (59%), Gaps = 11/391 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M+ILDNH+SKP WCC DGNGFFGD++F+P+ WI+GL+++A F +V+GMS+RNEL
Sbjct: 150 MLILDNHVSKPMWCCEREDGNGFFGDKFFDPEEWIEGLSQVANRFKDESHVIGMSMRNEL 209
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG + N KDWY+YM+ GA+A+H+ NP +LV++SGLNFD DLSF++ + LT K+VFE
Sbjct: 210 RGQRSNQKDWYKYMREGAKAIHSINPNLLVVVSGLNFDNDLSFLKKKPFRLTLNNKVVFE 269
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNNVND 178
AHWY F + W D P Q C R +GFL E PLF+SEFG DLR NV D
Sbjct: 270 AHWYSFDATEQWNDKPPEQECKDRAREFYRDAGFLTEGDNPAPLFISEFGMDLRMGNVVD 329
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NRY NC+ A D DWALWTL SYY R+G + L E Y + D+NW RN FLE+++
Sbjct: 330 NRYFNCYLPTVAAKDLDWALWTLQASYYYRQGSLFLEEVYSVLDFNWEKPRNPQFLEKLT 389
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHK-TISL 297
LQ + P E + VI H +GLCV G C W+++ I L
Sbjct: 390 ILQEMLQDPNS-EASPYYVIVHAKSGLCVNVDGEKKVHGSG-CRHRTKWTHSGDGWPIEL 447
Query: 298 KGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSK--ADNGTTVCLDVDSSN- 354
G+ CL+A G+P K+ C S W +S K+HL+ + D+ ++CLD SS+
Sbjct: 448 MGSNLCLKAVGDGRPVKVSTDCRSQQSRWRRVSKEKLHLAVRDEHDHTESLCLDASSSDP 507
Query: 355 -TIVTNTCKCLSRDKTC--DPASQWFKLVDS 382
+IVT C C D C + S WF+LV S
Sbjct: 508 SSIVTRKCACKEGDLNCHHEEQSNWFQLVPS 538
>gi|225426112|ref|XP_002272454.1| PREDICTED: uncharacterized protein LOC100267356 [Vitis vinifera]
Length = 521
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 169/391 (43%), Positives = 237/391 (60%), Gaps = 11/391 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M++LDNH+SKP WCC+ DGNGFFGD YF+P WIKGLT ++T F VV +S+RNE+
Sbjct: 130 MLVLDNHVSKPLWCCAREDGNGFFGDMYFDPKEWIKGLTDVSTRFKNNPQVVALSMRNEM 189
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG + NV DWYRYM+ GA+A+H ANP+VLVI+SGLNFDKDLSF+ ++ T K+VFE
Sbjct: 190 RGARSNVPDWYRYMRKGAKAIHKANPKVLVIVSGLNFDKDLSFLGRKSFGFTLNNKVVFE 249
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNVND 178
AHWY F Q W PN+ C + D R + FL ++ PLF+SE+G +L+ + D
Sbjct: 250 AHWYTFDFTQQWQQLPPNRACRQRADEFQRDAAFLTTGDKAAPLFISEYGINLQETSQVD 309
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
+RY CF AE D DWALWTL SY+ R+G G E Y + D++W R FLER+
Sbjct: 310 SRYFTCFLPTVAEKDLDWALWTLQASYHYRQGHAGGGESYSVLDYSWSKPRYPQFLERMV 369
Query: 239 SLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPC---TESEAWSYTPHKT- 294
LQ+ + P + ++ H +G CV K D +++ C + + W++ ++
Sbjct: 370 ILQNLLQDPNS-NVSPYYLLVHAQSGFCVNVKHN-DVVSVSSCRKNSRNSRWNHEGDRSP 427
Query: 295 ISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDV---D 351
I G CL+A G P L C+ +TW+++SDS + +++ + G +CLD
Sbjct: 428 IKKLGRKHCLKAVGDGVPLTLSDDCSSPRATWQLVSDSMLQIAAMDEQGNPLCLDATPSS 487
Query: 352 SSNTIVTNTCKCLSRDKTCDPASQWFKLVDS 382
+S I+T C CL+ + CDP SQWFKLV S
Sbjct: 488 NSTPILTRACACLNNEAGCDPLSQWFKLVPS 518
>gi|357475987|ref|XP_003608279.1| Endoglucanase [Medicago truncatula]
gi|87240496|gb|ABD32354.1| Glycoside hydrolase, family 5; Ricin B-related lectin [Medicago
truncatula]
gi|355509334|gb|AES90476.1| Endoglucanase [Medicago truncatula]
Length = 536
Score = 330 bits (846), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 164/393 (41%), Positives = 242/393 (61%), Gaps = 9/393 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV++DNH+S WCC N+D NGFFGD++F+PD W++GL +A F G NV+ M LRNEL
Sbjct: 145 MVLIDNHVSMAEWCCDNNDQNGFFGDRHFHPDEWLQGLAFIAKHFRGKPNVIAMDLRNEL 204
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG +QN+ DWY+Y+ GA +H NP++L+++SGLNFD DLSF++ + ++L FT KLV+E
Sbjct: 205 RGGRQNLPDWYKYVSQGASTIHKHNPDLLIVISGLNFDNDLSFLKKKTLDLNFTNKLVYE 264
Query: 121 AHWYGFTDGQAWVDGNP-NQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNNVN 177
AH Y F+ Q + P N VC V++N+ +GFL+ PLF+SEFG D+ G N
Sbjct: 265 AHIYSFSGNQDRWNLQPMNWVCSSVIENLNDQAGFLISGNNPVPLFISEFGYDMTGGNAV 324
Query: 178 DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERI 237
DN+++ CF AA +D DW+LW+ GSYY REG +G E Y + D++W + R+ F ++
Sbjct: 325 DNKFMPCFVSYAASVDLDWSLWSFGGSYYFREGSVGAGETYAVMDYDWKNYRDPKFPQKF 384
Query: 238 SSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKT-IS 296
LQ + P + H +++HP TG C + L LG C WS+ + I
Sbjct: 385 QLLQKKIQDPTSNLSKSH-IMFHPLTGKCAHANGSNNELVLGDCKSHSEWSFEGDGSPIR 443
Query: 297 LKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS-SNT 355
L + CL+A+ G PA L C S+W+ +S + +HL++ NG CL+++S S+
Sbjct: 444 LMDSAMCLKAEGEGLPATLSEHCLSPQSSWKSVSKTGLHLATSHGNGPLFCLEMESDSSK 503
Query: 356 IVTNTCKCLSR-DKTC--DPASQWFKLVDSTRS 385
IVT C C+ D +C +P SQWF+L+ + S
Sbjct: 504 IVTRKCICIDENDSSCLDNPQSQWFQLISTNFS 536
>gi|297807695|ref|XP_002871731.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
gi|297317568|gb|EFH47990.1| glycosyl hydrolase family 5 protein [Arabidopsis lyrata subsp.
lyrata]
Length = 488
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 175/385 (45%), Positives = 237/385 (61%), Gaps = 46/385 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++ PGWCC +D + FFG F+P +W KGL KMAT+F NV+GMSLRNE
Sbjct: 145 MVILDNHLTTPGWCCGENDLDAFFGYPKFDPTVWAKGLGKMATLFRNFTNVIGMSLRNEP 204
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG + W+R+M GAE VHAANP++LVILSG++FD +LSF+R+++VN++FT KLVFE
Sbjct: 205 RGARDYPDLWFRHMPQGAEVVHAANPKLLVILSGIDFDTNLSFLRDRSVNVSFTNKLVFE 264
Query: 121 AHWYGFTDGQ-AWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDN 179
HWY F+DG+ +W N N C ++++ V GFL+ +G+PL +SEFG D RG +++ N
Sbjct: 265 LHWYSFSDGRDSWRKHNSNDFCVKIIEKVTHNGGFLIGRGFPLILSEFGTDQRGGDISGN 324
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
RY+NC AAE D DWA+W L G YYLR
Sbjct: 325 RYMNCLVAWAAENDLDWAVWALTGDYYLRT------------------------------ 354
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSF--LDPLTLGPCTESEAWSYTPHKTISL 297
GPG+ + +++HP+TGLCV L L LGPC +S+ W++ P + I L
Sbjct: 355 ------GPGLRPN--NNILFHPSTGLCVTNNPSDNLPTLGLGPCPKSDPWTFNPQEGI-L 405
Query: 298 KGAYFCLQAKHV-GKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS-SNT 355
C++A +V G+ KLG T C + IS +KMHLS K NG +CLD D N+
Sbjct: 406 WVNKMCVEAPNVAGQKVKLG-FGTKCSKLGQ-ISATKMHLSFKTSNGLLLCLDADERDNS 463
Query: 356 IVTNTCKCLSRDKTCDPASQWFKLV 380
+V N CKCL+ D +CDPASQWFK++
Sbjct: 464 VVANPCKCLTMDASCDPASQWFKVL 488
>gi|15237397|ref|NP_197172.1| glycosyl hydrolase family 5 protein [Arabidopsis thaliana]
gi|9755722|emb|CAC01834.1| putative protein [Arabidopsis thaliana]
gi|332004944|gb|AED92327.1| glycosyl hydrolase family 5 protein [Arabidopsis thaliana]
Length = 488
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 175/385 (45%), Positives = 240/385 (62%), Gaps = 46/385 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++ PGWCC ++D + FFG +F+P +W KGL KMAT+F +V+GMSLRNE
Sbjct: 145 MVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRKMATLFRNFTHVIGMSLRNEP 204
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG + W+R+M GAEAVHAANP++LVILSG++FD +LSF+R+++VN++FT KLVFE
Sbjct: 205 RGARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTNLSFLRDRSVNVSFTDKLVFE 264
Query: 121 AHWYGFTDGQ-AWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDN 179
HWY F+DG+ +W N N C ++++ V GFLL +G+PL +SEFG D RG +++ N
Sbjct: 265 LHWYSFSDGRDSWRKHNSNDFCVKIIEKVTHNGGFLLGRGFPLILSEFGTDQRGGDMSGN 324
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
RY+NC AAE D DWA+W L G YYLR
Sbjct: 325 RYMNCLVAWAAENDLDWAVWALTGDYYLRT------------------------------ 354
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSF--LDPLTLGPCTESEAWSYTPHKTISL 297
GPG+ +++HP+TGLCV + L LGPC +S+ W++ P + I L
Sbjct: 355 ------GPGLRPN--KNLLFHPSTGLCVTNNPSDNIPTLRLGPCPKSDPWTFNPSEGI-L 405
Query: 298 KGAYFCLQAKH-VGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS-SNT 355
C++A + VG+ KLG + T C + IS +KMHLS K NG +CLDVD N+
Sbjct: 406 WINKMCVEAPNVVGQKVKLG-VGTKCSKLGQ-ISATKMHLSFKTSNGLLLCLDVDERDNS 463
Query: 356 IVTNTCKCLSRDKTCDPASQWFKLV 380
+V N CK L+ D +CDPASQWFK++
Sbjct: 464 VVANRCKFLTMDASCDPASQWFKVL 488
>gi|225426116|ref|XP_002272534.1| PREDICTED: endoglucanase/exoglucanase B [Vitis vinifera]
Length = 545
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 174/398 (43%), Positives = 233/398 (58%), Gaps = 17/398 (4%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+LDNH+SKP WCC+ DGNGFFGD+YF+P W++GLTK+A F VV MS+RNEL
Sbjct: 146 MVVLDNHVSKPKWCCAEDDGNGFFGDKYFHPKEWLQGLTKVAHRFKDKSQVVAMSMRNEL 205
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP Q WY+ ++ G +H+ NP +LVI+SGL FD DLSF++ + + L K+VFE
Sbjct: 206 RGPNQTEHIWYKQIRKGGRTIHSINPNLLVIVSGLEFDTDLSFLKQKPLKLKLPNKVVFE 265
Query: 121 AHWYGF---TDGQAWVDGNPNQVCGRVVDNVMRLSGFL--LEQGWPLFVSEFGADLRGNN 175
AHWY F T G W + NQ+C + + F+ P F+SEFG DLRG N
Sbjct: 266 AHWYSFSSITGGVEWTEQPLNQICHNRTEWFESGAAFVGTGSNPAPFFLSEFGIDLRGVN 325
Query: 176 VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
DNR+ CF A+ D DWALWTL GSYY R+GV+G E YG+ D+NW RN FL+
Sbjct: 326 PRDNRFFGCFLAFVAQRDLDWALWTLQGSYYYRDGVVGQEETYGVLDYNWDKPRNPKFLK 385
Query: 236 RISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRK-SFLDPLTLGPCTESEAWSYTPH-K 293
RI LQ + P H +I+HP +G CV K D + C + WS+ +
Sbjct: 386 RIRILQDILQDPNSNVPKYH-LIFHPRSGRCVSVKGEGQDQIHGRNCKKGSRWSHDGNGG 444
Query: 294 TISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSS 353
I L G+ CL+A G PAKL C+ + W++IS SK+H+++ + G +CLD +S
Sbjct: 445 AIRLMGSGLCLKAVGDGLPAKLSTDCSSPQARWKLISKSKLHIAAMDEQGKPLCLDGKNS 504
Query: 354 N--TIVTNTCKCLSRDKT-------CDPASQWFKLVDS 382
N +I+T TC C +D +P QWFK V S
Sbjct: 505 NSSSILTRTCICALQDGDGSDLDCLTNPQRQWFKFVPS 542
>gi|297742255|emb|CBI34404.3| unnamed protein product [Vitis vinifera]
Length = 561
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 174/398 (43%), Positives = 233/398 (58%), Gaps = 17/398 (4%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+LDNH+SKP WCC+ DGNGFFGD+YF+P W++GLTK+A F VV MS+RNEL
Sbjct: 162 MVVLDNHVSKPKWCCAEDDGNGFFGDKYFHPKEWLQGLTKVAHRFKDKSQVVAMSMRNEL 221
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP Q WY+ ++ G +H+ NP +LVI+SGL FD DLSF++ + + L K+VFE
Sbjct: 222 RGPNQTEHIWYKQIRKGGRTIHSINPNLLVIVSGLEFDTDLSFLKQKPLKLKLPNKVVFE 281
Query: 121 AHWYGF---TDGQAWVDGNPNQVCGRVVDNVMRLSGFL--LEQGWPLFVSEFGADLRGNN 175
AHWY F T G W + NQ+C + + F+ P F+SEFG DLRG N
Sbjct: 282 AHWYSFSSITGGVEWTEQPLNQICHNRTEWFESGAAFVGTGSNPAPFFLSEFGIDLRGVN 341
Query: 176 VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
DNR+ CF A+ D DWALWTL GSYY R+GV+G E YG+ D+NW RN FL+
Sbjct: 342 PRDNRFFGCFLAFVAQRDLDWALWTLQGSYYYRDGVVGQEETYGVLDYNWDKPRNPKFLK 401
Query: 236 RISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRK-SFLDPLTLGPCTESEAWSYTPH-K 293
RI LQ + P H +I+HP +G CV K D + C + WS+ +
Sbjct: 402 RIRILQDILQDPNSNVPKYH-LIFHPRSGRCVSVKGEGQDQIHGRNCKKGSRWSHDGNGG 460
Query: 294 TISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSS 353
I L G+ CL+A G PAKL C+ + W++IS SK+H+++ + G +CLD +S
Sbjct: 461 AIRLMGSGLCLKAVGDGLPAKLSTDCSSPQARWKLISKSKLHIAAMDEQGKPLCLDGKNS 520
Query: 354 N--TIVTNTCKCLSRDK-------TCDPASQWFKLVDS 382
N +I+T TC C +D +P QWFK V S
Sbjct: 521 NSSSILTRTCICALQDGDGSDLDCLTNPQRQWFKFVPS 558
>gi|413922904|gb|AFW62836.1| hypothetical protein ZEAMMB73_750687 [Zea mays]
Length = 396
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 148/248 (59%), Positives = 182/248 (73%), Gaps = 2/248 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S PGWCCSN DGNGFFGD+ F+P++W+ GL MATIF V NVVGMSLRNEL
Sbjct: 149 MVILDNHVSTPGWCCSNDDGNGFFGDRDFDPNVWVDGLGSMATIFADVPNVVGMSLRNEL 208
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTG--KLV 118
RGP+QN +DWY YMQ GAEAVHAA+P LVI+ GL++D DLSF+ + V ++F KLV
Sbjct: 209 RGPRQNPQDWYTYMQRGAEAVHAASPRALVIMGGLSYDYDLSFLAARQVGVSFAAENKLV 268
Query: 119 FEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVND 178
FE HWY F+D +AW + N+VCGR + R GFLL +G+PLF+SEFGAD RG + D
Sbjct: 269 FEVHWYSFSDARAWETESANEVCGRAARDFARRGGFLLARGFPLFLSEFGADSRGGDPKD 328
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NR+ C VAAE D DWA W L GSY LR+GV G++E YG+ DW+W RN + L RI
Sbjct: 329 NRFFPCAAAVAAEHDVDWAYWALQGSYALRQGVAGMDEVYGVLDWSWSKPRNETVLPRIQ 388
Query: 239 SLQSPFRG 246
+LQ P +G
Sbjct: 389 ALQRPLQG 396
>gi|413922903|gb|AFW62835.1| hypothetical protein ZEAMMB73_750687 [Zea mays]
Length = 380
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 148/248 (59%), Positives = 182/248 (73%), Gaps = 2/248 (0%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S PGWCCSN DGNGFFGD+ F+P++W+ GL MATIF V NVVGMSLRNEL
Sbjct: 133 MVILDNHVSTPGWCCSNDDGNGFFGDRDFDPNVWVDGLGSMATIFADVPNVVGMSLRNEL 192
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTG--KLV 118
RGP+QN +DWY YMQ GAEAVHAA+P LVI+ GL++D DLSF+ + V ++F KLV
Sbjct: 193 RGPRQNPQDWYTYMQRGAEAVHAASPRALVIMGGLSYDYDLSFLAARQVGVSFAAENKLV 252
Query: 119 FEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVND 178
FE HWY F+D +AW + N+VCGR + R GFLL +G+PLF+SEFGAD RG + D
Sbjct: 253 FEVHWYSFSDARAWETESANEVCGRAARDFARRGGFLLARGFPLFLSEFGADSRGGDPKD 312
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERIS 238
NR+ C VAAE D DWA W L GSY LR+GV G++E YG+ DW+W RN + L RI
Sbjct: 313 NRFFPCAAAVAAEHDVDWAYWALQGSYALRQGVAGMDEVYGVLDWSWSKPRNETVLPRIQ 372
Query: 239 SLQSPFRG 246
+LQ P +G
Sbjct: 373 ALQRPLQG 380
>gi|449527495|ref|XP_004170746.1| PREDICTED: endoglucanase E1-like [Cucumis sativus]
Length = 539
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 163/391 (41%), Positives = 233/391 (59%), Gaps = 12/391 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVI DNH+S+P WCCS DGNGFFG++YF+P W++GL+ +A FN VVGMSLRNEL
Sbjct: 144 MVIADNHMSQPRWCCSLDDGNGFFGNRYFDPQEWLQGLSLVAQRFNNKSTVVGMSLRNEL 203
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL-TFTGKLVF 119
RG +N DW Y+ G +H NP VLVI+SGLN+D DL ++++ +N+ T KL F
Sbjct: 204 RGMMENANDWNNYVTQGVTTIHKINPAVLVIVSGLNYDNDLRCLKDKPLNVSTLDNKLAF 263
Query: 120 EAHWYGFT-DGQAWVDGNP-NQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNN 175
E H Y F+ D ++ P N +C +++ + + F++E +PLFVSE+G D R +
Sbjct: 264 EVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEFVIEGPNPFPLFVSEYGYDQREVD 323
Query: 176 VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
+NR+++CF A+ D DWALWT GSYY REG L E +G+ D NW I+N +F++
Sbjct: 324 DAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQAELAETFGVLDSNWTQIKNPNFVQ 383
Query: 236 RISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKT- 294
+ LQ+ + P VIYH +G C++ + + L C+ S WS+ T
Sbjct: 384 KFQLLQTMLQDPNS-NASFSYVIYHVQSGQCIEVSNDNKEIFLTNCSTSSRWSHDNDSTP 442
Query: 295 ISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLD-VDSS 353
I + CL+A G A L C S W IS+S +HL + ++G ++CL ++SS
Sbjct: 443 IKMSSTGLCLKASGEGLEASLSTDCVGKQSLWSAISNSNLHLGTVTEDGKSLCLQIIESS 502
Query: 354 NT--IVTNTCKCLSRDKTC--DPASQWFKLV 380
N+ IVTN+C C + D TC D SQWF+LV
Sbjct: 503 NSSKIVTNSCICTANDPTCLQDTQSQWFELV 533
>gi|449451950|ref|XP_004143723.1| PREDICTED: uncharacterized protein LOC101213113 [Cucumis sativus]
Length = 539
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 168/392 (42%), Positives = 229/392 (58%), Gaps = 13/392 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVI DNHIS+P WCCS DGNGFFGD+YF+ + W++GL +A F VV MSLRNEL
Sbjct: 144 MVIADNHISQPRWCCSLEDGNGFFGDRYFDTEEWLEGLRLVARRFYNKSAVVAMSLRNEL 203
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL-TFTGKLVF 119
RG KDW +Y+ GA +H NP++LVI+SGLNFD DL R + L KLVF
Sbjct: 204 RGASSKSKDWNKYITQGATTIHNINPKILVIISGLNFDNDLRCQRQYPLQLNNLHNKLVF 263
Query: 120 EAHWYGFT-DGQAWVDGNP-NQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNN 175
E H Y F+ + Q+ NP N++C +V++ + + F++E + PLFVSEFG D RG N
Sbjct: 264 EVHLYSFSGESQSKFIHNPLNKICSKVINGFVERAEFVMEGAEAVPLFVSEFGLDQRGVN 323
Query: 176 VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
D+R+L+CF E D DWALW GSYY R+G +G E +G+ ++NW D+RN F +
Sbjct: 324 EADDRFLSCFSAHLVEKDLDWALWGWQGSYYYRQGKVGPEEVFGVLNYNWSDVRNPHFSQ 383
Query: 236 RISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLD-PLTLGPCTESEAWSYTPHKT 294
LQ+ + P + + V+YHP +G CV + + L C+ + WSY T
Sbjct: 384 MFQLLQTMLQDPNSNSSNTY-VMYHPQSGQCVLVQDMKHMQIYLNDCSNASHWSYEGDGT 442
Query: 295 -ISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTT-VCLDVDS 352
I L FCL+A G P L C S W ISDSK+HL++ G +CL+ +S
Sbjct: 443 PIMLASTNFCLKASGDGLPPSLSRDCFGEQSVWTAISDSKLHLATLTKQGNNGMCLEKES 502
Query: 353 SNT--IVTNTCKCLSRDKTC--DPASQWFKLV 380
SN+ I+ +C C+ D C D +QWF+LV
Sbjct: 503 SNSSRILMRSCVCVGNDSNCLQDTQAQWFQLV 534
>gi|449523497|ref|XP_004168760.1| PREDICTED: uncharacterized protein LOC101223816 [Cucumis sativus]
Length = 535
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 167/390 (42%), Positives = 224/390 (57%), Gaps = 11/390 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+ DNHIS+P WCCSN DGNGFFGD+YFN W++GL+ VV MSLRNE
Sbjct: 144 MVVSDNHISQPRWCCSNDDGNGFFGDRYFNSQEWLQGLSLATQSLKTKPQVVAMSLRNEP 203
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP QNV+ W++YM GA+ VH NP LV++SGL++D LSF++N+++ KLVFE
Sbjct: 204 RGPNQNVEMWFQYMSQGAKLVHQINPNALVVVSGLSYDTYLSFLKNRSMGFNLDNKLVFE 263
Query: 121 AHWYGFTD--GQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNV 176
AH Y FT+ G W N C V +GFL+ + PLFVSEFG + G N
Sbjct: 264 AHLYSFTNNMGDYWTSKPLNTFCANVNQGFEDRAGFLVRGQNPIPLFVSEFGINQMGVNE 323
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
NR+L+CFF + D+DW LW L GSYY REGV E +G+ D + +++N FL++
Sbjct: 324 GQNRFLSCFFTYLTKNDFDWGLWALQGSYYYREGVKNDEETFGVLDSKFTNVKNPKFLQK 383
Query: 237 ISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKT-I 295
+Q+ + P T ++YHP +G CV R + L + C S WS+ T I
Sbjct: 384 FQLMQTKLQDPSSNLTTSF-IMYHPLSGECV-RMNKKYQLGVSSCKTSNRWSHEQDDTPI 441
Query: 296 SLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS-SN 354
L G+ CLQA G P L C+ S W+ S++K+ L++ + G +CL S S+
Sbjct: 442 KLAGSILCLQAVGDGLPPILSKDCSSQQSAWKYASNAKLQLATVDEEGQALCLQRASHSH 501
Query: 355 TIVTNTCKCLSRDKTC--DPASQWFKLVDS 382
I+TN C C D C DP SQWF LV S
Sbjct: 502 QILTNKCMC-PNDSECQGDPQSQWFTLVPS 530
>gi|115447081|ref|NP_001047320.1| Os02g0596200 [Oryza sativa Japonica Group]
gi|46805307|dbj|BAD16839.1| putative cellulase [Oryza sativa Japonica Group]
gi|47847822|dbj|BAD21617.1| putative cellulase [Oryza sativa Japonica Group]
gi|113536851|dbj|BAF09234.1| Os02g0596200 [Oryza sativa Japonica Group]
Length = 449
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 135/211 (63%), Positives = 162/211 (76%), Gaps = 4/211 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S+PGWCC++ DGNGFFGD++F+PD W++GL MA +F GV NVVGMSLRNEL
Sbjct: 147 MVILDNHVSRPGWCCADDDGNGFFGDRHFDPDAWVRGLGAMAALFAGVPNVVGMSLRNEL 206
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFT----GK 116
RGP+QN DWYRYMQ+GAEAVHAANP LVI+ GL +D DLSF+ + V+++F GK
Sbjct: 207 RGPRQNADDWYRYMQMGAEAVHAANPAALVIMGGLGYDTDLSFLAARPVDVSFAAAERGK 266
Query: 117 LVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNV 176
LVFE HWY F D +AW + N+VCGRV V R GFLL+ G+PLF+SEFGAD RG +
Sbjct: 267 LVFELHWYSFADARAWESEDANEVCGRVARGVARRGGFLLDAGFPLFLSEFGADTRGGSR 326
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
D+RYL C VAAELD DWALW L GSY L
Sbjct: 327 KDDRYLPCAAAVAAELDLDWALWALQGSYAL 357
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 44/74 (59%), Gaps = 12/74 (16%)
Query: 324 STWEIISDSKMHLS---------SKADNGTTVCLDV-DSSNTIVTNTCKCL--SRDKTCD 371
STW ++S S MH++ S+ G +CLDV D ++VTN C+CL + CD
Sbjct: 358 STWRLVSGSTMHVAVNATTTTTPSRDGGGGLLCLDVGDDGRSVVTNPCRCLDDAAAGECD 417
Query: 372 PASQWFKLVDSTRS 385
P +QWFKLV STRS
Sbjct: 418 PETQWFKLVTSTRS 431
>gi|356527943|ref|XP_003532565.1| PREDICTED: uncharacterized protein LOC100811735 [Glycine max]
Length = 531
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 156/390 (40%), Positives = 223/390 (57%), Gaps = 10/390 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV++DNH+S WCC+N D NGFFGD++FN W++GL +A F G NV M LRNEL
Sbjct: 142 MVLIDNHVSLAKWCCANDDQNGFFGDRHFNTSEWLQGLAFIAHHFKGKPNVFAMDLRNEL 201
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG +QN DWY+YM GA +H NP+ +VI+SGL FD DLSF++ + ++L F K+V+E
Sbjct: 202 RGSRQNHHDWYKYMTQGANTIHDINPDFIVIISGLAFDNDLSFLKKKPLDLNFPHKIVYE 261
Query: 121 AHWYGFT-DGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNVN 177
+H Y + D W N +C + + + S FLL + PL VSEFG D+ G +
Sbjct: 262 SHIYSVSGDTHRWRVQPVNWICNATIQLLHQQSSFLLSGKNPAPLLVSEFGYDMTGGSFA 321
Query: 178 DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERI 237
DN YL C A +D DW+LW GSYY R+G +GL E Y + D +W R+ +F ++
Sbjct: 322 DNMYLPCIVSYFASVDLDWSLWAFQGSYYYRQGKVGLGESYAVMDDDWKSYRDPNFTQKF 381
Query: 238 SSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKT-IS 296
LQ + P +I+HP TG C + + L +G C + WSY + I
Sbjct: 382 ELLQRMVQDP-TSNVSKSNIIFHPLTGYCAHVNNSKE-LVMGDCKSNSLWSYEGDGSPIR 439
Query: 297 LKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS-SNT 355
L + CL+A P L C S+W+ +S + +HL++ +G +CL+ DS S+
Sbjct: 440 LMNSAKCLKAVGERLPPSLSEDCLSPQSSWKTVSMTGLHLATFDKDGDLLCLEKDSNSSK 499
Query: 356 IVTNTCKCLSRDKTC---DPASQWFKLVDS 382
IVT+ C C+S D + +P SQWFKLV +
Sbjct: 500 IVTSKCICISDDDSSCLDNPQSQWFKLVST 529
>gi|357502161|ref|XP_003621369.1| Endoglucanase E1 [Medicago truncatula]
gi|355496384|gb|AES77587.1| Endoglucanase E1 [Medicago truncatula]
Length = 530
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 150/388 (38%), Positives = 221/388 (56%), Gaps = 11/388 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+ DNH+S P WCC N+DGNGFFGDQYFNP+ W++GL+ +A G VV + LRNEL
Sbjct: 142 MVLADNHVSDPKWCCDNNDGNGFFGDQYFNPEEWLQGLSNVANRVKGKPQVVAIGLRNEL 201
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGPKQN +W++YM GA VH ANP+VLV +SGLN+D DLSF++ + +N+ KLV+E
Sbjct: 202 RGPKQNENNWHKYMSQGATTVHKANPDVLVFVSGLNYDTDLSFLKTKPLNVNIGNKLVYE 261
Query: 121 AHWYGFTDGQA--WVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNV 176
H Y ++ G+ W N+ C V++N+ +GFL+ PL +SEFG ++
Sbjct: 262 VHSYAWSSGERSDWDKQPLNKKCANVMNNLNDKAGFLMSGSNPKPLVMSEFGINMENKTD 321
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
+ ++L+C +D DWALW G+YY+R+ I ++E +G++ +++ +R F +R
Sbjct: 322 MNEKFLSCMLAYLVGVDLDWALWAAQGAYYIRKNEIIVSETFGIWSYDFRTLRYIEFPQR 381
Query: 237 ISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTIS 296
+ P + + +IYHP +G CV+ + L LG C W+ + I
Sbjct: 382 FQLMHKKLLEPSSNSSKSY-IIYHPLSGQCVKVNK-RNKLVLGDCDGKSKWNQVGQQ-IK 438
Query: 297 LKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS--SN 354
L G C++A G KL C S W+ +S + +HL + G +CL S S
Sbjct: 439 LVGNDACIEAIKDGSQVKLSNDCRSKQSFWKTLSATNLHLGTLDSQGQNLCLQRKSPTSP 498
Query: 355 TIVTNTCKCLSRDKTC--DPASQWFKLV 380
IVT C C+ + C DP SQWF+LV
Sbjct: 499 KIVTKKCICIDDNPACLEDPQSQWFQLV 526
>gi|255537625|ref|XP_002509879.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
gi|223549778|gb|EEF51266.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
Length = 542
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 164/400 (41%), Positives = 232/400 (58%), Gaps = 14/400 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+LDNH+S+P WCC D NGFFGD +F+P W++GL +A IF G VV MS+RNEL
Sbjct: 144 MVLLDNHVSQPKWCCPQDDENGFFGDIHFHPKEWLRGLAIVAKIFQGKSQVVAMSMRNEL 203
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN--LTFTGKLV 118
RGP QN DWY+Y+Q GA VH NPEVLV++SGL + DLSF++ + ++ L KLV
Sbjct: 204 RGPYQNEHDWYKYIQEGARMVHKLNPEVLVLVSGLVWGTDLSFLKKKPLHLGLNLDNKLV 263
Query: 119 FEAHWYGFT-DGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNN 175
+EAHWY F+ D + W N++C + LSGF++ E PLF+ E G D RG N
Sbjct: 264 YEAHWYSFSGDPKVWEVQPLNRICDLKTQIQVDLSGFVITGENPVPLFLGEVGIDQRGVN 323
Query: 176 VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
DNR+ CF AE D DW LW GSYY +EG+ G +E YGL +++W +R+ F +
Sbjct: 324 RADNRFFTCFLAYVAENDLDWGLWAFQGSYYFKEGIAGPDENYGLMNFDWNYLRSPEFDD 383
Query: 236 RISSLQSPFRGP-GVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPH-K 293
RI ++ + P + T ++YHP +G CV S + + + WS+
Sbjct: 384 RIWLIKRMIQDPDSILSTSY--LMYHPLSGNCVH-ASEKNEIYASRFQQHSRWSHDGDGA 440
Query: 294 TISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSS 353
I L G+ CL+A G L C S+W+++S SK+HL K ++G +CL+ +S
Sbjct: 441 PIRLMGSALCLKAIGDGLEPVLSNDCFSQQSSWKLLSSSKLHLGVKDEHGEYLCLEKESF 500
Query: 354 NT--IVTNTCKCLSRDKTC--DPASQWFKLVDSTRSSTTT 389
N+ + T C C+ D C +P SQWFKL+ + T+
Sbjct: 501 NSSKVFTRKCICIEDDSDCQENPQSQWFKLIKTNIKVNTS 540
>gi|147789205|emb|CAN62580.1| hypothetical protein VITISV_036569 [Vitis vinifera]
Length = 221
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 144/236 (61%), Positives = 169/236 (71%), Gaps = 27/236 (11%)
Query: 149 MRLSGFLLEQGWPL-FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
MR G LL+QGWPL FVSE G D DNR+LNCFFG+AAELD+DWALWTL
Sbjct: 1 MRRGGVLLQQGWPLXFVSELGVD-------DNRHLNCFFGLAAELDFDWALWTL------ 47
Query: 208 REGVIGLNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCV 267
E GL +WN SSF +RIS+LQSP +GP V HK+I+HP+TGLC+
Sbjct: 48 -------EETNGLMNWN------SSFFQRISALQSPLQGPDVSRVRRHKIIFHPSTGLCI 94
Query: 268 QRKSFLDPLTLGPCTESEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWE 327
R+S +PL LGPCT+SEAW YTP K +++KG YFCLQA +GKPAKL IICT GS WE
Sbjct: 95 LRESGSEPLKLGPCTKSEAWGYTPQKLLTVKGTYFCLQAVGLGKPAKLSIICTKPGSNWE 154
Query: 328 IISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDST 383
IISDSKM+LS+K +GT VCLDVDSS+TIVT+ CKCL R CDP SQWFK+VDST
Sbjct: 155 IISDSKMYLSTKLGDGTRVCLDVDSSSTIVTDACKCLGRGDMCDPGSQWFKVVDST 210
>gi|297852122|ref|XP_002893942.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297339784|gb|EFH70201.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 420
Score = 282 bits (721), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 161/342 (47%), Positives = 196/342 (57%), Gaps = 72/342 (21%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH++KPGWCC+++DGN F GD++F+P +W L+KM F+GV NVVGMSLRNEL
Sbjct: 137 MVILDNHLTKPGWCCADNDGNDFSGDKFFDPTVWAAALSKMTATFDGVSNVVGMSLRNEL 196
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
GPKQNV DW++YMQ GAEAVH+AN VLVILSGL FD DLSFVR VNL+FTGKLVFE
Sbjct: 197 TGPKQNVNDWFKYMQQGAEAVHSANKNVLVILSGLTFDTDLSFVRCLPVNLSFTGKLVFE 256
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
HWY F+ P + G GFLL QG+PLF+SEFG D R N ND+R
Sbjct: 257 LHWYSFS-------SKPERNSG----------GFLLNQGFPLFLSEFGMDKREINANDDR 299
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
Y C G AE D DW++W L GSY L V R SSFLE+IS L
Sbjct: 300 YFGCLTGWTAENDVDWSIWALTGSYILNTSV-----------------RKSSFLEKISLL 342
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
QS +GPG T + ++ P T LC L+ GP
Sbjct: 343 QSTLQGPGP-RTDDYNLVLDPLTRLC---------LSSGP-------------------- 372
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADN 342
+++ K + G C+ GS W+ S S+MHLSS N
Sbjct: 373 ------ENLVKMTRTG--CSGPGSKWQTDSASRMHLSSTTSN 406
>gi|147826489|emb|CAN72915.1| hypothetical protein VITISV_020826 [Vitis vinifera]
Length = 221
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/248 (58%), Positives = 170/248 (68%), Gaps = 29/248 (11%)
Query: 149 MRLSGFLLEQGWPL-FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
MR G LL+QGWPL FVSE G D DNR+LNCFFG+AAELD+DWALWTL
Sbjct: 1 MRGGGVLLQQGWPLMFVSELGVD-------DNRHLNCFFGLAAELDFDWALWTL------ 47
Query: 208 REGVIGLNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCV 267
E GL +WN S+F +RIS+ QSP +GP V HK+I HP+TGLC+
Sbjct: 48 -------EETNGLMNWN------SNFFQRISAXQSPLQGPDVSRVRPHKIILHPSTGLCI 94
Query: 268 QRKSFLDPLTLGPCTESEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWE 327
R+S +PL LGPCTESEAW YTP K + +KG YFCLQA +GKPAKL IICT GS WE
Sbjct: 95 LRESLSEPLKLGPCTESEAWGYTPQKILIVKGTYFCLQAVGLGKPAKLSIICTQPGSNWE 154
Query: 328 IISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSST 387
IISDSKM+LS+K +GTTVCLDVDSS+ IVT+ CKCL RD CDP SQWF +VDST +
Sbjct: 155 IISDSKMYLSTKLGDGTTVCLDVDSSSNIVTDACKCLGRDDMCDPGSQWFNVVDST--NI 212
Query: 388 TTKSFFQF 395
T + Q
Sbjct: 213 TRRPILQI 220
>gi|357502207|ref|XP_003621392.1| Endoglucanase E1 [Medicago truncatula]
gi|355496407|gb|AES77610.1| Endoglucanase E1 [Medicago truncatula]
Length = 534
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 152/392 (38%), Positives = 221/392 (56%), Gaps = 15/392 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+ DNH+S P WCC N+DGNGFFGDQYFNP+ W++GL+ ++ G VV + LRNEL
Sbjct: 142 MVLADNHVSDPKWCCDNNDGNGFFGDQYFNPEEWLQGLSNVSNRVKGKSQVVAIGLRNEL 201
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP QN+ W +YM GA VH NP VLV +SG N+D DLSF++ +N + KLV+E
Sbjct: 202 RGPSQNISSWQKYMSQGATTVHKENPNVLVFVSGFNYDIDLSFLKTNPLNTSIGDKLVYE 261
Query: 121 AHWYGFTDGQA--WVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRG--- 173
H Y ++ G W N+ C V++N+ +GFL+ PL +SEFGADL
Sbjct: 262 VHSYAWSTGDRSDWDKQPLNKKCANVMNNLNDKAGFLMSGSNPNPLVMSEFGADLTAIDD 321
Query: 174 -NNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSS 232
N + R+L+C A +D DWALWT GSYY+R+ +E +G+++ ++ +R
Sbjct: 322 KNQTFNQRFLSCMLAYLAGVDLDWALWTAQGSYYIRDKESNASEPFGIWNIDFKSLRYPD 381
Query: 233 FLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPH 292
F +R +Q P + + + +IYHP +G CV+ + + L LG C + W+
Sbjct: 382 FSQRFQLVQKKLLDPSLNSSKSY-IIYHPLSGQCVKVNTN-NELELGDCEWASKWNQEGQ 439
Query: 293 KTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS 352
+ I L G ++A G K+ C S W+ +S + +HL + + G +CL +S
Sbjct: 440 Q-IKLVGNGTYIEAVSDGSQVKVSNDCKSKQSFWKTLSATNLHLGTLDEQGQKLCLQRES 498
Query: 353 --SNTIVTNTCKCLSRDKTC--DPASQWFKLV 380
S IVT C C+ + C DP SQWF+LV
Sbjct: 499 PTSPKIVTKRCICIDDNPACLEDPQSQWFQLV 530
>gi|224053839|ref|XP_002298005.1| predicted protein [Populus trichocarpa]
gi|222845263|gb|EEE82810.1| predicted protein [Populus trichocarpa]
Length = 535
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 156/392 (39%), Positives = 220/392 (56%), Gaps = 15/392 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M+ DNHIS+P WCCSN D NGF D +F+P+ WI+GL +A F G ++ + LRNE
Sbjct: 145 MIDADNHISEPIWCCSNDDENGFPNDPHFDPEEWIEGLKLVAQRFKGKSQLISIGLRNEP 204
Query: 61 RGPKQNVKDWY-RYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVF 119
RG QN W+ Y+ A VH ANP+VLVI SGLNF DL++ + ++ F KL+F
Sbjct: 205 RGKNQNATLWFDHYIMEAAAQVHQANPDVLVIASGLNFATDLTYFKKHSLKSNFDNKLIF 264
Query: 120 EAHWYGFT-DGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNV 176
E H Y + G WVDG+ N+ C + ++ F+ E PLF SEFG D +
Sbjct: 265 EGHSYSWGGKGNPWVDGSVNKACADKIGSLNNNLAFVTDGENAVPLFFSEFGIDRKQMPA 324
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
D+R+L+CF AAE D DW LW L GSYYLR+ V + EY+G+ + +W ++N R
Sbjct: 325 GDDRFLSCFSTWAAEKDLDWGLWALQGSYYLRQNVTNMEEYFGVLEIDWDRVKNPEVERR 384
Query: 237 ISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHK-TI 295
+ L+ P L+ ++YHP +G CV + + G C W++ H+ +
Sbjct: 385 LGLLKQTLLDPKS-TAPLNYIMYHPQSGACVG-EGMDGQIRAGNCKGLTRWTHNGHEGPL 442
Query: 296 SLKGAYFCLQAKHVGKPAKLGIICTDCG-STWEIISDSKMHLSSKADNGTTVCLDVDS-- 352
LK CL+A G P I+ DC +TW+ IS SK+HL+SK G +CL ++
Sbjct: 443 ELKRTGLCLKAIGDGLPP---ILTPDCSQTTWKPISASKLHLASKDHRGEYLCLHLEPPF 499
Query: 353 SNTIVTNTCKCLSRDKTC--DPASQWFKLVDS 382
+ IVT C C+ D TC +P SQWFKLV++
Sbjct: 500 AGNIVTKKCICVGDDPTCKDNPTSQWFKLVET 531
>gi|147800141|emb|CAN73208.1| hypothetical protein VITISV_009079 [Vitis vinifera]
Length = 221
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/236 (60%), Positives = 166/236 (70%), Gaps = 27/236 (11%)
Query: 149 MRLSGFLLEQGWPL-FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
MR G LL+QGWPL FVSE G D DNR+LNCFFG+AA LD+DWALWTL
Sbjct: 1 MRGGGVLLQQGWPLMFVSELGVD-------DNRHLNCFFGLAAXLDFDWALWTL------ 47
Query: 208 REGVIGLNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCV 267
E GL +WN S+F +RIS+ QSP +GP V HK+I HP+TGLC+
Sbjct: 48 -------EETNGLMNWN------SNFFQRISAXQSPLQGPDVSRVRPHKIILHPSTGLCI 94
Query: 268 QRKSFLDPLTLGPCTESEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWE 327
R+S +PL LGPCTESEAW YTP K + +KG YFCLQA +GKPAKL IICT GS WE
Sbjct: 95 LRESLSEPLKLGPCTESEAWGYTPQKILIVKGTYFCLQAVGLGKPAKLSIICTQPGSNWE 154
Query: 328 IISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDST 383
IISDSKM+LS+K +GTTVCLDVDSS+ IVT+ CKCL RD CDP SQWFK+VDST
Sbjct: 155 IISDSKMYLSTKLGDGTTVCLDVDSSSNIVTDACKCLGRDDXCDPGSQWFKVVDST 210
>gi|449452430|ref|XP_004143962.1| PREDICTED: uncharacterized protein LOC101211675 [Cucumis sativus]
gi|449527485|ref|XP_004170741.1| PREDICTED: uncharacterized protein LOC101224959 [Cucumis sativus]
Length = 479
Score = 277 bits (709), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 144/390 (36%), Positives = 222/390 (56%), Gaps = 13/390 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M++LDNHIS+P WCCS DGNGFFGD++F+ W++GL +A F V+GMSLRNEL
Sbjct: 88 MIVLDNHISQPRWCCSLDDGNGFFGDRHFDTLEWLQGLDYVARHFTWHSQVIGMSLRNEL 147
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP N+ +WY+Y++ G+ +H N ++LVI+SG+ FD DLSF++ +++ K+V E
Sbjct: 148 RGPYTNMDNWYKYVKEGSHLIHTKNRKLLVIISGITFDNDLSFLKKKSLGYNLHNKVVLE 207
Query: 121 AHWYGFTDGQ-----AWVDGNPNQVCGRVVDNVMRLSGFLL---EQGWPLFVSEFGADLR 172
AH Y F+ + +V N VC +V++ R +GF++ ++ +PL++SEFG DLR
Sbjct: 208 AHLYPFSGSEKLPESKFVKKPLNIVCNQVMEKFEREAGFVVNMEDEPYPLWLSEFGYDLR 267
Query: 173 GNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSS 232
G+N NR+++CF D DWA W G+Y R+G ++E +G+ D W + RN
Sbjct: 268 GDNKAQNRFMSCFLAHIVAKDLDWAYWAFQGTYMYRQGQESVDETFGVMDSTWTNDRNPQ 327
Query: 233 FLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPH 292
+ + + + P + +I HP +G C+ R + + LG C W+++
Sbjct: 328 LNQMLQLAKRINQDPNS-NASMSYIILHPVSGQCI-RSNGQGGIVLGDCLTPTHWTHSGD 385
Query: 293 KTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDS 352
+ CLQ+ GKP + C+ GS W + S +K+ L++K+ G CL+ S
Sbjct: 386 GSSMKLSNGQCLQSAGDGKPLIVSAECSSDGSKWTMASKAKLQLATKS-GGENFCLEKKS 444
Query: 353 SNTIVTNTCKCLSRDKTC--DPASQWFKLV 380
+ IV C CL + C DP QWFKLV
Sbjct: 445 NTKIVVKRCICLEDELNCFNDPQPQWFKLV 474
>gi|255537621|ref|XP_002509877.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
gi|223549776|gb|EEF51264.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
Length = 547
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 144/407 (35%), Positives = 221/407 (54%), Gaps = 27/407 (6%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+V+LDN +S+P WCC DGNGFFGD+ F+ + W++ L +A F G V+G+S RNEL
Sbjct: 140 LVVLDNQVSRPTWCCGYDDGNGFFGDKDFDAEEWLQALATVADRFKGKSQVIGISTRNEL 199
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP N DWY+YM G +H ANP+VL+ +SG+ + DL++++N++++ F KLV+E
Sbjct: 200 RGPSSNEDDWYKYMHEGGSTIHRANPDVLIFVSGIGYASDLTYLQNKSLDTNFDNKLVYE 259
Query: 121 AHWYGFT--DGQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNV 176
AHWY F+ G+ W + N C + +GF++ E+ +P+F+ EFG D RG +
Sbjct: 260 AHWYPFSWGVGKTWDLEDVNGACYDNTQYFVNHTGFVINGEKPFPMFLGEFGIDQRGLSR 319
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
D ++ CF AA+ D DW W GSYY RE G E +G+ ++NW +RN F R
Sbjct: 320 GDEHFMACFMAYAADTDVDWGFWAWQGSYYYRENQTGTEETFGVMNFNWNRVRNPEFQNR 379
Query: 237 ISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWS-----YTP 291
+ + + P + +++H +G C+ + + C WS P
Sbjct: 380 MQLITKKLQDPSS-NSSTSYIMFHALSGSCIHTDGNKE-IYATSCKAPRRWSDPAGDVIP 437
Query: 292 HK--------TISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNG 343
K I LKG CL+A G L C+ S+W+ +S +K+HL++ +NG
Sbjct: 438 MKWLHDGDGAPIRLKGTKLCLKALGEGLAPILSEDCSSPQSSWKFLSKTKLHLAATDENG 497
Query: 344 TTVCLDVDSSNT--IVTNTCKCLSRDKTC------DPASQWFKLVDS 382
+CL +S T I+TN C ++ + C DP +QWFKLV +
Sbjct: 498 EYLCLQKESPYTSKILTNACIFMNEEPECGKDPQKDPVTQWFKLVKT 544
>gi|449452432|ref|XP_004143963.1| PREDICTED: uncharacterized protein LOC101211910 [Cucumis sativus]
Length = 525
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 154/389 (39%), Positives = 222/389 (57%), Gaps = 16/389 (4%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHIS+P WCCS DGNGFFGD+ F P W++GL +A F+ VVGMSLRNEL
Sbjct: 139 MVILDNHISQPRWCCSLHDGNGFFGDRNFKPIEWLRGLAYVARHFSWHPKVVGMSLRNEL 198
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RG NV W +Y++LG+ +H NP +LV++SGLN+D DLS+++ + + K+V E
Sbjct: 199 RG-SNNVGVWRKYVKLGSHLIHRINPRLLVVISGLNYDNDLSYLKKKPLGYNLNNKVVLE 257
Query: 121 AHWYGFT--DGQAWVDGNPNQVCGRVVDNVMRLSGFL--LEQGWPLFVSEFGADLRGNNV 176
AH Y F+ +V N VC +V++ R +GF+ ++ +PLF+SEFG DLRG N
Sbjct: 258 AHLYSFSGEPESKFVKKPLNIVCNQVMEKFEREAGFVVDMKDPYPLFLSEFGYDLRGGNK 317
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
NR+++CF D DWA W GSY R+G ++E +G+ D +W R S L++
Sbjct: 318 AQNRFMSCFLARIIGKDIDWAYWAFQGSYMYRQGQQDVDESFGIMDSSWTKDR-SPRLQQ 376
Query: 237 ISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLD---PLTLGPCTESEAWSYTPHK 293
+ L + + ++ HP +G CV+ LD + LG C W +T
Sbjct: 377 MLQLAKRINQDPNSKGPMSYIMLHPVSGQCVK----LDGKGGIELGDCETPTLWDHTGDG 432
Query: 294 TISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSS 353
+ CL++ GKP + C+ GS+W + S +K+ LS+K+ G +CL+ +S
Sbjct: 433 SPMKLWNGQCLKSAGDGKPPVVSAECSGDGSSWTVASKAKLQLSTKS-GGENICLEKESD 491
Query: 354 NTIVTNTCKCLSRDKTC--DPASQWFKLV 380
+IV C CL + C DP SQWFKLV
Sbjct: 492 TSIVVKKCICLKDEWNCFDDPQSQWFKLV 520
>gi|41323933|gb|AAS00040.1| unknown [Vitis riparia]
Length = 221
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 142/239 (59%), Positives = 168/239 (70%), Gaps = 27/239 (11%)
Query: 149 MRLSGFLLEQGWPL-FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
MR G LL+QGWPL FVSE G D DNR LNCFFG+AAELD+DWALWT+
Sbjct: 1 MRRGGVLLQQGWPLMFVSELGVD-------DNRNLNCFFGLAAELDFDWALWTV------ 47
Query: 208 REGVIGLNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCV 267
E GL +WN SS +RIS+LQSP +GP V HK+I HP TGLC+
Sbjct: 48 -------EETNGLMNWN------SSVFQRISALQSPLQGPDVSRVRPHKIILHPPTGLCI 94
Query: 268 QRKSFLDPLTLGPCTESEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWE 327
+S+ +PL LGPCT+S+AW YTP K + +KG+YFCLQA +GKPAKL IICT GS W+
Sbjct: 95 LWESWTEPLKLGPCTKSDAWGYTPQKLLIVKGSYFCLQAVELGKPAKLSIICTKPGSNWD 154
Query: 328 IISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSS 386
IISDSKM+LS+K + TTVCLDVDSS+TIVT+ CKCL RD TCDP SQWFK+VDST S
Sbjct: 155 IISDSKMYLSTKLADSTTVCLDVDSSSTIVTDACKCLGRDDTCDPGSQWFKVVDSTNIS 213
>gi|147810105|emb|CAN60394.1| hypothetical protein VITISV_015741 [Vitis vinifera]
Length = 221
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 141/239 (58%), Positives = 166/239 (69%), Gaps = 27/239 (11%)
Query: 149 MRLSGFLLEQGWPL-FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
MR G LL+QGWPL FVSE G D DNR LNCFFG+AAELD+DWALWT
Sbjct: 1 MRRGGVLLQQGWPLMFVSELGVD-------DNRNLNCFFGLAAELDFDWALWTX------ 47
Query: 208 REGVIGLNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCV 267
E GL +WN S +RIS+LQSP +GP V HK+I HP TGLC+
Sbjct: 48 -------EETNGLMNWN------SXXFQRISALQSPLQGPXVSRVRPHKIILHPPTGLCI 94
Query: 268 QRKSFLDPLTLGPCTESEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWE 327
+S+ +PL LGPCT+S+AW YTP K + +KG+YFCLQA +GKPAKL IICT GS W+
Sbjct: 95 LWESWTEPLKLGPCTKSDAWGYTPQKLLIVKGSYFCLQAVELGKPAKLSIICTKPGSNWD 154
Query: 328 IISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSS 386
IISDSKM+LS+K + TTVCLDVDSS+TIVT+ CKCL RD TCDP SQWFK+VDST S
Sbjct: 155 IISDSKMYLSTKLGDSTTVCLDVDSSSTIVTDACKCLGRDDTCDPGSQWFKVVDSTNIS 213
>gi|255537623|ref|XP_002509878.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
gi|223549777|gb|EEF51265.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus
communis]
Length = 521
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 143/395 (36%), Positives = 212/395 (53%), Gaps = 33/395 (8%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
M +LDNH+SKP WCCS +DGN FFGD YF+ WI+GLT +AT F G V+ +S RNEL
Sbjct: 145 MTLLDNHVSKPTWCCSETDGNTFFGDTYFDVKEWIQGLTTVATRFRGKPKVMAISTRNEL 204
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP QN DWY+Y+ GA A+H ANP+VL+ SGL++ DL++++++ + F KLV++
Sbjct: 205 RGPLQNQDDWYKYILEGASAIHNANPDVLIFASGLSYANDLTYLKSKTLGTNFDDKLVYD 264
Query: 121 AHWYGFT--DGQAWVDGNPNQVCGRVVDNVMRLSGF---LLEQGWPLFVSEFGADLRGNN 175
AHWY ++ D W N C R + +GF L E+ PLF+ EFG D RG +
Sbjct: 265 AHWYPWSWEDVSTWDVELLNDACYRKTQYFINQTGFTYALHEEPIPLFMGEFGMDQRGLS 324
Query: 176 VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
D+ +L+CF A ++D DW +W GSYY RE ++E +G D +W IR+ ++
Sbjct: 325 RADDHFLSCFLAFATDIDLDWGMWGWQGSYYFRENKTNIDETFGAMDHHWNRIRSPQYMS 384
Query: 236 RISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTI 295
R+ +++ G T K+ LC +R+S + + + W +T
Sbjct: 385 RVDFVKNKLIG-----TSYRKMQATTKMELCRRRES--NQIGRKQSVLASCWRWTSXXVG 437
Query: 296 SLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNT 355
G PA L CT S+W +++ +K+H+++ + G +CL +S T
Sbjct: 438 D-------------GHPAILSKNCTSQKSSWRLLTATKLHVATVDETGEYLCLQKESPLT 484
Query: 356 IVTNTCKCL--------SRDKTCDPASQWFKLVDS 382
T KC+ RD DP QWFKL +
Sbjct: 485 TKVITSKCVVTLSEPDCQRDPLQDPTMQWFKLTST 519
>gi|357493709|ref|XP_003617143.1| Oligoribonuclease [Medicago truncatula]
gi|355518478|gb|AET00102.1| Oligoribonuclease [Medicago truncatula]
Length = 1551
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/392 (36%), Positives = 220/392 (56%), Gaps = 30/392 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V+LDNH+S+P WCC++ D NGFF DQ+F+P WI GLT A F G + +V MSLRNELR
Sbjct: 220 VLLDNHVSEPKWCCNDDDDNGFFHDQHFDPQEWIHGLTLAAKHFYGHQPIVAMSLRNELR 279
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEA 121
GP+QN++DWY+YM A +H NP VLV++SGLN+D +L F+RN + + K+V+EA
Sbjct: 280 GPRQNLRDWYKYMSHAALVIHKTNPNVLVVISGLNYDTELQFLRNNPLKIDLGEKMVYEA 339
Query: 122 HWYGFTD------GQAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRG 173
H Y ++ + W N++C ++ + + +GFL + PL ++EFG D G
Sbjct: 340 HLYSWSGIGTLKLKEFWSKQPLNRICAENIEGLDQSAGFLTSGKNAVPLIITEFGFDQTG 399
Query: 174 NNVNDNRYLNCF--FGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNS 231
++V DNR+L C + V +LD+ E + L+E +G+ D W +R
Sbjct: 400 SSVEDNRFLTCLQTYLVGRDLDF--------------EDKLQLDESFGVVDATWHKLRYP 445
Query: 232 SFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTP 291
+F ++ LQ + P + + ++YHP TG C Q + L +G C W Y
Sbjct: 446 NFADKFQLLQRKNQDPTSKVSEAY-IMYHPLTGQCGQVND-KNELEIGSCENQTRWIYNG 503
Query: 292 HKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVD 351
+ I L + CL A G P + + S+W+ S S++HL++ NG +CL D
Sbjct: 504 SQ-ILLNDSKKCLTAIGEGLPVAISDDYENKNSSWKSESLSRLHLATVDQNGKHLCLHKD 562
Query: 352 -SSNTIVTNTCKCLSRDKTC--DPASQWFKLV 380
+S+ +VT+ C C++ D C DP SQWF+LV
Sbjct: 563 YNSSFVVTSKCICINDDSLCLDDPQSQWFQLV 594
>gi|449467349|ref|XP_004151386.1| PREDICTED: uncharacterized protein LOC101207450 [Cucumis sativus]
Length = 364
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 139/353 (39%), Positives = 203/353 (57%), Gaps = 13/353 (3%)
Query: 39 TKMATIFNGVRNVVGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFD 98
T + T+ NG + VVGMSLRNELRG +N DW Y+ G +H NP VLVI+SGLN+D
Sbjct: 8 TAILTLKNGYK-VVGMSLRNELRGMMENANDWNNYVTQGVTTIHKINPAVLVIVSGLNYD 66
Query: 99 KDLSFVRNQAVNL-TFTGKLVFEAHWYGFT-DGQAWVDGNP-NQVCGRVVDNVMRLSGFL 155
DL ++++ +N+ T KL FE H Y F+ D ++ P N +C +++ + + F+
Sbjct: 67 NDLRCLKDKPLNVSTLDNKLAFEVHLYSFSGDSESKFVQQPLNNICAKIMHEFIDHAEFV 126
Query: 156 LE--QGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIG 213
+E +PLFVSE+G D R + +NR+++CF A+ D DWALWT GSYY REG
Sbjct: 127 IEGPNPFPLFVSEYGYDQREVDDAENRFMSCFTAHLAQKDLDWALWTWQGSYYYREGQAE 186
Query: 214 LNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFL 273
L E +G+ D NW I+N +F+++ LQ+ + P VIYH +G C++ +
Sbjct: 187 LAETFGVLDSNWTQIKNPNFVQKFQLLQTMLQDP-YSNASFSYVIYHVQSGQCIEVSNDN 245
Query: 274 DPLTLGPCTESEAWSYTPHKT-ISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDS 332
+ L C+ S WS+ T I + CL+A G A L C S W IS+S
Sbjct: 246 KEIFLTNCSTSSRWSHDNDSTPIKMSSTGLCLKASGEGLEASLSTDCIGKQSLWSAISNS 305
Query: 333 KMHLSSKADNGTTVCLD-VDSSNT--IVTNTCKCLSRDKTC--DPASQWFKLV 380
+HL + ++G ++CL ++SSN+ IVTN+C C + D TC D SQWF+LV
Sbjct: 306 NLHLGTVTEDGKSLCLQIIESSNSSKIVTNSCICTTNDPTCLQDTQSQWFELV 358
>gi|125548742|gb|EAY94564.1| hypothetical protein OsI_16342 [Oryza sativa Indica Group]
Length = 326
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 110/174 (63%), Positives = 129/174 (74%)
Query: 73 YMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAW 132
YMQLGAE VHAANP VLVILSGLNFD L F+ + LTFTGKLVFE HWYGF+D W
Sbjct: 142 YMQLGAEVVHAANPGVLVILSGLNFDNTLDFLVPNQIQLTFTGKLVFEQHWYGFSDDGNW 201
Query: 133 VDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAEL 192
N N CG VVD++ + FLL+QGWPLF SEFG D+ G +V DNRYL CF VAAE+
Sbjct: 202 GSQNQNDACGMVVDSIKKKGLFLLQQGWPLFFSEFGFDMSGTHVADNRYLTCFLTVAAEM 261
Query: 193 DWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSLQSPFRG 246
D DWA+W L GSYY+REG + +E YGL W+WC RN SF++RI+SLQSPF+G
Sbjct: 262 DLDWAIWALQGSYYIREGTLAYDESYGLLSWDWCTARNPSFIKRINSLQSPFQG 315
>gi|449488685|ref|XP_004158142.1| PREDICTED: uncharacterized LOC101213113 [Cucumis sativus]
Length = 400
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 119/251 (47%), Positives = 159/251 (63%), Gaps = 5/251 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVI DNHIS+P WCCS DGNGFFGD+YF+ + W++GL +A F VV MSLRNEL
Sbjct: 144 MVIADNHISQPRWCCSLEDGNGFFGDRYFDTEEWLEGLRLVARRFYNKSAVVAMSLRNEL 203
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL-TFTGKLVF 119
RG KDW +Y+ GA +H NP++LVI+SGLNFD DL R + L KLVF
Sbjct: 204 RGASSKSKDWNKYITQGATTIHNINPKILVIISGLNFDNDLRCQRQYPLQLNNLHNKLVF 263
Query: 120 EAHWYGFT-DGQAWVDGNP-NQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNN 175
E H Y F+ + Q+ NP N++C +V++ + + F++E + PLFVSEFG D RG N
Sbjct: 264 EVHLYSFSGESQSKFIHNPLNKICSKVINGFVERAEFVMEGAEAVPLFVSEFGLDQRGVN 323
Query: 176 VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
D+R+L+CF E D DWALW GSYY R+G +G E +G+ ++NW D+RN F +
Sbjct: 324 EADDRFLSCFSAHLVEKDLDWALWGWQGSYYYRQGKVGPEEVFGVLNYNWSDVRNPHFSQ 383
Query: 236 RISSLQSPFRG 246
LQ+ +G
Sbjct: 384 MFQLLQTMLQG 394
>gi|449467699|ref|XP_004151560.1| PREDICTED: uncharacterized protein LOC101211788 [Cucumis sativus]
Length = 341
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 190/339 (56%), Gaps = 11/339 (3%)
Query: 54 MSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL-T 112
MSLRNE+RG +N DW +Y+ G +H N EVLVI+SGLN+D DL ++ + +N+ T
Sbjct: 1 MSLRNEIRGFMENANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVGT 60
Query: 113 FTGKLVFEAHWYGFT-DGQAWVDGNP-NQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFG 168
KLVFE H Y F+ D ++ P N +C +++ + +GF+++ +PLFVSE+G
Sbjct: 61 LDNKLVFEVHLYSFSGDSESKFVKQPLNNICANIMNGFIDHAGFVMQGPNPFPLFVSEYG 120
Query: 169 ADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDI 228
D R N +NR+++CF A+ D DWALW GSYY REG E +G+ D NW I
Sbjct: 121 YDQREVNDAENRFMSCFTAHLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQI 180
Query: 229 RNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWS 288
+N +F+++ LQ+ P VIYHP + C+Q + + L C+ WS
Sbjct: 181 KNPNFVQKFQLLQTMLHDPNS-NASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWS 239
Query: 289 YTPHKT-ISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVC 347
+ T I + L+A G A L S W IS+SK+HL++ G ++C
Sbjct: 240 HNNDGTPIEMSSTGLYLKASGEGLEASLSTDTLSQQSVWSAISNSKLHLATFTQGGKSLC 299
Query: 348 LDVDSSNT--IVTNTCKCLSRDKTC--DPASQWFKLVDS 382
L +DSSN+ +VTN+C C + D C D SQWF+LV++
Sbjct: 300 LQMDSSNSSKVVTNSCICTNGDPNCLQDTRSQWFELVET 338
>gi|357502159|ref|XP_003621368.1| Endoglucanase [Medicago truncatula]
gi|355496383|gb|AES77586.1| Endoglucanase [Medicago truncatula]
Length = 457
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 128/333 (38%), Positives = 177/333 (53%), Gaps = 28/333 (8%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+ DNH+S P WCC N+DGNG FGDQYFN + W++GL+ +A G +V + LRNEL
Sbjct: 143 MVLADNHVSDPKWCCDNNDGNGCFGDQYFNLEEWLQGLSNVANRVKGKPQIVAVGLRNEL 202
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP QN +WY+YM G VH ANP VLV +SGLN+D DLSF++ + +N+ KLV+E
Sbjct: 203 RGPGQNNDNWYKYMSQGVTTVHKANPNVLVFVSGLNYDTDLSFLKTKPLNVNIGDKLVYE 262
Query: 121 AHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNR 180
H Y G NPN PL ++EFG D+ + + R
Sbjct: 263 VHSYA---GFLMSGSNPN----------------------PLVMTEFGMDMENIDDQNQR 297
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSL 240
YL+C +D DWALW GSYY+RE + E+YGL+ ++ +R F +R L
Sbjct: 298 YLSCILAYLGGVDLDWALWAAQGSYYIREKENIVREHYGLWSIDFSSLRYQEFPQRFQLL 357
Query: 241 QSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGA 300
Q P + + +IYHP +G CV+ S + L LG C + W+ + I L G
Sbjct: 358 QKKLLEPSSNSSKSY-IIYHPLSGQCVKVNSN-NELELGHCEWASKWN-QEGQHIKLVGN 414
Query: 301 YFCLQAKHVGKPAKLGIICTDCGSTWEIISDSK 333
++A G KL C S W+ +S S+
Sbjct: 415 GTYIEANSDGSKVKLSKDCKSKQSFWKTLSTSQ 447
>gi|449523495|ref|XP_004168759.1| PREDICTED: uncharacterized protein LOC101223591 [Cucumis sativus]
Length = 522
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 140/367 (38%), Positives = 193/367 (52%), Gaps = 37/367 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVI DNH+S+P WCCS DGNGFFG+ F+P W++GL+ +A F VVGMSLRNE+
Sbjct: 88 MVIADNHMSQPRWCCSLDDGNGFFGNNNFDPQEWLQGLSLVAQRFRNKSTVVGMSLRNEI 147
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL-TFTGKLVF 119
RG +N DW +Y+ G +H N EVLVI+SGLN+D DL ++ + +N+ T KLVF
Sbjct: 148 RGFMENANDWNKYITQGVTTIHNINSEVLVIVSGLNYDNDLRCLKEKPLNVSTLDNKLVF 207
Query: 120 EAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDN 179
E H Y F+ D N E+G D R N +N
Sbjct: 208 EVHLYSFSG-----DSESN---------------------------EYGYDQREVNDAEN 235
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISS 239
R+++CF A+ D DWALW GSYY REG E +G+ D NW I+N +F+++
Sbjct: 236 RFMSCFTAHLAQRDLDWALWAWQGSYYFREGQAEPGESFGVLDSNWTQIKNPNFVQKFQL 295
Query: 240 LQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKT-ISLK 298
LQ+ + P VIYHP + C+Q + + L C+ WS+ T I +
Sbjct: 296 LQTMLQDPNS-NASFSYVIYHPQSSQCIQVSNDNKEIFLTNCSTPTRWSHNNDGTPIEMS 354
Query: 299 GAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNT--I 356
L+A G A L S W IS+SK+HL++ G +CL +DSSN+ +
Sbjct: 355 STGLYLKASGEGLEASLSSDTLSQQSVWSAISNSKLHLATFTQGGKNLCLQIDSSNSSKV 414
Query: 357 VTNTCKC 363
VTN+C C
Sbjct: 415 VTNSCIC 421
>gi|357502195|ref|XP_003621386.1| Endoglucanase E1 [Medicago truncatula]
gi|355496401|gb|AES77604.1| Endoglucanase E1 [Medicago truncatula]
Length = 416
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 108/250 (43%), Positives = 154/250 (61%), Gaps = 4/250 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV++DNH+S P WCC N DGNGFFGDQY +P W++GL+ +A G VV + LRNEL
Sbjct: 142 MVLVDNHVSDPKWCCHNKDGNGFFGDQYSDPKEWLQGLSNVANRVKGKSQVVAVGLRNEL 201
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
RGP QN+ +W +YM GA VH NP VLV +SG N+D DLSF++ +N + KLV+E
Sbjct: 202 RGPGQNIDNWQKYMSQGATTVHKENPNVLVFVSGFNYDTDLSFLKTNPLNTSIGDKLVYE 261
Query: 121 AHWYGFTDG--QAWVDGNPNQVCGRVVDNVMRLSGFLL--EQGWPLFVSEFGADLRGNNV 176
H Y ++ G + W+ NQ C V++ + +GFL+ PL +SEFG D+ +
Sbjct: 262 VHSYAWSTGSPKDWIVKPLNQKCANVMNGLNDRAGFLMSGSNPNPLVMSEFGLDMTDMDD 321
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
+ R+L+C A +D DWALWT GSYY+RE ++E+YGL++ ++ +R F +R
Sbjct: 322 KNQRFLSCMLAYLAGVDLDWALWTAQGSYYIREKESNVSEHYGLWNIDFKSLRYPDFPQR 381
Query: 237 ISSLQSPFRG 246
+Q G
Sbjct: 382 FQLVQKKLLG 391
>gi|302143021|emb|CBI20316.3| unnamed protein product [Vitis vinifera]
Length = 168
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 106/166 (63%), Positives = 126/166 (75%), Gaps = 2/166 (1%)
Query: 230 NSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSY 289
NS+F +RIS+LQSP +GP V HK+I HP+TGLC+ R+S +PL LGPCTESEAW Y
Sbjct: 4 NSNFFQRISALQSPLQGPDVSRVRPHKIILHPSTGLCILRESLSEPLKLGPCTESEAWGY 63
Query: 290 TPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLD 349
TP K + +KG YFCLQA +GKPAKL IICT GS WEIISDSKM+LS+K +GTTVCLD
Sbjct: 64 TPQKILIVKGTYFCLQAVGLGKPAKLSIICTQPGSNWEIISDSKMYLSTKLGDGTTVCLD 123
Query: 350 VDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDSTRSSTTTKSFFQF 395
VDSS+ IVT+ CKCL RD CDP SQWFK+VDST + T + Q
Sbjct: 124 VDSSSNIVTDACKCLGRDDMCDPGSQWFKVVDST--NITRRPILQI 167
>gi|168029996|ref|XP_001767510.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681216|gb|EDQ67645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 359
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 115/240 (47%), Positives = 147/240 (61%), Gaps = 6/240 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKP WCCS++DGNGF+GD+YF+ + W+ GLT +AT F+ VV MSLRN L
Sbjct: 120 MVILDNHVSKPKWCCSDTDGNGFWGDEYFDVETWMLGLTTVATTFSSNPFVVAMSLRNAL 179
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL---TFTGKL 117
RGP+QN DW M AEAVH NP+VL+I GL+F DLSF+ N +L F KL
Sbjct: 180 RGPRQNAIDWRVLMAQAAEAVHDVNPDVLIIAGGLSFATDLSFLNNNDNHLDTSRFPNKL 239
Query: 118 VFEAHWYGFT---DGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGN 174
V+E HWY ++ G + + + C NV +GFLLEQG P+ VSEFG +L
Sbjct: 240 VYEFHWYSWSRLARGSNFYNASDPSACSNTQRNVNLDNGFLLEQGTPVIVSEFGINLEST 299
Query: 175 NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFL 234
N +NR+L+C D DWA W L GSYYLR G +E +GL W RN + +
Sbjct: 300 NFAENRFLDCVIDYLERGDIDWAFWALQGSYYLRYGREDDDEVFGLLQSTWRSFRNPTIV 359
>gi|449527487|ref|XP_004170742.1| PREDICTED: uncharacterized protein LOC101225189, partial [Cucumis
sativus]
Length = 337
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/339 (36%), Positives = 186/339 (54%), Gaps = 16/339 (4%)
Query: 51 VVGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
VVGMSLRNELRG NV W +Y++LG+ +H NP +LV++SGLN+D DLS+++ + +
Sbjct: 1 VVGMSLRNELRG-SNNVGVWRKYVKLGSHLIHRINPRLLVVISGLNYDNDLSYLKKKPLG 59
Query: 111 LTFTGKLVFEAHWYGFT--DGQAWVDGNPNQVCGRVVDNVMRLSGFL--LEQGWPLFVSE 166
K+V EAH Y F+ +V N C +V+D R +GF+ ++ +PLF+SE
Sbjct: 60 YNLNNKVVLEAHLYSFSGEPESKFVKKPLNIACNQVMDKFEREAGFVVDMKDPYPLFLSE 119
Query: 167 FGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC 226
FG DLRG N NR+++CF D DWA W GSY R+G ++E +G+ D +W
Sbjct: 120 FGYDLRGGNKAQNRFMSCFLARIIGKDIDWAYWAFQGSYMYRQGQQDVDESFGIMDSSWT 179
Query: 227 DIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLD---PLTLGPCTE 283
R S L+++ L + + ++ HP +G CV+ LD + LG C
Sbjct: 180 KDR-SPRLQQMLQLAKRINQDPNSKGPMSYIMLHPVSGQCVK----LDGKGGIELGDCET 234
Query: 284 SEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNG 343
W +T + CL++ GKP + C+ GS+W + S +K+ LS+K+ G
Sbjct: 235 PTLWDHTGDGSPMKLWNGQCLKSAGDGKPPVVSAECSGDGSSWTVASKAKLQLSTKS-GG 293
Query: 344 TTVCLDVDSSNTIVTNTCKCLSRDKTC--DPASQWFKLV 380
+CL+ +S +IV C CL + C DP SQWFKLV
Sbjct: 294 ENICLEKESDTSIVVKKCICLKDEWNCFDDPQSQWFKLV 332
>gi|302143019|emb|CBI20314.3| unnamed protein product [Vitis vinifera]
Length = 168
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 104/160 (65%), Positives = 124/160 (77%), Gaps = 4/160 (2%)
Query: 224 NWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPLTLGPCTE 283
NW NSSF +RIS+LQSP +GP V HK+I+HP+TGLC+ R+S +PL LGPCT+
Sbjct: 2 NW----NSSFFQRISALQSPLQGPDVSRVRRHKIIFHPSTGLCILRESGSEPLKLGPCTK 57
Query: 284 SEAWSYTPHKTISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEIISDSKMHLSSKADNG 343
SEAW YTP K +++KG YFCLQA +GKPAKL IICT GS WE ISDSKM+LS+K +G
Sbjct: 58 SEAWGYTPQKLLTVKGTYFCLQAVGLGKPAKLSIICTKPGSNWENISDSKMYLSTKLGDG 117
Query: 344 TTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQWFKLVDST 383
T VCLDVDSS+TIVT+ CKCL R CDP SQWFK+VDST
Sbjct: 118 TRVCLDVDSSSTIVTDACKCLGRGDMCDPGSQWFKVVDST 157
>gi|168033792|ref|XP_001769398.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679318|gb|EDQ65767.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 359
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 106/241 (43%), Positives = 147/241 (60%), Gaps = 4/241 (1%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKP WCC+++DGNGF+GD++F+ + W+ GLT +A+ F+ VV MSLRNEL
Sbjct: 119 MVILDNHVSKPQWCCASNDGNGFWGDEFFDAETWLMGLTTVASEFSNNPFVVAMSLRNEL 178
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT-FTGKLVF 119
RG +QN W M A+AVH NP +L+I GLNF DLSF+ A+ + F KL++
Sbjct: 179 RGSRQNSYLWRVLMSTAAQAVHDVNPNILIIAGGLNFGTDLSFLNKGALETSRFPNKLMY 238
Query: 120 EAHWYG---FTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNV 176
E HWY G + + + C NV +GFLLE+ PL +SEFG +L N+
Sbjct: 239 EFHWYKTSRLARGGNFNNASDQNACAISQSNVHADNGFLLEKNAPLLLSEFGINLDSTNI 298
Query: 177 NDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
++ ++L+C D DWA W L GSYYLR+ V +E YGL + W RN + + R
Sbjct: 299 SETQFLDCVIDYLERADLDWAFWALQGSYYLRDAVRDADEVYGLLESTWRSFRNPTVVSR 358
Query: 237 I 237
+
Sbjct: 359 L 359
>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
vinifera]
Length = 596
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 105/203 (51%), Positives = 134/203 (66%), Gaps = 12/203 (5%)
Query: 203 GSYYLREGVIGLNEYYGLFDWNWCDIRNSSF---------LERISSLQSPFRGPGVFETG 253
GS LR+ V +NE + ++ D+R S F LE+ + ++ GP
Sbjct: 95 GSADLRKAV-HVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLTGPDAPHAR 153
Query: 254 LHKVIYHPATGLCVQRKSFLDPLTLGPCTESEAWSYTPHKTISLKGAYFCLQAKHVGKPA 313
+K++ HP+TGLCV R S +PL LGPCTESEAW YTP K + +KG Y CLQA +GKPA
Sbjct: 154 PYKMVLHPSTGLCVLRGSLSEPLKLGPCTESEAWGYTPQKILIIKGTYLCLQAVGLGKPA 213
Query: 314 KLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPA 373
KL +IC++ GS WE ISDSKM+LS+K NGTTVCLDVDSS+ IVTN+CKCLSRD CDP
Sbjct: 214 KLSVICSNPGSNWESISDSKMYLSTKLGNGTTVCLDVDSSDDIVTNSCKCLSRDDMCDPG 273
Query: 374 SQWFKLVDSTRSSTTTKSFFQFN 396
SQWFK+ +ST T+K + N
Sbjct: 274 SQWFKIANST--DITSKPILRIN 294
>gi|218191090|gb|EEC73517.1| hypothetical protein OsI_07900 [Oryza sativa Indica Group]
Length = 277
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 89/143 (62%), Positives = 110/143 (76%), Gaps = 4/143 (2%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+S+PGW C++ DGNGFFGD++F PD W++GL MA +F GV NVVGMSLRNEL
Sbjct: 112 MVILDNHVSRPGWSCADDDGNGFFGDRHFVPDAWVRGLGAMAALFAGVPNVVGMSLRNEL 171
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFT----GK 116
RGP+QN DWYRYMQ+GAEAVHAANP LVI+ GL +D DLSF+ + V+++F GK
Sbjct: 172 RGPRQNADDWYRYMQMGAEAVHAANPAALVIMGGLGYDTDLSFLAARPVDVSFAAAERGK 231
Query: 117 LVFEAHWYGFTDGQAWVDGNPNQ 139
LVFE HW F D +AW + N+
Sbjct: 232 LVFERHWNSFADARAWESEDANR 254
>gi|302143020|emb|CBI20315.3| unnamed protein product [Vitis vinifera]
Length = 147
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 89/130 (68%), Positives = 105/130 (80%), Gaps = 5/130 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILD+H S+P + NG FGDQ+FNPDLW+KGLT++AT+F+GV NVVGMSLRNEL
Sbjct: 23 MVILDSHFSEPSF-----HDNGVFGDQHFNPDLWVKGLTRIATMFSGVPNVVGMSLRNEL 77
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
R P QNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL+ DLSF+ NQ + LTFTGKLV E
Sbjct: 78 RCPNQNVKDWYRYMQKGAEAVHSANPDVLVIISGLSDGTDLSFLLNQQLELTFTGKLVLE 137
Query: 121 AHWYGFTDGQ 130
HW+G DG
Sbjct: 138 MHWHGSRDGH 147
>gi|302143022|emb|CBI20317.3| unnamed protein product [Vitis vinifera]
Length = 147
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 89/130 (68%), Positives = 105/130 (80%), Gaps = 5/130 (3%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILD+H S+P + GNG FGDQ+FNPDLW+KGLT++AT+F+GV NVVGMSLRNEL
Sbjct: 23 MVILDSHFSEPSF-----HGNGVFGDQHFNPDLWVKGLTRIATMFSGVPNVVGMSLRNEL 77
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFE 120
R P QNVKDWYRYMQ GAEAVH+ANP+VLVI+SGL+ DLSF+ NQ + LTFTGKLV E
Sbjct: 78 RCPNQNVKDWYRYMQKGAEAVHSANPDVLVIISGLSDGTDLSFLLNQQLELTFTGKLVLE 137
Query: 121 AHWYGFTDGQ 130
HW+G G
Sbjct: 138 MHWHGSRGGH 147
>gi|322709424|gb|EFZ01000.1| cellulase family protein [Metarhizium anisopliae ARSEF 23]
Length = 417
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 106/270 (39%), Positives = 137/270 (50%), Gaps = 28/270 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+SK WCC SDGNG++ D +YFN W+ GL MA +
Sbjct: 153 MTILDNHVSKASWCCDLSDGNGWWNDANFYVAATSRYFNTQKWLNGLKSMARWSASRPGI 212
Query: 52 VGMSLRNELRGPKQNV----KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VGMSLRNELR + W +YM + VHA NP +LVI+ G+N DLS +RN
Sbjct: 213 VGMSLRNELRAHVAQIPWAPSTWLKYMPRAGDVVHAENPRLLVIVGGINGGTDLSPLRNG 272
Query: 108 AVNL-TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG----WPL 162
A+ L + K V+EAH Y FT V C L GF+LEQ PL
Sbjct: 273 AMKLGNWADKRVWEAHAYSFT-----VVTPSLGSCDIRKAEFGGLFGFVLEQNKASTGPL 327
Query: 163 FVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
F+SEFG + G + DN YL C G D DWA W + GSYY+R+ + NE
Sbjct: 328 FLSEFGVGMTGGPHDGLSDQDNDYLTCLVGYMENNDADWAHWAVQGSYYVRDKTVDYNET 387
Query: 218 YGLFDWNWCDIRNSSFLERISSLQSPFRGP 247
+G D+ W D RN F + ++ + +GP
Sbjct: 388 WGALDYEWSDWRNPKFKGMLGNMFAVTQGP 417
>gi|322693763|gb|EFY85612.1| cellulase family protein [Metarhizium acridum CQMa 102]
Length = 414
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 106/270 (39%), Positives = 136/270 (50%), Gaps = 28/270 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+SK WCC SDGNG++ D +YF+ W+ GL MA +
Sbjct: 150 MTILDNHVSKASWCCDLSDGNGWWDDAKYYAAATSRYFDTQKWLDGLKAMARWSASRSGI 209
Query: 52 VGMSLRNELRGPKQNV----KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VGMSLRNELR + W YM + VHA NP +LVI+ G+N DLS +RN
Sbjct: 210 VGMSLRNELRAHVTQIPWAPSTWLEYMPRAGDVVHAENPRLLVIVGGINGGTDLSPLRND 269
Query: 108 AVNL-TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG----WPL 162
A+ L + KLV+EAH Y FT V C L GF+LEQ PL
Sbjct: 270 AMKLGNWADKLVWEAHAYSFT-----VVTPSLGSCDIRKAEYGGLFGFVLEQDKPSTGPL 324
Query: 163 FVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
F+SEFG + G + DN YL C G D DWA W + GSYY+R + NE
Sbjct: 325 FLSEFGVGMTGGPHDGLSDEDNDYLTCLVGYMESNDADWAHWAVQGSYYVRGKTVDYNET 384
Query: 218 YGLFDWNWCDIRNSSFLERISSLQSPFRGP 247
+G D+ W D RN F + ++ + +GP
Sbjct: 385 WGALDYEWSDWRNPKFKGMLGNMFAVTQGP 414
>gi|393226861|gb|EJD34572.1| glycosyl hydrolase family 5 protein/cellulase [Auricularia delicata
TFB-10046 SS5]
Length = 402
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/265 (36%), Positives = 144/265 (54%), Gaps = 26/265 (9%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
+V+LDNH S+ GWCCS SDGNG++ QYF+ + W+ GL MAT+ NV
Sbjct: 146 LVMLDNHNSRAGWCCSTSDGNGWWASASGYNAANSQYFDVNNWLNGLRAMATLAKSHSNV 205
Query: 52 VGMSLRNELRG----PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VG++LRNELR N DWY ++ G AVH+AN ++L+++ G+++ D SF+ +
Sbjct: 206 VGLALRNELRAVGSQDGNNHADWYNFVGQGLNAVHSANADLLIVVGGVSYATDFSFLGSN 265
Query: 108 AVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW----PL 162
+N T F K+V+E H Y +T C R + + +G+LL Q PL
Sbjct: 266 PLNRTAFGNKVVYEFHNYQWTF--------STNDCNRHKSLLGQRAGYLLTQNQAYTGPL 317
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++SEFG + ++N Y +C D +WA W L+GSYY+R + +E +GL +
Sbjct: 318 WLSEFGWAQSNPSADENAYASCLVSYMQSNDAEWAYWALMGSYYVRSQSVNFDEGFGLLN 377
Query: 223 WNWCDIRNSSFLERISSLQSPFRGP 247
+W RN SFL I + +GP
Sbjct: 378 HDWSGWRNGSFLNTIGKMMQVTQGP 402
>gi|299750425|ref|XP_001836746.2| glycosyl hydrolase family 5 protein/cellulase [Coprinopsis cinerea
okayama7#130]
gi|298408897|gb|EAU84963.2| glycosyl hydrolase family 5 protein/cellulase [Coprinopsis cinerea
okayama7#130]
Length = 452
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 97/269 (36%), Positives = 146/269 (54%), Gaps = 34/269 (12%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
+V+LDNH+S+ WCCS +DGNG++ +YF+ + W+ GL MA+ + NV
Sbjct: 196 LVVLDNHVSRASWCCSTNDGNGWWASASGYDANNSRYFDTNNWLNGLRSMASFSHSHPNV 255
Query: 52 VGMSLRNELRG----PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VG++LRNELR + N DWY Y+ GA+AV AN +LV++ G+N+ DLSFV N+
Sbjct: 256 VGIALRNELRAVGNQDRNNHADWYNYITQGAQAVRGANSNLLVVVGGVNYALDLSFVWNR 315
Query: 108 AVNL-TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRL----SGFLLEQGW-- 160
++ F KLV+E H Y ++ G GR +N ++ GF+L Q
Sbjct: 316 MLDRNVFGDKLVWELHNYAWSGG------------GRDCNNHQKMMGDRGGFVLIQNREF 363
Query: 161 --PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYY 218
PLF+SEFG + ++ Y +C D +WA W L GSYY+RE + +E +
Sbjct: 364 TGPLFLSEFGWAQNNPSADEIAYSSCLVRYLESNDAEWAYWALQGSYYVRERRVNFDETF 423
Query: 219 GLFDWNWCDIRNSSFLERISSLQSPFRGP 247
GL + +W RN SF+ + + +GP
Sbjct: 424 GLLNHDWSGWRNGSFVRTLGKMWDVTQGP 452
>gi|346320699|gb|EGX90299.1| cellulase family protein [Cordyceps militaris CM01]
Length = 509
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/263 (37%), Positives = 138/263 (52%), Gaps = 28/263 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+S+ WCC+ DGNG++ D ++F+ W+ GL MA G +
Sbjct: 243 MTILDNHVSRASWCCNLDDGNGWWSDAPGDVDANSRFFDHGRWLDGLRAMAAWARGKPGI 302
Query: 52 VGMSLRNELRGPKQNVKD----WYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
+GMSLRNELR + W M A+AVH ANP+VLVI+ GLN DL+ +R++
Sbjct: 303 IGMSLRNELRAHVTQIPAAAGVWRENMPAAAKAVHGANPDVLVIMGGLNGGTDLTPLRDR 362
Query: 108 AVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG----WPL 162
+ + + + GK V+EAH Y FT + C N L GF+LEQ PL
Sbjct: 363 SRDTSAWAGKNVWEAHAYSFT-----ITTPDFGSCSIRKVNYGLLFGFVLEQNKGNTGPL 417
Query: 163 FVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
F+SEFG + G N D+ YL C + D DW+LW L G+YY+R+G + E
Sbjct: 418 FLSEFGVGMTGGDKDGLNDKDSSYLTCLREYMEDNDADWSLWALQGTYYVRDGKVDAEET 477
Query: 218 YGLFDWNWCDIRNSSFLERISSL 240
+G D +W D RN F + +
Sbjct: 478 WGALDHDWKDWRNPKFKAMLGKM 500
>gi|400598879|gb|EJP66586.1| cellulase-like protein [Beauveria bassiana ARSEF 2860]
Length = 425
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 105/266 (39%), Positives = 138/266 (51%), Gaps = 38/266 (14%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNG---- 47
+ ILDNH+S+ WCC DGNG++ D +YF+ D W++GL MA G
Sbjct: 151 VTILDNHLSRASWCCDLDDGNGWWADAPGYLAANSRYFDRDEWLRGLRAMAAWVRGDGSS 210
Query: 48 ------VRNVVGMSLRNELRGPKQNVKD----WYRYMQLGAEAVHAANPEVLVILSGLNF 97
+VGMSLRNELR + W M++ A AVHA NP+VLVI+ GLN
Sbjct: 211 SSSSSSRPGIVGMSLRNELRAHVTQLPAAAGVWRENMRVAAAAVHATNPDVLVIVGGLNG 270
Query: 98 DKDLSFVRNQAVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL 156
DL+ +R Q ++ T + K V+EAH Y FT V C N L GF+L
Sbjct: 271 GTDLTPLRGQTLDTTAWAAKNVWEAHAYSFTLTTPDVGS-----CSVRKANYGLLFGFVL 325
Query: 157 EQG----WPLFVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
EQG PLF+SEFG + G N D+RYL C D DW+LW L G+YY+
Sbjct: 326 EQGKDSTGPLFLSEFGVGMTGGDEDGLNDKDSRYLACLREYMEGNDADWSLWALQGTYYV 385
Query: 208 REGVIGLNEYYGLFDWNWCDIRNSSF 233
R+G + E +G+ D +W D RN F
Sbjct: 386 RDGKVDAEETWGVLDNDWKDWRNPKF 411
>gi|212530076|ref|XP_002145195.1| cellulase family protein [Talaromyces marneffei ATCC 18224]
gi|210074593|gb|EEA28680.1| cellulase family protein [Talaromyces marneffei ATCC 18224]
Length = 409
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 102/270 (37%), Positives = 135/270 (50%), Gaps = 28/270 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFF---------GDQYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+SK WCC+ +DGNG++ +YFN W+ GL MAT V
Sbjct: 145 MTILDNHVSKASWCCNLTDGNGWWDTATGYIASNSRYFNTTEWLSGLDAMATFALDHPGV 204
Query: 52 VGMSLRNELRG----PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VGMSLRNELR DWY Y+ GA VH ANP VLVI+ G DLSF++
Sbjct: 205 VGMSLRNELRPFPILQDLTHSDWYNYVTQGARTVHKANPNVLVIIGGSQSATDLSFIKTS 264
Query: 108 AVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG----WPL 162
++ + + GK V+E H Y FT V N C GFLL Q PL
Sbjct: 265 NLDFSHWAGKHVWEFHAYSFT-----VTFPGNTDCTVADAEYGFFDGFLLTQNESYTAPL 319
Query: 163 FVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
+SEFG G + D YL C D +WA+W + GSYY+R+G + +E
Sbjct: 320 ILSEFGVGQSGGPNSGLSDKDYDYLQCLVKYMESNDAEWAVWAVQGSYYIRDGTVDYDEA 379
Query: 218 YGLFDWNWCDIRNSSFLERISSLQSPFRGP 247
+GL + +W RN +FL + + + +GP
Sbjct: 380 WGLLNHDWSGSRNGNFLGMLGQMWNVTQGP 409
>gi|116191715|ref|XP_001221670.1| hypothetical protein CHGG_05575 [Chaetomium globosum CBS 148.51]
gi|88181488|gb|EAQ88956.1| hypothetical protein CHGG_05575 [Chaetomium globosum CBS 148.51]
Length = 424
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 99/274 (36%), Positives = 139/274 (50%), Gaps = 31/274 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
+ ILDNH+S+ WCC+ +DGNG++ + ++FN W+ GL MA G V
Sbjct: 155 LTILDNHVSRAAWCCNLTDGNGWWDEGFGYNTWNSRFFNTQQWLAGLEAMAAWGKGHAGV 214
Query: 52 VGMSLRNELR----GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VG LRNE+R DWY Y++ G + VHAA+P+ LV++ G+ DL +R+
Sbjct: 215 VGYGLRNEIREFLLQGLNGRADWYTYVKKGGDVVHAAHPDALVLVGGVQSTTDLLHLRSG 274
Query: 108 AVNLT-----FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG--- 159
+ GK V+E H Y FT + D N C V GF+LEQG
Sbjct: 275 EAEAMLDTSGWAGKHVWEMHAYSFT--VTFPDAFNN--CDIVKAQYGAFVGFVLEQGKPY 330
Query: 160 -WPLFVSEFGADLRGNNVN-----DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIG 213
PL +SEFG + G + DNRYLNC D DWA+W + GSYY+R+G
Sbjct: 331 TGPLILSEFGVGMEGGEFDGLSEKDNRYLNCLVSYMQSNDADWAVWAIQGSYYVRDGKTD 390
Query: 214 LNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGP 247
+E +GL D NW D RN +F ++ + + P
Sbjct: 391 YDETWGLMDRNWTDWRNPAFSGKLGDMWKMTQQP 424
>gi|380495036|emb|CCF32701.1| cellulase [Colletotrichum higginsianum]
Length = 416
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 99/271 (36%), Positives = 144/271 (53%), Gaps = 30/271 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
M +LDNH+S+ WCC+ DGNG++ D ++F+ W+ GL +++ V
Sbjct: 152 MTVLDNHVSRASWCCNLDDGNGWWKDARFYWAANSRFFDTGDWLAGLQQVSFWARTRPGV 211
Query: 52 VGMSLRNELRGPKQNV----KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
+SLRNELR + WY Y+ GA+AVH ANP+VLV++ GLN D + +R +
Sbjct: 212 AAISLRNELRATWTQIPFAADQWYGYIARGAKAVHEANPDVLVVIGGLNSATDFTPLRTR 271
Query: 108 AVN-LTFTGKLVFEAHWYGFTDGQAWVDGNPN-QVCGRVVDNVMRLSGFLLEQG----WP 161
+++ + GK V+EAH Y FT PN CG L GF+LEQG P
Sbjct: 272 SLDTAAWRGKNVWEAHSYSFTVT------TPNFGDCGVERAQYGALFGFVLEQGKGYTGP 325
Query: 162 LFVSEFGADLRGNNVN-----DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNE 216
LF+SEFG + G N ++ YL C G E D DWALW + G+YY+R V+ +E
Sbjct: 326 LFMSEFGVAMSGGPENGLSAEEHAYLTCLVGYLEENDADWALWAIQGTYYVRNKVVNFDE 385
Query: 217 YYGLFDWNWCDIRNSSFLERISSLQSPFRGP 247
+G D W RN +F + ++ + +GP
Sbjct: 386 TWGAMDREWRGWRNPAFKSMLGNIFAVTQGP 416
>gi|367051100|ref|XP_003655929.1| glycoside hydrolase family 5 protein [Thielavia terrestris NRRL
8126]
gi|347003193|gb|AEO69593.1| glycoside hydrolase family 5 protein [Thielavia terrestris NRRL
8126]
Length = 420
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 99/259 (38%), Positives = 137/259 (52%), Gaps = 30/259 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGF---------FGDQYFNPDLWIKGLTKMAT-IFNGVRN 50
M ILDNH+SK WCC+ +DGNG+ F +YF+ W+ GL MA+ +
Sbjct: 152 MTILDNHVSKASWCCNLTDGNGWWDTAFGYNSFNSRYFHTADWLAGLQAMASWAASSHPG 211
Query: 51 VVGMSLRNELRG----PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN 106
VV MSLRNELR DWY+Y+Q + VHA +P+VLVI+ G+ DL+ +R+
Sbjct: 212 VVAMSLRNELRAFLLQGLNGRADWYKYVQQAGDLVHATHPDVLVIVGGVQSATDLTHLRD 271
Query: 107 QAVNLT---FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG---- 159
L + GK V+E H Y FT +P Q C V GF+LEQG
Sbjct: 272 GVGMLDTSGWAGKHVWEMHAYSFTV----TFPDPFQSCDLVQAEYGAFVGFVLEQGKPYT 327
Query: 160 WPLFVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGL 214
PL +SEFG ++G + D+RYL+C D DWA+W + GSYY+R+ +
Sbjct: 328 GPLILSEFGVGMQGGPNDGLSDQDSRYLSCLVSYMQNNDADWAVWAVQGSYYVRDRTVDA 387
Query: 215 NEYYGLFDWNWCDIRNSSF 233
+E +GL +W D RN +F
Sbjct: 388 DESWGLLTHDWSDWRNPAF 406
>gi|310798401|gb|EFQ33294.1| cellulase [Glomerella graminicola M1.001]
Length = 416
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/257 (38%), Positives = 135/257 (52%), Gaps = 30/257 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+S+ WCC+ DGNG++ D +YF+ D W+ GL +++ V
Sbjct: 152 MTILDNHVSRASWCCNLDDGNGWWKDARFYWAANSRYFDTDEWLAGLRQVSLWAKTRPGV 211
Query: 52 VGMSLRNELRGPKQNVK----DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
+SLRNELR + WY Y+ GA+AVH ANP+VLV++ GLN D + +R +
Sbjct: 212 AAISLRNELRATWTQIPFAAGQWYDYVARGAKAVHEANPDVLVVIGGLNSATDFTPLRTR 271
Query: 108 AVN-LTFTGKLVFEAHWYGFTDGQAWVDGNPN-QVCGRVVDNVMRLSGFLLEQG----WP 161
+++ + GK V+EAH Y FT PN C L GF+LEQG P
Sbjct: 272 SLDTAAWQGKNVWEAHSYSFTVT------TPNLGDCNIERAEYGALFGFVLEQGVGYTGP 325
Query: 162 LFVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNE 216
LF+SEFG + G + D+ YL C G D DWALW + G+YY+R V+ E
Sbjct: 326 LFMSEFGVAMSGGPEDGLSAEDHAYLTCLVGYLEGNDADWALWAIQGTYYVRNKVVDFEE 385
Query: 217 YYGLFDWNWCDIRNSSF 233
+G D W RN +F
Sbjct: 386 TWGAMDREWKGWRNPAF 402
>gi|242761487|ref|XP_002340189.1| cellulase family protein [Talaromyces stipitatus ATCC 10500]
gi|218723385|gb|EED22802.1| cellulase family protein [Talaromyces stipitatus ATCC 10500]
Length = 416
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 98/256 (38%), Positives = 130/256 (50%), Gaps = 28/256 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFF---------GDQYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+SK WCC+ +DGNG++ +YFN W+ GL MAT V
Sbjct: 145 MTILDNHVSKASWCCNLTDGNGWWDTASGYIASNSRYFNTTEWLAGLDAMATFARSHPGV 204
Query: 52 VGMSLRNELRG----PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VGMS+RNELR DWY Y+ GA VH ANP+VL+I+ G DLSF+RN
Sbjct: 205 VGMSIRNELRPFPLLQDLTHSDWYNYVTQGALTVHNANPDVLIIIGGSQSATDLSFIRNS 264
Query: 108 AVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG----WPL 162
++ + + GK V+E H Y FT V N C GFLL Q PL
Sbjct: 265 NLDFSQWAGKHVWEFHAYSFT-----VTFPGNTDCMVASAEYGLFDGFLLTQDKSYTAPL 319
Query: 163 FVSEFGADLRGNNVN-----DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
+SEFG G + + D YL C D +W++W + GSYY+R+G + +E
Sbjct: 320 LLSEFGVGQSGGSNSGLSDSDYNYLQCLVQYMESNDAEWSVWAVQGSYYIRDGTVDYDET 379
Query: 218 YGLFDWNWCDIRNSSF 233
+GL + W RN +F
Sbjct: 380 WGLLNHEWSGWRNGNF 395
>gi|367026724|ref|XP_003662646.1| glycoside hydrolase family 5 protein [Myceliophthora thermophila
ATCC 42464]
gi|347009915|gb|AEO57401.1| glycoside hydrolase family 5 protein [Myceliophthora thermophila
ATCC 42464]
Length = 427
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 99/269 (36%), Positives = 141/269 (52%), Gaps = 33/269 (12%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
+ +LDNH+S+ WCCS +DGNG++ + ++F+ W+ GL MA G V
Sbjct: 156 LTVLDNHVSRASWCCSLTDGNGWWDEAAGYNPWNSRFFDTRAWLAGLEAMAAWARGHPGV 215
Query: 52 VGMSLRNELRGPK--QNV----KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVR 105
V MSLRNELR Q+V DWY ++ VHAANP+VLV++ G+ DLS +R
Sbjct: 216 VAMSLRNELREFPLLQDVDPARPDWYAFVGRAGAVVHAANPDVLVVVGGVQSSTDLSHLR 275
Query: 106 -----NQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW 160
+ GK V+E H Y FT V G+ C V LSGF+LEQ
Sbjct: 276 LPGGGGMLDTSAWAGKHVWEMHAYSFTVTFPDVFGS----CDVVRAEYGALSGFVLEQDR 331
Query: 161 P----LFVSEFGADLRGNNVN-----DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGV 211
P L +SEFG + G + + D+RYL C D DWA+W + GSYY+R+G
Sbjct: 332 PYTGPLILSEFGVGMEGGDKDGLSDEDDRYLRCLVSYMENNDADWAVWAVQGSYYVRDGQ 391
Query: 212 IGLNEYYGLFDWNWCDIRNSSFLERISSL 240
+ +E +GL + NW RN +F ++ ++
Sbjct: 392 VDYDETWGLMNHNWTGWRNPAFPAKLGNM 420
>gi|407921172|gb|EKG14335.1| Glycoside hydrolase family 5 [Macrophomina phaseolina MS6]
Length = 408
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 92/267 (34%), Positives = 143/267 (53%), Gaps = 26/267 (9%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFF---------GDQYFNPDLWIKGLTKMATIFNGVRNV 51
+V+LDNH+S WCC +SDGNG++ +YFN W+ GL MA NV
Sbjct: 150 LVVLDNHVSHASWCCGSSDGNGWWDQASGYNDANSRYFNTTAWLAGLAAMAEFSAAHPNV 209
Query: 52 VGMSLRNELRGP----KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VGM+LRNELR + DW +++ G A+H ANP++L+++ G ++ DLSF+ +
Sbjct: 210 VGMALRNELRATGSQDRDGHADWRKFVGQGLNAIHGANPDLLMVVGGPSYATDLSFLYSS 269
Query: 108 AVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW----PL 162
++ + + K+V+E H Y ++ W + C + + +GFLL + PL
Sbjct: 270 PLDRSAYPDKVVWEFHNYQWS----WSYSD----CADHQTKLGQKTGFLLSENQAYTGPL 321
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++SEFG +L + + Y C D DWA W L GSYY+REG E +GL +
Sbjct: 322 WLSEFGWNLDTPSAVEIDYYTCLVQYMQGNDADWAFWALQGSYYIREGQTNFEETFGLVN 381
Query: 223 WNWCDIRNSSFLERISSLQSPFRGPGV 249
+W RN+SF + + S+ +GPG+
Sbjct: 382 KDWSAWRNASFPDILGSMFDVTQGPGI 408
>gi|429863477|gb|ELA37928.1| cellulase family protein [Colletotrichum gloeosporioides Nara gc5]
Length = 416
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 100/271 (36%), Positives = 143/271 (52%), Gaps = 30/271 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+SK WCC+ DGNG++ D +YF+ + W+ GL +++ + V
Sbjct: 152 MTILDNHVSKASWCCNLDDGNGWWKDARFYWAANSRYFDTNEWLAGLQQVSMWASTKPGV 211
Query: 52 VGMSLRNELRGPKQNV----KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
+ +S+RNELR + WY Y+ GA+AVH AN +VLVI+ GLN D S + +
Sbjct: 212 IAISIRNELRATWTQIPFAADQWYGYVTRGAKAVHEANADVLVIIGGLNSATDFSPLHTK 271
Query: 108 AVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPN-QVCGRVVDNVMRLSGFLLEQGW----P 161
++ + + GK V+EAH Y FT PN C L GF+LEQG P
Sbjct: 272 TLDTSAWKGKNVWEAHSYSFTVT------TPNFGDCNVERTEYGALFGFVLEQGEGYTGP 325
Query: 162 LFVSEFGADLRGN-----NVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNE 216
LF+SEFG + G + +D+ YL C G D DWALW + G+YY+R V+ NE
Sbjct: 326 LFLSEFGVGMSGGPEDGLSADDHAYLTCLVGYLEGNDADWALWAIQGTYYVRNKVVDFNE 385
Query: 217 YYGLFDWNWCDIRNSSFLERISSLQSPFRGP 247
+G D +W RN + ++ S +GP
Sbjct: 386 TWGAMDSDWKGWRNPLVKGLLGNIFSVTQGP 416
>gi|171687058|ref|XP_001908470.1| hypothetical protein [Podospora anserina S mat+]
gi|170943490|emb|CAP69143.1| unnamed protein product [Podospora anserina S mat+]
Length = 433
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 95/274 (34%), Positives = 141/274 (51%), Gaps = 31/274 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGF---------FGDQYFNPDLWIKGLTKMATIFNGVRNV 51
+ ILDNH+SK WCC+ +DGNG+ F ++F+ W+ L MAT V
Sbjct: 164 ITILDNHVSKASWCCNLTDGNGWWDTASGYNPFNSRFFSTSSWLSSLAFMATWAKSHPGV 223
Query: 52 VGMSLRNELRG----PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
VG+ LRNELR +DWY +Q VH AN ++L+++ G DL ++ +
Sbjct: 224 VGLGLRNELRAFLLQDLNGRRDWYANIQRAGNLVHQANKDLLILVGGAQSSTDLVHLKTR 283
Query: 108 AVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWP----L 162
++ + + GK V+E H Y FT +P + C V SGF+LEQ P L
Sbjct: 284 MLDTSGWEGKNVWEMHAYSFTV----TFPDPFKNCDLVKAGYGFWSGFVLEQDRPYTGPL 339
Query: 163 FVSEFGADLRGNNVN---------DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIG 213
+SEFG ++G+ V+ D+RYL+C G D +WA+W + G YY+REG +
Sbjct: 340 IMSEFGVGMQGSEVDSQYGGLNEQDHRYLDCLVGYLEGNDAEWAVWAIQGGYYIREGTVD 399
Query: 214 LNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGP 247
+E +GL D W RN F ++I L + +GP
Sbjct: 400 YDETWGLMDREWKGWRNERFRQKIQGLYAVTQGP 433
>gi|389639070|ref|XP_003717168.1| cellulase [Magnaporthe oryzae 70-15]
gi|351642987|gb|EHA50849.1| cellulase [Magnaporthe oryzae 70-15]
gi|440475729|gb|ELQ44392.1| glycosyl hydrolase family 5 protein / cellulase family protein
[Magnaporthe oryzae Y34]
gi|440486441|gb|ELQ66304.1| glycosyl hydrolase family 5 protein / cellulase family protein
[Magnaporthe oryzae P131]
Length = 411
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 94/267 (35%), Positives = 141/267 (52%), Gaps = 28/267 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGD---------QYFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+S+ WCC+ DGNG++ + ++F + W+ GL MA+ G + V
Sbjct: 153 MTILDNHVSRASWCCNLDDGNGWWAEAPGYNDNNSRFFKTNEWLAGLQAMASWAQGHKGV 212
Query: 52 VGMSLRNELR-----GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN 106
VGM +RNE+R G +DWY + A +H+ +P+VL+++ G DLS VR+
Sbjct: 213 VGMGVRNEIREFLTQGTFNGRQDWYDQVSAAARLIHSTHPDVLILIGGTLSSTDLSHVRS 272
Query: 107 QAVNLT-FTGKLVFEAHWYGFTDGQAWVDGNPN-QVCGRVVDNVMRLSGFLLEQG----W 160
+ ++++ + K V+E H Y FT V+ +P + C + GFLLEQG
Sbjct: 273 RPLDVSEWKEKHVWEWHAYSFT-----VNFHPIIKTCWYMRQLYGAFDGFLLEQGKSYTG 327
Query: 161 PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGL 220
PL +SEFG D N DN YL+C D DWA+W L GSYY R +E +G+
Sbjct: 328 PLILSEFGFD---QNSPDNFYLDCLRDYVVGNDGDWAIWALQGSYYARNKQADFDESFGV 384
Query: 221 FDWNWCDIRNSSFLERISSLQSPFRGP 247
+ +W D RNS ++ + +GP
Sbjct: 385 LNRDWSDWRNSDVKNKLGKMWDTTQGP 411
>gi|361067931|gb|AEW08277.1| Pinus taeda anonymous locus 2_5099_01 genomic sequence
gi|383171959|gb|AFG69337.1| Pinus taeda anonymous locus 2_5099_01 genomic sequence
gi|383171961|gb|AFG69338.1| Pinus taeda anonymous locus 2_5099_01 genomic sequence
gi|383171963|gb|AFG69339.1| Pinus taeda anonymous locus 2_5099_01 genomic sequence
gi|383171965|gb|AFG69340.1| Pinus taeda anonymous locus 2_5099_01 genomic sequence
Length = 133
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 89/133 (66%), Gaps = 3/133 (2%)
Query: 43 TIFNGVRNVVGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS 102
T F G VVGMSLRNELRG + N DWY+YMQ GA+AVH ANP+VLVI+SGLN+D DL
Sbjct: 1 TAFKGTSAVVGMSLRNELRGKRSNPADWYKYMQQGAQAVHDANPDVLVIMSGLNYDADLK 60
Query: 103 FVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFL---LEQG 159
F+ ++ VNL+FT K+V+E HWY FTDG AW + +C V + F+ L
Sbjct: 61 FLASEPVNLSFTNKIVYEMHWYSFTDGDAWAKMPVDTLCQTVTARINDHLAFVTKTLSPP 120
Query: 160 WPLFVSEFGADLR 172
PLF+SEFG D R
Sbjct: 121 APLFISEFGIDER 133
>gi|383171957|gb|AFG69336.1| Pinus taeda anonymous locus 2_5099_01 genomic sequence
Length = 133
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 89/133 (66%), Gaps = 3/133 (2%)
Query: 43 TIFNGVRNVVGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS 102
T F G VVGMSLRNELRG + N DWY+YMQ GA+AVH ANP+VLVI+SGLN+D DL
Sbjct: 1 TAFKGTSAVVGMSLRNELRGKRSNPADWYKYMQQGAQAVHDANPDVLVIMSGLNYDADLK 60
Query: 103 FVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFL---LEQG 159
F+ ++ VNL+FT K+V+E HWY FTDG AW + +C V + F+ L
Sbjct: 61 FLASEPVNLSFTNKIVYEMHWYSFTDGDAWEKMPVDTLCQTVTARINDHLAFVTKTLSPP 120
Query: 160 WPLFVSEFGADLR 172
PLF+SEFG D R
Sbjct: 121 APLFISEFGIDER 133
>gi|402079105|gb|EJT74370.1| hypothetical protein GGTG_08211 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 414
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 93/267 (34%), Positives = 137/267 (51%), Gaps = 28/267 (10%)
Query: 1 MVILDNHISKPGWCCSNSDGNGF------FGDQ---YFNPDLWIKGLTKMATIFNGVRNV 51
M ILDNH+S+ WCC+ DGNG+ + DQ YFN W+ GL MA G + +
Sbjct: 156 MTILDNHVSRASWCCNLDDGNGWWNTADGYNDQNSRYFNTAEWLAGLRAMAEWAKGHKGI 215
Query: 52 VGMSLRNELR-----GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN 106
VGM +RNE+R G +DWY+ + A +H +P++L+++ G DL+ VR
Sbjct: 216 VGMGIRNEIREFLAQGIINARQDWYQLVSDAARLIHETHPDLLILIGGTMSSTDLTHVRT 275
Query: 107 QAVNLT-FTGKLVFEAHWYGFTDGQAWVDGNP-NQVCGRVVDNVMRLSGFLLEQG----W 160
+ ++++ + K V+E H Y FT V P + C V +GFLL+Q
Sbjct: 276 RPLDVSAWKDKHVWEWHAYSFT-----VTFYPLIKNCWYVHQLYGAFNGFLLQQNKAYTA 330
Query: 161 PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGL 220
PL +SEFG + G D YL+C A D DWA+W L GSYY+R+ I +E YG+
Sbjct: 331 PLILSEFGFAMDGP---DRFYLDCLRDYATSNDGDWAIWALQGSYYVRDKQIDQDESYGV 387
Query: 221 FDWNWCDIRNSSFLERISSLQSPFRGP 247
+ +W RN + + +GP
Sbjct: 388 MNRDWSGWRNGDLKNLMGKMWETTQGP 414
>gi|255635797|gb|ACU18247.1| unknown [Glycine max]
Length = 195
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 61/74 (82%), Positives = 70/74 (94%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNHI++PGWCCSNSDGNGFFGD++F+P+ WI GLTKMA++FNGV NVVGMSLRNEL
Sbjct: 113 MVILDNHITQPGWCCSNSDGNGFFGDKFFDPNQWILGLTKMASLFNGVTNVVGMSLRNEL 172
Query: 61 RGPKQNVKDWYRYM 74
RGPKQNV DWY+YM
Sbjct: 173 RGPKQNVNDWYKYM 186
>gi|429849189|gb|ELA24599.1| beta-1,6-galactanase [Colletotrichum gloeosporioides Nara gc5]
Length = 419
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 91/264 (34%), Positives = 129/264 (48%), Gaps = 41/264 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
++LDNH+SK WCCS DGN ++GD FN W++GLT MA V MSLRNELR
Sbjct: 157 ILLDNHMSKGKWCCSGDDGNTWWGDTEFNAMNWVRGLTYMANHGKSWPAFVAMSLRNELR 216
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFV-------- 104
N +DWY+Y+Q G A++ ANP+VL+ LSGLN+D ++ V
Sbjct: 217 QATNNPDLVSASYHWQDWYKYIQQGTNAINGANPDVLIYLSGLNYDTTVAPVFEGTALEP 276
Query: 105 RNQAVNLT----FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG- 159
+ NL + KLV E H Y + G C + N+ GF
Sbjct: 277 GTETFNLADFEGYANKLVLEIHNYEGSIGS----------CDSLKYNLYH-KGFQAANAS 325
Query: 160 -------WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVI 212
+P+ ++EFG ++ +N N Y C E W +W LVGSYY R+G
Sbjct: 326 DPETVNVFPVALTEFGFNMN-DNTYQNVYSTCLSEYLPEYKASWFIWVLVGSYYTRQGTQ 384
Query: 213 GLNEYYGLFDWNWCDIRNSSFLER 236
+E +GL +W D RN ++++
Sbjct: 385 NFDESWGLLKADWSDWRNQDYVDQ 408
>gi|449446131|ref|XP_004140825.1| PREDICTED: uncharacterized protein LOC101208873 [Cucumis sativus]
Length = 288
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 121/232 (52%), Gaps = 7/232 (3%)
Query: 153 GFLLE--QGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREG 210
GF+++ +PLFVSE+G D R N +NR+++CF + D DWALW GSYY REG
Sbjct: 9 GFVMQGPNPFPLFVSEYGYDQREVNDAENRFMSCFTAHLTQRDLDWALWAWQGSYYFREG 68
Query: 211 VIGLNEYYGLFDWNWCDIRNSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRK 270
E +G+ D NW I+N +F+ + LQ+ + P VIYHP + C+Q
Sbjct: 69 QAEPGESFGVLDSNWTQIKNPNFVRKFQLLQTMLQDPNS-NASFSYVIYHPQSSQCIQVS 127
Query: 271 SFLDPLTLGPCTESEAWSYTPHKT-ISLKGAYFCLQAKHVGKPAKLGIICTDCGSTWEII 329
+ + L C+ WS+ T I + L+A G A L S W I
Sbjct: 128 NDNKEIFLTNCSTPTRWSHNNDGTPIEMSSTGLYLKASGKGLEASLSSDTLSQQSVWSAI 187
Query: 330 SDSKMHLSSKADNGTTVCLDVDSSNT--IVTNTCKCLSRDKTCDPASQWFKL 379
S+SK+HL++ G ++CL +DSSN+ +VTN+C C + K D +W ++
Sbjct: 188 SNSKLHLATFTQGGKSLCLQIDSSNSSKVVTNSCICTNVSKIGD-GGEWRQM 238
>gi|156045355|ref|XP_001589233.1| hypothetical protein SS1G_09866 [Sclerotinia sclerotiorum 1980]
gi|154694261|gb|EDN93999.1| hypothetical protein SS1G_09866 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 422
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 129/262 (49%), Gaps = 38/262 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+SK WCCSN+DGN +FGD YFN W +GL MAT N++ MSLRNELR
Sbjct: 160 VHLDNHVSKAEWCCSNTDGNSWFGDTYFNVTNWQRGLAYMATRGASWGNLMSMSLRNELR 219
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV--- 109
PK N ++WY + G ++H ANP VL+ LSGLNFD DLS + N +
Sbjct: 220 EPKSNPTLDSSSYNWENWYANVVPGMNSIHNANPNVLIFLSGLNFDVDLSPIPNAGLLTP 279
Query: 110 ---------NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGF-LLEQG 159
+ +F KLV E H Y + +++ +R SG+ L G
Sbjct: 280 STTQKFLKSSFSFANKLVLELHNYASS-----------TSSCSSLESSLRNSGYNALGDG 328
Query: 160 ----WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLN 215
P+ ++E+G ++ Y C + W +W L GSYY+R G L+
Sbjct: 329 ATNQMPVVLTEWGHAQTASDY-ATVYSTCLKTYLTSIKGGWMVWVLSGSYYIRSGAQDLD 387
Query: 216 EYYGLFDWNWCDIRNSSFLERI 237
E +GL +W R+ + + +
Sbjct: 388 ESWGLLTHDWSAWRDPTDITNV 409
>gi|310794879|gb|EFQ30340.1| beta-1,6-galactanase [Glomerella graminicola M1.001]
Length = 418
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/255 (33%), Positives = 131/255 (51%), Gaps = 24/255 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNH+SK WCCS+ DGN ++GD+ F+ D W++GL MA V MSLRNELR
Sbjct: 157 IHLDNHMSKAKWCCSSDDGNTWWGDREFDVDNWVRGLAYMAEHGKSWPAFVSMSLRNELR 216
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAV-- 109
N +DWY Y++LG +A++ ANP+VL+ LSGLN+D ++ VR A+
Sbjct: 217 QASDNPDLVAASYHWQDWYEYIKLGTDAINGANPDVLIYLSGLNYDTTVTPVVRGTALEP 276
Query: 110 --------NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWP 161
+ +T KLV E H + +G D + + M + +P
Sbjct: 277 GNGTFSRSDFAYTDKLVLEIH--NYNNGATSCDSLKSALYNSGF-QAMNATDPATVNVFP 333
Query: 162 LFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLF 221
+ ++E+G ++ ++ Y C E + +W LVGSYY+R G +E +GL
Sbjct: 334 VALTEYGFNMEDDSYQ-GVYATCLSEYLPEARASFFIWVLVGSYYVRSGTQNYDESWGLL 392
Query: 222 DWNWCDIRNSSFLER 236
+W D RN +++E
Sbjct: 393 TVDWSDWRNPAYVEE 407
>gi|389637876|ref|XP_003716571.1| beta-1,6-galactanase [Magnaporthe oryzae 70-15]
gi|351642390|gb|EHA50252.1| beta-1,6-galactanase [Magnaporthe oryzae 70-15]
gi|440465783|gb|ELQ35084.1| beta-1,6-galactanase [Magnaporthe oryzae Y34]
gi|440486957|gb|ELQ66776.1| beta-1,6-galactanase [Magnaporthe oryzae P131]
Length = 423
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 131/261 (50%), Gaps = 38/261 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
++LDNHIS WCC DGN F+GD F+ DLW++GL MA + + MSLRNE R
Sbjct: 164 IVLDNHISTGKWCCGADDGNAFWGDTQFSADLWVRGLEYMAKHGKKWQALTSMSLRNEPR 223
Query: 62 GPKQ--------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAV--- 109
P N + WY YM+ GA AVHAAN +VLV LSGLN+D ++ VR +A+
Sbjct: 224 TPAAGSPALNNYNWQSWYEYMKRGANAVHAANADVLVFLSGLNYDTYMTPVVRGEALTPG 283
Query: 110 --NLTF-------TGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL---- 156
+F KLV E H Y + G C + +N+ +GF
Sbjct: 284 TGKFSFDDFPNWGRDKLVVELHNYENSQGD----------CTNLQNNLYN-NGFQALTDP 332
Query: 157 -EQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLN 215
+P+ ++E+G ++ N+ + Y C + +W + GSYY R+G
Sbjct: 333 SVTTFPVMLTEYGFNMMDNSW-QSVYATCIAQYMPAQKASFFIWVVAGSYYTRQGTQDFE 391
Query: 216 EYYGLFDWNWCDIRNSSFLER 236
E +G+ + +W D RN +++E+
Sbjct: 392 ESWGVLNHDWSDWRNPTYVEQ 412
>gi|367028963|ref|XP_003663765.1| glycoside hydrolase family 5 protein [Myceliophthora thermophila
ATCC 42464]
gi|347011035|gb|AEO58520.1| glycoside hydrolase family 5 protein [Myceliophthora thermophila
ATCC 42464]
Length = 422
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/266 (34%), Positives = 132/266 (49%), Gaps = 46/266 (17%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHISK WCCS++DGN ++GD YF+ + W++GL+ MA + + LRNE R
Sbjct: 159 IHLDNHISKGMWCCSSTDGNSWWGDTYFSTENWVRGLSYMAEHGKSWPALTSIGLRNEPR 218
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAV-- 109
P N + WYRYM+ GA+AVH ANP++L+ LSGLNFD L+ VR + +
Sbjct: 219 EPTSNPALAQSSYNWQSWYRYMRQGADAVHGANPDLLIFLSGLNFDTYLTPVVRGEPLAP 278
Query: 110 ----------NLTFTGKLVFEAHWY--GFTDGQAW---VDGNPNQVCGRVVDNVMRLSGF 154
GKLV E H Y G T A +D N + D V +
Sbjct: 279 GTDRFDVADFAAGPAGKLVLELHNYETGATSCDALRANLDRNGFEALLSPDDAVANV--- 335
Query: 155 LLEQGWPLFVSEFGADLRGNNVNDNR-----YLNCFFGVAAELDWDWALWTLVGSYYLRE 209
P+ ++EFG + D+R Y +C E W +W L GSYY+R
Sbjct: 336 -----LPVVMTEFGFQM------DDRTWRGVYASCLAQYLPERKAGWTIWVLAGSYYVRS 384
Query: 210 GVIGLNEYYGLFDWNWCDIRNSSFLE 235
G +E +GL + +W D R+ +++
Sbjct: 385 GTQDYDEGWGLLNHDWSDWRSPGYVD 410
>gi|380488102|emb|CCF37606.1| beta-1,6-galactanase [Colletotrichum higginsianum]
Length = 425
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 133/263 (50%), Gaps = 39/263 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
++LDNH+SK WCCS +DGN ++GD+ F+ W++GL MA +V MSLRNELR
Sbjct: 163 ILLDNHMSKGKWCCSGNDGNTWWGDREFDAANWVRGLAYMAEHGKSWPALVAMSLRNELR 222
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFV-RNQAV-- 109
N +DWY+Y+Q G +AV+ ANP+VL+ LSGLN+D ++ V R A+
Sbjct: 223 QATDNPELVAASYHWQDWYKYIQQGTDAVNGANPDVLIYLSGLNYDTTVAPVFRGTALTP 282
Query: 110 -NLTFT--------GKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRLSG 153
N TF+ KLV E H Y + G C + N+ M +
Sbjct: 283 GNATFSRADFDGYADKLVLEIHNYEGSIGS----------CASLRYNLYNRGFQAMNATD 332
Query: 154 FLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIG 213
+P+ ++EFG ++ + Y C E + +W LVGSYY R+G
Sbjct: 333 PATADVFPVALTEFGFNMN-DATYQGVYSTCLAEYLPEARASFFIWVLVGSYYTRQGTQN 391
Query: 214 LNEYYGLFDWNWCDIRNSSFLER 236
+E +GL + +W RN +++E
Sbjct: 392 FDESWGLLNVDWSAWRNPAYVEE 414
>gi|154318702|ref|XP_001558669.1| hypothetical protein BC1G_02740 [Botryotinia fuckeliana B05.10]
gi|347830620|emb|CCD46317.1| glycoside hydrolase family 5 protein [Botryotinia fuckeliana]
Length = 422
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 82/257 (31%), Positives = 120/257 (46%), Gaps = 28/257 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+SK WCCSN+DGN +FGD YFN W +G+ MAT N++ M+LRNELR
Sbjct: 160 VHLDNHVSKAIWCCSNTDGNSWFGDTYFNVSNWQRGIAYMATRGASWGNLMSMALRNELR 219
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV--- 109
P N ++WY + G ++H ANP VL+ +SGLNFD DLS + N +
Sbjct: 220 EPTSNPSLDSSSYNWENWYTNVVAGMNSIHTANPNVLIFISGLNFDTDLSTIPNAGLLTP 279
Query: 110 ---------NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW 160
+ ++ KLV E H Y + + + G N +
Sbjct: 280 STTQKFLKSSFSYANKLVLELHNYASSTSSCSSLQSSLKSQGYNALNAG------VTNQM 333
Query: 161 PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGL 220
P+ ++E+G + Y C + W +W L GSYY+R G E +GL
Sbjct: 334 PVVLTEWGHAQTAADY-ATVYSTCLKTYLTSIKGGWMVWVLSGSYYIRSGTQDFEETWGL 392
Query: 221 FDWNWCDIRNSSFLERI 237
+W R+ + + +
Sbjct: 393 LTHDWSAWRDPTDVANV 409
>gi|116206102|ref|XP_001228860.1| hypothetical protein CHGG_02344 [Chaetomium globosum CBS 148.51]
gi|88182941|gb|EAQ90409.1| hypothetical protein CHGG_02344 [Chaetomium globosum CBS 148.51]
Length = 425
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/262 (34%), Positives = 129/262 (49%), Gaps = 41/262 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCC ++DGN ++GD YF+ W++GL+ MA + + LRNE R
Sbjct: 165 VHLDNHISKGMWCCGSTDGNSWWGDTYFSTSNWVRGLSYMAEHGKSWPALTSIGLRNEPR 224
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT 112
P N + WY +M+ GA+AVHAANP +L+ LSGL+FD L+ V Q LT
Sbjct: 225 EPTNNPPLSRSSYNWQAWYGFMRQGADAVHAANPSLLIFLSGLSFDTYLTPVV-QGTALT 283
Query: 113 ----------FTG----KLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQ 158
F G KLV E H Y G C + N+ R +GF Q
Sbjct: 284 PGNGRFNRNDFAGYGDTKLVLELHNYETGIGS----------CATLQGNLDR-NGFEALQ 332
Query: 159 -----GWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIG 213
+P+ ++EFG + + Y +C WA+W L GSYY+R G
Sbjct: 333 ASGAGAFPVMMTEFGFQMDASTWR-GVYASCLAQYLPARKAGWAIWVLAGSYYMRSGTQD 391
Query: 214 LNEYYGLFDWNWCDIRNSSFLE 235
+E +GL + +W R+ S+++
Sbjct: 392 YDENWGLLNHDWSGWRSQSYVD 413
>gi|346979872|gb|EGY23324.1| beta-1,6-galactanase [Verticillium dahliae VdLs.17]
Length = 422
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 41/266 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHIS WCC DGN ++GD F+ D W++G MA ++ SLRNE R
Sbjct: 158 ITLDNHISSGEWCCGGEDGNTWWGDTKFDVDNWVRGGAYMAAHAAKWPALISQSLRNEPR 217
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFD-------------- 98
PK N +DWY+YM+ GA+AVHAAN +V +ILSGL++D
Sbjct: 218 PPKNNDDLLQSSYNWRDWYKYMRAGADAVHAANKDVPIILSGLDYDTFVTPVFRGTPLEP 277
Query: 99 KDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRL 151
D F R+ V KL+ E H Y + N N C + N+ M
Sbjct: 278 SDQVFSRDDFVGYG-EDKLILEIHNY---------ETNTNS-CDSLRYNLYNKGFQAMNA 326
Query: 152 SGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGV 211
S +P+ ++E+G + + Y C E+ +W +W +VG YY R+GV
Sbjct: 327 SDPATVNVFPVQLTEYGHSMEDGSWKTKVYQPCLAEYLPEVKANWFIWVIVGRYYTRQGV 386
Query: 212 IGLNEYYGLFDWNWCDIRNSSFLERI 237
++ +GL + +W RN ++E++
Sbjct: 387 QEFDDSWGLMNPDWSGWRNPEYVEQM 412
>gi|336467838|gb|EGO56002.1| hypothetical protein NEUTE1DRAFT_148368 [Neurospora tetrasperma
FGSC 2508]
gi|350287497|gb|EGZ68733.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 422
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 84/254 (33%), Positives = 121/254 (47%), Gaps = 26/254 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCCS DGN ++GD YF+ W +GL MA +++ + LRNE R
Sbjct: 165 VHLDNHISKGMWCCSTDDGNSWWGDTYFSAANWTRGLAYMANHGKQWTSLMSIGLRNEPR 224
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAVN--- 110
P N + WY YM+ GAEAVH++NP++L+ LSGL+FD L+ VR A+
Sbjct: 225 EPTSGAAKSTYNWQTWYTYMKQGAEAVHSSNPDLLIFLSGLSFDTFLTPVVRGTALTPGT 284
Query: 111 --------LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
++ KLV E H Y + N G ++ F P+
Sbjct: 285 GKFSFNDFPSYANKLVLELHNYETSANSCNNLQNNLYNNGFEALTSSAVNQF------PV 338
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++EFG + + Y +C W +W L GSYY+R G +E +GL
Sbjct: 339 MLTEFGFQMDASTWKGT-YASCLASYLPAQKAGWFIWVLAGSYYIRSGTQDYDEGWGLLT 397
Query: 223 WNWCDIRNSSFLER 236
+W R+ S++
Sbjct: 398 HDWSTWRSPSYVNE 411
>gi|302404994|ref|XP_003000334.1| beta-1,6-galactanase [Verticillium albo-atrum VaMs.102]
gi|261360991|gb|EEY23419.1| beta-1,6-galactanase [Verticillium albo-atrum VaMs.102]
Length = 420
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 128/266 (48%), Gaps = 43/266 (16%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHIS WCC DGN ++GD F+ D W++G MA+ ++ SLRNE R
Sbjct: 158 ITLDNHISSGEWCCGGEDGNTWWGDTKFDVDNWVRGGAYMASA--KWPALISQSLRNEPR 215
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFD-------------- 98
PK N +DWY+YM+ GA+AVHAAN +V +ILSGL++D
Sbjct: 216 PPKNNDALLQSSYNWRDWYKYMRAGADAVHAANKDVPIILSGLDYDTFVTPVFRGTPLEP 275
Query: 99 KDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRL 151
D F RN V KL+ E H Y + N N C + N+ M
Sbjct: 276 SDQVFSRNDFVGYG-EDKLILEIHNY---------ETNTNS-CDSLRYNLYNKGFQAMNA 324
Query: 152 SGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGV 211
S +P+ ++E+G + + Y C E+ +W +W +VG YY R+GV
Sbjct: 325 SDPATVNVFPVQLTEYGHSMEDGSWKTKVYQPCLAEYLPEVKANWFIWVIVGRYYTRQGV 384
Query: 212 IGLNEYYGLFDWNWCDIRNSSFLERI 237
++ +GL + +W RN ++E++
Sbjct: 385 QEFDDSWGLMNPDWSGWRNPEYVEQM 410
>gi|297602933|ref|NP_001053106.2| Os04g0481000 [Oryza sativa Japonica Group]
gi|255675564|dbj|BAF15020.2| Os04g0481000, partial [Oryza sativa Japonica Group]
Length = 100
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 54/72 (75%), Positives = 63/72 (87%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILDNH+SKPGWCC N+DGNGFFGD YF+PD+W+ GLTKMAT+F V +VV MSLRNEL
Sbjct: 29 MVILDNHVSKPGWCCGNNDGNGFFGDAYFDPDVWVDGLTKMATMFAAVPSVVAMSLRNEL 88
Query: 61 RGPKQNVKDWYR 72
RGP+QN DWY+
Sbjct: 89 RGPRQNSADWYK 100
>gi|320585874|gb|EFW98553.1| cellulase family protein [Grosmannia clavigera kw1407]
Length = 388
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/262 (33%), Positives = 123/262 (46%), Gaps = 47/262 (17%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFF---------GDQYFNPDLWIKGLTKMATIFNGVRNV 51
M +LDNH+S+ GWCC +DGNG++ ++F+ D W++GL MA
Sbjct: 152 MTVLDNHVSRAGWCCDLTDGNGWWDVATGYDAANSRFFHTDRWLRGLQAMAD-------- 203
Query: 52 VGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN-QAVN 110
DWY + VH NP+VLVI+ G DLSFVR Q +N
Sbjct: 204 -----------ADGRHADWYHLLGRAGALVHQVNPDVLVIIGGTFMATDLSFVRLLQPLN 252
Query: 111 LT-FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRL-SGFLLEQGWP----LFV 164
T + GK V+E H Y F+ P+ + R + L +GF+L Q P L +
Sbjct: 253 TTGWAGKHVWEWHTYSFSLTV------PHVLPCRARQQLYGLFAGFVLRQHQPYTGPLLL 306
Query: 165 SEFGADLRGNNVN------DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYY 218
SEFG D G + D+RYL+C D +WA+W L GSYY+R NE +
Sbjct: 307 SEFGVDQTGGHGPNGLSHLDSRYLSCLVDYLRRNDAEWAVWALQGSYYVRNKQTNYNETW 366
Query: 219 GLFDWNWCDIRNSSFLERISSL 240
L + NW +RN F E + +
Sbjct: 367 DLLNHNWTALRNPRFFELLRGI 388
>gi|85094362|ref|XP_959873.1| hypothetical protein NCU05882 [Neurospora crassa OR74A]
gi|28921329|gb|EAA30637.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 422
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 120/252 (47%), Gaps = 26/252 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCCS DGN ++GD YF+ W +GL MA +++ + LRNE R
Sbjct: 165 VHLDNHISKGMWCCSTGDGNSWWGDTYFSAANWTRGLAYMANHGKQWTSLMSIGLRNEPR 224
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAVN--- 110
P N + WY YM+ GAEAVH++NP++L+ LSGL+FD L+ VR A+
Sbjct: 225 EPTSGAAKSTYNWQTWYTYMKQGAEAVHSSNPDLLIFLSGLSFDTFLTPVVRGTALTPGT 284
Query: 111 --------LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
+ KLV E H Y + N G ++ F P+
Sbjct: 285 GKFSFNDFPGYADKLVLELHNYETSANNCNNLQNNLYNNGFEALTSSTVNQF------PV 338
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++EFG + + Y +C W +W L GSYY+R G +E +GL
Sbjct: 339 MLTEFGFQMDASTWK-GTYASCLASYLPAQKAGWFIWVLAGSYYIRSGTQDYDEGWGLLT 397
Query: 223 WNWCDIRNSSFL 234
+W R+ S++
Sbjct: 398 HDWSTWRSPSYV 409
>gi|336262547|ref|XP_003346057.1| hypothetical protein SMAC_08559 [Sordaria macrospora k-hell]
gi|380087626|emb|CCC05307.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 419
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 119/252 (47%), Gaps = 26/252 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCCS DGN ++GD YF+ W +GL MA +++ + LRNE R
Sbjct: 162 VHLDNHISKGAWCCSTGDGNSWWGDTYFSASNWTRGLAYMANHGKQWTSLMSIGLRNEPR 221
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAVN--- 110
P N + WY YM+ GAEAVH +NP++L+ LSGL+FD L+ VR A+
Sbjct: 222 EPTSGAAKSTYNWQTWYTYMKQGAEAVHFSNPDLLIFLSGLSFDTFLTPVVRGTALTPGS 281
Query: 111 --------LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
+ KLV E H Y + N G ++ F P+
Sbjct: 282 GKFSFNDFPGYANKLVLELHNYENSANNCNNLQNNLYNNGFQALTSSAVNKF------PV 335
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++EFG + + Y +C W +W L GSYY+R G +E +GL
Sbjct: 336 MLTEFGFQMDASTWKGT-YASCLASYLPAQKAGWFIWVLAGSYYIRSGTQDYDEGWGLLT 394
Query: 223 WNWCDIRNSSFL 234
+W R+ S++
Sbjct: 395 RDWSAWRSPSYV 406
>gi|396459113|ref|XP_003834169.1| similar to glycosyl hydrolase family 5 protein/cellulase
[Leptosphaeria maculans JN3]
gi|312210718|emb|CBX90804.1| similar to glycosyl hydrolase family 5 protein/cellulase
[Leptosphaeria maculans JN3]
Length = 410
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 85/259 (32%), Positives = 124/259 (47%), Gaps = 27/259 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVR---NVVGMSLRN 58
V LDNH+SK WCCS +DGN +FGD FN W +GL MA + N+V + LRN
Sbjct: 150 VHLDNHMSKGKWCCSGNDGNTWFGDTDFNVAKWERGLVHMAEHIIQSKTWPNMVSLGLRN 209
Query: 59 ELRGPKQNVKD-------WYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ---- 107
ELR P Q WY M A +V+AANPE+L+ LSGLNFD L + +
Sbjct: 210 ELRRPDQAGNTLPYTWSVWYDNMIKAANSVNAANPEILIFLSGLNFDTTLQPIPSSDDLG 269
Query: 108 ------AVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWP 161
N ++ KLV E H Y + N + G + +N R + P
Sbjct: 270 DGKKFVLKNFKYSDKLVLELHNYQNS------ATNCGDIEGGLWNNGFRATWPTAVNRMP 323
Query: 162 LFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLF 221
+ ++EFG + +N Y C + + W +W L GSYY+R G+ E +GL
Sbjct: 324 VLLTEFGFS-QADNSYSGVYSTCLKKLMPQWQSGWMVWALGGSYYIRSGIQDYEETWGLV 382
Query: 222 DWNWCDIRNSSFLERISSL 240
+ +W R+++ + + +
Sbjct: 383 NHDWSGWRSANSIRSLKEM 401
>gi|302906325|ref|XP_003049455.1| hypothetical protein NECHADRAFT_45063 [Nectria haematococca mpVI
77-13-4]
gi|256730390|gb|EEU43742.1| hypothetical protein NECHADRAFT_45063 [Nectria haematococca mpVI
77-13-4]
Length = 421
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 127/263 (48%), Gaps = 39/263 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHIS WCC +DGN ++GD F+ D W +G MA + SLRNE R
Sbjct: 153 INLDNHISSGKWCCGGTDGNTWWGDTEFDADRWARGNAYMAAHSRSWPAKISQSLRNEPR 212
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV--- 109
P N +D Y++M+ GA+AVH A+P+ L+I+SGLN+D ++ + A
Sbjct: 213 PPTNNEDLRSKSYNWRDLYKFMRQGADAVHEADPDSLIIISGLNYDTFVTPLFTGAALDP 272
Query: 110 -NLTFT--------GKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRLSG 153
TF+ KLV E H Y + G C + N+ M S
Sbjct: 273 STQTFSRDDFAGYGEKLVLEIHNYENSIGS----------CSSLRYNLYTKGFQAMNASD 322
Query: 154 FLLEQGWPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLVGSYYLREGVI 212
+ +P+ ++EFG + G+N N + Y++C E W +W +VG YY R+G+
Sbjct: 323 PNTKNVFPVMLTEFGHSMEGDNYNKAKTYMSCLSEYLPEAQASWFIWVIVGRYYTRQGIQ 382
Query: 213 GLNEYYGLFDWNWCDIRNSSFLE 235
++ +G+ +W RN +++
Sbjct: 383 EFDDSWGIKKPDWSGWRNDQYIK 405
>gi|67904358|ref|XP_682435.1| hypothetical protein AN9166.2 [Aspergillus nidulans FGSC A4]
gi|40742809|gb|EAA61999.1| hypothetical protein AN9166.2 [Aspergillus nidulans FGSC A4]
gi|259485409|tpe|CBF82407.1| TPA: cellulase family protein (AFU_orthologue; AFUA_5G14560)
[Aspergillus nidulans FGSC A4]
Length = 412
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 122/265 (46%), Gaps = 47/265 (17%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCCS +DGN +FGD YF+ D W +GL MA+ + LRNELR
Sbjct: 153 VHLDNHISKAMWCCSTTDGNAWFGDTYFDVDNWKRGLQYMASHAASWETFTSIGLRNELR 212
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFV---------- 104
P N + WY M A+ V+AANP+ L+ LSGLN+D L+ +
Sbjct: 213 QPASANPSYPYNWETWYSQMTEAADLVNAANPDALIFLSGLNYDTTLAPIPTGSDLGEGT 272
Query: 105 RNQAVNLTFTGKLVFEAHWY-------GFTDGQAW------VDGNPNQVCGRVVDNVMRL 151
R + ++ KLV E H Y G W +D N + + N+M
Sbjct: 273 RFYLEDFSYADKLVLELHNYDTGASSCANLSGALWNGGFKALDTNSSDIV-----NIM-- 325
Query: 152 SGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGV 211
P+ ++EFG + + Y +C + W +WT+ GSYY+R G
Sbjct: 326 ---------PVVLTEFGF-AQDETTWQSVYASCIREWIPQQQAGWMVWTISGSYYIRSGG 375
Query: 212 IGLNEYYGLFDWNWCDIRNSSFLER 236
++ +G+ D W RN +E+
Sbjct: 376 QDYDDTWGILDHTWSGWRNEEAIEQ 400
>gi|115449373|ref|XP_001218590.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114187539|gb|EAU29239.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 628
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/240 (35%), Positives = 113/240 (47%), Gaps = 25/240 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCCS +DGN +FGD YF+ W++GL M + N V + LRNELR
Sbjct: 150 VHLDNHISKAMWCCSTTDGNAWFGDTYFDVPKWMRGLEYMVSHAKSWPNFVSIGLRNELR 209
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS----------FV 104
P N WY M A+ V+AANP L+ LSGLN+D LS V
Sbjct: 210 QPATANPEYPYNWGTWYTQMTTAAKRVNAANPNALIFLSGLNYDTTLSPIPTASDLGNGV 269
Query: 105 RNQAVNLTFTGKLVFEAHWY--GFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
+ + + F KLV E H Y G T A N + S + P+
Sbjct: 270 KFRLSDFAFASKLVMEIHNYETGATSCSALSSALWNAGL-----KALNSSDPSIVNSMPV 324
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++EFG L+ + N Y +C E W W + GSYY+R+G ++E +G D
Sbjct: 325 VLTEFGF-LQDSTTWKNVYASCLRTWIPEQHVGWMTWVIAGSYYIRQGNQDMDETWGEKD 383
>gi|119476840|ref|XP_001259223.1| cellulase, putative [Neosartorya fischeri NRRL 181]
gi|119407377|gb|EAW17326.1| cellulase, putative [Neosartorya fischeri NRRL 181]
Length = 426
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/242 (35%), Positives = 114/242 (47%), Gaps = 35/242 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHISK WCCS +DGN +FGD Y++ W++GL M + N V + LRNELR
Sbjct: 150 IHLDNHISKAMWCCSRTDGNTWFGDTYYDVAKWMRGLEYMVSHAKKWPNFVSIGLRNELR 209
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS----------FV 104
P N WY M A+ V+AANP L+ LSGLN+D L+ V
Sbjct: 210 EPSTINTQYPYNWGTWYTQMTTAAKRVNAANPNALIFLSGLNYDTTLAPIPTASDLGNGV 269
Query: 105 RNQAVNLTFTGKLVFEAHWY--GFT-----DGQAWVDGNPNQVCGRVVDNVMRLSGFLLE 157
R + + +F KLV E H Y G T G W G Q S +
Sbjct: 270 RFRLSDFSFANKLVLELHNYETGATSCSALSGALWNAGFKAQ----------NSSDPSIV 319
Query: 158 QGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
P+ ++EFG L+ + N Y +C E W W + GSYY+R+G L+E
Sbjct: 320 NSMPVVLTEFGF-LQDSTTWKNVYASCLRTWIPEQQAGWTTWVIAGSYYIRQGTQDLDET 378
Query: 218 YG 219
+G
Sbjct: 379 WG 380
>gi|119507928|dbj|BAF42338.1| beta-1,6-galactanase [Fusarium oxysporum]
Length = 420
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 127/266 (47%), Gaps = 42/266 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHIS+ WCC +DGN ++GD FN D W++G MA + SLRNE R
Sbjct: 154 IDLDNHISEAKWCCGGTDGNTWWGDTQFNVDNWVRGGKYMAAHSKKWPAKITQSLRNEPR 213
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS---------- 102
P N D Y+YM+ GA+AVH A+PE ++++SG+N+D ++
Sbjct: 214 EPTNNNALRDKSYNWSDLYKYMRQGADAVHEADPEAIIVISGMNYDTYVTPLYSGEKLQP 273
Query: 103 ----FVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRL 151
F R+ V KLV E H Y N C + N+ M
Sbjct: 274 GGEVFNRDDFVGYG-KDKLVLEIHNY----------ENKGTSCSSLRYNLYNKGFQAMNE 322
Query: 152 SGFLLEQGWPLFVSEFGADLRGNNVND-NRYLNCFFGVAAELDWDWALWTLVGSYYLREG 210
S + + +P+ ++EFG + G + + N Y++C E+ W +W +VG YY R+G
Sbjct: 323 SDPNVAEVFPVMLTEFGQAMNGADYDTANTYVSCLSEYLPEIKASWFIWVIVGRYYTRQG 382
Query: 211 VIGLNEYYGLFDWNWCDIRNSSFLER 236
+ ++ +G+ +W +N ++ +
Sbjct: 383 IQEFDDSWGMKKADWSGWKNDDYIAK 408
>gi|70996893|ref|XP_753201.1| cellulase family protein [Aspergillus fumigatus Af293]
gi|66850837|gb|EAL91163.1| cellulase family protein [Aspergillus fumigatus Af293]
gi|159127069|gb|EDP52185.1| cellulase family protein [Aspergillus fumigatus A1163]
Length = 392
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 115/242 (47%), Gaps = 35/242 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHISK WCCS +DGN +FGD YF+ W++GL M + N V + LRNELR
Sbjct: 150 IHLDNHISKAMWCCSQTDGNAWFGDTYFDVAKWMRGLEYMVSHAKKWPNFVSIGLRNELR 209
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS----------FV 104
P N WY + A+ V+AANP+ L+ LSGLN+D L+ V
Sbjct: 210 QPSTSNTQYPYNWGTWYTQVTTAAKRVNAANPDALIFLSGLNYDTTLAPIPTASDLGNGV 269
Query: 105 RNQAVNLTFTGKLVFEAHWY--GFT-----DGQAWVDGNPNQVCGRVVDNVMRLSGFLLE 157
R + + +F KLV E H Y G T G W G Q S +
Sbjct: 270 RFRLSDFSFANKLVLELHNYDTGATSCSALSGALWNAGFKAQ----------NSSDPSIV 319
Query: 158 QGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
P+ ++EFG L+ + N Y +C E W W + GSYY+R+G ++E
Sbjct: 320 NPMPVVLTEFGF-LQDSTTWKNVYASCLRTWIPEQRAGWTTWVIAGSYYIRQGTQDMDET 378
Query: 218 YG 219
+G
Sbjct: 379 WG 380
>gi|389643616|ref|XP_003719440.1| cellulase [Magnaporthe oryzae 70-15]
gi|351639209|gb|EHA47073.1| cellulase [Magnaporthe oryzae 70-15]
Length = 471
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/294 (32%), Positives = 134/294 (45%), Gaps = 61/294 (20%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V++DNH+SK WCCS DGN ++GD YF+ W +GLT M N++G+SLRNELR
Sbjct: 151 VLVDNHVSKAQWCCSPLDGNSWWGDTYFSTANWTRGLTYMTEHTRDWPNLIGLSLRNELR 210
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT 112
P N+ + WY + GA A+H ANP+ LV LSGL+ D L V + NLT
Sbjct: 211 QPLTNITLYQQAYNWETWYARTREGAAAIHKANPKALVFLSGLDSDTTLQPVV-EGSNLT 269
Query: 113 -----------FTG---KLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQ 158
F G KLV E H Y N + + N L L++
Sbjct: 270 PGKGMFRPGDYFEGAENKLVLELHSYA------------NIINEGLAKNCTGLKETLVQG 317
Query: 159 GW-----------------PLFVSEFGADLRGNNVNDNRYLNCFFG-VAAELDWDWALWT 200
GW P ++EFG N N Y C + ++ W +W
Sbjct: 318 GWSGLSTAAPGGTKVKNRMPTLMTEFGWGQNDQEWNSN-YSTCIQDFLRDDVRAGWMIWA 376
Query: 201 LVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSF----LERI--SSLQSPFRGPG 248
+ GSYY+RE +E +G+ + +W D R+ L+R+ S+L + G G
Sbjct: 377 ISGSYYIRECKQDYDEPWGILNHDWSDWRSEKHAMGELKRLVNSTLSTTMDGAG 430
>gi|440471530|gb|ELQ40532.1| cellulase family protein [Magnaporthe oryzae Y34]
gi|440489917|gb|ELQ69525.1| cellulase family protein [Magnaporthe oryzae P131]
Length = 459
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 95/294 (32%), Positives = 134/294 (45%), Gaps = 61/294 (20%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V++DNH+SK WCCS DGN ++GD YF+ W +GLT M N++G+SLRNELR
Sbjct: 139 VLVDNHVSKAQWCCSPLDGNSWWGDTYFSTANWTRGLTYMTEHTRDWPNLIGLSLRNELR 198
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT 112
P N+ + WY + GA A+H ANP+ LV LSGL+ D L V + NLT
Sbjct: 199 QPLTNITLYQQAYNWETWYARTREGAAAIHKANPKALVFLSGLDSDTTLQPVV-EGSNLT 257
Query: 113 -----------FTG---KLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQ 158
F G KLV E H Y N + + N L L++
Sbjct: 258 PGKGMFRPGDYFEGAENKLVLELHSYA------------NIINEGLAKNCTGLKETLVQG 305
Query: 159 GW-----------------PLFVSEFGADLRGNNVNDNRYLNCFFG-VAAELDWDWALWT 200
GW P ++EFG N N Y C + ++ W +W
Sbjct: 306 GWSGLSTAAPGGTKVKNRMPTLMTEFGWGQNDQEWNSN-YSTCIQDFLRDDVRAGWMIWA 364
Query: 201 LVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSF----LERI--SSLQSPFRGPG 248
+ GSYY+RE +E +G+ + +W D R+ L+R+ S+L + G G
Sbjct: 365 ISGSYYIRECKQDYDEPWGILNHDWSDWRSEKHAMGELKRLVNSTLSTTMDGAG 418
>gi|46139339|ref|XP_391360.1| hypothetical protein FG11184.1 [Gibberella zeae PH-1]
Length = 421
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 126/264 (47%), Gaps = 40/264 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHIS+ WCC DGN ++GD F+ W++G MA V SLRNE R
Sbjct: 156 INLDNHISEGKWCCGGEDGNTWWGDTQFDTAKWVRGGAYMAAHAKKWPAKVSQSLRNEPR 215
Query: 62 GPKQNVK---------DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFV-------- 104
P N K D Y+YM+ GA+AVH A+P ++++SG+N+D ++ +
Sbjct: 216 EPTNNNKLRDASYNWSDLYKYMRQGADAVHQADPNAIIVISGMNYDTYVTPLYSGEKMQP 275
Query: 105 RNQAVNL-TFTG----KLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRLS 152
R + N F G KLV E H Y N C + N+ M S
Sbjct: 276 RGEVFNRDDFVGYGKDKLVLEIHTY----------ENKGTSCPSLRYNLYNKGFQAMNES 325
Query: 153 GFLLEQGWPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLVGSYYLREGV 211
+ + +P+ ++EFG + G + + Y++C E+ W +W +VG YY R+G+
Sbjct: 326 DPNVAEVFPVMLTEFGQAMNGADYETAKTYVSCLSQYLPEMQASWFIWVIVGRYYTRQGI 385
Query: 212 IGLNEYYGLFDWNWCDIRNSSFLE 235
++ +G+ +W RN ++E
Sbjct: 386 QEFDDSWGMKKPDWSGWRNDQYIE 409
>gi|361129842|gb|EHL01724.1| putative endoglucanase E1 [Glarea lozoyensis 74030]
Length = 420
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 118/264 (44%), Gaps = 36/264 (13%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHIS+ WCC +DGN +FGD YFN W +GL MA +V + LRNE R
Sbjct: 157 VHLDNHISRAEWCCGETDGNAWFGDTYFNVANWKRGLAYMADHGKAWPALVSLGLRNEPR 216
Query: 62 GPKQNV--------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQA----- 108
N + WY+ M A V+AANP +L+ SGLN+D L+ + A
Sbjct: 217 NATNNPTISKTYNWETWYKNMIPAANGVNAANPGLLIYFSGLNYDTTLAPITTGANLGSG 276
Query: 109 -----VNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVD-------NVMRLSGFLL 156
+ ++ KLV E H Y T C + + N + + +
Sbjct: 277 TVFRKSDFPYSNKLVLELHNYQTTA----------TTCPSITNGLYNGGYNALNATDPKV 326
Query: 157 EQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNE 216
+ P+ ++E+G D+ NN Y C L W W + GSYY R+G +E
Sbjct: 327 KNVLPVVMTEWGHDM-SNNEYKGVYQTCLDSYLGGLKAGWMFWEVGGSYYTRQGKQDFDE 385
Query: 217 YYGLFDWNWCDIRNSSFLERISSL 240
+ LF+ +W + R + E I +
Sbjct: 386 TWALFNHDWTNWRQKASAESIKEM 409
>gi|449302780|gb|EMC98788.1| glycoside hydrolase family 5 protein [Baudoinia compniacensis UAMH
10762]
Length = 440
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 118/256 (46%), Gaps = 28/256 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+SK WCC+N+DGN +FGD YF+ W +GL MA + MSLRNELR
Sbjct: 167 VHLDNHVSKAQWCCNNNDGNAWFGDTYFSVQKWHRGLGFMAAHAASWQGFASMSLRNELR 226
Query: 62 GPKQNV-------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF-VRNQAVN--- 110
P + W + A +H NP L+ SG ++ D ++ ++ Q N
Sbjct: 227 QPDSDSPASPYTWNTWIKNALPAASVIHKHNPSPLIFFSGEGYNTDDTYAIQRQTWNHKT 286
Query: 111 -----LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDN---VMRLSGFLLEQGWPL 162
F K+V+E H Y Q N + R+ M LS P+
Sbjct: 287 FTPEKYAFRNKVVYEIHNY-----QTKAKSCSNDIQPRLYHTAYCAMNLSDSSCPNHGPV 341
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAEL---DWDWALWTLVGSYYLREGVIGLNEYYG 219
++EFG D + ++ + + Y C + W W L GSYY+R G+ +E +G
Sbjct: 342 IMTEFGFD-QTDSSDQSVYAQCISSTLLQQPGGPGGWMQWVLAGSYYIRSGIKDYDETWG 400
Query: 220 LFDWNWCDIRNSSFLE 235
L + +W D RN S ++
Sbjct: 401 LLNHDWSDFRNRSVID 416
>gi|402087567|gb|EJT82465.1| cellulase [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 482
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/262 (32%), Positives = 120/262 (45%), Gaps = 36/262 (13%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V+LDNH+SK WCC+ DGN ++GD F+ W +GL+ MA N++ MSLRNELR
Sbjct: 160 VVLDNHVSKAQWCCTPLDGNSWWGDTLFSAANWTRGLSYMANHTKSWPNLLAMSLRNELR 219
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAVN- 110
P N+ + WY++ + G A+ NP+ LV LSGL D L VR + +
Sbjct: 220 QPFTNITLYRSSYNWETWYQHTKDGVAAIRKENPDALVFLSGLESDTTLQPVVRGEVLTP 279
Query: 111 -------LTFTG-----KLVFEAHWYG--FTDGQ---------AWVDGNPNQVCGRVVDN 147
F G KLV E H Y GQ A DG + +
Sbjct: 280 GSARFSVADFPGGSSENKLVLELHSYSNVINQGQANNCTALRAALRDGGFEALLSNSTEG 339
Query: 148 VMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
G +++ P+ ++EFG + + Y C G + W +W L GSYY
Sbjct: 340 --GAGGAVVKNRLPVLLTEFGWAQQDEKEWGSAYATCLRGFVGDTGAGWTVWVLAGSYYT 397
Query: 208 REGVIGLNEYYGLFDWNWCDIR 229
R+G +E +GL + +W D R
Sbjct: 398 RQGSQDNDEAWGLLNHDWSDWR 419
>gi|342883750|gb|EGU84192.1| hypothetical protein FOXB_05301 [Fusarium oxysporum Fo5176]
Length = 420
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 127/266 (47%), Gaps = 42/266 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHIS+ WCC +DGN ++GD FN D W++G MA + SLRNE R
Sbjct: 154 IDLDNHISEAKWCCGGTDGNTWWGDTQFNVDNWVRGGKYMAAHSKKWPAKITQSLRNEPR 213
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS---------- 102
P N D Y+YM+ GA+AVH A+P+ ++++SG+N+D ++
Sbjct: 214 EPTNNNALRDKSYNWSDLYKYMRQGADAVHEADPKAIIVISGMNYDTYVTPLYSGEKLQP 273
Query: 103 ----FVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRL 151
F R+ V KLV E H Y N C + N+ M
Sbjct: 274 GGEVFNRDDFVGYG-KDKLVLEIHNY----------ENKGTSCSSLRYNLYNKGFQAMNE 322
Query: 152 SGFLLEQGWPLFVSEFGADLRGNNVND-NRYLNCFFGVAAELDWDWALWTLVGSYYLREG 210
S + + +P+ ++EFG + G + + N Y++C E+ W +W +VG YY R+G
Sbjct: 323 SDPNVAEVFPVMLTEFGQAMNGADYDTANTYVSCLSEYLPEIKASWFIWVIVGRYYTRQG 382
Query: 211 VIGLNEYYGLFDWNWCDIRNSSFLER 236
+ ++ +G+ +W +N ++ +
Sbjct: 383 IQEFDDSWGMKKADWSGWKNDDYIAK 408
>gi|169618910|ref|XP_001802868.1| hypothetical protein SNOG_12647 [Phaeosphaeria nodorum SN15]
gi|160703705|gb|EAT79945.2| hypothetical protein SNOG_12647 [Phaeosphaeria nodorum SN15]
Length = 302
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/253 (31%), Positives = 121/253 (47%), Gaps = 25/253 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+S+ WCC DGN +FGD FN W++GL MA N++ + LRNELR
Sbjct: 52 VHLDNHMSRGAWCCGGGDGNTWFGDTDFNVAKWVRGLEFMADHSKKWANMISIGLRNELR 111
Query: 62 GPKQ-------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFD--KDLSFVRN-QAVNL 111
P N WY + A AV+ ANP +L+ LSGL++D +DL + Q +
Sbjct: 112 KPDNAGSSLAYNWPTWYDQVIPAANAVNKANPNILIFLSGLDYDTPEDLGNGKKFQLGDF 171
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVV----DNVMRLSGFLLEQGWPLFVSEF 167
+ KLV E H Y N CG + +N R + P+ ++EF
Sbjct: 172 AYKNKLVLELHNY----------QNDITNCGSMESGLWNNGFRATWPTAINKMPVVLTEF 221
Query: 168 GADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCD 227
G + + +N Y C + + W +W + GSYY+R G E +GL + +W
Sbjct: 222 GFN-QADNSYTKTYATCIKKLMPQWQTGWTVWVISGSYYIRSGTQDFEETWGLVNHDWSA 280
Query: 228 IRNSSFLERISSL 240
RN+ + ++ ++
Sbjct: 281 WRNNDAINQLKAM 293
>gi|408397822|gb|EKJ76960.1| hypothetical protein FPSE_02835 [Fusarium pseudograminearum CS3096]
Length = 419
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/264 (29%), Positives = 126/264 (47%), Gaps = 40/264 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ LDNHIS+ WCC DGN ++GD F+ W++G MA V SLRNE R
Sbjct: 154 IDLDNHISEGKWCCGGEDGNTWWGDTQFDTAKWVRGGAYMAAHAKKWPAKVTQSLRNEPR 213
Query: 62 GPKQNVK---------DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFV-------- 104
P N K D Y+YM+ GA+AVH A+P ++++SG+N+D ++ +
Sbjct: 214 EPTNNNKLRDASYNWSDLYKYMRQGADAVHQADPNAIIVISGMNYDTYVTPLYSGEKMKP 273
Query: 105 RNQAVNL-TFTG----KLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNV-------MRLS 152
R + N F G KLV E H Y N C + N+ M S
Sbjct: 274 RGEVFNRDDFKGYGKDKLVLEIHTY----------ENKGTSCPSLRYNLYNKGFQAMNES 323
Query: 153 GFLLEQGWPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLVGSYYLREGV 211
+ + +P+ ++EFG + G + + Y++C E+ W +W +VG YY R+G+
Sbjct: 324 DPNVAEVFPVMLTEFGQAMNGADYETAKTYVSCLSQYLPEMQASWFIWVIVGRYYTRQGI 383
Query: 212 IGLNEYYGLFDWNWCDIRNSSFLE 235
++ +G+ +W +N ++E
Sbjct: 384 QEFDDSWGMKKADWSGWKNDQYIE 407
>gi|317136538|ref|XP_001727126.2| cellulase family protein [Aspergillus oryzae RIB40]
Length = 412
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/257 (31%), Positives = 119/257 (46%), Gaps = 39/257 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCCS +DGN +FGD+YF+ W +G MA + V + +RNELR
Sbjct: 152 VHLDNHISKAMWCCSGTDGNTWFGDRYFDVAKWHRGWQYMAEHVKSLPAVKSVGMRNELR 211
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV--- 109
+ N + WYR+M A +HAAN +L+ SGL++D LS + A
Sbjct: 212 KAEDNPTLVNTTYNWRYWYRHMVENANQIHAANRNLLIYFSGLDYDTRLSPIPTGAELGN 271
Query: 110 -------NLTFTGKLVFEAHWYGFT-------DGQAWVDG-NPNQVCGRVVDNVMRLSGF 154
+ + K+V E H Y T W G N + N+M
Sbjct: 272 GTAFRKDDFEYADKIVLELHNYERTATSCEDLKSSLWNAGFNALDTQNSSIVNIM----- 326
Query: 155 LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGL 214
P+ ++EFG + N + Y +C L W +W + GSYY+R+G+
Sbjct: 327 ------PVQMTEFGFP-QDNTTYTDVYASCLREWLPSLQAGWMVWVIAGSYYIRKGIQDD 379
Query: 215 NEYYGLFDWNWCDIRNS 231
+E +GL D W D R++
Sbjct: 380 DELWGLLDHTWSDWRST 396
>gi|358375972|dbj|GAA92545.1| cellulase family protein [Aspergillus kawachii IFO 4308]
Length = 412
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 117/253 (46%), Gaps = 23/253 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+SK WCCS +DGN +FGD YFN W +G MA + V + +RNELR
Sbjct: 152 VHLDNHVSKASWCCSTTDGNAWFGDTYFNVSNWHRGWKYMAEHVKSLPAVKSIGMRNELR 211
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN------ 106
P N + WY M A VH+ N ++L+ SGLN+D LS +
Sbjct: 212 SPDDNTTLADDTYNWETWYSNMVENANQVHSVNSDLLLFFSGLNYDVTLSPIPTASDLGD 271
Query: 107 ----QAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
+ + F K+V E H Y + + G + + + P+
Sbjct: 272 GTVFKKSDFDFEDKIVLELHNYDSSATSCSSLSSSLLSDGF---DALETDDSSIVNVLPV 328
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++EFG + + ++ Y +C + W +W L GSYY+R+G+ +E +GL D
Sbjct: 329 VMTEFGYE-QDDSTYTGVYASCLREWLPSVHAGWMIWVLSGSYYIRQGIQDYDETWGLLD 387
Query: 223 WNWCDIRNSSFLE 235
W D R++ ++
Sbjct: 388 HTWSDWRSTEAID 400
>gi|402080598|gb|EJT75743.1| beta-1,6-galactanase [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 428
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/258 (31%), Positives = 125/258 (48%), Gaps = 36/258 (13%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHIS WCC +SD N F+GD F+P W +GL MA ++ +LRNE R
Sbjct: 170 VHLDNHISTGKWCCGSSDDNAFWGDLAFHPGNWTRGLAYMAAHGKAWPALMSSALRNEPR 229
Query: 62 ----GPKQNVKDW---YRYMQLGAEAVHAANPEVLVILSGLNFDKDLS-FVRNQAVN--- 110
GP + +W Y+Y + GA A++ ANP +L+ +SGLN+D ++ VR QA++
Sbjct: 230 EPSSGPAKANYNWETLYKYAKQGAAAINKANPNLLIFISGLNYDTFVTPIVRGQALSPGQ 289
Query: 111 --------LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQ---- 158
F KLV E H Y + NQ + N + +GF
Sbjct: 290 TRFARSDFAGFENKLVLEIHNY-----------DNNQGSCSALRNNLYNNGFEAHSDSAV 338
Query: 159 -GWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
+P+ ++E+G + Y C A+ + +W +VGSYY R G +E
Sbjct: 339 NKFPVLLTEYGFAMDATTW-QRVYATCIAQFTADNKAGFFIWVIVGSYYTRSGTQDFDEA 397
Query: 218 YGLFDWNWCDIRNSSFLE 235
+G+ + +W D RN S+++
Sbjct: 398 WGVLNHDWSDWRNPSYVQ 415
>gi|238488229|ref|XP_002375352.1| cellulase family protein [Aspergillus flavus NRRL3357]
gi|220697740|gb|EED54080.1| cellulase family protein [Aspergillus flavus NRRL3357]
Length = 412
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 82/257 (31%), Positives = 118/257 (45%), Gaps = 39/257 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNHISK WCCS +DGN +FGD+YF+ W +G MA + V + +RNELR
Sbjct: 152 VHLDNHISKAMWCCSGTDGNTWFGDRYFDVAKWHRGWQYMAEHVKSLPAVKSVGMRNELR 211
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV--- 109
+ N + WYR+M A +HAAN +L+ SGL++D LS + A
Sbjct: 212 KAEDNPTLVNTTYNWRYWYRHMVENANQIHAANRNLLIYFSGLDYDTRLSPIPTGAELGN 271
Query: 110 -------NLTFTGKLVFEAHWYGFT-------DGQAWVDG-NPNQVCGRVVDNVMRLSGF 154
+ + K+V E H Y T W G N + N+M
Sbjct: 272 GTAFRKDDFEYADKIVLELHNYERTATSCEDLKSSLWNAGFNALDTQNSSIVNIM----- 326
Query: 155 LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGL 214
P+ ++EFG + N + Y +C L W +W + GSYY+R+G
Sbjct: 327 ------PVQMTEFGFP-QDNTTYTDVYASCLREWLPSLQAGWMVWVIAGSYYIRKGTQDD 379
Query: 215 NEYYGLFDWNWCDIRNS 231
+E +GL D W D R++
Sbjct: 380 DELWGLLDHTWSDWRST 396
>gi|317028141|ref|XP_001390007.2| cellulase family protein [Aspergillus niger CBS 513.88]
gi|350632615|gb|EHA20982.1| hypothetical protein ASPNIDRAFT_194447 [Aspergillus niger ATCC
1015]
Length = 412
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 115/254 (45%), Gaps = 23/254 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+SK WCCS +DGN +FGD YFN W +G MA + V + +RNELR
Sbjct: 152 VHLDNHVSKASWCCSTTDGNAWFGDTYFNVSNWHRGWKYMAEHVKSLPAVKSIGMRNELR 211
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN------ 106
P N + WY M A VH+ N ++L+ SGLN+D LS +
Sbjct: 212 SPDDNTTLADDTYNWETWYSNMVENANQVHSVNSDLLLFFSGLNYDVTLSPIPTASDLGD 271
Query: 107 ----QAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
+ + F K+V E H Y + + G + + + + P+
Sbjct: 272 GTVFKKSDFDFADKIVLELHNYDSSATSCSSLSSSLLSDGF---DALETNDSSIVNVLPV 328
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++EFG + + Y +C + W +W L GSYY+REG+ +E +GL D
Sbjct: 329 VMTEFGY-AQDDTTYTEVYASCLREWLPSVHAGWMIWVLSGSYYVREGIQDYDETWGLLD 387
Query: 223 WNWCDIRNSSFLER 236
W R++ ++
Sbjct: 388 HTWSGWRSTEAIDN 401
>gi|134057680|emb|CAK38078.1| unnamed protein product [Aspergillus niger]
Length = 426
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 115/254 (45%), Gaps = 23/254 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+SK WCCS +DGN +FGD YFN W +G MA + V + +RNELR
Sbjct: 152 VHLDNHVSKASWCCSTTDGNAWFGDTYFNVSNWHRGWKYMAEHVKSLPAVKSIGMRNELR 211
Query: 62 GPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN------ 106
P N + WY M A VH+ N ++L+ SGLN+D LS +
Sbjct: 212 SPDDNTTLADDTYNWETWYSNMVENANQVHSVNSDLLLFFSGLNYDVTLSPIPTASDLGD 271
Query: 107 ----QAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
+ + F K+V E H Y + + G + + + + P+
Sbjct: 272 GTVFKKSDFDFADKIVLELHNYDSSATSCSSLSSSLLSDGF---DALETNDSSIVNVLPV 328
Query: 163 FVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFD 222
++EFG + + Y +C + W +W L GSYY+REG+ +E +GL D
Sbjct: 329 VMTEFGY-AQDDTTYTEVYASCLREWLPSVHAGWMIWVLSGSYYVREGIQDYDETWGLLD 387
Query: 223 WNWCDIRNSSFLER 236
W R++ ++
Sbjct: 388 HTWSGWRSTEAIDN 401
>gi|367048791|ref|XP_003654775.1| glycoside hydrolase family 5 protein [Thielavia terrestris NRRL
8126]
gi|347002038|gb|AEO68439.1| glycoside hydrolase family 5 protein [Thielavia terrestris NRRL
8126]
Length = 426
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/260 (34%), Positives = 125/260 (48%), Gaps = 41/260 (15%)
Query: 4 LDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGP 63
LDNH+SK WCCS++DGN ++GD YF+ W +GL MA + M LRNE R P
Sbjct: 168 LDNHVSKAMWCCSSTDGNSWWGDTYFSVANWTRGLAYMANHGKSWPALTSMGLRNEPREP 227
Query: 64 KQNV---------KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT-- 112
N + WY YM+ GA AVHAANP L+ LSGL+FD L+ V Q LT
Sbjct: 228 TNNAALDRSSYNWQSWYEYMRQGAAAVHAANPAPLIFLSGLSFDTFLTPVV-QGTALTPG 286
Query: 113 -----------FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-- 159
+ KLV E H Y + G C + N++R GF QG
Sbjct: 287 SAVFSRADFPGYADKLVLELHNYENSIGS----------CASLQSNLLR-DGFQALQGGS 335
Query: 160 ----WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLN 215
+P+ ++EFG + + Y +C W +W L GSYY+R G+
Sbjct: 336 GINVFPVMMTEFGFQMDASTWR-GVYASCLASFLPAQKAGWLIWVLAGSYYIRSGIQDYE 394
Query: 216 EYYGLFDWNWCDIRNSSFLE 235
E +GL + +W R+ S+++
Sbjct: 395 EGWGLLNHDWSAWRSPSYVD 414
>gi|402080301|gb|EJT75446.1| hypothetical protein GGTG_05380 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 476
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 125/286 (43%), Gaps = 57/286 (19%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
I D HI K WCCS+ DGN +F D YFN W +GL +A NVVG+SLRNELR
Sbjct: 182 AIPDVHIHKAMWCCSHIDGNAWFDDVYFNTSNWKRGLAYVANFAKSHPNVVGLSLRNELR 241
Query: 62 GPKQNVKDWYRY------MQLGAEAVHAANPEVLVILSGLNFDKDLS------------- 102
+Y + M GA+A+HAANP++L+ SG+ +D+DLS
Sbjct: 242 ESWNRTDLYYNWQTLVGNMTAGADAIHAANPDLLITWSGMQYDQDLSALTARRNLLSAPC 301
Query: 103 ----FVRNQAVNLTF---------TGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVM 149
VR+ A KLV+E H Y ++ Q C V ++
Sbjct: 302 YRCTAVRDAARREPLYFDLDAHPWADKLVWELHMYHMSEDQ------DTGTCEAVEASLY 355
Query: 150 R----------------LSGFLLEQGW---PLFVSEFGADLRGNNVNDNRYLNCFFGVAA 190
R L G+ E P+ SEFG + + + C G
Sbjct: 356 RNGFNALGMDAPKPGCDLPGYDCEPAHRQTPVIFSEFGYEQNAATLPRDTLQACLRGFLT 415
Query: 191 ELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
+ +W +W L GSY +R G G+++ +GL +++ R+ +E+
Sbjct: 416 KHKVNWMVWALAGSYRIRSGGQGVDDTWGLTTYDYSAWRSPELIEK 461
>gi|169606922|ref|XP_001796881.1| hypothetical protein SNOG_06513 [Phaeosphaeria nodorum SN15]
gi|160707111|gb|EAT86344.2| hypothetical protein SNOG_06513 [Phaeosphaeria nodorum SN15]
Length = 433
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 123/272 (45%), Gaps = 43/272 (15%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRN---VVGMSLR 57
++ LDNH+SK WCC+++D NG+FG +YFN + W +GL MA +G N + LR
Sbjct: 157 IIHLDNHMSKAFWCCADNDENGWFGSKYFNVENWKRGLAFMAK--HGKENWPTFASIGLR 214
Query: 58 NELRGPKQNVK----DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFV--------- 104
NELR WY +M A+AVH A+P+ L+ SGL++D + +
Sbjct: 215 NELRNAANAEPIDWYTWYIHMTAAADAVHKADPDALIFFSGLSYDTYIDPIPLGKTLNGT 274
Query: 105 ----------RNQAVNLTFTGKLVFEAHWYGFTDGQ---AWVDGNPNQVCGRVVDNVMRL 151
R + + K+V E H Y F Q A Q + V+
Sbjct: 275 AGTSTAGKTARFVPSSFAYKDKIVLEIHKYDFEATQDDCATFKRKWYQKGFQAVNPQDPA 334
Query: 152 SGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL---- 207
+ +L P+ +SE+G G N Y C + E W W L GS+Y+
Sbjct: 335 TKYLF----PMVISEWGFISNGEYWNQTTYAKCLVEMVKEYKVSWQHWELSGSFYIQTRP 390
Query: 208 ----REGVIGLNEYYGLFDWNWCDIRNSSFLE 235
+E + G E++GL ++NW IR+ LE
Sbjct: 391 GRMPKETLQGSEEFWGLLNYNWTAIRSPITLE 422
>gi|389625153|ref|XP_003710230.1| beta-1,6-galactanase [Magnaporthe oryzae 70-15]
gi|351649759|gb|EHA57618.1| beta-1,6-galactanase [Magnaporthe oryzae 70-15]
gi|440464814|gb|ELQ34179.1| beta-1,6-galactanase [Magnaporthe oryzae Y34]
gi|440490556|gb|ELQ70101.1| beta-1,6-galactanase [Magnaporthe oryzae P131]
Length = 480
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 132/289 (45%), Gaps = 62/289 (21%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
I D H+ K WCCS+ DGN +FGD +F+ + W +GL+ +A NV+G+SLRNELR
Sbjct: 184 AIPDVHVHKAQWCCSHIDGNAWFGDVHFDVENWKRGLSYVAGWAKAHPNVLGLSLRNELR 243
Query: 62 GPKQNVKDWYRY------MQLGAEAVHAANPEVLVILSGLNFDKDLSFV---RN------ 106
+Y + M GA+A+HAANP++L+ SG+ +D+DLS + RN
Sbjct: 244 ESWNRTDLYYNWQTLVGNMTAGADAIHAANPDLLITWSGMQYDQDLSALTAGRNLLSAPC 303
Query: 107 --------------QAVNL---TFTGKLVFEAHWYGFTDGQAWVDGN------------- 136
Q +L ++ KLV+E H Y ++ Q D +
Sbjct: 304 YRCTAIRDAGRRQPQVFDLDAHPWSDKLVWELHMYHMSEDQDTGDCDVIEANLYRNGFNA 363
Query: 137 ---------PNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFG 187
N+ G + +RL+ P+ SEFG + + ++ C G
Sbjct: 364 LGIDAPAACANETIGLDCEKAVRLT--------PVIFSEFGYEQNVTTLPNDVLQKCLRG 415
Query: 188 VAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
+ W W++ GSY R+G G + +GL ++++ R+ +ER
Sbjct: 416 FNTKHKVSWMTWSIAGSYRSRQGRQGFEDTWGLTNYDFSGWRDEELVER 464
>gi|388499420|gb|AFK37776.1| unknown [Lotus japonicus]
Length = 99
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 59/98 (60%), Positives = 71/98 (72%), Gaps = 5/98 (5%)
Query: 312 PAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCD 371
PAKL IIC+ + WE+ISDSK+HLS K +NG +VCLDVD +N IVTN CKCLSRD CD
Sbjct: 2 PAKLSIICSGPNNKWEMISDSKLHLSFKVNNGFSVCLDVDENNNIVTNFCKCLSRDVKCD 61
Query: 372 PASQWFKLVDSTRSSTTTKSFFQFNPILDLPGKDFIWK 409
P SQWFKL+DS R S +T S +L+LP D +WK
Sbjct: 62 PGSQWFKLIDSGRRSMSTTSALS---MLNLP--DLLWK 94
>gi|398397119|ref|XP_003852017.1| hypothetical protein MYCGRDRAFT_71681 [Zymoseptoria tritici IPO323]
gi|339471898|gb|EGP86993.1| hypothetical protein MYCGRDRAFT_71681 [Zymoseptoria tritici IPO323]
Length = 409
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 124/260 (47%), Gaps = 39/260 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+SK WCC+ +DGN +FGD F+ W +GL MA + + LRNELR
Sbjct: 152 VHLDNHMSKGKWCCNGNDGNTWFGDSEFDVSKWQRGLAYMADHGKQWSSFASIGLRNELR 211
Query: 62 -----GPKQ--NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQA------ 108
P Q + WY M AEAV+ NP +L+ SGL++DKD++ + N+
Sbjct: 212 HHDSSSPAQPYDWAHWYEAMVPAAEAVYKTNPNILIFFSGLDYDKDITALVNKQHLGSGH 271
Query: 109 ----VNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGF----LLEQGW 160
+ F ++V E H Y N + C + + +GF + G+
Sbjct: 272 YFDPRSFDFGNRVVMELHNY----------QNNAKTCDSITSG-LNHAGFNAMDVNNSGY 320
Query: 161 ---PLFVSEFGADLRGNNVNDNRYLNC---FFGVAAELDWDWALWTLVGSYYLREGVIGL 214
P+ ++EFG D G + N + Y C F W W L GSYY+REG
Sbjct: 321 YHVPVVLTEFGFDQNGKDYN-SVYAQCIKTFMTGQPGGPGGWMQWALGGSYYIREGTQDF 379
Query: 215 NEYYGLFDWNWCDIRNSSFL 234
+E +GL++ +W R+++ +
Sbjct: 380 DETWGLYNHDWSAWRDANAI 399
>gi|453082703|gb|EMF10750.1| glycoside hydrolase family 5 protein [Mycosphaerella populorum
SO2202]
Length = 427
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 121/258 (46%), Gaps = 29/258 (11%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+V LDNH+SK WCCS DGN +FG ++FN + W +G MA V LRNEL
Sbjct: 156 LVHLDNHVSKGEWCCSTGDGNAWFGSEFFNVENWRRGNAFMANHAKSWPAYVSQGLRNEL 215
Query: 61 RGPKQNV-----KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT--- 112
R P + WY+ + A+AV+AANP+ L+ SGL+FD DL+ V + LT
Sbjct: 216 RDPNNTALNYGWESWYQNVIPTADAVNAANPDPLIFYSGLHFDTDLANV-TAGIPLTPDG 274
Query: 113 ---------FTGKLVFEAHWYGFTDGQAWVDGN-PNQVCGRVVDNVMRLSGFLLEQGW-P 161
+ K+VFE H Y A D N N + N M +S + + P
Sbjct: 275 SVFDISDFCYADKIVFELHNY----NNALADANCANFNLTKSGYNAMDISSTSTAKNFAP 330
Query: 162 LFVSEFGADLRGNNVNDNRYLNCFFGVAAEL----DWDWALWTLVGSYYLREGVIGLNEY 217
+ ++EFG N Y C W W + GSYY+REGV +E
Sbjct: 331 VVLTEFGFPQTPTNYTL-PYAQCIKEYLTTALPGGPGGWIHWAVGGSYYIREGVQDNDET 389
Query: 218 YGLFDWNWCDIRNSSFLE 235
+GL + +W R+ + +E
Sbjct: 390 WGLLNHDWSAWRSPAAVE 407
>gi|391866656|gb|EIT75924.1| hypothetical protein Ao3042_07759 [Aspergillus oryzae 3.042]
Length = 430
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 122/275 (44%), Gaps = 57/275 (20%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMAT-IFNGVRN---------- 50
V LDNHISK WCCS +DGN +FGD+YF+ W +G MA +F R+
Sbjct: 152 VHLDNHISKAMWCCSGTDGNTWFGDRYFDVAKWHRGWQYMAEHVFKTPRSPASSVTDATT 211
Query: 51 -------VVGMSLRNELRGPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSG 94
V + +RNELR + N + WYR+M A +HAAN +L+ SG
Sbjct: 212 QVKSLPAVKSVGMRNELRKAEDNPTLVNTTYNWRYWYRHMVENANQIHAANRNLLIYFSG 271
Query: 95 LNFDKDLSFVRNQAV----------NLTFTGKLVFEAHWYGFT-------DGQAWVDG-N 136
L++D LS + A + + K+V E H Y T W G N
Sbjct: 272 LDYDTRLSPIPTGAELGNGTAFRKDDFEYADKIVLELHNYERTATSCEDLKSSLWNAGFN 331
Query: 137 PNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDW 196
+ N+M P+ ++EFG + N + Y +C L W
Sbjct: 332 ALDTQNSSIVNIM-----------PVQMTEFGFP-QDNTTYTDVYASCLREWLPSLQAGW 379
Query: 197 ALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNS 231
+W + GSYY+R+G+ +E +GL D W D R++
Sbjct: 380 MVWVIAGSYYIRKGIQDDDELWGLLDHTWSDWRST 414
>gi|406865966|gb|EKD19006.1| putative glycosyl hydrolase family 5 protein/cellulase [Marssonina
brunnea f. sp. 'multigermtubi' MB_m1]
Length = 407
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 120/261 (45%), Gaps = 35/261 (13%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V LDNH+S WCCS DGN +FGD F+ W +GL MAT + + +RNELR
Sbjct: 153 VHLDNHMSSGYWCCSADDGNTWFGDTQFDVTKWKRGLEYMATHGKSWPALTSIGMRNELR 212
Query: 62 GPKQNVK--------DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN------- 106
P N WY M ++ +HAANP L+ SGL++D L+ V
Sbjct: 213 KPSNNASLQATYGWPLWYTNMVSASQTIHAANPTPLIFFSGLDYDTTLTPVVEGTDLGSG 272
Query: 107 ---QAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG---- 159
+ + + K+V E H Y + G C + +N++R GF G
Sbjct: 273 VVFRKSDFPYADKIVLELHNYEGSVGS----------CSTLQENLVRY-GFSTLSGTKAN 321
Query: 160 -WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYY 218
PL ++E+G + + ++ Y +C W W + GSYY+R G ++ +
Sbjct: 322 TLPLLLTEWGHN-QMDSSGTGVYASCLRSYLPAQKVGWFYWVVAGSYYIRSGTQDYDDAW 380
Query: 219 GLFDWNWCDIRNSSFLERISS 239
GL++ +W R++S + + +
Sbjct: 381 GLYNHDWSGWRSASNIAAMKT 401
>gi|83770154|dbj|BAE60287.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 430
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 122/275 (44%), Gaps = 57/275 (20%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMAT-IFNGVRN---------- 50
V LDNHISK WCCS +DGN +FGD+YF+ W +G MA +F R+
Sbjct: 152 VHLDNHISKAMWCCSGTDGNTWFGDRYFDVAKWHRGWQYMAEHVFKTPRSLASSVTDATT 211
Query: 51 -------VVGMSLRNELRGPKQNV---------KDWYRYMQLGAEAVHAANPEVLVILSG 94
V + +RNELR + N + WYR+M A +HAAN +L+ SG
Sbjct: 212 QVKSLPAVKSVGMRNELRKAEDNPTLVNTTYNWRYWYRHMVENANQIHAANRNLLIYFSG 271
Query: 95 LNFDKDLSFVRNQAV----------NLTFTGKLVFEAHWYGFT-------DGQAWVDG-N 136
L++D LS + A + + K+V E H Y T W G N
Sbjct: 272 LDYDTRLSPIPTGAELGNGTAFRKDDFEYADKIVLELHNYERTATSCEDLKSSLWNAGFN 331
Query: 137 PNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDW 196
+ N+M P+ ++EFG + N + Y +C L W
Sbjct: 332 ALDTQNSSIVNIM-----------PVQMTEFGFP-QDNTTYTDVYASCLREWLPSLQAGW 379
Query: 197 ALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNS 231
+W + GSYY+R+G+ +E +GL D W D R++
Sbjct: 380 MVWVIAGSYYIRKGIQDDDELWGLLDHTWSDWRST 414
>gi|452001078|gb|EMD93538.1| glycoside hydrolase family 5 protein [Cochliobolus heterostrophus
C5]
Length = 422
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 117/273 (42%), Gaps = 62/273 (22%)
Query: 4 LDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGP 63
LDNH+SK WCC +DGNG+FG++YF+ + W +GL MA SLRNELR
Sbjct: 160 LDNHVSKAFWCCGENDGNGWFGEKYFDVEKWKRGLAFMAK---------HASLRNELRSA 210
Query: 64 KQNVK----DWYRYMQLGAEAVHAANPEVLVILSGLNFD--------------------- 98
K WY +M A AVH A+P+ L+ SGL++D
Sbjct: 211 KDAEPVDWYTWYIHMTSAANAVHQADPDALIFFSGLSYDTYIDPIPLGKTLSGTAGTSTA 270
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGF--------TDGQAWVDGNPNQVCGRVVDNVM 149
K +FV + + K+V E H Y F T + W V
Sbjct: 271 NKKATFV---PTDFAWKSKIVLEIHKYDFEGTKDDCQTFKRKWYQNGFQAVNA------- 320
Query: 150 RLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLR- 208
S + +P+ +SE+G G N Y C + E W W L GS+YL+
Sbjct: 321 --SDPATKYVFPMVISEWGFINNGVYWNQTTYAKCLVEMVKEYKVGWQHWELCGSFYLQT 378
Query: 209 ------EGVIGLNEYYGLFDWNWCDIRNSSFLE 235
++G +E +G+ +++W IR+ LE
Sbjct: 379 RPNRQPNTLVGADEAWGILNYDWSAIRSPITLE 411
>gi|310790111|gb|EFQ25644.1| endo-beta-1 [Glomerella graminicola M1.001]
Length = 402
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 82/244 (33%), Positives = 107/244 (43%), Gaps = 40/244 (16%)
Query: 4 LDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVR--NVVGMSLRNELR 61
LDNH+SK GWCC+ DGNG+FGD YFN W++ L MA + + M LRNEL
Sbjct: 165 LDNHVSKAGWCCNPYDGNGWFGDTYFNTSNWVRSLAFMAEHGKAAQWPSFSSMGLRNELH 224
Query: 62 GP----------KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS------FVR 105
P W M GA A+H ANP+VL+ G FD D+S F
Sbjct: 225 DPFSAPPAPGLENTTWTTWKTRMVQGANAIHGANPDVLIFFGGRIFDFDISAPVQGRFGS 284
Query: 106 NQAVN-----LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVD-------NVMRLSG 153
N L F K VFE H Y Q VD C D N M ++
Sbjct: 285 EPGFNFSLAELPFRDKFVFEQHQY----DQGLVD----DACASYRDILTAFGSNAMTIAD 336
Query: 154 FLLEQGWPLFVSEFGAD-LRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVI 212
+ PL +SE+G + + + +++ C + W +W L GSYY REGV
Sbjct: 337 AGSNRA-PLVMSEWGHEQTDASGMYKDKFRRCLMDFMVDKQIGWMVWVLGGSYYTREGVQ 395
Query: 213 GLNE 216
+E
Sbjct: 396 DKDE 399
>gi|343426309|emb|CBQ69840.1| related to cellulase [Sporisorium reilianum SRZ2]
Length = 470
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 124/297 (41%), Gaps = 67/297 (22%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
++ LDNHIS+ WCCS+ DGN +FG+ FN D W +G MA N+V MSLRNEL
Sbjct: 162 LMHLDNHISRAIWCCSHDDGNAWFGNVDFNVDHWHRGHRYMANWAKSHSNIVSMSLRNEL 221
Query: 61 RGP------KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQA------ 108
R + N K W+ + AE +HA NP++L+ +SGL++D DLS + QA
Sbjct: 222 RKSALRPHLEYNWKTWWGNVSQAAENIHATNPDLLITISGLDYDIDLSALTTQANLLDAP 281
Query: 109 ----------------------VNLTF--TGKLVFEAHWYGFTDGQAWVDGNPNQVCGRV 144
N TF K V E H Y + G+ CG +
Sbjct: 282 YVNTDIGAIAHADSMPPVYADIENTTFGRAKKAVLELHAYKMST----YYGDHLDDCGAI 337
Query: 145 VDNVMRLSGFLLEQG--------------------------WPLFVSEFGADLRGNNVND 178
+ R L + P+ ++EFG D + +
Sbjct: 338 QAGLYRFGFNALGESARPAACTSGNPSADTDPYSCPPAKVTLPVLLTEFG-DAQDAGFAN 396
Query: 179 NRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
C + WA W L GSY +R+ V+ N+ +GL + W D ++ ++
Sbjct: 397 VTMQKCLREYTTKHKIGWAHWALAGSYRIRQNVVFFNDTWGLTNPTWSDYQSRETVD 453
>gi|451854914|gb|EMD68206.1| glycoside hydrolase family 5 protein [Cochliobolus sativus ND90Pr]
Length = 422
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/268 (30%), Positives = 119/268 (44%), Gaps = 52/268 (19%)
Query: 4 LDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGL---TKMATIFNGVRNVVGMSLRNEL 60
LDNH+SK WCC +DGNG+FG++YF+ + W +GL TK A SLRNEL
Sbjct: 160 LDNHVSKAFWCCGENDGNGWFGERYFDVEKWKRGLAFITKHA------------SLRNEL 207
Query: 61 RGPKQNVK----DWYRYMQLGAEAVHAANPEVLVILSGLNFD---------KDLSFVRNQ 107
R K WY +M A AVH A+P+ L+ SGL++D K LS
Sbjct: 208 RSAKGAEPVDWYTWYIHMTSAANAVHQADPDALIFFSGLSYDTYIDPIPLGKTLSGTAGT 267
Query: 108 AV----------NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDN---VMRLSGF 154
+ + + K+V E H Y F + N + N + S
Sbjct: 268 STANKKATFIPSDFAWKNKIVLEIHKYDFEGTK----DNCQTFKRKWYQNGFQAVNASDP 323
Query: 155 LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLR------ 208
+ +P+ +SE+G G N Y C + E W W L GS+YL+
Sbjct: 324 ATKYVFPMVISEWGFINNGVYWNQTTYAKCLVEMVKEYKVGWQHWELCGSFYLQTRPNRQ 383
Query: 209 -EGVIGLNEYYGLFDWNWCDIRNSSFLE 235
++G +E +G+ ++NW IR+ LE
Sbjct: 384 PNTLVGADEAWGILNYNWSAIRSPVTLE 411
>gi|429852659|gb|ELA27784.1| cellulase family protein [Colletotrichum gloeosporioides Nara gc5]
Length = 421
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 114/252 (45%), Gaps = 33/252 (13%)
Query: 4 LDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGV-RNVVGMSLRNELR- 61
LDNH+SK GWCC+ +DGNG+FGD YFN WI+ L+ MA ++ + LRNE+R
Sbjct: 164 LDNHMSKAGWCCNVADGNGWFGDAYFNTTNWIRSLSYMAEHGKANWKSFSSIGLRNEIRE 223
Query: 62 ---------GPKQNVKDWY---RYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ-A 108
P K W + M GA A++AANP+VL+ G D+ N
Sbjct: 224 RFSYDNPPVAPAIEPKTWETWGKRMVQGASAINAANPDVLIFFGGRLGGIDIEAPANGWY 283
Query: 109 VNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFG 168
+ + + +YG DG V + NV PL +SE+G
Sbjct: 284 AETSRSRRCRSRTRYYGMLDGWG------ADVMTKGQANVA-----------PLIMSEWG 326
Query: 169 ADLRGNNVN-DNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCD 227
D + + C + + W +W++ GSYY+REG + +++GL D W
Sbjct: 327 HDQSDTKREYEKNFHKCLMDLMVQRSVGWMVWSMGGSYYIREGKADVEDFWGLLDRTWSG 386
Query: 228 IRNSSFLERISS 239
R ++++ +
Sbjct: 387 YRGRDSIKKLQA 398
>gi|356515008|ref|XP_003526193.1| PREDICTED: TBC1 domain family member 15-like [Glycine max]
Length = 521
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/67 (74%), Positives = 57/67 (85%)
Query: 72 RYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQA 131
RYM GAEAVHAANP+VLVILSGLNFD LSF++++ V+LTF GKLVFE + YGFTDG A
Sbjct: 455 RYMVKGAEAVHAANPDVLVILSGLNFDTSLSFIQDRPVSLTFKGKLVFEVNRYGFTDGGA 514
Query: 132 WVDGNPN 138
WVDGN N
Sbjct: 515 WVDGNLN 521
>gi|388857140|emb|CCF49355.1| related to cellulase [Ustilago hordei]
Length = 468
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 125/295 (42%), Gaps = 65/295 (22%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
++ LDNHIS+ WCCS+ DGN +FGD F+ W +G MA N+V MSLRNEL
Sbjct: 162 LMHLDNHISRAIWCCSHDDGNAWFGDMDFDVSKWHRGHRYMAEWAKKHSNIVSMSLRNEL 221
Query: 61 R------GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQA------ 108
R G + N W+ + AE +HA NP +L+ +SGL++D DLS + QA
Sbjct: 222 RKSAVKPGLEYNWNTWWGNVSEAAENIHATNPNLLITISGLDYDIDLSALTTQANLYTAP 281
Query: 109 ----------------------VNLTF--TGKLVFEAHWYGFTDGQAWVDGNPNQVCGRV 144
N F K V E H Y + + D N CG +
Sbjct: 282 YVNTDMDRVANAESLPPVYADIANTKFGKANKAVLELHAYKMS--TYYQDHLEN--CGAI 337
Query: 145 VDNVMRLSGFLLEQG------------------------WPLFVSEFGADLRGNNVNDNR 180
+ R L + P ++EFG D + ++ +
Sbjct: 338 QAGLYRFGFNALGESARPDSCNNSTDFSDPNSCPPAKITLPTLLTEFG-DAQDSSYGNVI 396
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
C + WA W+L GSY++R+ V+ N+ +GL + W + ++ ++
Sbjct: 397 MQKCLRDFTTKNKIGWAHWSLAGSYHIRQNVVFNNDSWGLTNPTWTEYQSKETVD 451
>gi|330920733|ref|XP_003299126.1| hypothetical protein PTT_10061 [Pyrenophora teres f. teres 0-1]
gi|311327313|gb|EFQ92773.1| hypothetical protein PTT_10061 [Pyrenophora teres f. teres 0-1]
Length = 468
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 119/268 (44%), Gaps = 52/268 (19%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
D H K WCCS+ DG +F D +FN W +GL+ +A N+ MSLRNELR
Sbjct: 180 DVHTGKAQWCCSHYDGAAWFDDLFFNATHWRRGLSYVANWAKDHPNIASMSLRNELRD-S 238
Query: 65 QNVKDWYR-------YMQLGAEAVHAANPEVLVILSGLNFDKDLS--------------- 102
NV D + M GA+A+HAANP+VL++ SG+ + +DLS
Sbjct: 239 YNVTDLHYNWQTLVGNMTAGADAIHAANPDVLILWSGMQYGQDLSALTSGKNILSAPCYK 298
Query: 103 --FVRNQAV---------NLTFTGKLVFEAHWYGFTDGQAWVDGNP-NQVCGRVVDNVMR 150
+RN A + + KLV+E H Y ++ VD + + V + N
Sbjct: 299 CTAIRNAARLEPVHFNIDDHAWADKLVWELHLYKMSED---VDTDRCDIVKASLYRNGFN 355
Query: 151 LSGFLLEQG-------------WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWA 197
G + P+ +SEFG ND NC E + W
Sbjct: 356 ALGIDAPEACNITNDCPKAVRETPVILSEFGTAQDETLFNDT-LQNCLKEYTVENNVSWM 414
Query: 198 LWTLVGSYYLREGVIGLNEYYGLFDWNW 225
+W + GSY +R GV GL + +G+ +++W
Sbjct: 415 MWAIAGSYRIRSGVQGLPDTWGMTNYDW 442
>gi|302417628|ref|XP_003006645.1| cellulase family protein [Verticillium albo-atrum VaMs.102]
gi|261354247|gb|EEY16675.1| cellulase family protein [Verticillium albo-atrum VaMs.102]
Length = 374
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 78/269 (28%), Positives = 124/269 (46%), Gaps = 43/269 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRN---VVGMSLRN 58
+I++NH+SK WCCS DGNG+F D YF+ W + L M + +N SLRN
Sbjct: 98 MIVNNHVSKAVWCCSPDDGNGWFDDTYFDVVKWKRSLDFMTR--HAKQNWPAATAHSLRN 155
Query: 59 ELRGPKQNVKD------WYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ----- 107
ELR D WY M A V+AA+P+ LVI SGLN+D +L+ + +
Sbjct: 156 ELRIVNSAANDTYGWSRWYDEMTDAAAIVNAADPDALVIYSGLNYDHELNPITTKLPLSE 215
Query: 108 -------AVNLTFTGKLVFEAHWYGFTDGQAWVDG---NPNQVCGRVVDNVMRLSGFLLE 157
+ + K+VFE H Y W+ +++ R + + +
Sbjct: 216 TDPRVFDLADFDYADKIVFELHTY-----DNWMTNCTFFQSELYNRGFNALDTSAESTSR 270
Query: 158 QGWPLFVSEFGADLRGNNVNDNRYLNCF---------FGVAAELD--WDWALWTLVGSYY 206
P+F SEFG G + Y C G A+ ++ W WT+ GSYY
Sbjct: 271 NIAPVFFSEFGFAQNGTDYL-GVYAQCLKDFITGDKRIGAASRIEGPIGWLQWTIGGSYY 329
Query: 207 LREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
+R+G+ ++++ + + +W + RN + L+
Sbjct: 330 MRQGIPDFDDWWAILNHDWSNWRNETVLK 358
>gi|443900016|dbj|GAC77343.1| hypothetical protein PANT_26c00006 [Pseudozyma antarctica T-34]
Length = 467
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 115/286 (40%), Gaps = 66/286 (23%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
++ LDNHIS+ WCCS+ DGN +F D FN + W +G + MA N+V MSLRNEL
Sbjct: 162 LMHLDNHISRAIWCCSHDDGNAWFDDVDFNVENWHRGHSYMANWAKKHSNIVSMSLRNEL 221
Query: 61 RGPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQA------- 108
R N + W+ + AE +HA NP++L+ +SGL++D DLS + QA
Sbjct: 222 RKSALRPLDYNWQTWWGNVSAAAENIHATNPDLLITISGLDYDIDLSALTTQANLLDAPY 281
Query: 109 ---------------------VNLTF--TGKLVFEAHWYGFTDG---------------- 129
N F K V E H Y +
Sbjct: 282 VNTDMDKIANAESMPPQYADIANTKFGKANKAVLELHAYKMSTYYQDHLEDCDAIQTGLY 341
Query: 130 ----QAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLN-- 183
A + + C D S + P ++EFG + D+ Y N
Sbjct: 342 RFGFNALGESARPEACNNSTDFADPYSCPSAKITLPTLLTEFG------DAQDSSYANVT 395
Query: 184 ---CFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC 226
C + WA W L GSY +R+ V+ N+ +GL W
Sbjct: 396 MQKCLRDFTTKNKIGWAHWALAGSYRIRQNVVFNNDTWGLTTPTWS 441
>gi|189201748|ref|XP_001937210.1| beta-1,6-galactanase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984309|gb|EDU49797.1| beta-1,6-galactanase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 468
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 117/268 (43%), Gaps = 52/268 (19%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
D H K WCCS+ DG +F D +FN W +GL+ +A N+ MSLRNELR
Sbjct: 180 DVHTGKAQWCCSHYDGAAWFDDLFFNATHWRRGLSYVANWAKDHPNIASMSLRNELRD-S 238
Query: 65 QNVKD----WYRY---MQLGAEAVHAANPEVLVILSGLNFDKDLS--------------- 102
NV D W M GA+A+H ANP++L++ SG+ + +DLS
Sbjct: 239 YNVTDLNYNWQTLVGNMTAGADAIHEANPDILILWSGMQYGQDLSALTSGKNILSAPCYK 298
Query: 103 --FVRNQAV---------NLTFTGKLVFEAHWYGFTDGQAWVDGNP-NQVCGRVVDNVMR 150
+RN A + + KLV+E H Y ++ VD + + V + N
Sbjct: 299 CTAIRNAARLEPVYFNVDDHAWADKLVWELHLYKMSED---VDTDRCDIVKASLYRNGFN 355
Query: 151 LSGFLLEQG-------------WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWA 197
G + P+ +SEFG ND NC E D W
Sbjct: 356 ALGIDAPEACNITNDCPKAVRETPVILSEFGTAQDETLFNDT-LQNCLREYTIENDVSWM 414
Query: 198 LWTLVGSYYLREGVIGLNEYYGLFDWNW 225
+W + GSY +R G+ G + +G+ +++W
Sbjct: 415 MWAIAGSYRIRSGIQGFPDTWGMTNYDW 442
>gi|429855728|gb|ELA30671.1| beta-1,6-galactanase [Colletotrichum gloeosporioides Nara gc5]
Length = 465
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 119/265 (44%), Gaps = 69/265 (26%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
DNH+SK WCCSN DGN +F F D W++GL +A NV MSLRNELR P
Sbjct: 171 DNHVSKAMWCCSNIDGNAWFDSFNFPVDKWLRGLAYVARWAGKHPNVASMSLRNELR-PS 229
Query: 65 QNVKD---------WYRY---MQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT 112
N+ + W Y M GA A+H ANP++L+ SG+ FD+DLS + NQ +T
Sbjct: 230 YNITELADAPLQYNWQTYVGNMTAGAYAIHEANPDILISWSGMQFDEDLSGLINQKNLMT 289
Query: 113 --------------------------FTGKLVFEAHWY-GFTDGQA------------WV 133
F K+ +E H Y G T+ A W
Sbjct: 290 HDCYHCDTIKNGWVSDPVYFDLDEHPFKDKVFWELHMYDGLTETVATENCKLTEAELYWR 349
Query: 134 DGNP-----NQVCGRV---VDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCF 185
N + CG + ++ R++ P+ +SEFG + +++ N +++C
Sbjct: 350 GLNALGIESPEYCGLLDGKCESATRIT--------PVVISEFGWG-QDSSMLSNTFVSCM 400
Query: 186 FGVAAELDWDWALWTLVGSYYLREG 210
+ + +W +W L G Y +REG
Sbjct: 401 TNMTVKYGVNWMMWALPGIYRVREG 425
>gi|451997151|gb|EMD89616.1| glycoside hydrolase family 5 protein [Cochliobolus heterostrophus
C5]
Length = 469
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 120/272 (44%), Gaps = 60/272 (22%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
D H K WCCS+ DG +F D +FN W +GL+ +A N+ MSLRNELR
Sbjct: 181 DVHTGKAQWCCSHIDGTAWFDDLHFNATHWRRGLSYVANWAKAHPNIASMSLRNELR-ES 239
Query: 65 QNVKDWYR-------YMQLGAEAVHAANPEVLVILSGLNFDKDLSFV------------- 104
NV D Y M GA+A+H ANP++L++ G+ + +DLS +
Sbjct: 240 WNVTDLYYNWDTLVGNMTAGADAIHEANPDILILWGGMQYAQDLSALTSGKNILTAPCYK 299
Query: 105 --------RNQAV-----NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRL 151
R + V + + KLV+E H Y ++ VD C V N+ R
Sbjct: 300 CTAIRDAARREPVYFNLDDHAWADKLVWEIHLYSMSED---VD---TGTCDIVKANLYR- 352
Query: 152 SGF------------------LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELD 193
+GF + P+ +SEFG+ + + +N NC +
Sbjct: 353 NGFNALGIDAPPACNVTGDCPKAVRETPVILSEFGS-AQDETLFNNTLQNCLKDYTIDNG 411
Query: 194 WDWALWTLVGSYYLREGVIGLNEYYGLFDWNW 225
W +W + GSY +R G+ G + +GL +++W
Sbjct: 412 VSWMMWAIAGSYRIRSGIQGFPDTWGLTNYDW 443
>gi|451852598|gb|EMD65893.1| glycoside hydrolase family 5 protein [Cochliobolus sativus ND90Pr]
Length = 469
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 119/271 (43%), Gaps = 58/271 (21%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
D H K WCCS+ DG +F D +FN W +GL+ +A N+V MSLRNELR
Sbjct: 181 DVHTGKAQWCCSHIDGTAWFDDLHFNATHWRRGLSYVANWAKAHPNIVSMSLRNELRESW 240
Query: 65 QNVKDWYRY------MQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT------ 112
+Y + M GA+A+H ANP++L++ G+ + +DLS + + LT
Sbjct: 241 NVTNLYYNWDTLVSNMTAGADAIHEANPDLLILWGGMQYGQDLSALTSGKNILTAPCYKC 300
Query: 113 --------------------FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS 152
+ KLV+E H Y ++ +D C V N+ R +
Sbjct: 301 TAIRDAARREPVYFNLDDHPWADKLVWEIHLYSMSED---ID---TGTCDIVKANLYR-N 353
Query: 153 GF------------------LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDW 194
GF + P+ +SEFG+ + + +N NC +
Sbjct: 354 GFNALGIDAPPACNITGDCPKAVRETPVILSEFGS-AQDETLFNNTLQNCLKDYTIDHGV 412
Query: 195 DWALWTLVGSYYLREGVIGLNEYYGLFDWNW 225
W +W + GSY +R G+ G + +G+ +++W
Sbjct: 413 SWMMWAIAGSYRIRSGIQGFPDTWGMTNYDW 443
>gi|396481222|ref|XP_003841187.1| hypothetical protein LEMA_P091170.1 [Leptosphaeria maculans JN3]
gi|312217761|emb|CBX97708.1| hypothetical protein LEMA_P091170.1 [Leptosphaeria maculans JN3]
Length = 467
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 125/284 (44%), Gaps = 62/284 (21%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
D H K WCCS+ DG +F D +FN W +GL+ +A N+V MSLRNELR
Sbjct: 179 DVHTGKAQWCCSHIDGTAWFDDIHFNASDWRRGLSYVANWARSHPNIVSMSLRNELRESW 238
Query: 65 QNVKDWYRY------MQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT------ 112
+Y + M GA+A+H ANP++L++ SG+ + +D+S + + LT
Sbjct: 239 NVTNLYYNWQTLVGNMTQGADAIHEANPDILILWSGMQYGQDVSALTSGKNILTAPCYKC 298
Query: 113 --------------------FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS 152
+ K+V+E H Y ++ VD C + N+ R +
Sbjct: 299 TAIRDAARREPEYFNLDDHPWANKVVYEVHLYSMSED---VD---TGTCPIIQANLFR-N 351
Query: 153 GF--------------------LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAEL 192
GF + E P+ +SEFG+ + + + NC +
Sbjct: 352 GFNALGIDAPAACNTTNDCPKAIRET--PVILSEFGS-AQDETLYNATLQNCLREFTIDN 408
Query: 193 DWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLER 236
W +W++ GSY +R GV G + +GL +++W + +E+
Sbjct: 409 GISWMMWSVAGSYRIRSGVQGFPDTWGLTNYDWTGWNYAEGIEK 452
>gi|71021601|ref|XP_761031.1| hypothetical protein UM04884.1 [Ustilago maydis 521]
gi|46100885|gb|EAK86118.1| hypothetical protein UM04884.1 [Ustilago maydis 521]
Length = 577
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 123/295 (41%), Gaps = 65/295 (22%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
++ LDNHIS+ WCCS+ DGN +F D F+ + W +GL MA N+V MSLRNEL
Sbjct: 162 LMHLDNHISRAIWCCSHDDGNAWFDDLDFDVNKWHRGLRYMANWAKNHTNIVSMSLRNEL 221
Query: 61 RGP------KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLS------------ 102
R + N K W+ + AE +H+ NP++L+ +SGL++D DLS
Sbjct: 222 RKSAIEPNLEYNWKTWWGNISEAAEIIHSTNPDLLITISGLDYDIDLSALTGQTNLLDAP 281
Query: 103 FVRNQAVNLT------------------FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRV 144
+V N+T K V E H Y + + + CG +
Sbjct: 282 YVNKDMDNITNAESMPAMYADIESTKFGKAKKAVLELHAYKMSTYYQ----DHLEDCGAI 337
Query: 145 VDNVMRL-----------------SGFLLEQGWP-------LFVSEFGADLRGNNVNDNR 180
+ R +GF P + ++EFG D + + +
Sbjct: 338 QAGLYRFGFNALGKSARPQACSNSAGFSDPYSCPDAKVLLPVLLTEFG-DAQDSGYANVT 396
Query: 181 YLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLE 235
C WA W L GSY +R+ V N+ +GL + W + ++ ++
Sbjct: 397 MQKCLRDFTTRNKIGWAHWALAGSYRIRQNVGFFNDTWGLTNPTWTEYQSKDTVD 451
>gi|398394515|ref|XP_003850716.1| Exo-beta-1,3-glucanase [Zymoseptoria tritici IPO323]
gi|339470595|gb|EGP85692.1| Exo-beta-1,3-glucanase [Zymoseptoria tritici IPO323]
Length = 446
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 118/272 (43%), Gaps = 52/272 (19%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR--- 61
D H+ K WCC+N+DGN +F D F W +GL +A RNVV M+LRNELR
Sbjct: 158 DVHVGKAQWCCNNTDGNAWFDDYNFPVANWKRGLQFVAEWAKDHRNVVSMALRNELRRAI 217
Query: 62 ---GPKQNVKDWYRYMQL------GAEAVHAANPEVLVILSGLNFDKDLS---------- 102
P + Y ++ L +A+H NP++LV SG+ +D+DL+
Sbjct: 218 NETAPTSTIG--YNWLSLVGNNTAATDAIHEVNPDILVSWSGMQYDQDLAALTTGLDLNT 275
Query: 103 ------------FVRNQAV----NLTFTGKLVFEAHWYGFTDG---------QAWVDGNP 137
+ R+ + + + K+V+E H Y ++ QA +
Sbjct: 276 APCYKCDVVRDGYRRDPIIFNLDDHPWADKVVYELHLYSMSEDLDTGSCPLIQAQFYSSG 335
Query: 138 NQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWD 195
G ++G +E + P+ +SEFG ND C + +
Sbjct: 336 FNAMGIDKPEACNITGDCVEAVRQTPVILSEFGWAQDETLFNDT-LQTCLRDFTTQNNVS 394
Query: 196 WALWTLVGSYYLREGVIGLNEYYGLFDWNWCD 227
WA+W GSY +R G G+ + +GL ++N+ D
Sbjct: 395 WAVWAFAGSYRVRSGGQGVPDTWGLTNYNFSD 426
>gi|429849867|gb|ELA25202.1| cellulase family protein [Colletotrichum gloeosporioides Nara gc5]
Length = 486
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 115/265 (43%), Gaps = 59/265 (22%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
D H+SK WCCS++DGNG+F D YFN W +GL+ +A N+V MSLRNE+R
Sbjct: 211 DVHVSKAMWCCSHTDGNGWFDDVYFNTTHWRRGLSYVANWARDHPNIVSMSLRNEVRESW 270
Query: 65 QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT------------ 112
+Y + L ANP+VL++ G+ + +DLS + + LT
Sbjct: 271 NRTDLYYNWDTL-------ANPDVLILWGGMQYGQDLSALTSGKNYLTAPCYKCTAIRDA 323
Query: 113 --------------FTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGF---- 154
+ KLV+E H Y ++ VD C +++ SGF
Sbjct: 324 ARREPKVFDVDAHAWADKLVWELHLYKMSED---VD---TGTCP-IIEAGFYRSGFNALG 376
Query: 155 --------------LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWT 200
+ P+ +SEFG + + + + ++ C + W +W
Sbjct: 377 IDKPAACNVTGDCPKASRLTPVILSEFG-NAQDDTMRNDTLQVCLREFTVKNKISWMMWG 435
Query: 201 LVGSYYLREGVIGLNEYYGLFDWNW 225
L GSY +R G G+ + + L ++ W
Sbjct: 436 LAGSYRIRSGAQGVFDGWALGNYEW 460
>gi|383170168|gb|AFG68326.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170172|gb|AFG68328.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170174|gb|AFG68329.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170176|gb|AFG68330.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170178|gb|AFG68331.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170180|gb|AFG68332.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170182|gb|AFG68333.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170186|gb|AFG68335.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170188|gb|AFG68336.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170192|gb|AFG68338.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170194|gb|AFG68339.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170196|gb|AFG68340.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
Length = 72
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 39/72 (54%), Positives = 55/72 (76%)
Query: 316 GIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQ 375
G+ C + W ++SDS++HLS+K ++GTTVCLD S ++VTN C CLS D +CDPASQ
Sbjct: 1 GMFCMGDNTQWSMMSDSQLHLSAKVEDGTTVCLDGSSMPSVVTNKCNCLSGDSSCDPASQ 60
Query: 376 WFKLVDSTRSST 387
WFK++ +TRSS+
Sbjct: 61 WFKIITTTRSSS 72
>gi|383170184|gb|AFG68334.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
gi|383170190|gb|AFG68337.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
Length = 72
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/72 (54%), Positives = 55/72 (76%)
Query: 316 GIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQ 375
G+ C + W ++SDS++HLS+K ++GTTVCLD S ++VTN C CLS D +CDPASQ
Sbjct: 1 GMFCMGDNTQWSMMSDSQLHLSAKVEDGTTVCLDGSSIPSVVTNKCNCLSGDSSCDPASQ 60
Query: 376 WFKLVDSTRSST 387
WFK++ +TRSS+
Sbjct: 61 WFKIITTTRSSS 72
>gi|322708830|gb|EFZ00407.1| cellulase [Metarhizium anisopliae ARSEF 23]
Length = 498
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 101/213 (47%), Gaps = 15/213 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF--------NPDLWIKGLTKMATIFNGVRNVVG 53
VI++NHI++ WCC + + + + + + D WI+ + T F V+G
Sbjct: 249 VIVNNHITRATWCCGANPCDAAWANDHLLGICKVTQSEDEWIRNWETVMTRFTNDPLVIG 308
Query: 54 MSLRNELRGPKQNVKDWYRYMQLGAE---AVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
+ LRNE+RG W+++ + + A NP+ LV++ G DL R + VN
Sbjct: 309 VDLRNEVRG-LWGTMPWHKWAEAAERCGNRLLAMNPDWLVVVEGTESANDLRGARKRPVN 367
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQG-WPLFVSEFG 168
L KLV+ AH Y ++ G + G ++ MR G+LLE+G P++V E G
Sbjct: 368 LDIANKLVYSAHVYAWS-GWGSLGGRFSKRTYESFRESMRGHWGYLLEEGVAPVWVGEIG 426
Query: 169 ADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
A +V D RY + E+D D+ W L
Sbjct: 427 APRLPASVGDVRYWKNLWRYLGEVDADFGYWAL 459
>gi|452982740|gb|EME82499.1| glycoside hydrolase family 5 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 466
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 113/271 (41%), Gaps = 54/271 (19%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR--- 61
D H+ K WCCS +DGN +F D F W +GL A NVV MSLRNE R
Sbjct: 175 DVHVGKAQWCCSKTDGNAWFDDYNFPVKNWHRGLQHTAEWAKKHPNVVSMSLRNEFRRNI 234
Query: 62 ---GPKQNVKDWYRYMQL------GAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT 112
P N Y ++ L +A++ NP++L+ SG+ FD+DLS + + +NL
Sbjct: 235 NLTDPTINYG--YNWVNLVGNSTAATDAIYQTNPDILISWSGMQFDQDLSALAS-GLNLN 291
Query: 113 ---------------------------FTGKLVFEAHWYGFTDGQ---------AWVDGN 136
+ K+ +E H Y ++ Q A + N
Sbjct: 292 TAPCYKCDAIRDGYTREPIVFDLDAHPWADKVFYELHMYSMSEDQDTDNCAITQAQLYQN 351
Query: 137 PNQVCGRVVDNVMRLSGFLLE--QGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDW 194
G ++G ++ + P+ ++EFG ND C +
Sbjct: 352 GYNALGIDPPAACNITGACMKPVRVTPVVMTEFGWSQDETLFNDT-LQGCIRNFTTQHGI 410
Query: 195 DWALWTLVGSYYLREGVIGLNEYYGLFDWNW 225
WA+W+L GSY +REG G+ + + L ++ W
Sbjct: 411 SWAMWSLAGSYRIREGGQGVPDTWALSNYEW 441
>gi|342882756|gb|EGU83355.1| hypothetical protein FOXB_06135 [Fusarium oxysporum Fo5176]
Length = 323
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 71/140 (50%), Gaps = 36/140 (25%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
++LDNHISK WCCS SDGN ++GD FN W++GL+ MAT
Sbjct: 163 ILLDNHISKAEWCCSGSDGNTWWGDSQFNVANWLRGLSYMAT------------------ 204
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ------------AV 109
WY+Y+Q GA+A++ AN +VLV LSGL +D ++ V Q A
Sbjct: 205 ------HTWYKYVQQGAKAINNANGDVLVYLSGLGYDTWITPVFTQTALTPGKEVFDKAA 258
Query: 110 NLTFTGKLVFEAHWYGFTDG 129
F+ KLV E H Y T G
Sbjct: 259 FAGFSDKLVLEIHNYEKTVG 278
>gi|444918299|ref|ZP_21238377.1| hypothetical protein D187_01097 [Cystobacter fuscus DSM 2262]
gi|444710195|gb|ELW51184.1| hypothetical protein D187_01097 [Cystobacter fuscus DSM 2262]
Length = 630
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 111/260 (42%), Gaps = 47/260 (18%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+VIL+NH +K WCC DGNGF+ D W +AT + G + V G LRNE+
Sbjct: 168 VVILNNHTTKSMWCCG-WDGNGFW-DSSQALQRWQDDWVMLATRYQGNKWVAGADLRNEV 225
Query: 61 R-----GPK---QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNF-------DKDLSFVR 105
R P +N DW+ Q + NP++L+++ +N+ L +R
Sbjct: 226 RPDGLDSPNWGMRNQHDWHMAAQTMGNLLLRTNPDLLIVVEAVNWWGLLDGARPQLKPMR 285
Query: 106 NQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRV--------VDNVMRLS----- 152
+ V L KLV+ H YG+T PNQ G + +D
Sbjct: 286 QRPVALLRGDKLVYAVHNYGYT--------GPNQSGGSLGSGPKYSDMDKATLHGTLDQE 337
Query: 153 -GFLLEQGW----PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
GF+LE P+++SEFG N + N + D DWA W + + +
Sbjct: 338 WGFVLEANQAYTAPVWMSEFGVGYNEQAANSRAWFNHLADYLIDKDVDWAYWA-INAAKI 396
Query: 208 REGVIGLNEYYGLF---DWN 224
+ V G E YGL+ DWN
Sbjct: 397 QNSVQGEEETYGLWSYPDWN 416
>gi|383170170|gb|AFG68327.1| Pinus taeda anonymous locus 0_4489_01 genomic sequence
Length = 72
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/72 (52%), Positives = 54/72 (75%)
Query: 316 GIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNTIVTNTCKCLSRDKTCDPASQ 375
G+ C + W ++SDS++HLS+K ++GTTVCLD S ++VTN C CLS D +CDPASQ
Sbjct: 1 GMFCMGDNTQWSMMSDSQLHLSAKVEDGTTVCLDGSSMPSVVTNKCNCLSGDSSCDPASQ 60
Query: 376 WFKLVDSTRSST 387
W K++ +TRSS+
Sbjct: 61 WLKIITTTRSSS 72
>gi|310800104|gb|EFQ34997.1| beta-1,6-galactanase [Glomerella graminicola M1.001]
Length = 458
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 120/280 (42%), Gaps = 63/280 (22%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNG-VRNVVGMSLRNE-LRG 62
D H+SK WCCS++DGN +FGD +F+ W +GL +A N+V + L E L G
Sbjct: 181 DVHVSKASWCCSHTDGNAWFGDVHFDAARWRRGLAHVAGWARAHPNNMVWLDLGGETLVG 240
Query: 63 PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLT---------- 112
GA+A+HAANP+VL++ G+ + +DLS + + A LT
Sbjct: 241 ----------NFSAGADAIHAANPDVLILWGGMQYGQDLSALTSGANYLTAPCYKCTAIR 290
Query: 113 ----------------FTGKLVFEAHWYGFTDGQ-----AWVDG------------NPNQ 139
++ KLVFE H Y ++ Q A ++ +
Sbjct: 291 DAARREPRVFDIDAHAWSRKLVFELHLYKMSEDQDTGTCAVIEAGLYRNGFNALGIDTPA 350
Query: 140 VCGRVVD--NVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWA 197
CG + R S P+ SEFG + + + + + C + W
Sbjct: 351 ACGNATTGGDCPRASRLT-----PVIFSEFG-NGQDDTLWTDTLQTCLREYTVKHGVSWM 404
Query: 198 LWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERI 237
+W L GSY +R G G+ + +GL ++ W + + +E +
Sbjct: 405 VWGLAGSYRVRSGAQGVPDSWGLTNYEWNGWNHPATIEGV 444
>gi|115377032|ref|ZP_01464249.1| cellulase (glycosyl hydrolase family 5), putative [Stigmatella
aurantiaca DW4/3-1]
gi|310820368|ref|YP_003952726.1| cellulase [Stigmatella aurantiaca DW4/3-1]
gi|115365940|gb|EAU64958.1| cellulase (glycosyl hydrolase family 5), putative [Stigmatella
aurantiaca DW4/3-1]
gi|309393440|gb|ADO70899.1| Cellulase (Glycosyl hydrolase family 5) [Stigmatella aurantiaca
DW4/3-1]
Length = 624
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 113/267 (42%), Gaps = 45/267 (16%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+VIL+NH +K WCC DGNGF+ D W MA + G + V G LRNE+
Sbjct: 162 VVILNNHTTKSMWCCG-WDGNGFW-DSSQALQRWQDDWVMMAGRYQGNKWVAGADLRNEV 219
Query: 61 R-----GPK---QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNF-------DKDLSFVR 105
R P +N DW+ Q V NP++L+++ +N+ L VR
Sbjct: 220 RPDGLDSPNWGMRNQHDWHMAAQTMGNLVLRMNPDLLIVVEAVNWWGLLDGERPQLKPVR 279
Query: 106 NQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCG------RVVDNVMRLSGFLLEQG 159
+ + L KLV+ H YG+T PNQ G + D L+Q
Sbjct: 280 QRPIALLRGDKLVYAVHNYGYT--------GPNQSGGSLGSGPKYSDMDRPTLHGTLDQE 331
Query: 160 W------------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
W P+++SEFG N + + + D DWA W + + L
Sbjct: 332 WGFVLAANQAYTAPVWMSEFGIGYNEQAANSRAWFSNLADYLIDKDVDWAYWA-INAAKL 390
Query: 208 REGVIGLNEYYGLFDW-NWCDIRNSSF 233
+ V G +E YGL+ + +W +R+ +
Sbjct: 391 QNSVQGEDETYGLWSYPDWSGVRSGDW 417
>gi|19572317|emb|CAD19084.1| cellulase [Stigmatella aurantiaca]
Length = 630
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 109/260 (41%), Gaps = 47/260 (18%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+VIL+NH +K WCC DGNGF+ D W +A + G + V G LRNE+
Sbjct: 168 VVILNNHTTKSMWCCG-WDGNGFW-DSSQALQRWQDDWVMLAARYQGNKWVAGADLRNEV 225
Query: 61 R-----GPK---QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNF-------DKDLSFVR 105
R P +N DW+ Q + NP++L+++ +N+ L VR
Sbjct: 226 RPDGLDSPNWGMRNQHDWHMAAQTMGNLLLRTNPDLLIVVEAVNWWGLLDGSRPQLKPVR 285
Query: 106 NQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCG------RVVDNVMRLSGFLLEQG 159
+ V L KLV+ H YG+T PNQ G + D L+Q
Sbjct: 286 QRPVALLRGDKLVYAVHNYGYT--------GPNQSGGSLGSGPKYSDMDKPTLYGTLDQE 337
Query: 160 W------------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
W P+++SEFG N + + E D DWA W + + L
Sbjct: 338 WGFALAANQAYTAPVWMSEFGVGYNEQAANSRAWFSNLADYLIEKDVDWAYWA-INAAKL 396
Query: 208 REGVIGLNEYYGLF---DWN 224
+ V G +E YGL+ DWN
Sbjct: 397 QNSVQGEDETYGLWSYPDWN 416
>gi|169606616|ref|XP_001796728.1| hypothetical protein SNOG_06354 [Phaeosphaeria nodorum SN15]
gi|160707041|gb|EAT86185.2| hypothetical protein SNOG_06354 [Phaeosphaeria nodorum SN15]
Length = 459
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 113/268 (42%), Gaps = 62/268 (23%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK 64
D H+ K WCCS+ DGN +GL+ +A NVV MSLRNELR
Sbjct: 181 DVHVGKAQWCCSHIDGNACGR----------RGLSYVAKWAQDHPNVVSMSLRNELR-ES 229
Query: 65 QNVKDWYRY-------MQLGAEAVHAANPEVLVILSGLNFDKDLS--------------- 102
NV D Y M GA+A+HAANP +L+I SG+ + +DLS
Sbjct: 230 WNVTDLYYNWETLVGNMTAGADAIHAANPNILIIWSGMQYGQDLSALTTGKNYLTAPCYK 289
Query: 103 --FVRNQA---------VNLTFTGKLVFEAHWYGFTDGQAWVD-GNPNQVCGRVVDNVMR 150
+R+ A + + KLV+E H Y ++ +D G + + N
Sbjct: 290 CTAIRDAARREPKVFDLASHAWADKLVWEIHLYSMSED---IDTGTCDIIKANFFRNGFN 346
Query: 151 LSGF-------------LLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWA 197
GF + P+ +SEFG + + ++ C E W
Sbjct: 347 AMGFDAPPACNVTGDCPKAVRETPVIISEFGW-AQDETLFNHTLTQCLKEYTLEHKISWM 405
Query: 198 LWTLVGSYYLREGVIGLNEYYGLFDWNW 225
+W++ GSY +R G+ G + +G+ + +W
Sbjct: 406 MWSVAGSYRIRSGIQGFVDTWGMTNSDW 433
>gi|322699119|gb|EFY90883.1| cellulase [Metarhizium acridum CQMa 102]
Length = 317
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 98/213 (46%), Gaps = 15/213 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF--------NPDLWIKGLTKMATIFNGVRNVVG 53
VI++NHI++ WCC + + + + Y + D WI+ + T F V+G
Sbjct: 68 VIVNNHITRATWCCGANPCDAAWANDYLLGICRVTQSEDEWIRNWETVMTRFTNNPLVIG 127
Query: 54 MSLRNELRGPKQNVKDWYRYMQLGAE---AVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
+ LRNE+RG W+++ + + A NP+ LV++ G DL R + V
Sbjct: 128 VDLRNEVRG-LWGTMPWHKWAEAAERCGNRLLAMNPDWLVVVEGTESANDLRGARRRPVK 186
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQG-WPLFVSEFG 168
L +LV+ AH Y ++ G + G ++ MR G+LLE P++V E G
Sbjct: 187 LDIADRLVYSAHVYAWS-GWGSLGGRFSKRTYESFRESMREHWGYLLEGDVAPVWVGEIG 245
Query: 169 ADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
A +V D RY + E+D D+ W L
Sbjct: 246 APRVPASVGDVRYWKNLWRYLGEVDADFGYWAL 278
>gi|380472044|emb|CCF46982.1| cellulase, partial [Colletotrichum higginsianum]
Length = 272
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 88/209 (42%), Gaps = 30/209 (14%)
Query: 54 MSLRNELRGP----------KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF 103
+ LRNELR P W M A AVHAANP++L+ SG FD D+S
Sbjct: 12 IGLRNELRDPFSGSPPASLENTTWTTWKTRMVQAANAVHAANPDLLIFFSGRIFDFDISA 71
Query: 104 -VRNQ----------AVNLTFTGKLVFEAHWYGFTDGQAWVD---GNPNQVCGRVVDNVM 149
V+ + L F K VFE H Y Q VD N + N M
Sbjct: 72 PVQGRFGSEPGFGFALAELPFRHKFVFEQHQYD----QGLVDDACANYRDILTAFGSNAM 127
Query: 150 RLSGFLLEQGWPLFVSEFGADLRGNN-VNDNRYLNCFFGVAAELDWDWALWTLVGSYYLR 208
++ + PL +SE+G D + +++ C + W +W + GSYY R
Sbjct: 128 TIADAGSNRA-PLVMSEWGHDQADEGGMFKDKFRRCLMDFMVDQQMSWMVWVMGGSYYTR 186
Query: 209 EGVIGLNEYYGLFDWNWCDIRNSSFLERI 237
EGV +E +GL D W R ++++
Sbjct: 187 EGVQDRDEPWGLLDHTWSSYRGKDSIKQL 215
>gi|256390071|ref|YP_003111635.1| glycoside hydrolase family protein [Catenulispora acidiphila DSM
44928]
gi|256356297|gb|ACU69794.1| glycoside hydrolase family 5 [Catenulispora acidiphila DSM 44928]
Length = 642
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 103/241 (42%), Gaps = 45/241 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W M + + VVG LR
Sbjct: 174 VILNNHTTSYRWCC------GLDGNERWNSGQSTQQWESDWLFMVNRYRANKRVVGADLR 227
Query: 58 NELRGPKQNVKDW--------YRYMQLGAEAVHAANPEVLVILSGLNF-----------D 98
NE+R + +W Y + + AA+P++L+++ G+N+
Sbjct: 228 NEVRRDTWDDPNWGWYDAHDEYAAFEEAGNQILAADPDMLIVMEGINWYGIPAAGFSHGR 287
Query: 99 KDLSFVRNQAVNLTFTGKLVFEAHWYGFT---------------DGQAWVDGNPNQVCGR 143
L+ N + L + KLV+ AH+Y +T + D P+Q+
Sbjct: 288 PMLTPAANLSATLIASNKLVYSAHFYSYTGPNNSGAAAGSAGSTSDPRYEDMTPDQLASA 347
Query: 144 VVDNVMRLSGFLLEQGWPLFVSEFGADLRGN-NVNDNRYLNCFFGVAAELDWDWALWTLV 202
V + ++ P++VSEFGA RG + + +L+ F + A D D+A+W L+
Sbjct: 348 VNQEALFVTQSGQHFTAPVWVSEFGAAGRGETDTKEQTWLDTFTTILAANDTDFAIWPLI 407
Query: 203 G 203
G
Sbjct: 408 G 408
>gi|400595342|gb|EJP63147.1| cellulase-like protein [Beauveria bassiana ARSEF 2860]
Length = 504
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 103/235 (43%), Gaps = 16/235 (6%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF--------NPDLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + + + WI+ + V+G
Sbjct: 257 VIVNNHITTATWCCGANPCDAGWANDHLGALCSVRQTEESWIRNWEAVMGRLVANPLVIG 316
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG + + W + E + A NP+ LV++ G D+S R + V L
Sbjct: 317 ADLRNEVRGLWGTMPWEKWAAAAERCGERLLAMNPDWLVVVEGTESANDVSGARRRPVTL 376
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGAD 170
+ KLV+ AH Y ++ +W + V + R +LLEQ P++V E GA
Sbjct: 377 SVKDKLVYSAHVYAWSGWGSWGGRFAQRGYDSFVATMRRSWLYLLEQDVAPVWVGELGAS 436
Query: 171 LRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNW 225
R + RY + + E+D D+ W + + + V E Y L + +W
Sbjct: 437 -RHPSRGGARYWQNLWRLLKEVDADFGYWAINPNKAYKSTV----ETYSLVESDW 486
>gi|345564719|gb|EGX47679.1| hypothetical protein AOL_s00083g187 [Arthrobotrys oligospora ATCC
24927]
Length = 518
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 65/244 (26%), Positives = 105/244 (43%), Gaps = 21/244 (8%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF--------NPDLWIKGLTKMATIFNGVRNVVG 53
VI++NHI++ WCC + + + + Y + WI+ K+ + V+G
Sbjct: 231 VIVNNHITQARWCCDANLCDATWANDYLGKVCRISQTEEQWIQNWEKVMRNYVDNPLVIG 290
Query: 54 MSLRNELRGP--KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE R P K W + A +HA P L+ + G+ L VR++ ++L
Sbjct: 291 ADLRNENRSPLGKMLWPSWATAAEKAANRLHALQPNWLMFVEGVASANYLQGVRSRPISL 350
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNP-NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGA 169
++V+ AH YG++ + + G ++ ++ GFLLE+ P++V EFGA
Sbjct: 351 PIPHRVVYSAHVYGWSGWGSLLKGPYWSRDYASFAYDMYDNWGFLLEENIAPVWVGEFGA 410
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNW---- 225
N D Y E+D DW W + R+ E Y L +W
Sbjct: 411 P-DVPNTGDVNYWTNLMNFLEEMDADWGYWAINP----RKPQGNETETYALVQDDWVTPK 465
Query: 226 CDIR 229
CD R
Sbjct: 466 CDYR 469
>gi|398345192|ref|ZP_10529895.1| cellulase [Leptospira inadai serovar Lyme str. 10]
Length = 634
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 105/249 (42%), Gaps = 48/249 (19%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFF----GDQYFNP--DLWIKGLTKMATIFNGVRNVVGM 54
+V+L+NH + WCC D NG + +N D+W M + + VV
Sbjct: 177 VVVLNNHTTLAEWCCG-FDYNGLWYHTGSSLAYNSSTDIWQADWLMMVNRYKNNKMVVAA 235
Query: 55 SLRNELRGPKQ--------------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNF--- 97
LRNE+R ++ +V DW++ Q + +NP++L+I+ G+N+
Sbjct: 236 DLRNEVRTQRRGDSYIPDSPNWGSGDVNDWHKASQDLGGLITQSNPDLLIIVEGINWWGL 295
Query: 98 --------DKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVM 149
L ++ V+L + KLV+ AH YG+ +G+P + N M
Sbjct: 296 IPILGSGERPHLKPIKGLPVHLPLSNKLVYSAHNYGYIGPNN--NGDPKTSGNNITYNQM 353
Query: 150 RLSGF----LLEQGW----------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWD 195
+ F E G+ P++VSEFGA N D ++L E D D
Sbjct: 354 DPTTFRNTVYSEWGYVVDPELYYTAPVWVSEFGASPSTTNNQDKQWLQNLVDYLIEKDLD 413
Query: 196 WALWTLVGS 204
+A W L G+
Sbjct: 414 FAFWPLNGN 422
>gi|452955476|gb|EME60874.1| glycoside hydrolase family protein [Amycolatopsis decaplanina DSM
44594]
Length = 612
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 99/240 (41%), Gaps = 44/240 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFN----PDLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W MA + + VVG L
Sbjct: 155 VILNNHTTTSRWCC------GLDGNERWNTAQTEQRWQDDWLFMARRYAANKRVVGADLY 208
Query: 58 NELR-----GPK---QNVKDWYRYMQLGAEAVHA-ANPEVLVILSGLNFD---------- 98
NE+R P N DW R Q A+ + ANP +L+I+ G+N+
Sbjct: 209 NEVRRNVFDDPNWGWGNATDWQRASQRVADRIQTEANPGLLLIVEGINWTGLPVDGLPHG 268
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFT-DGQAWVDGNPNQVCGRVVD-------NVM 149
L R + L +GKLV+ AH+YG+T + G R D +V+
Sbjct: 269 RPTLEPARTLSHTLVDSGKLVYAAHFYGYTGPNHSGATGTGETHDPRYRDLSPQELRDVL 328
Query: 150 RLSGFLL--EQG----WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
F + E G PL++SEFG + + F G AE D D+A W VG
Sbjct: 329 HRQAFFVSAETGKHYTAPLWISEFGEGRHTTDAASRAWFENFVGYLAETDTDFAYWPAVG 388
>gi|353239719|emb|CCA71619.1| related to cellulase-Streptomyces pristinaespiralis [Piriformospora
indica DSM 11827]
Length = 627
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 104/248 (41%), Gaps = 46/248 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VIL+NH +K WCC DGN + + W M +F VVG L NE+R
Sbjct: 160 VILNNHTNKNRWCCGVGDGNERWNESQTTQQ-WADDWVMMVKLFKDNPRVVGADLYNEVR 218
Query: 62 ------------GPKQNVKDWYRYMQ-LGAEAVHAANPEVLVILSGLNF----------- 97
GP DW Q G + ANP++L+I+ G+N+
Sbjct: 219 RDVLIDPNWGWGGPV----DWQAASQWAGDRILTEANPDILIIIEGINWVGLPVDGLPHG 274
Query: 98 DKDLSFVRNQAVNLTFTGKLVFEAHWYGFTD---------GQAWVDGNPNQVCGRVVDNV 148
LS + L + KLV+ AH+Y +T G+ + ++ D V
Sbjct: 275 RPTLSGAATISHTLVQSDKLVYAAHFYSYTGPNHSGAFGIGETHDPRYRDLNLQQLRDVV 334
Query: 149 MRLSGFLLEQ-----GWPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLV 202
+ F++ Q PL++SEFG RG + +R + + F + D D+A W L+
Sbjct: 335 ESSATFVVNQPDKHFTRPLWISEFGIPGRGIISDADRNWFDNFLSILKTNDLDYAFWPLI 394
Query: 203 GSYYLREG 210
G YLR G
Sbjct: 395 G--YLRNG 400
>gi|398347173|ref|ZP_10531876.1| cellulase [Leptospira broomii str. 5399]
Length = 633
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 103/248 (41%), Gaps = 48/248 (19%)
Query: 2 VILDNHISKPGWCCSNSDGNGFF----GDQYFNP--DLWIKGLTKMATIFNGVRNVVGMS 55
V+L+NH + WCC D NG + +N D+W M + + VV
Sbjct: 177 VVLNNHTTLAEWCCG-FDYNGLWYHTGSSLAYNSSTDMWQADWLMMVNRYKNNKMVVAAD 235
Query: 56 LRNELRGPKQ--------------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNF---- 97
LRNE+R ++ +V DW++ Q + NP++LVI+ G+N+
Sbjct: 236 LRNEVRTQRRGDSYIPDSPNWGSGDVNDWHKASQDLGGLITQNNPDLLVIVEGINWWGLI 295
Query: 98 -------DKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMR 150
L ++ V+L + KLV+ AH YG+ +G+P + + M
Sbjct: 296 PILGSGERPHLKPIKGLPVHLPLSNKLVYSAHNYGYIGPNN--NGDPKTSGSNITYSQMD 353
Query: 151 LSGF----LLEQGW----------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDW 196
+ F E G+ P++VSEFGA N D ++L E D D+
Sbjct: 354 PTTFRNTVYSEWGYVVDPELYYTAPVWVSEFGASPSTTNDQDKQWLQNLVDYLIERDLDF 413
Query: 197 ALWTLVGS 204
A W L G+
Sbjct: 414 AFWPLNGN 421
>gi|451338671|ref|ZP_21909201.1| Endoglucanase [Amycolatopsis azurea DSM 43854]
gi|449418655|gb|EMD24226.1| Endoglucanase [Amycolatopsis azurea DSM 43854]
Length = 617
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 99/240 (41%), Gaps = 44/240 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFN----PDLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W M + + VVG L
Sbjct: 160 VILNNHTTTSRWCC------GLDGNERWNTAQSEQQWQDDWLFMVRRYAANKRVVGADLY 213
Query: 58 NELR-----GPK---QNVKDWYRYMQLGAEAVHA-ANPEVLVILSGLNFD---------- 98
NE+R P N DW R Q A+ + ANP++L+I+ G+N+
Sbjct: 214 NEVRRNVFDDPNWGWGNGTDWQRASQRVADRIQTEANPDLLLIVEGINWTGLPVDGLPHG 273
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFT-DGQAWVDGNPNQVCGRVVD-------NVM 149
L R + L +GKLV+ AH+YG+T + G R D +V+
Sbjct: 274 RPTLEPARTLSHTLVDSGKLVYAAHFYGYTGPNHSGATGTGETHDPRYRDLSPQELRDVL 333
Query: 150 RLSGFLL--EQG----WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
F + E G PL++SEFG + + F G AE D D+A W VG
Sbjct: 334 HRQAFFVSAETGKHYTAPLWISEFGEGRHTTDAASRAWFENFVGHLAENDTDFAYWPAVG 393
>gi|380491298|emb|CCF35420.1| cellulase [Colletotrichum higginsianum]
Length = 507
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 107/250 (42%), Gaps = 22/250 (8%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + P + WI+ + F V+G
Sbjct: 260 VIVNNHITHSTWCCGADPCDAHWANDHLGPLCRIKQTEEDWIQHWEAIMLRFVDNPRVIG 319
Query: 54 MSLRNELRGPKQNVK--DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
+ LRNE+RG ++ W + + A N + LVI+ G DL+ V + V L
Sbjct: 320 VDLRNEVRGVWGSMTWDKWATAAEKAGNRLLAMNKDWLVIVGGTESGNDLTGVAKRPVIL 379
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG--WPLFVSEFGA 169
++V+ AH Y ++ G ++G ++ MR + L +G P++V EFGA
Sbjct: 380 DVPDRVVYSAHVYAWS-GWGSLEGRYSKRSYASFVQSMRHNWAYLVEGDQSPVWVGEFGA 438
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC--- 226
R N+ D Y N +D D+ W + R+ + E Y L + +W
Sbjct: 439 PHR-PNIGDANYWNNLMRYLKVIDADFGYWAV----NPRKPKDNVKETYALVEDDWVTPV 493
Query: 227 -DIRNSSFLE 235
D R +E
Sbjct: 494 LDYRMKDMVE 503
>gi|449300277|gb|EMC96289.1| glycoside hydrolase family 5 protein [Baudoinia compniacensis UAMH
10762]
Length = 429
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 112/260 (43%), Gaps = 21/260 (8%)
Query: 1 MVILDNHISKPGWC-----CSNSDGNGFFGD---QYFNPDLWIKGLTKMAT--IFNGVRN 50
+VI +NHI+ WC C +S N +G + WI+ + I NG+
Sbjct: 177 VVIPNNHITNAHWCDGKNLCDSSWKNDQYGSLCKVRQTTESWIEHWKTIMEPHIDNGL-- 234
Query: 51 VVGMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQA 108
V+G LRNE RG + W + +E + P L+ + G+ D S R +
Sbjct: 235 VIGADLRNEPRGLWGTMSWDMWATAAEQASEELLKMQPNWLMFVEGIGSANDCSGARERP 294
Query: 109 VNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEF 167
V L+ ++V+ AH Y ++ V +D + R G+LL P++V EF
Sbjct: 295 VKLSIPERVVYSAHVYKWSGWSTLVPYGKRPYPSFALD-MERNWGYLLRGDIAPVWVGEF 353
Query: 168 GADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCD 227
GA GN+ D+ Y N ++D DW W L R+ NE YGL +W
Sbjct: 354 GAPHGGND-GDHHYWNNLMKYLRDVDADWGYWAL----NPRKPENYDNETYGLLADDWET 408
Query: 228 IRNSSFLERISSLQSPFRGP 247
+ + ++ + L +P R P
Sbjct: 409 VVDDWRIKDLLKLVAPRRHP 428
>gi|171680400|ref|XP_001905145.1| hypothetical protein [Podospora anserina S mat+]
gi|170939827|emb|CAP65052.1| unnamed protein product [Podospora anserina S mat+]
Length = 264
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 40/57 (70%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
D H+SK GWCCS +DGN +FGD++F+ + W +GL +A N+V MSLRNELR
Sbjct: 157 DAHVSKAGWCCSQTDGNAWFGDEHFDIENWTRGLGYVARWAGDHTNIVSMSLRNELR 213
>gi|359687775|ref|ZP_09257776.1| cellulase [Leptospira licerasiae serovar Varillal str. MMD0835]
gi|418748763|ref|ZP_13305055.1| cellulase (glycosyl hydrolase family 5) [Leptospira licerasiae str.
MMD4847]
gi|418758988|ref|ZP_13315169.1| cellulase (glycosyl hydrolase family 5) [Leptospira licerasiae
serovar Varillal str. VAR 010]
gi|384114205|gb|EIE00469.1| cellulase (glycosyl hydrolase family 5) [Leptospira licerasiae
serovar Varillal str. VAR 010]
gi|404275832|gb|EJZ43146.1| cellulase (glycosyl hydrolase family 5) [Leptospira licerasiae str.
MMD4847]
Length = 762
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/248 (24%), Positives = 99/248 (39%), Gaps = 46/248 (18%)
Query: 1 MVILDNHISKPGWCCS-NSDGNGFFGDQYF----NPDLWIKGLTKMATIFNGVRNVVGMS 55
+V+L+NH + WCC + +G + F P++W + + + V
Sbjct: 164 VVVLNNHTTFSEWCCGFDYNGQWYHTGSSFAYNQTPEMWKADWVFLVNRYKDNKLVAAAD 223
Query: 56 LRNELRGPK--------------QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNF---- 97
LRNE+R + N+ DW + + ANP++++++ G+N+
Sbjct: 224 LRNEVRTQRFNDTHLPNSPNWGWNNIDDWRKAAGEAGNDILRANPDMVIVVEGINWWGAI 283
Query: 98 -------DKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMR 150
L VR+ V++ KLV+ AH YGF + +G+ G + M
Sbjct: 284 PILGSGERPHLKPVRDLQVHIRNVNKLVYAAHNYGFIGPKH--NGDDATSGGNIKYKDMD 341
Query: 151 LSGF--LLEQGW------------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDW 196
L+ F + W P++VSEFGA N D +L E D D+
Sbjct: 342 LTTFRNTITDEWGYVTDPDAVTTAPVWVSEFGASPGETNPADREWLKRLVDYLIEKDLDF 401
Query: 197 ALWTLVGS 204
A W L G
Sbjct: 402 AFWPLNGE 409
>gi|452842788|gb|EME44724.1| glycoside hydrolase family 5 protein [Dothistroma septosporum
NZE10]
Length = 188
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 10/101 (9%)
Query: 5 DNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR--- 61
D H+ K WCC+N+DGN +F D F D+W +GL MA G +V+ MSLRNELR
Sbjct: 88 DMHVGKAQWCCNNTDGNAWFNDYDFPVDVWKRGLKYMANWAQGHDDVLSMSLRNELRRAI 147
Query: 62 ---GPKQNVK-DWYRYM---QLGAEAVHAANPEVLVILSGL 95
P + DW + +A++ N ++LV S +
Sbjct: 148 NITSPTSTIDYDWLSLVGDDAAATDAIYETNSDILVTWSSM 188
>gi|256378266|ref|YP_003101926.1| glycoside hydrolase family protein [Actinosynnema mirum DSM 43827]
gi|255922569|gb|ACU38080.1| glycoside hydrolase family 5 [Actinosynnema mirum DSM 43827]
Length = 632
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 105/243 (43%), Gaps = 47/243 (19%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFN----PDLWIKGLTKMATIFNGVRNVVGMSL 56
VIL+NH WCC G G++ +N W + MA + + VVG L
Sbjct: 175 AVILNNHTGTSRWCC------GVDGNERWNTARTEAQWQEDWLFMARRYASNKRVVGADL 228
Query: 57 RNELR-----GPK---QNVKDWYRY-MQLGAEAVHAANPEVLVILSGLNFD--------- 98
NE+R P N DW+R Q+ + ANP++LVI+ G+N+
Sbjct: 229 YNEVRRTITDDPNWGWGNDHDWHRASQQVANRILTEANPDLLVIVEGINWTGLPVDGFPH 288
Query: 99 --KDLSFVRNQAVNLTFTGKLVFEAHWYGFTD---------GQAWVDGNPNQVCGRVVDN 147
L VR + L + KLV+ AH+YG+T G+ D + + + +
Sbjct: 289 GRPTLEPVRTLSHALVESDKLVYSAHFYGYTGPNHSGATGIGET-TDPRYQDLSPQELKD 347
Query: 148 VMRLSGFLL--EQGW----PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
V+R + E G P+++SEFG R + N + F + E D D+A W +
Sbjct: 348 VVRRQALYVSAETGQHFTAPVWISEFGVG-RDSAANTRDWFERFVDLLVEADADFAYWPV 406
Query: 202 VGS 204
VG+
Sbjct: 407 VGN 409
>gi|342877360|gb|EGU78828.1| hypothetical protein FOXB_10650 [Fusarium oxysporum Fo5176]
Length = 646
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 104/253 (41%), Gaps = 20/253 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + P D WI+ + V+G
Sbjct: 399 VIVNNHITSATWCCGADPCDAGWSNDHLGPICRVRQTEDEWIQHWETIMLPHINNPLVIG 458
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
+ LRNE+RG + W + + N + LVI+ G DLS V + + L
Sbjct: 459 VDLRNEIRGLWGTMPWSKWAPAAERCGNRLLQMNRDWLVIVEGTESSNDLSKVCKRPILL 518
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLE-QGWPLFVSEFGAD 170
T +LV+ AH Y ++ +W + + G++++ Q P++V E GA
Sbjct: 519 DVTHRLVYSAHVYAWSGWGSWEGRFLQRDYDSFAKTMHHNWGYIVDKQIAPVWVGEIGAP 578
Query: 171 LRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNW----C 226
++ +V D Y E D D+ W L R+ E YGL +W
Sbjct: 579 VQP-SVGDANYWQHLVRFLQEKDSDFGYWALNA----RKPKGNATERYGLLQDDWKTPVL 633
Query: 227 DIRNSSFLERISS 239
D R LE +S+
Sbjct: 634 DYRMKDMLELMSA 646
>gi|393232644|gb|EJD40223.1| cellulase [Auricularia delicata TFB-10046 SS5]
Length = 614
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 66/146 (45%), Gaps = 21/146 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VIL+NH +K WCC +DGN + + D WIK + F + VVG L NE+R
Sbjct: 154 VILNNHTNKSRWCCGVADGNERWNESQSEED-WIKDWVSIVKRFKNNKRVVGADLYNEVR 212
Query: 62 -----GPK---QNVKDWY-RYMQLGAEAVHAANPEVLVILSGLNF-----------DKDL 101
P N DWY G + ANP++L+I+ G+N+ L
Sbjct: 213 RNILSDPNWGGYNKYDWYLAAHHAGDRILTEANPDILIIIEGINWVGIPTDITPHSRPTL 272
Query: 102 SFVRNQAVNLTFTGKLVFEAHWYGFT 127
+ L KLV+ AH+YG+T
Sbjct: 273 VGANQLSHTLVQPNKLVYSAHFYGYT 298
>gi|310793874|gb|EFQ29335.1| cellulase [Glomerella graminicola M1.001]
Length = 409
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 101/237 (42%), Gaps = 18/237 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + P + WI+ + F V+G
Sbjct: 162 VIVNNHITHSTWCCGADPCDAHWANDHLGPLCRIKQTEEDWIQHWETIMLRFVDNPRVIG 221
Query: 54 MSLRNELRGP--KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
+ LRNE+RG W + + N + L+I+ G DL+ V + V L
Sbjct: 222 VDLRNEVRGVWGSMTWDKWATAAEKAGNRLLEMNKDWLIIVGGTESGNDLTGVAKRPVVL 281
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG--WPLFVSEFGA 169
T ++V+ AH Y ++ G ++G ++ N MR + L +G P++V EFGA
Sbjct: 282 TVPDRVVYSAHVYAWS-GWGSLEGRYSKRGYTSFVNSMRHNWAYLVEGDQAPVWVGEFGA 340
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC 226
++ D Y N +D D+ W + R+ + E Y L + +W
Sbjct: 341 P-HHPSIGDANYWNNLLRYLKVIDADFGYWAV----NPRKPKDNVKETYSLVEDDWV 392
>gi|452985724|gb|EME85480.1| glycoside hydrolase family 5 protein, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 388
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 108/254 (42%), Gaps = 21/254 (8%)
Query: 1 MVILDNHISKPGWC-----CSNS---DGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVV 52
+VI +NHI+ WC C +S D G F WI + F V+
Sbjct: 139 IVIPNNHITNAHWCDGMNLCDSSWKNDHYGPFCKIKQTTATWIDHWRTVMAPFVHNPFVI 198
Query: 53 GMSLRNELRGPKQNVKDWYRYM---QLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV 109
G LRNE RG W + + +EA+ A P L+ + G++ D S +N+ V
Sbjct: 199 GADLRNEPRG-LWGTMTWSMFASASEKASEALLAIQPNWLMFVEGISSANDCSGAKNRPV 257
Query: 110 NLTFTGKLVFEAHWYGFTDGQAWVDGNP--NQVCGRVVDNVMRLSGFLLEQG-WPLFVSE 166
L+ ++V+ +H Y ++ G + P + ++ + +L+E+ P++V E
Sbjct: 258 KLSIPNRVVYSSHIYSWS-GWGSIPSVPYGKRPYPSFAADMRKNWAYLIEENTAPVWVGE 316
Query: 167 FGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC 226
FGA GN+ +D Y + E D DW W L R+ NE YGL +W
Sbjct: 317 FGAPHSGND-HDYHYWENSMRILNETDADWGYWALNP----RKPEGYDNETYGLLHDDWE 371
Query: 227 DIRNSSFLERISSL 240
+ L+ + L
Sbjct: 372 TVVGDYRLKSLQKL 385
>gi|408396252|gb|EKJ75414.1| hypothetical protein FPSE_04433 [Fusarium pseudograminearum CS3096]
Length = 403
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 113/256 (44%), Gaps = 26/256 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRN--V 51
VI++NHI+ WCC + + + + + P + WI+ + + ++N V
Sbjct: 156 VIVNNHITSATWCCGANPCDAGWSNDHLGPICRVKQTEEEWIQHWETI--MLPHIKNPLV 213
Query: 52 VGMSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV 109
VG+ +RNE+RG + W + + N + LVI+ G DLS + + V
Sbjct: 214 VGVDVRNEIRGLWGTMPWSKWAPAAERCGNRLLQMNRDWLVIVEGTESSNDLSSICKRPV 273
Query: 110 NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLE-QGWPLFVSEF 167
L +LV+ AH Y ++ +W +G Q + MR + ++LE Q P++V E
Sbjct: 274 LLDVAHRLVYSAHVYAWSGWGSW-EGRFLQRDYKSFAKTMRHNWAYILEKQIAPVWVGEI 332
Query: 168 GADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNW-- 225
GA + +V D Y E+D D+ W L R+ + E YGL +W
Sbjct: 333 GAPGQP-SVGDINYWQHLTRFLQEMDADFGYWALNA----RKPKGNVTERYGLLHDDWKT 387
Query: 226 --CDIRNSSFLERISS 239
D R LE ++S
Sbjct: 388 PVLDYRMKDMLELMAS 403
>gi|440694740|ref|ZP_20877327.1| cellulase (glycosyl hydrolase family 5) [Streptomyces
turgidiscabies Car8]
gi|440283205|gb|ELP70528.1| cellulase (glycosyl hydrolase family 5) [Streptomyces
turgidiscabies Car8]
Length = 645
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 71/255 (27%), Positives = 110/255 (43%), Gaps = 59/255 (23%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N + W MA + + VVG L
Sbjct: 177 VILNNHTNTTRWCC------GVDGNERWNASRSTETWENDWLFMARRYRDNQRVVGADLY 230
Query: 58 NELRGPKQNV-----------KDWYRYMQ-LGAEAVHAANPEVLVILSGLNFD------- 98
NE+R +N+ DW+ Q +G + A+P++L+I+ G+N+
Sbjct: 231 NEVR---RNILDDPNWGSGDDHDWFAASQHVGDRILTEADPDLLIIVEGINWTGVPVDGL 287
Query: 99 ----KDLSFVRNQAVNLTFTGKLVFEAHWY-------------GFTDGQAWVDGNPNQVC 141
L VR + L +GKLV+ AH+Y G T + D +P ++
Sbjct: 288 PHERPTLEPVRRLSHTLVDSGKLVYSAHFYDYTGPNHSGATGTGETSDPRYRDLSPAELI 347
Query: 142 GRVVDNVMRLSGFLLEQ-----GWPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWD 195
RV+D R + F+ + PL++SEFG R + R + F D D
Sbjct: 348 -RVLD---RQAFFVTAETDKHFTAPLWISEFGVGGRDETGSKQRAWFENFVDQLIRTDAD 403
Query: 196 WALWTLVGSYYLREG 210
+A W LVG + REG
Sbjct: 404 FAYWPLVGWHENREG 418
>gi|302411452|ref|XP_003003559.1| endoglucanase [Verticillium albo-atrum VaMs.102]
gi|261357464|gb|EEY19892.1| endoglucanase [Verticillium albo-atrum VaMs.102]
Length = 326
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 101/236 (42%), Gaps = 18/236 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
V+++NHI+ WCC + + + + P + WI+ + F V+G
Sbjct: 70 VVVNNHITHATWCCGADPCDAGWANDHLGPLCRVRQTEEGWIRNWETVMVRFVDNPLVIG 129
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG + W + + NP+ L+++ G DL V + V L
Sbjct: 130 ADLRNEVRGVWGTMPWSKWAAAAERAGNRLLKMNPDWLIVVGGTESGNDLRGVAERPVRL 189
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLE-QGWPLFVSEFGA 169
G++V+ AH Y ++ G ++G + + MR + G+L+E P++V EFGA
Sbjct: 190 DVPGRVVYSAHVYAWS-GWGSLEGRFAKRGYASFVHAMRENWGYLVEGDRAPVWVGEFGA 248
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNW 225
R ++ D Y +D D+ W + R+ E Y L + +W
Sbjct: 249 P-RDPSMGDANYWQNLVRYLKVVDADFGYWAI----NPRKPKANTTETYALVEDDW 299
>gi|345855822|ref|ZP_08808461.1| glycosyl hydrolase [Streptomyces zinciresistens K42]
gi|345632711|gb|EGX54579.1| glycosyl hydrolase [Streptomyces zinciresistens K42]
Length = 645
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 104/248 (41%), Gaps = 45/248 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD----LWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W + MA + VVG L
Sbjct: 177 VILNNHTNTTRWCC------GVDGNERWNTSQSTAAWEEDWLFMARRYRANPRVVGADLY 230
Query: 58 NELR-----GPKQNV---KDWYRYMQ-LGAEAVHAANPEVLVILSGLNFD---------- 98
NE+R P + +DW+ Q +G + ANP++L+++ G+N+
Sbjct: 231 NEVRRSVWDDPNWGLGDDRDWFAASQRVGDRILTEANPDLLIVVEGINWTGIPVDGLPHE 290
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWY-------------GFTDGQAWVDGNPNQVCGRV 144
L R + L +GKLV+ AH+Y G T + D +P+++ +
Sbjct: 291 RPTLEPARRLSHTLVDSGKLVYSAHFYDYTGPNHTGATGTGETSDPRYRDLSPDELIAVL 350
Query: 145 VDNVMRLSGFLLEQ-GWPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLV 202
+SG + PL++SEFG R R + F D D+A W LV
Sbjct: 351 NRQAFFISGEQNQHFTAPLWISEFGVGGRAETGQKQRAWFENFVDQLIRTDADFAYWPLV 410
Query: 203 GSYYLREG 210
G + R+G
Sbjct: 411 GWHENRQG 418
>gi|255537619|ref|XP_002509876.1| hypothetical protein RCOM_1689350 [Ricinus communis]
gi|223549775|gb|EEF51263.1| hypothetical protein RCOM_1689350 [Ricinus communis]
Length = 139
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 32/39 (82%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLT 39
MVIL+NH+S+P WCCS SDGNGF GD F+P++W++G
Sbjct: 65 MVILENHMSQPLWCCSESDGNGFHGDAQFHPEVWLRGFV 103
>gi|346978266|gb|EGY21718.1| endoglucanase [Verticillium dahliae VdLs.17]
Length = 418
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/261 (24%), Positives = 108/261 (41%), Gaps = 24/261 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
V+++NHI+ WCC + + + + P + WI+ + F V+G
Sbjct: 162 VVVNNHITHATWCCGADPCDAGWANDHLGPLCRVRQTEEGWIRNWEAVMARFVDNPLVIG 221
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG + W + + NP+ L+++ G DL V + V L
Sbjct: 222 ADLRNEVRGVWGTMPWSKWAAAAERAGNRLLRMNPDWLIVVGGTESGNDLRGVAERPVRL 281
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLE-QGWPLFVSEFGA 169
G++V+ AH Y ++ G ++G + + MR + G+L+E P++V EFGA
Sbjct: 282 DVPGRVVYSAHVYAWS-GWGSLEGRFAKRGYASFVHAMRENWGYLVEGDEAPVWVGEFGA 340
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNW---- 225
R ++ D Y +D D+ W + R+ E Y L + +W
Sbjct: 341 P-RDPSMGDANYWQNLVRYLKVVDADFGYWAI----NPRKPKANTTETYALVEDDWKTPV 395
Query: 226 CDIRNSSFLERISSLQSPFRG 246
D R E + Q F G
Sbjct: 396 LDYRMKDMAEMMK--QGTFEG 414
>gi|386844464|ref|YP_006249522.1| cellulase [Streptomyces hygroscopicus subsp. jinggangensis 5008]
gi|374104765|gb|AEY93649.1| cellulase [Streptomyces hygroscopicus subsp. jinggangensis 5008]
gi|451797758|gb|AGF67807.1| cellulase [Streptomyces hygroscopicus subsp. jinggangensis TL01]
Length = 641
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 98/239 (41%), Gaps = 43/239 (17%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD----LWIKGLTKMATIFNGVRNVVGMSLR 57
V+L+NH + WCC G G++ +N W MA + V VVG L
Sbjct: 179 VVLNNHTNTSRWCC------GIDGNERWNSGQSTARWADDWVFMARRYASVPRVVGADLY 232
Query: 58 NELR-----GPKQNV---KDWYRYMQLGAEAV-HAANPEVLVILSGLNFD---------- 98
NE+R P + DWY +L A+ + ANP +L+++ G+N+
Sbjct: 233 NEVRRDVLDDPNWGLGDDHDWYGAARLAADRILTEANPRLLIVIEGINWTGVPVDGLPHG 292
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFT----DGQAWV----DGNPNQVCGRVVDNVM 149
L+ R + L GKLV+ AH+YG+T G + D + + +
Sbjct: 293 RPVLTPARTLSHTLVDAGKLVYSAHFYGYTGPRHSGATGIGETHDARYQDLSRAELAAAL 352
Query: 150 RLSGFLLEQG-----WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
F +E+ P++VSEFG N + A+ D D+A W LVG
Sbjct: 353 DDEAFFVEETGRHYTAPVWVSEFGIGSGETNAAARAWFGHVTDYFADHDADFAFWPLVG 411
>gi|398407351|ref|XP_003855141.1| Exo-beta-1,3-glucanase [Zymoseptoria tritici IPO323]
gi|339475025|gb|EGP90117.1| Exo-beta-1,3-glucanase [Zymoseptoria tritici IPO323]
Length = 424
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 111/259 (42%), Gaps = 25/259 (9%)
Query: 1 MVILDNHISKPGWC-----CSNSDGNGF---FGDQYFNPDLWIKGLTKMATIFNGVRNVV 52
MVI+++HI+ WC C +S N F + WI+ + F VV
Sbjct: 173 MVIVNDHITNAAWCDGFNLCDSSWKNDHLLGFCKIRQTTESWIENWKTIMKPFVDNPLVV 232
Query: 53 GMSLRNELRG-----PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQ 107
G LRNE RG P +N W + +EA+ P+ L+ + G + D S V+ +
Sbjct: 233 GADLRNEPRGIWGTFPWEN---WAAAAEKASEALLKMQPDWLMFVEGTSSANDCSGVKRR 289
Query: 108 AVNLTFTGKLVFEAHWYGFTDGQAWVDGNP--NQVCGRVVDNVMRLSGFLLEQG-WPLFV 164
V L+ G++V+ +H Y ++ G + P + +++R GFLL++ P++V
Sbjct: 290 PVRLSIPGRVVYSSHVYSWS-GWGTLPSVPYKERPYPSFRADMVRKWGFLLDRDEAPVWV 348
Query: 165 SEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWN 224
EFG D+ Y N +E D DW W L R+ E YGL +
Sbjct: 349 GEFGNPGHAGK-GDHHYWNGLMRYLSETDADWGYWALNP----RKPGTFDEETYGLLQDD 403
Query: 225 WCDIRNSSFLERISSLQSP 243
W ++ + + L P
Sbjct: 404 WTTPKDDYRMRDLQRLMMP 422
>gi|453087736|gb|EMF15777.1| glycoside hydrolase family 5 protein [Mycosphaerella populorum
SO2202]
Length = 447
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 110/258 (42%), Gaps = 19/258 (7%)
Query: 1 MVILDNHISKPGWC-----CSNSDGNGFFG---DQYFNPDLWIKGLTKMATIFNGVRNVV 52
+VI +NHI+ WC C +S N +G WI + F V+
Sbjct: 196 IVIPNNHITNAHWCDGLNLCDSSWKNSQYGPICKIRQTTSSWIAHWVTIMAPFISNPFVI 255
Query: 53 GMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
G LRNE RG W + +EA+ P+ L+ + G++ D S + + V
Sbjct: 256 GADLRNEPRGIWGTMPWSSWASAAEEASEALLTLQPDWLMFVEGVSSANDCSGAKKRPVE 315
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNP--NQVCGRVVDNVMRLSGFLL-EQGWPLFVSEF 167
L+ ++V+ +H Y ++ G + P + +++ + GFLL E P++V EF
Sbjct: 316 LSIPHRVVYSSHVYSWS-GWGHIPSVPYGKRPYASFAEDMKQNWGFLLHENVAPVWVGEF 374
Query: 168 GADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCD 227
GA GN D+ Y V E D DW W L R+ NE YGL +W
Sbjct: 375 GAPQAGNEA-DHHYWENLMRVLKETDADWGYWALNP----RKPEGYDNETYGLLRDDWET 429
Query: 228 IRNSSFLERISSLQSPFR 245
+ + L+ + +L P +
Sbjct: 430 VVDDWRLKDLRTLMEPLQ 447
>gi|398336822|ref|ZP_10521527.1| cellulase [Leptospira kmetyi serovar Malaysia str. Bejo-Iso9]
Length = 640
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 103/249 (41%), Gaps = 48/249 (19%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFF---GDQY-FNP--DLWIKGLTKMATIFNGVRNVVGM 54
+VIL+NH + WCC D NG + G + +N ++W M + V G
Sbjct: 181 IVILNNHTTFSEWCCG-YDYNGLWYHTGSSFAYNQTVEMWQADWLTMIRRYKNNPFVAGA 239
Query: 55 SLRNELRGPKQ--------------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNF--- 97
LRNE+R ++ + DW++ Q + NP +LV++ G+N+
Sbjct: 240 DLRNEVRTMRKGDTHIPDSPNWGMNDQNDWHKASQDMGVLITQENPNILVVVEGINWWGL 299
Query: 98 --------DKDLSFVRNQAVNLTFTGKLVFEAHWYGFT----DGQAWVDGNPNQVCGRVV 145
L +++ V+L + KLV+ AH Y + +G GN + R +
Sbjct: 300 IPVLGSGERPHLKPIKDLPVHLPLSNKLVYAAHNYAYIGPNHNGDDSTSGN--NIKYRDM 357
Query: 146 DNVMRLSGFLLEQGW----------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWD 195
D S E G+ P+++SEFGA + D +L V E D D
Sbjct: 358 DENTFKSTVFNEWGYVVTPETYYSAPVWLSEFGASPSNSGATDQEWLRRLTDVLIERDMD 417
Query: 196 WALWTLVGS 204
+A W L G+
Sbjct: 418 FAYWPLNGN 426
>gi|284041630|ref|YP_003391970.1| cellulase [Conexibacter woesei DSM 14684]
gi|283945851|gb|ADB48595.1| Cellulase [Conexibacter woesei DSM 14684]
Length = 542
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 116/305 (38%), Gaps = 42/305 (13%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+++LDNH ++ G + Q F+ D W+ ++A + RNV+G L+NE
Sbjct: 138 LILLDNHSH-----ADDAYQQGLWYGQGFSEDDWVATWKRLAARYRDQRNVIGADLKNEP 192
Query: 61 RGPKQ----NVKDWYRYMQLGAEAVHAANPEVLVILSGLN--------------FDKDLS 102
DW R + AV + P+ LV++ G+ + +L
Sbjct: 193 HAEATWGTGGPTDWRRAAERAGNAVLSVAPQWLVVVEGVGGGAPVPGQRLDTHWWGGNLE 252
Query: 103 FVRNQAVNLTFTGKLVFEAHWYG---FTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQ 158
VR V L +LV+ H YG F Q W G PN +++ R GF+ EQ
Sbjct: 253 GVRTHPVRLDRANRLVYSPHEYGPGVFP--QPWF-GKPNTPA--LLEERWRTGFGFIAEQ 307
Query: 159 G-WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
G P+ V EFG + R+ FF W W L + GV+
Sbjct: 308 GIAPILVGEFGGRNVDRESAEGRWQRQFFDFIGRTGASWTYWALNPNSGDTGGVL----- 362
Query: 218 YGLFDWNWCDIRNSSFLERISSLQS-PFRGPGVFETGLHKVIYHPATGLCVQRKSFLDPL 276
DW+ ++ L+R+ + Q FRG G T + P G K+
Sbjct: 363 --KDDWSSVQPAKTALLQRMIARQRIAFRGSGAVFTAPRRATT-PRRGGKAAPKTPARSQ 419
Query: 277 TLGPC 281
T P
Sbjct: 420 TAAPT 424
>gi|380479975|emb|CCF42701.1| endo-beta-1 [Colletotrichum higginsianum]
Length = 254
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 38/62 (61%), Gaps = 2/62 (3%)
Query: 4 LDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVR--NVVGMSLRNELR 61
LDNH+SK GWCC+ DGNG+FGD FN W++ L+ MA + + LRNELR
Sbjct: 166 LDNHVSKAGWCCNPYDGNGWFGDTNFNTSNWVRSLSFMAEHGKAANWPSFSSIGLRNELR 225
Query: 62 GP 63
P
Sbjct: 226 DP 227
>gi|302926443|ref|XP_003054296.1| hypothetical protein NECHADRAFT_90120 [Nectria haematococca mpVI
77-13-4]
gi|256735237|gb|EEU48583.1| hypothetical protein NECHADRAFT_90120 [Nectria haematococca mpVI
77-13-4]
Length = 489
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 65/259 (25%), Positives = 106/259 (40%), Gaps = 32/259 (12%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF--------NPDLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + Y + WI+ + + V+G
Sbjct: 242 VIVNNHITSATWCCGADPCDAAWANDYLGLLCPVKQTEEEWIQHWETIMLPYVNNPLVIG 301
Query: 54 MSLRNELRG-------PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN 106
+ LRNE+RG PK W + + N LVI+ G DLS V
Sbjct: 302 VDLRNEIRGLWGTMPWPK-----WATAAEHCGNRLLRMNKNWLVIVEGTESSNDLSHVGK 356
Query: 107 QAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLL-EQGWPLFV 164
+ + L ++V+ AH Y ++ +W +G MR + G++L E+ P++V
Sbjct: 357 RPIKLDVDHRVVYSAHVYAWSGWGSW-EGRFLLREYESFAKTMRHNWGYILDEEMAPVWV 415
Query: 165 SEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWN 224
EFGA + +V D Y E + D+ W L R+ E YGL +
Sbjct: 416 GEFGAPSKP-SVGDANYWQHLMRFLREQNADFGYWALNA----RKPKGNTTETYGLLHDD 470
Query: 225 WC----DIRNSSFLERISS 239
W D R L+ +++
Sbjct: 471 WVTPVLDYRMKDMLDLMAA 489
>gi|388520409|gb|AFK48266.1| unknown [Lotus japonicus]
Length = 197
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/46 (50%), Positives = 34/46 (73%)
Query: 3 ILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGV 48
+LDNH+S+P WCC++ D NGFF D++F+P W++GLT A F +
Sbjct: 147 LLDNHVSEPKWCCNDDDENGFFHDRHFDPQEWVQGLTLAAKHFADI 192
>gi|455650611|gb|EMF29381.1| glycosyl hydrolase [Streptomyces gancidicus BKS 13-15]
Length = 642
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 105/248 (42%), Gaps = 45/248 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD----LWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W MA + VVG L
Sbjct: 173 VILNNHTNTTRWCC------GVDGNERWNASQSDGTWENDWLFMARRYRDNPRVVGADLY 226
Query: 58 NELR-----GPKQNV---KDWYRYMQ-LGAEAVHAANPEVLVILSGLNFD---------- 98
NE+R P + DW+ Q G + ANP +L+I+ G+N+
Sbjct: 227 NEVRRSVWDDPNWGLGDGHDWFAASQRAGDRILTEANPHLLIIVEGINWTGLPVDGFPHG 286
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFTD---------GQAWVDGNPNQVCGRVVDNV 148
L VR+ + L +GKLV+ AH+YG+T G+ + +VD +
Sbjct: 287 RPTLEPVRDLSHTLVDSGKLVYSAHFYGYTGPRHSGATGIGETSDPRYQDLTPAELVDVL 346
Query: 149 MRLSGFL---LEQGW--PLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLV 202
R + F+ +++ + P+++SEFG R R + F D D+A W LV
Sbjct: 347 NRQAFFVTGDMDRHYTAPVWISEFGVGGREETGAAPRAWFERFVDHLIRTDADFAYWPLV 406
Query: 203 GSYYLREG 210
G + R G
Sbjct: 407 GRHENRRG 414
>gi|336464317|gb|EGO52557.1| hypothetical protein NEUTE1DRAFT_72302 [Neurospora tetrasperma FGSC
2508]
Length = 396
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 93/213 (43%), Gaps = 15/213 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL---------WIKGLTKMATIFNGVRNVV 52
VI++NHI+ WCC + + + + L WI + T F V+
Sbjct: 148 VIVNNHITTATWCCGADPCDAGWANDFLPGSLCRVRQTEAEWIDHWVTVMTRFVDNPLVI 207
Query: 53 GMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
G LRNE+RG + W + A+ NP+ L+I+ G DL+ VR + V
Sbjct: 208 GADLRNEVRGLWGTMPWERWAAAAERAGNALLRINPDWLIIVGGTESGNDLTGVRRRPVE 267
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQGW-PLFVSEFG 168
L +LV+ AH Y ++ G + G ++ MR + G+L+E+ P+++ EFG
Sbjct: 268 LDVANRLVYSAHVYSWS-GWGSMGGRYSKRTYASFVKAMRQNWGYLVEENLAPVWIGEFG 326
Query: 169 ADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
A R + D Y +D D+ W +
Sbjct: 327 APDR-PGIGDANYWQNLLRYLKAIDADFGYWAI 358
>gi|254385974|ref|ZP_05001291.1| cellulase [Streptomyces sp. Mg1]
gi|194344836|gb|EDX25802.1| cellulase [Streptomyces sp. Mg1]
Length = 624
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 98/253 (38%), Gaps = 50/253 (19%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL----WIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH WCC G G++ +N W +A + VVG L
Sbjct: 154 VILNNHTGTTRWCC------GLDGNERWNSGRSTAQWADDWVFLARRYRDNPRVVGADLY 207
Query: 58 NELR-----GPKQNV---KDWYRYMQLGAEAV-HAANPEVLVILSGLNFD---------- 98
NE+R P + DW+ Q A+ + ANP +L+++ G+N+
Sbjct: 208 NEVRRDVFDDPNWGLGDGHDWHAAAQEAADRILTEANPNLLIVIEGINWTGIPVDGFAHG 267
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQ-------------AWVDGNPNQVCGRV 144
L+ VR + L + KLV+ AH+YG+T + D P Q+ +
Sbjct: 268 RPALTPVRTLSHTLAVSHKLVYAAHFYGYTGPHHSGATGIGETSDLRYQDMTPAQLEQTL 327
Query: 145 VDNVMRLSGFLLEQG----WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWT 200
D +S E G P+++SEFG + G D D+A W
Sbjct: 328 YDQAFFVSA---ETGTHFTAPVWISEFGIGADETGAKPRAWFENLTGYLTRADADFAYWP 384
Query: 201 LVGSYYLREGVIG 213
LVG +G G
Sbjct: 385 LVGWSTTAQGAPG 397
>gi|389641787|ref|XP_003718526.1| endoglucanase E1 [Magnaporthe oryzae 70-15]
gi|351641079|gb|EHA48942.1| endoglucanase E1 [Magnaporthe oryzae 70-15]
gi|440469775|gb|ELQ38872.1| endoglucanase E1 [Magnaporthe oryzae Y34]
gi|440482281|gb|ELQ62788.1| endoglucanase E1 [Magnaporthe oryzae P131]
Length = 411
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 88/212 (41%), Gaps = 14/212 (6%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + P D WI + + F V+G
Sbjct: 164 VIINNHITHATWCCGADPCDAGWANDHLGPLCRVSQTEDDWIAHWETVMSRFTENPRVIG 223
Query: 54 MSLRNELRGP--KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG W + A + N + L+I+ G DL R++ V L
Sbjct: 224 ADLRNEVRGVWGTMTWDRWATAAERCANRLLKMNSDWLMIVGGTESGNDLRGARDRPVRL 283
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG--WPLFVSEFGA 169
++V+ AH Y ++ G +G ++ MR + L +G P++V EFGA
Sbjct: 284 DVDDRVVYSAHVYAWS-GWGSREGRYSKRPYPSFVASMRENWAYLVEGDVAPVWVGEFGA 342
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
+V D Y +D D+ W +
Sbjct: 343 PAH-PSVGDANYWQNLMRYLKAIDADFGYWAI 373
>gi|393242514|gb|EJD50032.1| cytochrome P450 [Auricularia delicata TFB-10046 SS5]
Length = 1042
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 66/146 (45%), Gaps = 21/146 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VIL+NH +K WCC +DGN + + + + WI+ + F + VVG L NE+R
Sbjct: 578 VILNNHTNKSRWCCGIADGNERWNESQ-SEEQWIQDWVTIVRRFKDNKRVVGADLYNEVR 636
Query: 62 GPKQNVKDWYRY---------MQLGAEAVHAANPEVLVILSGLNF-----------DKDL 101
N +W Y + G + ANP++L+++ G+N+ L
Sbjct: 637 RDILNDPNWGGYNSYDWYLAAHRAGDRILTEANPDILIVIEGINWVGIPTDITPHSRPTL 696
Query: 102 SFVRNQAVNLTFTGKLVFEAHWYGFT 127
+ L KLV+ AH+Y +T
Sbjct: 697 VGANQLSHTLVQPNKLVYSAHFYSYT 722
>gi|291447816|ref|ZP_06587206.1| cellulase [Streptomyces roseosporus NRRL 15998]
gi|291350763|gb|EFE77667.1| cellulase [Streptomyces roseosporus NRRL 15998]
Length = 622
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/240 (26%), Positives = 102/240 (42%), Gaps = 45/240 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH ++ WCC G G + +N + WI+ +A + VVG LR
Sbjct: 161 VILNNHSTQTRWCC------GVDGSERWNSGQSTEEWIEDWVLLAERYRSNPRVVGADLR 214
Query: 58 NELRGP--------KQNVKDWYRYMQ-LGAEAVHAANPEVLVILSGLN--------FDKD 100
NE+R + + DW Q G + ANP++L+++ G+N F +D
Sbjct: 215 NEVRRDVWDDPNWGRGDAHDWAAAAQRAGDRILKDANPDLLIMVEGINWAGIPVDGFWRD 274
Query: 101 ---LSFVRNQAVNLTFTGKLVFEAHWYGFTDGQ-AWVDGNPNQVCGRVVD--------NV 148
L V + L + KLV+ AH+YG+T + + G R D +
Sbjct: 275 RPHLKPVAELSHTLVRSHKLVYAAHYYGYTGPRHSGATGIGETSDPRYRDLSRTELFAEM 334
Query: 149 MRLSGFLLEQ-----GWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
R S ++ + P+++SEFG + + D + E D D+A W +VG
Sbjct: 335 HRSSAYVADTPDRHFTAPVWISEFGV-AKDADATDRAWFANTVDFLVEHDLDFAYWPVVG 393
>gi|239990802|ref|ZP_04711466.1| cellulase CelA [Streptomyces roseosporus NRRL 11379]
Length = 633
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/240 (26%), Positives = 102/240 (42%), Gaps = 45/240 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH ++ WCC G G + +N + WI+ +A + VVG LR
Sbjct: 172 VILNNHSTQTRWCC------GVDGSERWNSGQSTEEWIEDWVLLAERYRSNPRVVGADLR 225
Query: 58 NELRGP--------KQNVKDWYRYMQ-LGAEAVHAANPEVLVILSGLN--------FDKD 100
NE+R + + DW Q G + ANP++L+++ G+N F +D
Sbjct: 226 NEVRRDVWDDPNWGRGDAHDWAAAAQRAGDRILKDANPDLLIMVEGINWAGIPVDGFWRD 285
Query: 101 ---LSFVRNQAVNLTFTGKLVFEAHWYGFTDGQ-AWVDGNPNQVCGRVVD--------NV 148
L V + L + KLV+ AH+YG+T + + G R D +
Sbjct: 286 RPHLKPVAELSHTLVRSHKLVYAAHYYGYTGPRHSGATGIGETSDPRYRDLSRTELFAEM 345
Query: 149 MRLSGFLLEQ-----GWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
R S ++ + P+++SEFG + + D + E D D+A W +VG
Sbjct: 346 HRSSAYVADTPDRHFTAPVWISEFGV-AKDADATDRAWFANTVDFLVEHDLDFAYWPVVG 404
>gi|408528867|emb|CCK27041.1| cellulase [Streptomyces davawensis JCM 4913]
Length = 620
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 101/248 (40%), Gaps = 45/248 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD----LWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W MA + + VVG L
Sbjct: 157 VILNNHTNTTRWCC------GVDGNERWNASQSTAAWETDWLFMANRYKDNKRVVGADLY 210
Query: 58 NELR-----GPK---QNVKDWYRYMQLGAEAVH-AANPEVLVILSGLNFD---------- 98
NE+R P N DW+ Q A+ + ANP++L+I+ G+N+
Sbjct: 211 NEVRRNIWDDPNWGWGNDHDWFAASQRVADRLLLEANPDLLIIVEGINWTGIPIDGFAHE 270
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFTD-GQAWVDGNPNQVCGRVVD-------NVM 149
L VR + L +GKLV+ AH+Y +T + G R D +V+
Sbjct: 271 RPTLEPVRRLSHTLVDSGKLVYSAHFYDYTGPNHSGATGTGETTDPRYRDLSPSELISVL 330
Query: 150 RLSGFLLEQ------GWPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLV 202
F + P+++SEFG R R + F +D D+A W LV
Sbjct: 331 NRQAFYVASEPDQHFTAPVWISEFGVGGREETGARQRAWFENFVDHLIRMDADFAYWPLV 390
Query: 203 GSYYLREG 210
G + R G
Sbjct: 391 GWHEGRTG 398
>gi|429857273|gb|ELA32145.1| cellulase [Colletotrichum gloeosporioides Nara gc5]
Length = 484
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 93/217 (42%), Gaps = 19/217 (8%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + P + WI+ K+ V+G
Sbjct: 232 VIVNNHITHSTWCCGADPCDAHWANDHLGPLCRIKQTEEDWIQHWEKIMLRLVDNPRVIG 291
Query: 54 MSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVIL-----SGLNFDKDLSFVRN 106
+ LRNE+RG + W + + N + LVI G DL+ V +
Sbjct: 292 VDLRNEVRGLWGTMSWDKWATAAEKAGNRLLEMNKDWLVIDWLVIGGGTESGNDLTGVAD 351
Query: 107 QAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG--WPLFV 164
+ V L+ ++V+ AH Y ++ G V+G ++ MR + L +G P++V
Sbjct: 352 RPVVLSVPDRVVYSAHVYAWS-GWGSVEGRYSKRGYASFVKAMRKNWAYLVEGDQAPVWV 410
Query: 165 SEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
EFGA R ++ D Y N +D D+ W +
Sbjct: 411 GEFGAPHR-PSIGDANYWNNLLRYLKVIDADFGYWAV 446
>gi|346322492|gb|EGX92091.1| cellulase (glycosyl hydrolase family 5), putative [Cordyceps
militaris CM01]
Length = 489
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 104/250 (41%), Gaps = 21/250 (8%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF--------NPDLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + + + WI+ + V+
Sbjct: 241 VIVNNHITTATWCCGANPCDAGWANDHLGGLCAVRQTEEGWIRNWETVMARLRHDPLVIA 300
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
+ LRNE+RG + W + + + A NP+ LV++ G D+S R + V L
Sbjct: 301 VDLRNEVRGLWGTMPWATWAAAAERCGDRLLAMNPDWLVVVEGTESANDVSGARRRPVVL 360
Query: 112 TFT-GKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGA 169
KLV+ AH Y ++ +W + V + +LLEQ P++V E GA
Sbjct: 361 RRAPDKLVYSAHVYAWSGWGSWGGRFAQRGYASFVGTMRHNWLWLLEQDVAPVWVGELGA 420
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNW---- 225
R + RY + + ++D D+ W + + + V E Y L + +W
Sbjct: 421 -ARHTSRGGARYWRNLWRLLKDVDADFGYWAVNPTKAYQSTV----ETYALLEADWETPV 475
Query: 226 CDIRNSSFLE 235
D R +E
Sbjct: 476 LDYRMKDMVE 485
>gi|297190765|ref|ZP_06908163.1| cellulase [Streptomyces pristinaespiralis ATCC 25486]
gi|297150611|gb|EDY66963.2| cellulase [Streptomyces pristinaespiralis ATCC 25486]
Length = 673
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 104/248 (41%), Gaps = 45/248 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL----WIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W MA + VVG L
Sbjct: 206 VILNNHTNTTRWCC------GVDGNERWNASRSDVEWEADWLFMARRYRDDPRVVGADLY 259
Query: 58 NELR-----GPK---QNVKDWYRYMQ-LGAEAVHAANPEVLVILSGLNF----------- 97
NE+R P + DW+ Q LG + ANP++L+++ G+N+
Sbjct: 260 NEVRRSVWDDPNWGLGDRHDWFAASQRLGDRILQEANPDLLIVVEGINWTGVPVDGLPHG 319
Query: 98 DKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVC-------GR--VVDNV 148
L V+ + L +GKLV+ AH+Y +T P + GR + D +
Sbjct: 320 RPTLEPVQRLSHTLVRSGKLVYSAHFYDYTGPHHTGATGPGETSDNRYRDFGRDELFDVL 379
Query: 149 MRLSGFLLEQG-----WPLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLV 202
R + ++ Q P+++SEFG R + R + F D D+A W +V
Sbjct: 380 DRQAFYVAGQADRHFTAPVWISEFGIGGRDEHGEKPRAWFENFVDHLIARDADFAYWPMV 439
Query: 203 GSYYLREG 210
G + R+G
Sbjct: 440 GWHEDRKG 447
>gi|345010863|ref|YP_004813217.1| glycoside hydrolase [Streptomyces violaceusniger Tu 4113]
gi|344037212|gb|AEM82937.1| glycoside hydrolase family 5 [Streptomyces violaceusniger Tu 4113]
Length = 642
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 95/240 (39%), Gaps = 44/240 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W+ MA + VVG L
Sbjct: 179 VILNNHTNTTRWCC------GVDGNERWNSGQTTQRWVDDWVFMARRYADEPGVVGADLY 232
Query: 58 NELR-----GPK---QNVKDWYRYMQLGAEAVHA-ANPEVLVILSGLNFD---------- 98
NE+R P + DWY Q A+ + NP++L+++ G+N+
Sbjct: 233 NEVRRDILDDPNWGLGDAHDWYAAAQQAADRILTDGNPDLLIVIEGINWTGIPVDGLPHE 292
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMR--LSGFL 155
L+ R + L KLV+ AH+YG+T + + ++ R L L
Sbjct: 293 RPTLAPARTLSHTLVNAHKLVYSAHFYGYTGPRHSGATGVGETHDPRYQDLSRDELRKVL 352
Query: 156 LEQGW------------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
+Q + P+++SEFG + A+ D D+A W LVG
Sbjct: 353 TDQAFFVAEDTGRHYTAPVWISEFGIGAAETGTRPRAWFTNLTDYFADHDADFAYWPLVG 412
>gi|290954732|ref|YP_003485914.1| glycosyl hydrolase [Streptomyces scabiei 87.22]
gi|260644258|emb|CBG67339.1| putative secreted glycosyl hydrolase [Streptomyces scabiei 87.22]
Length = 635
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 101/244 (41%), Gaps = 37/244 (15%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VIL+NH + W C DGN + + + W MA + VVG L NE+R
Sbjct: 166 VILNNHTNTTRWYC-GVDGNERWNASRSDRE-WEDDWLFMARRYRDNPRVVGADLYNEVR 223
Query: 62 GPK--------QNVKDWYRYMQ-LGAEAVHAANPEVLVILSGLNFD-----------KDL 101
+ DW+ Q +G + ANPE+L+I+ G+N+ L
Sbjct: 224 RDIWDDANWGLGDAHDWFAASQRVGDRILTEANPELLIIVEGINWTGVPVDGLPHERPTL 283
Query: 102 SFVRNQAVNLTFTGKLVFEAHWYGFTD---------GQAWVDGNPNQVCGRVVDNVMRLS 152
VR + L +GKLV+ AH+Y +T G+ + +VD + R +
Sbjct: 284 EPVRRLSHTLVDSGKLVYSAHFYDYTGPRHSGATGIGETSDPRYRDLTPAELVDVLTRQA 343
Query: 153 GFLL-EQG----WPLFVSEFGADLRGNN-VNDNRYLNCFFGVAAELDWDWALWTLVGSYY 206
F+ EQ P+++SEFG R V + F D D+A W LVG +
Sbjct: 344 FFVTGEQNRHFTAPVWISEFGVGGRAETGVRPRAWFENFVDQLIRTDADFAYWPLVGRHE 403
Query: 207 LREG 210
R G
Sbjct: 404 NRTG 407
>gi|340924098|gb|EGS19001.1| hypothetical protein CTHT_0056210 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 426
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 99/214 (46%), Gaps = 17/214 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL---------WIKG-LTKMATIFNGVRNV 51
VI++NHI++ WCC + + + Y + WI+ T M+ + N V
Sbjct: 174 VIVNNHITRATWCCGLDLCDTGWANDYLPKSICRVRQTEEEWIQHWKTVMSRVVNNSL-V 232
Query: 52 VGMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAV 109
+G LRNE+RG K W + A+ NP L+I+ G + DLS V + +
Sbjct: 233 IGADLRNEVRGLWGTMPWKKWAAAAEKAGNALLKLNPNWLIIVGGTESNNDLSGVAKRPI 292
Query: 110 NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLE-QGWPLFVSEF 167
L ++V+EAH Y ++ G ++G+ ++ MR + +LLE P++V EF
Sbjct: 293 MLAVPNRVVYEAHVYSWS-GWGSIEGSYSRRSYESFARSMRKNWAYLLEGDVAPVWVGEF 351
Query: 168 GADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
GA N + N + N E+D D+ W L
Sbjct: 352 GAPDHPNKGDANYWAN-LVRFLKEVDADFGYWAL 384
>gi|411003499|ref|ZP_11379828.1| glycoside hydrolase [Streptomyces globisporus C-1027]
Length = 632
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 105/241 (43%), Gaps = 47/241 (19%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH ++ WCC G G + +N + WI+ +A + VVG LR
Sbjct: 171 VILNNHSTQTRWCC------GVDGSERWNSGQSTEEWIEDWVLLAKRYRSNPRVVGADLR 224
Query: 58 NELRGP--------KQNVKDWYRYMQ-LGAEAVHAANPEVLVILSGLN--------FDKD 100
NE+R + + DW Q G + ANP++L+++ G+N F +D
Sbjct: 225 NEVRRDVWDDPNWGRGDAHDWAAAAQRAGDRILKDANPDLLIMVEGINWAGIPVDGFWRD 284
Query: 101 ---LSFVRNQAVNLTFTGKLVFEAHWYGFTDGQ-AWVDGNPNQVCGRVVD--------NV 148
L V + L + KLV+ AH+YG+T + + G R D +
Sbjct: 285 RPHLKPVAELSHTLVRSHKLVYAAHYYGYTGPRHSGATGIGETSDPRYRDLSRAELFAEM 344
Query: 149 MRLSGFLLEQ-----GWPLFVSEFGADLRGNNVNDNRYLNCF-FGVAAELDWDWALWTLV 202
R S ++ + P+++SEFG + + + N F VA +L D+A W +V
Sbjct: 345 HRSSAYVADTPDRHFTAPVWISEFGVAKDADATDRAWFANTVDFLVAHDL--DFAYWPVV 402
Query: 203 G 203
G
Sbjct: 403 G 403
>gi|171678816|ref|XP_001904357.1| hypothetical protein [Podospora anserina S mat+]
gi|170937478|emb|CAP62136.1| unnamed protein product [Podospora anserina S mat+]
Length = 273
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 32/43 (74%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATI 44
V LDNHISK WCCS +DGN ++GD YFN W++GL+ MA +
Sbjct: 138 VHLDNHISKGMWCCSGTDGNTWWGDTYFNTANWVRGLSYMAGL 180
Score = 46.2 bits (108), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 44/84 (52%), Gaps = 7/84 (8%)
Query: 156 LEQGWPL--FVSEFGADLRGNNVNDNR--YLNCFFGVAAELDWDWALWTLVGSYYLREGV 211
L++ W L +SEFG + +VN R Y NC W +W L GSYY+REG+
Sbjct: 180 LKERWSLGFLLSEFGFAM---DVNTWRGTYANCLASYVRSEKAGWTMWVLAGSYYVREGI 236
Query: 212 IGLNEYYGLFDWNWCDIRNSSFLE 235
+E +GL +W + R+ +++
Sbjct: 237 QDYDEGWGLLTRDWREWRSEGYVD 260
>gi|452845726|gb|EME47659.1| glycoside hydrolase family 5 protein [Dothistroma septosporum
NZE10]
Length = 466
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 95/252 (37%), Gaps = 15/252 (5%)
Query: 2 VILDNHISKPGWC-----CSNS---DGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVG 53
VI ++HI+ WC C +S D G F D WI + V+G
Sbjct: 209 VIPNDHITNAHWCDGMNLCDSSWKNDHLGPFCKIKQTTDSWIDHWKTIMEPLADNPLVIG 268
Query: 54 MSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE RG W + +EA+ + P L+ + G++ D R++ + L
Sbjct: 269 ADLRNEPRGLWGTMTWNMWATAAEKASEALLSIQPNWLMFVEGISSANDCGGARSRPIKL 328
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADL 171
+ +LV+ +H Y ++ V +D + + P++V EFGA
Sbjct: 329 SVADRLVYSSHVYSWSGWSTLVPYGKRPYPSFAMDMDKNWAYLIRGNIAPVWVGEFGAPH 388
Query: 172 RGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNS 231
D Y + + E D D+ W L R+ NE YGL +W +
Sbjct: 389 HAGE-QDRHYWDNLMKILEETDADYGYWALNP----RKPEGYDNETYGLLRDDWETPVDD 443
Query: 232 SFLERISSLQSP 243
L + L P
Sbjct: 444 YRLADLKKLMKP 455
>gi|440633127|gb|ELR03046.1| hypothetical protein GMDG_05893 [Geomyces destructans 20631-21]
Length = 421
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/213 (23%), Positives = 90/213 (42%), Gaps = 15/213 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL---------WIKGLTKMATIFNGVRNVV 52
V+++NHI+ WCC + +G + + + W+ + V+
Sbjct: 171 VVVNNHITHATWCCGANPCDGLWYNTHLPASACRIHQSESGWVDNWVTVMEPHVNNPLVI 230
Query: 53 GMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
LRNE+R + + W R + + + L+++ GL L+ VR + V
Sbjct: 231 AADLRNEVRALWGTMSWERWARAAERAGNRLLGMRSDWLIVVGGLGSQNFLNEVRERPVR 290
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGN-PNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFG 168
L G++V+ +H Y ++ G DG + V ++ G+L+E P++V EFG
Sbjct: 291 LVVPGRVVYSSHVYSWS-GWGSRDGRYAKRPYASFVKSMRENWGYLVEGDIAPVWVGEFG 349
Query: 169 ADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
A + D RY ++D D+A W +
Sbjct: 350 APAKPGE-GDARYWRHLMRYLKKIDADFAYWAI 381
>gi|358398748|gb|EHK48099.1| glycoside hydrolase family 5 protein [Trichoderma atroviride IMI
206040]
Length = 409
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 97/250 (38%), Gaps = 22/250 (8%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF--------NPDLWIKGLTKMATIFNGVRNVVG 53
VI++NHI++ WCC + + + + + WI+ + F V+G
Sbjct: 162 VIINNHITRATWCCGADPCDAGWANDHLGSLCRVKQTEEEWIQHWESVMERFVHNPLVIG 221
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG + W + + + L+++ G D+S R + V L
Sbjct: 222 ADLRNEVRGLWGTMPWSKWAAAAEKCGNRLLKMKSDWLIVVEGTESANDVSGARERPVQL 281
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG--WPLFVSEFGA 169
+LV+ AH Y ++ +W DG Q + MR + L +G P++ E GA
Sbjct: 282 DVDNRLVYSAHGYKWSGWGSW-DGRFAQRSYKSFVKTMRHNWAYLLEGDVAPVWFGELGA 340
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC--- 226
+V D Y +D D+ W L R+ E Y L +W
Sbjct: 341 P-NDPSVGDVHYWRNLLKYLKSIDADFGYWALNP----RKPKDNARESYSLMHDDWITPV 395
Query: 227 -DIRNSSFLE 235
D R LE
Sbjct: 396 LDYRMKDLLE 405
>gi|456387362|gb|EMF52875.1| glycosyl hydrolase [Streptomyces bottropensis ATCC 25435]
Length = 653
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 101/248 (40%), Gaps = 45/248 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD----LWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W MA + + VVG L
Sbjct: 184 VILNNHTNTTRWCC------GVDGNERWNASQTAAAWENDWLFMARRYRDNKRVVGADLY 237
Query: 58 NELR-----GPK---QNVKDWYRYMQ-LGAEAVHAANPEVLVILSGLNFD---------- 98
NE+R P + DW+ Q +G + A P++L+I+ G+N+
Sbjct: 238 NEVRRDIWDDPNWGLGDAHDWFAASQRVGDRILTEAAPDLLIIVEGINWTGIPVDGLPHE 297
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQ-AWVDGNPNQVCGRVVD-------NVM 149
L VR + L + KLV+ AH+Y +T + + G R D +V+
Sbjct: 298 RPTLEPVRRLSHTLVDSDKLVYSAHFYDYTGPRHSGATGTGETSDPRYRDLTPTELTDVL 357
Query: 150 RLSGFLL--EQGW----PLFVSEFGADLRGNNVNDNR-YLNCFFGVAAELDWDWALWTLV 202
F + EQ P+++SEFG R R + F D D+A W LV
Sbjct: 358 NRQAFFVTAEQDRHFTAPVWISEFGVGGRAETGARQRAWFENFVDQLVRADADFAYWPLV 417
Query: 203 GSYYLREG 210
G + R G
Sbjct: 418 GWHENRTG 425
>gi|291442689|ref|ZP_06582079.1| cellulase [Streptomyces ghanaensis ATCC 14672]
gi|291345584|gb|EFE72540.1| cellulase [Streptomyces ghanaensis ATCC 14672]
Length = 624
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 102/248 (41%), Gaps = 45/248 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N + W MA + VVG L
Sbjct: 155 VILNNHTNTTRWCC------GVDGNERWNTSQSTEAWENDWLFMARRYRDNDRVVGADLY 208
Query: 58 NELR-----GPKQNV---KDWYRYMQ-LGAEAVHAANPEVLVILSGLNFD---------- 98
NE+R P + DW+ Q +G + A+ ++L+I+ G+N+
Sbjct: 209 NEVRRTVWDDPNWGLGDHHDWFAASQRVGDRILTEADSDLLIIVEGINWTGIPVDGFAHG 268
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFTD---------GQAWVDGNPNQVCGRVVDNV 148
L VR + L +GKLV+ AH+YG+T G+ + ++D +
Sbjct: 269 RPTLEPVRRLSHTLVASGKLVYSAHFYGYTGPNHSGATGIGETSDPRYQDLSPAELIDVL 328
Query: 149 MRLSGFLLEQ-----GWPLFVSEFGADLRGNN-VNDNRYLNCFFGVAAELDWDWALWTLV 202
R + F + P+++SEFG R V + F D D+A W LV
Sbjct: 329 NRQAFFAAAEPDRHFTAPVWISEFGVGGREETGVAQRAWFENFVDHLIRTDADFAYWPLV 388
Query: 203 GSYYLREG 210
G + R G
Sbjct: 389 GWHDNRRG 396
>gi|363581940|ref|ZP_09314750.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
HQM9]
Length = 812
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 78/187 (41%), Gaps = 32/187 (17%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQY-----FNPDLWIKGLTKMATIFNGVRNVVGMSL 56
++LDNH + +GF + + ++ + WI +A + G VV M L
Sbjct: 144 IVLDNH---------SRAADGFLNEAFWYTPEYSEERWINDWIFLAERYKGKSAVVAMDL 194
Query: 57 RNELRGP----KQNVKDWYRYMQLGAEAVHAANPEVLVILSGL-NFDKD-------LSFV 104
NE G DW + + AV A NP+VL+I+ G+ F+ D L
Sbjct: 195 NNEPHGSTWGNSNPATDWNKAAERCGNAVLAVNPDVLIIIEGVGEFEGDSYWWGGQLKGA 254
Query: 105 RNQAVNLTFTGKLVFEAHWYG-FTDGQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQGWP 161
+ L+ KLV+ AH YG Q W + N PN + G ++ L P
Sbjct: 255 EKYPIQLSNQQKLVYSAHEYGPEVSEQDWFNANNFPNNMPGLWEEHFHYL---YKNNASP 311
Query: 162 LFVSEFG 168
+F+ EFG
Sbjct: 312 IFIGEFG 318
>gi|350296405|gb|EGZ77382.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
Length = 396
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 92/213 (43%), Gaps = 15/213 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL---------WIKGLTKMATIFNGVRNVV 52
VI++NHI+ WCC + + + + L WI + T F V+
Sbjct: 148 VIVNNHITTATWCCGADPCDAGWANDFLPGSLCRVRQTEAQWIDHWVTVMTRFVDNPLVI 207
Query: 53 GMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
G LRNE+RG + W + A+ N + L+++ G DL+ V+ + V
Sbjct: 208 GADLRNEVRGLWGTMPWERWAAAAERAGNALLGINSDWLIVVGGTESGNDLTGVKRRPVE 267
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQGW-PLFVSEFG 168
L +LV+ AH Y ++ G + G ++ MR + G+L+E+ P+++ EFG
Sbjct: 268 LDVANRLVYSAHVYSWS-GWGSMGGRYSKRTYASFVKAMRQNWGYLVEENLAPVWIGEFG 326
Query: 169 ADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
A R + D Y +D D+ W +
Sbjct: 327 APDR-PGIGDANYWQNLLRYLKAIDADFGYWAI 358
>gi|402086611|gb|EJT81509.1| endoglucanase E1 [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 409
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 91/212 (42%), Gaps = 14/212 (6%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + P + WI+ + F G VVG
Sbjct: 162 VIVNNHITTATWCCGADPCDAGWANDHLGPICRVSQTEEDWIRHWEIVMERFVGNPRVVG 221
Query: 54 MSLRNELRGPKQNVK--DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG + W + + N + L+++ G DL V + V L
Sbjct: 222 ADLRNEVRGLWGTMPWDRWASAAERCGNRLLRMNRDWLIVVGGTESGNDLRGVERRPVRL 281
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQG-WPLFVSEFGA 169
++V+ AH Y ++ G +G Q MR + +L+E+ P++V EFGA
Sbjct: 282 DVEDRVVYSAHVYAWS-GWGSTEGRFAQRPYASFVRTMRENWAYLVERDIAPVWVGEFGA 340
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
+ N + N + A +D D+ W +
Sbjct: 341 PHLPGAGDVNYWQNLMRYLKA-IDADFGYWAI 371
>gi|323356589|ref|YP_004222985.1| endoglucanase [Microbacterium testaceum StLB037]
gi|323272960|dbj|BAJ73105.1| endoglucanase [Microbacterium testaceum StLB037]
Length = 564
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 88/217 (40%), Gaps = 30/217 (13%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H +P S + ++ QY + WI +A + V+G L NE
Sbjct: 151 VILDRH--RPD---SGAQSELWYTAQY-SEKRWIDDWKMLAARYKNESAVIGADLHNEPH 204
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD---------LSFVRNQ 107
GP +DW AV + NP++L+I+ G+ +D L VR+
Sbjct: 205 GPACWNCGDASRDWRAAATRAGNAVLSVNPKLLIIVEGVENQRDGSATWWGGGLKDVRSA 264
Query: 108 AVNLTFTGKLVFEAHWYGFT-DGQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQG-WPLF 163
V L+ ++V+ H Y T Q W + P + G N G+L+ + P+
Sbjct: 265 PVTLSVKNRVVYSPHEYPATVHAQPWFSASNYPANLEGIWDANW----GYLVRENIAPVL 320
Query: 164 VSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWT 200
+ EFG L +D +L A+ +A W+
Sbjct: 321 LGEFGTKLE--TTSDQAWLKTLVAYLAKTKMSYAYWS 355
>gi|302546879|ref|ZP_07299221.1| putative cellulase (glycosyl hydrolase family 5) [Streptomyces
hygroscopicus ATCC 53653]
gi|302464497|gb|EFL27590.1| putative cellulase (glycosyl hydrolase family 5) [Streptomyces
himastatinicus ATCC 53653]
Length = 623
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 95/240 (39%), Gaps = 44/240 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP----DLWIKGLTKMATIFNGVRNVVGMSLR 57
VIL+NH + WCC G G++ +N W MA + VVG L
Sbjct: 160 VILNNHTNTSRWCC------GIDGNERWNSGQSTQAWTDDWVFMARRYADDPRVVGADLY 213
Query: 58 NELR-----GPK---QNVKDWYRYMQLGAEAV-HAANPEVLVILSGLNFD---------- 98
NE+R P + DWY Q A+ + +P++L+++ G+N+
Sbjct: 214 NEVRRDVFDDPNWGLGDEHDWYAAAQRAADRILTEGDPDLLIVIEGINWTGLPVDGLPHG 273
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFT----DGQAWV----DGNPNQVCGRVVDNVM 149
L+ R + L T KLV+ AH+YG+T G + D + + V+
Sbjct: 274 RPTLAPARTLSHTLVDTHKLVYSAHFYGYTGPRHSGATGIGETHDPRYQDLSREDLFRVL 333
Query: 150 RLSGFLLEQG------WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
F + + P+++SEFG + A+ D D+A W LVG
Sbjct: 334 SDQAFFVAEDTGKHYTAPVWISEFGIGSGETGTASRAWFANITDYFADHDADFAYWPLVG 393
>gi|358387472|gb|EHK25067.1| glycoside hydrolase family 5 protein, partial [Trichoderma virens
Gv29-8]
Length = 409
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 86/212 (40%), Gaps = 14/212 (6%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI+ WCC + + + + P + WI+ + F V+G
Sbjct: 162 VIINNHITSATWCCGADPCDAGWANDHLGPLCRVKQTEEEWIQHWESVMDRFVHNPLVIG 221
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG + W + + + L+I+ G D+S ++ V L
Sbjct: 222 ADLRNEVRGLWGTMPWSKWAAAAEKCGNRLLKMKSDWLIIVEGTESANDVSGALDRPVRL 281
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQGW-PLFVSEFGA 169
+LV+ AH Y ++ +W G Q + MR + +LL+ P++ E GA
Sbjct: 282 DVENRLVYSAHGYKWSGWGSW-QGRFAQRSYKSFVKTMRHNWAYLLDLDIAPVWFGELGA 340
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
V D RY +D D+ W L
Sbjct: 341 P-NDPTVGDARYWRHLLKYLKSIDGDFGYWAL 371
>gi|148245203|ref|YP_001220664.1| cellulase [Clavibacter michiganensis subsp. michiganensis NCPPB
382]
gi|147829031|emb|CAM98473.1| cellulase [Clavibacter michiganensis subsp. michiganensis NCPPB
382]
Length = 746
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 85/204 (41%), Gaps = 30/204 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H +P S ++ QY WI +A + V+G+ L NE
Sbjct: 173 VILDQH--RPD---SGGQSELWYTSQYPE-SRWISDWRMLAKRYAAEPTVIGVDLHNEPH 226
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD---------LSFVRNQ 107
G DW + G AV A NP +LV++ G++ + D L V N
Sbjct: 227 GAATWGTGAATTDWRAAAERGGNAVLAENPNLLVLVEGIDHEADGSGTWWGGALGLVGNA 286
Query: 108 AVNLTFTGKLVFEAHWYGFTD-GQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQG-WPLF 163
V L+ ++V+ H Y T GQ+W + P + G + D G+L ++ P+
Sbjct: 287 PVRLSVANRVVYSPHDYPSTIYGQSWFSASNYPANLPG-IWDAHW---GYLAKKDIAPVL 342
Query: 164 VSEFGADLRGNNVNDNRYLNCFFG 187
V EFG +D ++LN G
Sbjct: 343 VGEFGTKFE--TTSDKQWLNTLVG 364
>gi|8980306|emb|CAA44467.2| Cellulase [Clavibacter michiganensis subsp. michiganensis NCPPB
382]
Length = 745
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 85/204 (41%), Gaps = 30/204 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H +P S ++ QY WI +A + V+G+ L NE
Sbjct: 173 VILDQH--RPD---SGGQSELWYTSQYPE-SRWISDWRMLAKRYAAEPTVIGVDLHNEPH 226
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD---------LSFVRNQ 107
G DW + G AV A NP +LV++ G++ + D L V N
Sbjct: 227 GAATWGTGAATTDWRAAAERGGNAVLAENPNLLVLVEGIDHEADGSGTWWGGALGLVGNA 286
Query: 108 AVNLTFTGKLVFEAHWYGFTD-GQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQG-WPLF 163
V L+ ++V+ H Y T GQ+W + P + G + D G+L ++ P+
Sbjct: 287 PVRLSVANRVVYSPHDYPSTIYGQSWFSASNYPANLPG-IWDAHW---GYLAKKDIAPVL 342
Query: 164 VSEFGADLRGNNVNDNRYLNCFFG 187
V EFG +D ++LN G
Sbjct: 343 VGEFGTKFE--TTSDKQWLNTLVG 364
>gi|164427183|ref|XP_964407.2| hypothetical protein NCU03254 [Neurospora crassa OR74A]
gi|157071641|gb|EAA35171.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 413
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 91/213 (42%), Gaps = 15/213 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL---------WIKGLTKMATIFNGVRNVV 52
VI++NHI+ WCC + + + + L WI + F V+
Sbjct: 164 VIVNNHITTATWCCGADPCDAGWANDFLPGSLCRVRQTEAQWIDHWVTVMNRFVDNPLVI 223
Query: 53 GMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
G LRNE+RG + W + A+ N + L+I+ G DL+ V+ + V
Sbjct: 224 GADLRNEVRGLWGTMPWERWAAAAERAGNALLGINSDWLIIVGGTESGNDLTGVKRRPVE 283
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQGW-PLFVSEFG 168
L +LV+ AH Y ++ G + G ++ MR + G+L+E+ P+++ EFG
Sbjct: 284 LDVANRLVYSAHVYSWS-GWGSMGGRYSKRTYASFVKAMRQNWGYLVEENLAPVWIGEFG 342
Query: 169 ADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
A R + D Y +D D+ W +
Sbjct: 343 APDR-PGIGDANYWQNLLRYLKAIDADFGYWAI 374
>gi|340521941|gb|EGR52174.1| glycoside hydrolase family 16 [Trichoderma reesei QM6a]
Length = 407
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 49/211 (23%), Positives = 86/211 (40%), Gaps = 12/211 (5%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNP--------DLWIKGLTKMATIFNGVRNVVG 53
VI++NHI++ WCC + + + + P + WI+ ++ F V+G
Sbjct: 160 VIINNHITRATWCCGADPCDAGWANDHLGPLCAVKQTEEEWIQHWEEVMGRFVHNPLVIG 219
Query: 54 MSLRNELRGPKQNV--KDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNL 111
LRNE+RG + W + + + L+I+ G D+S ++ V L
Sbjct: 220 ADLRNEVRGLWGTMPWSKWAAAAENCGNRLLKMKSDWLIIVEGTESANDVSGALDRPVRL 279
Query: 112 TFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW-PLFVSEFGAD 170
++V+ AH Y ++ +W + V + R +LLE P++ E GA
Sbjct: 280 DVENRVVYSAHGYKWSGWGSWQGRFAQRSYKSFVKTMRRNWAYLLEVDIAPVWFGELGAP 339
Query: 171 LRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
+V D Y + +D D W L
Sbjct: 340 -SNPSVGDVHYWSNLMRYLKSIDADLGYWAL 369
>gi|329937512|ref|ZP_08287070.1| glycosyl hydrolase [Streptomyces griseoaurantiacus M045]
gi|329303388|gb|EGG47275.1| glycosyl hydrolase [Streptomyces griseoaurantiacus M045]
Length = 635
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 96/240 (40%), Gaps = 44/240 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD----LWIKGLTKMATIFNGVRNVVGMSLR 57
V+L+NH + WCC G G++ +N W MA + VVG L
Sbjct: 174 VVLNNHTNTSRWCC------GLDGNERWNSGQDTRRWTDDWVFMARRYADDPRVVGADLY 227
Query: 58 NELRGP--------KQNVKDWYRYMQLGAEAV-HAANPEVLVILSGLNFD---------- 98
NE+R + DWY Q A+ + +P++LV++ G+N+
Sbjct: 228 NEVRRDVLHDPDWGGGDDYDWYAAAQWAADRILTEGDPDLLVVIEGINWTGLPVDGFPHG 287
Query: 99 -KDLSFVRNQAVNLTFTGKLVFEAHWYGFT----DGQAWVDGNPNQVCGRVVDNVMRLS- 152
L+ R + L + KLV+ AH+YG+T G + + + + +R +
Sbjct: 288 RPTLTPARTLSHTLVESDKLVYSAHFYGYTGPHHSGATGLGETSDPRYQDLTEQELRQAL 347
Query: 153 ---GFLLEQG------WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVG 203
F + Q P+++SEFG N + A+ D D+A W LVG
Sbjct: 348 YDEAFFVSQDTGRHYTAPVWISEFGIGSGETNGAARTWFGRITDYFADQDADFAYWPLVG 407
>gi|402494289|ref|ZP_10841031.1| glycoside hydrolase [Aquimarina agarilytica ZC1]
Length = 819
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 79/188 (42%), Gaps = 34/188 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQY-----FNPDLWIKGLTKMATIFNGVRNVVGMSL 56
++LDNH +GF + + ++ + WI MA + G VV M L
Sbjct: 144 IVLDNHSRA---------ADGFLNEAFWYTPEYSEERWINDWIFMAERYKGKSAVVAMDL 194
Query: 57 RNELRGP----KQNVKDWYRYMQLGAEAVHAANPEVLVILSGL-NFDKD-------LSFV 104
NE G DW + + AV A NP+VL+I+ G+ F+ D L
Sbjct: 195 NNEPHGSTWGNSTPATDWNKAAERCGNAVLAVNPDVLIIIEGVGEFEGDSYWWGGQLKGA 254
Query: 105 RNQAVNLTFTGKLVFEAHWYGFTDGQA-WVDGN--PNQVCGRVVDNVMRLSGFLLE-QGW 160
+ L+ KLV+ AH YG Q W + + P+ + G ++ FL E
Sbjct: 255 EKYPIQLSNQEKLVYSAHEYGPEVAQQDWFEASNFPSNMPGLWEEHFH----FLYENNSS 310
Query: 161 PLFVSEFG 168
P+F+ EFG
Sbjct: 311 PIFIGEFG 318
>gi|367052039|ref|XP_003656398.1| glycoside hydrolase family 5 protein [Thielavia terrestris NRRL
8126]
gi|347003663|gb|AEO70062.1| glycoside hydrolase family 5 protein [Thielavia terrestris NRRL
8126]
Length = 416
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/212 (23%), Positives = 89/212 (41%), Gaps = 13/212 (6%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF---------NPDLWIKGLTKMATIFNGVRNVV 52
VI++NHI+ WCC + + + + + W++ ++ F R V+
Sbjct: 164 VIVNNHITSATWCCGADPCDSGWANDHLPAVLCRVRQTEEQWVQHWEEVMGRFVKNRLVI 223
Query: 53 GMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
G LRNE+RG + W + + NP+ L+I+ G DLS V + V
Sbjct: 224 GADLRNEVRGFWGTMPWEKWAAAAEKAGNRLLKMNPDWLIIVGGTESCNDLSGVARRPVV 283
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGA 169
L ++V+ AH Y ++ + + V ++ + +L+E P++V EFGA
Sbjct: 284 LDVPDRVVYSAHVYSWSGWGSLGGRYSKRTYASFVQSMRKNWAYLVEGNIAPVWVGEFGA 343
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
+ D Y N +D D+ W +
Sbjct: 344 PDKPKR-GDANYWNNLMRYLKTIDADFGYWAI 374
>gi|443626585|ref|ZP_21111001.1| putative Glycoside hydrolase [Streptomyces viridochromogenes Tue57]
gi|443339890|gb|ELS54116.1| putative Glycoside hydrolase [Streptomyces viridochromogenes Tue57]
Length = 505
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 25/170 (14%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG+ L NE P +DW Q AV + NPE+
Sbjct: 171 WIANLKALAARYKGDSTVVGIDLHNEPHDPACWGCGDTARDWRLAAQRAGNAVLSVNPEL 230
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQV 140
L+++ G+ + +L V V L +LV+ AH Y + Q G+P+
Sbjct: 231 LIMVEGVQSFNGANGWWGGNLMGVAQYPVQLDVPNRLVYSAHDYATSVAQQPWFGDPS-- 288
Query: 141 CGRVVDNVM----RLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
DN+ R G++ +Q P+++ EFG L+ + D ++L
Sbjct: 289 ---FPDNMPGVWDRYWGYVFKQNIAPVWLGEFGTTLQ--STVDQKWLAAL 333
>gi|367018270|ref|XP_003658420.1| glycoside hydrolase family 5 protein [Myceliophthora thermophila
ATCC 42464]
gi|347005687|gb|AEO53175.1| glycoside hydrolase family 5 protein [Myceliophthora thermophila
ATCC 42464]
Length = 420
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/237 (23%), Positives = 93/237 (39%), Gaps = 17/237 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF---------NPDLWIKGLTKMATIFNGVRNVV 52
V+++NHI+ WCC + + + + D WI+ + V+
Sbjct: 168 VVVNNHITSATWCCGADPCDAGWANDHLPAALCRVRQTEDQWIEHWETVMARLADNPLVI 227
Query: 53 GMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
G LRNE+RG W + + NP+ L+I+ G DL+ V + V
Sbjct: 228 GADLRNEVRGLWGTMPWARWAAAAERAGNRLLRMNPDWLIIVGGTESQNDLTGVATRPVV 287
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLE-QGWPLFVSEFGA 169
L ++V+ AH Y ++ +W + V + + +L+E P++V EFGA
Sbjct: 288 LDLPDRVVYSAHVYSWSGWGSWGGRYATRSYPSFVQSARKNWAYLVEGDVAPVWVGEFGA 347
Query: 170 DLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC 226
R D Y N +D D+ W L R+ E Y L + +W
Sbjct: 348 P-RSPGRGDANYWNNLLRYLKTIDADFGYWAL----NPRKPRHNETETYALLEDDWV 399
>gi|189016689|ref|YP_001705759.1| cellulase CelA [Clavibacter michiganensis subsp. sepedonicus]
gi|169158032|emb|CAQ03244.1| cellulase CelA [Clavibacter michiganensis subsp. sepedonicus]
Length = 684
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 84/204 (41%), Gaps = 30/204 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H +P S ++ +Y WI +A + V+G+ L NE
Sbjct: 109 VILDQH--RPD---SGGQSELWYTSEYPE-SRWISDWRMLAKRYASDPTVIGVDLHNEPH 162
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD---------LSFVRNQ 107
G DW + G AV A NP++LV++ G++ D L
Sbjct: 163 GAATWGTGAATTDWRAAAERGGNAVLAENPKLLVLVEGIDHQADGTGTWWGGALDSAATA 222
Query: 108 AVNLTFTGKLVFEAHWYGFTD-GQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQG-WPLF 163
+V LT ++V+ H Y T GQ W + P + G + D G+L ++ P+
Sbjct: 223 SVRLTVANRVVYSPHDYPSTIYGQPWFSASNYPTNLPG-IWDAHW---GYLAKKDIAPVL 278
Query: 164 VSEFGADLRGNNVNDNRYLNCFFG 187
V EFG L +D ++LN G
Sbjct: 279 VGEFGTKLE--TASDKQWLNTLVG 300
>gi|13277513|gb|AAK16222.1| cellulase CelA [Clavibacter michiganensis subsp. sepedonicus]
Length = 727
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 84/204 (41%), Gaps = 30/204 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H +P S ++ +Y WI +A + V+G+ L NE
Sbjct: 152 VILDQH--RPD---SGGQSELWYTSEYPE-SRWISDWRMLAKRYASDPTVIGVDLHNEPH 205
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD---------LSFVRNQ 107
G DW + G AV A NP++LV++ G++ D L
Sbjct: 206 GAATWGTGAATTDWRAAAERGGNAVLAENPKLLVLVEGIDHQADGTGTWWGGALDSAATA 265
Query: 108 AVNLTFTGKLVFEAHWYGFTD-GQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQG-WPLF 163
+V LT ++V+ H Y T GQ W + P + G + D G+L ++ P+
Sbjct: 266 SVRLTVANRVVYSPHDYPSTIYGQPWFSASNYPTNLPG-IWDAHW---GYLAKKDIAPVL 321
Query: 164 VSEFGADLRGNNVNDNRYLNCFFG 187
V EFG L +D ++LN G
Sbjct: 322 VGEFGTKLE--TASDKQWLNTLVG 343
>gi|422935934|ref|YP_007005831.1| glycoside hydrolase [Erwinia phage vB_EamP-S6]
gi|339508054|gb|AEJ81614.1| glycoside hydrolase [Erwinia phage vB_EamP-S6]
Length = 782
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 82/181 (45%), Gaps = 21/181 (11%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
++LD+H K G + + GN GD + W+ +K+A + N+VG + NE
Sbjct: 518 IVLDHHRGKAG---TGTIGNPVDGD--YTLAKWLDTWSKLAQRYASYPNIVGADVHNEPH 572
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF--------VRNQAVNLTF 113
W Y++ A+HA NP ++ G+ D S+ V+++ V L+
Sbjct: 573 --DLTWTTWANYVEQAGAAIHAVNPNWIIFCQGVAQYNDESYWWGGQLQGVKDRPVKLSL 630
Query: 114 TGKLVFEAHWYGFTDG-QAWVDGNPNQVCGRVVDNVMRL----SGFL-LEQGWPLFVSEF 167
K+ + H YG + G Q+W+ + +N+ + GF+ E+ PL+V EF
Sbjct: 631 PNKVAYSPHEYGQSVGTQSWLAYDGQTPPANWPNNLASVWDTHWGFIYYEKIAPLWVGEF 690
Query: 168 G 168
G
Sbjct: 691 G 691
>gi|429196036|ref|ZP_19188026.1| cellulase (glycosyl hydrolase family 5) [Streptomyces ipomoeae
91-03]
gi|428668263|gb|EKX67296.1| cellulase (glycosyl hydrolase family 5) [Streptomyces ipomoeae
91-03]
Length = 510
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 76/169 (44%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +AT + G V+G+ L NE P +DW Q AV + NP++
Sbjct: 171 WITNLKALATRYKGNPTVIGIDLHNEPHDPACWGCGDTTRDWRLAAQRAGNAVLSVNPDL 230
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
L+++ G+ + +L V V L ++V+ AH Y + Q+W P
Sbjct: 231 LILVEGVQSFNGVSGWWGGNLMGVAQYPVQLDVANRVVYSAHDYATSVAQQSWFSDPTFP 290
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ G + D R G++ +Q P+++ EFG L+ + D R+L+
Sbjct: 291 ANMPG-IWD---RYWGYIFKQNIAPVWIGEFGTTLQ--STIDQRWLSAL 333
>gi|171695112|ref|XP_001912480.1| hypothetical protein [Podospora anserina S mat+]
gi|170947798|emb|CAP59961.1| unnamed protein product [Podospora anserina S mat+]
Length = 430
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/214 (23%), Positives = 88/214 (41%), Gaps = 17/214 (7%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF---------NPDLWIKGLTKMATIFNGVRNVV 52
VI+++HI+ WCC + + + + + WI+ + G V+
Sbjct: 179 VIVNDHITTATWCCGADPCDASWSNDWLPGVVCKVRQTEEDWIRNWETVMGRLAGNELVI 238
Query: 53 GMSLRNELRGPKQNVKDWYRYMQLGAEA---VHAANPEVLVILSGLNFDKDLSFVRNQAV 109
G LRNE+RG W R+ +A + + L+I+ G DL R + V
Sbjct: 239 GADLRNEVRG-VWGTMPWDRWASAAEKAGNRLLKMKKDWLIIVGGTESGNDLRGARTRPV 297
Query: 110 NLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLE-QGWPLFVSEF 167
L ++V+ AH Y ++ G ++G ++ MR + G+L+E P+++ EF
Sbjct: 298 RLEVEDRVVYSAHVYSWS-GWGSLEGRYSKRTYPSFVASMRSNWGYLVEGDVAPVWIGEF 356
Query: 168 GADLRGNNVNDNRYLNCFFGVAAELDWDWALWTL 201
GA D Y +D D+ W +
Sbjct: 357 GAPSEPGR-GDANYWENLMRYLKSIDADFGYWAI 389
>gi|418048811|ref|ZP_12686898.1| glycoside hydrolase family 5 [Mycobacterium rhodesiae JS60]
gi|353189716|gb|EHB55226.1| glycoside hydrolase family 5 [Mycobacterium rhodesiae JS60]
Length = 1099
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 78/184 (42%), Gaps = 20/184 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD+H S G S NG + + ++ D W+ +A ++ V+G L NE
Sbjct: 488 VILDHHRSAAGAGTSE---NGLWYNSQYSEDQWVADWQMLANRYSDNPTVIGFDLHNEPY 544
Query: 62 GPK---QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF--------VRNQAVN 110
DW R + AV NP +L+ + G+ + S+ V+++ +
Sbjct: 545 NGTWGGGGANDWARAAERAGNAVLQENPNLLIFVEGVGTYQGQSYWWGGNLMGVKDRPIV 604
Query: 111 LTFTGKLVFEAHWY-GFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQG-WPLFVSEF 167
L G++V+ H Y Q W G + NV R + G++ E P++V EF
Sbjct: 605 LDVPGRVVYSPHDYPNSVYAQPWFQ---TANFGANLPNVFRNAWGYIYEDNIAPIYVGEF 661
Query: 168 GADL 171
G L
Sbjct: 662 GTKL 665
>gi|440695674|ref|ZP_20878200.1| cellulase (glycosyl hydrolase family 5) [Streptomyces
turgidiscabies Car8]
gi|440282184|gb|ELP69669.1| cellulase (glycosyl hydrolase family 5) [Streptomyces
turgidiscabies Car8]
Length = 513
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 74/169 (43%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +AT + G V+G+ L NE P KDW Q AV NP++
Sbjct: 179 WIANLKALATRYVGQDTVIGIDLHNEPHDPACWGCGDTTKDWRLAAQRAGNAVLGVNPQL 238
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
LV + G+ + +L V V L+ ++V+ AH Y + Q+W P
Sbjct: 239 LVFVEGVQTFNGVSGWWGGNLMGVAQYPVQLSVANRVVYSAHDYATSVAQQSWFSDPSFP 298
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ G + D + G++ +Q P++V EFG L+ + D R+L
Sbjct: 299 ANMPG-IWD---KYWGYIFKQNIAPVWVGEFGTTLQ--STVDQRWLAAL 341
>gi|374531863|ref|YP_005098419.1| cel gene product [Erwinia phage PEp14]
gi|373429479|gb|AEY69604.1| cellulase [Erwinia phage PEp14]
Length = 619
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 78/182 (42%), Gaps = 21/182 (11%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ILD+H G + +DG+ D + D W+ MA + G V+G L NE
Sbjct: 356 IILDHHRRHAG---TGADGSPV--DATYTVDKWLASWRFMAERYAGNEWVIGADLHNEPH 410
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDK--------DLSFVRNQAVNLTF 113
N W Y +L A+H P+ ++ + G+ K L+ VR++ V L
Sbjct: 411 NLDWN--QWATYAELAGNAIHEIQPQWIMFVQGVATYKGVGAWWGGQLAGVRDRPVVLKH 468
Query: 114 TGKLVFEAHWYGFTDG-QAWVDGNPNQVCGRVVDNVMRL----SGFLLEQG-WPLFVSEF 167
+L + H YG + G Q W+ + N+ + GFL E+ P++V EF
Sbjct: 469 RDRLAYSPHEYGQSVGQQTWLADEAGNMPANWPLNLYAIWRERWGFLFEEAIAPVWVGEF 528
Query: 168 GA 169
G
Sbjct: 529 GG 530
>gi|395769309|ref|ZP_10449824.1| cellulase [Streptomyces acidiscabies 84-104]
Length = 507
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 71/158 (44%), Gaps = 21/158 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A+ + G VVG+ L NE P DW Q AV + NP++
Sbjct: 171 WIANLKSLASRYAGNPAVVGIDLHNEPHDPACWGCGDTATDWRLAAQRAGNAVLSVNPDL 230
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
L+ + G+ + +L V V L+ +LV+ AH Y + Q+W P
Sbjct: 231 LIFVEGIQTVNGVSGWWGGNLMGVAQYPVQLSVPNRLVYSAHDYATSVAQQSWFSDPTFP 290
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGN 174
N + G + D + G++ +Q P++V EFG L+ +
Sbjct: 291 NNMPG-IWD---KYWGYIFKQNIAPVWVGEFGTTLQAS 324
>gi|42408138|dbj|BAD09277.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 202
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 53/100 (53%), Gaps = 14/100 (14%)
Query: 230 NSSFLERISSLQSPFRGPGVFETGLHKVIYHPATGLCVQRK--------SFLDPLTLGPC 281
N +FL + + GPG + + V++HP TG CV R+ + L LG C
Sbjct: 16 NVAFLTDGVAGDAMHAGPGYDDARPYTVLFHPLTGRCVVRRAADDAAAATAAGTLELGWC 75
Query: 282 TESEAWSYT-PHKTISLKGAY-----FCLQAKHVGKPAKL 315
+++AW+YT P T++++G + CL+A G+PA+L
Sbjct: 76 EDTDAWAYTQPVSTLAMQGVWRGSPPLCLRAVGSGRPARL 115
>gi|50955752|ref|YP_063040.1| cellulase [Leifsonia xyli subsp. xyli str. CTCB07]
gi|50952234|gb|AAT89935.1| cellulase [Leifsonia xyli subsp. xyli str. CTCB07]
Length = 541
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 63/146 (43%), Gaps = 9/146 (6%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI +A + R V+G+ L NE P DW AV A NP +
Sbjct: 176 WIADWGALAARYRENRTVIGVDLHNEPHQPACWGCGDPAVDWRLAATRAGNAVLAVNPNL 235
Query: 89 LVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEAHWYG-FTDGQAWVDGNPNQVCGRVVDN 147
LVI+ G+ + DL+ V + V L + ++V+ H Y +W +P G +
Sbjct: 236 LVIVEGMEWGGDLTGVASAQVVLNVSNRVVYSPHDYPQEVANHSWFH-DPT-YPGNLKKV 293
Query: 148 VMRLSGFLLEQG-WPLFVSEFGADLR 172
G++ QG P+ + EFG DL+
Sbjct: 294 WSSRWGYIAAQGIAPVLLGEFGTDLQ 319
>gi|336267424|ref|XP_003348478.1| hypothetical protein SMAC_02972 [Sordaria macrospora k-hell]
gi|380092133|emb|CCC10401.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 379
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/201 (20%), Positives = 80/201 (39%), Gaps = 24/201 (11%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VI++NHI+ WCC + + + Y L + A G+ +
Sbjct: 164 VIINNHITTATWCCGADPCDAGWANDY---------LPQRAVSVRGLWGTMPW------- 207
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEA 121
+ W + A+ NP+ L+++ G DL+ VR + + L ++V+ A
Sbjct: 208 ------EKWASAAERAGNALLMMNPDWLMVVGGTESGNDLTGVRRRPIELDVEDRVVYSA 261
Query: 122 HWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW-PLFVSEFGADLRGNNVNDNR 180
H Y ++ + + V + + +L+E+ P+++ EFGA R + D
Sbjct: 262 HVYSWSGWGSAGGRYSKRTYASFVKAMRKNWAYLVEENMAPVWIGEFGAPDR-PGIGDAN 320
Query: 181 YLNCFFGVAAELDWDWALWTL 201
Y E+D D+ W +
Sbjct: 321 YWRNLLRYLKEIDADFGYWAV 341
>gi|456385535|gb|EMF51103.1| cellulase [Streptomyces bottropensis ATCC 25435]
Length = 520
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 74/169 (43%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG+ L NE P +DW Q AV + NP++
Sbjct: 180 WITNLKALAARYKGNATVVGIDLHNEPHDPACWGCGDTARDWRLAAQRAGNAVLSVNPDL 239
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
L+++ G+ + +L V V L ++V+ AH Y + Q+W P
Sbjct: 240 LILVEGVQTFNGVSGWWGGNLMGVAQYPVQLDVANRVVYSAHDYATSVAQQSWFSDPSFP 299
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ G V D + G++ +Q P++V EFG L+ + D R+L
Sbjct: 300 ANMPG-VWD---KYWGYIFKQNIAPVWVGEFGTTLQ--STVDQRWLAAL 342
>gi|146197143|dbj|BAF57335.1| putative glycosyl hydrolase family5 [uncultured symbiotic protist
of Hodotermopsis sjoestedti]
Length = 357
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 80/176 (45%), Gaps = 24/176 (13%)
Query: 30 NPDLWI-KGLTKMATIFNGVRNVVGMSLRNELRG-----------PKQNVKDWYRYMQLG 77
+P WI GL A + + +VG+ ++NE G ++ +W +++
Sbjct: 137 HPAEWIYAGLEWFADHYKSEQTIVGIDVKNEPHGNCDSSDGAKWDDSKDANNWKYFIETA 196
Query: 78 AEAVHAANPEVLVILSGLNFDK--------DLSFVRNQAVNL-TFTGKLVFEAHWYGFT- 127
A +HA NPE+L+I+ G+ K +L VR+ +NL + KLV+ H YG +
Sbjct: 197 ATRIHAKNPELLIIVEGIACYKGHGGWWGGNLYAVRDYPINLGSHQNKLVYSPHEYGPSV 256
Query: 128 DGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYL 182
Q W +G+ D+ +++E PL + E+G + N + +Y+
Sbjct: 257 SQQTWFEGSFT-YDSLYADHWKDSWLYIVEDNIAPLLIGEWGGHIEEPNTSWMKYM 311
>gi|158422925|ref|YP_001524217.1| endoglucanase [Azorhizobium caulinodans ORS 571]
gi|158329814|dbj|BAF87299.1| endoglucanase [Azorhizobium caulinodans ORS 571]
Length = 1835
Score = 50.1 bits (118), Expect = 0.002, Method: Composition-based stats.
Identities = 49/183 (26%), Positives = 74/183 (40%), Gaps = 18/183 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ILD+H S G + NG + D + WI +AT + G +V+G L NE
Sbjct: 1368 IILDHHRSSAG---DGTSANGLWYDSSHSEASWIADWQMLATRYAGDTSVIGADLHNEPY 1424
Query: 62 GPK---QNVKDWYRYMQLGAEAVHAANPEVLVILSGLN--------FDKDLSFVRNQAVN 110
DW R A+ A N L+ + G+ + +L VR+ +
Sbjct: 1425 NGTWGDGGTNDWARAATAAGNAIGAVNDNWLIFVEGIGTYQGQNYWWGGNLMGVRDHPIE 1484
Query: 111 LTFTGKLVFEAHWY-GFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLL-EQGWPLFVSEFG 168
L KLV+ AH Y Q W G + ++ GF+ EQ P+++ EFG
Sbjct: 1485 LNVDNKLVYSAHDYPNSVYAQPWFQG--ANFAADLPAKFDQMWGFIYKEQIAPVYIGEFG 1542
Query: 169 ADL 171
L
Sbjct: 1543 TKL 1545
>gi|392382818|ref|YP_005032015.1| endoglucanase [Azospirillum brasilense Sp245]
gi|356877783|emb|CCC98631.1| endoglucanase [Azospirillum brasilense Sp245]
Length = 441
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 80/188 (42%), Gaps = 24/188 (12%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD+H S G ++++ NG + D + I+ +A + G VVG L NE
Sbjct: 111 VILDHHRSSDG---ASANENGLWYDDKYPESKMIENWKMLAERYKGNDTVVGADLHNEPH 167
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDK--------DLSFVRNQA 108
G DW + + + N + L+++ G+ + +L +++
Sbjct: 168 GQATWGGGDKATDWAWAAERIGNEIQSVNKDWLLLVEGVEIHQNQWEWWGGNLRGAKDRP 227
Query: 109 VNLTFTGKLVFEAHWYGFTDGQ-AWVDGN--PNQVCGRVVDNVMRLSGFLL-EQGWPLFV 164
+ KLV+ H YG + + W + PN + G+ DN G+LL E P+ V
Sbjct: 228 IEFDHPDKLVYSVHSYGPSIHEMEWFKASDYPNNLPGQYTDNW----GWLLKEDKAPVLV 283
Query: 165 SEFGADLR 172
EFG L
Sbjct: 284 GEFGGKLE 291
>gi|255547530|ref|XP_002514822.1| hypothetical protein RCOM_1077730 [Ricinus communis]
gi|223545873|gb|EEF47376.1| hypothetical protein RCOM_1077730 [Ricinus communis]
Length = 263
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 26/46 (56%)
Query: 161 PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYY 206
P+FV E G D RG + + Y +C A+ D DWA WT GSYY
Sbjct: 120 PMFVGEIGLDQRGLSQTEEHYYSCVLAYLADFDMDWAWWTWPGSYY 165
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 15/97 (15%)
Query: 299 GAYFCLQAKHVGKPAKLGIIC-----TDCGSTWEIISD-SKMHLSSKA-DNGTTVCLDVD 351
G+Y+ L + + +IC TDC S + S+ +HL++ + G +CL +
Sbjct: 162 GSYYYLHTSELKSERTMALICKPILSTDCSSQLSLWSEIDHLHLAAAVGEKGEPLCLQKE 221
Query: 352 SSNT--IVTNTCKCLSRDKTC------DPASQWFKLV 380
S T I+T+ C D C DP SQWFKLV
Sbjct: 222 SPYTSKILTSKCIFTQEDPACRKDPQKDPTSQWFKLV 258
>gi|290959511|ref|YP_003490693.1| cellulase [Streptomyces scabiei 87.22]
gi|260649037|emb|CBG72151.1| putative secreted cellulase [Streptomyces scabiei 87.22]
Length = 516
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 73/169 (43%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +AT + G VVG+ L NE P KDW Q AV + N ++
Sbjct: 178 WITNLKALATRYKGNPTVVGIDLHNEPHDPACWGCGDTAKDWRLAAQRAGNAVLSVNADL 237
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQA-WVDGN--P 137
L+++ G+ + +L V V L +LV+ AH Y + Q W P
Sbjct: 238 LILVEGVQTFNGVSGWWGGNLMGVAQYPVQLDVANRLVYSAHDYATSVAQQDWFSDPSFP 297
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ G + D + G++ +Q P++V EFG L+ + D ++L
Sbjct: 298 ANMPG-IWD---KYWGYIFKQNIAPVWVGEFGTTLQ--STTDQKWLAAL 340
>gi|302532779|ref|ZP_07285121.1| LOW QUALITY PROTEIN: cellulase CelA [Streptomyces sp. C]
gi|302441674|gb|EFL13490.1| LOW QUALITY PROTEIN: cellulase CelA [Streptomyces sp. C]
Length = 475
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/218 (22%), Positives = 84/218 (38%), Gaps = 34/218 (15%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELR-----GPKQNVKDWYRYMQLGAEAVH----AA 84
W +A + VVG L NE+R P + D + + EA A
Sbjct: 37 WADDWVSIARRYRDNPRVVGADLYNEVRRDTWDDPNWGLGDGHDWQAASQEAADRILTEA 96
Query: 85 NPEVLVILSGLNFD-----------KDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQAWV 133
NP++L+++ G+N+ L+ VR + L +G+LV+ AH+YG+T +
Sbjct: 97 NPDLLIVVEGINWTGIPLDGFPHERPTLTPVRTLSHTLAVSGRLVYSAHFYGYTGPRHSG 156
Query: 134 DGNPNQVCGRVVDNVMR--LSGFLLEQGW------------PLFVSEFGADLRGNNVNDN 179
+ ++ R L L +Q + P+++SEFG
Sbjct: 157 ATGLGETHDPRYQDMTRAQLEQTLYDQAFFVSAETGTHYTAPVWISEFGIGADETGAAPR 216
Query: 180 RYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEY 217
+ A D D+A W LVG +G G + +
Sbjct: 217 AWFQNLTAYLAGSDADFAYWPLVGWSTTAQGAPGGDSW 254
>gi|288962579|ref|YP_003452874.1| endoglucanase [Azospirillum sp. B510]
gi|288914845|dbj|BAI76330.1| endoglucanase [Azospirillum sp. B510]
Length = 787
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 97/242 (40%), Gaps = 44/242 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD-LWIKGLTKMATIFNGVRNVVGMSLRNEL 60
VILD+H + G +N G ++ D+ NP+ I +A + G VVG L NE
Sbjct: 439 VILDHHRNDGG-AGTNESGL-WYTDK--NPESTVIDHWQTLAKRYAGNEAVVGADLHNEP 494
Query: 61 R-----GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTG 115
G DW Q A+ + N + L+++ G + L+ V++ +
Sbjct: 495 SVSATWGDGNQATDWAAAAQRIGNAIQSVNKDWLMLVEGTEWSSTLAGVKDHPIQFDVPN 554
Query: 116 KLVFEAHWYGFTDGQ-AWV-DGN-PNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADL 171
KLV+ H YG + GQ +W+ D N PN + + + G++ + P+ V EFG +
Sbjct: 555 KLVYSPHAYGQSVGQFSWLSDPNYPNNLPAQ----YDSMWGYIYKNNIAPILVGEFGGQM 610
Query: 172 RGNNVNDNRYLNCFFGVAAELDWDWAL-------WTLVGSYYLREGVIGLNEYYGLFDWN 224
AAE W L W L G L G G+N Y W
Sbjct: 611 HS---------------AAEQTWASTLIKYMNGDWNLDGKSDLAPGQQGMNWTY----WA 651
Query: 225 WC 226
W
Sbjct: 652 WT 653
>gi|383639631|ref|ZP_09952037.1| cellulase [Streptomyces chartreusis NRRL 12338]
Length = 504
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 74/169 (43%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG+ L NE P +DW Q AV + NPE+
Sbjct: 169 WITNLKSLAARYKGNPTVVGIDLHNEPHDPACWGCGDTTRDWRLAAQRAGNAVLSVNPEL 228
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
L+++ G+ + +L V V L +LV+ AH Y + Q+W P
Sbjct: 229 LIMVEGVQSFNGANGWWGGNLMGVAQYPVQLDVPNRLVYSAHDYATSVAQQSWFSDPSFP 288
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ G + D + G++ +Q P+++ EFG L+ D ++L+
Sbjct: 289 ANMPG-IWD---KYWGYIFKQDIAPVWLGEFGTTLQ--PAVDQKWLSAL 331
>gi|329941376|ref|ZP_08290655.1| cellulase [Streptomyces griseoaurantiacus M045]
gi|329299907|gb|EGG43806.1| cellulase [Streptomyces griseoaurantiacus M045]
Length = 516
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 75/169 (44%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A+ + G V+G+ L NE P DW + AV + NP++
Sbjct: 181 WIADLKSIASRYAGQDTVIGIDLHNEPHDPACWGCGDQATDWRLAAERAGNAVLSVNPDL 240
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
LV + G+ + +L+ V L +LV+ AH Y + Q+W + P
Sbjct: 241 LVFVEGVQTVNGVSGWWGGNLTGAGQYPVRLDVPNRLVYSAHDYATSVAQQSWFEDASFP 300
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ + G + D + G+L +Q P++V EFG L+ + D ++L
Sbjct: 301 DNMPG-IWD---KYWGYLFKQNIAPVWVGEFGTTLQ--STTDQKWLAAL 343
>gi|444909402|ref|ZP_21229593.1| Endoglucanase E1 precursor (Endo-1,4-beta-glucanase E1) Cellulase
E1 [Cystobacter fuscus DSM 2262]
gi|444720351|gb|ELW61135.1| Endoglucanase E1 precursor (Endo-1,4-beta-glucanase E1) Cellulase
E1 [Cystobacter fuscus DSM 2262]
Length = 570
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 82/188 (43%), Gaps = 26/188 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPKQ----NV-KDWYRYMQLGAEAVHAANPEV 88
WI +A + G VVG+ L NE G N+ DW + A+ NP++
Sbjct: 354 WIDDWKMLAQRYKGNPTVVGVDLHNEPHGRATWGDGNLDTDWRLAAERAGNAILGVNPDL 413
Query: 89 LVILSGLNFDK--------DLSFVRNQAVNLTFTGKLVFEAHWYGFTD-GQAWVD----- 134
L+I+ G+ + +L R+ V L G++V+ AH Y + GQ W
Sbjct: 414 LIIVEGIEIYQNNWYWWGGNLRGARDYPVRLNVPGRVVYSAHDYPESVYGQPWFQNKGST 473
Query: 135 GNPNQVCGRVVDNVMRLSGFLL-EQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELD 193
G P + G V D GFL+ E P+++ EFG L + +D +++ G A
Sbjct: 474 GYPANLPG-VWDATW---GFLVKENRAPVWLGEFGTKLLID--SDKQWIQTLTGYLANNG 527
Query: 194 WDWALWTL 201
+A W+L
Sbjct: 528 MSFAFWSL 535
>gi|297200481|ref|ZP_06917878.1| glycoside hydrolase [Streptomyces sviceus ATCC 29083]
gi|297147663|gb|EDY53640.2| glycoside hydrolase [Streptomyces sviceus ATCC 29083]
Length = 528
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 73/169 (43%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +AT + G VVG+ L NE P DW Q AV NP++
Sbjct: 194 WIANLKALATRYKGQDTVVGIDLHNEPHDPACWGCGDTSVDWRLAAQRAGNAVLGVNPDL 253
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
L+ + G+ + +L V V L+ ++V+ AH Y + Q+W P
Sbjct: 254 LIFVEGVQTFNGVSGWWGGNLMGVAQYPVQLSVANRVVYSAHDYATSVAQQSWFSDPSFP 313
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ G + D + G++ +Q P++V EFG L+ + D ++L
Sbjct: 314 ANMPG-IWD---KYWGYIFKQNIAPVWVGEFGTTLQ--STVDQKWLAAL 356
>gi|225155020|ref|ZP_03723516.1| Endoglucanase-like protein [Diplosphaera colitermitum TAV2]
gi|224804190|gb|EEG22417.1| Endoglucanase-like protein [Diplosphaera colitermitum TAV2]
Length = 531
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 105/289 (36%), Gaps = 69/289 (23%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VI+D H SKPG + S ++ F + + TI ++ + + +
Sbjct: 270 VIIDLH-SKPGGNKTPSKTTRMLMEKEFADEF----VAIWETIVRRHKDHPALYAYDLVN 324
Query: 62 GPKQ------NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTG 115
P Q V DWY A A+ A +P+ ++ FD +F + V+L
Sbjct: 325 EPVQYMPSPAGVDDWYALQVRAAHAIRAIDPKTPIMFETEYFDSPWAFAWLKPVDLP--- 381
Query: 116 KLVFEAHWY--------------GFTDGQAWVDGN--PNQVCGRVVD------NVMRLSG 153
++++AH Y G DG+ + G P + G D +V +
Sbjct: 382 NIIYQAHVYFPHTYTHQGVKTDQGLADGKNYAAGQRYPGVISGTHYDKESLRKHVQPIRD 441
Query: 154 FLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIG 213
F L G P+F+ EF A A W + Y+ + I
Sbjct: 442 FQLATGAPIFIGEFSA---------------------------ARWAPGAAQYI-DDCIS 473
Query: 214 LNEYYGLFDWNWCDIRNSSF--LERI--SSLQSPFRGPGVFETGLHKVI 258
L E YG +DW + R S+ +E + S P R P T KV+
Sbjct: 474 LFEEYG-WDWTYHAFRESTIWDVEHVDDSPRAKPVRAPADQPTERQKVL 521
>gi|146197301|dbj|BAF57414.1| putative glycosyl hydrolase family5 [uncultured symbiotic protist
of Mastotermes darwiniensis]
Length = 369
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 46/200 (23%), Positives = 80/200 (40%), Gaps = 39/200 (19%)
Query: 31 PDLWIKGLTKMATIFNGVRNVVGMSLRNELRGP-----------KQNVKDWYRYMQLGAE 79
P+ + L +A F +G+ ++NE G +N +W +++ A
Sbjct: 149 PEYILSALEWIADYFKDEPAFIGIDVKNEPHGTCDDPTAAHWDGTKNDNNWKYFVETAAA 208
Query: 80 AVHAANPEVLVILSGLNFDK--------DLSFVRNQAVNL-TFTGKLVFEAHWYGFTDGQ 130
+HA NP++L+ + G+ K L+ V++ + L T+ KLV+ H YG +
Sbjct: 209 RIHAKNPKLLIFVEGIECYKGVEGWWGGQLAAVKDYPIKLGTYQNKLVYSPHDYGPS--- 265
Query: 131 AWVDGNPNQVCGRVVDNVMRLSGFLLEQGW---------PLFVSEFGADLRGNNVNDNRY 181
NP Q R L + E W P+F+ E+G L N N +
Sbjct: 266 ----VNPKQTWLRDNMTYDSLMAEVWEPQWLFIHENSIAPIFIGEWGGHLEKRN---NLW 318
Query: 182 LNCFFGVAAELDWDWALWTL 201
+ F + A+ + W L
Sbjct: 319 MGPFVQLIAKYKLSFTHWCL 338
>gi|256393355|ref|YP_003114919.1| glycoside hydrolase family protein [Catenulispora acidiphila DSM
44928]
gi|256359581|gb|ACU73078.1| glycoside hydrolase family 5 [Catenulispora acidiphila DSM 44928]
Length = 608
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 64/155 (41%), Gaps = 21/155 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +AT + G VVG+ L NE P DW + G AV + NP +
Sbjct: 186 WINDLKAIATRYQGNPAVVGIDLHNEPHDPACWGCGDTTIDWRLAAERGGNAVLSVNPSL 245
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFT-DGQAWVDGN--P 137
L+ + G+ + +L V L+ ++V+ AH Y + Q W P
Sbjct: 246 LIFVEGVQTFNGSSYWWGGNLQGAGQYPVQLSVANRVVYSAHDYATSVASQPWFTDPSFP 305
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADL 171
+ + G N G+L Q P++V EFG L
Sbjct: 306 SNMAGIWDKNW----GYLFNQNIAPVWVGEFGTTL 336
>gi|302552263|ref|ZP_07304605.1| glycoside hydrolase [Streptomyces viridochromogenes DSM 40736]
gi|302469881|gb|EFL32974.1| glycoside hydrolase [Streptomyces viridochromogenes DSM 40736]
Length = 498
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 69/156 (44%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG+ L NE P +DW Q AV + NP++
Sbjct: 162 WITNLKSLAARYKGNPTVVGIDLHNEPHDPACWGCGDTTRDWRLAAQRAGNAVLSVNPDL 221
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVD--GNP 137
L+++ G+ + +L V V L +LV+ AH Y + Q W P
Sbjct: 222 LIMVEGVQSYNGTNGWWGGNLMGVAQYPVQLDVPNRLVYSAHDYATSVAQQPWFSDPAFP 281
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
+ + G + D + G++ +Q P+++ EFG L+
Sbjct: 282 SNLPG-IWD---KYWGYVFKQNIAPVWLGEFGTTLQ 313
>gi|255084097|ref|XP_002508623.1| glycoside hydrolase family 5 protein [Micromonas sp. RCC299]
gi|226523900|gb|ACO69881.1| glycoside hydrolase family 5 protein [Micromonas sp. RCC299]
Length = 593
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 8/68 (11%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDL--------WIKGLTKMATIFNGVRNVVG 53
VILDNH++ WCC +D +G + D+ W+ LTK++ + G NVV
Sbjct: 229 VILDNHMTVADWCCQRADCDGLWYHSTAMGDVNSTGTERAWLAALTKISARYVGAANVVA 288
Query: 54 MSLRNELR 61
+ L+NE R
Sbjct: 289 VELKNEPR 296
>gi|294629964|ref|ZP_06708524.1| endoglucanase E1 [Streptomyces sp. e14]
gi|292833297|gb|EFF91646.1| endoglucanase E1 [Streptomyces sp. e14]
Length = 540
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 74/169 (43%), Gaps = 23/169 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +AT + G V+G+ L NE P DW + +AV + NP +
Sbjct: 200 WIADLKALATRYQGQDTVIGIDLHNEPHDPACWGCGDQATDWRLAAERAGDAVLSVNPNL 259
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
L+ + G+ + +L V V L ++V+ AH Y + Q+W P
Sbjct: 260 LIFVEGVQTVDGVSGWWGGNLMGVGQYPVRLDVANRVVYSAHDYATSVAQQSWFSDPAFP 319
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCF 185
+ G V D + G++ +Q P++V EFG L+ + D ++L+
Sbjct: 320 ANMPG-VWD---KYWGYVFKQNIAPVWVGEFGTTLQ--STVDQKWLSAL 362
>gi|298251924|ref|ZP_06975727.1| glycoside hydrolase family 5 [Ktedonobacter racemifer DSM 44963]
gi|297546516|gb|EFH80384.1| glycoside hydrolase family 5 [Ktedonobacter racemifer DSM 44963]
Length = 614
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 64/259 (24%), Positives = 97/259 (37%), Gaps = 56/259 (21%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ILD H +PG + ++ QY WI +A + V+G L NE R
Sbjct: 181 IILDRH--RPG---ATQQEPLWYTAQYPE-SRWIADWQMLAQHYKNNSMVIGADLHNEPR 234
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGL------NFDK----------- 99
P DW + Q A+ + NP L+ + G+ N DK
Sbjct: 235 APACWGCGDPKLDWQQAAQRAGNAILSVNPNWLIFIEGVECYPDPNADKQVPGNCSWWGS 294
Query: 100 DLSFVRNQAVNLTFTGKLVFEAHWYGFT-DGQAWV-DGNPNQVCGRVVDNVMRLSGFLLE 157
+L V V L +LV+ H Y + Q W D N Q + D+ G++ +
Sbjct: 295 NLKGVATHPVQLQVAHRLVYSVHDYPVSVYPQLWFKDPNFPQNLSAIWDSYW---GYIQK 351
Query: 158 QG-WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNE 216
QG P++V EFG L+ N D ++L YL GV G+N
Sbjct: 352 QGIAPVWVGEFGTRLQ--NAQDKQWLTSLVS------------------YLGRGVNGIN- 390
Query: 217 YYGLFDWNWCDIRNSSFLE 235
+ + WN + L+
Sbjct: 391 -WTFWSWNPDSVDTGGLLQ 408
>gi|424887242|ref|ZP_18310847.1| endoglucanase [Rhizobium leguminosarum bv. trifolii WSM2012]
gi|393175014|gb|EJC75057.1| endoglucanase [Rhizobium leguminosarum bv. trifolii WSM2012]
Length = 501
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 74/187 (39%), Gaps = 31/187 (16%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V+LD+H G S G Y + W MA + G NVVG + NE
Sbjct: 237 VVLDHHRRTAGVGADGSP----VGIGYTLAN-WKASWAIMAKRYGGRINVVGADVHNEPH 291
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF--------VRNQAVNLTF 113
N W Y++ +H P+ ++ G+ D +F V + V LT
Sbjct: 292 --DLNWATWAGYVEECGNQIHTIAPDWIIFCEGVGSDASGAFWWGGALGGVATRPVVLTH 349
Query: 114 TGKLVFEAHWYGFT---------DGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQG-WPL 162
+G+L + H YG + DGQ G PN + V R GF+ EQ P+
Sbjct: 350 SGRLAYSPHEYGQSVGVQSWLAYDGQTAPAGWPNNLYA-----VWRAHWGFIFEQNIAPI 404
Query: 163 FVSEFGA 169
+V EFG
Sbjct: 405 WVGEFGG 411
>gi|297623672|ref|YP_003705106.1| glycoside hydrolase [Truepera radiovictrix DSM 17093]
gi|297164852|gb|ADI14563.1| glycoside hydrolase family 5 [Truepera radiovictrix DSM 17093]
Length = 335
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 79/182 (43%), Gaps = 22/182 (12%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+V+L H P + G F GD Y D W+ L +A + NV+G+ L NE
Sbjct: 102 LVLLSFHTYDPNRLGHDLPGRPF-GDGYTKAD-WLADLRTLAELAKRYPNVLGIDLCNE- 158
Query: 61 RGPKQNVKDWYRYMQLGAEA---VHAANPEVLVILSGL-NFDKDLSFVRNQAVNLTFT-- 114
+ W + +L AEA V NP++LV + G+ N + + N NLT
Sbjct: 159 ----PHKLTWRAWAKLAAEAGREVLKTNPDLLVFVEGVANASDNGGYGANWGENLTRAGE 214
Query: 115 ------GKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQGWPLFVSEF 167
GKLV+ H YG + G P + ++ + G L+++G+ L + EF
Sbjct: 215 LPGIPGGKLVYSPHVYGPSVHHKPYFGAPEFPAN--LPHIWDVHFGHLVDKGYTLVIGEF 272
Query: 168 GA 169
G
Sbjct: 273 GG 274
>gi|357502209|ref|XP_003621393.1| hypothetical protein MTR_7g012850 [Medicago truncatula]
gi|355496408|gb|AES77611.1| hypothetical protein MTR_7g012850 [Medicago truncatula]
Length = 100
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 4/75 (5%)
Query: 310 GKPAKLGIICTDCGSTWEIISDSKMHLSSKADNGTTVCLDVDSSNT--IVTNTCKCLSRD 367
G K+ C S W+ +S + +HL + + G +CL +S+ + IVT C C+ +
Sbjct: 22 GSQVKVSNDCKSKQSFWKTLSATNLHLGTLDEQGQNLCLQRESTTSPKIVTKKCICIDDN 81
Query: 368 KTC--DPASQWFKLV 380
C DP SQWF+LV
Sbjct: 82 PACLDDPQSQWFQLV 96
>gi|332705490|ref|ZP_08425568.1| endoglucanase [Moorea producens 3L]
gi|332355850|gb|EGJ35312.1| endoglucanase [Moorea producens 3L]
Length = 767
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 18/120 (15%)
Query: 23 FFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR-----GPKQNVKDWYRYMQLG 77
++GD + D WI+ +AT + NV+G L+NE G DW Q
Sbjct: 210 WYGDGFTETD-WIETWKMLATRYRNQPNVIGADLKNEPHGRASWGTNDLATDWRLAAQRA 268
Query: 78 AEAVHAANPEVLVILSGLNFD------------KDLSFVRNQAVNLTFTGKLVFEAHWYG 125
A+ A NP L+++ G+ + +L VR V L+ KLV+ H YG
Sbjct: 269 GNAILAVNPNWLIVVEGVEKNVPRQRLATHWHGGNLEGVREYPVRLSNANKLVYSPHEYG 328
>gi|330722943|gb|AEC45566.1| cellulase precursor [uncultured organism]
Length = 385
Score = 45.8 bits (107), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 4/99 (4%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V+LD H P G F D Y D W+ L +MA + ++ G+ L NE
Sbjct: 152 VLLDFHTYDPNRIGGKLPGRPF-ADGYTQAD-WLADLRRMAELSREFPHIFGVDLCNEPY 209
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD 100
++W R + GAEAV + NP VLVI+ G+ D
Sbjct: 210 --ALTWREWKRLAREGAEAVLSVNPSVLVIVEGVGNASD 246
>gi|374988645|ref|YP_004964140.1| glycoside hydrolase [Streptomyces bingchenggensis BCW-1]
gi|297159297|gb|ADI09009.1| glycoside hydrolase family protein [Streptomyces bingchenggensis
BCW-1]
Length = 424
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 56/141 (39%), Gaps = 25/141 (17%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MV+LD H +P + + Q W+ T++A + V+G L NE
Sbjct: 175 MVLLDQH--RPDQYGQSE----LWYSQNLTEKQWVDDWTQLAKRYKDNPRVIGADLHNEP 228
Query: 61 RG--------PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDK--------DLSFV 104
RG PK DW + A+HA N L+ + G + K DL V
Sbjct: 229 RGQATWGDGNPKT---DWRLAAEKAGNAIHAVNKNWLIFVEGTDRHKNEQFWWGGDLQGV 285
Query: 105 RNQAVNLTFTGKLVFEAHWYG 125
+ V L K+V+ AH YG
Sbjct: 286 KKYPVKLKEPNKVVYSAHDYG 306
>gi|357393773|ref|YP_004908614.1| putative endoglucanase precursor [Kitasatospora setae KM-6054]
gi|311900250|dbj|BAJ32658.1| putative endoglucanase precursor [Kitasatospora setae KM-6054]
Length = 636
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
W+ L ++ + V+G+ L NE P DW Q G EAV +ANP++
Sbjct: 209 WLANLKTISARYANNSAVIGIDLHNEPHDPACWGCGDLATDWRLAAQRGGEAVLSANPKL 268
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
LV + G+ + +L V L+ ++V+ AH Y + Q W P
Sbjct: 269 LVFVEGVQSFNGSSYWWGGNLQGAGQYPVQLSVPNRVVYSAHDYATSVAQQTWFSDPSFP 328
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
+ + G N G+L Q P++V EFG L+
Sbjct: 329 SNMPGVWDKNW----GYLFNQNIAPVWVGEFGTTLQ 360
>gi|452004268|gb|EMD96724.1| glycoside hydrolase family 5 protein [Cochliobolus heterostrophus
C5]
Length = 482
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 49/213 (23%), Positives = 84/213 (39%), Gaps = 22/213 (10%)
Query: 2 VILDNHISKPG----WCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLR 57
V+LD H G W C + F D D I+ +A +NG V G +L
Sbjct: 136 VVLDLHAVPGGQNQDWHCDSGLNKALFWDFRVFQDQAIELWKAIAAHYNGNPVVCGYNLL 195
Query: 58 NELRGPKQ-NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGK 116
NE P+ + WY + + + A +PE ++ + G + D ++ +N N +
Sbjct: 196 NEPADPQHVRLISWY---ERAEKEIRAIDPETMLFIDGNTYAMDFTYFQNVLPNSVYA-- 250
Query: 117 LVFEAHWYGF-TDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFG------- 168
+ +GF GQ +G+ ++ R F+ E+ P++ EFG
Sbjct: 251 -CHDYAMFGFPIPGQPLYNGSEEHKT-KLKRQFDRKVFFMREKNVPIWNGEFGPVYANEH 308
Query: 169 ADLRGNNVNDNRY--LNCFFGVAAELDWDWALW 199
D +N RY L + A+ W++W
Sbjct: 309 TDPEATEINSARYHMLQDQLKIYAQDQVSWSIW 341
>gi|427737620|ref|YP_007057164.1| endoglucanase [Rivularia sp. PCC 7116]
gi|427372661|gb|AFY56617.1| endoglucanase [Rivularia sp. PCC 7116]
Length = 430
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 58/142 (40%), Gaps = 23/142 (16%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+++LD+H K G G+GF + WI +A + NV+G L+NE
Sbjct: 190 LILLDSHCLKDGHISEVWYGDGFTEED------WINTWILLAKRYKNQPNVIGADLKNEP 243
Query: 61 R-----GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLN------------FDKDLSF 103
G DW +Q + + NP+ L+++ G+ + +L
Sbjct: 244 HGRASWGTDDTSTDWRLAVQKAGDKILDINPDWLIVVEGIEKNVPNQIQHGYFWGANLEG 303
Query: 104 VRNQAVNLTFTGKLVFEAHWYG 125
VR V+L KLV+ H YG
Sbjct: 304 VREYPVHLKKPSKLVYSPHEYG 325
>gi|391231506|ref|ZP_10267712.1| endoglucanase [Opitutaceae bacterium TAV1]
gi|391221167|gb|EIP99587.1| endoglucanase [Opitutaceae bacterium TAV1]
Length = 534
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 51/218 (23%), Positives = 80/218 (36%), Gaps = 58/218 (26%)
Query: 67 VKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEAHWY-- 124
V DWY A A+ A +P+ ++ FD +F + V+L ++++AH Y
Sbjct: 339 VDDWYALQVRAARAIRAIDPKTPIMFETEYFDSPWAFAWLKPVDLP---NIIYQAHVYFP 395
Query: 125 ------------GFTDGQAWVDGN--PNQVCGRVVD------NVMRLSGFLLEQGWPLFV 164
G DG+ + G P + G D +V + F L G P+F+
Sbjct: 396 HTYTHQGVITDQGLADGKNYAAGQRYPGIISGAHYDKESLRKHVQPIRDFQLATGAPIFI 455
Query: 165 SEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWN 224
EF A A W + Y+ + I L E YG ++W
Sbjct: 456 GEFSA---------------------------ARWAPGAAQYI-DDCISLFEEYG-WNWT 486
Query: 225 WCDIRNSSF--LERI--SSLQSPFRGPGVFETGLHKVI 258
+ R S+ +E + S P R P T KV+
Sbjct: 487 YHAFRESTIWDVEHVDDSPRAKPVRTPADQPTERQKVL 524
>gi|428312591|ref|YP_007123568.1| endoglucanase [Microcoleus sp. PCC 7113]
gi|428254203|gb|AFZ20162.1| endoglucanase [Microcoleus sp. PCC 7113]
Length = 534
Score = 44.7 bits (104), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 48/207 (23%), Positives = 76/207 (36%), Gaps = 42/207 (20%)
Query: 23 FFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR-----GPKQNVKDWYRYMQLG 77
++GD + D WI +AT + N++G L+NE G + DW +
Sbjct: 163 WYGDGFTETD-WIDTWKMLATRYKNYPNIIGADLKNEPHGRASWGTNNSETDWRLAAERA 221
Query: 78 AEAVHAANPEVLVILSGLN------------FDKDLSFVRNQAVNLTFTGKLVFEAHWYG 125
A+ A NP L+++ G+ +L V V L+ + KLV+ H YG
Sbjct: 222 GNAILAINPNWLIVVEGVEKNVPGQKLAAHWMGANLEGVEKFPVRLSKSNKLVYSPHEYG 281
Query: 126 F-TDGQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQGW---------PLFVSEFGADLRG 173
Q W + PN + R E GW P+ V EFG
Sbjct: 282 AGVYNQPWFNEPSFPNNLESR------------WETGWNYIATKGIAPILVGEFGGRQVD 329
Query: 174 NNVNDNRYLNCFFGVAAELDWDWALWT 200
+ + + F + D +A W+
Sbjct: 330 TSSKEGIWQRKFVDYIRQKDLSFAYWS 356
>gi|392378287|ref|YP_004985447.1| endoglucanase [Azospirillum brasilense Sp245]
gi|356879769|emb|CCD00695.1| endoglucanase [Azospirillum brasilense Sp245]
Length = 825
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 59/136 (43%), Gaps = 15/136 (11%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
+ILD+H S G ++++ NG + + I+ +AT + G V+G L NE
Sbjct: 418 IILDHHRSGDG---ASANENGLWYTSQYPESKMIENWKMLATRYKGNDAVIGADLHNEPH 474
Query: 62 GPKQ----NVKDWYRYMQLGAEAVHAANPEVLVILSGLN--------FDKDLSFVRNQAV 109
P DW R + A+ + N + L+I+ G+ + +L +N V
Sbjct: 475 NPATWGDGGPNDWARAAERIGNAIQSVNKDWLLIVEGIETYQNQWYWWGGNLLGEKNYEV 534
Query: 110 NLTFTGKLVFEAHWYG 125
GKLV+ H YG
Sbjct: 535 KFNEPGKLVYSVHDYG 550
>gi|148273627|ref|YP_001223188.1| cellulase [Clavibacter michiganensis subsp. michiganensis NCPPB
382]
gi|147831557|emb|CAN02524.1| putative secreted cellulase containing cellulose-binding domain
(endo-1,4-beta-glucanase) [Clavibacter michiganensis
subsp. michiganensis NCPPB 382]
Length = 535
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 72/171 (42%), Gaps = 24/171 (14%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPKQ-----NVKDWYRYMQLGAEAVHAANPEV 88
WI +A + V+G+ L NE G DW + G +AV + NP++
Sbjct: 184 WIADWKMLADRYKDDPTVIGVDLHNEPHGQASWGTGDTATDWRLAAERGGDAVLSVNPKL 243
Query: 89 LVILSGLNFDKD---------LSFVRNQAVNLTFTGKLVFEAHWYGFTD-GQAWVDGN-- 136
LVI+ G + D L ++ V L+ ++V+ H Y + Q+W
Sbjct: 244 LVIVEGTDKQPDGSGTWWGGALGAAGDKPVELSVPNRVVYSPHDYPASIYAQSWFSSPDY 303
Query: 137 PNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNVNDNRYLNCFF 186
PN + G V D G+L ++ P+ + EFG L +D ++L+
Sbjct: 304 PNNLPG-VWDAHW---GYLAKKSIAPVLLGEFGTKLE--TTSDKQWLSTLV 348
>gi|71041793|pdb|1VRX|A Chain A, Endocellulase E1 From Acidothermus Cellulolyticus Mutant
Y245g
gi|71041794|pdb|1VRX|B Chain B, Endocellulase E1 From Acidothermus Cellulolyticus Mutant
Y245g
Length = 358
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 62/156 (39%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG L NE P DW + AV + NP +
Sbjct: 137 WISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNL 196
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDG-QAWVDGN--P 137
L+ + G+ + +L V L +LV+ AH Y + G Q W P
Sbjct: 197 LIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVGPQTWFSDPTFP 256
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
N + G N G+L Q P+++ EFG L+
Sbjct: 257 NNMPGIWNKNW----GYLFNQNIAPVWLGEFGTTLQ 288
>gi|424883299|ref|ZP_18306927.1| endoglucanase [Rhizobium leguminosarum bv. trifolii WU95]
gi|392514960|gb|EIW39693.1| endoglucanase [Rhizobium leguminosarum bv. trifolii WU95]
Length = 419
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 75/183 (40%), Gaps = 23/183 (12%)
Query: 2 VILDNHISKPGWCCSNSD-GNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
V+LD+H G S G G+ + W MA + NVVG + NE
Sbjct: 155 VVLDHHRRTAGVGADGSPVGTGYTLAK------WKSSWAIMANRYRDKINVVGADVHNEP 208
Query: 61 RGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF--------VRNQAVNLT 112
N W Y++ +H P+ ++ G+ D +F V + V LT
Sbjct: 209 H--DLNWATWAGYVEECGNQIHTIAPDWIIFCEGVGSDASGAFWWGGALGGVATRPVVLT 266
Query: 113 FTGKLVFEAHWYGFTDG-QAWVDGNPNQVCGRVVDNVMRL----SGFLLEQG-WPLFVSE 166
+G+L + H YG + G Q+W+ + +N+ + GF+ EQ P++V E
Sbjct: 267 RSGRLAYSPHEYGQSVGVQSWLAYDGQTAPAGWPNNLYAVWQAHWGFIFEQNIAPIWVGE 326
Query: 167 FGA 169
FG
Sbjct: 327 FGG 329
>gi|396493030|ref|XP_003843939.1| hypothetical protein LEMA_uP015900.1 [Leptosphaeria maculans JN3]
gi|312220519|emb|CBY00460.1| hypothetical protein LEMA_uP015900.1 [Leptosphaeria maculans JN3]
Length = 92
Score = 43.5 bits (101), Expect = 0.18, Method: Composition-based stats.
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 7/81 (8%)
Query: 162 LFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLR-------EGVIGL 214
+ VSE+G G N Y C + E W W L GS+YL+ E + GL
Sbjct: 1 MVVSEWGFIQNGKYWNQTTYARCLVEMVKEYQVSWQHWELSGSFYLQTRPNRKPETIQGL 60
Query: 215 NEYYGLFDWNWCDIRNSSFLE 235
+E +GL + +W +R+ +E
Sbjct: 61 DEAWGLLNHDWTAVRSPITVE 81
>gi|320592055|gb|EFX04494.1| cellulase family protein [Grosmannia clavigera kw1407]
Length = 202
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 78/193 (40%), Gaps = 30/193 (15%)
Query: 49 RNVVGMSLRNELRG--PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN 106
+ V+G LRNE+RG + + W + + + A NP L+++ G DLS V
Sbjct: 8 KRVIGADLRNEVRGLWGTMSWERWATAAEHCGKRLLALNPAWLIVVGGTGSGNDLSGVAA 67
Query: 107 QAVNLTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGW------ 160
+ V L + ++V+ H YG++ + GR V R ++ W
Sbjct: 68 RPVQLQVSHRVVYSVHVYGWS--------GWGSLGGRFVQRPYRSFAAAMQSAWSYLLES 119
Query: 161 --------PLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVI 212
P++V+EFGA R +V D Y A +D D+ W +
Sbjct: 120 ETGDSGPVPVWVAEFGAPHR-PSVGDVCYWQHLLRFLAAVDADFGYWAVNP-----RKTD 173
Query: 213 GLNEYYGLFDWNW 225
G E Y L D +W
Sbjct: 174 GSEETYKLVDDDW 186
>gi|402495575|ref|ZP_10842299.1| glycoside hydrolase [Aquimarina agarilytica ZC1]
Length = 521
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 60/148 (40%), Gaps = 29/148 (19%)
Query: 2 VILDNHISKP-GWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
VILDNH P G+ + + + + + WI+ M + V+GM + NE
Sbjct: 166 VILDNHSRNPDGYLVEKT-----WFTESTSEEQWIEDWVFMTNRYKDKSAVIGMDINNEP 220
Query: 61 RGPKQNVK----------DWYRYMQLGAEAVHAANPEVLVILSGLNFDK----------- 99
G N K DW + A+ ANP VL+++ G+ +
Sbjct: 221 NGEIDNPKGARWGTGDKYDWRLAAEKCGNAILEANPNVLIMVEGIEAYRKPNGDLTSYWW 280
Query: 100 --DLSFVRNQAVNLTFTGKLVFEAHWYG 125
+L VR+ V L+ KL++ H YG
Sbjct: 281 GGNLQGVRDFPVRLSDPSKLMYSPHEYG 308
>gi|381396970|ref|ZP_09922384.1| glycoside hydrolase family 5 [Microbacterium laevaniformans OR221]
gi|380775929|gb|EIC09219.1| glycoside hydrolase family 5 [Microbacterium laevaniformans OR221]
Length = 540
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 77/203 (37%), Gaps = 30/203 (14%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H +S + + ++ WI +A + VVG L NE
Sbjct: 146 VILDRHR------IESSGQSELWYTSTYSEKRWIDDWVMLANRYRDDPTVVGADLHNEPH 199
Query: 62 GPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF------VRNQA-- 108
G +DW AV AANP +L+++ G+ D S +R+
Sbjct: 200 GSACWGCGDASRDWQAAATRAGNAVLAANPRLLIVVEGVERQNDASTTWWGGGLRDAGAT 259
Query: 109 -VNLTFTGKLVFEAHWY-GFTDGQAWVDGN--PNQVCGRVVDNVMRLSGFLLEQG-WPLF 163
V L ++V+ H Y GQ+W + P + N G+L G P+F
Sbjct: 260 PVVLATPNRVVYSPHDYPASVYGQSWFSASNYPANLPSVWDANW----GYLARTGTAPIF 315
Query: 164 VSEFGADLRGNNVNDNRYLNCFF 186
V EFG L +D ++L
Sbjct: 316 VGEFGTKLE--TTSDAQWLKSLV 336
>gi|375006300|ref|YP_004975084.1| endoglucanase (Cellulase, modular protein) [Azospirillum lipoferum
4B]
gi|357427558|emb|CBS90502.1| endoglucanase (Cellulase, modular protein) [Azospirillum lipoferum
4B]
Length = 788
Score = 42.7 bits (99), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 57/234 (24%), Positives = 98/234 (41%), Gaps = 29/234 (12%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD-LWIKGLTKMATIFNGVRNVVGMSLRNEL 60
VILD+H + G +N G ++ D NP+ I+ +A + G V+G L NE
Sbjct: 441 VILDHHRNDGG-AGTNEHGL-WYTDN--NPESKVIENWKMLAKHYAGNEAVIGADLHNEP 496
Query: 61 RGPK----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGK 116
DW + A+ + N + L+++ G + L+ V+++ + K
Sbjct: 497 SVSATWGGNAATDWASAAERIGNAIQSVNKDWLMLVEGTEWSSTLAGVKDRPIEFDVPNK 556
Query: 117 LVFEAHWYGFTDGQ-AWVD--GNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
LV+ H YG + G +W++ PN + + + G++ + P+ + EFG +
Sbjct: 557 LVYSPHAYGHSVGNFSWLNDPAYPNNLPAQ----YDSMWGYIYKNNIAPVLIGEFGGQM- 611
Query: 173 GNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWC 226
N+ + + N F DWD L G L G G+N Y W W
Sbjct: 612 -NSAAEQTWANAFVKY-MNGDWD-----LDGKSDLPAGQQGMNWTY----WAWT 654
>gi|332981694|ref|YP_004463135.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
gi|332699372|gb|AEE96313.1| glycoside hydrolase family 5 [Mahella australiensis 50-1 BON]
Length = 586
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 57/144 (39%), Gaps = 27/144 (18%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPD-----LWIKGLTKMATIFNGVRNVVGMSL 56
+ILDNH G G+ +Q + D WI +A + NVVG L
Sbjct: 143 IILDNHSRAAG---------GYMNEQLWYTDSYSEQKWISDWVMLAERYKNNTNVVGFDL 193
Query: 57 RNELR-----GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDK--------DLSF 103
NE G + +W +A+ NP+ L+++ G+ K +LS
Sbjct: 194 NNEPHGIATWGEGNSSTNWNEAASRCGQAILKVNPKALIMIEGVEKYKGDSYWWGGNLSG 253
Query: 104 VRNQAVNLTFTGKLVFEAHWYGFT 127
V++ + G LV+ H YG T
Sbjct: 254 VKDYPITGIPAGNLVYSPHDYGPT 277
>gi|182412911|ref|YP_001817977.1| glycoside hydrolase [Opitutus terrae PB90-1]
gi|177840125|gb|ACB74377.1| glycoside hydrolase family 5 [Opitutus terrae PB90-1]
Length = 566
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 18/65 (27%), Positives = 34/65 (52%), Gaps = 3/65 (4%)
Query: 69 DWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTD 128
+W R M+ + AANP V+ +++G ++ DL+ +R + +N ++ + H Y
Sbjct: 313 EWKRIMETQITMIRAANPHVIPVVAGFDWAYDLTPLRQEPINAE---RIAYAVHPYANKR 369
Query: 129 GQAWV 133
Q WV
Sbjct: 370 PQPWV 374
>gi|345016157|ref|YP_004818511.1| glycoside hydrolase [Streptomyces violaceusniger Tu 4113]
gi|344042506|gb|AEM88231.1| glycoside hydrolase family 5 [Streptomyces violaceusniger Tu 4113]
Length = 425
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 52/140 (37%), Gaps = 25/140 (17%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H +P + + Q W+ K+A + V+G L NE R
Sbjct: 179 VILDQH--RPDQYGQSE----LWYSQSLTEKQWLDDWVKLAKRYQNNDRVIGADLHNEPR 232
Query: 62 G--------PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDK--------DLSFVR 105
G PK DW A+H N L+ + G + K DL V+
Sbjct: 233 GQATWGDGNPKT---DWQLASTKAGNAIHKVNKNWLIFVEGTDRHKNEQFWWGGDLQGVK 289
Query: 106 NQAVNLTFTGKLVFEAHWYG 125
V L K+V+ AH YG
Sbjct: 290 QHPVKLKEPNKVVYSAHDYG 309
>gi|316936065|ref|YP_004111047.1| glycoside hydrolase family protein [Rhodopseudomonas palustris
DX-1]
gi|315603779|gb|ADU46314.1| glycoside hydrolase family 5 [Rhodopseudomonas palustris DX-1]
Length = 344
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/160 (22%), Positives = 64/160 (40%), Gaps = 18/160 (11%)
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGK-LVFE 120
G + W ++ V ++NPE +V+++ LN + +++ L T + ++
Sbjct: 163 GGHMTIAQWNSLLRELLATVRSSNPERIVVVAALNTPE----ASIESLELPATDRRIIVT 218
Query: 121 AHWYGFTD----GQAWVD---GNP------NQVCGRVVDNVMRLSGFLLEQGWPLFVSEF 167
H+Y D G W + G P Q +V + R+ F + P+++ EF
Sbjct: 219 FHYYEPFDFTHQGAPWSERLAGLPAREWGSEQDRAKVAADFDRVQRFAAREARPVYLGEF 278
Query: 168 GADLRGNNVNDNRYLNCFFGVAAELDWDWALWTLVGSYYL 207
G R + RYL+ A W WA W + L
Sbjct: 279 GVYERAPRASRLRYLDDVARAAERRGWPWAYWQFDHDFAL 318
>gi|302544326|ref|ZP_07296668.1| cellulase [Streptomyces hygroscopicus ATCC 53653]
gi|302461944|gb|EFL25037.1| cellulase [Streptomyces himastatinicus ATCC 53653]
Length = 399
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 51/138 (36%), Gaps = 19/138 (13%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
MVILD H +P + + Q + W+ K+A + V+G L NE
Sbjct: 150 MVILDQH--RPDQYGQSE----LWYSQSLTENKWLSDWVKLAKRYKSNDRVIGADLHNEP 203
Query: 61 RGPK-----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDK--------DLSFVRNQ 107
RG DW +H N L+ + G + K DL V+
Sbjct: 204 RGQATWGDGNKKTDWRMAATKAGNEIHKVNKNWLIFVEGTDRHKNEQFWWGGDLQGVKKY 263
Query: 108 AVNLTFTGKLVFEAHWYG 125
V L K+V+ AH YG
Sbjct: 264 PVKLDEGNKVVYSAHDYG 281
>gi|300787066|ref|YP_003767357.1| glycoside hydrolase [Amycolatopsis mediterranei U32]
gi|384150408|ref|YP_005533224.1| glycoside hydrolase [Amycolatopsis mediterranei S699]
gi|399538949|ref|YP_006551611.1| glycoside hydrolase [Amycolatopsis mediterranei S699]
gi|299796580|gb|ADJ46955.1| glycoside hydrolase family protein [Amycolatopsis mediterranei U32]
gi|340528562|gb|AEK43767.1| glycoside hydrolase [Amycolatopsis mediterranei S699]
gi|398319719|gb|AFO78666.1| glycoside hydrolase [Amycolatopsis mediterranei S699]
Length = 509
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 48/118 (40%), Gaps = 22/118 (18%)
Query: 29 FNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPK----------QNVKDWYRYMQLGA 78
++ WI T +A + G V+G L NE + DW +
Sbjct: 165 YSESRWISDWTMLAQHYKGNPTVIGADLHNEPHSIQGGGGACWGCGDTATDWRLAAERAG 224
Query: 79 EAVHAANPEVLVILSGLN------------FDKDLSFVRNQAVNLTFTGKLVFEAHWY 124
AV AANP+ L+++ G++ + +LS R V L+ KLV+ AH Y
Sbjct: 225 NAVLAANPDWLIVVEGVDCVSGTGDPQCGWWGGNLSGARQFPVRLSKPDKLVYSAHEY 282
>gi|365858582|ref|ZP_09398505.1| cellulase [Acetobacteraceae bacterium AT-5844]
gi|363713893|gb|EHL97453.1| cellulase [Acetobacteraceae bacterium AT-5844]
Length = 909
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 54/134 (40%), Gaps = 14/134 (10%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNE-L 60
V+LD H + G + NG + D+ + WI +A + G V+G L NE
Sbjct: 331 VLLDRHRGEAG---DGPNDNGLWYDEVYTEQAWIDDWVMLADRYAGNPTVLGADLSNEPF 387
Query: 61 RGP--KQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF--------VRNQAVN 110
G + DW + A+ ANP L+++ G + K+ + V
Sbjct: 388 NGTWGDGSATDWKAAAERAGNAILEANPNWLIVVEGTGWYKNEGYWWGGNLMGAAEAPVV 447
Query: 111 LTFTGKLVFEAHWY 124
L +LV+ AH Y
Sbjct: 448 LNVANRLVYSAHDY 461
>gi|296270290|ref|YP_003652922.1| family 5 glycoside hydrolase [Thermobispora bispora DSM 43833]
gi|296093077|gb|ADG89029.1| glycoside hydrolase family 5 [Thermobispora bispora DSM 43833]
Length = 456
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 77/197 (39%), Gaps = 18/197 (9%)
Query: 13 WCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPKQNVKDWYR 72
W N FF D I +A + R V G + NE P V +
Sbjct: 141 WHSDNPTHVAFFWQHRHFQDRVINLWEAIADRYKDNRWVAGYNPVNEPGDPTGRVIGPF- 199
Query: 73 YMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFT------GKLVFEAHWYGF 126
Y +L +AV A +P+ ++ L G + D S R N F L + G+
Sbjct: 200 YDRL-VKAVRAVDPDHVLFLDGNTYSTDFSIFREVYENTVFVCHDYALAGLAHGGPYPGY 258
Query: 127 TDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGN-NVNDNRY--LN 183
T G+ W D N + ++ + F E G P++V EFG G+ V++ RY L
Sbjct: 259 TRGE-WCDRNT------LEQTFLKRTQFQRETGTPIWVGEFGPVYTGDPAVDEQRYQVLR 311
Query: 184 CFFGVAAELDWDWALWT 200
+ + W+LWT
Sbjct: 312 DQLEIYDAYNAGWSLWT 328
>gi|146197055|dbj|BAF57291.1| putative glycosyl hydrolase family5 [uncultured symbiotic protist
of Reticulitermes speratus]
Length = 348
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 78/196 (39%), Gaps = 34/196 (17%)
Query: 19 DGNGFFGDQYFN-----PDLWIKGLTKMATIFNGVRNVVGMSLRNELRG----------- 62
D +G DQY + P+ L A F ++G+ ++NE G
Sbjct: 129 DIHGVEPDQYQDATWGKPEYIYTALEWFANEFKSDDTIIGIDIKNEPHGQCDYADKAIWD 188
Query: 63 PKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLN------------FDKDLSFVRNQAVN 110
+W A +HA NP +L+ + G+ + L+FV++ ++
Sbjct: 189 SSTADNNWRSTAATAASRIHAKNPNLLIFVEGIECYNGPRGIESGWWGGVLTFVKDLPLD 248
Query: 111 L-TFTGKLVFEAHWYGFTD-GQAWVDGNPNQVCGRVVDNVMRLS-GFLLEQG-WPLFVSE 166
L + KLV+ H YG T Q W NP + ++ + S F+ EQ PL E
Sbjct: 249 LGSHQDKLVYSPHEYGPTVFNQTWF--NPTFTYDSIYNDHWKDSWMFIHEQDIAPLLFGE 306
Query: 167 FGADLRGNNVNDNRYL 182
+G L G N + Y+
Sbjct: 307 WGGKLEGTNTDWMTYM 322
>gi|1827681|pdb|1ECE|A Chain A, Acidothermus Cellulolyticus Endocellulase E1 Catalytic
Domain In Complex With A Cellotetraose
gi|1827682|pdb|1ECE|B Chain B, Acidothermus Cellulolyticus Endocellulase E1 Catalytic
Domain In Complex With A Cellotetraose
Length = 358
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 61/156 (39%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG L NE P DW + AV + NP +
Sbjct: 137 WISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNL 196
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFT-DGQAWVDGN--P 137
L+ + G+ + +L V L +LV+ AH Y + Q W P
Sbjct: 197 LIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 256
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
N + G N G+L Q P+++ EFG L+
Sbjct: 257 NNMPGIWNKNW----GYLFNQNIAPVWLGEFGTTLQ 288
>gi|383779069|ref|YP_005463635.1| putative glycosyl hydrolase [Actinoplanes missouriensis 431]
gi|381372301|dbj|BAL89119.1| putative glycosyl hydrolase [Actinoplanes missouriensis 431]
Length = 660
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 95/245 (38%), Gaps = 55/245 (22%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYF----NPDLWIKGLTKMATIFNGVRNVVGMSLR 57
V+LD H + +D +G ++ P+L+ + + + +V M L+
Sbjct: 168 VLLDVH-------SAEADNSGHIHPVWYKGTITPELYYQAWEWVTARYKANDTIVAMDLK 220
Query: 58 NELRG-----PKQ------NVKDWYRYMQLGAEAVHAANPEVLVILSGL----------- 95
NE G P+ +V +W + A+ + A NPEVLV++ G
Sbjct: 221 NEPHGTPGTTPRAKWDNSTDVDNWKHTAETAAKRILAINPEVLVLVEGQEVYPREGETWN 280
Query: 96 --NFDKDLS-------------FVRNQAVNL-TFTGKLVFEAHWYG-FTDGQAWVDGNPN 138
N D DLS VR+ +NL +LV+ H YG Q W D +
Sbjct: 281 SPNTDPDLSPNYYYNWWGGNLRGVRDYPINLGANQDQLVYSPHDYGPLVFNQPWFDKPFD 340
Query: 139 QVCGRVVDNVMRLSGFLLEQG--WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWDW 196
+ + ++V R + + + PL + E+G L G + +R++ + E
Sbjct: 341 KTT--LTNDVWRPNWLYIHENDTAPLLIGEWGGRL-GQDERQDRWMTALRDLIVEHRLHQ 397
Query: 197 ALWTL 201
W L
Sbjct: 398 TFWVL 402
>gi|304281821|gb|ADM21138.1| E1-Cat [synthetic construct]
Length = 381
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 61/156 (39%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG L NE P DW + AV + NP +
Sbjct: 137 WISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNL 196
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFT-DGQAWVDGN--P 137
L+ + G+ + +L V L +LV+ AH Y + Q W P
Sbjct: 197 LIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 256
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
N + G N G+L Q P+++ EFG L+
Sbjct: 257 NNMPGIWNKNW----GYLFNQNIAPVWLGEFGTTLQ 288
>gi|304281815|gb|ADM21135.1| Z130-E1-Cat fusion protein [synthetic construct]
Length = 410
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 61/156 (39%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG L NE P DW + AV + NP +
Sbjct: 166 WISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNL 225
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTD-GQAWVDGN--P 137
L+ + G+ + +L V L +LV+ AH Y + Q W P
Sbjct: 226 LIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 285
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
N + G N G+L Q P+++ EFG L+
Sbjct: 286 NNMPGIWNKNW----GYLFNQNIAPVWLGEFGTTLQ 317
>gi|14325794|gb|AAK60011.1|AF289822_1 cellulase [Thermus caldophilus]
Length = 425
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 76/188 (40%), Gaps = 23/188 (12%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V+LD H + G F + + D W+ L ++A + NV G+ L NE
Sbjct: 143 VLLDFHTFRCDLIGGRLPGRPFDPSRGYTKDDWLADLRRLAGLSLEFPNVFGIDLANEPY 202
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLN-----------FDKDLSFVRNQAVN 110
+W Q GA AV NP VLV + G+ + ++L+ R+ +
Sbjct: 203 --DLTWAEWKALAQEGARAVLGVNPRVLVAVEGVGNLSPNGGYNAFWGENLAEARD---D 257
Query: 111 LTFTGKLVFEAHWYGFT--DGQAWVDGN-PNQVCGRVVDNVMRLSGFLLEQGWPLFVSEF 167
L +L++ H YG + D + D PN + + LSG +G P + EF
Sbjct: 258 LGLGDRLLYLPHVYGPSVYDQPYFSDSTFPNNMPAVWDAHFGHLSG----RGLPWGIGEF 313
Query: 168 GADLRGNN 175
G G +
Sbjct: 314 GGKYTGQD 321
>gi|428299635|ref|YP_007137941.1| glycoside hydrolase family protein [Calothrix sp. PCC 6303]
gi|428236179|gb|AFZ01969.1| glycoside hydrolase family 5 [Calothrix sp. PCC 6303]
Length = 536
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 54/247 (21%), Positives = 88/247 (35%), Gaps = 33/247 (13%)
Query: 23 FFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGP-----KQNVKDWYRYMQLG 77
++GD + D WI +A + NV+G L+NE G DW +
Sbjct: 176 WYGDGFTEAD-WIDTWKMLAQRYKNQANVIGADLKNEPHGKASWGTSNLATDWRLAAERA 234
Query: 78 AEAVHAANPEVLVILSGLNFD------------KDLSFVRNQAVNLTFTGKLVFEAHWYG 125
A+ NP L+++ G+ + +L VR V L+ KLV+ H YG
Sbjct: 235 GNAILGINPNWLIVVEGVADNVPGQKLKHHWQGGNLEGVRRYPVRLSRRNKLVYSPHEYG 294
Query: 126 ---FTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYL 182
+ G P + R ++ +Q P+ + EFG + +
Sbjct: 295 PGVYKMPYFQEKGFPGNMRSRWETGFYYIA---RKQIAPILIGEFGGRNVDTKSTEGIWQ 351
Query: 183 NCFFGVAAELDWDWALWTLVGSYYLREGVIGLNEYYGLFDWNWCDIRNSSFLERISSLQS 242
N AE + + W+ + GV+ L DW D+ L +I L
Sbjct: 352 NQLVKFIAEKNLSYTYWSWNPNSGDTGGVL-------LDDWQTVDVPKQVLLSKI--LPV 402
Query: 243 PFRGPGV 249
F P +
Sbjct: 403 KFNPPSI 409
>gi|314910730|gb|ADT63057.1| b-1,4-endo-D-glucanase [synthetic construct]
Length = 552
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 61/156 (39%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG L NE P DW + AV + NP +
Sbjct: 162 WISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNL 221
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTD-GQAWVDGN--P 137
L+ + G+ + +L V L +LV+ AH Y + Q W P
Sbjct: 222 LIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 281
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
N + G N G+L Q P+++ EFG L+
Sbjct: 282 NNMPGIWNKNW----GYLFNQNIAPVWLGEFGTTLQ 313
>gi|427718343|ref|YP_007066337.1| glycoside hydrolase family protein [Calothrix sp. PCC 7507]
gi|427350779|gb|AFY33503.1| glycoside hydrolase family 5 [Calothrix sp. PCC 7507]
Length = 527
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 18/120 (15%)
Query: 23 FFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGP-----KQNVKDWYRYMQLG 77
++GD + D WI T +A + NV+G L+NE G DW +
Sbjct: 176 WYGDGFTEED-WIDTWTMLAQRYKNQANVIGADLKNEPHGQASWGTNNLATDWRLAAERA 234
Query: 78 AEAVHAANPEVLVILSGLNFD------------KDLSFVRNQAVNLTFTGKLVFEAHWYG 125
A+ NP L+++ G+ + +L V+ V L+ KLV+ H YG
Sbjct: 235 GNAILRVNPNWLIVVEGVEKNVPGQKLKQHWQGGNLEGVKRYPVRLSRRNKLVYSPHEYG 294
>gi|117927822|ref|YP_872373.1| glycoside hydrolase family protein [Acidothermus cellulolyticus
11B]
gi|1708075|sp|P54583.1|GUN1_ACIC1 RecName: Full=Endoglucanase E1; AltName: Full=Cellulase E1;
AltName: Full=Endo-1,4-beta-glucanase E1; AltName:
Full=Endocellulase E1; Flags: Precursor
gi|988300|gb|AAA75477.1| E I beta-1,4-endoglucanase precursor [Acidothermus cellulolyticus]
gi|117648285|gb|ABK52387.1| glycoside hydrolase, family 5 [Acidothermus cellulolyticus 11B]
Length = 562
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 61/156 (39%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG L NE P DW + AV + NP +
Sbjct: 178 WISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNL 237
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTD-GQAWVDGN--P 137
L+ + G+ + +L V L +LV+ AH Y + Q W P
Sbjct: 238 LIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 297
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
N + G N G+L Q P+++ EFG L+
Sbjct: 298 NNMPGIWNKNW----GYLFNQNIAPVWLGEFGTTLQ 329
>gi|304281817|gb|ADM21136.1| Z331-E1-Cat fusion protein [synthetic construct]
Length = 411
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 61/156 (39%), Gaps = 21/156 (13%)
Query: 34 WIKGLTKMATIFNGVRNVVGMSLRNELRGPK-----QNVKDWYRYMQLGAEAVHAANPEV 88
WI L +A + G VVG L NE P DW + AV + NP +
Sbjct: 167 WISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNL 226
Query: 89 LVILSGLN--------FDKDLSFVRNQAVNLTFTGKLVFEAHWYGFT-DGQAWVDGN--P 137
L+ + G+ + +L V L +LV+ AH Y + Q W P
Sbjct: 227 LIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFP 286
Query: 138 NQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLR 172
N + G N G+L Q P+++ EFG L+
Sbjct: 287 NNMPGIWNKNW----GYLFNQNIAPVWLGEFGTTLQ 318
>gi|342673919|gb|AEL31247.1| endoglucanase Cel5B [bacterium enrichment culture clone
FXVII-Cel5B]
Length = 848
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 83/213 (38%), Gaps = 26/213 (12%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
V+LD H + G F + + W+ L +MA + NV G+ L NE
Sbjct: 134 VLLDFHTFRCDLVREQLPGKPFDPARGYTKADWLADLERMARLSLEFPNVFGVDLANEPH 193
Query: 62 GPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKD-----------LSFVRNQAVN 110
+W Q GA+AV NP +LV + G+ D L+ R+ +
Sbjct: 194 A--LTWAEWKALAQEGAQAVLRVNPRILVAVEGVGNASDSGGYPAFWGENLTEARD---D 248
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRL----SGFLLEQGWPLFVSE 166
L +L++ H YG + V P +N+ + G L +G P + E
Sbjct: 249 LGLGDRLLYLPHVYGPS-----VSSQPYFSDPTFPENMPAIWDTHFGHLSTRGLPWGIGE 303
Query: 167 FGADLRGNN-VNDNRYLNCFFGVAAELDWDWAL 198
FG G + V +R++ G + + WAL
Sbjct: 304 FGGWYTGQDRVWQDRFVEYLRGKGVRVWFYWAL 336
>gi|434394228|ref|YP_007129175.1| glycoside hydrolase family 5 [Gloeocapsa sp. PCC 7428]
gi|428266069|gb|AFZ32015.1| glycoside hydrolase family 5 [Gloeocapsa sp. PCC 7428]
Length = 567
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 18/120 (15%)
Query: 23 FFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR-----GPKQNVKDWYRYMQLG 77
++GD + D WI +A + NV+G L+NE G DW +
Sbjct: 216 WYGDGFTEQD-WIDTWKMLAVRYQNQANVIGADLKNEPHGRASWGTNDLATDWRLAAERA 274
Query: 78 AEAVHAANPEVLVILSGLNFD------------KDLSFVRNQAVNLTFTGKLVFEAHWYG 125
A+ A NP L+++ G+ + +L VR V L+ +LV+ H YG
Sbjct: 275 GNAILAINPNWLIVVEGVENNVPGQRLSGHWQGGNLEGVRRYPVRLSRPRQLVYSPHEYG 334
>gi|315443935|ref|YP_004076814.1| endoglucanase [Mycobacterium gilvum Spyr1]
gi|315262238|gb|ADT98979.1| endoglucanase [Mycobacterium gilvum Spyr1]
Length = 2310
Score = 39.3 bits (90), Expect = 3.5, Method: Composition-based stats.
Identities = 42/183 (22%), Positives = 73/183 (39%), Gaps = 18/183 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNE-- 59
VILD+H S G + + NG + + + WI T +A + V+G L NE
Sbjct: 1957 VILDHHRSSKG---AGPNSNGLWYEGSYTEAAWIDDWTMLAQRYANTPTVIGADLHNEPH 2013
Query: 60 -LRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLN--------FDKDLSFVRNQAVN 110
DW + AV + N L+ + G+ + +L V+++ +
Sbjct: 2014 AGTWGGGGATDWAAAAERAGNAVLSVNSNWLIFVEGVADYQGHSYWWGGNLMGVKDRPIV 2073
Query: 111 LTFTGKLVFEAHWY-GFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFG 168
L ++V+ H Y Q W + D ++ G++ EQ P+++ EFG
Sbjct: 2074 LNTPNRVVYSPHDYPNSVYAQPWF--QTANFGAALPDKFEQMWGYIYEQKIAPIYLGEFG 2131
Query: 169 ADL 171
L
Sbjct: 2132 TRL 2134
>gi|304408016|ref|ZP_07389666.1| glycoside hydrolase family 5 [Paenibacillus curdlanolyticus YK9]
gi|304343035|gb|EFM08879.1| glycoside hydrolase family 5 [Paenibacillus curdlanolyticus YK9]
Length = 646
Score = 39.3 bits (90), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 53/141 (37%), Gaps = 22/141 (15%)
Query: 1 MVILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNEL 60
+ILD H G ++ QY + WI +A + V+G L NE
Sbjct: 144 QIILDRHRPDSG-----GQSELWYTAQY-SEQRWINDWVMLAQRYANNPTVIGADLHNEP 197
Query: 61 RGPKQ-----NVKDWYRYMQLGAEAVHAANPEVLVILSGLNFD-----------KDLSFV 104
G DW + A+ AANP L+++ G++ + +L
Sbjct: 198 HGTASWGTGVAATDWRLAAERAGNAILAANPNWLILVEGVSSNVQGESSNYWWGGNLKGA 257
Query: 105 RNQAVNLTFTGKLVFEAHWYG 125
RN V L +LV+ H YG
Sbjct: 258 RNYPVRLDVPNRLVYSPHDYG 278
>gi|426195750|gb|EKV45679.1| hypothetical protein AGABI2DRAFT_186400 [Agaricus bisporus var.
bisporus H97]
Length = 855
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 56/132 (42%), Gaps = 15/132 (11%)
Query: 143 RVVDNVMRLSGFLLEQG---WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWD--WA 197
R VDN R S +L+E+ +PL S F + G V Y FG+ A +WD WA
Sbjct: 372 RWVDNNSRYSLWLVERPPFVFPLMSSSFACNRSGELVPKKAYSTSPFGIPAREEWDNSWA 431
Query: 198 LW-----TLVGSYYLREGVIGLNE---YYGLFDWNWCDIRNSSFLERISSLQSPFRGPGV 249
LW ++ L E I L +Y + DI+ S L + P R +
Sbjct: 432 LWDFITLRMIPRSMLHEKPIHLRHICLFYIGHIPAFLDIQLSRLLNEPHT--EPERYKDI 489
Query: 250 FETGLHKVIYHP 261
FE G+ + +P
Sbjct: 490 FERGIDPDVDNP 501
>gi|409078850|gb|EKM79212.1| hypothetical protein AGABI1DRAFT_120644 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 870
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 56/132 (42%), Gaps = 15/132 (11%)
Query: 143 RVVDNVMRLSGFLLEQG---WPLFVSEFGADLRGNNVNDNRYLNCFFGVAAELDWD--WA 197
R VDN R S +L+E+ +PL S F + G V Y FG+ A +WD WA
Sbjct: 387 RWVDNNSRYSLWLVERPPFVFPLMSSSFSCNRSGELVPKKAYSTSPFGIPAREEWDNSWA 446
Query: 198 LW-----TLVGSYYLREGVIGLNE---YYGLFDWNWCDIRNSSFLERISSLQSPFRGPGV 249
LW ++ L E I L +Y + DI+ S L + P R +
Sbjct: 447 LWDFITLRMIPRSMLHEKPIHLRHICLFYIGHIPAFLDIQLSRLLNEPHT--EPERYKDI 504
Query: 250 FETGLHKVIYHP 261
FE G+ + +P
Sbjct: 505 FERGIDPDVDNP 516
>gi|427734313|ref|YP_007053857.1| endoglucanase [Rivularia sp. PCC 7116]
gi|427369354|gb|AFY53310.1| endoglucanase [Rivularia sp. PCC 7116]
Length = 564
Score = 38.5 bits (88), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 30/120 (25%), Positives = 48/120 (40%), Gaps = 18/120 (15%)
Query: 23 FFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELRGPKQ-----NVKDWYRYMQLG 77
++GD Y D WI ++A + N++G L+NE G DW +
Sbjct: 207 WYGDGYSEAD-WINTWLQLAQRYKNQTNIIGADLKNEPHGKASWGTNDLATDWRLAAERA 265
Query: 78 AEAVHAANPEVLVILSGLNFD------------KDLSFVRNQAVNLTFTGKLVFEAHWYG 125
A+ NP L+++ G+ + +L V V L+ KLV+ H YG
Sbjct: 266 GNAIGKINPNWLIVVEGVENNVPGQKLAHHWQGGNLEGVLRYPVRLSQRNKLVYSPHEYG 325
>gi|254786853|ref|YP_003074282.1| glycoside hydrolase family 5 domain-containing protein
[Teredinibacter turnerae T7901]
gi|237686533|gb|ACR13797.1| glycoside hydrolase family 5 domain protein [Teredinibacter
turnerae T7901]
Length = 814
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 58/156 (37%), Gaps = 19/156 (12%)
Query: 47 GVRNVVGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRN 106
G + V +++ NE G + DW + + AA +++ N+ +D +V
Sbjct: 206 GEEDYVIINIANEPFGNTASADDWIDAHKEAITRLRAAGLTHTLMVDAANWGQDWQYVMR 265
Query: 107 QAVNLTFT----GKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPL 162
F +VF H Y + + VD L F+ + PL
Sbjct: 266 DHAQEIFAHDPLANIVFSIHMYQIFNNRQAVDS--------------YLKTFVEDYKLPL 311
Query: 163 FVSEFGADLRGNNVNDNRYLN-CFFGVAAELDWDWA 197
V EFGAD G +V++ L C L W W+
Sbjct: 312 VVGEFGADHGGEDVDEASILELCELYNLGYLGWSWS 347
>gi|120404477|ref|YP_954306.1| chitinase, cellulase [Mycobacterium vanbaalenii PYR-1]
gi|119957295|gb|ABM14300.1| Chitinase, Cellulase [Mycobacterium vanbaalenii PYR-1]
Length = 2305
Score = 38.1 bits (87), Expect = 7.1, Method: Composition-based stats.
Identities = 41/183 (22%), Positives = 73/183 (39%), Gaps = 18/183 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD+H S G + +GNG + + + WI +A + V+G L NE
Sbjct: 1952 VILDHHRSSAG---AGPNGNGLWYEGSYTEAAWIADWKMLAQRYANDPTVIGADLHNEPH 2008
Query: 62 GPK---QNVKDWYRYMQLGAEAVHAANPEVLVILSGLN--------FDKDLSFVRNQAVN 110
DW + AV + N L+ + G+ + +L V+++ +
Sbjct: 2009 NGTWGGGGATDWAAAAERAGNAVLSVNSNWLIFVEGVETYQGNNYWWGGNLMGVKDRPIV 2068
Query: 111 LTFTGKLVFEAHWY-GFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFG 168
L ++V+ H Y Q W + D ++ G++ EQ P+++ EFG
Sbjct: 2069 LNVPDRVVYSPHDYPNSVYNQPWF--QTANFGAALPDKFEQMWGYIYEQNIAPIYLGEFG 2126
Query: 169 ADL 171
+
Sbjct: 2127 TRM 2129
>gi|408393732|gb|EKJ72991.1| hypothetical protein FPSE_06779 [Fusarium pseudograminearum CS3096]
Length = 390
Score = 38.1 bits (87), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 57/135 (42%), Gaps = 15/135 (11%)
Query: 72 RYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLVFEAHWYGFTDGQA 131
RY Q V NP++ ++L G F + +N A N +VF+ H Y F A
Sbjct: 231 RYFQGVVSRVQKINPKIPIMLQG-GFRPVDFWAKNFAANTN----IVFDVHHYYFAGRPA 285
Query: 132 WVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFGADLRGNNVNDNRYLNCFFGVAAE 191
P+ +C + S +E +P+FV E+ +N +R +N G+ A
Sbjct: 286 TSQNLPDLICTDA-----KSSAVTVEPKFPVFVGEWSIQATSDNNFSSRAINLNAGLKA- 339
Query: 192 LDWDWALWTLVGSYY 206
W+ +T +Y+
Sbjct: 340 ----WSKYTRGSAYW 350
>gi|284030047|ref|YP_003379978.1| glycoside hydrolase family 5 [Kribbella flavida DSM 17836]
gi|283809340|gb|ADB31179.1| glycoside hydrolase family 5 [Kribbella flavida DSM 17836]
Length = 462
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 62/159 (38%), Gaps = 18/159 (11%)
Query: 51 VVGMSLRNELRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVN 110
+ G + NE P V + Y +L A A+ A +P ++ L G + D S N
Sbjct: 184 IAGYNPVNEPADPTGEVIGPF-YQRLTA-AIRAIDPGHVLFLDGNRYSTDFSMFTEVGEN 241
Query: 111 LTFTGKLVFEAHWYGFTDGQAWVDGNPNQVCGRVVDN------VMRLSGFLLEQGWPLFV 164
FT AH Y G A P G+ D + + F+ G P+++
Sbjct: 242 TVFT------AHDYALP-GIAAGSEYPGITRGKYFDRGVVEQTFLERTEFMRSTGTPIWI 294
Query: 165 SEFGADLRGNNVND-NRY--LNCFFGVAAELDWDWALWT 200
EFG G+ D +RY L + E D W LWT
Sbjct: 295 GEFGPVYTGDPATDASRYQLLRDQLDIYREHDASWTLWT 333
>gi|2980984|gb|AAC06197.1| CMC-xylanase [Fibrobacter succinogenes subsp. succinogenes S85]
Length = 635
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 50/226 (22%), Positives = 90/226 (39%), Gaps = 39/226 (17%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD H+ N +G+ +++F T+ AT + ++NV+ + NE
Sbjct: 163 VILDWHV-----LNYNPNGDADKAEEFF---------TRYATKYKNLKNVL-YEIDNEPT 207
Query: 62 GPK---QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKLV 118
+ D Y Y + +A+ A + +VI + +D+ V + ++ G +
Sbjct: 208 STSWYDGSGNDLYTYSKRITKAIRATGNQSVVICGTNTWSQDVDAVAAKPLSADGIGNVA 267
Query: 119 FEAHWYGFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQGWPLFVSEFG-ADLRGNNVN 177
+ H+Y G DN+ L G P+FVSEFG D G
Sbjct: 268 YTLHFY----------------AGTHYDNIKNKLRTALAAGTPVFVSEFGITDASGWGGI 311
Query: 178 DNRYLNCFFGVAAELDWDWALWTLV----GSYYLREGVIGLNEYYG 219
D N + + + +A W+L G+ +L+E +++ G
Sbjct: 312 DIANANDWMTLLTRNNISYAAWSLCNKGEGASFLKESTSKTSKWTG 357
>gi|145223588|ref|YP_001134266.1| chitinase [Mycobacterium gilvum PYR-GCK]
gi|145216074|gb|ABP45478.1| chitinase [Mycobacterium gilvum PYR-GCK]
Length = 2310
Score = 37.7 bits (86), Expect = 9.8, Method: Composition-based stats.
Identities = 41/183 (22%), Positives = 72/183 (39%), Gaps = 18/183 (9%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNE-- 59
VILD+H S G + + NG + + + WI T +A + V+G L NE
Sbjct: 1957 VILDHHRSSKG---AGPNSNGLWYEGSYTEAAWIDDWTMLAQRYANTPTVIGADLHNEPH 2013
Query: 60 -LRGPKQNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSF--------VRNQAVN 110
DW + AV + N L+ + G+ + S+ V+++ +
Sbjct: 2014 AGTWGGGGATDWAAAAERAGNAVLSVNSNWLIFVEGVADHQGHSYWWGGNLMGVKDRPIV 2073
Query: 111 LTFTGKLVFEAHWY-GFTDGQAWVDGNPNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFG 168
L ++V+ H Y Q W + D ++ G++ EQ P+++ E G
Sbjct: 2074 LNTPNRVVYSPHDYPNSVYAQPWF--QTANFGAALPDKFEQMWGYIYEQNIAPIYLGELG 2131
Query: 169 ADL 171
L
Sbjct: 2132 TRL 2134
>gi|374998476|ref|YP_004973975.1| Putative endoglucanase (Cellulase) [Azospirillum lipoferum 4B]
gi|357425901|emb|CBS88800.1| Putative endoglucanase (Cellulase) [Azospirillum lipoferum 4B]
Length = 839
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 30/133 (22%), Positives = 55/133 (41%), Gaps = 7/133 (5%)
Query: 2 VILDNHISKPGWCCSNSDGNGFFGDQYFNPDLWIKGLTKMATIFNGVRNVVGMSLRNELR 61
VILD+H + G + ++ +G + F I+ +A + G V+G L NE
Sbjct: 420 VILDHHRNDGG---AGTNEHGLWYSNRFPESTVIENWKMLAKHYAGNETVIGADLHNEPS 476
Query: 62 GPK----QNVKDWYRYMQLGAEAVHAANPEVLVILSGLNFDKDLSFVRNQAVNLTFTGKL 117
DW + A+ + N + L+ + G + L+ +++ + KL
Sbjct: 477 VTATWGGSAATDWASAAERIGNAIQSVNKDWLLFVEGTEWSSTLAGAQSRPIEFDTPNKL 536
Query: 118 VFEAHWYGFTDGQ 130
V+ H YG + G
Sbjct: 537 VYSPHAYGHSVGN 549
>gi|146197125|dbj|BAF57326.1| putative glycosyl hydrolase family5 [uncultured symbiotic protist
of Hodotermopsis sjoestedti]
Length = 372
Score = 37.7 bits (86), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 41/175 (23%), Positives = 72/175 (41%), Gaps = 33/175 (18%)
Query: 31 PDLWIKGLTKMATIFNGVRNVVGMSLRNELRGP-----------KQNVKDWYRYMQLGAE 79
P+ K + A + V+G+ ++NE G +W+ A
Sbjct: 144 PEYLYKAVEFFADAYKDDDTVIGIDIKNEPHGECGSSSGATWDSSTTDNNWHYVSLTAAN 203
Query: 80 AVHAANPEVLVILSGLNFDKD------------LSFVRNQAVNL-TFTGKLVFEAHWYGF 126
+ A NP +L+++ G+ K LSFV++ ++L + KLV+ H YG
Sbjct: 204 RILAKNPNLLILVEGIACYKGPRGQDNGWWGGVLSFVKDLPLDLGKYQNKLVYSPHEYGP 263
Query: 127 TD-GQAWVDGN---PNQVCGRVVDNVMRLSGFLLEQG-WPLFVSEFGADLRGNNV 176
+ Q+W G+ + D+ M F+ E G PL + E+G ++G N
Sbjct: 264 SVFDQSWFQGDFTYDSLYNDHWKDSWM----FIHEDGIAPLLIGEWGGHIQGTNT 314
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.139 0.461
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,145,473,125
Number of Sequences: 23463169
Number of extensions: 311477331
Number of successful extensions: 579013
Number of sequences better than 100.0: 384
Number of HSP's better than 100.0 without gapping: 205
Number of HSP's successfully gapped in prelim test: 179
Number of HSP's that attempted gapping in prelim test: 578007
Number of HSP's gapped (non-prelim): 444
length of query: 414
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 269
effective length of database: 8,957,035,862
effective search space: 2409442646878
effective search space used: 2409442646878
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)