RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4449
         (2417 letters)



>gnl|CDD|119351 cd02872, GH18_chitolectin_chitotriosidase, This conserved domain
            family includes a large number of catalytically inactive
            chitinase-like lectins (chitolectins) including YKL-39,
            YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic
            mammalian chitinase), as well as catalytically active
            chitotriosidases.  The conserved domain is an
            eight-stranded alpha/beta barrel fold belonging to the
            family 18 glycosyl hydrolases.  The fold has a pronounced
            active-site cleft at the C-terminal end of the
            beta-barrel.  The chitolectins lack a key active site
            glutamate (the proton donor required for hydrolytic
            activity) but retain highly conserved residues involved
            in oligosaccharide binding.  Chitotriosidase is a
            chitinolytic enzyme expressed in maturing macrophages,
            which suggests that it plays a part in antimicrobial
            defense.  Chitotriosidase hydrolyzes chitotriose, as well
            as colloidal chitin to yield chitobiose and is therefore
            considered an exochitinase. Chitotriosidase occurs in two
            major forms, the large form being converted to the small
            form by either RNA or post-translational processing.
            Although the small form, containing the chitinase domain
            alone, is sufficient for the chitinolytic activity, the
            additional C-terminal chitin-binding domain of the large
            form plays a role in processing colloidal chitin. The
            chitotriosidase gene is nonessential in humans, as about
            35% of the population are heterozygous and 6% homozygous
            for an inactivated form of the gene.  HCGP39 is a 39-kDa
            human cartilage glycoprotein thought to play a role in
            connective tissue remodeling and defense against
            pathogens.
          Length = 362

 Score =  531 bits (1370), Expect = e-172
 Identities = 183/371 (49%), Positives = 240/371 (64%), Gaps = 15/371 (4%)

Query: 1713 IVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFYE 1772
            +VCYFTNWA YR   GK++P +ID  LCTH+IY FA L+ D  +I   D W D+D   YE
Sbjct: 1    VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPDGNIIIL-DEWNDIDLGLYE 59

Query: 1773 KVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDL 1830
            +  ALK+K   +K  LAIGGWN  +  K+S +  S + R  FI   + F+ ++ FDGLDL
Sbjct: 60   RFNALKEKNPNLKTLLAIGGWNFGSA-KFSAMAASPENRKTFIKSAIAFLRKYGFDGLDL 118

Query: 1831 DWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHD--LLLSAAVSPSKAVIDNAYDI 1888
            DWEYP         GP  DK+ F  L+KELR AF P    LLL+AAVS  K  ID AYDI
Sbjct: 119  DWEYPGQRG-----GPPEDKENFVTLLKELREAFEPEAPRLLLTAAVSAGKETIDAAYDI 173

Query: 1889 PVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTT--PTFNANYSLHYWVSHGADR 1946
            P +S+ LD+I+VMTYD+HG W+  TGH +P+YA   DT      N +Y++ YW+S GA  
Sbjct: 174  PEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSADTGDQKYLNVDYAIKYWLSKGAPP 233

Query: 1947 KKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDGWVV 2006
            +K++ G+P YG+SFTLA  +  G+ +   G   AG  TR  GFLAYYEIC+   K GW V
Sbjct: 234  EKLVLGIPTYGRSFTLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEICEF-LKSGWTV 292

Query: 2007 VRDRKRRIGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNFCGCES 2066
            V D +++  PYA+KG+QWVG+DD+  I  K +++K   LGGAM+W++DLDDF+  CG   
Sbjct: 293  VWDDEQK-VPYAYKGNQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDLDDFRGTCGQGK 351

Query: 2067 YPLLKTINRVL 2077
            YPLL  INR L
Sbjct: 352  YPLLNAINRAL 362



 Score =  376 bits (968), Expect = e-117
 Identities = 128/310 (41%), Positives = 183/310 (59%), Gaps = 13/310 (4%)

Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQ 444
           VVCY TNW+  R  +GKFVPENID  LCTHI+YAFA L+P+  +I   +   DID   Y+
Sbjct: 1   VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPD-GNIIILDEWNDIDLGLYE 59

Query: 445 RISSSPLVTHGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLH 504
           R ++     +  +K L+AIGGW   S  K+S + +S  NRK FIKS + FLR++ F GL 
Sbjct: 60  RFNALKE-KNPNLKTLLAIGGWNFGS-AKFSAMAASPENRKTFIKSAIAFLRKYGFDGLD 117

Query: 505 FDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKHD--YSIAVGISGYKEILEVAYD 562
            DW YP             DK NFV L++EL+  F+       +   +S  KE ++ AYD
Sbjct: 118 LDWEYPGQRGGP-----PEDKENFVTLLKELREAFEPEAPRLLLTAAVSAGKETIDAAYD 172

Query: 563 FPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEP--YPNYNINTALKLIDELGGD 620
            P ++ +++F+++M+YD+HG+WEG+TG  SPL +   +       N++ A+K     G  
Sbjct: 173 IPEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSADTGDQKYLNVDYAIKYWLSKGAP 232

Query: 621 KRKIVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTRKWT 680
             K+V+G+P YG+S+TL      G+GA   GPG AG YT++ G LAYYEIC  +K   WT
Sbjct: 233 PEKLVLGIPTYGRSFTLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEICEFLK-SGWT 291

Query: 681 VQRNTKSLEP 690
           V  + +   P
Sbjct: 292 VVWDDEQKVP 301



 Score =  302 bits (775), Expect = 1e-91
 Identities = 125/379 (32%), Positives = 197/379 (51%), Gaps = 34/379 (8%)

Query: 15  VVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIP-EDLEYDVIKGGY 73
           VVCY    + YR     F  +N+ P  CTH+IYA+A ++P    +I  E  + D   G Y
Sbjct: 1   VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPDGNIIILDEWNDID--LGLY 58

Query: 74  KSFLGLKEANPELKVYLAV------KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLDL 127
           + F  LKE NP LK  LA+       + F ++ +  E+R  FI S +     Y FDGLDL
Sbjct: 59  ERFNALKEKNPNLKTLLAIGGWNFGSAKFSAMAASPENRKTFIKSAIAFLRKYGFDGLDL 118

Query: 128 NVKDPALNDEDDDDLESIANERSDFSTFIQELSSTLRRNNYQLTLT-----SPGVIDRKT 182
           + + P       +D       + +F T ++EL         +L LT         ID   
Sbjct: 119 DWEYPGQRGGPPED-------KENFVTLLKELREAFEPEAPRLLLTAAVSAGKETIDAA- 170

Query: 183 SLVDISVVAPLVDLILLKSFN----NDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAV 238
              DI  ++  +D I + +++     + +     P+   +       +    + N++ A+
Sbjct: 171 --YDIPEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSA-----DTGDQKYLNVDYAI 223

Query: 239 YNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVC 298
             W+ KGA PE++++GIP +G+S+ L + S  G+GA   GPGT G YT+  G+LA++E+C
Sbjct: 224 KYWLSKGAPPEKLVLGIPTYGRSFTLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEIC 283

Query: 299 NKFKDKTWRHFTDSNGEPFMVKKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDDF 358
              K      + D    P+  K ++W+ Y++ +SI  K+ Y+K + LGGAM+W++DLDDF
Sbjct: 284 EFLKSGWTVVWDDEQKVPYAYKGNQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDLDDF 343

Query: 359 RGFCGQ-KYPLLSAVVSNL 376
           RG CGQ KYPLL+A+   L
Sbjct: 344 RGTCGQGKYPLLNAINRAL 362



 Score =  292 bits (750), Expect = 3e-88
 Identities = 112/267 (41%), Positives = 146/267 (54%), Gaps = 49/267 (18%)

Query: 1342 IVCYFTNWAWYRPGKGKYVLKKV----------------PWINILKLYVTLQTGLGTDQE 1385
            +VCYFTNWA YRPG GK+V + +                P  NI+ L       LG    
Sbjct: 1    VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNPDGNIIILDEWNDIDLG---- 56

Query: 1386 KENTFLKISELTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQ 1443
                              YER   LK+K   +K  LAIGGWN     K+S +  S   R+
Sbjct: 57   -----------------LYERFNALKEKNPNLKTLLAIGGWNFG-SAKFSAMAASPENRK 98

Query: 1444 RFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPH--G 1501
             FI+  + FL KY FDGLDLDWEYP         GP  DKE+F   ++EL +AF+P    
Sbjct: 99   TFIKSAIAFLRKYGFDGLDLDWEYPGQRG-----GPPEDKENFVTLLKELREAFEPEAPR 153

Query: 1502 LLLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDD--D 1559
            LLL+AAVS  K+ I+AAYD+  +S+ LD+I+VMTYD+HG W+  TGH +PLY    D  D
Sbjct: 154  LLLTAAVSAGKETIDAAYDIPEISKYLDFINVMTYDFHGSWEGVTGHNSPLYAGSADTGD 213

Query: 1560 FFYFNANFTMNYWMKKGAPSRKLVMGV 1586
              Y N ++ + YW+ KGAP  KLV+G+
Sbjct: 214  QKYLNVDYAIKYWLSKGAPPEKLVLGI 240



 Score =  175 bits (445), Expect = 2e-47
 Identities = 54/116 (46%), Positives = 72/116 (62%), Gaps = 1/116 (0%)

Query: 695 TLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTRKWTVQRNTKSLEPFTFQG 754
           TL      G+GA   GPG AG YT++ G LAYYEIC  +K   WTV  + +   P+ ++G
Sbjct: 248 TLASPSNTGVGAPASGPGTAGPYTREAGFLAYYEICEFLK-SGWTVVWDDEQKVPYAYKG 306

Query: 755 DQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLDDFHNKCCMESFPLLRAVNRAF 810
           +QWV Y+D  S+  KV+++KS G  G M W+IDLDDF   C    +PLL A+NRA 
Sbjct: 307 NQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDLDDFRGTCGQGKYPLLNAINRAL 362



 Score =  155 bits (393), Expect = 1e-40
 Identities = 51/137 (37%), Positives = 77/137 (56%), Gaps = 4/137 (2%)

Query: 2192 VICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFYE 2251
            V+CY+T+WA  R   GK+ PE+ID  LCTH++YAFA L+  +  I   +   D++ G YE
Sbjct: 1    VVCYFTNWAQYRPGNGKFVPENIDPFLCTHIIYAFAGLNP-DGNIIILDEWNDIDLGLYE 59

Query: 2252 KITKFNER--GIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEF 2309
            +     E+   ++ +LAIGGW+   S K+S +      R+ F++    FL  YGF+GL+ 
Sbjct: 60   RFNALKEKNPNLKTLLAIGGWNFG-SAKFSAMAASPENRKTFIKSAIAFLRKYGFDGLDL 118

Query: 2310 AWEYPKCWQTCSTVKSH 2326
             WEYP         K +
Sbjct: 119  DWEYPGQRGGPPEDKEN 135



 Score =  113 bits (286), Expect = 1e-26
 Identities = 44/82 (53%), Positives = 52/82 (63%), Gaps = 2/82 (2%)

Query: 2320 CSTVKSHEWTVVRDSLGARGPYAYNGKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDL 2379
            C  +KS  WTVV D    + PYAY G  W+ YDD   I  K + +K  GLGGAMVWSIDL
Sbjct: 283  CEFLKS-GWTVVWDD-EQKVPYAYKGNQWVGYDDEESIALKVQYLKSKGLGGAMVWSIDL 340

Query: 2380 DDFQNLCSCGPYPLLSAVNYEL 2401
            DDF+  C  G YPLL+A+N  L
Sbjct: 341  DDFRGTCGQGKYPLLNAINRAL 362



 Score = 58.3 bits (142), Expect = 2e-08
 Identities = 18/35 (51%), Positives = 23/35 (65%), Gaps = 1/35 (2%)

Query: 867 SI-LQVKFIKSNGYAGIMAWTIDLDDFHNKCCMDK 900
           SI L+V+++KS G  G M W+IDLDDF   C   K
Sbjct: 317 SIALKVQYLKSKGLGGAMVWSIDLDDFRGTCGQGK 351


>gnl|CDD|214753 smart00636, Glyco_18, Glyco_18 domain. 
          Length = 334

 Score =  381 bits (981), Expect = e-119
 Identities = 136/355 (38%), Positives = 198/355 (55%), Gaps = 31/355 (8%)

Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFY 1771
            ++V YFTNW  Y   G  +   DI +   TH+IY FA +D D   +   D WAD+ N  +
Sbjct: 1    RVVGYFTNWGVY---GRNFPVDDIPASKLTHIIYAFANIDPD-GTVTIGDEWADIGN--F 54

Query: 1772 EKVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLD 1829
             ++ ALKKK  G+KV L+IGGW +S  + +S +++   +R KFI  +V+F+ ++ FDG+D
Sbjct: 55   GQLKALKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFLKKYGFDGID 112

Query: 1830 LDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFN-----PHDLLLSAAVSPSKAVIDN 1884
            +DWEYP             D++ +  L+KELR A +         LL+ AV      ID 
Sbjct: 113  IDWEYPGGRG--------DDRENYTALLKELREALDKEGAEGKGYLLTIAVPAGPDKIDK 164

Query: 1885 AYD-IPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHG 1943
             Y  +P +++ LD+I++MTYD+HG W   TGH AP+YA P D    +N +Y++ Y++  G
Sbjct: 165  GYGDLPAIAKYLDFINLMTYDFHGAWSNPTGHNAPLYAGPGDPEK-YNVDYAVKYYLCKG 223

Query: 1944 ADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDG 2003
                K++ G+P YG+ +TL D + NG  +   G A  G  T   G + Y EIC  +   G
Sbjct: 224  VPPSKLVLGIPFYGRGWTLVDGSNNGPGAPFTGPATGGPGTWEGGVVDYREICKLL---G 280

Query: 2004 WVVVRDRKRRIGPYAFKGD--QWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLD 2056
              VV D   +  PYA+     QWV +DD   I  KA++VK   LGG MIW LD D
Sbjct: 281  ATVVYDDTAK-APYAYNPGTGQWVSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334



 Score =  290 bits (744), Expect = 5e-88
 Identities = 123/413 (29%), Positives = 196/413 (47%), Gaps = 86/413 (20%)

Query: 384 KVVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFY 443
           +VV Y TNW         F  ++I     THI+YAFA++DP+  ++  G+  ADI  NF 
Sbjct: 1   RVVGYFTNWG---VYGRNFPVDDIPASKLTHIIYAFANIDPD-GTVTIGDEWADIG-NFG 55

Query: 444 QRISSSPLV-THGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAG 502
           Q  +   L   +  +K+L++IGGWT+S  + +S ++S  ++RKKFI S+++FL+++ F G
Sbjct: 56  QLKA---LKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFLKKYGFDG 110

Query: 503 LHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDK-----HDYSIAVGISGYKEIL 557
           +  DW YP     D        + N+  L++EL+   DK       Y + + +    + +
Sbjct: 111 IDIDWEYPGGRGDD--------RENYTALLKELREALDKEGAEGKGYLLTIAVPAGPDKI 162

Query: 558 EVAYDF-PALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDE 616
           +  Y   PA+  +++F++LM+YD+HGAW   TG  +PL + PG+P   YN++ A+K    
Sbjct: 163 DKGYGDLPAIAKYLDFINLMTYDFHGAWSNPTGHNAPLYAGPGDP-EKYNVDYAVKYYLC 221

Query: 617 LGGDKRKIVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKT 676
            G    K+V+G+PFYG+ +TL     +G GA   GP   G  T + G++ Y EIC   K 
Sbjct: 222 KGVPPSKLVLGIPFYGRGWTLVDGSNNGPGAPFTGPATGGPGTWEGGVVDYREIC---KL 278

Query: 677 RKWTVQRNTKSLEPRSMGTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTR 736
              TV  +  +                             Y             Y   T 
Sbjct: 279 LGATVVYDDTA--------------------------KAPY------------AYNPGTG 300

Query: 737 KWTVQRNTKSLEPFTFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
           +W                   VSY+DP S+  K  ++K  G  G+M W +D D
Sbjct: 301 QW-------------------VSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334



 Score =  230 bits (588), Expect = 5e-67
 Identities = 84/259 (32%), Positives = 131/259 (50%), Gaps = 39/259 (15%)

Query: 1341 KIVCYFTNWAWYRPGKGKYVLKKVPW-----INILKLYVTLQTGLGTDQEKENTFLKISE 1395
            ++V YFTNW  Y      + +  +P      I      +    G  T  ++        +
Sbjct: 1    RVVGYFTNWGVYGR---NFPVDDIPASKLTHIIYAFANID-PDGTVTIGDEWADIGNFGQ 56

Query: 1396 LTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFL 1453
            L             LKKK  G+KV L+IGGW +S    +S +++   +R++FI+ +V FL
Sbjct: 57   LKA-----------LKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFL 103

Query: 1454 LKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFK-----PHGLLLSAAV 1508
             KY FDG+D+DWEYP             D+E++   ++EL +A         G LL+ AV
Sbjct: 104  KKYGFDGIDIDWEYPGGR--------GDDRENYTALLKELREALDKEGAEGKGYLLTIAV 155

Query: 1509 SPSKQVINAAYD-VKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANF 1567
                  I+  Y  + A+++ LD+I++MTYD+HG W   TGH APLY  P D    +N ++
Sbjct: 156  PAGPDKIDKGYGDLPAIAKYLDFINLMTYDFHGAWSNPTGHNAPLYAGPGDP-EKYNVDY 214

Query: 1568 TMNYWMKKGAPSRKLVMGV 1586
             + Y++ KG P  KLV+G+
Sbjct: 215  AVKYYLCKGVPPSKLVLGI 233



 Score =  185 bits (473), Expect = 1e-51
 Identities = 100/356 (28%), Positives = 161/356 (45%), Gaps = 35/356 (9%)

Query: 14  QVVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIPEDLEYDVIKGGY 73
           +VV Y      Y      F V ++     TH+IYA+A IDP    +   D   D+   G 
Sbjct: 1   RVVGYFTNWGVYGRN---FPVDDIPASKLTHIIYAFANIDP-DGTVTIGDEWADIGNFGQ 56

Query: 74  KSFLGLKEANPELKVYLAV-----KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLDLN 128
                LK+ NP LKV L++       NF S+ SD  SR  FI S++     Y FDG+D++
Sbjct: 57  --LKALKKKNPGLKVLLSIGGWTESDNFSSMLSDPASRKKFIDSIVSFLKKYGFDGIDID 114

Query: 129 VKDPALNDEDDDDLESIANERSDFSTFIQELSSTLRR-----NNYQLTLTSPGVIDRKTS 183
            + P    +D ++          ++  ++EL   L +       Y LT+  P   D+   
Sbjct: 115 WEYPGGRGDDREN----------YTALLKELREALDKEGAEGKGYLLTIAVPAGPDKIDK 164

Query: 184 LVD-ISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAVYNWI 242
               +  +A  +D I L ++  D       P   N    +          N++ AV  ++
Sbjct: 165 GYGDLPAIAKYLDFINLMTY--DFHGAWSNPTGHNA--PLYAGPGDPEKYNVDYAVKYYL 220

Query: 243 KKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVCNKFK 302
            KG  P ++++GIPF+G+ + L + S  G GA   GP T G  T   G + + E+C    
Sbjct: 221 CKGVPPSKLVLGIPFYGRGWTLVDGSNNGPGAPFTGPATGGPGTWEGGVVDYREICKLLG 280

Query: 303 DKTWRHFTDSNGEPFMVKKD--EWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLD 356
                 + D+   P+       +W++Y++  SI+ K +Y+KD+ LGG M+W LD D
Sbjct: 281 AT--VVYDDTAKAPYAYNPGTGQWVSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334



 Score =  119 bits (300), Expect = 9e-29
 Identities = 40/129 (31%), Positives = 73/129 (56%), Gaps = 6/129 (4%)

Query: 2191 KVICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFY 2250
            +V+ Y+T+W    +    +  +DI  S  TH++YAFA++D  +  +   +   D+     
Sbjct: 1    RVVGYFTNWGVYGR---NFPVDDIPASKLTHIIYAFANIDP-DGTVTIGDEWADIGNFGQ 56

Query: 2251 EKITKFNERGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFA 2310
             K  K    G++V+L+IGGW++S  D +S +++D  +R+ F+  +  FL+ YGF+G++  
Sbjct: 57   LKALKKKNPGLKVLLSIGGWTES--DNFSSMLSDPASRKKFIDSIVSFLKKYGFDGIDID 114

Query: 2311 WEYPKCWQT 2319
            WEYP     
Sbjct: 115  WEYPGGRGD 123



 Score = 67.7 bits (166), Expect = 2e-11
 Identities = 28/60 (46%), Positives = 34/60 (56%), Gaps = 3/60 (5%)

Query: 2323 VKSHEWTVVRDSLGARGPYAYNG--KLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLD 2380
             K    TVV D   A+ PYAYN     W+SYDD R I+ K   +K  GLGG M+W +D D
Sbjct: 276  CKLLGATVVYDDT-AKAPYAYNPGTGQWVSYDDPRSIKAKADYVKDKGLGGVMIWELDAD 334



 Score = 35.0 bits (81), Expect = 0.44
 Identities = 7/20 (35%), Positives = 11/20 (55%)

Query: 871 VKFIKSNGYAGIMAWTIDLD 890
             ++K  G  G+M W +D D
Sbjct: 315 ADYVKDKGLGGVMIWELDAD 334


>gnl|CDD|216071 pfam00704, Glyco_hydro_18, Glycosyl hydrolases family 18. 
          Length = 325

 Score =  332 bits (852), Expect = e-102
 Identities = 128/352 (36%), Positives = 179/352 (50%), Gaps = 33/352 (9%)

Query: 1711 FKIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKF 1770
             +IV Y+T W  Y +    +   DI +D  TH+IY FA +D +       DT  D     
Sbjct: 1    GRIVGYYTQWGNYGEG---FPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57

Query: 1771 YEKVTALKKK---GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDG 1827
            +E++  LKK    GVKV L+IGGW  S    +S L +    R  F   +++F+ ++ FDG
Sbjct: 58   FEQLKDLKKCQNPGVKVLLSIGGWTFS--GGFSLLASDDAKRKTFADSIIDFLKKYGFDG 115

Query: 1828 LDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPH---DLLLSAAVSPSKAVIDN 1884
            +D+DWEYP             DK  +  L+KELRAA         LLSAAV      ID 
Sbjct: 116  IDIDWEYPGGKG--------DDKDNYTALLKELRAALKKEAKAGYLLSAAVPAGPDKIDG 167

Query: 1885 AYDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGA 1944
              DI  + + LD+I++MTYD+HG W   TG  AP+Y          N +Y++ Y++  G 
Sbjct: 168  -SDIAKIGKYLDFINLMTYDFHG-WSNITGPNAPLYDGSWQ-----NVDYAVQYYLKAGV 220

Query: 1945 DRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDGW 2004
               K++ G+P YG+ +TL + + NG      G    G  T   G L+Y EIC  + K G 
Sbjct: 221  PASKLVLGIPFYGRGWTLVNGSGNG-----GGAPAPGPGTWEGGILSYKEICALL-KSGA 274

Query: 2005 VVVRDRKRRIGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLD 2056
                D      PY +KG Q+V +DD   I  KA++VK   LGG MIW+LD D
Sbjct: 275  GPGYD-DTAKAPYIYKGKQFVSYDDPRSIKAKAKYVKDKGLGGVMIWSLDQD 325



 Score =  227 bits (580), Expect = 4e-66
 Identities = 107/412 (25%), Positives = 170/412 (41%), Gaps = 92/412 (22%)

Query: 383 PKVVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNF 442
            ++V Y T W    +    F  ++I     THI+YAFA++D N       + E D     
Sbjct: 1   GRIVGYYTQWGNYGEG---FPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57

Query: 443 YQRISSSPLVTHGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAG 502
           ++++       +  VK+L++IGGWT S G  +S L S  + RK F  S++ FL+++ F G
Sbjct: 58  FEQLKDLKKCQNPGVKVLLSIGGWTFSGG--FSLLASDDAKRKTFADSIIDFLKKYGFDG 115

Query: 503 LHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKHDY-----SIAVGISGYKEIL 557
           +  DW YP             DK N+  L++EL+A   K        S AV     K   
Sbjct: 116 IDIDWEYPGGKG--------DDKDNYTALLKELRAALKKEAKAGYLLSAAVPAGPDK--- 164

Query: 558 EVAYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDEL 617
               D   +  +++F++LM+YD+HG W  ITG  +PL     +     N++ A++   + 
Sbjct: 165 IDGSDIAKIGKYLDFINLMTYDFHG-WSNITGPNAPLYDGSWQ-----NVDYAVQYYLKA 218

Query: 618 GGDKRKIVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTR 677
           G    K+V+G+PFYG+ +TL     +G GA   GP      T + G+L+Y EIC  +K+ 
Sbjct: 219 GVPASKLVLGIPFYGRGWTLVNGSGNGGGAPAPGP-----GTWEGGILSYKEICALLKS- 272

Query: 678 KWTVQRNTKSLEPRSMGTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEICYRIKTRK 737
                 +                          P     Y  +      Y+    IK   
Sbjct: 273 GAGPGYD---------------------DTAKAP-----YIYKGKQFVSYDDPRSIKA-- 304

Query: 738 WTVQRNTKSLEPFTFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
                                          K K++K  G  G+M W++D D
Sbjct: 305 -------------------------------KAKYVKDKGLGGVMIWSLDQD 325



 Score =  205 bits (524), Expect = 1e-58
 Identities = 86/258 (33%), Positives = 127/258 (49%), Gaps = 40/258 (15%)

Query: 1340 FKIVCYFTNWAWYRPGKGKYVLKKVPW-----INILKLYVTLQTGLGTDQEKENTFLKIS 1394
             +IV Y+T W  Y  G   + L  +P      I      +      G + + E+  LK  
Sbjct: 1    GRIVGYYTQWGNYGEG---FPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKG- 56

Query: 1395 ELTYVHILFYERVVTLKKK---GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVK 1451
                     +E++  LKK    GVKV L+IGGW  S  G +S L +    R+ F + ++ 
Sbjct: 57   --------CFEQLKDLKKCQNPGVKVLLSIGGWTFS--GGFSLLASDDAKRKTFADSIID 106

Query: 1452 FLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPH---GLLLSAAV 1508
            FL KY FDG+D+DWEYP             DK+++   ++EL  A K     G LLSAAV
Sbjct: 107  FLKKYGFDGIDIDWEYPGGKG--------DDKDNYTALLKELRAALKKEAKAGYLLSAAV 158

Query: 1509 SPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFT 1568
                  I    D+  + + LD+I++MTYD+HG W   TG  APLY+         N ++ 
Sbjct: 159  PAGPDKI-DGSDIAKIGKYLDFINLMTYDFHG-WSNITGPNAPLYDGSWQ-----NVDYA 211

Query: 1569 MNYWMKKGAPSRKLVMGV 1586
            + Y++K G P+ KLV+G+
Sbjct: 212  VQYYLKAGVPASKLVLGI 229



 Score =  183 bits (465), Expect = 9e-51
 Identities = 95/353 (26%), Positives = 164/353 (46%), Gaps = 37/353 (10%)

Query: 13  HQVVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIPEDLEYDVIKGG 72
            ++V Y      Y      F + ++     TH+IYA+A ID         D E D +KG 
Sbjct: 1   GRIVGYYTQWGNYGE---GFPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57

Query: 73  YKSFLGLKEA-NPELKVYLAV-----KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLD 126
           ++    LK+  NP +KV L++        F  + SD   R  F  S+++    Y FDG+D
Sbjct: 58  FEQLKDLKKCQNPGVKVLLSIGGWTFSGGFSLLASDDAKRKTFADSIIDFLKKYGFDGID 117

Query: 127 LNVKDPALNDEDDDDLESIANERSDFSTFIQELSS---TLRRNNYQLTLTSPGVIDRKTS 183
           ++ + P    +D          + +++  ++EL +      +  Y L+   P   D K  
Sbjct: 118 IDWEYPGGKGDD----------KDNYTALLKELRAALKKEAKAGYLLSAAVPAGPD-KID 166

Query: 184 LVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAVYNWIK 243
             DI+ +   +D I L +++     +   P  P    + Q         N++ AV  ++K
Sbjct: 167 GSDIAKIGKYLDFINLMTYDFHGWSNITGPNAPLYDGSWQ---------NVDYAVQYYLK 217

Query: 244 KGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVCNKFKD 303
            G    ++++GIPF+G+ + L N S  G GA   GPGT        G L++ E+C   K 
Sbjct: 218 AGVPASKLVLGIPFYGRGWTLVNGSGNGGGAPAPGPGTWE-----GGILSYKEICALLKS 272

Query: 304 KTWRHFTDSNGEPFMVKKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLD 356
                + D+   P++ K  ++++Y++  SI+ K  Y+KD+ LGG M+W+LD D
Sbjct: 273 GAGPGYDDTAKAPYIYKGKQFVSYDDPRSIKAKAKYVKDKGLGGVMIWSLDQD 325



 Score =  112 bits (281), Expect = 2e-26
 Identities = 39/133 (29%), Positives = 70/133 (52%), Gaps = 8/133 (6%)

Query: 2190 YKVICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGF 2249
             +++ YYT W         +  +DI     TH++YAFA++D       + + + D  +G 
Sbjct: 1    GRIVGYYTQWG---NYGEGFPLDDIPTDKLTHIIYAFANIDGNGTFGNNADTEDDGLKGC 57

Query: 2250 YEKITKFNE---RGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNG 2306
            +E++    +    G++V+L+IGGW+ S    +S L +D   R+ F   + +FL+ YGF+G
Sbjct: 58   FEQLKDLKKCQNPGVKVLLSIGGWTFS--GGFSLLASDDAKRKTFADSIIDFLKKYGFDG 115

Query: 2307 LEFAWEYPKCWQT 2319
            ++  WEYP     
Sbjct: 116  IDIDWEYPGGKGD 128



 Score = 67.1 bits (164), Expect = 3e-11
 Identities = 28/64 (43%), Positives = 38/64 (59%), Gaps = 2/64 (3%)

Query: 2317 WQTCSTVKSHEWTVVRDSLGARGPYAYNGKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWS 2376
             + C+ +KS       D   A+ PY Y GK ++SYDD R I+ K K +K  GLGG M+WS
Sbjct: 264  KEICALLKS-GAGPGYDD-TAKAPYIYKGKQFVSYDDPRSIKAKAKYVKDKGLGGVMIWS 321

Query: 2377 IDLD 2380
            +D D
Sbjct: 322  LDQD 325



 Score = 34.0 bits (78), Expect = 1.0
 Identities = 8/20 (40%), Positives = 13/20 (65%)

Query: 871 VKFIKSNGYAGIMAWTIDLD 890
            K++K  G  G+M W++D D
Sbjct: 306 AKYVKDKGLGGVMIWSLDQD 325


>gnl|CDD|119352 cd02873, GH18_IDGF, The IDGF's (imaginal disc growth factors) are a
            family of growth factors identified in insects that
            include at least five members, some of which are encoded
            by genes in a tight cluster. The IDGF's have an
            eight-stranded alpha/beta barrel fold and are related to
            the glycosyl hydrolase family 18 (GH18) chitinases, but
            they have an amino acid substitution known to abolish
            chitinase catalytic activity. IDGFs may have evolved from
            chitinases to gain new functions as growth factors,
            interacting with cell surface glycoproteins involved in
            growth-promoting processes.
          Length = 413

 Score =  243 bits (622), Expect = 2e-70
 Identities = 123/416 (29%), Positives = 204/416 (49%), Gaps = 61/416 (14%)

Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDL--CTHVIYGFAVLDTDQLVIKPHDTWADLDNK 1769
            K+VCY+ + ++ R+   K    D++  L  CTH++YG+A +D D   IK  +   DLD  
Sbjct: 1    KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60

Query: 1770 FYEKVTALKKK--GVKVTLAIGGWNDS----AGNKYSRLVNSQQARSKFIAHVVNFILEH 1823
             Y  +T+LK+K   +KV L++GG  D+       KY  L+ S ++R+ FI    + +  +
Sbjct: 61   HYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTY 120

Query: 1824 NFDGLDLDWEYPK------------CWQ-----------VDCKQGPASDKQGFADLIKEL 1860
             FDGLDL W++PK             W            VD K   A  K+ F  L++EL
Sbjct: 121  GFDGLDLAWQFPKNKPKKVRGTFGSAWHSFKKLFTGDSVVDEKA--AEHKEQFTALVREL 178

Query: 1861 RAAFNPHDLLLSAAVSP---SKAVIDNAYDIPVMSENLDWISVMTYDYHG-QWDKKTGH- 1915
            + A  P  LLL+  V P   S       +D+P ++ N+D++++ T+D+   + + +    
Sbjct: 179  KNALRPDGLLLTLTVLPHVNSTWY----FDVPAIANNVDFVNLATFDFLTPERNPEEADY 234

Query: 1916 VAPMYALPNDTTPTFNANYSLHYWVSHGADRKKVIFGMPMYGQSFTL-ADKNKNGL--NS 1972
             AP+Y    +  P  N +Y + YW++ G    K+  G+  YG+++ L  D    G+    
Sbjct: 235  TAPIYE-LYERNPHHNVDYQVKYWLNQGTPASKLNLGIATYGRAWKLTKDSGITGVPPVL 293

Query: 1973 QTYGGAEAGENTRARGFLAYYEICDKIQKDGWVV--------VRDRKRRIGPYAF----- 2019
            +T G   AG  T+  G L++ EIC K+     +         V D  +R G YA+     
Sbjct: 294  ETDGPGPAGPQTKTPGLLSWPEICSKLPNPANLKGADAPLRKVGDPTKRFGSYAYRPADE 353

Query: 2020 KGDQ--WVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNFCGCESYPLLKTI 2073
             G+   WV ++D     +KA + K   LGG  ++ L LDDF+  C  + +P+L++ 
Sbjct: 354  NGEHGIWVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDFRGQCTGDKFPILRSA 409



 Score =  174 bits (442), Expect = 1e-46
 Identities = 118/423 (27%), Positives = 186/423 (43%), Gaps = 70/423 (16%)

Query: 14  QVVCYVEAKSAYRHRPATFNVKNVIP--QTCTHVIYAYAAIDPVSRALIPEDLEYDVIKG 71
           ++VCY ++KS  R   A  +++++ P  Q CTH++Y YA ID  +  +   + + D+ K 
Sbjct: 1   KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60

Query: 72  GYKSFLGLKEANPELKVYLAVKSN-----------FVSITSDRESRLNFISSVLEMFDMY 120
            Y++   LK   P LKV L+V  +           ++ +    ESR  FI+S   +   Y
Sbjct: 61  HYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTY 120

Query: 121 KFDGLDLNVKDPALNDE--DDDDLESI------------------ANERSDFSTFIQELS 160
            FDGLDL  + P  N          S                   A  +  F+  ++EL 
Sbjct: 121 GFDGLDLAWQFPK-NKPKKVRGTFGSAWHSFKKLFTGDSVVDEKAAEHKEQFTALVRELK 179

Query: 161 STLRRNNYQLTLTS-PGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTK 219
           + LR +   LTLT  P V    T   D+  +A  VD + L +F  D +  E  P + +  
Sbjct: 180 NALRPDGLLLTLTVLPHV--NSTWYFDVPAIANNVDFVNLATF--DFLTPERNPEEADYT 235

Query: 220 VNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSE---YGLGATV 276
             I         +N++  V  W+ +G    ++ +GI  +G++++L   S           
Sbjct: 236 APIYELYERNPHHNVDYQVKYWLNQGTPASKLNLGIATYGRAWKLTKDSGITGVPPVLET 295

Query: 277 KGPGTEGKYTQMPGYLAFFEVCNKFK----------------DKTWRHFT------DSNG 314
            GPG  G  T+ PG L++ E+C+K                  D T R  +      D NG
Sbjct: 296 DGPGPAGPQTKTPGLLSWPEICSKLPNPANLKGADAPLRKVGDPTKRFGSYAYRPADENG 355

Query: 315 EPFMVKKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDDFRGFC-GQKYPLLSAVV 373
           E  +     W++YE+ D+   K  Y K + LGG  L+ L LDDFRG C G K+P+L +  
Sbjct: 356 EHGI-----WVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDFRGQCTGDKFPILRSAK 410

Query: 374 SNL 376
             L
Sbjct: 411 YRL 413



 Score =  155 bits (393), Expect = 3e-40
 Identities = 90/330 (27%), Positives = 157/330 (47%), Gaps = 49/330 (14%)

Query: 384 KVVCYVTNWSGSRKSDGKFVPENIDYKL--CTHIVYAFASLDPNTLSIQAGNPEADIDDN 441
           K+VCY  + S  R+   K   E+++  L  CTH+VY +A +D +T  I++ N + D+D +
Sbjct: 1   KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60

Query: 442 FYQRISSSPLVTHGKVKILIAIGGWTDSSGE----KYSQLISSGSNRKKFIKSVLTFLRR 497
            Y+ I+S     +  +K+L+++GG  D+  E    KY  L+ S  +R  FI S  + L+ 
Sbjct: 61  HYRAITSLKR-KYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKT 119

Query: 498 FDFAGLHFDWNYPV------------CWQA-------DCSKQHKAD--KGNFVKLIQELK 536
           + F GL   W +P              W +       D     KA   K  F  L++ELK
Sbjct: 120 YGFDGLDLAWQFPKNKPKKVRGTFGSAWHSFKKLFTGDSVVDEKAAEHKEQFTALVRELK 179

Query: 537 AEFDKHDYSIAVGISGYKEIL-----EVAYDFPALNDHVEFMSLMSYDYHGAWEG--ITG 589
                    + + +      L        +D PA+ ++V+F++L ++D+           
Sbjct: 180 NALRPDGLLLTLTV------LPHVNSTWYFDVPAIANNVDFVNLATFDFLTPERNPEEAD 233

Query: 590 LVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVIGVPFYGQSYTLKLDKEHGL---- 645
             +P+        P++N++  +K     G    K+ +G+  YG+++  KL K+ G+    
Sbjct: 234 YTAPIYELYER-NPHHNVDYQVKYWLNQGTPASKLNLGIATYGRAW--KLTKDSGITGVP 290

Query: 646 -GAETDGPGLAGEYTQQPGMLAYYEICYRI 674
              ETDGPG AG  T+ PG+L++ EIC ++
Sbjct: 291 PVLETDGPGPAGPQTKTPGLLSWPEICSKL 320



 Score =  123 bits (311), Expect = 2e-29
 Identities = 79/285 (27%), Positives = 130/285 (45%), Gaps = 53/285 (18%)

Query: 1341 KIVCYFTNWAWYRPGKGKYVLKKVPWINILKLYVTLQTGL-----GTDQEKENTFLKISE 1395
            K+VCY+ + ++ R G  K  L        L+  +   T L     G D +      KI  
Sbjct: 1    KLVCYYDSKSYLREGLAKMSL------EDLEPALQFCTHLVYGYAGIDADT----YKIKS 50

Query: 1396 LTY---VHILFYERVVTLKKK--GVKVSLAIGGWNDSLGG----KYSRLVNSATARQRFI 1446
            L     +    Y  + +LK+K   +KV L++GG  D+       KY  L+ S+ +R  FI
Sbjct: 51   LNEDLDLDKSHYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFI 110

Query: 1447 EHVVKFLLKYQFDGLDLDWEYPT------------CWQ-----VNCDAGPDSD----KES 1485
                  L  Y FDGLDL W++P              W         D+  D      KE 
Sbjct: 111  NSAHSLLKTYGFDGLDLAWQFPKNKPKKVRGTFGSAWHSFKKLFTGDSVVDEKAAEHKEQ 170

Query: 1486 FGLFVRELHQAFKPHGLLLSAAVSPSKQVINAA--YDVKALSESLDWISVMTYDYHG-QW 1542
            F   VREL  A +P GLLL+  V P    +N+   +DV A++ ++D++++ T+D+   + 
Sbjct: 171  FTALVRELKNALRPDGLLLTLTVLPH---VNSTWYFDVPAIANNVDFVNLATFDFLTPER 227

Query: 1543 DKKTGH-VAPLYEHPDDDFFYFNANFTMNYWMKKGAPSRKLVMGV 1586
            + +     AP+Y    +   + N ++ + YW+ +G P+ KL +G+
Sbjct: 228  NPEEADYTAPIY-ELYERNPHHNVDYQVKYWLNQGTPASKLNLGI 271



 Score =  112 bits (282), Expect = 8e-26
 Identities = 50/133 (37%), Positives = 79/133 (59%), Gaps = 8/133 (6%)

Query: 2191 KVICYYTSWAYLRQAEGKYSPEDIDGSL--CTHLVYAFADLDEQELVIRSHNNKLDLEQG 2248
            K++CYY S +YLR+   K S ED++ +L  CTHLVY +A +D     I+S N  LDL++ 
Sbjct: 1    KLVCYYDSKSYLREGLAKMSLEDLEPALQFCTHLVYGYAGIDADTYKIKSLNEDLDLDKS 60

Query: 2249 FYEKITKFNER--GIQVILAIGGWSDS----RSDKYSRLVNDIPARRNFVRHVTEFLEFY 2302
             Y  IT    +   ++V+L++GG  D+     ++KY  L+    +R  F+      L+ Y
Sbjct: 61   HYRAITSLKRKYPHLKVLLSVGGDRDTDEEGENEKYLLLLESSESRNAFINSAHSLLKTY 120

Query: 2303 GFNGLEFAWEYPK 2315
            GF+GL+ AW++PK
Sbjct: 121  GFDGLDLAWQFPK 133



 Score = 81.2 bits (201), Expect = 1e-15
 Identities = 45/135 (33%), Positives = 66/135 (48%), Gaps = 25/135 (18%)

Query: 695 TLKLDKEHGL-----GAETDGPGLAGEYTQQPGMLAYYEICYRIKT-----------RKW 738
             KL K+ G+       ETDGPG AG  T+ PG+L++ EIC ++             RK 
Sbjct: 277 AWKLTKDSGITGVPPVLETDGPGPAGPQTKTPGLLSWPEICSKLPNPANLKGADAPLRK- 335

Query: 739 TVQRNTKSLEPFTF-----QGDQ--WVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLDDF 791
            V   TK    + +      G+   WVSYEDP +   K  + K+ G  G+  + + LDDF
Sbjct: 336 -VGDPTKRFGSYAYRPADENGEHGIWVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDF 394

Query: 792 HNKCCMESFPLLRAV 806
             +C  + FP+LR+ 
Sbjct: 395 RGQCTGDKFPILRSA 409



 Score = 50.0 bits (120), Expect = 1e-05
 Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 7/70 (10%)

Query: 2339 GPYAY-----NGK--LWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLDDFQNLCSCGPY 2391
            G YAY     NG+  +W+SY+D      K    K  GLGG  ++ + LDDF+  C+   +
Sbjct: 344  GSYAYRPADENGEHGIWVSYEDPDTAANKAGYAKAKGLGGVALFDLSLDDFRGQCTGDKF 403

Query: 2392 PLLSAVNYEL 2401
            P+L +  Y L
Sbjct: 404  PILRSAKYRL 413


>gnl|CDD|119365 cd06548, GH18_chitinase, The GH18 (glycosyl hydrolases, family 18)
            type II chitinases hydrolyze chitin, an abundant polymer
            of N-acetylglucosamine and have been identified in
            bacteria, fungi, insects, plants, viruses, and protozoan
            parasites.  The structure of this domain is an
            eight-stranded alpha/beta barrel with a pronounced
            active-site cleft at the C-terminal end of the
            beta-barrel.
          Length = 322

 Score =  230 bits (588), Expect = 4e-67
 Identities = 108/370 (29%), Positives = 167/370 (45%), Gaps = 74/370 (20%)

Query: 1713 IVCYFTNWAWYRQSGGKYLPS-DIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFY 1771
            +V YFTNW  Y   G  Y  + DI +D  TH+ Y FA +D D  V+   D  AD   +  
Sbjct: 1    VVGYFTNWGIY---GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSV 57

Query: 1772 EKVT---------------ALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIA 1814
            +                   LK+K   +K+ L+IGGW  S G  +S    ++ +R+KF  
Sbjct: 58   DGGADTDDQPLKGNFGQLRKLKQKNPHLKILLSIGGWTWSGG--FSDAAATEASRAKFAD 115

Query: 1815 HVVNFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFN------PHD 1868
              V+FI ++ FDG+D+DWEYP             DK+ F  L+KELR A +         
Sbjct: 116  SAVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKENFTLLLKELREALDALGAETGRK 175

Query: 1869 LLLSAAVSPSKAVIDNAYDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTP 1928
             LL+ A       +D   ++  +++ LD+I++MTYD+HG W   TGH + +YA P D   
Sbjct: 176  YLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNTTGHHSNLYASPADPPG 234

Query: 1929 TFNANYSLHYWVSHGADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARG 1988
             ++ + +++Y++S G   +K++ G+P YG                             RG
Sbjct: 235  GYSVDAAVNYYLSAGVPPEKLVLGVPFYG-----------------------------RG 265

Query: 1989 FLAYYEICDKIQKDGWVVVRDRKRRIGPYAFKGD--QWVGFDDQAMIHHKAEFVKYNDLG 2046
            +             G+    D   +  PY +      ++ +DD   I  KA++VK   LG
Sbjct: 266  W------------TGYTRYWDEVAK-APYLYNPSTKTFISYDDPRSIKAKADYVKDKGLG 312

Query: 2047 GAMIWALDLD 2056
            G M W L  D
Sbjct: 313  GVMFWELSGD 322



 Score =  176 bits (448), Expect = 2e-48
 Identities = 82/274 (29%), Positives = 128/274 (46%), Gaps = 37/274 (13%)

Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDD---- 440
           VV Y TNW         FV ++I     THI YAFA +D +   + + +  AD       
Sbjct: 1   VVGYFTNWGIY--GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSVD 58

Query: 441 ------------NFYQRISSSPLVT----HGKVKILIAIGGWTDSSGEKYSQLISSGSNR 484
                       NF Q      L      +  +KIL++IGGWT S    +S   ++ ++R
Sbjct: 59  GGADTDDQPLKGNFGQ------LRKLKQKNPHLKILLSIGGWTWS--GGFSDAAATEASR 110

Query: 485 KKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKH-- 542
            KF  S + F+R++ F G+  DW YP    A  +     DK NF  L++EL+   D    
Sbjct: 111 AKFADSAVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKENFTLLLKELREALDALGA 170

Query: 543 ----DYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRP 598
                Y + +      + L+   +   +  +++F++LM+YD+HGAW   TG  S L + P
Sbjct: 171 ETGRKYLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNTTGHHSNLYASP 229

Query: 599 GEPYPNYNINTALKLIDELGGDKRKIVIGVPFYG 632
            +P   Y+++ A+      G    K+V+GVPFYG
Sbjct: 230 ADPPGGYSVDAAVNYYLSAGVPPEKLVLGVPFYG 263



 Score =  170 bits (434), Expect = 1e-46
 Identities = 78/265 (29%), Positives = 128/265 (48%), Gaps = 26/265 (9%)

Query: 1342 IVCYFTNWAWYRPGKGKYVLKKVPWINILKLY------------VTLQTGLGTDQEKENT 1389
            +V YFTNW  Y  G+  +V   +P   +  +             VT       +  +   
Sbjct: 1    VVGYFTNWGIY--GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSVD 58

Query: 1390 FLKISELTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIE 1447
                ++   +   F + +  LK+K   +K+ L+IGGW  +  G +S    +  +R +F +
Sbjct: 59   GGADTDDQPLKGNFGQ-LRKLKQKNPHLKILLSIGGW--TWSGGFSDAAATEASRAKFAD 115

Query: 1448 HVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGL----- 1502
              V F+ KY FDG+D+DWEYP       +     DKE+F L ++EL +A    G      
Sbjct: 116  SAVDFIRKYGFDGIDIDWEYPGSGGAPGNVARPEDKENFTLLLKELREALDALGAETGRK 175

Query: 1503 -LLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFF 1561
             LL+ A       ++   +V  +++ LD+I++MTYD+HG W   TGH + LY  P D   
Sbjct: 176  YLLTIAAPAGPDKLDKL-EVAEIAKYLDFINLMTYDFHGAWSNTTGHHSNLYASPADPPG 234

Query: 1562 YFNANFTMNYWMKKGAPSRKLVMGV 1586
             ++ +  +NY++  G P  KLV+GV
Sbjct: 235  GYSVDAAVNYYLSAGVPPEKLVLGV 259



 Score =  115 bits (289), Expect = 2e-27
 Identities = 78/379 (20%), Positives = 140/379 (36%), Gaps = 94/379 (24%)

Query: 15  VVCYVEAKSAYRHRPATFNVKNVIPQT-CTHVIYAYAAIDP-----------VSRALIPE 62
           VV Y      Y      + V + IP    TH+ YA+A ID               A    
Sbjct: 1   VVGYFTNWGIY---GRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSV 57

Query: 63  DLEYDVIKGGYKSFLG----LKEANPELKVYLAV----KS-NFVSITSDRESRLNFISSV 113
           D   D      K   G    LK+ NP LK+ L++     S  F    +   SR  F  S 
Sbjct: 58  DGGADTDDQPLKGNFGQLRKLKQKNPHLKILLSIGGWTWSGGFSDAAATEASRAKFADSA 117

Query: 114 LEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFIQELSSTL------RRNN 167
           ++    Y FDG+D++ + P           +   ++ +F+  ++EL   L          
Sbjct: 118 VDFIRKYGFDGIDIDWEYP--GSGGAPGNVARPEDKENFTLLLKELREALDALGAETGRK 175

Query: 168 YQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSF--------NNDHMDDEVVPVKPNTK 219
           Y LT+ +P     K   ++++ +A  +D I L ++           H       +  +  
Sbjct: 176 YLLTIAAPAG-PDKLDKLEVAEIAKYLDFINLMTYDFHGAWSNTTGHHS----NLYASPA 230

Query: 220 VNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGP 279
                        ++++AV  ++  G  PE++++G+PF+G+ +  +              
Sbjct: 231 DP-------PGGYSVDAAVNYYLSAGVPPEKLVLGVPFYGRGWTGYT------------- 270

Query: 280 GTEGKYTQMPGYLAFFEVCNKFKDKTWRHFTDSNGEPFMVKKD--EWITYENNDSIRRKM 337
                                      R++ +    P++       +I+Y++  SI+ K 
Sbjct: 271 ---------------------------RYWDEVAKAPYLYNPSTKTFISYDDPRSIKAKA 303

Query: 338 NYIKDRHLGGAMLWTLDLD 356
           +Y+KD+ LGG M W L  D
Sbjct: 304 DYVKDKGLGGVMFWELSGD 322



 Score = 75.7 bits (187), Expect = 4e-14
 Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 23/141 (16%)

Query: 2192 VICYYTSWAYLRQAEGKYSPE-DIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFY 2250
            V+ Y+T+W         Y    DI     TH+ YAFAD+D    V+ S +   D      
Sbjct: 1    VVGYFTNW---GIYGRNYFVTDDIPADKLTHINYAFADIDGDGGVVTSDDEAADEAAQSV 57

Query: 2251 EKITKFNERGIQ-----------------VILAIGGWSDSRSDKYSRLVNDIPARRNFVR 2293
            +     +++ ++                 ++L+IGGW+   S  +S       +R  F  
Sbjct: 58   DGGADTDDQPLKGNFGQLRKLKQKNPHLKILLSIGGWT--WSGGFSDAAATEASRAKFAD 115

Query: 2294 HVTEFLEFYGFNGLEFAWEYP 2314
               +F+  YGF+G++  WEYP
Sbjct: 116  SAVDFIRKYGFDGIDIDWEYP 136



 Score = 61.5 bits (150), Expect = 2e-09
 Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 3/55 (5%)

Query: 2328 WTVVRDSLGARGPYAYNG--KLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLD 2380
            +T   D + A+ PY YN   K ++SYDD R I+ K   +K  GLGG M W +  D
Sbjct: 269  YTRYWDEV-AKAPYLYNPSTKTFISYDDPRSIKAKADYVKDKGLGGVMFWELSGD 322



 Score = 57.3 bits (139), Expect = 4e-08
 Identities = 14/56 (25%), Positives = 27/56 (48%), Gaps = 2/56 (3%)

Query: 736 RKWTVQRNTKSLEPFTFQGD--QWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
             +T   +  +  P+ +      ++SY+DP S+  K  ++K  G  G+M W +  D
Sbjct: 267 TGYTRYWDEVAKAPYLYNPSTKTFISYDDPRSIKAKADYVKDKGLGGVMFWELSGD 322



 Score = 31.4 bits (72), Expect = 5.3
 Identities = 6/20 (30%), Positives = 10/20 (50%)

Query: 871 VKFIKSNGYAGIMAWTIDLD 890
             ++K  G  G+M W +  D
Sbjct: 303 ADYVKDKGLGGVMFWELSGD 322


>gnl|CDD|225862 COG3325, ChiA, Chitinase [Carbohydrate transport and metabolism].
          Length = 441

 Score =  185 bits (472), Expect = 3e-50
 Identities = 106/397 (26%), Positives = 169/397 (42%), Gaps = 58/397 (14%)

Query: 1709 DEFKIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFA------------VLDTDQLV 1756
            D+FK+V Y+T+W+ Y      Y P DI  D  TH+ Y F             V D   L 
Sbjct: 36   DQFKVVGYYTSWSQY--DRQDYFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAALY 93

Query: 1757 IKP------HDTWADLDNKFYEKVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQA 1808
              P       D W+D     +  +  LK     +K  ++IGGW+DS G  +S +     +
Sbjct: 94   GVPNIEGVELDPWSDPLKGHFGALFDLKATYPDLKTLISIGGWSDSGG--FSDMAADDAS 151

Query: 1809 RSKFIAHVVNFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAF---- 1864
            R  F    V F+  + FDG+D+DWEYP         G   DK  +  L++ELR       
Sbjct: 152  RENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKDKANYVLLLQELRKKLDKAG 211

Query: 1865 --NPHDLLLSAAVSPSKAVIDNAYDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYAL 1922
              +     L+ A   SK  ++   +   +++ +D+I++MTYD+HG W++  GH A +Y  
Sbjct: 212  VEDGRHYQLTIAAPASKDKLEGL-NHAEIAQYVDYINIMTYDFHGAWNETLGHHAALYGT 270

Query: 1923 PNDTTPT-----FNANYSLHYWVSHGAD----RKKVIFGMPMYGQSFTLADKNKNGLNSQ 1973
            P D          +A      W+  G        K++ GMP YG+ +   D    G    
Sbjct: 271  PKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPSKLVLGMPFYGRGWNGVD---GGSLGT 327

Query: 1974 TYGGAEAGENTRAR-----------GFLAYYEICD-KIQKDGWVVVRDRKRRIGPYAF-- 2019
              G  +  +N+               +   Y++      K+G+    D   +  PY +  
Sbjct: 328  CPGLYQGLDNSGIPKGTWEAGNGDKDYGKAYDLDANNAGKNGYERYWDDVAKA-PYLYNP 386

Query: 2020 KGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLD 2056
            +   ++ +DD   +  KAE+V  N+LGG M W +  D
Sbjct: 387  EKGVFISYDDPRSVKAKAEYVADNNLGGMMFWEISGD 423



 Score =  162 bits (413), Expect = 1e-42
 Identities = 100/444 (22%), Positives = 174/444 (39%), Gaps = 91/444 (20%)

Query: 381 PEPKVVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEAD--- 437
            + KVV Y T+WS        + P +I     THI YAF  ++ +  SI++   +     
Sbjct: 36  DQFKVVGYYTSWSQY--DRQDYFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAALY 93

Query: 438 IDDNFYQRI---SSSPLV-----------THGKVKILIAIGGWTDSSGEKYSQLISSGSN 483
              N         S PL            T+  +K LI+IGGW+DS G  +S + +  ++
Sbjct: 94  GVPNIEGVELDPWSDPLKGHFGALFDLKATYPDLKTLISIGGWSDSGG--FSDMAADDAS 151

Query: 484 RKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDK-- 541
           R+ F KS + F+R + F G+  DW YP       +     DK N+V L+QEL+ + DK  
Sbjct: 152 RENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKDKANYVLLLQELRKKLDKAG 211

Query: 542 ----HDYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSR 597
                 Y + +     K+ LE   +   +  +V+++++M+YD+HGAW    G  + L   
Sbjct: 212 VEDGRHYQLTIAAPASKDKLE-GLNHAEIAQYVDYINIMTYDFHGAWNETLGHHAALYGT 270

Query: 598 PGEPYPN---------YNINTALKLIDELGGDKRKIVIGVPFYGQSYTLKLDKEHGLGAE 648
           P +P             +    L+          K+V+G+PFYG+ +       +G+   
Sbjct: 271 PKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPSKLVLGMPFYGRGW-------NGVDGG 323

Query: 649 TDGPGLAGEYTQQPGMLAYYEICYRIKTRKWTVQRNTKSLEPRSMGTLKLDKEHGLGAET 708
           + G    G Y                             +   +      DK++G     
Sbjct: 324 SLGTCP-GLYQGLDNS----------------------GIPKGTWEAGNGDKDYG----- 355

Query: 709 DGPGLAGEYTQQPGMLAYYEICY-RIKTRKWTVQRNTKSLEPFTF--QGDQWVSYEDPTS 765
                             Y++         +    +  +  P+ +  +   ++SY+DP S
Sbjct: 356 ----------------KAYDLDANNAGKNGYERYWDDVAKAPYLYNPEKGVFISYDDPRS 399

Query: 766 LTEKVKFIKSNGYAGIMAWTIDLD 789
           +  K +++  N   G+M W I  D
Sbjct: 400 VKAKAEYVADNNLGGMMFWEISGD 423



 Score =  137 bits (346), Expect = 7e-34
 Identities = 77/283 (27%), Positives = 126/283 (44%), Gaps = 40/283 (14%)

Query: 1337 DQYFKIVCYFTNWAWYRPGKGKYVLKKVPWINILKLY----------------VTLQTGL 1380
            D  FK+V Y+T+W+ Y   +  Y    +P   +  +                 V  +  L
Sbjct: 35   DDQFKVVGYYTSWSQY--DRQDYFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAAL 92

Query: 1381 GTDQEKENTFLKISELTYVHILFYERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNS 1438
                  E   L       +   F   +  LK     +K  ++IGGW+DS  G +S +   
Sbjct: 93   YGVPNIEGVEL-DPWSDPLKGHFGA-LFDLKATYPDLKTLISIGGWSDS--GGFSDMAAD 148

Query: 1439 ATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFK 1498
              +R+ F +  V+F+  Y FDG+D+DWEYP       + G   DK ++ L ++EL +   
Sbjct: 149  DASRENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKDKANYVLLLQELRKKLD 208

Query: 1499 PHGL------LLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPL 1552
              G+       L+ A   SK       +   +++ +D+I++MTYD+HG W++  GH A L
Sbjct: 209  KAGVEDGRHYQLTIAAPASKD-KLEGLNHAEIAQYVDYINIMTYDFHGAWNETLGHHAAL 267

Query: 1553 YEHPDD-----DFFYFNANFTMNYWMKKG----APSRKLVMGV 1586
            Y  P D       FY +A      W+++G     P  KLV+G+
Sbjct: 268  YGTPKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPSKLVLGM 310



 Score = 96.0 bits (239), Expect = 3e-20
 Identities = 84/419 (20%), Positives = 160/419 (38%), Gaps = 88/419 (21%)

Query: 3   NTAYFSQSTKHQVVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAI---------- 52
           N    +   + +VV Y  + S Y      +   ++     TH+ YA+  I          
Sbjct: 28  NNKAHTSDDQFKVVGYYTSWSQYD--RQDYFPGDIPLDQLTHINYAFLDINSDGKSIESW 85

Query: 53  -----------DPVSRALIPEDLEYDVIKGGYKSFLGLKEANPELKVYLAV----KSN-F 96
                      +     L P     D +KG + +   LK   P+LK  +++     S  F
Sbjct: 86  VADEAALYGVPNIEGVELDPWS---DPLKGHFGALFDLKATYPDLKTLISIGGWSDSGGF 142

Query: 97  VSITSDRESRLNFISSVLEMFDMYKFDGLDL------NVKDPALNDEDDDDLESIANERS 150
             + +D  SR NF  S +E    Y FDG+D+      +  D        D        ++
Sbjct: 143 SDMAADDASRENFAKSAVEFMRTYGFDGVDIDWEYPGSGGDAGNCGRPKD--------KA 194

Query: 151 DFSTFIQELSSTLRR------NNYQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSFN- 203
           ++   +QEL   L +       +YQLT+ +P   D K   ++ + +A  VD I + +++ 
Sbjct: 195 NYVLLLQELRKKLDKAGVEDGRHYQLTIAAPASKD-KLEGLNHAEIAQYVDYINIMTYDF 253

Query: 204 ----NDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAV------YNWIKKGAR----PE 249
               N+        +  +    +  T       N    V       +W+++G      P 
Sbjct: 254 HGAWNET-------LGHHA--ALYGTPKDPPLANGGFYVDAEVDGIDWLEEGFAGDVPPS 304

Query: 250 QIIIGIPFFGKSYRLFNRSEY--------GLGATVKGPGTEGKYTQMPGYLAFFEV-CNK 300
           ++++G+PF+G+ +   +            GL  +    GT         Y   +++  N 
Sbjct: 305 KLVLGMPFYGRGWNGVDGGSLGTCPGLYQGLDNSGIPKGTWEAGNGDKDYGKAYDLDANN 364

Query: 301 FKDKTWRHFTDSNGE-PFMV--KKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLD 356
                +  + D   + P++   +K  +I+Y++  S++ K  Y+ D +LGG M W +  D
Sbjct: 365 AGKNGYERYWDDVAKAPYLYNPEKGVFISYDDPRSVKAKAEYVADNNLGGMMFWEISGD 423



 Score = 73.6 bits (181), Expect = 5e-13
 Identities = 44/148 (29%), Positives = 67/148 (45%), Gaps = 26/148 (17%)

Query: 2188 EPYKVICYYTSWA-YLRQAEGKYSPEDIDGSLCTHLVYAF------------------AD 2228
            + +KV+ YYTSW+ Y RQ    Y P DI     TH+ YAF                  A 
Sbjct: 36   DQFKVVGYYTSWSQYDRQD---YFPGDIPLDQLTHINYAFLDINSDGKSIESWVADEAAL 92

Query: 2229 LDEQELVIRSHNNKLDLEQGFYEKITKFNER--GIQVILAIGGWSDSRSDKYSRLVNDIP 2286
                 +     +   D  +G +  +         ++ +++IGGWSDS    +S +  D  
Sbjct: 93   YGVPNIEGVELDPWSDPLKGHFGALFDLKATYPDLKTLISIGGWSDS--GGFSDMAADDA 150

Query: 2287 ARRNFVRHVTEFLEFYGFNGLEFAWEYP 2314
            +R NF +   EF+  YGF+G++  WEYP
Sbjct: 151  SRENFAKSAVEFMRTYGFDGVDIDWEYP 178



 Score = 44.0 bits (104), Expect = 9e-04
 Identities = 19/55 (34%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 2328 WTVVRDSLGARGPYAYN--GKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLD 2380
            +    D + A+ PY YN    +++SYDD R ++ K + +  + LGG M W I  D
Sbjct: 370  YERYWDDV-AKAPYLYNPEKGVFISYDDPRSVKAKAEYVADNNLGGMMFWEISGD 423


>gnl|CDD|119349 cd00598, GH18_chitinase-like, The GH18 (glycosyl hydrolase, family
            18) type II chitinases hydrolyze chitin, an abundant
            polymer of beta-1,4-linked N-acetylglucosamine (GlcNAc)
            which is a major component of the cell wall of fungi and
            the exoskeleton of arthropods.  Chitinases have been
            identified in viruses, bacteria, fungi, protozoan
            parasites, insects, and plants. The structure of the GH18
            domain is an eight-stranded beta/alpha barrel with a
            pronounced active-site cleft at the C-terminal end of the
            beta-barrel.  The GH18 family includes chitotriosidase,
            chitobiase, hevamine, zymocin-alpha, narbonin, SI-CLP
            (stabilin-1 interacting chitinase-like protein), IDGF
            (imaginal disc growth factor), CFLE (cortical
            fragment-lytic enzyme) spore hydrolase, the type III and
            type V plant chitinases, the
            endo-beta-N-acetylglucosaminidases, and the chitolectins.
             The GH85 (glycosyl hydrolase, family 85) ENGases
            (endo-beta-N-acetylglucosaminidases) are closely related
            to the GH18 chitinases and are included in this alignment
            model.
          Length = 210

 Score =  164 bits (417), Expect = 8e-46
 Identities = 66/192 (34%), Positives = 106/192 (55%), Gaps = 15/192 (7%)

Query: 1713 IVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFYE 1772
            ++CY+  W+  R       P+DI   LCTH+IY FA + +D  +    D   +      E
Sbjct: 1    VICYYDGWSSGR----GPDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDKSEEPLKGALE 56

Query: 1773 KVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLDW 1832
            ++ + KK G+KV ++IGGW DS       L +   +R+ F   +V+F+  + FDG+D+DW
Sbjct: 57   ELAS-KKPGLKVLISIGGWTDS---SPFTLASDPASRAAFANSLVSFLKTYGFDGVDIDW 112

Query: 1833 EYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNAYDIPVMS 1892
            EYP            SD++ F  L++ELR+A    + LL+ AV  S   +  AYD+P + 
Sbjct: 113  EYPG-------AADNSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVPAIG 165

Query: 1893 ENLDWISVMTYD 1904
            + +D+++VMTYD
Sbjct: 166  DYVDFVNVMTYD 177



 Score =  144 bits (364), Expect = 1e-38
 Identities = 64/197 (32%), Positives = 92/197 (46%), Gaps = 21/197 (10%)

Query: 1342 IVCYFTNWAWYRPGKGKYV-LKKVPWINILKLYVTLQTGLGTDQEKENTFLKISELTYVH 1400
            ++CY+  W+  R      + L     I      ++    L       N F   SE     
Sbjct: 1    VICYYDGWSSGRGPDPTDIPLSLCTHIIYAFAEISSDGSL-------NLFGDKSEEPLKG 53

Query: 1401 ILFYERVVTLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDG 1460
             L     +  KK G+KV ++IGGW DS       L +   +R  F   +V FL  Y FDG
Sbjct: 54   AL---EELASKKPGLKVLISIGGWTDS---SPFTLASDPASRAAFANSLVSFLKTYGFDG 107

Query: 1461 LDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAYD 1520
            +D+DWEYP        A  +SD+E+F   +REL  A      LL+ AV  S   +  AYD
Sbjct: 108  VDIDWEYP-------GAADNSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYD 160

Query: 1521 VKALSESLDWISVMTYD 1537
            V A+ + +D+++VMTYD
Sbjct: 161  VPAIGDYVDFVNVMTYD 177



 Score =  139 bits (352), Expect = 5e-37
 Identities = 62/195 (31%), Positives = 101/195 (51%), Gaps = 18/195 (9%)

Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQ 444
           V+CY   WS    S     P +I   LCTHI+YAFA +  +      G+   +      +
Sbjct: 1   VICYYDGWS----SGRGPDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDKSEEPLKGALE 56

Query: 445 RISSSPLVTHGKVKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLH 504
            ++S        +K+LI+IGGWTDSS      L S  ++R  F  S+++FL+ + F G+ 
Sbjct: 57  ELASK----KPGLKVLISIGGWTDSS---PFTLASDPASRAAFANSLVSFLKTYGFDGVD 109

Query: 505 FDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDKHDYSIAVGISGYKEILEVAYDFP 564
            DW YP            +D+ NF+ L++EL++     +Y + + +      L  AYD P
Sbjct: 110 IDWEYPG-------AADNSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVP 162

Query: 565 ALNDHVEFMSLMSYD 579
           A+ D+V+F+++M+YD
Sbjct: 163 AIGDYVDFVNVMTYD 177



 Score =  101 bits (254), Expect = 7e-24
 Identities = 62/224 (27%), Positives = 94/224 (41%), Gaps = 49/224 (21%)

Query: 2192 VICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFYE 2251
            VICYY  W+    +     P DI  SLCTH++YAFA++     +    +     E+    
Sbjct: 1    VICYYDGWS----SGRGPDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDK---SEEPLKG 53

Query: 2252 KITKFNER--GIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEF 2309
             + +   +  G++V+++IGGW+DS       L +D  +R  F   +  FL+ YGF+G++ 
Sbjct: 54   ALEELASKKPGLKVLISIGGWTDSS---PFTLASDPASRAAFANSLVSFLKTYGFDGVDI 110

Query: 2310 AWEYPKCWQTCSTVKSHEWTV-----VRDSLGARG------PYAYNGKLWLSYDDTR--- 2355
             WEYP            E  +     +R +LGA          A    L  +YD      
Sbjct: 111  DWEYPGAADNSDR----ENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVPAIGD 166

Query: 2356 -------------------DIRRKCKSIKQDGLGGAMVWSIDLD 2380
                                +  K K  KQ GLGG M+W +D D
Sbjct: 167  YVDFVNVMTYDLVLGVPFYSLGAKAKYAKQKGLGGVMIWELDQD 210



 Score = 97.8 bits (244), Expect = 2e-22
 Identities = 49/193 (25%), Positives = 87/193 (45%), Gaps = 20/193 (10%)

Query: 15  VVCYVEAKSAYRHRPATFNVKNVIPQTCTHVIYAYAAIDPVSRALIPEDLEYDVIKGGYK 74
           V+CY +  S+ R      +  ++    CTH+IYA+A I       +  D   + +KG  +
Sbjct: 1   VICYYDGWSSGRG----PDPTDIPLSLCTHIIYAFAEISSDGSLNLFGDKSEEPLKGALE 56

Query: 75  SFLGLKEANPELKVYLAV----KSNFVSITSDRESRLNFISSVLEMFDMYKFDGLDLNVK 130
                    P LKV +++     S+  ++ SD  SR  F +S++     Y FDG+D++ +
Sbjct: 57  EL---ASKKPGLKVLISIGGWTDSSPFTLASDPASRAAFANSLVSFLKTYGFDGVDIDWE 113

Query: 131 DPALNDEDDDDLESIANERSDFSTFIQELSSTLRRNNYQLTLTSPGVIDRKTSLVDISVV 190
            P   D          ++R +F T ++EL S L   NY LT+  P          D+  +
Sbjct: 114 YPGAAD---------NSDRENFITLLRELRSALGAANYLLTIAVPASYFDLGYAYDVPAI 164

Query: 191 APLVDLILLKSFN 203
              VD + + +++
Sbjct: 165 GDYVDFVNVMTYD 177



 Score = 40.8 bits (96), Expect = 0.003
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 332 SIRRKMNYIKDRHLGGAMLWTLDLD 356
           S+  K  Y K + LGG M+W LD D
Sbjct: 186 SLGAKAKYAKQKGLGGVMIWELDQD 210



 Score = 38.5 bits (90), Expect = 0.024
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 764 TSLTEKVKFIKSNGYAGIMAWTIDLD 789
            SL  K K+ K  G  G+M W +D D
Sbjct: 185 YSLGAKAKYAKQKGLGGVMIWELDQD 210



 Score = 32.7 bits (75), Expect = 1.8
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 2036 KAEFVKYNDLGGAMIWALDLD 2056
            KA++ K   LGG MIW LD D
Sbjct: 190  KAKYAKQKGLGGVMIWELDQD 210



 Score = 32.3 bits (74), Expect = 2.2
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query: 871 VKFIKSNGYAGIMAWTIDLD 890
            K+ K  G  G+M W +D D
Sbjct: 191 AKYAKQKGLGGVMIWELDQD 210


>gnl|CDD|119358 cd02879, GH18_plant_chitinase_class_V, The class V plant chitinases
            have a glycosyl hydrolase family 18 (GH18) domain, but
            lack the chitin-binding domain present in other GH18
            enzymes.  The GH18 domain of the class V chitinases has
            endochitinase activity in some cases and no catalytic
            activity in others.  Included in this family is a lectin
            found in black locust (Robinia pseudoacacia) bark, which
            binds chitin but lacks chitinase activity.  Also included
            is a chitinase-related receptor-like kinase (CHRK1) from
            tobacco (Nicotiana tabacum), with an N-terminal GH18
            domain and a C-terminal kinase domain, which is thought
            to be part of a plant signaling pathway.  The GH18 domain
            of CHRK1 is expressed extracellularly where it binds
            chitin but lacks chitinase activity.
          Length = 299

 Score =  156 bits (397), Expect = 6e-42
 Identities = 99/349 (28%), Positives = 149/349 (42%), Gaps = 76/349 (21%)

Query: 1722 WYRQSGGKYLPSDIDSDLCTHVIYGFAVLD--TDQLVIKPHDTWADLDNKFYEKVTALKK 1779
            W      ++ PS+IDS L TH+ Y FA LD  T ++VI      ++    F E V   K 
Sbjct: 9    WP-AWSEEFPPSNIDSSLFTHLFYAFADLDPSTYEVVI-SPSDESEFST-FTETVKR-KN 64

Query: 1780 KGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKCWQ 1839
              VK  L+IGG    +   ++ + +   AR  FI   +    ++ FDGLDLDWE+P    
Sbjct: 65   PSVKTLLSIGGGGSDSSA-FAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFPS--- 120

Query: 1840 VDCKQGPASDKQGFADLIKELRAAFNPHD-------LLLSAAVSPSKAVIDNA----YDI 1888
                     + + F  L++E RAA            LLL+AAV  S  +  +     Y I
Sbjct: 121  ------SQVEMENFGKLLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSDDSVSYPI 174

Query: 1889 PVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGADRKK 1948
              +++NLDW++VM YDY+G W+  T   A   A   D     + +Y +  W+  G   KK
Sbjct: 175  EAINKNLDWVNVMAYDYYGSWESNTTGPA---AALYDPNSNVSTDYGIKSWIKAGVPAKK 231

Query: 1949 VIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKIQKDGWVVVR 2008
            ++ G+P+YG+++TL D         T   +                              
Sbjct: 232  LVLGLPLYGRAWTLYDT--------TTVSS------------------------------ 253

Query: 2009 DRKRRIGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDD 2057
                    Y + G  W+G+DD   I  K ++ K   L G   WA+  DD
Sbjct: 254  --------YVYAGTTWIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294



 Score =  122 bits (308), Expect = 4e-30
 Identities = 64/192 (33%), Positives = 100/192 (52%), Gaps = 26/192 (13%)

Query: 1406 RVVTLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDW 1465
              V  K   VK  L+IGG        ++ + +  TAR+ FI   +K   KY FDGLDLDW
Sbjct: 58   ETVKRKNPSVKTLLSIGGGGSD-SSAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDW 116

Query: 1466 EYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPH-------GLLLSAAVSPSKQVI--- 1515
            E+P+            + E+FG  + E   A K          LLL+AAV  S  +    
Sbjct: 117  EFPS---------SQVEMENFGKLLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSD 167

Query: 1516 -NAAYDVKALSESLDWISVMTYDYHGQWDKK-TGHVAPLYEHPDDDFFYFNANFTMNYWM 1573
             + +Y ++A++++LDW++VM YDY+G W+   TG  A LY+   +     + ++ +  W+
Sbjct: 168  DSVSYPIEAINKNLDWVNVMAYDYYGSWESNTTGPAAALYDPNSN----VSTDYGIKSWI 223

Query: 1574 KKGAPSRKLVMG 1585
            K G P++KLV+G
Sbjct: 224  KAGVPAKKLVLG 235



 Score =  108 bits (273), Expect = 1e-25
 Identities = 73/253 (28%), Positives = 122/253 (48%), Gaps = 32/253 (12%)

Query: 398 SDGKFVPENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQRI-SSSPLVTHGK 456
              +F P NID  L TH+ YAFA LDP+T  +     +      F + +   +P V    
Sbjct: 12  WSEEFPPSNIDSSLFTHLFYAFADLDPSTYEVVISPSDESEFSTFTETVKRKNPSV---- 67

Query: 457 VKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQAD 516
            K L++IGG    S   ++ + S  + RK FI S +   R++ F GL  DW +P      
Sbjct: 68  -KTLLSIGGGGSDS-SAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFP------ 119

Query: 517 CSKQHKADKGNFVKLIQELKAEFDKHDYS---------IAVGISGYKEILEVAYDFP--A 565
                + +  NF KL++E +A       S          AV  S    + + +  +P  A
Sbjct: 120 ---SSQVEMENFGKLLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSDDSVSYPIEA 176

Query: 566 LNDHVEFMSLMSYDYHGAWEGI-TGLVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKI 624
           +N +++++++M+YDY+G+WE   TG  + L         +Y I + +K     G   +K+
Sbjct: 177 INKNLDWVNVMAYDYYGSWESNTTGPAAALYDPNSNVSTDYGIKSWIK----AGVPAKKL 232

Query: 625 VIGVPFYGQSYTL 637
           V+G+P YG+++TL
Sbjct: 233 VLGLPLYGRAWTL 245



 Score = 72.8 bits (179), Expect = 2e-13
 Identities = 69/334 (20%), Positives = 132/334 (39%), Gaps = 85/334 (25%)

Query: 43  THVIYAYAAIDPVSRAL-IPEDLEYDVIKGGYKSFLG-LKEANPELKVYLAV------KS 94
           TH+ YA+A +DP +  + I    E +     + +F   +K  NP +K  L++       S
Sbjct: 27  THLFYAFADLDPSTYEVVISPSDESE-----FSTFTETVKRKNPSVKTLLSIGGGGSDSS 81

Query: 95  NFVSITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFST 154
            F ++ SD  +R  FI+S +++   Y FDGLDL+ + P        ++E       +F  
Sbjct: 82  AFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFP----SSQVEME-------NFGK 130

Query: 155 FIQELSSTLRR-----NNYQLTLTSPGVIDRKTSLVDISVVAPL------VDLILLKSFN 203
            ++E  + ++          L LT+         L D SV  P+      +D + + +++
Sbjct: 131 LLEEWRAAVKDEARSSGRPPLLLTAAVYFSPILFLSDDSVSYPIEAINKNLDWVNVMAYD 190

Query: 204 NDHMDDEVVPVKPNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYR 263
             +   E     P   +    ++   ++      + +WIK G   +++++G+P +G+++ 
Sbjct: 191 Y-YGSWESNTTGPAAALYDPNSNVSTDY-----GIKSWIKAGVPAKKLVLGLPLYGRAWT 244

Query: 264 LFNRSEYGLGATVKGPGTEGKYTQMPGYLAFFEVCNKFKDKTWRHFTDSNGEPFMVKKDE 323
           L++              T   Y               +   T                  
Sbjct: 245 LYD------------TTTVSSYV--------------YAGTT------------------ 260

Query: 324 WITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDD 357
           WI Y++  SI  K+ Y K + L G   W +  DD
Sbjct: 261 WIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294



 Score = 59.7 bits (145), Expect = 5e-09
 Identities = 39/118 (33%), Positives = 60/118 (50%), Gaps = 6/118 (5%)

Query: 2200 AYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQ--ELVIRSHNNKLDLEQGFYEKITKFN 2257
             Y      ++ P +ID SL THL YAFADLD    E+VI   +     E   + +  K  
Sbjct: 7    GYWPAWSEEFPPSNIDSSLFTHLFYAFADLDPSTYEVVISPSDES---EFSTFTETVKRK 63

Query: 2258 ERGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFAWEYPK 2315
               ++ +L+IGG   S S  ++ + +D  AR+ F+    +    YGF+GL+  WE+P 
Sbjct: 64   NPSVKTLLSIGG-GGSDSSAFAAMASDPTARKAFINSSIKVARKYGFDGLDLDWEFPS 120



 Score = 43.1 bits (102), Expect = 0.001
 Identities = 19/55 (34%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 736 RKWTVQRNTKSLEPFTFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLDD 790
           R WT+  +T ++  + + G  W+ Y+D  S+  KVK+ K  G  G  AW +  DD
Sbjct: 241 RAWTLY-DTTTVSSYVYAGTTWIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294



 Score = 42.0 bits (99), Expect = 0.003
 Identities = 18/41 (43%), Positives = 22/41 (53%)

Query: 2341 YAYNGKLWLSYDDTRDIRRKCKSIKQDGLGGAMVWSIDLDD 2381
            Y Y G  W+ YDD + I  K K  KQ GL G   W++  DD
Sbjct: 254  YVYAGTTWIGYDDVQSIAVKVKYAKQKGLLGYFAWAVGYDD 294


>gnl|CDD|119357 cd02878, GH18_zymocin_alpha, Zymocin, alpha subunit.  Zymocin is a
            heterotrimeric enzyme that inhibits yeast cell cycle
            progression. The zymocin alpha subunit has a chitinase
            activity that is essential for holoenzyme action from the
            cell exterior while the gamma subunit contains the
            intracellular toxin responsible for G1 phase cell cycle
            arrest.  The zymocin alpha and beta subunits are thought
            to act from the cell's exterior by docking to the cell
            wall-associated chitin, thus mediating gamma-toxin
            translocation.  The alpha subunit has an eight-stranded
            TIM barrel fold similar to that of family 18 glycosyl
            hydrolases such as hevamine, chitolectin, and chitobiase.
          Length = 345

 Score =  128 bits (325), Expect = 6e-32
 Identities = 98/361 (27%), Positives = 145/361 (40%), Gaps = 32/361 (8%)

Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVL--DTDQLVIKPHDTWADLDNK 1769
            K + YF  +   R        + ID+   TH+ + FA +  D    V    + +    + 
Sbjct: 1    KNIAYFEAYNLDR-PCLNMDVTQIDTSKYTHIHFAFANITSDFSVDVSSVQEQF----SD 55

Query: 1770 FYEKVTALKKKGVKVTLAIGGWNDSAG-NKYSRLVN--SQQARSKFIAHVVNFILEHNFD 1826
            F       K KGVK  L+ GGW+ S   + Y    +      R  F  +VVNF+ ++N D
Sbjct: 56   F------KKLKGVKKILSFGGWDFSTSPSTYQIFRDAVKPANRDTFANNVVNFVNKYNLD 109

Query: 1827 GLDLDWEYPKCWQV-DCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNA 1885
            G+D DWEYP    +     G   D + + + +K L++   P    LS A  P+       
Sbjct: 110  GVDFDWEYPGAPDIPGIPAGDPDDGKNYLEFLKLLKSKL-PSGKSLSIAA-PASYWYLKG 167

Query: 1886 YDIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPN------DTTPTFNANYSLHYW 1939
            + I  M++ +D+I  MTYD HGQWD      +P     N      + T T +A   L   
Sbjct: 168  FPIKDMAKYVDYIVYMTYDLHGQWDYGNKWASPGCPAGNCLRSHVNKTETLDA---LSMI 224

Query: 1940 VSHGADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEICDKI 1999
               G    KV+ G+  YG+SF +AD    G      G     E  R      Y  I +  
Sbjct: 225  TKAGVPSNKVVVGVASYGRSFKMADPGCTGPGCTFTGPGSGAEAGRCTCTAGYGAISEIE 284

Query: 2000 QKDGWVVVRDRKRRIGPYA----FKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDL 2055
              D       R       +    +  DQWV +   A    + E+ K  + GG   WA+DL
Sbjct: 285  IIDISKSKNKRWYDTDSDSDILVYDDDQWVAYMSPATKAARIEWYKGLNFGGTSDWAVDL 344

Query: 2056 D 2056
             
Sbjct: 345  Q 345



 Score = 86.6 bits (215), Expect = 1e-17
 Identities = 71/287 (24%), Positives = 114/287 (39%), Gaps = 46/287 (16%)

Query: 404 PENIDYKLCTHIVYAFASLDPNTLSIQAGNPEADIDDNFYQRISSSPLVTHGKVKILIAI 463
              ID    THI +AFA++            +  +D +  Q    S       VK +++ 
Sbjct: 20  VTQIDTSKYTHIHFAFANI----------TSDFSVDVSSVQ-EQFSDFKKLKGVKKILSF 68

Query: 464 GGWTDSSGEKYSQLISSG---SNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQA----D 516
           GGW  S+     Q+       +NR  F  +V+ F+ +++  G+ FDW YP    A     
Sbjct: 69  GGWDFSTSPSTYQIFRDAVKPANRDTFANNVVNFVNKYNLDGVDFDWEYP---GAPDIPG 125

Query: 517 CSKQHKADKGNFVKLIQELKAEF-DKHDYSIAVGISGYKEILEVAY--DFP--ALNDHVE 571
                  D  N+++ ++ LK++       SIA   S +       Y   FP   +  +V+
Sbjct: 126 IPAGDPDDGKNYLEFLKLLKSKLPSGKSLSIAAPASYW-------YLKGFPIKDMAKYVD 178

Query: 572 FMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNY--------NINTALKLIDELGGDKRK 623
           ++  M+YD HG W+          + PG P  N             AL +I + G    K
Sbjct: 179 YIVYMTYDLHGQWD-----YGNKWASPGCPAGNCLRSHVNKTETLDALSMITKAGVPSNK 233

Query: 624 IVIGVPFYGQSYTLKLDKEHGLGAETDGPGLAGEYTQQPGMLAYYEI 670
           +V+GV  YG+S+ +      G G    GPG   E  +      Y  I
Sbjct: 234 VVVGVASYGRSFKMADPGCTGPGCTFTGPGSGAEAGRCTCTAGYGAI 280



 Score = 82.4 bits (204), Expect = 3e-16
 Identities = 64/195 (32%), Positives = 87/195 (44%), Gaps = 31/195 (15%)

Query: 1410 LKKKGVKVSLAIGGWNDSLG-GKYSRL---VNSATARQRFIEHVVKFLLKYQFDGLDLDW 1465
             K KGVK  L+ GGW+ S     Y      V  A  R  F  +VV F+ KY  DG+D DW
Sbjct: 57   KKLKGVKKILSFGGWDFSTSPSTYQIFRDAVKPAN-RDTFANNVVNFVNKYNLDGVDFDW 115

Query: 1466 EYPTCWQVNCDAGPD---------SDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVIN 1516
            EYP          PD          D +++  F++ L     P G  LS A  P+     
Sbjct: 116  EYP--------GAPDIPGIPAGDPDDGKNYLEFLKLLKSKL-PSGKSLSIAA-PASYWYL 165

Query: 1517 AAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYW---M 1573
              + +K +++ +D+I  MTYD HGQWD      +P    P  +    + N T       M
Sbjct: 166  KGFPIKDMAKYVDYIVYMTYDLHGQWDYGNKWASP--GCPAGNCLRSHVNKTETLDALSM 223

Query: 1574 --KKGAPSRKLVMGV 1586
              K G PS K+V+GV
Sbjct: 224  ITKAGVPSNKVVVGV 238



 Score = 50.8 bits (122), Expect = 5e-06
 Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 14/128 (10%)

Query: 2191 KVICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQ-ELVIRSHNNKLDLEQGF 2249
            K I Y+ ++   R          ID S  TH+ +AFA++     + + S   +       
Sbjct: 1    KNIAYFEAYNLDR-PCLNMDVTQIDTSKYTHIHFAFANITSDFSVDVSSVQEQFSD---- 55

Query: 2250 YEKITKFNERGIQVILAIGGWSDSRS-DKYSRLVNDI-PARRN-FVRHVTEFLEFYGFNG 2306
            ++K+      G++ IL+ GGW  S S   Y    + + PA R+ F  +V  F+  Y  +G
Sbjct: 56   FKKLK-----GVKKILSFGGWDFSTSPSTYQIFRDAVKPANRDTFANNVVNFVNKYNLDG 110

Query: 2307 LEFAWEYP 2314
            ++F WEYP
Sbjct: 111  VDFDWEYP 118



 Score = 48.1 bits (115), Expect = 4e-05
 Identities = 30/123 (24%), Positives = 57/123 (46%), Gaps = 12/123 (9%)

Query: 243 KKGARPEQIIIGIPFFGKSYRLFNRSEYGLGATVKGPG---TEGKYTQMPGYLAFFEVCN 299
           K G    ++++G+  +G+S+++ +    G G T  GPG     G+ T   GY A  E+  
Sbjct: 226 KAGVPSNKVVVGVASYGRSFKMADPGCTGPGCTFTGPGSGAEAGRCTCTAGYGAISEIEI 285

Query: 300 KFKDKT-----WRHFTDSNGEPFMV-KKDEWITYENNDSIRRKMNYIKDRHLGGAMLWTL 353
               K+     +   +DS+    +V   D+W+ Y +  +   ++ + K  + GG   W +
Sbjct: 286 IDISKSKNKRWYDTDSDSD---ILVYDDDQWVAYMSPATKAARIEWYKGLNFGGTSDWAV 342

Query: 354 DLD 356
           DL 
Sbjct: 343 DLQ 345



 Score = 35.7 bits (83), Expect = 0.26
 Identities = 11/39 (28%), Positives = 20/39 (51%)

Query: 751 TFQGDQWVSYEDPTSLTEKVKFIKSNGYAGIMAWTIDLD 789
            +  DQWV+Y  P +   ++++ K   + G   W +DL 
Sbjct: 307 VYDDDQWVAYMSPATKAARIEWYKGLNFGGTSDWAVDLQ 345


>gnl|CDD|119353 cd02874, GH18_CFLE_spore_hydrolase, Cortical fragment-lytic enzyme
            (CFLE) is a peptidoglycan hydrolase involved in
            bacterial endospore germination.  CFLE is expressed as an
            inactive preprotein (called SleB) in the forespore
            compartment of sporulating cells.  SleB translocates
            across the forespore inner membrane and is deposited as a
            mature enzyme in the cortex layer of the spore.  As part
            of a sensory mechanism capable of initiating germination,
            CFLE degrades a spore-specific peptidoglycan constituent
            called muramic-acid delta-lactam that comprises the outer
            cortex.  CFLE has a C-terminal glycosyl hydrolase family
            18 (GH18) catalytic domain as well as two N-terminal LysM
            peptidoglycan-binding domains.  In addition to SleB, this
            family includes YaaH, YdhD, and YvbX from Bacillus
            subtilis.
          Length = 313

 Score = 98.5 bits (246), Expect = 7e-22
 Identities = 57/222 (25%), Positives = 98/222 (44%), Gaps = 63/222 (28%)

Query: 1396 LTYVHILFY-------------ERVVTL-KKKGVKVSLAI-----GGWNDSLGGKYSRLV 1436
            LTY+   +Y             ER++   K++GVK  L I     G ++  L      ++
Sbjct: 26   LTYIAPFWYGVDADGTLTGLPDERLIEAAKRRGVKPLLVITNLTNGNFDSELA---HAVL 82

Query: 1437 NSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQA 1496
            ++  ARQR I +++    KY +DG+++D+E      V        D+E++  F+REL   
Sbjct: 83   SNPEARQRLINNILALAKKYGYDGVNIDFEN-----VPP-----EDREAYTQFLRELSDR 132

Query: 1497 FKPHGLLLSAAVSPSK-----QVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAP 1551
              P G  LS AV P          + AYD  A+ + +D++ +MTYD+H +     G VAP
Sbjct: 133  LHPAGYTLSTAVVPKTSADQFGNWSGAYDYAAIGKIVDFVVLMTYDWHWRG-GPPGPVAP 191

Query: 1552 LYEHPDDDFFYFNANFTMNYWMKK-------GAPSRKLVMGV 1586
            +                   W+++         P  K+++G+
Sbjct: 192  I------------------GWVERVLQYAVTQIPREKILLGI 215



 Score = 98.5 bits (246), Expect = 7e-22
 Identities = 71/307 (23%), Positives = 125/307 (40%), Gaps = 74/307 (24%)

Query: 1774 VTALKKKGVKVTLAIGGWNDS--AGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLD 1831
            + A K++GVK  L I    +          ++++ +AR + I +++    ++ +DG+++D
Sbjct: 51   IEAAKRRGVKPLLVITNLTNGNFDSELAHAVLSNPEARQRLINNILALAKKYGYDGVNID 110

Query: 1832 WEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDN-----AY 1886
            +E            P  D++ +   ++EL    +P    LS AV P  +         AY
Sbjct: 111  FENV----------PPEDREAYTQFLRELSDRLHPAGYTLSTAVVPKTSADQFGNWSGAY 160

Query: 1887 DIPVMSENLDWISVMTYDYHGQWDKKTGHVAPM--------YAL---PNDTTPTFNANYS 1935
            D   + + +D++ +MTYD+H +     G VAP+        YA+   P            
Sbjct: 161  DYAAIGKIVDFVVLMTYDWHWRG-GPPGPVAPIGWVERVLQYAVTQIP------------ 207

Query: 1936 LHYWVSHGADRKKVIFGMPMYGQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEI 1995
                      R+K++ G+P+YG  +TL    K G  + T    +A     A+ + A    
Sbjct: 208  ----------REKILLGIPLYGYDWTL--PYKKGGKASTISPQQAIN--LAKRYGA---- 249

Query: 1996 CDKIQKDGWVVVRDRKRRIGPYAFKGDQ-----WVGFDDQAMIHHKAEFVKYNDLGGAMI 2050
              +IQ D        +    P+    D+      V F+D   +  K E  K   L G   
Sbjct: 250  --EIQYD--------EEAQSPFFRYVDEQGRRHEVWFEDARSLQAKFELAKEYGLRGVSY 299

Query: 2051 WALDLDD 2057
            W L L+D
Sbjct: 300  WRLGLED 306



 Score = 77.7 bits (192), Expect = 7e-15
 Identities = 60/271 (22%), Positives = 109/271 (40%), Gaps = 54/271 (19%)

Query: 456 KVKILIAIGGWTDS--SGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWN--YPV 511
            VK L+ I   T+     E    ++S+   R++ I ++L   +++ + G++ D+    P 
Sbjct: 58  GVKPLLVITNLTNGNFDSELAHAVLSNPEARQRLINNILALAKKYGYDGVNIDFENVPP- 116

Query: 512 CWQADCSKQHKADKGNFVKLIQELKAEFDKHDYSIAVGI-----SGYKEILEVAYDFPAL 566
                       D+  + + ++EL        Y+++  +     +        AYD+ A+
Sbjct: 117 -----------EDREAYTQFLRELSDRLHPAGYTLSTAVVPKTSADQFGNWSGAYDYAAI 165

Query: 567 NDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVI 626
              V+F+ LM+YD+H    G  G V+P+       +    +  A+  I      + KI++
Sbjct: 166 GKIVDFVVLMTYDWHWRG-GPPGPVAPIG------WVERVLQYAVTQI-----PREKILL 213

Query: 627 GVPFYGQSYTLKLDKEHGLGAETDGPG----LAGEY--------TQQPGMLAYYEICYRI 674
           G+P YG  +TL   K  G  A T  P     LA  Y          Q     Y +     
Sbjct: 214 GIPLYGYDWTLPYKK--GGKASTISPQQAINLAKRYGAEIQYDEEAQSPFFRYVD---EQ 268

Query: 675 KTRKWTVQRNTKSLEPRSMGTLKLDKEHGLG 705
             R      + +SL+ +     +L KE+GL 
Sbjct: 269 GRRHEVWFEDARSLQAK----FELAKEYGLR 295



 Score = 57.3 bits (139), Expect = 4e-08
 Identities = 54/276 (19%), Positives = 103/276 (37%), Gaps = 65/276 (23%)

Query: 98  SITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANE-RSDFSTFI 156
           ++ S+ E+R   I+++L +   Y +DG+++             D E++  E R  ++ F+
Sbjct: 80  AVLSNPEARQRLINNILALAKKYGYDGVNI-------------DFENVPPEDREAYTQFL 126

Query: 157 QELSSTLRRNNYQLTLTS-PGVIDRKTSLV----DISVVAPLVDLILLKSFNNDHMDDEV 211
           +ELS  L    Y L+    P     +        D + +  +VD ++L +++        
Sbjct: 127 RELSDRLHPAGYTLSTAVVPKTSADQFGNWSGAYDYAAIGKIVDFVVLMTYDWHWRGGPP 186

Query: 212 VPVKPNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYG 271
            PV P   V             ++ AV   I +    E+I++GIP +G     +    Y 
Sbjct: 187 GPVAPIGWVE----------RVLQYAV-TQIPR----EKILLGIPLYG---YDWT-LPYK 227

Query: 272 LGATVKGPGTEG------KYTQMPGYLAFFEVCNKFKDKTWRH----FTDSNGEPFMVKK 321
            G        +       +Y     Y           D+  +     + D  G      +
Sbjct: 228 KGGKASTISPQQAINLAKRYGAEIQY-----------DEEAQSPFFRYVDEQG------R 270

Query: 322 DEWITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDD 357
              + +E+  S++ K    K+  L G   W L L+D
Sbjct: 271 RHEVWFEDARSLQAKFELAKEYGLRGVSYWRLGLED 306


>gnl|CDD|119362 cd06545, GH18_3CO4_chitinase, The Bacteroides thetaiotaomicron
            protein represented by pdb structure 3CO4 is an
            uncharacterized bacterial member of the family 18
            glycosyl hydrolases with homologs found in
            Flavobacterium, Stigmatella, and Pseudomonas.
          Length = 253

 Score = 88.7 bits (220), Expect = 5e-19
 Identities = 67/273 (24%), Positives = 109/273 (39%), Gaps = 48/273 (17%)

Query: 1713 IVCYFTNWAW-YRQSGGKYLPSDIDSDLCTHVIYGFAVLDTD-QLVIKPHDTWADLDNKF 1770
            +V Y  N+      S        ID    TH+   FA  D +  L   P    ++L+   
Sbjct: 1    VVGYLPNYDDLNALSPT------IDFSKLTHINLAFANPDANGTLNANP--VRSELN--- 49

Query: 1771 YEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDL 1830
               V A     VK+ +++ G    +  +++  +N    R   +  ++N+++ +N DG+D+
Sbjct: 50   -SVVNAAHAHNVKILISLAG---GSPPEFTAALNDPAKRKALVDKIINYVVSYNLDGIDV 105

Query: 1831 DWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNAYDIPV 1890
            D E P     D           +   I+ L AA      LL+AAVS       N   +  
Sbjct: 106  DLEGPDVTFGD-----------YLVFIRALYAALKKEGKLLTAAVSSW-----NGGAVSD 149

Query: 1891 MSENL-DWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHG-ADRKK 1948
             +    D+I++M+YD  G W             P   +   +A   L+YW   G A + K
Sbjct: 150  STLAYFDFINIMSYDATGPWWGDN---------PGQHSSYDDAVNDLNYWNERGLASKDK 200

Query: 1949 VIFGMPMYGQSFTLAD----KNKNGLNSQTYGG 1977
            ++ G+P YG  F        +NK     Q YGG
Sbjct: 201  LVLGLPFYGYGFYYNGIPTIRNKVAFAKQNYGG 233



 Score = 85.2 bits (211), Expect = 9e-18
 Identities = 63/252 (25%), Positives = 100/252 (39%), Gaps = 54/252 (21%)

Query: 1342 IVCYFTNWAWYRPGKGKYVLKKVPWINILKLYV----TLQTGLGTDQEKENTFLKISELT 1397
            +V Y  N+             K+  IN+         TL                 SEL 
Sbjct: 1    VVGYLPNYDDLNALSPTIDFSKLTHINLAFANPDANGTLNANPVR-----------SELN 49

Query: 1398 YVHILFYERVVTLKKKGVKV--SLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLK 1455
             V       V       VK+  SLA G        +++  +N    R+  ++ ++ +++ 
Sbjct: 50   SV-------VNAAHAHNVKILISLAGGS-----PPEFTAALNDPAKRKALVDKIINYVVS 97

Query: 1456 YQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVI 1515
            Y  DG+D+D E           GPD     + +F+R L+ A K  G LL+AAVS      
Sbjct: 98   YNLDGIDVDLE-----------GPDVTFGDYLVFIRALYAALKKEGKLLTAAVSSWNG-- 144

Query: 1516 NAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYWMKK 1575
              A     L+   D+I++M+YD  G W           ++P     Y +A   +NYW ++
Sbjct: 145  -GAVSDSTLA-YFDFINIMSYDATGPWWG---------DNPGQHSSYDDAVNDLNYWNER 193

Query: 1576 G-APSRKLVMGV 1586
            G A   KLV+G+
Sbjct: 194  GLASKDKLVLGL 205



 Score = 65.2 bits (159), Expect = 5e-11
 Identities = 60/258 (23%), Positives = 100/258 (38%), Gaps = 51/258 (19%)

Query: 385 VVCYVTNWSGSRKSDGKFVPENIDYKLCTHIVYAFASLDPN-TLSIQAGNPE--ADIDDN 441
           VV Y+ N+              ID+   THI  AFA+ D N TL+      E  + ++  
Sbjct: 1   VVGYLPNYDDLNALSPT-----IDFSKLTHINLAFANPDANGTLNANPVRSELNSVVNAA 55

Query: 442 FYQRISSSPLVTHGK-VKILIAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDF 500
                       H   VKILI++ G    S  +++  ++  + RK  +  ++ ++  ++ 
Sbjct: 56  ------------HAHNVKILISLAG---GSPPEFTAALNDPAKRKALVDKIINYVVSYNL 100

Query: 501 AGLHFDWNYPVCWQADCSKQHKADKGNFV-KLIQELKAEFDKHDYSIAVGISGYKEILEV 559
            G+  D   P     D           F+  L   LK E      +++    G      +
Sbjct: 101 DGIDVDLEGPDVTFGD--------YLVFIRALYAALKKEGKLLTAAVSSWNGGAVSDSTL 152

Query: 560 AYDFPALNDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPNYNINTALKLIDELG- 618
           AY         +F+++MSYD  G W G           PG+     +    L   +E G 
Sbjct: 153 AY--------FDFINIMSYDATGPWWG---------DNPGQHSSYDDAVNDLNYWNERGL 195

Query: 619 GDKRKIVIGVPFYGQSYT 636
             K K+V+G+PFYG  + 
Sbjct: 196 ASKDKLVLGLPFYGYGFY 213



 Score = 38.6 bits (90), Expect = 0.027
 Identities = 30/158 (18%), Positives = 59/158 (37%), Gaps = 30/158 (18%)

Query: 106 RLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFIQELSSTLRR 165
           R   +  ++     Y  DG+D+             DLE       D+  FI+ L + L++
Sbjct: 84  RKALVDKIINYVVSYNLDGIDV-------------DLEGPDVTFGDYLVFIRALYAALKK 130

Query: 166 NNYQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKVNIQVT 225
                 L +  V       V  S +A   D I + S++            P    N    
Sbjct: 131 EG---KLLTAAVSSWNGGAVSDSTLA-YFDFINIMSYDA---------TGPWWGDNPGQH 177

Query: 226 STIANFNNIESAVYNWIKKGARP-EQIIIGIPFFGKSY 262
           S+  +     + +  W ++G    +++++G+PF+G  +
Sbjct: 178 SSYDDA---VNDLNYWNERGLASKDKLVLGLPFYGYGF 212



 Score = 30.9 bits (70), Expect = 6.7
 Identities = 23/126 (18%), Positives = 51/126 (40%), Gaps = 13/126 (10%)

Query: 2192 VICYYTSWAYLRQAEGKYSPEDIDGSLCTHLVYAFADLDEQELVIRSHNNKLDLEQGFYE 2251
            V+ Y  ++  L           ID S  TH+  AFA+ D    +  +     +L      
Sbjct: 1    VVGYLPNYDDLNALSPT-----IDFSKLTHINLAFANPDANGTLNANPVRS-ELN----S 50

Query: 2252 KITKFNERGIQVILAIGGWSDSRSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFAW 2311
             +   +   +++++++ G S     +++  +ND   R+  V  +  ++  Y  +G++   
Sbjct: 51   VVNAAHAHNVKILISLAGGS---PPEFTAALNDPAKRKALVDKIINYVVSYNLDGIDVDL 107

Query: 2312 EYPKCW 2317
            E P   
Sbjct: 108  EGPDVT 113


>gnl|CDD|119359 cd06542, GH18_EndoS-like, Endo-beta-N-acetylglucosaminidases are
            bacterial chitinases that hydrolyze the chitin core of
            various asparagine (N)-linked glycans and glycoproteins.
            The endo-beta-N-acetylglucosaminidases have a glycosyl
            hydrolase family 18 (GH18) catalytic domain.  Some
            members also have an additional C-terminal glycosyl
            hydrolase family 20 (GH20) domain while others have an
            N-terminal domain of unknown function (pfam08522).
            Members of this family include
            endo-beta-N-acetylglucosaminidase S (EndoS) from
            Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and
            EndoH from Flavobacterium meningosepticum, and  EndoE
            from Enterococcus faecalis.  EndoS is a secreted
            endoglycosidase from Streptococcus pyogenes that
            specifically hydrolyzes the glycan on human IgG between
            two core N-acetylglucosamine residues.  EndoE is a
            secreted endoglycosidase, encoded by the ndoE gene in
            Enterococcus faecalis, that hydrolyzes the glycan on
            human RNase B.
          Length = 255

 Score = 68.9 bits (169), Expect = 2e-12
 Identities = 37/143 (25%), Positives = 57/143 (39%), Gaps = 13/143 (9%)

Query: 1772 EKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLD 1831
              +  L+ KG KV L+I G +  AG   +    S  A   +   +V+ + ++  DG+D D
Sbjct: 55   TYIRPLQAKGTKVLLSILGNHLGAGFANN---LSDAAAKAYAKAIVDTVDKYGLDGVDFD 111

Query: 1832 WEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDNAYDIPVM 1891
             EY                + F  LIKELR    P D LL+            + D   +
Sbjct: 112  DEY---SGYGKNGTSQPSNEAFVRLIKELRKYMGPTDKLLTIDGYGQAL----SNDGEEV 164

Query: 1892 SENLDWISVMTYDYHGQWDKKTG 1914
            S  +D++    Y Y+G     T 
Sbjct: 165  SPYVDYV---IYQYYGSSSSSTQ 184



 Score = 55.1 bits (133), Expect = 1e-07
 Identities = 32/144 (22%), Positives = 52/144 (36%), Gaps = 13/144 (9%)

Query: 1410 LKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPT 1469
            L+ KG KV L+I G +   G   +    S  A + + + +V  + KY  DG+D D EY  
Sbjct: 60   LQAKGTKVLLSILGNHLGAGFANN---LSDAAAKAYAKAIVDTVDKYGLDGVDFDDEY-- 114

Query: 1470 CWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAYDVKALSESLD 1529
                  +       E+F   ++EL +   P   LL+                    E   
Sbjct: 115  -SGYGKNGTSQPSNEAFVRLIKELRKYMGPTDKLLTIDGYGQ-------ALSNDGEEVSP 166

Query: 1530 WISVMTYDYHGQWDKKTGHVAPLY 1553
            ++  + Y Y+G     T       
Sbjct: 167  YVDYVIYQYYGSSSSSTQRNWNTN 190


>gnl|CDD|226376 COG3858, COG3858, Predicted glycosyl hydrolase [General function
            prediction only].
          Length = 423

 Score = 68.6 bits (168), Expect = 2e-11
 Identities = 65/348 (18%), Positives = 128/348 (36%), Gaps = 64/348 (18%)

Query: 1741 THVIYGFAVLDTDQLVIK---PHDTWADLDNKFYEKV------------TALKKKGVKVT 1785
            T         +T +LV +   P+ T+ DL +   ++                + + +K  
Sbjct: 105  TAYYAPRPPTETGELVRRHAGPYLTYVDLFSYHAQENGNLTETPNENVIEIAQCRKIKPV 164

Query: 1786 LAIGGWNDSAGNKY----SRLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKCWQVD 1841
              I        N        L+N++ A+++ I +++  +    + G+++D+E        
Sbjct: 165  PGISNGTRPGANFGGELAQLLLNNETAKNRLINNIITLLDARGYRGVNIDFE-------- 216

Query: 1842 CKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAVIDN-----AYDIPVMSENLD 1896
                   D++ + D ++++R A +     +S AV+   + +       AYD   + +  D
Sbjct: 217  --NVGPGDRELYTDFLRQVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIAD 274

Query: 1897 WISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGADRKKVIFGMPMY 1956
            ++ +MTYD+H       G VA +                + Y ++      KV+ G+P+Y
Sbjct: 275  FVILMTYDWHYSGGPP-GPVASI-------GWVRK---VIEYALTVIPAE-KVMMGIPLY 322

Query: 1957 GQSFTLADKNKNGLNSQTYGGAEAGENTRARGFLAYYEIC---DKIQKDGWVVVRDRKRR 2013
            G  +TL        +   Y  A A     A      Y      D   +  +    D++ R
Sbjct: 323  GYDWTLPY------DPLGY-LARAISPDEAIDIANRYNATIQYDATSQSPFFYYVDKEGR 375

Query: 2014 IGPYAFKGDQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNF 2061
                       V F+D      K + +K   L G   W L  +D +N+
Sbjct: 376  --------YHEVWFEDARSFQTKLDLIKEYGLRGVSYWVLGQEDPRNW 415



 Score = 65.5 bits (160), Expect = 1e-10
 Identities = 42/204 (20%), Positives = 75/204 (36%), Gaps = 36/204 (17%)

Query: 1374 VTLQTGLGTDQEKENTFLKISE--LTYVHILFYER--------------VVTLKKKGVKV 1417
             T         E      + +   LTYV +  Y                +   + + +K 
Sbjct: 104  TTAYYAPRPPTETGELVRRHAGPYLTYVDLFSYHAQENGNLTETPNENVIEIAQCRKIKP 163

Query: 1418 SLAIGGWNDSLGGKY----SRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQV 1473
               I                 L+N+ TA+ R I +++  L    + G+++D+E       
Sbjct: 164  VPGISNGTRPGANFGGELAQLLLNNETAKNRLINNIITLLDARGYRGVNIDFENVG---- 219

Query: 1474 NCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVI-----NAAYDVKALSESL 1528
                    D+E +  F+R++  A    G  +S AV+     +     + AYD  AL +  
Sbjct: 220  ------PGDRELYTDFLRQVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIA 273

Query: 1529 DWISVMTYDYHGQWDKKTGHVAPL 1552
            D++ +MTYD+H       G VA +
Sbjct: 274  DFVILMTYDWHYSGGPP-GPVASI 296



 Score = 54.7 bits (132), Expect = 4e-07
 Identities = 44/193 (22%), Positives = 80/193 (41%), Gaps = 28/193 (14%)

Query: 468 DSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGN 527
           +  GE    L+++ + + + I +++T L    + G++ D+                D+  
Sbjct: 176 NFGGELAQLLLNNETAKNRLINNIITLLDARGYRGVNIDFEN----------VGPGDREL 225

Query: 528 FVKLIQELKAEFDKHDY--SIAVGISGYKEILEV---AYDFPALNDHVEFMSLMSYDYHG 582
           +   +++++       Y  SIAV        +     AYD+ AL    +F+ LM+YD+H 
Sbjct: 226 YTDFLRQVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIADFVILMTYDWHY 285

Query: 583 AWEGITGLVSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVIGVPFYGQSYTLKLDKE 642
           +  G  G V+ +          Y    AL +I        K+++G+P YG  +TL  D  
Sbjct: 286 SG-GPPGPVASIG--WVRKVIEY----ALTVI-----PAEKVMMGIPLYGYDWTLPYD-P 332

Query: 643 HGLGAETDGPGLA 655
            G  A    P  A
Sbjct: 333 LGYLARAISPDEA 345



 Score = 49.0 bits (117), Expect = 2e-05
 Identities = 44/273 (16%), Positives = 98/273 (35%), Gaps = 52/273 (19%)

Query: 99  ITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANE-RSDFSTFIQ 157
           + ++  ++   I++++ + D   + G+++             D E++    R  ++ F++
Sbjct: 185 LLNNETAKNRLINNIITLLDARGYRGVNI-------------DFENVGPGDRELYTDFLR 231

Query: 158 ELSSTLRRNNYQLTL-----TSPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVV 212
           ++   L    Y +++     TS   +       D   +  + D ++L +++  +      
Sbjct: 232 QVRDALHSGGYTVSIAVAAKTSDLQVGSWHGAYDYVALGKIADFVILMTYDWHYSGGPPG 291

Query: 213 PVKPNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGL 272
           PV     V             IE A+          E++++GIP +G  Y         L
Sbjct: 292 PVASIGWVR----------KVIEYAL-----TVIPAEKVMMGIPLYG--YDWT------L 328

Query: 273 GATVKGPGTEGKYTQMPGYLAFFEVCNKFKDKT----WRHFTDSNGEPFMVKKDEWITYE 328
                G             +A         D T    + ++ D  G      +   + +E
Sbjct: 329 PYDPLGYLARAISPDEAIDIANRYNATIQYDATSQSPFFYYVDKEG------RYHEVWFE 382

Query: 329 NNDSIRRKMNYIKDRHLGGAMLWTLDLDDFRGF 361
           +  S + K++ IK+  L G   W L  +D R +
Sbjct: 383 DARSFQTKLDLIKEYGLRGVSYWVLGQEDPRNW 415



 Score = 39.7 bits (93), Expect = 0.020
 Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 5/64 (7%)

Query: 732 RIKTRKWTVQRNTKSLEPFTFQGDQ-----WVSYEDPTSLTEKVKFIKSNGYAGIMAWTI 786
                  T+Q +  S  PF +  D+      V +ED  S   K+  IK  G  G+  W +
Sbjct: 348 IANRYNATIQYDATSQSPFFYYVDKEGRYHEVWFEDARSFQTKLDLIKEYGLRGVSYWVL 407

Query: 787 DLDD 790
             +D
Sbjct: 408 GQED 411


>gnl|CDD|119355 cd02876, GH18_SI-CLP, Stabilin-1 interacting chitinase-like protein
           (SI-CLP) is a eukaryotic chitinase-like protein of
           unknown function that interacts with the
           endocytic/sorting transmembrane receptor stabilin-1 and
           is secreted from the lysosome.  SI-CLP has a glycosyl
           hydrolase family 18 (GH18) domain but lacks a
           chitin-binding domain. The catalytic amino acids of the
           GH18 domain are not conserved in SI-CLP, similar to the
           chitolectins YKL-39, YKL-40, and YM1/2.  Human SI-CLP is
           sorted to late endosomes and secretory lysosomes in
           alternatively activated macrophages.
          Length = 318

 Score = 67.3 bits (165), Expect = 2e-11
 Identities = 51/206 (24%), Positives = 83/206 (40%), Gaps = 36/206 (17%)

Query: 455 GKVKIL--IAIGGWTDSSGEKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFD-WNYPV 511
             +KIL  +   GW+         L++    R+K IK ++T  ++  F G+  + W+   
Sbjct: 65  KNIKILPRVLFEGWSYQD---LQSLLNDEQEREKLIKLLVTTAKKNHFDGIVLEVWSQ-- 119

Query: 512 CWQADCSKQHKADKGNFVKLIQELKAEFDKHDYSIAVGISGYKEILEVAY-----DFPAL 566
              A      + +    ++L+  L       +  + + I   +E           DF  L
Sbjct: 120 -LAAYGVPDKRKE---LIQLVIHLGETLHSANLKLILVIPPPREKGNQNGLFTRKDFEKL 175

Query: 567 NDHVEFMSLMSYDYHGAWEGITGLVSPLNSRPGEPYPN---YNINTALKLIDELGGDKR- 622
             HV+  SLM+YDY               S P  P PN     + + L+L+    G KR 
Sbjct: 176 APHVDGFSLMTYDY---------------SSPQRPGPNAPLSWVRSCLELLLPESGKKRA 220

Query: 623 KIVIGVPFYGQSYTLKLDKEHGLGAE 648
           KI++G+ FYG  YTL        G+E
Sbjct: 221 KILLGLNFYGNDYTLPGGGGAITGSE 246



 Score = 55.4 bits (134), Expect = 1e-07
 Identities = 48/202 (23%), Positives = 78/202 (38%), Gaps = 30/202 (14%)

Query: 1423 GWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCW-QVNCDAGPDS 1481
            GW+         L+N    R++ I+ +V    K  FDG+ L  E    W Q+     PD 
Sbjct: 77   GWSYQD---LQSLLNDEQEREKLIKLLVTTAKKNHFDGIVL--EV---WSQLAAYGVPDK 128

Query: 1482 DKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAY-----DVKALSESLDWISVMTY 1536
             KE     V  L +      L L   + P ++  N        D + L+  +D  S+MTY
Sbjct: 129  RKELIQ-LVIHLGETLHSANLKLILVIPPPREKGNQNGLFTRKDFEKLAPHVDGFSLMTY 187

Query: 1537 DYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYWMKKGAPSR-KLVMGVHPLLSTITE 1595
            DY     ++ G  APL         +  +   +   + +    R K+++G++   +  T 
Sbjct: 188  DYSSP--QRPGPNAPLS--------WVRSC--LELLLPESGKKRAKILLGLNFYGNDYTL 235

Query: 1596 VLGHGP--GGNYESTTEEYKPT 1615
              G G   G  Y    +  KP 
Sbjct: 236  PGGGGAITGSEYLKLLKSNKPK 257



 Score = 54.6 bits (132), Expect = 3e-07
 Identities = 47/217 (21%), Positives = 82/217 (37%), Gaps = 47/217 (21%)

Query: 1765 DLDNKFYEKVTALKKKGVKVT--LAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILE 1822
            D+D  + E+V     K +K+   +   GW+         L+N +Q R K I  +V    +
Sbjct: 51   DIDKGWIEEVRK-ANKNIKILPRVLFEGWS---YQDLQSLLNDEQEREKLIKLLVTTAKK 106

Query: 1823 HNFDGLDLD-WEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSPSKAV 1881
            ++FDG+ L+ W            G    ++    L+  L    +  +L L   + P +  
Sbjct: 107  NHFDGIVLEVWSQ---LAA---YGVPDKRKELIQLVIHLGETLHSANLKLILVIPPPREK 160

Query: 1882 IDNAY-----DIPVMSENLDWISVMTYDYHGQWDKKTGHVAPMYALPNDTTPTFNANYSL 1936
             +        D   ++ ++D  S+MTYDY     ++ G  AP+                 
Sbjct: 161  GNQNGLFTRKDFEKLAPHVDGFSLMTYDYSSP--QRPGPNAPLS---------------- 202

Query: 1937 HYWV---------SHGADRKKVIFGMPMYGQSFTLAD 1964
              WV           G  R K++ G+  YG  +TL  
Sbjct: 203  --WVRSCLELLLPESGKKRAKILLGLNFYGNDYTLPG 237


>gnl|CDD|216601 pfam01607, CBM_14, Chitin binding Peritrophin-A domain.  This domain
            is called the Peritrophin-A domain and is found in chitin
            binding proteins particularly peritrophic matrix proteins
            of insects and animal chitinases. Copies of the domain
            are also found in some baculoviruses. Relevant references
            that describe proteins with this domain include. It is an
            extracellular domain that contains six conserved
            cysteines that probably form three disulphide bridges.
            Chitin binding has been demonstrated for a protein
            containing only two of these domains.
          Length = 53

 Score = 56.7 bits (137), Expect = 5e-10
 Identities = 16/53 (30%), Positives = 24/53 (45%), Gaps = 3/53 (5%)

Query: 1138 CVNGD--YLPDPDDCRSFLICSHGNLLKQSCGPSLLWNAKKKLCDWSYNV-QC 1187
            C        PDP DC  +  CS+G  +  +C   L+++     CD+  NV  C
Sbjct: 1    CEGRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFDPALGTCDYPDNVVDC 53



 Score = 54.0 bits (130), Expect = 4e-09
 Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 2/51 (3%)

Query: 983  CTSGE--YYPVDGECGSYYRCILGTLKKESCAPGLHWNKVNKICDWPKSAK 1031
            C       YP  G+C  YY+C  G     +C  GL ++     CD+P +  
Sbjct: 1    CEGRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFDPALGTCDYPDNVV 51



 Score = 53.6 bits (129), Expect = 6e-09
 Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 1205 CQEGE--FAAYPSDCNKYQYCIWGSYQVASCSPGLYWNDKMKTCDWP 1249
            C+         P DC+KY  C  G   V +C  GL ++  + TCD+P
Sbjct: 1    CEGRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFDPALGTCDYP 47



 Score = 42.0 bits (99), Expect = 7e-05
 Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 2129 NRTGELYQHEGDCAKFASCENGLIKIYDCPPGLHFN 2164
             R   LY   GDC+K+  C NG   ++ CP GL F+
Sbjct: 3    GRPDGLYPDPGDCSKYYQCSNGKAVVFTCPAGLVFD 38


>gnl|CDD|214696 smart00494, ChtBD2, Chitin-binding domain type 2. 
          Length = 49

 Score = 53.2 bits (128), Expect = 8e-09
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 2/49 (4%)

Query: 1136 ETCVNGD--YLPDPDDCRSFLICSHGNLLKQSCGPSLLWNAKKKLCDWS 1182
              C        P P DC  +  CS+G  +  SC   L++N   + CDW 
Sbjct: 1    NECPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDWP 49



 Score = 52.8 bits (127), Expect = 1e-08
 Identities = 17/47 (36%), Positives = 21/47 (44%), Gaps = 2/47 (4%)

Query: 983  CTSGE--YYPVDGECGSYYRCILGTLKKESCAPGLHWNKVNKICDWP 1027
            C       YP   +C  YY+C  G     SC  GL +N   + CDWP
Sbjct: 3    CPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDWP 49



 Score = 52.1 bits (125), Expect = 2e-08
 Identities = 20/47 (42%), Positives = 23/47 (48%), Gaps = 2/47 (4%)

Query: 1205 CQEGEFAAYPS--DCNKYQYCIWGSYQVASCSPGLYWNDKMKTCDWP 1249
            C       YP   DC+KY  C  G   V SC  GL +N   +TCDWP
Sbjct: 3    CPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDWP 49



 Score = 44.0 bits (104), Expect = 1e-05
 Identities = 18/41 (43%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 2126 NRC-NRTGELYQHEGDCAKFASCENGLIKIYDCPPGLHFNF 2165
            N C  R   LY H  DC+K+  C NG   +  CP GL FN 
Sbjct: 1    NECPGRGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNP 41


>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein.  This family of
            trypanosomal proteins resemble vertebrate mucins. The
            protein consists of three regions. The N and C terminii
            are conserved between all members of the family, whereas
            the central region is not well conserved and contains a
            large number of threonine residues which can be
            glycosylated. Indirect evidence suggested that these
            genes might encode the core protein of parasite mucins,
            glycoproteins that were proposed to be involved in the
            interaction with, and invasion of, mammalian host cells.
            This family contains an N-terminal signal peptide.
          Length = 143

 Score = 51.8 bits (123), Expect = 3e-07
 Identities = 24/63 (38%), Positives = 33/63 (52%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
            T+  + P T +T+T  TT     T    TTTTT+TTTTT   + P+  +TT+  P    T
Sbjct: 48   TTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDT 107

Query: 1675 IKP 1677
              P
Sbjct: 108  RAP 110



 Score = 50.2 bits (119), Expect = 8e-07
 Identities = 34/90 (37%), Positives = 42/90 (46%), Gaps = 8/90 (8%)

Query: 1603 GNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT 1662
            G Y++   E      ++  TT +T+   TT     T   ITTTTT TTTTT      TTT
Sbjct: 29   GQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTT---TTT 85

Query: 1663 STTTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
            +TTT  P   TT     T+  P  T T  P
Sbjct: 86   TTTTEAPSKNTT-----TSEAPTTTDTRAP 110



 Score = 48.3 bits (114), Expect = 5e-06
 Identities = 24/66 (36%), Positives = 28/66 (42%)

Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTV 1690
               + + T    TTT  +TTTTT       TT+TT T     TT     TT  P   TT 
Sbjct: 39   AEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTT 98

Query: 1691 KPKPTT 1696
               PTT
Sbjct: 99   SEAPTT 104



 Score = 46.0 bits (108), Expect = 3e-05
 Identities = 23/90 (25%), Positives = 31/90 (34%), Gaps = 1/90 (1%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEE-PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
            T++       +       +    T     T  TT+TTTTT      TTT+ TTT     T
Sbjct: 24   TAQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTT 83

Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVI 1703
            T            TT+  P  T  +    I
Sbjct: 84   TTTTTTEAPSKNTTTSEAPTTTDTRAPSSI 113



 Score = 43.3 bits (101), Expect = 2e-04
 Identities = 26/80 (32%), Positives = 33/80 (41%), Gaps = 6/80 (7%)

Query: 1630 VTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
                E  + +   TTTTT+TT  T      TTT+TTTT     TT     TT     TTT
Sbjct: 33   AAVVEAAEGQSQTTTTTTTTTPPT------TTTTTTTTTTTITTTTTKTTTTTTTTTTTT 86

Query: 1690 VKPKPTTVKPKPVIPPSTKD 1709
               +  +        P+T D
Sbjct: 87   TTTEAPSKNTTTSEAPTTTD 106



 Score = 42.2 bits (98), Expect = 5e-04
 Identities = 29/70 (41%), Positives = 34/70 (48%), Gaps = 11/70 (15%)

Query: 1627 STVVTTEEPEQTEEPITT-------TTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP 1679
            S   T +   Q +  +         TTT+TTTTT    PPTTT+TTTT    ITT   K 
Sbjct: 20   SVCATAQGEGQYDAAVVEAAEGQSQTTTTTTTTT----PPTTTTTTTTTTTTITTTTTKT 75

Query: 1680 TTVKPKPTTT 1689
            TT     TTT
Sbjct: 76   TTTTTTTTTT 85



 Score = 37.1 bits (85), Expect = 0.028
 Identities = 22/70 (31%), Positives = 28/70 (40%), Gaps = 7/70 (10%)

Query: 1259 ATTTSEQVPKPTKKPTKPTKKPTTTT--EYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
            A   + +    T   T  T  PTTTT         TT  + TT+TTT     T   T   
Sbjct: 34   AVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT-----TTTTTTTT 88

Query: 1317 WHPPTEPTTT 1326
               P++ TTT
Sbjct: 89   TEAPSKNTTT 98



 Score = 37.1 bits (85), Expect = 0.033
 Identities = 23/70 (32%), Positives = 26/70 (37%)

Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
              A     Q    T   T PT   TTTT       TT  +TTT+TTT +   T       
Sbjct: 36   VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKN 95

Query: 1317 WHPPTEPTTT 1326
                  PTTT
Sbjct: 96   TTTSEAPTTT 105



 Score = 36.8 bits (84), Expect = 0.035
 Identities = 25/71 (35%), Positives = 30/71 (42%), Gaps = 7/71 (9%)

Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEW 1315
            QT+ TTT       T  PT  T   TTTT        T  +TTT+TTT +    P+    
Sbjct: 44   QTTTTTT------TTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTT 97

Query: 1316 VWHPPTEPTTT 1326
                PT  T T
Sbjct: 98   TSEAPT-TTDT 107



 Score = 35.2 bits (80), Expect = 0.13
 Identities = 18/72 (25%), Positives = 26/72 (36%)

Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPN 1311
            T    T+ TTT+      T      T   TTTT     EA +K +TT+   T +    P+
Sbjct: 52   TTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPS 111

Query: 1312 PTEWVWHPPTEP 1323
                +       
Sbjct: 112  SIREIDGSLGSS 123



 Score = 34.8 bits (79), Expect = 0.17
 Identities = 15/69 (21%), Positives = 17/69 (24%)

Query: 1639 EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
             E             +     TTT+TT       TT      T     TTT     TT  
Sbjct: 27   GEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTT 86

Query: 1699 PKPVIPPST 1707
                 P   
Sbjct: 87   TTTEAPSKN 95



 Score = 34.8 bits (79), Expect = 0.21
 Identities = 20/70 (28%), Positives = 31/70 (44%), Gaps = 4/70 (5%)

Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTS----TTTTTKRPKPPTTT 1662
            +TT    P +  +  TT +T+   TT +   T    TTTTT+    +  TT    P TT 
Sbjct: 47   TTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTD 106

Query: 1663 STTTTRPKPI 1672
            +   +  + I
Sbjct: 107  TRAPSSIREI 116



 Score = 32.5 bits (73), Expect = 1.1
 Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 6/63 (9%)

Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPN 1311
            T     + TTT+      T   T  TK  TTTT       TT+  +  +TT+++    P 
Sbjct: 50   TTTTPPTTTTTTTT--TTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEA----PT 103

Query: 1312 PTE 1314
             T+
Sbjct: 104  TTD 106



 Score = 31.8 bits (71), Expect = 2.1
 Identities = 18/57 (31%), Positives = 23/57 (40%)

Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDS 1305
            P  T    T+ TT +    K T   T  T   TTT   +    T++  TTT T   S
Sbjct: 55   PTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPS 111



 Score = 31.0 bits (69), Expect = 3.4
 Identities = 22/84 (26%), Positives = 29/84 (34%), Gaps = 7/84 (8%)

Query: 1254 CKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPT 1313
            C    AT   E             +  TTTT       TT  +  T+TTT +   T   T
Sbjct: 18   CASVCATAQGEGQYDAAVVEAAEGQSQTTTT-------TTTTTPPTTTTTTTTTTTTITT 70

Query: 1314 EWVWHPPTEPTTTHISVTEKSPLD 1337
                   T  TTT  + T ++P  
Sbjct: 71   TTTKTTTTTTTTTTTTTTTEAPSK 94


>gnl|CDD|119350 cd02871, GH18_chitinase_D-like, GH18 domain of Chitinase D (ChiD).
            ChiD, a chitinase found in Bacillus circulans, hydrolyzes
            the 1,4-beta-linkages of N-acetylglucosamine in chitin
            and chitodextrins.  The domain architecture of ChiD
            includes a catalytic glycosyl hydrolase family 18 (GH18)
            domain, a chitin-binding domain, and a fibronectin type
            III domain. The chitin-binding and fibronectin type III
            domains are located either N-terminal or C-terminal to
            the catalytic domain.  This family includes exochitinase
            Chi36 from Bacillus cereus.
          Length = 312

 Score = 54.6 bits (132), Expect = 3e-07
 Identities = 46/206 (22%), Positives = 77/206 (37%), Gaps = 30/206 (14%)

Query: 1713 IVCYFTNWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDN---- 1768
            +V Y+ NW     +G      D        +   FA   +D       +  +        
Sbjct: 3    LVGYWHNWDNG--AGSGRQDLDDVPSKYNVINVAFAEPTSDGGGEVTFNNGSSPGGYSPA 60

Query: 1769 KFYEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGL 1828
            +F   + AL+ KG KV ++IGG N          +N       F+  +V  I E+ FDGL
Sbjct: 61   EFKADIKALQAKGKKVLISIGGANGHV------DLNHTAQEDNFVDSIVAIIKEYGFDGL 114

Query: 1829 DLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAA------VSPSKAV- 1881
            D+D E                       +K+L+  + P + +L+ A           A  
Sbjct: 115  DIDLE-----SGSNPLNATPVITNLISALKQLKDHYGP-NFILTMAPETPYVQGGYAAYG 168

Query: 1882 -IDNAYDIPV---MSENLDWISVMTY 1903
             I  AY +P+   + ++L W++V  Y
Sbjct: 169  GIWGAY-LPLIDNLRDDLTWLNVQYY 193



 Score = 50.4 bits (121), Expect = 6e-06
 Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 6/57 (10%)

Query: 1410 LKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWE 1466
            L+ KG KV ++IGG N  +       +N       F++ +V  + +Y FDGLD+D E
Sbjct: 69   LQAKGKKVLISIGGANGHV------DLNHTAQEDNFVDSIVAIIKEYGFDGLDIDLE 119



 Score = 40.4 bits (95), Expect = 0.010
 Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 8/68 (11%)

Query: 2246 EQGFYEKITKFNERGIQVILAIGGWSDSRSDKYSR-LVNDIPARRNFVRHVTEFLEFYGF 2304
               F   I     +G +V+++IGG             +N      NFV  +   ++ YGF
Sbjct: 59   PAEFKADIKALQAKGKKVLISIGG-------ANGHVDLNHTAQEDNFVDSIVAIIKEYGF 111

Query: 2305 NGLEFAWE 2312
            +GL+   E
Sbjct: 112  DGLDIDLE 119


>gnl|CDD|119354 cd02875, GH18_chitobiase, Chitobiase (also known as
            di-N-acetylchitobiase) is a lysosomal glycosidase that
            hydrolyzes the reducing-end N-acetylglucosamine from the
            chitobiose core of oligosaccharides during the ordered
            degradation of asparagine-linked glycoproteins in
            eukaryotes. Chitobiase can only do so if the asparagine
            that joins the oligosaccharide to protein is previously
            removed by a glycosylasparaginase. Chitobiase is
            therefore the final step in the lysosomal degradation of
            the protein/carbohydrate linkage component of
            asparagine-linked glycoproteins. The catalytic domain of
            chitobiase is an eight-stranded alpha/beta barrel fold
            similar to that of other family 18 glycosyl hydrolases
            such as hevamine and chitotriosidase.
          Length = 358

 Score = 52.0 bits (125), Expect = 2e-06
 Identities = 64/292 (21%), Positives = 105/292 (35%), Gaps = 35/292 (11%)

Query: 1778 KKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKC 1837
              KGV++ L              ++ N    R+++I   V        DG+++D E P  
Sbjct: 74   HSKGVRLVL-------KGDVPLEQISNPTY-RTQWIQQKVELAKSQFMDGINIDIEQP-- 123

Query: 1838 WQVDCKQGPASDKQGFADLIKELRAAF---NPHDLLLSAAVSPSKAVID-NAYDIPVMSE 1893
              +    G   +     +L+KE   AF   NP    +S  V+ S + ID   YD   +++
Sbjct: 124  --ITK--GS-PEYYALTELVKETTKAFKKENPGY-QISFDVAWSPSCIDKRCYDYTGIAD 177

Query: 1894 NLDWISVMTYDYHGQ-WDKKTGHVAPMYALPNDTTPTFNANYSLHYWVSHGADRKKVIFG 1952
              D++ VM YD   Q W K          +    +P        + +   G D KK++ G
Sbjct: 178  ASDFLVVMDYDEQSQIWGK--------ECIAGANSPYSQTLSGYNNFTKLGIDPKKLVMG 229

Query: 1953 MPMYGQSFTLADKNKNGLNSQTYGGAEAGEN-TRARGFLAYYEICDKIQKDGWVVVRDRK 2011
            +P YG  +   + N   +          G N + A G    Y    K             
Sbjct: 230  LPWYGYDYPCLNGNLEDVVCTIPKVPFRGANCSDAAGRQIPYSEIMKQINSSIGGRLWDS 289

Query: 2012 RRIGPYAFKGD-----QWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDF 2058
             +  P+    D       V +D+   +  K  + K   L G  +W  DL D+
Sbjct: 290  EQKSPFYNYKDKQGNLHQVWYDNPQSLSIKVAYAKNLGLKGIGMWNGDLLDY 341



 Score = 42.8 bits (101), Expect = 0.002
 Identities = 40/166 (24%), Positives = 69/166 (41%), Gaps = 25/166 (15%)

Query: 478 ISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQE--- 534
           IS+ + R ++I+  +   +     G++ D   P+   +              +L++E   
Sbjct: 91  ISNPTYRTQWIQQKVELAKSQFMDGINIDIEQPITKGSPEYYA-------LTELVKETTK 143

Query: 535 -LKAEFDKHDYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHG-AWEG--ITGL 590
             K E   +  S  V  S    I +  YD+  + D  +F+ +M YD     W    I G 
Sbjct: 144 AFKKENPGYQISFDVAWSP-SCIDKRCYDYTGIADASDFLVVMDYDEQSQIWGKECIAGA 202

Query: 591 VSPLNSRPGEPYPNYNINTALKLIDELGGDKRKIVIGVPFYGQSYT 636
            SP  S+    Y N+          +LG D +K+V+G+P+YG  Y 
Sbjct: 203 NSP-YSQTLSGYNNFT---------KLGIDPKKLVMGLPWYGYDYP 238



 Score = 42.4 bits (100), Expect = 0.002
 Identities = 47/161 (29%), Positives = 74/161 (45%), Gaps = 30/161 (18%)

Query: 1435 LVNSATARQRFIEHVVKFLLKYQF-DGLDLDWEYPTCWQVNCDAGPDSDKESFGL--FVR 1491
            + N  T R ++I+  V+ L K QF DG+++D E P    +   +      E + L   V+
Sbjct: 91   ISNP-TYRTQWIQQKVE-LAKSQFMDGINIDIEQP----ITKGS-----PEYYALTELVK 139

Query: 1492 ELHQAFKPH--GLLLSAAVSPSKQVIN-AAYDVKALSESLDWISVMTYDYHGQ-WDKK-- 1545
            E  +AFK    G  +S  V+ S   I+   YD   ++++ D++ VM YD   Q W K+  
Sbjct: 140  ETTKAFKKENPGYQISFDVAWSPSCIDKRCYDYTGIADASDFLVVMDYDEQSQIWGKECI 199

Query: 1546 TGHVAPLYEHPDDDFFYFNANFTMNYWMKKGAPSRKLVMGV 1586
             G  +P          Y       N + K G   +KLVMG+
Sbjct: 200  AGANSP----------YSQTLSGYNNFTKLGIDPKKLVMGL 230



 Score = 41.6 bits (98), Expect = 0.004
 Identities = 51/276 (18%), Positives = 107/276 (38%), Gaps = 40/276 (14%)

Query: 101 SDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFIQELS 160
           S+   R  +I   +E+      DG++++++ P      +             +  ++E +
Sbjct: 92  SNPTYRTQWIQQKVELAKSQFMDGINIDIEQPITKGSPEYYA---------LTELVKETT 142

Query: 161 STLRRNN--YQLTLT---SPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVK 215
              ++ N  YQ++     SP  ID+     D + +A   D +++  ++           +
Sbjct: 143 KAFKKENPGYQISFDVAWSPSCIDK--RCYDYTGIADASDFLVVMDYD----------EQ 190

Query: 216 PNTKVNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFGKSYRLFNRSEYGLGAT 275
                   +    + ++   S   N+ K G  P+++++G+P++G  Y   N +   +  T
Sbjct: 191 SQIWGKECIAGANSPYSQTLSGYNNFTKLGIDPKKLVMGLPWYGYDYPCLNGNLEDVVCT 250

Query: 276 VK------GPGTEGKYTQMPGYLAFFEVCNKFKDKTWRHFTDSNGEPFMVKKD-----EW 324
           +          ++    Q+P Y    +  N       R +      PF   KD       
Sbjct: 251 IPKVPFRGANCSDAAGRQIP-YSEIMKQIN--SSIGGRLWDSEQKSPFYNYKDKQGNLHQ 307

Query: 325 ITYENNDSIRRKMNYIKDRHLGGAMLWTLDLDDFRG 360
           + Y+N  S+  K+ Y K+  L G  +W  DL D+ G
Sbjct: 308 VWYDNPQSLSIKVAYAKNLGLKGIGMWNGDLLDYSG 343



 Score = 37.0 bits (86), Expect = 0.10
 Identities = 22/71 (30%), Positives = 33/71 (46%), Gaps = 5/71 (7%)

Query: 726 YYEICYRIKTRKWTVQRNTKSLEPFTFQGD-----QWVSYEDPTSLTEKVKFIKSNGYAG 780
           Y EI  +I +       +++   PF    D       V Y++P SL+ KV + K+ G  G
Sbjct: 271 YSEIMKQINSSIGGRLWDSEQKSPFYNYKDKQGNLHQVWYDNPQSLSIKVAYAKNLGLKG 330

Query: 781 IMAWTIDLDDF 791
           I  W  DL D+
Sbjct: 331 IGMWNGDLLDY 341


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
            glycoprotein (BLLF1).  This family consists of the BLLF1
            viral late glycoprotein, also termed gp350/220. It is the
            most abundantly expressed glycoprotein in the viral
            envelope of the Herpesviruses and is the major antigen
            responsible for stimulating the production of
            neutralising antibodies in vivo.
          Length = 830

 Score = 52.9 bits (126), Expect = 2e-06
 Identities = 28/102 (27%), Positives = 32/102 (31%), Gaps = 12/102 (11%)

Query: 1601 PGGNYESTT-EEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
            P G   ST  E+  PTS  +  T  +TS       P  T    TT  TS T     P P 
Sbjct: 474  PTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATS--PTTQKTSDTPNATSPTPI 531

Query: 1660 TTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
                TTT            P T          P+ T   P  
Sbjct: 532  VIGVTTTA---------TSPPTGTTSVPNATSPQVTEESPVN 564



 Score = 44.4 bits (104), Expect = 9e-04
 Identities = 23/96 (23%), Positives = 33/96 (34%), Gaps = 4/96 (4%)

Query: 1608 TTEEYKPTSEESKPTT-VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTT 1666
            TT++   T   + PT  V   T   T  P  T        TS   T + P   T T   T
Sbjct: 515  TTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSV--PNATSPQVTEESPVNNTNTPVVT 572

Query: 1667 TRPKPIT-TIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
            + P  +T  +          PT+     P++    P
Sbjct: 573  SAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTP 608



 Score = 43.2 bits (101), Expect = 0.002
 Identities = 22/109 (20%), Positives = 27/109 (24%), Gaps = 8/109 (7%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
             T   + PTT  TS       P      +TTT TS  T T      T+   T   P   T
Sbjct: 507  TTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNT 566

Query: 1674 TIK-----PKPTTVKPKPT---TTVKPKPTTVKPKPVIPPSTKDEFKIV 1714
                    P   T         T   P             + +      
Sbjct: 567  NTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTST 615



 Score = 41.3 bits (96), Expect = 0.008
 Identities = 25/103 (24%), Positives = 37/103 (35%), Gaps = 9/103 (8%)

Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST--- 1664
            ++    P S  +  T + TS   T  E    E P   +TT  +T +  P P TT+     
Sbjct: 602  SSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGP 661

Query: 1665 --TTTRPKP----ITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
              ++T   P    +T   P P    P   +  K    TV    
Sbjct: 662  GNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704



 Score = 39.0 bits (90), Expect = 0.039
 Identities = 39/196 (19%), Positives = 55/196 (28%), Gaps = 15/196 (7%)

Query: 1530 WISVMTYDYHGQWDKKTGHVAPLYEHPDDDFFYFNANFTMNYWMKKGAPSRKLVMGVHPL 1589
            W +V   D+  +W   T    P        FF  N  F +            ++      
Sbjct: 353  WGNVTQSDFKCKWVLATHGQNPEGCERSLGFFNSNRTFEVTVANPVADAKTLIITRTATN 412

Query: 1590 LSTITE--VLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT 1647
             +T T   V    P               E  K T V T+  +       T      T+ 
Sbjct: 413  ATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSG 472

Query: 1648 STTTTTKRPKP----PTTTSTTTTRPKPI-TTIKPKP-----TTVKPKPTTTVK-PKPTT 1696
            + T TT    P    PT+ +T+ T      T     P     TT K   T     P P  
Sbjct: 473  TPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIV 532

Query: 1697 VKPKPV--IPPSTKDE 1710
            +        PP+    
Sbjct: 533  IGVTTTATSPPTGTTS 548



 Score = 35.9 bits (82), Expect = 0.29
 Identities = 31/132 (23%), Positives = 46/132 (34%), Gaps = 10/132 (7%)

Query: 1588 PLLSTITEVLGHG-------PGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEE 1640
            P   T ++V G G       PG  + +        +  S P+   T+    T    +   
Sbjct: 650  PGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANS 709

Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPK 1700
                T+ ST   +    P T      T P   TT  P  T+ K +P  T    P T K  
Sbjct: 710  TTKETSGSTLMAST--SPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQA 767

Query: 1701 PV-IPPSTKDEF 1711
             V +PP+   + 
Sbjct: 768  TVPVPPTQHPDH 779



 Score = 35.9 bits (82), Expect = 0.29
 Identities = 24/88 (27%), Positives = 28/88 (31%), Gaps = 9/88 (10%)

Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAW 1308
                    T A TT      PT + T  T   T+ T       TT  S  T TT+   A 
Sbjct: 495  ATPNATSPTPAVTTPN-ATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNAT 553

Query: 1309 TPNPTEWVWHPPTEPT--TTHISVTEKS 1334
            +P  TE        P   T    VT   
Sbjct: 554  SPQVTE------ESPVNNTNTPVVTSAP 575



 Score = 32.8 bits (74), Expect = 2.5
 Identities = 25/84 (29%), Positives = 32/84 (38%), Gaps = 10/84 (11%)

Query: 1261 TTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNP-----TEW 1315
            +T  +   PT + T  T   T+ T        T P TT  T+    A +P P     T  
Sbjct: 480  STLPEDTSPTSRTTSATPNATSPTPAVTTPNATSP-TTQKTSDTPNATSPTPIVIGVTTT 538

Query: 1316 VWHPP---TEPTT-THISVTEKSP 1335
               PP   T     T   VTE+SP
Sbjct: 539  ATSPPTGTTSVPNATSPQVTEESP 562



 Score = 32.8 bits (74), Expect = 2.9
 Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTT-----TTEYNPPEATTKPSTTTSTTTDSGAWTP 1310
            +T+A  T+  +P  +  PT  T  PT+     TT    PE T+  S TTS T ++ + TP
Sbjct: 445  KTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTP 504

Query: 1311 -NPTEWVWHPPTEPTTTHISVTEKSPL 1336
               T     P T+ T+   + T  +P+
Sbjct: 505  AVTTPNATSPTTQKTSDTPNATSPTPI 531



 Score = 31.3 bits (70), Expect = 9.3
 Identities = 25/126 (19%), Positives = 38/126 (30%), Gaps = 16/126 (12%)

Query: 1588 PLLSTITEVLGHGPGGNYESTTEEY-KPTSEESKPTT---------------VSTSTVVT 1631
            P++  +T      P G          + T E     T               V+T    T
Sbjct: 530  PIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGT 589

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
               P   +  I +++ ST  +      P  TS   T  + IT   P   +     T +  
Sbjct: 590  GSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPG 649

Query: 1692 PKPTTV 1697
            P P T 
Sbjct: 650  PGPGTT 655


>gnl|CDD|119366 cd06549, GH18_trifunctional, GH18 domain of an uncharacterized family
            of bacterial proteins, which share a common three-domain
            architecture: an N-terminal glycosyl hydrolase family 18
            (GH18) domain, a glycosyl transferase family 2 domain,
            and a C-terminal polysaccharide deacetylase domain.
          Length = 298

 Score = 51.3 bits (123), Expect = 3e-06
 Identities = 41/151 (27%), Positives = 67/151 (44%), Gaps = 26/151 (17%)

Query: 1434 RLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVREL 1493
            RL+   +AR +FI ++  +L + Q DG+ LD+E            P  D   +  F+ EL
Sbjct: 81   RLLADPSARAKFIANIAAYLERNQADGIVLDFE----------ELPADDLPKYVAFLSEL 130

Query: 1494 HQAFKPHGLLLSAAVSPSKQVINAAYDVKALSESLDWISVMTYDYHGQWDKKTGHVAPLY 1553
             +     G  L+  V P+     A +++KAL+ + D + +M YD H Q     G   P+ 
Sbjct: 131  RRRLPAQGKQLTVTV-PAD---EADWNLKALARNADKLILMAYDEHYQG----GAPGPI- 181

Query: 1554 EHPDDDFFYFNANFTMNYWMKKGAPSRKLVM 1584
                 D+F  N    +     K  P  KL++
Sbjct: 182  --ASQDWFESNLAQAV-----KKLPPEKLIV 205



 Score = 45.1 bits (107), Expect = 3e-04
 Identities = 29/108 (26%), Positives = 51/108 (47%), Gaps = 14/108 (12%)

Query: 1801 RLVNSQQARSKFIAHVVNFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKEL 1860
            RL+    AR+KFIA++  ++  +  DG+ LD+E          + PA D   +   + EL
Sbjct: 81   RLLADPSARAKFIANIAAYLERNQADGIVLDFE----------ELPADDLPKYVAFLSEL 130

Query: 1861 RAAFNPHDLLLSAAVSPSKAVIDNAYDIPVMSENLDWISVMTYDYHGQ 1908
            R         L+  V P+    +  +++  ++ N D + +M YD H Q
Sbjct: 131  RRRLPAQGKQLTVTV-PAD---EADWNLKALARNADKLILMAYDEHYQ 174



 Score = 35.8 bits (83), Expect = 0.26
 Identities = 22/102 (21%), Positives = 39/102 (38%), Gaps = 14/102 (13%)

Query: 482 SNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKLIQELKAEFDK 541
           S R KFI ++  +L R    G+  D+           +    D   +V  + EL+     
Sbjct: 87  SARAKFIANIAAYLERNQADGIVLDFE----------ELPADDLPKYVAFLSELRRRLPA 136

Query: 542 HDYSIAVGISGYKEILEVAYDFPALNDHVEFMSLMSYDYHGA 583
               + V +       E  ++  AL  + + + LM+YD H  
Sbjct: 137 QGKQLTVTVP----ADEADWNLKALARNADKLILMAYDEHYQ 174



 Score = 33.5 bits (77), Expect = 1.1
 Identities = 31/159 (19%), Positives = 62/159 (38%), Gaps = 33/159 (20%)

Query: 102 DRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESI-ANERSDFSTFIQELS 160
           D  +R  FI+++    +  + DG+ L             D E + A++   +  F+ EL 
Sbjct: 85  DPSARAKFIANIAAYLERNQADGIVL-------------DFEELPADDLPKYVAFLSELR 131

Query: 161 STLRRNNYQLTLTSPGVIDRKTSLVDISVVAPLVDLILLKSFNNDHMDDEVVPVKPNTKV 220
             L     QLT+T P       +  ++  +A   D ++L +++  +      P+      
Sbjct: 132 RRLPAQGKQLTVTVP----ADEADWNLKALARNADKLILMAYDEHYQGGAPGPIASQDWF 187

Query: 221 NIQVTSTIANFNNIESAVYNWIKKGARPEQIIIGIPFFG 259
                      +N+  AV     K   PE++I+ +  +G
Sbjct: 188 ----------ESNLAQAV-----KKLPPEKLIVALGSYG 211


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
            heparin sulfate proteoglycans which are implicated in the
            binding of extracellular matrix components and growth
            factors.
          Length = 207

 Score = 47.0 bits (112), Expect = 3e-05
 Identities = 23/88 (26%), Positives = 35/88 (39%), Gaps = 5/88 (5%)

Query: 1612 YKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
              P+ +E      +++T     +   T    +  TT+ +T+TK     +TT TTTT P  
Sbjct: 60   ATPSDDEDSEPVTTSATP---PKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSE 116

Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
              T +   TTV    T T         P
Sbjct: 117  TDT-EEATTTV-STETPTEGGSSAATDP 142



 Score = 35.1 bits (81), Expect = 0.30
 Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 6/82 (7%)

Query: 1260 TTTSEQVPKPTKKPTKPTKKPTT---TTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
             TTS   PK T   + P+   TT   +T+ +P  +TT  +TT+ + TD+       T   
Sbjct: 71   VTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDT---EEATTTVS 127

Query: 1317 WHPPTEPTTTHISVTEKSPLDQ 1338
               PTE  ++  +   K+ L++
Sbjct: 128  TETPTEGGSSAATDPSKNLLER 149



 Score = 34.3 bits (79), Expect = 0.44
 Identities = 25/94 (26%), Positives = 37/94 (39%), Gaps = 15/94 (15%)

Query: 1619 SKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPK 1678
            + P+    S  VTT     T   +TTT++S +  T      TT ST+T     ++T    
Sbjct: 60   ATPSDDEDSEPVTTS---ATPPKLTTTSSSPSNDT------TTASTSTKTSPTVSTTVTT 110

Query: 1679 PTTVKPKPTTTVKPKPTTV---KPKPVIPPSTKD 1709
             T+  P   T  +   TTV    P      +  D
Sbjct: 111  TTS--PS-ETDTEEATTTVSTETPTEGGSSAATD 141



 Score = 34.3 bits (79), Expect = 0.53
 Identities = 18/66 (27%), Positives = 27/66 (40%)

Query: 1239 WNDKMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTT 1298
              D           K   TS++ +++     T   T PT   T TT  +P E  T+ +TT
Sbjct: 65   DEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATT 124

Query: 1299 TSTTTD 1304
            T +T  
Sbjct: 125  TVSTET 130



 Score = 33.2 bits (76), Expect = 1.3
 Identities = 16/78 (20%), Positives = 23/78 (29%), Gaps = 5/78 (6%)

Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP 1694
            P   E+    TT++T           +  TTT      T+    PT      TTT  P  
Sbjct: 62   PSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTS----PTVSTTVTTTTS-PSE 116

Query: 1695 TTVKPKPVIPPSTKDEFK 1712
            T  +       +      
Sbjct: 117  TDTEEATTTVSTETPTEG 134



 Score = 31.2 bits (71), Expect = 4.5
 Identities = 16/59 (27%), Positives = 21/59 (35%), Gaps = 7/59 (11%)

Query: 1279 KPTTTTEYNPPEATTKPS------TTTSTTTDSGAWTPNPTEWVWHPPTEPTTTHISVT 1331
                TT   PP+ TT  S      TT ST+T +              P+E  T   + T
Sbjct: 68   SEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTT-TSPSETDTEEATTT 125



 Score = 30.5 bits (69), Expect = 8.2
 Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 1/51 (1%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTE-EPITTTTTSTTTTTKRPKPPTTTS 1663
              S  +K +   ++TV TT  P +T+ E  TTT ++ T T       T  S
Sbjct: 93   TASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTETPTEGGSSAATDPS 143


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
            membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 47.1 bits (112), Expect = 4e-05
 Identities = 21/84 (25%), Positives = 27/84 (32%), Gaps = 3/84 (3%)

Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
            +V         + +   T       + P+      T    P   T  KPK      K   
Sbjct: 36   LVPLAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPK 95

Query: 1689 TVKPKPTTVKPKPVIPPSTKDEFK 1712
              KPKP   KPKP   P  K + K
Sbjct: 96   KPKPKP---KPKPKPKPKVKPQPK 116



 Score = 45.5 bits (108), Expect = 1e-04
 Identities = 30/99 (30%), Positives = 36/99 (36%), Gaps = 12/99 (12%)

Query: 1613 KPTSEESKPTTVSTS-TVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
             P +       V  + T     EPE  EE     T       + P  PT       +PK 
Sbjct: 37   VPLAVFLLAAKVLEAPTEEPQPEPEPPEEQPKPPTE-----PETPPEPTPP-----KPKE 86

Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPTTV-KPKPVIPPSTKD 1709
                + KP   KPKP    KPKP    +PKP  PPS   
Sbjct: 87   KPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTA 125



 Score = 33.6 bits (77), Expect = 0.90
 Identities = 23/102 (22%), Positives = 30/102 (29%), Gaps = 6/102 (5%)

Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTT 1667
                  PT EE +P            +P    E     T        +P+          
Sbjct: 46   AKVLEAPT-EEPQPEP---EPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKP 101

Query: 1668 RPKPITTIKPKPT-TVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
            +PKP    K KP    K  P+ T    P     +P  PPS  
Sbjct: 102  KPKPKPKPKVKPQPKPKKPPSKTAAKAPAA-PNQPARPPSAA 142


>gnl|CDD|119363 cd06546, GH18_CTS3_chitinase, GH18 domain of CTS3 (chitinase 3), an
            uncharacterized protein from the human fungal pathogen
            Coccidioides posadasii.  CTS3 has a chitinase-like
            glycosyl hydrolase family 18 (GH18) domain; and has
            homologs in bacteria as well as fungi.
          Length = 256

 Score = 46.6 bits (111), Expect = 6e-05
 Identities = 39/168 (23%), Positives = 59/168 (35%), Gaps = 39/168 (23%)

Query: 1712 KIVCYFTNWAWYRQSGGKYLPSDIDSDL----CTHVIYGFAVLDTDQLVI----KPHDT- 1762
            ++V Y+        S G  + S +         TH+I     ++ D  +      P    
Sbjct: 1    RLVIYYQT---THPSNGDPISSLLLVTEKGIALTHLIVAALHINDDGNIHLNDHPPDHPR 57

Query: 1763 ----WADLDNKFYEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVN 1818
                W +L          L+  GVKV   +GG    A   +SRL +  +   ++   + +
Sbjct: 58   FTTLWTELA--------ILQSSGVKVMGMLGGA---APGSFSRLDDDDEDFERYYGQLRD 106

Query: 1819 FILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNP 1866
             I     DGLDLD E P                G   LI  LR+ F P
Sbjct: 107  MIRRRGLDGLDLDVEEPM------SLD------GIIRLIDRLRSDFGP 142



 Score = 41.9 bits (99), Expect = 0.002
 Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 1409 TLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLDLDWEYP 1468
             L+  GVKV   +GG      G +SRL +     +R+   +   + +   DGLDLD E P
Sbjct: 67   ILQSSGVKVMGMLGGAAP---GSFSRLDDDDEDFERYYGQLRDMIRRRGLDGLDLDVEEP 123



 Score = 35.0 bits (81), Expect = 0.36
 Identities = 25/130 (19%), Positives = 48/130 (36%), Gaps = 19/130 (14%)

Query: 413 TH-IVYAFASLDPNTLSIQAGNPEADIDDNFYQRISSSPLVTHGKVKILIAIGGWTDSSG 471
           TH IV A    D   + +    P+      F    +   ++    VK++  +GG    S 
Sbjct: 31  THLIVAALHINDDGNIHLNDHPPD---HPRFTTLWTELAILQSSGVKVMGMLGGAAPGS- 86

Query: 472 EKYSQLISSGSNRKKFIKSVLTFLRRFDFAGLHFDWNYPVCWQADCSKQHKADKGNFVKL 531
             +S+L     + +++   +   +RR    GL  D   P+                 ++L
Sbjct: 87  --FSRLDDDDEDFERYYGQLRDMIRRRGLDGLDLDVEEPM------------SLDGIIRL 132

Query: 532 IQELKAEFDK 541
           I  L+++F  
Sbjct: 133 IDRLRSDFGP 142


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
            glycoprotein G. 
          Length = 408

 Score = 47.4 bits (112), Expect = 7e-05
 Identities = 28/135 (20%), Positives = 48/135 (35%), Gaps = 20/135 (14%)

Query: 1613 KPTSEESKPTTVSTSTVVTTEE--PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
            K TS  ++ +  +  + VT +     Q  +  T     ++T        T  +TT++ P+
Sbjct: 176  KTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSST----EPVGTQGTTTSSNPE 231

Query: 1671 PITTIKPK-----------PTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIVCYFTN 1719
            P T   P            P+T     +TT   +  T   +   PP+T +        T 
Sbjct: 232  PQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHT--QRRKTPPATSNRRSPHSTATP 289

Query: 1720 WAW-YRQSGGKYLPS 1733
                 RQ  G+  P 
Sbjct: 290  PPTTKRQETGRPTPR 304



 Score = 47.0 bits (111), Expect = 9e-05
 Identities = 30/108 (27%), Positives = 41/108 (37%), Gaps = 6/108 (5%)

Query: 1600 GPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
             P G   +TT        E  P+    S   + + P  T     +TT      T+R K P
Sbjct: 217  EPVGTQGTTTSSNPEPQTEPPPSQRGPSG--SPQHPPSTTSQDQSTTGDGQEHTQRRKTP 274

Query: 1660 TTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
              TS   +   P +T  P PTT K + T    P+PT        PP +
Sbjct: 275  PATSNRRS---PHSTATPPPTT-KRQETGRPTPRPTATTQSGSSPPHS 318



 Score = 38.5 bits (89), Expect = 0.038
 Identities = 20/92 (21%), Positives = 29/92 (31%), Gaps = 6/92 (6%)

Query: 1238 YWNDKMKTCDWPYRTK----CKQT-SATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEA- 1291
            Y       C    R K    CK+  +A TTS+    PT+          T       +  
Sbjct: 145  YNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGH 204

Query: 1292 TTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEP 1323
             T  +    ++T+        T     P TEP
Sbjct: 205  QTATANQRLSSTEPVGTQGTTTSSNPEPQTEP 236



 Score = 31.2 bits (70), Expect = 7.3
 Identities = 19/77 (24%), Positives = 28/77 (36%), Gaps = 11/77 (14%)

Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
            T  TTTS   P+P  +P    + P+ + +       +  S   STT D    T       
Sbjct: 221  TQGTTTS-SNPEPQTEPPPSQRGPSGSPQ----HPPSTTSQDQSTTGDGQEHTQRRKT-- 273

Query: 1317 WHPPT--EPTTTHISVT 1331
              PP      + H + T
Sbjct: 274  --PPATSNRRSPHSTAT 288



 Score = 30.8 bits (69), Expect = 8.9
 Identities = 12/62 (19%), Positives = 22/62 (35%), Gaps = 3/62 (4%)

Query: 1242 KMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTST 1301
            + KT       +     +T T        ++  +PT +PT TT+       + P    + 
Sbjct: 270  RRKTPPATSNRR--SPHSTATPPPTT-KRQETGRPTPRPTATTQSGSSPPHSSPPGVQAN 326

Query: 1302 TT 1303
             T
Sbjct: 327  PT 328


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General function
            prediction only].
          Length = 872

 Score = 47.2 bits (112), Expect = 1e-04
 Identities = 33/112 (29%), Positives = 42/112 (37%), Gaps = 6/112 (5%)

Query: 1572 WMKKGAPSRKLVMGVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVT 1631
            W       R  V  + P   T T +   G        T E   +  +++ T   ++   T
Sbjct: 742  WGFIPTTKRVKVRIMDPASGTGTSITTSGT------FTAEVPQSPTKTETTLSYSAYSNT 795

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK 1683
            +   E T   IT T T T TTT  P P  TTS T T     TT  P  TT  
Sbjct: 796  SILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTG 847



 Score = 42.9 bits (101), Expect = 0.002
 Identities = 28/91 (30%), Positives = 32/91 (35%), Gaps = 17/91 (18%)

Query: 1623 TVSTSTVVTTEEPEQTEEPITTTTTSTT---------------TTTKRPKPPTTTSTTTT 1667
               T T +TT      E P + T T TT               T+    K  T T TTT+
Sbjct: 759  ASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTS 818

Query: 1668 RPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
             P P  T    PT      TTT  P  TT  
Sbjct: 819  SPSPTQT--TSPTQTSTSTTTTTSPSQTTTG 847



 Score = 42.2 bits (99), Expect = 0.004
 Identities = 27/113 (23%), Positives = 35/113 (30%), Gaps = 16/113 (14%)

Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITT----------------TTTSTTTTTK 1654
            +   T +  +  TV  S  V          P T                  TTS T T +
Sbjct: 716  KNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAE 775

Query: 1655 RPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
             P+ PT T TT +      T     TT      T  + + TT  P P    S 
Sbjct: 776  VPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSP 828



 Score = 35.6 bits (82), Expect = 0.37
 Identities = 15/55 (27%), Positives = 20/55 (36%), Gaps = 3/55 (5%)

Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT---STTTTT 1653
                  TT      +     TT S+ +   T  P QT    TTTT+   +TT   
Sbjct: 795  TSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849



 Score = 34.8 bits (80), Expect = 0.76
 Identities = 12/59 (20%), Positives = 23/59 (38%)

Query: 1248 WPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSG 1306
              Y      +    T+  V   T   T+ T    + T+   P  T+  +TTT++ + + 
Sbjct: 787  LSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTT 845



 Score = 34.1 bits (78), Expect = 1.3
 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 3/56 (5%)

Query: 1258 SATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTS---TTTDSGAWTP 1310
               TTS  + K   +    T  P+ T   +P + +T  +TTTS   TTT  G   P
Sbjct: 798  LIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGGICGP 853



 Score = 33.3 bits (76), Expect = 2.0
 Identities = 18/81 (22%), Positives = 26/81 (32%), Gaps = 5/81 (6%)

Query: 1260 TTTSEQVPKPTKK-----PTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTE 1314
            T    Q P  T+          T     TT     +  T+  TTTS+ + +   +P  T 
Sbjct: 773  TAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTS 832

Query: 1315 WVWHPPTEPTTTHISVTEKSP 1335
                  T P+ T        P
Sbjct: 833  TSTTTTTSPSQTTTGGGICGP 853



 Score = 33.3 bits (76), Expect = 2.4
 Identities = 15/52 (28%), Positives = 19/52 (36%), Gaps = 1/52 (1%)

Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGA 1307
                T T  Q    T  P+ PT+  + T        TT PS TT+     G 
Sbjct: 803  SVVITKTVTQTQTTTSSPS-PTQTTSPTQTSTSTTTTTSPSQTTTGGGICGP 853


>gnl|CDD|225629 COG3087, FtsN, Cell division protein [Cell division and chromosome
            partitioning].
          Length = 264

 Score = 46.0 bits (109), Expect = 1e-04
 Identities = 20/79 (25%), Positives = 25/79 (31%), Gaps = 2/79 (2%)

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
            T+  EQ E+        T +        TT  T   +PKP    KP+P    P P    K
Sbjct: 112  TQLGEQPEQARIEEQPRTQSQKA-QSQATTVQTQPVKPKPRPE-KPQPVAPAPAPEPVEK 169

Query: 1692 PKPTTVKPKPVIPPSTKDE 1710
                   P P        E
Sbjct: 170  APKAEAAPPPKPKAEDAAE 188



 Score = 36.7 bits (85), Expect = 0.097
 Identities = 17/104 (16%), Positives = 34/104 (32%), Gaps = 12/104 (11%)

Query: 1619 SKPTTVSTST---------VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
            ++P  V  +          +   E  ++ +              ++P+  +  + +    
Sbjct: 81   TEPAAVKDAERLTPEQRQLLEQMEVDQKAQPTQLGEQPEQARIEEQPRTQSQKAQSQATT 140

Query: 1670 KPITTIKPKPTTVKPK---PTTTVKPKPTTVKPKPVIPPSTKDE 1710
                 +KPKP   KP+   P    +P     K +   PP  K E
Sbjct: 141  VQTQPVKPKPRPEKPQPVAPAPAPEPVEKAPKAEAAPPPKPKAE 184


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 585

 Score = 46.7 bits (111), Expect = 1e-04
 Identities = 14/75 (18%), Positives = 21/75 (28%), Gaps = 2/75 (2%)

Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKP-KPTTVKP 1699
            P     T+   +  RP P  +T         I   +P   T    P    +P  P     
Sbjct: 366  PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRET-ATPPPVPPRPVAPPVPHT 424

Query: 1700 KPVIPPSTKDEFKIV 1714
                P  T+    + 
Sbjct: 425  PESAPKLTRAAIPVD 439



 Score = 42.9 bits (101), Expect = 0.002
 Identities = 24/99 (24%), Positives = 33/99 (33%), Gaps = 7/99 (7%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
            P  + +KPT  + S V  T  P    +             +    P         P+P+ 
Sbjct: 364  PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVP-----PRPVA 418

Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
               P P T +  P  T    P   KPK   P   K+E K
Sbjct: 419  P--PVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEK 455



 Score = 35.9 bits (83), Expect = 0.31
 Identities = 17/82 (20%), Positives = 25/82 (30%), Gaps = 2/82 (2%)

Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTT--STTTDSG 1306
               +  + T A +T  +       P K   + T T    PP     P   T  S    + 
Sbjct: 374  AAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTR 433

Query: 1307 AWTPNPTEWVWHPPTEPTTTHI 1328
            A  P   +  + PP  P     
Sbjct: 434  AAIPVDEKPKYTPPAPPKEEEK 455



 Score = 35.6 bits (82), Expect = 0.40
 Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 9/59 (15%)

Query: 1656 PKPPTTT------STTTTRPKPITTIKPKPTTVKPKPTTTVKPK---PTTVKPKPVIPP 1705
            P P          + +  RP P  + +PK       P      +   P  V P+PV PP
Sbjct: 362  PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPP 420



 Score = 32.5 bits (74), Expect = 3.4
 Identities = 14/77 (18%), Positives = 22/77 (28%), Gaps = 7/77 (9%)

Query: 1257 TSATTTSEQVPKPT-KKPTK-PTKKPTTTTEYNPPEATTKPSTTT-STTTDSGAWTPNPT 1313
             +          P+  +PT  P+ +P      N P       T T           P P 
Sbjct: 364  PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPH 423

Query: 1314 EWVWHPPTEPTTTHISV 1330
                 P + P  T  ++
Sbjct: 424  T----PESAPKLTRAAI 436



 Score = 32.1 bits (73), Expect = 4.0
 Identities = 15/87 (17%), Positives = 23/87 (26%), Gaps = 4/87 (4%)

Query: 1601 PGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPT 1660
            P     +     +PT   S     + +  +  +EP   E                P  P 
Sbjct: 368  PAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEP-VRETATPPPVPPRPVAPPVPHTPE 426

Query: 1661 TTSTTTTRPKPITTIKPKPTTVKPKPT 1687
            +    T    P+     KP    P P 
Sbjct: 427  SAPKLTRAAIPVDE---KPKYTPPAPP 450


>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp). 
          Length = 305

 Score = 45.5 bits (108), Expect = 2e-04
 Identities = 26/105 (24%), Positives = 33/105 (31%), Gaps = 8/105 (7%)

Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT--S 1663
               T  Y  +     P   ++  V T      T+ P  TT    ++TT      T T   
Sbjct: 4    TELTFSYNLSDTTLFPNA-TSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLHD 62

Query: 1664 TTTTRPKPITTIKPKPTTV---KPKPTT--TVKPKPTTVKPKPVI 1703
             T        T     T      P PTT  T  P PT V   P +
Sbjct: 63   VTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTPVPSSPAV 107



 Score = 40.5 bits (95), Expect = 0.009
 Identities = 24/103 (23%), Positives = 31/103 (30%), Gaps = 18/103 (17%)

Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT-----STTTTRPKPI 1672
                 T S +   TT  P  T + + T T+ST T     K PT T     S+TT     +
Sbjct: 2    SVTELTFSYNLSDTTLFPNATSKGVKTVTSSTDT-----KAPTNTTYRCVSSTTVPMTNV 56

Query: 1673 TTIK--------PKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
            T               T     T      P+        P  T
Sbjct: 57   TVTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPT 99


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).  This
            family consists of several MGC-24 (or Cd164 antigen)
            proteins from eukaryotic organisms. MGC-24/CD164 is a
            sialomucin expressed in many normal and cancerous
            tissues. In humans, soluble and transmembrane forms of
            MGC-24 are produced by alternative splicing.
          Length = 187

 Score = 43.5 bits (102), Expect = 4e-04
 Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 8/87 (9%)

Query: 1608 TTEEYKPTSEESKPTTVSTSTVV--TTEEPEQTEEPITTTTT-----STTTTTKRPKPPT 1660
            +T  ++    E      S + VV    +    T+     TTT     ST   T  P P T
Sbjct: 73   STCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTT 132

Query: 1661 TTSTTTTRPKPITTIKPKPTTVKPKPT 1687
            +    T+ PK  TT+ P     + K T
Sbjct: 133  SHHHVTSEPKTNTTVTPTSQPDR-KST 158



 Score = 33.4 bits (76), Expect = 0.78
 Identities = 26/108 (24%), Positives = 33/108 (30%), Gaps = 23/108 (21%)

Query: 1217 CNKYQYCIWGSYQVASCSPGLYWNDKMKTCDWPYRTKCK----------QTSATTTSEQV 1266
             N    C+  ++  ++C     W    + C       C           Q   TT S  V
Sbjct: 59   LNSCVSCVNRTHNNSTC----VW----QQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSV 110

Query: 1267 PKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTE 1314
               T  PT  T KPT T     P  TT     TS    +   TP    
Sbjct: 111  ATTTPVPTNSTAKPTIT-----PSPTTSHHHVTSEPKTNTTVTPTSQP 153



 Score = 33.4 bits (76), Expect = 0.91
 Identities = 28/101 (27%), Positives = 37/101 (36%), Gaps = 22/101 (21%)

Query: 1623 TVSTSTVV----TTEEPEQ--TEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK 1676
            T + ST V      EEP    ++  +  +      TT      TTT   T      +T K
Sbjct: 69   THNNSTCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTN-----STAK 123

Query: 1677 PKPTTVKPKPTTTVKPKPTTVKPK------PVIPPSTKDEF 1711
            P   T+ P PTT+      T +PK      P   P  K  F
Sbjct: 124  P---TITPSPTTS--HHHVTSEPKTNTTVTPTSQPDRKSTF 159


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 44.1 bits (103), Expect = 4e-04
 Identities = 27/94 (28%), Positives = 38/94 (40%), Gaps = 3/94 (3%)

Query: 1603 GNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT 1662
            GN   TT    P+   S P+T  ++T+ TT  P  T   ++T TT+ T+T     P  TT
Sbjct: 35   GNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTT 94

Query: 1663 STTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
            S  +T      T K     +      T      T
Sbjct: 95   SNAST---INVTTKVTAQNITATEAGTGTSTGVT 125



 Score = 41.4 bits (96), Expect = 0.003
 Identities = 29/110 (26%), Positives = 47/110 (42%), Gaps = 2/110 (1%)

Query: 1590 LSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTV-VTTEEPEQTEEPITT-TTT 1647
            L+T +  +      +  +TT     T+    PTT + ST+ VTT+   Q        T T
Sbjct: 61   LTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGT 120

Query: 1648 STTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTV 1697
            ST  T+      ++T++ TTR    TT+ P  ++      T    +  TV
Sbjct: 121  STGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTV 170



 Score = 41.4 bits (96), Expect = 0.003
 Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 7/82 (8%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
             S  +    V+ +T VTT  P  +      +TT TTT+     P TTT+  +T     TT
Sbjct: 28   GSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSA----PITTTAILSTN---TTT 80

Query: 1675 IKPKPTTVKPKPTTTVKPKPTT 1696
            +    TTV P PTT+       
Sbjct: 81   VTSTGTTVTPVPTTSNASTINV 102



 Score = 34.9 bits (79), Expect = 0.34
 Identities = 27/78 (34%), Positives = 35/78 (44%), Gaps = 7/78 (8%)

Query: 1626 TSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT--TSTTTTRPKPITTIKPKPTTVK 1683
            TS + T+           T TT+ TT +     P+T  ++T TT   PITT     T + 
Sbjct: 20   TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITT-----TAIL 74

Query: 1684 PKPTTTVKPKPTTVKPKP 1701
               TTTV    TTV P P
Sbjct: 75   STNTTTVTSTGTTVTPVP 92



 Score = 34.9 bits (79), Expect = 0.36
 Identities = 25/115 (21%), Positives = 39/115 (33%), Gaps = 4/115 (3%)

Query: 1591 STITEVLGHGPGGNYESTTEEYKPTSEESKPTTV----STSTVVTTEEPEQTEEPITTTT 1646
            +  T V    P  +  ST +    T+  +  TT     + +T VT+     T  P T+  
Sbjct: 38   TGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNA 97

Query: 1647 TSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
            ++   TTK      T +   T      T      +      TT     TT+ P  
Sbjct: 98   STINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTL 152


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 43.5 bits (103), Expect = 5e-04
 Identities = 19/76 (25%), Positives = 26/76 (34%), Gaps = 4/76 (5%)

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
             E     +    +   +T      P  P        +PKP+   KPKP   + K      
Sbjct: 72   AEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPKPKPVEKPKPKP-KPQQKVEAPPA 130

Query: 1692 PKPTTVKPKPVIPPST 1707
            PKP    PKPV+    
Sbjct: 131  PKPE---PKPVVEEKA 143



 Score = 42.3 bits (100), Expect = 0.001
 Identities = 21/78 (26%), Positives = 28/78 (35%), Gaps = 5/78 (6%)

Query: 1635 PE-QTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPK 1693
            PE   E        + +       PP T       P P+   KPKP   KPKP    + K
Sbjct: 68   PEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPE--PAPVEPPKPKPVE-KPKPKPKPQQK 124

Query: 1694 PT-TVKPKPVIPPSTKDE 1710
                  PKP   P  +++
Sbjct: 125  VEAPPAPKPEPKPVVEEK 142



 Score = 30.4 bits (69), Expect = 9.5
 Identities = 14/40 (35%), Positives = 19/40 (47%), Gaps = 5/40 (12%)

Query: 1673 TTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
             T+ P  T V+P+P     PKP     KPV  P  K + +
Sbjct: 88   ATVAPPNTPVEPEPAPVEPPKP-----KPVEKPKPKPKPQ 122


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 43.6 bits (102), Expect = 9e-04
 Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 9/108 (8%)

Query: 1607 STTEEYKPTSEESKPTT---VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
            +T +E  P  E++  T     S S +    +P+Q E    +   S T   K+ +  TTT 
Sbjct: 67   NTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEE--QKSLNISATPAPKQEQSQTTTE 124

Query: 1664 TTTTRPKPITTIKPKPTTVKP-KPTTTVKPK-PTTVKPKPVIPPSTKD 1709
            +TT  PK   T  P   T +P + T +  P+ PT  + +  + P  +D
Sbjct: 125  STT--PKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMTPKYED 170



 Score = 36.6 bits (84), Expect = 0.17
 Identities = 18/97 (18%), Positives = 28/97 (28%), Gaps = 19/97 (19%)

Query: 1624 VSTSTVVTTEEPEQTEEPIT----------TTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
            ++      T +   T +  T             TS +   K  +P      +        
Sbjct: 54   ITAGANSATTQAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISAT-- 111

Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
                 P   + +  TT   + TT K K   PPST   
Sbjct: 112  -----PAPKQEQSQTT--TESTTPKTKVTTPPSTNTP 141



 Score = 35.5 bits (81), Expect = 0.33
 Identities = 30/136 (22%), Positives = 55/136 (40%), Gaps = 11/136 (8%)

Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPST-TTSTTTDSGAWTPNPTEW 1315
             S      Q  +  +K    +  P    E    + TT+ +T  T  TT     TP P + 
Sbjct: 89   ASKIEKISQPKQEEQKSLNISATPAPKQEQ--SQTTTESTTPKTKVTTPPSTNTPQPMQS 146

Query: 1316 VWHP-PTEPTTTHISVTEKSPLDQYFKIVCYFTNWAWYRPGKGKYVLKK---VPWINILK 1371
                 P  PT    + T+ +P  +Y  +  Y+T  ++    +  ++LK    V ++N++ 
Sbjct: 147  TKSDTPQSPTIKQ-AQTDMTP--KYEDLRAYYTKPSFEFEKQFGFLLKPWTTVRFMNVIP 203

Query: 1372 LYVTLQTGL-GTDQEK 1386
                 +  L G D++K
Sbjct: 204  NRFIYKIALVGKDEKK 219



 Score = 33.6 bits (76), Expect = 1.5
 Identities = 27/97 (27%), Positives = 38/97 (39%), Gaps = 7/97 (7%)

Query: 1594 TEVLGHGPGGNYEST-----TEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTS 1648
            T  L   P  N E T      +  +P  EE K   +S +     E+ + T E  T  T  
Sbjct: 73   TPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKV 132

Query: 1649 TTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPK 1685
            TT  +     P  ++ + T   P  TIK   T + PK
Sbjct: 133  TTPPSTNTPQPMQSTKSDTPQSP--TIKQAQTDMTPK 167


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
            family consists of REP proteins from Dictyostelium (Slime
            molds). REP protein is likely involved in transcription
            regulation and control of DNA replication, specifically
            amplification of plasmid at low copy numbers. The
            formation of homomultimers may be required for their
            regulatory activity.
          Length = 910

 Score = 44.0 bits (104), Expect = 0.001
 Identities = 26/120 (21%), Positives = 40/120 (33%), Gaps = 13/120 (10%)

Query: 1589 LLSTITEVLGHGPGGNYESTTEEYK--PTSE-ESKPTTVSTSTVVTT----EEPEQTEEP 1641
            +LS    +       + E  +E Y   P    E   + +   ++ +        + ++ P
Sbjct: 193  MLSLFHPIKVDSQNRDEEILSEIYSLYPFVNLEVNESDIEQISINSENIQRINSQPSKRP 252

Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
              TTTT+TTTTT      TT    T + K I   K        K     KP         
Sbjct: 253  NNTTTTTTTTTT------TTFQPRTRKRKSIDDHKLSLNQAPEKFKNNTKPDDDPQSDFS 306



 Score = 36.3 bits (84), Expect = 0.22
 Identities = 28/116 (24%), Positives = 38/116 (32%), Gaps = 10/116 (8%)

Query: 1630 VTTEEPEQTEEPITTTTTSTTTTTKRP-KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
            V  E  E   E I+  + +      +P K P  T+TTTT     TT   +P T K K   
Sbjct: 222  VNLEVNESDIEQISINSENIQRINSQPSKRPNNTTTTTTT---TTTTTFQPRTRKRKSID 278

Query: 1689 TVKPKPTTVK-PKPVIPPSTKDEFKIVCYFTNWAWYRQSGG-KYLPSDIDSDLCTH 1742
              K                  D         +    R+SG  K +  D  S   +H
Sbjct: 279  DHKLSLNQAPEKFKNNTKPDDDPQSDFSDKGS----RKSGSLKDVRIDNISCSVSH 330



 Score = 32.1 bits (73), Expect = 5.4
 Identities = 22/93 (23%), Positives = 33/93 (35%), Gaps = 14/93 (15%)

Query: 1255 KQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTT------STTTDSGAW 1308
              T+ TTT+   P+  K+ +    K +        +  TKP            +  SG+ 
Sbjct: 257  TTTTTTTTTTFQPRTRKRKSIDDHKLSLNQAPEKFKNNTKPDDDPQSDFSDKGSRKSGSL 316

Query: 1309 TP----NPTEWVWHPPT----EPTTTHISVTEK 1333
                  N +  V H        PT T ISV+ K
Sbjct: 317  KDVRIDNISCSVSHNGVNISISPTETIISVSNK 349


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 42.6 bits (100), Expect = 0.002
 Identities = 23/98 (23%), Positives = 35/98 (35%), Gaps = 5/98 (5%)

Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT 1661
             GN    T + +     +         V+  ++P+ T +   T       T KR +P   
Sbjct: 135  NGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQATAK---TEPKPVAQTPKRTEPAAP 191

Query: 1662 TSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
             ++T   P   +T  PK T     P  T  P  TT  P
Sbjct: 192  VASTKA-PAATSTPAPKETA-TTAPVQTASPAQTTATP 227



 Score = 37.6 bits (87), Expect = 0.062
 Identities = 19/68 (27%), Positives = 26/68 (38%), Gaps = 12/68 (17%)

Query: 1256 QTSATTTSEQVPKPTKK-----PTKPTKKPTTTTEYNP-------PEATTKPSTTTSTTT 1303
            Q +A T  + V +  K+     P   TK P  T+   P       P  T  P+ TT+T  
Sbjct: 169  QATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPA 228

Query: 1304 DSGAWTPN 1311
              G    N
Sbjct: 229  AGGKTAGN 236



 Score = 36.5 bits (84), Expect = 0.14
 Identities = 22/76 (28%), Positives = 29/76 (38%), Gaps = 6/76 (7%)

Query: 1637 QTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK-----PKPTTTVK 1691
            QT E   TT  +       PK P  T+ T  +P   T  + +P         P  T+T  
Sbjct: 146  QTAERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPA 205

Query: 1692 PKPT-TVKPKPVIPPS 1706
            PK T T  P     P+
Sbjct: 206  PKETATTAPVQTASPA 221



 Score = 34.1 bits (78), Expect = 0.88
 Identities = 28/97 (28%), Positives = 39/97 (40%), Gaps = 12/97 (12%)

Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP-KP 1671
            +P +++        ST+ T  EP  T  P+     S  T     K  T     TTRP + 
Sbjct: 107  QPQNQQQLNNVAVNSTLPT--EP-ATVAPVRNGNASRQT----AKTQTAERPATTRPARK 159

Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
               I+PK    KP+ T   +PKP    PK   P +  
Sbjct: 160  QAVIEPK----KPQATAKTEPKPVAQTPKRTEPAAPV 192



 Score = 33.4 bits (76), Expect = 1.6
 Identities = 20/95 (21%), Positives = 28/95 (29%), Gaps = 28/95 (29%)

Query: 1255 KQTSATTTSEQVP--------------KPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTS 1300
            +QT+ T T+E+                KP        K    T +   P A    +   +
Sbjct: 140  RQTAKTQTAERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPA 199

Query: 1301 TTTDSGAWTPNPTEWVWHPPTEPTTTHISVTEKSP 1335
             T+     TP P           T T   V   SP
Sbjct: 200  ATS-----TPAPK---------ETATTAPVQTASP 220


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is a
            family of proteins conserved in yeasts. The function is
            not known. The Schizosaccharomyces pombe member is
            SPBC18E5.07 and the Saccharomyces cerevisiae member is
            AIM21.
          Length = 671

 Score = 43.0 bits (101), Expect = 0.002
 Identities = 21/97 (21%), Positives = 32/97 (32%), Gaps = 12/97 (12%)

Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPI 1672
            +   E+        S  + T E    E   ++   +      RPK      T+  R  P 
Sbjct: 482  RDVWED-----APESQELVTTEETPEEVKSSSPGVTKPAIPSRPKKGK--PTSEKRKPPP 534

Query: 1673 TTIKPKPTT----VKPKPTTTVK-PKPTTVKPKPVIP 1704
               KPKP       K +     +    +  KPKP +P
Sbjct: 535  VPKKPKPQIPARPAKLQKQQAGEEANSSAFKPKPRVP 571


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 41.6 bits (98), Expect = 0.003
 Identities = 18/76 (23%), Positives = 26/76 (34%), Gaps = 3/76 (3%)

Query: 1636 EQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP- 1694
            EQT +   +T        ++ +PP TT+       P  T  P      P P  T    P 
Sbjct: 109  EQTPQVPRSTVQIQQQA-QQQQPPATTAQPQPVTPPRQTTAPVQPQ-TPAPVRTQPAAPV 166

Query: 1695 TTVKPKPVIPPSTKDE 1710
            T     P +    + E
Sbjct: 167  TQAVEAPKVEAEKEKE 182



 Score = 35.8 bits (83), Expect = 0.23
 Identities = 16/72 (22%), Positives = 22/72 (30%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
            T +  + T          + P  T +P   T    TT   +P+ P    T    P     
Sbjct: 111  TPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAV 170

Query: 1675 IKPKPTTVKPKP 1686
              PK    K K 
Sbjct: 171  EAPKVEAEKEKE 182



 Score = 33.9 bits (78), Expect = 0.82
 Identities = 16/72 (22%), Positives = 19/72 (26%), Gaps = 1/72 (1%)

Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKP-KPTTVKPKPTTT 1689
            T + P  T +            T     P T    TT P    T  P +     P     
Sbjct: 111  TPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAV 170

Query: 1690 VKPKPTTVKPKP 1701
              PK    K K 
Sbjct: 171  EAPKVEAEKEKE 182



 Score = 31.9 bits (73), Expect = 3.0
 Identities = 20/79 (25%), Positives = 29/79 (36%), Gaps = 6/79 (7%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTT----TTKRPKPPTTTSTTTTRP 1669
            P +E++    V  STV   ++ +Q + P TT      T    TT   +P T     T   
Sbjct: 106  PYNEQTPQ--VPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPA 163

Query: 1670 KPITTIKPKPTTVKPKPTT 1688
             P+T     P     K   
Sbjct: 164  APVTQAVEAPKVEAEKEKE 182


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 41.9 bits (98), Expect = 0.003
 Identities = 21/93 (22%), Positives = 28/93 (30%), Gaps = 4/93 (4%)

Query: 1620 KPTTVSTSTVVTTEE--PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKP 1677
               T  T+   TTEE  P   +   +  T  T  T K               K   ++K 
Sbjct: 1    MAETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTS-TKAAKKAAKVKKT-KSVKT 58

Query: 1678 KPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
                V  K   T   K  +V  K V   +   E
Sbjct: 59   TTKKVTVKFEKTESVKKESVAKKTVKKEAVSAE 91



 Score = 34.2 bits (78), Expect = 0.89
 Identities = 15/56 (26%), Positives = 21/56 (37%), Gaps = 8/56 (14%)

Query: 1255 KQTSATTTSEQVPKPTKKPTK-----PTKKPTTTTE---YNPPEATTKPSTTTSTT 1302
            K+T+   T+E+     KK T       TKK   TT            K  +  +TT
Sbjct: 5    KKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTT 60


>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
          Length = 209

 Score = 40.6 bits (95), Expect = 0.005
 Identities = 19/103 (18%), Positives = 27/103 (26%), Gaps = 4/103 (3%)

Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
            ++T+     T  +    T     +   E P     P      +         P       
Sbjct: 98   KATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPR 157

Query: 1666 TTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
               P+P     PKPT          K K T   PKP    +  
Sbjct: 158  AKSPRPAAKAAPKPTETITAK----KAKKTAAAPKPTADKTAT 196



 Score = 35.2 bits (81), Expect = 0.27
 Identities = 29/108 (26%), Positives = 31/108 (28%), Gaps = 20/108 (18%)

Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTT---------TTTKRPKPPTTTS 1663
            K TS     T    + V T  EP    E                        PK P    
Sbjct: 98   KATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPAKPR 157

Query: 1664 TTTTRP------KP---ITTIKPKPTTVKPKPTTTVKPKPT--TVKPK 1700
              + RP      KP   IT  K K T   PKPT      P   T K K
Sbjct: 158  AKSPRPAAKAAPKPTETITAKKAKKTAAAPKPTADKTATPAKKTTKKK 205


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.  Members
            of this family are the S-layer-forming halobacterial
            major cell surface glycoprotein. The highest scores below
            model cutoffs are fragmentary paralogs to actual members
            of the family. Modifications include at N-linked and
            O-linked glycosylation, a C-terminal diphytanylglyceryl
            modification, and probable cleavage of the PGF-CTERM
            tail.
          Length = 782

 Score = 41.4 bits (97), Expect = 0.007
 Identities = 22/65 (33%), Positives = 27/65 (41%), Gaps = 2/65 (3%)

Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT 1661
            G N +    E    +E    TT    T  TT      EE   T T   TTTT+ P   TT
Sbjct: 697  GDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTGPEET--TETAEPTTTTEEPTEETT 754

Query: 1662 TSTTT 1666
            T ++T
Sbjct: 755  TGSST 759



 Score = 39.1 bits (91), Expect = 0.038
 Identities = 20/55 (36%), Positives = 26/55 (47%), Gaps = 7/55 (12%)

Query: 1623 TVSTSTVVTTEEPEQT--EEPITTTTTSTTTTTKRPKPPTTTST-TTTRPKPITT 1674
             V+   V  TE P+ T  E+P TTTT +TT         T T+  TTT  +P   
Sbjct: 702  RVNVEVVEETERPDTTTSEDPTTTTTPTTTGPE----ETTETAEPTTTTEEPTEE 752



 Score = 37.2 bits (86), Expect = 0.14
 Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 7/53 (13%)

Query: 1616 SEESKPTTVSTSTVVTTEEPEQTEEPITT-TTTSTTTTTKRPKPPTTTSTTTT 1667
             E  +P T       T+E+P  T  P TT    +T T          T  TTT
Sbjct: 709  EETERPDTT------TSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTT 755



 Score = 37.2 bits (86), Expect = 0.15
 Identities = 21/60 (35%), Positives = 25/60 (41%), Gaps = 8/60 (13%)

Query: 1630 VTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
            V  E  E+TE P TTT+   TTT       T T+T            P  TT +P   TT
Sbjct: 703  VNVEVVEETERPDTTTSEDPTTT------TTPTTTGPEETTETAE--PTTTTEEPTEETT 754



 Score = 36.4 bits (84), Expect = 0.20
 Identities = 20/47 (42%), Positives = 22/47 (46%), Gaps = 3/47 (6%)

Query: 1260 TTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSG 1306
            TTTSE    PT   T  T  P  TTE   P  TT+  T  +TT  S 
Sbjct: 716  TTTSED---PTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSST 759



 Score = 32.9 bits (75), Expect = 2.7
 Identities = 22/73 (30%), Positives = 32/73 (43%), Gaps = 3/73 (4%)

Query: 1624 VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP-PTTTSTTTTRPKPITTIKPKPTTV 1682
            V  S V T     + ++   T   +     +  +P  TT+   TT   P TT  P+ TT 
Sbjct: 680  VDLSDVETGNYTVEADDGDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTG-PEETTE 738

Query: 1683 KPKPTTTVKPKPT 1695
              +PTTT   +PT
Sbjct: 739  TAEPTTTT-EEPT 750



 Score = 32.5 bits (74), Expect = 3.4
 Identities = 21/53 (39%), Positives = 25/53 (47%), Gaps = 2/53 (3%)

Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT-STTTTTKRPKP 1658
              TE    T+ E  PTT +T T    EE  +T EP TTT   +  TTT    P
Sbjct: 709  EETERPDTTTSE-DPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760



 Score = 31.4 bits (71), Expect = 8.0
 Identities = 18/49 (36%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 1277 TKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEPTT 1325
            T++P TTT  +P   TT  +T    TT+    T  PT     P  E TT
Sbjct: 711  TERPDTTTSEDPTTTTTPTTTGPEETTE----TAEPTTTTEEPTEETTT 755



 Score = 31.4 bits (71), Expect = 8.2
 Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 5/52 (9%)

Query: 1264 EQVPKP-TKKPTKPTKKPTTTTEY-NPPEATTKPSTTT---STTTDSGAWTP 1310
            E+  +P T     PT   T TT        T +P+TTT   +  T +G+ TP
Sbjct: 709  EETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid protein
            may act as a link between the external capsid and the
            internal DNA-protein core. The C-terminal 11 residues may
            function as a protease cofactor leading to enzyme
            activation.
          Length = 238

 Score = 40.2 bits (94), Expect = 0.008
 Identities = 17/89 (19%), Positives = 26/89 (29%), Gaps = 2/89 (2%)

Query: 1621 PTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPT 1680
                      T  +P Q  +P              P+PP+   T    P P+        
Sbjct: 111  GEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMA 170

Query: 1681 TVKPKP--TTTVKPKPTTVKPKPVIPPST 1707
               P      T++  P    P PV+P  +
Sbjct: 171  IAVPAIDTPVTLELPPAPQPPPPVVPQPS 199


>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
            This family includes attachment proteins from respiratory
            synctial virus. Glycoprotein G has not been shown to have
            any neuraminidase or hemagglutinin activity. The amino
            terminus is thought to be cytoplasmic, and the carboxyl
            terminus extracellular. The extracellular region contains
            four completely conserved cysteine residues.
          Length = 263

 Score = 40.1 bits (93), Expect = 0.009
 Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 1275 KPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEPTT-THISVTEK 1333
            KP K PTT     P   TTK  T T TT    A T +PT      PT+ TT T+ +  + 
Sbjct: 184  KPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTS----EPTKLTTKTNTTTPQF 239

Query: 1334 SPLD 1337
            +PL 
Sbjct: 240  TPLS 243



 Score = 36.2 bits (83), Expect = 0.17
 Identities = 27/98 (27%), Positives = 35/98 (35%), Gaps = 9/98 (9%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTT---RPKP 1671
            +S   K T  +T T   T + +       T     +T+       TTT    T      P
Sbjct: 54   SSANHKVTPTTTPTQQITNQIQNHTSTYLTQHNQLSTSP--SNQSTTTPLIHTILDDTTP 111

Query: 1672 ITTIKPKPTTVKPKPTTTVKP---KPTTVKPKPVIPPS 1706
             T    + TTV  K  TT      KP T KP+   PP 
Sbjct: 112  GTKSTYQHTTVGTKGRTTTPAQTNKPPT-KPRQSNPPE 148



 Score = 35.0 bits (80), Expect = 0.34
 Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 1/56 (1%)

Query: 1249 PYRTKCKQTSATTTSEQVPKPTKKPTKP-TKKPTTTTEYNPPEATTKPSTTTSTTT 1303
            P +   K  +   T +  PK TKK TK  T K    T ++P    TK +T T+TTT
Sbjct: 181  PEKKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTT 236



 Score = 34.3 bits (78), Expect = 0.60
 Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 5/49 (10%)

Query: 1649 TTTTTKRPKPPTT---TSTTTTRPKPITTIKP--KPTTVKPKPTTTVKP 1692
            TT  TK+PKP TT   T T TT+ K  TT  P  +PT +  K  TT   
Sbjct: 190  TTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQ 238



 Score = 33.1 bits (75), Expect = 1.4
 Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKP-KPTTVKPKPTTTVKP--KPTTVKPK 1700
                K+P    TT   T +PKP TT K  K  T K K  TT  P  +PT +  K
Sbjct: 179  RIPEKKPGKAPTT-KPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTK 231



 Score = 32.7 bits (74), Expect = 1.9
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 6/69 (8%)

Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTST--TTTTKRPKPPT--TTSTTTTRPK--P 1671
            E KP    T+      +P+ T++   T TT +   TT      PT  TT T TT P+  P
Sbjct: 182  EKKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQFTP 241

Query: 1672 ITTIKPKPT 1680
            ++T   +  
Sbjct: 242  LSTTTTRNP 250



 Score = 31.2 bits (70), Expect = 6.6
 Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 5/66 (7%)

Query: 1242 KMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTK-----PTKKPTTTTEYNPPEATTKPS 1296
            K KT     +T+  ++   TT     +PTK  TK     P   P +TT    PE T++  
Sbjct: 198  KPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQFTPLSTTTTRNPELTSQME 257

Query: 1297 TTTSTT 1302
            T  ST 
Sbjct: 258  TFHSTN 263



 Score = 30.8 bits (69), Expect = 7.2
 Identities = 19/68 (27%), Positives = 23/68 (33%), Gaps = 2/68 (2%)

Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKP-KPTTTVKPKPTTVKPK 1700
            I         TTK  K P    TT    K  TT   + TT  P    T +  K  T  P+
Sbjct: 180  IPEKKPGKAPTTKPTKKP-KPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTNTTTPQ 238

Query: 1701 PVIPPSTK 1708
                 +T 
Sbjct: 239  FTPLSTTT 246


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
            2'-phosphodiesterase/3'-nucleotidase precursor protein;
            Reviewed.
          Length = 814

 Score = 41.0 bits (96), Expect = 0.009
 Identities = 25/134 (18%), Positives = 38/134 (28%), Gaps = 26/134 (19%)

Query: 1569 MNYWMKKGAPSRKLVMGVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTST 1628
            M ++  K A    + + +  LL+     L             E   T+  +  T    +T
Sbjct: 2    MKHYFSKSA----VALTLA-LLTASNPKLAQ----------AEEIVTTTPATSTEAEQTT 46

Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
                E     E   T T  + TT  + P    T  T      P +      T+       
Sbjct: 47   --PVESDATEEADNTETPVAATTAAEAPSSSETAET----SDPTSEATDTTTSEARTV-- 98

Query: 1689 TVKPKPTTVKPKPV 1702
                   T   KPV
Sbjct: 99   ---TPAATETSKPV 109



 Score = 39.8 bits (93), Expect = 0.019
 Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 7/90 (7%)

Query: 1625 STSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK- 1683
            +++  +   E   T  P T+T    TT  +     T  +  T  P   TT    P++ + 
Sbjct: 20   ASNPKLAQAEEIVTTTPATSTEAEQTTPVE--SDATEEADNTETPVAATTAAEAPSSSET 77

Query: 1684 ---PKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
                 PT+      T+       P +T+  
Sbjct: 78   AETSDPTSEATDTTTSEARTVT-PAATETS 106



 Score = 32.9 bits (75), Expect = 2.8
 Identities = 16/76 (21%), Positives = 23/76 (30%), Gaps = 2/76 (2%)

Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEA--TTKPSTTTSTTTDSGAWT 1309
            T  +    T       +       P    T     +  E   T+ P++  + TT S A T
Sbjct: 38   TSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEART 97

Query: 1310 PNPTEWVWHPPTEPTT 1325
              P       P E  T
Sbjct: 98   VTPAATETSKPVEGQT 113


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 40.6 bits (96), Expect = 0.013
 Identities = 9/80 (11%), Positives = 11/80 (13%), Gaps = 4/80 (5%)

Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT----STTTTRPKPITTIKPKPTTVKPKP 1686
                P  T  P      +    +     P                     PKP       
Sbjct: 34   ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93

Query: 1687 TTTVKPKPTTVKPKPVIPPS 1706
                 P        P     
Sbjct: 94   AAPAAPPAAAAAAAPAAAAV 113



 Score = 36.0 bits (84), Expect = 0.31
 Identities = 11/95 (11%), Positives = 13/95 (13%), Gaps = 9/95 (9%)

Query: 1610 EEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
             +Y P       T   T+                              P          P
Sbjct: 34   ADYGPG-----STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAP--APPAAAAPAAP--P 84

Query: 1670 KPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
            KP            P                 V P
Sbjct: 85   KPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTP 119


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 40.3 bits (94), Expect = 0.016
 Identities = 15/93 (16%), Positives = 29/93 (31%), Gaps = 1/93 (1%)

Query: 1616 SEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI 1675
            ++         + +     P     P T+   +       P PP+     +  P      
Sbjct: 2805 ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR 2864

Query: 1676 KPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
            +P   +   KP    +P P     +P +  ST+
Sbjct: 2865 RPPSRSPAAKPAAPARP-PVRRLARPAVSRSTE 2896



 Score = 40.3 bits (94), Expect = 0.019
 Identities = 20/98 (20%), Positives = 28/98 (28%), Gaps = 5/98 (5%)

Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP-----PTTTSTTTT 1667
            +P    ++PT  S +++     P  T EP      S T     P       P   +    
Sbjct: 2682 RPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP 2741

Query: 1668 RPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPP 1705
               P     P       +P TT  P        P   P
Sbjct: 2742 PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779



 Score = 36.8 bits (85), Expect = 0.21
 Identities = 17/99 (17%), Positives = 26/99 (26%), Gaps = 5/99 (5%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP--KPPTTTSTTTTRPKP 1671
            P      PT+   +       P     P+  +        +RP  + P        RP  
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883

Query: 1672 ITTIKP---KPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
                +P   + T     P    +  P    P P  P   
Sbjct: 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ 2922


>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439).  This
            presumed domain is functionally uncharacterized. This
            domain is found in eukaryotes. This domain is typically
            between 46 to 94 amino acids in length. This domain is
            found associated with pfam01462, pfam00560.
          Length = 122

 Score = 37.2 bits (86), Expect = 0.019
 Identities = 18/28 (64%), Positives = 20/28 (71%), Gaps = 3/28 (10%)

Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTR 1668
            P TTTTT+TTTTT    P TTT  TTT+
Sbjct: 46   PTTTTTTTTTTTT---MPTTTTLPTTTK 70



 Score = 34.2 bits (78), Expect = 0.21
 Identities = 23/47 (48%), Positives = 28/47 (59%), Gaps = 11/47 (23%)

Query: 1659 PTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPP 1705
            PTTT+TTTT     TT  P  TT+   PTTT   K + VK  P++PP
Sbjct: 46   PTTTTTTTT----TTTTMPTTTTL---PTTT---KMSMVK-VPLVPP 81



 Score = 32.2 bits (73), Expect = 0.89
 Identities = 19/34 (55%), Positives = 20/34 (58%), Gaps = 10/34 (29%)

Query: 1643 TTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK 1676
            TTTTT+TTTTT  P       TTTT P   TT K
Sbjct: 47   TTTTTTTTTTTTMP-------TTTTLP---TTTK 70



 Score = 30.3 bits (68), Expect = 4.5
 Identities = 19/47 (40%), Positives = 21/47 (44%), Gaps = 7/47 (14%)

Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
            KP      PTT +T+T  TT  P       TTTT  TTT     K P
Sbjct: 38   KPVRSIICPTTTTTTTTTTTTMP-------TTTTLPTTTKMSMVKVP 77



 Score = 30.3 bits (68), Expect = 5.4
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTT 1667
            I   TT+TTTTT     PTTT+  TT
Sbjct: 43   IICPTTTTTTTTTTTTMPTTTTLPTT 68


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 39.3 bits (92), Expect = 0.021
 Identities = 18/63 (28%), Positives = 23/63 (36%), Gaps = 2/63 (3%)

Query: 1645 TTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
            T  +T   +K  K P T  T  T P+   T KP         T  V P  +  KPK    
Sbjct: 176  TAPATVAPSKGAKVPATAETHPTPPQKPATKKP-AVNHHKTATVAVPPATSG-KPKSGAA 233

Query: 1705 PST 1707
             + 
Sbjct: 234  SAR 236



 Score = 34.3 bits (79), Expect = 0.86
 Identities = 15/69 (21%), Positives = 18/69 (26%), Gaps = 8/69 (11%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTT--TTKRPKPPTTTSTTTTRPKP 1671
            PT     P TV+ S     + P   E   T      T        K  T      T  KP
Sbjct: 175  PT----APATVAPSK--GAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKP 228

Query: 1672 ITTIKPKPT 1680
             +       
Sbjct: 229  KSGAASARA 237



 Score = 33.5 bits (77), Expect = 1.2
 Identities = 11/68 (16%), Positives = 16/68 (23%), Gaps = 1/68 (1%)

Query: 1640 EPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTT-VKPKPTTTVKPKPTTVK 1698
             P T   +         +   T        KP        T  V P  +   K    + +
Sbjct: 177  APATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASAR 236

Query: 1699 PKPVIPPS 1706
                 P S
Sbjct: 237  ALSSAPAS 244



 Score = 32.0 bits (73), Expect = 4.5
 Identities = 13/70 (18%), Positives = 18/70 (25%), Gaps = 3/70 (4%)

Query: 1626 TSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPK 1685
            TST+ T        +      T+ T  T   KP T              + P       K
Sbjct: 171  TSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPAT---SGK 227

Query: 1686 PTTTVKPKPT 1695
            P +       
Sbjct: 228  PKSGAASARA 237



 Score = 31.6 bits (72), Expect = 5.2
 Identities = 15/57 (26%), Positives = 16/57 (28%), Gaps = 3/57 (5%)

Query: 1251 RTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGA 1307
             T      A   +     PT      TKKP          AT      TS    SGA
Sbjct: 179  ATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH---HKTATVAVPPATSGKPKSGA 232



 Score = 31.2 bits (71), Expect = 7.2
 Identities = 12/47 (25%), Positives = 15/47 (31%)

Query: 1662 TSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
            TST  T P  +   K        +   T   KP T KP      +  
Sbjct: 171  TSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTAT 217


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 292

 Score = 38.8 bits (90), Expect = 0.024
 Identities = 16/80 (20%), Positives = 21/80 (26%), Gaps = 8/80 (10%)

Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRP-------KPITTIKP-KPTTVKPKPTTTVKP 1692
                T  +T    +    PT      TRP       +P    K  K  T       T  P
Sbjct: 126  STLPTEPATVAPVRNASVPTAERPAITRPVRAQAVSEPAVEPKAAKTATATEAKVQTASP 185

Query: 1693 KPTTVKPKPVIPPSTKDEFK 1712
              T   P      +   + K
Sbjct: 186  AQTPATPPAGKGAAASGQLK 205


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
            binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 39.6 bits (92), Expect = 0.027
 Identities = 24/108 (22%), Positives = 38/108 (35%), Gaps = 19/108 (17%)

Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTK---------RPKPPTTTSTTTTR 1668
            ++  TT   + +  T    + +  I+  T S+   TK          P   T+       
Sbjct: 303  KNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIA 362

Query: 1669 PKPITTIKPKPTTVKPKPTT-TVKPKPTT-------VKPKPVIP--PS 1706
                  ++  P+T    P T  V+   TT       VKP P +P  PS
Sbjct: 363  SATFRGLEKNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPS 410



 Score = 35.4 bits (81), Expect = 0.56
 Identities = 27/118 (22%), Positives = 38/118 (32%), Gaps = 25/118 (21%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPE------QTEEPITTTTTSTTTTT--------KRPKPPT 1660
            TSE     T+S  T  +  E +      +   P++ T+               K P    
Sbjct: 320  TSEGQ--VTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATFRGLEKNPSTAP 377

Query: 1661 TTSTT-TTRPKPIT------TIKPKPTT-VKPKPTTTVKPKPTTVKPKP-VIPPSTKD 1709
            +T  T   R    T       +KP P     P P+ T    P    P P  +PP   D
Sbjct: 378  STPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPD 435


>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
          Length = 246

 Score = 38.1 bits (89), Expect = 0.038
 Identities = 24/84 (28%), Positives = 30/84 (35%), Gaps = 27/84 (32%)

Query: 1634 EPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPK 1693
            EPE   EPI             P+PP        +  P+   KP+P   KPKP    KPK
Sbjct: 73   EPEPEPEPI-------------PEPP--------KEAPVVIPKPEP---KPKPKPKPKPK 108

Query: 1694 P---TTVKPKPVIPPSTKDEFKIV 1714
            P      +PK  + P         
Sbjct: 109  PVKKVEEQPKREVKPVEPRPASPF 132



 Score = 31.6 bits (72), Expect = 4.2
 Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 2/50 (4%)

Query: 1267 PKPTKKPTKPTKKPTTTTEYNP--PEATTKPSTTTSTTTDSGAWTPNPTE 1314
            PKP KK  +  K+     E  P  P   T P+  TS+T  + A  P  + 
Sbjct: 107  PKPVKKVEEQPKREVKPVEPRPASPFENTAPARPTSSTATAAASKPVTSV 156



 Score = 30.8 bits (70), Expect = 8.8
 Identities = 19/79 (24%), Positives = 28/79 (35%), Gaps = 4/79 (5%)

Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
            ++V    E     +PI+ T  +       P+           P+P     P+P    P  
Sbjct: 33   TSVHQVIELPAPAQPISVTMVAPADLEP-PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 91

Query: 1687 TTTVKPKPTTV---KPKPV 1702
                +PKP      KPKPV
Sbjct: 92   IPKPEPKPKPKPKPKPKPV 110


>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246).  This is a
            small family of fungal proteins one of whose members from
            Pichia stipitis is described as being an extremely serine
            rich protein-mucin-like protein.
          Length = 208

 Score = 37.8 bits (87), Expect = 0.039
 Identities = 24/84 (28%), Positives = 31/84 (36%), Gaps = 10/84 (11%)

Query: 1620 KPTTVSTSTVVTT--------EEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
             PT  ++ TVV T        ++    E  I+  TT TT T      PT T   TT    
Sbjct: 72   DPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTPTGTVNGNPTGT--VTTNGTA 129

Query: 1672 ITTIKPKPTTVKPKPTTTVKPKPT 1695
             TT+      V    T T +   T
Sbjct: 130  TTTVITTVEGVAVTYTGTGQTFTT 153



 Score = 35.5 bits (81), Expect = 0.18
 Identities = 26/105 (24%), Positives = 31/105 (29%), Gaps = 15/105 (14%)

Query: 1621 PTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK---- 1676
            PT  +T      +E    E   T   T T TT     P  TTS T    KP TT      
Sbjct: 40   PTGSATDDDDYDDEETDCEGQTTANPTGTVTT----DPTGTTSQTVVPTKPTTTDDDDDT 95

Query: 1677 -------PKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIV 1714
                     PTT+     T       TV        +     + V
Sbjct: 96   TCVETTISDPTTITTPTGTVNGNPTGTVTTNGTATTTVITTVEGV 140



 Score = 32.4 bits (73), Expect = 2.3
 Identities = 27/109 (24%), Positives = 34/109 (31%), Gaps = 6/109 (5%)

Query: 1241 DKMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTS 1300
            D+   C+          + TT        T  PTKPT       +    E T    TT +
Sbjct: 52   DEETDCEGQTTAN-PTGTVTTDPTGTTSQTVVPTKPTTT-DDDDDTTCVETTISDPTTIT 109

Query: 1301 TTTDSGAWTPNPTEWVWHPPTEPTTTHISVTEKSPLDQYFKIVCYFTNW 1349
            T T  G    NPT  V    T  TT   +V   +           FT  
Sbjct: 110  TPT--GTVNGNPTGTVTTNGTATTTVITTVEGVAVTYTGTGQT--FTTD 154



 Score = 31.7 bits (71), Expect = 3.2
 Identities = 22/75 (29%), Positives = 26/75 (34%), Gaps = 6/75 (8%)

Query: 1613 KPTSEESKPTTVSTSTVVTT---EEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
              T E    T   T    TT   E+ E  +E  T TTT  T  T      T  +   T  
Sbjct: 134  ITTVEGVAVTYTGTGQTFTTDGTEDDEDCDETTTYTTTYYTPYTTVIHGGTVYTNGVT-- 191

Query: 1670 KPITTIKPKPTTVKP 1684
              I T    PT V+ 
Sbjct: 192  -VIATHTVYPTDVED 205



 Score = 30.9 bits (69), Expect = 6.1
 Identities = 26/96 (27%), Positives = 30/96 (31%), Gaps = 9/96 (9%)

Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
            E+  E     +     TT  T T   T  P    +P TT     TT  +      TT TT
Sbjct: 54   ETDCEGQTTANPTGTVTTDPTGTTSQTVVPT---KPTTTDDDDDTTCVETTISDPTTITT 110

Query: 1666 TTRPKPITTIKPKPTTV--KPKPTTTVKPKPTTVKP 1699
             T     T       TV      TTTV      V  
Sbjct: 111  PTG----TVNGNPTGTVTTNGTATTTVITTVEGVAV 142


>gnl|CDD|226000 COG3469, COG3469, Chitinase [Carbohydrate transport and metabolism].
          Length = 332

 Score = 37.9 bits (88), Expect = 0.048
 Identities = 22/69 (31%), Positives = 35/69 (50%), Gaps = 7/69 (10%)

Query: 1765 DLDNKFYEKVTALKKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHN 1824
            D D +F  +V AL  +G  V L++GG      + +  L   Q+    F+  ++  I  + 
Sbjct: 82   DPDAEFRAQVGALNAEGKAVLLSLGG-----ADGHIELKAGQEQ--AFVNEIIRLIETYG 134

Query: 1825 FDGLDLDWE 1833
            FDGLD+D E
Sbjct: 135  FDGLDIDLE 143



 Score = 33.3 bits (76), Expect = 1.6
 Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 18/99 (18%)

Query: 2225 AFADLDEQELVIRSHNNK-----------LDLEQGFYEKITKFNERGIQVILAIGGWSDS 2273
            A AD      V+     K            D +  F  ++   N  G  V+L++GG    
Sbjct: 52   ALADTPRNYNVVTVSFMKGAGDIPTFKPYNDPDAEFRAQVGALNAEGKAVLLSLGG---- 107

Query: 2274 RSDKYSRLVNDIPARRNFVRHVTEFLEFYGFNGLEFAWE 2312
             +D +  L         FV  +   +E YGF+GL+   E
Sbjct: 108  -ADGHIELKAGQEQ--AFVNEIIRLIETYGFDGLDIDLE 143



 Score = 31.4 bits (71), Expect = 5.9
 Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 7/64 (10%)

Query: 1403 FYERVVTLKKKGVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLD 1462
            F  +V  L  +G  V L       SLGG    +   A   Q F+  +++ +  Y FDGLD
Sbjct: 87   FRAQVGALNAEGKAVLL-------SLGGADGHIELKAGQEQAFVNEIIRLIETYGFDGLD 139

Query: 1463 LDWE 1466
            +D E
Sbjct: 140  IDLE 143



 Score = 31.4 bits (71), Expect = 6.0
 Identities = 15/68 (22%), Positives = 29/68 (42%), Gaps = 15/68 (22%)

Query: 97  VSITSDRESRLNFISSVLEMFDMYKFDGLDLNVKDPALNDEDDDDLESIANERSDFSTFI 156
           + + + +E    F++ ++ + + Y FDGLD+             DLE  A   +D  T I
Sbjct: 112 IELKAGQEQ--AFVNEIIRLIETYGFDGLDI-------------DLEQSAILAADNQTVI 156

Query: 157 QELSSTLR 164
                 ++
Sbjct: 157 PAALKAVK 164


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 38.4 bits (89), Expect = 0.051
 Identities = 19/103 (18%), Positives = 41/103 (39%), Gaps = 1/103 (0%)

Query: 1596 VLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKR 1655
            +L +  G +  S+ E    ++ E + T  + +   +            +T  + T +T +
Sbjct: 21   ILTYASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQ 80

Query: 1656 PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
               P +  TT T  K + ++ P   +V    TT+V    +  +
Sbjct: 81   ASQPHSHETTITCTKSLISV-PYYKSVDMNCTTSVGVNYSEYR 122



 Score = 35.4 bits (81), Expect = 0.41
 Identities = 15/77 (19%), Positives = 24/77 (31%)

Query: 1623 TVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTV 1682
                ST  + E  + +   + +T  + T TT     P  T T  +     T    + +  
Sbjct: 25   ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQP 84

Query: 1683 KPKPTTTVKPKPTTVKP 1699
                TT    K     P
Sbjct: 85   HSHETTITCTKSLISVP 101


>gnl|CDD|217603 pfam03537, DUF297, TM1410 hypothetical-related protein. 
          Length = 203

 Score = 36.9 bits (86), Expect = 0.075
 Identities = 35/202 (17%), Positives = 66/202 (32%), Gaps = 59/202 (29%)

Query: 1719 NWAWYRQSGGKYLPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWADLDNKFYEKVTALK 1778
            +WA+Y      Y  +   ++L             D +VI       D  +    ++ ALK
Sbjct: 1    SWAFY------YGGALPVAELDAF----------DVVVI-------DPFSFSKAEIAALK 37

Query: 1779 KKGVKV---------------------TLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVV 1817
              G KV                     +   G      G ++   + S   R   +   +
Sbjct: 38   AAGTKVIAYLSVGEVEDWRPDASQFPKSWLGGENPGWPGERWVD-IRSPAWRDIIMDARL 96

Query: 1818 NFILEHNFDGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFNPHDLLLSAAVSP 1877
            + +    FDG+ LD      +Q     G          LI+EL A      L +      
Sbjct: 97   DRLWAKGFDGVFLDN--VDSYQNANGTGRPLTAADLVALIRELAARA--PGLAI------ 146

Query: 1878 SKAVIDNAYDI-PVMSENLDWI 1898
               +++N +++ P ++  +D +
Sbjct: 147  ---ILNNGFELLPALAPLVDGV 165


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
            attachment glycoprotein from region G of metaviruses. It
            is high in serine and threonine suggesting it is highly
            glycosylated.
          Length = 183

 Score = 36.5 bits (84), Expect = 0.076
 Identities = 25/99 (25%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query: 1614 PTSEESKPT-TVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP--KPPTTTSTTTTRPK 1670
            P +E SK T T  T    T    +   +  T ++T    T++      PT+T  TT R +
Sbjct: 35   PPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTNRQQ 94

Query: 1671 PI--TTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
             +   T  P  +  +       K K T    +   PP+T
Sbjct: 95   TVDRHTTPPSSSRTQTTQAVHEK-KNTRTTSRTQTPPTT 132



 Score = 35.8 bits (82), Expect = 0.13
 Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 3/92 (3%)

Query: 1619 SKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI--- 1675
            ++  TV   T   +    QT + +     + TT+  +  P T+T+   T     T+    
Sbjct: 91   NRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGK 150

Query: 1676 KPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
            +P  T+V+P+ + T +    T +  P    ST
Sbjct: 151  EPTTTSVQPRSSATTQSHEETSQANPQSSAST 182



 Score = 32.7 bits (74), Expect = 1.6
 Identities = 25/93 (26%), Positives = 36/93 (38%), Gaps = 1/93 (1%)

Query: 1605 YESTTEEYKPTSEESKP-TTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
              +T+E +  T   S P TT    TV     P  +    TT        T+      T  
Sbjct: 71   PAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPP 130

Query: 1664 TTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
            TT+T     TT     +T K   TT+V+P+ + 
Sbjct: 131  TTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSA 163



 Score = 31.5 bits (71), Expect = 3.0
 Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 3/66 (4%)

Query: 1261 TTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPP 1320
             TSE     +  PT+ +KK  TT   NP    T P++   T   + + T        H  
Sbjct: 24   NTSESEHHTSSPPTESSKKTPTTPTDNP---DTNPNSQHPTQQSTESSTLPAATSESHLE 80

Query: 1321 TEPTTT 1326
            TEPT+T
Sbjct: 81   TEPTST 86


>gnl|CDD|221091 pfam11355, DUF3157, Protein of unknown function (DUF3157).  This
            family of proteins with unknown function appears to be
            restricted to Gammaproteobacteria.
          Length = 197

 Score = 36.6 bits (85), Expect = 0.081
 Identities = 14/56 (25%), Positives = 23/56 (41%), Gaps = 1/56 (1%)

Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTE-EPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
            T EY     ++  T  ++ +  ++   P     P+ T TT TT    +PK     S
Sbjct: 39   TWEYVILETQTTATAATSQSAASSSAAPAVVAIPVATATTLTTIALAQPKLLLQLS 94


>gnl|CDD|217490 pfam03318, ETX_MTX2, Clostridium epsilon toxin ETX/Bacillus
            mosquitocidal toxin MTX2.  This family appears to be
            distantly related to pfam01117.
          Length = 228

 Score = 36.6 bits (85), Expect = 0.086
 Identities = 29/103 (28%), Positives = 38/103 (36%), Gaps = 26/103 (25%)

Query: 1604 NYESTTEEYKPTS-EESKPTTVSTSTV------VTTEE----PEQTEEPITTTTT----S 1648
            N   +T+  +  S  +   TT ST+T                P   E  IT + T     
Sbjct: 38   NNTDSTQTLQTQSFSKKVTTTTSTTTTHGFKIGAKASGKFGIPFVAEGGITLSVTGEYNF 97

Query: 1649 TTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
            ++TTT       TTSTT T      T   +  TV P  T TV 
Sbjct: 98   SSTTT------NTTSTTET-----YTAPSQKVTVPPHTTVTVT 129


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
            subunit 62; Provisional.
          Length = 576

 Score = 37.6 bits (87), Expect = 0.090
 Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 11/81 (13%)

Query: 1640 EPITTTTTSTTTT--------TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
            E I  TT   T          ++R  P  + +    +P P   + P+  +   +      
Sbjct: 304  EVIAETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEE---P 360

Query: 1692 PKPTTVKPKPVIPPSTKDEFK 1712
            P+P  V P+P+ P +  ++ K
Sbjct: 361  PQPKAVVPRPLSPYTAYEDLK 381



 Score = 36.8 bits (85), Expect = 0.14
 Identities = 20/99 (20%), Positives = 33/99 (33%), Gaps = 12/99 (12%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP----------KPPTTTS 1663
            PT+      +VS+++ V    P+       T   +      RP          KPPT+ S
Sbjct: 455  PTAPTGVSPSVSSTSSV-PAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPS 513

Query: 1664 TTTTRPKPITTIKPK-PTTVKPKPTTTVKPKPTTVKPKP 1701
                  K   +   +        P T +  +    +PKP
Sbjct: 514  PAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKP 552



 Score = 32.2 bits (73), Expect = 3.9
 Identities = 22/125 (17%), Positives = 36/125 (28%), Gaps = 26/125 (20%)

Query: 1610 EEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST----- 1664
            E+ KP +  S PT  +  +   +        P T   T+ T       PP          
Sbjct: 445  EDLKPPTSPS-PTAPTGVSPSVSSTSSVPAVPDTAPATAATDA---AAPPPANMRPLSPY 500

Query: 1665 -------TTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTV----------KPKPVIPPST 1707
                     T P P   +     +   +        P T           KP+P+ P + 
Sbjct: 501  AVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYTM 560

Query: 1708 KDEFK 1712
             ++ K
Sbjct: 561  YEDLK 565



 Score = 32.2 bits (73), Expect = 4.3
 Identities = 22/107 (20%), Positives = 33/107 (30%), Gaps = 12/107 (11%)

Query: 1613 KPTSEESKPTTVSTSTVVTTE---EPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP 1669
            KP   +  P+  S S V   E      +   P+     S     +  KPPT+ S T    
Sbjct: 406  KPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPL-----SPYARYEDLKPPTSPSPTAPTG 460

Query: 1670 KPITTIKPKPTT----VKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
               +              P    T    P     +P+ P +  D+ K
Sbjct: 461  VSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLK 507



 Score = 31.8 bits (72), Expect = 5.8
 Identities = 18/78 (23%), Positives = 28/78 (35%), Gaps = 8/78 (10%)

Query: 1627 STVVTTEEPEQTEEPITTTTTSTT---TTTKRPKPPTTTSTTTTRP-KPITT---IKPKP 1679
            S  V  +E +  + P    T   T    +    + P        RP  P T    +KP P
Sbjct: 325  SQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKP-P 383

Query: 1680 TTVKPKPTTTVKPKPTTV 1697
            T+  P P ++      +V
Sbjct: 384  TSPIPTPPSSSPASSKSV 401


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 36.9 bits (85), Expect = 0.096
 Identities = 23/72 (31%), Positives = 28/72 (38%), Gaps = 10/72 (13%)

Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI---KPKPTTVKPKPTTTVKPKPTTVK 1698
            ITTTT +   TT    P +T         P T +   + KP    P  T    P+ T   
Sbjct: 21   ITTTTQAANATT----PSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQT--- 73

Query: 1699 PKPVIPPSTKDE 1710
            P    P STK E
Sbjct: 74   PNATTPSSTKVE 85



 Score = 36.5 bits (84), Expect = 0.15
 Identities = 21/90 (23%), Positives = 36/90 (40%), Gaps = 4/90 (4%)

Query: 1624 VSTSTVVTTEEPEQTEEPITTTTTS--TTTTTKRPKPPTTTSTTTTRPKP-ITTIKPKPT 1680
            ++T  + TT +      P +T   +  +T  + + + P +    TT P   +   +  P 
Sbjct: 16   LTTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPN 75

Query: 1681 TVKPKPTTTVKPK-PTTVKPKPVIPPSTKD 1709
               P  T    P+ PTT +    I P  KD
Sbjct: 76   ATTPSSTKVETPQSPTTKQVPTEINPKFKD 105



 Score = 35.0 bits (80), Expect = 0.40
 Identities = 22/81 (27%), Positives = 31/81 (38%), Gaps = 3/81 (3%)

Query: 1607 STTEEYKPTSEESKP--TTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST 1664
            +TT+    T+  S       ST      E P+      T  +T      + P   T +ST
Sbjct: 23   TTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSST 82

Query: 1665 TTTRPKPITTIKPKPTTVKPK 1685
                P+  TT K  PT + PK
Sbjct: 83   KVETPQSPTT-KQVPTEINPK 102



 Score = 34.2 bits (78), Expect = 0.67
 Identities = 32/138 (23%), Positives = 57/138 (41%), Gaps = 25/138 (18%)

Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNP----PEATTKPSTTTSTTTDSGAWTPNP 1312
            +S    + Q   P+ K   P  KP  TT  +     P+ T   +T +ST  ++       
Sbjct: 34   SSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVET------- 86

Query: 1313 TEWVWHPPTEPTTTHISVTEKSPLDQYFKIVCYFTNWAWYRPGKGKYVLKK---VPWINI 1369
                   P  PTT  +  TE +P  ++  +  Y+T  +     +   +LKK   + ++N+
Sbjct: 87   -------PQSPTTKQVP-TEINP--KFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNV 136

Query: 1370 LKLYVTLQTGL-GTDQEK 1386
            +  Y   +  L G D +K
Sbjct: 137  VPDYFIYKIALVGKDDKK 154



 Score = 33.5 bits (76), Expect = 1.4
 Identities = 27/102 (26%), Positives = 43/102 (42%), Gaps = 8/102 (7%)

Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
            +ST    K  + +SKP   +T      E P+QT       TT ++T  + P+ PTT    
Sbjct: 42   QSTPPSTKVEAPQSKPN-ATTPPSTKVEAPQQTPNA----TTPSSTKVETPQSPTTKQVP 96

Query: 1666 TT---RPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
            T    + K +     KP+         +  K TT++   V+P
Sbjct: 97   TEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNVVP 138


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
            UL32.  The large phosphorylated protein (UL32-like) of
            herpes viruses is the polypeptide most frequently
            reactive in immuno-blotting analyses with antisera when
            compared with other viral proteins.
          Length = 777

 Score = 36.8 bits (85), Expect = 0.17
 Identities = 19/123 (15%), Positives = 31/123 (25%), Gaps = 10/123 (8%)

Query: 1595 EVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTK 1654
                   G + E      +     S   +V +   +        + P   +   +  T  
Sbjct: 492  TATPESSGSDSEGGASGRE-DETSSDAESVVSIKELRPRIGFINKSPPPKSPPKSRRTLI 550

Query: 1655 RPKPPTTTSTT-TTRPKPI---TTIKPKPTTVKPKPTTT-----VKPKPTTVKPKPVIPP 1705
                  + ST  + RPKP      I   P         T          ++VKPK     
Sbjct: 551  VALSLASPSTAGSPRPKPSLGKFVIGTDPFAFANTVRLTDNMRGGNGVGSSVKPKGSASS 610

Query: 1706 STK 1708
               
Sbjct: 611  KPL 613



 Score = 31.8 bits (72), Expect = 5.7
 Identities = 17/109 (15%), Positives = 25/109 (22%), Gaps = 12/109 (11%)

Query: 1601 PGGNYESTTEEYKPTSE-ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP 1659
                 E T       S+ E   +     T    E        I           K P P 
Sbjct: 485  DAKRPEVTATPESSGSDSEGGASGREDETSSDAESVVS----IKELRPRIGFINKSPPPK 540

Query: 1660 TTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
            +   +  T            +   P    + +PKP+  K      P   
Sbjct: 541  SPPKSRRTLI-------VALSLASPSTAGSPRPKPSLGKFVIGTDPFAF 582


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 35.6 bits (82), Expect = 0.21
 Identities = 18/103 (17%), Positives = 31/103 (30%), Gaps = 7/103 (6%)

Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTT 1667
             T      S+ S       +      E E  ++      +ST+ T +    P  T     
Sbjct: 84   ATAPSDAGSQASPDDDAQPAA-----EAEAADQSAPPEASSTSATDEAATDPPAT--AAA 136

Query: 1668 RPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
            R  P      +P T   + +   +P  +   P P  P +    
Sbjct: 137  RDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAG 179


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
            includes the B. subtilis YqfQ protein, also known as
            VrrA, which is functionally uncharacterized. This family
            of proteins is found in bacteria. Proteins in this family
            are typically between 146 and 237 amino acids in length.
            There are two conserved sequence motifs: QYGP and PKLY.
          Length = 155

 Score = 34.7 bits (80), Expect = 0.24
 Identities = 16/62 (25%), Positives = 21/62 (33%), Gaps = 5/62 (8%)

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
             EE E  EE      T  T     P+  T +     R  P    + +    +PK     K
Sbjct: 96   DEEEETEEES-----TDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150

Query: 1692 PK 1693
            PK
Sbjct: 151  PK 152



 Score = 32.8 bits (75), Expect = 1.1
 Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 5/68 (7%)

Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
            E   + +E        +   +T+E EQ + P T T +      + PKP T      T PK
Sbjct: 90   ELSSSDDEE-----EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144

Query: 1671 PITTIKPK 1678
                 KPK
Sbjct: 145  KPKPSKPK 152



 Score = 31.6 bits (72), Expect = 2.7
 Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 1/60 (1%)

Query: 1646 TTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT-VKPKPTTVKPKPVIP 1704
            ++S     +  +  T  +     P+  T  K K     PKP T   KPK    KPKP  P
Sbjct: 92   SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKP 151



 Score = 30.9 bits (70), Expect = 4.2
 Identities = 12/62 (19%), Positives = 17/62 (27%)

Query: 1618 ESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKP 1677
            E   +             E  +E    T T +    KR  P   T     + +P      
Sbjct: 90   ELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPS 149

Query: 1678 KP 1679
            KP
Sbjct: 150  KP 151


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 620

 Score = 36.1 bits (84), Expect = 0.24
 Identities = 25/125 (20%), Positives = 36/125 (28%), Gaps = 14/125 (11%)

Query: 1572 WMKKGAPSRKLVMGVHPLL-STITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVV 1630
            W+     SRK      PLL     +VLG     N ES +         S           
Sbjct: 483  WLGM-VQSRK------PLLEQAFAKVLGRSIKLNLESQS------GSASNTAKTPPPPQK 529

Query: 1631 TTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTV 1690
            +   P  T      T T+   T   P P  T +++    +      P P   +    +  
Sbjct: 530  SPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPT 589

Query: 1691 KPKPT 1695
            K    
Sbjct: 590  KDSSP 594



 Score = 36.1 bits (84), Expect = 0.25
 Identities = 10/72 (13%), Positives = 16/72 (22%)

Query: 1639 EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
                 +   + +        P  +             K   TT  P P     P P   +
Sbjct: 361  PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAE 420

Query: 1699 PKPVIPPSTKDE 1710
            P    P    + 
Sbjct: 421  PTEPSPTPPANA 432



 Score = 36.1 bits (84), Expect = 0.26
 Identities = 16/77 (20%), Positives = 23/77 (29%)

Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP 1694
              +T  P   +      T   P+P  T    T  P P T  +         P  +  P P
Sbjct: 520  TAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPP 579

Query: 1695 TTVKPKPVIPPSTKDEF 1711
               +P P     +  E 
Sbjct: 580  IPEEPTPSPTKDSSPEE 596



 Score = 35.7 bits (83), Expect = 0.33
 Identities = 12/83 (14%), Positives = 17/83 (20%), Gaps = 8/83 (9%)

Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
            S +     P         +                T    T P P       P  V  +P
Sbjct: 366  SEIANASAPANPTPAPNPSPPPAP----IQPSAPKTKQAATTPSPPPAKASPPIPVPAEP 421

Query: 1687 TTTVKPKPTTVKPKPVIPPSTKD 1709
            T        T        P + +
Sbjct: 422  TEP----SPTPPANAANAPPSLN 440



 Score = 35.7 bits (83), Expect = 0.35
 Identities = 11/76 (14%), Positives = 16/76 (21%), Gaps = 1/76 (1%)

Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKP 1694
            P      I   +     T      P       + PK                     P  
Sbjct: 361  PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAE 420

Query: 1695 TTVKPKPVIPPSTKDE 1710
             T +P P  P +  + 
Sbjct: 421  PT-EPSPTPPANAANA 435



 Score = 33.0 bits (76), Expect = 2.3
 Identities = 18/84 (21%), Positives = 28/84 (33%), Gaps = 3/84 (3%)

Query: 1256 QTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEW 1315
             ++   T     K    P      P  T    PP     P T T  ++++ A  P  +  
Sbjct: 517  ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSS- 575

Query: 1316 VWHPPTEPTTTHISVTEKSPLDQY 1339
               PP  P     S T+ S  ++ 
Sbjct: 576  --PPPPIPEEPTPSPTKDSSPEEI 597


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 35.8 bits (82), Expect = 0.27
 Identities = 23/101 (22%), Positives = 37/101 (36%), Gaps = 12/101 (11%)

Query: 1608 TTEEYKPTSEESKPTTVSTSTVVTTEEPEQT-----------EEPITTTTTSTTTTTKRP 1656
            T       S    P++V+T +  TT+                  P T    +      +P
Sbjct: 317  TPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQP 376

Query: 1657 KPPTTTSTTTTRPKPITTIKPKPTTVKP-KPTTTVKPKPTT 1696
            +P +TT T  +    IT+    PTT  P  P + +   PT+
Sbjct: 377  QPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTS 417



 Score = 32.4 bits (73), Expect = 3.4
 Identities = 14/70 (20%), Positives = 22/70 (31%), Gaps = 2/70 (2%)

Query: 1257 TSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWV 1316
            +SA T S  +   +     P   P++ T       TT+ S    ++             V
Sbjct: 307  SSAVTQSSAITPSSAAIPSPAVIPSSVT--TQSATTTQASAVALSSAGVLPSDVTLPGTV 364

Query: 1317 WHPPTEPTTT 1326
              P  EP   
Sbjct: 365  ALPAAEPVNM 374


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
            Provisional.
          Length = 481

 Score = 35.5 bits (81), Expect = 0.33
 Identities = 23/78 (29%), Positives = 28/78 (35%), Gaps = 5/78 (6%)

Query: 1623 TVSTSTVVTTEEP--EQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK--PK 1678
            T +    V TE P  E+   P+    T+T T T   K  TTT   T    P    K  P 
Sbjct: 249  TATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKE-TTTQQQTAPKAPTEAAKPAPA 307

Query: 1679 PTTVKPKPTTTVKPKPTT 1696
            P+T      T       T
Sbjct: 308  PSTNTNANKTNTNTNTNT 325



 Score = 35.5 bits (81), Expect = 0.41
 Identities = 25/97 (25%), Positives = 32/97 (32%), Gaps = 5/97 (5%)

Query: 1606 ESTTEEYKPTSEESKPTTVSTST---VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTT 1662
            E  TE   P +E+     V  +T     TTE+ E T +  T     T      P P T T
Sbjct: 255  EVKTE--APAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNT 312

Query: 1663 STTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
            +   T     T      T+   K T T     T    
Sbjct: 313  NANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNS 349



 Score = 32.9 bits (74), Expect = 2.6
 Identities = 17/64 (26%), Positives = 24/64 (37%), Gaps = 4/64 (6%)

Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPP---EATTKPSTTTST-TTDSGA 1307
            T  +Q +A     +  KP   P+  T    T T  N       T+ PS  T+T T  +  
Sbjct: 287  TTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTN 346

Query: 1308 WTPN 1311
               N
Sbjct: 347  TNSN 350



 Score = 31.7 bits (71), Expect = 5.0
 Identities = 15/54 (27%), Positives = 21/54 (38%)

Query: 1260 TTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPT 1313
            TT  +  PK   +  KP   P+T T  N     T  +T  + T+     T   T
Sbjct: 288  TTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNT 341



 Score = 31.7 bits (71), Expect = 5.9
 Identities = 15/74 (20%), Positives = 23/74 (31%), Gaps = 2/74 (2%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
             + E K TT    T    + P +  +P    +T+T          T T+ T T      T
Sbjct: 280  ATTEKKETTTQQQT--APKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNT 337

Query: 1675 IKPKPTTVKPKPTT 1688
                 +       T
Sbjct: 338  NTNTNSNTNTNSNT 351



 Score = 31.7 bits (71), Expect = 6.4
 Identities = 17/62 (27%), Positives = 21/62 (33%), Gaps = 1/62 (1%)

Query: 1638 TEEPITTTT-TSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
            T  P+  T      TTT++  P   T T   +    TT  PK    K  P          
Sbjct: 144  TSTPVAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHA 203

Query: 1697 VK 1698
            VK
Sbjct: 204  VK 205


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 35.7 bits (82), Expect = 0.33
 Identities = 25/92 (27%), Positives = 31/92 (33%), Gaps = 3/92 (3%)

Query: 1597 LGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVV--TTEEPEQTEEPITTTTTSTTTTTK 1654
            L   P G   S      P    S P        V  T     +T     TT T +TTT+ 
Sbjct: 172  LAAPPLGE-GSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSP 230

Query: 1655 RPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
                    STT   P+  TT + + T   P P
Sbjct: 231  PSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262



 Score = 35.3 bits (81), Expect = 0.35
 Identities = 23/69 (33%), Positives = 26/69 (37%), Gaps = 7/69 (10%)

Query: 1635 PEQTEEPITTTTTSTTTTTKRPKPPTTTST--TTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
            P     P T   T  TT       P TT T  TTT P   T   P  T   P+  TT + 
Sbjct: 199  PADVFVPATPRPTPRTTA-----SPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEA 253

Query: 1693 KPTTVKPKP 1701
            + T   P P
Sbjct: 254  EGTPAPPTP 262



 Score = 34.2 bits (78), Expect = 1.0
 Identities = 20/63 (31%), Positives = 21/63 (33%), Gaps = 7/63 (11%)

Query: 1273 PTKPTKKPTTTTEYNPPEATTKPSTTTSTTTDSGAWTPNPTEWVWHPPTEPTTTHISVTE 1332
            P  P   P TT     PE T  PSTTTS  +     T          P   TT     T 
Sbjct: 205  PATPRPTPRTTA---SPETTPTPSTTTSPPST----TIPAPSTTIAAPQAGTTPEAEGTP 257

Query: 1333 KSP 1335
              P
Sbjct: 258  APP 260



 Score = 33.0 bits (75), Expect = 2.2
 Identities = 23/57 (40%), Positives = 25/57 (43%), Gaps = 6/57 (10%)

Query: 1653 TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK-----PKPVIP 1704
            T RP P TT S  TT P P TT  P  TT+    TT   P+  T       P P  P
Sbjct: 207  TPRPTPRTTASPETT-PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262



 Score = 32.2 bits (73), Expect = 3.4
 Identities = 16/52 (30%), Positives = 18/52 (34%), Gaps = 3/52 (5%)

Query: 1656 PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
            P  P  T  TT  P+   T     TT  P  TT   P  T   P+    P  
Sbjct: 205  PATPRPTPRTTASPE---TTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEA 253


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein TopJ.
             This model describes TopJ (MG_200, CbpA), a DnaJ homolog
            and probable assembly protein of the Mycoplasma terminal
            organelle. The terminal organelle is involved in both
            cytadherence and gliding motility [Cellular processes,
            Chemotaxis and motility].
          Length = 871

 Score = 35.6 bits (81), Expect = 0.37
 Identities = 29/84 (34%), Positives = 35/84 (41%), Gaps = 7/84 (8%)

Query: 1256 QTSATTTSEQVPKPTKK-PTKPTKKPTTTTEYNPPE------ATTKPSTTTSTTTDSGAW 1308
            +T A T +E+ P+PT+   TKPT  P +T E N PE      A    S T STT      
Sbjct: 283  ETVAETKAEEEPQPTQTVETKPTSAPESTVEENLPEINQPTQAVQPTSETISTTPVEPTD 342

Query: 1309 TPNPTEWVWHPPTEPTTTHISVTE 1332
               P E          T  I V E
Sbjct: 343  QLKPKEVDQIQEELKKTKEIEVEE 366



 Score = 31.7 bits (71), Expect = 7.0
 Identities = 24/109 (22%), Positives = 40/109 (36%), Gaps = 12/109 (11%)

Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTEEPEQTE----------EPITTTTTSTTTTTKRPKP 1658
            T E +P  +      +  + ++++   E T           EP+   T + T   + P+P
Sbjct: 237  TRELEPQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPVDEETVAETKAEEEPQP 296

Query: 1659 PTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
              T  T  T     T  +  P     +PT  V+P   T+   PV P   
Sbjct: 297  TQTVETKPTSAPESTVEENLPEI--NQPTQAVQPTSETISTTPVEPTDQ 343


>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein.  This domain is found
            in the aflatoxin regulatory protein (AflR) which is
            involved in the regulation of the biosynthesis of
            aflatoxin in the fungal genus Aspergillus. It occurs
            together with the fungal Zn(2)-Cys(6) binuclear cluster
            domain (pfam00172).
          Length = 275

 Score = 34.9 bits (80), Expect = 0.39
 Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 2/68 (2%)

Query: 1632 TEEPEQTEEP-ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTT-VKPKPTTT 1689
             E P     P I   TT+ TT++  P+PP  +  ++ +P   T   P  T+   PK +  
Sbjct: 2    LETPNTASSPTIPANTTANTTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPSPKFSHQ 61

Query: 1690 VKPKPTTV 1697
              P    +
Sbjct: 62   SPPAEPEL 69


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 35.4 bits (81), Expect = 0.40
 Identities = 31/128 (24%), Positives = 42/128 (32%), Gaps = 14/128 (10%)

Query: 1574 KKGAPSRKL-----VMGVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTST 1628
            KK   S+K      + GV  + ++IT      PG     + +E  P  E SK    S   
Sbjct: 17   KKNKKSKKFKTGATIAGVTAIATSITV-----PGIEVIVSADETAPADEASK----SAEA 67

Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
              T E P       TT  T     T+  +   T        K +     +P TV      
Sbjct: 68   NTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKAPAEPATVSNPDNA 127

Query: 1689 TVKPKPTT 1696
            T    P T
Sbjct: 128  TSSSTPAT 135


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
            conserved protein active in cell division in a number of
            Proteobacteria. The N-terminal 30 residue region tends to
            by Lys/Arg-rich, and is followed by a membrane-spanning
            region. This is followed by an acidic low-complexity
            region of variable length and a well-conserved C-terminal
            domain of two tandem regions matched by pfam05036
            (Sporulation related repeat), found in several cell
            division and sporulation proteins. The role of FtsN as a
            suppressor for other cell division mutations is poorly
            understood; it may involve cell wall hydrolysis [Cellular
            processes, Cell division].
          Length = 298

 Score = 35.1 bits (80), Expect = 0.41
 Identities = 24/120 (20%), Positives = 35/120 (29%), Gaps = 7/120 (5%)

Query: 1588 PLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTT 1647
             L++   E    G        T E +   E+ +    +   V+ T   EQT        T
Sbjct: 94   VLINDPEEPSNGGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVAVEARKQT 153

Query: 1648 STTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPST 1707
            +        K P    T   +  P+ T K      + K      PK T        P  T
Sbjct: 154  AE-------KKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIET 206



 Score = 33.1 bits (75), Expect = 1.9
 Identities = 17/89 (19%), Positives = 26/89 (29%), Gaps = 3/89 (3%)

Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP---PTTTS 1663
              T   + T         +          E  + P+ T   ++     + K    P  T+
Sbjct: 136  LATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTA 195

Query: 1664 TTTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
             T +  KPI T        K KP    K 
Sbjct: 196  ETQSNSKPIETAPKADKADKTKPKPKEKA 224


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 226

 Score = 34.5 bits (79), Expect = 0.46
 Identities = 15/83 (18%), Positives = 20/83 (24%), Gaps = 2/83 (2%)

Query: 1629 VVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST--TTTRPKPITTIKPKPTTVKPKP 1686
            V     P Q  E +                P   +     + P  +      P  VKP  
Sbjct: 61   VQVVALPTQPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPPK 120

Query: 1687 TTTVKPKPTTVKPKPVIPPSTKD 1709
                   P    PKP   P  + 
Sbjct: 121  QPPAGAVPAKPTPKPEPKPVAEP 143


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 614

 Score = 35.1 bits (81), Expect = 0.50
 Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 5/85 (5%)

Query: 1634 EPEQTEEPITT-----TTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTT 1688
             P+Q  +P+ T        S           ++ +   + P+  T     P TV   P  
Sbjct: 374  GPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPA 433

Query: 1689 TVKPKPTTVKPKPVIPPSTKDEFKI 1713
             V   P +  P+ V P   K+E KI
Sbjct: 434  AVPVNPPSTAPQAVRPAQFKEEKKI 458



 Score = 32.1 bits (73), Expect = 4.1
 Identities = 22/123 (17%), Positives = 37/123 (30%), Gaps = 13/123 (10%)

Query: 1622 TTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKP----PTTTSTTTTRPKPITTIKP 1677
             + S S      +P   +       T  T +   P      P +T+    RP      K 
Sbjct: 398  ASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKK 457

Query: 1678 KPTTVKPK--PTTTVKPKPTTVKPKPVIP----PSTKDEF---KIVCYFTNWAWYRQSGG 1728
             P +      P+T    +    +    I      + K+ F    +  Y+  +A  R    
Sbjct: 458  IPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQKEIFTEEDLQYYWQEFAGTRPQEE 517

Query: 1729 KYL 1731
            K L
Sbjct: 518  KAL 520


>gnl|CDD|163190 TIGR03234, OH-pyruv-isom, hydroxypyruvate isomerase.  This enzyme
           interconverts tartronate semi-aldehyde (TSA, aka
           2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E.
           coli enzyme has been characterized and found to be
           specific for TSA, contain no cofactors, and have a
           rather high Km for hydroxypyruvate of 12.5 mM. The gene
           is ofter found in association with glyoxalate
           carboligase (which produces TSA), but has been shown to
           have no effect on growth on glyoxalate when knocked out.
           This is consistent with the fact that the gene for
           tartronate semialdehyde reductase (glxR) is also
           associated and may have primary responsibility for the
           catabolism of TSA.
          Length = 254

 Score = 34.6 bits (80), Expect = 0.51
 Identities = 14/42 (33%), Positives = 23/42 (54%), Gaps = 4/42 (9%)

Query: 585 EGITGLVSPLNSR--PGEPYPNYNINTALKLIDELGGDKRKI 624
            G+T L+ P+NS   PG  +       AL +ID++G +  K+
Sbjct: 135 IGLTLLIEPINSFDMPG--FFLTTTEQALAVIDDVGRENLKL 174


>gnl|CDD|145047 pfam01692, Paramyxo_C, Paramyxovirus non-structural protein c.  This
            family consist of the C proteins (C', C, Y1, Y2) found in
            Paramyxovirinae; human parainfluenza, and sendai virus.
            The C proteins effect viral RNA synthesis having both a
            positive and negative effect during the course of
            infection. Paramyxovirus have a negative strand ssRNA
            genome of 15.3kb form which six mRNAs are transcribed,
            five of these are monocistronic. The P/C mRNA is
            polycistronic and has two overlapping open reading frames
            P and C, C encodes the nested C proteins C', C, Y1 and
            Y2.
          Length = 204

 Score = 34.2 bits (78), Expect = 0.58
 Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 2/66 (3%)

Query: 1637 QTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
            Q  + +++  + ++  +     PT   T        TT  PK +    KP+   K K   
Sbjct: 16   QENKNLSSMRSDSSLNSYPTSAPTPEKTEAGSMVSSTT--PKDSAHHAKPSVNTKTKQQK 73

Query: 1697 VKPKPV 1702
             +PK +
Sbjct: 74   RRPKII 79


>gnl|CDD|119361 cd06544, GH18_narbonin, Narbonin is a plant 2S protein from the
            globulin fraction of narbon bean (Vicia narbonensis L.)
            cotyledons with unknown function.  Narbonin has a
            glycosyl hydrolase family 18 (GH18) domain without the
            conserved catalytic residues and with no known enzymatic
            activity.  Narbonin amounts to up to 3% of the total seed
            globulins of mature seeds and was thought to be a storage
            protein but was found to degrade too slowly during
            germination.  This family also includes the VfNOD32
            nodulin from Vicia faba.
          Length = 253

 Score = 34.3 bits (79), Expect = 0.60
 Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 2/64 (3%)

Query: 1772 EKVTALKKK--GVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHNFDGLD 1829
            E V ++K +   VKV ++IGG             N     S  ++ + + I  +N DG+D
Sbjct: 59   EAVKSIKAQHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAVSSLTSIIQTYNLDGID 118

Query: 1830 LDWE 1833
            +D+E
Sbjct: 119  IDYE 122



 Score = 33.5 bits (77), Expect = 1.2
 Identities = 33/138 (23%), Positives = 62/138 (44%), Gaps = 15/138 (10%)

Query: 1405 ERVVTLKKK--GVKVSLAIGGWNDSLGGKYSRLVNSATARQRFIEHVVKFLLKYQFDGLD 1462
            E V ++K +   VKV ++IGG             N  +     +  +   +  Y  DG+D
Sbjct: 59   EAVKSIKAQHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAVSSLTSIIQTYNLDGID 118

Query: 1463 LDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSPSKQVINAAYDV- 1521
            +D+E       +  A PD+  E  G  + EL    K +G++  A+++PS+    + Y   
Sbjct: 119  IDYE-------HFPADPDTFVECIGQLITEL----KNNGVIKVASIAPSEDAEQSHYLAL 167

Query: 1522 -KALSESLDWISVMTYDY 1538
              A  + +D+++   Y+Y
Sbjct: 168  YNAYGDYIDYVNYQFYNY 185



 Score = 32.7 bits (75), Expect = 1.8
 Identities = 19/96 (19%), Positives = 35/96 (36%), Gaps = 20/96 (20%)

Query: 79  LKEANPELKVYLAV----KSNFVSITSDRESRL---NFISSVLEMFDMYKFDGLDLNVKD 131
           +K  +P +KV +++      N  +            N +SS+  +   Y  DG+D+    
Sbjct: 64  IKAQHPNVKVVISIGGRGVQNNPTPFDPSNVDSWVSNAVSSLTSIIQTYNLDGIDI---- 119

Query: 132 PALNDEDDDDLESIANERSDFSTFIQELSSTLRRNN 167
                    D E    +   F   I +L + L+ N 
Sbjct: 120 ---------DYEHFPADPDTFVECIGQLITELKNNG 146


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 34.7 bits (80), Expect = 0.64
 Identities = 18/86 (20%), Positives = 31/86 (36%), Gaps = 6/86 (6%)

Query: 1624 VSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPT--TTSTTTTRPKPITTIKPKPTT 1681
            + T +V     P+    P++ +T  +   T  P   +    + TT  P  +   +P PT 
Sbjct: 204  MGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTA 263

Query: 1682 VKPKPTTTVKPKPTTVKPKPVIPPST 1707
                P T     P    P+     S+
Sbjct: 264  PSSAPATA----PAAAAPQAAATSSS 285


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 35.1 bits (80), Expect = 0.65
 Identities = 26/122 (21%), Positives = 47/122 (38%), Gaps = 8/122 (6%)

Query: 1598 GHGPGGNYESTTEEYKPTSEESKPTTVSTST--------VVTTEEPEQTEEPITTTTTST 1649
            G  PG   E    +    ++E KP+ + T +            ++PE+ ++P    +   
Sbjct: 546  GGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQR 605

Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
             T  K PK P       +  +P +   PK      +P++  +P+   +   P  P S K 
Sbjct: 606  PTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKP 665

Query: 1710 EF 1711
             F
Sbjct: 666  PF 667



 Score = 33.5 bits (76), Expect = 1.6
 Identities = 17/86 (19%), Positives = 32/86 (37%), Gaps = 23/86 (26%)

Query: 1251 RTKCKQTSATT-----TSEQVPKP-----TKKPTKPTKKPTTTTEYNPPEATTKPS---- 1296
            RT   +T A T      +E+  +      T +P +  K+P + +E+        PS    
Sbjct: 757  RTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPSLPKK 816

Query: 1297 ---------TTTSTTTDSGAWTPNPT 1313
                     +TT   +D+G    + +
Sbjct: 817  RHRLDGLALSTTDLESDAGRIAKDAS 842



 Score = 33.1 bits (75), Expect = 2.5
 Identities = 22/110 (20%), Positives = 32/110 (29%), Gaps = 7/110 (6%)

Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
            E   ++Y   + +SK T  +     + E   +   P T  T  TT     PK P      
Sbjct: 673  EKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFP 732

Query: 1666 TTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPS---TKDEFK 1712
                      +P        P      + T     P   P      +EFK
Sbjct: 733  FEPIGDPDAEQPDDIEFFTPPEE----ERTFFHETPADTPLPDILAEEFK 778



 Score = 33.1 bits (75), Expect = 2.6
 Identities = 23/133 (17%), Positives = 37/133 (27%), Gaps = 5/133 (3%)

Query: 1572 WMKKGAPSRKLVMGV---HPLLSTITEVLGHGPGGNYESTTE--EYKPTSEESKPTTVST 1626
            ++   A S++    V       S + E L   PG  + +        P  EE     +  
Sbjct: 679  YLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGD 738

Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
                  ++ E    P    T    T    P P               T +P     +P  
Sbjct: 739  PDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS 798

Query: 1687 TTTVKPKPTTVKP 1699
             +  + KP    P
Sbjct: 799  PSEHEDKPPGDHP 811



 Score = 32.0 bits (72), Expect = 5.2
 Identities = 20/78 (25%), Positives = 27/78 (34%), Gaps = 2/78 (2%)

Query: 1252 TKCKQTSATTTSEQVPKPTKKPTKPTK--KPTTTTEYNPPEATTKPSTTTSTTTDSGAWT 1309
             K K+T  T   ++  +   K T P     P TT    PP+             D  A  
Sbjct: 684  AKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQ 743

Query: 1310 PNPTEWVWHPPTEPTTTH 1327
            P+  E+   P  E T  H
Sbjct: 744  PDDIEFFTPPEEERTFFH 761


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
            family consists of several special lobe-specific silk
            protein SSP160 sequences which appear to be specific to
            Chironomus (Midge) species.
          Length = 758

 Score = 34.7 bits (79), Expect = 0.71
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 6/41 (14%)

Query: 1634 EPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITT 1674
             P     P    TT+++TTT      TTT+TTT  P   TT
Sbjct: 669  APGSVTVPAAANTTTSSTTT------TTTTTTTAAPTTTTT 703



 Score = 34.0 bits (77), Expect = 1.1
 Identities = 14/54 (25%), Positives = 19/54 (35%)

Query: 1636 EQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
                   +TTTT+TTTTT  P   TT +       P+  +             T
Sbjct: 678  AANTTTSSTTTTTTTTTTAAPTTTTTKAANAPFTYPLCNLIMSAACSAGGAGCT 731



 Score = 32.1 bits (72), Expect = 4.7
 Identities = 14/35 (40%), Positives = 18/35 (51%)

Query: 1639 EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
                TTT+++TTTTT       TT+TT     P T
Sbjct: 677  AAANTTTSSTTTTTTTTTTAAPTTTTTKAANAPFT 711



 Score = 31.3 bits (70), Expect = 7.6
 Identities = 15/57 (26%), Positives = 25/57 (43%)

Query: 1619 SKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTI 1675
            S     S+S   T+     T    TT++ STTTT+       +TS+  T    + ++
Sbjct: 104  SASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSGLTSGASVVSL 160


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 34.5 bits (79), Expect = 0.74
 Identities = 21/105 (20%), Positives = 44/105 (41%), Gaps = 18/105 (17%)

Query: 59  LIPEDLEYDVIKGG------YKSFLGLKEANPELKVYLAVKSNFVSITSDRESRLNFISS 112
           L+P   +Y+  +GG         F   ++A P L V L    N +   +D     ++   
Sbjct: 274 LMPLPGDYNERRGGIILDISLNEFE--QDAVPLL-VDLERNGNRIKELADF---GDYYED 327

Query: 113 VLEMFDMYKFDGLDLNVKDPALNDEDDDD----LESIANERSDFS 153
           + E+ ++ +    D+  ++    DE+ +D     ES+  E ++  
Sbjct: 328 IFEVVEVVEKQEGDVVTEE--STDEESEDEVEIDESVIEEVAEME 370


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 34.7 bits (79), Expect = 0.78
 Identities = 17/86 (19%), Positives = 25/86 (29%), Gaps = 15/86 (17%)

Query: 1636 EQTEEPITTTTTSTTTTTKRPKP-PTTTSTTTTRPKP------ITTIKPKPTTVKPKPTT 1688
                  I      T+  T++P P P      +  P P        + KP      P P  
Sbjct: 20   ANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLA-QAPTPAA 78

Query: 1689 TVKPKP-------TTVKPKPVIPPST 1707
            + K  P        +  P P + P  
Sbjct: 79   SEKFDPAPAPHQAASRAPDPAVAPQL 104



 Score = 33.5 bits (76), Expect = 1.5
 Identities = 15/113 (13%), Positives = 31/113 (27%), Gaps = 19/113 (16%)

Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTR 1668
            T         +  T +    + T+   ++ +        ++      P P    ++  +R
Sbjct: 11   TIACINLIIANLNTNIPIPELHTSAATQKPDPAPAPHQAASR----APDPAVAPTSAASR 66

Query: 1669 -------PKPITTIKPKPTTVKPKPTTTVKPKPTTV-------KPKPVIPPST 1707
                   P P  + K  P    P    +  P P          KP      ++
Sbjct: 67   KPDLAQAPTPAASEKFDP-APAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTS 118


>gnl|CDD|221562 pfam12406, DUF3664, Surface protein.  This family of proteins is
            found in eukaryotes. Proteins in this family are
            typically between 131 and 312 amino acids in length.
          Length = 100

 Score = 31.9 bits (72), Expect = 0.79
 Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 4/76 (5%)

Query: 1633 EEPEQTEEPITTTTTSTTTTTK---RPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
            ++ EQ++EP           T+    P+     + T   P+P+T+ +PK +    +PT T
Sbjct: 25   DQAEQSQEPTQQEPIEPQQPTQPETEPEELEPETVTVEVPEPVTSEEPKESDQTEEPTET 84

Query: 1690 VKPKPTTVKPKPVIPP 1705
               K      +PV  P
Sbjct: 85   QDSKQEPT-QQPVDEP 99


>gnl|CDD|140307 PTZ00284, PTZ00284, protein kinase; Provisional.
          Length = 467

 Score = 34.6 bits (79), Expect = 0.82
 Identities = 24/122 (19%), Positives = 41/122 (33%), Gaps = 10/122 (8%)

Query: 1590 LSTITEVLGHGPGGNYESTTEEYKPTSEESKP-TTVSTSTVVTTEEPEQTEEPITTTTTS 1648
            + T    L     G Y+ T+         + P    S ST  TT           +T ++
Sbjct: 1    MDTRAAGLSAARAGLYQYTSGAPVNALSGNSPKANNSASTGQTTSR---------STNSA 51

Query: 1649 TTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
              + +KR +   T++ +            K  T  P     V P P   K    +P  ++
Sbjct: 52   RRSGSKRDRETATSTDSGRTKSHEGAATTKQATTTPTTNVEVAPPPKKKKVTYALPNQSR 111

Query: 1709 DE 1710
            +E
Sbjct: 112  EE 113


>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
          Length = 191

 Score = 33.2 bits (76), Expect = 0.92
 Identities = 14/44 (31%), Positives = 17/44 (38%)

Query: 1620 KPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
            K TT S   V     PE  E P   T+    TT ++ K    T 
Sbjct: 25   KSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTK 68


>gnl|CDD|200461 cd11322, AmyAc_Glg_BE, Alpha amylase catalytic domain found in the
           Glycogen branching enzyme (also called 1,4-alpha-glucan
           branching enzyme).  The glycogen branching enzyme
           catalyzes the third step of glycogen biosynthesis by the
           cleavage of an alpha-(1,4)-glucosidic linkage and the
           formation a new alpha-(1,6)-branch by subsequent
           transfer of cleaved oligosaccharide. They are part of a
           group called branching enzymes which catalyze the
           formation of alpha-1,6 branch points in either glycogen
           or starch. This group includes proteins from bacteria,
           eukaryotes, and archaea. The Alpha-amylase family
           comprises the largest family of glycoside hydrolases
           (GH), with the majority of enzymes acting on starch,
           glycogen, and related oligo- and polysaccharides. These
           proteins catalyze the transformation of alpha-1,4 and
           alpha-1,6 glucosidic linkages with retention of the
           anomeric center. The protein is described as having 3
           domains: A, B, C. A is a (beta/alpha) 8-barrel; B is a
           loop between the beta 3 strand and alpha 3 helix of A; C
           is the C-terminal extension characterized by a Greek
           key. The majority of the enzymes have an active site
           cleft found between domains A and B where a triad of
           catalytic residues (Asp, Glu and Asp) performs
           catalysis. Other members of this family have lost the
           catalytic activity as in the case of the human 4F2hc, or
           only have 2 residues that serve as the catalytic
           nucleophile and the acid/base, such as Thermus A4
           beta-galactosidase with 2 Glu residues (GH42) and human
           alpha-galactosidase with 2 Asp residues (GH31). The
           family members are quite extensive and include: alpha
           amylase, maltosyltransferase, cyclodextrin
           glycotransferase, maltogenic amylase, neopullulanase,
           isoamylase, 1,4-alpha-D-glucan maltotetrahydrolase,
           4-alpha-glucotransferase, oligo-1,6-glucosidase,
           amylosucrase, sucrose phosphorylase, and amylomaltase.
          Length = 402

 Score = 34.0 bits (79), Expect = 0.93
 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 2/34 (5%)

Query: 569 HVEFMSLMSYDYHGAWE-GITGLVSPLNSRPGEP 601
           HVE M +M + + G+W   +TG  +P  SR G P
Sbjct: 75  HVELMPVMEHPFDGSWGYQVTGYFAP-TSRYGTP 107


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
            recombination and chromosome segregation during
            homologous chromosome formation. This protein localises
            to the synaptonemal complex in S. cerevisiae and the
            analogous structures (linear elements) in S. pombe. This
            family is currently only found in fungi.
          Length = 706

 Score = 34.5 bits (79), Expect = 0.97
 Identities = 19/113 (16%), Positives = 33/113 (29%), Gaps = 9/113 (7%)

Query: 1602 GGNYESTTEEYKPTSEESKPTTV---STSTVVTTEEPEQTEE---PITTTTTSTTTTTKR 1655
            G + ++  + +KP   +SK       S +         +T            S+    K 
Sbjct: 467  GKSKQTKLKNFKPVPNKSKKQLANNNSQNIKSKKVVKAKTNNKANLQDVGECSSPPNNKE 526

Query: 1656 P---KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPP 1705
                +  T++S   +    I    P     K + TT     PT  K       
Sbjct: 527  KNDKQTSTSSSVLKSDRSSIEVRNPNANVKKLEDTTYNAKFPTVSKNNAYTLV 579


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
            presumed domain is functionally uncharacterized. This
            domain family is found in bacteria and eukaryotes, and is
            typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 34.1 bits (78), Expect = 1.0
 Identities = 27/106 (25%), Positives = 39/106 (36%), Gaps = 8/106 (7%)

Query: 1607 STTEEYKPTSEESKPTTVSTSTV--VTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST 1664
              T+  KP  E S   T  +S      T +P+  E+     TT      K P+     S 
Sbjct: 234  EKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTEEEL--KSPEASPKESE 291

Query: 1665 TTTRPKPI-TTIKPKPTTVKPKPTTTVKPKPT---TVKPKPVIPPS 1706
              +  K   + + P P    PKP  +    P    + +PKP  PP 
Sbjct: 292  EASARKRSPSLLSPSPKAESPKPLASPGKSPRDPLSPRPKPQSPPV 337



 Score = 33.7 bits (77), Expect = 1.3
 Identities = 23/96 (23%), Positives = 36/96 (37%), Gaps = 8/96 (8%)

Query: 1614 PTSEESKPTTVSTST-VVTTEEPEQT--EEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
            P    SK  + S    + ++ + E+T  E+P   T++  T  +  PKP       T  PK
Sbjct: 211  PPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKP-----RETLDPK 265

Query: 1671 PITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPS 1706
                  P  TT +   +    PK +         PS
Sbjct: 266  SPEKAPPIDTTEEELKSPEASPKESEEASARKRSPS 301



 Score = 33.7 bits (77), Expect = 1.4
 Identities = 19/113 (16%), Positives = 36/113 (31%), Gaps = 6/113 (5%)

Query: 1601 PGGNYESTTE----EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRP 1656
            PG + +S ++    +   + +  K          ++ + E++  P    T    +  K P
Sbjct: 212  PGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAP 271

Query: 1657 KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
               TT     +        +      K  P+    P P    PKP+  P    
Sbjct: 272  PIDTTEEELKSPEASPKESEEASAR-KRSPSLLS-PSPKAESPKPLASPGKSP 322



 Score = 32.1 bits (73), Expect = 3.7
 Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 7/101 (6%)

Query: 1615 TSEESKPTTVSTSTVVTTEEPEQTEEPITTT-TTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
            + +E  P  +  +    +     ++  I    ++  +  TK  KP   TS+  T      
Sbjct: 197  SFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSA- 255

Query: 1674 TIKPKPTTV-KPKPTTTVKPKPTTV-KPKPVIPPSTKDEFK 1712
               PKP     PK      P  TT  + K       + E  
Sbjct: 256  ---PKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESEEA 293


>gnl|CDD|212516 cd09990, Agmatinase-like, Agmatinase-like family.  Agmatinase
           subfamily currently includes metalloenzymes such as
           agmatinase, guanidinobutyrase, guanidopropionase,
           formimidoylglutamase and proclavaminate
           amidinohydrolase. Agmatinase (agmatine ureohydrolase;
           SpeB; EC=3.5.3.11) is the key enzyme in the synthesis of
           polyamine putrescine; it catalyzes hydrolysis of
           agmatine to yield putrescine and urea. This enzyme has
           been found in bacteria, archaea and eukaryotes,
           requiring divalent Mn and sometimes Zn, Co or Ca for
           activity. In mammals, the highest level of agmatinase
           mRNA was found in liver and kidney. However, catabolism
           of agmatine via agmatinase apparently is a not major
           path; it is mostly catabolized via diamine oxidase.
           Agmatinase has been shown to be down-regulated in tumor
           renal cells. Guanidinobutyrase (Gbh, EC=3.5.3.7)
           catalyzes hydrolysis of 4-guanidinobutanoate to yield
           4-aminobutanoate and urea in arginine degradation
           pathway. Activity has been shown for purified enzyme
           from Arthrobacter sp. KUJ 8602. Additionally,
           guanidinobutyrase is able to hydrolyze D-arginine,
           3-guanidinopropionate, 5-guanidinovaleriate and
           L-arginine with much less affinity, having divalent Zn
           ions for catalysis. Proclavaminate amidinohydrolase
           (Pah, EC 3.5.3.22) hydrolyzes amidinoproclavaminate to
           yield proclavaminate and urea in clavulanic acid
           biosynthesis. Activity has been shown for purified
           enzyme from Streptomyces clavuligerus. Clavulanic acid
           is the effective inhibitor of beta-lactamases. This acid
           is used in combination with the penicillin amoxicillin
           to prevent antibiotic's beta-lactam rings from
           hydrolysis, thus keeping the antibiotics biologically
           active.
          Length = 275

 Score = 33.7 bits (78), Expect = 1.1
 Identities = 10/35 (28%), Positives = 15/35 (42%), Gaps = 2/35 (5%)

Query: 220 VNIQVTSTIANFNNIESAVYNWIKKGARPEQIIIG 254
           V +        F+ I  AV    + GA P  I++G
Sbjct: 55  VPVDPGDIEKTFDRIREAVAEIAEAGAIP--IVLG 87


>gnl|CDD|119356 cd02877, GH18_hevamine_XipI_class_III, This conserved domain family
            includes xylanase inhibitor Xip-I, and the class III
            plant chitinases such as hevamine, concanavalin B, and
            PPL2, all of which have a glycosyl hydrolase family 18
            (GH18) domain. Hevamine is a class III endochitinase that
            hydrolyzes the linear polysaccharide chains of chitin and
            peptidoglycan and is important for defense against
            pathogenic bacteria and fungi.  PPL2 (Parkia platycephala
            lectin 2) is a class III chitinase from Parkia
            platycephala seeds that hydrolyzes beta(1-4) glycosidic
            bonds linking 2-acetoamido-2-deoxy-beta-D-glucopyranose
            units in chitin.
          Length = 280

 Score = 33.8 bits (78), Expect = 1.1
 Identities = 35/149 (23%), Positives = 48/149 (32%), Gaps = 50/149 (33%)

Query: 1411 KKKGVKVSLAIGG--------------------WNDSLGGKYSRLVNSATARQRFIEHVV 1450
            + KG KV L+IGG                    WN   GG       + +   R     V
Sbjct: 69   QSKGKKVLLSIGGAGGSYSLSSDADAKDFADYLWNAFGGG-------TDSGVPRPFGDAV 121

Query: 1451 KFLLKYQFDGLDLDWEYPTCWQVNCDAGPDSDKESFGLFVRELHQAFKPHGLLLSAAVSP 1510
                    DG D D E+ +    N DA            +R L  +       L+AA   
Sbjct: 122  -------VDGFDFDIEHGS--PENYDA--------LAKRLRSLFASDPSKKYYLTAAP-- 162

Query: 1511 SKQ-VINAAYDVKALSESL-DWISVMTYD 1537
              Q     A    A++  L D+I V  Y+
Sbjct: 163  --QCPYPDASLGDAIATGLFDFIFVQFYN 189



 Score = 33.4 bits (77), Expect = 1.4
 Identities = 35/143 (24%), Positives = 51/143 (35%), Gaps = 38/143 (26%)

Query: 1778 KKKGVKVTLAIGGWNDSAGNKYSRLVNSQQARSKFIAHVVNFILEHN------------F 1825
            + KG KV L+IGG    AG  YS L +   A+  F  ++ N                   
Sbjct: 69   QSKGKKVLLSIGG----AGGSYS-LSSDADAK-DFADYLWNAFGGGTDSGVPRPFGDAVV 122

Query: 1826 DGLDLDWEYPKCWQVDCKQGPASDKQGFADLIKELRAAFN---PHDLLLSAAVSPSKAVI 1882
            DG D D E                 + +  L K LR+ F         L+AA    +   
Sbjct: 123  DGFDFDIE-------------HGSPENYDALAKRLRSLFASDPSKKYYLTAAP---QCPY 166

Query: 1883 DNAYDIPVMSENL-DWISVMTYD 1904
             +A     ++  L D+I V  Y+
Sbjct: 167  PDASLGDAIATGLFDFIFVQFYN 189


>gnl|CDD|218902 pfam06121, DUF959, Domain of Unknown Function (DUF959).  This
            N-terminal domain is not expressed in the 'Short' isoform
            of Collagen A.
          Length = 202

 Score = 33.2 bits (75), Expect = 1.2
 Identities = 25/98 (25%), Positives = 34/98 (34%), Gaps = 2/98 (2%)

Query: 1613 KPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPI 1672
            K  +   KP T  + T  +   P Q   P+ +T ++TT    RP   T  STT    +  
Sbjct: 20   KKPTWLWKPYTELSPTASSAAVP-QASTPVQSTESTTTHVVPRPGE-TEESTTPASSEEP 77

Query: 1673 TTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDE 1710
              I  K        T    P  T V       P   +E
Sbjct: 78   KEIVEKGKQNVVPGTVATTPTVTPVAMDVASSPDLSEE 115


>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
            Provisional.
          Length = 418

 Score = 33.9 bits (78), Expect = 1.2
 Identities = 14/79 (17%), Positives = 19/79 (24%), Gaps = 1/79 (1%)

Query: 1605 YESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTST 1664
             E  T    P +  +             +       P      S  T     KPP     
Sbjct: 116  SEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTP-EPPAASKPTPPAAAKPPEPAPA 174

Query: 1665 TTTRPKPITTIKPKPTTVK 1683
                P P+    P+ T V 
Sbjct: 175  AKPPPTPVARADPRETRVP 193



 Score = 33.5 bits (77), Expect = 1.6
 Identities = 15/64 (23%), Positives = 15/64 (23%), Gaps = 2/64 (3%)

Query: 1656 PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIVC 1715
              PP          K   T   KP    P P      KPT   P    PP      K   
Sbjct: 122  GAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPT--PPAAAKPPEPAPAAKPPP 179

Query: 1716 YFTN 1719
                
Sbjct: 180  TPVA 183



 Score = 32.7 bits (75), Expect = 2.5
 Identities = 15/77 (19%), Positives = 20/77 (25%), Gaps = 1/77 (1%)

Query: 1623 TVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTV 1682
             + T        P          TT        P P    ++  T P      +P P   
Sbjct: 117  EIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAK 176

Query: 1683 KP-KPTTTVKPKPTTVK 1698
             P  P     P+ T V 
Sbjct: 177  PPPTPVARADPRETRVP 193



 Score = 32.3 bits (74), Expect = 3.2
 Identities = 14/73 (19%), Positives = 18/73 (24%)

Query: 1627 STVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKP 1686
            S + T   P            +  TT ++PK    T       KP      KP    P  
Sbjct: 116  SEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAA 175

Query: 1687 TTTVKPKPTTVKP 1699
                 P       
Sbjct: 176  KPPPTPVARADPR 188



 Score = 32.0 bits (73), Expect = 4.4
 Identities = 14/61 (22%), Positives = 15/61 (24%)

Query: 1642 ITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKP 1701
            I T                   TT  +PK        P   KP P    KP       KP
Sbjct: 118  IDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKP 177

Query: 1702 V 1702
             
Sbjct: 178  P 178



 Score = 31.2 bits (71), Expect = 7.0
 Identities = 15/71 (21%), Positives = 19/71 (26%), Gaps = 1/71 (1%)

Query: 1641 PITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK-PKPTTTVKPKPTTVKP 1699
            P++   T        P          T P+      P P      KPT     KP    P
Sbjct: 114  PLSEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAP 173

Query: 1700 KPVIPPSTKDE 1710
                PP+    
Sbjct: 174  AAKPPPTPVAR 184


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 33.5 bits (77), Expect = 1.3
 Identities = 31/126 (24%), Positives = 36/126 (28%), Gaps = 7/126 (5%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
              S +S P   ST+T   T  P       TT T S T        P           P  
Sbjct: 156  QNSGQSVPLDTSTTTDPATT-PAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQ 214

Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK--DEFKIVCYFTNWAWY--RQSGGK 1729
                   T  P    T  P      P      ST   D   +V  FT   W     + GK
Sbjct: 215  ANVDTAATPAPAAPAT--PDGAAPLPTDQAGVSTPAADPNALVMNFTADCWLEVTDATGK 272

Query: 1730 YLPSDI 1735
             L S +
Sbjct: 273  KLFSGM 278


>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family.  This family
            includes gbp a protein from Soybean that binds to GAGA
            element dinucleotide repeat DNA. It seems likely that the
            this domain mediates DNA binding. This putative domain
            contains several conserved cysteines and a histidine
            suggesting this may be a zinc-binding DNA interaction
            domain.
          Length = 301

 Score = 33.3 bits (76), Expect = 1.4
 Identities = 20/81 (24%), Positives = 26/81 (32%), Gaps = 14/81 (17%)

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK 1691
            T E    E P  +T        K+PK       +   PK      PKP   K K      
Sbjct: 126  TREMHHLEVPPISTAPPEAKEVKKPK----KGQSPKVPK-----APKPKKPKKKG----- 171

Query: 1692 PKPTTVKPKPVIPPSTKDEFK 1712
                     P I P +K ++K
Sbjct: 172  SVSNRSVKMPGIDPRSKPDWK 192


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 33.9 bits (77), Expect = 1.4
 Identities = 30/129 (23%), Positives = 46/129 (35%), Gaps = 20/129 (15%)

Query: 1609 TEEYKPTSEESKPTTVSTSTVVTTEEPEQTE-EPITTTTTS---------TTTTTKRPKP 1658
            TE+    S+E   T      ++        + +P+T+ TTS           T    P P
Sbjct: 543  TEDLDIESDEPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHP 602

Query: 1659 PTTTSTTTTRPKPITTIKPK--PTTVKPKPTTTVKPKPTT----VKPKPVIPPSTKDEFK 1712
              T    TT+     T  P+  P  ++P P   ++ +P T    V P P  PP  +    
Sbjct: 603  SQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITP- 661

Query: 1713 IVCYFTNWA 1721
               Y   W 
Sbjct: 662  ---YKPTWT 667



 Score = 31.6 bits (71), Expect = 7.3
 Identities = 18/72 (25%), Positives = 24/72 (33%), Gaps = 6/72 (8%)

Query: 1635 PEQTEEPITTTTTST-TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPK 1693
            P Q  +   T    T T     P  P+ T   T  P     I+  P T++P P      +
Sbjct: 651  PHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLP-----IQWAPGTMQPPPRAPTPMR 705

Query: 1694 PTTVKPKPVIPP 1705
            P    P     P
Sbjct: 706  PPAAPPGRAQRP 717


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 33.5 bits (77), Expect = 1.4
 Identities = 15/97 (15%), Positives = 25/97 (25%), Gaps = 2/97 (2%)

Query: 1606 ESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTT 1665
            +S  +             +S   + T  E E      +T            K  +     
Sbjct: 18   KSKLQPISYIYSNVL--VLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK 75

Query: 1666 TTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPV 1702
              + K     + +       P  + K K    KPKP 
Sbjct: 76   KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPN 112



 Score = 33.2 bits (76), Expect = 1.8
 Identities = 15/88 (17%), Positives = 22/88 (25%), Gaps = 4/88 (4%)

Query: 1602 GGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTT 1661
                  + E     SEE      +++     E+     +  +          K PK    
Sbjct: 29   SNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGE 88

Query: 1662 TSTTTTRPKPITTIKPKPTTVKPKPTTT 1689
            T      PK       K     PKP   
Sbjct: 89   TKLGFKTPKK----SKKTKKKPPKPKPN 112


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 33.7 bits (77), Expect = 1.5
 Identities = 31/123 (25%), Positives = 45/123 (36%), Gaps = 19/123 (15%)

Query: 1603 GNYESTTEEYKPTSEESKPTTVST-------------STVVTTEEPEQTEEPITTTTTST 1649
            G  +S+  +Y P S+E  P + S              S     + P Q+E P    T   
Sbjct: 479  GKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESGRGKQKSPAQSEAPPQRRTVGK 538

Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTV-----KPKPTTTVKPKPTTVK-PKPVI 1703
                K  K       T  RP+      P  ++V     +PK  T    KP+  K PK  +
Sbjct: 539  KQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTPPDRPKAATKGSRKPSPRKEPKSSV 598

Query: 1704 PPS 1706
            PP+
Sbjct: 599  PPA 601


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
            large complex of up to 33 proteins that is conserved from
            plants to fungi to humans - the number and representation
            of individual subunits varying with species. It is
            arranged into four different sections, a core, a head, a
            tail and a kinase-activity part, and the number of
            subunits within each of these is what varies with
            species. Overall, Mediator regulates the transcriptional
            activity of RNA polymerase II but it would appear that
            each of the four different sections has a slightly
            different function. Mediator subunit Hrs1/Med3 is a
            physical target for Cyc8-Tup1, a yeast transcriptional
            co-repressor.
          Length = 381

 Score = 33.5 bits (76), Expect = 1.6
 Identities = 19/82 (23%), Positives = 29/82 (35%)

Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTT 1666
            S T+    +   +  +T +T       +      P TTT   +  TT      T T+ TT
Sbjct: 130  SITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATTATTPTT 189

Query: 1667 TRPKPITTIKPKPTTVKPKPTT 1688
            T+ KP    + K T        
Sbjct: 190  TQKKPRKPRQTKKTGPAAAAKA 211



 Score = 31.5 bits (71), Expect = 6.5
 Identities = 19/82 (23%), Positives = 26/82 (31%), Gaps = 14/82 (17%)

Query: 1640 EPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVK-------- 1691
            +      +++ T T       TTS+T   P     +        P  TT V         
Sbjct: 121  QLGNAGASASITKTSNGSDAATTSSTANTPA-AAKVLKANAASAPNTTTGVGSAATTAAI 179

Query: 1692 -----PKPTTVKPKPVIPPSTK 1708
                   PTT + KP  P  TK
Sbjct: 180  SATTATTPTTTQKKPRKPRQTK 201


>gnl|CDD|119360 cd06543, GH18_PF-ChiA-like, PF-ChiA is an uncharacterized chitinase
            found in the hyperthermophilic archaeon Pyrococcus
            furiosus with a glycosyl hydrolase family 18 (GH18)
            catalytic domain as well as a cellulose-binding domain.
            Members of this domain family are found not only in
            archaea but also in eukaryotes and prokaryotes. PF-ChiA
            exhibits hydrolytic activity toward both colloidal and
            crystalline (beta/alpha) chitins at high temperature.
          Length = 294

 Score = 33.0 bits (76), Expect = 1.7
 Identities = 35/216 (16%), Positives = 63/216 (29%), Gaps = 50/216 (23%)

Query: 1716 YFTNWAWYRQSGGKY-LPSDIDSDLCTHVIYGFAVLDTDQLVIKPHDTWA-----DLDNK 1769
            Y             Y   + +           F V        KP   W      D    
Sbjct: 6    YVDVTLNPPPDLTTYAAATGVK-----AFTLAFIVASGGC---KP--AWGGSYPLDQGGW 55

Query: 1770 FYEKVTALKKKGVKVTLAIGGWNDS--AGNKYSRLVNSQQARSKFIAHVVNFILEHNFDG 1827
                + AL+  G  V ++ GG + +  A +      ++ Q  + +   +  +        
Sbjct: 56   IKSDIAALRAAGGDVIVSFGGASGTPLATSC----TSADQLAAAYQKVIDAY----GLTH 107

Query: 1828 LDLDWEYPKCWQVDCKQGPASDKQGFADL----IKELRAAFNPHDLLLSAAVSPS----- 1878
            LD D E           G A       D     +  L+  +    +  +  V P+     
Sbjct: 108  LDFDIE-----------GGALTDTAAIDRRAQALALLQKEYPDLKISFTLPVLPTGLTPD 156

Query: 1879 -KAVIDNAYDIPVMSENLDWISVMTYDYHGQWDKKT 1913
               V++ A    V   +LD +++MT DY      + 
Sbjct: 157  GLNVLEAAAANGV---DLDTVNIMTMDYGSSAGSQD 189


>gnl|CDD|215044 PLN00063, PLN00063, photosystem II core complex proteins psbY;
            Provisional.
          Length = 194

 Score = 32.2 bits (73), Expect = 1.9
 Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 3/51 (5%)

Query: 1643 TTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP---TTVKPKPTTTV 1690
            T    +T   +   +     S T T+P  + +++ KP   T  K    T V
Sbjct: 3    TMAALATKCLSINARKNPNPSKTKTKPIILLSMQNKPKGLTISKSADNTNV 53


>gnl|CDD|115671 pfam07032, DUF1322, Protein of unknown function (DUF1322).  This
           family consists of several hypothetical 9.4 kDa Borrelia
           burgdorferi (Lyme disease spirochete) proteins of around
           78 residues in length. The function of this family is
           unknown.
          Length = 80

 Score = 30.5 bits (68), Expect = 1.9
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 527 NFVKLIQELKAEFDKHDYSIAVGISGYKEILEVAYD 562
            + KLI E+  E DK+ + I +GI  Y ++ ++ YD
Sbjct: 21  KYFKLIDEITREADKYYFPIIIGICSYDDVKKLPYD 56


>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 475

 Score = 33.0 bits (75), Expect = 2.3
 Identities = 12/61 (19%), Positives = 18/61 (29%)

Query: 1653 TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
             +    P + +     P P  T  P      P        KP   KP+    P     +K
Sbjct: 15   EQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWK 74

Query: 1713 I 1713
            +
Sbjct: 75   L 75


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins in
            this family for which functions are known are components
            of a multiprotein complex used for targeting nucleotide
            excision repair to specific parts of the genome. In
            humans, Rad23 complexes with the XPC protein. This family
            is based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
            replication, recombination, and repair].
          Length = 378

 Score = 32.9 bits (75), Expect = 2.3
 Identities = 12/71 (16%), Positives = 22/71 (30%), Gaps = 6/71 (8%)

Query: 1642 ITTTTTSTTTTTKRPKPPT--TTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKP 1699
            ++   T T         PT   T T +    P + +   P +   + + + +    T   
Sbjct: 74   VSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133

Query: 1700 KPVIPPSTKDE 1710
                 PST   
Sbjct: 134  ----SPSTSVP 140


>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
            region.  This highly conserved region is found towards
            the C-terminus of the transmembrane domain. The function
            is unclear.
          Length = 151

 Score = 31.5 bits (71), Expect = 2.3
 Identities = 16/98 (16%), Positives = 27/98 (27%), Gaps = 4/98 (4%)

Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPP----TTTSTTT 1666
            E++P  E ++   +     V           +     +      +P        +   + 
Sbjct: 53   EHEPNQEVTEVEVIIEKEPVPAVAVAPVPVAVVAPVVAPKPKKSQPVMSQEKTASPQKSV 112

Query: 1667 TRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIP 1704
              P P    K K   V+P P   V       K  PV  
Sbjct: 113  PAPSPKEKKKKKVAKVEPAPAKAVAVPVLASKSAPVPA 150


>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47. 
          Length = 627

 Score = 32.9 bits (74), Expect = 2.4
 Identities = 28/130 (21%), Positives = 45/130 (34%), Gaps = 26/130 (20%)

Query: 1604 NYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTS 1663
            N++S       ++  S   ++++  + +T  P     P+T   T   T T +  P T T 
Sbjct: 232  NFDSILTTTPSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHTDTIKVTPNTPTI 291

Query: 1664 TTTTR------------PKPI--------------TTIKPKPTTVKPKPTTTVKPKPTTV 1697
            T  T             P+P+               TIK +  T  P       PKPT  
Sbjct: 292  TAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPKSPPKPTNF 351

Query: 1698 KPKPVIPPST 1707
            +   +  P T
Sbjct: 352  ENTTIRIPET 361


>gnl|CDD|115650 pfam07010, Endomucin, Endomucin.  This family consists of several
            mammalian endomucin proteins. Endomucin is an early
            endothelial-specific antigen that is also expressed on
            putative hematopoietic progenitor cells.
          Length = 259

 Score = 32.4 bits (73), Expect = 2.6
 Identities = 23/119 (19%), Positives = 35/119 (29%), Gaps = 5/119 (4%)

Query: 1585 GVHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITT 1644
                L +      G  P G   S   +    S  +  TT       TT    + E   + 
Sbjct: 45   NTVSLKNVNKPTTGTPPKGTTTSELLKTSLMSTATSLTTPKHELKTTTTGVRKNESSTSK 104

Query: 1645 TTTSTTTT-----TKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVK 1698
             T +  T      T +     T + ++ R   I+          PK T T     TT +
Sbjct: 105  VTVTNVTLSNAVSTLQSSQNKTENQSSIRTTEISPTSVLQPDASPKKTGTTSASLTTAE 163



 Score = 32.0 bits (72), Expect = 3.9
 Identities = 28/118 (23%), Positives = 41/118 (34%), Gaps = 8/118 (6%)

Query: 1604 NYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTT------TSTTTTTKRPK 1657
            N    +E  K  +  S  TT +T   +TT      +     TT      T+T+   K   
Sbjct: 15   NSLCNSEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTSL 74

Query: 1658 PPTTTSTTTTRPKPITTIKP--KPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKI 1713
              T TS TT + +  TT     K  +   K T T       V         T+++  I
Sbjct: 75   MSTATSLTTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVSTLQSSQNKTENQSSI 132


>gnl|CDD|99803 cd06207, CyPoR_like, NADPH cytochrome p450 reductase (CYPOR) serves
           as an electron donor in several oxygenase systems and is
           a component of nitric oxide synthases and methionine
           synthase reductases. CYPOR transfers two electrons from
           NADPH to the heme of cytochrome p450 via FAD and FMN.
           Ferredoxin-NADP+ (oxido)reductase is an FAD-containing
           enzyme that catalyzes the reversible electron transfer
           between NADP(H) and electron carrier proteins such as
           ferredoxin and flavodoxin. Isoforms of these
           flavoproteins (i.e. having a non-covalently bound FAD as
           a prosthetic group) are present in chloroplasts,
           mitochondria, and bacteria in which they participate in
           a wide variety of redox metabolic pathways. The
           C-terminal domain contains most of the NADP(H) binding
           residues and the N-terminal domain interacts
           non-covalently with the isoalloxazine rings of the
           flavin molecule which lies largely in a large gap
           betweed the two domains. Ferredoxin-NADP+ reductase
           first accepts one electron from reduced ferredoxin to
           form a flavin semiquinone intermediate. The enzyme then
           accepts a second electron to form FADH2 which then
           transfers two electrons and a proton to NADP+ to form
           NADPH.
          Length = 382

 Score = 32.6 bits (75), Expect = 3.0
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 446 ISSSPLVTHGKVKILIAIGGWTDSSGEK 473
           ISSSPL    +V +L+++  W   SG  
Sbjct: 169 ISSSPLKNPNEVHLLVSLVSWKTPSGRS 196


>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432).  This
            presumed domain is functionally uncharacterized. This
            domain is found in eukaryotes. This domain is about 100
            amino acids in length. This domain is found associated
            with pfam00096. This domain has two conserved sequence
            motifs: YPSPV and PSP.
          Length = 100

 Score = 30.5 bits (68), Expect = 3.1
 Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 1/70 (1%)

Query: 1638 TEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP-TTVKPKPTTTVKPKPTT 1696
               P++T + + +  +  P     +   T+ P P+ T    P ++  P P  T  P P+ 
Sbjct: 3    KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVHTSFPSPSI 62

Query: 1697 VKPKPVIPPS 1706
                P + P+
Sbjct: 63   ATTYPSVSPT 72


>gnl|CDD|212088 cd11519, SLC5sbd_SMCT1, Na(+)/monocarboxylate cotransporter SMCT1 and
            related proteins; solute-binding domain.  SMCT1 is a
            high-affinity transporter of various monocarboxylates
            including lactate and pyruvate, short-chain fatty acids,
            ketone bodies, nicotinate and its structural analogs,
            pyroglutamate, benzoate and its derivatives, and iodide.
            Human SMCT1 (hSMCT1, also called AIT) is encoded by the
            tumor suppressor gene SLC5A8. Its expression is under the
            control of the C/EBP transcription factor. Its
            tumor-suppressive role is related to uptake of butyrate,
            propionate, and pyruvate, these latter are inhibitors of
            histone deacetylases. SMCT1 is expressed in the colon,
            small intestine, kidney, thyroid gland, retina, and
            brain. SMCT1 may contribute to the intestinal/colonic and
            oral absorption of monocarboxylate drugs. SMCT1 also
            mediates iodide transport from thyrocyte into the colloid
            lumen in thyroid gland and through transporting l-lactate
            and ketone bodies helps maintain the energy status and
            the function of neurons. In the kidney its expression is
            limited to the S3 segment of the proximal convoluted
            tubule (in contrast to the low-affinity monocarboxylate
            transporter SMCT2, belonging to a different family, which
            is expressed along the entire length of the tubule). In
            the retina, SMCT1 and SMCT2 may play a differential role
            in monocarboxylate transport in a cell type-specific
            manner, SMCT1 is expressed predominantly in retinal
            neurons and in retinal pigmented epithelial (RPE) cells.
            This subgroup belongs to the solute carrier 5 (SLC5)
            transporter family.
          Length = 541

 Score = 32.4 bits (74), Expect = 3.6
 Identities = 19/60 (31%), Positives = 23/60 (38%), Gaps = 14/60 (23%)

Query: 1612 YKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
            Y P  E + P  +ST            E    T T S TTTT   + P T   T   P+P
Sbjct: 457  YPPLPERTLPLPLST------------EGCNFTNTESNTTTT--TEMPFTIFPTQPSPRP 502


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 32.3 bits (73), Expect = 4.1
 Identities = 14/90 (15%), Positives = 28/90 (31%), Gaps = 2/90 (2%)

Query: 1607 STTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTT 1666
            + +      ++ +      T T     +  Q  + I  +  + +  T    PP       
Sbjct: 355  APSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMT--AYPPVPQFCGD 412

Query: 1667 TRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
                     +   T+  P+P   V P+PT 
Sbjct: 413  PGLVSPYNPQSPGTSYGPEPVGPVPPQPTN 442


>gnl|CDD|145556 pfam02478, Pneumo_phosprot, Pneumovirus phosphoprotein.  This family
            represents the phosphoprotein of Paramyxoviridae, a
            putative RNA polymerase alpha subunit that may function
            in template binding.
          Length = 266

 Score = 31.8 bits (72), Expect = 4.2
 Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 4/77 (5%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRP---- 1669
              +  S+   + T +  TT +    ++ I +        +   KP T TS   T      
Sbjct: 38   KVNTISEKFELPTISKPTTPKGPCIKDEIISVNKVKDIESIYEKPVTPTSDGKTPTEKSD 97

Query: 1670 KPITTIKPKPTTVKPKP 1686
               T  K K +  +P+P
Sbjct: 98   DTPTKKKKKVSFKEPEP 114



 Score = 30.6 bits (69), Expect = 9.2
 Identities = 14/62 (22%), Positives = 20/62 (32%), Gaps = 3/62 (4%)

Query: 1650 TTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
            T + K   P  +  TT   PK         +  K K   ++  KP T       P    D
Sbjct: 41   TISEKFELPTISKPTT---PKGPCIKDEIISVNKVKDIESIYEKPVTPTSDGKTPTEKSD 97

Query: 1710 EF 1711
            + 
Sbjct: 98   DT 99


>gnl|CDD|212074 cd11505, SLC5sbd_SMCT, Na(+)/monocarboxylate cotransporters SMCT1 and
            2 and related proteins; solute-binding domain.  SMCT1 is
            a high-affinity transporter of various monocarboxylates
            including lactate and pyruvate, short-chain fatty acids,
            ketone bodies, nicotinate and its structural analogs,
            pyroglutamate, benzoate and its derivatives, and iodide.
            Human SMCT1 (hSMCT1, also called AIT) is encoded by the
            tumor suppressor gene SLC5A8. SMCT1 is expressed in the
            colon, small intestine, kidney, thyroid gland, retina,
            and brain. SMCT1 may contribute to the intestinal/colonic
            and oral absorption of monocarboxylate drugs. It also
            mediates iodide transport from thyrocyte into the colloid
            lumen in thyroid gland and, through transporting
            L-lactate and ketone bodies, helps maintain the energy
            status and the function of neurons. SMCT2 is a
            low-affinity transporter for short-chain fatty acids,
            lactate, pyruvate, and nicotinate. hSMCT2 is encoded by
            the SLC5A12 gene. SMCT2 is expressed in the kidney, small
            intestine, skeletal muscle, and retina. In the kidney,
            SMCT2 may initiate lactate absorption in the early parts
            of the tubule, SMCT1 in the latter parts of the tubule.
            In the retina, SMCT1 and SMCT2 may play a differential
            role in monocarboxylate transport in a cell type-specific
            manner. This subgroup belongs to the solute carrier 5
            (SLC5) transporter family.
          Length = 536

 Score = 32.0 bits (73), Expect = 4.3
 Identities = 19/60 (31%), Positives = 23/60 (38%), Gaps = 14/60 (23%)

Query: 1612 YKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKP 1671
            Y P  E + P  +ST            E    T T S TTTT   + PTT   T    +P
Sbjct: 457  YPPPPERTLPLPLST------------EGCNFTNTESNTTTT--TEMPTTIFPTQPSSRP 502


>gnl|CDD|237052 PRK12313, PRK12313, glycogen branching enzyme; Provisional.
          Length = 633

 Score = 32.2 bits (74), Expect = 4.7
 Identities = 17/35 (48%), Positives = 21/35 (60%), Gaps = 4/35 (11%)

Query: 569 HVEFMSLMSYDYHGAWEG--ITGLVSPLNSRPGEP 601
           HVEFM LM +   G+W G  +TG  +P  SR G P
Sbjct: 187 HVEFMPLMEHPLDGSW-GYQLTGYFAP-TSRYGTP 219


>gnl|CDD|201431 pfam00763, THF_DHG_CYH, Tetrahydrofolate
            dehydrogenase/cyclohydrolase, catalytic domain. 
          Length = 117

 Score = 30.2 bits (69), Expect = 5.2
 Identities = 14/45 (31%), Positives = 24/45 (53%), Gaps = 8/45 (17%)

Query: 1772 EKVTALKKKGVKVTLAI---GGWNDSAGNKYSRLVNSQQARSKFI 1813
            E+V  LK+KG+   LA+   G  +D A   Y   V S++  ++ +
Sbjct: 18   EEVAKLKEKGITPKLAVILVG--DDPASQVY---VRSKRKAAEEL 57


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is a
            family of proteins conserved in fungi. The function is
            not known.
          Length = 436

 Score = 31.7 bits (72), Expect = 6.2
 Identities = 18/74 (24%), Positives = 27/74 (36%), Gaps = 3/74 (4%)

Query: 1242 KMKTCDWPYRTKCKQTSATTTSEQVPKPTKKPTKPTKKPTTTTEYNPPEATTKPSTTTST 1301
            K K    P   +  + S+  T        +KP KP K   +T   +P     K  +T S 
Sbjct: 52   KPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKS---KAPSTESE 108

Query: 1302 TTDSGAWTPNPTEW 1315
              +    TP+P   
Sbjct: 109  EEEEPEETPDPIAS 122


>gnl|CDD|183673 PRK12676, PRK12676, bifunctional inositol-1
           monophosphatase/fructose-1,6-bisphosphatase; Reviewed.
          Length = 263

 Score = 31.0 bits (71), Expect = 6.5
 Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 5/47 (10%)

Query: 252 IIGIPFFGKSYRLFNRSE--YGLGATVKGPGTEGKYTQMPGYLAFFE 296
           I GIPF+  S  +F   +  YG    V    T   Y  +PG  A+  
Sbjct: 96  INGIPFYAISIAVFKGGKPVYGY---VYNLATGDFYEAIPGKGAYLN 139


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
            component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 31.4 bits (71), Expect = 6.5
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 6/53 (11%)

Query: 1657 KPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTKD 1709
            KP ++ +    + KP     P       KP ++ +PK      KP  PPS+ D
Sbjct: 201  KPSSSAAPAAPKAKPSPP--PPKEEEVEKPASSPEPK----ASKPSAPPSSGD 247


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This protein,
            which interacts with both microtubules and TRAF3 (tumour
            necrosis factor receptor-associated factor 3), is
            conserved from worms to humans. The N-terminal region is
            the microtubule binding domain and is well-conserved; the
            C-terminal 100 residues, also well-conserved, constitute
            the coiled-coil region which binds to TRAF3. The central
            region of the protein is rich in lysine and glutamic acid
            and carries KKE motifs which may also be necessary for
            tubulin-binding, but this region is the least
            well-conserved.
          Length = 506

 Score = 31.4 bits (71), Expect = 6.6
 Identities = 14/78 (17%), Positives = 22/78 (28%), Gaps = 9/78 (11%)

Query: 1633 EEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKP 1692
            +E  + E              K  +PP        +       +      + K    V+ 
Sbjct: 121  KEKPKEEPK----DRKPKEEAKEKRPPKEKEKEKEKKV-----EEPRDREEEKKRERVRA 171

Query: 1693 KPTTVKPKPVIPPSTKDE 1710
            K    KP    PP+ K E
Sbjct: 172  KSRPKKPPKKKPPNKKKE 189


>gnl|CDD|212004 TIGR04281, peripla_PGF_1, putative ABC transporter PGF-CTERM-modified
            substrate-binding protein.  Members of this archaeal
            protein family resemble periplasmic substrate-binding
            proteins of ABC transporters and appear in gene
            neighborhoods with permease and ATP-binding cassette
            proteins. Notably, essentially all members also have the
            PGF-CTERM putative protein-sorting domain at the
            C-terminus, while more distant homologs (excluded by the
            trusted cutoff) instead have what appear to be
            lipoprotein signal peptides at the N-terminus.
          Length = 330

 Score = 31.2 bits (71), Expect = 6.6
 Identities = 21/76 (27%), Positives = 28/76 (36%), Gaps = 9/76 (11%)

Query: 1601 PGGNYESTT--EEYKPTSEESK------PTTV-STSTVVTTEEPEQTEEPITTTTTSTTT 1651
            P   Y STT  EE    +  +       P  V +  T+     PE  EE  T  T + TT
Sbjct: 223  PTPAYNSTTAVEEGNVVAVNANYLSQPAPRVVEAVETLAEAFHPEAYEEAETADTEADTT 282

Query: 1652 TTKRPKPPTTTSTTTT 1667
                       +TTT+
Sbjct: 283  YDDSTTDAEDGTTTTS 298


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
            family is conserved in fungi but the function is not
            known.
          Length = 509

 Score = 31.4 bits (71), Expect = 6.6
 Identities = 21/110 (19%), Positives = 37/110 (33%), Gaps = 17/110 (15%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTT-----------TTKRPKPPTTT 1662
            P S E  P    +S    T  P+  ++ + +T +++TT           T       T+T
Sbjct: 91   PPSSEPTP-APPSSESTATRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSDTST 149

Query: 1663 STTTTRPKPITT----IKPKPTTVKPKPTTTVKPKPTTVKPKPVIPPSTK 1708
            S++   P              ++ +        P PT    +P   P  K
Sbjct: 150  SSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKS-AEPTAAPQAK 198


>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
          Length = 463

 Score = 31.3 bits (70), Expect = 6.6
 Identities = 22/84 (26%), Positives = 33/84 (39%), Gaps = 10/84 (11%)

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKP------TTVKPK 1685
            T EP      I+ +  + +  T   K P TT     +P P    K KP         KPK
Sbjct: 155  TVEPGTKVAIISKSEDAASQVTPSQKIPETTDP---KPSPPAEDKQKPKVESAPVAEKPK 211

Query: 1686 PTTTVKPKPTTVKPKPVIPPSTKD 1709
              ++  P P     +P +PP  ++
Sbjct: 212  APSS-PPPPKQSAKEPQLPPKERE 234


>gnl|CDD|178321 PLN02719, PLN02719, triacylglycerol lipase.
          Length = 518

 Score = 31.2 bits (70), Expect = 7.1
 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 6/58 (10%)

Query: 2007 VRDRKRRIGPYAFKG-DQWVGFDDQAMIHHKAEFVKYNDLGGAMIWALDLDDFKNFCG 2063
            +RD  R+I     +G D W G  D      ++E ++Y ++  A   A D D F  +CG
Sbjct: 88   LRDTWRKI-----QGEDDWAGLMDPMDPVLRSELIRYGEMAQACYDAFDFDPFSRYCG 140


>gnl|CDD|146962 pfam04574, DUF592, Protein of unknown function (DUF592).  This region
            is found in some SIR2 family proteins (pfam02146).
          Length = 153

 Score = 30.1 bits (68), Expect = 7.7
 Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 18/139 (12%)

Query: 1680 TTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFKIVCYFTNWAWYRQSGGKYLPSDIDSDL 1739
            + V   P   +         K V PP TKD+      F  +   R+   KYLP++++S  
Sbjct: 1    SLVPQIPKKPIIIGKNPKTGKFVFPPITKDDSLNARMFLKYYGLRKFLDKYLPTELNSLY 60

Query: 1740 CTHVI--YGFAVLDTDQLVIKPHDTWADLDNKFYEKVTALKKKGVKVTLAIGGWNDSAGN 1797
              ++I   GF + D             +L N  Y+ V           L+   + D    
Sbjct: 61   IYYLIKLLGFELKDQ------------ELLNAIYKNVD--VDANSAEKLSYTDFEDPLEK 106

Query: 1798 KY-SRLV-NSQQARSKFIA 1814
            K+  RL+ + Q+A +K ++
Sbjct: 107  KHIVRLIKDLQKAMNKVLS 125


>gnl|CDD|218481 pfam05177, RCSD, RCSD region.  Proteins contain this region include
            C.elegans UNC-89. This region is found repeated in UNC-89
            and shows conservation in prolines, lysines and glutamic
            acids. Proteins with RCSD are involved in muscle M-line
            assembly, but the function of this region RCSD is not
            clear.
          Length = 101

 Score = 29.1 bits (64), Expect = 7.9
 Identities = 20/85 (23%), Positives = 28/85 (32%)

Query: 1614 PTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPIT 1673
            P  +E  P   S S    TEE   TEE  +      +      KP + T    +  K  T
Sbjct: 5    PGKKEKSPLRRSPSRTEKTEEDRATEEAKSPEKKEKSPEKVDGKPKSPTKKEKSPEKSAT 64

Query: 1674 TIKPKPTTVKPKPTTTVKPKPTTVK 1698
                 PT  +  P    +   +  K
Sbjct: 65   EEVKSPTKKEKSPEKVEEKPASPTK 89


>gnl|CDD|215584 PLN03113, PLN03113, DNA ligase 1; Provisional.
          Length = 744

 Score = 31.5 bits (71), Expect = 8.1
 Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 7/77 (9%)

Query: 1644 TTTTSTTTTTKRPKPPT-------TTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTT 1696
             T + +++  KR    T       T  +  T PK   TI+PK  + KP+ +T+   + + 
Sbjct: 25   QTQSQSSSPKKRKIGETQDANLGKTNVSEGTLPKTEDTIEPKSDSAKPRSSTSSIAEDSK 84

Query: 1697 VKPKPVIPPSTKDEFKI 1713
               K     S   + ++
Sbjct: 85   TGTKKAQTLSKPKKDEM 101


>gnl|CDD|165461 PHA03191, PHA03191, UL14 tegument protein; Provisional.
          Length = 238

 Score = 30.8 bits (69), Expect = 8.6
 Identities = 20/81 (24%), Positives = 26/81 (32%), Gaps = 7/81 (8%)

Query: 1632 TEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIKPKPTTVK---PKPTT 1688
             EE ++  E + T          R     T    T      T     PT V+    + T 
Sbjct: 137  IEELDEEAEALLTKWILEQKPRPRLPTAKTAPPHTRDGTRTTEKARVPTGVQDTNSQATD 196

Query: 1689 TVKPKPTTVKP----KPVIPP 1705
               P PTT  P    + VI P
Sbjct: 197  QNLPPPTTSTPSISREHVIHP 217


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 30.1 bits (68), Expect = 9.0
 Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 5/59 (8%)

Query: 1641 PITTTTTSTTTTTKRPKPPTTTST---TTTRPKPITTIKPKPTTVKPKPTTT--VKPKP 1694
              T    + T TT     P  T+T       P+P+ T   +  TV P  + T    P+P
Sbjct: 70   FRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAREVKTVVPVSSVTPVTPPRP 128


>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106).  This
            family of proteins are found in large numbers in the
            Trichomonas vaginalis proteome. The function of this
            protein is unknown.
          Length = 422

 Score = 30.8 bits (69), Expect = 9.3
 Identities = 25/130 (19%), Positives = 44/130 (33%), Gaps = 6/130 (4%)

Query: 1586 VHPLLSTITEVLGHGPGGNYESTTEEYKPTSEESKPTTVSTSTVVTTEEPEQTEEP-ITT 1644
            V+  L T ++V  +   G Y      Y P         +++ T +    P +   P +  
Sbjct: 131  VNAFLRTKSQVGHYAADGTYVPAGGTYIPAG---GTYILASGTYIPPNPPREAPAPGLPK 187

Query: 1645 TTTSTTTTTKR--PKPPTTTSTTTTRPKPITTIKPKPTTVKPKPTTTVKPKPTTVKPKPV 1702
            T TS+     R  PKP    +      +P      +    +P+       +  T +    
Sbjct: 188  TFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNPAQ 247

Query: 1703 IPPSTKDEFK 1712
             PP T+   K
Sbjct: 248  QPPQTEQGHK 257


>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
          Length = 417

 Score = 31.0 bits (70), Expect = 9.6
 Identities = 21/105 (20%), Positives = 40/105 (38%), Gaps = 10/105 (9%)

Query: 1611 EYKPTSEESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPK 1670
            +Y+ T +  + +    +  +T    ++T+  +   +  T    K+   P +       PK
Sbjct: 64   KYRKTCKRCRVSDEDINRFLTKTFEDKTQVKVKVVSAPTK---KKKAMPKSVV---RAPK 117

Query: 1671 PITTIKPK---PTTVKPKPTTTVKPKPTTVKPKPVIPPSTKDEFK 1712
            P+    P     +  KP P+  V        P P + PS KD  +
Sbjct: 118  PLENPVPAQAESSGSKPVPSIPVSTPEVKA-PAPALTPSQKDRLE 161


>gnl|CDD|227600 COG5275, COG5275, BRCT domain type II [General function prediction
            only].
          Length = 276

 Score = 30.5 bits (68), Expect = 9.8
 Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 1/84 (1%)

Query: 1617 EESKPTTVSTSTVVTTEEPEQTEEPITTTTTSTTTTTKRPKPPTTTSTTTTRPKPITTIK 1676
             +   TT        +         I+   T+T+     P      +T+ ++P    T  
Sbjct: 10   SDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRA 69

Query: 1677 -PKPTTVKPKPTTTVKPKPTTVKP 1699
              KP   K + +TT K K  T   
Sbjct: 70   TRKPAQPKAEKSTTSKSKSHTTTA 93


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.134    0.421 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 121,802,524
Number of extensions: 11888196
Number of successful extensions: 13606
Number of sequences better than 10.0: 1
Number of HSP's gapped: 12065
Number of HSP's successfully gapped: 462
Length of query: 2417
Length of database: 10,937,602
Length adjustment: 113
Effective length of query: 2304
Effective length of database: 5,925,600
Effective search space: 13652582400
Effective search space used: 13652582400
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.6 bits)