BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015393
(408 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255547716|ref|XP_002514915.1| conserved hypothetical protein [Ricinus communis]
gi|223545966|gb|EEF47469.1| conserved hypothetical protein [Ricinus communis]
Length = 394
Score = 585 bits (1508), Expect = e-164, Method: Compositional matrix adjust.
Identities = 305/418 (72%), Positives = 337/418 (80%), Gaps = 34/418 (8%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGF--------SSGEPKTTYRKKARFSPD-V 51
M+T+RGLCYPR + L +R+V KRR ++GE + RK+ R SP+ +
Sbjct: 1 MKTRRGLCYPRTDALDLDKRLV-----KRRKLGEIIAAATATGE-QMVCRKRQRLSPEKI 54
Query: 52 AG-KSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKT 110
AG K+D F ALPDDLV ILCKLSS+A CPSDFVNVL TCKR+NGL LNS+VLSKAS KT
Sbjct: 55 AGDKTDFFYALPDDLVTCILCKLSSSASCPSDFVNVLATCKRLNGLGLNSIVLSKASPKT 114
Query: 111 FAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQA 170
FA+KA NWS+SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHA A
Sbjct: 115 FALKAKNWSDSAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAPA 174
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG
Sbjct: 175 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 234
Query: 231 RRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGC 290
RRFLVQANARELAAVLS+ ++TR+W+ WNP + + NH H +G GC
Sbjct: 235 RRFLVQANARELAAVLSNPN-SDLNTRAWVNWNP-----------HAHSNHR-HASGPGC 281
Query: 291 PLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSV 350
PLLSDFGCNVPAPEAHPASRF+A+WFA RGG+PG GLRLCSHVGCGRPETR+HEFRRCSV
Sbjct: 282 PLLSDFGCNVPAPEAHPASRFMADWFATRGGSPGQGLRLCSHVGCGRPETRKHEFRRCSV 341
Query: 351 CGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGEDNVVADS 408
CGAVNYCSRACQALDWKLRHK +CAP ERW+ E+G G G G NVV DS
Sbjct: 342 CGAVNYCSRACQALDWKLRHKEECAPVERWVVEDGEGG-----EAGVGDGNQNVVVDS 394
>gi|225425418|ref|XP_002271442.1| PREDICTED: F-box protein At1g67340-like [Vitis vinifera]
Length = 385
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 300/386 (77%), Positives = 325/386 (84%), Gaps = 21/386 (5%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+RG+CY R VD+C RVV KRRG SG+ RK+ R +P+ AGKSD F+A
Sbjct: 1 MRTRRGVCY-REVDMCTDNRVV-----KRRGDFSGDNVVCCRKRQRLAPESAGKSDFFEA 54
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
LPDDLV+ IL KLSS+A CPSDF+NVL+TCKR NGL L+SLVLS+AS K+ A++A NWSE
Sbjct: 55 LPDDLVICILSKLSSSAGCPSDFINVLLTCKRFNGLGLHSLVLSRASSKSLAVRAKNWSE 114
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
SA RFLKLCADAGNVEACYTLGMIRFYCLQNR SGASLMAKAAISSHA ALYSLAVIQFN
Sbjct: 115 SALRFLKLCADAGNVEACYTLGMIRFYCLQNRVSGASLMAKAAISSHAPALYSLAVIQFN 174
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GS GSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN+AEGRRFLVQANAR
Sbjct: 175 GSVGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNVAEGRRFLVQANAR 234
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNV 300
ELAAVLSSA+ +S+RSWLTWNP H+ HVTG GCPLLSDFGCNV
Sbjct: 235 ELAAVLSSAS-SAVSSRSWLTWNP--------------HQHHRHVTGSGCPLLSDFGCNV 279
Query: 301 PAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRA 360
PAPEAHPASRFLAEWFAARGG PGPGLRLCSH GCGRPE RRHEFRRCSVCG VNYCSRA
Sbjct: 280 PAPEAHPASRFLAEWFAARGGVPGPGLRLCSHAGCGRPEMRRHEFRRCSVCGIVNYCSRA 339
Query: 361 CQALDWKLRHKADCAPAERWLDEEGN 386
CQALDWKLRHKA CAP ERWL ++G+
Sbjct: 340 CQALDWKLRHKAACAPVERWLVDDGD 365
>gi|224053499|ref|XP_002297844.1| predicted protein [Populus trichocarpa]
gi|222845102|gb|EEE82649.1| predicted protein [Populus trichocarpa]
Length = 403
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 285/387 (73%), Positives = 322/387 (83%), Gaps = 17/387 (4%)
Query: 1 MRTKRGLCYPR---VVDVC---LGQRVVPVN-DKKRRGFSSGEPKTTYRKKARFSPDVAG 53
MRT+RG+ YPR V+ C +R ++ + F++GE ++ S G
Sbjct: 1 MRTRRGISYPRRGVAVNACNTAAEKRTSSTTYNRGKPDFTAGEYMVCRKRNRLVSTGKKG 60
Query: 54 KSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAI 113
++DLF++LPDDLV+SILCKLSS+A CPSDF+NVLITC+R+NGL L+SLVLSKAS K+FAI
Sbjct: 61 ETDLFESLPDDLVISILCKLSSSASCPSDFINVLITCRRLNGLGLHSLVLSKASPKSFAI 120
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
KANNWS+SAHRFLKLCADAGN EACYTLGMIRFYCLQNRGSGASLMAKAAISSHA ALYS
Sbjct: 121 KANNWSDSAHRFLKLCADAGNAEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAPALYS 180
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LAVIQFNGSGGSK+DKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN+ EGRRF
Sbjct: 181 LAVIQFNGSGGSKSDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNVTEGRRF 240
Query: 234 LVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLL 293
LVQANARELAAVLS+ G+ TR+WLTWNP + HP +++P+ NG GCPLL
Sbjct: 241 LVQANARELAAVLSNPD-SGLPTRAWLTWNPHA------HPNHRHPSENGPS---GCPLL 290
Query: 294 SDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGA 353
SDFGCNVPAPEAHPASRF+ EWFA RGG+ G GLRLCSH GCGRPETR+HEFRRCSVCGA
Sbjct: 291 SDFGCNVPAPEAHPASRFMTEWFAIRGGSAGSGLRLCSHTGCGRPETRKHEFRRCSVCGA 350
Query: 354 VNYCSRACQALDWKLRHKADCAPAERW 380
VNYCSRACQALDWKLRHK +CAP ERW
Sbjct: 351 VNYCSRACQALDWKLRHKEECAPVERW 377
>gi|224075553|ref|XP_002304680.1| predicted protein [Populus trichocarpa]
gi|222842112|gb|EEE79659.1| predicted protein [Populus trichocarpa]
Length = 403
Score = 566 bits (1458), Expect = e-159, Method: Compositional matrix adjust.
Identities = 283/387 (73%), Positives = 316/387 (81%), Gaps = 16/387 (4%)
Query: 1 MRTKRGLCYPR-----VVDVCLGQRVVPVNDKKRR-GFSSGEPKTTYRKKARFSPDVAGK 54
MRT+RGL YPR D G+R + K+ R F++G+ +K S G+
Sbjct: 1 MRTRRGLSYPRGAAVNACDTAAGKRTATTSYKRERPDFAAGDYLVCRKKNRLISTQKTGE 60
Query: 55 SDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIK 114
+DLFD+LPDDLV+SILCKLSS+A CPSDF+NVL+TCKR+NGL L+SLVLSKAS KTFA+K
Sbjct: 61 TDLFDSLPDDLVISILCKLSSSASCPSDFINVLLTCKRLNGLGLHSLVLSKASPKTFAVK 120
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
A NWS+SAHRFLKL ADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISS+A ALYSL
Sbjct: 121 AKNWSDSAHRFLKLFADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSYAPALYSL 180
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AVIQFNGSGGSK+DKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN+ EGRRFL
Sbjct: 181 AVIQFNGSGGSKSDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNVTEGRRFL 240
Query: 235 VQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLS 294
VQANARELAAVLS G TR+W +WNP + HP +++P NG GCPLLS
Sbjct: 241 VQANARELAAVLSKHN-SGFPTRTWFSWNPHA------HPNHRHPTGNGPS---GCPLLS 290
Query: 295 DFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAV 354
DFGCNVPAPE+HPASRF+ EWFA RGG+ G GLRLCSH GCGR ETR+HEFRRCSVCGAV
Sbjct: 291 DFGCNVPAPESHPASRFMTEWFAIRGGSSGSGLRLCSHTGCGRSETRKHEFRRCSVCGAV 350
Query: 355 NYCSRACQALDWKLRHKADCAPAERWL 381
NYCSRACQALDWKLRHK CAP ERW+
Sbjct: 351 NYCSRACQALDWKLRHKEGCAPVERWV 377
>gi|356512373|ref|XP_003524894.1| PREDICTED: F-box protein At1g67340-like [Glycine max]
Length = 394
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 281/418 (67%), Positives = 323/418 (77%), Gaps = 34/418 (8%)
Query: 1 MRTKRGLCYPRVVD-VCLGQRVVPVNDK-------KRRGFSSGEPKTTYRKKARFSPDVA 52
MRT+RG CY VV +C RV +K +G+ RK+ + +P+
Sbjct: 1 MRTRRGACYSGVVSRMCSDARVSKKKNKDLHMHMHMHMHVPAGDSIIYSRKRQKKTPEKT 60
Query: 53 GKSD--LFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKT 110
+D F++LPDDLV+SI CKLSSTA PSDFVN+LITCKR+N LAL+SLVLSKAS KT
Sbjct: 61 AGADYEFFESLPDDLVISIFCKLSSTATKPSDFVNILITCKRLNRLALHSLVLSKASPKT 120
Query: 111 FAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQA 170
F IKA +W +SAH+FLK CADAGNVEACYTLGMIRFYCLQNRGSGAS MAKAAI+SHA+A
Sbjct: 121 FTIKARDWCDSAHKFLKHCADAGNVEACYTLGMIRFYCLQNRGSGASFMAKAAINSHARA 180
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
LY+LAVIQFNGSGG+K+DKDLRAGVALCARAAFLGH+DA+RELGHCLQDGYGVRQNIAEG
Sbjct: 181 LYALAVIQFNGSGGTKSDKDLRAGVALCARAAFLGHVDAMRELGHCLQDGYGVRQNIAEG 240
Query: 231 RRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGC 290
RRFLVQANARELAAVLS+ G + R+WL+WN QP G GC
Sbjct: 241 RRFLVQANARELAAVLST----GAAARTWLSWNL--QPQQLRQ-------------GSGC 281
Query: 291 PLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSV 350
PLLSDFGCNVPAPE HPAS F+AEWFAARGG+PGPGLRLCSH GCGRPETR+HEFRRCSV
Sbjct: 282 PLLSDFGCNVPAPEVHPASLFMAEWFAARGGSPGPGLRLCSHAGCGRPETRKHEFRRCSV 341
Query: 351 CGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGEDNVVADS 408
CG VNYCSRACQALDWK RHKA+C+P +RWL+E+G D+ + +G GE V+ DS
Sbjct: 342 CGVVNYCSRACQALDWKFRHKAECSPVQRWLEEDGE-----DVGNDDGDGEVEVMVDS 394
>gi|356525160|ref|XP_003531195.1| PREDICTED: F-box protein At1g67340-like [Glycine max]
Length = 391
Score = 547 bits (1410), Expect = e-153, Method: Compositional matrix adjust.
Identities = 282/411 (68%), Positives = 319/411 (77%), Gaps = 30/411 (7%)
Query: 1 MRTKRGLCYPRVVD-VCLGQRVVPVNDKK---RRGFSSGEPKTTYRKKARFSPD--VAGK 54
MRT+RG CYP VV +C RV K +G+ RKK + +P+ AG+
Sbjct: 1 MRTRRGACYPGVVSRMCSNVRVTKKKSKDLHMHMHVPAGDSIIYSRKKQKKTPEKLAAGR 60
Query: 55 SDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIK 114
D F++LPDDLV+SI CKLSSTA PSDFVNVLIT KR+N LAL+SLVLSKAS +TF IK
Sbjct: 61 YDFFESLPDDLVISIFCKLSSTATKPSDFVNVLITSKRLNRLALHSLVLSKASPRTFTIK 120
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
A W +SAH+FLK CADAGN+EACYTLGMIRFYCLQNRGSGASLMAKAAI+SHA+ALY+L
Sbjct: 121 AKYWCDSAHKFLKHCADAGNIEACYTLGMIRFYCLQNRGSGASLMAKAAINSHARALYAL 180
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AVIQFNGSGG+K+DKDLRAGVALCARAAFLGH+DALRELGHCLQDGYGVRQNIAEGRRFL
Sbjct: 181 AVIQFNGSGGTKSDKDLRAGVALCARAAFLGHVDALRELGHCLQDGYGVRQNIAEGRRFL 240
Query: 235 VQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLS 294
VQANARELAAVLS+ G + R+ L+WN QP G GCPLLS
Sbjct: 241 VQANARELAAVLST----GAAARTLLSWN--LQPQQLRQ-------------GSGCPLLS 281
Query: 295 DFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAV 354
DFGCNVPAP+ HPAS F+ EWFAA GG+PGPGLRLCS+ GCGRPETR+HEFRRCSVCG V
Sbjct: 282 DFGCNVPAPDVHPASLFMTEWFAAHGGSPGPGLRLCSNAGCGRPETRKHEFRRCSVCGEV 341
Query: 355 NYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGEDNVV 405
NYCSRACQALDWK RHKA+CAP ERWLDE+G D+ + +G GE VV
Sbjct: 342 NYCSRACQALDWKFRHKAECAPVERWLDEDGE-----DVGNDDGDGEVEVV 387
>gi|449435083|ref|XP_004135325.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
gi|449494931|ref|XP_004159686.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
Length = 382
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 282/392 (71%), Positives = 310/392 (79%), Gaps = 19/392 (4%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+ GL YP + N K++ ++ P R K + SDLFD+
Sbjct: 1 MRTRTGLSYPPLQPTAFASG----NRKRKHHVAADRPFCRKRNKLSYHIKTP-TSDLFDS 55
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
LPDDLV++IL LSS A PSDF+N+L+TCKR+N L LN +VLS+AS+KTFAI+A NW+E
Sbjct: 56 LPDDLVITILSNLSSAASSPSDFINILLTCKRLNNLGLNPMVLSRASQKTFAIRAKNWTE 115
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
SAHRFLK C+DAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI SHA ALYSLAVIQFN
Sbjct: 116 SAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFN 175
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI EGRRFLVQANAR
Sbjct: 176 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANAR 235
Query: 241 ELAAVLSSAACQGISTRSW-LTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCN 299
ELAAVLSS A ++ LTW N P H+ HVTG GCPLLSDFGCN
Sbjct: 236 ELAAVLSSEAASSVAASRSCLTW-------------NTQPPHHRHVTGSGCPLLSDFGCN 282
Query: 300 VPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSR 359
+PAPEAHPAS+FLAEWF ARGG+PG GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSR
Sbjct: 283 IPAPEAHPASQFLAEWFEARGGSPGNGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSR 342
Query: 360 ACQALDWKLRHKADCAPAERWLDEEGNGDVDV 391
ACQALDWKLRHK DCAP ERWLD+ G+G DV
Sbjct: 343 ACQALDWKLRHKIDCAPVERWLDDNGDGMDDV 374
>gi|356540140|ref|XP_003538548.1| PREDICTED: F-box protein At1g67340-like [Glycine max]
Length = 350
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 264/364 (72%), Positives = 299/364 (82%), Gaps = 21/364 (5%)
Query: 42 RKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSL 101
RK+ + SPD SD FD+LPDDLV+SILCKLSSTA PSDF++VLITCKR+N L L+SL
Sbjct: 4 RKRQKTSPD--KTSDCFDSLPDDLVLSILCKLSSTATSPSDFISVLITCKRLNSLGLHSL 61
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAK 161
VLSKAS KTF++KA NW +S HRFLK CADAGN+EACYTLG+IRFYCLQNRGSGA+LMAK
Sbjct: 62 VLSKASHKTFSVKAKNWCDSLHRFLKHCADAGNIEACYTLGLIRFYCLQNRGSGAALMAK 121
Query: 162 AAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 221
AA++SHA ALYSLAVIQFNGSGG+KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY
Sbjct: 122 AAMNSHAPALYSLAVIQFNGSGGTKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 181
Query: 222 GVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNH 281
GV+QN+ EGRRFLVQAN RELAAVLS+ G + R WLTW+ P PHPR ++
Sbjct: 182 GVKQNVTEGRRFLVQANVRELAAVLSA----GNNKRPWLTWS----HQPVPHPRLRSE-- 231
Query: 282 NGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETR 341
TG GCPLLSD+GCNVPAPE HPAS+F++EWF RGG P G+RLCS GCGRPETR
Sbjct: 232 ----TGSGCPLLSDYGCNVPAPEVHPASQFMSEWFDIRGGFPSAGMRLCSQSGCGRPETR 287
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGE 401
RHEFRRCSVCG VNYCSRACQALDWK RHKA+CAP ERWLD++ D D+ GG+
Sbjct: 288 RHEFRRCSVCGVVNYCSRACQALDWKFRHKAECAPVERWLDDD-----DEDVAGENDGGD 342
Query: 402 DNVV 405
D V+
Sbjct: 343 DGVM 346
>gi|356566106|ref|XP_003551276.1| PREDICTED: F-box protein At1g67340-like [Glycine max]
Length = 348
Score = 529 bits (1363), Expect = e-148, Method: Compositional matrix adjust.
Identities = 259/346 (74%), Positives = 291/346 (84%), Gaps = 19/346 (5%)
Query: 42 RKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSL 101
RK+ + SPD D FD+LPDDLV+SILCKLSSTA PSDF++VLITCKR+N L L+SL
Sbjct: 4 RKRQKTSPDKT-SDDFFDSLPDDLVLSILCKLSSTATSPSDFISVLITCKRLNSLGLHSL 62
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAK 161
VLSKAS KTF++KA NW +S HRFLK CADAGN+EACYTLGMIRFYCLQNRGSGASLMAK
Sbjct: 63 VLSKASHKTFSVKAKNWCDSLHRFLKHCADAGNIEACYTLGMIRFYCLQNRGSGASLMAK 122
Query: 162 AAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 221
AA++SHA ALYSLAVIQFNGSGG+KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY
Sbjct: 123 AAMNSHAPALYSLAVIQFNGSGGTKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 182
Query: 222 GVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNH 281
GV+ N+ EGRRFLVQANARELAAVLS+ G + R WLTW+ HP HPR ++
Sbjct: 183 GVKLNVTEGRRFLVQANARELAAVLSA----GNTKRPWLTWS----HHPVTHPRLRS--- 231
Query: 282 NGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETR 341
GCPL+SD+GCNVPAP+AHPAS+F++EWF RGG PGP +RLCSH GCGRPETR
Sbjct: 232 -------GCPLMSDYGCNVPAPDAHPASQFMSEWFDIRGGFPGPDMRLCSHSGCGRPETR 284
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNG 387
RHEFRRCSVCG VNYCSRACQALDWK RHKA+CA ERWLD++G G
Sbjct: 285 RHEFRRCSVCGLVNYCSRACQALDWKFRHKAECATVERWLDDDGEG 330
>gi|297838467|ref|XP_002887115.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297332956|gb|EFH63374.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 266/362 (73%), Positives = 297/362 (82%), Gaps = 20/362 (5%)
Query: 48 SPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKAS 107
+P+ +DL D++PDDLVVSILCKL ST+RCP+DF+NVL+TCKR+ GLA+N LVLS+ S
Sbjct: 33 APEHTTGADLLDSIPDDLVVSILCKLGSTSRCPADFINVLMTCKRLKGLAMNPLVLSRLS 92
Query: 108 KKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSH 167
K A+KA+NWSE +HRFLK C DAG++EACYTLGMIRFYCLQNRG+GASLMAKAAISSH
Sbjct: 93 PKAIAVKAHNWSEYSHRFLKRCVDAGSLEACYTLGMIRFYCLQNRGNGASLMAKAAISSH 152
Query: 168 AQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
A ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGH+DALRELGHCLQDGYGV QN+
Sbjct: 153 APALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQDGYGVPQNV 212
Query: 228 AEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTG 287
+EGRRFLVQANARELAAVLSS GI RS TW LSQ P PNH G
Sbjct: 213 SEGRRFLVQANARELAAVLSS----GIQARS--TWLSLSQTPPV-------PNH-----G 254
Query: 288 IGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGT-PGPGLRLCSHVGCGRPETRRHEFR 346
CPLLSDFGCNVPAPE HPA+RFLA+WFA RGG PG GLRLCSH GCGRPETR+HEFR
Sbjct: 255 QSCPLLSDFGCNVPAPETHPANRFLADWFAVRGGDCPGDGLRLCSHAGCGRPETRKHEFR 314
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGEDNVVA 406
RCSVCG VNYCSRACQALDWKLRHK DCAP ERWL+E G+ +V + D G GE+ ++
Sbjct: 315 RCSVCGVVNYCSRACQALDWKLRHKMDCAPVERWLEEGDGGEGNVQI-DGNGNGENVLLQ 373
Query: 407 DS 408
S
Sbjct: 374 MS 375
>gi|297738456|emb|CBI27657.3| unnamed protein product [Vitis vinifera]
Length = 437
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 281/386 (72%), Positives = 304/386 (78%), Gaps = 46/386 (11%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+RG+CY R VD+C RVV KRRG SG+ RK+ R +P+ AGKSD F+A
Sbjct: 1 MRTRRGVCY-REVDMCTDNRVV-----KRRGDFSGDNVVCCRKRQRLAPESAGKSDFFEA 54
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
LPDDLV+ IL KLSS+A CPSDF+NVL+TCKR NGL L+SLVLS+AS K+ A++A NWSE
Sbjct: 55 LPDDLVICILSKLSSSAGCPSDFINVLLTCKRFNGLGLHSLVLSRASSKSLAVRAKNWSE 114
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
SA RFLKLCADAGNVEACYTLGMIRFYCLQNR SGASLMAKAAISSHA ALYSLAVIQFN
Sbjct: 115 SALRFLKLCADAGNVEACYTLGMIRFYCLQNRVSGASLMAKAAISSHAPALYSLAVIQFN 174
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GS GSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN+AEGRRFLVQANAR
Sbjct: 175 GSVGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNVAEGRRFLVQANAR 234
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNV 300
ELAAVLSSA+ +SDFGCNV
Sbjct: 235 ELAAVLSSAS----------------------------------------SAVSDFGCNV 254
Query: 301 PAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRA 360
PAPEAHPASRFLAEWFAARGG PGPGLRLCSH GCGRPE RRHEFRRCSVCG VNYCSRA
Sbjct: 255 PAPEAHPASRFLAEWFAARGGVPGPGLRLCSHAGCGRPEMRRHEFRRCSVCGIVNYCSRA 314
Query: 361 CQALDWKLRHKADCAPAERWLDEEGN 386
CQALDWKLRHKA CAP ERWL ++G+
Sbjct: 315 CQALDWKLRHKAACAPVERWLVDDGD 340
>gi|357463003|ref|XP_003601783.1| F-box protein [Medicago truncatula]
gi|355490831|gb|AES72034.1| F-box protein [Medicago truncatula]
Length = 381
Score = 525 bits (1351), Expect = e-146, Method: Compositional matrix adjust.
Identities = 266/398 (66%), Positives = 305/398 (76%), Gaps = 30/398 (7%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+ GLCYP+ + L N+ KR P+ T SD F+
Sbjct: 1 MRTRTGLCYPQTLQ--LSNMFSDTNNNKRH---KRMPQNT--------------SDFFEL 41
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
LPDD+V+SIL KLSSTA PSDF++VLITCKR+NGL+LNS+VLSKAS KT ++KA NW +
Sbjct: 42 LPDDIVLSILGKLSSTATSPSDFISVLITCKRLNGLSLNSVVLSKASNKTLSVKAKNWCD 101
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
SAHRFLK C DAGN+EACYTLGMIRFYCLQNRGSGASLMAKAA+ SHA +LYSLAVIQFN
Sbjct: 102 SAHRFLKRCVDAGNIEACYTLGMIRFYCLQNRGSGASLMAKAAMKSHAPSLYSLAVIQFN 161
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGG+KNDKDLRAGVALCARAAFLGHID LRELGHCLQDGYGV+QN+ EGRRFLVQANAR
Sbjct: 162 GSGGTKNDKDLRAGVALCARAAFLGHIDGLRELGHCLQDGYGVKQNVIEGRRFLVQANAR 221
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNV 300
ELAAVLS+ + + LTW+ P + P+ + G V+G GCPLLSDFGCNV
Sbjct: 222 ELAAVLSNG---NNNKQPLLTWS--VNPGSSQPPQLRVLAGAGSVSGSGCPLLSDFGCNV 276
Query: 301 PAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRA 360
P E HPA+RF+ EWF RGG PGPG+RLCS+ GCGRPETR+HEFRRCSVCG VNYCSRA
Sbjct: 277 PVQETHPANRFMTEWFDIRGGFPGPGMRLCSNSGCGRPETRKHEFRRCSVCGVVNYCSRA 336
Query: 361 CQALDWKLRHKADCAPAERWLDE------EGNGDVDVD 392
CQALDWK RHKA+CAP ERWL+E EG+G V+VD
Sbjct: 337 CQALDWKFRHKAECAPVERWLEEEEHIAGEGDGGVNVD 374
>gi|18408746|ref|NP_564894.1| zinc finger (MYND type) family protein / F-box family protein
[Arabidopsis thaliana]
gi|75173237|sp|Q9FYF9.1|FB76_ARATH RecName: Full=F-box protein At1g67340
gi|9828618|gb|AAG00241.1|AC002130_6 F1N21.16 [Arabidopsis thaliana]
gi|16209726|gb|AAL14418.1| At1g67340/F1N21_16 [Arabidopsis thaliana]
gi|20260406|gb|AAM13101.1| unknown protein [Arabidopsis thaliana]
gi|23197876|gb|AAN15465.1| unknown protein [Arabidopsis thaliana]
gi|332196512|gb|AEE34633.1| zinc finger (MYND type) family protein / F-box family protein
[Arabidopsis thaliana]
Length = 379
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 264/354 (74%), Positives = 293/354 (82%), Gaps = 22/354 (6%)
Query: 55 SDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIK 114
+DL D++PDDLV+SILCKL ST+RCP+DF+NVL+TCKR+ GLA+N +VLS+ S K A+K
Sbjct: 41 ADLLDSIPDDLVISILCKLGSTSRCPADFINVLLTCKRLKGLAMNPIVLSRLSPKAIAVK 100
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
A+NWSE +HRFLK C DAG++EACYTLGMIRFYCLQNRG+GASLMAKAAISSHA ALYSL
Sbjct: 101 AHNWSEYSHRFLKRCVDAGSLEACYTLGMIRFYCLQNRGNGASLMAKAAISSHAPALYSL 160
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AVIQFNGSGGSKNDKDLRAGVALCARAAFLGH+DALRELGHCLQDGYGV QN++EGRRFL
Sbjct: 161 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQDGYGVPQNVSEGRRFL 220
Query: 235 VQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLS 294
VQANARELAAVLSS GI RS TW LSQP P PNH CPLLS
Sbjct: 221 VQANARELAAVLSS----GIQARS--TWLSLSQPPPP-----VVPNHGQQT----CPLLS 265
Query: 295 DFGCNVPAPEAHPASRFLAEWFAARGGT-PGPGLRLCSHVGCGRPETRRHEFRRCSVCGA 353
DFGCNVPAPE HPA+RFLA+WFA RGG PG GLRLCSH GCGRPETR+HEFRRCSVCG
Sbjct: 266 DFGCNVPAPETHPANRFLADWFAVRGGDCPGDGLRLCSHAGCGRPETRKHEFRRCSVCGV 325
Query: 354 VNYCSRACQALDWKLRHKADCAPAERWLDE--EGNGDVDVDLNDAEGGGEDNVV 405
VNYCSRACQALDWKLRHK DCAP +RWL+E G G+V +D N G DNV+
Sbjct: 326 VNYCSRACQALDWKLRHKMDCAPVQRWLEEGDGGEGNVQIDGN----GNGDNVL 375
>gi|449450584|ref|XP_004143042.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
gi|449494547|ref|XP_004159577.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
Length = 371
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 259/342 (75%), Positives = 278/342 (81%), Gaps = 26/342 (7%)
Query: 43 KKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLV 102
K+ RF P K D FD+LPDDL++SIL KL S+A PS F+N LITCKR N L +SLV
Sbjct: 16 KRNRFYP----KFDFFDSLPDDLLISILSKLVSSASSPSHFINALITCKRFNHLGRHSLV 71
Query: 103 LSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKA 162
LSKAS++T I A NWSESAHRFLK C DAGNVEACY LGMIRFYCLQNRGSGASLMAKA
Sbjct: 72 LSKASQRTLGISAKNWSESAHRFLKQCVDAGNVEACYILGMIRFYCLQNRGSGASLMAKA 131
Query: 163 AISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYG 222
AISS+A ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGH+DALRELGHCLQDGYG
Sbjct: 132 AISSYAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHVDALRELGHCLQDGYG 191
Query: 223 VRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHN 282
V QNI EGRRFLVQANARELAAVLSS + ++RS +T NP H
Sbjct: 192 VCQNIMEGRRFLVQANARELAAVLSSPS--SAASRSLVTCNP----------------HA 233
Query: 283 GHVTGIG----CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRP 338
H T +G CPLLSDFGCNVPAPEAHPASRF+AEWFA RGG PGPGLRLCSHVGCGRP
Sbjct: 234 IHRTAVGSSSDCPLLSDFGCNVPAPEAHPASRFMAEWFALRGGAPGPGLRLCSHVGCGRP 293
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
ETRRHEFRRCSVCG VNYCSRACQALDWKL+HKA+CAP E W
Sbjct: 294 ETRRHEFRRCSVCGIVNYCSRACQALDWKLQHKAECAPLEIW 335
>gi|255540243|ref|XP_002511186.1| conserved hypothetical protein [Ricinus communis]
gi|223550301|gb|EEF51788.1| conserved hypothetical protein [Ricinus communis]
Length = 349
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 245/346 (70%), Positives = 269/346 (77%), Gaps = 17/346 (4%)
Query: 42 RKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSL 101
RK+ + S KSDLFD LPDD+VV IL KLSS+A CPSDF+N+L TCKR+N LAL +
Sbjct: 4 RKRQKTSRRTPEKSDLFDELPDDIVVCILSKLSSSASCPSDFINILFTCKRLNRLALQPV 63
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAK 161
VLSKA +TFA+KA NWS+SAHRFLKLC +AGN EA YTLGMIRFYCLQNRG GASLMAK
Sbjct: 64 VLSKAGPQTFAVKAKNWSDSAHRFLKLCINAGNTEASYTLGMIRFYCLQNRGVGASLMAK 123
Query: 162 AAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 221
AAI SHA ALYSLAV+QFNGSGGSK DKDLRAGV+LCARAA LGHIDALRELGHCLQDGY
Sbjct: 124 AAIKSHAPALYSLAVMQFNGSGGSKIDKDLRAGVSLCARAAVLGHIDALRELGHCLQDGY 183
Query: 222 GVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNH 281
GV QNIAEGRR LVQANARELA+ S RS LTW P +Q H +
Sbjct: 184 GVAQNIAEGRRLLVQANARELAS----------SLRSMLTWQPHNQHH-------RQYAS 226
Query: 282 NGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETR 341
+ GCPLLSDFGCNVPA E HPA+RFL+EWF + G GPGLRLCSH GCGRPETR
Sbjct: 227 CEVMESAGCPLLSDFGCNVPAREVHPANRFLSEWFESGRGLLGPGLRLCSHSGCGRPETR 286
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNG 387
HEFRRCSVCG VNYCSR CQALDWKLRHK +C P E+WL E +G
Sbjct: 287 PHEFRRCSVCGTVNYCSRGCQALDWKLRHKMECVPLEQWLVVEDDG 332
>gi|224136173|ref|XP_002322257.1| predicted protein [Populus trichocarpa]
gi|222869253|gb|EEF06384.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 244/366 (66%), Positives = 273/366 (74%), Gaps = 23/366 (6%)
Query: 39 TTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLAL 98
T RK+ R ++ K DLFD LPDDLV+ IL KLSS+A CPSDF+N+LITCKR+ LAL
Sbjct: 3 TAQRKRLR---KISEKPDLFDQLPDDLVLYILFKLSSSASCPSDFINILITCKRLKQLAL 59
Query: 99 NSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL 158
+ LVLSK KTFA+KA WS+SAH+FLK C +AGN EA YTLGMIRFYCLQNRGSGASL
Sbjct: 60 DPLVLSKVGVKTFAVKAERWSDSAHQFLKQCVNAGNREALYTLGMIRFYCLQNRGSGASL 119
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
MAKAAI SHA ALYSLAVIQFNGSGGSKNDK+LRAGVALCARAA LGHIDALRELGHCLQ
Sbjct: 120 MAKAAIKSHASALYSLAVIQFNGSGGSKNDKNLRAGVALCARAAVLGHIDALRELGHCLQ 179
Query: 219 DGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQN 278
DGYGV Q+I +GRRFLVQANA+ELA +S RS LTW P Q H QN
Sbjct: 180 DGYGVPQDIVQGRRFLVQANAKELA----------LSLRSMLTWKP-QQQHV---DEQQN 225
Query: 279 PNHNGHVTG---IGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGC 335
++ V G GCPLLSDFGCNVPA E HP + FL EWF +R G GLRLCSH GC
Sbjct: 226 LHYACSVMGTAITGCPLLSDFGCNVPAREVHPVNEFLKEWFESRPGMLDHGLRLCSHSGC 285
Query: 336 GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW---LDEEGNGDVDVD 392
GRPETR EFRRCSVCG VNYCSR CQALDWKLRHK +C P E+W L++ G+ +
Sbjct: 286 GRPETRPREFRRCSVCGTVNYCSRGCQALDWKLRHKVECVPMEQWQGVLEDGGDNGIGGM 345
Query: 393 LNDAEG 398
+ EG
Sbjct: 346 VEIEEG 351
>gi|224122012|ref|XP_002318728.1| f-box family protein [Populus trichocarpa]
gi|222859401|gb|EEE96948.1| f-box family protein [Populus trichocarpa]
Length = 348
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 238/359 (66%), Positives = 267/359 (74%), Gaps = 26/359 (7%)
Query: 42 RKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSL 101
RK+ R + K D FD LPDDLV++ILCKLSS+A CPSDF+NVLITCK + LA N L
Sbjct: 4 RKRLR---TITEKPDYFDQLPDDLVLNILCKLSSSASCPSDFINVLITCKTLKRLATNPL 60
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAK 161
VLSKA K FA+KA WS+SAH+FLK C AGN EA YTLGMIRFYCLQNRGSGASLMAK
Sbjct: 61 VLSKAGAKMFAMKAERWSDSAHQFLKQCVKAGNSEASYTLGMIRFYCLQNRGSGASLMAK 120
Query: 162 AAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 221
AAI SHA A+YSLAVIQFNGSGGSK DK+LRAGVALCARA+ LGH+DALRELGHCLQDGY
Sbjct: 121 AAIKSHALAMYSLAVIQFNGSGGSKTDKNLRAGVALCARASVLGHVDALRELGHCLQDGY 180
Query: 222 GVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNH 281
GV QNI EGRR LVQANA+ELA +S RS +TW P Q QN +H
Sbjct: 181 GVPQNIVEGRRLLVQANAKELA----------LSLRSMMTWKPQQQ--------QQNLHH 222
Query: 282 NGHVTG---IGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRP 338
V G GCPLLSDFGCN+ A E HPAS+FL EWF +R G GLRLCSH GCGRP
Sbjct: 223 ACTVMGTAITGCPLLSDFGCNLLAREVHPASKFLREWFESRSGALDDGLRLCSHSGCGRP 282
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW--LDEEGNGDVDVDLND 395
ETR HEFRRCSVCG VNYCSR CQALDWK RHK +C P ++W + E+G V+++ D
Sbjct: 283 ETRTHEFRRCSVCGTVNYCSRGCQALDWKARHKVECVPMDQWHGVLEDGGDMVEMEEGD 341
>gi|449454830|ref|XP_004145157.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
gi|449474252|ref|XP_004154118.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
Length = 351
Score = 446 bits (1147), Expect = e-123, Method: Compositional matrix adjust.
Identities = 226/340 (66%), Positives = 259/340 (76%), Gaps = 24/340 (7%)
Query: 43 KKARFSPDVA-GKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSL 101
K+ R S A G SDLFD LPDDLVV +L KLS+TA P D VNV+ITCKR N LAL+ +
Sbjct: 5 KRRRTSTSAATGISDLFDGLPDDLVVVVLGKLSATASSPLDLVNVMITCKRFNRLALHPI 64
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAK 161
VLSKA K F I+ NWSES HRFLKLC AGNVEACYTLGMI FYCL+NRG+GASLMAK
Sbjct: 65 VLSKAGPKAFEIQTKNWSESTHRFLKLCVSAGNVEACYTLGMIEFYCLKNRGTGASLMAK 124
Query: 162 AAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 221
AAI SH ALYSLA++QFNGSGGSK+DK+LRAGVALC RAA+LGHIDALRELGHCLQDGY
Sbjct: 125 AAIKSHPLALYSLAIVQFNGSGGSKSDKNLRAGVALCGRAAYLGHIDALRELGHCLQDGY 184
Query: 222 GVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNH 281
GV QN++EGRR L++ANARELA ++T + L + H + H ++
Sbjct: 185 GVPQNVSEGRRLLIEANARELA---------------YITNSSLRRLHCSRHRQS----- 224
Query: 282 NGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETR 341
T G LLSDFGCN PAPEAHPA++FL +WFA+ G+ GLRLCS+ GCGRPETR
Sbjct: 225 ---ATDSGGSLLSDFGCNFPAPEAHPANQFLRDWFASGRGSIAEGLRLCSNSGCGRPETR 281
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
HEFRRCSVCG VNYCSR CQALDWKLRHK +CAP +RWL
Sbjct: 282 LHEFRRCSVCGNVNYCSRGCQALDWKLRHKTECAPFDRWL 321
>gi|449440552|ref|XP_004138048.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
gi|449519733|ref|XP_004166889.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
Length = 354
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 228/364 (62%), Positives = 264/364 (72%), Gaps = 31/364 (8%)
Query: 43 KKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLV 102
K+ R + SDLFD LPDDL++ +L +L+++A PSD +N+L+TCKR+N L LN +V
Sbjct: 5 KRPRTCRNFTVDSDLFDVLPDDLLIHLLSRLAASASSPSDLLNLLLTCKRLNRLVLNPMV 64
Query: 103 LSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKA 162
L KA K FA++ NWS+SAHRFLK C DAGN EA YTLGMIRFYCL+NRGSGASLMAKA
Sbjct: 65 LCKAGPKAFAVRMRNWSDSAHRFLKRCVDAGNSEASYTLGMIRFYCLRNRGSGASLMAKA 124
Query: 163 AISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYG 222
AI SHA ALYSLAVIQFNGSGGSK+DKDL+AGVALCARAAFLGH+DALRELGHCLQDGYG
Sbjct: 125 AIKSHAPALYSLAVIQFNGSGGSKSDKDLQAGVALCARAAFLGHVDALRELGHCLQDGYG 184
Query: 223 VRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHN 282
VRQN EGRR LVQANARELA VL S++ TW + Q N +
Sbjct: 185 VRQNSDEGRRLLVQANARELATVLLSSSS---------TW------------QQQRHNQS 223
Query: 283 GH---VTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPG-PGLRLCSHVGCGRP 338
G+ +T C LLSDFGCNVPAPE HP + FL EWF + G GLRLCSH GCGR
Sbjct: 224 GNLPDLTATRCSLLSDFGCNVPAPEPHPVNLFLREWFESEGEVAARVGLRLCSHSGCGRG 283
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN------GDVDVD 392
ETR HEFRRCSVCG VNYCSR CQA DWK+RHK +C +RW DE+ N G V+ +
Sbjct: 284 ETRPHEFRRCSVCGTVNYCSRGCQAQDWKVRHKEECTTVQRWRDEDANNAGEMFGIVEEE 343
Query: 393 LNDA 396
+ DA
Sbjct: 344 VEDA 347
>gi|356508806|ref|XP_003523145.1| PREDICTED: F-box protein At1g67340-like [Glycine max]
Length = 344
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/337 (67%), Positives = 256/337 (75%), Gaps = 15/337 (4%)
Query: 54 KSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAI 113
K DLF+ LPDDL+V IL KLSSTA PSDF+N+++TCKR+N L L+ LVLSKA+ K FAI
Sbjct: 11 KHDLFECLPDDLLVLILSKLSSTASSPSDFINIILTCKRLNRLGLHRLVLSKAASKLFAI 70
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
K NWSE AH FLK CA+AGNV+ACYTLGMIRFYCL+NRGSG SLMAKAA+ HA ALYS
Sbjct: 71 KPKNWSEYAHSFLKHCANAGNVDACYTLGMIRFYCLKNRGSGLSLMAKAAMKLHAPALYS 130
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LAVIQFNGSGGSK DKDLRAGVAL ARA+ LGHIDALRELGHCLQDGYGVRQN+ EGRR
Sbjct: 131 LAVIQFNGSGGSKRDKDLRAGVALSARASLLGHIDALRELGHCLQDGYGVRQNVTEGRRL 190
Query: 234 LVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLL 293
LVQAN RELA VL + A S LTW L R Q N+N + + CPLL
Sbjct: 191 LVQANVRELAYVLRAVAPSRCYETSLLTWRAL---------RCQRNNNNNNAV-VACPLL 240
Query: 294 SDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGA 353
SD+G NV APE PA+ FL EWF + G GLRLC+H+GCGRPETR HEFRRCSVCG
Sbjct: 241 SDYGYNVAAPEVQPANWFLREWFESGKGKLEEGLRLCAHIGCGRPETRPHEFRRCSVCGK 300
Query: 354 VNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVD 390
VNYCSR CQALDWKLRHK +C+P E + +G VD
Sbjct: 301 VNYCSRGCQALDWKLRHKMECSPVEMY-----DGGVD 332
>gi|356516551|ref|XP_003526957.1| PREDICTED: F-box protein At1g67340-like [Glycine max]
Length = 348
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/344 (66%), Positives = 258/344 (75%), Gaps = 21/344 (6%)
Query: 54 KSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAI 113
K DLF+ LPDDLVV IL KLS+TA PSDF+N+++TCKR+N L+ LVLSKA K FAI
Sbjct: 11 KHDLFECLPDDLVVLILSKLSATASSPSDFINIILTCKRLNRSGLHRLVLSKAGPKVFAI 70
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
K NWSE AH FLK C +AGNV+ACYTLGMIRFYCL+NRGSG SLMAKAA+ HA ALYS
Sbjct: 71 KPKNWSEYAHSFLKHCVNAGNVDACYTLGMIRFYCLKNRGSGLSLMAKAAMKLHAPALYS 130
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LAVIQFNGSGGSK DKDLRAGVAL ARA+ LGHIDALRELGHCLQDGYGVRQN+ EGRR
Sbjct: 131 LAVIQFNGSGGSKRDKDLRAGVALSARASLLGHIDALRELGHCLQDGYGVRQNVTEGRRL 190
Query: 234 LVQANARELAAVLSSAA-----CQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGI 288
LVQAN RELA VL + A C + S LTW L R Q N+N V+ +
Sbjct: 191 LVQANVRELAYVLRAVAPPSRCCD--ESSSLLTWGAL---------RCQRKNNNA-VSSL 238
Query: 289 GCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRC 348
CPLLSD+G +V PE PA+ FL EWF + G GLRLC+H+GCGRPETR HEFRRC
Sbjct: 239 ACPLLSDYGYSVAPPEVQPANWFLREWFESGKGKLEEGLRLCAHIGCGRPETRPHEFRRC 298
Query: 349 SVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVD 392
SVCG VNYCSR CQALDWKLRHK +C+P E + G+G VDV+
Sbjct: 299 SVCGKVNYCSRGCQALDWKLRHKIECSPVEMY----GDGGVDVE 338
>gi|294461488|gb|ADE76305.1| unknown [Picea sitchensis]
Length = 382
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 225/381 (59%), Positives = 266/381 (69%), Gaps = 28/381 (7%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
M+T+RG CY P +R E +++K + + + FD
Sbjct: 1 MQTRRGTCYSSPAPA------APSTGCRRESKRRRENSPCHKRKRQV------RGNHFDD 48
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
+PDDL+VSIL KLSST+R P+DF++ L+TC+R + + VL KAS FA+KA++W E
Sbjct: 49 IPDDLLVSILSKLSSTSRRPADFISALLTCRRFHAMGFQPFVLVKASTSAFAVKASSWCE 108
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+HRF+K CADAGNVEACYTLGMIRFYCLQNRG GASLMAKAAI+SHA ALYSLA+IQFN
Sbjct: 109 GSHRFIKKCADAGNVEACYTLGMIRFYCLQNRGRGASLMAKAAIASHASALYSLAIIQFN 168
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGG K DKDL+AGVALCARAAFLGHIDALRELGHCLQDGYG+ +N+AEGRRFLVQANAR
Sbjct: 169 GSGGGKKDKDLKAGVALCARAAFLGHIDALRELGHCLQDGYGISKNVAEGRRFLVQANAR 228
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHPA-----PHPRNQNPNHNGHVTGIGCPLLSD 295
ELA+V + W + L PA NP + CPLLSD
Sbjct: 229 ELASVY--------PPQYWKSQQNLRALRPAWMKSLRTTTTLNPAFSSSSLR-ACPLLSD 279
Query: 296 FGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVN 355
FGCNVP PE+HPA++F+ EWF R P GLRLCS+ GCGRPETR+HEFRRCSVCG VN
Sbjct: 280 FGCNVPPPESHPAAKFMVEWFKKR--PPCAGLRLCSYAGCGRPETRKHEFRRCSVCGTVN 337
Query: 356 YCSRACQALDWKLRHKADCAP 376
YCSRACQALDWK+RHK C P
Sbjct: 338 YCSRACQALDWKIRHKHTCMP 358
>gi|357465071|ref|XP_003602817.1| F-box family protein [Medicago truncatula]
gi|355491865|gb|AES73068.1| F-box family protein [Medicago truncatula]
Length = 446
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/338 (59%), Positives = 234/338 (69%), Gaps = 36/338 (10%)
Query: 55 SDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIK 114
D F+ PDD+VV IL KLSSTA PS F N L+TCKR+N L + +VLSKA K A+K
Sbjct: 15 QDFFEDFPDDIVVFILTKLSSTASSPSHFFNTLLTCKRLNRLGFHPMVLSKAGPKVLAVK 74
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
NWSE++H FLK C + GNV+ACYTLGMIRFYCL+NR SG SL+AKAA+ HA ALYSL
Sbjct: 75 PKNWSENSHHFLKRCVNVGNVDACYTLGMIRFYCLRNRRSGVSLIAKAAMKLHAPALYSL 134
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AVIQFNGSGG+K DKDLRAGVAL ARA+ LGHIDALRELGHCLQDGYGV+QN+ EGRR L
Sbjct: 135 AVIQFNGSGGTKQDKDLRAGVALSARASLLGHIDALRELGHCLQDGYGVKQNVTEGRRLL 194
Query: 235 VQANARELAAVL---------SSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHV 285
VQAN RE+ V + +C+GI +R +
Sbjct: 195 VQANVREILLVYRFLELASPSRAGSCEGILSR---------------------------L 227
Query: 286 TGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEF 345
+ + PL+ + NV PE HP + FL EWF + G GLRLC+H+GCGR ETR HEF
Sbjct: 228 SNMAVPLIKENRYNVKVPEVHPVNWFLREWFESGLGILEDGLRLCAHIGCGRAETRPHEF 287
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE 383
RRCSVCG VNYCSRACQALDWKLRHK +C+P E WLDE
Sbjct: 288 RRCSVCGKVNYCSRACQALDWKLRHKMECSPMEWWLDE 325
>gi|388508880|gb|AFK42506.1| unknown [Medicago truncatula]
Length = 351
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 200/338 (59%), Positives = 234/338 (69%), Gaps = 36/338 (10%)
Query: 55 SDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIK 114
D F+ PDD+VV IL KLSSTA PS F N L+TCKR+N L + +VLSKA K A+K
Sbjct: 15 QDFFEDFPDDIVVFILTKLSSTASSPSHFFNTLLTCKRLNRLGFHPMVLSKAGPKVLAVK 74
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
NWSE++H FLK C + GNV+ACYTLGMIRFYCL+NR SG SL+AKAA+ HA ALYSL
Sbjct: 75 PKNWSENSHHFLKRCVNVGNVDACYTLGMIRFYCLRNRRSGVSLIAKAAMKLHAPALYSL 134
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AVIQFNGSGG+K DKDLRAGVAL ARA+ LGHIDALRELGHCLQDGYGV+QN+ EGRR L
Sbjct: 135 AVIQFNGSGGTKQDKDLRAGVALSARASLLGHIDALRELGHCLQDGYGVKQNVTEGRRLL 194
Query: 235 VQANARELAAVL---------SSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHV 285
VQAN RE+ V + +C+GI +R +
Sbjct: 195 VQANVREILLVYRFLELASPSRAGSCEGILSR---------------------------L 227
Query: 286 TGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEF 345
+ + PL+ + NV PE HP + FL EWF + G GLRLC+H+GCGR ETR HEF
Sbjct: 228 SNMAVPLIKENRYNVKVPEVHPVNWFLREWFESGLGILEDGLRLCAHIGCGRAETRPHEF 287
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE 383
RRCSVCG VNYCSRACQALDWKLRHK +C+P E WLDE
Sbjct: 288 RRCSVCGKVNYCSRACQALDWKLRHKMECSPMEWWLDE 325
>gi|15240674|ref|NP_199856.1| MYND-type zinc finger protein [Arabidopsis thaliana]
gi|75171164|sp|Q9FK27.1|FB342_ARATH RecName: Full=F-box protein At5g50450
gi|9758927|dbj|BAB09464.1| unnamed protein product [Arabidopsis thaliana]
gi|20466554|gb|AAM20594.1| putative protein [Arabidopsis thaliana]
gi|23198134|gb|AAN15594.1| putative protein [Arabidopsis thaliana]
gi|332008564|gb|AED95947.1| MYND-type zinc finger protein [Arabidopsis thaliana]
Length = 336
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/346 (56%), Positives = 244/346 (70%), Gaps = 38/346 (10%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
F+ L DDL++SIL KL+++A PSDF+ VL TCKR+N L L+ LVLSKA +T A+ A
Sbjct: 22 FEDLHDDLIISILRKLATSASSPSDFLTVLSTCKRLNRLGLHPLVLSKAGTQTLAVTAEK 81
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
WS+S+H+FLKLC +AGN++A Y+LGMIRFYCLQN SGASLMAKAAI SHA ALYSL+VI
Sbjct: 82 WSDSSHKFLKLCVNAGNIDASYSLGMIRFYCLQNPVSGASLMAKAAIKSHAPALYSLSVI 141
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QFNGSGGSK DK+LRAGVALCAR+A+LGH+DALRELGHCLQDGYGV ++++EGRR L+QA
Sbjct: 142 QFNGSGGSKTDKNLRAGVALCARSAYLGHVDALRELGHCLQDGYGVPRDVSEGRRLLIQA 201
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
NARELA S RS+L+ ++ + N +T +
Sbjct: 202 NARELAC----------SLRSYLSL--------------KSGDENETLTDLSV------- 230
Query: 298 CNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYC 357
VP E HP +RFL EWF++ GLR+CSH GCGRPETR HEFRRCSVCG VNYC
Sbjct: 231 --VPVQEIHPVNRFLKEWFSSGRVDLAEGLRMCSHGGCGRPETRAHEFRRCSVCGKVNYC 288
Query: 358 SRACQALDWKLRHKADCAPAERWLDEEGN-GD----VDVDLNDAEG 398
SR CQALDW+ +HK +C P + W+ GD V V+++D G
Sbjct: 289 SRGCQALDWRAKHKVECTPLDLWVAAAAEIGDDGEAVAVEIDDNHG 334
>gi|297795845|ref|XP_002865807.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297311642|gb|EFH42066.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 311
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 197/343 (57%), Positives = 236/343 (68%), Gaps = 39/343 (11%)
Query: 39 TTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLAL 98
T KK R + + FD L DDL++SIL KL+S+A PSDF+ VL TCKR+N L L
Sbjct: 2 THLNKKQRLEKN----HNHFDNLHDDLIISILRKLASSASSPSDFLTVLSTCKRLNRLGL 57
Query: 99 NSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL 158
LVLSKA +T A+ A WS+S+H+FLKLC +AGN++A Y LGMIRFYCLQN SGASL
Sbjct: 58 QPLVLSKAGTQTLAVTAEKWSDSSHKFLKLCVNAGNIDASYALGMIRFYCLQNPVSGASL 117
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
MAKAAI SHA ALYSL+VIQFNGSGGSK DK+LRAGV LCAR+A+LGH+DALRELGHCLQ
Sbjct: 118 MAKAAIKSHAPALYSLSVIQFNGSGGSKTDKNLRAGVTLCARSAYLGHVDALRELGHCLQ 177
Query: 219 DGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQN 278
DGYGV ++++EGRR L+QANARELA S RS+L+
Sbjct: 178 DGYGVPRDVSEGRRLLIQANARELAC----------SLRSYLSLK--------------- 212
Query: 279 PNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRP 338
G +L+D VP E HP +RFL EWF++ GLR+CSH GCGRP
Sbjct: 213 ---------CGDEILTDLSV-VPVQEIHPVNRFLKEWFSSGRVDLAEGLRMCSHGGCGRP 262
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
ETR HEFRRCSVCG VNYCSR CQALDW+ +HK +C P + W+
Sbjct: 263 ETRAHEFRRCSVCGKVNYCSRGCQALDWRAKHKIECTPLDLWI 305
>gi|357131579|ref|XP_003567414.1| PREDICTED: F-box protein At1g67340-like [Brachypodium distachyon]
Length = 386
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 211/401 (52%), Positives = 264/401 (65%), Gaps = 46/401 (11%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVP--VNDKKRRG---FSSGEPKTTYRKKARFSPDVAGKS 55
MRT+ G Y C G+ P V K++R ++ ++ ++ R +P +
Sbjct: 1 MRTRSGSLY----SPCGGREPAPPAVGQKRKRSPAQSAAAADGESHGRRKRQAPGL---- 52
Query: 56 DLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKA 115
D D LPDDLV+SIL KL+++A PSD ++V +TCKR+NGL +V + AS + A+KA
Sbjct: 53 DYLDELPDDLVLSILSKLAASASSPSDLLSVHLTCKRLNGLGHQDMVFANASPASLAVKA 112
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
WSE A RFLK CADAGN+EACY LGMIRFYCL +R GA+++AKAA+ H ALYSLA
Sbjct: 113 AAWSEPAQRFLKRCADAGNLEACYILGMIRFYCLGSRSGGAAMLAKAAVGGHPAALYSLA 172
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
VIQFNGSGG+K+D+DLRAG ALCAR+A LGH+DALRELGHCLQDGYGVR++ AEGRR LV
Sbjct: 173 VIQFNGSGGAKSDRDLRAGAALCARSAALGHVDALRELGHCLQDGYGVRRDAAEGRRLLV 232
Query: 236 QANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSD 295
ANAREL+ L+SA + ++ L P + G G GCPLLSD
Sbjct: 233 AANARELSLALASANHRAFAS---LPLVPGAN--------------AGAGAGAGCPLLSD 275
Query: 296 FGCNVPAPEAHPASRFLAEWFAARG---------------GTPGPGLRLCSHVGCGRPET 340
FG ++P E H A+ F+++W+A+RG G G LRLCSH+ CGR ET
Sbjct: 276 FGWSLPEAEPHTANLFMSDWWASRGVQATAKKSSGTDEAAGAEGE-LRLCSHLRCGRKET 334
Query: 341 RRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
RRHEFRRCSVCGA NYCSRACQALDWK HK C P +RWL
Sbjct: 335 RRHEFRRCSVCGAANYCSRACQALDWKRAHKVQCVPMDRWL 375
>gi|302773327|ref|XP_002970081.1| hypothetical protein SELMODRAFT_171141 [Selaginella moellendorffii]
gi|300162592|gb|EFJ29205.1| hypothetical protein SELMODRAFT_171141 [Selaginella moellendorffii]
Length = 398
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 209/382 (54%), Positives = 261/382 (68%), Gaps = 26/382 (6%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+RG YP D + + P K+R P T + A+ A FD+
Sbjct: 1 MRTRRGN-YPSPSDG-ITFSIKPSVQKRRL------PSGTSHRPAK----AARAPCFFDS 48
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
L DD++++I+ +SSTA P+D VN ++TC+R A N VL+ AS A+KA++WS+
Sbjct: 49 LHDDVLITIITAVSSTASSPADLVNTMLTCRRFCAAATNPHVLANASTAALAVKASSWSD 108
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+ RFLK CADAGN+EACYTLGMIRFYCL NRG GASLMAKAA++SHA AL+SLAVIQFN
Sbjct: 109 GSSRFLKQCADAGNIEACYTLGMIRFYCLHNRGGGASLMAKAAMASHAAALHSLAVIQFN 168
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGGS+ DKDL+AGVALCARAA LGH+DA+RELGHCLQDGYGV QN+ +GR L++AN R
Sbjct: 169 GSGGSRKDKDLKAGVALCARAASLGHVDAMRELGHCLQDGYGVAQNVVKGRGLLLEANTR 228
Query: 241 ELAAVLSSAACQGISTRSWLTWN--PLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGC 298
E AA ++ + + + LT ++ H +H + GC LLSDFGC
Sbjct: 229 EAAAAVAQSPRRFMEAALHLTAKGGAMACLH----------HHLHYYASGGCSLLSDFGC 278
Query: 299 NVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCS 358
NVP P+ H A+RF+ +WF A+ P GLRLCSH CGRPE+RRHEFRRCS CG+VNYCS
Sbjct: 279 NVPPPKLHVANRFMVDWFVAK--PPESGLRLCSHSNCGRPESRRHEFRRCSACGSVNYCS 336
Query: 359 RACQALDWKLRHKADCAPAERW 380
RACQALDWK+RHK DC P W
Sbjct: 337 RACQALDWKIRHKCDCTPVPDW 358
>gi|302807084|ref|XP_002985273.1| hypothetical protein SELMODRAFT_121825 [Selaginella moellendorffii]
gi|300147101|gb|EFJ13767.1| hypothetical protein SELMODRAFT_121825 [Selaginella moellendorffii]
Length = 399
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 208/382 (54%), Positives = 261/382 (68%), Gaps = 26/382 (6%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+RG YP D + + P K+R P T + A+ A FD+
Sbjct: 1 MRTRRGN-YPSPSDG-ITFSIKPSVQKRRL------PSGTSDRPAK----AARAPCFFDS 48
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
L DD++++I+ +SSTA P+D VN ++TC+R A N VL+ AS A+KA++WS+
Sbjct: 49 LHDDVLITIITAVSSTASSPADLVNTMLTCRRFCAAATNPHVLANASTAALAVKASSWSD 108
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+ RFLK CADAGN+EACYT+GMIRFYCL NRG GASLMAKAA++SHA AL+SLAVIQFN
Sbjct: 109 GSSRFLKQCADAGNIEACYTVGMIRFYCLHNRGGGASLMAKAAMASHAAALHSLAVIQFN 168
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGGS+ DKDL+AGVALCARAA LGH+DA+RELGHCLQDGYGV QN+ +GR L++AN R
Sbjct: 169 GSGGSRKDKDLKAGVALCARAASLGHVDAMRELGHCLQDGYGVAQNVVKGRGLLLEANTR 228
Query: 241 ELAAVLSSAACQGISTRSWLTWN--PLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGC 298
E AA ++ + + + LT ++ H +H + GC LLSDFGC
Sbjct: 229 EAAAAVAQSPRRFMEAALHLTAKGGAMACLH----------HHLHYYASGGCSLLSDFGC 278
Query: 299 NVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCS 358
NVP P+ H A+RF+ +WF A+ P GLRLCSH CGRPE+RRHEFRRCS CG+VNYCS
Sbjct: 279 NVPPPKLHVANRFMVDWFVAK--PPESGLRLCSHSNCGRPESRRHEFRRCSACGSVNYCS 336
Query: 359 RACQALDWKLRHKADCAPAERW 380
RACQALDWK+RHK DC P W
Sbjct: 337 RACQALDWKIRHKCDCTPVPDW 358
>gi|326495470|dbj|BAJ85831.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 378
Score = 361 bits (926), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 216/397 (54%), Positives = 260/397 (65%), Gaps = 46/397 (11%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+ G YP C PV +KR+ S RK+ +AG D D
Sbjct: 1 MRTRSGSLYP-----C--DPAAPVAGQKRKRSPSAAADDGCRKR------LAGGPDYLDG 47
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
+PDDLV+SI KL+++A PSD ++V +TCKR+NGL +V +KAS + A+KA WSE
Sbjct: 48 IPDDLVLSIFSKLAASASSPSDLLSVHLTCKRLNGLGQQDMVFAKASPASLAVKAAAWSE 107
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
RFLK CADAGN+EACY LGMIRFYCL +R GA+L+AKAA+ H ALYSLAVIQFN
Sbjct: 108 PVQRFLKRCADAGNLEACYILGMIRFYCLGSRSGGAALLAKAAVGGHPAALYSLAVIQFN 167
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGG+K+D+DLRAG ALCAR+A LGH+DALRELGHCLQDGYGVR++ AEGRR LV ANAR
Sbjct: 168 GSGGAKSDRDLRAGAALCARSAALGHVDALRELGHCLQDGYGVRRDPAEGRRLLVAANAR 227
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHP-APHPRNQNPNHNGHVTGIGCPLLSDFGCN 299
EL+ L++AA S +P A P G T PLLSDFG +
Sbjct: 228 ELSLALAAAA--------------ASAAYPFASLPIGAVAGGGGVSTS---PLLSDFGWS 270
Query: 300 VPAPEAHPASRFLAEWFAAR--------------GGTPGPG-LRLCSHVGCGRPETRRHE 344
+P E H A++F+++W+AAR GG G G LRLCSH+ CGR ETRRHE
Sbjct: 271 LPEAEPHTANQFMSDWWAARGVQACAKKPDAAATGGDGGEGELRLCSHMRCGRKETRRHE 330
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
FRRCSVCGA NYCSRACQALDWK HKA C P +RWL
Sbjct: 331 FRRCSVCGAANYCSRACQALDWKRAHKAQCVPMDRWL 367
>gi|242059707|ref|XP_002458999.1| hypothetical protein SORBIDRAFT_03g044140 [Sorghum bicolor]
gi|241930974|gb|EES04119.1| hypothetical protein SORBIDRAFT_03g044140 [Sorghum bicolor]
Length = 394
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 213/400 (53%), Positives = 261/400 (65%), Gaps = 40/400 (10%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGF--SSGEPKTTYRKKARFSPDVAGKSDLF 58
MRT+ G Y G+ V V K++R +GE R+K +AG D
Sbjct: 1 MRTRSGSRYSNG-----GEVVALVGQKRKRSALAVAGECACGGRRK-----RLAGGPDYL 50
Query: 59 DALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNW 118
DALPDDLV+SIL KL++++ PSD ++V +TCKR+N L + +V +KAS + A+KA W
Sbjct: 51 DALPDDLVLSILTKLAASSSAPSDLLSVHLTCKRLNELGGHDMVFAKASPASLAVKAAAW 110
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
SE A RFLK CADAGN+EACY LGMIRFYC +R GA+L+A+AA+ HA ALYSLAVIQ
Sbjct: 111 SEPAQRFLKRCADAGNLEACYILGMIRFYCQGSRSGGATLLARAAVGGHAAALYSLAVIQ 170
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
NGSGG+K+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR++ AEGRR LV AN
Sbjct: 171 VNGSGGAKSDRDLRAGAALCARAAALGHVDALRELGHCLQDGYGVRRDPAEGRRLLVAAN 230
Query: 239 ARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGC 298
AREL LS+A ++R +T P + A CPLLSDFG
Sbjct: 231 ARELTLALSAA----TASRHAVTALPFAAAAAAAAAAGGVGGGG-------CPLLSDFGW 279
Query: 299 NVPAPEAHPASRFLAEWFAAR-----------------GGTPGPGLRLCSHVGCGRPETR 341
++P E H A++F+ +W+A+R G+ G LRLCSHV CGR ETR
Sbjct: 280 SLPEAEPHAANQFMVDWWASRCAAQAGGKKPDGDSSGVDGSDGAELRLCSHVRCGRRETR 339
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
RHEFRRCSVCGA NYCSRACQALDWK HKA C +RWL
Sbjct: 340 RHEFRRCSVCGAANYCSRACQALDWKRAHKAQCVAMDRWL 379
>gi|19386803|dbj|BAB86182.1| OJ1485_B09.11 [Oryza sativa Japonica Group]
Length = 388
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 205/353 (58%), Positives = 243/353 (68%), Gaps = 38/353 (10%)
Query: 53 GKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFA 112
G D D LPDDLV+++L KL+++A PSD ++V +TCKR+NGL + +V +KAS + A
Sbjct: 53 GGPDYLDELPDDLVLAVLSKLAASASSPSDLLSVHLTCKRLNGLGRHDMVFAKASPASLA 112
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALY 172
+KA +WSE RFLKLCADAGN+EACY LGMIRFYCL NR GA+L+A+AA+ HA ALY
Sbjct: 113 VKAASWSEPVQRFLKLCADAGNLEACYILGMIRFYCLGNRSGGAALLARAAVGGHAAALY 172
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
SLAVIQFNGSGG+K+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR++ AEGRR
Sbjct: 173 SLAVIQFNGSGGAKSDRDLRAGAALCARAAALGHVDALRELGHCLQDGYGVRRDPAEGRR 232
Query: 233 FLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPL 292
FLV ANAREL L++AA P + IGCPL
Sbjct: 233 FLVAANARELTLALAAAASH----------RPFA---------ALPLAGGAAAGAIGCPL 273
Query: 293 LSDFGCNVPAPEAHPASRFLAEWFAARG--------GTPGPG-----------LRLCSHV 333
LSDFG ++P E HPA+ F+A+W+A+RG G P LRLCSHV
Sbjct: 274 LSDFGWSLPEAEPHPANLFMADWWASRGVQATAKKPGLEAPAAATGDSDGGGELRLCSHV 333
Query: 334 GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
CGR ETRRHEFRRCSVCGA NYCSRACQALDWK HKA C P +RWL G
Sbjct: 334 RCGRRETRRHEFRRCSVCGAANYCSRACQALDWKRAHKAQCVPMDRWLLAAGE 386
>gi|297598207|ref|NP_001045230.2| Os01g0921800 [Oryza sativa Japonica Group]
gi|57899437|dbj|BAD88375.1| hypothetical protein [Oryza sativa Japonica Group]
gi|255674009|dbj|BAF07144.2| Os01g0921800 [Oryza sativa Japonica Group]
Length = 369
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 205/353 (58%), Positives = 243/353 (68%), Gaps = 38/353 (10%)
Query: 53 GKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFA 112
G D D LPDDLV+++L KL+++A PSD ++V +TCKR+NGL + +V +KAS + A
Sbjct: 34 GGPDYLDELPDDLVLAVLSKLAASASSPSDLLSVHLTCKRLNGLGRHDMVFAKASPASLA 93
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALY 172
+KA +WSE RFLKLCADAGN+EACY LGMIRFYCL NR GA+L+A+AA+ HA ALY
Sbjct: 94 VKAASWSEPVQRFLKLCADAGNLEACYILGMIRFYCLGNRSGGAALLARAAVGGHAAALY 153
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
SLAVIQFNGSGG+K+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR++ AEGRR
Sbjct: 154 SLAVIQFNGSGGAKSDRDLRAGAALCARAAALGHVDALRELGHCLQDGYGVRRDPAEGRR 213
Query: 233 FLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPL 292
FLV ANAREL L++AA P + IGCPL
Sbjct: 214 FLVAANARELTLALAAAASH----------RPFA---------ALPLAGGAAAGAIGCPL 254
Query: 293 LSDFGCNVPAPEAHPASRFLAEWFAARG--------GTPGPG-----------LRLCSHV 333
LSDFG ++P E HPA+ F+A+W+A+RG G P LRLCSHV
Sbjct: 255 LSDFGWSLPEAEPHPANLFMADWWASRGVQATAKKPGLEAPAAATGDSDGGGELRLCSHV 314
Query: 334 GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
CGR ETRRHEFRRCSVCGA NYCSRACQALDWK HKA C P +RWL G
Sbjct: 315 RCGRRETRRHEFRRCSVCGAANYCSRACQALDWKRAHKAQCVPMDRWLLAAGE 367
>gi|357450275|ref|XP_003595414.1| F-box family protein [Medicago truncatula]
gi|355484462|gb|AES65665.1| F-box family protein [Medicago truncatula]
Length = 344
Score = 347 bits (889), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 178/321 (55%), Positives = 212/321 (66%), Gaps = 39/321 (12%)
Query: 56 DLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKA 115
D F+ PDD+VV IL KLSSTA PS F N L+TCKR+N L + +VLSKA K AIK
Sbjct: 16 DFFEDFPDDIVVFILTKLSSTASSPSHFFNTLLTCKRLNRLGFHPMVLSKAGPKVLAIKP 75
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
NWSE++H FLK C + GNV+ACYTLGMIRFYCL+NR SG SL+AKAA+ HA ALYSLA
Sbjct: 76 KNWSENSHHFLKRCVNVGNVDACYTLGMIRFYCLRNRRSGVSLIAKAAMKLHAPALYSLA 135
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
VIQFNGSGG+K DKDLRAGVAL ARA+ LGHIDALRELGHCLQDGYGV+QN+ EGRR LV
Sbjct: 136 VIQFNGSGGTKQDKDLRAGVALSARASLLGHIDALRELGHCLQDGYGVKQNVTEGRRLLV 195
Query: 236 QANARELAAVL---------SSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVT 286
QAN RE+ V + +C+GI +R ++
Sbjct: 196 QANVREILLVYRFLELASPSRAGSCEGILSR---------------------------LS 228
Query: 287 GIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFR 346
+ PL+ + NV PE HP + FL EWF + G GLRLC+H+GCGR ETR HEFR
Sbjct: 229 NMAVPLIKENRYNVKVPEVHPVNWFLREWFESGLGILEDGLRLCAHIGCGRAETRPHEFR 288
Query: 347 RCSVC---GAVNYCSRACQAL 364
RCSVC G ++ C + Q
Sbjct: 289 RCSVCGISGVISVCVQLVQIF 309
>gi|414878966|tpg|DAA56097.1| TPA: hypothetical protein ZEAMMB73_508510 [Zea mays]
Length = 390
Score = 344 bits (882), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 215/400 (53%), Positives = 258/400 (64%), Gaps = 46/400 (11%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSS--GEPKTTYRKKARFSPDVAGKSDLF 58
MRT+ G Y G+ V V K++R S+ GE R+K AG+ D
Sbjct: 1 MRTRSGSRYSNG-----GEVVALVGQKRKRSPSAVAGECDCGGRRK-----RPAGEPDYL 50
Query: 59 DALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNW 118
DALPDDLV+SIL KL++ + PSD ++V +TCKR+N L + +V +KAS + A+KA W
Sbjct: 51 DALPDDLVLSILSKLAAASSAPSDLLSVHLTCKRLNELGSHDMVFAKASPASLAVKAAAW 110
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
SE A RFLK CADAGN+EACY LGMIRFYCL +R GA+L+A+AA+ HA ALYSLAVIQ
Sbjct: 111 SEPAQRFLKRCADAGNLEACYNLGMIRFYCLGSRSGGAALLARAAVGGHAAALYSLAVIQ 170
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
FNGSGG+K+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR++ AEGRR LV AN
Sbjct: 171 FNGSGGAKSDRDLRAGAALCARAASLGHVDALRELGHCLQDGYGVRRDPAEGRRLLVAAN 230
Query: 239 ARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGC 298
AREL LS+AA + G GCPLLSDFG
Sbjct: 231 ARELTLALSAAASRHAVAAL-----------------PFAAAAGVVGGGGGCPLLSDFGW 273
Query: 299 NVPAPEAHPASRFLAEWFAARGGTPGPG-----------------LRLCSHVGCGRPETR 341
++P E H A++F+ +W+A+R G LRLCSHV CGR ETR
Sbjct: 274 SLPEAEPHAANQFMVDWWASRCAAQAGGKKDGDGSGVDGDGDGAELRLCSHVRCGRRETR 333
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
RHEFRRCSVCGA NYCSRACQALDWK HKA C P +RWL
Sbjct: 334 RHEFRRCSVCGAANYCSRACQALDWKRAHKAQCVPMDRWL 373
>gi|255639003|gb|ACU19802.1| unknown [Glycine max]
Length = 216
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 169/207 (81%), Positives = 187/207 (90%), Gaps = 1/207 (0%)
Query: 42 RKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSL 101
RK+ + SPD D FD+LPDDLV+SILCKLSSTA PSDF++VLITCKR+N L L+SL
Sbjct: 4 RKRQKTSPDKTS-DDFFDSLPDDLVLSILCKLSSTATSPSDFISVLITCKRLNSLGLHSL 62
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAK 161
VLSKAS KTF++KA NW +S HRFLK CADAGN+EACYTLGMIRFYCLQNRGSGASLMAK
Sbjct: 63 VLSKASHKTFSVKAKNWCDSLHRFLKHCADAGNIEACYTLGMIRFYCLQNRGSGASLMAK 122
Query: 162 AAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 221
AA++SHA ALYSLAVIQFNGSGG+KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY
Sbjct: 123 AAMNSHAPALYSLAVIQFNGSGGTKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGY 182
Query: 222 GVRQNIAEGRRFLVQANARELAAVLSS 248
GV+ N+ EGRRFLVQANARELAAVLS+
Sbjct: 183 GVKLNVTEGRRFLVQANARELAAVLSA 209
>gi|125528903|gb|EAY77017.1| hypothetical protein OsI_04973 [Oryza sativa Indica Group]
Length = 389
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 204/350 (58%), Positives = 242/350 (69%), Gaps = 38/350 (10%)
Query: 56 DLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKA 115
D D LPDDLV+++L KL+++A PSD ++V +TCKR+NGL + +V +KAS + A+KA
Sbjct: 57 DYLDELPDDLVLAVLSKLAASASSPSDLLSVHLTCKRLNGLGRHDMVFAKASPASLAVKA 116
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
+WSE RFLKLCADAGN+EACY LGMIRFYCL NR GA+L+A+AA+ HA ALYSLA
Sbjct: 117 ASWSEPVQRFLKLCADAGNLEACYILGMIRFYCLGNRSGGAALLARAAVGGHAAALYSLA 176
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
VIQFNGSGG+K+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR++ AEGRRFLV
Sbjct: 177 VIQFNGSGGAKSDRDLRAGAALCARAAALGHVDALRELGHCLQDGYGVRRDPAEGRRFLV 236
Query: 236 QANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSD 295
ANAREL L++AA P + IGCPLLSD
Sbjct: 237 AANARELTLALAAAASH----------RPFA---------ALPLAGGAGAGAIGCPLLSD 277
Query: 296 FGCNVPAPEAHPASRFLAEWFAARG--------GTPGPG-----------LRLCSHVGCG 336
FG ++P E HPA+ F+A+W+A+RG G P LRLCSHV CG
Sbjct: 278 FGWSLPEAEPHPANLFMADWWASRGVQATAKKPGLEAPAAATGDSDGGGELRLCSHVRCG 337
Query: 337 RPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
R ETRRHEFRRCSVCGA NYCSRACQALDWK HKA C P +RWL G
Sbjct: 338 RRETRRHEFRRCSVCGAANYCSRACQALDWKRAHKAQCVPMDRWLLAAGE 387
>gi|357163051|ref|XP_003579609.1| PREDICTED: F-box protein At1g67340-like [Brachypodium distachyon]
Length = 396
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 218/414 (52%), Positives = 265/414 (64%), Gaps = 59/414 (14%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVV---PVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDL 57
MRT+RG CY C G P +++R ++ E VAG D+
Sbjct: 1 MRTRRGACYS-----CHGAEAASDGPEMHRRKRRKTATEGSLAAVSGPPGCGAVAG--DM 53
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
F+ LPDDLVVSIL ++++A P+D ++TCKR L + +VL++ S + A++A +
Sbjct: 54 FEELPDDLVVSILADVAASAGSPADLAGAILTCKRFRQLGQSKVVLARVSSRCLAVRAKS 113
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
WS+SAHRFL+ CADAGN++ACY LGMIRFYCL +RGSGA+LMA AA+ H +ALYSLAVI
Sbjct: 114 WSDSAHRFLQRCADAGNLDACYLLGMIRFYCLGSRGSGAALMAAAAVGGHREALYSLAVI 173
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QFNGSGGSK D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR+++ +GRR L+QA
Sbjct: 174 QFNGSGGSKEDRDLRAGAALCARAASLGHVDALRELGHCLQDGYGVRRSVLDGRRLLIQA 233
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHV--TGIGCPLLSD 295
NARELAA ++++A +T G V TG LLSD
Sbjct: 234 NARELAAAVTASASLLRAT------------------------GGGKVSRTGRHSCLLSD 269
Query: 296 FGCN----VPAPEAHPASRFLAEWFAARG-GTPGPG------------------LRLCSH 332
FGC A EAH A+RFLAEWFAAR G PG LRLCSH
Sbjct: 270 FGCRAAAAAAAGEAHAANRFLAEWFAARPLGPPGAAESPAAAQEEDGSGSGSGGLRLCSH 329
Query: 333 VGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
CGRPETRRHEFRRCSVCG VNYCSRACQAL WK+ HKA+C P +RWLD N
Sbjct: 330 GLCGRPETRRHEFRRCSVCGVVNYCSRACQALHWKMAHKAECTPMDRWLDANPN 383
>gi|115458036|ref|NP_001052618.1| Os04g0385600 [Oryza sativa Japonica Group]
gi|32489984|emb|CAE05014.1| OSJNBa0044M19.1 [Oryza sativa Japonica Group]
gi|38347478|emb|CAE05298.2| OSJNBa0084N21.16 [Oryza sativa Japonica Group]
gi|113564189|dbj|BAF14532.1| Os04g0385600 [Oryza sativa Japonica Group]
Length = 395
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 210/404 (51%), Positives = 259/404 (64%), Gaps = 50/404 (12%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+RG CY C R K+RR G + A +D+F+
Sbjct: 1 MRTRRGACYSPAS--CQDGR-----RKRRRIAGGGGEGSAAAAAAVAGGAEGPANDMFEE 53
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
LPDDLVVSIL ++++AR P D ++TCKR L + +VL++AS + A++A WS+
Sbjct: 54 LPDDLVVSILADVAASARSPGDLAGAMLTCKRFRELGQSKVVLARASPRCLAVRAKAWSD 113
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+AHRFL+ CADAGN++ACY LGMIRFYCL +RGSGA+LMA AA+ H +ALYSLAVIQFN
Sbjct: 114 AAHRFLQRCADAGNLDACYLLGMIRFYCLGSRGSGAALMAAAAVGGHREALYSLAVIQFN 173
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGGSK+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR+++ +GRR L+QANAR
Sbjct: 174 GSGGSKDDRDLRAGAALCARAASLGHVDALRELGHCLQDGYGVRRSVLDGRRLLIQANAR 233
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNV 300
ELAA ++++A + + S+ H LLSDFGC+
Sbjct: 234 ELAAAVAASA-SLLRAATGKPAAAASRRHSC--------------------LLSDFGCHA 272
Query: 301 PAP----EAHPASRFLAEWFAAR------------------GGTPGPGLRLCSHVGCGRP 338
AP EAH A+RFL +WFA+R GLRLCSH CGRP
Sbjct: 273 AAPKAGGEAHAANRFLVDWFASRPLAGSTAAAAAAPTPGSAAEDEAAGLRLCSHALCGRP 332
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
ETRRHEFRRCSVCG VNYCSRACQAL WK HKA+C P +RWLD
Sbjct: 333 ETRRHEFRRCSVCGVVNYCSRACQALHWKTAHKAECTPMDRWLD 376
>gi|242075442|ref|XP_002447657.1| hypothetical protein SORBIDRAFT_06g011760 [Sorghum bicolor]
gi|241938840|gb|EES11985.1| hypothetical protein SORBIDRAFT_06g011760 [Sorghum bicolor]
Length = 411
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 197/355 (55%), Positives = 240/355 (67%), Gaps = 50/355 (14%)
Query: 56 DLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKA 115
D+F+ LPDDLVVSIL ++++A P+D ++TCKR L +VL++AS + A++A
Sbjct: 50 DMFEELPDDLVVSILRDVAASAGSPADLAGAMLTCKRFRELGQTKVVLARASPRCLAVRA 109
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQ-NRGSGASLMAKAAISSHAQALYSL 174
WS+ AHRFL+ CADAGN+EACY LGMIRFYCL +RGSGA+LMA AA+ H +ALYSL
Sbjct: 110 KAWSDDAHRFLQRCADAGNLEACYLLGMIRFYCLGGSRGSGAALMAAAAVGGHREALYSL 169
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AVIQFNGSGG K+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR+++ +GRR L
Sbjct: 170 AVIQFNGSGGGKDDRDLRAGAALCARAASLGHVDALRELGHCLQDGYGVRRSVLDGRRLL 229
Query: 235 VQANARELAAVLSSA---ACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCP 291
+QANARELAA ++++ A G+ S + PR +
Sbjct: 230 IQANARELAAAVTASPPVAAAGVG----------SGKTTSAAPRRHS------------C 267
Query: 292 LLSDFGCNVPAPEAHPASRFLAEWFAAR------------GGTP------------GPGL 327
LLSDFGC EAH A+RFL +WFA+R G P G L
Sbjct: 268 LLSDFGCRAAGGEAHAANRFLVDWFASRPLGAPAAAAPSGNGNPAAASPEDEEEEAGGAL 327
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
RLCS CGRPETRRHEFRRCSVCG VNYCSRACQAL WK+ HKA+C PA+RWLD
Sbjct: 328 RLCSQALCGRPETRRHEFRRCSVCGVVNYCSRACQALHWKMAHKAECTPADRWLD 382
>gi|302775612|ref|XP_002971223.1| hypothetical protein SELMODRAFT_66305 [Selaginella moellendorffii]
gi|300161205|gb|EFJ27821.1| hypothetical protein SELMODRAFT_66305 [Selaginella moellendorffii]
Length = 289
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 190/323 (58%), Positives = 221/323 (68%), Gaps = 34/323 (10%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FD +PDDL+V I L STA P+D N+++ C+R A + VL++ S + FA+KA +
Sbjct: 1 FDFVPDDLLVDIAMALVSTASSPADLFNMMLICRRFCAAATHPQVLARVSPQAFAVKAGS 60
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
WS +HRFLK CAD GNVEA YTLGMIRFYCL N SGASLMAKAA++SHA AL+SLAVI
Sbjct: 61 WSAGSHRFLKQCADVGNVEALYTLGMIRFYCLNN-SSGASLMAKAAVASHAAALHSLAVI 119
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
FNGSGG + DK+L+AGVALC RAA LGH+DA+RELGHCLQDGYGV +N+ +GRR L++A
Sbjct: 120 HFNGSGGRRKDKNLKAGVALCMRAASLGHVDAIRELGHCLQDGYGVVKNVLQGRRLLLEA 179
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
NARE A + HP NG GC LLSDFG
Sbjct: 180 NAREAA----------------------AARHPV-----FKLLQNG-----GCALLSDFG 207
Query: 298 CNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYC 357
CNVP + H A++FL EWFA T GLRLCSH CGRPETRRHEFRRCS CG VNYC
Sbjct: 208 CNVPPAKVHVANKFLVEWFALH-PTSAAGLRLCSHANCGRPETRRHEFRRCSACGRVNYC 266
Query: 358 SRACQALDWKLRHKADCAPAERW 380
SRACQALDWKLRHK DC P W
Sbjct: 267 SRACQALDWKLRHKYDCIPVADW 289
>gi|297788564|ref|XP_002862364.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297307802|gb|EFH38622.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 305
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 173/294 (58%), Positives = 204/294 (69%), Gaps = 35/294 (11%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FD L DDL++SIL KL+S+A PSDF+ VL TCKR+N L L LVLSKA +T A+ A
Sbjct: 17 FDDLHDDLIISILRKLASSASSPSDFLTVLSTCKRLNRLGLQPLVLSKAGTQTLAVTAEK 76
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
WS+S+H+FLKLC +AGN++A Y LGMIRFYCLQN SGASLMAKAAI SHA ALYSL+VI
Sbjct: 77 WSDSSHKFLKLCVNAGNIDASYALGMIRFYCLQNPVSGASLMAKAAIKSHAPALYSLSVI 136
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QFNGSGGSK DK+LRAGV LCAR+A+LGH+DALRELGH LQDGYGV ++++EGRR L+QA
Sbjct: 137 QFNGSGGSKTDKNLRAGVTLCARSAYLGHVDALRELGHRLQDGYGVPRDVSEGRRLLIQA 196
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
ARELA S RS+L+ G +L+D
Sbjct: 197 YARELAC----------SLRSYLSLK------------------------YGDEILTDLS 222
Query: 298 CNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVC 351
VP E HP RFL EWF++ GLR+CSH GCGRPETR HEFRRCSVC
Sbjct: 223 V-VPVQEIHPVYRFLKEWFSSGRVDLSEGLRMCSHGGCGRPETRAHEFRRCSVC 275
>gi|413951485|gb|AFW84134.1| hypothetical protein ZEAMMB73_570285 [Zea mays]
Length = 382
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 204/376 (54%), Positives = 246/376 (65%), Gaps = 52/376 (13%)
Query: 35 GEPKTTYRKKARFSPDVAGK-------------SDLFDALPDDLVVSILCKLSSTARCPS 81
GE +K+ R + VAG+ D DALPDDLV+SIL K+++ + P+
Sbjct: 14 GEDALVGQKRKRAASAVAGECACCERRKRPADGPDYLDALPDDLVLSILTKVAADSSAPA 73
Query: 82 DFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTL 141
D ++V +TCKR+N L + +V +KAS + ++KA WSE+A RFLK ADAGN+EACY L
Sbjct: 74 DLLSVHLTCKRLNELGSHDMVFAKASPASLSVKAAAWSEAAQRFLKRSADAGNLEACYIL 133
Query: 142 GMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARA 201
GMIRFYCL +R GA+L+AKAA+ HA ALYSLAVIQFNGSGG+K+D+DLRAG ALCARA
Sbjct: 134 GMIRFYCLGSRSGGAALLAKAAVGGHAAALYSLAVIQFNGSGGAKSDRDLRAGAALCARA 193
Query: 202 AFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLT 261
A LGH+DALRELGHCLQDGYGVR++ AEGRR LV ANAREL LS+AA
Sbjct: 194 AALGHVDALRELGHCLQDGYGVRRDPAEGRRLLVSANARELTLALSAAA----------- 242
Query: 262 WNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAA--- 318
R+ G G GCPLLSDFG ++P E H A++F+ +W+ A
Sbjct: 243 ------------SRHAAGVVVGGGGGGGCPLLSDFGWSLPEAEPHAANQFMVDWWWASDR 290
Query: 319 -----------RGGTPGPG--LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALD 365
G G G LRLCSHV CGR ETRRHEFRRCSVCGA NYCSRACQALD
Sbjct: 291 CAAQAGGKITGESGVDGDGAELRLCSHVRCGRRETRRHEFRRCSVCGAANYCSRACQALD 350
Query: 366 WKLRHKADCAPAERWL 381
WK HK C P +RWL
Sbjct: 351 WKRAHKVQCVPVDRWL 366
>gi|326499025|dbj|BAK06003.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 383
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 205/371 (55%), Positives = 256/371 (69%), Gaps = 47/371 (12%)
Query: 36 EPKTTYRKKARFSPDVAGKS-------DLFDALPDDLVVSILCKLSSTARCPSDFVNVLI 88
EP+ RK+ R + + AG + D+F+ LPDDL++SIL ++++AR P+D +
Sbjct: 18 EPEMHRRKRRRTAMEAAGCAAAAAAVGDMFEDLPDDLLLSILADVAASARSPADLAGATM 77
Query: 89 TCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYC 148
TCKR L + +VL++ S + A++A +WS+SAHRFL+ CADAGN++ACY LGMIRFYC
Sbjct: 78 TCKRFRELGQSKVVLAEVSPRCLAVRAKSWSDSAHRFLQRCADAGNLDACYLLGMIRFYC 137
Query: 149 LQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHID 208
L +RGSGA+LMA AA+ H +ALYSLAVIQFNGSGGSK+D+DLRAG ALCARAA LGH+D
Sbjct: 138 LGSRGSGAALMAAAAVGGHREALYSLAVIQFNGSGGSKDDRDLRAGAALCARAASLGHVD 197
Query: 209 ALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQP 268
ALRELGHCLQDGYGVR+++ +GRR L+QANARELAA ++++A R+ + S+
Sbjct: 198 ALRELGHCLQDGYGVRRSLLDGRRLLIQANARELAAAVTTSASL---LRAAASSGKASRR 254
Query: 269 HPAPHPRNQNPNHNGHVTGIGCPLLSDFGCN---VPAPEAHPASRFLAEWFAAR--GGT- 322
H LLSDFGC A EAH A+RFL EWFA+R GG
Sbjct: 255 HSC--------------------LLSDFGCRAAVAAAGEAHAANRFLVEWFASRPLGGES 294
Query: 323 -----------PGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
G GLRLCSH CGRPETRRHEFRRCSVCG VNYCSRACQAL WK+ HK
Sbjct: 295 SSPAAAPAPADDGGGLRLCSHALCGRPETRRHEFRRCSVCGVVNYCSRACQALHWKMAHK 354
Query: 372 ADCAPAERWLD 382
A+C P +RWLD
Sbjct: 355 AECTPMDRWLD 365
>gi|302756315|ref|XP_002961581.1| hypothetical protein SELMODRAFT_66287 [Selaginella moellendorffii]
gi|300170240|gb|EFJ36841.1| hypothetical protein SELMODRAFT_66287 [Selaginella moellendorffii]
Length = 289
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 189/323 (58%), Positives = 220/323 (68%), Gaps = 34/323 (10%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FD +PDDL+V I L STA P+D N+++ C+R A + VL++ S + FA+KA +
Sbjct: 1 FDFVPDDLLVDIAMALVSTASSPADLFNMMLICRRFCAAATHPQVLARVSPQAFAVKAGS 60
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
WS +HRFLK CAD GNVEA YTLGMIRFYCL N SGASLMAKAA++SHA AL+SLAVI
Sbjct: 61 WSAGSHRFLKQCADVGNVEALYTLGMIRFYCLNN-SSGASLMAKAAVASHAAALHSLAVI 119
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
FNGSGG + DK+L+AGVALC RAA LGH+DA+RELGHCLQDGYGV +N+ +GRR L++A
Sbjct: 120 HFNGSGGRRKDKNLKAGVALCMRAASLGHVDAIRELGHCLQDGYGVVKNVLQGRRLLLEA 179
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
NARE A + HP NG GC LLSDFG
Sbjct: 180 NAREAA----------------------AARHPV-----FKLLQNG-----GCALLSDFG 207
Query: 298 CNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYC 357
CNVP + H A++FL EWFA T GLRLCSH CGRPETRRHEFRRCS CG VNYC
Sbjct: 208 CNVPPAKVHVANKFLVEWFALH-PTSAAGLRLCSHANCGRPETRRHEFRRCSACGRVNYC 266
Query: 358 SRACQALDWKLRHKADCAPAERW 380
SRACQALDWKLRHK C P W
Sbjct: 267 SRACQALDWKLRHKYHCIPVADW 289
>gi|413918111|gb|AFW58043.1| MYND finger family protein [Zea mays]
Length = 373
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 207/404 (51%), Positives = 251/404 (62%), Gaps = 70/404 (17%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
M+T+RG CY G+ K+RR + P R G D+F+
Sbjct: 1 MKTRRGACYS--CHAAPGEGPEVQRRKRRRTAAEASPAAAEDPGVRL-----GARDMFED 53
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNS--LVLSKASKKTFAIKANNW 118
LPDDLVVSIL ++++A P+D ++TCKR L S +VL++AS + +++A W
Sbjct: 54 LPDDLVVSILRDVAASAGSPADLAGAMLTCKRFRELVARSKVVVLARASPRCLSVRAKAW 113
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
S+ AHRFL+ CADAGN+EACY LGMIRFYCL +RGSGA+LMA AA+ H +ALYSLAVIQ
Sbjct: 114 SDEAHRFLQRCADAGNLEACYLLGMIRFYCLGSRGSGAALMAAAAVGGHREALYSLAVIQ 173
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
FNGSGG K+D+DLRAG ALCARAA GH+DALRELGHCLQDGYGVR+++ +GRR L+QAN
Sbjct: 174 FNGSGGGKDDRDLRAGAALCARAAARGHVDALRELGHCLQDGYGVRRSVLDGRRLLIQAN 233
Query: 239 ARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGC 298
ARELAA S HP C L+SD GC
Sbjct: 234 ARELAAA--------------------SVAHP-------------------CLLMSDLGC 254
Query: 299 NVPAPEAHPASRFLAEWFAAR--GGTPG------------------PGLRLCSHVGCGRP 338
A EAH A+RFL +WFA+R G TPG GLRLCS CGRP
Sbjct: 255 R--AGEAHAANRFLVDWFASRPLGLTPGGNGNGSAGTAGSPEEDGVGGLRLCSQALCGRP 312
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
ETRRHEFRRCSVCG VNYCSRACQAL W++ HKA+C P +RWLD
Sbjct: 313 ETRRHEFRRCSVCGVVNYCSRACQALHWRMAHKAECTPMDRWLD 356
>gi|125548043|gb|EAY93865.1| hypothetical protein OsI_15641 [Oryza sativa Indica Group]
Length = 395
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 208/404 (51%), Positives = 257/404 (63%), Gaps = 50/404 (12%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+RG CY C R K+RR G + A +D+F+
Sbjct: 1 MRTRRGACYSPAS--CQDGR-----RKRRRIAGGGGEGSAAAAAAVAGGAEGPANDMFEE 53
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
LPDDLVVSIL ++++AR P D ++TCKR L + +VL++AS + A++A WS+
Sbjct: 54 LPDDLVVSILADVAASARSPGDLAGAMLTCKRFRELGQSKVVLARASPRCLAVRAKAWSD 113
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+AHRFL+ CADAGN++ACY LGMIRFYCL +RGSGA+LMA AA+ H +ALYSLAVIQFN
Sbjct: 114 AAHRFLQRCADAGNLDACYLLGMIRFYCLGSRGSGAALMAAAAVGGHREALYSLAVIQFN 173
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGGSK+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR+++ +GRR L+QANAR
Sbjct: 174 GSGGSKDDRDLRAGAALCARAASLGHVDALRELGHCLQDGYGVRRSVLDGRRLLIQANAR 233
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCN- 299
ELAA ++++A + + S+ H LLSDFGC+
Sbjct: 234 ELAAAVAASA-SLLRAATGKPAAAASRRHSC--------------------LLSDFGCHA 272
Query: 300 ---VPAPEAHPASRFLAEWFAAR------------------GGTPGPGLRLCSHVGCGRP 338
EAH A+RFL +WFA+R GLRLCSH CGRP
Sbjct: 273 AAPKAGAEAHAANRFLVDWFASRPLAGSTAAAAAAPTPGSAAEDEAAGLRLCSHALCGRP 332
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
ETRRHEFRRCSVCG VNYCSRACQAL WK HKA+C P +RWLD
Sbjct: 333 ETRRHEFRRCSVCGVVNYCSRACQALHWKTAHKAECTPMDRWLD 376
>gi|125590155|gb|EAZ30505.1| hypothetical protein OsJ_14552 [Oryza sativa Japonica Group]
Length = 396
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 204/400 (51%), Positives = 251/400 (62%), Gaps = 41/400 (10%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
MRT+RG CY C R K+RR G + A +D+F+
Sbjct: 1 MRTRRGACYSPAS--CQDGR-----RKRRRIAGGGGEGSAAAAAAVAGGAEGPANDMFEE 53
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
LPDDLVVSIL ++++AR P D ++TCKR L + +VL++AS + A++A WS+
Sbjct: 54 LPDDLVVSILADVAASARSPGDLAGAMLTCKRFRELGQSKVVLARASPRCLAVRAKAWSD 113
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+AHRFL+ CADAGN++ACY LGMIRFYCL +RGSGA+LMA AA+ H +ALYSLAVIQFN
Sbjct: 114 AAHRFLQRCADAGNLDACYLLGMIRFYCLGSRGSGAALMAAAAVGGHREALYSLAVIQFN 173
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GSGGSK+D+DLRAG ALCARAA LGH+DALRELGHCLQDGYGVR+++ +GRR L+QANAR
Sbjct: 174 GSGGSKDDRDLRAGAALCARAASLGHVDALRELGHCLQDGYGVRRSVLDGRRLLIQANAR 233
Query: 241 ELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNV 300
ELAA ++++A + P P R +
Sbjct: 234 ELAAAVAASASLLRAATRQARRRP-----PPAATRASSATSAATPRR-----------PR 277
Query: 301 PAPEAHPASRFLAEWFAAR-----------GGTPGP-------GLRLCSHVGCGRPETRR 342
PA A+RFL +WFA+R TPG GLRLCSH CGRPETRR
Sbjct: 278 PAARRTAANRFLVDWFASRPLAGSTVASAAAPTPGSAAEDEAAGLRLCSHALCGRPETRR 337
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
HEFRRCSVCG VNYCSRACQAL WK HKA+C P +RWLD
Sbjct: 338 HEFRRCSVCGVVNYCSRACQALHWKTAHKAECTPMDRWLD 377
>gi|449519282|ref|XP_004166664.1| PREDICTED: F-box protein At1g67340-like [Cucumis sativus]
Length = 246
Score = 317 bits (812), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 158/239 (66%), Positives = 184/239 (76%), Gaps = 23/239 (9%)
Query: 143 MIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA 202
MI FYCL+NRG+GASLMAKAAI SH ALYSLA++QFNGSGGSK+DK+LRAGVALC RAA
Sbjct: 1 MIEFYCLKNRGTGASLMAKAAIKSHPLALYSLAIVQFNGSGGSKSDKNLRAGVALCGRAA 60
Query: 203 FLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTW 262
+LGHIDALRELGHCLQDGYGV QN++EGRR L++ANARELA ++T
Sbjct: 61 YLGHIDALRELGHCLQDGYGVPQNVSEGRRLLIEANARELA---------------YITN 105
Query: 263 NPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGT 322
+ L + H + H ++ T G LLSDFGCN PAPEAHPA++FL +WFA+ G+
Sbjct: 106 SSLRRLHCSRHRQS--------ATDSGGSLLSDFGCNFPAPEAHPANQFLRDWFASGRGS 157
Query: 323 PGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
GLRLCS+ GCGRPETR HEFRRCSVCG VNYCSR CQALDWKLRHK +CAP +RWL
Sbjct: 158 IAEGLRLCSNSGCGRPETRLHEFRRCSVCGNVNYCSRGCQALDWKLRHKTECAPFDRWL 216
>gi|115446277|ref|NP_001046918.1| Os02g0506400 [Oryza sativa Japonica Group]
gi|48716133|dbj|BAD23173.1| F-box protein-like [Oryza sativa Japonica Group]
gi|113536449|dbj|BAF08832.1| Os02g0506400 [Oryza sativa Japonica Group]
Length = 379
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 165/330 (50%), Positives = 212/330 (64%), Gaps = 26/330 (7%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FDAL D+LVVSIL ++++A P+D ++TC+R L + LVL++AS A++A
Sbjct: 46 FDALHDELVVSILADVAASAGSPADLAAAMLTCRRFRELGKHGLVLARASPSAVAVRAAA 105
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
W + AHRFL CA+AGNVEA Y LGMI FYC +NR GA L+ AA H +ALYS+A+I
Sbjct: 106 WCDDAHRFLVRCAEAGNVEASYLLGMIMFYCFENRKLGAELLGAAARRGHGEALYSMAII 165
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QFNGSG K+ ++L+AG LCARAA GH DALRELGHC+ DGYGVR++++ GRR L+QA
Sbjct: 166 QFNGSGLPKDGRNLQAGAQLCARAASRGHTDALRELGHCVSDGYGVRRSLSGGRRLLIQA 225
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
N REL A +++ G + L + +P P PH LLSD+G
Sbjct: 226 NFRELCAAVANG---GARFAAALGRSGECKP-PGPHM----------------CLLSDYG 265
Query: 298 CNV---PAPEAHPASRFLAEWFAAR---GGTPGPGLRLCSHVGCGRPETRRHEFRRCSVC 351
C+V AH A+ FLA W+A+R G LR+CS CGRPETR+HEFRRCSVC
Sbjct: 266 CHVAGAAGRRAHAANAFLAGWYASRPLASGAGAAALRMCSQPTCGRPETRKHEFRRCSVC 325
Query: 352 GAVNYCSRACQALDWKLRHKADCAPAERWL 381
V YCSRACQA+ WK+ HK+ C P WL
Sbjct: 326 SGVIYCSRACQAMHWKVAHKSACVPMAHWL 355
>gi|357142382|ref|XP_003572553.1| PREDICTED: F-box protein At1g67340-like [Brachypodium distachyon]
Length = 364
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 163/326 (50%), Positives = 207/326 (63%), Gaps = 39/326 (11%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FD L D+LVVSIL +++TA P+D +TCKR L + LVL++AS + A++A
Sbjct: 29 FDELHDELVVSILADVAATANSPADLAAATLTCKRFRELGQHKLVLARASPRCVAVRAKG 88
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
WS+ AHRFL C+DAGN +A Y LGMI +YC NR +G+ L+A+AA+ HA+ALYS+A+I
Sbjct: 89 WSDDAHRFLLRCSDAGNTDASYLLGMILYYCAGNRPAGSELLAQAALRGHAEALYSMAII 148
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QFNGSGGSK+ ++L LCA AA GH DALRELGHC+ DGYGVR++++ GRR LVQA
Sbjct: 149 QFNGSGGSKDSRNLLVAAHLCAHAAGRGHTDALRELGHCVSDGYGVRKSVSAGRRLLVQA 208
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
N E+ A L + A QG+ Q P+H LLSDFG
Sbjct: 209 NFNEMCAALRAGAGQGV----------------------QRPSHE--------CLLSDFG 238
Query: 298 CN----VPAPEAHPASRFLAEWFAAR-----GGTPGPGLRLCSHVGCGRPETRRHEFRRC 348
C+ A A+ A+ FLAEWFA R G GLRLCS CGRPETR++EFRRC
Sbjct: 239 CHHVAAGRARAANGANEFLAEWFATRPAAVAEAAAGAGLRLCSQPACGRPETRKNEFRRC 298
Query: 349 SVCGAVNYCSRACQALDWKLRHKADC 374
S CG V YCSRACQAL W+ H+ +C
Sbjct: 299 SACGVVVYCSRACQALHWRAGHRTEC 324
>gi|125539593|gb|EAY85988.1| hypothetical protein OsI_07351 [Oryza sativa Indica Group]
Length = 379
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 166/330 (50%), Positives = 212/330 (64%), Gaps = 26/330 (7%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FDAL D+LVVSIL ++++A P+D ++TC+R L + LVL++AS A++A
Sbjct: 46 FDALHDELVVSILADVAASAGSPADLAAAMLTCRRFRELGKHGLVLARASPSAVAVRAAA 105
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
W + AHRFL CA+AGNVEA Y LGMI FYC +NR GA L+ AA H +ALYS+A+I
Sbjct: 106 WCDDAHRFLVRCAEAGNVEASYLLGMIMFYCFENRKLGAELLGAAARRGHGEALYSMAII 165
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QFNGSG K+ ++L+AG LCARAA GH DALRELGHC+ DGYGVR++++ GRR L+QA
Sbjct: 166 QFNGSGLPKDGRNLQAGAQLCARAASRGHNDALRELGHCVSDGYGVRRSLSGGRRLLIQA 225
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
N REL A +++ G + L + +P P PH LLSD+G
Sbjct: 226 NFRELCAAVANG---GARFAAALGRSGECKP-PGPHM----------------CLLSDYG 265
Query: 298 CNV---PAPEAHPASRFLAEWFAAR---GGTPGPGLRLCSHVGCGRPETRRHEFRRCSVC 351
C+V AH A+ FLA W+A+R G LR+CS CGRPETR+HEFRRCSVC
Sbjct: 266 CHVAGAAGRRAHAANAFLAGWYASRPLASGAGAAALRMCSQPTCGRPETRKHEFRRCSVC 325
Query: 352 GAVNYCSRACQALDWKLRHKADCAPAERWL 381
V YCSRACQAL WK+ HK+ C P WL
Sbjct: 326 SGVIYCSRACQALHWKVAHKSACVPMAHWL 355
>gi|413951484|gb|AFW84133.1| hypothetical protein ZEAMMB73_570285 [Zea mays]
Length = 304
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 180/308 (58%), Positives = 209/308 (67%), Gaps = 39/308 (12%)
Query: 90 CKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL 149
CKR+N L + +V +KAS + ++KA WSE+A RFLK ADAGN+EACY LGMIRFYCL
Sbjct: 4 CKRLNELGSHDMVFAKASPASLSVKAAAWSEAAQRFLKRSADAGNLEACYILGMIRFYCL 63
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
+R GA+L+AKAA+ HA ALYSLAVIQFNGSGG+K+D+DLRAG ALCARAA LGH+DA
Sbjct: 64 GSRSGGAALLAKAAVGGHAAALYSLAVIQFNGSGGAKSDRDLRAGAALCARAAALGHVDA 123
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPH 269
LRELGHCLQDGYGVR++ AEGRR LV ANAREL LS+AA
Sbjct: 124 LRELGHCLQDGYGVRRDPAEGRRLLVSANARELTLALSAAA------------------- 164
Query: 270 PAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAA----------- 318
R+ G G GCPLLSDFG ++P E H A++F+ +W+ A
Sbjct: 165 ----SRHAAGVVVGGGGGGGCPLLSDFGWSLPEAEPHAANQFMVDWWWASDRCAAQAGGK 220
Query: 319 ---RGGTPGPG--LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKAD 373
G G G LRLCSHV CGR ETRRHEFRRCSVCGA NYCSRACQALDWK HK
Sbjct: 221 ITGESGVDGDGAELRLCSHVRCGRRETRRHEFRRCSVCGAANYCSRACQALDWKRAHKVQ 280
Query: 374 CAPAERWL 381
C P +RWL
Sbjct: 281 CVPVDRWL 288
>gi|356523044|ref|XP_003530152.1| PREDICTED: LOW QUALITY PROTEIN: F-box protein At1g67340-like
[Glycine max]
Length = 249
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 141/242 (58%), Positives = 162/242 (66%), Gaps = 30/242 (12%)
Query: 90 CKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL 149
CKR+N L ++ VLSKAS KTF+ KA NW + HRFLK C DA N+EACYTLGMIRFYCL
Sbjct: 35 CKRLNSLGIHPFVLSKASHKTFSYKAKNWCDFLHRFLKHCPDAENIEACYTLGMIRFYCL 94
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
QNRGS SLMAK ++SHA ALYSL VIQFNGSG KNDKDLRAGV LC AFLGHIDA
Sbjct: 95 QNRGSSVSLMAKVTMNSHASALYSLTVIQFNGSGDMKNDKDLRAGVILCPHVAFLGHIDA 154
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPH 269
L++LGHCLQDGY ++QN EGRRFLVQAN EL VL +
Sbjct: 155 LQKLGHCLQDGYNIKQNAMEGRRFLVQANVHELTVVLFTE-------------------- 194
Query: 270 PAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRL 329
N+ VT +GCP+LSD+ CN+P E H S+F+ EWF RG P PG+RL
Sbjct: 195 ----------NNKVLVTRLGCPMLSDYDCNIPTLEIHLMSQFILEWFDIRGEFPVPGMRL 244
Query: 330 CS 331
CS
Sbjct: 245 CS 246
>gi|108864399|gb|ABA93801.2| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 358
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 169/361 (46%), Positives = 214/361 (59%), Gaps = 63/361 (17%)
Query: 31 GFSSGEPKTTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITC 90
G +S +K+A SD+FD LPDD+V+ +L +L++ A P+D + +TC
Sbjct: 26 GAASPRSPPAAKKRAVVVAGDGDDSDVFDRLPDDIVLVVLSRLAANAASPADVASAALTC 85
Query: 91 KRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQ 150
+R LA + VLS+AS A++ WSE+AHRFL+ CA AG++ ACY LGM+RFYCL
Sbjct: 86 RRFRELATHPAVLSRASAAAVAVRWGAWSEAAHRFLRRCAAAGSLHACYFLGMVRFYCLG 145
Query: 151 NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDAL 210
+R +GA+L+ +AA HA ALY+LAV+QFNGSGG K DKD RAGVALCARAA+LGH AL
Sbjct: 146 SRATGAALLGRAAGGGHAPALYALAVVQFNGSGGGKADKDARAGVALCARAAWLGHTPAL 205
Query: 211 RELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHP 270
RELGHCLQDGYG R++ GRR L+ A ARE L+W
Sbjct: 206 RELGHCLQDGYGARRDAPAGRRLLLHAAAREH-----------------LSWK------- 241
Query: 271 APHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWF-------AARGGTP 323
++ + +H+G A +A SRF+ W+ AARG P
Sbjct: 242 ----KHNHGHHDGSA----------------AEDA--VSRFMVAWWDSHRAKAAARGCLP 279
Query: 324 ----------GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKAD 373
G LRLCSH CGR ETRRHEFRRCSVCGA +YCSRACQALDWK H+A
Sbjct: 280 GEHGDGEHDGGEDLRLCSHARCGRRETRRHEFRRCSVCGAASYCSRACQALDWKRAHRAQ 339
Query: 374 C 374
C
Sbjct: 340 C 340
>gi|125534405|gb|EAY80953.1| hypothetical protein OsI_36133 [Oryza sativa Indica Group]
Length = 335
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 166/361 (45%), Positives = 211/361 (58%), Gaps = 63/361 (17%)
Query: 31 GFSSGEPKTTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITC 90
G +S +K+A SD+FD LPDD+V+ +L +L++ A P+D + +TC
Sbjct: 4 GAASPRSPPAAKKRAVVVAGDGDDSDVFDRLPDDIVLVVLSRLAANAASPADVASAALTC 63
Query: 91 KRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQ 150
+R LA + VLS+AS A++ WSE+AHRFL+ CA AG++ ACY LGM+RFYCL
Sbjct: 64 RRFRELAAHPAVLSRASAAAVAVRWGAWSEAAHRFLRRCAAAGSLHACYFLGMVRFYCLG 123
Query: 151 NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDAL 210
+R +GA+L+ +AA HA ALY+LAV+QFNGSGG K DKD RAGVALCARAA+LGH AL
Sbjct: 124 SRATGAALLGRAAGGGHAPALYALAVVQFNGSGGGKADKDARAGVALCARAAWLGHTPAL 183
Query: 211 RELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHP 270
RELGH LQDGYG R++ GRR L+ A ARE L+W
Sbjct: 184 RELGHWLQDGYGARRDAPAGRRLLLHAAAREH-----------------LSWK------- 219
Query: 271 APHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWF-------AARGGTP 323
++ + +H+G + SRF+ W+ AARG P
Sbjct: 220 ----KHNHGHHDG------------------SAAEDAVSRFMVAWWDSHRAKAAARGCLP 257
Query: 324 ----------GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKAD 373
G LRLCSH CGR ETRRHEFRRCSVCGA +YCSRACQALDWK H+A
Sbjct: 258 GEHGDGEHDGGEDLRLCSHARCGRRETRRHEFRRCSVCGAASYCSRACQALDWKRAHRAQ 317
Query: 374 C 374
C
Sbjct: 318 C 318
>gi|297728367|ref|NP_001176547.1| Os11g0488900 [Oryza sativa Japonica Group]
gi|255680104|dbj|BAH95275.1| Os11g0488900 [Oryza sativa Japonica Group]
Length = 372
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 165/397 (41%), Positives = 211/397 (53%), Gaps = 99/397 (24%)
Query: 31 GFSSGEPKTTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNV---- 86
G +S +K+A SD+FD LPDD+V+ +L +L++ A P+D +
Sbjct: 4 GAASPRSPPAAKKRAVVVAGDGDDSDVFDRLPDDIVLVVLSRLAANAASPADVASAALTY 63
Query: 87 ----------------LITCKR----------------MNGLALNSLVLSKASKKTFAIK 114
++ +R LA + VLS+AS A++
Sbjct: 64 VAAAAAALFFLLFPLGVVFVRRGEEISHGCARGRRCRRFRELATHPAVLSRASAAAVAVR 123
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
WSE+AHRFL+ CA AG++ ACY LGM+RFYCL +R +GA+L+ +AA HA ALY+L
Sbjct: 124 WGAWSEAAHRFLRRCAAAGSLHACYFLGMVRFYCLGSRATGAALLGRAAGGGHAPALYAL 183
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AV+QFNGSGG K DKD RAGVALCARAA+LGH ALRELGHCLQDGYG R++ GRR L
Sbjct: 184 AVVQFNGSGGGKADKDARAGVALCARAAWLGHTPALRELGHCLQDGYGARRDAPAGRRLL 243
Query: 235 VQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLS 294
+ A ARE L+W ++ + +H+G
Sbjct: 244 LHAAAREH-----------------LSWK-----------KHNHGHHDG----------- 264
Query: 295 DFGCNVPAPEAHPASRFLAEWF-------AARGGTP----------GPGLRLCSHVGCGR 337
+ SRF+ W+ AARG P G LRLCSH CGR
Sbjct: 265 -------SAAEDAVSRFMVAWWDSHRAKAAARGCLPGEHGDGEHDGGEDLRLCSHARCGR 317
Query: 338 PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
ETRRHEFRRCSVCGA +YCSRACQALDWK H+A C
Sbjct: 318 RETRRHEFRRCSVCGAASYCSRACQALDWKRAHRAQC 354
>gi|212722836|ref|NP_001131425.1| uncharacterized protein LOC100192755 [Zea mays]
gi|194691482|gb|ACF79825.1| unknown [Zea mays]
Length = 338
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 151/270 (55%), Positives = 188/270 (69%), Gaps = 24/270 (8%)
Query: 56 DLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKA 115
D+F+ LPDDLVVSIL ++++A P+D ++TCKR L + +VL++AS + A++A
Sbjct: 52 DMFEELPDDLVVSILRDVAASASSPADLAGAMLTCKRFRELGQSKVVLARASPRCLAVRA 111
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
WSE AHRFL+ CADAG++EACY LGMIRFYCL RGSGA+LMA AA+ H +ALYSLA
Sbjct: 112 KAWSEEAHRFLQRCADAGSLEACYLLGMIRFYCLGIRGSGAALMAAAAVGGHREALYSLA 171
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
V+QFNGSGG K+D+DLRAG ALCARAA LGH+ ALRELGHCLQDGYGVR+++ +GRR L+
Sbjct: 172 VVQFNGSGGGKDDRDLRAGAALCARAASLGHVGALRELGHCLQDGYGVRRSVLDGRRLLI 231
Query: 236 QANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSD 295
QANARELAAV +S Q + APHP ++ C LLSD
Sbjct: 232 QANARELAAVAASPPPQLLGK-------------AAPHPTRRH----------SC-LLSD 267
Query: 296 FGCNVPAPEAHPASRFLAEWFAARGGTPGP 325
FGC A E H A+RFLA+WFA P P
Sbjct: 268 FGCRAAAGEPHAANRFLADWFAPPAQVPSP 297
>gi|195611864|gb|ACG27762.1| MYND finger family protein [Zea mays]
Length = 328
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 163/342 (47%), Positives = 202/342 (59%), Gaps = 61/342 (17%)
Query: 53 GKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFA 112
G D FD LPDDLV+++L +++ ARCP+D + C+R LA + VLS+AS A
Sbjct: 38 GTVDAFDCLPDDLVLAVLAGIAARARCPADLAAAALPCRRFRDLAAHPAVLSRASAAAVA 97
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALY 172
+ A WS++AHRFL+ CA +GN+ ACY LGM+ FYC+ R +GA+L+A++A HA ALY
Sbjct: 98 VPAGRWSDAAHRFLRRCAASGNLHACYLLGMVLFYCIGGRATGAALLARSAAGGHAAALY 157
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+LAV+QFNGSGG K DKD RAGVALCARAA+LGH+ ALRELGHCLQDGYG R+ A GR
Sbjct: 158 ALAVVQFNGSGGDKADKDPRAGVALCARAAWLGHVPALRELGHCLQDGYGARRAAATGRY 217
Query: 233 FLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPL 292
FL+ A AREL + H RN
Sbjct: 218 FLLHAAARELV---------------------------SSHCRN---------------- 234
Query: 293 LSDFGCNVPAPEAHPASRFLAEWFA--ARGGTPGPG------LRLCSHVGCGRPETRRHE 344
E ASRF+AEW+A ++ G G G RLCSH CGR ETRRHE
Sbjct: 235 ----------GEDDAASRFMAEWWALPSKTGAQGDGNDADADARLCSHPRCGRRETRRHE 284
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
FR+CS CG+ YCSRACQAL WK H++ CA A GN
Sbjct: 285 FRQCSACGSAIYCSRACQALHWKRAHRSQCAAAASRWLAAGN 326
>gi|226531261|ref|NP_001152415.1| MYND finger family protein [Zea mays]
gi|195640994|gb|ACG39965.1| MYND finger family protein [Zea mays]
gi|195656045|gb|ACG47490.1| MYND finger family protein [Zea mays]
gi|413920826|gb|AFW60758.1| MYND finger family protein [Zea mays]
Length = 328
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 201/342 (58%), Gaps = 61/342 (17%)
Query: 53 GKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFA 112
G D FD LPDDLV+++L +++ ARCP+D + C+R LA + VLS+AS A
Sbjct: 38 GTVDAFDCLPDDLVLAVLAGIAARARCPADLAAAALPCRRFRDLAAHPAVLSRASAAAVA 97
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALY 172
+ A WS++AHRFL+ CA +GN+ ACY LGM+ FYC+ R +GA+L+A++A HA ALY
Sbjct: 98 VPAGRWSDAAHRFLRRCAASGNLHACYLLGMVLFYCIGGRATGAALLARSAAGGHAAALY 157
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+LAV+QFNGSGG K DKD RAGVALCARAA+LGH+ ALRELGHCLQDGYG R+ A GR
Sbjct: 158 ALAVVQFNGSGGDKADKDPRAGVALCARAAWLGHVPALRELGHCLQDGYGARRAAATGRY 217
Query: 233 FLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPL 292
FL+ A AREL + H RN
Sbjct: 218 FLLHAAARELV---------------------------SSHCRN---------------- 234
Query: 293 LSDFGCNVPAPEAHPASRFLAEWFA--ARGGTPGPG------LRLCSHVGCGRPETRRHE 344
E ASRF+ EW+A ++ G G G RLCSH CGR ETRRHE
Sbjct: 235 ----------GEDDAASRFMVEWWALPSKTGAQGDGNDADADARLCSHPRCGRRETRRHE 284
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
FR+CS CG+ YCSRACQAL WK H++ CA A GN
Sbjct: 285 FRQCSACGSAIYCSRACQALHWKRAHRSQCAAAASRWLAAGN 326
>gi|357450279|ref|XP_003595416.1| F-box family protein [Medicago truncatula]
gi|355484464|gb|AES65667.1| F-box family protein [Medicago truncatula]
Length = 242
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 125/232 (53%), Positives = 149/232 (64%), Gaps = 39/232 (16%)
Query: 143 MIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA 202
MIRFYCL+NR SG SL+AKAA+ HA ALYSLAVIQFNGSGG+K DKDLRAGVAL ARA+
Sbjct: 1 MIRFYCLRNRRSGVSLIAKAAMKLHAPALYSLAVIQFNGSGGTKQDKDLRAGVALSARAS 60
Query: 203 FLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVL---------SSAACQG 253
LGHIDALRELGHCLQDGYGV+QN+ EGRR LVQAN RE+ V + +C+G
Sbjct: 61 LLGHIDALRELGHCLQDGYGVKQNVTEGRRLLVQANVREILLVYRFLELASPSRAGSCEG 120
Query: 254 ISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLA 313
I +R ++ + PL+ + NV PE HP + FL
Sbjct: 121 ILSR---------------------------LSNMAVPLIKENRYNVKVPEVHPVNWFLR 153
Query: 314 EWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVC---GAVNYCSRACQ 362
EWF + G GLRLC+H+GCGR ETR HEFRRCSVC G ++ C + Q
Sbjct: 154 EWFESGLGILEDGLRLCAHIGCGRAETRPHEFRRCSVCGISGVISVCVQLVQ 205
>gi|242070927|ref|XP_002450740.1| hypothetical protein SORBIDRAFT_05g016760 [Sorghum bicolor]
gi|241936583|gb|EES09728.1| hypothetical protein SORBIDRAFT_05g016760 [Sorghum bicolor]
Length = 335
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 165/335 (49%), Positives = 202/335 (60%), Gaps = 53/335 (15%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FD LPDDLV+++L L++ A CP+D + C+R LA + VLS+AS A+ A
Sbjct: 46 FDCLPDDLVLAVLAGLAARAGCPADLAAAALPCRRFRDLAAHPAVLSRASAAAVAVPAGR 105
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
WS++AH+FL+ CA AGN+ ACY LGM+RFYC+ R +GA+L+A+AA HA ALY+LAV+
Sbjct: 106 WSDAAHQFLRRCAAAGNLHACYFLGMVRFYCVGGRATGAALLARAAAGGHAAALYALAVV 165
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QFNGSGG K DKD RAGVALCARAA+LGH+ ALRELGHC+QDGYG R++ A GRRFL+ A
Sbjct: 166 QFNGSGGVKADKDPRAGVALCARAAWLGHVPALRELGHCIQDGYGARRDAATGRRFLLHA 225
Query: 238 NARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
ARE +S H H RN
Sbjct: 226 AAREFL---------------------VSSSH--THRRNGQ------------------- 243
Query: 298 CNVPAPEAHPASRFLAEWF---AARGGTPGPG---LRLCSHVGCGRPETRRHEFRRCSVC 351
E ASRF+ EW+ AA+ G G G +RLCSH CGR ETRRHEFRRCS C
Sbjct: 244 -----EEEDAASRFMVEWWALAAAKTGGEGDGDADVRLCSHPRCGRRETRRHEFRRCSAC 298
Query: 352 GAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
G+ YCSR CQALDWK H+ CA A GN
Sbjct: 299 GSAVYCSRECQALDWKRVHRGQCAAAASRWLAAGN 333
>gi|294462461|gb|ADE76778.1| unknown [Picea sitchensis]
Length = 264
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 134/268 (50%), Positives = 170/268 (63%), Gaps = 26/268 (9%)
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
MA+AAI+SH AL+SLAVIQFNGSGGS+ DKDL+AGV LCA+AA LG +DA+RELGHCLQ
Sbjct: 1 MAEAAIASHPAALHSLAVIQFNGSGGSRKDKDLKAGVVLCAKAAALGQVDAMRELGHCLQ 60
Query: 219 DGYGVRQNIAEGRRFLVQANAREL-------------AAVLSSAACQG----ISTRSWLT 261
DGYGV++N+AEGR++L++ANARE A + S C G ++ +S
Sbjct: 61 DGYGVKKNVAEGRQYLLEANAREAEVVREPSMQASAHAELQSLIICNGGYAHLNLQSQSL 120
Query: 262 WNPLSQPHPAPHPR-NQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARG 320
N LS H + + + LSDF C + + A+ FL +WFA
Sbjct: 121 LNKLSGSLVQGHTSVSDQYSVQRLIERDDSSSLSDFSCKMMPSNIYVANAFLVDWFAEH- 179
Query: 321 GTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
P GLRLCSH CGRPETRRHEFRRCS CG+VNYCSRACQA+DWKL H+ CAP W
Sbjct: 180 -PPELGLRLCSHSKCGRPETRRHEFRRCSACGSVNYCSRACQAMDWKLYHRHHCAPVADW 238
Query: 381 LDEEGNGDVDVDLNDAEGGGEDNVVADS 408
+ E ++D D E ED+++AD+
Sbjct: 239 EEHE---EID---RDEEDNIEDHILADN 260
>gi|414587613|tpg|DAA38184.1| TPA: hypothetical protein ZEAMMB73_264095 [Zea mays]
Length = 386
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 137/243 (56%), Positives = 172/243 (70%), Gaps = 24/243 (9%)
Query: 56 DLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKA 115
D+F+ LPDDLVVSIL ++++A P+D ++TCKR L + +VL++AS + A++A
Sbjct: 52 DMFEELPDDLVVSILRDVAASASSPADLAGAMLTCKRFRELGQSKVVLARASPRCLAVRA 111
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
WSE AHRFL+ CADAG++EACY LGMIRFYCL RGSGA+LMA AA+ H +ALYSLA
Sbjct: 112 KAWSEEAHRFLQRCADAGSLEACYLLGMIRFYCLGIRGSGAALMAAAAVGGHREALYSLA 171
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
V+QFNGSGG K+D+DLRAG ALCARAA LGH+ ALRELGHCLQDGYGVR+++ +GRR L+
Sbjct: 172 VVQFNGSGGGKDDRDLRAGAALCARAASLGHVGALRELGHCLQDGYGVRRSVLDGRRLLI 231
Query: 236 QANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSD 295
QANARELAAV +S Q + APHP ++ C LLSD
Sbjct: 232 QANARELAAVAASPPPQLLGK-------------AAPHPTRRH----------SC-LLSD 267
Query: 296 FGC 298
FG
Sbjct: 268 FGV 270
>gi|413936916|gb|AFW71467.1| hypothetical protein ZEAMMB73_748197 [Zea mays]
Length = 377
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 177/341 (51%), Gaps = 35/341 (10%)
Query: 51 VAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKT 110
VA + F+ LP +L++S+L LSS A P+D NV++ C+ L N V+ AS
Sbjct: 20 VANQMGAFEVLPQELILSVLTALSSAADKPADLFNVMLVCQEFCRLVTNPQVIRAASAAC 79
Query: 111 FAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQA 170
++ W +A RFL+ C D GN +A + LGMI+FYCL + G S M A S HA A
Sbjct: 80 VSVLPRRWCAAADRFLRFCGDNGNADANFFLGMIQFYCLGLQTHGWSRMLMAVRSGHAHA 139
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
++ AV+ +NGS N + A L A AA GH+ ALR+L C+ +G GVRQ+ G
Sbjct: 140 AFAAAVVYYNGSCIDHNQRQPLAAANLFALAACRGHMGALRDLAFCVSNGLGVRQDAEAG 199
Query: 231 RRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGC 290
RR AN RE A +AA Q ++ HV G
Sbjct: 200 RRLTFTANLREFRATYPTAAEQAVAY--------------------------AHVVGNPG 233
Query: 291 PLLSDFGCNVPAPE-----AHPASRFLAEWFAARGGTPGPGLRL---CSHVGCGRPETRR 342
L S+ GC P AHPA+RFL +WF+A P P RL CS CGRPETR
Sbjct: 234 CLTSELGCFTTVPASLRGWAHPANRFLCDWFSA-NPLPFPPRRLLLMCSMPTCGRPETRG 292
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE 383
E+RRC VCG YCSR+CQ+L W++ H +C P +WL E
Sbjct: 293 LEYRRCPVCGIARYCSRSCQSLHWRMGHTRECVPTHQWLME 333
>gi|357450277|ref|XP_003595415.1| F-box family protein [Medicago truncatula]
gi|355484463|gb|AES65666.1| F-box family protein [Medicago truncatula]
Length = 221
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 108/208 (51%), Positives = 129/208 (62%), Gaps = 39/208 (18%)
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
HA ALYSLAVIQFNGSGG+K DKDLRAGVAL ARA+ LGHIDALRELGHCLQDGYGV+QN
Sbjct: 4 HAPALYSLAVIQFNGSGGTKQDKDLRAGVALSARASLLGHIDALRELGHCLQDGYGVKQN 63
Query: 227 IAEGRRFLVQANARELAAVL---------SSAACQGISTRSWLTWNPLSQPHPAPHPRNQ 277
+ EGRR LVQAN RE+ V + +C+GI +R
Sbjct: 64 VTEGRRLLVQANVREILLVYRFLELASPSRAGSCEGILSR-------------------- 103
Query: 278 NPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGR 337
++ + PL+ + NV PE HP + FL EWF + G GLRLC+H+GCGR
Sbjct: 104 -------LSNMAVPLIKENRYNVKVPEVHPVNWFLREWFESGLGILEDGLRLCAHIGCGR 156
Query: 338 PETRRHEFRRCSVC---GAVNYCSRACQ 362
ETR HEFRRCSVC G ++ C + Q
Sbjct: 157 AETRPHEFRRCSVCGISGVISVCVQLVQ 184
>gi|226509286|ref|NP_001149759.1| MYND finger family protein [Zea mays]
gi|195632060|gb|ACG36688.1| MYND finger family protein [Zea mays]
Length = 383
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 179/274 (65%), Gaps = 10/274 (3%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDA 60
M+T+RG CY G+ K+RR + P R G D+F+
Sbjct: 1 MKTRRGACYS--CHAAPGEGPEVQRRKRRRTAAEASPAAAEDPGVRL-----GARDMFED 53
Query: 61 LPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNS--LVLSKASKKTFAIKANNW 118
LPDDLVVSIL ++++A P+D ++TCKR L S +VL++AS + +++A W
Sbjct: 54 LPDDLVVSILRDVAASAGSPADLAGAMLTCKRFRELVARSKVVVLARASPRCLSVRAKAW 113
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
S+ AHRFL+ CADAGN+EACY LGMIRFYCL +RGSGA+LMA AA+ H +ALYSLAVIQ
Sbjct: 114 SDEAHRFLQRCADAGNLEACYLLGMIRFYCLGSRGSGAALMAAAAVGGHREALYSLAVIQ 173
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
FNGSG K+D+DLRAG ALCARAA GH+ ALRELGH LQDGYGVR+++ +GR L+ AN
Sbjct: 174 FNGSGXGKDDRDLRAGAALCARAAARGHVXALRELGHXLQDGYGVRRSVLDGRXLLIXAN 233
Query: 239 ARELAAVLSSAACQGISTRSWLTWNPLSQPHPAP 272
ARELAA ++ C +S R + ++ P P
Sbjct: 234 ARELAAASAAHPCL-LSERLGVPRRRGARGQPVP 266
>gi|255082424|ref|XP_002504198.1| predicted protein [Micromonas sp. RCC299]
gi|226519466|gb|ACO65456.1| predicted protein [Micromonas sp. RCC299]
Length = 502
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 129/357 (36%), Positives = 180/357 (50%), Gaps = 26/357 (7%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFA-IKAN 116
F + DD++ + L+ A P+ + + ++T KR ++ LS +K + A
Sbjct: 122 FCSCSDDVLEVVFGHLARDA-SPAQYFDAMLTSKRFRDAGMSKRALSNVGEKVLSRATAE 180
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQ-----------NRGSGASLMAKAAIS 165
NW+E A F + DAG+V A + +G + FYC R GASL+A+AA++
Sbjct: 181 NWNEGASAFAEAAVDAGSVFAAFLVGSVDFYCRDAVGETTELAAARRTRGASLLARAAVA 240
Query: 166 SHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFL-GHIDALRELGHCLQDGYGVR 224
+ A ++LA++ FNGSGG + DKD G A CAR L G DA RELGHCLQDG+GV
Sbjct: 241 GSSDAHHTLAIMHFNGSGGKRKDKDPETGAAFCARGNILFGSADAKRELGHCLQDGFGVE 300
Query: 225 QNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGH 284
Q++ GR+ L +A A ++A AA + + L + RN N
Sbjct: 301 QDVPLGRKLLAEAAAADVATPAMDAAVVAATVAAAAAEERLRK-------RNDINTLNAA 353
Query: 285 VTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAAR--GGTPGPGLRLCSHVGCGRPETRR 342
V + A P +RFL +W+ G PG CSH CGR ETRR
Sbjct: 354 VAAAAEVSAAAKHAAEQHLTAPPVARFLLDWYDPNVCGNVLQPGTHACSHPLCGRVETRR 413
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGG 399
HEFRRCS CG YCSR+CQ+LDW+L+HK C P L+ G + D + D G
Sbjct: 414 HEFRRCSCCGRTRYCSRSCQSLDWRLQHKFACLP---LLELYGWIESDTESQDTTAG 467
>gi|303281746|ref|XP_003060165.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458820|gb|EEH56117.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 123/324 (37%), Positives = 172/324 (53%), Gaps = 14/324 (4%)
Query: 59 DALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNW 118
DA+PDD++ I L+ + P ++ + +TCKR + LS A ++ A +A +W
Sbjct: 49 DAVPDDVLEVIFGHLARDS-TPREYHDATLTCKRFRDAGTSKRALSDAGERVLATRAKDW 107
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLA 175
+ A F AG+V A + +G ++FY + R +GA+L+AKAA++ A ++LA
Sbjct: 108 NNGASAFADAAVAAGSVFASFFVGSVKFYAAKTKEERTAGAALLAKAAVNGSPDAHHTLA 167
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFL-GHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
++ FNGSGG + DKD AG ALCAR L + A RELGHCLQDG+GV +++A G++ L
Sbjct: 168 IMHFNGSGGRRKDKDPEAGAALCARGNILFNSVPAKRELGHCLQDGFGVERDVALGKKLL 227
Query: 235 VQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLS 294
V+A A + I + RN VT +
Sbjct: 228 VEAAAED-------TQTPAIDIAVSAATAAAAAVEERLRKRNDINTLQEAVTASREVSNA 280
Query: 295 DFGCNVPAPEAHPASRFLAEWFAAR--GGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCG 352
A P +RFL +WF G PG+ CSH CGR ETRRHEFRRCS CG
Sbjct: 281 ARAAAREHLHATPENRFLLDWFNPDVCGEVLPPGMHACSHPLCGRVETRRHEFRRCSCCG 340
Query: 353 AVNYCSRACQALDWKLRHKADCAP 376
V YCSR+CQ+LDW+L+HK C P
Sbjct: 341 RVRYCSRSCQSLDWRLQHKFACLP 364
>gi|383164812|gb|AFG65194.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164818|gb|AFG65197.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
Length = 120
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 66/88 (75%), Positives = 73/88 (82%), Gaps = 2/88 (2%)
Query: 289 GCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRC 348
CPLLSDFGCNVP PE+HPA++F+ EWF R P GLRLCS+ GCGRPETR+HEFRRC
Sbjct: 9 ACPLLSDFGCNVPPPESHPAAKFMVEWFKLRA--PCAGLRLCSYAGCGRPETRKHEFRRC 66
Query: 349 SVCGAVNYCSRACQALDWKLRHKADCAP 376
SVCG VNYCSRACQALDWK+RHK C P
Sbjct: 67 SVCGTVNYCSRACQALDWKIRHKHSCMP 94
>gi|383164802|gb|AFG65189.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164804|gb|AFG65190.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164814|gb|AFG65195.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164820|gb|AFG65198.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164824|gb|AFG65200.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
Length = 119
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 66/88 (75%), Positives = 73/88 (82%), Gaps = 2/88 (2%)
Query: 289 GCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRC 348
CPLLSDFGCNVP PE+HPA++F+ EWF R P GLRLCS+ GCGRPETR+HEFRRC
Sbjct: 9 ACPLLSDFGCNVPPPESHPAAKFMVEWFKLRA--PCAGLRLCSYAGCGRPETRKHEFRRC 66
Query: 349 SVCGAVNYCSRACQALDWKLRHKADCAP 376
SVCG VNYCSRACQALDWK+RHK C P
Sbjct: 67 SVCGTVNYCSRACQALDWKIRHKHSCMP 94
>gi|383164800|gb|AFG65188.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
Length = 119
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 66/88 (75%), Positives = 73/88 (82%), Gaps = 2/88 (2%)
Query: 289 GCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRC 348
CPLLSDFGCNVP PE+HPA++F+ EWF R P GLRLCS+ GCGRPETR+HEFRRC
Sbjct: 9 ACPLLSDFGCNVPPPESHPAAKFMVEWFKLRA--PCAGLRLCSYAGCGRPETRKHEFRRC 66
Query: 349 SVCGAVNYCSRACQALDWKLRHKADCAP 376
SVCG VNYCSRACQALDWK+RHK C P
Sbjct: 67 SVCGTVNYCSRACQALDWKIRHKHSCMP 94
>gi|383164796|gb|AFG65186.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164798|gb|AFG65187.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164806|gb|AFG65191.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164808|gb|AFG65192.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164810|gb|AFG65193.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164816|gb|AFG65196.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
gi|383164822|gb|AFG65199.1| Pinus taeda anonymous locus 2_4883_01 genomic sequence
Length = 119
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 66/88 (75%), Positives = 73/88 (82%), Gaps = 2/88 (2%)
Query: 289 GCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRC 348
CPLLSDFGCNVP PE+HPA++F+ EWF R P GLRLCS+ GCGRPETR+HEFRRC
Sbjct: 9 ACPLLSDFGCNVPPPESHPAAKFMVEWFKLRA--PCAGLRLCSYAGCGRPETRKHEFRRC 66
Query: 349 SVCGAVNYCSRACQALDWKLRHKADCAP 376
SVCG VNYCSRACQALDWK+RHK C P
Sbjct: 67 SVCGTVNYCSRACQALDWKIRHKHSCMP 94
>gi|413936915|gb|AFW71466.1| hypothetical protein ZEAMMB73_748197 [Zea mays]
Length = 289
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/248 (39%), Positives = 125/248 (50%), Gaps = 35/248 (14%)
Query: 144 IRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAF 203
I+FYCL + G S M A S HA A ++ AV+ +NGS N + A L A AA
Sbjct: 25 IQFYCLGLQTHGWSRMLMAVRSGHAHAAFAAAVVYYNGSCIDHNQRQPLAAANLFALAAC 84
Query: 204 LGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWN 263
GH+ ALR+L C+ +G GVRQ+ GRR AN RE A +AA Q ++
Sbjct: 85 RGHMGALRDLAFCVSNGLGVRQDAEAGRRLTFTANLREFRATYPTAAEQAVAY------- 137
Query: 264 PLSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPE-----AHPASRFLAEWFAA 318
HV G L S+ GC P AHPA+RFL +WF+A
Sbjct: 138 -------------------AHVVGNPGCLTSELGCFTTVPASLRGWAHPANRFLCDWFSA 178
Query: 319 RGGTPGPGLRL---CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
P P RL CS CGRPETR E+RRC VCG YCSR+CQ+L W++ H +C
Sbjct: 179 -NPLPFPPRRLLLMCSMPTCGRPETRGLEYRRCPVCGIARYCSRSCQSLHWRMGHTRECV 237
Query: 376 PAERWLDE 383
P +WL E
Sbjct: 238 PTHQWLME 245
>gi|388505912|gb|AFK41022.1| unknown [Medicago truncatula]
Length = 122
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 61/96 (63%), Positives = 70/96 (72%)
Query: 288 IGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRR 347
+ PL+ + NV PE HP + FL EWF + G GLRLC+H+GCGR ETR HEFRR
Sbjct: 1 MAVPLIKENRYNVKVPEVHPVNWFLREWFESGLGILEDGLRLCAHIGCGRAETRPHEFRR 60
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE 383
CSVCG VNYC RACQALDWKLRHK +C+P E WLDE
Sbjct: 61 CSVCGKVNYCFRACQALDWKLRHKMECSPMEWWLDE 96
>gi|226496882|ref|NP_001141028.1| uncharacterized protein LOC100273107 [Zea mays]
gi|194702268|gb|ACF85218.1| unknown [Zea mays]
gi|414878965|tpg|DAA56096.1| TPA: hypothetical protein ZEAMMB73_508510 [Zea mays]
Length = 261
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/154 (49%), Positives = 98/154 (63%), Gaps = 12/154 (7%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSS--GEPKTTYRKKARFSPDVAGKSDLF 58
MRT+ G Y G+ V V K++R S+ GE R+K AG+ D
Sbjct: 1 MRTRSGSRYSNG-----GEVVALVGQKRKRSPSAVAGECDCGGRRK-----RPAGEPDYL 50
Query: 59 DALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNW 118
DALPDDLV+SIL KL++ + PSD ++V +TCKR+N L + +V +KAS + A+KA W
Sbjct: 51 DALPDDLVLSILSKLAAASSAPSDLLSVHLTCKRLNELGSHDMVFAKASPASLAVKAAAW 110
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNR 152
SE A RFLK CADAGN+EACY LGMIRFYCL +R
Sbjct: 111 SEPAQRFLKRCADAGNLEACYNLGMIRFYCLGSR 144
>gi|376336497|gb|AFB32865.1| hypothetical protein 0_4764_01, partial [Pinus mugo]
Length = 149
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/93 (66%), Positives = 69/93 (74%), Gaps = 2/93 (2%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C LLSDFGCNVP PE H A++FL +WF+ P PGLRLCS+ CGRPETRRHEFRRCS
Sbjct: 44 CSLLSDFGCNVPPPEPHIANKFLVDWFSLYP--PQPGLRLCSNEHCGRPETRRHEFRRCS 101
Query: 350 VCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
C +VNYCSRACQALDWK H+ C P W D
Sbjct: 102 ACNSVNYCSRACQALDWKTCHRYVCMPQRDWDD 134
>gi|361066465|gb|AEW07544.1| Pinus taeda anonymous locus 0_4764_01 genomic sequence
Length = 149
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/93 (65%), Positives = 69/93 (74%), Gaps = 2/93 (2%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C LLSDFGCNVP E+H A++FL +WF+ P PGLRLCS+ CGRPETRRHEFRRCS
Sbjct: 44 CSLLSDFGCNVPPTESHIANKFLVDWFSLYP--PQPGLRLCSNEHCGRPETRRHEFRRCS 101
Query: 350 VCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
C +VNYCSRACQALDWK H+ C P W D
Sbjct: 102 ACSSVNYCSRACQALDWKTCHRYVCMPQHDWDD 134
>gi|383130443|gb|AFG45954.1| Pinus taeda anonymous locus 0_4764_01 genomic sequence
Length = 149
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/93 (65%), Positives = 69/93 (74%), Gaps = 2/93 (2%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C LLSDFGCNVP E+H A++FL +WF+ P PGLRLCS+ CGRPETRRHEFRRCS
Sbjct: 44 CSLLSDFGCNVPPTESHIANKFLVDWFSLYP--PQPGLRLCSNEHCGRPETRRHEFRRCS 101
Query: 350 VCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
C +VNYCSRACQALDWK H+ C P W D
Sbjct: 102 ACSSVNYCSRACQALDWKTCHRYVCMPQHDWDD 134
>gi|376336485|gb|AFB32859.1| hypothetical protein 0_4764_01, partial [Pinus cembra]
Length = 148
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 61/93 (65%), Positives = 68/93 (73%), Gaps = 2/93 (2%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C LLSDFGCNVP PE H A++FL +WF+ P GLRLCS+ CGRPETRRHEFRRCS
Sbjct: 43 CSLLSDFGCNVPPPEPHIANKFLVDWFSL--CPPQTGLRLCSNEHCGRPETRRHEFRRCS 100
Query: 350 VCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
C +VNYCSRACQALDWK H+ C P W D
Sbjct: 101 ACSSVNYCSRACQALDWKTSHRYVCMPQHDWDD 133
>gi|376336487|gb|AFB32860.1| hypothetical protein 0_4764_01, partial [Pinus cembra]
gi|376336489|gb|AFB32861.1| hypothetical protein 0_4764_01, partial [Pinus cembra]
gi|376336491|gb|AFB32862.1| hypothetical protein 0_4764_01, partial [Pinus cembra]
gi|376336493|gb|AFB32863.1| hypothetical protein 0_4764_01, partial [Pinus cembra]
gi|376336495|gb|AFB32864.1| hypothetical protein 0_4764_01, partial [Pinus cembra]
Length = 148
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 61/93 (65%), Positives = 68/93 (73%), Gaps = 2/93 (2%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C LLSDFGCNVP PE H A++FL +WF+ P GLRLCS+ CGRPETRRHEFRRCS
Sbjct: 43 CSLLSDFGCNVPPPEPHIANKFLVDWFSL--CPPQTGLRLCSNEHCGRPETRRHEFRRCS 100
Query: 350 VCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
C +VNYCSRACQALDWK H+ C P W D
Sbjct: 101 ACSSVNYCSRACQALDWKTSHRYVCMPQHDWDD 133
>gi|361066467|gb|AEW07545.1| Pinus taeda anonymous locus 0_4764_01 genomic sequence
Length = 148
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 60/93 (64%), Positives = 67/93 (72%), Gaps = 2/93 (2%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C LLSDFGCNVP PE H A++FL +WF+ P GLRLCS+ CGR ETRRHEFRRCS
Sbjct: 43 CSLLSDFGCNVPPPEPHIANKFLVDWFSL--CPPQTGLRLCSNEHCGRSETRRHEFRRCS 100
Query: 350 VCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
C +VNYCSRACQALDWK H+ C P W D
Sbjct: 101 ACSSVNYCSRACQALDWKTSHRYVCMPQHDWDD 133
>gi|125582237|gb|EAZ23168.1| hypothetical protein OsJ_06852 [Oryza sativa Japonica Group]
Length = 380
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 58/120 (48%), Positives = 80/120 (66%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
FDAL D+LVVSIL ++++A P+D ++TC+R L + LVL++AS A++A
Sbjct: 46 FDALHDELVVSILADVAASAGSPADLAAAMLTCRRFRELGKHGLVLARASPSAVAVRAAA 105
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
W + AHRFL CA+AGNVEA Y LGMI FYC +NR GA L+ AA + + LYS+ +I
Sbjct: 106 WCDDAHRFLVRCAEAGNVEASYLLGMIMFYCFENRKLGAELLGAAARPGNGKELYSMGII 165
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/54 (66%), Positives = 41/54 (75%)
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
R+CS CGRPETR+HEFRRCSVC V YCSRACQA+ WK+ HK+ C P WL
Sbjct: 303 RMCSQPTCGRPETRKHEFRRCSVCSGVIYCSRACQAMHWKVAHKSACVPMAHWL 356
>gi|238011454|gb|ACR36762.1| unknown [Zea mays]
Length = 147
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 92/147 (62%), Gaps = 12/147 (8%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVNDKKRRGFSS--GEPKTTYRKKARFSPDVAGKSDLF 58
MRT+ G Y G+ V V K++R S+ GE R+K AG+ D
Sbjct: 1 MRTRSGSRYSNG-----GEVVALVGQKRKRSPSAVAGECDCGGRRK-----RPAGEPDYL 50
Query: 59 DALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNW 118
DALPDDLV+SIL KL++ + PSD ++V +TCKR+N L + +V +KAS + A+KA W
Sbjct: 51 DALPDDLVLSILSKLAAASSAPSDLLSVHLTCKRLNELGSHDMVFAKASPASLAVKAAAW 110
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIR 145
SE A RFLK CADAGN+EACY LGM+R
Sbjct: 111 SEPAQRFLKRCADAGNLEACYNLGMVR 137
>gi|449533314|ref|XP_004173621.1| PREDICTED: F-box protein At5g50450-like, partial [Cucumis sativus]
Length = 95
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/91 (63%), Positives = 65/91 (71%), Gaps = 1/91 (1%)
Query: 43 KKARFSPDVA-GKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSL 101
K+ R S A G SDLFD LPDDLVV +L KLS+TA P D VNV+ITCKR N LAL+ +
Sbjct: 5 KRRRTSTSAATGISDLFDGLPDDLVVVVLGKLSATASSPLDLVNVMITCKRFNRLALHPI 64
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADA 132
VLSKA K F I+ NWSES HRFLKLC A
Sbjct: 65 VLSKAGPKAFEIQTKNWSESTHRFLKLCVSA 95
>gi|57899057|dbj|BAD87831.1| hypothetical protein [Oryza sativa Japonica Group]
gi|218188054|gb|EEC70481.1| hypothetical protein OsI_01548 [Oryza sativa Indica Group]
Length = 246
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 80/165 (48%), Gaps = 51/165 (30%)
Query: 228 AEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTG 287
A+GR FLV A+A EL L++AA ++ P + H+TG
Sbjct: 72 AKGRHFLVVASAWELTLALAAAA-------------------------SRRPFASLHLTG 106
Query: 288 ----IGCPLLSDFGCNVPAPEAHPASRFLAE-------------------WFAARGGTPG 324
GCPLLSDF ++P E +PA+ F+A+ A +
Sbjct: 107 GGAISGCPLLSDFRWSLPEVEPYPANLFMADSSMSCGVQATAKKLDLEAPVVATTATSDN 166
Query: 325 PG---LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G L+LCS + CGR ETRRHEFRRCSVCGA NY SR Q LDW
Sbjct: 167 NGHSELQLCSQMWCGRRETRRHEFRRCSVCGAANYSSRVSQVLDW 211
>gi|294463938|gb|ADE77490.1| unknown [Picea sitchensis]
Length = 194
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 66/127 (51%), Gaps = 23/127 (18%)
Query: 1 MRTKRGLCYPRVVDVCLGQRVVPVND-----KKRRGFSSGEPKTTYRKKARFSPDVAGKS 55
MRT+ G CYP PV D K++R S E K+ R P+
Sbjct: 1 MRTRAGSCYPS-----------PVRDGRENCKRKRQMMSREQMKG--KRLRREPNGVNH- 46
Query: 56 DLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVL--SKASKKTFAI 113
FD LPDD+++S+L K S +A+ ++FVNV++TCKR N L L+ LVL K ++ +
Sbjct: 47 --FDDLPDDVLISVLAKCSLSAKSAAEFVNVILTCKRFNALGLHLLVLPQEKVYRELQNL 104
Query: 114 KANNWSE 120
KA + E
Sbjct: 105 KAMYYPE 111
>gi|397563822|gb|EJK43954.1| hypothetical protein THAOC_37552 [Thalassiosira oceanica]
Length = 438
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 106/279 (37%), Gaps = 60/279 (21%)
Query: 134 NVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRA 193
NV C TLGM R + +Q + +G + KAA ++ ALY+L+++ G RA
Sbjct: 173 NVGWCMTLGM-RGFEIQVQ-TGMEWINKAAAQNYPAALYALSILYREGIASELERSQGRA 230
Query: 194 GVALCARAAFLGHIDALRELGHCLQDGY-GVRQNIAEGRRFLVQANARELAAVLSSAACQ 252
L AA LGH+ A +L C Q G G +N E F +A+ +A L + +
Sbjct: 231 N-ELLLEAANLGHVLANSDLADCHQKGADGFEKNPDE---FYFRAS---VAFALDNTHEE 283
Query: 253 GISTRSWLTWNP---LSQPHPAPHPRNQNPNHNGHVTGIGCPLLS--------------- 294
L +N L++P P +N + TG S
Sbjct: 284 AARILGGLHFNKNQILAEPSPYLACYYRNIAASEDTTGSASYFYSKSLLRLAYNDRKTLK 343
Query: 295 ----------------DFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRP 338
D GCNV A L +W T G CG+
Sbjct: 344 GSNELPAAFFWLRKSRDMGCNV-------AREMLKKWEIFAQKTCG---------NCGKK 387
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
+F++CS C A YCS+ CQ W+ HK DC A
Sbjct: 388 VQADEKFKQCSKCKAQWYCSKECQIEAWRAGHKQDCKRA 426
>gi|299741263|ref|XP_001834348.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
gi|298404633|gb|EAU87325.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
Length = 434
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 315 WFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
W +R LR C+HVGC ET +F+RCS C V YC + CQ DWK RHK +C
Sbjct: 330 WVNSREDVKEIILRTCNHVGCSARETEVAQFKRCSACRLVVYCGQECQKEDWK-RHKPEC 388
>gi|224122932|ref|XP_002318952.1| predicted protein [Populus trichocarpa]
gi|222857328|gb|EEE94875.1| predicted protein [Populus trichocarpa]
Length = 238
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 58/137 (42%), Gaps = 2/137 (1%)
Query: 52 AGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTF 111
+ + F P DL+ +L +++S + SD ++CK A + + + +
Sbjct: 8 TSRHEFFKYFPKDLLTEVLARVASAS--ISDLFTAKLSCKEFLEAASENYIFEHITIEKL 65
Query: 112 AIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQAL 171
+ S A FL C + GN EA + GM+ F+ SG + AA H +A+
Sbjct: 66 PVIPWRISHGASSFLARCKENGNPEALFRQGMVEFFSSNKPESGFQHLKNAARKGHVEAI 125
Query: 172 YSLAVIQFNGSGGSKND 188
Y+ VI G K +
Sbjct: 126 YTCGVILVCHGGQFKQE 142
>gi|357478767|ref|XP_003609669.1| F-box protein [Medicago truncatula]
gi|355510724|gb|AES91866.1| F-box protein [Medicago truncatula]
Length = 256
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 62/123 (50%), Gaps = 4/123 (3%)
Query: 57 LFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKAN 116
+ LP+DL+V+I+ K++S R +D + ++CK + + V A+ FA+
Sbjct: 24 IVKTLPNDLLVNIVGKVAS--RSMADLCKIKLSCKEFLNASEDGYVYQHAAMDKFALVPL 81
Query: 117 NW--SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
W E FL C ++GN+E Y GM++++ G + KAA+ H +A Y
Sbjct: 82 PWFTDEKETSFLSRCKESGNLEITYREGMVQYFSTLMVDLGLKNLKKAALEGHHEAKYVY 141
Query: 175 AVI 177
+++
Sbjct: 142 SML 144
>gi|123492730|ref|XP_001326130.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121909040|gb|EAY13907.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 2444
Score = 57.0 bits (136), Expect = 2e-05, Method: Composition-based stats.
Identities = 42/158 (26%), Positives = 73/158 (46%), Gaps = 20/158 (12%)
Query: 94 NGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG 153
N L +N+ L S I ++A ++ K+ A+ GN+ + +G+ + +G
Sbjct: 2208 NALGMNNYALMCCS----GIGVRRNYDTAAKYFKMAAEKGNIYSLNNIGL-----MLRKG 2258
Query: 154 SG--------ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLG 205
+G A KAAIS A+Y+LA + +NG G +D R V L AA G
Sbjct: 2259 TGMDKDPVAAAQYFEKAAISGEVDAMYNLATMYYNGEG---IKRDRRKAVQLIKMAADRG 2315
Query: 206 HIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELA 243
++ A+ + + G V+QN+ E ++ A ++ A
Sbjct: 2316 NVKAIVQFASFCKHGKLVKQNLEEAAKYYRMATQKDSA 2353
>gi|219112695|ref|XP_002178099.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410984|gb|EEC50913.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 374
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 107/265 (40%), Gaps = 67/265 (25%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALY 172
++ ++ E+A L AGN+EA Y LG+I+ YC Q+ G +L+ +A+ + + +A Y
Sbjct: 171 LQEHDTPETAAVALTPFCQAGNLEAIYMLGIIKSYCHQDVQDGIALLKRASRAGYVRATY 230
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALREL--GHCLQDGYGVRQNIAEG 230
+L ++ D D + +AA G++ AL+E ++ YG + +AE
Sbjct: 231 ALGLVL--------RDNDPKEASRYMRKAAHKGYLPALQEWLPAREMKRRYG--EPLAE- 279
Query: 231 RRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGC 290
N RE L + R ++ PL RN N +H +
Sbjct: 280 -------NLRE---YLDPFGLNRLLARHYVRDKPL---------RNLNTSHCWN------ 314
Query: 291 PLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSV 350
PL C A +AH A HV + R + CS
Sbjct: 315 PL-----CGRWAFKAHTA-----------------------HVVEEPVDVRVSRMKMCSR 346
Query: 351 CGAVNYCSRACQALDWKL-RHKADC 374
C YCS+ CQ DW+ RHK +C
Sbjct: 347 CCRAKYCSKLCQVYDWRSGRHKTEC 371
>gi|328766838|gb|EGF76890.1| hypothetical protein BATDEDRAFT_28100 [Batrachochytrium
dendrobatidis JAM81]
Length = 449
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
R CS++ C + E+ HEF +CS C V YCS++CQ + W + H+ C P+E E
Sbjct: 381 SFRRCSNLKCSKTESHLHEFSKCSRCRRVTYCSKSCQCIAWTM-HRCWCMPSESHFTAES 439
Query: 386 NG 387
Sbjct: 440 TA 441
>gi|147773103|emb|CAN71689.1| hypothetical protein VITISV_039291 [Vitis vinifera]
Length = 241
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/159 (22%), Positives = 71/159 (44%), Gaps = 13/159 (8%)
Query: 59 DALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTF-AIKANN 117
++L DL++ +L +++S++ +D N ++C+ + ++ V+ S + F I
Sbjct: 16 ESLSSDLLIEVLTRVASSS--FTDLFNAKLSCRNFLEASKDTFVMKNISIEKFPTIPWWQ 73
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
+ FLK C ++GN E Y GM+ F+ + S + +A H +A Y +I
Sbjct: 74 VKDEVSSFLKACKESGNPEVLYRQGMMEFFSWKKVESXEEYLKRAMEMGHMEASYVYGII 133
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
G S ++ ++ L + R+LG C
Sbjct: 134 LLCKGGDSNEEQGMK----------LLNAVKKSRKLGEC 162
>gi|389643696|ref|XP_003719480.1| hypothetical protein MGG_04327 [Magnaporthe oryzae 70-15]
gi|351639249|gb|EHA47113.1| hypothetical protein MGG_04327 [Magnaporthe oryzae 70-15]
Length = 1240
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 26/44 (59%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
CGRPE + + RC C V YCS CQ DWK RHK +C A+
Sbjct: 1196 CGRPEGPQQKLNRCKACLEVKYCSVDCQRADWKTRHKRECQTAQ 1239
>gi|440472357|gb|ELQ41222.1| hypothetical protein OOU_Y34scaffold00290g19 [Magnaporthe oryzae Y34]
gi|440489372|gb|ELQ69030.1| hypothetical protein OOW_P131scaffold00197g25 [Magnaporthe oryzae
P131]
Length = 1240
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 26/44 (59%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
CGRPE + + RC C V YCS CQ DWK RHK +C A+
Sbjct: 1196 CGRPEGPQQKLNRCKACLEVKYCSVDCQRADWKTRHKRECQTAQ 1239
>gi|338814246|ref|ZP_08626276.1| hypothetical protein ALO_18502 [Acetonema longum DSM 6540]
gi|337273767|gb|EGO62374.1| hypothetical protein ALO_18502 [Acetonema longum DSM 6540]
Length = 707
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 7/129 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALY 172
++S++AH +LK AD GN A TLG++ + N L +AA +H QA Y
Sbjct: 387 KDYSQAAH-WLKQSADQGNGWAQVTLGVLYRNGYGVANNDSEAVKLWEQAAKQNHVQAQY 445
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
L + F+G+G +N + +AA G+++A +LG+ + G G N+AE +
Sbjct: 446 LLGLSYFDGTGVVRN---YATALEWFKKAADQGYLEAQVQLGYMYERGLGATSNLAEAMK 502
Query: 233 FLVQANARE 241
+ +A ++
Sbjct: 503 WYQKAAEQQ 511
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 55/118 (46%), Gaps = 3/118 (2%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E A + + AD G+++A + LG FY + G +AA + ++ Y L +
Sbjct: 285 EEALLWCRKAADQGHIQAQFLLGRAYFYSKKEYAEGVKWWQRAASAGEKESQYELGIAYQ 344
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G +++D + V +AA GH DA LG G GV ++ ++ +L Q+
Sbjct: 345 LGKGIAQDDVE---SVKWFQKAAEQGHPDAQHMLGRAYYYGKGVPKDYSQAAHWLKQS 399
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 64/176 (36%), Gaps = 11/176 (6%)
Query: 124 RFLKLCADAGNVEACYTLG---MIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
++ + A+ G+++A Y LG + + +N A K+A H Y++
Sbjct: 73 KWWRKAAEQGHMDAQYVLGNAAIFGYGMDRNPVEAAKWWRKSAEQGHISGQYTIGNAYMY 132
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
G G KD V AA GH A LG DG GV Q+ E R+ +A +
Sbjct: 133 GEG---VKKDPTEAVRWWKEAAAQGHGGAQYVLGMAYNDGIGVEQDFTEAVRWWQKAVEQ 189
Query: 241 ELAAV-----LSSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCP 291
A L+ A G+ Q H R+Q + G G P
Sbjct: 190 NYAPAHYPLGLAYAEGHGVKKDHDAALKHWQQGAEKEHARSQFALGGAYAHGYGVP 245
>gi|300122313|emb|CBK22885.2| unnamed protein product [Blastocystis hominis]
Length = 178
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 36/58 (62%), Gaps = 3/58 (5%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE 383
+R CS+ CG E+R EF++CS C + YCS+ CQ+ WK H+ +C P ++DE
Sbjct: 3 AVRSCSNPLCGNTESRPGEFKKCSRCKSACYCSKKCQSAHWKNGHREECKP---FVDE 57
>gi|384490648|gb|EIE81870.1| hypothetical protein RO3G_06575 [Rhizopus delemar RA 99-880]
Length = 484
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 69/149 (46%), Gaps = 18/149 (12%)
Query: 88 ITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY 147
++ ++ N L NSL + + AN E A ++ KL A+ G A LG Y
Sbjct: 150 LSAEQGNALGQNSL--GYCYEDGIGVAAN--FEEAVKWYKLSAEQGYPWAECNLG----Y 201
Query: 148 CLQN-------RGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCAR 200
C QN GA KAA+ HA+A ++L NG G +++K+ V R
Sbjct: 202 CYQNGIGLIKDETQGAYWYKKAALQGHARAQHNLGFCLQNGIGTERDEKE---AVKWYRR 258
Query: 201 AAFLGHIDALRELGHCLQDGYGVRQNIAE 229
AA G+I A LG+C Q+G GV N E
Sbjct: 259 AADRGNIFAYHSLGYCYQNGVGVEVNKQE 287
>gi|329118664|ref|ZP_08247367.1| Sel1 repeat superfamily protein [Neisseria bacilliformis ATCC
BAA-1200]
gi|327465222|gb|EGF11504.1| Sel1 repeat superfamily protein [Neisseria bacilliformis ATCC
BAA-1200]
Length = 328
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 72/152 (47%), Gaps = 13/152 (8%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQAL 171
A N+ ++A + K AD GN +A Y LG++ + QN A+ KAA A A
Sbjct: 178 AKNYRQAAAWYQK-AADQGNADAQYNLGLMYYNGQGVAQNYRQAAAWYQKAANQGDAAAQ 236
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGV----RQNI 227
++L ++ NG G ++ND R A +AA GH A LG +G G+ RQ
Sbjct: 237 FNLGLMYDNGQGVAQND---RQAAAWYQKAANQGHAKAQYNLGVMYYNGQGMARNYRQAA 293
Query: 228 AEGRRFLVQANARE--LAAVLSSAACQGISTR 257
A ++ L Q + E A L+ QG+S R
Sbjct: 294 AWYKKVLAQPDTPENAEAKALARENLQGLSKR 325
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 56/128 (43%), Gaps = 22/128 (17%)
Query: 131 DAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
D G+ EA Y LG++ Y +N A+ KAA +A A Y+L ++ +NG G +
Sbjct: 157 DQGDAEAQYNLGLM--YDNGRGVAKNYRQAAAWYQKAADQGNADAQYNLGLMYYNGQGVA 214
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAV 245
+N R A +AA G A LG +G GV QN R+ AA
Sbjct: 215 QN---YRQAAAWYQKAANQGDAAAQFNLGLMYDNGQGVAQND------------RQAAAW 259
Query: 246 LSSAACQG 253
AA QG
Sbjct: 260 YQKAANQG 267
>gi|294669975|ref|ZP_06734967.1| putative TPR repeat protein [Neisseria elongata subsp. glycolytica
ATCC 29315]
gi|291308183|gb|EFE49426.1| putative TPR repeat protein [Neisseria elongata subsp. glycolytica
ATCC 29315]
Length = 296
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 66/139 (47%), Gaps = 9/139 (6%)
Query: 103 LSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKA 162
L +A + + IK N E A ++ + A G+ EA +LG + + + A
Sbjct: 78 LGRARQYGYGIKPN--PEKAEKYYRRAAKLGHAEAQESLGCLYEFAEKPDYRRARKWYTR 135
Query: 163 AISSHAQ----ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
A+ H+ A Y L + G GG K D++ L +AA GH DA R LG+C +
Sbjct: 136 ALKQHSSDTPDAAYRLGWLYERGLGGKK---DIQKACQLYRKAAKNGHADAQRALGYCYE 192
Query: 219 DGYGVRQNIAEGRRFLVQA 237
G G+ +N A+ R++ +A
Sbjct: 193 KGLGLHKNYAKARKWSARA 211
>gi|421653723|ref|ZP_16094055.1| Sel1 repeat protein [Acinetobacter baumannii Naval-72]
gi|408512425|gb|EKK14068.1| Sel1 repeat protein [Acinetobacter baumannii Naval-72]
Length = 230
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 82/180 (45%), Gaps = 19/180 (10%)
Query: 66 VVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRF 125
+ +++ L S+A ++ ++ +A L + S + +K N+ E A ++
Sbjct: 10 IFTVIFSLHSSAFAEPSVADIQYRAEQGQSVAQYHLGMMLLSGEQGVVK--NY-EQAFKW 66
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLAVI 177
L G+V A Y+LGM+ F G+G AKAA SHA+A Y+LAV+
Sbjct: 67 LTAADQNGSVGAKYSLGMMYF-----TGTGVEKDMKRAFEYFAKAADKSHAKAQYNLAVL 121
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G ++N +RAA G+ A L H + G+GV Q+ + ++ +A
Sbjct: 122 YDRGEGTAQN---YEQAFEWYSRAAEQGYPPAEYNLAHLYKKGHGVAQSDEQALKWYTKA 178
>gi|397624427|gb|EJK67380.1| hypothetical protein THAOC_11597 [Thalassiosira oceanica]
Length = 305
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 2/114 (1%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSG 183
R + C A + Y R C N +++ +A+Y L F G
Sbjct: 107 RICEGCEMAAHKRGMYDCAFCRTPCPDNDADRLAMIMTRVRKKDHEAIYFLGQQYFFGEL 166
Query: 184 GSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G + KD+R V L AA LG I AL +LG+ + GYGV+Q++A+G F +A
Sbjct: 167 GLQ--KDMRRAVELWTEAAELGSIQALFDLGNAYRQGYGVQQDMAKGVEFYTKA 218
>gi|329849411|ref|ZP_08264257.1| sel1 repeat family protein [Asticcacaulis biprosthecum C19]
gi|328841322|gb|EGF90892.1| sel1 repeat family protein [Asticcacaulis biprosthecum C19]
Length = 555
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 70/147 (47%), Gaps = 14/147 (9%)
Query: 94 NGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG 153
GL + + LS+ + + ++ N+ E++ ++++ A+ G+ + Y G+I + RG
Sbjct: 125 QGLVYSQIRLSEMYSRGYGVEKND--ETSAKWMRKAAEQGDAASQYNFGII---LSKGRG 179
Query: 154 ------SGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHI 207
+ AA A Y+L V NG+G K+DK A VA +AA GH
Sbjct: 180 VAEDDVEAVKWFSLAAEQGDIDAQYALGVAFINGAGVEKSDK---AAVAWFRKAAEQGHA 236
Query: 208 DALRELGHCLQDGYGVRQNIAEGRRFL 234
A R+ L G G+R+N E +++
Sbjct: 237 LAQRQFARMLGQGRGIRKNDGEAFKWM 263
>gi|225561064|gb|EEH09345.1| enzyme activator [Ajellomyces capsulatus G186AR]
Length = 889
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 82/191 (42%), Gaps = 22/191 (11%)
Query: 65 LVVSILCKLSSTARC-PSDFVNVLITCKRMNGLALNSLVL----SKASKKTFAIKANNWS 119
L + L ++ TA+ PSD L K+ L S+VL +A +T +
Sbjct: 513 LTLEELNRIQQTAKNNPSDHKAQLFLAKK---LVEASVVLVDDNGRADLRTKNKNRERYI 569
Query: 120 ESAHRFLKLCADAGNVEA------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
AH+ +K AG A CY G + Q SL AA + H ++ Y
Sbjct: 570 FDAHKIVKKLVSAGYPPAMFYLADCYGEGQLGLEVDQR--EAFSLYQSAAKAGHPESAYR 627
Query: 174 LAV---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAE 229
LAV + + G GG+K +D V RAA LG A+ ++G L G G ++N E
Sbjct: 628 LAVCCEMGYEGGGGTK--RDPMKAVQWYRRAAALGDTPAMYKMGMILLKGLLGQQKNPRE 685
Query: 230 GRRFLVQANAR 240
G +L +A R
Sbjct: 686 GVSWLKRAAER 696
>gi|325096551|gb|EGC49861.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 889
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 82/191 (42%), Gaps = 22/191 (11%)
Query: 65 LVVSILCKLSSTARC-PSDFVNVLITCKRMNGLALNSLVL----SKASKKTFAIKANNWS 119
L + L ++ TA+ PSD L K+ L S+VL +A +T +
Sbjct: 513 LTLEELNRIQQTAKNNPSDHKAQLFLAKK---LVEASVVLVDDNGRADLRTKNKNRERYI 569
Query: 120 ESAHRFLKLCADAGNVEA------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
AH+ +K AG A CY G + Q SL AA + H ++ Y
Sbjct: 570 FDAHKIVKKLVSAGYPPAMFYLADCYGEGQLGLEVDQR--EAFSLYQSAAKAGHPESAYR 627
Query: 174 LAV---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAE 229
LAV + + G GG+K +D V RAA LG A+ ++G L G G ++N E
Sbjct: 628 LAVCCEMGYEGGGGTK--RDPMKAVQWYRRAAALGDTPAMYKMGMILLKGLLGQQKNPRE 685
Query: 230 GRRFLVQANAR 240
G +L +A R
Sbjct: 686 GVSWLKRAAER 696
>gi|255068262|ref|ZP_05320117.1| TPR repeat protein [Neisseria sicca ATCC 29256]
gi|255047454|gb|EET42918.1| TPR repeat protein [Neisseria sicca ATCC 29256]
Length = 384
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 59/123 (47%), Gaps = 6/123 (4%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++L+ A+ G+V A LG++ QN A +KAA+ A A Y L V+
Sbjct: 140 EATKWLQKAAEQGSVNAQKNLGLMYEQGQGVRQNYEEAARWYSKAAVQGDANAQYHLGVM 199
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
NG G +N ++ +AA G +DA LG +G GVRQ+ AE R+ +A
Sbjct: 200 YANGRGVRQNYEE---AAQWYRKAAEQGDVDAQNNLGALYDEGQGVRQDSAEAVRWYRKA 256
Query: 238 NAR 240
R
Sbjct: 257 AER 259
>gi|240280380|gb|EER43884.1| enzyme activator [Ajellomyces capsulatus H143]
Length = 876
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 82/191 (42%), Gaps = 22/191 (11%)
Query: 65 LVVSILCKLSSTARC-PSDFVNVLITCKRMNGLALNSLVL----SKASKKTFAIKANNWS 119
L + L ++ TA+ PSD L K+ L S+VL +A +T +
Sbjct: 500 LTLEELNRIQQTAKNNPSDHKAQLFLAKK---LVEASVVLVDDNGRADLRTKNKNRERYI 556
Query: 120 ESAHRFLKLCADAGNVEA------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
AH+ +K AG A CY G + Q SL AA + H ++ Y
Sbjct: 557 FDAHKIVKKLVSAGYPPAMFYLADCYGEGQLGLEVDQR--EAFSLYQSAAKAGHPESAYR 614
Query: 174 LAV---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAE 229
LAV + + G GG+K +D V RAA LG A+ ++G L G G ++N E
Sbjct: 615 LAVCCEMGYEGGGGTK--RDPMKAVQWYRRAAALGDTPAMYKMGMILLKGLLGQQKNPRE 672
Query: 230 GRRFLVQANAR 240
G +L +A R
Sbjct: 673 GVSWLKRAAER 683
>gi|388578936|gb|EIM19267.1| hypothetical protein WALSEDRAFT_34141 [Wallemia sebi CBS 633.66]
Length = 676
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 31/49 (63%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G+R C++V CG+ E+ EF +C +C YCS+ CQ+ W++ H+ C
Sbjct: 453 GIRQCANVACGKWESSPKEFAKCRICRKAKYCSKECQSRAWRVGHRFWC 501
>gi|428165092|gb|EKX34096.1| hypothetical protein GUITHDRAFT_98165 [Guillardia theta CCMP2712]
Length = 347
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 60/130 (46%), Gaps = 7/130 (5%)
Query: 126 LKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
LK A+ G+V+A + GM C N KAA H A Y+L V + G
Sbjct: 71 LKFDAEKGDVKAMFETGMSLLAGRGCEANESEAVKWYTKAAEKGHLDAQYNLGVCMYYGR 130
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ-ANARE 241
G D+++ VA +AA GH+ A LG C + G GV++++ + ++ ANA +
Sbjct: 131 G---VDRNIEQAVAWYLKAADQGHVAAEYALGVCYEKGRGVQKDLVKSIKYYTNAANAGD 187
Query: 242 LAAVLSSAAC 251
+A C
Sbjct: 188 ASAQYKLGEC 197
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 57/125 (45%), Gaps = 7/125 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYC--LQNRGS-GASLMAKAAISSHAQALY 172
N+ E+A F K C + G +A Y LG+ +Y ++N S A L KAA H QA Y
Sbjct: 207 ENFQEAASLFSKAC-EQGLAKAQYMLGLCHYYGQGVENSESKAADLFLKAAEQGHPQAEY 265
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+A ++G G KN L V +AA H A LG C G GV ++ +
Sbjct: 266 QIAACYYSGRGVEKN---LEKAVEWFEKAAKQSHPVAQYSLGQCYYYGRGVPKSEEKALE 322
Query: 233 FLVQA 237
+ A
Sbjct: 323 YYTMA 327
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 55/118 (46%), Gaps = 7/118 (5%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
++ A+AG+ A Y LG +Y +N ASL +KA A+A Y L + +
Sbjct: 177 KYYTNAANAGDASAQYKLGECFYYGRSGLKENFQEAASLFSKACEQGLAKAQYMLGLCHY 236
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G +N + A + L +AA GH A ++ C G GV +N+ + + +A
Sbjct: 237 YGQ-GVENSESKAADLFL--KAAEQGHPQAEYQIAACYYSGRGVEKNLEKAVEWFEKA 291
>gi|18416380|ref|NP_567705.1| ubiquitin carboxyl-terminal hydrolase 16 [Arabidopsis thaliana]
gi|75265551|sp|Q9SB51.1|UBP16_ARATH RecName: Full=Ubiquitin carboxyl-terminal hydrolase 16; AltName:
Full=Deubiquitinating enzyme 16; Short=AtUBP16; AltName:
Full=Ubiquitin thioesterase 16; AltName:
Full=Ubiquitin-specific-processing protease 16
gi|4220534|emb|CAA23007.1| putative protein [Arabidopsis thaliana]
gi|7269306|emb|CAB79366.1| putative protein [Arabidopsis thaliana]
gi|332659526|gb|AEE84926.1| ubiquitin carboxyl-terminal hydrolase 16 [Arabidopsis thaliana]
Length = 1008
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 28/52 (53%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEG 398
RCS C AV YCS CQ + W+ HK +C PA D E D D+ L + G
Sbjct: 84 RCSRCKAVRYCSGKCQIIHWRQGHKDECHPASIVYDSEDESDSDLRLGEENG 135
>gi|11993477|gb|AAG42757.1|AF302666_1 ubiquitin-specific protease 16 [Arabidopsis thaliana]
Length = 1008
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 28/52 (53%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEG 398
RCS C AV YCS CQ + W+ HK +C PA D E D D+ L + G
Sbjct: 84 RCSRCKAVRYCSGKCQIIHWRQGHKDECHPASIVYDSEDESDSDLRLGEENG 135
>gi|395330558|gb|EJF62941.1| hypothetical protein DICSQDRAFT_83701 [Dichomitus squalens LYAD-421
SS1]
Length = 398
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/48 (52%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
LR C+ GCG E R EF+RCS C V YCS +CQ DW HK C
Sbjct: 307 LRTCARSGCGAREVRAAEFKRCSGCKEVIYCSPSCQKEDWPA-HKKKC 353
>gi|299534490|ref|ZP_07047823.1| hypothetical protein BFZC1_00610 [Lysinibacillus fusiformis ZC1]
gi|424739801|ref|ZP_18168217.1| hypothetical protein C518_4040 [Lysinibacillus fusiformis ZB2]
gi|298730118|gb|EFI70660.1| hypothetical protein BFZC1_00610 [Lysinibacillus fusiformis ZC1]
gi|422946536|gb|EKU40944.1| hypothetical protein C518_4040 [Lysinibacillus fusiformis ZB2]
Length = 428
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 58/129 (44%), Gaps = 10/129 (7%)
Query: 127 KLCADAGNVEACYTLGMI----RFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
K A+AG VEA Y LG I R +++ G +AA H A Y L I NG
Sbjct: 299 KKSAEAGYVEAQYRLGGIYLEGRLGQVKDVNRGLFWYERAAEQYHVDAFYDLGYIWSNGL 358
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL---VQANA 239
G +N L G+ +AA G ++A +LGH G+ +N+ E ++ A +
Sbjct: 359 AGIRN---LEKGIHWFKQAALQGDVEAKLQLGHIFNKDTGLDRNLKEAIKWFGLAADAGS 415
Query: 240 RELAAVLSS 248
E A +L
Sbjct: 416 EEAAKILQE 424
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 54/124 (43%), Gaps = 15/124 (12%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSH--------AQALYS 173
A ++ A EA +TLG+I +G G + K A +++ +A Y
Sbjct: 258 AKKWFDYAAQKNVAEAMFTLGIIY-----EQGLGVEVDVKEAFNAYKKSAEAGYVEAQYR 312
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L I G G KD+ G+ RAA H+DA +LG+ +G +N+ +G +
Sbjct: 313 LGGIYLEGRLGQV--KDVNRGLFWYERAAEQYHVDAFYDLGYIWSNGLAGIRNLEKGIHW 370
Query: 234 LVQA 237
QA
Sbjct: 371 FKQA 374
>gi|170114662|ref|XP_001888527.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164636440|gb|EDR00735.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 354
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 315 WFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
W + G L+ CS GCG+ ET+ EF+ CS C YC+ CQ DWK +HK C
Sbjct: 250 WANSDPGAKESVLKTCSREGCGKRETKATEFKLCSGCRLAFYCNSTCQKEDWK-KHKPAC 308
>gi|237748202|ref|ZP_04578682.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
gi|229379564|gb|EEO29655.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
Length = 398
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 58/119 (48%), Gaps = 16/119 (13%)
Query: 129 CADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLAVIQFN 180
A G V+A LGM+ + +G+G + +AA+S +ALY L +
Sbjct: 125 AAKRGEVKAQKRLGMMYY-----KGTGVARNVHEARFWFNQAALSDDPEALYYLGIAYLK 179
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANA 239
G GG +KD L RAA GH++A+ +L +G GVRQ+ E ++L++ A
Sbjct: 180 GIGG---EKDFHQAHDLFERAADEGHVNAMWKLYEMFNEGTGVRQDRQEAFKWLMKLAA 235
>gi|414168802|ref|ZP_11424765.1| hypothetical protein HMPREF9696_02620 [Afipia clevelandensis ATCC
49720]
gi|410887538|gb|EKS35348.1| hypothetical protein HMPREF9696_02620 [Afipia clevelandensis ATCC
49720]
Length = 345
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K AD G+ EA + LGM+R NR GA L+A +A A A Y+L ++
Sbjct: 122 AAEWYKQAADRGDREAMFALGMMRIAGRAGPANREEGARLLASSAKLGKAAAAYNLGLLY 181
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G +D++ L +AA G+ +A L ++G GV +N+ E + +
Sbjct: 182 LE---GQTFPQDVKRAAELFRQAATAGNPEAQYALATLYKEGRGVEKNLTEAAKLM 234
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 51/121 (42%), Gaps = 6/121 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQALYSLAV 176
E R L A G A Y LG++ Q+ A L +AA + + +A Y+LA
Sbjct: 156 EEGARLLASSAKLGKAAAAYNLGLLYLEGQTFPQDVKRAAELFRQAATAGNPEAQYALAT 215
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ G G KN L L AA + ++DA E L +G G +++ L +
Sbjct: 216 LYKEGRGVEKN---LTEAAKLMRLAAMVDNLDAEVEYAIALYNGTGTPKDVPTAVALLTR 272
Query: 237 A 237
A
Sbjct: 273 A 273
>gi|338972073|ref|ZP_08627452.1| TPR repeat-containing protein [Bradyrhizobiaceae bacterium SG-6C]
gi|338234967|gb|EGP10078.1| TPR repeat-containing protein [Bradyrhizobiaceae bacterium SG-6C]
Length = 365
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 6/116 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K AD G+ EA + LGM+R NR GA L+A +A A A Y+L ++
Sbjct: 142 AAEWYKQAADRGDREAMFALGMMRIAGRAGPANREEGARLLASSAKLGKAAAAYNLGLLY 201
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G +D++ L +AA G+ +A L ++G GV +N+ E + +
Sbjct: 202 LE---GQTFPQDVKRAAELFRQAATAGNPEAQYALATLYKEGRGVEKNLTEAAKLM 254
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 51/121 (42%), Gaps = 6/121 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQALYSLAV 176
E R L A G A Y LG++ Q+ A L +AA + + +A Y+LA
Sbjct: 176 EEGARLLASSAKLGKAAAAYNLGLLYLEGQTFPQDVKRAAELFRQAATAGNPEAQYALAT 235
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ G G KN L L AA + ++DA E L +G G +++ L +
Sbjct: 236 LYKEGRGVEKN---LTEAAKLMRLAAMVDNLDAEVEYAIALYNGTGTPKDVPTAVALLTR 292
Query: 237 A 237
A
Sbjct: 293 A 293
>gi|391230754|ref|ZP_10266960.1| TPR repeat-containing protein [Opitutaceae bacterium TAV1]
gi|391220415|gb|EIP98835.1| TPR repeat-containing protein [Opitutaceae bacterium TAV1]
Length = 376
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 6/123 (4%)
Query: 129 CADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
A+ GNV+A LG++ Q+ S + KAA HA + Y L + G G
Sbjct: 107 AAEQGNVDAQCMLGVLLMTGRGIRQDPAEAVSWLRKAAEQGHANSQYFLGLALDEGVG-- 164
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAV 245
+KD V +AA H DA LG+ LQ G GV ++ AEG ++ +A AR A+
Sbjct: 165 -VEKDETGAVEWWRKAAEQDHPDAQERLGYALQKGIGVTKDEAEGLKWYYKAAARGHASA 223
Query: 246 LSS 248
+S
Sbjct: 224 QNS 226
>gi|114321729|ref|YP_743412.1| Sel1 domain-containing protein [Alkalilimnicola ehrlichii MLHE-1]
gi|114228123|gb|ABI57922.1| Sel1 domain protein repeat-containing protein [Alkalilimnicola
ehrlichii MLHE-1]
Length = 359
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 57/120 (47%), Gaps = 6/120 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLAVI 177
+A R+ + A+ + +A + LG I Y L + A+L +AA H Q L++LA +
Sbjct: 60 AARRWWEQAAEKDHADAWFNLGQITEYGLDGTPDPAQAAALYRRAADQGHPQGLHALAAL 119
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
F G G +D VAL RAA G DA LG Q G GV ++ + R +A
Sbjct: 120 LFQGQGVP---EDPAQAVALWRRAAEAGLPDAENSLGVAHQMGRGVEEDFSAAVRHYRRA 176
>gi|392559543|gb|EIW52727.1| hypothetical protein TRAVEDRAFT_53166 [Trametes versicolor
FP-101664 SS1]
Length = 322
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 1/49 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
LRLC GCG+PE+ + ++C+ C V YCS+ CQ DW HK C
Sbjct: 6 ALRLCYGPGCGKPESEELKLKKCARCAVVVYCSKECQKKDWA-AHKPVC 53
>gi|299541678|ref|ZP_07052008.1| suppressor of lin-12-like protein [Lysinibacillus fusiformis ZC1]
gi|298725773|gb|EFI66407.1| suppressor of lin-12-like protein [Lysinibacillus fusiformis ZC1]
Length = 638
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 80/207 (38%), Gaps = 27/207 (13%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
SA ++L L AD G++ A + LG + Y L++ S AA +A A Y L ++ +
Sbjct: 412 SAEKWLLLAADNGHISAGFELGRLYAYQLEDPVKAMSYFRTAAEKGYADAQYELGLLLTS 471
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
G+G N K+ V A HI A +LG + G GV N+ E RR
Sbjct: 472 GTGVPINYKE---AVKWWRAATDQSHIQAEYQLGLLYEQGLGVALNLEEARR------CY 522
Query: 241 ELAAVLSSAACQ-----------GISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIG 289
LAA A Q G+ + Q H + Q H+ G G
Sbjct: 523 RLAATQGHAGAQYQLGNLFDKGKGVEQDYTEAAKWIEQAASQGHMKAQYQLAQMHIHGQG 582
Query: 290 CPLLSDFG-----CNVPAPEAHPASRF 311
P DF + A + H ++F
Sbjct: 583 VP--KDFAKAAQLYRLSANQGHQKAQF 607
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 54/123 (43%), Gaps = 16/123 (13%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHAQAL 171
E A R +L A G+ A Y LG L ++G G A + +AA H +A
Sbjct: 516 EEARRCYRLAATQGHAGAQYQLG-----NLFDKGKGVEQDYTEAAKWIEQAASQGHMKAQ 570
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y LA + +G G K D L +A GH A +LG + G+GV Q+ E
Sbjct: 571 YQLAQMHIHGQGVPK---DFAKAAQLYRLSANQGHQKAQFQLGMLYKKGHGVAQDYQEAT 627
Query: 232 RFL 234
R+L
Sbjct: 628 RWL 630
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 10/116 (8%)
Query: 125 FLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+ K+ A G+ A Y LG + QN+ A AA+ H A + LA++ G
Sbjct: 312 YFKMAALTGDASAQYNLGYMHLKGCGIPQNQEEAAKWFHMAALQDHVNAEFQLAMLYNTG 371
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G +K+ + L A GH++A +CL Y QN+ ++L+ A
Sbjct: 372 QGMTKDHIEALKWFKLAAHK---GHLNA----QYCLGLLYEKEQNLVSAEKWLLLA 420
>gi|257063145|ref|YP_003142817.1| hypothetical protein Shel_04070 [Slackia heliotrinireducens DSM
20476]
gi|256790798|gb|ACV21468.1| TPR repeat-containing protein [Slackia heliotrinireducens DSM
20476]
Length = 1032
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 57/113 (50%), Gaps = 6/113 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
+ A R+ + A+AG ++A Y LG++ +NR S KAA + +A A++ LA
Sbjct: 326 KEAARWYRKAAEAGRMDAMYELGLLYERGNGVTENRREAVSWYRKAADAGNADAMFRLAS 385
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
I+ +G+G KDL L RAA GH A+ G G GV+++ E
Sbjct: 386 IRLHGNGAK---KDLAEAFDLFKRAAEAGHPQAMFNTGVMYAHGDGVKKDATE 435
Score = 44.7 bits (104), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 77/181 (42%), Gaps = 29/181 (16%)
Query: 78 RCPSDFVNVLITCK--RMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNV 135
R +D NV+ C RM L K + K ++A W A F G
Sbjct: 796 RKAADAGNVMAMCNMGRM-------LSTGKEASKNL-MEAAQWYRKAAEF-------GET 840
Query: 136 EACYTLGMIRFYCLQNRGSG------ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDK 189
E+ Y LG + +G+G A +AA H A+Y L V+ NG G ++N
Sbjct: 841 ESMYNLGRM---LANGQGTGKNPLEAAQWFRRAAEDGHELAMYHLGVMYANGEGVARNPH 897
Query: 190 DLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSA 249
+ + +AA LG+ +A+ LG L G GV +N + R+ +A + A +++
Sbjct: 898 E---ALTWYRKAADLGNANAMYNLGVMLAGGIGVERNPQQAARWYRKAIGKGHVAAMNNL 954
Query: 250 A 250
A
Sbjct: 955 A 955
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 17/123 (13%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHAQA 170
+E+A + K ADAG A LG+ + RG G ASL KA+ +A
Sbjct: 434 TEAASWYRK-AADAGVTGAMCNLGI-----MHERGDGVAKDPQEAASLYRKASDLDNALG 487
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L ++ NGSG +KN ++ RAA LG+ +A+ ++G + G GVR+N
Sbjct: 488 AYNLGIMLLNGSGVAKNPQEA---ALHLRRAAALGNTEAMIKMGEAYESGEGVRKNKKSA 544
Query: 231 RRF 233
+F
Sbjct: 545 VKF 547
>gi|424737685|ref|ZP_18166135.1| suppressor of lin-12-like protein [Lysinibacillus fusiformis ZB2]
gi|422948324|gb|EKU42707.1| suppressor of lin-12-like protein [Lysinibacillus fusiformis ZB2]
Length = 638
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 80/207 (38%), Gaps = 27/207 (13%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
SA ++L L AD G++ A + LG + Y L++ S AA +A A Y L ++ +
Sbjct: 412 SAEKWLLLAADNGHISAGFELGRLYAYQLEDPVKAMSYFRTAAEKGYADAQYELGLLLTS 471
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
G+G N K+ V A HI A +LG + G GV N+ E RR
Sbjct: 472 GTGVPINYKE---AVKWWRAATDQSHIQAEYQLGLLYEQGLGVALNLEEARR------CY 522
Query: 241 ELAAVLSSAACQ-----------GISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIG 289
LAA A Q G+ + Q H + Q H+ G G
Sbjct: 523 RLAATQGHAGAQYQLGNLFDKGKGVEQDYTEAAKWIEQAASQGHMKAQYQLAQMHIHGQG 582
Query: 290 CPLLSDFG-----CNVPAPEAHPASRF 311
P DF + A + H ++F
Sbjct: 583 VP--KDFAKAAQLYRLSANQGHQKAQF 607
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 54/123 (43%), Gaps = 16/123 (13%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHAQAL 171
E A R +L A G+ A Y LG L ++G G A + +AA H +A
Sbjct: 516 EEARRCYRLAATQGHAGAQYQLG-----NLFDKGKGVEQDYTEAAKWIEQAASQGHMKAQ 570
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y LA + +G G K D L +A GH A +LG + G+GV Q+ E
Sbjct: 571 YQLAQMHIHGQGVPK---DFAKAAQLYRLSANQGHQKAQFQLGMLYKKGHGVAQDYQEAT 627
Query: 232 RFL 234
R+L
Sbjct: 628 RWL 630
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 50/116 (43%), Gaps = 10/116 (8%)
Query: 125 FLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+ K+ A G+ A Y LG + QN+ A AA+ H A + LA++ G
Sbjct: 312 YFKMAALTGDASAQYNLGYMHLKGRGIPQNQEEAAKWFHMAALQDHVNAEFQLAMLYNTG 371
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G +K+ + L A GH++A +CL Y QN+ ++L+ A
Sbjct: 372 QGMTKDHIEALKWFKLAAHK---GHLNA----QYCLGLLYEKEQNLVSAEKWLLLA 420
>gi|237748454|ref|ZP_04578934.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
gi|229379816|gb|EEO29907.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
Length = 544
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 62/125 (49%), Gaps = 10/125 (8%)
Query: 109 KTFAIKA---NNWSESAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKA 162
KTF I A N ++ + F+K A+ G EA +GMI F Y +N G + KA
Sbjct: 378 KTFYIGAGINKNTDKAVYWFIK-AANQGFTEAQAYIGMIYFKGKYVAKNEKKGFYWLKKA 436
Query: 163 AISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYG 222
A A+A L + G+ N ++ GVAL +AA G+ +A LG C ++G
Sbjct: 437 AEKDSAKAQAFLGALYIAGNEVKPN---IKEGVALTKKAALQGNYEAQTLLGFCYENGLE 493
Query: 223 VRQNI 227
V++++
Sbjct: 494 VKKDL 498
Score = 38.1 bits (87), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 49/112 (43%), Gaps = 6/112 (5%)
Query: 129 CADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
A +GN EA Y GM+ + Q+ KAA +HA+AL L + G +
Sbjct: 76 AAKSGNAEAQYLFGMLVYDGRGVQQDNCVAMLWWMKAAEQNHAKALVMLGNLHRKGQCIA 135
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+N +A RAA ++ A LG DG GV +N E R+ +A
Sbjct: 136 EN---YPKAIAYWKRAAVQNNVWAYHNLGTAYYDGIGVDKNPHEAVRWWKKA 184
>gi|323135730|ref|ZP_08070813.1| Sel1 domain protein repeat-containing protein [Methylocystis sp.
ATCC 49242]
gi|322398821|gb|EFY01340.1| Sel1 domain protein repeat-containing protein [Methylocystis sp.
ATCC 49242]
Length = 304
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 57/117 (48%), Gaps = 6/117 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
++A + + A+ GN +A Y LG++ ++ + A +AA A+ A+
Sbjct: 153 DTAANYFRRAAEGGNADADYALGVLYKTGKGVAKDDKAAAEWFRRAADLGLTAAMVEYAI 212
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+QFNG G ++D + V L +AA G+ A L H L +G GV N+ E R +
Sbjct: 213 MQFNGVG---VERDRLSAVDLLRKAAVKGNAVAQNRLAHLLAEGLGVETNLREAREW 266
>gi|397611217|gb|EJK61229.1| hypothetical protein THAOC_18323 [Thalassiosira oceanica]
Length = 467
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 61/153 (39%), Gaps = 25/153 (16%)
Query: 263 NPLSQPHPAPHP--RNQNPNHNGHVTGIGCPLLSDF-----------GCNVPAPEAHPAS 309
N S P P+P+ N NG GI C L SD +P + PA
Sbjct: 311 NYESIPEPSPYLACHYLNIAANGDEDGIACHLYSDELLKLIERLHDGYIEIPGFDVIPAV 370
Query: 310 RFLAEWFAARGGTPG-PGLRLC-----SHVG-CGRPETRRHEFRRCSVCGAVNYCSRACQ 362
F G + G L+ C S+ G C + +F++CS C A YCS+ CQ
Sbjct: 371 LFWKRKSRDIGYSAGNEHLKQCEFAAKSYCGNCKKEAQGDEKFKQCSKCKAQWYCSKECQ 430
Query: 363 ALDWKLRHKADCAPA-----ERWLDEEGNGDVD 390
WK HK DC A E +LD E +D
Sbjct: 431 VEAWKAVHKKDCKSAGILNFEDYLDAECGMSID 463
>gi|357485797|ref|XP_003613186.1| F-box protein [Medicago truncatula]
gi|355514521|gb|AES96144.1| F-box protein [Medicago truncatula]
Length = 254
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 60/124 (48%), Gaps = 5/124 (4%)
Query: 57 LFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKAN 116
+ A P+DL+V I+ K++S + D + ++CK + + V AS FA+
Sbjct: 24 IVKAFPNDLLVDIVGKVASGSMV--DLYKIKLSCKEFLSASEDRFVYQHASLDKFALIPL 81
Query: 117 NW--SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGS-GASLMAKAAISSHAQALYS 173
W E FL+ C ++GN+E Y GM +++ + G + KAA+ H A Y
Sbjct: 82 PWFTEEKETSFLRRCRESGNLEILYREGMEQYFSTSMLNNLGFENLKKAALEGHHDAKYV 141
Query: 174 LAVI 177
+++
Sbjct: 142 YSML 145
>gi|154277542|ref|XP_001539612.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150413197|gb|EDN08580.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 806
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 81/193 (41%), Gaps = 22/193 (11%)
Query: 63 DDLVVSILCKLSSTARC-PSDFVNVLITCKRMNGLALNSLVL----SKASKKTFAIKANN 117
+ L L ++ TA+ PSD L K+ L S+VL +A +T
Sbjct: 428 EPLTFEELNRIQQTAKNNPSDHKAQLFLAKK---LVEASVVLVDDNGRADLRTKNKNRER 484
Query: 118 WSESAHRFLKLCADAGNVEA------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQAL 171
+ AH+ K AG A CY G + Q SL AA + H ++
Sbjct: 485 YIFDAHKIAKKLVSAGYPPAMFYLADCYGEGQLGLEVDQREA--FSLYQSAAKAGHPESA 542
Query: 172 YSLAV---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNI 227
Y LAV + + G GG+K +D V RAA LG A+ ++G L G G ++N
Sbjct: 543 YRLAVCCEMGYEGGGGTK--RDPMKAVQWYRRAAALGDTPAMYKMGMILLKGLLGQQKNP 600
Query: 228 AEGRRFLVQANAR 240
EG +L +A R
Sbjct: 601 REGVSWLKRAAER 613
>gi|169825777|ref|YP_001695935.1| hypothetical protein Bsph_0171 [Lysinibacillus sphaericus C3-41]
gi|168990265|gb|ACA37805.1| Hypothetical ybeQ protein [Lysinibacillus sphaericus C3-41]
Length = 434
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 54/119 (45%), Gaps = 7/119 (5%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMI----RFYCLQNRGSGASLMAKAAISSHAQALYSL 174
+ +A K A+AG VEA Y LG I R ++ G +AA H A Y L
Sbjct: 297 TNAAFNMYKKSAEAGYVEAQYRLGGIYLEGRLDQAKDINRGLFWYERAAEQFHIDAFYDL 356
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
I G G +N + G+ +AA G DA +LGH +G G+ +N+ E ++
Sbjct: 357 GFIWSKGLTGIRN---IEKGIHWFKQAALQGDADAKLQLGHIYNNGEGIARNLKEALKW 412
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 53/119 (44%), Gaps = 5/119 (4%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A ++ A EA +TLG+I L ++ + ++ K+A + + +A Y L I
Sbjct: 264 ARKWFDFAASKNVAEAMFTLGIIYEQGLGVNKDTNAAFNMYKKSAEAGYVEAQYRLGGIY 323
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G KD+ G+ RAA HIDA +LG G +NI +G + QA
Sbjct: 324 LEGRLDQA--KDINRGLFWYERAAEQFHIDAFYDLGFIWSKGLTGIRNIEKGIHWFKQA 380
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAV 176
E + ++ +L A GN +A Y LG F + +N SL +AA+ HA A +LA
Sbjct: 190 EKSLKWFELAAAQGNADAQYVLGNFYFEGVGVEENYTQAFSLYERAALQGHADAANNLAD 249
Query: 177 IQFNGSG 183
+ FNG G
Sbjct: 250 MYFNGEG 256
>gi|282856612|ref|ZP_06265881.1| Sel1 repeat protein [Pyramidobacter piscolens W5455]
gi|282585522|gb|EFB90821.1| Sel1 repeat protein [Pyramidobacter piscolens W5455]
Length = 811
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 70/148 (47%), Gaps = 18/148 (12%)
Query: 101 LVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--- 157
++L + S+K +K + +A R+ K+ ADAG+ C LG L +GSG +
Sbjct: 285 VLLGRMSEKGKTVKKD--PAAAARYYKMAADAGDPAGCNELGR-----LIEKGSGIAQSY 337
Query: 158 -----LMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRE 212
L A A + +ALY++ ++ G G KN+ RAG+A +AA +G +A+
Sbjct: 338 TNAYRLYALGAEGGNGEALYNVGRMEIYGLGTEKNE---RAGLAKLKKAAEMGESEAMTA 394
Query: 213 LGHCLQDGYGVRQNIAEGRRFLVQANAR 240
L +G V N E R +A R
Sbjct: 395 LATFYMEGKIVAANRGEALRLYREAAER 422
>gi|296444561|ref|ZP_06886525.1| Sel1 domain protein repeat-containing protein [Methylosinus
trichosporium OB3b]
gi|296257829|gb|EFH04892.1| Sel1 domain protein repeat-containing protein [Methylosinus
trichosporium OB3b]
Length = 285
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 57/108 (52%), Gaps = 3/108 (2%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
Q+ G A + +A S+H+ A+ LA+++FNG+G +++ D V L RAA G+ A
Sbjct: 181 QDAGEAAQWLRRAVASAHSAAMVELAILEFNGAGVARDRAD---AVKLLRRAAEAGNAVA 237
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTR 257
L L +G GV +++ E R+ +A + L A QG++ +
Sbjct: 238 QNRLARLLAEGVGVEKDLIEAARWNNRARSAGLVDAKLDALLQGLAAK 285
>gi|393234842|gb|EJD42401.1| hypothetical protein AURDEDRAFT_168471 [Auricularia delicata
TFB-10046 SS5]
Length = 429
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 38/86 (44%), Gaps = 13/86 (15%)
Query: 310 RFLAEWFAARGGTPGP---------GLRLCSHVGCGRPE-TRRHEFRRCSVCGAVNYCSR 359
R+L W GGT G R+C+ GCGRPE CS C + YCSR
Sbjct: 308 RWLFTWLNQVGGTYGTIYDLLRSVSARRVCAATGCGRPEHALETRLLVCSQCHVLRYCSR 367
Query: 360 ACQALDW---KLRHKADCAPAERWLD 382
ACQ W K HK C ++ LD
Sbjct: 368 ACQKAHWKSGKTPHKEACPLIKQVLD 393
>gi|148232732|ref|NP_001080804.1| programmed cell death 2 [Xenopus laevis]
gi|28838500|gb|AAH47974.1| Pdcd2-prov protein [Xenopus laevis]
Length = 361
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 30/50 (60%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
GLRLC GC P+T CS C VNYCS+ Q +DWKL+HK C+
Sbjct: 132 GLRLCRVCGCLGPKT-------CSKCHKVNYCSKDHQLMDWKLQHKKVCS 174
>gi|384155749|ref|YP_005538564.1| hypothetical protein [Arcobacter butzleri ED-1]
gi|345469303|dbj|BAK70754.1| conserved hypothetical protei [Arcobacter butzleri ED-1]
Length = 180
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 71/143 (49%), Gaps = 11/143 (7%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA------KAAISSHAQALYSL 174
+A +F + A NV+A ++LG I +++G A+L+ K A + Q L+++
Sbjct: 40 TAIKFYEKAASKNNVDAIWSLGFIYDNVQEHKGINANLLEAFKWYKKCADLNDEQCLFNV 99
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+ G G + D + +AA G+IDA LG ++G GVR+NI E ++
Sbjct: 100 GNMYHEGHGVKQ---DYVEAIKWYLKAADKGYIDAYYNLGLMYEEGLGVRKNIPEAIKWY 156
Query: 235 VQANARELAAVLSSAACQGISTR 257
+ A +L S +A + +S++
Sbjct: 157 QK--AADLGDEESLSAVKELSSK 177
>gi|225575064|ref|ZP_03783674.1| hypothetical protein RUMHYD_03153 [Blautia hydrogenotrophica DSM
10507]
gi|225037734|gb|EEG47980.1| Sel1 repeat protein [Blautia hydrogenotrophica DSM 10507]
Length = 521
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 12/114 (10%)
Query: 120 ESAHRFLKLCADAGN------VEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
E A ++ + A GN + ACY+ G+ ++ KAA HA + Y+
Sbjct: 349 EEAAQWYEKAAKQGNAVGQNELGACYSSGLG---VEEDAAKAVEWFQKAANQGHAVSQYN 405
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
L ++G G ++D + V +AA G DA RELG+C DG GV Q+
Sbjct: 406 LGKHYYDGEG---VERDYQKAVQWYEKAANQGDADAQRELGNCYYDGKGVEQDY 456
>gi|316932480|ref|YP_004107462.1| Sel1 domain-containing protein repeat-containing protein
[Rhodopseudomonas palustris DX-1]
gi|315600194|gb|ADU42729.1| Sel1 domain protein repeat-containing protein [Rhodopseudomonas
palustris DX-1]
Length = 364
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 55/117 (47%), Gaps = 10/117 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A L++ ADAGN EA Y L FY +N L+ AA++ + A A+
Sbjct: 212 AAELLRMSADAGNPEAQYALAT--FYKEGTGVTKNIEQSVRLLQAAALAGNVPAQVEYAI 269
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+NG+G KN+ A VAL +AA + A L H L G G ++I E ++
Sbjct: 270 ALYNGTGTPKNEP---AAVALLRKAARANNPIAQNRLAHVLVTGQGAPRDINEAMKW 323
>gi|39934015|ref|NP_946291.1| hypothetical protein RPA0938 [Rhodopseudomonas palustris CGA009]
gi|39647862|emb|CAE26382.1| conserved unknown protein [Rhodopseudomonas palustris CGA009]
Length = 363
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 57/120 (47%), Gaps = 16/120 (13%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYS 173
A L++ ADAGN EA Y L FY G+G + L+ AA++ + A
Sbjct: 211 AAELLRMAADAGNPEAQYALAT--FY---KEGTGVTKSIEQSVRLLQAAALAGNVPAQVE 265
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
A+ +NG+G KN+ A VAL +AA + A L H L G G ++I E ++
Sbjct: 266 YAIALYNGTGTPKNEP---AAVALLRKAARANNPIAQNRLAHVLVSGQGAPRDINEAMKW 322
>gi|192289435|ref|YP_001990040.1| Sel1 domain-containing protein repeat-containing protein
[Rhodopseudomonas palustris TIE-1]
gi|192283184|gb|ACE99564.1| Sel1 domain protein repeat-containing protein [Rhodopseudomonas
palustris TIE-1]
Length = 363
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 57/120 (47%), Gaps = 16/120 (13%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYS 173
A L++ ADAGN EA Y L FY G+G + L+ AA++ + A
Sbjct: 211 AAELLRMAADAGNPEAQYALAT--FY---KEGTGVTKSIEQSVRLLQAAALAGNVPAQVE 265
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
A+ +NG+G KN+ A VAL +AA + A L H L G G ++I E ++
Sbjct: 266 YAIALYNGTGTPKNEP---AAVALLRKAARANNPIAQNRLAHVLVSGQGAPRDINEAMKW 322
>gi|301603821|ref|XP_002931535.1| PREDICTED: programmed cell death protein 2 [Xenopus (Silurana)
tropicalis]
Length = 369
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 29/49 (59%), Gaps = 7/49 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
GLRLC GC P+T CS C VNYCS+ Q +DWKL+HK C
Sbjct: 140 GLRLCRVCGCLGPKT-------CSKCHKVNYCSKEHQLMDWKLQHKKVC 181
>gi|336371133|gb|EGN99473.1| hypothetical protein SERLA73DRAFT_182428 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383880|gb|EGO25029.1| hypothetical protein SERLADRAFT_469083 [Serpula lacrymans var.
lacrymans S7.9]
Length = 473
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 41/82 (50%), Gaps = 8/82 (9%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
G+R C+++ CGR ET EF +C C YC + CQ+ W H+ C+ ++G
Sbjct: 232 GIRQCANMLCGRWETYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS------AKDG 285
Query: 386 NGDVDVDLNDAEGGGE--DNVV 405
+ D D+ A G E D VV
Sbjct: 286 DDDADLVSEQANGASEASDGVV 307
>gi|328864091|gb|EGG13190.1| hypothetical protein MELLADRAFT_114804 [Melampsora larici-populina
98AG31]
Length = 1165
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 29/46 (63%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
G+R C+++ CG+ ET EF +C C YCS+ CQ+ W+L H+
Sbjct: 913 GIRQCANMQCGKWETYPREFAKCRRCRKAKYCSKPCQSKAWQLGHR 958
>gi|255547876|ref|XP_002514995.1| conserved hypothetical protein [Ricinus communis]
gi|223546046|gb|EEF47549.1| conserved hypothetical protein [Ricinus communis]
Length = 182
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 55/124 (44%), Gaps = 3/124 (2%)
Query: 55 SDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIK 114
S+ +P +L+ IL +++S + +DF N TCK A + + AS F +
Sbjct: 19 SNRLSCMPRELLTEILARVASESF--TDFFNAKTTCKEFLEAASDDYIFKHASIDCFPVI 76
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQ-NRGSGASLMAKAAISSHAQALYS 173
++ A FL C + N E+ + GMI ++ + SG + +A H A Y
Sbjct: 77 PWKITDDASTFLDKCKKSENPESLFRQGMIDYFSTTLEKNSGLDYLKRATNKGHRVATYV 136
Query: 174 LAVI 177
+I
Sbjct: 137 YGII 140
>gi|90426246|ref|YP_534616.1| Sel1-like protein [Rhodopseudomonas palustris BisB18]
gi|90108260|gb|ABD90297.1| Sel1-like [Rhodopseudomonas palustris BisB18]
Length = 368
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 55/119 (46%), Gaps = 10/119 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A L+L ADAGN EA Y L FY +N L+ A++S + A A+
Sbjct: 217 AAELLRLAADAGNAEAQYALAT--FYKEGTGVEKNLDQAVRLLQAASLSGNVDAEVEYAI 274
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+NG+G KN+ A VAL +AA + A L L G G ++ EG ++ V
Sbjct: 275 ALYNGTGTPKNEV---AAVALLRKAARQNNPIAQNRLARVLLTGKGPPEDRLEGLKWHV 330
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 52/110 (47%), Gaps = 6/110 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQALYSLAV 176
E A R+L A GN +A Y L ++ Q+ A L+ AA + +A+A Y+LA
Sbjct: 179 EDAARWLASSAKLGNPKAAYNLALLYLDGQTFPQDTRRAAELLRLAADAGNAEAQYALAT 238
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
G+G KN L V L A+ G++DA E L +G G +N
Sbjct: 239 FYKEGTGVEKN---LDQAVRLLQAASLSGNVDAEVEYAIALYNGTGTPKN 285
>gi|440790750|gb|ELR12022.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 354
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 45/107 (42%), Gaps = 17/107 (15%)
Query: 288 IGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGT-------PGPGLRLCSHVGCGRPET 340
C +LS+ VP E PA+ L A P LR CS+ C ET
Sbjct: 182 FACKVLSEV-LGVPFDELAPATSLLQPLKAVESENVRGMLFKETPNLRTCSNPKCDNRET 240
Query: 341 RRHEFRRCSVCG----AVNYCSRACQALDWKLRHKADC----APAER 379
+ F CS+C + YCSR CQA DW HK C APA R
Sbjct: 241 KEKTFLVCSLCRKAGLSTPYCSRGCQAGDWP-EHKKTCGVATAPAGR 286
>gi|358348963|ref|XP_003638510.1| F-box protein [Medicago truncatula]
gi|355504445|gb|AES85648.1| F-box protein [Medicago truncatula]
Length = 250
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 60/123 (48%), Gaps = 4/123 (3%)
Query: 57 LFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKAN 116
L +ALPD+L+ +I+ K+++ + +D ++CK + V A+ FA+
Sbjct: 18 LVEALPDELLANIVGKVATFSM--ADLFKTTLSCKDFCKALEDPHVYQHAALDKFALIPL 75
Query: 117 NW--SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
W SE FL C + GN+E Y GM++++ G + AA+ H +A Y
Sbjct: 76 AWFTSEKETMFLSRCREMGNLEIIYREGMVQYFSTLMVNLGFDNLKNAALGGHHEAKYVY 135
Query: 175 AVI 177
+++
Sbjct: 136 SML 138
>gi|308047791|ref|YP_003911357.1| Sel1 domain-containing protein repeat-containing protein
[Ferrimonas balearica DSM 9799]
gi|307629981|gb|ADN74283.1| Sel1 domain protein repeat-containing protein [Ferrimonas balearica
DSM 9799]
Length = 484
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 55/111 (49%), Gaps = 6/111 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA---KAAISSHAQALYSLAVI 177
A +LKL A+ EA LG++ + L + A A +AA++ + A Y LAV+
Sbjct: 274 KAEHYLKLAAEQDQPEAMANLGVLYYQGLLGQPDLAQARACSEQAALAGYPHAQYHLAVM 333
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
F G GG D D AG++ +AA + DAL +L DG GV N A
Sbjct: 334 LFAGEGG---DADPEAGLSWLRQAAAQHYPDALADLAQRTLDGDGVDTNPA 381
>gi|387125689|ref|YP_006291571.1| hypothetical protein ABTJ_03728 [Acinetobacter baumannii MDR-TJ]
gi|385880181|gb|AFI97276.1| TPR repeat-containing protein [Acinetobacter baumannii MDR-TJ]
Length = 230
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 80/180 (44%), Gaps = 19/180 (10%)
Query: 66 VVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRF 125
+ +++ L S+A +V ++ +A L + + + +K N+ E A ++
Sbjct: 10 IFTMIFSLHSSAFAEPSVADVQYRAQQGQPIAQYHLGIMLLTGEQGVVK--NY-EQAFKW 66
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLAVI 177
L G+V A Y+LGM+ F G+G AKAA HA+A Y+L V+
Sbjct: 67 LTAADQNGSVGAKYSLGMMYF-----TGTGVEKDMKRAFEYFAKAADKGHAKAQYNLGVL 121
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G ++N +RAA G+ A L H + G+GV Q+ + ++ +A
Sbjct: 122 YVRGEGTAQN---YEQAFEWYSRAAEQGYPPAEYNLAHLYKKGHGVAQSDEQALKWYTKA 178
>gi|402487246|ref|ZP_10834068.1| peptidase C14 caspase catalytic subunit p20 [Rhizobium sp. CCGE
510]
gi|401813820|gb|EJT06160.1| peptidase C14 caspase catalytic subunit p20 [Rhizobium sp. CCGE
510]
Length = 859
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 6/122 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQAL 171
AN +++A +K DAG+V A L + + N + + A A AL
Sbjct: 555 ANRDTKTAFATIKKAMDAGHVRAISELSALYLVGAFVPANLDKSSEIAAIGAKKGDPYAL 614
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+ + G G N ++ G+ L +AA LGH A+ ELG+ +G V +I G
Sbjct: 615 YAYGKSLYYGRGTKANTQE---GLKLMLQAADLGHTYAMNELGYIFFNGVSVPADIERGI 671
Query: 232 RF 233
RF
Sbjct: 672 RF 673
>gi|126640192|ref|YP_001083176.1| signal peptide [Acinetobacter baumannii ATCC 17978]
Length = 183
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 58/126 (46%), Gaps = 16/126 (12%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQAL 171
E A ++L G+V A Y+LGM+ F G+G AKAA HA+A
Sbjct: 14 EQAFKWLTAADQNGSVGAKYSLGMMYF-----TGTGVEKDMKRAFEYFAKAADKGHAKAQ 68
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+L V+ G G ++N +RAA G+ A L H + G+GV Q+ +
Sbjct: 69 YNLGVLYDRGEGTAQN---YEQAFEWYSRAAEQGYPPAEYNLAHLYKKGHGVAQSDEQAL 125
Query: 232 RFLVQA 237
++ +A
Sbjct: 126 KWYTKA 131
>gi|299740414|ref|XP_001838814.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
gi|298404233|gb|EAU83041.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
Length = 1320
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%)
Query: 331 SHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
S VG G + + R CS C +V YCSR CQ DWK RHK++C
Sbjct: 1122 STVGAGVDPSTQRNHRACSQCRSVFYCSRECQKEDWKRRHKSEC 1165
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 3/40 (7%)
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADC---APAERWLDEE 384
CS C +V YCSR CQ DWK H+ +C A + +D+E
Sbjct: 458 CSGCKSVAYCSRECQKQDWKRVHREECKLLATQRKSMDKE 497
>gi|157737339|ref|YP_001490022.1| TPR repeat-containing SEL1 subfamily protein [Arcobacter butzleri
RM4018]
gi|157699193|gb|ABV67353.1| TPR repeat protein, SEL1 subfamily [Arcobacter butzleri RM4018]
Length = 180
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 72/143 (50%), Gaps = 11/143 (7%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA------KAAISSHAQALYSL 174
+A +F + A NV+A ++LG I +++G A+L+ K A + Q L+++
Sbjct: 40 TAIKFYEKAASKNNVDAIWSLGFIYDNVQEHKGINANLVEAFKWYKKCADLNDEQCLFNV 99
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+ G G + D + +AA G+IDA LG ++G GVR+NI+E ++
Sbjct: 100 GNMYHEGHGVKQ---DYVEAIKWYLKAADKGYIDAYYNLGLMHEEGLGVRKNISEAIKWY 156
Query: 235 VQANARELAAVLSSAACQGISTR 257
+ A +L S +A + +S++
Sbjct: 157 QK--AADLGDEESLSAVKELSSK 177
>gi|123437585|ref|XP_001309587.1| TKL family protein kinase [Trichomonas vaginalis G3]
gi|121891320|gb|EAX96657.1| TKL family protein kinase [Trichomonas vaginalis G3]
Length = 1103
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 74/144 (51%), Gaps = 8/144 (5%)
Query: 111 FAIKANNWS-ESAHRFLKLCADAGNVEACYTLG--MIR-FYCLQNRGSGASLMAKAAISS 166
A ++N +S + + ++K AD G++EA Y G ++R F Q+ A AA +
Sbjct: 780 LAAESNKYSAQQSAIYMKKAADKGHLEAKYQCGNYLVRGFGTKQDLSKAAIYFYDAAKNG 839
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
H+ A+ + NG G K DLR +C +A G+++A+ + +++G GV+ N
Sbjct: 840 HSGAMLECSNFLSNGMGVKK---DLRKAAYMCKLSAESGNVEAMYTFANIVKNGQGVQSN 896
Query: 227 IAEGR-RFLVQANARELAAVLSSA 249
+ F++ A+ + A+L++A
Sbjct: 897 SVDAHSSFVLAADRGHVKAMLAAA 920
>gi|299749049|ref|XP_001838462.2| hypothetical protein CC1G_09090 [Coprinopsis cinerea okayama7#130]
gi|298408258|gb|EAU83396.2| hypothetical protein CC1G_09090 [Coprinopsis cinerea okayama7#130]
Length = 716
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%)
Query: 338 PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDV 389
P T+R+ +CS C AV YCS ACQ DWK RH++ C+ + + +G +
Sbjct: 468 PRTKRNRRPQCSGCAAVFYCSEACQREDWKRRHRSVCSSIRSYYNGHKHGQI 519
>gi|297839713|ref|XP_002887738.1| hypothetical protein ARALYDRAFT_340002 [Arabidopsis lyrata subsp.
lyrata]
gi|297333579|gb|EFH63997.1| hypothetical protein ARALYDRAFT_340002 [Arabidopsis lyrata subsp.
lyrata]
Length = 169
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 28/57 (49%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGEDN 403
RCS C +V YCS CQ + W+L HK +C P E G D +L G DN
Sbjct: 79 RCSRCKSVRYCSGECQTIHWRLSHKDECVPVEYCSTSTGKLSCDEELASLGGSQVDN 135
>gi|393234411|gb|EJD41974.1| hypothetical protein AURDEDRAFT_105699 [Auricularia delicata
TFB-10046 SS5]
Length = 531
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 340 TRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
T+ RRCS CGA +YCS +CQ DW+ HK +C ++W
Sbjct: 51 TQEAPLRRCSKCGAAHYCSSSCQTSDWQF-HKTECVSLQQW 90
>gi|238613981|ref|XP_002398576.1| hypothetical protein MPER_00805 [Moniliophthora perniciosa FA553]
gi|215475394|gb|EEB99506.1| hypothetical protein MPER_00805 [Moniliophthora perniciosa FA553]
Length = 311
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 33/59 (55%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
G+R C+++ CGR ET EF +C C YC + CQ+ W H+ C+ E +DE+
Sbjct: 188 GIRQCANMLCGRWETYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCSAKEVGVDED 246
>gi|397642774|gb|EJK75446.1| hypothetical protein THAOC_02829, partial [Thalassiosira oceanica]
Length = 365
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 49/98 (50%), Gaps = 2/98 (2%)
Query: 151 NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDAL 210
N G +++ + +A+Y L + F+GS G + KD++ V L AA LG I+AL
Sbjct: 207 NDADGLAMIQTRVLKKDPEAIYFLGLKYFHGSLGLQ--KDMQRAVKLWEEAAELGSIEAL 264
Query: 211 RELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSS 248
LGH G GV+Q+ A+ F +A + L S
Sbjct: 265 FCLGHAHHKGDGVQQDNAKAVEFFTKAAMQGHVLALES 302
>gi|340352026|ref|ZP_08674916.1| TPR repeat protein [Prevotella pallens ATCC 700821]
gi|339615761|gb|EGQ20431.1| TPR repeat protein [Prevotella pallens ATCC 700821]
Length = 198
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 5/121 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLAV 176
++A ++ + A+AGN++A LG Y L N S KAA + A Y L +
Sbjct: 56 DTAFKYYQKAAEAGNIDAYTGLGRCYAYGLGVAFNAQSALHWYQKAAYAGSALGQYYLGI 115
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ F G G K DK + G+ +AA A+ +G+C + G GV +NI E + +
Sbjct: 116 LYFEGWNGEKPDK--KRGLEWLYKAANSCEPWAMFYIGNCYRRGDGVEKNIDEAIHWYKK 173
Query: 237 A 237
A
Sbjct: 174 A 174
>gi|237748075|ref|ZP_04578555.1| TPR repeat-containing protein [Oxalobacter formigenes OXCC13]
gi|229379437|gb|EEO29528.1| TPR repeat-containing protein [Oxalobacter formigenes OXCC13]
Length = 311
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 58/120 (48%), Gaps = 6/120 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++L+L A G +A LG++ Q+ G A L+ +AA+ A +L ++
Sbjct: 159 QAEKWLRLAAKQGEPDAQTYLGLLYLDGTELPQDVGEAARLLKEAAVKGDPNAQSALGMM 218
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
F+G G D+D+ +AA G++DA LG+ G GV ++ R +L +A
Sbjct: 219 YFSGKG---VDQDMNESEKWLEKAAIAGNVDAQTFLGNLYYKGIGVAKDDTRARYWLQKA 275
>gi|365920272|ref|ZP_09444615.1| Sel1 repeat protein [Cardiobacterium valvarum F0432]
gi|364578339|gb|EHM55551.1| Sel1 repeat protein [Cardiobacterium valvarum F0432]
Length = 389
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 70/148 (47%), Gaps = 16/148 (10%)
Query: 104 SKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTL--------GMIRFYCLQNRGSG 155
S A ++ I ++ A R L+ A GNVEA L G+ + Y + G
Sbjct: 133 SAAVQRAIRILREGDADGARRKLEALAREGNVEAIRYLAVMFQRGIGVAQDYAIAREWWG 192
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
+L + ++ Y++ + FNG G +++ R +AA G+ +A R LG
Sbjct: 193 VALALQDDVAQ-----YAMGELYFNGDGVTRDYAQARH---YWEQAAAQGNSEAQRGLGV 244
Query: 216 CLQDGYGVRQNIAEGRRFLVQANARELA 243
+DG+GV Q+ AE RR+L QA + LA
Sbjct: 245 LFRDGHGVAQDYAEARRWLAQAAEQGLA 272
>gi|239614668|gb|EEQ91655.1| chitin synthase activator [Ajellomyces dermatitidis ER-3]
Length = 884
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 81/191 (42%), Gaps = 22/191 (11%)
Query: 65 LVVSILCKLSSTARC-PSDFVNVLITCKRMNGLALNSLVL----SKASKKTFAIKANNWS 119
L L ++ A+ PSD L K+ L S+VL +A KT +
Sbjct: 508 LTHEELNRIQHAAKSNPSDHKAQLFLAKK---LVEASMVLIDDNGRADLKTRNKNRERYI 564
Query: 120 ESAHRFLKLCADAGNVEA------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
AH+ +K AG A CY G + + SL A + H+++ Y
Sbjct: 565 FDAHKIVKKLVSAGYPPAMFYLADCYGEGQLGLEV--DPKEAFSLYQSGAKAGHSESAYR 622
Query: 174 LAV---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAE 229
LAV + + G GG+K +D V RAA LG A+ ++G L G G ++N E
Sbjct: 623 LAVCCEMGYEGGGGTK--RDPMKAVQWYRRAAALGDTPAMYKMGMILLKGLLGQQRNPRE 680
Query: 230 GRRFLVQANAR 240
G +L +A+ R
Sbjct: 681 GVSWLKRASER 691
>gi|239503112|ref|ZP_04662422.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii AB900]
gi|421679060|ref|ZP_16118940.1| Sel1 repeat protein [Acinetobacter baumannii OIFC111]
gi|410391751|gb|EKP44117.1| Sel1 repeat protein [Acinetobacter baumannii OIFC111]
Length = 230
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 80/180 (44%), Gaps = 19/180 (10%)
Query: 66 VVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRF 125
+ +++ L S+A +V ++ +A L + + + +K N+ E A ++
Sbjct: 10 IFTMIFSLHSSAFAEPSVADVQYRAQQGQPIAQYHLGIMLLTGEQGVVK--NY-EQAFKW 66
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLAVI 177
L G+V A Y+LGM+ F G+G AKAA HA+A Y+L V+
Sbjct: 67 LTAADQNGSVGAKYSLGMMYF-----TGTGVEKDMKRAFEYFAKAADKGHAKAQYNLGVL 121
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G ++N +RAA G+ A L H + G+GV Q+ + ++ +A
Sbjct: 122 YDRGEGTAQN---YEQAFEWYSRAAEQGYPPAEYNLAHLYKKGHGVSQSNEQALKWYTKA 178
>gi|261196343|ref|XP_002624575.1| chitin synthase activator [Ajellomyces dermatitidis SLH14081]
gi|239587708|gb|EEQ70351.1| chitin synthase activator [Ajellomyces dermatitidis SLH14081]
Length = 912
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 81/191 (42%), Gaps = 22/191 (11%)
Query: 65 LVVSILCKLSSTARC-PSDFVNVLITCKRMNGLALNSLVL----SKASKKTFAIKANNWS 119
L L ++ A+ PSD L K+ L S+VL +A KT +
Sbjct: 536 LTHEELNRIQHAAKSNPSDHKAQLFLAKK---LVEASMVLIDDNGRADLKTRNKNRERYI 592
Query: 120 ESAHRFLKLCADAGNVEA------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
AH+ +K AG A CY G + + SL A + H+++ Y
Sbjct: 593 FDAHKIVKKLVSAGYPPAMFYLADCYGEGQLGLEV--DPKEAFSLYQSGAKAGHSESAYR 650
Query: 174 LAV---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAE 229
LAV + + G GG+K +D V RAA LG A+ ++G L G G ++N E
Sbjct: 651 LAVCCEMGYEGGGGTK--RDPMKAVQWYRRAAALGDTPAMYKMGMILLKGLLGQQRNPRE 708
Query: 230 GRRFLVQANAR 240
G +L +A+ R
Sbjct: 709 GVSWLKRASER 719
>gi|327358165|gb|EGE87022.1| chitin synthase activator [Ajellomyces dermatitidis ATCC 18188]
Length = 898
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 81/191 (42%), Gaps = 22/191 (11%)
Query: 65 LVVSILCKLSSTARC-PSDFVNVLITCKRMNGLALNSLVL----SKASKKTFAIKANNWS 119
L L ++ A+ PSD L K+ L S+VL +A KT +
Sbjct: 522 LTHEELNRIQHAAKSNPSDHKAQLFLAKK---LVEASMVLIDDNGRADLKTRNKNRERYI 578
Query: 120 ESAHRFLKLCADAGNVEA------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
AH+ +K AG A CY G + + SL A + H+++ Y
Sbjct: 579 FDAHKIVKKLVSAGYPPAMFYLADCYGEGQLGLEV--DPKEAFSLYQSGAKAGHSESAYR 636
Query: 174 LAV---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAE 229
LAV + + G GG+K +D V RAA LG A+ ++G L G G ++N E
Sbjct: 637 LAVCCEMGYEGGGGTK--RDPMKAVQWYRRAAALGDTPAMYKMGMILLKGLLGQQRNPRE 694
Query: 230 GRRFLVQANAR 240
G +L +A+ R
Sbjct: 695 GVSWLKRASER 705
>gi|315637052|ref|ZP_07892275.1| Sel1 repeat superfamily protein [Arcobacter butzleri JV22]
gi|315478588|gb|EFU69298.1| Sel1 repeat superfamily protein [Arcobacter butzleri JV22]
Length = 180
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 71/143 (49%), Gaps = 11/143 (7%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA------KAAISSHAQALYSL 174
+A +F + A NV+A ++LG I +++G A+L+ K A + Q L+++
Sbjct: 40 TAIKFYEKAASKNNVDAIWSLGFIYDNVQEHKGINANLVEAFKWYKKCADLNDEQCLFNV 99
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+ G G + D + +AA G+IDA LG ++G GVR+NI E ++
Sbjct: 100 GNMYHEGHGVKQ---DYVEAIKWYLKAADKGYIDAYYNLGLMHEEGLGVRKNIPEAIKWY 156
Query: 235 VQANARELAAVLSSAACQGISTR 257
+ A +L S +A + +S++
Sbjct: 157 QK--AADLGDEESLSAVKELSSK 177
>gi|237729590|ref|ZP_04560071.1| Sel1 domain-containing protein repeat-containing protein
[Citrobacter sp. 30_2]
gi|226908196|gb|EEH94114.1| Sel1 domain-containing protein repeat-containing protein
[Citrobacter sp. 30_2]
Length = 470
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 63/127 (49%), Gaps = 6/127 (4%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQA 170
+A W++ L+ A+AG+ +A + L + QN + + K+A +A A
Sbjct: 24 QAKEWNKEPVAVLQQGAEAGDAQAQFVLATVYLKGKEVEQNFSKVSEWLTKSAEQGNASA 83
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
+ LA++ + G GG K+DK ++AA G++DA LG +G GV+QN +
Sbjct: 84 QHLLALMYYEGKGGLKDDKK---AAEWFSKAAAQGNVDAQYYLGVLYFEGKGVQQNDKKA 140
Query: 231 RRFLVQA 237
+L +A
Sbjct: 141 VEWLTRA 147
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 47/104 (45%), Gaps = 6/104 (5%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYC---LQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+L A+ GN A + L ++ + L++ A +KAA + A Y L V+ F
Sbjct: 70 EWLTKSAEQGNASAQHLLALMYYEGKGGLKDDKKAAEWFSKAAAQGNVDAQYYLGVLYFE 129
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
G G +NDK V RAA G++DA L DG G R
Sbjct: 130 GKGVQQNDKK---AVEWLTRAAEQGNLDARYLLALVYFDGEGAR 170
>gi|184156430|ref|YP_001844769.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii ACICU]
gi|260557655|ref|ZP_05829869.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii ATCC 19606 = CIP 70.34]
gi|332875557|ref|ZP_08443370.1| Sel1 repeat protein [Acinetobacter baumannii 6014059]
gi|417548213|ref|ZP_12199294.1| Sel1 repeat protein [Acinetobacter baumannii Naval-18]
gi|417567536|ref|ZP_12218408.1| Sel1 repeat protein [Acinetobacter baumannii OIFC143]
gi|417576511|ref|ZP_12227356.1| Sel1 repeat protein [Acinetobacter baumannii Naval-17]
gi|421631487|ref|ZP_16072156.1| Sel1 repeat protein [Acinetobacter baumannii OIFC180]
gi|445450083|ref|ZP_21444418.1| Sel1 repeat protein [Acinetobacter baumannii WC-A-92]
gi|445491657|ref|ZP_21459888.1| Sel1 repeat protein [Acinetobacter baumannii AA-014]
gi|183208024|gb|ACC55422.1| TPR repeat, SEL1 subfamily protein [Acinetobacter baumannii ACICU]
gi|193075991|gb|ABO10574.2| putative signal peptide [Acinetobacter baumannii ATCC 17978]
gi|260408828|gb|EEX02132.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii ATCC 19606 = CIP 70.34]
gi|332736260|gb|EGJ67274.1| Sel1 repeat protein [Acinetobacter baumannii 6014059]
gi|395553208|gb|EJG19216.1| Sel1 repeat protein [Acinetobacter baumannii OIFC143]
gi|395569732|gb|EJG30394.1| Sel1 repeat protein [Acinetobacter baumannii Naval-17]
gi|400388512|gb|EJP51584.1| Sel1 repeat protein [Acinetobacter baumannii Naval-18]
gi|408692759|gb|EKL38373.1| Sel1 repeat protein [Acinetobacter baumannii OIFC180]
gi|444756066|gb|ELW80625.1| Sel1 repeat protein [Acinetobacter baumannii WC-A-92]
gi|444764195|gb|ELW88518.1| Sel1 repeat protein [Acinetobacter baumannii AA-014]
gi|452950078|gb|EME55543.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii MSP4-16]
Length = 230
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 80/180 (44%), Gaps = 19/180 (10%)
Query: 66 VVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRF 125
+ +++ L S+A +V ++ +A L + + + +K N+ E A ++
Sbjct: 10 IFTMIFSLHSSAFAEPSVADVQYRAQQGQPIAQYHLGIMLLTGEQGVVK--NY-EQAFKW 66
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLAVI 177
L G+V A Y+LGM+ F G+G AKAA HA+A Y+L V+
Sbjct: 67 LTAADQNGSVGAKYSLGMMYF-----TGTGVEKDMKRAFEYFAKAADKGHAKAQYNLGVL 121
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G ++N +RAA G+ A L H + G+GV Q+ + ++ +A
Sbjct: 122 YDRGEGTAQN---YEQAFEWYSRAAEQGYPPAEYNLAHLYKKGHGVAQSDEQALKWYTKA 178
>gi|123481611|ref|XP_001323598.1| CAMK family protein kinase [Trichomonas vaginalis G3]
gi|121906466|gb|EAY11375.1| CAMK family protein kinase [Trichomonas vaginalis G3]
Length = 2004
Score = 48.1 bits (113), Expect = 0.008, Method: Composition-based stats.
Identities = 47/139 (33%), Positives = 61/139 (43%), Gaps = 12/139 (8%)
Query: 108 KKTFAIKANNWSESAHRFLKLCADAGNVEACYTLG-MIR--FYCLQNRGSGASLMAKAAI 164
+ F +K N E A + + A + +ACY G MIR F QN A AA
Sbjct: 1644 RNGFGVKQN--LEGAVEYFRRAAKLNHADACYNCGLMIRLGFGAKQNLSRAAYYYYLAAK 1701
Query: 165 SSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
H A Y+LA++ NG G SKN+ + AR G A LG L++G GV
Sbjct: 1702 QRHIYASYNLAILHSNGWGVSKNESLAAYYFGIAARG---GDAAAQANLGLMLKNGIGVE 1758
Query: 225 QNIAEG----RRFLVQANA 239
+NI RR Q NA
Sbjct: 1759 KNIFGAVKYFRRSARQGNA 1777
Score = 40.4 bits (93), Expect = 1.4, Method: Composition-based stats.
Identities = 32/122 (26%), Positives = 53/122 (43%), Gaps = 6/122 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
E A F + A GNV A Y + C +N A ++A ++ + + L
Sbjct: 1799 EKATIFFRFAAKQGNVSAMYNYAIALLKGVGCKRNPKKAAKILALSSREGDIDSQFKLGY 1858
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ + G + KD G+ A AA GHI A+ +G L++G + QNI ++
Sbjct: 1859 MLYKGE---RIRKDPIRGLQYLAMAARQGHILAMIMIGRALKNGDFIGQNIELSLKYFKA 1915
Query: 237 AN 238
A+
Sbjct: 1916 AS 1917
Score = 38.9 bits (89), Expect = 4.8, Method: Composition-based stats.
Identities = 36/134 (26%), Positives = 62/134 (46%), Gaps = 7/134 (5%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQ 169
I+ +++ + H FL++ A G EA Y LG++ +N A+ AA S H +
Sbjct: 368 IECHSFQDVQH-FLQIAAQNGCSEANYQLGILYQNGISVEKNIELAANYYRLAAQSGHRE 426
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A ++ NG KN K+ + + +A G+ A+ + L++G GV +NI E
Sbjct: 427 AQLQYGLMLQNGYDIQKNIKE---AANIYSESAKQGNPGAMNQYALLLKEGIGVDKNIKE 483
Query: 230 GRRFLVQANARELA 243
+ A +E A
Sbjct: 484 AAKLFKNAADKENA 497
Score = 38.5 bits (88), Expect = 6.8, Method: Composition-based stats.
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 3/88 (3%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
+N A L AA +A+A + A++ NG G KN ++ ++A G+I+A
Sbjct: 479 KNIKEAAKLFKNAADKENAEAQNNFAIMLQNGQGVPKN---IKMAAKYFEKSAKNGNIEA 535
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G CL+ G GV ++I ++ Q+
Sbjct: 536 QSNYGWCLKVGAGVEKDIELSTKYFKQS 563
>gi|293411917|ref|ZP_06654642.1| predicted protein [Escherichia coli B354]
gi|291469472|gb|EFF11961.1| predicted protein [Escherichia coli B354]
Length = 381
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A ++ KL +DAG V+A LG Y +N + KAA ++A+ L ++
Sbjct: 134 AIKYFKLASDAGYVKATSVLGAYYQYGKGVKKNYKKAFTYYKKAADQGSSEAMIGLGILY 193
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+G G +ND + V +AA LG+ DA+ LG ++G GV+++
Sbjct: 194 DDGLGVKRNDAE---AVKWYKKAAELGNADAITNLGIMYENGEGVKKD 238
>gi|295104276|emb|CBL01820.1| FOG: TPR repeat, SEL1 subfamily [Faecalibacterium prausnitzii
SL3/3]
Length = 734
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 62/125 (49%), Gaps = 16/125 (12%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHAQALY 172
A ++ + A+ GN +A LG +R+ +RG G A AK+A H +A
Sbjct: 160 EAAKWYRKAAEQGNADAQNRLG-VRY----DRGEGVSKDVKEAAKWYAKSAAQGHPKAQC 214
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+LA+ G G +KDL+ V L +A G+ +A LG C +G GV Q+ AE +
Sbjct: 215 NLALDYKTGKG---VEKDLKKAVELFYNSAIQGYANAQSNLGECYYNGEGVEQDHAEAFK 271
Query: 233 FLVQA 237
++ +A
Sbjct: 272 WMKKA 276
>gi|75676923|ref|YP_319344.1| Sel1 repeat-containing protein [Nitrobacter winogradskyi Nb-255]
gi|74421793|gb|ABA05992.1| sel1-like repeat [Nitrobacter winogradskyi Nb-255]
Length = 346
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 56/122 (45%), Gaps = 10/122 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A L+ ADAG+ EA Y L FY +N L+ A+++ + A A+
Sbjct: 195 AAELLRFAADAGSPEAQYALAT--FYKEGTGVEKNLYKSVRLLQAASLAGNVDAEVEYAI 252
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
FNGSG KN+ A V+L +AA A L H L +G GV + EG ++ +
Sbjct: 253 ALFNGSGTGKNEA---AAVSLLRKAARRNSAIAQNRLAHALVEGMGVPMDKVEGLKWHIV 309
Query: 237 AN 238
A
Sbjct: 310 AK 311
>gi|50085803|ref|YP_047313.1| signal peptide [Acinetobacter sp. ADP1]
gi|49531779|emb|CAG69491.1| conserved hypothetical protein; putative signal peptide
[Acinetobacter sp. ADP1]
Length = 240
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 56/122 (45%), Gaps = 6/122 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLA 175
+ A ++L L G+V A Y+LG++ QN+ + +KAA HAQA Y++
Sbjct: 70 TTQALKWLTLADQNGSVGAKYSLGLMYMTGTGVSQNQSTAFEWFSKAAKFGHAQAQYTVG 129
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+ G G KN + +AA G+ A LG DG GV QN + ++
Sbjct: 130 RMYSEGVGVEKN---MPQAFEWIQKAALQGYPPAEFSLGLMYNDGRGVAQNKQQAIKWYT 186
Query: 236 QA 237
QA
Sbjct: 187 QA 188
>gi|86360401|ref|YP_472289.1| hypothetical protein RHE_PE00124 [Rhizobium etli CFN 42]
gi|86284503|gb|ABC93562.1| hypothetical conserved protein [Rhizobium etli CFN 42]
Length = 862
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 54/122 (44%), Gaps = 6/122 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQAL 171
AN +++A +K DAG+V A L + +N G + + A A AL
Sbjct: 558 ANRDAKTAFATIKKAMDAGHVRAISELAALYLVGASVPENLGKSSEIAAIGAKKGDPYAL 617
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+ + G G D G+ L +AA LGH A+ ELG+ +G V ++ G
Sbjct: 618 YAYGKSLYYGRGVKA---DTEEGLKLMLQAADLGHTYAMNELGYIFSNGVNVPPDMERGI 674
Query: 232 RF 233
RF
Sbjct: 675 RF 676
>gi|397587063|gb|EJK53796.1| hypothetical protein THAOC_26690, partial [Thalassiosira oceanica]
Length = 271
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 87/240 (36%), Gaps = 51/240 (21%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDK---DLRAGVAL------CARAAFLGHI 207
+L+ KAA H QA LA+ +G G + D+ RA VA C+ A LG +
Sbjct: 46 NLLVKAANLGHGQANSRLAMRYLSGDNGFEKDRVKGYFRASVAFALDDTDCSAAQMLGLM 105
Query: 208 DALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQ 267
Y + E +F ++ +L A AC ++ + + Q
Sbjct: 106 -------------YLHGSELLEESKFSQESEFLDLEAS-PYLACYYMNIAANVVDVYGGQ 151
Query: 268 PHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPG--- 324
A +++ HNG C VP PA F A + G
Sbjct: 152 YSNALFSLDKHL-HNGDY------------CEVPGYNVLPAVFFWASKYRDSDMNKGCKM 198
Query: 325 ----------PGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G R CS CGR +F++CS C A YCS+ CQ WK H DC
Sbjct: 199 ELEVPKEIEKLGQRFCS--SCGREAQADEKFKQCSRCKAQWYCSKECQVEAWKAGHNKDC 256
>gi|393199170|ref|YP_006461012.1| hypothetical protein SSIL_0443 [Solibacillus silvestris StLB046]
gi|327438501|dbj|BAK14866.1| FOG: TPR repeat [Solibacillus silvestris StLB046]
Length = 280
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLA 175
+ A + K+ A G EA +TLG++ L + + +++A +A+Y +
Sbjct: 121 EQQALSWFKMAAQFGIAEAMFTLGIMYEQGLGTECDESQAFAYYSRSAEKQDEEAMYRMG 180
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+I F+G G + D + L A F H+DA +G+C ++G+GV +N
Sbjct: 181 MIYFSGELGQQQDNEKALEWFLKASGQF--HVDATFNIGYCYENGHGVARN 229
>gi|406667973|ref|ZP_11075722.1| Putative beta-lactamase hcpC precursor [Bacillus isronensis B3W22]
gi|405384179|gb|EKB43629.1| Putative beta-lactamase hcpC precursor [Bacillus isronensis B3W22]
Length = 280
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLA 175
+ A + K+ A G EA +TLG++ L + + +++A +A+Y +
Sbjct: 121 EQQALSWFKMAAQFGIAEAMFTLGIMYEQGLGTECDESQAFAYYSRSAEKQDEEAMYRMG 180
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+I F+G G + D + L A F H+DA +G+C ++G+GV +N
Sbjct: 181 MIYFSGELGQQQDNEKALEWFLKASGQF--HVDATFNIGYCYENGHGVARN 229
>gi|399154275|ref|ZP_10754342.1| TPR repeat-containing SEL1 subfamily protein [gamma proteobacterium
SCGC AAA007-O20]
Length = 198
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/144 (24%), Positives = 69/144 (47%), Gaps = 9/144 (6%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
E A + +L A G+++A + +G++ + QN G + KAA SH +A Y+L
Sbjct: 37 EKALKAFELAAKTGDLDAMHAVGIMYIGGWGIEQNDKKGLEYIIKAANQSHLKAQYTLGA 96
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ + G G + D + + +A G++DA L ++G GV++N+ + + +
Sbjct: 97 MYYLGIGVT---LDFEKAFSWISLSANQGYLDAQHNLAEMYENGKGVKKNLEKAYEYYLL 153
Query: 237 ANAR---ELAAVLSSAACQGISTR 257
A + + ++S +GI T
Sbjct: 154 AARKGNLDSQKMVSEMYKEGIGTE 177
>gi|384483828|gb|EIE76008.1| hypothetical protein RO3G_00712 [Rhizopus delemar RA 99-880]
Length = 610
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 52/113 (46%), Gaps = 11/113 (9%)
Query: 147 YCLQN-------RGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCA 199
YC QN GA +AAI HA+A ++L NG G KN+++ V
Sbjct: 322 YCFQNGIGVQKDDIRGAYWYRRAAIQGHARAQHNLGFCYQNGIGVEKNEEE---AVKWYK 378
Query: 200 RAAFLGHIDALRELGHCLQDGYGVRQNIAEG-RRFLVQANARELAAVLSSAAC 251
R+A G+I A LG+C Q+G GV N E +++ A A LS C
Sbjct: 379 RSAERGNIFAYHSLGYCYQNGIGVSTNEEEAVFWYMLSAKENHAPAQLSLGYC 431
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 56/116 (48%), Gaps = 16/116 (13%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI--------SSHAQA 170
E A ++ K A+ GN+ A ++LG YC QN G G S + A+ +HA A
Sbjct: 370 EEEAVKWYKRSAERGNIFAYHSLG----YCYQN-GIGVSTNEEEAVFWYMLSAKENHAPA 424
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
SL NG G KN+ R V ++A G+ A LG C ++G GV+++
Sbjct: 425 QLSLGYCYRNGIGVPKNE---REAVKWFRKSAEQGNALAQNSLGFCYEEGLGVKKD 477
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 56/128 (43%), Gaps = 14/128 (10%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL-------QNRGSGASLMAKAAISSHAQALY 172
+ A ++ +L A+ GN A +LG YC Q+ K+A + A
Sbjct: 263 QEAVKWYRLSAEQGNAFAQNSLG----YCYEDGIGVDQDFNQATFWYQKSADQGYPWAEC 318
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+L NG G K+D +R G RAA GH A LG C Q+G GV +N E +
Sbjct: 319 NLGYCFQNGIGVQKDD--IR-GAYWYRRAAIQGHARAQHNLGFCYQNGIGVEKNEEEAVK 375
Query: 233 FLVQANAR 240
+ ++ R
Sbjct: 376 WYKRSAER 383
>gi|414175530|ref|ZP_11429934.1| hypothetical protein HMPREF9695_03580 [Afipia broomeae ATCC 49717]
gi|410889359|gb|EKS37162.1| hypothetical protein HMPREF9695_03580 [Afipia broomeae ATCC 49717]
Length = 374
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 57/116 (49%), Gaps = 6/116 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K AD G+ EA + LGM++ NR GA L+A +A A A Y+L ++
Sbjct: 151 AAEWYKQAADRGDREAMFALGMMKIAGRGGPANRDEGARLLASSAKLGKAAAAYNLGLLY 210
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G +D++ L +AA G+ +A L ++G GV +++AE + +
Sbjct: 211 LE---GQVFPQDIKRAAELFRQAANAGNPEAQYALATFYKEGRGVEKDLAEAAKLM 263
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 51/117 (43%), Gaps = 10/117 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A + A+AGN EA Y L FY ++ A LM AA+ + A A+
Sbjct: 223 AAELFRQAANAGNPEAQYALAT--FYKEGRGVEKDLAEAAKLMRAAAMVDNLDAEVEYAI 280
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+NG+G KD+ +AL RAA A L L DG GV + +G ++
Sbjct: 281 ALYNGTG---TPKDVPTAIALLNRAARQNSPIAQNRLARILVDGLGVPVDKVQGFKW 334
>gi|345869667|ref|ZP_08821624.1| serine/threonine protein kinase [Thiorhodococcus drewsii AZ1]
gi|343923050|gb|EGV33747.1| serine/threonine protein kinase [Thiorhodococcus drewsii AZ1]
Length = 981
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 61/127 (48%), Gaps = 7/127 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
+++H F K A+ G+ A +LG+ FY +Q+ KAA + +A Y+L +
Sbjct: 740 QASHWFRK-AAEQGHAAAQNSLGIAFFYGRGVIQSDHQALKWFHKAAEQGYIEAQYNLGL 798
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
I G G K+D+ +AA GH +A LG +G GV++N ++ R+ +
Sbjct: 799 INTFGRGTKKDDQQ---SAEWFHKAASQGHTEAQYNLGIMYSEGRGVKKNQSQAARWYRK 855
Query: 237 ANARELA 243
A + A
Sbjct: 856 AAEQGFA 862
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 3/112 (2%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSL 174
A NW A + +D +++ + Q+ L KAAI+ HA A Y+L
Sbjct: 593 AANWIAKAEEINPINSDLIGLQSHLGFIYLETNTDQDNRKALRLFNKAAINGHAMAQYNL 652
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
V+ NG G +K+D R V +AA G+ DA LG +G G+ ++
Sbjct: 653 GVMYANGLGTTKDD---RQAVEWYRKAAEQGNADAQNNLGVMYANGLGITKD 701
>gi|84387130|ref|ZP_00990152.1| TPR repeat protein, SEL1 subfamily [Vibrio splendidus 12B01]
gi|84377991|gb|EAP94852.1| TPR repeat protein, SEL1 subfamily [Vibrio splendidus 12B01]
Length = 335
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 56/118 (47%), Gaps = 4/118 (3%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E + ++ A++G V+A TL ++Y + + KA S ALYSL V+ F
Sbjct: 68 ERGYEYMNQLAESGEVKAIITLA-DKYYYDEQYEKALAWYHKAEPSKDPYALYSLGVMYF 126
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+G G DL+ G +A G+ DA+ +L +G GV Q+ ++ + Q+
Sbjct: 127 DGEG---TPVDLKKGNDYYLASAQAGYSDAMYQLAFSYDEGQGVAQDFSKSAYWFEQS 181
>gi|169826234|ref|YP_001696392.1| suppressor of lin-12-like protein (Sel-1L) [Lysinibacillus
sphaericus C3-41]
gi|168990722|gb|ACA38262.1| Suppressor of lin-12-like protein (Sel-1L) [Lysinibacillus
sphaericus C3-41]
Length = 619
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 59/126 (46%), Gaps = 18/126 (14%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAIS--------SHAQALYSLAV 176
+LK A+ G V+A Y LG+ L G G + A+ SH QA Y L +
Sbjct: 430 YLKAAAEKGYVDAQYELGL-----LLTAGDGVPVNYPEAVQWWRAATDQSHIQAEYQLGL 484
Query: 177 IQFNGSGGSKNDKDLRAGVALCAR-AAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+ G G S + ++ R C R AA GH A +LG+ G GV Q+ E +++
Sbjct: 485 VYEQGLGVSIDLEEARR----CYRLAAIQGHAGAQYQLGNLFDKGKGVTQDYTEAAKWIE 540
Query: 236 QANARE 241
QA A+E
Sbjct: 541 QAAAQE 546
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 53/123 (43%), Gaps = 16/123 (13%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHAQAL 171
E A R +L A G+ A Y LG L ++G G A + +AA H +A
Sbjct: 497 EEARRCYRLAAIQGHAGAQYQLG-----NLFDKGKGVTQDYTEAAKWIEQAAAQEHIKAQ 551
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y LA + +G G K D L AA GH A +LG + G GV Q+ E
Sbjct: 552 YQLAQMHIHGQGVPK---DFAKAAQLYRLAANQGHQKAQFQLGLLYKKGQGVAQDYQEAT 608
Query: 232 RFL 234
++L
Sbjct: 609 KWL 611
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 75/206 (36%), Gaps = 27/206 (13%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
A ++L L A+ G++ A + LG + Y L + AA + A Y L ++ G
Sbjct: 394 AEKWLLLAAEKGHISAGFELGRLYVYQLHQPDKALPYLKAAAEKGYVDAQYELGLLLTAG 453
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARE 241
G N + V A HI A +LG + G GV ++ E RR
Sbjct: 454 DGVPVNYPE---AVQWWRAATDQSHIQAEYQLGLVYEQGLGVSIDLEEARR------CYR 504
Query: 242 LAAVLSSAACQ-----------GISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGC 290
LAA+ A Q G++ + Q H + Q H+ G G
Sbjct: 505 LAAIQGHAGAQYQLGNLFDKGKGVTQDYTEAAKWIEQAAAQEHIKAQYQLAQMHIHGQGV 564
Query: 291 PLLSDFG-----CNVPAPEAHPASRF 311
P DF + A + H ++F
Sbjct: 565 P--KDFAKAAQLYRLAANQGHQKAQF 588
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 55/122 (45%), Gaps = 7/122 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALY 172
N++E+AH + ++ AD GN +A Y LG++ L QN A AA + A
Sbjct: 213 QNYTEAAHHY-QIAADLGNADAQYNLGVLYNQGLGMSQNFLEAAKWYTLAADQGNTSAQN 271
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+L + NG+G ++ + + A A G A LG+ G G+ QN E +
Sbjct: 272 NLGFLYHNGTGVEQSYVEASTYFEMAALA---GDASAQYNLGYMHLKGRGIPQNFTEAAK 328
Query: 233 FL 234
+
Sbjct: 329 WF 330
>gi|384491727|gb|EIE82923.1| hypothetical protein RO3G_07628 [Rhizopus delemar RA 99-880]
Length = 644
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 56/117 (47%), Gaps = 14/117 (11%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGS-------GASLMAKAAISSHAQALY 172
+ A ++ KL A+ G A LG YC QN G+ KAA+ HA+A +
Sbjct: 292 QEAVKWYKLSAEQGYPWAECNLG----YCYQNGIGVEKDDVLGSYWYKKAALQGHARAQH 347
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
+L NG G +N+K+ V R+A G+I A LG+C Q+G GV N E
Sbjct: 348 NLGFCYQNGIGIERNEKE---AVKWYRRSAERGNIFAYHSLGYCYQNGIGVDVNEQE 401
>gi|392568936|gb|EIW62110.1| hypothetical protein TRAVEDRAFT_27483 [Trametes versicolor
FP-101664 SS1]
Length = 399
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
L+ C GC E R EF+RC C V YC +ACQ DWK HK C
Sbjct: 308 LKRCVRAGCDTREVRVAEFKRCGGCKEVVYCGQACQKEDWKA-HKPRC 354
>gi|331082196|ref|ZP_08331323.1| hypothetical protein HMPREF0992_00247 [Lachnospiraceae bacterium
6_1_63FAA]
gi|330402990|gb|EGG82555.1| hypothetical protein HMPREF0992_00247 [Lachnospiraceae bacterium
6_1_63FAA]
Length = 215
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 77/176 (43%), Gaps = 13/176 (7%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
LK+ A+ G + Y +G ++ N A KAA HA+A LA++ G
Sbjct: 13 LKILAENGQKDFIYAVGYYYENGIRTSVNLTEAAVWYEKAAQKGHAEAAMRLALLYAQGK 72
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAREL 242
G KN+K + A+A G+ +AL +G C ++G GV+Q++++ + +A A
Sbjct: 73 GVEKNEKKAFTYMEQAAKA---GNTNALYNVGRCYEEGIGVKQDLSKAFDWYKKAAAE-- 127
Query: 243 AAVLSSAACQGISTRSWLTWNPL-SQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
+ C G +LT NP+ +P A + N N +L +G
Sbjct: 128 -GDFRAMCCMG---GYFLTGNPVPYEPAKAFQLFEKAANANIPAAQYNLSVLYRYG 179
>gi|338737836|ref|YP_004674798.1| Sel1 domain-containing protein [Hyphomicrobium sp. MC1]
gi|337758399|emb|CCB64224.1| Sel1 domain protein repeat-containing protein [Hyphomicrobium sp.
MC1]
Length = 360
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 12/128 (9%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRG------SGASLMAKAAISSHAQALYSLA 175
A+ + K +D G+++ + LGM Q +G L KAA+S A A Y+L
Sbjct: 127 ANEYFKRASDMGDIQGTFALGMA---YAQGKGVKKDYTQAGELFEKAALSGSADANYNLG 183
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
++ NG+G +N +RA AA G A +L Q G GV N E R+L
Sbjct: 184 LLFLNGTGKPQN--PIRA-YQHIRYAAEKGVPQAEYDLAELYQTGTGVDANALEAARWLS 240
Query: 236 QANARELA 243
+A+ + LA
Sbjct: 241 RASEQGLA 248
>gi|392575125|gb|EIW68259.1| hypothetical protein TREMEDRAFT_44669 [Tremella mesenterica DSM
1558]
Length = 773
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 26/46 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
G R C+ + CG+ ET EF +C C YCS+ CQ++ W H+
Sbjct: 675 GTRQCARMDCGKKETVAREFSKCRKCRKAKYCSKECQSMAWAEGHR 720
>gi|30249930|ref|NP_842000.1| hypothetical protein NE1983 [Nitrosomonas europaea ATCC 19718]
gi|30139037|emb|CAD85894.1| conserved hypothetical protein [Nitrosomonas europaea ATCC 19718]
Length = 232
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 56/118 (47%), Gaps = 17/118 (14%)
Query: 123 HRFLKL--CADAGNVEACYTLGMIRFYCL------------QNRGSGASLMAKAAISSHA 168
RFL L A+AGN EA +LG + F ++ + A +AA HA
Sbjct: 77 QRFLNLRKDAEAGNAEAQNSLGSMYFSGEAISRDAQGKVKDKDPETAAGWFFRAAEQGHA 136
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
A ++L ++ F+G G ++ D V L ++A G+IDA LG G GV+QN
Sbjct: 137 GAQFNLGLLYFSGEGVTR---DTAKAVELFTKSAEQGNIDAQNNLGVIYLMGEGVKQN 191
>gi|189502274|ref|YP_001957991.1| hypothetical protein Aasi_0895 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497715|gb|ACE06262.1| hypothetical protein Aasi_0895 [Candidatus Amoebophilus asiaticus
5a2]
Length = 1493
Score = 47.0 bits (110), Expect = 0.015, Method: Composition-based stats.
Identities = 40/124 (32%), Positives = 56/124 (45%), Gaps = 7/124 (5%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYS 173
N+ E+ F K AD G +A LG + + QN KAA A A S
Sbjct: 666 NYQEAIKWFQK-AADQGLADAQNNLGFMYQNGYGLSQNYQEAIKWFQKAADQGLAAAQNS 724
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L + NG G S+N ++ + +AA GH DA LG Q+GYG+ QN E ++
Sbjct: 725 LGFMYQNGYGLSQNYQE---AIKWYQKAAEQGHADAQNNLGFTYQNGYGLSQNYQEAIKW 781
Query: 234 LVQA 237
+A
Sbjct: 782 YQKA 785
Score = 40.8 bits (94), Expect = 1.2, Method: Composition-based stats.
Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 3/89 (3%)
Query: 160 AKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQD 219
AKAA + A L + +G GG +N ++ + +AA G DA LG Q+
Sbjct: 639 AKAADQGNMHAQNWLGFMYQHGQGGPQNYQE---AIKWFQKAADQGLADAQNNLGFMYQN 695
Query: 220 GYGVRQNIAEGRRFLVQANARELAAVLSS 248
GYG+ QN E ++ +A + LAA +S
Sbjct: 696 GYGLSQNYQEAIKWFQKAADQGLAAAQNS 724
>gi|366994436|ref|XP_003676982.1| hypothetical protein NCAS_0F01430 [Naumovozyma castellii CBS 4309]
gi|342302850|emb|CCC70627.1| hypothetical protein NCAS_0F01430 [Naumovozyma castellii CBS 4309]
Length = 612
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
G+R C++ CG+ E +F +C C + YCSRACQ WK H+ C E L +G
Sbjct: 486 GVRQCANFACGKWEEYPKQFAKCRRCKSTKYCSRACQLKSWKF-HRYWC--NETKLPSQG 542
Query: 386 NG 387
N
Sbjct: 543 NS 544
>gi|397640489|gb|EJK74149.1| hypothetical protein THAOC_04191, partial [Thalassiosira oceanica]
Length = 205
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 42/76 (55%), Gaps = 3/76 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A +M + H +A+Y L F G G + KD+R V L AA LG I AL +LG+
Sbjct: 5 AMIMTRVRKKDH-EAIYFLGQQYFFGELGLQ--KDMRRAVELWTEAADLGSIQALFDLGN 61
Query: 216 CLQDGYGVRQNIAEGR 231
+ GYGV+Q++A R
Sbjct: 62 AYRQGYGVQQDMANKR 77
>gi|344925535|ref|ZP_08778996.1| Sel1 domain-containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 170
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 6/104 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAV 176
+ A ++ K+ AD GN +A + LG + L Q+ AA +HAQA Y++ V
Sbjct: 52 KEAFKWYKMAADQGNADAKFYLGFLYDDGLGVPQDYTEAMKWYLLAAEQNHAQAQYNIGV 111
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
+ +G G +KD + V AA GH+DA + L +++G
Sbjct: 112 LYNDGRG---VEKDYKEAVKWFRLAAEQGHVDAQQNLKLLMEEG 152
>gi|117924831|ref|YP_865448.1| hypothetical protein Mmc1_1533 [Magnetococcus marinus MC-1]
gi|117608587|gb|ABK44042.1| TPR repeat SEL1 subfamily-like protein [Magnetococcus marinus MC-1]
Length = 976
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 59/133 (44%), Gaps = 6/133 (4%)
Query: 110 TFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISS 166
TF E A R+LK A+ G+ EA + LG + + N KAA
Sbjct: 700 TFKTPLKQDYEQALRWLKKGAEGGSPEAQFHLGYMLHLGVGLAPNAHRAVHWYRKAAEQG 759
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
A+A +L + F G+G D+D+ V R A LGH+ AL LG+ + G GV +
Sbjct: 760 FAEAANNLGTLYFQGNG---VDRDVFKAVEWYTRGAKLGHVPALHNLGNHYRHGLGVAVD 816
Query: 227 IAEGRRFLVQANA 239
R + +A A
Sbjct: 817 ARLARHYFEKAQA 829
>gi|260588529|ref|ZP_05854442.1| TPR repeat protein [Blautia hansenii DSM 20583]
gi|260541004|gb|EEX21573.1| TPR repeat protein [Blautia hansenii DSM 20583]
Length = 215
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 76/176 (43%), Gaps = 13/176 (7%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
LK+ A+ G + Y +G ++ N A KAA HA+A LA++ G
Sbjct: 13 LKILAENGQKDFIYAVGYYYENGIRTSVNLTEAAVWYEKAAQKGHAEAAMRLALLYAQGK 72
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAREL 242
G KN+K + A+A G+ +AL +G C ++G GV+Q+ ++ + +A A
Sbjct: 73 GVEKNEKKAFTYMEQAAKA---GNTNALYNVGRCYEEGIGVKQDFSKAFDWYKKAAAE-- 127
Query: 243 AAVLSSAACQGISTRSWLTWNPL-SQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFG 297
+ C G +LT NP+ +P A + N N +L +G
Sbjct: 128 -GDFRAMCCMG---GYFLTGNPVPYEPAKAFQLFEKAANANIPAAQYNLSVLYRYG 179
>gi|307105600|gb|EFN53848.1| hypothetical protein CHLNCDRAFT_135935 [Chlorella variabilis]
Length = 923
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 326 GLRLCSHVGCGR---PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
GLR CS+ GCG P + RRCS C V YC AC DW+ RHK C
Sbjct: 849 GLRRCSYPGCGNLAGPAEAALKTRRCSACRVVRYCGEACSHEDWR-RHKKGC 899
>gi|238604759|ref|XP_002396286.1| hypothetical protein MPER_03512 [Moniliophthora perniciosa FA553]
gi|215468545|gb|EEB97216.1| hypothetical protein MPER_03512 [Moniliophthora perniciosa FA553]
Length = 263
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 4/54 (7%)
Query: 324 GPGLRLCSHVGC---GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
GP + C + C P+ ++ ++RCSVC A YCSR+CQ DW HK C
Sbjct: 138 GPQMNFCLNEVCPLRENPQKKKIRWKRCSVCFATTYCSRSCQKQDWA-EHKKCC 190
>gi|222631994|gb|EEE64126.1| hypothetical protein OsJ_18958 [Oryza sativa Japonica Group]
Length = 144
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 66/132 (50%), Gaps = 5/132 (3%)
Query: 49 PDVAGKSDLFDALPDDLVVSILCKLSSTARCP-SDFVNVLITCKRMNGLALNSLVLSKAS 107
P +A + AL D +V I C +++T+ P +D +++ TCK M+ +A V S
Sbjct: 11 PSMASATMKELALRHDNIVHIACLVAATSSEPITDLLSLCATCKAMHAVAKECDVGSYVP 70
Query: 108 KKTFAIKANNWSESAHRFLKL--CADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAIS 165
+ + W E+ F+ + A N++AC+ +G+ + Q+ G + KAAI+
Sbjct: 71 LER--LDNMKWMENERYFIVVNHLVTADNLDACFIVGVTLVFAHQDMEQGLLFLDKAAIT 128
Query: 166 SHAQALYSLAVI 177
H A+Y L ++
Sbjct: 129 GHKAAVYVLGLL 140
>gi|409047429|gb|EKM56908.1| hypothetical protein PHACADRAFT_208075 [Phanerochaete carnosa
HHB-10118-sp]
Length = 606
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 31/62 (50%), Gaps = 4/62 (6%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDV 389
CSH PET +RC+ C YC+ CQ DW + HK +C +RW D + DV
Sbjct: 119 CSHCAGPAPET---GLKRCTGCRTTRYCNAMCQNSDWAV-HKMECQALQRWADAAPSEDV 174
Query: 390 DV 391
V
Sbjct: 175 AV 176
>gi|365989636|ref|XP_003671648.1| hypothetical protein NDAI_0H02310 [Naumovozyma dairenensis CBS 421]
gi|343770421|emb|CCD26405.1| hypothetical protein NDAI_0H02310 [Naumovozyma dairenensis CBS 421]
Length = 608
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G+R C++V CG+ E +F +C C YCS+ CQ W+ H+ C PA
Sbjct: 477 GVRQCANVSCGKWEEYPKQFSKCHRCRRTKYCSKQCQVNSWEY-HRYWCHPA 527
>gi|46445788|ref|YP_007153.1| hypothetical protein pc0154 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46399429|emb|CAF22878.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 299
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 16/121 (13%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSH--------AQALYS 173
A ++ KL AD G+ A Y LG+I + G G + + AI + A A Y+
Sbjct: 133 AFKYFKLAADQGHAVAQYNLGVIY-----DNGQGITQSEQEAIKYYKLAADQGDADAQYN 187
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L VI NG G +++D + L A G DA ELG +G G+ Q+ E ++
Sbjct: 188 LGVIYANGQGITQSDAEAFKYFKLAADQ---GDADAQYELGVRYANGQGITQSDTEAFKY 244
Query: 234 L 234
Sbjct: 245 F 245
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 126 LKLCADAGNVEACYTLGMIRFYC-----LQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
+KL AD G+ +A Y LG+I Y Q+ AA HA A Y+L VI N
Sbjct: 101 IKLLADQGDAKAQYKLGVI--YANGRGITQSDTEAFKYFKLAADQGHAVAQYNLGVIYDN 158
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G G ++++++ + AA G DA LG +G G+ Q+ AE ++
Sbjct: 159 GQGITQSEQE---AIKYYKLAADQGDADAQYNLGVIYANGQGITQSDAEAFKYF 209
>gi|402772856|ref|YP_006592393.1| Sel1 domain-containing protein repeat-containing protein
[Methylocystis sp. SC2]
gi|401774876|emb|CCJ07742.1| Sel1 domain protein repeat-containing protein [Methylocystis sp.
SC2]
Length = 291
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 52/115 (45%), Gaps = 6/115 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A+ + + A GN ++ Y LG++ ++ A + A A A+ AV+Q
Sbjct: 150 AYDYFRRAAAKGNADSLYALGVLYKTGRGVPKDEREAAEWFRRGAELDFAPAMVEFAVLQ 209
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
FNG G + D A +AA G+ A L H L +G GV N+AE R +
Sbjct: 210 FNGVGAPR---DRAAAAQWFRKAAAKGNAVAQNRLAHILAEGLGVEANLAEAREW 261
>gi|299740520|ref|XP_002910328.1| hypothetical protein CC1G_15236 [Coprinopsis cinerea okayama7#130]
gi|298404285|gb|EFI26834.1| hypothetical protein CC1G_15236 [Coprinopsis cinerea okayama7#130]
Length = 642
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADCAPAER-WLDEEGNG 387
+ CS C VNYCS+ACQ DW RH+ +CA R ++D++ G
Sbjct: 412 KTCSACHWVNYCSQACQRADWDRRHRTECASMRRCYIDDKLGG 454
>gi|91977402|ref|YP_570061.1| hypothetical protein RPD_2933 [Rhodopseudomonas palustris BisB5]
gi|91683858|gb|ABE40160.1| Tetratricopeptide TPR_2 [Rhodopseudomonas palustris BisB5]
Length = 287
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 61/133 (45%), Gaps = 7/133 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
A + + ADAGN L + + G +L+AK A ++A+A Y L ++ G
Sbjct: 142 ARQLFERAADAGNPRGVSNLAALGGGTTSDPGKTRALLAKGA-ETNAEAQYQLGMMLAEG 200
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANA-- 239
GG K+D R L +A+ GH AL +G Q G G Q+ A + F +A A
Sbjct: 201 VGGPKDDVAAR---TLFEKASAQGHAGALERMGAFAQAGRGGPQDTAAAKGFYEKAAALG 257
Query: 240 -RELAAVLSSAAC 251
+ AA L A C
Sbjct: 258 NSDAAAALKRADC 270
>gi|393229522|gb|EJD37143.1| hypothetical protein AURDEDRAFT_173777 [Auricularia delicata
TFB-10046 SS5]
Length = 444
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 59/134 (44%), Gaps = 24/134 (17%)
Query: 284 HVTGIGCPLLSDFGC--NVP-APEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPET 340
++ +G + D GC NVP P+ +FL + GG R C GCG+ E
Sbjct: 295 YLCWLGTAVGDDAGCTHNVPDIPKLELLYKFL---LSTIGG------RWCCGPGCGKNEV 345
Query: 341 RRHE----FRRCSVCGAVNYCSRACQALDWK---LRHKADC---APAER--WLDEEGNGD 388
E FRRC +C V YC R CQ DWK RH+ C A A R + EG+
Sbjct: 346 SFEEGVKHFRRCELCLLVRYCGRDCQRKDWKHGAPRHRDICPILAAAHRRGVVITEGHEK 405
Query: 389 VDVDLNDAEGGGED 402
+ DAE ED
Sbjct: 406 FLESILDAEMEAED 419
>gi|169855749|ref|XP_001834540.1| hypothetical protein CC1G_10858 [Coprinopsis cinerea okayama7#130]
gi|116504369|gb|EAU87264.1| hypothetical protein CC1G_10858 [Coprinopsis cinerea okayama7#130]
Length = 678
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 6/77 (7%)
Query: 310 RFLAEWFAARGGTP--GPGLRLCSHVGCGRP--ETRRHEF--RRCSVCGAVNYCSRACQA 363
RF A+ TP P + C + C + R + R+CS C + YCS CQ
Sbjct: 434 RFPDNPVASYHSTPRFFPKVYFCDNFQCTTSPLDLRSQDLSSRKCSACSLMVYCSLECQH 493
Query: 364 LDWKLRHKADCAPAERW 380
DW LRH+ +C PA +
Sbjct: 494 QDWVLRHREECIPARFY 510
>gi|237746450|ref|ZP_04576930.1| Sel1 repeat-containing protein [Oxalobacter formigenes HOxBLS]
gi|229377801|gb|EEO27892.1| Sel1 repeat-containing protein [Oxalobacter formigenes HOxBLS]
Length = 1527
Score = 46.6 bits (109), Expect = 0.021, Method: Composition-based stats.
Identities = 37/116 (31%), Positives = 58/116 (50%), Gaps = 6/116 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGM---IRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A ++L+ A+ G+ A Y LG+ I +N + KAA SHA+A Y L V+
Sbjct: 745 ALQWLEKAAENGDPAAQYRLGVENHIGKRLPENPEAARQWYRKAADQSHAEAQYWLGVLT 804
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
NG GG KN + ++A GH +A +LG +DG + +N + R++L
Sbjct: 805 LNGEGGEKNPAE---AFRWMEKSAKNGHAEAQYQLGLAFRDGDIIPENKPKARQWL 857
Score = 38.5 bits (88), Expect = 6.6, Method: Composition-based stats.
Identities = 31/91 (34%), Positives = 43/91 (47%), Gaps = 10/91 (10%)
Query: 147 YCLQNRGSGASLMAKA-------AISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCA 199
+CLQ S A ++KA A + +A A Y+L ++ NG G KN L A
Sbjct: 622 HCLQEGTSDAQDVSKALVWFEKAAKNGNADAQYTLGILNHNGEGIPKN---LEAARRWYR 678
Query: 200 RAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
+AA GH+ A L CL +G G +N E
Sbjct: 679 QAAEQGHVKAQYWLATCLLEGLGGPENPEEA 709
>gi|237748063|ref|ZP_04578543.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
gi|229379425|gb|EEO29516.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
Length = 1238
Score = 46.6 bits (109), Expect = 0.021, Method: Composition-based stats.
Identities = 41/133 (30%), Positives = 61/133 (45%), Gaps = 23/133 (17%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL--------MAKAAI 164
IKA W E A A G+ A Y LG+ L + G G + ++ AA
Sbjct: 49 IKARIWFEKA-------ATEGDTRAAYPLGL-----LYSAGLGTPIDYDKAFYWLSIAAR 96
Query: 165 SSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
+ A Y LA + G G +K++++ V +AA GHIDA R +G L G GV
Sbjct: 97 QNIPDAQYRLAGLYQEGKGTAKSEREFAYWVK---KAAGNGHIDAQRAMGMILHYGLGVH 153
Query: 225 QNIAEGRRFLVQA 237
+N+ E ++ +A
Sbjct: 154 KNLPESVKWFEKA 166
Score = 40.8 bits (94), Expect = 1.2, Method: Composition-based stats.
Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 6/93 (6%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
++K A G+++A +GMI Y L +N KAA + +A A Y L + NG
Sbjct: 126 WVKKAAGNGHIDAQRAMGMILHYGLGVHKNLPESVKWFEKAANAGNATAQYYLGMDYMNG 185
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
+G +KN+ R G RAA H++A LG
Sbjct: 186 NGLAKNE---REGEKWLYRAAMQDHLEAQTYLG 215
Score = 38.9 bits (89), Expect = 4.8, Method: Composition-based stats.
Identities = 31/120 (25%), Positives = 57/120 (47%), Gaps = 6/120 (5%)
Query: 125 FLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
L AD G+ A YTLG+ ++ S + K+A + A A Y L ++ +G
Sbjct: 345 LLMRAADKGDSFAQYTLGLAYIDGNGIAKDERKAFSWLEKSASQNRASAQYFLGLMYLDG 404
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARE 241
+G N++ G+ L A G++ A +LG G + +++AE R++ A +++
Sbjct: 405 TGTPVNEEK---GIRLLKELAKTGYVYAQYKLGTYAHSGLHMAKDLAEARKWYQLAASQD 461
>gi|242208886|ref|XP_002470292.1| predicted protein [Postia placenta Mad-698-R]
gi|220730599|gb|EED84453.1| predicted protein [Postia placenta Mad-698-R]
Length = 380
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
L+ CS GCGR ET +F+ C C YC + CQ DWK HK C
Sbjct: 314 LKTCSRPGCGRRETTVAQFKCCGACKDAVYCGQECQKKDWKA-HKPKC 360
>gi|344343559|ref|ZP_08774427.1| Sel1 domain protein repeat-containing protein [Marichromatium
purpuratum 984]
gi|343804982|gb|EGV22880.1| Sel1 domain protein repeat-containing protein [Marichromatium
purpuratum 984]
Length = 276
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 54/121 (44%), Gaps = 9/121 (7%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNR------GSGASLMAKAAISSHAQALYSLAVIQF 179
L AD G+ EA Y LG+ R G A +AKAA H +A + +
Sbjct: 110 LLTAADTGDAEAQYQLGLAIVQRHWERGEQRRLGEAADWIAKAAEQDHVRAQLVMGGLYE 169
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANA 239
G G +D + +A RAA G A+ LG L+ G GV +N+ E +L A+A
Sbjct: 170 KGRG---VIQDYESALAWYRRAATQGDPQAMARLGRMLRTGRGVEKNLVEAYVWLNLASA 226
Query: 240 R 240
R
Sbjct: 227 R 227
>gi|393237072|gb|EJD44617.1| hypothetical protein AURDEDRAFT_152115 [Auricularia delicata
TFB-10046 SS5]
Length = 700
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 39/86 (45%), Gaps = 19/86 (22%)
Query: 308 ASRFLAEWFAARGGTPG----------------PGLRLCSHV---GCGRPETRRHEFRRC 348
+S+ LA W AA GG P L+L + V CG R RRC
Sbjct: 610 SSKQLAAWRAALGGDISANANISWRHAPMSLLNPMLQLNNPVEMRKCGHCAKRSTLLRRC 669
Query: 349 SVCGAVNYCSRACQALDWKLRHKADC 374
C V YCS+ CQA DW++ HKA C
Sbjct: 670 GSCRDVRYCSKDCQAKDWRIGHKAAC 695
>gi|422341261|ref|ZP_16422202.1| hypothetical protein HMPREF9353_00865 [Treponema denticola F0402]
gi|325474832|gb|EGC78018.1| hypothetical protein HMPREF9353_00865 [Treponema denticola F0402]
Length = 784
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 49/126 (38%), Gaps = 16/126 (12%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGA--------SLMAKAAISSHAQAL 171
E A + A+ GNV A Y+LG + N G G KAA H+ A
Sbjct: 311 EKAFEWYTKAAEQGNVYAQYSLGF-----MYNNGQGTKKDEKKAVEWYTKAAEQGHSSAQ 365
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y L + NG G KD + V +AA GH A LG +G GV N +
Sbjct: 366 YFLGFMYDNGQGTK---KDEKKAVEWYTKAAEQGHSSAQNNLGTIYANGTGVEINYKKAF 422
Query: 232 RFLVQA 237
+A
Sbjct: 423 ELYTRA 428
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 5/121 (4%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
N SE+ + K AD G A LG I + + A++ AA HA +L
Sbjct: 205 NTSEAIKWYTK-AADQGVANAQNNLGWI-YEDREEYNRAAAMYLMAAQQGHASGQNNLGR 262
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ +NG G DKD + +AA G++ A LG DGYGV +N + + +
Sbjct: 263 MYYNGYGV---DKDYKQAFEWYTKAAEQGNVYAQSNLGGMYYDGYGVDKNYEKAFEWYTK 319
Query: 237 A 237
A
Sbjct: 320 A 320
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 56/124 (45%), Gaps = 16/124 (12%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYS 173
A + K AD GN A LG+ C N G G L KA +A+A +
Sbjct: 101 AFEWFKKGADQGNANAQNALGV----CYAN-GQGVEKNYTIAIDLYKKAIEQGNAKAQNN 155
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L + ++G+G DK+ L +AA G+ A LG+ ++G GV +N +E ++
Sbjct: 156 LGNMYYDGNGV---DKNYEKAFELYKKAAEQGNAYAQDNLGYMYENGEGVEKNTSEAIKW 212
Query: 234 LVQA 237
+A
Sbjct: 213 YTKA 216
>gi|344925131|ref|ZP_08778592.1| Sel1 domain-containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 900
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 54/111 (48%), Gaps = 6/111 (5%)
Query: 119 SESAHRFLKLCADAGNVEACYTLG---MIRFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
E A + K AD G+ EA + L +I + ++ G +L KAA H A + +A
Sbjct: 97 QEKAFEWFKRAADQGHAEAQFRLSNCYVIGYGVTKDLEKGLALCEKAAEQGHVAAQFIVA 156
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
F G G +KD++ G A +AA G+ A LG+C G+G+ ++
Sbjct: 157 RSYFRGEG---VNKDVKQGYAWAEKAAAQGYAKAQLILGNCYLTGFGIEKD 204
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 66/136 (48%), Gaps = 7/136 (5%)
Query: 109 KTFAIKANNWSES-AHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAI 164
K ++IK ++ES + L L A+A + +A Y L I +++ +AA
Sbjct: 50 KDYSIKKRRFTESNSFEALLLQAEANDPQAQYELAEIYKKGEIISEDQEKAFEWFKRAAD 109
Query: 165 SSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
HA+A + L+ G G +K DL G+ALC +AA GH+ A + G GV
Sbjct: 110 QGHAEAQFRLSNCYVIGYGVTK---DLEKGLALCEKAAEQGHVAAQFIVARSYFRGEGVN 166
Query: 225 QNIAEGRRFLVQANAR 240
+++ +G + +A A+
Sbjct: 167 KDVKQGYAWAEKAAAQ 182
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA +HA+A LA+ + G G + D ++LC +AA G +A LG+C G
Sbjct: 430 KAAAQNHAKAQVELALCYYTGHGVTA---DPVQAISLCQKAAEQGLAEAQCRLGNCYLSG 486
Query: 221 YGVRQNIAEGRRFLVQANARELA 243
YGV +N+ + + +A + LA
Sbjct: 487 YGVERNVEKAFEWFRKAADQGLA 509
>gi|359298722|ref|ZP_09184561.1| Sel1 domain-containing protein [Haemophilus [parainfluenzae] CCUG
13788]
gi|402306033|ref|ZP_10825085.1| Sel1 repeat protein [Haemophilus sputorum HK 2154]
gi|400375448|gb|EJP28347.1| Sel1 repeat protein [Haemophilus sputorum HK 2154]
Length = 321
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 58/122 (47%), Gaps = 16/122 (13%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
++ + A+ G+++A + LG+ + ++G G KAA H A Y+L
Sbjct: 121 KWYRKAAEQGHIDAQFNLGV-----MYSKGEGVKQDDIEAVKWYRKAAEQGHKNAQYNLG 175
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
V+ ++G G + D +AA GHI+AL LG DG GV+Q+ E ++
Sbjct: 176 VMYYDGRGVKQ---DYLEAAKWYRKAADQGHINALFNLGVIYYDGRGVKQDYLETAKWYR 232
Query: 236 QA 237
+A
Sbjct: 233 KA 234
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 55/120 (45%), Gaps = 6/120 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ + A+ G+ A Y LG++ + Q+ A KAA H AL++L VI
Sbjct: 154 EAVKWYRKAAEQGHKNAQYNLGVMYYDGRGVKQDYLEAAKWYRKAADQGHINALFNLGVI 213
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++G G + D +AA GH DA LG G GV+Q+ E ++ +A
Sbjct: 214 YYDGRGVKQ---DYLETAKWYRKAAEQGHRDAQFNLGVMYSKGEGVKQDYFEAAKWYRKA 270
>gi|393243137|gb|EJD50653.1| hypothetical protein AURDEDRAFT_112233, partial [Auricularia
delicata TFB-10046 SS5]
Length = 257
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 31/69 (44%), Gaps = 16/69 (23%)
Query: 312 LAEWFAARGGTPGP---------GLRLCSHVGCGRPETRRHEFRR----CSVCGAVNYCS 358
L W GGT P R+C+ GCGRPE HE + CS C V YCS
Sbjct: 126 LDAWLERAGGTYEPMHDLLRTVGDRRICAAAGCGRPE---HELEKRLLVCSQCHLVRYCS 182
Query: 359 RACQALDWK 367
+ CQ WK
Sbjct: 183 KECQKAHWK 191
>gi|397629558|gb|EJK69411.1| hypothetical protein THAOC_09337 [Thalassiosira oceanica]
Length = 302
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A+Y L +NG+ G + KD+R V L +AA LG I A ELG+ +G GV+Q+ A
Sbjct: 149 AIYFLGQRYYNGTHGLQ--KDMRKAVELWTKAAELGSIGAHFELGNAYDNGKGVQQDNAT 206
Query: 230 GRRFLVQA 237
+F V+A
Sbjct: 207 AVKFFVKA 214
>gi|397639664|gb|EJK73694.1| hypothetical protein THAOC_04668 [Thalassiosira oceanica]
Length = 447
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 58/254 (22%), Positives = 92/254 (36%), Gaps = 38/254 (14%)
Query: 154 SGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALREL 213
+G + KAA ++ ALY L+ +G S+ +K L RAA LG A L
Sbjct: 190 TGLEWVNKAAAQNYPPALYELSFFHRHGIT-SELEKSEEKANELLLRAANLGFALANNAL 248
Query: 214 GHCL---QDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHP 270
C DG+ + ++ F +A+ + A + + T + P P+
Sbjct: 249 AKCYFHGADGFEMDRD-----EFYFRASVAFALDNTNEEAAKMLGTLHHMKRVPEPSPYL 303
Query: 271 APHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGP----- 325
A + N N + + S + C++ + + L +A G P
Sbjct: 304 ACYYLNIVVNESTGSNIVASEDNSGYACHLYSKSLLHLAEHLHGGYARNGSNAVPTVFFW 363
Query: 326 ----------------------GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQA 363
G LC V C + +F++CS C A YCS+ CQ
Sbjct: 364 LRKSLDLGFNDARELLKDLDVVGQSLC--VNCSKKAQTGEKFKQCSKCKAQWYCSKECQV 421
Query: 364 LDWKLRHKADCAPA 377
W+ HK DC A
Sbjct: 422 EAWRAGHKKDCKRA 435
>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 562
Score = 46.6 bits (109), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 329 LCSHVG-CGRPET-RRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGN 386
L SH C P + R + +RC+ C V+YCS+ CQ DWK HK +C +RW +
Sbjct: 73 LSSHCSSCAAPASPERPQLKRCAKCHVVHYCSQKCQNADWKA-HKPECDALQRWATAAPS 131
Query: 387 GDVDV 391
D+ V
Sbjct: 132 PDLAV 136
>gi|344229702|gb|EGV61587.1| hypothetical protein CANTEDRAFT_94477 [Candida tenuis ATCC 10573]
Length = 812
Score = 46.2 bits (108), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G+R C ++ CG+ ET EF +C C YC+R CQ W H+ C P+
Sbjct: 696 GVRQCGNLDCGKWETYPREFSKCRRCKRTKYCTRECQKKSWHC-HRNWCIPS 746
>gi|397565871|gb|EJK44804.1| hypothetical protein THAOC_36627 [Thalassiosira oceanica]
Length = 450
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 35/90 (38%), Gaps = 15/90 (16%)
Query: 298 CNVPAPEAHPASRFLAEWFAARGGTPG-------------PGLRLCSHVGCGRPETRRHE 344
C VP PA F A + G G R CS CGR +
Sbjct: 348 CEVPGYNVLPAVFFWASKYRDSDMNKGCKMELEVPKEIEKLGQRFCS--SCGREAQADEK 405
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADC 374
F++CS C A YCS+ CQ WK H DC
Sbjct: 406 FKQCSRCKAQWYCSKECQVEAWKAGHNKDC 435
>gi|307106233|gb|EFN54479.1| hypothetical protein CHLNCDRAFT_135138 [Chlorella variabilis]
Length = 470
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 23/41 (56%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
CG+ E+ RRCS C V YC CQ DW+ RH +CA
Sbjct: 152 CGKAESAVLALRRCSACKCVAYCCTPCQHADWQARHSGECA 192
>gi|430814114|emb|CCJ28601.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 555
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 9/55 (16%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
G+R C+++ CGR ET +F +C C YCS+ CQ+ W P RW
Sbjct: 502 GIRQCAYIDCGRWETFNRQFAKCRRCRRTKYCSKQCQSRAW---------PGHRW 547
>gi|403172974|ref|XP_003332082.2| hypothetical protein PGTG_13449 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375170076|gb|EFP87663.2| hypothetical protein PGTG_13449 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1295
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 31/50 (62%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E+ EF +C C YCS++CQ+ W+ H+ C+
Sbjct: 1065 GIRQCANMQCGRWESVPREFAKCRRCRKAKYCSKSCQSRAWQHGHRFWCS 1114
>gi|303282879|ref|XP_003060731.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458202|gb|EEH55500.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 375
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
CG + +C C V YC +ACQ DWK RHKA+C PA+ +
Sbjct: 262 CGARAAGVERYSQCGGCHLVAYCGKACQKEDWKKRHKAECRPAKSF 307
>gi|260783057|ref|XP_002586595.1| hypothetical protein BRAFLDRAFT_106189 [Branchiostoma floridae]
gi|229271713|gb|EEN42606.1| hypothetical protein BRAFLDRAFT_106189 [Branchiostoma floridae]
Length = 246
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
C + CG+P R+ +RCS C YCSR CQ W + HK C ++++DEE
Sbjct: 175 CCNPKCGKPGYRK-TLKRCSQCKLTRYCSRDCQIQHWSVGHKK-CCGQDKYIDEE 227
>gi|303280167|ref|XP_003059376.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459212|gb|EEH56508.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 363
Score = 46.2 bits (108), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 33/61 (54%), Gaps = 8/61 (13%)
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDL---NDAEGGGEDNV 404
C+ CG V YC + CQ DW RHKA+C P + E GD+ V L + EGG ED
Sbjct: 262 CAACGLVAYCGKPCQRADWAARHKAECRPKK----ELRRGDM-VQLPPTREGEGGTEDAT 316
Query: 405 V 405
V
Sbjct: 317 V 317
>gi|123474537|ref|XP_001320451.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121903256|gb|EAY08228.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 793
Score = 46.2 bits (108), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 12/125 (9%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGS------GASLMAKAAISSHAQALY 172
+ A RF L AD G+ A + G I ++ +G G M AA +++ A+
Sbjct: 471 EDDALRFYMLAADQGHELALFNAGRI---LMKKKGDTDAISRGIEYMRLAAEKNNSLAML 527
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
L + +G +N ++ G L + L + AL +LG C G G +QNI E RR
Sbjct: 528 KLGQLLLSGKLVERNAEE---GFQLIKSSVELKNALALNDLGECYLYGNGCQQNITEARR 584
Query: 233 FLVQA 237
+ QA
Sbjct: 585 YFEQA 589
>gi|449542372|gb|EMD33351.1| hypothetical protein CERSUDRAFT_108147 [Ceriporiopsis subvermispora
B]
Length = 885
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
G+R C+++ CGR E+ EF +C C YC + CQ+ W H+ C+ E D E
Sbjct: 658 GIRQCANMLCGRWESYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCSAKETEEDPE 716
>gi|343506976|ref|ZP_08744429.1| hypothetical protein VII00023_01845 [Vibrio ichthyoenteri ATCC
700023]
gi|342800878|gb|EGU36382.1| hypothetical protein VII00023_01845 [Vibrio ichthyoenteri ATCC
700023]
Length = 335
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 52/115 (45%), Gaps = 4/115 (3%)
Query: 123 HRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
+++LK AD+GN EA + +Y + + KA LYSL V+ F+G
Sbjct: 71 YQYLKTVADSGNQEAIIAMA-DHYYDEEEYQQSLAWYHKAEAGKDPYVLYSLGVMYFDGE 129
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G D+ G AA G+ DA+ +L DG G+ Q+ ++ + Q+
Sbjct: 130 GTVA---DMVKGNEYYRAAAEAGYSDAMYQLAFSYNDGAGIGQDYSKAAYWFEQS 181
>gi|449550074|gb|EMD41039.1| hypothetical protein CERSUDRAFT_111609 [Ceriporiopsis subvermispora
B]
Length = 397
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
LR+C CG+ E + F+RC C YCS+ACQ LDW H+ C +++
Sbjct: 309 LRICCRKSCGKREDKPALFKRCGGCKEAIYCSQACQKLDWPA-HRRGCKESQQ 360
>gi|343513484|ref|ZP_08750586.1| hypothetical protein VIBRN418_04178 [Vibrio sp. N418]
gi|342802035|gb|EGU37479.1| hypothetical protein VIBRN418_04178 [Vibrio sp. N418]
Length = 335
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 4/115 (3%)
Query: 123 HRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
+++LK A+AG+ +A +L +Y + + KA S+ LYSL V+ F+G
Sbjct: 71 YQYLKTAAEAGSHQAAISLA-DYYYDEEEYQQSLAWYHKAESSNDPYVLYSLGVMYFDGE 129
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G D+ G AA G+ DA+ +L DG G++Q+ + + Q+
Sbjct: 130 G---TPVDMAKGNEYYLAAAKAGYSDAMYQLAFSYNDGDGIKQDYTKAAYWFEQS 181
>gi|390594082|gb|EIN03497.1| hypothetical protein PUNSTDRAFT_147032 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1013
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 34/65 (52%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
G+R C+++ CGR E+ EF +C C YC + CQ+ W H+ C+ + D
Sbjct: 764 GIRQCANMLCGRWESYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCSAKDGDEDPHH 823
Query: 386 NGDVD 390
NG+ +
Sbjct: 824 NGNTE 828
>gi|307102755|gb|EFN51023.1| hypothetical protein CHLNCDRAFT_59342 [Chlorella variabilis]
Length = 175
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 29/54 (53%), Gaps = 5/54 (9%)
Query: 321 GTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G G+R+C+ GCG RRCS C AV YCS AC WK HKA+C
Sbjct: 115 GEVAAGVRVCAAEGCGNTSG----LRRCSRCRAVRYCSEACSHAHWKA-HKAEC 163
>gi|164659153|ref|XP_001730701.1| hypothetical protein MGL_2155 [Malassezia globosa CBS 7966]
gi|159104598|gb|EDP43487.1| hypothetical protein MGL_2155 [Malassezia globosa CBS 7966]
Length = 781
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W++ H+ C+
Sbjct: 422 GIRQCANMLCGKWEAYPREFAKCRRCRKAKYCSKQCQSKGWQMGHRFWCS 471
>gi|86146215|ref|ZP_01064540.1| FOG: TPR repeat protein, SEL1 subfamily protein [Vibrio sp. MED222]
gi|85835926|gb|EAQ54059.1| FOG: TPR repeat protein, SEL1 subfamily protein [Vibrio sp. MED222]
Length = 335
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 4/115 (3%)
Query: 123 HRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
+ ++ A++G V+A TL +Y + + KA S LYSL V+ F+G
Sbjct: 71 YEYMNQVAESGEVKAMITLA-DNYYYEEQYEKALAWYHKAETSKDPYVLYSLGVMYFDGE 129
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G DL+ G +A G+ DA+ +L +G GV Q+ ++ + Q+
Sbjct: 130 G---TPVDLKKGNDYYLASAKAGYSDAMYQLAFSYDEGQGVTQDFSKSAYWFEQS 181
>gi|406876717|gb|EKD26186.1| hypothetical protein ACD_79C01317G0003 [uncultured bacterium]
Length = 282
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 56/128 (43%), Gaps = 8/128 (6%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQ 169
+K NN + A F A G+ + Y +G +N K+A+ +
Sbjct: 142 VKKNN--KKAFEFFLKAAHLGHSRSLYKVGKCYVEGKGVTKNYKEALKWFKKSALEKDPE 199
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
AL L + NG G K D + + C +A LG A LG C DG GV++NI +
Sbjct: 200 ALSYLGICFHNGWGVKK---DFKKSIEYCKKALSLGDDIANYNLGLCYLDGEGVKKNINK 256
Query: 230 GRRFLVQA 237
+++ ++A
Sbjct: 257 AKKYFIKA 264
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 56/126 (44%), Gaps = 16/126 (12%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQ--------AL 171
++A ++ L A G EA TLG+ FY G G K AI + A
Sbjct: 75 KNAFKYYLLAAKLGEPEAITTLGVCYFY-----GKGIKKNVKKAIKYYRLATKGKDICAY 129
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
++L + G G KN+K +AA LGH +L ++G C +G GV +N E
Sbjct: 130 FNLGLCYEWGDGVKKNNKK---AFEFFLKAAHLGHSRSLYKVGKCYVEGKGVTKNYKEAL 186
Query: 232 RFLVQA 237
++ ++
Sbjct: 187 KWFKKS 192
>gi|313230254|emb|CBY07958.1| unnamed protein product [Oikopleura dioica]
Length = 153
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 2/59 (3%)
Query: 333 VGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDV 391
+ C + + F+ C+ C + YCSR CQA DW++ HK DC + W + GD V
Sbjct: 1 MACNHCKKKESSFQ-CAACNRIRYCSRKCQASDWRM-HKQDCKKKDNWEPKHKCGDCTV 57
>gi|260786628|ref|XP_002588359.1| hypothetical protein BRAFLDRAFT_81498 [Branchiostoma floridae]
gi|229273520|gb|EEN44370.1| hypothetical protein BRAFLDRAFT_81498 [Branchiostoma floridae]
Length = 296
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
C + CG+P R+ +RCS C YCSR CQ W + HK C ++++DEE
Sbjct: 175 CCNPKCGKPGYRK-TLKRCSQCKLTRYCSRDCQIQHWTVGHKK-CCGRDKYIDEE 227
>gi|46446345|ref|YP_007710.1| hypothetical protein pc0711 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46399986|emb|CAF23435.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 412
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 6/118 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY--CLQNRGSGASLMAK-AAISSHAQALYSLAV 176
+ A ++ KL AD G+ A Y LG+I Y C+ N A K AA H A YSL +
Sbjct: 203 QEAFKYFKLAADQGHATAQYKLGIIYGYGRCVTNSEQEAFKYYKLAADQGHVMAQYSLGL 262
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G G ++ ++ L A GH A +LG ++G GV+++ E ++
Sbjct: 263 TYAYGWGVKQSKQEAFKYFKLAADQ---GHAKAQYQLGDTYKNGRGVKRSKQEAIKYY 317
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 46/102 (45%), Gaps = 6/102 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALY 172
N + A ++ KL AD G+V A Y+LG+ Y Q++ AA HA+A Y
Sbjct: 235 TNSEQEAFKYYKLAADQGHVMAQYSLGLTYAYGWGVKQSKQEAFKYFKLAADQGHAKAQY 294
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
L NG G ++ ++ + AA G+ DA LG
Sbjct: 295 QLGDTYKNGRGVKRSKQE---AIKYYKLAADQGYADAQYYLG 333
>gi|389748518|gb|EIM89695.1| hypothetical protein STEHIDRAFT_73908 [Stereum hirsutum FP-91666
SS1]
Length = 401
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKA 372
LR C GC + ET+ +EF+RC+ C YCS+ CQ W KA
Sbjct: 309 LRECCMPGCKKSETKPNEFKRCADCKKTWYCSKECQKKHWPDHRKA 354
>gi|218709734|ref|YP_002417355.1| hypothetical protein VS_1745 [Vibrio splendidus LGP32]
gi|218322753|emb|CAV18930.1| Conserved hypothetical protein [Vibrio splendidus LGP32]
Length = 335
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 4/115 (3%)
Query: 123 HRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
+ ++ A++G V+A TL +Y + + KA S LYSL V+ F+G
Sbjct: 71 YEYMNQVAESGEVKAMITLA-DNYYYEEQYEKALAWYHKAETSKDPYVLYSLGVMYFDGE 129
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G DL+ G +A G+ DA+ +L +G GV Q+ ++ + Q+
Sbjct: 130 G---TPIDLKKGNDYYLASAKAGYSDAMYQLAFSYDEGQGVTQDFSKSAYWFEQS 181
>gi|320159276|ref|YP_004191654.1| hypothetical protein VVMO6_04429 [Vibrio vulnificus MO6-24/O]
gi|319934588|gb|ADV89451.1| TPR repeat protein SEL1 subfamily [Vibrio vulnificus MO6-24/O]
Length = 334
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 50/113 (44%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+LK AD N+EA TL +Y + L K A A ALY L V+ F+G G
Sbjct: 73 YLKQSADLDNLEALSTLAE-HYYYEEAYQQALPLYQKLAEKGDAIALYYLGVMYFDGEG- 130
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++D + G +A G DA+ +L DG G+ Q+ + + A
Sbjct: 131 --VEQDQKKGNEYYLASAQKGDEDAMFQLAFSYNDGLGIEQDFKQANHWFASA 181
>gi|254472523|ref|ZP_05085923.1| Sel1 domain protein repeat-containing protein [Pseudovibrio sp.
JE062]
gi|211958806|gb|EEA94006.1| Sel1 domain protein repeat-containing protein [Pseudovibrio sp.
JE062]
Length = 601
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 58/123 (47%), Gaps = 7/123 (5%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQAL 171
+ N+ ++A +F +L A+ G+ A + LG I L Q+ +AA A
Sbjct: 311 SQNYPKAAEKF-RLAAEQGHANAQFNLGRIYEIGLGVDQDYNEALKWYIRAAEQGVVDAQ 369
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+LAV+ NG+G S +DL A AA GH DA LG G GV Q+ A
Sbjct: 370 YNLAVMYANGTGIS---QDLAEAAAWYHFAAKQGHPDAQYNLGFLYATGQGVEQDEATAA 426
Query: 232 RFL 234
R++
Sbjct: 427 RWV 429
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 55/112 (49%), Gaps = 10/112 (8%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLA 175
+A R+++L A+ G+ EA Y +G R Y QN A+ AA +HA+A ++L
Sbjct: 424 TAARWVRLAANQGHAEAQYRIG--RAYEDGVGVEQNHTEAANWYYLAATQNHAKAQFALG 481
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
+ G G +++ + AA G+++A +G G GV+QNI
Sbjct: 482 RVYAIGLGVPQDEVE---AAKWVLHAAERGYLEAQYRIGLAFYKGSGVKQNI 530
Score = 37.7 bits (86), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 60/140 (42%), Gaps = 13/140 (9%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALY 172
+++E+ +++ A+ G V+A Y L ++ Q+ A+ AA H A Y
Sbjct: 348 QDYNEALKWYIR-AAEQGVVDAQYNLAVMYANGTGISQDLAEAAAWYHFAAKQGHPDAQY 406
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+L + G G +++ V L A GH +A +G +DG GV QN E
Sbjct: 407 NLGFLYATGQGVEQDEATAARWVRLAANQ---GHAEAQYRIGRAYEDGVGVEQNHTEA-- 461
Query: 233 FLVQANARELAAVLSSAACQ 252
AN LAA + A Q
Sbjct: 462 ----ANWYYLAATQNHAKAQ 477
>gi|421591576|ref|ZP_16036409.1| hypothetical protein RCCGEPOP_20912 [Rhizobium sp. Pop5]
gi|403702937|gb|EJZ19324.1| hypothetical protein RCCGEPOP_20912 [Rhizobium sp. Pop5]
Length = 859
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 54/122 (44%), Gaps = 6/122 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACY---TLGMIRFYCLQNRGSGASLMAKAAISSHAQAL 171
AN +++A +K DAG+V A L +I N + + A A AL
Sbjct: 555 ANRDTKTAFETIKKAMDAGHVRAISEISALYLIGASVPTNLEKSSEIAAMGAKKGDPFAL 614
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+ + G G D + G+ L +AA LGH A+ ELG+ +G V ++ G
Sbjct: 615 YAYGKSLYYGRGAKA---DTQEGLKLMLQAADLGHTYAMNELGYIFLNGVNVTADVERGV 671
Query: 232 RF 233
RF
Sbjct: 672 RF 673
>gi|237748455|ref|ZP_04578935.1| Sel1 domain-containing protein [Oxalobacter formigenes OXCC13]
gi|229379817|gb|EEO29908.1| Sel1 domain-containing protein [Oxalobacter formigenes OXCC13]
Length = 399
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 59/126 (46%), Gaps = 6/126 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAV 176
++A + K A+ G +A Y LG+I L Q+ S +AA A Y+L +
Sbjct: 102 KAAIEWYKKAAEQGFAQAQYNLGVIYQNGLGIKQDFDSAVQWYERAANQGFVLAQYNLGM 161
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ G+G KN K G+ +AA G+ A LG +G GVR+N E ++ +
Sbjct: 162 LYITGAGVGKNPKR---GILWLRKAAEGGYGQAQHNLGTVYYEGIGVRKNYPEAVQWFAK 218
Query: 237 ANAREL 242
A +EL
Sbjct: 219 AAKQEL 224
>gi|340714736|ref|XP_003395881.1| PREDICTED: hypothetical protein LOC100649411 [Bombus terrestris]
Length = 376
Score = 45.8 bits (107), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 34/60 (56%), Gaps = 2/60 (3%)
Query: 316 FAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+R TPG +++ + VGC R + RCS C AV+YC++ Q DWK RHK CA
Sbjct: 5 IVSRAQTPGGDVQIGTTVGCVIC-NRTDKLLRCSRCKAVSYCTKEHQRRDWK-RHKEFCA 62
>gi|392545490|ref|ZP_10292627.1| Sel1 domain-containing protein [Pseudoalteromonas rubra ATCC 29570]
Length = 555
Score = 45.8 bits (107), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 62/134 (46%), Gaps = 11/134 (8%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLG-MIRF--YCLQNRGSGASLMAKAAISSHAQALY 172
N+ + + F K A+ G A Y+LG M R Y Q+ + L +AA HA A Y
Sbjct: 42 QNYERAVYWFGK-AAEQGEAHAQYSLGYMYRMGQYFEQSDEAAVKLYQQAAEQGHASAQY 100
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+L V+ G G +++D V L +AA G+ A ELG G G+++N E +
Sbjct: 101 NLGVLCVRGEGITQSDT---KAVKLYRKAAQQGNALAQYELGFMYARGNGIKKNDTEAVK 157
Query: 233 FLV----QANAREL 242
+ Q N R L
Sbjct: 158 WYRLASDQGNTRAL 171
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 56/120 (46%), Gaps = 12/120 (10%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS------LMAKAAISSHAQALYS 173
E+A + + A+ G+ A Y LG++ C++ G S L KAA +A A Y
Sbjct: 81 EAAVKLYQQAAEQGHASAQYNLGVL---CVRGEGITQSDTKAVKLYRKAAQQGNALAQYE 137
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L + G+G KND + V A+ G+ AL LG+ G GV+Q+ E ++
Sbjct: 138 LGFMYARGNGIKKNDTE---AVKWYRLASDQGNTRALCNLGYMYDFGRGVKQSSKEANKY 194
>gi|343500082|ref|ZP_08737998.1| hypothetical protein VITU9109_08932 [Vibrio tubiashii ATCC 19109]
gi|418478556|ref|ZP_13047656.1| hypothetical protein VT1337_09147 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
gi|342821286|gb|EGU56073.1| hypothetical protein VITU9109_08932 [Vibrio tubiashii ATCC 19109]
gi|384573798|gb|EIF04285.1| hypothetical protein VT1337_09147 [Vibrio tubiashii NCIMB 1337 =
ATCC 19106]
Length = 320
Score = 45.8 bits (107), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 4/125 (3%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
AH +L+ +G+ + TL Y N S KA LYSL V+ F+G
Sbjct: 55 AHEYLEQALKSGH-DGAETLKADLEYSDGNYEEALSWYHKAEDGQDPYVLYSLGVMYFDG 113
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARE 241
G ++D++ G +A LG DA+ +L DG GV ++ A+ + QA ++
Sbjct: 114 EG---TEQDVKKGNEYYLASAKLGDDDAMYQLAFSYNDGVGVEKDYAKAAYWFEQAAKQQ 170
Query: 242 LAAVL 246
A+ +
Sbjct: 171 NASAI 175
>gi|406695032|gb|EKC98347.1| regulation of budding-related protein [Trichosporon asahii var.
asahii CBS 8904]
Length = 963
Score = 45.8 bits (107), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E EF +C C YCS+ CQ+ W H+ C+
Sbjct: 731 GIRQCAYMSCGRWEVFPREFAKCRRCRKAKYCSKECQSKAWSEGHRFWCS 780
>gi|397591040|gb|EJK55255.1| hypothetical protein THAOC_25029, partial [Thalassiosira oceanica]
Length = 207
Score = 45.8 bits (107), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 2/109 (1%)
Query: 129 CADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKND 188
C A + R C N +++ +A++ L F GS G +
Sbjct: 93 CNMAAQKRGMFDCAFCRTPCPDNDDDKLAMLQARVKKKDPEAIFHLGQKYFFGSLGLQ-- 150
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD R V L A LG IDAL LG+ +G GV+Q+ +G FL +A
Sbjct: 151 KDTRKAVELFTEAVELGSIDALFSLGNAYFNGDGVQQDKDKGVYFLTKA 199
>gi|344925543|ref|ZP_08779004.1| Sel1 domain-containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 856
Score = 45.8 bits (107), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 62/129 (48%), Gaps = 11/129 (8%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQAL 171
N+ E+ ++ KL A GN+ + LG +Y +N +AA H+++
Sbjct: 715 NYKEAV-KWCKLAAKQGNIACQHDLG--HYYDVGQGVTKNLRKAFKWYMRAAEQGHSESQ 771
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y++ + + G G ++ D V RAA G DA ELGHC G+GV +++AE
Sbjct: 772 YNVGICFYEGQGVTR---DHHEAVKWYRRAAEQGDSDAYCELGHCYIYGHGVPRDLAEAL 828
Query: 232 RFLVQANAR 240
++ A A+
Sbjct: 829 KYYRMAAAK 837
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 119 SESAHRFLKLCADAGNVEA------CYTLGM-IRFYCLQNRGSGASLMAKAAISSHAQAL 171
++ A ++ K A G+VEA CY LG+ I+ QN L AA H A
Sbjct: 644 ADQAVKWYKCAAKQGSVEAQFWLGVCYDLGVGIK----QNYKEAVKLYRLAAEKGHEGAQ 699
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
+L+ G+G +N K+ V C AA G+I +LGH G GV +N+ +
Sbjct: 700 LNLSTCYHEGTGVERNYKE---AVKWCKLAAKQGNIACQHDLGHYYDVGQGVTKNLRKAF 756
Query: 232 RFLVQA 237
++ ++A
Sbjct: 757 KWYMRA 762
>gi|406660037|ref|ZP_11068172.1| Putative beta-lactamase hcpC precursor [Cecembia lonarensis LW9]
gi|405556171|gb|EKB51121.1| Putative beta-lactamase hcpC precursor [Cecembia lonarensis LW9]
Length = 445
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 60/124 (48%), Gaps = 7/124 (5%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYS 173
N+ E+ R +L AD G+ A + +G++ QN + +AA A A +
Sbjct: 236 NYQET-ERLYRLAADQGHPAAEFNMGILYRDGLSVQQNFAEAYNWFERAAKKGFANAEFE 294
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+ ++ +NG+G K+D + V +AA + A R LG +GYGV+++ E ++
Sbjct: 295 IGLLYYNGTGVKKDDTE---AVKWYLKAAEQEYTLAERSLGIMYANGYGVKKDDTEAVKW 351
Query: 234 LVQA 237
+A
Sbjct: 352 YRKA 355
>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 552
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDV 391
+RC+ C + YCS ACQ DW + HK +CA +RW + DV +
Sbjct: 80 LKRCTRCRVIWYCSTACQNNDWTM-HKPECAAIQRWAANAPSPDVSI 125
>gi|325267697|ref|ZP_08134348.1| TPR repeat protein [Kingella denitrificans ATCC 33394]
gi|324980821|gb|EGC16482.1| TPR repeat protein [Kingella denitrificans ATCC 33394]
Length = 702
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 52/117 (44%), Gaps = 6/117 (5%)
Query: 124 RFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
++ + A+ G+ EA LG++ + QN +AA AQA Y+L V+
Sbjct: 102 KWYRQAAEQGDAEAQSNLGVMYDKGYGVRQNYAEAVKWYRQAAEQGFAQAQYNLGVMYET 161
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G + D V +AA G +A LG G GVRQN AE R+ QA
Sbjct: 162 GRGVRQ---DYAEAVKWYRQAAEQGDAEAQNNLGAMYDSGQGVRQNYAEALRWYRQA 215
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 10/119 (8%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
R+ + A+ G A Y LG + Y Q+ +AA +A+A L V+
Sbjct: 462 RWFRKAAEQGVAVAQYNLGAM--YDNGDGVRQDYAEALRWYRQAAEQEYAEAQNDLGVMY 519
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+NGSG + D + +AA G+++A LG +G+GVRQ+ AE R+ +A
Sbjct: 520 YNGSGVRQ---DYAEALRWYRKAAEQGNVEAQNNLGVMYDNGHGVRQDYAEALRWFRKA 575
Score = 41.2 bits (95), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 55/119 (46%), Gaps = 10/119 (8%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
++ + A+ G+ EA LG + Y QN +AA HA+A ++L +
Sbjct: 174 KWYRQAAEQGDAEAQNNLGAM--YDSGQGVRQNYAEALRWYRQAAEQGHAEAQFNLGSMY 231
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+N G +D V +AA G+ +A LG ++G GVRQ+ AE R+ +A
Sbjct: 232 YN---GQDVQQDYAEAVKWYRQAADQGNAEAQNNLGLLYENGRGVRQDYAEALRWYRKA 287
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 55/122 (45%), Gaps = 10/122 (8%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLA 175
A R+ + A+ G+ A + LG + Y Q+ +AA HA+A ++L
Sbjct: 351 EALRWYRKAAEQGHAAAQFNLGAM--YDSGRGVRQDYAEAFRWYRQAAEQGHAEAQFNLG 408
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+ NG G + D +AA G+ +A LG +GYGVRQ+ AE R+
Sbjct: 409 AMYDNGDGVRQ---DYAEAFRWFHKAAEQGYAEAQNNLGVMYYNGYGVRQDYAESFRWFR 465
Query: 236 QA 237
+A
Sbjct: 466 KA 467
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 55/125 (44%), Gaps = 8/125 (6%)
Query: 118 WSESAHRFLKLC--ADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALY 172
W++ F + A+ G+ EA + LG++ + Q+ +AA A A
Sbjct: 22 WADDVSDFRQTLQLAEQGDAEAQFNLGLMYYNGQGVRQDYAEAVKWYRQAAEQGDASAQN 81
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+L ++ NG G + D V +AA G +A LG GYGVRQN AE +
Sbjct: 82 NLGLMYDNGYGVRQ---DYAEAVKWYRQAAEQGDAEAQSNLGVMYDKGYGVRQNYAEAVK 138
Query: 233 FLVQA 237
+ QA
Sbjct: 139 WYRQA 143
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 60/124 (48%), Gaps = 7/124 (5%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYS 173
N++E A R+ + A+ G+ EA + LG + + Q+ +AA +A+A +
Sbjct: 204 NYAE-ALRWYRQAAEQGHAEAQFNLGSMYYNGQDVQQDYAEAVKWYRQAADQGNAEAQNN 262
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L ++ NG G + D + +AA GH +A LG +G+GV Q+ AE ++
Sbjct: 263 LGLLYENGRGVRQ---DYAEALRWYRKAAEQGHTEAQNNLGAMYGNGHGVHQDDAEAVKW 319
Query: 234 LVQA 237
QA
Sbjct: 320 YRQA 323
>gi|443921550|gb|ELU41142.1| Sel1 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 804
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 42/85 (49%)
Query: 155 GASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
G L+ + A H +A Y LA NG G +++ +D + AA GH DA G
Sbjct: 383 GVQLLKRLADRGHGEAQYFLADCMANGIGTNRSRQDFDRAYPMFVLAAKHGHADAAYRAG 442
Query: 215 HCLQDGYGVRQNIAEGRRFLVQANA 239
C ++G+G R+ A+ +F +A A
Sbjct: 443 TCCENGWGCRRESAKAVQFFRKAAA 467
>gi|440894480|gb|ELR46922.1| Programmed cell death protein 2 [Bos grunniens mutus]
Length = 340
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 24/101 (23%)
Query: 275 RNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVG 334
RNQ P N DF P E P+ + + + G LC G
Sbjct: 93 RNQLPRQN------------DFYSYEPPSEDPPSETGESVYLHLKSGA-----HLCRVCG 135
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
C P +RCS C +YCS+ Q+LDW+L HK CA
Sbjct: 136 CSGP-------KRCSRCHKAHYCSKEHQSLDWRLGHKQACA 169
>gi|114797087|ref|YP_759395.1| putative localization factor protein PodJ [Hyphomonas neptunium ATCC
15444]
gi|114737261|gb|ABI75386.1| putative localization factor protein PodJ [Hyphomonas neptunium ATCC
15444]
Length = 1238
Score = 45.8 bits (107), Expect = 0.043, Method: Composition-based stats.
Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 6/114 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAV 176
+SA L+ A G A Y LG + + Q+ SL++KAA + H A+Y LA+
Sbjct: 1000 DSAIPLLRRAALKGAAPAQYDLGKLYEQGIGVDQDMIQARSLISKAAEAGHVGAMYDLAL 1059
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
G GG +D V +AA G +DA LG +G G Q++AE
Sbjct: 1060 FMAEGEGGELDD---LGAVEWFRKAADHGFLDAQYNLGVMFAEGIGAEQDLAEA 1110
>gi|24376033|ref|NP_720076.1| Sel1 domain-containing protein [Shewanella oneidensis MR-1]
gi|24351041|gb|AAN57520.1| Sel1 domain-containing protein [Shewanella oneidensis MR-1]
Length = 484
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 56/115 (48%), Gaps = 5/115 (4%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
A FL+L A+ EA LG++ +Y Q+ ++KAA + + A Y LA++ N
Sbjct: 286 QAKHFLQLAAEGEQTEAMANLGVL-YYQTQDLPQAYHFISKAAQAGYPHAQYHLALMLAN 344
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFL 234
G G +D A A AA G +DA L H L D G+ ++++ R+L
Sbjct: 345 GDGCQ---RDPIASEYWMAEAAEQGQLDAMLTRAQHMLNDESGLGGDVSQAERYL 396
>gi|340375070|ref|XP_003386060.1| PREDICTED: programmed cell death protein 2-like [Amphimedon
queenslandica]
Length = 374
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 25/46 (54%), Gaps = 7/46 (15%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
LC GC P +C C V+YCSR Q LDWK+ HKADC
Sbjct: 149 LCGLCGCPAP-------NKCGQCRKVHYCSRYHQLLDWKINHKADC 187
>gi|92118996|ref|YP_578725.1| Sel1 [Nitrobacter hamburgensis X14]
gi|91801890|gb|ABE64265.1| Sel1 [Nitrobacter hamburgensis X14]
Length = 357
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 10/122 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A L+ ADAG+ EA Y L FY QN L+ A+++ + A A+
Sbjct: 206 AAELLRFAADAGSPEAQYALAT--FYKEGTGVEQNLYKSVRLLQAASLAGNVDAEVEYAI 263
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
FNG+G KN+ A V+L +AA A L H L G G + EG ++ +
Sbjct: 264 ALFNGAGTEKNET---AAVSLLRKAARQNSAIAQNRLAHVLVTGMGAPVDKVEGLKWHIV 320
Query: 237 AN 238
A
Sbjct: 321 AK 322
>gi|388853626|emb|CCF52798.1| related to SamB protein [Ustilago hordei]
Length = 1433
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W++ H+ C+
Sbjct: 752 GIRQCANMLCGKWEEYPREFAKCRRCRKAKYCSKQCQSKGWQMGHRYWCS 801
>gi|37677192|ref|NP_937588.1| hypothetical protein VVA1532 [Vibrio vulnificus YJ016]
gi|326424333|ref|NP_762954.2| hypothetical protein VV2_1036 [Vibrio vulnificus CMCP6]
gi|37201737|dbj|BAC97558.1| conserved hypothetical protein [Vibrio vulnificus YJ016]
gi|319999719|gb|AAO07944.2| FOG: TPR repeat protein, SEL1 subfamily [Vibrio vulnificus CMCP6]
Length = 347
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 50/113 (44%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+LK AD N+EA TL +Y + L K A A ALY L V+ F+G G
Sbjct: 86 YLKQSADLDNLEALSTLAE-HYYYEEAYQQALPLYQKLAEKGDAIALYYLGVMYFDGEG- 143
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++D + G +A G DA+ +L DG G+ Q+ + + A
Sbjct: 144 --VEQDQKKGNEYYLASAQKGDEDAMFQLAFSYNDGLGIEQDFKQANHWFASA 194
>gi|71014596|ref|XP_758734.1| hypothetical protein UM02587.1 [Ustilago maydis 521]
gi|46098524|gb|EAK83757.1| hypothetical protein UM02587.1 [Ustilago maydis 521]
gi|154761386|gb|ABS85543.1| Fuz1 [Ustilago maydis]
Length = 1421
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W++ H+ C+
Sbjct: 741 GIRQCANMLCGKWEEYPREFAKCRRCRKAKYCSKQCQSKGWQMGHRYWCS 790
>gi|393239660|gb|EJD47190.1| hypothetical protein AURDEDRAFT_163797 [Auricularia delicata
TFB-10046 SS5]
Length = 722
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 334 GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
C E RRCS+C V YC R CQA DW+ +H+ +C+ A
Sbjct: 677 ACAHCELESAVLRRCSLCREVRYCGRECQAEDWQNQHREECSRA 720
>gi|315045996|ref|XP_003172373.1| hypothetical protein MGYG_04963 [Arthroderma gypseum CBS 118893]
gi|311342759|gb|EFR01962.1| hypothetical protein MGYG_04963 [Arthroderma gypseum CBS 118893]
Length = 1210
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
RLCS+ CG+ + RCS C YCS+ CQ DWK+ HK+ C
Sbjct: 1159 RLCSN--CGKKPGSGTKLLRCSGCKIAEYCSKECQREDWKVTHKSVC 1203
>gi|384497176|gb|EIE87667.1| hypothetical protein RO3G_12378 [Rhizopus delemar RA 99-880]
Length = 288
Score = 45.4 bits (106), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 31/59 (52%)
Query: 317 AARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
A R GLR C+++ C R ET+ EF +C C YCS++CQ+ W H+ C
Sbjct: 162 ACRKDETRGGLRRCANMSCQRWETQPREFAKCRRCRKAKYCSKSCQSKAWADGHRWWCV 220
>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
Length = 565
Score = 45.4 bits (106), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 1/53 (1%)
Query: 340 TRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVD 392
T RRC+ C V YCS CQ DW + HK +C RW + G+ D
Sbjct: 95 TSEENVRRCTRCKKVAYCSTECQTADWGI-HKQECQSLRRWAEASGSDSTPAD 146
>gi|115526708|ref|YP_783619.1| Sel1-like protein [Rhodopseudomonas palustris BisA53]
gi|115520655|gb|ABJ08639.1| Sel1-like protein [Rhodopseudomonas palustris BisA53]
Length = 386
Score = 45.4 bits (106), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 57/116 (49%), Gaps = 6/116 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + L A+AG+ EA + L M+R NR GA +A +A + +A Y+LA++
Sbjct: 163 AVEWYTLAAEAGDREAMFALAMLRLAGRAGPPNRAEGARWLASSAKLGNPKAAYNLALLY 222
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+ G +D++ L +A G+ +A L ++G GV +N+ + R L
Sbjct: 223 LD---GQTFPQDVKRAAELLRLSADAGNPEAQYALATFYKEGTGVEKNLEQSVRLL 275
Score = 44.7 bits (104), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 55/122 (45%), Gaps = 10/122 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A L+L ADAGN EA Y L FY +N L+ AA++ + A A+
Sbjct: 235 AAELLRLSADAGNPEAQYALAT--FYKEGTGVEKNLEQSVRLLQAAAVAGNVDAEVEYAI 292
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
FNG+G KN+ A VAL +AA + A L L G G + +G ++ +
Sbjct: 293 ALFNGAGTPKNEA---AAVALLRKAARRNNPIAQNRLARVLVTGQGAPIDRIDGLKWHIV 349
Query: 237 AN 238
A
Sbjct: 350 AK 351
>gi|91978778|ref|YP_571437.1| hypothetical protein RPD_4319 [Rhodopseudomonas palustris BisB5]
gi|91685234|gb|ABE41536.1| Sel1 [Rhodopseudomonas palustris BisB5]
Length = 367
Score = 45.4 bits (106), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 55/117 (47%), Gaps = 10/117 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A L++ ADAGN EA Y L FY +N L+ AA++ + A A+
Sbjct: 216 AAELLRVAADAGNSEAQYALAT--FYKEGTGVEKNLDQAVRLLQSAALAGNVPAQVEYAI 273
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+NG+G KN+ A VA+ +AA + A L H L +G G ++ E ++
Sbjct: 274 ALYNGTGTVKNEP---AAVAMLRKAARANNPIAQNRLAHVLLNGQGAPRDPVEAIKW 327
>gi|343429396|emb|CBQ72969.1| related to SamB protein [Sporisorium reilianum SRZ2]
Length = 1443
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W++ H+ C+
Sbjct: 763 GIRQCANMLCGKWEEYPREFAKCRRCRKAKYCSKQCQSKGWQMGHRYWCS 812
>gi|117925293|ref|YP_865910.1| Sel1 domain-containing protein [Magnetococcus marinus MC-1]
gi|117609049|gb|ABK44504.1| Sel1 domain protein repeat-containing protein [Magnetococcus
marinus MC-1]
Length = 831
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 61/127 (48%), Gaps = 16/127 (12%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI--------SSHAQA 170
++ A ++ + A+ G+ +A + LG + ++G G + AK A+ HAQA
Sbjct: 362 AKEAVKWFRKSAEQGHAQAQHNLG-----VMYDKGQGVTKDAKEAVKWFRKSAEQGHAQA 416
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
++L V+ NG G + KD + V +AA GH A LG +G GV ++ E
Sbjct: 417 QHNLGVMYNNGEGVT---KDAKEAVKWYRKAAEQGHARAQNNLGVMYNNGEGVTKDAKEA 473
Query: 231 RRFLVQA 237
++ +A
Sbjct: 474 VKWYRKA 480
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 53/119 (44%), Gaps = 16/119 (13%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI--------SSHAQA 170
++ A ++ + A+ G +A + LG + N G G + AK A+ A+A
Sbjct: 614 AKEAVKWFRKSAEQGEAKAQHNLG-----VMYNNGEGVTKDAKEAVKWFRKSAEQGEAEA 668
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
+L + NG G +K D + V +AA GH +A LG GYGV ++ E
Sbjct: 669 QNNLGFMYDNGEGVTK---DAKEAVKWLRKAAEQGHANAQAFLGQSYDVGYGVTKDAKE 724
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 16/127 (12%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI--------SSHAQA 170
++ A ++ + A+ G EA LG + ++G G + AK A+ HA+A
Sbjct: 470 AKEAVKWYRKAAEQGQAEAQNDLG-----VMYDKGEGVTKDAKEAVKWYRKAAEQGHARA 524
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
+L V+ NG G + KD + V +AA G +A LG G GV ++ E
Sbjct: 525 QNNLGVMYNNGEGVT---KDAKEAVKWYRKAAEQGQAEAQHNLGFMYDKGEGVTKDAKEA 581
Query: 231 RRFLVQA 237
++ +A
Sbjct: 582 VKWYRKA 588
>gi|440793753|gb|ELR14928.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 252
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
C++ CGR E F RCS C V YC R CQ DW +HK C
Sbjct: 208 CANGKCGRSEEDGETFNRCSRCKKVYYCGRECQRADWP-KHKPTC 251
>gi|443894529|dbj|GAC71877.1| hypothetical protein PANT_5d00113 [Pseudozyma antarctica T-34]
Length = 1350
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 30/50 (60%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W++ H+ C+
Sbjct: 741 GIRQCANMLCGKWEEYPREFAKCRRCRKAKYCSKQCQSKGWQMGHRYWCS 790
>gi|126651405|ref|ZP_01723609.1| hypothetical protein BB14905_06974 [Bacillus sp. B14905]
gi|126591658|gb|EAZ85754.1| hypothetical protein BB14905_06974 [Bacillus sp. B14905]
Length = 428
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 49/111 (44%), Gaps = 7/111 (6%)
Query: 127 KLCADAGNVEACYTLGMI----RFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
K A+ G V+A Y LG I R ++ G +AA H A Y L I G
Sbjct: 299 KKSAETGYVDAQYRLGGIYLEGRLDQAKDINRGLFWYERAAEQFHIDAFYDLGFIWSKGL 358
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
G +N + G+ +AA G DA +LGH G GV +N+ E ++
Sbjct: 359 TGIRN---IEKGIHWFKQAALQGDADAKLQLGHIYNKGVGVARNLKEAIKW 406
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 51/110 (46%), Gaps = 6/110 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAV 176
E + ++ +L A GN +A Y LG F + +N SL KAA+ HA A +LA
Sbjct: 184 EKSLKWFELAAAQGNADAQYVLGNFYFEGVGVEENYTQAFSLYEKAALQGHADAANNLAD 243
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+ FNG G +D AA +A+ LG + G GV+++
Sbjct: 244 MYFNGEGVP---QDFTLARKWFDFAASKNVAEAMFTLGIMYEQGLGVKKD 290
>gi|393227837|gb|EJD35500.1| hypothetical protein AURDEDRAFT_188722 [Auricularia delicata
TFB-10046 SS5]
Length = 523
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 28/56 (50%), Gaps = 6/56 (10%)
Query: 325 PGLRLCSHVGCGRPETR---RHEFRRCSVCGAVNYCSRACQALDWK---LRHKADC 374
P R CS GC + RH F +C+ C V YCSRACQ DW L HK C
Sbjct: 347 PETRDCSAPGCAKTVHEPGMRHAFAKCAKCRVVQYCSRACQKADWTGLPLPHKEVC 402
>gi|343508863|ref|ZP_08746169.1| hypothetical protein VIS19158_10664 [Vibrio scophthalmi LMG 19158]
gi|342806644|gb|EGU41862.1| hypothetical protein VIS19158_10664 [Vibrio scophthalmi LMG 19158]
Length = 335
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 55/115 (47%), Gaps = 4/115 (3%)
Query: 123 HRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
+++L+ A+AG+ +A +L +Y + + KA S+ LYSL V+ F+G
Sbjct: 71 YQYLQTAAEAGSHQAAISLA-DYYYDEEEYQQSLAWYHKAEASNDPYVLYSLGVMYFDGE 129
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G D+ G AA G+ DA+ +L DG G++Q+ + + Q+
Sbjct: 130 G---TPADMAKGNEYYLAAAKAGYSDAMYQLAFSYNDGDGIKQDYTKAAYWFEQS 181
>gi|424891627|ref|ZP_18315210.1| hypothetical protein Rleg4DRAFT_7545 [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393185622|gb|EJC85658.1| hypothetical protein Rleg4DRAFT_7545 [Rhizobium leguminosarum bv.
trifolii WSM2297]
Length = 860
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 60/122 (49%), Gaps = 6/122 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY--CLQNRGSGASLMAKA-AISSHAQAL 171
AN +++A +K DAG+V A Y L + + + + AS +AK A AL
Sbjct: 556 ANRDTKTAFATIKKAMDAGHVRAIYELSSLYEFGASVPRDAAKASEIAKGGAAKGDPFAL 615
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
+S + G G +K D+ L G+ L +AA LGH A+ ELG+ +G V + G
Sbjct: 616 HSYGKSLYYGRG-TKADQQL--GLRLMLQAADLGHTYAMNELGYIFLNGVNVPADPDRGI 672
Query: 232 RF 233
RF
Sbjct: 673 RF 674
>gi|402848518|ref|ZP_10896775.1| TPR repeat, SEL1 subfamily [Rhodovulum sp. PH10]
gi|402501265|gb|EJW12920.1| TPR repeat, SEL1 subfamily [Rhodovulum sp. PH10]
Length = 371
Score = 45.4 bits (106), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 52/113 (46%), Gaps = 10/113 (8%)
Query: 133 GNVEACYTLGMIRFYC-----LQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKN 187
G+ A Y LG+ FY Q+ G A L ++A +A Y+LA++ G G
Sbjct: 190 GHASAAYDLGL--FYMEGQIFPQDIGRAAELFRQSAAGGSPEAQYALAILYKEGRG---V 244
Query: 188 DKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
+DL L +AA G +DA E G L +G G +N AE + +A R
Sbjct: 245 QQDLGEAAKLLGQAALAGIVDAQVEYGIALFNGTGTAKNEAEAAAWFRRAAMR 297
Score = 38.9 bits (89), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 53/121 (43%), Gaps = 6/121 (4%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + A G+ EA Y L ++ Q+ G A L+ +AA++ A +
Sbjct: 215 AAELFRQSAAGGSPEAQYALAILYKEGRGVQQDLGEAAKLLGQAALAGIVDAQVEYGIAL 274
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
FNG+G +KN+ + A RAA G A L L G GV + + ++ + A+
Sbjct: 275 FNGTGTAKNEAE---AAAWFRRAAMRGSPIAQNRLARLLSVGRGVAADPVQAIKWHLVAS 331
Query: 239 A 239
A
Sbjct: 332 A 332
>gi|319639039|ref|ZP_07993797.1| hypothetical protein HMPREF0604_01421 [Neisseria mucosa C102]
gi|317399943|gb|EFV80606.1| hypothetical protein HMPREF0604_01421 [Neisseria mucosa C102]
Length = 339
Score = 45.4 bits (106), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 66/151 (43%), Gaps = 17/151 (11%)
Query: 118 WSESAHRFLKL--CADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQA 170
W+++ F + A+ G V A Y LG + Y ++ KAA +AQA
Sbjct: 22 WADNVPNFQETLRAAEQGVVAAQYNLGQM--YRNGQGVRKDYAEAVKWYRKAAEQGYAQA 79
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L V+ NG G + D V +AA G DA LG +G GVRQ+ AE
Sbjct: 80 QYNLGVMYDNGRGVRQ---DYIQAVQWYRKAAEQGLADAQYNLGMMYANGQGVRQDYAEA 136
Query: 231 RRFLVQANARELAAV-----LSSAACQGIST 256
R+ + + LA LS A QG+S
Sbjct: 137 VRWFRKTAEQGLAKAQYNLGLSYAQGQGVSQ 167
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 48/108 (44%), Gaps = 6/108 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
+A ++ + A+ G+ A LGM+ QN KAA HAQA +L V+
Sbjct: 207 NAVQWYRKAAEQGHAGAQINLGMMYEKGQGVHQNYAKAVEWYHKAAEQGHAQAQNNLGVM 266
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
NG G +D V +AA G+ DA LG + G GVRQ
Sbjct: 267 YDNGQGVR---QDYAQAVQWYLKAAEQGYADAQYNLGLMYEKGQGVRQ 311
>gi|32169828|emb|CAD99199.1| hypothetical protein [Mucor circinelloides]
Length = 202
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
Query: 155 GASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
GA +AA HA+A ++L NG G ++N L+ VA ++A G+I A LG
Sbjct: 70 GAYWYGQAARQGHARAQHNLGFCYHNGIGVTRN---LQMAVAWYKKSANQGNIFAYHSLG 126
Query: 215 HCLQDGYGVRQNIAE 229
+C Q+G GV N+ E
Sbjct: 127 YCFQNGLGVEVNLRE 141
>gi|119947297|ref|YP_944977.1| Sel1-like repeat-containing serine/threonine protein kinase
[Psychromonas ingrahamii 37]
gi|119865901|gb|ABM05378.1| Tyrosine protein kinase:Serine/threonine protein kinase:Sel1-like
repeat protein [Psychromonas ingrahamii 37]
Length = 448
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 58/121 (47%), Gaps = 7/121 (5%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYS 173
N++E A R+ + A GN +A Y +G++ +Q+ KAA HA A Y+
Sbjct: 284 NYTE-ATRWYRKAAKQGNADAQYNMGLMYNNGHGVIQDYKQALQWYNKAAEQQHAGAQYN 342
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+ ++ G G S+ D + +AA GH +A LG ++G+GV Q+ + +
Sbjct: 343 MGMMYDYGQGVSQ---DYKQAADWYHKAAEQGHANAQYYLGMMYENGHGVLQDYRQAYMW 399
Query: 234 L 234
L
Sbjct: 400 L 400
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 6/128 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLA 175
SE A ++ A+ + EA YTLG+I + Q+ KAA ++ A Y++
Sbjct: 213 SEQAAQWYHKAAEQEHAEAQYTLGLIYTSGYGVTQSYKQATYWYNKAAEQGYSDAQYNMG 272
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
++ +G+ G KN + +AA G+ DA +G +G+GV Q+ + ++
Sbjct: 273 LMYNSGNNGFKNYTE---ATRWYRKAAKQGNADAQYNMGLMYNNGHGVIQDYKQALQWYN 329
Query: 236 QANARELA 243
+A ++ A
Sbjct: 330 KAAEQQHA 337
>gi|449517999|ref|XP_004166031.1| PREDICTED: putative F-box protein At1g67623-like, partial [Cucumis
sativus]
Length = 246
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 29/123 (23%), Positives = 56/123 (45%), Gaps = 2/123 (1%)
Query: 57 LFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKAN 116
+ +LP+DL+ +L K+++++ D V + K + + + AS F
Sbjct: 33 VIKSLPNDLLTEVLAKVATSSYI--DLVQAKLATKHFLEASNDRYIFQHASLGNFRNLLW 90
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
N S F++ C ++ N E+ Y GM+ F+ SG + + ++A + A Y V
Sbjct: 91 NNSPKFWSFMETCNNSENPESLYRKGMLEFFTHCKEASGMAYLKRSAQKGYVDACYVFGV 150
Query: 177 IQF 179
I +
Sbjct: 151 ILY 153
>gi|290985106|ref|XP_002675267.1| predicted protein [Naegleria gruberi]
gi|284088862|gb|EFC42523.1| predicted protein [Naegleria gruberi]
Length = 1914
Score = 45.4 bits (106), Expect = 0.055, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 5/40 (12%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CG+P +++ +C C AV YCS+ CQA WK+ HK C
Sbjct: 1877 CGKPNSKK----KCGACQAVVYCSKECQASHWKV-HKTQC 1911
>gi|344925128|ref|ZP_08778589.1| hypothetical protein COdytL_10840 [Candidatus Odyssella
thessalonicensis L13]
Length = 126
Score = 45.4 bits (106), Expect = 0.056, Method: Composition-based stats.
Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 3/91 (3%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
+N ASL KA +S+ A+ALY+L + S + D + V L +AA GH +A
Sbjct: 10 RNEQEAASLRHKAGLSTQAEALYNLGCM---FSDAQELDPNFAEAVVLWEKAAAYGHAEA 66
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
L + G GV Q+ A R + +A A+
Sbjct: 67 QNNLAVMYKQGKGVAQDFARAREWYQKAAAQ 97
>gi|392566683|gb|EIW59859.1| hypothetical protein TRAVEDRAFT_64687 [Trametes versicolor
FP-101664 SS1]
Length = 335
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 322 TPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
T P LR+C++ C +P+ + + RCS C + YC+R CQ W + HK C+
Sbjct: 22 TALPSLRICTY--CAKPQDQYMKLMRCSRCASALYCNRVCQKAAWPV-HKLSCS 72
>gi|302692620|ref|XP_003035989.1| expressed protein [Schizophyllum commune H4-8]
gi|300109685|gb|EFJ01087.1| expressed protein [Schizophyllum commune H4-8]
Length = 232
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 25/50 (50%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLND 395
+RC+ C V YCS+ CQ WK HK C P DE+GN + D
Sbjct: 18 KRCARCRLVYYCSKDCQVASWKASHKKSCRPHASLYDEDGNPRTKIKRED 67
>gi|392569247|gb|EIW62420.1| hypothetical protein TRAVEDRAFT_42789 [Trametes versicolor
FP-101664 SS1]
Length = 429
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 4/69 (5%)
Query: 307 PASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
P S + +WF R C+ CG+ T + + C+ C A +YC +ACQ +D+
Sbjct: 15 PYSVIMKQWFDDRKDEERVAKTHCTF--CGKQGT--NPLKSCARCKAAHYCDQACQRMDF 70
Query: 367 KLRHKADCA 375
K RH+ +CA
Sbjct: 71 KARHRDECA 79
>gi|404378836|ref|ZP_10983915.1| hypothetical protein HMPREF9021_01251 [Simonsiella muelleri ATCC
29453]
gi|294483337|gb|EFG31023.1| hypothetical protein HMPREF9021_01251 [Simonsiella muelleri ATCC
29453]
Length = 269
Score = 45.1 bits (105), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL-----QNRGSGASLMAKAAISSHAQALYSLAV 176
A ++ + A+ G+V+A L R Y Q+ AA+ +ALY+L V
Sbjct: 93 AAKWYAVAAEQGDVDAQNNLA--RLYAEGLGFEQDYDKAEQYWQMAAVQGDDEALYNLGV 150
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
I + G D + +AA LG+ DA+ LG Q+G+GV ++ A+ + +Q
Sbjct: 151 IHDDALG---RDPNYEKAADFYLQAAHLGNADAMVNLGMLYQEGFGVPEDAAQANEWFLQ 207
Query: 237 A 237
A
Sbjct: 208 A 208
>gi|169863681|ref|XP_001838459.1| hypothetical protein CC1G_09087 [Coprinopsis cinerea okayama7#130]
gi|116500498|gb|EAU83393.1| hypothetical protein CC1G_09087 [Coprinopsis cinerea okayama7#130]
Length = 699
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 24/38 (63%)
Query: 338 PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
P T+R +CS C AV YCS ACQ DWK RH+ C+
Sbjct: 458 PRTKRDRRPQCSGCAAVFYCSEACQREDWKRRHRPVCS 495
>gi|449108565|ref|ZP_21745207.1| polymorphic outer membrane protein [Treponema denticola ATCC 33520]
gi|448961366|gb|EMB42071.1| polymorphic outer membrane protein [Treponema denticola ATCC 33520]
Length = 793
Score = 45.1 bits (105), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 55/124 (44%), Gaps = 16/124 (12%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYS 173
A + K AD GN A LG+ C N G G L KA +A+A +
Sbjct: 101 AFEWFKKGADQGNANAQNALGV----CYDN-GQGVEKNYTIAIDLYKKAIEQGNAKAQNN 155
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L V+ +NG G DK L +AA G+ A LG+ ++G GV +N +E ++
Sbjct: 156 LGVMYYNGHG---VDKSYEKAFELYKKAAEQGNAYAQNNLGYMYENGEGVEKNTSEAIKW 212
Query: 234 LVQA 237
+A
Sbjct: 213 YTKA 216
Score = 41.2 bits (95), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 7/122 (5%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMI-RFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
N SE+ + K AD G A +LG I N+ + LMA A HA +L
Sbjct: 277 NTSEAIKWYTK-AADQGVANAQDSLGWICEVREEYNKAAAMYLMA--AQQGHANGQNNLG 333
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+ +NG G DKD + + ++A GH A LG+ +GYGV +N + +
Sbjct: 334 RMYYNGYG---VDKDYKQALEWYTKSAEQGHAYAQNNLGYMYYNGYGVDKNYKQALEWYT 390
Query: 236 QA 237
++
Sbjct: 391 KS 392
>gi|299738408|ref|XP_001838336.2| hypothetical protein CC1G_04780 [Coprinopsis cinerea okayama7#130]
gi|298403294|gb|EAU83524.2| hypothetical protein CC1G_04780 [Coprinopsis cinerea okayama7#130]
Length = 604
Score = 45.1 bits (105), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE 383
+RC C V YC ACQ+ DW L HK +C+ +RW+ +
Sbjct: 118 LKRCPNCKIVMYCGSACQSRDWSL-HKLECSALQRWMSQ 155
>gi|238603294|ref|XP_002395908.1| hypothetical protein MPER_03951 [Moniliophthora perniciosa FA553]
gi|215467458|gb|EEB96838.1| hypothetical protein MPER_03951 [Moniliophthora perniciosa FA553]
Length = 222
Score = 45.1 bits (105), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 33/70 (47%), Gaps = 2/70 (2%)
Query: 307 PASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHE-FRRCSVCGAVNYCSRACQALD 365
P R E A G GP + C CG+ E R E RRC C V YCS+ CQ
Sbjct: 119 PRPRIHIEVEAKIGMDGGPLYKSCYAENCGKAEKRDVESLRRCGGCELVWYCSKECQTKS 178
Query: 366 WKLR-HKADC 374
WK HK++C
Sbjct: 179 WKEDGHKSEC 188
>gi|340714716|ref|XP_003395871.1| PREDICTED: egl nine homolog 1-like [Bombus terrestris]
Length = 486
Score = 45.1 bits (105), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 34/60 (56%), Gaps = 2/60 (3%)
Query: 316 FAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+R TPG +++ + VGC R + RCS C AV+YC++ Q DWK RHK CA
Sbjct: 5 IVSRAQTPGGDVQIGTTVGCVIC-NRTDKLLRCSRCKAVSYCTKEHQRRDWK-RHKEFCA 62
>gi|46446348|ref|YP_007713.1| hypothetical protein pc0714 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46399989|emb|CAF23438.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 445
Score = 45.1 bits (105), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 11/129 (8%)
Query: 113 IKANNW-----SESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAI 164
+ AN W + A ++ KL A+ G+ +A Y LG I L Q+ AA
Sbjct: 191 MYANRWGIAQSEQEAIKYYKLAAEQGHADAQYALGFIYANGLGVTQSDAEAFKYFKLAAE 250
Query: 165 SSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
HA A Y+L V NG G +++D++ L A GH DA LG +G GV
Sbjct: 251 QGHANAQYNLGVRYSNGRGVTQSDQEAFKYYKLAADQ---GHADAQYNLGVRYVNGQGVM 307
Query: 225 QNIAEGRRF 233
++ E ++
Sbjct: 308 RSEQEAAKY 316
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 10/112 (8%)
Query: 127 KLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
KL AD G+ +A Y LG++ Y Q+ AA HA A Y+L V NG
Sbjct: 102 KLLADQGDAKAQYKLGLM--YDESCGVTQSDAEAIKYFKLAAKQGHADAQYNLGVRYANG 159
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
G +++D++ + AA GH DA LG + +G+ Q+ E ++
Sbjct: 160 RGVTQSDQE---AIKYYKLAAEQGHADAQYALGFMYANRWGIAQSEQEAIKY 208
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 8/117 (6%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ KL A G+ +A Y LG +R+ Q+ AA HA A Y+L +
Sbjct: 133 AIKYFKLAAKQGHADAQYNLG-VRYANGRGVTQSDQEAIKYYKLAAEQGHADAQYALGFM 191
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
N G ++++++ + AA GH DA LG +G GV Q+ AE ++
Sbjct: 192 YANRWGIAQSEQE---AIKYYKLAAEQGHADAQYALGFIYANGLGVTQSDAEAFKYF 245
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 55/118 (46%), Gaps = 8/118 (6%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASLMAKAAISSHAQALYSLA 175
+ A ++ KL AD G V+A Y LG +R+ +Q+ AA HA+A Y+L
Sbjct: 311 QEAAKYYKLAADQGYVDAQYNLG-VRYSNGRGVMQSDQEAIKYYKLAADQGHAKAQYNLG 369
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
V NG G ++++++ L A G A LG +G GV Q+ E ++
Sbjct: 370 VRYSNGRGVTQSEQEATKYYKLAADQ---GDAKAQYNLGARYANGRGVTQSEQEAAKY 424
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 53/116 (45%), Gaps = 8/116 (6%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ KL A+ G+ A Y LG +R+ Q+ AA HA A Y+L V
Sbjct: 241 AFKYFKLAAEQGHANAQYNLG-VRYSNGRGVTQSDQEAFKYYKLAADQGHADAQYNLGVR 299
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
NG G +++++ L A G++DA LG +G GV Q+ E ++
Sbjct: 300 YVNGQGVMRSEQEAAKYYKLAADQ---GYVDAQYNLGVRYSNGRGVMQSDQEAIKY 352
>gi|408399594|gb|EKJ78693.1| hypothetical protein FPSE_01181 [Fusarium pseudograminearum CS3096]
Length = 1189
Score = 45.1 bits (105), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 30/55 (54%), Gaps = 4/55 (7%)
Query: 321 GTPGPGLRLCSHVGCGRPETRRH-EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
GTP + C CG+PE ++CS C V YCS CQ DWK +H+A+C
Sbjct: 1137 GTPVTRVERCR--SCGKPENEEGVTLKKCSRCQRVKYCSGECQKKDWK-KHRAEC 1188
>gi|397593560|gb|EJK55991.1| hypothetical protein THAOC_24202 [Thalassiosira oceanica]
Length = 302
Score = 45.1 bits (105), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
+A+Y L F GS G + KD+R V L +AA LG IDAL LG G GV++++
Sbjct: 150 EAIYHLGQKYFFGSLGLQ--KDMRKAVELYTKAAELGSIDALFSLGDAYFSGNGVQEDVT 207
Query: 229 EGRRFLVQA 237
+ F +A
Sbjct: 208 KATAFFTKA 216
>gi|428175244|gb|EKX44135.1| hypothetical protein GUITHDRAFT_72490 [Guillardia theta CCMP2712]
Length = 419
Score = 45.1 bits (105), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 7/108 (6%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A +L+ A GNVEA Y LG++ Y ++R +L KAA + AL LA+
Sbjct: 98 ALEYLQRAARDGNVEAQYNLGVMYAYGHGVPKDRNESLNLFRKAAAQGYVAALNGLALSL 157
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+GS D + ++A G+ D L G L+DG GV +N
Sbjct: 158 TDGSA----DNNFTEAFHYFNQSALSGNADGLYNAGLLLKDGRGVERN 201
>gi|409048362|gb|EKM57840.1| hypothetical protein PHACADRAFT_251713 [Phanerochaete carnosa
HHB-10118-sp]
Length = 757
Score = 45.1 bits (105), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E+ EF +C C YC + CQ++ W H+ C+
Sbjct: 512 GIRQCANMMCGRWESFPREFAKCRRCRKAKYCGKECQSIAWSEGHRFWCS 561
>gi|344301422|gb|EGW31734.1| hypothetical protein SPAPADRAFT_51716 [Spathaspora passalidarum
NRRL Y-27907]
Length = 779
Score = 45.1 bits (105), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CGR E+ EF +C C YCSR CQ W H+ C P
Sbjct: 691 GVRQCGNLECGRWESYPREFSKCRRCKRTKYCSRDCQMRAWHC-HRNWCIP 740
>gi|407925050|gb|EKG18072.1| Sel1-like protein [Macrophomina phaseolina MS6]
Length = 1100
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 70/161 (43%), Gaps = 11/161 (6%)
Query: 63 DDLVVSILCKLSSTARC-PSD-FVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSE 120
D + + L +L + A PSD + + + K + A+ + +A KT ++
Sbjct: 714 DPITMESLARLRNDAASRPSDKELQLRLAKKLVEAAAILADEGGRADPKTRNKNRERYTV 773
Query: 121 SAHRFLKLCADAGNVEACY----TLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
AH+ +K A EA + +LG + N SL +AA HAQA Y AV
Sbjct: 774 EAHKIVKKLAHGKYPEAMFYMADSLGSGQLGLAPNEKEAFSLYLEAAKLGHAQAAYRTAV 833
Query: 177 ---IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
I GG++ KD V RAA LG + A+ ++G
Sbjct: 834 CCEIGAEDGGGTR--KDPVKAVQWYKRAAALGDVAAMYKMG 872
>gi|397594670|gb|EJK56308.1| hypothetical protein THAOC_23842 [Thalassiosira oceanica]
Length = 430
Score = 45.1 bits (105), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 33/70 (47%), Gaps = 9/70 (12%)
Query: 308 ASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWK 367
ASR L EW G C++ CG +F++CS C A YCS+ CQ W+
Sbjct: 358 ASRLLKEW-------ETNGQNFCAN--CGMVAQAGEKFKQCSKCKAQWYCSKECQVEAWR 408
Query: 368 LRHKADCAPA 377
HK DC A
Sbjct: 409 EGHKKDCKRA 418
>gi|374328612|ref|YP_005078796.1| Sel1 domain-containing protein repeat-containing protein
[Pseudovibrio sp. FO-BEG1]
gi|359341400|gb|AEV34774.1| Sel1 domain protein repeat-containing protein [Pseudovibrio sp.
FO-BEG1]
Length = 601
Score = 45.1 bits (105), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 58/122 (47%), Gaps = 7/122 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALY 172
N ++A +F +L A+ G+ A + LG I L Q+ +AA A A +
Sbjct: 312 QNHPKAAEKF-RLAAEQGHANAQFNLGRIYEIGLGVDQDYSEALKWYIRAAEQGVADAQF 370
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+LAV+ NG+G S+ DL VA AA GH DA LG G GV Q+ A R
Sbjct: 371 NLAVMYANGTGISQ---DLVEAVAWYHFAAKQGHPDAQYNLGFLYATGQGVEQDEATAAR 427
Query: 233 FL 234
++
Sbjct: 428 WV 429
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 55/112 (49%), Gaps = 10/112 (8%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLA 175
+A R+++L A+ G+ EA Y +G R Y QN A+ AA +HA+A ++L
Sbjct: 424 TAARWVRLAANQGHAEAQYRIG--RAYEDGVGVEQNHTEAANWYYLAATQNHAKAQFTLG 481
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
+ G G +++ + AA G+++A +G G GV+QNI
Sbjct: 482 RVYAIGLGVPQDEVE---AAKWVLHAAERGYLEAQYRIGLAFYKGSGVKQNI 530
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 56/124 (45%), Gaps = 12/124 (9%)
Query: 120 ESAHRFLKLCADAGN------VEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYS 173
+ A ++ +L AD G+ +E + G+I Q+ A A +A A Y
Sbjct: 99 KEAEKWYQLAADKGHPKALSELEYLFNKGLI---GPQSDDKVAMWTLHKAKQGYASAQYR 155
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LA++ +G G K D RAA GH A ELG+ +G GV+Q+ E ++
Sbjct: 156 LALLYHDGKGVPK---DYSEAEKWYRRAASNGHSGAQLELGYMYANGQGVQQDYQEAEKW 212
Query: 234 LVQA 237
++A
Sbjct: 213 YLKA 216
>gi|145541335|ref|XP_001456356.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424167|emb|CAK88959.1| unnamed protein product [Paramecium tetraurelia]
Length = 475
Score = 44.7 bits (104), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
++CS+ C E+ CSVC + YCS+ C+ DW L HK +C P +
Sbjct: 3 QVCSYKYCTNKTVLELEYI-CSVCQIMKYCSQKCRDTDWTLSHKNNCRPCQ 52
>gi|449119237|ref|ZP_21755633.1| polymorphic outer membrane protein [Treponema denticola H1-T]
gi|449121627|ref|ZP_21757973.1| polymorphic outer membrane protein [Treponema denticola MYR-T]
gi|448949068|gb|EMB29893.1| polymorphic outer membrane protein [Treponema denticola MYR-T]
gi|448950227|gb|EMB31049.1| polymorphic outer membrane protein [Treponema denticola H1-T]
Length = 721
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 55/124 (44%), Gaps = 16/124 (12%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYS 173
A + K AD GN A LG+ C N G G L KA +A+A +
Sbjct: 101 AFEWFKKGADQGNANAQNALGV----CYDN-GQGVEKNYTIAIDLYKKAIEQGNAKAQNN 155
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L V+ +NG G DK L +AA G+ A LG+ ++G GV +N +E ++
Sbjct: 156 LGVMYYNGHG---VDKSYEKAFELYKKAAEQGNAYAQNNLGYMYENGEGVEKNTSEAIKW 212
Query: 234 LVQA 237
+A
Sbjct: 213 YTKA 216
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 7/122 (5%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMI-RFYCLQNRGSGASLMAKAAISSHAQALYSLA 175
N SE+ + K AD G A +LG I N+ + LMA A HA +L
Sbjct: 205 NTSEAIKWYTK-AADQGVANAQDSLGWICEVREEYNKAAAMYLMA--AQQGHANGQNNLG 261
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+ +NG G DKD + + ++A GH A LG+ +GYGV +N + +
Sbjct: 262 RMYYNGYG---VDKDYKQALEWYTKSAEQGHAYAQNNLGYMYYNGYGVDKNYKQALEWYT 318
Query: 236 QA 237
++
Sbjct: 319 KS 320
>gi|353238866|emb|CCA70798.1| related to SamB protein [Piriformospora indica DSM 11827]
Length = 1096
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 39/87 (44%), Gaps = 13/87 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA---------P 376
G+R C+++ CG+ E EF +C C YC + CQ+ W + H+ C+ P
Sbjct: 710 GIRQCANMTCGKWEKTPREFAKCRRCRKAKYCGKECQSKAWSMGHRYWCSAREGGEPGQP 769
Query: 377 AERWLDEEGNGDVDVDLNDAEGGGEDN 403
A LD E D N A G+ N
Sbjct: 770 ATDNLDHEN----DETPNTATIAGQTN 792
>gi|325267702|ref|ZP_08134353.1| Sel1 repeat protein [Kingella denitrificans ATCC 33394]
gi|324980826|gb|EGC16487.1| Sel1 repeat protein [Kingella denitrificans ATCC 33394]
Length = 321
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 57/125 (45%), Gaps = 8/125 (6%)
Query: 118 WSESAHRFLKLC--ADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALY 172
W++ A F + A+ GN EA Y LG + + Q+ +AA A+A +
Sbjct: 36 WADDASDFRQTLQLAEQGNAEAQYNLGWMYYNGQGVRQDYAEAVKWYRQAAEQGVAEAQF 95
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
SL ++ NG G + D +AA GH +A LG +G GVRQ+ AE +
Sbjct: 96 SLGLMYDNGQGVRQ---DYAEAFRWYRQAAEQGHAEAQYNLGVMYDNGDGVRQDYAEALK 152
Query: 233 FLVQA 237
+ QA
Sbjct: 153 WYRQA 157
>gi|299756319|ref|XP_001829247.2| Sel1 domain-containing protein [Coprinopsis cinerea okayama7#130]
gi|298411623|gb|EAU92573.2| Sel1 domain-containing protein [Coprinopsis cinerea okayama7#130]
Length = 542
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 40/83 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A HA + Y LA NG G KN +D L AA GH DA G C
Sbjct: 224 SLLKRLADRGHAPSQYFLADCYANGIGTYKNKQDFDRAYPLFVLAAKHGHPDAAYRAGTC 283
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ F +A A
Sbjct: 284 CENGWGCRRESAKALGFYRKAGA 306
>gi|416237175|ref|ZP_11630697.1| tetratricopeptide repeat family protein, partial [Moraxella
catarrhalis BC1]
gi|326571297|gb|EGE21319.1| tetratricopeptide repeat family protein [Moraxella catarrhalis BC1]
Length = 126
Score = 44.7 bits (104), Expect = 0.079, Method: Composition-based stats.
Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 3/88 (3%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
Q+ KAA HA+A Y+L V+ + G G + D V +AA G A
Sbjct: 22 QDDQEAVEWYTKAAGQGHAEAQYNLGVMYYEGQGVRQ---DYHKAVEWFTKAANQGFAQA 78
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQA 237
LG +G GVRQNIA + + QA
Sbjct: 79 QNNLGVMYDEGQGVRQNIATAKIYYGQA 106
>gi|404378894|ref|ZP_10983970.1| hypothetical protein HMPREF9021_01187 [Simonsiella muelleri ATCC
29453]
gi|294483273|gb|EFG30959.1| hypothetical protein HMPREF9021_01187 [Simonsiella muelleri ATCC
29453]
Length = 889
Score = 44.7 bits (104), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 6/121 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
++A K A+ G+ EA Y LG++ + Q+ A KAA HA A Y+L +
Sbjct: 538 DNAFNEFKALAEQGHAEAQYYLGILYAQGWGVEQDERQAAVWYLKAADQGHAAAQYNLGM 597
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
G G +N + +AA LG++ A LG G GV Q+ A+ + +
Sbjct: 598 AYAKGLGIMQN---MVEASYWYTQAAKLGYVQAQNNLGELYTSGEGVNQDYAQAAEWFTK 654
Query: 237 A 237
A
Sbjct: 655 A 655
>gi|269104631|ref|ZP_06157327.1| tPR repeat protein SEL1 subfamily [Photobacterium damselae subsp.
damselae CIP 102761]
gi|268161271|gb|EEZ39768.1| tPR repeat protein SEL1 subfamily [Photobacterium damselae subsp.
damselae CIP 102761]
Length = 333
Score = 44.7 bits (104), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 52/115 (45%), Gaps = 4/115 (3%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E ++L A+ + +A TL +Y ++ L + S LYSL ++ F
Sbjct: 68 EQGIKYLNQLAENNHPKATLTLA-DYYYEEEDYTKALKLYHQLESSQDPYVLYSLGIMYF 126
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+G G ++ D G AA LG+ DA+ +L DG GV+Q+ E ++
Sbjct: 127 DGEGTAQ---DYEKGNEYYLAAAKLGYSDAMYQLAFSYNDGQGVKQDFTEAAKWF 178
>gi|440799995|gb|ELR21038.1| Sel1 repeatcontaining protein [Acanthamoeba castellanii str. Neff]
Length = 810
Score = 44.7 bits (104), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 90/206 (43%), Gaps = 27/206 (13%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFA--IKA 115
+D +PD ++ + S+ A P ++T K G AL LS+ + KA
Sbjct: 499 YDDVPD--TMNDIIGDSAAALLP------ILTEKAEEGSALGQYYLSQVYTRLTPPDTKA 550
Query: 116 NN-WSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQAL 171
++ WSE A A G+ EA Y LG + +N G + + +A + A+
Sbjct: 551 SSYWSERA-------AGQGHKEAMYDLGNLLLTGDGVERNTEKGIAWLKRAIEAGSIDAI 603
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y L ++ + G + +D R RAA G + A +G G GV QN +
Sbjct: 604 YRLGLLYYEGK---EVGRDYRKAFKHFTRAANAGDVTAAYRVGKMYARGIGVEQNGKKAA 660
Query: 232 RFLVQANAR-ELAAVLSSAAC--QGI 254
++ V+A A+ ++ A + AA QG+
Sbjct: 661 KWFVRAAAQGQVGAFYAEAALYDQGV 686
>gi|192289310|ref|YP_001989915.1| Sel1 domain-containing protein repeat-containing protein
[Rhodopseudomonas palustris TIE-1]
gi|192283059|gb|ACE99439.1| Sel1 domain protein repeat-containing protein [Rhodopseudomonas
palustris TIE-1]
Length = 1110
Score = 44.7 bits (104), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 97/219 (44%), Gaps = 26/219 (11%)
Query: 39 TTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKR-----M 93
T +++ ++P +++ A P D+ +I + S+ P F V I +
Sbjct: 838 TPVERQSLYTPPAPPQTE--PAAPSDITGTIPSQPSA---APEKFGTVAIPSAERLPDTI 892
Query: 94 NGLALNSLVLSKASKKTFAIKAN---------NWSESAHRFLKLCADAGNVEACYTLGMI 144
G L +L L + + + N+ E+A ++ + ADAG A + +G +
Sbjct: 893 GGATLRTLALKGDAAAAYEVATRYVEGKGVPVNYDEAA-KWYQRAADAGVTPAIFRIGTL 951
Query: 145 RFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARA 201
L ++ +L + AA +A+A+++LAV+ +G GSK + + A +A
Sbjct: 952 YEKGLGVKRDLDVARTLYSTAADRGNAKAMHNLAVLYADG--GSKG-ANYKTAAAWFRKA 1008
Query: 202 AFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
A G D+ LG G GV QN+AE ++ A A+
Sbjct: 1009 AERGVADSQFNLGILYARGIGVDQNLAESYKWFSLAAAQ 1047
>gi|389750408|gb|EIM91579.1| hypothetical protein STEHIDRAFT_144801 [Stereum hirsutum FP-91666
SS1]
Length = 344
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 3/67 (4%)
Query: 328 RLCSHVGCGRPETRRHE---FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
R C + C + E RC C YCSR CQ DWK K CAP E ++ +
Sbjct: 269 RFCFNPACNLAWLKDDEGKTLHRCGKCKWAFYCSRECQTADWKRHKKEPCAPLEEIVEND 328
Query: 385 GNGDVDV 391
+VD+
Sbjct: 329 DLWEVDI 335
>gi|320583857|gb|EFW98070.1| hypothetical protein HPODL_0700 [Ogataea parapolymorpha DL-1]
Length = 696
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 38/93 (40%), Gaps = 13/93 (13%)
Query: 291 PLLSDFGCNVPAPEAHPASRFLAEWFAA------RGGTPGPGLRLCSHVGCGRPETRRHE 344
PL+ F H S+ + W + R G R C++ GCG+ E +
Sbjct: 570 PLVEKFTV------KHLHSKDITYWASVIVRNSNRKDESKGGRRQCAYFGCGKWEDEPRQ 623
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
F +C C YCS+ CQ+ W HK C A
Sbjct: 624 FAKCRRCKRAKYCSKECQSKAWTY-HKYWCNAA 655
>gi|426235294|ref|XP_004011619.1| PREDICTED: programmed cell death protein 2 [Ovis aries]
Length = 317
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P +RCS C +YCS+ Q+LDW+L HK CA
Sbjct: 104 GAHLCRVCGCSGP-------KRCSRCHKAHYCSKEHQSLDWRLGHKQACA 146
>gi|302677072|ref|XP_003028219.1| hypothetical protein SCHCODRAFT_258325 [Schizophyllum commune H4-8]
gi|300101907|gb|EFI93316.1| hypothetical protein SCHCODRAFT_258325 [Schizophyllum commune H4-8]
Length = 1253
Score = 44.7 bits (104), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 7/76 (9%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP-------AE 378
G+R C+++ CG+ E+ EF +C C YC + CQ+ W H+ C+ AE
Sbjct: 988 GIRQCANMLCGKWESYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCSAKEGDDERAE 1047
Query: 379 RWLDEEGNGDVDVDLN 394
R E D D D +
Sbjct: 1048 RRASEAQAADGDADTS 1063
>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 645
Score = 44.7 bits (104), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 24/37 (64%), Gaps = 1/37 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
+RC++C V+YC CQ+ DW + HK +C +RW+
Sbjct: 128 LKRCTICKVVHYCDSKCQSADW-IFHKQECVALQRWV 163
>gi|30248264|ref|NP_840334.1| hypothetical protein NE0240 [Nitrosomonas europaea ATCC 19718]
gi|30180149|emb|CAD84151.1| hypothetical protein NE0240 [Nitrosomonas europaea ATCC 19718]
Length = 1032
Score = 44.7 bits (104), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 10/125 (8%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
ACY G + QNR S L AA ++ A+ +L NG G +N K+ V
Sbjct: 853 ACYENG---YGVKQNRSSAIELYRMAANQGNSNAMVNLGFYYRNGIGVKQNRKE---AVK 906
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN-ARELAAVLSSAACQGIS 255
L RAA +G A+ LG C ++G GV Q+ + QA A E+ A+ + Q I
Sbjct: 907 LFQRAAKVGDYRAMCNLGVCYENGEGVDQDWNKAISLYQQATKAGEIRAI---SNIQNIL 963
Query: 256 TRSWL 260
R++L
Sbjct: 964 LRNFL 968
>gi|440797399|gb|ELR18486.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 488
Score = 44.7 bits (104), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 317 AARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
AA TP +++C CG+ E+ RC C V YCSR CQ DW RHK +C+
Sbjct: 56 AAPTPTPKQQVKVCK--SCGKTESGDRALVRCGRCHKVYYCSRDCQRQDWG-RHKPECSS 112
Query: 377 AE 378
++
Sbjct: 113 SD 114
>gi|301109493|ref|XP_002903827.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262096830|gb|EEY54882.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 316
Score = 44.7 bits (104), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 64/150 (42%), Gaps = 20/150 (13%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
E HR +KL DAGN +A + LG++ Y QN + A L +A+ A Y+L +
Sbjct: 148 EDVHRAVKLLQDAGNPQAKFHLGVMYEYGRGVAQNFKTAAELYRQASEHQVPDAFYNLGL 207
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG----RR 232
+ G G +N + R +A LG A+ LG G G + ++ +R
Sbjct: 208 LHLQGRGVEQNFEHAR---EYFQQAVDLGSAQAMYALGQMHVHGQGSSIDYSQALYWLKR 264
Query: 233 FLVQANAR----------ELAAVLSSAACQ 252
VQ + R E+ VLS A Q
Sbjct: 265 AAVQDDVRVSSTAHTVANEIELVLSQAELQ 294
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 6/105 (5%)
Query: 136 EACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLR 192
E+ Y L M++FY QN + +L+++AA H A ++L V+ G G +D
Sbjct: 59 ESLYLLAMMKFYGHGVDQNVEAAVTLLSRAAERGHRDAEFALGVLYGRGEGVVHSDT--- 115
Query: 193 AGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+ A++A GHIDA L +G GV +++ + L A
Sbjct: 116 LSASWLAKSAARGHIDAKWMLAAMYNEGRGVDEDVHRAVKLLQDA 160
>gi|150864377|ref|XP_001383162.2| hypothetical protein PICST_82184 [Scheffersomyces stipitis CBS
6054]
gi|149385633|gb|ABN65133.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
Length = 763
Score = 44.7 bits (104), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CGR E EF +C C YCSR CQ W H+ C P
Sbjct: 706 GVRQCGNLECGRWEKYPREFSKCRRCKRTKYCSRECQMRAWHC-HRNWCIP 755
>gi|323455258|gb|EGB11127.1| hypothetical protein AURANDRAFT_62041 [Aureococcus anophagefferens]
Length = 529
Score = 44.7 bits (104), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 27/57 (47%), Gaps = 2/57 (3%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL--RHKADCAPAERW 380
G R C C E+ H+F RC+ C YCS+ CQ LDW L HK C W
Sbjct: 409 GKRKCQLKQCDTLESDDHKFLRCARCKDAAYCSKECQRLDWSLVGNHKRMCGTVPMW 465
>gi|410630928|ref|ZP_11341612.1| hypothetical protein GARC_1506 [Glaciecola arctica BSs20135]
gi|410149437|dbj|GAC18479.1| hypothetical protein GARC_1506 [Glaciecola arctica BSs20135]
Length = 627
Score = 44.7 bits (104), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 53/127 (41%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A NW A A G+ +A TLG + Q+ A KAA H A
Sbjct: 494 QAANWYTQA-------AKHGHNQAQATLGFMYSKGNGVDQDYSQAAYWYQKAAEQGHMNA 546
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+LA + G G KN + +AA G DA LG + G GV Q++AE
Sbjct: 547 QYNLAYLYSLGQGIVKNHQQ---AAYWFEKAAIQGDADAQNSLGKLYERGLGVSQDLAEA 603
Query: 231 RRFLVQA 237
+R QA
Sbjct: 604 KRLYQQA 610
>gi|397598232|gb|EJK57182.1| hypothetical protein THAOC_22806 [Thalassiosira oceanica]
Length = 610
Score = 44.7 bits (104), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 2/109 (1%)
Query: 129 CADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKND 188
C A + R C N +++ +A+Y L F GS G +
Sbjct: 443 CNMAAQKRGMFDCAFCRTPCPDNDDDKLAMLQARVKKKDPEAIYFLGQKYFFGSLGLQ-- 500
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD R V L A LG IDAL LG+ +G GV+Q+ +G FL ++
Sbjct: 501 KDTRKAVELFTEAVELGSIDALFSLGNAYFNGDGVQQDKDKGVYFLTKS 549
>gi|340363634|ref|ZP_08685957.1| TPR repeat protein [Neisseria macacae ATCC 33926]
gi|339885313|gb|EGQ75042.1| TPR repeat protein [Neisseria macacae ATCC 33926]
Length = 420
Score = 44.7 bits (104), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 56/124 (45%), Gaps = 4/124 (3%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E A + + AD G + A LGM+ + LQN + KAA A Y+L +
Sbjct: 107 EQAMEWCRSAADKGYLPAQNNLGMM-YGVLQNYVEATKWLQKAAEQGSVNAQYNLGLRYE 165
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANA 239
G G +ND++ V +AA G A LG +G GVRQN E R+ +A
Sbjct: 166 QGQGVRQNDEE---AVRWYRKAAEQGLATAQYHLGVMYANGRGVRQNDEEAVRWYRKAAE 222
Query: 240 RELA 243
+ LA
Sbjct: 223 QGLA 226
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 55/121 (45%), Gaps = 8/121 (6%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A ++L+ A+ G+V A Y LG +R+ QN KAA A A Y L V
Sbjct: 140 EATKWLQKAAEQGSVNAQYNLG-LRYEQGQGVRQNDEEAVRWYRKAAEQGLATAQYHLGV 198
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ NG G +ND++ V +AA G A LG + GVRQN E ++ +
Sbjct: 199 MYANGRGVRQNDEE---AVRWYRKAAEQGLATAQYHLGVMYANRRGVRQNYEEAAQWYRK 255
Query: 237 A 237
A
Sbjct: 256 A 256
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 54/124 (43%), Gaps = 6/124 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
E A R+ + A+ G A Y LG++ QN KAA A A Y L V
Sbjct: 175 EEAVRWYRKAAEQGLATAQYHLGVMYANGRGVRQNDEEAVRWYRKAAEQGLATAQYHLGV 234
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ N G +N ++ +AA G +DA LG +G GVRQ+ AE R+ +
Sbjct: 235 MYANRRGVRQNYEE---AAQWYRKAAEQGDVDAQNNLGALYDEGQGVRQDSAEAVRWYRK 291
Query: 237 ANAR 240
A R
Sbjct: 292 AAER 295
>gi|58269600|ref|XP_571956.1| regulation of budding-related protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|134114033|ref|XP_774264.1| hypothetical protein CNBG2450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256899|gb|EAL19617.1| hypothetical protein CNBG2450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57228192|gb|AAW44649.1| regulation of budding-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 812
Score = 44.7 bits (104), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W+ H+ C+
Sbjct: 682 GIRQCANMSCGKWEKFSREFAKCRRCRKAKYCSKECQSRAWQEGHRFWCS 731
>gi|440789665|gb|ELR10969.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 205
Score = 44.7 bits (104), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Query: 323 PGPGLRLCSHVGCGRPETRRHEFRRCSVCG----AVNYCSRACQALDWKLRHKADCAPAE 378
P L+ CS C + ET + +F+ C+ C V YCSR+CQA DW H + C E
Sbjct: 117 PPRDLKKCSASNCDKYETDQVKFKLCAPCKDVGKKVPYCSRSCQAYDWTQGHVSVCGKPE 176
Query: 379 RWL 381
L
Sbjct: 177 EQL 179
>gi|405121801|gb|AFR96569.1| hypothetical protein CNAG_03348 [Cryptococcus neoformans var.
grubii H99]
Length = 811
Score = 44.7 bits (104), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W+ H+ C+
Sbjct: 681 GIRQCANMSCGKWEKFSREFAKCRRCRKAKYCSKECQSRAWQEGHRFWCS 730
>gi|448101313|ref|XP_004199530.1| Piso0_002066 [Millerozyma farinosa CBS 7064]
gi|359380952|emb|CCE81411.1| Piso0_002066 [Millerozyma farinosa CBS 7064]
Length = 772
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 2/58 (3%)
Query: 319 RGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
R T G G+R C ++ CG+ E EF +C C YCSR CQ W H+ C P
Sbjct: 676 RNETKG-GVRQCGNLECGKWERYPREFSKCRRCKRTKYCSRECQMRAWHC-HRNWCIP 731
>gi|321261069|ref|XP_003195254.1| regulation of budding-related protein [Cryptococcus gattii WM276]
gi|317461727|gb|ADV23467.1| Regulation of budding-related protein, putative [Cryptococcus
gattii WM276]
Length = 795
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 29/50 (58%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ CQ+ W+ H+ C+
Sbjct: 682 GIRQCANMSCGKWEKFSREFAKCRRCRKAKYCSKECQSRAWQEGHRFWCS 731
>gi|170101963|ref|XP_001882198.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164643013|gb|EDR07267.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 750
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
G+R C+++ CGR ET EF +C C YC + CQ+ W H+ C+ + +DE+
Sbjct: 439 GIRQCANMLCGRWETYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCSAKD--VDED 495
>gi|448097471|ref|XP_004198682.1| Piso0_002066 [Millerozyma farinosa CBS 7064]
gi|359380104|emb|CCE82345.1| Piso0_002066 [Millerozyma farinosa CBS 7064]
Length = 772
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 29/58 (50%), Gaps = 2/58 (3%)
Query: 319 RGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
R T G G+R C ++ CG+ E EF +C C YCSR CQ W H+ C P
Sbjct: 676 RNETKG-GVRQCGNLECGKWERYPREFSKCRRCKRTKYCSRECQMRAWHC-HRNWCIP 731
>gi|169846426|ref|XP_001829928.1| hypothetical protein CC1G_04617 [Coprinopsis cinerea okayama7#130]
gi|116508955|gb|EAU91850.1| hypothetical protein CC1G_04617 [Coprinopsis cinerea okayama7#130]
Length = 257
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP 376
RCS+C V+YC + CQ LDWK H+ +C+P
Sbjct: 38 RCSLCKGVSYCGQRCQKLDWKENHRWNCSP 67
>gi|397647092|gb|EJK77557.1| hypothetical protein THAOC_00606 [Thalassiosira oceanica]
Length = 396
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
+A++ L + F G G + KD R GV L AA LG +DAL LGH G GV+++
Sbjct: 244 EAIFYLGLKYFFGHLGLQ--KDARKGVVLYTEAAELGSVDALFNLGHAYDTGEGVQEDKV 301
Query: 229 EGRRFLVQA 237
+ F +A
Sbjct: 302 KATEFYTKA 310
>gi|237748058|ref|ZP_04578538.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
gi|229379420|gb|EEO29511.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
Length = 414
Score = 44.7 bits (104), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
AKAA ++ A Y LA++QF+G+GG++N A + + A G+ A LG+
Sbjct: 122 FAKAAGQNYQPAQYQLALMQFSGTGGTENK---SAAIEQFKKLASEGYAPAQYTLGYLNL 178
Query: 219 DGYGVRQNIAEGRRFLVQANAR 240
G G+ QN E R + +A A+
Sbjct: 179 KGDGIPQNSGEARFWFEKAAAK 200
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 39/79 (49%), Gaps = 3/79 (3%)
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
+ KAA + A Y +A++ GSG KN + V +AAF G++DA L
Sbjct: 266 IEKAANQQYPVAEYHMAMMYLTGSGTEKNPE---LAVKWLEKAAFHGNVDAQNFYASLLY 322
Query: 219 DGYGVRQNIAEGRRFLVQA 237
GYG++Q+I + +A
Sbjct: 323 LGYGIKQDIPRAIGYFTEA 341
>gi|260950759|ref|XP_002619676.1| hypothetical protein CLUG_00835 [Clavispora lusitaniae ATCC 42720]
gi|238847248|gb|EEQ36712.1| hypothetical protein CLUG_00835 [Clavispora lusitaniae ATCC 42720]
Length = 754
Score = 44.3 bits (103), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G+R C ++ CG+ E EF +C C YCSR CQ W H+ C P+
Sbjct: 659 GVRQCGNLECGKWEKYPKEFSKCKKCKRTKYCSRECQMRAWHC-HRNWCVPS 709
>gi|213512745|ref|NP_001134666.1| programmed cell death protein 2 [Salmo salar]
gi|209735098|gb|ACI68418.1| Programmed cell death protein 2 [Salmo salar]
Length = 346
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 32/68 (47%), Gaps = 7/68 (10%)
Query: 307 PASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
P S W G G G++LC GC + CS C AV+YCS+ Q +DW
Sbjct: 113 PPSDEDPNWTERDPGVHGSGVKLCKLCGCPGQKV-------CSKCHAVSYCSKEHQTIDW 165
Query: 367 KLRHKADC 374
K HK +C
Sbjct: 166 KHCHKKEC 173
>gi|148922942|ref|NP_001092246.1| LRP2-binding protein [Danio rerio]
gi|166199146|sp|A5PLI4.1|LR2BP_DANRE RecName: Full=LRP2-binding protein
gi|148744648|gb|AAI42916.1| Zgc:165631 protein [Danio rerio]
Length = 343
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 56/125 (44%), Gaps = 12/125 (9%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E LK A+ G+ +A + LG + + +Q + A L+ QALY LAVI +
Sbjct: 43 EKTMNLLKEKAETGDSQATFLLGQLHY--VQGCYAEAELIFDRIKDKDPQALYQLAVIYY 100
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHID-------ALRELGHCLQDGYGVRQNIAEGRR 232
+G G +DL V R AF + AL LG +G+GV+ + +E R
Sbjct: 101 DGLG---TKEDLGRAVEYMGRVAFWDSSEAGSVRYAALYNLGQAYLEGFGVQASSSEAER 157
Query: 233 FLVQA 237
+ A
Sbjct: 158 LWLLA 162
>gi|316932352|ref|YP_004107334.1| Sel1 domain-containing protein repeat-containing protein
[Rhodopseudomonas palustris DX-1]
gi|315600066|gb|ADU42601.1| Sel1 domain protein repeat-containing protein [Rhodopseudomonas
palustris DX-1]
Length = 1127
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 67/128 (52%), Gaps = 7/128 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALY 172
N+ E+A ++ + ADAG A + +G + L ++ + L + AA +A+A++
Sbjct: 936 TNYDEAA-KWYQRAADAGVTPAIFRIGTLYEKGLGVKKDLDAARKLYSTAADRGNAKAMH 994
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+LAV+ +G GSK + + A ++AA G D+ LG G GV QN+AE +
Sbjct: 995 NLAVLYADG--GSKG-ANYKTAAAWFSKAAERGVADSQFNLGILYARGIGVDQNLAESYK 1051
Query: 233 FLVQANAR 240
+ A+A+
Sbjct: 1052 WFALASAQ 1059
>gi|398863568|ref|ZP_10619128.1| TPR repeat-containing protein [Pseudomonas sp. GM78]
gi|398247343|gb|EJN32792.1| TPR repeat-containing protein [Pseudomonas sp. GM78]
Length = 448
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + + A+ G +A + LG++ F Q+R SL KAA + +A Y+L V+
Sbjct: 315 AASWFRKAAEQGFAKAQFNLGVLYFNGRGVAQDRQQAVSLYQKAAEQGYVEAQYNLGVLY 374
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGV 223
F G G + +DL+ +AA G+ +A LG G G+
Sbjct: 375 FRGEGLT---RDLKQAAYWYQKAAEQGYANAQYNLGLMYAKGEGL 416
>gi|307108050|gb|EFN56291.1| hypothetical protein CHLNCDRAFT_144678 [Chlorella variabilis]
Length = 505
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 21/35 (60%)
Query: 340 TRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
T + + C+ C AV YCSRACQ DWK HK C
Sbjct: 448 TPTQQLKSCAACKAVAYCSRACQVQDWKQGHKQLC 482
>gi|392571478|gb|EIW64650.1| hypothetical protein TRAVEDRAFT_112059 [Trametes versicolor
FP-101664 SS1]
Length = 167
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CS+ C + +C+VC A+ YCS+ CQ DWK RHK C
Sbjct: 114 CSNYACANLSNKLSALSQCTVCKAIYYCSKPCQMADWK-RHKKFC 157
>gi|46123869|ref|XP_386488.1| hypothetical protein FG06312.1 [Gibberella zeae PH-1]
Length = 1174
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 4/55 (7%)
Query: 321 GTPGPGLRLCSHVGCGRPETRRH-EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
GTP + C CG+PE ++CS C V YCS CQ DW+ +H+A+C
Sbjct: 1122 GTPVTRVERCR--SCGKPENEEGVTLKKCSRCQKVKYCSGECQKKDWR-KHRAEC 1173
>gi|397630866|gb|EJK69944.1| hypothetical protein THAOC_08749 [Thalassiosira oceanica]
Length = 556
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 2/51 (3%)
Query: 328 RLCSHVGCGRPETR-RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
R+C + P +H +CS C + YCS+ CQ DWK RHK C P
Sbjct: 289 RMCRYCNKNDPSAEFQHALMKCSRCKSTYYCSKECQRADWK-RHKKSCIPV 338
>gi|299754756|ref|XP_001828169.2| hypothetical protein CC1G_02750 [Coprinopsis cinerea okayama7#130]
gi|298410909|gb|EAU93520.2| hypothetical protein CC1G_02750 [Coprinopsis cinerea okayama7#130]
Length = 625
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
LR+C ++ P R + C+ C V YCS CQ DWK RH+ +C
Sbjct: 416 LRICDNLNHTTPTRRFSISKTCADCQTVVYCSPECQRQDWKSRHRTEC 463
>gi|393236307|gb|EJD43857.1| hypothetical protein AURDEDRAFT_185421 [Auricularia delicata
TFB-10046 SS5]
Length = 501
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 324 GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G R C+ CGR E R + +RCS C A+ YC+ CQ L WK HK C
Sbjct: 427 GTFFRRCALPTCGRVEEVRGKHQRCSGCKAMYYCNGKCQMLHWKT-HKTVC 476
>gi|397635442|gb|EJK71862.1| hypothetical protein THAOC_06657, partial [Thalassiosira oceanica]
Length = 867
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A + A+ A QA+ L ++G G + KD+R V L + AA LG DAL LG+
Sbjct: 701 AKIQARVA-KKDPQAILFLGRKYYHGDDGLQ--KDMRKAVELYSEAAELGLTDALFSLGN 757
Query: 216 CLQDGYGVRQNIAEGRRFLVQA 237
+ G+GV+Q+ A+ F +A
Sbjct: 758 AYRQGHGVQQDKAKAVEFYAKA 779
>gi|330804439|ref|XP_003290202.1| hypothetical protein DICPUDRAFT_154698 [Dictyostelium purpureum]
gi|325079666|gb|EGC33255.1| hypothetical protein DICPUDRAFT_154698 [Dictyostelium purpureum]
Length = 229
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 34/76 (44%), Gaps = 5/76 (6%)
Query: 320 GGTPGPGLRL---CSHVGC--GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G P L L CS+ C G+ + + + CS CG +YCSR CQ W HK C
Sbjct: 62 GELFNPTLALAAKCSNNKCKSGKIDNTKTKLLLCSACGTASYCSRDCQVEHWSNGHKEKC 121
Query: 375 APAERWLDEEGNGDVD 390
+ +EE + D
Sbjct: 122 IKIRQKKEEEAKKNFD 137
>gi|397573337|gb|EJK48653.1| hypothetical protein THAOC_32530 [Thalassiosira oceanica]
Length = 289
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 2/68 (2%)
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A++ L + + G G + KD+R V L AA LG I+AL +LG+ + GYGV++++A+
Sbjct: 150 AIHYLGELYYYGGLGLQ--KDIRRAVELWTEAAELGSIEALFDLGNAYRKGYGVKRDMAK 207
Query: 230 GRRFLVQA 237
F +A
Sbjct: 208 AVEFWTKA 215
>gi|39933891|ref|NP_946167.1| hypothetical protein RPA0814 [Rhodopseudomonas palustris CGA009]
gi|39647738|emb|CAE26258.1| conserved unknown protein [Rhodopseudomonas palustris CGA009]
Length = 1131
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 100/219 (45%), Gaps = 26/219 (11%)
Query: 39 TTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLI-TCKRM---- 93
T +++ ++P +++ A P D+ +I + S+ P F V I + +R+
Sbjct: 859 TPVERQSLYTPPAPPQTE--PAAPSDITGTIPSQPSA---APEKFGTVAIPSAERLPDTI 913
Query: 94 NGLALNSLVLSKASKKTFAIKAN---------NWSESAHRFLKLCADAGNVEACYTLGMI 144
G L +L L + + + N+ E+A ++ + ADAG A + +G +
Sbjct: 914 GGATLRTLALKGDAAAAYEVATRYAEGKGVPVNYDEAA-KWYQRAADAGVTPAIFRIGTL 972
Query: 145 RFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARA 201
L ++ +L + AA +A+A+++LAV+ +G GSK + + A +A
Sbjct: 973 YEKGLGVKRDLDVARTLYSTAADRGNAKAMHNLAVLYADG--GSKG-ANYKTAAAWFRKA 1029
Query: 202 AFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
A G D+ LG G GV QN+AE ++ A A+
Sbjct: 1030 AERGVADSQFNLGILYARGIGVDQNLAESYKWFSLAAAQ 1068
>gi|354594571|ref|ZP_09012610.1| hypothetical protein CIN_13060 [Commensalibacter intestini A911]
gi|353672247|gb|EHD13947.1| hypothetical protein CIN_13060 [Commensalibacter intestini A911]
Length = 256
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 61/130 (46%), Gaps = 16/130 (12%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQA 170
SE A + K AD G++EA Y LG + G G + L KAA A
Sbjct: 49 SEKAFEYYKKAADQGSIEAQYNLG-----WMYANGQGTTKDYEKAYELFQKAADEGLPAA 103
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
YSLA++ ++G G ++D + + +AA G+ +A LG+ ++G V ++ +
Sbjct: 104 QYSLAIMYWHGQG---VEQDRQKSIEYYEKAAAQGYPEAEANLGYFYKNGNEVTKDSFKA 160
Query: 231 RRFLVQANAR 240
L +A A+
Sbjct: 161 VELLQKAAAQ 170
>gi|397611044|gb|EJK61143.1| hypothetical protein THAOC_18413, partial [Thalassiosira oceanica]
Length = 430
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 2/85 (2%)
Query: 153 GSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRE 212
S +++ K A+A+Y L F G G KD+ + L +AA LG +DAL +
Sbjct: 106 ASALAMIQKRVNKGDAEAIYHLGNQYFYGLLGF--TKDVPRAIELWTQAAELGSLDALND 163
Query: 213 LGHCLQDGYGVRQNIAEGRRFLVQA 237
LGH +G GV+Q+ G R QA
Sbjct: 164 LGHMYYNGDGVQQDKPRGIRHWQQA 188
>gi|375335521|ref|ZP_09776865.1| hypothetical protein SbacW_00584 [Succinivibrionaceae bacterium
WG-1]
Length = 363
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 48/111 (43%), Gaps = 6/111 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
E A R+L N E Y L + + + + +G +LM KAA ++ ALY LA
Sbjct: 164 EEAARWLTKGVQQNNYECYYRLAKMHIDGMFSIADFNTGINLMTKAAEENNTNALYFLAS 223
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
G KN + + C +AA L HI A LG G V QN+
Sbjct: 224 AYKEGKIIQKN---ISKAINYCQKAASLNHIKATTMLGKMFYKGEDVEQNL 271
>gi|318063738|ref|NP_001188026.1| programmed cell death protein 2 [Ictalurus punctatus]
gi|308324631|gb|ADO29450.1| programmed cell death protein 2 [Ictalurus punctatus]
Length = 355
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 7/51 (13%)
Query: 324 GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G G++LC GC + CS C V YCS+ QA+DWK +HK +C
Sbjct: 138 GSGVKLCRLCGCAG-------HKACSRCHTVTYCSKEHQAIDWKKQHKKEC 181
>gi|237746461|ref|ZP_04576941.1| Sel1 repeat-containing protein [Oxalobacter formigenes HOxBLS]
gi|229377812|gb|EEO27903.1| Sel1 repeat-containing protein [Oxalobacter formigenes HOxBLS]
Length = 827
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 66/142 (46%), Gaps = 9/142 (6%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVI 177
+A ++L+ A + +A + LGM+ + + S KAA S H A Y LA++
Sbjct: 470 AARKWLEKAAARNDRQARFELGMMAKNGIGMPADPASAREWFEKAAQSGHRNAAYQLALL 529
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
QF G GG +N + A + R A G A LG+ G G+ + E + +A
Sbjct: 530 QFAGKGGPENRQ---AAIKEFTRLAEEGFPPAQYTLGYLTLKGDGIPPDPEEAATWFSKA 586
Query: 238 NAR-ELAAV--LSSAACQGIST 256
A+ ++ A+ L + +GI T
Sbjct: 587 AAQNDIRAISALGYLSLKGIGT 608
>gi|294661164|ref|YP_003573039.1| hypothetical protein Aasi_1584 [Candidatus Amoebophilus asiaticus
5a2]
gi|227336314|gb|ACP20911.1| hypothetical protein Aasi_1584 [Candidatus Amoebophilus asiaticus
5a2]
Length = 185
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 56/125 (44%), Gaps = 23/125 (18%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGA--------SLMAKAAI 164
+KA W + A A G+ EA Y LG C+ + G G KAA
Sbjct: 17 VKAVEWYQKA-------AKQGDAEAQYILG-----CMYDDGRGVIKDEQKAFKWYQKAAG 64
Query: 165 SSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
+A+A ++L V NG G ++++K V +AA GH+ A LG + G G++
Sbjct: 65 QGYAKAQFNLGVSYANGQGIAEDEKK---AVEWYQKAAEQGHVGAQYNLGVIYEGGMGIK 121
Query: 225 QNIAE 229
QN +
Sbjct: 122 QNYKQ 126
>gi|194868459|ref|XP_001972296.1| GG15447 [Drosophila erecta]
gi|190654079|gb|EDV51322.1| GG15447 [Drosophila erecta]
Length = 403
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 34/86 (39%), Gaps = 8/86 (9%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C + A +A PA L G G CS G +P+ ++CS
Sbjct: 294 CTIFRQIATQFGAKDAPPALTILRNAINGMRGFVDEG-SYCSTCGAEKPD------KKCS 346
Query: 350 VCGAVNYCSRACQALDWKLRHKADCA 375
C AV YC R CQ L W HK CA
Sbjct: 347 KCKAVQYCDRECQRLHW-FMHKKSCA 371
>gi|397571619|gb|EJK47880.1| hypothetical protein THAOC_33372, partial [Thalassiosira oceanica]
Length = 182
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 2/74 (2%)
Query: 164 ISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGV 223
+ +A++ LAV NG G K K++R L AA LG I+AL LG+ +G GV
Sbjct: 104 LKKDPEAMFFLAVQYINGDLGLK--KNMRKAFELYTEAAELGSIEALFSLGNAYHEGKGV 161
Query: 224 RQNIAEGRRFLVQA 237
+++ A+ F +A
Sbjct: 162 QEDKAKAVEFFAKA 175
>gi|169864414|ref|XP_001838816.1| hypothetical protein CC1G_08980 [Coprinopsis cinerea okayama7#130]
gi|116500148|gb|EAU83043.1| hypothetical protein CC1G_08980 [Coprinopsis cinerea okayama7#130]
Length = 626
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 22/31 (70%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
F+ CS C +V YCSR CQ DWK RH+ +C+
Sbjct: 426 FKMCSGCHSVCYCSRTCQKEDWKRRHRNECS 456
>gi|395329216|gb|EJF61604.1| hypothetical protein DICSQDRAFT_147058 [Dichomitus squalens LYAD-421
SS1]
Length = 1179
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
C + + E RRCS C V YCS ACQ DWK HK DC
Sbjct: 1139 CRKQRSISGELRRCSGCKIVYYCSEACQKSDWK-SHKLDC 1177
>gi|302687600|ref|XP_003033480.1| hypothetical protein SCHCODRAFT_108392 [Schizophyllum commune H4-8]
gi|300107174|gb|EFI98577.1| hypothetical protein SCHCODRAFT_108392, partial [Schizophyllum
commune H4-8]
Length = 316
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 5/46 (10%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
LC++V C +P E R+CS C YCSR CQ DW HK C
Sbjct: 17 LCTYVKCNKP----GELRKCSKCQLAWYCSRDCQKADWP-EHKKTC 57
>gi|288963182|ref|YP_003453461.1| hypothetical protein AZL_f01570 [Azospirillum sp. B510]
gi|288915434|dbj|BAI76917.1| hypothetical protein AZL_f01570 [Azospirillum sp. B510]
Length = 452
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 63/152 (41%), Gaps = 43/152 (28%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMI------------------RFYCLQNR------ 152
N+ ++AH +L+ A G+VEA +TLG+ R NR
Sbjct: 69 NFGDAAH-WLRKAAAQGHVEAQFTLGLFLANGEGAPQDNAPPVRWQRSAAQHNRAAAEAN 127
Query: 153 -------GSGA--------SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVAL 197
G GA S KAA +HA+A+Y L + G G +D A V
Sbjct: 128 LALLFPHGLGAAPDMAEALSWYEKAAAQNHAEAIYHLGLFHTFGQG---VPQDYAAAVPW 184
Query: 198 CARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
+AA LGH A +LG G GV Q++AE
Sbjct: 185 FRKAAELGHATAQMKLGSLYAQGNGVAQDMAE 216
>gi|308322465|gb|ADO28370.1| programmed cell death protein 2 [Ictalurus furcatus]
Length = 355
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 7/51 (13%)
Query: 324 GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G G++LC GC + CS C V YCS+ QA+DWK +HK +C
Sbjct: 138 GSGVKLCRLCGCAG-------HKACSRCHTVTYCSKEHQAIDWKKQHKKEC 181
>gi|420159452|ref|ZP_14666254.1| Sel1 repeat protein, partial [Capnocytophaga ochracea str. Holt 25]
gi|394762191|gb|EJF44469.1| Sel1 repeat protein, partial [Capnocytophaga ochracea str. Holt 25]
Length = 696
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NVEA LG ++ QN G + K+A +AQ Y+LA +NG+G
Sbjct: 569 WLAKAAEQNNVEALNELGSY-YFEKQNFGQALANFQKSAQRDYAQGQYNLANCYYNGNGI 627
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + L AR + A LGHC G G+ Q+ + + QA
Sbjct: 628 DRSYEKAANYYKLSARKDY---APAQFRLGHCYYHGEGIEQSDSRAADWFEQA 677
>gi|392564964|gb|EIW58141.1| hypothetical protein TRAVEDRAFT_148195 [Trametes versicolor
FP-101664 SS1]
Length = 388
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 30/70 (42%), Gaps = 4/70 (5%)
Query: 311 FLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFR----RCSVCGAVNYCSRACQALDW 366
++ E R P +C+ GCG R R RC +YCS+ CQ DW
Sbjct: 221 YVEERLRRRKAAADPAAYVCAAEGCGIRGEERRALRSCAGRCPPDLKPHYCSKECQTKDW 280
Query: 367 KLRHKADCAP 376
K RHK C P
Sbjct: 281 KQRHKTICKP 290
>gi|291522026|emb|CBK80319.1| Sel1 repeat [Coprococcus catus GD/7]
Length = 457
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
+AA H +A Y L+V G G +N + C AAF GH A E+G+C + G
Sbjct: 290 QAASQGHTEAQYQLSVCYDRGIGVRRN---ITEAAKWCQMAAFGGHAKAQSEIGYCYEYG 346
Query: 221 YGVRQNIAEGRRFLVQANAR 240
GV +NI E + A+A+
Sbjct: 347 QGVVRNIKEAVSWYEMASAQ 366
>gi|390604850|gb|EIN14241.1| HCP-like protein [Punctularia strigosozonata HHB-11173 SS5]
Length = 406
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 41/83 (49%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A HA + Y LA NG G +K +D L AA GH DA G C
Sbjct: 86 SLLKRLADRGHAASQYFLADCYANGIGTTKGKQDFDRAYPLFVLAAKHGHPDAAYRAGTC 145
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 146 CENGWGCRRESAKALQFFRKAAA 168
>gi|160942306|ref|ZP_02089615.1| hypothetical protein CLOBOL_07192 [Clostridium bolteae ATCC
BAA-613]
gi|158434863|gb|EDP12630.1| hypothetical protein CLOBOL_07192 [Clostridium bolteae ATCC
BAA-613]
Length = 973
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 57/123 (46%), Gaps = 12/123 (9%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG-----ASLMAKAAISSHAQALYSLAV 176
A + + A GN+ A Y LG + L+N G+G + + KAA + HA A Y+LA
Sbjct: 671 AEAYWERAAQLGNMNAQYALGKL---WLEN-GTGDQKQAVAWLEKAAEAEHASAQYALAN 726
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
I G + KD+ L RAA GH A +LG G +++ ++L Q
Sbjct: 727 IYLAGEAVA---KDVTKATELFTRAAKQGHDYAAYQLGKQFLQGEETEKDVEAAIKWLKQ 783
Query: 237 ANA 239
+ A
Sbjct: 784 SAA 786
>gi|443696861|gb|ELT97476.1| hypothetical protein CAPTEDRAFT_221453 [Capitella teleta]
Length = 1099
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 48/112 (42%), Gaps = 11/112 (9%)
Query: 265 LSQPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEW--FAARGGT 322
L Q A H R+++P+ N TG G S + +H + F+ F
Sbjct: 892 LEQEEAAGH-RSRSPSANFVYTGSG----SAVPPGIIKRSSHMTANFIESHVSFGQEEEI 946
Query: 323 PGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
P LR C CGR R C+ C VNYCS+AC+ W RHK +C
Sbjct: 947 RKPLLRYCYE--CGRSVGVR--LSACTRCKEVNYCSKACKLKAWNARHKEEC 994
>gi|195493207|ref|XP_002094318.1| GE21757 [Drosophila yakuba]
gi|194180419|gb|EDW94030.1| GE21757 [Drosophila yakuba]
Length = 403
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 35/86 (40%), Gaps = 8/86 (9%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C + A +A PA L G G CS G +P+ ++CS
Sbjct: 294 CTIFRQIATQFGAKDAPPALTILRNAINGMRGFVDEG-GYCSTCGAEKPD------KKCS 346
Query: 350 VCGAVNYCSRACQALDWKLRHKADCA 375
C AV YC R CQ L W HK +CA
Sbjct: 347 KCKAVQYCDRECQRLHW-FMHKKNCA 371
>gi|114330400|ref|YP_746622.1| Sel1 domain-containing protein [Nitrosomonas eutropha C91]
gi|114307414|gb|ABI58657.1| Sel1 domain protein repeat-containing protein [Nitrosomonas
eutropha C91]
Length = 232
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 52/119 (43%), Gaps = 17/119 (14%)
Query: 123 HRFLKL--CADAGNVEACYTLGMIRFY------------CLQNRGSGASLMAKAAISSHA 168
RFL L A+AG+ EA LG + F ++ + A +AA HA
Sbjct: 77 QRFLNLRKKAEAGDAEAQNGLGSMYFSGEAVSHDAQGNPLSKDPEAAAGWFYRAAEQGHA 136
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
A ++L ++ F G G + D V L +AA G+IDA LG G GV QN
Sbjct: 137 DAQFNLGLLYFTGEGVPQ---DKTKAVELFTKAAEQGNIDAQNNLGVIYLLGEGVEQNT 192
>gi|336377035|gb|EGO05370.1| hypothetical protein SERLA73DRAFT_43858 [Serpula lacrymans var.
lacrymans S7.3]
Length = 498
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 40/83 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A H + Y LA NG G KN +D L AA GH DA G C
Sbjct: 179 SLLKRLADRGHPASQYFLADCYANGLGTVKNRQDFDRAYPLFVLAAKHGHPDAAYRAGTC 238
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 239 CENGWGCRRESAKALQFFRKAAA 261
>gi|284006465|emb|CBA71725.1| conserved Sel1 repeat protein [Arsenophonus nasoniae]
Length = 487
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 55/130 (42%), Gaps = 14/130 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQA 170
KA W E + AD G++ A L I + +N L +AA S A A
Sbjct: 195 KAKEWYEKS-------ADLGDLGALNNLAQIYEKGYGVKKNPAYAIELYRRAAYSGDAIA 247
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y++ I +G KN+ RAA GHIDA L Q GYGV QN
Sbjct: 248 QYNMGFIYDDGEYLEKNN---YQAFYWYKRAAEQGHIDAQYHLAEFYQHGYGVEQNAILA 304
Query: 231 RR-FLVQANA 239
R+ + V ANA
Sbjct: 305 RQWYEVLANA 314
>gi|114053207|ref|NP_001039574.1| programmed cell death protein 2 [Bos taurus]
gi|122138157|sp|Q2YDC9.1|PDCD2_BOVIN RecName: Full=Programmed cell death protein 2
gi|82571672|gb|AAI10284.1| Programmed cell death 2 [Bos taurus]
gi|296483811|tpg|DAA25926.1| TPA: programmed cell death protein 2 [Bos taurus]
Length = 344
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 24/101 (23%)
Query: 275 RNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVG 334
RNQ P N DF P E P+ + + + G LC G
Sbjct: 97 RNQLPRQN------------DFYSYEPPSEDPPSETGESVYLHLKSGA-----HLCRVCG 139
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
C P +RCS C +YCS+ Q+LDW+L HK CA
Sbjct: 140 CSGP-------KRCSRCHKAHYCSKEHQSLDWRLGHKQACA 173
>gi|397614361|gb|EJK62752.1| hypothetical protein THAOC_16624, partial [Thalassiosira oceanica]
Length = 297
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A+Y L F+G G + KD R L AA LG I+AL +LG+ + GYGV+Q++A+
Sbjct: 142 AIYYLGGKYFHGRLGLQ--KDARKAFELWTEAAELGSINALFDLGNAYRQGYGVQQDMAK 199
Query: 230 GRRFLVQA 237
+F A
Sbjct: 200 AAQFYSNA 207
>gi|328350139|emb|CCA36539.1| MYND-type zinc finger protein MUB1 [Komagataella pastoris CBS 7435]
Length = 791
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
G+R C+ + CG+ ET +F +C C YCS+ CQ+ W+ HK C +
Sbjct: 534 GVRQCACISCGKWETMPKQFAKCRRCKRTKYCSKECQSKAWEY-HKHWCVAVQ 585
>gi|297850108|ref|XP_002892935.1| ubiquitin-specific protease 15 [Arabidopsis lyrata subsp. lyrata]
gi|297338777|gb|EFH69194.1| ubiquitin-specific protease 15 [Arabidopsis lyrata subsp. lyrata]
Length = 926
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
RCS C +V YCS CQ + W++ HK +C P E
Sbjct: 140 RCSRCKSVRYCSGKCQIIHWRIAHKDECVPLE 171
>gi|254565219|ref|XP_002489720.1| Protein of unknown function, deletion causes multi-budding
phenotype [Komagataella pastoris GS115]
gi|238029516|emb|CAY67439.1| Protein of unknown function, deletion causes multi-budding
phenotype [Komagataella pastoris GS115]
Length = 783
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
G+R C+ + CG+ ET +F +C C YCS+ CQ+ W+ HK C +
Sbjct: 526 GVRQCACISCGKWETMPKQFAKCRRCKRTKYCSKECQSKAWEY-HKHWCVAVQ 577
>gi|384221991|ref|YP_005613157.1| hypothetical protein BJ6T_83260 [Bradyrhizobium japonicum USDA 6]
gi|354960890|dbj|BAL13569.1| hypothetical protein BJ6T_83260 [Bradyrhizobium japonicum USDA 6]
Length = 360
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 56/120 (46%), Gaps = 6/120 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K ADAG+ EA + L M+R ++G LMA AA +A Y+LA++
Sbjct: 137 ALEWYKRAADAGDREAMFALAMLRISGRGGSVDKGEAVKLMASAAKLGEPKAAYNLALLY 196
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
+ G +D++ L +AA G +A L ++G GV ++ R L A+
Sbjct: 197 LD---GQTLPQDVKRSAELLRQAADAGLPEAQYALATFYKEGTGVPKDPERAVRLLQAAS 253
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 51/118 (43%), Gaps = 16/118 (13%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
L+ ADAG EA Y L FY G+G L+ A+++ + A A
Sbjct: 211 ELLRQAADAGLPEAQYALAT--FY---KEGTGVPKDPERAVRLLQAASLADNVDAEVEYA 265
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+ FNG+G KN A VAL +A+ G A L L +G G + EG ++
Sbjct: 266 IAMFNGTGTPKNQP---AAVALLRKASRQGSAIAQNRLAWVLINGMGTSVDKVEGFKW 320
>gi|384500117|gb|EIE90608.1| hypothetical protein RO3G_15319 [Rhizopus delemar RA 99-880]
Length = 444
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 24/47 (51%)
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+ C++ C + E F+RC C V YCSR CQ WK HK C
Sbjct: 392 KQCANGECDKKEEEPGSFKRCGRCQRVAYCSRECQKEHWKTGHKIVC 438
>gi|298713247|emb|CBJ33540.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 301
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/53 (47%), Positives = 27/53 (50%), Gaps = 2/53 (3%)
Query: 323 PGPGLRLCSHVGCGRPETRR-HEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
PG L C C + E F CS CGA YCSR CQ +DWK HKA C
Sbjct: 84 PGLCLASCQRAACNKTEEPTGRPFCCCSKCGA-KYCSRECQVIDWKKDHKAMC 135
>gi|384497433|gb|EIE87924.1| hypothetical protein RO3G_12635 [Rhizopus delemar RA 99-880]
Length = 382
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 31/58 (53%)
Query: 317 AARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
A R GLR C+++ C + ET+ EF +C C YCS++CQ+ W H+ C
Sbjct: 235 ACRKDETRGGLRRCANLSCRKWETQPREFAKCRRCRKAKYCSKSCQSKAWADGHRWWC 292
>gi|374572876|ref|ZP_09645972.1| TPR repeat-containing protein [Bradyrhizobium sp. WSM471]
gi|374421197|gb|EHR00730.1| TPR repeat-containing protein [Bradyrhizobium sp. WSM471]
Length = 364
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 55/116 (47%), Gaps = 6/116 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A ++ K +DAG+ EA + L M+R ++ LMA AA +A Y+LA++
Sbjct: 141 ALQWYKRASDAGDREAMFALAMLRMTGRGGPVDKNEAVKLMASAAKLGEPKAAYNLALLY 200
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+ G +D++ L +AA G +A L ++G GV +++ R L
Sbjct: 201 LD---GQTLPQDVKRSAELLRQAADAGLPEAQYALATFYKEGTGVPKDLERAVRLL 253
>gi|409052411|gb|EKM61887.1| hypothetical protein PHACADRAFT_248791 [Phanerochaete carnosa
HHB-10118-sp]
Length = 451
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 40/83 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A H + Y LA NG G KN +D L AA GH DA G C
Sbjct: 132 SLLKRNADRGHMPSQYFLADCYANGIGTHKNRQDFDRAYPLFVLAAKHGHPDAAYRAGTC 191
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 192 CENGWGCRRESAKALQFYRKAAA 214
>gi|429747727|ref|ZP_19280977.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 380 str. F0488]
gi|429162529|gb|EKY04844.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 380 str. F0488]
Length = 815
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NVEA LG ++ QN G + K+A +AQ Y+LA +NG+G
Sbjct: 688 WLAKAAEQNNVEALNELGSY-YFEKQNFGQALANFQKSAQRDYAQGQYNLANCYYNGNGI 746
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + L AR + A LGHC G G+ Q+ + + QA
Sbjct: 747 DRSYEKAANYYKLSARKDY---APAQFRLGHCYYHGEGIEQSDSRAADWFEQA 796
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 49/110 (44%), Gaps = 17/110 (15%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
+ +KL A+ G+V+A L F +G G + K A + A Y LA
Sbjct: 25 KLIKL-AEKGDVQAQTELADAYF-----KGKGVRRSYQEAVVWLEKVAETGDLNAQYQLA 78
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
FNG G K+ + GV + A G+ +A REL C +DG GV Q
Sbjct: 79 QCYFNGKGVPKSP---QKGVEWLTKVADAGNPEAQRELALCYRDGKGVEQ 125
>gi|389750409|gb|EIM91580.1| hypothetical protein STEHIDRAFT_137218 [Stereum hirsutum FP-91666
SS1]
Length = 328
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL--DEEGNGDV 389
RCS C YCS ACQ+ DWK K CAP E + DE +GD+
Sbjct: 272 RCSRCKWAFYCSPACQSADWKRHKKEPCAPLEEMVANDELWHGDI 316
>gi|50552107|ref|XP_503528.1| YALI0E04114p [Yarrowia lipolytica]
gi|49649397|emb|CAG79107.1| YALI0E04114p [Yarrowia lipolytica CLIB122]
Length = 749
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+H CG+ E +F +C C YCSR CQ W HK C+
Sbjct: 606 GIRQCAHFDCGKWEEFPRQFAKCRRCKRTKYCSRECQLKAWNY-HKHWCS 654
>gi|393227475|gb|EJD35152.1| hypothetical protein AURDEDRAFT_130788 [Auricularia delicata
TFB-10046 SS5]
Length = 466
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 33/63 (52%), Gaps = 9/63 (14%)
Query: 328 RLCSHVGCGRPETRRHE----FRRCSVCGAVNYCSRACQALDW----KLRHKADCAPAER 379
R CS GCG + HE FR C+ C V+YCSRACQ DW ++ H CA R
Sbjct: 336 RRCSGPGCGL-YVQEHEPGMAFRPCADCLVVHYCSRACQRRDWNGGSRVAHAQVCAAIRR 394
Query: 380 WLD 382
+D
Sbjct: 395 LVD 397
>gi|383769362|ref|YP_005448425.1| hypothetical protein S23_10940 [Bradyrhizobium sp. S23321]
gi|381357483|dbj|BAL74313.1| hypothetical protein S23_10940 [Bradyrhizobium sp. S23321]
Length = 360
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 56/120 (46%), Gaps = 6/120 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K ADAG+ EA + L M+R ++G LMA +A +A Y+LA++
Sbjct: 137 ALEWYKRAADAGDREAMFALAMMRMSGRGGPVDKGEAVKLMASSAKLGEPKAAYNLALLY 196
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
+ G +D++ L +AA G +A L ++G GV ++ R L A+
Sbjct: 197 LD---GQTLPQDVKRSAELLRQAADAGLPEAQYALATFYKEGTGVPKDAERAVRLLQAAS 253
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 51/117 (43%), Gaps = 16/117 (13%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLAV 176
L+ ADAG EA Y L FY G+G L+ A+++ + A A+
Sbjct: 212 LLRQAADAGLPEAQYALAT--FY---KEGTGVPKDAERAVRLLQAASLADNVDAEVEYAI 266
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
FNG+G KN A VAL +A+ G A L L +G G + EG ++
Sbjct: 267 AMFNGTGTPKNQP---AAVALLRKASRQGSAIAQNRLAWVLINGMGTSVDKVEGFKW 320
>gi|291235927|ref|XP_002737895.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 552
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 66/164 (40%), Gaps = 26/164 (15%)
Query: 54 KSDLFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAI 113
+ D + P+ L++ + K S A G LN++ L+ K+ F
Sbjct: 162 QQDSTEQTPEQLLLEVRSKFESLA-------------DESTGTVLNAVGLNYVEKRNFVK 208
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQA 170
A + +++ D G+ +A Y LG+ + Q+ A +AA H A
Sbjct: 209 AAEEFRQAS--------DMGHGKAMYNLGICYEQGMGVSQSLAKAAEYYKQAADKGHPMA 260
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
LY+LAV G G K KD + + L AA G + A LG
Sbjct: 261 LYNLAVFHLMGLAGLK--KDTQKAIDLMENAAEQGLLQAQSYLG 302
>gi|118350855|ref|XP_001008706.1| sel1 repeat protein [Tetrahymena thermophila]
gi|89290473|gb|EAR88461.1| sel1 repeat protein [Tetrahymena thermophila SB210]
Length = 287
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 61/135 (45%), Gaps = 6/135 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
E A + A+ G ++ Y LG + + QN A+ H +AL+ L +
Sbjct: 52 EKAIEWYTKSAEKGEAKSQYCLGNLYYQGAAVQQNFQEAIKWYNLASKQGHDKALFQLGL 111
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+Q G G + D + G+ ++ G+ DA LG+ ++G GV+ N E ++ +
Sbjct: 112 MQIFGQGFKQ---DFQKGIDYFKKSGERGNQDAYNNLGNMYREGTGVKVNYEEAVKYYLM 168
Query: 237 ANARELAAVLSSAAC 251
A E AA +++ A
Sbjct: 169 ACEGECAAAMANLAT 183
>gi|393240860|gb|EJD48384.1| hypothetical protein AURDEDRAFT_162347 [Auricularia delicata
TFB-10046 SS5]
Length = 377
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 31/61 (50%), Gaps = 5/61 (8%)
Query: 328 RLCSHVGCGRP--ETRRHEFRRCSVCGAVNYCSRACQALDWK---LRHKADCAPAERWLD 382
R CS GCG+ + F C+ C AV YCSRACQ DWK RHK C R L
Sbjct: 195 RGCSSPGCGKTVHDNGGRPFPTCANCKAVQYCSRACQQRDWKTGQYRHKDICPLLCRLLA 254
Query: 383 E 383
E
Sbjct: 255 E 255
>gi|190348373|gb|EDK40815.2| hypothetical protein PGUG_04913 [Meyerozyma guilliermondii ATCC
6260]
Length = 736
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CG+ E+ EF +C C YCSR CQ W H+ C P
Sbjct: 644 GVRQCGNLECGKWESYPREFSKCRRCKRTKYCSRECQMRAWHC-HRNWCIP 693
>gi|429755011|ref|ZP_19287692.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 324 str. F0483]
gi|429175949|gb|EKY17361.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 324 str. F0483]
Length = 815
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NVEA LG ++ QN G + K+A +AQ Y+LA +NG+G
Sbjct: 688 WLAKAAEQNNVEALNELGSY-YFEKQNFGQALANFQKSAQRDYAQGQYNLANCYYNGNGI 746
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + L AR + A LGHC G G+ Q+ + + QA
Sbjct: 747 DRSYEKAANYYKLSARKDY---APAQFRLGHCYYHGEGIEQSDSRAADWFEQA 796
Score = 38.9 bits (89), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 50/110 (45%), Gaps = 17/110 (15%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
+ +KL A+ G+V+A L F +G G + K A + +A Y LA
Sbjct: 25 KLIKL-AEKGDVQAQTELADAYF-----KGKGVRRSYQEAVVWLEKVAETGDLKAQYQLA 78
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
FNG G K+ + GV + A G+ +A REL C +DG GV Q
Sbjct: 79 QCYFNGKGIPKSP---QKGVEWLTKVADAGNPEAQRELALCYRDGKGVEQ 125
>gi|397647870|gb|EJK77894.1| hypothetical protein THAOC_00243 [Thalassiosira oceanica]
Length = 317
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
QA+Y LA G G + KD+R V L AA LG +DAL LG+ G+GV+Q+ A
Sbjct: 149 QAIYFLAQQYCFGKKGLQ--KDMRKAVELFTEAAELGSVDALFSLGNAYHQGHGVQQDNA 206
Query: 229 EGRRFLVQA 237
+ F +A
Sbjct: 207 KAVEFYKKA 215
>gi|116255798|ref|YP_771631.1| putative exported peptidase [Rhizobium leguminosarum bv. viciae
3841]
gi|115260446|emb|CAK03550.1| putative exported peptidase [Rhizobium leguminosarum bv. viciae
3841]
Length = 859
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 60/122 (49%), Gaps = 6/122 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY--CLQNRGSGASLMAK-AAISSHAQAL 171
AN +++A +K DAG+V A L + + + + AS +AK A AL
Sbjct: 555 ANRDTKTAFATIKKAMDAGHVRAIQELSRLYEFGASVPRDAAKASQIAKDGAAKGDPFAL 614
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
+S ++G G +K D+ L G+ L +AA LGH A+ ELG+ +G V + G
Sbjct: 615 HSYGKALYHGRG-TKADQQL--GLRLMLQAADLGHTYAMNELGYIFLNGVNVPADPERGI 671
Query: 232 RF 233
RF
Sbjct: 672 RF 673
>gi|401415092|ref|XP_003872042.1| MYND finger domain-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488264|emb|CBZ23509.1| MYND finger domain-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 169
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 35/81 (43%), Gaps = 3/81 (3%)
Query: 298 CNVPAPEAHPASRFLAEWFAA--RGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVN 355
CN EA SR+L + A G P P H G T +RCS C
Sbjct: 89 CNSANCEAVAKSRYLMDMEDAVTAQGMPSPNGCFHCHKGAQGAATTTAPLQRCSRCKVAK 148
Query: 356 YCSRACQALDWKLRHKADCAP 376
YCS CQ DWK+ HK C+P
Sbjct: 149 YCSVECQKADWKV-HKQVCSP 168
>gi|302677412|ref|XP_003028389.1| hypothetical protein SCHCODRAFT_112758 [Schizophyllum commune H4-8]
gi|300102077|gb|EFI93486.1| hypothetical protein SCHCODRAFT_112758 [Schizophyllum commune H4-8]
Length = 560
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+R C + C R + HE +RC+ CG V YCS++CQA D + RHKA C
Sbjct: 426 SIRFCHNNSCLRSHDQ-HELKRCA-CGRVYYCSKSCQAAD-RHRHKAIC 471
>gi|290990111|ref|XP_002677680.1| predicted protein [Naegleria gruberi]
gi|284091289|gb|EFC44936.1| predicted protein [Naegleria gruberi]
Length = 1179
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
CG+ + +RCS C +V YCS CQ DWK HK C +
Sbjct: 1005 CGKGSSSESSLKRCSACNSVRYCSVECQKKDWK-NHKEKCKKIQE 1048
>gi|222055441|ref|YP_002537803.1| Sel1 domain-containing protein repeat-containing protein [Geobacter
daltonii FRC-32]
gi|221564730|gb|ACM20702.1| Sel1 domain protein repeat-containing protein [Geobacter daltonii
FRC-32]
Length = 271
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 40/85 (47%), Gaps = 6/85 (7%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
QNR SL+ KAA +AQA Y L + F G G +KD R + AA H A
Sbjct: 64 QNR---VSLIKKAARDGYAQAQYELGCMLFTGWG---IEKDRREAIRWFLEAAGHHHAQA 117
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFL 234
LG G GVRQ+ EG R+
Sbjct: 118 QNALGLAYSSGEGVRQDDTEGARWF 142
>gi|315224578|ref|ZP_07866404.1| Sel1 domain protein repeat-containing protein [Capnocytophaga
ochracea F0287]
gi|314945444|gb|EFS97467.1| Sel1 domain protein repeat-containing protein [Capnocytophaga
ochracea F0287]
Length = 815
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NVEA LG ++ QN G + K+A +AQ Y+LA +NG+G
Sbjct: 688 WLAKAAEQNNVEALNELGSY-YFEKQNFGQALANFQKSAQRDYAQGQYNLANCYYNGNGI 746
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + L AR + A LGHC G G+ Q+ + + QA
Sbjct: 747 DRSYEKAANYYKLSARKDY---APAQFRLGHCYYHGEGIEQSDSRAADWFEQA 796
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 49/110 (44%), Gaps = 17/110 (15%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
+ +KL A+ G+V+A L F +G G + K A + A Y LA
Sbjct: 25 KLIKL-AEKGDVQAQTELADAYF-----KGKGVRRSYQEAVVWLEKVAETGDLNAQYQLA 78
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
FNG G K+ + GV + A G+ +A REL C +DG GV Q
Sbjct: 79 QCYFNGKGVPKSP---QKGVEWLTKVADAGNPEAQRELALCYRDGKGVEQ 125
>gi|303237720|ref|ZP_07324280.1| Sel1 repeat protein [Prevotella disiens FB035-09AN]
gi|302482172|gb|EFL45207.1| Sel1 repeat protein [Prevotella disiens FB035-09AN]
Length = 267
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 54/127 (42%), Gaps = 6/127 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
E AH + L A+ G V A +GM F QN + K+A A + L
Sbjct: 44 EEAHHYYLLAANEGLVAAYLEVGMDNFSGCGTAQNYAEALKWVRKSAEVGEANGQFQLGE 103
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR-FLV 235
F G+ G +KD V AA + DA+ L +C + G GV++N E +L+
Sbjct: 104 FYFYGNCGL--EKDATKAVEWFLLAAKQSYSDAITALAYCYEQGIGVQENKEEAYNWYLL 161
Query: 236 QANAREL 242
A EL
Sbjct: 162 LAKEGEL 168
>gi|432897213|ref|ZP_20108209.1| hypothetical protein A13U_00950 [Escherichia coli KTE192]
gi|433027468|ref|ZP_20215344.1| hypothetical protein WIA_00551 [Escherichia coli KTE109]
gi|431430023|gb|ELH11857.1| hypothetical protein A13U_00950 [Escherichia coli KTE192]
gi|431545978|gb|ELI20621.1| hypothetical protein WIA_00551 [Escherichia coli KTE109]
Length = 717
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 5/115 (4%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
+K A GN A +TLG F C N G + K+A + A+Y LA + G+G
Sbjct: 555 IKDNATKGNSSAQFTLGKYEFTCA-NIDEGIKWLTKSAEQKNTDAIYFLATA-YKGNGIP 612
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
N++ + +AA LG+ +A E+G+ G + QN+ + + +A A+
Sbjct: 613 ANNEKY---ITYLQQAAMLGNSNAQAEIGYLYLIGKELPQNLPDAGDWFKKAAAQ 664
>gi|393227470|gb|EJD35147.1| hypothetical protein AURDEDRAFT_175780 [Auricularia delicata
TFB-10046 SS5]
Length = 466
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 34/71 (47%), Gaps = 7/71 (9%)
Query: 328 RLCSHVGCGRPETRRHE---FRRCSVCGAVNYCSRACQALDW----KLRHKADCAPAERW 380
R CS GCG +R FR C+ C V+YCSR CQ DW ++ H CA R
Sbjct: 336 RRCSGPGCGLFVHQREAGVVFRPCAGCRVVHYCSRGCQRHDWHGGAQVTHARVCAAIRRL 395
Query: 381 LDEEGNGDVDV 391
+D G V V
Sbjct: 396 VDAPDYGAVYV 406
>gi|354594378|ref|ZP_09012417.1| hypothetical protein CIN_11130 [Commensalibacter intestini A911]
gi|353672054|gb|EHD13754.1| hypothetical protein CIN_11130 [Commensalibacter intestini A911]
Length = 213
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 65/139 (46%), Gaps = 12/139 (8%)
Query: 100 SLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL-----QNRGS 154
++ L S KTFA + ++S+ +K AD GNV A LGM Y L +
Sbjct: 16 TVTLIHFSSKTFADET--VTKSSLMLVKERADQGNVNAQLKLGMT--YVLGQGVSADYQK 71
Query: 155 GASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
A KAA +A A Y+L + + G G ++D + + +AA G++ AL +LG
Sbjct: 72 AAEYFNKAANQGNAFAQYNLGSMYYYGKGVPQDD---QKAIEYFNKAADQGNVSALTQLG 128
Query: 215 HCLQDGYGVRQNIAEGRRF 233
+G GV Q+ + +
Sbjct: 129 VIYAEGQGVSQDYQKAAEY 147
>gi|345309407|ref|XP_001520462.2| PREDICTED: programmed cell death protein 2-like [Ornithorhynchus
anatinus]
Length = 264
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 16/89 (17%)
Query: 324 GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA-------- 375
G G+ LC GC P + C+ C +YCSR Q LDWK+ HK C
Sbjct: 38 GSGVWLCRVCGCPGP-------KACARCHRAHYCSRHHQTLDWKMGHKQACPLPEDLDGA 90
Query: 376 -PAERWLDEEGNGDVDVDLNDAEGGGEDN 403
P ++L E ++ + D++G GE++
Sbjct: 91 IPDHQFLFPEYEIVIEAEEPDSDGEGEED 119
>gi|86751571|ref|YP_488067.1| Sel1-like protein [Rhodopseudomonas palustris HaA2]
gi|86574599|gb|ABD09156.1| Sel1-like protein [Rhodopseudomonas palustris HaA2]
Length = 366
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 10/115 (8%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
L++ ADAGN EA Y L FY +N L+ AA++ + A A+
Sbjct: 217 ELLRVAADAGNPEAQYALAT--FYKEGTGVEKNVEQSVRLLQAAAVAGNVPAEVEYAIAL 274
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+NG+G KN+ A VAL +AA + A L H L G G ++ E ++
Sbjct: 275 YNGTGTVKNEP---AAVALLRKAARANNPIAQNRLAHVLLSGQGAPRDPVEAIKW 326
>gi|416242044|ref|ZP_11633178.1| tetratricopeptide repeat family protein [Moraxella catarrhalis BC7]
gi|326571605|gb|EGE21620.1| tetratricopeptide repeat family protein [Moraxella catarrhalis BC7]
Length = 302
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA HA+A Y+L V+ + G G + D V +AA G A LG +G
Sbjct: 209 KAAGQGHAEAQYNLGVMYYEGQGVRQ---DYHKAVEWFTKAANQGFAQAQNNLGVMYDEG 265
Query: 221 YGVRQNIAEGRRFLVQA 237
GVRQNIA + + QA
Sbjct: 266 QGVRQNIATAKIYYGQA 282
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 16/124 (12%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGA--------SLMAKAAISSHAQALYS 173
A + A+ G EA ++LG++ + G G KAA +A A ++
Sbjct: 131 AFEWFTKAANQGLAEAQFSLGVMY-----DEGQGVRQDYYKAVEWYTKAANQGYAGAQFN 185
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LA++ + G G ++D++ V +AA GH +A LG +G GVRQ+ + +
Sbjct: 186 LALMYYEGQGVRQDDQE---AVEWYTKAAGQGHAEAQYNLGVMYYEGQGVRQDYHKAVEW 242
Query: 234 LVQA 237
+A
Sbjct: 243 FTKA 246
>gi|237750801|ref|ZP_04581281.1| Sel1 domain-containing protein repeat-containing protein
[Helicobacter bilis ATCC 43879]
gi|229373246|gb|EEO23637.1| Sel1 domain-containing protein repeat-containing protein
[Helicobacter bilis ATCC 43879]
Length = 326
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 54/115 (46%), Gaps = 7/115 (6%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQAL 171
A ++ ++ F K+CAD ACY LG++ Y Q+ L KA H +A
Sbjct: 193 AKDYDKAREFFTKVCAD-NRAGACYNLGILFDYGYGVEQSYPEAIRLYTKACDMHHIKAC 251
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
YSL ++ G G + D + L ++ +GH +A +G DG GVR++
Sbjct: 252 YSLGIMYNKGDG---VNIDYPKALGLYLKSCNMGHSNACYNIGAMYYDGMGVRRD 303
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 75/196 (38%), Gaps = 18/196 (9%)
Query: 119 SESAHRFLKLCADAGNVE---ACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALY 172
S+ + L L A NV+ AC LG + L ++ L K+ A+ Y
Sbjct: 85 SQDYKQSLDLYTQACNVDHAAACLNLGYLYMGGLGVEKDEKKARDLYIKSCDGKKAEGCY 144
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
SL + F G GG D+D L A+A GH DA LG G G+ ++ + R
Sbjct: 145 SLGNLYFYGKGG---DRDYEKAADLYAKACEYGHDDACDNLGVMYAKGEGIAKDYDKARE 201
Query: 233 FLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAP---HPRNQNPNHNGHVTGIG 289
F + A A AC + + + Q +P + + + +H +G
Sbjct: 202 FFTKVCADNRA-----GACYNLGILFDYGYG-VEQSYPEAIRLYTKACDMHHIKACYSLG 255
Query: 290 CPLLSDFGCNVPAPEA 305
G N+ P+A
Sbjct: 256 IMYNKGDGVNIDYPKA 271
>gi|149240561|ref|XP_001526156.1| hypothetical protein LELG_02714 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450279|gb|EDK44535.1| hypothetical protein LELG_02714 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 809
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CG+ E EF +C C YCSR CQ W H+ C P
Sbjct: 718 GIRQCGNLECGKWEKYPREFLKCRRCKRTKYCSRDCQMKSWHC-HRNWCIP 767
>gi|50291873|ref|XP_448369.1| hypothetical protein [Candida glabrata CBS 138]
gi|49527681|emb|CAG61330.1| unnamed protein product [Candida glabrata]
Length = 602
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 23/42 (54%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWK 367
G+R C++ CG+ E EF +C C YCSR CQ WK
Sbjct: 479 GVRQCANFSCGKWERYPREFAKCRRCKRTKYCSRECQLEAWK 520
>gi|365891959|ref|ZP_09430314.1| putative Beta-lactamase [Bradyrhizobium sp. STM 3809]
gi|365332055|emb|CCE02845.1| putative Beta-lactamase [Bradyrhizobium sp. STM 3809]
Length = 365
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 12/121 (9%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG------SGASLMAKAAISSHAQALYS 173
+ A + K A+AG+ EA + L M+R L RG L+A AA +A Y+
Sbjct: 139 DKAAIWYKRAAEAGDREAMFALAMMR---LAGRGGPVDKPEAVKLLASAAKLGEPKAAYN 195
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LA++ + G +DLR L AA G+ +A L ++G GV +++ + R
Sbjct: 196 LALLYLD---GQTLPQDLRRSAELLRMAADAGNAEAQYALATFYKEGTGVPKDLEKAARL 252
Query: 234 L 234
L
Sbjct: 253 L 253
Score = 38.1 bits (87), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 34/69 (49%), Gaps = 7/69 (10%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
L++ ADAGN EA Y L FY ++ A L+ AAI+ + A A+
Sbjct: 215 ELLRMAADAGNAEAQYALAT--FYKEGTGVPKDLEKAARLLQAAAIADNVDAEVEYAIAL 272
Query: 179 FNGSGGSKN 187
FNG+G K+
Sbjct: 273 FNGTGVPKD 281
>gi|307109958|gb|EFN58195.1| hypothetical protein CHLNCDRAFT_142043 [Chlorella variabilis]
Length = 377
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 26/54 (48%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 323 PGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKADCA 375
P P + +CS GCG+ T ++CS C AV YCSRAC A WK HK CA
Sbjct: 316 PPPPVLVCS--GCGQLST---SLKKCSRCRAVAYCSRACHARHWKEGGHKQQCA 364
>gi|424875215|ref|ZP_18298877.1| hypothetical protein Rleg5DRAFT_6787 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393170916|gb|EJC70963.1| hypothetical protein Rleg5DRAFT_6787 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 859
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 52/122 (42%), Gaps = 6/122 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQAL 171
AN +++A + DAG+V A L + N + + A A AL
Sbjct: 555 ANRDAKTAFATINKAMDAGHVRAISELSALYLVGASVPANLDKSSEIAAIGAKKGDPYAL 614
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+ + G G D + G+ L +AA LGH A+ ELG+ +G V +I G
Sbjct: 615 YAYGKSLYYGRG---TKADTQEGLKLMLQAADLGHTYAMNELGYIFFNGVSVPPDIERGI 671
Query: 232 RF 233
RF
Sbjct: 672 RF 673
>gi|303247415|ref|ZP_07333687.1| Sel1 domain protein repeat-containing protein [Desulfovibrio
fructosovorans JJ]
gi|302491111|gb|EFL51003.1| Sel1 domain protein repeat-containing protein [Desulfovibrio
fructosovorans JJ]
Length = 461
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 47/105 (44%), Gaps = 8/105 (7%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A R + AD GNV A LG++ + Q+ ++ KAA A+ALY L +++
Sbjct: 76 AARLYRRAADGGNVSAMVALGLLYYRGEGVGQSDARASAYFRKAADKGSARALYLLGLMR 135
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGV 223
+G GG D V RA G +A LG L G GV
Sbjct: 136 LSGRGGPTADA-----VGYLRRAVRAGSPEAAAVLGELLLAGRGV 175
>gi|256819152|ref|YP_003140431.1| Sel1 domain-containing protein repeat-containing protein
[Capnocytophaga ochracea DSM 7271]
gi|256580735|gb|ACU91870.1| Sel1 domain protein repeat-containing protein [Capnocytophaga
ochracea DSM 7271]
Length = 811
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NVEA LG ++ QN G + K+A +AQ Y+LA +NG+G
Sbjct: 684 WLAKAAEQNNVEALNELGSY-YFEKQNFGQALANFQKSAQRDYAQGQYNLANCYYNGNGI 742
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + L AR + A LGHC G G+ Q+ + + QA
Sbjct: 743 DRSYEKAANYYKLSARKDY---APAQFRLGHCYYHGEGIEQSDSRAADWFEQA 792
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 50/110 (45%), Gaps = 17/110 (15%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
+ +KL A+ G+V+A L F +G G + K A + A Y LA
Sbjct: 25 KLIKL-AEKGDVQAQTELADAYF-----KGKGVRRSYQEAVVWLEKVAETGDLNAQYQLA 78
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
FNG G SK+ + GV + A G+ +A REL C +DG GV Q
Sbjct: 79 QCYFNGKGVSKSP---QKGVEWLTKVADAGNPEAQRELALCYRDGKGVEQ 125
>gi|397631833|gb|EJK70303.1| hypothetical protein THAOC_08347, partial [Thalassiosira oceanica]
Length = 351
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A+ +LA F+G G + KD R V L AA LG IDAL ELG+ +G GV+Q+ A
Sbjct: 196 AINNLAQQYFHGGLGLQ--KDARRAVKLWEEAAELGSIDALFELGNTYHEGEGVQQDEAM 253
Query: 230 GRRFLVQA 237
F ++A
Sbjct: 254 AVEFYMKA 261
>gi|393237003|gb|EJD44548.1| hypothetical protein AURDEDRAFT_166334 [Auricularia delicata
TFB-10046 SS5]
Length = 296
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 5/50 (10%)
Query: 329 LCSHV--GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
+HV CG+ R +RCS+C YC++ CQ DW RHK C P
Sbjct: 239 FATHVCYKCGK---RDVAMKRCSLCRKATYCTKECQKADWAARHKEACIP 285
>gi|336390143|gb|EGO31286.1| protoplast regeneration and killer toxin resistance protein SKT5
[Serpula lacrymans var. lacrymans S7.9]
Length = 356
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 41/85 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A H + Y LA NG G KN +D L AA GH DA G C
Sbjct: 37 SLLKRLADRGHPASQYFLADCYANGLGTVKNRQDFDRAYPLFVLAAKHGHPDAAYRAGTC 96
Query: 217 LQDGYGVRQNIAEGRRFLVQANARE 241
++G+G R+ A+ +F +A A +
Sbjct: 97 CENGWGCRRESAKALQFFRKAAAAQ 121
>gi|303271413|ref|XP_003055068.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463042|gb|EEH60320.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 224
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
Query: 145 RFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG-SGGSKNDKDLRAGVALCARAAF 203
F G K+A + ++ A F G +GG+ +++ V RA+
Sbjct: 64 EFRSAHKHGKAIYWFEKSADHGDPEGMFWFASYTFAGYTGGACTEENYNKAVHWWKRASE 123
Query: 204 LGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
LGH +A + + C +DG+GV ++ A+ + V+A
Sbjct: 124 LGHAEATKMVATCYEDGHGVERDFAKAIEWHVKA 157
>gi|389749476|gb|EIM90647.1| hypothetical protein STEHIDRAFT_90479 [Stereum hirsutum FP-91666
SS1]
Length = 1075
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E+ EF +C C YC + CQ+ W H+ C+
Sbjct: 736 GIRQCANMLCGRWESYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 785
>gi|357451869|ref|XP_003596211.1| Ubiquitin carboxyl-terminal hydrolase [Medicago truncatula]
gi|355485259|gb|AES66462.1| Ubiquitin carboxyl-terminal hydrolase [Medicago truncatula]
Length = 983
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 20/32 (62%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
RCS C V YCS CQ + W+L HK +C P E
Sbjct: 97 RCSRCKLVRYCSGNCQIIHWRLFHKQECLPLE 128
>gi|346970125|gb|EGY13577.1| hypothetical protein VDAG_00259 [Verticillium dahliae VdLs.17]
Length = 220
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 31/59 (52%), Gaps = 7/59 (11%)
Query: 317 AARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW-KLRHKADC 374
AARGG +C+ GC + ET RC+ CG YCS+ CQ DW + HK DC
Sbjct: 156 AARGGE----AAVCA--GCEKEETEGDALMRCTGCGRARYCSKVCQKKDWGEGGHKVDC 208
>gi|384500587|gb|EIE91078.1| hypothetical protein RO3G_15789 [Rhizopus delemar RA 99-880]
Length = 623
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 49/101 (48%), Gaps = 14/101 (13%)
Query: 130 ADAGNVEACYTLGMIRFYCLQN-------RGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
A+ G + A LG YC QN GA ++AA HA+A ++L NG
Sbjct: 278 AEQGYIWAQCNLG----YCYQNGIGIDKDVVQGAYWYSQAATQGHARAQHNLGFCYQNGI 333
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGV 223
G +K DL+ + +AA G+I A LG+C Q+G GV
Sbjct: 334 GVTK---DLKMAIFWYKKAAEQGNIFAYHSLGYCYQNGLGV 371
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 50/110 (45%), Gaps = 14/110 (12%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCL-------QNRGSGASLMAKAAISSHAQALYSLAVI 177
+ KL AD GN A +LG YC Q++ A ++A + A +L
Sbjct: 237 WYKLAADQGNAFAQNSLG----YCYEDGIGIKQDKAMAAFWYRRSAEQGYIWAQCNLGYC 292
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
NG G DKD+ G ++AA GH A LG C Q+G GV +++
Sbjct: 293 YQNGIG---IDKDVVQGAYWYSQAATQGHARAQHNLGFCYQNGIGVTKDL 339
>gi|397623678|gb|EJK67100.1| hypothetical protein THAOC_11911 [Thalassiosira oceanica]
Length = 183
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 3/82 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A + A+ A H +A++ L F G G + KD+R GV L A + G +DAL LG
Sbjct: 10 AMVRARVAKRDH-EAIFFLGQQYFYGQLGLQ--KDMRKGVELYTEAVYFGSVDALFNLGL 66
Query: 216 CLQDGYGVRQNIAEGRRFLVQA 237
G GV++++A+ F +A
Sbjct: 67 AYYKGEGVQKDVAKAAEFYEKA 88
>gi|393227360|gb|EJD35042.1| hypothetical protein AURDEDRAFT_140435 [Auricularia delicata
TFB-10046 SS5]
Length = 447
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 307 PASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRR---CSVCGAVNYCSRACQA 363
P++ L FA R PG G+ C++ CG P+ H R+ C+ C V YC CQ
Sbjct: 324 PSTELLVFRFACRINVPGQGV--CANTACGTPQHSAHGTRKLFYCNRCRLVRYCGADCQK 381
Query: 364 LDW---KLRHKADCAPAERWL 381
DW ++ HK C + L
Sbjct: 382 ADWSGAEVPHKLVCVTLAKLL 402
>gi|386400239|ref|ZP_10085017.1| TPR repeat-containing protein [Bradyrhizobium sp. WSM1253]
gi|385740865|gb|EIG61061.1| TPR repeat-containing protein [Bradyrhizobium sp. WSM1253]
Length = 364
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 55/116 (47%), Gaps = 6/116 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A ++ K +DAG+ EA + L M+R ++ LMA AA +A Y+LA++
Sbjct: 141 ALQWYKRASDAGDREAMFALAMLRMSGRGGPVDKNEAVKLMASAAKLGEPKAAYNLALLY 200
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+ G +D++ L +AA G +A L ++G GV +++ R L
Sbjct: 201 LD---GQTLPQDVKRSAELLRQAADAGLPEAQYALATFYKEGTGVPKDLERAVRLL 253
>gi|426198854|gb|EKV48779.1| hypothetical protein AGABI2DRAFT_191011 [Agaricus bisporus var.
bisporus H97]
Length = 559
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E EF +C C YC + CQ+ W H+ C+
Sbjct: 381 GIRQCANMLCGRWEAYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 430
>gi|409077524|gb|EKM77889.1| hypothetical protein AGABI1DRAFT_114791 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 559
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E EF +C C YC + CQ+ W H+ C+
Sbjct: 381 GIRQCANMLCGRWEAYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 430
>gi|420150362|ref|ZP_14657522.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 335 str. F0486]
gi|394752421|gb|EJF36123.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 335 str. F0486]
Length = 811
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NVEA LG ++ QN G + K+A +AQ Y+LA +NG+G
Sbjct: 684 WLAKAAEQNNVEALNELGSY-YFEKQNFGQALANFQKSAQRDYAQGQYNLANCYYNGNGI 742
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + L AR + A LGHC G G+ Q+ + + QA
Sbjct: 743 DRSYEKAANYYKLSARKDY---APAQFRLGHCYYHGEGIEQSDSRAADWFEQA 792
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 58/130 (44%), Gaps = 17/130 (13%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
+ +KL A+ G+V+A L F +G G + K A + A Y LA
Sbjct: 25 KLIKL-AEKGDVQAQTELADAYF-----KGKGVRRSYQEAVVWLEKVAETGDLNAQYQLA 78
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
FNG G K+ + GV + A G+ +A REL C +DG GV Q+ + R +
Sbjct: 79 QCYFNGKGVPKSP---QKGVEWLTKVADAGNPEAQRELALCYRDGKGVEQSKEKYYRLIE 135
Query: 236 QANARELAAV 245
+ +E + V
Sbjct: 136 KHAEKENSEV 145
>gi|296112632|ref|YP_003626570.1| hypothetical protein MCR_0405 [Moraxella catarrhalis RH4]
gi|295920326|gb|ADG60677.1| tetratricopeptide repeat family protein [Moraxella catarrhalis
BBH18]
Length = 302
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA HA+A Y+L V+ + G G + D V +AA G A LG +G
Sbjct: 209 KAAGQGHAEAQYNLGVMYYEGQGVRQ---DYHKAVEWFTKAANQGFAQAQNNLGVMYDEG 265
Query: 221 YGVRQNIAEGRRFLVQA 237
GVRQNIA + + QA
Sbjct: 266 QGVRQNIATAKIYYGQA 282
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 16/124 (12%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGA--------SLMAKAAISSHAQALYS 173
A + A+ G EA ++LG++ + G G KAA +A A ++
Sbjct: 131 AFEWFTKAANQGLAEAQFSLGVMY-----DEGQGVRQDYYKAVEWYTKAANQGYAGAQFN 185
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LA++ + G G ++D++ V +AA GH +A LG +G GVRQ+ + +
Sbjct: 186 LALMYYEGQGVRQDDQE---AVEWYTKAAGQGHAEAQYNLGVMYYEGQGVRQDYHKAVEW 242
Query: 234 LVQA 237
+A
Sbjct: 243 FTKA 246
>gi|54288360|gb|AAV31648.1| conserved hypothetical protein [uncultured alpha proteobacterium
EBAC2C11]
Length = 321
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 14/112 (12%)
Query: 136 EACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQF----NGSGGSKND 188
EA + L M+ ++ L Q + + L+ A H+ AL+ LA F NGSG
Sbjct: 106 EATFELAMMYWHGLGVKQCKATAVLLLDVAGKQDHSLALFKLAEYFFEVAENGSG----- 160
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
L+ G C R+A LG ++A LG ++G G QN + + +A R
Sbjct: 161 --LKKGFDSCHRSAELGLVEAQFNLGVMFENGLGTEQNFKKAATWYHRAATR 210
>gi|307102765|gb|EFN51033.1| hypothetical protein CHLNCDRAFT_59345 [Chlorella variabilis]
Length = 385
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 27/49 (55%), Gaps = 5/49 (10%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G+R+C+ GCG RRCS C AV YCS AC WK HK +C
Sbjct: 330 GVRVCAAEGCGNTSG----LRRCSGCRAVRYCSEACSHAHWKA-HKTEC 373
>gi|294656294|ref|XP_458552.2| DEHA2D01936p [Debaryomyces hansenii CBS767]
gi|199431357|emb|CAG86684.2| DEHA2D01936p [Debaryomyces hansenii CBS767]
Length = 821
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CG+ E EF +C C YCSR CQ W H+ C P
Sbjct: 719 GVRQCGNLECGKWEKYPREFSKCRRCKRTKYCSRECQMRAWHC-HRNWCIP 768
>gi|397615120|gb|EJK63234.1| hypothetical protein THAOC_16120 [Thalassiosira oceanica]
Length = 528
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 22/41 (53%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
CGR F +C C AV+YC ++CQ WK HK +C
Sbjct: 439 CGREAKDDETFMKCGKCQAVHYCQKSCQVAAWKSGHKINCT 479
>gi|301111688|ref|XP_002904923.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095253|gb|EEY53305.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 453
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 59/129 (45%), Gaps = 15/129 (11%)
Query: 122 AHRFLKLCADAGNVEACYTL-GMIRFYCLQNRGSGA------SLMAKAAISSHAQALYSL 174
A+ + A+AGNV A + GM+ L+ G+ KA+ +H +AL L
Sbjct: 173 AYNYFNRAAEAGNVNAQSAVAGML----LKGEGTAQDNVTAIEWYEKASEKNHTRALNGL 228
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHI-DALRELGHCLQDGYGVRQNIAEGRRF 233
+ F+GSGG +K L + L RAAF D++ G+C G G NI+ F
Sbjct: 229 GFVHFHGSGGVLENKTL--ALELFERAAFNKEDGDSIFNAGYCHAKGLGTSVNISRAMEF 286
Query: 234 LVQANAREL 242
A ARE
Sbjct: 287 YHMA-AREF 294
>gi|375336756|ref|ZP_09778100.1| hypothetical protein SbacW_07215 [Succinivibrionaceae bacterium
WG-1]
Length = 387
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 75/175 (42%), Gaps = 13/175 (7%)
Query: 71 CKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCA 130
KL A C S I C+ N LA N + + I+ +++ E A L+ +
Sbjct: 1 MKLKLLALCLS----FTIMCQFNNALAENDDISEYTEQAIIHIENDDYQE-AKVLLEKAS 55
Query: 131 DAGNVEACYTLGMIRFYCLQ----NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSK 186
+ G+ A Y LG + +Y + + L KAA ++ A+ L ++ GS K
Sbjct: 56 NLGDGLASYLLG-VMYYSGEVGNVDYQKAYQLFEKAASANVPNAITYLGIMHLEGSFVKK 114
Query: 187 NDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARE 241
D A +AA L DA+ LG+ G GV +N A+ + F +A + E
Sbjct: 115 ---DFTKAKAYFEKAANLEDEDAIFNLGYIYHLGLGVPKNYAKAKSFYEKAGSFE 166
>gi|328778894|ref|XP_397368.4| PREDICTED: egl nine homolog 1-like [Apis mellifera]
Length = 486
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 32/60 (53%), Gaps = 2/60 (3%)
Query: 316 FAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+R TPG +++ VGC R + RCS C AV YC++ Q DWK RHK CA
Sbjct: 5 IVSRAQTPGGDVQIGITVGCVVC-NRTDKLLRCSRCKAVFYCTKEHQKRDWK-RHKEFCA 62
>gi|146413975|ref|XP_001482958.1| hypothetical protein PGUG_04913 [Meyerozyma guilliermondii ATCC
6260]
Length = 736
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CG+ E+ EF +C C YCSR CQ W H+ C P
Sbjct: 644 GVRQCGNLECGKWESYPREFLKCRRCKRTKYCSRECQMRAWHC-HRNWCIP 693
>gi|296806407|ref|XP_002844013.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845315|gb|EEQ34977.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 1201
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CS+ G +P + + RCS C YCS+ CQ DW+L+HK C
Sbjct: 1151 CSNCGGTKPGSGT-KLLRCSGCKVTKYCSKECQREDWRLKHKLTC 1194
>gi|393779651|ref|ZP_10367888.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 412 str. F0487]
gi|392609846|gb|EIW92644.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 412 str. F0487]
Length = 811
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NVEA LG ++ QN G + K+A +AQ Y+LA +NG+G
Sbjct: 684 WLAKAAEQNNVEALNELGSY-YFEKQNFGQALANFQKSAQRDYAQGQYNLANCYYNGNGI 742
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + L AR + A LGHC G G+ Q+ + + QA
Sbjct: 743 DRSYEKAANYYKLSARKDY---APAQFRLGHCYYHGEGIEQSDSRAADWFEQA 792
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 49/110 (44%), Gaps = 17/110 (15%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLA 175
+ +KL A+ G+V+A L F +G G + K A + A Y LA
Sbjct: 25 KLIKL-AEKGDVQAQTELADAYF-----KGKGVRRSYQEAVVWLEKVAETGDLNAQYQLA 78
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
FNG G K+ + GV + A G+ +A REL C +DG GV Q
Sbjct: 79 QCYFNGKGVPKSP---QKGVEWLTKVADAGNPEAQRELALCYRDGKGVEQ 125
>gi|386266002|ref|YP_005829494.1| Putative TPR repeat protein [Haemophilus influenzae R2846]
gi|309973238|gb|ADO96439.1| Putative TPR repeat protein [Haemophilus influenzae R2846]
Length = 348
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 70/148 (47%), Gaps = 12/148 (8%)
Query: 95 GLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY-----CL 149
G+A L L K +K ++ A ++ + A+ GN +A + LG++ Y
Sbjct: 80 GVADAQLNLGNMYAKGLGVKQDD--VEAVKWYRQAAEQGNAKAQFNLGLM--YDNGRGVK 135
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
Q+ KAA +A A ++L + +NG G ++D + V +AA G+ DA
Sbjct: 136 QDYFEAVKWFRKAAEQGYADAQFNLGNMYYNGHGVKQDDFE---AVKWYRKAAEQGYADA 192
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQA 237
LG+ +G+GV+Q+ E ++ +A
Sbjct: 193 QFNLGNMYYNGHGVKQDDFEAVKWYRKA 220
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 6/120 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ + A+ G +A + LG + + Q+ KAA HA+A Y+L +
Sbjct: 176 EAVKWYRKAAEQGYADAQFNLGNMYYNGHGVKQDDFEAVKWYRKAAEQGHAKAQYNLGNM 235
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
NG G + D V +AA G+ DA LG G+GVRQ+ E ++ +A
Sbjct: 236 YANGRGVKQ---DYFEAVKWYRKAAEQGYADAQANLGSAYSAGHGVRQDYIEAVKWFKKA 292
>gi|410074105|ref|XP_003954635.1| hypothetical protein KAFR_0A00620 [Kazachstania africana CBS 2517]
gi|372461217|emb|CCF55500.1| hypothetical protein KAFR_0A00620 [Kazachstania africana CBS 2517]
Length = 664
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G+R C++ CG+ E +F +C C YCSR CQ W+ H+ C
Sbjct: 483 GVRQCANFSCGKWEEYPRQFAKCRRCKRTKYCSRKCQLESWQF-HRYWC 530
>gi|422368691|ref|ZP_16449096.1| MORN repeat variant [Escherichia coli MS 16-3]
gi|315299548|gb|EFU58796.1| MORN repeat variant [Escherichia coli MS 16-3]
Length = 641
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 5/115 (4%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
+K A GN A +TLG F C N G + K+A + A+Y LA + G+G
Sbjct: 479 IKDNATKGNSSAQFTLGKYEFTCA-NIDEGIKWLTKSAEQKNTDAIYFLATA-YKGNGIP 536
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
N++ + +AA LG+ +A E+G+ G + QN+ + + +A A+
Sbjct: 537 ANNEKY---ITYLQQAAMLGNSNAQAEIGYLYLIGKELPQNLPDAGDWFKKAAAQ 588
>gi|312113403|ref|YP_004010999.1| Sel1 domain-containing protein repeat-containing protein
[Rhodomicrobium vannielii ATCC 17100]
gi|311218532|gb|ADP69900.1| Sel1 domain protein repeat-containing protein [Rhodomicrobium
vannielii ATCC 17100]
Length = 326
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 6/113 (5%)
Query: 125 FLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
F + A G+ A Y L + F Q+ G A + AA HAQA Y LA + +G
Sbjct: 137 FFEAAAQQGHTLAAYNLAQVYVEGFARPQDMGRAAYWLGIAAEKDHAQAAYDLATLLRHG 196
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G K++ A+AA +G+++A E L +G GV ++ A + L
Sbjct: 197 DGVQKDEAK---AAQFMAKAADMGNVEAQVEYAIMLGNGKGVPKDEAGAVKLL 246
>gi|154302509|ref|XP_001551664.1| predicted protein [Botryotinia fuckeliana B05.10]
Length = 200
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 325 PGLRLCSHVGC----GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
P +CS+ C ++R+E + C+ C YCS+ CQ DWK RHK C
Sbjct: 128 PENMICSNFNCPLGSMNVNSKRNELKNCARCKCTWYCSKDCQTEDWKARHKRWC 181
>gi|392562474|gb|EIW55654.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 499
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 31/62 (50%), Gaps = 4/62 (6%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDV 389
CS PET +RC C AV YC +ACQ DW + HK +C+ + W + DV
Sbjct: 19 CSACAAPAPET---GLKRCPKCKAVYYCDKACQNRDW-VWHKHECSALQTWAASAPSPDV 74
Query: 390 DV 391
+
Sbjct: 75 MI 76
>gi|148258139|ref|YP_001242724.1| beta-lactamase [Bradyrhizobium sp. BTAi1]
gi|146410312|gb|ABQ38818.1| putative Beta-lactamase [Bradyrhizobium sp. BTAi1]
Length = 368
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 12/121 (9%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG------SGASLMAKAAISSHAQALYS 173
E A + K A+AG+ EA + L M+R L RG L+A AA +A Y+
Sbjct: 142 EKAAIWYKRAAEAGDREAMFALAMMR---LAGRGGPVDKQEAVKLLASAAKLGEPKAAYN 198
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LA++ + G +DL+ L AA G+ +A L ++G GV ++ + R
Sbjct: 199 LALLYLD---GQTLPQDLKRSAELLRMAADAGNPEAQYALATFYKEGTGVPKDPEKATRL 255
Query: 234 L 234
L
Sbjct: 256 L 256
>gi|416155954|ref|ZP_11604247.1| tetratricopeptide repeat family protein [Moraxella catarrhalis
101P30B1]
gi|416220293|ref|ZP_11625385.1| tetratricopeptide repeat family protein [Moraxella catarrhalis
103P14B1]
gi|416249994|ref|ZP_11637003.1| tetratricopeptide repeat family protein [Moraxella catarrhalis
CO72]
gi|326566881|gb|EGE17020.1| tetratricopeptide repeat family protein [Moraxella catarrhalis
103P14B1]
gi|326575117|gb|EGE25045.1| tetratricopeptide repeat family protein [Moraxella catarrhalis
CO72]
gi|326576797|gb|EGE26704.1| tetratricopeptide repeat family protein [Moraxella catarrhalis
101P30B1]
Length = 302
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 49/112 (43%), Gaps = 6/112 (5%)
Query: 129 CADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
A+ G A + L ++ + Q+ KAA HA+A Y+L V+ + G G
Sbjct: 174 AANQGYAGAQFNLALMYYEGQGVRQDDQEAVEWYTKAAGQGHAEAQYNLGVMYYEGQGVR 233
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+ D V +AA G A LG +G GVRQNIA + + QA
Sbjct: 234 Q---DYHKAVEWFTKAANQGFAQAQNNLGVMYDEGQGVRQNIATAKIYYGQA 282
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 16/124 (12%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGA--------SLMAKAAISSHAQALYS 173
A + A+ G EA ++LG++ + G G KAA +A A ++
Sbjct: 131 AFEWFTKAANQGLAEAQFSLGVMY-----DEGQGVRQDYYKAVEWYTKAANQGYAGAQFN 185
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LA++ + G G ++D++ V +AA GH +A LG +G GVRQ+ + +
Sbjct: 186 LALMYYEGQGVRQDDQE---AVEWYTKAAGQGHAEAQYNLGVMYYEGQGVRQDYHKAVEW 242
Query: 234 LVQA 237
+A
Sbjct: 243 FTKA 246
>gi|449439561|ref|XP_004137554.1| PREDICTED: putative F-box protein At1g67623-like [Cucumis sativus]
Length = 260
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 27/123 (21%), Positives = 55/123 (44%), Gaps = 2/123 (1%)
Query: 57 LFDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKAN 116
+ +LP+DL+ +L K+++++ D + + K + + + S F
Sbjct: 33 VIKSLPNDLLTEVLAKVATSSYI--DLIQAKLATKHFLEASNDRYIFQHVSLGNFRNLLW 90
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
N S F++ C ++ N E+ Y GM+ F+ SG + + ++A + A Y V
Sbjct: 91 NNSPKFWSFMETCNNSENPESLYRKGMLEFFTHCKEASGMAYLKRSAQKGYVDACYVFGV 150
Query: 177 IQF 179
I +
Sbjct: 151 ILY 153
>gi|403052608|ref|ZP_10907092.1| hypothetical protein AberL1_13916 [Acinetobacter bereziniae LMG
1003]
Length = 282
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 60/116 (51%), Gaps = 7/116 (6%)
Query: 126 LKLCADAGNVEACYTLGMIR---FYCLQ-NRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+K A+ G+ EA Y +G++ + L+ N +A +A+A Y+LA++ +
Sbjct: 45 IKSKAEQGDPEAQYNMGVLFTDGYKDLKPNIIEAFKWYKLSADKGYAKAKYNLAIL-LSS 103
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G KND L AA LG I +L ELG+ +DG GV++N A+ + ++A
Sbjct: 104 DSGIKND--YAQAKKLFEDAAALGDIPSLNELGNFYKDGIGVQENYAQASEYYLKA 157
>gi|401827984|ref|XP_003888284.1| Sel1 repeat domain-containing protein [Encephalitozoon hellem ATCC
50504]
gi|392999556|gb|AFM99303.1| Sel1 repeat domain-containing protein [Encephalitozoon hellem ATCC
50504]
Length = 588
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 53/110 (48%), Gaps = 6/110 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
E + +K A+ N A TLG F +N +A ++ AL++L+
Sbjct: 404 EKSFSLMKRAAEMNNSSAQNTLGYYYEEGFGTSRNIKEAVRWYETSAKQDNSWALFNLSS 463
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+ FNG+ G ++K GV L +R+ LG+ A LG+C + G GV +N
Sbjct: 464 LYFNGAHGPPDEK---LGVKLLSRSRDLGNPRAANTLGYCFEKGIGVEKN 510
>gi|297799550|ref|XP_002867659.1| ubiquitin-specific protease 16 [Arabidopsis lyrata subsp. lyrata]
gi|297313495|gb|EFH43918.1| ubiquitin-specific protease 16 [Arabidopsis lyrata subsp. lyrata]
Length = 1007
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDL 393
RCS C +V YCS CQ + W+ HK +C PA D E D DL
Sbjct: 84 RCSRCKSVRYCSGKCQIIHWRQGHKDECRPASIVYDSEDEKS-DSDL 129
>gi|221235062|ref|YP_002517498.1| polar development protein podJ [Caulobacter crescentus NA1000]
gi|29429226|sp|Q9ZG88.2|PODJ_CAUCR RecName: Full=Localization factor PodJL; AltName: Full=Polar
organelle development protein; Contains: RecName:
Full=Localization factor PodJS
gi|251764811|sp|B8GXA0.1|PODJ_CAUCN RecName: Full=Localization factor PodJL; AltName: Full=Polar
organelle development protein; Contains: RecName:
Full=Localization factor PodJS
gi|24110907|tpg|DAA00313.1| TPA_exp: polar organelle development protein [Caulobacter
vibrioides]
gi|220964234|gb|ACL95590.1| polar development protein podJ [Caulobacter crescentus NA1000]
Length = 974
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 12/116 (10%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
+AA +A+++LA+ F G GG +N + +AA +G +D+ L + G
Sbjct: 787 RAANGGDPRAMHNLALYYFKGEGGPRNST---TAASWFRKAADMGLVDSQFNLAQLYESG 843
Query: 221 YGVRQNIAEGRRFLVQA------NARELAAVLSS---AACQGISTRSWLTWNPLSQ 267
GV QN AE ++ V A AR A L S A Q + RS L + P +Q
Sbjct: 844 LGVSQNPAEAYKWYVIAGRAGDSTARGRATALRSQLTAEAQQTADRSALAFRPQTQ 899
>gi|432731150|ref|ZP_19965989.1| hypothetical protein WGK_00981 [Escherichia coli KTE45]
gi|432758210|ref|ZP_19992733.1| hypothetical protein A1S1_00344 [Escherichia coli KTE46]
gi|431278554|gb|ELF69544.1| hypothetical protein WGK_00981 [Escherichia coli KTE45]
gi|431311996|gb|ELG00144.1| hypothetical protein A1S1_00344 [Escherichia coli KTE46]
Length = 717
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 5/115 (4%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
+K A GN A +TLG F C N G + K+A + A+Y LA + G+G
Sbjct: 555 IKDNATKGNSSAQFTLGKYEFTCA-NIDEGIKWLTKSAEQKNTDAIYFLATA-YKGNGIP 612
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
N++ + +AA LG+ +A E+G+ G + QN+ + + +A A+
Sbjct: 613 ANNEKY---ITYLQQAAMLGNSNAQAEIGYLYLIGKELPQNLPDAGVWFKKAAAQ 664
>gi|432464438|ref|ZP_19706546.1| hypothetical protein A15K_00375 [Escherichia coli KTE205]
gi|432582537|ref|ZP_19818947.1| hypothetical protein A1SM_01751 [Escherichia coli KTE57]
gi|433071475|ref|ZP_20258177.1| hypothetical protein WIS_00447 [Escherichia coli KTE129]
gi|433118975|ref|ZP_20304689.1| hypothetical protein WKC_00411 [Escherichia coli KTE157]
gi|433181959|ref|ZP_20366262.1| hypothetical protein WGO_00412 [Escherichia coli KTE85]
gi|430997189|gb|ELD13456.1| hypothetical protein A15K_00375 [Escherichia coli KTE205]
gi|431119553|gb|ELE22552.1| hypothetical protein A1SM_01751 [Escherichia coli KTE57]
gi|431593662|gb|ELI63954.1| hypothetical protein WIS_00447 [Escherichia coli KTE129]
gi|431649324|gb|ELJ16682.1| hypothetical protein WKC_00411 [Escherichia coli KTE157]
gi|431711855|gb|ELJ76162.1| hypothetical protein WGO_00412 [Escherichia coli KTE85]
Length = 717
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 5/115 (4%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
+K A GN A +TLG F C N G + K+A + A+Y LA + G+G
Sbjct: 555 IKDNATKGNSSAQFTLGKYEFTCA-NIDEGIKWLTKSAEQKNTDAIYFLATA-YKGNGIP 612
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
N++ + +AA LG+ +A E+G+ G + QN+ + + +A A+
Sbjct: 613 ANNEKY---ITYLQQAAMLGNSNAQAEIGYLYLIGKELPQNLPDAGVWFKKAAAQ 664
>gi|189502156|ref|YP_001957873.1| hypothetical protein Aasi_0764 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497597|gb|ACE06144.1| hypothetical protein Aasi_0764 [Candidatus Amoebophilus asiaticus
5a2]
Length = 850
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 8/121 (6%)
Query: 122 AHRFLKLCADAGNVEACYTLGMI--RFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A R+ A + EA Y+LG++ R + LQ+ KAA HA+A L
Sbjct: 551 AKRYYLQKAAQEDAEAQYSLGIMYSRGFEGSVLQDFEEAREWYTKAARQGHAEAQRELGK 610
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ +G GG +KD + AA G ++A RE+G+ + YGV Q+ ++ +
Sbjct: 611 MYRSGLGG---NKDYAESLKWLKNAAKQGDVNAQREVGYMYEHAYGVEQHYTRALKWYKR 667
Query: 237 A 237
A
Sbjct: 668 A 668
>gi|169866977|ref|XP_001840071.1| hypothetical protein CC1G_12125 [Coprinopsis cinerea okayama7#130]
gi|116498849|gb|EAU81744.1| hypothetical protein CC1G_12125 [Coprinopsis cinerea okayama7#130]
Length = 574
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 322 TPGPG-LRLCSHVGC----GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
TP P L LC ++G R ++ R+CS C +V YCS ACQ DW+ HK +C
Sbjct: 365 TPYPKRLVLCQNIGHLDRGFRAPSQGEGVRQCSRCRSVVYCSEACQKQDWERLHKREC 422
>gi|449551139|gb|EMD42103.1| hypothetical protein CERSUDRAFT_79710 [Ceriporiopsis subvermispora
B]
Length = 173
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 324 GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
P C + C R +R E +C C ++ YCS CQ +DWK RHK C
Sbjct: 119 APDKIACGNYKCIRETLKRKELMKCGNCKSIFYCSAPCQKVDWK-RHKKYC 168
>gi|397574117|gb|EJK49043.1| hypothetical protein THAOC_32119, partial [Thalassiosira oceanica]
Length = 1069
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A + A+ A QA+ L ++G G + KD+R V L AA LG I+AL +LG+
Sbjct: 525 AKIQARVA-KKDPQAILFLGRKYYHGDDGLQ--KDMRKAVELYTEAAELGSIEALFDLGN 581
Query: 216 CLQDGYGVRQNIAEGRRFLVQA 237
+G GV+Q+ A+ F +A
Sbjct: 582 AYNEGDGVQQDKAKAVEFWTKA 603
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%)
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD+R V L AA LG IDAL LG+ +G GV+Q+ + +F +A
Sbjct: 934 KDMRRAVELWTEAAELGSIDALFSLGNAYDEGDGVQQDKPKAAQFWTKA 982
>gi|403420436|emb|CCM07136.1| predicted protein [Fibroporia radiculosa]
Length = 927
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E+ EF +C C YC + CQ+ W H+ C+
Sbjct: 646 GIRQCANMLCGRWESFPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 695
>gi|290984414|ref|XP_002674922.1| predicted protein [Naegleria gruberi]
gi|284088515|gb|EFC42178.1| predicted protein [Naegleria gruberi]
Length = 504
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 58/128 (45%), Gaps = 15/128 (11%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS---LMAKAAI---SS 166
IK+N S + H +L+ A+ GN A Y LG CL G G L AK+ S+
Sbjct: 324 IKSNRISRAIH-WLEQSANQGNSIAQYQLG-----CLYQTGKGVKSDILTAKSWFEKASN 377
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+ AL L I NG G N K R L ++A GH + LG +G GV Q+
Sbjct: 378 NPDALNDLGCIYSNGLAGEINHKKARE---LFEKSANQGHALGQKNLGGLYLNGMGVEQD 434
Query: 227 IAEGRRFL 234
+ + +L
Sbjct: 435 YDKAKEWL 442
>gi|424844207|ref|ZP_18268818.1| TPR repeat-containing protein [Jonquetella anthropi DSM 22815]
gi|363985645|gb|EHM12475.1| TPR repeat-containing protein [Jonquetella anthropi DSM 22815]
Length = 829
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 10/125 (8%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSL 174
+ A + A +G+ EA LG R Y +N+ A + KAA + +Y++
Sbjct: 291 KKAFQLYTSAATSGDSEAYIDLG--RLYETGVGVEKNKAKAAEMYKKAASFGLPEGMYNM 348
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
I G G ++ D +AGV +AA G +A R+LG G V +N+A+ ++ L
Sbjct: 349 GRIAIIGKGVAQ---DRKAGVQWLEKAAAAGSTNAQRDLGVLCARGDWVPKNLAKAKKLL 405
Query: 235 VQANA 239
+A+A
Sbjct: 406 KEASA 410
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 51/116 (43%), Gaps = 2/116 (1%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
A K AD G++ + LG+++ R G L+++AA + A LA I NG
Sbjct: 223 AETLYKQAADEGSLPSLMALGILKMQK-DARAEGLRLLSQAAAGGYTPAKKFLAQIYENG 281
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
GG + + L AA G +A +LG + G GV +N A+ +A
Sbjct: 282 -GGMGFTPEPKKAFQLYTSAATSGDSEAYIDLGRLYETGVGVEKNKAKAAEMYKKA 336
>gi|307108778|gb|EFN57017.1| hypothetical protein CHLNCDRAFT_143686 [Chlorella variabilis]
Length = 303
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
E ++CS C AV YC R CQ W+ HK +CAPA
Sbjct: 267 ELKKCSRCRAVAYCCRECQMAHWRRGHKRECAPA 300
>gi|197117496|ref|YP_002137923.1| SEL1 repeat-containing lipoprotein [Geobacter bemidjiensis Bem]
gi|197086856|gb|ACH38127.1| SEL1 repeat-containing protein [Geobacter bemidjiensis Bem]
Length = 270
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 56/120 (46%), Gaps = 6/120 (5%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
R+ + A G+V A Y L ++ Q++ A KAA + A Y L ++
Sbjct: 140 RWYRRDALLGHVTAQYNLADAYYHGEEVPQDKREAAKWYLKAAEQGNVPAQYMLGLMLLQ 199
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
G G + GVA +AA GH +A ++G CL G GVR++ E +L +A A+
Sbjct: 200 GDGVQNASAE---GVAWLRKAASSGHREAQYQMGRCLLQGIGVRRDSEEAVLWLRKAAAQ 256
>gi|303237422|ref|ZP_07323988.1| Sel1 repeat protein [Prevotella disiens FB035-09AN]
gi|302482372|gb|EFL45401.1| Sel1 repeat protein [Prevotella disiens FB035-09AN]
Length = 429
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 46/78 (58%), Gaps = 3/78 (3%)
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
+ KAA +AQA+Y++AV NG G +++ L ++ +AA G DA+ ++G
Sbjct: 266 LHKAAEQENAQAMYNIAVSYENGWGVAQS---LENAISWYRKAALAGDADAMLQMGLAYA 322
Query: 219 DGYGVRQNIAEGRRFLVQ 236
DG GV+Q+ + ++L++
Sbjct: 323 DGRGVKQSWTDATQWLLK 340
>gi|299750436|ref|XP_001836755.2| hypothetical protein CC1G_04068 [Coprinopsis cinerea okayama7#130]
gi|298408902|gb|EAU84972.2| hypothetical protein CC1G_04068 [Coprinopsis cinerea okayama7#130]
Length = 1096
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E+ EF +C C YC + CQ+ W H+ C+
Sbjct: 843 GIRQCANMLCGRWESYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 892
>gi|345020394|ref|ZP_08784007.1| hypothetical protein OTW25_03551 [Ornithinibacillus scapharcae
TW25]
Length = 333
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 61/128 (47%), Gaps = 9/128 (7%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQ 169
I+ ++W A + + A AGN +A Y LG F Q+ + +AA HA
Sbjct: 129 IEHDDW---AFEWFLVAAQAGNSDALYWLGNYYFVGTVVDQDLEKTYNCYKQAAEKGHAD 185
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A+ + A + G K+++ + L +AA LG +A+ LG+ Q+G G +++
Sbjct: 186 AMNNYADMYLRGEYVPKDEE---KALELFMKAAELGVPEAMYTLGYMYQNGVGTEKDLEV 242
Query: 230 GRRFLVQA 237
R + V++
Sbjct: 243 SREWFVKS 250
>gi|448522350|ref|XP_003868664.1| Mub1 protein [Candida orthopsilosis Co 90-125]
gi|380353004|emb|CCG25760.1| Mub1 protein [Candida orthopsilosis]
Length = 789
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CG+ E EF +C C YCSR CQ W H+ C P
Sbjct: 681 GVRQCGNLDCGKWEKYPREFSKCRRCKRTKYCSRDCQMKAWHC-HRNWCIP 730
>gi|329120626|ref|ZP_08249289.1| Sel1 repeat superfamily protein [Neisseria bacilliformis ATCC
BAA-1200]
gi|327460850|gb|EGF07184.1| Sel1 repeat superfamily protein [Neisseria bacilliformis ATCC
BAA-1200]
Length = 227
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 9/120 (7%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIR-------FYCLQNRGSGASLMAKAAISSHAQALYS 173
A R+L A+ G++ + Y LG++ F + A AKAA HA A +
Sbjct: 100 EAARWLHKAAEGGDIGSQYYLGLLYLRGRDRDFDIKPDAAQAAKWFAKAAEQGHADAALT 159
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
LA I G+ G +D+ A +AA LG +A L G GV++++ E ++
Sbjct: 160 LASIYEEGADGLP--QDMAAAARWYRKAAELGKPNAAAVLAQMYAQGVGVKRDLREAEKW 217
>gi|395335123|gb|EJF67499.1| HCP-like protein [Dichomitus squalens LYAD-421 SS1]
Length = 502
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 39/82 (47%)
Query: 158 LMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCL 217
L+ + A H A Y LA NG G KN +D L AA GH DA G C
Sbjct: 177 LLKRLADRGHMPAQYFLADCYANGIGTHKNRQDFDRAFPLFVLAAKHGHPDAAYRAGTCC 236
Query: 218 QDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 237 ENGWGCRRESAKAVQFYKKAAA 258
>gi|358054559|dbj|GAA99485.1| hypothetical protein E5Q_06185 [Mixia osmundae IAM 14324]
Length = 903
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C++ CG+ E EF +C C YCS+ CQ+ W+ H+ C+
Sbjct: 669 GIRQCANTQCGKWEAYAREFAKCRRCRKAKYCSKECQSKAWQGGHRYWCS 718
>gi|300940986|ref|ZP_07155510.1| tetratricopeptide repeat protein [Escherichia coli MS 21-1]
gi|300454310|gb|EFK17803.1| tetratricopeptide repeat protein [Escherichia coli MS 21-1]
Length = 375
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 57/130 (43%), Gaps = 13/130 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 208 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGDGVSQNFQQAREWFEKAASQNQLNA 260
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV Q+ +
Sbjct: 261 QYNLGQIYYYGQGVTQS---YRKAKEWFEKAAGEGHVDAQYNLGVIYENGEGVGQDFHQA 317
Query: 231 RRFLVQANAR 240
R + +A AR
Sbjct: 318 RAWYEKAAAR 327
>gi|212533065|ref|XP_002146689.1| MYND domain protein (SamB), putative [Talaromyces marneffei ATCC
18224]
gi|210072053|gb|EEA26142.1| MYND domain protein (SamB), putative [Talaromyces marneffei ATCC
18224]
Length = 388
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C+H CG+ E + +F +C C YCS+ CQ W
Sbjct: 337 GIRQCAHFECGKWEDYQRQFAKCRRCRRTKYCSKECQKAAW 377
>gi|449461409|ref|XP_004148434.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 16-like [Cucumis
sativus]
Length = 1113
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 40/93 (43%), Gaps = 9/93 (9%)
Query: 313 AEWFAARGGTPGPGLRLCSHVG-CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
E A+ G P P + L H C P T R C+ C AV YCS CQ + W+ HK
Sbjct: 55 VELQASFGCIPAP-ISLNHHCALCLSPTTTR-----CARCKAVRYCSGRCQIIHWRQGHK 108
Query: 372 ADCAPAER--WLDEEGNGDVDVDLNDAEGGGED 402
+C P + W E+ N D N G D
Sbjct: 109 NECQPPKTLDWSLEKENDPKDNLENQNHSGSYD 141
>gi|402224519|gb|EJU04581.1| hypothetical protein DACRYDRAFT_98489 [Dacryopinax sp. DJM-731 SS1]
Length = 860
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 26/49 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G+R C+++ CGR E EF +C C YC + CQ+ W H+ C
Sbjct: 693 GIRQCANMSCGRWEEFPREFAKCRRCRKAKYCGKECQSRAWAEGHRFWC 741
>gi|393229558|gb|EJD37179.1| hypothetical protein AURDEDRAFT_173814 [Auricularia delicata
TFB-10046 SS5]
Length = 206
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 25/50 (50%), Gaps = 5/50 (10%)
Query: 330 CSHVGCGRPETRR----HEFRRCSVCGAVNYCSRACQALDWK-LRHKADC 374
C GCG E F RC +C V YCS+ACQ DWK RHK C
Sbjct: 91 CLRPGCGETELSEVHGVRRFARCELCRIVRYCSKACQRADWKRARHKDMC 140
>gi|389742016|gb|EIM83203.1| HCP-like protein [Stereum hirsutum FP-91666 SS1]
Length = 430
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 40/83 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A H + Y LA NG G KN +D L AA GH DA G C
Sbjct: 105 SLLKRNADRGHMPSQYFLADCYANGIGTHKNKQDFDRAFPLFILAAKHGHPDASYRAGTC 164
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 165 CENGWGCRRESAKAVQFYRKAAA 187
>gi|397620868|gb|EJK65964.1| hypothetical protein THAOC_13135 [Thalassiosira oceanica]
Length = 401
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 1/76 (1%)
Query: 102 VLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAK 161
VL++ + F I + + SESA R L A NV+A Y L +I YC +N G +L+
Sbjct: 214 VLARPDIRGFII-STDTSESAARRLLPFVLARNVQATYMLSIILCYCHENLKEGLALLRL 272
Query: 162 AAISSHAQALYSLAVI 177
AA +H ++ ++L+++
Sbjct: 273 AASQNHLESSFTLSLL 288
>gi|397618594|gb|EJK64965.1| hypothetical protein THAOC_14241, partial [Thalassiosira oceanica]
Length = 417
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
+N G S++ + +A+Y L + F G G + +D++ V L A AA LG I A
Sbjct: 247 ENAGDMISMIQARVLKKDPEAIYYLGLKYFFGELGLQ--RDIQRAVELWAEAAELGSIQA 304
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGISTRSW 259
L LG G GV+Q++ + +F +A + A ++ C + ++
Sbjct: 305 LYNLGLAYDLGDGVQQDMMKAFQFYRRAAMQGHALSRNNLGCIELKKENY 354
>gi|255085856|ref|XP_002505359.1| predicted protein [Micromonas sp. RCC299]
gi|226520628|gb|ACO66617.1| predicted protein [Micromonas sp. RCC299]
Length = 435
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 21/36 (58%), Gaps = 2/36 (5%)
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
HEF CS C YCSR CQ WK RHK +C P E
Sbjct: 313 HEF--CSKCHLACYCSRDCQKQHWKWRHKRECRPKE 346
>gi|242766081|ref|XP_002341102.1| hypothetical protein TSTA_071180 [Talaromyces stipitatus ATCC
10500]
gi|218724298|gb|EED23715.1| hypothetical protein TSTA_071180 [Talaromyces stipitatus ATCC
10500]
Length = 288
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 30/60 (50%), Gaps = 2/60 (3%)
Query: 318 ARGGTPGPGLRLCSHVGCGRPETRRHE--FRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
AR + R C+H P R E +RC+ C + YCSRACQ DWK H+ C+
Sbjct: 30 ARRKSTSKSARTCTHCKKREPLNPRAEQKLKRCARCRSALYCSRACQRADWKRGHQRACS 89
>gi|157864173|ref|XP_001680799.1| MYND finger domain-like protein [Leishmania major strain Friedlin]
gi|68124091|emb|CAJ02073.1| MYND finger domain-like protein [Leishmania major strain Friedlin]
Length = 167
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 33/79 (41%), Gaps = 1/79 (1%)
Query: 298 CNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYC 357
CN EA SR+L + A P C H G +RCS C YC
Sbjct: 89 CNSANCEAVAKSRYLMDMEDATTAQGMPSPNGCFHCHKGARGAATVPLQRCSRCKVAKYC 148
Query: 358 SRACQALDWKLRHKADCAP 376
S CQ DWK+ HK CAP
Sbjct: 149 SVECQKADWKV-HKQVCAP 166
>gi|384494170|gb|EIE84661.1| hypothetical protein RO3G_09371 [Rhizopus delemar RA 99-880]
Length = 432
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ACQ+ W H+ C
Sbjct: 254 GVRQCANMQCGKWERVPREFAKCRRCRKAKYCSKACQSKAWADGHRWWCV 303
>gi|354547905|emb|CCE44640.1| hypothetical protein CPAR2_404440 [Candida parapsilosis]
Length = 783
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G+R C ++ CG+ E EF +C C YCSR CQ W H+ C P
Sbjct: 682 GVRQCGNLDCGKWEKYPREFSKCRRCKRTKYCSRDCQMKAWHC-HRNWCIP 731
>gi|444323345|ref|XP_004182313.1| hypothetical protein TBLA_0I01340 [Tetrapisispora blattae CBS 6284]
gi|387515360|emb|CCH62794.1| hypothetical protein TBLA_0I01340 [Tetrapisispora blattae CBS 6284]
Length = 795
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 1/51 (1%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
+R C ++ CG E+ EF +C C YCSR CQ WK H+ C A
Sbjct: 577 IRQCGNLFCGEWESYPREFSKCRRCKRTKYCSRECQIKAWK-NHRHWCQEA 626
>gi|333367586|ref|ZP_08459838.1| TPR repeat protein [Psychrobacter sp. 1501(2011)]
gi|332978563|gb|EGK15270.1| TPR repeat protein [Psychrobacter sp. 1501(2011)]
Length = 310
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 68/147 (46%), Gaps = 9/147 (6%)
Query: 96 LALNSLVLSKASKKTFAIK--ANNWSESAHRFLKLCADAGNVEACYTLGMIRF---YCLQ 150
L +N + +KAS ++ A W++ H+ LK A+ G+ EA + LG+I Q
Sbjct: 2 LTINPKIFTKASLFLLSLLLSATLWAQDIHQ-LKTLANKGDSEAQFELGVIYDNGDQLPQ 60
Query: 151 NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDAL 210
+ A K+A H A Y++ + G G ++ D V + AA G I+A
Sbjct: 61 DLKKAAYWYTKSAQQGHVDAQYNIGDMYRTGDGVTQ---DYVQAVQWLSNAADQGSIEAQ 117
Query: 211 RELGHCLQDGYGVRQNIAEGRRFLVQA 237
+LG+ +G GV Q+ + L Q+
Sbjct: 118 NDLGYLYMEGIGVPQDYRKAFELLSQS 144
>gi|307111685|gb|EFN59919.1| hypothetical protein CHLNCDRAFT_132967 [Chlorella variabilis]
Length = 981
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 26/48 (54%), Gaps = 4/48 (8%)
Query: 330 CSHVGCGRPE-TRRHEFR--RCSVCGAVNYCSRACQALDWKLRHKADC 374
C+HVGC T E R RC CGAV YC AC DW+ HKA C
Sbjct: 917 CAHVGCTTVSGTSEAEIRGKRCGGCGAVRYCGTACSRADWR-AHKAVC 963
>gi|189502556|ref|YP_001958273.1| hypothetical protein Aasi_1217 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497997|gb|ACE06544.1| hypothetical protein Aasi_1217 [Candidatus Amoebophilus asiaticus
5a2]
Length = 1402
Score = 42.7 bits (99), Expect = 0.29, Method: Composition-based stats.
Identities = 38/141 (26%), Positives = 61/141 (43%), Gaps = 13/141 (9%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
KA W + A A+ G A Y+LG + + +Q+ A KAA +A A
Sbjct: 796 KAVEWFQKA-------AEQGLASAQYSLGFMYYNGYGVVQDDAKAAEWFQKAAGQGNASA 848
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L + G G +++DK V +AA G +DA LG +G+GV Q+ +
Sbjct: 849 QYNLGRMYREGRGVAQDDKK---AVEWYGKAAEQGQVDAQNSLGAMYYNGHGVAQDDRKA 905
Query: 231 RRFLVQANARELAAVLSSAAC 251
+ +A + +S C
Sbjct: 906 VEWFQKAAEKGNVLAQNSLGC 926
Score = 42.0 bits (97), Expect = 0.53, Method: Composition-based stats.
Identities = 42/139 (30%), Positives = 62/139 (44%), Gaps = 20/139 (14%)
Query: 103 LSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASL 158
L K KK F W + A A+ G+V A Y LG +R+ ++ A
Sbjct: 646 LEKDDKKAF-----EWYQKA-------AEQGHVTAQYNLG-VRYGNGRGVAKDERKAAEW 692
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
KAA +A A Y+L + +G G +KD V +AA GHI+A LG +
Sbjct: 693 FQKAAGQGNASAQYNLGRMYDDGEG---LEKDHAKAVVWYTKAAEQGHINAQYNLGISYE 749
Query: 219 DGYGVRQNIAEGRRFLVQA 237
DG GV ++ + R + +A
Sbjct: 750 DGEGVEKDDNKAREWYQKA 768
Score = 39.3 bits (90), Expect = 4.0, Method: Composition-based stats.
Identities = 35/116 (30%), Positives = 49/116 (42%), Gaps = 13/116 (11%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
KA W + A AD G+ +A Y LG+I Q+ KAA A A
Sbjct: 760 KAREWYQKA-------ADQGDTDAQYKLGIIYRNGRDVAQDDRKAVEWFQKAAEQGLASA 812
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
YSL + +NG G ++D +AA G+ A LG ++G GV Q+
Sbjct: 813 QYSLGFMYYNGYGVVQDDAK---AAEWFQKAAGQGNASAQYNLGRMYREGRGVAQD 865
>gi|403376502|gb|EJY88230.1| TPR repeat protein [Oxytricha trifallax]
Length = 486
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 24/45 (53%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
C PE+ + + CS C YCS CQ DW+ +HK +C ++
Sbjct: 438 CSAPESDLRKHKLCSACKQAFYCSPDCQKYDWQKKHKLECKELQK 482
>gi|237748064|ref|ZP_04578544.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
gi|229379426|gb|EEO29517.1| Sel1 repeat-containing protein [Oxalobacter formigenes OXCC13]
Length = 1181
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 57/145 (39%), Gaps = 17/145 (11%)
Query: 85 NVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMI 144
NV I +M L ++ K KK A + L G +A Y L ++
Sbjct: 690 NVSIAQNKMGYLYETGHIVPKDMKK------------AIEWYTLAEQNGFTDAAYHLALL 737
Query: 145 RFYC---LQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARA 201
LQN + KAA +++ ALY L F+G + KD + RA
Sbjct: 738 YLASSPPLQNDPLALRYLEKAASANNTNALYKLGTFYFHGQYSA--TKDRKKAAEYFRRA 795
Query: 202 AFLGHIDALRELGHCLQDGYGVRQN 226
A LGH ++ LQ G GV +N
Sbjct: 796 AKLGHKNSQIAYADILQKGKGVEKN 820
>gi|393227836|gb|EJD35499.1| hypothetical protein AURDEDRAFT_117365 [Auricularia delicata
TFB-10046 SS5]
Length = 210
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 25/55 (45%), Positives = 29/55 (52%), Gaps = 6/55 (10%)
Query: 326 GLRLCSHVGCGRP---ETRRHEFRRCSVCGAVNYCSRACQALDWK---LRHKADC 374
G R C GCG+ +T F RCS C A YCSR CQ DW+ + HKA C
Sbjct: 70 GNRGCDGPGCGKTVHDKTTPGGFPRCSTCRAAQYCSRVCQDADWRRGAVPHKAVC 124
>gi|452985978|gb|EME85734.1| hypothetical protein MYCFIDRAFT_161381 [Pseudocercospora fijiensis
CIRAD86]
Length = 643
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
G+R C++ CG+ E +F +C C YCS+ CQ W H+ C PAE
Sbjct: 566 GIRQCAYWQCGKWEEYTRQFAKCRRCRRTKYCSKECQKGAWG-SHRFWCVPAE 617
>gi|393229543|gb|EJD37164.1| hypothetical protein AURDEDRAFT_173799 [Auricularia delicata
TFB-10046 SS5]
Length = 454
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 30/70 (42%), Gaps = 8/70 (11%)
Query: 301 PAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGC---GRPETRRHEFRRCSVCGAVNYC 357
PAP+ P SR +F + CS+ C G ET F RC C V YC
Sbjct: 327 PAPQT-PTSRLFHNFFHSTRSK----YSFCSNRACAAAGVTETHVQPFSRCQQCSLVRYC 381
Query: 358 SRACQALDWK 367
R CQ DWK
Sbjct: 382 GRECQKQDWK 391
>gi|449511450|ref|XP_004163959.1| PREDICTED: LOW QUALITY PROTEIN: ubiquitin carboxyl-terminal
hydrolase 16-like, partial [Cucumis sativus]
Length = 1115
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 40/93 (43%), Gaps = 9/93 (9%)
Query: 313 AEWFAARGGTPGPGLRLCSHVG-CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
E A+ G P P + L H C P T R C+ C AV YCS CQ + W+ HK
Sbjct: 57 VELQASFGCIPAP-ISLNHHCALCLSPTTTR-----CARCKAVRYCSGRCQIIHWRQGHK 110
Query: 372 ADCAPAER--WLDEEGNGDVDVDLNDAEGGGED 402
+C P + W E+ N D N G D
Sbjct: 111 NECQPPKTLDWSLEKENDPKDNLENQNHSGSYD 143
>gi|356960360|ref|ZP_09063342.1| TPR repeat-containing SEL1 subfamily protein [gamma proteobacterium
SCGC AAA001-B15]
Length = 198
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 58/121 (47%), Gaps = 6/121 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
E A + +L A G+++A +G++ + QN G + KAA SH +A Y+L
Sbjct: 37 EKALKAFELAAKTGDLDALTAVGIMYIGGWGIEQNDAKGLEYILKAANQSHPKAQYTLGA 96
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ + G G + + + + L A +L DA L ++G GV++N+ + + +
Sbjct: 97 LYYLGIGVPLDFEKAFSWINLSANQDYL---DAQHNLAEMYENGKGVKKNLEKAYEYYLL 153
Query: 237 A 237
A
Sbjct: 154 A 154
>gi|384499491|gb|EIE89982.1| hypothetical protein RO3G_14693 [Rhizopus delemar RA 99-880]
Length = 470
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YCS+ACQ+ W H+ C
Sbjct: 289 GVRQCANMQCGKWERVPREFAKCRRCRKAKYCSKACQSKAWADGHRWWCV 338
>gi|225024092|ref|ZP_03713284.1| hypothetical protein EIKCOROL_00960 [Eikenella corrodens ATCC
23834]
gi|224943117|gb|EEG24326.1| hypothetical protein EIKCOROL_00960 [Eikenella corrodens ATCC
23834]
Length = 578
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 54/125 (43%), Gaps = 6/125 (4%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
AH + + A + EA +LG++ + Q+ KAA +HA Y L ++
Sbjct: 334 QAHYWYEKAAAQNHPEAQNSLGIMYYAGHGVPQDYAQARMWFEKAAAQNHADGQYYLGLL 393
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
NG G ++ R +AA H DA LG +G GV QN + R + +A
Sbjct: 394 YDNGHGVPQDYTQARM---WFEKAAAQNHPDAQNNLGAMYYEGQGVTQNYTQARIWFEKA 450
Query: 238 NAREL 242
A+ L
Sbjct: 451 AAQNL 455
>gi|21355925|ref|NP_648406.1| CG8003, isoform A [Drosophila melanogaster]
gi|442631568|ref|NP_001261683.1| CG8003, isoform B [Drosophila melanogaster]
gi|7294808|gb|AAF50142.1| CG8003, isoform A [Drosophila melanogaster]
gi|16769502|gb|AAL28970.1| LD35289p [Drosophila melanogaster]
gi|220946846|gb|ACL85966.1| CG8003-PA [synthetic construct]
gi|220956438|gb|ACL90762.1| CG8003-PA [synthetic construct]
gi|440215604|gb|AGB94377.1| CG8003, isoform B [Drosophila melanogaster]
Length = 407
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 35/92 (38%), Gaps = 20/92 (21%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFL------AEWFAARGGTPGPGLRLCSHVGCGRPETRRH 343
C + A +A PA L W G CS G +P+
Sbjct: 294 CTIFRQIATQFGAKDAPPALTVLRNAINGMRWITDEGN-------YCSTCGAEKPD---- 342
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
++CS C AV YC R CQ L W HK +CA
Sbjct: 343 --KKCSKCKAVQYCDRECQRLHW-FMHKKNCA 371
>gi|397661633|ref|YP_006502333.1| hypothetical protein KUI_0644 [Taylorella equigenitalis ATCC 35865]
gi|394349812|gb|AFN35726.1| hypothetical protein KUI_0644 [Taylorella equigenitalis ATCC 35865]
gi|399115026|emb|CCG17823.1| hypothetical protein KUK_0513 [Taylorella equigenitalis 14/56]
Length = 215
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 57/134 (42%), Gaps = 6/134 (4%)
Query: 107 SKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAA 163
S+K + E A +LK AD G V+A +G++ L +N K+A
Sbjct: 33 SQKGTELYKQQKYEEAFEYLKPAADQGQVQAQVAVGVMYLKGLGVPKNYDEATKYFEKSA 92
Query: 164 ISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGV 223
+ +AQ +Y++ + + G G DL AA LG DA + G G G
Sbjct: 93 KAGNAQGIYNIGTMYYKGEG---VKLDLNKAADYFEAAAKLGLRDAQHDYGAMYLFGKGR 149
Query: 224 RQNIAEGRRFLVQA 237
+NI E ++ +A
Sbjct: 150 PENIKEAASWMQKA 163
>gi|51596307|ref|YP_070498.1| hypothetical protein YPTB1981 [Yersinia pseudotuberculosis IP
32953]
gi|186895347|ref|YP_001872459.1| Sel1 domain-containing protein [Yersinia pseudotuberculosis PB1/+]
gi|51589589|emb|CAH21219.1| putative exported protein [Yersinia pseudotuberculosis IP 32953]
gi|186698373|gb|ACC89002.1| Sel1 domain protein repeat-containing protein [Yersinia
pseudotuberculosis PB1/+]
Length = 720
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
A Y LG + C Q+ G + K+A + +AL+ LA + +N G K D+
Sbjct: 569 AQYELGKYHYTC-QDYDRGLKWLEKSADHKNIKALFLLAQM-YNEGDGVKEDQTQYFSYL 626
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
L +AA LG DA E+G+ G GV +N+ E ++ ++A
Sbjct: 627 L--KAAQLGLSDAQVEIGYLYLVGEGVEKNLPEAYQWHIKA 665
>gi|423113617|ref|ZP_17101308.1| hypothetical protein HMPREF9689_01365 [Klebsiella oxytoca 10-5245]
gi|376387888|gb|EHT00590.1| hypothetical protein HMPREF9689_01365 [Klebsiella oxytoca 10-5245]
Length = 375
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 53/121 (43%), Gaps = 13/121 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 208 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVNQNFQQTREWFEKAASQNQLNA 260
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GVRQ+ +
Sbjct: 261 QYNLGQIYYYGQGVTQS---YRKAKEWFEKAAGEGHVDAQYNLGVIYENGEGVRQDFHQA 317
Query: 231 R 231
R
Sbjct: 318 R 318
>gi|238591206|ref|XP_002392540.1| hypothetical protein MPER_07865 [Moniliophthora perniciosa FA553]
gi|215458738|gb|EEB93470.1| hypothetical protein MPER_07865 [Moniliophthora perniciosa FA553]
Length = 303
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 7/63 (11%)
Query: 327 LRLCSHVGC------GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
+R+C + GC ETR+ + +CS C +V+YCSR C W H+ C+ +
Sbjct: 143 IRMCGYEGCPNERIPADEETRKIRYLKCSACSSVSYCSRECGKKGWP-NHREQCSSNRQL 201
Query: 381 LDE 383
L E
Sbjct: 202 LKE 204
>gi|350405143|ref|XP_003487339.1| PREDICTED: hypothetical protein LOC100742335 [Bombus impatiens]
Length = 1663
Score = 42.7 bits (99), Expect = 0.32, Method: Composition-based stats.
Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 6/45 (13%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
CG+P T C +C YCS+ C L W+ H DC+P ER
Sbjct: 1621 CGKPSTVA-----CIICLEAKYCSKECSQLHWR-DHYKDCSPVER 1659
>gi|238880968|gb|EEQ44606.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 751
Score = 42.7 bits (99), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 56/133 (42%), Gaps = 18/133 (13%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYC--------LQNRGSGASLMAKAA---ISSH 167
S A FLK+ A + A Y LG+ FY + + G + +AA
Sbjct: 494 SRKAVEFLKIAASRNHPAAMYKLGVYSFYGRMGLPANDMNTKKMGIKWLTRAANVATELT 553
Query: 168 AQALYSLAVIQFNGSGGSKNDK---DLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
A A Y L + +NG D D + G+ L A+AA LGH+ + LGH + G V
Sbjct: 554 AAAPYELGKLYYNGF----EDIVLIDKKYGLELFAQAAALGHLQSAAILGHHYEIGEIVP 609
Query: 225 QNIAEGRRFLVQA 237
Q+ + QA
Sbjct: 610 QDSNLSIHYYTQA 622
>gi|424741229|ref|ZP_18169590.1| Sel1 repeat protein [Acinetobacter baumannii WC-141]
gi|422945162|gb|EKU40133.1| Sel1 repeat protein [Acinetobacter baumannii WC-141]
Length = 230
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 77/180 (42%), Gaps = 19/180 (10%)
Query: 66 VVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRF 125
+ +++ L S+A ++ + +A L + S + +K N+ E A ++
Sbjct: 10 IFTVIFSLHSSAFAEPSVADIQYRAEHGQSVAQYHLGMMLLSGEQGVVK--NY-EQAFKW 66
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHAQALYSLAVI 177
L G++ A Y+LGM+ + G+G KAA H +A Y+L V+
Sbjct: 67 LTAADQNGSMGAKYSLGMMYY-----TGTGVEKDAKRAFDYFTKAAAKDHVKAQYNLGVL 121
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G + +D +RAA G+ A L H + G+GV Q+ + ++ +A
Sbjct: 122 YDRGEGTA---QDYGKAFEWFSRAAKQGYPPAEYNLAHLYKKGHGVSQSNEQALKWYTKA 178
>gi|419699342|ref|ZP_14226958.1| hypothetical protein OQA_02258 [Escherichia coli SCI-07]
gi|422378666|ref|ZP_16458873.1| MORN repeat variant [Escherichia coli MS 57-2]
gi|324010056|gb|EGB79275.1| MORN repeat variant [Escherichia coli MS 57-2]
gi|380349357|gb|EIA37629.1| hypothetical protein OQA_02258 [Escherichia coli SCI-07]
Length = 641
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 5/115 (4%)
Query: 126 LKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
+K A GN A +TLG F C N G + K+A + A+Y LA + G+G
Sbjct: 479 IKDNATKGNSSAQFTLGKYEFTCA-NIDEGIKWLTKSAEQKNTDAIYFLATA-YKGNGIP 536
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
N++ + +AA LG+ +A E+G+ G + QN+ + + +A A+
Sbjct: 537 ANNEKY---ITYLQQAAMLGNSNAQAEIGYLYLIGKELPQNLPDAGVWFKKAAAQ 588
>gi|397642568|gb|EJK75315.1| hypothetical protein THAOC_02962 [Thalassiosira oceanica]
Length = 506
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 32/66 (48%), Gaps = 9/66 (13%)
Query: 312 LAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
L EW A G LC++ C + +F++CS C A YCS+ CQ W+ HK
Sbjct: 438 LKEWETA-------GRSLCAY--CAKKAETGEKFKQCSKCKAQWYCSKECQVEAWRAGHK 488
Query: 372 ADCAPA 377
DC A
Sbjct: 489 KDCKRA 494
>gi|397628008|gb|EJK68704.1| hypothetical protein THAOC_10093 [Thalassiosira oceanica]
Length = 355
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A +M++ A H QA+ L +G G + KD+R GV L AA LG I+AL LG+
Sbjct: 182 AMIMSRVA-KKHPQAINHLGHKYCHGRLGLQ--KDMRKGVELYTEAAELGSIEALCHLGY 238
Query: 216 CLQDGYGVRQNIAEGRRFLVQANAR 240
G GV Q+ A+G F +A R
Sbjct: 239 AHVTGCGVEQDKAKGIHFWRKAAMR 263
>gi|344925132|ref|ZP_08778593.1| Sel1 domain-containing protein [Candidatus Odyssella
thessalonicensis L13]
Length = 616
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 6/101 (5%)
Query: 138 CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVAL 197
CYT G +N+ L +AA H +A Y++ G G +K DL A
Sbjct: 218 CYTKGTG---TRKNKNQAFQLYREAAEQGHLEAQYTVGRCYAEGRGVAK---DLNQAFAW 271
Query: 198 CARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
+AA G +A ++G C +DGYGV +++ E R + A+
Sbjct: 272 FKKAALSGSEEARYKVGCCYRDGYGVNKDVIEAVRHFLPAD 312
Score = 38.5 bits (88), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 4/111 (3%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLA 175
++ A R+ AD G+ +A Y LG + + G L+ KAA ++ A +L
Sbjct: 119 AKEAIRWFTKAADQGHAKALYELGGYYYSGESLEYDAEKGFKLIEKAAQKNYVVAQITLG 178
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
G G D DL+ V +AA G DA +L C G G R+N
Sbjct: 179 DYYLLG-GSDYVDIDLKKAVFWYKKAAAQGDCDAQFKLAECYTKGTGTRKN 228
>gi|291414590|ref|XP_002723542.1| PREDICTED: programmed cell death 2 [Oryctolagus cuniculus]
Length = 344
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 26/50 (52%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P + CS C +YCSR QALDWK+ HK CA
Sbjct: 131 GAHLCRVCGCSGP-------KACSRCHKAHYCSREHQALDWKMGHKQVCA 173
>gi|299746830|ref|XP_001839517.2| hypothetical protein CC1G_08896 [Coprinopsis cinerea okayama7#130]
gi|298407242|gb|EAU82284.2| hypothetical protein CC1G_08896 [Coprinopsis cinerea okayama7#130]
Length = 743
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 19/27 (70%)
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADC 374
CS C V+YCSR CQ DWK RH+ +C
Sbjct: 452 CSSCHTVSYCSRGCQKHDWKARHRDEC 478
>gi|424920034|ref|ZP_18343397.1| hypothetical protein Rleg9DRAFT_0305 [Rhizobium leguminosarum bv.
trifolii WSM597]
gi|392849049|gb|EJB01571.1| hypothetical protein Rleg9DRAFT_0305 [Rhizobium leguminosarum bv.
trifolii WSM597]
Length = 860
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 56/124 (45%), Gaps = 10/124 (8%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASLMAKA-AISSHAQ 169
AN +++A +K DAG+V A Y L FY + + AS +AKA A
Sbjct: 556 ANRDTKTAFATIKKAMDAGHVRAIYELS--SFYEFGASVPRDAAKASEIAKAGAAKGDPF 613
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
AL S + G G D + G+ L +AA LGH A+ ELG+ +G V +
Sbjct: 614 ALLSYGKSLYYGRGAKA---DQQVGLRLMLQAADLGHTYAMNELGYIFLNGVNVPADPER 670
Query: 230 GRRF 233
G F
Sbjct: 671 GIHF 674
>gi|335041823|ref|ZP_08534850.1| sel1 domain protein repeat-containing protein [Methylophaga
aminisulfidivorans MP]
gi|333788437|gb|EGL54319.1| sel1 domain protein repeat-containing protein [Methylophaga
aminisulfidivorans MP]
Length = 356
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 8/112 (7%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSL 174
+SA +L + A G+ EA Y LGM+ + +++ L K+A A A +++
Sbjct: 76 DSAMSWLDIAAQKGHDEAAYALGMLYYTGNEPDVKRDQKKAFELFKKSAERGSAVAQFNV 135
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+ NG G +KDL+A +A GH A LG G GV+Q+
Sbjct: 136 GAMLMNGQG---VEKDLKAAADWFEKAGEQGHSQAQFNLGLLYLSGSGVKQD 184
>gi|297794145|ref|XP_002864957.1| hypothetical protein ARALYDRAFT_332761 [Arabidopsis lyrata subsp.
lyrata]
gi|297310792|gb|EFH41216.1| hypothetical protein ARALYDRAFT_332761 [Arabidopsis lyrata subsp.
lyrata]
Length = 731
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGED 402
RCS C +V YCS CQ L W+ HK +C + DEE V D +DA+ D
Sbjct: 63 RCSQCKSVRYCSSKCQILHWRRGHKEECRSPDS--DEEKEESVKSD-DDAKESNMD 115
>gi|348682468|gb|EGZ22284.1| hypothetical protein PHYSODRAFT_330115 [Phytophthora sojae]
Length = 322
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 57/127 (44%), Gaps = 16/127 (12%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHAQALYSLAV 176
L A+ G+ +A + LG L RG G AS +AK+A+ H A + LA+
Sbjct: 87 LLGRAAERGHRDAEFALG-----VLYGRGEGVPRSDSLSASWLAKSAVRGHTDAKWMLAI 141
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ G G ++ D+ V L AA G A LG + G GVRQN + Q
Sbjct: 142 MYNEGRGVAE---DVDRAVELLQEAATSGSDQAKFHLGVMYEYGRGVRQNFKQAAELYQQ 198
Query: 237 ANARELA 243
A+ ++A
Sbjct: 199 AHEHQVA 205
>gi|241953493|ref|XP_002419468.1| chitin synthase regulatory factor, putative [Candida dubliniensis
CD36]
gi|223642808|emb|CAX43063.1| chitin synthase regulatory factor, putative [Candida dubliniensis
CD36]
Length = 744
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 55/130 (42%), Gaps = 12/130 (9%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYC--------LQNRGSGASLMAKAA---ISSH 167
S A FLK+ A + A Y LG+ FY + + G + +AA
Sbjct: 480 SRKAIEFLKIAASRNHPAAMYKLGVYSFYGRMGLPTNDMNTKKMGIKWLTRAANVATELT 539
Query: 168 AQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
A A Y L + +NG D + G+ L A+AA LGH+ + LGH + G V Q+
Sbjct: 540 AAAPYELGKLYYNGFEDIV-LIDKKYGLELFAQAAALGHLQSAAILGHHYEIGEIVPQDS 598
Query: 228 AEGRRFLVQA 237
+ QA
Sbjct: 599 NLSIHYYTQA 608
>gi|393227841|gb|EJD35504.1| hypothetical protein AURDEDRAFT_130598 [Auricularia delicata
TFB-10046 SS5]
Length = 505
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 27/52 (51%), Gaps = 11/52 (21%)
Query: 334 GCGRPE--------TRRHEFRRCSVCGAVNYCSRACQALDWK---LRHKADC 374
GC RPE T F RCS C AV YCSR CQ DW+ + HKA C
Sbjct: 372 GCQRPECGMTVHDKTTPGGFPRCSTCRAVRYCSRECQDADWRCGAVPHKAVC 423
>gi|384411625|ref|YP_005620990.1| Sel1 domain-containing protein repeat-containing protein [Zymomonas
mobilis subsp. mobilis ATCC 10988]
gi|335931999|gb|AEH62539.1| Sel1 domain protein repeat-containing protein [Zymomonas mobilis
subsp. mobilis ATCC 10988]
Length = 455
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 6/119 (5%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYC---LQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+ K AD G+V+A +G + Q+ G L KAA ++A Y+LA+ +NG
Sbjct: 323 WYKKAADQGHVKAQLNMGFAYYQARGVAQDYARGIFLYRKAAEQGDSKAEYNLAIAYYNG 382
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
G KDL + RAA G + A LG G GV ++ E +L +A A+
Sbjct: 383 VG---EPKDLAQSIYWFQRAASHGEMSAQYNLGAFYMRGEGVPKDRNEAIFWLEKAAAQ 438
>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
Length = 498
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ R CS C V YCS+ CQA +W L HK +CA
Sbjct: 75 KLRACSGCKVVKYCSKGCQAENWALIHKHECA 106
>gi|417666054|ref|ZP_12315615.1| tetratricopeptide repeat family protein [Escherichia coli STEC_O31]
gi|397786390|gb|EJK97227.1| tetratricopeptide repeat family protein [Escherichia coli STEC_O31]
Length = 328
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 158 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVKQNFRQAREWFEKAASQNQLNA 210
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV QN +
Sbjct: 211 QYNLGQIYYYGQGVTQS---YRQAKDWFEKAAEKGHVDAQYNLGVIYENGEGVSQNYQQA 267
Query: 231 RRFLVQA 237
+ + +A
Sbjct: 268 KAWYEKA 274
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 47/117 (40%), Gaps = 6/117 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A + + A + A Y LG I +Y Q+ KAA H A Y+L VI
Sbjct: 194 QAREWFEKAASQNQLNAQYNLGQIYYYGQGVTQSYRQAKDWFEKAAEKGHVDAQYNLGVI 253
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
NG G S+N + A +AA A ELG + G G ++ + R +
Sbjct: 254 YENGEGVSQN---YQQAKAWYEKAASQNDAQAQFELGVMNELGQGESIDLKQARHYY 307
>gi|153948984|ref|YP_001401070.1| hypothetical protein YpsIP31758_2097 [Yersinia pseudotuberculosis
IP 31758]
gi|152960479|gb|ABS47940.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP
31758]
Length = 720
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
A Y LG + C Q+ G + K+A + +AL+ LA + +N G K D+ +
Sbjct: 569 AQYELGKYHYTC-QDYDRGLKWLEKSADHKNIKALFLLAQM-YNEGDGVKEDQ--TKYFS 624
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+AA LG DA E+G+ G GV +N+ E ++ ++A
Sbjct: 625 YLLKAAQLGLSDAQVEIGYLYLVGEGVEKNLPEAYQWHIKA 665
>gi|393239221|gb|EJD46754.1| hypothetical protein AURDEDRAFT_113626 [Auricularia delicata
TFB-10046 SS5]
Length = 392
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 330 CSHVGCGR---PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
C+ GCGR T H F C+ CG YCSRACQ DW H+ C
Sbjct: 278 CAAPGCGRHSHESTTGHMFPVCARCGLATYCSRACQKRDWAGAHRDVC 325
>gi|170024436|ref|YP_001720941.1| Sel1 domain-containing protein [Yersinia pseudotuberculosis YPIII]
gi|169750970|gb|ACA68488.1| Sel1 domain protein repeat-containing protein [Yersinia
pseudotuberculosis YPIII]
Length = 720
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
A Y LG + C Q+ G + K+A + +AL+ LA + +N G K D+ +
Sbjct: 569 AQYELGKYHYTC-QDYDRGLKWLEKSADHKNIKALFLLAQM-YNEGDGVKEDQ--TKYFS 624
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+AA LG DA E+G+ G GV +N+ E ++ ++A
Sbjct: 625 YLLKAAQLGLSDAQVEIGYLYLVGEGVEKNLPEAYQWHIKA 665
>gi|148826087|ref|YP_001290840.1| Sel1 domain-containing protein [Haemophilus influenzae PittEE]
gi|148716247|gb|ABQ98457.1| Sel1 domain protein repeat-containing protein [Haemophilus
influenzae PittEE]
Length = 384
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 60/122 (49%), Gaps = 10/122 (8%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLA 175
A ++ + A+ GN +A + LG++ Y Q+ KAA +A A ++L
Sbjct: 104 EAVKWYRQAAEQGNAKAQFNLGLM--YDNGRGVKQDYFEAVKWFRKAAEQGYADAQFNLG 161
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+ +NG G ++D + V +AA G+ DA LG+ +G+GV+Q+ E ++
Sbjct: 162 NMYYNGHGVKQDDFE---AVKWYRKAAEQGYADAQFNLGNMYYNGHGVKQDDFEAVKWYR 218
Query: 236 QA 237
+A
Sbjct: 219 KA 220
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 6/120 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ + A+ G +A + LG + + Q+ KAA HA+A Y+L +
Sbjct: 212 EAVKWYRKAAEQGYADAQFNLGNMYYNGHGVKQDDFEAVKWYRKAAEQGHAKAQYNLGNM 271
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
NG G + D V +AA G+ DA LG G+GVRQ+ E ++ +A
Sbjct: 272 YANGRGVKQ---DYFEAVKWYRKAAEQGYADAQANLGSAYSAGHGVRQDYIEAVKWFKKA 328
>gi|260654388|ref|ZP_05859878.1| Sel1 repeat family protein [Jonquetella anthropi E3_33 E1]
gi|260631021|gb|EEX49215.1| Sel1 repeat family protein [Jonquetella anthropi E3_33 E1]
Length = 831
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 60/125 (48%), Gaps = 10/125 (8%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSL 174
+ A + A +G+ EA LG R Y +N+ A + KAA + +Y++
Sbjct: 293 KKAFQLYTSAATSGDNEAYIDLG--RLYETGVGVEKNKAKAAEMYKKAASFGLPEGMYNM 350
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
I G G ++ D +AGV +AA G +A R+LG G V +N+A+ ++ L
Sbjct: 351 GRIAIIGKGVAQ---DRKAGVQWLEKAAAAGSTNAQRDLGVLCARGDWVPKNLAKAKKLL 407
Query: 235 VQANA 239
+A+A
Sbjct: 408 KEASA 412
Score = 38.5 bits (88), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 49/116 (42%), Gaps = 2/116 (1%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
A K AD G++ + LG+++ G LM KAA +A A LA I NG
Sbjct: 225 AEALYKQAADEGSLPSVMALGILKMQK-DAHTEGLRLMTKAADGGYAPAKKFLAQIYENG 283
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
GG + L AA G +A +LG + G GV +N A+ +A
Sbjct: 284 -GGMGFTPQPKKAFQLYTSAATSGDNEAYIDLGRLYETGVGVEKNKAKAAEMYKKA 338
>gi|320165350|gb|EFW42249.1| hypothetical protein CAOG_07634 [Capsaspora owczarzaki ATCC 30864]
Length = 1693
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 166 SHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
+H ALY LA+ +G N + RAA G++ +LR +GHC + G GV++
Sbjct: 1086 NHPIALYELAMRNLHGIDMPANGNN---AFRYFFRAARAGNMSSLRWVGHCYKTGTGVKE 1142
Query: 226 NIAEGRRFLVQ 236
N+AE R +
Sbjct: 1143 NVAEALRLFSE 1153
>gi|209875943|ref|XP_002139414.1| MYND finger domain-containing protein [Cryptosporidium muris RN66]
gi|209555020|gb|EEA05065.1| MYND finger domain-containing protein [Cryptosporidium muris RN66]
Length = 867
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 36/83 (43%), Gaps = 10/83 (12%)
Query: 298 CNVPAPEAHP-ASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNY 356
C++ P HP RF G G +L ++ +P++ +FR C C +Y
Sbjct: 662 CSIKVPCFHPDCVRFTV------GSRVKIGEKLDENL---KPDSYEIDFRYCKNCLVASY 712
Query: 357 CSRACQALDWKLRHKADCAPAER 379
CS ACQ W+L H C R
Sbjct: 713 CSNACQKAHWELSHSTTCGLMSR 735
>gi|45441640|ref|NP_993179.1| hypothetical protein YP_1836 [Yersinia pestis biovar Microtus str.
91001]
gi|145598429|ref|YP_001162505.1| hypothetical protein YPDSF_1134 [Yersinia pestis Pestoides F]
gi|162421304|ref|YP_001606911.1| hypothetical protein YpAngola_A2494 [Yersinia pestis Angola]
gi|229894696|ref|ZP_04509876.1| putative exported protein [Yersinia pestis Pestoides A]
gi|45436502|gb|AAS62056.1| putative exported protein [Yersinia pestis biovar Microtus str.
91001]
gi|145210125|gb|ABP39532.1| hypothetical protein YPDSF_1134 [Yersinia pestis Pestoides F]
gi|162354119|gb|ABX88067.1| conserved hypothetical protein [Yersinia pestis Angola]
gi|229702169|gb|EEO90188.1| putative exported protein [Yersinia pestis Pestoides A]
Length = 720
Score = 42.7 bits (99), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
A Y LG + C Q+ G + K+A + +AL+ LA + +N G K D+ +
Sbjct: 569 AQYELGKYHYTC-QDYDRGLKWLEKSADHKNIKALFLLAQM-YNEGDGVKEDQ--TKYFS 624
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+AA LG DA E+G+ G GV +N+ E ++ ++A
Sbjct: 625 YLLKAAQLGLSDAQVEIGYLYLVGEGVEKNLPEAYQWHIKA 665
>gi|397568300|gb|EJK46064.1| hypothetical protein THAOC_35290, partial [Thalassiosira oceanica]
Length = 397
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 45/93 (48%), Gaps = 2/93 (2%)
Query: 145 RFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFL 204
R C N +++ + +A++SLA F+G G + K+++ V L AA +
Sbjct: 209 RTPCPDNDADKLAMIRARVVKKDPEAIFSLAQFYFHGELGLQ--KNMQKAVELYTEAAEV 266
Query: 205 GHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G +DAL +G G GV +N A+ F +A
Sbjct: 267 GSVDALFNIGVAYYFGSGVEKNTAKAVEFYEKA 299
>gi|386638523|ref|YP_006105321.1| hypothetical protein ECABU_c12240 [Escherichia coli ABU 83972]
gi|442603690|ref|ZP_21018559.1| FIG00639943: hypothetical protein [Escherichia coli Nissle 1917]
gi|307553015|gb|ADN45790.1| conserved hypothetical protein [Escherichia coli ABU 83972]
gi|441715593|emb|CCQ04536.1| FIG00639943: hypothetical protein [Escherichia coli Nissle 1917]
Length = 328
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 158 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVKQNFRQAREWFEKAASQNQPNA 210
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV QN +
Sbjct: 211 QYNLGQIYYYGQGVTQS---YRQAKDWFEKAAEKGHVDAQYNLGVIYENGEGVSQNYQQA 267
Query: 231 RRFLVQA 237
+ + +A
Sbjct: 268 KAWYEKA 274
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 46/117 (39%), Gaps = 6/117 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A + + A A Y LG I +Y Q+ KAA H A Y+L VI
Sbjct: 194 QAREWFEKAASQNQPNAQYNLGQIYYYGQGVTQSYRQAKDWFEKAAEKGHVDAQYNLGVI 253
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
NG G S+N + A +AA A ELG + G G ++ + R +
Sbjct: 254 YENGEGVSQN---YQQAKAWYEKAASQNDAQAQFELGVMNELGQGESIDLKQARHYY 307
>gi|392559282|gb|EIW52467.1| ankyrin [Trametes versicolor FP-101664 SS1]
Length = 406
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%), Gaps = 1/32 (3%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
E + C+ CG++ YCS+ CQ DW RHK DC
Sbjct: 230 ELKFCAKCGSIRYCSKECQKTDWP-RHKLDCV 260
>gi|270490908|ref|ZP_06207982.1| hypothetical protein YPD27_4517 [Yersinia pestis KIM D27]
gi|270339412|gb|EFA50189.1| hypothetical protein YPD27_4517 [Yersinia pestis KIM D27]
Length = 715
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
A Y LG + C Q+ G + K+A + +AL+ LA + +N G K D+ +
Sbjct: 564 AQYELGKYHYTC-QDYDRGLKWLEKSADHKNIKALFLLAQM-YNEGDGVKEDQ--TKYFS 619
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+AA LG DA E+G+ G GV +N+ E ++ ++A
Sbjct: 620 YLLKAAQLGLSDAQVEIGYLYLVGEGVEKNLPEAYQWHIKA 660
>gi|22126207|ref|NP_669630.1| hypothetical protein y2322 [Yersinia pestis KIM10+]
gi|108807367|ref|YP_651283.1| hypothetical protein YPA_1371 [Yersinia pestis Antiqua]
gi|108811632|ref|YP_647399.1| hypothetical protein YPN_1469 [Yersinia pestis Nepal516]
gi|149366080|ref|ZP_01888115.1| putative exported protein [Yersinia pestis CA88-4125]
gi|165926483|ref|ZP_02222315.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
str. F1991016]
gi|165938901|ref|ZP_02227455.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
str. IP275]
gi|166009423|ref|ZP_02230321.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
E1979001]
gi|166210791|ref|ZP_02236826.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
B42003004]
gi|167401377|ref|ZP_02306874.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
UG05-0454]
gi|167420429|ref|ZP_02312182.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
str. MG05-1020]
gi|167424508|ref|ZP_02316261.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis
str. K1973002]
gi|167466848|ref|ZP_02331552.1| hypothetical protein YpesF_02910 [Yersinia pestis FV-1]
gi|218929099|ref|YP_002346974.1| hypothetical protein YPO1989 [Yersinia pestis CO92]
gi|229897397|ref|ZP_04512553.1| putative exported protein [Yersinia pestis biovar Orientalis str.
PEXU2]
gi|229898040|ref|ZP_04513191.1| putative exported protein [Yersinia pestis biovar Orientalis str.
India 195]
gi|229901910|ref|ZP_04517031.1| putative exported protein [Yersinia pestis Nepal516]
gi|294503939|ref|YP_003568001.1| hypothetical protein YPZ3_1829 [Yersinia pestis Z176003]
gi|384122344|ref|YP_005504964.1| hypothetical protein YPD4_1753 [Yersinia pestis D106004]
gi|384126257|ref|YP_005508871.1| hypothetical protein YPD8_1795 [Yersinia pestis D182038]
gi|384140187|ref|YP_005522889.1| hypothetical protein A1122_16240 [Yersinia pestis A1122]
gi|384414832|ref|YP_005624194.1| hypothetical protein YPC_2333 [Yersinia pestis biovar Medievalis
str. Harbin 35]
gi|420546877|ref|ZP_15044820.1| MORN repeat variant family protein [Yersinia pestis PY-01]
gi|420552175|ref|ZP_15049552.1| MORN repeat variant family protein [Yersinia pestis PY-02]
gi|420557689|ref|ZP_15054405.1| MORN repeat variant family protein [Yersinia pestis PY-03]
gi|420563198|ref|ZP_15059273.1| MORN repeat variant family protein [Yersinia pestis PY-04]
gi|420568240|ref|ZP_15063851.1| MORN repeat variant family protein [Yersinia pestis PY-05]
gi|420571475|ref|ZP_15066801.1| MORN repeat variant family protein [Yersinia pestis PY-06]
gi|420579241|ref|ZP_15073827.1| MORN repeat variant family protein [Yersinia pestis PY-07]
gi|420584554|ref|ZP_15078645.1| MORN repeat variant family protein [Yersinia pestis PY-08]
gi|420589682|ref|ZP_15083258.1| MORN repeat variant family protein [Yersinia pestis PY-09]
gi|420595083|ref|ZP_15088126.1| MORN repeat variant family protein [Yersinia pestis PY-10]
gi|420600729|ref|ZP_15093158.1| MORN repeat variant family protein [Yersinia pestis PY-11]
gi|420606167|ref|ZP_15098044.1| MORN repeat variant family protein [Yersinia pestis PY-12]
gi|420611559|ref|ZP_15102905.1| MORN repeat variant family protein [Yersinia pestis PY-13]
gi|420622274|ref|ZP_15112386.1| MORN repeat variant family protein [Yersinia pestis PY-15]
gi|420627340|ref|ZP_15116979.1| MORN repeat variant family protein [Yersinia pestis PY-16]
gi|420632426|ref|ZP_15121563.1| MORN repeat variant family protein [Yersinia pestis PY-19]
gi|420637637|ref|ZP_15126234.1| MORN repeat variant family protein [Yersinia pestis PY-25]
gi|420643195|ref|ZP_15131281.1| MORN repeat variant family protein [Yersinia pestis PY-29]
gi|420648386|ref|ZP_15135999.1| MORN repeat variant family protein [Yersinia pestis PY-32]
gi|420654041|ref|ZP_15141083.1| MORN repeat variant family protein [Yersinia pestis PY-34]
gi|420659507|ref|ZP_15145993.1| MORN repeat variant family protein [Yersinia pestis PY-36]
gi|420664851|ref|ZP_15150779.1| MORN repeat variant family protein [Yersinia pestis PY-42]
gi|420669752|ref|ZP_15155232.1| MORN repeat variant family protein [Yersinia pestis PY-45]
gi|420675074|ref|ZP_15160070.1| MORN repeat variant family protein [Yersinia pestis PY-46]
gi|420680667|ref|ZP_15165136.1| MORN repeat variant family protein [Yersinia pestis PY-47]
gi|420685947|ref|ZP_15169853.1| MORN repeat variant family protein [Yersinia pestis PY-48]
gi|420691144|ref|ZP_15174442.1| MORN repeat variant family protein [Yersinia pestis PY-52]
gi|420708234|ref|ZP_15188962.1| MORN repeat variant family protein [Yersinia pestis PY-55]
gi|420713606|ref|ZP_15193767.1| MORN repeat variant family protein [Yersinia pestis PY-56]
gi|420724600|ref|ZP_15203318.1| MORN repeat variant family protein [Yersinia pestis PY-59]
gi|420730216|ref|ZP_15208343.1| MORN repeat variant family protein [Yersinia pestis PY-60]
gi|420735237|ref|ZP_15212888.1| MORN repeat variant family protein [Yersinia pestis PY-61]
gi|420740713|ref|ZP_15217816.1| MORN repeat variant family protein [Yersinia pestis PY-63]
gi|420746231|ref|ZP_15222575.1| MORN repeat variant family protein [Yersinia pestis PY-64]
gi|420751850|ref|ZP_15227478.1| MORN repeat variant family protein [Yersinia pestis PY-65]
gi|420762920|ref|ZP_15236777.1| MORN repeat variant family protein [Yersinia pestis PY-71]
gi|420768085|ref|ZP_15241429.1| MORN repeat variant family protein [Yersinia pestis PY-72]
gi|420773134|ref|ZP_15245975.1| MORN repeat variant family protein [Yersinia pestis PY-76]
gi|420778664|ref|ZP_15250876.1| MORN repeat variant family protein [Yersinia pestis PY-88]
gi|420784247|ref|ZP_15255766.1| MORN repeat variant family protein [Yersinia pestis PY-89]
gi|420789487|ref|ZP_15260428.1| MORN repeat variant family protein [Yersinia pestis PY-90]
gi|420794988|ref|ZP_15265382.1| MORN repeat variant family protein [Yersinia pestis PY-91]
gi|420800041|ref|ZP_15269919.1| MORN repeat variant family protein [Yersinia pestis PY-92]
gi|420805431|ref|ZP_15274788.1| MORN repeat variant family protein [Yersinia pestis PY-93]
gi|420810774|ref|ZP_15279612.1| MORN repeat variant family protein [Yersinia pestis PY-94]
gi|420816318|ref|ZP_15284594.1| MORN repeat variant family protein [Yersinia pestis PY-95]
gi|420821585|ref|ZP_15289344.1| MORN repeat variant family protein [Yersinia pestis PY-96]
gi|420826681|ref|ZP_15293909.1| MORN repeat variant family protein [Yersinia pestis PY-98]
gi|420832402|ref|ZP_15299078.1| MORN repeat variant family protein [Yersinia pestis PY-99]
gi|420837253|ref|ZP_15303462.1| MORN repeat variant family protein [Yersinia pestis PY-100]
gi|420842424|ref|ZP_15308148.1| MORN repeat variant family protein [Yersinia pestis PY-101]
gi|420848069|ref|ZP_15313230.1| MORN repeat variant family protein [Yersinia pestis PY-102]
gi|420858925|ref|ZP_15322615.1| MORN repeat variant family protein [Yersinia pestis PY-113]
gi|421763474|ref|ZP_16200269.1| hypothetical protein INS_10362 [Yersinia pestis INS]
gi|21959174|gb|AAM85881.1|AE013834_6 hypothetical [Yersinia pestis KIM10+]
gi|108775280|gb|ABG17799.1| hypothetical protein YPN_1469 [Yersinia pestis Nepal516]
gi|108779280|gb|ABG13338.1| hypothetical protein YPA_1371 [Yersinia pestis Antiqua]
gi|115347710|emb|CAL20625.1| putative exported protein [Yersinia pestis CO92]
gi|149292493|gb|EDM42567.1| putative exported protein [Yersinia pestis CA88-4125]
gi|165913264|gb|EDR31887.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
str. IP275]
gi|165921704|gb|EDR38901.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
str. F1991016]
gi|165991978|gb|EDR44279.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
E1979001]
gi|166207971|gb|EDR52451.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
B42003004]
gi|166962124|gb|EDR58145.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
str. MG05-1020]
gi|167049073|gb|EDR60481.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
UG05-0454]
gi|167056390|gb|EDR66159.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis
str. K1973002]
gi|229680806|gb|EEO76901.1| putative exported protein [Yersinia pestis Nepal516]
gi|229689081|gb|EEO81146.1| putative exported protein [Yersinia pestis biovar Orientalis str.
India 195]
gi|229693734|gb|EEO83783.1| putative exported protein [Yersinia pestis biovar Orientalis str.
PEXU2]
gi|262361940|gb|ACY58661.1| hypothetical protein YPD4_1753 [Yersinia pestis D106004]
gi|262365921|gb|ACY62478.1| hypothetical protein YPD8_1795 [Yersinia pestis D182038]
gi|294354398|gb|ADE64739.1| hypothetical protein YPZ3_1829 [Yersinia pestis Z176003]
gi|320015336|gb|ADV98907.1| putative exported protein [Yersinia pestis biovar Medievalis str.
Harbin 35]
gi|342855316|gb|AEL73869.1| hypothetical protein A1122_16240 [Yersinia pestis A1122]
gi|391426435|gb|EIQ88616.1| MORN repeat variant family protein [Yersinia pestis PY-01]
gi|391428125|gb|EIQ90135.1| MORN repeat variant family protein [Yersinia pestis PY-02]
gi|391429398|gb|EIQ91254.1| MORN repeat variant family protein [Yersinia pestis PY-03]
gi|391441946|gb|EIR02385.1| MORN repeat variant family protein [Yersinia pestis PY-04]
gi|391443618|gb|EIR03913.1| MORN repeat variant family protein [Yersinia pestis PY-05]
gi|391452176|gb|EIR11607.1| MORN repeat variant family protein [Yersinia pestis PY-06]
gi|391459078|gb|EIR17885.1| MORN repeat variant family protein [Yersinia pestis PY-07]
gi|391460124|gb|EIR18851.1| MORN repeat variant family protein [Yersinia pestis PY-08]
gi|391462348|gb|EIR20869.1| MORN repeat variant family protein [Yersinia pestis PY-09]
gi|391475130|gb|EIR32356.1| MORN repeat variant family protein [Yersinia pestis PY-10]
gi|391476917|gb|EIR33996.1| MORN repeat variant family protein [Yersinia pestis PY-11]
gi|391477290|gb|EIR34326.1| MORN repeat variant family protein [Yersinia pestis PY-12]
gi|391490939|gb|EIR46544.1| MORN repeat variant family protein [Yersinia pestis PY-13]
gi|391492069|gb|EIR47572.1| MORN repeat variant family protein [Yersinia pestis PY-15]
gi|391506661|gb|EIR60562.1| MORN repeat variant family protein [Yersinia pestis PY-16]
gi|391507551|gb|EIR61370.1| MORN repeat variant family protein [Yersinia pestis PY-19]
gi|391511867|gb|EIR65234.1| MORN repeat variant family protein [Yersinia pestis PY-25]
gi|391522306|gb|EIR74701.1| MORN repeat variant family protein [Yersinia pestis PY-29]
gi|391524860|gb|EIR77038.1| MORN repeat variant family protein [Yersinia pestis PY-34]
gi|391525942|gb|EIR78033.1| MORN repeat variant family protein [Yersinia pestis PY-32]
gi|391537884|gb|EIR88730.1| MORN repeat variant family protein [Yersinia pestis PY-36]
gi|391540548|gb|EIR91166.1| MORN repeat variant family protein [Yersinia pestis PY-42]
gi|391542426|gb|EIR92884.1| MORN repeat variant family protein [Yersinia pestis PY-45]
gi|391555759|gb|EIS04898.1| MORN repeat variant family protein [Yersinia pestis PY-46]
gi|391556180|gb|EIS05278.1| MORN repeat variant family protein [Yersinia pestis PY-47]
gi|391557563|gb|EIS06547.1| MORN repeat variant family protein [Yersinia pestis PY-48]
gi|391571065|gb|EIS18466.1| MORN repeat variant family protein [Yersinia pestis PY-52]
gi|391583990|gb|EIS29580.1| MORN repeat variant family protein [Yersinia pestis PY-55]
gi|391587068|gb|EIS32283.1| MORN repeat variant family protein [Yersinia pestis PY-56]
gi|391600400|gb|EIS43932.1| MORN repeat variant family protein [Yersinia pestis PY-60]
gi|391602087|gb|EIS45422.1| MORN repeat variant family protein [Yersinia pestis PY-59]
gi|391614982|gb|EIS56795.1| MORN repeat variant family protein [Yersinia pestis PY-61]
gi|391615639|gb|EIS57382.1| MORN repeat variant family protein [Yersinia pestis PY-63]
gi|391620767|gb|EIS61888.1| MORN repeat variant family protein [Yersinia pestis PY-64]
gi|391627155|gb|EIS67394.1| MORN repeat variant family protein [Yersinia pestis PY-65]
gi|391638411|gb|EIS77218.1| MORN repeat variant family protein [Yersinia pestis PY-71]
gi|391640885|gb|EIS79374.1| MORN repeat variant family protein [Yersinia pestis PY-72]
gi|391650413|gb|EIS87696.1| MORN repeat variant family protein [Yersinia pestis PY-76]
gi|391655057|gb|EIS91835.1| MORN repeat variant family protein [Yersinia pestis PY-88]
gi|391659608|gb|EIS95871.1| MORN repeat variant family protein [Yersinia pestis PY-89]
gi|391663482|gb|EIS99318.1| MORN repeat variant family protein [Yersinia pestis PY-90]
gi|391670819|gb|EIT05819.1| MORN repeat variant family protein [Yersinia pestis PY-91]
gi|391680861|gb|EIT14871.1| MORN repeat variant family protein [Yersinia pestis PY-93]
gi|391682352|gb|EIT16237.1| MORN repeat variant family protein [Yersinia pestis PY-92]
gi|391683024|gb|EIT16836.1| MORN repeat variant family protein [Yersinia pestis PY-94]
gi|391694775|gb|EIT27404.1| MORN repeat variant family protein [Yersinia pestis PY-95]
gi|391698199|gb|EIT30524.1| MORN repeat variant family protein [Yersinia pestis PY-96]
gi|391699543|gb|EIT31726.1| MORN repeat variant family protein [Yersinia pestis PY-98]
gi|391709421|gb|EIT40600.1| MORN repeat variant family protein [Yersinia pestis PY-99]
gi|391715550|gb|EIT46076.1| MORN repeat variant family protein [Yersinia pestis PY-100]
gi|391716261|gb|EIT46723.1| MORN repeat variant family protein [Yersinia pestis PY-101]
gi|391727707|gb|EIT56890.1| MORN repeat variant family protein [Yersinia pestis PY-102]
gi|391735121|gb|EIT63311.1| MORN repeat variant family protein [Yersinia pestis PY-113]
gi|411176375|gb|EKS46395.1| hypothetical protein INS_10362 [Yersinia pestis INS]
Length = 720
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
A Y LG + C Q+ G + K+A + +AL+ LA + +N G K D+ +
Sbjct: 569 AQYELGKYHYTC-QDYDRGLKWLEKSADHKNIKALFLLAQM-YNEGDGVKEDQ--TKYFS 624
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+AA LG DA E+G+ G GV +N+ E ++ ++A
Sbjct: 625 YLLKAAQLGLSDAQVEIGYLYLVGEGVEKNLPEAYQWHIKA 665
>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
Length = 495
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+E + C+ C V YC RACQ+ DWK H +C
Sbjct: 72 NELKACTGCRVVKYCDRACQSKDWKFAHSVEC 103
>gi|375335721|ref|ZP_09777065.1| hypothetical protein SbacW_01602 [Succinivibrionaceae bacterium
WG-1]
Length = 492
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 59/144 (40%), Gaps = 18/144 (12%)
Query: 126 LKLCADAGNVEACYTLGMIRFY--------CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
LK AD GN AC+ LG R Y +N + KAA + +A Y LA +
Sbjct: 241 LKKAADKGNTLACFELG--RMYLNGDKNANIQENLPEAVEYIGKAASENMIEAQYLLATL 298
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL--- 234
NG G D R + L AA HI A +LG G I EG ++L
Sbjct: 299 YENGIGVK---VDYRRAMELYRLAASHDHIGAESKLGQLYLTK-GDSTQIEEGIQWLNKA 354
Query: 235 -VQANARELAAVLSSAACQGISTR 257
V+ + E L+ A QGI +
Sbjct: 355 IVEGDDAEAKTFLAYAYSQGIGVK 378
>gi|68478643|ref|XP_716590.1| hypothetical protein CaO19.7349 [Candida albicans SC5314]
gi|46438262|gb|EAK97595.1| hypothetical protein CaO19.7349 [Candida albicans SC5314]
Length = 751
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 56/133 (42%), Gaps = 18/133 (13%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYC--------LQNRGSGASLMAKAA---ISSH 167
S A FLK+ A + A Y LG+ FY + + G + +AA
Sbjct: 494 SRKAVEFLKIAASRNHPAAMYKLGVYSFYGRMGLPANDMNTKKMGIKWLTRAANVATELT 553
Query: 168 AQALYSLAVIQFNGSGGSKNDK---DLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
A A Y L + +NG D D + G+ L A+AA LGH+ + LGH + G V
Sbjct: 554 AAAPYELGKLYYNGF----EDIVLIDKKYGLELFAQAAALGHLQSAAILGHHYEIGEIVP 609
Query: 225 QNIAEGRRFLVQA 237
Q+ + QA
Sbjct: 610 QDSNLSIHYYTQA 622
>gi|86749631|ref|YP_486127.1| Sel1-like protein [Rhodopseudomonas palustris HaA2]
gi|86572659|gb|ABD07216.1| Sel1-like protein [Rhodopseudomonas palustris HaA2]
Length = 287
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 53/112 (47%), Gaps = 4/112 (3%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
A + + A+AGN L + + G +L+AK A ++A+A Y L ++ G
Sbjct: 142 ARQLFERAAEAGNPRGVSNLAALGGGTPSDPGKTRALLAKGA-ETNAEAQYQLGMMLAEG 200
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
GG K+D RA L +AA GH AL +G Q G G ++ A + F
Sbjct: 201 LGGPKDDVAARA---LFEKAAAQGHPAALERMGAFAQSGRGGPKDSAAAKGF 249
>gi|395334343|gb|EJF66719.1| hypothetical protein DICSQDRAFT_164560 [Dichomitus squalens
LYAD-421 SS1]
Length = 181
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
C + C + + +CS C A+ YCS+ACQ DWK RHK C
Sbjct: 135 CGNYNCLKLQNLLSSLSKCSACKAILYCSQACQKADWK-RHKKYC 178
>gi|393234157|gb|EJD41722.1| hypothetical protein AURDEDRAFT_186343 [Auricularia delicata
TFB-10046 SS5]
Length = 452
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+ + CS C ++NYCS ACQ WK HKADC
Sbjct: 48 KLQTCSRCKSINYCSAACQREHWKAGHKADC 78
>gi|19075782|ref|NP_588282.1| chitin synthase regulatory factor Chr2 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|78099951|sp|O94486.1|CHR2_SCHPO RecName: Full=Chitin synthase regulatory factor 2; AltName:
Full=Chs four homolog 2; Flags: Precursor
gi|4107282|emb|CAA22651.1| chitin synthase regulatory factor Chr2 (predicted)
[Schizosaccharomyces pombe]
Length = 512
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+++ALY LAV G+G + + + + L AA L H+ A + CLQ G+GV QN
Sbjct: 223 YSEALYLLAVCY--GTGALRTEINEKEAYRLYKMAADLNHVQAAYRVAICLQMGFGVTQN 280
Query: 227 IAEGRRFLVQA 237
E + +A
Sbjct: 281 TEEAIHYFFRA 291
>gi|397607763|gb|EJK59797.1| hypothetical protein THAOC_19939, partial [Thalassiosira oceanica]
Length = 341
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 47/114 (41%), Gaps = 2/114 (1%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSG 183
R + C A R C N +++ A+Y L I F G
Sbjct: 144 RICRGCDSAAQKRGMDDCAFCRTPCPDNDADRLAMIMTRVRKKDPDAIYFLGKIYFFGEL 203
Query: 184 GSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G + KD+R V L AA G I AL +LG DG GV++++A+ F +A
Sbjct: 204 GLQ--KDMRRAVELWTEAAEHGLIQALFKLGSAYYDGDGVQKDVAKAAEFYEKA 255
>gi|392594963|gb|EIW84287.1| hypothetical protein CONPUDRAFT_135786 [Coniophora puteana
RWD-64-598 SS2]
Length = 981
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 28/50 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E+ EF +C C YC + CQ+ W H+ C+
Sbjct: 726 GIRQCANMLCGKWESYPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 775
>gi|445427002|ref|ZP_21437723.1| Sel1 repeat protein [Acinetobacter sp. WC-743]
gi|444752104|gb|ELW76795.1| Sel1 repeat protein [Acinetobacter sp. WC-743]
Length = 150
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 47/111 (42%), Gaps = 7/111 (6%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRF----YCLQNRGSGASLMAKAAISSHAQALYSLA 175
E A++ L L G+ EA Y + + Q+ G + KAA A+Y LA
Sbjct: 22 EQAYKLLDLAYKRGSPEATYAIATWYLHGGHFLEQDYVKGTEYLKKAAEMKWPDAMYDLA 81
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
V G SKN K L ++A G +A+ E+ C G G R+N
Sbjct: 82 VSYERGVYVSKNKK---KAFLLYLQSALRGSSEAVYEVSRCYYFGIGTRKN 129
>gi|260753304|ref|YP_003226197.1| Sel1 domain-containing protein repeat-containing protein [Zymomonas
mobilis subsp. mobilis NCIMB 11163]
gi|258552667|gb|ACV75613.1| Sel1 domain protein repeat-containing protein [Zymomonas mobilis
subsp. mobilis NCIMB 11163]
Length = 455
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 6/119 (5%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYC---LQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+ K AD G+V+A +G + Q+ G L KAA ++A Y+LA+ +NG
Sbjct: 323 WYKKAADQGHVKAQLNMGFAYYQARGVAQDYARGIFLYRKAAEQGDSKAEYNLAIAYYNG 382
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
G KDL + RAA G + A LG G GV ++ E +L +A A+
Sbjct: 383 VG---EPKDLAQSIYWFQRAASHGEMSAQYNLGAFYMRGEGVPKDRNEAIFWLEKAAAQ 438
>gi|357465295|ref|XP_003602929.1| Ubiquitin carboxyl-terminal hydrolase [Medicago truncatula]
gi|355491977|gb|AES73180.1| Ubiquitin carboxyl-terminal hydrolase [Medicago truncatula]
Length = 1116
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 22/43 (51%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDV 389
RC+ C +V YCS CQ + W HK DC P + G D+
Sbjct: 86 RCAKCKSVRYCSTGCQTVHWHQGHKFDCRPPSKTHRSNGVSDI 128
>gi|397647725|gb|EJK77826.1| hypothetical protein THAOC_00315 [Thalassiosira oceanica]
Length = 305
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 55/122 (45%), Gaps = 9/122 (7%)
Query: 151 NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDAL 210
N+ +++ A A++ L + F+G G + KD+R V + AA LG ++AL
Sbjct: 133 NKADQLAMVQARAAKRDPAAIHFLGLTYFHGEMGIQ--KDIRKAVEITEEAAELGSVEAL 190
Query: 211 RELGHCLQDGYGVRQNIAEGRRF----LVQANA--RELAAVLSSAACQ-GISTRSWLTWN 263
+G+ +G GV + A+G F +Q NA R + C ++ R W+
Sbjct: 191 SHIGYWYTEGVGVEVDEAKGMEFSRKAAMQGNAGSRHFLGMYEFEKCNYDLALRHWMISA 250
Query: 264 PL 265
L
Sbjct: 251 KL 252
>gi|410960407|ref|XP_003986781.1| PREDICTED: programmed cell death protein 2 [Felis catus]
Length = 268
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 8/57 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
G LC GC P+T CS C +YCS+ Q LDW+L HK C +E +LD
Sbjct: 55 GAHLCRVCGCLGPKT-------CSRCHKAHYCSKEHQTLDWRLGHKQACTQSE-YLD 103
>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 315
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 20/33 (60%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
RCS C V YCSR CQ DWK HK DC +R
Sbjct: 66 RCSSCKYVRYCSRPCQKGDWKRCHKQDCKTLKR 98
>gi|336262610|ref|XP_003346088.1| hypothetical protein SMAC_07743 [Sordaria macrospora k-hell]
gi|380088111|emb|CCC05087.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 737
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 22/31 (70%), Gaps = 1/31 (3%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+ +CS C +YCS+ACQ +DWK RHK +C
Sbjct: 663 DLNKCSKCKKAHYCSKACQVVDWK-RHKKEC 692
>gi|386598823|ref|YP_006100329.1| hypothetical protein ECOK1_1122 [Escherichia coli IHE3034]
gi|294491583|gb|ADE90339.1| conserved hypothetical protein [Escherichia coli IHE3034]
Length = 346
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 176 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVKQNFRQAREWFEKAASQNQLNA 228
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV QN +
Sbjct: 229 QYNLGQIYYYGQGVTQS---YRQAKYWFEKAAEKGHVDAQYNLGVIYENGEGVSQNYQQA 285
Query: 231 RRFLVQA 237
+ + +A
Sbjct: 286 KAWYEKA 292
Score = 38.9 bits (89), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 48/117 (41%), Gaps = 6/117 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A + + A + A Y LG I +Y Q+ KAA H A Y+L VI
Sbjct: 212 QAREWFEKAASQNQLNAQYNLGQIYYYGQGVTQSYRQAKYWFEKAAEKGHVDAQYNLGVI 271
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
NG G S+N + + A +AA A ELG + G G ++ + R +
Sbjct: 272 YENGEGVSQNYQQAK---AWYEKAASQNDAQAQFELGVMNELGQGESIDLKQARHYY 325
>gi|389749456|gb|EIM90627.1| hypothetical protein STEHIDRAFT_72241 [Stereum hirsutum FP-91666
SS1]
Length = 305
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 24/40 (60%), Gaps = 3/40 (7%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CG+ R + RCS C YCS+ CQ DWK+ HKA+C
Sbjct: 265 CGK---RGVKLERCSRCRKAWYCSKECQTKDWKIGHKANC 301
>gi|2073105|dbj|BAA19847.1| chitin synthase regulatory factor [Candida albicans]
Length = 687
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 56/133 (42%), Gaps = 18/133 (13%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYC--------LQNRGSGASLMAKAA---ISSH 167
S A FLK+ A + A Y LG+ FY + + G + +AA
Sbjct: 430 SRKAVEFLKIAASRNHPAAMYKLGVYSFYGRMGLPANDMNTKKMGIKWLTRAANVATELT 489
Query: 168 AQALYSLAVIQFNGSGGSKNDK---DLRAGVALCARAAFLGHIDALRELGHCLQDGYGVR 224
A A Y L + +NG D D + G+ L A+AA LGH+ + LGH + G V
Sbjct: 490 AAAPYELGKLYYNGF----EDIVLIDKKYGLELFAQAAALGHLQSAAILGHHYEIGEIVP 545
Query: 225 QNIAEGRRFLVQA 237
Q+ + QA
Sbjct: 546 QDSNLSIHYYTQA 558
>gi|397645305|gb|EJK76768.1| hypothetical protein THAOC_01453 [Thalassiosira oceanica]
Length = 302
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 2/68 (2%)
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A++ L ++G+ G + K +R G+ L A AA LG IDAL LG+ G G ++N A+
Sbjct: 128 AIFLLGKTYYHGALGLQ--KVMRKGIELYAEAAQLGSIDALYSLGNVYFLGRGAQENQAK 185
Query: 230 GRRFLVQA 237
G +F +A
Sbjct: 186 GLKFWTKA 193
>gi|319779397|ref|YP_004130310.1| hypothetical protein TEQUI_1247 [Taylorella equigenitalis MCE9]
gi|317109421|gb|ADU92167.1| hypothetical protein TEQUI_1247 [Taylorella equigenitalis MCE9]
Length = 215
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 34/167 (20%), Positives = 63/167 (37%), Gaps = 36/167 (21%)
Query: 107 SKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAA 163
S+K + E A +LK AD G V+A +G++ L +N K+A
Sbjct: 33 SQKGTELYKQQKYEEAFEYLKPAADQGQVQAQVAVGVMYLKGLGVPKNYDEATKYFEKSA 92
Query: 164 ISSHAQALYSLAVIQFNGSG---------------------------------GSKNDKD 190
+ +AQ +Y++ + + G G G +D
Sbjct: 93 KAGNAQGIYNIGTMYYKGEGVKLDLNKAADYFEAAAKLGLRDAQHDYGAMYLFGKGRPED 152
Query: 191 LRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++ + +AA GH +L LG + G G+ QN+ + + ++
Sbjct: 153 IKEAASWMQKAATQGHTGSLYNLGVMYETGKGMPQNLEKSKEMYTKS 199
>gi|32450448|gb|AAH54132.1| Programmed cell death 2 [Danio rerio]
Length = 358
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 9/52 (17%)
Query: 324 GPGLRLCSHVGC-GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G GL+LC GC G+ + CS C +V YC + Q DWK RHK +C
Sbjct: 139 GSGLKLCRLCGCLGQ--------KACSRCHSVTYCCKEHQTTDWKQRHKKEC 182
>gi|440798272|gb|ELR19340.1| Sel1 repeat/Ubox domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 918
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 55/121 (45%), Gaps = 6/121 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
+ + + + ++G V+A Y LG + +N L KAA A + L +
Sbjct: 794 DKSFQLFQQATESGFVDALYNLGRCYHHGEGVAKNMAEAVELYTKAAALGQRSAQWKLGM 853
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
G +KD++ V L +AA GH A R L HC + G GV ++A+ +++ +
Sbjct: 854 CY---ECGEAVEKDIQKAVHLYIQAAKQGHSRAQRHLSHCYRTGKGVPVDLAKAKKWRQR 910
Query: 237 A 237
A
Sbjct: 911 A 911
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 50/110 (45%), Gaps = 7/110 (6%)
Query: 129 CADAGN-VEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKN 187
CA+A N + CY G + +N L +A S ALY+L +G G +KN
Sbjct: 772 CAEAQNYLGLCYQFGKL---VQRNPDKSFQLFQQATESGFVDALYNLGRCYHHGEGVAKN 828
Query: 188 DKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+ V L +AA LG A +LG C + G V ++I + +QA
Sbjct: 829 ---MAEAVELYTKAAALGQRSAQWKLGMCYECGEAVEKDIQKAVHLYIQA 875
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 41/93 (44%), Gaps = 3/93 (3%)
Query: 158 LMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCL 217
+ +AA A+Y L F G GG K + A+ LGH+ A +LG C
Sbjct: 370 FLEEAADLGSIDAMYKLGSFYFKGRGGQKTPQQ---AFRRWKEASALGHLKATAKLGFCY 426
Query: 218 QDGYGVRQNIAEGRRFLVQANARELAAVLSSAA 250
+G GV ++ E R + A + + A ++ A
Sbjct: 427 LNGVGVSKDNGEAIRLYMVAASEDCPAAMNDLA 459
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 34/63 (53%), Gaps = 3/63 (4%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KA+ +SHA A SLA F G G +KDL + L +AA G+ A LG C ++G
Sbjct: 586 KASNASHAPATASLASCYFYGKG---VEKDLNKAIPLFVQAAEQGNKQAEFRLGQCYEEG 642
Query: 221 YGV 223
GV
Sbjct: 643 DGV 645
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 58/159 (36%), Gaps = 46/159 (28%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAIS--------SHAQALYSLAV 176
FL+ AD G+++A Y LG F +G G + A H +A L
Sbjct: 370 FLEEAADLGSIDAMYKLGSFYF-----KGRGGQKTPQQAFRRWKEASALGHLKATAKLGF 424
Query: 177 IQFNGSGGSKND---------------------------------KDLRAGVALCARAAF 203
NG G SK++ KD+ A VAL R+A
Sbjct: 425 CYLNGVGVSKDNGEAIRLYMVAASEDCPAAMNDLAWCYQNGQGIAKDMAAAVALLTRSAK 484
Query: 204 LGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAREL 242
G+ AL L Q+G G ++I + +A+ + L
Sbjct: 485 QGNKHALNSLAWAYQNGEGCEKDIGKAIELYTKASDKGL 523
>gi|342321104|gb|EGU13041.1| Hypothetical Protein RTG_00754 [Rhodotorula glutinis ATCC 204091]
Length = 1150
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 27/49 (55%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G+R C+++ CG+ E EF +C C YCS+ CQ+ W H+ C
Sbjct: 905 GIRQCANMLCGKWEGYAREFAKCRRCRRAKYCSKMCQSEAWNQGHRHWC 953
>gi|113681653|ref|NP_001038603.1| programmed cell death protein 2 [Danio rerio]
Length = 358
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 9/52 (17%)
Query: 324 GPGLRLCSHVGC-GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G GL+LC GC G+ + CS C +V YC + Q DWK RHK +C
Sbjct: 139 GSGLKLCRLCGCLGQ--------KACSRCHSVTYCCKEHQTTDWKQRHKKEC 182
>gi|306814141|ref|ZP_07448309.1| hypothetical protein ECNC101_18901 [Escherichia coli NC101]
gi|432513295|ref|ZP_19750530.1| hypothetical protein A17M_01152 [Escherichia coli KTE224]
gi|432698408|ref|ZP_19933573.1| hypothetical protein A31M_01154 [Escherichia coli KTE169]
gi|432745027|ref|ZP_19979725.1| hypothetical protein WGG_01152 [Escherichia coli KTE43]
gi|432971215|ref|ZP_20160090.1| hypothetical protein A15O_01786 [Escherichia coli KTE207]
gi|433082886|ref|ZP_20269350.1| hypothetical protein WIW_02035 [Escherichia coli KTE133]
gi|433144595|ref|ZP_20329741.1| hypothetical protein WKO_02130 [Escherichia coli KTE168]
gi|305852506|gb|EFM52956.1| hypothetical protein ECNC101_18901 [Escherichia coli NC101]
gi|431044334|gb|ELD54614.1| hypothetical protein A17M_01152 [Escherichia coli KTE224]
gi|431245732|gb|ELF40011.1| hypothetical protein A31M_01154 [Escherichia coli KTE169]
gi|431294003|gb|ELF84285.1| hypothetical protein WGG_01152 [Escherichia coli KTE43]
gi|431485471|gb|ELH65133.1| hypothetical protein A15O_01786 [Escherichia coli KTE207]
gi|431602493|gb|ELI71925.1| hypothetical protein WIW_02035 [Escherichia coli KTE133]
gi|431662072|gb|ELJ28881.1| hypothetical protein WKO_02130 [Escherichia coli KTE168]
Length = 378
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 208 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVKQNFRQAREWFEKAASQNQLNA 260
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV QN +
Sbjct: 261 QYNLGQIYYYGQGVTQS---YRQAKYWFEKAAEKGHVDAQYNLGVIYENGEGVSQNYQQA 317
Query: 231 RRFLVQA 237
+ + +A
Sbjct: 318 KAWYEKA 324
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 48/116 (41%), Gaps = 6/116 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A + + A + A Y LG I +Y Q+ KAA H A Y+L VI
Sbjct: 244 QAREWFEKAASQNQLNAQYNLGQIYYYGQGVTQSYRQAKYWFEKAAEKGHVDAQYNLGVI 303
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
NG G S+N + + A +AA A ELG + G G ++ + R +
Sbjct: 304 YENGEGVSQNYQQAK---AWYEKAASQNDAQAQFELGVMNELGQGESIDLKQARHY 356
>gi|395323527|gb|EJF55994.1| hypothetical protein DICSQDRAFT_158065 [Dichomitus squalens
LYAD-421 SS1]
Length = 995
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CGR E EF +C C YC + CQ+ W H+ C+
Sbjct: 708 GIRQCANMLCGRWERFPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 757
>gi|452989908|gb|EME89663.1| hypothetical protein MYCFIDRAFT_150257 [Pseudocercospora fijiensis
CIRAD86]
Length = 457
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 68/161 (42%), Gaps = 15/161 (9%)
Query: 80 PSDFVNVLITCKRMNGLALNSLVLS---KASKKTFAIKANNWSESAHRFLKLCADAGNVE 136
P++ LI K++ A S++ S +A KT A W A + LK ++G E
Sbjct: 88 PNNPKQTLIYIKKLVEAA--SVLASDGGRADPKTTAKNREKWIMEALKRLKRIVNSGYPE 145
Query: 137 A------CYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI-QFNGSGGSKNDK 189
A CY GM+ + L AA + H QA Y AV + G +
Sbjct: 146 AQFYLADCYGQGMLGLEV--DTKEAFKLYQAAAKAGHPQAAYRTAVCCEMGPEEGGGTSR 203
Query: 190 DLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAE 229
DL V RAA LG A+ ++G L G G ++N+ E
Sbjct: 204 DLAKAVQWYRRAATLGDGPAMYKIGAVLLKGLLGQQRNVTE 244
>gi|397598947|gb|EJK57319.1| hypothetical protein THAOC_22651 [Thalassiosira oceanica]
Length = 197
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
Query: 158 LMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCL 217
+ + A HA A++ L F G G + KD R GV L A LG IDAL LG
Sbjct: 25 IQTRVAKKDHA-AIFYLGQKYFFGELGLQ--KDARKGVELWTEAVELGSIDALYNLGLAY 81
Query: 218 QDGYGVRQNIAEGRRFLVQA 237
DG GV+Q+ + +F +A
Sbjct: 82 YDGNGVQQDKKKSFQFFEKA 101
>gi|91210161|ref|YP_540147.1| hypothetical protein UTI89_C1133 [Escherichia coli UTI89]
gi|386605031|ref|YP_006111331.1| hypothetical protein UM146_12250 [Escherichia coli UM146]
gi|432553054|ref|ZP_19789783.1| hypothetical protein A1S3_01445 [Escherichia coli KTE47]
gi|432573037|ref|ZP_19809527.1| hypothetical protein A1SI_01731 [Escherichia coli KTE55]
gi|91071735|gb|ABE06616.1| hypothetical protein UTI89_C1133 [Escherichia coli UTI89]
gi|307627515|gb|ADN71819.1| hypothetical protein UM146_12250 [Escherichia coli UM146]
gi|431085771|gb|ELD91875.1| hypothetical protein A1S3_01445 [Escherichia coli KTE47]
gi|431110245|gb|ELE14172.1| hypothetical protein A1SI_01731 [Escherichia coli KTE55]
Length = 378
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 208 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVKQNFRQAREWFEKAASQNQLNA 260
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV QN +
Sbjct: 261 QYNLGQIYYYGQGVTQS---YRQAKYWFEKAAEKGHVDAQYNLGVIYENGEGVSQNYQQA 317
Query: 231 RRFLVQA 237
+ + +A
Sbjct: 318 KAWYEKA 324
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 48/116 (41%), Gaps = 6/116 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A + + A + A Y LG I +Y Q+ KAA H A Y+L VI
Sbjct: 244 QAREWFEKAASQNQLNAQYNLGQIYYYGQGVTQSYRQAKYWFEKAAEKGHVDAQYNLGVI 303
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
NG G S+N + + A +AA A ELG + G G ++ + R +
Sbjct: 304 YENGEGVSQNYQQAK---AWYEKAASQNDAQAQFELGVMNELGQGESIDLKQARHY 356
>gi|440791520|gb|ELR12758.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 547
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 23/41 (56%)
Query: 334 GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G + R+ + C C AV YCSR CQA DWK H+A C
Sbjct: 71 GSAPTDDRKKKLMVCGSCRAVRYCSRDCQAADWKAGHRALC 111
>gi|381195649|ref|ZP_09902991.1| putative signal peptide [Acinetobacter lwoffii WJ10621]
Length = 187
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 77/185 (41%), Gaps = 24/185 (12%)
Query: 69 ILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKL 128
++C + + + P++ VN+ T L L + ++ + A W+ LK
Sbjct: 13 VIC-MPKSPKIPAELVNLATTGNSDAQFELAELYMQSENEDDITL-AEEWA------LKA 64
Query: 129 CADAGNVEACYTLGM-IRFYCLQNR-----------GSGASLMAKAAISSHAQALYSLAV 176
A G+VEA Y LG FY + R G + +AA H A+ LA
Sbjct: 65 AA-LGHVEAMYWLGEGYAFYAKELREEDPSEAETYFGHAHHWLKQAAELKHPTAILELAG 123
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
G +KD+ + L +AA LG A+R+L ++ GV + A+ + + +
Sbjct: 124 FY---RRGDVVEKDVAKSIELVQQAAELGEAQAMRDLAFIYENALGVDTDEAKAKYWHDK 180
Query: 237 ANARE 241
+A E
Sbjct: 181 TDAAE 185
>gi|367048451|ref|XP_003654605.1| hypothetical protein THITE_52613 [Thielavia terrestris NRRL 8126]
gi|347001868|gb|AEO68269.1| hypothetical protein THITE_52613 [Thielavia terrestris NRRL 8126]
Length = 1236
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 318 ARGGTPGPGLRLCSHVGCGRPETRRH-EFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
A G G ++ C + CG+ E + +RC C V YCS CQ LDWK +H+ +C
Sbjct: 1171 ALGDGSGDVMKACRN--CGKTEEQDGVRLKRCLRCLEVAYCSNVCQKLDWK-KHRMECEE 1227
Query: 377 AERWL 381
AE ++
Sbjct: 1228 AEVYV 1232
>gi|397615049|gb|EJK63183.1| hypothetical protein THAOC_16180 [Thalassiosira oceanica]
Length = 531
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 38/78 (48%), Gaps = 3/78 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA H Q +Y LAV + G G ++D V L RA A LG CL DG
Sbjct: 407 KAADQDHPQGIYELAVAYYTGEG---VEEDEELAVHLFRRAGEKNVPAAAYMLGDCLLDG 463
Query: 221 YGVRQNIAEGRRFLVQAN 238
GV + A+ +LV+A+
Sbjct: 464 VGVAMDRAQALEWLVRAS 481
>gi|126311314|ref|XP_001381663.1| PREDICTED: programmed cell death protein 2-like [Monodelphis
domestica]
Length = 368
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 7/55 (12%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
G+ LC GC P+ CS C +YCS+ Q LDWKL HK C W
Sbjct: 149 GVHLCRVCGCLGPKV-------CSKCHKAHYCSKDHQTLDWKLGHKQSCTSDNLW 196
>gi|393234159|gb|EJD41724.1| hypothetical protein AURDEDRAFT_186345 [Auricularia delicata
TFB-10046 SS5]
Length = 443
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CGR T + CS C VNYCS CQ WK HKADC
Sbjct: 40 CGRGATGTR-LQTCSRCKGVNYCSAVCQREHWKAGHKADC 78
>gi|409045448|gb|EKM54928.1| hypothetical protein PHACADRAFT_208474 [Phanerochaete carnosa
HHB-10118-sp]
Length = 307
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 335 CGRPETR-RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
CGR ET F+ CS C + YCS CQ DW +RHK+ CA
Sbjct: 149 CGRWETTGAPNFKCCSSCKSRYYCSVQCQRTDWLMRHKSICA 190
>gi|421605071|ref|ZP_16047044.1| hypothetical protein BCCGELA001_40007, partial [Bradyrhizobium sp.
CCGE-LA001]
gi|404262818|gb|EJZ28525.1| hypothetical protein BCCGELA001_40007, partial [Bradyrhizobium sp.
CCGE-LA001]
Length = 271
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 56/120 (46%), Gaps = 6/120 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K ADAG+ EA + L M+R ++G LMA AA +A Y+LA++
Sbjct: 133 AAEWYKRAADAGDREAMFALSMLRMSGRGGPVDKGEAVKLMASAAKLGEPKAAYNLALLY 192
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
+ G +D++ L +AA G +A L ++G GV ++ R L A+
Sbjct: 193 LD---GQTLPQDVKRSAELLRQAADAGLPEAQYALATFYKEGTGVPKDAERSVRLLQAAS 249
>gi|296420192|ref|XP_002839659.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635853|emb|CAZ83850.1| unnamed protein product [Tuber melanosporum]
Length = 601
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 10/89 (11%)
Query: 291 PLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSV 350
PL+ F AP+ ++ A +RGG +R C++ CG+ E +F +C
Sbjct: 523 PLVEKFTVRHHAPDM----QYWAGVDDSRGG-----IRQCAYYECGKGEEYTRQFAKCRR 573
Query: 351 CGAVNYCSRACQALDWKLRHKADCAPAER 379
C YCS+ CQ W + H+ C A++
Sbjct: 574 CRRTKYCSKECQKSAW-VYHRHWCVAAQQ 601
>gi|27376629|ref|NP_768158.1| hypothetical protein bll1518 [Bradyrhizobium japonicum USDA 110]
gi|27349770|dbj|BAC46783.1| bll1518 [Bradyrhizobium japonicum USDA 110]
Length = 352
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 56/120 (46%), Gaps = 6/120 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K +DAG+ EA + L M+R ++G LMA AA +A Y+LA++
Sbjct: 129 ALEWYKRASDAGDREAMFALAMMRIAGRGGPVDKGEAVKLMASAAKLGEPKAAYNLALLY 188
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
+ G +D++ L +AA G +A L ++G GV ++ R L A+
Sbjct: 189 LD---GQTLPQDVKRSAELLRQAADAGLPEAQYALATFYKEGTGVPKDPERAVRLLQAAS 245
>gi|238583891|ref|XP_002390387.1| hypothetical protein MPER_10339 [Moniliophthora perniciosa FA553]
gi|215453737|gb|EEB91317.1| hypothetical protein MPER_10339 [Moniliophthora perniciosa FA553]
Length = 206
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 36/76 (47%), Gaps = 7/76 (9%)
Query: 300 VPAPEAHPASRFLAEWFAARG-GTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCS 358
+P P + PA+ A W A T G+R+C CG ET FR C C YCS
Sbjct: 1 MPTPYSQPAA---ANWNAPPSHSTKQRGVRICDQ--CGAVETPMARFRLCGGCMTTQYCS 55
Query: 359 RACQALDWKLRHKADC 374
+ CQ + W HK+ C
Sbjct: 56 QDCQKIHWP-GHKSIC 70
>gi|50304311|ref|XP_452105.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641237|emb|CAH02498.1| KLLA0B12892p [Kluyveromyces lactis]
Length = 625
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G+R C++ CG+ E +F +C C YCSR CQ W+ H+ C +
Sbjct: 485 GVRQCANFSCGKWEDHPKQFPKCRRCKRTKYCSRLCQLKSWEF-HRYWCQES 535
>gi|393227468|gb|EJD35145.1| hypothetical protein AURDEDRAFT_117443 [Auricularia delicata
TFB-10046 SS5]
Length = 149
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 328 RLCSHVGCGRPETR---RHEFRRCSVCGAVNYCSRACQALDWK--LRHKADCAPAERWLD 382
R+C + CG E +H+F+RC C YCSR CQ DW + HK+ C R
Sbjct: 45 RMCGNHACGMTEASSAVKHKFQRCGRCMFTLYCSRNCQRQDWAEGVTHKSVCPLLSRIAA 104
Query: 383 EEGN 386
GN
Sbjct: 105 ATGN 108
>gi|348680691|gb|EGZ20507.1| hypothetical protein PHYSODRAFT_354314 [Phytophthora sojae]
Length = 169
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 30/54 (55%), Gaps = 2/54 (3%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G R+C++ C + + + C+ C V YC + CQ DWK RH A C PA++
Sbjct: 117 GKRICANEACEALDY--EKVQACTRCRCVYYCGKTCQKADWKSRHSALCRPAKK 168
>gi|258573871|ref|XP_002541117.1| predicted protein [Uncinocarpus reesii 1704]
gi|237901383|gb|EEP75784.1| predicted protein [Uncinocarpus reesii 1704]
Length = 236
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 10/70 (14%)
Query: 310 RFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLR 369
+ +E F GP +C V H+ + CS C + YCS CQA W +
Sbjct: 171 KIFSERFPTMAVPDGPCCEVCGFV---------HDLKWCSRCNMIQYCSVECQAHHWPM- 220
Query: 370 HKADCAPAER 379
HK DC AER
Sbjct: 221 HKKDCRNAER 230
>gi|367477446|ref|ZP_09476797.1| putative Beta-lactamase [Bradyrhizobium sp. ORS 285]
gi|365270200|emb|CCD89265.1| putative Beta-lactamase [Bradyrhizobium sp. ORS 285]
Length = 362
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 54/114 (47%), Gaps = 12/114 (10%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG------SGASLMAKAAISSHAQALYS 173
E A + K A+AG+ EA + L M+R L RG L+A AA +A Y+
Sbjct: 136 EKAAIWYKRAAEAGDREAMFALAMMR---LAGRGGPVDKAEAVKLLASAAKLGEPKAAYN 192
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
LA++ + G +DLR L AA G+ +A L ++G GV +++
Sbjct: 193 LALLYLD---GQTFPQDLRRSAELLRVAADAGNPEAQYALATFYKEGTGVPKDL 243
>gi|410688508|ref|YP_006961777.1| truncated hypothetical protein, Sel1 domain protein [Psychrobacter
sp. DAB_AL60]
gi|380861024|gb|AFF18226.1| truncated hypothetical protein, Sel1 domain protein [Psychrobacter
sp. DAB_AL60]
Length = 346
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA H+ A Y LAV+ G G ++ D+ + +AA GH DA LG + G
Sbjct: 104 KAAEQGHSDAQYHLAVMYKKGQGIAQ---DMTKAIEWYTKAAEQGHSDAQYNLGDMYEKG 160
Query: 221 YGVRQNIAEGRRFLVQA 237
GV Q+I + ++A
Sbjct: 161 QGVPQDITKALELYLEA 177
>gi|307105804|gb|EFN54052.1| hypothetical protein CHLNCDRAFT_136131 [Chlorella variabilis]
Length = 356
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 26/47 (55%), Gaps = 5/47 (10%)
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
R+C+ GCG H R+CS C +V YCS C W+ HKA+C
Sbjct: 302 RVCAAEGCGN----THSLRKCSRCRSVRYCSETCSHAHWRA-HKAEC 343
>gi|167043546|gb|ABZ08242.1| putative TPR repeat region [uncultured marine microorganism
HF4000_APKG2J17]
Length = 369
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 49/110 (44%), Gaps = 6/110 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
++A ++ KL A G A Y LG++ QN + AA A A ++L +
Sbjct: 74 KTAMKWYKLAAKQGLANAQYNLGLMYRKGQGVPQNDKTAVKWFRLAAEQGFALAQFNLGL 133
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+ G G +NDK + L A GH DA LG ++G GV QN
Sbjct: 134 MYGKGQGVPQNDKTAVKWITLAAEQ---GHADAQNSLGLMYENGDGVPQN 180
>gi|445412297|ref|ZP_21433166.1| Sel1 repeat protein [Acinetobacter sp. WC-743]
gi|444767359|gb|ELW91608.1| Sel1 repeat protein [Acinetobacter sp. WC-743]
Length = 282
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 60/116 (51%), Gaps = 7/116 (6%)
Query: 126 LKLCADAGNVEACYTLGMIR---FYCLQ-NRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+K A+ G+ EA Y +G++ + L+ N +A +A+A Y+LA++ +
Sbjct: 46 IKSKAEKGDPEAQYNMGVLFTDGYKDLKPNIIEAFKWYELSADKGYAKAKYNLAIL-LSS 104
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G KND L AA LG + +L ELG+ +DG GV++N A+ + ++A
Sbjct: 105 DSGIKND--YAQAKKLFEDAAALGDVPSLNELGNFYKDGIGVQENYAQASEYYLKA 158
>gi|320169426|gb|EFW46325.1| UPF0682 protein [Capsaspora owczarzaki ATCC 30864]
Length = 1116
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
L C H C + E +F CS C V YC CQ LDWK H+ +C
Sbjct: 1040 LYECHHSPCCKLEPDLRQFMVCSRCRHVRYCGERCQQLDWK-HHRKEC 1086
>gi|302683498|ref|XP_003031430.1| hypothetical protein SCHCODRAFT_109938 [Schizophyllum commune H4-8]
gi|300105122|gb|EFI96527.1| hypothetical protein SCHCODRAFT_109938, partial [Schizophyllum
commune H4-8]
Length = 547
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 3/46 (6%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
C ++ C P R + + C +C YCS+ CQ DW+LRH+ CA
Sbjct: 417 CFNLEC--PLEREAQVKTC-ICKEAYYCSKVCQQADWRLRHRMTCA 459
>gi|397647732|gb|EJK77829.1| hypothetical protein THAOC_00312, partial [Thalassiosira oceanica]
Length = 618
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 68/252 (26%), Positives = 93/252 (36%), Gaps = 41/252 (16%)
Query: 137 ACYTLGMIRFYCL-----QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKD- 190
A Y L I Y + ++ L+ K+A HA A L+ NG+ G + D D
Sbjct: 385 ALYELSKIYRYGIASELEKSEEKANELLLKSANLGHALANSVLSNFYINGTNGFEADPDE 444
Query: 191 --LRAGVALCARAAFLGHID--ALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVL 246
RA VA L + D A R LG + Q +AE +L A
Sbjct: 445 FYFRASVAFA-----LDNTDKNAARLLGSL---HFKKHQILAEPSSYLACYYTNIWAKDE 496
Query: 247 SSAACQGISTRSWLTWNPLSQPHPAPHPRNQNPNHNGHVTGIGCPLLS-DFGCNVPAPEA 305
S+ A +S L L+ + RN N C +S D G N
Sbjct: 497 STGAAYYFYGQSLLR---LANHLHGGYARN---GSNAVPKVSFCLRISRDLGFN------ 544
Query: 306 HPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALD 365
A L EW G LC + C + ++++CS C A YCS+ CQ
Sbjct: 545 -DARELLKEW-------ETIGQNLCDN--CSKEAKSGEKYKQCSKCKAQWYCSKECQVEA 594
Query: 366 WKLRHKADCAPA 377
W+ HK DC A
Sbjct: 595 WRAGHKTDCKRA 606
>gi|397576522|gb|EJK50308.1| hypothetical protein THAOC_30745, partial [Thalassiosira oceanica]
Length = 456
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 29/52 (55%), Gaps = 2/52 (3%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G LC++ CG+ +F++CS C A YCS+ CQ W+ HK +C A
Sbjct: 395 GQSLCAN--CGKEVQAGEKFKQCSKCKAQWYCSKECQVEAWRAGHKKNCKSA 444
>gi|403413353|emb|CCM00053.1| predicted protein [Fibroporia radiculosa]
Length = 573
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 40/83 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A H + Y LA NG G K+ +D L AA GH DA G C
Sbjct: 252 SLLKRLADRGHMPSQYFLADCYVNGIGTHKSRQDFDRAFPLFVLAAKHGHPDAAYRAGTC 311
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 312 CENGWGCRRESAKAIQFYRKAAA 334
>gi|15238468|ref|NP_201348.1| ubiquitin carboxyl-terminal hydrolase 17 [Arabidopsis thaliana]
gi|75262625|sp|Q9FKP5.1|UBP17_ARATH RecName: Full=Ubiquitin carboxyl-terminal hydrolase 17; AltName:
Full=Deubiquitinating enzyme 17; Short=AtUBP17; AltName:
Full=Ubiquitin thioesterase 17; AltName:
Full=Ubiquitin-specific-processing protease 17
gi|9759625|dbj|BAB11567.1| unnamed protein product [Arabidopsis thaliana]
gi|332010675|gb|AED98058.1| ubiquitin carboxyl-terminal hydrolase 17 [Arabidopsis thaliana]
Length = 731
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEG 398
RCS C +V YCS CQ L W+ HK +C + DEE V D + E
Sbjct: 67 RCSQCKSVRYCSSKCQILHWRRGHKEECRSPD--YDEEKEEYVQSDYDAKES 116
>gi|433006100|ref|ZP_20194527.1| hypothetical protein A17S_03681 [Escherichia coli KTE227]
gi|433154718|ref|ZP_20339655.1| hypothetical protein WKS_02648 [Escherichia coli KTE176]
gi|431513158|gb|ELH91243.1| hypothetical protein A17S_03681 [Escherichia coli KTE227]
gi|431672978|gb|ELJ39210.1| hypothetical protein WKS_02648 [Escherichia coli KTE176]
Length = 375
Score = 42.0 bits (97), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 55/124 (44%), Gaps = 6/124 (4%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
+ A + + A+ A + LGM+ + QN KAA + A Y+L
Sbjct: 207 QQAKGWYEKAAEQNFANAQFNLGMLYYKGEGVSQNFQQAREWFEKAASQNQLNAQYNLGQ 266
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
I + G G +++ R +AA GH+DA LG ++G GV Q+ + R + +
Sbjct: 267 IYYYGQGVTQS---YRKAKEWFEKAAGEGHVDAQYNLGVIYENGEGVGQDFHQARAWYEK 323
Query: 237 ANAR 240
A AR
Sbjct: 324 AAAR 327
>gi|452981426|gb|EME81186.1| hypothetical protein MYCFIDRAFT_204223 [Pseudocercospora fijiensis
CIRAD86]
Length = 349
Score = 42.0 bits (97), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 18/29 (62%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADC 374
+RCS CG YC+R CQ DWK K DC
Sbjct: 152 KRCSACGKAYYCNRDCQKADWKRHKKPDC 180
>gi|432903884|ref|ZP_20113155.1| hypothetical protein A13Y_01518 [Escherichia coli KTE194]
gi|433038951|ref|ZP_20226554.1| hypothetical protein WIE_02303 [Escherichia coli KTE113]
gi|431434318|gb|ELH15968.1| hypothetical protein A13Y_01518 [Escherichia coli KTE194]
gi|431551858|gb|ELI25824.1| hypothetical protein WIE_02303 [Escherichia coli KTE113]
Length = 378
Score = 42.0 bits (97), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 208 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVKQNFRQAREWFEKAASQNQPNA 260
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV QN +
Sbjct: 261 QYNLGQIYYYGQGVTQS---YRQAKDWFEKAAEKGHVDAQYNLGVIYENGEGVSQNYQQA 317
Query: 231 RRFLVQA 237
+ + +A
Sbjct: 318 KAWYEKA 324
>gi|393227838|gb|EJD35501.1| hypothetical protein AURDEDRAFT_175431 [Auricularia delicata
TFB-10046 SS5]
Length = 508
Score = 42.0 bits (97), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 29/54 (53%), Gaps = 6/54 (11%)
Query: 328 RLCSHVGCGRP---ETRRHEFRRCSVCGAVNYCSRACQALDWK---LRHKADCA 375
R C GCG+ +T F RCS C AV YCSR CQ DW+ + HK C+
Sbjct: 374 RGCHGPGCGKTVHDKTTPGAFPRCSTCRAVQYCSRECQNTDWRRGAVPHKKVCS 427
>gi|392568012|gb|EIW61186.1| hypothetical protein TRAVEDRAFT_17980 [Trametes versicolor
FP-101664 SS1]
Length = 425
Score = 42.0 bits (97), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 28/51 (54%), Gaps = 3/51 (5%)
Query: 327 LRLCSHVGCGRPE-TRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
+R+C+ CGR E T R FRRC C + YCS CQ DWK + D P
Sbjct: 139 IRVCAR--CGRWEATSRPRFRRCGRCKSNYYCSEQCQKDDWKPGYHRDECP 187
>gi|407043395|gb|EKE41930.1| protein kinase domain containing protein [Entamoeba nuttalli P19]
Length = 1079
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 46/104 (44%), Gaps = 17/104 (16%)
Query: 133 GNVEACYTLGMIRFYCL----------QNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
GNVEA Y L YCL + + + L+ KAA + + L FNG+
Sbjct: 695 GNVEALYNLS----YCLLHGEGCEGTEKEKKTAIKLLLKAAKKGYVPSQLLLGNCYFNGN 750
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
G K++K + V +AA + A+ LG C G GV +N
Sbjct: 751 GTLKDEK---SAVYWYTKAALQNNSTAINNLGSCFYKGNGVEKN 791
>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
Length = 464
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 339 ETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
E R +RC+ C A+ YCS ACQ DWK HK +CA +R
Sbjct: 75 EQLRRPLQRCAGCKALRYCSAACQKADWK-DHKPECAALKR 114
>gi|345430184|ref|YP_004823304.1| hypothetical protein PARA_16180 [Haemophilus parainfluenzae T3T1]
gi|301156247|emb|CBW15718.1| unnamed protein product [Haemophilus parainfluenzae T3T1]
Length = 418
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 55/114 (48%), Gaps = 7/114 (6%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
SE + +L A+ N++A L +Y Q+ KAA + +QA+ L+ +
Sbjct: 233 SEESLYWLNKAAENNNLQAIMYLA--SYYHNQDINKAIYYYQKAAKLNDSQAMLELSYLY 290
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN--IAEG 230
NG G K+DK V L A L +++A+ EL +G GV +N IAEG
Sbjct: 291 ENGEGVEKDDK---KAVELLEEAFRLQNLEAMNELSIRYLEGRGVERNYEIAEG 341
>gi|340520522|gb|EGR50758.1| predicted protein [Trichoderma reesei QM6a]
Length = 727
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 56/126 (44%), Gaps = 16/126 (12%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSG 183
++L+ AD + E GM++ N + A+ A+S +Y L V NG
Sbjct: 572 QWLRKAADMASGEVAEDEGMVKEGKHINIAENKTHKAQFALS-----IYELGVSHMNG-W 625
Query: 184 GSKNDKDLRAGVALCAR----AAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANA 239
G + DK AL R A G +DAL E G C G G ++N+ + +F A A
Sbjct: 626 GIEQDK------ALALRCFEIAGTWGDVDALAEAGFCYAQGIGCKKNLKKSAKFYRMAEA 679
Query: 240 RELAAV 245
+ ++ V
Sbjct: 680 KGMSMV 685
>gi|307203234|gb|EFN82389.1| Egl nine-like protein 1 [Harpegnathos saltator]
Length = 473
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 319 RGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
R TPG G++ + V C E R + RCS C AV YC++ Q DWK RH+ CA
Sbjct: 8 RAQTPGDGVQTGATVTCVVCE-RTDKLLRCSRCKAVFYCTKEHQRSDWK-RHRDFCA 62
>gi|229847119|ref|ZP_04467224.1| Sel1 domain protein repeat-containing protein [Haemophilus
influenzae 7P49H1]
gi|229809948|gb|EEP45669.1| Sel1 domain protein repeat-containing protein [Haemophilus
influenzae 7P49H1]
Length = 384
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 6/120 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ + A+ G +A + LG + + Q+ KAA HA+A Y+L +
Sbjct: 212 EAVKWYRKAAEQGYADAQFNLGNMYYNGHGVKQDDFEAVKWYRKAAEQGHAKAQYNLGNM 271
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
NG G + D V +AA G+ DA LG G+GVRQ+ E ++ +A
Sbjct: 272 YANGRGVKQ---DYFEAVKWYRKAAEQGYADAQANLGSAYSAGHGVRQDYIEAVKWFKKA 328
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 69/148 (46%), Gaps = 12/148 (8%)
Query: 95 GLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY-----CL 149
G+A L L K +K ++ A ++ + A+ GN +A + LG++ Y
Sbjct: 80 GVADAQLNLGNMYAKGLGVKQDD--VEAVKWYRQAAEQGNAKAQFNLGLM--YDNGRGVK 135
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
Q+ KAA +A A ++L + + G G ++D + V +AA G+ DA
Sbjct: 136 QDYFEAVKWFRKAAEQGYADAQFNLGNMYYKGHGVKQDDFE---AVKWYRKAAEQGYADA 192
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQA 237
LG+ +G+GV+Q+ E ++ +A
Sbjct: 193 QFNLGNMYYNGHGVKQDDFEAVKWYRKA 220
>gi|303289683|ref|XP_003064129.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454445|gb|EEH51751.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 529
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 11/138 (7%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAV 176
E A R+ +L A G+V A + + + +N + A KAA+ + ++
Sbjct: 213 EGARRWWELAAAQGDVIAMSNIAQLYANGVGVEKNISTAAEWFLKAAMKGNHES------ 266
Query: 177 IQFN-GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
QFN G +KN K G R+A G++ A+ ELG G GV QN+++ R
Sbjct: 267 -QFNYGYHLAKNLKQYAEGALWYERSAAQGNVRAMNELGTLYFRGDGVEQNVSKAREMWE 325
Query: 236 QANARELAAVLSSAACQG 253
+A A + AA G
Sbjct: 326 KAAANISDEMWEKAAANG 343
>gi|353242384|emb|CCA74033.1| hypothetical protein PIIN_07987 [Piriformospora indica DSM 11827]
Length = 513
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 26/63 (41%), Gaps = 5/63 (7%)
Query: 316 FAARGGTPGPGLRLCSHVGCGR----PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
+ G P C V C R E +RCS C YCS CQ DWK RHK
Sbjct: 446 YRPSGAKPSVTRDGCWEVSCQRHGAIDEQTEQVLQRCSGCKTARYCSTGCQTKDWK-RHK 504
Query: 372 ADC 374
+C
Sbjct: 505 VEC 507
>gi|334182643|ref|NP_001185019.1| ubiquitin carboxyl-terminal hydrolase 15 [Arabidopsis thaliana]
gi|332191422|gb|AEE29543.1| ubiquitin carboxyl-terminal hydrolase 15 [Arabidopsis thaliana]
Length = 928
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP 376
RCS C +V YCS CQ + W++ HK +C P
Sbjct: 140 RCSRCKSVRYCSGKCQIIHWRVAHKDECVP 169
>gi|328872421|gb|EGG20788.1| hypothetical protein DFA_00653 [Dictyostelium fasciculatum]
Length = 508
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 22/39 (56%)
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
H +C C VNYCSR Q++DW+L H+ C A L
Sbjct: 249 HATNKCGKCKQVNYCSRDHQSIDWELHHREHCGKATTLL 287
>gi|195589318|ref|XP_002084399.1| GD14254 [Drosophila simulans]
gi|194196408|gb|EDX09984.1| GD14254 [Drosophila simulans]
Length = 407
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 34/92 (36%), Gaps = 20/92 (21%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFL------AEWFAARGGTPGPGLRLCSHVGCGRPETRRH 343
C + A +A PA L W G CS G +P+
Sbjct: 294 CTIFRQIATQFGAKDAPPALTVLRNAINGMRWITDEGN-------YCSTCGAEKPD---- 342
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
++CS C AV YC R CQ L W HK CA
Sbjct: 343 --KKCSKCKAVQYCDRECQRLHW-FMHKKSCA 371
>gi|302673602|ref|XP_003026487.1| hypothetical protein SCHCODRAFT_238489 [Schizophyllum commune H4-8]
gi|300100170|gb|EFI91584.1| hypothetical protein SCHCODRAFT_238489 [Schizophyllum commune H4-8]
Length = 622
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 326 GLRLCSHVGC-GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
L+ C C G E RH CGAV YCSR CQ DW+ RH+ C
Sbjct: 443 ALQHCHRAECPGSTEPIRH----ICACGAVYYCSRICQKADWRARHRITCT 489
>gi|11993475|gb|AAG42756.1|AF302665_1 ubiquitin-specific protease 15 [Arabidopsis thaliana]
Length = 924
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP 376
RCS C +V YCS CQ + W++ HK +C P
Sbjct: 140 RCSRCKSVRYCSGKCQIIHWRVAHKDECVP 169
>gi|262368638|ref|ZP_06061967.1| conserved hypothetical protein [Acinetobacter johnsonii SH046]
gi|262316316|gb|EEY97354.1| conserved hypothetical protein [Acinetobacter johnsonii SH046]
Length = 187
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 77/185 (41%), Gaps = 24/185 (12%)
Query: 69 ILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKL 128
++C + + + P++ VN+ T L L + ++ + A W+ LK
Sbjct: 13 VIC-MPKSPKIPAELVNLATTGNSDAQFELAELYMQSENEDDITL-AEEWA------LKA 64
Query: 129 CADAGNVEACYTLGM-IRFYCLQNR-----------GSGASLMAKAAISSHAQALYSLAV 176
A G+VEA Y LG FY + R G + +AA H A+ LA
Sbjct: 65 AA-LGHVEAMYWLGEGYAFYAKELREEDPSEAETYFGHAHHWLKQAAELKHPSAILELAG 123
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
G +KD+ + L +AA LG A+R+L ++ GV + + + + +
Sbjct: 124 FY---RRGDVVEKDVAKSIELVQQAAELGEAQAMRDLAFIYENALGVDADEVKAKYWHDK 180
Query: 237 ANARE 241
A+A E
Sbjct: 181 ADAAE 185
>gi|195326597|ref|XP_002030012.1| GM25221 [Drosophila sechellia]
gi|194118955|gb|EDW40998.1| GM25221 [Drosophila sechellia]
Length = 407
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 34/92 (36%), Gaps = 20/92 (21%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFL------AEWFAARGGTPGPGLRLCSHVGCGRPETRRH 343
C + A +A PA L W G CS G +P+
Sbjct: 294 CTIFRQIATQFGAKDAPPALTVLRNAINGMRWITDEGN-------YCSTCGAEKPD---- 342
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
++CS C AV YC R CQ L W HK CA
Sbjct: 343 --KKCSKCKAVQYCDRECQRLHW-FMHKKSCA 371
>gi|217971401|ref|YP_002356152.1| Sel1 domain-containing protein repeat-containing protein
[Shewanella baltica OS223]
gi|217496536|gb|ACK44729.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS223]
Length = 491
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
S+ A ++L+ A+ EA LG++ +Y +N ++KAA + + A Y LA++
Sbjct: 291 SKLAQQYLQRAAEGEQTEAMANLGVL-YYQQKNLTQAYHFISKAAQAGYPHAQYHLALML 349
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFLVQ 236
NG G + +D+ A A AA G +DA L H L D ++ + +L Q
Sbjct: 350 ANGDGCT---RDMIASEYWMAEAAEQGQLDAMLTRAQHMLNDDNAFGSDLTQAEDYLRQ 405
>gi|18394440|ref|NP_564014.1| ubiquitin carboxyl-terminal hydrolase 15 [Arabidopsis thaliana]
gi|166201354|sp|Q9FPS9.2|UBP15_ARATH RecName: Full=Ubiquitin carboxyl-terminal hydrolase 15; AltName:
Full=Deubiquitinating enzyme 15; Short=AtUBP15; AltName:
Full=Ubiquitin thioesterase 15; AltName:
Full=Ubiquitin-specific-processing protease 15
gi|332191421|gb|AEE29542.1| ubiquitin carboxyl-terminal hydrolase 15 [Arabidopsis thaliana]
Length = 924
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP 376
RCS C +V YCS CQ + W++ HK +C P
Sbjct: 140 RCSRCKSVRYCSGKCQIIHWRVAHKDECVP 169
>gi|373471213|ref|ZP_09562276.1| Sel1 repeat protein [Lachnospiraceae bacterium oral taxon 082 str.
F0431]
gi|371760834|gb|EHO49502.1| Sel1 repeat protein [Lachnospiraceae bacterium oral taxon 082 str.
F0431]
Length = 609
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 67/144 (46%), Gaps = 14/144 (9%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGS-----GASLMAKAAISSHAQALYSL 174
E A +L+ A+ G+ ++ TLG +FY L N G+ + KAA + +A Y L
Sbjct: 330 ELARLWLEKAANTGDAKSQTTLG--KFY-LSNFGNNDEKKAFEMFEKAAEQGYTEAEYLL 386
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
N + +KD+ + +AA GHI ++ +G YG+ +N +G R+L
Sbjct: 387 GTCYLNQN---LLEKDIELALLWFDKAAKKGHIKSMYNIGLIYMGNYGMIRNPEKGIRYL 443
Query: 235 VQA---NARELAAVLSSAACQGIS 255
+ A N+ + +L+ GI
Sbjct: 444 IMAADENSEDACFMLTQLYMNGIE 467
>gi|421695866|ref|ZP_16135463.1| Sel1 repeat protein [Acinetobacter baumannii WC-692]
gi|404564102|gb|EKA69293.1| Sel1 repeat protein [Acinetobacter baumannii WC-692]
Length = 159
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 55/116 (47%), Gaps = 7/116 (6%)
Query: 126 LKLCADAGNVEACYTLGMIRF---YCLQ-NRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
LK A+ G++ A +G + Y L + + KAA S A++ Y+LA++ NG
Sbjct: 30 LKSAAEQGDIRAQSQMGDVYLFGEYDLNVDYKQAINWYQKAADQSDAKSQYNLAIMYLNG 89
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G K DL V ++A G D+ +LG +G GV +NI + + +A
Sbjct: 90 YGVKK---DLSKSVEYYRKSALQGDTDSQLQLGIRYLNGEGVERNIETAKEWFKKA 142
>gi|378821902|ref|ZP_09844749.1| Sel1 repeat protein [Sutterella parvirubra YIT 11816]
gi|378599253|gb|EHY32294.1| Sel1 repeat protein [Sutterella parvirubra YIT 11816]
Length = 249
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 33/63 (52%), Gaps = 2/63 (3%)
Query: 168 AQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
A+ LY L + F G G +K + G+ RAA LGH+ + ELG ++G GVR
Sbjct: 56 AECLYQLGRLYFYGLGSTKPQR--LTGIGWWERAALLGHVGSQLELGKVFREGLGVRPEA 113
Query: 228 AEG 230
+G
Sbjct: 114 RQG 116
>gi|260950971|ref|XP_002619782.1| hypothetical protein CLUG_00941 [Clavispora lusitaniae ATCC 42720]
gi|238847354|gb|EEQ36818.1| hypothetical protein CLUG_00941 [Clavispora lusitaniae ATCC 42720]
Length = 563
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 53/123 (43%), Gaps = 11/123 (8%)
Query: 125 FLKLCADAGNVEACYTLGMIRFY-------CLQNRGSGASLMAKA---AISSHAQALYSL 174
FL++ A + A Y LGM FY + + +G + +A A A A Y L
Sbjct: 323 FLRMAASKNHPAAMYKLGMYSFYARMGLPDTVNTKKAGIKWLERASNVATELTAAAPYEL 382
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
+ NG KD + + L A+AA LGH+ + LG C + G V Q+ +
Sbjct: 383 GKLYQNGFMDILL-KDEKYALELYAQAAALGHVQSSAILGKCYEIGEVVPQDANLSIHYY 441
Query: 235 VQA 237
QA
Sbjct: 442 TQA 444
>gi|26247143|ref|NP_753183.1| hypothetical protein c1269 [Escherichia coli CFT073]
gi|227888009|ref|ZP_04005814.1| Sel1 family repeat-containing serine/threonine protein kinase
[Escherichia coli 83972]
gi|300982506|ref|ZP_07176160.1| tetratricopeptide repeat protein [Escherichia coli MS 45-1]
gi|386628648|ref|YP_006148368.1| hypothetical protein i02_1162 [Escherichia coli str. 'clone D i2']
gi|386633568|ref|YP_006153287.1| hypothetical protein i14_1162 [Escherichia coli str. 'clone D i14']
gi|419915803|ref|ZP_14434149.1| hypothetical protein ECKD1_21452 [Escherichia coli KD1]
gi|432380724|ref|ZP_19623674.1| hypothetical protein WCU_00866 [Escherichia coli KTE15]
gi|432411103|ref|ZP_19653782.1| hypothetical protein WG9_01588 [Escherichia coli KTE39]
gi|432431147|ref|ZP_19673588.1| hypothetical protein A13K_01437 [Escherichia coli KTE187]
gi|432435678|ref|ZP_19678073.1| hypothetical protein A13M_01384 [Escherichia coli KTE188]
gi|432440478|ref|ZP_19682827.1| hypothetical protein A13O_01303 [Escherichia coli KTE189]
gi|432445592|ref|ZP_19687896.1| hypothetical protein A13S_01628 [Escherichia coli KTE191]
gi|432455974|ref|ZP_19698170.1| hypothetical protein A15C_01763 [Escherichia coli KTE201]
gi|432494902|ref|ZP_19736718.1| hypothetical protein A173_02070 [Escherichia coli KTE214]
gi|432503743|ref|ZP_19745477.1| hypothetical protein A17E_00797 [Escherichia coli KTE220]
gi|432526741|ref|ZP_19763843.1| hypothetical protein A17Y_04881 [Escherichia coli KTE230]
gi|432567826|ref|ZP_19804349.1| hypothetical protein A1SE_01403 [Escherichia coli KTE53]
gi|432592153|ref|ZP_19828480.1| hypothetical protein A1SS_01577 [Escherichia coli KTE60]
gi|432606864|ref|ZP_19843055.1| hypothetical protein A1U7_01861 [Escherichia coli KTE67]
gi|432610779|ref|ZP_19846947.1| hypothetical protein A1UG_01133 [Escherichia coli KTE72]
gi|432645537|ref|ZP_19881335.1| hypothetical protein A1W5_01286 [Escherichia coli KTE86]
gi|432654635|ref|ZP_19890352.1| hypothetical protein A1WE_00742 [Escherichia coli KTE93]
gi|432843551|ref|ZP_20076734.1| hypothetical protein A1YS_01468 [Escherichia coli KTE141]
gi|432936923|ref|ZP_20135615.1| hypothetical protein A13C_00030 [Escherichia coli KTE183]
gi|432999249|ref|ZP_20187786.1| hypothetical protein A17K_01587 [Escherichia coli KTE223]
gi|433014192|ref|ZP_20202549.1| hypothetical protein WI5_02023 [Escherichia coli KTE104]
gi|433026251|ref|ZP_20214206.1| hypothetical protein WI9_04421 [Escherichia coli KTE106]
gi|433058438|ref|ZP_20245495.1| hypothetical protein WIM_02214 [Escherichia coli KTE124]
gi|433127978|ref|ZP_20313504.1| hypothetical protein WKE_04481 [Escherichia coli KTE160]
gi|433139626|ref|ZP_20324894.1| hypothetical protein WKM_01913 [Escherichia coli KTE167]
gi|433148496|ref|ZP_20333550.1| hypothetical protein WKQ_01160 [Escherichia coli KTE174]
gi|433191172|ref|ZP_20375242.1| hypothetical protein WGS_04268 [Escherichia coli KTE88]
gi|433201237|ref|ZP_20385085.1| hypothetical protein WGW_04769 [Escherichia coli KTE94]
gi|433210544|ref|ZP_20394194.1| hypothetical protein WI1_04331 [Escherichia coli KTE97]
gi|433212868|ref|ZP_20396468.1| hypothetical protein WI3_02054 [Escherichia coli KTE99]
gi|433327623|ref|ZP_20403903.1| hypothetical protein B185_024045 [Escherichia coli J96]
gi|26107544|gb|AAN79743.1|AE016759_17 Hypothetical protein c1269 [Escherichia coli CFT073]
gi|47600612|emb|CAE55733.1| hypothetical protein [Escherichia coli Nissle 1917]
gi|227835005|gb|EEJ45471.1| Sel1 family repeat-containing serine/threonine protein kinase
[Escherichia coli 83972]
gi|300408738|gb|EFJ92276.1| tetratricopeptide repeat protein [Escherichia coli MS 45-1]
gi|355419547|gb|AER83744.1| hypothetical protein i02_1162 [Escherichia coli str. 'clone D i2']
gi|355424467|gb|AER88663.1| hypothetical protein i14_1162 [Escherichia coli str. 'clone D i14']
gi|388383034|gb|EIL44845.1| hypothetical protein ECKD1_21452 [Escherichia coli KD1]
gi|430910211|gb|ELC31565.1| hypothetical protein WCU_00866 [Escherichia coli KTE15]
gi|430937025|gb|ELC57288.1| hypothetical protein WG9_01588 [Escherichia coli KTE39]
gi|430954942|gb|ELC73735.1| hypothetical protein A13K_01437 [Escherichia coli KTE187]
gi|430965352|gb|ELC82777.1| hypothetical protein A13M_01384 [Escherichia coli KTE188]
gi|430968543|gb|ELC85769.1| hypothetical protein A13O_01303 [Escherichia coli KTE189]
gi|430974828|gb|ELC91742.1| hypothetical protein A13S_01628 [Escherichia coli KTE191]
gi|430984203|gb|ELD00844.1| hypothetical protein A15C_01763 [Escherichia coli KTE201]
gi|431027507|gb|ELD40570.1| hypothetical protein A173_02070 [Escherichia coli KTE214]
gi|431041058|gb|ELD51590.1| hypothetical protein A17E_00797 [Escherichia coli KTE220]
gi|431046255|gb|ELD56373.1| hypothetical protein A17Y_04881 [Escherichia coli KTE230]
gi|431102351|gb|ELE07175.1| hypothetical protein A1SE_01403 [Escherichia coli KTE53]
gi|431132069|gb|ELE34085.1| hypothetical protein A1SS_01577 [Escherichia coli KTE60]
gi|431139210|gb|ELE41006.1| hypothetical protein A1U7_01861 [Escherichia coli KTE67]
gi|431150324|gb|ELE51379.1| hypothetical protein A1UG_01133 [Escherichia coli KTE72]
gi|431182255|gb|ELE82076.1| hypothetical protein A1W5_01286 [Escherichia coli KTE86]
gi|431195176|gb|ELE94384.1| hypothetical protein A1WE_00742 [Escherichia coli KTE93]
gi|431396431|gb|ELG79909.1| hypothetical protein A1YS_01468 [Escherichia coli KTE141]
gi|431465878|gb|ELH45958.1| hypothetical protein A13C_00030 [Escherichia coli KTE183]
gi|431513018|gb|ELH91105.1| hypothetical protein A17K_01587 [Escherichia coli KTE223]
gi|431528041|gb|ELI04753.1| hypothetical protein WI9_04421 [Escherichia coli KTE106]
gi|431531485|gb|ELI08148.1| hypothetical protein WI5_02023 [Escherichia coli KTE104]
gi|431570354|gb|ELI43269.1| hypothetical protein WIM_02214 [Escherichia coli KTE124]
gi|431638271|gb|ELJ06311.1| hypothetical protein WKE_04481 [Escherichia coli KTE160]
gi|431660843|gb|ELJ27702.1| hypothetical protein WKM_01913 [Escherichia coli KTE167]
gi|431674795|gb|ELJ40946.1| hypothetical protein WKQ_01160 [Escherichia coli KTE174]
gi|431699777|gb|ELJ64772.1| hypothetical protein WGS_04268 [Escherichia coli KTE88]
gi|431713658|gb|ELJ77882.1| hypothetical protein WGW_04769 [Escherichia coli KTE94]
gi|431726866|gb|ELJ90633.1| hypothetical protein WI1_04331 [Escherichia coli KTE97]
gi|431734656|gb|ELJ98035.1| hypothetical protein WI3_02054 [Escherichia coli KTE99]
gi|432344820|gb|ELL39374.1| hypothetical protein B185_024045 [Escherichia coli J96]
Length = 378
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 13/127 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQA 170
+A +W E A A+ A + LGM+ + QN KAA + A
Sbjct: 208 QAKDWYEKA-------AEQNFANAQFNLGMLYYKGEGVKQNFRQAREWFEKAASQNQPNA 260
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L I + G G +++ R +AA GH+DA LG ++G GV QN +
Sbjct: 261 QYNLGQIYYYGQGVTQS---YRQAKDWFEKAAEKGHVDAQYNLGVIYENGEGVSQNYQQA 317
Query: 231 RRFLVQA 237
+ + +A
Sbjct: 318 KAWYEKA 324
>gi|5734755|gb|AAD50020.1|AC007651_15 Unknown protein [Arabidopsis thaliana]
Length = 891
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP 376
RCS C +V YCS CQ + W++ HK +C P
Sbjct: 140 RCSRCKSVRYCSGKCQIIHWRVAHKDECVP 169
>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
Length = 498
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADCA 375
CS C V YC R CQ+ +WKL HK +CA
Sbjct: 79 CSGCKVVKYCDRKCQSANWKLIHKHECA 106
>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
Length = 498
Score = 42.0 bits (97), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADCA 375
CS C V YC R CQ+ +WKL HK +CA
Sbjct: 79 CSGCKVVKYCDRKCQSANWKLIHKHECA 106
>gi|146070845|ref|XP_001463112.1| MYND finger domain-like protein [Leishmania infantum JPCM5]
gi|398010281|ref|XP_003858338.1| MYND finger domain-like protein [Leishmania donovani]
gi|134067195|emb|CAM65461.1| MYND finger domain-like protein [Leishmania infantum JPCM5]
gi|322496545|emb|CBZ31614.1| MYND finger domain-like protein [Leishmania donovani]
Length = 169
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 33/81 (40%), Gaps = 3/81 (3%)
Query: 298 CNVPAPEAHPASRFLAEWFAARG--GTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVN 355
CN E SR+L + A G P P H G T +RCS C
Sbjct: 89 CNSANCEVVAKSRYLMDMEDATTAQGMPSPNGCFHCHKGARGAATTSVPLQRCSRCKVAK 148
Query: 356 YCSRACQALDWKLRHKADCAP 376
YCS CQ DWK+ HK C P
Sbjct: 149 YCSVECQKADWKV-HKQVCTP 168
>gi|407043678|gb|EKE42083.1| actin binding protein, putative [Entamoeba nuttalli P19]
Length = 775
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 48/108 (44%), Gaps = 9/108 (8%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E A +K AG A G RF N+G G L+ A I +ALY+LA+
Sbjct: 643 EEAMTLIKEAVSAGYWVAESWYG--RFIYATNKGEGLRLIRNACIMEDKRALYALAIHLL 700
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
++D+ + + L ++DA+ LG C G GV++++
Sbjct: 701 -------KNQDVSHAIEILFHLRKLNYVDAIVTLGVCFMKGVGVKKDV 741
>gi|424878045|ref|ZP_18301685.1| hypothetical protein Rleg8DRAFT_5898 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392520537|gb|EIW45266.1| hypothetical protein Rleg8DRAFT_5898 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 859
Score = 42.0 bits (97), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 54/129 (41%), Gaps = 6/129 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQAL 171
AN +++A +K DAG+V A L + N ++ AA + AL
Sbjct: 555 ANRDTKTAFATIKKAMDAGHVRAIDQLSSLYIVGASVPANPAKANEIVQAAAKKNDPYAL 614
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+ + G G D G+ L ++A LGH A+ ELG+ +G V + G
Sbjct: 615 YTYGKSLYYGRGVKA---DTEQGLKLMLQSADLGHTFAMNELGYIFLNGVNVPADPERGI 671
Query: 232 RFLVQANAR 240
RF AR
Sbjct: 672 RFYEAGLAR 680
>gi|392569186|gb|EIW62360.1| hypothetical protein TRAVEDRAFT_35680 [Trametes versicolor
FP-101664 SS1]
Length = 619
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 27/51 (52%), Gaps = 2/51 (3%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKADCAPAER 379
C +V C R R +FRRCS C YCS+ CQA WK HK C +R
Sbjct: 396 CDNVKCHRIGPR-EQFRRCSACLNTFYCSKDCQATAWKEGDHKTMCKLKQR 445
>gi|225023636|ref|ZP_03712828.1| hypothetical protein EIKCOROL_00496 [Eikenella corrodens ATCC
23834]
gi|224943518|gb|EEG24727.1| hypothetical protein EIKCOROL_00496 [Eikenella corrodens ATCC
23834]
Length = 267
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 63/139 (45%), Gaps = 9/139 (6%)
Query: 103 LSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQ--NRGSGASLMA 160
L +A + + +K N A ++ + A G+ EA LG + + + R +
Sbjct: 49 LGRARQYGYGVKPN--PAKAEKYYRRAAKLGHAEAQEALGCLYEFAEKPDYRRARKWYAR 106
Query: 161 KAAISSHA--QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
AA S A A Y L + G GG K D++ + A GH DA R LG+C +
Sbjct: 107 NAAQRSSATPDAAYRLGYLYEKGLGGKK---DIQMACQFYRKDAKAGHPDAQRALGYCYE 163
Query: 219 DGYGVRQNIAEGRRFLVQA 237
G G+ +N A+ R++ +A
Sbjct: 164 KGLGLPENHAKARKWYARA 182
>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
Length = 498
Score = 42.0 bits (97), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADCA 375
CS C V YC R CQ+ +WKL HK +CA
Sbjct: 79 CSGCKVVKYCDRKCQSANWKLIHKHECA 106
>gi|118602485|ref|YP_903700.1| Sel1 domain-containing protein [Candidatus Ruthia magnifica str. Cm
(Calyptogena magnifica)]
gi|118567424|gb|ABL02229.1| Sel1 domain protein repeat-containing protein [Candidatus Ruthia
magnifica str. Cm (Calyptogena magnifica)]
Length = 149
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 55/120 (45%), Gaps = 6/120 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A++ L A GN EA + +GM++ L +N+ G +AA HA A + L V
Sbjct: 29 AYQLLAPLATKGNTEALWRVGMMQMNGLGMVENQPLGFENFLQAASQGHAFAHHMLGVAY 88
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
G G +KD+ + + A G + LG +D V+Q++ + + + +A+
Sbjct: 89 MTGEG---VEKDIVKSIEWFEKGAEFGIPGPMYALGMLFEDDKEVKQDLEKAKYWFDKAD 145
>gi|397640854|gb|EJK74353.1| hypothetical protein THAOC_03972, partial [Thalassiosira oceanica]
Length = 301
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 30/49 (61%)
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD+R V L AA LG AL LG +GYGV+Q++A+G FL +A
Sbjct: 66 KDMRKAVELWTEAAELGSAKALCNLGVAYYNGYGVQQDMAKGVEFLSEA 114
>gi|383934141|ref|ZP_09987583.1| hypothetical protein RNAN_0643 [Rheinheimera nanhaiensis E407-8]
gi|383704597|dbj|GAB57674.1| hypothetical protein RNAN_0643 [Rheinheimera nanhaiensis E407-8]
Length = 717
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 62/127 (48%), Gaps = 14/127 (11%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG------SGASLMAKAAISS 166
++AN+ A FL+L D ++ ACY L + + L+ RG GA L+ +AA
Sbjct: 397 VRAND--SKAVEFLQLACDKQDLAACYHLAL---HILEGRGIKADAVRGAQLLQRAANEG 451
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+++A + L V+ G G + ++ A +AA GH +A G L +G G QN
Sbjct: 452 NSEAQFRLGVMLSQGQGVAVDET---AAFGWLLKAAEQGHAEAQYLTGLRLANGTGTAQN 508
Query: 227 IAEGRRF 233
AE ++
Sbjct: 509 DAEAVKW 515
>gi|397613212|gb|EJK62084.1| hypothetical protein THAOC_17317, partial [Thalassiosira oceanica]
Length = 490
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
CG+ +F++CS C A YCS+ CQ W+ HK DC A
Sbjct: 436 CGKEAQGGEKFKQCSKCKAQWYCSKECQVEAWRDGHKKDCKRA 478
>gi|298713925|emb|CBJ33785.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 462
Score = 42.0 bits (97), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
R + ++CS+C V YC CQA DW HK DC+
Sbjct: 366 RSQVKQCSLCNEVWYCGNECQASDWSKGHKNDCS 399
>gi|255713302|ref|XP_002552933.1| KLTH0D04862p [Lachancea thermotolerans]
gi|238934313|emb|CAR22495.1| KLTH0D04862p [Lachancea thermotolerans CBS 6340]
Length = 567
Score = 41.6 bits (96), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 29/65 (44%), Gaps = 3/65 (4%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
G+R C++ CG+ E F +C C YCSR CQ W H+ C G
Sbjct: 447 GVRQCANFACGKWEDYPKRFAKCRRCKRTKYCSRECQLQSWNF-HRYWCQEVTS--SSAG 503
Query: 386 NGDVD 390
NG V+
Sbjct: 504 NGSVN 508
>gi|397571974|gb|EJK48053.1| hypothetical protein THAOC_33186 [Thalassiosira oceanica]
Length = 472
Score = 41.6 bits (96), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 27/42 (64%), Gaps = 6/42 (14%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRH 370
LC++ CG P ++ ++C CG+V YCSR CQA WKL H
Sbjct: 320 LCAN--CGSP----NDLKQCGGCGSVYYCSRECQAAHWKLVH 355
>gi|395839041|ref|XP_003792411.1| PREDICTED: programmed cell death protein 2 [Otolemur garnettii]
Length = 343
Score = 41.6 bits (96), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C V+YC + QALDW+L HK C+
Sbjct: 130 GAHLCRVCGCLGPKT-------CSRCHKVHYCGKVHQALDWRLGHKQACS 172
>gi|282891327|ref|ZP_06299829.1| hypothetical protein pah_c050o115 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|281498824|gb|EFB41141.1| hypothetical protein pah_c050o115 [Parachlamydia acanthamoebae str.
Hall's coccus]
Length = 812
Score = 41.6 bits (96), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 62/143 (43%), Gaps = 16/143 (11%)
Query: 100 SLVLSK-ASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL 158
+L+L + KK F +KA +F K A+ N+EA Y L + ++ + +
Sbjct: 261 TLILGQFYQKKEFNLKA-------IKFFKQAAEKDNIEAQYRLAQ-SYLDEEDLKNASYW 312
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
++KAA S H A + L + KD +AA GH + ++G
Sbjct: 313 LSKAAQSGHILAQHELG-------SSYEEKKDFANAFQWFLKAAEQGHSKSQVKIGGMYY 365
Query: 219 DGYGVRQNIAEGRRFLVQANARE 241
+G+GV QN ++ ++ +E
Sbjct: 366 EGHGVEQNYPMAEKYFLEVTKKE 388
>gi|237746753|ref|ZP_04577233.1| TPR repeat-containing protein [Oxalobacter formigenes HOxBLS]
gi|229378104|gb|EEO28195.1| TPR repeat-containing protein [Oxalobacter formigenes HOxBLS]
Length = 230
Score = 41.6 bits (96), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 73/171 (42%), Gaps = 16/171 (9%)
Query: 78 RCPSDFV--NVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNV 135
R P F+ L+ C +G++L + + A + +S++ H F +L A GN
Sbjct: 4 RFPEKFLPCGKLLACVVFSGISLIAFA-DTYTDAEIAYRNGQYSKAFH-FFQLAAKEGNK 61
Query: 136 EACYTLGMIRFYCLQNRGSGASL------MAKAAISSHAQALYSLAVIQFNGSGGSKNDK 189
+A Y LG + LQ +G+ +L +AA + A Y L + + G G K
Sbjct: 62 DALYNLGHMH---LQGKGTQQNLTEARNCFLQAAQKADPGAQYQLGNLYYYGYGIP---K 115
Query: 190 DLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
D + +AA + AL LG ++G GV +N + QA R
Sbjct: 116 DCQTAYKWYMKAAGQHYTPALYALGTLHEEGCGVSKNPYKATMRFRQAAER 166
>gi|170092020|ref|XP_001877232.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648725|gb|EDR12968.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 592
Score = 41.6 bits (96), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKADCA 375
C +V C + + ++ FRRC C YCS+ CQ + WK HK CA
Sbjct: 370 CDNVKCQKID-EKNNFRRCGACSTTLYCSKECQTVSWKEGGHKTMCA 415
>gi|374333040|ref|YP_005083224.1| Sel1 domain-containing protein repeat-containing protein
[Pseudovibrio sp. FO-BEG1]
gi|359345828|gb|AEV39202.1| Sel1 domain protein repeat-containing protein [Pseudovibrio sp.
FO-BEG1]
Length = 346
Score = 41.6 bits (96), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + +L ++ G+ +A LG++ ++ A + KAA ++ +ALY+LA++
Sbjct: 113 AASWYELASNDGDPQAALRLGLLYLAGAGTKADKDKAAQQLEKAAAANIPEALYNLALLH 172
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
G + K ++ +L RA+ G DA+ ELG L+DG
Sbjct: 173 QEGKVRPNDPKQIK---SLLERASETGDADAMLELGIYLKDG 211
>gi|345560153|gb|EGX43278.1| hypothetical protein AOL_s00215g14 [Arthrobotrys oligospora ATCC
24927]
Length = 1153
Score = 41.6 bits (96), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 21/38 (55%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKA 372
CG+ E + R+C+ C V YCS+ CQ DWK K
Sbjct: 1113 CGKEEDGDRKLRKCTACMNVEYCSKECQKKDWKAHKKT 1150
>gi|117922384|ref|YP_871576.1| Sel1 domain-containing protein [Shewanella sp. ANA-3]
gi|117614716|gb|ABK50170.1| Sel1 domain protein repeat-containing protein [Shewanella sp.
ANA-3]
Length = 485
Score = 41.6 bits (96), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 55/115 (47%), Gaps = 5/115 (4%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
A +L+L A+ EA LG++ +Y ++ ++KAA + + A Y LA++ N
Sbjct: 287 QAKHYLQLAAEGEQTEAMANLGVL-YYQTKDLTQAYHFISKAAQAGYPHAQYHLALMLAN 345
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFL 234
G G +D A AA G +DA L H L D G+ ++++ R+L
Sbjct: 346 GDGCK---RDPIVSEYWMAEAAEQGQLDAMLTRAQHMLNDESGLGGDVSQAERYL 397
>gi|78778186|ref|YP_394501.1| Sel1 repeat-containing protein [Sulfurimonas denitrificans DSM
1251]
gi|78498726|gb|ABB45266.1| Sel1-like repeat [Sulfurimonas denitrificans DSM 1251]
Length = 233
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 49/98 (50%), Gaps = 10/98 (10%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
+L+ A +G+ A Y LGM FY QN L++K++ ++ A Y+LA++
Sbjct: 72 EWLEKSAQSGDANAQYDLGM--FYLKGNNVEQNSKKAFELLSKSSAQNNINAQYNLALMY 129
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
+ G G D + V L +AA GH+ A + +GH
Sbjct: 130 YKGDGV---DLSVPKAVELLDKAATSGHVGAAQNVGHI 164
>gi|389746341|gb|EIM87521.1| hypothetical protein STEHIDRAFT_168243 [Stereum hirsutum FP-91666
SS1]
Length = 393
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
CS+ CG+ + + + C+ C YC R CQ +K +HK DC
Sbjct: 41 CSY--CGKEDDEKKPLKACTQCREARYCDRECQVAHYKYQHKKDCT 84
>gi|344924771|ref|ZP_08778232.1| hypothetical protein COdytL_09001 [Candidatus Odyssella
thessalonicensis L13]
Length = 1945
Score = 41.6 bits (96), Expect = 0.67, Method: Composition-based stats.
Identities = 46/156 (29%), Positives = 68/156 (43%), Gaps = 22/156 (14%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS------LMAKAAISSHAQALYSLA 175
A ++L A+ GN++A LG I C G+ L KAA +A A +L
Sbjct: 1232 ALKWLTRAAEQGNLDALSILGYI---CATGEGAAKDEDKAIRLYTKAAEQGNASAQSNLG 1288
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
++ NG G K+D +A L A+AA G+ A LG G GV+Q+ + +
Sbjct: 1289 LMYMNGQGVDKDDT--KAAYWL-AKAAKQGNAFAQTNLGAMYGKGQGVKQDDTKAIEWYT 1345
Query: 236 QANARELAAV-----LSSAACQGISTR-----SWLT 261
+A +E A +S + QGI WLT
Sbjct: 1346 KAAQQENAGAQYNLGVSYLSGQGIEQNYGNAFYWLT 1381
>gi|381394956|ref|ZP_09920667.1| sel1 domain protein repeat-containing protein [Glaciecola punicea
DSM 14233 = ACAM 611]
gi|379329563|dbj|GAB55800.1| sel1 domain protein repeat-containing protein [Glaciecola punicea
DSM 14233 = ACAM 611]
Length = 401
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 54/118 (45%), Gaps = 6/118 (5%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLA 175
S+ A ++ +L ++ GN EA + L ++ LQ+ A AA A A Y+L
Sbjct: 60 SKEAAKWYRLASEQGNAEAQFNLALMYVSGEDVLQDSKEAAKWFKLAAEQGDASAQYNLG 119
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
++ ++G G KD + G +A GH +A LG G GV Q+ E ++
Sbjct: 120 IMYYSGQGVL---KDFKEGAKWFKLSAEQGHANAQSNLGLMYYFGDGVLQDSKEAAKW 174
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 54/119 (45%), Gaps = 6/119 (5%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLA 175
S+ A ++ KL A+ G+ A Y LG++ + L++ GA +A HA A +L
Sbjct: 96 SKEAAKWFKLAAEQGDASAQYNLGIMYYSGQGVLKDFKEGAKWFKLSAEQGHANAQSNLG 155
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
++ + G G ++ K+ L A G A LG DG GV Q+ E ++
Sbjct: 156 LMYYFGDGVLQDSKEAAKWYRLAAEQ---GDASAQFVLGGIYYDGQGVIQDYKEAVKWF 211
>gi|392568621|gb|EIW61795.1| hypothetical protein TRAVEDRAFT_143901 [Trametes versicolor
FP-101664 SS1]
Length = 404
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 25/46 (54%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKA 372
++ C+ GCGR E R +F++C C YC CQ DW+ +A
Sbjct: 304 VKTCTREGCGRREMRPLDFKKCGGCRQAFYCGPECQREDWQAHRQA 349
>gi|356510092|ref|XP_003523774.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 16-like [Glycine
max]
Length = 1063
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 13/65 (20%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAE-------------RWLDEEGNGDVDVDL 393
RC+ C +V YCS CQ + W+ HK +C P R L E+G + V+
Sbjct: 86 RCAQCKSVRYCSFECQTVHWRQGHKLECRPPSTTCWSDDVASDHGRKLVEQGYSGIHVEK 145
Query: 394 NDAEG 398
+++EG
Sbjct: 146 SESEG 150
>gi|328794268|ref|XP_001120235.2| PREDICTED: protein sel-1 homolog 2-like, partial [Apis mellifera]
Length = 351
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 10/108 (9%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
+ KL A+ G+ +A ++LG++ Y QN +AA +A A ++L V+
Sbjct: 10 EYYKLAAEQGDSDAQFSLGLM--YEDGEGTEQNYTEAYKYYMEAARKGNANAQFALGVMF 67
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
NG +N + L A+ G+IDA LG +GYGV QN
Sbjct: 68 ENGEETEQNYAEAYKYYKLAAKQ---GNIDAQFNLGLLYSEGYGVEQN 112
>gi|254473185|ref|ZP_05086583.1| Sel1 [Pseudovibrio sp. JE062]
gi|211957906|gb|EEA93108.1| Sel1 [Pseudovibrio sp. JE062]
Length = 360
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + +L ++ G+ +A LG++ ++ A + KAA ++ +ALY+LA++
Sbjct: 127 AASWYELASNDGDPQAALRLGLLYLAGAGTKADKDKAAQQLEKAAAANIPEALYNLALLH 186
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
G + K ++ +L RA+ G DA+ ELG L+DG
Sbjct: 187 QEGKVRPNDPKQIK---SLLERASETGDADAMLELGIYLKDG 225
>gi|389745101|gb|EIM86283.1| hypothetical protein STEHIDRAFT_157783 [Stereum hirsutum FP-91666
SS1]
Length = 701
Score = 41.6 bits (96), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+ F++C+ C YCS+ CQ WK +HKA+C
Sbjct: 441 KDTFKQCTGCKVTLYCSKECQVASWKKQHKAEC 473
>gi|378728452|gb|EHY54911.1| hypothetical protein HMPREF1120_03070 [Exophiala dermatitidis
NIH/UT8656]
Length = 252
Score = 41.6 bits (96), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 23/35 (65%), Gaps = 1/35 (2%)
Query: 340 TRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
++ + +RCS C + YCSR CQA DWK +HK C
Sbjct: 22 SKTTDLKRCSKCHIIQYCSRQCQAADWK-QHKKPC 55
>gi|134056128|emb|CAK96303.1| unnamed protein product [Aspergillus niger]
Length = 922
Score = 41.6 bits (96), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 25/45 (55%), Gaps = 2/45 (4%)
Query: 331 SHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ V C R T + CS CG+ YCSR CQ W LRHK DC+
Sbjct: 495 TSVACARC-TATEDLTPCSKCGSTFYCSRECQTTHW-LRHKFDCS 537
>gi|187735941|ref|YP_001878053.1| Sel1 domain-containing protein repeat-containing protein
[Akkermansia muciniphila ATCC BAA-835]
gi|187425993|gb|ACD05272.1| Sel1 domain protein repeat-containing protein [Akkermansia
muciniphila ATCC BAA-835]
Length = 380
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 59/129 (45%), Gaps = 17/129 (13%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHA 168
NW ++ + + K A+ G+ A Y L +C N GSG A +AA+ +HA
Sbjct: 160 NWEQAVY-WYKQAAEQGDPRAQYNLA----WCYGN-GSGTPKNPRKAAYWYEEAAMQNHA 213
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
A Y+L NG G + DL + ++A G I A LG C +G G+ ++A
Sbjct: 214 TAQYNLGWCYENGFG---VEPDLDKALVWYHKSALQGQITAQYTLGWCYGNGRGMEVDMA 270
Query: 229 EGRRFLVQA 237
+ + +A
Sbjct: 271 KAVHWYTKA 279
>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
Length = 429
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVD 392
C + T E ++CS C V YCS CQ DWKL HK +C + +G+V +D
Sbjct: 26 CNQCLTNMVELKKCSACRKVAYCSAECQRADWKL-HKRECKAI------QAHGEVAID 76
>gi|392566737|gb|EIW59913.1| hypothetical protein TRAVEDRAFT_37131 [Trametes versicolor
FP-101664 SS1]
Length = 323
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 5/70 (7%)
Query: 311 FLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFR----RCSVCGAVNYCSRACQALDW 366
++ EW R P +C+ GCG +R R RC YCS+ CQ DW
Sbjct: 148 YVEEWKKQRRVARDPAAYVCAAEGCGIRGEKRQALRSCAGRCPPGLKPRYCSKKCQKQDW 207
Query: 367 KLRHKADCAP 376
RHK+ C P
Sbjct: 208 S-RHKSICKP 216
>gi|392561582|gb|EIW54763.1| hypothetical protein TRAVEDRAFT_172379 [Trametes versicolor
FP-101664 SS1]
Length = 204
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 334 GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
GC E+ E RRC C YCS ACQ DW+ RHK C
Sbjct: 150 GCQEDESMDKEMRRCGGCKVTRYCSVACQKEDWR-RHKPTC 189
>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
Length = 499
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 20/32 (62%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ R CS C V YC CQ+ +WKL HK +CA
Sbjct: 76 KLRACSGCKVVKYCDEKCQSANWKLIHKHECA 107
>gi|183233344|ref|XP_651501.2| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|169801621|gb|EAL46113.2| hypothetical protein EHI_178100 [Entamoeba histolytica HM-1:IMSS]
gi|449709974|gb|EMD49133.1| protein kinase domain containing protein [Entamoeba histolytica
KU27]
Length = 1079
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 46/104 (44%), Gaps = 17/104 (16%)
Query: 133 GNVEACYTLGMIRFYCL----------QNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
GNVEA Y L YCL + + + L+ KAA + + L FNG+
Sbjct: 695 GNVEALYNLS----YCLLHGEGCEGTEKEKKTAIKLLLKAAKKGYIPSQLLLGNCYFNGN 750
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
G K++K + V +AA + A+ LG C G GV +N
Sbjct: 751 GTLKDEK---SAVYWYTKAALQNNSTAINNLGSCFYKGNGVEKN 791
>gi|67483170|ref|XP_656868.1| actin binding protein [Entamoeba histolytica HM-1:IMSS]
gi|56474094|gb|EAL51482.1| actin binding protein, putative [Entamoeba histolytica HM-1:IMSS]
gi|449708613|gb|EMD48039.1| actin binding protein, putative [Entamoeba histolytica KU27]
Length = 775
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 9/108 (8%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E A +K DAG A G RF N+ G L+ A I +ALY+LA+
Sbjct: 643 EEAMTLIKEAVDAGYWVAESWYG--RFIYAFNKREGLRLIRNACIMEDKRALYALAIHLL 700
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
++D+ + + R L ++DA+ +G C G GV++++
Sbjct: 701 K-------NQDVNHAIEILFRLCKLNYVDAIVTVGVCFMKGVGVKKDV 741
>gi|46446343|ref|YP_007708.1| hypothetical protein pc0709 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46399984|emb|CAF23433.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 309
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 57/126 (45%), Gaps = 16/126 (12%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG--------ASLMAKAAISSHAQAL 171
+ A ++ +L A+ +A Y+LG C+ G G A AA +H +A
Sbjct: 131 QEAFKYFQLAANQNYADAQYSLG-----CMYENGQGVAQSDQNAAQCYQLAANQNHVKAQ 185
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+L V+ +G G +++D++ L A+ GH A LG G GV Q+ +
Sbjct: 186 YNLGVMYMHGQGVAQSDQEAARYYQLAAKQ---GHAKAQFSLGFIYAHGKGVEQSDQKAV 242
Query: 232 RFLVQA 237
++ +A
Sbjct: 243 KYYQRA 248
>gi|299738005|ref|XP_001841523.2| hypothetical protein CC1G_13255 [Coprinopsis cinerea okayama7#130]
gi|298403102|gb|EAU80291.2| hypothetical protein CC1G_13255 [Coprinopsis cinerea okayama7#130]
Length = 535
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 332 HVGCGRPETRRH---EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
H+ P+T R +F+ CS C YCS CQ DW H+++C
Sbjct: 333 HIELTAPKTERRMKRKFKVCSRCKVATYCSAQCQVADWAQFHRSEC 378
>gi|72148780|ref|XP_788420.1| PREDICTED: sel1 repeat-containing protein 1-like
[Strongylocentrotus purpuratus]
Length = 232
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 31/130 (23%), Positives = 57/130 (43%), Gaps = 12/130 (9%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---------QNRGSGASLMAKAAISSH 167
N ++ H F+K C + GN +AC+ G + Q+R G L+ A H
Sbjct: 86 NKEKALHYFIKGC-NLGNQDACFGAGSVFLSSKKGTPDSDLKQDRDKGLKLLETACKKDH 144
Query: 168 AQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
+ ++L+ + G GG +KD++ + R+ LGH+ + G G+ +N
Sbjct: 145 QASCFNLSAVYLKGLGGM--EKDMKKAIEYSTRSCELGHMYGCVNASRMYKLGDGIAKND 202
Query: 228 AEGRRFLVQA 237
A ++ +A
Sbjct: 203 ALANKYKERA 212
>gi|46446347|ref|YP_007712.1| hypothetical protein pc0713 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46399988|emb|CAF23437.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 447
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 6/117 (5%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAV 176
+ A ++ +L AD G A LG + F Q+ A AA A A +L
Sbjct: 310 QEAIKYFELIADQGEAIAQCALGFMYFQGKGITQSHQEAAKYFKFAADQGEADAQCALGF 369
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
+ NG G +++D++ L A GH DA ELG + G GV Q+ E R+
Sbjct: 370 MYANGLGVTQSDQEAAKYYKLAADQ---GHADAQYELGTMYKKGLGVEQSSQEALRY 423
>gi|338175039|ref|YP_004651849.1| hypothetical protein PUV_10450 [Parachlamydia acanthamoebae UV-7]
gi|336479397|emb|CCB85995.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV-7]
Length = 812
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 62/143 (43%), Gaps = 16/143 (11%)
Query: 100 SLVLSK-ASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL 158
+L+L + KK F +KA +F K A+ N+EA Y L + ++ + +
Sbjct: 261 TLILGQFYQKKEFNLKA-------IKFFKQAAEKDNIEAQYRLAQ-SYLDEEDLKNASYW 312
Query: 159 MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQ 218
++KAA S H A + L + KD +AA GH + ++G
Sbjct: 313 LSKAAQSGHILAQHELG-------SSYEEKKDFANAFQWFLKAAEQGHSKSQVKIGGMYY 365
Query: 219 DGYGVRQNIAEGRRFLVQANARE 241
+G+GV QN ++ ++ +E
Sbjct: 366 EGHGVEQNYPMAEKYFLEVTKKE 388
>gi|319639040|ref|ZP_07993798.1| Sel1 domain-containing protein repeat-containing protein [Neisseria
mucosa C102]
gi|317399944|gb|EFV80607.1| Sel1 domain-containing protein repeat-containing protein [Neisseria
mucosa C102]
Length = 299
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 52/117 (44%), Gaps = 6/117 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRF---YCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ + A+ G+ +A Y LG++ Q+ KA +A Y+L V+
Sbjct: 164 EAAKWYRKAAEQGDAQAQYNLGVMYTDGQGVRQDYVEAVKWYRKATKQGDVKAQYNLGVM 223
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
NG G +N V +AA GH+ A +G +G GVRQN+ + +L
Sbjct: 224 YANGQGVRQN---YVQAVKWIEKAAMQGHVKAQYNVGAMYANGQGVRQNLRVAKAWL 277
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 59/125 (47%), Gaps = 7/125 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALY 172
+++E+A F K A+ G+ EA LG++ +Q+ A KAA AQA Y
Sbjct: 124 QDYAEAAKWFHK-AAEQGSAEAQLNLGVMYANGQGMIQDYVEAAKWYRKAAEQGDAQAQY 182
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
+L V+ +G G +D V +A G + A LG +G GVRQN + +
Sbjct: 183 NLGVMYTDGQGVR---QDYVEAVKWYRKATKQGDVKAQYNLGVMYANGQGVRQNYVQAVK 239
Query: 233 FLVQA 237
++ +A
Sbjct: 240 WIEKA 244
>gi|393227835|gb|EJD35498.1| hypothetical protein AURDEDRAFT_130594 [Auricularia delicata
TFB-10046 SS5]
Length = 505
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 327 LRLCSHVGCGRP---ETRRHEFRRCSVCGAVNYCSRACQALDWK---LRHKADC 374
+R C GC + +T F RCS+C AV YCSR CQ DW+ + HKA C
Sbjct: 370 IRGCQRPGCDKTVHDKTTPGGFPRCSMCRAVRYCSRECQDADWRCGAVPHKAVC 423
>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
127.97]
Length = 499
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 20/32 (62%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ R CS C V YC CQ+ +WKL HK +CA
Sbjct: 76 KLRACSGCKVVKYCDEKCQSANWKLIHKHECA 107
>gi|393220832|gb|EJD06317.1| hypothetical protein FOMMEDRAFT_138352 [Fomitiporia mediterranea
MF3/22]
Length = 1068
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 29/53 (54%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
G+R C+++ CG+ E+ EF +C C YC + CQ+ W H+ C+ E
Sbjct: 723 GIRQCANMLCGKWESFPREFAKCRRCRKAKYCGKECQSRAWSEGHRFWCSARE 775
>gi|393213325|gb|EJC98822.1| HCP-like protein [Fomitiporia mediterranea MF3/22]
Length = 473
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
+L+ + A H + Y LA NG G +K +D + AA GH DA G C
Sbjct: 148 TLLKRLADRGHGPSQYFLADCYANGIGTAKGKQDFDRAYPMFVLAAKHGHPDAAYRAGTC 207
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 208 CENGWGCRRESAKAVQFFRKAAA 230
>gi|340369302|ref|XP_003383187.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like
[Amphimedon queenslandica]
Length = 1041
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADC 374
F+RCS C +V YCSR CQ DW RH+ C
Sbjct: 716 FKRCSRCKSVEYCSRKCQEDDWS-RHRKRC 744
>gi|365920613|ref|ZP_09444940.1| Sel1 repeat protein [Cardiobacterium valvarum F0432]
gi|364577857|gb|EHM55101.1| Sel1 repeat protein [Cardiobacterium valvarum F0432]
Length = 139
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 57/128 (44%), Gaps = 7/128 (5%)
Query: 116 NNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALY 172
++ E+ R+ K A G+ A Y+LG + + Q+ AA+ A A
Sbjct: 2 QDYGEARERWEK-AATLGSSYAQYSLGFLYYNGQGVAQDYSKAQQWYELAALQGEATAQN 60
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
SLA + + G G +N R +AA G+ DA LG +G GV Q+I + R
Sbjct: 61 SLATLYYEGKGVVQNYDKAR---QWWEKAAIQGYGDAQFNLGALYYNGNGVPQDIDKARE 117
Query: 233 FLVQANAR 240
+ QA A+
Sbjct: 118 YFAQAAAQ 125
>gi|323452359|gb|EGB08233.1| hypothetical protein AURANDRAFT_64241 [Aureococcus anophagefferens]
Length = 1574
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 34/69 (49%), Gaps = 7/69 (10%)
Query: 324 GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER-WLD 382
G + C+H CG + + RCS C V YCS+ACQ DW HK C E L
Sbjct: 663 GASIFACAH--CGVAAEKSY---RCSGCKQVTYCSKACQRGDWP-EHKIQCESVEEALLA 716
Query: 383 EEGNGDVDV 391
+ G GD +V
Sbjct: 717 KSGEGDRNV 725
>gi|170044634|ref|XP_001849946.1| zinc finger MYND domain-containing protein 10 [Culex
quinquefasciatus]
gi|167867700|gb|EDS31083.1| zinc finger MYND domain-containing protein 10 [Culex
quinquefasciatus]
Length = 442
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 28/55 (50%), Gaps = 8/55 (14%)
Query: 320 GGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
GT G R+CS ++ +RCS C +V YCSR CQ DW RHK C
Sbjct: 390 SGTDDAGHRICST-------CKKSAEKRCSKCESVFYCSRECQVADWP-RHKELC 436
>gi|397636713|gb|EJK72385.1| hypothetical protein THAOC_06088, partial [Thalassiosira oceanica]
Length = 653
Score = 41.6 bits (96), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
C + +F++CS C A YCS+ CQ W+ HK DC A
Sbjct: 599 CAKESQEGEKFKQCSKCKAQWYCSKECQVEAWRAGHKKDCKRA 641
>gi|170118182|ref|XP_001890273.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634749|gb|EDQ99071.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 603
Score = 41.6 bits (96), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 33/91 (36%), Gaps = 16/91 (17%)
Query: 284 HVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRH 343
H GI N P HP R G R+C C PET+
Sbjct: 224 HSKGIMETYFQPAAANYHTPPVHPQQR-------------QRGYRICDQ--CRTPETQSL 268
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+FR C C YCS+ CQ W HKA C
Sbjct: 269 KFRLCGGCLVTQYCSQDCQRDHWP-HHKAIC 298
>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
Length = 555
Score = 41.6 bits (96), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
E + C+ C V YC++ CQA DWKL H +C
Sbjct: 120 ELKACTGCRVVKYCNKTCQAKDWKLFHSLEC 150
>gi|189500441|ref|YP_001959911.1| Sel1 domain-containing protein repeat-containing protein
[Chlorobium phaeobacteroides BS1]
gi|189495882|gb|ACE04430.1| Sel1 domain protein repeat-containing protein [Chlorobium
phaeobacteroides BS1]
Length = 528
Score = 41.6 bits (96), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 6/115 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A ++ ++ A+ G+ +A LG+I F Q+ A AA H +A + L +
Sbjct: 102 AAKWFRMAAEQGHAKAQSNLGLIYFSNQGVQQDYVEAAKWFGMAADQGHTRAQFFLGRMY 161
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
++G G +KN K L AA A LG +G GV QN E R+
Sbjct: 162 YSGEGVTKNHK---TAARLFQLAAKNNDAKAQHNLGVMYAEGQGVEQNYTEAARW 213
>gi|409047583|gb|EKM57062.1| hypothetical protein PHACADRAFT_172745 [Phanerochaete carnosa
HHB-10118-sp]
Length = 357
Score = 41.6 bits (96), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 5/62 (8%)
Query: 328 RLCSHVGCGRPETRRH-----EFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
++C++ C R TR + RCS C NYCS CQ DWK + C P E L+
Sbjct: 276 KICNNPECLRTRTRGKSSQDVKMLRCSRCTVANYCSAECQRADWKRHKEMPCKPFEVLLE 335
Query: 383 EE 384
++
Sbjct: 336 DD 337
>gi|335036713|ref|ZP_08530036.1| enhanced entry protein [Agrobacterium sp. ATCC 31749]
gi|333791861|gb|EGL63235.1| enhanced entry protein [Agrobacterium sp. ATCC 31749]
Length = 370
Score = 41.6 bits (96), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 9/124 (7%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA-----KAAISSHAQALYSLAV 176
A F++ A+AGN A + G I +N G+ LMA K+A A A Y+++
Sbjct: 183 ADDFMRRAAEAGNASAQFNWGQI--LVSENPGAKGLLMALPFYEKSAEQGIADAQYAVSQ 240
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
I ++ DK +A L RAA G+ A +LG L +G+G +N+ EG R+L
Sbjct: 241 IYWS-VKDVPTDKKAKARDWLM-RAAKAGYDTAQVDLGVWLVNGFGGERNLDEGFRWLYG 298
Query: 237 ANAR 240
A R
Sbjct: 299 AAQR 302
>gi|327297246|ref|XP_003233317.1| MYND domain-containing protein [Trichophyton rubrum CBS 118892]
gi|326464623|gb|EGD90076.1| MYND domain-containing protein [Trichophyton rubrum CBS 118892]
Length = 101
Score = 41.6 bits (96), Expect = 0.78, Method: Composition-based stats.
Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKADCAP 376
G+R C++ CG+ E + +F +C C YCS+ CQ W RH +P
Sbjct: 48 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKECQKSAWVYHRHWCHSSP 99
>gi|222086265|ref|YP_002544797.1| hypothetical protein Arad_2749 [Agrobacterium radiobacter K84]
gi|221723713|gb|ACM26869.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
Length = 393
Score = 41.6 bits (96), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 50/108 (46%), Gaps = 6/108 (5%)
Query: 125 FLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+ + AD GNV+A Y L ++ ++ G S KAA +A Y+LAV+ +G
Sbjct: 184 WYRKAADQGNVDAQYNLAIMYDSGEGITKDSGQALSWYRKAADQGKIEAQYNLAVMYRDG 243
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
+G KD V +AA G DA LG DG G+ ++ E
Sbjct: 244 AGVP---KDGARAVTWFRKAADQGDADAQYNLGTMYADGDGIAEDDVE 288
>gi|397632620|gb|EJK70634.1| hypothetical protein THAOC_07990 [Thalassiosira oceanica]
Length = 440
Score = 41.6 bits (96), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
C R +F++CS C A YCS+ CQ W+ H+ DC A
Sbjct: 386 CARKAQSDEKFKQCSKCNAQWYCSKECQVGSWRAGHRKDCKRA 428
>gi|299754820|ref|XP_002912036.1| hypothetical protein CC1G_13564 [Coprinopsis cinerea okayama7#130]
gi|298410939|gb|EFI28542.1| hypothetical protein CC1G_13564 [Coprinopsis cinerea okayama7#130]
Length = 592
Score = 41.6 bits (96), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 22/35 (62%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
F+ CS C V YCS+ CQ DW+ RH+ +C+ R
Sbjct: 383 FKGCSDCHMVIYCSKKCQLEDWRNRHRDECSDMRR 417
>gi|290982293|ref|XP_002673865.1| Sel1 domain protein repeat-containing protein [Naegleria gruberi]
gi|284087451|gb|EFC41121.1| Sel1 domain protein repeat-containing protein [Naegleria gruberi]
Length = 404
Score = 41.6 bits (96), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 4/119 (3%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
E A + K AD G+V+A Y G + +Y + + S K AQ++ + ++
Sbjct: 138 QEMALYWYKKAADNGSVDAIYRCGNV-YYMGDDYSNALSWFWKGNALGDAQSMEKIGLMY 196
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+G G + DL+ +A LG ID + +G+ G GV+ + +E + + A
Sbjct: 197 RHGFGLKR---DLKKAFEYFENSADLGCIDGIFRVGYAFHSGEGVKLDYSEAMEYYLDA 252
>gi|15220485|ref|NP_176929.1| putative F-box protein [Arabidopsis thaliana]
gi|75263152|sp|Q9FXC7.1|FB79_ARATH RecName: Full=Putative F-box protein At1g67623
gi|11072021|gb|AAG28900.1|AC008113_16 F12A21.25 [Arabidopsis thaliana]
gi|332196551|gb|AEE34672.1| putative F-box protein [Arabidopsis thaliana]
Length = 296
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 59/130 (45%), Gaps = 4/130 (3%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
D+LP+DL+V I + A S N+ + K + V + S K I+
Sbjct: 24 LDSLPEDLLVEI--SSCTGASSLSAVRNLRLVSKSFRRICDEKYVFYRLSLK--EIEFLP 79
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
W E++ +F++ C ++ N EA + G I ++ + + G +A+AA +A Y VI
Sbjct: 80 WHENSAKFIERCTESRNPEALFQKGFINYFRDKLQDRGLEYLAEAAEKGIKEAKYVYGVI 139
Query: 178 QFNGSGGSKN 187
G +K
Sbjct: 140 LICLGGKTKQ 149
>gi|414869932|tpg|DAA48489.1| TPA: putative ubiquitin carboxyl-terminal hydrolase superfamily
protein [Zea mays]
Length = 656
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 27/58 (46%), Gaps = 8/58 (13%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE---RWLDEEGNGDV 389
C RP T R C C AV YCS CQ W+ HK +C P R DEE +V
Sbjct: 118 CFRPTTFR-----CKQCKAVKYCSFKCQIAHWRQGHKDECHPPSVNTRPDDEEEKAEV 170
>gi|397565442|gb|EJK44625.1| hypothetical protein THAOC_36821, partial [Thalassiosira oceanica]
Length = 447
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 2/88 (2%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
+N G +++ + A+ L F G G + KD+R V L A AA LG + A
Sbjct: 276 KNDADGMAMVQSRVLKKDPAAILFLGEKYFLGELGLQ--KDMRRAVDLFAEAAKLGSVGA 333
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQA 237
L LG G GV+++ A+G +F +A
Sbjct: 334 LYHLGIAYYHGEGVQEDKAKGVQFFTKA 361
>gi|345483028|ref|XP_001605003.2| PREDICTED: hypothetical protein LOC100121386 isoform 1 [Nasonia
vitripennis]
Length = 1219
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGEDNVVA 406
RC CG YCS+ CQ DW HK +C P + N + +D E ED+ V
Sbjct: 251 RCQKCGITFYCSKPCQVNDWA-NHKLECQPIPPLVKRVSNKSQVQNGHDIESSTEDHKVE 309
Query: 407 DS 408
S
Sbjct: 310 SS 311
>gi|158294421|ref|XP_315591.4| AGAP005582-PA [Anopheles gambiae str. PEST]
gi|157015557|gb|EAA11866.4| AGAP005582-PA [Anopheles gambiae str. PEST]
Length = 351
Score = 41.6 bits (96), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 26/50 (52%), Gaps = 7/50 (14%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
LC GC P + CS C AVNYCS A Q LDWK HK+ C E
Sbjct: 131 LCVVCGCRGP-------KLCSRCKAVNYCSAAHQRLDWKAVHKSVCCSEE 173
>gi|154418755|ref|XP_001582395.1| TKL family protein kinase [Trichomonas vaginalis G3]
gi|121916630|gb|EAY21409.1| TKL family protein kinase [Trichomonas vaginalis G3]
Length = 1834
Score = 41.6 bits (96), Expect = 0.81, Method: Composition-based stats.
Identities = 40/150 (26%), Positives = 64/150 (42%), Gaps = 13/150 (8%)
Query: 112 AIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISS 166
A+ +N +A ++LK+ +D GN+EA T +F C +N L AA +
Sbjct: 888 AVAPDNDDATAFQYLKMASDKGNIEA--TAKYAQFLRKGRGCQKNLSQSLQLFKTAADKN 945
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
A A + +G G S N LR AA G D + L C G G ++
Sbjct: 946 CLSAQVGYAEMLHDGEGTSPN---LREAFYYTKEAAKHGGADEIYNLAKCYHTGDGTSKD 1002
Query: 227 IAEG-RRFLVQANARELAA--VLSSAACQG 253
+ + + + A+ L A V+S+ A +G
Sbjct: 1003 LRQALENYKIAADKGSLLAVGVISNVAKEG 1032
>gi|321473768|gb|EFX84735.1| hypothetical protein DAPPUDRAFT_314623 [Daphnia pulex]
Length = 1216
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ LC C R R + +RCS C AV YC+ CQ DW HKA C
Sbjct: 1036 NIPLCDPSVCARCMRRSAKLKRCSRCKAVKYCTVDCQKADWS-THKACCV 1084
>gi|113460299|ref|YP_718359.1| TPR repeat-containing protein [Haemophilus somnus 129PT]
gi|112822342|gb|ABI24431.1| conserved hypothetical protein, with TPR repeat (tetratrico peptide
repeat) [Haemophilus somnus 129PT]
Length = 181
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 60/124 (48%), Gaps = 6/124 (4%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQ 169
IK N+S+ A K A+ G+ A + LG++ Y QN KAA +A
Sbjct: 36 IKQQNYSD-AFPLFKQLAEQGDANAQHNLGLMYEYGDGITQNDQQAVYWYTKAAEQGYAN 94
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A +L ++ +G G ++ + V +AA G+ +A LG +G GV++N+++
Sbjct: 95 AQNNLGLMYTDGGKGIT--QNYKQAVYWYTKAAEQGYANAQYNLGVMYANGQGVQRNVSK 152
Query: 230 GRRF 233
+++
Sbjct: 153 AKQY 156
>gi|406606330|emb|CCH42321.1| hypothetical protein BN7_1865 [Wickerhamomyces ciferrii]
Length = 703
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+ CG+ E EF +C C YCS+ CQ W + HK C
Sbjct: 622 GVRQCASFQCGKWENFPREFAKCRRCKRTKYCSKECQLKAW-IYHKHWCV 670
>gi|397627574|gb|EJK68529.1| hypothetical protein THAOC_10280, partial [Thalassiosira oceanica]
Length = 1375
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
QA+ L F GS GS+ KD+R V L AA LG I+AL +LG+ G GV+Q+
Sbjct: 1223 QAIDMLGKKYFFGSLGSQ--KDVRRAVELFTEAAELGSIEALFDLGNAYFHGEGVQQDKG 1280
Query: 229 EGRRF 233
+ F
Sbjct: 1281 KANEF 1285
>gi|330934908|ref|XP_003304752.1| hypothetical protein PTT_17423 [Pyrenophora teres f. teres 0-1]
gi|311318500|gb|EFQ87148.1| hypothetical protein PTT_17423 [Pyrenophora teres f. teres 0-1]
Length = 255
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 26/52 (50%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G+R C++ CGR E +F +C C YCS+ CQ W H+ C A
Sbjct: 204 GIRQCAYYQCGRWEEYTRQFAKCRRCRRTKYCSKECQKSAWVF-HRHWCVAA 254
>gi|334343014|ref|YP_004555618.1| Sel1 domain-containing protein repeat-containing protein
[Sphingobium chlorophenolicum L-1]
gi|334103689|gb|AEG51112.1| Sel1 domain protein repeat-containing protein [Sphingobium
chlorophenolicum L-1]
Length = 261
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 62/127 (48%), Gaps = 8/127 (6%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMI---RFYCLQNRGSGASLMAKAAISSHAQALYSLAV 176
++A ++ + A G +A Y LG I F Q+ A +AA H A ++LAV
Sbjct: 95 KAAMKWFRRAAALGVADAQYNLGEIYLREFGVDQDLVEAARWYTRAAEQGHVGAQFTLAV 154
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGV-RQNIAEGRRF-L 234
+ G G S++ L+A V RAA G +A +LG G GV R ++ + F L
Sbjct: 155 LYMIGQGVSRSP--LKA-VYWFERAASQGSAEAQVQLGIIYGAGQGVARDSVVAYKWFAL 211
Query: 235 VQANARE 241
QANARE
Sbjct: 212 GQANARE 218
>gi|260786582|ref|XP_002588336.1| hypothetical protein BRAFLDRAFT_81475 [Branchiostoma floridae]
gi|229273497|gb|EEN44347.1| hypothetical protein BRAFLDRAFT_81475 [Branchiostoma floridae]
Length = 267
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
C + CG+P R+ + C C YCSR CQ W + HK C ++++DEE
Sbjct: 190 CCNPKCGKPGYRK-TLKLCGRCKLTRYCSRDCQIQHWSVGHKK-CCGRDKYIDEE 242
>gi|153875215|ref|ZP_02003111.1| Sel1-like repeat [Beggiatoa sp. PS]
gi|152068320|gb|EDN66888.1| Sel1-like repeat [Beggiatoa sp. PS]
Length = 249
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 61/138 (44%), Gaps = 10/138 (7%)
Query: 103 LSKASKKTFAIKAN---NWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGA 156
L K K ++IK + N++E+ F K A+ + EA Y LGM+ ++ Q+ G
Sbjct: 74 LKKTLGKIYSIKDSPIVNFAEAEKLFGK-AAEQNDAEAQYLLGMMYYHGDGVAQSYGYAE 132
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
S KAA ++A A L + F G G +N +AA G DA L
Sbjct: 133 SWFRKAAQQNYAPAQSGLGQLFFEGKGVKQN---FTEAFIWTEKAAEQGLADAQNNLAMM 189
Query: 217 LQDGYGVRQNIAEGRRFL 234
G GV Q+ + +++
Sbjct: 190 YYKGQGVPQDSVKAYQWI 207
>gi|345483030|ref|XP_003424728.1| PREDICTED: hypothetical protein LOC100121386 isoform 2 [Nasonia
vitripennis]
Length = 1210
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDAEGGGEDNVVA 406
RC CG YCS+ CQ DW HK +C P + N + +D E ED+ V
Sbjct: 251 RCQKCGITFYCSKPCQVNDWA-NHKLECQPIPPLVKRVSNKSQVQNGHDIESSTEDHKVE 309
Query: 407 DS 408
S
Sbjct: 310 SS 311
>gi|169618661|ref|XP_001802744.1| hypothetical protein SNOG_12524 [Phaeosphaeria nodorum SN15]
gi|160703659|gb|EAT80337.2| hypothetical protein SNOG_12524 [Phaeosphaeria nodorum SN15]
Length = 129
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
RRCS C A YC R+CQ DWK H+ C PA++
Sbjct: 74 RRCSRCKAAYYCDRSCQKTDWKA-HRNVCEPAQQ 106
>gi|169863706|ref|XP_001838471.1| hypothetical protein CC1G_12395 [Coprinopsis cinerea okayama7#130]
gi|116500439|gb|EAU83334.1| hypothetical protein CC1G_12395 [Coprinopsis cinerea okayama7#130]
Length = 242
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G R+C CG PET +FR C C YCS+ CQ W HKA C
Sbjct: 24 GYRICD--ACGVPETPSVKFRLCGGCMTTQYCSQDCQKSHWP-SHKAIC 69
>gi|451851643|gb|EMD64941.1| hypothetical protein COCSADRAFT_316329 [Cochliobolus sativus
ND90Pr]
Length = 608
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CGR E +F +C C YCS+ CQ W RH
Sbjct: 557 GIRQCAYYQCGRWEEYTRQFAKCRRCRRTKYCSKECQKSAWVFHRH 602
>gi|448930606|gb|AGE54170.1| histone-lysine N-methyltransferase / SET domain containing protein
[Paramecium bursaria Chlorella virus IL-5-2s1]
gi|448934740|gb|AGE58292.1| histone-lysine N-methyltransferase / SET domain containing protein
[Paramecium bursaria Chlorella virus NY-2B]
Length = 190
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
+RCS C + YCS CQ DWK HK C P E+
Sbjct: 122 LKRCSCCRMIRYCSEECQKRDWK-EHKNSCVPKEK 155
>gi|393240863|gb|EJD48387.1| hypothetical protein AURDEDRAFT_112881 [Auricularia delicata
TFB-10046 SS5]
Length = 459
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 26/52 (50%), Gaps = 5/52 (9%)
Query: 328 RLCSHVGCGRP--ETRRHEFRRCSVCGAVNYCSRACQALDWKL---RHKADC 374
R CS GCGR + F C+ C V YCSR CQ DW RHKA C
Sbjct: 356 RGCSSPGCGRTIHDNGGRPFPTCARCKTVRYCSRECQQRDWTTGQNRHKAIC 407
>gi|449540361|gb|EMD31354.1| hypothetical protein CERSUDRAFT_89230 [Ceriporiopsis subvermispora
B]
Length = 292
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 27/58 (46%), Gaps = 1/58 (1%)
Query: 317 AARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
A G G + C GC E+R RC C + YCS+ CQ W+ RHKA C
Sbjct: 139 AKLGKNLGALIARCEADGCSAVESRSVRLLRCMKCQQILYCSKQCQRAHWR-RHKAAC 195
>gi|157953435|ref|YP_001498326.1| hypothetical protein AR158_c245L [Paramecium bursaria Chlorella
virus AR158]
gi|156068083|gb|ABU43790.1| hypothetical protein AR158_c245L [Paramecium bursaria Chlorella
virus AR158]
Length = 190
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
+RCS C + YCS CQ DWK HK C P E+
Sbjct: 122 LKRCSCCRMIRYCSEECQKRDWK-EHKNSCVPKEK 155
>gi|238589601|ref|XP_002392067.1| hypothetical protein MPER_08410 [Moniliophthora perniciosa FA553]
gi|215457608|gb|EEB92997.1| hypothetical protein MPER_08410 [Moniliophthora perniciosa FA553]
Length = 147
Score = 41.2 bits (95), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 25/52 (48%), Gaps = 6/52 (11%)
Query: 335 CGRPETRRHEFRRCSVCGAVN------YCSRACQALDWKLRHKADCAPAERW 380
CG+ E + + RCS C YCS+ CQ DWK RHK + A W
Sbjct: 89 CGKFENKESKLFRCSKCSTERRSRSALYCSKDCQRADWKARHKEEHAGVREW 140
>gi|451995569|gb|EMD88037.1| hypothetical protein COCHEDRAFT_1183264 [Cochliobolus
heterostrophus C5]
Length = 585
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CGR E +F +C C YCS+ CQ W RH
Sbjct: 534 GIRQCAYYQCGRWEEYTRQFAKCRRCRRTKYCSKECQKSAWVFHRH 579
>gi|448931253|gb|AGE54815.1| histone-lysine N-methyltransferase / SET domain containing protein
[Paramecium bursaria Chlorella virus MA-1D]
gi|448935115|gb|AGE58666.1| histone-lysine N-methyltransferase / SET domain containing protein
[Paramecium bursaria Chlorella virus NYs1]
Length = 190
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
+RCS C + YCS CQ DWK HK C P E+
Sbjct: 122 LKRCSCCRMIRYCSEECQKRDWK-EHKNSCVPKEK 155
>gi|429753148|ref|ZP_19285964.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 326 str. F0382]
gi|429174242|gb|EKY15724.1| Sel1 repeat protein [Capnocytophaga sp. oral taxon 326 str. F0382]
Length = 807
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 63/132 (47%), Gaps = 13/132 (9%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL------MAKAAISSHAQALYSL 174
S + +KL A+ G+V+A L L+ +G S + K A + AQA Y L
Sbjct: 22 SVKKLMKL-AEKGDVKAQSELAEA---YLKGKGVKRSFQDAALWLEKVAETGDAQAQYQL 77
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
A + +G G K+++ G A+AA G+ A +EL C +DG GV Q+ + ++
Sbjct: 78 AHLHLDGKGMPKSEEK---GTEWLAKAAENGNQKAEQELALCYRDGRGVPQSTEKYYAWI 134
Query: 235 VQANARELAAVL 246
+ +E A L
Sbjct: 135 EKNADKEKAETL 146
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 4/102 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NV A LG + Q+ G + KAA +A+ Y L +NGSG
Sbjct: 680 WLSKAAEQNNVAALTELGAY-YESKQDFGQAVTQYQKAAHREYAEGQYKLGNCYYNGSGL 738
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
++++ RAA G+ A LG+C G G++Q+
Sbjct: 739 ERSNEK---AADYYKRAARQGYAPAQFRLGNCYYHGEGIQQS 777
>gi|389742410|gb|EIM83597.1| hypothetical protein STEHIDRAFT_141295 [Stereum hirsutum FP-91666
SS1]
Length = 678
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 34/73 (46%), Gaps = 22/73 (30%)
Query: 321 GTPGPGLRLCSHVGCGRPETRRHE-------------FRRCSVCGAVNYCSRACQALDWK 367
GT P +LC RP+ R + +RCS+C YCS CQ DWK
Sbjct: 435 GTFSPTAKLC------RPKENRDDCQVCSKTPEGDESLKRCSICKNRFYCSSRCQKRDWK 488
Query: 368 LRHKADCA--PAE 378
+HK DC+ P+E
Sbjct: 489 -KHKVDCSLLPSE 500
>gi|302422266|ref|XP_003008963.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261352109|gb|EEY14537.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 231
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 31/59 (52%), Gaps = 7/59 (11%)
Query: 317 AARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW-KLRHKADC 374
AARGG +C+ GC + ET RC+ CG YCS+ CQ DW + HK DC
Sbjct: 167 AARGGEAA----VCA--GCEKEETEGDVLMRCTGCGRARYCSKVCQKKDWGEGGHKVDC 219
>gi|148244585|ref|YP_001219279.1| hypothetical protein COSY_0436 [Candidatus Vesicomyosocius okutanii
HA]
gi|146326412|dbj|BAF61555.1| conserved hypothetical protein [Candidatus Vesicomyosocius okutanii
HA]
Length = 149
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 6/120 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A++ L A GN EA + +GM++ L +N+ G +AA HA A + L V
Sbjct: 29 AYQLLAPLATKGNAEALWRVGMMQMNGLGMVENQPLGFENFLQAAGKGHAFAHHMLGVAY 88
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
G G +KD+ + + A G + LG +DG V+Q++ + + + +A+
Sbjct: 89 MTGEG---VEKDIIQSIEWFKKGAKFGISGPMYTLGMLFEDGKEVKQDLEKAQFWFDKAD 145
>gi|397568327|gb|EJK46081.1| hypothetical protein THAOC_35274, partial [Thalassiosira oceanica]
Length = 548
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 47/109 (43%), Gaps = 2/109 (1%)
Query: 129 CADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKND 188
CA A + R N +++ QA+ L F G G +
Sbjct: 203 CALAAQKRGMFDCAFCRTPLSDNDADTMAMVRARVAKKDPQAILFLGNKYFFGELGLQ-- 260
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD+R V L +AA LG I AL ELG+ +G GV ++ A+ +F +A
Sbjct: 261 KDMRKAVELWTKAAELGSIGALFELGNAYDNGDGVEEDNAKAAQFWTKA 309
>gi|303286119|ref|XP_003062349.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455866|gb|EEH53168.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 405
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 35/74 (47%), Gaps = 3/74 (4%)
Query: 324 GPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE 383
G G C ++ C + R ++CS CG V YCS+ CQ W + HKA C + E
Sbjct: 57 GSGTLKCGNLACEDRDVSR--VKKCSRCGIVAYCSKECQVAHWPI-HKAMCDEQKLMHAE 113
Query: 384 EGNGDVDVDLNDAE 397
G ++ D N E
Sbjct: 114 TGKSPLEKDNNTLE 127
>gi|414869933|tpg|DAA48490.1| TPA: putative ubiquitin carboxyl-terminal hydrolase superfamily
protein [Zea mays]
Length = 973
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 27/58 (46%), Gaps = 8/58 (13%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE---RWLDEEGNGDV 389
C RP T R C C AV YCS CQ W+ HK +C P R DEE +V
Sbjct: 118 CFRPTTFR-----CKQCKAVKYCSFKCQIAHWRQGHKDECHPPSVNTRPDDEEEKAEV 170
>gi|307104799|gb|EFN53051.1| hypothetical protein CHLNCDRAFT_137302 [Chlorella variabilis]
Length = 1214
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 19/33 (57%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
RRCS C +YC ACQ DW+ HK C PA
Sbjct: 1180 LRRCSSCRQASYCGAACQTRDWRCWHKTVCQPA 1212
>gi|449435498|ref|XP_004135532.1| PREDICTED: putative F-box protein At1g67623-like [Cucumis sativus]
Length = 255
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 33/157 (21%), Positives = 68/157 (43%), Gaps = 7/157 (4%)
Query: 25 NDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFV 84
N KR+ S + + R K P S L +LP+DL++ +L K+++++ D V
Sbjct: 3 NQLKRKSEPSFNSQISTRTKKMCRP---FSSSLIVSLPNDLLIEVLAKVAASSYI--DLV 57
Query: 85 NVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSES--AHRFLKLCADAGNVEACYTLG 142
+ K + + + S + + + W+ + + F++ + N E+ Y G
Sbjct: 58 QAKLATKLFLHASNDGYIFQHVSLENKSFRNLLWNNTPQSRSFMETLNNNENPESLYRKG 117
Query: 143 MIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
+ F+ SG S + ++A + A Y V+ +
Sbjct: 118 TMEFFSHGKEASGLSYLKQSAQKGYVDACYVYGVVMY 154
>gi|421596302|ref|ZP_16040153.1| hypothetical protein BCCGELA001_04002, partial [Bradyrhizobium sp.
CCGE-LA001]
gi|404271588|gb|EJZ35417.1| hypothetical protein BCCGELA001_04002, partial [Bradyrhizobium sp.
CCGE-LA001]
Length = 311
Score = 41.2 bits (95), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 80/192 (41%), Gaps = 29/192 (15%)
Query: 69 ILCKLSSTARCPSDFVNVLITCK---RMNGLALNSLVLSKASKKTFAIK---------AN 116
+ LS T+R + V + K + G AL + + + + I A
Sbjct: 54 VTGALSGTSRAKLGLIQVPPSEKLPEAIGGPALRTAAMKGDATAAYEIGMRFAEGKGVAT 113
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISSHA 168
N+ E+A ++ A AG V A + LG L +G G +AA +A
Sbjct: 114 NYDEAA-KWYDRAAQAGVVPATFRLG-----TLYEKGLGVKKDADIARRYYTQAAERGNA 167
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
+A+++LAV+ +G G N ++ +AA G D+ LG G GV QN+A
Sbjct: 168 KAMHNLAVLDADGGGRGAN---YKSAAQWFRKAADRGVADSQFNLGILYARGIGVEQNLA 224
Query: 229 EGRRFLVQANAR 240
E ++ A A+
Sbjct: 225 ESYKWFSLAAAQ 236
>gi|329119951|ref|ZP_08248623.1| Sel1 repeat superfamily protein [Neisseria bacilliformis ATCC
BAA-1200]
gi|327463864|gb|EGF10178.1| Sel1 repeat superfamily protein [Neisseria bacilliformis ATCC
BAA-1200]
Length = 354
Score = 41.2 bits (95), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 7/115 (6%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHAQAL 171
A N+ ++A F K AD G ++ + LG++ +QN A+ KAA HA A
Sbjct: 83 AQNYQQAAAWFQK-AADQGYADSQFNLGIMSAEGLGMMQNHQQAATWFQKAAGQGHADAQ 141
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+ LA + G G +N + A +AA GH DA L +G GV Q+
Sbjct: 142 FRLAKLYAWGLGVPQNHQQ---AAAWFQKAANQGHADAQLFLASMYAEGIGVAQD 193
>gi|255545936|ref|XP_002514028.1| conserved hypothetical protein [Ricinus communis]
gi|223547114|gb|EEF48611.1| conserved hypothetical protein [Ricinus communis]
Length = 1060
Score = 41.2 bits (95), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
RC+ C AV YCS CQ + W+ HK +C PA
Sbjct: 86 RCARCKAVRYCSGKCQIIHWRQGHKEECRPA 116
>gi|86139341|ref|ZP_01057910.1| hypothetical protein MED193_12863 [Roseobacter sp. MED193]
gi|85823844|gb|EAQ44050.1| hypothetical protein MED193_12863 [Roseobacter sp. MED193]
Length = 778
Score = 41.2 bits (95), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 3/99 (3%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
Q+ S KAA H +A Y+LA + NG G S++ +AA +++A
Sbjct: 566 QDDALAVSWYRKAANRDHPRAQYNLAWMYENGRGTSQS---YSRAYDWYQKAAQADYLNA 622
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSS 248
++G ++GYGV QN E R+ A AR+L +S
Sbjct: 623 QYKIGVFHREGYGVSQNDVEAVRWFRMAAARDLPKAQTS 661
>gi|357148128|ref|XP_003574640.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 16-like
[Brachypodium distachyon]
Length = 970
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 11/76 (14%)
Query: 301 PAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRA 360
P AH +AE P P + +C+ C +P T R C C AV YCS
Sbjct: 89 PVAPAHEEPEVVAET----QQQPQPPVGVCA--VCFKPTTFR-----CKQCKAVKYCSFK 137
Query: 361 CQALDWKLRHKADCAP 376
CQ W+ HK +C P
Sbjct: 138 CQIAHWRQGHKDECHP 153
>gi|407923773|gb|EKG16837.1| Zinc finger MYND-type protein [Macrophomina phaseolina MS6]
Length = 614
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 29/54 (53%), Gaps = 1/54 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G+R C++ CG+ E +F +C C YCS+ CQ W + H+ C A++
Sbjct: 562 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAW-VYHRHWCVAAQQ 614
>gi|397570438|gb|EJK47299.1| hypothetical protein THAOC_33990 [Thalassiosira oceanica]
Length = 420
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 68/145 (46%), Gaps = 25/145 (17%)
Query: 98 LNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGS 154
L L K ++K F + W+E+A + G+++A + LG++ Q+
Sbjct: 281 FGQLGLQKDARKAFEL----WAEAA--------ELGSIQALFNLGVVYDTGNGVKQDESK 328
Query: 155 GASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELG 214
GA L KAA+ A++ YSL I+ G K + D RAG ++ +GH D++ +
Sbjct: 329 GAKLYKKAAMQGCAESRYSLGCIE-----GEKGNYD-RAGRHFLI-SSKMGHRDSVETIK 381
Query: 215 HCLQDGYGVRQNIAE---GRRFLVQ 236
L +G + + + G +F V+
Sbjct: 382 RMLMEGAATKDHFTQALKGYQFAVE 406
>gi|392597868|gb|EIW87190.1| HCP-like protein [Coniophora puteana RWD-64-598 SS2]
Length = 436
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 40/83 (48%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
+L+ + A H + Y LA NG G K +D + L AA GH DA G C
Sbjct: 116 ALLKRLADRGHPSSQYFLADCYANGLGTRKGKQDFDSAYPLFVLAAKHGHPDAAYRAGTC 175
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 176 CENGWGCRRESAKALQFFRKAAA 198
>gi|418022709|ref|ZP_12661695.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS625]
gi|353537711|gb|EHC07267.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS625]
Length = 491
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
S+ A ++L+ A+ EA LG++ +Y +N ++KAA + + A Y LA++
Sbjct: 291 SKLALQYLQRAAEGEQTEAMANLGVL-YYQQKNLAQAYHFISKAAQAGYPHAQYHLALML 349
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFLVQ 236
NG G + +D+ A A AA G +DA L H L D ++ + +L Q
Sbjct: 350 ANGDGCT---RDMIASEYWMAEAAEQGQLDAMLTRAQHMLNDDNAFGSDLTQAEDYLRQ 405
>gi|169625527|ref|XP_001806167.1| hypothetical protein SNOG_16037 [Phaeosphaeria nodorum SN15]
gi|160705670|gb|EAT76616.2| hypothetical protein SNOG_16037 [Phaeosphaeria nodorum SN15]
Length = 571
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CGR E +F +C C YCS+ CQ W RH
Sbjct: 519 GIRQCAYYQCGRWEEFTRQFAKCRRCRRTKYCSKECQKSAWVFHRH 564
>gi|397563667|gb|EJK43891.1| hypothetical protein THAOC_37619, partial [Thalassiosira oceanica]
Length = 504
Score = 41.2 bits (95), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 12/125 (9%)
Query: 153 GSGASLMAKAAISSHAQ---ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
GS A ++A+ A A+ LA ++G+ G + +D+ + L AA LG ++A
Sbjct: 333 GSDAGIIARVRKRVDANDPVAIQRLASAYYHGNWGLQ--QDVPRAIELWTEAARLGDLNA 390
Query: 210 LRELGHCLQDGYGVRQNIAEGRRFLVQANAR-------ELAAVLSSAACQGISTRSWLTW 262
+LGH DG GV Q++A G R A + EL + + ++ R W+
Sbjct: 391 HYKLGHRYYDGDGVEQDVARGVRHFQYAAMQGHPDSRFELGCIEYDSGNHELAVRHWMVS 450
Query: 263 NPLSQ 267
+ Q
Sbjct: 451 AKMGQ 455
>gi|297825441|ref|XP_002880603.1| ubiquitin carboxyl-terminal hydrolase family protein [Arabidopsis
lyrata subsp. lyrata]
gi|297326442|gb|EFH56862.1| ubiquitin carboxyl-terminal hydrolase family protein [Arabidopsis
lyrata subsp. lyrata]
Length = 650
Score = 41.2 bits (95), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 40/94 (42%), Gaps = 19/94 (20%)
Query: 310 RFLAEWFAARGGTPG-----------PGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCS 358
R A++F GG+ G P CS CG+ T++ CS C +V YCS
Sbjct: 33 RTAAKYFEVGGGSGGFDRDHRRDFMVPDTVECS--VCGKTSTKK-----CSRCKSVRYCS 85
Query: 359 RACQALDWKLRHKADCAPAERWLDEEGNGDVDVD 392
CQ DWK HK C R D G D+D
Sbjct: 86 ADCQTSDWKSGHKLKCK-VFRSTDSSPVGRDDID 118
>gi|367003675|ref|XP_003686571.1| hypothetical protein TPHA_0G02960 [Tetrapisispora phaffii CBS 4417]
gi|357524872|emb|CCE64137.1| hypothetical protein TPHA_0G02960 [Tetrapisispora phaffii CBS 4417]
Length = 659
Score = 41.2 bits (95), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
+R C++ CG+ E+ EF +C C YCSR CQ W
Sbjct: 552 NIRQCANFECGKWESYPREFAKCRRCKRTKYCSRECQLKGW 592
>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 558
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 23/39 (58%), Gaps = 3/39 (7%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC---APAER 379
E + C+ C V YCS+ CQ WK HK +C APAER
Sbjct: 78 ETKACTGCKRVRYCSKTCQTRAWKREHKYECKVLAPAER 116
>gi|393219982|gb|EJD05468.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 635
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
++C+ C A++YC CQ DW+ HK +C +RW
Sbjct: 140 LKKCTKCKAIHYCGTTCQNADWQF-HKLECEALQRW 174
>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
Length = 484
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
+ C CG V YC + CQA DWKL H +C ++
Sbjct: 66 LKGCRGCGVVKYCDKTCQAKDWKLGHSFECTIYQK 100
>gi|242064616|ref|XP_002453597.1| hypothetical protein SORBIDRAFT_04g008750 [Sorghum bicolor]
gi|241933428|gb|EES06573.1| hypothetical protein SORBIDRAFT_04g008750 [Sorghum bicolor]
Length = 966
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 33/71 (46%), Gaps = 18/71 (25%)
Query: 323 PGPGLRLCSHVGCGRP-ETRRHEFR-----------RCSVCGAVNYCSRACQALDWKLRH 370
PGP LR + RP + R E+ RCS C +V YCS CQ + W+ H
Sbjct: 65 PGPSLRTMPYF---RPAPSLRQEYHECATCHAPAKTRCSRCKSVRYCSGKCQIVHWRQGH 121
Query: 371 KADCAPAERWL 381
K C ++WL
Sbjct: 122 KETC---QKWL 129
>gi|284799409|ref|ZP_05983910.2| TPR repeat protein [Neisseria subflava NJ9703]
gi|284797776|gb|EFC53123.1| TPR repeat protein [Neisseria subflava NJ9703]
Length = 420
Score = 41.2 bits (95), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 51/112 (45%), Gaps = 6/112 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ + A+ G+ +A + LGM+ Q+ +AA HA+A Y+L V+
Sbjct: 248 EAIKWYRKAAEQGDAQAQFNLGMMYLQGQGVRQDNAQAVQWFGRAAEQGHAKAQYNLGVM 307
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
NG G ++D V +AA G A LG G GVRQ+ A+
Sbjct: 308 YANGQGIRQDDVQ---AVRWYHKAAEQGVAQAQFNLGIMYDQGQGVRQDDAQ 356
>gi|392562202|gb|EIW55383.1| hypothetical protein TRAVEDRAFT_130973 [Trametes versicolor
FP-101664 SS1]
Length = 402
Score = 41.2 bits (95), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
C R + + ++C+ C AV YCS+ACQ DWK HK CA R
Sbjct: 358 CKRCDKTGSDVKQCNRCKAVMYCSKACQRADWK-SHKPACATVSR 401
>gi|356509862|ref|XP_003523663.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
max]
Length = 742
Score = 41.2 bits (95), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADC 374
++CS C +V YCS+ACQ + WK HK C
Sbjct: 69 KKCSRCKSVRYCSQACQQMHWKSEHKVRC 97
>gi|350287419|gb|EGZ68666.1| hypothetical protein NEUTE2DRAFT_118689 [Neurospora tetrasperma FGSC
2509]
Length = 2045
Score = 41.2 bits (95), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 9/66 (13%)
Query: 317 AARGGTPGPGLRLC-SHVG-----CGRPETRRH--EFRRCSVCGAVNYCSRACQALDWKL 368
A+ PGP S VG C P+ R + +CS C YCS+ CQ DWK
Sbjct: 1949 ASANAKPGPATATTKSTVGYRCNFCNEPKNRATGGDLNQCSKCKEARYCSKECQVADWK- 2007
Query: 369 RHKADC 374
RHK +C
Sbjct: 2008 RHKKEC 2013
>gi|336465755|gb|EGO53920.1| hypothetical protein NEUTE1DRAFT_86887 [Neurospora tetrasperma FGSC
2508]
Length = 2058
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 9/66 (13%)
Query: 317 AARGGTPGPGLRLC-SHVG-----CGRPETRRH--EFRRCSVCGAVNYCSRACQALDWKL 368
A+ PGP S VG C P+ R + +CS C YCS+ CQ DWK
Sbjct: 1962 ASANAKPGPATATTKSTVGYRCNFCNEPKNRATGGDLNQCSKCKEARYCSKECQVADWK- 2020
Query: 369 RHKADC 374
RHK +C
Sbjct: 2021 RHKKEC 2026
>gi|260783903|ref|XP_002587010.1| hypothetical protein BRAFLDRAFT_103826 [Branchiostoma floridae]
gi|229272144|gb|EEN43021.1| hypothetical protein BRAFLDRAFT_103826 [Branchiostoma floridae]
Length = 110
Score = 41.2 bits (95), Expect = 0.99, Method: Composition-based stats.
Identities = 18/46 (39%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
C + CGRP R+ + C+ C YCSR CQ W + HK C
Sbjct: 37 CCNPKCGRPGYRK-TLKLCAGCKLTRYCSRDCQKQHWSVGHKKCCG 81
>gi|449532447|ref|XP_004173192.1| PREDICTED: putative F-box protein At1g67623-like, partial [Cucumis
sativus]
Length = 240
Score = 41.2 bits (95), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 33/157 (21%), Positives = 68/157 (43%), Gaps = 7/157 (4%)
Query: 25 NDKKRRGFSSGEPKTTYRKKARFSPDVAGKSDLFDALPDDLVVSILCKLSSTARCPSDFV 84
N KR+ S + + R K P S L +LP+DL++ +L K+++++ D V
Sbjct: 3 NQLKRKSEPSFNSQISTRTKKMCRP---FSSSLIVSLPNDLLIEVLAKVAASSYI--DLV 57
Query: 85 NVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSES--AHRFLKLCADAGNVEACYTLG 142
+ K + + + S + + + W+ + + F++ + N E+ Y G
Sbjct: 58 QAKLATKLFLHASNDGYIFQHVSLENKSFRNLLWNNTPQSRSFMETLNNNENPESLYRKG 117
Query: 143 MIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
+ F+ SG S + ++A + A Y V+ +
Sbjct: 118 TMEFFSHGKEASGLSYLKQSAQKGYVDACYVYGVVMY 154
>gi|397563455|gb|EJK43804.1| hypothetical protein THAOC_37714 [Thalassiosira oceanica]
Length = 433
Score = 41.2 bits (95), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 108/301 (35%), Gaps = 39/301 (12%)
Query: 96 LALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSG 155
A+NS L K T A + +W++ L + G + +G
Sbjct: 141 FAVNSAALYKWQVMTLAKRGVSWAQREAGILMIDGMEG--------------LQKQEKAG 186
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
+ KAA + AL+ L+ + G S+ K L +AA LG +A ELG
Sbjct: 187 LEWLNKAAAQNDPSALHELSSLYREGMA-SELGKSQEKANELLLKAANLGFANANSELGD 245
Query: 216 CLQDGY-GVRQNIAEGR-----RFLVQANARELAAVLSSAACQGISTRS---WLTWNPLS 266
C G G ++ E F + E A +L S +L + L+
Sbjct: 246 CYYRGVNGFEEDDDEAYFRASVAFALDGTDEEAAEILGYFHFDAESIPEPSLYLACHYLN 305
Query: 267 QPHPAPHPRNQNPNHNGHVTGIGCPLLSDFGCNVPAPEAHPASRFLAEWF-AARGGTPGP 325
+++ N++ + + L + N +A PA F W +R
Sbjct: 306 IAASEDSSGDESFNYSIALHKLNQHLHDGYKAN--GSDATPALLF---WLRKSRDLGHEN 360
Query: 326 GLRLCSHV---------GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
G RL + C + + ++CS C A YCS+ CQ W+ HK DC
Sbjct: 361 GTRLLKKLESNGQSRCANCSKKVKADEKLKQCSKCRAKWYCSKECQVEAWRAGHKKDCKR 420
Query: 377 A 377
A
Sbjct: 421 A 421
>gi|420616899|ref|ZP_15107605.1| sel1 repeat family protein [Yersinia pestis PY-14]
gi|391494344|gb|EIR49584.1| sel1 repeat family protein [Yersinia pestis PY-14]
Length = 228
Score = 41.2 bits (95), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 51/101 (50%), Gaps = 4/101 (3%)
Query: 137 ACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVA 196
A Y LG + C Q+ G + K+A + +AL+ LA + +N G K D+
Sbjct: 77 AQYELGKYHYTC-QDYDRGLKWLEKSADHKNIKALFLLAQM-YNEGDGVKEDQTKYFSYL 134
Query: 197 LCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
L +AA LG DA E+G+ G GV +N+ E ++ ++A
Sbjct: 135 L--KAAQLGLSDAQVEIGYLYLVGEGVEKNLPEAYQWHIKA 173
>gi|350270102|ref|YP_004881410.1| hypothetical protein OBV_17060, partial [Oscillibacter
valericigenes Sjm18-20]
gi|348594944|dbj|BAK98904.1| hypothetical protein OBV_17060 [Oscillibacter valericigenes
Sjm18-20]
Length = 969
Score = 41.2 bits (95), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 67/165 (40%), Gaps = 37/165 (22%)
Query: 121 SAHRFLKLCADAGNVEACYTL--------GM-------IRFYCLQ-NRGSGASLMAKAAI 164
A R +L AD G++ A Y L GM +R Y + +G ++L
Sbjct: 760 EAARLYRLAADNGDMHAVYNLAVLYDYGRGMPQDQVEAVRLYRIAAEQGQPSALANLGYA 819
Query: 165 SSHAQA-----------LYSLAVIQFNG----------SGGSKNDKDLRAGVALCARAAF 203
+HA+ LY LA + N G ++DL+ V L AA
Sbjct: 820 YNHAEGGLEKDSQEAFRLYRLAAEKGNAVAQCNLGVMYKNGENVERDLQEAVRLYRLAAE 879
Query: 204 LGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSS 248
G++ AL LG C ++G GV Q+ A+ + QA R A SS
Sbjct: 880 QGNLTALNNLGECYENGEGVEQDYAQAMQLYRQAFERGHAFAASS 924
>gi|18400597|ref|NP_565576.1| ubiquitin carboxyl-terminal hydrolase 19 [Arabidopsis thaliana]
gi|75265927|sp|Q9SJA1.2|UBP19_ARATH RecName: Full=Ubiquitin carboxyl-terminal hydrolase 19; AltName:
Full=Deubiquitinating enzyme 19; Short=AtUBP19; AltName:
Full=Ubiquitin thioesterase 19; AltName:
Full=Ubiquitin-specific-processing protease 19
gi|13430806|gb|AAK26025.1|AF360315_1 putative ubiquitin carboxyl terminal hydrolase [Arabidopsis
thaliana]
gi|15810655|gb|AAL07252.1| putative ubiquitin carboxyl terminal hydrolase [Arabidopsis
thaliana]
gi|20197976|gb|AAD23896.2| putative ubiquitin carboxyl terminal hydrolase [Arabidopsis
thaliana]
gi|330252513|gb|AEC07607.1| ubiquitin carboxyl-terminal hydrolase 19 [Arabidopsis thaliana]
Length = 672
Score = 41.2 bits (95), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 5/40 (12%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CG+ T++ CS C +V YCS ACQ DWK HK C
Sbjct: 67 CGKATTKK-----CSRCKSVRYCSAACQTSDWKSGHKLKC 101
>gi|213963117|ref|ZP_03391375.1| Sel1 domain protein repeat-containing protein [Capnocytophaga
sputigena Capno]
gi|213954201|gb|EEB65525.1| Sel1 domain protein repeat-containing protein [Capnocytophaga
sputigena Capno]
Length = 810
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 51/113 (45%), Gaps = 4/113 (3%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGG 184
+L A+ NV A LG + Q+ G + KAA +A+ Y L +NGSG
Sbjct: 683 WLSKAAEQNNVAALTELGAY-YESKQDFGQAVTQYQKAAHREYAEGQYKLGNCYYNGSGL 741
Query: 185 SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
++++ RAA G+ A LG+C G G++Q+ A + QA
Sbjct: 742 ERSNEK---AADYYKRAARQGYAPAQFRLGNCYYHGEGIQQSDARAIDWFDQA 791
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 13/113 (11%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL------MAKAAISSHAQALYSL 174
S + +KL A+ G+V+A L L+ +G S + K A + AQA Y L
Sbjct: 22 SVKKLMKL-AEKGDVKAQSELAEA---YLKGKGVKRSFQDAALWLEKVAETGDAQAQYQL 77
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
A + +G G K+++ G A+AA G+ A +EL C +DG GV Q+
Sbjct: 78 AHLHLDGKGMPKSEEK---GAEWLAKAAENGNQKAEQELALCYRDGRGVAQST 127
>gi|307107210|gb|EFN55453.1| hypothetical protein CHLNCDRAFT_133779 [Chlorella variabilis]
Length = 695
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 29/64 (45%), Gaps = 3/64 (4%)
Query: 326 GLRLCSHVGCGR---PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLD 382
LR C++ C P RRC CG V YCS AC DW+ H+A C R
Sbjct: 624 ALRSCANPRCTNLSGPSEAALRGRRCGGCGVVRYCSEACSHADWRAGHRAACRLLLRQQQ 683
Query: 383 EEGN 386
E G+
Sbjct: 684 EAGS 687
>gi|398850377|ref|ZP_10607083.1| TPR repeat-containing protein [Pseudomonas sp. GM80]
gi|398248914|gb|EJN34310.1| TPR repeat-containing protein [Pseudomonas sp. GM80]
Length = 443
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 3/83 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA +AQA Y+LA FN + G+ D+D +AA GH +A LG + G
Sbjct: 291 KAAEEGYAQAQYNLA---FNYAQGTGVDQDFEQAAGWYRKAAEQGHAEAQNNLGASYERG 347
Query: 221 YGVRQNIAEGRRFLVQANARELA 243
GV QNI + + +A + LA
Sbjct: 348 EGVVQNIKQAVLWYRKAAEQGLA 370
>gi|327304899|ref|XP_003237141.1| hypothetical protein TERG_01862 [Trichophyton rubrum CBS 118892]
gi|326460139|gb|EGD85592.1| hypothetical protein TERG_01862 [Trichophyton rubrum CBS 118892]
Length = 1216
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
CS+ G +P + + RCS C YCS+ CQ DW+ RHK C R
Sbjct: 1166 CSNCGGKKPGSGT-KLLRCSGCKVAEYCSKDCQREDWRARHKLACKMMAR 1214
>gi|299134157|ref|ZP_07027350.1| Sel1 domain protein repeat-containing protein [Afipia sp. 1NLS2]
gi|298590904|gb|EFI51106.1| Sel1 domain protein repeat-containing protein [Afipia sp. 1NLS2]
Length = 364
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 53/117 (45%), Gaps = 10/117 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A + AD GN EA Y L FY +N A L+ AA++ + A A+
Sbjct: 213 AAELFRQAADKGNPEAQYALAT--FYKEGRGVEKNPVEAAKLLGAAALADNLDAEVEYAI 270
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
FNG+G ++D R VAL RAA G A L H L +G QN EG ++
Sbjct: 271 ALFNGTG---VNRDERTAVALLNRAARQGSPIAQNRLAHVLIEGKAAPQNKVEGFKW 324
>gi|241667086|ref|YP_002985170.1| peptidase C14 caspase catalytic subunit p20 [Rhizobium
leguminosarum bv. trifolii WSM1325]
gi|240862543|gb|ACS60208.1| peptidase C14 caspase catalytic subunit p20 [Rhizobium
leguminosarum bv. trifolii WSM1325]
Length = 859
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 54/129 (41%), Gaps = 6/129 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQAL 171
AN +++A +K DAG+V A L + N ++ AA + AL
Sbjct: 555 ANRDAKTAFATIKKAMDAGHVRAIDQLSSLYIVGASVPANPAKANEIVQAAAKKNDPYAL 614
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
Y+ + G G D G+ L ++A LGH A+ ELG+ +G V + G
Sbjct: 615 YTYGKSLYYGRGVKA---DTEQGLKLMLQSADLGHTFAMNELGYIFLNGVNVPADPERGI 671
Query: 232 RFLVQANAR 240
RF AR
Sbjct: 672 RFYEAGLAR 680
>gi|424844565|ref|ZP_18269176.1| Sel1 repeat protein [Jonquetella anthropi DSM 22815]
gi|363986003|gb|EHM12833.1| Sel1 repeat protein [Jonquetella anthropi DSM 22815]
Length = 191
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 65/142 (45%), Gaps = 8/142 (5%)
Query: 103 LSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLM 159
L+ A +K +K + A + +L A G VEA Y L ++ + +N A
Sbjct: 43 LATAWEKGVGVKRD--PSQAFFWCRLAASGGLVEAQYHLSVMYSSGIGTPRNVKEAAKWC 100
Query: 160 AKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQD 219
KAA A + V+ NG G S+ D +AGV AA LG++DA+ L C +
Sbjct: 101 LKAAEGGDPSARFDYGVMCMNGQGVSR---DYQAGVKWFLSAAALGNVDAMNCLSLCYRL 157
Query: 220 GYGVRQNIAEGRRFLVQANARE 241
G GV ++ +L +A ++
Sbjct: 158 GMGVDRSPQTADVWLRRAQKQQ 179
>gi|356556906|ref|XP_003546761.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
max]
Length = 989
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
RCS C V YCS CQ + W+L HK +C E
Sbjct: 86 RCSRCKIVRYCSGNCQIIHWRLNHKQECQQLE 117
>gi|440291922|gb|ELP85164.1| hypothetical protein EIN_082260 [Entamoeba invadens IP1]
Length = 1077
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 11/111 (9%)
Query: 122 AHRFLKLCADAGNVEACYTLG---MIRFYCLQNRGSGASLMAKAAISSH---AQALYSLA 175
A + K A+ GNV+ Y + + F C N+ ++ K SH AQ L LA
Sbjct: 684 AFKCFKKAAELGNVKGQYNMAYCLLNGFGCATNKARAVDILQKILKQSHLVNAQLL--LA 741
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
F G G ++ D+ V L +AA L + +A+ LG C G GV++N
Sbjct: 742 NCYFKGEGVAR---DMAQAVNLYVKAACLRNSNAMNNLGSCFFSGNGVKKN 789
>gi|356502067|ref|XP_003519843.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 17-like [Glycine
max]
Length = 938
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 19/30 (63%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP 376
RCS C AV YCS CQ W+L HK +C P
Sbjct: 95 RCSRCKAVRYCSGKCQISHWRLGHKDECCP 124
>gi|346974529|gb|EGY17981.1| MYND-type zinc finger protein MUB1 [Verticillium dahliae VdLs.17]
Length = 638
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 2/81 (2%)
Query: 291 PLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSV 350
PL+ +F +P+ + + + T G G+R C++ CG+ E +F +C
Sbjct: 552 PLVENFTVRHHSPDMQYWAGVVMRNLCRKDDTRG-GIRQCAYYQCGKWEEFTRQFAKCRR 610
Query: 351 CGAVNYCSRACQALDWKL-RH 370
C YCS+ CQ W RH
Sbjct: 611 CRRTKYCSKECQKSAWAFHRH 631
>gi|85714410|ref|ZP_01045398.1| sel1-like repeat protein [Nitrobacter sp. Nb-311A]
gi|85698857|gb|EAQ36726.1| sel1-like repeat protein [Nitrobacter sp. Nb-311A]
Length = 342
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 53/122 (43%), Gaps = 10/122 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A L+ ADAG+ EA Y L FY +N L+ A+++ + A A+
Sbjct: 191 AAELLRFAADAGSPEAQYALAT--FYKEGTGVEKNLYKSVRLLQAASLAGNVDAEVEYAI 248
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
FNGSG KN+ A V+L +AA A L L G G + EG ++ +
Sbjct: 249 ALFNGSGTGKNEA---AAVSLLRKAARQNSAIAQNRLARVLVTGMGAPMDKIEGLKWHIV 305
Query: 237 AN 238
A
Sbjct: 306 AK 307
>gi|378726188|gb|EHY52647.1| hypothetical protein HMPREF1120_00856 [Exophiala dermatitidis
NIH/UT8656]
Length = 624
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 29/62 (46%), Gaps = 1/62 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
G+R C++ CG+ E +F +C C YCS+ CQ W + H+ C +L
Sbjct: 564 GIRQCANYQCGKWEEYTRQFAKCRRCRRTKYCSKECQRESWSM-HRHWCHLENVYLSRRS 622
Query: 386 NG 387
N
Sbjct: 623 NA 624
>gi|421656441|ref|ZP_16096748.1| Sel1 repeat protein [Acinetobacter baumannii Naval-72]
gi|408505425|gb|EKK07148.1| Sel1 repeat protein [Acinetobacter baumannii Naval-72]
Length = 159
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA + A++ Y+LA++ NG G K DL V ++A G D+ +LG +G
Sbjct: 69 KAADQNDAKSQYNLAIMYLNGYGAKK---DLSKSVEYYRKSALQGDTDSQLQLGIRYLNG 125
Query: 221 YGVRQNIAEGRRFLVQA 237
GV +NI + + +A
Sbjct: 126 EGVEKNIETAKEWFQKA 142
>gi|428171965|gb|EKX40878.1| hypothetical protein GUITHDRAFT_113138 [Guillardia theta CCMP2712]
Length = 518
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 54/126 (42%), Gaps = 7/126 (5%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY----CLQNRGSGASLMAKAAISSHA 168
++ E +FL A G+ EA + LG + F+ + +AA H+
Sbjct: 382 MREEGKEEDMKKFLSKAAGLGDAEALFCLGDMHFHGSDGFQMDHPKAFRFFQEAARLQHS 441
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
A+ + V+ +NG G K D+ R+A LG+I+A+R L G G +N A
Sbjct: 442 AAMCCVGVMCYNGFGVKK---DVNKAFLAYQRSAELGNIEAMRNLAAMHLSGEGCEKNEA 498
Query: 229 EGRRFL 234
R L
Sbjct: 499 TARYLL 504
>gi|119474305|ref|XP_001259028.1| MYND domain protein, putative [Neosartorya fischeri NRRL 181]
gi|119407181|gb|EAW17131.1| MYND domain protein, putative [Neosartorya fischeri NRRL 181]
Length = 250
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
C + E+ R CS C +V YCSR CQ DWK HK CA
Sbjct: 15 CKKAESESVPLRNCSRCKSVVYCSRDCQKADWKT-HKKVCA 54
>gi|356518197|ref|XP_003527768.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
max]
Length = 734
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 32/67 (47%), Gaps = 6/67 (8%)
Query: 308 ASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWK 367
AS++ E A PGP L H C R ++CS C +V YCS+ACQ WK
Sbjct: 34 ASKYF-EIEATFHAMPGP---LTDHSLCAV--CRNPANKKCSRCKSVRYCSQACQQAHWK 87
Query: 368 LRHKADC 374
HK C
Sbjct: 88 SEHKMRC 94
>gi|350272852|ref|YP_004884160.1| hypothetical protein OBV_44560 [Oscillibacter valericigenes
Sjm18-20]
gi|348597694|dbj|BAL01655.1| hypothetical protein OBV_44560 [Oscillibacter valericigenes
Sjm18-20]
Length = 933
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 69/146 (47%), Gaps = 16/146 (10%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQ----NRGSGASLMAKAAISSHAQALYSLA 175
++A +L A GNV A Y LG + CL+ N + M KAA + +A A Y+L
Sbjct: 628 QAAVSYLGQSAQLGNVNAQYLLGKV---CLETGIGNPMQAVAWMTKAAEAGNAGAQYALG 684
Query: 176 VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLV 235
+ +G+ +KD++ VA+ AA + A +LG G + +N+ E ++L
Sbjct: 685 KLYRDGT---HVEKDIQKAVAMFTVAAKQKNEYAAYQLGRLYIAGTDIPKNVPEAVKWLT 741
Query: 236 QANARELAAVLSSAACQGISTRSWLT 261
L++ L +A Q + +LT
Sbjct: 742 ------LSSDLGNAYAQYALAKLYLT 761
>gi|119497225|ref|XP_001265374.1| cell cycle inhibitor Nif1, putative [Neosartorya fischeri NRRL 181]
gi|119413536|gb|EAW23477.1| cell cycle inhibitor Nif1, putative [Neosartorya fischeri NRRL 181]
Length = 759
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 3/90 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A M KA + A ++Y L V NG G + DK L + A G DAL E G
Sbjct: 619 AKEMQKAYRAQFALSIYELGVSHLNG-WGIEQDKSL--ALRCFEIAGQWGDADALAEAGF 675
Query: 216 CLQDGYGVRQNIAEGRRFLVQANARELAAV 245
C +G G ++++ + +F QA A+ ++ V
Sbjct: 676 CYAEGIGCKKDMKKAAKFYRQAEAKGISMV 705
>gi|307107217|gb|EFN55460.1| hypothetical protein CHLNCDRAFT_133802 [Chlorella variabilis]
Length = 281
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 334 GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKADCAPAE 378
GCGR T ++CS C YCSRACQ WK HK +CA E
Sbjct: 226 GCGRQAT---SLKKCSRCRVAAYCSRACQVHHWKEGGHKQECAQLE 268
>gi|78188258|ref|YP_378596.1| Sel1 repeat-containing protein [Chlorobium chlorochromatii CaD3]
gi|78170457|gb|ABB27553.1| Sel1-like repeat [Chlorobium chlorochromatii CaD3]
Length = 271
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 8/121 (6%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQN----RGSGASLMAKAAISSHAQALYSLAV 176
A ++ +L A+ GNVEA LG I FY +N AA ++A A L
Sbjct: 133 EAIKWFRLAAEQGNVEAQAMLGSI-FYVGKNVQRDEFEAIKWFKLAAQQNYAYAQMMLGT 191
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ 236
+ G G + D + AA G+++A +LG +GYGVRQN A + + +
Sbjct: 192 MYATGEGVRQ---DYVEAIKWYRFAAEQGNVEAQYDLGLLYLNGYGVRQNKAIAKEWFGK 248
Query: 237 A 237
A
Sbjct: 249 A 249
>gi|392564901|gb|EIW58079.1| hypothetical protein TRAVEDRAFT_150742 [Trametes versicolor
FP-101664 SS1]
Length = 997
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 27/50 (54%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C+++ CG+ E EF +C C YC + CQ+ W H+ C+
Sbjct: 710 GIRQCANMLCGKWERFPREFAKCRRCRKAKYCGKECQSTAWSEGHRFWCS 759
>gi|443702950|gb|ELU00773.1| hypothetical protein CAPTEDRAFT_224225 [Capitella teleta]
Length = 1175
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
C R T + +RCS C YC R CQ DW +RHK +C
Sbjct: 678 CRRDCTANMQLKRCSKCFKTAYCDRQCQTQDW-VRHKPNC 716
>gi|242014702|ref|XP_002428024.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512543|gb|EEB15286.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 420
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 58/136 (42%), Gaps = 18/136 (13%)
Query: 91 KRMNGLALNSLVLSKASKKTFAIKANNW----------SESAHRFLKLCADAGNVEACYT 140
K+ G LN VL KA +K NNW + A F AD N +
Sbjct: 185 KQSTGEDLN--VLEKAHRKVIGT-INNWLGLQHMECGNFKQALEFFIQSADEENPSGTFN 241
Query: 141 LGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVAL 197
LG+ L QN ASL +A HA A+Y+L V G GG + D D RA L
Sbjct: 242 LGICHERGLGTSQNYKKAASLYQRATDLGHASAMYNLGVFYARGLGGFEPDVD-RA-RQL 299
Query: 198 CARAAFLGHIDALREL 213
AA LG +A++ L
Sbjct: 300 FKSAARLGQKEAVKAL 315
>gi|308807931|ref|XP_003081276.1| MYND domain protein, putative (ISS) [Ostreococcus tauri]
gi|116059738|emb|CAL55445.1| MYND domain protein, putative (ISS) [Ostreococcus tauri]
Length = 307
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 21/30 (70%), Gaps = 1/30 (3%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+RC+ C AV YCS+ CQ DWK RHK CA
Sbjct: 94 KRCARCRAVRYCSKECQHADWK-RHKTSCA 122
>gi|157875169|ref|XP_001685989.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129062|emb|CAJ06614.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 1035
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 29/62 (46%), Gaps = 10/62 (16%)
Query: 323 PGPGLRL---------CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKA 372
P PG L SH CG RR RC C AV+YC + QALDWK H+
Sbjct: 623 PAPGYTLHVPSVCFSAISHKKCGWCGRRREVLLRCGGCKAVSYCCKRHQALDWKEGSHRL 682
Query: 373 DC 374
+C
Sbjct: 683 EC 684
>gi|67515593|ref|XP_657682.1| hypothetical protein AN0078.2 [Aspergillus nidulans FGSC A4]
gi|74582073|sp|O42631.1|MUB1_EMENI RecName: Full=MYND-type zinc finger protein samB; AltName:
Full=Suppressor of anucleate metulae protein B
gi|2661416|emb|CAA04448.1| SamB protein [Emericella nidulans]
gi|40746100|gb|EAA65256.1| hypothetical protein AN0078.2 [Aspergillus nidulans FGSC A4]
gi|259489732|tpe|CBF90245.1| TPA: Putative uncharacterized proteinSamB protein ;
[Source:UniProtKB/TrEMBL;Acc:O42631] [Aspergillus
nidulans FGSC A4]
Length = 590
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 27/50 (54%), Gaps = 1/50 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G+R C++ CG+ E +F +C C YCS+ CQ W L H+ CA
Sbjct: 540 GIRQCANYKCGKWEEFTRQFAKCRRCRRTKYCSKDCQKAAW-LYHRHWCA 588
>gi|393237067|gb|EJD44612.1| hypothetical protein AURDEDRAFT_114308 [Auricularia delicata
TFB-10046 SS5]
Length = 694
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
LR+ S CG T RRC C A+ YC R CQA DW+ H+ C
Sbjct: 645 LRMHSCGHCGIDST---VLRRCGACKAIRYCGRDCQAEDWRQDHQHSC 689
>gi|359476454|ref|XP_002271840.2| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Vitis
vinifera]
Length = 940
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADC 374
++CS C AV YCS+ CQA WK HK C
Sbjct: 96 KKCSGCKAVRYCSQTCQATHWKSGHKTKC 124
>gi|307213312|gb|EFN88764.1| Ankyrin repeat and MYND domain-containing protein 2 [Harpegnathos
saltator]
Length = 406
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 29/61 (47%), Gaps = 9/61 (14%)
Query: 333 VGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNG-DVDV 391
V CG + ++CS C AV YC R CQ L W HK CA R NG VDV
Sbjct: 336 VTCGEDKA----IKKCSKCKAVQYCDRECQRLHW-FMHKKACA---RLGQSSNNGKSVDV 387
Query: 392 D 392
D
Sbjct: 388 D 388
>gi|195017279|ref|XP_001984571.1| GH14954 [Drosophila grimshawi]
gi|193898053|gb|EDV96919.1| GH14954 [Drosophila grimshawi]
Length = 399
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 34/86 (39%), Gaps = 8/86 (9%)
Query: 290 CPLLSDFGCNVPAPEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCS 349
C + + A +A PA L G CS G +P+ ++CS
Sbjct: 290 CTIFRQIVSQLSAKDAPPALGLLRNAINGMRGFVDES-SYCSSCGQEKPD------KKCS 342
Query: 350 VCGAVNYCSRACQALDWKLRHKADCA 375
C AV YC R CQ L W HK CA
Sbjct: 343 KCKAVQYCDRECQRLHW-FMHKKSCA 367
>gi|302653465|ref|XP_003018558.1| hypothetical protein TRV_07403 [Trichophyton verrucosum HKI 0517]
gi|291182212|gb|EFE37913.1| hypothetical protein TRV_07403 [Trichophyton verrucosum HKI 0517]
Length = 1143
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CS+ G RP + + RCS C YCS+ CQ DW RHK C
Sbjct: 1093 CSNCGGERPGSGT-KLLRCSGCKVAEYCSKDCQREDWMARHKLAC 1136
>gi|401428002|ref|XP_003878484.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494732|emb|CBZ30035.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 1025
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 29/62 (46%), Gaps = 10/62 (16%)
Query: 323 PGPGLRL---------CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKA 372
P PG L SH CG RR RC C AV+YC + QALDWK H+
Sbjct: 613 PAPGYTLHVPPVCFSAISHKKCGWCGRRREVLLRCGGCKAVSYCCKRHQALDWKEGSHRL 672
Query: 373 DC 374
+C
Sbjct: 673 EC 674
>gi|396477210|ref|XP_003840223.1| hypothetical protein LEMA_P110090.1 [Leptosphaeria maculans JN3]
gi|312216794|emb|CBX96744.1| hypothetical protein LEMA_P110090.1 [Leptosphaeria maculans JN3]
Length = 907
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CGR E +F +C C YCS+ CQ W
Sbjct: 856 GIRQCAYYQCGRWEEYTRQFAKCRRCRRTKYCSKECQKSAW 896
>gi|323453765|gb|EGB09636.1| hypothetical protein AURANDRAFT_24370, partial [Aureococcus
anophagefferens]
Length = 147
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 16/128 (12%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLA--- 175
++ A + K + GNV+A LG L G G L + A LY +A
Sbjct: 16 AKKAAKIYKRAVELGNVDAMVNLGF-----LLETGDGVKLDVR-----KANQLYKMAAEL 65
Query: 176 ---VIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRR 232
Q+N + + A + +A G+ +A LG CL++GYGV +++ E +R
Sbjct: 66 GDATAQYNLGNNNARAGNFDAAFSYFKSSASQGYFNAFYGLGKCLENGYGVDRDLDEAKR 125
Query: 233 FLVQANAR 240
+ + A+
Sbjct: 126 WYARVAAK 133
>gi|159130505|gb|EDP55618.1| cell cycle inhibitor Nif1, putative [Aspergillus fumigatus A1163]
Length = 756
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 3/90 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A M KA + A ++Y L V NG G + DK L + A G DAL E G
Sbjct: 616 AKEMQKAYRAQFALSIYELGVSHLNG-WGIEQDKSL--ALRCFEIAGQWGDADALAEAGF 672
Query: 216 CLQDGYGVRQNIAEGRRFLVQANARELAAV 245
C +G G ++++ + +F QA A+ ++ V
Sbjct: 673 CYAEGIGCKKDMKKAAKFYRQAEAKGISMV 702
>gi|145519996|ref|XP_001445859.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413325|emb|CAK78462.1| unnamed protein product [Paramecium tetraurelia]
Length = 476
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 8/49 (16%)
Query: 348 CSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDVDVDLNDA 396
CS C + YCS+ C+ DW L HK +C P + G +LND+
Sbjct: 22 CSYCQNMKYCSQKCRDTDWTLSHKNNCHPVK--------GKSITELNDS 62
>gi|109073369|ref|XP_001086056.1| PREDICTED: programmed cell death protein 2 isoform 5 [Macaca
mulatta]
Length = 311
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 98 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQTCA 140
>gi|169846458|ref|XP_001829944.1| hypothetical protein CC1G_04633 [Coprinopsis cinerea okayama7#130]
gi|116508971|gb|EAU91866.1| hypothetical protein CC1G_04633 [Coprinopsis cinerea okayama7#130]
Length = 256
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCA 375
RCSVC +YCS+ CQ L WK HK C+
Sbjct: 39 RCSVCKDYSYCSQKCQKLHWKQHHKWGCS 67
>gi|398380210|ref|ZP_10538328.1| TPR repeat-containing protein [Rhizobium sp. AP16]
gi|397721526|gb|EJK82074.1| TPR repeat-containing protein [Rhizobium sp. AP16]
Length = 393
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 6/108 (5%)
Query: 125 FLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+ + AD GNV+A Y L ++ ++ G + KAA +A Y+LAV+ +G
Sbjct: 184 WYRKAADQGNVDAQYNLAIMYDSGEGITKDSGQALAWYRKAADQGKIEAQYNLAVMYRDG 243
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
+G KD V +AA G DA LG DG G+ ++ E
Sbjct: 244 AGVP---KDGAQAVTWFRKAADQGDADAQYNLGTMYADGDGIAEDDVE 288
>gi|194751029|ref|XP_001957829.1| GF10604 [Drosophila ananassae]
gi|190625111|gb|EDV40635.1| GF10604 [Drosophila ananassae]
Length = 407
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 7/46 (15%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
CS G +P+ ++CS C AV YC R CQ L W HK +CA
Sbjct: 335 CSTCGAEKPD------KKCSKCRAVQYCDRECQRLHW-FMHKKNCA 373
>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 495
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+E + C+ C V YC R CQ+ DWK H +C
Sbjct: 72 NELKACTGCRVVKYCDRTCQSKDWKFAHSLEC 103
>gi|197106231|ref|YP_002131608.1| localization factor podJL (Polar organelle development protein)
[Phenylobacterium zucineum HLK1]
gi|196479651|gb|ACG79179.1| localization factor podJL (Polar organelle development protein)
[Phenylobacterium zucineum HLK1]
Length = 702
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 54/119 (45%), Gaps = 7/119 (5%)
Query: 126 LKLCADAGNVEACYTLGMI----RFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
LK A+ G A + L + R +QN A+AA A+++L + F G
Sbjct: 567 LKAVAEDGYAPAQFYLAQLYESGRLGVVQNLAEARRWTARAAEGGEPNAMHNLGLYFFRG 626
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
GG + DL + +AA G +D+ LG Q G GV+++ AE ++ A A+
Sbjct: 627 EGGPQ---DLASAAQWFRKAAEAGVVDSQYNLGLMYQAGSGVQRDPAEALKWFSLAAAQ 682
>gi|397605791|gb|EJK59116.1| hypothetical protein THAOC_20701 [Thalassiosira oceanica]
Length = 369
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 2/87 (2%)
Query: 151 NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDAL 210
N +++ + +A+Y L + F+G G + KD+R + L A AA LG I AL
Sbjct: 201 NDADALAMIQARVLKRDPEAIYYLGMKYFHGHLGLQ--KDMRRAIELWAEAAELGSIQAL 258
Query: 211 RELGHCLQDGYGVRQNIAEGRRFLVQA 237
LG G GV+Q+ A+ +A
Sbjct: 259 YNLGAAYYQGEGVQQDKAKAVEIFERA 285
>gi|392587935|gb|EIW77268.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 656
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 4/62 (6%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDV 389
CS+ P ++ +RC+ C V YC CQ DW+L HK +C + W + D+
Sbjct: 110 CSYCANAAP---KNPLKRCTGCQRVRYCDAECQKSDWRL-HKHECGALQNWAKAAPSADL 165
Query: 390 DV 391
V
Sbjct: 166 AV 167
>gi|410688509|ref|YP_006961773.1| truncated hypothetical protein, Sel1 domain protein [Psychrobacter
sp. DAB_AL60]
gi|380861025|gb|AFF18227.1| truncated hypothetical protein, Sel1 domain protein [Psychrobacter
sp. DAB_AL60]
Length = 265
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 64/139 (46%), Gaps = 9/139 (6%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A R+L+ A+ +V+A + +G++ F Q+ + KAA HA A Y LA
Sbjct: 69 EAFRWLEKAANQDHVDAQFQVGIMCFRGTGTRQDEARAVNWYKKAANQGHANAQYFLADR 128
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+NG ++ C +AA ++A +LG +DG GV Q+ A+ + +A
Sbjct: 129 FYNGIALEQS---YIKAFEWCQKAANQNLVEAQIDLGDMYKDGKGVEQDYAKAFEWYQKA 185
Query: 238 ---NARELAAVLSSAACQG 253
+ + A+++ C G
Sbjct: 186 VDFDDPKAKALIAEMYCHG 204
>gi|413936571|gb|AFW71122.1| putative ubiquitin carboxyl-terminal hydrolase superfamily protein
[Zea mays]
Length = 821
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 28/61 (45%), Gaps = 10/61 (16%)
Query: 323 PGPGLRLCSH--VGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
P P LR H C P R CS C +V YCS CQ + W+ HK C ++W
Sbjct: 77 PAPSLRQEYHECATCHAPAKTR-----CSRCKSVRYCSGKCQIVHWRQGHKETC---QKW 128
Query: 381 L 381
L
Sbjct: 129 L 129
>gi|403216005|emb|CCK70503.1| hypothetical protein KNAG_0E02420 [Kazachstania naganishii CBS
8797]
Length = 624
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 493 GVRQCANFACGKWEEYPRQFAKCRRCKRTKYCSRKCQLKAW 533
>gi|397620582|gb|EJK65792.1| hypothetical protein THAOC_13313 [Thalassiosira oceanica]
Length = 263
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/177 (23%), Positives = 79/177 (44%), Gaps = 28/177 (15%)
Query: 71 CKLSSTAR-----CPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANNW------- 118
C L++T R CP F I + LA+ +SK + + + +
Sbjct: 67 CGLAATRRGMYNSCP--FCRTNIPTDDASELAMIQKRVSKGDAEAIYLLGDKYYFGDLGL 124
Query: 119 SESAHRFLKL---CADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALY 172
+++ R ++L A+ G+V+A + LG I +Y +++ G L +AAI H Q +
Sbjct: 125 AKNVPRAIELWTEAAELGSVDAHHNLGRIYYYGDGIEEDKPRGIRLWQQAAIKGHVQGRH 184
Query: 173 SLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
SL V+++N ++ + A+ +G+ +L E+ H ++G + AE
Sbjct: 185 SLGVVEYNNG-----HYEVAVQRMISAK---MGYERSLNEIKHMFKEGQATKAQYAE 233
>gi|397572083|gb|EJK48099.1| hypothetical protein THAOC_33133 [Thalassiosira oceanica]
Length = 207
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 29/49 (59%)
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD R V L AA +G IDAL LG + G GV+Q++AEG F +A
Sbjct: 66 KDTRMAVKLWEEAAEIGSIDALFNLGLAHERGEGVKQDMAEGAEFYKKA 114
>gi|340386762|ref|XP_003391877.1| PREDICTED: LRP2-binding protein-like, partial [Amphimedon
queenslandica]
Length = 304
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 6/106 (5%)
Query: 133 GNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKD-- 190
GN A ++L I F G S + AA AQALY LAVI++ G G++ +
Sbjct: 42 GNAMAQFSLAKI-FLSQGQTGKAMSYLTSAARQEDAQALYELAVIRYQGLQGTEASSEEG 100
Query: 191 --LRAGVALCARAAFLGHIDALR-ELGHCLQDGYGVRQNIAEGRRF 233
L VA C A + A + +G G+GV+Q+ E ++
Sbjct: 101 FKLMLKVAQCKGAKNTDLVSAAQFNIGRAYFQGFGVKQDPEEALKW 146
>gi|255078648|ref|XP_002502904.1| predicted protein [Micromonas sp. RCC299]
gi|226518170|gb|ACO64162.1| predicted protein [Micromonas sp. RCC299]
Length = 843
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGC-GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
GLR CS+ C R +T + RCS C A YCS CQ W+ H+ C P
Sbjct: 526 GLRRCSNPRCVEREDTIAGKMLRCSRCKAACYCSSHCQRTHWRDGHRESCKP 577
>gi|70990354|ref|XP_750026.1| cell cycle inhibitor Nif1 [Aspergillus fumigatus Af293]
gi|66847658|gb|EAL87988.1| cell cycle inhibitor Nif1, putative [Aspergillus fumigatus Af293]
Length = 756
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 3/90 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A M KA + A ++Y L V NG G + DK L + A G DAL E G
Sbjct: 616 AKEMQKAYRAQFALSIYELGVSHLNG-WGIEQDKSL--ALRCFEIAGQWGDADALAEAGF 672
Query: 216 CLQDGYGVRQNIAEGRRFLVQANARELAAV 245
C +G G ++++ + +F QA A+ ++ V
Sbjct: 673 CYAEGIGCKKDMKKAAKFYRQAEAKGISMV 702
>gi|413936572|gb|AFW71123.1| putative ubiquitin carboxyl-terminal hydrolase superfamily protein
[Zea mays]
Length = 890
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 28/61 (45%), Gaps = 10/61 (16%)
Query: 323 PGPGLRLCSH--VGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
P P LR H C P R CS C +V YCS CQ + W+ HK C ++W
Sbjct: 77 PAPSLRQEYHECATCHAPAKTR-----CSRCKSVRYCSGKCQIVHWRQGHKETC---QKW 128
Query: 381 L 381
L
Sbjct: 129 L 129
>gi|302142049|emb|CBI19252.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 3/43 (6%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP---AERWLDEEGN 386
RC+ C AV YCS CQ + W+ HK +C P + +DE N
Sbjct: 79 RCARCKAVRYCSGKCQIIHWRQGHKEECNPPSITHQIIDESIN 121
>gi|159481955|ref|XP_001699040.1| hypothetical protein CHLREDRAFT_152090 [Chlamydomonas reinhardtii]
gi|158273303|gb|EDO99094.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1161
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 25/55 (45%), Gaps = 3/55 (5%)
Query: 323 PGPGLRLCSHVGC---GRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
P P LR+C + C G + + C C V YC CQ WK HKADC
Sbjct: 1082 PLPLLRVCGNPACERFGSGSEAALDLKLCGRCRCVRYCCSECQTTHWKAGHKADC 1136
>gi|153002642|ref|YP_001368323.1| Sel1 domain-containing protein [Shewanella baltica OS185]
gi|151367260|gb|ABS10260.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS185]
Length = 491
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
S+ A ++L+ A+ EA LG++ +Y +N ++KAA + + A Y LA++
Sbjct: 291 SKLALQYLQRAAEGEQTEAMANLGVL-YYQQKNLTQAYHFISKAAQAGYPHAQYHLALML 349
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFLVQ 236
NG G + +D+ A A AA G +DA L H L D ++ + +L Q
Sbjct: 350 ANGDGCT---RDMIASEYWMAEAAEQGQLDAMLTRAQHMLNDDNAFGSDLTQAEDYLRQ 405
>gi|355562212|gb|EHH18844.1| hypothetical protein EGK_15530, partial [Macaca mulatta]
Length = 292
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 27/55 (49%), Gaps = 7/55 (12%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
G LC GC P+T CS C YCS+ Q LDW+L HK CA + +
Sbjct: 79 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQTCAQPDHF 126
>gi|340371217|ref|XP_003384142.1| PREDICTED: LRP2-binding protein-like [Amphimedon queenslandica]
Length = 329
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 6/106 (5%)
Query: 133 GNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKD-- 190
GN A ++L I F G S + AA AQALY LAVI++ G G++ +
Sbjct: 42 GNAMAQFSLAKI-FLSQGQTGKAMSYLTSAARQEDAQALYELAVIRYQGLQGTEASSEEG 100
Query: 191 --LRAGVALCARAAFLGHIDALR-ELGHCLQDGYGVRQNIAEGRRF 233
L VA C A + A + +G G+GV+Q+ E ++
Sbjct: 101 FKLMLKVAQCKGAKNTDLVSAAQFNIGRAYFQGFGVKQDPEEALKW 146
>gi|238591587|ref|XP_002392650.1| hypothetical protein MPER_07737 [Moniliophthora perniciosa FA553]
gi|215459009|gb|EEB93580.1| hypothetical protein MPER_07737 [Moniliophthora perniciosa FA553]
Length = 328
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 337 RPETRRHEFRRCSVCGAVNYCSRACQALDWK 367
+P+ + +++RCS C A YCS++CQ DWK
Sbjct: 213 KPKHKAVKYKRCSRCYAATYCSKSCQIFDWK 243
>gi|169603682|ref|XP_001795262.1| hypothetical protein SNOG_04849 [Phaeosphaeria nodorum SN15]
gi|160706436|gb|EAT87240.2| hypothetical protein SNOG_04849 [Phaeosphaeria nodorum SN15]
Length = 472
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 35/77 (45%), Gaps = 3/77 (3%)
Query: 314 EWFAARGGTPGPGLRLCSHVG--CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK 371
E A GG+ P L L + G C R +C+ C YCS+ CQ LDW HK
Sbjct: 9 ERILASGGSDVPSLPLGAVRGILCFRCFEPSENILKCAGCRRAGYCSKECQKLDWTAVHK 68
Query: 372 ADCAPAERWLDEEGNGD 388
C +R ++EE D
Sbjct: 69 KQCKILQR-INEEDLKD 84
>gi|397612864|gb|EJK61918.1| hypothetical protein THAOC_17496, partial [Thalassiosira oceanica]
Length = 515
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 62/255 (24%), Positives = 97/255 (38%), Gaps = 36/255 (14%)
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG 220
KAA + ALY L++ +G + +A L AA LG+ A +L + G
Sbjct: 235 KAAAQNFPPALYDLSIFHRSGIAFVLEKSEEKAN-ELLLEAANLGYAKANSDLSKVFKFG 293
Query: 221 YG-VRQNIAEGRR-----FLVQANARELAAVLSSAACQGISTRSWLTWNPLSQPHPAPHP 274
+G +++ E F + + + A +L+ + S S P+ A +
Sbjct: 294 FGGFEEDLGEAYSRASVAFAIDRSNEDAAKMLAIMFHENASEYS---------PYLACYY 344
Query: 275 RNQNPNHNGHVTGIGCPLLSD---------FGC-NVPAPEAHPASRFLAEWFAARGGTPG 324
N N +G C L S C N+ A PA+ F A G G
Sbjct: 345 LNIAANE--ETSGDSCFLYSQALIEMNSHLHNCLNLIGFNATPAAFFWARKSCDMGFDCG 402
Query: 325 -------PGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G R CG+ ++++CS C A Y S+ CQ W HK DC A
Sbjct: 403 REMLKKWEGERQSFCANCGKNAQSGEKYKQCSNCRAQWYVSKECQVEAWNAGHKKDCKRA 462
Query: 378 ERWLDEEG-NGDVDV 391
+ E+ N +++V
Sbjct: 463 RILMFEDYLNAELEV 477
>gi|390357051|ref|XP_003728916.1| PREDICTED: uncharacterized protein LOC100893269 [Strongylocentrotus
purpuratus]
Length = 257
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 21/35 (60%), Gaps = 1/35 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
+RC C + YCSR CQ DW HK +C+P +R
Sbjct: 26 LKRCKSCKSTFYCSRECQINDWPT-HKLECSPKDR 59
>gi|397614166|gb|EJK62635.1| hypothetical protein THAOC_16742 [Thalassiosira oceanica]
Length = 218
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 325 PGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
G LC++ C + ++++CS C A YCS+ CQ W+ HK DC A
Sbjct: 156 TGQSLCAN--CAKETETGEKYKQCSKCRAQWYCSKECQVESWRTGHKKDCKRA 206
>gi|302915433|ref|XP_003051527.1| hypothetical protein NECHADRAFT_122473 [Nectria haematococca mpVI
77-13-4]
gi|256732466|gb|EEU45814.1| hypothetical protein NECHADRAFT_122473 [Nectria haematococca mpVI
77-13-4]
Length = 1179
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CG+PE+ ++C C YCS CQ DW+ +H+ +C
Sbjct: 1138 CGKPESDSVTLKKCMRCQRAKYCSGECQKKDWR-KHRGEC 1176
>gi|169846422|ref|XP_001829926.1| hypothetical protein CC1G_04615 [Coprinopsis cinerea okayama7#130]
gi|116508953|gb|EAU91848.1| hypothetical protein CC1G_04615 [Coprinopsis cinerea okayama7#130]
Length = 257
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCA 375
RCSVC +YCS+ CQ L WK HK C+
Sbjct: 39 RCSVCKDYSYCSQKCQKLHWKQHHKWGCS 67
>gi|323336296|gb|EGA77567.1| Mub1p [Saccharomyces cerevisiae Vin13]
Length = 487
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 375 GVRQCANFSCGKWEDFPRQFAKCRRCKRTKYCSRKCQLKAW 415
>gi|170118408|ref|XP_001890383.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634653|gb|EDQ98981.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1168
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 8/49 (16%)
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
R+C G G+P+ + CSVC V YCS CQ DWK HK C P
Sbjct: 1128 RVCR--GKGKPK-----IKECSVCQKVRYCSPECQKKDWK-AHKPKCKP 1168
>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
1015]
Length = 386
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ C CG V YC + CQA DWKL H +C+
Sbjct: 66 LKGCRGCGIVKYCDKTCQAKDWKLGHSLECS 96
>gi|323307741|gb|EGA61004.1| Mub1p [Saccharomyces cerevisiae FostersO]
Length = 487
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 375 GVRQCANFSCGKWEDFPRQFAKCRRCKRTKYCSRKCQLKAW 415
>gi|449540132|gb|EMD31128.1| hypothetical protein CERSUDRAFT_120095 [Ceriporiopsis subvermispora
B]
Length = 518
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 39/83 (46%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
+L+ + A H + Y LA NG G K +D L AA GH DA G C
Sbjct: 197 ALLKRLADRGHMPSQYFLADCYANGIGTHKGRQDFDRAFPLFVLAAKHGHPDAAYRAGTC 256
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 257 CENGWGCRREAAKAVQFYRKAAA 279
>gi|397616941|gb|EJK64208.1| hypothetical protein THAOC_15081, partial [Thalassiosira oceanica]
Length = 467
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 35/73 (47%), Gaps = 2/73 (2%)
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A+Y L + G G +ND R V L AA LG IDAL LG + G GV +N A+
Sbjct: 95 AIYHLGGNYWGGKHGLQND--FRKAVKLWTEAAELGSIDALYNLGVAYESGEGVEKNRAK 152
Query: 230 GRRFLVQANAREL 242
F R++
Sbjct: 153 ATEFYKGRPCRDM 165
>gi|302675362|ref|XP_003027365.1| hypothetical protein SCHCODRAFT_113792 [Schizophyllum commune H4-8]
gi|300101051|gb|EFI92462.1| hypothetical protein SCHCODRAFT_113792 [Schizophyllum commune H4-8]
Length = 529
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 22/46 (47%), Gaps = 2/46 (4%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
C PE + RC C V+YCS CQ W+ H+ DC P W
Sbjct: 438 CQSPEAIK--LLRCEQCHDVSYCSVECQKSHWRDTHRIDCEPLTGW 481
>gi|393238670|gb|EJD46206.1| hypothetical protein AURDEDRAFT_151705 [Auricularia delicata
TFB-10046 SS5]
Length = 741
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 19/33 (57%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
R+C+ C V YCS+ CQ DWK H+ C A
Sbjct: 703 LRKCAACRDVRYCSKQCQKKDWKFTHREQCTGA 735
>gi|365759013|gb|EHN00827.1| Mub1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 621
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 509 GVRQCANFSCGKWEDFPRQFAKCRRCKRTKYCSRKCQLKAW 549
>gi|410465884|ref|ZP_11319057.1| TPR repeat-containing protein, partial [Desulfovibrio magneticus
str. Maddingley MBC34]
gi|409981090|gb|EKO37711.1| TPR repeat-containing protein, partial [Desulfovibrio magneticus
str. Maddingley MBC34]
Length = 695
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 62/118 (52%), Gaps = 6/118 (5%)
Query: 141 LGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCAR 200
LGM+R + +GA+L+++AA + QA+Y++A++ +G G S N + CA+
Sbjct: 364 LGMVRGRLPGDPAAGAALLSQAAWAGQPQAMYAMALLHLSGQGVSGNPAE---AFRWCAK 420
Query: 201 AAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ-ANARELAA--VLSSAACQGIS 255
AA G +A L ++ + N+A+ + + ANA +L A L S +G++
Sbjct: 421 AAEAGVPEAKGLLAALSEEEFPSAANLAKAVTYYREAANAGDLEAGFALGSLLSKGLA 478
>gi|296083782|emb|CBI23999.3| unnamed protein product [Vitis vinifera]
Length = 892
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADC 374
++CS C AV YCS+ CQA WK HK C
Sbjct: 20 KKCSGCKAVRYCSQTCQATHWKSGHKTKC 48
>gi|146338219|ref|YP_001203267.1| beta-lactamase [Bradyrhizobium sp. ORS 278]
gi|146191025|emb|CAL75030.1| putative Beta-lactamase [Bradyrhizobium sp. ORS 278]
Length = 364
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 54/114 (47%), Gaps = 12/114 (10%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG------SGASLMAKAAISSHAQALYS 173
E A + K A+AG+ EA + L M+R L RG L+A AA +A Y+
Sbjct: 138 EKAAIWYKRAAEAGDREAMFALAMMR---LAGRGGPVDKPEAVKLLASAAKLGEPKAAYN 194
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
LA++ + G +DL+ L AA G+ +A L ++G GV +++
Sbjct: 195 LALLYLD---GQTLPQDLKRSAELLRIAADAGNAEAQYALATFYKEGTGVPKDL 245
>gi|427402303|ref|ZP_18893375.1| hypothetical protein HMPREF9710_02971 [Massilia timonae CCUG 45783]
gi|425718836|gb|EKU81779.1| hypothetical protein HMPREF9710_02971 [Massilia timonae CCUG 45783]
Length = 767
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 6/117 (5%)
Query: 125 FLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+LK A AG+ EA Y LG+ +NR + + +AA +A A +++ + +G
Sbjct: 570 WLKRAAQAGDPEAQYELGVCHIEGRGVAKNRRTASEWQVRAAEQGYAAAQHAIGLAYLHG 629
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQAN 238
S G+++D V AA + +A +LGHC +G GV N A + +A
Sbjct: 630 SEGAEDDA---LAVQWFRLAAQQDYPEAQTDLGHCYLNGRGVAVNRAHAASWFRKAE 683
>gi|342887815|gb|EGU87244.1| hypothetical protein FOXB_02226 [Fusarium oxysporum Fo5176]
Length = 653
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G+R C++ CG+ E +F +C C YCS+ CQ W H+ C A +
Sbjct: 601 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAF-HRHWCVAATQ 653
>gi|402305749|ref|ZP_10824808.1| Sel1 repeat protein [Haemophilus sputorum HK 2154]
gi|400376862|gb|EJP29749.1| Sel1 repeat protein [Haemophilus sputorum HK 2154]
Length = 383
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 52/123 (42%), Gaps = 6/123 (4%)
Query: 137 ACYTLGMIRFYCLQNRGSGASL-MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGV 195
A Y L M F GA + +AA + +ALY L V G + DL +
Sbjct: 218 AQYLLAMNFFDLYSENNKGALFWLERAANNDEPEALYQLGVYYEEGV-----EADLAKAI 272
Query: 196 ALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGIS 255
RAA L H DA EL + +G V Q+ + FL +A + + A Q ++
Sbjct: 273 NYFQRAAELNHSDAALELSYIYDEGIIVEQDDDKALFFLKKAAELDNQEAIDELAAQALA 332
Query: 256 TRS 258
+R+
Sbjct: 333 SRA 335
>gi|147838237|emb|CAN78307.1| hypothetical protein VITISV_005598 [Vitis vinifera]
Length = 494
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADC 374
++CS C AV YCS+ CQA WK HK C
Sbjct: 99 KKCSGCKAVRYCSQTCQATHWKSGHKTKC 127
>gi|405959674|gb|EKC25686.1| SET and MYND domain-containing protein 2 [Crassostrea gigas]
Length = 243
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+RC+ C V YCSR CQA DW + HK +C+
Sbjct: 1 MKRCTRCRQVYYCSRECQAKDWPM-HKTECS 30
>gi|363580151|ref|ZP_09312961.1| Sel1 domain-containing protein repeat-containing protein
[Flavobacteriaceae bacterium HQM9]
Length = 475
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 49/116 (42%), Gaps = 8/116 (6%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K A AGN A + LG++ C N K+A + A+YSL +
Sbjct: 87 AFKLYKEGAKAGNAIAMFNLGVLYKNGKGCQLNYNKARKWFEKSAEQGNEMAIYSLGYMY 146
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G G D+D V+ ++ + A LG C +GYGV +NI + L
Sbjct: 147 LKGLGSI--DQDYNKAVSHFEKSEYAM---AKYWLGVCYLNGYGVSKNIQKANELL 197
>gi|373947688|ref|ZP_09607649.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS183]
gi|386326466|ref|YP_006022583.1| Sel1 domain-containing protein repeat-containing protein
[Shewanella baltica BA175]
gi|333820611|gb|AEG13277.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
BA175]
gi|373884288|gb|EHQ13180.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS183]
Length = 491
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
S+ A ++L+ A+ EA LG++ +Y +N ++KAA + + A Y LA++
Sbjct: 291 SKLALQYLQRAAEGEQTEAMANLGVL-YYQQKNLTQAYHFISKAAQAGYPHAQYHLALML 349
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFLVQ 236
NG G + +D+ A A AA G +DA L H L D ++ + +L Q
Sbjct: 350 ANGDGCT---RDMIASEYWMAEAAEQGQLDAMLTRAQHMLNDDNAFGSDLTQAEDYLRQ 405
>gi|403050408|ref|ZP_10904892.1| hypothetical protein AberL1_02428 [Acinetobacter bereziniae LMG
1003]
Length = 478
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 59/127 (46%), Gaps = 13/127 (10%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA-----KAAISSHAQALYSL 174
++A ++ ++ + CY ++ Q RG G + +A KAA + H ++ Y
Sbjct: 209 DAAFKYAEIAQQHNDPFGCYVRALVE----QQRGQGDATLAQEFLLKAAKNGHIESAYLA 264
Query: 175 AVIQFNGSGGSKNDKDLRA----GVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
AV + +K ++ + V L + AA GH+ A L CL+ G G +N+ +G
Sbjct: 265 AVNLLKNAENAKTEQKHQEYATLAVELLSHAAVAGHVPAQYSLAQCLRYGLGTEKNLDQG 324
Query: 231 RRFLVQA 237
+L +A
Sbjct: 325 VSWLERA 331
>gi|397588049|gb|EJK54121.1| hypothetical protein THAOC_26317, partial [Thalassiosira oceanica]
Length = 503
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
C + +F++CS C A YCS+ CQ W+ HK DC A
Sbjct: 449 CAKKSQAGEKFKQCSKCKAQWYCSKECQVEAWRAGHKKDCKRA 491
>gi|342217685|ref|ZP_08710324.1| Sel1 repeat protein [Megasphaera sp. UPII 135-E]
gi|341593348|gb|EGS36198.1| Sel1 repeat protein [Megasphaera sp. UPII 135-E]
Length = 1112
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 5/117 (4%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
A + + A GNV+A M + Q KAA +A+A Y L V
Sbjct: 558 KAVEWYQKAATQGNVDA--QNSMDKLQKEQKEQKAIEEYQKAAAQGNAEAQYQLGVCYEE 615
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G ++D+ +AG +AA GHI A +LG C + G GV Q+ A+ + +A
Sbjct: 616 GKGVVQDDE--KAG-EWYQKAAVKGHIAAQYQLGVCYEKGNGVAQDDAKAVEWYQKA 669
>gi|151946256|gb|EDN64487.1| multi budding protein [Saccharomyces cerevisiae YJM789]
gi|392297261|gb|EIW08361.1| Mub1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 620
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 508 GVRQCANFSCGKWEDFPRQFAKCRRCKRTKYCSRKCQLKAW 548
>gi|237746442|ref|ZP_04576922.1| Sel1 repeat-containing protein [Oxalobacter formigenes HOxBLS]
gi|229377793|gb|EEO27884.1| Sel1 repeat-containing protein [Oxalobacter formigenes HOxBLS]
Length = 237
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 63/156 (40%), Gaps = 17/156 (10%)
Query: 96 LALNSLVLSKASKKTFAIKANNWSESAHRFL-------KLC----ADAGNVEACYTLGMI 144
+A +LVL SK FA+ N+ + + +LC A AGN EA +G++
Sbjct: 32 VAFLTLVLIIISKTAFAVGENDLKQGMQHYASRQFDRARLCFEKAAKAGNTEAQTMMGVL 91
Query: 145 RFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARA 201
F ++ A KAA + HA A + V+ G G KN K + +A
Sbjct: 92 YFKGNGVEKDDSQAARWFEKAAKAGHADAQTFIGVMNLEGQGIPKNGKK---ALEWFEKA 148
Query: 202 AFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
A G +A LG G Q+ + + +A
Sbjct: 149 AQAGETNAQNYLGTAYLKGTETAQDTGKAVYWFTRA 184
>gi|209546425|ref|YP_002278315.1| peptidase C14 caspase catalytic subunit p20 [Rhizobium
leguminosarum bv. trifolii WSM2304]
gi|209539282|gb|ACI59215.1| peptidase C14 caspase catalytic subunit p20 [Rhizobium
leguminosarum bv. trifolii WSM2304]
Length = 860
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 6/122 (4%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFY--CLQNRGSGASLMAKA-AISSHAQAL 171
AN +++A +K DAG+V A L + + + + AS ++K A AL
Sbjct: 556 ANRDTKTAFATIKKAMDAGHVRAIQELSSLYEFGASVPRDAAKASEISKGGAAKGDPFAL 615
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
+S + G G N ++ G+ + +AA LGH A+ ELG+ +G V + G
Sbjct: 616 HSYGKALYYGRGTKANQQE---GLRMMLQAADLGHTYAMNELGYIFLNGVNVPADPERGI 672
Query: 232 RF 233
RF
Sbjct: 673 RF 674
>gi|160877378|ref|YP_001556694.1| Sel1 domain-containing protein [Shewanella baltica OS195]
gi|378710592|ref|YP_005275486.1| Sel1 domain-containing protein repeat-containing protein
[Shewanella baltica OS678]
gi|160862900|gb|ABX51434.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS195]
gi|315269581|gb|ADT96434.1| Sel1 domain protein repeat-containing protein [Shewanella baltica
OS678]
Length = 491
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 56/119 (47%), Gaps = 5/119 (4%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
S+ A ++L+ A+ EA LG++ Y +N ++KAA + + A Y LA++
Sbjct: 291 SKLAQQYLQRAAEGEQTEAMANLGVLH-YQQKNLTQAYHFISKAAQAGYPHAQYHLALML 349
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFLVQ 236
NG G + +D+ A A AA G +DA L H L D ++ + +L Q
Sbjct: 350 ANGDGCT---RDMIASEYWMAEAAEQGQLDAMLTRAQHMLNDDNAFGSDLTQAEDYLRQ 405
>gi|6323747|ref|NP_013818.1| Mub1p [Saccharomyces cerevisiae S288c]
gi|2497149|sp|Q03162.1|MUB1_YEAST RecName: Full=MYND-type zinc finger protein MUB1; AltName:
Full=Multi-budding protein
gi|854437|emb|CAA89901.1| unknown [Saccharomyces cerevisiae]
gi|190408329|gb|EDV11594.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|256269981|gb|EEU05231.1| Mub1p [Saccharomyces cerevisiae JAY291]
gi|259148673|emb|CAY81918.1| Mub1p [Saccharomyces cerevisiae EC1118]
gi|285814102|tpg|DAA09997.1| TPA: Mub1p [Saccharomyces cerevisiae S288c]
gi|323332064|gb|EGA73475.1| Mub1p [Saccharomyces cerevisiae AWRI796]
gi|323347183|gb|EGA81458.1| Mub1p [Saccharomyces cerevisiae Lalvin QA23]
gi|323353078|gb|EGA85378.1| Mub1p [Saccharomyces cerevisiae VL3]
gi|365763822|gb|EHN05348.1| Mub1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 620
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 508 GVRQCANFSCGKWEDFPRQFAKCRRCKRTKYCSRKCQLKAW 548
>gi|393786768|ref|ZP_10374900.1| hypothetical protein HMPREF1068_01180 [Bacteroides nordii
CL02T12C05]
gi|392658003|gb|EIY51633.1| hypothetical protein HMPREF1068_01180 [Bacteroides nordii
CL02T12C05]
Length = 828
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 3/77 (3%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
QN L +KAA + +Y + + S G K D G A +AA G DA
Sbjct: 476 QNYEQAFELFSKAAEQEYPYGMYRVG---YYLSEGIKGDAQPEEGAAWYLKAAEAGDTDA 532
Query: 210 LRELGHCLQDGYGVRQN 226
+ +G C ++GYGV ++
Sbjct: 533 MFAMGRCYKNGYGVEED 549
>gi|297838503|ref|XP_002887133.1| hypothetical protein ARALYDRAFT_894509 [Arabidopsis lyrata subsp.
lyrata]
gi|297332974|gb|EFH63392.1| hypothetical protein ARALYDRAFT_894509 [Arabidopsis lyrata subsp.
lyrata]
Length = 248
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 54/120 (45%), Gaps = 4/120 (3%)
Query: 58 FDALPDDLVVSILCKLSSTARCPSDFVNVLITCKRMNGLALNSLVLSKASKKTFAIKANN 117
D+LP+DL+V I + A S N+ + K + V + S K I
Sbjct: 23 LDSLPEDLLVEI--SSCTGASSLSAVRNLRVVSKSFRRICDERYVFYRLSLK--EIDYPP 78
Query: 118 WSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVI 177
W +++ F + C ++GN EA Y G + ++ + G +A+AA + +A Y +I
Sbjct: 79 WHQNSEYFFERCRNSGNPEALYRKGFMNYFRDNLKHEGLKYLAEAAEKGNREANYVYGLI 138
>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
bisporus H97]
Length = 674
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
+RC C ++YC CQ+ DW L HK +C ++W
Sbjct: 144 LKRCPTCRIIHYCDSECQSRDWTL-HKRECPALQKW 178
>gi|365896718|ref|ZP_09434778.1| putative Beta-lactamase [Bradyrhizobium sp. STM 3843]
gi|365422536|emb|CCE07320.1| putative Beta-lactamase [Bradyrhizobium sp. STM 3843]
Length = 365
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 68/154 (44%), Gaps = 14/154 (9%)
Query: 87 LITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRF 146
L T K G A +L + + I+ + + A + K ++AG+ EA + L M+R
Sbjct: 108 LATTKAQAGDAKAMTMLGQLYENAMGIRRD--YDKAALWYKRASEAGDREAMFALAMMR- 164
Query: 147 YCLQNRG------SGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCAR 200
L RG L+A AA +A Y+LA++ + G +DL+ L
Sbjct: 165 --LAGRGGPVDKQEAVKLLASAAKLGEPKAAYNLALLYLD---GQTLPQDLKRSAELLQM 219
Query: 201 AAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
AA G+ +A L ++G GV ++ + R L
Sbjct: 220 AADAGNAEAQYALATFYKEGTGVPKDPEKAARLL 253
>gi|109073367|ref|XP_001086513.1| PREDICTED: programmed cell death protein 2 isoform 9 [Macaca
mulatta]
Length = 344
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQTCA 173
>gi|402868793|ref|XP_003898472.1| PREDICTED: programmed cell death protein 2 [Papio anubis]
Length = 344
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQTCA 173
>gi|397627830|gb|EJK68628.1| hypothetical protein THAOC_10172, partial [Thalassiosira oceanica]
Length = 433
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 30/70 (42%), Gaps = 9/70 (12%)
Query: 308 ASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWK 367
A L +W T G CG+ ++++CS C A YCS+ CQ W+
Sbjct: 361 ARELLKQWETIEQDTCG---------NCGKEVQSDMKYKQCSKCRAQWYCSKECQIEAWR 411
Query: 368 LRHKADCAPA 377
HK DC A
Sbjct: 412 AGHKKDCKRA 421
>gi|349580381|dbj|GAA25541.1| K7_Mub1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 620
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 508 GVRQCANFSCGKWEDFPRQFAKCRRCKRTKYCSRKCQLKAW 548
>gi|448927302|gb|AGE50876.1| histone-lysine N-methyltransferase / SET domain containing protein
[Paramecium bursaria Chlorella virus CVB-1]
Length = 175
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 5/51 (9%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
+C GC E +RCSVC V YCS CQ WK HK CAP ++
Sbjct: 104 VCYAEGCSNTEN----LKRCSVCRNVRYCSAECQKAHWKC-HKQCCAPKKK 149
>gi|406880646|gb|EKD28944.1| hypothetical protein ACD_79C00146G0001, partial [uncultured
bacterium]
Length = 666
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 60/134 (44%), Gaps = 13/134 (9%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRG----SGASLM------AKAAISS 166
N++E A++ LK AD G+ EA Y LG + Y + + + M KAA
Sbjct: 74 NYAERANKILKPLADKGDPEAQYNLGEMYKYGIGVKQDIELANTYFMPAFEKYKKAAEKG 133
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
++A + ++ G G S DL +AA G+++A +G + GYGV Q+
Sbjct: 134 DSEAQKKVGLMYLKGYGVS---TDLTQAFQWYLKAAEQGNLEAQVNIGGAYRTGYGVNQD 190
Query: 227 IAEGRRFLVQANAR 240
+ + +A +
Sbjct: 191 YNKALEWFTKATEQ 204
>gi|392570551|gb|EIW63723.1| HCP-like protein [Trametes versicolor FP-101664 SS1]
Length = 502
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 38/82 (46%)
Query: 158 LMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCL 217
L+ + A H A Y LA NG G K +D L AA GH DA G C
Sbjct: 179 LLKRLADRGHMPAQYFLADCYANGIGTHKGRQDFDRAFPLFVLAAKHGHPDASYRAGTCC 238
Query: 218 QDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ A+ +F +A A
Sbjct: 239 ENGWGCRRESAKAVQFYKKAAA 260
>gi|171692093|ref|XP_001910971.1| hypothetical protein [Podospora anserina S mat+]
gi|170945995|emb|CAP72796.1| unnamed protein product [Podospora anserina S mat+]
Length = 620
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 568 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 613
>gi|401624326|gb|EJS42388.1| mub1p [Saccharomyces arboricola H-6]
Length = 621
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 509 GVRQCANFSCGKWEDFPKQFAKCRRCKRTKYCSRKCQLKAW 549
>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
Length = 429
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
C + T E ++CS C + YCS+ CQ DWKL HK +C
Sbjct: 26 CNQCLTSMAELKKCSACRRLAYCSQECQRADWKL-HKVEC 64
>gi|451856758|gb|EMD70049.1| hypothetical protein COCSADRAFT_342177 [Cochliobolus sativus
ND90Pr]
Length = 308
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 23/49 (46%), Gaps = 1/49 (2%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
L L + CG +RC CG YCS ACQ DWK+ HK C
Sbjct: 204 LDLSTCTSCGSAGAADKALQRCGKCGTATYCSDACQKGDWKV-HKGVCT 251
>gi|414163928|ref|ZP_11420175.1| hypothetical protein HMPREF9697_02076 [Afipia felis ATCC 53690]
gi|410881708|gb|EKS29548.1| hypothetical protein HMPREF9697_02076 [Afipia felis ATCC 53690]
Length = 363
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 55/117 (47%), Gaps = 10/117 (8%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY-----CLQNRGSGASLMAKAAISSHAQALYSLAV 176
A + ADAGN EA Y L FY +N A L+ AA++++ A A+
Sbjct: 212 AAELFRQAADAGNPEAQYALAT--FYKEGRGVEKNPAEAAKLLGAAALANNLDAEVEYAI 269
Query: 177 IQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
FNG+G ++ D R VAL RAA G A L H L +G QN EG ++
Sbjct: 270 ALFNGTGIAR---DERTAVALLNRAARQGSPIAQNRLAHVLIEGKAAPQNKVEGFKW 323
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 52/108 (48%), Gaps = 6/108 (5%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQ 178
A + K AD G+ EA + LGM++ +R +GA L+ AA +A Y+L ++
Sbjct: 140 AASWYKKAADRGDREALFALGMMQIGGRGMPVDRSAGAKLLVAAAKLGKPEAAYNLGLLY 199
Query: 179 FNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
G +D++ L +AA G+ +A L ++G GV +N
Sbjct: 200 LE---GQVFPQDIKRAAELFRQAADAGNPEAQYALATFYKEGRGVEKN 244
>gi|397564868|gb|EJK44379.1| hypothetical protein THAOC_37083 [Thalassiosira oceanica]
Length = 684
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 46/109 (42%), Gaps = 2/109 (1%)
Query: 129 CADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKND 188
C A + R +N +++ A A++ L F G G +
Sbjct: 111 CNLAAQRRGMFDCAFCRTPIARNDADSLAMIQARAAKKDPTAIHFLGKKYFFGRLGLQ-- 168
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD+R V L AA LG DAL ELG+ +G GV+Q+ F ++A
Sbjct: 169 KDMRRAVKLWEEAAELGSTDALFELGNAYLEGEGVQQDEPMAVEFYMKA 217
>gi|254569888|ref|XP_002492054.1| hypothetical protein [Komagataella pastoris GS115]
gi|238031851|emb|CAY69774.1| Hypothetical protein PAS_chr2-2_0152 [Komagataella pastoris GS115]
gi|328351455|emb|CCA37854.1| SET and MYND domain-containing protein 2 [Komagataella pastoris CBS
7435]
Length = 736
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 20/35 (57%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWL 381
RCS C VNYCS +CQ DW H +C + +L
Sbjct: 141 RCSGCRVVNYCSLSCQLQDWNEFHSKECTYLKDYL 175
>gi|451849236|gb|EMD62540.1| hypothetical protein COCSADRAFT_221991 [Cochliobolus sativus
ND90Pr]
Length = 302
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
CGR F C+ C YCS+ CQ +DWK+ HK C
Sbjct: 261 CGRSPKGEKTFPACAKCKKARYCSQECQKVDWKM-HKKTCV 300
>gi|338739900|ref|YP_004676862.1| hypothetical protein HYPMC_3079 [Hyphomicrobium sp. MC1]
gi|337760463|emb|CCB66296.1| exported protein of unknown function [Hyphomicrobium sp. MC1]
Length = 384
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 61/123 (49%), Gaps = 13/123 (10%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA-----KAAISSH--AQALYSL 174
A L ADAGN A L I + G A L A +AA ++ ++A++SL
Sbjct: 50 ARDLLSHAADAGNASAMNLLASI---LQKGLGGDADLPAARKWYEAAANNGDLSEAMFSL 106
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
A++ NG GG K D+ + + +AA G++ A+ +G+ QDG G Q+ A R +
Sbjct: 107 AMMLHNGQGGPK---DVASARSWFEKAANAGNVPAMDWMGYLSQDGEGGPQDFAVARSWF 163
Query: 235 VQA 237
+A
Sbjct: 164 QRA 166
>gi|189502234|ref|YP_001957951.1| hypothetical protein Aasi_0851 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497675|gb|ACE06222.1| hypothetical protein Aasi_0851 [Candidatus Amoebophilus asiaticus
5a2]
Length = 961
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 54/120 (45%), Gaps = 13/120 (10%)
Query: 113 IKANNWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQ 169
+KA W E A A+ + ++ Y LG+I ++ KAA HA+
Sbjct: 675 VKAIEWYEKA-------ANQEHAKSQYALGVIYESGEGVEKDEKKAIEWYEKAANQGHAR 727
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A +SL V+ G G +KD R V +AA GH A +LG +G GV Q+ A+
Sbjct: 728 AQFSLGVMYGEGEG---VEKDERKAVEWYEKAANQGHARAQFKLGWMYGEGRGVSQDYAK 784
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 53/124 (42%), Gaps = 13/124 (10%)
Query: 114 KANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQA 170
KA W E AD G+ A Y LG + L ++ + +AA HA++
Sbjct: 532 KAKEWYEK-------TADQGHAHAQYNLGYMYEKGLGVAKDYVKAIAWYKQAANQGHAKS 584
Query: 171 LYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
Y+L VI G G + KD+R + +AA GH D +L G G+ ++ A+
Sbjct: 585 QYALGVIYIEGQGVA---KDVRKAIEWYEKAANQGHADVQLKLAARYFKGEGIAKDYAKA 641
Query: 231 RRFL 234
+
Sbjct: 642 IEWF 645
>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 643
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
+RC C ++YC CQ+ DW L HK +C ++W
Sbjct: 109 LKRCPTCRIIHYCDTECQSRDWTL-HKRECPALQKW 143
>gi|189502471|ref|YP_001958188.1| hypothetical protein Aasi_1118 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497912|gb|ACE06459.1| hypothetical protein Aasi_1118 [Candidatus Amoebophilus asiaticus
5a2]
Length = 329
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 51/107 (47%), Gaps = 6/107 (5%)
Query: 134 NVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKD 190
++EA Y LG++ + C +N KAA +A+A ++LA + NG G +KD
Sbjct: 127 DMEAQYNLGVMYYKCWGVDKNYQEAKEWYEKAAEQGYAKAQHTLAAMYINGEGV---EKD 183
Query: 191 LRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
C +AA G+ A L +G GV ++ A+ ++ +A
Sbjct: 184 HVKAFKWCQKAAKQGYARAQHNLAAMYINGEGVEKDHAKAFKWCQKA 230
>gi|426201389|gb|EKV51312.1| hypothetical protein AGABI2DRAFT_189560 [Agaricus bisporus var.
bisporus H97]
Length = 475
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A H + Y LA NG G K +D L AA GH DA G C
Sbjct: 156 SLLKRLADRGHPASQYFLADCYANGIGTVKGKQDFDKAYPLFVLAAKHGHPDAAYRAGTC 215
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ + +F +A A
Sbjct: 216 CENGWGCRRESPKAVQFFRKAAA 238
>gi|380794651|gb|AFE69201.1| programmed cell death protein 2 isoform 1, partial [Macaca mulatta]
Length = 307
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 27/55 (49%), Gaps = 7/55 (12%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERW 380
G LC GC P+T CS C YCS+ Q LDW+L HK CA + +
Sbjct: 94 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQTCAQPDHF 141
>gi|431904622|gb|ELK10004.1| Programmed cell death protein 2 [Pteropus alecto]
Length = 302
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 24/50 (48%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCSR Q LDW+L HK C
Sbjct: 89 GAHLCRVCGCLGPKT-------CSRCHKAFYCSREHQTLDWRLGHKQACT 131
>gi|409083572|gb|EKM83929.1| hypothetical protein AGABI1DRAFT_110539 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 475
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%)
Query: 157 SLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHC 216
SL+ + A H + Y LA NG G K +D L AA GH DA G C
Sbjct: 156 SLLKRLADRGHPASQYFLADCYANGIGTVKGKQDFDKAYPLFVLAAKHGHPDAAYRAGTC 215
Query: 217 LQDGYGVRQNIAEGRRFLVQANA 239
++G+G R+ + +F +A A
Sbjct: 216 CENGWGCRRESPKAVQFFRKAAA 238
>gi|397563864|gb|EJK43971.1| hypothetical protein THAOC_37538 [Thalassiosira oceanica]
Length = 316
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 38/82 (46%), Gaps = 13/82 (15%)
Query: 169 QALYSLAVIQFNGSGGSKN-------------DKDLRAGVALCARAAFLGHIDALRELGH 215
+ALY L + F G G +N KDL+ V L AA LG IDAL LG
Sbjct: 149 EALYCLGLKYFFGKFGLQNSIGIQSSRIGHWTQKDLKKAVELWKEAAELGSIDALYHLGA 208
Query: 216 CLQDGYGVRQNIAEGRRFLVQA 237
G GV+Q+ A+ F +A
Sbjct: 209 AHDLGEGVQQDEAKAAEFWSKA 230
>gi|448926282|gb|AGE49859.1| histone-lysine N-methyltransferase / SET domain containing protein
[Paramecium bursaria Chlorella virus Can18-4]
gi|448928305|gb|AGE51876.1| histone-lysine N-methyltransferase / SET domain containing protein
[Paramecium bursaria Chlorella virus CVM-1]
Length = 175
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 5/51 (9%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
+C GC E +RCSVC V YCS CQ WK HK CAP ++
Sbjct: 104 VCYAEGCSNTEN----LKRCSVCRNVRYCSAECQKAHWKC-HKQCCAPKKK 149
>gi|307111060|gb|EFN59295.1| hypothetical protein CHLNCDRAFT_137646 [Chlorella variabilis]
Length = 890
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 5/50 (10%)
Query: 328 RLCSHVGCGRPET---RRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
R C+++ C P T +R+ RCS C V YC R C DW+ RHK C
Sbjct: 827 RSCANLRCANPATVGGKRNP--RCSGCRTVRYCCRQCSVADWRPRHKLVC 874
>gi|327262224|ref|XP_003215925.1| PREDICTED: LOW QUALITY PROTEIN: programmed cell death protein
2-like [Anolis carolinensis]
Length = 362
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 25/49 (51%), Gaps = 7/49 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
G LC GC P +RCS C +YCS+ Q LDWK HKA C
Sbjct: 137 GANLCRVCGCLGP-------KRCSKCHKAHYCSQDHQLLDWKAGHKASC 178
>gi|260812637|ref|XP_002601027.1| hypothetical protein BRAFLDRAFT_96937 [Branchiostoma floridae]
gi|229286317|gb|EEN57039.1| hypothetical protein BRAFLDRAFT_96937 [Branchiostoma floridae]
Length = 241
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
C + CG+P R+ + + C+ C YCSR CQ W + HK C ER
Sbjct: 181 CCNPKCGKPGYRK-DLKLCAGCKLTRYCSRDCQIQHWSVGHKKCCGHDER 229
>gi|92115879|ref|YP_575608.1| Sel1-like protein [Nitrobacter hamburgensis X14]
gi|91798773|gb|ABE61148.1| Sel1-like protein [Nitrobacter hamburgensis X14]
Length = 1086
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 62/134 (46%), Gaps = 17/134 (12%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGAS--------LMAKAAISS 166
A N+ E+A ++ A AG V A + LG L +G GAS +AA
Sbjct: 883 AVNYDEAA-KWYDRAAQAGVVPAMFRLG-----TLHEKGLGASKDVDAARRYYMQAAERG 936
Query: 167 HAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
+A+A+++LAV+ +G G N + +AA G D+ LG G G+ QN
Sbjct: 937 NAKAMHNLAVLDADGGGKGAN---YTSAAQWFRKAAERGVADSQFNLGILYARGIGIEQN 993
Query: 227 IAEGRRFLVQANAR 240
+AE ++ A A+
Sbjct: 994 LAESYKWFSLAAAQ 1007
>gi|393227480|gb|EJD35157.1| hypothetical protein AURDEDRAFT_175791 [Auricularia delicata
TFB-10046 SS5]
Length = 436
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 27/51 (52%), Gaps = 6/51 (11%)
Query: 328 RLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWK----LRHKADC 374
R+C GCG + R F RCS CG YCS++CQ DW + HK C
Sbjct: 289 RVCVAPGCGASVSGR--FVRCSDCGVAVYCSKSCQRRDWNRDSVVAHKVVC 337
>gi|408391041|gb|EKJ70425.1| hypothetical protein FPSE_09419 [Fusarium pseudograminearum CS3096]
Length = 657
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 605 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 650
>gi|397570946|gb|EJK47543.1| hypothetical protein THAOC_33728 [Thalassiosira oceanica]
Length = 445
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
C + +F++CS C A YCS+ CQ W+ HK DC A
Sbjct: 391 CSKKAETGEKFKQCSKCKAQWYCSKECQVEAWRAGHKKDCKRA 433
>gi|395330297|gb|EJF62681.1| hypothetical protein DICSQDRAFT_179962 [Dichomitus squalens
LYAD-421 SS1]
Length = 331
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 31/69 (44%), Gaps = 1/69 (1%)
Query: 325 PGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEE 384
P LR C H + E R+C+ C YCS+ACQ W+ +HK C A +
Sbjct: 21 PALRRCHHCARAQEEDPATRLRKCAGCSRALYCSKACQKAAWQ-KHKLVCCDATDFTARN 79
Query: 385 GNGDVDVDL 393
+G + L
Sbjct: 80 NSGQLHSGL 88
>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3-like [Apis florea]
Length = 414
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 9/62 (14%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE------EGNGDVDVDLNDAEGGG 400
RCSVC + YC+++CQ + W + H +CA +R+ + + + LN +GGG
Sbjct: 44 RCSVCKCIYYCNQSCQQMSWTI-HSKECASLKRFSSKVIPDVARLMARIIIKLN--QGGG 100
Query: 401 ED 402
E+
Sbjct: 101 EE 102
>gi|353236014|emb|CCA68017.1| related to SKT5-protoplast regeneration and killer toxin resistance
protein [Piriformospora indica DSM 11827]
Length = 679
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 39/84 (46%)
Query: 158 LMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCL 217
L+ K A H A Y+LA NG G K +D L AA H+DA G C
Sbjct: 356 LLKKIADRGHVMAQYTLADCYSNGIGTVKGRQDFDRAFPLFVAAAKHYHVDAAYRAGTCC 415
Query: 218 QDGYGVRQNIAEGRRFLVQANARE 241
+ G+G R++ A+ F A +++
Sbjct: 416 EYGWGCRKDSAKAAHFYRAAASQQ 439
>gi|397616553|gb|EJK64030.1| hypothetical protein THAOC_15280 [Thalassiosira oceanica]
Length = 323
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
+A+Y L + + G G + KD++ V LC AA LG I+AL L + +G V+Q+ A
Sbjct: 149 EAIYFLGMKYYQGDLGLQ--KDMQKAVELCTEAAELGSIEALFSLANAYNEGDDVQQDTA 206
Query: 229 EGRRFLVQA 237
+ + +A
Sbjct: 207 KAIKLYTRA 215
>gi|393236301|gb|EJD43851.1| hypothetical protein AURDEDRAFT_114606 [Auricularia delicata
TFB-10046 SS5]
Length = 635
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKADC 374
C ++ C + + ++ F++CS CG YCS+ CQ WK HK+ C
Sbjct: 400 CDNIQCNKIDDKK-SFKKCSGCGMTYYCSKECQVTAWKQGDHKSVC 444
>gi|392380396|ref|YP_004987553.1| conserved hypothetical protein; putative TPR repeats; putative
Beta-lactamase [Azospirillum brasilense Sp245]
gi|356882926|emb|CCD03945.1| conserved hypothetical protein; putative TPR repeats; putative
Beta-lactamase [Azospirillum brasilense Sp245]
Length = 567
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 53/114 (46%), Gaps = 8/114 (7%)
Query: 125 FLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
+ + AD GN +A Y +GM+ L ++ S +AA A + L + +G
Sbjct: 384 WFRRAADQGNSDAVYNIGMLHSLGLGVPRDPAGALSWYQRAAEQGSVLAQFRLGAMLASG 443
Query: 182 SGGSKNDKDLRAGVALCAR-AAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
G ++ G AL +R AA GH+ A+ +G G GV ++ AE R+L
Sbjct: 444 DGVPQD----YPGAALWSRKAAEQGHVGAMVNIGRFSMQGLGVERDTAEALRWL 493
>gi|347840926|emb|CCD55498.1| similar to MYND domain protein (SamB) [Botryotinia fuckeliana]
Length = 357
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G+R C++ CG+ E +F +C C YCS+ CQ W H+ C A +
Sbjct: 305 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWVF-HRHWCVQASQ 357
>gi|356546765|ref|XP_003541793.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
max]
Length = 990
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
RCS C V YCS CQ + W+L HK +C E
Sbjct: 86 RCSRCKIVRYCSGNCQIIHWRLIHKQECQQLE 117
>gi|297735922|emb|CBI18698.3| unnamed protein product [Vitis vinifera]
Length = 669
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 14/76 (18%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP-----AERWLDEEGNGDVDVDLN------- 394
RCS C AV YCS CQ W+ HK +C P A ++ D G G N
Sbjct: 78 RCSQCKAVRYCSGKCQIKHWRQGHKNECVPPTPTSAMQFNDGSGFGGKTTSQNHDVLGSS 137
Query: 395 --DAEGGGEDNVVADS 408
D+ G D++ DS
Sbjct: 138 TPDSSKGLSDDISCDS 153
>gi|313851173|ref|NP_001186391.1| programmed cell death protein 2 isoform 4 [Homo sapiens]
gi|61661003|gb|AAX51225.1| programmed cell death 2 isoform 1 [Homo sapiens]
gi|306921397|dbj|BAJ17778.1| programmed cell death 2 [synthetic construct]
Length = 311
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 98 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 140
>gi|343504481|ref|ZP_08742194.1| Sel1 domain-containing protein [Vibrio ichthyoenteri ATCC 700023]
gi|342811471|gb|EGU46509.1| Sel1 domain-containing protein [Vibrio ichthyoenteri ATCC 700023]
Length = 443
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 49/104 (47%), Gaps = 6/104 (5%)
Query: 129 CADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGS 185
A+ G++EA Y +GM+ + L N+ AA + HA A +SLA + G G
Sbjct: 274 AAEQGHIEAQYNVGMMYDFGLGVEPNKTKALIWYHNAAENGHADAQFSLASLYELGVGTP 333
Query: 186 KNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
N K+ +AA G + A LG L+ G G+ QNI E
Sbjct: 334 VNKKE---AYRWYVKAAKQGSVAAQYNLGVMLEAGKGIEQNIDE 374
>gi|322699759|gb|EFY91518.1| MYND domain protein (SamB) [Metarhizium acridum CQMa 102]
Length = 648
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G+R C++ CG+ E +F +C C YCS+ CQ W H+ C A +
Sbjct: 596 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAF-HRHWCVAASQ 648
>gi|398021807|ref|XP_003864066.1| hypothetical protein, conserved [Leishmania donovani]
gi|322502300|emb|CBZ37384.1| hypothetical protein, conserved [Leishmania donovani]
Length = 1028
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 29/62 (46%), Gaps = 10/62 (16%)
Query: 323 PGPGLRL---------CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKA 372
P PG L SH CG RR RC C AV+YC + QALDWK H+
Sbjct: 616 PAPGYTLHVPPVCFSAISHKKCGWCGRRREVLLRCGGCKAVSYCCKRHQALDWKEGPHRL 675
Query: 373 DC 374
+C
Sbjct: 676 EC 677
>gi|429850176|gb|ELA25474.1| mynd finger family protein [Colletotrichum gloeosporioides Nara gc5]
Length = 1227
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 334 GCGRPETRR-HEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
CG+ E + ++C+ C V YCS CQ DW+ +H+A+CA ++
Sbjct: 1172 ACGKSEAESGTKLKKCTRCLQVKYCSGECQKKDWR-KHRAECAESD 1216
>gi|397582835|gb|EJK52436.1| hypothetical protein THAOC_28284 [Thalassiosira oceanica]
Length = 377
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 31/55 (56%)
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
GG KD+R V L AA LG I+AL LG+ G GV+Q++A+ F +A
Sbjct: 164 GGLGLQKDMRGAVELYTEAAELGSIEALYNLGNAYDLGKGVQQDMAKAVEFWTKA 218
>gi|358391583|gb|EHK40987.1| hypothetical protein TRIATDRAFT_28657 [Trichoderma atroviride IMI
206040]
Length = 732
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 66/149 (44%), Gaps = 18/149 (12%)
Query: 101 LVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMA 160
L+ + A + + +KAN + ++L+ AD + E G+ + N + A
Sbjct: 557 LLYALACRHGWGMKANQ--KEGVQWLRKAADMASAEVAEDEGVAKEGKRINVAENKTHKA 614
Query: 161 KAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCAR----AAFLGHIDALRELGHC 216
+ A+S +Y L V NG G + DK AL R A G +DAL E G C
Sbjct: 615 QFALS-----IYELGVSHMNG-WGIEQDK------ALALRCFEIAGTWGDVDALAEAGFC 662
Query: 217 LQDGYGVRQNIAEGRRFLVQANARELAAV 245
G G ++N+ + +F A A+ ++ V
Sbjct: 663 YAQGIGCKKNLKKSAKFYRMAEAKGMSMV 691
>gi|299744301|ref|XP_001840749.2| hypothetical protein CC1G_10363 [Coprinopsis cinerea okayama7#130]
gi|298406059|gb|EAU81072.2| hypothetical protein CC1G_10363 [Coprinopsis cinerea okayama7#130]
Length = 705
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 23/36 (63%)
Query: 342 RHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
R++ + C+ C +V YCS CQ DWKL H+ +C A
Sbjct: 462 RYKSKTCAGCTSVAYCSSLCQKEDWKLFHRYECKTA 497
>gi|336266678|ref|XP_003348106.1| hypothetical protein SMAC_03952 [Sordaria macrospora k-hell]
gi|380091041|emb|CCC11247.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 622
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 570 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 615
>gi|184157441|ref|YP_001845780.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii ACICU]
gi|332874098|ref|ZP_08442032.1| Sel1 repeat protein [Acinetobacter baumannii 6014059]
gi|385236838|ref|YP_005798177.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii TCDC-AB0715]
gi|387124611|ref|YP_006290493.1| Sel1 repeat protein [Acinetobacter baumannii MDR-TJ]
gi|407932178|ref|YP_006847821.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii TYTH-1]
gi|416149193|ref|ZP_11602754.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii AB210]
gi|417579102|ref|ZP_12229935.1| Sel1 repeat protein [Acinetobacter baumannii Naval-17]
gi|421629687|ref|ZP_16070409.1| Sel1 repeat protein [Acinetobacter baumannii OIFC180]
gi|421686539|ref|ZP_16126289.1| Sel1 repeat protein [Acinetobacter baumannii IS-143]
gi|421702962|ref|ZP_16142433.1| Sel1 repeat protein [Acinetobacter baumannii ZWS1122]
gi|421709169|ref|ZP_16148531.1| Sel1 repeat protein [Acinetobacter baumannii ZWS1219]
gi|424053124|ref|ZP_17790656.1| hypothetical protein W9G_01813 [Acinetobacter baumannii Ab11111]
gi|445473166|ref|ZP_21452717.1| Sel1 repeat protein [Acinetobacter baumannii OIFC338]
gi|183209035|gb|ACC56433.1| TPR repeat, SEL1 subfamily protein [Acinetobacter baumannii ACICU]
gi|323517335|gb|ADX91716.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii TCDC-AB0715]
gi|332737710|gb|EGJ68603.1| Sel1 repeat protein [Acinetobacter baumannii 6014059]
gi|333364609|gb|EGK46623.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii AB210]
gi|385879103|gb|AFI96198.1| Sel1 repeat protein [Acinetobacter baumannii MDR-TJ]
gi|395568240|gb|EJG28914.1| Sel1 repeat protein [Acinetobacter baumannii Naval-17]
gi|404568109|gb|EKA73218.1| Sel1 repeat protein [Acinetobacter baumannii IS-143]
gi|404670765|gb|EKB38648.1| hypothetical protein W9G_01813 [Acinetobacter baumannii Ab11111]
gi|407188460|gb|EKE59706.1| Sel1 repeat protein [Acinetobacter baumannii ZWS1219]
gi|407193338|gb|EKE64505.1| Sel1 repeat protein [Acinetobacter baumannii ZWS1122]
gi|407900759|gb|AFU37590.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii TYTH-1]
gi|408700355|gb|EKL45814.1| Sel1 repeat protein [Acinetobacter baumannii OIFC180]
gi|444769453|gb|ELW93638.1| Sel1 repeat protein [Acinetobacter baumannii OIFC338]
Length = 159
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 55/116 (47%), Gaps = 7/116 (6%)
Query: 126 LKLCADAGNVEACYTLG---MIRFYCLQ-NRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
LK A+ G++ A +G + Y L + + KAA S A++ Y+LA++ NG
Sbjct: 30 LKSAAEQGDIRAQSQMGDAYLFGEYDLNVDYKQAINWYQKAADQSDAKSQYNLAIMYLNG 89
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G K DL V ++A G D+ +LG +G GV +NI + + +A
Sbjct: 90 YGVKK---DLSKSVEYYRKSALQGDADSQLQLGIRYLNGEGVERNIETAKEWFKKA 142
>gi|359299880|ref|ZP_09185719.1| hypothetical protein Haemo_06997 [Haemophilus [parainfluenzae] CCUG
13788]
Length = 383
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 52/123 (42%), Gaps = 6/123 (4%)
Query: 137 ACYTLGMIRFYCLQNRGSGASL-MAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGV 195
A Y L M F GA + +AA + +ALY L V G+ + DL +
Sbjct: 218 AQYLLAMNFFDLYSENNKGALFWLERAANNDEPEALYQLGVYYSEGA-----EADLAKSI 272
Query: 196 ALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAACQGIS 255
RAA L H DA L + +G V Q+ + FL +A + + A Q ++
Sbjct: 273 KYYQRAAELNHADAALALSYIYDEGISVEQDEDKALFFLKKAAELDNQEAIDELAAQALA 332
Query: 256 TRS 258
+R+
Sbjct: 333 SRA 335
>gi|332253851|ref|XP_003276046.1| PREDICTED: programmed cell death protein 2-like [Nomascus
leucogenys]
Length = 300
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 87 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 129
>gi|123484570|ref|XP_001324301.1| TPR repeat protein [Trichomonas vaginalis G3]
gi|121907181|gb|EAY12078.1| TPR repeat protein, putative [Trichomonas vaginalis G3]
Length = 364
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 60/155 (38%), Gaps = 25/155 (16%)
Query: 115 ANNWSESAH---------RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAIS 165
AN W + H +LK A AGN +AC LG+ FY Q R LM +
Sbjct: 111 ANVWLKGKHFAPNIPKGLYYLKKAARAGNGKACEMLGL--FY-EQGRHVEQDLMEAYDLF 167
Query: 166 SHAQ-------ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA---FLGHIDALRELGH 215
A + Y A NG GG KD G +C + A ++D + LG
Sbjct: 168 EIASQYPDCTFSAYKFAYFLINGIGGL---KDQNRGFTVCKQGADNKNYNNLDCINYLGE 224
Query: 216 CLQDGYGVRQNIAEGRRFLVQANARELAAVLSSAA 250
+ GYG ++I + + + A L + A
Sbjct: 225 LYEYGYGTAKDIKKAKEYYKLAAKSNFPKALYNIA 259
>gi|322706029|gb|EFY97611.1| MYND domain protein (SamB) [Metarhizium anisopliae ARSEF 23]
Length = 637
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 585 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 630
>gi|307610906|emb|CBX00523.1| TPR repeat protein [Legionella pneumophila 130b]
Length = 305
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 64/148 (43%), Gaps = 10/148 (6%)
Query: 117 NWSESAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYS 173
N+SE+ F K A+ G+ A LG+ QN KAA + A ++
Sbjct: 60 NYSEAVKWFQK-AAEQGDTMAQRNLGLTYTTGTGVAQNHSKAMKWFRKAAEKNDPVAEFN 118
Query: 174 LAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRF 233
L V+ G G S ND + V +AA G DA R LG G GV+QN E +
Sbjct: 119 LGVLYIEGIGISHNDGE---AVKWIHKAAEQGFPDAERTLGILYLTGKGVKQNDGEAIIW 175
Query: 234 LVQANAR-ELAA--VLSSAACQGISTRS 258
+A +LA+ LS+ +G T+
Sbjct: 176 FRKAGEHGDLASQRTLSAMYVEGNHTQQ 203
>gi|393232093|gb|EJD39679.1| hypothetical protein AURDEDRAFT_128150 [Auricularia delicata
TFB-10046 SS5]
Length = 921
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 33/74 (44%), Gaps = 8/74 (10%)
Query: 307 PASRFLAEWFAARGGTPGPGLRLCSHVGCGRPETRRHEFRR---CSVCGAVNYCSRACQA 363
P++ L FA+R G G C+H C P+ H R+ C+ C V YC CQ
Sbjct: 798 PSTELLVYRFASRTNVAGQGA--CAHTACRAPQLSAHGTRKLFYCNRCRLVRYCGADCQR 855
Query: 364 LDWK---LRHKADC 374
DW + HK C
Sbjct: 856 ADWSGAAVPHKLVC 869
>gi|336472673|gb|EGO60833.1| hypothetical protein NEUTE1DRAFT_57649 [Neurospora tetrasperma FGSC
2508]
gi|350294091|gb|EGZ75176.1| hypothetical protein NEUTE2DRAFT_83246 [Neurospora tetrasperma FGSC
2509]
Length = 622
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 570 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 615
>gi|223992863|ref|XP_002286115.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977430|gb|EED95756.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 275
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 5/56 (8%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWK-----LRHKADCAPAER 379
+C H G E + RCS CG YCSR CQ +WK + HK CA +R
Sbjct: 56 VCFHCGKSAEEGGVPKLSRCSKCGVACYCSRDCQVTNWKGGDGVVGHKFLCAAYKR 111
>gi|149909665|ref|ZP_01898318.1| hypothetical protein PE36_12787 [Moritella sp. PE36]
gi|149807369|gb|EDM67322.1| hypothetical protein PE36_12787 [Moritella sp. PE36]
Length = 1046
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
Q++ A+L KAA H + Y+LA++ ++ S G + DK + AL +AA LGH+ A
Sbjct: 461 QDKQKAAALYQKAANLDHPGSTYNLAIM-YDSSDGIEQDK--QKAAALYQKAADLGHLGA 517
Query: 210 LRELGHCLQDGYGVRQN 226
+ L G G+ Q+
Sbjct: 518 MYNLAIMYDIGDGIEQD 534
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 3/78 (3%)
Query: 150 QNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDA 209
Q++ A+L KAA H+ A +LA++ +G G ++D + L +AA LGH A
Sbjct: 569 QDKQKAAALYQKAADLGHSGATLNLAIMYDSGDGI---EQDKQKAADLYQKAADLGHSGA 625
Query: 210 LRELGHCLQDGYGVRQNI 227
L DG G+ QNI
Sbjct: 626 TLNLAIMYNDGDGIEQNI 643
>gi|339899009|ref|XP_003392748.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321398619|emb|CBZ08946.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 1028
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 29/62 (46%), Gaps = 10/62 (16%)
Query: 323 PGPGLRL---------CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKA 372
P PG L SH CG RR RC C AV+YC + QALDWK H+
Sbjct: 616 PAPGYTLHVPPVCFSAISHKKCGWCGRRREVLLRCGGCKAVSYCCKRHQALDWKEGPHRL 675
Query: 373 DC 374
+C
Sbjct: 676 EC 677
>gi|302654728|ref|XP_003019164.1| MYND domain protein (SamB), putative [Trichophyton verrucosum HKI
0517]
gi|291182869|gb|EFE38519.1| MYND domain protein (SamB), putative [Trichophyton verrucosum HKI
0517]
Length = 725
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 672 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKECQKSAW 712
>gi|85096529|ref|XP_960277.1| hypothetical protein NCU07116 [Neurospora crassa OR74A]
gi|28921762|gb|EAA31041.1| predicted protein [Neurospora crassa OR74A]
Length = 2055
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 23/42 (54%), Gaps = 3/42 (7%)
Query: 335 CGRPETRRHE--FRRCSVCGAVNYCSRACQALDWKLRHKADC 374
C P+ R + +CS C YCS+ CQ DWK RHK +C
Sbjct: 1987 CNEPKNRENGGGLNQCSKCKEARYCSKECQVADWK-RHKKEC 2027
>gi|118588908|ref|ZP_01546315.1| hypothetical protein SIAM614_12688 [Stappia aggregata IAM 12614]
gi|118438237|gb|EAV44871.1| hypothetical protein SIAM614_12688 [Stappia aggregata IAM 12614]
Length = 874
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 10/113 (8%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL-----QNRGSGASLMAKAAISSHAQALYSL 174
+ A + + G+V A LG R Y L ++R L KAA A AL+SL
Sbjct: 559 QQAIGYFEQAYQMGHVRAGQVLG--RMYFLGAGIDRDRKKAVELYRKAAERGDAYALHSL 616
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
+ + G G ++N+KD G+ ++ GH + +G +G V +N+
Sbjct: 617 GMAEIKGEGTAQNEKD---GLQKLLQSVEAGHTFSFNAIGGFYLNGQHVEENV 666
>gi|398824728|ref|ZP_10583049.1| Sel1 repeat protein [Bradyrhizobium sp. YR681]
gi|398224595|gb|EJN10896.1| Sel1 repeat protein [Bradyrhizobium sp. YR681]
Length = 1133
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 62/129 (48%), Gaps = 7/129 (5%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL---QNRGSGASLMAKAAISSHAQAL 171
A N+ E+A ++ A AG V A + LG + L ++ + +AA +A+A+
Sbjct: 934 AANYDEAA-KWYDRAAQAGVVPATFRLGTLYEKGLGVKKDADTARRYYTQAAERGNAKAM 992
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
++LAV+ +G G N K + +AA G D+ LG G GV QN+AE
Sbjct: 993 HNLAVLDADGGGRGANYK---SAAQWFRKAADRGVADSQFNLGILYARGIGVEQNLAESY 1049
Query: 232 RFLVQANAR 240
++ A A+
Sbjct: 1050 KWFSLAAAQ 1058
>gi|397636925|gb|EJK72461.1| hypothetical protein THAOC_06008, partial [Thalassiosira oceanica]
Length = 328
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 29/49 (59%)
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD R V L AA +G IDAL LG + G GV+Q++AEG F +A
Sbjct: 191 KDTRMAVKLWEEAAEIGSIDALFNLGLAHERGEGVKQDMAEGAEFYKKA 239
>gi|254585105|ref|XP_002498120.1| ZYRO0G02706p [Zygosaccharomyces rouxii]
gi|238941014|emb|CAR29187.1| ZYRO0G02706p [Zygosaccharomyces rouxii]
Length = 693
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E +F +C C YCSR CQ W
Sbjct: 494 GVRQCANFACGKWEDYPRQFAKCRRCKRTKYCSRDCQLKAW 534
>gi|389623179|ref|XP_003709243.1| SamB protein [Magnaporthe oryzae 70-15]
gi|351648772|gb|EHA56631.1| SamB protein [Magnaporthe oryzae 70-15]
gi|440465847|gb|ELQ35147.1| SamB protein [Magnaporthe oryzae Y34]
gi|440486485|gb|ELQ66346.1| SamB protein [Magnaporthe oryzae P131]
Length = 636
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 584 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 629
>gi|39936091|ref|NP_948367.1| hypothetical protein RPA3028 [Rhodopseudomonas palustris CGA009]
gi|39649945|emb|CAE28469.1| conserved unknown protein [Rhodopseudomonas palustris CGA009]
Length = 279
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 49/101 (48%), Gaps = 4/101 (3%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
A + L+ A AGN L + + +L+AK A S+A+A Y L ++ +G
Sbjct: 134 ARQLLERAAAAGNPRGISNLAALGGGASSDPVKTRALLAKGA-ESNAEAQYQLGMMMADG 192
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYG 222
GG K+D R AL +AA GH AL ++G Q G G
Sbjct: 193 IGGPKDDAGAR---ALFEKAAGQGHPGALMQMGAFAQAGRG 230
>gi|398406847|ref|XP_003854889.1| hypothetical protein MYCGRDRAFT_68176 [Zymoseptoria tritici IPO323]
gi|339474773|gb|EGP89865.1| hypothetical protein MYCGRDRAFT_68176 [Zymoseptoria tritici IPO323]
Length = 680
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 32/65 (49%), Gaps = 2/65 (3%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
G+R C++ CG+ E +F +C C YCS+ CQ W H+ C AE E G
Sbjct: 585 GIRQCAYWQCGKWEEYTRQFAKCRRCRRTKYCSKECQKGAWG-SHRFWCVAAEEHA-EGG 642
Query: 386 NGDVD 390
+G D
Sbjct: 643 SGRAD 647
>gi|258567220|ref|XP_002584354.1| hypothetical protein UREG_05043 [Uncinocarpus reesii 1704]
gi|237905800|gb|EEP80201.1| hypothetical protein UREG_05043 [Uncinocarpus reesii 1704]
Length = 604
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 553 GIRQCAYYKCGKWEEYQRQFAKCRRCRRTKYCSKECQKNAW 593
>gi|168334224|ref|ZP_02692425.1| Sel1 domain protein repeat-containing protein [Epulopiscium sp.
'N.t. morphotype B']
Length = 648
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 66/150 (44%), Gaps = 21/150 (14%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL--------MAKAAISSHAQAL 171
E A +F + A G+V+A L + C + +G G +L +AA+S A+A
Sbjct: 468 EMAFKFYQKAATLGSVKAQTNLAL----CYE-KGIGTTLDLDKAFEWYVRAAVSGFAKAQ 522
Query: 172 YSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGR 231
+L + NG G +KN +AA GH A L C + G GV +N+ E
Sbjct: 523 NNLGYLYENGKGATKN---YSKAFEWYQKAAIQGHAKAQYNLALCYEYGKGVIKNLDETF 579
Query: 232 RFL----VQANARELAAVLSSAACQGISTR 257
++ Q N A L +A +G+ T+
Sbjct: 580 KWFKESAEQGNMYAQYA-LGAAYIKGLGTK 608
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 46/114 (40%), Gaps = 6/114 (5%)
Query: 127 KLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSG 183
KL A+ G+ A Y + Y +N G KAA H +A Y L G+G
Sbjct: 403 KLMAERGDATAQYRIASAYIYGNGTEKNLIQGFRWYQKAAEQEHVEAQYKLGYCYEKGTG 462
Query: 184 GSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
D DL +AA LG + A L C + G G ++ + + V+A
Sbjct: 463 V---DSDLEMAFKFYQKAATLGSVKAQTNLALCYEKGIGTTLDLDKAFEWYVRA 513
>gi|68304974|gb|AAY89985.1| hypothetical protein PM1611 [uncultured bacterium BAC13K9BAC]
Length = 145
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 58/129 (44%), Gaps = 7/129 (5%)
Query: 112 AIKANNWSESAHRFLKLCADAGNVEACYTLGMIR---FYCLQNRGSGASLMAKAAISSHA 168
A + N++ A + L A+ GNV+A Y + ++ CL N M KAA
Sbjct: 17 AFETKNFA-HATKLLSPIAEEGNVDALYRMAIMLQNGLGCLANEDKAFLYMTKAAEDGLP 75
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
A+++L + F G +KD + RAA G + + LG +DG V+Q++
Sbjct: 76 LAMHALGFMYFEGEC---TEKDSNLCIKWFERAAAEGMMGSATTLGMIYEDGKIVKQDLK 132
Query: 229 EGRRFLVQA 237
+ + +A
Sbjct: 133 KAEEWYKKA 141
>gi|397591373|gb|EJK55350.1| hypothetical protein THAOC_24925 [Thalassiosira oceanica]
Length = 192
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 34/55 (61%)
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
GG K+++ V L A AA LG IDAL +LG+ +G GV+Q+ A+ +F +A
Sbjct: 80 GGLGLQKNVQKAVELYAEAAELGSIDALFDLGNAYYEGNGVQQDKAKAAQFWTKA 134
>gi|384142516|ref|YP_005525226.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii MDR-ZJ06]
gi|347593009|gb|AEP05730.1| TPR repeat-containing SEL1 subfamily protein [Acinetobacter
baumannii MDR-ZJ06]
Length = 156
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 55/116 (47%), Gaps = 7/116 (6%)
Query: 126 LKLCADAGNVEACYTLG---MIRFYCLQ-NRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
LK A+ G++ A +G + Y L + + KAA S A++ Y+LA++ NG
Sbjct: 30 LKSAAEQGDIRAQSQMGDAYLFGEYDLNVDYKQAINWYQKAADQSDAKSQYNLAIMYLNG 89
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G K DL V ++A G D+ +LG +G GV +NI + + +A
Sbjct: 90 YGVKK---DLSKSVEYYRKSALQGDADSQLQLGIRYLNGEGVERNIETAKEWFKKA 142
>gi|323447117|gb|EGB03067.1| hypothetical protein AURANDRAFT_34508 [Aureococcus anophagefferens]
Length = 345
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 57/140 (40%), Gaps = 17/140 (12%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASL--------MAKAAISSHAQAL 171
E R+ L AD G A LG C RG G + +AA H +A
Sbjct: 140 EEGFRYYALAADQGYTNAENNLG-----CCYERGKGTEVDLGKARYWFERAAAKGHEKAT 194
Query: 172 YSLAVIQFNGSGG-SKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEG 230
SLA+ G G ++DK + RA LG++DA+ LG + G GV+ + +
Sbjct: 195 QSLALAYRRGYFGLVQSDKK---AAKIYRRAVELGNVDAMSRLGEMTEYGSGVKLDKKKA 251
Query: 231 RRFLVQANARELAAVLSSAA 250
R A R A SS A
Sbjct: 252 ERLYRAAADRGCAIAQSSLA 271
>gi|302677194|ref|XP_003028280.1| hypothetical protein SCHCODRAFT_237338 [Schizophyllum commune H4-8]
gi|300101968|gb|EFI93377.1| hypothetical protein SCHCODRAFT_237338 [Schizophyllum commune H4-8]
Length = 301
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 19/31 (61%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
RC+ C AV YCS+ CQ DWK H+ C A
Sbjct: 269 RCAKCKAVKYCSKECQMQDWKGGHRMQCYEA 299
>gi|62320711|dbj|BAD95379.1| putative ubiquitin carboxyl terminal hydrolase [Arabidopsis
thaliana]
gi|227202736|dbj|BAH56841.1| AT2G24640 [Arabidopsis thaliana]
Length = 418
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 5/40 (12%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CG+ T++ CS C +V YCS ACQ DWK HK C
Sbjct: 67 CGKATTKK-----CSRCKSVRYCSAACQTSDWKSGHKLKC 101
>gi|393228864|gb|EJD36499.1| hypothetical protein AURDEDRAFT_174458 [Auricularia delicata
TFB-10046 SS5]
Length = 364
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 28/51 (54%), Gaps = 5/51 (9%)
Query: 330 CSHVGCGRPETRRHEFRRCSV-CGA---VNYCSRACQALDWKLRHKADCAP 376
C+ GCG T++ RCS C A +YCS+ CQ DWK RHK C P
Sbjct: 238 CAAPGCGITATKKAALLRCSGGCPAETKAHYCSKECQTADWK-RHKPACRP 287
>gi|365920829|ref|ZP_09445138.1| Sel1 repeat protein [Cardiobacterium valvarum F0432]
gi|364577292|gb|EHM54573.1| Sel1 repeat protein [Cardiobacterium valvarum F0432]
Length = 364
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 3/88 (3%)
Query: 156 ASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGH 215
A+ KAA+ A A ++L ++ G G ++ D A +AA G + A LG
Sbjct: 194 AAWYEKAAVQGDAAAQFNLGLLYDEGKGVTQ---DYTQAAAWYEKAAVQGLVQAQYNLGV 250
Query: 216 CLQDGYGVRQNIAEGRRFLVQANARELA 243
+DG GV Q+ + RR+ +A A+ A
Sbjct: 251 LYRDGQGVAQDYGKARRWFEKAAAQNFA 278
>gi|302692150|ref|XP_003035754.1| hypothetical protein SCHCODRAFT_105161 [Schizophyllum commune H4-8]
gi|300109450|gb|EFJ00852.1| hypothetical protein SCHCODRAFT_105161, partial [Schizophyllum
commune H4-8]
Length = 571
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 26/48 (54%), Gaps = 6/48 (12%)
Query: 330 CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
CS GC T RRCS CG YCS CQ++DW RH+A C P
Sbjct: 438 CSREGCLGDAT----LRRCS-CGFALYCSTDCQSVDWS-RHRAHCQPV 479
>gi|85098706|ref|XP_960655.1| hypothetical protein NCU01252 [Neurospora crassa OR74A]
gi|12718251|emb|CAC28640.1| related to samB [Neurospora crassa]
gi|28922167|gb|EAA31419.1| hypothetical protein NCU01252 [Neurospora crassa OR74A]
Length = 622
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 570 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 615
>gi|392562241|gb|EIW55422.1| hypothetical protein TRAVEDRAFT_131337 [Trametes versicolor FP-101664
SS1]
Length = 1177
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
C P+ + R+CS C V+YCS CQ DWK +HK C A+
Sbjct: 1137 CSWPDNK---MRKCSRCKTVSYCSEGCQKADWK-KHKPACHVAQE 1177
>gi|344249389|gb|EGW05493.1| Programmed cell death protein 2 [Cricetulus griseus]
Length = 708
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 27/53 (50%), Gaps = 7/53 (13%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
G LC GC P+T CS C +YC + Q LDW+L HK CA ++
Sbjct: 494 GAHLCRVCGCLGPKT-------CSRCKQAHYCGKEHQTLDWRLGHKHACAQSD 539
>gi|426355241|ref|XP_004045037.1| PREDICTED: programmed cell death protein 2 isoform 1 [Gorilla
gorilla gorilla]
Length = 344
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 173
>gi|395738003|ref|XP_002817656.2| PREDICTED: programmed cell death protein 2-like isoform 2 [Pongo
abelii]
Length = 344
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 173
>gi|114049311|ref|YP_739861.1| Sel1 domain-containing protein [Shewanella sp. MR-7]
gi|113890753|gb|ABI44804.1| Sel1 domain protein repeat-containing protein [Shewanella sp. MR-7]
Length = 485
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 5/115 (4%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
A +L+L A+ EA LG++ +Y ++ + KAA + + A Y LA++ N
Sbjct: 287 QAKHYLQLAAEGEQTEAMANLGVL-YYQAKDLTQAYHFINKAAQAGYPHAQYHLALMLAN 345
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFL 234
G G +D A AA G +DA L H L D G+ ++++ R+L
Sbjct: 346 GDGCQ---RDPIVSEYWMAEAAEQGQLDAMLTRAQHMLNDESGLGGDVSQAERYL 397
>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
Length = 530
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 7/57 (12%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
+CSH ++ E R CS C AV+YC ACQA +W H +C R + E+G
Sbjct: 48 ICSHC------FKQAEVRACSRCHAVSYCDAACQAANWTAVHSKECK-VLRKVTEQG 97
>gi|332264011|ref|XP_003281042.1| PREDICTED: programmed cell death protein 2 isoform 1 [Nomascus
leucogenys]
Length = 344
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 173
>gi|119567801|gb|EAW47416.1| programmed cell death 2, isoform CRA_d [Homo sapiens]
Length = 334
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 121 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 163
>gi|169863653|ref|XP_001838445.1| hypothetical protein CC1G_09073 [Coprinopsis cinerea okayama7#130]
gi|116500484|gb|EAU83379.1| hypothetical protein CC1G_09073 [Coprinopsis cinerea okayama7#130]
Length = 684
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 28/64 (43%), Gaps = 1/64 (1%)
Query: 312 LAEWFAARGGTPGPGLRLCSHVGCGR-PETRRHEFRRCSVCGAVNYCSRACQALDWKLRH 370
+ E +A G+ C + R P R + +CS C V YCS CQ DW H
Sbjct: 393 IDERYAMMEAVENDGIDFCDNYAHKRRPRKRLSQLSQCSRCRTVVYCSHECQKEDWVAGH 452
Query: 371 KADC 374
+ +C
Sbjct: 453 RDEC 456
>gi|21735592|ref|NP_002589.2| programmed cell death protein 2 isoform 1 [Homo sapiens]
gi|28202261|sp|Q16342.2|PDCD2_HUMAN RecName: Full=Programmed cell death protein 2; AltName: Full=Zinc
finger MYND domain-containing protein 7; AltName:
Full=Zinc finger protein Rp-8
gi|14249983|gb|AAH08378.1| Programmed cell death 2 [Homo sapiens]
gi|119567802|gb|EAW47417.1| programmed cell death 2, isoform CRA_e [Homo sapiens]
gi|123993599|gb|ABM84401.1| programmed cell death 2 [synthetic construct]
gi|123999602|gb|ABM87344.1| programmed cell death 2 [synthetic construct]
gi|410221952|gb|JAA08195.1| programmed cell death 2 [Pan troglodytes]
gi|410251694|gb|JAA13814.1| programmed cell death 2 [Pan troglodytes]
gi|410303276|gb|JAA30238.1| programmed cell death 2 [Pan troglodytes]
Length = 344
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 173
>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
mellifera]
Length = 440
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 34/62 (54%), Gaps = 9/62 (14%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDE------EGNGDVDVDLNDAEGGG 400
RCSVC + YC+++CQ + W + H +CA +R+ + + + LN +GGG
Sbjct: 44 RCSVCKCIYYCNQSCQQMSWTI-HSKECASLKRFSSKVIPDVARLMARIIIKLN--QGGG 100
Query: 401 ED 402
E+
Sbjct: 101 EE 102
>gi|320032209|gb|EFW14164.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 240
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 343 HEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
H+ ++C+ C + YCS CQA W + HK DC AER
Sbjct: 195 HKLKQCTRCKMIQYCSIECQAHHWPI-HKKDCRNAER 230
>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
Length = 544
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
E + C+ C V YC++ CQA DWK H +C
Sbjct: 120 ELKACTGCRVVKYCNKTCQAKDWKFAHSFEC 150
>gi|998901|gb|AAB34865.1| programmed cell death-2/Rp8 homolog [Homo sapiens]
Length = 344
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 173
>gi|393229535|gb|EJD37156.1| hypothetical protein AURDEDRAFT_173790 [Auricularia delicata
TFB-10046 SS5]
Length = 468
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 34/79 (43%), Gaps = 16/79 (20%)
Query: 303 PEAHPASRFLAEWFAARGGTPGPGLRLCSHVGCGRPE-TRRHEFRR---CSVCGAVNYCS 358
P +H FL A R C+ GC E + H RR C +C V YCS
Sbjct: 332 PRSHAVHDFLERCMANRA---------CAAPGCDEAELSETHGVRRFLSCELCRIVRYCS 382
Query: 359 RACQALDWK---LRHKADC 374
+ACQ DWK RHK C
Sbjct: 383 KACQRADWKHGAPRHKDVC 401
>gi|348673062|gb|EGZ12881.1| hypothetical protein PHYSODRAFT_334723 [Phytophthora sojae]
Length = 1501
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 20/41 (48%)
Query: 334 GCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
CG P + C C AV YCSR CQ + WK H+ C
Sbjct: 1254 ACGNPAYPGEKLSACGACKAVKYCSRDCQRVHWKRIHRHMC 1294
>gi|328849245|gb|EGF98429.1| hypothetical protein MELLADRAFT_69335 [Melampsora larici-populina
98AG31]
Length = 324
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 16/31 (51%), Positives = 21/31 (67%), Gaps = 1/31 (3%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+ RRC+ C V+YC CQ LDW + HKA+C
Sbjct: 123 QIRRCTKCRVVSYCGENCQRLDW-VSHKAEC 152
>gi|302677442|ref|XP_003028404.1| hypothetical protein SCHCODRAFT_112780 [Schizophyllum commune H4-8]
gi|300102092|gb|EFI93501.1| hypothetical protein SCHCODRAFT_112780 [Schizophyllum commune H4-8]
Length = 558
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
++ C +V CG ++ + +RC+ CG YC + CQ DW RHKA C P
Sbjct: 422 AIQYCHNVSCGNRDSGA-KLKRCA-CGYAYYCCKTCQVADWS-RHKAACRPV 470
>gi|393243888|gb|EJD51402.1| hypothetical protein AURDEDRAFT_143169 [Auricularia delicata
TFB-10046 SS5]
Length = 534
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 327 LRLCSHVGCGRPETRR-HEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
+R C + CG E H+F+RC+ C V YCS ACQ W H+A C
Sbjct: 430 IRRCWNQECGEMEMLAGHKFQRCACCKVVRYCSSACQRAAWD-THQAPC 477
>gi|384171495|ref|YP_005552872.1| hypothetical protein [Arcobacter sp. L]
gi|345471105|dbj|BAK72555.1| conserved hypothetical protein [Arcobacter sp. L]
Length = 243
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 45/103 (43%), Gaps = 4/103 (3%)
Query: 84 VNVLITCKRMNGLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGM 143
+ +LI + L+L +L S + +K N E+A+ FLK C D G CY LG+
Sbjct: 2 LKILIKILLFSILSLQTLSASVVNDGFTELKKGNVLEAANLFLKAC-DEGATSGCYNLGL 60
Query: 144 IRF---YCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSG 183
+ + QN KA H A Y+LA + N G
Sbjct: 61 LYYKGDKIEQNYPKAIEYFTKACNDGHTTACYNLAYMYQNAQG 103
>gi|326923979|ref|XP_003208210.1| PREDICTED: tudor domain-containing protein 1-like [Meleagris
gallopavo]
Length = 946
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 24/55 (43%), Gaps = 8/55 (14%)
Query: 322 TPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
TP P +R C H G RCS C + YCS CQ DW + H C P
Sbjct: 68 TPPPQMRTCHHCGLSGS-------LRCSQCKQIYYCSVDCQKRDWSV-HSVVCEP 114
>gi|189501781|ref|YP_001957498.1| hypothetical protein Aasi_0340 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497222|gb|ACE05769.1| hypothetical protein Aasi_0340 [Candidatus Amoebophilus asiaticus
5a2]
Length = 2413
Score = 40.0 bits (92), Expect = 2.0, Method: Composition-based stats.
Identities = 31/105 (29%), Positives = 48/105 (45%), Gaps = 3/105 (2%)
Query: 122 AHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNG 181
A ++ + A+ GN +A + LG++ N KAA HA A + L V+ NG
Sbjct: 2218 AIKWFQKAAEQGNADAQFNLGVMYEKVEGNYKKAIKWFQKAAEQGHADAQFKLGVMYHNG 2277
Query: 182 SGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
G +K+D V +AA ++ A +LG G GV Q+
Sbjct: 2278 EGVAKDDNQ---AVFWYRKAAGQRNVKAQFKLGVMYYHGQGVGQD 2319
>gi|167386456|ref|XP_001737759.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899211|gb|EDR25850.1| hypothetical protein EDI_044060 [Entamoeba dispar SAW760]
Length = 1079
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 45/104 (43%), Gaps = 17/104 (16%)
Query: 133 GNVEACYTLGMIRFYCL----------QNRGSGASLMAKAAISSHAQALYSLAVIQFNGS 182
GN+EA Y L YCL + + + L+ KAA + + L FNG
Sbjct: 695 GNIEALYNLS----YCLLHGEGCEGTEKEKKTAIKLLLKAAKKGYIPSQLLLGNCYFNGI 750
Query: 183 GGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQN 226
G K++K + V +AA + A+ LG C G GV +N
Sbjct: 751 GTLKDEK---SAVYWYTKAALQNNSTAINNLGSCFYKGNGVEKN 791
>gi|358378079|gb|EHK15762.1| hypothetical protein TRIVIDRAFT_38220 [Trichoderma virens Gv29-8]
Length = 731
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 55/126 (43%), Gaps = 16/126 (12%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSG 183
++L+ AD + E GM + N + A+ A+S +Y L V NG
Sbjct: 574 QWLRKAADMASGEVAEDEGMAKEGKRINIAENKTHKAQFALS-----IYELGVSHMNG-W 627
Query: 184 GSKNDKDLRAGVALCAR----AAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQANA 239
G + DK AL R A G +DAL E G C G G ++N+ + +F A A
Sbjct: 628 GIEQDK------ALALRCFEIAGTWGDVDALAEAGFCYAQGIGCKKNLKKSAKFYRMAEA 681
Query: 240 RELAAV 245
+ ++ V
Sbjct: 682 KGMSMV 687
>gi|242081757|ref|XP_002445647.1| hypothetical protein SORBIDRAFT_07g023330 [Sorghum bicolor]
gi|241941997|gb|EES15142.1| hypothetical protein SORBIDRAFT_07g023330 [Sorghum bicolor]
Length = 998
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 26/56 (46%), Gaps = 7/56 (12%)
Query: 335 CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAP--AERWLDEEGNGD 388
C +P T R C C AV YCS CQ W+ HK DC P D+EG +
Sbjct: 124 CFKPTTFR-----CKQCKAVKYCSFKCQIAHWRQGHKDDCHPPSVNTRPDDEGKAE 174
>gi|156041138|ref|XP_001587555.1| hypothetical protein SS1G_11548 [Sclerotinia sclerotiorum 1980]
gi|154695931|gb|EDN95669.1| hypothetical protein SS1G_11548 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 632
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G+R C++ CG+ E +F +C C YCS+ CQ W H+ C A +
Sbjct: 580 GIRQCAYYQCGKWEEYTRQFAKCRRCRRTKYCSKECQKSAWVF-HRHWCVQASQ 632
>gi|118638406|gb|ABL09362.1| programmed cell death 2 alternative transcript [Mus musculus]
Length = 258
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 7/54 (12%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G LC GC P T CS C +YCS+ Q LDW+L HK C +++
Sbjct: 130 GAHLCRVCGCLAPMT-------CSRCKQAHYCSKEHQTLDWRLGHKQACTQSDK 176
>gi|115387447|ref|XP_001211229.1| hypothetical protein ATEG_02051 [Aspergillus terreus NIH2624]
gi|121741341|sp|Q0CW83.1|MUB1_ASPTN RecName: Full=MYND-type zinc finger protein samB; AltName:
Full=Suppressor of anucleate metulae protein B
gi|114195313|gb|EAU37013.1| hypothetical protein ATEG_02051 [Aspergillus terreus NIH2624]
Length = 603
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 552 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKDCQKAAW 592
>gi|428185949|gb|EKX54800.1| hypothetical protein GUITHDRAFT_99450 [Guillardia theta CCMP2712]
Length = 712
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 65/121 (53%), Gaps = 5/121 (4%)
Query: 125 FLKLCADAGNVEACYTLG--MIRFYCLQ-NRGSGASLMAKAAISSHAQALYSLA-VIQFN 180
F + A G+ +A Y L M++ ++ + G ++ +AA ++ AQA + LA +++
Sbjct: 49 FERRAAKKGHAKAQYNLAVCMLKGTGMEADEEGGMRMLRRAADNNSAQAQFQLAAMLEEA 108
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDG-YGVRQNIAEGRRFLVQANA 239
G+K+ + +L AR+A LG ++A L C +DG YG++ N R++ +A A
Sbjct: 109 LRAGAKDKEKAEEIRSLLARSAGLGLVEAQMRLARCFKDGLYGLQVNFGAYVRWMEKAAA 168
Query: 240 R 240
+
Sbjct: 169 Q 169
>gi|507906|gb|AAA83433.1| zinc finger protein [Mus musculus]
Length = 343
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 7/54 (12%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G LC GC P T CS C +YCS+ Q LDW+L HK C +++
Sbjct: 130 GAHLCRVCGCLAPMT-------CSRCKQAHYCSKEHQTLDWRLGHKQACTQSDK 176
>gi|47117852|sp|P46718.2|PDCD2_MOUSE RecName: Full=Programmed cell death protein 2; AltName: Full=Zinc
finger protein Rp-8
Length = 343
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 7/54 (12%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G LC GC P T CS C +YCS+ Q LDW+L HK C +++
Sbjct: 130 GAHLCRVCGCLAPMT-------CSRCKQAHYCSKEHQTLDWRLGHKQACTQSDK 176
>gi|357485573|ref|XP_003613074.1| Ubiquitin carboxyl-terminal hydrolase [Medicago truncatula]
gi|355514409|gb|AES96032.1| Ubiquitin carboxyl-terminal hydrolase [Medicago truncatula]
Length = 1096
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 9/62 (14%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEGNGDV---------DVDLNDAE 397
RC+ C AV YCS CQ W+ HK +C P+ E N +V + DL++++
Sbjct: 95 RCARCKAVRYCSGKCQISHWRQGHKDECCPSITTTREVNNENVTSRASASGTEFDLHESK 154
Query: 398 GG 399
G
Sbjct: 155 GN 156
>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
Length = 544
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
E + C+ C V YC++ CQA DWK H +C
Sbjct: 120 ELKACTGCRVVKYCNKTCQAKDWKFAHSFEC 150
>gi|441602293|ref|XP_004087725.1| PREDICTED: programmed cell death protein 2 [Nomascus leucogenys]
Length = 298
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 173
>gi|397676942|ref|YP_006518480.1| Sel1 domain-containing protein repeat-containing protein [Zymomonas
mobilis subsp. mobilis ATCC 29191]
gi|395397631|gb|AFN56958.1| Sel1 domain protein repeat-containing protein [Zymomonas mobilis
subsp. mobilis ATCC 29191]
Length = 373
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 67/149 (44%), Gaps = 10/149 (6%)
Query: 95 GLALNSLVLSKASKKTFAIKANNWSESAHRFLKLCADAGNVEACYTLGMIRFYC---LQN 151
GLA +L L A +K I ++ A + K AD G+V+A +G + Q+
Sbjct: 215 GLAYRNLGL--AYRKGEGISQDD--AKAAFWYKKAADQGHVKAQLNMGFAYYQARGVAQD 270
Query: 152 RGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALR 211
G L KAA ++A Y+LA+ +N G +DL + RAA G + A
Sbjct: 271 YARGIFLYRKAAEQGDSKAEYNLAIAYYNAEG---EPRDLAQSIDWFQRAASHGEVSAQY 327
Query: 212 ELGHCLQDGYGVRQNIAEGRRFLVQANAR 240
LG G GV ++ + +L +A AR
Sbjct: 328 NLGIFYLKGEGVPKDRNQAIFWLEKAAAR 356
>gi|393227832|gb|EJD35495.1| hypothetical protein AURDEDRAFT_188720 [Auricularia delicata
TFB-10046 SS5]
Length = 628
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 26/53 (49%), Gaps = 6/53 (11%)
Query: 325 PGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWK---LRHKADC 374
P R CS GC + R +C C +YCSR CQ DWK L HKA C
Sbjct: 443 PRQRACSRPGCTTADPR---LAQCGKCHVASYCSRDCQRADWKDELLPHKAVC 492
>gi|367037107|ref|XP_003648934.1| hypothetical protein THITE_2106957 [Thielavia terrestris NRRL 8126]
gi|346996195|gb|AEO62598.1| hypothetical protein THITE_2106957 [Thielavia terrestris NRRL 8126]
Length = 617
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 565 GIRQCAYYQCGKWEEFTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 610
>gi|120407033|ref|NP_032825.2| programmed cell death protein 2 [Mus musculus]
gi|26337749|dbj|BAC32560.1| unnamed protein product [Mus musculus]
gi|148688511|gb|EDL20458.1| programmed cell death 2, isoform CRA_b [Mus musculus]
gi|151555611|gb|AAI48379.1| Programmed cell death 2 [synthetic construct]
Length = 343
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 7/54 (12%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G LC GC P T CS C +YCS+ Q LDW+L HK C +++
Sbjct: 130 GAHLCRVCGCLAPMT-------CSRCKQAHYCSKEHQTLDWRLGHKQACTQSDK 176
>gi|328860497|gb|EGG09603.1| hypothetical protein MELLADRAFT_61078 [Melampsora larici-populina
98AG31]
Length = 190
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 6/56 (10%)
Query: 324 GPGLRLCSHVGCGR-----PETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
P +CSH+ C + ++ E + C+ C V YC+ ACQ DWK RHK C
Sbjct: 129 APNEMVCSHLECPKMINAAKLSKYVELQHCARCKCVFYCNAACQKKDWK-RHKKYC 183
>gi|310792622|gb|EFQ28149.1| MYND finger [Glomerella graminicola M1.001]
Length = 627
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 575 GIRQCAYYQCGKWEEFTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 620
>gi|299749011|ref|XP_001838436.2| hypothetical protein CC1G_09064 [Coprinopsis cinerea okayama7#130]
gi|298408241|gb|EAU83370.2| hypothetical protein CC1G_09064 [Coprinopsis cinerea okayama7#130]
Length = 619
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 5/53 (9%)
Query: 325 PGLRLCS---HVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
P L LC HV + +R ++C C +V YCS ACQ DW+ RH+ +C
Sbjct: 415 PTLNLCDNELHVHADGMDRKRS--KKCGRCHSVVYCSSACQHEDWERRHQREC 465
>gi|220904871|ref|YP_002480183.1| Sel1 domain-containing protein [Desulfovibrio desulfuricans subsp.
desulfuricans str. ATCC 27774]
gi|219869170|gb|ACL49505.1| Sel1 domain protein repeat-containing protein [Desulfovibrio
desulfuricans subsp. desulfuricans str. ATCC 27774]
Length = 222
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 63/128 (49%), Gaps = 9/128 (7%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCL-----QNRGSGASLMAKAAISSHAQALYSL 174
E+A ++ +L A+ N EA L Y L ++ + LM AA + +A+AL++L
Sbjct: 83 ETAAKWFRLAAEQNNSEAQVLLAYQ--YELGQGVPKDDAAVVRLMTSAANAGNAEALFNL 140
Query: 175 AVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFL 234
A+ ++G G K+ L AA G+ A R +G C + G GV +N E + +
Sbjct: 141 AL--YHGQGKYGFAKNPAESFRLAKMAADQGNAQAQRYVGACYEYGVGVPENATEAQVWY 198
Query: 235 VQANAREL 242
+A A+ L
Sbjct: 199 GKAKAQGL 206
>gi|429852531|gb|ELA27663.1| mynd domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 629
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 577 GIRQCAYYQCGKWEEFTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 622
>gi|358366622|dbj|GAA83242.1| MYND domain protein [Aspergillus kawachii IFO 4308]
Length = 605
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 554 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKDCQKAAW 594
>gi|254568086|ref|XP_002491153.1| Activator of Chs3p (chitin synthase III), recruits Chs3p to the bud
neck via interaction with Bni4p [Komagataella pastoris
GS115]
gi|238030950|emb|CAY68873.1| Activator of Chs3p (chitin synthase III), recruits Chs3p to the bud
neck via interaction with Bni4p [Komagataella pastoris
GS115]
gi|328352324|emb|CCA38723.1| Protein SKT5 [Komagataella pastoris CBS 7435]
Length = 575
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 54/121 (44%), Gaps = 17/121 (14%)
Query: 119 SESAHRFLKLCADAGNVEACYTLGMIRFYC-------LQNRGSGASLMAKAAISS---HA 168
S A FLK A + A Y +G+ FY + + +G +++A + + A
Sbjct: 340 SRKAVEFLKFAASRNHAAAMYKMGLYSFYAKMGLPNNITTKKAGIQWLSRATMRATELTA 399
Query: 169 QALYSLAVIQFNGSGGSKND---KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQ 225
A Y LA I ++G D D + + L +AA LGH+ + LGH + G + Q
Sbjct: 400 AAPYELAKIHYHGF----EDIVIPDRKYALELYVKAASLGHVKSAAILGHHYEVGDVIPQ 455
Query: 226 N 226
+
Sbjct: 456 D 456
>gi|113972081|ref|YP_735874.1| Sel1 domain-containing protein [Shewanella sp. MR-4]
gi|113886765|gb|ABI40817.1| Sel1 domain protein repeat-containing protein [Shewanella sp. MR-4]
Length = 485
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 54/115 (46%), Gaps = 5/115 (4%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFN 180
A +L+L A+ EA LG++ +Y ++ + KAA + + A Y LA++ N
Sbjct: 287 QAKHYLQLAAEGEQTEAMANLGVL-YYQAKDLTQAYHFINKAAQAGYPHAQYHLALMLAN 345
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDA-LRELGHCLQDGYGVRQNIAEGRRFL 234
G G +D A AA G +DA L H L D G+ ++++ R+L
Sbjct: 346 GDGCQ---RDPIVSEYWMAEAAEQGQLDAMLTRAQHMLNDESGLGGDVSQAERYL 397
>gi|295670449|ref|XP_002795772.1| MYND domain protein (SamB) [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284857|gb|EEH40423.1| MYND domain protein (SamB) [Paracoccidioides sp. 'lutzii' Pb01]
Length = 617
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 1/52 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RHKADCAP 376
G+R C++ CG+ E + +F +C C YCS+ CQ W RH P
Sbjct: 559 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKECQKNAWAFHRHWCHITP 610
>gi|226294126|gb|EEH49546.1| SamB protein [Paracoccidioides brasiliensis Pb18]
Length = 618
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E + +F +C C YCS+ CQ W RH
Sbjct: 559 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKECQKNAWAFHRH 604
>gi|224056573|ref|XP_002298917.1| predicted protein [Populus trichocarpa]
gi|222846175|gb|EEE83722.1| predicted protein [Populus trichocarpa]
Length = 665
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 7/58 (12%)
Query: 321 GTPGPGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAE 378
T G+++C+ C P T R CS C +V YCS CQ + W+ HK +C E
Sbjct: 67 STSKNGIQVCAR--CFSPATTR-----CSRCKSVRYCSGKCQIIHWRQAHKEECQLLE 117
>gi|157104842|ref|XP_001648596.1| hypothetical protein AaeL_AAEL004183 [Aedes aegypti]
gi|108880244|gb|EAT44469.1| AAEL004183-PA [Aedes aegypti]
Length = 422
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 25/49 (51%), Gaps = 7/49 (14%)
Query: 327 LRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ LCS G +P+ ++CS C V YC R CQ L W HK CA
Sbjct: 346 ISLCSSCGEEKPD------KKCSKCKEVQYCDRECQRLHW-FMHKKVCA 387
>gi|167390305|ref|XP_001739292.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897066|gb|EDR24334.1| hypothetical protein EDI_070760 [Entamoeba dispar SAW760]
Length = 773
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 49/108 (45%), Gaps = 9/108 (8%)
Query: 120 ESAHRFLKLCADAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQF 179
E A +K DAG A G RF N+ G L+ KA + +ALY+LA+
Sbjct: 642 EEAVSLIKEAVDAGYWVAESWYG--RFIYANNQREGLKLIHKACLMEDKRALYALAI--- 696
Query: 180 NGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNI 227
S ++D+ + + ++DA+ +G C G GV++++
Sbjct: 697 ----HSLKNQDVNYAIKILLNLCKSSYVDAIVTVGVCFMKGVGVKKDV 740
>gi|145254566|ref|XP_001398665.1| MYND-type zinc finger protein samB [Aspergillus niger CBS 513.88]
gi|294956657|sp|A2RA63.1|MUB1_ASPNC RecName: Full=MYND-type zinc finger protein samB; AltName:
Full=Suppressor of anucleate metulae protein B
gi|134084246|emb|CAK47278.1| unnamed protein product [Aspergillus niger]
gi|350630518|gb|EHA18890.1| hypothetical protein ASPNIDRAFT_211829 [Aspergillus niger ATCC
1015]
Length = 605
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 554 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKDCQKAAW 594
>gi|70997359|ref|XP_753428.1| MYND domain protein (SamB) [Aspergillus fumigatus Af293]
gi|74673523|sp|Q4WVI6.1|MUB1_ASPFU RecName: Full=MYND-type zinc finger protein samB; AltName:
Full=Suppressor of anucleate metulae protein B
gi|66851064|gb|EAL91390.1| MYND domain protein (SamB), putative [Aspergillus fumigatus Af293]
Length = 605
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 554 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKDCQKAAW 594
>gi|426355243|ref|XP_004045038.1| PREDICTED: programmed cell death protein 2 isoform 2 [Gorilla
gorilla gorilla]
Length = 298
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 25/50 (50%), Gaps = 7/50 (14%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
G LC GC P+T CS C YCS+ Q LDW+L HK CA
Sbjct: 131 GAHLCRVCGCLGPKT-------CSRCHKAYYCSKEHQTLDWRLGHKQACA 173
>gi|393227325|gb|EJD35009.1| hypothetical protein AURDEDRAFT_175918 [Auricularia delicata
TFB-10046 SS5]
Length = 177
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 27/56 (48%), Gaps = 6/56 (10%)
Query: 325 PGLRLCSHVGCGRPETRRHE---FRRCSVCGAVNYCSRACQALDWK---LRHKADC 374
P LR C GCG+ + F RC+ C + YCSR CQ DW+ HK C
Sbjct: 24 PRLRDCDGPGCGKTVHEKETPGAFARCANCRLMQYCSRDCQKADWRDSGFPHKEIC 79
>gi|225684470|gb|EEH22754.1| MYND domain protein (SamB) [Paracoccidioides brasiliensis Pb03]
Length = 618
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E + +F +C C YCS+ CQ W RH
Sbjct: 559 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKECQKNAWAFHRH 604
>gi|294956655|sp|B0Y1D1.1|MUB1_ASPFC RecName: Full=MYND-type zinc finger protein samB; AltName:
Full=Suppressor of anucleate metulae protein B
gi|159126845|gb|EDP51961.1| MYND domain protein (SamB), putative [Aspergillus fumigatus A1163]
Length = 605
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 554 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKDCQKAAW 594
>gi|168011115|ref|XP_001758249.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690705|gb|EDQ77071.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 430
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 18/29 (62%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADC 374
RRCS C AV YCSR CQ W HK DC
Sbjct: 10 RRCSRCKAVFYCSRECQERHWHGGHKNDC 38
>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
Length = 544
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 344 EFRRCSVCGAVNYCSRACQALDWKLRHKADC 374
E + C+ C V YC++ CQA DWK H +C
Sbjct: 120 ELKACTGCRVVKYCNKTCQAKDWKFVHSFEC 150
>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
Length = 520
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 7/57 (12%)
Query: 329 LCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAERWLDEEG 385
+CSH ++ E R CS C AV+YC ACQA +W H +C R + E+G
Sbjct: 48 ICSHC------FKQAEVRACSRCHAVSYCDAACQAANWTAVHSKECK-VLRKVTEQG 97
>gi|395332614|gb|EJF64993.1| hypothetical protein DICSQDRAFT_167141 [Dichomitus squalens
LYAD-421 SS1]
Length = 552
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 13/56 (23%)
Query: 333 VGCGRPETRRHE-------------FRRCSVCGAVNYCSRACQALDWKLRHKADCA 375
+ G+ E RR + + CS C A YC ACQ D+K RHK +CA
Sbjct: 19 INQGKEEERRRQAKCTYCQNESGDALKNCSRCKAARYCDEACQLADFKARHKRECA 74
>gi|239908141|ref|YP_002954882.1| hypothetical protein DMR_35050 [Desulfovibrio magneticus RS-1]
gi|239798007|dbj|BAH76996.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length = 705
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 62/118 (52%), Gaps = 6/118 (5%)
Query: 141 LGMIRFYCLQNRGSGASLMAKAAISSHAQALYSLAVIQFNGSGGSKNDKDLRAGVALCAR 200
LGM+R + +GA+L+++AA + QA+Y++A++ +G G S N + CA+
Sbjct: 377 LGMVRGRLPGDPTAGAALLSQAAWAGQPQAMYAMALLHLSGQGVSGNPAE---AFRWCAK 433
Query: 201 AAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQ-ANARELAA--VLSSAACQGIS 255
AA G +A L ++ + N+A+ + + ANA +L A L S +G++
Sbjct: 434 AAEAGVPEAKGLLAALSEEEFPSAANLAKAVTYYREAANAGDLEAGFALGSLLSKGLA 491
>gi|402081216|gb|EJT76361.1| SamB protein [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 630
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 578 GIRQCAYYQCGKWEEFTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 623
>gi|197117516|ref|YP_002137943.1| SEL1 repeat-containing lipoprotein [Geobacter bemidjiensis Bem]
gi|197086876|gb|ACH38147.1| SEL1 repeat-containing protein [Geobacter bemidjiensis Bem]
Length = 272
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 6/117 (5%)
Query: 124 RFLKLCADAGNVEACYTLGMIRFYCLQ---NRGSGASLMAKAAISSHAQALYSLAVIQFN 180
++ + A+ G V A LG+I ++ ++ A +KAA+ AQA Y ++
Sbjct: 142 KWYRRDAERGMVRAQVNLGLIYYFGRDLGADKREAALWFSKAAVQGDAQAQYYYGLMLSE 201
Query: 181 GSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
G G D D G L AA GH DA ++ C G GV +N E +L +A
Sbjct: 202 GDG---IDPDAVEGARLLLSAAKAGHADAQYQISKCYLHGKGVARNADEAVAWLRKA 255
>gi|145519479|ref|XP_001445606.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413061|emb|CAK78209.1| unnamed protein product [Paramecium tetraurelia]
Length = 411
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 13/33 (39%), Positives = 19/33 (57%)
Query: 345 FRRCSVCGAVNYCSRACQALDWKLRHKADCAPA 377
F C +C + YCS+ C+ DW + HK DC +
Sbjct: 13 FLTCPLCQKITYCSKKCRDYDWAISHKFDCKQS 45
>gi|152997375|ref|YP_001342210.1| Sel1 domain-containing protein [Marinomonas sp. MWYL1]
gi|150838299|gb|ABR72275.1| Sel1 domain protein repeat-containing protein [Marinomonas sp.
MWYL1]
Length = 685
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 58/120 (48%), Gaps = 11/120 (9%)
Query: 115 ANNWSESAHRFLKLCADAGNVEACYTLGMIRFYCL-----QNRGSGASLMAKAAISSHAQ 169
A +++E+ +LK D N EA Y LG + Y + Q+ A+ KAA +
Sbjct: 211 AQDFTEAEKWYLKAAED-DNGEAEYNLGYM--YEMGEGVAQDYEVAANWYRKAAEQDYTN 267
Query: 170 ALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAE 229
A +LA + ++G G +K + V R+A LG+ A LG+ ++G G+ QN E
Sbjct: 268 AQNALAYLYYSGQG---IEKSYQESVNWYLRSAELGNNSAQYNLGYFYEEGIGIPQNFPE 324
>gi|397643264|gb|EJK75751.1| hypothetical protein THAOC_02518 [Thalassiosira oceanica]
Length = 278
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 169 QALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIA 228
QA+ L ++G+ G + KD R L AA LG +DAL LG+ +DG V Q++A
Sbjct: 151 QAILYLGRAYYHGTFGLQ--KDTRRAFQLYTEAAELGSVDALFSLGNAYEDGVVVGQDMA 208
Query: 229 EGRRFLVQA 237
+ F +A
Sbjct: 209 KAAEFWTKA 217
>gi|397626218|gb|EJK68075.1| hypothetical protein THAOC_10788 [Thalassiosira oceanica]
Length = 307
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 28/49 (57%)
Query: 189 KDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
KD+R V L AA LG +DAL LG + GYGV+Q+ + F ++
Sbjct: 171 KDMRKAVKLWTEAAELGSLDALFNLGDSYEVGYGVKQDKEKAVEFYTES 219
>gi|119478881|ref|XP_001259474.1| MYND domain protein (SamB), putative [Neosartorya fischeri NRRL
181]
gi|294956658|sp|A1DDX0.1|MUB1_NEOFI RecName: Full=MYND-type zinc finger protein samB; AltName:
Full=Suppressor of anucleate metulae protein B
gi|119407628|gb|EAW17577.1| MYND domain protein (SamB), putative [Neosartorya fischeri NRRL
181]
Length = 606
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 555 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKDCQKAAW 595
>gi|367024287|ref|XP_003661428.1| hypothetical protein MYCTH_2300790 [Myceliophthora thermophila ATCC
42464]
gi|347008696|gb|AEO56183.1| hypothetical protein MYCTH_2300790 [Myceliophthora thermophila ATCC
42464]
Length = 620
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKL-RH 370
G+R C++ CG+ E +F +C C YCS+ CQ W RH
Sbjct: 568 GIRQCAYYQCGKWEEFTRQFAKCRRCRRTKYCSKECQKSAWAFHRH 613
>gi|326475651|gb|EGD99660.1| MYND domain-containing protein [Trichophyton tonsurans CBS 112818]
Length = 276
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDW 366
G+R C++ CG+ E + +F +C C YCS+ CQ W
Sbjct: 223 GIRQCAYYKCGKWEEFQRQFAKCRRCRRTKYCSKECQKSAW 263
>gi|320587213|gb|EFW99693.1| mynd domain containing protein [Grosmannia clavigera kw1407]
Length = 747
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 326 GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKADCAPAER 379
G+R C++ CG+ E +F +C C YCS+ CQ W H+ C A +
Sbjct: 695 GIRQCAYYQCGKWEEFTRQFAKCRRCRRTKYCSKECQKSAWAF-HRHWCVAASQ 747
>gi|393243884|gb|EJD51398.1| hypothetical protein AURDEDRAFT_57017 [Auricularia delicata
TFB-10046 SS5]
Length = 229
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 346 RRCSVCGAVNYCSRACQALDWKLRHKADCAP 376
+RC+ C V YCS+ CQ WK HK +C P
Sbjct: 17 KRCARCRLVTYCSKECQVTSWKSSHKRNCRP 47
>gi|325578342|ref|ZP_08148477.1| Sel1 repeat protein [Haemophilus parainfluenzae ATCC 33392]
gi|419803018|ref|ZP_14328196.1| Sel1 repeat protein [Haemophilus parainfluenzae HK262]
gi|419844813|ref|ZP_14368100.1| Sel1 repeat protein [Haemophilus parainfluenzae HK2019]
gi|325160078|gb|EGC72207.1| Sel1 repeat protein [Haemophilus parainfluenzae ATCC 33392]
gi|385188814|gb|EIF36287.1| Sel1 repeat protein [Haemophilus parainfluenzae HK262]
gi|386416739|gb|EIJ31231.1| Sel1 repeat protein [Haemophilus parainfluenzae HK2019]
Length = 214
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 55/120 (45%), Gaps = 6/120 (5%)
Query: 121 SAHRFLKLCADAGNVEACYTLGMIRFY---CLQNRGSGASLMAKAAISSHAQALYSLAVI 177
A ++ + A+ G+V+A + LGM+ Q+ KAA ++ YSL V+
Sbjct: 82 EAAKWFRKAAENGDVKAQHNLGMMYAKGEGVEQDYVEAVKWYRKAADQGGLRSQYSLGVM 141
Query: 178 QFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNIAEGRRFLVQA 237
+NG G + D +AA G+ A LG +DG GV+QN + +L +A
Sbjct: 142 YYNGVGVKQ---DYVEAAKWYRKAADKGYTMAQFNLGLMYRDGEGVKQNRTVAKEWLGKA 198
>gi|297742807|emb|CBI35523.3| unnamed protein product [Vitis vinifera]
Length = 723
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 14/76 (18%)
Query: 347 RCSVCGAVNYCSRACQALDWKLRHKADCAP-----AERWLDEEGNGDVDVDLN------- 394
RCS C AV YCS CQ W+ HK +C P A ++ D G G N
Sbjct: 78 RCSQCKAVRYCSGKCQIKHWRQGHKNECVPPTPTSAMQFNDGSGFGGKTTSENHDVLGSS 137
Query: 395 --DAEGGGEDNVVADS 408
D+ G D++ DS
Sbjct: 138 TPDSSKGLSDDISCDS 153
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.135 0.429
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,898,247,126
Number of Sequences: 23463169
Number of extensions: 301473882
Number of successful extensions: 681158
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 751
Number of HSP's successfully gapped in prelim test: 1522
Number of HSP's that attempted gapping in prelim test: 677190
Number of HSP's gapped (non-prelim): 4575
length of query: 408
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 263
effective length of database: 8,957,035,862
effective search space: 2355700431706
effective search space used: 2355700431706
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)