BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000132
(2109 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255554036|ref|XP_002518058.1| conserved hypothetical protein [Ricinus communis]
gi|223542654|gb|EEF44191.1| conserved hypothetical protein [Ricinus communis]
Length = 2833
Score = 2327 bits (6031), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1232/2186 (56%), Positives = 1544/2186 (70%), Gaps = 167/2186 (7%)
Query: 5 HRPRFRPVGGVGGEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNL------ 58
HRP FRP GG GG G PP + +PNFA QNPN +F N A L Q+L
Sbjct: 4 HRPPFRPGGGAGGGYYGQPPPPPL----NSDPNFALQNPNAYFHNLANTLYQSLQQQQNF 59
Query: 59 -TNIPF--QNPNFDIQNPNLFMQNANL-LTQNPNLP----------------PQQPPSSA 98
+ PF QN NF IQNPN Q++N L Q ++P P P +
Sbjct: 60 NSYFPFSLQNLNFPIQNPNFPFQHSNFSLPQQFSIPQTQQQQQQNRNFQIQNPNSPQQNL 119
Query: 99 CNQQRSQTQHP-------------------------------------PGASN------- 114
QQ++Q P P A N
Sbjct: 120 VQQQKAQNLPPETPKPVEEERIDQTAQHDSVQQQQQQHQQQKQPQESSPNAQNDSTQQHQ 179
Query: 115 -QQRPLTQQP---QSLPPQPQ------KLNKEFLERIDRAVVKARSELIMAGESVSSWKV 164
Q R L QQ Q L P+ K +E +E++D AV KA +L+ A ES+S+W+V
Sbjct: 180 QQSRKLLQQQPLHQMLFQNPKGKRKQLKHKQELVEKVDLAVEKAWQDLLAAAESISAWRV 239
Query: 165 SRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAIC 224
S++ L+ LQV+SW SLGF MQEVPSLH+L++TE +INAFIHCFV VR+ITSLYDLE AIC
Sbjct: 240 SQAALVTLQVESWGSLGFPMQEVPSLHRLILTEGKINAFIHCFVAVRRITSLYDLEVAIC 299
Query: 225 KSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIK 284
++EGIE+FEEL+LGP LR PL+ HYF V D V KI T D+I+ L EY T K KDI
Sbjct: 300 ENEGIEQFEELKLGPLLRHPLVLHYFSVSCDATEVLKITTEDIILTLHEYMGTFKEKDIT 359
Query: 285 VDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKR-- 342
DEFLDFI KKRS+ K LGVRIQ LGMHI FI+EA++S++ TLKK L ++ KR
Sbjct: 360 ADEFLDFIVKKRSVNGKGNLGVRIQGLGMHIKFIQEAKRSKNTTLKKCLSSMRAPSKRSG 419
Query: 343 ---RKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTY 399
K P+ SS+KK LDERFSAI +RV+SF+ +++DF KHIRF SSSSE E+S+D +
Sbjct: 420 TRCHKHPLLSSQKK-DLDERFSAISQRVESFALVHKDFQGKHIRFDSSSSEGEESNDSMH 478
Query: 400 E---CSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGE--INHQLTSAGNSSG 454
+ SN S+ L + +D+VS+CPYPS TEE+ RLGLKGE + Q + ++S
Sbjct: 479 DDTMTSNGERSHYSL-QNVNSTDKVSTCPYPSATEEMSRLGLKGESEVGSQSSRKRHTSN 537
Query: 455 QDDYIGSSK-----KKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDDELNEAD 509
SK +KRK E T+SAP K L+ N+ K +++GD+T +E D
Sbjct: 538 PSRLRSRSKLDSWERKRKFEELSGTASAPPKLLKGNEEKHDIHSLKNGDKT-----DEVD 592
Query: 510 ISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCI 569
S SN +M FITTW++ACK +T++EV EKM QFY+P D R+ KR+K F S P +
Sbjct: 593 FSLSNNAMETFITTWRDACKEHTVTEVFEKMVQFYRPLD----GRHRKRIKWAFVSNPSV 648
Query: 570 GLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHT 629
GLLN+AV ++K GM DS+YD Q++ Q EL NT S EYESI++EP+E+ V + +
Sbjct: 649 GLLNVAVMAMKSGMLDSIYDAFQSVNQHELTNTFS----EYESIDVEPAEKHKSVVPQCS 704
Query: 630 V---QDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEF 686
+ Q + V+E++ K+++Y E D +N + + I L+KLC+CE WL +QF IKEF
Sbjct: 705 LLPTQSVTVDEIIGKITRYYELDQKFQSNDKLLLEDKFISLKKLCNCEFWLVDQFGIKEF 764
Query: 687 KSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWES 746
K LG+GEF FLEKHAS+L TELQKLF D ++ LEVS++QH L+ LVSQASNNLWES
Sbjct: 765 KFLGHGEFLMFLEKHASLLPTELQKLFAADICEKPPLEVSVLQHQLIFLVSQASNNLWES 824
Query: 747 EIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLS 806
E I+KQMIS LL +QFPLISFKI + GSME FL+TV +++N V+SKCV FS +LG
Sbjct: 825 ETISKQMISALLIKQFPLISFKIMENGSMEEFLQTVAQHKNNVLSKCVQFSAALLGEHYI 884
Query: 807 GDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFA 866
GD+ E+ +ET + T++G + ES+TS+ AIE+LLRAP L DL SWSHWD +FA
Sbjct: 885 GDML--REDHTVETAAVRTNSGQKMMAFESITSQSAIEVLLRAPMLCDLTSWSHWDLIFA 942
Query: 867 PSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSS 926
PSLGPL WLLNEVN KELLCLVT+DGKVIRID SA+VDSFLEAAL+GS FQTAVKLLS
Sbjct: 943 PSLGPLVEWLLNEVNAKELLCLVTKDGKVIRIDQSANVDSFLEAALQGSPFQTAVKLLSL 1002
Query: 927 FALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANI 986
+LAGGEK++PL LLKC+AR AF+V+FKN+ E+++V ++N +HG + N
Sbjct: 1003 LSLAGGEKHIPLSLLKCYARQAFDVIFKNHFENMDVQENRNYLLHGKAVDKAAN------ 1056
Query: 987 DNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAIL 1046
LSG+ K L + + +P ASRF LDCLGYLPSEFRSFAADVLLSG+ S KDAPSAIL
Sbjct: 1057 -TLSGQAHKNLFQINRVLPAASRFVLDCLGYLPSEFRSFAADVLLSGMHSVAKDAPSAIL 1115
Query: 1047 LECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGS- 1105
EC Q E R+MLHE+GLS+G++EWI DYH F ST ++D AAT L++GS
Sbjct: 1116 CECSQKE-RIMLHEIGLSIGLVEWIDDYHTFFSTISTDSFT-SFEPALGAATPVLSTGSR 1173
Query: 1106 ---------GCAEGSLFESVGADVHIEECGAICDTI-----CGEASDDGLGDCTTQTLPE 1151
C +G + D H EE TI G+A+ G + ++ E
Sbjct: 1174 YVQNTLDMYSCGDGKTNMHLAEDGHNEESTETSPTIQDAVVSGDATATGCAEESS----E 1229
Query: 1152 DKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
+ +DAAL++ESIRRDEFGL PNIS+ ES +LKKQHARLGRALHCLSQELYS+DSHFLL
Sbjct: 1230 SNKLKDAALVIESIRRDEFGLDPNISSTESTILKKQHARLGRALHCLSQELYSEDSHFLL 1289
Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
ELVQNADDNIY +VEPTLTFILQESGIV+LNNEQGF A+NIRALCDVGNSTKK S GY
Sbjct: 1290 ELVQNADDNIYSGSVEPTLTFILQESGIVILNNEQGFLAQNIRALCDVGNSTKKASGTGY 1349
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDP 1331
IG+KGIGFKSVFRVTDAPEIHSNGFH+KFD SEGQIGFVLPT+VP ++D+F RL+S++
Sbjct: 1350 IGQKGIGFKSVFRVTDAPEIHSNGFHIKFDISEGQIGFVLPTVVPACDVDLFSRLVSRET 1409
Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
Q + K WNTCI LPFR+K SE AM MF+DLHPSLLLFLHRLQCIMFRNMLNDSL
Sbjct: 1410 GQKDKKHWNTCIVLPFRSKLSEETAMK----MFADLHPSLLLFLHRLQCIMFRNMLNDSL 1465
Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
+V+RK+I+ DGIIKVSCG+DKMTW VASQKL+A RP V+TTEIA+A TL+ES G+Y
Sbjct: 1466 LVMRKEILQDGIIKVSCGKDKMTWLVASQKLQAHASRPKVQTTEIAVAFTLEESENGDYY 1525
Query: 1452 PLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERS 1511
P L QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD N PWN+WLL++FP LFVSAERS
Sbjct: 1526 PRLDQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDKNDPWNEWLLTKFPDLFVSAERS 1585
Query: 1512 FCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAP 1571
FC L CFR NP KA +VYMSFVPLVGEVHGFFSGLP+ I +LR ++CL+LEG+N P
Sbjct: 1586 FCALSCFRYNPGKAVAVYMSFVPLVGEVHGFFSGLPKAIALELRRTSCLLLEGDNCNMVP 1645
Query: 1572 PCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISS 1631
PC VLRGWN++A +LLPD LLQ+HLGLGFL K+I+LSDSLARALGI EYGP+IL++ ++
Sbjct: 1646 PCNVLRGWNEQARNLLPDGLLQEHLGLGFLDKNIILSDSLARALGIMEYGPEILIKFMTC 1705
Query: 1632 LCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGT 1691
L T +GL+SMGL WL+S LN LY + HSSG TDLIDNL++IPFIPLSDG
Sbjct: 1706 LSHTTSGLKSMGLGWLSSLLNTLYIMISHSSG--------PTDLIDNLRQIPFIPLSDGR 1757
Query: 1692 FSSVDEGTIWLHSD--CSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVD 1749
+SS+D GTIWLHSD + FDG LEAFP L AKLR V+PAL SAS D G + VD
Sbjct: 1758 YSSLDRGTIWLHSDILSAGFDGAQELEAFPQLYAKLRVVNPALFSASVAD----GTL-VD 1812
Query: 1750 NLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCH 1809
N MLLKIGVQQLSAH+IVKVH+LPA+S+E ++ +K LM DYLCFVM+HL+ CP+C
Sbjct: 1813 NSATMLLKIGVQQLSAHEIVKVHVLPALSNEKVSDRNKELMTDYLCFVMIHLQSSCPHCC 1872
Query: 1810 VEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITY 1869
+ER++I+SEL KAF+LTN G++RP+E P+HF K+FGNP+ +N LI+ +DI+W+E+D+TY
Sbjct: 1873 MERKYIISELHSKAFILTNFGYRRPAETPLHFSKDFGNPIDINKLINVMDIQWHEIDLTY 1932
Query: 1870 LKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-WTKELLSPGS 1928
LKH +SLS GL+KWR FF+EIG+TDFVQV+Q++K+++D+ T KN+ +LL PGS
Sbjct: 1933 LKHSVNDSLSNGLMKWRVFFQEIGVTDFVQVIQIEKNISDLLQTVLKNVKCDADLLCPGS 1992
Query: 1929 AAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSF 1988
A DWES+EL +LS+L+ +R+C K+LLEILD +WDD +++K G++ SK + R+F
Sbjct: 1993 IARDWESSELAQILSILSKTGDRECCKYLLEILDRMWDDSFSEKATGYYNSKSSVAGRTF 2052
Query: 1989 QSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIG 2048
+S F+ I D+QW +S+MD+ELHYPKDLF+DCD VRSILG SAPY +PKV S KL+ DIG
Sbjct: 2053 KSCFLRSIHDVQWVVSTMDNELHYPKDLFNDCDVVRSILGSSAPYALPKVTSSKLLSDIG 2112
Query: 2049 LKTEVTIDDILEILKVWTRLEAPFMA 2074
KT+VT+DD L+ L+VW + E PF A
Sbjct: 2113 FKTKVTLDDALKFLRVWRKSETPFKA 2138
>gi|449433179|ref|XP_004134375.1| PREDICTED: uncharacterized protein LOC101204025 [Cucumis sativus]
Length = 2724
Score = 2142 bits (5549), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1118/2087 (53%), Positives = 1458/2087 (69%), Gaps = 70/2087 (3%)
Query: 21 GGGQPPSQPPPPHFNPNFAFQNPNI--FFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQ 78
G GQ + PP P+ + N N+ F +P QN+ N P QN F P+ F
Sbjct: 12 GSGQRQQRLPPQMQFPDAPYLNHNVPFFQTHPGFIPYQNV-NFPMQNTGFQFSPPSNFGL 70
Query: 79 NANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFL 138
+ Q+PN PG ++ QP+S +K K+ L
Sbjct: 71 H---FPQSPN---------------------PGVRLRKASTASQPES---STKKAGKD-L 102
Query: 139 ERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEA 198
+I+ AV A + L+ AGES+S+WKV +S L LQVDSW SLG++MQE+ LH++MV E
Sbjct: 103 SKINHAVATAHTNLLEAGESISAWKVVQSALSILQVDSWDSLGYRMQEISDLHRIMVREG 162
Query: 199 RINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDV 258
+IN+FIHCFVGVR+IT+LY+LEEAICK EG+ +F+EL LGPFLR PL+ HYF V S+V
Sbjct: 163 KINSFIHCFVGVRRITTLYELEEAICKYEGVGEFKELRLGPFLRHPLVLHYFSVDSEVTE 222
Query: 259 VFKIATNDVIVCLSEYTDTHKA--KDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
VFKI D+I L E+ D K I V++FL+FI++K S+ E LG+R+Q LG +++
Sbjct: 223 VFKITGEDIISFLFEFMDVDACSNKKIAVEKFLEFISRKLSVEKWEMLGIRVQKLGDYVS 282
Query: 317 FIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINED 376
+++AR+S+ T K+L + H + KRP+FS +K++ LDERFSAI +RV+SFSS+++D
Sbjct: 283 VVKKARRSEADTFSKYLTKYDEKHGK-KRPLFSLQKEK-LDERFSAISQRVESFSSVHKD 340
Query: 377 FGAKHIRFVSSSSEDEDSDDCTYE---CSNDISSNVQLPSQI-KGSDRVSSCPYPSVTEE 432
F KH+RF SSSS DE S D TYE D S++++L QI K +DR SSCPYPS TEE
Sbjct: 341 FCGKHVRFDSSSSGDEGSGDSTYEKKKVDTDSSNDLKLSLQINKSADRPSSCPYPSATEE 400
Query: 433 LKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR-A 491
+ RLGLKGE++ T G+ Q IGS KRK + + K K+ +
Sbjct: 401 MMRLGLKGEVSGHATPIGS---QKHCIGSGTLKRKRRSRSRGHTTSGTSTSSKKFKEDCS 457
Query: 492 LPIESG-DQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
LP++ +++ EA+ +N S+RMF+T WK+ C++ T++EV+++M Y
Sbjct: 458 LPVKDDFGNSEEGSGFEAEYEITNNSLRMFVTMWKDGCRDMTVAEVIKRMLDHYGMN--- 514
Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
R R++ M SSYP IGLLN+AV+SIK GMWDS+YD+LQ I EL NT Y
Sbjct: 515 --LRSTVRVRSMLSSYPFIGLLNVAVSSIKNGMWDSIYDSLQIINLPELTNTNVKKQPVY 572
Query: 611 ESIEIEPSEQVAVV-NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRK 669
+ I++ PS + A++ + + DI VE+++ KV ++++F+ I ++ + + I +L++
Sbjct: 573 DCIDVGPSAEGALIKHVPKSTHDITVEDILNKVGQHIKFEQEIRSDGRLLMENRIQILKQ 632
Query: 670 LCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQ 729
LCSCE WL EQ+ K+FKSLG+G+FF+FLEKHASML EL K + +RS LE ++Q
Sbjct: 633 LCSCEFWLVEQYSAKDFKSLGHGDFFSFLEKHASMLPDELHKFLVPEISERSPLEACMLQ 692
Query: 730 HLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEV 789
LV LVSQA NNL E+EII+KQMI LL +QFPLI FK+ + G +E F+E VG+ +N V
Sbjct: 693 RQLVTLVSQACNNLSENEIISKQMIYNLLTQQFPLIDFKLTENGPLEDFMELVGQQKNSV 752
Query: 790 MSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRA 849
+SKCV FS ++LG D +N L TS G +SVTS+DAI++L+RA
Sbjct: 753 VSKCVTFSVSLLGGSHLRDSLATMDNDSLGATSFSAEAGQGLGAIKSVTSEDAIKVLVRA 812
Query: 850 PFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLE 909
P L DLN WSHWD LFAP+LGPL WL EVN + +C+VT++GKVIRIDH+A+ DSFLE
Sbjct: 813 PMLLDLNLWSHWDLLFAPALGPLVPWLQKEVNTENFMCMVTKEGKVIRIDHTATADSFLE 872
Query: 910 AALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCR 969
AAL+GS F TAVKLLS FAL GGEK V L LLK HA AFEV+ KN+VE+IE+ +N
Sbjct: 873 AALQGSPFHTAVKLLSIFALLGGEKYVLLSLLKHHASRAFEVIMKNSVENIEMF--ENWG 930
Query: 970 MHGNVLCGRQNF-DVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAAD 1028
+ QNF + NLS E++K++ KA+ + SRFF+DCLGY+P EFR AA+
Sbjct: 931 QGLEKVAFHQNFIEQVAAGNLSLELKKKIDMRNKAISLLSRFFVDCLGYIPVEFRYLAAN 990
Query: 1029 VLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMP 1088
+LLSG+ S +KDA SAIL EC + E RLMLHE+GLSLG+ EWI DYH S+ +SDL
Sbjct: 991 ILLSGITSVVKDAASAILHECWKPEQRLMLHEIGLSLGVPEWIQDYHTVSSSASSDLFTD 1050
Query: 1089 CVVT-----CTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEASDDGLGD 1143
+ N GL + +E + S+ +V E+ + S+D G
Sbjct: 1051 ACLNDRSEINRNVHRDGLLTKYSTSEQNASFSIEENVFNEKLSVSSANCTAKTSNDANGL 1110
Query: 1144 CTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELY 1203
E +DA I++ IRRDEFGL ++ E+ ML+KQHARLGRALHCLSQELY
Sbjct: 1111 SCMSLASEPDGNKDAVEIIQCIRRDEFGLDLDLPISETGMLRKQHARLGRALHCLSQELY 1170
Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
SQDSHFLLELVQNADDNIYP++VEPTL FI +ESGIVVLNNE+GFSA+NIRALCDVGNST
Sbjct: 1171 SQDSHFLLELVQNADDNIYPQSVEPTLAFIFEESGIVVLNNEEGFSAKNIRALCDVGNST 1230
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMF 1323
KKGS+AGYIG+KGIGFKSVFR+TDAPEIHSNGFHVKFD SEGQIGFVLPT++ P N++++
Sbjct: 1231 KKGSNAGYIGKKGIGFKSVFRITDAPEIHSNGFHVKFDISEGQIGFVLPTIISPCNVNLY 1290
Query: 1324 CRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAM-NNIVDMFSDLHPSLLLFLHRLQCIM 1382
+L + ++ WNTCI LPFR+K S G+ + NNIV+MF+DLHPSLLLFLHRLQCI
Sbjct: 1291 GKLATSASDHEDTNIWNTCIVLPFRSKLSGGVNLINNIVNMFADLHPSLLLFLHRLQCIK 1350
Query: 1383 FRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
RN++++SL+V+RK+IVG+GII+VS GE+KMTW V SQKL+A VIR DV++TEI++A TL
Sbjct: 1351 IRNLIDNSLIVMRKEIVGNGIIRVSHGEEKMTWLVVSQKLKADVIRHDVQSTEISIAFTL 1410
Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
E G PLL+QQPVFAFLPLR YGLKFI+QGDFVLPSSREEVDG+SPWNQWLLSEFP
Sbjct: 1411 HEEENGVISPLLHQQPVFAFLPLRMYGLKFIIQGDFVLPSSREEVDGDSPWNQWLLSEFP 1470
Query: 1503 ALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
LFVSA SFC LPCF P KA S YMS++PL+GEVHGFFS LPR+I+SKLRMSNCL+L
Sbjct: 1471 GLFVSAVESFCSLPCFESCPGKAISAYMSYIPLIGEVHGFFSSLPRLIISKLRMSNCLLL 1530
Query: 1563 EGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGP 1622
EG N+WAPPCKVLRGWN++A +LLPD LL+++LGLGFL KDI+LSDSLARALGIEEYGP
Sbjct: 1531 EGKENEWAPPCKVLRGWNEQALTLLPDNLLREYLGLGFLHKDIILSDSLARALGIEEYGP 1590
Query: 1623 KILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRI 1682
KIL+Q +SSLC+ N L+SMGL WL S L+ L+ + SSGQ++L+ DLI +LQ++
Sbjct: 1591 KILVQFMSSLCQKYNSLKSMGLFWLGSCLSVLHNMLLQSSGQTTLELERNADLIRSLQKV 1650
Query: 1683 PFIPLSDGTFSSVDEGTIWLHSDCS--VFDGGFGLEAFPNLCAKLRTVSPALLSASAVDK 1740
P IPLSDGT+SSV EGTIWLHSD S DG +GLEAFP L +K+R V PA LS +VD
Sbjct: 1651 PLIPLSDGTYSSVAEGTIWLHSDSSNATVDGKYGLEAFPYLNSKIRVVCPAFLSLFSVDN 1710
Query: 1741 SSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMH 1800
S + V SV N++ ML +IGVQ+LSAH+I+K HI+PAI++E+ NG+K LM +Y+CFVM H
Sbjct: 1711 SQIDVPSVGNISWMLYRIGVQRLSAHEIIKEHIIPAITNESNLNGNKILMTEYVCFVMTH 1770
Query: 1801 LEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDI 1860
L CP CH++R FI+SELR KAF+LTNHG+KR E+P+HF KE+GNP+ +N L+ +++
Sbjct: 1771 LLSSCPECHIDRGFIISELRTKAFILTNHGYKRLVEVPVHFSKEYGNPIDLNKLL-SVEM 1829
Query: 1861 KWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-W 1919
W+EV TYLKHP SLSCGL KWR FF+EIGI DFV VV+V++ +A++ H N W
Sbjct: 1830 NWHEVADTYLKHPVTNSLSCGLTKWRNFFQEIGINDFVHVVEVNRSIANMPHDIMVNRKW 1889
Query: 1920 TKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKS 1979
E++ G+ DWES EL HLL++L T+ N++ K+LLE+LDTLW+D +DK++G S
Sbjct: 1890 DPEIIFSGAMVKDWESPELTHLLTMLATHGNKESCKYLLEVLDTLWNDHLSDKVVGCCIS 1949
Query: 1980 KPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVK 2039
K + FQS+F+N ICD QW +SS+D + HYPKDL++DCDAVRSILG SAPY +PKV+
Sbjct: 1950 KSGDSSKQFQSAFMNSICDAQWVVSSVDKKGHYPKDLYYDCDAVRSILGASAPYALPKVQ 2009
Query: 2040 SEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------MARYYLLV 2080
S KLV DIG KT V++DD ILKVW R E PF M ++Y +
Sbjct: 2010 STKLVRDIGFKTRVSLDDTFNILKVW-RTEKPFKTSISQMCKFYTFL 2055
>gi|356528304|ref|XP_003532744.1| PREDICTED: uncharacterized protein LOC100780184 [Glycine max]
Length = 2785
Score = 2123 bits (5502), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1111/2029 (54%), Positives = 1423/2029 (70%), Gaps = 110/2029 (5%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
+E DRA+ A L+ AG+SVS+W VS++ LL LQVDSW++LG +MQ+VPSLH+LM+TE
Sbjct: 81 IEHADRAIANACRALLAAGDSVSAWTVSQNALLTLQVDSWNTLGIKMQQVPSLHRLMMTE 140
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++NAF+HCFVGV++ITSLYDLE AICK+EG++ FEEL LGPFLR PL+ HYF ++SDV
Sbjct: 141 GKVNAFVHCFVGVQRITSLYDLEVAICKNEGVDDFEELGLGPFLRHPLVIHYFSLRSDVT 200
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKD-IKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
V+KI T ++I L E+ D ++ + IKV++FLDFIA KR + KE LG+RIQNLGMHI
Sbjct: 201 QVYKITTEEIIQLLIEFLDASRSNEFIKVEQFLDFIANKRLVECKEWLGIRIQNLGMHIY 260
Query: 317 FIREARKSQDVTLKKFLK----------ELQPHHKRRKRPIFSSEKKRQLDERFSAICE- 365
IREAR S+ +++ K +L +R R + S ++ + SAI E
Sbjct: 261 AIREARNSEQSAMREVRKLGQSARREARKLGQSARREARKLGQSARREARNSEQSAIQEA 320
Query: 366 --------------------------------RVKSFSS-----INEDFGA--------- 379
R ++ S ++E F A
Sbjct: 321 RNSEQSAVQEASNSEQSAYEKCLESFLKNGKFRYRTIPSSQKKQLDERFNAITQRVESFS 380
Query: 380 --------KHIRFVSSSSEDEDSDDCTYECSNDISSNVQL-PSQ--IKGSDRVSSCPYPS 428
KH RF+SS+SEDEDSD T E SN+I Q PS + S+RVSSCPYPS
Sbjct: 381 PVKKSFCGKHKRFMSSASEDEDSDSSTDEQSNNIIKGSQSNPSSQFTRSSERVSSCPYPS 440
Query: 429 VTEELKRLGLKGEI-NHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKA 487
TEE RLG++ ++ H L NS+ + + +KKRK EN+ T SAP K +RNK
Sbjct: 441 ATEEKARLGVRSDMAGHSLV---NSNLKKGFSEQPRKKRKFENATSTRSAPYKLRKRNKL 497
Query: 488 KQRALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPT 547
PI +G++T+ + D+S SN+S++MF+TTWK AC + ++EVLE M QF
Sbjct: 498 GV-VTPINTGNKTKVSTNIDEDLSISNDSLQMFVTTWKMACSEHKVAEVLEMMLQF---- 552
Query: 548 DPKKAARYFKR-MKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGN 606
K R+ KR +K +FSSYP IGLLN AV+SIK GM +++YDT QAI L N+ +
Sbjct: 553 --SKVNRFQKRKIKNLFSSYPFIGLLNAAVSSIKSGMRNNIYDTFQAIIDNGLGNSPTKG 610
Query: 607 SLEYESIEIEPSEQVAVVNTKHTVQD---IQVEEVMKKVSKYLEFDNSILNNAQSPVTKI 663
S EY++I++ P ++ V TK ++ I ++V++K+ Y + N I N+ + +
Sbjct: 611 S-EYDTIDVGPGQENVPVITKDNTENTKCISSDDVIRKIGTYFDHGNDINRNSNDSLVQY 669
Query: 664 -IILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSS 722
I+L RK CSCE W+ EQF +K+F SLGYG+F +FLEKH ++L EL KL DT + SS
Sbjct: 670 RIMLWRKFCSCENWVAEQFGMKKFDSLGYGDFLSFLEKHPNLLPHELLKLLVGDTCENSS 729
Query: 723 LEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETV 782
+ + L+ LVSQA + LWE+E ITKQMIS LL RQFP I+F++ + GS+ L+TV
Sbjct: 730 FRACMSSNQLIALVSQALSGLWENETITKQMISMLLMRQFPSINFELVENGSLVDLLDTV 789
Query: 783 GKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDA 842
+ + V SKCV+FS TI+ +GD S +N+ E + + T + K +E+V +K+A
Sbjct: 790 KGHTSSVTSKCVVFSATIIEKNYNGDSSSDRDNNWSEIPTDRSETSHK-KSTETVIAKNA 848
Query: 843 IEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSA 902
IE+LL+AP LSDL+ WSHWD FAP LGP WLLN+VN KEL CLVTRDGKVIRIDHSA
Sbjct: 849 IEVLLKAPMLSDLSKWSHWDLRFAPFLGPFISWLLNDVNTKELFCLVTRDGKVIRIDHSA 908
Query: 903 SVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEV 962
++DSFLEAA++GSSFQTAV LLS +L GGEK VPL LLKCH+ HAFEVMF+N+VED+EV
Sbjct: 909 TLDSFLEAAVQGSSFQTAVHLLSLISLVGGEKYVPLSLLKCHSCHAFEVMFRNSVEDVEV 968
Query: 963 INSQNCRMHGNV--LCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
N N +H +V L + + + K + K K + SRF LDCLG LP+
Sbjct: 969 SNDGNA-LHQSVEALSKTKFLTEISTAKMRSVFSKHMHKVSKVASILSRFVLDCLGNLPA 1027
Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
EF SFA+DVLLSG+QS KDA S IL EC E RLMLHE+GLSLGI EWI+DYHA S
Sbjct: 1028 EFHSFASDVLLSGMQSVFKDAASTILCECSNMEQRLMLHEIGLSLGISEWINDYHALISN 1087
Query: 1081 GTSDLLMPCV-VTCTNAATSGLNSG----------SGCAEGSLFESVGADVHIEECGAIC 1129
+SD+ C V+C AT+ +N+ S E ++ S+ IE C I
Sbjct: 1088 NSSDI--HCARVSCLKDATTDINTSLKLDQVTLDKSPIPEANMVTSLVPHRLIEGCTEII 1145
Query: 1130 DTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHA 1189
+T+ E S+D C + E DA+ ++ESIRRDEFGL ++S+++S MLKKQHA
Sbjct: 1146 ETVDPEKSNDESNTCCLGNSFQHVEDMDASRLIESIRRDEFGLDSSLSDIDSCMLKKQHA 1205
Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFS 1249
RLGRALHCLSQELYSQDSHF+LELVQNADDN YPENVEPTLTFIL++SGIVVLNNE+GFS
Sbjct: 1206 RLGRALHCLSQELYSQDSHFILELVQNADDNNYPENVEPTLTFILRDSGIVVLNNERGFS 1265
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
A+N+RALCDVGNSTKKGS+AGYIG+KGIGFKSVFRVTDAPEIHSNGFHVKFD SEGQIGF
Sbjct: 1266 AQNMRALCDVGNSTKKGSTAGYIGKKGIGFKSVFRVTDAPEIHSNGFHVKFDISEGQIGF 1325
Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
VLPT+VPP +I + R+ S D + WNTCI LPFR+ SEG+AMN+++ MFSDLHP
Sbjct: 1326 VLPTVVPPCDIGVLRRMASTDTELCDDNPWNTCILLPFRSHLSEGMAMNSVLSMFSDLHP 1385
Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRP 1429
SLLLFLHRL+CI RN+LND+L V++K+I GDGIIKVS G++K+ WFV SQKL+ IR
Sbjct: 1386 SLLLFLHRLKCIKLRNLLNDTLTVMKKEISGDGIIKVSHGKEKIVWFVVSQKLQTNSIRF 1445
Query: 1430 DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1489
DV+TTEI++A TLQES+ G Y P QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG
Sbjct: 1446 DVQTTEISMAFTLQESDNG-YIPCSDQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1504
Query: 1490 NSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRM 1549
+SPWNQWLLSE+P LFV A+R FC+LPCFR P K S +MSFVPLVGEVHGFFS LPR+
Sbjct: 1505 DSPWNQWLLSEYPNLFVRAQREFCELPCFRSEPGKGLSAFMSFVPLVGEVHGFFSSLPRL 1564
Query: 1550 ILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSD 1609
I+SKLRM NCL+++G+NN+WAPPCKVLRGW ++ +L+PD +L +HLGL +L ++IVLSD
Sbjct: 1565 IISKLRMMNCLLVDGDNNEWAPPCKVLRGWTEQVRNLIPDNMLLEHLGLIYLDRNIVLSD 1624
Query: 1610 SLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQS 1669
LARALGIEE+GP IL++++SSLC T++GL SM +SWLAS LN L F+SSG +
Sbjct: 1625 ELARALGIEEFGPNILVRVLSSLCHTKSGLISMDMSWLASCLNILSVTMFNSSGSVPINF 1684
Query: 1670 GVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSD--CSVFDGGFGLEAFPNLCAKLRT 1727
++ D+ NLQ++PFIPLSDGT+SSVDEGTIWLH + + FDG +EAFPN+CAKLRT
Sbjct: 1685 EMK-DVQKNLQKMPFIPLSDGTYSSVDEGTIWLHFNHLNTGFDGEHKIEAFPNICAKLRT 1743
Query: 1728 VSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDK 1787
VSP L SAS+ SL V +DN+ R+L IGVQQLS HD+VK+HILPA+SDET AN ++
Sbjct: 1744 VSPFLFSASS-GTPSLNVTFLDNVTRLLQSIGVQQLSVHDVVKLHILPALSDETMANKNR 1802
Query: 1788 NLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGN 1847
LM +Y+CFVM+HL C +C +ERE I+SE R K+ +LTN+GFK P+EIPIHF FGN
Sbjct: 1803 VLMIEYVCFVMLHLNSSCSDCFIEREHIISEFRCKSLLLTNYGFKCPAEIPIHFCTGFGN 1862
Query: 1848 PVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDV 1907
PV+ ML + ++W+EVDI+YL HP ES+S L+KWR FF++ GITDF QVVQVDK V
Sbjct: 1863 PVTPKMLADSVSMRWHEVDISYLSHPVNESVSSALIKWRDFFEKFGITDFAQVVQVDKSV 1922
Query: 1908 ADISHTGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWD 1966
DI FK MW + L+S S DWES+E+V L+SLL+ + N + K+LLE+LDTLWD
Sbjct: 1923 VDICDVTFKQMMWDRGLISAESIVKDWESSEIVQLVSLLSKSGNLENCKYLLEVLDTLWD 1982
Query: 1967 DCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSI 2026
CY++K G+F K GD F+S+FI +CDIQW +S+MDDELHYPKDLF+DC+ VR +
Sbjct: 1983 VCYSNKTTGYFYLKSVGDGHPFKSTFICSLCDIQWVVSTMDDELHYPKDLFYDCETVRML 2042
Query: 2027 LGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTR-LEAPFMA 2074
LG APY VPKVKSE+LV D G KT VT+DDI ++LK W + + PF A
Sbjct: 2043 LGDFAPYAVPKVKSERLVKDFGFKTRVTLDDIFDVLKAWRKSSKTPFKA 2091
>gi|357518761|ref|XP_003629669.1| hypothetical protein MTR_8g085280 [Medicago truncatula]
gi|355523691|gb|AET04145.1| hypothetical protein MTR_8g085280 [Medicago truncatula]
Length = 2812
Score = 2105 bits (5453), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1128/2104 (53%), Positives = 1450/2104 (68%), Gaps = 106/2104 (5%)
Query: 45 IFFPN---PALQLLQNLTNIP----FQNPNFDIQNPNLFM---QNANLLTQNPNLPPQQP 94
F+PN P+ +L N P F P NP F QN + P P
Sbjct: 22 AFYPNYYSPSHNF--HLPNSPWPHSFPPPQ---HNPYAFAPPPQNPPGVVPPPQNPYAHR 76
Query: 95 PSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVKARSELIM 154
P + + + P S + P QP S P Q K+ +++ + A KA EL+
Sbjct: 77 PQNPPPSTSTAPPNKPSGSAPRPP--NQPDSTPRQQNP--KQAIDKAENASSKACRELLA 132
Query: 155 AGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQIT 214
AG+SVS+WKVS+ LL L+VDS +SLG +MQ+VP+LH+LM+TE ++NAF+HCFVGVR+IT
Sbjct: 133 AGDSVSAWKVSQKALLTLKVDSLNSLGIKMQQVPTLHRLMITEGKVNAFVHCFVGVRRIT 192
Query: 215 SLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEY 274
SLYDLE AICK+EG++ FEEL LGPFLR PL+ HYF V+S+V V KI + ++I LSE+
Sbjct: 193 SLYDLELAICKNEGVDSFEELGLGPFLRHPLVIHYFSVRSNVAEVCKITSEEIIQFLSEF 252
Query: 275 TDTHKAKD-IKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFL 333
D KAK + V+EFL+FIAKKRS+ S E LG+RIQNLGMHI I+EARKS+ TL+K L
Sbjct: 253 LDVSKAKAVVGVEEFLEFIAKKRSVESMELLGIRIQNLGMHIAAIKEARKSEQSTLEKCL 312
Query: 334 KELQPHH-KRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDE 392
K L+ K KRPI SS+KK QLDERFS I +RV+SFSS+ + F KHIRF SSSSEDE
Sbjct: 313 KTLRSKSGKLGKRPISSSQKK-QLDERFSTIAQRVESFSSVEKSFCGKHIRFTSSSSEDE 371
Query: 393 DSDDCTYECSND--ISSNVQLPSQIKGS--DRVSSCPYPSVTEELKRLGLKGEINHQLTS 448
SD T ND I ++ PS G +R+SSCPYPS TEE+ RL +KG+ Q S
Sbjct: 372 GSDYYTDNDQNDSIIMNSWSNPSSQFGRSLERMSSCPYPSATEEMARLVVKGD--KQGGS 429
Query: 449 AGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDDELNEA 508
N S ++++ +KKRK E P K K P ++G+ T+ + ++E
Sbjct: 430 LSNGSLKNEFTEPPRKKRKSE-LDLPDPLPFKHFNFKYKKVDPTPTKNGNATEINTMDEC 488
Query: 509 DISFSNESMRMFITTWKEAC------------------------KNNTMSEVLEKMFQFY 544
+S ++++++MF+TTWKEAC K+N + ++E M QFY
Sbjct: 489 -LSITDDALQMFVTTWKEACLEHNVGESSLPKGIRKVFTEPTHQKDNYILLMVEMMLQFY 547
Query: 545 KPTDPKKAARYFKRMKLMFSSYP-CIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTV 603
PK ++++++F YP IGLLN AV++IK GMW+S+YDT Q EL+N+
Sbjct: 548 G-VKPKGK----RKIRMLFVEYPFLIGLLNAAVSAIKSGMWNSIYDTFQVFNHSELSNSP 602
Query: 604 SGNSLEYESIEIEPS-EQVAVV--NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPV 660
+ +S E+E+I++ PS E V +V ++ + I E+V+ K+ Y + DN + N+ V
Sbjct: 603 TKSS-EFETIDVGPSLENVPLVSKDSAENTKCISAEDVVGKIGMYFDLDNEVYRNSDWQV 661
Query: 661 TKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDR 720
I+ LRK C+CE+WL EQF +K F SLG+G+ +FLE + + L EL KL D +
Sbjct: 662 KYKIMFLRKFCNCESWLAEQFGVKNFNSLGHGDLLSFLENNVNQLPHELLKLLGGDMCEN 721
Query: 721 SSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLE 780
S+ + + + LV L+SQA +LWE+E +TKQ+IS LL RQFP I F+ + GS+E L+
Sbjct: 722 STFKACMSTNELVALLSQAICSLWENETVTKQIISMLLMRQFPSIGFEFLESGSLEDLLD 781
Query: 781 TVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSK 840
TV ++++ V SKCV+FS ++ D +N+L T+ + G + K SE++T+K
Sbjct: 782 TVREHKSNVTSKCVVFSAAMIEEHCDVDSLRDGDNNLSGITTDTSEIGHKTKSSETITAK 841
Query: 841 DAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDH 900
AIE+LL++P LSDL+ WSHWD +FAPSLG L WLLN+VN +ELLCLVTRDGKVIRIDH
Sbjct: 842 KAIEMLLKSPMLSDLSKWSHWDLMFAPSLGSLISWLLNDVNSEELLCLVTRDGKVIRIDH 901
Query: 901 SASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDI 960
SA++ SFLEAA++GSSF+TAV LLS +L GG++ VPL LLK A AFEVMF+N +EDI
Sbjct: 902 SATLKSFLEAAVQGSSFRTAVSLLSLISLVGGKRKVPLSLLKRDACSAFEVMFRNFLEDI 961
Query: 961 EVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
EV + +N R L + + +S E K L K KAV + SRF LDCLGYLP+
Sbjct: 962 EVCDDKNARQSEEALRKTKILTEVSTAKMSDEFGKHLHKVNKAVSILSRFVLDCLGYLPA 1021
Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
EF SFAADVLLSG++S KDA +AIL EC E LMLHE+GLSLGI EWI+DYHAF S
Sbjct: 1022 EFHSFAADVLLSGMRSVFKDAAAAILCECRNMEQHLMLHEIGLSLGITEWINDYHAFISN 1081
Query: 1081 GTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECG------AICDTICG 1134
TSD C+ ++GL G G + S V + CG I T+ G
Sbjct: 1082 DTSD-HASCLKDAKTEISTGLKHGQGILDNSDVPEVNMVTSLVPCGLNEICSEISQTVDG 1140
Query: 1135 EASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRA 1194
E S+D + ++ + D+ L++ESIRRDEFGL P++S+++S MLKKQHARLGRA
Sbjct: 1141 EKSNDESMTSHLEDSFQNGKDVDSTLVIESIRRDEFGLDPSLSDIDSCMLKKQHARLGRA 1200
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIR 1254
LHCLSQELYSQDSHF+LELVQNADDN YPENVEPTL FILQ+SGIVVLNNE+GFSA+N+R
Sbjct: 1201 LHCLSQELYSQDSHFILELVQNADDNNYPENVEPTLAFILQDSGIVVLNNERGFSAQNMR 1260
Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
ALCDVGNSTKKGSS GYIG+KGIGFKSVFRVTDAPEIHSNGFHVKFD SEGQIGFVLPTL
Sbjct: 1261 ALCDVGNSTKKGSSTGYIGKKGIGFKSVFRVTDAPEIHSNGFHVKFDISEGQIGFVLPTL 1320
Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
VPP +I + R+ WNTCI LPFR+ S+G MNNI+ MFSDLHPSLLLF
Sbjct: 1321 VPPCDIGLLRRMAFTGTDSYGDNPWNTCIMLPFRSHLSDGAVMNNIMTMFSDLHPSLLLF 1380
Query: 1375 LHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
LHRL+CI RN+LND+++V++K+I DGIIKVS G+++M WFV SQKL+ IR DV+TT
Sbjct: 1381 LHRLKCIKLRNLLNDTIIVMKKEISEDGIIKVSHGKERMAWFVVSQKLQTNSIRFDVQTT 1440
Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
EI++A TLQES++G Y P L QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG+SPWN
Sbjct: 1441 EISMAFTLQESDDG-YSPCLDQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGDSPWN 1499
Query: 1495 QWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKL 1554
QWLLSE+P LFV A R FC+LPCFR P K S +MSF+PLVGEVHGFFS LPR+I+SKL
Sbjct: 1500 QWLLSEYPNLFVKAVREFCELPCFRSEPGKGLSAFMSFIPLVGEVHGFFSTLPRLIISKL 1559
Query: 1555 RMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARA 1614
RM NCL++EG+N WA PCKVLRGW ++ LLPD +L +HLGL +L K+++LSD+LARA
Sbjct: 1560 RMMNCLLVEGDNKGWASPCKVLRGWTEQVRCLLPDEILLEHLGLRYLDKNVLLSDTLARA 1619
Query: 1615 LGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETD 1674
LGIEE+GP +L++++SSLC T+N L SM +SWLAS+LN LY + F SSG S+ ++ D
Sbjct: 1620 LGIEEFGPSVLVRVMSSLCYTKNWLISMNMSWLASFLNTLYVLMFDSSGTISINFEIKDD 1679
Query: 1675 LIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDC--SVFDGGFGLEAFPNLCAKLRTVSPAL 1732
++ L++ PFIPLSDGT+SSVDEGTIWL S+ + FDG +EAFPNL AKLRTVSP+L
Sbjct: 1680 ILKRLKKTPFIPLSDGTYSSVDEGTIWLQSNTFNTGFDGEHKIEAFPNLFAKLRTVSPSL 1739
Query: 1733 LSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMAD 1792
LSA A D SSL V S+DN+ R+L IGVQQLSAHD+VK+HILP +SDE AN +K LM +
Sbjct: 1740 LSA-ASDTSSLNVTSLDNVTRLLQTIGVQQLSAHDVVKLHILPVLSDEAMANKNKMLMIE 1798
Query: 1793 YLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVN 1852
Y+CFVM++L+ C +C +RE I+SELR K+ +LT+ GFK PS+IPIHF FGNPV+
Sbjct: 1799 YICFVMLYLKSTCSDC--DREDIISELRYKSLLLTDCGFKCPSKIPIHFCPGFGNPVTPK 1856
Query: 1853 MLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISH 1912
+L ++++W+EVDI+YL+HP ES+S L+KWR+FF+EIGITDF Q+VQVDK DI
Sbjct: 1857 ILADAVNMRWHEVDISYLQHPVNESVSSSLIKWREFFEEIGITDFAQIVQVDKTAVDICD 1916
Query: 1913 TGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTD 1971
FK MW + L+S S DWES E+V L SLL+ + N+ K+ LE+LDTLWD CY+D
Sbjct: 1917 ATFKQVMWDRGLISAESIVKDWESPEIVQLGSLLSKSGNQGNCKYFLEVLDTLWDACYSD 1976
Query: 1972 KIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSA 2031
K G F SK GD F+S+FI+ +CDI+W +S++DDELHYPKDLFHDC+AVR LG A
Sbjct: 1977 KARGCFYSKSVGDGHPFKSTFISNLCDIRWVVSTLDDELHYPKDLFHDCEAVRQTLGTFA 2036
Query: 2032 PYIVPK----------------------------VKSEKLVCDIGLKTEVTIDDILEILK 2063
PY VPK VKSE+LV DIGLKT VT+DDIL+ILK
Sbjct: 2037 PYAVPKVSCFVHLCDTVVGNIYGLLTWVSGLLSVVKSERLVNDIGLKTRVTLDDILDILK 2096
Query: 2064 VWTR 2067
W +
Sbjct: 2097 AWRK 2100
>gi|449520493|ref|XP_004167268.1| PREDICTED: uncharacterized LOC101204025, partial [Cucumis sativus]
Length = 2143
Score = 2096 bits (5430), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1102/2087 (52%), Positives = 1436/2087 (68%), Gaps = 105/2087 (5%)
Query: 21 GGGQPPSQPPPPHFNPNFAFQNPNI--FFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQ 78
G GQ + PP P+ + N N+ F +P QN+ N P QN F P+ F
Sbjct: 12 GSGQRQQRLPPQMQFPDAPYLNHNVPFFQTHPGFIPYQNV-NFPMQNTGFQFSPPSNFGL 70
Query: 79 NANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFL 138
+ Q+PN PG ++ QP+S +K K+ L
Sbjct: 71 H---FPQSPN---------------------PGVRLRKASTASQPES---STKKAGKD-L 102
Query: 139 ERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEA 198
+I+ AV A + L+ AGES+S+WKV +S L LQVDSW SLG++MQE+ LH++MV E
Sbjct: 103 SKINHAVATAHTNLLEAGESISAWKVVQSALSILQVDSWDSLGYRMQEISDLHRIMVREG 162
Query: 199 RINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDV 258
+IN+FIHCFVGVR+IT+LY+LEEAICK EG+ +F+EL LGPFLR PL+ HYF V S+V
Sbjct: 163 KINSFIHCFVGVRRITTLYELEEAICKYEGVGEFKELRLGPFLRHPLVLHYFSVDSEVTE 222
Query: 259 VFKIATNDVIVCLSEYTDTHKA--KDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
VFKI D+I L E+ D K I V++FL+FI++K S+ E LG+R+Q LG +++
Sbjct: 223 VFKITGEDIISFLFEFMDVDACSNKKIAVEKFLEFISRKLSVEKWEMLGIRVQKLGDYVS 282
Query: 317 FIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINED 376
+++AR+S+ T K+L + H +KRP+FS +K++ LDERFSAI +RV+SFSS+++D
Sbjct: 283 VVKKARRSEADTFSKYLTKYDEKHG-KKRPLFSLQKEK-LDERFSAISQRVESFSSVHKD 340
Query: 377 FGAKHIRFVSSSSEDEDSDDCTYE---CSNDISSNVQLPSQI-KGSDRVSSCPYPSVTEE 432
F KH+RF SSSS DE S D TYE D S++++L QI K +DR SSCPYPS TEE
Sbjct: 341 FCGKHVRFDSSSSGDEGSGDSTYEEKKVDTDSSNDLKLSLQINKSADRPSSCPYPSATEE 400
Query: 433 LKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR-A 491
+ RLGLKGE++ T G+ Q IGS KRK + + K K+ +
Sbjct: 401 MMRLGLKGEVSGHATPIGS---QKHCIGSGTLKRKRRSRSRGHTTSGTSTSSKKFKEDCS 457
Query: 492 LPIESG-DQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
LP++ +++ EA+ +N S+RMF+T WK+ C++ T++E
Sbjct: 458 LPVKDDFGNSEEGSGFEAEYEITNNSLRMFVTMWKDGCRDMTVAE--------------- 502
Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
V+SIK GMWDS+YD+LQ I EL NT Y
Sbjct: 503 -------------------------VSSIKNGMWDSIYDSLQIINLPELTNTNVKKQPVY 537
Query: 611 ESIEIEPSEQVAVV-NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRK 669
+ I++ PS + A++ + + DI VE+++ KV ++++F+ I ++ + + I +L++
Sbjct: 538 DCIDVGPSAEGALIKHVPKSTHDITVEDILNKVGQHIKFEQEIRSDGRLLMENRIQILKQ 597
Query: 670 LCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQ 729
LCSCE WL EQ+ K+FKSLG+G+FF+FLEKHASML EL K + +RS LE ++Q
Sbjct: 598 LCSCEFWLVEQYSAKDFKSLGHGDFFSFLEKHASMLPDELHKFLVPEISERSPLEACMLQ 657
Query: 730 HLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEV 789
LV LVSQA NNL E+EII+KQMI LL +QFPLI FK+ + G +E F+E VG+ +N V
Sbjct: 658 RQLVTLVSQACNNLSENEIISKQMIYNLLTQQFPLIDFKLTENGPLEDFMELVGQQKNSV 717
Query: 790 MSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRA 849
+SKCV FS ++LG D +N L TS G +SVTS+DAI++L+RA
Sbjct: 718 VSKCVTFSVSLLGGSHLRDSLATMDNDSLGATSFSAEAGQGLGAIKSVTSEDAIKVLVRA 777
Query: 850 PFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLE 909
P L DLN WSHWD LFAP+LGPL WL EVN + +C+VT++GKVIRIDH+A+ DSFLE
Sbjct: 778 PMLLDLNLWSHWDLLFAPALGPLVPWLQKEVNTENFMCMVTKEGKVIRIDHTATADSFLE 837
Query: 910 AALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCR 969
AAL+GS F TAVKLLS FAL GGEK V L LLK HA AFEV+ KN+VE+IE+ +N
Sbjct: 838 AALQGSPFHTAVKLLSIFALLGGEKYVLLSLLKHHASRAFEVIMKNSVENIEMF--ENWG 895
Query: 970 MHGNVLCGRQNF-DVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAAD 1028
+ QNF + NLS E++K++ KA+ + SRFF+DCLGY+P EFR AA+
Sbjct: 896 QGLEKVAFHQNFIEQVAAGNLSLELKKKIDMRNKAISLLSRFFVDCLGYIPVEFRYLAAN 955
Query: 1029 VLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMP 1088
+LLSG+ S +KDA SAIL EC + E RLMLHE+GLSLG+ EWI DYH S+ +SDL
Sbjct: 956 ILLSGITSVVKDAASAILHECWKPEQRLMLHEIGLSLGVPEWIQDYHTVSSSASSDLFTD 1015
Query: 1089 CVVT-----CTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEASDDGLGD 1143
+ N GL + +E + S+ +V E+ + S+D G
Sbjct: 1016 ACLNDRSEINRNVHRDGLLTKYSTSEQNASFSIEENVFNEKLSVSSANCTAKTSNDANGL 1075
Query: 1144 CTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELY 1203
E +DA I++ IRRDEFGL ++ E+ ML+KQHARLGRALHCLSQELY
Sbjct: 1076 SCMSLASEPDGNKDAVEIIQCIRRDEFGLDLDLPISETGMLRKQHARLGRALHCLSQELY 1135
Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
SQDSHFLLELVQNADDNIYP++VEPTL FI +ESGIVVLNNE+GFSA+NIRALCDVGNST
Sbjct: 1136 SQDSHFLLELVQNADDNIYPQSVEPTLAFIFEESGIVVLNNEEGFSAKNIRALCDVGNST 1195
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMF 1323
KKGS+AGYIG+KGIGFKSVFR+TDAPEIHSNGFHVKFD SEGQIGFVLPT++ P N++++
Sbjct: 1196 KKGSNAGYIGKKGIGFKSVFRITDAPEIHSNGFHVKFDISEGQIGFVLPTIISPCNVNLY 1255
Query: 1324 CRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAM-NNIVDMFSDLHPSLLLFLHRLQCIM 1382
+L + ++ WNTCI LPFR+K S G+ + NNIV+MF+DLHPSLLLFLHRLQCI
Sbjct: 1256 GKLATSASDHEDTNIWNTCIVLPFRSKLSGGVNLINNIVNMFADLHPSLLLFLHRLQCIK 1315
Query: 1383 FRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
RN++++SL+V+RK+IVG+GII+VS GE+KMTW V SQKL+A VIR DV++TEI++A TL
Sbjct: 1316 IRNLIDNSLIVMRKEIVGNGIIRVSHGEEKMTWLVVSQKLKADVIRHDVQSTEISIAFTL 1375
Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
E G PLL+QQPVFAFLPLR YGLKFI+QGDFVLPSSREEVDG+SPWNQWLLSEFP
Sbjct: 1376 HEEENGVISPLLHQQPVFAFLPLRMYGLKFIIQGDFVLPSSREEVDGDSPWNQWLLSEFP 1435
Query: 1503 ALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
LFVSA SFC LPCF P KA S YMS++PL+GEVHGFFS LPR+I+SKLRMSNCL+L
Sbjct: 1436 GLFVSAVESFCSLPCFESCPGKAISAYMSYIPLIGEVHGFFSSLPRLIISKLRMSNCLLL 1495
Query: 1563 EGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGP 1622
EG N+WAPPCKVLRGWN++A +LLPD LL+++LGLGFL KDI+LSDSLARALGIEEYGP
Sbjct: 1496 EGKENEWAPPCKVLRGWNEQALTLLPDNLLREYLGLGFLHKDIILSDSLARALGIEEYGP 1555
Query: 1623 KILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRI 1682
KIL+Q +SSLC+ N L+SMGL WL S L+ L+ + SSGQ++L+ DLI +LQ++
Sbjct: 1556 KILVQFMSSLCQKYNSLKSMGLFWLGSCLSVLHNMLLQSSGQTTLELERNADLIRSLQKV 1615
Query: 1683 PFIPLSDGTFSSVDEGTIWLHSDCS--VFDGGFGLEAFPNLCAKLRTVSPALLSASAVDK 1740
P IPLSDGT+SSV EGTIWLHSD S DG +GLEAFP L +K+R V PA LS +VD
Sbjct: 1616 PLIPLSDGTYSSVAEGTIWLHSDSSNATVDGKYGLEAFPYLNSKIRVVCPAFLSLFSVDN 1675
Query: 1741 SSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMH 1800
S + V SV N++ ML +IGVQ+LSAH+I+K HI+PAI++E+ NG+K LM +Y+CFVM H
Sbjct: 1676 SQIDVPSVGNISWMLYRIGVQRLSAHEIIKEHIIPAITNESNLNGNKILMTEYVCFVMTH 1735
Query: 1801 LEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDI 1860
L CP CH++R FI+SELR KAF+LTNHG+KR E+P+HF KE+GNP+ +N L+ +++
Sbjct: 1736 LLSSCPECHIDRGFIISELRTKAFILTNHGYKRLVEVPVHFSKEYGNPIDLNKLL-SVEM 1794
Query: 1861 KWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-W 1919
W+EV TYLKHP SLSCGL KWR FF+EIGI DFV VV+V++ +A++ H N W
Sbjct: 1795 NWHEVADTYLKHPVTNSLSCGLTKWRNFFQEIGINDFVHVVEVNRSIANMPHDIMVNRKW 1854
Query: 1920 TKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKS 1979
E++ G+ DWES EL HLL++L T+ N++ K+LLE+LDTLW+D +DK++G S
Sbjct: 1855 DPEIIFSGAMVKDWESPELTHLLTMLATHGNKESCKYLLEVLDTLWNDHLSDKVVGCCIS 1914
Query: 1980 KPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVK 2039
K + FQS+F+N ICD QW +SS+D + HYPKDL++DCDAVRSILG SAPY +PKV+
Sbjct: 1915 KSGDSSKQFQSAFMNSICDAQWVVSSVDKKGHYPKDLYYDCDAVRSILGASAPYALPKVQ 1974
Query: 2040 SEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------MARYYLLV 2080
S KLV DIG KT V++DD ILKVW R E PF M ++Y +
Sbjct: 1975 STKLVRDIGFKTRVSLDDTFNILKVW-RTEKPFKTSISQMCKFYTFL 2020
>gi|34148073|gb|AAQ62582.1| unknown [Glycine max]
Length = 2711
Score = 1928 bits (4995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1037/1985 (52%), Positives = 1353/1985 (68%), Gaps = 98/1985 (4%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
+E DRA+ A L+ AG+SVS+W VS++ LL LQVDSW++LG +MQ+VPSLH+LM+TE
Sbjct: 81 IEHADRAIANACRALLAAGDSVSAWTVSQNALLTLQVDSWNTLGIKMQQVPSLHRLMMTE 140
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++NAF+HCFVGV++ITSLYDLE AICK+EG++ FEEL LGPFLR PL+ HYF ++SDV
Sbjct: 141 GKVNAFVHCFVGVQRITSLYDLEVAICKNEGVDDFEELGLGPFLRHPLVIHYFSLRSDVT 200
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKD-IKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
V+KI T ++I L E+ D ++ + IKV++FLDFIA KR + KE LG+RIQNLGMHI
Sbjct: 201 QVYKITTEEIIQLLIEFLDASRSNEFIKVEQFLDFIANKRLVECKEWLGIRIQNLGMHIY 260
Query: 317 FIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICE-RVKSFSSINE 375
IREAR S+ +++ +++L +R R + S ++ SA E R S+I E
Sbjct: 261 AIREARNSEQSAMRE-VRKLGQSARREARKLGQSARREARKLGQSARREARNSEQSAIQE 319
Query: 376 DFGAKHIRFVSSSSEDEDSDDCTYE-CSNDISSNVQL------PSQIKGSDRVSSCPYPS 428
++ S+ E +S+ YE C N + SQ K D + +
Sbjct: 320 ARNSEQ----SAVQEASNSEQSAYEKCLESFLKNGKFRYRTIPSSQKKQLDE----RFNA 371
Query: 429 VTEELKRLG-LKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKA 487
+T+ ++ +K + +S+ +D+ SS ++ + S P+ R+
Sbjct: 372 ITQRVESFSPVKKSFCGKHKRFMSSASEDEDSDSSTDEQSNNIIKGSQSNPSSQFTRSSE 431
Query: 488 KQRALPIESGDQTQDDELNEADI---SFSNESMRMFITTWKEACKNNTMSEVLEKMFQFY 544
+ + P S + + +D+ S N +++ SE K +F
Sbjct: 432 RVSSCPYPSATEEKARLGVRSDMAGHSLVNSNLK------------KGFSEQPRKKRKFE 479
Query: 545 KPTDPKKAA-RYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTV 603
T + A + KR KL VT I G + + T + + ++
Sbjct: 480 NATSTRSAPYKLRKRNKLG------------VVTPINTG------NKTKVSTNIDEDLSI 521
Query: 604 SGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKI 663
S +SL+ + ++A ++H V ++ V++V++K+ Y + N I N+ + +
Sbjct: 522 SNDSLQM----FVTTWKMAC--SEHKVAEVGVDDVIRKIGTYFDHGNDINRNSNDSLVQY 575
Query: 664 -IILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSS 722
I+L RK CSCE W+ EQF +K+F SLGYG+F +FLEKH ++L EL KL DT + SS
Sbjct: 576 RIMLWRKFCSCENWVAEQFGMKKFDSLGYGDFLSFLEKHPNLLPHELLKLLVGDTCENSS 635
Query: 723 LEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETV 782
+ + L+ LVSQA + LWE+E ITKQMIS LL RQFP I+F++ + GS+ L+TV
Sbjct: 636 FRACMSSNQLIALVSQALSGLWENETITKQMISMLLMRQFPSINFELVENGSLVDLLDTV 695
Query: 783 GKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDA 842
+ + V SKCV+FS TI+ +GD S +N+ E + + T + K +E+V +K+A
Sbjct: 696 KGHTSSVTSKCVVFSATIIEKNYNGDSSSDRDNNWSEIPTDRSETSHK-KSTETVIAKNA 754
Query: 843 IEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSA 902
IE+LL+AP LSDL+ WSHWD FAP LGP WLLN+VN KEL CLVTRDGKVIRIDHSA
Sbjct: 755 IEVLLKAPMLSDLSKWSHWDLRFAPFLGPFISWLLNDVNTKELFCLVTRDGKVIRIDHSA 814
Query: 903 SVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEV 962
++DSFLEAA++GSSFQTAV LLS +L GGEK VPL LLKCH+ HAFEVMF+N+VED+EV
Sbjct: 815 TLDSFLEAAVQGSSFQTAVHLLSLISLVGGEKYVPLSLLKCHSCHAFEVMFRNSVEDVEV 874
Query: 963 INSQNCRMHGNV--LCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
N N +H +V L + + + K + K K + SRF LDCLG LP+
Sbjct: 875 SNDGNA-LHQSVEALSKTKFLTEISTAKMRSVFSKHMHKVSKVASILSRFVLDCLGNLPA 933
Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
EF SFA+DVLLSG+QS KDA S IL EC E RLMLHE+GLSLGI EWI+DYHA S
Sbjct: 934 EFHSFASDVLLSGMQSVFKDAASTILCECSNMEQRLMLHEIGLSLGISEWINDYHALISN 993
Query: 1081 GTSDLLMPCV-VTCTNAATSGLNSG----------SGCAEGSLFESVGADVHIEECGAIC 1129
+SD+ C V+C AT+ +N+ S E ++ S+ IE C I
Sbjct: 994 NSSDI--HCARVSCLKDATTDINTSLKLDQVTLDKSPIPEANMVTSLVPHRLIEGCTEII 1051
Query: 1130 DTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHA 1189
+T+ E S+D C + E DA+ ++ESIRRDEFGL ++S+++S MLKKQHA
Sbjct: 1052 ETVDPEKSNDESNTCCLGNSFQHVEDMDASRLIESIRRDEFGLDSSLSDIDSCMLKKQHA 1111
Query: 1190 RLGRALHCLSQELYSQDSHFLLELV-----QNADDNIYPENVEPTLTFILQESGIVVLNN 1244
RLGRALHCLSQELYSQDSHF+LELV QNADDN YPENVEPTLTFIL++SGIVVLNN
Sbjct: 1112 RLGRALHCLSQELYSQDSHFILELVRIILVQNADDNNYPENVEPTLTFILRDSGIVVLNN 1171
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV-----FRVTDAPEIHSNGFHVK 1299
E+GFSA+N+RALCDVGNSTKKGS+AGYIG+KGIGFKSV +VTDAPEIHSNGFHVK
Sbjct: 1172 ERGFSAQNMRALCDVGNSTKKGSTAGYIGKKGIGFKSVPCLFPLQVTDAPEIHSNGFHVK 1231
Query: 1300 FDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
FD SEGQIGFVLPT+VPP +I + R+ S D + WNTCI LPFR+ SEG+AMN+
Sbjct: 1232 FDISEGQIGFVLPTVVPPCDIGVLRRMASTDTELCDDNPWNTCILLPFRSHLSEGMAMNS 1291
Query: 1360 IVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
++ MFSDLHPSLLLFLHRL+CI RN+LND+L V++K+I GDGIIKVS G++K+ WFV S
Sbjct: 1292 VLSMFSDLHPSLLLFLHRLKCIKLRNLLNDTLTVMKKEISGDGIIKVSHGKEKIVWFVVS 1351
Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFV 1479
QKL+ IR DV+TTEI++A TLQES+ G Y P QQPVFAFLPLRTYGLKFILQGDFV
Sbjct: 1352 QKLQTNSIRFDVQTTEISMAFTLQESDNG-YIPCSDQQPVFAFLPLRTYGLKFILQGDFV 1410
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
LPSSREEVDG+SPWNQWLLSE+P LFV A+R FC+LPCFR P K S +MSFVPLVGEV
Sbjct: 1411 LPSSREEVDGDSPWNQWLLSEYPNLFVRAQREFCELPCFRSEPGKGLSAFMSFVPLVGEV 1470
Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLG 1599
HGFFS LPR+I+SKLRM NCL+++G+NN+WAPPCKVLRGW ++ +L+PD +L +HLGL
Sbjct: 1471 HGFFSSLPRLIISKLRMMNCLLVDGDNNEWAPPCKVLRGWTEQVRNLIPDNMLLEHLGLI 1530
Query: 1600 FLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISF 1659
+L ++IVLSD LARALGIEE+GP IL++++SSLC T++GL SM +SWLAS LN L F
Sbjct: 1531 YLDRNIVLSDELARALGIEEFGPNILVRVLSSLCHTKSGLISMDMSWLASCLNILSVTMF 1590
Query: 1660 HSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSD--CSVFDGGFGLEA 1717
+SSG + ++ D+ NLQ++PFIPLSDGT+SSVDEGTIWLH + + FDG +EA
Sbjct: 1591 NSSGSVPINFEMK-DVQKNLQKMPFIPLSDGTYSSVDEGTIWLHFNHLNTGFDGEHKIEA 1649
Query: 1718 FPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAI 1777
FPN+CAKLRTVSP L SAS+ SL V +DN+ R+L IGVQQLS HD+VK+HILPA+
Sbjct: 1650 FPNICAKLRTVSPFLFSASS-GTPSLNVTFLDNVTRLLQSIGVQQLSVHDVVKLHILPAL 1708
Query: 1778 SDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEI 1837
SDET AN ++ LM +Y+CFVM+HL C +C +ERE I+SE R K+ +LTN+GFK P+EI
Sbjct: 1709 SDETMANKNRVLMIEYVCFVMLHLNSSCSDCFIEREHIISEFRCKSLLLTNYGFKCPAEI 1768
Query: 1838 PIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDF 1897
PIHF FGNPV+ ML + ++W+EVDI+YL HP ES+S L+KWR FF++ GITDF
Sbjct: 1769 PIHFCTGFGNPVTPKMLADSVSMRWHEVDISYLSHPVNESVSSALIKWRDFFEKFGITDF 1828
Query: 1898 VQVVQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKH 1956
QVVQVDK V DI FK MW + L+S S DWES+E+V L+SLL+ + N + K+
Sbjct: 1829 AQVVQVDKSVVDICDVTFKQMMWDRGLISAESIVKDWESSEIVQLVSLLSKSGNLENCKY 1888
Query: 1957 LLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDL 2016
LLE+LDTLWD CY++K G+F K GD F+S+FI +CDIQW +S+MDDELHYPKDL
Sbjct: 1889 LLEVLDTLWDVCYSNKTTGYFYLKSVGDGHPFKSTFICSLCDIQWVVSTMDDELHYPKDL 1948
Query: 2017 FHDCDAVRSILGPSAPYIVPK----VKSEKLVCDIGLKTEVTIDDILEILKVWTR-LEAP 2071
F+DC+ VR +LG APY VPK VKSE+LV D G KT VT+DDI ++LK W + + P
Sbjct: 1949 FYDCETVRMLLGDFAPYAVPKVSFLVKSERLVKDFGFKTRVTLDDIFDVLKAWRKSSKTP 2008
Query: 2072 FMARY 2076
F ARY
Sbjct: 2009 FKARY 2013
>gi|186511742|ref|NP_193111.2| Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Arabidopsis thaliana]
gi|332657921|gb|AEE83321.1| Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Arabidopsis thaliana]
Length = 2729
Score = 1846 bits (4782), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1028/2132 (48%), Positives = 1348/2132 (63%), Gaps = 185/2132 (8%)
Query: 26 PSQPPPPHFNPNFAFQN-PNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLT 84
P + P P F NF N P+ NP Q QNL + Q P+ N F+QN N
Sbjct: 50 PIRYPIPQFPANFYRPNFPDFSLGNPNFQPHQNLNFLHQQIPHQYGSAANHFLQNHN--- 106
Query: 85 QNP-NLPPQQPPSS--ACNQQRSQTQHPPGASNQQRPLTQQ----PQSLPPQPQKLNKEF 137
QN + PPQ P++ + +Q ++ +Q + Q P+S + N F
Sbjct: 107 QNSFSFPPQSIPNNDISISQNHGAFENSSLKRRRQEEVVQVTDVVPKSNFASGESANNSF 166
Query: 138 --------------------------------LERIDRAVVKARSELIMAGESVSSWKVS 165
+ RID+AV K R + AGESVSS +VS
Sbjct: 167 SVSLPIPIATDDSGVSRVHGEKSSGKPKRKVDVLRIDKAVNKTRKLFVAAGESVSSTRVS 226
Query: 166 RSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICK 225
R+VL +LQ DSW SLG QMQ+VPSL +LM E +INAFIHCFVG R+I +L+DLE AIC+
Sbjct: 227 RAVLEELQADSWRSLGVQMQDVPSLRQLMAIEGKINAFIHCFVGARRIVTLHDLEVAICR 286
Query: 226 SEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKV 285
+E ++ F++LELGP L+ PL+ YF S KI + ++I L Y T+ +D+K+
Sbjct: 287 NEFVDSFDDLELGPLLQHPLVLLYFPSISSSTGPVKITSEEIISFLDSYLHTYMTEDVKL 346
Query: 286 DEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKR 345
DEFL+F+A ++S+ SKE+LGVRIQ+L M+++FI +A++ + TLK L EL H++
Sbjct: 347 DEFLNFVASQKSVTSKEKLGVRIQSLRMYVSFILDAKRQEGETLKVLLTEL---HQKYHI 403
Query: 346 PIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSNDI 405
P S KK+Q D+ + + ER SF+ ++D+ KHIRF SSSS D +D YE N
Sbjct: 404 P---SSKKQQRDKSLT-VSERADSFALHHKDYCGKHIRFDSSSS---DENDNVYEVRN-- 454
Query: 406 SSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKK 465
+ SD ++SCPYPSV EE+KRLG S KK
Sbjct: 455 ---------LNSSDHINSCPYPSVAEEMKRLG-----------------------GSNKK 482
Query: 466 RKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDDE-------LNEADISFSNESMR 518
RK E S +K LR++ +K + + + DD ++EAD + S +R
Sbjct: 483 RKGERRNHEKSDSSKLLRKSPSKLQGHAKQEIPKLADDSEAKKVFSVDEADFTLSEGDLR 542
Query: 519 MFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTS 578
+FI+TWK+ CK ++S +EKM FY + A+ KR K M SS+P +GLLN+AVTS
Sbjct: 543 LFISTWKDTCKELSISTFVEKMLSFYNLGGSEGRAQ-IKRAKAM-SSFPFVGLLNVAVTS 600
Query: 579 IKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQ---V 635
++RGMWDS+YD Q + + NT SGN + EI P E + T+H + V
Sbjct: 601 LRRGMWDSIYDNFQMTSLSDTTNTGSGNQVG----EINPIENSELSKTQHVMPPTHCNTV 656
Query: 636 EEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFF 695
EE+++++S Y E D L+ A K I + RKL +CE L EQF++++F+SLG+G FF
Sbjct: 657 EEIIRRLSLYFEHD---LSGA-----KHIGIFRKLQTCENLLAEQFQVQDFESLGWGGFF 708
Query: 696 TFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMIS 755
FLEKH +L T+LQ+ + + + LEV + ++LL +L+SQAS +++++Q ++
Sbjct: 709 AFLEKHMLLLPTQLQRFLSRELQEEFPLEVHVNENLLTLLLSQASE-FSSDKVLSRQTLA 767
Query: 756 ELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEEN 815
L+ QFP ISFK+ + S E+F E +GK ++ SKCVLFS T+LG EN
Sbjct: 768 RLVAEQFPSISFKVVGRDSEENFSEIIGKKKSS--SKCVLFSATLLGA----------EN 815
Query: 816 SLL-----ETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLG 870
SL E+ + T R +V SK+ +++LLR P LSDLNSW HWD +AP G
Sbjct: 816 SLTSKYLEESLTVGNDTEARSTTLNAVASKEVLDVLLRVPLLSDLNSWCHWDLRYAPQFG 875
Query: 871 PLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALA 930
PL G LNE+N +LLCLVTRDGK+IR D SA+ DSFLEAAL+GS+++TA +LLS +L
Sbjct: 876 PLMG-CLNEINSTDLLCLVTRDGKIIRADPSATADSFLEAALQGSAYRTAAQLLSLISL- 933
Query: 931 GGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQN--CRMHGNVLCGRQNFDVANIDN 988
G ++P LLKC+A+ AFEV F N E++E +N +N +MHG +
Sbjct: 934 NGRTHLPFSLLKCYAKRAFEVFFYNYSEEME-LNDRNSLVQMHGP-------------EK 979
Query: 989 LSGEVQKQLL-----KFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPS 1043
LS K ++ K K AS+F LDCLGYLP EFRS D+LL GL+S +KDAP+
Sbjct: 980 LSTSFDKVIVVGEKAKVAKRDYAASKFLLDCLGYLPGEFRSLVVDILLPGLRSVVKDAPT 1039
Query: 1044 AILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNS 1103
+L C QTE R+MLH+ GL LGI+EWI DYH FCS+ + + + A+S L+S
Sbjct: 1040 RVLSACEQTEQRIMLHDAGLLLGIVEWISDYHKFCSSCSPN------SSIVENASSNLDS 1093
Query: 1104 GSGCAEGSLFESVGADV-----------HIEECGAICDTICGEAS-DDGLGDCTTQTLPE 1151
G+G + L + V + EE C T G D +G+ TQT PE
Sbjct: 1094 GAGFVQNELEDPVQTKQRCMIVSEKSCEYKEEPHESCHTFGGSGILCDSVGEAFTQTAPE 1153
Query: 1152 DKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
++ A +++SIRRDEFGL S E +ML+KQHARLGRAL CLSQELYSQDSHF+L
Sbjct: 1154 --FYDNRASVIDSIRRDEFGLDLTSSGSEMSMLQKQHARLGRALQCLSQELYSQDSHFIL 1211
Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
ELVQNADDN YPE+VEPTLTFILQ++GIVVLNNE GF ENIRALCDVG STKKGS
Sbjct: 1212 ELVQNADDNKYPEHVEPTLTFILQKTGIVVLNNECGFMPENIRALCDVGQSTKKGSGGYI 1271
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDP 1331
+ FKSVFRV+DAPEIHSNGFH KFD SEGQIG++LPT+VPP +I+ +LS
Sbjct: 1272 GKKGIG-FKSVFRVSDAPEIHSNGFHFKFDISEGQIGYILPTVVPPHDIESLSSMLSGRA 1330
Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
+ L+ WNTCI LPFR SE +N+I MFSDLHPSLLLFLHRLQCI++RN+L+DSL
Sbjct: 1331 LHLKDAGWNTCITLPFRAIDSERTTVNHIEPMFSDLHPSLLLFLHRLQCIVYRNVLDDSL 1390
Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
+V+RK++V I+KVSCGE+ MTWFVAS+KL+A +R DV+TTEI++ TL +G Y
Sbjct: 1391 LVMRKEVVSKNIVKVSCGENSMTWFVASEKLKATNLRDDVQTTEISIGFTLDMLEDGTYR 1450
Query: 1452 PLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERS 1511
+ Q+PVFAFLPLRTYGLKFI+QGDF+L SSRE+VD +SPWNQWLLSEFP LFV A RS
Sbjct: 1451 SCMIQEPVFAFLPLRTYGLKFIIQGDFILTSSREDVDEDSPWNQWLLSEFPGLFVDALRS 1510
Query: 1512 FCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAP 1571
FC LP F +N K S YM VPLVGEVHGFFS LPR I+S+LR +NCL+LEG+ +W P
Sbjct: 1511 FCSLPSFTQNLGKGVSSYMQLVPLVGEVHGFFSSLPRSIISRLRTTNCLLLEGDGEEWVP 1570
Query: 1572 PCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISS 1631
PCKVLR WN++ LL D LLQ+HL LGFL KDIVLSDSL+RALGIE+YGPK L+QI+SS
Sbjct: 1571 PCKVLRNWNEKIRVLLKDGLLQEHLALGFLDKDIVLSDSLSRALGIEDYGPKTLVQILSS 1630
Query: 1632 LCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGT 1691
L L+SMG +WL+S L ELY + F SSG +++ G++ LID+L +IPFIPLS+G
Sbjct: 1631 LSHKNGCLQSMGFTWLSSILTELYLL-FRSSGHGNVELGIDKSLIDDLHKIPFIPLSNGK 1689
Query: 1692 FSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNL 1751
F+S+DEG +WLH D + D G EAFP L LRT+ +LL AS+VD+ S SVD+L
Sbjct: 1690 FTSLDEGAVWLHHDTTGLDLGDVFEAFPVLYGNLRTIDHSLLLASSVDEKS----SVDDL 1745
Query: 1752 NRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVE 1811
ML IGVQ+LSAH+IVK HILPA +T D LM DYLCFVM HL C C E
Sbjct: 1746 VNMLCAIGVQKLSAHEIVKAHILPAFEARSTGAVD-GLMVDYLCFVMTHLRSGCHICLKE 1804
Query: 1812 REFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLK 1871
R++I+SELR KA VL+N+G K+ E IHFG+E+GN V++ L ++DI W+ VD TYLK
Sbjct: 1805 RKYIISELRSKALVLSNYGLKQLGEGSIHFGEEYGNQVNMKKLTKNLDISWHVVDGTYLK 1864
Query: 1872 HPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAI 1931
HPA + +CGL +WR+FF+EIGI DFVQVVQV+K +A+ + LLSP
Sbjct: 1865 HPASKFYACGLKEWREFFQEIGIADFVQVVQVEKSIAEFYSVSHCEKYDINLLSPDLTVK 1924
Query: 1932 DWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSS 1991
DWES ELV LLSLL + R+ K+LLE+LD LWDDCY DK + S G RS +SS
Sbjct: 1925 DWESPELVDLLSLLHKSNGRKGCKYLLEVLDRLWDDCYYDKTTVNYNSGTHGIIRSSESS 1984
Query: 1992 FINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKT 2051
F+ ICD W +SSMD +LH KDL+HDCD V+SILG +APY VP V S KL+ DIG KT
Sbjct: 1985 FMRVICDSLWIVSSMDSKLHLSKDLYHDCDDVQSILGMNAPYAVPTVTSVKLLSDIGFKT 2044
Query: 2052 EVTIDDILEILKVWTRLEAPF------MARYY 2077
+V++DD LE+L+ W F + R+Y
Sbjct: 2045 KVSLDDALEVLESWVHCGDSFKSSISQITRFY 2076
>gi|222625995|gb|EEE60127.1| hypothetical protein OsJ_13005 [Oryza sativa Japonica Group]
Length = 2821
Score = 1620 bits (4196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 881/1987 (44%), Positives = 1245/1987 (62%), Gaps = 97/1987 (4%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
LER A K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L L + E
Sbjct: 172 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 231
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++N FIHC+V R+I S+YDLE ICK+E I +FEEL LGPFL+ PL++HYF V +D+
Sbjct: 232 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 291
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
+V K++++++I L ++ D K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 292 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 350
Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
+R+AR+++ +K ++ + + K K+ LDERFSAI R+K I
Sbjct: 351 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 410
Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
N KHI F S+ ++ D D + + D S + + I D RVSSCPYPS T
Sbjct: 411 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 464
Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
EE++RLGLK E + + + + G +++KRK E +G +S+ +
Sbjct: 465 EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 514
Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
P + + S + FITTWKEAC+ + + +VLE + +Y T +
Sbjct: 515 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAETPKE 572
Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
K + FS YP IG LN+AV ++ G+ DS+YD + + + ++ + + E
Sbjct: 573 KKK-----IIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 627
Query: 611 ESIEIEPSEQVAV----------VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPV 660
+E PS++ + V +K + ++V+++++++ E + + S V
Sbjct: 628 MEVEPPPSKRKSKCVAKGDNDTNVGSKDPGCSVTADDVIRRITEFFESNRGVSRTDASQV 687
Query: 661 TKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDR 720
K LR L CET +T +F +F +LG+G F FL KH L +L +
Sbjct: 688 RKST-FLRTLLDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTH 746
Query: 721 SSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLE 780
SS+EVS++Q + VL+ QA N E ++ S+LL+RQFP ISF I S E L+
Sbjct: 747 SSVEVSVLQQQIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLD 806
Query: 781 TVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSK 840
++ + R + ++FS ++L R SG + + + T + + ++ S +V+S+
Sbjct: 807 SIERQRKNIEINNIMFSMSLLEKRWSGIVPGNHD-----TVDGLMN-DIQQSCSVTVSSQ 860
Query: 841 DAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDH 900
+AI+ LL+AP LSDL +WSHWD LFAPSLG WLLN V++L C+VT DG+ IR+D
Sbjct: 861 EAIKCLLKAPMLSDLLTWSHWDLLFAPSLGSFMHWLLNTGPVQDLACIVTTDGRFIRVDP 920
Query: 901 SASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDI 960
SA+VD FLE ++ S FQ AVKLLS + G N P+ LLKC+A+ A ++ N + +
Sbjct: 921 SATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLLKCYAQRAIGIIMDNGNDLL 980
Query: 961 EVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
N + G R N+ ++ K + V + ++F LDCLG+LPS
Sbjct: 981 ------NTKSEGKSFSAR---------NIWSDMSKDI---DDIVHLVAKFVLDCLGHLPS 1022
Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
EFRS AADVLL+GL++ K+ SAILLE +T MLH++GLSLG+ EW D C T
Sbjct: 1023 EFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDIGLSLGVAEWAEDCRRLCLT 1082
Query: 1081 GTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD---------------VHIEE 1124
D + + ++ SG C +L + D V + +
Sbjct: 1083 ---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDIMKRSKSLPGKDNQIVAVSK 1139
Query: 1125 CGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNML 1184
+ + + + + TL E E+A L++E+IRR+EFGL ++S E+++L
Sbjct: 1140 NQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETIRREEFGLDQSLSCTENSLL 1198
Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN 1244
KKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL FILQ++GIVVLNN
Sbjct: 1199 KKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVEDVEPTLAFILQDNGIVVLNN 1258
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
E GFSAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRVTDAPEIHSNGFHVKFD +E
Sbjct: 1259 ESGFSAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRVTDAPEIHSNGFHVKFDITE 1318
Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
GQIGFVLPT V P+N D R+LS + V+ S WNTCI LPFR+KF EG M++I MF
Sbjct: 1319 GQIGFVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVLPFRSKFKEGTGMHSIASMF 1378
Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRA 1424
SDLHPSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI+++S G D M+W V S+KL+
Sbjct: 1379 SDLHPSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVRISNGNDMMSWLVVSKKLQG 1438
Query: 1425 GVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSR 1484
++R DV +TEIA+A TLQE+ G Y P L QQPVFAFLPLR YGLKFILQGDFVLPSSR
Sbjct: 1439 TIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPLRNYGLKFILQGDFVLPSSR 1498
Query: 1485 EEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFS 1544
EEVD ++ WNQWLLSEFP+LFVSA+ SFC LPCF+ P KA + +MSFVPLVGEVHGFF
Sbjct: 1499 EEVDADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKAVTTFMSFVPLVGEVHGFFC 1558
Query: 1545 GLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKD 1604
LP +ILSKLR++ C++LEG++++W PC LRGW+++ L+ D LL +HLGLG+LSKD
Sbjct: 1559 QLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRILISDSLLLEHLGLGYLSKD 1618
Query: 1605 IVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY-TISFHSSG 1663
I++SD+L+RALGI EYGPK+L+ IISS+CR + + S+GL WL +W L+ ++ HSS
Sbjct: 1619 IIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLEWLCAWFISLHLSLMHHSSK 1678
Query: 1664 QSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDC--SVFDGGFGLEAFPNL 1721
L + E DL+ L++IP IPLSDG+FSS+ +G IWL D S D + FP L
Sbjct: 1679 NLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPYDVLNSKPDSRSSMLNFPVL 1737
Query: 1722 CAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDET 1781
+ LRT+ P LLS S +K + ++L +LLK+GV++LS HDI+K HIL ++S+ T
Sbjct: 1738 YSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRKLSGHDIIKNHILVSLSNST 1797
Query: 1782 TANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHF 1841
AN +M +Y+ F+M+HL+ C +C+ E+E I+SELR++ +LTNHG+K P + PIHF
Sbjct: 1798 EANVANTMMIEYVSFIMLHLQSPCASCNFEKEEIMSELRRRPILLTNHGYKCPYDEPIHF 1857
Query: 1842 GKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESL-SCGLVKWRKFFKEIGITDFVQV 1900
KE+GN V + L+ +++IKW E+D YL + +SL L KWR+FF+E+G+TDFVQV
Sbjct: 1858 SKEYGNSVDLCKLLLNVEIKWIELDSCYLMNRGSDSLPPFELKKWRQFFEEMGVTDFVQV 1917
Query: 1901 VQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEI 1960
V+V+K+++ + + DWES ELV +LS ++ R+ +LLE+
Sbjct: 1918 VKVEKNISQADSSLAGRLSQGHHSGTPCIVYDWESPELVSILSTFSSKKCRENCVYLLEV 1977
Query: 1961 LDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDC 2020
LD WD Y+ K + +G++ + +SSF+N I +W S+MD++LHY DLF++
Sbjct: 1978 LDKFWDAHYSAKARIHADATHSGENIAVESSFMNSIRTFKWIASAMDEDLHYATDLFYNT 2037
Query: 2021 DAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------MA 2074
+ VRSILG APY VP+V S L DIG K +V+ D L ILK W + F M
Sbjct: 2038 EDVRSILGSVAPYAVPQVCSRSLGKDIGFKIKVSHSDALMILKSWIASQTSFSASMDQMC 2097
Query: 2075 RYYLLVN 2081
++Y V+
Sbjct: 2098 KFYTFVS 2104
>gi|297804940|ref|XP_002870354.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316190|gb|EFH46613.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 2136
Score = 1520 bits (3936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 802/1537 (52%), Positives = 1038/1537 (67%), Gaps = 83/1537 (5%)
Query: 571 LLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTV 630
+L++ VTS+KRGMWDS+YD LQ + + NT SGN + EI PSE + T+H +
Sbjct: 1 MLHLQVTSLKRGMWDSIYDNLQMTSLSDTTNTGSGNQVG----EINPSENSELSKTQHVM 56
Query: 631 ---QDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFK 687
VEE+++++S Y E D S K I + RKL +CE L EQF++++F+
Sbjct: 57 PPKHSNTVEEIIRRLSLYFEHDLS--------GEKHIGIFRKLQTCEVLLAEQFQVQDFE 108
Query: 688 SLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESE 747
SLG+G FFTFLEKH +L T+LQ+ + + ++ LEV + ++LL +L+SQAS
Sbjct: 109 SLGWGGFFTFLEKHMLLLPTQLQRFLSRELLEEFPLEVHVNENLLTLLLSQASE-FSGGN 167
Query: 748 IITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSG 807
+I++Q ++ LL QFP I+FK+ + S E+F E +GK ++ SKCVLFS T LSG
Sbjct: 168 VISRQTVNRLLAEQFPSINFKVVGRNSEENFTEIIGKEKS--CSKCVLFSAT-----LSG 220
Query: 808 DLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAP 867
+ L E+ + T R +V SK+ +++LLR P LSDLNSW HWD +AP
Sbjct: 221 AENSLTSKYLEESLTVGNDTEARGSTLNAVASKEVLDVLLRVPLLSDLNSWCHWDLRYAP 280
Query: 868 SLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSF 927
GPL G LNE+N K+LLCLVTRDGK+IR D SA+ DSFLEAAL+GS++ TA +LLS
Sbjct: 281 QFGPLLG-CLNEINSKDLLCLVTRDGKIIRTDPSATADSFLEAALQGSAYCTAAQLLSLI 339
Query: 928 ALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQN-CRMHGNVLCGRQNFDVANI 986
+L G ++P LLKC+A+ AFEV F N+ E++E+ N + +MH
Sbjct: 340 SL-NGRTHLPFSLLKCYAKRAFEVFFDNHSEEMELNNRNSLVQMH-------------RP 385
Query: 987 DNLSGEVQKQLL-----KFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDA 1041
+ LS + K ++ K K+ AS+F LDCLGYLP EFRS D+LL GL+S +KDA
Sbjct: 386 EQLSTSLDKMIVVGEKTKAAKSDYAASKFLLDCLGYLPGEFRSLVVDILLPGLRSVVKDA 445
Query: 1042 PSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGL 1101
P+ +L C QTE R+MLH+ GL LGI+EWI DYH FCS+ + + + +A+S L
Sbjct: 446 PTRVLSACKQTEQRIMLHDAGLLLGIVEWISDYHMFCSSCSPN------SSIVESASSNL 499
Query: 1102 NSGSGCAEGSLFESVGADVHIEECGAICDTICGEASD---------------DGLGDCTT 1146
+S +G + L + + + + C + + C + + D +G+ T
Sbjct: 500 DSVAGSVQKELEDPIQTE---QRCMIVSEKACQDKKEPHESCHTFGDSGTLCDSVGEAFT 556
Query: 1147 QTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQD 1206
QT PE + + A +++SIRRDEFGL S E +ML+KQHARLGRAL CLSQELYSQD
Sbjct: 557 QTAPEFHD--NPASVIDSIRRDEFGLDLTSSGSEMSMLQKQHARLGRALQCLSQELYSQD 614
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
SHF+LELVQNADDN YP +VEPTLTFILQ++GIVVLNNE GF ENIRALCDVG STKKG
Sbjct: 615 SHFILELVQNADDNKYPAHVEPTLTFILQKTGIVVLNNECGFMPENIRALCDVGQSTKKG 674
Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL 1326
S + FKSVFRV+DAPEIHSNGFH KFD SEGQIG++LPT+VPP +I+ +
Sbjct: 675 SGGYIGKKGIG-FKSVFRVSDAPEIHSNGFHFKFDISEGQIGYILPTVVPPQDIESLSSM 733
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
LS + L+ WNTCI LPFR+ SEG +N+I MFSDLHPSLLLFLHRLQCI++RN+
Sbjct: 734 LSGRALHLKDAGWNTCITLPFRSIDSEGTTVNHIEPMFSDLHPSLLLFLHRLQCIVYRNV 793
Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
L DSL+V+RK++V + I+KVSCGE+ M WFVAS+KL+A +R DV+TTEI++ TL
Sbjct: 794 LEDSLLVMRKEVVSNNIVKVSCGENSMIWFVASEKLKAANLRDDVQTTEISIGFTLDMLE 853
Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
+G Y + Q+PVFAFLPLRTYGLKFI+QGDF+L SSRE+VD +SPWNQWLLSEFP LFV
Sbjct: 854 DGTYRSCMIQEPVFAFLPLRTYGLKFIIQGDFILTSSREDVDEDSPWNQWLLSEFPGLFV 913
Query: 1507 SAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNN 1566
A RSFC LP F N KA S YM VPL GEVHGFFS LPR I+S+LR +NCL+LEG+
Sbjct: 914 DALRSFCSLPSFTHNLGKAVSSYMQLVPLAGEVHGFFSSLPRSIISRLRTTNCLLLEGDG 973
Query: 1567 NQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILL 1626
+W PPCKVLR WN++ LL D LLQ+HL LGFL KDIVLSDSL+RAL IE+YGPK L+
Sbjct: 974 EEWVPPCKVLRNWNEKIRVLLKDGLLQEHLALGFLDKDIVLSDSLSRALSIEDYGPKTLV 1033
Query: 1627 QIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIP 1686
QI+SSL L+SMG +WL+S L ELY I F SSG +++ G++ LID+L RIPFIP
Sbjct: 1034 QILSSLSHKNGCLQSMGFTWLSSILTELY-ILFRSSGHGNVELGIDKTLIDDLHRIPFIP 1092
Query: 1687 LSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVI 1746
LS+G F+++DEG +WLH D + D G EAFP L LRT+ +LL AS+VD+ S
Sbjct: 1093 LSNGKFTALDEGAVWLHHDTTGLDLGDVFEAFPVLYGNLRTIDHSLLLASSVDEKS---- 1148
Query: 1747 SVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCP 1806
SVD+L ML IGVQ+LSAH+I+KVHILPA +T + + LM DYLCFVM HL C
Sbjct: 1149 SVDDLVNMLCAIGVQKLSAHEIIKVHILPAFEARST-DTVEGLMVDYLCFVMTHLRSSCH 1207
Query: 1807 NCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVD 1866
CH ER +I+SELR KA VL+N+G K+ E IHFG+EFGN V++ L ++DI W+ VD
Sbjct: 1208 ICHNERNYIISELRSKALVLSNYGLKQLGEGSIHFGEEFGNQVNMKKLTKNLDISWHVVD 1267
Query: 1867 ITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSP 1926
TYLKHPA + +CGL +WR+FF+EIGI DFVQVVQV+K +A+ + LLSP
Sbjct: 1268 GTYLKHPASKFYACGLKEWREFFQEIGIADFVQVVQVEKTIAEFYSVSRCEKYDINLLSP 1327
Query: 1927 GSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDR 1986
DWES ELV+LLSLL + R+ K+L+E+LD LWD+CY +K + S G R
Sbjct: 1328 ELTVKDWESPELVNLLSLLHKSNGRKGCKYLMEVLDRLWDECYHEKTTVNYNSGTHGVIR 1387
Query: 1987 SFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCD 2046
S +SSF+ ICD QW +SSMD ELH K L+HDCD V+SILG +APY VPKV S KL+ D
Sbjct: 1388 SSESSFMRVICDSQWIVSSMDSELHLAKGLYHDCDDVQSILGMNAPYAVPKVTSVKLLSD 1447
Query: 2047 IGLKTEVTIDDILEILKVWTRLEAPF------MARYY 2077
IG KT+V +DD LEIL+ W F + R+Y
Sbjct: 1448 IGFKTKVCLDDALEILEAWVHCGDNFKSSISQITRFY 1484
>gi|4455304|emb|CAB36839.1| hypothetical protein [Arabidopsis thaliana]
gi|7268079|emb|CAB78417.1| hypothetical protein [Arabidopsis thaliana]
Length = 2137
Score = 1518 bits (3929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 806/1540 (52%), Positives = 1035/1540 (67%), Gaps = 89/1540 (5%)
Query: 571 LLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTV 630
+L++ VTS++RGMWDS+YD Q + + NT SGN + EI P E + T+H +
Sbjct: 1 MLHLQVTSLRRGMWDSIYDNFQMTSLSDTTNTGSGNQVG----EINPIENSELSKTQHVM 56
Query: 631 QDIQ---VEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFK 687
VEE+++++S Y E D L+ A K I + RKL +CE L EQF++++F+
Sbjct: 57 PPTHCNTVEEIIRRLSLYFEHD---LSGA-----KHIGIFRKLQTCENLLAEQFQVQDFE 108
Query: 688 SLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESE 747
SLG+G FF FLEKH +L T+LQ+ + + + LEV + ++LL +L+SQAS +
Sbjct: 109 SLGWGGFFAFLEKHMLLLPTQLQRFLSRELQEEFPLEVHVNENLLTLLLSQASE-FSSDK 167
Query: 748 IITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSG 807
++++Q ++ L+ QFP ISFK+ + S E+F E +GK ++ SKCVLFS T+LG
Sbjct: 168 VLSRQTLARLVAEQFPSISFKVVGRDSEENFSEIIGKKKSS--SKCVLFSATLLGA---- 221
Query: 808 DLSVHEENSLL-----ETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWD 862
ENSL E+ + T R +V SK+ +++LLR P LSDLNSW HWD
Sbjct: 222 ------ENSLTSKYLEESLTVGNDTEARSTTLNAVASKEVLDVLLRVPLLSDLNSWCHWD 275
Query: 863 FLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVK 922
+AP GPL G LNE+N +LLCLVTRDGK+IR D SA+ DSFLEAAL+GS+++TA +
Sbjct: 276 LRYAPQFGPLMG-CLNEINSTDLLCLVTRDGKIIRADPSATADSFLEAALQGSAYRTAAQ 334
Query: 923 LLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQN--CRMHGNVLCGRQN 980
LLS +L G ++P LLKC+A+ AFEV F N E++E +N +N +MHG
Sbjct: 335 LLSLISL-NGRTHLPFSLLKCYAKRAFEVFFYNYSEEME-LNDRNSLVQMHGP------- 385
Query: 981 FDVANIDNLSGEVQKQLL-----KFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQ 1035
+ LS K ++ K K AS+F LDCLGYLP EFRS D+LL GL+
Sbjct: 386 ------EKLSTSFDKVIVVGEKAKVAKRDYAASKFLLDCLGYLPGEFRSLVVDILLPGLR 439
Query: 1036 STIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTN 1095
S +KDAP+ +L C QTE R+MLH+ GL LGI+EWI DYH FCS+ + + +
Sbjct: 440 SVVKDAPTRVLSACEQTEQRIMLHDAGLLLGIVEWISDYHKFCSSCSPN------SSIVE 493
Query: 1096 AATSGLNSGSGCAEGSLFESVGADV-----------HIEECGAICDTICGEAS-DDGLGD 1143
A+S L+SG+G + L + V + EE C T G D +G+
Sbjct: 494 NASSNLDSGAGFVQNELEDPVQTKQRCMIVSEKSCEYKEEPHESCHTFGGSGILCDSVGE 553
Query: 1144 CTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELY 1203
TQT PE + + A +++SIRRDEFGL S E +ML+KQHARLGRAL CLSQELY
Sbjct: 554 AFTQTAPEFYD--NRASVIDSIRRDEFGLDLTSSGSEMSMLQKQHARLGRALQCLSQELY 611
Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
SQDSHF+LELVQNADDN YPE+VEPTLTFILQ++GIVVLNNE GF ENIRALCDVG ST
Sbjct: 612 SQDSHFILELVQNADDNKYPEHVEPTLTFILQKTGIVVLNNECGFMPENIRALCDVGQST 671
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMF 1323
KKGS + FKSVFRV+DAPEIHSNGFH KFD SEGQIG++LPT+VPP +I+
Sbjct: 672 KKGSGGYIGKKGIG-FKSVFRVSDAPEIHSNGFHFKFDISEGQIGYILPTVVPPHDIESL 730
Query: 1324 CRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
+LS + L+ WNTCI LPFR SE +N+I MFSDLHPSLLLFLHRLQCI++
Sbjct: 731 SSMLSGRALHLKDAGWNTCITLPFRAIDSERTTVNHIEPMFSDLHPSLLLFLHRLQCIVY 790
Query: 1384 RNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
RN+L+DSL+V+RK++V I+KVSCGE+ MTWFVAS+KL+A +R DV+TTEI++ TL
Sbjct: 791 RNVLDDSLLVMRKEVVSKNIVKVSCGENSMTWFVASEKLKATNLRDDVQTTEISIGFTLD 850
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
+G Y + Q+PVFAFLPLRTYGLKFI+QGDF+L SSRE+VD +SPWNQWLLSEFP
Sbjct: 851 MLEDGTYRSCMIQEPVFAFLPLRTYGLKFIIQGDFILTSSREDVDEDSPWNQWLLSEFPG 910
Query: 1504 LFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE 1563
LFV A RSFC LP F +N K S YM VPLVGEVHGFFS LPR I+S+LR +NCL+LE
Sbjct: 911 LFVDALRSFCSLPSFTQNLGKGVSSYMQLVPLVGEVHGFFSSLPRSIISRLRTTNCLLLE 970
Query: 1564 GNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPK 1623
G+ +W PPCKVLR WN++ LL D LLQ+HL LGFL KDIVLSDSL+RALGIE+YGPK
Sbjct: 971 GDGEEWVPPCKVLRNWNEKIRVLLKDGLLQEHLALGFLDKDIVLSDSLSRALGIEDYGPK 1030
Query: 1624 ILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIP 1683
L+QI+SSL L+SMG +WL+S L ELY + F SSG +++ G++ LID+L +IP
Sbjct: 1031 TLVQILSSLSHKNGCLQSMGFTWLSSILTELYLL-FRSSGHGNVELGIDKSLIDDLHKIP 1089
Query: 1684 FIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSL 1743
FIPLS+G F+S+DEG +WLH D + D G EAFP L LRT+ +LL AS+VD+ S
Sbjct: 1090 FIPLSNGKFTSLDEGAVWLHHDTTGLDLGDVFEAFPVLYGNLRTIDHSLLLASSVDEKS- 1148
Query: 1744 GVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEY 1803
SVD+L ML IGVQ+LSAH+IVK HILPA +T D LM DYLCFVM HL
Sbjct: 1149 ---SVDDLVNMLCAIGVQKLSAHEIVKAHILPAFEARSTGAVD-GLMVDYLCFVMTHLRS 1204
Query: 1804 YCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWY 1863
C C ER++I+SELR KA VL+N+G K+ E IHFG+E+GN V++ L ++DI W+
Sbjct: 1205 GCHICLKERKYIISELRSKALVLSNYGLKQLGEGSIHFGEEYGNQVNMKKLTKNLDISWH 1264
Query: 1864 EVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKEL 1923
VD TYLKHPA + +CGL +WR+FF+EIGI DFVQVVQV+K +A+ + L
Sbjct: 1265 VVDGTYLKHPASKFYACGLKEWREFFQEIGIADFVQVVQVEKSIAEFYSVSHCEKYDINL 1324
Query: 1924 LSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTG 1983
LSP DWES ELV LLSLL + R+ K+LLE+LD LWDDCY DK + S G
Sbjct: 1325 LSPDLTVKDWESPELVDLLSLLHKSNGRKGCKYLLEVLDRLWDDCYYDKTTVNYNSGTHG 1384
Query: 1984 DDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKL 2043
RS +SSF+ ICD W +SSMD +LH KDL+HDCD V+SILG +APY VP V S KL
Sbjct: 1385 IIRSSESSFMRVICDSLWIVSSMDSKLHLSKDLYHDCDDVQSILGMNAPYAVPTVTSVKL 1444
Query: 2044 VCDIGLKTEVTIDDILEILKVWTRLEAPF------MARYY 2077
+ DIG KT+V++DD LE+L+ W F + R+Y
Sbjct: 1445 LSDIGFKTKVSLDDALEVLESWVHCGDSFKSSISQITRFY 1484
>gi|414873448|tpg|DAA52005.1| TPA: hypothetical protein ZEAMMB73_895652 [Zea mays]
Length = 2676
Score = 1474 bits (3817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 833/2010 (41%), Positives = 1185/2010 (58%), Gaps = 195/2010 (9%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
L+R A KAR EL+ G+ V++WKV+++VL+ L+ DSW SLG Q Q+VP L L + E
Sbjct: 118 LDRAQEAARKAREELVKGGDGVTAWKVAQAVLVALKADSWDSLGVQPQDVPLLRDLFLIE 177
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++NAFIHC+V R+I ++YDLE ICK+EG+ +FEEL LGPFL+ PL++HYFLV +D+
Sbjct: 178 GKVNAFIHCYVAARKIVTVYDLEVEICKNEGVVQFEELGLGPFLQHPLVAHYFLVPADLS 237
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
VV K+ C E +T L F+ K +
Sbjct: 238 VVPKL-------CSEEIINT----------LLKFVDKSK--------------------- 259
Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDF 377
+ +T++ FL +H K+ + EK L R ++ + + F+++ F
Sbjct: 260 -------KKITIEDFL-----NHLAEKKSVSGKEK---LGVRVQSLGKAIPFFTNLGR-F 303
Query: 378 GAKHIRFVSSSSEDE-----------DSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPY 426
HI + + + E S + + E D+ N + K D+ +
Sbjct: 304 CRLHISLLRQAKQTEVSAAKLRANMSGSGNSSQE--KDLLKNASFHTHKKALDK----RF 357
Query: 427 PSVTEELKRL-GLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRN 485
S+T +K L G+ I+ T DD S ++ G + L +
Sbjct: 358 ISLTNRIKELPGINKHIHFDST--------DDETNSCTGSEDGKSDGNENENECSILDKK 409
Query: 486 KAKQRALPIESGDQTQD-------DELNEADISFSNESMRMFITTWKEACKNNTMSEVLE 538
+R +T++ E+N+ S N R + +VLE
Sbjct: 410 DGDKRVNSCPYPSKTEELERLGLKSEINKRQ-SLENSKPR----------DSGKKGKVLE 458
Query: 539 KMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQE 598
+ +Y T K +R++ FS YP IGLLN+AV S+ G+ DS+YD +Q + +
Sbjct: 459 MLANYYGRTISGK-----RRIRNFFSQYPGIGLLNVAVKSMGCGLLDSIYDVIQLADEND 513
Query: 599 LNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQS 658
+ ++ N+ E +EIEP + +++V++++S+Y+E ++ I +
Sbjct: 514 VASSPPPNTTT-EVMEIEPPS-------------VAIDDVIRRISEYIESNSKISGD--- 556
Query: 659 PVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTI 718
+ + +R L CETW+T QF +F +LG+G F FL+KH T L F +
Sbjct: 557 ----VALQVRALNDCETWVTTQFSANQFSALGHGTFLEFLDKHCHQFRTALSSFFKEGPS 612
Query: 719 DRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKI-EDK--GSM 775
+ SSLEVS++Q + L+ QA +N E ++ LL++QFP ISF I +DK G +
Sbjct: 613 NSSSLEVSVLQQQIEFLLCQAESNWLEDSDFSEDSFVMLLKKQFPTISFNIVQDKSDGGV 672
Query: 776 ESFLETVGKYRNEVMSKCVLFSETILGTRLSGDL-----SVHEENSLLETTSAITHTGLR 830
F+E K ++ + + FS ++L R SG L +V E +++ S HT
Sbjct: 673 SGFIEGQKK---DIRTYSLKFSISLLEKRWSGTLPSRHGNVDELGNIVAEQSCY-HT--- 725
Query: 831 PKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVT 890
+V S++AI LLRAP LSDL+ WSHWD LFAP+LG WLLN ++EL C+ T
Sbjct: 726 -----TVCSREAINCLLRAPMLSDLHLWSHWDSLFAPTLGSFVHWLLNTGPIQELACIAT 780
Query: 891 RDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFE 950
DG+ IR+D SA+VD FLEA + S Q AVKLLS + G N P+ LLKC+A+ A +
Sbjct: 781 TDGRFIRVDSSATVDQFLEAIIHRSPLQVAVKLLSLLYIYNGSMNTPMSLLKCYAQRAIK 840
Query: 951 VMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQ-------------L 997
++ NN D+ S+N + QN + + + Q+ L
Sbjct: 841 LIVDNN-NDLMNAKSENKIFMPD---EPQNLSSESSTCFADQCQESSQASPGRLIRSDSL 896
Query: 998 LKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLM 1057
V + ++F LDCL +LPSEFRS AAD+LL+G + K+ + +L E + M
Sbjct: 897 PNINNTVHLITKFVLDCLDHLPSEFRSLAADILLAGFRVVTKNWHAVMLHEATENGQLCM 956
Query: 1058 LHEVGLSLGILEWIHDYHAFCST----------GTSDLLMPCVVTCTNAATSGLNSGSGC 1107
LH++GLSLG++EW+ D H C T ++ L P V T+ S ++ S
Sbjct: 957 LHDIGLSLGVVEWVEDCHRLCLTDEVHVQTEMHSSAKLATPASVGATHE-DSNMHISSDV 1015
Query: 1108 ----AEGSLFESVGADVHIE-ECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIV 1162
LF + I+ E + + EA L + ++ E+ E+A+L++
Sbjct: 1016 NMMDKRKQLFPVINDRAGIDKEDNKMLNPAGTEADVAELHTTSRSSMMEETSLEEASLVI 1075
Query: 1163 ESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY 1222
E+IRR+EFGL +S+ E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y
Sbjct: 1076 ETIRREEFGLDQALSDTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTY 1135
Query: 1223 PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
PE+VE TL FILQE GI VLNNE+GFSAENIRALCD+GNSTKKGS+ GYIG KGIGFKSV
Sbjct: 1136 PEDVEATLAFILQEDGIAVLNNERGFSAENIRALCDIGNSTKKGSNQGYIGNKGIGFKSV 1195
Query: 1283 FRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC 1342
FRVTDAPEIHSNGFHVKFD ++GQIGFVLPT VPP++ F R+LS + + WNTC
Sbjct: 1196 FRVTDAPEIHSNGFHVKFDITDGQIGFVLPTAVPPYSTTSFSRMLSVEDDKDAHSLWNTC 1255
Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG 1402
I LPFR+KF E +DLHPSLLLFLHRL+CI F+N+ +D+L+++R++I+GDG
Sbjct: 1256 ILLPFRSKFREC----------TDLHPSLLLFLHRLKCIKFKNLFDDTLLIMRREILGDG 1305
Query: 1403 IIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
I+++S G + M+W V S++L+ ++R DV TTEIA+A TLQ++ EG+Y P L QQPVFAF
Sbjct: 1306 IVRISHGIETMSWLVVSKRLQGTIVRHDVCTTEIAVAFTLQQTEEGDYEPYLKQQPVFAF 1365
Query: 1463 LPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENP 1522
LPLR YGLKFILQGDFVLPSSREEVD +S WNQWLLSEFP+LFVSA+ SFC LPCF+ P
Sbjct: 1366 LPLRNYGLKFILQGDFVLPSSREEVDADSAWNQWLLSEFPSLFVSAQESFCALPCFQRCP 1425
Query: 1523 AKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDR 1582
KA + ++SF+PL GEVHGFFS LP +ILSKLR++ C+ L+G+ QW PC LRGW+++
Sbjct: 1426 GKAVTAFLSFIPLAGEVHGFFSQLPHLILSKLRLTRCMFLDGSTVQWVYPCNTLRGWDEQ 1485
Query: 1583 AHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSM 1642
L + LL +HLGLG+LSK+IV+SD L+RALGI +YGP ILL ISS+CR + + S+
Sbjct: 1486 TKMLFSEGLLHEHLGLGYLSKNIVISDKLSRALGIHDYGPNILLDAISSICRIDGCIESL 1545
Query: 1643 GLSWLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
GL WL +W LY T+ HSS S +E L+D +++IP IPLSDG+F+S+ +G IW
Sbjct: 1546 GLEWLCAWFVNLYLTLLSHSSQNVSSARNLEDILLDKVRKIPCIPLSDGSFTSISDGRIW 1605
Query: 1702 LHSD-CSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGV 1760
L D S + FP L LRTVSP LLSA +K + + +++L ML +IGV
Sbjct: 1606 LPYDVASSIPECSSIPNFPALYGNLRTVSPNLLSACCKNKYLMEEVRINDLADMLQRIGV 1665
Query: 1761 QQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELR 1820
++LS HDIVK HI+ ++ + AN ++ +Y+ F+M+HL+ C +C+ ER IVSELR
Sbjct: 1666 RKLSGHDIVKNHIMVSLRNGLDANVADIVIREYVSFIMVHLQSSCTSCNFERGEIVSELR 1725
Query: 1821 KKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSC 1880
K LTNHG+K P++ PIHF K++GN V V L+ +++I W E+D +YL+H +S S
Sbjct: 1726 KSPIFLTNHGYKSPADEPIHFSKDYGNSVDVTRLLQNVEISWIELDSSYLQHHGSQSSSF 1785
Query: 1881 GLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVH 1940
KWR+FF+E+G+TDFVQVV+V+K ++ + + ++ + DWES EL
Sbjct: 1786 AREKWRQFFEEMGVTDFVQVVKVEKTLSQVDSILAGGLSLADVSTKQCTVYDWESPELSR 1845
Query: 1941 LLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQ 2000
+LS+ ++ ++ +LLE+LD WDD Y+ K F + G++R+ +SSF+ CI +
Sbjct: 1846 ILSIFSSKRCKENCVYLLEVLDRFWDDHYSAKSRIFTDATHCGENRAVESSFMKCIQSFK 1905
Query: 2001 WTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILE 2060
W S MD++LHY DLF+DC+ VRS+ G APY VP+V S L IG KTEV+ D L
Sbjct: 1906 WIASRMDEDLHYATDLFYDCENVRSLFGSVAPYAVPQVSSSSLKKAIGFKTEVSYCDALM 1965
Query: 2061 ILKVWTRLEAPF------MARYYLLVNHDV 2084
+LK W + PF M ++Y ++ V
Sbjct: 1966 VLKSWITSKVPFRASMSQMCKFYTFLSEGV 1995
>gi|218193937|gb|EEC76364.1| hypothetical protein OsI_13954 [Oryza sativa Indica Group]
Length = 1909
Score = 1369 bits (3543), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 777/1842 (42%), Positives = 1108/1842 (60%), Gaps = 150/1842 (8%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
LER A K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++N FIHC+V R+I S+YDLE ICK+E I +FEEL LGPFL+ PL++HYF V +D+
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
+V K++++++I L ++ D K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357
Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
+R+AR+++ +K ++ + + K K+ LDERFSAI R+K I
Sbjct: 358 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417
Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
N KHI F S+ ++ D D + + D S + + I D RVSSCPYPS T
Sbjct: 418 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471
Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
EE++RLGLK E++ + + + G +++KRK E +G +S+ +
Sbjct: 472 EEMERLGLKSEMSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521
Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
P + + S + FITTWKEAC+ + + +VLE + +Y T
Sbjct: 522 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAET--- 576
Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
+ K++ FS YP IG LN+AV ++ G+ DS+YD + + + ++ + + E
Sbjct: 577 --PKEKKKIIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 634
Query: 611 ESIEIEPSEQVAV----------VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPV 660
+E PS++ + V +K + ++V+++++++ E + + S V
Sbjct: 635 MEVEPPPSKRKSKCVAKGDNDTNVGSKDPGCSVTADDVIRRITEFFESNRGVSRTDASQV 694
Query: 661 TKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDR 720
K LR L CET +T +F +F +LG+G F FL KH L +L +
Sbjct: 695 RK-STFLRTLLDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTH 753
Query: 721 SSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLE 780
SS+EVS++Q + VL+ QA N E ++ S+LL+RQFP ISF I S E L+
Sbjct: 754 SSVEVSVLQQQIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLD 813
Query: 781 TVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSK 840
++ + R + ++FS ++L R SG + + +T + + ++ S +V+S+
Sbjct: 814 SIERQRKNIEINNIMFSMSLLEKRWSGIVPGNH-----DTVDGLMND-IQQSCSVTVSSQ 867
Query: 841 DAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDH 900
+AI+ LL+AP LSDL +WSHWD LFAPSLG WLLN V++L C+VT DG+ IR+D
Sbjct: 868 EAIKCLLKAPMLSDLLTWSHWDLLFAPSLGSFMHWLLNTGPVQDLACIVTTDGRFIRVDP 927
Query: 901 SASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDI 960
SA+VD FLE ++ S FQ AVKLLS + G N P+ LLKC+A+ A ++ N + +
Sbjct: 928 SATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLLKCYAQRAIGIIMDNGNDLL 987
Query: 961 EVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
N + G R N+ ++ K + V + ++F LDCLG+LPS
Sbjct: 988 ------NTKSEGKSFSAR---------NIWSDMSKDI---DDIVHLVAKFVLDCLGHLPS 1029
Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
EFRS AADVLL+GL++ K+ SAILLE +T MLH++GLSLG+ EW D C T
Sbjct: 1030 EFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDIGLSLGVAEWAEDCRRLCLT 1089
Query: 1081 GTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD---------------VHIEE 1124
D + + ++ SG C +L + D V + +
Sbjct: 1090 ---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDIMKRSKSLPGKDNQIVAVSK 1146
Query: 1125 CGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNML 1184
+ + + + + TL E E+A L++E+IRR+EFGL ++S E+++L
Sbjct: 1147 NQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETIRREEFGLDQSLSCTENSLL 1205
Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN 1244
KKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL FILQ++GIVVLNN
Sbjct: 1206 KKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVEDVEPTLAFILQDNGIVVLNN 1265
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
E GFSAENIRALCD+GNSTKKGS+ GYIG KGIGFKSVFRVTDAPEIHSNGFHVKFD +E
Sbjct: 1266 ESGFSAENIRALCDIGNSTKKGSNQGYIGNKGIGFKSVFRVTDAPEIHSNGFHVKFDITE 1325
Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
GQIGFVLPT V P+N D +Q + CI + FR+
Sbjct: 1326 GQIGFVLPTAVAPYNTDSI-------KIQGGYRHALHCIHV-FRSP-------------- 1363
Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRA 1424
P FLHRL+CI F+NM+ND+L+V+R+K +G+GI+++S G D M+W V S+KL+
Sbjct: 1364 ----PISAPFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVRISNGNDMMSWLVVSKKLQG 1419
Query: 1425 GVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSR 1484
++R DV +TEIA+A TLQE+ G Y P L QQPVFAFLPLR YGLKFILQGDFVLPSSR
Sbjct: 1420 TIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPLRNYGLKFILQGDFVLPSSR 1479
Query: 1485 EEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFS 1544
EE C P KA + +MSFVPLVGEVHGFF
Sbjct: 1480 EE-----------------------------GC----PGKAVTTFMSFVPLVGEVHGFFC 1506
Query: 1545 GLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKD 1604
LP +ILSKLR++ C++LEG++++W PC LRGW+++ L+ D LL +HLGLG+LSKD
Sbjct: 1507 QLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRILISDSLLLEHLGLGYLSKD 1566
Query: 1605 IVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY-TISFHSSG 1663
I++SD+L+RALGI EYGPK+L+ IISS+CR + + S+GL WL +W L+ ++ HSS
Sbjct: 1567 IIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLEWLCAWFISLHLSLMHHSSK 1626
Query: 1664 QSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDC--SVFDGGFGLEAFPNL 1721
L + E DL+ L++IP IPLSDG+FSS+ +G IWL D S D + FP L
Sbjct: 1627 NLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPYDVLNSKPDSRSSMLNFPVL 1685
Query: 1722 CAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDET 1781
+ LRT+ P LLS S +K + ++L +LLK+GV++LS HDI+K HIL ++S+ T
Sbjct: 1686 YSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRKLSGHDIIKNHILVSLSNST 1745
Query: 1782 TANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHF 1841
AN +M +Y+ F+M+HL+ C +C+ E+E I+SELR++ +LTNHG+K P + PIHF
Sbjct: 1746 EANVANTMMIEYVSFIMLHLQSPCASCNFEKEEIMSELRRRPILLTNHGYKCPYDEPIHF 1805
Query: 1842 GKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESL-SCGLVKWRKFFKEIGITDFVQV 1900
KE+GN V + L+ +++IKW E+D YL + +SL L KWR+FF+E+G+TDFVQV
Sbjct: 1806 SKEYGNSVDLCKLLLNVEIKWIELDSCYLMNRGSDSLPPFELKKWRQFFEEMGVTDFVQV 1865
Query: 1901 VQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHLL 1942
V+V+K+++ + + DWES EL +L
Sbjct: 1866 VKVEKNISQADSSLAGRLSQGHHSGTPCIVYDWESPELFEVL 1907
>gi|242032627|ref|XP_002463708.1| hypothetical protein SORBIDRAFT_01g004640 [Sorghum bicolor]
gi|241917562|gb|EER90706.1| hypothetical protein SORBIDRAFT_01g004640 [Sorghum bicolor]
Length = 2610
Score = 1350 bits (3493), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 709/1591 (44%), Positives = 1006/1591 (63%), Gaps = 93/1591 (5%)
Query: 535 EVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAI 594
+VLE + +Y T +K K + S YP IGLLN+AV S+ G+ DS+YD +Q
Sbjct: 435 KVLEMLANYYGRTPTEK-----KGIINFCSQYPGIGLLNVAVKSMGCGLLDSIYDVIQLA 489
Query: 595 TQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILN 654
++ ++++ N+ E +EIEP + +++V++++++Y+E ++ +
Sbjct: 490 SENNVSSSPLPNTTT-EVMEIEPPS-------------VAIDDVIRRITEYIESNSKVSG 535
Query: 655 NAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFT 714
+ + + +R L CETW+T QF +F +LG+G F FL+KH T L
Sbjct: 536 D-------VTMQVRALNDCETWVTTQFSANQFSALGHGTFLEFLDKHCHQFRTALSSFLK 588
Query: 715 DDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGS 774
+ T + SSLEVS++Q + L+ QA +N E ++ LL+RQFP ISF I S
Sbjct: 589 EGTCNPSSLEVSVLQQQIEFLLYQAESNWLEDGGFSEDSFVMLLKRQFPTISFNIVQDKS 648
Query: 775 MESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMS 834
E + + ++ + + FS ++L R SG L N + E ++ +
Sbjct: 649 GEGVPGFIEGQKKDIQTNSLKFSISLLEKRWSGTLPSRHGN-VDELGKSVAE---QSYYC 704
Query: 835 ESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGK 894
+V S++AI LLRAP LSDL WSHWD LFAP+LG WL+N ++EL C+ T DG+
Sbjct: 705 GTVCSREAINCLLRAPMLSDLLLWSHWDSLFAPTLGSFVHWLMNTGPIQELACIATTDGR 764
Query: 895 VIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFK 954
IR+D SA+VD FLEA ++ S FQ AVKLLS + G + P+ LLKC+A+ A +++
Sbjct: 765 FIRVDSSATVDQFLEAIIQRSPFQVAVKLLSLLYVYNGSTSTPMSLLKCYAQRAIKLIVD 824
Query: 955 NNVEDIEVINS---------QNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVP 1005
NN + + + QN + Q + + + + + L F V
Sbjct: 825 NNNDLMNAKSENKIFMPDEPQNLISESSTCFADQCQESSQVSSARLIRSESLPNFDNTVH 884
Query: 1006 VASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSL 1065
+ ++F LDCLG+LP EFRS AAD+LL+G + K+ + +L E + MLH++GLSL
Sbjct: 885 LIAKFVLDCLGHLPLEFRSLAADILLAGFRVVTKNWHAVMLHEATENGQLCMLHDIGLSL 944
Query: 1066 GILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFES----VGADVH 1121
G++EW+ D C T + T +S +EG+ E+ + +DV+
Sbjct: 945 GVVEWVEDCRRLCLTEE-----------VHVQTEMHSSAKLASEGATHENSNGHISSDVN 993
Query: 1122 I-----EECGAICDTI--------------CGEASDDGLGDCTTQTLPEDKECEDAALIV 1162
+ + I D + G +D + ++ E+ E+A+L++
Sbjct: 994 MMGERRQLFPGINDRVGMDNEDNNNKMLNPAGTEADIAELHTSRSSMMEETNLEEASLVI 1053
Query: 1163 ESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY 1222
E+IRR+EFGL +S+ E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y
Sbjct: 1054 ETIRREEFGLDQALSDTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTY 1113
Query: 1223 PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
P++VEPTL F+LQE+GI VLNNE+GFSAENIRALCD+GNSTKKGS+ GYIG KGIGFKSV
Sbjct: 1114 PKDVEPTLAFVLQENGIAVLNNERGFSAENIRALCDIGNSTKKGSNQGYIGNKGIGFKSV 1173
Query: 1283 FRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC 1342
FRVTDAPEIHSNGFHVKFD ++GQIGFVLPT VPP++ F R+L+ + + WNTC
Sbjct: 1174 FRVTDAPEIHSNGFHVKFDITDGQIGFVLPTAVPPYSTTSFSRMLAVEDDKDAHPLWNTC 1233
Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG 1402
I LPFR+KF EG +DLHPSLLLFLHRL+CI F+N+ +D+L+++R++++GDG
Sbjct: 1234 ILLPFRSKFREG----------TDLHPSLLLFLHRLKCIKFKNLFDDTLLIMRREVLGDG 1283
Query: 1403 IIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
I+++S G + M+W V S++L+ ++R DV TTEIA+A TLQ++ +G+Y P L QQPVFAF
Sbjct: 1284 IVRISHGIETMSWLVVSKRLQGTIVRHDVCTTEIAVAFTLQQTEKGDYEPYLKQQPVFAF 1343
Query: 1463 LPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENP 1522
LPLR YGLKFI+QGDFVLPSSREEVD +S WNQWLLSEFP+LFV A+ SFC LPCF P
Sbjct: 1344 LPLRNYGLKFIIQGDFVLPSSREEVDADSAWNQWLLSEFPSLFVCAQESFCALPCFERCP 1403
Query: 1523 AKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDR 1582
KA + ++SF+PL GEVHGFFS LP +ILSKLR++ C+ L+G+ QW PC LRGW+++
Sbjct: 1404 GKAVTAFLSFIPLAGEVHGFFSQLPHLILSKLRVTCCMFLDGSTVQWVYPCNTLRGWDEQ 1463
Query: 1583 AHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSM 1642
LL + LL +HLGLG+LSKDI++SD+L+RALGI +YGP ILL ISS+CR + + S+
Sbjct: 1464 TKMLLSEGLLHEHLGLGYLSKDIIISDNLSRALGIHDYGPNILLDTISSICRIDGCIESL 1523
Query: 1643 GLSWLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
GL WL +W LY T+ HS SS +S +E L+D +++IP IPLSDG+FSSV +G IW
Sbjct: 1524 GLEWLCAWFVNLYLTLLSHSRNVSSARS-LEDALLDKVRKIPCIPLSDGSFSSVSDGPIW 1582
Query: 1702 LHSDC--SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIG 1759
L D S+ + +E FP L LRTVSP LLSA +K + + +D+L ML KIG
Sbjct: 1583 LPYDVVSSIPECRSSIENFPVLYGNLRTVSPNLLSACCKNKYLMEEVRIDDLADMLQKIG 1642
Query: 1760 VQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSEL 1819
V++LS HDI+K HI+ ++ + AN ++ +Y+ F+M+HL+ C +C+ E+E IVSEL
Sbjct: 1643 VRKLSGHDIIKNHIMVSLRNGLDANVTDRMIREYVSFIMVHLQSSCTSCNFEKEGIVSEL 1702
Query: 1820 RKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLS 1879
RK LTNHG+K P++ PIHF K++GN V V+ L+ +++I W E+D +YL H ES S
Sbjct: 1703 RKSPIFLTNHGYKSPADEPIHFSKDYGNSVDVSRLLQNVEISWIELDSSYLIHHGSESSS 1762
Query: 1880 CGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELV 1939
KWR+FF+E+G+TDFVQVV+V+K ++ + + ++ + DWES EL
Sbjct: 1763 FEREKWRRFFEEMGVTDFVQVVKVEKSLSQVDSLLAGGLSLADVSAKPCTVYDWESPELS 1822
Query: 1940 HLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDI 1999
+LS+ ++ R+ +LLE+LD WDD Y+ K M + G++R+ +SSF+ CI
Sbjct: 1823 RILSIFSSKRCRENCVYLLEVLDRFWDDYYSPKSMILTDATHCGENRAVESSFMKCIQSF 1882
Query: 2000 QWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDIL 2059
+W S MD++LHYP DLF+D + VRS+ G APY VP+V S L IG KTEV+ D L
Sbjct: 1883 KWIASRMDEDLHYPTDLFYDSENVRSLFGSVAPYAVPQVSSSSLKKAIGFKTEVSYCDAL 1942
Query: 2060 EILKVWTRLEAPF------MARYYLLVNHDV 2084
+LK W + PF M ++Y ++ V
Sbjct: 1943 MVLKYWITSKVPFRASMSQMCKFYTFLSEGV 1973
Score = 241 bits (616), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 144/352 (40%), Positives = 213/352 (60%), Gaps = 39/352 (11%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
L+R A KAR EL+ GE V++WKV+++VL+ L+ DSW SLG Q Q+VP L L + E
Sbjct: 103 LDRAQEAARKAREELVKGGEGVTAWKVAQAVLVALKADSWDSLGVQPQDVPLLRDLFLIE 162
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++NAFIHC+V R+I ++YDLE ICK+EG+ +FEEL LGPFL+ PL++HYFLV D+
Sbjct: 163 GKVNAFIHCYVAARKIVTVYDLEVEICKNEGVVQFEELGLGPFLQHPLVAHYFLVPDDLS 222
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVR-----IQNL- 311
VV K+++ +VI L ++ D K K I + +FL+ +A+K+S++ KE+LGVR NL
Sbjct: 223 VVPKLSSEEVINVLLKFVDKSKKK-ITIVDFLNHLAEKKSVSGKEKLGVRKCHSTSTNLR 281
Query: 312 ---GMHINFIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAIC 364
+HI+ I++A++++ K + R++ + + K+ LD+RF ++
Sbjct: 282 RFCRLHISLIQQAKQTETSAAKLLGNMSGSGNSSREKDLLKNASFRTHKKALDKRFISLT 341
Query: 365 ERVKSFSSINEDFGAKHIRFVS--------SSSEDEDSDDCTYECSNDISSNVQLPSQIK 416
R+K IN KHI F S +SSED SDD N+ + + +
Sbjct: 342 NRIKQLPGIN-----KHIHFDSTDDETNGDTSSEDGKSDD------NENKNEYSVLDKKD 390
Query: 417 GSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKI 468
G RV+SCPYPS TEEL+RLGLK EIN + S NS ++ S+KK K+
Sbjct: 391 GDKRVNSCPYPSKTEELERLGLKSEINKR-PSLENSKPKE-----SRKKGKV 436
>gi|357115001|ref|XP_003559282.1| PREDICTED: uncharacterized protein LOC100827209 [Brachypodium
distachyon]
Length = 2540
Score = 1285 bits (3325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 734/1673 (43%), Positives = 1001/1673 (59%), Gaps = 160/1673 (9%)
Query: 46 FFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPN----------------- 88
+ PNP L QNP F++ PNL +QN +
Sbjct: 51 YAPNPHQYL---------QNP-FNLMLPNLLLQNQAAFAAAYHHQQQQQQQQQQQQQQQQ 100
Query: 89 -LPPQQPPSSACNQQRSQTQH-PPGASNQQRPLTQQPQSLPPQPQKL-----NKEFLERI 141
PP P S QH PP + QQ P + P +PQ + LER
Sbjct: 101 YQPPALVPPGYTQNPTSNIQHRPPNPAPQQPRAASPPPAPPQRPQTQPPTRNQQAVLERA 160
Query: 142 DRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARIN 201
A KAR EL++AGE V+ WKV+++ LL L+VDSW SLG Q+ +VP L L + E ++N
Sbjct: 161 QVAARKARDELVLAGEGVTGWKVAQAALLALKVDSWGSLGVQLHDVPILRDLFLVEGKVN 220
Query: 202 AFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFK 261
FIHC+V R+I S+ DLE ICKSEG+ +FEEL LGPFL+ PLI+HYFLV SD+ V K
Sbjct: 221 TFIHCYVAARKIVSICDLEVEICKSEGVRQFEELGLGPFLQHPLIAHYFLVPSDLSTVPK 280
Query: 262 IATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREA 321
+++ ++I CL +Y K K + V+ FLD++A+++S++ KE+LG R +N + + +
Sbjct: 281 LSSEEIINCLQKYIGDSKDK-VTVENFLDYLAEQKSVSGKEKLGAR-RNEVAAVKHLGKT 338
Query: 322 RKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKH 381
S D T K L + H K+ L++RF AI R+K INE H
Sbjct: 339 SGSGDSTCGKDLPKQTDFHL----------GKQVLNKRFDAITSRIKQLPGINE-----H 383
Query: 382 IRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVTEELKRLGL 438
I FVSS E D D + + D S N S + D RVSSCPYPS +EE++RLGL
Sbjct: 384 IHFVSSDDEI-DGDTSSEGNAVDESENKNTCSILDSKDVDKRVSSCPYPSKSEEMERLGL 442
Query: 439 KGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSS---APAKFLRRNKAKQRALPIE 495
K E++ + T+ + D S +KRK E +G SS P K + K+ A P
Sbjct: 443 KSELSKKPTTESSMVKVTDKNVYSGEKRKYEGTGTPSSLCKQPKKRRKVGMHKKEASP-- 500
Query: 496 SGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARY 555
+ S + FITTWKEAC+ + + +VLE + +Y T +K
Sbjct: 501 -------------NCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYGETTHEK---- 543
Query: 556 FKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEI 615
+++ FS YP IG LN+AV S+ G+ DS+YD + ++ +L+++ N+ E +EI
Sbjct: 544 -RKIIQFFSQYPGIGFLNVAVRSMGCGLLDSIYDAIHVFSENKLSSSPIPNTTT-EIMEI 601
Query: 616 E-PSEQVAVVNTKHTV----------QDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKII 664
E PS + A K ++ ++V+++++++ E S ++ A + +
Sbjct: 602 EPPSRETAECIAKGAYDSNERNYEPGHNVTTDDVIRRITEHFE-SKSGVSKAGALQVERS 660
Query: 665 ILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLE 724
I LR L CETW+T QF ++F SLG+G F FLEK+ S +L SLE
Sbjct: 661 IFLRTLHDCETWVTTQFSARQFSSLGHGTFLEFLEKYGSHFPPKLSSFLKGGDSGSVSLE 720
Query: 725 VSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGK 784
VS+ Q + L+ QA N E + LL++QFP +SF I S E + +
Sbjct: 721 VSVPQQQIEALLCQAEGNWLEDGDFSGDSFLMLLKKQFPTMSFDITQYKSGEGLDGSTER 780
Query: 785 YRNEVMSKCVLFSETILGTRLSGDLSVHEENSL-LETTSAITHTGLRPKMSESVTSKDAI 843
R + + + FS ++L R SG LS E +++ + +A+ + SE+V+S++AI
Sbjct: 781 QRKCIQTNNISFSISLLEKRWSG-LSPGEHDTVGGQRNNAVEQS----YYSETVSSREAI 835
Query: 844 EILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSAS 903
LL+AP LSDL WSHWD LFAPSLG WLL+ +++L C+VT D + IR+D SA+
Sbjct: 836 NCLLKAPMLSDLLLWSHWDLLFAPSLGSFIRWLLSTGPIQQLACIVTTDCRFIRVDPSAT 895
Query: 904 VDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVI 963
VD FLEA ++ S FQ AVKLLS + G +N P+ LLKC+A+ A ++ NN +
Sbjct: 896 VDQFLEAIIQCSPFQVAVKLLSLIHIYNGSRNTPISLLKCYAQRAIGIIMNNNNDS---- 951
Query: 964 NSQNCRMHGNVLCGRQNFD--VANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSE 1021
G R + +IDN V ++F LDCLG LPSE
Sbjct: 952 -------EGKSFSARNAMSGILKSIDN--------------TVHFVAKFVLDCLGQLPSE 990
Query: 1022 FRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST- 1080
FRS AAD+LLSGL++ K++ SAIL E +T MLH++GLSLGI EW+ DY FC T
Sbjct: 991 FRSLAADILLSGLRTVTKNSYSAILHEATETWQLCMLHDIGLSLGIAEWVEDYRGFCLTE 1050
Query: 1081 ------------------------GTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESV 1116
S++L+P V N ++ L G + V
Sbjct: 1051 EDDAKAELHSSSGHASAASEGPTLENSNVLIPHDVDMMNDSSKSL-------PGKKDQVV 1103
Query: 1117 GADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNI 1176
D + + + + +A L + + E E+AAL++E+IRRDEFGL ++
Sbjct: 1104 ATDNKNQN---MLNPVEAKAETTELHTTKSPMMGE-MNLEEAALVIETIRRDEFGLDQSL 1159
Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQE 1236
S E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL F+LQE
Sbjct: 1160 SCTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYHEDVEPTLVFVLQE 1219
Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
+GIV+LNNE GFSAENIRALCD+GNSTKKG+++GYIG KGIGFKSVFRVTDAPEIHSNGF
Sbjct: 1220 NGIVILNNEMGFSAENIRALCDIGNSTKKGANSGYIGNKGIGFKSVFRVTDAPEIHSNGF 1279
Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA 1356
HVKFD ++GQIGFVLPT VPP++ R+LS + + WNTCI LPFR+KF +G
Sbjct: 1280 HVKFDITKGQIGFVLPTAVPPYSASSLSRMLSAEDDKGACSLWNTCILLPFRSKFRDGTG 1339
Query: 1357 MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWF 1416
M +I MFSDLHPSLLLFLHRL CI F+N+LND+L+++R+K +GDGI+++S G + M+W
Sbjct: 1340 MCSIASMFSDLHPSLLLFLHRLNCIKFKNVLNDTLLIMRRKALGDGIVRISHGNEIMSWL 1399
Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
V S+KL+ ++R DV+TTEIALA TLQE+ G Y P L QQPVFAFLPLR YGLKFILQG
Sbjct: 1400 VLSKKLQGTLVRHDVRTTEIALAFTLQETENGEYEPYLKQQPVFAFLPLRNYGLKFILQG 1459
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV 1536
DFVLPSSREEVD ++ WNQWLLSEFP+LFVSA+ SFC L CF+ P KA + +MSFVPL
Sbjct: 1460 DFVLPSSREEVDADNAWNQWLLSEFPSLFVSAQESFCSLSCFQRCPGKAVTAFMSFVPLA 1519
Query: 1537 GEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHL 1596
GEVHGFF LP +I+SKLR++ C++LEG+++QW PC VL+GW+++ L D LL HL
Sbjct: 1520 GEVHGFFCKLPHLIISKLRLTRCMVLEGSSSQWVYPCNVLKGWDEQTRMLFSDGLLHDHL 1579
Query: 1597 GLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS 1649
GLG+LSKDI++SD+L+RALGI YGP +L+ ++SS+CRT+ + S+G+ WL +
Sbjct: 1580 GLGYLSKDIIISDTLSRALGIHVYGPNVLIDVLSSICRTDGCIESLGMEWLCA 1632
Score = 167 bits (422), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 122/206 (59%), Gaps = 6/206 (2%)
Query: 1882 LVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHL 1941
L KWR+FF E+G+TDFVQVV+V+K++ + + ++ S DWES EL +
Sbjct: 1669 LKKWREFFAEMGVTDFVQVVKVEKNIPQVDSLIAGKLSQGDISGTPSTVYDWESPELTSI 1728
Query: 1942 LSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQW 2001
LS ++ R+ +LLE+LD+ WDD Y+ K G++R+ +SSF+ CI +W
Sbjct: 1729 LSTFSSRKCRENCIYLLEVLDSFWDDYYSAKAWCLTNVTHCGENRTVESSFMKCIQSFKW 1788
Query: 2002 TISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEI 2061
SS+D +LHY DLF+D + VRS+LG APY VP+V S L DIG KT V+ D L +
Sbjct: 1789 IASSVDYDLHYATDLFYDFENVRSLLGSVAPYAVPQVSSRSLRKDIGFKTNVSHSDALMV 1848
Query: 2062 LKVWTRLEAPF------MARYYLLVN 2081
LK+W + PF M ++Y V+
Sbjct: 1849 LKLWIASQVPFNASVHQMCKFYTFVS 1874
>gi|29150358|gb|AAO72367.1| unknow protein [Oryza sativa Japonica Group]
Length = 2592
Score = 1275 bits (3300), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 693/1541 (44%), Positives = 962/1541 (62%), Gaps = 93/1541 (6%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
LER A K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++N FIHC+V R+I S+YDLE ICK+E I +FEEL LGPFL+ PL++HYF V +D+
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
+V K++++++I L ++ D K K I V+ FLD++A+++S++ KE LG R + +
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGARRTEVSA-VKV 356
Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDF 377
S D + +K L + + H K+ LDERFSAI R+K IN
Sbjct: 357 QGNTSGSGDGSCEKDLVKNRKFHL----------SKQALDERFSAITSRIKKLPGIN--- 403
Query: 378 GAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVTEELK 434
KHI F S+ ++ D D + + D S + + I D RVSSCPYPS TEE++
Sbjct: 404 --KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKTEEME 460
Query: 435 RLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPI 494
RLGLK E + + + + G +++KRK E +G +S+ + P
Sbjct: 461 RLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR------------PK 508
Query: 495 ESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAAR 554
+ + S + FITTWKEAC+ + + +VLE + +Y T +K
Sbjct: 509 KKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAETPKEKKK- 567
Query: 555 YFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIE 614
+ FS YP IG LN+AV ++ G+ DS+YD + + + ++ + + E +E
Sbjct: 568 ----IIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTELMEVE 623
Query: 615 IEPSEQVAV----------VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKII 664
PS++ + V +K + ++V+++++++ E + + S V K
Sbjct: 624 PPPSKRKSKCVAKGDNDTNVGSKDPGCSVTADDVIRRITEFFESNRGVSRTDASQVRKST 683
Query: 665 ILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLE 724
LR L CET +T +F +F +LG+G F FL KH L +L + SS+E
Sbjct: 684 -FLRTLLDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTHSSVE 742
Query: 725 VSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGK 784
VS++Q + VL+ QA N E ++ S+LL+RQFP ISF I S E L+++ +
Sbjct: 743 VSVLQQQIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLDSIER 802
Query: 785 YRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIE 844
R + ++FS ++L R SG + + + T + + ++ S +V+S++AI+
Sbjct: 803 QRKNIEINNIMFSMSLLEKRWSGIVPGNHD-----TVDGLMN-DIQQSCSVTVSSQEAIK 856
Query: 845 ILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASV 904
LL+AP LSDL +WSHWD LFAPSLG WLLN V++L C+VT DG+ IR+D SA+V
Sbjct: 857 CLLKAPMLSDLLTWSHWDLLFAPSLGSFMHWLLNTGPVQDLACIVTTDGRFIRVDPSATV 916
Query: 905 DSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVIN 964
D FLE ++ S FQ AVKLLS + G N P+ LLKC+A+ A ++ N + +
Sbjct: 917 DQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLLKCYAQRAIGIIMDNGNDLL---- 972
Query: 965 SQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRS 1024
N + G R N+ ++ K + V + ++F LDCLG+LPSEFRS
Sbjct: 973 --NTKSEGKSFSAR---------NIWSDMSKDI---DDIVHLVAKFVLDCLGHLPSEFRS 1018
Query: 1025 FAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSD 1084
AADVLL+GL++ K+ SAILLE +T MLH++GLSLG+ EW D C T D
Sbjct: 1019 LAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDIGLSLGVAEWAEDCRRLCLT---D 1075
Query: 1085 LLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD---------------VHIEECGAI 1128
+ + ++ SG C +L + D V + + +
Sbjct: 1076 EIHANIEMHASSRHPSTASGVAICENSNLLNATDVDIMKRSKSLPGKDNQIVAVSKNQNV 1135
Query: 1129 CDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQH 1188
+ + + + TL E E+A L++E+IRR+EFGL ++S E+++LKKQH
Sbjct: 1136 LNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETIRREEFGLDQSLSCTENSLLKKQH 1194
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGF 1248
ARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL FILQ++GIVVLNNE GF
Sbjct: 1195 ARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVEDVEPTLAFILQDNGIVVLNNESGF 1254
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
SAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRVTDAPEIHSNGFHVKFD +EGQIG
Sbjct: 1255 SAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRVTDAPEIHSNGFHVKFDITEGQIG 1314
Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLH 1368
FVLPT V P+N D R+LS + V+ S WNTCI LPFR+KF EG M++I MFSDLH
Sbjct: 1315 FVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVLPFRSKFKEGTGMHSIASMFSDLH 1374
Query: 1369 PSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIR 1428
PSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI+++S G D M+W V S+KL+ ++R
Sbjct: 1375 PSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVRISNGNDMMSWLVVSKKLQGTIVR 1434
Query: 1429 PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD 1488
DV +TEIA+A TLQE+ G Y P L QQPVFAFLPLR YGLKFILQGDFVLPSSREEVD
Sbjct: 1435 NDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPLRNYGLKFILQGDFVLPSSREEVD 1494
Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPR 1548
++ WNQWLLSEFP+LFVSA+ SFC LPCF+ P KA + +MSFVPLVGEVHGFF LP
Sbjct: 1495 ADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKAVTTFMSFVPLVGEVHGFFCQLPH 1554
Query: 1549 MILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS 1608
+ILSKLR++ C++LEG++++W PC LRGW+++ L+ D LL +HLGLG+LSKDI++S
Sbjct: 1555 LILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRILISDSLLLEHLGLGYLSKDIIIS 1614
Query: 1609 DSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS 1649
D+L+RALGI EYGPK+L+ IISS+CR + + S+GL WL +
Sbjct: 1615 DTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLEWLCA 1655
Score = 129 bits (325), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)
Query: 1882 LVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHL 1941
L KWR+FF+E+G+TDFVQVV+V+K+++ + + DWES ELV
Sbjct: 1691 LKKWRQFFEEMGVTDFVQVVKVEKNISQADSSLAGRLSQGHHSGTPCIVYDWESPELV-- 1748
Query: 1942 LSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQW 2001
LD WD Y+ K + +G++ + +SSF+N I +W
Sbjct: 1749 -------------------LDKFWDAHYSAKARIHADATHSGENIAVESSFMNSIRTFKW 1789
Query: 2002 TISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEI 2061
S+MD++LHY DLF++ + VRSILG APY VP+V S L DIG K +V+ D L I
Sbjct: 1790 IASAMDEDLHYATDLFYNTEDVRSILGSVAPYAVPQVCSRSLGKDIGFKIKVSHSDALMI 1849
Query: 2062 LKVWTRLEAPF------MARYYLLVN 2081
LK W + F M ++Y V+
Sbjct: 1850 LKSWIASQTSFSASMDQMCKFYTFVS 1875
>gi|359477903|ref|XP_002270714.2| PREDICTED: uncharacterized protein LOC100258878 [Vitis vinifera]
Length = 1578
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 549/847 (64%), Positives = 658/847 (77%), Gaps = 50/847 (5%)
Query: 1284 RVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCI 1343
+VTDAPEIHSNGFHVKFD SEGQIGFVLPT++PP N+D+F RL S D Q ++ WNTCI
Sbjct: 50 KVTDAPEIHSNGFHVKFDISEGQIGFVLPTVIPPCNVDLFRRLASSDTDQEDTDSWNTCI 109
Query: 1344 RLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGI 1403
LPFR K S+G M+NI+ MFSDLHPSLLLFLH L+CI F+NMLNDSL+++RK+IVGDGI
Sbjct: 110 VLPFRMKLSKGTGMSNIISMFSDLHPSLLLFLHHLRCIKFKNMLNDSLIIMRKEIVGDGI 169
Query: 1404 IKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFL 1463
IKVS G +KMTWFV SQKLRA VIRPDV+TTEIA+A TLQES+ G Y P QQPVFAFL
Sbjct: 170 IKVSHGREKMTWFVISQKLRADVIRPDVQTTEIAIAFTLQESDNGEYSPHFEQQPVFAFL 229
Query: 1464 PLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPA 1523
PLRTYGLKFILQGDFVLPSSREEVDG+SPWNQWLLSEFP LFV+AERSFC LPCFRENP
Sbjct: 230 PLRTYGLKFILQGDFVLPSSREEVDGDSPWNQWLLSEFPGLFVTAERSFCALPCFRENPG 289
Query: 1524 KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRA 1583
KA + YMSFVPLVGEVHGFFS LPRMI+SKLRMSNCL+LEG+NN+W PPCKVLR WN++A
Sbjct: 290 KAVAAYMSFVPLVGEVHGFFSSLPRMIISKLRMSNCLLLEGDNNEWVPPCKVLRSWNEQA 349
Query: 1584 HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMG 1643
SLLPD LL KHLGLGFL K+I LSD LARALGI+EYGPKILLQIISSLC TE+GL+SMG
Sbjct: 350 RSLLPDSLLCKHLGLGFLDKNIHLSDPLARALGIQEYGPKILLQIISSLCHTEDGLKSMG 409
Query: 1644 LSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLH 1703
L+WL+SWLN LYT+ H SGQSSL S +E+DLI +L++IPFIPLSDG + S+DEGTIWLH
Sbjct: 410 LAWLSSWLNALYTMPLHYSGQSSLNSNMESDLIYDLKKIPFIPLSDGNYGSLDEGTIWLH 469
Query: 1704 SD--CSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
SD + DG GL AFPNL AKLR V+PALLSA++VD + + +N+ RMLL+IGVQ
Sbjct: 470 SDSLSTELDGEHGLGAFPNLYAKLRIVNPALLSAASVDIPCMDMTLAENVTRMLLRIGVQ 529
Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
QLSAH+IV+VHILPA+SDE N +KNLM +YL FVM+HL+ C NC VERE+I+SE+
Sbjct: 530 QLSAHEIVQVHILPAMSDEGITNREKNLMIEYLSFVMVHLQSSCTNCRVEREYIISEICN 589
Query: 1822 KAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCG 1881
KAF+LTNHG+KRP E+PIHF KEFGN + VN I+ ++ W+ VDI YLKHP ESLSCG
Sbjct: 590 KAFILTNHGYKRPVEVPIHFSKEFGNTIDVNRFINATNMTWHVVDIAYLKHPITESLSCG 649
Query: 1882 LVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNELVH 1940
L+KWR FF+ +G+TDFVQ+VQV+K+V+DISH KN MW ++L+S G+ A DWES ELV
Sbjct: 650 LMKWRGFFQALGVTDFVQIVQVEKNVSDISHMILKNEMWDRDLISHGTIAKDWESPELVQ 709
Query: 1941 LLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQ 2000
LLS+L+ +++ K+LL++LDTLWDDC++DK+ G+ K +GD + F+SS + ICD Q
Sbjct: 710 LLSILSKTGDQESCKNLLDVLDTLWDDCFSDKVSGYCNFKSSGDRKPFKSSLMTSICDFQ 769
Query: 2001 WTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPK----------------------- 2037
W SSMDDELHYPKDLF+D D V +LG SAPY +PK
Sbjct: 770 WIASSMDDELHYPKDLFYDSDEVHLVLGSSAPYALPKGTDIVLLVKNNKMKLHLEQGAVV 829
Query: 2038 ------------------VKSEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------M 2073
V+S KL CDIG KT+VT+DDIL IL+ W R E PF M
Sbjct: 830 GAFTLFLVQRISDISAEMVRSGKLACDIGFKTKVTLDDILGILQEWRRSETPFKASIAQM 889
Query: 2074 ARYYLLV 2080
+++Y +
Sbjct: 890 SKFYTFI 896
>gi|108711621|gb|ABF99416.1| expressed protein [Oryza sativa Japonica Group]
Length = 2655
Score = 956 bits (2472), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/935 (52%), Positives = 643/935 (68%), Gaps = 42/935 (4%)
Query: 882 VKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
V++L C+VT DG+ IR+D SA+VD FLE ++ S FQ AVKLLS + G N P+ LL
Sbjct: 822 VQDLACIVTTDGRFIRVDPSATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLL 881
Query: 942 KCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFG 1001
KC+A+ A ++ N + + N + G R N+ ++ K +
Sbjct: 882 KCYAQRAIGIIMDNGNDLL------NTKSEGKSFSAR---------NIWSDMSKDI---D 923
Query: 1002 KAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEV 1061
V + ++F LDCLG+LPSEFRS AADVLL+GL++ K+ SAILLE +T MLH++
Sbjct: 924 DIVHLVAKFVLDCLGHLPSEFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDI 983
Query: 1062 GLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD- 1119
GLSLG+ EW D C T D + + ++ SG C +L + D
Sbjct: 984 GLSLGVAEWAEDCRRLCLT---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDI 1040
Query: 1120 --------------VHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESI 1165
V + + + + + + + TL E E+A L++E+I
Sbjct: 1041 MKRSKSLPGKDNQIVAVSKNQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETI 1099
Query: 1166 RRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN 1225
RR+EFGL ++S E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+
Sbjct: 1100 RREEFGLDQSLSCTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVED 1159
Query: 1226 VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
VEPTL FILQ++GIVVLNNE GFSAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRV
Sbjct: 1160 VEPTLAFILQDNGIVVLNNESGFSAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRV 1219
Query: 1286 TDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
TDAPEIHSNGFHVKFD +EGQIGFVLPT V P+N D R+LS + V+ S WNTCI L
Sbjct: 1220 TDAPEIHSNGFHVKFDITEGQIGFVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVL 1279
Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIK 1405
PFR+KF EG M++I MFSDLHPSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI++
Sbjct: 1280 PFRSKFKEGTGMHSIASMFSDLHPSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVR 1339
Query: 1406 VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
+S G D M+W V S+KL+ ++R DV +TEIA+A TLQE+ G Y P L QQPVFAFLPL
Sbjct: 1340 ISNGNDMMSWLVVSKKLQGTIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPL 1399
Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKA 1525
R YGLKFILQGDFVLPSSREEVD ++ WNQWLLSEFP+LFVSA+ SFC LPCF+ P KA
Sbjct: 1400 RNYGLKFILQGDFVLPSSREEVDADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKA 1459
Query: 1526 ASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHS 1585
+ +MSFVPLVGEVHGFF LP +ILSKLR++ C++LEG++++W PC LRGW+++
Sbjct: 1460 VTTFMSFVPLVGEVHGFFCQLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRI 1519
Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
L+ D LL +HLGLG+LSKDI++SD+L+RALGI EYGPK+L+ IISS+CR + + S+GL
Sbjct: 1520 LISDSLLLEHLGLGYLSKDIIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLE 1579
Query: 1646 WLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHS 1704
WL +W L+ ++ HSS L + E DL+ L++IP IPLSDG+FSS+ +G IWL
Sbjct: 1580 WLCAWFISLHLSLMHHSSKNLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPY 1638
Query: 1705 DC--SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQ 1762
D S D + FP L + LRT+ P LLS S +K + ++L +LLK+GV++
Sbjct: 1639 DVLNSKPDSRSSMLNFPVLYSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRK 1698
Query: 1763 LSAHDIVKVHILPAISDETTANGDKNLMADYLCFV 1797
LS HDI+K HIL ++S+ T AN +M +Y+ V
Sbjct: 1699 LSGHDIIKNHILVSLSNSTEANVANTMMIEYVVKV 1733
Score = 395 bits (1014), Expect = e-106, Method: Compositional matrix adjust.
Identities = 238/677 (35%), Positives = 373/677 (55%), Gaps = 46/677 (6%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
LER A K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++N FIHC+V R+I S+YDLE ICK+E I +FEEL LGPFL+ PL++HYF V +D+
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
+V K++++++I L ++ D K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357
Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
+R+AR+++ +K ++ + + K K+ LDERFSAI R+K I
Sbjct: 358 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417
Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
N KHI F S+ ++ D D + + D S + + I D RVSSCPYPS T
Sbjct: 418 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471
Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
EE++RLGLK E + + + + G +++KRK E +G +S+ +
Sbjct: 472 EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521
Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
P + + S + FITTWKEAC+ + + +VLE + +Y T +
Sbjct: 522 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAETPKE 579
Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
K + FS YP IG LN+AV ++ G+ DS+YD + + + ++ + + E
Sbjct: 580 KKK-----IIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 634
Query: 611 ESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKL 670
+E PS + ++V+++++++ E + + S V K LR L
Sbjct: 635 MEVEPPPS--------------VTADDVIRRITEFFESNRGVSRTDASQVRK-STFLRTL 679
Query: 671 CSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQH 730
CET +T +F +F +LG+G F FL KH L +L + SS+EVS++Q
Sbjct: 680 LDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTHSSVEVSVLQQ 739
Query: 731 LLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVM 790
+ VL+ QA N E ++ S+LL+RQFP ISF I S E L+++ + R +
Sbjct: 740 QIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLDSIERQRKNIE 799
Query: 791 SKCVLFSETILGTRLSG 807
++FS ++L R SG
Sbjct: 800 INNIMFSMSLLEKRWSG 816
Score = 120 bits (301), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 92/156 (58%), Gaps = 6/156 (3%)
Query: 1932 DWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSS 1991
DWES ELV +LS ++ R+ +LLE+LD WD Y+ K + +G++ + +SS
Sbjct: 1762 DWESPELVSILSTFSSKKCRENCVYLLEVLDKFWDAHYSAKARIHADATHSGENIAVESS 1821
Query: 1992 FINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKT 2051
F+N I +W S+MD++LHY DLF++ + VRSILG APY VP+V S L DIG K
Sbjct: 1822 FMNSIRTFKWIASAMDEDLHYATDLFYNTEDVRSILGSVAPYAVPQVCSRSLGKDIGFKI 1881
Query: 2052 EVTIDDILEILKVWTRLEAPF------MARYYLLVN 2081
+V+ D L ILK W + F M ++Y V+
Sbjct: 1882 KVSHSDALMILKSWIASQTSFSASMDQMCKFYTFVS 1917
>gi|108711620|gb|ABF99415.1| expressed protein [Oryza sativa Japonica Group]
Length = 2702
Score = 956 bits (2470), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/935 (52%), Positives = 643/935 (68%), Gaps = 42/935 (4%)
Query: 882 VKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
V++L C+VT DG+ IR+D SA+VD FLE ++ S FQ AVKLLS + G N P+ LL
Sbjct: 869 VQDLACIVTTDGRFIRVDPSATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLL 928
Query: 942 KCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFG 1001
KC+A+ A ++ N + + N + G R N+ ++ K +
Sbjct: 929 KCYAQRAIGIIMDNGNDLL------NTKSEGKSFSAR---------NIWSDMSKDI---D 970
Query: 1002 KAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEV 1061
V + ++F LDCLG+LPSEFRS AADVLL+GL++ K+ SAILLE +T MLH++
Sbjct: 971 DIVHLVAKFVLDCLGHLPSEFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDI 1030
Query: 1062 GLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD- 1119
GLSLG+ EW D C T D + + ++ SG C +L + D
Sbjct: 1031 GLSLGVAEWAEDCRRLCLT---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDI 1087
Query: 1120 --------------VHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESI 1165
V + + + + + + + TL E E+A L++E+I
Sbjct: 1088 MKRSKSLPGKDNQIVAVSKNQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETI 1146
Query: 1166 RRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN 1225
RR+EFGL ++S E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+
Sbjct: 1147 RREEFGLDQSLSCTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVED 1206
Query: 1226 VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
VEPTL FILQ++GIVVLNNE GFSAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRV
Sbjct: 1207 VEPTLAFILQDNGIVVLNNESGFSAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRV 1266
Query: 1286 TDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
TDAPEIHSNGFHVKFD +EGQIGFVLPT V P+N D R+LS + V+ S WNTCI L
Sbjct: 1267 TDAPEIHSNGFHVKFDITEGQIGFVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVL 1326
Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIK 1405
PFR+KF EG M++I MFSDLHPSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI++
Sbjct: 1327 PFRSKFKEGTGMHSIASMFSDLHPSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVR 1386
Query: 1406 VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
+S G D M+W V S+KL+ ++R DV +TEIA+A TLQE+ G Y P L QQPVFAFLPL
Sbjct: 1387 ISNGNDMMSWLVVSKKLQGTIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPL 1446
Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKA 1525
R YGLKFILQGDFVLPSSREEVD ++ WNQWLLSEFP+LFVSA+ SFC LPCF+ P KA
Sbjct: 1447 RNYGLKFILQGDFVLPSSREEVDADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKA 1506
Query: 1526 ASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHS 1585
+ +MSFVPLVGEVHGFF LP +ILSKLR++ C++LEG++++W PC LRGW+++
Sbjct: 1507 VTTFMSFVPLVGEVHGFFCQLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRI 1566
Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
L+ D LL +HLGLG+LSKDI++SD+L+RALGI EYGPK+L+ IISS+CR + + S+GL
Sbjct: 1567 LISDSLLLEHLGLGYLSKDIIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLE 1626
Query: 1646 WLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHS 1704
WL +W L+ ++ HSS L + E DL+ L++IP IPLSDG+FSS+ +G IWL
Sbjct: 1627 WLCAWFISLHLSLMHHSSKNLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPY 1685
Query: 1705 DC--SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQ 1762
D S D + FP L + LRT+ P LLS S +K + ++L +LLK+GV++
Sbjct: 1686 DVLNSKPDSRSSMLNFPVLYSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRK 1745
Query: 1763 LSAHDIVKVHILPAISDETTANGDKNLMADYLCFV 1797
LS HDI+K HIL ++S+ T AN +M +Y+ V
Sbjct: 1746 LSGHDIIKNHILVSLSNSTEANVANTMMIEYVVKV 1780
Score = 369 bits (947), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 239/724 (33%), Positives = 373/724 (51%), Gaps = 93/724 (12%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
LER A K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++N FIHC+V R+I S+YDLE ICK+E I +FEEL LGPFL+ PL++HYF V +D+
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
+V K++++++I L ++ D K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357
Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
+R+AR+++ +K ++ + + K K+ LDERFSAI R+K I
Sbjct: 358 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417
Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
N KHI F S+ ++ D D + + D S + + I D RVSSCPYPS T
Sbjct: 418 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471
Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
EE++RLGLK E + + + + G +++KRK E +G +S+ +
Sbjct: 472 EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521
Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTM--SEVLEKMFQFYKPTD 548
P + + S + FITTWKEAC+ + + EVLE + +Y T
Sbjct: 522 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQKEVLELLANYYAETP 579
Query: 549 PKKAARYFKRMKLMFSSYPCIGLLNIA--------------------------------- 575
+K + FS YP IG LN+A
Sbjct: 580 KEKKK-----IIKFFSEYPGIGFLNVALDYFIWAWNGENIIQILAASMTSQILHAVTHVF 634
Query: 576 ------------VTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAV 623
V ++ G+ DS+YD + + + ++ + + E +E PS
Sbjct: 635 SLEIFALPFLVEVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTELMEVEPPPS----- 689
Query: 624 VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRI 683
+ ++V+++++++ E + + S V K LR L CET +T +F
Sbjct: 690 ---------VTADDVIRRITEFFESNRGVSRTDASQVRK-STFLRTLLDCETCITAKFSA 739
Query: 684 KEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNL 743
+F +LG+G F FL KH L +L + SS+EVS++Q + VL+ QA N
Sbjct: 740 NQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTHSSVEVSVLQQQIEVLLCQAGGNW 799
Query: 744 WESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGT 803
E ++ S+LL+RQFP ISF I S E L+++ + R + ++FS ++L
Sbjct: 800 LEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLDSIERQRKNIEINNIMFSMSLLEK 859
Query: 804 RLSG 807
R SG
Sbjct: 860 RWSG 863
Score = 120 bits (301), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/156 (41%), Positives = 92/156 (58%), Gaps = 6/156 (3%)
Query: 1932 DWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSS 1991
DWES ELV +LS ++ R+ +LLE+LD WD Y+ K + +G++ + +SS
Sbjct: 1809 DWESPELVSILSTFSSKKCRENCVYLLEVLDKFWDAHYSAKARIHADATHSGENIAVESS 1868
Query: 1992 FINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKT 2051
F+N I +W S+MD++LHY DLF++ + VRSILG APY VP+V S L DIG K
Sbjct: 1869 FMNSIRTFKWIASAMDEDLHYATDLFYNTEDVRSILGSVAPYAVPQVCSRSLGKDIGFKI 1928
Query: 2052 EVTIDDILEILKVWTRLEAPF------MARYYLLVN 2081
+V+ D L ILK W + F M ++Y V+
Sbjct: 1929 KVSHSDALMILKSWIASQTSFSASMDQMCKFYTFVS 1964
>gi|298205162|emb|CBI17221.3| unnamed protein product [Vitis vinifera]
Length = 2388
Score = 953 bits (2463), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/797 (62%), Positives = 573/797 (71%), Gaps = 114/797 (14%)
Query: 876 LLNEVNVKE--LLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGE 933
LL +K+ LLCLVT+DGKV+RIDHSA++DSFLEA+L+GSSF+TAV+LLS F+L GG+
Sbjct: 832 LLGTYTIKDSSLLCLVTKDGKVMRIDHSATMDSFLEASLQGSSFRTAVQLLSLFSLFGGK 891
Query: 934 KNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEV 993
++VP LLKCHAR AFEV+ +N+VE++EV
Sbjct: 892 RHVPFSLLKCHARQAFEVILRNSVENMEV------------------------------- 920
Query: 994 QKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTE 1053
+ VPVASRF LDCLGYLPSEFRSFAAD+LLSGLQ + PSAIL EC+Q +
Sbjct: 921 -------NETVPVASRFLLDCLGYLPSEFRSFAADILLSGLQPFTINGPSAILDECNQMD 973
Query: 1054 LRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG------- 1106
R+MLHEVGLSLG+++WI DYHAF S ++ + C AA+S L G+
Sbjct: 974 QRVMLHEVGLSLGVMQWIDDYHAFSSAAATNSFVSSGALCLQAASSELRRGTKFTQNALA 1033
Query: 1107 ---CAEGSLFESVGADVHIEECGAICDTICGEA-SDDGLGDCTTQTLPEDKECEDAALIV 1162
EG + S GA H EE IC T E S D G PE E +DA L++
Sbjct: 1034 KFPSCEGEMIISDGACGHNEEHSEICQTTGSEGVSVDRSGHGCILYAPELNEHKDATLVI 1093
Query: 1163 ESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELV-----QNA 1217
ESIRRDEFGL P +S+MES+MLKKQHARLGRALHCLSQELYSQDSHFLLELV QNA
Sbjct: 1094 ESIRRDEFGLDPTLSSMESSMLKKQHARLGRALHCLSQELYSQDSHFLLELVMVLLVQNA 1153
Query: 1218 DDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
DDNIYPENVEPTLTFILQ+ GI+VLNNEQGFSA+NIRALCDVGNSTKKGS AGYIG+KGI
Sbjct: 1154 DDNIYPENVEPTLTFILQDRGIIVLNNEQGFSAQNIRALCDVGNSTKKGSKAGYIGQKGI 1213
Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESK 1337
GFKSVFRVTDAPEIHSNGFHVKFD SEGQIGFVLPT++PP N+D+
Sbjct: 1214 GFKSVFRVTDAPEIHSNGFHVKFDISEGQIGFVLPTVIPPCNVDLM-------------- 1259
Query: 1338 CWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKK 1397
K S+G M+NI+ MFSDLHPSLLLFLH L+CI F+NMLNDSL+++RK+
Sbjct: 1260 ------------KLSKGTGMSNIISMFSDLHPSLLLFLHHLRCIKFKNMLNDSLIIMRKE 1307
Query: 1398 IVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQ 1457
IVGDGIIKVS G +KMTWFV SQKLRA VIRPDV+TTEIA+A TLQES+ G Y P QQ
Sbjct: 1308 IVGDGIIKVSHGREKMTWFVISQKLRADVIRPDVQTTEIAIAFTLQESDNGEYSPHFEQQ 1367
Query: 1458 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
PVFAFLPLRTYGLKFILQGDFVLPSSREEVDG+
Sbjct: 1368 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGDR-------------------------- 1401
Query: 1518 FRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR 1577
KA + YMSFVPLVGEVHGFFS LPRMI+SKLRMSNCL+LEG+NN+W PPCKVLR
Sbjct: 1402 ------KAVAAYMSFVPLVGEVHGFFSSLPRMIISKLRMSNCLLLEGDNNEWVPPCKVLR 1455
Query: 1578 GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTEN 1637
WN++A SLLPD LL KHLGLGFL K+I LSD LARALGI+EYGPKILLQIISSLC TE+
Sbjct: 1456 SWNEQARSLLPDSLLCKHLGLGFLDKNIHLSDPLARALGIQEYGPKILLQIISSLCHTED 1515
Query: 1638 GLRSMGLSWLASWLNEL 1654
GL+SMGL+WL+SW L
Sbjct: 1516 GLKSMGLAWLSSWEKNL 1532
Score = 710 bits (1833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/817 (50%), Positives = 518/817 (63%), Gaps = 79/817 (9%)
Query: 1 MYGNHRPRFRPVGGVGGEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN 60
MYG++ P RP GG G GG PP Q PP PN+ QN N +F NP Q NL
Sbjct: 100 MYGHNPPGHRPTAAGGGSGRGGAVPPLQALPP--TPNYFIQNVNPYFQNPTFQ--ANLGL 155
Query: 61 IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQRPLT 120
QNP QNP L MQN NL QNPNLP Q N P + Q T
Sbjct: 156 PYLQNPTLPTQNPTLPMQNTNLPLQNPNLPMQNTSFPLQN---------PSFAIQNTNFT 206
Query: 121 QQPQSLPPQPQKLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSL 180
PQP K NKE L+R+D AVVKAR ++I GESVS+WKVS+S LL LQVDSW SL
Sbjct: 207 ----GFRPQPPKRNKEALDRVDGAVVKARRDVIATGESVSAWKVSQSALLALQVDSWESL 262
Query: 181 GFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPF 240
GF MQEVPSLH L+V E +IN+FIHCFVGVR+ITSLYDLE AICK+EG+E+FE+LELGP
Sbjct: 263 GFPMQEVPSLHSLIVIEGKINSFIHCFVGVRRITSLYDLEMAICKNEGVEQFEDLELGPL 322
Query: 241 LRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLAS 300
+R PLI HYF + SD VFKI + ++I CL E+ + + K I ++EFL++IAKKRSL
Sbjct: 323 VRHPLIMHYFSISSDASGVFKITSAEIISCLDEFMEACQDKHIIIEEFLEYIAKKRSLTG 382
Query: 301 KEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQ--PHHKRRKRPIFSSEKKRQLDE 358
+E+LGVRIQ+LGMHI+FIREARK + +TLKK L+ P K R+ P+ SSEKK+ LDE
Sbjct: 383 RERLGVRIQSLGMHISFIREARKLEHMTLKKSQGSLKQIPDKKIREHPLRSSEKKK-LDE 441
Query: 359 RFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSN---DISSNVQLPS-Q 414
RFSA+ +RVKSF+S ++DFG KH FVSS SE++ SDD YE S D SN + S
Sbjct: 442 RFSAMSQRVKSFASAHDDFGGKHTIFVSSCSEEDGSDDHKYEESEEDIDGCSNSKFSSPN 501
Query: 415 IKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCT 474
K DRVSSCPYPS EE+ RLGLKGE +++G SS + G K+KRK N CT
Sbjct: 502 SKTRDRVSSCPYPSAIEEMTRLGLKGETEGNPSASG-SSMHSENTGPFKRKRKSSNRSCT 560
Query: 475 SSAPAKFLRRNKAKQRALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMS 534
S K +RNK + EAC+ +T++
Sbjct: 561 VSKYLKLPKRNKLEL------------------------------------EACQEHTIA 584
Query: 535 EVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAI 594
EVLE+M QF+ ++ K MK M SSYP +GLLN+AVTSIK GMWDS+YDT QAI
Sbjct: 585 EVLERMLQFHGTQTKQR-----KIMKSMLSSYPFVGLLNVAVTSIKSGMWDSIYDTFQAI 639
Query: 595 TQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILN 654
+Q EL N + EYESI+ PS V V E+VM+ + + E D+ I +
Sbjct: 640 SQDELTNKLPDKHSEYESID-GPSTGVTV------------EDVMRNLVTFFELDHDISH 686
Query: 655 NAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFT 714
+ +SP+ K +L R+L +CE W+ E+F +KEFKSLG+G+FFTFLEKHAS+L EL K T
Sbjct: 687 SGKSPLEKKFLLFRQLSNCEFWVAEKFSVKEFKSLGFGDFFTFLEKHASILPNELHKCLT 746
Query: 715 DDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGS 774
DT ++S LEV ++Q LVVL+SQASN+LWE+E +TKQ IS LL+RQFP + FKI + G
Sbjct: 747 SDTYEKSPLEVCMLQKQLVVLLSQASNSLWENETLTKQKISMLLKRQFPSVGFKILENGC 806
Query: 775 MESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSV 811
M+ FL+ V + ++ V+S CVLFS T+LGT D S+
Sbjct: 807 MDDFLDIVREQKSCVVSTCVLFSSTLLGTYTIKDSSL 843
Score = 229 bits (584), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 109/209 (52%), Positives = 143/209 (68%), Gaps = 24/209 (11%)
Query: 1879 SCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNE 1937
SC +WR FF+ +G+TDFVQ+VQV+K+V+DISH KN MW ++L+S G+ A DWES E
Sbjct: 1547 SCTNCRWRGFFQALGVTDFVQIVQVEKNVSDISHMILKNEMWDRDLISHGTIAKDWESPE 1606
Query: 1938 LVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCIC 1997
L +LL++LDTLWDDC++DK+ G+ K +GD + F+SS + IC
Sbjct: 1607 L-----------------NLLDVLDTLWDDCFSDKVSGYCNFKSSGDRKPFKSSLMTSIC 1649
Query: 1998 DIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDD 2057
D QW SSMDDELHYPKDLF+D D V +LG SAPY +PKV+S KL CDIG KT+VT+DD
Sbjct: 1650 DFQWIASSMDDELHYPKDLFYDSDEVHLVLGSSAPYALPKVRSGKLACDIGFKTKVTLDD 1709
Query: 2058 ILEILKVWTRLEAPF------MARYYLLV 2080
IL IL+ W R E PF M+++Y +
Sbjct: 1710 ILGILQEWRRSETPFKASIAQMSKFYTFI 1738
>gi|168050836|ref|XP_001777863.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670728|gb|EDQ57291.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1962
Score = 863 bits (2231), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 497/1288 (38%), Positives = 730/1288 (56%), Gaps = 101/1288 (7%)
Query: 838 TSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIR 897
T+ DA++ L+ AP L +L+ WS WD +FAP+LGP WL E + LVT G +++
Sbjct: 14 TNADALQALINAPMLVNLSLWSQWDTIFAPTLGPFLSWLEREGFKGNIYTLVTDAGVILK 73
Query: 898 IDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNV 957
ID +A+ D+FL + + G S AV+L S AL GG + P LLK +A A V+ ++
Sbjct: 74 IDGTATADNFLPSLISGDSKSAAVQLTSIVALYGGINHAPSALLKTYATKALTVLTRSFA 133
Query: 958 EDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGY 1017
++ + M LD L
Sbjct: 134 SEVHSEDYSRVAM---------------------------------------LLLDILCV 154
Query: 1018 LPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAF 1077
+P E + FAA +LL S ++ + S +L C E R +LH++G++ GI EWI+++ A
Sbjct: 155 VPQELQFFAAKILLPAFSSAVEYSASILLKSCKIDEHRFLLHKLGVAFGIEEWINNFKA- 213
Query: 1078 CSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEAS 1137
C +S M + +++ + S + SV + + E + + I + +
Sbjct: 214 CVFKSS---MTEGAVVQDISSNEMRSPAISGNHEQLPSVQSQSDVMETDSTVERIENDQA 270
Query: 1138 DD-----GLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLG 1192
GL +TQ+ P+ + A +VE IRR+EFG+G ++ E +L +QHAR+G
Sbjct: 271 STSAVVTGLSG-STQSAPDFDIRQRANDVVEFIRREEFGIGQDLKVQEQELLARQHARMG 329
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
RALH LSQ+LYSQDSHF+LELVQNADDN Y VE L F++Q ++V NNE GF++ N
Sbjct: 330 RALHRLSQDLYSQDSHFVLELVQNADDNSYAPGVEAALVFVVQSGHVMVFNNEVGFTSAN 389
Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
+RALCDVGNSTK GSS GYIG KGIGFKSVFRVTDAPEIHSNGF+VKFD +E +GF+LP
Sbjct: 390 LRALCDVGNSTKTGSSTGYIGHKGIGFKSVFRVTDAPEIHSNGFNVKFDITESNLGFILP 449
Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESK---------CWNTCIRLPFRTKFSEGIAMNNIVDM 1363
T++ C ++L K CWNT I LP ++ S G M ++
Sbjct: 450 TIIEARQ----CAKSYSGTLELVGKKVFSHEKVVCWNTRIDLPIKSSISNGKGMASLAAK 505
Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR 1423
F D+HPSLLLFLHRL+CI +N + + ++ ++ + +G+++V + TW VA ++
Sbjct: 506 FDDIHPSLLLFLHRLRCIAVQNDVVSATKLMHREDLANGLVRVVHDKGHATWLVARHQVL 565
Query: 1424 AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSS 1483
AGV RP++ TEIALA L++ +GNY + QQ VFAFLPLR+YGL+FI+Q DF+LPSS
Sbjct: 566 AGVPRPNILETEIALAFPLEDLIDGNYKAVAEQQQVFAFLPLRSYGLRFIVQCDFLLPSS 625
Query: 1484 REEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFF 1543
REEVD +S WNQWL SE P +FV A F LP A A S Y+ FVPL GEV GFF
Sbjct: 626 REEVDSDSAWNQWLRSEIPEVFVKAAEIFKTLPTLGTRGA-AVSNYLKFVPLEGEVLGFF 684
Query: 1544 SGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK 1603
S LPR+I ++L+ CL +EG+ + W+ PC VL W++ A L+PD LL++HLGL +L K
Sbjct: 685 SVLPRLIHTRLQAIPCLPVEGSESSWSFPCAVLMRWSETARELIPDALLKEHLGLQYLDK 744
Query: 1604 DIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTI--SFHS 1661
++ +SDSLAR+LG+++YG K L+ ++ S+ + +G+ MGL W+ WL EL+
Sbjct: 745 EVDISDSLARSLGVQKYGSKTLVALMKSVSQKSDGVGRMGLEWIRHWLIELHACLQKEQE 804
Query: 1662 SGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSD-CSVFDGGFG-LEAFP 1719
SS S E+ L L+ + FIPL+ G+F+++ +G +W + + DG L FP
Sbjct: 805 VFTSSADSTNESILFQELRAMQFIPLASGSFTAIKDGPVWFANHPAELQDGAIKPLGCFP 864
Query: 1720 NLCAKLRTVSPALL---SASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPA 1776
L A+LRTV P L SAS VD + ++ ++ ML ++GV+ +SAH ++K H++PA
Sbjct: 865 LLYAELRTVHPTLFHSTSASEVDLADDPNVTGKVIS-MLHQLGVRPISAHHVLKSHVIPA 923
Query: 1777 ISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHG-FKRPS 1835
++DE + +L+ Y+ + HL C C E I +L++ A + TN+G F+
Sbjct: 924 MADENCTAKETSLLLQYIGYAKWHLGSSCNQCTGEIALIFEQLKQNALISTNNGVFRMGE 983
Query: 1836 EIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGIT 1895
+ PIHFGK G+P ++ +KW EV +YL+ ++ + WRKF E+G+T
Sbjct: 984 KEPIHFGKSMGSPFDAQDVLSGCSMKWNEVSDSYLQLALPGQITPNVESWRKFLSELGVT 1043
Query: 1896 DFVQVVQVDKDVADISHTGFKNMWTKELLS------PGSAAIDWESNELVHLLSLLTTNV 1949
DFVQV+ V K + + + ++W E+ G DWES ELV L+S +TT
Sbjct: 1044 DFVQVIPVKKIIKNKA----ASLWKDEVWEGVNKDDTGCVLEDWESPELVDLISTVTTKT 1099
Query: 1950 N--------RQCSKHLLEILDTLWDDCYTDKIMGFFK---SKPTGDDRSFQSSFINCICD 1998
+ ++CS + ILD LW Y + F+ K +F +S++ +
Sbjct: 1100 SARKVHKKAKRCSA-IFSILDKLWQSNYASRCRATFRISDEKQAKQGLTF-ASWVLQVRK 1157
Query: 1999 IQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDI 2058
++W SS+D LH P LF C++V ILG A Y+ ++ + L +GL+TEVT+DD
Sbjct: 1158 LKWVKSSLDSRLHTPASLFQRCESVNKILGNHAAYLRTQIHNVHLTETLGLRTEVTVDDC 1217
Query: 2059 LEILKVWTR----LEAPF--MARYYLLV 2080
L +L+ W LEA MAR YL +
Sbjct: 1218 LSLLQHWGSSNQVLEASVAQMARLYLFL 1245
>gi|302811920|ref|XP_002987648.1| hypothetical protein SELMODRAFT_126522 [Selaginella moellendorffii]
gi|300144540|gb|EFJ11223.1| hypothetical protein SELMODRAFT_126522 [Selaginella moellendorffii]
Length = 1823
Score = 822 bits (2123), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/1253 (39%), Positives = 714/1253 (56%), Gaps = 122/1253 (9%)
Query: 836 SVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKV 895
S T DA L++AP L+DL WS W ++ P LGPL WL + +++L + TRDGK+
Sbjct: 72 SKTDADAEACLVKAPMLADLTLWSQWTDIYEPQLGPLLEWLEHGRCNRQVLAVATRDGKL 131
Query: 896 IRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKN 955
+R+D ASVDSFL+A + G S A +L+S L GG P LL+ HA ++ F
Sbjct: 132 LRVDGDASVDSFLDAVINGRSLDVAAQLVSLVVLYGGAARTPKALLQTHALKGLQIYF-- 189
Query: 956 NVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCL 1015
D A ++N G +KQ + A F LDCL
Sbjct: 190 --------------------------DRARVNN--GRKRKQENRHEAAC-----FILDCL 216
Query: 1016 GYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYH 1075
P +F F+A+VL +GL DA S +L C LMLH VGL+LG+ +WI+DY
Sbjct: 217 ASAPHDFIEFSAEVLYAGLSRVFPDAGSLLLQNCSSVLHHLMLHRVGLALGVPQWINDYS 276
Query: 1076 AFCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHI---EECGAICDTI 1132
F + +D S +S S C + F+ D + + G+ ++
Sbjct: 277 TFEAASHAD-------------ESYRDSSSACE--NHFKKPRRDETVLSERQPGSQARSV 321
Query: 1133 CGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLG 1192
E +D + E+ DA +V IR+DEFG +S +L KQHAR+G
Sbjct: 322 KKEVAD----------MSEESLDSDAREVVRMIRKDEFGFEEQDEVTQSGLLAKQHARMG 371
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
RAL CLSQ LYS+D HF+LELVQNADDN YP N PTL FI+Q+ IVVLNNE+GF+A+N
Sbjct: 372 RALECLSQNLYSEDCHFILELVQNADDNSYPLNAHPTLAFIIQKDRIVVLNNEKGFTAKN 431
Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
IRALCDVG STKKG AGYIG KGIGFKSVFRVT++P+IHSNGFHVKFD SE IGF+LP
Sbjct: 432 IRALCDVGKSTKKGVPAGYIGHKGIGFKSVFRVTESPQIHSNGFHVKFDISESSIGFILP 491
Query: 1313 TLV-PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSL 1371
T V P + +LS+ + ++ C NT I LP R E N++ MF D+ P L
Sbjct: 492 TWVQSPLDNAYLENVLSQATGE-DTWCINTRIDLPLRASVVEA---NSMSRMFQDVQPKL 547
Query: 1372 LLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
LLFLH+L+ ++ +N +DS + +R++ +G+ IIK++ G T+ V Q L A + R V
Sbjct: 548 LLFLHKLRTVVVKNEASDSCLTMRRQDIGNNIIKLTDGRSTGTFLVVRQLLDAPMDRAGV 607
Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
TE+ALA L+ES +G+Y L QQ VFAFLP+R+YGLKF++QGDF+LPSSRE++D +S
Sbjct: 608 PDTELALAFELKESRDGHYAALAVQQQVFAFLPVRSYGLKFVVQGDFILPSSREDLDRDS 667
Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPA-----KAASVYMSFVPLVGEVHGFFSGL 1546
WNQWL +E P LF+ + L CF+ + +A S Y+S+VP+ GEV GFFS +
Sbjct: 668 AWNQWLRAEVPKLFLQS------LHCFQSSAVFGSRIEAISSYLSYVPVEGEVLGFFSSV 721
Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIV 1606
PR+ILS LR + CL +EG + W+ PC LRGW D L+PD +L+K+LGL +L ++++
Sbjct: 722 PRIILSLLRAAPCLPVEG-SELWSLPCTTLRGWGDAFRLLVPDDMLKKYLGLLYLDREVI 780
Query: 1607 LSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSS 1666
+SD LA ALG+ EYG + L+ ++ + ++ L +G+ WL L E++ F Q+
Sbjct: 781 ISDILASALGVPEYGVETLVGLLKAAAKS---LDDLGIEWLRRCLVEIHKC-FAYEKQNF 836
Query: 1667 LQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEA----FPNLC 1722
+ E + L+ +PFIPLS G F+S+++G +W F G GL+ F +L
Sbjct: 837 VHQ--EQKVAGTLRGVPFIPLSSGGFTSLEDGLVW-------FLDGEGLQGHSKDFLDLY 887
Query: 1723 AKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETT 1782
+ +RTVSP L+ ++S+ G I ML+K+GV+ ++AH+++K H+LPA++
Sbjct: 888 SDIRTVSPFLID----EESTAGTI-----KEMLVKLGVKVVTAHEVIKSHVLPAMASVDC 938
Query: 1783 ANGDKNLMADYLCFVMMHLEYY--CPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIH 1840
D +L+ YL + +H + C +C E++ ++S+LR A VLT G + +E IH
Sbjct: 939 LEKDPSLLLQYLAYCALHFDKLDNCRSCQAEKDGLLSQLRNTAIVLTKDGLRSVAEHSIH 998
Query: 1841 FGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQV 1900
FGKE GN V V L+ D + W EVD YL+ + WRKFF +G++DF+ V
Sbjct: 999 FGKELGNIVDVEKLLEDAPVPWCEVDSKYLQFNPSPAF------WRKFFMSLGVSDFINV 1052
Query: 1901 VQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLE 1959
+ V D S T +KN W + DW + + + L + + + L
Sbjct: 1053 LPSKVRVGDKSSTVWKNESWKHADGNTAWIIQDWGCADSFRICNALARSGRPENCRLFLT 1112
Query: 1960 ILDTLWDDCYTDKIMGFF--KSKPT-GDDRSFQSSFINCICDIQWTISSMDDELHYPKDL 2016
+LD++WD+ Y G + + P+ GDD +SSF+ + W SS+D +LH P +L
Sbjct: 1113 VLDSMWDEHYDRFSKGSYSVEGAPSIGDD--CESSFVLQLRGYTWIGSSLDHQLHKPTEL 1170
Query: 2017 FHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTRLE 2069
FH +V S+LG Y P+V S KL +GL TEV ++DI+ ++K+ E
Sbjct: 1171 FH--KSVESMLGSYVSYARPQVGSSKLAKALGLHTEVRLEDIINLMKLLNERE 1221
>gi|302811833|ref|XP_002987605.1| hypothetical protein SELMODRAFT_158880 [Selaginella moellendorffii]
gi|300144759|gb|EFJ11441.1| hypothetical protein SELMODRAFT_158880 [Selaginella moellendorffii]
Length = 1734
Score = 805 bits (2080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/1237 (39%), Positives = 705/1237 (56%), Gaps = 121/1237 (9%)
Query: 851 FLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEA 910
L+DL WS W ++ P LGPL WL + +++L + TRDGK++R+D ASVDSFL+A
Sbjct: 1 MLADLTLWSQWTDIYEPQLGPLLEWLEHGRCNRQVLAVATRDGKLLRVDGDASVDSFLDA 60
Query: 911 ALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRM 970
+ G S A +L+S L GG P LL+ HA ++ F
Sbjct: 61 VVNGRSLDVAAQLVSLVVLYGGAARTPKALLQTHALKGLQIYF----------------- 103
Query: 971 HGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVL 1030
D A ++N G +KQ + A+ F LDCL P +F F+A+VL
Sbjct: 104 -----------DRARVNN--GRKRKQ-----ENHHEAACFILDCLASAPHDFIEFSAEVL 145
Query: 1031 LSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCV 1090
+GL DA S +L C LMLH VGL+LG+ +WI+DY F + +D
Sbjct: 146 YAGLSRLFPDAGSLLLQNCSSVLHHLMLHRVGLALGVPQWINDYSTFEAASHAD------ 199
Query: 1091 VTCTNAATSGLNSGSGCAEGSLFESVGADVHI---EECGAICDTICGEASDDGLGDCTTQ 1147
S +S S C + F+ D + + G+ ++ E +D
Sbjct: 200 -------ESYRDSSSACE--NHFKKPRRDETVLSERQPGSQLRSVKKEVAD--------- 241
Query: 1148 TLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDS 1207
+ E+ DA +V IR+DEFG +S +L KQHAR+GRAL CLSQ LYS+D
Sbjct: 242 -MSEESLDSDAREVVRMIRKDEFGFEEQDEVTQSGLLAKQHARMGRALECLSQNLYSEDC 300
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
HF+LELVQNADDN YP N PTL FI+Q+ IVVLNNE+GF+A+NIRALCDVG STKKG
Sbjct: 301 HFILELVQNADDNSYPLNAHPTLAFIIQKDRIVVLNNEKGFTAKNIRALCDVGKSTKKGV 360
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV-PPFNIDMFCRL 1326
AGYIG KGIGFKSVFRVT++P+IHSNGFHVKFD SE IGF+LPT V P + +
Sbjct: 361 PAGYIGHKGIGFKSVFRVTESPQIHSNGFHVKFDISESSIGFILPTWVQSPLDNAYLENV 420
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
LS+ + ++ C NT I LP R E N++ MF D+ P LLLFLH+L+ ++ +N
Sbjct: 421 LSQATGE-DTWCINTRIDLPLRASVVEA---NSMSRMFQDVQPKLLLFLHKLRTVVVKNE 476
Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
+DS + +R++ +G+ IIK++ G T+ V Q L A + R V TE+ALA L+ES
Sbjct: 477 ASDSCLTMRRQDIGNNIIKLTDGRSTGTFLVVRQLLDAPMDRAGVPDTELALAFELKESR 536
Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
+G+Y L QQ VFAFLP+R+YGLKF++QGDF+LPSSRE++D +S WNQWL +E P LF+
Sbjct: 537 DGHYAALAVQQQVFAFLPVRSYGLKFVVQGDFILPSSREDLDRDSAWNQWLRAEVPKLFL 596
Query: 1507 SAERSFCDLPCFRENPA-----KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLI 1561
+ L CF+ + +A S Y+S+VP+ GEV GFFS +PR+ILS LR + CL
Sbjct: 597 QS------LHCFQSSAVFGSRIEAISSYLSYVPVEGEVLGFFSSVPRIILSLLRAAPCLP 650
Query: 1562 LEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYG 1621
+EG + W+ PC LRGW D L+PD +L+K+LGL +L +++++SD LA ALG+ EYG
Sbjct: 651 VEG-SELWSLPCTTLRGWGDAFRLLVPDDMLKKYLGLLYLDREVIISDILASALGVPEYG 709
Query: 1622 PKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQR 1681
+ L+ ++ + ++ L +G+ WL L E++ F Q+ + E + L+
Sbjct: 710 VETLVGLLKAAAKS---LDDLGIGWLRRCLVEIHKC-FAYEKQNFVHQ--EQKVAGTLRG 763
Query: 1682 IPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEA----FPNLCAKLRTVSPALLSASA 1737
+PFIPLS G F+S+++G +W F G GL+ F +L + +RTVSP L+
Sbjct: 764 VPFIPLSSGGFTSLEDGLVW-------FLDGEGLQGRSKDFLDLYSDIRTVSPFLID--- 813
Query: 1738 VDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFV 1797
++S+ G I ML+K+GV+ ++AH+++K H+LPA++ D +L+ YL +
Sbjct: 814 -EESTAGTI-----KEMLVKLGVKVVTAHEVIKSHVLPAMASVDCLKKDPSLLLQYLAYC 867
Query: 1798 MMHLEY-YCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIH 1856
+H E C +C E++ ++S+LR A VLT G + +E IHFGKE GN V V L+
Sbjct: 868 ALHFELDNCRSCQAEKDGLLSQLRNTAIVLTKDGLRSVAEHSIHFGKELGNIVDVEKLLE 927
Query: 1857 DIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFK 1916
D + W EVD YL+ + WRKFF +G++DF+ V+ V D S T +K
Sbjct: 928 DAPVPWCEVDSKYLQFNPSPAF------WRKFFMSLGVSDFINVLPSKVRVGDKSSTVWK 981
Query: 1917 N-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMG 1975
N W + DW + + + L + + + L +LD++WD+ Y G
Sbjct: 982 NESWKHADGNTAWIIQDWGCADSFRICNALARSGRPENCRLFLTVLDSMWDEHYDRFSKG 1041
Query: 1976 FF--KSKPT-GDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAP 2032
+ + P+ G D +SSF+ + W SS+D +LH P +LFH +V S+LG
Sbjct: 1042 SYSVEGAPSIGGD--CESSFVLQLRGYAWIGSSLDHQLHKPTELFH--KSVESMLGSYVS 1097
Query: 2033 YIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTRLE 2069
Y P+V S KL +GL TEV ++DI+ ++K+ E
Sbjct: 1098 YARPQVGSSKLAKALGLHTEVRLEDIINLMKLLNERE 1134
>gi|147796512|emb|CAN68152.1| hypothetical protein VITISV_009301 [Vitis vinifera]
Length = 604
Score = 568 bits (1464), Expect = e-158, Method: Compositional matrix adjust.
Identities = 303/520 (58%), Positives = 357/520 (68%), Gaps = 53/520 (10%)
Query: 816 SLLETTSAITHTGLRPKMSESVTS-KDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPG 874
+ E I+H+G P + + V+ + A E+ AP LSDLNSWSHWD +FAPSLGPL
Sbjct: 88 TFFELDHDISHSGKSPLLGKEVSPLQAAFEL---APMLSDLNSWSHWDLIFAPSLGPLVW 144
Query: 875 WLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEK 934
WLLNEVN KELLCLVT+DGKV+RIDHSA+ DSFLEA+L+GSSF+TAV+LLS F+L GG++
Sbjct: 145 WLLNEVNTKELLCLVTKDGKVMRIDHSATXDSFLEASLQGSSFRTAVQLLSLFSLFGGKR 204
Query: 935 NVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQ 994
+VP LLKCHAR AFEV+ +N+VE++EV SQ+ MHG L R+ D+ NLS Q
Sbjct: 205 HVPFSLLKCHARQAFEVILRNSVENMEVNESQDSLMHGKPLFQREMLDMDATSNLSSGSQ 264
Query: 995 KQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTEL 1054
+ + + KAVPVASRF LDCLGYLPSEFRSFAAD+LLSGLQ + PSAIL EC+Q +
Sbjct: 265 RNMSRTSKAVPVASRFLLDCLGYLPSEFRSFAADILLSGLQPFTINGPSAILDECNQMDQ 324
Query: 1055 RLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG-------- 1106
R+MLHEVGLSLG+++WI DYHAF S ++ + C AA+S L G
Sbjct: 325 RVMLHEVGLSLGVMQWIDDYHAFSSAAATNSFVSSGALCLQAASSELRRGXKFTQNALAK 384
Query: 1107 --CAEGSLFESVGADVHIEECGAICDTICGEA-SDDGLGDCTTQTLPEDKECEDAALIVE 1163
EG + S GA H EE IC T E S D G PE E +DA L++E
Sbjct: 385 FPSCEGEMIISDGACGHNEEHSEICQTTGSEGVSVDRSGHGCILYAPELNEHKDATLVIE 444
Query: 1164 SIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLEL---------- 1213
SIRRDEFGL P +S+MES+MLKKQHARLGRALHCLSQELYSQDSHFLLEL
Sbjct: 445 SIRRDEFGLDPTLSSMESSMLKKQHARLGRALHCLSQELYSQDSHFLLELTVGSKFNAIE 504
Query: 1214 ----------------------------VQNADDNIYPENVEPTLTFILQESGIVVLNNE 1245
VQNADDNIYPENVEPTLTFILQ+ GI+VLNNE
Sbjct: 505 IYALGGKQWRWSLVHHXRKKPWCLHIAQVQNADDNIYPENVEPTLTFILQDRGIIVLNNE 564
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
QGFSA+NIRALCDVGNSTKKGS AGYIG+KGIGFKSVFRV
Sbjct: 565 QGFSAQNIRALCDVGNSTKKGSKAGYIGQKGIGFKSVFRV 604
Score = 80.5 bits (197), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 57/88 (64%), Gaps = 3/88 (3%)
Query: 576 VTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQ---D 632
VTSIK GMWDS+YDT QAI+Q EL N + EYESI++EPSE V T ++
Sbjct: 17 VTSIKSGMWDSIYDTFQAISQDELTNKLPDKHSEYESIDVEPSENDTVAITDCILEHRHS 76
Query: 633 IQVEEVMKKVSKYLEFDNSILNNAQSPV 660
+ VE+VM+ + + E D+ I ++ +SP+
Sbjct: 77 VTVEDVMRNLVTFFELDHDISHSGKSPL 104
>gi|359477941|ref|XP_002270965.2| PREDICTED: uncharacterized protein LOC100250350 [Vitis vinifera]
Length = 488
Score = 494 bits (1273), Expect = e-136, Method: Compositional matrix adjust.
Identities = 273/530 (51%), Positives = 351/530 (66%), Gaps = 49/530 (9%)
Query: 55 LQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASN 114
+QN TN+P QNPN +QN + +QN + QN N
Sbjct: 1 MQN-TNLPLQNPNLPMQNTSFPLQNPSFAIQNTNF------------------------- 34
Query: 115 QQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQV 174
PQP K NKE L+R+D AVVKAR ++I GESVS+WKVS+S LL LQV
Sbjct: 35 ---------TGFRPQPPKRNKEALDRVDGAVVKARRDVIATGESVSAWKVSQSALLALQV 85
Query: 175 DSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEE 234
DSW SLGF MQEVPSLH L+V E +IN+FIHCFVGVR+ITSLYDLE AICK+EG+E+FE+
Sbjct: 86 DSWESLGFPMQEVPSLHSLIVIEGKINSFIHCFVGVRRITSLYDLEMAICKNEGVEQFED 145
Query: 235 LELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAK 294
LELGP +R PLI HYF + SD VFKI + ++I CL E+ + + K I ++EFL++IAK
Sbjct: 146 LELGPLVRHPLIMHYFSISSDASGVFKITSAEIISCLDEFMEACQDKHIIIEEFLEYIAK 205
Query: 295 KRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQ--PHHKRRKRPIFSSEK 352
KRSL +E+LGVRIQ+LGMHI+FIREARK + +TLKK L+ P K R+ P+ SSEK
Sbjct: 206 KRSLTGRERLGVRIQSLGMHISFIREARKLEHMTLKKSQGSLKQIPDKKIREHPLRSSEK 265
Query: 353 KRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSN---DISSNV 409
K+ LDERFSA+ +RVKSF+S ++DFG KH FVSS SE++ SDD YE S D SN
Sbjct: 266 KK-LDERFSAMSQRVKSFASAHDDFGGKHTIFVSSCSEEDGSDDHKYEESEEDIDGCSNS 324
Query: 410 QLPS-QIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKI 468
+ S K DRVSSCPYPS EE+ RLGLKGE +++G SS + G K+KRK
Sbjct: 325 KFSSPNSKTRDRVSSCPYPSAIEEMTRLGLKGETEGNPSASG-SSMHSENTGPFKRKRKS 383
Query: 469 ENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQD-DELNEADISFSNESMRMFITTWKEA 527
N CT S K +RNK + L ++ ++ ++ + LNEAD +N+SMRMFITTWKEA
Sbjct: 384 SNRSCTVSKYLKLPKRNKLELVPLSVDHDNEKKELNNLNEADFLLANDSMRMFITTWKEA 443
Query: 528 CKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVT 577
C+ +T++EVLE+M QF+ ++ K MK M SSYP +GLLN+AV+
Sbjct: 444 CQEHTIAEVLERMLQFHGTQTKQR-----KIMKSMLSSYPFVGLLNVAVS 488
>gi|405957164|gb|EKC23395.1| hypothetical protein CGI_10019612 [Crassostrea gigas]
Length = 3111
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 386/1256 (30%), Positives = 602/1256 (47%), Gaps = 175/1256 (13%)
Query: 838 TSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPL----------PGWLL---NEVNVKE 884
T + A+ L P L ++ WS W +F P LG L W L N++ +
Sbjct: 1000 TKEAALACLENCPLLENIAEWSQWTLVFEPQLGKLRDFIQKYGECQTWPLEGGNKIMKTD 1059
Query: 885 LLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCH 944
+ L + G+ +R+ S+S + F A S + L+S + G +N P LL H
Sbjct: 1060 FIALESSPGQFLRLISSSSPEKFNAALQRRSPREVGGHLMSLVVGSKGVENTPRALLASH 1119
Query: 945 ARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAV 1004
+ E +FK +HG A++D G + G+AV
Sbjct: 1120 IK---EELFK---------------LHGE----------ASVDRTPGAPAS--FQHGEAV 1149
Query: 1005 PVASRFFLDCLGYLPSEFRSFAAD-VLLSGLQSTIKDAPS-AILLECHQTELRL-MLHEV 1061
+F LDCL LP + S A+ + L L + + S +LL+ TE +L L ++
Sbjct: 1150 ---VKFVLDCLLVLPVKSCSVLANQMFLDPLGQVVGQSKSKTLLLQSCTTESQLSQLQKL 1206
Query: 1062 GLSLGILEWIHDYHAFCSTGTSDL--LMPCVV----------------TCTNAATSGLN- 1102
G LG+ EW + A C + L + P + +A+TS L+
Sbjct: 1207 GCMLGVAEWTNSIQARCRPSHASLEEVDPEIADEFFRDDPEPIDDSEDEEDSASTSYLSD 1266
Query: 1103 -------SGSGCAEGSLFESVGADVHIEECGAICDTICGEA-------SDDGLGDC-TTQ 1147
G E + ++ +EE D E S+D + D + +
Sbjct: 1267 VEVETEVKGQVEEEELIVLDERTEIEMEEGDVTGDKQKKEVTIDLTVKSEDNVIDLLSPE 1326
Query: 1148 TLPEDKECEDAAL-----IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQEL 1202
PE E A IVE IRR+EFG+G ++ +++ Q RLGR+L LS++L
Sbjct: 1327 PKPEAVSMETDAEKTCRGIVEQIRREEFGIGVELNEDGQRLMRVQQERLGRSLDRLSKDL 1386
Query: 1203 YSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCD 1258
YS+D+HF+LEL+QNADDN YP+++ P++ FI+Q G+VVLNNE GF+ +NIRALCD
Sbjct: 1387 YSKDTHFVLELIQNADDNSYPDSLGEGECPSVQFIMQPEGVVVLNNETGFADKNIRALCD 1446
Query: 1259 VGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPF 1318
VG STK AGYIG+KGIGFKSVFRVTD PEIHSNGFH+KFD G G++LP V
Sbjct: 1447 VGRSTKGKHKAGYIGQKGIGFKSVFRVTDEPEIHSNGFHIKFDVHSGPTGYILPQWV--- 1503
Query: 1319 NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
++ + + W T I LP + + F+D+HPSLLLFL+RL
Sbjct: 1504 ----------EEDKWKDGEGWTTKIVLPLKESIRS--QTTTLAARFNDIHPSLLLFLNRL 1551
Query: 1379 QCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIR------PDVK 1432
+ I N + + + I +K G+ +++++ + W V + L A I +V+
Sbjct: 1552 RKITIDNRVENFVQNIHRKDYGENVVEITHDDKMDRWLVVKKSLDASKISLQAKSGVEVE 1611
Query: 1433 TTEIALALTLQ---ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1489
+TEIALA L+ + + + P +QPVFAFLPLR+YG +FI+QGDF +PSSRE+VD
Sbjct: 1612 STEIALAFPLRPPGQKKDSKFVP--PKQPVFAFLPLRSYGFRFIVQGDFDVPSSREDVDR 1669
Query: 1490 NSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRM 1549
+S WNQWL +E LF+ A F F +A + ++ FVP+ E+ FF +
Sbjct: 1670 DSSWNQWLRNEIHILFLEALEIFKSHADFSN--MEAVTSFLQFVPVEDEIIDFFRPVANQ 1727
Query: 1550 ILSKLRMSNCLILEGNNN---QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDI- 1605
IL KLR C+ + N+ W P +++R + ++ LL+ L L +L +D+
Sbjct: 1728 ILQKLRAKPCVPTQPNSKGQCAWKIPSQMVRVKDLLVLEVVTPDLLKDTLNLFYLHRDVA 1787
Query: 1606 -VLSDSLARALGIEEYGPKILLQIISSL----CRTENGLRSMGLSWLASWLNELY-TISF 1659
L+ SL +LGIE + LL I SL T+ + + A WL +Y +++
Sbjct: 1788 ATLNSSLTHSLGIESLNTEHLLLIGKSLSLQWAATDGRDKDNDILMTAKWLACVYRSLNE 1847
Query: 1660 HSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWL----HSDCSVFDGGFGL 1715
+ Q + D L+ + IPL+ + TI+ +++ S D
Sbjct: 1848 FTDNQP---------VYDKLKALKIIPLATYQRQCTADSTIFFPLSENTELSYND----- 1893
Query: 1716 EAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILP 1775
L L T+ P L+S++ + +S + ++LLKIGV QLS +++ HILP
Sbjct: 1894 -PMSVLQQDLNTIHPLLVSSADSEVNS-------QVQKLLLKIGVCQLSPREVISHHILP 1945
Query: 1776 AISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPS 1835
+ +N++ Y+ ++ E ++E EL A + TN G + PS
Sbjct: 1946 ILKSNKVKEKSENILISYVVYIKEQTERDPSLINME------ELGMSAMLKTNKGLQNPS 1999
Query: 1836 EIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGIT 1895
++PIHF E+ N +++ ++ D W +D YL++ + +WR FF +IG+T
Sbjct: 2000 KVPIHFTPEYKNSINLQAVLPGYD--WVLLDPVYLRNKKN---PVEIQRWRLFFSKIGLT 2054
Query: 1896 DFVQVVQVDKDVA--DISHTGFKNMWTKELLS---PGSAAIDWESNELVHLLSLLTTNVN 1950
DF+ + QV+ +++ +I T + + KELL GS DW S+ELV L+ + +
Sbjct: 2055 DFISLKQVNVEISNENIGTTPWAPL--KELLGTIPKGSYIKDWSSSELVDLIQNNSQPTS 2112
Query: 1951 R-QCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISS 2005
Q + L +L WD + I ++ G + SSF + W S
Sbjct: 2113 YPQQMRTLCVLLAEEWDRNFKGYISTHMCTESGGLLKDVDSSFSLLLRTASWVPGS 2168
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/213 (21%), Positives = 116/213 (54%), Gaps = 16/213 (7%)
Query: 132 KLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGF--QMQEVPS 189
+ +++ ++ + + ++A +E A E VS +V +L + + + LG Q++P
Sbjct: 538 RFSQDQIQMVAKECIEALAE---ADEHVSVERVENLMLQRFGKRNIAELGHWRNAQQIPC 594
Query: 190 L--HKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSE---GIEKFEELELGPFLRQP 244
L H M+ ++NA++ ++ I +L++L+ +C +E + FE L++GP R P
Sbjct: 595 LFEHNRMI--CKVNAYVQAYMNTHSICTLHELK--LCMAEFAPDKDSFEALQIGPLQRLP 650
Query: 245 LISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHK--AKDIKVDEFLDFIAKKRSLASKE 302
+I ++F D+ + +I T+D+ L Y + ++ + ++++ F++++ + + +
Sbjct: 651 IIYNHFKFPHDMADIPEITTSDIFENLRNYLNKYQKWSTKLELEHFMEYLVETYHVENAY 710
Query: 303 QLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
LG+RI++L + + ++++++ T +K ++
Sbjct: 711 VLGIRIRSLPLAMQVLKKSQRDAGGTRRKVTED 743
>gi|326672123|ref|XP_001920014.3| PREDICTED: hypothetical protein LOC335618 [Danio rerio]
Length = 2252
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 304/934 (32%), Positives = 471/934 (50%), Gaps = 90/934 (9%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
++E IR+ EFG+G ++ +++KQ ARLGR+L LS ELYS+D+HF+LEL+QNADDN
Sbjct: 580 VIEDIRKTEFGIGVELNQEGQILMEKQQARLGRSLQRLSDELYSKDTHFVLELIQNADDN 639
Query: 1221 IYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
YP+ E P LTF++++S I +LNNE GF +NIRA+CDVG STK GYIG+KGIGF
Sbjct: 640 SYPDGGEQPALTFVVEKSCITILNNECGFEEKNIRAICDVGRSTKGKHKYGYIGQKGIGF 699
Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
KSVF+VTD PEIHSNGFH++FD + G +G++LP + R ++ ++ K W
Sbjct: 700 KSVFKVTDCPEIHSNGFHIQFDKTSGPMGYILPHWLEQE------RPVNPAAKEIAEKRW 753
Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
T I LP RT E N +F D+HPSLLLFLHRL+ I N LV + ++ +
Sbjct: 754 TTKIHLPLRT---ENYQTKN---LFHDVHPSLLLFLHRLRSITIFNEAEKRLVSMTRRDL 807
Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
I++VS + W V + L I+ DV++TE+ALA + S+ + P +QPV
Sbjct: 808 SHNILEVSHTGGEERWLVLKKMLFPKKIKDDVESTELALAFQIHSSSCADVKP--QKQPV 865
Query: 1460 FAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFR 1519
FAFLPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE P LFV A F P F
Sbjct: 866 FAFLPLRSFGFRFIIQGDFDIPSSREDVDRDSSWNQWLRSEIPQLFVQAMDVFSKHPEF- 924
Query: 1520 ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNN---QWAPPCKVL 1576
+ ++ F+P E+ FF+ + I+ LR CL + ++ ++ P ++
Sbjct: 925 -SGLSGLCYFLQFIPQPSEILDFFNPVANQIIQLLRGRACLPAQEDHEGRVEFRQPSQL- 982
Query: 1577 RGWNDRAHSLLPDILLQKHLGLGFLSKDIVL-----------SDSLARALGIEEYGPKIL 1625
++ PD L+Q+ +G LS+ + L S SL ALG+ +
Sbjct: 983 --------AVCPDRLIQQVIGGQDLSRSLSLWYLHPDLQCSVSGSLLSALGVHRLRAAEI 1034
Query: 1626 LQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETD-LIDNLQRIPF 1684
L + + R L L F +L+S E D L+ +++ +P
Sbjct: 1035 LTVTCCMARELLQHTQTHTDGELEKLARLLVCIFR-----ALESEYEVDALLQSVRDVPM 1089
Query: 1685 IPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLG 1744
IPL+DG+ S+ ++ D V L+A L L TV LL SL
Sbjct: 1090 IPLADGSVVSLSSEGVFFPLD-DVTQAHADLQA---LYQDLNTVDRRLL-------QSLD 1138
Query: 1745 VISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYY 1804
+ + +L ++ V +L ++ HI P + + ++ YL F+ +H +
Sbjct: 1139 ALGNSQVRELLRRLDVHELEPQQVLHQHIYPQLRSGDWQKKPQGIVVSYLVFIKLHAQ-- 1196
Query: 1805 CPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYE 1864
++E + L VLTN GF+ P+E IHF +++ N ++L ++W
Sbjct: 1197 ------DQE--IRSLAAHVPVLTNQGFRCPAEHRIHFSRDYKNIHLPSVL---PGVQWLL 1245
Query: 1865 VDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELL 1924
+D YL+ A S WR F +G+ D + + + +K + + L
Sbjct: 1246 LDSCYLRADADVS------GWRSVFTALGVRDLL-IFRKEKRTLTTAQLSCSPWAAESAL 1298
Query: 1925 SPGSA-----AIDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWD--DCYTDKIMG 1975
P A +D + +EL LL+ L+ Q + L+++LD W+ + Y +
Sbjct: 1299 WPTPADGQYTIMDHQCSELHTLLTTDQLSDQEKLQQRRALIQLLDNNWNSGERYGQLVSA 1358
Query: 1976 FFKSKPTGDDRSFQSSFINCICDIQW--TISSMDDELHY--PKDLFHDCDAVRSILGPSA 2031
G R+ +SSFI + + W T M + Y P+ ++ AV +LG
Sbjct: 1359 QLLDADGGVVRNVRSSFIQMLTEFSWIPTHRPMCSTVSYLQPRSVYILSTAVNRLLGAHV 1418
Query: 2032 PYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
Y+ + + + IGL+ V ++D++ LK W
Sbjct: 1419 DYMHSEQTASEFTSAIGLRHSVCVEDMVGYLKRW 1452
Score = 90.9 bits (224), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 108/432 (25%), Positives = 167/432 (38%), Gaps = 66/432 (15%)
Query: 648 FDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLST 707
F S+ N S T ++ LR L E + E FR ++F L G F F+ K+ +L
Sbjct: 88 FQESVCN---SKSTLTLVHLRTL---EKRIAEHFRFRDFSQLQQGTFLDFIIKNKKVLQE 141
Query: 708 ELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISF 767
D D + + + Q + + I LR + +
Sbjct: 142 AAGGAVCIDNQDPGLSGFRPCRQDVFEFIKQCGDG----DDSRLAFIEAALRNHYRIRDS 197
Query: 768 KIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHT 827
+ G + SFL + + + E +S VL S +V E LL T+ + +
Sbjct: 198 RELGHGPL-SFLLNLTQRQKE-LSGHVLAS------------AVRYECPLLPITTGVCVS 243
Query: 828 GLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLC 887
+ V A+ LL AP L DL+ W+ W +F P+ GPL ++ ELL
Sbjct: 244 EPAVGLLGEVGRDQAMAALLSAPLLQDLHDWTQWQLVFLPNHGPLKDFIDKYCGKTELLA 303
Query: 888 LVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARH 947
L G ++R+ F +AA +TA L+ S A+A G N P LL H
Sbjct: 304 LEVSPGVLLRVTSETGDKHFSDAAQALDPLRTAGHLV-SMAVADGIPNTPTALLANHMES 362
Query: 948 AFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVA 1007
A V G +++ D+ G +
Sbjct: 363 ALNV-------------------------GVAQEEISPGDD------------GYSYSTV 385
Query: 1008 SRFFLDCLGYLPSEF-RSFAADVLLSGLQSTIKDAPS-AILLECHQTELRLM--LHEVGL 1063
+RF L+C+ +P+ + V L L + A S A+LLE QT R + LH++GL
Sbjct: 386 ARFVLECVTLIPTRICKELLQQVFLEPLSKVLGQAKSKAVLLEAAQTNTRHLNKLHQMGL 445
Query: 1064 SLGILEWIHDYH 1075
LGI EW D++
Sbjct: 446 LLGITEWRADFN 457
>gi|432926114|ref|XP_004080836.1| PREDICTED: uncharacterized protein LOC101171115 [Oryzias latipes]
Length = 2547
Score = 422 bits (1085), Expect = e-114, Method: Compositional matrix adjust.
Identities = 444/1663 (26%), Positives = 712/1663 (42%), Gaps = 244/1663 (14%)
Query: 526 EACKNNTMSEVLEKMFQFYKPT-DPKKAARYF--------KRMKLMFSSYPCIGLLNIAV 576
E C+ E L+ + + K +P ++ F K+ M ++ + +L+ A
Sbjct: 286 EQCREAIEKETLDHLRKMKKSVFEPLQSTDSFSTINMDVRKKYASMSAAEVVLTVLSNAS 345
Query: 577 TSIKRGMWDSVYDTLQAITQQELNNT-----VSGNSLEYESIEIEP-------SEQVAVV 624
T M L+A++ +L + G SL E+ P +EQ V
Sbjct: 346 TVFSPEMVKHCESFLKAVSDDQLAKALFQLAICGGSLAAPQ-ELVPQKKSSKNTEQSQAV 404
Query: 625 NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIK 684
+ V + E +K EF L + S VT L + + E +T+ F+I
Sbjct: 405 SDDSDVPALPCEATLK------EFLRESLRSQNSAVT-----LAHIAALERKMTKHFQIP 453
Query: 685 EFKSLGYGEFFTFLEKHASM-------LSTELQKLFTDDTIDRSSLEVSLVQHLL-VVLV 736
+F SL G F FL K+ + S ++ + + + V L+Q L +L
Sbjct: 454 DFGSLEQGSFLEFLVKNIQIPPVLVLVRSPDIASVKLTPSPVCIPVTVQLLQESLGSLLF 513
Query: 737 SQASNNLWESEII-TKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNE------- 788
+ +NNL KQ + E +++ ++ +ES L R+
Sbjct: 514 LEDNNNLAGCGFRPCKQDVFEFIQQCGKTTPSDPDELSCVESALRAHYSVRDSRDLGYGS 573
Query: 789 --VMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRP--KMSESVTSKDAIE 844
++ V + G LS L +E + + T+ +G P ++ E V+ A+
Sbjct: 574 LPTLAGLVQRQRELSGRGLS--LICYESALVSKHTNESPKSGSEPVGRLGE-VSKGQALA 630
Query: 845 ILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASV 904
L P L DL+ WS WD +F P GPL ++ + L L R G +++I +
Sbjct: 631 SLQCCPLLEDLSDWSQWDCIFKPLHGPLKDFIERDAANTGLAALEVRPGLLLKITTNTGH 690
Query: 905 DSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAF-EVMFKNNVEDIEVI 963
F A TA L+S +A G +N P LL H + A E M K ++ E
Sbjct: 691 KHFTSAVAALDPVGTAGHLVS-MMVADGIRNAPTALLANHMQSALAEAMAKEDLSQAEED 749
Query: 964 NSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEF- 1022
S R+ ++F LDCL +P+
Sbjct: 750 ISCYSRV-------------------------------------AKFLLDCLIRIPTRTC 772
Query: 1023 RSFAADVLLSGLQSTIKDAPSA-ILLECHQTELRLM--LHEVGLSLGILEWIHDY----- 1074
++ V L + S +L+ Q++LR + LH +G G+ +WI DY
Sbjct: 773 QALLQKVFLEPFSLVVGQTKSKQVLISTAQSDLRHLNCLHRLGTLSGVTDWIRDYQNKLR 832
Query: 1075 ----HAFC------------------------STGTSDL-------LMPCVVTCTNAATS 1099
H C S DL L P + T
Sbjct: 833 PPQNHNLCKEHMEQVKCSWAVSNSSSLSALNVSEAKEDLTDCASTDLQPGLQTEEEEDEG 892
Query: 1100 GLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAA 1159
+ + CA + A+ E G GEA ++ ++T+ C A
Sbjct: 893 EMCELASCANEKTSKRSDAE---GEAGDGLLNEAGEADEEKRAGFQSETVL----CFQRA 945
Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
I+E IR+ EFG+G +S +L+ RLGR L LS ELYS+D+HF+LEL+QNADD
Sbjct: 946 -IIEDIRKSEFGIGVELSAHGQRLLEVHQERLGRGLARLSTELYSKDTHFVLELIQNADD 1004
Query: 1220 NIYP--ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
N Y V P+L F++++ I +LNNE GF +NIRA+CDVG STK GYIG+KGI
Sbjct: 1005 NSYSLDPTVVPSLAFVVEQDCITILNNETGFQEKNIRAICDVGRSTKGKHRYGYIGQKGI 1064
Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFN-IDMFCRLLSKDPVQLES 1336
GFKSVF+VTD PEIHSNGFH++FD S G +G++LP +D +L+S+
Sbjct: 1065 GFKSVFKVTDCPEIHSNGFHLRFDRSCGPMGYILPHWTEEERPLDPQLKLISQ------- 1117
Query: 1337 KCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
W T IRLP R+ G + ++F D+HPSLLLFLHRL+ I N LV + +
Sbjct: 1118 HSWTTKIRLPLRS----GSRLTQ--NLFHDVHPSLLLFLHRLRSITIYNHSEKRLVTMTR 1171
Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQ 1456
K + +++V E W V L + DV++TE+ALA L + +
Sbjct: 1172 KDLRKNVVEVEHSEGIDRWLVVKTTLYPQMGEQDVESTELALAFQLCSGTAEDLICQPQK 1231
Query: 1457 QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP 1516
QPVFA+LPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE P LF+ A F D
Sbjct: 1232 QPVFAYLPLRSFGFRFIIQGDFHIPSSREDVDRDSSWNQWLRSEVPQLFLQAMAVFDDHS 1291
Query: 1517 CFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCK 1574
F K ++ F+PL EV FF+ + I+ L+ L ++ + P +
Sbjct: 1292 EFM--GLKGLCPFLQFIPLPDEVLDFFNPVAGQIIHLLKDKAFLPTRNSDGGVVYKLPSQ 1349
Query: 1575 VLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSL 1632
V + ++ L+KHL L +L D+ + SL LG+ + + +++
Sbjct: 1350 VAICQDAVIRDIIGSDELEKHLSLSYLHPDLSPTPPTSLLTQLGVRYLQGSDVTTVTTAM 1409
Query: 1633 CR--TENG--LRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLS 1688
E G L+ + LA L + H G + ++ NL+ +P IPL+
Sbjct: 1410 ANELVEAGHMLKEEKIRQLARLLVCNFRAVEHGFGDAE-------SILQNLKNLPIIPLA 1462
Query: 1689 DGTFSSVD-EGTIWLHSDC----SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSL 1743
+G +++ EG I+L + G G A L + V P+LLS L
Sbjct: 1463 NGNVVTLNKEGVIFLMEETKAKKKKAQGQTG--AVSALYKDVSVVHPSLLSC-------L 1513
Query: 1744 GVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEY 1803
G++ + +L ++GV +L ++++ +I P I + + ++ YL F+ H
Sbjct: 1514 GLLESQQVRELLKRLGVHELEPQELLEQYIYPTIKSNNWKSKSEAVVVSYLVFIKQH--- 1570
Query: 1804 YCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWY 1863
+E+ + VLT+ G + P+ +HF K + N +N+ W
Sbjct: 1571 ----SSSSQEYADIAVP----VLTSEGLQCPATDKVHFSKAYKN---INLRKKLPGCDWI 1619
Query: 1864 EVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-WTKE 1922
V Y++ + WR+ +G+ D + + K+ ++ N W+ E
Sbjct: 1620 LVSPRYVETDG------DVDGWRELLSRLGVRDG---LVIRKERRTLTAEELANSPWSAE 1670
Query: 1923 LLSPG-SAAID-------WESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIM 1974
+ SAA D E E++ L ++ RQ + LLE+L+ WD T +
Sbjct: 1671 SATWDLSAAQDCVLEDYTCEEFEILAKAKLPGKDLLRQ-RRTLLELLEANWD---TGHLY 1726
Query: 1975 GFFKSKPTGDD-----RSFQSSFINCICDIQWTISSMDDELHY-------PKDLFHDCDA 2022
F + D R+ +SSF + +C ++W + + E P ++
Sbjct: 1727 SQFLTAQVIDSSGRPIRTTKSSFHHLLCSLEWIPAHREQEGEQQEKEYLCPGSVYLSLPE 1786
Query: 2023 VRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
V +LG Y+ + + +G++ V++D + LK W
Sbjct: 1787 VTRLLGTHIFYV--DLNPSEFTRALGIRDTVSVDLLTHYLKEW 1827
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/202 (22%), Positives = 90/202 (44%), Gaps = 21/202 (10%)
Query: 187 VPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLI 246
+P+L +L IN F+ + I +LY+L + + +EEL LGP + P++
Sbjct: 141 LPALRELQNLTREINTFVEATEALSCIFTLYELGQGLAALRNKTHYEELNLGPLCKVPVV 200
Query: 247 SHYFLV---KSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQ 303
F + D D+ +I T DV+ L + +++ + + + +F+ +A +
Sbjct: 201 HRIFKIGHNTKDKDIP-QIRTVDVLKQLYIFRKSNEEQHLTLADFMKHLADHYDCEHPNK 259
Query: 304 LGVRIQNLGMHINFIREARKSQDVTLKK----FLKELQPHHKRRKRPIFSSEKKRQLDER 359
LG+ I L + ++ I + +++ L++ KE H ++ K+ +F
Sbjct: 260 LGIVILGLHLPLSTIAKVTSTEEAVLEQCREAIEKETLDHLRKMKKSVFEP--------- 310
Query: 360 FSAICERVKSFSSINEDFGAKH 381
+ SFS+IN D K+
Sbjct: 311 ----LQSTDSFSTINMDVRKKY 328
>gi|224060363|ref|XP_002300162.1| predicted protein [Populus trichocarpa]
gi|222847420|gb|EEE84967.1| predicted protein [Populus trichocarpa]
Length = 570
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 271/570 (47%), Positives = 351/570 (61%), Gaps = 53/570 (9%)
Query: 1 MYGNHRPRFRPVGGVGGEGTGGGQPPSQPP-----PPH---FNPNFA-FQNPNIFFPNPA 51
M+G HRP +RP G GQPP + P P FNPNF Q+PN F
Sbjct: 1 MHG-HRPPYRP--------GGRGQPPQRHPLQQELPTELLTFNPNFVPLQDPNFFLHTFT 51
Query: 52 LQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPG 111
LLQ N P QNPNF IQNPN F+ + + + PP A Q Q
Sbjct: 52 NALLQQ--NFPMQNPNFPIQNPNFFLPPQQQIQYHQQQHQEPPPPGAPPPPEQQEQQQEP 109
Query: 112 ASNQQRPLTQQPQSLPPQPQKLNKEF-LERIDRAVVKARSELIMAGESVSSWKVSRSVLL 170
A+ QQ L + PQ+ + ++ NKE LERIDRAV KAR +L A E+VS+WKVS+SVL+
Sbjct: 110 ATLQQN-LPKFPQNPKKKVKQNNKELQLERIDRAVEKARQDLSDAEENVSAWKVSQSVLV 168
Query: 171 KLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIE 230
Q +SW SLGF+MQEVP+L +LMVTE++INAFIHCFVGVR+ITS+YDLE AICK+EGIE
Sbjct: 169 NFQAESWDSLGFKMQEVPALFRLMVTESKINAFIHCFVGVRRITSIYDLEVAICKNEGIE 228
Query: 231 KFEELELGPFLRQPLISHYFLVKSDVDV-VFKIATNDVIVCLSEYTDTHKAKDIKVDEFL 289
FEEL LGP +R PL+ HYF +K D VFKI + ++I+ LSE+ DT + K I VDEFL
Sbjct: 229 NFEELGLGPLMRHPLVLHYFSMKPDASAEVFKITSEEIILLLSEFMDTCQKKVIIVDEFL 288
Query: 290 DFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKK---------FLKELQPH- 339
F++K + E LGVR+Q+LG HI+FIREA+ S++ T KK LK+ Q
Sbjct: 289 HFLSKNYPVKGPEMLGVRVQSLGTHISFIREAKASENSTQKKCRETLARNGSLKKCQEAR 348
Query: 340 ------HKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFV------SS 387
+R FSSEK+R L+ERFSA+ ER+KSFS N F KHIRFV
Sbjct: 349 ASGPRVRSQRHEGRFSSEKER-LEERFSAVSERIKSFSQENYGFCGKHIRFVSSSSEDEK 407
Query: 388 SSEDEDSDDCTYECSNDISSNVQLPSQ-IKGSDRVSSCPYPSVTEELKRLGLKGEINHQL 446
S + ++ D+ T SN++ S+++ +Q I SDRVSSCPYPS TEE+ RLGLKGE Q
Sbjct: 408 SDDGKNEDEMT---SNNVGSHLRSSAQAISNSDRVSSCPYPSATEEMSRLGLKGETGSQF 464
Query: 447 TSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQD-DEL 505
+ SS + S KKRK+E++ S P+K LR NK A PI++ D+T++
Sbjct: 465 SPDCGSSRPKESNRSFFKKRKLEDASWNVSVPSKLLRSNKT--HAHPIDNFDKTEEFVTP 522
Query: 506 NEADISFSNESMRMFITTWKEACKNNTMSE 535
+E DIS S+ + FITTWKEACK+ T++E
Sbjct: 523 SEDDISLSSNDLGAFITTWKEACKDYTVAE 552
>gi|410896246|ref|XP_003961610.1| PREDICTED: uncharacterized protein LOC101079642 [Takifugu rubripes]
Length = 2475
Score = 404 bits (1038), Expect = e-109, Method: Compositional matrix adjust.
Identities = 292/942 (30%), Positives = 456/942 (48%), Gaps = 100/942 (10%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
I+E IR+ FG+G ++ +++ Q R GR+L L+ ELYS+D+HF+LEL+QNADDN
Sbjct: 799 IIEDIRKSAFGIGIELNAECQRLMQVQQERQGRSLDRLATELYSKDTHFVLELIQNADDN 858
Query: 1221 IYPE--NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
YP V P L F++++ I +LNNE GF +NIRA+CDVG STK GYIG+KGIG
Sbjct: 859 TYPSEAGVIPALAFVVEKDCITILNNEMGFQEKNIRAICDVGRSTKGKHKYGYIGQKGIG 918
Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
FKSVF+++D PEIHSNGFH++FD + G +G++LP D R L ++
Sbjct: 919 FKSVFKISDCPEIHSNGFHLRFDKNSGPMGYILPHWA---EAD---RPLDSQLEDIKQHS 972
Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
W T I LP R++ + ++F D+HPSLLLFLHRL+ I N LV + +K
Sbjct: 973 WTTKICLPLRSQ------SHQTRNLFHDVHPSLLLFLHRLRSITIYNRNEKRLVKMTRKD 1026
Query: 1399 VGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQ-- 1456
+ + +++V + W V + L I+ DV++TE+ALA L +N ++Q
Sbjct: 1027 LNNNVLEVEHTDGTERWLVVKKTLHPKKIKEDVESTELALAFQL--TNGDTESGTVWQPQ 1084
Query: 1457 -QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDL 1515
QPVFAFLPLR++G +FI+QGDF +PSSRE+VD +SPWNQWL SE P LF+ A ++ +
Sbjct: 1085 KQPVFAFLPLRSFGFRFIVQGDFDIPSSREDVDRDSPWNQWLRSEIPHLFIQAMENYPEF 1144
Query: 1516 PCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPC 1573
+Y+ F+PL EV FF + I+ L+ L ++ + + PC
Sbjct: 1145 SGL-----NGLYLYLHFIPLPDEVLDFFRPVAGQIIQLLKGKAFLPTINSDGKIVYKQPC 1199
Query: 1574 KVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEY-GP---KILLQ 1627
KV + ++ L++HL L +L + + SL LG+ GP +
Sbjct: 1200 KVAVCQDAVIRDVIGGDELERHLSLSYLHPRLSSAPLTSLLTHLGVRYLQGPDVSTVTTA 1259
Query: 1628 IISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPL 1687
+ L RTE GL LA L + H G++ ++ L+ +P IPL
Sbjct: 1260 MARELTRTEGICTDGGLRQLARLLVCNFRALEHGYGETE-------SVLQTLRDLPIIPL 1312
Query: 1688 SDGTFSSVDEGTIWLHSDCSVFDGG--FGLEAFPNLCAKLRTVSPALLSASAVDKSSLGV 1745
+DG ++ E ++ + E+ L + V P+LLS L
Sbjct: 1313 ADGRVVALSEEGVFFPVETQTKKSKDLIQTESLAALYKDINVVKPSLLSC-------LEP 1365
Query: 1746 ISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYC 1805
+ + ++L +GV L ++ HI P I + ++++ YL F+ H
Sbjct: 1366 LESQQVQKLLRMLGVHDLEPQQLLDQHIYPTIQSNKWKSKPESVVVSYLVFIKEHSMSSH 1425
Query: 1806 PNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGN-PVSVNMLIHDIDIKWYE 1864
H VLTN G P+E +HF ++ N + + HD W
Sbjct: 1426 DYVHA-----------AVPVLTNRGLLCPAEEKVHFSVDYDNIDLPTQLPGHD----WVM 1470
Query: 1865 VDITYLKHPAKESLSCGLVKWRKFFKEIGITDFV-----QVVQVDKDVADISHTGFKNMW 1919
+ Y+K + WR+ F +G+ D + +V K++A + W
Sbjct: 1471 LSSCYVKTDGD------VTGWRELFSRLGVRDGLIIRKERVTLTAKELATSPWSVESRSW 1524
Query: 1920 TKELLSPG-SAAIDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMGF 1976
K +PG S +D E H L+ L +V Q LLE+L WD + F
Sbjct: 1525 NK---NPGESCELDDYPCEEFHALATAQLPGSVLLQQRLTLLEMLANNWDTGH--HYSQF 1579
Query: 1977 FKSKPTGDD----RSFQSSFINCICDIQWT-----ISSMDDELHY--PKDLFHDCDAVRS 2025
+++ D +S +SSF + +C ++W + E +Y P ++ + VRS
Sbjct: 1580 LRAQVVNSDGREVKSTKSSFYHYLCTLKWVPAFRPVEGEQQERNYLCPTSVYLNSPEVRS 1639
Query: 2026 ILGPSAPY--IVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
+LG Y + P S IG++ ++++ ++ LK W
Sbjct: 1640 LLGTHVCYADLTPSDFSRA----IGIRQNISVEAMINYLKEW 1677
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 96/201 (47%), Gaps = 7/201 (3%)
Query: 156 GESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITS 215
GE V+ KV V ++Q+ S + G P+L L +N FI C V I +
Sbjct: 17 GELVTKEKVISRVCQQMQIPSLEAGGIYPGMTPALKDLQYKTREVNMFIECAEAVSSICT 76
Query: 216 LYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDVIVCLSE 273
LY+L ++ + +++EEL LGP + P I F + S+ D + +I T D++ L+
Sbjct: 77 LYELGHSLAGLKDKKQYEELNLGPLCKLPRIHRIFKIDSNTKDDDIQQIETVDILKLLNN 136
Query: 274 YTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN----FIREARKSQDVTL 329
+ K + + + EF+ +A + S +LG+RI ++ + I+ I + R ++
Sbjct: 137 FRKKSKGR-VDLTEFMQHLADHYNCDSPYKLGIRINSIALCISTLNKVIHQERAFREEAQ 195
Query: 330 KKFLKELQPHHKRRKRPIFSS 350
+ +ELQ R R I S
Sbjct: 196 QLIERELQEETYERLRKIKKS 216
>gi|340371548|ref|XP_003384307.1| PREDICTED: hypothetical protein LOC100639095 [Amphimedon
queenslandica]
Length = 1639
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 334/1095 (30%), Positives = 509/1095 (46%), Gaps = 197/1095 (17%)
Query: 834 SESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDG 893
S S+TS+DA++ L AP L+DL WS+W +F PS G L +L + +V L DG
Sbjct: 629 SLSMTSQDALQYLKSAPLLADLALWSNWKLVFEPSCGSLSHFLSSHPDVH---TLCVDDG 685
Query: 894 KVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMF 953
++ID ++SV F A L S Q++ L+S +G N+P LL H + +
Sbjct: 686 VFLKIDPNSSVKDFANA-LYVSPSQSSGHLVSMVVRSGSTNNIPAQLLAQHVQTRLSGLL 744
Query: 954 KNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLD 1013
+ + + + C ++C + AV V FLD
Sbjct: 745 SDTMPPEKFVF--EC-----LVC---------------------MPVELAVTVGKEVFLD 776
Query: 1014 CLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHD 1073
L L S D LL L++T D P + R LH +GL LGI WI D
Sbjct: 777 PLKKLHG---SSKVDSLL--LKAT-DDNPLS----------RSFLHTLGLKLGIKVWIDD 820
Query: 1074 YHAFCSTGTSDLLMPC------VVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGA 1127
Y A G +P V+ N + ++ S ES + +
Sbjct: 821 YLA---RGKQPNTIPATEPDLPVLKPVNPDLNKMDFSEESVAESFLESSFNNQSSNPSTS 877
Query: 1128 ICDTICGEASD----DGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNM 1183
+ + E S+ D L T ++E ++A IVESIR +EFG+G ++
Sbjct: 878 LNEP---EESNTIFTDVLPLSLTSPFSVEEERDNALKIVESIRSEEFGIGLEFDEKTKSV 934
Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-------ENVEPTLTFILQE 1236
+ K RL R+L LS+ELYS+D+HF+LELVQNADDN YP + P+L F L +
Sbjct: 935 MSKGKGRLDRSLQRLSEELYSKDTHFVLELVQNADDNSYPVLTCGSTSSFVPSLKFTLSK 994
Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
S IVVLNNE GFS +NIRA+CDVG STK +++GYIG+KGIGFKSVFRVT PEIHS G+
Sbjct: 995 SCIVVLNNELGFSEKNIRAICDVGRSTKDRNASGYIGQKGIGFKSVFRVTKTPEIHSRGY 1054
Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA 1356
H++FD+S IG++LP V D DP +L W T I LPF++ S
Sbjct: 1055 HIRFDSS-VPIGYILPEWVESSISDDII-----DPGELAK--WTTKIVLPFKSD-STNAG 1105
Query: 1357 MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWF 1416
++++ F D+ PSLLLFL+RL+CI+ ++ + + +++ +GII++ + W
Sbjct: 1106 SSDMMSRFHDIQPSLLLFLNRLRCIIIEDLESKVIRSMKRHDRSNGIIEIRHNDGVDKWL 1165
Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
+ + TL E+N+ L Q VFAFLPLR+YG KFI+QG
Sbjct: 1166 LVRK--------------------TLHEANKD-----LSPQQVFAFLPLRSYGFKFIVQG 1200
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASV--YMSFVP 1534
DF LPSSRE+V+ NS WNQWL E P+LF+ + + CF + ++ + F+P
Sbjct: 1201 DFDLPSSREDVNANSAWNQWLREEIPSLFMKS------IDCFESHLTAIEAIIHILKFIP 1254
Query: 1535 LVGEVHGFFSGLPRMILSKLRMSNCL-------------------------------ILE 1563
L GEVH FF + IL + CL L+
Sbjct: 1255 LEGEVHDFFQPVTSEILRIFKERKCLPTEPFPVHSSSPTDSPFDAMREITSIISEIRSLK 1314
Query: 1564 GNNN--QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDI--VLSDSLARALGIEE 1619
G++ QW P ++L +P LL L L +L+ + LS + L I
Sbjct: 1315 GSSAGIQWKQPSQLLLIKEQSIQDCIPQSLLTASLKLYYLNSALNNHLSSQICSQLNIGS 1374
Query: 1620 YGPKILLQIISSLCRTENGLRS---MGL-------------------------SWLASWL 1651
L+ + + + + + M L W+A WL
Sbjct: 1375 INIDHLMSVAQYVLSSYHQHKEEQFMSLQSSASSAEDESDEEEVAVHPVKCLNQWIALWL 1434
Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDG 1711
+ ++ + G + E ++ L+ P IPLS+G S+D G I+ V
Sbjct: 1435 SCVHLLCEGVGGSTD-----EMSVLRKLKSFPVIPLSNGNIVSIDSGPIFF---PPVSSK 1486
Query: 1712 GFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKV 1771
+ P + +LRTV L+ + D+ + V N +LL++GV+ L+ ++++
Sbjct: 1487 DVDISYRP-VYDELRTVHTDLIDS---DEKLVAV----KANDLLLRLGVRNLNITEVIEQ 1538
Query: 1772 HILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGF 1831
HI+P E T + ++L+ YL F+ +E ++ + EL+ ++T +G
Sbjct: 1539 HIIPHFKKENTEDTSEHLLILYLSFIENAIE----RNQFSQKKAIDELKGHVVLITENGK 1594
Query: 1832 K-RPSEIPIHFGKEF 1845
+ +PS+ IHF ++
Sbjct: 1595 RLKPSDEVIHFHSKY 1609
Score = 75.5 bits (184), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 120/259 (46%), Gaps = 20/259 (7%)
Query: 90 PPQQPPSSACNQQRSQTQHPPG--ASNQQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVK 147
PP P SS HPP + Q P+ + LPP P+ E ++ + R +
Sbjct: 97 PPHFPDSSV---HPRDVHHPPSFVHKDHQIPVLDR---LPPPPEFKRGENVKDVVRKI-N 149
Query: 148 ARSELIMAGESVSSWKVSRSVL---LKLQVDSWSSLGF--QMQEVPSLHKLMVTEARINA 202
+ +E+ S V SV+ LKL + G+ ++++ R
Sbjct: 150 SYAEMFTGIISSRGKHVPLSVVEDELKLLLGKAKRHGYYINLRDIDFWVNWEKLHKRTIE 209
Query: 203 FIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKI 262
FI F IT+LY+L+ + +SE + F+EL++GP L+ P I F V D++ V +I
Sbjct: 210 FIKVFCWHSPITTLYELQRILNESENVSDFKELKMGPLLKHPDIIRLFKVPEDLETVPEI 269
Query: 263 ATNDVIVCLSEYTDTHKAKDIK--VDEFLDFIAKKRSLASKEQLGVRIQN--LGMHINFI 318
DV L + D K K K +++FLDF++ S+ S + +RI + L + INF
Sbjct: 270 TAYDVHSKLFRFLDKSKNKKSKTQLEDFLDFLSHTFSIESPLHICIRITSYPLALQINF- 328
Query: 319 REARKSQDVTLKKFLKELQ 337
+ R+SQ ++ EL+
Sbjct: 329 -KFRRSQREIKERIESELE 346
>gi|348501984|ref|XP_003438549.1| PREDICTED: hypothetical protein LOC100712247 [Oreochromis niloticus]
Length = 2910
Score = 398 bits (1022), Expect = e-107, Method: Compositional matrix adjust.
Identities = 281/935 (30%), Positives = 457/935 (48%), Gaps = 82/935 (8%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
++E IR+ EFG+G ++ +++ RLGR+L LS ELYS+D+HF+LEL+QNADDN
Sbjct: 1233 VIEDIRKSEFGIGVKLTAEGQKLMQVHQERLGRSLDRLSTELYSKDTHFVLELIQNADDN 1292
Query: 1221 IYPE--NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
YP V P L F+++ I VLNNE GF +NIRA+CDVG STK GYIG+KGIG
Sbjct: 1293 SYPAEPGVVPALAFMVERDCITVLNNETGFQEKNIRAICDVGRSTKGKHKYGYIGQKGIG 1352
Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
FKSVF+VTD PEIHSNGFH++FD + G +G++LP R L +
Sbjct: 1353 FKSVFKVTDCPEIHSNGFHLRFDKNCGPMGYILPHWTED------TRPLDPQLKDINQHS 1406
Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
W T I LP RT+ + ++F D+HPSLLLFLHRL+ I N + + +K
Sbjct: 1407 WTTKISLPLRTE------SHQTRNLFHDVHPSLLLFLHRLRSITIYNQSEKRFMTMTRKD 1460
Query: 1399 VGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEG-NYGPLLYQQ 1457
+ +++V E W V L+ I+ DV++TE+ALA L + G N +Q
Sbjct: 1461 LNHNVLEVEHSEGTERWLVVKTTLKPKKIKEDVESTELALAFQLGSDSTGSNIVCQPQKQ 1520
Query: 1458 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
PVFA+LPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE LF+ A F + P
Sbjct: 1521 PVFAYLPLRSFGFRFIIQGDFDIPSSREDVDRDSSWNQWLRSEIAQLFLQAMDVFTNHPE 1580
Query: 1518 FRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKV 1575
F+ K ++ F+PL EV FF + I+ L+ + L ++ + + P +V
Sbjct: 1581 FKG--LKGLCQFLQFIPLSDEVLDFFKPVAGQIIQLLKGKSFLPTLSSDGKIVYKLPSQV 1638
Query: 1576 LRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSLC 1633
+ ++ L+KHL L +L D+ + SL LG+ + + +++
Sbjct: 1639 AVCQDPVIREVIGGGELEKHLSLSYLHPDLSPAPPTSLLTHLGVRYLRGTDVTIVTTAMA 1698
Query: 1634 R---TENGLRS-MGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSD 1689
+ T G+ S GL LA L + H G++ ++ L+ +P IPL+D
Sbjct: 1699 KELMTVGGIHSDAGLHQLARLLVCNFRALEHGYGEAD-------SILQTLRELPIIPLAD 1751
Query: 1690 GTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVS---PALLSASAVDKSSLGVI 1746
G ++ ++ + S L A + VS P+LLS L +
Sbjct: 1752 GRAVALSVEGVFFPMEESKTKKKKAQAQTGPLAALYKDVSVVHPSLLSC-------LEPL 1804
Query: 1747 SVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCP 1806
+ +L ++GV +L ++++ HI P I + + ++ YL F+ H
Sbjct: 1805 ESQQVRELLRRLGVHELEPQELLEQHIYPTIRNNKWKSKPTPVVVSYLVFIKQH------ 1858
Query: 1807 NCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVD 1866
+E+ + + VLT+ G P+ +HF +E+GN VN+ W +
Sbjct: 1859 -SSSSQEYSDTAVP----VLTSRGLLCPANERVHFSEEYGN---VNLPTKLPGYDWILLS 1910
Query: 1867 ITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLS- 1925
Y++ + WR+ F ++GI D + +++ + + + W+ E +
Sbjct: 1911 PCYVQ------TDGDVAGWRELFSKLGIKDGL-IIRKQRQTLTATELA-SSPWSVESATW 1962
Query: 1926 ---PGSAAI--DWESNELVHLLSL-LTTNVNRQCSKHLLEILDTLWD--DCYTDKIMGFF 1977
PG + D+ E L + L ++ Q LLE+L+ W+ CY+ +
Sbjct: 1963 HQVPGEGYVLDDYPCEEFHALATAELPGSLLLQQRMALLELLEANWETGHCYSQYLTAQV 2022
Query: 1978 KSKPTGDDRSFQSSFINCICDIQWTIS-------SMDDELHYPKDLFHDCDAVRSILGPS 2030
+S +SSF + +C ++W + ++ + P ++ V ++LG
Sbjct: 2023 IDSDGQPIKSTKSSFYHSLCRLEWVSAYCPQQGGQLERKYLCPSSVYLSSPEVINLLGTH 2082
Query: 2031 APYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
Y+ + S +G++ +++D ++ LK W
Sbjct: 2083 VYYV--NISSSDFSRALGMRETISVDVLINYLKEW 2115
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 117/512 (22%), Positives = 184/512 (35%), Gaps = 121/512 (23%)
Query: 603 VSGNSLEYESIEIE-------------PSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFD 649
VSGN LE ++ P ++ + K +D + + K+ +YL
Sbjct: 669 VSGNRLETALFQLAICGGSLAVPHDLVPKDKTSKPTEKTKPEDKSTKSLPTKLKQYLRDK 728
Query: 650 NSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTEL 709
+ N+A I L + S E L E F +K+F SL G F FL + +L
Sbjct: 729 LAAQNSA--------ITLAHIASLEKKLPEHFHVKDFLSLEKGSFLEFLVNNIQLL---- 776
Query: 710 QKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKI 769
Q I SS+E +A + S KQ + E +++ + S
Sbjct: 777 QDTVGSTLILGSSME-------------RAGSGFRPS----KQDVFEFIKQCGDVASTDP 819
Query: 770 EDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGL 829
++ +ES L + R+ +R G +H L++ A+ G+
Sbjct: 820 DELSYIESALRSHYGVRD---------------SRDLGHGPLHRLADLVQRQKALGGGGM 864
Query: 830 RPKMSESV----------------------TSKD-AIEILLRAPFLSDLNSWSHWDFLFA 866
P ES SKD A+ LL P L DL+ WS W+ +F
Sbjct: 865 SPICYESALFAKQGKSSAESGFESVGRLGEMSKDQALASLLSCPLLEDLSEWSQWELIFK 924
Query: 867 PSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSS 926
P G L ++ L L G + +I F A TA L+ S
Sbjct: 925 PLHGSLKEFIEKNAGNTGLAALEVTPGVLFKITTDTGDKHFSSATTALDPVGTAGHLV-S 983
Query: 927 FALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANI 986
+A G N P+ LL H + + ED+ + DV+
Sbjct: 984 MVVADGIVNAPIALLSNHMQSSLAAAVAK--EDLSLAEE----------------DVSCY 1025
Query: 987 DNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEF-RSFAADVLLSGLQSTIKDAPSA- 1044
D + ++F L+CL +P+ ++ V L + A S
Sbjct: 1026 DKV------------------AKFLLECLTRIPTRTCQALLLQVFLEPFSLVLGQAKSKR 1067
Query: 1045 ILLECHQTELRLM--LHEVGLSLGILEWIHDY 1074
+L+ Q+E R M LH +G+ LGI +WI DY
Sbjct: 1068 VLMTVAQSEPRYMNCLHRLGILLGIPDWIKDY 1099
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 108/226 (47%), Gaps = 11/226 (4%)
Query: 151 ELIMAGESVSSWKV-SRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVG 209
E+ GE V++ KV SR L +D S ++ + L L+ IN FI
Sbjct: 408 EIAGEGELVTTKKVISRVCRLVPSLDPRSVEHLPLKALKDLQYLI---REINMFIETTSA 464
Query: 210 VRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDV 267
V I +LY+L +++ + +++EEL LGP + P I F + S+ D + +I T D+
Sbjct: 465 VTSICTLYELGQSLAGLKDKKRYEELNLGPLCKLPFIHRMFKIDSNTKDDDIHQIETVDI 524
Query: 268 IVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDV 327
+ L + + + EF+ +A + S +LG+RI LG+ I+ + + +++
Sbjct: 525 LKQLRVFRRKQSKVKVDLAEFMKHLADHYNCDSPYELGIRITGLGLPISTLSKVGRNEYT 584
Query: 328 TLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSI 373
L+ + +Q K + I+ E+ R+L + + SFSS+
Sbjct: 585 ILECAREVIQ---KELEEEIY--ERLRKLKKSILEPLQGAASFSSV 625
>gi|449666320|ref|XP_002160396.2| PREDICTED: uncharacterized protein LOC100210255, partial [Hydra
magnipapillata]
Length = 2259
Score = 392 bits (1006), Expect = e-105, Method: Compositional matrix adjust.
Identities = 349/1348 (25%), Positives = 591/1348 (43%), Gaps = 257/1348 (19%)
Query: 834 SESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKEL--LCLVTR 891
S+++ S A+ + P+L D++ +WD +F LG L ++ ++ N K++ L +
Sbjct: 288 SDNILSMLAVSKINALPYLEDVSKLMNWDEIFQRKLGSLKEFVKSKFNSKDVCYFFLELQ 347
Query: 892 DGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEV 951
GK +++ S++SF EA + + ++ L S
Sbjct: 348 FGKFVKVLKDCSIESFTEAFVARDAKLVSICLTS-------------------------- 381
Query: 952 MFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFF 1011
++C + + A + LS +QK +L + + + F
Sbjct: 382 ----------------------IICSNRRLEHAPLALLSNVIQKNMLTSSETSEFSEKHF 419
Query: 1012 LDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLEC------HQTELRLMLHEVGLSL 1065
L L L + + + + PS +L C R LH +G L
Sbjct: 420 LMFLTPLSTIYGNL--------------NVPSVLLKACSDCCRSRSDCCRSRLHALGFQL 465
Query: 1066 GILEWIHDYHAFCST----------------GTSDLLMPC-------VVTCTNAATSGLN 1102
I+EWI +Y S G +++ C V+ C +
Sbjct: 466 SIVEWIENYKDEFSVLKGLKSIKSDGIGLWWGYQIVILYCCNIKPLFVIICYQKNINEFM 525
Query: 1103 SG---------SGCAEGSLFESVGADVHIE-ECGAICDTICGEASDDGLGDCTTQTLPED 1152
S E + F+ +G + +IE + T+ E + + + T PE
Sbjct: 526 SKRQQQTEENEKEAIEKTKFQVIGENKNIEIKHDESISTLVNEKEELIM---ESSTFPEQ 582
Query: 1153 KECED-----------AALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQE 1201
+ ++ ++ +IRRD+FG+G E +++ R GR+L LS E
Sbjct: 583 GKVKEFFGSSSDQNNHCREVINNIRRDQFGVGIEFGEQEMKLIEVHREREGRSLQHLSNE 642
Query: 1202 LYSQDSHFLLELVQNADDNIYPE-----NVEPTLTFILQESGIVVLNNEQGFSAENIRAL 1256
LYS+DSHF+LEL+QNADDN YPE N +P + FI++ I +LNNE+GFS +I+AL
Sbjct: 643 LYSKDSHFVLELIQNADDNQYPETFFEKNEKPAVAFIVEHDKITILNNEKGFSERDIKAL 702
Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVP 1316
CDVG STK GYIG+KGIGFKSVFRVT++PEIHSNG+H+KFD S G G++LP +
Sbjct: 703 CDVGKSTKGIHCKGYIGQKGIGFKSVFRVTNSPEIHSNGYHIKFDASHGSNGYILPYWID 762
Query: 1317 PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLH 1376
+ +LL + P + ++ W T I LP + + + F D+HPSLLLFL+
Sbjct: 763 CWESQ--SKLLYQ-PSKKQNYTWITQIVLPLK----RDMQHTKLTSRFHDIHPSLLLFLN 815
Query: 1377 RLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEI 1436
RL I+ + +++ L VI ++ + + I+++S D WFV ++L V
Sbjct: 816 RLHTIIIDDKVSNVLKVIERENLDENIVRLSNNNDSALWFVVGKELNVDV---------- 865
Query: 1437 ALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQW 1496
L QPV+A+LPLR+YG KFI+Q DF +PSSRE+++ +S WNQ
Sbjct: 866 -----------------LPNQPVYAYLPLRSYGFKFIIQADFEVPSSREDINKDSAWNQC 908
Query: 1497 LLSEFPALFVSAERSFCDLPCFRENP----AKAASVYMSFVPLVGEVHGFFSGLPRMILS 1552
+L P L V A CF+++ Y+ F+ L V G F + I+
Sbjct: 909 ILQHIPKLVVEA------FHCFKKHHHFTGLDGVIKYLKFMDLEDHVLGVFKQVSHQIIK 962
Query: 1553 KLRMSNCL-ILEGNNN-------QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKD 1604
L+ CL + + N W P ++ + ++ +LQ+ LGL +L+KD
Sbjct: 963 SLQREACLPAMSKDCNAYPQMPITWVVPSCIINP-QENVLKVISQEMLQEKLGLYYLNKD 1021
Query: 1605 I--VLSDSLARALGIEEYGPKILLQI-------ISSLCRTENGLRSMGLSWLASWLNELY 1655
+ V+S + +LG+ E ++LL+I +SL + E ++ W+ WL+ +Y
Sbjct: 1022 VNTVISKRMLISLGVNEMTLELLLEICRLIVHQFNSLPQNEQDCKAF-YQWIGQWLHTVY 1080
Query: 1656 TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGL 1715
+ Q + L+ + P+SD + S++ ++ F +
Sbjct: 1081 L-------RLQDQCDCSEKTFNILRSLYIYPMSDRSLKSLNGHVVF-----------FPV 1122
Query: 1716 EAFPNLCAKLRTVSPA------LLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIV 1769
+ ++V P LL S S L I L L +GV++++ DIV
Sbjct: 1123 SDNNKIIFNKKSVQPIHLIEQDLLILSHDFMSCLSGIEQQQLKNFLCDLGVKEITPLDIV 1182
Query: 1770 KVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNH 1829
+ I+P +NL+ Y+ + +H +Y + + F + E+R+ +LTN
Sbjct: 1183 QNCIIPTFKSGKWKEKSENLVVAYVNY--LHEQYMLDS----QIFDILEIRQCLMLLTNK 1236
Query: 1830 GFKRP--SEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSC----GLV 1883
GF+ P S PI+FG+E+ N + + D W + Y LSC V
Sbjct: 1237 GFQSPFDSTNPIYFGREYNNDIDLQDSFPSAD--WTLISSCY--------LSCLSDIEKV 1286
Query: 1884 KWRKFFKEIGITDFVQVVQVDK--DVADISHTGFKNMWTK--ELLSPGSAAIDWESNELV 1939
++F + + F+ V ++ K V+D+ + ++ + + + D E E
Sbjct: 1287 SLKRFLSSLSVHTFMVVRKITKTLHVSDLIASCWEKYFETWPQTIDDTYIIDDLECPEFN 1346
Query: 1940 HLLSLLTTNVNRQCSKHLLEILDTL--WDDCYTDKIMGFFKSKP-----TGDDRSFQSSF 1992
++ K+ EI D L +D+ ++ + F +P S+F
Sbjct: 1347 QVVD----------GKNYREIYDMLEEFDERWSSEFSKFNFIRPGCLVKNTSGEVVSSTF 1396
Query: 1993 INCICDIQ---WTISSMDDE-LHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIG 2048
+ + ++ W ++ D+ LH P+DLF D + S+LG Y+ +VKS+ + +G
Sbjct: 1397 CSFLLKMRSSAWLANNEDENSLHCPRDLFLRNDRLFSLLGNKVNYLTNRVKSKNFILTLG 1456
Query: 2049 LKTEVTIDDILEILKVWTRLEAPFMARY 2076
++TEV W R + F Y
Sbjct: 1457 VQTEVNFKRFANEFMKW-REQTVFTTNY 1483
>gi|326672130|ref|XP_003199604.1| PREDICTED: hypothetical protein LOC100151305 [Danio rerio]
Length = 868
Score = 362 bits (929), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 217/561 (38%), Positives = 316/561 (56%), Gaps = 46/561 (8%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
++E IR+ EFG+G ++ +++KQ ARLGR+L LS ELYS+D+HF+LEL+QNADDN
Sbjct: 321 VIEDIRKTEFGIGVELNQEGQILMEKQQARLGRSLQRLSDELYSKDTHFVLELIQNADDN 380
Query: 1221 IYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
YP+ E P LTF++++S I +LNNE GF +NIRA+CDVG STK GYIG+KGIGF
Sbjct: 381 SYPDGGEQPALTFVVEKSCITILNNECGFEEKNIRAICDVGRSTKGKHKYGYIGQKGIGF 440
Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
KSVF+VTD PEIHSNGFH++FD + G +G++LP + R ++ ++ K W
Sbjct: 441 KSVFKVTDCPEIHSNGFHIQFDKTSGPMGYILPHWLEQE------RPVNPAAKEIAEKRW 494
Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
T I LP RT E N +F D+HPSLLLFLHRL+ I N LV + ++ +
Sbjct: 495 TTKIHLPLRT---ENYQTKN---LFHDVHPSLLLFLHRLRSITIFNEAEKRLVSMTRRDL 548
Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
I++VS + W V + L I+ DV++TE+ALA + S+ + P +QPV
Sbjct: 549 SHNILEVSHTGGEERWLVLKKMLFPKKIKDDVESTELALAFQIHSSSCADVKP--QKQPV 606
Query: 1460 FAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFR 1519
FAFLPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE P LFV A F P F
Sbjct: 607 FAFLPLRSFGFRFIIQGDFDIPSSREDVDRDSSWNQWLRSEIPQLFVQAMDVFSKHPEF- 665
Query: 1520 ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNN---QWAPPCKVL 1576
+ ++ F+P E+ FF+ + I+ LR CL + ++ ++ P ++
Sbjct: 666 -SGLSGLCYFLQFIPQPSEILDFFNPVANQIIQLLRGRACLPAQEDHEGRVEFRQPSQL- 723
Query: 1577 RGWNDRAHSLLPDILLQKHLGLGFLSKDIVL-----------SDSLARALGIEEYGPKIL 1625
++ PD L+Q+ +G LS+ + L S SL ALG+ +
Sbjct: 724 --------AVCPDRLIQQVIGGQDLSRSLSLWYLHPDLQCSVSGSLLSALGVHRLQAAEI 775
Query: 1626 LQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETD-LIDNLQRIPF 1684
L + + R L L F +L+S E D L+ +++ +P
Sbjct: 776 LTVTCCMARELLQHTQTHTDGELEKLARLLVCIFR-----ALESEYEVDALLQSVRDVPM 830
Query: 1685 IPLSDGTFSSVDEGTIWLHSD 1705
IPL+DG+ S+ ++ D
Sbjct: 831 IPLADGSVVSLSSEGVFFPLD 851
Score = 79.7 bits (195), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/234 (28%), Positives = 95/234 (40%), Gaps = 42/234 (17%)
Query: 846 LLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVD 905
LL AP L DL W+ W +F P+ GPL ++ ELL L G ++R+
Sbjct: 4 LLSAPLLQDLQDWTQWQLVFLPNHGPLKDFIDKYCGKTELLALEVSPGVLLRVTSETGDK 63
Query: 906 SFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINS 965
F AA +TA L+ S A+A G N P LL H A V
Sbjct: 64 HFSVAAQALDPLRTAGHLV-SMAVADGIPNTPTALLANHMESALNV-------------- 108
Query: 966 QNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEF-RS 1024
G +++ D+ G + +RF L+C+ +P +
Sbjct: 109 -----------GVAQEEISPGDD------------GYSYSTVARFVLECVTLIPIRICKE 145
Query: 1025 FAADVLLSGLQSTIKDAPS-AILLECHQTELRLM--LHEVGLSLGILEWIHDYH 1075
V L L + A S A+LLE QT R + LH++GL LGI EW D++
Sbjct: 146 LLQQVFLEPLSKVLGQAKSKAVLLEAAQTNTRHLNKLHQMGLLLGITEWRADFN 199
>gi|449686810|ref|XP_002155553.2| PREDICTED: uncharacterized protein LOC100210004, partial [Hydra
magnipapillata]
Length = 797
Score = 360 bits (923), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 242/740 (32%), Positives = 384/740 (51%), Gaps = 71/740 (9%)
Query: 1138 DDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHC 1197
++ L +C T + D C I+ IR +FG+G + E + + R GR+L
Sbjct: 101 EEVLNECVTSDIQNDY-CHK---IINDIRCKQFGIGLELGEQEKKLFEILREREGRSLQR 156
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENV---EPTLTFILQESGIVVLNNEQGFSAENIR 1254
LS ELYS DSHF+LEL+QNADDN Y EN +PT+ FI++ I++LNNE+GFS +I+
Sbjct: 157 LSNELYSNDSHFVLELIQNADDNQYIENSNNEKPTVAFIVEHDQIIILNNEKGFSESDIK 216
Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
ALCDVG STK GYIG+KGIGFKSVFRVTD PEIHSNG+H+KFD S G G++LP
Sbjct: 217 ALCDVGKSTKGIHRKGYIGQKGIGFKSVFRVTDMPEIHSNGYHIKFDASSGSNGYILPNW 276
Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
+ +N D + + P + + W T I LP + + ++ F D+H SLLLF
Sbjct: 277 IYDWNSD---QRSTYQPCESRNYGWKTQIVLPLKKEMQHS----KLISRFHDIHSSLLLF 329
Query: 1375 LHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
L+RL I+ + +++ + V+ ++ V I+++S WFV +++ IR D+K+T
Sbjct: 330 LNRLHTIIIDDKVSNIMKVMEREHVEKNIVRLSNNNYSRLWFVVEKEIPID-IREDIKST 388
Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
IA A + P Q VFAFLPLR+YG KFI+Q DF +PSSRE++D +S WN
Sbjct: 389 NIAFAFPFNNKLKAEALP---AQQVFAFLPLRSYGFKFIIQADFEVPSSREDIDKDSAWN 445
Query: 1495 QWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKL 1554
Q++L + P L V A SF F+ + Y+ F+ L V G F + + ++ L
Sbjct: 446 QYILQQLPKLTVEAFYSFKVHQYFK--GLEGVIEYLKFMDLEDNVSGVFKQVSQQMIKSL 503
Query: 1555 RMSNCLILEGNNNQ----WAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS 1610
+ CL + N + W P ++ + + P+I LQ+ LGL +L+ DI + +
Sbjct: 504 QREACLPVIPRNIENILNWVIPKHIVFSQQEIYKVISPEI-LQEKLGLYYLNNDIDCT-A 561
Query: 1611 LAR---ALGIEEYG-------PKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFH 1660
L R +LG+ E K+++Q SS + M W+ WL+ +YT
Sbjct: 562 LKRVLTSLGVCELNFEQLIEVCKLVVQEFSSSSQQIKNPDKMQ-QWIGKWLHLVYT---- 616
Query: 1661 SSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPN 1720
S Q+S +T D ++ + P+++G+ S+ E D F + P
Sbjct: 617 -SLQNSCDCSKKT--FDIIRSLHIFPMTNGSMMSLKEN-----------DAFFPVNYIPK 662
Query: 1721 LCAK------LRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHIL 1774
K L + L S+ SL I + + +++ +GV+ ++ D++ HI+
Sbjct: 663 HGEKKEDKRMLDLIENDLNILSSNLMESLDHIEQEQVKKLMCDLGVKYITPSDLIHNHII 722
Query: 1775 PAISDETTANGDKNLMADYLCFVMMHLEYYC-PNCHVEREFIVSELRKKAFVLTNHGFKR 1833
P+ + + L+ Y+ V + +Y P+ F ++E+ + ++TN GF+
Sbjct: 723 PSFKSGLWKSKPEKLVVAYV--VYLEGQYRAQPDM-----FDIAEVSQSVMLMTNKGFRS 775
Query: 1834 P--SEIPIHFGKEFGNPVSV 1851
P PI+ ++GN +++
Sbjct: 776 PLDQSNPIYLTPQYGNEINL 795
>gi|147783924|emb|CAN74689.1| hypothetical protein VITISV_001349 [Vitis vinifera]
Length = 566
Score = 337 bits (865), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 188/362 (51%), Positives = 247/362 (68%), Gaps = 14/362 (3%)
Query: 222 AICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAK 281
AICK+EG+E+FE+LELGP +R PLI HYF + SD VFKI + ++I CL E+ + K K
Sbjct: 2 AICKNEGVEQFEDLELGPLVRHPLIMHYFSISSDASGVFKITSAEIISCLDEFMEACKDK 61
Query: 282 DIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQ--PH 339
I ++EFL++IAKKRSL +E+LGVRIQ+LGMHI+FIREARK + TLKK L+ P
Sbjct: 62 HIIIEEFLEYIAKKRSLTGRERLGVRIQSLGMHISFIREARKLEHXTLKKSQGSLKQIPD 121
Query: 340 HKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTY 399
K R+ P+ SSEKK+ LDERFSA+ +RVKSF+S ++DFG KH FVSS SE++ SDD Y
Sbjct: 122 KKIREHPLRSSEKKK-LDERFSAMSQRVKSFASAHDDFGGKHTIFVSSCSEEDGSDDHKY 180
Query: 400 ECSN---DISSNVQLPS-QIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQ 455
E S D SN + S K DRVSSCPYPS EE+ RLGLKGE +++G SS
Sbjct: 181 EESEEDIDGCSNSKFSSPNSKTRDRVSSCPYPSAIEEMTRLGLKGETEGNPSASG-SSMH 239
Query: 456 DDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQD-DELNEADISFSN 514
+ G K+KRK N CT S K +RNK + L ++ ++ ++ + LNEAD +N
Sbjct: 240 SENTGPFKRKRKSSNRSCTVSKYLKLPKRNKLELVPLSVDHXNEKKELNNLNEADFLLAN 299
Query: 515 ESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNI 574
+SMRMFITTWKEAC +T++EVLE+M QF+ ++ K MK M SSYP +GLLN+
Sbjct: 300 DSMRMFITTWKEACXEHTIAEVLERMLQFHGTQTKQR-----KIMKSMLSSYPFVGLLNV 354
Query: 575 AV 576
A+
Sbjct: 355 AI 356
>gi|4455303|emb|CAB36838.1| hypothetical protein [Arabidopsis thaliana]
gi|7268078|emb|CAB78416.1| hypothetical protein [Arabidopsis thaliana]
Length = 605
Score = 329 bits (843), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 224/599 (37%), Positives = 319/599 (53%), Gaps = 96/599 (16%)
Query: 26 PSQPPPPHFNPNFAFQN-PNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLT 84
P + P P F NF N P+ NP Q QNL + Q P+ N F+QN N
Sbjct: 50 PIRYPIPQFPANFYRPNFPDFSLGNPNFQPHQNLNFLHQQIPHQYGSAANHFLQNHN--- 106
Query: 85 QNP-NLPPQQPPSS--ACNQQRSQTQHPPGASNQQRPLTQQ----PQSLPPQPQKLNKEF 137
QN + PPQ P++ + +Q ++ +Q + Q P+S + N F
Sbjct: 107 QNSFSFPPQSIPNNDISISQNHGAFENSSLKRRRQEEVVQVTDVVPKSNFASGESANNSF 166
Query: 138 --------------------------------LERIDRAVVKARSELIMAGESVSSWKVS 165
+ RID+AV K R + AGESVSS +VS
Sbjct: 167 SVSLPIPIATDDSGVSRVHGEKSSGKPKRKVDVLRIDKAVNKTRKLFVAAGESVSSTRVS 226
Query: 166 RSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICK 225
R+VL +LQ DSW SLG QMQ+VPSL +LM E +INAFIHCFVG R+I +L+DLE AIC+
Sbjct: 227 RAVLEELQADSWRSLGVQMQDVPSLRQLMAIEGKINAFIHCFVGARRIVTLHDLEVAICR 286
Query: 226 SEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKV 285
+E ++ F++LELGP L+ PL+ YF S KI + ++I L Y T+ +D+K+
Sbjct: 287 NEFVDSFDDLELGPLLQHPLVLLYFPSISSSTGPVKITSEEIISFLDSYLHTYMTEDVKL 346
Query: 286 DEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKR 345
DEFL+F+A ++S+ SKE+LGVRIQ+L M+++FI +A++ + TLK L EL H++
Sbjct: 347 DEFLNFVASQKSVTSKEKLGVRIQSLRMYVSFILDAKRQEGETLKVLLTEL---HQKYHI 403
Query: 346 PIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSNDI 405
P S KK+Q D+ + + ER SF+ ++D+ KHIRF SSSS D +D YE N
Sbjct: 404 P---SSKKQQRDKSLT-VSERADSFALHHKDYCGKHIRFDSSSS---DENDNVYEVRN-- 454
Query: 406 SSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKK 465
+ SD ++SCPYPSV EE+KRL G S KK
Sbjct: 455 ---------LNSSDHINSCPYPSVAEEMKRL-----------------------GGSNKK 482
Query: 466 RKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDD-------ELNEADISFSNESMR 518
RK E S +K LR++ +K + + + DD ++EAD + S +R
Sbjct: 483 RKGERRNHEKSDSSKLLRKSPSKLQGHAKQEIPKLADDSEAKKVFSVDEADFTLSEGDLR 542
Query: 519 MFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVT 577
+FI+TWK+ CK ++S +EKM FY + A+ KR K M SS+P +GLLN+AV+
Sbjct: 543 LFISTWKDTCKELSISTFVEKMLSFYNLGGSEGRAQ-IKRAKAM-SSFPFVGLLNVAVS 599
>gi|449679290|ref|XP_002168168.2| PREDICTED: uncharacterized protein LOC100211694, partial [Hydra
magnipapillata]
Length = 366
Score = 317 bits (811), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 231/369 (62%), Gaps = 14/369 (3%)
Query: 1149 LPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSH 1208
L D + + I+ IRR++FG+G + E +++ R GR+L LS ELYS+DSH
Sbjct: 9 LTSDVQNDHCHKIINDIRREQFGIGLELGEQEKKLIEIHREREGRSLQRLSNELYSKDSH 68
Query: 1209 FLLELVQNADDNIYPENV---EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
F+LEL+QNADDN Y EN +PT+ FI++ I++LNNE+GFS NI+ALCDVG STK
Sbjct: 69 FVLELIQNADDNQYIENSNNEKPTVAFIVEHDQIIILNNEKGFSESNIKALCDVGKSTKG 128
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
GYIG+KGIGFKSVFRVTD PEIHSNG+H+KFD S G G++LP + +N D
Sbjct: 129 IHRKGYIGQKGIGFKSVFRVTDMPEIHSNGYHIKFDASSGSNGYILPNWIYDWNSDQRS- 187
Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
+ P + + W T I LP + + + + ++ F D+H SLLLFL+RL I+ +
Sbjct: 188 --TYQPCESRNYGWKTQIVLPLKKE----MQHSKLISRFHDIHSSLLLFLNRLHTIIIDD 241
Query: 1386 MLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQES 1445
+++ + V+ ++ V I+++S D WFV +++ V R DVK+T IA A
Sbjct: 242 KVSNIMKVMEREHVEKNIVRLSNNNDSRLWFVVEKEIPIDV-REDVKSTNIAFAFPFNNK 300
Query: 1446 NEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF 1505
+ P Q VFAFLPLR+YG KFI+Q DF +PSSRE+++ +S WNQ++L + P L
Sbjct: 301 LKTEALPA---QQVFAFLPLRSYGFKFIIQADFEVPSSREDINKDSAWNQYILQQLPKLT 357
Query: 1506 VSAERSFCD 1514
V A SF D
Sbjct: 358 VEAFHSFKD 366
>gi|297804942|ref|XP_002870355.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316191|gb|EFH46614.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 608
Score = 309 bits (791), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 191/445 (42%), Positives = 270/445 (60%), Gaps = 53/445 (11%)
Query: 140 RIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEAR 199
RID+AV K R ++ AGESVSS +VSR+VL +LQ DSW SLG QMQ+VPSL +LM E +
Sbjct: 204 RIDKAVNKTRKSVVAAGESVSSTRVSRAVLEELQADSWRSLGVQMQDVPSLRQLMAIEGK 263
Query: 200 INAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVV 259
INAFIHCFVG R+I +L+DLE AIC++E ++ F++LELGP L+ PL+ YF S
Sbjct: 264 INAFIHCFVGARRIVTLHDLEVAICRNEFVDSFDDLELGPLLQHPLVLLYFPSISSSTGP 323
Query: 260 FKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIR 319
+I + ++I L Y T+ +D+K+DEFL+F+A ++S+ SKE+LGVRIQ+L M+++FI
Sbjct: 324 VQITSEEIISFLDSYLHTYMTEDVKLDEFLNFVASQKSVTSKEKLGVRIQSLRMYVSFIL 383
Query: 320 EARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGA 379
+A++ + TLK L EL +K I SS+K+RQ ++ + ER SF+ ++D+
Sbjct: 384 DAKRQEGETLKVLLTELH-----QKYQIPSSKKQRQ--DKPLTVSERADSFALHHKDYCG 436
Query: 380 KHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLK 439
KHIRF SSSS+D DS C YE N S SD ++CPYPSV EE+KRL
Sbjct: 437 KHIRFDSSSSDDNDS--C-YEVRNKNS-----------SDHFNTCPYPSVAEEMKRL--- 479
Query: 440 GEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQ 499
G S KKRK E S +K LR++ +K + + +
Sbjct: 480 --------------------GGSNKKRKGEKRNHEKSDSSKLLRKSPSKLQGHAKQEIPR 519
Query: 500 TQDD-------ELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKA 552
DD ++EAD + S +R+FI+TWK+ CK ++S +EKM FY +
Sbjct: 520 LTDDSDAKKVFSVDEADFTLSEGDLRLFISTWKDTCKELSISTFVEKMLSFYNLGGSEGR 579
Query: 553 ARYFKRMKLMFSSYPCIGLLNIAVT 577
A+ KR K M SS+P +GLLN+AV+
Sbjct: 580 AQ-IKRAKAM-SSFPFVGLLNVAVS 602
>gi|115456025|ref|NP_001051613.1| Os03g0803800 [Oryza sativa Japonica Group]
gi|113550084|dbj|BAF13527.1| Os03g0803800, partial [Oryza sativa Japonica Group]
Length = 642
Score = 307 bits (786), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 181/488 (37%), Positives = 280/488 (57%), Gaps = 31/488 (6%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
LER A K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238
Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
++N FIHC+V R+I S+YDLE ICK+E I +FEEL LGPFL+ PL++HYF V +D+
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298
Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
+V K++++++I L ++ D K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357
Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
+R+AR+++ +K ++ + + K K+ LDERFSAI R+K I
Sbjct: 358 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417
Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
N KHI F S+ ++ D D + + D S + + I D RVSSCPYPS T
Sbjct: 418 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471
Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
EE++RLGLK E + + + + G +++KRK E +G +S+ +
Sbjct: 472 EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521
Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
P + + S + FITTWKEAC+ + + +VLE + +Y T
Sbjct: 522 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAET--- 576
Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
+ K++ FS YP IG LN+AV ++ G+ DS+YD + + + ++ + + E
Sbjct: 577 --PKEKKKIIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 634
Query: 611 ESIEIEPS 618
+E PS
Sbjct: 635 MEVEPPPS 642
>gi|240254043|ref|NP_172307.5| no vein-like protein [Arabidopsis thaliana]
gi|332190150|gb|AEE28271.1| no vein-like protein [Arabidopsis thaliana]
Length = 746
Score = 287 bits (734), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 191/505 (37%), Positives = 276/505 (54%), Gaps = 82/505 (16%)
Query: 115 QQRPLT---QQPQSLP-PQPQ---KLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRS 167
QQ+P+ Q Q+L P PQ N E + RID+AV R LI AG++VSS +VS+S
Sbjct: 95 QQQPIRYNFQTQQNLNFPHPQFGGGANVELI-RIDKAVNNTRKSLIAAGDNVSSTRVSQS 153
Query: 168 VLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSE 227
VL +LQ D+W SLG MQ+VPSL +LM E +I AFIHCF+G R I +L+DLE AIC++E
Sbjct: 154 VLAQLQADTWRSLGIWMQDVPSLRQLMALEGKIIAFIHCFIGARGIVTLHDLEVAICQNE 213
Query: 228 GIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDE 287
+ F++L LGP L+ PL+ YF KI + ++I L Y +T+ D+K+DE
Sbjct: 214 FVGCFDDLGLGPLLQHPLVLLYFPSVYCSTAPVKITSEEIISLLDSYLNTYDIDDVKLDE 273
Query: 288 FLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKRPI 347
FLDF+A+ +++ SKE+LGVRIQNL M+++FI++A++ + LK L EL H K R I
Sbjct: 274 FLDFVAEAKAVTSKEKLGVRIQNLRMYVSFIQDAKRQEGEILKIVLTEL--HQKYR---I 328
Query: 348 FSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISS 407
SS+K+RQ ++D+ KH RF S SSE+ DS D YE N
Sbjct: 329 LSSKKQRQ------------------DKDYCGKHTRFNSPSSEENDSAD--YEVEN---- 364
Query: 408 NVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRK 467
+K SD SSCPY S EE+K+L GSS KKRK
Sbjct: 365 -------VKRSDHFSSCPYSSAEEEVKQL-----------------------GSSSKKRK 394
Query: 468 IENSGCTSSAPAKFLRRNKAKQR------ALPIESGDQ-TQDDELNEADISFSNESMRMF 520
E+ S K LRR +K R +P + D Q +N+AD + S ++++F
Sbjct: 395 AESRNHEKSDSPKLLRRGPSKLRRGHVKQKIPKSADDSDAQIFSVNDADFTLSEGALKLF 454
Query: 521 ITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIK 580
I+TWKE CK ++S ++K+ FY + + ++ SS+P +GLL++A +
Sbjct: 455 ISTWKETCKELSISVFVKKLLSFYNLGGSEVHGQI--KIATAVSSFPFVGLLHVAKELVT 512
Query: 581 RGMWDSVYDTLQAITQQELNNTVSG 605
+ + Q T+ + N V+G
Sbjct: 513 EALEE------QIPTEITITNLVAG 531
>gi|6579199|gb|AAF18242.1|AC011438_4 T23G18.16 [Arabidopsis thaliana]
Length = 512
Score = 283 bits (723), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 187/480 (38%), Positives = 267/480 (55%), Gaps = 79/480 (16%)
Query: 115 QQRPLT---QQPQSLP-PQPQ---KLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRS 167
QQ+P+ Q Q+L P PQ N E + RID+AV R LI AG++VSS +VS+S
Sbjct: 95 QQQPIRYNFQTQQNLNFPHPQFGGGANVELI-RIDKAVNNTRKSLIAAGDNVSSTRVSQS 153
Query: 168 VLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSE 227
VL +LQ D+W SLG MQ+VPSL +LM E +I AFIHCF+G R I +L+DLE AIC++E
Sbjct: 154 VLAQLQADTWRSLGIWMQDVPSLRQLMALEGKIIAFIHCFIGARGIVTLHDLEVAICQNE 213
Query: 228 GIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDE 287
+ F++L LGP L+ PL+ YF KI + ++I L Y +T+ D+K+DE
Sbjct: 214 FVGCFDDLGLGPLLQHPLVLLYFPSVYCSTAPVKITSEEIISLLDSYLNTYDIDDVKLDE 273
Query: 288 FLDFIAKKRSLASKEQLGVRIQNL---GMHINFIREARKSQDVTLKKFLKELQPHHKRRK 344
FLDF+A+ +++ SKE+LGVRIQNL M+++FI++A++ + LK L EL H K R
Sbjct: 274 FLDFVAEAKAVTSKEKLGVRIQNLRYDQMYVSFIQDAKRQEGEILKIVLTEL--HQKYR- 330
Query: 345 RPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSND 404
I SS+K+RQ ++D+ KH RF S SSE+ DS D YE N
Sbjct: 331 --ILSSKKQRQ------------------DKDYCGKHTRFNSPSSEENDSAD--YEVEN- 367
Query: 405 ISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKK 464
+K SD SSCPY S EE+K+L GSS K
Sbjct: 368 ----------VKRSDHFSSCPYSSAEEEVKQL-----------------------GSSSK 394
Query: 465 KRKIENSGCTSSAPAKFLRRNKAKQR------ALPIESGDQ-TQDDELNEADISFSNESM 517
KRK E+ S K LRR +K R +P + D Q +N+AD + S ++
Sbjct: 395 KRKAESRNHEKSDSPKLLRRGPSKLRRGHVKQKIPKSADDSDAQIFSVNDADFTLSEGAL 454
Query: 518 RMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVT 577
++FI+TWKE CK ++S ++K+ FY + + ++ SS+P +GLL++AV+
Sbjct: 455 KLFISTWKETCKELSISVFVKKLLSFYNLGGSEVHGQI--KIATAVSSFPFVGLLHVAVS 512
>gi|432926042|ref|XP_004080801.1| PREDICTED: uncharacterized protein LOC101162250 [Oryzias latipes]
Length = 2775
Score = 273 bits (698), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 194/316 (61%), Gaps = 13/316 (4%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
++E IR+ EFG+G +S +++ Q RLGR L LS ELYS+D+HF+LEL+QNADDN
Sbjct: 1030 VIEDIRKSEFGIGVELSAEGRRLMEVQQERLGRGLDRLSTELYSKDTHFVLELIQNADDN 1089
Query: 1221 IYP-ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
Y E P+L F++++ I +LNNE GF +N+RA+CDVG STK GYIG+KGIGF
Sbjct: 1090 SYSSETSVPSLAFVVEQDCITILNNESGFQEKNVRAICDVGRSTKGKHKYGYIGQKGIGF 1149
Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
KSVF+VTD PEIHSNGFH++FD S G +G++LP R L L W
Sbjct: 1150 KSVFKVTDCPEIHSNGFHLRFDRSCGPMGYILPHWAEDD------RPLDPQLQLLRQHGW 1203
Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
T IRLP R SE N +F D+HPSLLLFLHRL+ I N LV + +K +
Sbjct: 1204 TTKIRLPLR---SESRQTRN---LFHDVHPSLLLFLHRLRSISIYNHSEKRLVTMTRKDL 1257
Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
G +++V E W V L ++ DV++TE+ALA L S + +QPV
Sbjct: 1258 GHNVLEVEHSEGIERWLVVKTTLYPKKVKQDVESTELALAFQLGSSPPEDVACQPQKQPV 1317
Query: 1460 FAFLPLRTYGLKFILQ 1475
FA+LPLR++G +FI+Q
Sbjct: 1318 FAYLPLRSFGFRFIVQ 1333
Score = 117 bits (293), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 146/621 (23%), Positives = 255/621 (41%), Gaps = 76/621 (12%)
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
GDF +PSSRE+VD +S WNQWL SE P LF+ A F D F+ K ++ F+PL
Sbjct: 1403 GDFDIPSSREDVDRDSSWNQWLRSEIPQLFLQAMEVFSDHSEFKG--LKGLCRFLQFIPL 1460
Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWNDRAHSLLPDILLQ 1593
EV FF + I+ L+ L ++ + P +V + S++ L+
Sbjct: 1461 PDEVLDFFKPVAGQIIQLLKGKAFLPTRNSDGGVVYKLPSQVAICQDAVIRSVIGSDELE 1520
Query: 1594 KHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS-- 1649
K+L L +L D+ + SL LG+ + + +++ + L++ G +
Sbjct: 1521 KYLSLSYLHPDLSPTPPTSLLTHLGVRYLRGSDVTTVTTAMAK--ELLKAGGFHEEENIH 1578
Query: 1650 WLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVD-EGTIWLHSDCSV 1708
L L +F + G ++ NL+ +P IPL+DG+ S++ EG + +
Sbjct: 1579 QLARLLVCNFRAMEHG---YGDAESILQNLKDLPIIPLADGSVVSLNGEGVFFPVEEAKT 1635
Query: 1709 F----DGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLS 1764
G G A L + V P+LLS L + + +L ++GV +L
Sbjct: 1636 KKKKAQGQTG--AVSALYQDVSVVHPSLLSC-------LEPLESQQVRELLKRLGVHELE 1686
Query: 1765 AHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAF 1824
++++ HI P I + + ++ YL F+ H V
Sbjct: 1687 PQELLEQHIYPTIKSKKWKSKPAAVVVSYLVFIKQHSSSSKGYADV-----------AVP 1735
Query: 1825 VLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDI--DIKWYEVDITYLKHPAKESLSCGL 1882
VLT+ G P+ +HF KE+ NM +H W V Y++ +
Sbjct: 1736 VLTSRGLLCPATERVHFSKEYR-----NMDLHGTLPGCDWILVSPCYVQ------TDGDV 1784
Query: 1883 VKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAA----IDWESNEL 1938
WR+ +G+ D + + + + A + W E SAA +D E
Sbjct: 1785 DGWRELLSSLGVRDGLIIRKERR--ALTAEELANGPWAAERAMGLSAAQDCLLDDYPCEE 1842
Query: 1939 VHLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDD-----RSFQSS 1991
LL+ L+ + Q LLE+L+ W+ T + F + D R+ +SS
Sbjct: 1843 FRLLATAPLSGSALLQQRTALLELLEANWN---TGHLYSQFLTAQVIDSNGRPIRNTKSS 1899
Query: 1992 FINCICDIQWTISSMDDELHY-------PKDLFHDCDAVRSILGPSAPYIVPKVKSEKLV 2044
F + +C ++W + + E P ++ V +LG Y+ + S +
Sbjct: 1900 FHHLLCSLEWIPAYREQEGEQQEKKCLCPGSVYLPLPEVTRLLGTHVFYV--DLNSSEFS 1957
Query: 2045 CDIGLKTEVTIDDILEILKVW 2065
+G++ +++D ++ LK W
Sbjct: 1958 RALGIREVISVDVLIHFLKEW 1978
Score = 77.4 bits (189), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 103/439 (23%), Positives = 167/439 (38%), Gaps = 63/439 (14%)
Query: 643 SKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHA 702
+K EF L + S VT L + + E LT+ F+I F SL G F FL K+
Sbjct: 530 AKVKEFVRENLRSQNSAVT-----LAHIAALERKLTKHFQIPHFLSLERGTFLEFLVKNI 584
Query: 703 SMLSTELQKL-FTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQ 761
+L L L F ++ + + + + + Q ++ S+ + LR
Sbjct: 585 QLLQESLGSLLFLENNKNLAGCGFRPSKQDVFEFIQQC-GSMTPSDPDELCGMESALRAH 643
Query: 762 FPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETT 821
+ + + G + + V + R E+ + V L S +S H S +
Sbjct: 644 YGVRDSRDLGYGPLPTLAGLVQRQR-ELGGRGV-----SLICYESAVVSKHRNQSTESGS 697
Query: 822 SAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVN 881
++ G ++ A+ LL P L DL+ WS WDF+F P GPL ++
Sbjct: 698 ESVGRLG-------DMSKGQALASLLCCPLLEDLSEWSQWDFVFKPLHGPLKDFIERNAA 750
Query: 882 VKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
L L R G +++I + F A TA L+ S +A G N P LL
Sbjct: 751 NTGLAALEVRPGLLLKITTNTGDKHFFRAVAALDPVGTAGHLV-SMVVADGVSNAPTALL 809
Query: 942 KCHARHAFE-VMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKF 1000
H + A M K ++ E S R+
Sbjct: 810 ANHMQSALAGAMAKEDLSQAEEDISCYSRV------------------------------ 839
Query: 1001 GKAVPVASRFFLDCLGYLPSEF-RSFAADVLLSGLQSTIKDAPSA-ILLECHQTELRLM- 1057
++F LDCL +P+ ++ V L + A S +L+ Q++LR +
Sbjct: 840 -------AKFLLDCLIRIPTRTCQALLQKVFLEPFSLVLGQAKSKQVLIRTAQSDLRHLN 892
Query: 1058 -LHEVGLSLGILEWIHDYH 1075
LH +G+ G+ +WI DY
Sbjct: 893 CLHRLGILTGVTDWIRDYQ 911
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 96/203 (47%), Gaps = 7/203 (3%)
Query: 187 VPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLI 246
+P+L+ L IN FI V I +LY+L +A+ + + +EEL LGP + P+I
Sbjct: 252 LPALNSLQYLTKEINLFIEATEAVSCICTLYELGQALAALKNKKHYEELNLGPLCKLPVI 311
Query: 247 SHYFLVKSDV--DVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQL 304
F + S+ D + KI T D++ L + + K + + EF+ +A +L
Sbjct: 312 YRMFKMDSNTKDDDIPKIRTVDILKQLYYFRKSQKQPKLDLAEFMKHLADHYDCELPYKL 371
Query: 305 GVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAIC 364
G+RI + G+ I + +++ L++ + +Q ++ + E+ R++ +
Sbjct: 372 GIRISSFGLPIATVSRVARNERTMLEQCREVIQKELEQE-----ALERLRKIKDSVFKPL 426
Query: 365 ERVKSFSSINEDFGAKHIRFVSS 387
SFS+IN D K+ ++
Sbjct: 427 HSPGSFSTINMDVRKKYTSMTAA 449
>gi|297843606|ref|XP_002889684.1| hypothetical protein ARALYDRAFT_470891 [Arabidopsis lyrata subsp.
lyrata]
gi|297335526|gb|EFH65943.1| hypothetical protein ARALYDRAFT_470891 [Arabidopsis lyrata subsp.
lyrata]
Length = 456
Score = 270 bits (691), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 173/446 (38%), Positives = 247/446 (55%), Gaps = 70/446 (15%)
Query: 140 RIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDS-WSSLGFQMQEVPSLHKLMVTEA 198
RID+A+ R LI AGE+VSS +VS+SVL +LQ D W SLG QMQ+VPSL +LM E
Sbjct: 71 RIDKAINNTRKSLIAAGENVSSTRVSQSVLEQLQADDIWRSLGIQMQDVPSLRQLMALEG 130
Query: 199 RINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDV 258
+I AFIHCF+G R I +L+DLE AIC++E + F++L LGP L+ PL+ YF S
Sbjct: 131 KIIAFIHCFIGARGIVTLHDLEVAICRNEFVGCFDDLGLGPLLQHPLVLLYFPSISGSTA 190
Query: 259 VFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFI 318
+I + ++I L + T+ D+K+DEFLDF+A+++++ SK++LGVRIQ+L M+++FI
Sbjct: 191 PVQITSEELISFLDSFLKTYDMDDVKLDEFLDFVAEEKAVTSKDKLGVRIQSLRMYVSFI 250
Query: 319 REARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFG 378
+ A++ + TLK L L +K I S+K++Q ++D+
Sbjct: 251 QNAKRQEGETLKILLTGLH-----QKYHILPSKKQQQ------------------DKDYC 287
Query: 379 AKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGL 438
KH RF S SSED DS D YE N + D+ SSCPYPSV EE+K+
Sbjct: 288 GKHTRFNSLSSEDNDSAD--YEVEN-----------VNSCDQFSSCPYPSVAEEMKQ--- 331
Query: 439 KGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPI---- 494
SS KKRK E+ S +K RR +K +
Sbjct: 332 --------------------PVSSNKKRKAESRSHEKSDLSKLRRRYPSKLHRGHVKQEI 371
Query: 495 ----ESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
+ D Q NEAD + S ++R+FI+TWK+ CK +MS ++K+ FY +
Sbjct: 372 HNLADDSDAKQIFSGNEADFTLSEGALRLFISTWKDTCKELSMSMFVKKLLSFYNLGGSE 431
Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAV 576
A+ KR K M SS+P GLL++AV
Sbjct: 432 VQAQ-IKRAKAM-SSFPFFGLLHVAV 455
>gi|126179420|ref|YP_001047385.1| hypothetical protein Memar_1474 [Methanoculleus marisnigri JR1]
gi|125862214|gb|ABN57403.1| hypothetical protein Memar_1474 [Methanoculleus marisnigri JR1]
Length = 1625
Score = 260 bits (665), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 185/654 (28%), Positives = 319/654 (48%), Gaps = 65/654 (9%)
Query: 1161 IVESIRRDEFG--LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNAD 1218
IVE I + G L + N K++ L AL LS++LY + +HF++EL+QNA+
Sbjct: 10 IVEQIHNENTGHRLKGLRNRRTKNAWNKKNRDLDSALQKLSEDLYRKKTHFVMELIQNAE 69
Query: 1219 DNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-KGSSAGYIGRKGI 1277
DN Y V+P++ F ++ +V+ NNE GF+AE++ +C VG + K K S GYIG KGI
Sbjct: 70 DNTYDPAVQPSIVFSIKPDTLVIYNNEIGFTAEDVERICSVGQTIKAKKKSEGYIGEKGI 129
Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSE-GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLES 1336
GFKSVF+++D+P+I SNGF +F S+ G +G+++P +P P
Sbjct: 130 GFKSVFKISDSPKIISNGFQFEFRRSDGGDLGYIVPYWLPSI------------PEYANP 177
Query: 1337 KCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
K T I LP R E + ++F ++ P+L+LFL +LQ I + +ND+L + K
Sbjct: 178 KV--TNIILPLRADARERL------NLFDEIDPTLILFLRKLQRIEIIDEINDNLKAVTK 229
Query: 1397 KIVGDGIIKVSCGEDKMTWFV--ASQKLRAGVI-----RPDVKTTEIALALTLQESNEGN 1449
+G++ + + W+V A+ K+ +I R D+++TEI LA + E
Sbjct: 230 S-GSEGLVTIDYQDQSQYWWVIRATFKVPQEIIDQEPRREDIRSTEIVLAFPVSEGGHPI 288
Query: 1450 YGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAE 1509
+ + +FAFLP++ G FI+Q DF+L S+RE++ +S WNQWL S+ P + + A
Sbjct: 289 NIQNVSKPDIFAFLPVKKCGFNFIVQADFLLTSNREDILQDSRWNQWLRSKIPDVLIKAI 348
Query: 1510 RSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMILSKLRMSNCLILEGNNNQ 1568
F ++ K ++ ++ + L E+ FF+ L L C++ E +
Sbjct: 349 GKF-------KHDKKLSTSFLKHLSLPEEITDEFFNPLMHQAFQALSGVECILTESGS-- 399
Query: 1569 WAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQI 1628
W PC V +D L+ + L+ + G +++ L L LGI ++ PK L +
Sbjct: 400 WKKPCDVYYAEDD-LKDLVSNNELRSNFGKEYVANTFYLKRELRDLLGIADFTPKHLEEC 458
Query: 1629 ISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLS 1688
+ W+ NE + + Q + +D++ ++P I L
Sbjct: 459 LKH------------PEWVKKHSNEWLIMLYRYCAQQLEVAKSGNLTLDDIIQLPIIKLE 506
Query: 1689 DGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISV 1748
+G + S + G I+L+ + + +G E+ +L LR + + +A ++ + +I
Sbjct: 507 NGDYCSANNGQIFLYLERT--GKKYGFES--DLEGTLRVLDADIRNAVRAEEDAKDLIE- 561
Query: 1749 DNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLE 1802
L K+ + +I+ HILP ET ++ ++ F+ H E
Sbjct: 562 -----FLKKLNIVNPDPSNIINQHILPQYHRETWKKKPPGILQGHIHFIKDHFE 610
>gi|291243341|ref|XP_002741561.1| PREDICTED: predicted protein-like, partial [Saccoglossus kowalevskii]
Length = 810
Score = 257 bits (657), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 207/345 (60%), Gaps = 30/345 (8%)
Query: 1151 EDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFL 1210
E+K + ++ IR +FG+G +S S +++ R GR+L LS ELYS+D+HF+
Sbjct: 485 EEKIQRECRAVISDIRHHQFGVGVELSEEASALMEVYKERSGRSLERLSTELYSKDTHFV 544
Query: 1211 LELVQNADDNIYPENVE---PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
LELVQNADDN Y EN++ P L F++++ + VLNNE GF ++IRALCDVG STK
Sbjct: 545 LELVQNADDNNY-ENMDDSPPALRFVIEKECVTVLNNESGFREKDIRALCDVGKSTKGKH 603
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
GYIG+KGIGFKSVFR+TD PEIHSNG+H+ FD G G++LP + +
Sbjct: 604 KYGYIGQKGIGFKSVFRITDKPEIHSNGYHMCFDAKSGPNGYILPHWIDENSC------- 656
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
+E W T I LP + + + + N F D+HPSLLLFLHRL+ I RN +
Sbjct: 657 ------VEDTEWKTRIVLPVKAETQDRRLLAN---RFHDIHPSLLLFLHRLKSISIRNKV 707
Query: 1388 NDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI-RPDVKTTEIALALTLQESN 1446
N + + V D +I+V W V ++L A + + +V++TE+ALA LQ+ +
Sbjct: 708 NGEEHSMTRTDVDDDVIEVQHTNGVDRWIVTKKQLDASNMSKENVESTELALAFLLQQPS 767
Query: 1447 EGNYGPLLY------QQPVFAFLPLRTYGLKFILQGDFVLPSSRE 1485
G+ +Y +Q VFAFLPLR+YG +F++QG+ + S+E
Sbjct: 768 SGS---AIYPQSTPDKQKVFAFLPLRSYGFRFVVQGEKNIAGSKE 809
Score = 56.6 bits (135), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 58/116 (50%), Gaps = 10/116 (8%)
Query: 836 SVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLP-------GWLLNEVNVK---EL 885
S + + A+ LL AP L DL+ WSHW +F P G + G+++ + ++
Sbjct: 78 SQSKEKAMICLLNAPLLEDLSVWSHWSLIFQPEHGDIKQFIEKNTGYMIKNNGISTAFDM 137
Query: 886 LCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
L T+ G ++++ ++ + F ++A G S QT L+S GG + PL LL
Sbjct: 138 CVLETKPGVLLKLSTDSTPELFSQSAENGESHQTVGHLVSIVIHHGGVTSSPLALL 193
>gi|348676380|gb|EGZ16198.1| hypothetical protein PHYSODRAFT_334382 [Phytophthora sojae]
Length = 2724
Score = 253 bits (647), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 244/843 (28%), Positives = 374/843 (44%), Gaps = 127/843 (15%)
Query: 768 KIEDKG--SMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAIT 825
++ED G S+E ++ K + + V F+ ++ DL V + S +T S+
Sbjct: 679 RVEDLGLPSVEELIKLADKEQAGEETSVVKFAGSL-------DL-VQGDGSFDDTVSSNE 730
Query: 826 HTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKEL 885
TG + + + ++ A+E L + P+L D + + W +AP GPL L+ + V E+
Sbjct: 731 TTG--ESVQQKLLTEQALEKLRKCPYLVDASLYMDWRERYAPLCGPL----LSFIRVHEM 784
Query: 886 LCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVP--LPLLKC 943
+ L DH+ S ++F+ V L+ L EK+ P L LL
Sbjct: 785 ILL----------DHAPSSNNFM-----------FVSCLNGTILRVNEKSTPSDLELLFA 823
Query: 944 HARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKA 1003
A+ A V V C+ G+ NF + + K A
Sbjct: 824 RAQQAKTHAAARQVAVHLVSMVVTCK-------GQANFPKQLVQAHLRAYLSSVTKDKTA 876
Query: 1004 VPVASRFFLDCLGYLPSEFRSFAADVLLSGL-QSTIKDAPSA--ILLECHQ-TELRLMLH 1059
RF L+ L P EF F +LL L Q+T ++ SA + C E + +L+
Sbjct: 877 GSYPERFVLEVLLETPVEFADFVMSLLLGVLSQTTGSESASADRVWKACRSDAERKALLY 936
Query: 1060 EVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNS----------GSGCAE 1109
SL L W + +C + L NAA S G
Sbjct: 937 VSSRSLSGL-WATESEQWCILRDTPSLSSTDDDGGNAAKENTKSSATPQPVTSTGDDGKT 995
Query: 1110 GSLFESVGADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDE 1169
SLF + +V + + G+ ++D + + + D C +E +RR +
Sbjct: 996 VSLFNASSQEVAL---ANFSSELSGQNTNDL---SSVEDIASDDSCRS---FIEDLRRKQ 1046
Query: 1170 FGLGPNISN-MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP 1228
FG+G I + +++L+ Q RL RAL LS ELYS+ +HF+LEL+QNADDN Y + V P
Sbjct: 1047 FGVGLQIQDEATTSVLRIQQQRLERALKRLSDELYSESTHFVLELLQNADDNTYDDAVVP 1106
Query: 1229 TLTFIL-QESGIVVLNNEQGFSAENIRALCDVGNSTKKG-SSAGYIGRKGIGFKSVFRVT 1286
F L E IV NNE+GFS NI+A+CDVG STK+ S IG+KGIGFKSVF+V+
Sbjct: 1107 LGDFTLTAEKEIVFYNNERGFSPSNIQAICDVGASTKEAVDSEASIGKKGIGFKSVFKVS 1166
Query: 1287 DAPEIHSNGFHVKFDTSEGQ----IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC 1342
D P++HSNGFH+ F + +G++LP + +D D Q + + T
Sbjct: 1167 DNPQVHSNGFHICFHAKNAEHGSGMGYILP-----YWLD--------DATQWKQRRGTTF 1213
Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSL--------------------LLFLHRLQCIM 1382
+ LP + + ++I PS+ L FL + + +
Sbjct: 1214 V-LPLNDSSVQRV--DDISQSLMAFEPSVLLFLRRIRELRLRDPARQHALHFLKKEKQLQ 1270
Query: 1383 FRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
+ + ++K +++ W V +KL + TEIALA L
Sbjct: 1271 TNAQIVELYSQVKKSESSVEVVQ-------QNWLVVKEKLEPPQLFGRSHPTEIALAFPL 1323
Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
E + PL Q VFA+LPLR+YG +FILQGDF +PSSRE + S WN+WL+S+FP
Sbjct: 1324 TFQGEESRPPL---QEVFAYLPLRSYGFRFILQGDFEIPSSREAITNGSEWNEWLVSKFP 1380
Query: 1503 ALFVSAERSFCDLPCFRENPA----KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
L +A S+ A +A S +S +PL EV F + ++ +LR
Sbjct: 1381 KLVRAAVGSYVSSAQASGQEADKAIEAISHLLSLLPLENEVQAPFRSIIPEVMRELRQVK 1440
Query: 1559 CLI 1561
L+
Sbjct: 1441 WLL 1443
Score = 74.3 bits (181), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 114/232 (49%), Gaps = 18/232 (7%)
Query: 166 RSVLLKLQVDSWSSL-GFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAIC 224
R++ +V+++ L G ++P+L +L R+ F+ CF+ R+I +L++ ++A
Sbjct: 103 RAICSHFRVNAFEELMGTSALQLPALRQLHTVNQRVWTFVTCFMQSRRINTLFECQQAFL 162
Query: 225 KSEGIEKFEELELG-PFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDT--HKA- 280
+ EG+ F EL+LG FL + L +S V + + T DV+ L ++ D H A
Sbjct: 163 QHEGLRAFHELKLGNSFLHTEAVQQ--LYRSPV-ALLTVTTRDVLASLKQFEDMLGHDAF 219
Query: 281 ---KDIKVDEFLDFIAKKRSLASKEQLGVRI--QNLGMHINFIREARKSQDVTLK----K 331
I ++EFL ++A++ S + +GV I G+++ +R ++ +K +
Sbjct: 220 RASSHIDLNEFLQYLAQQYRQPSAQAMGVAIDPSGFGVYVGMLRRVANNEMKEMKTLEQQ 279
Query: 332 FLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIR 383
F +E+ R + FS + ++Q E A +S +E+ GA+ R
Sbjct: 280 FQREVAEKMFRLTKEKFSDDNRKQALEELLAQTNAAQSTQQESEE-GAQKRR 330
>gi|301122067|ref|XP_002908760.1| phosphoserine aminotransferase, putative [Phytophthora infestans
T30-4]
gi|262099522|gb|EEY57574.1| phosphoserine aminotransferase, putative [Phytophthora infestans
T30-4]
Length = 3635
Score = 243 bits (619), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 290/1080 (26%), Positives = 454/1080 (42%), Gaps = 199/1080 (18%)
Query: 821 TSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGW----- 875
+ ++ TG ++ + + A+E L PFL D++ + W +AP GPL +
Sbjct: 1809 SDSVEETG---ELGQKTLTDQALEKLRTCPFLVDVSLYMDWPERYAPLCGPLLSFIRVHE 1865
Query: 876 --LLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEG--------SSFQTAVKLLS 925
LL+ + + +G ++R++ S S LE L S+ AV L+S
Sbjct: 1866 MILLDHAPSSNFMFVSCVNGAIVRVNEK-STPSDLELLLTRAQQEKTRVSASVVAVHLVS 1924
Query: 926 SFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVAN 985
G+ N P L++ H R
Sbjct: 1925 MLVTCKGQANFPKQLVQAHLR--------------------------------------- 1945
Query: 986 IDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAI 1045
L F K RF L+ L P EF F +LLS L ++ A+
Sbjct: 1946 ---------AYLSSFAKHGDYPKRFLLEILLETPVEFADFVVALLLSVLGQSVSTNEYAV 1996
Query: 1046 LLE----CHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDL--LMPCVVTCTNAATS 1099
+ C R L V W +C+ DL ++ V+ T A +S
Sbjct: 1997 ADQLWKTCTSDAERKALVFVSSHSSASLWKDQTEKWCAV-REDLTAIIDGDVSKTTAVSS 2055
Query: 1100 --GLNSGSGCAEGSLFESVGADV-------HIEECGAICDTICGEASDDGLGDCTTQTLP 1150
+SG GS+ E+V A+V +EE I + G C +
Sbjct: 2056 IDVADSGDDDVVGSVLEAVDAEVPSFSVDQEMEEVSVIETSTSGS--------CRS---- 2103
Query: 1151 EDKECEDAALIVESIRRDEFGLGPNISN-MESNMLKKQHARLGRALHCLSQELYSQDSHF 1209
+E +R+ +FG+G I + +++L Q RL RAL LS ELYS+ +HF
Sbjct: 2104 ----------FIEDLRKKQFGVGLEIQDEATTSVLLIQQQRLERALKRLSDELYSESTHF 2153
Query: 1210 LLELVQNADDNIYPENVEPTLTFIL-QESGIVVLNNEQGFSAENIRALCDVGNSTKKG-S 1267
+LEL+QNADDN Y + V P F L + IV NNEQGF+ NI+A+CDVG STK
Sbjct: 2154 VLELLQNADDNTYDDAVVPLGEFTLTNDKEIVFYNNEQGFTRANIQAICDVGASTKAAVD 2213
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
S IG+KGIGFKSVF+V D P++HSNGFH+ F G + ++P + +D
Sbjct: 2214 SEASIGKKGIGFKSVFKVADDPQVHSNGFHICFHAKNSNHGTGMGYILP-YWLD------ 2266
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLH----------- 1376
D Q + T + LP + + ++I PS+LLFL
Sbjct: 2267 --DTTQWRQRRGTTFV-LPLNDASVQRV--DDISQSLMAFEPSVLLFLRRIRELRLLDSV 2321
Query: 1377 RLQCIMF-----RNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
R + F R N LV + ++ + ++V+ + W V + L +
Sbjct: 2322 RQHSLHFLKKETRLQTNSKLVKLYSQVKSESSVEVT----QQKWLVVKETLEPPQLFHRS 2377
Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
TTEIALA + + +G+ PL Q VF +LPLR+YG +FILQGDF +PSSRE + S
Sbjct: 2378 HTTEIALAFPVDDF-KGDRPPL---QQVFCYLPLRSYGFRFILQGDFEIPSSREAITNGS 2433
Query: 1492 PWNQWLLSEFPALFVSAERSFCD-LPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMI 1550
WN+WL+S+FP L +A S+ LP A A + F S +P M
Sbjct: 2434 DWNEWLVSKFPKLMRAAVHSYVSTLPTDESRIAAIAHLLSLLPLESEVQAPFRSIIPEM- 2492
Query: 1551 LSKLRMSNCLILEGNNNQWAPP-----CKVLRGWNDRAHSLLPDI---LLQKHLGLGFLS 1602
+ +LR L +++ P C L G ++ LL + +L FL
Sbjct: 2493 MRELRQLKWLASTSLSSELYKPMELVDCSELAGTSESTVMLLEALSEDVLASTGNKRFLH 2552
Query: 1603 KDI--VLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFH 1660
+ ++ S+ L +E+ L++++ SL +N ++W L L + +
Sbjct: 2553 ATLSQAMTTSMKNQLRVEQLHASHLMRVL-SLSMEKN-----DINWTVKVLALLAKL-WR 2605
Query: 1661 SSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTI-WLH-SDCS----VFDGGFG 1714
SSL L L+ I PL + EG + W+ +DC V + G
Sbjct: 2606 KDRHSSL-------LRQELRLIKCFPLHKDSA----EGVVKWVSLADCHDSLFVSEDSAG 2654
Query: 1715 LEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLK-IGVQQLSAHDIVKVHI 1773
L + + AV+KS+ L LLK +GV + H++++ H+
Sbjct: 2655 SGTSYAFFGDLHILDDSF--TKAVNKST-------RLRTFLLKDVGVHVMEDHELIRHHV 2705
Query: 1774 LPAISDETTANGDK---NLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHG 1830
LP +++ +++ DK + +Y F++ HL C +C + +E++ V T+ G
Sbjct: 2706 LPKMTELRSSSDDKLDVGAVMEYGRFLVSHLA-TCRDCSMH-----AEVKANTAVATSSG 2759
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 129/278 (46%), Gaps = 25/278 (8%)
Query: 100 NQQRSQTQHPPGASNQQ---RPLTQQPQSLPPQPQKLNKEFLERIDRAVVKARSELIMAG 156
QQ + + ASNQQ +P Q P++ PQ Q+ F+E A +S
Sbjct: 1156 QQQHGEERGGSYASNQQGYYQPSHQHPRA-SPQVQRDVVAFIE----AFRANQSSPSAPP 1210
Query: 157 ESVSSWKVSRSVLLKLQVDSWSSL-GFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITS 215
+++ V R++ V+++ L G ++P+L +L R+ F+ CF+ R+I +
Sbjct: 1211 NALTLDVVVRAICSHFHVNAFEELMGTSALQLPALRQLHTVNQRVWTFVTCFMQSRRINT 1270
Query: 216 LYDLEEAICKSEGIEKFEELELG-PFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEY 274
L++ +A + EG+ F EL+LG FL + H + + + T DV+ L ++
Sbjct: 1271 LFECHQAFLQHEGLRSFHELKLGNSFLHTEAVQHLY---HSPRAMLTVTTRDVLTFLRQF 1327
Query: 275 TDT--HKA----KDIKVDEFLDFIAKKRSLASKEQLGVRI--QNLGMHINFIREARKSQD 326
+ H A + ++EFL F+A++ + + +GV I G+++ +R +
Sbjct: 1328 EEMLGHDAFRASSHVDLNEFLQFLAQQYRQPNAQAMGVAIDPSGFGVYVGMLRRVANYEM 1387
Query: 327 VTLK----KFLKELQPHHKRRKRPIFSSEKKRQLDERF 360
+K +F +E+ R + FS E +++ E+
Sbjct: 1388 KEMKTLEQQFQREVAEQMFRLTKEKFSDENRKRALEKL 1425
>gi|389776172|ref|ZP_10193760.1| hypothetical protein UU7_07556 [Rhodanobacter spathiphylli B39]
gi|388436844|gb|EIL93681.1| hypothetical protein UU7_07556 [Rhodanobacter spathiphylli B39]
Length = 1590
Score = 238 bits (606), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 223/739 (30%), Positives = 347/739 (46%), Gaps = 122/739 (16%)
Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
AL LS ELYS+ SHF+LEL+QNADDN Y P LTF + + +VV+NNE GF ENI
Sbjct: 61 ALRLLSVELYSKKSHFVLELIQNADDNRYELGTTPQLTFEVTSNRLVVVNNEIGFRDENI 120
Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-TSEGQ-IGFVL 1311
A+C VG S+K +GYIG KGIGFKSVF V+DAPEIHSNGFH+KFD T EG +G+++
Sbjct: 121 EAICKVGASSKPKEKSGYIGEKGIGFKSVFTVSDAPEIHSNGFHIKFDRTVEGNLLGYIV 180
Query: 1312 PTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSL 1371
P P P ++ C T I LP +S G D + L L
Sbjct: 181 PQWCEP-------------PEEVRPNC--TTIILPASKNYSFG------ADTLAALDARL 219
Query: 1372 LLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI---- 1427
LLFL+ L+ + + ND V +++ + + +++ +K + +AS+++R +
Sbjct: 220 LLFLNTLRQLTL-DYDNDR--VTYRRVDSEEVSRLTAEREKTSGELASEEMRYVRVSSRF 276
Query: 1428 -----------RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
R + + + LA + P VFAFLP+R G KF +QG
Sbjct: 277 AMDEAHADQEKRAGIGQSTVVLAFPIDAGGAAKPEP---SSNVFAFLPIRQMGFKFSIQG 333
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV 1536
DF+L +SREE+ + PWN++L + +F SA + F+++ A S Y+ +VP
Sbjct: 334 DFILSASREEILTDRPWNRFLRDQIATVFSSA------VEVFKKSEGLALS-YLKYVPDE 386
Query: 1537 GE-VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKH 1595
E V FF + + I++KL C L + W P ++LR +L P L+ +
Sbjct: 387 HEVVDPFFRTVRKSIIAKLSGVEC--LRSASGVWKRPGELLRA-EKTFRALFPSKLVIEL 443
Query: 1596 LGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY 1655
++ I+ S L RALG+ + GPK+++ ++ R + ++ L+W A +
Sbjct: 444 TDFDYVDAKILGSVELLRALGVTDVGPKLVVDVL----REKAWFQAQPLAWRAKFY---- 495
Query: 1656 TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVF-----D 1710
++ + + SL + L + P +P+SD G+ + S VF
Sbjct: 496 --AYVADNRESLTAV-------GLLQAPCLPISD--------GSCVVPSQLEVFFPLSRR 538
Query: 1711 GGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVK 1770
+G E ++L V L +A K S V+ + + V+ +D++
Sbjct: 539 KKYGFE------SELVFVDNELYEEAA--KHSKHVVE------LFSAMKVRSDVPYDLIS 584
Query: 1771 VHILPA-ISDETTANGDKNLMADYLCFVMMHLEYYCPNCHV---EREFIVSELRKKAFVL 1826
HILP SD T + DK ++ +L +V L+ Y V + LR V
Sbjct: 585 SHILPTHASDAWTESEDKAIVG-HLRYVKDKLKDYLDGALVAGKSEQQAYQALRDGIRVG 643
Query: 1827 TNHG-------FKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYL----KHPAK 1875
T F+R E ++ KE+ S+ + D V +YL K P +
Sbjct: 644 TKQQEAEGLWIFRRVGE--LYLSKEYRPDFSIESFLGDQIAAEKLVSPSYLTDKPKDPIQ 701
Query: 1876 ESLSCGLVKWRKFFKEIGI 1894
E+ S WR+FF +G+
Sbjct: 702 EADS-----WRQFFTALGV 715
>gi|325185329|emb|CCA19816.1| phosphoserine aminotransferase putative [Albugo laibachii Nc14]
Length = 2536
Score = 233 bits (595), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 203/695 (29%), Positives = 319/695 (45%), Gaps = 95/695 (13%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESN---------MLKKQHARLGRALHCLSQELYSQD 1206
E ++ESIR GL + ++N + Q RLGRAL LS ELY+ +
Sbjct: 940 EACRALIESIRTSRLGLEIPQAPADTNGSLVTSTRQIFTSQQRRLGRALKRLSAELYAFN 999
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
SHF+LEL+QNADD ++ NV PT ++ E+ + +NE GF+ N++ALCDVG STK+
Sbjct: 1000 SHFVLELIQNADDAVFQGNVSPTCKLLITENSVEFTSNEVGFTDANVQALCDVGASTKE- 1058
Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE----GQIGFVLPTLVPPFNIDM 1322
S A IG KGIGFKSVF+++D P++HSNGFH++F S G +G+++PT +
Sbjct: 1059 SDATCIGNKGIGFKSVFKISDRPQVHSNGFHIQFQASSSSDNGMLGYIVPTWI------- 1111
Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFR---TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
+DP C T I LPF ++ E I + + + + PSLLLFL R++
Sbjct: 1112 ------EDPALWRVDC-GTSIVLPFHEEARRWKEAI----VEALHATIEPSLLLFLRRIK 1160
Query: 1380 CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV--KTTEIA 1437
+ + N I K+ + ++ C + K S+ + + + KT +
Sbjct: 1161 QLQVDH--NAQQTCIAKQ---ENWARIYCNDIKWKHISLSESSKGHNVSWLLVEKTIAVP 1215
Query: 1438 LALTLQESNE----------GNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEV 1487
+T Q S+ G P Q VFAFLP+R+YG +FI+QGDF+LPSSRE +
Sbjct: 1216 ERITRQSSSADTRLVVGLPLGTIDPPY--QKVFAFLPVRSYGFRFIVQGDFILPSSRESI 1273
Query: 1488 DGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAK-AASVYMSFVPLVGEVHGFFSGL 1546
D +S WNQWLL+ P LF LP NP + + +P+ V F +
Sbjct: 1274 DASSEWNQWLLTFLPDLFEHIATHLW-LPGSPNNPDWISREQFHRLLPVEKHVEAPFRCI 1332
Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKVLR------GWNDRAHSLLPDILLQKHLGLGF 1600
R IL R++ + + PPC +L ++ SL+ D ++ L
Sbjct: 1333 VREILQ--RLAKAPWIPSTQDTLLPPCTLLEVPVDVLEQDEWVESLMLDATHKRALSRET 1390
Query: 1601 LSKDIVLSDSLARALGIEE----YGPKILLQIISSLCRTENGL--RSMGLSWLASWLNEL 1654
S LS + AL IE + ++L ++ R+E + L L + L++L
Sbjct: 1391 SS---TLSKEIKCALQIESLQAIHVVRMLAVFTQAIGRSEEEMAKNEHKLVVLFNLLSKL 1447
Query: 1655 YTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDG-----TFSSVDEGTI--WLHSDCS 1707
+ I + G L L+ IP P+ + S + G W +
Sbjct: 1448 WPI---------VDRGASHALSRELEWIPCFPVDRPGERHISLKSAENGLFLPWKVKPAT 1498
Query: 1708 VFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHD 1767
+ +L K V P + VD + L + + + IG++ L AH
Sbjct: 1499 HLTHNPIVALVQSL--KYTPVKP----LAVVDGAFLLQLDGHTIEFLTRTIGLEPLDAHK 1552
Query: 1768 IVKVHILPAISDETTANGDKNLMADYLCFVMMHLE 1802
+++ +LP + A D + +L ++ HL+
Sbjct: 1553 VLEYQVLPGLKAHALAEMDHLQLEKWLFCLLDHLD 1587
>gi|326431928|gb|EGD77498.1| serine/threonine protein kinase [Salpingoeca sp. ATCC 50818]
Length = 2796
Score = 220 bits (561), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 166/539 (30%), Positives = 253/539 (46%), Gaps = 67/539 (12%)
Query: 1012 LDCLGYLPSEFRS-FAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEW 1070
L L +LP S DVL+ G + + + + +L RL+L ++G I +W
Sbjct: 1064 LAVLSHLPPPLLSPVGYDVLVEGARGVVPEMHTKLLHAAATPFHRLLLRQMGRPARIRDW 1123
Query: 1071 IHDYHAFCSTGTSDLL--MPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAI 1128
+ S + P +NAA S S +L +V + + G
Sbjct: 1124 NNAALQATDQAASRVQSSAPLSRIGSNAAPSADALASRTPSSALPPTVSGTTLLGQDG-- 1181
Query: 1129 CDTICGEA-----------------SDDGLGDCT-TQTLPEDKECEDAALIVESIRRDEF 1170
D G + +++ + D + +PED EC ++ +IR D
Sbjct: 1182 -DGTTGNSVTTATATTTNGTGGVIGNEEDVHDAVLAREMPED-ECN---AVLRAIRADHG 1236
Query: 1171 -GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT 1229
GL ++ + +++ + +++ L+ +LY+ +HF++ELVQNADDN Y NV P
Sbjct: 1237 EGLDDGMTEEKKRAVQQLRHLVQQSVEKLASDLYASTTHFVMELVQNADDNTYLTNV-PA 1295
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L+ + + NNE GFS +N+RALC +GNSTK S A +IG KGIGFKSVF+VT P
Sbjct: 1296 LSITIANDRFELRNNEVGFSEKNVRALCSIGNSTKSASEARFIGHKGIGFKSVFKVTHCP 1355
Query: 1290 EIHSNGFHVKFDTSEGQIGFVLP-TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF- 1347
+IHSN FH FD +G+VLP + PP N + T I LPF
Sbjct: 1356 QIHSNNFHFGFDVKNNPLGYVLPHAIAPPPNW----------------RARETVIVLPFD 1399
Query: 1348 ----RTKFSEGIAMNNIV------DMFSDLHPSLLLFLHRLQCIMFRN-MLNDSLVVIRK 1396
+ + G V D+ L PSLLLFLH+L+ + F++ +LN+ + R
Sbjct: 1400 EPDTNSGTNSGSGGRVAVTREFRNDLKEKLDPSLLLFLHKLKRLDFQDTVLNEKRSLQRT 1459
Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAG-VIRPDVKTTEIALALTLQESNEGNYGPLLY 1455
G+++++ D+ W L +R V TE+ +AL L+ + L
Sbjct: 1460 PRNTKGVVQITADSDRTYWLTRKLDLAPDPTLRQGVTATEMVIALPLRRYKQEER---LQ 1516
Query: 1456 QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
VFAFLP+ Y K I+Q D++L SSR + S WNQWLL + PA F +S C+
Sbjct: 1517 SYDVFAFLPVSDYRFKLIVQADWMLTSSRNAIHEGSAWNQWLLKQVPAAF----KSLCE 1571
>gi|256074692|ref|XP_002573657.1| hypothetical protein [Schistosoma mansoni]
Length = 3070
Score = 211 bits (536), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 143/422 (33%), Positives = 209/422 (49%), Gaps = 93/422 (22%)
Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
D +E IRR EFG+G +S ++++++ +L R+L LS+ELY HFLLEL+QN
Sbjct: 989 DRHQFIEEIRRREFGVGVELSEEATDLIQRVEGKLSRSLVQLSEELYGLPGHFLLELIQN 1048
Query: 1217 ADDNIYPENVEPTLTFIL----------------QESGIVVLNNEQ-GFSAENIRALCDV 1259
ADDN Y N PTL L Q+ ++V+NNE GF+ ++ ALCD+
Sbjct: 1049 ADDNTYGINDTPTLQLQLSTISTMNNECNSNINGQKFSLLVMNNESVGFTEHDMSALCDI 1108
Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPT-----L 1314
G STK IGRKGIGFKSVF +TD PE+HSNGFHV+F + T L
Sbjct: 1109 GQSTKVTQRDAKIGRKGIGFKSVFNITDTPEVHSNGFHVRFHRQSKCTKYTSQTSPSLIL 1168
Query: 1315 VPPF--------------NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI 1360
+P + I +C+ L P+ E T I R S + + +
Sbjct: 1169 IPEWCDNEQLMKSQNSCNEIPSWCKTLFILPLNSE-----TLINRSTRIVQSPALYITQL 1223
Query: 1361 VDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI--------VGDGIIKVSCGEDK 1412
+ + LHPSL+LFL RLQC+ F +M ++ +++ I +K+ C +
Sbjct: 1224 IQ--TTLHPSLMLFLRRLQCLRFSSMESNIYWNVKRTIKTLSSSSISLGISVKIQCSAEI 1281
Query: 1413 MT-------------------WFVASQKLRAGVIRPDVK--------TTEIALALTLQES 1445
+T WF + +I D K TEI+LA++L +S
Sbjct: 1282 ITVHETQCNLSSDGETNTIHKWFCFKE-----IIPVDFKETNKNLPSQTEISLAISLNDS 1336
Query: 1446 NEGNYGPLLYQQP--VFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
+ PL QP +F++LP+R+ G +F + DF L SSRE+VD S WN+WL+ + P
Sbjct: 1337 D-----PL---QPCAIFSYLPIRSVGFRFFINADFDLTSSREDVDSTSAWNRWLVGKIPN 1388
Query: 1504 LF 1505
+F
Sbjct: 1389 IF 1390
>gi|358334131|dbj|GAA52586.1| hypothetical protein CLF_108397 [Clonorchis sinensis]
Length = 2905
Score = 209 bits (533), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 149/423 (35%), Positives = 215/423 (50%), Gaps = 86/423 (20%)
Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
D + +RR EFG G ++ ++++++ +L R+L LS++LY HFLLEL+QN
Sbjct: 807 DRETFITELRRREFGCGAELTQEAADLVRRVEGKLSRSLLQLSEDLYGLPGHFLLELIQN 866
Query: 1217 ADDNIYPENVEPTLTFILQE---------SGIVVLNNEQ-GFSAENIRALCDVGNSTK-- 1264
ADDN Y + +PTL F + + + ++V+NNE GF+ ++ +LCDVG STK
Sbjct: 867 ADDNEYAASTKPTLEFCIAQQTNDTVGPAAALLVMNNEAVGFTEADMSSLCDVGISTKVQ 926
Query: 1265 -KGSSAG---YIGRKGIGFKSVFRVTDAPEIHSNGFHVKF------DTSEGQIGFVLPTL 1314
+G G + RKGIGFKSVF VTDAPE+HSNGFHV+F D + + L L
Sbjct: 927 QRGIKTGEFSLLCRKGIGFKSVFNVTDAPEVHSNGFHVRFHRQSRSDADQVDVANAL-LL 985
Query: 1315 VP--------PFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMN--NIVDM 1363
VP P++I R P+ + W T LP +K S AMN +I+ +
Sbjct: 986 VPEWCTAIATPYSIHEPSRW----PIPVSVPAWCKTLFVLPIISK-STATAMNFGSILQL 1040
Query: 1364 FSD-LHPSLLLFLHRLQCIMFRN-----------------MLNDSL------VVIRKKIV 1399
D L P L+LFL RLQC+ +L DS + +I+
Sbjct: 1041 AYDTLAPYLMLFLRRLQCLTLSYGPDCKAGQEPARRKLLVLLRDSTALPSTEIAKTAEII 1100
Query: 1400 GDGIIKVSCGEDKMT--------------WFVASQKLRA---GVIRPDVKTTEIALALTL 1442
+I+ GED+++ WF + LR + + TE+ALA++L
Sbjct: 1101 SVTMIRGGDGEDQLSGKAGESSSSTITHRWFCFKETLRVNEENLPKHAPTQTEVALAVSL 1160
Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
+S L Q PVFA+LP+R G F + DF L SSRE+VDG S WNQWL+S P
Sbjct: 1161 TDSEP------LPQCPVFAYLPVRAAGFTFYINADFDLTSSREDVDGTSVWNQWLVSRIP 1214
Query: 1503 ALF 1505
++
Sbjct: 1215 TVW 1217
>gi|393214741|gb|EJD00234.1| hypothetical protein FOMMEDRAFT_170214 [Fomitiporia mediterranea
MF3/22]
Length = 1480
Score = 207 bits (528), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 162/537 (30%), Positives = 263/537 (48%), Gaps = 81/537 (15%)
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
+AL LS++LYS D+HF++E VQNADDN Y + +EPTL + ++ + + NE+GF +N
Sbjct: 30 QALELLSEQLYSTDTHFIMEFVQNADDNEYEKGIEPTLRILADKTSMTLSCNEKGFYEKN 89
Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
+RA+C +G STKK ++ G+IG KGIGFKSVF+V D + S F KFD S+ ++G + P
Sbjct: 90 VRAICKIGGSTKK-NNPGFIGEKGIGFKSVFKVADIVNVASGPFTFKFDRSK-RLGMITP 147
Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLL 1372
DP + W T L G A+ +D ++ PSLL
Sbjct: 148 IW---------------DPSVVADHGW-TTFYLDLSRSPGSGTAIGVKLD---EIKPSLL 188
Query: 1373 LFLHRLQCI-------------MFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
LFL +L+ I + R LN LV + + + +G I +K T+FV
Sbjct: 189 LFLRKLRAIEVDIRTGSPRQVEIKRRDLNSDLVQLTR--IANGTIL-----NKNTYFVVR 241
Query: 1420 QKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQ 1475
Q + + R D+K TE+ LA + + + P ++ Q V AFLP+R +G KF++
Sbjct: 242 QCVDMSSVRDDRRKDIKETEVVLAFPVDD----EHLPQIFAQDVHAFLPVRDFGFKFVIN 297
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
GDF+ SSRE+V +SPWNQ + F++A R+F + +++ F+P
Sbjct: 298 GDFLTSSSREDVLHDSPWNQNIRRAIIQTFITAVRAF-------QTQGTLQDLWIRFLP- 349
Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDI--LLQ 1593
G V FF + R +L LR N ++ ++ PC+VL + + P +
Sbjct: 350 SGIVGSFFKAVERDLLRDLREMN--VVRDSSGAVRRPCQVLSVPSTFRYDFTPLVPARFM 407
Query: 1594 KHLGLGFLSK--DIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWL 1651
+ G +LS D++ + + LG+ E LL GL++M + +
Sbjct: 408 ERSGFYYLSDNYDMLADGPILKQLGVREMTVDDLL----------TGLQAMSSNHFMAHN 457
Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSV 1708
N +H + S L + + LQ + +PL DG++++ + ++ DC +
Sbjct: 458 NR-----WHDAVCSQLLHVAKRN--KKLQGLKILPLDDGSWAATHKAGNFVF-DCDM 506
>gi|350644204|emb|CCD61038.1| hypothetical protein Smp_134120 [Schistosoma mansoni]
Length = 3064
Score = 204 bits (519), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 140/414 (33%), Positives = 203/414 (49%), Gaps = 83/414 (20%)
Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
D +E IRR EFG+G +S ++++++ +L R+L LS+ELY HFLLEL+QN
Sbjct: 989 DRHQFIEEIRRREFGVGVELSEEATDLIQRVEGKLSRSLVQLSEELYGLPGHFLLELIQN 1048
Query: 1217 ADDNIYPENVEPTLTFIL----------------QESGIVVLNNEQ-GFSAENIRALCDV 1259
ADDN Y N PTL L Q+ ++V+NNE GF+ ++ ALCD+
Sbjct: 1049 ADDNTYGINDTPTLQLQLSTISTMNNECNSNINGQKFSLLVMNNESVGFTEHDMSALCDI 1108
Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPT-----L 1314
G STK IGRKGIGFKSVF +TD PE+HSNGFHV+F + T L
Sbjct: 1109 GQSTKVTQRDAKIGRKGIGFKSVFNITDTPEVHSNGFHVRFHRQSKCTKYTSQTSPSLIL 1168
Query: 1315 VPPF--------------NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI 1360
+P + I +C+ L P+ E T I R S + + +
Sbjct: 1169 IPEWCDNEQLMKSQNSCNEIPSWCKTLFILPLNSE-----TLINRSTRIVQSPALYITQL 1223
Query: 1361 VDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI--------VGDGIIKVSCGEDK 1412
+ + LHPSL+LFL RLQC+ F +M ++ +++ I +K+ C +
Sbjct: 1224 IQ--TTLHPSLMLFLRRLQCLRFSSMESNIYWNVKRTIKTLSSSSISLGISVKIQCSAEI 1281
Query: 1413 MT-------------------WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPL 1453
+T WF + +I D K T L SN+ + PL
Sbjct: 1282 ITVHETQCNLSSDGETNTIHKWFCFKE-----IIPVDFKETNKKYHW-LFHSNDSD--PL 1333
Query: 1454 LYQQP--VFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF 1505
QP +F++LP+R+ G +F + DF L SSRE+VD S WN+WL+ + P +F
Sbjct: 1334 ---QPCAIFSYLPIRSVGFRFFINADFDLTSSREDVDSTSAWNRWLVGKIPNIF 1384
>gi|393234598|gb|EJD42159.1| hypothetical protein AURDEDRAFT_186149 [Auricularia delicata
TFB-10046 SS5]
Length = 1966
Score = 203 bits (516), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 174/635 (27%), Positives = 288/635 (45%), Gaps = 95/635 (14%)
Query: 1158 AALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNA 1217
AA +VE + R + ++ ++ N L ++L LS ELYS+ +HF+LE +QNA
Sbjct: 403 AADLVEELARGNSSVQNTVNELKDN--------LAKSLQILSDELYSKPAHFVLETIQNA 454
Query: 1218 DDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
DDN Y P L +L+ES + + NE GFS+ N++A C +G STKK + AGYIG KGI
Sbjct: 455 DDNTYANGAAPELRVVLRESYMTIQCNEVGFSSANVKAFCGIGKSTKK-NKAGYIGEKGI 513
Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESK 1337
G K++F+V + +HS G+H D E ++G + P Q S+
Sbjct: 514 GAKAMFKVANKVHVHSLGYHFVLD-REAELGMISP--------------------QWSSE 552
Query: 1338 CWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL--------QCIMFRNMLND 1389
N R F +G + + + ++LLF+ RL + +
Sbjct: 553 VLNEPGWTLMRLDFCDGSQFSVVQAQLQHVRDTVLLFMRRLSRFKIDDGEQFSVSRSTDG 612
Query: 1390 SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRP---DVKTTEIALALTLQESN 1446
++ IR KI GD + + + S+++ A P +V TTEI LA L E
Sbjct: 613 NICTIRTKI-GDSV-------SRKPYLRVSRRVAAFPQEPKRENVDTTEIVLAFPLDE-- 662
Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
+ P++ +Q V AFLP+R+ G KFI+Q DF+ ++RE+V N WN+ L S A F
Sbjct: 663 --DLRPVIAKQMVHAFLPMRSVGFKFIIQADFLTAANREDVLSNLQWNKKLRSGIAATFC 720
Query: 1507 SAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMILSKLRMSNCLILEGN 1565
SA F +P N ++ F+P +V FFS L IL+ LR ++
Sbjct: 721 SAVTDFMAMPSLELN-------WLLFLPDARDVADPFFSELGNTILNSLRTMEIVLCADG 773
Query: 1566 NNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVL-SDSLARALGIEEYGPKI 1624
+ + + + ++P +KHL ++S + + + S RALG+E
Sbjct: 774 HYRAPASVRFPSAFAVDDEGIIP----EKHLPFHYISPEYPINTHSKLRALGVESMS--- 826
Query: 1625 LLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQR-IP 1683
+S LCR + + G WL + F + + + + D+ R +P
Sbjct: 827 FNDFLSGLCRMGSTITLRGNDWLEDVCRLIIAQGF------TWKQRIGSQPRDHRVRTLP 880
Query: 1684 FIPLSDGTFSSVD-EGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSS 1742
L+DG++++ + +++ D S F G G++ + SP A
Sbjct: 881 LAQLTDGSWTTCELSDSLFFSGDSSRFPSGLGIDVL-----HMDGTSPQRRDVLA----R 931
Query: 1743 LGVISVD---------NLNRMLLKIGVQQLSAHDI 1768
LG+ ++D +L+R + + V + HD+
Sbjct: 932 LGITTIDVNVVAKKIFDLHRGNMGLTVARTLEHDL 966
>gi|346976275|gb|EGY19727.1| hypothetical protein VDAG_01743 [Verticillium dahliae VdLs.17]
Length = 1542
Score = 191 bits (486), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 182/640 (28%), Positives = 280/640 (43%), Gaps = 116/640 (18%)
Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE 1212
+E ED + IRRD+ GLG + +N A L AL LS +LY +HFLLE
Sbjct: 10 REAEDH---IRQIRRDK-GLGDGPGQIGNN-----GADLESALKVLSHDLYQTATHFLLE 60
Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--SSAG 1270
L+QNADDN+Y + PTL + + + NE+GFS +NI A+C + STK G SAG
Sbjct: 61 LIQNADDNMYAVDA-PTLAITYRPGRVRIDCNERGFSRKNIEAICRICQSTKSGRSKSAG 119
Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
++G KGIGFK+VF+V I S + +FD +G +G + P D
Sbjct: 120 FVGEKGIGFKAVFKVASTVWISSGYYSFRFD-RDGHLGMIAPIW---------------D 163
Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM-------- 1382
K T I L +EG I+D +L+FL RLQ +
Sbjct: 164 AFPEAPKPGFTSIYLKLARDCNEG----GIIDELRSYDAKILIFLRRLQRLEIDVQREFW 219
Query: 1383 ----FRNML----NDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
F+ +L N S ++ DG+ K + W +L RP + ++
Sbjct: 220 KSGDFKTVLSRQANTSGNPSMTTLMNDGVKK-----QYLVWRHTVNRLPNDARRPGISSS 274
Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
E+ LA L + E PL+ +Q V+AFLP+R Y KF+LQ DF+L ++RE+V + PWN
Sbjct: 275 EVVLAFPLDKDGET---PLIERQSVYAFLPIRDYDFKFLLQADFLLSANREDVHADLPWN 331
Query: 1495 QWLLSEFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVGEVH-GFFSGLPRMILS 1552
AL +A+++F D N K + FV G F+ L R +L+
Sbjct: 332 L-------ALTTAAQKAFLDAVKHMSNLTNKLRYTWFRFVTGSYSAQLGIFADLQRKLLA 384
Query: 1553 KLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQ---KHLGLGFLSKDIVLSD 1609
L+ + +L+ + P K+ R +PDI + L + + D LS
Sbjct: 385 DLQKTQ--LLDSTWGRKKKPMKLTR---------VPDIFCDNDGRPFTLHYKNDDRYLSP 433
Query: 1610 SLA---------RALGIEEYGPKILLQIISSLCRTENG--LRSMGLSWLASWLNELYTIS 1658
+ RALG+++ P+ + I L + G + W A + L + S
Sbjct: 434 KYSQDEPDASSLRALGVKDITPEAFMNEIDKLLKKHRGAFFQKQTRDWHAKFSQTLTSSS 493
Query: 1659 FHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWL--HSDCSVFDGGFGLE 1716
F S + + IPL D +++S++EG ++ S+ ++ G
Sbjct: 494 FASWHYRK-------------RTMAIIPLRDKSWTSMNEGQVYFAAQSNSTLIPEGI--- 537
Query: 1717 AFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLL 1756
K+R V P +A K + V NL+R L+
Sbjct: 538 -------KIRIVDPE-AAADPARKLLFEHLGVANLSRPLV 569
>gi|47203291|emb|CAG14733.1| unnamed protein product [Tetraodon nigroviridis]
Length = 238
Score = 190 bits (482), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 87/157 (55%), Positives = 115/157 (73%), Gaps = 2/157 (1%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
I+E IR+ EFG+G ++ +++ Q RLGR+L LS ELYS+D+HF+LEL+QNADDN
Sbjct: 62 IIEDIRKHEFGIGVELNAESQKLMQTQQERLGRSLDRLSTELYSKDTHFVLELIQNADDN 121
Query: 1221 IYPE--NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
YP V P L F+++ I +LNNE GF +N+RA+CDVG STK GYIG+KGIG
Sbjct: 122 TYPSEAGVLPALAFVVENDCITILNNEMGFQEKNVRAICDVGRSTKGKHKYGYIGQKGIG 181
Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
FKSVF+VTD PEIHSNGFH++FD + G +G++LP V
Sbjct: 182 FKSVFKVTDCPEIHSNGFHLRFDKTSGPMGYILPHWV 218
>gi|260803433|ref|XP_002596594.1| hypothetical protein BRAFLDRAFT_187605 [Branchiostoma floridae]
gi|229281853|gb|EEN52606.1| hypothetical protein BRAFLDRAFT_187605 [Branchiostoma floridae]
Length = 157
Score = 187 bits (474), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 84/152 (55%), Positives = 111/152 (73%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
+VE IRR+EFG+G + +++++ R+GR L LS+ELY++D+HF+LELVQNADDN
Sbjct: 1 VVEDIRRNEFGVGLKLEKKGEDLVERHKKRIGRGLERLSKELYTKDTHFVLELVQNADDN 60
Query: 1221 IYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFK 1280
Y +V TL F+++ + VLNNE GF NIRALCDVGNSTK GYIG KGIGFK
Sbjct: 61 EYEADVSATLVFVVEREKVTVLNNESGFQEANIRALCDVGNSTKGKHKFGYIGHKGIGFK 120
Query: 1281 SVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
SVFRV+D PE+HS G+H+ FD S +G++LP
Sbjct: 121 SVFRVSDKPEVHSRGYHICFDASSDPMGYILP 152
>gi|428172047|gb|EKX40959.1| hypothetical protein GUITHDRAFT_112964 [Guillardia theta CCMP2712]
Length = 2624
Score = 186 bits (473), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 251/1006 (24%), Positives = 412/1006 (40%), Gaps = 184/1006 (18%)
Query: 1134 GEASDDGLGDCTTQTLPEDKECEDAALIVESIRR------DEFGLGPNISNMESNMLKKQ 1187
GEA+ + + T D+ LI IR D GL P + + +++Q
Sbjct: 990 GEAAQERAIEARKCTHDGDEAESKTELICRKIRNKYGIQLDTMGLDPELRKV----VEEQ 1045
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQG 1247
L +L LS ELY+++ H++ E++QN DDN Y +VEP+L L E I NNE G
Sbjct: 1046 RKMLKNSLKLLSTELYAKEIHYIHEIIQNCDDNSYTGDVEPSLIIRLTEELIEFENNEIG 1105
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF--DTSEG 1305
FS ++ ++C +G S+ K + G+IG KGIGFKSVF++ P+IHSN FH + + EG
Sbjct: 1106 FSERDVESICSIGQSS-KANRKGFIGHKGIGFKSVFKICSKPKIHSNEFHFELAAEAKEG 1164
Query: 1306 Q----IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
+ + +++P +PP P + T I LP + I
Sbjct: 1165 EESAGLEYIVPQPIPP-------------PAGWRAGS-GTKIVLPIASS-----RRREIQ 1205
Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNM-LNDSLVVIRK-----KIVGD----GIIKVSCGED 1411
DL ++LLFL +L+ I + D V IR+ + G+ + KV GE+
Sbjct: 1206 ACIKDLEETILLFLRKLRVIEIHDARAQDGSVHIRRMRREAETEGNFDLVSLTKVVDGEE 1265
Query: 1412 -KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGL 1470
W V K D ++A+ + E +Y Q+ VFA+LP+R YG
Sbjct: 1266 SSQDWLVL--KTNVAEKTQDDLCLQLAVPFMHELGAEQDY----QQRDVFAYLPVRCYGF 1319
Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWL----LSEFPALFVSAERSFCDLPCFRENPAKAA 1526
+FI+ GDF + SSRE VD N L FP L + + L ++N
Sbjct: 1320 RFIVNGDFAVTSSRESVDTMDSRNLRLRDGIARHFPTLIRELQERY--LSATKQNQEAER 1377
Query: 1527 SVY--------MSFVPLVGEVHGFFSGLPRMI------LSKLRMSNCLILEGNNNQWAPP 1572
V+ MS +P GEV+ FF+ +P I L + + ++ W P
Sbjct: 1378 DVHVGEYTRVVMSVLPTPGEVNDFFAVVPDQIRRAIDSLELVPLRTGKFVKARCAVWRP- 1436
Query: 1573 CKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGI---EEYGPKILLQII 1629
R S L ++L ++ GLGF+ + L LGI ++ P IL +
Sbjct: 1437 ----RKVESEVISFLEEMLAEE--GLGFVHDAVAKYQVLGEHLGIRRLQQVFPSILKHLE 1490
Query: 1630 SSLCRT-ENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLS 1688
L + + L L+W+ + L + ++ L + LQR IPL+
Sbjct: 1491 QRLQSSLSDDLTMECLAWIFNELRDEEDVAV---------------LREVLQR-KLIPLA 1534
Query: 1689 DGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISV 1748
G F E ++++ D VFD L+ P+ R + P L A + G
Sbjct: 1535 SGDFVRPQEKSVYVLQD--VFD--LSLDPSPS----FRILHP--LFAEKIQMYGRG---- 1580
Query: 1749 DNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNC 1808
L L ++ V +A ++ ++P + D+ + D + L + LE
Sbjct: 1581 --LKSFLKEMNVVWNNAEAFLRSQLMPVLHDQESEEKD---LVTGLILLKKVLEQQSEQV 1635
Query: 1809 HVEREFIVSELRKKAFVLTN-HGFKRPSEIPIHFGKEFGNPVSVNMLIHDI--------- 1858
R ++ ++R+K VL + G + S +H G + + DI
Sbjct: 1636 ---RRLMLLDIRQKGIVLLDEQGDRVWSCNALHQGPSYSKHAD---RVTDIVGRSSGSMS 1689
Query: 1859 DIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM 1918
+ W V YL+ KE + G WR F +GI F V +
Sbjct: 1690 PVPWKLVSSKYLE---KEQDAGG---WRAVFLGLGILTFCHVRE---------------- 1727
Query: 1919 WTKELLSPGSAAIDWESNELVHLL-SLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFF 1977
PG + S E+ L S++ + + + L++ +L D + ++
Sbjct: 1728 -------PG-----YTSEEMCSLYRSIVEADGAPEPRRRRLQLFFSLLSDVWQEEY---- 1771
Query: 1978 KSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPK 2037
+ G + +F+ + + + D +LH P+ L + V +LG A I P
Sbjct: 1772 -EQALGLPEEGEVAFLTFLRQ-EAKLPGTDGQLHPPQLLLQRTEHVYRVLG--AKGIYPD 1827
Query: 2038 VKS--EKLVCDIGLKTEVTIDDILEILKVW----TRLEAPFMARYY 2077
V + KL +G+ +++ +I + L+ W T L FM Y
Sbjct: 1828 VDNLDVKLSQRLGISSQLQPSNIFDQLEHWRSSSTPLSLSFMESVY 1873
>gi|302905878|ref|XP_003049358.1| hypothetical protein NECHADRAFT_82646 [Nectria haematococca mpVI
77-13-4]
gi|256730293|gb|EEU43645.1| hypothetical protein NECHADRAFT_82646 [Nectria haematococca mpVI
77-13-4]
Length = 2237
Score = 186 bits (473), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 177/561 (31%), Positives = 254/561 (45%), Gaps = 106/561 (18%)
Query: 1178 NMESNMLKKQHARLGRALH------CLSQELYSQDSHFLLELVQNADDNIYPENVEPTLT 1231
N + + K ++ +GR L LS+ELY + +HFLLEL+QNA+DN Y ++V P+L+
Sbjct: 23 NTVNGTITKHNSPVGRLLENALEACSLSRELYEKQTHFLLELIQNAEDNTY-DDVTPSLS 81
Query: 1232 FILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--SSAGYIGRKGIGFKSVFRVTDAP 1289
F + + V NE GF+ N+RA+C +G STK G SA Y+G KGIGFKSVF+V
Sbjct: 82 FTYKRGSLRVDCNETGFTERNVRAICKIGESTKAGLGQSARYVGEKGIGFKSVFKVASEV 141
Query: 1290 EIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
I S F KFD ++ +G + P + S+ P + F
Sbjct: 142 YISSRSFSFKFDQAQA-VGLIAP-------------IWSEFPEPIRPGY------TSFYL 181
Query: 1350 KFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV---VIRKK--IVGDG-I 1403
K SE +I L P++LLFL RL+ I + L D +R++ + D I
Sbjct: 182 KLSENTDEEHIARDIKRLDPTMLLFLRRLREINLKATLMDGSTWSQTLRRQDALTQDSHI 241
Query: 1404 IKVSCGEDKMTWFVAS--------QKLRAGVIRPDVKTTEIALAL-TLQESNEGNYGPLL 1454
++ G + V +K R GV +EI+LA L E++EG
Sbjct: 242 TQLHRGPTVSDYIVVKHLVSNLPEEKGRVGVTE-----SEISLAFPPLHETSEG------ 290
Query: 1455 YQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
+ V+AFLP+R YG KF+LQGDF+L ++RE+V N PWN+ L FV A
Sbjct: 291 HTHAVYAFLPIRDYGFKFLLQGDFILTANREDVPDN-PWNRALREACADAFVRA------ 343
Query: 1515 LPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCK 1574
+ P + A ++S++P + GFF L MIL KL S ILE + + P
Sbjct: 344 IHHLNSGPRRYA--WISYIP-GRHILGFFEPLRDMILEKL--SQEPILEACSRKMVAPAT 398
Query: 1575 VLRGWNDRAHSLLPDILLQKHLGLGFL----SKDIVLSDS-------LARALGIEEYGPK 1623
+ S +P GL +KD+ LS S L LG +
Sbjct: 399 L---------SYVPLSRFTDDDGLPLTLSPSTKDLYLSCSYPALDYTLLSRLGTQTLDSN 449
Query: 1624 ILLQIISSLCRTE-NGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGVETDLIDNL 1679
LQ +SS+ T R W LA+ L LY H D +
Sbjct: 450 SFLQHLSSMISTRPQHFRQKPREWHEKLANVLIALYKAERHR---------------DCI 494
Query: 1680 QRIPFIPLSDGTFSSVDEGTI 1700
+ + IPLS+G + S D G I
Sbjct: 495 KGLEIIPLSNGEWVSADTGPI 515
>gi|302419583|ref|XP_003007622.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261353273|gb|EEY15701.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 1505
Score = 185 bits (470), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 153/517 (29%), Positives = 233/517 (45%), Gaps = 86/517 (16%)
Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE 1212
+E ED + IRRD+ GLG + +N A L AL LS +LY +HFLLE
Sbjct: 10 REAEDH---IRQIRRDK-GLGDGPGQIGNN-----GADLESALKVLSHDLYQTATHFLLE 60
Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--SSAG 1270
L+QNADDN+Y + PTL + + + NE+GFS +NI A+C + STK G SAG
Sbjct: 61 LIQNADDNMYTADA-PTLAITYRPGRVRIDCNERGFSRKNIEAICRICQSTKSGRSKSAG 119
Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
++G KGIGFK+VF+V I S + +FD +G +G + P D
Sbjct: 120 FVGEKGIGFKAVFKVASTVWISSGHYSFRFD-RDGHLGMIAPIW---------------D 163
Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM-------- 1382
K T I L +EG IVD +L+FL RLQ +
Sbjct: 164 AFPEAPKPGFTSIYLKLARDCNEG----GIVDELRSYDAKILIFLRRLQRLEIDVQREFW 219
Query: 1383 ----FRNMLNDSLVVI----RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
F+ +L+ + ++ DG+ K + W +L RP + ++
Sbjct: 220 KSGDFKTVLSRQATISGNPSMTTLMNDGVKK-----QYLVWRHTVNRLPNDARRPGISSS 274
Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
E+ +A L + E PL+ +Q V+AFLP+R Y KF+LQ DF+L ++RE+V + PWN
Sbjct: 275 EVVIAFPLDKDGET---PLIERQSVYAFLPIRDYDFKFLLQADFLLSANREDVHADLPWN 331
Query: 1495 QWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH-GFFSGLPRMILSK 1553
L + F+ A R +L K + FV H G F+ L R +L+
Sbjct: 332 LALTTAVQKAFLDAVRHMSNLSN------KLRYTWFRFVTCSYSAHLGIFADLQRKLLAD 385
Query: 1554 LRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQ---KHLGLGFLSKDIVLSDS 1610
L+ + +L+ + P K+ R +P+I + L + + D LS
Sbjct: 386 LQKTQ--LLDSTWGRKKKPTKLTR---------VPEIFCDNDGRPFTLHYKNDDRYLSPK 434
Query: 1611 LA---------RALGIEEYGPKILLQIISSLCRTENG 1638
+ RALG+++ P+ + I L + G
Sbjct: 435 YSQDEPDASSLRALGVKDITPEAFMNEIDKLLKKHRG 471
>gi|389750862|gb|EIM91935.1| hypothetical protein STEHIDRAFT_151290 [Stereum hirsutum FP-91666
SS1]
Length = 1525
Score = 183 bits (465), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 129/366 (35%), Positives = 190/366 (51%), Gaps = 42/366 (11%)
Query: 1161 IVESIRRDEFGL-GPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
+V+SIR+D GL PN + +E LGRAL L+ +LY D+HF+LE VQNA+D
Sbjct: 3 VVQSIRKD-LGLDSPNPTELER--------MLGRALEHLADQLYRTDTHFVLEFVQNAED 53
Query: 1220 NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
N Y +V PTL+ +++ + + NE+GF N+RALC++G STKK G+IG KGIGF
Sbjct: 54 NAYLSDVTPTLSLRIEDYTMTISCNERGFDESNVRALCNIGTSTKK-HLQGFIGEKGIGF 112
Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
KSVF+V + + S F KFD + ++G + P + D+ LE
Sbjct: 113 KSVFKVANVVRVTSGDFRFKFDRDQ-RLGMIAPI----WEDDILPVYPGWTEFHLE---- 163
Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND----SLVVIR 1395
LP +T + G A+++ ++ + SLLLFL +L+CI LND ++ +
Sbjct: 164 --LTSLPGQTSITPGTALSSKLEQ---IRASLLLFLKKLRCIEVSFALNDPEDPRIITLF 218
Query: 1396 KKIVGDGIIKVS-CGEDK--------MTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
+ VG I+K++ C D+ M F A R V +EI LA + E
Sbjct: 219 RHDVGPDIVKLTHCEHDRPIRSDFYLMISFSADMSAIEEPKRQGVNDSEIVLAFPVTEDI 278
Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
+ + P Q V FLPLR G F++Q DF+ SSRE++ +S WN+ LL F+
Sbjct: 279 QPVFSP----QDVHVFLPLRPVGFNFVIQADFLTSSSREDILADSYWNKTLLYGVVDAFI 334
Query: 1507 SAERSF 1512
A F
Sbjct: 335 QAIDRF 340
>gi|378727103|gb|EHY53562.1| hypothetical protein HMPREF1120_01751 [Exophiala dermatitidis
NIH/UT8656]
Length = 1492
Score = 182 bits (462), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 165/584 (28%), Positives = 251/584 (42%), Gaps = 104/584 (17%)
Query: 1150 PEDKECEDAALIVESIRRDEFGL------GPNISNMESNMLKKQHARLGRALHCLSQELY 1203
P + + A V++IR+ E GL N+ N+E+ ALH L+ ELY
Sbjct: 4 PHIDQSDRARKHVDAIRK-EAGLDNGQDSSRNVQNLEN------------ALHTLADELY 50
Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
+ +HFLLEL+QNADDN Y EP +T ++ + + NE+GFS +N+ A+C VG+ST
Sbjct: 51 HKSTHFLLELIQNADDNEY-LVAEPEITLTYKDRRLRIDCNERGFSPQNVEAICRVGSST 109
Query: 1264 KKGS--SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE--GQIGFVLPTLVPPFN 1319
K G ++ Y+G KGIGFKSVF+ D I SN + KFD G I + P
Sbjct: 110 KAGQDRASSYVGEKGIGFKSVFKAADIVWISSNSYTFKFDKRNVLGMIAPIWEAFPEP-- 167
Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
DP W + + + S G ++ P +L+FL RL+
Sbjct: 168 ---------TDP------SWTSML-----LQLSPGYEEKQLLHDLKTFDPRMLIFLRRLK 207
Query: 1380 ---CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMT--WFVAS---QKLRAGVIRPDV 1431
++ R + +K D + +D T + V+S + L A RP
Sbjct: 208 KVKIVIHRAGPVWQTTLTKKLQRNDQETAIELTQDHKTLRYMVSSYLAENLPAEPKRPGA 267
Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
++++I LA + N + Q V+AFLP+R YG KF+LQ DF+L ++REE+
Sbjct: 268 RSSQIMLAFP-----DCNGAEAIESQKVYAFLPVRDYGFKFLLQADFLLSANREEILAYP 322
Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMIL 1551
WNQ L F FV+ + +C + + ++ ++PL FF + IL
Sbjct: 323 QWNQALRQAFVDAFVAVTKEYCQ--------GRISHLWPQYIPLRPSQTRFFESARKEIL 374
Query: 1552 SKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK-------- 1603
R+S +L N+ PC +L +PD K G+ S
Sbjct: 375 E--RLSREPVLRCRTNELMTPCSLLD---------VPDEFCDKE-GVPMTSTPTSDRAYL 422
Query: 1604 DIVLSDSLA---RALGIEEYGPKILLQIISSLCRTENGL--RSMGLSWLASWLNELYTIS 1658
V SL LG+ + I L ++S + L W L +S
Sbjct: 423 SCVYPTSLTENLEVLGVRKMTASIFLDHLASFITQHSELFREQKPDGWHTRLAEILLKLS 482
Query: 1659 FHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWL 1702
F S+ D + +P IPLSDG + S E ++L
Sbjct: 483 FDSANH------------DRIAELPVIPLSDGRWVSAKEDYVFL 514
>gi|383762367|ref|YP_005441349.1| hypothetical protein CLDAP_14120 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381382635|dbj|BAL99451.1| hypothetical protein CLDAP_14120 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 1539
Score = 181 bits (459), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 163/563 (28%), Positives = 252/563 (44%), Gaps = 77/563 (13%)
Query: 1165 IRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE 1224
IRR++F L + + + N L K L + LS +LYS+D+HF+LEL+QNA+DN YP
Sbjct: 18 IRREKFSLDADGTLLSQNPLAKD---LHNSAERLSADLYSKDTHFVLELIQNAEDNAYPP 74
Query: 1225 NVEPTLTFIL----------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGR 1274
V+P LTF L +++LNNE G E++ ALC VG STK GYIG
Sbjct: 75 GVQPDLTFCLLPEDPTATPGAHGALLILNNECGLRPEDVDALCAVGRSTKT-KRQGYIGE 133
Query: 1275 KGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ---IGFVLPTLVPPFNIDMFCRLLSKDP 1331
KGIGFKSVF+VT P + S G+ +FD +G+++P V P
Sbjct: 134 KGIGFKSVFKVTKRPHLFSAGYQFRFDRDPDPAIGLGYIVPYWV------------DDVP 181
Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
++ S+ TCI LP EG ++ + P +LFL RL+ + + DS
Sbjct: 182 PEVASQHDKTCILLP----LDEG-QWEPVIAQLQRIAPETILFLSRLEGLTV-ELPGDSP 235
Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPD-----VKTTEIALALTLQESN 1446
+ + ++ + GE T++V +++R + D V+T +++A L +
Sbjct: 236 LRLLIDKSKAPLVDLMDGETLATYWVHQREVRCPPDQGDEKRIGVETRTVSIAFPLARAR 295
Query: 1447 EGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF 1505
E Y VFA+LP G F++ DF+L SSRE + + PWN WL E F
Sbjct: 296 EPIYS-------VFAYLPTEMQSGYPFLINADFILTSSREAILVDRPWNVWLRDEIVPCF 348
Query: 1506 VSAERSFC-DLPCFRENPAKAASVYMSFVPLVGEVH-GFFSGLPRMILSKLRMSNCLILE 1563
V + DL +++ F+PL +V FF I KL + +
Sbjct: 349 VEGFVALVRDLRYWQQA--------YGFIPLADKVQERFFQPPVERIHQKLANLPVVWVV 400
Query: 1564 GNNNQWAPPCKVLRGWNDRA----HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEE 1619
G APP L R LP+ L Q L + ++ + RA+G++
Sbjct: 401 GKQEPVAPPYARLAPLTFRKLLDDGRPLPEQLTQTLL----VRPELEQFEKQLRAIGVKA 456
Query: 1620 YGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNL 1679
P + + + L+ W WL +Y S QS + L
Sbjct: 457 LQP----DEVRACLQDGAWLKDRPPQWFV-WLY-MYLQKEDWHAPSPWQSESRS-----L 505
Query: 1680 QRIPFIPLSDGTFSSVDEGTIWL 1702
+ +P IP ++G ++V + T++L
Sbjct: 506 RDLPIIPTTEGRLAAVGKETVYL 528
>gi|358347408|ref|XP_003637749.1| hypothetical protein MTR_101s0015 [Medicago truncatula]
gi|355503684|gb|AES84887.1| hypothetical protein MTR_101s0015 [Medicago truncatula]
Length = 1586
Score = 181 bits (458), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 179/588 (30%), Positives = 276/588 (46%), Gaps = 110/588 (18%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR +F +G N + + + L A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 VEEIRRTKFSIGGNPNPLTED--------LHHAVKNLSAELYAKDVHFLMELIQNAEDNH 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y E VEPTL F++ I ++ NNE+GFS +NI ++C VG STKKG+ S+GY
Sbjct: 60 YNEGVEPTLEFVITSDDITATGAPATLLIFNNEKGFSPKNIESICSVGRSTKKGNRSSGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF V+ P I SNG+ ++F+ +G+++P V P +D+
Sbjct: 120 IGEKGIGFKSVFLVSAQPYIFSNGYQIRFNEMPCPHCSLGYIVPEWVEEKPTLVDI---- 175
Query: 1327 LSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
K K T I LP + +N + S +HP +LLFL +++ + R
Sbjct: 176 --KQIYDGAGKALPTTTIVLPLKPD-----KVNPVKQQLSSVHPEVLLFLTKIRHLSVRE 228
Query: 1386 MLND---SLV-----------VIRKKIVGDGII------KVSCGEDKMTWFVASQK--LR 1423
+ + S+V V RK + + + S E + ++++ QK +R
Sbjct: 229 VNENPGQSMVTAVSISSEINFVTRKNMNAESYTLYLSAEENSNNEKECSYYMWKQKFPVR 288
Query: 1424 AGVI---RPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDF 1478
+ + R DV+ + LAL QE + G P V+AFLP FI+Q DF
Sbjct: 289 SESVVERRTDVEEWVVTLALPNQERLHRGKSSP-----GVYAFLPTEMVTNFPFIIQADF 343
Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
VL SSRE + ++ WNQ +L P F+ A ++ + + P + + F+P+
Sbjct: 344 VLASSRETILLDNKWNQGILECVPIAFMDAFKTL--VVGSDQAPVSSLARMFKFLPIESS 401
Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWNDRAHSLLPDI--LLQK 1594
F+ + I KL N + +E Q + P +V R LLP+ +L K
Sbjct: 402 PFEKFNYVRDKIKEKLVNENIVPIETYKEQKHFYKPGEVSR--------LLPEFWNILSK 453
Query: 1595 HLGLGFLSKDIVLSDS---LARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWL 1651
G G ++ D L+ + EY +++ L G++S+ + W A +
Sbjct: 454 ARGEGVYLLNLSSHDGRKILSSSFDKSEYD-----NVLNFL-----GVKSVTVDWYAKCI 503
Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGT 1699
S+L GV DL LQ + FI +SS+ EGT
Sbjct: 504 Q-----------SSNLVDGVSEDLY--LQLLLFIA---KNWSSIFEGT 535
>gi|302775136|ref|XP_002970985.1| hypothetical protein SELMODRAFT_411712 [Selaginella moellendorffii]
gi|300160967|gb|EFJ27583.1| hypothetical protein SELMODRAFT_411712 [Selaginella moellendorffii]
Length = 1543
Score = 179 bits (454), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 191/695 (27%), Positives = 308/695 (44%), Gaps = 139/695 (20%)
Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
+ +E +RRD FG+G + N + ML++ + LS ELY +D HFL EL+QNA+D
Sbjct: 6 MFIEHLRRDRFGIGQSGPNPLALMLQE-------TIRLLSAELYQKDIHFLSELLQNAED 58
Query: 1220 NIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SA 1269
N Y +VEP++ L ++ + ++LNNE GFS +NI +LC G STKKG S
Sbjct: 59 NTYHSSVEPSIEIYLTKTDVTGTGAAATLLLLNNEVGFSEKNIESLCGAGQSTKKGKKSQ 118
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRL 1326
GYIG KGIGFKSVF VT P I +NGF ++FD +E ++G+++P V + ++
Sbjct: 119 GYIGEKGIGFKSVFLVTKQPYIITNGFRIRFDEEPHNEAKLGYIVPEWVDRPTDAVLQKV 178
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
SK + T I LP R + ++ + + + LLFL +++ + +
Sbjct: 179 CSKTELP------ATVIILPLRQE-----KISEVAKQLTQIPAETLLFLSKIRMLSIYDQ 227
Query: 1387 LN---DSLVVIRK---KIVGDGIIKVSC----------GEDKMTWFVASQKLR----AGV 1426
+ LV+ R +G ++ S G +++ Q L+ A V
Sbjct: 228 QSRPPTELVLSRSVEMTTSKNGSVETSTTFLTASNGRNGTQHSNYYMFKQTLKVPEEAKV 287
Query: 1427 -IRPDVKTTEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPS 1482
R DV + I LA + + S+ + G +F+FLP + GL F++ DF+L S
Sbjct: 288 EKRKDVDSWTITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLVS 341
Query: 1483 SREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCF--RENPAKAASVYMSFVPLVGEVH 1540
SRE + +SPWNQ + P +F +A + F + P+ A +VP+ +
Sbjct: 342 SRETLLFDSPWNQGIFRCVPEVFYNAFELLLNTTAFGLQSIPSVARMAMYKYVPVTACRN 401
Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQWAPP----CK---------------------- 1574
+ + ILSKL+ +N ++ + + APP CK
Sbjct: 402 SYLERVRNDILSKLKDNNVILCASDVGR-APPSGQLCKPALARKIAPSFRELLKAAPKCP 460
Query: 1575 -VLRG--WN-DRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIIS 1630
LRG W R+ S + D LQ++ G S+ R LG+ + Q +
Sbjct: 461 RFLRGSPWFLRRSSSYIVDSTLQENYG------------SVLRLLGVPDMSNDEYHQSLD 508
Query: 1631 SLCRTENGLRSMG-------LSWLASWLNELYTISF--HSSGQSSLQSGVETDLIDNLQR 1681
S E+ +RS+ L +L+S + L ++ F + LQ DL N R
Sbjct: 509 S----EDWMRSLPDDAYVKLLVFLSSKSSTLRSLRFLKYRGSDGRLQLASMDDLARNDGR 564
Query: 1682 IPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLC-AKLRT------------V 1728
+ + S T WL VF P+ C AK+R
Sbjct: 565 SMY-------WDSGHRATAWLAGWTEVFQRWLTFNFVPSACAAKIRQHSVLENWLERTPF 617
Query: 1729 SPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQL 1763
S + L+A ++ + LG ++ ++ N L + + L
Sbjct: 618 SVSELTACSLARQVLGAVTANSSNAKLAVLAMHFL 652
>gi|297819496|ref|XP_002877631.1| ATP binding protein [Arabidopsis lyrata subsp. lyrata]
gi|297323469|gb|EFH53890.1| ATP binding protein [Arabidopsis lyrata subsp. lyrata]
Length = 1902
Score = 178 bits (451), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 141/435 (32%), Positives = 212/435 (48%), Gaps = 77/435 (17%)
Query: 1146 TQTLP-EDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYS 1204
T+T P + E A VE IRR +F +G E+ + + H +A+ LS ELY+
Sbjct: 188 TKTKPVRETMAESAKQHVERIRRTKFSIG----GAENPLTEDLH----QAVKNLSAELYA 239
Query: 1205 QDSHFLLELVQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRA 1255
+D HFL+EL+QNA+DN YPE V+P+L F++ I ++ NNE+GFS +NI +
Sbjct: 240 KDVHFLMELIQNAEDNEYPEGVDPSLEFVITSDDITATGAPATLLIFNNEKGFSEKNIDS 299
Query: 1256 LCDVGNSTKKGSSA-GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVL 1311
+C VG STKKG+ GYIG KGIGFKSVF +T P I SNG+ ++F+ S +G+++
Sbjct: 300 ICSVGRSTKKGNRKRGYIGEKGIGFKSVFLITSQPYIFSNGYQIRFNEAPCSHCSLGYIV 359
Query: 1312 PTLVP--PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
P V P +D+ R+ S T I LP + + + + S++HP
Sbjct: 360 PEWVEQHPSLVDI-QRMYGSG-----SALPTTTIILPLKYD-----KVKPVKEQLSNVHP 408
Query: 1370 SLLLFLHRLQCIMFR--------------NMLNDSLVVIRKKIVGDGII-------KVSC 1408
+LLFL +++ + R +++++ V RK I + K
Sbjct: 409 EVLLFLSKIKRLSIREHCLDAKLSTVNSIGIVSETNFVTRKSIDAESYTIHLSASEKGKN 468
Query: 1409 GEDKMTWFVASQKLRAGV-----IRPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAF 1462
E + ++++ QK R +V I LA E GN P +Y AF
Sbjct: 469 SEQECSYYMWKQKFPVKHENRVDRRSEVDDWVITLAFPFGERLGRGNSSPGIY-----AF 523
Query: 1463 LPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD------- 1514
LP FI+Q DF+L SSRE + + WNQ +LS P F++A +
Sbjct: 524 LPTEMVTNFPFIIQADFILASSRETILLDDIWNQGILSCVPIAFLNAFTTLVKKTDAPVS 583
Query: 1515 --LPCFRENPAKAAS 1527
LP FR P K ++
Sbjct: 584 SVLPAFRFLPVKESN 598
>gi|125577277|gb|EAZ18499.1| hypothetical protein OsJ_34026 [Oryza sativa Japonica Group]
Length = 1674
Score = 178 bits (451), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 142/451 (31%), Positives = 221/451 (49%), Gaps = 71/451 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ F +GP E N L + + +A+ LSQE+YS+D HFL+EL+QNA+DN
Sbjct: 8 VERIRRERFFIGPG----ERNPLAED---MHQAVKYLSQEIYSKDVHFLMELIQNAEDND 60
Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP V P+L F++ S +++ NNE+GFS N+ ++C VG STKKG+ GY
Sbjct: 61 YPSGVAPSLEFLITSKDITGLGAPSTLLIFNNEKGFSPSNVESICRVGKSTKKGNRHQGY 120
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF ++ P I SNG+ +KF+ +SE IG+++P V P D+
Sbjct: 121 IGEKGIGFKSVFLISRQPHIFSNGYRIKFNEDPSSECNIGYIVPEWVESKPSLSDIQELH 180
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
S P+ T I LP +++ ++ + S +HP +LLFL +++ + R
Sbjct: 181 GSSKPLP------TTTIILPLKSE-----KVDVVKKQLSSIHPEILLFLTKIRRLSVRED 229
Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGEDK----MTWFVASQKLRAGV- 1426
+M ++ +RK I + + +S E+K +++ QK
Sbjct: 230 NSDPKCSTINEISMESEKNYQVRKNIHAESYTLHLSAQENKEQEECGYYMWRQKFPVKPE 289
Query: 1427 ----IRPDVKTTEIALAL----TLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
+R D+ I LA L + + G V+AFLP FI+Q D
Sbjct: 290 NRVDMRADIDEYVITLAFPHGQRLSRGKQSSPG-------VYAFLPTEILTNFPFIIQAD 342
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
F+L SSRE + +S WN+ +L P+ F++A + + PA + +F+P+
Sbjct: 343 FLLASSRETILFDSMWNKGILECVPSAFLNAFVALVKSSA--DAPAMSLPSMFNFLPVHP 400
Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
F + I K+R N + E Q
Sbjct: 401 SHVPFLEPVRSAIKDKVRTENIMPCESYTLQ 431
>gi|15229022|ref|NP_190446.1| ATP/DNA binding protein [Arabidopsis thaliana]
gi|7576215|emb|CAB87905.1| hypothetical protein [Arabidopsis thaliana]
gi|332644933|gb|AEE78454.1| ATP/DNA binding protein [Arabidopsis thaliana]
Length = 1899
Score = 177 bits (450), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 137/424 (32%), Positives = 209/424 (49%), Gaps = 76/424 (17%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
E A +E IRR +F +G E+ + + H +A+ LS ELY++D HFL+EL+Q
Sbjct: 194 ESAKQHIERIRRTKFSIG----GAENPLTEDLH----QAVKNLSAELYAKDVHFLMELIQ 245
Query: 1216 NADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
NA+DN YPE V+P+L F++ I ++ NNE+GFS +NI ++C VG STKKG
Sbjct: 246 NAEDNEYPEGVDPSLEFVITSEDITNTGAPATLLIFNNEKGFSEKNIESICSVGRSTKKG 305
Query: 1267 S-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVP--PFNI 1320
+ GYIG KGIGFKSVF +T P I SNG+ ++F+ S +G+++P V P +
Sbjct: 306 NRKCGYIGEKGIGFKSVFLITSQPYIFSNGYQIRFNEAPCSHCSLGYIVPEWVDQHPSLV 365
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
D+ R+ S T I LP ++ + + + S++HP +LLFL +++
Sbjct: 366 DI-QRMYGSG-----SALPTTTIILPLKSD-----KVKPVKEQLSNVHPEVLLFLSKIKR 414
Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGII-------KVSCGEDKMTWFVAS 1419
+ R +++++ V RK I + K E + ++++
Sbjct: 415 LSIREHCLDPKLSTVNSIGIVSETNFVTRKSIDAESYTIHLSASEKGKNSEQECSYYMWR 474
Query: 1420 QKLRAGV-----IRPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAFLPLR-TYGLKF 1472
QK R +V+ I LA E GN P +Y AFLP F
Sbjct: 475 QKFPVKHENRVDRRSEVEEWVITLAFPFGERLGHGNNSPGIY-----AFLPTEMVTNFPF 529
Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD---------LPCFRENPA 1523
I+Q DF+L SSRE + + WNQ +L+ P F++A S LP FR P
Sbjct: 530 IIQADFILASSREMILLDDIWNQGILNCVPLAFLNAFTSLVKTTDAPVSSLLPAFRFLPV 589
Query: 1524 KAAS 1527
K ++
Sbjct: 590 KESN 593
>gi|218186840|gb|EEC69267.1| hypothetical protein OsI_38308 [Oryza sativa Indica Group]
Length = 1815
Score = 176 bits (447), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 144/424 (33%), Positives = 211/424 (49%), Gaps = 84/424 (19%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ F +G E N L + + +A+ LSQELYS+D HFL+EL+QNA+DN
Sbjct: 16 VERIRRERFYIG----REERNPLAED---IHQAVTYLSQELYSKDLHFLMELIQNAEDNE 68
Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP +VEP L F++ S ++V NNE+GFSA N+ ++C +G STKKG+ GY
Sbjct: 69 YPSDVEPALEFVIIKKDITATGAASTLLVFNNERGFSAANVESICRIGKSTKKGNRHLGY 128
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV---PPFNIDMFCR 1325
IG KGIGFKSVF V+ P I SNG+ +KF+ + + IG+++P V P +
Sbjct: 129 IGEKGIGFKSVFLVSSQPHIFSNGYQIKFNEEPSEDCDIGYIVPKWVDEKPSIDDIHAVY 188
Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
SK K T I LP ++ + + + S HP +LLFL +++ + R
Sbjct: 189 GYSK-------KLPTTTIILPLKSD-----KILPVKNELSSTHPEILLFLSKIRQLSVRE 236
Query: 1386 MLND------SLVVIRKKI---------VGDGIIKVSCGEDKM------TWFVASQKLRA 1424
+ +D S + I ++ + ++ E+K T+++ QK
Sbjct: 237 INDDPKASKLSQISISSEVNYKTQKDINAESYTLHLAMQENKRGDKEECTYYMWKQKF-- 294
Query: 1425 GVIRP---DVKTTEIA-LALTL-----QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFIL 1474
V++P D K E+ +TL Q + G P V+AFLP L FI+
Sbjct: 295 -VVKPECRDKKRMEVGKWVITLAFPHGQRLSRGARSP-----GVYAFLPTEMVTNLPFII 348
Query: 1475 QGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----------ERSFCDLPCFRENPAK 1524
Q DF+L SSRE + +S WN+ +L P FVSA SF P FR P
Sbjct: 349 QADFLLASSRESILLDSQWNRGILDCVPYAFVSAFEALLKSSSNAPSFALPPIFRFLPVA 408
Query: 1525 AASV 1528
+S+
Sbjct: 409 TSSI 412
>gi|222617066|gb|EEE53198.1| hypothetical protein OsJ_36070 [Oryza sativa Japonica Group]
Length = 1815
Score = 176 bits (447), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 144/423 (34%), Positives = 209/423 (49%), Gaps = 82/423 (19%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ F +G E N L + + +A+ LSQELYS+D HFL+EL+QNA+DN
Sbjct: 16 VERIRRERFYIG----REERNPLAED---IHQAVTYLSQELYSKDLHFLMELIQNAEDNE 68
Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP +VEP L F++ S ++V NNE+GFSA N+ ++C +G STKKG+ GY
Sbjct: 69 YPSDVEPALEFVIIKKDITATGAASTLLVFNNERGFSAANVESICRIGKSTKKGNRHLGY 128
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV---PPFNIDMFCR 1325
IG KGIGFKSVF V+ P I SNG+ +KF+ + + IG+++P V P +
Sbjct: 129 IGEKGIGFKSVFLVSSQPHIFSNGYQIKFNEEPSEDCDIGYIVPKWVDEKPSIDDIHAVY 188
Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
SK K T I LP ++ + + + S HP +LLFL +++ + R
Sbjct: 189 GYSK-------KLPTTTIILPLKSD-----KILPVKNELSSTHPEILLFLSKIRQLSVRE 236
Query: 1386 MLND------SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG-------------- 1425
+ +D S + I ++ D + + T +A Q+ + G
Sbjct: 237 INDDPKASKLSQISISSEV--DYKTQKDINAESYTLHLAMQENKRGDKEECTYYMWKQKF 294
Query: 1426 VIRP---DVKTTEI-ALALTL-----QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQ 1475
V++P D K E+ +TL Q + G P V+AFLP L FI+Q
Sbjct: 295 VVKPECRDKKRMEVDKWVITLAFPHGQRLSRGARSP-----GVYAFLPTEMVTNLPFIIQ 349
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----------ERSFCDLPCFRENPAKA 1525
DF+L SSRE + +S WN+ +L P FVSA SF P FR P
Sbjct: 350 ADFLLASSRESILLDSQWNRGILDCVPYAFVSAFEALLKSSSNAPSFALPPIFRFLPVAT 409
Query: 1526 ASV 1528
+S+
Sbjct: 410 SSI 412
>gi|302686242|ref|XP_003032801.1| hypothetical protein SCHCODRAFT_234274 [Schizophyllum commune H4-8]
gi|300106495|gb|EFI97898.1| hypothetical protein SCHCODRAFT_234274 [Schizophyllum commune H4-8]
Length = 1818
Score = 176 bits (445), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 168/576 (29%), Positives = 259/576 (44%), Gaps = 78/576 (13%)
Query: 1157 DAALIVESIRRDEFGLG----------PNISNMESNMLKKQHARLGRALHCLSQELYSQD 1206
+A ++E IR D +G G P++ + N+ L A +S++LY++
Sbjct: 129 EAQKLIEEIR-DGYGSGRSAQAKMQQDPSLQAVIGNLTTT----LENACRNVSEDLYAKS 183
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
HF+LE+VQNADDN Y E +PTLT +++ + +NE GF N+RA+C VG STKKG
Sbjct: 184 PHFILEMVQNADDNKYAEGTKPTLTIRIEDDAVTFSSNELGFEPANVRAICSVGKSTKKG 243
Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL 1326
S GR +GFKS F+V D I S F KFD E ++G + P ++C
Sbjct: 244 MS----GRCYVGFKSCFKVADVVHIASRDFQFKFDKRE-KLGMITP---------IWC-- 287
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
+ PV T + FR + S G + + + S L PSLLLFL +L +
Sbjct: 288 -TDHPV--------TKGQTTFRLELSPGESGAALAEHASKLEPSLLLFLRQLNTLKVETA 338
Query: 1387 L---NDSLVVIRKKIVGD-GIIKVSCGED----KMTWFVAS---QKLRAGVIRPDVKTTE 1435
L N ++ + + D +I ++ ED + T+F+ Q + R + +TE
Sbjct: 339 LSGQNPQIIHFKYRTTNDPDVITLTRWEDHRTTRDTYFIVHHTVQGMPEEEKRRGINSTE 398
Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
+ LA + + EG P + AFLPLR YG KFI+Q DF+ P+SRE++ PWN
Sbjct: 399 VTLAFPIDK--EG--APQARAEYAHAFLPLRKYGFKFIIQADFLTPASREDILEEKPWNI 454
Query: 1496 WLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV-HGFFSGLPRMILSKL 1554
L + A ++A + RE + F+PL +V +G+F L I L
Sbjct: 455 TLRNGLLAAVLAA----IERLKIRE---AMKFTWFRFLPLETDVSNGYFKPLVANIRHTL 507
Query: 1555 RMSNCLILEGNNNQWAPPCKVLRGW------NDRAHSLLPDILLQKHLGLGFL--SKDIV 1606
+ + + A +L W D H P I Q + +L S D
Sbjct: 508 AAKSLFLSQDGEYHTAGQLLILPDWAAYQEKEDDKHVTKPLIPEQYLPFVWYLHSSYDTS 567
Query: 1607 LSDS--LARALGIEEYGPKILLQIISSLCRTENGLRSMGLS--WLASWLNELYTISFHSS 1662
++ L L ++ Q I++L N G S W + + LY I S
Sbjct: 568 INGDGPLLEGLRLKRMNSA---QFINALQAMVNKRAHEGRSPAWFNAVYDNLYRIYDEPS 624
Query: 1663 GQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEG 1698
+ + ++ + IPLSDGT++S +G
Sbjct: 625 NPNQKRGRKSYPYQTPIEALKLIPLSDGTWASKVDG 660
>gi|115485641|ref|NP_001067964.1| Os11g0514100 [Oryza sativa Japonica Group]
gi|77551084|gb|ABA93881.1| hypothetical protein LOC_Os11g31500 [Oryza sativa Japonica Group]
gi|113645186|dbj|BAF28327.1| Os11g0514100 [Oryza sativa Japonica Group]
Length = 1674
Score = 175 bits (443), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 141/451 (31%), Positives = 221/451 (49%), Gaps = 71/451 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ F +G E N L + + +A++ LSQE+YS+D HFL+EL+QNA+DN
Sbjct: 8 VERIRRERFFIGRG----ERNPLAED---MHQAVNYLSQEIYSKDVHFLMELIQNAEDND 60
Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP V P+L F++ S +++ NNE+GFS N+ ++C VG STKKG+ GY
Sbjct: 61 YPSGVAPSLEFLITSKDITGLGAPSTLLIFNNEKGFSPSNVESICRVGKSTKKGNRHQGY 120
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF ++ P I SNG+ +KF+ +SE IG+++P V P D+
Sbjct: 121 IGEKGIGFKSVFLISRQPHIFSNGYRIKFNEDPSSECNIGYIVPEWVESKPSLSDIQELH 180
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
S P+ T I LP +++ ++ + S +HP +LLFL +++ + R
Sbjct: 181 GSSKPLP------TTTIILPLKSE-----KVDVVKKQLSSIHPEILLFLTKIRRLSVRED 229
Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGEDK----MTWFVASQKLRAGV- 1426
+M ++ +RK I + + +S E+K +++ QK
Sbjct: 230 NSDPKCSTINEISMESEKNYQVRKNIHAESYTLHLSAQENKEQEECGYYMWRQKFPVKPE 289
Query: 1427 ----IRPDVKTTEIALAL----TLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
+R D+ I LA L + + G V+AFLP FI+Q D
Sbjct: 290 NRVDMRADIDEYVITLAFPHGQRLSRGKQSSPG-------VYAFLPTEILTNFPFIIQAD 342
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
F+L SSRE + +S WN+ +L P+ F++A + + PA + +F+P+
Sbjct: 343 FLLASSRETILFDSMWNKGILECVPSAFLNAFVALVKSSA--DAPAMSLPSMFNFLPVHP 400
Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
F + I K+R N + E Q
Sbjct: 401 SHVPFLEPVRSAIKDKVRTENIMPCESYTLQ 431
>gi|357513121|ref|XP_003626849.1| Sacsin [Medicago truncatula]
gi|355520871|gb|AET01325.1| Sacsin [Medicago truncatula]
Length = 1699
Score = 174 bits (442), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 134/394 (34%), Positives = 196/394 (49%), Gaps = 61/394 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G N + + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEEIRRKKFSIGGNPNPLTED--------LHQAVKNLSDELYTRDVHFLMELIQNAEDNH 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y E V PTL FI+ I ++ NNE+GFS +NI ++C VG STKKG+ S+GY
Sbjct: 60 YIEGVSPTLEFIITSKDITATGAPATLLIFNNEKGFSRQNIESICSVGRSTKKGNRSSGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF VT P I SNG+ ++F + +G+++P V P +D+ ++
Sbjct: 120 IGEKGIGFKSVFLVTAQPYIFSNGYQIRFSERPCTHCSLGYIVPEWVEEKPSLLDI-KQI 178
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
L + T I LP + E + S ++P +LLFL +++ + R
Sbjct: 179 YGAGKDSLPT----TIIVLPLKLDKVEPVKQQ-----LSSVNPEVLLFLLKIRHLSVRED 229
Query: 1387 LNDSL--------------VVIRKKIVGDG-IIKVSCG-----EDKMTWFVASQKLRAGV 1426
D V RK + + I +S G E K +++ QK +
Sbjct: 230 NEDPKKNTVTSVSISSEIDFVTRKNMDAESYTINLSAGEKSSDEKKCCYYMWKQKFPVKL 289
Query: 1427 -----IRPDVKTTEIALALTLQESNEGNYGPLLYQQP--VFAFLPLR-TYGLKFILQGDF 1478
R DV+ + LA QE + +Q V+AFLP FI+Q DF
Sbjct: 290 QNVVERRKDVEECVVTLAFPRQERLQKIKKLRKHQSSPGVYAFLPTEMVTNFPFIIQADF 349
Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
VL SSRE + N WNQ +L P+ FV A ++
Sbjct: 350 VLASSRETILLNDKWNQGILEYVPSAFVDAFKTL 383
>gi|125534525|gb|EAY81073.1| hypothetical protein OsI_36253 [Oryza sativa Indica Group]
Length = 1695
Score = 174 bits (441), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 141/451 (31%), Positives = 220/451 (48%), Gaps = 71/451 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ F +G E N L + + +A++ LSQE+YS+D HFL+EL+QNA+DN
Sbjct: 8 VERIRRERFFIGRG----ERNPLAED---MHQAVNYLSQEIYSKDVHFLMELIQNAEDND 60
Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP V P+L F++ S +++ NNE GFS N+ ++C VG STKKG+ GY
Sbjct: 61 YPSGVAPSLEFLITSKDITGLGAPSTLLIFNNENGFSPSNVESICRVGKSTKKGNRHQGY 120
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF ++ P I SNG+ +KF+ +SE IG+++P V P D+
Sbjct: 121 IGEKGIGFKSVFLISRQPHIFSNGYRIKFNEDPSSECNIGYIVPEWVESKPSLSDIQELH 180
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
S P+ T I LP +++ ++ + S +HP +LLFL +++ + R
Sbjct: 181 GSSKPLP------TTTIILPLKSE-----KVDVVKKQLSSIHPEILLFLTKIRRLSVRED 229
Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGEDK----MTWFVASQKLRAGV- 1426
+M ++ +RK I + + +S E+K +++ QK
Sbjct: 230 NSDPKCSTINEISMESEKNYQVRKNIHAESYTLHLSAQENKEQEECGYYMWRQKFPVKPE 289
Query: 1427 ----IRPDVKTTEIALAL----TLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
+R D+ I LA L + + G V+AFLP FI+Q D
Sbjct: 290 NRVDMRADIDEYVITLAFPHGQRLSRGKQSSPG-------VYAFLPTEILTNFPFIIQAD 342
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
F+L SSRE + +S WN+ +L P+ F++A + + PA + +F+P+
Sbjct: 343 FLLASSRETILFDSMWNKGILECVPSAFLNAFVALVKSSA--DAPAMSLPSMFNFLPVHP 400
Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
F + I K+R N + E Q
Sbjct: 401 SHVPFLEPVRSAIKDKVRTENIMPCESYTLQ 431
>gi|393234641|gb|EJD42202.1| hypothetical protein AURDEDRAFT_152983 [Auricularia delicata
TFB-10046 SS5]
Length = 1430
Score = 174 bits (440), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 163/597 (27%), Positives = 263/597 (44%), Gaps = 78/597 (13%)
Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFS 1249
+L ++L L ELYS+ +HF+LE++QNADDN Y + EP L +L+ S + + NE GFS
Sbjct: 41 KLAQSLQVLCAELYSKPAHFVLEVIQNADDNTYAPDTEPELRIVLRHSHMEIQCNEVGFS 100
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
EN+RA C +G STKK S GYIG KGIG KS+F+V + S G+ D E ++G
Sbjct: 101 DENVRAFCGIGQSTKKNQS-GYIGEKGIGAKSIFQVARKVFVQSRGYSFVLDRDE-ELGM 158
Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK----FSEGIAMNNIVDMFS 1365
+ P W LP T+ F + +
Sbjct: 159 ISPH-------------------------WTELAVLPGWTRMTLEFCDEKQYYIVKAQLE 193
Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
+ ++LLF+ RL + ++ + ++ +KI GD + G + SQ + A
Sbjct: 194 HVRMTVLLFMRRLAHL---HIDDGTVFAASRKIDGDVVTITVPGAPSWNFLRVSQIVPAY 250
Query: 1426 VIRPDVK---TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPS 1482
P +TE++LA L ++ + P + Q V+AFLPLR G FI+Q DF+ +
Sbjct: 251 PYEPKRAHCVSTEVSLAFPLDQAGQ----PWISWQLVYAFLPLRAAGFTFIIQADFLTAA 306
Query: 1483 SREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG- 1541
+RE+V + WN L S +F+SA F LP + ++P +H
Sbjct: 307 NREDVLWDLEWNINLRSAISKVFLSAVEKFKRLPALE-------LTWPRYLP--ANIHDP 357
Query: 1542 FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFL 1601
FF+ IL+ LR S + E + +P + + L + ++L +
Sbjct: 358 FFAPAAHEILASLRNSPIVKCEDGKYRHSPLVRCSSWFVSPTGEAL---VAAQYLPFSYA 414
Query: 1602 SKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASW--LNELYTISF 1659
+ R+LG+E+ K L + L E L S+G WL L LY ++
Sbjct: 415 APYPEDVKPHLRSLGVEDMTFKDFLDGL--LAMGEPRLTSIGNEWLEQVCSLINLYGTTW 472
Query: 1660 HSSGQSSLQSGVETDLIDN-LQRIPFIPLSDGTFSSVDEGTIWLHSDCS--VFDGGFGLE 1716
++ Q D +D ++++P + L G ++ ++ G D G G+
Sbjct: 473 NAKDQ-------RWDCVDQRVRQLPLVHLHTGRWARINSGRSLFFGDAQEIPLPSGLGIS 525
Query: 1717 AFPNLCAKLRTVSP---ALLSASAVDKSSLGVI--SVDNLNRMLLKIGVQQLSAHDI 1768
+ T SP LL V K +G + + L+RM K+ VQ++ H +
Sbjct: 526 -----LVQTPTSSPHRTQLLERLGVRKVEVGAVVDKILELHRMSSKMSVQEMLEHAL 577
>gi|302775142|ref|XP_002970988.1| hypothetical protein SELMODRAFT_411717 [Selaginella moellendorffii]
gi|300160970|gb|EFJ27586.1| hypothetical protein SELMODRAFT_411717 [Selaginella moellendorffii]
Length = 1493
Score = 172 bits (437), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 140/453 (30%), Positives = 221/453 (48%), Gaps = 66/453 (14%)
Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
+ +E +RR+ FG+G + N + ML++ + LS ELY +D HFL EL+QNA+D
Sbjct: 6 MFIERLRRERFGIGQSGPNPLAPMLQE-------TIRLLSAELYQKDIHFLSELLQNAED 58
Query: 1220 NIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SA 1269
N Y +VEP++ L ++ + ++LNNE GFS +NI +LC G STKKG S
Sbjct: 59 NTYHSSVEPSIEIYLTKTDVTGTGAAATLLLLNNEVGFSEKNIESLCGSGQSTKKGKKSQ 118
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRL 1326
GYIG KGIGFKSVF VT P I SNGF ++FD +E ++G+++P V + ++
Sbjct: 119 GYIGEKGIGFKSVFLVTKQPYIISNGFRIRFDEEPHNEAKLGYIVPEWVDRPTDAVLQKV 178
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI-MFRN 1385
SK ++ T I LP R + ++ + + + LLFL +++ + ++
Sbjct: 179 CSK------AELPATVIILPLRQE-----KISEVAKQLTQIPAETLLFLSKIRMLSIYDQ 227
Query: 1386 MLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI------------------ 1427
++ + V K E T+ AS R G
Sbjct: 228 QSRPPTELVLSRSVEMTTSKNESVETSTTFLTASNG-RNGTQHSNYYMFKQTLKVPEEAK 286
Query: 1428 ---RPDVKTTEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLP 1481
R DV + I LA + + S+ + G +F+FLP + GL F++ DF+L
Sbjct: 287 VENRKDVDSWTITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLV 340
Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCF--RENPAKAASVYMSFVPLVGEV 1539
SSRE + +SPWNQ + P +F +A + F + P+ A +VP+
Sbjct: 341 SSRETLLFDSPWNQGIFRCVPEVFYNAFELLLNTTAFGLQSIPSVARMAMYKYVPVNACR 400
Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP 1572
+ + + ILSKL+ +N +IL ++ APP
Sbjct: 401 NSYLERVRNDILSKLKDNN-VILCASDVGRAPP 432
>gi|393234597|gb|EJD42158.1| hypothetical protein AURDEDRAFT_186148 [Auricularia delicata
TFB-10046 SS5]
Length = 1503
Score = 172 bits (435), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 167/601 (27%), Positives = 262/601 (43%), Gaps = 97/601 (16%)
Query: 1142 GDCTTQTLPEDKECEDA--ALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLS 1199
G Q + + C++ A +V+ RRD P + N + + +K L ++L LS
Sbjct: 3 GISAEQQVVKQIRCDNGVDADLVQQARRD-----PVVWNTVTKLKEK----LEKSLQILS 53
Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
+ELYS+ +HF+LEL+QNADDN Y P + +L+ S + + NE GF+AEN++A C +
Sbjct: 54 EELYSKPAHFVLELLQNADDNSYAPGTTPEMRIVLRPSHMEIRCNETGFTAENVKAFCGI 113
Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFN 1319
G STK+ + GYIG KGIG KS+F+V +HS G+H D + ++G + P
Sbjct: 114 GQSTKRNQT-GYIGEKGIGAKSMFKVARKVHVHSLGYHFVLD-RDAELGMISP------- 164
Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
Q N R F + + + S+LLF+ RL
Sbjct: 165 -------------QWTGSSENLSGWTLLRLDFCDDKQYAIVKSQLGQVRESVLLFMRRLA 211
Query: 1380 CIMFRNMLNDSLVVIRKKIVGDG-IIKVSCGEDKMTW-FVASQKLRAGVIRPDVK----- 1432
+ + ++ S V R+ DG + +S K W Q LR P +
Sbjct: 212 RLTVDDSVDKSFSVERQM---DGKFLTISV---KNGWSMTCRQFLRVANTIPAYRHEPKR 265
Query: 1433 ----TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD 1488
TTEI LA L S E PL+ +Q V AFLP+R+ G KFI+Q DF+ ++RE+V
Sbjct: 266 EHSETTEIVLAFPLNASRE----PLVSRQMVHAFLPVRSVGFKFIIQADFLTAANREDVL 321
Query: 1489 GNSPWNQWLLSEFPALFVSA-ERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGL 1546
+ WN+ L S F SA F +P + ++ F+P +V FFS +
Sbjct: 322 SDLEWNKTLRSGVVEAFCSAVTVQFQSMPTLD-------TTWLHFLPDPSDVADQFFSPI 374
Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKV---------LRGWNDRAHSLLPDILLQKHLG 1597
R I S R+ + +++ + W P V G LP
Sbjct: 375 TRDIRS--RIESLKVIKDADGFWRTPSLVRFPSAFSETTDGAGIVPEPELPFYYASNQYT 432
Query: 1598 LGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTI 1657
G SK A+ALG+ + +S L + + + G WL + +YT
Sbjct: 433 GGLASK--------AQALGVADM---TFNDFLSGLIKMGDRITRKGNVWLERVCSLIYTH 481
Query: 1658 SFHSSGQSSLQSGVETDLID-NLQRIPFIPLSDGTFSSVD-EGTIWLHSDCSVFDGGFGL 1715
F+ + Q L D ++ +P L++G + S ++ + S F G G+
Sbjct: 482 GFNWNSQ----------LRDRRIRDLPLARLANGNWISYGISAALFFNRSSSEFPSGLGI 531
Query: 1716 E 1716
+
Sbjct: 532 D 532
>gi|302142144|emb|CBI19347.3| unnamed protein product [Vitis vinifera]
Length = 1801
Score = 171 bits (433), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 136/419 (32%), Positives = 196/419 (46%), Gaps = 70/419 (16%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
E A L +E IR +F +G N + L A+ LS ELY++D HFL+EL+Q
Sbjct: 19 EAARLHIEKIRTQKFSIGAKSPNPLTE-------DLHHAVTSLSAELYAKDVHFLMELIQ 71
Query: 1216 NADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
NA+DN Y + VEPTL F+ I +V NNE GFS +N+ ++C VG STKKG
Sbjct: 72 NAEDNEYKQGVEPTLEFVFTTKDITGSGAPATLLVFNNEVGFSEKNMESICSVGRSTKKG 131
Query: 1267 SS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ---IGFVLPTLV--PPFNI 1320
G+IG KGIGFKSVF VT P I SNG+HV+F Q IG+++P V P+ +
Sbjct: 132 KRHLGFIGEKGIGFKSVFLVTIQPHIFSNGYHVRFSEDPDQDCGIGYIVPEWVGGKPY-L 190
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
C + D V T I LP + + E + S+LHP LLLFL +++
Sbjct: 191 STICDIYGSDKVLP-----TTIILLPLKPEKVEAVKAQ-----LSELHPELLLFLSKIKR 240
Query: 1381 I-----------------MFRNMLNDSLVVIRKKIVGDGIIKVSCGED----KMTWFVAS 1419
+ +F + D V + K ++ +S ED +++
Sbjct: 241 LSVCGNTCNSKEDVNVSDIFISTETDH-VALSDKSADSRVLHLSVKEDGAEETCRYYIWR 299
Query: 1420 QKLRAG-----VIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFI 1473
+ R DVK I++A E + + +FAFLP F+
Sbjct: 300 ETFPVKPANEFSARNDVKKCVISIAFPFGERLKRGTSAV----GIFAFLPTAMITNFPFV 355
Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSF 1532
+Q DFVL SSRE + ++ WN +L P+ F A S C +++ SV +F
Sbjct: 356 IQADFVLASSRETILLDNKWNLGILECIPSAFFHAFIS-----CVKKDATTICSVAQAF 409
>gi|443713407|gb|ELU06277.1| hypothetical protein CAPTEDRAFT_76413, partial [Capitella teleta]
Length = 180
Score = 171 bits (432), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 81/161 (50%), Positives = 115/161 (71%), Gaps = 7/161 (4%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR+EFG+G ++ + +K ARLGR+L L+ +LYS+D+HF+LEL+QNADDN
Sbjct: 13 IERIRREEFGIGLELTCEFQAVFEKNQARLGRSLQRLAHDLYSKDTHFVLELIQNADDNS 72
Query: 1222 YPE-------NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGR 1274
Y E +V PTL+F++ + + + NNE+GF ++++A+CDVG STK GYIG+
Sbjct: 73 YAEHLLNPDSDVVPTLSFVVSDRAVKISNNEKGFLEKHVKAICDVGCSTKPKHQMGYIGQ 132
Query: 1275 KGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
KGIGFKSVFRV+D PEI S+GFH KFD + +G++LP V
Sbjct: 133 KGIGFKSVFRVSDEPEIVSSGFHFKFDKNSSDMGYILPHWV 173
>gi|147787343|emb|CAN62566.1| hypothetical protein VITISV_022763 [Vitis vinifera]
Length = 1725
Score = 171 bits (432), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 136/419 (32%), Positives = 196/419 (46%), Gaps = 70/419 (16%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
E A L +E IR +F +G N + L A+ LS ELY++D HFL+EL+Q
Sbjct: 37 EAARLHIEKIRTQKFSIGAKSPNPLTE-------DLHHAVTSLSAELYAKDVHFLMELIQ 89
Query: 1216 NADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
NA+DN Y + VEPTL F+ I +V NNE GFS +N+ ++C VG STKKG
Sbjct: 90 NAEDNEYKQGVEPTLEFVFTTKDITGSGAPATLLVFNNEVGFSEKNMESICSVGRSTKKG 149
Query: 1267 SS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ---IGFVLPTLV--PPFNI 1320
G+IG KGIGFKSVF VT P I SNG+HV+F Q IG+++P V P+ +
Sbjct: 150 KRHLGFIGEKGIGFKSVFLVTIQPHIFSNGYHVRFSEDPDQDCGIGYIVPEWVGGKPY-L 208
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
C + D V T I LP + + E + S+LHP LLLFL +++
Sbjct: 209 STICDIYGSDKVLP-----TTIILLPLKPEKVEAVKAQ-----LSELHPELLLFLSKIKR 258
Query: 1381 I-----------------MFRNMLNDSLVVIRKKIVGDGIIKVSCGED----KMTWFVAS 1419
+ +F + D V + K ++ +S ED +++
Sbjct: 259 LSVCGNTCNSKEDVNVSDIFISTETDH-VALSDKSADSRVLHLSVKEDGAEETCRYYIWR 317
Query: 1420 QKLRAG-----VIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFI 1473
+ R DVK I++A E + + +FAFLP F+
Sbjct: 318 ETFPVKPANEFSARNDVKKCVISIAFPFGERLKRGTSAV----GIFAFLPTAMITNFPFV 373
Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSF 1532
+Q DFVL SSRE + ++ WN +L P+ F A S C +++ SV +F
Sbjct: 374 IQADFVLASSRETILLDNKWNLGILECIPSAFFHAFIS-----CVKKDATTICSVAQAF 427
>gi|242068521|ref|XP_002449537.1| hypothetical protein SORBIDRAFT_05g018376 [Sorghum bicolor]
gi|241935380|gb|EES08525.1| hypothetical protein SORBIDRAFT_05g018376 [Sorghum bicolor]
Length = 1709
Score = 170 bits (431), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 140/459 (30%), Positives = 225/459 (49%), Gaps = 66/459 (14%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ + +G E N L + + +A++ LSQELYS+D HFL+ELVQNA+DN
Sbjct: 11 VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQNAEDNE 63
Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP V P+L F++ S +++ NNE+GFS+ NI ++C VG STKKG+ GY
Sbjct: 64 YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSSTNIESICRVGKSTKKGNRHQGY 123
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNIDMFCRLLS 1328
IG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V + L
Sbjct: 124 IGEKGIGFKSVFLISSQPYIFSNGYQIKFNEKPCAECNIGYIVPQWV-----ESTPSLSD 178
Query: 1329 KDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
+ + SK T I LP +++ ++ + S +HP +LLFL +++ + R
Sbjct: 179 IEAIYGCSKVLPTTTIILPLKSE-----KIDAVKSQLSSMHPEMLLFLTKIRKLSVREDK 233
Query: 1388 ND------SLVVI--------RKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVI 1427
+D S + I RK + + + E + +++ QK +
Sbjct: 234 SDPNNTTVSEISISSEKNYQARKNMHAESYTLHLSSEETGKDEAECGYYMWRQKFP---V 290
Query: 1428 RPDVKTT------EIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVL 1480
+P+ + E + L + G L ++AFLP FI+Q DF+L
Sbjct: 291 KPENRVNKRAEIDEWVITLAFPHGERLSRGKQL-SPGIYAFLPTEMVTSFPFIIQADFLL 349
Query: 1481 PSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH 1540
SSRE + ++PWN+ +L P+ F++A + + PA + SF+P+ +
Sbjct: 350 GSSREAILFDNPWNKGILDCVPSAFMNAFVTLVKSKA--DAPAMSLPSMFSFLPVNSSLI 407
Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
+ I K+ N + LE +Q + P +V R
Sbjct: 408 PLLEPVRSGIKYKVLTENIVPLESYASQKIFCKPSEVAR 446
>gi|242068523|ref|XP_002449538.1| hypothetical protein SORBIDRAFT_05g018443 [Sorghum bicolor]
gi|241935381|gb|EES08526.1| hypothetical protein SORBIDRAFT_05g018443 [Sorghum bicolor]
Length = 1699
Score = 170 bits (431), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 139/459 (30%), Positives = 226/459 (49%), Gaps = 66/459 (14%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ + +G E N L + + +A++ LSQELYS+D HFL+ELVQNA+DN
Sbjct: 11 VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQNAEDNE 63
Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP V P+L F++ S +++ NNE+GFS+ NI ++C VG STKKG+ GY
Sbjct: 64 YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSSTNIESICRVGKSTKKGNRHQGY 123
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNIDMFCRLLS 1328
IG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V + L
Sbjct: 124 IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECNIGYIVPQWV-----ESTPSLSD 178
Query: 1329 KDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
+ + SK T I LP +++ ++ + S +HP +LLFL +++ + R
Sbjct: 179 IEALYGCSKVLPTTTIILPLKSEKTDAVK-----SQLSSMHPEMLLFLTKIRKLSVREDK 233
Query: 1388 ND------SLVVI--------RKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVI 1427
+D S + I RK + + + E + +++ QK +
Sbjct: 234 SDPNNTTVSEISISSERNYQARKNMHAESYTLHLSSEETGKDEAECGYYMWRQKFP---V 290
Query: 1428 RPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVL 1480
+P+ + E + L + G L ++AFLP FI+Q DF+L
Sbjct: 291 KPENRVDKRAEIDEWVITLAFPHGERLSRGKQL-SPGIYAFLPTEMVTSFPFIIQADFLL 349
Query: 1481 PSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH 1540
SSRE + ++PWN+ +L P+ F++A + + PA + +F+P+ +
Sbjct: 350 GSSREAILFDNPWNKGILDCVPSAFMNAFVTLVKSKA--DAPAMSLPSMFNFLPVSSSLI 407
Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
+ I +K+ N + LE +Q + P +V R
Sbjct: 408 PLLEPVRSGIKNKVLTENIVPLESYASQKIFCKPSEVAR 446
>gi|414591567|tpg|DAA42138.1| TPA: hypothetical protein ZEAMMB73_860622 [Zea mays]
gi|414591568|tpg|DAA42139.1| TPA: hypothetical protein ZEAMMB73_860622 [Zea mays]
Length = 1700
Score = 169 bits (429), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 132/395 (33%), Positives = 194/395 (49%), Gaps = 77/395 (19%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VESIRR + +G E N L + L +A + LSQELYS+D HFL+ELVQNA+DN
Sbjct: 13 VESIRRKRYYIGRG----EQNPLAED---LHQATNYLSQELYSKDVHFLMELVQNAEDNE 65
Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP V P+L F++ S +++ NNE+GFSA NI ++C VG STKKG+ GY
Sbjct: 66 YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSATNIESICRVGKSTKKGNRHRGY 125
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV---PPF-NIDMFC 1324
IG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V P +I+
Sbjct: 126 IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECNIGYIVPEWVESTPSLSDIETLY 185
Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR 1384
P T I LP +++ ++ + S +HP +LLFL ++ + R
Sbjct: 186 GCSKVLP--------TTTIILPLKSE-----KIDAVKTQLSSMHPEMLLFLTKITKLSVR 232
Query: 1385 --NMLNDSLVV-----------IRKKIVGD-----------GIIKVSCGEDKMTWFVASQ 1420
+ LN + V RK + + G + CG +++ Q
Sbjct: 233 ENSDLNHTTVSEISITSEKNYQARKNMHAESYTLHLSAEETGKEEAECG-----YYMWRQ 287
Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
K ++P+ + E + L G L ++AFLP FI
Sbjct: 288 KFP---VKPENRVDKRAEIDEWVITLAFPHGQRLTRGKQLLPS-IYAFLPTEMVTSFPFI 343
Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
+Q DF+L SSRE + +SPWN+ +L P F+ A
Sbjct: 344 IQADFLLASSREAILFDSPWNKGILECVPKAFMDA 378
>gi|359491119|ref|XP_003634225.1| PREDICTED: uncharacterized protein LOC100252577 [Vitis vinifera]
Length = 1711
Score = 169 bits (429), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 134/461 (29%), Positives = 218/461 (47%), Gaps = 71/461 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
++ IRR +F +G ++ + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IKEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDND 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
YPE V P+L ++ I ++ NNE+GFSA+NI ++C VG STKK + GY
Sbjct: 60 YPEGVNPSLELVITSQDITGTGASATLLIFNNEKGFSAKNIESICSVGRSTKKNNRKCGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVP--PFNIDMFCRL 1326
IG KGIGFKSVF +T P I SNG+ ++F+ +G+++P V P +D ++
Sbjct: 120 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVQWNP-TLDDIKQI 178
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
V T I LP + + + + S + P +LLFL +++ +
Sbjct: 179 YGSHAV-----LPTTTIILPLKPD-----KIGPVKEQLSSIQPEVLLFLSKIKQFSVKKH 228
Query: 1387 LNDSLV--------------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKL--- 1422
D + V RK I D I E + ++++ QK
Sbjct: 229 NEDPRLNTVNAISISSEINFVKRKNIDADSYILHLSTDGAKDVTEKECSYYMWRQKFPVR 288
Query: 1423 RAGVIRPDVKTTEIALALTL---QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDF 1478
+ + + E+ + L Q N G P V+AFLP FI+Q DF
Sbjct: 289 QENQVERRLGVEELVITLAFPFGQRLNRGISSP-----GVYAFLPTEMVTNFPFIIQADF 343
Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
VL SSRE + ++ WNQ +L P+ F++A S + ++ P + F+P+
Sbjct: 344 VLASSRETILLDNKWNQGILDCVPSAFLNAFISL--VTTSQDVPVSTLTPMFKFLPIYSS 401
Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
+ + + +I ++L N + E ++Q + PC+V R
Sbjct: 402 PYPKLNDVRELIKAELLKKNIVPCESYSDQKIFRKPCEVGR 442
>gi|357513141|ref|XP_003626859.1| hypothetical protein MTR_8g011330 [Medicago truncatula]
gi|355520881|gb|AET01335.1| hypothetical protein MTR_8g011330 [Medicago truncatula]
Length = 1702
Score = 169 bits (428), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 137/448 (30%), Positives = 217/448 (48%), Gaps = 65/448 (14%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR++F +G + + + L A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEEIRREKFSIGGKPNPLTYD--------LHHAVKNLSAELYAKDVHFLMELIQNAEDNH 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y + PTL F++ I ++ NNE+GFS +NI ++C VG STKK + S+GY
Sbjct: 60 YIDGENPTLEFVITSDDITGTGAPATLLIFNNEKGFSPKNIESICSVGRSTKKDNRSSGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF VT P I SNG+H++F+ +IG+V+P V P +D+ ++
Sbjct: 120 IGEKGIGFKSVFLVTAQPYIFSNGYHIRFNERPCPHCRIGYVVPEWVEEKPTLVDI-QKI 178
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
L + T I LP ++ + + S +HP +LLFL +++ + R
Sbjct: 179 YGAGKDSLPT----TTIVLPLKSD-----KVKPVKQQLSSIHPEVLLFLSKIRHLSVREG 229
Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGED-----KMTWFVASQKLRAGV 1426
++ ++ V RK + I +S GE + ++++ QK +
Sbjct: 230 NKDFNQNTATSVSISSEINFVTRKNMNAKSYTIHLSTGESSRDKKECSYYMWKQKFPVKL 289
Query: 1427 -----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVL 1480
R V+ + LA QE + + V+AFLP FI+Q DFVL
Sbjct: 290 ENVVERRRGVEECVVTLAFPQQERLDKDKS----LPGVYAFLPTEMVTNFPFIIQADFVL 345
Query: 1481 PSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH 1540
SSRE + + WNQ +L P F+ A ++ + E P + F+P+
Sbjct: 346 ASSRETILFDDKWNQGILEYVPLAFIDAFKTL--VTGSDEAPLSSLPYMFKFLPIHSSPF 403
Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
F+ + + I +KL + +E NQ
Sbjct: 404 EIFNRVRQKIKAKLDEEKIIPIETFINQ 431
>gi|225456431|ref|XP_002284314.1| PREDICTED: uncharacterized protein LOC100247409 [Vitis vinifera]
Length = 1712
Score = 169 bits (427), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 142/460 (30%), Positives = 219/460 (47%), Gaps = 69/460 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G E N L + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEEIRRTKFSIGG-----EQNPLTED---LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
Y E+V P+L ++ I ++ NNE+GFSA+NI ++C VG STKKG+ GY
Sbjct: 60 YGEDVNPSLELVITSKDITGTRALMTLLIFNNEKGFSAKNIESICGVGRSTKKGNRKRGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPF-NIDMFCRLL 1327
IG KGIGFKSVF +T P I SNG+ ++F+ +G+++P V ++D ++
Sbjct: 120 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVEQNPSLDDIKQIY 179
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR--- 1384
V T I LP + + + + S + P +LLFL +++ +
Sbjct: 180 GSHAVLP-----TTTIILPLKPD-----KIKPVKEQLSSIQPEVLLFLSKIKHFSVKEDN 229
Query: 1385 -----NMLNDSLV------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKLRAGV 1426
N +N + V RK I D E + ++++ QK
Sbjct: 230 EDPRLNTVNAISISSEINFVTRKNIDADSYTLHLSADETSDVTEKECSYYMWRQKFPVRQ 289
Query: 1427 -----IRPDVKTTEIALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
R +V+ I LA L Q N G P V+AFLP FI+Q DFV
Sbjct: 290 ENQVERRLEVEEWVITLAFPLGQRLNRGMSSP-----GVYAFLPTEMVTNFPFIIQADFV 344
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
L SSRE + ++ WNQ +L P+ F++A S + + P + F+P+
Sbjct: 345 LASSRETILLDNKWNQGILDCVPSAFLNALISL--VTTSEDVPVSTLTPMFKFLPIDSSS 402
Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
+ + + I +KL N + E ++Q + PC+V R
Sbjct: 403 YPKLNVVRESIKAKLLKENIIPCESYSDQKIFRKPCEVGR 442
>gi|147775835|emb|CAN78188.1| hypothetical protein VITISV_033142 [Vitis vinifera]
Length = 1488
Score = 168 bits (426), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 137/441 (31%), Positives = 211/441 (47%), Gaps = 55/441 (12%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G ++ + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y E V P+L ++ I ++ NNE+GFSA+NI ++C VG STKK + GY
Sbjct: 60 YGEGVNPSLELVITSQDITDTGAPATLLIFNNEKGFSAKNIESICSVGRSTKKSNRKQGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP--PFNIDMFCRL 1326
IG KGIGFKSVF +T P I SNG+ ++F+ S +G+++P V P D+
Sbjct: 120 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCSHSNLGYIVPEWVEQNPSLADI---- 175
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
K + T I LP + + + S + P +LLFL +++ +
Sbjct: 176 --KQIYGSHAVLPTTTIILPLKPD-----KIKPVKQQLSSIQPEVLLFLSKIKQFSVKED 228
Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGV-----IRPDVKTTEIALALT 1441
D + I I E + ++++ QK R V+ I LA
Sbjct: 229 NKDPRLNTVNAI---SISTSDVTEKECSYYMWRQKFPVRQENQVERRLGVEEWVIKLAFP 285
Query: 1442 L-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
+ Q N G P +Y AFLP FI+Q DFVL SSRE + ++ WNQ +L
Sbjct: 286 IGQRLNRGMSSPGIY-----AFLPTEMVTNFPFIIQADFVLASSRETILLDNKWNQGILD 340
Query: 1500 EFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
P+ F++A F L E+ P + F+P+ + + + I +KL N
Sbjct: 341 CVPSAFLNA---FISLVTTSEDVPVSTLTPMFKFLPINSSSYPKLNVVRESIKAKLLTEN 397
Query: 1559 CLILEGNNNQ--WAPPCKVLR 1577
+ E ++Q + PC+V R
Sbjct: 398 IIPCESYSDQKIFRKPCEVGR 418
>gi|255540111|ref|XP_002511120.1| DNA binding protein, putative [Ricinus communis]
gi|223550235|gb|EEF51722.1| DNA binding protein, putative [Ricinus communis]
Length = 2299
Score = 168 bits (425), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 138/449 (30%), Positives = 207/449 (46%), Gaps = 74/449 (16%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E +R+ F +G N ++ML K A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 626 IEHLRKTTFSIGGQ-ENPLASMLDK-------AVEYLSAELYAKDVHFLMELIQNAEDNE 677
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
YP V+P+L F++ I ++ NNE GF+ +NI ++C VGNSTKKG+ GY
Sbjct: 678 YPAGVDPSLEFVITSRDITATGASATLLIFNNEMGFTPKNIDSICSVGNSTKKGNRKRGY 737
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNIDMFCRLLS 1328
IG KGIGFKSVF V+ P I SNG+ ++F + + ++G+++P V LS
Sbjct: 738 IGEKGIGFKSVFLVSAQPCIFSNGYQIRFSEKPSPDCKLGYIVPEWVED------SPSLS 791
Query: 1329 KDPVQLESKCW--NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
S+C T I LP + M + S +HP +LLFL +++C+ R
Sbjct: 792 DIKQIYGSRCSLPTTTIVLPLKPD-----KMGPVKQQLSSIHPEVLLFLSKIKCLSVRED 846
Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
D R V +A K V R ++ L L+ +E+
Sbjct: 847 NEDP----RLNTVSA---------------IAITKETNFVTRKNIDAESYTLHLSAEENG 887
Query: 1447 EGNYGPLLYQQPVFAFLPLR----------------TYGLKFILQGDFVLPSSREEVDGN 1490
+ + Y F P+R FI+Q DF+L SSRE + +
Sbjct: 888 DSSRAECSYHIWRQKF-PVRQENRVERRMDVEDWLMVTNFPFIIQADFILASSRETILLD 946
Query: 1491 SPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMI 1550
+ WNQ +L P FV+A S + + P + F+PL G + + + I
Sbjct: 947 NTWNQGILDCVPCAFVNALISL--VKSTEDAPLSSLPRMFEFLPLSGSPYPKLNVVRESI 1004
Query: 1551 LSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
+KL N L E Q + PC+V R
Sbjct: 1005 KTKLAAENILPSESYTVQKFFHKPCEVRR 1033
>gi|297734475|emb|CBI15722.3| unnamed protein product [Vitis vinifera]
Length = 875
Score = 167 bits (424), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 142/460 (30%), Positives = 219/460 (47%), Gaps = 69/460 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G E N L + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEEIRRTKFSIGG-----EQNPLTED---LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
Y E+V P+L ++ I ++ NNE+GFSA+NI ++C VG STKKG+ GY
Sbjct: 60 YGEDVNPSLELVITSKDITGTRALMTLLIFNNEKGFSAKNIESICGVGRSTKKGNRKRGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPF-NIDMFCRLL 1327
IG KGIGFKSVF +T P I SNG+ ++F+ +G+++P V ++D ++
Sbjct: 120 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVEQNPSLDDIKQIY 179
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR--- 1384
V T I LP + + + + S + P +LLFL +++ +
Sbjct: 180 GSHAV-----LPTTTIILPLKPD-----KIKPVKEQLSSIQPEVLLFLSKIKHFSVKEDN 229
Query: 1385 -----NMLNDSLV------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKLRAGV 1426
N +N + V RK I D E + ++++ QK
Sbjct: 230 EDPRLNTVNAISISSEINFVTRKNIDADSYTLHLSADETSDVTEKECSYYMWRQKFPVRQ 289
Query: 1427 -----IRPDVKTTEIALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
R +V+ I LA L Q N G P V+AFLP FI+Q DFV
Sbjct: 290 ENQVERRLEVEEWVITLAFPLGQRLNRGMSSP-----GVYAFLPTEMVTNFPFIIQADFV 344
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
L SSRE + ++ WNQ +L P+ F++A S + + P + F+P+
Sbjct: 345 LASSRETILLDNKWNQGILDCVPSAFLNALISL--VTTSEDVPVSTLTPMFKFLPIDSSS 402
Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
+ + + I +KL N + E ++Q + PC+V R
Sbjct: 403 YPKLNVVRESIKAKLLKENIIPCESYSDQKIFRKPCEVGR 442
>gi|357513125|ref|XP_003626851.1| Sacsin [Medicago truncatula]
gi|355520873|gb|AET01327.1| Sacsin [Medicago truncatula]
Length = 1698
Score = 166 bits (420), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 137/421 (32%), Positives = 202/421 (47%), Gaps = 78/421 (18%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR++F +G +N +++ H +A+ LS ELYS + HF+ EL+QNA+DN+
Sbjct: 8 IEEIRREKFSIGEKRAN---PLIEDLH----QAVKNLSAELYSTNVHFVKELIQNAEDNL 60
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKK-GSSAGY 1271
Y + V P+L FI+ I ++ NNE+GFS ENI +LC VG STKK S+GY
Sbjct: 61 YNDGVSPSLEFIITSKDITATGAPATLLIFNNEKGFSRENIESLCSVGRSTKKENRSSGY 120
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF VT P I SNG+ ++F+ +G+++P V P +D
Sbjct: 121 IGEKGIGFKSVFLVTAQPYIFSNGYQIRFNERPCPHCSLGYIVPEWVEEKPTLVD----- 175
Query: 1327 LSKDPVQLESKCWNTCIRLPF---RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
+ K + NT I LP R +F + S +HP +LLFL +++ +
Sbjct: 176 IQKIYGAGKDSLPNTIIVLPLKPDRVRF--------VNQQLSSIHPEVLLFLSKIRRLSV 227
Query: 1384 RNMLND------SLVVIRKKI---------VGDGIIKVSCGED-----KMTWFVASQKLR 1423
R D + V I +I I +S GE+ + ++++ QK
Sbjct: 228 RENSEDPKENTLTAVSISSEINFETSRNMDAESYTIHLSAGENNNDEQECSYYIWKQKFP 287
Query: 1424 AGV-----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGD 1477
+ R V+ + LA QE + VFAFLP FI+Q D
Sbjct: 288 VKLENVVERRMGVEECVVTLAFPHQERLHKDKS----LPGVFAFLPTEMVTNFPFIIQAD 343
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA---------ERSFCDLP-CFRENPAKAAS 1527
FVL SSRE + + WN+ +L P+ F+ A E C LP F+ P ++
Sbjct: 344 FVLASSRETILLDDKWNKGILEYVPSAFIDAFKTLVTGSDEAPMCSLPYVFKFIPINSSP 403
Query: 1528 V 1528
V
Sbjct: 404 V 404
>gi|77555433|gb|ABA98229.1| expressed protein [Oryza sativa Japonica Group]
Length = 1831
Score = 166 bits (420), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 144/439 (32%), Positives = 209/439 (47%), Gaps = 98/439 (22%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ------ 1215
VE IRR+ F +G E N L + + +A+ LSQELYS+D HFL+EL+Q
Sbjct: 16 VERIRRERFYIG----REERNPLAED---IHQAVTYLSQELYSKDLHFLMELIQLRLAAF 68
Query: 1216 ----------NADDNIYPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRAL 1256
NA+DN YP +VEP L F++ S ++V NNE+GFSA N+ ++
Sbjct: 69 FTCVISSSALNAEDNEYPSDVEPALEFVIIKKDITATGAASTLLVFNNERGFSAANVESI 128
Query: 1257 CDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLP 1312
C +G STKKG+ GYIG KGIGFKSVF V+ P I SNG+ +KF+ + + IG+++P
Sbjct: 129 CRIGKSTKKGNRHLGYIGEKGIGFKSVFLVSSQPHIFSNGYQIKFNEEPSEDCDIGYIVP 188
Query: 1313 TLV---PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
V P + SK K T I LP ++ + + + S HP
Sbjct: 189 KWVDEKPSIDDIHAVYGYSK-------KLPTTTIILPLKSD-----KILPVKNELSSTHP 236
Query: 1370 SLLLFLHRLQCIMFRNMLND------SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR 1423
+LLFL +++ + R + +D S + I ++ D + + T +A Q+ +
Sbjct: 237 EILLFLSKIRQLSVREINDDPKASKLSQISISSEV--DYKTQKDINAESYTLHLAMQENK 294
Query: 1424 AG--------------VIRP---DVKTTEI-ALALTL-----QESNEGNYGPLLYQQPVF 1460
G V++P D K E+ +TL Q + G P V+
Sbjct: 295 RGDKEECTYYMWKQKFVVKPECRDKKRMEVDKWVITLAFPHGQRLSRGARSP-----GVY 349
Query: 1461 AFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----------E 1509
AFLP L FI+Q DF+L SSRE + +S WN+ +L P FVSA
Sbjct: 350 AFLPTEMVTNLPFIIQADFLLASSRESILLDSQWNRGILDCVPYAFVSAFEALLKSSSNA 409
Query: 1510 RSFCDLPCFRENPAKAASV 1528
SF P FR P +S+
Sbjct: 410 PSFALPPIFRFLPVATSSI 428
>gi|225456428|ref|XP_002280614.1| PREDICTED: uncharacterized protein LOC100257713 [Vitis vinifera]
Length = 1725
Score = 166 bits (419), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 141/462 (30%), Positives = 216/462 (46%), Gaps = 73/462 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G ++ + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y E V P+L ++ I ++ NNE+GFSA+NI ++C VG STKK + GY
Sbjct: 60 YGEGVNPSLELVITSQDITDTGAPATLLIFNNEKGFSAKNIESICSVGRSTKKSNRKQGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP--PFNIDMFCRL 1326
IG KGIGFKSVF +T P I SNG+ ++F+ S +G+++P V P D+
Sbjct: 120 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCSHSNLGYIVPEWVEQNPSLADI---- 175
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
K + T I LP + + + S + P +LLFL +++ +
Sbjct: 176 --KQIYGSHAVLPTTTIILPLKPD-----KIKPVKQQLSSIQPEVLLFLSKIKQFSVKED 228
Query: 1385 ------NMLNDSLV------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKLRAG 1425
N +N + V RK I D E + ++++ QK
Sbjct: 229 NKDPRLNTVNAISISSEINFVTRKNIDADSYTLHLSTDEASDVTEKECSYYMWRQKFPVR 288
Query: 1426 V-----IRPDVKTTEIALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDF 1478
R V+ I LA + Q N G P +Y AFLP FI+Q DF
Sbjct: 289 QENQVERRLGVEEWVIKLAFPIGQRLNRGMSSPGIY-----AFLPTEMVTNFPFIIQADF 343
Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVG 1537
VL SSRE + ++ WNQ +L P+ F++A F L E+ P + F+P+
Sbjct: 344 VLASSRETILLDNKWNQGILDCVPSAFLNA---FISLVTTSEDVPVSTLTPMFKFLPINS 400
Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
+ + + I +KL N + E ++Q + PC+V R
Sbjct: 401 SSYPKLNVVRESIKAKLLTENIIPCESYSDQKIFRKPCEVGR 442
>gi|297734473|emb|CBI15720.3| unnamed protein product [Vitis vinifera]
Length = 2156
Score = 165 bits (418), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 133/438 (30%), Positives = 208/438 (47%), Gaps = 80/438 (18%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G ++ + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 686 IEEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 737
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y E V P+L ++ I ++ NNE+GFSA+NI ++C VG STKK + GY
Sbjct: 738 YGEGVNPSLELVITSQDITDTGAPATLLIFNNEKGFSAKNIESICSVGRSTKKSNRKQGY 797
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP--PFNIDMFCRL 1326
IG KGIGFKSVF +T P I SNG+ ++F+ S +G+++P V P D+
Sbjct: 798 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCSHSNLGYIVPEWVEQNPSLADI---- 853
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ-CIMFRN 1385
K + T I LP + + + S + P +LLFL +++ M+R
Sbjct: 854 --KQIYGSHAVLPTTTIILPLKPD-----KIKPVKQQLSSIQPEVLLFLSKIKHYYMWRQ 906
Query: 1386 M--LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
+ V R+ V + +IK++ F Q+L G+ P
Sbjct: 907 KFPVRQENQVERRLGVEEWVIKLA--------FPIGQRLNRGMSSPG------------- 945
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
++AFLP FI+Q DFVL SSRE + ++ WNQ +L P
Sbjct: 946 ---------------IYAFLPTEMVTNFPFIIQADFVLASSRETILLDNKWNQGILDCVP 990
Query: 1503 ALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLI 1561
+ F++A F L E+ P + F+P+ + + + I +KL N +
Sbjct: 991 SAFLNA---FISLVTTSEDVPVSTLTPMFKFLPINSSSYPKLNVVRESIKAKLLTENIIP 1047
Query: 1562 LEGNNNQ--WAPPCKVLR 1577
E ++Q + PC+V R
Sbjct: 1048 CESYSDQKIFRKPCEVGR 1065
Score = 165 bits (417), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/421 (29%), Positives = 203/421 (48%), Gaps = 51/421 (12%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
++ IRR +F +G ++ + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IKEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDND 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
YPE V P+L ++ I ++ NNE+GFSA+NI ++C VG STKK + GY
Sbjct: 60 YPEGVNPSLELVITSQDITGTGASATLLIFNNEKGFSAKNIESICSVGRSTKKNNRKCGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVP--PFNIDMFCRL 1326
IG KGIGFKSVF +T P I SNG+ ++F+ +G+++P V P +D ++
Sbjct: 120 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVQWNP-TLDDIKQI 178
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
V T I LP + + + + S + P +LLFL +++ +
Sbjct: 179 YGSHAV-----LPTTTIILPLKPD-----KIGPVKEQLSSIQPEVLLFLSKIKQFSVKKH 228
Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKL---RAGVIRPDVKTTEIALALTL- 1442
D + I I E + ++++ QK + + + E+ + L
Sbjct: 229 NEDPRLNTVNAI---SISTKDVTEKECSYYMWRQKFPVRQENQVERRLGVEELVITLAFP 285
Query: 1443 --QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
Q N G P V+AFLP FI+Q DFVL SSRE + ++ WNQ +L
Sbjct: 286 FGQRLNRGISSP-----GVYAFLPTEMVTNFPFIIQADFVLASSRETILLDNKWNQGILD 340
Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNC 1559
P+ F++A S ++ P + F+P+ + + + +I ++L N
Sbjct: 341 CVPSAFLNAFISLVTTS--QDVPVSTLTPMFKFLPIYSSPYPKLNDVRELIKAELLKKNI 398
Query: 1560 L 1560
+
Sbjct: 399 V 399
>gi|449440953|ref|XP_004138248.1| PREDICTED: uncharacterized protein LOC101220895 [Cucumis sativus]
Length = 1742
Score = 165 bits (418), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 127/391 (32%), Positives = 196/391 (50%), Gaps = 71/391 (18%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G + + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 47 IEDIRRSKFSIGGPPNPLTED--------LHQAVRNLSAELYTKDVHFLMELIQNAEDNE 98
Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y +V+P+L FI+ + +++ NNE GFS +NI ++C VG STKK + GY
Sbjct: 99 YSTSVKPSLEFIITSRDVTGTGAATTLLIFNNEIGFSFKNIDSICSVGRSTKKNNRERGY 158
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDMFCRLLS 1328
IG KGIGFKSVF +T P I SNG+ ++F+ +GFV+P V R+LS
Sbjct: 159 IGEKGIGFKSVFLITSQPYIFSNGYQIRFNEQPCPHCGVGFVVPEWVEE------NRILS 212
Query: 1329 --KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
K+ +S T I LP + + + S++HP +LLFL +++ + R +
Sbjct: 213 NIKEIYGPQSILPTTTIVLPLKAD-----KIKPVKQQLSNIHPEVLLFLSKIKQLSVREV 267
Query: 1387 LND--------------SLVVIRKKIVGDGII-----KVSCGEDKMTWFVASQKLRAGVI 1427
D + V RK I + + S G + ++++ QK +
Sbjct: 268 NEDPKSSTVNAISISSETNFVSRKNIDAESYTLHLSSEESVGGTQCSYYMWKQKFP---V 324
Query: 1428 RPDVKTT------EIALALTL---QESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
+P+ + E+ + L Q N G+ P V+AFLP FI+Q D
Sbjct: 325 KPENRVERRMGVGELVIILAFPNGQRLNGGDKSP-----GVYAFLPTEMITDFPFIIQSD 379
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
FVL SSRE + ++ WNQ +L P+ FV+A
Sbjct: 380 FVLSSSRETILLDNEWNQGILDCVPSAFVNA 410
>gi|449501472|ref|XP_004161377.1| PREDICTED: uncharacterized LOC101220895 [Cucumis sativus]
Length = 1703
Score = 165 bits (418), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 127/391 (32%), Positives = 196/391 (50%), Gaps = 71/391 (18%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G + + + L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEDIRRSKFSIGGPPNPLTED--------LHQAVRNLSAELYTKDVHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y +V+P+L FI+ + +++ NNE GFS +NI ++C VG STKK + GY
Sbjct: 60 YSTSVKPSLEFIITSRDVTGTGAATTLLIFNNEIGFSFKNIDSICSVGRSTKKNNRERGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDMFCRLLS 1328
IG KGIGFKSVF +T P I SNG+ ++F+ +GFV+P V R+LS
Sbjct: 120 IGEKGIGFKSVFLITSQPYIFSNGYQIRFNEQPCPHCGVGFVVPEWVEE------NRILS 173
Query: 1329 --KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
K+ +S T I LP + + + S++HP +LLFL +++ + R +
Sbjct: 174 NIKEIYGPQSILPTTTIVLPLKAD-----KIKPVKQQLSNIHPEVLLFLSKIKQLSVREV 228
Query: 1387 LND--------------SLVVIRKKIVGDGII-----KVSCGEDKMTWFVASQKLRAGVI 1427
D + V RK I + + S G + ++++ QK +
Sbjct: 229 NEDPKSSTVNAISISSETNFVSRKNIDAESYTLHLSSEESVGGTQCSYYMWKQKFP---V 285
Query: 1428 RPDVKTT------EIALALTL---QESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
+P+ + E+ + L Q N G+ P V+AFLP FI+Q D
Sbjct: 286 KPENRVERRMGVGELVIILAFPNGQRLNGGDKSP-----GVYAFLPTEMITDFPFIIQSD 340
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
FVL SSRE + ++ WNQ +L P+ FV+A
Sbjct: 341 FVLSSSRETILLDNEWNQGILDCVPSAFVNA 371
>gi|224136274|ref|XP_002322288.1| predicted protein [Populus trichocarpa]
gi|222869284|gb|EEF06415.1| predicted protein [Populus trichocarpa]
Length = 1356
Score = 164 bits (416), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 126/389 (32%), Positives = 192/389 (49%), Gaps = 64/389 (16%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IR+ F +G E N L L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEHIRKTTFSIGG-----EKNPLAPM---LDQAVKYLSAELYAKDIHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
Y E V+P+L F++ I ++ NNE+GFSA+NI ++C+VGNSTKKG+ GY
Sbjct: 60 YLERVDPSLEFVITSRDITNTGAPATLLIFNNEKGFSAKNIESICNVGNSTKKGNRKRGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF + P I SNG+ ++F+ +G+++P V P D+
Sbjct: 120 IGEKGIGFKSVFLIAAQPYIFSNGYQIRFNEKPCPHCNLGYIVPEWVDDSPSLSDI---- 175
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
K S T + LP + +N + S +HP +LLFL +++ + R
Sbjct: 176 --KQIYGSASTLPTTTLILPLKPD-----KVNPVKQQLSSIHPEILLFLSKIKRLSVREE 228
Query: 1387 LND------SLVVI--------RKKIVGDGII-------KVSCGEDKMTWFVASQKLRAG 1425
D S V I RK + + + E + ++++ QK
Sbjct: 229 NEDPRLNTVSAVAITKETNFVQRKNMDAESYTLHLSAEENIDEFEQECSYYLWKQKFPVR 288
Query: 1426 V-----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
+R +V+ I LA + E + + + ++AFLP G FI+Q DF+
Sbjct: 289 EDNKVDMRMEVEDWVITLAFP---NGERLHRGMKHSPGIYAFLPTEMVTGFPFIIQADFI 345
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
L SSRE + ++ WNQ +L P F+ A
Sbjct: 346 LASSRETIRWDNIWNQGILDCVPFAFIEA 374
>gi|357513119|ref|XP_003626848.1| hypothetical protein MTR_8g011200 [Medicago truncatula]
gi|355520870|gb|AET01324.1| hypothetical protein MTR_8g011200 [Medicago truncatula]
Length = 1645
Score = 164 bits (415), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 132/420 (31%), Positives = 206/420 (49%), Gaps = 59/420 (14%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI---------VV 1241
L A+ L ELY++D HFL+EL+QNA+DN Y E V PTL F++ I ++
Sbjct: 73 LHHAVKNLFAELYAKDVHFLMELIQNAEDNHYNEGVNPTLEFVITSDDITGTGAPATLLI 132
Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
NNE+GF ++NI ++C VG STKKG+ S+GYIG KGIGFKSVF VT P + SNG+ ++F
Sbjct: 133 FNNEKGFFSKNIDSICSVGRSTKKGNRSSGYIGEKGIGFKSVFLVTAQPYVFSNGYQIRF 192
Query: 1301 DTS---EGQIGFVLPTLV--PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI 1355
+ +G+++P V P +D+ ++ L + T I LP ++ E +
Sbjct: 193 NEKPCPHCSLGYIVPEWVEEKPTLVDI-KKIYGAGKGSLRT----TTIVLPLKSDKVEPV 247
Query: 1356 AMNNIVDMFSDLHPSLLLFLHRLQCIMFR--------NMLNDSLV------VIRKKIVGD 1401
S HP +LLFL +++ + R N + D + V RK + +
Sbjct: 248 KQK-----LSSFHPEVLLFLTKIRHLSVREVSENPKQNTVTDVSISSEINFVTRKNMNAE 302
Query: 1402 G-IIKVSC-----GEDKMTWFVASQKL-----RAGVIRPDVKTTEIALALTLQES-NEGN 1449
I +S E + ++++ QK A R DV+ + LA QE + G
Sbjct: 303 SYTIHLSAEENSDAEKECSYYMWKQKFPVRPENAVERRTDVEEWVVTLAFPNQERLHRGK 362
Query: 1450 YGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
P V+AFLP FI+Q DFVL SSRE + ++ WNQ +L P+ F+ A
Sbjct: 363 SSP-----GVYAFLPTGMVTNFPFIIQADFVLASSRETILLDNKWNQGILDCVPSAFLDA 417
Query: 1509 ERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
++ + P + + F+P+ F+ + I +KL + + +E + Q
Sbjct: 418 FKTLVIGSV--QAPVSSLASMFKFLPIESSTFEKFNYVRDKIKAKLVEESIVPIETYSKQ 475
>gi|320033070|gb|EFW15019.1| hypothetical protein CPSG_08207 [Coccidioides posadasii str.
Silveira]
Length = 2121
Score = 164 bits (415), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 174/594 (29%), Positives = 272/594 (45%), Gaps = 91/594 (15%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
EDA +E IR ++ GLG +SN S L+ AL L+++LY ++ FL+EL+Q
Sbjct: 19 EDAQQHIERIRHEK-GLG--LSNHLSRDLEN-------ALVILAKQLYKTNTRFLMELIQ 68
Query: 1216 NADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY---I 1272
NADDN Y +V P+LT ++ + V NE GF+ E++ ++C +G S +KG SA I
Sbjct: 69 NADDNRYRRDVSPSLTLSYRKMHLRVDCNELGFAPEDVDSICRIGGS-QKGISADRMDCI 127
Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
G KGIGFKSVF+V I S + KFD + ++G + P + ++ P
Sbjct: 128 GEKGIGFKSVFKVASVVWISSGKYSFKFDNAT-RLGMIAP-------------MWAQFPA 173
Query: 1333 QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ----CIMFRNMLN 1388
+ + + +RL ++ E + + D P +L+FL++++ CI N
Sbjct: 174 EAPASGTSMLLRLLNKSVRDELLKEMRVCD------PKILIFLNQIRELKVCIAQDNGTT 227
Query: 1389 DSLVVIRK--KIV---GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
+ RK K V G + + G D FV + + +G EI LA ++
Sbjct: 228 FERTISRKDDKDVDKPGQYVTTLRYG-DSSQQFVVIRYVPSGR---STDKAEILLAFPIK 283
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
+ E P++ Q V+ FLP+R YGLKF++QG F L +SRE+++G+S NQ L S P
Sbjct: 284 NNQE----PVIQSQCVYTFLPIRDYGLKFLVQGRFSLIASREDINGSSAENQRLRSLIPK 339
Query: 1504 LFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE 1563
FV A F + FR ++ ++P++ E FF+ L + +S +LE
Sbjct: 340 AFVQAVDYFQESDYFR-------YIWPRYLPIIPE-ENFFTSLTEETI--FTLSQRKLLE 389
Query: 1564 GNNNQWAPPCKVLR---GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS-LARALGIEE 1619
+ + A P + + D LP L +LS+ DS L + LG+
Sbjct: 390 SESGELATPGTLFYVPPRFRDPNGDPLP---LCSRTAFRYLSRKYFDEDSDLFQCLGVRP 446
Query: 1620 YGPKILLQIISSLCRTENGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGVETDLI 1676
+ LL I S+ W L+ LN + L++ T
Sbjct: 447 LSLRDLLVDIKSVVNQYQNHSREATEWHIRLSKALNR----------EDMLRAYRHT--- 493
Query: 1677 DNLQRIPFIPLSDGTFSSVDE--GTIWLHS---DCSVFDGGFGLEAFPNLCAKL 1725
L IP IPL DG + S E G I+ S D SV G +E P+ + +
Sbjct: 494 --LHSIPLIPLRDGRWISASESAGIIFFPSALADLSVPSGLNSVEIHPDAASNV 545
>gi|209875783|ref|XP_002139334.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209554940|gb|EEA04985.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 2286
Score = 164 bits (414), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 147/534 (27%), Positives = 237/534 (44%), Gaps = 78/534 (14%)
Query: 1161 IVESIRRDEFGLGPNISN--MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNAD 1218
IV I+ + FG+G + N+L + + R LS+ LY +HF LEL+QNAD
Sbjct: 686 IVSLIQLNRFGIGIDSRKDPQLENILLQYSKLIKRTGEKLSENLYCDRNHFQLELIQNAD 745
Query: 1219 DNIYP-----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-KGSSAGYI 1272
DN Y + P+L FI GI+VLNNE GF+ +I +L D+ NS+K I
Sbjct: 746 DNFYEKLKYHQTSVPSLKFITGSGGILVLNNEDGFTPLDIMSLSDLANSSKIDCGKIRRI 805
Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR-LLSKDP 1331
G+ G+GFKSVF +TD P I SNGF ++FD ++ P V ++ + LL
Sbjct: 806 GKFGLGFKSVFTITDTPYIFSNGFKIRFDAQSPYGAYIYPEWVSCEITELIPKDLLDISG 865
Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
++ WNT I LPF+ + I L P LLF ++L + + ++
Sbjct: 866 YEVNKSAWNTVIWLPFKARKLTRIH----------LKPEFLLFTNKL--VYLEHTQKNAT 913
Query: 1392 VVIRK--KIVGDG------IIKVSCGEDKMTWFVASQKLRAG---------VIRPDV--- 1431
+ I+K +I+ D I+ + +KL G ++ D+
Sbjct: 914 ISIKKCTEILDDNSYFTTIFIQEYIHSKDAERTIKKRKLLPGGRYWKRQFLLVDYDISSI 973
Query: 1432 --------------KTTEIALALTLQESNEGNYGPLLYQQ--PVFAFLPLRTYGLKFILQ 1475
+ +I + + L+ N G L +Q VFAFLP+R+YG FI+
Sbjct: 974 CKQNHWLDREIYKNRQKKIVIGVELKADESTNTGVCLGKQYCDVFAFLPIRSYGFHFIIH 1033
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
DF L SSRE + N+ WN L P + A +S+ L +N + ++S +PL
Sbjct: 1034 ADFELTSSREGIATNNEWNCLLRDCIPNALIHAIQSWRRL----DNLSMLHKSFLSVIPL 1089
Query: 1536 VGE-VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWN-------DRAHSLL 1587
+ V FF+ + L+ C+ + ++ P VL +N D +
Sbjct: 1090 SSDYVDNFFAPIITKFHHMLKSVPCIYTQSGIFEY--PRNVLCSFNTYSGLSVDTISKIF 1147
Query: 1588 PDI-----LLQKHLGLGFLSKDI--VLSDSLARALGIEEYGPKILLQIISSLCR 1634
P++ +L KH + D+ + ++ LGI+ I++ +I + R
Sbjct: 1148 PNLDELCFVLDKHCKKKIICSDVEKYVEHTIFDDLGIDSVNLVIIITLIDGMIR 1201
>gi|218185812|gb|EEC68239.1| hypothetical protein OsI_36251 [Oryza sativa Indica Group]
Length = 2351
Score = 164 bits (414), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 124/387 (32%), Positives = 195/387 (50%), Gaps = 60/387 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ + +G E N L + + +A+ LSQE+YS+D HFL+ELVQNA+DN
Sbjct: 17 VERIRRERYYIG----REERNPLAED---MHQAVIYLSQEIYSKDVHFLMELVQNAEDNE 69
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
YP V P+L F++ + I ++ NNE+GFS NI ++ VG STKKG+ GY
Sbjct: 70 YPSGVAPSLEFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTKKGNRDKGY 129
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLS 1328
IG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V R +
Sbjct: 130 IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECGIGYIVPEWVESRPSLSDIRTIY 189
Query: 1329 KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-L 1387
L + T I LP +++ ++ + S +HP +LLFL +++ + + L
Sbjct: 190 GSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQLSVKEENL 240
Query: 1388 NDSLVVI-------------RKKIVGDGI------IKVSCGEDKMTWFVASQKLRAGVIR 1428
N + RK + + ++ GE++ +++ QK ++
Sbjct: 241 NPKCSTVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQKFP---VK 297
Query: 1429 PD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLP 1481
P+ + E + L + G L V+AFLP FI+Q DF+L
Sbjct: 298 PENRVDKRAEIDEWVIILAFPYGQRLSRGKQL-SPGVYAFLPTEMVTNFPFIIQADFLLA 356
Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSA 1508
SSRE + +SPWN+ +L P+ F++A
Sbjct: 357 SSREAILFDSPWNKGILECVPSAFLNA 383
Score = 160 bits (404), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/387 (31%), Positives = 195/387 (50%), Gaps = 60/387 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ + +G E N L + + +A+ LSQE+YS+D HFL+ELVQNA+DN
Sbjct: 761 VERIRRERYYIG----REERNPLAED---MHQAVTYLSQEIYSKDVHFLMELVQNAEDNE 813
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
YP V P+L F++ + I ++ NNE+GFS NI ++ VG STKKG+ GY
Sbjct: 814 YPSGVAPSLEFLVSSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTKKGNRDKGY 873
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLS 1328
IG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V R +
Sbjct: 874 IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECGIGYIVPEWVESRPSLSDIRTIY 933
Query: 1329 KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-L 1387
L + T I +P +++ ++++ S +HP +LLFL +++ + + +
Sbjct: 934 GSSKVLPT----TTIIIPLKSE-----KVDSVKKQLSSIHPEMLLFLSKIRQLSVKEENV 984
Query: 1388 NDSLVVI-------------RKKIVGDGI------IKVSCGEDKMTWFVASQKLRAGVIR 1428
N + RK + + ++ GE++ +++ QK ++
Sbjct: 985 NPKCSTVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQKFP---VK 1041
Query: 1429 PD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLP 1481
P+ + E + L + G + V+AFLP FI+Q DF+L
Sbjct: 1042 PENRVDKRAEIDEWVITLAFPHGQRLSRGKQM-SPGVYAFLPTEMVTNFPFIIQADFLLA 1100
Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSA 1508
SSRE + +S WN+ +L P+ F++A
Sbjct: 1101 SSREAILFDSTWNKGILECVPSAFLNA 1127
Score = 77.4 bits (189), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 121/260 (46%), Gaps = 42/260 (16%)
Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
GIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V R +
Sbjct: 479 GIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECGIGYIVPEWVESRPSLSDIRTIYGSSK 538
Query: 1333 QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV 1392
L + T I LP +++ ++ + S +HP +LLFL +++ +
Sbjct: 539 VLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQLS---------- 579
Query: 1393 VIRKKIVGDGIIKVSC----GEDKMTWFVASQKLRAGVIRPD------VKTTEIALALTL 1442
++++ V V+ GE++ +++ QK ++P+ + E + L
Sbjct: 580 -VKEENVNHKCSPVTLENGKGEEECGYYMWRQKFP---VKPENRVDKRAEIDEWVITLAF 635
Query: 1443 QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEF 1501
+ G + V+AFLP FI+Q DF+L SSRE + +SPWN+ +L
Sbjct: 636 PYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFIIQADFLLASSREAILFDSPWNKGILECV 694
Query: 1502 PALFVSA----ERSFCDLPC 1517
P+ F++A +S D+P
Sbjct: 695 PSAFLNAFVALVKSGADVPA 714
>gi|392866750|gb|EAS30073.2| hypothetical protein CIMG_08498 [Coccidioides immitis RS]
Length = 2143
Score = 163 bits (413), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 173/594 (29%), Positives = 273/594 (45%), Gaps = 91/594 (15%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
EDA +E IR ++ GLG +SN S L+ AL L+++LY ++ FL+EL+Q
Sbjct: 19 EDAQQHIERIRHEK-GLG--LSNHLSRDLEN-------ALVILAKQLYKTNTRFLMELIQ 68
Query: 1216 NADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY---I 1272
NADDN Y +V P+LT ++ + V NE GF+ E++ ++C +G S +KG SA I
Sbjct: 69 NADDNRYRRDVSPSLTLSYRKMHLRVDCNELGFAPEDVDSICRIGGS-QKGISADRMDCI 127
Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
G KGIGFKSVF+V I S + KFD + ++G + P + ++ P
Sbjct: 128 GEKGIGFKSVFKVASVVWISSGKYSFKFDNAT-RLGMIAP-------------MWAQFPA 173
Query: 1333 QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ----CIMFRNMLN 1388
+ + + +RL ++ E + + D P +L+FL++++ CI N
Sbjct: 174 EAPASGTSMLLRLLNKSVRDELLKEMRVCD------PKILIFLNQIRELKVCIAQDNGTT 227
Query: 1389 DSLVVIRK--KIV---GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
+ RK K V G + + G D FV + + +G EI LA ++
Sbjct: 228 FERTISRKDDKDVDKPGQYVTTLRYG-DSSQQFVVIRYVPSGR---STDKAEILLAFPIK 283
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
+ E P++ Q V+ FLP+R YGLKF++QG F L +SRE+++G+S NQ L S P
Sbjct: 284 NNQE----PVIQSQCVYTFLPIRDYGLKFLVQGRFSLIASREDINGSSAENQRLRSLIPK 339
Query: 1504 LFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE 1563
FV A F + FR ++ ++P++ E FF+ L + +S +LE
Sbjct: 340 AFVQAVDYFQESDYFR-------YIWPRYLPIIPE-ENFFTSLTEETI--FTLSQRKLLE 389
Query: 1564 GNNNQWAPPCKVLR---GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS-LARALGIEE 1619
+ + A P + + D LP L +LS+ DS L + LG+
Sbjct: 390 SESGELATPGTLFYVPPRFRDPNGDPLP---LCSRTAFRYLSRKYFDEDSDLFQCLGVRH 446
Query: 1620 YGPKILLQIISSLCRTENGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGVETDLI 1676
+ LL + S+ W L+ LN + L++ T
Sbjct: 447 LSLRDLLVDVKSVVNQYQNHSREATEWHIRLSKALNR----------EDMLRAYRHT--- 493
Query: 1677 DNLQRIPFIPLSDGTFSSVDE--GTIWLHS---DCSVFDGGFGLEAFPNLCAKL 1725
L+ IP IPL DG + S E G I+ S D SV G +E P+ + +
Sbjct: 494 --LRSIPLIPLRDGRWISASESAGIIFFPSALADLSVPSGLDSVEIHPDAASNV 545
>gi|436670130|ref|YP_007317869.1| hypothetical protein Cylst_6579 [Cylindrospermum stagnale PCC 7417]
gi|428262402|gb|AFZ28351.1| hypothetical protein Cylst_6579 [Cylindrospermum stagnale PCC 7417]
Length = 1205
Score = 163 bits (413), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 160/661 (24%), Positives = 296/661 (44%), Gaps = 118/661 (17%)
Query: 1428 RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEV 1487
R +VK TE+ LA +LQE +G+ L +Q VFAFLP R+YG KF++Q DF++P++RE++
Sbjct: 79 RENVKYTELILAFSLQE--DGSADTRL-EQKVFAFLPTRSYGFKFLIQADFLVPANREDI 135
Query: 1488 DGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGL 1546
++ WN+W+ F+ A F + + Y +++PL E+ FF +
Sbjct: 136 HKDTQWNKWIRDNIATTFLLAVEKF-------KQDSNLQKTYYNYIPLNSEIKDEFFIPV 188
Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIV 1606
I +KL+ S C++ E +W P V++ +++ L+ + LQK L ++ ++
Sbjct: 189 VSEIHNKLKTSECILTES--GKWEIPSHVVQV-DEQIRKLISNADLQKLLNKEYIHPEVK 245
Query: 1607 LSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLA----SWLNELYTISFHSS 1662
S+ +LG++++ LLQ C N WL +W ELY
Sbjct: 246 AKTSILESLGVKKFSFDDLLQ-----CLQNN-------EWLQKQSDNWFVELYIY----- 288
Query: 1663 GQSSLQSGVETDLID--NLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPN 1720
L+S D D ++++ IPL + +S+ E +I+ + G + E
Sbjct: 289 ----LKSCNLNDYSDLPKIKKLKIIPLENNQLASIAETSIFFPFNNV---GEYSFE---- 337
Query: 1721 LCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDE 1780
L+ + LL ++ S V L K+GVQ S ++I++ HILP +
Sbjct: 338 ----LQFIKKTLLESN----SKFAV------TEFLKKLGVQNASCYEIIENHILPLYKGD 383
Query: 1781 TTANGDKNLMADYLCFVMMHLEYY--------CPNCH-VEREFIVSELRKKAFVLTN-HG 1830
+ N + +Y+C++ +L Y P + +E + L++ + T+ +G
Sbjct: 384 -SWKSKANQLLEYICYIKDNLSEYEKEFNKIKNPYSYSYSKEDPLKSLKELLLIKTDKNG 442
Query: 1831 FKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKW-----YEVDITYLKHPAKESLSCGLVKW 1885
+ RP ++ +GNP + L+ I W Y D+ +K+P+++ + + +W
Sbjct: 443 YSRPGN--VYLPASYGNPNELETLLSGIQDVWFVSPEYIQDLMQIKNPSEK--AEKIREW 498
Query: 1886 RKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELV----HL 1941
++FF ++G+ ++ DV + G N ID+ + H+
Sbjct: 499 KEFFIKLGVHTVPKI-----DVKIQTARGRSNQ-----------KIDYYKEYPIYSSPHI 542
Query: 1942 LSLLTTNVNRQCSKHLLEILDTLWD--------DCYTDKIMGFFKSKPTGDDRSFQSSFI 1993
+ +L T N + ++ L + LD+ WD Y GF+ P D +
Sbjct: 543 IRILETK-NVEKNQKLAKFLDSNWDYYKQYKSWQNYAYANGGFYSYNPNDAD------WF 595
Query: 1994 NCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEV 2053
I W + + +L P ++F D +ILG S PY+ + + + D+G+K ++
Sbjct: 596 TKIKTAAW-LPTTKGKLANPSEVFLDKTETGTILGDSVPYLAISLNNPDFIKDLGIKEKI 654
Query: 2054 T 2054
T
Sbjct: 655 T 655
>gi|224136270|ref|XP_002322287.1| predicted protein [Populus trichocarpa]
gi|222869283|gb|EEF06414.1| predicted protein [Populus trichocarpa]
Length = 1467
Score = 163 bits (412), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 128/389 (32%), Positives = 195/389 (50%), Gaps = 64/389 (16%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IR+ F +G E N L L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEHIRKTTFSIGG-----EKNPLAPM---LDQAVKYLSAELYAKDVHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
Y E V+P+L F++ I ++ NNE+GFSA+NI ++C+VGNSTKKG+ GY
Sbjct: 60 YLERVDPSLEFVITSRDITNTGAPATLLMFNNEKGFSAKNIESICNVGNSTKKGNRKRGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF + P I SNG+ ++F+ +G+++P V P D+
Sbjct: 120 IGEKGIGFKSVFLIAAQPYIFSNGYQIRFNEKPCPHCNLGYIVPEWVDDSPSLSDI---- 175
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
K S T + LP + +N + S +HP +LLFL +++ + R
Sbjct: 176 --KQIYGSASTLPTTTLILPLKPD-----KVNPVKQQLSSIHPEILLFLSKIKRLSVREE 228
Query: 1387 LND------SLVVI--------RKKIVGDG-IIKVSCGED------KMTWFVASQKLRAG 1425
D S V I RK + + + +S E+ + ++++ QK
Sbjct: 229 NADPRLNTVSAVAITKETNFMQRKNMDAESYTLHLSADENSDEFEKECSYYLWKQKFP-- 286
Query: 1426 VIRP----DVKTTEIALALTLQESN-EGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
+RP D++ +TL N E + + Y ++AFLP FI+Q DF+
Sbjct: 287 -VRPENRVDMRMGVDDWVITLAFPNGERLHRGMKYSPGIYAFLPTEMVTDFPFIIQADFI 345
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
L SSRE + ++ WNQ +L P F+ A
Sbjct: 346 LASSRETIRWDNIWNQGILDCVPFAFIEA 374
>gi|449295343|gb|EMC91365.1| hypothetical protein BAUCODRAFT_152628 [Baudoinia compniacensis UAMH
10762]
Length = 1509
Score = 161 bits (407), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 155/530 (29%), Positives = 236/530 (44%), Gaps = 73/530 (13%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-NEQGFS 1249
L AL LS+ELYS+ +HF+LEL+QNADDN YPE+V P LT + +E G + + NE GF+
Sbjct: 33 LTNALRILSEELYSKPTHFILELIQNADDNNYPEDVRPKLTLLYREDGYLWIGCNEIGFT 92
Query: 1250 AENIRALCDVGNSTKK--GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
A N+RALC +G+STKK S GYIG KGIGFKS F+V + S KFD + +
Sbjct: 93 AANVRALCGIGDSTKKVENSQKGYIGEKGIGFKSAFKVAAEIWVKSGAISFKFDKHK-PL 151
Query: 1308 GFVLPTLVPPF------NIDMFCRLL---SKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
G + P F MFC + + VQ + L R +
Sbjct: 152 GMIAPMWTDFFENAHIKERTMFCFHILPEHQGTVQENLLELKPELLLFLRKLH---VLQV 208
Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMFRNM-LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFV 1417
I +D+ S L + R++ L + R + + + D+M +
Sbjct: 209 KIQKTSADVKHSFSLSKVEGETAGIRSITLRHQMFRPRNQTYTEAFLVF----DQMIENM 264
Query: 1418 ASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGD 1477
++K R P+V ++I LA + + N+ P + + F FLP+R+YGL F+LQ D
Sbjct: 265 PTEKKR-----PNVMKSQIVLAFPVNDQNQ----PAMRSRLTFNFLPVRSYGLPFVLQAD 315
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
F+L +SREE+ + WN L++ +FV+ F R + VP
Sbjct: 316 FMLSASREEILLGNRWNAVLVAATIDVFVACVERFNSTNTLRYTWPRFTKPLG--VPYGS 373
Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPC-------KVLRGWNDRAHSLLPDI 1590
+ +F L R R+ +LE + +A P + G N + + PD
Sbjct: 374 IFNSYFKELWR------RLMQEPVLEAQDGTFAKPALLQSVPLEFTDGGNPPSPLIYPD- 426
Query: 1591 LLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGL-RSMGLSWLAS 1649
H + +LS +SD L + + P ++I+ TE L R+ W
Sbjct: 427 ----HGPVAYLSTAYDVSD--LTELSVVQQTPDTFSKLITEYASTEPELFRAQPTPW--- 477
Query: 1650 WLNELYTISFHSS-GQSSLQSGVETDLIDNLQRIPFIPLSDGTF-SSVDE 1697
HS + L+SG + + + IP+ DG + SS DE
Sbjct: 478 ----------HSRLADAILRSGTQ-----RFRSVAIIPMRDGEWLSSKDE 512
>gi|224122118|ref|XP_002318757.1| predicted protein [Populus trichocarpa]
gi|222859430|gb|EEE96977.1| predicted protein [Populus trichocarpa]
Length = 1713
Score = 160 bits (404), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 130/416 (31%), Positives = 200/416 (48%), Gaps = 66/416 (15%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IR F +G E N L L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEHIRETTFSIG-----REKNPLAPM---LDQAVKYLSAELYAKDVHFLMELIQNAEDNE 59
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
Y E V+P+L F++ I ++ NNE+GFSA+NI ++C VGNSTKKG+ GY
Sbjct: 60 YLEGVDPSLEFVITSRDITNTGAPATLLMFNNEKGFSAKNIDSICSVGNSTKKGNRKRGY 119
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF +T P I SNG+ ++F+ + +G+++P V P D+
Sbjct: 120 IGEKGIGFKSVFLITAQPYIFSNGYQIRFNENPCPHCNLGYIVPEWVHESPSLSDI---- 175
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN- 1385
K S T + LP + + + S +HP +LLFL +++ + R
Sbjct: 176 --KQIYGSTSMLPTTTLILPLKPD-----KVTAVKQQLSSVHPEVLLFLSKIKRLSVRED 228
Query: 1386 -------------MLNDSLVVIRKKIVGDG-IIKVSCGEDK------MTWFVASQKLRAG 1425
+ ++ V RK I + + +S E+ ++++ QK
Sbjct: 229 NEDPSLNTVSAIAITKETNFVTRKNIDAESYTLHLSAEENDDEFAKGCSYYLWKQKFPVR 288
Query: 1426 V-----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG-LKFILQGDFV 1479
R +V+ I LA + E + Y ++AFLP FI+Q DF+
Sbjct: 289 QENRVDRRMEVEDWVITLAFP---NGERLLRGMKYSPGIYAFLPTEMVSNFPFIIQADFI 345
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
L SSRE + ++ WNQ +L P FV+A S + + P + F+P+
Sbjct: 346 LASSRETIQWDNIWNQGILDCVPFAFVNALVSL--IKTVDDAPVSSLPPMFKFLPV 399
>gi|302913717|ref|XP_003050987.1| hypothetical protein NECHADRAFT_80792 [Nectria haematococca mpVI
77-13-4]
gi|256731925|gb|EEU45274.1| hypothetical protein NECHADRAFT_80792 [Nectria haematococca mpVI
77-13-4]
Length = 2071
Score = 160 bits (404), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/399 (31%), Positives = 190/399 (47%), Gaps = 48/399 (12%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
+ A+ LS +LY + SHF+LE +QNADD Y + EPTL L+++ + NE GF+
Sbjct: 40 MDNAIGILSDDLYQKPSHFILEFLQNADDGHY-DTPEPTLAITLEDNTVRFDANEVGFTR 98
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
N+ ALC +G+S+K S G KGIGFKSVF+V D I S + KFD E ++G V
Sbjct: 99 TNVDALCSLGSSSKVQSDQA-TGHKGIGFKSVFKVADKVWIKSGHYSFKFDARE-RLGTV 156
Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
P + +K P + + T IRL K + ++ + D L
Sbjct: 157 DP-------------IWTKFPD--DEREGYTSIRLRLLPKLN----LSALADELFKLDAQ 197
Query: 1371 LLLFLHRLQCIMFRNMLNDSLVVIRKKI--------VGDGIIKVSCGEDKMTWFVASQ-- 1420
LLLFL ++QCI + +++ + DG+ + + D + F+ S
Sbjct: 198 LLLFLRKIQCIEVHVIPEKGQRARPRRVLKRSTDSRIPDGLQRFTLMSDYSSTFLVSSHR 257
Query: 1421 --KLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDF 1478
KL R K ++I LA P Q V++FLP+R YG KFI+Q DF
Sbjct: 258 TIKLGHDNRRKGRKHSDILLAFPEMSFRH----PPTATQNVYSFLPIRNYGFKFIIQADF 313
Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
+L +SRE++D +S WN LL E P ++A + F N + ++ ++
Sbjct: 314 LLTASREDIDNSSDWNIKLLDELPEAMLTAIKRF--------NSRRLYDAWLPYLIFDHV 365
Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR 1577
GFF L + ILS L S IL+ PP ++ R
Sbjct: 366 QPGFFENLGQDILSFL--SKQPILKSITGDLKPPSELTR 402
>gi|66357810|ref|XP_626083.1| superfamily I helicase [Cryptosporidium parvum Iowa II]
gi|46227162|gb|EAK88112.1| superfamily I helicase [Cryptosporidium parvum Iowa II]
Length = 2023
Score = 159 bits (403), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 143/575 (24%), Positives = 266/575 (46%), Gaps = 93/575 (16%)
Query: 1125 CGAICDTICGEASDD-GLGDCTTQTLPEDK------ECE-----------DAALIVESIR 1166
C ++ TI E D +C + ED+ CE D + SI+
Sbjct: 589 CNSLNLTISLEVEDSLEENECLVKMSTEDEFIFEELRCEYCSNYKSINIFDCKSFILSIQ 648
Query: 1167 RDEFGLGPNISNMESNMLKKQHARL-GRALHCLSQELYSQDSHFLLELVQNADDNIYPEN 1225
+ +FG+ + + N + + + + ++ LS +LY++ +HF+ EL+QNADDN Y
Sbjct: 649 KQKFGISEKALDKDYNQIIQNYTSIINQSCKNLSIKLYTKINHFIFELIQNADDNQYCSC 708
Query: 1226 VE--PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVF 1283
+ P++ F+ ++G++V+NNE GF+ +I ++CD+GNS+K + IG GIGFKSVF
Sbjct: 709 IGKIPSIIFVFHKNGVLVINNEIGFTEGDISSICDIGNSSKVFNEKK-IGCFGIGFKSVF 767
Query: 1284 RVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCI 1343
+T+ P I SNG+ KF+ + ++ P V ++ + ++E + T
Sbjct: 768 LITNTPFIFSNGYCFKFNLNSKHGSYIFPEWVDEELYNLIPNHKFHNEYEIEVSTYKTKF 827
Query: 1344 RLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK--KIVGD 1401
LP++ E I ++ L+ +++LF ++L+ I + + ND + VI + K++
Sbjct: 828 WLPYK----EDIIFEDL-----KLNDNIILFTNKLKRI--KLITNDRVTVITRSEKLISK 876
Query: 1402 GIIKVSCGEDKM------------------TWFVASQKLR----AGVIRPDVKTTEIALA 1439
+I V+ + M ++ + + + + K +A+
Sbjct: 877 DLILVNIFKHIMCDPTNKKRKLSSNRNITKSFLIVNYNFEIPKNISNLVKNKKLNNLAIG 936
Query: 1440 LTLQESNEGNYGPLLY-QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
+ + ++ G Y + VF+FLP+R+YGLKFILQ DF L SSRE + +S WN ++
Sbjct: 937 IEINNESDTFDGECNYDNKEVFSFLPIRSYGLKFILQADFELTSSRESISIDSNWNIYIR 996
Query: 1499 SEFP---ALFVSAERSFCDLPCFREN-----PAKAASVYMSFVPLVGEVHGFFSGLPRMI 1550
P +S R +++ P KA ++ F+P++ +++ + +
Sbjct: 997 EIIPNAIIYLISKLRETNGFHLLKKSFLGILPTKADNIDEFFIPIIPKIN-------KAL 1049
Query: 1551 LSKLRMSNC--LILEGNNNQWAPPCKVLRGWNDRAHSLLPDIL--------LQKHLGLGF 1600
+++ + C + ++ P V + R +L DI L K F
Sbjct: 1050 INEKCIYTCEKIFIQ-------PSNSVFINNDSRTFKILLDIFPNINEFSYLLKKYSNKF 1102
Query: 1601 LSKDIVLSDS---LARALGIEEYGPKILLQIISSL 1632
L + +++ L R LGI E+ IL+ II +
Sbjct: 1103 LINNAFANNTENILLRDLGITEFNIDILVDIIKGI 1137
>gi|342886436|gb|EGU86266.1| hypothetical protein FOXB_03201 [Fusarium oxysporum Fo5176]
Length = 1581
Score = 159 bits (403), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 155/523 (29%), Positives = 237/523 (45%), Gaps = 58/523 (11%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN---NEQG 1247
L +L +S +LY +HF+LEL+QNADDNIYP V P+LT L + L NE G
Sbjct: 38 LQHSLDIVSHDLYQTPTHFILELIQNADDNIYPPEVTPSLTLTLSGAESTTLRIDCNEAG 97
Query: 1248 FSAENIRALCDVGNSTKKGSS---AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
F+ E I AL +G STK SS YIG KGIGFKSVF+ D +I S + +FD E
Sbjct: 98 FTFEQIDALAGIGASTKTASSHRQRRYIGEKGIGFKSVFKAADVVKIASGYYEFRFDRRE 157
Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
+G VLP + PF ++ ++ + L+ K + IR R + +
Sbjct: 158 S-LGMVLP-IASPFPVEHRLSNHTQFLLLLKDKQTHQRIRADLRV-IEPHLLIFLRRLRR 214
Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFV---ASQK 1421
+ + ++Q M L + + + + VG C K + V + +
Sbjct: 215 LRIQTDDAQRVFQVQYDMSNTTLGEIVTITTSEGVG-------CSSSKTDYIVVRSSCDQ 267
Query: 1422 LRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLP 1481
+ R V+TTEI +A + +G+ P + +Q FAFLP+ +G KF++ DF+L
Sbjct: 268 MPDNEKRKGVQTTEITIAFPI----DGSERPKIGKQKAFAFLPVDDFGFKFLIHADFLLV 323
Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG 1541
+SRE +D + WN+ L F+ A + F +P P ++ + H
Sbjct: 324 ASREGLDYDCQWNETLHKAIRRAFLLAVKRFASVPASEPGPG-LRYLWPKYTKHHRASHN 382
Query: 1542 FFSGLPRMILSKLR----MSNCLILEGNNNQWAPPCK---VLRGWNDRAHSLLPDILLQK 1594
F++ L + IL LR + +C + E + P + V + H+L L+K
Sbjct: 383 FWNQLHQDILHDLRYEPILESCDLSECHRR----PTELRFVPHDFRYGGHALFDSPSLRK 438
Query: 1595 HLGLGFLSKDIVLSDSLARALGIEEYGPKILLQI--ISSLCRTE-NGLRSMGLSWLASWL 1651
L F D+ IEE P + LQ I LC + +R +G+S LA+
Sbjct: 439 -THLSFQYDDV-----------IEELSP-LGLQTSSIQDLCNDFCSWIREVGVSVLATKT 485
Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSS 1694
E + SL G L D L +P IPL DG+++S
Sbjct: 486 PEWH------RKVVSLFCG-HKKLKDQLLSLPIIPLRDGSWTS 521
>gi|242085496|ref|XP_002443173.1| hypothetical protein SORBIDRAFT_08g014300 [Sorghum bicolor]
gi|241943866|gb|EES17011.1| hypothetical protein SORBIDRAFT_08g014300 [Sorghum bicolor]
Length = 1948
Score = 159 bits (401), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 157/517 (30%), Positives = 240/517 (46%), Gaps = 106/517 (20%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ------ 1215
VE IRR+ F +G E N L + + +A+ LS+ELYS+D HFL+EL+Q
Sbjct: 14 VERIRRERFYIGRG----ERNPLAED---IHQAVGYLSEELYSKDVHFLMELIQIILVIA 66
Query: 1216 ----------------------NADDNIYPENVEPTLTFILQE---------SGIVVLNN 1244
+A+DN YP +V P L F++ E S +VV NN
Sbjct: 67 VENENDCNQVANCMIVCANVKQHAEDNDYPLDVSPALEFVITEKDITATGARSTLVVFNN 126
Query: 1245 EQGFSAENIRALCDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-- 1301
E+GFSA NI ++C +G STKKG+ GYIG KGIGFKSVF V+ P I SNG+ +KF+
Sbjct: 127 ERGFSAANIDSICRIGRSTKKGNRHLGYIGEKGIGFKSVFLVSGQPHIFSNGYQIKFNEK 186
Query: 1302 -TSEGQIGFVLPTLV--PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
+++ IG+++P V P NID R + L + T I LP +T +
Sbjct: 187 PSADCDIGYIVPEWVDGKP-NIDDI-RTVYGPSRSLPT----TVIILPLKTD-----KIL 235
Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVI--------------RKKIVGD--- 1401
+ + HP +LLFL +++ + R M +D RK I +
Sbjct: 236 AVKKELNSTHPEILLFLSKIRQLSVREMNHDPKASKISQISISSEVDYRSRKDIDAESYT 295
Query: 1402 ---GIIKVSCGE-DKMTWFVASQKLRAGVIRPDVKTTE--------IALALTL-QESNEG 1448
+ + S GE ++ T+++ QK ++PD + + + LA Q + G
Sbjct: 296 LHLAMQENSKGETEECTYYMWKQKF---AVQPDCRIQKRMEVDQWVVTLAFPHGQRLSRG 352
Query: 1449 NYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
P V+AFLP L FI+Q DF+L SSRE + +S WN +L P+ FV+
Sbjct: 353 AKSP-----GVYAFLPTEMVTNLPFIIQADFLLASSRESILFDSKWNHGILECVPSAFVN 407
Query: 1508 AERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNN 1567
A + L + P A + F+P+ F + I +K+ + + E
Sbjct: 408 AFGAL--LKSSGDAPHFALAPIFRFLPIQDPSISLFDNVRLSIKTKVEAEDIMPCESCTT 465
Query: 1568 Q--WAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLS 1602
Q + P +V R D A + ++ ++ +GL LS
Sbjct: 466 QKVFCKPTEVSRL--DSAFWRILNLAQKQGIGLQNLS 500
>gi|167521057|ref|XP_001744867.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776481|gb|EDQ90100.1| predicted protein [Monosiga brevicollis MX1]
Length = 3466
Score = 158 bits (400), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/338 (30%), Positives = 174/338 (51%), Gaps = 50/338 (14%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
L +A ++ ++Y D+HFL+EL+QNA D Y V P L+ I++ + V NE GF
Sbjct: 1326 LHKATILVATDMYGSDTHFLMELIQNAADCTY-TGVTPCLSIIVKRDCVAVEYNEDGFGR 1384
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
E++ A+C + +STK G G+IG+KGIGFKSVF+V PE+HS G+H +F+ ++G++
Sbjct: 1385 EDVMAICSICSSTKAGGKQGFIGQKGIGFKSVFKVATQPEVHSRGYHFRFNAENPKLGYL 1444
Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI------VDMF 1364
+P + +K P E + T I LP + G + +I +
Sbjct: 1445 VPERI------------AKPPDFSEER--GTRIVLPRSAVRNSGTSFASIDLQDISQKLL 1490
Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLN---------DSLVV-----------IRKKIVGDGII 1404
+ P +LLFL +L+ + N+++ D+ V +R +
Sbjct: 1491 TAFQPDMLLFLPKLRTV---NIVDGDCAFDSKPDTRSVHVELHGSQWRELRTTLYRPAND 1547
Query: 1405 KVSCGEDKMTWFVASQKLRAGVIRPDVKTTE--IALALTLQESNEGNYGPLL--YQQPVF 1460
VS + W + + K A +R E +AL + L E+ G+ + YQ+ VF
Sbjct: 1548 TVSETRSEF-WLMHTVKREAVNLRQHAAEAELTVALPVVLDEALLGHEASDMPGYQR-VF 1605
Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
AFLP++ + L+F + D+ +P+SRE++D SPWN++L+
Sbjct: 1606 AFLPMKEFNLRFAVHADWHVPASREDIDSTSPWNKFLI 1643
>gi|392310012|ref|ZP_10272546.1| hypothetical protein PcitN1_15279 [Pseudoalteromonas citrea NCIMB
1889]
Length = 1648
Score = 156 bits (394), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 137/505 (27%), Positives = 234/505 (46%), Gaps = 65/505 (12%)
Query: 1152 DKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
D+ + A ++ +R ++ + E N+ + + AL LS++LYS+D HF+
Sbjct: 2 DQAVKTAKSQIDEFKRQKYHF--DTETGECNLPEDAKSDYDNALKILSEQLYSKDIHFIF 59
Query: 1212 ELVQNADDNIYPENVEPTLTFILQESG----------IVVLNNEQGFSAENIRALCDVGN 1261
EL+QNA+DN Y + P L+F L E +V+ NNE GF NI+A+ +G
Sbjct: 60 ELIQNAEDNHYGKIATPELSFELLEYDPTNTPGCTGCLVIKNNENGFDINNIKAISSIGK 119
Query: 1262 STKKG-SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNI 1320
STK AGYIG KGIGFKSVF V+ AP I SNGF +KF + + G L +VP +
Sbjct: 120 STKANQKDAGYIGEKGIGFKSVFVVSPAPHILSNGFQIKFLKDDPKTG--LGYIVPYWLE 177
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTK-FSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
D + + + T + LP + K SE + + S L P L+LFL +L+
Sbjct: 178 DDLSKFYNGN---------GTTLLLPLQNKQGSESRMFDKVQHELSKLSPELVLFLRKLK 228
Query: 1380 CIMFR--NMLNDSLVVIRKKIVGDGI---IKVSCGEDKMTWFVASQKLRAGV-----IRP 1429
+ + N D + GD + + S + + + SQ + +R
Sbjct: 229 KLSIKTPNYYADYHLTKD----GDYVELNTQSSSQNTSVKYLLKSQTISVPADAVNELRE 284
Query: 1430 DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT-YGLKFILQGDFVLPSSREEVD 1488
+V +I++A + N+ L + P++ +LP + GL F++ DF+L +SRE +
Sbjct: 285 NVTKRDISIAFPV------NFE--LEESPLYCYLPTESETGLPFLVNADFILSASRESIR 336
Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGLP 1547
+ WN W+ E A ++ + + + ++ A +VP+ G +++ +
Sbjct: 337 QDLTWNNWMRGEVANF---AAKTISKILSQKGDISRWA-----WVPVFGSNRVIYWNNVR 388
Query: 1548 RMILSKLRMSNCL-ILEGNNNQWAPPCKVLRGWNDRA--HSLLPDILLQKHLGLGFLSKD 1604
+ L++S C+ L G Q+ P KVL N HS ++ + D
Sbjct: 389 ERVEEALKVSKCIPTLSG---QYELPSKVLSLGNTYGFLHSWPLSVI--ERFPFDLFWDD 443
Query: 1605 IVLSDSLARALGIEEYGPKILLQII 1629
L ++ A+G++ +G K + +
Sbjct: 444 SSLISRVSSAIGLKSFGQKDFVSFL 468
>gi|356504434|ref|XP_003521001.1| PREDICTED: uncharacterized protein LOC100800765 [Glycine max]
Length = 1675
Score = 156 bits (394), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 154/522 (29%), Positives = 229/522 (43%), Gaps = 103/522 (19%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IRR +F +G E N L++ L +A LS ELY++D HFL+ELVQNA+DN
Sbjct: 9 IEKIRRTKFSIGG-----EPNPLRED---LHQATKNLSTELYAKDVHFLMELVQNAEDNK 60
Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
Y E V P+L FI+ I ++ NNE+GFS ENI ++C VG STKKG+ S+GY
Sbjct: 61 YAEGVSPSLEFIITSKDITATGASATLLIFNNEKGFSRENIESICSVGRSTKKGNRSSGY 120
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
IG KGIGFKSVF +T P I SNG+ ++F IG+++P V P D+ ++
Sbjct: 121 IGEKGIGFKSVFLLTAYPYIFSNGYQIRFSEKPCPHCDIGYIVPEWVEQKPTLHDI-KQI 179
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
L + T I LP + + + S++HP +LLFL +++ + R
Sbjct: 180 YGAGAGSLPT----TTIILPLKPD-----KVKPVKHQLSNIHPEVLLFLSKIRHLSVRED 230
Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
D K+ + +S + T R ++ L L+ E N
Sbjct: 231 NEDP------KLNSVNAVSISSEINFCT-------------RKNMNAESYTLHLSAAE-N 270
Query: 1447 EGNYGPLLYQQPVFAFLPLR----------------TYGLKFILQGDFVLPSSREEVDGN 1490
G+ Y F P+R FI+Q DFVL SSRE + +
Sbjct: 271 GGSEKECRYYMWKQKF-PVRFENVVDRREDVEELIMVTNFPFIIQADFVLASSRETILLD 329
Query: 1491 SPWNQWLLSEFPALFVSAERSF----------CDLPCFRENPAKAASVYMSFVPLVGEVH 1540
S WNQ +L P+ F+ A ++ C P FR F+P+
Sbjct: 330 SKWNQGILECVPSAFMDAFKTLVLGSDQAPISCLPPVFR------------FLPIDSSPF 377
Query: 1541 GFFSGLPRMILSKLRMSNCLILE--GNNNQWAPPCKVLR----GWNDRAHSLLPDILLQK 1594
+ + I K+ + + +E G + PC+V R W+ A + + L
Sbjct: 378 EKLNYVREKIKEKMLGESIIPIETYGEQKHFYKPCEVSRLLPEFWDILAEAQAERVYLHN 437
Query: 1595 ---HLGLGFLSK--DIVLSDSLARALGIEEYGPKILLQIISS 1631
H G LS D + D + LGI++ + I S
Sbjct: 438 LSSHNGRKILSSSFDKIEYDDILSFLGIQQVNTDWYAKCIQS 479
>gi|436670129|ref|YP_007317868.1| hypothetical protein Cylst_6577 [Cylindrospermum stagnale PCC 7417]
gi|428262401|gb|AFZ28350.1| hypothetical protein Cylst_6577 [Cylindrospermum stagnale PCC 7417]
Length = 268
Score = 155 bits (392), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 129/231 (55%), Gaps = 23/231 (9%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
++E IR++ +G+G ++ L AL LS +LYS+++HF+LEL+QNADDN
Sbjct: 7 LIEDIRKNTYGIGLQTDQAAQGVIDNFRRSLNSALERLSVDLYSKETHFVLELIQNADDN 66
Query: 1221 IYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFK 1280
Y V PTL ++ I+V NNE GFS +N+RA+C+VG STK GYIG KGIGFK
Sbjct: 67 QYQSGVIPTLNLTIEPQKIIVQNNELGFSEDNVRAICNVGRSTKT-KVEGYIGEKGIGFK 125
Query: 1281 SVFRVTDAPEIHSNG--FHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
SVFR++D P+I SNG FH K +E Q+GFV P + P ++S
Sbjct: 126 SVFRISDEPQIFSNGFQFHFKRQDNENQLGFVTPYWIEAI------------PNYVDSTL 173
Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND 1389
N I LP R + ++ ++ +L+ FL +L+ + N +N+
Sbjct: 174 TN--IILPLRESAKDELS------KLGEIEHTLIFFLRQLKTVTIENRVNN 216
>gi|350630573|gb|EHA18945.1| hypothetical protein ASPNIDRAFT_187731 [Aspergillus niger ATCC 1015]
Length = 1659
Score = 154 bits (390), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 168/621 (27%), Positives = 265/621 (42%), Gaps = 112/621 (18%)
Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
E A +VE I + LG ++ +E ML+K +G ++ LS+ LY+
Sbjct: 9 ERARKLVEGISKGHGYLGEDVLSQMSDEVRRQVEEAMLRKDEM-IGSSVVTLSKNLYNSS 67
Query: 1207 SHFLLELVQNADDNIYPEN----VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+ F+ ELVQNADDN Y P+L F + IVV NE GF+ EN+ A+C+VG S
Sbjct: 68 ARFVFELVQNADDNHYTTARGGATVPSLAFHVYPRRIVVDCNEDGFTHENLVAICNVGKS 127
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
+K G+ GYIG KGIGFKSVF V I S F F G G + M
Sbjct: 128 SKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSGDFSFSFRHKPGGSG-----------MGM 175
Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQ 1379
+ QL+ + L + + F +L +LLLF L RL+
Sbjct: 176 ISPIWEDVEEQLQRPMTRMTLYL-----HDDAAPRETAMQQFHELQDTLLLFTKNLQRLE 230
Query: 1380 CIMFRN-------------MLNDSLVVIRKKIVG-DGIIKVSCGEDKMTWFVA------- 1418
M+ + N++L V+ K ++ DG + +T +VA
Sbjct: 231 VSMYDDDEEKLSSTVFAISSQNENLKVLTKTLLARDGTSEELTRNYHLTKYVAHDLPKSE 290
Query: 1419 -------SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
Q++RA +I LA + + + P++ Q VFAFLP+R G
Sbjct: 291 NRNYTDKEQEIRA------YSRADIILAFPVTKDS----VPVIEPQDVFAFLPIRNMGFT 340
Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
F++ DFV +SR+++ +S N+ L +E + F +A C P R +M
Sbjct: 341 FLIHSDFVTDASRQDIVRSSARNRKLRTEIASAFNAAVLEMCKHPSLR-------YTWMR 393
Query: 1532 FVPLVGEVHG--FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWND-----RAH 1584
++P H F+ GL + K R+++ IL+ + P + +R D
Sbjct: 394 YLPQRDGHHWDPFWMGLLDEL--KSRLTSSPILQSRTSGLWFPIERMRCLQDFLLDKSGE 451
Query: 1585 SLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGL----- 1639
L PD+ +K+L +L +DI AR + EYG + + + S+ R L
Sbjct: 452 PLFPDLPREKYLSAHYLMEDI------AR---LHEYG-LLFMSMRESIKRARMDLELSHS 501
Query: 1640 RSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGT 1699
+ WL+ + S + G+ T I+ ++ +P IPL G + S+ +G
Sbjct: 502 KMKNTDTSEDWLSRAAMVL-----SKSFRQGL-TRRIEEIKSMPLIPLITGQWVSIQKGA 555
Query: 1700 IWLHSDCSV---FDGGFGLEA 1717
++ S + D G GL A
Sbjct: 556 VYFSSINGIEIPSDLGIGLVA 576
>gi|224136262|ref|XP_002322285.1| predicted protein [Populus trichocarpa]
gi|222869281|gb|EEF06412.1| predicted protein [Populus trichocarpa]
Length = 1692
Score = 154 bits (389), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 129/403 (32%), Positives = 197/403 (48%), Gaps = 60/403 (14%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E IR+ F +G E N L L +A+ LS ELY++D HFL+EL+QNA+DN
Sbjct: 8 IEHIRKTTFSIGG-----EKNPLAPM---LDQAVKYLSAELYTKDVHFLMELIQNAEDND 59
Query: 1222 YPENVE------PTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
Y E V+ P+L F++ I ++ N+E+GFSA+NI ++C VGNSTKKG
Sbjct: 60 YLEGVDYLEGVDPSLEFVITSRDITATGAPATLLIFNSEKGFSAKNIESICSVGNSTKKG 119
Query: 1267 SSA-GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE---GQIGFVLPTLVP--PFNI 1320
+ GYIG KGIGFKSVF +T P I SNG+ ++F+ +G+V+P V P
Sbjct: 120 NRKRGYIGEKGIGFKSVFLITPQPFIFSNGYQIRFNEKPCPLCNLGYVVPEWVEENPSLS 179
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
D+ K S T I LP + + + S +HP +LLFL +++
Sbjct: 180 DI------KQIYGSNSTLPTTTIVLPLKPN-----KVKPVKQQLSSIHPEVLLFLSKIKS 228
Query: 1381 IMFRNMLND------SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
+ R D S + I K+ + + S + T +++++ D +
Sbjct: 229 LSVREQNEDPRLNTVSAIAITKRT--NFRARESMDAESYTLHLSAEENNT-----DEQDR 281
Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPW 1493
E + ++ Q+ G L ++AFLP L FI+Q DF+L SSRE + + W
Sbjct: 282 ECSYSVWKQKFPRLRRGRSL--PGIYAFLPTEMVTNLPFIIQADFILASSRETILLDDNW 339
Query: 1494 NQWLLSEFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPL 1535
NQ +L P F++A S + RE+ P F+P+
Sbjct: 340 NQGILDCVPLAFINALVSLVKM---REDAPVSRLPRLFQFLPI 379
>gi|291243339|ref|XP_002741560.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 1441
Score = 154 bits (389), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 129/518 (24%), Positives = 239/518 (46%), Gaps = 46/518 (8%)
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
GDF +PSSRE+VD +S WNQWL E P LF+ A ++F + P F E +A ++ +VPL
Sbjct: 15 GDFDVPSSREDVDRDSAWNQWLRGELPTLFIDALQTFKNRPEFSE--LEAICKFLQYVPL 72
Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ----WAPPCKVLRGWNDRAHSLLPDIL 1591
E+ GFF + + I+ +L+ S CL + + ++ W P ++ + ++ L
Sbjct: 73 EDEILGFFKPVAKQIIQQLKASPCLPADVSTDEDLYAWKLPSELAVTRDPLLREVITSDL 132
Query: 1592 LQKHLGLGFLSKDIVLSDS--LARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS 1649
L HL ++ I+ S S L LG++ L+ I ++ + G ++ + ++A
Sbjct: 133 LHHHLNRYYVHPKILSSISMPLINMLGVQTLSINHLIDIGKAISASIEG-NNITIGFVAK 191
Query: 1650 WLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVF 1709
WL L+ + S + E D+I + I IPLS+G F S+ E +++
Sbjct: 192 WLACLHRL------MESEFNIPEDDIIKQVTNIRIIPLSNGGFVSLSEFSVFFPV----- 240
Query: 1710 DGGFGLE----AFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSA 1765
G G + F + L+ + L + + L + ++ +L +GV ++
Sbjct: 241 -SGDGRQQSKSQFITKQSHLKYLESDLNTVPMELLTCLDEVGNSQVHSVLKILGVCEMKP 299
Query: 1766 HDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFV 1825
+++ HI+P + + D+ L+ YL ++ E C ++ EL+ A +
Sbjct: 300 REVIHNHIIPVLRSDKWQKKDRELLMQYLIYIKEEHEKTPSVCDLD------ELKTIARI 353
Query: 1826 LTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKW 1885
+TNHG K P IP+HF ++ N ++ + D W +D YL+ + + + W
Sbjct: 354 ITNHGLKNPHSIPVHFTTDYANKLNFQKDLPGYD--WVLLDSCYLQ---RNNSQQNVSSW 408
Query: 1886 RKFFKEIGITDFV-----QVVQVDKDVADISHTGFKNMWTKELLSPGSAAI-DWESNELV 1939
+ FF ++G+ + + +VV + ++A W + G I D+ +E
Sbjct: 409 KSFFIKLGVLNSLAVKKEKVVLQESEIAHSPWAAVYKFWPRS--QDGCFYIEDYICDEFH 466
Query: 1940 HLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMG 1975
L++ +TT S+ LL +L+ W Y+ + G
Sbjct: 467 SLVTAENITTEQRLHQSRILLRLLEEEWQSTYSKYLNG 504
>gi|317037598|ref|XP_001398736.2| hypothetical protein ANI_1_1220164 [Aspergillus niger CBS 513.88]
Length = 1671
Score = 154 bits (389), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 207/828 (25%), Positives = 338/828 (40%), Gaps = 145/828 (17%)
Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
E A +VE I + LG ++ +E ML+K +G ++ LS+ LY+
Sbjct: 9 ERARKLVEGISKRHGYLGEDVLSQMSDEVRRQVEEAMLRKDEM-IGSSVVTLSKNLYNSS 67
Query: 1207 SHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+ F+ ELVQNADDN Y E P+L F + IVV NE GF+ ENI A+C+VG S
Sbjct: 68 ARFVFELVQNADDNHYTTARERATVPSLAFHVYPRRIVVDCNEDGFTHENIVAICNVGKS 127
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
+K G+ GYIG KGIGFKSVF V I S F F G G + M
Sbjct: 128 SKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSGDFSFSFRHKPGGSG-----------MGM 175
Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQ 1379
+ QL+ + L T E AM F +L +LLLF L RL
Sbjct: 176 ISPIWEDVEEQLQRPMTRMTLYLHEDTAPRE-TAMQQ----FRELQDTLLLFTKNLQRLG 230
Query: 1380 CIMFRN-------------MLNDSLVVIRKKIVG-DGIIKVSCGEDKMTWFVA------- 1418
M+ + N++L V+ K ++ DG + +T +VA
Sbjct: 231 VFMYDDDEKQLSSTVFAISSQNENLKVLTKTLLAPDGTSEELTRNYHLTKYVAHDLPGSE 290
Query: 1419 -------SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
++ RA +I LA + + N P++ Q VFAFLP+R G
Sbjct: 291 NRTYTDKERETRA------YSRADIILAFPVTK----NSVPVIEPQDVFAFLPIRNMGFT 340
Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
F++ DFV +SR+++ +S N+ L +E F +A P R +M
Sbjct: 341 FLIHSDFVTDASRQDIVRSSARNRKLRTEISIAFGTAVLEMRKHPTLRYR-------WMR 393
Query: 1532 FVPLVGEVHG--FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGW----NDRAHS 1585
++P H F+ GL I S+L +S ++ + W P ++ R W +
Sbjct: 394 YLPKRDGHHWDPFWVGLLGEIKSRL-ISLPILQSRTSGSWFPIEQMERLWEHWFDKSGDP 452
Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
L PD+ +++L + +L KDI + EYG + + + N M L
Sbjct: 453 LFPDLPREQYLSVHYLEKDIT---------RLMEYG--LTFMAMPGMIARAN----MDLE 497
Query: 1646 WLASWL-NELYTISFHSSGQSSLQSGVE---TDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
+ S + N+ + +HS ++L + I+ ++ +P IPL G + S+ G ++
Sbjct: 498 FPHSKMKNKETSEDWHSRAATALSHSFRLGYDNCIEEVKAMPLIPLKTGQWVSIKSGAVY 557
Query: 1702 LHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
F G++ +L L T G I+ ++ IGVQ
Sbjct: 558 -------FPSIHGIDIPSDLGVGLVTS---------------GAINNPQRKQLFKDIGVQ 595
Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
+ S + K+ I D +A G + ++L C V
Sbjct: 596 EASVDFVRKL-----IKDNYSAYGIFFIPVSREHLTFLYLT-ECLASSSSTALFVHLFNG 649
Query: 1822 KAFVLTNHGF--KRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPA-KESL 1878
K ++H + ++P ++F E +P L+ I +D++++ ++
Sbjct: 650 KGLAFSDHKYQIRQPKIHTVYFPNE--DPYGAKALLEPIQGGAPGLDVSFINEDYLRDPP 707
Query: 1879 SCGLVKWRKFFKEIGITDFVQVVQVDKDVA--DISHTGFKNMWTKELL 1924
SC + RK+ + + QV V ++ D+ H+ F+ +E L
Sbjct: 708 SCPPKEQRKWIEWLQ-----QVFHVRGTLSLIDLRHSPFRARLGQECL 750
>gi|413934111|gb|AFW68662.1| hypothetical protein ZEAMMB73_452932, partial [Zea mays]
Length = 1387
Score = 153 bits (387), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 150/489 (30%), Positives = 231/489 (47%), Gaps = 87/489 (17%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ F +G E N L + + +A+ LS+ELYS+D HFL+EL+QNA+DN
Sbjct: 12 VERIRRERFYIGRE----ERNPLAED---IHQAVSYLSEELYSKDVHFLMELIQNAEDNE 64
Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP +V P L F++ E S +VV NNE+GFSA NI ++C +G STKKG+ GY
Sbjct: 65 YPLDVSPALEFVITEKDITGTGADSTLVVFNNERGFSAANIESICRIGRSTKKGNRHLGY 124
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
IG K V+ P I SNG+ +KF ++ IG+++P V P NID R
Sbjct: 125 IGEK---------VSSQPHIFSNGYQIKFSEKPSAVCDIGYIVPEWVDGKP-NIDDI-RT 173
Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
+ L + T I LP +T + + S HP +LLFL +++ + + M
Sbjct: 174 VYGPSRSLPT----TVIILPLKTD-----KILAVKKELSSTHPEILLFLSKIRQLSVKEM 224
Query: 1387 LNDSLVVI--------------RKKIVGD------GIIKVSCGE-DKMTWFVASQKLRAG 1425
ND K I + + + S GE ++ T+++ QK
Sbjct: 225 NNDPKASKISQISISSEVDYWSSKDIDAESYTLHLAMQENSKGETEECTYYMWRQKF--- 281
Query: 1426 VIRPDVKTTE--------IALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQ 1475
++PD + + + LA Q + G P V+AFLP L FI+Q
Sbjct: 282 AVQPDCRIQKRMEVDQWVVTLAFPHGQRLSRGAKSP-----GVYAFLPTEMVTNLPFIIQ 336
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
DF+L SSRE + +S WN+ +L P+ F++A + L R+ P A F+P+
Sbjct: 337 ADFLLASSRESILFDSQWNRGILECVPSAFINAFGAL--LKSSRDAPHFALPPIFRFLPI 394
Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWNDRAHSLLPDILLQ 1593
F + I +K++ + + E Q + P +V R D A + ++ +
Sbjct: 395 QESSISLFDSIRLSIKTKVQAEDIMPCEPCTAQKVFCKPAEVSR--LDSAFCRILNLAQK 452
Query: 1594 KHLGLGFLS 1602
+ +GL LS
Sbjct: 453 QGIGLQNLS 461
>gi|302775138|ref|XP_002970986.1| hypothetical protein SELMODRAFT_411714 [Selaginella moellendorffii]
gi|300160968|gb|EFJ27584.1| hypothetical protein SELMODRAFT_411714 [Selaginella moellendorffii]
Length = 1435
Score = 152 bits (385), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 139/444 (31%), Positives = 221/444 (49%), Gaps = 54/444 (12%)
Query: 1154 ECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLEL 1213
E + +E +RR+ FG+G + N + ML++ + LS ELY +D HFL EL
Sbjct: 163 EMSSQRMFIERLRRELFGIGQSGPNPLALMLQE-------TIRLLSAELYQKDIHFLSEL 215
Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
+QNA+DN Y +VEP++ L ++ + ++LNNE GFS +NI +LC G STK
Sbjct: 216 LQNAEDNTYHSSVEPSIEIYLTKTDVTGTGAAATLLLLNNEVGFSEKNIESLCAAGQSTK 275
Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP-PFN 1319
KG S GYIG KGIGFKSVF VT P I +NGF ++F+ +E ++G+++P V P +
Sbjct: 276 KGKKSQGYIGEKGIGFKSVFLVTKQPYIINNGFRIRFNEEPHNEAKLGYIVPEWVDCPTD 335
Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
+L K+ + S+ ++P T F I M +I D S P+ L+ +
Sbjct: 336 A-----VLQKEKI---SEVAKQLTQIPAETLLFLSKIRMLSIYDQQSR-PPTELVLSRSV 386
Query: 1379 QCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR----AGV-IRPDVKT 1433
+ +N ++ G G +++ Q L+ A V R DV +
Sbjct: 387 EMTTSKNGSVETSTTFLTASNGRN------GTQHSNYYMFKQTLKVPEEAKVEKRKDVDS 440
Query: 1434 TEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGN 1490
I LA + + S+ + G +F+FLP + GL F++ DF+L SSR + +
Sbjct: 441 WMITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLVSSRATLLFD 494
Query: 1491 SPWNQWLLSEFPALFVSAERSFCDLPCFR-ENPAKAASVYM-SFVPLVGEVHGFFSGLPR 1548
S WNQ + P +F +A + F ++ A + M +VP+ + + +
Sbjct: 495 SSWNQGIFRCVPEVFYNAFELLLNTTAFGLQSILSVARMAMYKYVPVNVCRNSYLERVRN 554
Query: 1549 MILSKLRMSNCLILEGNNNQWAPP 1572
ILSKL+ +N +IL ++ APP
Sbjct: 555 DILSKLKDNN-VILCASDVGRAPP 577
>gi|242071023|ref|XP_002450788.1| hypothetical protein SORBIDRAFT_05g018310 [Sorghum bicolor]
gi|241936631|gb|EES09776.1| hypothetical protein SORBIDRAFT_05g018310 [Sorghum bicolor]
Length = 1755
Score = 152 bits (384), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 134/437 (30%), Positives = 202/437 (46%), Gaps = 92/437 (21%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN----- 1216
VE IR + + +G E N L + + +A++ LSQELYS+D HFL+EL+QN
Sbjct: 17 VERIRWERYFIGRE----ERNPLAED---MHQAVNYLSQELYSKDVHFLMELIQNDVLNY 69
Query: 1217 ------------ADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRA 1255
A+DN YP V P L F++ I +V NNE GF+ NI +
Sbjct: 70 ANLNLFITIDMNAEDNDYPSGVLPALEFVITSKDITRTGATATLLVFNNENGFTPANIES 129
Query: 1256 LCDVGNSTKKGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVL 1311
+C +G STKK + +GYIG KGIGFKSVF V+ P I SNG+ +KF +++ IG+++
Sbjct: 130 ICRIGRSTKKSNRGSGYIGEKGIGFKSVFLVSKNPHIFSNGYQIKFSEDPSADCGIGYIV 189
Query: 1312 PTLVP--PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
P V P D+ + + T I LP ++ ++ + S HP
Sbjct: 190 PEWVEDCPSIADIM------NIYGCQKSLPTTTIILPLKSD-----KIDAVKKELSGTHP 238
Query: 1370 SLLLFLHRLQCIMFRNMLNDSL---------------VVIRKKIVGDGI------IKVSC 1408
+LLFL +++ I R +ND L + RK I + ++
Sbjct: 239 EVLLFLSKIRQISVRE-INDDLNATSLSQISISSEADALTRKDISAESYTLHLSALENKI 297
Query: 1409 GEDKMTWFVASQKLRAGVIRPDVKTTE--------IALALTL-QESNEGNYGPLLYQQPV 1459
GE + ++++ Q ++P+ + + I LA Q ++G P V
Sbjct: 298 GERQCSYYIWKQHFP---VKPECRVQKREGIDQWVIMLAFPHGQRLSQGMGSP-----GV 349
Query: 1460 FAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCF 1518
+AFLP FI+Q DF+L SSRE + +S WNQ +L + FV+A L
Sbjct: 350 YAFLPTEMATNFPFIIQADFLLSSSRESILLDSQWNQGILECVSSAFVNA--FLVLLKSI 407
Query: 1519 RENPAKAASVYMSFVPL 1535
PA A F+PL
Sbjct: 408 ESAPAFALPPIFKFLPL 424
>gi|429328793|gb|AFZ80553.1| hypothetical protein BEWA_034100 [Babesia equi]
Length = 2424
Score = 152 bits (384), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 145/554 (26%), Positives = 243/554 (43%), Gaps = 102/554 (18%)
Query: 1161 IVESIRRDEFGLG---PNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNA 1217
++E IR EFG+G NI ++L KQ R+ RA+ LS++LY+ + LEL+QNA
Sbjct: 680 LIEEIRVKEFGIGLVESNIDERLKSVLNKQRDRISRAVKRLSEDLYNSRTQLQLELIQNA 739
Query: 1218 DDNIYPEN--VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
DDN Y N EP ++F L +V+LNNE GF ++IR++CD+G+S+K + G G+
Sbjct: 740 DDNCYSVNGATEPFISFFLDYESVVMLNNEDGFMEKDIRSICDIGSSSKVLKT-GKTGKF 798
Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
GIGFKSVF +T P I SNG+ F + I ++LP V L P+
Sbjct: 799 GIGFKSVFILTSNPHIFSNGYSFMFSNDPSFKDNIEYILPHWV---------ELSDSSPI 849
Query: 1333 QLESKCWNTCIRLPFRTKFSEGIA-----------------------MNNIVDMFSDLHP 1369
+ RLPF T ++ +A +F L
Sbjct: 850 DYVKYKLGSESRLPFTTYSNKILAKSRGKRLTLMYFPLKDEYITNEYYKKYYQIFQMLDL 909
Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGII-KVSCGEDKMTWFVASQKLRAGVIR 1428
+LFL + + F N ++ ++ KK++G ++ K C D + + L I
Sbjct: 910 ESILFLRNISTVHFYNNVDGVETLMNKKLMGTFLVDKSDCEIDIDNFKLVHLNLERKNIH 969
Query: 1429 P---------DVKTTEIALALTLQESN------EGNYGPLL-------------YQQPVF 1460
+V + LT +ES GN L+ + ++
Sbjct: 970 SLDIVDSVHLEVLVVNYSFKLTAEESTMLSTSKAGNNSSLISIALPVSPEIPENTRYKLY 1029
Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
LP+ YGL F++ DFVL ++R+ + PWN+ + S+ ++V + +
Sbjct: 1030 NSLPIGEYGLNFMVNADFVLSANRQNLFTTDPWNEIIASKMCDVYVGI------IEIIKY 1083
Query: 1521 NPAKAASVYMSFV---PLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVL- 1576
N +K + +Y FV P + F I SKL ++ +L +++ P K +
Sbjct: 1084 NYSKMSILYNGFVYSIPRQDDGISLFKYSCNRIFSKL-LNIKWVLVDEVDEFVEPFKAIF 1142
Query: 1577 -------------RGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPK 1623
++ +++P +L+KH +++ I+ +L L EYG K
Sbjct: 1143 VCSEFPDDFSNTEVMFSKLLRTVVPSNILRKHCDFCYINS-ILCRPNLKSKLM--EYGIK 1199
Query: 1624 IL-----LQIISSL 1632
I+ ++I+SS+
Sbjct: 1200 IITFDLVVEIVSSI 1213
Score = 42.4 bits (98), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 7/92 (7%)
Query: 842 AIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVK---ELLCLVTRDGKVIRI 898
A E++ AP L +L ++ W++++ LGPLP +LLNE +K L D V++I
Sbjct: 416 ATELIENAPLLLNLYDYTMWNYVYEDCLGPLPNFLLNE-KIKFFGSYSFLYIHDRSVVKI 474
Query: 899 ---DHSASVDSFLEAALEGSSFQTAVKLLSSF 927
D S ++ +G A+K+LS+F
Sbjct: 475 KSPDCYISDLVIVQEITDGFDNFDAIKVLSAF 506
>gi|302881512|ref|XP_003039668.1| hypothetical protein NECHADRAFT_88029 [Nectria haematococca mpVI
77-13-4]
gi|256720532|gb|EEU33955.1| hypothetical protein NECHADRAFT_88029 [Nectria haematococca mpVI
77-13-4]
Length = 1482
Score = 151 bits (381), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 189/404 (46%), Gaps = 58/404 (14%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+ IRR E GLG + +N A L AL +S +LY +HFLLEL+QNADDN
Sbjct: 16 IRQIRR-EKGLGDGPDQIGNN-----GADLESALKIISHDLYQTSTHFLLELIQNADDNS 69
Query: 1222 YPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIG 1278
Y VE PT++ + + NE+GFS +NI A+C +G STK G S G+IG KGIG
Sbjct: 70 Y--TVETPTMSISYSPGKLRIDCNERGFSKQNIEAICRIGKSTKGGESKSEGFIGEKGIG 127
Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
FK+VF+V I S + +FD +G +G + P + PF P Q + +C
Sbjct: 128 FKAVFKVASTVWISSGYYSFRFD-RDGHLGMIAP-ITCPF------------PEQPKQEC 173
Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFS--------------DLHPSLLLFLHRLQCIMFR 1384
T I L EG+ + + ++HP +RL F
Sbjct: 174 --TSILLKLDKDCDEGLIVEEMKSCDEKILLFLRRLRKLEINIHPGA---ENRLFKRTFT 228
Query: 1385 NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQE 1444
+ ++ +G+ K W ++ L + RP + T+EI LA
Sbjct: 229 RGAQPAENPSMMTLMNNGVEK-----HYFVWRHTAKNLPSEARRPGITTSEIVLAFPNTG 283
Query: 1445 SN-EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
N + P++ Q V++FLP+R YG F+LQ DF+LP++RE+V +S WN L +E
Sbjct: 284 VNKDAPVLPVIEPQNVYSFLPIRNYGFSFLLQADFLLPANREDVHVDSNWNIHLRTE--- 340
Query: 1504 LFVSAERSFCDLPCFRENPAKA-ASVYMSFVPLVGEVHGFFSGL 1546
A ++F + C +N + ++ + P+ FF L
Sbjct: 341 ----ARKAFMEAMCHIDNLDNSLRHSWVRYAPITPSQSEFFEEL 380
>gi|389750864|gb|EIM91937.1| hypothetical protein STEHIDRAFT_136037 [Stereum hirsutum FP-91666
SS1]
Length = 1448
Score = 150 bits (380), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 159/536 (29%), Positives = 253/536 (47%), Gaps = 88/536 (16%)
Query: 1161 IVESIRRDEFGL-GPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
++ SIR+D GL PN + +E LGRAL L+ +LY D+HF+LE VQNA+D
Sbjct: 3 VIRSIRQD-LGLDAPNPTEVER--------MLGRALEHLADQLYRTDTHFVLEFVQNAED 53
Query: 1220 NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
N Y +V PTL+ + + + + NE+GF N+RA+C++G STKK G+ IGF
Sbjct: 54 NSYHPDVTPTLSLRIDDYSMRISCNERGFDESNVRAICNIGTSTKK-HLQGF-----IGF 107
Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
KSVF+V + + S F KFD E ++G + P M+ L P +
Sbjct: 108 KSVFKVANVVRVASGDFRFKFDRDE-RLGMIAP---------MWEDDLPAQPGWTDFHLE 157
Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND----SLVVIR 1395
T +R ++ G+A++ +D ++ SLLLFL RL+ I L+D ++ +
Sbjct: 158 LTSLR---QSSMPPGMALSFKLD---EVKASLLLFLQRLRFIELDFALSDPDRQRIITLF 211
Query: 1396 KKIVGDGIIKVSCGE------DKMTWFV-------ASQKLRAGVIRPDVKTTEIALALTL 1442
+ +G GI+K++ E D V A+Q+ + RP + +EI LA +
Sbjct: 212 RYDIGPGIVKLTHCENDCPLRDDFYLMVNHFNDMSATQEPK----RPGLDRSEIVLAFPI 267
Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
E ++ P+ Q V FLPLR +G F++Q DF+ SSRE++ +S WN+ L
Sbjct: 268 AE----DFQPVFSPQDVHVFLPLRPFGFNFVIQADFLTSSSREDILADSRWNKLLRDGIV 323
Query: 1503 ALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
FV A F R+ + ++P + FFS + +L L SN I+
Sbjct: 324 TAFVQAIERFHSQDELRDT-------WFGYLPH-AVLDPFFSEVEPHLLRVL--SNLPII 373
Query: 1563 EGNNNQWAPPCKVL----RGWNDRAHSLLP-DILLQKHLGLGFLSKDI-----VLSDSLA 1612
N + P +VL R + L+P D L + H F+S D VL SL
Sbjct: 374 RNANGVYRRPREVLVIPTRFRDKIGIPLIPGDHLPKPHT---FISPDYDLSVRVLRPSLK 430
Query: 1613 RALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY------TISFHSS 1662
R LG+++ + ++ + ++ + + W + + LY + FH S
Sbjct: 431 R-LGVQDMDDDVFVKSLENMS-AGGAISHQSIEWHETVCDLLYRFPKTRKMQFHPS 484
>gi|429847683|gb|ELA23260.1| heterokaryon incompatibility, partial [Colletotrichum gloeosporioides
Nara gc5]
Length = 1350
Score = 149 bits (376), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 146/519 (28%), Positives = 228/519 (43%), Gaps = 81/519 (15%)
Query: 1142 GDCTTQTLP------EDKECEDAALIV-----ESIRRDEF---GLGPNISNMESNMLKKQ 1187
G+ TT LP E++ DA L E +++ E G + +N + Q
Sbjct: 3 GEVTTAVLPGWKARPENRAQADAQLRKLLKPRERLKQRELLRQQTGTDANNQSDDFEIDQ 62
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQES-GIVVLN--- 1243
L +L LS++LY++D+HF+LEL QNADDN Y + P+ L ++ G
Sbjct: 63 ---LDGSLTVLSEQLYNKDTHFILELTQNADDNAYGASTIPSFGMRLDKNDGDCTFQTGC 119
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSS---AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
NE GF+ + I ALC + +STKK GYIG KGIGFK+VF+V D I S + K
Sbjct: 120 NEVGFTVDQIDALCRIASSTKKNEKDVRTGYIGEKGIGFKAVFKVADVVYIKSGFYSFKL 179
Query: 1301 DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR-TKFSEGIAM-N 1358
D G +G +LP P + ++ R R T +GI +
Sbjct: 180 DKRNGLVGTLLPL-----------------PADFPASKYHEHDRSATRMTLLLKGIEQYD 222
Query: 1359 NIVDMFSDLHPSLLLFLHRLQCI-MFRNMLNDSLVVIR----KKIVGDGIIKVSC---GE 1410
I + + P +LLFL RL+ I M++ + + +K+ G+ + C GE
Sbjct: 223 KISKELTKIKPEILLFLRRLRSISMYKGQETTVYLAMHNEHEEKLQGE-TKTIDCQKQGE 281
Query: 1411 D----------KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVF 1460
M + +++ RAG + +E+ LA L + + + P Q VF
Sbjct: 282 SFRNKKFLITRTMVNDMPNEEKRAG-----ISASEVTLAFPLNKYGQPEFHP----QHVF 332
Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP-CFR 1519
+FL + +G F++QGDF+L +SRE + + WN L PA F+ A F + C R
Sbjct: 333 SFLAVACHGFPFLIQGDFILLASREGLSAGNAWNNKLRDAIPAAFLQAVDHFNKIEGCLR 392
Query: 1520 ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE-GNNNQWAPPCKVLRG 1578
+ + +S+ P F+ L + LR L G+ + A + +
Sbjct: 393 YSWLRYLEYNVSYDP-------FWKRLDDQLTPHLRRHPILRNRLGHTRKPADLIFIPKE 445
Query: 1579 WNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGI 1617
+ LL D ++HL +LS D S R LG+
Sbjct: 446 YRLEGEPLL-DFCSERHLAFEYLSHDATCLTSSLRRLGV 483
>gi|399218232|emb|CCF75119.1| unnamed protein product [Babesia microti strain RI]
Length = 1605
Score = 149 bits (376), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/401 (30%), Positives = 191/401 (47%), Gaps = 42/401 (10%)
Query: 1112 LFESVGADVHIEECGAICDTICGEASDD---GLGDCTTQTLPEDKECEDAALIVESIRRD 1168
L +SV A EC + I + + D + CT + D +DA +E IRRD
Sbjct: 444 LEKSVSASTLSIECIKPNEPITPKHNADLNTSVNSCTGDNIIID--AKDARSHIEYIRRD 501
Query: 1169 EFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP 1228
++G+G + +++ KQ+ RL R L LS LYS +H L EL+QNADDN Y ++ P
Sbjct: 502 KYGIGAD--QCKNDYTIKQNDRLSRILDKLSTNLYSSTNHILYELIQNADDNCYSNDIIP 559
Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
L + E + + NNE GFS ++ A+CD+ NS+K S G+ GIGF+SVF +T+
Sbjct: 560 ELVIVQTEGSVTLFNNEIGFSKTDVEAICDLANSSKIYRSLC-TGKFGIGFRSVFTITNE 618
Query: 1289 PEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
P I SNG+ KF TS G IG ++P + N+D+ + +++++C+NT I LP
Sbjct: 619 PHIFSNGYQFKF-TSNG-IGKLIPQWID--NLDLLNGI--SKSTKIDTRCYNTIIYLPLI 672
Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVV-IRKKIVGDGIIKVS 1407
T I N L+L L +L+ + F + D + I K + I
Sbjct: 673 TPVQFDITYN------------LILHLRKLRRLYFIQITADGIRSEINLKATNESITDTP 720
Query: 1408 CGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT 1467
K+ + K + T + LT N G Q V++ LP+
Sbjct: 721 LFIKKIFYNDKDFKYIQYI------DTSSNIQLTFPIDNIG--------QFVYSTLPVGN 766
Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
+GL F + +F L R + + N+ ++ P +F++A
Sbjct: 767 FGLNFAINSNFNLTLDRSNL-LDDQHNRTIVERIPIIFLNA 806
>gi|70993156|ref|XP_751426.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66849060|gb|EAL89388.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 1516
Score = 149 bits (375), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 131/429 (30%), Positives = 196/429 (45%), Gaps = 62/429 (14%)
Query: 1156 EDAALIVESIRRDEFGLGPN----ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
E A IV+ I+ GL P +S K GR++ ++++LY D+ FL
Sbjct: 7 EQAQAIVDDIQATNGGLPPEEKQALSQRSLRAYKNLQTIAGRSIVHVAEDLYDTDTRFLF 66
Query: 1212 ELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
EL+QNA+DN Y V EP L L E + V +NE GF+ ++RA+C + S+KK
Sbjct: 67 ELIQNAEDNCYGHAVSQAEEPFLDLTLHEDHLTVDSNEDGFTENDVRAICSIHQSSKK-Q 125
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
+ GYIG KGIGFKSVF+V I S F F+ +G G ++ PFN
Sbjct: 126 TGGYIGHKGIGFKSVFKVAYKVSIQSGPFSFYFEHHQGDSGL---GMITPFN-------- 174
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP---SLLLFLHRLQ----- 1379
++P +L NT I L F I++++ S+L ++LLFL +LQ
Sbjct: 175 -EEPQELPPSV-NTRITLFF-------ISISDFEARASELREIPDTMLLFLRKLQRLTVN 225
Query: 1380 ------CIMFRNMLNDS--LVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
I F + S L+ + K+ G+ + K E K T Q L RP
Sbjct: 226 IPSLQSQISFTRFEDKSKHLITLTKETNGEQVTKFYHLE-KTTLSNLPQHLS----RPGQ 280
Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
++ LA +QE +Y PL+ Q V++FLP+R G F++Q DF+ +SR+ V
Sbjct: 281 PEVDLILAFPVQE----DYSPLIQAQYVYSFLPMRDEGFNFLIQSDFITQASRQGVH-RC 335
Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMIL 1551
N + LF+ A FC R ++ ++P F++ L MI
Sbjct: 336 DRNYAIREGISHLFLQAVAYFCKHTSLRYE-------WLQYLPGTHIRDPFWADLREMIF 388
Query: 1552 SKLRMSNCL 1560
L+ S L
Sbjct: 389 DCLKGSKIL 397
>gi|159125661|gb|EDP50778.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 989
Score = 149 bits (375), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 131/429 (30%), Positives = 196/429 (45%), Gaps = 62/429 (14%)
Query: 1156 EDAALIVESIRRDEFGLGPN----ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
E A IV+ I+ GL P +S K GR++ ++++LY D+ FL
Sbjct: 7 EQAQAIVDDIQATNGGLPPEEKQALSQRSLRAYKNLQTIAGRSIVHVAEDLYDTDTRFLF 66
Query: 1212 ELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
EL+QNA+DN Y V EP L L E + V +NE GF+ ++RA+C + S+KK
Sbjct: 67 ELIQNAEDNCYGHAVSQAEEPFLDLTLHEDHLTVDSNEDGFTENDVRAICSIHQSSKK-Q 125
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
+ GYIG KGIGFKSVF+V I S F F+ +G G ++ PFN
Sbjct: 126 TGGYIGHKGIGFKSVFKVAYKVSIQSGPFSFYFEHHQGDSGL---GMITPFN-------- 174
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP---SLLLFLHRLQ----- 1379
++P +L NT I L F I++++ S+L ++LLFL +LQ
Sbjct: 175 -EEPQELPPSV-NTRITLFF-------ISISDFEARASELREIPDTMLLFLRKLQRLTVN 225
Query: 1380 ------CIMFRNMLNDS--LVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
I F + S L+ + K+ G+ + K E K T Q L RP
Sbjct: 226 IPSLQSQISFTRFEDKSKHLITLTKETNGEQVTKFYHLE-KTTLSNLPQHLS----RPGQ 280
Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
++ LA +QE +Y PL+ Q V++FLP+R G F++Q DF+ +SR+ V
Sbjct: 281 PEVDLILAFPVQE----DYSPLIQAQYVYSFLPMRDEGFNFLIQSDFITQASRQGVH-RC 335
Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMIL 1551
N + LF+ A FC R ++ ++P F++ L MI
Sbjct: 336 DRNYAIREGISHLFLQAVAYFCKHTSLRYE-------WLQYLPGTHIRDPFWADLREMIF 388
Query: 1552 SKLRMSNCL 1560
L+ S L
Sbjct: 389 DCLKGSKIL 397
>gi|346976436|gb|EGY19888.1| hypothetical protein VDAG_01904 [Verticillium dahliae VdLs.17]
Length = 1402
Score = 148 bits (374), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 158/559 (28%), Positives = 236/559 (42%), Gaps = 104/559 (18%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG----IVVLNNEQ 1246
LG AL L+ +L S+ +HF+LEL+QNADDN YP V+PT + +L ++G + NE
Sbjct: 42 LGSALDILASQLNSKTTHFILELIQNADDNAYPSGVQPTFSLVLDKNGSNFTLRTDCNET 101
Query: 1247 GFSAENIRALCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
GFS +NI A+C +G STK G G+IG KGIGFKSVF+V +I SN + + D +
Sbjct: 102 GFSLKNIDAICSIGKSTKTGIINGQKGFIGEKGIGFKSVFKVAHQVDILSNFYEFRIDRT 161
Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
+G + P P R L +
Sbjct: 162 -AHLGMLDPK---PHRFPDTQRRLQQ---------------------------------- 183
Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR-----KKIVGDGIIKVSCGEDK------ 1412
+ P LLLFL +L + + + D V R + DG VS ED+
Sbjct: 184 ---IKPELLLFLRQLSVLDIK--VEDGHCVYRSIRQNRDPRYDGET-VSLNEDRTGSPFD 237
Query: 1413 ---------MTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFL 1463
+ Q + R V +TEI LAL + G L FAFL
Sbjct: 238 EPVKTTKLHLIVRYEVQDMPVDARRNSVSSTEITLALPV--------GSELETCDAFAFL 289
Query: 1464 PLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPA 1523
P+ G FI+Q DF+L ++R+ + PWNQ +L P FV A R F N A
Sbjct: 290 PIDQSGFNFIVQADFLLVANRQALQHALPWNQRILEAIPDAFVQAVRHF--------NHA 341
Query: 1524 KAASVY-MSFV-PLVGEVHGFFSGLPRMILS--KLRMSNCLILEGNNNQWAP----PCKV 1575
A Y + + PL E +G S + K R+ + +LE + ++ PC
Sbjct: 342 DATHGYSLKYKWPLFLEYNGPKSSTWTEVHQGIKRRLCDEDVLEARDGRFRAVRPVPCPA 401
Query: 1576 LRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKI-LLQIISSLCR 1634
AH L I + GFL D ++ A EY P + L+ +
Sbjct: 402 -GTQQTAAHKPLIIIPTEYRFDNGFLI-DATINHGHHLAF---EYDPVLEELKALDVKEM 456
Query: 1635 TENGLRSMGLSWLASWLN---ELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGT 1691
T GLR+ WL + + +H++ ++ + D LQ +P + L+DG+
Sbjct: 457 TGAGLRNEFSDWLERAGHAGLAAKSDKWHAALAAAFMWKGRSKQKDKLQELPLVHLADGS 516
Query: 1692 FSSVDEGTIWLHSDCSVFD 1710
+++ G I+ S +V +
Sbjct: 517 WAACAGGDIFFPSTGAVLN 535
>gi|406866769|gb|EKD19808.1| Heterokaryon incompatibility protein, putative [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 1617
Score = 147 bits (371), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 120/397 (30%), Positives = 180/397 (45%), Gaps = 82/397 (20%)
Query: 1157 DAALIVESIRRDEFGL------GPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFL 1210
DA + IR D+ GL GPNIS++E+ AL LS++LY +HFL
Sbjct: 11 DAKAHISQIRHDK-GLVDGKPSGPNISDLEN------------ALTTLSEQLYQSSTHFL 57
Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA- 1269
LE++QNADDN Y V P+L F ++ G+ V NE GFS N+ A+C VG STKKG A
Sbjct: 58 LEIIQNADDNQYSAAV-PSLEFTYRKGGLRVDCNEIGFSPRNVEAICRVGQSTKKGMKAE 116
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE--GQIGFV---LPTLVPPFNIDMFC 1324
GY+G KGIGFKSVF+ D + S + KFD + G I + P V P ++
Sbjct: 117 GYVGEKGIGFKSVFKAADVVWVQSGHYSFKFDKRKPLGMISPIWDDFPMPVRPGYTSLYL 176
Query: 1325 RL--------LSKDPVQLES-KCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
+L L ++ L+S + S+G N + FS L
Sbjct: 177 KLSAGYNTSGLLEELKALDSRLLIFLRRLRSVSVEISQGGVFGNFKNSFSRQDRGKTL-- 234
Query: 1376 HRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTE 1435
+ +N ++ ++ R +++G + + R G ++E
Sbjct: 235 ----VRLTQNKVDCDYIIKRHRVLG----------------MPQDRRRDGTT-----SSE 269
Query: 1436 IALALTLQES-NEGNYGPLLYQQPVFAFLPLRTYGL-------------------KFILQ 1475
I L + S +E + P Q V+AFLP+R YG +F++Q
Sbjct: 270 IVLGFPIMMSEDEKSAVPKRESQQVYAFLPIRDYGFEVNTSLTCALKLAHSANVQQFLIQ 329
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
DF+L +SRE++D ++ WN + E F+ R F
Sbjct: 330 ADFLLIASREDIDSSAEWNHRIQRELTIAFIDTMREF 366
>gi|402076366|gb|EJT71789.1| hypothetical protein GGTG_11043 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 2200
Score = 147 bits (370), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 149/547 (27%), Positives = 230/547 (42%), Gaps = 96/547 (17%)
Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNEQGFSAEN 1252
AL +S ELY + FLLE+ QNADDN Y +V P L L + ++ NE GF +
Sbjct: 61 ALDVISNELYPNSTDFLLEIFQNADDNAYSSSVTPKLWITYLDDDTLMFDTNEIGFRRAD 120
Query: 1253 IRALCDVGNSTKKG-----SSAG--YIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----- 1300
+ +C VG S+K+ + AG IG KGIGFK+VF+V D + S + KF
Sbjct: 121 VEGICGVGRSSKQENLAEVTRAGERRIGEKGIGFKAVFKVADEVYVKSGFYSFKFTDKAV 180
Query: 1301 -DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
E +G V P + P Q + T I L R +
Sbjct: 181 PGAPEASLGRVAPEW-------------ASFPAQARRRG-GTSILLKLRRYIDRQKLAGD 226
Query: 1360 IVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC---GEDKMTWF 1416
I+ L LL+FL+R++ + L + VV+R++ D + C D + +
Sbjct: 227 IMK----LDAKLLMFLNRVKEVEI-EALREPRVVLRREDRTDSSTGIPCRVLKPDVSSPY 281
Query: 1417 VASQKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKF 1472
+ +G+ R + +E+ LA+ + +G L V++FLP+R YG KF
Sbjct: 282 ILFHHPVSGLPAEKKRKGLAQSEMVLAIPYDGTRDG--AALTQTHNVYSFLPIRDYGFKF 339
Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSF 1532
+LQ DFVL ++R+E+D S WN+ L ++ P + L P S
Sbjct: 340 VLQADFVLIANRKEIDSGSEWNRALRAQLPTALLGFVAQLNRLGLQYRWP--------SL 391
Query: 1533 VPLVGEVHGFFSGLPRMILSKLRMSNCL-----ILEGNNNQWAPPCKVLRGWNDRAHSLL 1587
P+ E HGFF G+ +L + S + +LE + W P H L
Sbjct: 392 FPIHDEGHGFFQGINAKVLEEFSRSAIIESIGGMLESASQLWLVP-----------HEL- 439
Query: 1588 PDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWL 1647
DI D +A A I + S T +GL ++G+ L
Sbjct: 440 ---------------ADIPTGDGVAPAPMIPPDFSQFRYVSSSYSHATWDGLSNLGVRTL 484
Query: 1648 --ASWLNEL-YTISFHS-----------SGQSSLQSGVETDLIDNLQRIPFIPLSDGTFS 1693
A++L L + IS H S S++ + + R+ F+PL DGTF
Sbjct: 485 SGANFLEALGHFISHHPRQFYAMPNRWHSRLSTILDQLTAKHEGTITRLRFVPLRDGTFV 544
Query: 1694 SVDEGTI 1700
++D+G +
Sbjct: 545 ALDDGPV 551
>gi|342872578|gb|EGU74933.1| hypothetical protein FOXB_14550 [Fusarium oxysporum Fo5176]
Length = 1502
Score = 146 bits (368), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 152/549 (27%), Positives = 236/549 (42%), Gaps = 78/549 (14%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG----IVVLNNEQGFSAENI 1253
+S +LY +HF+LEL+QNADDN Y + P+L L +G NE GF+ + +
Sbjct: 70 VSDQLYKASTHFILELIQNADDNQYDSEIVPSLRLALYNNGGHRYFRSDCNEVGFTFKQL 129
Query: 1254 RALCDVGNSTKKGS---SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
AL VG STK + S IG KGIGFKSVF+V D + S + KFD + +IG +
Sbjct: 130 DALTRVGQSTKAATIDNSKSCIGEKGIGFKSVFKVADVVHVASGFYEFKFDRN-ARIGMI 188
Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
LP + R + D V ++ F + + I + + + P
Sbjct: 189 LP---------ITSRFPTADRVADHTQ---------FLLELKSQRDYDVIKEELNSIEPD 230
Query: 1371 LLLFLHRL-QCIMFRNMLNDSLVVIRKKIV------GDGIIKVSCGEDKMT---WFVAS- 1419
LLLFL L Q + LN R+KI +K+S D +T + V
Sbjct: 231 LLLFLRNLDQVHISTRGLNKR---CRRKITRFDPRYKGETVKISVHGDAVTSKEYIVHRR 287
Query: 1420 --QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGD 1477
+KL R V ++E+ +A T+ + P+ Q VFAFLP+ +G +F++ D
Sbjct: 288 IVEKLPPEPQREGVVSSEVVIAFTV----DSEATPVFTTQNVFAFLPVDDFGFRFLIHAD 343
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
F+L +SRE +D +S WN L E FV A L R+ + ++P
Sbjct: 344 FILVASREGLDESSLWNWSLRDEIQTAFVDAIHRLVALSPIRDGEGLCYK-WPKYLPRYS 402
Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP-CKVLRGWNDRAHSLLPDI--LLQK 1594
E GF+ GL + +++ LR + L P + G + L D+ LL
Sbjct: 403 ETSGFWYGLHQNMMNALRDTPLLKSRAGGALRKPTDLYYVPGDYRFENGTLFDLPSLLHS 462
Query: 1595 HLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNE- 1653
HL + S LS +G++ I+ L R + W+NE
Sbjct: 463 HLSFEYDSVKSELS-----LIGVDTLH-------INDLWRE-----------FSHWINEV 499
Query: 1654 ----LYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVF 1709
L T S + SL + L + L+ +P +PL DG+++ + ++ S +
Sbjct: 500 GVDGLKTQSIEWHQKVSLIFCDQKTLREKLRNLPIVPLRDGSWAKARQDRVFFTSTHTEE 559
Query: 1710 DGGFGLEAF 1718
G+E F
Sbjct: 560 HVPTGIELF 568
>gi|408393884|gb|EKJ73142.1| hypothetical protein FPSE_06755 [Fusarium pseudograminearum CS3096]
Length = 2336
Score = 146 bits (368), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 159/529 (30%), Positives = 225/529 (42%), Gaps = 73/529 (13%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
L ALH LS ELY Q +HF+ EL+QNADDN + E PTL F + V NE+GF
Sbjct: 38 LTNALHILSTELYDQSTHFIQELLQNADDNHF-ETSTPTLIFSYSPGRLRVDCNEKGFLP 96
Query: 1251 ENIRALCDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
+I ALC + STK A Y G KGIGFKSVFR+ D I SN + KFD + Q G
Sbjct: 97 HHIDALCTIRGSTKTDKDQASYTGEKGIGFKSVFRMADVVWISSNSYQFKFDKHK-QFGT 155
Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
V P V F PV S+ F + M +V+ D
Sbjct: 156 VAPEWVEDF------------PVHSSSEY------TTFYLELCPQGDMEELVEKLVDFDA 197
Query: 1370 SLLLFLHRLQC--IMFRNMLNDSLVVIRKKIVGDG----IIKVSCGEDKMTWFVAS---Q 1420
LLFL RL+ I D +IR+ DG I + G K + V
Sbjct: 198 DQLLFLRRLRQVKINIEREYQDLSHMIRRIDETDGCGNLITTLDLGTSKKRFMVRKFQVT 257
Query: 1421 KLRAGVIRPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAFLPLRT-YGLKFILQGDF 1478
L R + +E+ LA L E +G +++ +FA LP+ GLKF+LQGDF
Sbjct: 258 NLPEEPRRRNQTCSEMVLAFPLMEELPDGGTAMPVFEAKLFAGLPVGADGGLKFLLQGDF 317
Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
+L +SR +D + WNQ L F+ + +F N ++ F+ +
Sbjct: 318 ILTASRLHLDTSLRWNQGLRDGLVNAFLESVLAF--------NAGIHRYIWPFFLSYDKK 369
Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGL 1598
V FF + I LR+ +CL E P S L + L+K
Sbjct: 370 VSSFFQSATKSICEILRLMDCL--ESMEGIMVKP------------SALTYVDLEK---F 412
Query: 1599 GFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMG----LSWLASWLNEL 1654
G SKD + + L + P + I SL G+R + L + L+E
Sbjct: 413 GDSSKDFLTLSAKTSPLYLSNKYPAWTIPGILSL-----GVRRLSATEFLEHFRTLLSEE 467
Query: 1655 YTI------SFHSSGQSSLQSGVETDLI-DNLQRIPFIPLSDGTFSSVD 1696
++ S+H L S + + I +L+ +P IPL++G + S +
Sbjct: 468 DSVFIKKSPSWHGDVAKVLLSLISDERIRKDLRNLPIIPLTEGRWVSAN 516
>gi|212534372|ref|XP_002147342.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210069741|gb|EEA23831.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 1645
Score = 145 bits (366), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 142/570 (24%), Positives = 247/570 (43%), Gaps = 64/570 (11%)
Query: 1156 EDAALIVESIRRDEFGL-GPNISNMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
E A I++ IR + G+ + N N+L+ +LG + L+++LY+ +S F+
Sbjct: 7 ERARYIIDEIRVNNGGITAEDRRNTSENVLRALENLRRQLGSSTRALARDLYTSESRFVY 66
Query: 1212 ELVQNADDNIY--PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
EL+QNA+DN Y +N + + F L +VV NNE GF+ ++ ++C VG+ST K +
Sbjct: 67 ELIQNAEDNSYSPAQNGDCYIKFTLTPENLVVENNELGFNEADVLSICKVGDST-KSNRI 125
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSK 1329
G+IG KGIGFKSVF+V I S F F+ G ++ P N
Sbjct: 126 GFIGEKGIGFKSVFQVAQKVRIQSGPFDFYFEHVPSDTGM---GMITPIN---------- 172
Query: 1330 DPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQCIMFRNM 1386
+ E++ + R+ T D+ + +L+LF L R++ ++ R
Sbjct: 173 --GEEETEIYKEGTRMTL-TLLQPHDFSKRSTDLLG-IPDALILFLPKLQRIEVLIQRQN 228
Query: 1387 LNDSLVVIRKKIVGDG-----IIKVSCGEDKMTWFVASQKLR---AGVIRPDVKTTEIAL 1438
+ S V K+ G + +V + + V + R + R + E+ L
Sbjct: 229 GDLSTTVHEKQDDGQNSTVRLVKRVDDTVSEKFFHVERRMFRDLPSNDARKNRDRAEVVL 288
Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
A + N P++ Q F+FLP+R G KF++Q DF+ +SR++V WN LL
Sbjct: 289 AFPIA----SNSNPIVEPQLTFSFLPIRNLGFKFLIQSDFITAASRQDV-RQCDWNDALL 343
Query: 1499 SEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
+FV A FC P + ++SF+P H F++GL +I+++L
Sbjct: 344 QHVSDVFVDAVNKFCSRPVLQYG-------WLSFLPGTSVPHEFWNGLHGIIINQLAQHP 396
Query: 1559 CLILEGNNNQWAPPCKVL--RGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALG 1616
L+ Q + L R + L D+ + +L + + + LG
Sbjct: 397 ILLTRRGVRQLPSRLQRLSERHCDKDGEPLFEDLQQEVYLSSDYRPE----HHEYLQQLG 452
Query: 1617 IEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGV-ETDL 1675
+ ++ I+ + E +L+ +H+ L G+ + D+
Sbjct: 453 VTPLSWNNIIARITPYLQDET----------PRFLHPDRDDDWHTRVADLLIRGLRDRDI 502
Query: 1676 IDNLQRIPFIPLSDGTFSSVDEGTIWLHSD 1705
++ +P IPL DG+ TI+ +D
Sbjct: 503 ESKIKELPLIPLRDGSIHHNRSKTIYFPTD 532
>gi|119483844|ref|XP_001261825.1| hypothetical protein NFIA_095480 [Neosartorya fischeri NRRL 181]
gi|119409981|gb|EAW19928.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 1634
Score = 145 bits (365), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 180/722 (24%), Positives = 302/722 (41%), Gaps = 131/722 (18%)
Query: 1141 LGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISN-MESNMLKKQHARLGRALHCLS 1199
+G T + L + + L E + R + P++ +E MLKK A +G ++ L+
Sbjct: 8 MGSSTARALVKQISKQHGYLGEEVLSR----MDPDVRREVEEAMLKKD-AMIGSSVITLA 62
Query: 1200 QELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGIVVLNNEQGFSAENIRA 1255
+ LY+ + F+ EL+QNADDN Y + +P ++F + I+V NE GF+ EN+ A
Sbjct: 63 KNLYNSTARFVFELLQNADDNSYNVAKSTSTDPFVSFCVYNRRIIVECNEDGFTHENLVA 122
Query: 1256 LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
+C+VG S+K G+ GYIG KGIGFKSVF V I S F + G G + + V
Sbjct: 123 ICNVGKSSKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSRDLSFSFRHNVGDSGMGMISPV 181
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN--NIVDMFSDLHPSLLL 1373
+ ++ L++ + L +E +A + F +L + LL
Sbjct: 182 WEESEEVLPHPLTRITLFLHE------------NGSAEDLARQRETTLQQFRELKAAFLL 229
Query: 1374 FLHRLQCIMFRNMLN----------------DSLVVIRKKIVGDGIIKVSCGEDKMTWFV 1417
F+ L+ I R N + V ++++IV DG ++ T
Sbjct: 230 FMKNLRRIEVRICDNLDTEISQTTFSMRCEDEGRVALKQEIVEDGELQECTQYYHTTKLT 289
Query: 1418 ASQKLRA--------GVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG 1469
AS R+ ++ T I LA L + P++ + VFAFLP+R G
Sbjct: 290 ASNIPRSENREYTDVELLTKAYSETSIILAFPLTHDSV----PIIEKHDVFAFLPIRNMG 345
Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVY 1529
L F++Q DFV +SR+++ +S N LL F+ A + FC A +
Sbjct: 346 LPFLIQADFVTDASRQDIVRSSARNIKLLPAIAQAFIQAVKQFC-------THTSLAYQW 398
Query: 1530 MSFVPLV--GEVHGFFSGLPRMILSKLRMSNCLILEGNNN-------QWAPPCKVLRGWN 1580
M ++P + G GF+ LP I +L++++ + + N + P +
Sbjct: 399 MRYLPNIEWGSQDGFWKALPGEIRRQLQLTSTMWTRSHRNMRCIGDMRLVPSSMC----D 454
Query: 1581 DRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLR 1640
LLPD+ +++L + KD+ +++YG K + L R +
Sbjct: 455 KYGKPLLPDLDTEQYLSSHYALKDLT---------SLKDYGLK-RMNRFKFLDRLRQDIN 504
Query: 1641 SMGLSWLASWLNELYTISFHSSGQSSL--QSGVET-DLIDNLQRIPFIPLSDGTF---SS 1694
+ G S + N +HS+ L SG E + +++I IPL G++ S+
Sbjct: 505 NGGSSIMR---NPDTDDDWHSAVAKYLISVSGPENPNTTAKIKKIQLIPLLGGSWKCPSA 561
Query: 1695 VDEGTIWL---HSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNL 1751
+D ++ H DG F +L A +PA
Sbjct: 562 LDRYPVYFTQAHGYAVPTDGIF------DLIAPDAENNPA-------------------R 596
Query: 1752 NRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVE 1811
++ ++GVQ+ S DI + I D+ T A L HL++ H++
Sbjct: 597 KQLFARLGVQEASVRDIRR-----KIIDQDTR------FAPGLATSRSHLQFLYLTAHLD 645
Query: 1812 RE 1813
RE
Sbjct: 646 RE 647
>gi|242068525|ref|XP_002449539.1| hypothetical protein SORBIDRAFT_05g018510 [Sorghum bicolor]
gi|241935382|gb|EES08527.1| hypothetical protein SORBIDRAFT_05g018510 [Sorghum bicolor]
Length = 1620
Score = 144 bits (363), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 196/433 (45%), Gaps = 110/433 (25%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
VE IRR+ + +G E N L + + +A++ LSQELYS+D HFL+ELVQNA+DN
Sbjct: 11 VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQNAEDNE 63
Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
YP V P+L F++ S +++ NNE+GFS+ NI ++C VG STKKG+ GY
Sbjct: 64 YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSSTNIESICRVGKSTKKGNRHQGY 123
Query: 1272 IGRK------------------------------------------GIGFKSVFRVTDAP 1289
IG K GIGFKSVF ++ P
Sbjct: 124 IGEKVLNMMLINYIMGKYSVESWKPPKMDMSNIWIYYDVLSFESYYGIGFKSVFLISSQP 183
Query: 1290 EIHSNGFHVKFD---TSEGQIGFVLPTLV---PPF-NIDMFCRLLSKDPVQLESKCWNTC 1342
I SN + +KF+ ++ IG+++P V P F +I+ P T
Sbjct: 184 HIFSNSYQIKFNEKPCAQCNIGYIVPQWVESTPSFSDIEALYGCSKVLP--------TTT 235
Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR------NMLNDSLVVI-- 1394
I LP +++ ++ + S + P +LLFL +++ + R N+ S + I
Sbjct: 236 IILPLKSE-----KIDAVKRQLSSMQPEMLLFLTKIRKLSVREDKSDPNITTVSEISISS 290
Query: 1395 ------RKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVIRPD------VKTTEI 1436
RK + + + E + +++ QK ++P+ + E
Sbjct: 291 EKNYQARKDMHAESYTLHLSSEETGKDEAECGYYMWRQKFP---VKPENRVDKRAEIDEW 347
Query: 1437 ALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
+ L + G L ++AFLP FI+Q DF+L SSRE + +SPWN+
Sbjct: 348 VITLAFPHGERLSRGKQL-SPGIYAFLPTEMVTSFPFIIQADFLLASSREAILFDSPWNK 406
Query: 1496 WLLSEFPALFVSA 1508
+L P+ F+ A
Sbjct: 407 GILECIPSTFMDA 419
>gi|302924848|ref|XP_003053981.1| hypothetical protein NECHADRAFT_32128 [Nectria haematococca mpVI
77-13-4]
gi|256734922|gb|EEU48268.1| hypothetical protein NECHADRAFT_32128 [Nectria haematococca mpVI
77-13-4]
Length = 1738
Score = 144 bits (362), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 154/623 (24%), Positives = 273/623 (43%), Gaps = 76/623 (12%)
Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV---- 1226
+ PN+ L ++ +G A+ L++ LY+ ++ F+ EL+QNADDN Y +
Sbjct: 39 AINPNLRREVEEALLRKDEMIGSAVLTLARNLYTSNARFVFELLQNADDNNYSTAISQGE 98
Query: 1227 EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
+P ++F + + + NE GF+ EN+RA+C +G S+K G+ AGYIG KGIGFKSVF
Sbjct: 99 DPYVSFKIWPDKVSIECNENGFTHENLRAICAIGKSSKVGA-AGYIGEKGIGFKSVFMAA 157
Query: 1287 DAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
I SN F F +G G + T V ++ L++ + L +
Sbjct: 158 WKVHIQSNDFSFSFTHRKGDSGLGMVTPVWEETDEVLGETLTRITLFLHTS--------- 208
Query: 1347 FRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG---- 1402
+ + I F DL ++LLFL +L+ + D + G
Sbjct: 209 -DDPEEDARQLETIKLQFQDLQHTILLFLRKLRRVQVSFFDEDDSLTSATTYSLHGSNPA 267
Query: 1403 -IIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-----------TEIALALTLQESNEGNY 1450
+ K S + + ++ + + + + +T E+ LA L ES
Sbjct: 268 TVRKTSSEGVEERRYHVTKHMAENIPKSENRTYSEERDRADSSAEVVLAFPLTESAT--- 324
Query: 1451 GPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAER 1510
P++ Q VFAFLP+R G KF++ DFV +SR+ + S N+ LL+ F+ A +
Sbjct: 325 -PVVESQDVFAFLPMRPMGFKFLIHTDFVTEASRQGLVVTSLRNRALLTGIADCFIKAIQ 383
Query: 1511 SFCDLPCFRENPAKAASVYMSFVPLVGEV--HGFFSGLPRMILSKLRMSNCL--ILEGNN 1566
FCD P + +M ++P F+SGL I +++ L + G
Sbjct: 384 EFCDHPTLQYQ-------WMRWLPQRDSYPWDSFWSGLLDRIEERIQEVKILRTLGTGKL 436
Query: 1567 NQWAPPCKVLRGW--NDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKI 1624
+ C+ L+ W + + L D+ + +L + ++D+ L ++ YG
Sbjct: 437 DYIHKLCR-LQPWLRDKQGDPLFMDMDEEIYLSKEYKTEDLAL---------LKPYG--- 483
Query: 1625 LLQIISS---LCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLI----D 1677
L+ +S+ + R E L+ + + N +HS +L +T + D
Sbjct: 484 -LENVSAWDMIVRVEKDLQKQPEA--SRMKNPDTDNEWHSLAARALVLLKDTATLKTSQD 540
Query: 1678 NLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASA 1737
++ + FIPL +G + S G ++ +S C G + NL +R + + +
Sbjct: 541 RIKDLKFIPLENGEWVSAGSGPLY-YSHCP---GQLAIPDDLNLSLDVRGLILKKPAVAV 596
Query: 1738 VDKSSLGVISVDNLNRMLLKIGV 1760
D+++L S+ +L + L G+
Sbjct: 597 ADEAALSA-SISHLKFLYLSEGL 618
>gi|303320881|ref|XP_003070435.1| Heterokaryon incompatibility protein, putative [Coccidioides
posadasii C735 delta SOWgp]
gi|240110131|gb|EER28290.1| Heterokaryon incompatibility protein, putative [Coccidioides
posadasii C735 delta SOWgp]
Length = 2074
Score = 143 bits (361), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 153/539 (28%), Positives = 242/539 (44%), Gaps = 81/539 (15%)
Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAG 1270
+EL+QNADDN Y +V P+LT ++ + V NE GF+ E++ ++C +G S +KG SA
Sbjct: 1 MELIQNADDNRYRRDVSPSLTLSYRKMHLRVDCNELGFAPEDVDSICRIGGS-QKGISAD 59
Query: 1271 Y---IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
IG KGIGFKSVF+V I S + KFD + ++G + P +
Sbjct: 60 RMDCIGEKGIGFKSVFKVASVVWISSGKYSFKFDNAT-RLGMIAP-------------MW 105
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ----CIMF 1383
++ P + + + +RL ++ E + + D P +L+FL++++ CI
Sbjct: 106 AQFPAEAPASGTSMLLRLLNKSVRDELLKEMRVCD------PKILIFLNQIRELKVCIAQ 159
Query: 1384 RNMLNDSLVVIRK--KIV---GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIAL 1438
N + RK K V G + + G D FV + + +G EI L
Sbjct: 160 DNGTTFERTISRKDDKDVDKPGQYVTTLRYG-DSSQQFVVIRYVPSGR---STDKAEILL 215
Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
A ++ + E P++ Q V+ FLP+R YGLKF++QG F L +SRE+++G+S NQ L
Sbjct: 216 AFPIKNNQE----PVIQSQCVYTFLPIRDYGLKFLVQGRFSLIASREDINGSSAENQRLR 271
Query: 1499 SEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
S P FV A F + FR ++ ++P++ E FF+ L + +S
Sbjct: 272 SLIPKAFVQAVDYFQESDYFR-------YIWPRYLPIIPE-ENFFTSLTEETI--FTLSQ 321
Query: 1559 CLILEGNNNQWAPPCKVLR---GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS-LARA 1614
+LE + + A P + + D LP L +LS+ DS L +
Sbjct: 322 RKLLESESGELATPGTLFYVPPRFRDPNGDPLP---LCSRTAFRYLSRKYFDEDSDLFQC 378
Query: 1615 LGIEEYGPKILLQIISSLCRTENGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGV 1671
LG+ + LL + S+ W L+ LN + L++
Sbjct: 379 LGVRPLSLRDLLVDVKSVVNQYQNHSREATEWHIRLSKALNR----------EDMLRAYR 428
Query: 1672 ETDLIDNLQRIPFIPLSDGTFSSVDE--GTIWLHS---DCSVFDGGFGLEAFPNLCAKL 1725
T L IP IPL DG + S E G I+ S D SV G +E P+ + +
Sbjct: 429 HT-----LHSIPLIPLRDGRWISASESAGIIFFPSALADLSVPSGLNSVEIHPDAASNV 482
>gi|302419259|ref|XP_003007460.1| hypothetical protein VDBG_01648 [Verticillium albo-atrum VaMs.102]
gi|261353111|gb|EEY15539.1| hypothetical protein VDBG_01648 [Verticillium albo-atrum VaMs.102]
Length = 1457
Score = 142 bits (358), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 156/571 (27%), Positives = 238/571 (41%), Gaps = 102/571 (17%)
Query: 1186 KQHARLGR----ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG--- 1238
+QH + R AL L+ +L S+ +HF+LEL+QNADDN YP V PT + +L ++G
Sbjct: 33 EQHGFVARNLESALDILASQLNSKPTHFILELIQNADDNAYPRGVRPTFSLVLGKNGSNY 92
Query: 1239 -IVVLNNEQGFSAENIRALCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
+ NE GFS +NI A+C +G STK G G+IG KGIGFKSVF+V +I SN
Sbjct: 93 TLRTDCNETGFSLKNIDAICSIGKSTKTGIINGQKGFIGEKGIGFKSVFKVAHQVDILSN 152
Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEG 1354
+ + D + +G + P P P + + R K
Sbjct: 153 FYEFRIDRT-AHLGMLDPK---PHRF----------PDTQRREGHTQMLLHLLRDKDHR- 197
Query: 1355 IAMNNIVDMFSDLHPSLLLFLHRLQCI---------MFRNMLND----------SLVVIR 1395
+++I + P LLLFL +L + ++R+ D SL+ R
Sbjct: 198 --ISDITKGLQQIKPELLLFLRQLSVLNIKAENGHCVYRSARQDRDPRYDGETVSLIEDR 255
Query: 1396 KKIVGDGIIKVSCGEDKMTWFVASQ--KLRAGVIRPDVKTTEIALALTLQESNEGNYGPL 1453
D +K + K+ V + + R V +TEI LA + G
Sbjct: 256 TASPLDEPVKTT----KLYRIVRYELHDMPVDARRNSVSSTEITLAFPV--------GSE 303
Query: 1454 LYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF- 1512
L FAFLP+ G FI+Q DF+L ++R+ + PWNQ +L P FV A F
Sbjct: 304 LETCDAFAFLPIDQSGFNFIVQADFLLVANRQALQHALPWNQRILEAVPDAFVQAVHHFN 363
Query: 1513 -----------CDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLI 1561
+ P F E +S + EVH G+ K R+ + +
Sbjct: 364 HPGATHGYSLKYEWPLFLEYTGPKSSPW-------AEVH---QGI------KKRLCDEDV 407
Query: 1562 LEGNNNQWAP----PCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGI 1617
LE + + PC AH L I + GFL D ++ A
Sbjct: 408 LEARDGSFRAVRPIPCPA-GTQQTAAHKPLIVIPTEYRFDNGFLI-DATINHGHHLAF-- 463
Query: 1618 EEYGPKI-LLQIISSLCRTENGLRSMGLSWLASWLN---ELYTISFHSSGQSSLQSGVET 1673
EY P + L+ + T GLR+ WL + + +H + ++ +
Sbjct: 464 -EYDPVLEKLKTLDMKEMTSAGLRNEFSDWLERAGHAGLAAKSDKWHVALAAAFMWKGRS 522
Query: 1674 DLIDNLQRIPFIPLSDGTFSSVDEGTIWLHS 1704
D LQ +P + L+DG++++ G I+ S
Sbjct: 523 KQKDKLQELPLVHLADGSWAACASGDIFFPS 553
>gi|347976163|ref|XP_003437411.1| unnamed protein product [Podospora anserina S mat+]
gi|170940269|emb|CAP65496.1| unnamed protein product [Podospora anserina S mat+]
Length = 1640
Score = 140 bits (353), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 145/573 (25%), Positives = 247/573 (43%), Gaps = 98/573 (17%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFI 1233
+E +L K +G+++ L++ELYS+D F+ EL+QNADDN + EP + F+
Sbjct: 37 KVERALLAKDQL-IGQSVITLARELYSKDVRFIFELLQNADDNHFNHAKSSGQEPYVAFL 95
Query: 1234 LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ IV+ NE GF+ EN+RA+C VG S+K G+ GYIG KGIGFKSVF+V + S
Sbjct: 96 IYHDKIVIDCNEDGFTEENLRAICSVGKSSKHGAQ-GYIGEKGIGFKSVFKVAWKVHVQS 154
Query: 1294 NGFHVKFDTSEGQ--IGFVLP-------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIR 1344
F F G +G + P + PP I L + L+ K
Sbjct: 155 GPFSFCFIHRPGDSGMGMISPEWHQSEDSTRPPTGITRMTFTLHNN---LDQKL------ 205
Query: 1345 LPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR------------NMLNDSLV 1392
+ +IV+ F DL P++LLFL +L+ I M + V
Sbjct: 206 --------QAAQRQSIVNEFKDLQPAMLLFLKKLRRIEVHFFDVEGTKESHSAMSLSAPV 257
Query: 1393 VIRKKIVGDGIIKVSCGEDKM--TWFVASQKLRAGVIRPDVKT-------------TEIA 1437
I + V E K + ++ G+ R + + T++
Sbjct: 258 DISNRAVLTTTKTTHNSEPKTFSHHYHVTKLWATGLARNENRKYSPAEESSKAYSRTKVV 317
Query: 1438 LALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWL 1497
LA L + + P++ Q +FAFLP+R G F++ DFV ++RE++ +S N+ L
Sbjct: 318 LAFPLTDQS----IPIIEPQDIFAFLPIRKIGFNFLIHTDFVTMANREDIVTSSQRNKGL 373
Query: 1498 LSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH--GFFSGLPRMILSKLR 1555
FV+A C+ P + +M ++P + H F+S ++ ++
Sbjct: 374 RRYLAQAFVTAAEQMCEHPQLQFQ-------WMRYLPRLSGNHWDSFWSEFSGILQEEIT 426
Query: 1556 MSNCLILEGNNNQWAPPCKVLRGWNDRAHSL-------LPDILLQK--HLGLGFLSKDIV 1606
+ +I + P L+ W + L D+ +K ++ L + D+
Sbjct: 427 KVDVII--PFTSMTPRPLTALKQWPSKISWLDHLDRPVFEDLPKEKAAYVSLEYQDSDLS 484
Query: 1607 LSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSS 1666
+ + +YG K L + + R + LRS + + +HS
Sbjct: 485 I---------LRDYGMK-YLSWVDLISRAKADLRSRSSKMKSPQTD----AKWHSLAAKL 530
Query: 1667 LQSGVET-DLIDNLQRIPFIPLSDGTFSSVDEG 1698
L+ G+ + ++ + +PLSDG+++SV +G
Sbjct: 531 LKRGLRNLKYEEEIKALNLLPLSDGSWTSVTKG 563
>gi|159125686|gb|EDP50803.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 1608
Score = 140 bits (352), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/471 (26%), Positives = 218/471 (46%), Gaps = 63/471 (13%)
Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVE 1227
+ P+I L K+ A +G ++ L++ LY+ + F+ EL+QN DDN Y + +
Sbjct: 35 MDPDIRREVEEALLKKDAMIGSSVITLAKNLYNSTARFVFELLQNVDDNSYNVARSRSAD 94
Query: 1228 PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
P ++F + I+V NE GF+ EN+ A+C+VG S+K G+ GYIG KGIGFKSVF V
Sbjct: 95 PFVSFCVDNRRIIVECNEDGFTHENLVAICNVGKSSKTGAQ-GYIGEKGIGFKSVFMVAW 153
Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
I S F + G G + + + + ++ R L++ + L
Sbjct: 154 KVHIQSRDLSFSFRHNVGDSGMGMISPIWEESEEVLPRPLTRITLYLHE----------- 202
Query: 1348 RTKFSEGIAMN--NIVDMFSDLHPSLLLFLHRLQCIMFR----------------NMLND 1389
T + +A + F +L + LLF+ L+ I + ++
Sbjct: 203 -TGSAADLATQRETTLQQFRELKAAFLLFMKNLRRIEVKLCGSLDKEISRTTFSMRCEDE 261
Query: 1390 SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI--RPDVKTTEIALAL-TLQESN 1446
V ++++IV DG ++ + T + + KL A I + + T++ LA E+N
Sbjct: 262 GRVALKQEIVQDGDLQ------ECTQYYHTTKLTASNIPRSENREYTDVELATKAYSETN 315
Query: 1447 -------EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
+ P++ + VFAFLP+R GL F++Q DFV +SR+++ +S N LL+
Sbjct: 316 IILAFPLTRDSVPIIEKHEVFAFLPIRNMGLPFLIQADFVTDASRQDIVRSSARNLKLLT 375
Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV--GEVHGFFSGLPRMILSKLRMS 1557
F+ A + FC A +M ++P G ++ LP I +L+++
Sbjct: 376 AIAQAFIQAVKQFC-------THTSLAYQWMRYLPDFEWGSQDRYWKALPGEIRRQLQLT 428
Query: 1558 NCLILEGN-NNQWAPPCKVLRG--WNDRAHSLLPDILLQKHLGLGFLSKDI 1605
+ + + N + ++L + LLPD+ +++L + KD+
Sbjct: 429 DTMWTRSHLNMRCIADVRLLPSSMCDKYGKPLLPDLDTEQYLSPHYALKDL 479
>gi|146323927|ref|XP_751403.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|129557466|gb|EAL89365.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 1608
Score = 140 bits (352), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/471 (26%), Positives = 218/471 (46%), Gaps = 63/471 (13%)
Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVE 1227
+ P+I L K+ A +G ++ L++ LY+ + F+ EL+QN DDN Y + +
Sbjct: 35 MDPDIRREVEEALLKKDAMIGSSVITLAKNLYNSTARFVFELLQNVDDNSYNVARSRSAD 94
Query: 1228 PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
P ++F + I+V NE GF+ EN+ A+C+VG S+K G+ GYIG KGIGFKSVF V
Sbjct: 95 PFVSFCVDNRRIIVECNEDGFTHENLVAICNVGKSSKTGAQ-GYIGEKGIGFKSVFMVAW 153
Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
I S F + G G + + + + ++ R L++ + L
Sbjct: 154 KVHIQSRDLSFSFRHNVGDSGMGMISPIWEESEEVLPRPLTRITLYLHE----------- 202
Query: 1348 RTKFSEGIAMN--NIVDMFSDLHPSLLLFLHRLQCIMFR----------------NMLND 1389
T + +A + F +L + LLF+ L+ I + ++
Sbjct: 203 -TGSAADLATQRETTLQQFRELKAAFLLFMKNLRRIEVKLCGSLDKEISRTTFSMRCEDE 261
Query: 1390 SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI--RPDVKTTEIALAL-TLQESN 1446
V ++++IV DG ++ + T + + KL A I + + T++ LA E+N
Sbjct: 262 GRVALKQEIVQDGDLQ------ECTQYYHTTKLTASNIPRSENREYTDVELATKAYSETN 315
Query: 1447 -------EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
+ P++ + VFAFLP+R GL F++Q DFV +SR+++ +S N LL+
Sbjct: 316 IILAFPLTRDSVPIIEKHEVFAFLPIRNMGLPFLIQADFVTDASRQDIVRSSARNLKLLT 375
Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV--GEVHGFFSGLPRMILSKLRMS 1557
F+ A + FC A +M ++P G ++ LP I +L+++
Sbjct: 376 AIAQAFIQAVKQFC-------THTSLAYQWMRYLPDFEWGSQDRYWKALPGEIRRQLQLT 428
Query: 1558 NCLILEGN-NNQWAPPCKVLRG--WNDRAHSLLPDILLQKHLGLGFLSKDI 1605
+ + + N + ++L + LLPD+ +++L + KD+
Sbjct: 429 DTMWTRSHLNMRCIADVRLLPSSMCDKYGKPLLPDLDTEQYLSPHYALKDL 479
>gi|402084208|gb|EJT79226.1| hypothetical protein GGTG_04312 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 2143
Score = 139 bits (350), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 171/378 (45%), Gaps = 49/378 (12%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC 1257
L+ L ++ + FLLEL+QNADDN Y ++V P L + + NE GF ++++++C
Sbjct: 52 LANTLSTKPAQFLLELLQNADDNAYADSVNPELVITFTDETLQFDTNELGFRRQDVQSIC 111
Query: 1258 DVGNSTKK------GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL 1311
+G S+KK G +A IG KG+GFK+VFRV + IHS + +FD S ++G +
Sbjct: 112 SMGQSSKKEPKKQEGKAASCIGEKGVGFKAVFRVAEQVFIHSGHYSFQFD-SRRELGCIK 170
Query: 1312 PTL-VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
P V P M W T I L K + +++ +
Sbjct: 171 PEWAVFPGKPRM---------------GW-TSIHL----KLKPDLNREQLLEEMGKFDAT 210
Query: 1371 LLLFLHRLQCIMFR----------NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS- 1419
LLLFL R++ I + LN ++ + G+ +S +V +
Sbjct: 211 LLLFLKRIRVIRVQIKPLYSWLSYKSLNRTIRRLDDASGLGGLAMLSLTPSTFPGYVVNH 270
Query: 1420 ---QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
+ + RP +EI LA + + V+AFLP+R+YG KF +Q
Sbjct: 271 YTVRNMPPAPERPSCAESEIVLAFPKAPQQQTKAAGFPGEHKVYAFLPIRSYGFKFTIQA 330
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV 1536
DF+LP+SRE++ +S WN+ L P V +P R + F P+
Sbjct: 331 DFLLPASREDIMLDSDWNRALREAIPPAVVDFIPKLRAVPELRYQ-------WPRFFPVR 383
Query: 1537 GEVHGFFSGLPRMILSKL 1554
++ FF GL +I +++
Sbjct: 384 HDMDDFFHGLSTIIANQV 401
>gi|342874594|gb|EGU76590.1| hypothetical protein FOXB_12889 [Fusarium oxysporum Fo5176]
Length = 2179
Score = 137 bits (344), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 158/611 (25%), Positives = 253/611 (41%), Gaps = 89/611 (14%)
Query: 1140 GLGDCTTQTLPEDKECEDAALIVESIRRDEFG-LGPNISNMESNMLKKQHARLGRALHCL 1198
G G T Q L ++ E L E + R G + P + L ++ +G A+ L
Sbjct: 9 GPGQRTAQDLVKEIAKERGYLGEEQLAR--IGEINPELRREVEEALLRKDEMIGSAVLTL 66
Query: 1199 SQELYSQDSHFLLELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIR 1254
++ LY+ ++ F+ EL+QNADDN Y + +P ++F ++ + + NE GF+ EN++
Sbjct: 67 ARNLYTSNARFVFELLQNADDNNYSTALSSGQDPYVSFEVRPDKVSIECNENGFTHENLK 126
Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
A+C +G S+K G+ AGYIG KGIGFKSVF I SN F F +G G + T
Sbjct: 127 AICAIGKSSKVGA-AGYIGEKGIGFKSVFMAAWKVHIQSNDFSFSFTHRKGDSGLGMVTP 185
Query: 1315 V----------PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
V I +F S DP E IRL F
Sbjct: 186 VWEDADESLGGSSTRITLFLH-TSDDPE--EDARQRETIRL-----------------QF 225
Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKM-----TWFVAS 1419
DL ++LLFL +L+ + D G V+ ++ + +
Sbjct: 226 QDLQHTILLFLRKLRKVQVSFFDEDDTQTSSTTYSLRGSNPVTVKKETFEGVEERQYHVT 285
Query: 1420 QKLRAGVIRPDVKT-----------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY 1468
+ + + + + +T TE+ LA L + P++ Q VFAFLP+R
Sbjct: 286 KHVAENIPKSENRTYSEEQDRADSSTEVVLAFPLADLGT----PIVENQDVFAFLPMRPM 341
Query: 1469 GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASV 1528
G KF++ DFV +SR+ + S N+ LL+ F+ A FC P +
Sbjct: 342 GFKFLIHTDFVTEASRQGIVTTSLRNRGLLNGIADCFIKAIEEFCQHPTLQYQ------- 394
Query: 1529 YMSFVPLVGEV--HGFFSGLPRMILSKLRMSNCLILEGNNN-QWAPPCKVLRGW--NDRA 1583
+M ++P F+SGL I S+++ L G + L+ W +
Sbjct: 395 WMRWLPQRNSYPWDSFWSGLLDRIESRIQEVKILRTLGTGKLDYIHKLCRLQPWLRDKNG 454
Query: 1584 HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGL-RSM 1642
L D+ + +L + D+ L G+E P ++ R E L R +
Sbjct: 455 DPLFTDLDEEIYLAKEYTRDDL----KLLETYGLESMSPS------DAITRVEKDLERDI 504
Query: 1643 GLSWLASWLNELYTISFHSSGQSSL----QSGVETDLIDNLQRIPFIPLSDGTF-SSVDE 1697
S + S + +HS +L + D L+++ FIPL +GT+ SS
Sbjct: 505 KDSRMKSPATD---NEWHSLAARALILLKNTATVGPAQDRLKQLRFIPLENGTWVSSSSR 561
Query: 1698 GTIWLHSDCSV 1708
+ HS S+
Sbjct: 562 PLYYSHSSGSL 572
>gi|323444857|gb|EGB01787.1| hypothetical protein AURANDRAFT_69496 [Aureococcus anophagefferens]
Length = 360
Score = 135 bits (341), Expect = 2e-28, Method: Composition-based stats.
Identities = 68/149 (45%), Positives = 96/149 (64%), Gaps = 2/149 (1%)
Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE 1212
++ E ++ SIR FGL + + ++ + Q L AL +++++YS DSHF+LE
Sbjct: 155 RDDEPPGAVLASIRAG-FGLETDDARIDVSQ-SGQLQMLDNALKVIARDIYSSDSHFILE 212
Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYI 1272
L+QNADDN Y + PT+ F +VV NNE+GF A ++RALC VG S+K +GYI
Sbjct: 213 LLQNADDNDYAPSAAPTIVFDASPEAVVVRNNERGFRARHVRALCHVGASSKGAGGSGYI 272
Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
G+KGIGFKSVFRV+ PE+HSNG+ D
Sbjct: 273 GQKGIGFKSVFRVSSRPEVHSNGYAFALD 301
>gi|347826655|emb|CCD42352.1| hypothetical protein [Botryotinia fuckeliana]
Length = 1627
Score = 135 bits (341), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 183/379 (48%), Gaps = 40/379 (10%)
Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQ-------HARLGRALHCLSQELYSQ 1205
+E +DA ++ +R G+ +L K ++G + LS++LY +
Sbjct: 7 EELQDARAFIDDLRSANGGITKEEEEELDRLLPKVKEALENLQNKVGISTKILSEQLYQK 66
Query: 1206 DSHFLLELVQNADDNIYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
+ F+ E++QNA+DN Y + + P L+F L +++ +NE GF+ ENI+A+C +G ST
Sbjct: 67 STRFIYEIIQNAEDNEYKGSKKLPFLSFTLYSDRLIIDSNEDGFTKENIKAICSIGEST- 125
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-----IGFVLPTLVPPFN 1319
K S GYIG KGIGFKSVF V I S + F+ S+ +G V P P
Sbjct: 126 KASIQGYIGEKGIGFKSVFTVAHKVHIQSGCYSFAFEYSKNDDGKSGLGMVTPMNETPHE 185
Query: 1320 I--DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLL---F 1374
+ D+ R++ L C L ++ + + + L + L
Sbjct: 186 LPGDVQTRMI------LYLHAGEDCGAL-YQELLNLPDTLLLFLKKLRQLTVNFELPDRN 238
Query: 1375 LHRLQCIMFR---NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQ--KLRAGVIRP 1429
+H++Q + R + L++++V + I G V+ K W L A R
Sbjct: 239 VHQVQFSLTRPNNDRLSENIVQTKTAITG----AVNSVTHKSFWVKKRPVFNLPADSARG 294
Query: 1430 DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1489
++ E+ LA + E ++ P++ +Q FAFLP++ G KF++Q DF+ +SRE++
Sbjct: 295 KIREAEVILAFPIDE----DHVPVIEEQYAFAFLPMKKTGYKFLIQCDFITKASREDI-V 349
Query: 1490 NSPWNQWLLSEFPALFVSA 1508
+S WN+ LL E A F SA
Sbjct: 350 DSAWNRQLLVEVVATFRSA 368
>gi|119499764|ref|XP_001266639.1| hypothetical protein NFIA_102260 [Neosartorya fischeri NRRL 181]
gi|119414804|gb|EAW24742.1| hypothetical protein NFIA_102260 [Neosartorya fischeri NRRL 181]
Length = 1609
Score = 135 bits (341), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 123/423 (29%), Positives = 188/423 (44%), Gaps = 70/423 (16%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
E A IV+ I+ GL P + L QELY D+ FL EL+Q
Sbjct: 7 EQAQAIVDDIQVTNGGLPPEVQK------------------ALPQELYDTDTRFLFELIQ 48
Query: 1216 NADDNIY----PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
NA+DN Y + EP L L + I V +NE GF+ ++RA+C + S+KK + GY
Sbjct: 49 NAEDNCYGHAASQAEEPFLHLTLHKDYITVDSNEDGFTENDVRAICSIHQSSKK-QTGGY 107
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDP 1331
IG KGIGFKSVF+V I S F F+ +G G ++ PFN ++P
Sbjct: 108 IGHKGIGFKSVFKVAYKVSIQSGPFSFFFEHHQGGSGL---GMITPFN---------EEP 155
Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ-----------C 1380
+L NT I L F T S+ S++ ++LLFL +LQ
Sbjct: 156 QELPPGV-NTRITL-FLTNISD---FEACASELSEIPDTMLLFLRKLQRLTIEIPPLQSQ 210
Query: 1381 IMFRNM--LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIAL 1438
I F+ ++ L+ + K+ G+ + K E + R G +P+V ++ L
Sbjct: 211 ISFKRHEDMSKHLITLTKETNGEQLKKSYHLEKTTLSNLPQHPSRPG--QPEV---DLIL 265
Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
A +QE + PL+ Q V++FLP+ G F++Q DF+ +SR+ + N +
Sbjct: 266 AFPVQE----DCSPLIQAQYVYSFLPMHDEGFNFLIQSDFITQASRQGIH-RCDRNYAIR 320
Query: 1499 SEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
LF+ A FC R ++ ++P F++ L MI +L+ S
Sbjct: 321 EGISHLFLQAVAYFCKHTSLRYE-------WLQYLPGTHIRDPFWACLREMIFDRLKGSK 373
Query: 1559 CLI 1561
L
Sbjct: 374 ILF 376
>gi|358380666|gb|EHK18343.1| hypothetical protein TRIVIDRAFT_47475 [Trichoderma virens Gv29-8]
Length = 1650
Score = 133 bits (335), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 149/589 (25%), Positives = 250/589 (42%), Gaps = 102/589 (17%)
Query: 1156 EDAALIVESI-------RRDEFGLGPN-ISNMESNMLKKQHARLGRALHCLSQELYSQDS 1207
+DA +V+SI D++ L P+ + L+ A+ L++ LY+ D+
Sbjct: 9 KDACKLVKSIAEDHGYVHEDDWNLMPSAVRERVQKALRNLEGIAAVAVSTLAKNLYTSDA 68
Query: 1208 HFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
F+ EL+QNA+DN + + +E P ++F L I++ NE GF +N++A+C VG S+
Sbjct: 69 RFVFELLQNAEDNSFEKAIEANLTPYVSFELHPDHIIIECNEDGFEPKNLKAICAVGQSS 128
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ--IGFVLPTLVPPFNID 1321
K G GY+G KGIGFKSVF I S F F G +G + P P N
Sbjct: 129 KSGVQKGYVGEKGIGFKSVFMAAWKVHIKSGHFSFFFQHKRGDQGLGMISPVWEEPENDQ 188
Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI 1381
Q++ + + +P I+D FS L+ S+LLF+ L I
Sbjct: 189 ------PHPGTQMKLELGDGYTPIP-------SSQYQIILDQFSSLNESILLFMRNLGEI 235
Query: 1382 ---MFRN-------------MLNDSLVVIRK--KIVGD--------GIIKVSCG----ED 1411
+F N +N +L + K K + I K + G +
Sbjct: 236 RISIFDNEGNLEKSNTFSKERINANLTRLTKVSKTANNVDSSFKDYYIFKHTVGNLDKNE 295
Query: 1412 KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
T+ +K +A T EI L L E + P++ Q +FAFLP++ G
Sbjct: 296 NRTYSEYEEKTKA------YATAEIVLGFPLTEES----IPIVEYQDIFAFLPMKEVGFT 345
Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
F++ DFV ++R+++ S N + F+ A FC+ P K +M
Sbjct: 346 FLINSDFVTQANRQDIVTTSSRNIGIRQGIADAFIQAILEFCNHP-------KLQYTWMR 398
Query: 1532 FVPLVGE-VHGFFSGLPRMILSKLRMSNCLILEGNNN-------QWAPPCKVLRGWNDRA 1583
F+P + F+S L ++ KL+ + L +++ ++ R
Sbjct: 399 FLPNKDQHFDAFWSQLVTLLDDKLKETAVLRPRSETTFRLISSLRYSVDSQLDR----HG 454
Query: 1584 HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEY-GPKILLQIISSLCRTENGLRS- 1641
L D + H+ + +D+ + L G+ EY +ILL I L +++ ++S
Sbjct: 455 EPLFRDNNPEMHISKHYAKEDVNI---LRNFYGLTEYTASEILLAISLDLGMSDSRMKSE 511
Query: 1642 -MGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSD 1689
M W N L + + +G +S++S L+ +PF+PL D
Sbjct: 512 DMDDDWHTRSANYLSYV--YLTGDASIRS--------KLRSLPFLPLQD 550
>gi|302841821|ref|XP_002952455.1| hypothetical protein VOLCADRAFT_93075 [Volvox carteri f. nagariensis]
gi|300262391|gb|EFJ46598.1| hypothetical protein VOLCADRAFT_93075 [Volvox carteri f. nagariensis]
Length = 4091
Score = 133 bits (334), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/148 (49%), Positives = 86/148 (58%), Gaps = 22/148 (14%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE--------- 1212
VESIRR+EFG G + + + + AR GRAL L+ ELYS DSHF++E
Sbjct: 1609 VESIRREEFGEGLELGGEAAALSGRLAARTGRALQRLAAELYSADSHFVMELVQLPLLAL 1668
Query: 1213 ------------LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVG 1260
+QNADDN YP V P L F+LQ I NNE GFS N+RALCD G
Sbjct: 1669 LLLPSHSVPCRTFLQNADDNSYPAGVTPCLEFVLQRDVITAANNEVGFSPANLRALCDAG 1728
Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
STK GYIG+KGIGFKSVFRV+ A
Sbjct: 1729 ASTKS-RVTGYIGQKGIGFKSVFRVSSA 1755
Score = 58.2 bits (139), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 56/135 (41%), Gaps = 38/135 (28%)
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP-------------------- 1516
D+V+PSSRE +D +SPWNQ L + P LF+ A F LP
Sbjct: 1882 DWVVPSSREALDVDSPWNQELRARLPGLFLRALAVFKQLPHPYVAASGAAAAAGTTTACG 1941
Query: 1517 ------CFRENPAKAASV----------YMSFVPLVGEVHGFFSGLPRMILSKLRMSNCL 1560
C + + ++ VPL GE GFF+GLP I + LR + C+
Sbjct: 1942 GGGGMQCIGQQHQQHQQRRAAELFWLDQWLRCVPLEGEAQGFFAGLPHSIATLLRAAPCI 2001
Query: 1561 ILEGNNNQWAPPCKV 1575
+ WA P +
Sbjct: 2002 PT--ADGGWATPSET 2014
>gi|134084320|emb|CAK48660.1| unnamed protein product [Aspergillus niger]
Length = 1689
Score = 133 bits (334), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 202/828 (24%), Positives = 331/828 (39%), Gaps = 149/828 (17%)
Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
E A +VE I + LG ++ +E ML+K +G ++ LS+ LY+
Sbjct: 9 ERARKLVEGISKRHGYLGEDVLSQMSDEVRRQVEEAMLRKDEM-IGSSVVTLSKNLYNSS 67
Query: 1207 SHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+ F+ ELVQNADDN Y E P+L F + IVV NE GF+ ENI A+C+VG S
Sbjct: 68 ARFVFELVQNADDNHYTTARERATVPSLAFHVYPRRIVVDCNEDGFTHENIVAICNVGKS 127
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
+K G+ GYIG KGIGFKSVF V I S F F G G + M
Sbjct: 128 SKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSGDFSFSFRHKPGGSG-----------MGM 175
Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQ 1379
+ QL+ + L T E AM F +L +LLLF L RL
Sbjct: 176 ISPIWEDVEEQLQRPMTRMTLYLHEDTAPRE-TAMQQ----FRELQDTLLLFTKNLQRLG 230
Query: 1380 CIMFRN-------------MLNDSLVVIRKKIVG-DGIIKVSCGEDKMTWFVA------- 1418
M+ + N++L V+ K ++ DG + +T +VA
Sbjct: 231 VFMYDDDEKQLSSTVFAISSQNENLKVLTKTLLAPDGTSEELTRNYHLTKYVAHDLPGSE 290
Query: 1419 -------SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
++ RA +I LA + + N P++ Q VFAFLP+R G
Sbjct: 291 NRTYTDKERETRA------YSRADIILAFPVTK----NSVPVIEPQDVFAFLPIRNMGFT 340
Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
F + +++ +S N+ L +E F +A P R +M
Sbjct: 341 V----GFSSCKTLQDIVRSSARNRKLRTEISIAFGTAVLEMRKHPTLRYR-------WMR 389
Query: 1532 FVPLVGEVHG--FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGW----NDRAHS 1585
++P H F+ GL I S+L +S ++ + W P ++ R W +
Sbjct: 390 YLPKRDGHHWDPFWVGLLGEIKSRL-ISLPILQSRTSGSWFPIEQMERLWEHWFDKSGDP 448
Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
L PD+ +++L + +L KDI + EYG + + + N M L
Sbjct: 449 LFPDLPREQYLSVHYLEKDIT---------RLMEYG--LTFMAMPGMIARAN----MDLE 493
Query: 1646 WLASWL-NELYTISFHSSGQSSLQSGVE---TDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
+ S + N+ + +HS ++L + I+ ++ +P IPL G + S+ G ++
Sbjct: 494 FPHSKMKNKETSEDWHSRAATALSHSFRLGYDNCIEEVKAMPLIPLKTGQWVSIKSGAVY 553
Query: 1702 LHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
F G++ +L L T G I+ ++ IGVQ
Sbjct: 554 -------FPSIHGIDIPSDLGVGLVTS---------------GAINNPQRKQLFKDIGVQ 591
Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
+ S + K+ I D +A G + ++L C V
Sbjct: 592 EASVDFVRKL-----IKDNYSAYGIFFIPVSREHLTFLYLT-ECLASSSSTALFVHLFNG 645
Query: 1822 KAFVLTNHGF--KRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPA-KESL 1878
K ++H + ++P ++F E +P L+ I +D++++ ++
Sbjct: 646 KGLAFSDHKYQIRQPKIHTVYFPNE--DPYGAKALLEPIQGGAPGLDVSFINEDYLRDPP 703
Query: 1879 SCGLVKWRKFFKEIGITDFVQVVQVDKDVA--DISHTGFKNMWTKELL 1924
SC + RK+ + + QV V ++ D+ H+ F+ +E L
Sbjct: 704 SCPPKEQRKWIEWLQ-----QVFHVRGTLSLIDLRHSPFRARLGQECL 746
>gi|429851713|gb|ELA26885.1| ino80 chromatin remodeling complex protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 1640
Score = 132 bits (332), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 178/392 (45%), Gaps = 55/392 (14%)
Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRA 1255
+ LYS + F+ EL+QNADDN Y + P ++F + S IV+ NE GF+ EN++A
Sbjct: 40 KNLYSSKTRFIFELLQNADDNQYSKASALGDTPYISFRIYPSRIVIHCNEDGFTHENMKA 99
Query: 1256 LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
+C VG S+K G+ GYIG KGIGFKSVF V I SN F V F +G G
Sbjct: 100 ICAVGESSKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSNAFSVSFKHKKGDSG------- 151
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN--IVDMFSDLHPSLLL 1373
+ M + + L+S+ + L T+ S G+A I F +L ++LL
Sbjct: 152 ----MGMITPVWEETDEVLQSQLTRITLHL-LNTETSNGVAETREMIRAQFDELQETVLL 206
Query: 1374 FLHRLQCIMFRNMLND----SLVVIRKKIVGDGIIKVSC-------GEDKMTWFVASQKL 1422
FL ++ I +D + V + G +K+S ++ +F ++
Sbjct: 207 FLKNIRKISVFFYGDDGRRTTAVSHLRTQPQPGQVKLSSVYATNGRQREQTKYFHVTRGK 266
Query: 1423 RAGVIRPDVKT-------------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG 1469
+ + D +T +E+ +A L E+ P++ Q VFAFLP+R G
Sbjct: 267 ATNLAKNDNRTYTEAEDAAKAYSASEVVVAFPLSET----LVPIVEPQDVFAFLPVRPAG 322
Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVY 1529
F++Q DFV ++R+++ +S N LL FV A F R +
Sbjct: 323 FNFVIQADFVTNANRQDILKDSRRNIGLLHAVGEAFVIAAIDFQAHETLRYR-------W 375
Query: 1530 MSFVP-LVGEVHGFFSGLPRMILSKLRMSNCL 1560
+ F+P G GF+ GL I LR L
Sbjct: 376 VQFLPDRNGSWDGFWLGLVEKIAMLLRFEPVL 407
>gi|302889371|ref|XP_003043571.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256724488|gb|EEU37858.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 2274
Score = 129 bits (324), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 184/409 (44%), Gaps = 54/409 (13%)
Query: 1192 GRALHCLSQE----LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG------IVV 1241
RAL+ + +E LY+ HFLLEL+QNADD Y V PT+ +E G + +
Sbjct: 60 ARALNAVLKETAGKLYADRMHFLLELLQNADDCQYAPGVTPTVNLTYKEHGDGRATTLRI 119
Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGSSAG--YIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
NE GF +EN+RA+C S+K + G KG+GFKSVF V D I S + K
Sbjct: 120 DCNEIGFHSENVRAICGFCESSKPAAKDQNRTTGEKGMGFKSVFEVADVVWISSGFYSFK 179
Query: 1300 F-DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
F + G + + P L D +L +T + L K +GI +
Sbjct: 180 FVNNDSGMLSMITPHWAK----------LPDDERRLRG---HTSMLLELSAKCDKGILLE 226
Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMF------RNMLNDSLVVIRKKIVGDGIIK--VSCGE 1410
N M D +LFL L+ I + ++ + DG+ + G+
Sbjct: 227 NFKSMSLDT----VLFLRNLKRIKISTIEAGKKRWETTIKRLDDGPTKDGMFSRTIQIGD 282
Query: 1411 DKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPL----LYQQPVFAFLPLR 1466
++ V + +R D + ++ + L ++ + L L Q V AFLP+
Sbjct: 283 TLASYHVFKHAVNERDLRYDDRRRDVKDSQILLAFSKADDLSLPRQDLKAQKVHAFLPID 342
Query: 1467 TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAA 1526
YG KF +Q DF+L SSRE+V+ + PWN L+S+ P F+ A ++F F
Sbjct: 343 HYGFKFAIQADFILVSSREDVE-DYPWNTSLVSQIPTAFLGAIKAFQSQDVFS------- 394
Query: 1527 SVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKV 1575
++S+VPL F +P I++ L S +L+ + +W P K+
Sbjct: 395 --WISYVPLDHYADTLFQDVPAAIVNVL--SENPVLQSDQREWLLPSKL 439
>gi|46137929|ref|XP_390655.1| hypothetical protein FG10479.1 [Gibberella zeae PH-1]
Length = 1889
Score = 129 bits (323), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 145/582 (24%), Positives = 246/582 (42%), Gaps = 109/582 (18%)
Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV----EPTLTFIL 1234
+E +L+K +G A+ L++ LY+ ++ F+ EL+QNADDN Y V +P ++F +
Sbjct: 49 VEEALLRKDEM-IGSAVLTLARNLYTSNARFVFELLQNADDNDYSTAVSQGQDPYVSFKV 107
Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
+ + NE GF+ EN++A+C +G S+K G +AGYIG KGIGFKSVF I SN
Sbjct: 108 HPDKVTIECNENGFTHENLKAICAIGKSSKVG-AAGYIGEKGIGFKSVFMAAWKVHIQSN 166
Query: 1295 GFHVKFDTSEGQIGFVLPTLV----PPFNIDMFCRL-----LSKDPVQLESKCWNTCIRL 1345
F F +G G + T V D R+ S+DP + E + IRL
Sbjct: 167 KFSFSFTHRKGDSGLGMVTPVWEETDEATGDSSTRITLLLHTSEDPEEDERR--RDTIRL 224
Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL-------------- 1391
F DL ++LLFL +L+ + ND +
Sbjct: 225 -----------------QFQDLQHTILLFLQKLRKVQVSFYDNDDVQTATTVYSLHGSNP 267
Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-----------TEIALAL 1440
IRK+ DG+ E++ + ++ + + R + +T TE+ LA
Sbjct: 268 ATIRKE-TSDGV------EERQ--YHVTKHVAENIPRSENRTYSDERDRADSSTEVVLAF 318
Query: 1441 TLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSE 1500
L ++ P++ + VFAFLP+ F++ DFV +SR+ + S N+ LL
Sbjct: 319 PLTDTGT----PIVENKDVFAFLPI------FLIHTDFVTEASRQGIVVTSLRNRALLKG 368
Query: 1501 FPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV--HGFFSGLPRMILSKLRMSN 1558
V A FC P + +M ++P F+S L I ++
Sbjct: 369 IADCLVKAIEEFCRHPTLQYQ-------WMRWLPQRDSYPWDSFWSSLLDRIKDRIHQVK 421
Query: 1559 CLILEGNNN-QWAPPCKVLRGW--NDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARAL 1615
L G+ + + L+ W +++ L DI + +L + D+ L +
Sbjct: 422 VLRTLGSRQLGYMYQLRNLQSWLRDEKGDPLFTDINNEIYLAKEYAKDDL----DLLKPY 477
Query: 1616 GIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDL 1675
G++ P + + R EN L+ + + + + HSS +L ++
Sbjct: 478 GLQGASP------MEVILRIENDLQKPPEASIMK--RSMISNDGHSSAARALMLLLKNIT 529
Query: 1676 IDN-------LQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFD 1710
++N L+++ FIPL +G + S ++ D
Sbjct: 530 LNNPGPIQARLEQLRFIPLDNGAWVSASRRPLYYSHSSGQLD 571
>gi|325093511|gb|EGC46821.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 1631
Score = 128 bits (322), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 119/422 (28%), Positives = 193/422 (45%), Gaps = 46/422 (10%)
Query: 1156 EDAALIVESIRRDEFGLGPNIS-NMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
E A IVE IR + GL + ++ + L+ + G ++ ++++LY D+ F+
Sbjct: 63 EKAREIVEEIRINNGGLREEVQRSLHAEALRSIRNLQSLAGASIRHVAEDLYDADTRFIF 122
Query: 1212 ELVQNADDNIYP----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
ELVQNA+D+ Y + EP + F L I V NE GF+ ++RA+C + NS+K+
Sbjct: 123 ELVQNAEDSRYSHAKSQEEEPFIYFTLYPDRITVDTNEDGFTENDVRAICSIHNSSKR-Q 181
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
GYIG KGIGFKSVF+V I S F F +G+ G ++ PF
Sbjct: 182 IGGYIGHKGIGFKSVFKVAHKVHIQSGPFCFSFTHRKGESGM---GMITPF--------- 229
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
S++P L N R E + ++ +LLLFL RL I +
Sbjct: 230 SREPEDLPP---NVKTRFTLYFIHPEDFQIRE--GELKEIPDTLLLFLRRLCKIGVKIEP 284
Query: 1388 NDSLVVIRKKIVG-DGIIKV--SCGE--DKMTWFVASQKLRAGVIRPDVK---TTEIALA 1439
+ ++ +++ G +G I + G+ K + V + D T E+ LA
Sbjct: 285 SGFRLLFKREQTGPNGRITLVKEAGDVVSKRIYHVEGAEFENLPEHGDQSAQTTAEVILA 344
Query: 1440 LTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
+ +S+ P++ Q V++FLP+R G KF++Q DF+ ++R+ V P N +
Sbjct: 345 FPVDDSHR----PIIEPQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LCPRNYAIRE 399
Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNC 1559
+FV A +FC A ++ ++P F++ L MIL L S
Sbjct: 400 RISLVFVQAVYNFC-------KNATLKYEWLQYLPGPSIPDPFWATLRDMILESLSESKI 452
Query: 1560 LI 1561
L+
Sbjct: 453 LL 454
>gi|240277430|gb|EER40938.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
Length = 1609
Score = 128 bits (321), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 119/422 (28%), Positives = 193/422 (45%), Gaps = 46/422 (10%)
Query: 1156 EDAALIVESIRRDEFGLGPNIS-NMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
E A IVE IR + GL + ++ + L+ + G ++ ++++LY D+ F+
Sbjct: 41 EKAREIVEEIRINNGGLREEVQRSLHAEALRSIRNLQSLAGASIRHVAEDLYDADTRFIF 100
Query: 1212 ELVQNADDNIYP----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
ELVQNA+D+ Y + EP + F L I V NE GF+ ++RA+C + NS+K+
Sbjct: 101 ELVQNAEDSRYSHAKSQEEEPFIYFTLYPDRITVDTNEDGFTENDVRAICSIHNSSKR-Q 159
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
GYIG KGIGFKSVF+V I S F F +G+ G ++ PF
Sbjct: 160 IGGYIGHKGIGFKSVFKVAHKVHIQSGPFCFSFTHRKGESGM---GMITPF--------- 207
Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
S++P L N R E + ++ +LLLFL RL I +
Sbjct: 208 SREPEDLPP---NVKTRFTLYFIHPEDFQIRE--GELKEIPDTLLLFLRRLCKIGVKIEP 262
Query: 1388 NDSLVVIRKKIVG-DGIIKV--SCGE--DKMTWFVASQKLRAGVIRPDVK---TTEIALA 1439
+ ++ +++ G +G I + G+ K + V + D T E+ LA
Sbjct: 263 SGFRLLFKREQTGPNGRITLVKEAGDVVSKRIYHVEGAEFENLPEHGDQSAQTTAEVILA 322
Query: 1440 LTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
+ +S+ P++ Q V++FLP+R G KF++Q DF+ ++R+ V P N +
Sbjct: 323 FPVDDSHR----PIIEPQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LCPRNYAIRE 377
Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNC 1559
+FV A +FC A ++ ++P F++ L MIL L S
Sbjct: 378 RISLVFVQAVYNFC-------KNATLKYEWLQYLPGPSIPDPFWATLRDMILESLSESKI 430
Query: 1560 LI 1561
L+
Sbjct: 431 LL 432
>gi|77551082|gb|ABA93879.1| expressed protein [Oryza sativa Japonica Group]
Length = 1707
Score = 127 bits (320), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 168/348 (48%), Gaps = 57/348 (16%)
Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
V NA+DN YP V P+L F++ + I ++ NNE+GFS NI ++ VG STK
Sbjct: 62 VPNAEDNEYPSGVAPSLDFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTK 121
Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNI 1320
KG+ GYIG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V
Sbjct: 122 KGNRDKGYIGEKGIGFKSVFLISSQPHIFSNGYQIKFNGKPCAECGIGYIVPEWVESRPS 181
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
R + L + T I LP +++ ++ + S +HP +LLFL +++
Sbjct: 182 LSDIRTIYGSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQ 232
Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGI------IKVSCGEDKMTWFVASQ 1420
+ + ++ ++ RK + + ++ GE++ +++ Q
Sbjct: 233 LSVKEENVNHKCSPVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQ 292
Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
K ++P+ + E + L + G + V+AFLP FI
Sbjct: 293 KFP---VKPENRVDKRAEIDEWVITLAFPYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFI 348
Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----ERSFCDLPC 1517
+Q DF+L SSRE + +SPWN+ +L P+ F++A +S D+P
Sbjct: 349 IQADFLLASSREAILFDSPWNKGILECVPSAFLNAFVALVKSGADVPA 396
>gi|222616035|gb|EEE52167.1| hypothetical protein OsJ_34022 [Oryza sativa Japonica Group]
Length = 1690
Score = 127 bits (320), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 168/348 (48%), Gaps = 57/348 (16%)
Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
V NA+DN YP V P+L F++ + I ++ NNE+GFS NI ++ VG STK
Sbjct: 45 VPNAEDNEYPSGVAPSLDFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTK 104
Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNI 1320
KG+ GYIG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V
Sbjct: 105 KGNRDKGYIGEKGIGFKSVFLISSQPHIFSNGYQIKFNGKPCAECGIGYIVPEWVESRPS 164
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
R + L + T I LP +++ ++ + S +HP +LLFL +++
Sbjct: 165 LSDIRTIYGSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQ 215
Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGI------IKVSCGEDKMTWFVASQ 1420
+ + ++ ++ RK + + ++ GE++ +++ Q
Sbjct: 216 LSVKEENVNHKCSPVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQ 275
Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
K ++P+ + E + L + G + V+AFLP FI
Sbjct: 276 KFP---VKPENRVDKRAEIDEWVITLAFPYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFI 331
Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----ERSFCDLPC 1517
+Q DF+L SSRE + +SPWN+ +L P+ F++A +S D+P
Sbjct: 332 IQADFLLASSREAILFDSPWNKGILECVPSAFLNAFVALVKSGADVPA 379
>gi|156082529|ref|XP_001608749.1| hypothetical protein [Babesia bovis T2Bo]
gi|154795998|gb|EDO05181.1| conserved hypothetical protein [Babesia bovis]
Length = 2581
Score = 127 bits (320), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 109/178 (61%), Gaps = 6/178 (3%)
Query: 1143 DCTTQTLPEDKECEDAALIVESIRRDEFGLG--PNISNMESNMLKKQHARLGRALHCLSQ 1200
D ++ T + + ++++IR FG G +++ S ++ Q +L R++ LS+
Sbjct: 673 DTSSNTAVSSMQLDQGKALIDAIRLYTFGYGILEHLNPEASAIVNNQRKQLERSVRRLSE 732
Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVG 1260
+LYS H LEL+QNADDN Y + P + F+L SGIVV+NNE GF+ +++ALCD+G
Sbjct: 733 DLYSSRCHLHLELIQNADDNKYNCPL-PEIGFVLDRSGIVVINNECGFTDSDVKALCDIG 791
Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV 1315
+S+K SS IG+ G+GFKSVF VT P + SNGFH F ++ + + +++P V
Sbjct: 792 SSSKTNSSRDLIGQFGLGFKSVFLVTHTPCVFSNGFHFNFSSNPNLQNNVMYLMPHWV 849
>gi|115485639|ref|NP_001067963.1| Os11g0514000 [Oryza sativa Japonica Group]
gi|113645185|dbj|BAF28326.1| Os11g0514000 [Oryza sativa Japonica Group]
Length = 1757
Score = 127 bits (320), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 168/348 (48%), Gaps = 57/348 (16%)
Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
V NA+DN YP V P+L F++ + I ++ NNE+GFS NI ++ VG STK
Sbjct: 62 VPNAEDNEYPSGVAPSLDFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTK 121
Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNI 1320
KG+ GYIG KGIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V
Sbjct: 122 KGNRDKGYIGEKGIGFKSVFLISSQPHIFSNGYQIKFNGKPCAECGIGYIVPEWVESRPS 181
Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
R + L + T I LP +++ ++ + S +HP +LLFL +++
Sbjct: 182 LSDIRTIYGSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQ 232
Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGI------IKVSCGEDKMTWFVASQ 1420
+ + ++ ++ RK + + ++ GE++ +++ Q
Sbjct: 233 LSVKEENVNHKCSPVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQ 292
Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
K ++P+ + E + L + G + V+AFLP FI
Sbjct: 293 KFP---VKPENRVDKRAEIDEWVITLAFPYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFI 348
Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----ERSFCDLPC 1517
+Q DF+L SSRE + +SPWN+ +L P+ F++A +S D+P
Sbjct: 349 IQADFLLASSREAILFDSPWNKGILECVPSAFLNAFVALVKSGADVPA 396
>gi|147783923|emb|CAN74688.1| hypothetical protein VITISV_001348 [Vitis vinifera]
Length = 288
Score = 127 bits (319), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 84/170 (49%), Positives = 98/170 (57%), Gaps = 29/170 (17%)
Query: 37 NFAFQNPNIFFPNPALQLLQNLTNIPF-QNPNFDIQNPNLFMQNANLLTQNPNLP----- 90
N+ QN N +F NP Q NL +P+ QNP QNP L MQN NL +NPNLP
Sbjct: 35 NYFIQNVNPYFQNPTFQA--NL-GLPYLQNPTLPTQNPTLPMQNTNLPLKNPNLPMQNXS 91
Query: 91 -PQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVKAR 149
P Q PS A Q + T PQP K NKE L+R+D AV KAR
Sbjct: 92 FPLQNPSFAI-QNTNFT------------------GFRPQPPKRNKEALDRVDGAVXKAR 132
Query: 150 SELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEAR 199
++I GESVS+WKVS+S LL LQVDSW SLGF MQEVPSLH L+V E +
Sbjct: 133 RDVIAXGESVSAWKVSQSALLALQVDSWESLGFPMQEVPSLHSLIVIEGK 182
>gi|302825725|ref|XP_002994453.1| hypothetical protein SELMODRAFT_432374 [Selaginella moellendorffii]
gi|300137602|gb|EFJ04480.1| hypothetical protein SELMODRAFT_432374 [Selaginella moellendorffii]
Length = 1445
Score = 126 bits (316), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 125/444 (28%), Positives = 195/444 (43%), Gaps = 91/444 (20%)
Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
+ +E +RR+ FG+G + N + ML++ + LS ELY +D HFL ELVQ
Sbjct: 6 MFIERLRRELFGIGQSGPNPLAPMLQE-------TIRLLSAELYQKDIHFLSELVQET-- 56
Query: 1220 NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-SAGYIGRKGIG 1278
NI +LC G STKKG S GYIG KGIG
Sbjct: 57 --------------------------------NIESLCGSGQSTKKGKKSQGYIGEKGIG 84
Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLE 1335
FKSVF VT P I SNGF ++FD +E ++G+++P V + ++ SK
Sbjct: 85 FKSVFLVTKQPYIISNGFRIRFDEEPHNEAKLGYIVPEWVDRPTDAVLQKVCSK------ 138
Query: 1336 SKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI-MFRNMLNDSLVVI 1394
++ T I LP R + ++ + + + LLFL +++ + ++ ++
Sbjct: 139 AELPATVIILPLRQE-----KISEVAKQLTQIPAETLLFLSKIRMLSIYDQQSRPPTELV 193
Query: 1395 RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI---------------------RPDVKT 1433
+ V K E T+ AS R G R DV +
Sbjct: 194 LSRSVEMTTSKNESVETSTTFLTASNG-RNGTQHSNYYMFKQTLKVPEEAKVENRKDVDS 252
Query: 1434 TEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGN 1490
I LA + + S+ + G +F+FLP + GL F++ DF+L SSRE + +
Sbjct: 253 WTITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLVSSRETLLFD 306
Query: 1491 SPWNQWLLSEFPALFVSAERSFCDLPCF--RENPAKAASVYMSFVPLVGEVHGFFSGLPR 1548
SPWNQ + P +F +A + F + P+ A +VP+ + + +
Sbjct: 307 SPWNQGIFRCVPEVFYNAFELLLNTTAFGLQSIPSVARMAMYKYVPVNVCRNSYLERVRN 366
Query: 1549 MILSKLRMSNCLILEGNNNQWAPP 1572
ILSKL+ +N +IL ++ APP
Sbjct: 367 DILSKLKDNN-VILCASDVGRAPP 389
>gi|398410674|ref|XP_003856685.1| hypothetical protein MYCGRDRAFT_34426 [Zymoseptoria tritici IPO323]
gi|339476570|gb|EGP91661.1| hypothetical protein MYCGRDRAFT_34426 [Zymoseptoria tritici IPO323]
Length = 1453
Score = 126 bits (316), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 46/366 (12%)
Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE----NVEPTLTFIL 1234
+E ++L K G A+ L+Q +Y D+ F+ EL+QNADDN + + P ++F +
Sbjct: 37 VEESLLAKDKI-AGHAIITLAQNIYGSDARFVFELLQNADDNCFEKATAAGAAPFVSFHV 95
Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
IVV NE GF+ +N+ A+C VG S+K ++ GYIG KGIGFKSVF + S
Sbjct: 96 YSDRIVVECNEDGFTLQNLTAICAVGQSSK-STTHGYIGAKGIGFKSVFIAAWKVHVQSG 154
Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEG 1354
F F +G G + M + + L S + L TK +E
Sbjct: 155 HFSFDFKHKKGDSG-----------LGMVVPIWHETSETLPSPMTRMTLYL-HETKGAEH 202
Query: 1355 I--AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDK 1412
+ + + S L + LLFL +L+ I N S + + G D+
Sbjct: 203 LNDIRKTVFEQLSSLQETSLLFLRKLKKI------NVSFYDANDHQASSKLFQCDKGNDQ 256
Query: 1413 MTWFVASQKLRAGVIRPDVK----TTEIALALTLQESNEG--------------NYGPLL 1454
+G + D K TT +A L+ ES + PLL
Sbjct: 257 NFRLEIVHTTHSGATKKDTKRFHVTTHLATGLSASESRGAAAEAEVILAFPLTHDLKPLL 316
Query: 1455 YQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
+Q +FAFLP+R +FI+ DF +SR+++ S N+ LL+ F +A FC
Sbjct: 317 EKQELFAFLPVRKSEFEFIIHSDFDTTASRQDIMTTSRRNRDLLNAIADAFKTAVTEFCG 376
Query: 1515 LP--CF 1518
P C+
Sbjct: 377 HPELCY 382
>gi|70986517|ref|XP_748750.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66846380|gb|EAL86712.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 1452
Score = 125 bits (314), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 118/410 (28%), Positives = 175/410 (42%), Gaps = 78/410 (19%)
Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGI 1239
L+ + ++G ++ L++ LY+ ++ F+ EL+QNADDN + P ++F + I
Sbjct: 44 LRAKDEKIGHSIKTLAKNLYNSNARFVFELLQNADDNRFMLARKHKELPFISFKVYPDRI 103
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
VV NE GFS E++ A+C VG STK +S GYIG KGIGFKSVF I S F
Sbjct: 104 VVECNEDGFSIEDLSAICSVGESTK-AASHGYIGAKGIGFKSVFIAAWKVHIQSGNFSFH 162
Query: 1300 FDTSEGQIGF--VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
F G +G VLP P E +C + R+ +EG
Sbjct: 163 FKHRRGDLGLGMVLPVWEDP-----------------EYECPDRLTRMTLYLH-TEGEPH 204
Query: 1357 -----MNNIVDMFSDLHPSLLLFLHRLQCI---------------MFRNMLNDSLVVIRK 1396
I SDL + LLFL ++ I FR V
Sbjct: 205 ALEHLRKTIFQQLSDLQQTSLLFLRNIKQIGVFFDDGNGGMRRSKTFRVRDAPGHSVFLD 264
Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-------------TEIALALTLQ 1443
V DG + E + + +++ + D + E+ LA L
Sbjct: 265 ATVVDGDENSTTVEQR---YHVTRRTATELPTSDNRNLLSTLETNTAFSEAEVVLAFPLT 321
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
+ PLL +Q +FAFLP+R KF++Q DF ++R+++ S N LL +
Sbjct: 322 VDSR----PLLEKQEIFAFLPVRESRFKFLIQSDFDTSANRQDISTTSRRNLCLLPHIAS 377
Query: 1504 LFVSAERSFCDLP--CFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMI 1550
FV A FC+ P C+ + SF+P + + G F+S L R I
Sbjct: 378 AFVMAVLEFCEDPELCY---------AWPSFLPSLDDDMGDFWSPLARSI 418
>gi|159123450|gb|EDP48569.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 1590
Score = 125 bits (314), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 118/410 (28%), Positives = 175/410 (42%), Gaps = 78/410 (19%)
Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGI 1239
L+ + ++G ++ L++ LY+ ++ F+ EL+QNADDN + P ++F + I
Sbjct: 44 LRAKDEKIGHSIKTLAKNLYNSNARFVFELLQNADDNRFTLARKHKELPFISFKVYPDRI 103
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
VV NE GFS E++ A+C VG STK +S GYIG KGIGFKSVF I S F
Sbjct: 104 VVECNEDGFSIEDLSAICSVGESTK-AASHGYIGAKGIGFKSVFIAAWKVHIQSGNFSFH 162
Query: 1300 FDTSEGQIGF--VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
F G +G VLP P E +C + R+ +EG
Sbjct: 163 FKHRRGDLGLGMVLPVWEDP-----------------EYECPDRLTRMTLYLH-TEGEPH 204
Query: 1357 -----MNNIVDMFSDLHPSLLLFLHRLQCI---------------MFRNMLNDSLVVIRK 1396
I SDL + LLFL ++ I FR V
Sbjct: 205 ALEHLRKTIFQQLSDLQQTSLLFLRNIKQIGVFFYDGNGGMRRSKTFRVRDAPGHSVFLD 264
Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-------------TEIALALTLQ 1443
V DG + E + + +++ + D + E+ LA L
Sbjct: 265 ATVVDGDENSTTVEQR---YHVTRRTATELPTSDNRNLLSTLETNTAFSEAEVVLAFPLT 321
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
+ PLL +Q +FAFLP+R KF++Q DF ++R+++ S N LL +
Sbjct: 322 VDSR----PLLEKQEIFAFLPVRESRFKFLIQSDFDTSANRQDISTTSRRNLCLLPHIAS 377
Query: 1504 LFVSAERSFCDLP--CFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMI 1550
FV A FC+ P C+ + SF+P + + G F+S L R I
Sbjct: 378 AFVMAVLEFCEDPELCY---------AWPSFLPSLDDDMGDFWSPLARSI 418
>gi|407922036|gb|EKG15164.1| hypothetical protein MPH_07610 [Macrophomina phaseolina MS6]
Length = 379
Score = 125 bits (314), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/339 (32%), Positives = 158/339 (46%), Gaps = 44/339 (12%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESN-------MLKKQHARLGRALHCLSQELYSQDSH 1208
E+A ++ +R++ G+ P + L+ +LG A L+ LYS+D+
Sbjct: 12 EEAQRLIAQVRKENGGISPEHEALLKENCPDVLEALQNVRRKLGSATKTLATNLYSKDTR 71
Query: 1209 FLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
FL EL+QNA+DN Y P L FIL IV+ +NE GF++ N +A+C G STK
Sbjct: 72 FLYELIQNAEDNQYCVWTTPFLNFILYPDKIVIDSNENGFTSLNFKAICSTGESTKTNVQ 131
Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----DTSEGQIGFVLPTLVPPFNIDMFC 1324
GY+G KGIGFKSVF+V I S F F D+ + +G V P L P+ D+
Sbjct: 132 -GYVGEKGIGFKSVFKVAKKVHIQSGVFSFSFVYTRDSDDDGLGMVTP-LSEPYE-DL-- 186
Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR 1384
P + ++ T +R E M D+ S+L +LLLFL +++ I R
Sbjct: 187 ------PANVRTRMTLTLLR-------PEDF-MQRAADL-SNLPDTLLLFLSKIRSINVR 231
Query: 1385 --NMLNDSLVVIRKKIVG----DGIIKVSCGEDKMT------WFVASQKLRAGVIRPDVK 1432
+ N V K + G + I+K D T + V + +R P K
Sbjct: 232 ICSSANTITTVHYKHVRGSDNLEKILKSIKTYDGTTSIQTKLFLVTRRVIRNLPFDPARK 291
Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
T A + L + N P+L Q VFA+LPLR G K
Sbjct: 292 HTNQA-TIILAFPVDVNDTPILENQHVFAYLPLRRVGFK 329
>gi|302917474|ref|XP_003052445.1| hypothetical protein NECHADRAFT_77414 [Nectria haematococca mpVI
77-13-4]
gi|256733385|gb|EEU46732.1| hypothetical protein NECHADRAFT_77414 [Nectria haematococca mpVI
77-13-4]
Length = 1747
Score = 125 bits (313), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 139/563 (24%), Positives = 240/563 (42%), Gaps = 112/563 (19%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDN--IYPENVEPTLTFILQESGIVVLNNE 1245
++L A+ L+Q LYS + F+ EL+QNA+DN ++ E P ++F L + +V+ NE
Sbjct: 52 QSKLEPAIKALAQSLYSSTARFVFELLQNAEDNSFVHAEG-RPYVSFHLSKDRLVIECNE 110
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN--GFHVKFDTS 1303
GF+ N+ A+C +G S+K ++ GYIG KGIGFKSVF I S F+ K S
Sbjct: 111 DGFTTANLEAICSIGKSSKL-ATKGYIGEKGIGFKSVFMAAWKVHIQSGPYSFYFKHLPS 169
Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
+ +G + P P + R +++ + L ++ I L R ++I
Sbjct: 170 DSGMGMITPVWQEP--TEELPRYMTRMTLDLHTEGDPQSI-LAQR---------HSIRQQ 217
Query: 1364 FSDLHPSLLLFLHRLQ--------------CIMFRNMLNDSLVVIRK------------- 1396
L+ ++LLF+ +L+ + ++ +DS V I K
Sbjct: 218 LRKLNGNILLFMRKLEEIRIIIDENEAETSTVFTKSETDDSKVKILKTVTQDDGDSSESS 277
Query: 1397 ----KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGP 1452
I + +S E++ T+ L+ + T E+ LA L Y P
Sbjct: 278 STLYHITKHKVYNLSKNENR-TYSEEEDLLK------EYSTAEVVLAFPLTP----EYEP 326
Query: 1453 LLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
++ Q VFAFLP++T G F++Q DF+ +SRE + + N L F+ A F
Sbjct: 327 VIESQEVFAFLPVQTAGFSFLIQSDFMTNASREHIVTTAARNIGLRDGICMAFIQAALEF 386
Query: 1513 CDLPCFREN-----PAKAASVYMSF-VPLVGEVHGFF---------SGLPRMILSKLRMS 1557
C+ + P K V+ F LV ++ SG P +++ LR
Sbjct: 387 CNHETLQYTWMQFLPDKENKVHSDFWSALVTDIEAKVRETPLIRPDSGGPLRLITSLRNP 446
Query: 1558 NCLILEGNNN----QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLAR 1613
L ++ NN P + R + + ++L ++ L +GF +DI+
Sbjct: 447 RPLFVDKENNLLLRDLTPELSISRRYGSSSLAILENLGL-----VGFYRQDII------- 494
Query: 1614 ALGIEEYGPKILLQIISSLCRTENGLRSMGL-SWLASWLNELYTISFHSSGQSSLQSGVE 1672
I L ++ L+S ++L S++ I++ + + +S +E
Sbjct: 495 ------------RMIDQDLQSPDSWLKSRAADAFLQSFVACFLEIAYKDTHNRAARSMIE 542
Query: 1673 TDLIDNLQRIPFIPLSDGTFSSV 1695
++P IPL DG + +V
Sbjct: 543 --------KLPIIPLQDGRWLAV 557
>gi|302413305|ref|XP_003004485.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261357061|gb|EEY19489.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 1583
Score = 124 bits (312), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 108/375 (28%), Positives = 163/375 (43%), Gaps = 58/375 (15%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE----NVEPTLTFI 1233
+E +ML K ++ L++ +Y D+ F+ EL+QNADDN + V P+++F
Sbjct: 39 TIEESMLAKDKL-AAHSIKTLAKNIYGSDARFVFELLQNADDNRFTRARANGVLPSISFH 97
Query: 1234 LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ IVV NE GF+ +++RA+C VG ST K +S GYIG KGIGFKSVF I S
Sbjct: 98 VHPHRIVVECNEDGFTEKDLRAICSVGEST-KSASRGYIGAKGIGFKSVFIAAWKVAIQS 156
Query: 1294 NGFHVKFDTSEGQIGF--VLP------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
F F +G +G VLP +P M L + Q T R
Sbjct: 157 GHFSFYFKHEKGDLGLGMVLPVWENQAAALPGPLTRMTLHLHEQGEAQELEHLRQTIFR- 215
Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI-VGD--- 1401
S+L + LLFL L+ I D + K VGD
Sbjct: 216 -----------------QLSELQHTSLLFLRNLRQISVAFYDEDGGLKSSKNFRVGDERD 258
Query: 1402 -------GIIKVSCGEDKMT-----WFVASQKLRAGVIRPDVKTT------EIALALTLQ 1443
+ + G T + ++ + + + D + + E+ LA L
Sbjct: 259 HTVLLETTSTRSASGNGSQTSTENKQYHVTRHMATNLSKSDNRDSANNTEAEVVLAFPLT 318
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
++ PL+ QQ +FAFLP+R KF++ DF +SR+++ S N+ L+ A
Sbjct: 319 SDSQ----PLIEQQEIFAFLPIRESNFKFLIHSDFDTSASRQDIVSTSLRNKSLIDGIAA 374
Query: 1504 LFVSAERSFCDLPCF 1518
F+ A FC+ P
Sbjct: 375 AFIKAVLQFCEHPSL 389
>gi|46127439|ref|XP_388273.1| hypothetical protein FG08097.1 [Gibberella zeae PH-1]
Length = 2219
Score = 124 bits (311), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 157/332 (47%), Gaps = 45/332 (13%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC 1257
LS++ YS + FL EL+QNADDN+Y + VEP + L+ + + +NE GF+ N+ ++C
Sbjct: 124 LSEDQYSDKTRFLFELLQNADDNLYAD-VEPLVKIHLKNQYLTIQSNEVGFTKANVDSIC 182
Query: 1258 DVGNSTKKGSSA-----GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
+ S+K A IG KGIGFK+VF V I S + +FD +G +G V P
Sbjct: 183 RMTVSSKSNKDADVDKENQIGEKGIGFKAVFLVATKVWIKSGHYSFRFD-RDGPLGRVYP 241
Query: 1313 TLVPPFNIDMFCRLLSKDPVQLE-SKCWNTCIRLPFRTKFSEGIAMNN------IVDMFS 1365
V F K V +E ++ + + L + ++ +N IV++
Sbjct: 242 NWVDHF---------PKGLVGIEDTRMTSIVLELAGTSGITDNFLSSNPDSGVDIVNLLR 292
Query: 1366 DLHPSLLLFLHRLQCI---------MFRNMLNDSLVVIRKKIVGD--------GIIKVSC 1408
+L P LLFL +++ I N ++R K + + +
Sbjct: 293 NLSPDQLLFLSKVRRIDIEIDRKNGTITGRANKYNAILRAKPIPELRQTLHVMDTFNGNA 352
Query: 1409 GEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY 1468
+T+ L + RP+ + +EI LA + S E ++ V+AFLP++ Y
Sbjct: 353 WASYVTYRHPVSNLPKEINRPNRRNSEIILAFRVPVSEENSFS----ANNVYAFLPIKDY 408
Query: 1469 GLKFILQGDFVLPSSREEVDGNSPWNQWLLSE 1500
G KF++ DF+L S+RE++ N WN LL E
Sbjct: 409 GFKFLIHSDFILCSNREDIQTNQ-WNDRLLEE 439
>gi|310791111|gb|EFQ26640.1| hypothetical protein GLRG_02460 [Glomerella graminicola M1.001]
Length = 483
Score = 124 bits (310), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 169/358 (47%), Gaps = 47/358 (13%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFIL-------- 1234
+H + R+L LSQ+LYS ++ F+ E++QNADDN Y ++ P+ + L
Sbjct: 33 KHGIIHRSLQILSQQLYSTETRFIHEVIQNADDNAYGKSTAHGSLPSFSLRLKSKDTSND 92
Query: 1235 -QESGIVVLNNEQGFSAENIRALCDVGNSTK---KGSSAGYIGRKGIGFKSVFRVTDAPE 1290
++ + NE GFS I AL D+G STK K G+ G KG+GFK+VFRV DA
Sbjct: 93 TEKYTLETSCNEDGFSIAQIAALTDIGASTKTKCKDVHGGFTGEKGVGFKAVFRVADAVY 152
Query: 1291 IHSNGFHVKFDTSEGQIGFVLP---TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
I S ++ K D S+ IG + P + P + R++ ++L+S+ I
Sbjct: 153 ITSGHYNFKLDNSQDMIGALRPIPCSYPAPKGTLNYTRMM----IELKSQGDFESIERDL 208
Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC-IMFRNMLNDSLVVIRKKIVGDGIIKV 1406
E + + S +H C F ND + ++ +
Sbjct: 209 EKVQPEILLFLRRLRRIS---------VHTATCNTTFTVAHNDDDEEYQGEMKTVTVQST 259
Query: 1407 SCGEDKMTWFVASQKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
+ K T ++ +++ + R ++ T+EI LA L N+G+ +L Q VFAF
Sbjct: 260 TEKRRKKTRYIFVRRIVDDMPVEPRRENIVTSEIVLAFPLH--NDGSL--ILEPQHVFAF 315
Query: 1463 LPLRTYGLK-----FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDL 1515
LP+ YG + F++QGDFVL +SRE + + WN+ L P+ F +A F ++
Sbjct: 316 LPINQYGFRASPCPFLIQGDFVLTASRESL-APTAWNEKLQEAIPSAFEAAVHRFNNI 372
>gi|320587287|gb|EFW99767.1| hypothetical protein CMQ_85 [Grosmannia clavigera kw1407]
Length = 1655
Score = 123 bits (309), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 168/392 (42%), Gaps = 63/392 (16%)
Query: 1149 LPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSH 1208
LPE+K L+ +RRD +E LKK +G ++ L++ LY+ +
Sbjct: 373 LPEEK----LRLLPADLRRD----------IEEAFLKKD-LMIGSSVITLAKNLYTSKAR 417
Query: 1209 FLLELVQNADDNIY----PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
F+ EL+QNADDN Y +V+P ++F + +V+ NE GF++ N++A+C +G S+K
Sbjct: 418 FIFELLQNADDNSYTHAAASHVDPYVSFRVHSDRVVIECNEDGFNSANLKAICSIGQSSK 477
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFC 1324
G+ GYIG KGIGFKSVF I S F F G G + M
Sbjct: 478 MGAQ-GYIGEKGIGFKSVFMAASKVHIQSGAFSFSFTHRPGDSG-----------MGMIS 525
Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN-IVDMFSDLHPSLLLFLHRLQ---- 1379
+ + +L+ + L + + + I F +L + LLF+ L+
Sbjct: 526 PVWEEPGEELQPHLTRITLFLHGGGDGEKHAQIQDAIATQFQELQETFLLFMRNLRSIRV 585
Query: 1380 ------------CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKL----- 1422
C + ++++ G + S K + V Q
Sbjct: 586 AFFDDGGAKTESCTYLIKRPQTDMAILKRTWATHGSKETSEEISKHFYVVTHQATDLTMN 645
Query: 1423 ------RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
A T+++ LA L + P+L +Q +F FLP+R YG FI+Q
Sbjct: 646 ENRTYSEAEKASGAYTTSQVVLAFPLSAEST----PILEEQDLFVFLPVRKYGFNFIIQA 701
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
DFV ++R+++ +S N L++ F A
Sbjct: 702 DFVTDANRQDIVTDSVRNARLMNSVAEGFCKA 733
>gi|116201367|ref|XP_001226495.1| hypothetical protein CHGG_08568 [Chaetomium globosum CBS 148.51]
gi|88177086|gb|EAQ84554.1| hypothetical protein CHGG_08568 [Chaetomium globosum CBS 148.51]
Length = 1525
Score = 123 bits (309), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/399 (26%), Positives = 182/399 (45%), Gaps = 59/399 (14%)
Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQ--HARL------GRALHCLSQELYSQDSH 1208
+A +V+SI D + + + S+ ++Q HA L G ++ L+Q LY+ ++
Sbjct: 11 EAQRLVKSIADDRRPIPDEVLDRMSDADRRQVEHAILMKDQVNGPSVLTLAQNLYTSNAR 70
Query: 1209 FLLELVQNADDNIYPE----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
F+LEL+QNA+DN Y + EP ++F + S +V+ NE GF+ EN+RA+C VG S+K
Sbjct: 71 FVLELLQNAEDNKYTKAQAFGDEPYVSFSVYPSQLVLECNENGFTEENLRAICSVGQSSK 130
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ--IGFVLPTLVPPFNIDM 1322
G+ GYIG KGIGFKSVF + S F +F +G +G + PT F
Sbjct: 131 IGAR-GYIGEKGIGFKSVFMAAYQVHVQSGHFSFRFQHRQGDSGLGMITPTWKDAFYFSG 189
Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM 1382
++ D +Q + T R +F + LH ++LLF+ +++ I
Sbjct: 190 PFTRITLDLLQGGTPEEQTGRNQTIRQQFEQ-------------LHDAILLFMKKIKKIK 236
Query: 1383 F------RNMLNDSLVVIRKKIVGDGIIKVSC--------GEDKMTWFVASQKLRAGVIR 1428
R + I I I V + + ++ + + G+ +
Sbjct: 237 VAFYDHERGEPGEPTRTITHSIDRQSPITVCTKHTSQKGHKQTQHRYYYVKRGVATGLSK 296
Query: 1429 PDVKT-------------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQ 1475
+ + E+ LA L + P+L Q VF+FLP+R G KF++
Sbjct: 297 VEGRIYSQSEEASGAYSRGEVVLAFPLT----ADSVPILENQWVFSFLPIRQIGFKFLVH 352
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
DFV+ + R+++ + N+ L ++ F+ A C+
Sbjct: 353 ADFVMQADRQDICLLAARNRELAAKITKGFIRAVLELCE 391
>gi|258568150|ref|XP_002584819.1| predicted protein [Uncinocarpus reesii 1704]
gi|237906265|gb|EEP80666.1| predicted protein [Uncinocarpus reesii 1704]
Length = 1609
Score = 123 bits (308), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 173/391 (44%), Gaps = 75/391 (19%)
Query: 1156 EDAALIVESIRRDEFGLGPNISNMESN------MLKKQHARLGRALHCLSQELYSQDSHF 1209
E+A I+++IR G+ S+ E +L +LG A+ L+ LYS D+ F
Sbjct: 7 EEARAIIDNIREQNGGI--TQSDREQTPQSVLRVLNNLQRKLGAAIKILASNLYSTDARF 64
Query: 1210 LLELVQNADDNIYPE----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
+ EL+QNA+D Y N P + F + +++ +NE GFS +++RA+C G STK
Sbjct: 65 VFELIQNAEDASYRVIQACNELPFIKFTVCHDRVIIDSNEDGFSVQDVRAICSTGESTKT 124
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ--IGFVLPTLVPPFNIDMF 1323
+ GYIG KGIGFKSVF+V I S F F+ + G+ +G V P P ++ +
Sbjct: 125 -NIQGYIGEKGIGFKSVFKVASKVHIQSGPFSFYFEHNRGEDGLGMVTPISEEPMDLPVD 183
Query: 1324 CR----LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
R L+ DP + +NTC++ D+ +L LFL +L+
Sbjct: 184 VRTRITLILSDPSK-----FNTCMK------------------EIQDIPETLTLFLSKLK 220
Query: 1380 CIMFR-NMLNDSLVVIRKKIVGD------GIIKVSCGEDKMTWFVASQKLRAGVIRPDVK 1432
I LN+ + D + + + + T++ +++ + + R +
Sbjct: 221 IISLSIQPLNEDPWEVSYSCAYDTSSSSMTLTRTTNNSSRQTFYRVVKRILSPLPRDPAR 280
Query: 1433 T----TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD 1488
E+ LA + + + LK +LQ DF+ +SRE++
Sbjct: 281 KGRHRAEVVLAFPVSD---------------------HSAELKPVLQSDFITQASREDIH 319
Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDLPCFR 1519
+SPWN +L F A FCD P R
Sbjct: 320 -HSPWNDAILDGVADTFRDAVIQFCDHPSLR 349
>gi|255951655|ref|XP_002566594.1| Pc23g00270 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211904215|emb|CAP79521.1| Pc23g00270 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1376
Score = 118 bits (296), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 175/382 (45%), Gaps = 65/382 (17%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC 1257
L+ LY++ +HFLLEL+QNADDN Y + PT+ F ++ +++ NE GFS++++ A+C
Sbjct: 40 LAGGLYTKPTHFLLELIQNADDNNYATST-PTINFHIESDILLISCNEVGFSSKDVDAIC 98
Query: 1258 DVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
+VG STK S +AGYIG KGIGFKS F++ D I S + KF S GQ+G + P +
Sbjct: 99 NVGQSTKSASRKAAGYIGEKGIGFKSGFKLADTIRISSGAYSFKFVKS-GQLGMITP-IW 156
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
F D L++ ++ +C R + ++ L PS+L+FL
Sbjct: 157 EDFPSDYLQAGLTQFLFEI------SCPR-----------NVQSVKTDLCSLQPSILIFL 199
Query: 1376 HRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTE 1435
RL+ I N+ +SL + I Q + A + VK
Sbjct: 200 RRLRRI---NLEGESLYKEIQSI---------------------QYVFADLAGAAVKIRG 235
Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
I++ + E Q+ LP +F++Q DF+L ++RE+VD WN
Sbjct: 236 ISVDGIISED-------YFVQRHKVGALPGDPKRPEFLIQADFLLTANREDVDERPLWNL 288
Query: 1496 WLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG--FFSGLPRMILSK 1553
LL F A F N + S++ + G F L + ++++
Sbjct: 289 SLLDGIVQAFALAVHRF--------NKTILKYSWPSYLQCLSSSMGSSLFDSLRKKMVAR 340
Query: 1554 LRMSNCLILEGNNNQWAPPCKV 1575
L+ ILE ++ + P +V
Sbjct: 341 LKSER--ILESKSSSFRTPAEV 360
>gi|225556893|gb|EEH05180.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 1578
Score = 118 bits (296), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 114/426 (26%), Positives = 187/426 (43%), Gaps = 54/426 (12%)
Query: 1156 EDAALIVESIRRDEFGLGPNIS-NMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
E A IVE IR + GL + ++ + L+ + G ++ ++++LY D+ F+
Sbjct: 80 EKAREIVEEIRINNGGLREEVQRSLHAEALRSIRNLQSLAGASIRHVAEDLYDADTRFIF 139
Query: 1212 ELVQNADDNIYP----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
ELVQNA+D+ Y + EP + F L I V NE GF+ ++RA+C + S+K+
Sbjct: 140 ELVQNAEDSRYSHAKSQEEEPFIYFTLYPDRITVDTNEDGFTENDVRAICSIHKSSKR-Q 198
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
GYIG KGIGFKSVF+V I S F F +G+ G ++ PF
Sbjct: 199 IGGYIGHKGIGFKSVFKVAHKVHIQSGPFCFSFTHRKGESGM---GMITPF--------- 246
Query: 1328 SKDPVQLESKC---WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR 1384
S++P L + P + EG + I D L +++ FR
Sbjct: 247 SQEPEDLPPNVKTRFTLYFIHPEDFQIREG-ELKEIPDTLLLFLRRLCRIGVKIEPSGFR 305
Query: 1385 NML-------NDSLVVIRKK--IVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTE 1435
+ N + ++++ +V I V E + Q + T E
Sbjct: 306 LLFKREQTEPNGRITLVKEAGDVVSKRIYHVEGAEFENLPEHGDQSAQT--------TAE 357
Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
+ LA + +S+ P++ Q V++FLP+R G KF++Q DF+ ++R+ V P N
Sbjct: 358 VILAFPVDDSHR----PIIEPQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LCPRNY 412
Query: 1496 WLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLR 1555
+ +FV A +FC A ++ ++P F++ L MIL L
Sbjct: 413 AIRERISLIFVQAVYNFC-------KNATLKYEWLQYLPGPSIPDPFWATLREMILESLS 465
Query: 1556 MSNCLI 1561
S L+
Sbjct: 466 ESKILL 471
>gi|408390643|gb|EKJ70034.1| hypothetical protein FPSE_09771 [Fusarium pseudograminearum CS3096]
Length = 1687
Score = 117 bits (294), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 166/373 (44%), Gaps = 69/373 (18%)
Query: 1143 DCTTQTLPEDKECEDAALIVESIRRDEFGLG-----------PNISNMESNMLKKQHARL 1191
D T P+ + +D +V+ I ++ LG P + L ++ +
Sbjct: 4 DGTQTAEPDKRTAQD---LVKEIAKERGYLGEEQLARIGEINPELRREVEEALLRKEEMI 60
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQG 1247
G A+ L++ +Y+ ++ F+ EL+QNADDN Y V +P ++F + + + NE G
Sbjct: 61 GSAVLTLARNIYTSNARFVFELLQNADDNDYSTAVSQGQDPYVSFKVHPDKVTIECNESG 120
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ- 1306
F+ EN++A+C +G S+K G+ AGYIG KGIGFKSVF I S+ F F +G
Sbjct: 121 FTHENLKAICAIGKSSKVGA-AGYIGEKGIGFKSVFMAAWKVHIQSHKFSFSFTHRKGDS 179
Query: 1307 -IGFVLP-------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
+G V P T V R S+DP + E + IRL
Sbjct: 180 GLGMVTPVWEETDETTVDSSTRITLLRHTSEDPEEDERR--RDAIRL------------- 224
Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMF----RNMLNDSLVVIRKKIVGDGIIK--VSCGEDK 1412
F DL ++LLFL +L+ + ++ + + V I+ S G D+
Sbjct: 225 ----QFQDLQHTILLFLRKLRKVQVSFYDKDDVQTATTVYSLHGSNPATIRKETSDGVDE 280
Query: 1413 MTWFVASQKLRAGVIRPDVKT-----------TEIALALTLQESNEGNYGPLLYQQPVFA 1461
+ V ++ + + R + +T TE+ LA L ++ P++ Q VFA
Sbjct: 281 RQYHV-TKHVAENIPRSENRTYSDERDRADSSTEVVLAFPLTDTGT----PIVENQDVFA 335
Query: 1462 FLPLRTYGLKFIL 1474
FLP+R K L
Sbjct: 336 FLPMRPMSFKITL 348
>gi|221501417|gb|EEE27194.1| SWIM zinc finger protein domain-containing protein, putative
[Toxoplasma gondii VEG]
Length = 6038
Score = 117 bits (292), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/246 (32%), Positives = 111/246 (45%), Gaps = 83/246 (33%)
Query: 1161 IVESIRRDEFGLGPNISNMES-------------------------------------NM 1183
I+E+IRR+EFG+G +S + ++
Sbjct: 1916 IIEAIRREEFGVGLQLSGAAAGSIGDTRHSDCASEGAHLSPEEREARRKERETNALVEDV 1975
Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------------------- 1224
L +Q RL RAL LS+ LY+ DSH +EL+QNADDN Y E
Sbjct: 1976 LLRQRQRLTRALDRLSRGLYTADSHLQMELLQNADDNNYSELQTQSKLKANGDLDAPSAP 2035
Query: 1225 -----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK----------GSSA 1269
+EP L F + G+ NNE GF+ +++RALCDV STK+ G+ A
Sbjct: 2036 GRPLTRLEPWLHFEITPQGLAAFNNENGFTEKDVRALCDVARSTKEKREGARKAEMGADA 2095
Query: 1270 GY-----------IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-IGFVLPTLVPP 1317
G IGR G+GFKSVF ++D P + S GF KF+ S+ +GFVLP + P
Sbjct: 2096 GAEDDGEKENVRKIGRFGLGFKSVFSISDRPHLFSQGFAFKFEASDPTGLGFVLPHWLEP 2155
Query: 1318 FNIDMF 1323
+
Sbjct: 2156 PEATFY 2161
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 6/111 (5%)
Query: 1453 LLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
L +PVF +LP+R++GL F+L F L +SRE + +S +N L + P F++A + F
Sbjct: 2535 LPESRPVFCYLPVRSFGLPFVLHAPFDLTASRESLVVSSFFNLQLRNALPETFLAALK-F 2593
Query: 1513 CDLPCFRENPAKAASVYMSFVP-LVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
C A ++ F+P + + FF R IL LR CL++
Sbjct: 2594 CKHSALFSLQAS----FLRFLPFFAAKENDFFGPAARDILRLLRRDPCLMV 2640
>gi|357496873|ref|XP_003618725.1| Serine/threonine protein phosphatase [Medicago truncatula]
gi|355493740|gb|AES74943.1| Serine/threonine protein phosphatase [Medicago truncatula]
Length = 1547
Score = 115 bits (288), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 175/390 (44%), Gaps = 67/390 (17%)
Query: 1216 NADDNIYPENVEPTLTFILQESGIV---------VLNNEQGFSAENIRALCDVGNSTKKG 1266
NA+DN Y V PTL FI+ + I + NNE+GFS +NI ++C VG STKKG
Sbjct: 17 NAEDNQYLVRVNPTLEFIITSTDIAATGAAATLLIFNNEKGFSRQNIESICSVGQSTKKG 76
Query: 1267 S-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDM 1322
+ S+GYIG KGIGFKSVF VT P I SNG+ + F+ + + +G+++P V +
Sbjct: 77 NRSSGYIGEKGIGFKSVFLVTAQPYIFSNGYQISFNENPCPQCSLGYIVPEWVE----EK 132
Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM 1382
RLL + + T + LP + +N++ S +HP +L FL +++
Sbjct: 133 PTRLLDIKQIYGKDSLPTTTLVLPLKAD-----KINSVKQQLSSIHPEILSFLSKIRDFS 187
Query: 1383 FR------NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQ---KLRAGV-IRPDVK 1432
R M + + V I +I + K + + T ++ + KL V R V+
Sbjct: 188 VREDNEDPKMESVTTVSISSEI--NSAAKKNLNAESYTIHLSQKFPVKLENVVERRMGVE 245
Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSP 1492
+ L QE N LP Y + IL ++
Sbjct: 246 DCVVTLVFPHQERLHKNKS-----------LP-GVYAWETILL--------------DNK 279
Query: 1493 WNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILS 1552
WNQ +L P+ F+ A R+ + E P + F+P+ HG F+ + I +
Sbjct: 280 WNQGILECVPSAFMEAFRTL--VAGSYEAPVSSLIYLFKFIPIHSSRHGNFNHVREEIKA 337
Query: 1553 KLRMSNCLILEGNNNQ---WAPP--CKVLR 1577
KL + +E +NQ + P C++LR
Sbjct: 338 KLAEEKIIPIETFSNQKHFYKPREVCRLLR 367
>gi|342890662|gb|EGU89430.1| hypothetical protein FOXB_00054 [Fusarium oxysporum Fo5176]
Length = 1666
Score = 115 bits (287), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 118/488 (24%), Positives = 215/488 (44%), Gaps = 71/488 (14%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
+E+ + D + P++ + LK A + ++L + + EL+QNADDN
Sbjct: 27 IEAAKLD--AMPPDVRRVVEEALKSNSKSHADATELI-RKLAAGNIDVAFELLQNADDNQ 83
Query: 1222 YPEN----VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
Y P FI+ + +V+ +NE GF+ NI+ALCD G STK G+S GYIG+ +
Sbjct: 84 YTRAKAACANPGAQFIVTQDMVVIGSNEDGFTESNIKALCDFGKSTKAGTS-GYIGQPDV 142
Query: 1278 GFKSVFRVTDAPEIHSN--GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLE 1335
GFKSVF + +I S F+++ + + +G V P + P + + R E
Sbjct: 143 GFKSVFTIASKVQIESGPFSFYLQHEDGDSGVGMVRPHWIEPPVLPDYVR---------E 193
Query: 1336 SKCWN------TCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND 1389
+ + T I L + + + + +N ++ D S+LLF+ L+ + + D
Sbjct: 194 HEAYGGLFGNRTRIILFLKDQNRDNL-RSNFINQLWDTPESVLLFMRNLRSLRITVLGED 252
Query: 1390 SL------VVIRKKIVGDG---IIKVSCGEDKM-TW--FVASQKLRAGVI-------RPD 1430
+ V++ + + G G K G ++ TW ++ + G+ PD
Sbjct: 253 NKVQRSKDVLLDRDLFGSGTRITTKKKLGPSQLTTWNRYLVYKSTATGLPANPNRRHAPD 312
Query: 1431 VKT----TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREE 1486
++ LA + E E P++ Q VFAF+P+ GL F++ DFV ++R++
Sbjct: 313 ASANSSNADVVLAFPVDEKGE----PIIEIQKVFAFMPVSETGLSFLIHSDFVANATRQK 368
Query: 1487 VDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE--VHGFFS 1544
+ S N + V+A + C P +M F+P + + F+
Sbjct: 369 LVTVSERNVRIKQAVENAVVTAVQDLCKKPGTEHK-------WMRFLPQINRNGIDPFWL 421
Query: 1545 GLPRMILSKLRMSNCLILEGNNNQW-----APPCKVLRG--WNDRAHSLLPDILLQKHLG 1597
G ++ +L++S +L + ++ A K L + LLPD+ + +L
Sbjct: 422 GFNNVL--RLKISIIPLLRPRSQKFGNLSKATSLKRLSADQLDQYGAPLLPDLEQEMYLS 479
Query: 1598 LGFLSKDI 1605
G+ +D+
Sbjct: 480 SGYEPQDL 487
>gi|237845199|ref|XP_002371897.1| SWIM zinc finger domain-containing protein [Toxoplasma gondii ME49]
gi|211969561|gb|EEB04757.1| SWIM zinc finger domain-containing protein [Toxoplasma gondii ME49]
gi|221480764|gb|EEE19195.1| SWIM zinc finger protein domain-containing protein, putative
[Toxoplasma gondii GT1]
Length = 6036
Score = 114 bits (285), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 81/241 (33%), Positives = 110/241 (45%), Gaps = 84/241 (34%)
Query: 1161 IVESIRRDEFGLGPNISNMES-------------------------------------NM 1183
I+E+IRR+EFG+G +S + ++
Sbjct: 1918 IIEAIRREEFGVGLQLSGAAAGSIGDTRHSDCASEGAHLSPEEREARRKERETNALVEDV 1977
Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------------------- 1224
L +Q RL RAL LS+ LY+ DSH +EL+QNADDN Y E
Sbjct: 1978 LLRQRQRLTRALDRLSRGLYTADSHLQMELLQNADDNNYSELQTQSKLKANGDLDAPSAP 2037
Query: 1225 -----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK----------GSSA 1269
+EP L F + G+ NNE GF+ +++RALCDV STK+ + A
Sbjct: 2038 GRPLTRLEPWLHFEITPQGLAAFNNENGFTEKDVRALCDVARSTKEKREGARKAEMSAEA 2097
Query: 1270 GY-----------IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-IGFVLPT-LVP 1316
G IGR G+GFKSVF ++D P + S GF KF+ S+ +GFVLP L P
Sbjct: 2098 GAEDDGEKENVRKIGRFGLGFKSVFSISDRPHLFSQGFAFKFEASDPTGLGFVLPHWLEP 2157
Query: 1317 P 1317
P
Sbjct: 2158 P 2158
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 56/111 (50%), Gaps = 6/111 (5%)
Query: 1453 LLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
L +PVF +LP+R++GL F+L F L +SRE + +S +N L + P F++A + F
Sbjct: 2537 LPESRPVFCYLPVRSFGLPFVLHAPFDLTASRESLVVSSFFNLQLRNALPETFLAALK-F 2595
Query: 1513 CDLPCFRENPAKAASVYMSFVP-LVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
C ++ F+P + + FF R IL LR CL++
Sbjct: 2596 CKHSAL----FSLQVSFLRFLPFFAAKENDFFGPAARDILRLLRRDPCLMV 2642
>gi|358397613|gb|EHK46981.1| conserved hypothetical protein [Trichoderma atroviride IMI 206040]
Length = 1578
Score = 114 bits (284), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 137/567 (24%), Positives = 225/567 (39%), Gaps = 114/567 (20%)
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE-PTLTFILQESGIVVLNNEQGFSA 1250
++ L++++Y + F+ EL+QNA+DN + + E P ++F + E+ IVV NE GF+
Sbjct: 52 AHSIKTLAKQIYGSGARFVFELLQNAEDNNFSKTTEDPYISFEVYENRIVVECNEDGFTE 111
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF--DTSEGQIG 1308
++ A+C VG ST K +S GYIG KGIGFKSVF I S F +F + +G
Sbjct: 112 RDLVAICAVGQST-KSTSYGYIGAKGIGFKSVFIAAWKVHIQSGNFSFEFRHKRDDPGLG 170
Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV-DMFSDL 1367
V P +P +L+ + E + +I+ DL
Sbjct: 171 MVRPIWIPTNE-------------KLKGPLTRMTLYFHDEGDKDELQRLKSIIFKQLDDL 217
Query: 1368 HPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED----------KMTWFV 1417
+ LLFL +LQ I + V K+ + + K D K +
Sbjct: 218 QQTSLLFLKKLQRIKMVFYDKSNRVEKSKEFLKRQVDKYRVALDTDSAKDNETRKESQIY 277
Query: 1418 ASQKLRA-GVIRPDVK-------------TTEIALALTLQESNEGNYGPLL---YQQPVF 1460
L A G+ R D + + E+ LA L + P++ +Q +F
Sbjct: 278 HITTLTAHGLARSDNRDLPNTDDTQTISTSAEVVLAFPLT----AEFKPIVNTKKKQEIF 333
Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFC------- 1513
AFLP+R KF++ DF ++R+++ S N+ LL F++A + FC
Sbjct: 334 AFLPVRESDYKFLIHSDFDTSANRQDIITTSTRNENLLDWIAKAFLTAIKQFCEHSSLCY 393
Query: 1514 DLPCFRENPAKAASVYMS--------FVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGN 1565
+ P F N A + + V + PR+I ++ EG
Sbjct: 394 EWPLFLPNTEGALDTFWAELNIKIRRIVTTSPTLKSRHRRDPRVITDISILTKDATYEGG 453
Query: 1566 NNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKIL 1625
+ P D+ L P SK + S+ + G++ P +
Sbjct: 454 VPLFDDP--------DKDPFLSPK-----------YSKPV---RSVLKGYGLKNISPNAI 491
Query: 1626 LQIISSLCRTEN----GLRSMG------LSWLASWLNELYTISFHSSGQSSLQSGVETDL 1675
L+++ S R N G ++ G S L SW + Y+
Sbjct: 492 LEMLESDLRNPNSKMHGKKTDGAWYSAVASLLCSWFDRNYS------------------F 533
Query: 1676 IDNLQRIPFIPLSDGTFSSVDEGTIWL 1702
++ L+++P IPL DG + S G I+
Sbjct: 534 LNRLRQLPLIPLRDGKWISAALGPIYF 560
>gi|402080239|gb|EJT75384.1| hypothetical protein GGTG_05321 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1699
Score = 114 bits (284), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 159/357 (44%), Gaps = 45/357 (12%)
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQG 1247
G + L++ +YS D+ F+ EL+QNADD+ + P++ F + IVV NE G
Sbjct: 52 GHMIKTLAKNIYSSDARFVFELLQNADDSRFQRARVRGDLPSIAFHVHPGRIVVECNEDG 111
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
F+ ++ A+C VG STK GS GYIG KGIGFKSVF I S F F +G +
Sbjct: 112 FTKRDLSAICAVGQSTKSGSH-GYIGAKGIGFKSVFSAAWKVYIQSGYFSFYFKHRKGDL 170
Query: 1308 GF--VLPTLVPPFNI-DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV-DM 1363
G +LP V + D L S PV + L + E ++NI+
Sbjct: 171 GLGMILPIWVDGDDDNDTQNELPSPSPVT------RMTLHLHDKGDPEELEHLHNIILKQ 224
Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI--VGDGIIKVSCGEDKMT------- 1414
L P+ LLFL L+ I + + ++ +GD E +T
Sbjct: 225 LDGLQPTCLLFLRNLRRIQVISYDDGGRTTKSREFRQIGDAENHRVTLESTVTDQGGEAT 284
Query: 1415 ----WFVASQKLRAGVIR--------PDVKT-----TEIALALTLQESNEGNYGPLLYQQ 1457
F ++ + + + R P+ K+ E+ LA L G P++ +Q
Sbjct: 285 TEERHFHVTRHMASNLSRSQNRELSGPEEKSRAFSEAEVVLAFPLT----GCSAPVVGRQ 340
Query: 1458 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
+FAFLP+ KF++ DF +SR+ + S N LL A F++A FC+
Sbjct: 341 EIFAFLPVEDSTFKFLINSDFDTTASRQRIVTTSRRNIDLLDGIAAAFLTAVLQFCE 397
>gi|408391323|gb|EKJ70703.1| hypothetical protein FPSE_09213 [Fusarium pseudograminearum CS3096]
Length = 1714
Score = 112 bits (280), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 164/371 (44%), Gaps = 50/371 (13%)
Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFIL 1234
++ N L + A + ++ L++++Y + F+ EL+QNA+DN++ N P ++F +
Sbjct: 40 IDENWLAMESA-VSHSVKILAKQIYGSGARFVFELLQNAEDNMFNKARASNALPFISFKI 98
Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
IVV NE GF+ ++ A+C VG STK +S GYIG KGIGFKSVF I S
Sbjct: 99 YLDKIVVECNEDGFTEPDLDAICAVGKSTK-SASHGYIGAKGIGFKSVFIAASRVHIQSG 157
Query: 1295 GFHVKF--DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
F +F + ++ IG V P V K + S + + +
Sbjct: 158 NFSFEFRHNKTDPGIGMVRPIWV-------------KATETIPSPMTRMTLYIHDQGDPD 204
Query: 1353 EGIAMNNIVDM-FSDLHPSLLLFLHRLQCI---------------MFRNMLNDSLVVIRK 1396
E + I+ M F DL + LLFLH+L I FR D V +
Sbjct: 205 EIRHLKRIISMQFDDLQETCLLFLHKLGQIDLEFYDDNGELERSRCFRKYDIDKYRVSLE 264
Query: 1397 KIV---GDGIIK--VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
I GD + + + +A R + K E+ LA + E +E
Sbjct: 265 TISDSHGDQTTRNQIYHITRQTATNLAPSDNRELSNTDEGKRAEVVLAFPITEDSE---- 320
Query: 1452 PLLYQ--QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAE 1509
PL+ Q Q +FAFLPLR KF++ DF ++R++V S N L F A
Sbjct: 321 PLITQRNQDLFAFLPLRRSEYKFLIHSDFDTNANRQDVITTSRRNLDLRDWIATAFYQAV 380
Query: 1510 RSFCDLP--CF 1518
FC+ P C+
Sbjct: 381 LQFCEHPQLCY 391
>gi|46110190|ref|XP_382153.1| hypothetical protein FG01977.1 [Gibberella zeae PH-1]
Length = 1612
Score = 111 bits (278), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 164/371 (44%), Gaps = 52/371 (14%)
Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFIL 1234
++ N L + A L ++ L++++Y + F+ EL+QNA+DN++ N P ++F +
Sbjct: 40 IDENWLAMESA-LSHSVKILAKQIYGSGARFVFELLQNAEDNMFNKARASNALPFISFKI 98
Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
IVV NE GF+ ++ A+C VG S K S GYIG KGIGFKSVF V I S
Sbjct: 99 YPDKIVVECNEDGFTEPDLDAICAVGKSNKLASH-GYIGAKGIGFKSVFIVASRVHIQSG 157
Query: 1295 GFHVKF--DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
F +F + ++ IG V P V P + L++ + + + IR
Sbjct: 158 NFSFEFRHNKTDPGIGMVRPIWVKP--TETIPSPLTRMTLYIHDQGDPDEIR-------- 207
Query: 1353 EGIAMNNIVDM-FSDLHPSLLLFLHRLQCIMF----RNMLNDSLVVIRKKIVGDGIIKVS 1407
+ I+ M F DL + LLFLH+L I N + RK V + +
Sbjct: 208 ---HLKRIISMQFDDLQETCLLFLHKLGQIGLEFYDENGELERSRCFRKYNVDKYRVSLE 264
Query: 1408 CGED--------KMTWFVASQKLRAGVIRP----------DVKTTEIALALTLQESNEGN 1449
D + + Q A + P + K E+ LA + E ++
Sbjct: 265 TTSDSHGNRTTRNQIYHITRQT--ATNLAPSDNRELSNTDEGKRAEVVLAFPITEDSK-- 320
Query: 1450 YGPLLYQ--QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
PL+ Q Q +FAFLPLR KF++ DF ++R+++ S N L F
Sbjct: 321 --PLVTQRNQDLFAFLPLRRSEYKFLIHSDFDTNANRQDIVTTSRRNLDLRGCIATAFYQ 378
Query: 1508 AERSFCDLPCF 1518
A FC+ P
Sbjct: 379 AVLQFCEHPTL 389
>gi|317154996|ref|YP_004123044.1| hypothetical protein Daes_3310 [Desulfovibrio aespoeensis Aspo-2]
gi|316945247|gb|ADU64298.1| hypothetical protein Daes_3310 [Desulfovibrio aespoeensis Aspo-2]
Length = 1118
Score = 111 bits (277), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 169/687 (24%), Positives = 271/687 (39%), Gaps = 134/687 (19%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV---EPTLTFILQESGIVVLNNE 1245
AR GR L + Y +HF+ EL+QNA+D + +TF L + + +
Sbjct: 43 ARYGRILLA---DRYDDRTHFIFELLQNAEDALGRRGTWDGTRGITFDLTSESLTLSHFG 99
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG 1305
F+ ++R +C + STK SS +GR GIGFKSV+ TD PEIHS G
Sbjct: 100 ASFNEADVRGVCGIAESTKDRSS---VGRFGIGFKSVYTFTDRPEIHS-----------G 145
Query: 1306 QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
F + V P D +LE K T I LP + + + A++ I + F
Sbjct: 146 LEDFTVENYVQPRRAD-----------RLELKPEETRIILPLKPE--DTTALHEITEGFK 192
Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKK--IVGDGIIKVSC-GEDKMTWFVASQKL 1422
L PS LLFL + I + S + +R + + + K++ G++ V +
Sbjct: 193 RLGPSSLLFLRHINEISWNAPGGASGIYLRSQPTFLAPDVQKITVIGQESGQADVDQNWI 252
Query: 1423 RAGVIRPDV--------KTTEIALALTLQESNEGNYGPL-LYQQPVFAFLP-LRTYGLKF 1472
V R DV + E+A ++ G + + P+ F P + + L F
Sbjct: 253 ---VFRRDVFLADGAMGGSVEVAFSVEADGDEPGQWSVRPVPTSPLVVFFPTVVSTNLGF 309
Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA-----ERSFCDLPCFRENPAKAAS 1527
++QG ++ SR+ + PWNQ L+ E AL V A E+S D R P
Sbjct: 310 LVQGPYLTTPSRDNIRQGEPWNQHLVEETAALVVDAIRWFREKSMLDTSTLRCLPLDRQG 369
Query: 1528 VYMS-FVPLVGEVHGFFSGLPRMILSKL---RMSNCLILEGNNNQWAPPCKVLRGWNDRA 1583
S F P FF + + L + R + G + LR
Sbjct: 370 FKDSMFAP-------FFEAVRQAFLDEALLPRFGGGYVAAGQAK--LARTQELRELFSPG 420
Query: 1584 H-SLLPDILLQKHLGLGFLSKDIV------LSDSLARALGIEEYGPKILLQIISSLCRTE 1636
H ++L DI G +LS DI + L R L IEE P +I+S L R
Sbjct: 421 HVAVLFDI-----EGAAWLSGDITEDKAPEIRKYLMRELEIEEVTPA---KIVSRLSR-- 470
Query: 1637 NGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVD 1696
L + W+ + F S +++L+ ++T IP + L+DG+
Sbjct: 471 EFLEAQADEWI------VLLYQFLSEQETALRRYLDT--------IPLLRLADGSHIVAR 516
Query: 1697 EGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLL 1756
E + + F L + T P + +A V S LL
Sbjct: 517 E-----NGKPTAF-----------LPSGSETGFPTIRNA---------VCSTMESRNFLL 551
Query: 1757 KIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIV 1816
+G+ + D + ++ P GD +Y+ + L + + +R+ ++
Sbjct: 552 SLGITEPDLVDDIIWNVKPKYDQSEVVVGD----TEYVTDIERILAAFETDSKAQRQKLL 607
Query: 1817 SELRKKAFV-------LTNHGFKRPSE 1836
S LR+ AFV + + G+ +PSE
Sbjct: 608 SGLRETAFVKIVEVGSVRSEGYGKPSE 634
>gi|67900586|ref|XP_680549.1| hypothetical protein AN7280.2 [Aspergillus nidulans FGSC A4]
gi|40741961|gb|EAA61151.1| hypothetical protein AN7280.2 [Aspergillus nidulans FGSC A4]
gi|259483371|tpe|CBF78706.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 1882
Score = 110 bits (276), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 152/341 (44%), Gaps = 41/341 (12%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGIVVLNNEQ 1246
LG + + ++ D+ F+ E++QNA+D Y +N P+L F + IVV +NE
Sbjct: 37 LGAPVREAAARAWTSDTRFVYEMIQNAEDCSYNSAKSKNAIPSLKFYVFLDRIVVDSNED 96
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
GF ++RA+C G K+G +G KGIGFKSVF++ I S F +G+
Sbjct: 97 GFEESHVRAICSAGKRKKEGK----LGEKGIGFKSVFKIASKARIQSGPFCFSLRHRDGE 152
Query: 1307 IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSD 1366
G ++ P N +P+ +R F T FS I D
Sbjct: 153 NGL---GMIAPVN-------EIHEPLP-------DGVRTRF-TLFSRPEKCREIQDELLR 194
Query: 1367 LHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMT--WFVASQKLRA 1424
L +++ FL RL + F + + ++ + + ++ V + ++ ++ A
Sbjct: 195 LPHTIIAFLSRLHMLSFYLSRSSTSLMFSRPTIEPSLVCVETRHSTVMVHRYIPFRRKVA 254
Query: 1425 GV----IRPDVKTTEIALALTLQESNEGNYGPLLYQQP-VFAFLPLRTYGLKFILQGDFV 1479
G+ RP ++ E+ LA G L QP ++ PLR GL F++Q DFV
Sbjct: 255 GLPPRESRPLIEDVEVVLAFPQD-------GRALTTQPFAHSYRPLRGAGLNFLIQSDFV 307
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
+ E++ PWN +L P +F +A + C +P F +
Sbjct: 308 TQDNGEDLVA-CPWNDHILDSLPGVFFAALQEICSIPEFEK 347
>gi|378732103|gb|EHY58562.1| hypothetical protein HMPREF1120_06571 [Exophiala dermatitidis
NIH/UT8656]
Length = 1671
Score = 108 bits (270), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 141/556 (25%), Positives = 241/556 (43%), Gaps = 112/556 (20%)
Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNE 1245
+ A R ++++LYS + EL+QN DD+ Y VEP++TF ++ S +++ +NE
Sbjct: 57 EHRALYARYTRTVAEQLYSSRFRLIYELLQNGDDSCYANAVEPSITFRIKPSALIIESNE 116
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG 1305
GF+ EN+ ++CD G S+ K S+A G KG+GFKSVF + + I S + +F+ +G
Sbjct: 117 LGFTLENVESICDTGKSS-KASNADTTGEKGLGFKSVFGIAEYVHIQSGLWSFRFEHRQG 175
Query: 1306 QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
+ G ++ P D+ L S N RL R I +V F
Sbjct: 176 EEGV---GMITPVWTDVTSLLPS-----------NVGTRLTLRYYDQRDIFSTRLVSEFE 221
Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
L +++ L +L+ ++ IV DG+ G F LR+
Sbjct: 222 KLPRTIMFALRKLRKLV---------------IVVDGV----AGRSDRIAFEKDGDLRSD 262
Query: 1426 VIRPDV-----------KTTEIAL---------ALTLQESNEGN---------YG-PLLY 1455
IR + TT + L + T + S+E + YG P++
Sbjct: 263 QIRINTTVTGNFGAHTSATTWLRLFQNTVSDLPSDTHRTSSESDVTVAFEVDSYGLPVIP 322
Query: 1456 Q--QPVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
Q VFA+LP+ R L F++ +F+L ++RE V ++ WN L + FV
Sbjct: 323 ANGQHVFAYLPVERIIQLPFLINANFILTANRESVP-DTDWNNALREGVSSAFV----GL 377
Query: 1513 CDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP 1572
D +P + +M ++P + GF+ L + + N L+ G
Sbjct: 378 VDKVVTSGDPLEFK--WMRYIPQ-RPMQGFWEYL------WVEIQNELVNRG-------- 420
Query: 1573 CKVLRGWNDRAHSL-----LPDILLQKHLGL-GFLSKDIVLSDSLARA-LGI-EEYGPKI 1624
+LR HS+ LPD + + + +DI LSD + +GI +++G
Sbjct: 421 TPILRSRRGELHSIEHMATLPDWFVHDNEPIVADTDEDIYLSDGYESSDVGILKDFG--- 477
Query: 1625 LLQIISSLCRTENGLR-SMGLSWLASWLNELYTI----SFHSSGQSSLQSGV-ETDLIDN 1678
++ + TE R +GLS A +Y I S+H++ + +Q + E + +
Sbjct: 478 ----VAPISATEILERIKLGLSQGAE--APIYHIPLDDSWHTAFAALIQRLLGERSIKKD 531
Query: 1679 LQRIPFIPLSDGTFSS 1694
++R+ IPL D + S
Sbjct: 532 VERLRIIPLDDNRWVS 547
>gi|401403116|ref|XP_003881414.1| hypothetical protein NCLIV_044430 [Neospora caninum Liverpool]
gi|325115826|emb|CBZ51381.1| hypothetical protein NCLIV_044430 [Neospora caninum Liverpool]
Length = 6000
Score = 107 bits (267), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 92/186 (49%), Gaps = 50/186 (26%)
Query: 1182 NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP------------EN---- 1225
++L +Q RL RAL LS+ LY+ DSH +EL+QNADDN Y EN
Sbjct: 1962 DVLLRQRQRLTRALDRLSRGLYTADSHLQMELLQNADDNNYSALEALRSQVRMRENASLS 2021
Query: 1226 --------VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK------------- 1264
EP+L F + G+ NNE GF+ +++RALCDV STK
Sbjct: 2022 ADGLSLALAEPSLHFEVTPEGLAAFNNENGFTEKDVRALCDVARSTKVKREVRATQANGA 2081
Query: 1265 -----------KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-IGFVLP 1312
K IG+ G+GFKSVF ++D P + S GF KF+ S+ +GFVLP
Sbjct: 2082 GSIPEEGKDREKAGEVKKIGKFGLGFKSVFAISDRPHLFSKGFAFKFEASDPTGLGFVLP 2141
Query: 1313 T-LVPP 1317
L PP
Sbjct: 2142 HWLEPP 2147
Score = 52.0 bits (123), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 58/107 (54%), Gaps = 6/107 (5%)
Query: 1457 QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP 1516
+PVF +LP+R++GL F+L F L +SRE + +S +N L + P F++A FC
Sbjct: 2534 RPVFCYLPIRSFGLPFVLHAPFELTASRESLVVSSFFNLQLRAALPETFLAA-LHFCK-- 2590
Query: 1517 CFRENPAKAASVYMSFVP-LVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
R + + ++ F+P + + FF+ R IL LR C+++
Sbjct: 2591 --RSDLFPLQTSFLRFLPFFAAKDNDFFAPAAREILRLLRQEPCVMV 2635
>gi|309791644|ref|ZP_07686136.1| hypothetical protein OSCT_2087 [Oscillochloris trichoides DG-6]
gi|308226266|gb|EFO80002.1| hypothetical protein OSCT_2087 [Oscillochloris trichoides DG6]
Length = 1068
Score = 106 bits (265), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 149/591 (25%), Positives = 239/591 (40%), Gaps = 112/591 (18%)
Query: 1202 LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
LY+ +HF+ EL+QNA+D + L F+L + + V +N + F+ ++R +C VG
Sbjct: 29 LYADRTHFVFELLQNAEDAGATQ-----LKFVLHKDRLDVWHNGRIFNEHDVRGICGVGE 83
Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNID 1321
TK G IG+ GIGFKSV+ T P I S G F I+
Sbjct: 84 GTK-AEDMGKIGKFGIGFKSVYAYTRRPSIFSGGER--------------------FAIE 122
Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTK-FSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
+ R + + + L K W T I LPF + +A + I+ F L S +LFL L
Sbjct: 123 HYVRPVPLNSIAL-PKPWTTLISLPFSPEALHPNLAYHQILHRFRSLKASTILFLRHLTQ 181
Query: 1381 IMF------RNMLNDSLVVI---RKKI---VGDGIIKVSCGEDKMTWFV---------AS 1419
+ + + ML+ + R+K+ V DG E++ TW V S
Sbjct: 182 VEWQVEGGEQGMLSYDFTKLGAGRRKMALTVRDG------KEEEETWLVFERPVSLPKGS 235
Query: 1420 QKLRAGVIRPDVKT------------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT 1467
++L G +T EIA L + G V F +
Sbjct: 236 RRLPNGEYEVFGRTLALPSNHKEALKVEIAFRLDQDKRKGDQIGTAQVSHLVVFFPTHKE 295
Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAAS 1527
L F++QG + +R+ + N PWN++L+SE L V A L +
Sbjct: 296 TELGFLMQGLYTTTPARDNIRINDPWNEYLISETAKLLVDALEELKSLGLL-------SV 348
Query: 1528 VYMSFVPLVGEVHGF---FSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAH 1584
++ +PL G F + + + LR L + A K+ RG
Sbjct: 349 QAINTLPLRPSKFGHDNPFRPIFDAVRTALRDRPLLPTHDGSFTAARDAKLARGAG--LI 406
Query: 1585 SLLPDILLQKHLG----LGFLSKDIVLSDS--LARAL-GIEEYG--PKILLQIISSL-CR 1634
LLP LL K G + +LS +I L + L R L G E P+ L ++ + R
Sbjct: 407 ELLPPTLLGKLFGSSKPIQWLSSEISLDRTPDLYRYLVGRFERAEDPERLGALVERIEIR 466
Query: 1635 TENGLRSMGLSWLAS----WLNELYTISFHSSGQSSLQSGVETDLIDNLQ---------- 1680
E L + +WL + W+ + Y++ F+ G++ ++ + T I L
Sbjct: 467 PETVLAQLTPAWLKTQPDAWMIQFYSL-FNGIGEADIRKLLRTKEIIRLSNGEHVAAYGA 525
Query: 1681 ------RIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKL 1725
+P ++D +F +V + TI H D F GL + PN+ ++
Sbjct: 526 GSTPRVHLPSSVMADRSFPTV-KATIAAHPDALEFLKSLGL-SVPNVATEV 574
>gi|134080334|emb|CAK46256.1| unnamed protein product [Aspergillus niger]
Length = 1917
Score = 105 bits (263), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 133/293 (45%), Gaps = 41/293 (13%)
Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
AL +++YS +HFLLEL+QNADDN Y N+ P L + + NE GF+ +
Sbjct: 39 ALQFFGEDIYSTAAHFLLELIQNADDNEYAANITPALVLTYRNRVLRTDCNELGFTPGQV 98
Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPT 1313
A+C++G STKK G KG+GFKSVF V + I S + KFD S Q+G ++P
Sbjct: 99 EAICNLGYSTKKDKVEA-TGEKGVGFKSVFSVAASVWIASGPYTFKFDAS-NQMGMLIPI 156
Query: 1314 LVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLL 1373
P + + NT + F + E I +++ + +L
Sbjct: 157 WESPPDATRYA---------------NTSMYARFAPRADERIVFHDL----DVVGAKILT 197
Query: 1374 FLHRLQCIMFRNMLNDS----LVVIRKKIVG-DGIIKVS---CGE-----DKMTWFVASQ 1420
FL +L+ + L V R+ DG+ GE + T+ V
Sbjct: 198 FLRKLRELNLHIELQSGDRICYNVARQDTTSPDGLCMTQLTFTGEMFPVAQRKTYVVCRY 257
Query: 1421 KLR---AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGL 1470
+++ RP +TTE+ +A + + + P+L Q V+A LP+R+YG
Sbjct: 258 EVKEMPQERKRPGRQTTELRIAFPVDDFGQ----PVLESQDVYAVLPVRSYGF 306
>gi|323455043|gb|EGB10912.1| hypothetical protein AURANDRAFT_62372 [Aureococcus anophagefferens]
Length = 957
Score = 103 bits (257), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 121/471 (25%), Positives = 188/471 (39%), Gaps = 48/471 (10%)
Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
T I LP +F + + +++ P LLLFL+R++ I + D R
Sbjct: 226 GTTIVLPLTDEFRGDAKRAELRERLAEVDPLLLLFLNRVRRIDVVDRGADGETTARSMRR 285
Query: 1400 GDG--------IIKVSCGEDKM-----TWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
D ++ ++ D TW V ++ P T LA + S
Sbjct: 286 ADAPGGAANASVVTLADAVDGAEPAASTWLVVRREAAVVPWAPRGDATRTVLAAAFEASE 345
Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
G Q PVFAFLP+ LKF+L D++L SSRE V +SPWNQWL E L+V
Sbjct: 346 APRDGDA--QLPVFAFLPIAPANLKFVLHADWILASSRETVREDSPWNQWLRDEAAGLYV 403
Query: 1507 SAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNN 1566
A + + + +P+ EV GFF + I ++R C + +
Sbjct: 404 DALAAVRAG--AGAGAGFDVARVLEALPVEREVVGFFRPVAAEIARRVRGVECCL--AAD 459
Query: 1567 NQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSL----ARALGIEEYGP 1622
WA P + L +R P +L + LG D D L ALG+E
Sbjct: 460 GTWALPSRAL-AVPERLRREAPGLLARAPALLGRSCLDQAFQDRLRPETRAALGVEALDA 518
Query: 1623 KILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRI 1682
L++ ++L ++ + L W A LY + + G S+ D L+ +
Sbjct: 519 SHALRLAAALADADDLGAPVFLEWAAL----LYRAAAPAGGVSAADR-------DRLRSL 567
Query: 1683 PFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSS 1742
+PL+ G + I++ +G LE F + A L V PA+L+ S
Sbjct: 568 AVVPLAGGGRGAPGAAPIYVDRG----EGAEPLEGFAAVRADLAVVDPAVLAPSLARPM- 622
Query: 1743 LGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADY 1793
V+SV L GV+ +A + H+LPA++ K L A +
Sbjct: 623 --VVSV------LRSAGVKDATAEAVCLEHVLPALAAADAGEDRKRLNARW 665
>gi|302891159|ref|XP_003044462.1| hypothetical protein NECHADRAFT_83213 [Nectria haematococca mpVI
77-13-4]
gi|256725385|gb|EEU38749.1| hypothetical protein NECHADRAFT_83213 [Nectria haematococca mpVI
77-13-4]
Length = 2106
Score = 103 bits (257), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 123/275 (44%), Gaps = 34/275 (12%)
Query: 1182 NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVV 1241
++ K Q RL L C ELY +HF+ EL+QNADDN Y + PT+ F+ + + V
Sbjct: 5 SLAKAQIERLHEDLSC---ELYDTSTHFIQELLQNADDNKY-DCETPTMRFVYKPGSLRV 60
Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
NE+GF A N+ A+C + STK G GY G KGIGFKSVFR D I S + KFD
Sbjct: 61 DCNERGFDASNVEAICTIRQSTKAGRR-GYTGEKGIGFKSVFRAADVVWISSGAYSFKFD 119
Query: 1302 TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
+ G V P L ++ P ++ T +F E +V
Sbjct: 120 KHR-RFGLVAP-------------LWAEFPEPIQEGY--TSFFFQLSPQFDE----KELV 159
Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV----VIRKKIVGDG--IIKVSCGEDKMTW 1415
M + + +LFL RL+ + +D V + R DG + + G ++ +
Sbjct: 160 QMLLNFEATHILFLRRLRHVELEVTSSDGKVWTKSIHRTDTTEDGNAVSYLESGNIRLKY 219
Query: 1416 FVASQK---LRAGVIRPDVKTTEIALALTLQESNE 1447
+ + L RP +++ LA + +E
Sbjct: 220 LMYKHQVTNLPEETKRPGCTDSDLVLAFPFLDRDE 254
>gi|159486654|ref|XP_001701353.1| hypothetical protein CHLREDRAFT_153826 [Chlamydomonas reinhardtii]
gi|158271748|gb|EDO97561.1| predicted protein [Chlamydomonas reinhardtii]
Length = 2248
Score = 102 bits (255), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 101/209 (48%), Gaps = 50/209 (23%)
Query: 1406 VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL------QES-------------- 1445
V C +T + Q RAGV+ V T ++LA L +ES
Sbjct: 3 VCCRWAVVTSALQPQTSRAGVV---VAQTRVSLAFCLGPPVAAEESAAVRSGGPPPPPLA 59
Query: 1446 ----NEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEF 1501
EG P QQPVFAFLPLR YGL+F++Q D+VLPSSRE +D +SPWNQ L +
Sbjct: 60 APHWREGGRPP---QQPVFAFLPLRGYGLRFVVQADWVLPSSREALDADSPWNQELRQQL 116
Query: 1502 PALFVSAERSFCDLPCFRENPAKAASV------------------YMSFVPLVGEVHGFF 1543
P LF+ A F LP + + A+S ++ VPL GE GFF
Sbjct: 117 PGLFLRALDVFRSLPPPATSTSTASSTSTASSTPTAASCLFWTDQWLRCVPLEGEAQGFF 176
Query: 1544 SGLPRMILSKLRMSNCLILEGNNNQWAPP 1572
+GLP I + LR + C+ + G WA P
Sbjct: 177 AGLPHSIAALLRAAPCIPVAGGG--WATP 203
>gi|121609568|ref|YP_997375.1| hypothetical protein Veis_2613 [Verminephrobacter eiseniae EF01-2]
gi|121554208|gb|ABM58357.1| conserved hypothetical protein [Verminephrobacter eiseniae EF01-2]
Length = 412
Score = 102 bits (253), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 154/382 (40%), Gaps = 48/382 (12%)
Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI--YPENVEP- 1228
+ N + ++ +GR L + Y +HF+ EL+QNA+D + ++V P
Sbjct: 1 MASNYEAISDENRRRYGTDIGRIGPMLLADRYDDRTHFIFELLQNAEDALGRRGDSVGPR 60
Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
+TF L + + + + F ++R +C + STK S IGR GIGFKSV+ TD
Sbjct: 61 KVTFELTSTRLTRSHFGRPFDETDVRGVCGIAESTKDKFS---IGRFGIGFKSVYTFTDR 117
Query: 1289 PEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
PEIHS G F + V P D + + K T I LP
Sbjct: 118 PEIHS-----------GDEDFAIANYVQPQRAD-----------RTDRKAGETQIVLPL- 154
Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI-----------MFRNMLNDSLVVIRK- 1396
K + A I F L P LLFL ++ I N+ + +R
Sbjct: 155 -KADDDTAQQEISAGFRHLGPGALLFLRHIEEINWSVDGGASGLYLHNVTKEFGENVRLI 213
Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG-PLLY 1455
++G KV E W V +++ + V EIA +L + G + +
Sbjct: 214 TVIGQETGKV---EVDQNWLVFHREVFTEA-KEKVGRVEIAFSLVAIKDAPGRWSVQPVA 269
Query: 1456 QQPVFAFLP-LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
P+ F P + L F++QG + SR+ + PWNQ L+ E +L V A R D
Sbjct: 270 ASPLVVFFPTVVPTNLGFLVQGPYRTTPSRDNIPRGEPWNQHLVKETSSLLVEAMRWMRD 329
Query: 1515 LPCFRENPAKAASVYMSFVPLV 1536
C+ P A P+
Sbjct: 330 KGCWTRPPCAACRSTARSSPMA 351
>gi|342883116|gb|EGU83674.1| hypothetical protein FOXB_05809 [Fusarium oxysporum Fo5176]
Length = 1626
Score = 102 bits (253), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 95/325 (29%), Positives = 149/325 (45%), Gaps = 50/325 (15%)
Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFIL 1234
++ N K ++A ++ L++ +Y + F+ EL+QNA+DN + +N P ++F +
Sbjct: 40 IDENWFKMENA-ASHSIKVLARNIYGSGARFVFELLQNAEDNSFRKADGKNDPPFISFQI 98
Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
IVV NE GF++ +++A+C VG STK + GY+G KGIGFKSVF I S
Sbjct: 99 HPKHIVVECNEDGFTSLDLKAICSVGESTKT-AKHGYVGAKGIGFKSVFIAASRVHIQSG 157
Query: 1295 GFHVKF--DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
F +F + ++ +G V P V P + F L++ + L + I
Sbjct: 158 NFSFEFRHNRNDPGLGMVRPIWVRP--TETFPSPLTRTTLYLHDQGDEDEIE-----HLK 210
Query: 1353 EGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF-----RNMLNDSLVVIRKKIVGDGI---I 1404
IAM F DL + LLFL +L I L S +KK+ +
Sbjct: 211 TVIAMQ-----FDDLQETCLLFLRKLSKISVAFYDEEGNLQRSKQFTKKKVDDYRVSLET 265
Query: 1405 KVSCGEDKMT---WFVASQKLRAGV-------------IRPDVKTTEIALALTLQESNEG 1448
V G++ T + +++L G+ R + + E+ LA LQ
Sbjct: 266 TVVSGDESATTSQMYHITKQLATGLAQSESRDAATTEEARKSLTSAEVVLAFPLQ----S 321
Query: 1449 NYGPLL--YQQPVFAFLPLRTYGLK 1471
NY P + +Q +FAFLPLRT K
Sbjct: 322 NYQPQISTRRQELFAFLPLRTSDYK 346
>gi|66359780|ref|XP_627068.1| sacsin like HSP90 chaperone domain, likely plant origin
[Cryptosporidium parvum Iowa II]
gi|46228502|gb|EAK89372.1| sacsin like HSP90 chaperone domain, likely plant origin
[Cryptosporidium parvum Iowa II]
Length = 2120
Score = 102 bits (253), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/139 (39%), Positives = 85/139 (61%), Gaps = 8/139 (5%)
Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIV 1240
K Q R R++ LS ELYS LLEL+QNADD+ + N P + +L+ +V
Sbjct: 22 KSQRDRQDRSIGILSNELYSSFGRVLLELLQNADDSNFNFNEIGEGIPQVYILLRWDCLV 81
Query: 1241 VLNNEQGFSAENIRALCDVGNSTKKGSSAGYI-GRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
NE+G + ENI ++CD+GNS+KK S + Y G KGIG++S+F++TD P I S G+ +K
Sbjct: 82 FHINEKGMTRENIESVCDIGNSSKKNSESRYFTGEKGIGYRSIFKITDNPCIFSGGYSIK 141
Query: 1300 FDTS---EGQIGFVLPTLV 1315
F + E ++ +++P +
Sbjct: 142 FSSKPDEEIELSYIVPKWI 160
>gi|154291063|ref|XP_001546118.1| hypothetical protein BC1G_15419 [Botryotinia fuckeliana B05.10]
Length = 1440
Score = 101 bits (252), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 157/343 (45%), Gaps = 44/343 (12%)
Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQE--------LYS 1204
+E +DA ++ +R G I+ E L + ++ AL L + LY
Sbjct: 7 EELQDARAFIDDLRSANGG----ITKEEEEELDRLLPKVKEALENLQNKVGISTKMFLYQ 62
Query: 1205 QDSHFLLELVQNADDNIYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
+ + F+ E++QNA+DN Y + + P L+F L +++ +NE GF+ ENI+A+C +G ST
Sbjct: 63 KSTRFIYEIIQNAEDNEYKGSKKLPFLSFTLYSDRLIIDSNEDGFTKENIKAICSIGEST 122
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-----IGFVLPTLVPPF 1318
K S GYIG KGIGFKSVF V I S + F+ S+ +G V P P
Sbjct: 123 -KASIQGYIGEKGIGFKSVFTVAHKVHIQSGCYSFAFEYSKNDDGKSGLGMVTPMNETPH 181
Query: 1319 NI--DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLL--- 1373
+ D+ R++ L C L ++ + + + L + L
Sbjct: 182 ELPGDVQTRMI------LYLHAGEDCGAL-YQELLNLPDTLLLFLKKLRQLTVNFELPDR 234
Query: 1374 FLHRLQCIMFR---NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQ--KLRAGVIR 1428
+H++Q + R + L++++V + I G V+ K W L A R
Sbjct: 235 NVHQVQFSLTRPNNDRLSENIVQTKTAITG----AVNSVTHKSFWVKKRPVFNLPADSAR 290
Query: 1429 PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
++ E+ LA + E ++ P++ +Q FAFLP++ G K
Sbjct: 291 GKIREAEVILAFPIDE----DHVPVIEEQYAFAFLPMKKTGYK 329
>gi|209876910|ref|XP_002139897.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555503|gb|EEA05548.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 2423
Score = 99.8 bits (247), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 83/131 (63%), Gaps = 8/131 (6%)
Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNE 1245
RL R++ L+ ELYS LLEL+QNADD+ + E + PT+ +L+ ++V NE
Sbjct: 25 RLERSISILTTELYSSLGRALLELLQNADDSNFSECYDKDSIPTVYILLRWDCLIVHINE 84
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAG-YIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
G + ENI ++CD+GNS+KK SS Y G KGIG++SVF++++ P I+S + + F
Sbjct: 85 LGMTPENIESICDIGNSSKKNSSVRFYTGEKGIGYRSVFQISNNPSIYSGYYSISFSNEP 144
Query: 1305 GQ---IGFVLP 1312
+ +G+++P
Sbjct: 145 DKDVGLGYIVP 155
>gi|154500304|ref|ZP_02038342.1| hypothetical protein BACCAP_03971 [Bacteroides capillosus ATCC 29799]
gi|150271036|gb|EDM98310.1| hypothetical protein BACCAP_03971 [Pseudoflavonifractor capillosus
ATCC 29799]
Length = 1069
Score = 99.0 bits (245), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/342 (26%), Positives = 153/342 (44%), Gaps = 45/342 (13%)
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQGFSAE 1251
R+ L LYS+++HF+ E +QNA+D E + LTFI Q+ ++VL++ + F+
Sbjct: 26 RSFEKLLTGLYSENNHFIYEFLQNAED------AEASFLTFICQKKQLLVLHDGRPFNEA 79
Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL 1311
++ +C V TK+ A IG G+GFKSVF+ T+ PEI+SN + +L
Sbjct: 80 DVWGICSVMEGTKRREDAQKIGHFGVGFKSVFKYTERPEIYSNEETFAIEN------ILL 133
Query: 1312 PTLVPPFNIDMFCRL----LSKDPVQLESKCWNTCIRLPFRTK---FSEGIAMNNIVDMF 1364
P +P + C + P + + C T LPFR + GI ++I
Sbjct: 134 PVAIPRGDFPTDCTYQLDGQTVHPFKEKENC--TKFVLPFRDREELSRSGIDPDDIPRKL 191
Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR----KKIVGDGIIK---VSCGEDKM---T 1414
+L P +LLFL ++ + + + + + K K VS GE+++
Sbjct: 192 QELEPEILLFLRHVRTLTWVDETTGAYGTYQNIPSKTDTNTHTCKKRCVSAGEEELEAQK 251
Query: 1415 WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
+ V + G ++ A+ ++ + V+ F P GL+F+
Sbjct: 252 YLVFEDRFDCGEMKNACVKAAFAMDRAIKPVKDTR---------VWVFFPTTDESGLRFL 302
Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDL 1515
L G + P SRE++ +S +N+ L L V RS DL
Sbjct: 303 LHGTYQTPISREKIISDSSFNKMLHERAEELVV---RSMEDL 341
>gi|325859904|ref|ZP_08173033.1| conserved domain protein [Prevotella denticola CRIS 18C-A]
gi|325482586|gb|EGC85590.1| conserved domain protein [Prevotella denticola CRIS 18C-A]
Length = 648
Score = 99.0 bits (245), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 163/371 (43%), Gaps = 78/371 (21%)
Query: 1194 ALHCLSQELYSQ-DSHFLLELVQNADDNIYPENVEPTLTFILQ--ESGIVVLNNEQGFSA 1250
ALH + +LYS D F+ EL+QNADD PE+ +P ++ ILQ + ++ ++N + F
Sbjct: 160 ALHNIMDDLYSNADDRFVYELLQNADDQ--PEDGQP-VSVILQLLKEHLLFMHNGRVFDD 216
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
++ ++C +G+STK+ IG KGIGFKSVF +D I+S + FD G
Sbjct: 217 NDVDSICSIGDSTKRKDKEK-IGYKGIGFKSVFTGSDTVIINSGNYSFAFDKYSPVYG-- 273
Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSE-------GIAMN----- 1358
++DM + P QL+ W R P + +E GI++
Sbjct: 274 --------DLDM-----NNIPWQLKP-IWQEKYRYPKEVRENEIFWKKRVGISLEIEEKD 319
Query: 1359 ------NIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDK 1412
+I +FS HP LLFL + + F + V I K VGD + G
Sbjct: 320 LAEYRMSIAKIFS--HPIFLLFLKNVTNLEFDE--GELHVRISKSNVGDILRIEKDGVVD 375
Query: 1413 MTWFVAS--------------------QKLRAGVIRPDVKTTEIALALTLQESNEGNYGP 1452
+W V +KL+ G + T+I+ A + +
Sbjct: 376 SSWIVKDYPITIPQEVRDALQDDRNVPEKLKKGTM------TQISFAAKVDDGKVVK--- 426
Query: 1453 LLYQQPVFAFLP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAER 1510
+ ++A+LP + +G FI+ DF+L ++RE++ WNQ+L E L V
Sbjct: 427 -MDNSVLYAYLPTSVNDFGFNFIVNADFLLAANREQLHVKKRWNQFLFGEIGKLLVDWVA 485
Query: 1511 SFCD-LPCFRE 1520
S +P + E
Sbjct: 486 SLAKVIPSYLE 496
>gi|390441275|ref|ZP_10229407.1| hypothetical protein MICAI_370004 [Microcystis sp. T1-4]
gi|389835403|emb|CCI33533.1| hypothetical protein MICAI_370004 [Microcystis sp. T1-4]
Length = 1545
Score = 98.2 bits (243), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 123/505 (24%), Positives = 220/505 (43%), Gaps = 68/505 (13%)
Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI 1239
E+ ++ L AL L+ +LY++ F+ EL+QNADD I E + + F++
Sbjct: 14 ETGQSERDAENLANALTRLTGDLYTETERFIFELLQNADD-IPNETGQVNVLFVILREHF 72
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
++L+N + F + N+ A+ + STK +S G KGIGFKSVF ++ I+S F K
Sbjct: 73 LILHNGKPFDSTNVDAISSISKSTKVNNSEQ-TGYKGIGFKSVFADSECVYINSGNFSFK 131
Query: 1300 FDTSEGQIGFV--LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
FD ++ + + P + P ID K +E + + P T G A
Sbjct: 132 FDRNDKRHKNLEKTPWQIKPIWID-------KKEYPVEIQQYQQFFTFPVATAIYVGQAK 184
Query: 1357 ----MNNIVDMFSDLHPSLLLFLHRLQCI--MFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
+ +F D P L+LFL +Q I + N +N V I K+ G G K+ C
Sbjct: 185 IKEYKTKLEKLFQD--PRLILFLRHVQNINVIGLNNINIVDVSITKEKQG-GKYKI-CRN 240
Query: 1411 DKMT-WFV----------ASQKLRAGVIRPD----VKTTEIALALTLQES-----NEGNY 1450
+++T W V K+ + PD ++ ++I A L ++ NE +
Sbjct: 241 NQITYWLVQDFDFNVPKQVKDKMESDKKIPDKLKLIERSKITFAAQLIDNKLVAVNEED- 299
Query: 1451 GPLLYQQPVFAFLP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
+F +LP ++ Y L F++ DF+ ++R+E+ ++PWN +L + ++
Sbjct: 300 ------SVLFTYLPTNVKEYKLPFLVNADFLTTANRQEIHHDNPWNIFLFANIAYYSLTW 353
Query: 1509 ERSFCDLPCFREN-----PAK------AASVYMSFVPLVGEVHGFFSGLPRMILSKLRMS 1557
+R P+K + ++SF + + +P +KL
Sbjct: 354 IAEVAKKTEYRNQVTNILPSKFSLNLAKSETFISFNQEFDQALELIAFIPSEQDNKLLTV 413
Query: 1558 NCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK--DIVLSDSLARAL 1615
N IL+ P + +R D + + LL K F K D+ S +++
Sbjct: 414 NNSILDETEITKIIPVETVRTSFDSKRYFISNSLLNKTKLKNFGVKIFDVKSLGSFLKSI 473
Query: 1616 ---GIEEYGPKILLQIISSLCRTEN 1637
I + P+++ Q+++ L +T+N
Sbjct: 474 HYQNIVKSNPQLIFQVLAHL-KTKN 497
>gi|440754017|ref|ZP_20933219.1| putative transcriptional regulator domain protein [Microcystis
aeruginosa TAIHU98]
gi|440174223|gb|ELP53592.1| putative transcriptional regulator domain protein [Microcystis
aeruginosa TAIHU98]
Length = 1545
Score = 98.2 bits (243), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 127/506 (25%), Positives = 227/506 (44%), Gaps = 70/506 (13%)
Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI 1239
E+ ++ L AL L+ +LY++ F+ EL+QNADD I E + + F++
Sbjct: 14 ETGQSERDAENLANALTRLTGDLYTETERFIFELLQNADD-IPNETGQVNVLFVILREHF 72
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
++L+N + F + N+ A+ + STK +S G KGIGFKSVF ++ I+S F K
Sbjct: 73 LILHNGKPFDSTNVDAISSISKSTKVNNSEQ-TGYKGIGFKSVFADSECVYINSGNFSFK 131
Query: 1300 FDTSEGQIGFV--LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
FD ++ + + P + P ID + PV+++ + P T G A
Sbjct: 132 FDRNDKRHKNLEKTPWQIKPIWIDK-----KEYPVEIQQ--YQQFFTFPVATAIYVGQAK 184
Query: 1357 ----MNNIVDMFSDLHPSLLLFLHRLQCI--MFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
+ +F D P L+LFL +Q I + N +N V I K+ G G K+ C
Sbjct: 185 IKEYKTKLEKLFQD--PRLILFLRHVQNINVIGLNNINIVDVSITKEKQG-GKYKI-CRN 240
Query: 1411 DKMT-WFV----------ASQKLRAGVIRPD----VKTTEIALALTLQES-----NEGNY 1450
+++T W V K+ + PD ++ ++I A L ++ NE +
Sbjct: 241 NQITYWLVQDFDFNVPKQVKDKMESDKKIPDKLKLIERSKITFAAQLIDNKLVAVNEED- 299
Query: 1451 GPLLYQQPVFAFLP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF--- 1505
+F +LP ++ Y L F++ DF+ ++R+E+ ++PWN +L +
Sbjct: 300 ------SVLFTYLPTNVKEYKLPFLVNADFLTTANRQEIHHDNPWNIFLFANIAYYSLTW 353
Query: 1506 ---VSAERSFCD-----LPC-FRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRM 1556
V+ + + + LP F N AK+ + ++SF + + +P +KL
Sbjct: 354 IAEVAKKTEYRNQVTNILPSKFSLNLAKSET-FISFNQEFDQALELIAFIPSEQDNKLLT 412
Query: 1557 SNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK--DIVLSDSLARA 1614
N IL+ P + +R D + + LL K F K D+ S ++
Sbjct: 413 VNNSILDETEITKIIPVETVRTSFDSKRYFISNSLLNKTKLKNFGVKIFDVKSLGSFLKS 472
Query: 1615 L---GIEEYGPKILLQIISSLCRTEN 1637
+ I + P+++ Q+++ L +T+N
Sbjct: 473 IHYQNIVKSNPQLIFQVLAHL-KTKN 497
>gi|171679363|ref|XP_001904628.1| hypothetical protein [Podospora anserina S mat+]
gi|170939307|emb|CAP64535.1| unnamed protein product [Podospora anserina S mat+]
Length = 1843
Score = 95.9 bits (237), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 155/356 (43%), Gaps = 66/356 (18%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTL--TFILQESGIVVLNNEQGFSAENIRA 1255
LS+ L + S F+ EL+QNADD Y P L T+ ++ ++V +NE GF +N+RA
Sbjct: 1 LSENLTTTTSRFIYELLQNADDLDY-GGASPQLHFTYDAEKRHLLVESNEIGFKKKNVRA 59
Query: 1256 LCDVGNSTKKGSSAGY------IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
+C S+ K + GY IG KG+GFKSVF+V + I S + +FD +G++G
Sbjct: 60 ICSAAESS-KNTQRGYGIERRLIGEKGLGFKSVFKVASSVWIRSGHYSFRFD-RDGELGR 117
Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT-----KFSEGIAMNNIVDMF 1364
+ P F + + L+ T I L + + M+ ++
Sbjct: 118 IRPIWDDDFPAEEYVAGLT------------TAILLKIASPTHDRQTGTSDPMDLVLQGL 165
Query: 1365 SDLHPSLLLFLHRLQCI---------MFRNMLNDSLVVIRKKIV---------GDGIIKV 1406
+L P LLFL + + + R+ + S + V + ++ V
Sbjct: 166 ENLKPGHLLFLSKAKEVKVGKVVGSGALRSFIRGSTATAKPTAVFKADHIPPHRNTLLPV 225
Query: 1407 SC-GEDKMTWFVASQKL---------RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQ 1456
S D + VA + L R G + +V T L + L E E
Sbjct: 226 SLYHNDVINQQVAFRHLVTDLPDTPERPGCLESEV-TLVFPLKIPLDEPPES-------- 276
Query: 1457 QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
Q V+AFLP+R YG K + GDF+L SRE + N WN+ L+ V+A F
Sbjct: 277 QMVYAFLPIRDYGFKLSIHGDFLLVPSRESIIDNV-WNRALVEGVCRALVNAIPGF 331
>gi|288553765|ref|YP_003425700.1| putative transcriptional regulator [Bacillus pseudofirmus OF4]
gi|288544925|gb|ADC48808.1| putative transcriptional regulator [Bacillus pseudofirmus OF4]
Length = 1676
Score = 94.7 bits (234), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 160/370 (43%), Gaps = 62/370 (16%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESGIVVLNNE 1245
Q +L+ LS +LY+ + F+ EL+QNADD++ N VE + I QE ++ ++
Sbjct: 22 QAVNQASSLNALSGDLYTDANRFIYELLQNADDSLEKGNPVEVWVKIIGQE--LIFGHSG 79
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---- 1301
+ F ++ +C+V N TKK + G KGIGFKSVF +D +I+SNG + +FD
Sbjct: 80 KAFDKRDLMGICNVNNGTKKNDISK-TGYKGIGFKSVFGQSDKVKIYSNGEYFRFDRDYD 138
Query: 1302 ------------TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
+E F P + P +D V L IR ++
Sbjct: 139 FRWQWEYSREEWETESGRKFQYPWQIIPI-------YTKEDEVSL-------SIREFIKS 184
Query: 1350 K---FSEGIAMNNIVDM-----FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGD 1401
K + I ++N+ D+ + + ++ LFL + I F + S+++ R +
Sbjct: 185 KDINVATIITLHNLADVEEAINYLTKNTNMFLFLKNISKINFELPQSSSILIDRS---SE 241
Query: 1402 GIIKVSCGEDKMTWFVASQKLRA-----GVIRPD-------VKTTEIALALTLQESNEGN 1449
IIK+ W + S L+ ++ D TEI + L + +
Sbjct: 242 NIIKLENKHSNSEWIIKSTTLKVPESMRAILNEDRNIPEKLAAATEIEITLAAKIIDNEI 301
Query: 1450 YGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
+ ++++LP Y L ++ F+ ++RE + +S WNQW+ + P V
Sbjct: 302 VKLSKEENLLYSYLPTGESKYELPVLVNTSFLTAANRESLHSDSKWNQWIFNSLPVEIV- 360
Query: 1508 AERSFCDLPC 1517
R +L C
Sbjct: 361 --RWIAELVC 368
>gi|300727815|ref|ZP_07061196.1| hypothetical protein PBR_0692 [Prevotella bryantii B14]
gi|299774914|gb|EFI71525.1| hypothetical protein PBR_0692 [Prevotella bryantii B14]
Length = 1264
Score = 94.4 bits (233), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 161/378 (42%), Gaps = 66/378 (17%)
Query: 1177 SNMESNMLKK--------QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP 1228
SN SN+ KK Q +L LS +Y+++ F+ EL+QNA D E+
Sbjct: 3 SNFISNLQKKRCNYNHPDQATSQANSLILLSSGIYTEEERFVFELLQNAVDAHNEESEIL 62
Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
+ I+++ + L+N + F+ +I LCDVGN K IG KGIGFKSVF +
Sbjct: 63 DVKMIIKDGYFIFLHNGEAFTERDIEGLCDVGNGNKM-KDVKKIGYKGIGFKSVFMCSTN 121
Query: 1289 PEIHSNGFHVKFDTS-----------EGQIG-------FVLPTLVPPFNIDMFCRLLSKD 1330
+ S G+ KFD S + G +++P + P + +
Sbjct: 122 VTVQSGGYCFKFDKSHWDNYWNEHWNDNDFGTRDRDKKYLMPWQIIP--------IETTP 173
Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS 1390
P+ L + +N + S I+ + S LLFL M +
Sbjct: 174 PILLSNYEYNVATYIKINGTDS---LEQKILKLLSS--SQFLLFLMSDNIRMTFESNGKT 228
Query: 1391 LVVIRKKIVGDGIIKVSCGEDKMTWFVASQK-----------LRAGVIRPD----VKTTE 1435
I K+ + ++ + G+++ W + + K + A + PD V T +
Sbjct: 229 KCWIEKQKQNNQVVLSTNGKEESRWLIHTNKNVQVPPELKETINADINTPDKLKNVDTFD 288
Query: 1436 IALALTLQESNEGNYGPL--LYQQPVFAFLP----LRTYGLKFILQGDFVLPSSREEVDG 1489
++ A+ L E G L L Q V+ +LP + G F++ +F+ + R+++
Sbjct: 289 LSFAVALDEK-----GKLKRLEQAVVYTYLPTSFCFGSEGFPFLVNANFITDAGRQQLHK 343
Query: 1490 NSPWNQWLLSEFPALFVS 1507
+S WN+ + S+ P+ +++
Sbjct: 344 DSEWNKLIFSKIPSEYLT 361
>gi|347535876|ref|YP_004843301.1| hypothetical protein FBFL15_0964 [Flavobacterium branchiophilum
FL-15]
gi|345529034|emb|CCB69064.1| Hypothetical protein FBFL15_0964 [Flavobacterium branchiophilum
FL-15]
Length = 1505
Score = 93.6 bits (231), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 162/366 (44%), Gaps = 81/366 (22%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
Q L ++ L+ ++YS+D F+ EL+QNADD E+ E L+ + + +V+ +N
Sbjct: 141 QVVNLANSVESLATDIYSEDKRFIYELIQNADDAALDEDSE--LSIDILDKYVVISHNGA 198
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE----IHSNGFHVKFDT 1302
F++ +IR LC +G T K + A G KGIGFKSVF PE + + +FD
Sbjct: 199 PFNSRDIRGLCSIGLGT-KTNDATKTGYKGIGFKSVF---GQPEGIVYVKTENTLFRFDK 254
Query: 1303 SEGQI-GF-----------------------VLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
++ G+ ++P L N + +L+ ++
Sbjct: 255 EYARLKGWNSKWGNKRDWEEKNGVTFQCPWQMMPILSETANDNETDLILNNKIFSVK--- 311
Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
T I++ K E NI F D LLFL R+ + + N + +RKK
Sbjct: 312 --TAIKIDDSKKLYE-----NINSFFGD--AKFLLFLRRISRVELK--FNSHKLELRKKK 360
Query: 1399 ---VGDGIIKVSCGEDKMTWFVAS----------QKLRAGVIRP----DVKTTEIALALT 1441
+G+ + S E+ W+V + ++L++ P ++ T I+ A+
Sbjct: 361 NSEIGNIVTLHSNKENISNWYVKNWIHNIPLEIQKELKSDPKTPKKIQSMEKTVISFAIN 420
Query: 1442 LQES-------NEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSP 1492
+ E EGN P++++LP + Y + FI+ +F+L +SRE++ N
Sbjct: 421 INEKFDEIKLLEEGN-------SPLYSYLPTTEKEYNIPFIVNCNFLLDASREKIHKNRK 473
Query: 1493 WNQWLL 1498
WN+WL
Sbjct: 474 WNEWLF 479
>gi|323452221|gb|EGB08096.1| hypothetical protein AURANDRAFT_64375 [Aureococcus anophagefferens]
Length = 1507
Score = 92.8 bits (229), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 3/124 (2%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ-GFS 1249
L A LS +LY HFL EL QNADD + P L L S +N++ GF
Sbjct: 350 LAAAAERLSVDLYEHRDHFLHELFQNADDARHDGVDAPALEVGLSGSWFFASSNQRPGFD 409
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-EGQIG 1308
A ++RA+CDV STK G +A G KGIGFK+VF + D P + SNGF FD + +G +G
Sbjct: 410 AADVRAICDVNASTKAGDAA-TTGHKGIGFKAVFAMCDRPCVLSNGFAFYFDAARDGALG 468
Query: 1309 FVLP 1312
V P
Sbjct: 469 LVAP 472
>gi|332671971|ref|YP_004454979.1| ATP-binding domain-containing protein [Cellulomonas fimi ATCC 484]
gi|332341009|gb|AEE47592.1| ATP-binding region ATPase domain protein [Cellulomonas fimi ATCC 484]
Length = 1453
Score = 92.8 bits (229), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 150/331 (45%), Gaps = 40/331 (12%)
Query: 1196 HCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRA 1255
H L + Y+ + EL+QNA D N + F L + ++V +N GFS + IR+
Sbjct: 45 HLLVRNQYT--GRYPFELLQNAADAAADTNHAGHVRFHLSDQALLVADNGIGFSPDAIRS 102
Query: 1256 LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE--GQIGFV--- 1310
+C +G STK S IG KG+GF+SV +TD P++ + FD + Q+ +
Sbjct: 103 ICRLGRSTK--SPTKSIGYKGLGFRSVGEITDRPQVITGAVRFGFDENRVFEQVAALTGG 160
Query: 1311 ------LPTLVPPFNI---DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI- 1360
LPT PF + D+ + D +L + + T +RLP RT G+ +++
Sbjct: 161 LPADQKLPTYAFPFTLSEADLGEDIALVD--ELLTGGYTTVMRLPLRT----GVDRSHVE 214
Query: 1361 VDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV-VIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
+ ++ L+LFL L + D + V R+ G + + G+D W V +
Sbjct: 215 AQLLRNVDAQLMLFLGSLDRLEVSGTARDFVAEVARQDTDGAQRVLMQVGDDLTQWLVYA 274
Query: 1420 QKLRAG--VIRP-----DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLK 1471
++L ++ P +V+T A+AL L E + P+ + P G
Sbjct: 275 RQLPVARELVAPMDGWQEVETVTAAVALPLAEDGRPSTS---THYPMHVYFPTEEKTGFT 331
Query: 1472 FILQGDFVLPSSREEVDGN---SPWNQWLLS 1499
FI D+ L R ++ G +P+N+++ +
Sbjct: 332 FIAHADWALHLDRRQLSGTPDATPYNEFMTA 362
>gi|407917132|gb|EKG10453.1| hypothetical protein MPH_12311 [Macrophomina phaseolina MS6]
Length = 207
Score = 91.7 bits (226), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/100 (52%), Positives = 68/100 (68%), Gaps = 3/100 (3%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-NEQGFSAENIRAL 1256
+S++L S HFLLEL+QNADDN Y ++++PTL E L+ NE+GFS E++ A+
Sbjct: 50 ISEDLNSTSFHFLLELLQNADDNKY-DDIKPTLALAYTEDKHFRLHCNEKGFSPEDVEAI 108
Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
CD+GNSTK SSA G KG+GFKSVFRV A I S +
Sbjct: 109 CDIGNSTKL-SSASTTGEKGVGFKSVFRVARAVWIVSGEY 147
>gi|159479534|ref|XP_001697845.1| hypothetical protein CHLREDRAFT_176946 [Chlamydomonas reinhardtii]
gi|158273943|gb|EDO99728.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1218
Score = 90.9 bits (224), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 38/155 (24%)
Query: 1158 AALIVESIRRDEFGLGPNISNMES-NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
A +VE IR +G+ S+ ++ ++ AR+G A+ LS +LYS+D HF++EL+QN
Sbjct: 254 GAEVVERIRHT-YGVDKCYSDPDAAETIRCNTARVGNAVEQLSAQLYSKDLHFVMELLQN 312
Query: 1217 ADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKG 1276
A+DN YP V P L F+L +VVLNNE+G
Sbjct: 313 AEDNSYPAGVVPCLEFVLTPDSLVVLNNEEG----------------------------- 343
Query: 1277 IGFKSVFRVTDAPEIHSNGFHVKFD-TSEGQIGFV 1310
F+V+DAPE+HS GFHV F T+ G +
Sbjct: 344 ------FKVSDAPEVHSGGFHVSFSRTAHGNLALA 372
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/182 (29%), Positives = 79/182 (43%), Gaps = 21/182 (11%)
Query: 1462 FLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFREN 1521
FL YGL+F++ DF++ +SRE VD S WN L + P LF S C
Sbjct: 381 FLKKLRYGLRFMVNADFIVTTSREAVDNGSSWNHMLAARIPQLFAS---------CLPLP 431
Query: 1522 PAKAASVYMSFVPLVGEVHGF----FSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR 1577
V + L G V GF F+ LP+ I + R S + +P LR
Sbjct: 432 DEPHVGVRVGVALLCGFVCGFVGDLFAPLPQRIAAVCR-SAVRLPTTCGAALSPATHQLR 490
Query: 1578 GWND-RAHSLLPDILLQKHL---GLGFLSKDIV---LSDSLARALGIEEYGPKILLQIIS 1630
+D R LL +Q L GL F+S+ + S L LG++++ + L+ ++
Sbjct: 491 VCHDPRVRELLAGPGMQHALRAAGLYFVSEAVAPLHASSRLRALLGVQDFHAQDLVALLE 550
Query: 1631 SL 1632
L
Sbjct: 551 ML 552
>gi|171686084|ref|XP_001907983.1| hypothetical protein [Podospora anserina S mat+]
gi|170943003|emb|CAP68656.1| unnamed protein product [Podospora anserina S mat+]
Length = 420
Score = 89.7 bits (221), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 74/111 (66%), Gaps = 2/111 (1%)
Query: 1202 LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
+Y + SHFL EL+QN DDN Y + +PTLT + + +NE GF+ E++ A+C +GN
Sbjct: 1 MYRRPSHFLWELIQNVDDNNYVAH-DPTLTITYKIRVLRFDSNETGFTMEDVEAICSMGN 59
Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
S+K SS+ IGRKGIGFKSVF+V ++ I S + +FD+ E +G ++P
Sbjct: 60 SSKTRSSSTRIGRKGIGFKSVFKVAESVYIASGCYSFRFDSRES-LGMMIP 109
Score = 49.3 bits (116), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 54/121 (44%), Gaps = 15/121 (12%)
Query: 1459 VFAFLPLRTYGLK----FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
FAFLP+R YG + F+ DF L ++RE+VD +S NQ L E P F+ A +
Sbjct: 244 TFAFLPIRDYGFRVRQAFVCHADFDLVANREDVDMSSARNQVFLHELPKAFLRAVHAL-- 301
Query: 1515 LPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCK 1574
N + ++P FF LP I++ L S +LE + P K
Sbjct: 302 ------NSGHLRYTWPYYLPF-PTTDSFFEPLPGDIITLL--SRTPMLESDAGHLMAPEK 352
Query: 1575 V 1575
+
Sbjct: 353 L 353
>gi|326672132|ref|XP_003199605.1| PREDICTED: hypothetical protein LOC100537266 [Danio rerio]
Length = 1057
Score = 88.6 bits (218), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 99/189 (52%), Gaps = 3/189 (1%)
Query: 152 LIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQ-MQEVPSLHKLMVTEARINAFIHCFVGV 210
L GE V+ +V V LQV S + + +P+L L T IN +I V
Sbjct: 585 LATEGEHVTIERVIGRVCSLLQVCSLRDMNIDPRRHLPALQDLQRTLKEINLYIESVEAV 644
Query: 211 RQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDVI 268
+ +LY+L +A+ + ++FEEL LGP + PLI F + ++ D V +I+T D++
Sbjct: 645 SSVCTLYELGQALANLKNKKRFEELHLGPLCKLPLIHRMFRIDANTKDDDVHQISTVDIL 704
Query: 269 VCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVT 328
L Y +K + + EFL +A + S S +LG+RIQ+LG+ I+ + +A S +
Sbjct: 705 RSLRMYRRVNKVPRVDLAEFLKHLADQYSCESPYELGIRIQSLGLPISTLNKASASDYLL 764
Query: 329 LKKFLKELQ 337
++K + +Q
Sbjct: 765 MEKAKETIQ 773
>gi|386783697|gb|AFJ24743.1| hypothetical protein, partial [Schmidtea mediterranea]
Length = 714
Score = 87.4 bits (215), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 122/524 (23%), Positives = 226/524 (43%), Gaps = 79/524 (15%)
Query: 1414 TWFVASQKLRAGVIRPDV--------KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
+WF+ +Q++ ++ P++ TE+++A+ L E L P FAFLP+
Sbjct: 41 SWFLRTQQI--PILEPEILRISPRNLSVTELSVAIPLAEQE-------LEDCPTFAFLPI 91
Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKA 1525
R+YG +FI+ DF +P+SRE++ PWN+WL+ + P +F C L + +
Sbjct: 92 RSYGFRFIINADFSIPASREDISNECPWNRWLIEQVPLVFTE-----CLLEFTSRDHVQN 146
Query: 1526 ASVYMSFVPLVGEV---HGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDR 1582
+ +PL GEV F +P++ + ++ L N + PP +++ D
Sbjct: 147 TIRAIKCIPLPGEVPTDSSFHVLVPKI---RAQLVQLAWLPDRNGKLHPPSQLIVESVD- 202
Query: 1583 AHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSLCRTENGLR 1640
S L L + L L L I S + ++RALG G + L + L R L+
Sbjct: 203 -DSSLTSALCNR-LSLTLLHPVIANSSDERVSRALG--WLGCQTL--SLQHLGRLLTALQ 256
Query: 1641 SMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPL-----SDGTFSSV 1695
++ L + I+ HS + LI L +P P+ + F S+
Sbjct: 257 PEDIA--DPVLMKSIIIAIHSGLERDPSIATCNRLIAQLLSMPIWPVLSRSGAPDRFVSI 314
Query: 1696 DEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSA--SAVDKSSLGVISVDNLNR 1753
++ + + S + + + C +L +SPA +++ ++ +K + V S
Sbjct: 315 NQSALPIFIPPSHWPNTHNNDLYS--CLRL-MISPAFIASFKTSANKKHINVAS-----S 366
Query: 1754 MLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVERE 1813
+L ++GV +L+++ ++ I+P +S+E + +N + F++ H P+ + +R
Sbjct: 367 VLQRLGVHELNSNTMIDHAIIPRLSNECQS---RNWLIAAAQFLVDHCH---PSQYRDR- 419
Query: 1814 FIVSELRKKAFVLTNHGFKRPSEIPIHFGKEF---------GNPVSVNMLIHDIDIKWYE 1864
V L A V NH + + H+ + +V LIH+ I +
Sbjct: 420 --VIPLLSFATVSYNHSNGKMNCTITHYSDLYKVHLPNQLEKTMTTVLPLIHN-RIDYTV 476
Query: 1865 VDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVA 1908
+D YL P + +WR+FF G ++ V D A
Sbjct: 477 LDDAYLDCPIQPE------QWREFFCAAGTSETFYRVAHDNGNA 514
>gi|110669161|ref|YP_658972.1| hypothetical protein HQ3280A [Haloquadratum walsbyi DSM 16790]
gi|109626908|emb|CAJ53377.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
Length = 916
Score = 87.0 bits (214), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 141/342 (41%), Gaps = 72/342 (21%)
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESG-IVVLNNEQGFSAENIRALCDVGNSTKK 1265
+HF LEL QNADD E L + +E G + V ++ GF + ++A+C G S K
Sbjct: 49 THFPLELTQNADD----EGATNLLFEVDEEKGELYVFDDGAGFDPDGLKAVCQQGQSPKD 104
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
IG GIGFKS+F ++D IHSNG+H F + L + P +
Sbjct: 105 PRQQ--IGFMGIGFKSMFEISDNVRIHSNGYHFGFSIDDEDEDDGLDAVTPHW------- 155
Query: 1326 LLSKDPVQ--LESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
+ + P Q ++ K + T I + + +G+ +L PS+ LFL L I
Sbjct: 156 VPEEPPEQRTVKQKEYTTRIVASIKNESDDGVEQIKQALEEDNLSPSVFLFLDSLSEIEI 215
Query: 1384 RN--------------------------------------------MLNDSLVVIRKKIV 1399
R+ + +D LV IR+
Sbjct: 216 RSRQDTSFSRSLYGGHVDTFDEPEGVSEARELYKKYAEGDPDPENKLSDDDLVQIRRTGN 275
Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNY-----GPLL 1454
DG + D W + + LR +R +++T+++ +A + S +G G +
Sbjct: 276 SDGEEQWVVFRD--IWDI-DKSLRRPRLRKNLETSDLFIAFRI--SQDGRLIELVDGGSV 330
Query: 1455 YQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEVDGNSPWN 1494
PV ++LPL + F++ DF L SRE + NS WN
Sbjct: 331 RISPVHSYLPLSELDVDIDFLIHADFDLNPSREGIRSNSEWN 372
>gi|150389397|ref|YP_001319446.1| putative transcriptional regulator [Alkaliphilus metalliredigens
QYMF]
gi|149949259|gb|ABR47787.1| putative transcriptional regulator [Alkaliphilus metalliredigens
QYMF]
Length = 1675
Score = 86.7 bits (213), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 163/365 (44%), Gaps = 61/365 (16%)
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAE 1251
+++ LS +LY+ F+ EL+QNADD+ ++ I E +VV ++ + F++
Sbjct: 27 ASSINALSSDLYTDSKRFIYELLQNADDSYLDDHDVKVWIKIFGEK-LVVAHSGKPFNSR 85
Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-------- 1303
++R +C++ N TKK S G KGIGFKSVF +D I S + +FD+S
Sbjct: 86 DLRGICNIANGTKK-SDITKTGYKGIGFKSVFGQSDKVTIFSKKEYFRFDSSYPFKWIWD 144
Query: 1304 EGQIG--------FVLPTLVPPFNIDM------FCRLLSKDPVQLESKCWNTCIRLPFRT 1349
+ ++ F P + P + D+ + L + V + T I++
Sbjct: 145 DSKVNWENANDREFQFPWQIIPIHTDVNEIDESINKFLEDNQVNVA-----TIIQMKNVN 199
Query: 1350 KFSEGI-AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC 1408
+ + + ++ ++MF LFL ++ I F M DS+ + + D I +
Sbjct: 200 ETRKAVQELSQNLNMF--------LFLKKISEINFDVM--DSVTIKINRTEKDKITLKNG 249
Query: 1409 GEDKMTWFVAS----------QKLRAGVIRPD--VKTTEIALALTLQESNEGNYGPLLYQ 1456
+ K W + + ++L++ P+ + T I L L + ++G +
Sbjct: 250 RDLKFVWLINTISLTVPTALKKELQSERNIPEKLLGTDIIELTLAAKVGSDGITRLSEQE 309
Query: 1457 QPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS-------EFPALFVS 1507
++++LP R Y L ++ F+ S+RE + +S WNQWL + + V+
Sbjct: 310 TMLYSYLPTDERNYSLPILINTSFLTTSNRESLHSDSKWNQWLFKTIAIEIFHWISTLVT 369
Query: 1508 AERSF 1512
E SF
Sbjct: 370 TEYSF 374
>gi|358367094|dbj|GAA83713.1| hypothetical protein AKAW_01828 [Aspergillus kawachii IFO 4308]
Length = 1359
Score = 84.7 bits (208), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 90/172 (52%), Gaps = 17/172 (9%)
Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
E A +V SI R L I +E MLKK +G ++ L++++Y+
Sbjct: 5 ESARELVRSIARGRGYLSEEILERMDQDTRREVEEAMLKKDEM-IGSSVLTLAKDVYNSS 63
Query: 1207 SHFLLELVQNADDNIYPE----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+ F+ EL+QNADDN Y + + P ++F + IV NE GF+ E++ A+C++G S
Sbjct: 64 ARFVFELLQNADDNSYSKAKSRSEAPYVSFRVYPRQIVFECNEDGFTPEDLIAICNIGKS 123
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN--GFHVKFDTSEGQIGFVLP 1312
+K S GYIG K IGFKSVF V I S F+++ + +G + P
Sbjct: 124 SKT-SVQGYIGEKDIGFKSVFMVAWKVLIQSGELSFYLQHRIGDSGMGMISP 174
>gi|113971542|ref|YP_735335.1| putative transcriptional regulator [Shewanella sp. MR-4]
gi|113886226|gb|ABI40278.1| putative transcriptional regulator [Shewanella sp. MR-4]
Length = 1687
Score = 84.3 bits (207), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 148/342 (43%), Gaps = 39/342 (11%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
Q +L LS +LY+ F+ EL+QNADD+ P + + S +VV ++ +
Sbjct: 22 QATNQASSLVSLSTDLYTDSKRFIYELLQNADDSAEPGHSVKVWIKVFGNS-LVVAHSGR 80
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--- 1303
FS ++R +C+V TKK S+ G KGIGFKSVF +D I + + +FD S
Sbjct: 81 EFSPRDVRGICNVNYGTKKEDSSK-TGYKGIGFKSVFGQSDKVTIFTANEYFRFDASYPF 139
Query: 1304 -------------EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK 1350
+ FV P + P D+ +L P++L K N +
Sbjct: 140 NWIWDGTKEEWEAKSDREFVNPWQIIPIYTDVREGILK--PIELYLKTINANVATIIELN 197
Query: 1351 FSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
+G + + ++ D++ + LFL + I F ++ N + I + ++K E
Sbjct: 198 -DKGDVLLALKELSEDVN--MYLFLKSISSINF-DIDNRKHIEIDRSESNRIVLKDGQSE 253
Query: 1411 DKMTWFVAS------QKLRAGVIR----PD--VKTTEIALALTLQESNEGNYGPLLYQQP 1458
K W +S L+ + PD + I L L + +G ++
Sbjct: 254 QK-KWLTSSITLNVPSDLKVALQDERNIPDKLLNAETIELTLAAKTGADGIIKLSQNERR 312
Query: 1459 VFAFLPLR--TYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
++++LP Y L ++ F+ ++RE + +S WNQWL
Sbjct: 313 IYSYLPTEETKYSLPVLVNTSFLTNANRENLHADSKWNQWLF 354
>gi|159466910|ref|XP_001691641.1| hypothetical protein CHLREDRAFT_170531 [Chlamydomonas reinhardtii]
gi|158278987|gb|EDP04749.1| hypothetical protein CHLREDRAFT_170531 [Chlamydomonas reinhardtii]
Length = 1333
Score = 84.0 bits (206), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 57/101 (56%), Gaps = 22/101 (21%)
Query: 1213 LVQNADDNIYPENV---------EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
L+QNADD YPE+ PT++ L +G+ V NE+GFS ++RALCD G
Sbjct: 513 LLQNADDCSYPESELAALPGGGEPPTMSLRLSGAGLAVATNERGFSEADVRALCDTGE-- 570
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
KGIGFKSVF ++DAP I+SNGF FD +
Sbjct: 571 -----------KGIGFKSVFALSDAPAIYSNGFAFGFDAQD 600
>gi|340353805|ref|ZP_08676610.1| hypothetical protein HMPREF9144_2422, partial [Prevotella pallens
ATCC 700821]
gi|339608492|gb|EGQ13386.1| hypothetical protein HMPREF9144_2422 [Prevotella pallens ATCC 700821]
Length = 1310
Score = 82.8 bits (203), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/367 (25%), Positives = 170/367 (46%), Gaps = 45/367 (12%)
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFILQESGIVVLNNEQGFSA 1250
++L+ LS +Y+++ F+ EL+QNA D E + ++++ ++ ++N F+
Sbjct: 27 AQSLNLLSSGIYTEEERFVYELLQNAIDAFVDVEGESLKIRVEIRDNYLLFMHNGAAFTE 86
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
+I L DVGN +K ++ +G KGIGFKSVF + + S F KFD E I +
Sbjct: 87 ADIEGLSDVGNGSK-AANVKKVGYKGIGFKSVFMKSTQVFVKSGPFCFKFD-KEDSIDY- 143
Query: 1311 LPTLVPPFNI---DMFCRLLSKD---PVQLESKCWN--TCIRLPFRTKFSEGIAMNNIVD 1362
+P+ +P + D+ +++ + P ++E + +N T IRL + + +
Sbjct: 144 MPSDMPDGKLSPEDIPWQIIPIEAIVPFEVEDEAFNVITYIRLT-----DTASLVPKVDE 198
Query: 1363 MFSDLHPSLLLFLHRLQC-IMFRNMLNDSLV--VIRK------------KIVGDGIIKVS 1407
+ L+ LLFL+ I F N +D+LV V RK K+V +IK S
Sbjct: 199 LLKGLN--FLLFLNAQDVSIAFYN--SDTLVRKVARKENNGEISLLIDDKVVSRWLIKSS 254
Query: 1408 C--GEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
++ ++A D E++ A+++ + E + + + FLP
Sbjct: 255 TISVTKEVKQYIAQNPASVPQKLKDASEIEVSFAISVNDKGELH---KVENAVAYTFLPT 311
Query: 1466 R--TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS--AERSFCDLPCFREN 1521
G F++ +F+ + R+++ +S WN+ + E P L++ AE S R
Sbjct: 312 SYGNLGAPFLINANFITDAGRQQLQQDSEWNKLIFKEAPTLYLKWIAELSHTYKTYHRVL 371
Query: 1522 PAKAASV 1528
P+K A V
Sbjct: 372 PSKVAKV 378
>gi|408492908|ref|YP_006869277.1| ATP-binding protein, putative [Psychroflexus torquis ATCC 700755]
gi|408470183|gb|AFU70527.1| ATP-binding protein, putative [Psychroflexus torquis ATCC 700755]
Length = 1254
Score = 82.8 bits (203), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 152/351 (43%), Gaps = 60/351 (17%)
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAE 1251
+L LS ++Y+ F+ EL+QNADD + E + + ++ +++ + + FS
Sbjct: 27 ANSLESLSSDIYTDSKRFIYELIQNADD-ASSNSRELDILLKVVDNYLIISHQGEKFSEI 85
Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL 1311
+I ++C VG+ KKG + G KGIGFKSVF +D +I+S + KFD S Q
Sbjct: 86 DIESICSVGDGNKKGDT-NKTGFKGIGFKSVFSHSDYVKINSGNYCFKFDKSHWQ----- 139
Query: 1312 PTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT-----------KFSEGIAMNNI 1360
+N + + +D + + K T I++P++ F + ++ I
Sbjct: 140 ----GHWNSEWQNQKEWRDGREKKEK--ETTIKMPWQIIPIWSEIEQLFSFVKSYNVSTI 193
Query: 1361 VDM--FSDLHPSLLLFLHRLQCIMFRNMLNDSLVV------IRKKIVGDGIIKVSCGEDK 1412
+ S L LL Q ++F N + V + +KI + +K+
Sbjct: 194 IKFEDTSKLEKELLELFSNTQILLFLRCQNVKITVEGNKPFVIEKIKENNTVKLKQNNKI 253
Query: 1413 MT-WFVAS-----QKLRAGVIRPDVK---------TTEIALALTLQES-----NEGNYGP 1452
++ W + S K +I+ D + TEI+LA+ + + N+ N
Sbjct: 254 LSEWLLKSFTFIVDKSTTELIKNDARIPKKLRLATKTEISLAVQIDKGKIKTINKDN--- 310
Query: 1453 LLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
+ +F +LP Y F++ F+ + R+ + + WN WL + P
Sbjct: 311 ----RLIFTYLPTSVNYDFPFLVNASFITDAGRQHIHEDLQWNIWLFKQIP 357
>gi|323446263|gb|EGB02492.1| hypothetical protein AURANDRAFT_68829 [Aureococcus anophagefferens]
Length = 1632
Score = 81.3 bits (199), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 101/218 (46%), Gaps = 8/218 (3%)
Query: 127 PPQP-QKLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQ 185
P P Q+ + + RI + K R+ GE V ++ V V S LG +
Sbjct: 15 PVDPAQRADAAAVSRIAK---KLRNRACERGERVRLLELVDDVCSAHGVRSLRDLGLSVA 71
Query: 186 EVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPL 245
V L L E + I FV R I +L+DL +C + G+ FE+L LGP L+QPL
Sbjct: 72 AVDELRSLRQLEGLLEIQICAFVESRFIATLHDLSHNLCAAHGVRAFEDLRLGPILKQPL 131
Query: 246 ISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLG 305
+ F + V + DV+ L ++ K + + ++EFL+++A KR +AS L
Sbjct: 132 VQRKFYADEALTAVPALEVADVVRELEKF--RFKKRRVDLEEFLNWLAAKRGVASPRSLC 189
Query: 306 VRIQN--LGMHINFIREARKSQDVTLKKFLKELQPHHK 341
+ + +G I + +A + + + LK+L+ +
Sbjct: 190 LTMSKYAIGTTIKYASQAARDEKDSRVAALKQLRDARR 227
>gi|153870496|ref|ZP_01999887.1| conserved hypothetical protein [Beggiatoa sp. PS]
gi|152073038|gb|EDN70113.1| conserved hypothetical protein [Beggiatoa sp. PS]
Length = 1020
Score = 80.9 bits (198), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 142/341 (41%), Gaps = 62/341 (18%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNN--------------- 1244
E YS ++HF+ EL+QNADD V+ T FIL +SGI +N
Sbjct: 40 EKYSDNAHFIYELLQNADD------VKATKARFILNQSGIYFAHNGTIHFSITAPDCEEI 93
Query: 1245 -EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
Q +I A+ +GNS K + IG+ G+GFK+VF+ T PEI+ F K D
Sbjct: 94 DTQNQCLGHINAITSIGNSNKYEAQ---IGKFGVGFKAVFQYTQTPEIYDPPFCFKIDR- 149
Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVD 1362
F++P L L ++ P E + T PF + + A IV+
Sbjct: 150 -----FIVPIL-----------LTTEHP---ERQPDETLFYFPFDNPQQTPHQAFQAIVN 190
Query: 1363 MFSDLHPSLLLFLHRLQCIMFRNMLNDS--LVVIRKKIVGDG--IIKVSCGEDKMTW--- 1415
++L LL + + + N N L I+ V + I++ G W
Sbjct: 191 KLTNLSYPLLFLRNLTEITWYCNNENQGKYLKTIKPLEVNNTHYHIELISGSYAPAWEPN 250
Query: 1416 -----FVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQ--PVFAFLPLR-T 1467
S+ V+ + ++A + S G +L++Q P + F P + T
Sbjct: 251 PDAPASTFSKNNHFLVLTDNTNNHPFSIAYLVNLSEHGQPIQILHEQTFPAYCFFPTKET 310
Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
L FI+Q F+L SRE + + WNQ L+ L ++
Sbjct: 311 THLHFIVQAPFLLTDSREGIKQTNSWNQQLVKALAQLIANS 351
>gi|258514802|ref|YP_003191024.1| putative transcriptional regulator [Desulfotomaculum acetoxidans DSM
771]
gi|257778507|gb|ACV62401.1| putative transcriptional regulator [Desulfotomaculum acetoxidans DSM
771]
Length = 1672
Score = 80.5 bits (197), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/345 (23%), Positives = 149/345 (43%), Gaps = 46/345 (13%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNE 1245
Q +L+ LS +LY+ F+ EL+QNADD+ +N + +I + + +V+ ++
Sbjct: 22 QAVNQASSLNALSADLYTDSKRFIYELLQNADDS--SQNNDTVKVWIKIFNNNLVIAHSG 79
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-- 1303
F+ +++ +C+V N TKK S G KGIGFKSVF +D I++ + +FD S
Sbjct: 80 SPFTVRDLQGICNVNNGTKK-SDLTKTGYKGIGFKSVFGQSDKVTIYTKNEYFRFDASYS 138
Query: 1304 --------------EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ--LESKCWNTCIRLPF 1347
F P + P + + +S +P+ L N +
Sbjct: 139 FEWKWAESKTIWENNNDRKFQFPWQIIPIYTE--AKEVS-EPINQFLNQNNANVATIIQM 195
Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVS 1407
+ +A N+ + ++ LFL + I F + +S+ V + D I
Sbjct: 196 KNVNETSLAAQNLSQ-----NLNMFLFLKNISEINFD--IAESVTVKIDRTKKDRITLQK 248
Query: 1408 CGEDKMTWFVASQKLRA-----GVIRPD-------VKTTEIALALTLQESNEGNYGPLLY 1455
K+ W + + L V++ + + T I L+L + ++G
Sbjct: 249 GNSSKVDWLINTVSLAVPNDVKAVLQDERNIPEKLLNTDSIELSLAAKVGSDGITKLSNQ 308
Query: 1456 QQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
++ ++++LP Y L ++ F+ ++RE + +S WNQWL
Sbjct: 309 EKLLYSYLPTDETKYSLPVLVNTSFLTAANRESLHADSKWNQWLF 353
>gi|359415302|ref|ZP_09207767.1| hypothetical protein CDLVIII_5898 [Clostridium sp. DL-VIII]
gi|357174186|gb|EHJ02361.1| hypothetical protein CDLVIII_5898 [Clostridium sp. DL-VIII]
Length = 1470
Score = 80.5 bits (197), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 150/370 (40%), Gaps = 46/370 (12%)
Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSH---FLLELVQNADDNIYPENVE----- 1227
ISN M+K R E +D + +LEL+QNADD + E
Sbjct: 10 ISNFRETMIKTYTYDPKRIREDFGSETRVKDDYKGRVILELLQNADDAQVTNDTENMRIG 69
Query: 1228 -PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
+ L++ +N + + A+C + +S KK YIG KG+GFKSV T
Sbjct: 70 DKNVILYLKKDAFYCVNGGYPVTQNGLEAICRLSHSPKKDRKL-YIGEKGLGFKSVLCFT 128
Query: 1287 DAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
D PEIHS H KF+ ++ + N D L + P L + W ++
Sbjct: 129 DRPEIHSGDLHCKFNRNDAMKFLYESNINGIDNFDSDNIALFRFPQYLSQEIW---MQDK 185
Query: 1347 FRTKFSE--------GIAMNNIV---DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR 1395
T+F + I NN + + ++ ++LLFL+ L I R +DS ++ +
Sbjct: 186 ILTEFVKENATVIKLKITDNNYIVLKEKLLEIKSTILLFLNSLNKISIRIEEDDSSLIEK 245
Query: 1396 KKIVG-------DGIIKVSCGEDKMTWFVASQKLRAGV---------IRPDVKTTEIALA 1439
I+ D + G +K W V ++ + D+ +++ A
Sbjct: 246 DFIIHRSESQELDNQCTLEDGNNKSIWQVYNETFEIEKNFIGDTDEQVFQDINACKVSFA 305
Query: 1440 LTLQESNEGNYGPL--LYQQPVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQW 1496
L + E NY PL L + + + P + L F+ + SSR+ ++ +N+
Sbjct: 306 L---QKEELNYKPLDNLDENYIRVYFPTCEEFSLPFLFHASYYTDSSRKNINLEHEYNRN 362
Query: 1497 LLSEFPALFV 1506
L + +F+
Sbjct: 363 LTKKAVDIFI 372
>gi|290990058|ref|XP_002677654.1| predicted protein [Naegleria gruberi]
gi|284091262|gb|EFC44910.1| predicted protein [Naegleria gruberi]
Length = 1534
Score = 80.1 bits (196), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 101/411 (24%), Positives = 170/411 (41%), Gaps = 70/411 (17%)
Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV--V 1241
++K + +G A SQ + HF+LE +QNADD+ Y + L F ES V +
Sbjct: 249 IEKMQSEIGSARSLFSQGFNTNLFHFILEFIQNADDSEYLQG--GILEFDFDESRKVLKI 306
Query: 1242 LNNEQGFSAENIRALCDVGNSTK--KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
+NE GF+ +N+ +L + +S K K IG KGIGFK++F V + SNGF
Sbjct: 307 KSNECGFTYDNLYSLSGIADSDKVSKKLETLAIGEKGIGFKTIFSVCKKVHLVSNGF--- 363
Query: 1300 FDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
D+F L S D + E++ +N S+G
Sbjct: 364 ---------------------DVF--LTSTDLTKKENQEFN----------LSDGTLYEI 390
Query: 1360 IVDMF--SDLHPSLLLFLHRLQCIMFRNMLND---SLVVIRKKIVGDGIIKVSCGEDKMT 1414
++ F S LL L ++ I+F++ LN SL + + K +K
Sbjct: 391 EIEKFDHSIFDKYTLLTLKNIKTIIFKDGLNTTKYSLTCKDNEYLLSTTFKSKTTTEKFY 450
Query: 1415 WFVASQKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT-YG 1469
F + I RP+ ++ + + + Y + +FLP++ Y
Sbjct: 451 GFSIENGINMKNILCEKRPNFHRADLMVFFPEEWKVDKRY-------KLHSFLPVKEDYN 503
Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVY 1529
L ++Q DF+L +RE + +S WN ++ P L V A + ++ ++ Y
Sbjct: 504 LPLLIQSDFILSQNREALLESSDWNISIIDHIPQLLVKAFNTMKNINNWK---------Y 554
Query: 1530 MSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQW--APPCKVLRG 1578
F L E FS + ++ L+ +C+ L G ++ P +L G
Sbjct: 555 DFFKVLQVESKLHFSNVQFKTINLLQNIDCIPLLGGQTKYFEKPSNIILMG 605
>gi|448528709|ref|ZP_21620217.1| restriction endonuclease-like protein [Halorubrum hochstenium ATCC
700873]
gi|445710449|gb|ELZ62263.1| restriction endonuclease-like protein [Halorubrum hochstenium ATCC
700873]
Length = 1561
Score = 79.3 bits (194), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 91/370 (24%), Positives = 153/370 (41%), Gaps = 54/370 (14%)
Query: 1155 CEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELV 1214
E+A I+ IR E + ++ + ++Q + ++ H Q + L EL+
Sbjct: 16 VEEAYEILRDIREKELKV------VDDDFERRQ--KRTQSFHDYIQRQLDHPAQVLPELI 67
Query: 1215 QNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGR 1274
QNADD VE LT + + + N+ + +E + LC G STK+ YIG
Sbjct: 68 QNADDIKSCNTVEIELT----DDTLRIRNDGRPMKSEEVDTLCAAGESTKQDPE--YIGH 121
Query: 1275 KGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQL 1334
G+GFKSVF ++D+P + S FH +FD+ RL + +
Sbjct: 122 FGLGFKSVFSISDSPRVRSGYFHFEFDSD---------------------RLTVPEIHDV 160
Query: 1335 ESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI---------MFRN 1385
+ T I LP + S + + ++H LL +L + I ++R
Sbjct: 161 STPVKGTEIELPLKEDLS-AERREQLQERLDEVH-RLLTYLRNISQIEITQEGETTVYRR 218
Query: 1386 MLNDSLVVIRKKIVGDGIIK---VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
+DS R D +++ V +DK + Q I + E +++TL
Sbjct: 219 E-SDSESPERIIYKDDEVLERRLVFTTDDKPSGEAFEQLAEKRQIDDEETVRESPVSVTL 277
Query: 1443 QESNEGNYGPLLYQQP--VFAFLPL-RTYGLKFILQGDFVL-PSSREEVDGNSPWNQWLL 1498
+ ++ P + +F FLP G F + DF+L P+ RE + P+NQWLL
Sbjct: 278 SFEVDDSHRPCSKAKDCYLFNFLPTDERSGFPFDVHADFLLEPNRRELAWKDGPYNQWLL 337
Query: 1499 SEFPALFVSA 1508
+ + A
Sbjct: 338 NRVGETYRKA 347
>gi|359415220|ref|ZP_09207685.1| peptidase C14 caspase catalytic subunit p20 [Clostridium sp. DL-VIII]
gi|357174104|gb|EHJ02279.1| peptidase C14 caspase catalytic subunit p20 [Clostridium sp. DL-VIII]
Length = 1679
Score = 79.0 bits (193), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 149/350 (42%), Gaps = 56/350 (16%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNE 1245
Q +L+ LS +LY+ F+ EL+QNADD+ +N E +I + +VV ++
Sbjct: 22 QAVNQASSLNALSGDLYTDSKRFIYELLQNADDS--SQNSEAVKVWIKIFGDNLVVAHSG 79
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-- 1303
+ F+ +I+ +C+V N TKK S G KGIGFKSVF +D I + + +FD+S
Sbjct: 80 RPFTTRDIQGICNVNNGTKK-SDLTKTGYKGIGFKSVFGQSDKVTIFTKDEYFRFDSSYS 138
Query: 1304 --------------EGQIGFVLPTLVPPFNI---DMFCRLLSKDPVQLESKCWNTCIRLP 1346
F P + P D+F +P+ K + +
Sbjct: 139 FDWKWDVSKKIWEQNNDREFQFPWQIIPIYTEVKDVF------EPINQFLKDIDASVATI 192
Query: 1347 FRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV 1406
+ K +N L +L +FL L+ I N LV + K + I +
Sbjct: 193 IQMK-----NVNETSQAAQSLSQNLNMFLF-LKNINEINFDITELVSVEIKRTENDRITL 246
Query: 1407 SCGED-KMTWFVASQKLRAGVIRPDVKTT---------------EIALALTLQESNEGNY 1450
G K+ W + + +L+ + +VKT I L+L + N+G
Sbjct: 247 KRGNGLKVNWLLNTIRLK---VPSEVKTVLQDERNIPEKLLNADSIELSLAAKVENDGIT 303
Query: 1451 GPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
++ ++++LP Y L ++ F+ ++RE + +S WNQWL
Sbjct: 304 KLSKQEKLLYSYLPTGETKYSLPVLVNTSFLTTANRESLHADSKWNQWLF 353
>gi|291460280|ref|ZP_06599670.1| conserved hypothetical protein [Oribacterium sp. oral taxon 078 str.
F0262]
gi|291417035|gb|EFE90754.1| conserved hypothetical protein [Oribacterium sp. oral taxon 078 str.
F0262]
Length = 1239
Score = 79.0 bits (193), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 150/331 (45%), Gaps = 46/331 (13%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQ--ESGIVVLNNEQGFSAENIRALCDVG 1260
YS +HF+ EL+QNA+D E +++ + +V +N F +++R + +
Sbjct: 35 YSDSNHFVYELLQNAED-------EKATKVVIEFYKDQLVFYHNGDPFDEDDVRGVSSML 87
Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN--GFHVKF------DTSEGQIGFVLP 1312
TK + A IGR G+GFKSVF+ T PEI+S+ F +K + S +
Sbjct: 88 MGTKDRNDAQTIGRFGMGFKSVFKYTYQPEIYSDEEAFIIKNYLLPVENDSHWDFQGLKS 147
Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEG----IAMNNIVDMFSDLH 1368
TL P + D L + L T I +PF+ + +G ++ +++D + L
Sbjct: 148 TLAYP-DGDGTKYLPFANTQHL------TKIVIPFKKRNEKGELVAVSGKDVLDKLNSLS 200
Query: 1369 PSLLLFLHRLQCIMFRNMLNDSLVVIR-KKIVGDG-------IIKVSCGEDKMTWFVASQ 1420
+LLFL ++ + + N N V I GD I S G+++++ ++ +
Sbjct: 201 GEILLFLTYIRNLYWINKENGKYVHITLSHANGDDQLLTCRIIGSESDGKEEISRYLRYK 260
Query: 1421 KLRAGVIRPDVKTTEIALA--LTLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
K+ PD+K+ E+A+A L +Q N + PV+ + P R L F++ G
Sbjct: 261 KV---FDHPDMKSAEVAVAYRLNVQARNINE----MEHTPVWVYFPTRDMTDLPFLIHGS 313
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
F SRE++ S +N L L +
Sbjct: 314 FETAVSREKLMTPSTFNDSLFGRLGDLIAES 344
>gi|428213778|ref|YP_007086922.1| hypothetical protein Oscil6304_3433 [Oscillatoria acuminata PCC 6304]
gi|428002159|gb|AFY83002.1| hypothetical protein Oscil6304_3433 [Oscillatoria acuminata PCC 6304]
Length = 1585
Score = 78.2 bits (191), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 149/333 (44%), Gaps = 32/333 (9%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV-EPTLTFILQESGIVVLNNEQGFS 1249
+ + + L+ + YS+ F+ EL+QNADD P + + + ++ E+ ++VL+N + F
Sbjct: 29 IAKFVTDLTGDSYSEQERFIFELLQNADD--LPTIIGQVHVKLVILENHLLVLHNGKPFD 86
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--TSEGQI 1307
EN+RA+C+ STK +S G KGIGFKSVF + I+S F KFD +++ Q
Sbjct: 87 DENLRAICNRSESTKSKNS-DQTGYKGIGFKSVFHDSQCVYINSGDFSFKFDRNSTKHQN 145
Query: 1308 GFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI--AMNNIVDMFS 1365
P + P ++ R D +Q S + + + + E I I +FS
Sbjct: 146 PAQTPWQIKPIWVE---RDEYPDEIQAYSDFFESPVATALKVS-PEKINDYKEKIKKLFS 201
Query: 1366 DLHPSLLLFLHRLQCIMFR--NMLNDSLVVIRKKIVGDGIIKVSCGEDKMT----WFV-- 1417
D P L+LFL + I N L+ + VI+ K I+ + + T W +
Sbjct: 202 D--PRLILFLRHISSIEVYGLNDLSLDIKVIKLKQQDIYTIQYTALPSEQTIETQWIIKN 259
Query: 1418 ----ASQKLRAGVIR----PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLP--LRT 1467
SQ +R + PD + LT N + +F +LP ++
Sbjct: 260 FELPVSQNIREAMKNDEKVPDKLKSIQNSKLTFAVEIVNNQIHPVDNSVLFNYLPTNVKA 319
Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSE 1500
F++ DF+ + RE + + WN +L +
Sbjct: 320 DQFPFLINADFLTTAGRETIHVDHTWNCFLFEQ 352
>gi|435849813|ref|YP_007301754.1| molecular chaperone of HSP90 family [Methanomethylovorans hollandica
DSM 15978]
gi|433663301|gb|AGB50726.1| molecular chaperone of HSP90 family [Methanomethylovorans hollandica
DSM 15978]
Length = 1113
Score = 77.4 bits (189), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 135/323 (41%), Gaps = 43/323 (13%)
Query: 1197 CLSQELYSQDSHFLLELVQNADDNIYPENVEPT----LTFILQESGIVVLNNEQGFSAEN 1252
L ELYS+ +HF+ EL+QNA+D N E T + F L + + +N F E+
Sbjct: 27 TLLTELYSEKTHFIYELLQNAEDAYERLNKEDTRIKSVHFELYNDRLEIRHNGIPFDEED 86
Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
I+ +C + + TKK IG+ GIGFKSV+ T +PEI+S G F +
Sbjct: 87 IKGICRLASGTKK-QDISQIGKFGIGFKSVYAYTKSPEIYS-----------GHYCFCIK 134
Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSL 1371
V P++I+ ++ T +P K ++ + I L
Sbjct: 135 NYVHPYSINQRNDIIKGQ----------TLFVIPLNNDKINQKDSYLEIKSKLIGLGART 184
Query: 1372 LLFLHRLQCIMFR---------NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKL 1422
LLFL ++ I ++ N R + + + V E+ W S+ +
Sbjct: 185 LLFLKNIEEITWKIDGTYGKYYKKSNHETTYSRAILYSESNVSVISIEN---WLKFSRTI 241
Query: 1423 RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLP 1481
+ + EIA + E+++ + F P + GL F++QG+F
Sbjct: 242 SNEC--NNKSSVEIAYLVQNDENSKKEQIISADNTQLVVFFPTDKKTGLSFLIQGNFQTT 299
Query: 1482 SSREEVDGNSPWNQWLLSEFPAL 1504
+SR+ + N WN+ L+ E L
Sbjct: 300 TSRDNIFSND-WNKLLIEEIAIL 321
>gi|404484308|ref|ZP_11019521.1| hypothetical protein HMPREF1135_02581 [Clostridiales bacterium
OBRC5-5]
gi|404342625|gb|EJZ68996.1| hypothetical protein HMPREF1135_02581 [Clostridiales bacterium
OBRC5-5]
Length = 1656
Score = 77.4 bits (189), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 75/343 (21%), Positives = 149/343 (43%), Gaps = 59/343 (17%)
Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
+L LS +LY+ + F+ EL+QNADD+ + +N I ++ +VV ++ + F +I
Sbjct: 32 SLTALSVDLYTDANRFIYELLQNADDSAFHDNGVKVWIKIFNDN-LVVAHSGKVFDTRDI 90
Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---------- 1303
R +C++ N TKK S G KGIGFKSVF +D I+S+G +FD +
Sbjct: 91 RGICNINNGTKK-SDLKKTGYKGIGFKSVFGQSDYVVIYSDGEFFRFDKNYDFKWRWEKS 149
Query: 1304 ------EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAM 1357
+ F P + P + + ++ + + ++ K + + +
Sbjct: 150 KKDWEKDNNRKFEYPWQIIP---------IYTENTEISEQIRDYIFKVG--AKVATILKI 198
Query: 1358 NNIVDMFSDL-----HPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDK 1412
N I + + + ++ LFL + I ++ D ++ I + I I+ K
Sbjct: 199 NYIQETIDAIRVLTENSNMFLFLKHITEI----IIEDKVITINRNIKNRLILSDDTNSQK 254
Query: 1413 MTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV------------- 1459
W + + KL + D++ + + Q+ E N +++ +
Sbjct: 255 -EWIIKNIKLN---VSNDLRKSLLEERNIPQKLLEANNIDMIFAAQIDKNRIVELNKEDR 310
Query: 1460 --FAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
+++LP Y + ++ F++ ++RE + +S WN+WL
Sbjct: 311 LLYSYLPTGETQYAIPVLVNTSFLVSANRESIHKDSKWNKWLF 353
>gi|315652131|ref|ZP_07905129.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
3986]
gi|315485623|gb|EFU76007.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
3986]
Length = 1286
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 145/332 (43%), Gaps = 49/332 (14%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQE-SGIVVLNNEQGFSAENIRALCDVGN 1261
YS +HF+ EL+QNA+D V + I +V +N + F +++R + +
Sbjct: 35 YSDSNHFVYELLQNAED------VRASKVVIEYYCDKLVFYHNGKPFDEDDVRGVSSMLM 88
Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNI- 1320
TK A IG+ G+GFKSVF+ T PEI+S+ + GF++ + P +
Sbjct: 89 GTKSRDDASTIGKFGMGFKSVFKYTYQPEIYSD-----------EEGFIIQNYLLPKEVN 137
Query: 1321 ---------DMFCRLLSKDPVQLESKCWN-TCIRLPFRTKFSEG----IAMNNIVDMFSD 1366
+ F V + + C + T I +PF S+G ++ N+++D
Sbjct: 138 NNWDFRKIKEEFNYSDGSKKVYVFANCEHLTKIIIPFNKIDSKGKLSYVSGNDVLDKLEG 197
Query: 1367 LHPSLLLFLHRLQCIMFRNMLNDSL--VVIRKKIVGDGII--KVSCGE-DKMTWFVASQK 1421
L +LLFL +Q + + N + + K + +I ++S D+ K
Sbjct: 198 LSGEILLFLTCIQDLYWINKEAGKYAHITLSKDSKDENLITCRISGNNSDRKEDISKYLK 257
Query: 1422 LRAGVIRPDVKTTEIALALTL----QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
+ PD+K E+++A + + NE N P++ + P R L F++QG
Sbjct: 258 YKDVFDHPDMKNAEVSVAYKVNSLAKNVNEIN------SSPIWVYFPTRDNTRLPFLIQG 311
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
F SRE++ S +N L + AL +
Sbjct: 312 SFETAVSREKLMTPSDFNNVLFNRLGALIAES 343
>gi|421077861|ref|ZP_15538823.1| putative transcriptional regulator [Pelosinus fermentans JBW45]
gi|392524114|gb|EIW47278.1| putative transcriptional regulator [Pelosinus fermentans JBW45]
Length = 347
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 146/341 (42%), Gaps = 52/341 (15%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
Q +L+ LS +LY+ F+ EL+QNADD+ V I + +VV ++ +
Sbjct: 22 QAVNQASSLNALSGDLYTDSKRFIYELLQNADDSSQNNEVVKVWIKIFNDK-LVVAHSGK 80
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--- 1303
F+ +++ +C+V N TKK S G KGIGFKSVF +D I SN + +FD+S
Sbjct: 81 PFTTRDLQGICNVNNGTKK-SDLTKTGYKGIGFKSVFGQSDKVTIFSNMEYFRFDSSYPF 139
Query: 1304 -----EGQI--------GFVLPTLVPPFNIDMFCRLLSKD-PVQ--LESKCWNTCIRLPF 1347
E +I F P + P ++ + D P+ LE+ N +
Sbjct: 140 EWNWDEAKIIWEKKNDREFQFPWQIIP----IYTKAEEIDEPINQFLENIEANVATIIQM 195
Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVS 1407
+ +A+ + + ++ LFL + I F M + S+ + R + D II +
Sbjct: 196 KNVNETSLAVGKLSQ-----NLNMFLFLKNISEINFDVMESVSIEINRSE--KDRIILKN 248
Query: 1408 CGEDKMTWFVASQKLRAGVIRPDVKTT---------------EIALALTLQESNEGNYGP 1452
K W + + L + DVKT I L L + N+G
Sbjct: 249 GNASKSDWLINTISL---TVPNDVKTALQDERNIPDKLLNTDSIELTLAAKVGNDGITKI 305
Query: 1453 LLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNS 1491
+ ++++LP R Y L ++ F+ ++RE + +S
Sbjct: 306 SEQETLLYSYLPTDERKYSLPVLVNTSFLTTANRESLHADS 346
>gi|313669368|ref|YP_004049795.1| hypothetical protein Sulku_2587 [Sulfuricurvum kujiense DSM 16994]
gi|313156565|gb|ADR35242.1| hypothetical protein Sulku_2587 [Sulfuricurvum kujiense DSM 16994]
Length = 1006
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 150/361 (41%), Gaps = 68/361 (18%)
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQGFSA 1250
G H + LYS +HFL EL+QNA D E T ++F L + +N + FS
Sbjct: 21 GDNSHRIIANLYSDSTHFLYELLQNAQD------AEATSVSFELYSDRLETSHNGRAFSF 74
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
++ ++ +G+STK + IG+ G GFKSVF +TD P I+S ++ K ++
Sbjct: 75 TDVESITTIGSSTK-SNEPNKIGKFGAGFKSVFSITDTPHIYSGTYNFKIS------DYI 127
Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVDMFSDLHP 1369
+P + DP++ +S T + LPF R + ++I +
Sbjct: 128 IPETI--------------DPLENQSDE-RTSVILPFDRAEIDAKTLYDSIAKRLRSIGK 172
Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMTWFVASQKL----- 1422
LLFL L I ++ +GD G +K + E T + + +
Sbjct: 173 EELLFLSNLCRIGWK--------------IGDEAGYVKKATDEASETRSLITLETHNKTE 218
Query: 1423 ------RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLP-LRTYGLKFILQ 1475
R I + +IA AL + +G Y + P++ F P + L+F++
Sbjct: 219 RFELYRRGFTIENKLLECKIAFAL----NADGRYVSIG-TNPLYVFFPTVVPTNLQFLVH 273
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVS-----AERSFCDLPCFRENPAKAASVYM 1530
+ +RE ++ + P NQ + L+ S AER D+ P + S +
Sbjct: 274 APYKTTPNRETMNFSDPQNQTITDNIIELYKSVLKDMAERKVLDVETMGLFPLSSQSGEL 333
Query: 1531 S 1531
S
Sbjct: 334 S 334
>gi|73748050|ref|YP_307289.1| hypothetical protein cbdb_A109 [Dehalococcoides sp. CBDB1]
gi|73659766|emb|CAI82373.1| hypothetical protein cbdbA109 [Dehalococcoides sp. CBDB1]
Length = 1023
Score = 75.5 bits (184), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 80/353 (22%), Positives = 155/353 (43%), Gaps = 48/353 (13%)
Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI---YPEN--- 1225
+ + + S+ ++++ +S++ YS +HF+ EL+QNA+D + + +N
Sbjct: 1 MASDYQKITSDNIRRRGEEFDDIGEWISEQFYSDRTHFIYELLQNAEDALSRRFRDNSDS 60
Query: 1226 -VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFR 1284
V + F L + + + + F+ +++RA+ D+ TK IG+ GIGFKSV+
Sbjct: 61 KVPRRVQFRLYSNRLEFRHFGKLFTEDDVRAISDILKGTK-AIDQNQIGKFGIGFKSVYA 119
Query: 1285 VTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIR 1344
T PE+HS H F + + P N D +L ++ T
Sbjct: 120 FTSTPEVHSGDEH-----------FFIERYIRPKNADQIPKLEDQE----------TLFV 158
Query: 1345 LPF-RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS----LVVIRKKIV 1399
PF + + ++ +I + +++ LLFL + I + + NDS L +R++
Sbjct: 159 FPFNHDRVTPEDSLRSISNKLNNIGVRTLLFLKHINEIEWL-VGNDSKGTYLKEVRRR-- 215
Query: 1400 GDGIIKVSCGEDKMT-----WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLL 1454
GD G++ T W + + + VK + +A ++ + N ++
Sbjct: 216 GDSRQVTVVGQNNGTEESEDWLMFERPVHHPGSTESVK---VEIAFRVEVDGKTNKEKIV 272
Query: 1455 YQQ--PVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPAL 1504
+ P+ F P + F++QG F+ SR+ + +S WN+ L+ E L
Sbjct: 273 KSKASPLVVFFPTEQETRFGFLVQGPFLPTPSRDNILKDSEWNKLLIREVAQL 325
>gi|119179507|ref|XP_001241335.1| hypothetical protein CIMG_08498 [Coccidioides immitis RS]
Length = 2079
Score = 75.5 bits (184), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 125/506 (24%), Positives = 203/506 (40%), Gaps = 107/506 (21%)
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGY---IGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
NE GF+ E++ ++C +G S +KG SA IG KGIGFKSVF+V I S + KF
Sbjct: 59 NELGFAPEDVDSICRIGGS-QKGISADRMDCIGEKGIGFKSVFKVASVVWISSGKYSFKF 117
Query: 1301 DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI 1360
D + ++G + P + ++ P + + + +RL ++ E + +
Sbjct: 118 DNAT-RLGMIAP-------------MWAQFPAEAPASGTSMLLRLLNKSVRDELLKEMRV 163
Query: 1361 VDMFSDLHPSLLLFLHRLQ----CIMFRNMLNDSLVVIRK--KIV---GDGIIKVSCGED 1411
D P +L+FL++++ CI N + RK K V G + + G D
Sbjct: 164 CD------PKILIFLNQIRELKVCIAQDNGTTFERTISRKDDKDVDKPGQYVTTLRYG-D 216
Query: 1412 KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
FV + + +G EI LA ++ + Q+P
Sbjct: 217 SSQQFVVIRYVPSGR---STDKAEILLAFPIKNN----------QEP------------- 250
Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
V+ S RE+++G+S NQ L S P FV A F + FR ++
Sbjct: 251 -------VIQSHREDINGSSAENQRLRSLIPKAFVQAVDYFQESDYFR-------YIWPR 296
Query: 1532 FVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR---GWNDRAHSLLP 1588
++P++ E FF+ L + +S +LE + + A P + + D LP
Sbjct: 297 YLPIIPE-ENFFTSLTEETI--FTLSQRKLLESESGELATPGTLFYVPPRFRDPNGDPLP 353
Query: 1589 DILLQKHLGLGFLSKDIVLSDS-LARALGIEEYGPKILLQIISSLCRTENGLRSMGLSW- 1646
L +LS+ DS L + LG+ + LL + S+ W
Sbjct: 354 ---LCSRTAFRYLSRKYFDEDSDLFQCLGVRHLSLRDLLVDVKSVVNQYQNHSREATEWH 410
Query: 1647 --LASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDE--GTIWL 1702
L+ LN + L++ T L+ IP IPL DG + S E G I+
Sbjct: 411 IRLSKALNR----------EDMLRAYRHT-----LRSIPLIPLRDGRWISASESAGIIFF 455
Query: 1703 HS---DCSVFDGGFGLEAFPNLCAKL 1725
S D SV G +E P+ + +
Sbjct: 456 PSALADLSVPSGLDSVEIHPDAASNV 481
>gi|227115501|ref|ZP_03829157.1| putative transcriptional regulator [Pectobacterium carotovorum subsp.
brasiliensis PBR1692]
Length = 1675
Score = 74.7 bits (182), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 90/400 (22%), Positives = 168/400 (42%), Gaps = 85/400 (21%)
Query: 1175 NISNMESNMLK-----------KQHARLGRALHCLSQELYSQDSHFLLELVQNADDN-IY 1222
NI N+++ ++K Q +L LS++LY+ F+ EL+QNADD+ +
Sbjct: 8 NIINLKAKVVKTFQDNINYSNPNQAVNQADSLDALSKDLYADSKRFIYELLQNADDSAVD 67
Query: 1223 PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
++V + I E+ ++V ++ + F +I+ +C++ N TKK S G KGIGFKSV
Sbjct: 68 GKSVNVWIKII--ENNLIVAHSGKPFDERDIQGICNINNGTKK-SDITKTGYKGIGFKSV 124
Query: 1283 FRVTDAPEIHSNGFHVKFDTS-----------------EGQIGFVLPTLVPPFNIDMFCR 1325
F + + +N + KFD + E F P +
Sbjct: 125 FGQSQKVIVFTNNEYFKFDVNYKHNWQWEESSQDKWEKENDRKFQYPWQII--------- 175
Query: 1326 LLSKDPVQLESKCWNTCIRLPFRT---KFSEGIAMNN---IVDMFSDL--HPSLLLFLHR 1377
P+ L+S T I R+ + I + N I+D +L ++ LFL
Sbjct: 176 -----PIYLQSNEIETNIHEYLRSIKATVATVIELKNDKDIIDSLQELSQKTNMYLFLKN 230
Query: 1378 LQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED-KMTWFVASQKLRAGVIRPDVKTT-- 1434
+ I+F + ++ + R +G + + C + + TW + ++ + D+K +
Sbjct: 231 ISNIIFDTERSVNVSIERN---ANGKVHLGCNNNCQSTWLSKTFEIS---LPEDIKNSLS 284
Query: 1435 -------------EIALALTLQESNEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFV 1479
++ L L+ Q + + ++++LP Y ++ F+
Sbjct: 285 YDQNIPDKFKYIEKVDLTLSAQVKDGSIVKLHRTDKLLYSYLPTDETKYDFPVLVNTSFL 344
Query: 1480 LPSSREEVDGNSPWNQWLLS-------EFPALFVSAERSF 1512
++RE + +S WNQWL + E+ A V +E F
Sbjct: 345 TTANRESLHASSVWNQWLFTQIGTKLFEWIAELVQSEYQF 384
>gi|320585933|gb|EFW98612.1| hypothetical protein CMQ_4464 [Grosmannia clavigera kw1407]
Length = 1083
Score = 74.3 bits (181), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 43/103 (41%), Positives = 64/103 (62%), Gaps = 4/103 (3%)
Query: 1203 YSQDSH-FLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-NEQGFSAENIRALCDVG 1260
+S +SH F+LEL+ NAD+N Y ++ +PTLT G L+ NE+GFSA+++ A+CD
Sbjct: 47 WSTNSHRFVLELLHNADENTY-DDADPTLTMTHTADGHFRLHCNERGFSAKDVDAVCD-N 104
Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
NS+ K + G KG+GFK+VFRV + S+ + K S
Sbjct: 105 NSSAKAALRTATGEKGVGFKAVFRVAHTVWVVSDPYRFKIRRS 147
>gi|328957430|ref|YP_004374816.1| putative transcriptional regulator [Carnobacterium sp. 17-4]
gi|328673754|gb|AEB29800.1| putative transcriptional regulator [Carnobacterium sp. 17-4]
Length = 1675
Score = 73.9 bits (180), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 68/117 (58%), Gaps = 2/117 (1%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
Q +L LS +LY+ + F+ EL+QNADD+ N+ I E +V+ ++
Sbjct: 22 QAVNQASSLDALSGDLYTDSTRFIYELLQNADDSSSNNNLIKVWIKIFDEY-LVIAHSGT 80
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
F+ ++IR +C+V N TKK S G KGIGFKSVF ++ I++N +++FD+S
Sbjct: 81 PFNIQDIRGICNVNNGTKK-SDITKTGYKGIGFKSVFGQSNYVTIYTNNEYLRFDSS 136
>gi|404319253|ref|ZP_10967186.1| hypothetical protein OantC_13717 [Ochrobactrum anthropi CTS-325]
Length = 1062
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 141/640 (22%), Positives = 246/640 (38%), Gaps = 118/640 (18%)
Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
++LY +HF+ EL+QNA+D + ++FIL E + +N + F ++I + D+
Sbjct: 29 EDLYPDSAHFIYELLQNAEDREASQ-----VSFILFEDKLTFTHNGESFRPKDIYGITDI 83
Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFN 1319
G TK IGR G+GFK+VF T++P I S F F + LV P
Sbjct: 84 GEGTK-ADDEDRIGRFGVGFKAVFAFTESPHIWSPTF-----------SFKITELVLPSE 131
Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
ID L ++ T PF K S A I +++ S LLFL +
Sbjct: 132 IDSLPGLGTQ-----------TRFDFPFNNPKKSREDAHREIKAGLNEIAESTLLFLPNI 180
Query: 1379 QCIMFRNMLNDSL-------------VVIRKKIVGDGIIKVSCGE-DKMTWFVASQKLRA 1424
+ I ++ + DS+ V + K++ G + D+ + Q++
Sbjct: 181 ESIKWQ--IGDSVSGEVLRIEHTRKHVEVLKQVNGQTTTSAHFLKFDQAVKGLEKQRVAI 238
Query: 1425 GV---IRPDVK--TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFV 1479
P+VK +T+ ALA L+ P Q VF + GL+F L FV
Sbjct: 239 AFALDFAPNVKSLSTDKALAQQLK------IVPAPGQVCVFFPANKESSGLRFHLHAPFV 292
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
SR + +P N L + AL SA DL + ++ +P +
Sbjct: 293 PELSRASIK-ETPANHPLYDQLAALCASALHEIRDLGLL-------TTEFLGVLPNAQDA 344
Query: 1540 HGFFSGLPRMILSKLR-MSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGL 1598
G G ++ S +R M + + AP +++ +A + L ++L + +
Sbjct: 345 LGKGYGYEQIRDSIIRAMKTEPLFPTHGKGHAPADRLV-----QAKASLKELLGEDDIKY 399
Query: 1599 GFLSKD---------IVLSDSLAR---ALGIEEYGPKILLQII--SSLCRTENGLRSMGL 1644
D + ++ R AL I ++ + + + + + G+ L
Sbjct: 400 LIEYDDEPPLWAAPRALQGTNVERFMTALEIRDWDIEDFVDCLDEKTTVGSWKGVDKEFL 459
Query: 1645 SWLA----SWLNELYT-ISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGT 1699
+WL W +LY+ + S QS+L L+R I LSDG++++ +
Sbjct: 460 TWLGHKTVEWHQQLYSLLDRESEAQSALY---------RLKRCQLIRLSDGSYATGNACH 510
Query: 1700 IWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIG 1759
D S +R V PA+ + S +N R L +G
Sbjct: 511 FADERDLS--------------AEGVRCVDPAVYT------SGKSKAQQENARRFLESLG 550
Query: 1760 VQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMM 1799
V + +V+ + + + + +AD F+ +
Sbjct: 551 VTGVGERQLVEALLKSKYTGDNREFNQRTYLADLRRFIKL 590
>gi|47206634|emb|CAF91966.1| unnamed protein product [Tetraodon nigroviridis]
Length = 911
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 97/453 (21%), Positives = 181/453 (39%), Gaps = 73/453 (16%)
Query: 1644 LSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLH 1703
+ W W F +S + S+ + ++ +L+ +P IPL+DG ++ E ++
Sbjct: 45 IDWPGCW--------FATSERLSMATAERESVLQSLRELPIIPLADGRVVALGEEGVFFP 96
Query: 1704 SDCSVFDGGFGLE--AFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
+ ++ + L + V P+LLS L + + ++L +GV
Sbjct: 97 METQTKKSKDLIQTGSLAALYKDISVVEPSLLSC-------LEPLESQQVLKLLRMLGVH 149
Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
+L ++ HI P I + ++++ YL F+ H ++ ++L
Sbjct: 150 ELEPQQLLDQHIYPTIQSNKWKSKPESVVVSYLVFIKQH-------STSSHDYANADLP- 201
Query: 1822 KAFVLTNHGFKRPSEIPIHFGKEFGN-PVSVNMLIHDIDIKWYEVDITYLKHPAKESLSC 1880
VLTN G P+E +HF ++ N + + HD W V Y+K
Sbjct: 202 ---VLTNRGLLCPTEEKVHFSVDYDNIDLPERLPGHD----WILVSSCYVKTDG------ 248
Query: 1881 GLVKWRKFFKEIGITDFV-----QVVQVDKDVADISHTGF-------KNMWTKELLSPGS 1928
+ WR+ +G+ D + +V +++ H F KN T ++ G
Sbjct: 249 DVAGWRELLSRLGVRDGLIIRKERVALTAEELVRALHMCFSAVCRLHKNAITCYQMNQGK 308
Query: 1929 AA-IDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDD 1985
+D E H L+ L +V Q LLE+L WD + + + +++ D
Sbjct: 309 GCELDDYPCEEFHALATAQLPDSVLLQQRLTLLEMLANNWDTGH--RYSQYLRAQLVDGD 366
Query: 1986 ----RSFQSSFINCICDIQWT-----ISSMDDELHY--PKDLFHDCDAVRSILGPSAPY- 2033
RS +SSF + +C ++W + E Y P ++ V S+LG Y
Sbjct: 367 GREVRSTRSSFHHYMCSLEWVPAFRPLEGERQERKYLRPTSVYLSSPEVNSLLGTHVCYA 426
Query: 2034 -IVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
I P S IG++ ++++ ++ LK W
Sbjct: 427 EITPSDFSRA----IGMRHNISVEAMINYLKEW 455
>gi|338213695|ref|YP_004657750.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336307516|gb|AEI50618.1| hypothetical protein Runsl_4277 [Runella slithyformis DSM 19594]
Length = 1581
Score = 72.0 bits (175), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 123/527 (23%), Positives = 226/527 (42%), Gaps = 70/527 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT---LTFILQESGIVVLNN 1244
+ +L +AL+ + +E+ F+ EL+QNA D YP+ ++F + ++ ++ ++
Sbjct: 140 NQQLAKALNSIQREINKSPETFIFELLQNASD--YPDRARENKVYVSFKISDNYLLFCHS 197
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
F N+ ++C V N + K S IG KGIGFKSVF+ + I S G+ +FD+S+
Sbjct: 198 GAVFKVNNVHSICSV-NESDKSDSIDKIGFKGIGFKSVFKDCNFAYIKSGGYSFRFDSSK 256
Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIV-- 1361
FV T P I ++ D +E+ + N+ + + + + E + +++
Sbjct: 257 ----FV--TEKPYQIIPIWSEPSEMDSEIVETTIFKNSTVSIALKPRRREKLEEYHVILN 310
Query: 1362 DMFSDLHPSLLLFLHRLQCIMF----------RNMLNDSLVVIRKKIVGDGIIKVSCGED 1411
+F+D +LLFL ++ +N+L D + + ++ D + + E
Sbjct: 311 RLFTD--DRILLFLPNIENAKIITPTGFKESKKNLL-DYWMGKEQVVIPDNVREWINKEI 367
Query: 1412 KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT-YGL 1470
V +K D++++ I A S ++ ++ +LP + G
Sbjct: 368 DDNESVVPEKYY------DIQSSIIQFA----TSRSSDHILKTTNAVLYNYLPTKVNLGF 417
Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPA--KAASV 1528
F++ GDF+ P E N WN +L+ + LF+ RS L N K
Sbjct: 418 DFLINGDFI-PDGTREFFHNIKWNDFLIEQSGFLFLKWVRS-IGLKKDERNAGNFKFNRN 475
Query: 1529 YMSFVP--------LVGEVH-----GFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKV 1575
Y+S +P ++ E + F G +L + ++ + + Q P +
Sbjct: 476 YLSIIPDFTIIENQIIDERNKHLLLSFKKGFETGLLGTVEVAPIAFIPAQSGQLEPLNNI 535
Query: 1576 LRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS---LARALG-IEEYGPKILLQIISS 1631
L A LLPD GL LS+ ++ SDS L++ G I+ Y L ++
Sbjct: 536 LIDKTGIA-VLLPD----DFKGLTGLSQKLISSDSGEGLSKVEGLIKAYSTVGKLYLVDD 590
Query: 1632 LCRTENGLRSMGLS-WLASWLNELYTIS-FHSSGQSSLQSGVETDLI 1676
L + L+ WL + +N I F++S SL+ + T+ I
Sbjct: 591 L---KADLKKEAFQIWLKNPVNNYKLIKHFYTSTDPSLKLLLTTESI 634
>gi|262198949|ref|YP_003270158.1| transglutaminase [Haliangium ochraceum DSM 14365]
gi|262082296|gb|ACY18265.1| transglutaminase domain protein [Haliangium ochraceum DSM 14365]
Length = 2543
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/336 (22%), Positives = 142/336 (42%), Gaps = 53/336 (15%)
Query: 1194 ALHCLSQELYSQDSHFLLELVQNADD---NIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
A LS+ +Y+ HFL+EL+QNA+D + +++ + V ++ F A
Sbjct: 230 AEELLSRRVYTDPGHFLVELLQNAEDAGARCWRADID--------AREVSVWHDGVPFDA 281
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
+++ + +G +TK G+ G+GFKSV+ + + P+++S F +F+ +
Sbjct: 282 KDVVGVLSIGQTTKHKEQIGFF---GVGFKSVYEICERPQVYSGPF--RFEIA------- 329
Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
++ + RL ++ E T + LP R + ++ P
Sbjct: 330 --------DVSLPRRLAARPEGYPEH---GTLLVLPLREPEDPARTPERLYQRAREVPPE 378
Query: 1371 LLLFLHRLQCIMFRNMLNDSLVVIRKKIV-GDG------------IIKVSCGEDKMTWFV 1417
LL L ++ + + GD I V D T ++
Sbjct: 379 TLLTLRNIREMRIAQPAQSRTIRAEAGTAPGDASSDGAEPNSAQRIDLVHLERDTRTRYI 438
Query: 1418 ASQKLRA---GVIRPDVKTTEIALALTLQESNEGNYGPLLY-QQPVFAFLPLRTYG-LKF 1472
+++ RA G R +++ + + L+ + +G PL + V+++LP R L+F
Sbjct: 439 IARE-RATWEGQAREGSRSSSTEVLVALRIAPDGVPVPLSEGEATVYSYLPTRERSRLRF 497
Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
++ F LP RE +D +SP+N+WLL L A
Sbjct: 498 LVHAHFDLPVDRERLDLDSPYNRWLLVNAGTLLAGA 533
>gi|410614948|ref|ZP_11325982.1| hypothetical protein GPSY_4260 [Glaciecola psychrophila 170]
gi|410165514|dbj|GAC39871.1| hypothetical protein GPSY_4260 [Glaciecola psychrophila 170]
Length = 1204
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 139/327 (42%), Gaps = 54/327 (16%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAENIRAL 1256
L ELY ++HF+ EL+QNA+D E + F L ES ++V +N ++ F+ E++ ++
Sbjct: 34 LLTELYPDNAHFIYELLQNAEDAEASE-----IHFSLSESCLIVKHNAKRQFTFEDVNSI 88
Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVP 1316
+GN+TKK IG+ G+GFK+VF T +P + SN Q F + LV
Sbjct: 89 TGIGNTTKKDDVVQ-IGKFGVGFKAVFSYTTSPRVFSN-----------QYSFEIRDLVI 136
Query: 1317 PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM---FSDLHPSLLL 1373
P I+ E + T PF + S + + +++ L +LL
Sbjct: 137 PHKINN----------NYEGH-YETIFEFPFNS--SNKLPKDAFIEIKKGLDTLSVKILL 183
Query: 1374 FLHRLQCIMFRNMLNDSLVVIRKKIVGDGI---IKVSCGEDKMTWFVASQKLRAGVIRPD 1430
FL ++ I + + + + + +GD I K G +T ++ + G I
Sbjct: 184 FLQNIKKITWEIESHKTATIRKVNEIGDYISIEKKSRDGLITLTHWLRFNRWLQGYIGLC 243
Query: 1431 VKTTEIALALTLQESNEGNYGP-----------LLYQQPVFAFLPL--RTYGLKFILQGD 1477
V IA L + N + P + Q V + P T LKF +
Sbjct: 244 VS---IAYKLDYKTKNITEFNPKKGLDEQFKIVIDLQSNVSVYFPAEKETSNLKFNINAP 300
Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPAL 1504
F +R+ + N P N+ L+ E L
Sbjct: 301 FASTVARDSIQ-NRPENEILIEEIAKL 326
>gi|254516519|ref|ZP_05128578.1| conserved hypothetical protein [gamma proteobacterium NOR5-3]
gi|219674942|gb|EED31309.1| conserved hypothetical protein [gamma proteobacterium NOR5-3]
Length = 1083
Score = 71.6 bits (174), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 92/206 (44%), Gaps = 31/206 (15%)
Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAEN 1252
L L ELY ++HF+ E++QNA+D T+ F+L+ +VV ++ E+ FS +
Sbjct: 28 GLRNLLSELYPDNAHFIYEVLQNAEDA-----RATTVDFMLESDRLVVTHDGERPFSLND 82
Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
I ++ +G STKK IG+ G+GFK+VF T PE+ S G+ FV+
Sbjct: 83 IESITSIGQSTKKDDETS-IGKFGVGFKAVFAYTTRPEVRS-----------GKFNFVIE 130
Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVDMFSDLHPSL 1371
L P D + T PF R + +A+ + +L
Sbjct: 131 DLFVPRLTDG------------SAPTGKTSFTFPFDRPEKESSVAVAEVQRGLQELDEKT 178
Query: 1372 LLFLHRLQCIMFRNMLNDSLVVIRKK 1397
LLFL + I + +V+R++
Sbjct: 179 LLFLSHISTINYSLPDGTDAIVMREE 204
>gi|159486652|ref|XP_001701352.1| hypothetical protein CHLREDRAFT_153825 [Chlamydomonas reinhardtii]
gi|158271747|gb|EDO97560.1| predicted protein [Chlamydomonas reinhardtii]
Length = 91
Score = 71.2 bits (173), Expect = 6e-09, Method: Composition-based stats.
Identities = 48/85 (56%), Positives = 54/85 (63%), Gaps = 10/85 (11%)
Query: 1216 NADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK---------- 1265
NADDN YP V P L F+L I NNE GFSA ++RALCDVG STK
Sbjct: 1 NADDNAYPAGVTPCLEFVLGRGHITAANNELGFSAAHLRALCDVGASTKAVGGGAGGGGG 60
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPE 1290
G GYIG+KGIGFKSVFRV+ AP+
Sbjct: 61 GGGGGYIGQKGIGFKSVFRVSAAPQ 85
>gi|320594061|gb|EFX06464.1| hypothetical protein CMQ_6785 [Grosmannia clavigera kw1407]
Length = 353
Score = 70.9 bits (172), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 12/151 (7%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQGFSAENI 1253
L+ S+ S+ +HF+ EL+QNADD Y NV + + ++ +NE GF+ ++
Sbjct: 65 LNLASKGQNSKSTHFIYELLQNADDASYGGNVPRVCIRYDTTNRTFLIESNEIGFTKADV 124
Query: 1254 RALCDVGNSTKK---------GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--T 1302
++C STK ++ IG KG+GFKSVF D I S + +FD T
Sbjct: 125 ESICSAAESTKSKKMPAASATATTERSIGEKGLGFKSVFLAADKVWISSGHYSFRFDGET 184
Query: 1303 SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ 1333
GQI + T P D F L P++
Sbjct: 185 PVGQIQPIWDTQFPGLKTDGFTAKLLHIPIR 215
>gi|448300737|ref|ZP_21490736.1| type III restriction protein res subunit [Natronorubrum tibetense
GA33]
gi|445585556|gb|ELY39851.1| type III restriction protein res subunit [Natronorubrum tibetense
GA33]
Length = 1611
Score = 70.9 bits (172), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 116/513 (22%), Positives = 201/513 (39%), Gaps = 66/513 (12%)
Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYI 1272
++QNADD I E +T L E +V N+E+ S +N+ AL STK+G I
Sbjct: 1 MLQNADD-IGGNCSE--VTIQLTEDSLVFQNHEEPMSVKNVEALGAYTRSTKQGD-LDSI 56
Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
G GIGFK+VF + + +HS F ++ T E T+ P D
Sbjct: 57 GHFGIGFKTVFSLANVVYVHSGFFSFRYTTDER-------TVPQPVG--------QNDQP 101
Query: 1333 QLESKCWN-TCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
+ +C++ T I LP + I++ D SLL FL+ + +
Sbjct: 102 ATDDECYDGTTITLPLTDDAKKN--RREILEEQLDSIGSLLPFLNNIDTVHVSAF--GER 157
Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPD-----------------VKTT 1434
R + DG +V C ++ +L +G PD ++
Sbjct: 158 KTFRASPIDDGGFEVICEQNGEPVRSERMRLFSGTFEPDRELVNYLAEKRNLNTDALREQ 217
Query: 1435 EIALALTLQESNEGNYGPL-LYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSP 1492
+ L + + + + P+ L + +F + P L F +Q DF L R+ +
Sbjct: 218 TLKLDIEIAIPIDTDGAPMALDESHLFCYFPTEPDTNLPFHIQADFSLKPDRKHIIWPDE 277
Query: 1493 WNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVP---LVGEVHGFFSGLPRM 1549
+N+ LL+ +F E +F L +P++ + VP L E+ + L
Sbjct: 278 FNEQLLNCAVDVF---ETAFVQLHSELVDPSQI----LELVPDPNLEREMDPYIDPLVTD 330
Query: 1550 ILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSD 1609
I+S +R +C + +G +N + P V + D L + + L +SD
Sbjct: 331 IVSFVRSESC-VPDGRDNLFQPSEVVF--FADPFRDLFTEAEVSDVLDRPVKYPSEQISD 387
Query: 1610 SLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWL--------ASWLNELYTISFHS 1661
+ L ++ L+ C + S +WL W +E F
Sbjct: 388 TARERLQAVVPDSQLQLETFLDHCTDSSLFESKSDNWLIRFLAGIMRHWSSEYEVSRFGR 447
Query: 1662 SGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSS 1694
+ SS + + + D+L+ + F+PL DG+ +S
Sbjct: 448 T--SSEEKQAQKEFTDSLEEVSFLPLEDGSVTS 478
>gi|148550774|ref|YP_001260213.1| hypothetical protein Swit_5338 [Sphingomonas wittichii RW1]
gi|148503193|gb|ABQ71446.1| hypothetical protein Swit_5338 [Sphingomonas wittichii RW1]
Length = 1934
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 91/349 (26%), Positives = 151/349 (43%), Gaps = 68/349 (19%)
Query: 1196 HCLSQELYSQDSH--FLLELVQNADDNIYPENVEPTLTFILQ----ESG-IVVLNNEQGF 1248
LSQ + + D H FL+EL+QN D + + +L +SG + V N Q F
Sbjct: 128 QSLSQHIIA-DYHGRFLIELIQNGHDAHDKARRDGEIAVLLAGDEGDSGTLYVANRGQPF 186
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS-----------NGFH 1297
++ N+ ALC++G S+K A +G KG+GF+SV V+D P+I+S +G+
Sbjct: 187 ASRNVDALCEMGLSSKPPGEA--VGNKGLGFRSVRHVSDIPQIYSRLAEDSPGDLFDGYR 244
Query: 1298 VKFDTSEGQIGFVLP---------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
F ++ +LP T +P F I + + K +++ IRLP R
Sbjct: 245 FTFARGS-ELDPLLPDEATRALARTDLPAFYIPRWLDQTPDTVREFARKGFSSVIRLPLR 303
Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC----IMFRNMLNDS--LVVIRKKIVGD- 1401
+ + + I+ + P +LLFL R+ I + D+ L + R + D
Sbjct: 304 NQAALDVVRAEILSLAKAEAP-MLLFLRRIATLRAEIQGDRIKEDALDLKLTRSETPLDD 362
Query: 1402 --GIIKVSCGEDKMTWFVASQKLRAGVIRPDVK-----------------TTEIALALTL 1442
G V GE+ W VA ++ G ++ V + E+A+A+ +
Sbjct: 363 AQGPALVDLGENGSFW-VARAEIPEGDMQAAVAEGLESGALPESWNGWKGSGEVAVAVPI 421
Query: 1443 QESNEGNYGPLLYQQPVFAFLPL---RTYGLKFILQGDFVLPSSREEVD 1488
E+ ++ P LY FLP+ T + L G F S+R +D
Sbjct: 422 GETMP-DFHPRLY-----TFLPMGRGATSPFRGHLHGSFFPSSNRTSLD 464
>gi|402825950|ref|ZP_10875192.1| hypothetical protein LH128_22929 [Sphingomonas sp. LH128]
gi|402260506|gb|EJU10627.1| hypothetical protein LH128_22929 [Sphingomonas sp. LH128]
Length = 1846
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 88/348 (25%), Positives = 149/348 (42%), Gaps = 66/348 (18%)
Query: 1196 HCLSQELYSQDSH--FLLELVQNADDNIYPENVEPTLTFILQ----ESG-IVVLNNEQGF 1248
LSQ + + D H FL+EL+QN D + + +L +SG + V N Q F
Sbjct: 40 QSLSQHIIA-DYHGRFLIELIQNGHDAHDRARRDGEIAVLLAGDEGDSGTLYVANRGQPF 98
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS-----------NGFH 1297
++ N+ ALC++G S+K A +G KG+GF+SV V+D P+I+S +G+
Sbjct: 99 ASRNVDALCEMGLSSKPPGEA--VGNKGLGFRSVRHVSDIPQIYSRLTEDSPGDLFDGYR 156
Query: 1298 VKFDTSEGQIGFVLP---------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
F ++ +LP T +P F I + + K +++ IRLP R
Sbjct: 157 FTF-ARGSELDPLLPDEATRALARTDLPAFYIPRWLDQTPDTVKEFARKGFSSVIRLPLR 215
Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI--------MFRNMLNDSLVVIRKKIVG 1400
+ + + I+ + P +LLFL R+ + + + L+ L +
Sbjct: 216 NQAALDVVRAEILGLAKAEAP-MLLFLRRIATLRAEIQGDRIKEDALDLKLSRSETPLED 274
Query: 1401 D-GIIKVSCGEDKMTWF------------VASQKLRAGVIRPDVK----TTEIALALTLQ 1443
D G V GE+ W ++ L +G + K + E+A+A+ +
Sbjct: 275 DQGPALVDLGENDSFWVARAEIPESDMQAAVAEGLESGALPESWKGWKGSGEVAVAVPIG 334
Query: 1444 ESNEGNYGPLLYQQPVFAFLPL---RTYGLKFILQGDFVLPSSREEVD 1488
E+ ++ P LY FLP+ T + L G F S+R +D
Sbjct: 335 ETMP-DFHPRLY-----TFLPMGRGATSPFRGHLHGSFFPSSNRTSLD 376
>gi|195996079|ref|XP_002107908.1| predicted protein [Trichoplax adhaerens]
gi|190588684|gb|EDV28706.1| predicted protein [Trichoplax adhaerens]
Length = 1467
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 146/356 (41%), Gaps = 50/356 (14%)
Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
Q +++ F++EL+QNADD + + F + +N + F E++ A+CD
Sbjct: 39 QTVFNSPEKFVMELLQNADD-ASCGDTSVDVRFRFYNGYLAFSHNGKHFDEEDVEAICDN 97
Query: 1260 G--NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----------DTSEGQI 1307
N K + G KGIGFKSVF + D I S + +F D+ E +
Sbjct: 98 AQQNLHSKRLNINKTGYKGIGFKSVFIIADCTYIISRNYCFRFDRDYTNWKESDSDENRA 157
Query: 1308 GF----VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
G ++P P ++ ++L +P + FR + + + + + +++
Sbjct: 158 GRYPWQIIPIWTEPCDVP---KILQLNPAD---------VNFLFRLRSNNDVIV-SALNI 204
Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED-KMTWFVASQKL 1422
S+ P +LFL +Q I+ N + + + R ++ + K +W V + +
Sbjct: 205 LSNT-PRTMLFLRHIQNIVIDNHGSITCINRRNELTSQQRYDLYLNNVLKSSWHVMAYSV 263
Query: 1423 --------RAGVIRPD--------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLP-L 1465
+ + P K EI A+ + L +F +LP L
Sbjct: 264 NIPKTVLAQLQAMGPQECPEKLKKAKNIEIKFAINIDPVENTKLNALDKNANLFCYLPTL 323
Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFREN 1521
GL F++ DF+L + R ++ N+ WN +L L +++ P FR++
Sbjct: 324 VNCGLPFLVNTDFLLTADRMQLLSNA-WNNFLFESIGKLHITSLAEMVQFPGFRDD 378
>gi|339896689|ref|XP_001463180.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321398943|emb|CAM65531.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 1660
Score = 68.2 bits (165), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 131/281 (46%), Gaps = 28/281 (9%)
Query: 170 LKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGI 229
L++ + S +SL F +++VP+L + + R+ + + + R I ++++LEE +C EG+
Sbjct: 85 LEMLISSTNSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGV 144
Query: 230 EKFEELELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKV--- 285
E F EL LG L+ P++ YF ++++ VVF + DVI L DT A+D+ +
Sbjct: 145 EHFAELGLGVGLQVLPVVQEYFQLRAN-SVVFPVRARDVIAFL--LNDT-TARDMLLYGG 200
Query: 286 ----DEFLDFIA--KKRSL--ASKEQLGVRIQNLGMHINFIREARKSQDVT--LKKFLKE 335
D F A ++ L AS ++ G +N + QD T L +E
Sbjct: 201 GDTRDLLYRFAAFYERHVLRNASPSSASATVRGQGRLLNVRQLGIHVQDYTALLAALTQE 260
Query: 336 LQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSD 395
L H+ + + +R ++ +A V +S + G + S++ ED
Sbjct: 261 LASAHR-----LEQQQYQRWIESCVTAETA-VTQTASTRDSAGVAAAKD-SAAGEDALVR 313
Query: 396 DCTYECSNDISSNVQLPSQIKGS-DRVSSCPYPSVTEELKR 435
D ++E + N L ++ S D S+C + + E ++R
Sbjct: 314 DASFEVAARRERNRALYERVLHSFD--SACAHAELVEGMRR 352
>gi|398010417|ref|XP_003858406.1| hypothetical protein, conserved [Leishmania donovani]
gi|322496613|emb|CBZ31683.1| hypothetical protein, conserved [Leishmania donovani]
Length = 1656
Score = 68.2 bits (165), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 62/103 (60%), Gaps = 2/103 (1%)
Query: 170 LKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGI 229
L++ + S +SL F +++VP+L + + R+ + + + R I ++++LEE +C EG+
Sbjct: 85 LEMLISSTNSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGV 144
Query: 230 EKFEELELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCL 271
E F EL LG L+ P++ YF ++++ VVF + DVI L
Sbjct: 145 EHFAELGLGVGLQVLPVVQEYFQLRAN-SVVFPVRARDVIAFL 186
>gi|195997659|ref|XP_002108698.1| predicted protein [Trichoplax adhaerens]
gi|190589474|gb|EDV29496.1| predicted protein [Trichoplax adhaerens]
Length = 1419
Score = 68.2 bits (165), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 144/351 (41%), Gaps = 57/351 (16%)
Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNE 1245
K L L Q +++ F LEL+QNADD +N +TF + + + +
Sbjct: 19 KNSTTLANTLDKDIQTVFNSPEKFALELLQNADDAAI-DNQPIHVTFQINQRYLAFSHTG 77
Query: 1246 QGFSAENIRALCDVGNS--TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
+ F +++ A+CD T+K + G KGIGFKSVF ++ I S + +FD
Sbjct: 78 RHFDYQDVEAICDNAQQQFTRKKTDLSKTGFKGIGFKSVFVISHRVYIISRNYCFRFDRD 137
Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP--FRTKFS--------- 1352
+ + D F R P Q+ + W +LP F S
Sbjct: 138 YAKWR--------NNDSDRFRRY----PWQI-TPIWTQQNQLPKVFELNKSHVTFLFQLV 184
Query: 1353 -EGIAMNNIVDMFSDLH-----PSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV 1406
E ++NN+ +++ L P+ +LFL +Q I R + +RK ++
Sbjct: 185 DETRSVNNVEKVYNALKILMQTPNTILFLRNIQEITIRQHNRIDSIQLRKLFDTSNQYEL 244
Query: 1407 SCGED-KMTWFVAS----------QKLRAGVIRPD-------VKTTEIALALTLQESNEG 1448
+ K W + S QKL+ PD + T + L EG
Sbjct: 245 YVNDTLKSKWLIRSDEQQIPNEIHQKLQR---LPDQECPTKLKQATSVNLKFAFLFGEEG 301
Query: 1449 NYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
++ +F +LP + Y GL F++ GDF+L + R ++ N WN+++
Sbjct: 302 TLKA-HHKAVLFCYLPTQMYSGLPFLVNGDFLLSADRTQLLTN-IWNEYMF 350
>gi|300787661|ref|YP_003767952.1| hypothetical protein AMED_5803 [Amycolatopsis mediterranei U32]
gi|384151062|ref|YP_005533878.1| hypothetical protein RAM_29730 [Amycolatopsis mediterranei S699]
gi|399539544|ref|YP_006552206.1| hypothetical protein AMES_5725 [Amycolatopsis mediterranei S699]
gi|299797175|gb|ADJ47550.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340529216|gb|AEK44421.1| hypothetical protein RAM_29730 [Amycolatopsis mediterranei S699]
gi|398320314|gb|AFO79261.1| hypothetical protein AMES_5725 [Amycolatopsis mediterranei S699]
Length = 1728
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 147/366 (40%), Gaps = 73/366 (19%)
Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQES----GIV-VLNNEQGFSAENIRALCDVG 1260
D ++EL+QN D + L F+L E+ G++ N + EN R++C +
Sbjct: 54 DGRTVVELLQNGHDAHDSGRSDGKLEFLLDENEGAHGVLYAANGGRPLDQENFRSMCLIA 113
Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFHVKF---DTSEG--- 1305
S+K+ IG KG+GFKSV +++D PE++S +GF +F D +G
Sbjct: 114 RSSKRPDEG--IGNKGVGFKSVLQLSDCPEVYSARHEGAGRFDGFRFRFGRPDDFDGLAA 171
Query: 1306 ------------------QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
+ +P P ++ F R + ++T +RLP
Sbjct: 172 RSKENWPGLADELRENVSTLKITVPLFETPTVVEDFAR-----------RGYSTVVRLPL 220
Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS-----LVVIRKKIVGDG 1402
R+ + A + + ++ D P LFL R+ I+ R+ ++ L + G
Sbjct: 221 RSASTRDHARDQLEELAHDEVP-FHLFLTRVAEIVVRSQAGETESRRRLTRSTRARRGTS 279
Query: 1403 IIKV---SCGEDK----MTWFVASQKLRAGVIRPDVKTTEIALALT-LQESNEGNY---- 1450
+ V GE + + VA +RA IR + I Q + E +
Sbjct: 280 TLSVEHIDLGEQREYVILKRVVAETGIRA-AIRASLDDNAIGSGWAGWQGAAEVSVAVPL 338
Query: 1451 GPLLYQQPVFAFLPLRTYG---LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
G L Q V+ FLPL + ++ G F R +V + P N +L +E AL
Sbjct: 339 GAALSQGRVYTFLPLGPGSPCPIAALVNGPFFTRLDRRDVHESIPLNDFLFAEIAALCAD 398
Query: 1508 AERSFC 1513
A + C
Sbjct: 399 AVVAAC 404
>gi|195997657|ref|XP_002108697.1| hypothetical protein TRIADDRAFT_51895 [Trichoplax adhaerens]
gi|190589473|gb|EDV29495.1| hypothetical protein TRIADDRAFT_51895 [Trichoplax adhaerens]
Length = 558
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 141/337 (41%), Gaps = 57/337 (16%)
Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
Q +++ F LEL+QNADD +N +TF + + + + + F +++ A+CD
Sbjct: 33 QTVFNSPEKFALELLQNADDAAI-DNQPIHVTFQINQRYLAFSHTGRHFDYQDVEAICDN 91
Query: 1260 GNS--TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP 1317
T+K + G KGIGFKSVF ++ I S + +FD +
Sbjct: 92 AQQQFTRKKTDLSKTGFKGIGFKSVFVISYRVYIISRNYCFRFDRDYAKWR--------N 143
Query: 1318 FNIDMFCRLLSKDPVQLESKCWNTCIRLP--FRTKFS----------EGIAMNNIVDMFS 1365
+ D F R P Q+ + W +LP F S E ++NN+ +++
Sbjct: 144 NDSDRFRRY----PWQI-TPIWTQQNQLPKVFELNKSHVTFLFQLVDETRSVNNVEKVYN 198
Query: 1366 DLH-----PSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED-KMTWFVAS 1419
L P+ +LFL +Q I R + +RK ++ + K W + S
Sbjct: 199 ALKILMQTPNTILFLRNIQEITIRQHNRIDSIQLRKLFDTSNQYELYVNDTLKSKWLIRS 258
Query: 1420 ----------QKLRAGVIRPD-------VKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
QKL+ PD + T + L EG ++ +F +
Sbjct: 259 DEQQIPNEIHQKLQR---LPDQECPTKLKQATSVNLKFAFLFGEEGTLKA-HHKAVLFCY 314
Query: 1463 LPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
LP + Y GL F++ GDF+L + R ++ N WN+++
Sbjct: 315 LPTQMYSGLPFLVNGDFLLSADRTQLLTN-IWNEYMF 350
>gi|242071029|ref|XP_002450791.1| hypothetical protein SORBIDRAFT_05g018540 [Sorghum bicolor]
gi|241936634|gb|EES09779.1| hypothetical protein SORBIDRAFT_05g018540 [Sorghum bicolor]
Length = 1582
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 119/265 (44%), Gaps = 47/265 (17%)
Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVP--PFNIDMFCRLLSKD 1330
GIGFKSVF ++ P I SNG+ +KF+ +E IG+++P V P D+ K
Sbjct: 25 GIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECNIGYIVPEWVESGPTLSDI------KA 78
Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND- 1389
T I LP +++ ++ + S LHP +LLFL +++ + + +D
Sbjct: 79 IYGCSKILPTTTIILPLKSE-----KVDAVKKQLSSLHPEMLLFLSKIRKLSVQEHNSDP 133
Query: 1390 -SLVV------------IRKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVIRPD 1430
S+ V RK + + + GE++ +++ QK ++P+
Sbjct: 134 KSITVSEISISSEKNYQARKNMHAESYTLHLSSEESGKGEEECGYYMWKQKFP---VKPE 190
Query: 1431 VKT------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSS 1483
+ E + L + G L V+AFLP FI+Q DF+L SS
Sbjct: 191 YRVDKRAEIDEWVITLAFPHGQRLSRGKQL-SPGVYAFLPTEMVTNFPFIIQADFLLASS 249
Query: 1484 REEVDGNSPWNQWLLSEFPALFVSA 1508
RE + +S WN+ +L P+ F++A
Sbjct: 250 REAILFDSLWNKGILECVPSAFMNA 274
>gi|455650231|gb|EMF29014.1| hypothetical protein H114_11886 [Streptomyces gancidicus BKS 13-15]
Length = 1765
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 92/427 (21%), Positives = 163/427 (38%), Gaps = 97/427 (22%)
Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQE-----SGIVVLNNEQGFSAENIRALCDVG 1260
+ L+EL+QNA D + + L E + V N + F +N + LC +
Sbjct: 51 EGRVLIELLQNAHDAHPARATDGRIEIRLDEDEGEHGTLYVANCGKPFGGKNFKDLCSIA 110
Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFHVKF----------- 1300
S+K+ + IG KG+GFKSV + DAPE++S +GF +F
Sbjct: 111 LSSKRADAG--IGHKGVGFKSVLHLCDAPEVYSVAREGSRLLDGFTFRFARLDDYDALAR 168
Query: 1301 DTSEGQIGFV-----------LPTLV--PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
+ + + GF +P + PP + F R + + T IRLP
Sbjct: 169 EVAPEREGFADYLKENLLTLKVPVFLEEPPKTVQGFAR-----------RGFVTVIRLPL 217
Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF--------RNMLNDSLVVIRKKIV 1399
++ + A + ++ D P LFL RL+ + R D V +
Sbjct: 218 KSADARSAAGAQVRELMDDNAP-FELFLDRLERVRLEWRAAGETRRRTYDRQVEVLHHAR 276
Query: 1400 GDGIIKVS----------CGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGN 1449
G + +++ CG K + A + L A + + ++ +A +
Sbjct: 277 GLKVQQITLRRRMRLIVVCG--KADPYRAREALAASIEKSGMRRDWMAWEKKAEVRVAVP 334
Query: 1450 YGPLLYQQPVFAFLPLR---TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP---- 1502
G L ++AFLP+ + ++ + F +R + PWN LL
Sbjct: 335 VGDPLRDGRLYAFLPMGEQISAPVRGYVHAPFFAEMNRRSFNEAVPWNGLLLDTVAEVCA 394
Query: 1503 -ALFVSAE-------RSFCDLPCFRENP-AKAASVY---------MSFVPLVGEVHGFFS 1544
A+ +A+ + DL C++ A+ S + + F+P++G G +
Sbjct: 395 RAVLAAADGRASIQPGALVDLMCWQPKALARLTSAFQRLDNGIAHVPFIPVIGTTPGART 454
Query: 1545 GLPRMIL 1551
L R +L
Sbjct: 455 SLHRALL 461
>gi|157864316|ref|XP_001680868.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68124160|emb|CAJ02143.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 1657
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 131/289 (45%), Gaps = 44/289 (15%)
Query: 170 LKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGI 229
+++ S +SL F +++VP+L + + R+ + + + R I ++++LEE +C EG+
Sbjct: 85 MEVLASSTNSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGV 144
Query: 230 EKFEELELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDI----- 283
E F EL LG L+ P++ YF ++++ VVF + DVI L +DT A+D+
Sbjct: 145 EHFAELGLGVGLQVLPVVQEYFQLRAN-SVVFPVRARDVIAFL--LSDT-TARDMLLYGG 200
Query: 284 --------KVDEFLDFIAKKRSLASKEQLGVR-------IQNLGMHINFIREARKSQDVT 328
+ F + + + S VR ++ LG+H+ QD T
Sbjct: 201 GDTRDLLYRFAAFYERHVLRNASPSSAPAAVRGQGRLLNVRQLGIHV---------QDYT 251
Query: 329 --LKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVS 386
L +EL H+ ++ + +R ++ +A E ++ S+ D + S
Sbjct: 252 ALLAALTQELASAHQLEQQ-----QYQRWIESCATAETEVTQTAST--RDSAGVAVAKDS 304
Query: 387 SSSEDEDSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKR 435
++ ED ++E + N L ++ S S+C + + E ++R
Sbjct: 305 AAGEDALVRAASFEVAARRERNRALYERVLHSFD-SACAHAELVEGMRR 352
>gi|384099073|ref|ZP_10000177.1| putative transcriptional regulator [Imtechella halotolerans K1]
gi|383834205|gb|EID73649.1| putative transcriptional regulator [Imtechella halotolerans K1]
Length = 1884
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 93/413 (22%), Positives = 165/413 (39%), Gaps = 112/413 (27%)
Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQ 1235
++N E+N + + + + + +YS + EL+QNADD E VE + F
Sbjct: 212 LNNPEANKM------IANLMREIGKGMYSSKQRMIFELLQNADDAPGKERVEFHIDF--N 263
Query: 1236 ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI--HS 1293
V+++ F+ +++ A+ STK+G S G KGIGFKSVF TD+ E+ S
Sbjct: 264 GDYFFVMHDGAPFNKDDVEAITSAAESTKRGDSKK-TGYKGIGFKSVF--TDSTEVWLKS 320
Query: 1294 NGFHVKFDTSEGQIGFVLPTLVPPF-NIDMFC----------RLLSKDPVQLESK--CWN 1340
G+ Q F+ T P F N D F LL +D ++ ++ +N
Sbjct: 321 GGY---------QFAFLRNT--PLFENFDNFYFSSERYQKYPELLDEDKLKYRNQRLRFN 369
Query: 1341 TCIRLPFR------------------------TKFSEGIAMNNIVDMFSDL-----HPSL 1371
+P++ +F+ + NNI D + + P
Sbjct: 370 GSTDIPWQVIPIWQNRLPSEFTDSNFNNFNNPVQFALKLGENNIEDYKTAIDNITKRPQF 429
Query: 1372 LLFLHRLQ--------CIMFRNMLNDSLVVIRKK-------------------IVGDGII 1404
LLFL + RN N+ + +I+ K + +
Sbjct: 430 LLFLRNTSKFRSPKNGVTVLRNDANNVIEIIKSKRAEPDQVFHYTKQTFENVEVSDEAFE 489
Query: 1405 KVSCGEDKMT----------WFVASQKLRAGVIRPDVKT---TEIALALTLQE---SNEG 1448
++ G K + +F + I P + + TEI+ ++L + S E
Sbjct: 490 QLDVGLKKQSKINDYNEVTYYFTDLEGREIETIPPKLASATQTEISFGISLVDNKISTEK 549
Query: 1449 NYGPLLYQ-QPVFAFLPLRT--YGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
Y L + +F +LP+ + L F++ DFV S R+ + G++ WN++++
Sbjct: 550 EYTKGLPKYSSLFTYLPMEDTRFQLPFLVNADFVPSSDRQRIQGDNLWNKYIM 602
>gi|381150652|ref|ZP_09862521.1| UvrD/REP helicase [Methylomicrobium album BG8]
gi|380882624|gb|EIC28501.1| UvrD/REP helicase [Methylomicrobium album BG8]
Length = 1718
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 71/344 (20%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV---------VLNNEQGFSAE-- 1251
YS +HF+ EL+QNADD T TF L+ESG+ ++N Q A+
Sbjct: 39 YSDQAHFIYELLQNADDV-----KATTATFRLEESGLYFTHNGSVRFTISNPQNEEADTN 93
Query: 1252 -----NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
+I A+ + NS K +S IG+ G+GFK+VF+ T P I+ K +
Sbjct: 94 NGTLGHINAITSIANSNKTEAS---IGKFGVGFKAVFQYTQTPHIYDPEIRFKIER---- 146
Query: 1307 IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSE-GIAMNNIVDMFS 1365
F++P ++ + D+ R SK T PF K + + + I++
Sbjct: 147 --FIVPHML---DADLDWRNSSK-----------TVFFFPFDHKKKQPQESHDEILEKLK 190
Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
L +LFL L+ + F + K K+ G W + KL
Sbjct: 191 ALEFP-VLFLSSLKSVSFEAE------KVSGKYTKKVTRKIEEGGITAQWITLALKLDGK 243
Query: 1426 --------VIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
R + ++ L ES GN P+ + F F P + L FIL
Sbjct: 244 RTSQRLVMFTRNNASGHPCSIGYALGES--GNLVPI--DRTAFCFFPTKEATNLNFILHA 299
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
F+L SRE + N L+ + AE + LP R+
Sbjct: 300 PFLLTDSREGIKAGEKHNNELIEQL------AELAANSLPILRD 337
>gi|72391776|ref|XP_846182.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62358393|gb|AAX78857.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70802718|gb|AAZ12623.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 1645
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 76/164 (46%), Gaps = 11/164 (6%)
Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
L F ++++PS+ ++ + R+ A + V R + +L++LE +C +EG+ F EL LG
Sbjct: 80 LPFALKQIPSIARIAELQLRVTALVQSAVATRPVVTLHELELEVCSAEGVSIFAELGLGD 139
Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTH-------KAKDIKVDEFLDF 291
L P + H F ++S VV I ++D + L + D A D V F F
Sbjct: 140 SLTALPFVQHAFNLRSSTGVVAPITSDDFLQFLLFHADAEIMLTSGGDAGD-AVRAFAQF 198
Query: 292 IAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
K + QLG+ IQ+ + +R+ + + + E
Sbjct: 199 Y--KNGYYTPLQLGIHIQHFSWLLRLLRQETGRTSAFISQIVAE 240
>gi|448389606|ref|ZP_21565745.1| hypothetical protein C477_05907 [Haloterrigena salina JCM 13891]
gi|445668380|gb|ELZ21009.1| hypothetical protein C477_05907 [Haloterrigena salina JCM 13891]
Length = 1620
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 56/106 (52%), Gaps = 4/106 (3%)
Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
Q+ D F+ EL+QNA D + ++ EP + L E + V NN + F +++ AL +
Sbjct: 44 QDRAGYDGRFIFELLQNAADEMA-DSDEPAIHIELSEETLYVANNGRAFDYKDLYALTMI 102
Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN-GFHVKFDTSE 1304
+TK G IG KG GF SV VTD P + S+ G KFD E
Sbjct: 103 AETTKAGERT--IGHKGRGFTSVLGVTDQPSVFSSEGVSAKFDREE 146
>gi|313126065|ref|YP_004036335.1| hypothetical protein Hbor_13070 [Halogeometricum borinquense DSM
11551]
gi|448285904|ref|ZP_21477143.1| hypothetical protein C499_03988 [Halogeometricum borinquense DSM
11551]
gi|312292430|gb|ADQ66890.1| hypothetical protein Hbor_13070 [Halogeometricum borinquense DSM
11551]
gi|445575934|gb|ELY30397.1| hypothetical protein C499_03988 [Halogeometricum borinquense DSM
11551]
Length = 1674
Score = 64.7 bits (156), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 3/102 (2%)
Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
Q+ + F+ EL+QNA D + + EP + F L E ++V NN FS E++ AL
Sbjct: 38 QDRADYEGRFVFELLQNAVDEM-DDTGEPRVRFELTEDSLLVANNGAAFSFEDLYALTLT 96
Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+TK G + IG KG GF SV VTD P ++S +F+
Sbjct: 97 TRTTKAGETT--IGHKGRGFTSVLGVTDNPSVYSENVQAQFN 136
>gi|261329772|emb|CBH12754.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 1683
Score = 64.3 bits (155), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 75/164 (45%), Gaps = 11/164 (6%)
Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
L F ++++PS+ ++ + R A + V R + +L++LE +C +EG+ F EL LG
Sbjct: 118 LPFALKQIPSIARIAELQLRATALVQSAVATRPVVTLHELELEVCSAEGVSIFAELGLGD 177
Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTH-------KAKDIKVDEFLDF 291
L P + H F ++S VV I ++D + L + D A D V F F
Sbjct: 178 SLTALPFVQHAFNLRSSTGVVAPITSDDFLQFLLFHADAEIMLTSGGDAGD-AVRAFAQF 236
Query: 292 IAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
K + QLG+ IQ+ + +R+ + + + E
Sbjct: 237 Y--KNGYYTPLQLGIHIQHFSWLLRLLRQETGRTSAFISQIVAE 278
>gi|336316858|ref|ZP_08571747.1| hypothetical protein Rhein_3170 [Rheinheimera sp. A13L]
gi|335879023|gb|EGM76933.1| hypothetical protein Rhein_3170 [Rheinheimera sp. A13L]
Length = 1411
Score = 64.3 bits (155), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 105/460 (22%), Positives = 181/460 (39%), Gaps = 69/460 (15%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
+LEL+QNA D ENV + L E G+++ NN + FS +++L + S K+ +
Sbjct: 54 ILELLQNAADQAKAENVSGKVIIELTEVGLLIANNGKAFSVGGVKSLQNAHLSPKRLNQR 113
Query: 1270 GYIGRKGIGFKSVFRVTDAP-------------EIHSNGFHVKFDTS------------E 1304
+IG KG+GF+S+ T P E+ +N D S +
Sbjct: 114 QFIGCKGLGFRSILNWTRNPIILSGELAIYYSDEVSANKLQELMDESSELADLVQRERDK 173
Query: 1305 GQIGFVLPTL-VPPFNI-DMFCRLLSKDPVQLESKC-------WNTCIRLPFRTKFSEGI 1355
+LP L P F I +S+ L +C + T I +PF S
Sbjct: 174 NNNDIILPKLPFPGFPIPTAMSGSVSQTTSFLFERCQYWKSLGYTTVIGIPFEKAESFSK 233
Query: 1356 AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTW 1415
AM I L P +LLF+ L + F + + L V K+ D + + GE W
Sbjct: 234 AMQQI----ESLRPEMLLFVPYLDEVRFL-LPENKLSVWSKEGDDDASLVMQDGEPIGIW 288
Query: 1416 FVASQKLRAGVIRPDVKTT----EIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGL 1470
+ + + + T EI +A+ +S L P+F + P L
Sbjct: 289 QLFRTQGNVDKLYLEDSTAAVLYEIVIAVPEVKSRSE-----LKPSPLFTYFPTEINLPL 343
Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS-AERSFCDLPC-----FR----- 1519
+ F L SR ++ N ++ ++ + AER P FR
Sbjct: 344 SVVCHATFELDQSRNQISKLKS-NDYVFTQLASFLAEVAERRAAKYPVGANAGFRVAMPL 402
Query: 1520 -ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRG 1578
E P + F L+ E + +P + + ++++ L+G N+ W P + G
Sbjct: 403 GEYPESLKKI--GFPKLLNEACKTRTIVPTIAGNVSKVADVFSLKGANSDWLP----ISG 456
Query: 1579 WNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIE 1618
+ + + ++ + LG+ LS + L ++L +GI+
Sbjct: 457 FKNIVPNDDAELYWFRELGISELSHE-ALYEALDSLVGID 495
>gi|401415226|ref|XP_003872109.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488331|emb|CBZ23578.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 1662
Score = 64.3 bits (155), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 57/97 (58%), Gaps = 2/97 (2%)
Query: 176 SWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEEL 235
S SSL F +++VP+L + + R+ + + + R I ++++LEE +C EG+E F EL
Sbjct: 92 STSSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGVEHFAEL 151
Query: 236 ELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCL 271
LG L+ P++ YF ++++ VF + DV+ L
Sbjct: 152 GLGIGLQVLPVVQEYFQLRAN-SAVFPVRARDVVAFL 187
>gi|340367655|ref|XP_003382369.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4323
Score = 64.3 bits (155), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 154/392 (39%), Gaps = 70/392 (17%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y + E++QNADD ++ + +L+
Sbjct: 1306 QHEPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRTHSSHNLVLEGMAE 1364
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 1365 SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 1423
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQL-----ESKCWN-TCI 1343
+ FD + + G V P I + L+K P Q S +N T
Sbjct: 1424 EWLYIFDPTLKHLKGVVQNESRPGKKIKFRSKFLAKSQQLAPYQGLFGFDSSSSYNGTIF 1483
Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
RLPFRT + S I N++V M DL + LLLFL ++ I F + +V
Sbjct: 1484 RLPFRTHPSQISSTIYNNHLVQMIKKDLESNGSKLLLFLQNVKRITFSSRQEGRPIVEHL 1543
Query: 1397 KIVGDGI----IKV----SCGEDKMTWFVASQKLRA----GVIRPDVKTTEIALALT--- 1441
I + +KV S + W +++Q+ R G R V + L
Sbjct: 1544 SIECSNVDTENVKVCVTQSSTKSTEYWLLSNQEQRLKSQDGAYRQAVASVACKLVKKQSS 1603
Query: 1442 -LQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------D 1488
L E+ EGN F FLPL GL + +F + S+R + D
Sbjct: 1604 FLCEAIEGN---------AFCFLPLSVPATGLPVHVTANFAVMSNRSGIWTGASSAMASD 1654
Query: 1489 GNSPWNQWLLSE-FPALFVSAERSFCDLPCFR 1519
WNQ L++ P + +S + C R
Sbjct: 1655 EREYWNQQLMTTVIPEAYCKLLKSLQAMHCLR 1686
>gi|402311178|ref|ZP_10830127.1| GHKL domain protein [Eubacterium sp. AS15]
gi|400365796|gb|EJP18844.1| GHKL domain protein [Eubacterium sp. AS15]
Length = 841
Score = 63.9 bits (154), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 60/109 (55%), Gaps = 1/109 (0%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGF 1248
+ L L L+ + + ++ EL+QNADD Y ++++ + +++ + + E+G
Sbjct: 137 SNLSEQLVKLASNINNDSYRYIYELIQNADDCEYEDDIDINMQIDVEDGKLTISYPEKGM 196
Query: 1249 SAENIRALCDVGNSTK-KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
+A +I A+ +G S K K + IG KG GFK++F D +I+SN F
Sbjct: 197 TASDIIAITTIGQSNKLKKKKSHIIGEKGKGFKTIFAACDNVQIYSNNF 245
>gi|340373769|ref|XP_003385412.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 3123
Score = 63.9 bits (154), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 90/384 (23%), Positives = 150/384 (39%), Gaps = 67/384 (17%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y D + E++QNADD +Y T + +
Sbjct: 218 QHEPLTTRLCNILKE-YKDDITIIKEMIQNADDAGATKVNILYDSRTYSTQKLLFKGMAE 276
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N+ F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 277 SHGPALIVHNDSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 335
Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMF--CRLLS--KDPVQLE--------SKCWNTC 1342
+ FD + + V+ P + R+ S +D V E S T
Sbjct: 336 EWLYIFDPTLKYLKGVVHDQSKPGKKIKYQQSRIFSNCQDLVPYEGLFGFKSSSNYKGTM 395
Query: 1343 IRLPFRTKFSE---GIAMNNIVDMFS----DLHPSLLLFLHRLQCIMFRNMLNDSLVV-- 1393
R PFRT S+ + +IV+ D LLLFL ++ I F + ++ +
Sbjct: 396 FRFPFRTSMSQISSKVYDEHIVEHMKRDLVDCGSKLLLFLQHMKHITFSSRRDNEEITHV 455
Query: 1394 ------IRKKIVGDGIIKVSCGEDKMT-WFVASQK---LRAGVIRPDVKTTEIALALTLQ 1443
+IK C E W +ASQ+ + +P V + +A L
Sbjct: 456 SIDISSSNSNDFQKVVIKSPCIESSTEHWLIASQEGQWQHQNITKPAVAS----VACQLD 511
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNS 1491
E N G Y + F FLPL + GL + +F + S+R + D
Sbjct: 512 EGNLGFYKCKEVEGAAFCFLPLAMPSTGLPVHVSANFAVMSNRSGIWTEVSSDTVTDSRV 571
Query: 1492 PWNQWLLSEFPALFVSAERSFCDL 1515
WN+ L+ + + +++C L
Sbjct: 572 QWNKNLMEK------TIPKAYCKL 589
Score = 43.5 bits (101), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 17/123 (13%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADD------------NIYPENVEPTLTF-- 1232
Q L R + + QE Y D+ FL EL+QNADD + P+ T+
Sbjct: 1171 QKEELTRRIRNILQE-YPLDATFLKELLQNADDAKATKIRIILDKRQHGNETVPSETWGK 1229
Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
LQ ++V N+++ FS ++ + +G +K+ IG+ GIGF V+ +TD P
Sbjct: 1230 ELQGPALLVWNDKE-FSDVDLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHLTDCPSFI 1287
Query: 1293 SNG 1295
+ G
Sbjct: 1288 TRG 1290
>gi|218891094|ref|YP_002439960.1| hypothetical protein PLES_23571 [Pseudomonas aeruginosa LESB58]
gi|218771319|emb|CAW27084.1| hypothetical protein PLES_23571 [Pseudomonas aeruginosa LESB58]
Length = 1603
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/322 (26%), Positives = 129/322 (40%), Gaps = 60/322 (18%)
Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ-----------GFSAE-----NIR 1254
LEL+QNADD ++ F + + G+VV N+ Q F AE +
Sbjct: 40 LELIQNADDA-----KAESVVFDVTDQGLVVFNSGQFTYCGGLDRPCAFLAEGNYKCDYH 94
Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
+ DVG S K + IGR GIGF S ++VTD PEI S G + +GQ + + L
Sbjct: 95 RITDVG-SGGKLTRGENIGRFGIGFVSTYQVTDHPEIRSAGIKLTLHPEKGQ--WSVEPL 151
Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
P F + DP NT RL ++ +VD F + LLF
Sbjct: 152 QQPEGTTFFLP-WADDP--------NTEARLALGVSHVSTAHIDQLVDDFQKVLRKSLLF 202
Query: 1375 LHRLQCIMFRNMLNDSLVVIRKKIVGDG---IIKVSCGEDKMTWFV-----ASQKLRAGV 1426
L ++ R N L++ GDG I+ + W + A R
Sbjct: 203 LRHVRTAELRR--NGKLLLACDLERGDGSDLIVSFRPSGEVEQWHILQADAAESAARLYA 260
Query: 1427 IRPDVKT----TEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLP 1481
P +++ T+I++ L + L + ++AFLP ++ GL + DF
Sbjct: 261 SHPRLESLGRGTKISIGLRIDPEP-------LAEGLLYAFLPTEQSTGLPLHINADFFPE 313
Query: 1482 SSREEV-----DGNSPWNQWLL 1498
S R+ V WN+ L+
Sbjct: 314 SDRKAVIFAGHQHEQAWNEMLI 335
>gi|390569708|ref|ZP_10249993.1| hypothetical protein WQE_15316 [Burkholderia terrae BS001]
gi|389938568|gb|EIN00412.1| hypothetical protein WQE_15316 [Burkholderia terrae BS001]
Length = 1042
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 146/366 (39%), Gaps = 70/366 (19%)
Query: 1155 CEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELV 1214
ED L +E++RR+ K LG + ++ Y +HF+ EL+
Sbjct: 8 AEDRRLFIEALRRN-----------------KGDINLG-----IFEDFYPDRAHFIYELL 45
Query: 1215 QNADDNIYPENVEPTLTFILQESGIVVLNNE-QGFSAENIRALCDVGNSTKKGSSAGYIG 1273
QNA+D E +F L G ++N + F+ +IR + + NSTKK + IG
Sbjct: 46 QNAEDAGATE-----ASFELMRDGCTFIHNGLRHFNEPDIRGITGIFNSTKK-DNLDKIG 99
Query: 1274 RKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ 1333
+ G+GFKSVF TD+P I+S + + VLP +P +D+
Sbjct: 100 KFGVGFKSVFVYTDSPVIYSRDYSFRISQ------LVLPEPIPQ-RVDLGA--------- 143
Query: 1334 LESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV 1392
T PF + K A + + + LL+L+ L+ I ++
Sbjct: 144 ------TTRFEFPFNSPKKDAAQAFSEVRAGLDAVSERTLLYLNNLRYISWK-CEGSQAA 196
Query: 1393 VIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL---QESNEGN 1449
V R + + + E++ V+S LR ++ ++A+A L + E +
Sbjct: 197 VFRVEHTPSHVEVLKEAENRK--LVSSHWLRFTAPVTKLEHQKVAVAYELGFVGDVTEFD 254
Query: 1450 YGPLLYQQ---------PVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
+ QQ V F P T GL+F L FV SR + N P N L
Sbjct: 255 PSKSIAQQMKIIAAVRGQVSVFFPAEKETSGLRFHLHAPFVPELSRASIK-NCPENATLF 313
Query: 1499 SEFPAL 1504
+ AL
Sbjct: 314 EQLAAL 319
>gi|423314391|ref|ZP_17292325.1| hypothetical protein HMPREF1058_02937 [Bacteroides vulgatus
CL09T03C04]
gi|392683161|gb|EIY76499.1| hypothetical protein HMPREF1058_02937 [Bacteroides vulgatus
CL09T03C04]
Length = 1864
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 93/388 (23%), Positives = 151/388 (38%), Gaps = 88/388 (22%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQG 1247
+ R L + + +YS EL+QNADD P+ ++ +V ++
Sbjct: 479 NRRDAHLLKEIGKNMYSSKERMFFELIQNADDAA-PQKDSVSINSFTTGDYLVFCHDGFS 537
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH--SNGFHVKFDTSEG 1305
F + A+ TKK + G KGIGFKSVF TD+ +++ +NG+H KFD +
Sbjct: 538 FDKYDFEAITSAAVGTKKANENK-TGYKGIGFKSVF--TDSKQVYICTNGYHFKFDKEDE 594
Query: 1306 QI----GFVL--PTLVPPFNIDMFCRLLSK----------DPVQLE-----------SKC 1338
+ GF L P L F + S P QLE K
Sbjct: 595 RFRTFDGFYLNNPFLDTEKAKKRFFSMYSSVRGNFDGVNDIPWQLEPIHVNEFPPEFGKS 654
Query: 1339 W---NTCIRLPFRTKFSEGI-----AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM---- 1386
+ N I L K +G A+++IVD P +LFL + I F
Sbjct: 655 FMKSNVSIALRLGEKNIQGTGGYESAIDSIVD-----DPKFMLFLRKTDRIRFNTRTISR 709
Query: 1387 -LNDSLVVIRKKIVGDGI-------------------------IKVSCGEDK----MTWF 1416
+N+ ++ I+ + + IK+ ED F
Sbjct: 710 EINEGIITIKNSFAQNKVERYERFDFAAGVSNEIFEEENLPIRIKIEEREDSGNILKALF 769
Query: 1417 VASQKLRAGVIRPDV---KTTEIALALTLQESNEGNYGPLLYQQ-PVFAFLP--LRTYGL 1470
V + + I + +TE++ A+ + E +G+ L +FAFLP ++ +
Sbjct: 770 VDLEGEQIENIPMKIAISSSTELSFAIPVNE--DGSISVLENNTISIFAFLPTLVKDFVF 827
Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWLL 1498
F + +F+L R+ V G+S WN +L
Sbjct: 828 PFYINANFILDPPRQRVLGDSAWNLYLF 855
>gi|189466556|ref|ZP_03015341.1| hypothetical protein BACINT_02931 [Bacteroides intestinalis DSM
17393]
gi|189434820|gb|EDV03805.1| hypothetical protein BACINT_02931 [Bacteroides intestinalis DSM
17393]
Length = 1554
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/367 (23%), Positives = 152/367 (41%), Gaps = 55/367 (14%)
Query: 1193 RALHCLSQ-ELYSQDSH---FLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGF 1248
R L + + E Y Q ++ + EL+QN D + E + L+++ + + N F
Sbjct: 20 RGLKTICEAEKYLQQAYEGRYFFELIQNVRDANKEASQEGDIYIELKDNVLSIANTGAPF 79
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT------ 1302
S E I ++ +G S K+ S +IG KGIGFKS V+D P I + + FD
Sbjct: 80 SEEGIASITSIGQSPKQ--SLDFIGFKGIGFKSTLEVSDTPRIVTTEGAIYFDRKLTMEV 137
Query: 1303 -SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
S+ V L F R L+ + K T I LP + F+E I
Sbjct: 138 LSQYNSNLVENQLPLFFFPHYRSRALTSEE---REKGIVTKIELPLKESFAE----EKIC 190
Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE--DKMTWFVAS 1419
F ++ L+ L +Q + F ++ I+K I + I+ + E ++ ++V +
Sbjct: 191 ADFGEIKAKQLVLLGNIQKVDFTTPTQRTIYNIQKDIRKNLIVVDNGKEILNRFRYYVPN 250
Query: 1420 QKL------------RAGVIRPDVKTTEIALALTLQESN----EGNYGPLLYQQPVFAFL 1463
+K + I + + +I + L L E+N + N ++ F
Sbjct: 251 EKNFLPDDILNSLEGKEKEIYSNSTSIDIHIVLELDENNLFVSDAN-------AKLYLFY 303
Query: 1464 PLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLL--------SEFPALFVSAERSFCD 1514
PL G +F++ F++ R + SP N++LL S+ L + + +
Sbjct: 304 PLNIRSGFRFMIHSYFLVNPERTRL-RKSPLNRYLLRKIGEYIGSDMLKLLKRGKYNTNE 362
Query: 1515 LPCFREN 1521
+ CF+ N
Sbjct: 363 ILCFKRN 369
>gi|389599951|ref|XP_001562029.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504277|emb|CAM37053.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 1654
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 68/132 (51%), Gaps = 3/132 (2%)
Query: 142 DRAVVKARSELIMAGESVSSWKVSRSV-LLKLQVDSWSSLGFQMQEVPSLHKLMVTEARI 200
D A V L G +S +++ L S +SL F +++VP+L + + R+
Sbjct: 66 DDADVHGYGRLSGVGHPRTSARIASGAHTLAAITSSINSLPFALRDVPALAHVAALQLRL 125
Query: 201 NAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQ-PLISHYFLVKSDVDVV 259
+ + + R I ++++LEE +C EG+ F EL LG L+ P++ YF ++++ VV
Sbjct: 126 TSSVMAAISTRAIVTVHELEEWVCAQEGVAHFAELGLGVGLQVLPVVQEYFQLRAN-SVV 184
Query: 260 FKIATNDVIVCL 271
F + DVI L
Sbjct: 185 FPVRARDVIAFL 196
>gi|374812444|ref|ZP_09716181.1| hypothetical protein TpriZ_01095 [Treponema primitia ZAS-1]
Length = 1490
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/313 (21%), Positives = 125/313 (39%), Gaps = 65/313 (20%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGI-VVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
LLEL+QN DD +++ + ++ I V NN F+AE I LC S K +
Sbjct: 41 LLELLQNVDDAAVKADIQNADVLLEYKNNIFTVANNGTQFTAETIERLCHGAASNK---T 97
Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF------------------- 1309
+IG KG GF+S+ EIHS FH+ F + F
Sbjct: 98 DDFIGNKGTGFRSLLNWGRQIEIHSGEFHIGFSQNYANSEFEKILSGCAILQEQKKNVPN 157
Query: 1310 -VLPTLVPPFNI-------DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
+P L PF + D ++++ D +Q + + NI+
Sbjct: 158 LSMPVLHCPFEVNSLNNKYDTTIQIITDDSLQNDEQ---------------------NIL 196
Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE--DKMTWFVAS 1419
LLFL + I+ + ++ +IRK + I++ E +K +F
Sbjct: 197 KQLQRFDHKTLLFLPNIMQIIIK--TDEYTKIIRKAPKENEEIEIYVDETIEKYLYFKNP 254
Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG--LKFILQGD 1477
+ + + + + A+A+ L E + + + ++ F P+R++ L ++
Sbjct: 255 DEPQIDTGKNEKRKIRTAIAIALDEWDYSD-------ETMYCFFPIRSFSTPLNMLMHAT 307
Query: 1478 FVLPSSREEVDGN 1490
F L ++R+++ N
Sbjct: 308 FELNANRDDIPPN 320
>gi|340383695|ref|XP_003390352.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 2691
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 95/387 (24%), Positives = 157/387 (40%), Gaps = 75/387 (19%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y + E++QNADD ++ +L +L+
Sbjct: 486 QHEPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRTHSSLNLVLEGMAE 544
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 545 SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 603
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQL-----ESKCWN-TCI 1343
+ FD + + G V P + + L+K P Q S +N T
Sbjct: 604 EWLYIFDPTLKHLKGVVQNENRPGKKLKFQSKFLAKSQQLAPYQGLFGFDSSASYNGTIF 663
Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
RLPFRT + S I N+++ M DL + LLLFL ++ I F + D ++
Sbjct: 664 RLPFRTHPSQISSTIYNNHLIQMIKKDLESNGSKLLLFLQNVKRITFSSRQEDRPIIKHL 723
Query: 1397 KI--VGDGI--IKV----SCGEDKMTWFVASQKLRA----GVIRPDVKTTEIALALT--- 1441
I + D +KV S + W +++Q+ R G + V + L
Sbjct: 724 SIECINDETENVKVCVTQSSTKSTEYWLLSNQEQRLKSQDGAYKQAVASVACKLVKKQSS 783
Query: 1442 -LQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------D 1488
L E+ EGN F FLPL GL + +F + S+R + D
Sbjct: 784 FLCEAIEGN---------AFCFLPLSVPATGLPVHVTANFAVMSNRSGIWAGASSAMASD 834
Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDL 1515
WN+ L++ A +++C+L
Sbjct: 835 EREYWNKQLMTTIIA------QAYCNL 855
>gi|254390316|ref|ZP_05005534.1| hypothetical protein SSCG_02861 [Streptomyces clavuligerus ATCC
27064]
gi|294811274|ref|ZP_06769917.1| Hypothetical protein SCLAV_0440 [Streptomyces clavuligerus ATCC
27064]
gi|326439778|ref|ZP_08214512.1| hypothetical protein SclaA2_01880 [Streptomyces clavuligerus ATCC
27064]
gi|197704021|gb|EDY49833.1| hypothetical protein SSCG_02861 [Streptomyces clavuligerus ATCC
27064]
gi|294323873|gb|EFG05516.1| Hypothetical protein SCLAV_0440 [Streptomyces clavuligerus ATCC
27064]
Length = 1801
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 89/201 (44%), Gaps = 27/201 (13%)
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQE-----SGIVVLNNEQGFSAENIRALCDVGN 1261
FLLEL+QN D + ++ +L E + V N F+ +I+A+C +
Sbjct: 54 GRFLLELLQNGHDAHRGGRNDGSVHVLLDEDEGTHGTLYVANGGTPFTWASIQAVCKLAR 113
Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS-----------NGFHVKFDTSE------ 1304
S K IG KG+GF+S+ +TDAPE++S +G+ +F +
Sbjct: 114 SEKTVGEG--IGNKGVGFRSILEITDAPEVYSASTAGEGLAVLDGYCFRFAVRDDLRVLL 171
Query: 1305 GQIGFV--LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVD 1362
G V +PPF + + +L + T IR+P R + S A+ + +
Sbjct: 172 GDEDLVHKAEEELPPFQVPFPLSDVPPTCKKLAADGHVTVIRIPLRNEASRRAAVRRLDE 231
Query: 1363 MFSDLHPSLLLFLHRLQCIMF 1383
+ + P ++LFL RL ++
Sbjct: 232 LGAAKAP-VMLFLDRLNRLLI 251
>gi|449680062|ref|XP_002155856.2| PREDICTED: uncharacterized protein LOC100205182, partial [Hydra
magnipapillata]
Length = 1503
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/219 (20%), Positives = 99/219 (45%), Gaps = 34/219 (15%)
Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLG-------FQMQEVPSL 190
L+ I++ V + R+++ E V ++ + + + V + L Q ++ ++
Sbjct: 937 LDEINQKVSQIRNKICRRNEVVKVPEIVKELCVFYNVSCVADLCPFESKKIRQETDITAI 996
Query: 191 HKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYF 250
+ ++ + +INAFI F+ R I +L++L + ICK E F +L+LGP +Q LI F
Sbjct: 997 NDIIRLQGKINAFIQSFMTFRNICTLFELNQEICKMESKSSFLDLKLGPLQKQLLIHDNF 1056
Query: 251 LVKSDVDVVFKIATNDVIVCLSEYTDTHKAKD---------------------------I 283
+ + + I T+D+ + + +K K
Sbjct: 1057 KFPPNQNDIPHITTSDIFRLIWKMHIDYKKKQCTDEELSENSRTGLKSKKQPRERENVVF 1116
Query: 284 KVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREAR 322
+ EF++F+ K+ + GV+I ++ + ++ +++A+
Sbjct: 1117 SLSEFMEFLIKQHGFTQPYESGVKISSIALAVSVVKKAK 1155
>gi|254481776|ref|ZP_05095019.1| hypothetical protein GPB2148_1466 [marine gamma proteobacterium
HTCC2148]
gi|214037905|gb|EEB78569.1| hypothetical protein GPB2148_1466 [marine gamma proteobacterium
HTCC2148]
Length = 1032
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 83/196 (42%), Gaps = 42/196 (21%)
Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQE-SG---IVVLNNEQGFS 1249
L L ELY +HF+ EL+QNA+D T++ E SG N + F+
Sbjct: 31 GLRKLMAELYPDQAHFVYELLQNAED--------AKATYVRFELSGQELRFAHNGARQFN 82
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
+++ ++ +G STKK IG+ G+GFK+VF D PEIHS G F
Sbjct: 83 HKDVESITGIGTSTKK-DDVNKIGKFGVGFKAVFSYCDTPEIHS-----------GDYSF 130
Query: 1310 VLPTLVPPFNIDMFCRLLSKD----PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
+ L P I R S+ P+ K C I+ +FS
Sbjct: 131 KIEHLFCPIQITPVSRKDSETLFVFPLGSSEKTPEQC--------------SEEILKVFS 176
Query: 1366 DLHPSLLLFLHRLQCI 1381
L S +LFL+ ++ I
Sbjct: 177 RLDHSAVLFLNSIKSI 192
>gi|336399055|ref|ZP_08579855.1| hypothetical protein Premu_2030 [Prevotella multisaccharivorax DSM
17128]
gi|336068791|gb|EGN57425.1| hypothetical protein Premu_2030 [Prevotella multisaccharivorax DSM
17128]
Length = 2327
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 72/130 (55%), Gaps = 7/130 (5%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
L LS++L +++S F+ EL+QNA+D YP E + F + ++ ++ +++ F+ NI
Sbjct: 250 LDTLSKQLSAKESTFVYELLQNAND--YPVEGKMVDVEFHITDNYLLFMHSGDKFNVRNI 307
Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFV 1310
+C + N +K ++ IG KGIGFK+VF + + + +FD T E ++G
Sbjct: 308 SGICGI-NEKEKIANKHTIGYKGIGFKTVFLHNHYVYLQTGDYSFRFDEGETPEKKVGGK 366
Query: 1311 LPTLVPPFNI 1320
+ L PF I
Sbjct: 367 IKRLGAPFQI 376
>gi|340373207|ref|XP_003385133.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4326
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 162/387 (41%), Gaps = 75/387 (19%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y + E++QNADD ++ + +L+
Sbjct: 1358 QHEPLTTRLSNILRE-YKDGLTIIKEMMQNADDAGATEVNILHDMRTHSSRNLVLEGMAE 1416
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK S IG+ G+GF SV+ +TD P S
Sbjct: 1417 SHGPALIVHNNSTFTKEDFENITKLAAATKANQSLK-IGKFGVGFCSVYHITDVPSFVSG 1475
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
+ FD + + G V P I+ + L+K P Q S +N T
Sbjct: 1476 DWLYIFDPTLKHLKGVVQNENRPGKKINFKSKFLAKSHQLAPYQDLFGFDPSSTYNGTIF 1535
Query: 1344 RLPFRTKFSE---GIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
RLPFRT S+ I ++++ M +DL + LLLFL ++ I F + S VV +
Sbjct: 1536 RLPFRTHPSQICPTIYNDHLIQMIKNDLESNGSKLLLFLQNVKRITFSSQQEGSQVVEQL 1595
Query: 1397 KI--VGDGIIKV-SCGEDKMT-----WFVASQKL----RAGVIRPDVKTTEIAL----AL 1440
I D I V C T W +++QK + G+ +P V + L +L
Sbjct: 1596 SIECSKDDIKNVKKCITKSNTTIIEYWLLSNQKQQLKSQEGICKPAVASVACKLVKEQSL 1655
Query: 1441 TLQESNEGNYGPLLYQQPVFAFLPLRT--YGLKFILQGDFVLPSSREEV----------D 1488
L E+ EGN F FLPL GL + +F + ++R + +
Sbjct: 1656 FLPEALEGN---------AFCFLPLSIPDTGLPVHVNANFAVMNNRSGIWTGVSSAIASE 1706
Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDL 1515
WN+ L++ + +++C+L
Sbjct: 1707 EREYWNKQLMTS------AIPQAYCNL 1727
>gi|449544899|gb|EMD35871.1| hypothetical protein CERSUDRAFT_138618, partial [Ceriporiopsis
subvermispora B]
Length = 2647
Score = 61.6 bits (148), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 98/206 (47%), Gaps = 32/206 (15%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y L E++QN+DD + PT I LQES ++ N+ F+
Sbjct: 22 YPYSVGLLREILQNSDDAKATKQIFVLDHRTYPTEHLIHEGLADLQESALLAFNDSV-FT 80
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG----FHVKFDTSEG 1305
++ +AL ++ S+K+ ++ IG+ GIGF++++ VTD P+I S+ F + D SEG
Sbjct: 81 EQDWKALQNISQSSKRLDTSK-IGKYGIGFRALYHVTDNPQILSHDSLVVFDPQHDYSEG 139
Query: 1306 --QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK----FSEGIAMNN 1359
+IG P + F L K V S T IRLP R SE + ++
Sbjct: 140 GLRIGLAECQQEYPDQLSPFTHFLHKGGV---STLEGTAIRLPLRRTESRISSEVHSTDD 196
Query: 1360 IVDMFSD--LHP--SLLLFLHRLQCI 1381
+ + D +H ++LFL+ L+C+
Sbjct: 197 LRRLLDDFVVHELQDVMLFLNHLKCV 222
>gi|338534009|ref|YP_004667343.1| hypothetical protein LILAB_21825 [Myxococcus fulvus HW-1]
gi|337260105|gb|AEI66265.1| hypothetical protein LILAB_21825 [Myxococcus fulvus HW-1]
Length = 1809
Score = 61.2 bits (147), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 60/108 (55%), Gaps = 7/108 (6%)
Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV-EPTLTFILQESGIVVLNN 1244
++H + A + Q+ + + FL+EL+QNA+D V + T+ + E + V N
Sbjct: 30 REHREIAHAAALIRQDYHDR---FLVELLQNANDQALRGEVHDSTVVVVRSERLLAVSNG 86
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
Q + +N+ L + +S K G +G KG+GFK+V++VT APEI+
Sbjct: 87 GQVLTPQNLERLSSLADSDKTGM---LVGNKGVGFKAVYQVTGAPEIY 131
>gi|373459999|ref|ZP_09551764.1| hypothetical protein HMPREF9944_00028 [Prevotella maculosa OT 289]
gi|371957325|gb|EHO75091.1| hypothetical protein HMPREF9944_00028 [Prevotella maculosa OT 289]
Length = 2449
Score = 61.2 bits (147), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 72/139 (51%), Gaps = 7/139 (5%)
Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFILQESGIVVLNN 1244
K L LS++L +++S F+ EL+QNA+D YP E + F + ++ ++ L++
Sbjct: 239 KDFGSAANFLDTLSKQLSAKESTFVYELLQNAND--YPVEGQMVDVEFHITDNYLLFLHS 296
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--- 1301
F+ NI +C + N +K ++ IG KGIGFK+VF I + + +FD
Sbjct: 297 GDRFNVRNISGICGI-NEKEKVANKKTIGYKGIGFKTVFLNNHYVYIRTGNYSFRFDQGE 355
Query: 1302 TSEGQIGFVLPTLVPPFNI 1320
T E ++G + PF I
Sbjct: 356 TPEKKLGGKIKRQDAPFQI 374
>gi|322710458|gb|EFZ02032.1| Heterokaryon incompatibility protein, putative [Metarhizium
anisopliae ARSEF 23]
Length = 1549
Score = 61.2 bits (147), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 84/184 (45%), Gaps = 40/184 (21%)
Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC 1408
T FS A+ N V+ + P LLLFL +++ I R + S DG+ +S
Sbjct: 32 TDFS--TALQNDVEA---VDPGLLLFLRQIERIHLRLFTSSS----------DGVPAISK 76
Query: 1409 GEDKMTW-----FVASQKLRAGVIR------------------PDVKTTEIALALTLQES 1445
++ W FV+ + A R + T+I LA ++ +
Sbjct: 77 RFHRVNWTPNSGFVSLKDEDADTRRNLYKHLFTIDCDGTETQRRGITNTDIVLAFPVK-N 135
Query: 1446 NEGNYGPLLYQQP-VFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPAL 1504
G Y PL+ + VFA+LPL +G +F++Q DF+ S+R+ VD +S WN+ + P
Sbjct: 136 KSGRYVPLIQKTNFVFAYLPLGNFGFRFVIQADFLTTSNRQSVDEDSSWNKNIAHAIPRA 195
Query: 1505 FVSA 1508
F +A
Sbjct: 196 FEAA 199
>gi|340384255|ref|XP_003390629.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
Length = 1922
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 146/371 (39%), Gaps = 69/371 (18%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y + E++QNADD ++ P+ +L+
Sbjct: 278 QHKPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRTHPSHNLVLEGMAE 336
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 337 SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 395
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQL-----ESKCWN-TCI 1343
+ FD + + G V P + L+K P Q S +N T
Sbjct: 396 EWLYIFDPTLKHLKGVVQNESRPGKKLKFQSTFLAKSQQLAPYQGLFDFDSSSSYNGTIF 455
Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
RLPFRT + S I N+++ M DL + LLLFL ++ I F + ++
Sbjct: 456 RLPFRTHSSQISSTIYNNHLIHMIKKDLESNGSKLLLFLQNVKRITFSSRREGKPIIEEL 515
Query: 1397 KIVGDGI----IKVSCGEDKMT----WFVASQKLRA----GVIRPDVKTTEIALALT--- 1441
I +KV + T W +++Q+ R G + V + L
Sbjct: 516 SIECSNFDTENVKVCVTQSNATSTEHWLLSNQEQRLKSQDGAYKQAVASVACKLVKKQFS 575
Query: 1442 -LQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------D 1488
L E+ EGN F FLPL GL + +F + S+R + D
Sbjct: 576 FLCEAIEGN---------AFCFLPLSVPATGLPVHVTANFAVMSNRSGIWTGASSAMASD 626
Query: 1489 GNSPWNQWLLS 1499
WNQ L++
Sbjct: 627 EREYWNQQLMT 637
>gi|343476855|emb|CCD12172.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1661
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 71/161 (44%), Gaps = 5/161 (3%)
Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
L F ++++P L K+ + RI A + V R + ++Y+LE +C +E I + EL LG
Sbjct: 76 LPFSIRQIPHLAKVAQRQVRITAMVQSAVATRPVVTIYELELEVCSNENISTYAELGLGD 135
Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVI--VCLSEYTDTHKAKDIKVDEFLDFIAK-- 294
L P + H F + S + + +ND + + Y + + A+
Sbjct: 136 SLAMLPFVQHAFKLPSSGTEIAPVTSNDFLQFLLFDAYAKIVLTSGGDAGDAVRAFARSY 195
Query: 295 KRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
K + QLG+ IQ+ + F+R+ L + + E
Sbjct: 196 KGDRYNPAQLGIHIQHFSWLLQFLRQEIARTSALLSQAVTE 236
>gi|305663140|ref|YP_003859428.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
gi|304377709|gb|ADM27548.1| hypothetical protein Igag_0718 [Ignisphaera aggregans DSM 17230]
Length = 948
Score = 59.7 bits (143), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/349 (22%), Positives = 146/349 (41%), Gaps = 62/349 (17%)
Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
+ F+LE +QNA D +NV+ L + I+V N+ F+ ++ + C + S +K
Sbjct: 43 ERQFILEYLQNAIDA-GSKNVKIVLDY--DNVRILVYNDGTEFNHDDFESFCKIARS-RK 98
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
+ IG GIG KS F + E+HS +HV F+ S + L +N+
Sbjct: 99 DPNKMLIGYIGIGAKSAFAIARKLELHSGEYHVTFEESN-----IPEELSKNYNLKYLWM 153
Query: 1326 LLSKDPVQLESKCWNTCIRL-----PFRTKF----------SEGIAMNNIVD----MFSD 1366
++ + + +C+++ R FRT F + I ++D F D
Sbjct: 154 IVPRT----KDECYSSSCRYLMNTASFRTVFVLENLTKDKETLEIIHRTLLDYNYEYFLD 209
Query: 1367 LHPSLLLFLH--RLQCIMFRNMLNDSLVVIRK---KIVGDGI-------IK----VSCGE 1410
L + +H +++ +F+N + L I K K + GI +K +S E
Sbjct: 210 ARTLLFIPVHNIKVEICIFKNGVEKCLRRIEKFHEKEIQSGIPHLRKVVVKLREILSDRE 269
Query: 1411 DKMTWFVASQKLR-----------AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
+ W + +++ A R +V + +A E+++ + + +
Sbjct: 270 ETEDWLIILKEVEVPENIRRDPITAIFKRENVVRRLVGIAFK-TENDKLIPTKGVVKFSI 328
Query: 1460 FAFLPLRTY--GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
F+++P+R G ++L DF+ R + N WN+WL +E V
Sbjct: 329 FSYMPIREIESGFNYLLHADFLTDPGRGGIIENIAWNEWLRNELVKFLV 377
>gi|53805067|ref|YP_113238.1| hypothetical protein MCA0731 [Methylococcus capsulatus str. Bath]
gi|53758828|gb|AAU93119.1| hypothetical protein MCA0731 [Methylococcus capsulatus str. Bath]
Length = 583
Score = 59.3 bits (142), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 130/332 (39%), Gaps = 70/332 (21%)
Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNN----------------EQGFSAENIR 1254
LEL+QNADD E V F + ++G++V N+ QG +
Sbjct: 36 LELIQNADDAKAEEVV-----FDITDNGLLVRNSGQFTYCGDLNKPCAFLSQGNYKCDYH 90
Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
+ DVG S K S IGR GIGF S ++VTD PEI S+G + GQ + +
Sbjct: 91 RITDVG-SGGKLSRGENIGRFGIGFVSTYQVTDHPEIRSSGIKLALYPERGQ--WFIEAF 147
Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
P F S DP NT RL ++ + + F + LLF
Sbjct: 148 EEPDGTSFFLPWAS-DP--------NTEARLALGISHVSPTHIDQLAEDFQAVLRKSLLF 198
Query: 1375 LHR-------------LQCIMFRNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMTWFVAS 1419
L L C + R+ +D +V R + I++ E A+
Sbjct: 199 LRHVRKAEIRRNGQLLLACDLDRSEGSDLIVSFRPSGAVEQWHILRADAAE-------AA 251
Query: 1420 QKLRAGVIRPDV--KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQG 1476
Q+L A R + ++T I++ L + L + ++AFLP ++ GL L
Sbjct: 252 QRLYATHPRLETLHRSTNISIGLRIDPEP-------LTEGLLYAFLPTEQSTGLPLHLNA 304
Query: 1477 DFVLPSSREEV-----DGNSPWNQWLLSEFPA 1503
DF S R+ V WN+ L+ A
Sbjct: 305 DFFPESDRKAVIFAGHQHEQAWNEMLIDAAAA 336
>gi|407859405|gb|EKG07006.1| hypothetical protein TCSYLVIO_001871 [Trypanosoma cruzi]
Length = 1541
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 75/177 (42%), Gaps = 20/177 (11%)
Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
L F +++VPSL V ++RI A I V R + +LY+LE +C +E +E + EL LG
Sbjct: 58 LPFSLKQVPSLASTAVLQSRITAMIQSAVATRPVVTLYELEAEVCLTENVEMYAELGLGC 117
Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYT-----DTHKAKDIK-----VDEF 288
L P + H F V + TN V +E+ DT+ + D
Sbjct: 118 SLAALPCVRHLFGV--------SVGTNTAPVTSTEFMHFLLFDTNAQALLSGGGDAGDAV 169
Query: 289 LDFI-AKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRK 344
F K + QLG+ IQ+ + F+R+ L E +KR K
Sbjct: 170 RAFACCYKGGKYTSMQLGIHIQHFPWLLRFVRQEVGRTSTFFSDLLNENAWCYKRNK 226
>gi|71665269|ref|XP_819606.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70884915|gb|EAN97755.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1541
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 75/177 (42%), Gaps = 20/177 (11%)
Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
L F +++VPSL V ++RI A I V R + +LY+LE +C +E +E + EL LG
Sbjct: 58 LPFSLKQVPSLASTAVLQSRITAMIQSAVATRPVVTLYELEAEVCLTENVEMYAELGLGC 117
Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYT-----DTHKAKDIK-----VDEF 288
L P + H F V + TN V +E+ DT+ + D
Sbjct: 118 SLAALPCVRHLFGV--------SVGTNTAPVTSTEFMHFLLFDTNAQALLSGGGDAGDAV 169
Query: 289 LDFI-AKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRK 344
F K + QLG+ IQ+ + F+R+ L E +KR K
Sbjct: 170 RAFACCYKGGKYTSMQLGIHIQHFPWLLRFVRQEVGRTSTFFSDLLNENAWCYKRNK 226
>gi|301105279|ref|XP_002901723.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099061|gb|EEY57113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 4807
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 94/213 (44%), Gaps = 39/213 (18%)
Query: 1203 YSQDSHFLLELVQNADD------NI---YPENVEPTLTF----ILQESGIVVLNNEQGFS 1249
Y + + E+VQNADD N+ Y ++ L + Q ++V NN FS
Sbjct: 29 YPEGTSIFKEMVQNADDAGATDVNLCLDYRQHAATGLAYEKLASFQGPSLLVHNNAT-FS 87
Query: 1250 AENIRALCDVGNSTKKGSSAGY-IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
A + +++ +G+S KK +S G+ GR G+GF SV+ VTD P S V FD +
Sbjct: 88 AADFQSIQRIGDSLKKDNSKGWKTGRFGVGFNSVYHVTDLPTFVSGSQLVFFDPQACHLP 147
Query: 1309 FVLPTLVPPFNIDMFCR--LLSKDPVQLES-KCW---------NTCIRLPFRT------- 1349
V P+ P ID L+ P Q++ +C+ T RL RT
Sbjct: 148 NVNPS-NPGKMIDYIAHPDLVDNFPAQIDPFRCFGCNFSESFAGTIFRLALRTEEQAQQS 206
Query: 1350 ----KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
+ M ++ F+D P++LLFL +
Sbjct: 207 KLSNRAQTPAKMAEMLKEFADSLPTVLLFLRNV 239
>gi|260803431|ref|XP_002596593.1| hypothetical protein BRAFLDRAFT_78509 [Branchiostoma floridae]
gi|229281852|gb|EEN52605.1| hypothetical protein BRAFLDRAFT_78509 [Branchiostoma floridae]
Length = 282
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/268 (23%), Positives = 114/268 (42%), Gaps = 30/268 (11%)
Query: 1524 KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWND 1581
+A ++ FVP+ E+ FF + I+ ++ + ++ + Q W P + + +
Sbjct: 25 EAVCRFLQFVPMPDEILDFFKPVAMQIVQHIKAQAFMPVQAEDGQISWKLPSETILSHDP 84
Query: 1582 RAHSLLPDILLQKHLGLGFLSKDIVLS----DSLARALGIEEYGPKILLQIISSLCRT-- 1635
++ LL L L ++ D+ +L+ LGI++ L+++ ++ T
Sbjct: 85 LVKDVIDSQLLSNTLNLFYIHGDVAAVLERYPALSTVLGIQKLSTNHLIEVGKAMATTFM 144
Query: 1636 ---ENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTF 1692
E G ++ +A WL H + + S E ++ LQ I +PLS G+F
Sbjct: 145 QQPEAGESLFMVAKVAKWLA-----CVHRAQEQEYNSSDE--MVKQLQDICILPLSCGSF 197
Query: 1693 SSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLN 1752
+S + TI+ G + L L TV P LLS L + +
Sbjct: 198 ASANGPTIFFPLSEGQVKGVTSV-----LARDLNTVHPGLLSL-------LDDVGNSHAR 245
Query: 1753 RMLLKIGVQQLSAHDIVKVHILPAISDE 1780
ML +GV+++S D++ HILP + E
Sbjct: 246 LMLNTLGVKKISPIDVINHHILPILESE 273
>gi|407424690|gb|EKF39095.1| hypothetical protein MOQ_000682 [Trypanosoma cruzi marinkellei]
Length = 1536
Score = 58.9 bits (141), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 69/153 (45%), Gaps = 16/153 (10%)
Query: 178 SSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELEL 237
+ L F +++VPSL + V ++RI A I V R + +LY+LE +C +E +E + EL L
Sbjct: 56 NGLPFSLKQVPSLASIAVLQSRITAMIQSAVATRPVVTLYELEAEVCLTENVEMYAELGL 115
Query: 238 GPFLRQ-PLISHYFLVKSDVDVV---------FKIATNDVIVCLSEYTDTHKAKDIKVDE 287
G L P + H F V ++ F + + LS D A V
Sbjct: 116 GCSLAALPCVRHLFGVGVGTNIAPVTSTEFMHFLLFDTNAQALLSGGGDAGDA----VRA 171
Query: 288 FLDFIAKKRSLASKEQLGVRIQNLGMHINFIRE 320
F K + QLG+ IQ+ + F+R+
Sbjct: 172 FA--CCYKGGRYTSMQLGIHIQHFPWLLRFVRQ 202
>gi|379736889|ref|YP_005330395.1| hypothetical protein BLASA_3503 [Blastococcus saxobsidens DD2]
gi|378784696|emb|CCG04365.1| conserved protein of unknown function [Blastococcus saxobsidens DD2]
Length = 1569
Score = 58.9 bits (141), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 83/189 (43%), Gaps = 24/189 (12%)
Query: 1210 LLELVQNADDNIYPENV---EPTLTFIL--QESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
LLELVQNA D + E + +L + + + N+ + FS I A+ S K
Sbjct: 58 LLELVQNAADALAGATDLADEARVEIVLDPRTATLYCANSGRPFSKSGITAVTMAYLSGK 117
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF--------VLPTLVP 1316
+G IGR G+GFKSV VTDAP++ S +F+++E + LP L
Sbjct: 118 RGDE---IGRFGLGFKSVLAVTDAPQVFSRSVSFEFNSAEAKAALRRIAPAARRLPVLRT 174
Query: 1317 PFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
ID +DPV + W +T ++LP T + + + LLF+
Sbjct: 175 ATLIDAAAE-FDRDPVLADLATWASTVVKLPRVTD------IQRLEKEIQEFRSEFLLFV 227
Query: 1376 HRLQCIMFR 1384
++ + R
Sbjct: 228 SAVREVRLR 236
>gi|340384390|ref|XP_003390695.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4028
Score = 58.5 bits (140), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 91/376 (24%), Positives = 155/376 (41%), Gaps = 57/376 (15%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y + E++QNADD +Y + T + +
Sbjct: 172 QHEPLTTRLGNILRE-YKDGLTIIKEMIQNADDAGATEVNILYDDRTHSTQKLLFKGMAE 230
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 231 SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 289
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
+ FD + + G V P + + L++ P + + +N T
Sbjct: 290 KWLYIFDPTLKYLKGVVRNESRPGKKVQYQSKFLAQSQQMAPYENLFGFKSTADYNGTIF 349
Query: 1344 RLPFRT---KFSEGIAMNNIVD-MFSDL---HPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
RLPFRT + S I ++V M +DL LLLFL ++ I F +M D V I
Sbjct: 350 RLPFRTSASQISSTIYNKHLVQKMKNDLIENGSKLLLFLQNVKRITFHSMQEDE-VCISS 408
Query: 1397 KIVGDGI---IKVSCGEDKMT--WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
+GI I S + ++ W V+ Q+ + D + ++A L + N ++
Sbjct: 409 INEENGIKYCITNSPANNSVSEYWLVSCQEQKLQAQGGDYQPGTASIACQLVKDN-SSFE 467
Query: 1452 PLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSR----------EEVDGNSPWNQWLLS 1499
+ VF FLPL GL + +F + S+R E D WN+ L++
Sbjct: 468 CKAIEGSVFCFLPLSLPCTGLPVHVNANFAVMSNRSGIWTGASSEEPSDSREYWNKKLMT 527
Query: 1500 EFPALFVSAERSFCDL 1515
+++C+L
Sbjct: 528 TI------IPKAYCNL 537
>gi|340375790|ref|XP_003386417.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4207
Score = 58.2 bits (139), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 88/389 (22%), Positives = 148/389 (38%), Gaps = 78/389 (20%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH + L + +E Y + E++QNADD +Y T I +
Sbjct: 1372 QHEPITTRLRNILKE-YKDGLTIIKEMIQNADDAGATEVNILYDNRTHSTQNLIFKGMAD 1430
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ GIGF SV+ +TD P S
Sbjct: 1431 SHGPALIVHNNGSFTKEDFENITKLAGATKANQPLK-IGKFGIGFCSVYHITDVPSFVSG 1489
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKDP----------VQLESKCWNTCI 1343
+ FD + + G V+ P + ++ +K + S T
Sbjct: 1490 EWLYIFDPTLKYLKGIVINESRPGKKMKYQSKIFAKSQDLTPYEGLFGFKSSSDYEGTMF 1549
Query: 1344 RLPFRT---KFSEGIAMNNIV-DMFSDL---HPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
R PFRT + S I ++V D+ DL LLLFL ++ I F ++ +K
Sbjct: 1550 RFPFRTSASQISSTIYNEHLVEDIKRDLIECGSKLLLFLQHVKQITFSSLKG------KK 1603
Query: 1397 KIVGDGIIKVSCGEDKM---------------TWFVASQK---LRAGVIRPDVKTTEIAL 1438
KI+ I + D + W +ASQ+ +I+P V + +
Sbjct: 1604 KIIEVSIDASNSSNDDIHKVVIKSPHNKSSTEHWLIASQEEQWRHQNIIKPAVAS----I 1659
Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV--------- 1487
A N Y + F FLPL + GL + +F + S+R +
Sbjct: 1660 ACQFDRINSSCYKCKEIEGNAFCFLPLALPSTGLPVHVSANFAVMSNRSGIWTEASSGIV 1719
Query: 1488 -DGNSPWNQWLLSEFPALFVSAERSFCDL 1515
D WN+ L+ ++ ++C L
Sbjct: 1720 TDSREQWNKNLME------ITIPEAYCKL 1742
Score = 42.4 bits (98), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 17/123 (13%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNI------------YPENVEPTLTF-- 1232
Q L R + + QE Y D+ FL EL+QNADD + + P+ +
Sbjct: 2327 QKEELTRRIRNILQE-YPLDATFLKELLQNADDTKATKIRVILDKRQHKKKKVPSEEWGK 2385
Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
LQ ++V N+++ FS ++ + +G +K+ IG+ GIGF V+ +TD P
Sbjct: 2386 ELQGPALLVWNDKE-FSDADLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHLTDCPSFI 2443
Query: 1293 SNG 1295
+ G
Sbjct: 2444 TRG 2446
>gi|423229230|ref|ZP_17215635.1| hypothetical protein HMPREF1063_01455 [Bacteroides dorei CL02T00C15]
gi|423245069|ref|ZP_17226143.1| hypothetical protein HMPREF1064_02349 [Bacteroides dorei CL02T12C06]
gi|392634508|gb|EIY28428.1| hypothetical protein HMPREF1063_01455 [Bacteroides dorei CL02T00C15]
gi|392640534|gb|EIY34334.1| hypothetical protein HMPREF1064_02349 [Bacteroides dorei CL02T12C06]
Length = 1804
Score = 58.2 bits (139), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 68/128 (53%), Gaps = 13/128 (10%)
Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPE------NVEP---TLTFILQESGI 1239
++ R+L L ++L S F+ EL+QNA+D YP ++P + F + E+ +
Sbjct: 223 KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRTKINGKIQPLPVNVEFHITENYL 280
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
+ + F+ +NI A+CD+ + K ++ IG KGIGFK+VF D +++ + +
Sbjct: 281 TFEHTGEYFNPKNIAAICDINDGEKSDNTEA-IGYKGIGFKTVFLDNDYVLLNTGNYTFR 339
Query: 1300 FDTSEGQI 1307
FD S +
Sbjct: 340 FDKSATDV 347
>gi|85859233|ref|YP_461435.1| cytoplasmic protein [Syntrophus aciditrophicus SB]
gi|85722324|gb|ABC77267.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
Length = 1210
Score = 58.2 bits (139), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 101/232 (43%), Gaps = 32/232 (13%)
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-----GFVLPTLVPPFNIDMFCRL 1326
IG KG+GFKSV VT PEIHS FH+ F + + GF P F I
Sbjct: 139 IGAKGLGFKSVMEVTTEPEIHSGPFHLHFSLDKTLVLLKKHGFSGIKQPPVFRIP----- 193
Query: 1327 LSKDP---VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
K+P ++ +C+ T +RLPF E + + +L +LLF ++
Sbjct: 194 FKKEPDEGIKELLECYATVLRLPFANHKREQVCQS-----LRELDAHVLLFCQHIE--EL 246
Query: 1384 RNMLNDSLVVI------RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGV---IRPDVKTT 1434
R +L+++L I + + D I++ + F +++ R ++ K
Sbjct: 247 RIILDENLKRIWTAKRETDRPLTDCRIRIMFENEPEADFSSTEYQRWAKTWNVKGQSKLH 306
Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLPSSRE 1485
+++ L L E+ + P+ YQQP+ F P + I+ F L +R+
Sbjct: 307 SVSICLPLDENRKPI--PMKYQQPLHVFFPTEEELPFRSIIHSSFDLSQNRK 356
>gi|336405902|ref|ZP_08586569.1| hypothetical protein HMPREF0127_03882 [Bacteroides sp. 1_1_30]
gi|335936369|gb|EGM98300.1| hypothetical protein HMPREF0127_03882 [Bacteroides sp. 1_1_30]
Length = 2302
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 68/128 (53%), Gaps = 13/128 (10%)
Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPE------NVEP---TLTFILQESGI 1239
++ R+L L ++L S F+ EL+QNA+D YP ++P + F + E+ +
Sbjct: 223 KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRTKINGKIQPLPVNVEFHITENYL 280
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
+ + F+ +NI A+CD+ + K ++ IG KGIGFK+VF D +++ + +
Sbjct: 281 TFEHTGEYFNPKNIAAICDINDGEKSDNTEA-IGYKGIGFKTVFLDNDYVLLNTGNYTFR 339
Query: 1300 FDTSEGQI 1307
FD S +
Sbjct: 340 FDKSATDV 347
>gi|449486139|ref|XP_002186744.2| PREDICTED: sacsin-like [Taeniopygia guttata]
Length = 5089
Score = 57.8 bits (138), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 156/396 (39%), Gaps = 101/396 (25%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQGF 1248
Y ++ E+VQNA+D + T T L + G+ + N F
Sbjct: 1306 YGEEGDLFTEMVQNAEDASATVCRFLLDLRSHRTATSGLLDPGMAACHGPALWAYNNALF 1365
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH-- 1297
S ++ + + +G +TK+G AG IGR G+GF SV+RVTD P + S NG H
Sbjct: 1366 SEDDFQNITRIGAATKEGQ-AGRIGRFGLGFSSVYRVTDVPAVLSGETLLIFDPNGTHLS 1424
Query: 1298 ----------VKFD-TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
++ D +S +I V P+N C L P + RLP
Sbjct: 1425 KHIPRAGSPGIRLDFSSRPRILRVFAEQFQPYNGIFGCCLPEPGPFP------GSLFRLP 1478
Query: 1347 FRTK--------FSEGIAMNNIVDMFSDLHPS---LLLFLHRLQCIMFRNMLNDS----- 1390
FRT+ SE I + + S LLLFL ++ + ML D+
Sbjct: 1479 FRTEEEAVTSQICSEAFGAERIQSLGTAFLGSNRLLLLFLKNVRELSL-EMLPDTATSTE 1537
Query: 1391 -----LVVIRKKIVGDG----------IIKVSCGED--KMTWF--------------VAS 1419
++ RK+I G G I ++S E+ + W +
Sbjct: 1538 DTMPLAMLQRKEIRGLGSPGDPPSWAAIEQLSACEEESRTVWHYLVLVCQGDGELLELFH 1597
Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQP--VFAFLPLRTY-GLKFILQG 1476
Q +AG+ P +AL L + +G + P L + VF LP+ GL + G
Sbjct: 1598 QNTQAGLHPP---LPMAGVALPLAPAKDGKWVPRLDAEKGQVFCHLPMPVISGLPIHVHG 1654
Query: 1477 DFVLPSSRE---EVDGNSPWNQWLL-SEFPALFVSA 1508
F + S+R+ + WN+ LL + PA ++ A
Sbjct: 1655 AFSILSNRKGLWDTAERGKWNRVLLHNAVPAAWLRA 1690
>gi|423347365|ref|ZP_17325052.1| hypothetical protein HMPREF1060_02724 [Parabacteroides merdae
CL03T12C32]
gi|409217824|gb|EKN10798.1| hypothetical protein HMPREF1060_02724 [Parabacteroides merdae
CL03T12C32]
Length = 2302
Score = 57.8 bits (138), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 68/128 (53%), Gaps = 13/128 (10%)
Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPE------NVEP---TLTFILQESGI 1239
++ R+L L ++L S F+ EL+QNA+D YP ++P + F + E+ +
Sbjct: 223 KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRTKIDGKIQPLPVDVEFHITENYL 280
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
+ + F+ +NI A+CD+ + K ++ IG KGIGFK+VF D +++ + +
Sbjct: 281 TFEHTGEYFNPKNIAAICDINDGEKSDNTEA-IGYKGIGFKTVFLDNDYVLLNTGNYTFR 339
Query: 1300 FDTSEGQI 1307
FD S +
Sbjct: 340 FDKSATDV 347
>gi|85819302|gb|EAQ40461.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
Length = 1500
Score = 57.8 bits (138), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 137/335 (40%), Gaps = 56/335 (16%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNN--EQGFSAENIRALCDVGNSTKKGS 1267
LLEL+QN+DD E + S + +NN + FS + R+L + N + K S
Sbjct: 42 LLELLQNSDD----EGATQVAIKLDTASKTISINNNGDTSFSKKGYRSLF-IANLSSKTS 96
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF-----------------V 1310
YIG KG+GF+S+ ++A EI SN +++ + + + +
Sbjct: 97 KRKYIGNKGLGFRSIINWSNAIEIQSNDLSLQYSETNTKARYESLFTTEQRATIAKEENL 156
Query: 1311 LPTLVP-PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
T++P PF C + K+ Q + + T I + ++ K A +I + P
Sbjct: 157 SSTVIPMPF----LCLPVIKEITQ---ENFKTSIVISYKEK-----AFKDIYKQVKSITP 204
Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED----KMTWFVASQKLRAG 1425
LLFL+ ++ I F + + + S E +W + ++
Sbjct: 205 ETLLFLNHIKQIDFIGFEDKPQIKCNRSAHIKSSTTFSPQEAIQFPNNSWSIFEEEDELP 264
Query: 1426 VIRPDVKTTE-----IALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
+ DV TE I +A+ E N P LY +F P ++L F
Sbjct: 265 LEFQDVNKTEKEFYQIKIAV---EDNFEKTSPYLY-----SFFPTNIKLNQPYVLHATFD 316
Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
L ++R +++ +S N+++L + A R F D
Sbjct: 317 LDATRNQLN-DSEKNRFILEKVVAFTTKVARYFTD 350
>gi|449486128|ref|XP_002186552.2| PREDICTED: sacsin-like [Taeniopygia guttata]
Length = 3423
Score = 57.8 bits (138), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 155/396 (39%), Gaps = 101/396 (25%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQGF 1248
Y ++ E+VQNA+D + T T L + G+ + N F
Sbjct: 200 YGEEGDLFTEMVQNAEDASATVCRFLLDLRSHRTATSGLLDPGMAACHGPALWAYNNALF 259
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH-- 1297
S ++ + + +G +TK+G AG IGR G+GF SV+RVTD P + S NG H
Sbjct: 260 SEDDFQNITRIGAATKEGQ-AGRIGRFGLGFSSVYRVTDVPAVLSGETLLIFDPNGTHLS 318
Query: 1298 ----------VKFD-TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
++ D +S +I V P+N C L P + RLP
Sbjct: 319 KHIPRAGSPGIRLDFSSRPRILRVFAEQFQPYNGIFGCCLPEPGPFP------GSLFRLP 372
Query: 1347 FRTK--------FSEGIAMNNIVDMFSDLHPS---LLLFLHRLQCIMFRNMLNDS----- 1390
FRT+ SE I + + S LLLFL ++ + ML D+
Sbjct: 373 FRTEEEAVTSQICSEAFGAERIQSLGTAFLGSNRLLLLFLKNVRELSL-EMLPDTATSTE 431
Query: 1391 -----LVVIRKKIVGDG----------IIKVSCGED--KMTWF--------------VAS 1419
++ RK+I G G I ++S E+ + W +
Sbjct: 432 DTMPLAMLQRKEIRGLGSPGDPPSWAAIEQLSACEEESRTVWHYLVLVCQGDGELLELFH 491
Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQP--VFAFLPLRTY-GLKFILQG 1476
Q +AG+ P +AL L + +G + P L + VF LP+ GL + G
Sbjct: 492 QNTQAGLHPP---LPMAGVALPLAPAKDGKWVPRLDAEKGQVFCHLPMPVISGLPIHVHG 548
Query: 1477 DFVLPSSRE---EVDGNSPWNQWLL-SEFPALFVSA 1508
F + S R+ + WN+ LL + PA ++ A
Sbjct: 549 AFSILSHRKGLWDTAERGKWNRVLLHNAVPAAWLRA 584
>gi|345851153|ref|ZP_08804135.1| helicase domain-containing protein [Streptomyces zinciresistens K42]
gi|345637382|gb|EGX58907.1| helicase domain-containing protein [Streptomyces zinciresistens K42]
Length = 1573
Score = 57.4 bits (137), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 95/228 (41%), Gaps = 26/228 (11%)
Query: 1210 LLELVQNADDNI--YPENVEPT----LTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
LLELVQNA D + E+ E + L + N + FS + L S
Sbjct: 56 LLELVQNAADAMSGAAEHEEGAGRVEIVLDLDRQTLYCANAGRPFSRSGLTTLAHAHLSG 115
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV--------LPTLV 1315
K+G IGR G+GFKSV VTDAP++ S +F++ + + LP L
Sbjct: 116 KRGDE---IGRFGLGFKSVLAVTDAPQVFSRSIAFEFNSPKAKAALAAIAPASKRLPVLR 172
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
ID ++DP+ E W I + + N+ + LLF+
Sbjct: 173 TLTRIDADVE-FAEDPILAELAEWAVTI-----VRLRRASRLENLRKEIEEFRSEFLLFV 226
Query: 1376 HRLQCIMFRNMLNDSLVVIR--KKIVGDGIIKVSCGE-DKMTWFVASQ 1420
+ ++ + R + D+ V + +GDG ++ + D W+V ++
Sbjct: 227 NSVREVRLRVVGADADFVTSHVSRDLGDGRFRIERPDGDHEEWYVENR 274
>gi|336322039|ref|YP_004602007.1| type III restriction protein res subunit [[Cellvibrio] gilvus ATCC
13127]
gi|336105620|gb|AEI13439.1| type III restriction protein res subunit [[Cellvibrio] gilvus ATCC
13127]
Length = 1547
Score = 57.0 bits (136), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 91/222 (40%), Gaps = 28/222 (12%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D + + L + V N+ + F+A+ + AL S+K
Sbjct: 52 LHELVQNAVDAA--RDGGGRIEVRLARDTLYVANDGEPFTAQGVEALMASDLSSKADDR- 108
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP-FNIDMFCRL-- 1326
IGR GIGFKSV VT P ++S FD + + L P + + RL
Sbjct: 109 --IGRFGIGFKSVLAVTAEPRVYSRSVSFAFDRAWSESTIRAAGLTAPRYPVMRVARLED 166
Query: 1327 ----LSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDL---HPSLLLFLHRL 1378
+DPV E W T + LP A+ + + DL LLLF +
Sbjct: 167 PRAAAERDPVLAELMSWATTVVVLP---------ALTQLGSLKRDLLAYRGELLLFSRHI 217
Query: 1379 QCIMFR-NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
Q + R + + D V ++ G++ + E TW V S
Sbjct: 218 QDLRLRIDGMADQRVTANER--AGGLVDLVSEEGVSTWKVLS 257
>gi|301621342|ref|XP_002940013.1| PREDICTED: sacsin-like, partial [Xenopus (Silurana) tropicalis]
Length = 4152
Score = 57.0 bits (136), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 101/449 (22%), Positives = 167/449 (37%), Gaps = 116/449 (25%)
Query: 1203 YSQDSHFLLELVQNADDNIYP---------ENVEPTLTFILQESGI-----VVLNNEQGF 1248
YS+ EL+QNADD EN + I + I + N + F
Sbjct: 1254 YSESVDIFKELIQNADDAEATVCAFLIDMRENATSRQSLIDPDMAICHGPALWSYNNRTF 1313
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD------- 1301
+ + R + VG +TK+ + IG+ G+GF SV+ +TD P I S + FD
Sbjct: 1314 TDNDFRNITRVGAATKE-TDIKKIGKFGLGFNSVYLITDIPLIMSRSHVLIFDPNVNHIR 1372
Query: 1302 ---------------TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
++ +I + P PF+ C L K P + T IRLP
Sbjct: 1373 KHIPNDCNPGIKLNLQTKPEILQIFPDQFQPFSSVFGCEL--KQPFNYDG----TLIRLP 1426
Query: 1347 FRTK------------FSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL--- 1391
FRT+ FS + ++ F + +LL+FL +Q + + D
Sbjct: 1427 FRTEKEAKNSQICQEAFSNK-QIKTLIMRFEESTDTLLVFLRNVQVLTISFLGKDLCPGD 1485
Query: 1392 VVIRKKIVGDGIIKVSCGEDKM--TWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGN 1449
++R + D +++ +D + SQ L + D+ + I + +T+Q++ + N
Sbjct: 1486 QLLRASVQKDNVLRQDVPQDIILQQQITTSQLLEIQMNDLDISVSNI-IKITVQQTTKKN 1544
Query: 1450 ----------------------------------------------YGPLL--YQQPVFA 1461
+ P L + VF
Sbjct: 1545 EKYYMVQSSLGIKASIHMFSQKKSFICPLPFAGVALPLKKHNGTEKWAPDLENFNGMVFC 1604
Query: 1462 FLPLR-TYGLKFILQGDFVLPSSREEVDGNS---PWNQWLLSEFPALFVSAERSFCDLPC 1517
FLPL + GL F L G F + S+R+ + G + WNQ LL++ A V+ + L
Sbjct: 1605 FLPLPISTGLPFHLNGCFSVMSNRKGLWGTTEKGEWNQKLLTD--AALVALITALSQLQI 1662
Query: 1518 FRENPAKAASVYMSFVPLVGEVHGFFSGL 1546
Y +F P V +V+ + L
Sbjct: 1663 LNHQGDLQDYYYYTFWPDVTKVNTQYKEL 1691
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 111/545 (20%), Positives = 200/545 (36%), Gaps = 136/545 (24%)
Query: 1238 GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI------ 1291
++V NNE FS + + GNS K +GR G+GF SV+ +TD P I
Sbjct: 2 ALLVYNNE-VFSKNDWEGIQKPGNSIK-SKDPNTVGRFGLGFNSVYHITDYPAIFSGKNI 59
Query: 1292 ----------HSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ-----LES 1336
H G + + I + +P FCR+L + L +
Sbjct: 60 GILDPQESVLHKGGLLWNLEKNMDSIKNLTDQFLP------FCRVLEDTGLSTWNEILNA 113
Query: 1337 KCWN-TCIRLPFRTK---FSEGI-AMNNIVDMFSDLHPSL---LLFLHRLQCIMFRNMLN 1388
C+ T R P R K SE I + + ++F LLFL + + + + +
Sbjct: 114 GCFQGTLFRFPLRVKPSEISENIYSSEKVEELFESFIKDAGISLLFLRHVNTVSLKRIGS 173
Query: 1389 DSLVV--IRKKIVGDGIIKVSCG------------------EDKMTWFVASQKLRAGVIR 1428
D LV+ + + I G E + TW V + + +
Sbjct: 174 DGLVIHLLTVSVSTQTIASTKSGNITTESHLKMTSIKDFDEEKEQTWLVQTSTAHGSLFQ 233
Query: 1429 PDVKTTE---------IALALTLQESNEGNYGPLLYQQPVFAFLPL-----RTYGLKFIL 1474
++ +E +A L+ Q+++ L+ + LPL GL ++
Sbjct: 234 DLIELSEKLCNNPSLDLAYPLSKQDTD-------LFAGRLSCVLPLPDKEENQTGLPVVI 286
Query: 1475 QGDFVLPSSR---------EEVDGNSPWNQWLLSE-FPALFVSAERSFCDLPCFRENPAK 1524
G F L R ++ D + WN L+ + P +++ A +
Sbjct: 287 NGCFDLTDDRRSMKWLETDQQHDNAAKWNHILVHKLLPLVYICAVKD------------A 334
Query: 1525 AASVYMSFV-----------PLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPC 1573
AA V MS + P + + + + I ++L + ++ +N+ W
Sbjct: 335 AALVKMSKITTDMAYGIWPDPEKTKFKDKWYSITKQI-AQLLLCESVLQTADNSDWITAS 393
Query: 1574 KVLRGWNDRAHSL---LPDILLQKHLGLGFLSKDIV-LSDSLARALGIEEYGPKILLQII 1629
K + +N H L L ++L+ +L + +V + D + + L + E +L +
Sbjct: 394 KAIFVYNIEDHDLQNCLEEVLI-------YLKQPLVKIPDHVQKTLKLAEAKTGMLNSVT 446
Query: 1630 SSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSD 1689
S R L+S + ++ L S GQ D+L + +PL++
Sbjct: 447 PSFIRM--VLKSGNWNTISKEQKMLLLRYIVSDGQ-----------YDDLLDLQILPLAN 493
Query: 1690 GTFSS 1694
GTF+S
Sbjct: 494 GTFTS 498
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 93/215 (43%), Gaps = 39/215 (18%)
Query: 1203 YSQDSHFLLELVQNADDNIYPE--------NVEPTLTF-----ILQESGIVVLNNEQGFS 1249
Y L EL+QNADD+ E + T TF L + + NN Q F
Sbjct: 2249 YLAQKDILKELIQNADDSEATEIHFVLDSRTHQATSTFGSEWNPLHGPALCIYNN-QKFD 2307
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFDTSEGQIG 1308
++I + +G K+G + IGR G GF SV+ +TD P + + FD + +
Sbjct: 2308 PKDIEGIQLLGKGGKRGQGSK-IGRFGTGFNSVYHITDCPSFLTGDAIMCVFDPN---LA 2363
Query: 1309 FVLPTLVPP----FNI-DMFCRLLSKDPVQLESKCWN----TCIRLPFR-------TKFS 1352
F+ + V F++ D F + +N T RLP R +K S
Sbjct: 2364 FLPSSTVGSPGGMFSVNDRFKKTFKDVYNTFLPSVFNLQEGTLFRLPLRMAETVAMSKIS 2423
Query: 1353 -EGIAMNNIVDMFSDLHP---SLLLFLHRLQCIMF 1383
+ +++ NI +M +L S++LFL+ ++ I F
Sbjct: 2424 NQPVSLENIRNMCRELEEDADSMILFLNNIKKITF 2458
>gi|340385242|ref|XP_003391119.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
Length = 2392
Score = 57.0 bits (136), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 149/366 (40%), Gaps = 55/366 (15%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH + L + +E Y + E++QNADD +Y T IL+
Sbjct: 339 QHEPITTRLRNILKE-YKDGLTIIKEMIQNADDAGATEVNILYDNRTHSTKNLILKGMAD 397
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ GIGF SV+ +TD P S
Sbjct: 398 SHGPALIVHNNGSFTKEDFENITKLAGATKANQPLK-IGKFGIGFCSVYHITDVPSFVSG 456
Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMF--CRLLS--KDPVQLE--------SKCWNTC 1342
+ FD + + V+ P + R+ S +D V E S T
Sbjct: 457 KWLYIFDPTLQFLKGVVHDQSKPGKKIKYQQSRIFSNCQDLVPYEGLFGFSSLSDYEGTM 516
Query: 1343 IRLPFRT---KFSEGIAMNNIV-DMFSDL---HPSLLLFLHRLQCIMFRNML-NDSLVVI 1394
R PFRT + S + ++V D+ DL LLLFL ++ I F ++ N+ ++ +
Sbjct: 517 FRFPFRTSSSQISSTVYNEHLVEDIKRDLIECGSKLLLFLQHVKRITFSSLRGNEKIIEV 576
Query: 1395 RKKI-------VGDGIIKV-SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
+ + + +IK G W +ASQ+ + +++ ++A L +N
Sbjct: 577 SIDVSSSSNDDIHEVVIKSPHNGSGTEHWLIASQE-EQWRYQNNIEPAVASVACQLDRTN 635
Query: 1447 EGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNSPWN 1494
Y + F FLPL + GL + +F + S+R + D WN
Sbjct: 636 TNCYNCKETEGNAFCFLPLALPSTGLPVHVSANFAVMSNRSGIWTEVSSGTVTDSKERWN 695
Query: 1495 QWLLSE 1500
+ L+ +
Sbjct: 696 KNLMEK 701
>gi|429768006|ref|ZP_19300182.1| hypothetical protein HMPREF0185_00450 [Brevundimonas diminuta 470-4]
gi|429189571|gb|EKY30398.1| hypothetical protein HMPREF0185_00450 [Brevundimonas diminuta 470-4]
Length = 1743
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 141/351 (40%), Gaps = 63/351 (17%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQ--ESGIVVL---NNEQGFSAENIRALCDVGNSTK 1264
+LEL+QNA D P + + F+ E VL N+ +GF+ N +ALC +TK
Sbjct: 56 VLELLQNAHDAHDPSDRNGRIRFVYDAGEGDFGVLYAANDGRGFTRANFKALCSPTRTTK 115
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-----GFVLPTLVPPFN 1319
+ A IG KG+GF SVF++ PE++S K +G L + P
Sbjct: 116 AVNEA--IGNKGVGFLSVFQICAHPEVYSKPTSRKSGVFDGYCFTFADNATLQAFLAPCG 173
Query: 1320 IDMFCRLLS-------------KDPVQLES---KCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
+ R ++ + P +ES + T +RLP + + +A+ +
Sbjct: 174 LADRARRVAATMPQLYLATPSIQPPPAVESLGEDGYATVLRLPLKNAEAR-LAVETQLTE 232
Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG----IIKVSCGEDKMTWFVAS 1419
++ P + LFL R+ + ++ + + +G + V+CGE + + VA
Sbjct: 233 LAEGQPDVQLFLERISELRIEFGGTTHVLGRAPETLHEGSDLVLQNVTCGERR--YIVAR 290
Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNY----------------GPLLYQQPVFAFL 1463
+ L P+ E+ A L E+ + GP + + ++ FL
Sbjct: 291 RTL------PETAVREVINADVLSEALPERWNDWIGDAVISIAVTADGPPIDGR-LYTFL 343
Query: 1464 PLRTYGLKFILQGDFVLP----SSREEVDGNSPWNQWLLSEFPALFVSAER 1510
P+ G G P R++++ +N WLL E L A R
Sbjct: 344 PM-GKGAPSPFAGHMDAPFFATIERKDLEEGGTFNPWLLDECRRLAFDAAR 393
>gi|116620801|ref|YP_822957.1| superfamily I DNA/RNA helicase [Candidatus Solibacter usitatus
Ellin6076]
gi|116223963|gb|ABJ82672.1| Superfamily I DNA and RNA helicases and helicase subunits-like
protein [Candidatus Solibacter usitatus Ellin6076]
Length = 1586
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/351 (23%), Positives = 137/351 (39%), Gaps = 86/351 (24%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN------------EQGFSA 1250
YS +HF+ EL+QNADD + +F+L G+V +N E+ SA
Sbjct: 39 YSDQAHFIYELLQNADDA-----RATSASFVLTRDGLVFKHNGETRFSISDPATEEADSA 93
Query: 1251 E----NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
++ A+ + NS K +S IG+ G+GFKSVF+ T P I+ F + +
Sbjct: 94 SGALGHVNAITSIANSNKTAAS---IGKFGVGFKSVFQYTKTPHIYDPAFCFRIER---- 146
Query: 1307 IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSD 1366
F++P + + D R + T LPF + ++ +
Sbjct: 147 --FIVPHAL---DSDFAGR-----------QDHETVFWLPFDDDRKRPVECHDEI----- 185
Query: 1367 LHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG------IIKVSCGEDKMTWFVASQ 1420
+L+ ++F + L + ++ G+ +I+ D V
Sbjct: 186 --------AEKLKSLIFPTLFLSKLGSVSWEVNGEAGGYSKQVIRTVQRHDLTARLV--- 234
Query: 1421 KLRAGVIRPDVKTTEIALALTLQESNEG---------NYGPL-LYQQPVFAFLPLRT-YG 1469
+L + V R +TTE + L + EG + G L QP F F P +
Sbjct: 235 RLASTVGR---QTTEQDIWLFSPATVEGLPFCVGFSVSDGKLRATSQPAFCFFPTKEPTN 291
Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
L F++ F+L SRE + NQ ++ + L + LP R+
Sbjct: 292 LSFMVHAPFLLTDSREGIKAGEKHNQDMVEQLARLAADS------LPVLRD 336
>gi|392843829|gb|AFM84518.1| sacsin, partial [Polypterus senegalus]
Length = 735
Score = 56.6 bits (135), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD +Y PT LQ + V NN Q F+
Sbjct: 416 YPSEKEMLKELLQNADDAKATEIYFVYDPRDHPTDRIFDDKWIPLQGPSLCVYNN-QPFT 474
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++IR + ++G TK+G+ G G+ GIGF SV+ +TD P SN
Sbjct: 475 EDDIRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISN 518
>gi|82913421|ref|XP_728637.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23485089|gb|EAA20202.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 285
Score = 56.6 bits (135), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 123/309 (39%), Gaps = 65/309 (21%)
Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNN----------------EQGFSAENIR 1254
LEL+QNADD E V F + ++G++V N+ QG +
Sbjct: 3 LELIQNADDAKAEEVV-----FDITDNGLLVRNSGQFTYCGDLNKPCAFLSQGNYKCDYH 57
Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
+ DVG S K S IGR GIGF S ++VTD PEI S+G + GQ + +
Sbjct: 58 RITDVG-SGGKLSRGENIGRFGIGFVSTYQVTDHPEIRSSGIKLALYPERGQ--WFIEAF 114
Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
P F S DP NT RL ++ + + F + LLF
Sbjct: 115 EEPDGTSFFLPWAS-DP--------NTEARLALGISHVSPTHIDQLAEDFQAVLRKSLLF 165
Query: 1375 LHR-------------LQCIMFRNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMTWFVAS 1419
L L C + R+ +D +V R + I++ E A+
Sbjct: 166 LRHVRKAEIRRNGQLLLACDLDRSEGSDLIVSFRPSGAVEQWHILRADAAE-------AA 218
Query: 1420 QKLRAGVIRPDV--KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQG 1476
Q+L A R + ++T I++ L + L + ++AFLP ++ GL L
Sbjct: 219 QRLYATHPRLETLHRSTNISIGLRIDPEP-------LTEGLLYAFLPTEQSTGLPLHLNA 271
Query: 1477 DFVLPSSRE 1485
DF S R+
Sbjct: 272 DFFPESDRK 280
>gi|260592925|ref|ZP_05858383.1| hypothetical protein HMPREF0973_02386 [Prevotella veroralis F0319]
gi|260535125|gb|EEX17742.1| hypothetical protein HMPREF0973_02386 [Prevotella veroralis F0319]
Length = 2288
Score = 56.6 bits (135), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 99/218 (45%), Gaps = 40/218 (18%)
Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPENVEPT---------LTFILQESGI 1239
++ R+L L ++L S F+ EL+QNA+D YP + + F + ++ +
Sbjct: 213 KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRAKINGEYQALPVDVEFHITKNYL 270
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
+ + F+ +NI A+CD+ N +K + IG KGIGFK+VF D +++ + +
Sbjct: 271 TFQHTGEYFNPKNIAAICDI-NDGEKADNTKAIGYKGIGFKTVFLDNDYVLLNTGNYTFR 329
Query: 1300 FDTSEGQI----GFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKF---- 1351
FD S + +LP ID + + + E FR KF
Sbjct: 330 FDKSATDVINTPWQILPIWTENNEIDNDIKFVFRQHPNDE-----------FRVKFALQP 378
Query: 1352 ------SEGIAMNNIVDMFSDLHPS--LLLFLHRLQCI 1381
++ +N +D+F+D+ S ++LF+ ++ +
Sbjct: 379 RDKEILTDEDRDDNYIDLFTDVFESERIILFIPNIKKV 416
>gi|260803427|ref|XP_002596591.1| hypothetical protein BRAFLDRAFT_78511 [Branchiostoma floridae]
gi|229281850|gb|EEN52603.1| hypothetical protein BRAFLDRAFT_78511 [Branchiostoma floridae]
Length = 1237
Score = 56.6 bits (135), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 71/307 (23%), Positives = 122/307 (39%), Gaps = 43/307 (14%)
Query: 1787 KNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFG 1846
+ L+ YL F+ + + +++R L+ A V+ N G PS P+HF +G
Sbjct: 73 RELLIGYLVFIKSQWDLQPDHVNLDR------LKPVAQVVNNKGLMTPSSQPVHFTPAYG 126
Query: 1847 NPVSVNMLIHDIDIKWYEVDITYL--KHPAKESLSCGLVKWRKFFKEIGITDFVQVVQ-- 1902
N + + + + W +D YL PA+ ++ WR FF ++G+ DF+ V +
Sbjct: 127 NRLDLQKKLPGNE--WTLLDSCYLGTSTPAQHTVQT----WRHFFVKLGVLDFLAVKKER 180
Query: 1903 ---VDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHL-LSLLTTNVNRQCSKHLL 1958
+ + + + W K G D+ E L ++ LT +V + L
Sbjct: 181 VKLSTEQLGKSPWSTLSSFWPKS--DEGYEVEDFTCAEFYSLVMADLTADVRTPQMQILF 238
Query: 1959 EILDTLWDDCYTDKIMGFFKSKPTGDDR-SFQSSFINCICDIQW-----------TISSM 2006
E++D WD Y K P+G +SSF+ I I W T
Sbjct: 239 EMIDKEWDGSYR-KYSTAQVFDPSGVKLCDAESSFLIHIKTIPWVPGIKTSTLPNTAPDG 297
Query: 2007 DDELHY------PKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCD--IGLKTEVTIDDI 2058
+L Y +L+ +L PY+ V++E + L+ +T D +
Sbjct: 298 SAQLFYNNVFMRGSELYEKTREHIGLLAHHVPYLTAAVQTETSSFGQVLQLQQSLTTDRL 357
Query: 2059 LEILKVW 2065
L I+K W
Sbjct: 358 LAIVKGW 364
>gi|126667620|ref|ZP_01738589.1| hypothetical protein MELB17_00250 [Marinobacter sp. ELB17]
gi|126627889|gb|EAZ98517.1| hypothetical protein MELB17_00250 [Marinobacter sp. ELB17]
Length = 1516
Score = 56.6 bits (135), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 68/146 (46%), Gaps = 27/146 (18%)
Query: 1211 LELVQNADD--------NIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCD 1258
LEL+QNADD +I + + T+ + S + L E +S + R + D
Sbjct: 43 LELIQNADDAKAHGVAFDITDDALVVWNSGAFTYCGEMSPVCRLMQEDNYSCDFHR-IAD 101
Query: 1259 VGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG---------- 1308
VG+ K S IGR GIGF S ++VTD PEIHS+G ++ E Q
Sbjct: 102 VGSGGKLAKSEN-IGRFGIGFVSAYQVTDHPEIHSSGIKLQLIPEEAQYACSNIPLETGT 160
Query: 1309 -FVLPTLVPPFNIDMFCRLLSKDPVQ 1333
FVLP + P +I R + PV+
Sbjct: 161 KFVLPWAMNPHSIGR--RAIGATPVR 184
>gi|374996799|ref|YP_004972298.1| ATPase [Desulfosporosinus orientis DSM 765]
gi|357215165|gb|AET69783.1| Putative ATPase (DUF699) [Desulfosporosinus orientis DSM 765]
Length = 1586
Score = 56.2 bits (134), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 138/339 (40%), Gaps = 63/339 (18%)
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV-VLNNEQGFSAE 1251
R + E YS +HF+ EL+QNADD ++ F L+ + ++ V N + FS
Sbjct: 29 RGVKTSVVEKYSDQAHFIYELLQNADDA-----GATSVRFDLRYNKLIFVHNGTRRFSIS 83
Query: 1252 N---------------IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
N I A+ + NS K +S IG+ G+GFK+VF+ T P I+ F
Sbjct: 84 NPETEEADTNNGKLGDINAITSIANSNKTAAS---IGKFGVGFKAVFQYTQTPSIYDPEF 140
Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK-FSEGI 1355
+ + F++P + D F + P + T PF + S
Sbjct: 141 RFRIER------FIVPVKID----DDFS---GRRPEE-------TLFVFPFDHRDRSREE 180
Query: 1356 AMNNIVDMFSDL-HPSLLLFLHRL---QCIMFRNMLNDSLVVIRKKIVGDGI---IKVSC 1408
A +I D L +P L L R+ +C S V ++ G+ + I ++
Sbjct: 181 AYEDISDKLRSLVYPVLFLTNLRMISVECAEKFGFYTKS--VDKEMAFGNTVARLITLTQ 238
Query: 1409 GEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-T 1467
D + KL R + + ++ L+L + EG + L P F F P +
Sbjct: 239 NNDDDENELIKDKLWM-FTRANERGFTYSVCLSL--NTEGAF--LRSNFPAFCFFPTKEV 293
Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQ---WLLSEFPA 1503
GL FIL F+L SRE + N+ LL+E A
Sbjct: 294 TGLNFILHAPFLLTDSREGIKAGEKHNKDMIKLLAELAA 332
>gi|429199883|ref|ZP_19191618.1| hypothetical protein STRIP9103_07746 [Streptomyces ipomoeae 91-03]
gi|428664454|gb|EKX63742.1| hypothetical protein STRIP9103_07746 [Streptomyces ipomoeae 91-03]
Length = 1527
Score = 56.2 bits (134), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 92/385 (23%), Positives = 155/385 (40%), Gaps = 69/385 (17%)
Query: 1202 LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
L S L E+VQNA+D + +V + +E ++V +N ++ A+ +
Sbjct: 76 LSSDRLQGLSEIVQNAED-LGATDVR----ILTRERELLVAHNGSPVRLPDVLAMA-MPW 129
Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNID 1321
T K A GR GIG ++ ++ E+H H +F + I P D
Sbjct: 130 LTSKAEQAESTGRFGIGLMTLQSLSPYLEVHCG--HYRFRVGDPDISVAEAFTAP----D 183
Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI 1381
F W T +R+P + + + + ++ S LLFL R++ +
Sbjct: 184 WFA-----------GDTW-TVLRVPLELGV---LGADEVDEWLAEWGDSALLFLGRVRSV 228
Query: 1382 M---------FRNMLNDSLVVIRKKIVGDGIIKVSCGE----DKMTWFVASQKL--RAGV 1426
R LN + + +VG + V GE D W V + +G+
Sbjct: 229 THLDEQGGVRHRLALNREVGEPFEAVVGGQSVDVETGEACAGDGSRWLVCRTTVPSPSGI 288
Query: 1427 IRPDVK---TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSS 1483
R TT IA+AL L + G ++A LP+ L + F +S
Sbjct: 289 TRSHKAVEPTTAIAVALPLGPDDAGR---------IYAGLPVADTTLALCVSAQFDPIAS 339
Query: 1484 REEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL---VGEVH 1540
R+ +D N+PWN+ L L+ +A L FR +PA+A VPL + +
Sbjct: 340 RQALD-NTPWNRALNELVGDLWTAAV-----LWQFRRDPARA----WRAVPLPDDMRDAR 389
Query: 1541 GFFSGLPRMILSKLR--MSNCLILE 1563
+ L R++L + R +S+ L+L+
Sbjct: 390 DPVAALERLLLDRARAGVSHGLVLD 414
>gi|47215794|emb|CAG02848.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4658
Score = 56.2 bits (134), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD IY PT +Q + V NN Q F+
Sbjct: 2618 YPSEKEMLKELLQNADDAKATEIYFIYDPRTHPTDRVFDDKWIPMQGPALCVYNN-QPFT 2676
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
+++R + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 2677 EDDVRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 2720
Score = 42.7 bits (99), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 77/189 (40%), Gaps = 45/189 (23%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-------NVEPTLTFILQESGIVVLN-------NEQ 1246
E Y + EL+QNADD E + L + G++ + N
Sbjct: 1547 EEYPSVADIFKELLQNADDASATECNFLIDMRKNNDIRENLLDPGMIACHGPSLWSFNNA 1606
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
FS + + +G S K+ S A +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1607 VFSDTDFLNITRLGGSVKR-SEADKVGKFGLGFNSVYHITDIPIIMSREFMIIFDPNINH 1665
Query: 1307 IG----------------------FVLPTLVPPFNIDMF-CRL-LSKD-PVQLESKCWNT 1341
I P PF I++F C+L L++D P + T
Sbjct: 1666 ISKHIRDRSNPGIKINWSKQQKRLRKFPNQFKPF-INVFGCQLPLAQDSPYKFN----GT 1720
Query: 1342 CIRLPFRTK 1350
RLPFRT+
Sbjct: 1721 LFRLPFRTE 1729
>gi|218670910|ref|ZP_03520581.1| hypothetical protein RetlG_04268 [Rhizobium etli GR56]
Length = 260
Score = 56.2 bits (134), Expect = 2e-04, Method: Composition-based stats.
Identities = 73/277 (26%), Positives = 116/277 (41%), Gaps = 42/277 (15%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
+TF L + V + FSA ++R +C + STK ++ IGR GIGFKSV+ T+ P
Sbjct: 2 VTFHLSRQSLRVSHFGAPFSAGDVRGVCGINASTKDYTA---IGRFGIGFKSVYAFTEKP 58
Query: 1290 EIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
EIHS S +V P PP +++P + T + LP
Sbjct: 59 EIHSG------SESFSISSYVFPEEAPP---------CAREPEE-------TVVLLPLAA 96
Query: 1350 KFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV--- 1406
+ A + I + +DL LLFL ++ I + + R+ V +G ++
Sbjct: 97 --DDENAFDEIAEGLADLGAEALLFLKHVEEISW-TIEGRGSGRYRRDAVAEGFMQAVTL 153
Query: 1407 ----SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
G + T+ + S+ + V EIA L + N P+ + V F
Sbjct: 154 LSSGDAGGQEETYLLFSRAVEND--GKYVGLAEIAF-LVADQDNRWVVQPIDDARLV-VF 209
Query: 1463 LP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWL 1497
P L TY F++QG + SR+ + + WN L
Sbjct: 210 FPTVLPTY-TGFLIQGPYQTTPSRDNIPTDRAWNAKL 245
>gi|449678125|ref|XP_002155302.2| PREDICTED: uncharacterized protein LOC100207584 [Hydra
magnipapillata]
Length = 238
Score = 56.2 bits (134), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 53/96 (55%), Gaps = 2/96 (2%)
Query: 200 INAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVV 259
INAFI F+ R I +L++L + ICK E F +L+LGP +Q LI F + + +
Sbjct: 144 INAFIQSFMTFRNICTLFELNQEICKMESKSSFLDLKLGPLQKQLLIYDNFKFPPNQNDI 203
Query: 260 FKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKK 295
I T+DV + + +K + + EFL+F+ KK
Sbjct: 204 PPITTSDVFRLI--WKMLNKKHNFLMIEFLEFLIKK 237
>gi|340383697|ref|XP_003390353.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
Length = 1275
Score = 56.2 bits (134), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 88/376 (23%), Positives = 148/376 (39%), Gaps = 79/376 (21%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y + E++QNADD ++ + + +L+
Sbjct: 909 QHEPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRMHSSHNLVLEGMAE 967
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 968 SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 1026
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
+ FD + + G V P I + L+K P Q S +N T
Sbjct: 1027 EWLYIFDPTLKHLKGVVQNESRPGKKIKFRSKFLAKSQQLAPYQDLFGFDSSSSYNGTIF 1086
Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
RLPFRT + S I N+++ M DL + LLLFL ++ I F + ++
Sbjct: 1087 RLPFRTHPSQISSTIYNNHLIQMIKKDLESNGSKLLLFLQNVKRITFSSQQEGRPIIEH- 1145
Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQE---SNEGNYG-- 1451
+ + C D ++ ++ V + K+TE L ++ S +G Y
Sbjct: 1146 -------LSIECSNDD------TENVKVCVTQSSTKSTEYWLLSNKEQRLKSQDGAYKQA 1192
Query: 1452 ------PLLYQQP----------VFAFLPLR--TYGLKFILQGDFVLPSSREEV------ 1487
L+ +Q F FLPL GL + +F + S+R +
Sbjct: 1193 VASVACKLVKKQSSFLCEAIEGNTFCFLPLSVPATGLPVHVTANFAVMSNRSGIWTGASS 1252
Query: 1488 ----DGNSPWNQWLLS 1499
D WNQ L++
Sbjct: 1253 AMASDEREYWNQQLMT 1268
>gi|340383029|ref|XP_003390020.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
Length = 5665
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 63/371 (16%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ-----ESGIVVLNNEQGFSA 1250
Y L E++QNADD +Y T I + ++++N F+
Sbjct: 1579 YKDGLSILKEMIQNADDAGATEVNILYDNRKHSTKNLIFKGMADSHGPALIVHNNSTFAK 1638
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
E+ + + +TK IG+ GIGF SV+ VTD P S + FD + + +
Sbjct: 1639 EDFENITKLAGTTKANQPLK-IGQFGIGFCSVYHVTDVPSFVSAEWLYIFDPTLNYLKDI 1697
Query: 1311 LPTLVPP-----------FNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT---KFSEGIA 1356
+ P N D + S T RLPFRT + S I
Sbjct: 1698 VQNQSRPGKKIKYQSKILANTDHLVPYDGLFGFKSSSNYNGTLFRLPFRTSASQISSTIY 1757
Query: 1357 MNNIV-DMFSDL---HPSLLLFLHRLQCIMF------RNMLNDSLVVIRKKIVGDGIIKV 1406
+++V DM DL LLLFL ++ I F + ML S+ R + ++
Sbjct: 1758 NDHLVEDMKRDLVNWGSKLLLFLQHVKRITFSDFKGTKKMLKVSIDRSRIADIRHFVVTF 1817
Query: 1407 SCGEDKMT--WFVA-----SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
S + +T W +A Q +G+ + ++A L + N +Y +
Sbjct: 1818 SPQQKCITEHWLIADAEEQQQSQNSGI------SAVASVACQLDKVNLSSYKCKKIKGSA 1871
Query: 1460 FAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNSPWNQWLL-SEFPALFV 1506
F FLPL + GL + +F + S+R + D WN+ L+ + PA +
Sbjct: 1872 FCFLPLALPSTGLPVHISANFAIMSNRTGIWTKPSLSTVSDFREQWNKKLMETTIPAAYC 1931
Query: 1507 SAERSFCDLPC 1517
+ ++ C
Sbjct: 1932 KLLQKLQEMHC 1942
>gi|379005862|ref|YP_005255313.1| superfamily I DNA/RNA helicase [Sulfobacillus acidophilus DSM 10332]
gi|361052124|gb|AEW03641.1| superfamily I DNA/RNA helicase [Sulfobacillus acidophilus DSM 10332]
Length = 1580
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 63/137 (45%), Gaps = 27/137 (19%)
Query: 1181 SNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV 1240
+NML+K R + E YS +HF+ EL+QNADD + FIL + ++
Sbjct: 20 ANMLEKPSMR---GVRTSVVEKYSDQAHFIYELLQNADDA-----GATSARFILHDQELI 71
Query: 1241 VLNN----------------EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFR 1284
+N + + +I A+ + NS K +S IG+ G+GFK+VF+
Sbjct: 72 FAHNGIRRFSISDPDTEEEDSRNGTLGDINAITSIANSNKTTAS---IGKFGVGFKAVFQ 128
Query: 1285 VTDAPEIHSNGFHVKFD 1301
T P I+ F K +
Sbjct: 129 YTQTPRIYDPNFRFKIE 145
>gi|288926515|ref|ZP_06420434.1| hypothetical protein HMPREF0649_01945 [Prevotella buccae D17]
gi|288336727|gb|EFC75094.1| hypothetical protein HMPREF0649_01945 [Prevotella buccae D17]
Length = 2291
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 66/128 (51%), Gaps = 13/128 (10%)
Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPENVEPT---------LTFILQESGI 1239
++ R+L L ++L S F+ EL+QNA+D YP + + F + ++ +
Sbjct: 213 KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRAKINGEYQALPVDVEFHITKNYL 270
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
+ + F+ +NI A+CD+ N +K + IG KGIGFK+VF D +++ + +
Sbjct: 271 TFQHTGEYFNPKNIAAICDI-NDGEKADNTKAIGYKGIGFKTVFLDNDYVLLNTGNYTFR 329
Query: 1300 FDTSEGQI 1307
FD S +
Sbjct: 330 FDKSATDV 337
>gi|284032160|ref|YP_003382091.1| hypothetical protein Kfla_4256 [Kribbella flavida DSM 17836]
gi|283811453|gb|ADB33292.1| hypothetical protein Kfla_4256 [Kribbella flavida DSM 17836]
Length = 1234
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 46/93 (49%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
++EL QNA D V + F L++ +VV N SA + +L + S K+
Sbjct: 43 RVVVELAQNAADAAVRAGVPGRVRFSLRDGTLVVANTGTPLSAAGVESLATLRASAKRDD 102
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
+ +GR G+GF +V VTD P I S V+F
Sbjct: 103 AESAVGRFGVGFAAVLAVTDEPVILSRTGGVRF 135
>gi|392843735|gb|AFM84471.1| sacsin, partial [Lepisosteus oculatus]
Length = 736
Score = 55.8 bits (133), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ PT +Q + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEIYFVFDPRSHPTDRIFDDKWVPMQGPALCVYNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++IR + ++G TK+G+ G G+ GIGF SV+ +TD P SN
Sbjct: 476 EDDIRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISN 519
>gi|410909740|ref|XP_003968348.1| PREDICTED: sacsin-like [Takifugu rubripes]
Length = 4585
Score = 55.8 bits (133), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 15/106 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD IY PT +Q + V NN Q F+
Sbjct: 2547 YPSEKEMLKELLQNADDAKATEIFFIYDPRTHPTDRVFDDKWIPMQGPALCVYNN-QPFT 2605
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
+++R + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 2606 EDDVRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYYITDCPSFISNS 2650
Score = 48.5 bits (114), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 95/236 (40%), Gaps = 56/236 (23%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-------NVEPTLTFILQESGIVVLN-------NEQ 1246
E Y + EL+QNADD E + L + G++V + N
Sbjct: 1476 EEYPSVADIFKELLQNADDASATECNFLIDMRKNNEIRENLLDPGMIVCHGPSLWSFNNA 1535
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
FS + + +G S K+ S A +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1536 VFSDTDFLNITRLGGSVKR-SEADKVGKFGLGFNSVYHITDIPIIMSREFMIIFDPNINH 1594
Query: 1307 I----------GFVL------------PTLVPPFNIDMF-CRLLSKDPVQLES--KCWNT 1341
I G + P PF I++F C L P+ +S K T
Sbjct: 1595 IRKHIRDRSNPGIKINWSKQQKRLRKFPNQFKPF-INVFGCEL----PLAQDSPYKYNGT 1649
Query: 1342 CIRLPFRTK-----------FSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
RLPFRT+ + + ++VD FS +LF + C++ + +
Sbjct: 1650 LFRLPFRTEQEASQSEISSLYYNTTDIYSLVDEFSICGHRFILFTQHVGCMLLKYL 1705
>gi|301617805|ref|XP_002938327.1| PREDICTED: sacsin-like [Xenopus (Silurana) tropicalis]
Length = 3379
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 101/437 (23%), Positives = 169/437 (38%), Gaps = 115/437 (26%)
Query: 1203 YSQDSHFLLELVQNADDN---------IYPENVEPTLTFILQE----SGIVVLN-NEQGF 1248
YS+ EL+QNADD +N E + I + G + + N++ F
Sbjct: 707 YSEQVELFKELIQNADDAEATICQFLVDMRQNSESRQSLIDPDMANCHGAALWSYNDRKF 766
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD------- 1301
+ + R + +G +TK+ + IG+ G+GF +V+ +TD P I S + FD
Sbjct: 767 TDNDFRNITRIGAATKE-TEVKKIGKFGLGFNTVYHLTDVPSIMSGSNVLIFDPNINHIK 825
Query: 1302 ---TSEGQIGFVLPTLVPPFNIDMF------------CRLLSKDPVQLESKCWNTCIRLP 1346
++ G G L + P +++F C L P + T IRLP
Sbjct: 826 KHISNSGNPGIKLNLQMKPEILNVFLDQFQPFSGVFGCEL--NQPFYYDG----TLIRLP 879
Query: 1347 FRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHR-----LQCI--------- 1381
FRT+ + +++ + F + +LL+FL L C+
Sbjct: 880 FRTEKEAKDSQICQQAFNKEQIHSFIHNFEEAASTLLIFLKNIEKVTLNCLEKDLCPQAD 939
Query: 1382 ---------------MFRNML-----NDSLVVIRKK-----IVGDGIIKVSCGEDKMT-- 1414
M RNM+ +SL V+R + G IIK++ + + T
Sbjct: 940 ESRVMVQRDKVQTLDMPRNMILQQEQMNSLNVLRMNVNNLDVTGSHIIKITVRQTETTEE 999
Query: 1415 -WFVASQKL----------RAGVIRPDVKTTEIALALTLQESNEGNYGPLL--YQQPVFA 1461
+++A L + + + +AL L + + G + P L + VF
Sbjct: 1000 MYYLAQSSLGIKESIQMFSQNKKLNTYLPVAGVALPLK-KHCDTGKWAPNLSSFSGMVFC 1058
Query: 1462 FLPLR-TYGLKFILQGDFVLPSSRE---EVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
FLPL + GL F L G F + S+R+ + WN+ LL + A V+ + L
Sbjct: 1059 FLPLPVSSGLPFHLNGTFAVMSNRKSLWDTTEKGEWNRTLLGD--AAVVALITALAQLQI 1116
Query: 1518 FRENPAKAASVYMSFVP 1534
N Y +F P
Sbjct: 1117 LSNNGDIQDYSYHTFWP 1133
>gi|300726908|ref|ZP_07060335.1| hypothetical protein PBR_1023 [Prevotella bryantii B14]
gi|299775811|gb|EFI72394.1| hypothetical protein PBR_1023 [Prevotella bryantii B14]
Length = 2150
Score = 55.5 bits (132), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 126/317 (39%), Gaps = 65/317 (20%)
Query: 1209 FLLELVQNADDNIYPEN--VEPT-LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
F+ EL+QNA+D Y N +E + F + +V +++ F+ NI A+C + N +K
Sbjct: 17 FIYELLQNANDYPYKANNTIEKVNVEFHITRDSLVFMHSGAVFNERNIAAICSI-NDKEK 75
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
+ IG KGIGFK+VF + + + F +FD E + P + P
Sbjct: 76 TDNKETIGYKGIGFKTVFLDNNYVYLQTGDFSFRFDREESRDIVDTPWQILPI------- 128
Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFS------EGI--AMNNIVDMFSDL--HPSLLLFL 1375
+ + + K T FR KFS E + + N V MF D+ + ++LF+
Sbjct: 129 WTNYNELTPSEKYVFTNADKQFRVKFSLRPTNIETLRGSGQNYVQMFKDVFKNERVILFI 188
Query: 1376 HRLQCI-MFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
L + +F N ++ LV C D W V + + P++ T
Sbjct: 189 PNLSSVKVFFNESDEPLV---------------CNCDNEHWQVNH---FSDDVSPEI-TK 229
Query: 1435 EIALALTLQESNEGNYGPLLY-----------------------QQPVFAFLPLRT-YGL 1470
I L + QE P Y V+ +LP + +G
Sbjct: 230 SINLDIDKQEDVGALKIPTKYYDFTKTKVSFACEIDGVQLKEVEDTLVYCYLPTKAKWGF 289
Query: 1471 KFILQGDFVLPSSREEV 1487
KF++ D + R++V
Sbjct: 290 KFLMNTDMIPTGPRDDV 306
>gi|302813581|ref|XP_002988476.1| hypothetical protein SELMODRAFT_159510 [Selaginella moellendorffii]
gi|300143878|gb|EFJ10566.1| hypothetical protein SELMODRAFT_159510 [Selaginella moellendorffii]
Length = 4500
Score = 55.5 bits (132), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 89/220 (40%), Gaps = 46/220 (20%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI-----------------VVLN 1243
E+Y+ L EL+QNADD E ++F+L +S +
Sbjct: 1363 EMYADGPGTLFELLQNADDAGATE-----VSFLLDKSSFGTNSVLSRQMADWQGPALYCY 1417
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
N+ F E+ A+ +G ++K + IGR G+GF SV+ TD P S V D
Sbjct: 1418 NDSVFKPEDFYAISRIGQNSKLDKPSA-IGRFGLGFNSVYHFTDVPSFVSGSTIVLLDPH 1476
Query: 1304 EGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQLES----KC------WNTCIRLPFRTKFS 1352
G + + P P I+ + LL + P Q ES C T R P RT +
Sbjct: 1477 AGNLPGISPA-QPGLKINFVGKELLKQFPDQFESFLLFGCDLQKPFLGTLFRFPLRTTVT 1535
Query: 1353 EGIAM--------NNIVDMFSDLHPS---LLLFLHRLQCI 1381
+ ++V +FS L S +LLFL + I
Sbjct: 1536 AARSEIKQDPYQPEDVVTLFSALKKSAGEVLLFLRHVTSI 1575
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 92/401 (22%), Positives = 148/401 (36%), Gaps = 104/401 (25%)
Query: 1203 YSQDSHFLLELVQNADD--------------NIYPENVEPTLTFILQESGIVVLN-NEQG 1247
Y + + L E++QNADD + V P L Q G +L N
Sbjct: 21 YPEGTTILKEMIQNADDAGASQVCFCVDRRRHAVDSLVSPPLA---QWQGPALLAYNNAA 77
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
F+ E+ +++ +G+S K+G A GR GIGF SV+ ++D P S G HV F + Q
Sbjct: 78 FTEEDFQSISRIGDSKKRGQ-AWKTGRFGIGFNSVYHLSDLPSFVS-GSHVVFFDPQCQF 135
Query: 1308 GFVLPTLVPPFNIDMFCRL-LSKDPVQLESKCWNTC----------IRLPFR-------T 1349
+ +L P ID L P Q C C R P R +
Sbjct: 136 LPSVSSLNPGKRIDFVASTALENFPDQFSPYCVFGCDMRRPFGGTLFRFPLRSAEQAATS 195
Query: 1350 KFS-EGIAMNNIVDMFSDLHPSL---LLFLHRLQCIMF---------------------- 1383
K S + +++++ DL+ + +LFL ++ + F
Sbjct: 196 KLSKQSYTSTDMLELLRDLYHEIVQVMLFLKNVERVEFYEWSAGSSSPTLLYSCAVRLPT 255
Query: 1384 ------RNML------------NDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
R + ++ V R ++V + G++ ++ F+ SQ + +
Sbjct: 256 SEIRSHRQLFLRLSKGKDAAASEETKDVFRLELVTNAYTGADSGKESLSSFLISQAMGSS 315
Query: 1426 VIRPDVKTT----EIALALTLQESNEGNYGP-------LLYQQPVFAFLPLRT-YGLKFI 1473
R T L L S + P LL + F FLPL G+ F
Sbjct: 316 ASRIGALATNAFKSYGLRLLPWASVAASILPGKVSIYCLLVEGRAFCFLPLPIKTGMPFH 375
Query: 1474 LQGDFVLPSSREEV----------DGNSPWNQWLLSEFPAL 1504
+ G F L ++R ++ S WNQ +L + AL
Sbjct: 376 VNGYFELSANRRDIWHGSDMDRMGKLRSDWNQCVLEDVAAL 416
>gi|291238887|ref|XP_002739357.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 4368
Score = 55.5 bits (132), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 57/115 (49%), Gaps = 15/115 (13%)
Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQ 1246
E+Y + EL+QNADD + + + L + G+ + N+
Sbjct: 1403 EVYKEGVDIFKELIQNADDAGATDIRFLIDQRQNTDVQLKLLDPGMKHCHGPALWSYNDA 1462
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
FS+E+IR +CD+ +TKK + IGR G+GF SV+ VTD P + S + FD
Sbjct: 1463 MFSSEDIRNICDIAAATKK-NKLDKIGRFGLGFTSVYHVTDVPSVVSGSHVMIFD 1516
Score = 49.3 bits (116), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 16/97 (16%)
Query: 1212 ELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCD 1258
EL+QNADD IY PT + +QES + + N++ FSA +I +
Sbjct: 2445 ELLQNADDAMATELHIIYDPRSHPTGKIVSDLWKPIQESPAICVFNDRVFSAGDIEGIQQ 2504
Query: 1259 VGNSTKKGSSAGYI-GRKGIGFKSVFRVTDAPEIHSN 1294
VG K S+ GY G+ GIGF SV+ +TD P S+
Sbjct: 2505 VGIGGK--SADGYTTGQYGIGFNSVYHLTDCPSFLSD 2539
>gi|323448598|gb|EGB04495.1| hypothetical protein AURANDRAFT_67157 [Aureococcus anophagefferens]
Length = 4616
Score = 55.1 bits (131), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 112/280 (40%), Gaps = 61/280 (21%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------------- 1224
N+E+ ++ + R H L +LY + S L ELVQNADD E
Sbjct: 1449 NVETEAFGQRESLTRRLAHIL--QLYPEGSGILNELVQNADDAGASEVRVLLDARSAATR 1506
Query: 1225 NVEPTLTFILQ--ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
+ L+ L + + ++N+ GF+ + R L VG +K + GR G+GF SV
Sbjct: 1507 GAQSLLSAGLAAWQGPALCVSNDAGFTESDWRNLVRVGQGSKL-ENFKTTGRFGLGFNSV 1565
Query: 1283 FRVTDAPEIHSNGFHVKF--DTSEGQI----------------GFVLPTLVPPFNIDMFC 1324
+ +TD P++ S G HV F ++G + G LP P D F
Sbjct: 1566 YHLTDVPQVAS-GDHVLFLDPHADGLVPGATPQRPGLRVKFTGGADLPAAFP----DQFA 1620
Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLL 1373
LE T RLP R++ + + +++ F S+LL
Sbjct: 1621 SWADCG-CDLEKPLEGTLFRLPLRSRAAAATSEISKQHYGVEEAEALLEQFKAQATSVLL 1679
Query: 1374 FLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKM 1413
FL ++ + ++ V+R + + + + ED +
Sbjct: 1680 FLRHVRAV--------TVTVLRDGGESETVFRAAAREDPL 1711
>gi|170757814|ref|YP_001782495.1| transcriptional regulator [Clostridium botulinum B1 str. Okra]
gi|429246935|ref|ZP_19210220.1| transcriptional regulator [Clostridium botulinum CFSAN001628]
gi|169123026|gb|ACA46862.1| putative transcriptional regulator [Clostridium botulinum B1 str.
Okra]
gi|428756042|gb|EKX78628.1| transcriptional regulator [Clostridium botulinum CFSAN001628]
Length = 125
Score = 55.1 bits (131), Expect = 5e-04, Method: Composition-based stats.
Identities = 33/95 (34%), Positives = 54/95 (56%), Gaps = 4/95 (4%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNE 1245
Q +L+ LS +LY+ F+ E++QNADD+ +N E +I + +VV ++
Sbjct: 22 QAVNQASSLNALSGDLYTDSKRFIYEILQNADDS--SQNSEAVKVWIKILGDNLVVAHSG 79
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFK 1280
+ F+ +++ +C+V N TKK S G KGIGFK
Sbjct: 80 KPFTTRDLQGICNVNNGTKK-SDLTKTGYKGIGFK 113
>gi|302866235|ref|YP_003834872.1| helicase domain-containing protein [Micromonospora aurantiaca ATCC
27029]
gi|302569094|gb|ADL45296.1| helicase domain protein [Micromonospora aurantiaca ATCC 27029]
Length = 1568
Score = 55.1 bits (131), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 101/252 (40%), Gaps = 37/252 (14%)
Query: 1210 LLELVQNADD----NIYPENVEPTLTFILQESG--IVVLNNEQGFSAENIRALCDVGNST 1263
LLELVQNA D N + + +L G + N + FS + A+ S
Sbjct: 54 LLELVQNAADAMAGNTDSDENTGRVEIVLDSVGGALYCANGGRPFSKRGLTAITHAHLSG 113
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV--------LPTLV 1315
K+G IGR G+GFKSV V++ P++ S +F++ E + P L
Sbjct: 114 KRGDE---IGRFGLGFKSVLAVSNTPQVFSRSVSFEFNSPEARSAIAEVGSGAKRYPALR 170
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
ID DPV W T ++LP + I++ S LLF
Sbjct: 171 TATTIDAMA-AFGHDPVLRGLAEWAATIVKLPNVSSIPR--LRKEIIEFSS----HFLLF 223
Query: 1375 LHRLQCIMFRNMLNDSLVVIR-KKIVGDGIIKVS-CGEDKMTWFV----------ASQKL 1422
+ ++ + R + D + +GDG+ K+ G + W+V A +++
Sbjct: 224 VGAVREVRLRILGADHFEATHTSRNLGDGLFKIERSGGEYEEWYVEDRMHAPSAEARRQV 283
Query: 1423 RAGVIRPDVKTT 1434
V R +VK T
Sbjct: 284 GEAVSRSEVKVT 295
>gi|296271243|ref|YP_003653875.1| hypothetical protein Tbis_3292 [Thermobispora bispora DSM 43833]
gi|296094030|gb|ADG89982.1| hypothetical protein Tbis_3292 [Thermobispora bispora DSM 43833]
Length = 985
Score = 54.7 bits (130), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 79/180 (43%), Gaps = 10/180 (5%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
++EL QNA D V L L++ + N A + +LC + S KK
Sbjct: 40 RLVVELAQNAADAALRAGVPGRLRLTLRDGVLYAANVGAPLDAAGVESLCTLRASAKK-D 98
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGF-HVKFDTSE-GQIGFVLPTLVPPF-NIDMFC 1324
G GR G+GF +V VTD P I S V++ +E ++ P L D
Sbjct: 99 DIGAAGRFGVGFAAVVSVTDTPAIMSRATGGVRWSRAETAELVAARPELAEELARRDGHV 158
Query: 1325 RLLSKDPVQLE----SKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
LL + P +E + ++T +RLP R + +E A+ ++D S P L L R++
Sbjct: 159 PLL-RLPFPVEDLEVPEGFDTVVRLPLRDEAAEQ-AVRRMLDEASPALPIALPALERIEI 216
>gi|392843741|gb|AFM84474.1| sacsin, partial [Acipenser sinensis]
Length = 386
Score = 54.7 bits (130), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 54/105 (51%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADD----NIY----PENVEPTLTF-----ILQESGIVVLNNEQGFS 1249
Y + L EL+QNADD IY P + F LQ + V NN Q F+
Sbjct: 67 YPSEKEMLKELLQNADDAKATEIYFVFDPREHQADRIFDDKWIPLQGPALCVYNN-QPFT 125
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
+++R + ++G TK+G+ G G+ GIGF SV+ +TD P SN
Sbjct: 126 EDDVRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISN 169
>gi|373500743|ref|ZP_09591116.1| hypothetical protein HMPREF9140_01234 [Prevotella micans F0438]
gi|371951701|gb|EHO69544.1| hypothetical protein HMPREF9140_01234 [Prevotella micans F0438]
Length = 1860
Score = 54.7 bits (130), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 11/125 (8%)
Query: 1186 KQHAR------LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESG 1238
KQH + L + Q LY+ EL+QNADD PEN V+ L L ++
Sbjct: 464 KQHTSPNTNTSVANLLEEVGQNLYTSKKRMFFELLQNADDAA-PENGVKVKLQ--LSDNY 520
Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
V+ ++ F+ + ++ STK + G KGIGFKSVF +++ I S G++
Sbjct: 521 FVLTHDGFAFNKHDFESITSAAKSTKSANKKK-TGYKGIGFKSVFTNSESVLIKSAGYNF 579
Query: 1299 KFDTS 1303
FD S
Sbjct: 580 SFDKS 584
Score = 43.1 bits (100), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 41/78 (52%), Gaps = 3/78 (3%)
Query: 1424 AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQ-PVFAFLPLRTYGLKF--ILQGDFVL 1480
AG+ TE +++ ++ EG+ P+ + ++A+LP+ + KF L DF+
Sbjct: 781 AGIPDRIASATETSISFAIRLDEEGHVMPIGKDELSLYAYLPMNEHRFKFPFYLNADFIP 840
Query: 1481 PSSREEVDGNSPWNQWLL 1498
S RE + ++PWN +L
Sbjct: 841 KSDREGIQSDNPWNYFLF 858
>gi|348508402|ref|XP_003441743.1| PREDICTED: sacsin-like [Oreochromis niloticus]
Length = 4656
Score = 54.7 bits (130), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 15/106 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ PT +Q + V NN Q F+
Sbjct: 2618 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPTDRIFDDKWIPMQGPALCVYNN-QPFT 2676
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
++IR + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 2677 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISNN 2721
Score = 45.4 bits (106), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 89/231 (38%), Gaps = 46/231 (19%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-------NVEPTLTFILQESGIVVLN-------NEQ 1246
E Y + EL+QNADD E + L + G++V + N
Sbjct: 1547 EEYPSVADIFKELLQNADDASATECSFLIDMRKNTDIRENLLDPGMIVCHGPALWSFNNA 1606
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
FS + + +G S K+ A +G+ G+GF SV+ VTD P I S F + FD +
Sbjct: 1607 VFSDTDFLNITRLGGSVKR-CEADKVGKFGLGFNSVYHVTDIPIIMSREFMIMFDPNINH 1665
Query: 1307 IGFVLPTLVPP---FNIDMFCRLLSKDPVQLES-----------------KCWNTCIRLP 1346
I + P N + R L K P Q + K T RLP
Sbjct: 1666 ISKHIRDRSNPGIKINWNKQQRRLRKFPNQFKPFINVFNCQLPLAQDSPYKYNGTLFRLP 1725
Query: 1347 FRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
FRT+ ++ + ++VD FS +LF + ++ + +
Sbjct: 1726 FRTEQEASVSEISSLYYNTTDIYSLVDEFSICGHRFILFTQHVGSMVLKYL 1776
>gi|168698281|ref|ZP_02730558.1| hypothetical protein GobsU_02097 [Gemmata obscuriglobus UQM 2246]
Length = 1863
Score = 54.7 bits (130), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQ 1246
+ ++ A +S E + + FL+EL+QNA+D + ++ T +T E + + N+ Q
Sbjct: 42 YVQIANAAEHISAEYHGR---FLIELLQNANDQAVRQGLKDTAVTIHWTERLLAIGNSGQ 98
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
F + A+ + S K+ +G KGIGFK+VF+V D+ EI+S
Sbjct: 99 PFDEAKVDAITSIFKSDKQADKC--LGNKGIGFKAVFQVADSAEIYS 143
>gi|432894177|ref|XP_004075943.1| PREDICTED: sacsin-like [Oryzias latipes]
Length = 4579
Score = 54.7 bits (130), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ PT +Q + V NN Q F+
Sbjct: 2541 YPSEKEMLKELLQNADDAKATEIFFVFDSRTHPTDRIFDDKWVPMQGPALCVYNN-QPFT 2599
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
+++R + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 2600 EDDVRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFVSN 2643
Score = 48.1 bits (113), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 97/235 (41%), Gaps = 54/235 (22%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-----NVEPTLTFI--LQESGIVVLNNEQGFSAEN- 1252
E Y + EL+QNADD E ++ L L + G++V + +S N
Sbjct: 1470 EEYPSVADIFKELLQNADDASATECSFLIDMRKNLDIRENLLDPGMIVCHGPSLWSFNNA 1529
Query: 1253 IRALCDVGNSTKKGSS-----AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
+ D N T+ G S A +G+ G+GF SV+ +TD P I S F + FD + I
Sbjct: 1530 VFTDTDFLNITRLGGSIKRCEADKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNINHI 1589
Query: 1308 G----------------------FVLPTLVPPFNIDMF-CRL-LSKD-PVQLESKCWNTC 1342
P PF I++F C+L L++D P + T
Sbjct: 1590 SKHIRDRSNPGIKINWSKQQKRLRKFPNQFKPF-INVFNCQLPLAQDSPYKFN----GTL 1644
Query: 1343 IRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
RLPFRT+ ++ + ++VD FS +LF + C++ + +
Sbjct: 1645 FRLPFRTEQEAAVSEISSLYYNTTDIYSLVDEFSICGHRFILFTQHVGCMILKYL 1699
Score = 42.7 bits (99), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 15/119 (12%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPEN---VEPTLTFIL---QESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E VE + + Q + + V NN F+
Sbjct: 109 YPEGGQILKELIQNAEDAGATEVRFMYDETEYGVESLWSPDMAQHQGAAMYVYNNAV-FT 167
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
E+ + ++ S K+ +GR GIGF SV+ +TD P I S D + G
Sbjct: 168 GEDWNGIQEIARSRKRDDPLK-VGRFGIGFNSVYHITDVPSIFSGNQIAMLDPHQTLFG 225
>gi|345885641|ref|ZP_08837002.1| hypothetical protein HMPREF0666_03178 [Prevotella sp. C561]
gi|345045034|gb|EGW48983.1| hypothetical protein HMPREF0666_03178 [Prevotella sp. C561]
Length = 352
Score = 54.7 bits (130), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 11/125 (8%)
Query: 1186 KQHAR------LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESG 1238
KQH + L + Q LY+ EL+QNADD PEN V+ L L ++
Sbjct: 110 KQHTSPNTNTSVANLLEEVGQNLYTSKKRMFFELLQNADDAA-PENGVKVKLQ--LSDNY 166
Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
V+ ++ F+ + ++ STK + G KGIGFKSVF +++ I S G++
Sbjct: 167 FVLTHDGFAFNKHDFESITSAAKSTKSANKKK-TGYKGIGFKSVFTNSESVLIKSAGYNF 225
Query: 1299 KFDTS 1303
FD S
Sbjct: 226 SFDKS 230
>gi|390340927|ref|XP_798327.3| PREDICTED: sacsin-like, partial [Strongylocentrotus purpuratus]
Length = 4131
Score = 54.3 bits (129), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 69/257 (26%), Positives = 112/257 (43%), Gaps = 44/257 (17%)
Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
I+ES GLGP+ ++ + Q L + + + Y D L ELVQNADD+
Sbjct: 2640 IIESYSVPVPGLGPHTLEAQNLLQFGQREPLTERIKNILKG-YPCDDTILKELVQNADDS 2698
Query: 1221 IYPE-------NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
E P LQ + V NN + F+ ++I + +G +K+
Sbjct: 2699 CATEVHLVLDVRTHPKDRLFRDEMSHLQGPALCVYNN-KSFTDDDIEGIQRLGVGSKRNL 2757
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGF-------HVKF--DTSEGQIGFVL-PT--LV 1315
S+ +GR G+GF +V+ +TD P ++NG +VK+ ++ GF+ PT L
Sbjct: 2758 SSK-VGRFGVGFNAVYHLTDCPSFYTNGDTLGMFDPNVKYVPKATKAYPGFMAKPTGKLK 2816
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI-----------AMNNIVDMF 1364
F C L P E + +T RLP RTK + ++ ++ F
Sbjct: 2817 DDFPNVFNCYL----PEFFEGQ-GSTMFRLPLRTKSMAKVSHISSEAHDVKSVKGLLRKF 2871
Query: 1365 SDLHPSLLLFLHRLQCI 1381
+ P LLLFL+ ++ +
Sbjct: 2872 KEGLPDLLLFLNHVKKV 2888
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 15/120 (12%)
Query: 1212 ELVQNADD----NIY-----PENVEPTLTFILQ-----ESGIVVLNNEQGFSAENIRALC 1257
E++QNA+D +Y EN E + + + + + N+ F+ ++ + +
Sbjct: 1682 EMIQNAEDAGATKVYFMLDLRENSEARSSLFDKGMVDCQGAALWVYNDASFTPDDFQNIT 1741
Query: 1258 DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP 1317
+G +TK+ ++ IGR GIGF SV+ +TD P S + FD +G +L + P
Sbjct: 1742 KLGGATKELDTSK-IGRFGIGFNSVYHLTDVPSFLSREYLQVFDPHATHLGSMLKSKDSP 1800
>gi|392843739|gb|AFM84473.1| sacsin, partial [Sphyrna lewini]
Length = 735
Score = 54.3 bits (129), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 7/87 (8%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
+F+ + Q+ D I+ E P LQ I V NN Q F+ ++IR + ++G TK+G+
Sbjct: 439 YFIFDPRQHPGDRIFDEKWAP-----LQGPAICVYNN-QPFTEDDIRGIQNLGRGTKEGN 492
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
G G+ GIGF SV+ +TD P S+
Sbjct: 493 -PGKTGQYGIGFNSVYHITDCPAFISD 518
>gi|297194170|ref|ZP_06911568.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
25486]
gi|297152141|gb|EDY65904.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
25486]
Length = 1584
Score = 54.3 bits (129), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 153/366 (41%), Gaps = 59/366 (16%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D I ++ + +L ++ + N + E + + S K+G
Sbjct: 50 LFELVQNAADEIA-DHPGGKIQVVLTDANLYCANEGNPVTPEGAETILRMSMSKKRG--- 105
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIGFVLPTLV----PPFNIDMF 1323
G IGR G+G KSV VTDAPE S FD S QI V P + P F+ +
Sbjct: 106 GQIGRFGVGVKSVLVVTDAPEFFSRTGSFGFDRAWSYEQIKAV-PGVTDRYGPTFDAPVL 164
Query: 1324 --CRLL------SKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFS----DLHPS 1370
R L + DP+ E W T +RLP K ++G ++ S + P
Sbjct: 165 RTARPLDDAAERAADPILDELLGWATTVVRLPL-LKGADGRLGKDMHGGGSSDREEFPPG 223
Query: 1371 LLLFLHRLQCIMF-----RNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMT--WFV---- 1417
LF + ++ R M +L + VGD + +V G+ + W V
Sbjct: 224 FQLFSPHVAEVVLEDRRPRPMARRTLTT---RQVGDVRTVWEVRTGKAASSEEWRVFTVT 280
Query: 1418 ----ASQKLRAGVIRPDVKTTEIALALTLQESN--EGNYG-PLLYQQPVF-AFLPLR-TY 1468
+ + AG + D T +++ A+ + E + G YG P + + F +F P +
Sbjct: 281 HEPGQTARADAGELH-DRVTLDVSWAVPVLERDPKSGLYGSPRIRGRGRFWSFFPTKYEM 339
Query: 1469 GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASV 1528
L +L G + R+ + +SP+NQ ++ AL V + R P++ +
Sbjct: 340 SLSGMLNGAWKTNEDRQNLLDSSPFNQEMIKVSAALVVDSLRHLA--------PSEDPAA 391
Query: 1529 YMSFVP 1534
Y+ +P
Sbjct: 392 YLRLLP 397
>gi|392843857|gb|AFM84532.1| sacsin, partial [Hypostomus plecostomus]
Length = 736
Score = 54.3 bits (129), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ PT +Q + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPTDRIFDDKWVPMQGPALCVYNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++IR + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 476 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519
>gi|392843853|gb|AFM84530.1| sacsin, partial [Pelophylax nigromaculatus]
Length = 736
Score = 54.3 bits (129), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRHHPSDRIFDEKWTP-----LQGPALCVFNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G G+ GIGF SV+ +TD P + SN FD
Sbjct: 472 -QPFTDDDIRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFVSSNDIICIFD 527
>gi|392843839|gb|AFM84523.1| sacsin, partial [Pangasianodon hypophthalmus]
Length = 736
Score = 53.9 bits (128), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ PT +Q + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPTDRIFDDKWVPMQGPSLCVYNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++IR + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 476 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519
>gi|47178510|emb|CAG14162.1| unnamed protein product [Tetraodon nigroviridis]
Length = 122
Score = 53.9 bits (128), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 27/33 (81%)
Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
GDF +PSSRE+VD +SPWNQWL SE P LF+ A
Sbjct: 75 GDFDIPSSREDVDRDSPWNQWLRSEIPQLFLQA 107
>gi|392843865|gb|AFM84536.1| sacsin, partial [Scincella reevesii]
Length = 736
Score = 53.9 bits (128), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 55/118 (46%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPSDRIFDEKWSP-----LQGPALCVFNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|348684143|gb|EGZ23958.1| hypothetical protein PHYSODRAFT_479380 [Phytophthora sojae]
Length = 4858
Score = 53.9 bits (128), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 93/213 (43%), Gaps = 39/213 (18%)
Query: 1203 YSQDSHFLLELVQNADD------NI---YPENVEPTLTF----ILQESGIVVLNNEQGFS 1249
Y + + E+VQNADD N+ Y ++ L + Q ++V NN F+
Sbjct: 29 YPEGTSIFKEMVQNADDAGATEVNLCLDYRQHASTGLAYEKLGSFQGPSLLVHNNAT-FT 87
Query: 1250 AENIRALCDVGNSTKKGSSAGY-IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
+ +++ +G+S KK +S G+ GR G+GF SV+ VTD P S V FD +
Sbjct: 88 DADFQSIQRIGDSLKKDNSKGWKTGRFGVGFNSVYHVTDLPTFVSGSQLVFFDPQACHLP 147
Query: 1309 FVLPTLVPPFNIDMFCR--LLSKDPVQLES-KCW---------NTCIRLPFRT------- 1349
V P+ P ID L++ P Q+ +C+ T RL RT
Sbjct: 148 NVNPS-NPGKMIDYIAHPDLVNNFPDQIAPFRCFGCNFSEPFSGTIFRLALRTEDQAQRS 206
Query: 1350 ----KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
+ M ++ F+D P++LLFL +
Sbjct: 207 KLSNRAQTPAKMAEMLKEFADSLPTVLLFLRNV 239
>gi|423480469|ref|ZP_17457159.1| hypothetical protein IEQ_00247 [Bacillus cereus BAG6X1-2]
gi|401147405|gb|EJQ54907.1| hypothetical protein IEQ_00247 [Bacillus cereus BAG6X1-2]
Length = 702
Score = 53.9 bits (128), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 71/139 (51%), Gaps = 15/139 (10%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESG-IVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
+E++QNADD + T ++ + +G + + NN + F +++R++C G S KK
Sbjct: 46 FIEMLQNADD------AQSTAFYVCEVNGHVFIANNGKPFDDKDVRSICRSGASEKKRGE 99
Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--EGQIGFV----LPTLVPPFNIDM 1322
IG +G+GFKS ++ IHSN F S ++G + +PT+ PF ++
Sbjct: 100 T--IGYRGVGFKSTTHLSKNIYIHSNECTFTFSKSLVAERLGEINENKVPTIRIPFLLES 157
Query: 1323 FCRLLSKDPVQLESKCWNT 1341
++K QL+ K + T
Sbjct: 158 LDADIAKTLNQLKQKGFTT 176
>gi|408375861|ref|ZP_11173500.1| hypothetical protein A11A3_17045, partial [Alcanivorax hongdengensis
A-11-3]
gi|407764271|gb|EKF72769.1| hypothetical protein A11A3_17045, partial [Alcanivorax hongdengensis
A-11-3]
Length = 498
Score = 53.9 bits (128), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 86/322 (26%), Positives = 127/322 (39%), Gaps = 60/322 (18%)
Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC------------- 1257
LEL+QNADD E + F + ++G+ V N+ Q ++ C
Sbjct: 32 LELIQNADDAKAEE-----VIFDITDAGLYVRNSGQFTYCGDLERFCSFTTDGNYKCDYH 86
Query: 1258 ---DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
DVG+ K S IGR GIGF S ++VTD PEI S G + G+ + +
Sbjct: 87 RITDVGSGGKLSRSEN-IGRFGIGFVSTYQVTDHPEIRSAGIKLALYPEHGK--WSIEEF 143
Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
P F S DP ES+ RL ++ + D F ++ LLF
Sbjct: 144 EEPAGTSFFLPWAS-DP---ESEG-----RLALGISHISSDHIDQLTDDFQEVLRKSLLF 194
Query: 1375 LHRLQCIMFRNMLNDSLVV---IRKKIVGDGIIKVSCGEDKMTWFV-------ASQKLRA 1424
L ++ R N L++ + + D II W + A+Q L A
Sbjct: 195 LRHIRKAEVRR--NGKLLMACDLDRSEETDLIISFRPSGTVEQWRILRANAEEAAQPLYA 252
Query: 1425 GVIRPDV--KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLP 1481
R + ++TEI++ L + E LLY AFLP + GL + DF
Sbjct: 253 THPRLESLHRSTEISIGLRVDP--EPITDGLLY-----AFLPTEQATGLPLHINADFFPE 305
Query: 1482 SSREEV-----DGNSPWNQWLL 1498
S R+ V WN+ L+
Sbjct: 306 SDRKAVIFAGHQHEQAWNEMLI 327
>gi|291234401|ref|XP_002737137.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 4311
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 58/117 (49%), Gaps = 19/117 (16%)
Query: 1201 ELYSQDSHFLLELVQNADDN---------IYPENVEPTLTFILQESGIVVLN-------N 1244
E+Y + EL+QNADD + +N + L + + G+ + N
Sbjct: 1377 EMYKEGVDIFKELIQNADDAGASEIRFLIDHRQNTDVQLKLL--DPGMKHCHGPALWSYN 1434
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+ FS+E+IR +CD+ ++KK IGR G+GF SV+ VTD P + S + FD
Sbjct: 1435 DAMFSSEDIRNICDIAAASKK-DKLDKIGRFGLGFTSVYHVTDVPSVVSGSHIMIFD 1490
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 94/219 (42%), Gaps = 38/219 (17%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y +S EL+QNADD IY T + +Q+S + + N Q FS
Sbjct: 2405 YPGESEIFKELLQNADDAKATELHIIYDPRSHSTDKIVSESWKPIQQSPAICVYNNQIFS 2464
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF-------HVKF-- 1300
++I + VG K + A G+ GIGF SV+ +TD P S+ H+K+
Sbjct: 2465 DDDIAGVQQVGIGAK-SADAHTTGQYGIGFNSVYHLTDCPSFLSDDHQLGVFDPHLKYVP 2523
Query: 1301 DTSEGQIGFVLPTLVPPFNI--DMFCRLLSKDPVQLESKCWNTCIRLPFR-TKFSEGIA- 1356
D + G L P NI + F +LS + S T RLP R +S I+
Sbjct: 2524 DATHDYPGI----LYKPVNILKENFGDMLSGFLGEYFSLKGGTMFRLPLRQNAYSSKISS 2579
Query: 1357 -------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLN 1388
+ ++++ F + LLFL+ ++ + +LN
Sbjct: 2580 KTYSDKDIRHMLNRFENEACDCLLFLNNVEVVTLSEILN 2618
>gi|260809550|ref|XP_002599568.1| hypothetical protein BRAFLDRAFT_77670 [Branchiostoma floridae]
gi|229284848|gb|EEN55580.1| hypothetical protein BRAFLDRAFT_77670 [Branchiostoma floridae]
Length = 4931
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 84/189 (44%), Gaps = 37/189 (19%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTF--------ILQESGIVVLN-------NEQG 1247
Y + S EL+QNADD E V+ + + L SG+ N +
Sbjct: 1350 YKEGSGIFYELIQNADDAGATE-VKFAVDWRQNSGSRKSLFSSGMAAWQGPALWAYNNKV 1408
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
F+ E+I+ + ++ +TKK ++ IGR G+GF SV+ +TD P S + V FD + +
Sbjct: 1409 FTDEDIQNINEISGATKKRDTSK-IGRFGLGFNSVYHITDMPSFISRHYFVCFDPHKTHL 1467
Query: 1308 GFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC-IRLPFRTKFS--EGIAMNNIVDMF 1364
G P N DP + WN IR +R +F+ GI NI D
Sbjct: 1468 G-------PVVN--------DNDPAEGVQVEWNRDEIRNIYRDQFAPFNGIFDCNIFD-- 1510
Query: 1365 SDLHPSLLL 1373
SD +P+ L
Sbjct: 1511 SDTYPATLF 1519
Score = 43.5 bits (101), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 29/109 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVL 1242
E YS EL+QNADD I+ + +P LQ ++V
Sbjct: 2323 ESYSTGMDIFKELLQNADDAGATEVKFIYDTRGHSDERIFGIDWKP-----LQGPALLVY 2377
Query: 1243 NNEQGFSAENIRALCDVG--NSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
NN + FS +I A+ VG N +KGS+ IG+ G+GF +V+ +TD P
Sbjct: 2378 NN-KTFSQSDIEAIQKVGKGNKQEKGST---IGQFGVGFNAVYHLTDCP 2422
>gi|392843837|gb|AFM84522.1| sacsin, partial [Carassius auratus]
Length = 736
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ PT +Q + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEIYFVFDPRSHPTERIFDDKWVPMQGPSLCVYNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++IR + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 476 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519
>gi|329938003|ref|ZP_08287485.1| hypothetical protein SGM_2977 [Streptomyces griseoaurantiacus M045]
gi|329302960|gb|EGG46849.1| hypothetical protein SGM_2977 [Streptomyces griseoaurantiacus M045]
Length = 1589
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 65/156 (41%), Gaps = 20/156 (12%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D I E + +L ES + N + E + + S K+G
Sbjct: 50 LFELVQNAADEIANEP-GGRVHVVLTESRLYCANEGSPVTPEGAETILRMSMSKKRG--- 105
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIG------------FVLPTLV 1315
G IGR G+G KSV VTDAPE S FD S QI F P L
Sbjct: 106 GQIGRFGVGVKSVLVVTDAPEFFSRTGSFGFDRAWSYQQIKAVPGVTERYGPEFDAPVLR 165
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTK 1350
+DM + DPV + W T +RLP K
Sbjct: 166 MARPLDMRAE-RAADPVLDKLLTWATTVVRLPLLKK 200
>gi|326402571|ref|YP_004282652.1| hypothetical protein ACMV_04230 [Acidiphilium multivorum AIU301]
gi|325049432|dbj|BAJ79770.1| hypothetical protein ACMV_04230 [Acidiphilium multivorum AIU301]
Length = 1764
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 91/215 (42%), Gaps = 29/215 (13%)
Query: 1208 HFLLELVQNADDN---IYPENVEPTLTFILQESG-IVVLNNEQGFSAENIRALCDVGNST 1263
L EL+QNA D P + LT + G ++V N + F+ N+ AL ++G S
Sbjct: 64 RVLFELIQNAHDAHPPGEPGEIAVRLTIEADDRGELLVANRGRPFTTSNLNALRNIGTSD 123
Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG---------QIGFVLPTL 1314
K+ IG KG+GF+SV +TD ++S + EG +I L L
Sbjct: 124 KQIGEG--IGNKGLGFRSVEALTDDVRVYSAAVAGPAERFEGYCFRFARLDEIAAGLAGL 181
Query: 1315 -------------VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
+P + I + S+ ++L + T I LP R+ + +A V
Sbjct: 182 GASSDMAESVAGNIPRYLIPVAVPDQSEGIMELARGGYATVIALPLRSAHTVALARAQ-V 240
Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
+ D +LLFL R+ + +++V R+
Sbjct: 241 EAVIDAGAPVLLFLDRIGSLEVSVRTVSTIIVERR 275
>gi|219666700|ref|YP_002457135.1| superfamily I DNA/RNA helicase [Desulfitobacterium hafniense DCB-2]
gi|219536960|gb|ACL18699.1| superfamily I DNA/RNA helicase [Desulfitobacterium hafniense DCB-2]
Length = 1584
Score = 53.1 bits (126), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 129/337 (38%), Gaps = 77/337 (22%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAEN------- 1252
E YS +HF+ EL+QNADD + F L ++ ++ +N + FS N
Sbjct: 37 EKYSDQAHFIYELLQNADDA-----GATSARFKLCDNELIFAHNGTRRFSISNPETEGPD 91
Query: 1253 --------IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
I A+ + NS K +S IG+ G+GFK+VF+ T P I+ F K +
Sbjct: 92 TCNGKLGDINAITSIANSNKTAAS---IGKFGVGFKAVFQYTQTPSIYDPEFRFKIER-- 146
Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVDM 1363
F++P + ++ + P + T PF A NI D
Sbjct: 147 ----FIVP-------VKIYNDFTGRRPEE-------TLFVFPFDHADRRHEEAYENISDK 188
Query: 1364 FSDL-HPSLLLFLHRLQCIMFRNMLNDSLVV--IRKKIVGDGIIKVSCG----------- 1409
L +P +LFL L+ I F L + ++I D +
Sbjct: 189 LRSLVYP--VLFLTNLREISFECSKKFGLYTKSVDEEIAFDNTVARLITLTQNDGDENNP 246
Query: 1410 -EDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-T 1467
E+K+ F R D ++ L +NEG Y F F P +
Sbjct: 247 IENKLWLFT----------RMDEHDFTYSVGFLL--NNEGKLRGSNYS--AFCFFPTKEE 292
Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPAL 1504
GL FI+ F+L SRE + N+ +++ AL
Sbjct: 293 TGLNFIVHAPFLLTDSREGIKAGEEHNKDMINLLAAL 329
>gi|302794218|ref|XP_002978873.1| hypothetical protein SELMODRAFT_152901 [Selaginella moellendorffii]
gi|300153191|gb|EFJ19830.1| hypothetical protein SELMODRAFT_152901 [Selaginella moellendorffii]
Length = 4493
Score = 53.1 bits (126), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 71/162 (43%), Gaps = 22/162 (13%)
Query: 1203 YSQDSHFLLELVQNADD--------------NIYPENVEPTLTFILQESGIVVLN-NEQG 1247
Y + + L E++QNADD + V P L Q G +L N
Sbjct: 21 YPEGTTILKEMIQNADDAGASQVCFCVDRRRHAVDSLVSPPLA---QWQGPALLAYNNAA 77
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
F+ E+ +++ +G+S K+G A GR GIGF SV+ ++D P S G HV F + Q
Sbjct: 78 FTEEDFQSISRIGDSKKRGQ-AWKTGRFGIGFNSVYHLSDLPSFVS-GSHVVFFDPQCQF 135
Query: 1308 GFVLPTLVPPFNIDMFCRL-LSKDPVQLESKCWNTC-IRLPF 1347
+ +L P ID L P Q C C +R PF
Sbjct: 136 LPSISSLNPGKRIDFVTSTALENFPDQFSPYCAFGCDMRRPF 177
Score = 52.4 bits (124), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 89/220 (40%), Gaps = 46/220 (20%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI-----------------VVLN 1243
E+Y+ L EL+QNADD E ++F+L +S +
Sbjct: 1363 EMYADGPGTLFELLQNADDAGATE-----VSFLLDKSSFGTNSVLSRQMADWQGPALYCY 1417
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
N+ F E+ A+ +G ++K + IGR G+GF SV+ TD P S V D
Sbjct: 1418 NDSVFKPEDFYAISRIGQNSKLDKPSA-IGRFGLGFNSVYHFTDVPSFVSGSTIVLLDPH 1476
Query: 1304 EGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQLES----KC------WNTCIRLPFRTKFS 1352
+ + P P I+ + LL++ P Q ES C T R P RT +
Sbjct: 1477 ADNLPGISPA-QPGLKINFVGKELLNQFPDQFESFLLFGCDLQKPFLGTLFRFPLRTTVT 1535
Query: 1353 EGIAM--------NNIVDMFSDLHPS---LLLFLHRLQCI 1381
+ ++V +FS L S +LLFL + I
Sbjct: 1536 AARSEIKQDPYQPEDVVTLFSALKKSAGEVLLFLRHVTSI 1575
>gi|326674209|ref|XP_003200093.1| PREDICTED: sacsin-like [Danio rerio]
Length = 4580
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 15/106 (14%)
Query: 1203 YSQDSHFLLELVQNADD----NIY----PENVEPTLTF-----ILQESGIVVLNNEQGFS 1249
Y + L EL+QNADD IY P F LQ + V NN Q F+
Sbjct: 2543 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHSTDRIFDDKWTPLQGPSLCVYNN-QPFT 2601
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
++IR + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 2602 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISNN 2646
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 66/238 (27%), Positives = 99/238 (41%), Gaps = 60/238 (25%)
Query: 1201 ELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESGIVVLN-------N 1244
E Y + EL+QNADD E N+E + L + G+++ + N
Sbjct: 1472 EEYPSVADIFKELLQNADDANATECSFLIDMRKNLE--IRENLLDPGMIICHGPALWSFN 1529
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
FS + + +G S K+ A +G+ G+GF SV+ VTD P I S F + FD +
Sbjct: 1530 NSVFSDTDFLNITRLGGSMKR-CEADKVGKFGLGFNSVYHVTDIPIIMSREFMIMFDPNI 1588
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
IG P PF I++F C+L P+ LE+ K
Sbjct: 1589 NHIGKHIRDKSNPGIKINWSKQQKRLRKFPNQFRPF-INVFNCQL----PLSLEAPYKYD 1643
Query: 1340 NTCIRLPFRTKFS------EGIAMN-----NIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
T RLPFRT+ GI N ++VD FS L+LF + ++ + +
Sbjct: 1644 GTLFRLPFRTEQEASASEISGIYYNTTDIYSLVDEFSICGHRLILFTQHVGTMVLKYL 1701
Score = 42.4 bits (98), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 13/103 (12%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPEN---VEPTLTF-ILQESGIVV-LNNEQGFSA 1250
Y + L EL+QNA+D +Y E VE + + Q G + + N+ F+
Sbjct: 111 YPEGGQILKELIQNAEDAGATEVKFLYDETEYGVESLWSHDMAQYQGTALYVYNDAVFTT 170
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S K+ +GR GIGF SV+ +TD P I S
Sbjct: 171 EDWNGIQEIARSRKREDPLK-VGRFGIGFNSVYHITDVPSIFS 212
>gi|301061109|ref|ZP_07201903.1| conserved hypothetical protein [delta proteobacterium NaphS2]
gi|300444778|gb|EFK08749.1| conserved hypothetical protein [delta proteobacterium NaphS2]
Length = 509
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 82/166 (49%), Gaps = 18/166 (10%)
Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGI--IKVSCGEDKMTWFVASQKL- 1422
++ P +LFL +L+ I ++ VI+ I + G D++ F+ +++
Sbjct: 13 EIEPETMLFLSKLKEIKIVTGHSEGFTVIKDDTSEPNIQLLTNKNGHDEIFEFLLFKRVF 72
Query: 1423 -RAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
+ I R +V+ E+++A + + + G L FA+LP+R G FI+
Sbjct: 73 NKPEEINHEKRNNVEEREVSIAFPVGD-KPNSAGKL------FAYLPVRYDTGFPFIINA 125
Query: 1477 DFVLPSSREEVDGNSPWN-QWLLSEFPALFVSAERSFCDLPCFREN 1521
DF+LPSSRE++ ++PWN +WL+ L S+ LP +EN
Sbjct: 126 DFILPSSREDIQ-DTPWNREWLMPCVGELVYSSLPILKSLPILKEN 170
>gi|115665280|ref|XP_001200196.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
Length = 3846
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 63/137 (45%), Gaps = 26/137 (18%)
Query: 1175 NISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVE 1227
+ S E +K R+ R L + Y D L EL+QNADD +Y
Sbjct: 1867 DFSGEEFGQHEKLTTRIKRIL-----DSYPSDETILKELLQNADDAGATEIHFVYDPRSH 1921
Query: 1228 PTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY---IGRKGIG 1278
+ LQ + V NN Q F+ E+IR + ++G KGS A + IGR G+G
Sbjct: 1922 SENKLLADSMKALQGPALCVFNN-QSFTDEDIRGIQNLG----KGSKADHMEKIGRYGVG 1976
Query: 1279 FKSVFRVTDAPEIHSNG 1295
F +V+++TD P S G
Sbjct: 1977 FNTVYQLTDCPSFISKG 1993
Score = 46.6 bits (109), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
F+ ++ + + +G TK+ A IG+ G GF SV+R+TD P + S + FD +
Sbjct: 4 FTDQDFKNILRLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSLVSRDYMQFFDPHTTHL 62
Query: 1308 GFVLPTLVPP 1317
G VLP P
Sbjct: 63 GNVLPNKSQP 72
Score = 46.2 bits (108), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
N+ F+ ++ + +G TK+ A IG+ G GF SV+R+TD P + S + FD
Sbjct: 955 NDAVFTDQDFENILCLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSLVSRNYMQCFDPH 1013
Query: 1304 EGQIGFVLPTLVPP 1317
+G VLP P
Sbjct: 1014 TTHLGNVLPNKSQP 1027
>gi|260907582|ref|ZP_05915904.1| hypothetical protein BlinB_19757 [Brevibacterium linens BL2]
Length = 1609
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/223 (24%), Positives = 89/223 (39%), Gaps = 27/223 (12%)
Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
ELVQNA D + + + E + V N+ + F + I+ L STK+
Sbjct: 62 ELVQNAADAVREGGSR--IHVVATEKALYVANDGKAFDSNGIKTLLHSNMSTKRDEQ--- 116
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL--------PTLVPPFNIDMF 1323
IGR G+GFKSV + P+I S+ FD L P L PF ++
Sbjct: 117 IGRFGLGFKSVLEICPQPQIFSSTVSFGFDAERSVQALSLINRGLTRYPVLRLPFLLNES 176
Query: 1324 CRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM 1382
S ++ E + W +T +RLP S + D + P LLF + +
Sbjct: 177 SESSSDSILR-ELRSWASTVVRLPLAQPLSW------LKDSVREFRPEFLLFSPTVSSLT 229
Query: 1383 FRNMLNDSLVVI----RKKI--VGDGIIKVSCGEDKMTWFVAS 1419
+ + R K+ VG +++S + + W V++
Sbjct: 230 LEVRGDGESRTVWTSERTKLSTVGVEAVRLSSADSNIDWVVST 272
>gi|379010681|ref|YP_005268493.1| DNA/RNA helicase superfamily I [Acetobacterium woodii DSM 1030]
gi|375301470|gb|AFA47604.1| DNA/RNA helicase superfamily I [Acetobacterium woodii DSM 1030]
Length = 1587
Score = 52.4 bits (124), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 79/334 (23%), Positives = 130/334 (38%), Gaps = 64/334 (19%)
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAE 1251
R + E YS +HF+ EL+QNADD ++ F L + ++ +N + FS
Sbjct: 29 RGVKTSVVEKYSDQAHFIYELLQNADDA-----GATSVRFELHKDELIFAHNGTRRFSIS 83
Query: 1252 N---------------IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
N I A+ + NS K +S IG+ G+GFK+VF+ T P ++ F
Sbjct: 84 NPDTEEADSDNGKLGDINAITSIANSNKTMAS---IGKFGVGFKAVFQYTQTPSVYDPDF 140
Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGI 1355
+ + F++P + D K T PF S+
Sbjct: 141 QFRIER------FIVPVKI--------------DNDYASRKSGETLFVFPFDHNDRSQEE 180
Query: 1356 AMNNIVDMFSDL-HPSLLLFLHRLQCIMFRNMLNDSLVV--IRKKIVGDGI------IKV 1406
A +I + L +P +LFL L I + ++ + +KI+ + +
Sbjct: 181 AYEDISHKLNTLVYP--ILFLTNLNEICYESLEQSGKYTRSVCEKIIFENTEACLFDMSQ 238
Query: 1407 SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR 1466
C D+ V KL R D + ++ L + EG F F P +
Sbjct: 239 ICENDERK--VIGDKLWL-FSRTDEQGLRYSVGYELNK--EGKLTA--SHHAAFCFFPTK 291
Query: 1467 -TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
GL F++Q F+L SRE + NQ +++
Sbjct: 292 ENTGLNFMIQAPFLLTDSREGIKAGEKHNQDMIA 325
>gi|340375792|ref|XP_003386418.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4104
Score = 52.4 bits (124), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 147/369 (39%), Gaps = 61/369 (16%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH + L + +E Y + E++QNADD +Y T IL+
Sbjct: 1372 QHEPITTRLRNILKE-YKDGLTIIKEMIQNADDAGATEVNILYDNRTHSTQNLILKGMAD 1430
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ GIGF SV+ +TD P S
Sbjct: 1431 SHGPALIVHNNGTFTKEDFENITKLAGATKANQPLK-IGKFGIGFCSVYHITDVPSFVSG 1489
Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMF--CRLLS--KDPVQLE--------SKCWNTC 1342
+ FD + + V+ P + ++ S +D V E S T
Sbjct: 1490 EWLYIFDPTLQFLKGVVHDQSKPGKKIKYQQSKIFSNCQDLVPYEGLFGFSSLSDYEGTM 1549
Query: 1343 IRLPFRT---KFSEGIAMNNIV-DMFSDL---HPSLLLFLHRLQCIMFRNML-NDSLVVI 1394
R PFRT + S + ++V D+ DL LLLFL ++ I F ++ N+ ++ +
Sbjct: 1550 FRFPFRTSASQISSTVYNEHLVEDIKRDLIECGSKLLLFLQHVKRITFSSLRGNEKIIEV 1609
Query: 1395 RKKI---VGDGIIKVSC-----GEDKMTWFVASQKLR---AGVIRPDVKTTEIALALTLQ 1443
+ D I +V G W +ASQ+ + I P V + +A L
Sbjct: 1610 SIDVSSSSNDDIHEVVINSPHNGSGTEHWLIASQEEQWRHQNNIEPAVAS----VACQLD 1665
Query: 1444 ESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNS 1491
N Y + F FLPL + GL + +F + S+R + D
Sbjct: 1666 RINSNCYKCKEIEGNAFCFLPLALPSTGLPVHVSANFAVMSNRSGIWTEVLSGTVTDSKE 1725
Query: 1492 PWNQWLLSE 1500
WN+ L+ +
Sbjct: 1726 RWNKNLMEK 1734
Score = 42.0 bits (97), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 17/123 (13%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADD------------NIYPENVEPTLTF-- 1232
Q L R + + QE Y D+ FL EL+QNADD + + P+ +
Sbjct: 2323 QKEELTRRIRNILQE-YPLDATFLKELLQNADDAKATKIRVILDKRQHKKEKVPSEEWGK 2381
Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
LQ ++V N+++ FS ++ + +G +K+ IG+ GIGF V+ +TD P
Sbjct: 2382 ELQGPALLVWNDKE-FSDADLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHLTDCPSFI 2439
Query: 1293 SNG 1295
+ G
Sbjct: 2440 TRG 2442
>gi|392843833|gb|AFM84520.1| sacsin, partial [Monopterus albus]
Length = 736
Score = 52.4 bits (124), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P +Q + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPIDRIFDDKWIPMQGPALCVFNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++IR + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 476 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519
>gi|47190447|emb|CAF92480.1| unnamed protein product [Tetraodon nigroviridis]
Length = 145
Score = 52.4 bits (124), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 2/115 (1%)
Query: 156 GESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITS 215
GE V+ KV V ++Q+ S + G +L L +N FI C V I +
Sbjct: 8 GELVTKEKVISRVCRQMQIPSLEAGGIYPGMTSALKDLQYKIREVNMFIECAESVSSICT 67
Query: 216 LYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDVI 268
LY++ ++ +G + +EEL LGP + P I F + S+ D + +I T D++
Sbjct: 68 LYEMGCSLAGLKGKKHYEELNLGPLCKLPRIHRIFKIDSNTKDDDIQQIETVDIL 122
>gi|340369659|ref|XP_003383365.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4979
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 138/346 (39%), Gaps = 54/346 (15%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + ++ Y + E++QNADD +Y + I++
Sbjct: 1453 QHESLTTRLRNILKD-YKDGLTIIKEMIQNADDAGATELNILYDKRTHSKEKLIIKGMAD 1511
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N + FS ++ + + +TK IG+ GIGF SV+ +TD P S
Sbjct: 1512 SHGPALIVHNNRAFSKDDFVNITKLAGATKADKPLK-IGKFGIGFCSVYHITDVPSFVSG 1570
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKDPVQLES-----------KCWNTC 1342
+ FD + + G V P ++ + L++ QL K T
Sbjct: 1571 EWLYIFDPTLKYLKGAVCNDSSPGKKMEYRSKFLARSD-QLAPYHGLFGFDSSLKYNGTI 1629
Query: 1343 IRLPFRTKFSEGIA-------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNML--NDSLVV 1393
RLPFRT S+ + + I D D LLLFL ++ I F + +SL
Sbjct: 1630 FRLPFRTNPSQISSTLYSDELVQQIKDDLQDNASKLLLFLQNVKSITFSTVQENGESLCE 1689
Query: 1394 IRKK---IVGDGIIK-------VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
I K I GI K + C E W ++S K + ++ +A L
Sbjct: 1690 ISIKCNAIESYGIEKRVITVTSLKCKEKIEHWLMSSVKEN---LEGELAVASVACELL-- 1744
Query: 1444 ESNEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEV 1487
+ + Y + VF FLPL + GL + +F + S+R +
Sbjct: 1745 -NTDNVYQCQSIKGNVFCFLPLAVSSTGLPVHVSANFAVMSNRSGI 1789
>gi|340377463|ref|XP_003387249.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4220
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 84/355 (23%), Positives = 140/355 (39%), Gaps = 70/355 (19%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG- 1238
QH L L + +E Y + E++QNADD +Y + I++ G
Sbjct: 1457 QHEPLTTRLGNILRE-YKDGLTIIKEMIQNADDAGATEVNIMYDKRTHSAKKLIIKGIGD 1515
Query: 1239 ----IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 1516 SHGPALIVHNNSTFTEEDFENITKLAAATKANQPLK-IGKFGVGFCSVYHITDVPSFISG 1574
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLS--------KDPVQLES--KCWNTCI 1343
+ FD + + G V P I+ + L+ +D S K T
Sbjct: 1575 KWLYIFDPTLKYLKGVVHNQSSPGKKIEYRSKFLAHSDQLAPYQDLFGFRSSIKYNGTIF 1634
Query: 1344 RLPFRTKFSEGIA-------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVV--- 1393
RLPFRT S+ + + I D D LLLFL ++ I F + ++ +
Sbjct: 1635 RLPFRTNPSQISSTLYDDELVQQIKDDLEDNASKLLLFLQNVKSITFSTVQDNGDRLHEI 1694
Query: 1394 -----------IRKKIVGDGIIKVSCGEDKMTWFVA--SQKLRAGVIRPDVKTTEIALAL 1440
I KK++ + ++ E+ W V+ ++ L D+ +A L
Sbjct: 1695 SIKCNAIESYGIEKKVI--TVTSLNYEEEIEHWLVSTVTENLEG---YNDLAVASVACGL 1749
Query: 1441 TLQ------ESNEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEV 1487
+S EGN VF FLPL + GL + +F + S+R +
Sbjct: 1750 RKSDNIYHCQSIEGN---------VFCFLPLAVSSTGLPVHVSANFAVMSNRSGI 1795
>gi|340368021|ref|XP_003382551.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 4293
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 80/342 (23%), Positives = 132/342 (38%), Gaps = 67/342 (19%)
Query: 1212 ELVQNADDN-------IYPENVEPTLTFILQ-----ESGIVVLNNEQGFSAENIRALCDV 1259
E++QNADD +Y + T + + ++++N F E+ + +
Sbjct: 1361 EMIQNADDAEATEVNILYDDRTHSTEKLLFKGMAGSHGPALIVHNNSVFKKEDFENITKL 1420
Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL-------- 1311
+TKK IG+ G+GF SV+ +TD P S + FD + + V+
Sbjct: 1421 AGATKKDQPLK-IGKFGVGFCSVYHITDVPSFVSGEWFYIFDPTLKHLKGVVHNESRPGK 1479
Query: 1312 ------------PTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR---TKFSEGIA 1356
+ P NI F + D T RLPFR ++ S I
Sbjct: 1480 KVKYQSKFMAQSQQMAPYENIFEFSSSANYD---------GTIFRLPFRENASQISSTIY 1530
Query: 1357 MNNIVDMFS-DLHPS---LLLFLHRLQCIMFRNMLNDSLVV---IRKKIVGDGIIKVSC- 1408
+V DL + LL+FL ++ I FR++ D LV+ I +G+ K +
Sbjct: 1531 NEYLVQKLKEDLKENGSKLLIFLQNVKRITFRSLDRDELVIDISITCSNEANGVKKCTTI 1590
Query: 1409 --GEDKMT-WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
E ++ W VA QK + + D IA + Y + +F FLPL
Sbjct: 1591 CFNESEIEYWLVAHQK--GNLKQDDDDKPGIASVACQLIKKDLCYKSQEIKGNLFCFLPL 1648
Query: 1466 --RTYGLKFILQGDFVLPSSREEV-------DGNSPWNQWLL 1498
+ GL + +F + ++R + D WN L+
Sbjct: 1649 SSSSTGLPVHVNANFAVLNNRTGIFIEETPSDSRELWNMQLM 1690
>gi|405966416|gb|EKC31703.1| Sacsin [Crassostrea gigas]
Length = 2134
Score = 52.0 bits (123), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 109/276 (39%), Gaps = 59/276 (21%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN- 1220
V S+ R E G I ++E +Q + R + L E YS D E++QNADD
Sbjct: 485 VPSLIRREMENG-GIEDLEYEQWGQQESLTTRIKNIL--EDYS-DGPLFKEMLQNADDAG 540
Query: 1221 ------IYPENVEPTLTFILQESGI-------VVLNNEQGFSAENIRALCDVGNSTKKGS 1267
+Y E L + G+ + N+ F ++ R + +G TK G+
Sbjct: 541 ATCMKILYDERRNEDARTGLIDKGMEECQGPALWFYNDAKFEEKDFRNIIKLGGGTK-GT 599
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL-----PTL-VP----- 1316
IG+ G+GF SV+ VTD P I S + V FD +G + P L +P
Sbjct: 600 EHTKIGKFGLGFCSVYSVTDVPSIVSGKYIVFFDPRTIHLGKAIKDKSKPGLRIPISKIV 659
Query: 1317 --------PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS-----EGIAMN----- 1358
P++ C +D + T RLP RT GI+
Sbjct: 660 KKFRKQFKPYDKIFHCDFSEEDEYE------GTLFRLPLRTANQASADPNGISTKQYNRE 713
Query: 1359 ---NIVDMFSDLHPSLLLFLHRLQCI--MFRNMLND 1389
+IV F D +LLLF ++ I F N ND
Sbjct: 714 EVLSIVKKFMDSAGNLLLFTQNVKKIELHFLNQTND 749
Score = 46.6 bits (109), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 51/116 (43%), Gaps = 15/116 (12%)
Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQG 1247
E Y D L ELVQNADD +Y + LQ +VV NN
Sbjct: 1502 EDYPCDVSILNELVQNADDAQATEIHFVYDKRRHSEKELFDDSMKCLQGPALVVYNN-SC 1560
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
FS+E+I + +G +K S G+ GIGF +V+ +TD P + G D S
Sbjct: 1561 FSSEDINGISKLGEGSK-SSDPHQTGQFGIGFNAVYHITDVPSFLTKGTKTPNDGS 1615
>gi|159486690|ref|XP_001701371.1| hypothetical protein CHLREDRAFT_194490 [Chlamydomonas reinhardtii]
gi|158271766|gb|EDO97579.1| predicted protein [Chlamydomonas reinhardtii]
Length = 785
Score = 52.0 bits (123), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 33/62 (53%)
Query: 191 HKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYF 250
+L + R+ A + + R + +L++LE +C SEG+E F EL LGP P++ F
Sbjct: 278 QRLRQLDRRVGATMEAYCAARCVVTLWELERHVCASEGVEAFGELRLGPLTAHPVVRRTF 337
Query: 251 LV 252
Sbjct: 338 FA 339
>gi|260822114|ref|XP_002606448.1| hypothetical protein BRAFLDRAFT_118535 [Branchiostoma floridae]
gi|229291789|gb|EEN62458.1| hypothetical protein BRAFLDRAFT_118535 [Branchiostoma floridae]
Length = 4895
Score = 52.0 bits (123), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 92/388 (23%), Positives = 150/388 (38%), Gaps = 74/388 (19%)
Query: 1003 AVPVASRF--FLDCLGY---LPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLM 1057
AV ++S F LD Y LP EFR F +G++ ++ ++ L Q+ +L
Sbjct: 1605 AVTLSSPFTPILDLAPYRFNLPEEFRDFKQMFRAAGVKESLDESDLIQTLHEIQSTPKLS 1664
Query: 1058 L-HEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESV 1116
++ LSL I++WI + D+L+P +S L C E
Sbjct: 1665 TERDLKLSLSIIDWIVSHPQILEEVRDDILVPIAT----KDSSRLTLAPVC------ECT 1714
Query: 1117 GADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKE--CEDAALIVESIRRDEFGLGP 1174
DV + A+ D +D + +P D E +L + +EFG
Sbjct: 1715 YNDVSWSDAQALLD------DEDEEFKPLHKLVPLDTAQLLEVPSLSQRVVNAEEFGFDQ 1768
Query: 1175 NISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL 1234
QH ++ + L + + Y + + EL+QNADD E + F+L
Sbjct: 1769 C----------GQHEKVTQRLKNILND-YPEGAGIFKELIQNADDAKATE-----VKFLL 1812
Query: 1235 Q------------ESGIVVLN-------NEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
+ G+ V + N FS E+ + + ++G TK + +GR
Sbjct: 1813 DWRTNEDSREDLIDQGMAVCHGPALWAFNNATFSEEDFKNIQNLGGQTKL-EALEKVGRF 1871
Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP---FNIDMFCRLLSKDPV 1332
G+GF SV+ VTD P S + FD + + P ++ RL+S+
Sbjct: 1872 GVGFNSVYHVTDVPSFVSGSRVLFFDPHATHLSKHIKDKSRPGIGLDLQKNTRLVSRFKD 1931
Query: 1333 QLE----------SKCWN-TCIRLPFRT 1349
Q + S +N T RLP RT
Sbjct: 1932 QFKPFQGVFGYSTSTSYNGTLFRLPLRT 1959
>gi|304320769|ref|YP_003854412.1| ATPase [Parvularcula bermudensis HTCC2503]
gi|303299671|gb|ADM09270.1| putative ATPase [Parvularcula bermudensis HTCC2503]
Length = 1418
Score = 52.0 bits (123), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 1175 NISNMESNMLKK--QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTF 1232
+I +NM ++ + L H L +++ S H + EL+QNADD E +
Sbjct: 10 SIREHSANMWEQLDGNPELAAPWHQLFKQVQSP-RHIISELLQNADDAHATE-----ASV 63
Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
+Q+ + +N + F+ E+ +LC G S K+ IG +GIGFKS F + + E++
Sbjct: 64 DIQDGSFLFSHNGEDFTEEHFASLCKFGYSNKRALHT--IGFRGIGFKSTFSLGERVELY 121
Query: 1293 SNGFHVKFD 1301
+ V FD
Sbjct: 122 TPTLSVGFD 130
>gi|395774006|ref|ZP_10454521.1| hypothetical protein Saci8_29739 [Streptomyces acidiscabies 84-104]
Length = 1579
Score = 52.0 bits (123), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 71/164 (43%), Gaps = 20/164 (12%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D I E T +L ++ + N + E + + S K+G
Sbjct: 43 LFELVQNAADEIADEP-GGTTHVVLTDTHLYCANEGTPVTPEGAETILRMSMSRKRG--- 98
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIGFVLPTLVPPFNIDMFCRLL 1327
G IGR G+G KSV VTDAP+ S FD S QI V P + F+ + +L
Sbjct: 99 GQIGRFGVGVKSVLVVTDAPQFFSRTGCFGFDREWSHDQIRSV-PEVRARFDAEFESPVL 157
Query: 1328 ------------SKDPVQLESKCW-NTCIRLPFRTKFSEGIAMN 1358
+ DPV E W T +RLP + +E + +
Sbjct: 158 RMARPLDESAERAVDPVLDELLGWATTVVRLPLLSDAAERLGQD 201
>gi|340054935|emb|CCC49243.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 1705
Score = 52.0 bits (123), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/150 (22%), Positives = 67/150 (44%), Gaps = 13/150 (8%)
Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
L F ++++PSL + ++++ + V R + ++Y+L+ +C++ + + EL LG
Sbjct: 132 LPFALKQIPSLAGVAAMQSQLTQMVQGAVATRPVVTVYELQLEVCRAMNVASYAELGLGA 191
Query: 240 FLR-QPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSL 298
L P + H F K V + + D + L D I + D R+
Sbjct: 192 SLACMPFVQHAFGFKPGPAAVAPVTSADFMRFLLFDADAR----IVLTSGGDAGDAVRAF 247
Query: 299 ASK--------EQLGVRIQNLGMHINFIRE 320
A + QLG+ IQ+ ++F+R+
Sbjct: 248 AKRYEGGRYAPRQLGIHIQHFSWLLHFVRQ 277
>gi|392843841|gb|AFM84524.1| sacsin, partial [Misgurnus anguillicaudatus]
Length = 736
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADD----NIY----PENVEPTLTF-----ILQESGIVVLNNEQGFS 1249
Y + L EL+QNADD IY P F +Q + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHATDRIFDDKWMPMQGPSLCVYNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++IR + ++G TK+ ++ G G+ GIGF SV+ +TD P SN
Sbjct: 476 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519
>gi|390348274|ref|XP_003726975.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
Length = 3428
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 15/106 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y D L EL+QNADD +Y + LQ + V NN Q F+
Sbjct: 1974 YQSDDTILKELLQNADDAGATEIHFVYDPRSHDGKKLLGDSMKGLQGPALCVFNN-QSFT 2032
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
E+IR + ++G +K G IGR G+GF +V+++TD S G
Sbjct: 2033 DEDIRGIQNLGEGSK-ADHMGKIGRYGVGFNAVYQLTDCSSFISKG 2077
Score = 47.8 bits (112), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
N+ F+ ++ + + +G TK+ A IG+ G GF SV+R+TD P + S + FD
Sbjct: 205 NDAVFTDQDFKNILRLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSLVSRDYMQFFDPH 263
Query: 1304 EGQIGFVLPTLVPP 1317
+G VLP P
Sbjct: 264 TTHLGNVLPNKSQP 277
Score = 45.4 bits (106), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 1/74 (1%)
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
N+ F+ ++ + +G TK+ A IG+ G GF SV+R+TD P S + FD
Sbjct: 1040 NDAVFTDQDFENILRLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSFVSMNYMQFFDPH 1098
Query: 1304 EGQIGFVLPTLVPP 1317
+G VLP P
Sbjct: 1099 TTHLGNVLPNKSQP 1112
>gi|392843817|gb|AFM84512.1| sacsin, partial [Alligator sinensis]
Length = 736
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRHHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNNILCIFD 527
>gi|392843867|gb|AFM84537.1| sacsin, partial [Naja atra]
Length = 736
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPVDRIFDEKWSP-----LQGPALCVFNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|168026372|ref|XP_001765706.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683132|gb|EDQ69545.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 4642
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 25/173 (14%)
Query: 1201 ELYSQDSHFLLELVQNADD------NIYPENVEPTLTFILQES------GIVVLNNEQGF 1248
E+YS L ELVQNADD N + E + +L S + N+ F
Sbjct: 1387 EMYSDGPGILCELVQNADDAGAREVNFLLDRSEFGTSSVLSPSMADWQGPALYCQNDSVF 1446
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
++ ++ A+ +G +K + IGR G+GF SV+ TD P S V FD +
Sbjct: 1447 TSRDLYAISRIGQDSKLERPSA-IGRFGLGFNSVYHFTDIPGFISGSNLVMFDPHACNLP 1505
Query: 1309 FVLPTLVPPFNIDMFCR-LLSKDPVQ----------LESKCWNTCIRLPFRTK 1350
V P+ P I R LL + P Q L+ T R P RT+
Sbjct: 1506 GVTPSH-PGLKISFVGRGLLEQFPDQLRPYLIFGCDLQRSYPGTLFRFPLRTE 1557
>gi|298715228|emb|CBJ27900.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 191
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 56/112 (50%), Gaps = 13/112 (11%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVE--PTLTFILQESGIVVLNNEQGFSA 1250
Y + + L ELVQNADD ++ E P+L + +++ N+ F+
Sbjct: 29 YPEGTSILKELVQNADDAGAREVRLCLDLRQHGTEGLPSLALAQFQGPALLVFNDGVFTD 88
Query: 1251 ENIRALCDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+ R++ +G+S KK +S G GR GIGF SV+ +T+ P S G FD
Sbjct: 89 TDFRSIQRIGDSLKKDTSKGGKTGRFGIGFNSVYHLTELPAFVSAGKWCSFD 140
>gi|443695138|gb|ELT96105.1| hypothetical protein CAPTEDRAFT_195081 [Capitella teleta]
Length = 4612
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 112/470 (23%), Positives = 169/470 (35%), Gaps = 112/470 (23%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQGF 1248
Y DS ELVQNA+D +Y T+ L G+ N+ F
Sbjct: 34 YPDDSQIFKELVQNAEDAGAREVKLLYDRKTYGTMGRYLHHPGMAKYQGPALYAYNDATF 93
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF------HVKF-- 1300
+ ++ R + + S K+ +G G+GFKSVF TD P I S H +F
Sbjct: 94 NDDDWRGIRMLSQSVKQNDPLK-VGYFGMGFKSVFHFTDLPFIVSGDHAGVIEPHQEFFP 152
Query: 1301 ------DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS-- 1352
S+ + +LP PF+ C ++D + + T R P RTK S
Sbjct: 153 SSSHGWTLSKDNLLELLPHQFEPFSGIFTC---TRDTFK-DGYFPGTLFRFPLRTKPSAL 208
Query: 1353 EGIAMNN-----IVDMFSDLHPSLLLFLHRLQCI--------------MFRNMLN----D 1389
+ NN ++D F + LLLFL ++ + +FR L+ +
Sbjct: 209 SDVIYNNDKVQALLDAFQEDAHLLLLFLRHVESVEVHTKDAHTLTPKLLFRVELSEDCRE 268
Query: 1390 SLVVIRKKIVG-------DGIIKVS----------CGEDKM----TWFV--------ASQ 1420
+ R+ V DG + +S G D++ TW V S
Sbjct: 269 DVRQKRQDFVARATAPRRDGPVMMSYPVTVVCTCVTGRDQIQRSYTWLVTNYFVGGQTSA 328
Query: 1421 KLRAGVIRPDVKTTE-IALALTLQESNEGNYGPLLYQQPVFAFLPLRT-----YGLKFIL 1474
R + PD+ + A+ L +G + +F LPL + GL +
Sbjct: 329 TFRKLQMDPDLHYPPWVGAAMPLSSPEDGQAS---VRGHIFCALPLPSDQSMDTGLPVEI 385
Query: 1475 QGDFVLPSSREEV-------------DGNSPWNQWLLSE-FPALFVSAERSFCDLPCFRE 1520
G F L +R+ + D WNQ +L E P + S R+ L
Sbjct: 386 NGYFSLEQNRKHIKWPTSYRTREDMMDKRLLWNQCMLKEALPKAYASLLRAAIQLHRQGS 445
Query: 1521 NPA-KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQW 1569
PA S+Y +F L + S LP M S L + E N W
Sbjct: 446 LPALTLDSIYAAFPDLTKVDRKWDSILPTM-YSDLFKDPVVFTEANGGNW 494
Score = 46.6 bits (109), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 75/316 (23%), Positives = 122/316 (38%), Gaps = 83/316 (26%)
Query: 1203 YSQDSHFLLELVQNADDN------------------IYPENVEPTLTFILQESGIVVLNN 1244
Y D L E++QNADD+ ++ ++ +P LQ G+ V NN
Sbjct: 2446 YPCDKEILKEMLQNADDSGATEIQFIKDPRQHGRDRVFEDSWKP-----LQGPGLCVFNN 2500
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS +I + +G +K GS G+ G+GF V+ +TDAP F T
Sbjct: 2501 -RPFSESDIEGIQRLGEGSK-GSDPNKTGQYGVGFNCVYHLTDAP---------SFLTRS 2549
Query: 1305 GQIGFVLPTLVP---------------PFNIDMFCRLLSKD--PVQLESKCW---NTCIR 1344
IG L P ++ R + D P LE K T R
Sbjct: 2550 ADIGETLCVFDPHGAFVPGASSEEPGRRYDEVSELRGIFTDVFPCYLEDKFALDEATMFR 2609
Query: 1345 LPFR-------TKFSEGIAMNNIVD-MFSDLHPSLLLFLHRLQCIMFRNMLNDSLVV--- 1393
P R ++ SE + ++D +FS C++F N + DSL +
Sbjct: 2610 FPLRNEEMAAKSELSEQVIGLEVIDVLFSKFKAE------AFDCLLFLNNV-DSLTLCEV 2662
Query: 1394 --IRKKIVGDG--IIKVSCGEDKMTWFVASQKLRAGVIRP------DVKTTEIALALTLQ 1443
K++ G ++V GE++ F K AG ++ D+ E+ +L +
Sbjct: 2663 DSFTKRLTGTYKVTVQVERGEEEKKRFSEQVKKIAGQVKSGRLSVWDIPVHEVTYSLNIA 2722
Query: 1444 ESNEGNYGPLLYQQPV 1459
+ N+G + L Q +
Sbjct: 2723 D-NQGKWEKWLVSQQI 2737
Score = 44.7 bits (104), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 17/151 (11%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGI 1239
Q L L+ L QE Y+ ELVQNADD +Y E L + G+
Sbjct: 1380 QADSLTHRLNVLLQE-YTDGFAVPKELVQNADDAGATEVKFLYDERQNEDCMTCLIDEGM 1438
Query: 1240 -------VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
+ N FS E+ + + +TK+ + IGR G+GF +V+ +TD P
Sbjct: 1439 RECQGPALWAYNNAAFSDEDFENITKLSGATKENQT-DKIGRFGLGFNAVYNITDVPSFV 1497
Query: 1293 SNGFHVKFDTSEGQIGFVLPTLVPP-FNIDM 1322
S V FD +G + P IDM
Sbjct: 1498 SRHNIVIFDPHTTHLGKSIKNKAKPGIKIDM 1528
>gi|393244468|gb|EJD51980.1| hypothetical protein AURDEDRAFT_181547 [Auricularia delicata
TFB-10046 SS5]
Length = 2646
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 74/167 (44%), Gaps = 19/167 (11%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ-----ESGIVVLNNEQGFSA 1250
Y + L EL+QN+DD I E PT + ++ + ++ N+ F
Sbjct: 21 YPASNAILRELIQNSDDAGSRRQIFILDERQHPTRSLLVPGLACCQGPALLAVNDSIFRP 80
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQI 1307
E+ A+ + +S+K + IG+ G+GF+S + VTD P+I S+ V FD T
Sbjct: 81 EDWEAIKTIHHSSKADDNT-QIGKYGLGFRSSYHVTDFPQILSSKTLVVFDPHMTVLNDG 139
Query: 1308 GFVLPTLVPP--FNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
G T +P ++D S P E T IRLP RT S
Sbjct: 140 GRRHSTFLPAESTHVDQLAAFASVVPADCEDYN-GTAIRLPLRTNSS 185
>gi|392843823|gb|AFM84515.1| sacsin, partial [Crocodylus siamensis]
Length = 736
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRHHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|392843869|gb|AFM84538.1| sacsin, partial [Pelodiscus sinensis]
Length = 736
Score = 51.6 bits (122), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDSRQHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|359146086|ref|ZP_09179729.1| hypothetical protein StrS4_09674 [Streptomyces sp. S4]
Length = 1601
Score = 51.2 bits (121), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 67/157 (42%), Gaps = 26/157 (16%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
LLELVQNA D I L +L E + N + E + + S K+G
Sbjct: 56 LLELVQNAADEIA-NAPGGKLHVVLTEDHLYCANEGTPVTPEGAETILRMSVSRKRG--- 111
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSN----GFHVKFDTSEGQIGFVL------------PT 1313
G IGR G+G KSV V+D+P+ S+ GF D S +I L P
Sbjct: 112 GQIGRFGVGVKSVLSVSDSPQFFSDHGAFGFGFDRDWSAAEIAEALGLDGAADLSMETPV 171
Query: 1314 L--VPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPF 1347
L P ++D + S DPV + W T +RLP
Sbjct: 172 LRMARPLDVD---QERSDDPVLDKLLTWATTVVRLPL 205
>gi|392843819|gb|AFM84513.1| sacsin, partial [Ichthyophis bannanicus]
Length = 736
Score = 51.2 bits (121), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 52/110 (47%), Gaps = 25/110 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEILKELLQNADDAKATEICFVFGPRHHPLDRIFDERWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P S+
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFMSS 519
>gi|392843849|gb|AFM84528.1| sacsin, partial [Carettochelys insculpta]
Length = 736
Score = 51.2 bits (121), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|392843855|gb|AFM84531.1| sacsin, partial [Trachemys scripta]
Length = 736
Score = 51.2 bits (121), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDSRQHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|456384943|gb|EMF50521.1| hypothetical protein SBD_8085 [Streptomyces bottropensis ATCC 25435]
Length = 1614
Score = 51.2 bits (121), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 73/185 (39%), Gaps = 27/185 (14%)
Query: 1194 ALHCLSQELYSQDSHF---LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
A H S+ +Q + L ELVQNA D I E + +L + + N +
Sbjct: 58 AEHANSERRITQGGYGDRQLFELVQNAADEIA-EEPGGKVHVVLTGTHLYCANEGNPVTP 116
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--------- 1301
E + + S K+G G IGR G+G KSV VTD PE S+ FD
Sbjct: 117 EGAETILRMSMSKKRG---GQIGRFGVGVKSVLVVTDTPEFFSSTGSFGFDRAWSYEQIK 173
Query: 1302 -----TSEGQIGFVLPTL--VPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSE 1353
T+ F P L P N+ + DPV E W T +RLP K
Sbjct: 174 SVPGVTARYGAEFDAPVLRMARPLNVQAE---RAADPVLHELLSWATTVVRLPLLPKADR 230
Query: 1354 GIAMN 1358
+ ++
Sbjct: 231 RLGLD 235
>gi|294629556|ref|ZP_06708116.1| conserved hypothetical protein [Streptomyces sp. e14]
gi|292832889|gb|EFF91238.1| conserved hypothetical protein [Streptomyces sp. e14]
Length = 1587
Score = 51.2 bits (121), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 68/164 (41%), Gaps = 20/164 (12%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D I E + +L E+ + N + E + + S K+G
Sbjct: 50 LFELVQNAADEIAGEPGG-RIHVVLTETHLYCANEGNPVTPEGAETILRMSMSKKRG--- 105
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIG------------FVLPTLV 1315
G IGR G+G KSV VT+APE S FD S QI F P L
Sbjct: 106 GQIGRFGVGVKSVLVVTNAPEFFSRTGSFGFDRAWSYEQIKAVPGVKERYGPEFDAPVLR 165
Query: 1316 PPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMN 1358
+DM S DPV + W T +RLP K + + +
Sbjct: 166 MARPLDMQAERAS-DPVLDKLLTWATTVVRLPLLPKADDRLGKD 208
>gi|392843821|gb|AFM84514.1| sacsin, partial [Calotes versicolor]
Length = 736
Score = 51.2 bits (121), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPSDRIFDEKWAP-----LQGPALCVFNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|392843859|gb|AFM84533.1| sacsin, partial [Coturnix coturnix]
Length = 736
Score = 50.8 bits (120), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ PT LQ + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPTDRIFDEKWAPLQGPALCVYNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
++IR + ++G TK G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 476 EDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|392843827|gb|AFM84517.1| sacsin, partial [Dibamus bourreti]
Length = 736
Score = 50.8 bits (120), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEVCFVFDSRHHSVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|392843835|gb|AFM84521.1| sacsin, partial [Hemidactylus bowringii]
Length = 736
Score = 50.8 bits (120), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|424862194|ref|ZP_18286140.1| hypothetical protein OPAG_00219 [Rhodococcus opacus PD630]
gi|356660666|gb|EHI41030.1| hypothetical protein OPAG_00219 [Rhodococcus opacus PD630]
Length = 1565
Score = 50.8 bits (120), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 45/92 (48%), Gaps = 5/92 (5%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D + + + +L + N Q + I L S K+
Sbjct: 81 LHELVQNAADALV--GNQGRIALVLANRKLYCANEGQPLTKSGIETLMASHLSRKRDEE- 137
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
IGR G+GFKSV V+DAPEI S G ++FD
Sbjct: 138 --IGRFGLGFKSVLGVSDAPEIISRGVSIRFD 167
>gi|449436349|ref|XP_004135955.1| PREDICTED: uncharacterized protein LOC101212447 [Cucumis sativus]
Length = 4709
Score = 50.8 bits (120), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 94/218 (43%), Gaps = 41/218 (18%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL---QESGIVVLNNEQGFS 1249
Y + + L ELVQNADD + E L+ L Q ++ NN F+
Sbjct: 33 YPEGTTVLKELVQNADDAGATKVCLCLDRRVHGRESLLSASLAPFQGPALLAYNNAV-FT 91
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-------- 1301
E+ ++ +G S K G A GR G+GF SV+ +T+ P S + V FD
Sbjct: 92 EEDFVSISRIGGSNKHGQ-AWKTGRFGVGFNSVYHLTELPSFVSGKYVVMFDPQGIYLPK 150
Query: 1302 ---TSEGQ-IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-------RTK 1350
++ G+ I F+ + + + D F + D +ES T R P R+K
Sbjct: 151 VSASNPGKRIDFIRSSAISQYR-DQFLPYCAFD-CTMESSFAGTLFRFPLRNTDQAARSK 208
Query: 1351 FS-EGIAMNNIVDMFSDLHPS---LLLFLHRLQCI-MF 1383
S + +I MF++L+ LLFL + CI MF
Sbjct: 209 ISRQAYTEEDISSMFAELYEEGVLTLLFLKSVLCIEMF 246
>gi|182438463|ref|YP_001826182.1| hypothetical protein SGR_4670 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178466979|dbj|BAG21499.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350]
Length = 1589
Score = 50.8 bits (120), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 145/358 (40%), Gaps = 49/358 (13%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D I + + +L ++ + N + +AE + + S K+G
Sbjct: 57 LFELVQNAADEIA-HSPGGKVHVVLTDTHLYCANEGKEVTAEGAETILRMSVSKKRG--- 112
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---------TSEGQIG---FVLPTLVPP 1317
G IGR G+G KSV VTDAP+ S FD + G+ F P L
Sbjct: 113 GQIGRFGVGVKSVLAVTDAPQFFSTSGAFGFDRAWSFEEIRKAGGRAAGGDFEAPVLRMA 172
Query: 1318 FNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSE--GIAMNNIV--DMFSDLHPSLL 1372
++ S DPV E W T +RLP ++ G ++ + + +
Sbjct: 173 RPLEAEAE-RSADPVLDELLGWATTVVRLPLLPGAADRLGHDIHTVAGRQIQREFPARFQ 231
Query: 1373 LFLHRLQCIMFRNMLNDSLVVIRKKI------VGDGIIKVSCG--EDKMTWFV------A 1418
LF + ++ N ++ IR++I + I +V G E + TW V
Sbjct: 232 LFSSHVGTVILEN--RRAMPQIRREITIEREDIAHTIHEVRTGKPESRATWKVFTCAHRP 289
Query: 1419 SQKLRAGVIR-PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
+++ RA D T +I+ A+ + P + ++F P + L IL G
Sbjct: 290 TERARASAGEMHDRGTIDISWAVPEYTGDTVLTVPKGRGE-FWSFFPTKYPMTLAGILNG 348
Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVP 1534
+ R+ + +SP+N+ +L L V + LP R PA+ Y+ +P
Sbjct: 349 AWKTNEDRQNLLDSSPFNEEILQVAARLVVES------LP--RLAPAEDPGAYLPLLP 398
>gi|363729189|ref|XP_417138.3| PREDICTED: sacsin [Gallus gallus]
Length = 4612
Score = 50.4 bits (119), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 2572 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 2626
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2627 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2682
Score = 44.7 bits (104), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 54/124 (43%), Gaps = 19/124 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1500 EEYPSISDIFKELLQNADDANASECNFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1558
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+ S +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1559 SE-FSDSDFLNITRLGESLKR-SEVDKVGKFGLGFNSVYHITDVPIILSREFMIMFDPNV 1616
Query: 1305 GQIG 1308
IG
Sbjct: 1617 NHIG 1620
Score = 43.9 bits (102), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 13/103 (12%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTF----ILQESGIVVLN-NEQGFSA 1250
Y + L EL+QNA+D +Y E T T + Q G N+ F+
Sbjct: 140 YPEGGQILKELIQNAEDAGATEVRFLYDETQYGTETLWSKDMAQYQGPAFYAYNDAIFTP 199
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 200 EDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPSIFS 241
>gi|326914333|ref|XP_003203480.1| PREDICTED: sacsin-like [Meleagris gallopavo]
Length = 4699
Score = 50.4 bits (119), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 2658 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 2712
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2713 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2768
Score = 45.1 bits (105), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 78/195 (40%), Gaps = 29/195 (14%)
Query: 1123 EECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISN---- 1178
++C +C + D G T++ P + +D L+ + D + S
Sbjct: 138 QQCPPLCPILVSSVPD---GQRTSRERPVEGNKDDEGLVASPLSDDTKIVDLQKSQNQIF 194
Query: 1179 --MESNMLKKQHARLGRALHCLSQEL------YSQDSHFLLELVQNADDN-------IYP 1223
++S LK R G+ L L Y + L EL+QNA+D +Y
Sbjct: 195 LKLQSEALKGG-GRFGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVRFLYD 253
Query: 1224 ENVEPTLTF----ILQESGIVVLN-NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
E T T + Q G N+ F+ E+ + ++ S KK +GR GIG
Sbjct: 254 ETQYGTETLWSKDMAQYQGPAFYAYNDAVFTPEDWHGIQEIARSRKKDDPLK-VGRFGIG 312
Query: 1279 FKSVFRVTDAPEIHS 1293
F SV+ +TD P I S
Sbjct: 313 FNSVYHITDVPSIFS 327
Score = 42.4 bits (98), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 19/124 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1586 EEYPSISDIFKELLQNADDANATECNFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1644
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+ S +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1645 SE-FSDSDFLNITRLGESLKR-SEVDKVGKFGLGFNSVYHITDVPIILSREFMIMFDPNV 1702
Query: 1305 GQIG 1308
I
Sbjct: 1703 NHIS 1706
>gi|449489127|ref|XP_004158223.1| PREDICTED: LOW QUALITY PROTEIN: sacsin-like [Cucumis sativus]
Length = 3586
Score = 50.4 bits (119), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 94/218 (43%), Gaps = 41/218 (18%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL---QESGIVVLNNEQGFS 1249
Y + + L ELVQNADD + E L+ L Q ++ NN F+
Sbjct: 33 YPEGTTVLKELVQNADDAGATKVCLCLDRRVHGRESLLSASLAPFQGPALLAYNNAV-FT 91
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-------- 1301
E+ ++ +G S K G A GR G+GF SV+ +T+ P S + V FD
Sbjct: 92 EEDFVSISRIGGSNKHGQ-AWKTGRFGVGFNSVYHLTELPSFVSGKYVVMFDPQGIYLPK 150
Query: 1302 ---TSEGQ-IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-------RTK 1350
++ G+ I F+ + + + D F + D +ES T R P R+K
Sbjct: 151 VSASNPGKRIDFIRSSAISQYR-DQFLPYCAFD-CTMESSFAGTLFRFPLRNTDQAARSK 208
Query: 1351 FS-EGIAMNNIVDMFSDLHPS---LLLFLHRLQCI-MF 1383
S + +I MF++L+ LLFL + CI MF
Sbjct: 209 ISRQAYTEEDISSMFAELYEEGVLTLLFLKSVLCIEMF 246
>gi|392843863|gb|AFM84535.1| sacsin, partial [Podocnemis unifilis]
Length = 736
Score = 50.4 bits (119), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDSRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 475
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
++IR + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 476 EDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|327311582|ref|YP_004338479.1| hypothetical protein TUZN_1702 [Thermoproteus uzoniensis 768-20]
gi|326948061|gb|AEA13167.1| hypothetical protein TUZN_1702 [Thermoproteus uzoniensis 768-20]
Length = 849
Score = 50.4 bits (119), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 1428 RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL---RTYGLKFILQGDFVLPSSR 1484
R + + E+A+A + E+ Q +++FLPL RT GLKF++ GDF++ + R
Sbjct: 89 RNNARKREVAIAFQVDEAKSSLVPVEAGLQFLYSFLPLTEVRT-GLKFLIHGDFIVQAGR 147
Query: 1485 EEVDGNSPWNQWLLSEFPALFVSA 1508
++ + WN+W++ E L A
Sbjct: 148 RTLNPEAKWNEWMVREIANLLNRA 171
>gi|328712264|ref|XP_003244766.1| PREDICTED: mediator of RNA polymerase II transcription subunit
14-like [Acyrthosiphon pisum]
Length = 1572
Score = 50.4 bits (119), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 44/94 (46%), Gaps = 15/94 (15%)
Query: 4 NHRPRFRPVGGVGGEGTGG---GQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN 60
N P+ P G V G Q P+ P NPN NPN+ PNP N
Sbjct: 1462 NMHPQMTPGGPVPSPLMGQHMMNQNPNMVGP---NPNMGGPNPNMVGPNP---------N 1509
Query: 61 IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQP 94
+ QNPN +QNPN+ N N++ QNPN+ Q P
Sbjct: 1510 MVGQNPNMVVQNPNMVGPNPNMVGQNPNMVGQNP 1543
>gi|345867558|ref|ZP_08819568.1| hypothetical protein BZARG_2748 [Bizionia argentinensis JUB59]
gi|344048054|gb|EGV43668.1| hypothetical protein BZARG_2748 [Bizionia argentinensis JUB59]
Length = 1534
Score = 50.4 bits (119), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 69/302 (22%), Positives = 126/302 (41%), Gaps = 43/302 (14%)
Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
D LLE++QN DD ++ + + ++ +E + + N + FS E I +L +S K
Sbjct: 40 DGRQLLEMLQNVDD---TKSKKVRIEWVKKEKKLAISNYGEAFSFEGIESLMRSHSSPK- 95
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---------------TSEGQIGFV 1310
+ YIG KG+GF+S+ + +I++N + F T++ Q+
Sbjct: 96 -TKEDYIGNKGLGFRSLLTWAERIDIYANDCKISFSDEIAHSIFDTKLKLTTNQKQVVKT 154
Query: 1311 LPTLVP---PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDL 1367
+V PF + +LL+ W T I + + + + +++D S+
Sbjct: 155 QAKVVNGAIPFPVLAVPQLLTFK----RQDDWQTVIEIQYTEEEEVEEKIKDVLDQISE- 209
Query: 1368 HPSLLLFLHRLQCIMFRNMLNDSLVVIRK--KIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
LLLFL+ ++ I ++ D +V I K KI + ++S F +
Sbjct: 210 --ELLLFLNHIEEI---ELVKDGVVSIFKSTKIEFEHWTEISINNKVWRVFTKDGLIPQK 264
Query: 1426 VIRPDVKT-TEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSS 1483
+ +T + L L Q Y L F F P + + L ++ G F L +S
Sbjct: 265 AVNGGEETFKKYMLKLAFQNDLSDTYYKL------FNFFPTKISVSLPCVIHGTFELNAS 318
Query: 1484 RE 1485
R+
Sbjct: 319 RD 320
>gi|407472961|ref|YP_006787361.1| hypothetical protein Curi_c04390 [Clostridium acidurici 9a]
gi|407049469|gb|AFS77514.1| hypothetical protein Curi_c04390 [Clostridium acidurici 9a]
Length = 1533
Score = 50.4 bits (119), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 12/99 (12%)
Query: 1210 LLELVQNADDNIYPENV--------EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
+LEL+QNADD Y +++ E L + + + V N F E I+A+ N
Sbjct: 57 ILELLQNADD-AYQKSIDMGEKPECELELLIEYKNNILTVSNTGTFFDKEGIKAIVQGNN 115
Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
S K G YIG KG GF+S+ D +I+S F+++F
Sbjct: 116 SPKSGK---YIGNKGTGFRSILNWADKVKIYSGDFNIEF 151
>gi|260788479|ref|XP_002589277.1| hypothetical protein BRAFLDRAFT_102519 [Branchiostoma floridae]
gi|229274453|gb|EEN45288.1| hypothetical protein BRAFLDRAFT_102519 [Branchiostoma floridae]
Length = 4368
Score = 50.4 bits (119), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 85/215 (39%), Gaps = 49/215 (22%)
Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN---------I 1221
LGP + L Q + R L + E Y + EL+QNADD
Sbjct: 1319 ALGPGMQ------LFGQKEPITRRLKTIIDE-YKEGPGIFYELIQNADDAGATEVKFAVD 1371
Query: 1222 YPENVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
+ +N + + + Q + NN + F+ E+I+ + ++ +TKK + IGR
Sbjct: 1372 WRQNTQVRKSLLSPGMAACQGPALWAYNN-KVFTDEDIQNITEISGATKK-TDTNKIGRF 1429
Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNI------------DMF 1323
G+GF SV+ +TD P S + V FD + ++ L P I D F
Sbjct: 1430 GLGFNSVYHMTDVPSFVSRHYFVCFDPHTTHLAPMVNRLEPGMKIEWNREEIQNIYTDQF 1489
Query: 1324 --------CRLLSKDPVQLESKCWNTCIRLPFRTK 1350
C + + D + T R PFRTK
Sbjct: 1490 KPYDGIFGCNIFNSD-----TPYPATLFRFPFRTK 1519
Score = 42.0 bits (97), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 19/104 (18%)
Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQG 1247
E YS EL+QNADD IY LQ ++V NN +
Sbjct: 2317 ESYSTGIDIFKELLQNADDAGATEVKFIYDTRCHSDKKIFDKEWKYLQGPSLLVYNN-RT 2375
Query: 1248 FSAENIRALCDVG--NSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
FS +I A+ +G N +KGS+ IG+ G+GF +V+ +TD P
Sbjct: 2376 FSQSDIDAIQKIGRGNKQEKGST---IGQFGVGFNAVYHLTDCP 2416
>gi|392843845|gb|AFM84526.1| sacsin, partial [Struthio camelus]
Length = 736
Score = 50.4 bits (119), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|397771742|ref|YP_006543591.1| hypothetical protein NJ7G_4300 [Natrinema sp. J7-2]
gi|397688955|gb|AFO59515.1| hypothetical protein NJ7G_4300 [Natrinema sp. J7-2]
Length = 1637
Score = 50.4 bits (119), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 1208 HFLLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-- 1264
F+ EL+QNADD + ++ +P + F L+ ++V NN + F A ++ A+ + +TK
Sbjct: 48 RFVFELLQNADDEMPEDSDKPRKVRFELRGDCLLVANNGRPFDAADLDAITTLTQTTKGD 107
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHS-NG 1295
+G A IG KG GF S+ +T P I S NG
Sbjct: 108 EGDEA-TIGHKGRGFTSILDITSNPAIFSQNG 138
>gi|356576915|ref|XP_003556575.1| PREDICTED: sacsin-like [Glycine max]
Length = 4756
Score = 50.4 bits (119), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 53/111 (47%), Gaps = 13/111 (11%)
Query: 1203 YSQDSHFLLELVQNADD------------NIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
Y + + L EL+QNADD +P + + + ++ N+ F+
Sbjct: 30 YPEGTTVLKELIQNADDAGATTVSLCLDRRSHPAGSLLSDSLAQWQGPALLAFNDAVFTE 89
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
E+ ++ +G STK G A GR G+GF SV+ +TD P S+ + V FD
Sbjct: 90 EDFVSISKIGGSTKHGQ-ASKTGRFGVGFNSVYHLTDLPSFVSHKYVVLFD 139
>gi|448495168|ref|ZP_21609788.1| hypothetical protein C463_14400 [Halorubrum californiensis DSM 19288]
gi|445688533|gb|ELZ40790.1| hypothetical protein C463_14400 [Halorubrum californiensis DSM 19288]
Length = 1637
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 53/92 (57%), Gaps = 5/92 (5%)
Query: 1208 HFLLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-- 1264
F+ EL+QNADD + ++ +P + F L+ ++V NN + F A ++ A+ + +TK
Sbjct: 48 RFVFELLQNADDEMPEDSDKPRKVRFELRGDCLLVANNGRPFDAADLDAITTLTQTTKGD 107
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHS-NG 1295
+G A IG KG GF S+ +T P I S NG
Sbjct: 108 EGDEA-TIGHKGRGFTSILDITSNPAIFSQNG 138
>gi|449483857|ref|XP_004186247.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Taeniopygia guttata]
Length = 4432
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 25/109 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 2392 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 2446
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ +TD P S
Sbjct: 2447 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFLS 2493
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 96/246 (39%), Gaps = 56/246 (22%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1320 EEYPSISDIFKELLQNADDANATECNFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1378
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+ S +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1379 SE-FSDSDFLNITRLGESLKR-SEVDKVGKFGLGFNSVYHITDVPIILSREFMIMFDPNV 1436
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMFCRLLSKDPVQLES--KCWN 1340
I P PF C+L P+ +ES
Sbjct: 1437 NHISKHIRDKSNPGIKINWSKQQKRLRKFPNQFKPFIGVFGCQL----PLTVESPYSYKG 1492
Query: 1341 TCIRLPFRT----KFSE-GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR 1395
T RL FRT K SE A N D++S L + HRL I+F +N S+V+
Sbjct: 1493 TLFRLSFRTQQEAKVSEVSSASYNTADIYS-LVDEFSICGHRL--IIFAQSVN-SMVLKY 1548
Query: 1396 KKIVGD 1401
KI D
Sbjct: 1549 LKIEAD 1554
>gi|154284896|ref|XP_001543243.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150406884|gb|EDN02425.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 953
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 46/81 (56%), Gaps = 5/81 (6%)
Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSP 1492
T E+ LA + +S+ P++ QQ V++FLP+R G KF++Q DF+ ++R+ V P
Sbjct: 94 TAEVILAFPVDDSHR----PIIEQQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LYP 148
Query: 1493 WNQWLLSEFPALFVSAERSFC 1513
N + +F+ A +FC
Sbjct: 149 RNYTIRERISLVFIQAVYNFC 169
>gi|149030215|gb|EDL85271.1| rCG52205 [Rattus norvegicus]
Length = 3830
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1899
Score = 44.3 bits (103), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 719 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 778 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 836 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 890
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 891 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 950
Query: 1387 ------LNDSLVVIRKK 1397
L ++I+KK
Sbjct: 951 EETNPTLAQDTIIIKKK 967
>gi|395848271|ref|XP_003796777.1| PREDICTED: sacsin [Otolemur garnettii]
Length = 4604
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2563 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2621
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2622 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2673
Score = 46.2 bits (108), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 122 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 180
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 181 SEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 223
Score = 42.7 bits (99), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 99/258 (38%), Gaps = 68/258 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1493 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1551
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+ +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1552 SE-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHLTDIPIIMSREFMIMFDPNI 1609
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
I P PF ID+F C+L P+ +E+
Sbjct: 1610 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDIFGCQL----PLTVEAPYSYH 1664
Query: 1340 NTCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1665 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1724
Query: 1387 ------LNDSLVVIRKKI 1398
L V+I+KK+
Sbjct: 1725 EETNPSLAQDTVIIKKKV 1742
>gi|148704160|gb|EDL36107.1| mCG7227 [Mus musculus]
Length = 3830
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1899
Score = 45.4 bits (106), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 102/258 (39%), Gaps = 68/258 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 719 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 778 SE-FSDSDFLNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 836 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 890
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 891 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 950
Query: 1387 ------LNDSLVVIRKKI 1398
L ++I+KK+
Sbjct: 951 EETNPSLAQDTIIIKKKV 968
>gi|260822048|ref|XP_002606415.1| hypothetical protein BRAFLDRAFT_67666 [Branchiostoma floridae]
gi|229291756|gb|EEN62425.1| hypothetical protein BRAFLDRAFT_67666 [Branchiostoma floridae]
Length = 4521
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 146/379 (38%), Gaps = 65/379 (17%)
Query: 1018 LPSEFRSFAADVLLSGLQSTIKDAP-SAILLECHQTELRLMLHEVGLSLGILEWIHDYHA 1076
LP EFR F G++ ++ + L E T ++ LSLGI++WI +
Sbjct: 1288 LPDEFRDFKQMFRAVGVKESLDELDLIQTLHEIRSTPKLSTERDLKLSLGIIDWIVSHPD 1347
Query: 1077 FCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEA 1136
D+L+P + +++ L S C DV + A+ D
Sbjct: 1348 ILQKVRDDILVP--IATKDSSRLTLAPVSECTYN--------DVSWSDAQALLD------ 1391
Query: 1137 SDDGLGDCTTQTLPEDKE--CEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRA 1194
+D + +P D E +L + +EFG QH ++ +
Sbjct: 1392 DEDEEFKPLHKQVPLDTAQLLEVPSLSQRVVDAEEFGFDQC----------GQHEKVTQR 1441
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESGIVVLN-- 1243
L + + Y + + EL+QNADD E N + I Q G+ V +
Sbjct: 1442 LKNILND-YPEGAGIFKELIQNADDAGATEVKFLVDWRTNEDSREDLIDQ--GMAVCHGP 1498
Query: 1244 -----NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
N FS E+ + + ++G TK + +GR G+GF SV+ VTD P S +
Sbjct: 1499 ALWAFNNATFSEEDFKNIQNLGGQTKL-DALEKVGRFGVGFNSVYHVTDVPSFVSGSRVI 1557
Query: 1299 KFDTSEGQIGFVLPTLVPP---FNIDMFCRLLSKDPVQLE----------SKCWN-TCIR 1344
FD + + P ++ RL+S+ Q + S +N T R
Sbjct: 1558 FFDPHATHLSKHIKDKSKPGIGLDLQKNTRLVSRFKDQFKPFQGVFGYSTSTSYNGTLFR 1617
Query: 1345 LPFRTKFSEGIAMNNIVDM 1363
LP RT +G + + I D+
Sbjct: 1618 LPLRT--PQGASDSEISDV 1634
>gi|410947206|ref|XP_003980343.1| PREDICTED: sacsin [Felis catus]
Length = 4653
Score = 50.1 bits (118), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2612 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2670
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2671 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2722
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 182 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 240
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 241 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 283
Score = 43.5 bits (101), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 63/234 (26%), Positives = 91/234 (38%), Gaps = 57/234 (24%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1542 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1600
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1601 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1658
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESKCW-- 1339
I P PF ID+F CRL P+ +E+
Sbjct: 1659 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCRL----PLTVEAPYSYD 1713
Query: 1340 NTCIRLPFRTKFSEGIA-----MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLN 1388
T RL FRT+ ++ N D++S L L HRL I+F +N
Sbjct: 1714 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYS-LVDEFSLCGHRL--IIFTQSVN 1764
>gi|392843831|gb|AFM84519.1| sacsin, partial [Anas platyrhynchos]
Length = 736
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|344284593|ref|XP_003414050.1| PREDICTED: sacsin [Loxodonta africana]
Length = 4805
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2764 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2822
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2823 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2874
Score = 47.0 bits (110), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1694 EEYPSVSDIFKELLQNADDAGATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1752
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1753 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1810
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
I P PF ID+F C+L P+ LE+
Sbjct: 1811 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTLEAPYSYK 1865
Query: 1340 NTCIRLPFRT----KFSE--GIAMN-----NIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT K SE G N ++VD FS L++F + + + +
Sbjct: 1866 GTLFRLSFRTQQEAKVSEVSGTCYNTADIYSLVDEFSLCGHRLIVFTQSVNSMYLKYLKI 1925
Query: 1387 ------LNDSLVVIRKK 1397
L +++I+KK
Sbjct: 1926 EETNPNLAQDIIIIKKK 1942
Score = 44.3 bits (103), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 60/153 (39%), Gaps = 42/153 (27%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + NN F+
Sbjct: 333 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYAYNNAI-FT 391
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
E+ + ++ S KK +GR GIGF SV+ +TD P I S QIG
Sbjct: 392 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFSG----------DQIGM 440
Query: 1310 VLP--TLVPPFNIDMFCRLLSKDPVQLESKCWN 1340
+ P TL P Q +CWN
Sbjct: 441 LDPHQTLFGP---------------QESGQCWN 458
>gi|392843851|gb|AFM84529.1| sacsin, partial [Columba livia]
Length = 736
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527
>gi|344248514|gb|EGW04618.1| Sacsin [Cricetulus griseus]
Length = 4371
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2330 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2388
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2389 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2440
Score = 44.3 bits (103), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1259 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1317
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1318 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1375
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1376 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1430
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1431 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1490
Query: 1387 ------LNDSLVVIRKK 1397
L ++I+KK
Sbjct: 1491 EESNPSLAQDTIIIKKK 1507
>gi|348583059|ref|XP_003477292.1| PREDICTED: LOW QUALITY PROTEIN: sacsin-like [Cavia porcellus]
Length = 4573
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2532 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2590
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2591 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2642
>gi|345790413|ref|XP_534533.3| PREDICTED: LOW QUALITY PROTEIN: sacsin [Canis lupus familiaris]
Length = 4575
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2534 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2592
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2593 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2644
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 104 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 162
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 163 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 205
Score = 43.1 bits (100), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 98/257 (38%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1464 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1522
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1523 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1580
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
I P PF ID+F C+L P+ +E+
Sbjct: 1581 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYS 1635
Query: 1340 NTCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1636 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVTSMYLKYLKI 1695
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1696 EETNPSLAQDTVIIKKK 1712
>gi|194221780|ref|XP_001490776.2| PREDICTED: sacsin [Equus caballus]
Length = 4612
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2571 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2629
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2630 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2681
Score = 45.4 bits (106), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 142 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 200
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 201 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 243
>gi|444706237|gb|ELW47586.1| Sacsin [Tupaia chinensis]
Length = 4519
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2478 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2536
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2537 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2588
Score = 43.9 bits (102), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 100/257 (38%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1408 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1466
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1467 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1524
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C L P+ +E+ +N
Sbjct: 1525 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCHL----PLTVEAPYSYN 1579
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1580 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1639
Query: 1387 ------LNDSLVVIRKK 1397
L ++I+KK
Sbjct: 1640 EETNPSLAQDTIIIKKK 1656
>gi|281346562|gb|EFB22146.1| hypothetical protein PANDA_009473 [Ailuropoda melanoleuca]
Length = 4582
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2541 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2599
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2600 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2651
Score = 44.3 bits (103), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 100/257 (38%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1471 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1529
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1530 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1587
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1588 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1642
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1643 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1702
Query: 1387 ------LNDSLVVIRKK 1397
L VVI+KK
Sbjct: 1703 EETNPRLAQDTVVIKKK 1719
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 112 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 170
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 171 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 213
>gi|351695481|gb|EHA98399.1| Sacsin [Heterocephalus glaber]
Length = 4576
Score = 50.1 bits (118), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2590 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2648
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2649 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2700
>gi|428185144|gb|EKX53997.1| hypothetical protein GUITHDRAFT_100248 [Guillardia theta CCMP2712]
Length = 4444
Score = 50.1 bits (118), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 60/134 (44%), Gaps = 15/134 (11%)
Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD------NIYPENVEPTLTFI 1233
+S + + GR H L ELY + + EL+QNADD +I + + +
Sbjct: 1382 DSESFGQSESLTGRLKHIL--ELYPEGPGIMNELIQNADDAQATEFSIIVNKCTYSTSSL 1439
Query: 1234 LQ------ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
L + + +N FS ++ + +C +G +K IGR G+GF +V+ TD
Sbjct: 1440 LSPRMASWQGPAIYFHNNSIFSPDDFKNICKIGQGSKL-DFLNKIGRFGLGFNAVYHFTD 1498
Query: 1288 APEIHSNGFHVKFD 1301
P S + V FD
Sbjct: 1499 VPSFVSGDYLVYFD 1512
Score = 42.4 bits (98), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 71/172 (41%), Gaps = 29/172 (16%)
Query: 1203 YSQDSHFLLELVQNADD------------NIYPEN--VEPTLTFILQESGIVVLNNEQGF 1248
Y + L E +QNADD +P + +P L Q S +V N+ F
Sbjct: 40 YPPGATILKEFIQNADDAGAQQIKICLDERSFPSSSLADPKLGQFQQSSLLVY--NDAVF 97
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
S E+ ++ +G S+K+ GR GIGF S + +T+ P S V FD +
Sbjct: 98 SDEDFDSIQRIGQSSKQAHPTK-TGRFGIGFNSCYHLTELPTFLSRSSIVMFDPQAKYLP 156
Query: 1309 FVLPTLVPPFNIDMFCRLLSKD-PVQL---------ESKCWN-TCIRLPFRT 1349
V P P I+ + ++ P Q+ K +N T RLP RT
Sbjct: 157 NVNPA-NPGKRINFLSEEIQRNYPDQIAPYFNFGARSDKAFNGTLFRLPLRT 207
>gi|301770575|ref|XP_002920712.1| PREDICTED: sacsin-like [Ailuropoda melanoleuca]
Length = 4608
Score = 50.1 bits (118), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2567 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2625
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2626 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2677
Score = 44.3 bits (103), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 100/257 (38%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1497 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1555
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1556 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1613
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1614 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1668
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1669 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1728
Query: 1387 ------LNDSLVVIRKK 1397
L VVI+KK
Sbjct: 1729 EETNPRLAQDTVVIKKK 1745
Score = 43.9 bits (102), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 138 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 196
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 197 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 239
>gi|293342144|ref|XP_001063390.2| PREDICTED: sacsin isoform 1 [Rattus norvegicus]
gi|293353889|ref|XP_224256.5| PREDICTED: sacsin isoform 2 [Rattus norvegicus]
Length = 4581
Score = 50.1 bits (118), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2540 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2598
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2599 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2650
Score = 45.4 bits (106), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 106 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 164
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 165 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 207
Score = 44.3 bits (103), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1470 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1528
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1529 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1586
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1587 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1641
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1642 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1701
Query: 1387 ------LNDSLVVIRKK 1397
L ++I+KK
Sbjct: 1702 EETNPTLAQDTIIIKKK 1718
>gi|392333336|ref|XP_003752865.1| PREDICTED: sacsin isoform 2 [Rattus norvegicus]
gi|392353567|ref|XP_003751541.1| PREDICTED: sacsin isoform 1 [Rattus norvegicus]
Length = 4578
Score = 50.1 bits (118), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2537 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2595
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2596 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2647
Score = 45.4 bits (106), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 103 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 161
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 162 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 204
Score = 44.3 bits (103), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1467 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1525
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1526 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1583
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1584 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1638
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1639 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1698
Query: 1387 ------LNDSLVVIRKK 1397
L ++I+KK
Sbjct: 1699 EETNPTLAQDTIIIKKK 1715
>gi|354476692|ref|XP_003500557.1| PREDICTED: sacsin [Cricetulus griseus]
Length = 4610
Score = 50.1 bits (118), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2569 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2627
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2628 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2679
Score = 45.4 bits (106), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 133 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 191
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 192 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 234
Score = 44.3 bits (103), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1498 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1556
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1557 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1614
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1615 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1669
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1670 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1729
Query: 1387 ------LNDSLVVIRKK 1397
L ++I+KK
Sbjct: 1730 EESNPSLAQDTIIIKKK 1746
>gi|334330536|ref|XP_003341374.1| PREDICTED: LOW QUALITY PROTEIN: sacsin-like [Monodelphis domestica]
Length = 4680
Score = 50.1 bits (118), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2640 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2698
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2699 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2750
Score = 44.3 bits (103), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 101/266 (37%), Gaps = 68/266 (25%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1571 EEYPSVSDIFKELLQNADDANATECNFMIDMRRNMDIRENLLDPGMA-ACHGPALWSYNN 1629
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1630 SE-FSDSDFINITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1687
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
+ P PF ID+F C+L P+ +E+
Sbjct: 1688 NHLSKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYK 1742
Query: 1340 NTCIRLPFRT----KFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT K SE + ++VD FS L+LF + + + +
Sbjct: 1743 GTLFRLSFRTQQEAKLSEVSSTCYNTADIYSLVDEFSLCGHRLILFTQNVNTMFLKYLKI 1802
Query: 1387 ------LNDSLVVIRKKIVGDGIIKV 1406
L +VI+K I + V
Sbjct: 1803 EETNPGLAQDTIVIKKNICSSKALNV 1828
Score = 43.5 bits (101), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T Q S + V NN F+
Sbjct: 210 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGKETLWSKDMEQYQGSALYVYNNAV-FT 268
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 269 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 311
>gi|395520793|ref|XP_003764508.1| PREDICTED: sacsin [Sarcophilus harrisii]
Length = 4804
Score = 50.1 bits (118), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2765 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2823
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2824 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2875
Score = 45.1 bits (105), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 65/266 (24%), Positives = 103/266 (38%), Gaps = 68/266 (25%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1696 EEYPSVSDIFKELLQNADDADATECNFMIDMRRNMDIRENLLDPGMA-ACHGPALWSYNN 1754
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1755 SE-FSDSDFINITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1812
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
I P PF ID+F C+L P+ +E+
Sbjct: 1813 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYK 1867
Query: 1340 NTCIRLPFRT----KFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT K SE + ++VD FS L+LF + + + +
Sbjct: 1868 GTLFRLSFRTQQEAKQSEISSTCYNTADIYSLVDEFSLCGHRLILFTQNVNTMFLKYLKI 1927
Query: 1387 ------LNDSLVVIRKKIVGDGIIKV 1406
L +++I+K + ++ V
Sbjct: 1928 EETNPGLAQDIIIIKKNMCSSKVLNV 1953
Score = 43.5 bits (101), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T Q S + V NN F+
Sbjct: 335 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGKETLWSKDMEQYQGSALYVYNNAV-FT 393
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 394 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 436
>gi|453052088|gb|EME99578.1| hypothetical protein H340_15726 [Streptomyces mobaraensis NBRC 13819
= DSM 40847]
Length = 1585
Score = 49.7 bits (117), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 64/154 (41%), Gaps = 22/154 (14%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
L ELVQNA D I E + +L ++ + N + E + + S K+G
Sbjct: 50 LFELVQNAADEIRDEP-GGRIHVVLTDTHLYCANQGNPVTPEGAETILRMSMSRKRG--- 105
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---------------EGQIGFVLPTL 1314
G IGR G+G KSV VTD PE S+ FD + G+ F P L
Sbjct: 106 GQIGRFGVGVKSVLVVTDTPEFFSSTGSFGFDRAWSYDLIRNVPGVRERHGE-DFEAPVL 164
Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPF 1347
+D L+ DPV E W T +RLP
Sbjct: 165 RMARELDSDAERLA-DPVLDELLGWATTVVRLPL 197
>gi|6907042|gb|AAF31262.1|AF193556_1 sacsin [Homo sapiens]
gi|119628727|gb|EAX08322.1| hCG1774880 [Homo sapiens]
Length = 3829
Score = 49.7 bits (117), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1899
Score = 47.0 bits (110), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 719 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 778 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 836 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 890
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 891 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 950
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 951 EETNPSLAQDTVIIKKK 967
>gi|390358686|ref|XP_003729316.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
Length = 2828
Score = 49.7 bits (117), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 105/251 (41%), Gaps = 52/251 (20%)
Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------ 1224
GLGP+ ++ + Q L + + + Y D L ELVQNADD+ E
Sbjct: 813 GLGPHTLEAQNLLPFGQRQPLTERIKNILKG-YPCDDTILKELVQNADDSCATEVHLVLD 871
Query: 1225 -NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
P LQ + V NN + F+ +I + +G +K+ S+ +GR G+
Sbjct: 872 VRTHPKDRLFRDEMSSLQGPALCVYNN-RSFTDADIEGIQRLGVGSKRNLSSK-VGRFGV 929
Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL-VPPFNIDMFCRLLSKDPVQLES 1336
GF +V+ +TD P ++NG DT +G + P + P + ++K +L+
Sbjct: 930 GFNAVYHLTDCPSFYTNG-----DT----LGMLDPNVKYAPNATLAYPGFMAKPTGKLKD 980
Query: 1337 ------KCW---------NTCIRLPFRT-----------KFSEGIAMNNIVDMFSDLHPS 1370
C+ +T RLP RT K + ++ ++ F + P
Sbjct: 981 GFPNVFNCYLPEFFEAQGSTMFRLPLRTESMARTSDISSKAHDVDSVKRLMRKFKEGLPD 1040
Query: 1371 LLLFLHRLQCI 1381
LLLFL ++ +
Sbjct: 1041 LLLFLKHVKKV 1051
>gi|281211462|gb|EFA85624.1| putative ATP binding protein [Polysphondylium pallidum PN500]
Length = 2733
Score = 49.7 bits (117), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 93/417 (22%), Positives = 154/417 (36%), Gaps = 96/417 (23%)
Query: 1145 TTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYS 1204
T L D DA + D G I E +K RL E Y
Sbjct: 1256 TYSILHRDLSLNDARKLKIKTSLDTIKRGKTILGSEFGQFEKLLTRLKNI-----NEEYK 1310
Query: 1205 QDSHFLLELVQNADDN--------------IYPENVEPTL--TFILQESGIVVLNNEQGF 1248
+S L E+VQNA+D+ ++++P L F +V+ N+ F
Sbjct: 1311 VES-LLKEVVQNAEDSKATVLEICLDKRRYTITDDLDPELPKNFGNYLEPSLVIFNDSKF 1369
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI----------------- 1291
+ +I+ + +G+S KK + + IG GIGF V+ +TD P I
Sbjct: 1370 TDLDIQNIQSLGDSYKKNNYSK-IGHYGIGFNCVYNITDVPSIITRDSLFIFDPLKSHFS 1428
Query: 1292 --HSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
HS G +F + + P P F + QL+ + +T IRLP R
Sbjct: 1429 VEHSAGRSYEFSIRDQLVDTFKPLNYPGFGL------------QLDKEFDHTIIRLPLRK 1476
Query: 1350 KFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND------SLVVIRK 1396
+S+ ++N ++ F + LLF L+ I F + +D +L + +
Sbjct: 1477 SYSDLSSNVWTMDSINKLLKEFQSSAENCLLFQRSLKKITFSVIEDDNSNQAKTLYSVER 1536
Query: 1397 KIVGD-GIIKVSCGEDKMTW-------FVASQKLRAGVIRPDVKTTEIALALTLQESNEG 1448
++ G I +S + W + V+ +I Q SN
Sbjct: 1537 RVTGGYDPINLSSDIEAYNWKNFNPKSYFMDIITNGNVVNKWFVGWKIGAE---QSSNLF 1593
Query: 1449 NYGPLLYQQPV-----------------FAFLPL-RTYGLKFILQGDFVLPSSREEV 1487
L+Q+ + F +LPL ++ L ++ G F+L SSR++V
Sbjct: 1594 KIYEKLHQKSIPLGSIAICMDKEIIGVPFTYLPLPKSTELPVMVNGSFILNSSRQDV 1650
>gi|45382637|ref|NP_990050.1| chondroitin sulfate proteoglycan 5 precursor [Gallus gallus]
gi|2065050|emb|CAA70459.1| epidermal growth factor [Gallus gallus]
Length = 551
Score = 49.7 bits (117), Expect = 0.018, Method: Composition-based stats.
Identities = 44/169 (26%), Positives = 65/169 (38%), Gaps = 37/169 (21%)
Query: 6 RPRFRPVGGVG--GEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN--- 60
RP P G+G G G P P NPN +PN+ PNP+L L
Sbjct: 134 RPPGLPSPGLGLSSPGPNLGLPSLDLP----NPNLGLPDPNLGLPNPSLGLPSPGPTPDR 189
Query: 61 -IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQ--------------------------Q 93
IP NP+ D+ +P L +Q NL NPN+P Q
Sbjct: 190 PIPNPNPSLDLPDPGLAIQTPNLGLSNPNIPLPSPSPGPGTEPDLLPVAEDSEVSMELPQ 249
Query: 94 PPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERID 142
P SS QR++ + P++ P + P P+ ++ ++ + D
Sbjct: 250 PSSSPAPAQRARGRTDRTWLGAPEPISAAPGTAEP-PEIIDVDYYDVFD 297
>gi|297274100|ref|XP_001089964.2| PREDICTED: sacsin-like isoform 2 [Macaca mulatta]
Length = 4517
Score = 49.7 bits (117), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 132/345 (38%), Gaps = 67/345 (19%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2536 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2594
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFDT----SE 1304
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD +
Sbjct: 2595 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFDPHARYAP 2653
Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLE-------SKCWNTCIRLPFR----TKFSE 1353
G + + D R D + L C T R P R K SE
Sbjct: 2654 GATSISPGRMFRDLDADF--RTQFSDVLDLYLGTHFKLDNC--TMFRFPLRNAEMAKVSE 2709
Query: 1354 -------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV 1406
+ N++D LL+FL+ ++ I S+ I K ++
Sbjct: 2710 ISSVPASDRMVQNLLDKLRSDGAELLMFLNHMEKI--------SICEIDKSTGALNVLYS 2761
Query: 1407 SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR 1466
G+ + ++ A VI K ++ + +Q+ + + F FLPL
Sbjct: 2762 VKGKITDGDRLKRKQFHASVIDSVTKKRQLK-DIPVQQXXKPHRA--------FCFLPLS 2812
Query: 1467 -TYGLKFILQGDFVLPSSREEV--DGN-----SPWNQWLLSEFPA 1503
GL F + G F L S+R + D N S WN L++ A
Sbjct: 2813 LETGLPFHVNGHFALDSARRNLWRDDNGVGVRSDWNNSLMTALIA 2857
Score = 45.1 bits (105), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1466 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1524
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1525 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1582
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1583 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1637
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1638 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1697
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1698 EETNPSLAQDTVIIKKK 1714
>gi|426236795|ref|XP_004012352.1| PREDICTED: sacsin [Ovis aries]
Length = 4691
Score = 49.7 bits (117), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2650 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDDKWAPLQGPALCVYNN-QPFT 2708
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ G+GF SV+ +TD P I N FD
Sbjct: 2709 EDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDILCIFD 2760
Score = 44.3 bits (103), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 216 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTQTLWSKDMAQYQGPALYVYNNAV-FT 274
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 275 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 317
Score = 43.1 bits (100), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 19/123 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1580 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1638
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1639 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPVIMSREFMIMFDPNI 1696
Query: 1305 GQI 1307
I
Sbjct: 1697 NHI 1699
>gi|410047646|ref|XP_001174272.3| PREDICTED: sacsin [Pan troglodytes]
Length = 4722
Score = 49.7 bits (117), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2682 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2740
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2741 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2792
Score = 47.0 bits (110), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1612 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1670
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1671 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1728
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1729 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1783
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1784 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1843
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1844 EETNPSLAQDTVIIKKK 1860
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 250 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 308
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 309 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 351
>gi|355700867|gb|EHH28888.1| DnaJ-like protein subfamily C member 29, partial [Macaca mulatta]
Length = 4523
Score = 49.7 bits (117), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2483 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2541
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2542 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2593
Score = 45.1 bits (105), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1413 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1472 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1529
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1530 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1584
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1585 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1644
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1645 EETNPSLAQDTVIIKKK 1661
>gi|440911870|gb|ELR61498.1| Sacsin, partial [Bos grunniens mutus]
Length = 4032
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 1991 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2049
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ G+GF SV+ +TD P I N FD
Sbjct: 2050 EDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDILCIFD 2101
Score = 43.1 bits (100), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 19/124 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 921 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 979
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 980 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPVIMSREFMIMFDPNI 1037
Query: 1305 GQIG 1308
I
Sbjct: 1038 NHIS 1041
>gi|431903116|gb|ELK09292.1| Sacsin [Pteropus alecto]
Length = 3445
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 1520 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRVFDDKWAPLQGPALCVYNN-QPFT 1578
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 1579 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1630
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 59/208 (28%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 772 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 830
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 831 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 888
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESKCWNT 1341
I P PF ID+F CRL P+ +E+
Sbjct: 889 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCRL----PLTVEA----- 938
Query: 1342 CIRLPFRTKFSEGIAMNN---IVDMFSD 1366
P+ +F E IA + +FSD
Sbjct: 939 ----PYSYRFYEDIAHGKGKELTRVFSD 962
>gi|410350047|gb|JAA41627.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Pan troglodytes]
Length = 4579
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649
Score = 47.0 bits (110), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 107 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 166 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208
>gi|441613500|ref|XP_004088146.1| PREDICTED: sacsin [Nomascus leucogenys]
Length = 4527
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2487 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2545
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2546 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2597
Score = 47.0 bits (110), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1417 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1475
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1476 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1533
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1534 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1588
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1589 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1648
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1649 EETNPSLAQDTVIIKKK 1665
>gi|410220792|gb|JAA07615.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Pan troglodytes]
gi|410306884|gb|JAA32042.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Pan troglodytes]
Length = 4579
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649
Score = 47.0 bits (110), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 107 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 166 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208
>gi|380798317|gb|AFE71034.1| sacsin, partial [Macaca mulatta]
Length = 3313
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 1273 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1331
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 1332 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1383
Score = 44.7 bits (104), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 203 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 261
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 262 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 319
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 320 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 374
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 375 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 434
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 435 EETNPSLAQDTVIIKKK 451
>gi|402901560|ref|XP_003913715.1| PREDICTED: sacsin [Papio anubis]
Length = 4579
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649
Score = 45.1 bits (105), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSNTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717
>gi|392843825|gb|AFM84516.1| sacsin, partial [Zosterops japonicus]
Length = 736
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 25/109 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ +TD P S
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFLS 518
>gi|355761304|gb|EHH61786.1| DnaJ-like protein subfamily C member 29, partial [Macaca
fascicularis]
Length = 4523
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2483 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2541
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2542 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2593
Score = 45.1 bits (105), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1413 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1472 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1529
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1530 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1584
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1585 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1644
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1645 EETNPSLAQDTVIIKKK 1661
>gi|397526330|ref|XP_003833084.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Pan paniscus]
Length = 4579
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649
Score = 45.8 bits (107), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700
Query: 1387 ------LNDSLVVIRKK 1397
L V+I KK
Sbjct: 1701 EETNPSLAQDTVIIXKK 1717
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 107 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 166 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208
>gi|426374909|ref|XP_004054297.1| PREDICTED: sacsin [Gorilla gorilla gorilla]
Length = 4579
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649
Score = 47.0 bits (110), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717
Score = 43.9 bits (102), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 107 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 166 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208
>gi|76364133|sp|Q9DF69.1|CSPG5_CHICK RecName: Full=Chondroitin sulfate proteoglycan 5; AltName:
Full=Acidic leucine-rich EGF-like domain-containing
brain protein; Contains: RecName: Full=Chondroitin
sulfate proteoglycan 5, 38 kDa form; Contains: RecName:
Full=Chondroitin sulfate proteoglycan 5, 80 kDa form;
Flags: Precursor
gi|11093919|gb|AAG29499.1|AF292101_1 CALEB isoform b [Gallus gallus]
Length = 594
Score = 49.7 bits (117), Expect = 0.019, Method: Composition-based stats.
Identities = 44/169 (26%), Positives = 65/169 (38%), Gaps = 37/169 (21%)
Query: 6 RPRFRPVGGVG--GEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN--- 60
RP P G+G G G P P NPN +PN+ PNP+L L
Sbjct: 134 RPPGLPSPGLGLSSPGPNLGLPSLDLP----NPNLGLPDPNLGLPNPSLGLPSPGPTPDR 189
Query: 61 -IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQ--------------------------Q 93
IP NP+ D+ +P L +Q NL NPN+P Q
Sbjct: 190 PIPNPNPSLDLPDPGLAIQTPNLGLSNPNIPLPSPSPGPGTEPDLLPVAEDSEVSMELPQ 249
Query: 94 PPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERID 142
P SS QR++ + P++ P + P P+ ++ ++ + D
Sbjct: 250 PSSSPAPAQRARGRTDRTWLGAPEPISAAPGTAEP-PEIIDVDYYDVFD 297
>gi|345319042|ref|XP_001519222.2| PREDICTED: sacsin [Ornithorhynchus anatinus]
Length = 4639
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2598 YPSEKEMLKELLQNADDAKATEICFVFDPRHHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2656
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2657 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2708
Score = 45.8 bits (107), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQE------SGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T +E S + V NN F+
Sbjct: 167 YPEGGQILKELIQNAEDAGATEVKFLYDETKYGTETLWSKEMEQYQGSALYVYNNAV-FT 225
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 226 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 268
Score = 43.1 bits (100), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 100/258 (38%), Gaps = 68/258 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1528 EEYPSVSDIFKELLQNADDANATECNFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1586
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1587 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1644
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
+ P PF ID+F C+L P+ +E+
Sbjct: 1645 NHLSKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYK 1699
Query: 1340 NTCIRLPFRT----KFSE--GIAMN-----NIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT K SE G N ++VD FS L++F + + + +
Sbjct: 1700 GTLFRLSFRTQQEAKLSEVSGTCYNTADIYSLVDEFSLCGHRLIIFTQNVNSMHLKYLKI 1759
Query: 1387 ------LNDSLVVIRKKI 1398
L V+I+K +
Sbjct: 1760 EETNPALAQDTVIIKKTL 1777
>gi|297693647|ref|XP_002824122.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Pongo abelii]
Length = 4576
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2536 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2594
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2595 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2646
Score = 47.0 bits (110), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1466 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1524
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1525 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1582
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1583 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1637
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1638 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1697
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1698 EETNPSLAQDTVIIKKK 1714
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 104 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 162
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 163 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 205
>gi|163659918|ref|NP_055178.3| sacsin [Homo sapiens]
gi|122066060|sp|Q9NZJ4.2|SACS_HUMAN RecName: Full=Sacsin; AltName: Full=DnaJ homolog subfamily C member
29; Short=DNAJC29
Length = 4579
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649
Score = 47.0 bits (110), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717
Score = 43.9 bits (102), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 107 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 166 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208
>gi|271970078|ref|YP_003344274.1| hypothetical protein [Streptosporangium roseum DSM 43021]
gi|270513253|gb|ACZ91531.1| hypothetical protein Sros_8899 [Streptosporangium roseum DSM 43021]
Length = 1043
Score = 49.7 bits (117), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 64/153 (41%), Gaps = 9/153 (5%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
++EL QNA D V L L+E + N A I L + S K+
Sbjct: 43 RLIVELAQNAADAALRAGVPGRLRLSLREGVLSAANTGAPVDAAGIEGLSTLRVSGKRDE 102
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL--PTLVPPF-NIDMFC 1324
S G GR G+GF +V VTD P I S G V + + V P L + D
Sbjct: 103 S-GMAGRFGVGFAAVVSVTDEPVIGSLGSGVVRWSRDQTTALVAAEPALARELTDRDRHV 161
Query: 1325 RLL----SKDPVQLESKCWNTCIRLPFRTKFSE 1353
LL PV++ ++T +RLP R + +E
Sbjct: 162 PLLRLPFPAGPVEVPG-GFDTLVRLPLRDRAAE 193
>gi|390463960|ref|XP_002806928.2| PREDICTED: LOW QUALITY PROTEIN: sacsin [Callithrix jacchus]
Length = 5004
Score = 49.7 bits (117), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2964 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 3022
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 3023 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 3074
Score = 44.3 bits (103), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 53/123 (43%), Gaps = 19/123 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1952 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 2010
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 2011 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 2068
Query: 1305 GQI 1307
I
Sbjct: 2069 NHI 2071
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 590 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 648
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 649 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 691
>gi|348519944|ref|XP_003447489.1| PREDICTED: sacsin-like [Oreochromis niloticus]
Length = 301
Score = 49.7 bits (117), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 92/236 (38%), Gaps = 47/236 (19%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
Y L EL+QNADD I+ E T + +E G + N F+
Sbjct: 31 YPDGGQILKELIQNADDARATEVVFIHDERSYGTESIWTEELGKYQGPALYAYNNAAFTV 90
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI------------------H 1292
E+ + + G S K+ + +GR GIGF SV+ +TD P I
Sbjct: 91 EDWQGIQMAGRSVKRDDPSK-VGRFGIGFNSVYHITDVPSILSSEHLGMMDPQEKIFGER 149
Query: 1293 SNGFHVKFDTSEGQ-IGFVLPTLVPPFN--IDMFCRLLSKDPVQLESKCWN-TCIRLPFR 1348
S GF D SE Q + + PF + + C V +E + +N T R P R
Sbjct: 150 SGGFRWSLDDSEDQEVLLNMHHQFQPFRDIVSLVCE-HGWSKVVMEDQHFNGTIFRFPLR 208
Query: 1349 TKFSEGIAMNNIV--DMFSDLHPSL-------LLFLHRLQCIMFRNMLNDSLVVIR 1395
+ SE +N+ D ++L S LLFL + + ++ D V R
Sbjct: 209 NEASE--ISDNLYDSDKVTELFDSFIADADLSLLFLKNVTSVSLLHISEDGAVNTR 262
>gi|403255390|ref|XP_003920419.1| PREDICTED: sacsin [Saimiri boliviensis boliviensis]
Length = 4696
Score = 49.7 bits (117), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2656 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2714
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 2715 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2766
Score = 46.2 bits (108), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1586 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1644
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1645 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1702
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1703 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1757
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1758 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1817
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1818 EESNPSLAQDTVIIKKK 1834
Score = 43.5 bits (101), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 224 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAI-FT 282
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 283 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 325
>gi|76631591|ref|XP_586753.2| PREDICTED: sacsin [Bos taurus]
gi|297481151|ref|XP_002691909.1| PREDICTED: sacsin [Bos taurus]
gi|296481777|tpg|DAA23892.1| TPA: hypothetical protein BOS_12530 [Bos taurus]
Length = 4575
Score = 49.7 bits (117), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2534 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2592
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ G+GF SV+ +TD P I N FD
Sbjct: 2593 EDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDILCIFD 2644
Score = 44.3 bits (103), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 104 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTQTLWSKDMAQYQGPALYVYNNAV-FT 162
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 163 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 205
Score = 42.7 bits (99), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 19/123 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1464 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1522
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1523 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPVIMSREFMIMFDPNI 1580
Query: 1305 GQI 1307
I
Sbjct: 1581 NHI 1583
>gi|448307818|ref|ZP_21497706.1| hypothetical protein C494_08772 [Natronorubrum bangense JCM 10635]
gi|445594953|gb|ELY49079.1| hypothetical protein C494_08772 [Natronorubrum bangense JCM 10635]
Length = 863
Score = 49.7 bits (117), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 89/209 (42%), Gaps = 38/209 (18%)
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESG---IVVLNNEQGFSAENIRA-------- 1255
++F LE +QNADD E + Q G + +LNN + F+ +
Sbjct: 6 TYFPLEFIQNADD----EGASAVRFQVRQIDGDWCLEILNNGRKFTDSALEGQDRDTDDV 61
Query: 1256 ------LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-G 1308
LC G S K +IG G+GFKS+F ++ E+HS FH FD + G
Sbjct: 62 KDDVTGLCAAGVSPKHPRD--HIGFIGVGFKSIFEISQCVEVHSGQFHFSFDRERAEKHG 119
Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNT--CIRLPFRTKFSEGIAMNNIVDMFSD 1366
+P V P+ + ++ P +++ + T +RL R G + + ++F
Sbjct: 120 DEIPWRVVPWECNATKE--TEVPAEIDGTEYTTRFVVRLDER-----GRELLSKDELFEP 172
Query: 1367 LHPS-----LLLFLHRLQCIMFRNMLNDS 1390
LH + LFL ++ + + +D+
Sbjct: 173 LHSENIDRRVFLFLKDVRELSIEDHYSDN 201
>gi|374321716|ref|YP_005074845.1| HSP90 family protein [Paenibacillus terrae HPL-003]
gi|357200725|gb|AET58622.1| HSP90 family protein [Paenibacillus terrae HPL-003]
Length = 615
Score = 49.7 bits (117), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 58/118 (49%), Gaps = 21/118 (17%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-----------ENVEPTLTFILQESG- 1238
LG + L+ LYS F+ E++QNA D I ++ E +T L SG
Sbjct: 13 LGGMIDILANHLYSSPRVFIREVLQNATDAITARKELGETAAESKSFEGKVTVELSGSGE 72
Query: 1239 ---IVVLNNEQGFSAENI-RALCDVGNSTKKG-----SSAGYIGRKGIGFKSVFRVTD 1287
+++ +N G + E+I R L +G S+KKG S +IGR GIG S F V+D
Sbjct: 73 QQVLMIQDNGIGLTEEDIHRFLSIIGQSSKKGTDLLASETSFIGRFGIGLLSCFMVSD 130
>gi|383625857|ref|ZP_09950263.1| hypothetical protein HlacAJ_21134 [Halobiforma lacisalsi AJ5]
gi|448701043|ref|ZP_21699657.1| hypothetical protein C445_16934 [Halobiforma lacisalsi AJ5]
gi|445778437|gb|EMA29381.1| hypothetical protein C445_16934 [Halobiforma lacisalsi AJ5]
Length = 862
Score = 49.3 bits (116), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 52/209 (24%), Positives = 89/209 (42%), Gaps = 38/209 (18%)
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESG---IVVLNNEQGFSAENIRA-------- 1255
++F LE +QNADD E + Q G + +LNN + F+ +
Sbjct: 6 TYFPLEFIQNADD----EGASAVRFQVRQIGGDWCLEILNNGRKFTDSALEGQDRDTDDV 61
Query: 1256 ------LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-G 1308
LC G S K +IG G+GFKS+F ++ E+HS FH FD + G
Sbjct: 62 KDDVTGLCAAGVSPKHPRD--HIGFIGVGFKSIFEISQCVEVHSGQFHFSFDRERAEKHG 119
Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNT--CIRLPFRTKFSEGIAMNNIVDMFSD 1366
+P V P+ + ++ P +++ + T +RL R G + + ++F
Sbjct: 120 DEIPWRVVPWECNATKE--TEVPTEIDGTEYTTRFVVRLDER-----GQELLSKDELFEP 172
Query: 1367 LHPS-----LLLFLHRLQCIMFRNMLNDS 1390
LH + LFL ++ + + +D+
Sbjct: 173 LHSENIDRRVFLFLKDVRELSIEDHYSDN 201
>gi|6907044|gb|AAF31263.1| sacsin [Mus musculus]
Length = 3830
Score = 49.3 bits (116), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 15/100 (15%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
+++R + ++G TK+G+ G GIGF SV+ +TD P
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPCK-TGHYGIGFNSVYHITDCP 1886
Score = 45.8 bits (107), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 102/258 (39%), Gaps = 68/258 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 719 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 778 SE-FSDSDFLNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 836 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 890
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 891 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 950
Query: 1387 ------LNDSLVVIRKKI 1398
L ++I+KK+
Sbjct: 951 EETNPSLAQDTIIIKKKV 968
>gi|224148547|ref|XP_002336671.1| predicted protein [Populus trichocarpa]
gi|222836495|gb|EEE74902.1| predicted protein [Populus trichocarpa]
Length = 1060
Score = 49.3 bits (116), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 55/126 (43%), Gaps = 26/126 (20%)
Query: 1428 RPDVKTTEIALALTLQESN-EGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSRE 1485
R D++ L +TL N E + + Y ++AFLP FI+Q DF+L SSRE
Sbjct: 26 RVDMRMEVEDLVITLAFPNGERLHRGMKYSPGIYAFLPTEMVTDFPFIIQADFILASSRE 85
Query: 1486 EVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGF-FS 1544
+ ++ WNQ +L P F+ A FV LV VHG S
Sbjct: 86 TIRWDNIWNQGILDCVPFAFIEA-----------------------FVSLVKTVHGAPAS 122
Query: 1545 GLPRMI 1550
LPRM
Sbjct: 123 SLPRMF 128
>gi|432115436|gb|ELK36851.1| Sacsin [Myotis davidii]
Length = 2206
Score = 49.3 bits (116), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 800 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 858
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
+++R + ++G TK+G+ G+ GIGF SV+ +TD P I N FD
Sbjct: 859 DDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 910
>gi|170120064|ref|XP_001891101.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164633702|gb|EDQ98250.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 208
Score = 49.3 bits (116), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 74/153 (48%), Gaps = 18/153 (11%)
Query: 1209 FLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
F+L+ +A D+I+ ++PT Q ++ N+ F E+ AL ++ S+KK +
Sbjct: 46 FVLDHRTHATDHIHHPRLKPT-----QGPALLAFNDAM-FQEEDWEALQNIHRSSKKMDT 99
Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPPF--NID 1321
+ IG+ GIGF+SV+ +TD P+I S + D G + L + ++
Sbjct: 100 SK-IGKYGIGFRSVYHITDYPQIMSGPYLAILDPQHNFARSGGVKIELSANPSTYQGHLT 158
Query: 1322 MFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSE 1353
F LL P + K + + +RLP R + SE
Sbjct: 159 TFDSLL---PSDWQGKEFTGSVVRLPLRNRESE 188
>gi|297563109|ref|YP_003682083.1| ATP-binding region ATPase domain-containing protein [Nocardiopsis
dassonvillei subsp. dassonvillei DSM 43111]
gi|296847557|gb|ADH69577.1| ATP-binding region ATPase domain protein [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
Length = 1084
Score = 49.3 bits (116), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 43/93 (46%), Gaps = 1/93 (1%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
++EL QNA D V L L + N +AE + +L + STK+G
Sbjct: 51 RVVVELAQNAADAARRAGVPGRLRLALTGHRLTAANTGAPLTAEGVESLATLRASTKRGE 110
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
G GR G+GF +V ++D + S G V+F
Sbjct: 111 Q-GSAGRFGVGFSAVAALSDDVTVDSGGSAVRF 142
>gi|327269040|ref|XP_003219303.1| PREDICTED: sacsin-like [Anolis carolinensis]
Length = 4592
Score = 49.3 bits (116), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 25/105 (23%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ + P LQ + V NN
Sbjct: 2545 YPSEKEMLKELLQNADDAKATEICFVFDPRQHSTDRIFDDKWAP-----LQGPALCVYNN 2599
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
Q F+ ++IR + ++G TK+G+ G+ GIGF SV+ +TD P
Sbjct: 2600 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHLTDCP 2642
Score = 45.1 bits (105), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 68/178 (38%), Gaps = 51/178 (28%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E + Q S + V NN F+
Sbjct: 114 YPEGGQILKELIQNAEDAGATEVRFLYDETQYGNESLWSKDMAQYQGSALYVYNNAI-FT 172
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
E+ + ++ S KK +GR GIGF SV+ +TD P I S QIG
Sbjct: 173 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPSIFSG----------DQIGM 221
Query: 1310 VLP--TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
+ P TL P +CWN + F E MN + D F+
Sbjct: 222 LDPHQTLFGPHE---------------SGQCWN------LKDDFKE---MNELTDQFA 255
Score = 40.8 bits (94), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 49/116 (42%), Gaps = 17/116 (14%)
Query: 1201 ELYSQDSHFLLELVQNADD---------------NIYPENVEPTLTFILQESGIVVLNNE 1245
E Y S EL+QNADD N EN+ + NN
Sbjct: 1474 EEYPSVSDIFKELLQNADDANATDCNFLIDMRRNNDIRENLLDPGMAASHGPALWSFNNS 1533
Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+ FS + + +G S K+ + +G+ G+GF SV+ +TD P I S F + FD
Sbjct: 1534 E-FSDNDFLNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFD 1587
>gi|122066080|sp|Q9JLC8.2|SACS_MOUSE RecName: Full=Sacsin; AltName: Full=DnaJ homolog subfamily C member
29; Short=DNAJC29
Length = 4582
Score = 49.3 bits (116), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 15/100 (15%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ P LQ + V NN Q F+
Sbjct: 2541 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2599
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
+++R + ++G TK+G+ G GIGF SV+ +TD P
Sbjct: 2600 EDDVRGIQNLGKGTKEGNPCK-TGHYGIGFNSVYHITDCP 2638
Score = 45.8 bits (107), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 102/258 (39%), Gaps = 68/258 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1471 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1529
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1530 SE-FSDSDFLNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1587
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1588 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1642
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F + + + +
Sbjct: 1643 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1702
Query: 1387 ------LNDSLVVIRKKI 1398
L ++I+KK+
Sbjct: 1703 EETNPSLAQDTIIIKKKV 1720
Score = 45.4 bits (106), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 107 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 165
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 166 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208
>gi|392843861|gb|AFM84534.1| sacsin, partial [Batrachuperus yenyuanensis]
Length = 736
Score = 49.3 bits (116), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 25/109 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL+QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELLQNADDAKATEICFVFDPRYHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
Q F+ ++IR + ++G TK+ + G+ GIGF SV+ +TD P S
Sbjct: 472 -QPFTEDDIRGIQNLGRGTKEANPCK-TGQYGIGFNSVYHITDCPSFMS 518
>gi|311747181|ref|ZP_07720966.1| hypothetical protein ALPR1_12575 [Algoriphagus sp. PR1]
gi|126578890|gb|EAZ83054.1| hypothetical protein ALPR1_12575 [Algoriphagus sp. PR1]
Length = 1880
Score = 48.9 bits (115), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 3/110 (2%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
L + + + +YS + EL+QNADD + V + F + ++++ FS
Sbjct: 222 LATTMTEIGKGMYSSKQRMIFELLQNADDAPGRDKVGFHIDF--KGDYFFIMHDGAPFSK 279
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
++ A+ G STK+ G KGIGFKSVF + I S G+ F
Sbjct: 280 SDVEAITRSGESTKRDKRK-VTGYKGIGFKSVFTDSSEVWIKSGGYQFAF 328
Score = 43.9 bits (102), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 62/126 (49%), Gaps = 12/126 (9%)
Query: 1400 GDGIIKVSCGED--KMTWFVAS-QKLRAGVIRPDVKT---TEIALALTLQES---NEGNY 1450
G G++K S D ++T+F + + I P + + TE++ +Q +E Y
Sbjct: 492 GIGLLKDSQVNDYNEITYFFTDLEGNKITTIPPKIASVSETEVSFGFLIQNGKLVSEPGY 551
Query: 1451 GPLLYQ-QPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
Q +F +LP+ + + L F++ GDFV S REE+ G++ WN++++++ V
Sbjct: 552 ATSNSQYSSLFTYLPMEDKRFKLPFLVNGDFVPDSRREEIQGDNLWNKYIMTKVAEKHVE 611
Query: 1508 AERSFC 1513
F
Sbjct: 612 TLEKFA 617
>gi|134097209|ref|YP_001102870.1| HSP90 family heat shock protein [Saccharopolyspora erythraea NRRL
2338]
gi|291005334|ref|ZP_06563307.1| HSP90 family heat shock protein [Saccharopolyspora erythraea NRRL
2338]
gi|133909832|emb|CAL99944.1| putative heat shock protein, HSP90-family [Saccharopolyspora
erythraea NRRL 2338]
Length = 984
Score = 48.9 bits (115), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 77/183 (42%), Gaps = 23/183 (12%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
L+EL QNA D + V L L + + V N + +AE + L + S K+
Sbjct: 45 RLLVELAQNAADAAGSKGV---LRVRLADGELRVANTGEPLTAEGVAGLASLRASAKREG 101
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG-QIGFVLPTLVPPFNIDMFCRL 1326
S +GR G+GF +V V++ P + S+ V F Q LP +
Sbjct: 102 S--TVGRYGVGFAAVLAVSEEPRVVSSSGSVTFSAERTRQEAERLPG-----PAEELAAR 154
Query: 1327 LSKDPV--------QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
+ PV + + + T +RLP R+ G+ + +++ F++ LLL L L
Sbjct: 155 AGEVPVLRLVWPLEDVPPEGFATEVRLPLRS----GVDADALLESFAEQASGLLLALPAL 210
Query: 1379 QCI 1381
I
Sbjct: 211 TEI 213
>gi|340384534|ref|XP_003390766.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 3558
Score = 48.9 bits (115), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 55/362 (15%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
QH L L + +E Y + E++QNADD +Y + T + +
Sbjct: 589 QHEPLTTRLGNILRE-YKDGLTIIKEMIQNADDAGATEVNILYDDRTHSTHKLLFKGMTE 647
Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
++++N F+ E+ + + +TK IG+ G+GF SV+ +TD P S
Sbjct: 648 SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 706
Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
+ FD + + G V P + + L++ P + + +N T
Sbjct: 707 EWLYIFDPTLKYLKGVVHNESRPGKKVQYQSKFLAQSQQMAPYEGLFGFTSTANYNGTIF 766
Query: 1344 RLPFRT---KFSEGIAMNNIVD-MFSDL---HPSLLLFLHRLQCIMFRNMLNDSLVVIR- 1395
RLPFRT + S I +++ M +DL LLLFL ++ I F ++ D L +
Sbjct: 767 RLPFRTSPSQISSTIYNEHLIQKMKNDLINNGSKLLLFLQHVKRITFSSVQGDVLSINSI 826
Query: 1396 ------KKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGN 1449
K + + S E W V+ Q+ + D + ++A L + +
Sbjct: 827 NEVNGIKHCITNFSTSNSVSE---YWLVSCQEEKLQEQGGDYQPGTASVACQLFKEG-SS 882
Query: 1450 YGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSR----------EEVDGNSPWNQWL 1497
+ + VF FLPL GL + +F + S+R E D WN+ L
Sbjct: 883 FECKAVEGCVFCFLPLSLPCTGLPVHVNANFAVMSNRSGIWTEASSGEPSDSREYWNEKL 942
Query: 1498 LS 1499
++
Sbjct: 943 MT 944
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 100/443 (22%), Positives = 157/443 (35%), Gaps = 122/443 (27%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENV----------EPTLT---FILQESGIVVLNNEQGFS 1249
Y D F+ EL+QNADD E + E L+ LQ ++V N ++ FS
Sbjct: 1563 YPFDETFVKELIQNADDAKANEMIIILDRRQHGTEKVLSENWGELQGPALLVWN-DKNFS 1621
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
+++ + +G +K+ IG+ GIGF V+ VTD P + G KF + +
Sbjct: 1622 DQDLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHVTDCPSFITRG--EKFCVFDPHCKY 1678
Query: 1310 V---------------------LPTLVPPFNIDMFC-RLLSKDPVQLESKCWNTCIRLPF 1347
V +P L F D F +LL D R P
Sbjct: 1679 VPGADAISPGRQYNIDKTFWNDMPDLSSAFLQDNFTSKLLEIDK--------GVLFRFPL 1730
Query: 1348 RTK---FSEGIAMNNIVD--MFSDLHPS-----------LLLFLHRLQCIMFRNMLNDSL 1391
R+ + +N I D + +D+ S LLLFL+ L + ++N+S
Sbjct: 1731 RSTQKLVEQSKIINEINDTVVTADVMESYLRNWVSVIKDLLLFLNHLTTFEYY-VINESG 1789
Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR--AGVIRPDVKTTEIALALTLQESNEGN 1449
+ + I + ED + + L +++P + T + L ++ S E N
Sbjct: 1790 SIALQTKYNIHITE----EDHLKRIHLKESLSNFKSILQPCLVTYSLTLEASVNSSKESN 1845
Query: 1450 YGPLLYQQPV------------------------------------FAFLPLRTY-GLKF 1472
L QQ V F FLPL Y L
Sbjct: 1846 REVWLIQQGVGDMLDPLKDWKFVPKILPQHGIATTLIRSQNFKGQVFCFLPLPIYTNLPV 1905
Query: 1473 ILQGDFVLPSSR--------EEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAK 1524
+ G F+L S R E++D S WN +L+ + S+ ++
Sbjct: 1906 HINGKFILNSDRRSLWASTSEKLDDRSTWNN-------SLYKAIASSYVHFLKIAQSSYV 1958
Query: 1525 AASVYMSFVPLVGEVHGFFSGLP 1547
AAS Y L V ++ P
Sbjct: 1959 AASGYSKLEELFSAVDVYYGLFP 1981
>gi|392843847|gb|AFM84527.1| sacsin, partial [Melopsittacus undulatus]
Length = 736
Score = 48.9 bits (115), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 49/109 (44%), Gaps = 25/109 (22%)
Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
Y + L EL QNADD I+ E P LQ + V NN
Sbjct: 417 YPSEKEMLKELPQNADDAKATEICFVFDPRQHPVDRIFDEKWAP-----LQGPALCVYNN 471
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
Q F+ ++IR + ++G TK G+ G+ GIGF SV+ VTD P S
Sbjct: 472 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHVTDCPSFLS 518
>gi|356530237|ref|XP_003533689.1| PREDICTED: sacsin-like [Glycine max]
Length = 4760
Score = 48.9 bits (115), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 87/220 (39%), Gaps = 50/220 (22%)
Query: 1203 YSQDSHFLLELVQNADD------NIYPENVEPTLTFILQES-----GIVVLN-NEQGFSA 1250
Y + + L EL+QNADD ++ + +L +S G +L N+ F+
Sbjct: 30 YPEGTTVLKELIQNADDAGATTVSLCLDRRSHAAGSLLSDSLAQWQGPALLAFNDAVFTE 89
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
E+ ++ +G STK G A GR G+GF SV+ +TD P S + V FD G
Sbjct: 90 EDFVSISKIGGSTKHGQ-ASKTGRFGVGFNSVYHLTDLPSFVSGKYVVLFDPQ----GVY 144
Query: 1311 LPTLV---PPFNIDM---------------FCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
LP + P ID +C ++S T R P R +
Sbjct: 145 LPRVSAANPGKRIDFTGSSALSFYRDQFSPYCAF----GCDMQSPFSGTLFRFPLRNAYQ 200
Query: 1353 --------EGIAMNNIVDMFSDLHPS---LLLFLHRLQCI 1381
+ + +I MF L+ LLFL + CI
Sbjct: 201 AATSKLSRQAYSPEDISSMFVQLYEEGVLTLLFLKSVLCI 240
Score = 44.3 bits (103), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 93/238 (39%), Gaps = 55/238 (23%)
Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI------ 1239
QH L R H L E+Y+ L E+VQNA+D E + F+L +S
Sbjct: 1383 QHEALTTRLKHIL--EMYADGPGSLFEMVQNAEDAGASE-----VIFLLDKSHYGTSSIL 1435
Query: 1240 -----------VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
+ N+ FS +++ A+ +G +K A IGR G+GF V+ TD
Sbjct: 1436 SPEMADWQGPALYCYNDSVFSPQDLYAISRIGQESKL-EKAFAIGRFGLGFNCVYHFTDI 1494
Query: 1289 PEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQ----------LESK 1337
P S V FD + + P+ P I R +L + P Q L+
Sbjct: 1495 PMFVSGENVVMFDPHASNLPGISPSH-PGLRIKFVGRQILEQFPDQFSPLLHFGCDLQHP 1553
Query: 1338 CWNTCIRLPFRTKFSEGIA--------------MNNIVDMFSDLHPSLLLFLHRLQCI 1381
T R P RT G+A + ++ FS++ LLFL ++ I
Sbjct: 1554 FPGTLFRFPLRTA---GVASRSQIKKEAYTPEDVRSLFAAFSEVVSETLLFLRNVKSI 1608
>gi|328854067|gb|EGG03202.1| hypothetical protein MELLADRAFT_90363 [Melampsora larici-populina
98AG31]
Length = 295
Score = 48.9 bits (115), Expect = 0.036, Method: Composition-based stats.
Identities = 42/127 (33%), Positives = 54/127 (42%), Gaps = 9/127 (7%)
Query: 9 FRPVGGVGGEGTGGGQPPSQPPPPHFNPNFAFQ-NPN-IFFPNPALQLLQNLTNIPFQNP 66
+ P G + + + G P + P PH P A+ NP+ FP+ Q Q NIP Q P
Sbjct: 49 YPPPGILYPQFSQGAHNPHETPNPH-GPQHAYPVNPSHPLFPH--YQHQQPTNNIPHQQP 105
Query: 67 NFDIQNPNLFMQNANLLTQNPNLPPQQ---PPSSACNQQRSQTQHPPGASNQQRPLTQQP 123
Q PN Q+ Q P+ P QQ P Q + Q PPG +N Q
Sbjct: 106 PNSYQQPNHPYQHTTSHYQQPHHPYQQHHNPYQQPFQQPFNPYQQPPG-NNFIHSQPSQN 164
Query: 124 QSLPPQP 130
LPPQP
Sbjct: 165 HFLPPQP 171
>gi|432934233|ref|XP_004081920.1| PREDICTED: sacsin-like [Oryzias latipes]
Length = 2838
Score = 48.5 bits (114), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 107/276 (38%), Gaps = 60/276 (21%)
Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLG-RALHCLSQELYSQDSHFLLELVQ 1215
+A + ESI D +I + + + K+ L R + +S YS L EL+Q
Sbjct: 1203 NAMFLHESIEED-----TDIGALAAQICKEHREELAVRIKNIISA--YSSKKDILKELIQ 1255
Query: 1216 NADDNIYPENVEPTLTFI------------------LQESGIVVLNNEQGFSAENIRALC 1257
NADD E + F+ LQ + V NN + FS E+++ +
Sbjct: 1256 NADDAEATE-----IHFVWDKRQHGKEKTFGERWNHLQGPALCVFNN-KVFSDEDLKGIQ 1309
Query: 1258 DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG------------FHVKFDTSEG 1305
+G K+ + G G+ G+GF SV+ +TD P I + + + +
Sbjct: 1310 QLGEGGKR-ETLGKTGKFGVGFNSVYHLTDCPSILTGDKFLCISDPNMKYIETQPNNHKL 1368
Query: 1306 QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA--------- 1356
IG+ L D++ L + + E T RLP RT++S +
Sbjct: 1369 GIGYKLADTFKEMYDDVYRSFLPEQFLLEEG----TMFRLPLRTEYSAKNSRISCQEVTE 1424
Query: 1357 --MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS 1390
M + SD L+LFL + I + N S
Sbjct: 1425 HDMRELCSALSDDPEGLILFLKNICKITVHEINNQS 1460
Score = 46.6 bits (109), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 60/274 (21%), Positives = 109/274 (39%), Gaps = 68/274 (24%)
Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
L ES+ ++FG I+ N+LK+ Y + + EL+QNA+D
Sbjct: 699 LAPESLGIEQFGQREPITTRIKNILKE----------------YDEPNDIFKELIQNAED 742
Query: 1220 N---------IYPENVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
+ E+ +P + I Q + NN+Q F+ E+ + VG+++K
Sbjct: 743 AGADTCKFMIDFREHRDPPESLIDPGMSLCQGPCLWAFNNKQ-FTTEDWENIVKVGSASK 801
Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEI---------------------HSNGFHVKFDTS 1303
+ + IG+ G+GF +V+ VTD P I H ++ D S
Sbjct: 802 E-TKVEKIGKFGLGFNTVYHVTDVPSILSGSRLLILDPNVTHLKKHIQHQTNPGIRLDLS 860
Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-------RTKFSEGIA 1356
+ +I P + C + P + + T I+LPF R++ S+ +
Sbjct: 861 QKRILDCFPGQFAQYEHIFDCNFSRQSPPKPYT---GTLIKLPFRSEKEVLRSEISKKVF 917
Query: 1357 MNNIV----DMFSDLHPSLLLFLHRLQCIMFRNM 1386
+ + D F + S LLFL + + +++
Sbjct: 918 YKHDILALRDYFRENSKSYLLFLKNINTLSLQSI 951
Score = 43.9 bits (102), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 49/118 (41%), Gaps = 13/118 (11%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
Y L EL+QNADD I+ E T + + G + N F+
Sbjct: 30 YPDGGQILKELIQNADDGQATEVVFIHDERRYGTESLWTNKLGKYQGPALYAYNNTAFTD 89
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
E+ + G S K + IGR GIGF SV+ +TD P I S+G D E G
Sbjct: 90 EDWERIQMAGRSGK-VNDPNKIGRFGIGFNSVYHITDVPIIFSSGHLGVMDPQENIFG 146
>gi|410897104|ref|XP_003962039.1| PREDICTED: sacsin-like [Takifugu rubripes]
Length = 3639
Score = 48.5 bits (114), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 101/240 (42%), Gaps = 47/240 (19%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
Y L EL+QNADD I+ + T + QE G + + N+ F+
Sbjct: 31 YPDGGQILKELIQNADDAQATEVVFIHDDRSYGTESLWTQELGQYQGPALYVYNDASFTE 90
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI--- 1307
E+ + G S K+ + +GR GIGF SV+ +TD P I S G H+ F + QI
Sbjct: 91 EDWEGIQLAGRSFKR-NDPNKVGRFGIGFNSVYHITDLPCILS-GSHLGFMDPQEQIFGE 148
Query: 1308 ---GF-----------VLPTLVPPFN-IDMFCRLLSK---DPVQLESKCWN-TCIRLPFR 1348
GF L T+ F I L+S + E + +N T R P R
Sbjct: 149 REGGFRWYMDDPQHQEYLTTMHDQFQPIRDIVSLISNKKWSSIVTEHQYFNGTLFRFPLR 208
Query: 1349 TKFSEGIAMN-----NIVDMF----SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
+ SE I+ N + D+F +D S LLFL + + +++ D V R ++
Sbjct: 209 NRVSE-ISDNLYNSGKVADLFESFITDAELS-LLFLKNVASVSLKHVTADGEVSTRLQVT 266
Score = 46.2 bits (108), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 48/102 (47%), Gaps = 15/102 (14%)
Query: 1203 YSQDSHFLLELVQNADDNIYPE----------NVEPTLT---FILQESGIVVLNNEQGFS 1249
Y L EL+QNADD E +VE T LQ + V NN + FS
Sbjct: 2163 YPSKKDILKELIQNADDAEATEIHFVWDKRQHDVEKTFGENWNQLQGPALCVFNN-KVFS 2221
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
+++ + +G K S+ G GR G+GF SV+ +TD P I
Sbjct: 2222 DADLKGIQRLGEGGKH-STPGKTGRYGVGFNSVYHLTDCPSI 2262
Score = 45.1 bits (105), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 74/184 (40%), Gaps = 44/184 (23%)
Query: 1203 YSQDSHFLLELVQNADD---------------NIYPENVEPTLTFILQESGIVVLNNEQG 1247
Y ++S EL+QNA+D PE + + Q + NNE
Sbjct: 1172 YDEESDIFKELIQNAEDAGAEQCKFLVDFRAHKYSPEELIDRDMSLCQGPCLWAFNNET- 1230
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH- 1297
F ++ + VG+++K+ + IG+ G+GF +V+ VTD P + S N H
Sbjct: 1231 FKDDDWANIVRVGSASKE-NQVEKIGKFGLGFNTVYHVTDIPSVLSGTSLLILDPNVTHL 1289
Query: 1298 -----------VKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
+K + S+ ++ P P+ C ++P T IRLP
Sbjct: 1290 KTHINNKSNPGIKLNLSQRELFKCFPGQFGPYERIFDCDFTQQNPYA------GTLIRLP 1343
Query: 1347 FRTK 1350
FRT+
Sbjct: 1344 FRTE 1347
>gi|340367671|ref|XP_003382377.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 3279
Score = 48.5 bits (114), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 47/111 (42%), Gaps = 17/111 (15%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENV----------------EPTLTFILQESGIVVLNNEQ 1246
Y D+ E +QNADD E V E T LQ + + N +
Sbjct: 1337 YPADTSIFREYIQNADDAKATEIVFVLDHRSKHPQEKLLTEGTEWASLQRVPALCIYNNR 1396
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFH 1297
F E+I+ +C +G K G +A IGR GIGF + +TD P S H
Sbjct: 1397 QFMEEDIKGICKLGRGGK-GDTADTIGRFGIGFNVSYHLTDCPTFVSFDSH 1446
>gi|402311146|ref|ZP_10830095.1| hypothetical protein HMPREF1142_1759 [Eubacterium sp. AS15]
gi|400365764|gb|EJP18812.1| hypothetical protein HMPREF1142_1759 [Eubacterium sp. AS15]
Length = 1235
Score = 48.5 bits (114), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 48/83 (57%), Gaps = 5/83 (6%)
Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNE-QGFSAENIRALCDVGNSTKKGSSAG 1270
EL+QNA+D+I E +E ++ +S + + E +GF+ ++ A+ GNS + S
Sbjct: 190 ELLQNANDHISEEIMEVSI----DDSKLCLEYAENEGFTVKDFYAIVTEGNSGNESSLEN 245
Query: 1271 YIGRKGIGFKSVFRVTDAPEIHS 1293
GRKG GFKSV++ D EI S
Sbjct: 246 REGRKGTGFKSVYKYFDKVEISS 268
>gi|380490481|emb|CCF35984.1| hypothetical protein CH063_07653 [Colletotrichum higginsianum]
Length = 151
Score = 48.5 bits (114), Expect = 0.042, Method: Composition-based stats.
Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 20/92 (21%)
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPEN----VEPTLTFILQ--ESGIVVLNN-- 1244
R+L L Q+LYS ++ F+ E++Q ADDN Y ++ + PT + L+ +G +N
Sbjct: 53 RSLEILLQQLYSTETRFIHEIIQTADDNAYGKSATNGIPPTFSLSLKVGRTGDTNTDNAN 112
Query: 1245 ------------EQGFSAENIRALCDVGNSTK 1264
E GFS + AL D+G STK
Sbjct: 113 PGKGYILETACSEDGFSMSQMTALTDIGESTK 144
>gi|365842393|ref|ZP_09383406.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Flavonifractor plautii ATCC 29863]
gi|364575731|gb|EHM53107.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Flavonifractor plautii ATCC 29863]
Length = 605
Score = 48.5 bits (114), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 54/117 (46%), Gaps = 13/117 (11%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNI------YPENVEPTLTFILQESG-IVVLN 1243
LG L LS LY FL EL+QN D + P+ LT ++ ++ +
Sbjct: 31 LGGMLDVLSNHLYKSPDVFLRELLQNGVDAVTLRQKSQPDWNRGKLTITVEPGRRLIFRD 90
Query: 1244 NEQGFSAENI-RALCDVGNSTKKGSSAG-----YIGRKGIGFKSVFRVTDAPEIHSN 1294
N G S E I R L +G S+K G YIGR GIG S F V+D+ +H+
Sbjct: 91 NGAGLSEEGIHRFLAVIGQSSKTELVNGQIPEDYIGRFGIGLLSCFMVSDSIVVHTR 147
>gi|357400009|ref|YP_004911934.1| hypothetical protein SCAT_2418 [Streptomyces cattleya NRRL 8057 = DSM
46488]
gi|386356049|ref|YP_006054295.1| hypothetical protein SCATT_24020 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
gi|337766418|emb|CCB75129.1| conserved protein of unknown function [Streptomyces cattleya NRRL
8057 = DSM 46488]
gi|365806557|gb|AEW94773.1| hypothetical protein SCATT_24020 [Streptomyces cattleya NRRL 8057 =
DSM 46488]
Length = 1038
Score = 48.5 bits (114), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 48/212 (22%), Positives = 86/212 (40%), Gaps = 13/212 (6%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
++EL QNA D L L+E + N A + +L + S K+
Sbjct: 54 RLVVELAQNAADAAVRAGTTGRLRLTLREGVLAAANTGAPLDAAGVESLSTLRASAKRDD 113
Query: 1268 SA-GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL 1326
+A +GR G+GF +V V+D P + V++ +E + P D R
Sbjct: 114 AAEAAVGRFGVGFSAVLAVSDEPAVIGRTGGVRWSLAEARAAAEETAADAPELADELRRR 173
Query: 1327 -----LSKDPVQLESKC---WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
L + P E ++T + LP R +E +A + + + +LLL L L
Sbjct: 174 DGHVPLLRLPFPAEGAAPEGYDTVVVLPLRDGAAEDLAHR----LLAGVDDALLLTLPGL 229
Query: 1379 QCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
++ ++ R+ + GD +++ S G+
Sbjct: 230 AEVVVETADTVRILTRRQLLEGDVVVEDSAGQ 261
>gi|347839221|emb|CCD53793.1| hypothetical protein [Botryotinia fuckeliana]
Length = 2724
Score = 48.5 bits (114), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 99/243 (40%), Gaps = 47/243 (19%)
Query: 1203 YSQDSHFLLELVQNADD------------NIY-PENVEPTLTFILQESGIVVLNNEQGFS 1249
Y L EL+QNADD N Y EN+ P L Q ++ NN + F
Sbjct: 26 YPAGGSVLRELLQNADDAQASEVKFFLDENSYGTENLPPKLA-QYQGPALLAYNNAE-FK 83
Query: 1250 AENIRALCDVGNSTK--KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF-----DT 1302
E+ ++L VGNS K G++ G GR GF SV+ TD+P I S G V+
Sbjct: 84 KEHFQSLSQVGNSLKMEDGATTGKFGR---GFNSVYNWTDSPSIVS-GHRVQILDPHHSW 139
Query: 1303 SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT---KFSEGIA--- 1356
S G G+ ++ + + T IRLP RT GI+
Sbjct: 140 SSGGNGYNFVEHSENSSLKAHMTVYQSFIKNAKQPFPGTIIRLPLRTIEQAKESGISDRK 199
Query: 1357 -----MNNIVDMFS-DLHPSLLLFL-HRLQCIMFRNM--------LNDSLVVIRKKIVGD 1401
M N+++ F+ D LLF+ H ++ + N+ +N V + K V D
Sbjct: 200 ITAQEMANVLEKFTEDFGKEGLLFMKHVIKISVISNITGLIEIEVMNKKAVQLNKTKVND 259
Query: 1402 GII 1404
I+
Sbjct: 260 AIV 262
>gi|408827281|ref|ZP_11212171.1| hypothetical protein SsomD4_08878 [Streptomyces somaliensis DSM
40738]
Length = 1588
Score = 48.1 bits (113), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 62/155 (40%), Gaps = 24/155 (15%)
Query: 1210 LLELVQNADDNIYPENVEP--TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
L ELVQNA D + EP + +L E+ + N + E + + S K+G
Sbjct: 50 LFELVQNAADEM---RDEPGCRIHVVLTETHLYCANQGNPVTPEGAETILRMSMSRKRG- 105
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--------------EGQIGFVLPT 1313
G IGR G+G KSV VTD PE S FD + F P
Sbjct: 106 --GQIGRFGVGVKSVLVVTDTPEFFSRTGSFGFDRTWSYESIRNVPGVRERHGDDFEAPV 163
Query: 1314 LVPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPF 1347
L +DM L +D V E W T +RLP
Sbjct: 164 LRMARELDMTAERL-RDRVLDELLEWATTVVRLPL 197
>gi|379732169|ref|YP_005324365.1| hypothetical protein SGRA_4066 [Saprospira grandis str. Lewin]
gi|378577780|gb|AFC26781.1| hypothetical protein SGRA_4066 [Saprospira grandis str. Lewin]
Length = 1842
Score = 48.1 bits (113), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 9/86 (10%)
Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
+QD + E +QNA D+ + I E +VLNN + F+ ++++AL + T
Sbjct: 62 AQDEQAIYEFLQNAADSQASD-----CAVIYDEQYFMVLNNGRAFTDKDLKALLNSFQGT 116
Query: 1264 K----KGSSAGYIGRKGIGFKSVFRV 1285
K K + G IGR GIGFK +R+
Sbjct: 117 KADKSKAENCGKIGRYGIGFKLAYRL 142
>gi|152967525|ref|YP_001363309.1| hypothetical protein Krad_3582 [Kineococcus radiotolerans SRS30216]
gi|151362042|gb|ABS05045.1| conserved hypothetical protein [Kineococcus radiotolerans SRS30216]
Length = 985
Score = 48.1 bits (113), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
++ N AE ++ L + S K G + G +GR G+GF +V VTDAP I S+
Sbjct: 35 LLAANTGAPLDAEGVQGLATLRASAKAGPAGAPGVVGRFGVGFAAVLAVTDAPAIRSSTG 94
Query: 1297 HVKFDTSE 1304
V+F +E
Sbjct: 95 AVRFSRTE 102
>gi|424843792|ref|ZP_18268417.1| hypothetical protein SapgrDRAFT_3267 [Saprospira grandis DSM 2844]
gi|395321990|gb|EJF54911.1| hypothetical protein SapgrDRAFT_3267 [Saprospira grandis DSM 2844]
Length = 1840
Score = 48.1 bits (113), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 9/86 (10%)
Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
+QD + E +QNA D+ + I E +VLNN + F+ ++++AL + T
Sbjct: 62 AQDEQAIYEFLQNAADSQASD-----CAVIYDEQYFMVLNNGRAFTDKDLKALLNSFQGT 116
Query: 1264 K----KGSSAGYIGRKGIGFKSVFRV 1285
K K + G IGR GIGFK +R+
Sbjct: 117 KADKSKAENCGKIGRYGIGFKLAYRL 142
>gi|224083922|ref|XP_002307173.1| predicted protein [Populus trichocarpa]
gi|222856622|gb|EEE94169.1| predicted protein [Populus trichocarpa]
Length = 2870
Score = 48.1 bits (113), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 57/117 (48%), Gaps = 13/117 (11%)
Query: 1203 YSQDSHFLLELVQNADD------NIYPENVEPTLTFILQES-----GIVVLN-NEQGFSA 1250
Y + + L EL+QNADD + + + T +L S G +L+ N+ F+
Sbjct: 35 YPEGTTVLKELIQNADDAGATVVRLCLDRRHHSTTSLLSPSLSAFQGPALLSYNDAVFTE 94
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
E+ ++ +G S K + A GR G+GF SV+ +TD P S + V FD G +
Sbjct: 95 EDFTSISRIGGSVKH-NQATKTGRFGVGFNSVYHLTDLPSFVSGNYIVMFDPQGGYL 150
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 68/248 (27%), Positives = 96/248 (38%), Gaps = 59/248 (23%)
Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL----------- 1234
QH L R H L E+Y+ L ELVQNA+D E + F+L
Sbjct: 1398 QHEALTTRLKHIL--EMYADGPGILFELVQNAEDAGASE-----VIFLLDKTQYGTSSVL 1450
Query: 1235 -------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
Q + NN FS++++ A+ +G +K IGR G+GF V+ TD
Sbjct: 1451 SPEMADWQGPALYCFNNSV-FSSQDLYAISRIGQESKLEKPFA-IGRFGLGFNCVYHFTD 1508
Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQ----------LES 1336
P S V FD + + P+ P I R +L + P Q L+
Sbjct: 1509 IPTFVSGENVVMFDPHACNLPGISPSH-PGLRIKFTGRKILEQFPDQFSPFLHFGCDLQH 1567
Query: 1337 KCWNTCIRLPFRTKF--------SEGIAMNNIVDM---FSDLHPSLLLFLHRLQCIMFRN 1385
T R P R+ EG A +++ + FS + LLFL RN
Sbjct: 1568 PFPGTLFRFPLRSSTVARRSLIKKEGYAPEDVMSLFTSFSGVVSDALLFL--------RN 1619
Query: 1386 MLNDSLVV 1393
+ N SL V
Sbjct: 1620 VKNISLFV 1627
>gi|156402329|ref|XP_001639543.1| predicted protein [Nematostella vectensis]
gi|156226672|gb|EDO47480.1| predicted protein [Nematostella vectensis]
Length = 4754
Score = 47.8 bits (112), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 18/137 (13%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENV-EPTLTFILQES- 1237
Q+ L R L L +E Y+ EL+QNADD +Y E E +T ++ E
Sbjct: 1267 QYEPLTRRLQRLLEE-YTDGFSVPKELIQNADDAGATEVRFLYDERANEDAMTCLISEGM 1325
Query: 1238 ------GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
+ V NN F+ + + + + +TK+ S G IGR G+GF +V+ +TD P +
Sbjct: 1326 KECQGPALWVYNNAV-FTDADFQNITKLNGATKEEDS-GKIGRFGLGFNAVYNITDVPSL 1383
Query: 1292 HSNGFHVKFDTSEGQIG 1308
S V FD +G
Sbjct: 1384 LSRNNLVIFDPHTTHLG 1400
Score = 44.3 bits (103), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 55/129 (42%), Gaps = 30/129 (23%)
Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADD------------------NIYPENV 1226
+K RL R L E Y D L EL+QNADD I+ +
Sbjct: 2307 EKLTTRLKRIL-----EGYPCDGAILKELLQNADDAKASKICFIKDLRRHPSERIFEHSW 2361
Query: 1227 EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
EP LQ + V N+ F+ ++I + +G +K G G+ G+GF +V+ +T
Sbjct: 2362 EP-----LQGPALCVYNDSP-FTTQDIEGIQRLGEGSK-GDDPNKTGQYGVGFNAVYHMT 2414
Query: 1287 DAPEIHSNG 1295
D P S G
Sbjct: 2415 DCPSFVSKG 2423
>gi|345850264|ref|ZP_08803264.1| hypothetical protein SZN_11043 [Streptomyces zinciresistens K42]
gi|345638310|gb|EGX59817.1| hypothetical protein SZN_11043 [Streptomyces zinciresistens K42]
Length = 1040
Score = 47.8 bits (112), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 49/208 (23%), Positives = 82/208 (39%), Gaps = 36/208 (17%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
++EL QNA D V L L+E + N A + +L + S K+ +
Sbjct: 53 RLVVELAQNAADAAGRAGVPGRLRLTLREGVLFAANTGAPLDAAGVESLSTLRASAKRDT 112
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----------DTSEGQIGFV------- 1310
+ +GR G+GF SV VTD P + V++ +T+ G G
Sbjct: 113 DS--VGRFGVGFASVLAVTDEPAVVGRHGGVRWALAEARDLALETAGGSAGLGEEIRRRD 170
Query: 1311 --LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLH 1368
+P L PF + + DP ++T + LP R + G+A + +
Sbjct: 171 GHVPLLRLPFAAEGT----APDP-------YDTVVILPLRDTAAAGLAER----LLRAVD 215
Query: 1369 PSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
+LLL L L+ ++ + V R+
Sbjct: 216 DALLLALPGLEEVVIEVGDGEPRTVRRR 243
>gi|326430063|gb|EGD75633.1| hypothetical protein PTSG_06697 [Salpingoeca sp. ATCC 50818]
Length = 5949
Score = 47.8 bits (112), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 86/204 (42%), Gaps = 30/204 (14%)
Query: 1203 YSQDSHFLLELVQNADD-------------NIYPENVEPTLTFILQESGIVVLNNEQGFS 1249
Y + + + EL+QNADD E + T Q ++ N+ FS
Sbjct: 30 YPEGTSVIKELIQNADDAEATCVRIALDKRQFGTEKLRFPDTGKFQGPALLCYNDSV-FS 88
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG- 1308
E+ ++ ++ S+K+ + +GR G+GF SV+ +TD P S ++V FD + +G
Sbjct: 89 DEDFTSIKNICRSSKQADTT-KVGRFGLGFNSVYHITDLPSFVSRNYYVVFDPHKSSLGT 147
Query: 1309 ------FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI------- 1355
F D + + + + + T RLP RT SE
Sbjct: 148 EGRRVKFTSKHERDGKYEDTWATMRAFG-CTMTADFKGTLFRLPLRTAASELSSSCHTID 206
Query: 1356 AMNNIVDMFSDLHPSLLLFLHRLQ 1379
+M +++D + P +LLFL ++
Sbjct: 207 SMLSLLDTLAAEAPHMLLFLRHVR 230
>gi|340367673|ref|XP_003382378.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
Length = 992
Score = 47.8 bits (112), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 51/105 (48%), Gaps = 17/105 (16%)
Query: 1203 YSQDSHFLLELVQNADD------------NIYPEN--VEPTLTFILQESGIVVLNNEQGF 1248
Y L EL+QNADD YPEN + PTL + Q ++ NN F
Sbjct: 31 YPDGGQILKELIQNADDAGATVRKFCLDTKQYPENDLMFPTLN-LFQGPALLAYNNAI-F 88
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + NS K+ ++A +GR G+GF SVF +TD P + S
Sbjct: 89 QEEDWEHFGRIENSGKE-TAALKVGRFGVGFLSVFHITDLPMVIS 132
>gi|195997653|ref|XP_002108695.1| predicted protein [Trichoplax adhaerens]
gi|190589471|gb|EDV29493.1| predicted protein [Trichoplax adhaerens]
Length = 1421
Score = 47.8 bits (112), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 89/194 (45%), Gaps = 41/194 (21%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS----- 1262
+ EL++NA+D+ ++ + ++E + V + + F N+ A+C GN+
Sbjct: 42 QIIFELLRNANDH---SDISLSFKIEIKERYLSVSHTGKAFDLGNVEAIC--GNTQSKSN 96
Query: 1263 --TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--------SEGQIGFVLP 1312
++ YIG +GFKSVF +DA I S G+ +FD+ S+ + P
Sbjct: 97 SKKNNSNNNDYIG---MGFKSVFVFSDAAFIKSGGYTFRFDSSYQKWNKPSQSDLASDYP 153
Query: 1313 TLVPPF---NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
V P N D+ R+ + P +NT + F+ K E I D+F DL
Sbjct: 154 WQVIPIWTENKDLPHRIANVPP------GYNT--QFLFKLKKDEIIE-----DLFQDLKK 200
Query: 1370 SLL--LFLHRLQCI 1381
SLL L+L +++ I
Sbjct: 201 SLLFMLYLRKIKMI 214
>gi|302844699|ref|XP_002953889.1| hypothetical protein VOLCADRAFT_118510 [Volvox carteri f.
nagariensis]
gi|300260701|gb|EFJ44918.1| hypothetical protein VOLCADRAFT_118510 [Volvox carteri f.
nagariensis]
Length = 5796
Score = 47.8 bits (112), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 13/111 (11%)
Query: 1203 YSQDSHFLLELVQNADD------------NIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
Y + L+EL+QNADD YP + + + + +++ N+ FS
Sbjct: 1759 YPEGPGVLMELLQNADDAGATSLELLLDDTTYPASSILSPAMAVWQGPALLVANDAVFSP 1818
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+ + +G +K S IGR G+GF +V+ TD P S + V FD
Sbjct: 1819 ADFANISRIGQDSKA-SRPTSIGRFGLGFNAVYHFTDLPCFVSGDYLVMFD 1868
>gi|390366753|ref|XP_003731107.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
Length = 292
Score = 47.8 bits (112), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 2/62 (3%)
Query: 1234 LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
LQ + V NN + F+ ++IR + ++G +K G IGR G+GF +V+++TD P S
Sbjct: 4 LQGPALCVFNN-KSFTDDDIRGIQNLGEGSK-ADHMGKIGRYGVGFNAVYQLTDCPSFIS 61
Query: 1294 NG 1295
NG
Sbjct: 62 NG 63
>gi|254443594|ref|ZP_05057070.1| hypothetical protein VDG1235_1830 [Verrucomicrobiae bacterium DG1235]
gi|198257902|gb|EDY82210.1| hypothetical protein VDG1235_1830 [Verrucomicrobiae bacterium DG1235]
Length = 1483
Score = 47.8 bits (112), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 9/118 (7%)
Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLN 1243
+ + A+ ++ L + + S+ H + EL+QNADD T I ++E V +
Sbjct: 23 QSESAQYRNSVRQLFETVASRVQHVVSELLQNADD------AGATCARIEIEEDNFVFRH 76
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+ F E + ALC S+K + G +GIGFKS F + D+ + + ++FD
Sbjct: 77 DGADFEPEQLEALCSFAVSSKH--TIKTTGFRGIGFKSAFSIGDSVYFITPSYQIRFD 132
>gi|297737937|emb|CBI27138.3| unnamed protein product [Vitis vinifera]
Length = 3960
Score = 47.4 bits (111), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 57/125 (45%), Gaps = 19/125 (15%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL---QESGIVVLNNEQGFS 1249
Y + + L EL+QNADD + E L+ L Q ++ NN + F+
Sbjct: 37 YPEGTTVLKELIQNADDAGATKVCLCLDRRVHGSESLLSEKLAQWQGPALLAYNNAE-FT 95
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
E+ ++ +G S K G A GR G+GF SV+ +TD P S + V FD G
Sbjct: 96 EEDFVSISRIGGSNKHGQ-AWKTGRFGVGFNSVYHLTDLPSFVSGKYVVLFDPQ----GV 150
Query: 1310 VLPTL 1314
LP +
Sbjct: 151 YLPNV 155
>gi|124001984|ref|ZP_01686838.1| hypothetical protein M23134_01824 [Microscilla marina ATCC 23134]
gi|123992450|gb|EAY31795.1| hypothetical protein M23134_01824 [Microscilla marina ATCC 23134]
Length = 1914
Score = 47.4 bits (111), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 24/115 (20%)
Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTL 1230
G P+I+N S LK +QD + E +QNA D
Sbjct: 47 GAVPDIANFLSTPLK-----------------IAQDDQAIYEFIQNAADA-----QSSLF 84
Query: 1231 TFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
ES V +NN + FS +++ +L ++ STKK S IGR GIGFK +R+
Sbjct: 85 NIFYDESYFVAINNGKPFSRDDLTSLLNIAASTKK--SCEKIGRFGIGFKLAYRL 137
>gi|303284333|ref|XP_003061457.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456787|gb|EEH54087.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 5511
Score = 47.4 bits (111), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 54/112 (48%), Gaps = 18/112 (16%)
Query: 1203 YSQDSHFLLELVQNADD--------------NIYPENVEPTLTFILQESGIVVLNNEQGF 1248
Y + + L ELVQNADD + V P L + ++++N+ F
Sbjct: 32 YPEGTSILKELVQNADDAGATVVKVCLDARSHGTSTLVSPALAPF--QGPALLVHNDSVF 89
Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
+ + ++ VG+S K+ + G GR G+GF SV+ +TD P S G HV F
Sbjct: 90 AEADFVSISRVGDSVKR-TQVGKTGRFGVGFNSVYHLTDVPSFVS-GRHVVF 139
>gi|357166714|ref|XP_003580814.1| PREDICTED: sacsin-like [Brachypodium distachyon]
Length = 4693
Score = 47.4 bits (111), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 13/111 (11%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL-QESGIVVL-NNEQGFSA 1250
Y + + L EL+QNADD + L L Q G +L +N+ F+
Sbjct: 32 YPEGTTALRELIQNADDAGASAVRLCLDRRSHGARSLLAPALAQWQGPALLAHNDAVFTD 91
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
E+ ++ +G+S KK S A GR G+GF SV+ +TD P S + V FD
Sbjct: 92 EDFASISRIGDS-KKVSQAWKTGRFGVGFNSVYHLTDLPSFVSGKYVVMFD 141
>gi|154299794|ref|XP_001550315.1| hypothetical protein BC1G_11523 [Botryotinia fuckeliana B05.10]
Length = 2580
Score = 47.4 bits (111), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 51/106 (48%), Gaps = 20/106 (18%)
Query: 1203 YSQDSHFLLELVQNADD------------NIY-PENVEPTLTFILQESGIVVLNNEQGFS 1249
Y L EL+QNADD N Y EN+ P L Q ++ NN + F
Sbjct: 26 YPAGGSVLRELLQNADDAQASEVKFFLDENSYGTENLPPKLA-QYQGPALLAYNNAE-FK 83
Query: 1250 AENIRALCDVGNSTK--KGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ ++L VGNS K G++ G GR GF SV+ TD+P I S
Sbjct: 84 KEHFQSLSQVGNSLKMEDGATTGKFGR---GFNSVYNWTDSPSIVS 126
>gi|255572407|ref|XP_002527141.1| protein binding protein, putative [Ricinus communis]
gi|223533501|gb|EEF35243.1| protein binding protein, putative [Ricinus communis]
Length = 4704
Score = 47.4 bits (111), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 15/112 (13%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVV---LNNEQG----------FS 1249
Y + + L EL+QNADD V L L S ++ L+ QG F+
Sbjct: 34 YPEGTTVLKELIQNADD-AGATKVRLCLDRRLHRSDSIISTSLSQWQGPALLAYNDAVFT 92
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
E+ ++ +G S K G A GR G+GF SV+ +TD P S + V FD
Sbjct: 93 EEDFVSISRIGGSAKHGQ-AWKTGRFGVGFNSVYHLTDLPSFVSGKYIVLFD 143
Score = 43.9 bits (102), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 67/261 (25%), Positives = 101/261 (38%), Gaps = 53/261 (20%)
Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
L+ E+ FGL + + + A R H L E+Y+ L ELVQNA+D
Sbjct: 1368 LLAENADSMNFGL-----SGAAEAFGQHEALTTRLKHIL--EMYADGPGILFELVQNAED 1420
Query: 1220 NIYPENVEPTLTFIL---QESGIVVLN--------------NEQGFSAENIRALCDVGNS 1262
E + F+L Q VL+ N+ FS +++ A+ +G
Sbjct: 1421 AGASE-----VIFLLDKTQYGTSSVLSPEMADWQGPALYCFNDSVFSPQDLYAISRIGQE 1475
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
+K IGR G+GF V+ TD P S V FD + + P+ P I
Sbjct: 1476 SKLEKPLA-IGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSH-PGLRIKF 1533
Query: 1323 FCR-LLSKDPVQ----------LESKCWNTCIRLPFRTKF--------SEGIAMNNIVDM 1363
R +L + P Q L+ T R P R+ EG A +++ +
Sbjct: 1534 AGRKILEQFPDQFSPFLHFGCDLQHSFPGTLFRFPLRSSAIALRSQIKKEGYAPEDVMSL 1593
Query: 1364 ---FSDLHPSLLLFLHRLQCI 1381
FS + LLFL ++ I
Sbjct: 1594 FESFSGIVSDALLFLRNVKSI 1614
>gi|395328407|gb|EJF60799.1| hypothetical protein DICSQDRAFT_181063 [Dichomitus squalens LYAD-421
SS1]
Length = 2845
Score = 47.0 bits (110), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 70/160 (43%), Gaps = 21/160 (13%)
Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
N+ FS + A+ V S+K G + IG+ G+G ++ + +TD P+ S G+HV FD
Sbjct: 264 NDAVFSDADWEAIRTVSESSKTGDTEK-IGKHGLGSRAYYHLTDNPQYLSGGYHVVFDPH 322
Query: 1304 E---GQIGF--VLPTLVP--PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKF----- 1351
GQ G+ L + P ++ F LL P + T +RL R
Sbjct: 323 RWAFGQGGWKQSLEEIARDYPDQLEPFSSLL---PHGDRTSLPGTIVRLALRPATAASRI 379
Query: 1352 -SEGIAMNNIVDMFSDLHPS----LLLFLHRLQCIMFRNM 1386
S+ + + I D+ D LLFL L CI R +
Sbjct: 380 SSKKVTADEIHDLLLDFVRKELDVALLFLSHLTCIEVREI 419
>gi|51476565|emb|CAH18265.1| hypothetical protein [Homo sapiens]
Length = 2015
Score = 47.0 bits (110), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1046 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1104
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
Q FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1105 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1162
Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
I P PF ID+F C+L P+ +E+ +N
Sbjct: 1163 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1217
Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
T RL FRT+ ++ + ++VD FS L++F ++ + + +
Sbjct: 1218 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1277
Query: 1387 ------LNDSLVVIRKK 1397
L V+I+KK
Sbjct: 1278 EETNPSLAQDTVIIKKK 1294
>gi|156392678|ref|XP_001636175.1| predicted protein [Nematostella vectensis]
gi|156223275|gb|EDO44112.1| predicted protein [Nematostella vectensis]
Length = 4477
Score = 47.0 bits (110), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 108/474 (22%), Positives = 173/474 (36%), Gaps = 115/474 (24%)
Query: 1203 YSQDSHFLLELVQNADDNIYPE----------NVEPTLT---FILQESGIVVLNNEQGFS 1249
Y +D L E++Q+ADD E N + L+ LQ I V N+ F+
Sbjct: 2489 YPRDEGILKEMLQSADDAKATEFHVIYDPRQHNKKRVLSEGWKDLQGPAICVYND-CPFT 2547
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF--------- 1300
E + + +G + K S K GF +++ +TD P ++ +
Sbjct: 2548 EERMESFQQIGRTAKSEESD-----KAFGFNTIYHLTDCPSFITDNSTICILDPHATHAP 2602
Query: 1301 DTSEGQIGFVLPTLVPPFN---IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS----- 1352
D + G +L L F +D+F L +D LE T +R P RTK +
Sbjct: 2603 DVTRETPGRILQPLDDEFRENFVDVFKGYL-EDFFNLEGA---TMLRFPLRTKINAEKSE 2658
Query: 1353 ------EGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV-------------- 1392
E + ++N++ F LLFL+ ++ + + N LV
Sbjct: 2659 ISKLSFEDVDVDNLLKTFEHDVKDSLLFLNSVRKVTISRIENGRLVKKYQASATLPDESE 2718
Query: 1393 ---------VIRKKIVGDGIIK--------VSCGEDKMT--WFV----ASQKLRAGVIRP 1429
VIR K V I V ++ T W V +++ A + P
Sbjct: 2719 EQLQTLSSEVIRSKGVPTADISYFGVTYKLVISDSERSTEEWLVHQCLGAKEYVAAPLVP 2778
Query: 1430 DVKT------TEIALALTLQESNEGNYGPLLYQQP-------VFAFLPLRT-YGLKFILQ 1475
D + +A +++QE G + Q+P F+ LPL L +
Sbjct: 2779 DGRALGFLPRAGVAAKVSVQEDVLGQHVLRALQEPFTDKTCRAFSSLPLHVETRLPVQVS 2838
Query: 1476 GDFVLPSSR------EEVDGNSPWNQWLLSEFPA-----LFVSAERSFCDL----PCF-- 1518
G F L +R E+ D S WN+++ +E A L +SA + P F
Sbjct: 2839 GQFALDVARRKLCDDEKKDCRSRWNEFMKTEVIAPAYVHLLMSARHKITGIQEGNPWFFN 2898
Query: 1519 -RENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAP 1571
R S Y S P VH + L + L +NC ++ + Q AP
Sbjct: 2899 TRSGVKSGLSWYHSLFPDPHSVHSDWKELSLAVFRCLENANCPVIPVIHGQEAP 2952
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 68/165 (41%), Gaps = 43/165 (26%)
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
ENI+ L ++T GS+ GR GIGF SVF +TD P S F V FD +G V
Sbjct: 1538 ENIKML---ASTTTTGSAKR--GRPGIGFCSVFNITDIPSFISREFIVMFDPQTTYLGNV 1592
Query: 1311 L-----PTL-----------------VPPFNIDMFCRLLSKDPVQLESKCWN-TCIRLPF 1347
+ P + P+N C +L Q + +N T RLPF
Sbjct: 1593 IKDKTRPGIRLNLKENSRAVAAFSDQFSPYNKIFGCSILE----QGKEFFYNGTLFRLPF 1648
Query: 1348 RT--KFSEGIAMNNIVD---------MFSDLHPSLLLFLHRLQCI 1381
RT + + N D +F + SLLLFL ++ I
Sbjct: 1649 RTLSQAQTSVISGNCYDESNISKLLGIFREHASSLLLFLRNVKRI 1693
>gi|320591625|gb|EFX04064.1| ino80 chromatin remodeling complex protein [Grosmannia clavigera
kw1407]
Length = 1728
Score = 47.0 bits (110), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 4/76 (5%)
Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSP 1492
T+E+ +A + +S P++ +F FLP+R G F++Q DFV +SR++V +SP
Sbjct: 43 TSEVVVAFPMTDSGV----PIVKPLDLFVFLPVRHVGFNFLIQADFVTDASRQDVVADSP 98
Query: 1493 WNQWLLSEFPALFVSA 1508
N L+ FV
Sbjct: 99 RNIGLIDGIAKAFVQG 114
>gi|377562530|ref|ZP_09791921.1| hypothetical protein GOOTI_256_00540 [Gordonia otitidis NBRC 100426]
gi|377520356|dbj|GAB37086.1| hypothetical protein GOOTI_256_00540 [Gordonia otitidis NBRC 100426]
Length = 892
Score = 46.6 bits (109), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 35/66 (53%), Gaps = 2/66 (3%)
Query: 1241 VLNNEQGFSAENIRALCDVGNSTKKGSSAG--YIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
V N SA +R+L + S K G SAG IGR G+GF + V D+ EI S +
Sbjct: 28 VANTGAPLSANGVRSLMALRASAKSGDSAGSGVIGRYGVGFTATATVADSVEIRSTTGSI 87
Query: 1299 KFDTSE 1304
+FD ++
Sbjct: 88 RFDRAD 93
>gi|393781484|ref|ZP_10369679.1| hypothetical protein HMPREF1071_00547 [Bacteroides salyersiae
CL02T12C01]
gi|392676547|gb|EIY69979.1| hypothetical protein HMPREF1071_00547 [Bacteroides salyersiae
CL02T12C01]
Length = 587
Score = 46.6 bits (109), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + L E G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYAGRIDVYLNEDGSLVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + YIGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDYIGRFGIGLLSCFVVTNEITVES 124
>gi|359793721|ref|ZP_09296461.1| hypothetical protein MAXJ12_29405 [Mesorhizobium alhagi CCNWXJ12-2]
gi|359250101|gb|EHK53640.1| hypothetical protein MAXJ12_29405 [Mesorhizobium alhagi CCNWXJ12-2]
Length = 1574
Score = 46.6 bits (109), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 119/313 (38%), Gaps = 38/313 (12%)
Query: 1210 LLELVQNADDNIYPENVEPT--LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
L ELVQNA D I E E + + +V N + I AL + +S K+G
Sbjct: 62 LFELVQNAADAIL-EGGEGAGRIHVKVGRDALVAANTGAPLDQDGIVALLNARSSPKRG- 119
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS------EGQIGFVLPTLVPPFNID 1321
G IGR GIGFKS+ ++ ++ S ++FD +G P +
Sbjct: 120 --GQIGRFGIGFKSLLKLGGRVDLVSRSIGLRFDPEACRVRIRTHLGLDSSARAPGMRLA 177
Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI 1381
+ +P LE W T + SE + + F+ +LFL
Sbjct: 178 EV--IDGPNPKGLE---WATTV---VSATISEAATFQRLAEEFTKFPSEFVLFLP-ANIH 228
Query: 1382 MFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTW--FVASQKLRAGVIRPDV----KTTE 1435
+ ++ + I + + G+ ++ V G ++ W F + +R R D E
Sbjct: 229 LTLDVEGGARREITRTVDGENVV-VRDGSNETRWKLFETTAAIRDPGAREDATHIQARDE 287
Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGL-KFILQGDFVLPSSREE-VDGNSPW 1493
+ LA + S P+ +AF P T L IL + L S R V G W
Sbjct: 288 VPLAWAVPLS------PIPRAGRFWAFFPTETATLTSGILNAPWKLNSDRTNLVRGE--W 339
Query: 1494 NQWLLSEFPALFV 1506
N L+SE L
Sbjct: 340 NTTLMSEAARLIA 352
>gi|255086673|ref|XP_002509303.1| predicted protein [Micromonas sp. RCC299]
gi|226524581|gb|ACO70561.1| predicted protein [Micromonas sp. RCC299]
Length = 5379
Score = 46.6 bits (109), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 50/112 (44%), Gaps = 15/112 (13%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + + L ELVQNADD E T T Q ++V N+ F
Sbjct: 30 YPEGTSILKELVQNADDAGARTVKVCLDERSHGTETLAYDGLAQFQGPSLLVFNDST-FK 88
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+ ++ +G+S K+ + G GR G+GF SV+ +TD P S V FD
Sbjct: 89 ESDFESISRIGDSVKR-TQVGKTGRFGVGFNSVYHLTDMPSFVSGRHAVFFD 139
>gi|336319563|ref|YP_004599531.1| ATP-binding region ATPase domain protein [[Cellvibrio] gilvus ATCC
13127]
gi|336103144|gb|AEI10963.1| ATP-binding region ATPase domain protein [[Cellvibrio] gilvus ATCC
13127]
Length = 1056
Score = 46.6 bits (109), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 5/102 (4%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFIL----QESGIVVLNNEQGFSAENIRALCD 1258
Y +D ++EL QNA D V L L + + +V N +AE + AL
Sbjct: 40 YYRD-RVVVELAQNAADAAVRGGVPGRLLLRLARTDEGTTLVAANTGAPLTAEGVAALAS 98
Query: 1259 VGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
+ S K+ G +GR G+GF +V V+D + S V+F
Sbjct: 99 MRASAKRSDQPGVVGRFGVGFAAVRAVSDELSVVSTTGGVRF 140
>gi|441509455|ref|ZP_20991373.1| putative helicase [Gordonia aichiensis NBRC 108223]
gi|441446553|dbj|GAC49334.1| putative helicase [Gordonia aichiensis NBRC 108223]
Length = 1602
Score = 46.6 bits (109), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 74/184 (40%), Gaps = 28/184 (15%)
Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY-PENVEPTLTFILQESGIVVL 1242
L ++HA R +H Q Y L EL+QNA D + P + +L ++ +
Sbjct: 31 LVEEHANGERRIH---QGGYGDRQ--LFELIQNAADELRQPAFYGGRIEAVLTDTHLYCA 85
Query: 1243 NNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT 1302
N + + +G S K+G G IGR G+G KSV VT +P+ S FD
Sbjct: 86 NEGSPITPAGAETILRMGVSRKRG---GQIGRFGVGVKSVLSVTKSPQFFSTSGAFGFDA 142
Query: 1303 -----------SEGQIGFVLPTL-VPPFNIDMFCRLL------SKDPVQLESKCW-NTCI 1343
+E Q VLP L + + R L + DP+ W T +
Sbjct: 143 DWSAHKIRTAVNEAQSKQVLPVLDLTDTPVLRLARQLDPASERAADPILDALLAWATTVV 202
Query: 1344 RLPF 1347
RLP
Sbjct: 203 RLPL 206
>gi|47226310|emb|CAG09278.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3392
Score = 46.2 bits (108), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 97/236 (41%), Gaps = 57/236 (24%)
Query: 1203 YSQDSHFLLELVQNADD---------------NIYPEN-VEPTLTFILQESGIVVLNNEQ 1246
Y ++ EL+QNA+D PE ++P ++ + Q + NNE
Sbjct: 675 YDEEGDIFKELIQNAEDAGAQQCSFLVDFRTHKYNPEELIDPDMS-LCQGPCLWAFNNE- 732
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH 1297
F ++ + VG+++K+ + IG+ G+GF +V+ VTD P I S N H
Sbjct: 733 TFKDDDWTNIVRVGSASKE-TKLEKIGKFGLGFNTVYHVTDIPSILSGNRLLILDPNVTH 791
Query: 1298 VK------------FDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
+K D S+ ++ P P++ C + P + T IRL
Sbjct: 792 LKTHIKQKSNPGIRLDLSQLRLFKCFPGQFGPYDRIFDCDFAKQKPYE------GTLIRL 845
Query: 1346 PFRT-------KFSEGIAMNNIVDMFSDL----HPSLLLFLHRLQCIMFRNMLNDS 1390
PFRT + S+ + + + F + S LLFL + + +N+ D+
Sbjct: 846 PFRTEEEALTSEISKRVYYKSDIVSFQECLAENSKSHLLFLRSISSLALQNIPKDA 901
Score = 46.2 bits (108), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 73/188 (38%), Gaps = 34/188 (18%)
Query: 1203 YSQDSHFLLELVQNADDNIYPE----------NVEPTLT---FILQESGIVVLNNEQGFS 1249
Y L EL+QNADD E VE T LQ + V NN + FS
Sbjct: 1665 YPSKKDILKELLQNADDAEATEIHFVWDKRQHGVEKTFGERWNQLQGPALCVFNN-KVFS 1723
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI------------HSNGFH 1297
+++ + +G K S+ G IG+ G+GF SV+ +TD P I +
Sbjct: 1724 NADLKGIQQLGEGGKH-STRGKIGKYGVGFNSVYHLTDCPSILTGDSLLCISDPNQKYIE 1782
Query: 1298 VKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKF---SEG 1354
D + IG+ L D++ L D L T RLP RT F S
Sbjct: 1783 CYSDKAPAGIGYKLDDKFKNMYADVYQSFLP-DRFPLHE---GTMFRLPLRTFFMATSSN 1838
Query: 1355 IAMNNIVD 1362
I+ + D
Sbjct: 1839 ISDKEVTD 1846
>gi|403739234|ref|ZP_10951766.1| putative heat shock protein [Austwickia chelonae NBRC 105200]
gi|403190895|dbj|GAB78536.1| putative heat shock protein [Austwickia chelonae NBRC 105200]
Length = 605
Score = 46.2 bits (108), Expect = 0.19, Method: Composition-based stats.
Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 32/119 (26%)
Query: 1198 LSQELYSQDSHFLLELVQNADD--------------------NIYPENVEPTLTFILQES 1237
LS LYS +L EL+QNA D I P +V P F L ++
Sbjct: 16 LSHHLYSGPRVYLRELIQNAADALTARLLLAPATAASAPPRITITPADVSPDGRFHLDDN 75
Query: 1238 GIVVLNNEQGFSAENI-RALCDVGNSTKKGSSAG----YIGRKGIGFKSVFRVTDAPEI 1291
GI G + E+I R L +G+S+K G++A +IG GIG + F V++ E+
Sbjct: 76 GI-------GLTREDIDRFLVGIGSSSKPGAAASTQETFIGHFGIGLLACFLVSETIEL 127
>gi|297808305|ref|XP_002872036.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317873|gb|EFH48295.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
Length = 4711
Score = 46.2 bits (108), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 52/112 (46%), Gaps = 15/112 (13%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-------------NEQGFS 1249
Y + + L EL+QNADD V L + SG ++ + N+ F+
Sbjct: 26 YPEGTTVLKELIQNADD-AGATKVRLCLDRRVHGSGSLLSDSLAQWQGPSLLAYNDAVFT 84
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
E+ ++ +G S K G A GR G+GF SV+ +TD P S + V FD
Sbjct: 85 EEDFVSISRIGGSGKHGQ-AWKTGRFGVGFNSVYHLTDIPSFVSGKYVVLFD 135
Score = 44.7 bits (104), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 61/146 (41%), Gaps = 28/146 (19%)
Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL----------- 1234
QH L R H L E+Y+ L ELVQNA+D E +TF+L
Sbjct: 1383 QHEALTTRLKHIL--EMYADGPGILFELVQNAEDAGASE-----VTFLLDKTHYGTSSLL 1435
Query: 1235 -------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
Q + NN F+ +++ A+ +G ++K IGR G+GF V+ TD
Sbjct: 1436 SPEMADWQGPALYCFNNSV-FTQQDMYAISRIGQASKLEKPFA-IGRFGLGFNCVYHFTD 1493
Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPT 1313
P S V FD + + PT
Sbjct: 1494 IPGFVSGENIVMFDPHANHLPGISPT 1519
>gi|156359807|ref|XP_001624956.1| predicted protein [Nematostella vectensis]
gi|156211764|gb|EDO32856.1| predicted protein [Nematostella vectensis]
Length = 250
Score = 46.2 bits (108), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 18/137 (13%)
Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENV-EPTLTFILQES- 1237
Q+ L R L L +E Y+ EL+QNADD +Y E E +T ++ E
Sbjct: 107 QYEPLTRRLQRLLEE-YTDGFSVPKELIQNADDAGATEVRFLYDERANEDAMTCLISEGM 165
Query: 1238 ------GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
+ V NN F+ + + + + +TK+ S G IGR G+GF +V+ +TD P +
Sbjct: 166 KECQGPALWVYNNAV-FTDADFQNITKLNGATKEEDS-GKIGRFGLGFNAVYNITDVPSL 223
Query: 1292 HSNGFHVKFDTSEGQIG 1308
S V FD +G
Sbjct: 224 LSRNNLVIFDPHTTHLG 240
>gi|296128608|ref|YP_003635858.1| hypothetical protein Cfla_0748 [Cellulomonas flavigena DSM 20109]
gi|296020423|gb|ADG73659.1| conserved hypothetical protein [Cellulomonas flavigena DSM 20109]
Length = 1026
Score = 46.2 bits (108), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 6/107 (5%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQ--ESGIVVL---NNEQGFSAENIRALC 1257
Y +D L+EL QNA D V L L + G VL N A +R+L
Sbjct: 40 YYRD-RVLVELAQNAADAAVRAGVAGRLLLRLATTDDGTAVLVAANTGAPLDAAGVRSLA 98
Query: 1258 DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ S K+ + G +GR G+GF +V V D + S V+F ++
Sbjct: 99 TLRASAKRDGAPGTVGRFGVGFAAVRAVADEVTVCSTSGAVRFSLAD 145
>gi|15237223|ref|NP_197702.1| C3HC4-type RING finger domain-containing protein [Arabidopsis
thaliana]
gi|9759369|dbj|BAB09828.1| unnamed protein product [Arabidopsis thaliana]
gi|332005740|gb|AED93123.1| C3HC4-type RING finger domain-containing protein [Arabidopsis
thaliana]
Length = 4706
Score = 46.2 bits (108), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 52/112 (46%), Gaps = 15/112 (13%)
Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-------------NEQGFS 1249
Y + + L EL+QNADD V L + SG ++ + N+ F+
Sbjct: 26 YPEGTTVLKELIQNADD-AGATKVRLCLDRRVHGSGSLLSDSLAQWQGPSLLAYNDAVFT 84
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
E+ ++ +G S K G A GR G+GF SV+ +TD P S + V FD
Sbjct: 85 EEDFVSISRIGGSGKHGQ-AWKTGRFGVGFNSVYHLTDIPSFVSGKYVVLFD 135
Score = 44.7 bits (104), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 61/146 (41%), Gaps = 28/146 (19%)
Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL----------- 1234
QH L R H L E+Y+ L ELVQNA+D E +TF+L
Sbjct: 1383 QHEALTTRLKHIL--EMYADGPGILFELVQNAEDAGASE-----VTFLLDKTHYGTSSLL 1435
Query: 1235 -------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
Q + NN F+ +++ A+ +G ++K IGR G+GF V+ TD
Sbjct: 1436 SPEMADWQGPALYCFNNSV-FTQQDMYAISRIGQASKLEKPFA-IGRFGLGFNCVYHFTD 1493
Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPT 1313
P S V FD + + PT
Sbjct: 1494 IPGFVSGENIVMFDPHANHLPGISPT 1519
>gi|329941389|ref|ZP_08290668.1| hypothetical protein SGM_6160 [Streptomyces griseoaurantiacus M045]
gi|329299920|gb|EGG43819.1| hypothetical protein SGM_6160 [Streptomyces griseoaurantiacus M045]
Length = 1067
Score = 46.2 bits (108), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 65/157 (41%), Gaps = 12/157 (7%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L L++ + N A + +L + S K+ + G +GR G+GF +V V+D P
Sbjct: 75 LRLTLRDGVLAAANTGAPLDAAGVESLATLRASAKRDGAPGAVGRFGVGFAAVLAVSDEP 134
Query: 1290 EIHSNGFHVKFDTSE-----GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC---WNT 1341
+ V++ +E G P L L + P E + ++T
Sbjct: 135 AVLGRHGGVRWSLAEARDLAGDTARHSPGLGDEIRRRDGHVPLLRLPFAAEGRAPEPYDT 194
Query: 1342 CIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
+ LP R +EG+A ++D D +LLL L L
Sbjct: 195 VVVLPLRDPAAEGLA-ERLLDAVDD---ALLLALPGL 227
>gi|190894939|ref|YP_001985232.1| heat shock protein 90 [Rhizobium etli CIAT 652]
gi|190700600|gb|ACE94682.1| heat shock chaperone protein, Hsp90 family [Rhizobium etli CIAT 652]
Length = 628
Score = 46.2 bits (108), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 63/144 (43%), Gaps = 30/144 (20%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD---NIYPENVE-PTLTFI 1233
N+ES++ + ARL LH + +YS FL EL+ NA D + E +E P L
Sbjct: 8 NVESHVFEADVARL---LHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIEAPALLGS 64
Query: 1234 LQESGI-----------VVLNNEQGFSAENI------------RALCDVGNSTKKGSSAG 1270
+ ES I VV +N G + + RA D ++K G A
Sbjct: 65 VAESRITLSLDDESRQLVVEDNGIGMQRDEMIEALGTIARSGTRAFMDRIAASKAGEGAQ 124
Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSN 1294
IG+ G+GF S F V D ++ S
Sbjct: 125 LIGQFGVGFYSCFMVADRVDVISR 148
>gi|323356692|ref|YP_004223088.1| hypothetical protein MTES_0244 [Microbacterium testaceum StLB037]
gi|323273063|dbj|BAJ73208.1| hypothetical protein MTES_0244 [Microbacterium testaceum StLB037]
Length = 715
Score = 46.2 bits (108), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 62/130 (47%), Gaps = 21/130 (16%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNA---DDNIYPENVEPTLTFILQESGIVVLNNEQGFSAE 1251
L L LYS + + ELV NA D N+ VE ++ + + ++ +G + E
Sbjct: 29 LEALGINLYSNAAAVVSELVANAWDADANL----VEINWNAGGDDAFVEISDDGRGMTRE 84
Query: 1252 NI-RALCDVGNS---TKKGSSAGY----IGRKGIGFKSVFRVTDAPEIHS------NGFH 1297
+ R VG S T+ G S GY +GRKGIG SVF + D E+HS N F
Sbjct: 85 ELKRKFLTVGYSKRTTEGGLSDGYKRPFMGRKGIGKLSVFSIADTVEVHSARNGERNAFR 144
Query: 1298 VKFDTSEGQI 1307
++ D + +I
Sbjct: 145 IEVDELKKRI 154
>gi|448690608|ref|ZP_21695769.1| hypothetical protein C444_18702 [Haloarcula japonica DSM 6131]
gi|445776570|gb|EMA27547.1| hypothetical protein C444_18702 [Haloarcula japonica DSM 6131]
Length = 1399
Score = 45.8 bits (107), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
LLEL+QNADD + + + L E G+ V N+ + FS E I +L NS K+
Sbjct: 38 LLELIQNADDAGWESSGSVDVLIELTEEGLFVANSGEPFSREGISSLMLSDNSPKQFREE 97
Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
IG KG+GF+S+ + I S + F
Sbjct: 98 -CIGYKGLGFRSILNWSSDVLIQSGEVSIGF 127
>gi|432115437|gb|ELK36852.1| Sacsin [Myotis davidii]
Length = 1679
Score = 45.8 bits (107), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 147 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 205
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 206 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 248
Score = 43.5 bits (101), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 19/124 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1508 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1566
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1567 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1624
Query: 1305 GQIG 1308
I
Sbjct: 1625 NHIS 1628
>gi|153809618|ref|ZP_01962286.1| hypothetical protein BACCAC_03936 [Bacteroides caccae ATCC 43185]
gi|149127743|gb|EDM18968.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein, partial
[Bacteroides caccae ATCC 43185]
Length = 488
Score = 45.8 bits (107), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + L + V+ +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYAGRIDVFLNDDKTVIFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VTD ++ S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTDEIKVES 124
>gi|187951065|gb|AAI38484.1| Sacs protein [Mus musculus]
gi|223460292|gb|AAI38483.1| Sacs protein [Mus musculus]
Length = 744
Score = 45.8 bits (107), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 59/136 (43%), Gaps = 22/136 (16%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQEL------YSQDSHFLLELVQNADDN-------IYPE 1224
N++S LK R G+ L L Y + L EL+QNA+D +Y E
Sbjct: 77 NLQSKGLKG-GGRFGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVKFLYDE 135
Query: 1225 NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
T T Q S + V NN F+ E+ + ++ S KK +GR GIG
Sbjct: 136 TQYGTETLWSKDMAQYQGSALYVYNNAV-FTPEDWHGIQEIARSRKKDDPLK-VGRFGIG 193
Query: 1279 FKSVFRVTDAPEIHSN 1294
F SV+ +TD P I S
Sbjct: 194 FNSVYHITDVPCIFSG 209
>gi|219521373|gb|AAI71956.1| Sacs protein [Mus musculus]
Length = 744
Score = 45.8 bits (107), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 59/136 (43%), Gaps = 22/136 (16%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQEL------YSQDSHFLLELVQNADDN-------IYPE 1224
N++S LK R G+ L L Y + L EL+QNA+D +Y E
Sbjct: 77 NLQSKGLKG-GGRFGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVKFLYDE 135
Query: 1225 NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
T T Q S + V NN F+ E+ + ++ S KK +GR GIG
Sbjct: 136 TQYGTETLWSKDMAQYQGSALYVYNNAV-FTPEDWHGIQEIARSRKKDDPLK-VGRFGIG 193
Query: 1279 FKSVFRVTDAPEIHSN 1294
F SV+ +TD P I S
Sbjct: 194 FNSVYHITDVPCIFSG 209
>gi|405975168|gb|EKC39753.1| Sacsin [Crassostrea gigas]
Length = 629
Score = 45.4 bits (106), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 98/242 (40%), Gaps = 41/242 (16%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPT-----LTFILQESGIVVLNNEQG 1247
Y D L E++QNA+D + P EP+ + +VV NN +
Sbjct: 34 YPDDGQILKEIIQNAEDAGASEMKILYDARPAVQEPSTKKAPFRKYFKGPALVVHNNAE- 92
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
F+ ++ + + + +S K+ +GR G+GFKSVF +TD P I S + D +
Sbjct: 93 FTEDDWKGIKMLYSSIKEFDKTK-VGRFGLGFKSVFHITDYPVIISGDQLLVLDPHQDS- 150
Query: 1308 GFVLPTL-------VPPFNID----MFCRLLSKDPVQLESKCW-NTCIRLPFR---TKFS 1352
V T+ NI+ F + D LES + T R P R T S
Sbjct: 151 SKVCQTMKLKKLHRYKKMNIEDCLKTFSGVFGFDQNTLESGHFRGTIFRFPLRQEETDLS 210
Query: 1353 EGIAMNNIVD----MFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC 1408
+ I + VD F D P LLFL L+ I N+S + +G+ I V
Sbjct: 211 DNIYDKSKVDDLFMSFKDEAPVSLLFLKCLESITLLREENES----KTGDIGEKIFSVHI 266
Query: 1409 GE 1410
E
Sbjct: 267 DE 268
>gi|392843737|gb|AFM84472.1| sacsin, partial [Capra hircus]
Length = 737
Score = 45.4 bits (106), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 59/125 (47%), Gaps = 20/125 (16%)
Query: 1180 ESNMLKK--QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQES 1237
E MLK+ Q+A +A D F+ + Q+ D I+ + P L
Sbjct: 421 EKEMLKELLQNANYAKA----------TDICFVFDPRQHPVDRIFDDKWAP-----LHGP 465
Query: 1238 GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGF 1296
+ V NN Q F+ +++R + ++G TK+G+ G+ G+GF SV+ +TD P I N
Sbjct: 466 ALCVYNN-QPFTEDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDI 523
Query: 1297 HVKFD 1301
FD
Sbjct: 524 LCIFD 528
>gi|313147938|ref|ZP_07810131.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|423281212|ref|ZP_17260123.1| hypothetical protein HMPREF1203_04340 [Bacteroides fragilis HMW 610]
gi|424665851|ref|ZP_18102887.1| hypothetical protein HMPREF1205_01726 [Bacteroides fragilis HMW 616]
gi|313136705|gb|EFR54065.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|404574104|gb|EKA78855.1| hypothetical protein HMPREF1205_01726 [Bacteroides fragilis HMW 616]
gi|404583376|gb|EKA88057.1| hypothetical protein HMPREF1203_04340 [Bacteroides fragilis HMW 610]
Length = 588
Score = 45.4 bits (106), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I E+ + L+E G VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALHNIDESYAGRIDVFLKEDGSVVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVHRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|383111062|ref|ZP_09931880.1| hypothetical protein BSGG_2167 [Bacteroides sp. D2]
gi|313694632|gb|EFS31467.1| hypothetical protein BSGG_2167 [Bacteroides sp. D2]
Length = 588
Score = 45.4 bits (106), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E+ VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNENKTVVFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|92119166|ref|YP_578895.1| hypothetical protein Nham_3709 [Nitrobacter hamburgensis X14]
gi|91802060|gb|ABE64435.1| hypothetical protein Nham_3709 [Nitrobacter hamburgensis X14]
Length = 1306
Score = 45.4 bits (106), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 71/309 (22%), Positives = 122/309 (39%), Gaps = 39/309 (12%)
Query: 1210 LLELVQNADDNIYPENVEP--TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
L ELVQNA D I E EP + L + N I AL +V +S K+G+
Sbjct: 57 LYELVQNAADAIL-EAEEPHGRICVRLTPDRLEAANTGAALDEGGIVALLNVRSSPKRGN 115
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS------EGQIGFVLPTLVPPFNID 1321
IGR GIGFKS+ R+ ++ S ++FD +G P +
Sbjct: 116 Q---IGRFGIGFKSLLRLGGRVDLTSKSLGLRFDPGACRNRIRNHLGLTSDARAPGMRLA 172
Query: 1322 MFCRLLSK-DPVQLESKC-WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
+K P+ + + W T + + ++ + + ++ LLFL
Sbjct: 173 EVLDPNAKGSPLSGQGRWNWATTV---VSAEIADKSTFDRLKKELAEFPAEFLLFLPSDI 229
Query: 1380 CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALA 1439
+ + ++ ++ G+ + VS G ++ W V ++R V P+ ++ A
Sbjct: 230 SLEIEVEGAEPRLIAKRFEEGNAV--VSDGANESIWKVFGTRVR--VENPEARSD----A 281
Query: 1440 LTLQESNEGNYGPLLYQQPV---------FAFLPLRTYG-LKFILQGDFVLPSSREEVDG 1489
+Q ++ PL + PV +AF P T IL + L S R +
Sbjct: 282 THIQARDQ---VPLAWAAPVGGREQAGRFWAFFPTETQSRTSGILNAPWKLNSDRTNLI- 337
Query: 1490 NSPWNQWLL 1498
PWN+ ++
Sbjct: 338 RGPWNEAIM 346
>gi|423294100|ref|ZP_17272227.1| hypothetical protein HMPREF1070_00892 [Bacteroides ovatus CL03T12C18]
gi|392676596|gb|EIY70027.1| hypothetical protein HMPREF1070_00892 [Bacteroides ovatus CL03T12C18]
Length = 588
Score = 45.4 bits (106), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E+ VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNENKTVVFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|423220459|ref|ZP_17206954.1| hypothetical protein HMPREF1061_03727 [Bacteroides caccae CL03T12C61]
gi|392623536|gb|EIY17639.1| hypothetical protein HMPREF1061_03727 [Bacteroides caccae CL03T12C61]
Length = 588
Score = 45.4 bits (106), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + L + V+ +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYAGRIDVFLNDDKTVIFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VTD ++ S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTDEIKVES 124
>gi|336404446|ref|ZP_08585141.1| hypothetical protein HMPREF0127_02454 [Bacteroides sp. 1_1_30]
gi|335942243|gb|EGN04090.1| hypothetical protein HMPREF0127_02454 [Bacteroides sp. 1_1_30]
Length = 588
Score = 45.4 bits (106), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGVGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|85677482|ref|NP_766397.2| sacsin [Mus musculus]
gi|74144304|dbj|BAE36019.1| unnamed protein product [Mus musculus]
Length = 768
Score = 45.4 bits (106), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q S + V NN F+
Sbjct: 107 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 165
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 166 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFSG 209
>gi|300764737|ref|ZP_07074728.1| hypothetical protein LMHG_11046 [Listeria monocytogenes FSL N1-017]
gi|404281267|ref|YP_006682165.1| hypothetical protein LMOSLCC2755_1715 [Listeria monocytogenes
SLCC2755]
gi|300514623|gb|EFK41679.1| hypothetical protein LMHG_11046 [Listeria monocytogenes FSL N1-017]
gi|404227902|emb|CBY49307.1| hypothetical protein LMOSLCC2755_1715 [Listeria monocytogenes
SLCC2755]
Length = 1507
Score = 45.4 bits (106), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 139/317 (43%), Gaps = 58/317 (18%)
Query: 1210 LLELVQNADD---NIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+LEL+QN DD +I EN E + + + + V N FS + I LC +G +
Sbjct: 46 ILELLQNVDDAYGDIKSENDASNDEVKVQITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
+ K S IG KG GF+S+ + E++S F ++F SE + V + D
Sbjct: 105 SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF--SEEFTKKLFEQYVSR-DSDK 159
Query: 1323 FCRLLSK--------------------DPVQLESKCWNTCIRLPFR-TKFSEGIAMNNIV 1361
F L+ + P+ ++T IR+ + T S+G +++
Sbjct: 160 FSELIFEQKKNWKKEDYDLCFPIMNCPQPINKIDSGFDTLIRVKLKETNLSKGTSVSK-- 217
Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV-GDGII----KVSCGEDKMTWF 1416
+ + S LLFL + I+ ++ I +KIV G+ ++ V+ +F
Sbjct: 218 QLQQPFYKS-LLFLPNITKIVIETTDDEK---ISEKIVDGNDVLIEKSDVTGQSPSEEYF 273
Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG--LKFIL 1474
V ++ G K +IALA+ S + +Y ++ ++ + P+R +G + ++
Sbjct: 274 VFDKESMIG-----DKVAKIALAV----SKDSDYD--YSKEKLYCYFPVRNFGTPIHALI 322
Query: 1475 QGDFVLPSSREEVDGNS 1491
F+ +SR++V +S
Sbjct: 323 NAPFITNNSRDDVPDDS 339
>gi|160887425|ref|ZP_02068428.1| hypothetical protein BACOVA_05444 [Bacteroides ovatus ATCC 8483]
gi|156107836|gb|EDO09581.1| S1 RNA binding domain protein [Bacteroides ovatus ATCC 8483]
Length = 1862
Score = 45.4 bits (106), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 61/123 (49%), Gaps = 11/123 (8%)
Query: 1186 KQHAR------LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESG 1238
KQH + L + Q LY+ EL+QNADD PEN V+ L L ++
Sbjct: 464 KQHTSPNTNTSVANLLEEVGQNLYTSKKRMFFELLQNADDAA-PENGVKVKLQ--LSDNY 520
Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
VV ++ F+ + ++ STK SS G KGIGFKSVF +++ I S G++
Sbjct: 521 FVVTHDGFAFNKHDFESITSAAKSTKSASSKK-TGYKGIGFKSVFTNSESVLIKSAGYNF 579
Query: 1299 KFD 1301
FD
Sbjct: 580 SFD 582
>gi|298480533|ref|ZP_06998730.1| heat shock protein HtpG [Bacteroides sp. D22]
gi|295086232|emb|CBK67755.1| Molecular chaperone, HSP90 family [Bacteroides xylanisolvens XB1A]
gi|298273354|gb|EFI14918.1| heat shock protein HtpG [Bacteroides sp. D22]
Length = 588
Score = 45.4 bits (106), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGVGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|262405471|ref|ZP_06082021.1| heat shock protein HtpG [Bacteroides sp. 2_1_22]
gi|294645217|ref|ZP_06722936.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Bacteroides ovatus SD CC 2a]
gi|294809664|ref|ZP_06768354.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Bacteroides xylanisolvens SD CC 1b]
gi|345510656|ref|ZP_08790220.1| heat shock protein HtpG [Bacteroides sp. D1]
gi|229442678|gb|EEO48469.1| heat shock protein HtpG [Bacteroides sp. D1]
gi|262356346|gb|EEZ05436.1| heat shock protein HtpG [Bacteroides sp. 2_1_22]
gi|292639460|gb|EFF57759.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Bacteroides ovatus SD CC 2a]
gi|294443149|gb|EFG11926.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Bacteroides xylanisolvens SD CC 1b]
Length = 588
Score = 45.4 bits (106), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGVGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|397642533|gb|EJK75294.1| hypothetical protein THAOC_02985 [Thalassiosira oceanica]
Length = 2093
Score = 45.4 bits (106), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
R + ++ +++ +L EL+QNA D+ T+ F +Q++ + V +N + F+ +
Sbjct: 159 RGFLSVLEKQFARGDLYLFELLQNAVDD-----GADTVRFAIQDNTLQVSHNGRRFTPLD 213
Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFR 1284
+ L VG STK IG G+GFK+V++
Sbjct: 214 VLGLSSVGLSTKSRQGKRTIGFMGVGFKAVYK 245
>gi|315641083|ref|ZP_07896162.1| phage minor structural protein [Enterococcus italicus DSM 15952]
gi|315483091|gb|EFU73608.1| phage minor structural protein [Enterococcus italicus DSM 15952]
Length = 1507
Score = 45.4 bits (106), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 135/317 (42%), Gaps = 58/317 (18%)
Query: 1210 LLELVQNADD---NIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+LEL+QN DD +I EN E + + + + V N FS + I LC +G +
Sbjct: 46 ILELLQNVDDAYGDIKSENDSNNDEVKVRITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
+ K S IG KG GF+S+ + E++S F ++F SE + V D
Sbjct: 105 SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF--SEEFTKRLFEQYVSR-ESDK 159
Query: 1323 FCRLLSK--------------------DPVQLESKCWNTCIRLPFR-TKFSEGIAMNNIV 1361
F L+ + P+ ++T IR+ + T S+G + +
Sbjct: 160 FSELIFEQQKNWKKEDYDLCFPIMNCPQPINKVDSDFDTLIRVKLKETNLSKGTS---VA 216
Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVG--DGIIK---VSCGEDKMTWF 1416
LLFL + I+ ++ I +KIV D +I+ V+ +F
Sbjct: 217 KQLQQPFYKSLLFLPNITKIVIETSDDEK---ISEKIVDGHDVLIEKSDVTGQAPSEEYF 273
Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG--LKFIL 1474
V ++ R G K +IA+A+ S + +Y ++ ++ + P+R +G + ++
Sbjct: 274 VFDKESRIG-----DKVAKIAVAV----SKDSDYD--YSKEKLYCYFPVRNFGTPIHALI 322
Query: 1475 QGDFVLPSSREEVDGNS 1491
F+ +SR++V +S
Sbjct: 323 NAPFITNNSRDDVPDDS 339
>gi|336417388|ref|ZP_08597712.1| hypothetical protein HMPREF1017_04820 [Bacteroides ovatus 3_8_47FAA]
gi|335936134|gb|EGM98074.1| hypothetical protein HMPREF1017_04820 [Bacteroides ovatus 3_8_47FAA]
Length = 588
Score = 45.4 bits (106), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|299147086|ref|ZP_07040153.1| heat shock protein HtpG [Bacteroides sp. 3_1_23]
gi|298514971|gb|EFI38853.1| heat shock protein HtpG [Bacteroides sp. 3_1_23]
Length = 588
Score = 45.4 bits (106), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|237720427|ref|ZP_04550908.1| heat shock protein HtpG [Bacteroides sp. 2_2_4]
gi|293370430|ref|ZP_06616983.1| histidine kinase domain family protein [Bacteroides ovatus SD CMC 3f]
gi|229450178|gb|EEO55969.1| heat shock protein HtpG [Bacteroides sp. 2_2_4]
gi|292634422|gb|EFF52958.1| histidine kinase domain family protein [Bacteroides ovatus SD CMC 3f]
Length = 588
Score = 45.4 bits (106), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|39935288|ref|NP_947564.1| ATPase [Rhodopseudomonas palustris CGA009]
gi|39649140|emb|CAE27660.1| Possible ATPase [Rhodopseudomonas palustris CGA009]
Length = 1413
Score = 45.1 bits (105), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 8/111 (7%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
L H L +++ S H L EL+QNADD E ++ + +N + F
Sbjct: 38 LAGPWHQLFKQVQSP-RHILSELLQNADDAGASEA-----RVSIENDRFIFQHNGEDFIE 91
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
+ ++C G S K+ IG +GIGFKS F + D E+++ V F+
Sbjct: 92 AHFASICRFGYSNKRALHT--IGFRGIGFKSTFSLGDRVELYTPTLAVAFE 140
>gi|160883939|ref|ZP_02064942.1| hypothetical protein BACOVA_01913 [Bacteroides ovatus ATCC 8483]
gi|423290634|ref|ZP_17269483.1| hypothetical protein HMPREF1069_04526 [Bacteroides ovatus CL02T12C04]
gi|156110669|gb|EDO12414.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Bacteroides ovatus ATCC 8483]
gi|392665287|gb|EIY58815.1| hypothetical protein HMPREF1069_04526 [Bacteroides ovatus CL02T12C04]
Length = 588
Score = 45.1 bits (105), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|424878843|ref|ZP_18302481.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392520353|gb|EIW45083.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 630
Score = 45.1 bits (105), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 59/139 (42%), Gaps = 30/139 (21%)
Query: 1186 KQHA---RLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
+QHA + R LH + +YS FL EL+ NA D + + ++P
Sbjct: 10 EQHAFEADVSRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYDSISQPQLSADGIQPG 69
Query: 1230 LTFILQESG--IVVLNNEQGFS-AENIRALCDVGNS-----------TKKGSSAGYIGRK 1275
+ L E +VV +N G S +E I AL + S K G A IG+
Sbjct: 70 ILVTLNEEALTLVVEDNGIGMSRSEMIDALGTIARSGTKAFMERLEAAKAGEKAELIGQF 129
Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
G+GF S F V D ++ S
Sbjct: 130 GVGFYSSFMVADKVDVISR 148
>gi|374849494|dbj|BAL52508.1| ATPase [uncultured Bacteroidetes bacterium]
Length = 1361
Score = 45.1 bits (105), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 53/108 (49%), Gaps = 11/108 (10%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
H L EL+QNADD ++ ++++ + ++ FS + +LC G S K+
Sbjct: 45 HVLSELLQNADDAGARH-----VSVRVEDNVFIFEHDGHDFSEDEFASLCRFGFSNKR-- 97
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
+ IG +GIGFKS F + ++ E+ + V+F F LP +
Sbjct: 98 TLYTIGFRGIGFKSTFSLGESVELLTQTLAVRFKKKR----FTLPIWI 141
>gi|315281648|ref|ZP_07870227.1| histidine kinase domain-containing protein [Listeria marthii FSL
S4-120]
gi|313614704|gb|EFR88269.1| histidine kinase domain-containing protein [Listeria marthii FSL
S4-120]
Length = 601
Score = 45.1 bits (105), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 30/123 (24%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
L + LS LY + ++ EL+QNA D I + ++PTL T
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDPTLEGKIHASLIGDINEKT 68
Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFR 1284
IL+++GI G + + + A L + NS+K SS +IGR GIG S F
Sbjct: 69 LILEDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFI 121
Query: 1285 VTD 1287
V+D
Sbjct: 122 VSD 124
>gi|148553124|ref|YP_001260706.1| hypothetical protein Swit_0197 [Sphingomonas wittichii RW1]
gi|148498314|gb|ABQ66568.1| hypothetical protein Swit_0197 [Sphingomonas wittichii RW1]
Length = 1414
Score = 45.1 bits (105), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 7/96 (7%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
H L EL+QNADD E ++ V +N + F + ++C G S K+
Sbjct: 54 HILSELLQNADDAGASEA-----HVSIEGERFVFEHNGEDFIEAHFASICRFGYSNKRAL 108
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
IG +GIGFKS F + D E+++ V F+ +
Sbjct: 109 HT--IGFRGIGFKSTFSLGDRVELYTPSLAVAFEKA 142
>gi|258654836|ref|YP_003203992.1| hypothetical protein Namu_4727 [Nakamurella multipartita DSM 44233]
gi|258558061|gb|ACV81003.1| hypothetical protein Namu_4727 [Nakamurella multipartita DSM 44233]
Length = 1012
Score = 44.7 bits (104), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 1/96 (1%)
Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
L+EL N D +V L L + + V N +A + L + S K+
Sbjct: 50 GRALVELAANGVDAARDADVPARLRIRLIDGELRVANTGAPLTAAGVAGLASLRASAKR- 108
Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT 1302
SAG IG G+GF +V T AP + S ++FD
Sbjct: 109 DSAGSIGFFGVGFTAVLSWTQAPRVISTTGGIRFDA 144
>gi|318062490|ref|ZP_07981211.1| hypothetical protein SSA3_31430 [Streptomyces sp. SA3_actG]
Length = 1051
Score = 44.7 bits (104), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 74/167 (44%), Gaps = 14/167 (8%)
Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
+ V N+ A + +L + S K+ G +GR G+GF +V V+D P + V
Sbjct: 94 LAVANSGAPLDASGVASLATLRASAKREDGTGTVGRFGVGFAAVLAVSDEPAVLGRTGGV 153
Query: 1299 KFDTSEGQIGFVLPTLVPPFNIDMFCR----LLSKDPVQLESKC---WNTCIRLPFRTKF 1351
++ +E + V VP + ++ R L + P E ++T + LP R
Sbjct: 154 RWSLTEAREA-VAALRVPALDEEVRRREGHVPLLRLPFPAEGSAPEGYDTVVVLPLR--- 209
Query: 1352 SEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
+ +A + + + + + +LLL L L I+ + V+R+++
Sbjct: 210 -DAVAEDLVERLLTGIDDALLLSLPGLAEIVVET--PEGTRVLRRRV 253
>gi|326670429|ref|XP_682866.5| PREDICTED: sacsin [Danio rerio]
Length = 2235
Score = 44.7 bits (104), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 82/186 (44%), Gaps = 45/186 (24%)
Query: 1203 YSQDSHFLLELVQNADDN---------IYPENVEPTLTFILQESGIVVLN-------NEQ 1246
Y ++S EL+QNA+D ++ +P + I + G+ + N N +
Sbjct: 842 YDEESDIFKELLQNAEDAGATTCKFLLDLRKHRDPPESLI--DDGMALCNGPCLWIFNNE 899
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH 1297
FS E+ R + VG+++K+ + IG+ G+GF +V+ V+D P I S N H
Sbjct: 900 LFSEEDWRNILKVGSASKE-NKVEMIGKFGLGFNAVYHVSDIPSILSGKKLLILDPNVTH 958
Query: 1298 ------------VKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWN-TCIR 1344
+K D + ++ P P+ C L V+ + +N T I+
Sbjct: 959 LEKHIGSKGNPGIKLDPFQERLCKRFPGQFKPYEGIFDCDL----SVRNSKRAYNGTLIK 1014
Query: 1345 LPFRTK 1350
LPFRT+
Sbjct: 1015 LPFRTQ 1020
Score = 44.7 bits (104), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 15/104 (14%)
Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTF------ILQESGIVVLNNEQG 1247
+ Y L EL+QNADD ++ + T +LQ + V NN +
Sbjct: 1833 DAYPSKKDILKELIQNADDAQATEIHFVWDKRKHQTKKIFGKKWELLQGPALCVYNN-RT 1891
Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
FS +++ + +G K G+ G G+ G+GF SV+ +TD P I
Sbjct: 1892 FSDADLQGIQQLGEGGKHGT-LGRTGKYGLGFNSVYHLTDCPSI 1934
Score = 44.7 bits (104), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQE-----SGIVVLNNEQGFSA 1250
Y L EL+QNADD I+ E T + +E + N+ F+
Sbjct: 31 YPDGGQILKELIQNADDAGASTVVFIHDERHYETHSLWTKELRKYQGPALYAFNDAAFTE 90
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
E+ + VG S K+ +GR GIGF SV+ +TD P + S+ FD
Sbjct: 91 EDWEGIQRVGRSIKQDDPTK-VGRFGIGFNSVYHITDLPCVFSSEHLAIFD 140
>gi|241258650|ref|YP_002978534.1| heat shock protein 90 [Rhizobium leguminosarum bv. trifolii WSM1325]
gi|240863120|gb|ACS60783.1| heat shock protein Hsp90 [Rhizobium leguminosarum bv. trifolii
WSM1325]
Length = 630
Score = 44.7 bits (104), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 56/132 (42%), Gaps = 30/132 (22%)
Query: 1186 KQHA---RLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
+QHA + R LH + +YS FL EL+ NA D + + ++P
Sbjct: 10 EQHAFEADVSRLLHMMVHSVYSDKDIFLRELISNAADACEKLRYDSISQPQLSADGIQPG 69
Query: 1230 LTFILQESG--IVVLNNEQGFS-AENIRALCDVGNS-----------TKKGSSAGYIGRK 1275
+ L E +VV +N G S +E I AL + S K G A IG+
Sbjct: 70 ILVTLNEEALTLVVEDNGIGMSRSEMIDALGTIARSGTKAFMERLDAAKAGEKAELIGQF 129
Query: 1276 GIGFKSVFRVTD 1287
G+GF S F V D
Sbjct: 130 GVGFYSSFMVAD 141
>gi|424876975|ref|ZP_18300634.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393164578|gb|EJC64631.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 630
Score = 44.7 bits (104), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 59/139 (42%), Gaps = 30/139 (21%)
Query: 1186 KQHA---RLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
+QHA + R LH + +YS FL EL+ NA D + + ++P
Sbjct: 10 EQHAFEADVSRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYDSISEPRLSADGIQPG 69
Query: 1230 LTFILQESG--IVVLNNEQGFSA-ENIRALCDVGNS-----------TKKGSSAGYIGRK 1275
+ L E +VV +N G S E I AL + S +K G A IG+
Sbjct: 70 ILVTLNEEALMLVVEDNGIGMSRNEMIDALGTIARSGTKAFMERLEASKAGEKAELIGQF 129
Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
G+GF S F V D ++ S
Sbjct: 130 GVGFYSSFMVADKVDVISR 148
>gi|402771382|ref|YP_006590919.1| Chaperone protein HtpG [Methylocystis sp. SC2]
gi|401773402|emb|CCJ06268.1| Chaperone protein HtpG [Methylocystis sp. SC2]
Length = 673
Score = 44.7 bits (104), Expect = 0.60, Method: Composition-based stats.
Identities = 37/127 (29%), Positives = 56/127 (44%), Gaps = 24/127 (18%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFIL- 1234
A + R L ++ +YS+ FL ELV NA D+ + + P +T L
Sbjct: 65 ADVARLLELMTHSVYSERDVFLRELVSNAADACEKLRYEALSDEKLAAQAAAPLVTISLD 124
Query: 1235 -QESGIVVLNNEQGFSAEN-IRALCDVGNSTK--------KGSSAGYIGRKGIGFKSVFR 1284
++ + + +N G S E I AL + NS K AG+IG+ G+GF S F
Sbjct: 125 KEKRTLTIADNGVGMSREELIGALGTIANSGTRAFLDKLGKEKDAGFIGQFGVGFYSAFM 184
Query: 1285 VTDAPEI 1291
V E+
Sbjct: 185 VAGRVEV 191
>gi|451820496|ref|YP_007456697.1| hypothetical protein DUF3883 [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
gi|451786475|gb|AGF57443.1| hypothetical protein DUF3883 [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
Length = 1614
Score = 44.7 bits (104), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 133/330 (40%), Gaps = 50/330 (15%)
Query: 1210 LLELVQNADDN--IYPENVEPT--LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
LLEL+QNADD I E E + L+++ ++V NN + FS E I++L N + K
Sbjct: 43 LLELIQNADDACVIKKEQTEKKRIIHIRLKDNKLIVANNGEPFSREGIKSLM-YSNLSPK 101
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
IG KG GF+S+ T + I S ++ F + E F++ + NI F
Sbjct: 102 AKKKNKIGNKGTGFRSILSWTKSIFIKSGDLYLNF-SKESAEKFLIDLMDKNQNISEFIE 160
Query: 1326 LLS--KDPVQLESKCWNTCIRLPFRTKFSEGI--------------AMNNIVDMFSDLHP 1369
S K P+ + +P F EG I S+L
Sbjct: 161 EYSDFKYPI--------ATLIVPEMVDFREGYDEYDTYIELELIENVKEEIEKQISELDK 212
Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE------DKMTWFVASQKLR 1423
+LFL+ L+ + + + + KKI+ I+K+ + + TW V K R
Sbjct: 213 ESILFLNNLETLKIES--ENISIEFTKKIITSKIVKIVESDLLNGTFNNKTWNV---KKR 267
Query: 1424 AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSS 1483
G + D K EI++A T + N ++ V P I+ G F L +
Sbjct: 268 EGQV--DGKNFEISIAYTDNLDDNKNILYSYFKTDVKFPFPA-------IIHGTFNLTGN 318
Query: 1484 REEVDGNSPWNQWLLSEFPALFVSAERSFC 1513
R ++ ++ N+ LL L + +
Sbjct: 319 RNQLVKDNEENKKLLQFLAELLIETAKEIA 348
>gi|423214844|ref|ZP_17201372.1| hypothetical protein HMPREF1074_02904 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692107|gb|EIY85345.1| hypothetical protein HMPREF1074_02904 [Bacteroides xylanisolvens
CL03T12C04]
Length = 588
Score = 44.7 bits (104), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + L E VV +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYNGRIDVFLNEDKTVVFRDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124
>gi|326526959|dbj|BAK00868.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 161
Score = 44.7 bits (104), Expect = 0.61, Method: Composition-based stats.
Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 13/111 (11%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL-QESGIVVL-NNEQGFSA 1250
Y + + L EL+QNADD ++ L L Q G +L +N+ F+
Sbjct: 35 YPEGTTALRELIQNADDAGASRVRLCLDLRAHGASSLLAPALAQWQGPALLAHNDAAFTD 94
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
++ ++ +G S KK S A R G+GF SV+ +T+ P S + V FD
Sbjct: 95 DDFASISRIGGS-KKASQAWKTSRFGVGFNSVYHLTNLPSFVSGKYVVMFD 144
>gi|229489297|ref|ZP_04383161.1| hypothetical protein RHOER0001_0584 [Rhodococcus erythropolis SK121]
gi|229323790|gb|EEN89547.1| hypothetical protein RHOER0001_0584 [Rhodococcus erythropolis SK121]
Length = 1181
Score = 44.7 bits (104), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 4/94 (4%)
Query: 1210 LLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
L ELVQNA D + ++ + +L E + N + E + +G S K+G
Sbjct: 51 LFELVQNAADELRTDDHRGGKIHAVLTERYLYCANEGSPMTPEGADTILRMGVSKKRG-- 108
Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT 1302
G IGR G+G KSV VT P+ S FD
Sbjct: 109 -GQIGRFGVGVKSVLSVTRTPQFFSKTGSFGFDA 141
>gi|149175056|ref|ZP_01853679.1| Molecular chaperone, HSP90 family protein [Planctomyces maris DSM
8797]
gi|148846034|gb|EDL60374.1| Molecular chaperone, HSP90 family protein [Planctomyces maris DSM
8797]
Length = 861
Score = 44.7 bits (104), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 18/114 (15%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP---------------ENVEPTLTFILQESGI 1239
+ L++ LYS+ F+ EL+QNA D I E+ L FI++++G+
Sbjct: 12 IQLLAKNLYSEKRVFIRELIQNAHDGILRRQSRESDAFSPRIDIESRPDELQFIIRDNGL 71
Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ N+ G E + + +KG G +G+ GIGF S F V + E+ +
Sbjct: 72 GMDLNDIG---EYLAVIGRGATRLEKGDVTGLVGQFGIGFLSAFIVAERVEVET 122
>gi|405755020|ref|YP_006678484.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2540]
gi|404224220|emb|CBY75582.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2540]
Length = 601
Score = 44.7 bits (104), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNAFIGRFGIGLLSCFIVSD 124
>gi|405968312|gb|EKC33393.1| Sacsin [Crassostrea gigas]
Length = 1679
Score = 44.3 bits (103), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 12/102 (11%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGI----VVLNNEQGFSAE 1251
Y S + ELVQNA+D Y E + + + S + ++N+ F+AE
Sbjct: 168 YPDGSQIIRELVQNAEDAGARRFEIFYKEGDDKPASQLSYRSYFRTPALCVHNDGVFTAE 227
Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ + + + S K+ A +GR G+GFKS+F +TD+P I S
Sbjct: 228 DWKGIKSIYTSVKE-KDALKVGRFGLGFKSIFHITDSPIIIS 268
>gi|431903117|gb|ELK09293.1| Sacsin [Pteropus alecto]
Length = 653
Score = 44.3 bits (103), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + + V NN F+
Sbjct: 103 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGAALYVYNNAV-FT 161
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 162 PEDWHGIQEIARSRKKDDPL-KVGRFGIGFNSVYHITDVPCIFSG 205
>gi|409078002|gb|EKM78366.1| hypothetical protein AGABI1DRAFT_114668 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 483
Score = 44.3 bits (103), Expect = 0.73, Method: Composition-based stats.
Identities = 33/111 (29%), Positives = 42/111 (37%), Gaps = 13/111 (11%)
Query: 26 PSQPPPPHFNPNFAFQNPN----------IFFPNPALQLLQNLTNIPFQNPNFDIQNPNL 75
PS P PP F P F +PN P+P+ QL + N FQ P F N
Sbjct: 351 PSSPMPPQFQPQFVSSSPNPQYLAPSPSAQLMPSPSPQLQPTMPNPSFQ-PAFGNSTANG 409
Query: 76 FMQNANLLTQNPNLP--PQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQ 124
F + P + P Q S Q T +P QQ+P+T Q
Sbjct: 410 FAPGYMTPSPQPMMAAMPGQTQMSGMGMQPMGTFNPGMMQPQQQPMTMNGQ 460
>gi|422418435|ref|ZP_16495390.1| histidine kinase domain-containing protein [Listeria seeligeri FSL
N1-067]
gi|313634043|gb|EFS00725.1| histidine kinase domain-containing protein [Listeria seeligeri FSL
N1-067]
Length = 600
Score = 44.3 bits (103), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 30/120 (25%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
L + LS LY + ++ EL+QNA D I + +EP++ T IL
Sbjct: 11 LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLEPSIEGEIHVSLSGKADEKTLIL 70
Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK-----KGSSAG-YIGRKGIGFKSVFRVTD 1287
+++GI G + E + A L + NS+K +G A +IGR GIG S F V+D
Sbjct: 71 EDNGI-------GLTEEEVHAFLATIANSSKGEKKFEGQEANDFIGRFGIGLLSCFIVSD 123
>gi|295838149|ref|ZP_06825082.1| hsp90-like protein [Streptomyces sp. SPB74]
gi|295826894|gb|EDY43651.2| hsp90-like protein [Streptomyces sp. SPB74]
Length = 1079
Score = 44.3 bits (103), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 74/167 (44%), Gaps = 14/167 (8%)
Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
+ V N+ A + +L + S K+ G +GR G+GF +V V+D P + V
Sbjct: 122 LAVANSGAPLDAGGVASLATLRASAKREDGTGTVGRFGVGFAAVLAVSDEPAVLGRTGGV 181
Query: 1299 KFDTSEGQIGFVLPTLVPPFNIDMFCR----LLSKDPVQLESKC---WNTCIRLPFRTKF 1351
++ +E + V VP + ++ R L + P+ E ++T + LP R
Sbjct: 182 RWSLAEAREA-VAALGVPALDEEVRRREGHVPLLRLPLPAEGSAPEGYDTVVVLPLRDSV 240
Query: 1352 SEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
+E + + + + + +LLL L L I+ + V+R+++
Sbjct: 241 AEDL----VERLLAGIDDALLLSLPGLAEIVVET--PEGTRVLRRRV 281
>gi|432952627|ref|XP_004085167.1| PREDICTED: sacsin-like [Oryzias latipes]
Length = 664
Score = 44.3 bits (103), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 49/118 (41%), Gaps = 13/118 (11%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
Y L EL+QNADD I+ E T + + G + N F+
Sbjct: 82 YPDGGQILKELIQNADDGQATEVVFIHDERRYGTESLWTNKLGKYQGPALYAYNNAAFTD 141
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
E+ + G S K + IGR GIGF SV+ +TD P I S+G D E G
Sbjct: 142 EDWERIQMAGRSGKV-NDPNKIGRFGIGFNSVYHITDVPIIFSSGHLGVMDPQENIFG 198
>gi|289434206|ref|YP_003464078.1| HSP90 domain family protein [Listeria seeligeri serovar 1/2b str.
SLCC3954]
gi|289170450|emb|CBH26990.1| HSP90 domain family protein [Listeria seeligeri serovar 1/2b str.
SLCC3954]
Length = 600
Score = 44.3 bits (103), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 30/120 (25%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
L + LS LY + ++ EL+QNA D I + +EP++ T IL
Sbjct: 11 LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLEPSIEGKIHVSLSGKADEKTLIL 70
Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK-----KGSSAG-YIGRKGIGFKSVFRVTD 1287
+++GI G + E + A L + NS+K +G A +IGR GIG S F V+D
Sbjct: 71 EDNGI-------GLTEEEVHAFLATIANSSKGEKKFEGQEANDFIGRFGIGLLSCFIVSD 123
>gi|154246920|ref|YP_001417878.1| putative ATPase [Xanthobacter autotrophicus Py2]
gi|154161005|gb|ABS68221.1| putative ATPase [Xanthobacter autotrophicus Py2]
Length = 1406
Score = 44.3 bits (103), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 7/96 (7%)
Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
H L EL+QNADD E ++ V +N + F + ++C G S K+
Sbjct: 54 HILSELLQNADDAGASEA-----RVSIENDRFVFEHNGEDFIEAHFASICRFGYSNKRAL 108
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
IG +GIGFKS F + D E+ + V F+ +
Sbjct: 109 HT--IGFRGIGFKSTFSLGDRVELFTPTLAVAFERT 142
>gi|429730804|ref|ZP_19265450.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Corynebacterium durum F0235]
gi|429147242|gb|EKX90272.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
[Corynebacterium durum F0235]
Length = 623
Score = 43.9 bits (102), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 18/127 (14%)
Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLT 1231
+K LG + LS+ LYS +L EL+QN D+ Y + +
Sbjct: 5 EKFRVDLGGIVELLSRNLYSGSDVYLRELLQNGVDAITANAQSGKDDEGTYSSDTSGRIR 64
Query: 1232 FILQESGIVVLNNEQGFSAENIR-ALCDVGNSTKKG----SSAGYIGRKGIGFKSVFRVT 1286
F++ + V +N G + + R L +G S+K+ + Y+G+ GIG S F V+
Sbjct: 65 FVVDGHTLRVTDNGVGLNIDEARNLLATIGGSSKRDEFGLGRSDYLGQFGIGLLSCFMVS 124
Query: 1287 DAPEIHS 1293
D ++S
Sbjct: 125 DNIVVYS 131
>gi|423100037|ref|ZP_17087744.1| conserved hypothetical protein TIGR03982 [Listeria innocua ATCC
33091]
gi|370793038|gb|EHN60876.1| conserved hypothetical protein TIGR03982 [Listeria innocua ATCC
33091]
Length = 601
Score = 43.9 bits (102), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 30/120 (25%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
L + LS LY + ++ EL+QNA D I + +EPTL T IL
Sbjct: 12 LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIEPTLNGKIHVSLSGEKNNKTLIL 71
Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
+++GI G + + + A L + NS+K + +IGR GIG S F V+D
Sbjct: 72 EDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGQDTNDFIGRFGIGLLSCFIVSD 124
>gi|348174622|ref|ZP_08881516.1| HSP90 family protein [Saccharopolyspora spinosa NRRL 18395]
Length = 622
Score = 43.9 bits (102), Expect = 0.94, Method: Composition-based stats.
Identities = 37/146 (25%), Positives = 60/146 (41%), Gaps = 30/146 (20%)
Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD---------------- 1219
+ + E+++++ LG + LS+ +YS +L EL+QN D
Sbjct: 1 MDSAENSLVRPFQVELGGVVDLLSRHIYSGAQIYLRELLQNGRDAVAARAKLPGQPDGGS 60
Query: 1220 -NIYPENVEPTL--TFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-----SAGY 1271
IYP P F+ +++G+ + E G L VG S+K+ Y
Sbjct: 61 LEIYPVGTAPDAPEAFVFRDAGLGLSAAEAG------ELLATVGKSSKRDDVLSLRDENY 114
Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFH 1297
+G+ GIG S F V D + S H
Sbjct: 115 LGQFGIGLLSCFMVCDRIVVRSRSVH 140
>gi|422412394|ref|ZP_16489353.1| histidine kinase domain-containing protein [Listeria innocua FSL
S4-378]
gi|313619696|gb|EFR91318.1| histidine kinase domain-containing protein [Listeria innocua FSL
S4-378]
Length = 601
Score = 43.9 bits (102), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 30/120 (25%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
L + LS LY + ++ EL+QNA D I + +EPTL T IL
Sbjct: 12 LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIEPTLNGKIHVSLSGEKNNKTLIL 71
Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
+++GI G + + + A L + NS+K + +IGR GIG S F V+D
Sbjct: 72 EDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGQDTNDFIGRFGIGLLSCFIVSD 124
>gi|404413028|ref|YP_006698615.1| chaperone/heat shock protein [Listeria monocytogenes SLCC7179]
gi|404238727|emb|CBY60128.1| chaperone/heat shock protein [Listeria monocytogenes SLCC7179]
Length = 601
Score = 43.9 bits (102), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|405968314|gb|EKC33395.1| Sacsin [Crassostrea gigas]
Length = 1493
Score = 43.9 bits (102), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 15/106 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPT-LTFI-----LQESGIVVLNNEQGFS 1249
Y + L EL+QNADD ++ + PT L F Q +VV NN FS
Sbjct: 1315 YPCNVSILNELIQNADDAGATEIHIVHDQRFHPTELIFDEKLADTQGPSLVVYNN-SCFS 1373
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
+I + +G +K GS G+ GIGF +V+ +TD P + G
Sbjct: 1374 KSDIEGISKLGEGSK-GSDPMTTGQFGIGFNAVYHITDVPSFLTIG 1418
Score = 41.6 bits (96), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 53/125 (42%), Gaps = 15/125 (12%)
Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGI-------VVLNNEQ 1246
E Y+ L E++QNADD +Y E L + G+ + + N+
Sbjct: 314 EEYADGVSVLKEMLQNADDAGATCLRILYDERTNENAKIGLIDKGMAECQGPAIWVYNDA 373
Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
F ++ + + +G TK IG+ G+GF SV+ +TD P I S V D
Sbjct: 374 KFKEDDFQNIIKLGAGTKMADLTK-IGKFGLGFCSVYNITDVPSILSGDSLVILDPRMIH 432
Query: 1307 IGFVL 1311
+G L
Sbjct: 433 LGKAL 437
>gi|46907174|ref|YP_013563.1| HSP90 family protein [Listeria monocytogenes serotype 4b str. F2365]
gi|405752162|ref|YP_006675627.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2378]
gi|417315010|ref|ZP_12101698.1| HSP90 family protein [Listeria monocytogenes J1816]
gi|424713821|ref|YP_007014536.1| Histidine kinase domain protein [Listeria monocytogenes serotype 4b
str. LL195]
gi|46880441|gb|AAT03740.1| histidine kinase domain protein [Listeria monocytogenes serotype 4b
str. F2365]
gi|328466963|gb|EGF38066.1| HSP90 family protein [Listeria monocytogenes J1816]
gi|404221362|emb|CBY72725.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2378]
gi|424013005|emb|CCO63545.1| Histidine kinase domain protein [Listeria monocytogenes serotype 4b
str. LL195]
Length = 601
Score = 43.9 bits (102), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|333026262|ref|ZP_08454326.1| putative hsp90-like protein [Streptomyces sp. Tu6071]
gi|332746114|gb|EGJ76555.1| putative hsp90-like protein [Streptomyces sp. Tu6071]
Length = 1051
Score = 43.9 bits (102), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 42/183 (22%), Positives = 76/183 (41%), Gaps = 25/183 (13%)
Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
+ V N+ A + +L + S K+ G +GR G+GF +V V+D P + V
Sbjct: 94 LAVANSGAPLDASGVASLATLRASAKREDGTGTVGRFGVGFAAVLAVSDEPAVLGRTGGV 153
Query: 1299 KFDTSEGQIGFVLPTLVPPFNIDMFCR----LLSKDPVQLESKC---WNTCIRLPFRTKF 1351
++ +E + V VP + ++ R L + P E ++T + LP R
Sbjct: 154 RWSLTEAREA-VAALGVPALDEEVRRREGHVPLLRLPFPAEGSAPEGYDTVVVLPLR--- 209
Query: 1352 SEGIAMNNIVDMFSDLHPSLLLFLHRLQCI---------MFRNMLNDSLVVIRKKIVGDG 1402
+ +A + + + + + +LLL L L I + R + D VI DG
Sbjct: 210 -DAVAEDLVERLLTGIDDALLLSLPGLAEIVVETPEGTRLLRRRVEDPYTVIED----DG 264
Query: 1403 IIK 1405
++
Sbjct: 265 AVR 267
>gi|284801273|ref|YP_003413138.1| hypothetical protein LM5578_1023 [Listeria monocytogenes 08-5578]
gi|284994415|ref|YP_003416183.1| hypothetical protein LM5923_0977 [Listeria monocytogenes 08-5923]
gi|284056835|gb|ADB67776.1| hypothetical protein LM5578_1023 [Listeria monocytogenes 08-5578]
gi|284059882|gb|ADB70821.1| hypothetical protein LM5923_0977 [Listeria monocytogenes 08-5923]
Length = 601
Score = 43.9 bits (102), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|93006822|ref|YP_581259.1| heat shock protein 90 [Psychrobacter cryohalolentis K5]
gi|118575198|sp|Q1Q978.1|HTPG_PSYCK RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
protein HtpG; AltName: Full=High temperature protein G
gi|92394500|gb|ABE75775.1| heat shock protein Hsp90 [Psychrobacter cryohalolentis K5]
Length = 666
Score = 43.9 bits (102), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 53/234 (22%), Positives = 90/234 (38%), Gaps = 61/234 (26%)
Query: 1184 LKKQ--HARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEP 1228
LKK A + + LH ++ LYS F+ ELV NA DD++Y ++ E
Sbjct: 24 LKKHTFEAEVAQLLHLVTHSLYSNSDIFVRELVSNASDACDKLRFEATNDDSLYEDDGEL 83
Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRK 1275
+ + E + + G A+ ++G K G+ A IG+
Sbjct: 84 KIRIAVDEDAKTITFTDNGIGMNEADAIENLGTIAKSGTKAFLDKLSESQKQDGQLIGQF 143
Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
G+GF S F V D + + D S+G F + +
Sbjct: 144 GVGFYSGFIVADTISVETRKAGEPADKGVRWVSDGTGSFTV------------------E 185
Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGI-------AMNNIVDMFSDLHPSLLLFLHR 1377
P+ E++ T I L + ++SEG + +V+ +SD H SL + + +
Sbjct: 186 PITKETR--GTAITLHLKEQYSEGEESYLDRGKIKQLVNKYSD-HISLPIQMRK 236
>gi|254993169|ref|ZP_05275359.1| HSP90 family protein [Listeria monocytogenes FSL J2-064]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|254823788|ref|ZP_05228789.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL J1-194]
gi|293593010|gb|EFG00771.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL J1-194]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|255026807|ref|ZP_05298793.1| HSP90 family protein [Listeria monocytogenes FSL J2-003]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|226223560|ref|YP_002757667.1| heat shock protein HtpG [Listeria monocytogenes serotype 4b str. CLIP
80459]
gi|254852435|ref|ZP_05241783.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL R2-503]
gi|254931325|ref|ZP_05264684.1| histidine kinase domain-containing protein [Listeria monocytogenes
HPB2262]
gi|300764151|ref|ZP_07074146.1| histidine kinase [Listeria monocytogenes FSL N1-017]
gi|386731696|ref|YP_006205192.1| HSP90 family protein [Listeria monocytogenes 07PF0776]
gi|404280495|ref|YP_006681393.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2755]
gi|404286354|ref|YP_006692940.1| chaperone/heat shock protein [Listeria monocytogenes serotype 7 str.
SLCC2482]
gi|405749298|ref|YP_006672764.1| chaperone/heat shock protein [Listeria monocytogenes ATCC 19117]
gi|406703717|ref|YP_006754071.1| chaperone/heat shock protein [Listeria monocytogenes L312]
gi|417317168|ref|ZP_12103791.1| HSP90 family protein [Listeria monocytogenes J1-220]
gi|424822669|ref|ZP_18247682.1| Histidine kinase domain protein [Listeria monocytogenes str. Scott A]
gi|225876022|emb|CAS04728.1| Putative heat shock protein HtpG [Listeria monocytogenes serotype 4b
str. CLIP 80459]
gi|258605743|gb|EEW18351.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL R2-503]
gi|293582875|gb|EFF94907.1| histidine kinase domain-containing protein [Listeria monocytogenes
HPB2262]
gi|300515141|gb|EFK42193.1| histidine kinase [Listeria monocytogenes FSL N1-017]
gi|328475415|gb|EGF46184.1| HSP90 family protein [Listeria monocytogenes J1-220]
gi|332311349|gb|EGJ24444.1| Histidine kinase domain protein [Listeria monocytogenes str. Scott A]
gi|384390454|gb|AFH79524.1| HSP90 family protein [Listeria monocytogenes 07PF0776]
gi|404218498|emb|CBY69862.1| chaperone/heat shock protein [Listeria monocytogenes ATCC 19117]
gi|404227130|emb|CBY48535.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2755]
gi|404245283|emb|CBY03508.1| chaperone/heat shock protein [Listeria monocytogenes serotype 7 str.
SLCC2482]
gi|406360747|emb|CBY67020.1| chaperone/heat shock protein [Listeria monocytogenes L312]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|34365253|emb|CAE45964.1| hypothetical protein [Homo sapiens]
Length = 731
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 6 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 64
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 65 PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 107
>gi|418475524|ref|ZP_13044916.1| hypothetical protein SMCF_7960 [Streptomyces coelicoflavus ZG0656]
gi|371543860|gb|EHN72628.1| hypothetical protein SMCF_7960 [Streptomyces coelicoflavus ZG0656]
Length = 1042
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L L++ +V N A + +L + S K+ + G +GR G+GF +V VTD P
Sbjct: 75 LRLTLRDGVLVAANTGAPLDATGVESLSTLRASAKRDAGDGSVGRFGVGFAAVLAVTDEP 134
Query: 1290 EIHSNGFHVKFDTSEGQ 1306
+ V++ +E +
Sbjct: 135 AVVGRHGGVRWSLAEAR 151
>gi|404410184|ref|YP_006695772.1| chaperone/heat shock protein [Listeria monocytogenes SLCC5850]
gi|404230010|emb|CBY51414.1| chaperone/heat shock protein [Listeria monocytogenes SLCC5850]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|254827887|ref|ZP_05232574.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL N3-165]
gi|258600268|gb|EEW13593.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL N3-165]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|386043267|ref|YP_005962072.1| hypothetical protein [Listeria monocytogenes 10403S]
gi|345536501|gb|AEO05941.1| hypothetical protein LMRG_02040 [Listeria monocytogenes 10403S]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|47096228|ref|ZP_00233826.1| histidine kinase domain protein [Listeria monocytogenes str. 1/2a
F6854]
gi|254911626|ref|ZP_05261638.1| histidine kinase domain-containing protein [Listeria monocytogenes
J2818]
gi|254935952|ref|ZP_05267649.1| histidine kinase domain-containing protein [Listeria monocytogenes
F6900]
gi|386046604|ref|YP_005964936.1| histidine kinase domain-containing protein [Listeria monocytogenes
J0161]
gi|47015373|gb|EAL06308.1| histidine kinase domain protein [Listeria monocytogenes str. 1/2a
F6854]
gi|258608540|gb|EEW21148.1| histidine kinase domain-containing protein [Listeria monocytogenes
F6900]
gi|293589573|gb|EFF97907.1| histidine kinase domain-containing protein [Listeria monocytogenes
J2818]
gi|345533595|gb|AEO03036.1| histidine kinase domain-containing protein [Listeria monocytogenes
J0161]
Length = 601
Score = 43.9 bits (102), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|386053211|ref|YP_005970769.1| histidine kinase domain-containing protein [Listeria monocytogenes
Finland 1998]
gi|346645862|gb|AEO38487.1| histidine kinase domain-containing protein [Listeria monocytogenes
Finland 1998]
Length = 601
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|47093853|ref|ZP_00231596.1| histidine kinase domain protein [Listeria monocytogenes str. 4b
H7858]
gi|47017762|gb|EAL08552.1| histidine kinase domain protein [Listeria monocytogenes str. 4b
H7858]
Length = 601
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRTRKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
++V +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|16800010|ref|NP_470278.1| HSP90 family protein [Listeria innocua Clip11262]
gi|16413400|emb|CAC96172.1| lin0941 [Listeria innocua Clip11262]
Length = 564
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 30/120 (25%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
L + LS LY + ++ EL+QNA D I + +EPTL T IL
Sbjct: 12 LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIEPTLNGKIHVSLSGEKNNKTLIL 71
Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
+++GI G + + + A L + NS+K + +IGR GIG S F V+D
Sbjct: 72 EDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGQDTNDFIGRFGIGLLSCFIVSD 124
>gi|386039035|ref|YP_005957989.1| chaperone protein HtpG [Paenibacillus polymyxa M1]
gi|343095073|emb|CCC83282.1| chaperone protein HtpG [Paenibacillus polymyxa M1]
Length = 634
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 60/134 (44%), Gaps = 37/134 (27%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPE-----------------NVEPTLT-- 1231
LG + L+ LYS F+ E++QNA D I +E T++
Sbjct: 16 LGGMIDILANHLYSSPRVFIREILQNATDAITARKELRKTEAESRSLKEDNTLEDTVSES 75
Query: 1232 ------FILQESG------IVVLNNEQGFSAENI-RALCDVGNSTKKG-----SSAGYIG 1273
I++ SG +++ +N G S E+I R L +G S+KKG S +IG
Sbjct: 76 NDYEGKVIVELSGAGDQQVLMIQDNGIGLSEEDIHRFLSIIGQSSKKGADLLTSETSFIG 135
Query: 1274 RKGIGFKSVFRVTD 1287
R GIG S F V+D
Sbjct: 136 RFGIGLLSCFMVSD 149
>gi|393786767|ref|ZP_10374899.1| hypothetical protein HMPREF1068_01179 [Bacteroides nordii CL02T12C05]
gi|392658002|gb|EIY51632.1| hypothetical protein HMPREF1068_01179 [Bacteroides nordii CL02T12C05]
Length = 588
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN D I EN + L ++G + +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYSGRIDVYLNDNGSLTFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + YIGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRETPDADDYIGRFGIGLLSCFVVTNEITVES 124
>gi|423248197|ref|ZP_17229213.1| hypothetical protein HMPREF1066_00223 [Bacteroides fragilis
CL03T00C08]
gi|423253146|ref|ZP_17234077.1| hypothetical protein HMPREF1067_00721 [Bacteroides fragilis
CL03T12C07]
gi|392657046|gb|EIY50683.1| hypothetical protein HMPREF1067_00721 [Bacteroides fragilis
CL03T12C07]
gi|392660304|gb|EIY53918.1| hypothetical protein HMPREF1066_00223 [Bacteroides fragilis
CL03T00C08]
Length = 588
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|310639839|ref|YP_003944597.1| hypothetical protein [Paenibacillus polymyxa SC2]
gi|309244789|gb|ADO54356.1| ATPase, histidine kinase-, DNA gyrase B-, and HSP90-like domain
protein [Paenibacillus polymyxa SC2]
Length = 631
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 60/134 (44%), Gaps = 37/134 (27%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPE-----------------NVEPTLT-- 1231
LG + L+ LYS F+ E++QNA D I +E T++
Sbjct: 13 LGGMIDILANHLYSSPRVFIREILQNATDAITARKELRKTEAESRSLKEDNTLEDTVSES 72
Query: 1232 ------FILQESG------IVVLNNEQGFSAENI-RALCDVGNSTKKG-----SSAGYIG 1273
I++ SG +++ +N G S E+I R L +G S+KKG S +IG
Sbjct: 73 NDYEGKVIVELSGAGDQQVLMIQDNGIGLSEEDIHRFLSIIGQSSKKGADLLTSETSFIG 132
Query: 1274 RKGIGFKSVFRVTD 1287
R GIG S F V+D
Sbjct: 133 RFGIGLLSCFMVSD 146
>gi|53711528|ref|YP_097520.1| heat shock protein 90 [Bacteroides fragilis YCH46]
gi|52214393|dbj|BAD46986.1| heat shock protein HtpG [Bacteroides fragilis YCH46]
Length = 588
Score = 43.9 bits (102), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|375356574|ref|YP_005109345.1| putative heat-shock protein [Bacteroides fragilis 638R]
gi|383116538|ref|ZP_09937286.1| hypothetical protein BSHG_1389 [Bacteroides sp. 3_2_5]
gi|251948188|gb|EES88470.1| hypothetical protein BSHG_1389 [Bacteroides sp. 3_2_5]
gi|301161254|emb|CBW20792.1| putative heat-shock protein [Bacteroides fragilis 638R]
Length = 588
Score = 43.9 bits (102), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|60679789|ref|YP_209933.1| HSP90 family protein [Bacteroides fragilis NCTC 9343]
gi|60491223|emb|CAH05971.1| putative heat-shock protein [Bacteroides fragilis NCTC 9343]
Length = 588
Score = 43.5 bits (101), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|297200503|ref|ZP_06917900.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|297147668|gb|EFH28702.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length = 1083
Score = 43.5 bits (101), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/217 (21%), Positives = 85/217 (39%), Gaps = 32/217 (14%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L L+E + N A + +L + S K+ ++ G +GR G+GF +V VTD P
Sbjct: 75 LRLTLREGVLFAANTGAPLDAAGVESLSTLRASAKRDNADGSVGRFGVGFAAVLAVTDEP 134
Query: 1290 EIHSNGFHVKFDTSE-----GQIGFVLPTL----------VPPFNIDMFCRLLSKDPVQL 1334
+ V++ +E + P L VP + + DP
Sbjct: 135 AVVGRHGGVRWSLAEARDHAAETARHSPGLGDEMRRRDGHVPLLRLPFAAEGTAPDP--- 191
Query: 1335 ESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVI 1394
++T + LP R + +A +++ D +LLL L L+ ++ D+ I
Sbjct: 192 ----YDTVVILPLRDTAAADLA-ERLLNAVDD---ALLLALPGLEEVVIE---TDTTRTI 240
Query: 1395 RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
++ G+ + + W A+ G + PD+
Sbjct: 241 SRRTEGELTVVEDSRDGTTRWRTAAAH---GQLTPDL 274
>gi|265764922|ref|ZP_06093197.1| heat shock protein HtpG [Bacteroides sp. 2_1_16]
gi|263254306|gb|EEZ25740.1| heat shock protein HtpG [Bacteroides sp. 2_1_16]
Length = 588
Score = 43.5 bits (101), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|423269904|ref|ZP_17248876.1| hypothetical protein HMPREF1079_01958 [Bacteroides fragilis
CL05T00C42]
gi|423272641|ref|ZP_17251588.1| hypothetical protein HMPREF1080_00241 [Bacteroides fragilis
CL05T12C13]
gi|392700750|gb|EIY93912.1| hypothetical protein HMPREF1079_01958 [Bacteroides fragilis
CL05T00C42]
gi|392708718|gb|EIZ01823.1| hypothetical protein HMPREF1080_00241 [Bacteroides fragilis
CL05T12C13]
Length = 588
Score = 43.5 bits (101), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|336407723|ref|ZP_08588219.1| hypothetical protein HMPREF1018_00234 [Bacteroides sp. 2_1_56FAA]
gi|335944802|gb|EGN06619.1| hypothetical protein HMPREF1018_00234 [Bacteroides sp. 2_1_56FAA]
Length = 588
Score = 43.5 bits (101), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|390166348|ref|ZP_10218611.1| heat shock protein 90 [Sphingobium indicum B90A]
gi|389590745|gb|EIM68730.1| heat shock protein 90 [Sphingobium indicum B90A]
Length = 626
Score = 43.5 bits (101), Expect = 1.2, Method: Composition-based stats.
Identities = 35/132 (26%), Positives = 55/132 (41%), Gaps = 27/132 (20%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFILQ 1235
A + R LH + +YS FL EL+ NA D + ++ +P +T L
Sbjct: 16 ADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYELLSDPALAGDDGQPRITVTLD 75
Query: 1236 ESG--IVVLNNEQGFSAENI------------RALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
++V +N G S + RA + + K+G A IG+ G+GF S
Sbjct: 76 PEARRLIVEDNGIGMSEAELAEALGTIARSGTRAFMERVTAAKEGEGAQLIGQFGVGFYS 135
Query: 1282 VFRVTDAPEIHS 1293
F V D ++ S
Sbjct: 136 AFMVADKVDVFS 147
>gi|294012641|ref|YP_003546101.1| molecular chaperone HtpG [Sphingobium japonicum UT26S]
gi|292675971|dbj|BAI97489.1| molecular chaperone HtpG [Sphingobium japonicum UT26S]
Length = 626
Score = 43.5 bits (101), Expect = 1.2, Method: Composition-based stats.
Identities = 35/132 (26%), Positives = 55/132 (41%), Gaps = 27/132 (20%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFILQ 1235
A + R LH + +YS FL EL+ NA D + ++ +P +T L
Sbjct: 16 ADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYELLSDPALAGDDGQPRITVTLD 75
Query: 1236 ESG--IVVLNNEQGFSAENI------------RALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
++V +N G S + RA + + K+G A IG+ G+GF S
Sbjct: 76 PEARRLIVEDNGIGMSEAELAEALGTIARSGTRAFMERVTAAKEGEGAQLIGQFGVGFYS 135
Query: 1282 VFRVTDAPEIHS 1293
F V D ++ S
Sbjct: 136 AFMVADKVDVFS 147
>gi|227495919|ref|ZP_03926230.1| HSP90 family heat shock protein [Actinomyces urogenitalis DSM 15434]
gi|226834535|gb|EEH66918.1| HSP90 family heat shock protein [Actinomyces urogenitalis DSM 15434]
Length = 609
Score = 43.5 bits (101), Expect = 1.2, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 60/132 (45%), Gaps = 18/132 (13%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI-----YPENVEPTLTFILQ-- 1235
M + LG + LS+ LYS +L EL+QNA D + PE + + L+
Sbjct: 1 MADRFQVDLGGMVDLLSRHLYSGPQVYLRELLQNAVDAMTARGQLPEPGQEPMRVRLRTG 60
Query: 1236 -----ESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAG-----YIGRKGIGFKSVFR 1284
+ + V + G +A+ R L +G S+K+ S G YIG+ GIG + F
Sbjct: 61 TAPDGQPTLSVTDTGIGLTADQARELLATIGRSSKRDSLLGEGRREYIGQFGIGMLAAFM 120
Query: 1285 VTDAPEIHSNGF 1296
V D E+ S +
Sbjct: 121 VADEIEVISRSY 132
>gi|302825729|ref|XP_002994455.1| hypothetical protein SELMODRAFT_432376 [Selaginella moellendorffii]
gi|300137604|gb|EFJ04482.1| hypothetical protein SELMODRAFT_432376 [Selaginella moellendorffii]
Length = 884
Score = 43.5 bits (101), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 142/346 (41%), Gaps = 58/346 (16%)
Query: 1459 VFAFLPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
+F+FLP + GL F++ DF+L SSR + +S WNQ + P +F +A +
Sbjct: 161 IFSFLPSDIHSGLPFLINSDFLLVSSRATLLFDSSWNQGIFRCVPEVFYNAFELLLNTTA 220
Query: 1518 FR-ENPAKAASVYM-SFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP--- 1572
F ++ A + M +VP+ + + + ILSKL+ +N ++ + + APP
Sbjct: 221 FGLQSILSVARMAMYKYVPVNVCRNSYLERVRNDILSKLKDNNVILCASDVGR-APPSGQ 279
Query: 1573 -CKVLRGWNDRAHSLLP---DILLQKHLGLGFL--SKDIVLSDSLARALGIEEYGPKILL 1626
CK A + P ++L FL S ++ +L E YG + L
Sbjct: 280 LCK-----PALARKIAPSFRELLKAAPKCPRFLRGSPSYIVDSTLQ-----ENYGSVLRL 329
Query: 1627 QIISSLCRTENGLRSMGLSWLASWLNELYT--ISFHSSGQSSLQS-------GVETDL-- 1675
+ + E W+ S ++ Y + F SS S+L+S G + L
Sbjct: 330 LGLPDMSNDEYHQSLDSEDWMRSLPDDAYVKLLVFLSSKSSTLRSLKFLKYRGSDGRLQL 389
Query: 1676 --IDNLQRIPFIPLSDGTFSSVDEG---TIWLHSDCSVFDGGFGLEAFPNLC-AKLRT-- 1727
+D+L R +DG D G T WL VF P+ C AK+R
Sbjct: 390 ASMDDLAR------NDGRSMYWDSGHRATAWLAGWTEVFQRWLTFNFVPSACAAKIRQHS 443
Query: 1728 ----------VSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQL 1763
S + L+A ++ + LG ++ ++ N L + + L
Sbjct: 444 VLENWLERTPFSVSELTACSLARQVLGAVTANSSNAKLAVLAMHFL 489
>gi|149277937|ref|ZP_01884076.1| putative heat-shock protein [Pedobacter sp. BAL39]
gi|149231135|gb|EDM36515.1| putative heat-shock protein [Pedobacter sp. BAL39]
Length = 594
Score = 43.5 bits (101), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 57/120 (47%), Gaps = 12/120 (10%)
Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFI 1233
ME N Q L + LS+ LYS + F+ EL+QN D I E+ E L
Sbjct: 1 MEDNQYLFQ-VNLKGMISLLSEHLYSNPNTFVRELLQNGIDAITALKTLDEHHEGALNVH 59
Query: 1234 LQESG---IVVLNNEQGFSAENI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTD 1287
L E G +V +N G + R L +G S+K+ + + YIG+ GIG S F V+D
Sbjct: 60 LPEEGQPELVFEDNGIGLKEDEFHRFLAVIGESSKRDAFEAKDYIGKFGIGLLSCFVVSD 119
>gi|405955306|gb|EKC22471.1| Sacsin [Crassostrea gigas]
Length = 3200
Score = 43.5 bits (101), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 83/214 (38%), Gaps = 39/214 (18%)
Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPT-----LTFILQESGIVVLNNEQG 1247
Y D L EL+QNA+D + P EP+ + +VV NN +
Sbjct: 177 YPDDGQILKELIQNAEDAGASEMKILYDDRPAVQEPSTKRAPFRKYFKGPALVVYNNAEF 236
Query: 1248 FSAE--NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG 1305
A+ I+ L S+ K +GR G+GFKSVF +TD P I S + D +
Sbjct: 237 IEADWTGIKMLY----SSIKEFDKTKVGRFGLGFKSVFHITDHPVIISGDQLLVLDPHQD 292
Query: 1306 QIGFVLPTLVPPFN----------IDMFCRLLSKDPVQLESKCWN-TCIRLPFR---TKF 1351
+ + + F + D LE +N T R P R T+
Sbjct: 293 SSKVCQTMKLKKLHRYKKMKVEDCLKAFSGVFGFDQNTLECGHFNGTIFRFPLRQEETEL 352
Query: 1352 SEGIAMNNIVD----MFSDLHPSLLLFLHRLQCI 1381
S+ I + VD F D P LLFL L+ I
Sbjct: 353 SDNIYDKSKVDDLFMSFKDEAPVSLLFLKCLESI 386
>gi|441470598|emb|CCQ20353.1| Chaperone protein htpG [Listeria monocytogenes]
gi|441473731|emb|CCQ23485.1| Chaperone protein htpG [Listeria monocytogenes N53-1]
Length = 433
Score = 43.5 bits (101), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 54/122 (44%), Gaps = 16/122 (13%)
Query: 1182 NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTF 1232
N + L + LS LY + ++ EL+QNA D I + +LT
Sbjct: 3 NYSHRFQVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTG 62
Query: 1233 ILQESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRV 1285
E ++V +N G + + + A L + NS+K SS +IGR GIG S F V
Sbjct: 63 DNNEKTLIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIV 122
Query: 1286 TD 1287
+D
Sbjct: 123 SD 124
>gi|167034745|ref|YP_001669976.1| heat shock protein 90 [Pseudomonas putida GB-1]
gi|166861233|gb|ABY99640.1| heat shock protein Hsp90 [Pseudomonas putida GB-1]
Length = 634
Score = 43.5 bits (101), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 90/222 (40%), Gaps = 59/222 (26%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDN---------IYPENVEPTLTFIL-----QESGIV 1240
LH + LYS FL EL+ NA D PE +E + +++G V
Sbjct: 20 LHLMIHSLYSNKEIFLRELISNASDAADKLRFEALAKPELLEGDADLKIRLSFDKDAGTV 79
Query: 1241 VL-NNEQGFSAENIRALCDVGNSTKKGSS-------------AGYIGRKGIGFKSVFRVT 1286
L +N G S E++ + +G K G++ + IG+ G+GF S F V
Sbjct: 80 TLEDNGIGMSREDV--IAHLGTIAKSGTADFMKNLTGDQKKDSHLIGQFGVGFYSAFIVA 137
Query: 1287 DAPEIHSN--------GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
D +++S G H +S+G+ F + T+ P L KD ++
Sbjct: 138 DKVDVYSRRAGQPAAEGVHW---SSKGEGEFEVATIDKPQRGTRIVLHLKKDELE----- 189
Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
F++G + N+V +SD H +L + L + Q
Sbjct: 190 ------------FADGWRLRNVVKKYSD-HIALPIQLPKEQA 218
>gi|398378376|ref|ZP_10536539.1| molecular chaperone of HSP90 family [Rhizobium sp. AP16]
gi|397724975|gb|EJK85433.1| molecular chaperone of HSP90 family [Rhizobium sp. AP16]
Length = 629
Score = 43.5 bits (101), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 61/142 (42%), Gaps = 30/142 (21%)
Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENV 1226
ES++ + ARL LH + +YS FL EL+ NA D ++ +
Sbjct: 11 ESHVFEADVARL---LHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIATPSLLASDP 67
Query: 1227 EPTLTFILQESG--IVVLNNEQGFSA-ENIRALCDVGNS-----------TKKGSSAGYI 1272
+ + L E G +VV +N G S E I AL + S K+G A I
Sbjct: 68 DSRILLTLDEEGSRLVVEDNGIGMSRDEMIEALGTIARSGTRAFMERIEANKQGEGAQLI 127
Query: 1273 GRKGIGFKSVFRVTDAPEIHSN 1294
G+ G+GF S F V D ++ S
Sbjct: 128 GQFGVGFYSCFMVADRVDVVSR 149
>gi|16802982|ref|NP_464467.1| HSP90 family protein [Listeria monocytogenes EGD-e]
gi|386049870|ref|YP_005967861.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL R2-561]
gi|404283385|ref|YP_006684282.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2372]
gi|405757941|ref|YP_006687217.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2479]
gi|16410344|emb|CAC99020.1| lmo0942 [Listeria monocytogenes EGD-e]
gi|346423716|gb|AEO25241.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL R2-561]
gi|404232887|emb|CBY54290.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2372]
gi|404235823|emb|CBY57225.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2479]
Length = 601
Score = 43.5 bits (101), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
L + LS LY + ++ EL+QNA D I + +LT E
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68
Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
+++ +N G + + + A L + NS+K SS +IGR GIG S F V+D
Sbjct: 69 LIIEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124
>gi|119628728|gb|EAX08323.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Homo sapiens]
Length = 1313
Score = 43.5 bits (101), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 47/105 (44%), Gaps = 15/105 (14%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
Y + L EL+QNA+D +Y E T T Q + V NN F+
Sbjct: 658 YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 716
Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
E+ + ++ S KK +GR GIGF SV+ +TD P I S
Sbjct: 717 PEDWHGIQEIARSRKKDDPL-KVGRFGIGFNSVYHITDVPCIFSG 760
>gi|222082149|ref|YP_002541514.1| heat shock protein 90 [Agrobacterium radiobacter K84]
gi|221726828|gb|ACM29917.1| heat shock chaperone protein [Agrobacterium radiobacter K84]
Length = 629
Score = 43.5 bits (101), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 60/142 (42%), Gaps = 30/142 (21%)
Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI----YPENVEPTLT---- 1231
ES++ + ARL LH + +YS FL EL+ NA D Y P+L
Sbjct: 11 ESHVFEADVARL---LHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIATPSLLASDP 67
Query: 1232 -----FILQESG--IVVLNNEQGFSA-ENIRALCDVGNS-----------TKKGSSAGYI 1272
L E G +VV +N G S E I AL + S K+G A I
Sbjct: 68 DSRILLTLDEEGSRLVVEDNGIGMSRDEMIEALGTIARSGTRAFMERIEANKQGEGAQLI 127
Query: 1273 GRKGIGFKSVFRVTDAPEIHSN 1294
G+ G+GF S F V D ++ S
Sbjct: 128 GQFGVGFYSCFMVADRVDVVSR 149
>gi|315302374|ref|ZP_07873248.1| histidine kinase domain-containing protein [Listeria ivanovii FSL
F6-596]
gi|313629261|gb|EFR97520.1| histidine kinase domain-containing protein [Listeria ivanovii FSL
F6-596]
Length = 600
Score = 43.5 bits (101), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 56/120 (46%), Gaps = 30/120 (25%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
L + LS LY + ++ EL+QNA D I + +EP + T IL
Sbjct: 11 LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLEPEIEGEIHVSLSGKPDEKTLIL 70
Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK-----KGSSAG-YIGRKGIGFKSVFRVTD 1287
+++GI G + E + A L + NS+K +G A +IGR GIG S F V+D
Sbjct: 71 EDNGI-------GLTEEEVHAFLATIANSSKGEKKFEGQEANDFIGRFGIGLLSCFIVSD 123
>gi|416356103|ref|ZP_11681929.1| hypothetical protein CBCST_12012 [Clostridium botulinum C str.
Stockholm]
gi|338195095|gb|EGO87427.1| hypothetical protein CBCST_12012 [Clostridium botulinum C str.
Stockholm]
Length = 515
Score = 43.1 bits (100), Expect = 1.5, Method: Composition-based stats.
Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 9/87 (10%)
Query: 1212 ELVQNADDNIYPENVEPTLTFILQESG----IVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
E+VQNA+DN +N L IL + G + + + GFS ENI + TK+ +
Sbjct: 94 EIVQNANDNTTSDN----LDIILSKKGNIYEMTLQYEDIGFSVENIVGFFNTEIHTKRDN 149
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
+ G+ G+G KS+F D +I SN
Sbjct: 150 FSA-TGKHGVGIKSLFYFVDYMKIESN 175
>gi|350589712|ref|XP_003357851.2| PREDICTED: sacsin [Sus scrofa]
Length = 1980
Score = 43.1 bits (100), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 19/123 (15%)
Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
E Y S EL+QNADD +I ++P + + NN
Sbjct: 1336 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNLDIRESLLDPGMA-ACHGPALWSFNN 1394
Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
+ FS + + +G S K+G +G+ G+GF SV+ +TD P I S F + FD +
Sbjct: 1395 SE-FSDADFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1452
Query: 1305 GQI 1307
I
Sbjct: 1453 NHI 1455
>gi|456822983|gb|EMF71453.1| GHKL domain protein [Leptospira interrogans serovar Canicola str.
LT1962]
Length = 603
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|290893499|ref|ZP_06556483.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL J2-071]
gi|404407395|ref|YP_006690110.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2376]
gi|290557000|gb|EFD90530.1| histidine kinase domain-containing protein [Listeria monocytogenes
FSL J2-071]
gi|404241544|emb|CBY62944.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2376]
Length = 601
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 30/123 (24%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
L + LS LY + ++ EL+QNA D I + + PTL T
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLTPTLEGKILVSLTGDTNEKT 68
Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGS------SAGYIGRKGIGFKSVFR 1284
IL+++GI G + E + A L + NS+K S +IGR GIG S F
Sbjct: 69 LILEDNGI-------GLTEEEVHAFLATIANSSKGEKDFDGERSNDFIGRFGIGLLSCFI 121
Query: 1285 VTD 1287
V+D
Sbjct: 122 VSD 124
>gi|148745189|gb|AAI42917.1| LOC555303 protein [Danio rerio]
Length = 304
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 49/111 (44%), Gaps = 13/111 (11%)
Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQE-----SGIVVLNNEQGFSA 1250
Y L EL+QNADD I+ E T + +E + N+ F+
Sbjct: 31 YPDGGQILKELIQNADDAGASTVVFIHDERHYETHSLWTKELRKYQGPALYAFNDAAFTE 90
Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
E+ + VG S K+ +GR GIGF SV+ +TD P + S+ FD
Sbjct: 91 EDWEGIQRVGRSIKQDDPTK-VGRFGIGFNSVYHITDLPCVFSSEHLAIFD 140
>gi|24213931|ref|NP_711412.1| HSP90 family protein [Leptospira interrogans serovar Lai str. 56601]
gi|45658311|ref|YP_002397.1| HSP90 family protein [Leptospira interrogans serovar Copenhageni str.
Fiocruz L1-130]
gi|386073466|ref|YP_005987783.1| protein HtpG [Leptospira interrogans serovar Lai str. IPAV]
gi|421084773|ref|ZP_15545629.1| GHKL domain protein [Leptospira santarosai str. HAI1594]
gi|421103352|ref|ZP_15563952.1| GHKL domain protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|24194785|gb|AAN48430.1| HtpG [Leptospira interrogans serovar Lai str. 56601]
gi|45601553|gb|AAS71034.1| HtpG [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130]
gi|353457255|gb|AER01800.1| HtpG [Leptospira interrogans serovar Lai str. IPAV]
gi|410367098|gb|EKP22486.1| GHKL domain protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410432723|gb|EKP77078.1| GHKL domain protein [Leptospira santarosai str. HAI1594]
gi|455790481|gb|EMF42343.1| GHKL domain protein [Leptospira interrogans serovar Lora str. TE
1992]
Length = 603
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|418709533|ref|ZP_13270319.1| GHKL domain protein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|418732663|ref|ZP_13290390.1| GHKL domain protein [Leptospira interrogans str. UI 12758]
gi|410769768|gb|EKR44995.1| GHKL domain protein [Leptospira interrogans serovar Grippotyphosa
str. UI 08368]
gi|410773443|gb|EKR53471.1| GHKL domain protein [Leptospira interrogans str. UI 12758]
gi|456973160|gb|EMG13410.1| GHKL domain protein [Leptospira interrogans serovar Grippotyphosa
str. LT2186]
Length = 603
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|417763687|ref|ZP_12411664.1| GHKL domain protein [Leptospira interrogans str. 2002000624]
gi|417773990|ref|ZP_12421865.1| GHKL domain protein [Leptospira interrogans str. 2002000621]
gi|418675150|ref|ZP_13236442.1| GHKL domain protein [Leptospira interrogans str. 2002000623]
gi|409940506|gb|EKN86146.1| GHKL domain protein [Leptospira interrogans str. 2002000624]
gi|410576461|gb|EKQ39468.1| GHKL domain protein [Leptospira interrogans str. 2002000621]
gi|410577722|gb|EKQ45591.1| GHKL domain protein [Leptospira interrogans str. 2002000623]
Length = 603
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|217964958|ref|YP_002350636.1| HSP90 family protein [Listeria monocytogenes HCC23]
gi|386007672|ref|YP_005925950.1| chaperone/heat shock protein [Listeria monocytogenes L99]
gi|386026265|ref|YP_005947041.1| putative chaperonin (heat shock protein Hsp90) [Listeria
monocytogenes M7]
gi|217334228|gb|ACK40022.1| histidine kinase domain protein [Listeria monocytogenes HCC23]
gi|307570482|emb|CAR83661.1| chaperone/heat shock protein [Listeria monocytogenes L99]
gi|336022846|gb|AEH91983.1| putative chaperonin (heat shock protein Hsp90) [Listeria
monocytogenes M7]
Length = 601
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 30/123 (24%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
L + LS LY + ++ EL+QNA D I + + PTL T
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLTPTLEGKILVSLTGDTNEKT 68
Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGS------SAGYIGRKGIGFKSVFR 1284
IL+++GI G + E + A L + NS+K S +IGR GIG S F
Sbjct: 69 LILEDNGI-------GLTEEEVHAFLATIANSSKGEKDFDGERSNDFIGRFGIGLLSCFI 121
Query: 1285 VTD 1287
V+D
Sbjct: 122 VSD 124
>gi|421122696|ref|ZP_15582979.1| GHKL domain protein [Leptospira interrogans str. Brem 329]
gi|410344596|gb|EKO95762.1| GHKL domain protein [Leptospira interrogans str. Brem 329]
Length = 603
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|71066274|ref|YP_265001.1| heat shock protein 90 [Psychrobacter arcticus 273-4]
gi|88909149|sp|Q4FQZ1.1|HTPG_PSYA2 RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
protein HtpG; AltName: Full=High temperature protein G
gi|71039259|gb|AAZ19567.1| probable chaperone protein htpG (Hsp90) [Psychrobacter arcticus
273-4]
Length = 656
Score = 43.1 bits (100), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 56/138 (40%), Gaps = 28/138 (20%)
Query: 1184 LKKQ--HARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEP 1228
LKK A + + LH ++ LYS F+ ELV NA DD++Y ++ E
Sbjct: 14 LKKHTFEAEVAQLLHLVTHSLYSNSDIFVRELVSNASDACDKLRFEATNDDSLYEDDGEL 73
Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRK 1275
+ + E + + G A+ ++G K G+ A IG+
Sbjct: 74 RIRIAVDEDAKTITFTDNGIGMNEADAIENLGTIAKSGTKAFLDKLSDSQKQDGQLIGQF 133
Query: 1276 GIGFKSVFRVTDAPEIHS 1293
G+GF S F V D + +
Sbjct: 134 GVGFYSGFIVADTISVET 151
>gi|423259412|ref|ZP_17240335.1| hypothetical protein HMPREF1055_02612 [Bacteroides fragilis
CL07T00C01]
gi|423263614|ref|ZP_17242617.1| hypothetical protein HMPREF1056_00304 [Bacteroides fragilis
CL07T12C05]
gi|387776992|gb|EIK39092.1| hypothetical protein HMPREF1055_02612 [Bacteroides fragilis
CL07T00C01]
gi|392707036|gb|EIZ00156.1| hypothetical protein HMPREF1056_00304 [Bacteroides fragilis
CL07T12C05]
Length = 588
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G +V +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR G+G S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGVGLLSCFVVTNEIRVES 124
>gi|418667447|ref|ZP_13228858.1| GHKL domain protein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
gi|410756618|gb|EKR18237.1| GHKL domain protein [Leptospira interrogans serovar Pyrogenes str.
2006006960]
Length = 603
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|422809027|ref|ZP_16857438.1| heat shock protein HtpG [Listeria monocytogenes FSL J1-208]
gi|378752641|gb|EHY63226.1| heat shock protein HtpG [Listeria monocytogenes FSL J1-208]
Length = 601
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 30/123 (24%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
L + LS LY + ++ EL+QNA D I + + PTL T
Sbjct: 9 QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLTPTLEGKILVSLTGDTNEKT 68
Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGS------SAGYIGRKGIGFKSVFR 1284
IL+++GI G + E + A L + NS+K S +IGR GIG S F
Sbjct: 69 LILEDNGI-------GLTEEEVHAFLATIANSSKGEKDFDGEHSNDFIGRFGIGLLSCFI 121
Query: 1285 VTD 1287
V+D
Sbjct: 122 VSD 124
>gi|299822814|ref|ZP_07054700.1| histidine kinase domain protein [Listeria grayi DSM 20601]
gi|299816343|gb|EFI83581.1| histidine kinase domain protein [Listeria grayi DSM 20601]
Length = 597
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 56/125 (44%), Gaps = 18/125 (14%)
Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFIL--- 1234
ME NM K LG + LS LY ++ EL+QNA D I + E +LT +
Sbjct: 1 MEENMRFK--VNLGGMIDILSNHLYDSKDVYIRELLQNATDAIRARKKNESSLTGEIAVS 58
Query: 1235 ----QESGIVVL--NNEQGFSAENIRALCDVGNSTKKGSSA------GYIGRKGIGFKSV 1282
+ G V++ +N G + E + S+ KGS YIGR GIG S
Sbjct: 59 LNTSDQGGSVIIFEDNGIGLTTEEVHEFLATIASSSKGSKRFQIEENDYIGRFGIGLLSC 118
Query: 1283 FRVTD 1287
F V+D
Sbjct: 119 FIVSD 123
>gi|414083827|ref|YP_006992535.1| chaperone/heat shock protein [Carnobacterium maltaromaticum LMA28]
gi|412997411|emb|CCO11220.1| chaperone/heat shock protein [Carnobacterium maltaromaticum LMA28]
Length = 497
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 51/126 (40%), Gaps = 19/126 (15%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV--------EPT 1229
N E N K LG + LS LYSQ ++ EL+QNA D I + E
Sbjct: 2 NPEENRFK---VNLGGMIEILSNHLYSQKDVYIRELLQNATDAIRGRKLISEDNFVGEIH 58
Query: 1230 LTFILQESGIVVL---NNEQGFSAENIRALCDVGNSTKKG-----SSAGYIGRKGIGFKS 1281
+ + SG L +N G + E I S+ KG +IGR GIG S
Sbjct: 59 VDLTIHRSGSSTLFFEDNGIGLTEEEIHQFLATIASSSKGEKNFEEKQDFIGRFGIGLLS 118
Query: 1282 VFRVTD 1287
F V D
Sbjct: 119 CFIVAD 124
>gi|418689769|ref|ZP_13250888.1| GHKL domain protein [Leptospira interrogans str. FPW2026]
gi|400360958|gb|EJP16927.1| GHKL domain protein [Leptospira interrogans str. FPW2026]
Length = 603
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|345885642|ref|ZP_08837003.1| hypothetical protein HMPREF0666_03179 [Prevotella sp. C561]
gi|345045035|gb|EGW48984.1| hypothetical protein HMPREF0666_03179 [Prevotella sp. C561]
Length = 320
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 1429 PD--VKTTEIALALTLQESNEGNYGPLLYQQ-PVFAFLPLRTYGLKF--ILQGDFVLPSS 1483
PD TE A++ ++ + EG+ P+ + ++A+LP+ + KF L DF+ S
Sbjct: 85 PDRIASATETAISFAIRLNEEGHVMPIGKDELSLYAYLPMNEHRFKFPFYLNADFIPKSD 144
Query: 1484 REEVDGNSPWNQWLL 1498
RE + +PWN +L
Sbjct: 145 REGIQSENPWNYFLF 159
>gi|21222723|ref|NP_628502.1| hypothetical protein SCO4331 [Streptomyces coelicolor A3(2)]
gi|8249984|emb|CAB93405.1| hypothetical protein SCD12A.14 [Streptomyces coelicolor A3(2)]
Length = 1042
Score = 43.1 bits (100), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L L++ +V N A + +L + S K+ + G +GR G+GF +V VTD P
Sbjct: 75 LRLTLRDGVLVAANTGAPLDATGVESLSTLRASAKRDAVDGSVGRFGVGFAAVLAVTDEP 134
Query: 1290 EIHSNGFHVKFDTSEGQ 1306
+ V++ +E +
Sbjct: 135 AVVGRHGGVRWSLAEAR 151
>gi|334343983|ref|YP_004552535.1| chaperone protein htpG [Sphingobium chlorophenolicum L-1]
gi|334100605|gb|AEG48029.1| Chaperone protein htpG [Sphingobium chlorophenolicum L-1]
Length = 626
Score = 43.1 bits (100), Expect = 1.9, Method: Composition-based stats.
Identities = 38/132 (28%), Positives = 57/132 (43%), Gaps = 27/132 (20%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFILQ 1235
A + R LH + +YS FL EL+ NA D + E+ +P +T L
Sbjct: 16 ADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYELLSDPALAGEDGQPRITVTLD 75
Query: 1236 ESG--IVVLNNEQGFS----AENI--------RALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
++V +N G S AE + RA + + K+G A IG+ G+GF S
Sbjct: 76 PDTRQLIVEDNGIGMSEVELAEALGTIARSGTRAFMERVAAAKEGEGAQLIGQFGVGFYS 135
Query: 1282 VFRVTDAPEIHS 1293
F V D ++ S
Sbjct: 136 AFMVADRVDVFS 147
>gi|289770094|ref|ZP_06529472.1| conserved hypothetical protein [Streptomyces lividans TK24]
gi|289700293|gb|EFD67722.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length = 1047
Score = 43.1 bits (100), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 37/77 (48%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L L++ +V N A + +L + S K+ + G +GR G+GF +V VTD P
Sbjct: 80 LRLTLRDGVLVAANTGAPLDATGVESLSTLRASAKRDAVDGSVGRFGVGFAAVLAVTDEP 139
Query: 1290 EIHSNGFHVKFDTSEGQ 1306
+ V++ +E +
Sbjct: 140 AVVGRHGGVRWSLAEAR 156
>gi|400288856|ref|ZP_10790888.1| heat shock protein 90 [Psychrobacter sp. PAMC 21119]
Length = 661
Score = 43.1 bits (100), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 55/138 (39%), Gaps = 28/138 (20%)
Query: 1184 LKKQ--HARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEP 1228
LKK A + + LH ++ LYS F+ ELV NA DD++Y + E
Sbjct: 14 LKKHTFEAEVAQLLHLVTHSLYSNADIFVRELVSNASDACDKLRFEGTNDDSLYENDGEL 73
Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRK 1275
+ + E + + G A+ ++G K G+ A IG+
Sbjct: 74 KIRIAVDEDAKTITFTDNGIGMNEADAIENLGTIAKSGTKAFLDKLSESQKQEGQLIGQF 133
Query: 1276 GIGFKSVFRVTDAPEIHS 1293
G+GF S F V D + S
Sbjct: 134 GVGFYSGFIVADTISVES 151
>gi|418724257|ref|ZP_13283077.1| GHKL domain protein [Leptospira interrogans str. UI 12621]
gi|409962206|gb|EKO25945.1| GHKL domain protein [Leptospira interrogans str. UI 12621]
Length = 603
Score = 43.1 bits (100), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|417785783|ref|ZP_12433485.1| GHKL domain protein [Leptospira interrogans str. C10069]
gi|409951124|gb|EKO05641.1| GHKL domain protein [Leptospira interrogans str. C10069]
Length = 603
Score = 43.1 bits (100), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L I
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G S G+IG+ G+G S F V+D
Sbjct: 61 KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120
>gi|443626565|ref|ZP_21110981.1| hypothetical protein STVIR_4886 [Streptomyces viridochromogenes
Tue57]
gi|443339870|gb|ELS54096.1| hypothetical protein STVIR_4886 [Streptomyces viridochromogenes
Tue57]
Length = 1044
Score = 42.7 bits (99), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/77 (27%), Positives = 36/77 (46%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L L+E + N A + +L + S K+ + G +GR G+GF +V VTD P
Sbjct: 75 LRLTLREGVLFAANTGAPLDAAGVESLATLRASAKRENQQGAVGRFGVGFAAVLSVTDEP 134
Query: 1290 EIHSNGFHVKFDTSEGQ 1306
+ V++ +E +
Sbjct: 135 AVVGRHGGVRWSLAEAR 151
>gi|261415453|ref|YP_003249136.1| hypothetical protein Fisuc_1049 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261371909|gb|ACX74654.1| hypothetical protein Fisuc_1049 [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 2408
Score = 42.7 bits (99), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 56/121 (46%), Gaps = 20/121 (16%)
Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHF--LLELVQNADDNIYPENVEPTLTFI- 1233
SN E+N+L L Y S F L EL+QNA D + V + T +
Sbjct: 8 SNYETNILSTIQLNLDN---------YKYGSGFTILKELIQNASDANATKIVIRSFTAVP 58
Query: 1234 ------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
L+ +GIVV NN + F ++ + + G KK + A IGR G G KS++ + D
Sbjct: 59 NAHCPLLKRAGIVVYNNGK-FDEDDAKNIAMFGGDNKK-NDATTIGRYGFGLKSIYHLCD 116
Query: 1288 A 1288
A
Sbjct: 117 A 117
>gi|403415159|emb|CCM01859.1| predicted protein [Fibroporia radiculosa]
Length = 2665
Score = 42.7 bits (99), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 77/188 (40%), Gaps = 23/188 (12%)
Query: 1218 DDNIYPEN--VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
D I+P +EP F + + N+ FS ++ AL ++ S+KK S IG+
Sbjct: 56 DSRIHPSEHLIEPK--FAKLQGPAFIAYNDATFSDDDWDALQNISRSSKK-SDTSKIGKY 112
Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
GIG +S + +TD E+ S V FD EG + L + F D C LS
Sbjct: 113 GIGIRSCYHLTDNLEVLSGNSFVIFDPHRKLIDEGGRKWSLADVKDTFG-DQLCPFLSFS 171
Query: 1331 PVQLESKCWNTCIRLPFR-----TKFSEGIAMNNIVDMFSDLHPS-----LLLFLHRLQC 1380
S IRLP R ++ S + + + ++LFL ++C
Sbjct: 172 --NAGSTFDGAAIRLPLRQEGHQSRLSSKVPQPDEIRKLLLEFVEEELGVVMLFLSSIKC 229
Query: 1381 IMFRNMLN 1388
I R + N
Sbjct: 230 IEIREIDN 237
>gi|384156150|ref|YP_005538965.1| heat shock protein 90 [Arcobacter butzleri ED-1]
gi|345469704|dbj|BAK71155.1| heat shock protein 90 [Arcobacter butzleri ED-1]
Length = 636
Score = 42.7 bits (99), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 33/142 (23%)
Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
M K Q +G+ LH ++ LYS F+ ELV NA D I EN+
Sbjct: 1 MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYADW 60
Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
E ++F ++ + +++N G + ++ + +G K G+ +
Sbjct: 61 KGEINISFDEKDKSLSIIDNGIGMNEADL--IASIGTIAKSGTKSFVEALTGDAKKDSNL 118
Query: 1272 IGRKGIGFKSVFRVTDAPEIHS 1293
IG+ G+GF SVF V D ++ S
Sbjct: 119 IGQFGVGFYSVFMVADKVDVIS 140
>gi|326672128|ref|XP_003199603.1| PREDICTED: hypothetical protein LOC100537175 [Danio rerio]
Length = 547
Score = 42.7 bits (99), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 66/143 (46%), Gaps = 8/143 (5%)
Query: 1931 IDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWD--DCYTDKIMGFFKSKPTGDDR 1986
+D + +EL LL+ L+ Q + L+++LD W+ + Y + G R
Sbjct: 1 MDHQCSELHTLLTTDQLSDQEKLQQRRTLIQLLDNNWNSGERYGQLVSAQLLDADGGVVR 60
Query: 1987 SFQSSFINCICDIQW--TISSMDDELHY--PKDLFHDCDAVRSILGPSAPYIVPKVKSEK 2042
+ +SSFI + + W T M + Y P+ ++ AV +LG Y+ + + +
Sbjct: 61 NVRSSFIQMLTEFSWIPTHRPMCSTVSYLQPRSIYIFSTAVNRLLGAHVDYMHSEQTASE 120
Query: 2043 LVCDIGLKTEVTIDDILEILKVW 2065
IGL+ V ++D++ LK W
Sbjct: 121 FTSAIGLRHSVCVEDMVGYLKRW 143
>gi|313235804|emb|CBY19788.1| unnamed protein product [Oikopleura dioica]
Length = 664
Score = 42.7 bits (99), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 44/182 (24%), Positives = 78/182 (42%), Gaps = 40/182 (21%)
Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------------PE 1224
S+ +SN + A + L +++ LYS+ F+ EL+ N+ D + P
Sbjct: 32 SDADSNPTYEFQAETKKLLDIVAKSLYSEVEIFVRELISNSSDALNKRKFEEMAAGGDPP 91
Query: 1225 NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--------------SSAG 1270
+E L F + +++ +N G S E A+ ++G K G S+
Sbjct: 92 ALEIKLHFDKSNNKLIIEDNGIGMSQE--EAMTNLGTIAKSGSQDFMTAVKDENSASAQS 149
Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSN------GFHVKFDTSEGQIGFVLPTLVPPFNIDMFC 1324
IG+ G+GF SVF V D ++ + G H + S G G+ T+ P +++
Sbjct: 150 IIGQFGVGFYSVFMVADGVDVFTRRWNEDVGLHWR---STGDSGY---TIEPKEGLNVGT 203
Query: 1325 RL 1326
R+
Sbjct: 204 RI 205
>gi|157737782|ref|YP_001490466.1| heat shock protein 90 [Arcobacter butzleri RM4018]
gi|166918431|sp|A8EV23.1|HTPG_ARCB4 RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
protein HtpG; AltName: Full=High temperature protein G
gi|157699636|gb|ABV67796.1| heat shock protein 90 HtpG [Arcobacter butzleri RM4018]
Length = 636
Score = 42.7 bits (99), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 33/142 (23%)
Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
M K Q +G+ LH ++ LYS F+ ELV NA D I EN+
Sbjct: 1 MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYAQW 60
Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
E ++F ++ + +++N G + ++ + +G K G+ +
Sbjct: 61 KGEINISFDEKDKSLSIIDNGIGMNEADL--IASIGTIAKSGTKSFVEALTGDAKKDSNL 118
Query: 1272 IGRKGIGFKSVFRVTDAPEIHS 1293
IG+ G+GF SVF V D ++ S
Sbjct: 119 IGQFGVGFYSVFMVADKVDVIS 140
>gi|291229801|ref|XP_002734859.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 2190
Score = 42.7 bits (99), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 87/209 (41%), Gaps = 38/209 (18%)
Query: 1206 DSHFLLELVQNADD------NIYPENVEPTLTFILQES------GIVVLNNEQGFSAENI 1253
D L EL+QNADD + E T +L S + N+ F+ +
Sbjct: 42 DGQILKELIQNADDAGATEVKFLLDTTEYGTTSLLTPSIATYQGAALYCYNDSVFNDADW 101
Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT-----SEGQIG 1308
+++ V S KK GR G+GF SV+ +TD P I S V FD EG G
Sbjct: 102 KSIQSVQQSVKKKDPLR-TGRFGLGFVSVYHITDMPSIMSADKIVFFDPFENRFREGYPG 160
Query: 1309 FVLP---TLVPPFNIDM--FCRLLSKDPVQLESKCWN------TCIRLPFRTKFSEG--- 1354
+ +P +L+ ++ F ++ S+ + + +N T +R P R + S+
Sbjct: 161 WQVPLDKSLLTAYHHQFLPFTKIHSQ--IGFGDQPFNNTFFNGTLMRFPLRKEISKMSSN 218
Query: 1355 -IAMNNIVDMFSDLHPSL---LLFLHRLQ 1379
NI DMF L LLFL ++
Sbjct: 219 VFKPENIDDMFESFKEELDVILLFLKSVE 247
>gi|400405574|ref|YP_006588433.1| molecular chaperone of HSP90 family [secondary endosymbiont of
Ctenarytaina eucalypti]
gi|400363937|gb|AFP85005.1| molecular chaperone of HSP90 family [secondary endosymbiont of
Ctenarytaina eucalypti]
Length = 630
Score = 42.7 bits (99), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 64/165 (38%), Gaps = 34/165 (20%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPTLTFILQESGIVV 1241
LH + +YS FL ELV NA D +Y + E L L + +
Sbjct: 19 LHLMINSIYSNKEIFLRELVSNASDAADKLRFRALSQPTLYETDSELRLRIALNKEKRTI 78
Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRKGIGFKSVFRVTDA 1288
+ ++ G + ++G K G+ A IG+ G+GF SVF V D
Sbjct: 79 ILSDNGIGMRRAEVIENLGTIAKSGTKAFLDSMKSDHLKEHPLIGQFGVGFYSVFIVADK 138
Query: 1289 PEIHSN--GFHVK---FDTSEGQIGFVLPTLVPP---FNIDMFCR 1325
+ + G H F S G+ + + ++ P +I ++ R
Sbjct: 139 VTVRTRAAGAHADEGVFWESAGEGDYTIADIIKPDRGTDITLYLR 183
>gi|313213012|emb|CBY36894.1| unnamed protein product [Oikopleura dioica]
Length = 664
Score = 42.7 bits (99), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 44/182 (24%), Positives = 78/182 (42%), Gaps = 40/182 (21%)
Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------------PE 1224
S+ +SN + A + L +++ LYS+ F+ EL+ N+ D + P
Sbjct: 32 SDADSNPTYEFQAETKKLLDIVAKSLYSEVEIFVRELISNSSDALNKRKFEEMAAGGDPP 91
Query: 1225 NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--------------SSAG 1270
+E L F + +++ +N G S E A+ ++G K G S+
Sbjct: 92 ALEIKLHFDKSNNKLIIEDNGIGMSQE--EAMTNLGTIAKSGSQDFMTAVKDENSASAQS 149
Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSN------GFHVKFDTSEGQIGFVLPTLVPPFNIDMFC 1324
IG+ G+GF SVF V D ++ + G H + S G G+ T+ P +++
Sbjct: 150 IIGQFGVGFYSVFMVADGVDVFTRRWNEDVGLHWR---STGDSGY---TIEPKEGLNVGT 203
Query: 1325 RL 1326
R+
Sbjct: 204 RI 205
>gi|374599692|ref|ZP_09672694.1| HSP90 family protein [Myroides odoratus DSM 2801]
gi|423324846|ref|ZP_17302687.1| hypothetical protein HMPREF9716_02044 [Myroides odoratimimus CIP
103059]
gi|373911162|gb|EHQ43011.1| HSP90 family protein [Myroides odoratus DSM 2801]
gi|404607820|gb|EKB07321.1| hypothetical protein HMPREF9716_02044 [Myroides odoratimimus CIP
103059]
Length = 592
Score = 42.7 bits (99), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 52/111 (46%), Gaps = 8/111 (7%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNI-----YPENVEPTLTFILQESGIVVLNNEQGFS 1249
+ LS+ +YS + F+ EL+QN D + E+ + +T ++ +N G S
Sbjct: 17 IELLSEHIYSSPTVFIRELLQNGVDAVTVRKGIDESFQGKITLFFDNDQLIFQDNGVGLS 76
Query: 1250 AENIRALCDV-GNSTKKGSSA--GYIGRKGIGFKSVFRVTDAPEIHSNGFH 1297
E++ V G S+K+G A IGR GIG S V++ + S +
Sbjct: 77 LEDMHQFLSVIGQSSKRGELAEKDLIGRFGIGLLSCLVVSEEIVVESRSLY 127
>gi|325168888|ref|YP_004280678.1| heat shock chaperone protein [Agrobacterium sp. H13-3]
gi|325064611|gb|ADY68300.1| heat shock chaperone protein [Agrobacterium sp. H13-3]
Length = 630
Score = 42.4 bits (98), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 58/139 (41%), Gaps = 30/139 (21%)
Query: 1186 KQH---ARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
+QH A + R LH + +YS FL EL+ NA D + + ++P
Sbjct: 10 EQHVFEADVNRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYESISKPQLSADGIQPG 69
Query: 1230 L---------TFILQESGIVVLNNEQ-----GFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
+ T ++++SGI + +E + +A D + K G A IG+
Sbjct: 70 ILVTLNEEAQTLVVEDSGIGMDRDEMIDALGTIARSGTKAFMDRLEAAKAGEKAQLIGQF 129
Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
G+GF S F V E+ S
Sbjct: 130 GVGFYSSFMVAHRVEVISR 148
>gi|300771857|ref|ZP_07081728.1| chaperone HtpG [Sphingobacterium spiritivorum ATCC 33861]
gi|300761243|gb|EFK58068.1| chaperone HtpG [Sphingobacterium spiritivorum ATCC 33861]
Length = 591
Score = 42.4 bits (98), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVLNNEQGFS 1249
+ LS+ LYS + F+ EL+QNA D I E+ + +T L++ ++ +N G +
Sbjct: 17 ISLLSEHLYSNPNTFVRELLQNAVDAITAFQNLDESHQGKITVRLEDKKMIFEDNGIGLT 76
Query: 1250 -AENIRALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTD 1287
AE + + +G S+K+ S + +IG+ GIG S F V++
Sbjct: 77 EAEIHKFIAVIGESSKRDSIDAQDFIGKFGIGLLSCFVVSN 117
>gi|156359821|ref|XP_001624963.1| predicted protein [Nematostella vectensis]
gi|156211771|gb|EDO32863.1| predicted protein [Nematostella vectensis]
Length = 2047
Score = 42.4 bits (98), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 55/129 (42%), Gaps = 30/129 (23%)
Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADD------------------NIYPENV 1226
+K RL R L E Y D L EL+QNADD I+ +
Sbjct: 45 EKLTTRLKRIL-----EGYPCDGAILKELLQNADDAKASKICFIKDLRRHPSERIFEHSW 99
Query: 1227 EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
+P LQ + V N+ F+ ++I + +G +K G G+ G+GF +V+ +T
Sbjct: 100 KP-----LQGPALCVYNDSP-FTTQDIEGIQRLGEGSK-GDDPNKTGQYGVGFNAVYHMT 152
Query: 1287 DAPEIHSNG 1295
D P S G
Sbjct: 153 DCPSFVSKG 161
>gi|227540135|ref|ZP_03970184.1| heat-shock protein [Sphingobacterium spiritivorum ATCC 33300]
gi|227239996|gb|EEI90011.1| heat-shock protein [Sphingobacterium spiritivorum ATCC 33300]
Length = 591
Score = 42.4 bits (98), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 56/101 (55%), Gaps = 8/101 (7%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVLNNEQGFS 1249
+ LS+ LYS + F+ EL+QNA D I E+ + +T L++ ++ +N G +
Sbjct: 17 ISLLSEHLYSNPNTFVRELLQNAVDAITAFQNLDESHQGKITVRLEDKKMIFEDNGIGLT 76
Query: 1250 -AENIRALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTD 1287
AE + + +G S+K+ S + +IG+ GIG S F V++
Sbjct: 77 EAEIHKFIAVIGESSKRDSIDAQDFIGKFGIGLLSCFVVSN 117
>gi|418409404|ref|ZP_12982716.1| heat shock protein 90 [Agrobacterium tumefaciens 5A]
gi|358004043|gb|EHJ96372.1| heat shock protein 90 [Agrobacterium tumefaciens 5A]
Length = 630
Score = 42.4 bits (98), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 58/139 (41%), Gaps = 30/139 (21%)
Query: 1186 KQH---ARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
+QH A + R LH + +YS FL EL+ NA D + + ++P
Sbjct: 10 EQHVFEADVSRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYESISNPQLSADGIQPG 69
Query: 1230 L---------TFILQESGIVVLNNEQ-----GFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
+ T ++++SGI + +E + +A D + K G A IG+
Sbjct: 70 ILVTLNEEAQTLVVEDSGIGMDRDEMIDALGTIARSGTKAFMDRLEAAKAGEKAQLIGQF 129
Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
G+GF S F V E+ S
Sbjct: 130 GVGFYSSFMVAHRVEVISR 148
>gi|242280887|ref|YP_002993016.1| heat shock protein 90 [Desulfovibrio salexigens DSM 2638]
gi|242123781|gb|ACS81477.1| heat shock protein Hsp90 [Desulfovibrio salexigens DSM 2638]
Length = 632
Score = 42.4 bits (98), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 25/133 (18%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADD---------NIYPE---NVEPTLTFILQE 1236
A + + L L LY+ FL ELV NA D N PE VEP + E
Sbjct: 12 AEVNQLLDILVHSLYTNREIFLRELVSNASDALDKMRFAINSNPELEDEVEPEILIAYDE 71
Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRKGIGFKSVF 1283
V + G + ++G GS+ IGR G+GF S+F
Sbjct: 72 EKKTVTVTDTGIGMTREEVMTNIGTIAHSGSAEFVKQAAESKESLDSLIGRFGVGFYSIF 131
Query: 1284 RVTDAPEIHSNGF 1296
V+D + + +
Sbjct: 132 MVSDHVVVRTKSY 144
>gi|421088019|ref|ZP_15548848.1| GHKL domain protein [Leptospira kirschneri str. 200802841]
gi|410003275|gb|EKO53720.1| GHKL domain protein [Leptospira kirschneri str. 200802841]
Length = 600
Score = 42.4 bits (98), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L +
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G +IG+ G+G S F V+D
Sbjct: 61 KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117
>gi|347532187|ref|YP_004838950.1| Electron transfer flavoprotein subunit alpha [Roseburia hominis
A2-183]
gi|345502335|gb|AEN97018.1| Electron transfer flavoprotein alpha subunit [Roseburia hominis
A2-183]
Length = 302
Score = 42.4 bits (98), Expect = 3.0, Method: Composition-based stats.
Identities = 23/80 (28%), Positives = 42/80 (52%)
Query: 764 LISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSA 823
+++ +E+ GS E F + + + + + CVL TILG L+G LSV+ ++ T S+
Sbjct: 46 ILTCPLEESGSPEDFAKAIAEEILKSDAACVLVGNTILGRALAGKLSVYLNTAVFATISS 105
Query: 824 ITHTGLRPKMSESVTSKDAI 843
+T +P +V A+
Sbjct: 106 LTSEDGKPCFERTVFGGTAV 125
>gi|256833671|ref|YP_003162398.1| ATPase, histidine kinase-, DNA gyrase B-, and HSP90-like
domain-containing protein [Jonesia denitrificans DSM
20603]
gi|256687202|gb|ACV10095.1| ATPase, histidine kinase-, DNA gyrase B-, and HSP90-like domain
protein [Jonesia denitrificans DSM 20603]
Length = 621
Score = 42.4 bits (98), Expect = 3.0, Method: Composition-based stats.
Identities = 36/116 (31%), Positives = 50/116 (43%), Gaps = 13/116 (11%)
Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-----VEPTLTFILQESGIVVLNNE 1245
L + LS+ +YS +L EL+QN D I + P +T G L ++
Sbjct: 9 LSGVIDLLSRHIYSSPRVYLRELIQNGIDAISAAHDTTPITNPQITITPARHGEPFLFHD 68
Query: 1246 QGF---SAENIRALCDVGNSTKKGSSAG-----YIGRKGIGFKSVFRVTDAPEIHS 1293
G + E L VG S+K+ G Y+G+ GIG S F VTD I S
Sbjct: 69 NGIGLTATEAAELLATVGRSSKRDPELGFRRDNYLGQFGIGLLSCFMVTDNIRIIS 124
>gi|418697545|ref|ZP_13258536.1| GHKL domain protein [Leptospira kirschneri str. H1]
gi|409954557|gb|EKO13507.1| GHKL domain protein [Leptospira kirschneri str. H1]
Length = 600
Score = 42.4 bits (98), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L +
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G +IG+ G+G S F V+D
Sbjct: 61 KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117
>gi|398338615|ref|ZP_10523318.1| heat shock protein 90 [Leptospira kirschneri serovar Bim str. 1051]
gi|418675785|ref|ZP_13237071.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
RM52]
gi|418686140|ref|ZP_13247309.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
Moskva]
gi|418739204|ref|ZP_13295592.1| GHKL domain protein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
gi|400323550|gb|EJO71398.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
RM52]
gi|410739094|gb|EKQ83823.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
Moskva]
gi|410753456|gb|EKR10421.1| GHKL domain protein [Leptospira kirschneri serovar Valbuzzi str.
200702274]
Length = 600
Score = 42.4 bits (98), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L +
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G +IG+ G+G S F V+D
Sbjct: 61 KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117
>gi|421131131|ref|ZP_15591316.1| GHKL domain protein [Leptospira kirschneri str. 2008720114]
gi|410357497|gb|EKP04747.1| GHKL domain protein [Leptospira kirschneri str. 2008720114]
Length = 600
Score = 42.4 bits (98), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
M K L ++ LS+ LYS F+ EL+QN D I PEN E L +
Sbjct: 1 MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60
Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
+ G ++ +N G I L +G S+K+G +IG+ G+G S F V+D
Sbjct: 61 KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117
>gi|322786202|gb|EFZ12807.1| hypothetical protein SINV_11884 [Solenopsis invicta]
Length = 633
Score = 42.4 bits (98), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 14/116 (12%)
Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIY------PENVEPT----LTFILQESG--I 1239
R + + LYS FL EL+ N+ D + P+ ++ T + +L E +
Sbjct: 10 SRFMSRVMSSLYSNSDIFLRELISNSSDALDKFSHKPPKEIDNTEELCIRIVLNEKNRTL 69
Query: 1240 VVLNNEQGFSAENIRALCDV--GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+++N G + I + + TKK + YIG+ GIGF SVF V D + S
Sbjct: 70 TIIDNGIGMNRSQITNYLGIMARSGTKKAENKNYIGQFGIGFYSVFLVADKVTVTS 125
>gi|296138562|ref|YP_003645805.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
gi|296026696|gb|ADG77466.1| conserved hypothetical protein [Tsukamurella paurometabola DSM 20162]
Length = 847
Score = 42.4 bits (98), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L+ G+ V N SA +R+L + S K +A +GR G+GF +V VTD
Sbjct: 70 LSVWAAADGVHVANTGAPLSAAGVRSLAALRVSPKSDGAA--VGRFGVGFAAVLTVTDHV 127
Query: 1290 EIHSNGFHVKFDTS 1303
+I S G ++F +
Sbjct: 128 QIRSAGGGIEFSAA 141
>gi|423282496|ref|ZP_17261381.1| hypothetical protein HMPREF1204_00919 [Bacteroides fragilis HMW 615]
gi|404582064|gb|EKA86759.1| hypothetical protein HMPREF1204_00919 [Bacteroides fragilis HMW 615]
Length = 587
Score = 42.4 bits (98), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 9/105 (8%)
Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
LS+ +YS + F+ EL+QN+ D I EN + L G + +N G E
Sbjct: 20 LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMAFQDNGIGLKEE 79
Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
+ R L +G S+K+ + + +IGR GIG S F VT+ + S
Sbjct: 80 EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124
>gi|241766434|ref|ZP_04764308.1| heat shock protein Hsp90 [Acidovorax delafieldii 2AN]
gi|241363376|gb|EER58887.1| heat shock protein Hsp90 [Acidovorax delafieldii 2AN]
Length = 657
Score = 42.0 bits (97), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 66/163 (40%), Gaps = 35/163 (21%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPE--NVEPTLTF 1232
A + + LH ++ LYS FL EL+ NA D +Y + N+E ++F
Sbjct: 9 QAEVAQLLHLVTHSLYSNQEIFLRELISNASDACDKLRFEALNNNALYEDAPNLEVRVSF 68
Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-------------SAGYIGRKGIGF 1279
+ + +N G SA+ A+ +G K G+ A IG+ G+GF
Sbjct: 69 DKDARTLTIQDNGIGMSAQE--AIDHLGTIAKSGTKDFMGKLTGDQKQDAQLIGQFGVGF 126
Query: 1280 KSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPP 1317
S F V D + S +K D S G F + T+ P
Sbjct: 127 YSGFIVADKITVESRRAGLKADEGVRWVSGGAGDFEVETITRP 169
>gi|386840873|ref|YP_006245931.1| hypothetical protein SHJG_4789 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374101174|gb|AEY90058.1| hypothetical protein SHJG_4789 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451794167|gb|AGF64216.1| hypothetical protein SHJGH_4552 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 1042
Score = 42.0 bits (97), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIGFKSVFRVTD 1287
L L++ +V N A + +L + S K+ + A G +GR G+GF +V VTD
Sbjct: 75 LRLTLRDGVLVAANTGAPLDAAGVESLSTLRASAKRDAEATPGAVGRFGVGFAAVLAVTD 134
Query: 1288 APEIHSNGFHVKFDTSEGQ 1306
P I V++ +E +
Sbjct: 135 EPAIVGRHGGVRWSLAEAR 153
>gi|315637563|ref|ZP_07892770.1| chaperone HtpG [Arcobacter butzleri JV22]
gi|315478154|gb|EFU68880.1| chaperone HtpG [Arcobacter butzleri JV22]
Length = 527
Score = 42.0 bits (97), Expect = 3.6, Method: Composition-based stats.
Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 33/142 (23%)
Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
M K Q +G+ LH ++ LYS F+ ELV NA D I EN+
Sbjct: 1 MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYADW 60
Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
E ++F ++ + +++N G + ++ + +G K G+ +
Sbjct: 61 KGEINISFDEKDKSLSIIDNGIGMNEADL--IASIGTIAKSGTKSFVEALTGDAKKDSNL 118
Query: 1272 IGRKGIGFKSVFRVTDAPEIHS 1293
IG+ G+GF SVF V D ++ S
Sbjct: 119 IGQFGVGFYSVFMVADKVDVIS 140
>gi|76802578|ref|YP_327586.1| hypothetical protein NP3886A [Natronomonas pharaonis DSM 2160]
gi|76558443|emb|CAI50034.1| hypothetical protein NP_3886A [Natronomonas pharaonis DSM 2160]
Length = 1999
Score = 42.0 bits (97), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 3/85 (3%)
Query: 1210 LLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNEQGFS--AENIRALCDVGNSTKKG 1266
+LEL+QNA D I + E ++ + + G++V N F E + + T KG
Sbjct: 59 ILELLQNARDAIRQGDAEQGRVYVGVYDEGVLVANTGSRFDLFDEQVEDAVTMIGETGKG 118
Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEI 1291
IG KG+G KS+ D+ EI
Sbjct: 119 DDDQSIGHKGVGLKSILATGDSFEI 143
>gi|296447419|ref|ZP_06889344.1| Heat shock protein Hsp90-like protein [Methylosinus trichosporium
OB3b]
gi|296255039|gb|EFH02141.1| Heat shock protein Hsp90-like protein [Methylosinus trichosporium
OB3b]
Length = 655
Score = 42.0 bits (97), Expect = 3.7, Method: Composition-based stats.
Identities = 40/131 (30%), Positives = 57/131 (43%), Gaps = 27/131 (20%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNI-------------YPENVEPTLTFIL 1234
A + R L ++ +YS+ FL ELV NA D I + P +T +
Sbjct: 38 QADVARLLELMTHSVYSERDIFLRELVSNAADAIEKLRHDALSDEALAAKAAAPLVTISI 97
Query: 1235 QESG--IVVLNNEQGFSAEN-IRALCDVGNS---------TKKG--SSAGYIGRKGIGFK 1280
+ + V +N G S E+ + AL + +S KKG SA IGR GIGF
Sbjct: 98 DKDARTLTVADNGVGMSREDLVAALGTIASSGTRAFLDDLEKKGGEKSASLIGRFGIGFY 157
Query: 1281 SVFRVTDAPEI 1291
S F V + E+
Sbjct: 158 SAFMVAEKVEV 168
>gi|313674537|ref|YP_004052533.1| hypothetical protein Ftrac_0419 [Marivirga tractuosa DSM 4126]
gi|312941235|gb|ADR20425.1| hypothetical protein Ftrac_0419 [Marivirga tractuosa DSM 4126]
Length = 468
Score = 42.0 bits (97), Expect = 3.9, Method: Composition-based stats.
Identities = 50/219 (22%), Positives = 92/219 (42%), Gaps = 11/219 (5%)
Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESGIVVLNNEQGFSAE 1251
+ LH L + + + + EL+QNA D++ EN + ++ L + IV +N + F+
Sbjct: 23 KRLHDLEKTVETNHGRWAWELLQNAKDSVATENDRKVSVQIKLYDGKIVFKHNGRRFTET 82
Query: 1252 NIRALCDVGNST--KKGSSAGYIGRKGIGFKSVFRVTDAPEIH-----SNGFHVKFDTSE 1304
++R + + +S ++ GR G GF + ++ I NG KF
Sbjct: 83 DVRGIINQISSKEIEEDEETKQTGRFGTGFLTTHILSKVITIQGILETENGDFYKFKFPL 142
Query: 1305 GQIGFVLPTLVPPFN--IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVD 1362
+ G LVP F ++K + +NT ++F + IA I +
Sbjct: 143 DREGKTTTQLVPKIEKAWSSFHESVTKITQDYDRNDFNTSFTYVLDSEFQQAIAEKGIKE 202
Query: 1363 MFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGD 1401
FS L P +L F+ ++ + N + ++ V K D
Sbjct: 203 -FSQLIPYVLTFIPAIEDVTIINKVKETEVTYNTKSHAD 240
>gi|260785782|ref|XP_002587939.1| hypothetical protein BRAFLDRAFT_87327 [Branchiostoma floridae]
gi|229273094|gb|EEN43950.1| hypothetical protein BRAFLDRAFT_87327 [Branchiostoma floridae]
Length = 857
Score = 42.0 bits (97), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 4/72 (5%)
Query: 35 NPNFAFQNPNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQP 94
NP+ +NP++ NP LQ ++ +NP +NP+L QN++L QNP L P P
Sbjct: 122 NPDLQPENPDLQPENPDLQPEN--PDLQPENPGLQPENPDLQPQNSDLQPQNPELQPGDP 179
Query: 95 P--SSACNQQRS 104
S N +R+
Sbjct: 180 EIQSGQSNAERA 191
>gi|156359811|ref|XP_001624958.1| predicted protein [Nematostella vectensis]
gi|156211766|gb|EDO32858.1| predicted protein [Nematostella vectensis]
Length = 216
Score = 42.0 bits (97), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 21/113 (18%)
Query: 1198 LSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNN 1244
+S+ + H + EL+QNA+D +Y E+ T + Q + NN
Sbjct: 1 MSKHGVNTQCHCVQELIQNAEDAEAREVKVLYDEHSYGTHSLFKPELAKFQGPALYAYNN 60
Query: 1245 EQGFSAEN---IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
+ F ++ +R LCD + K +GR G+GFKSVF +TD P + SN
Sbjct: 61 AR-FKRDDWNGLRMLCD----SIKVKDPMKVGRFGLGFKSVFHITDLPSVISN 108
>gi|440223583|ref|YP_007336979.1| chaperone protein HtpG [Rhizobium tropici CIAT 899]
gi|440042455|gb|AGB74433.1| chaperone protein HtpG [Rhizobium tropici CIAT 899]
Length = 629
Score = 42.0 bits (97), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 59/149 (39%), Gaps = 30/149 (20%)
Query: 1176 ISNMESNMLKKQH---ARLGRALHCLSQELYSQDSHFLLELVQNADDNI----YPENVEP 1228
++ + M + H A + R LH + +YS FL EL+ NA D Y +P
Sbjct: 1 MTTTAAEMTAESHVFEADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIAQP 60
Query: 1229 TL-----------TFILQESGIVVLNNEQGFSAENI------------RALCDVGNSTKK 1265
L T +VV +N G S + + RA + + K+
Sbjct: 61 GLLANDPDSRILLTLDADGGKLVVEDNGIGMSRDEMIEALGTIARSGTRAFMERIEANKQ 120
Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
G A IG+ G+GF S F V D ++ S
Sbjct: 121 GEGAQLIGQFGVGFYSCFMVADRVDVISR 149
>gi|242071031|ref|XP_002450792.1| hypothetical protein SORBIDRAFT_05g018550 [Sorghum bicolor]
gi|241936635|gb|EES09780.1| hypothetical protein SORBIDRAFT_05g018550 [Sorghum bicolor]
Length = 68
Score = 42.0 bits (97), Expect = 4.4, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 7/55 (12%)
Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
VE IRR+ + +G E N L + + +A++ LSQELYS+D HFL+ELVQ+
Sbjct: 17 VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQS 64
>gi|436841092|ref|YP_007325470.1| Chaperone protein htpG [Desulfovibrio hydrothermalis AM13 = DSM
14728]
gi|432169998|emb|CCO23369.1| Chaperone protein htpG [Desulfovibrio hydrothermalis AM13 = DSM
14728]
Length = 632
Score = 41.6 bits (96), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 60/155 (38%), Gaps = 29/155 (18%)
Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIY---------PE---NVEPTLTFILQE 1236
A + + L L LY+ FL EL+ NA D + PE +VEP + E
Sbjct: 12 AEVNQLLDILIHSLYTNREIFLRELISNASDALDKMRFAISCDPELNDDVEPEILIAYDE 71
Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRKGIGFKSVF 1283
+ + G + ++G GS+ IGR G+GF S+F
Sbjct: 72 EKKTITVTDTGIGMTRDEVVANIGTIAHSGSAEFIKQAAESKESLDSLIGRFGVGFYSIF 131
Query: 1284 RVTDAPEIHSNGF----HVKFDTSEGQIGFVLPTL 1314
V+D + + + H S+G+ + L T+
Sbjct: 132 MVSDHVVVRTKSYKDADHATQWISDGKNSYELSTI 166
>gi|290959492|ref|YP_003490674.1| hypothetical protein SCAB_50881 [Streptomyces scabiei 87.22]
gi|260649018|emb|CBG72132.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 1047
Score = 41.6 bits (96), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 42/201 (20%), Positives = 82/201 (40%), Gaps = 15/201 (7%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIGFKSVFRVTD 1287
L L++ +V N A + +L + S K+ + G +GR G+GF +V VTD
Sbjct: 75 LRLTLRDGVLVAANTGAPLDAGGVESLSTLRASAKRDAQGHDGAVGRFGVGFAAVLAVTD 134
Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL-----LSKDPVQLES---KCW 1339
P + +++ +E ++ P D R L + P E + +
Sbjct: 135 EPAVVGRHGGIRWSLAEARLLAAETARHSPGLGDEIRRRDGHVPLLRLPFAAEGTAPESY 194
Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
+T + LP R + +A ++ D +LLL L L ++ + D + +R+ +
Sbjct: 195 DTVVILPLRDTAAADLA-ERLLHAVDD---ALLLALPGLDEVVI-EIGADEVRTVRRSVE 249
Query: 1400 GDGIIKVSCGEDKMTWFVASQ 1420
G ++ + W A++
Sbjct: 250 GAHVVVEDSRDGVTRWRTAAE 270
>gi|45443521|ref|NP_995060.1| hypothetical protein YP_3789 [Yersinia pestis biovar Microtus str.
91001]
gi|145600493|ref|YP_001164569.1| hypothetical protein YPDSF_3239 [Yersinia pestis Pestoides F]
gi|162420877|ref|YP_001605784.1| hypothetical protein YpAngola_A1243 [Yersinia pestis Angola]
gi|229837034|ref|ZP_04457199.1| DNA mismatch repair enzyme [Yersinia pestis Pestoides A]
gi|45438390|gb|AAS63937.1| DNA mismatch repair enzyme (predicted ATPase) [Yersinia pestis biovar
Microtus str. 91001]
gi|145212189|gb|ABP41596.1| hypothetical protein YPDSF_3239 [Yersinia pestis Pestoides F]
gi|162353692|gb|ABX87640.1| DNA mismatch repair enzyme [Yersinia pestis Angola]
gi|229705977|gb|EEO91986.1| DNA mismatch repair enzyme [Yersinia pestis Pestoides A]
Length = 240
Score = 41.6 bits (96), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 14/108 (12%)
Query: 1192 GRALHCLSQELYSQDSHFLL-ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
G+ + LSQ++ S S F L ELV+N+ D P+ T+T I E I++ + G S
Sbjct: 10 GKIIEELSQKIPS--SLFALNELVKNSYDAFSPD---VTITVIPSELKIIISDYGNGMSV 64
Query: 1251 ENIRALCDVGNSTKK--------GSSAGYIGRKGIGFKSVFRVTDAPE 1290
+ I +L + STKK G G KG+GF S F+ D E
Sbjct: 65 DEIHSLFHISKSTKKYGCEVSQNGIKRIVQGSKGLGFLSAFKFGDKVE 112
>gi|392531652|ref|ZP_10278789.1| HSP90 family protein [Carnobacterium maltaromaticum ATCC 35586]
Length = 605
Score = 41.6 bits (96), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 51/126 (40%), Gaps = 19/126 (15%)
Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV--------EPT 1229
N E N K LG + LS LYSQ ++ EL+QNA D I + E
Sbjct: 2 NPEENRFK---VNLGGMIEILSNHLYSQKDVYIRELLQNATDAIRGRKLISEDNFVGEIH 58
Query: 1230 LTFILQESGIVVL---NNEQGFSAENIRALCDVGNSTKKG-----SSAGYIGRKGIGFKS 1281
+ + SG L +N G + + I S+ KG +IGR GIG S
Sbjct: 59 VDLTIHRSGSSTLFFEDNGIGLTEDEIHQFLATIASSSKGEKNFEEKQDFIGRFGIGLLS 118
Query: 1282 VFRVTD 1287
F V D
Sbjct: 119 CFIVAD 124
>gi|22127662|ref|NP_671085.1| hypothetical protein y3791 [Yersinia pestis KIM10+]
gi|108809882|ref|YP_653798.1| hypothetical protein YPA_3892 [Yersinia pestis Antiqua]
gi|108810429|ref|YP_646196.1| hypothetical protein YPN_0263 [Yersinia pestis Nepal516]
gi|150260565|ref|ZP_01917293.1| hypothetical protein YPE_2877 [Yersinia pestis CA88-4125]
gi|165927293|ref|ZP_02223125.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
F1991016]
gi|165936424|ref|ZP_02224992.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
IP275]
gi|166008913|ref|ZP_02229811.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
E1979001]
gi|166211933|ref|ZP_02237968.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
B42003004]
gi|167400522|ref|ZP_02306031.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
UG05-0454]
gi|167419396|ref|ZP_02311149.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
MG05-1020]
gi|167426097|ref|ZP_02317850.1| DNA mismatch repair enzyme [Yersinia pestis biovar Mediaevalis str.
K1973002]
gi|167467846|ref|ZP_02332550.1| hypothetical protein YpesF_08094 [Yersinia pestis FV-1]
gi|218927594|ref|YP_002345469.1| hypothetical protein YPO0392 [Yersinia pestis CO92]
gi|229840268|ref|ZP_04460427.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
PEXU2]
gi|229842346|ref|ZP_04462501.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
India 195]
gi|229900611|ref|ZP_04515735.1| DNA mismatch repair enzyme [Yersinia pestis Nepal516]
gi|270488028|ref|ZP_06205102.1| conserved hypothetical protein [Yersinia pestis KIM D27]
gi|294502497|ref|YP_003566559.1| hypothetical protein YPZ3_0387 [Yersinia pestis Z176003]
gi|384120932|ref|YP_005503552.1| hypothetical protein YPD4_0340 [Yersinia pestis D106004]
gi|384124810|ref|YP_005507424.1| hypothetical protein YPD8_0341 [Yersinia pestis D182038]
gi|384137625|ref|YP_005520327.1| hypothetical protein A1122_03155 [Yersinia pestis A1122]
gi|384416547|ref|YP_005625909.1| DNA mismatch repair protein [Yersinia pestis biovar Medievalis str.
Harbin 35]
gi|420544862|ref|ZP_15043047.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-01]
gi|420549649|ref|ZP_15047333.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-02]
gi|420550157|ref|ZP_15047782.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-02]
gi|420555603|ref|ZP_15052630.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-03]
gi|420561253|ref|ZP_15057546.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-04]
gi|420566294|ref|ZP_15062094.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-05]
gi|420571920|ref|ZP_15067207.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-06]
gi|420577270|ref|ZP_15072041.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-07]
gi|420582606|ref|ZP_15076902.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-08]
gi|420587744|ref|ZP_15081542.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-09]
gi|420593089|ref|ZP_15086352.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-10]
gi|420598742|ref|ZP_15091417.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-11]
gi|420604273|ref|ZP_15096345.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-12]
gi|420609640|ref|ZP_15101223.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-13]
gi|420620323|ref|ZP_15110632.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-15]
gi|420625371|ref|ZP_15115209.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-16]
gi|420635658|ref|ZP_15124476.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-25]
gi|420641260|ref|ZP_15129525.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-29]
gi|420646412|ref|ZP_15134254.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-32]
gi|420656895|ref|ZP_15143684.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-36]
gi|420657506|ref|ZP_15144230.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-36]
gi|420662841|ref|ZP_15148990.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-42]
gi|420667836|ref|ZP_15153506.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-45]
gi|420673135|ref|ZP_15158327.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-46]
gi|420678638|ref|ZP_15163336.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-47]
gi|420683871|ref|ZP_15168038.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-48]
gi|420689038|ref|ZP_15172631.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-52]
gi|420706270|ref|ZP_15187190.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-55]
gi|420711557|ref|ZP_15191989.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-56]
gi|420728212|ref|ZP_15206564.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-60]
gi|420733305|ref|ZP_15211152.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-61]
gi|420744021|ref|ZP_15220787.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-64]
gi|420749890|ref|ZP_15225718.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-65]
gi|420761062|ref|ZP_15235105.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-71]
gi|420766235|ref|ZP_15239787.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-72]
gi|420771280|ref|ZP_15244302.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-76]
gi|420776585|ref|ZP_15249089.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-88]
gi|420782081|ref|ZP_15253912.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-89]
gi|420787526|ref|ZP_15258683.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-90]
gi|420792972|ref|ZP_15263591.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-91]
gi|420798144|ref|ZP_15268235.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-92]
gi|420803491|ref|ZP_15273048.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-93]
gi|420808679|ref|ZP_15277747.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-94]
gi|420814471|ref|ZP_15282927.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-95]
gi|420819594|ref|ZP_15287579.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-96]
gi|420824682|ref|ZP_15292126.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-98]
gi|420830514|ref|ZP_15297394.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-99]
gi|420840446|ref|ZP_15306368.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-101]
gi|420846057|ref|ZP_15311448.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-102]
gi|420856993|ref|ZP_15320924.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-113]
gi|421761820|ref|ZP_16198620.1| DNA mismatch repair protein [Yersinia pestis INS]
gi|21960777|gb|AAM87336.1|AE013982_3 hypothetical [Yersinia pestis KIM10+]
gi|108774077|gb|ABG16596.1| hypothetical protein YPN_0263 [Yersinia pestis Nepal516]
gi|108781795|gb|ABG15853.1| hypothetical protein YPA_3892 [Yersinia pestis Antiqua]
gi|115346205|emb|CAL19073.1| hypothetical protein YPO0392 [Yersinia pestis CO92]
gi|149289973|gb|EDM40050.1| hypothetical protein YPE_2877 [Yersinia pestis CA88-4125]
gi|165915540|gb|EDR34149.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
IP275]
gi|165920755|gb|EDR38003.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
F1991016]
gi|165992252|gb|EDR44553.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
E1979001]
gi|166206679|gb|EDR51159.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
B42003004]
gi|166963390|gb|EDR59411.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
MG05-1020]
gi|167049890|gb|EDR61298.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
UG05-0454]
gi|167055020|gb|EDR64820.1| DNA mismatch repair enzyme [Yersinia pestis biovar Mediaevalis str.
K1973002]
gi|229681950|gb|EEO78042.1| DNA mismatch repair enzyme [Yersinia pestis Nepal516]
gi|229690656|gb|EEO82710.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
India 195]
gi|229696634|gb|EEO86681.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
PEXU2]
gi|262360528|gb|ACY57249.1| hypothetical protein YPD4_0340 [Yersinia pestis D106004]
gi|262364474|gb|ACY61031.1| hypothetical protein YPD8_0341 [Yersinia pestis D182038]
gi|270336532|gb|EFA47309.1| conserved hypothetical protein [Yersinia pestis KIM D27]
gi|294352956|gb|ADE63297.1| hypothetical protein YPZ3_0387 [Yersinia pestis Z176003]
gi|320017051|gb|ADW00623.1| DNA mismatch repair enzyme [Yersinia pestis biovar Medievalis str.
Harbin 35]
gi|342852754|gb|AEL71307.1| hypothetical protein A1122_03155 [Yersinia pestis A1122]
gi|391432509|gb|EIQ93945.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-01]
gi|391433640|gb|EIQ94949.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-02]
gi|391434921|gb|EIQ96049.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-02]
gi|391436310|gb|EIQ97282.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-03]
gi|391448550|gb|EIR08355.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-04]
gi|391449295|gb|EIR09033.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-05]
gi|391451745|gb|EIR11215.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-06]
gi|391464572|gb|EIR22844.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-07]
gi|391466198|gb|EIR24293.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-08]
gi|391468290|gb|EIR26180.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-09]
gi|391481652|gb|EIR38165.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-10]
gi|391482627|gb|EIR39063.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-12]
gi|391482741|gb|EIR39165.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-11]
gi|391496834|gb|EIR51747.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-13]
gi|391497530|gb|EIR52385.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-15]
gi|391512473|gb|EIR65787.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-16]
gi|391516004|gb|EIR68942.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-25]
gi|391528008|gb|EIR79870.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-29]
gi|391532165|gb|EIR83588.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-32]
gi|391545078|gb|EIR95212.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-36]
gi|391546722|gb|EIR96684.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-42]
gi|391546995|gb|EIR96935.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-36]
gi|391547579|gb|EIR97467.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-45]
gi|391561250|gb|EIS09801.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-46]
gi|391562376|gb|EIS10794.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-47]
gi|391564446|gb|EIS12648.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-48]
gi|391576616|gb|EIS23147.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-52]
gi|391588721|gb|EIS33710.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-55]
gi|391592635|gb|EIS37029.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-56]
gi|391605684|gb|EIS48533.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-60]
gi|391620114|gb|EIS61303.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-61]
gi|391629489|gb|EIS69416.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-64]
gi|391631622|gb|EIS71233.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-65]
gi|391642888|gb|EIS81113.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-71]
gi|391645647|gb|EIS83504.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-72]
gi|391655398|gb|EIS92146.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-76]
gi|391662291|gb|EIS98244.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-88]
gi|391667285|gb|EIT02641.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-89]
gi|391668917|gb|EIT04107.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-90]
gi|391672856|gb|EIT07631.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-91]
gi|391686576|gb|EIT19984.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-93]
gi|391688090|gb|EIT21342.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-92]
gi|391689398|gb|EIT22536.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-94]
gi|391700675|gb|EIT32752.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-95]
gi|391703966|gb|EIT35667.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-96]
gi|391704757|gb|EIT36393.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-98]
gi|391715204|gb|EIT45773.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-99]
gi|391720819|gb|EIT50813.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-101]
gi|391731418|gb|EIT60129.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-102]
gi|391737317|gb|EIT65218.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Yersinia pestis PY-113]
gi|411178142|gb|EKS48154.1| DNA mismatch repair protein [Yersinia pestis INS]
Length = 240
Score = 41.6 bits (96), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 14/108 (12%)
Query: 1192 GRALHCLSQELYSQDSHFLL-ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
G+ + LSQ++ S S F L ELV+N+ D P+ T+T I E I++ + G S
Sbjct: 10 GKIIEELSQKIPS--SLFALNELVKNSYDAFSPD---VTITVIPSELKIIISDYGNGMSV 64
Query: 1251 ENIRALCDVGNSTKK--------GSSAGYIGRKGIGFKSVFRVTDAPE 1290
+ I +L + STKK G G KG+GF S F+ D E
Sbjct: 65 DEIHSLFHISKSTKKYGCEVSQNGIKRIVQGSKGLGFLSAFKFGDKVE 112
>gi|387761650|ref|YP_006068627.1| hypothetical protein Ssal_01492 [Streptococcus salivarius 57.I]
gi|339292417|gb|AEJ53764.1| conserved hypothetical protein [Streptococcus salivarius 57.I]
Length = 1506
Score = 41.6 bits (96), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 1210 LLELVQNADD-------NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+LEL+QN DD + N E + + + + V N FS + I LC +G +
Sbjct: 46 ILELLQNVDDAYGDIKSDNDASNDEVKVQITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
+ K S IG KG GF+S+ + E++S F ++F
Sbjct: 105 SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF 140
>gi|340399189|ref|YP_004728214.1| hypothetical protein SALIVB_1412 [Streptococcus salivarius CCHSS3]
gi|338743182|emb|CCB93690.1| hypothetical protein SALIVB_1412 [Streptococcus salivarius CCHSS3]
Length = 1506
Score = 41.6 bits (96), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 1210 LLELVQNADD-------NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
+LEL+QN DD + N E + + + + V N FS + I LC +G +
Sbjct: 46 ILELLQNVDDAYGDIKSDNDASNDEVKVQITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104
Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
+ K S IG KG GF+S+ + E++S F ++F
Sbjct: 105 SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF 140
>gi|403359361|gb|EJY79339.1| hypothetical protein OXYTRI_23390 [Oxytricha trifallax]
Length = 338
Score = 41.6 bits (96), Expect = 5.0, Method: Composition-based stats.
Identities = 39/110 (35%), Positives = 48/110 (43%), Gaps = 18/110 (16%)
Query: 29 PPPPHFNPNFAFQNPNI-FFPNPALQLL-------QNLTNIPF-QNPNFD----IQNPNL 75
PPP FN + + NI PN Q Q +P QNP + IQNPN
Sbjct: 147 PPPNAFNGG-QYPSANIQGMPNGTQQYQPGMMMDNQPAMGMPVSQNPQYSNLPPIQNPNS 205
Query: 76 FMQNANLLTQNPN---LPPQQPPSSACNQQRSQTQHPP-GASNQQRPLTQ 121
+MQN + L + N LP Q+ SS N QR + H G Q LTQ
Sbjct: 206 YMQNPSGLPNSQNLMPLPQQRQESSNLNSQRDRLDHSQMGLLQNQTQLTQ 255
>gi|260782174|ref|XP_002586166.1| hypothetical protein BRAFLDRAFT_109861 [Branchiostoma floridae]
gi|229271259|gb|EEN42177.1| hypothetical protein BRAFLDRAFT_109861 [Branchiostoma floridae]
Length = 649
Score = 41.6 bits (96), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 11/71 (15%)
Query: 24 QPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLL 83
QPP + P NP +NP++ NP L +NP+ +NP+L +N +L
Sbjct: 138 QPPDREIP--ANPELQPENPDLQPENPDLHP---------ENPDLQPENPDLHPENPDLQ 186
Query: 84 TQNPNLPPQQP 94
+NP+L P+ P
Sbjct: 187 PENPDLQPENP 197
>gi|296272791|ref|YP_003655422.1| Heat shock protein Hsp90 [Arcobacter nitrofigilis DSM 7299]
gi|296096965|gb|ADG92915.1| Heat shock protein Hsp90 [Arcobacter nitrofigilis DSM 7299]
Length = 637
Score = 41.6 bits (96), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 58/136 (42%), Gaps = 34/136 (25%)
Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI----------------YPEN 1225
M K Q +G+ LH ++ LYS F+ ELV NA D I PE+
Sbjct: 1 MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLKLTDDKIKAALPED 60
Query: 1226 --VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AG 1270
E ++F + + +++N G + E++ + +G K G+ +
Sbjct: 61 WSGEINVSFDEADKSLTIVDNGIGMNEEDL--IASIGTIAKSGTKSFIEAMTGDAKKDSN 118
Query: 1271 YIGRKGIGFKSVFRVT 1286
IG+ G+GF SVF V
Sbjct: 119 LIGQFGVGFYSVFMVA 134
>gi|258627089|ref|ZP_05721885.1| heat shock protein HtpG [Vibrio mimicus VM603]
gi|258580607|gb|EEW05560.1| heat shock protein HtpG [Vibrio mimicus VM603]
Length = 635
Score = 41.2 bits (95), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 65/148 (43%), Gaps = 33/148 (22%)
Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIY 1222
I+N E+ + + +L LH + LYS FL EL+ NA D ++Y
Sbjct: 6 ITNKETRGFQSEVKQL---LHLMIHSLYSNKEIFLRELISNASDAADKLRFQALSNPDLY 62
Query: 1223 PENVE--PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS------------- 1267
+ E L+F + + + + +N G S E++ + ++G K G+
Sbjct: 63 QGDAELGVKLSFDQENNTLTISDNGIGMSREDV--IENLGTIAKSGTAEFFSKLSQEQSK 120
Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
S+ IG+ G+GF S F V DA + +
Sbjct: 121 SSQLIGQFGVGFYSAFIVADAVTVRTRA 148
>gi|404371673|ref|ZP_10976977.1| hypothetical protein CSBG_01028 [Clostridium sp. 7_2_43FAA]
gi|226912201|gb|EEH97402.1| hypothetical protein CSBG_01028 [Clostridium sp. 7_2_43FAA]
Length = 649
Score = 41.2 bits (95), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 54/110 (49%), Gaps = 8/110 (7%)
Query: 1197 CLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRAL 1256
L ++ DS + EL++N+ D + + V T+ F ++ I + +N G S E I+
Sbjct: 20 ALGSKIIELDSIVIAELIKNSKD-AHAKKV--TINFSDYKNSITIWDNGDGMSIEEIKDK 76
Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI-----HSNGFHVKFD 1301
V S K + +G KGIG S+F++ ++ I HSN + FD
Sbjct: 77 WGVVASNNKSNEVDTLGGKGIGRFSIFKLCNSFTIITKKEHSNEYKFTFD 126
>gi|91978860|ref|YP_571519.1| heat shock protein 90 [Rhodopseudomonas palustris BisB5]
gi|123735061|sp|Q12ZX1.1|HTPG_RHOPS RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
protein HtpG; AltName: Full=High temperature protein G
gi|91685316|gb|ABE41618.1| heat shock protein Hsp90 [Rhodopseudomonas palustris BisB5]
Length = 628
Score = 41.2 bits (95), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 81/209 (38%), Gaps = 44/209 (21%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG--------- 1238
A + L+ + +YS+ FL EL+ NA D + E T L E+G
Sbjct: 14 QAEVAELLNLMVHSVYSETDIFLRELISNASDALDKLRYESIATPALMEAGGAPKIQIVP 73
Query: 1239 ------IVVLNNEQGFSAENIRALCDVGNSTKKGS------------SAGYIGRKGIGFK 1280
+ V++N G + + + + ++G K G+ AG IG+ G+GF
Sbjct: 74 RKAPDTLTVIDNGIGMNRQEL--IDNLGTIAKSGTKSFLTKLTEAKDGAGLIGQFGVGFY 131
Query: 1281 SVFRVTDAPEIHS---NGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESK 1337
+ F V D + S V +S G GF + P + + R++ + L K
Sbjct: 132 AAFMVADNIVVTSRRAGSGEVWTWSSSGGAGFE----IAPASEEAAARVVRGTEIVLHLK 187
Query: 1338 CWNTCIRLPFRTKFSEGIAMNNIVDMFSD 1366
P K+ E + IV +SD
Sbjct: 188 --------PDAAKYLEAYQIERIVSEYSD 208
>gi|420694858|ref|ZP_15177717.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein, partial [Yersinia pestis PY-53]
gi|391577417|gb|EIS23851.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein, partial [Yersinia pestis PY-53]
Length = 132
Score = 41.2 bits (95), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 14/108 (12%)
Query: 1192 GRALHCLSQELYSQDSHFLL-ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
G+ + LSQ++ S S F L ELV+N+ D P+ T+T I E I++ + G S
Sbjct: 10 GKIIEELSQKIPS--SLFALNELVKNSYDAFSPD---VTITVIPSELKIIISDYGNGMSV 64
Query: 1251 ENIRALCDVGNSTKK--------GSSAGYIGRKGIGFKSVFRVTDAPE 1290
+ I +L + STKK G G KG+GF S F+ D E
Sbjct: 65 DEIHSLFHISKSTKKYGCEVSQNGIKRIVQGSKGLGFLSAFKFGDKVE 112
>gi|408373681|ref|ZP_11171375.1| heat shock protein 90 [Alcanivorax hongdengensis A-11-3]
gi|407766385|gb|EKF74828.1| heat shock protein 90 [Alcanivorax hongdengensis A-11-3]
Length = 617
Score = 41.2 bits (95), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 33/133 (24%), Positives = 49/133 (36%), Gaps = 26/133 (19%)
Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPTLTFIL 1234
A + R LH + LYS FL EL+ NA D + E+ EP + L
Sbjct: 11 QAEVSRLLHLMIHSLYSNREIFLRELISNASDACDKLRFEALDTPALMEEDPEPAIRLTL 70
Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-------------SAGYIGRKGIGFKS 1281
+S + + G + +G + G+ A IG+ G+GF S
Sbjct: 71 DKSAGTLTLADNGIGMSEAEIIDHLGTIARSGTEKFLASLSGDQKKDAQLIGQFGVGFYS 130
Query: 1282 VFRVTDAPEIHSN 1294
F V D + S
Sbjct: 131 AFIVADKVTVESR 143
>gi|260785720|ref|XP_002587908.1| hypothetical protein BRAFLDRAFT_87296 [Branchiostoma floridae]
gi|229273063|gb|EEN43919.1| hypothetical protein BRAFLDRAFT_87296 [Branchiostoma floridae]
Length = 1403
Score = 41.2 bits (95), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 4/71 (5%)
Query: 24 QPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLL 83
QPP + P NP+ +NP++ NP L ++ +NP+ +NP+ N +L
Sbjct: 533 QPPDREIPE--NPDLQPENPDLQPGNPELH--PGNPDLQLENPDLQPENPDQQPDNPDLQ 588
Query: 84 TQNPNLPPQQP 94
+NP+L PQ P
Sbjct: 589 PENPDLQPQNP 599
>gi|119484754|ref|ZP_01619236.1| hypothetical protein L8106_14815 [Lyngbya sp. PCC 8106]
gi|119457572|gb|EAW38696.1| hypothetical protein L8106_14815 [Lyngbya sp. PCC 8106]
Length = 864
Score = 41.2 bits (95), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 88/199 (44%), Gaps = 11/199 (5%)
Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGF 1248
++ + LH L + + + + EL+QNA D+I ++ ++ L + + +N F
Sbjct: 23 KIIKRLHDLEKTVENNQGRWAWELLQNAKDSIAEDDDRTVSIQIELDRNSVTFRHNGNHF 82
Query: 1249 SAENIRALCDVGNS--TKKGSSAGYIGRKGIGFKSVFRVTDAPEIH-----SNGFHVKFD 1301
+++++ L D +S ++G GR G GF + ++ +I ++G KF+
Sbjct: 83 TSQDVIGLIDQISSKEVEEGQETKKTGRFGTGFLTTHLLSRVIKIKGIIGSADGSFHKFE 142
Query: 1302 TSEGQIGFVLPTLVPPF--NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
+ G L+P + F + K + C+NT T+ + IA
Sbjct: 143 FLLNRQGRTTAQLMPRITNSWKEFDESIEKISPSYDKNCFNTSFCYQLDTEEQKKIAQIG 202
Query: 1360 IVDMFSDLHPSLLLFLHRL 1378
VD FS L P +L F+ ++
Sbjct: 203 -VDEFSRLIPFVLAFIPKI 220
>gi|87311767|ref|ZP_01093882.1| HtpG [Blastopirellula marina DSM 3645]
gi|87285551|gb|EAQ77470.1| HtpG [Blastopirellula marina DSM 3645]
Length = 595
Score = 41.2 bits (95), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 52/108 (48%), Gaps = 15/108 (13%)
Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEP---------TLTFILQESGIVVLNN 1244
+ LSQ LYS F+ EL+QN D I +EP +T + I+ +N
Sbjct: 7 IELLSQHLYSGPHVFIRELLQNGVDAIQARRQIEPKHEGAIEIEVVTSEESDPTIIFQDN 66
Query: 1245 EQGFS-AENIRALCDVGNSTKKGSSAG----YIGRKGIGFKSVFRVTD 1287
G + AE + L +G S+K+G + ++G+ GIG S F V+D
Sbjct: 67 GVGLTEAEVQQFLATIGQSSKRGEATSRPDDFLGQFGIGLLSCFTVSD 114
>gi|386859793|ref|YP_006272499.1| chaperone protein htpG [Borrelia crocidurae str. Achema]
gi|384934674|gb|AFI31347.1| Chaperone protein htpG [Borrelia crocidurae str. Achema]
Length = 615
Score = 40.8 bits (94), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 56/135 (41%), Gaps = 28/135 (20%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD--------NIYPE-----NVEPT 1229
M K+ + L+ + LYS FL EL+ NA D NI E ++P
Sbjct: 1 MKKQFDTEVNDLLYLIIHSLYSHKEIFLRELISNASDAIDKLKFLNITDEKFKDIKLDPR 60
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-------------SAGYIGRKG 1276
+ E I + +N G + E++ + +G K G+ +A IG+ G
Sbjct: 61 IDINFDEKSITIKDNGIGMNREDL--ISHLGTIAKSGTKEFINNLKKDEKKAASLIGQFG 118
Query: 1277 IGFKSVFRVTDAPEI 1291
+GF S F V D E+
Sbjct: 119 VGFYSAFIVADKVEV 133
>gi|66827323|ref|XP_647016.1| hypothetical protein DDB_G0268852 [Dictyostelium discoideum AX4]
gi|60475079|gb|EAL73015.1| hypothetical protein DDB_G0268852 [Dictyostelium discoideum AX4]
Length = 2673
Score = 40.8 bits (94), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 110/258 (42%), Gaps = 48/258 (18%)
Query: 1183 MLKKQHARLGRALHCLSQELYSQ-DSHFLLELVQNADDNI------------YP------ 1223
M K + RL + H L L + +S + EL+QNA+D + YP
Sbjct: 1 MPKTKDQRL-KVSHSLRNNLLNYPESKIVRELIQNAEDALADTFIIKVDEGSYPNNGLLI 59
Query: 1224 --ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
+N E + L I++ NN F ++ +C++ +KK + +G G+G+ S
Sbjct: 60 SNDNQEIESSKELLGPSILIYNN-ATFHEQDWEGICEINQGSKKENLKS-VGNFGLGWNS 117
Query: 1282 VFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQL-ESKC-- 1338
V+ +TD P ++S G +V F +IG L + + C + +Q ES C
Sbjct: 118 VYHITDNPIVYS-GNNVWFADPNERIGGGLFFDLDDEDDYNKCFSFLEPFIQFKESNCNP 176
Query: 1339 ---WN-TCIRLPFRTKFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
+N T IRLP R SE A+ I++ FS +L+FL L I
Sbjct: 177 KEYFNGTIIRLPLRIYPSEIKKAVLTMDAIKKILNEFSKDINEILIFLKSLASIT----- 231
Query: 1388 NDSLVVIRKKIVGDGIIK 1405
+++ +IV IK
Sbjct: 232 ----IIVNNQIVNSVKIK 245
>gi|384172841|ref|YP_005554218.1| heat shock protein 90 [Arcobacter sp. L]
gi|345472451|dbj|BAK73901.1| heat shock protein 90 [Arcobacter sp. L]
Length = 636
Score = 40.8 bits (94), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 58/134 (43%), Gaps = 33/134 (24%)
Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
M K Q +G+ LH ++ LYS F+ ELV NA D I EN+
Sbjct: 1 MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYAQW 60
Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
E ++F ++ + + +N G + E++ + +G K G+ +
Sbjct: 61 KGEINISFDEKDKSLTISDNGIGMNDEDL--IASIGTIAKSGTKSFIDNLTGDAKKDSNL 118
Query: 1272 IGRKGIGFKSVFRV 1285
IG+ G+GF SVF V
Sbjct: 119 IGQFGVGFYSVFMV 132
>gi|226509636|ref|NP_001148622.1| ubiquilin-1 [Zea mays]
gi|195620874|gb|ACG32267.1| ubiquilin-1 [Zea mays]
Length = 576
Score = 40.8 bits (94), Expect = 9.1, Method: Composition-based stats.
Identities = 39/166 (23%), Positives = 62/166 (37%), Gaps = 18/166 (10%)
Query: 7 PRFRPVGGVGGEGTGGGQPPSQPPPPHFNPNFA---------FQNPNIFFPNPALQLLQN 57
P P GG+GG G P + P F P F QNPN+ + +QN
Sbjct: 118 PSSGPAGGLGGLFPGVDAPGNDRPSGIFRPGFPELEQVEQHLSQNPNLMREIMNMPAMQN 177
Query: 58 LTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQR 117
L N P N + NP Q +++ +NP+L S Q ++P R
Sbjct: 178 LMNNPDLIRNMIMSNP----QMRDIMDRNPDLAHVLNDPSVLRQTLEAARNPEIMREMMR 233
Query: 118 PLTQQPQSLPPQPQKLN--KEFLERIDRAVVKARSELIMAGESVSS 161
+ ++ P+ N + E + + A + M GE ++
Sbjct: 234 NTDRAMSNIESSPEGFNMLRRMYETVQEPFLNATT---MGGEGSTA 276
>gi|291438926|ref|ZP_06578316.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
gi|291341821|gb|EFE68777.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
Length = 1048
Score = 40.8 bits (94), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 20/77 (25%), Positives = 36/77 (46%)
Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
L L++ +V N A + +L + S K+ + +GR G+GF +V VTD P
Sbjct: 80 LRLTLRDGVLVAANTGAPLDAAGVESLSTLRASAKRDAPERSVGRFGVGFAAVLAVTDEP 139
Query: 1290 EIHSNGFHVKFDTSEGQ 1306
+ V++ +E +
Sbjct: 140 AVVGRHGGVRWSLAEAR 156
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.405
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 32,939,753,213
Number of Sequences: 23463169
Number of extensions: 1414801664
Number of successful extensions: 4032289
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 271
Number of HSP's successfully gapped in prelim test: 1519
Number of HSP's that attempted gapping in prelim test: 4011808
Number of HSP's gapped (non-prelim): 14956
length of query: 2109
length of database: 8,064,228,071
effective HSP length: 159
effective length of query: 1950
effective length of database: 8,628,551,496
effective search space: 16825675417200
effective search space used: 16825675417200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 85 (37.4 bits)