BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000132
         (2109 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255554036|ref|XP_002518058.1| conserved hypothetical protein [Ricinus communis]
 gi|223542654|gb|EEF44191.1| conserved hypothetical protein [Ricinus communis]
          Length = 2833

 Score = 2327 bits (6031), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1232/2186 (56%), Positives = 1544/2186 (70%), Gaps = 167/2186 (7%)

Query: 5    HRPRFRPVGGVGGEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNL------ 58
            HRP FRP GG GG   G   PP      + +PNFA QNPN +F N A  L Q+L      
Sbjct: 4    HRPPFRPGGGAGGGYYGQPPPPPL----NSDPNFALQNPNAYFHNLANTLYQSLQQQQNF 59

Query: 59   -TNIPF--QNPNFDIQNPNLFMQNANL-LTQNPNLP----------------PQQPPSSA 98
             +  PF  QN NF IQNPN   Q++N  L Q  ++P                P  P  + 
Sbjct: 60   NSYFPFSLQNLNFPIQNPNFPFQHSNFSLPQQFSIPQTQQQQQQNRNFQIQNPNSPQQNL 119

Query: 99   CNQQRSQTQHP-------------------------------------PGASN------- 114
              QQ++Q   P                                     P A N       
Sbjct: 120  VQQQKAQNLPPETPKPVEEERIDQTAQHDSVQQQQQQHQQQKQPQESSPNAQNDSTQQHQ 179

Query: 115  -QQRPLTQQP---QSLPPQPQ------KLNKEFLERIDRAVVKARSELIMAGESVSSWKV 164
             Q R L QQ    Q L   P+      K  +E +E++D AV KA  +L+ A ES+S+W+V
Sbjct: 180  QQSRKLLQQQPLHQMLFQNPKGKRKQLKHKQELVEKVDLAVEKAWQDLLAAAESISAWRV 239

Query: 165  SRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAIC 224
            S++ L+ LQV+SW SLGF MQEVPSLH+L++TE +INAFIHCFV VR+ITSLYDLE AIC
Sbjct: 240  SQAALVTLQVESWGSLGFPMQEVPSLHRLILTEGKINAFIHCFVAVRRITSLYDLEVAIC 299

Query: 225  KSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIK 284
            ++EGIE+FEEL+LGP LR PL+ HYF V  D   V KI T D+I+ L EY  T K KDI 
Sbjct: 300  ENEGIEQFEELKLGPLLRHPLVLHYFSVSCDATEVLKITTEDIILTLHEYMGTFKEKDIT 359

Query: 285  VDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKR-- 342
             DEFLDFI KKRS+  K  LGVRIQ LGMHI FI+EA++S++ TLKK L  ++   KR  
Sbjct: 360  ADEFLDFIVKKRSVNGKGNLGVRIQGLGMHIKFIQEAKRSKNTTLKKCLSSMRAPSKRSG 419

Query: 343  ---RKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTY 399
                K P+ SS+KK  LDERFSAI +RV+SF+ +++DF  KHIRF SSSSE E+S+D  +
Sbjct: 420  TRCHKHPLLSSQKK-DLDERFSAISQRVESFALVHKDFQGKHIRFDSSSSEGEESNDSMH 478

Query: 400  E---CSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGE--INHQLTSAGNSSG 454
            +    SN   S+  L   +  +D+VS+CPYPS TEE+ RLGLKGE  +  Q +   ++S 
Sbjct: 479  DDTMTSNGERSHYSL-QNVNSTDKVSTCPYPSATEEMSRLGLKGESEVGSQSSRKRHTSN 537

Query: 455  QDDYIGSSK-----KKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDDELNEAD 509
                   SK     +KRK E    T+SAP K L+ N+ K     +++GD+T     +E D
Sbjct: 538  PSRLRSRSKLDSWERKRKFEELSGTASAPPKLLKGNEEKHDIHSLKNGDKT-----DEVD 592

Query: 510  ISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCI 569
             S SN +M  FITTW++ACK +T++EV EKM QFY+P D     R+ KR+K  F S P +
Sbjct: 593  FSLSNNAMETFITTWRDACKEHTVTEVFEKMVQFYRPLD----GRHRKRIKWAFVSNPSV 648

Query: 570  GLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHT 629
            GLLN+AV ++K GM DS+YD  Q++ Q EL NT S    EYESI++EP+E+   V  + +
Sbjct: 649  GLLNVAVMAMKSGMLDSIYDAFQSVNQHELTNTFS----EYESIDVEPAEKHKSVVPQCS 704

Query: 630  V---QDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEF 686
            +   Q + V+E++ K+++Y E D    +N +  +    I L+KLC+CE WL +QF IKEF
Sbjct: 705  LLPTQSVTVDEIIGKITRYYELDQKFQSNDKLLLEDKFISLKKLCNCEFWLVDQFGIKEF 764

Query: 687  KSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWES 746
            K LG+GEF  FLEKHAS+L TELQKLF  D  ++  LEVS++QH L+ LVSQASNNLWES
Sbjct: 765  KFLGHGEFLMFLEKHASLLPTELQKLFAADICEKPPLEVSVLQHQLIFLVSQASNNLWES 824

Query: 747  EIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLS 806
            E I+KQMIS LL +QFPLISFKI + GSME FL+TV +++N V+SKCV FS  +LG    
Sbjct: 825  ETISKQMISALLIKQFPLISFKIMENGSMEEFLQTVAQHKNNVLSKCVQFSAALLGEHYI 884

Query: 807  GDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFA 866
            GD+    E+  +ET +  T++G +    ES+TS+ AIE+LLRAP L DL SWSHWD +FA
Sbjct: 885  GDML--REDHTVETAAVRTNSGQKMMAFESITSQSAIEVLLRAPMLCDLTSWSHWDLIFA 942

Query: 867  PSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSS 926
            PSLGPL  WLLNEVN KELLCLVT+DGKVIRID SA+VDSFLEAAL+GS FQTAVKLLS 
Sbjct: 943  PSLGPLVEWLLNEVNAKELLCLVTKDGKVIRIDQSANVDSFLEAALQGSPFQTAVKLLSL 1002

Query: 927  FALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANI 986
             +LAGGEK++PL LLKC+AR AF+V+FKN+ E+++V  ++N  +HG  +    N      
Sbjct: 1003 LSLAGGEKHIPLSLLKCYARQAFDVIFKNHFENMDVQENRNYLLHGKAVDKAAN------ 1056

Query: 987  DNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAIL 1046
              LSG+  K L +  + +P ASRF LDCLGYLPSEFRSFAADVLLSG+ S  KDAPSAIL
Sbjct: 1057 -TLSGQAHKNLFQINRVLPAASRFVLDCLGYLPSEFRSFAADVLLSGMHSVAKDAPSAIL 1115

Query: 1047 LECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGS- 1105
             EC Q E R+MLHE+GLS+G++EWI DYH F ST ++D           AAT  L++GS 
Sbjct: 1116 CECSQKE-RIMLHEIGLSIGLVEWIDDYHTFFSTISTDSFT-SFEPALGAATPVLSTGSR 1173

Query: 1106 ---------GCAEGSLFESVGADVHIEECGAICDTI-----CGEASDDGLGDCTTQTLPE 1151
                      C +G     +  D H EE      TI      G+A+  G  + ++    E
Sbjct: 1174 YVQNTLDMYSCGDGKTNMHLAEDGHNEESTETSPTIQDAVVSGDATATGCAEESS----E 1229

Query: 1152 DKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
              + +DAAL++ESIRRDEFGL PNIS+ ES +LKKQHARLGRALHCLSQELYS+DSHFLL
Sbjct: 1230 SNKLKDAALVIESIRRDEFGLDPNISSTESTILKKQHARLGRALHCLSQELYSEDSHFLL 1289

Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
            ELVQNADDNIY  +VEPTLTFILQESGIV+LNNEQGF A+NIRALCDVGNSTKK S  GY
Sbjct: 1290 ELVQNADDNIYSGSVEPTLTFILQESGIVILNNEQGFLAQNIRALCDVGNSTKKASGTGY 1349

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDP 1331
            IG+KGIGFKSVFRVTDAPEIHSNGFH+KFD SEGQIGFVLPT+VP  ++D+F RL+S++ 
Sbjct: 1350 IGQKGIGFKSVFRVTDAPEIHSNGFHIKFDISEGQIGFVLPTVVPACDVDLFSRLVSRET 1409

Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
             Q + K WNTCI LPFR+K SE  AM     MF+DLHPSLLLFLHRLQCIMFRNMLNDSL
Sbjct: 1410 GQKDKKHWNTCIVLPFRSKLSEETAMK----MFADLHPSLLLFLHRLQCIMFRNMLNDSL 1465

Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
            +V+RK+I+ DGIIKVSCG+DKMTW VASQKL+A   RP V+TTEIA+A TL+ES  G+Y 
Sbjct: 1466 LVMRKEILQDGIIKVSCGKDKMTWLVASQKLQAHASRPKVQTTEIAVAFTLEESENGDYY 1525

Query: 1452 PLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERS 1511
            P L QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD N PWN+WLL++FP LFVSAERS
Sbjct: 1526 PRLDQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDKNDPWNEWLLTKFPDLFVSAERS 1585

Query: 1512 FCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAP 1571
            FC L CFR NP KA +VYMSFVPLVGEVHGFFSGLP+ I  +LR ++CL+LEG+N    P
Sbjct: 1586 FCALSCFRYNPGKAVAVYMSFVPLVGEVHGFFSGLPKAIALELRRTSCLLLEGDNCNMVP 1645

Query: 1572 PCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISS 1631
            PC VLRGWN++A +LLPD LLQ+HLGLGFL K+I+LSDSLARALGI EYGP+IL++ ++ 
Sbjct: 1646 PCNVLRGWNEQARNLLPDGLLQEHLGLGFLDKNIILSDSLARALGIMEYGPEILIKFMTC 1705

Query: 1632 LCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGT 1691
            L  T +GL+SMGL WL+S LN LY +  HSSG         TDLIDNL++IPFIPLSDG 
Sbjct: 1706 LSHTTSGLKSMGLGWLSSLLNTLYIMISHSSG--------PTDLIDNLRQIPFIPLSDGR 1757

Query: 1692 FSSVDEGTIWLHSD--CSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVD 1749
            +SS+D GTIWLHSD   + FDG   LEAFP L AKLR V+PAL SAS  D    G + VD
Sbjct: 1758 YSSLDRGTIWLHSDILSAGFDGAQELEAFPQLYAKLRVVNPALFSASVAD----GTL-VD 1812

Query: 1750 NLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCH 1809
            N   MLLKIGVQQLSAH+IVKVH+LPA+S+E  ++ +K LM DYLCFVM+HL+  CP+C 
Sbjct: 1813 NSATMLLKIGVQQLSAHEIVKVHVLPALSNEKVSDRNKELMTDYLCFVMIHLQSSCPHCC 1872

Query: 1810 VEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITY 1869
            +ER++I+SEL  KAF+LTN G++RP+E P+HF K+FGNP+ +N LI+ +DI+W+E+D+TY
Sbjct: 1873 MERKYIISELHSKAFILTNFGYRRPAETPLHFSKDFGNPIDINKLINVMDIQWHEIDLTY 1932

Query: 1870 LKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-WTKELLSPGS 1928
            LKH   +SLS GL+KWR FF+EIG+TDFVQV+Q++K+++D+  T  KN+    +LL PGS
Sbjct: 1933 LKHSVNDSLSNGLMKWRVFFQEIGVTDFVQVIQIEKNISDLLQTVLKNVKCDADLLCPGS 1992

Query: 1929 AAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSF 1988
             A DWES+EL  +LS+L+   +R+C K+LLEILD +WDD +++K  G++ SK +   R+F
Sbjct: 1993 IARDWESSELAQILSILSKTGDRECCKYLLEILDRMWDDSFSEKATGYYNSKSSVAGRTF 2052

Query: 1989 QSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIG 2048
            +S F+  I D+QW +S+MD+ELHYPKDLF+DCD VRSILG SAPY +PKV S KL+ DIG
Sbjct: 2053 KSCFLRSIHDVQWVVSTMDNELHYPKDLFNDCDVVRSILGSSAPYALPKVTSSKLLSDIG 2112

Query: 2049 LKTEVTIDDILEILKVWTRLEAPFMA 2074
             KT+VT+DD L+ L+VW + E PF A
Sbjct: 2113 FKTKVTLDDALKFLRVWRKSETPFKA 2138


>gi|449433179|ref|XP_004134375.1| PREDICTED: uncharacterized protein LOC101204025 [Cucumis sativus]
          Length = 2724

 Score = 2142 bits (5549), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1118/2087 (53%), Positives = 1458/2087 (69%), Gaps = 70/2087 (3%)

Query: 21   GGGQPPSQPPPPHFNPNFAFQNPNI--FFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQ 78
            G GQ   + PP    P+  + N N+  F  +P     QN+ N P QN  F    P+ F  
Sbjct: 12   GSGQRQQRLPPQMQFPDAPYLNHNVPFFQTHPGFIPYQNV-NFPMQNTGFQFSPPSNFGL 70

Query: 79   NANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFL 138
            +     Q+PN                     PG   ++     QP+S     +K  K+ L
Sbjct: 71   H---FPQSPN---------------------PGVRLRKASTASQPES---STKKAGKD-L 102

Query: 139  ERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEA 198
             +I+ AV  A + L+ AGES+S+WKV +S L  LQVDSW SLG++MQE+  LH++MV E 
Sbjct: 103  SKINHAVATAHTNLLEAGESISAWKVVQSALSILQVDSWDSLGYRMQEISDLHRIMVREG 162

Query: 199  RINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDV 258
            +IN+FIHCFVGVR+IT+LY+LEEAICK EG+ +F+EL LGPFLR PL+ HYF V S+V  
Sbjct: 163  KINSFIHCFVGVRRITTLYELEEAICKYEGVGEFKELRLGPFLRHPLVLHYFSVDSEVTE 222

Query: 259  VFKIATNDVIVCLSEYTDTHKA--KDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
            VFKI   D+I  L E+ D      K I V++FL+FI++K S+   E LG+R+Q LG +++
Sbjct: 223  VFKITGEDIISFLFEFMDVDACSNKKIAVEKFLEFISRKLSVEKWEMLGIRVQKLGDYVS 282

Query: 317  FIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINED 376
             +++AR+S+  T  K+L +    H + KRP+FS +K++ LDERFSAI +RV+SFSS+++D
Sbjct: 283  VVKKARRSEADTFSKYLTKYDEKHGK-KRPLFSLQKEK-LDERFSAISQRVESFSSVHKD 340

Query: 377  FGAKHIRFVSSSSEDEDSDDCTYE---CSNDISSNVQLPSQI-KGSDRVSSCPYPSVTEE 432
            F  KH+RF SSSS DE S D TYE      D S++++L  QI K +DR SSCPYPS TEE
Sbjct: 341  FCGKHVRFDSSSSGDEGSGDSTYEKKKVDTDSSNDLKLSLQINKSADRPSSCPYPSATEE 400

Query: 433  LKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR-A 491
            + RLGLKGE++   T  G+   Q   IGS   KRK  +     +         K K+  +
Sbjct: 401  MMRLGLKGEVSGHATPIGS---QKHCIGSGTLKRKRRSRSRGHTTSGTSTSSKKFKEDCS 457

Query: 492  LPIESG-DQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
            LP++     +++    EA+   +N S+RMF+T WK+ C++ T++EV+++M   Y      
Sbjct: 458  LPVKDDFGNSEEGSGFEAEYEITNNSLRMFVTMWKDGCRDMTVAEVIKRMLDHYGMN--- 514

Query: 551  KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
               R   R++ M SSYP IGLLN+AV+SIK GMWDS+YD+LQ I   EL NT       Y
Sbjct: 515  --LRSTVRVRSMLSSYPFIGLLNVAVSSIKNGMWDSIYDSLQIINLPELTNTNVKKQPVY 572

Query: 611  ESIEIEPSEQVAVV-NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRK 669
            + I++ PS + A++ +   +  DI VE+++ KV ++++F+  I ++ +  +   I +L++
Sbjct: 573  DCIDVGPSAEGALIKHVPKSTHDITVEDILNKVGQHIKFEQEIRSDGRLLMENRIQILKQ 632

Query: 670  LCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQ 729
            LCSCE WL EQ+  K+FKSLG+G+FF+FLEKHASML  EL K    +  +RS LE  ++Q
Sbjct: 633  LCSCEFWLVEQYSAKDFKSLGHGDFFSFLEKHASMLPDELHKFLVPEISERSPLEACMLQ 692

Query: 730  HLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEV 789
              LV LVSQA NNL E+EII+KQMI  LL +QFPLI FK+ + G +E F+E VG+ +N V
Sbjct: 693  RQLVTLVSQACNNLSENEIISKQMIYNLLTQQFPLIDFKLTENGPLEDFMELVGQQKNSV 752

Query: 790  MSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRA 849
            +SKCV FS ++LG     D     +N  L  TS     G      +SVTS+DAI++L+RA
Sbjct: 753  VSKCVTFSVSLLGGSHLRDSLATMDNDSLGATSFSAEAGQGLGAIKSVTSEDAIKVLVRA 812

Query: 850  PFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLE 909
            P L DLN WSHWD LFAP+LGPL  WL  EVN +  +C+VT++GKVIRIDH+A+ DSFLE
Sbjct: 813  PMLLDLNLWSHWDLLFAPALGPLVPWLQKEVNTENFMCMVTKEGKVIRIDHTATADSFLE 872

Query: 910  AALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCR 969
            AAL+GS F TAVKLLS FAL GGEK V L LLK HA  AFEV+ KN+VE+IE+   +N  
Sbjct: 873  AALQGSPFHTAVKLLSIFALLGGEKYVLLSLLKHHASRAFEVIMKNSVENIEMF--ENWG 930

Query: 970  MHGNVLCGRQNF-DVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAAD 1028
                 +   QNF +     NLS E++K++    KA+ + SRFF+DCLGY+P EFR  AA+
Sbjct: 931  QGLEKVAFHQNFIEQVAAGNLSLELKKKIDMRNKAISLLSRFFVDCLGYIPVEFRYLAAN 990

Query: 1029 VLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMP 1088
            +LLSG+ S +KDA SAIL EC + E RLMLHE+GLSLG+ EWI DYH   S+ +SDL   
Sbjct: 991  ILLSGITSVVKDAASAILHECWKPEQRLMLHEIGLSLGVPEWIQDYHTVSSSASSDLFTD 1050

Query: 1089 CVVT-----CTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEASDDGLGD 1143
              +        N    GL +    +E +   S+  +V  E+          + S+D  G 
Sbjct: 1051 ACLNDRSEINRNVHRDGLLTKYSTSEQNASFSIEENVFNEKLSVSSANCTAKTSNDANGL 1110

Query: 1144 CTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELY 1203
                   E    +DA  I++ IRRDEFGL  ++   E+ ML+KQHARLGRALHCLSQELY
Sbjct: 1111 SCMSLASEPDGNKDAVEIIQCIRRDEFGLDLDLPISETGMLRKQHARLGRALHCLSQELY 1170

Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            SQDSHFLLELVQNADDNIYP++VEPTL FI +ESGIVVLNNE+GFSA+NIRALCDVGNST
Sbjct: 1171 SQDSHFLLELVQNADDNIYPQSVEPTLAFIFEESGIVVLNNEEGFSAKNIRALCDVGNST 1230

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMF 1323
            KKGS+AGYIG+KGIGFKSVFR+TDAPEIHSNGFHVKFD SEGQIGFVLPT++ P N++++
Sbjct: 1231 KKGSNAGYIGKKGIGFKSVFRITDAPEIHSNGFHVKFDISEGQIGFVLPTIISPCNVNLY 1290

Query: 1324 CRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAM-NNIVDMFSDLHPSLLLFLHRLQCIM 1382
             +L +      ++  WNTCI LPFR+K S G+ + NNIV+MF+DLHPSLLLFLHRLQCI 
Sbjct: 1291 GKLATSASDHEDTNIWNTCIVLPFRSKLSGGVNLINNIVNMFADLHPSLLLFLHRLQCIK 1350

Query: 1383 FRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
             RN++++SL+V+RK+IVG+GII+VS GE+KMTW V SQKL+A VIR DV++TEI++A TL
Sbjct: 1351 IRNLIDNSLIVMRKEIVGNGIIRVSHGEEKMTWLVVSQKLKADVIRHDVQSTEISIAFTL 1410

Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
             E   G   PLL+QQPVFAFLPLR YGLKFI+QGDFVLPSSREEVDG+SPWNQWLLSEFP
Sbjct: 1411 HEEENGVISPLLHQQPVFAFLPLRMYGLKFIIQGDFVLPSSREEVDGDSPWNQWLLSEFP 1470

Query: 1503 ALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
             LFVSA  SFC LPCF   P KA S YMS++PL+GEVHGFFS LPR+I+SKLRMSNCL+L
Sbjct: 1471 GLFVSAVESFCSLPCFESCPGKAISAYMSYIPLIGEVHGFFSSLPRLIISKLRMSNCLLL 1530

Query: 1563 EGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGP 1622
            EG  N+WAPPCKVLRGWN++A +LLPD LL+++LGLGFL KDI+LSDSLARALGIEEYGP
Sbjct: 1531 EGKENEWAPPCKVLRGWNEQALTLLPDNLLREYLGLGFLHKDIILSDSLARALGIEEYGP 1590

Query: 1623 KILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRI 1682
            KIL+Q +SSLC+  N L+SMGL WL S L+ L+ +   SSGQ++L+     DLI +LQ++
Sbjct: 1591 KILVQFMSSLCQKYNSLKSMGLFWLGSCLSVLHNMLLQSSGQTTLELERNADLIRSLQKV 1650

Query: 1683 PFIPLSDGTFSSVDEGTIWLHSDCS--VFDGGFGLEAFPNLCAKLRTVSPALLSASAVDK 1740
            P IPLSDGT+SSV EGTIWLHSD S    DG +GLEAFP L +K+R V PA LS  +VD 
Sbjct: 1651 PLIPLSDGTYSSVAEGTIWLHSDSSNATVDGKYGLEAFPYLNSKIRVVCPAFLSLFSVDN 1710

Query: 1741 SSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMH 1800
            S + V SV N++ ML +IGVQ+LSAH+I+K HI+PAI++E+  NG+K LM +Y+CFVM H
Sbjct: 1711 SQIDVPSVGNISWMLYRIGVQRLSAHEIIKEHIIPAITNESNLNGNKILMTEYVCFVMTH 1770

Query: 1801 LEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDI 1860
            L   CP CH++R FI+SELR KAF+LTNHG+KR  E+P+HF KE+GNP+ +N L+  +++
Sbjct: 1771 LLSSCPECHIDRGFIISELRTKAFILTNHGYKRLVEVPVHFSKEYGNPIDLNKLL-SVEM 1829

Query: 1861 KWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-W 1919
             W+EV  TYLKHP   SLSCGL KWR FF+EIGI DFV VV+V++ +A++ H    N  W
Sbjct: 1830 NWHEVADTYLKHPVTNSLSCGLTKWRNFFQEIGINDFVHVVEVNRSIANMPHDIMVNRKW 1889

Query: 1920 TKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKS 1979
              E++  G+   DWES EL HLL++L T+ N++  K+LLE+LDTLW+D  +DK++G   S
Sbjct: 1890 DPEIIFSGAMVKDWESPELTHLLTMLATHGNKESCKYLLEVLDTLWNDHLSDKVVGCCIS 1949

Query: 1980 KPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVK 2039
            K     + FQS+F+N ICD QW +SS+D + HYPKDL++DCDAVRSILG SAPY +PKV+
Sbjct: 1950 KSGDSSKQFQSAFMNSICDAQWVVSSVDKKGHYPKDLYYDCDAVRSILGASAPYALPKVQ 2009

Query: 2040 SEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------MARYYLLV 2080
            S KLV DIG KT V++DD   ILKVW R E PF      M ++Y  +
Sbjct: 2010 STKLVRDIGFKTRVSLDDTFNILKVW-RTEKPFKTSISQMCKFYTFL 2055


>gi|356528304|ref|XP_003532744.1| PREDICTED: uncharacterized protein LOC100780184 [Glycine max]
          Length = 2785

 Score = 2123 bits (5502), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1111/2029 (54%), Positives = 1423/2029 (70%), Gaps = 110/2029 (5%)

Query: 138  LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
            +E  DRA+  A   L+ AG+SVS+W VS++ LL LQVDSW++LG +MQ+VPSLH+LM+TE
Sbjct: 81   IEHADRAIANACRALLAAGDSVSAWTVSQNALLTLQVDSWNTLGIKMQQVPSLHRLMMTE 140

Query: 198  ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
             ++NAF+HCFVGV++ITSLYDLE AICK+EG++ FEEL LGPFLR PL+ HYF ++SDV 
Sbjct: 141  GKVNAFVHCFVGVQRITSLYDLEVAICKNEGVDDFEELGLGPFLRHPLVIHYFSLRSDVT 200

Query: 258  VVFKIATNDVIVCLSEYTDTHKAKD-IKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
             V+KI T ++I  L E+ D  ++ + IKV++FLDFIA KR +  KE LG+RIQNLGMHI 
Sbjct: 201  QVYKITTEEIIQLLIEFLDASRSNEFIKVEQFLDFIANKRLVECKEWLGIRIQNLGMHIY 260

Query: 317  FIREARKSQDVTLKKFLK----------ELQPHHKRRKRPIFSSEKKRQLDERFSAICE- 365
             IREAR S+   +++  K          +L    +R  R +  S ++   +   SAI E 
Sbjct: 261  AIREARNSEQSAMREVRKLGQSARREARKLGQSARREARKLGQSARREARNSEQSAIQEA 320

Query: 366  --------------------------------RVKSFSS-----INEDFGA--------- 379
                                            R ++  S     ++E F A         
Sbjct: 321  RNSEQSAVQEASNSEQSAYEKCLESFLKNGKFRYRTIPSSQKKQLDERFNAITQRVESFS 380

Query: 380  --------KHIRFVSSSSEDEDSDDCTYECSNDISSNVQL-PSQ--IKGSDRVSSCPYPS 428
                    KH RF+SS+SEDEDSD  T E SN+I    Q  PS    + S+RVSSCPYPS
Sbjct: 381  PVKKSFCGKHKRFMSSASEDEDSDSSTDEQSNNIIKGSQSNPSSQFTRSSERVSSCPYPS 440

Query: 429  VTEELKRLGLKGEI-NHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKA 487
             TEE  RLG++ ++  H L    NS+ +  +    +KKRK EN+  T SAP K  +RNK 
Sbjct: 441  ATEEKARLGVRSDMAGHSLV---NSNLKKGFSEQPRKKRKFENATSTRSAPYKLRKRNKL 497

Query: 488  KQRALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPT 547
                 PI +G++T+     + D+S SN+S++MF+TTWK AC  + ++EVLE M QF    
Sbjct: 498  GV-VTPINTGNKTKVSTNIDEDLSISNDSLQMFVTTWKMACSEHKVAEVLEMMLQF---- 552

Query: 548  DPKKAARYFKR-MKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGN 606
               K  R+ KR +K +FSSYP IGLLN AV+SIK GM +++YDT QAI    L N+ +  
Sbjct: 553  --SKVNRFQKRKIKNLFSSYPFIGLLNAAVSSIKSGMRNNIYDTFQAIIDNGLGNSPTKG 610

Query: 607  SLEYESIEIEPSEQVAVVNTKHTVQD---IQVEEVMKKVSKYLEFDNSILNNAQSPVTKI 663
            S EY++I++ P ++   V TK   ++   I  ++V++K+  Y +  N I  N+   + + 
Sbjct: 611  S-EYDTIDVGPGQENVPVITKDNTENTKCISSDDVIRKIGTYFDHGNDINRNSNDSLVQY 669

Query: 664  -IILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSS 722
             I+L RK CSCE W+ EQF +K+F SLGYG+F +FLEKH ++L  EL KL   DT + SS
Sbjct: 670  RIMLWRKFCSCENWVAEQFGMKKFDSLGYGDFLSFLEKHPNLLPHELLKLLVGDTCENSS 729

Query: 723  LEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETV 782
                +  + L+ LVSQA + LWE+E ITKQMIS LL RQFP I+F++ + GS+   L+TV
Sbjct: 730  FRACMSSNQLIALVSQALSGLWENETITKQMISMLLMRQFPSINFELVENGSLVDLLDTV 789

Query: 783  GKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDA 842
              + + V SKCV+FS TI+    +GD S   +N+  E  +  + T  + K +E+V +K+A
Sbjct: 790  KGHTSSVTSKCVVFSATIIEKNYNGDSSSDRDNNWSEIPTDRSETSHK-KSTETVIAKNA 848

Query: 843  IEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSA 902
            IE+LL+AP LSDL+ WSHWD  FAP LGP   WLLN+VN KEL CLVTRDGKVIRIDHSA
Sbjct: 849  IEVLLKAPMLSDLSKWSHWDLRFAPFLGPFISWLLNDVNTKELFCLVTRDGKVIRIDHSA 908

Query: 903  SVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEV 962
            ++DSFLEAA++GSSFQTAV LLS  +L GGEK VPL LLKCH+ HAFEVMF+N+VED+EV
Sbjct: 909  TLDSFLEAAVQGSSFQTAVHLLSLISLVGGEKYVPLSLLKCHSCHAFEVMFRNSVEDVEV 968

Query: 963  INSQNCRMHGNV--LCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
             N  N  +H +V  L   +     +   +     K + K  K   + SRF LDCLG LP+
Sbjct: 969  SNDGNA-LHQSVEALSKTKFLTEISTAKMRSVFSKHMHKVSKVASILSRFVLDCLGNLPA 1027

Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
            EF SFA+DVLLSG+QS  KDA S IL EC   E RLMLHE+GLSLGI EWI+DYHA  S 
Sbjct: 1028 EFHSFASDVLLSGMQSVFKDAASTILCECSNMEQRLMLHEIGLSLGISEWINDYHALISN 1087

Query: 1081 GTSDLLMPCV-VTCTNAATSGLNSG----------SGCAEGSLFESVGADVHIEECGAIC 1129
             +SD+   C  V+C   AT+ +N+           S   E ++  S+     IE C  I 
Sbjct: 1088 NSSDI--HCARVSCLKDATTDINTSLKLDQVTLDKSPIPEANMVTSLVPHRLIEGCTEII 1145

Query: 1130 DTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHA 1189
            +T+  E S+D    C      +  E  DA+ ++ESIRRDEFGL  ++S+++S MLKKQHA
Sbjct: 1146 ETVDPEKSNDESNTCCLGNSFQHVEDMDASRLIESIRRDEFGLDSSLSDIDSCMLKKQHA 1205

Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFS 1249
            RLGRALHCLSQELYSQDSHF+LELVQNADDN YPENVEPTLTFIL++SGIVVLNNE+GFS
Sbjct: 1206 RLGRALHCLSQELYSQDSHFILELVQNADDNNYPENVEPTLTFILRDSGIVVLNNERGFS 1265

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
            A+N+RALCDVGNSTKKGS+AGYIG+KGIGFKSVFRVTDAPEIHSNGFHVKFD SEGQIGF
Sbjct: 1266 AQNMRALCDVGNSTKKGSTAGYIGKKGIGFKSVFRVTDAPEIHSNGFHVKFDISEGQIGF 1325

Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
            VLPT+VPP +I +  R+ S D    +   WNTCI LPFR+  SEG+AMN+++ MFSDLHP
Sbjct: 1326 VLPTVVPPCDIGVLRRMASTDTELCDDNPWNTCILLPFRSHLSEGMAMNSVLSMFSDLHP 1385

Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRP 1429
            SLLLFLHRL+CI  RN+LND+L V++K+I GDGIIKVS G++K+ WFV SQKL+   IR 
Sbjct: 1386 SLLLFLHRLKCIKLRNLLNDTLTVMKKEISGDGIIKVSHGKEKIVWFVVSQKLQTNSIRF 1445

Query: 1430 DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1489
            DV+TTEI++A TLQES+ G Y P   QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG
Sbjct: 1446 DVQTTEISMAFTLQESDNG-YIPCSDQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1504

Query: 1490 NSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRM 1549
            +SPWNQWLLSE+P LFV A+R FC+LPCFR  P K  S +MSFVPLVGEVHGFFS LPR+
Sbjct: 1505 DSPWNQWLLSEYPNLFVRAQREFCELPCFRSEPGKGLSAFMSFVPLVGEVHGFFSSLPRL 1564

Query: 1550 ILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSD 1609
            I+SKLRM NCL+++G+NN+WAPPCKVLRGW ++  +L+PD +L +HLGL +L ++IVLSD
Sbjct: 1565 IISKLRMMNCLLVDGDNNEWAPPCKVLRGWTEQVRNLIPDNMLLEHLGLIYLDRNIVLSD 1624

Query: 1610 SLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQS 1669
             LARALGIEE+GP IL++++SSLC T++GL SM +SWLAS LN L    F+SSG   +  
Sbjct: 1625 ELARALGIEEFGPNILVRVLSSLCHTKSGLISMDMSWLASCLNILSVTMFNSSGSVPINF 1684

Query: 1670 GVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSD--CSVFDGGFGLEAFPNLCAKLRT 1727
             ++ D+  NLQ++PFIPLSDGT+SSVDEGTIWLH +   + FDG   +EAFPN+CAKLRT
Sbjct: 1685 EMK-DVQKNLQKMPFIPLSDGTYSSVDEGTIWLHFNHLNTGFDGEHKIEAFPNICAKLRT 1743

Query: 1728 VSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDK 1787
            VSP L SAS+    SL V  +DN+ R+L  IGVQQLS HD+VK+HILPA+SDET AN ++
Sbjct: 1744 VSPFLFSASS-GTPSLNVTFLDNVTRLLQSIGVQQLSVHDVVKLHILPALSDETMANKNR 1802

Query: 1788 NLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGN 1847
             LM +Y+CFVM+HL   C +C +ERE I+SE R K+ +LTN+GFK P+EIPIHF   FGN
Sbjct: 1803 VLMIEYVCFVMLHLNSSCSDCFIEREHIISEFRCKSLLLTNYGFKCPAEIPIHFCTGFGN 1862

Query: 1848 PVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDV 1907
            PV+  ML   + ++W+EVDI+YL HP  ES+S  L+KWR FF++ GITDF QVVQVDK V
Sbjct: 1863 PVTPKMLADSVSMRWHEVDISYLSHPVNESVSSALIKWRDFFEKFGITDFAQVVQVDKSV 1922

Query: 1908 ADISHTGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWD 1966
             DI    FK  MW + L+S  S   DWES+E+V L+SLL+ + N +  K+LLE+LDTLWD
Sbjct: 1923 VDICDVTFKQMMWDRGLISAESIVKDWESSEIVQLVSLLSKSGNLENCKYLLEVLDTLWD 1982

Query: 1967 DCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSI 2026
             CY++K  G+F  K  GD   F+S+FI  +CDIQW +S+MDDELHYPKDLF+DC+ VR +
Sbjct: 1983 VCYSNKTTGYFYLKSVGDGHPFKSTFICSLCDIQWVVSTMDDELHYPKDLFYDCETVRML 2042

Query: 2027 LGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTR-LEAPFMA 2074
            LG  APY VPKVKSE+LV D G KT VT+DDI ++LK W +  + PF A
Sbjct: 2043 LGDFAPYAVPKVKSERLVKDFGFKTRVTLDDIFDVLKAWRKSSKTPFKA 2091


>gi|357518761|ref|XP_003629669.1| hypothetical protein MTR_8g085280 [Medicago truncatula]
 gi|355523691|gb|AET04145.1| hypothetical protein MTR_8g085280 [Medicago truncatula]
          Length = 2812

 Score = 2105 bits (5453), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1128/2104 (53%), Positives = 1450/2104 (68%), Gaps = 106/2104 (5%)

Query: 45   IFFPN---PALQLLQNLTNIP----FQNPNFDIQNPNLFM---QNANLLTQNPNLPPQQP 94
             F+PN   P+     +L N P    F  P     NP  F    QN   +   P  P    
Sbjct: 22   AFYPNYYSPSHNF--HLPNSPWPHSFPPPQ---HNPYAFAPPPQNPPGVVPPPQNPYAHR 76

Query: 95   PSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVKARSELIM 154
            P +      +   + P  S  + P   QP S P Q     K+ +++ + A  KA  EL+ 
Sbjct: 77   PQNPPPSTSTAPPNKPSGSAPRPP--NQPDSTPRQQNP--KQAIDKAENASSKACRELLA 132

Query: 155  AGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQIT 214
            AG+SVS+WKVS+  LL L+VDS +SLG +MQ+VP+LH+LM+TE ++NAF+HCFVGVR+IT
Sbjct: 133  AGDSVSAWKVSQKALLTLKVDSLNSLGIKMQQVPTLHRLMITEGKVNAFVHCFVGVRRIT 192

Query: 215  SLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEY 274
            SLYDLE AICK+EG++ FEEL LGPFLR PL+ HYF V+S+V  V KI + ++I  LSE+
Sbjct: 193  SLYDLELAICKNEGVDSFEELGLGPFLRHPLVIHYFSVRSNVAEVCKITSEEIIQFLSEF 252

Query: 275  TDTHKAKD-IKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFL 333
             D  KAK  + V+EFL+FIAKKRS+ S E LG+RIQNLGMHI  I+EARKS+  TL+K L
Sbjct: 253  LDVSKAKAVVGVEEFLEFIAKKRSVESMELLGIRIQNLGMHIAAIKEARKSEQSTLEKCL 312

Query: 334  KELQPHH-KRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDE 392
            K L+    K  KRPI SS+KK QLDERFS I +RV+SFSS+ + F  KHIRF SSSSEDE
Sbjct: 313  KTLRSKSGKLGKRPISSSQKK-QLDERFSTIAQRVESFSSVEKSFCGKHIRFTSSSSEDE 371

Query: 393  DSDDCTYECSND--ISSNVQLPSQIKGS--DRVSSCPYPSVTEELKRLGLKGEINHQLTS 448
             SD  T    ND  I ++   PS   G   +R+SSCPYPS TEE+ RL +KG+   Q  S
Sbjct: 372  GSDYYTDNDQNDSIIMNSWSNPSSQFGRSLERMSSCPYPSATEEMARLVVKGD--KQGGS 429

Query: 449  AGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDDELNEA 508
              N S ++++    +KKRK E        P K       K    P ++G+ T+ + ++E 
Sbjct: 430  LSNGSLKNEFTEPPRKKRKSE-LDLPDPLPFKHFNFKYKKVDPTPTKNGNATEINTMDEC 488

Query: 509  DISFSNESMRMFITTWKEAC------------------------KNNTMSEVLEKMFQFY 544
             +S ++++++MF+TTWKEAC                        K+N +  ++E M QFY
Sbjct: 489  -LSITDDALQMFVTTWKEACLEHNVGESSLPKGIRKVFTEPTHQKDNYILLMVEMMLQFY 547

Query: 545  KPTDPKKAARYFKRMKLMFSSYP-CIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTV 603
                PK      ++++++F  YP  IGLLN AV++IK GMW+S+YDT Q     EL+N+ 
Sbjct: 548  G-VKPKGK----RKIRMLFVEYPFLIGLLNAAVSAIKSGMWNSIYDTFQVFNHSELSNSP 602

Query: 604  SGNSLEYESIEIEPS-EQVAVV--NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPV 660
            + +S E+E+I++ PS E V +V  ++    + I  E+V+ K+  Y + DN +  N+   V
Sbjct: 603  TKSS-EFETIDVGPSLENVPLVSKDSAENTKCISAEDVVGKIGMYFDLDNEVYRNSDWQV 661

Query: 661  TKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDR 720
               I+ LRK C+CE+WL EQF +K F SLG+G+  +FLE + + L  EL KL   D  + 
Sbjct: 662  KYKIMFLRKFCNCESWLAEQFGVKNFNSLGHGDLLSFLENNVNQLPHELLKLLGGDMCEN 721

Query: 721  SSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLE 780
            S+ +  +  + LV L+SQA  +LWE+E +TKQ+IS LL RQFP I F+  + GS+E  L+
Sbjct: 722  STFKACMSTNELVALLSQAICSLWENETVTKQIISMLLMRQFPSIGFEFLESGSLEDLLD 781

Query: 781  TVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSK 840
            TV ++++ V SKCV+FS  ++      D     +N+L   T+  +  G + K SE++T+K
Sbjct: 782  TVREHKSNVTSKCVVFSAAMIEEHCDVDSLRDGDNNLSGITTDTSEIGHKTKSSETITAK 841

Query: 841  DAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDH 900
             AIE+LL++P LSDL+ WSHWD +FAPSLG L  WLLN+VN +ELLCLVTRDGKVIRIDH
Sbjct: 842  KAIEMLLKSPMLSDLSKWSHWDLMFAPSLGSLISWLLNDVNSEELLCLVTRDGKVIRIDH 901

Query: 901  SASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDI 960
            SA++ SFLEAA++GSSF+TAV LLS  +L GG++ VPL LLK  A  AFEVMF+N +EDI
Sbjct: 902  SATLKSFLEAAVQGSSFRTAVSLLSLISLVGGKRKVPLSLLKRDACSAFEVMFRNFLEDI 961

Query: 961  EVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
            EV + +N R     L   +     +   +S E  K L K  KAV + SRF LDCLGYLP+
Sbjct: 962  EVCDDKNARQSEEALRKTKILTEVSTAKMSDEFGKHLHKVNKAVSILSRFVLDCLGYLPA 1021

Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
            EF SFAADVLLSG++S  KDA +AIL EC   E  LMLHE+GLSLGI EWI+DYHAF S 
Sbjct: 1022 EFHSFAADVLLSGMRSVFKDAAAAILCECRNMEQHLMLHEIGLSLGITEWINDYHAFISN 1081

Query: 1081 GTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECG------AICDTICG 1134
             TSD    C+       ++GL  G G  + S    V     +  CG       I  T+ G
Sbjct: 1082 DTSD-HASCLKDAKTEISTGLKHGQGILDNSDVPEVNMVTSLVPCGLNEICSEISQTVDG 1140

Query: 1135 EASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRA 1194
            E S+D       +   ++ +  D+ L++ESIRRDEFGL P++S+++S MLKKQHARLGRA
Sbjct: 1141 EKSNDESMTSHLEDSFQNGKDVDSTLVIESIRRDEFGLDPSLSDIDSCMLKKQHARLGRA 1200

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIR 1254
            LHCLSQELYSQDSHF+LELVQNADDN YPENVEPTL FILQ+SGIVVLNNE+GFSA+N+R
Sbjct: 1201 LHCLSQELYSQDSHFILELVQNADDNNYPENVEPTLAFILQDSGIVVLNNERGFSAQNMR 1260

Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
            ALCDVGNSTKKGSS GYIG+KGIGFKSVFRVTDAPEIHSNGFHVKFD SEGQIGFVLPTL
Sbjct: 1261 ALCDVGNSTKKGSSTGYIGKKGIGFKSVFRVTDAPEIHSNGFHVKFDISEGQIGFVLPTL 1320

Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
            VPP +I +  R+            WNTCI LPFR+  S+G  MNNI+ MFSDLHPSLLLF
Sbjct: 1321 VPPCDIGLLRRMAFTGTDSYGDNPWNTCIMLPFRSHLSDGAVMNNIMTMFSDLHPSLLLF 1380

Query: 1375 LHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
            LHRL+CI  RN+LND+++V++K+I  DGIIKVS G+++M WFV SQKL+   IR DV+TT
Sbjct: 1381 LHRLKCIKLRNLLNDTIIVMKKEISEDGIIKVSHGKERMAWFVVSQKLQTNSIRFDVQTT 1440

Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
            EI++A TLQES++G Y P L QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG+SPWN
Sbjct: 1441 EISMAFTLQESDDG-YSPCLDQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGDSPWN 1499

Query: 1495 QWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKL 1554
            QWLLSE+P LFV A R FC+LPCFR  P K  S +MSF+PLVGEVHGFFS LPR+I+SKL
Sbjct: 1500 QWLLSEYPNLFVKAVREFCELPCFRSEPGKGLSAFMSFIPLVGEVHGFFSTLPRLIISKL 1559

Query: 1555 RMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARA 1614
            RM NCL++EG+N  WA PCKVLRGW ++   LLPD +L +HLGL +L K+++LSD+LARA
Sbjct: 1560 RMMNCLLVEGDNKGWASPCKVLRGWTEQVRCLLPDEILLEHLGLRYLDKNVLLSDTLARA 1619

Query: 1615 LGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETD 1674
            LGIEE+GP +L++++SSLC T+N L SM +SWLAS+LN LY + F SSG  S+   ++ D
Sbjct: 1620 LGIEEFGPSVLVRVMSSLCYTKNWLISMNMSWLASFLNTLYVLMFDSSGTISINFEIKDD 1679

Query: 1675 LIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDC--SVFDGGFGLEAFPNLCAKLRTVSPAL 1732
            ++  L++ PFIPLSDGT+SSVDEGTIWL S+   + FDG   +EAFPNL AKLRTVSP+L
Sbjct: 1680 ILKRLKKTPFIPLSDGTYSSVDEGTIWLQSNTFNTGFDGEHKIEAFPNLFAKLRTVSPSL 1739

Query: 1733 LSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMAD 1792
            LSA A D SSL V S+DN+ R+L  IGVQQLSAHD+VK+HILP +SDE  AN +K LM +
Sbjct: 1740 LSA-ASDTSSLNVTSLDNVTRLLQTIGVQQLSAHDVVKLHILPVLSDEAMANKNKMLMIE 1798

Query: 1793 YLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVN 1852
            Y+CFVM++L+  C +C  +RE I+SELR K+ +LT+ GFK PS+IPIHF   FGNPV+  
Sbjct: 1799 YICFVMLYLKSTCSDC--DREDIISELRYKSLLLTDCGFKCPSKIPIHFCPGFGNPVTPK 1856

Query: 1853 MLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISH 1912
            +L   ++++W+EVDI+YL+HP  ES+S  L+KWR+FF+EIGITDF Q+VQVDK   DI  
Sbjct: 1857 ILADAVNMRWHEVDISYLQHPVNESVSSSLIKWREFFEEIGITDFAQIVQVDKTAVDICD 1916

Query: 1913 TGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTD 1971
              FK  MW + L+S  S   DWES E+V L SLL+ + N+   K+ LE+LDTLWD CY+D
Sbjct: 1917 ATFKQVMWDRGLISAESIVKDWESPEIVQLGSLLSKSGNQGNCKYFLEVLDTLWDACYSD 1976

Query: 1972 KIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSA 2031
            K  G F SK  GD   F+S+FI+ +CDI+W +S++DDELHYPKDLFHDC+AVR  LG  A
Sbjct: 1977 KARGCFYSKSVGDGHPFKSTFISNLCDIRWVVSTLDDELHYPKDLFHDCEAVRQTLGTFA 2036

Query: 2032 PYIVPK----------------------------VKSEKLVCDIGLKTEVTIDDILEILK 2063
            PY VPK                            VKSE+LV DIGLKT VT+DDIL+ILK
Sbjct: 2037 PYAVPKVSCFVHLCDTVVGNIYGLLTWVSGLLSVVKSERLVNDIGLKTRVTLDDILDILK 2096

Query: 2064 VWTR 2067
             W +
Sbjct: 2097 AWRK 2100


>gi|449520493|ref|XP_004167268.1| PREDICTED: uncharacterized LOC101204025, partial [Cucumis sativus]
          Length = 2143

 Score = 2096 bits (5430), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1102/2087 (52%), Positives = 1436/2087 (68%), Gaps = 105/2087 (5%)

Query: 21   GGGQPPSQPPPPHFNPNFAFQNPNI--FFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQ 78
            G GQ   + PP    P+  + N N+  F  +P     QN+ N P QN  F    P+ F  
Sbjct: 12   GSGQRQQRLPPQMQFPDAPYLNHNVPFFQTHPGFIPYQNV-NFPMQNTGFQFSPPSNFGL 70

Query: 79   NANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFL 138
            +     Q+PN                     PG   ++     QP+S     +K  K+ L
Sbjct: 71   H---FPQSPN---------------------PGVRLRKASTASQPES---STKKAGKD-L 102

Query: 139  ERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEA 198
             +I+ AV  A + L+ AGES+S+WKV +S L  LQVDSW SLG++MQE+  LH++MV E 
Sbjct: 103  SKINHAVATAHTNLLEAGESISAWKVVQSALSILQVDSWDSLGYRMQEISDLHRIMVREG 162

Query: 199  RINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDV 258
            +IN+FIHCFVGVR+IT+LY+LEEAICK EG+ +F+EL LGPFLR PL+ HYF V S+V  
Sbjct: 163  KINSFIHCFVGVRRITTLYELEEAICKYEGVGEFKELRLGPFLRHPLVLHYFSVDSEVTE 222

Query: 259  VFKIATNDVIVCLSEYTDTHKA--KDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
            VFKI   D+I  L E+ D      K I V++FL+FI++K S+   E LG+R+Q LG +++
Sbjct: 223  VFKITGEDIISFLFEFMDVDACSNKKIAVEKFLEFISRKLSVEKWEMLGIRVQKLGDYVS 282

Query: 317  FIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINED 376
             +++AR+S+  T  K+L +    H  +KRP+FS +K++ LDERFSAI +RV+SFSS+++D
Sbjct: 283  VVKKARRSEADTFSKYLTKYDEKHG-KKRPLFSLQKEK-LDERFSAISQRVESFSSVHKD 340

Query: 377  FGAKHIRFVSSSSEDEDSDDCTYE---CSNDISSNVQLPSQI-KGSDRVSSCPYPSVTEE 432
            F  KH+RF SSSS DE S D TYE      D S++++L  QI K +DR SSCPYPS TEE
Sbjct: 341  FCGKHVRFDSSSSGDEGSGDSTYEEKKVDTDSSNDLKLSLQINKSADRPSSCPYPSATEE 400

Query: 433  LKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR-A 491
            + RLGLKGE++   T  G+   Q   IGS   KRK  +     +         K K+  +
Sbjct: 401  MMRLGLKGEVSGHATPIGS---QKHCIGSGTLKRKRRSRSRGHTTSGTSTSSKKFKEDCS 457

Query: 492  LPIESG-DQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
            LP++     +++    EA+   +N S+RMF+T WK+ C++ T++E               
Sbjct: 458  LPVKDDFGNSEEGSGFEAEYEITNNSLRMFVTMWKDGCRDMTVAE--------------- 502

Query: 551  KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
                                     V+SIK GMWDS+YD+LQ I   EL NT       Y
Sbjct: 503  -------------------------VSSIKNGMWDSIYDSLQIINLPELTNTNVKKQPVY 537

Query: 611  ESIEIEPSEQVAVV-NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRK 669
            + I++ PS + A++ +   +  DI VE+++ KV ++++F+  I ++ +  +   I +L++
Sbjct: 538  DCIDVGPSAEGALIKHVPKSTHDITVEDILNKVGQHIKFEQEIRSDGRLLMENRIQILKQ 597

Query: 670  LCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQ 729
            LCSCE WL EQ+  K+FKSLG+G+FF+FLEKHASML  EL K    +  +RS LE  ++Q
Sbjct: 598  LCSCEFWLVEQYSAKDFKSLGHGDFFSFLEKHASMLPDELHKFLVPEISERSPLEACMLQ 657

Query: 730  HLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEV 789
              LV LVSQA NNL E+EII+KQMI  LL +QFPLI FK+ + G +E F+E VG+ +N V
Sbjct: 658  RQLVTLVSQACNNLSENEIISKQMIYNLLTQQFPLIDFKLTENGPLEDFMELVGQQKNSV 717

Query: 790  MSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRA 849
            +SKCV FS ++LG     D     +N  L  TS     G      +SVTS+DAI++L+RA
Sbjct: 718  VSKCVTFSVSLLGGSHLRDSLATMDNDSLGATSFSAEAGQGLGAIKSVTSEDAIKVLVRA 777

Query: 850  PFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLE 909
            P L DLN WSHWD LFAP+LGPL  WL  EVN +  +C+VT++GKVIRIDH+A+ DSFLE
Sbjct: 778  PMLLDLNLWSHWDLLFAPALGPLVPWLQKEVNTENFMCMVTKEGKVIRIDHTATADSFLE 837

Query: 910  AALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCR 969
            AAL+GS F TAVKLLS FAL GGEK V L LLK HA  AFEV+ KN+VE+IE+   +N  
Sbjct: 838  AALQGSPFHTAVKLLSIFALLGGEKYVLLSLLKHHASRAFEVIMKNSVENIEMF--ENWG 895

Query: 970  MHGNVLCGRQNF-DVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAAD 1028
                 +   QNF +     NLS E++K++    KA+ + SRFF+DCLGY+P EFR  AA+
Sbjct: 896  QGLEKVAFHQNFIEQVAAGNLSLELKKKIDMRNKAISLLSRFFVDCLGYIPVEFRYLAAN 955

Query: 1029 VLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMP 1088
            +LLSG+ S +KDA SAIL EC + E RLMLHE+GLSLG+ EWI DYH   S+ +SDL   
Sbjct: 956  ILLSGITSVVKDAASAILHECWKPEQRLMLHEIGLSLGVPEWIQDYHTVSSSASSDLFTD 1015

Query: 1089 CVVT-----CTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEASDDGLGD 1143
              +        N    GL +    +E +   S+  +V  E+          + S+D  G 
Sbjct: 1016 ACLNDRSEINRNVHRDGLLTKYSTSEQNASFSIEENVFNEKLSVSSANCTAKTSNDANGL 1075

Query: 1144 CTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELY 1203
                   E    +DA  I++ IRRDEFGL  ++   E+ ML+KQHARLGRALHCLSQELY
Sbjct: 1076 SCMSLASEPDGNKDAVEIIQCIRRDEFGLDLDLPISETGMLRKQHARLGRALHCLSQELY 1135

Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            SQDSHFLLELVQNADDNIYP++VEPTL FI +ESGIVVLNNE+GFSA+NIRALCDVGNST
Sbjct: 1136 SQDSHFLLELVQNADDNIYPQSVEPTLAFIFEESGIVVLNNEEGFSAKNIRALCDVGNST 1195

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMF 1323
            KKGS+AGYIG+KGIGFKSVFR+TDAPEIHSNGFHVKFD SEGQIGFVLPT++ P N++++
Sbjct: 1196 KKGSNAGYIGKKGIGFKSVFRITDAPEIHSNGFHVKFDISEGQIGFVLPTIISPCNVNLY 1255

Query: 1324 CRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAM-NNIVDMFSDLHPSLLLFLHRLQCIM 1382
             +L +      ++  WNTCI LPFR+K S G+ + NNIV+MF+DLHPSLLLFLHRLQCI 
Sbjct: 1256 GKLATSASDHEDTNIWNTCIVLPFRSKLSGGVNLINNIVNMFADLHPSLLLFLHRLQCIK 1315

Query: 1383 FRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
             RN++++SL+V+RK+IVG+GII+VS GE+KMTW V SQKL+A VIR DV++TEI++A TL
Sbjct: 1316 IRNLIDNSLIVMRKEIVGNGIIRVSHGEEKMTWLVVSQKLKADVIRHDVQSTEISIAFTL 1375

Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
             E   G   PLL+QQPVFAFLPLR YGLKFI+QGDFVLPSSREEVDG+SPWNQWLLSEFP
Sbjct: 1376 HEEENGVISPLLHQQPVFAFLPLRMYGLKFIIQGDFVLPSSREEVDGDSPWNQWLLSEFP 1435

Query: 1503 ALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
             LFVSA  SFC LPCF   P KA S YMS++PL+GEVHGFFS LPR+I+SKLRMSNCL+L
Sbjct: 1436 GLFVSAVESFCSLPCFESCPGKAISAYMSYIPLIGEVHGFFSSLPRLIISKLRMSNCLLL 1495

Query: 1563 EGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGP 1622
            EG  N+WAPPCKVLRGWN++A +LLPD LL+++LGLGFL KDI+LSDSLARALGIEEYGP
Sbjct: 1496 EGKENEWAPPCKVLRGWNEQALTLLPDNLLREYLGLGFLHKDIILSDSLARALGIEEYGP 1555

Query: 1623 KILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRI 1682
            KIL+Q +SSLC+  N L+SMGL WL S L+ L+ +   SSGQ++L+     DLI +LQ++
Sbjct: 1556 KILVQFMSSLCQKYNSLKSMGLFWLGSCLSVLHNMLLQSSGQTTLELERNADLIRSLQKV 1615

Query: 1683 PFIPLSDGTFSSVDEGTIWLHSDCS--VFDGGFGLEAFPNLCAKLRTVSPALLSASAVDK 1740
            P IPLSDGT+SSV EGTIWLHSD S    DG +GLEAFP L +K+R V PA LS  +VD 
Sbjct: 1616 PLIPLSDGTYSSVAEGTIWLHSDSSNATVDGKYGLEAFPYLNSKIRVVCPAFLSLFSVDN 1675

Query: 1741 SSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMH 1800
            S + V SV N++ ML +IGVQ+LSAH+I+K HI+PAI++E+  NG+K LM +Y+CFVM H
Sbjct: 1676 SQIDVPSVGNISWMLYRIGVQRLSAHEIIKEHIIPAITNESNLNGNKILMTEYVCFVMTH 1735

Query: 1801 LEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDI 1860
            L   CP CH++R FI+SELR KAF+LTNHG+KR  E+P+HF KE+GNP+ +N L+  +++
Sbjct: 1736 LLSSCPECHIDRGFIISELRTKAFILTNHGYKRLVEVPVHFSKEYGNPIDLNKLL-SVEM 1794

Query: 1861 KWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-W 1919
             W+EV  TYLKHP   SLSCGL KWR FF+EIGI DFV VV+V++ +A++ H    N  W
Sbjct: 1795 NWHEVADTYLKHPVTNSLSCGLTKWRNFFQEIGINDFVHVVEVNRSIANMPHDIMVNRKW 1854

Query: 1920 TKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKS 1979
              E++  G+   DWES EL HLL++L T+ N++  K+LLE+LDTLW+D  +DK++G   S
Sbjct: 1855 DPEIIFSGAMVKDWESPELTHLLTMLATHGNKESCKYLLEVLDTLWNDHLSDKVVGCCIS 1914

Query: 1980 KPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVK 2039
            K     + FQS+F+N ICD QW +SS+D + HYPKDL++DCDAVRSILG SAPY +PKV+
Sbjct: 1915 KSGDSSKQFQSAFMNSICDAQWVVSSVDKKGHYPKDLYYDCDAVRSILGASAPYALPKVQ 1974

Query: 2040 SEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------MARYYLLV 2080
            S KLV DIG KT V++DD   ILKVW R E PF      M ++Y  +
Sbjct: 1975 STKLVRDIGFKTRVSLDDTFNILKVW-RTEKPFKTSISQMCKFYTFL 2020


>gi|34148073|gb|AAQ62582.1| unknown [Glycine max]
          Length = 2711

 Score = 1928 bits (4995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1037/1985 (52%), Positives = 1353/1985 (68%), Gaps = 98/1985 (4%)

Query: 138  LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
            +E  DRA+  A   L+ AG+SVS+W VS++ LL LQVDSW++LG +MQ+VPSLH+LM+TE
Sbjct: 81   IEHADRAIANACRALLAAGDSVSAWTVSQNALLTLQVDSWNTLGIKMQQVPSLHRLMMTE 140

Query: 198  ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
             ++NAF+HCFVGV++ITSLYDLE AICK+EG++ FEEL LGPFLR PL+ HYF ++SDV 
Sbjct: 141  GKVNAFVHCFVGVQRITSLYDLEVAICKNEGVDDFEELGLGPFLRHPLVIHYFSLRSDVT 200

Query: 258  VVFKIATNDVIVCLSEYTDTHKAKD-IKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN 316
             V+KI T ++I  L E+ D  ++ + IKV++FLDFIA KR +  KE LG+RIQNLGMHI 
Sbjct: 201  QVYKITTEEIIQLLIEFLDASRSNEFIKVEQFLDFIANKRLVECKEWLGIRIQNLGMHIY 260

Query: 317  FIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICE-RVKSFSSINE 375
             IREAR S+   +++ +++L    +R  R +  S ++       SA  E R    S+I E
Sbjct: 261  AIREARNSEQSAMRE-VRKLGQSARREARKLGQSARREARKLGQSARREARNSEQSAIQE 319

Query: 376  DFGAKHIRFVSSSSEDEDSDDCTYE-CSNDISSNVQL------PSQIKGSDRVSSCPYPS 428
               ++     S+  E  +S+   YE C      N +        SQ K  D      + +
Sbjct: 320  ARNSEQ----SAVQEASNSEQSAYEKCLESFLKNGKFRYRTIPSSQKKQLDE----RFNA 371

Query: 429  VTEELKRLG-LKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKA 487
            +T+ ++    +K     +     +S+ +D+   SS  ++       + S P+    R+  
Sbjct: 372  ITQRVESFSPVKKSFCGKHKRFMSSASEDEDSDSSTDEQSNNIIKGSQSNPSSQFTRSSE 431

Query: 488  KQRALPIESGDQTQDDELNEADI---SFSNESMRMFITTWKEACKNNTMSEVLEKMFQFY 544
            +  + P  S  + +      +D+   S  N +++               SE   K  +F 
Sbjct: 432  RVSSCPYPSATEEKARLGVRSDMAGHSLVNSNLK------------KGFSEQPRKKRKFE 479

Query: 545  KPTDPKKAA-RYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTV 603
              T  + A  +  KR KL              VT I  G      +  +  T  + + ++
Sbjct: 480  NATSTRSAPYKLRKRNKLG------------VVTPINTG------NKTKVSTNIDEDLSI 521

Query: 604  SGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKI 663
            S +SL+        + ++A   ++H V ++ V++V++K+  Y +  N I  N+   + + 
Sbjct: 522  SNDSLQM----FVTTWKMAC--SEHKVAEVGVDDVIRKIGTYFDHGNDINRNSNDSLVQY 575

Query: 664  -IILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSS 722
             I+L RK CSCE W+ EQF +K+F SLGYG+F +FLEKH ++L  EL KL   DT + SS
Sbjct: 576  RIMLWRKFCSCENWVAEQFGMKKFDSLGYGDFLSFLEKHPNLLPHELLKLLVGDTCENSS 635

Query: 723  LEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETV 782
                +  + L+ LVSQA + LWE+E ITKQMIS LL RQFP I+F++ + GS+   L+TV
Sbjct: 636  FRACMSSNQLIALVSQALSGLWENETITKQMISMLLMRQFPSINFELVENGSLVDLLDTV 695

Query: 783  GKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDA 842
              + + V SKCV+FS TI+    +GD S   +N+  E  +  + T  + K +E+V +K+A
Sbjct: 696  KGHTSSVTSKCVVFSATIIEKNYNGDSSSDRDNNWSEIPTDRSETSHK-KSTETVIAKNA 754

Query: 843  IEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSA 902
            IE+LL+AP LSDL+ WSHWD  FAP LGP   WLLN+VN KEL CLVTRDGKVIRIDHSA
Sbjct: 755  IEVLLKAPMLSDLSKWSHWDLRFAPFLGPFISWLLNDVNTKELFCLVTRDGKVIRIDHSA 814

Query: 903  SVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEV 962
            ++DSFLEAA++GSSFQTAV LLS  +L GGEK VPL LLKCH+ HAFEVMF+N+VED+EV
Sbjct: 815  TLDSFLEAAVQGSSFQTAVHLLSLISLVGGEKYVPLSLLKCHSCHAFEVMFRNSVEDVEV 874

Query: 963  INSQNCRMHGNV--LCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
             N  N  +H +V  L   +     +   +     K + K  K   + SRF LDCLG LP+
Sbjct: 875  SNDGNA-LHQSVEALSKTKFLTEISTAKMRSVFSKHMHKVSKVASILSRFVLDCLGNLPA 933

Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
            EF SFA+DVLLSG+QS  KDA S IL EC   E RLMLHE+GLSLGI EWI+DYHA  S 
Sbjct: 934  EFHSFASDVLLSGMQSVFKDAASTILCECSNMEQRLMLHEIGLSLGISEWINDYHALISN 993

Query: 1081 GTSDLLMPCV-VTCTNAATSGLNSG----------SGCAEGSLFESVGADVHIEECGAIC 1129
             +SD+   C  V+C   AT+ +N+           S   E ++  S+     IE C  I 
Sbjct: 994  NSSDI--HCARVSCLKDATTDINTSLKLDQVTLDKSPIPEANMVTSLVPHRLIEGCTEII 1051

Query: 1130 DTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHA 1189
            +T+  E S+D    C      +  E  DA+ ++ESIRRDEFGL  ++S+++S MLKKQHA
Sbjct: 1052 ETVDPEKSNDESNTCCLGNSFQHVEDMDASRLIESIRRDEFGLDSSLSDIDSCMLKKQHA 1111

Query: 1190 RLGRALHCLSQELYSQDSHFLLELV-----QNADDNIYPENVEPTLTFILQESGIVVLNN 1244
            RLGRALHCLSQELYSQDSHF+LELV     QNADDN YPENVEPTLTFIL++SGIVVLNN
Sbjct: 1112 RLGRALHCLSQELYSQDSHFILELVRIILVQNADDNNYPENVEPTLTFILRDSGIVVLNN 1171

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV-----FRVTDAPEIHSNGFHVK 1299
            E+GFSA+N+RALCDVGNSTKKGS+AGYIG+KGIGFKSV      +VTDAPEIHSNGFHVK
Sbjct: 1172 ERGFSAQNMRALCDVGNSTKKGSTAGYIGKKGIGFKSVPCLFPLQVTDAPEIHSNGFHVK 1231

Query: 1300 FDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
            FD SEGQIGFVLPT+VPP +I +  R+ S D    +   WNTCI LPFR+  SEG+AMN+
Sbjct: 1232 FDISEGQIGFVLPTVVPPCDIGVLRRMASTDTELCDDNPWNTCILLPFRSHLSEGMAMNS 1291

Query: 1360 IVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
            ++ MFSDLHPSLLLFLHRL+CI  RN+LND+L V++K+I GDGIIKVS G++K+ WFV S
Sbjct: 1292 VLSMFSDLHPSLLLFLHRLKCIKLRNLLNDTLTVMKKEISGDGIIKVSHGKEKIVWFVVS 1351

Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFV 1479
            QKL+   IR DV+TTEI++A TLQES+ G Y P   QQPVFAFLPLRTYGLKFILQGDFV
Sbjct: 1352 QKLQTNSIRFDVQTTEISMAFTLQESDNG-YIPCSDQQPVFAFLPLRTYGLKFILQGDFV 1410

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
            LPSSREEVDG+SPWNQWLLSE+P LFV A+R FC+LPCFR  P K  S +MSFVPLVGEV
Sbjct: 1411 LPSSREEVDGDSPWNQWLLSEYPNLFVRAQREFCELPCFRSEPGKGLSAFMSFVPLVGEV 1470

Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLG 1599
            HGFFS LPR+I+SKLRM NCL+++G+NN+WAPPCKVLRGW ++  +L+PD +L +HLGL 
Sbjct: 1471 HGFFSSLPRLIISKLRMMNCLLVDGDNNEWAPPCKVLRGWTEQVRNLIPDNMLLEHLGLI 1530

Query: 1600 FLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISF 1659
            +L ++IVLSD LARALGIEE+GP IL++++SSLC T++GL SM +SWLAS LN L    F
Sbjct: 1531 YLDRNIVLSDELARALGIEEFGPNILVRVLSSLCHTKSGLISMDMSWLASCLNILSVTMF 1590

Query: 1660 HSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSD--CSVFDGGFGLEA 1717
            +SSG   +   ++ D+  NLQ++PFIPLSDGT+SSVDEGTIWLH +   + FDG   +EA
Sbjct: 1591 NSSGSVPINFEMK-DVQKNLQKMPFIPLSDGTYSSVDEGTIWLHFNHLNTGFDGEHKIEA 1649

Query: 1718 FPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAI 1777
            FPN+CAKLRTVSP L SAS+    SL V  +DN+ R+L  IGVQQLS HD+VK+HILPA+
Sbjct: 1650 FPNICAKLRTVSPFLFSASS-GTPSLNVTFLDNVTRLLQSIGVQQLSVHDVVKLHILPAL 1708

Query: 1778 SDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEI 1837
            SDET AN ++ LM +Y+CFVM+HL   C +C +ERE I+SE R K+ +LTN+GFK P+EI
Sbjct: 1709 SDETMANKNRVLMIEYVCFVMLHLNSSCSDCFIEREHIISEFRCKSLLLTNYGFKCPAEI 1768

Query: 1838 PIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDF 1897
            PIHF   FGNPV+  ML   + ++W+EVDI+YL HP  ES+S  L+KWR FF++ GITDF
Sbjct: 1769 PIHFCTGFGNPVTPKMLADSVSMRWHEVDISYLSHPVNESVSSALIKWRDFFEKFGITDF 1828

Query: 1898 VQVVQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKH 1956
             QVVQVDK V DI    FK  MW + L+S  S   DWES+E+V L+SLL+ + N +  K+
Sbjct: 1829 AQVVQVDKSVVDICDVTFKQMMWDRGLISAESIVKDWESSEIVQLVSLLSKSGNLENCKY 1888

Query: 1957 LLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDL 2016
            LLE+LDTLWD CY++K  G+F  K  GD   F+S+FI  +CDIQW +S+MDDELHYPKDL
Sbjct: 1889 LLEVLDTLWDVCYSNKTTGYFYLKSVGDGHPFKSTFICSLCDIQWVVSTMDDELHYPKDL 1948

Query: 2017 FHDCDAVRSILGPSAPYIVPK----VKSEKLVCDIGLKTEVTIDDILEILKVWTR-LEAP 2071
            F+DC+ VR +LG  APY VPK    VKSE+LV D G KT VT+DDI ++LK W +  + P
Sbjct: 1949 FYDCETVRMLLGDFAPYAVPKVSFLVKSERLVKDFGFKTRVTLDDIFDVLKAWRKSSKTP 2008

Query: 2072 FMARY 2076
            F ARY
Sbjct: 2009 FKARY 2013


>gi|186511742|ref|NP_193111.2| Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Arabidopsis thaliana]
 gi|332657921|gb|AEE83321.1| Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Arabidopsis thaliana]
          Length = 2729

 Score = 1846 bits (4782), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1028/2132 (48%), Positives = 1348/2132 (63%), Gaps = 185/2132 (8%)

Query: 26   PSQPPPPHFNPNFAFQN-PNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLT 84
            P + P P F  NF   N P+    NP  Q  QNL  +  Q P+      N F+QN N   
Sbjct: 50   PIRYPIPQFPANFYRPNFPDFSLGNPNFQPHQNLNFLHQQIPHQYGSAANHFLQNHN--- 106

Query: 85   QNP-NLPPQQPPSS--ACNQQRSQTQHPPGASNQQRPLTQQ----PQSLPPQPQKLNKEF 137
            QN  + PPQ  P++  + +Q     ++      +Q  + Q     P+S     +  N  F
Sbjct: 107  QNSFSFPPQSIPNNDISISQNHGAFENSSLKRRRQEEVVQVTDVVPKSNFASGESANNSF 166

Query: 138  --------------------------------LERIDRAVVKARSELIMAGESVSSWKVS 165
                                            + RID+AV K R   + AGESVSS +VS
Sbjct: 167  SVSLPIPIATDDSGVSRVHGEKSSGKPKRKVDVLRIDKAVNKTRKLFVAAGESVSSTRVS 226

Query: 166  RSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICK 225
            R+VL +LQ DSW SLG QMQ+VPSL +LM  E +INAFIHCFVG R+I +L+DLE AIC+
Sbjct: 227  RAVLEELQADSWRSLGVQMQDVPSLRQLMAIEGKINAFIHCFVGARRIVTLHDLEVAICR 286

Query: 226  SEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKV 285
            +E ++ F++LELGP L+ PL+  YF   S      KI + ++I  L  Y  T+  +D+K+
Sbjct: 287  NEFVDSFDDLELGPLLQHPLVLLYFPSISSSTGPVKITSEEIISFLDSYLHTYMTEDVKL 346

Query: 286  DEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKR 345
            DEFL+F+A ++S+ SKE+LGVRIQ+L M+++FI +A++ +  TLK  L EL   H++   
Sbjct: 347  DEFLNFVASQKSVTSKEKLGVRIQSLRMYVSFILDAKRQEGETLKVLLTEL---HQKYHI 403

Query: 346  PIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSNDI 405
            P   S KK+Q D+  + + ER  SF+  ++D+  KHIRF SSSS   D +D  YE  N  
Sbjct: 404  P---SSKKQQRDKSLT-VSERADSFALHHKDYCGKHIRFDSSSS---DENDNVYEVRN-- 454

Query: 406  SSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKK 465
                     +  SD ++SCPYPSV EE+KRLG                        S KK
Sbjct: 455  ---------LNSSDHINSCPYPSVAEEMKRLG-----------------------GSNKK 482

Query: 466  RKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDDE-------LNEADISFSNESMR 518
            RK E      S  +K LR++ +K +    +   +  DD        ++EAD + S   +R
Sbjct: 483  RKGERRNHEKSDSSKLLRKSPSKLQGHAKQEIPKLADDSEAKKVFSVDEADFTLSEGDLR 542

Query: 519  MFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTS 578
            +FI+TWK+ CK  ++S  +EKM  FY     +  A+  KR K M SS+P +GLLN+AVTS
Sbjct: 543  LFISTWKDTCKELSISTFVEKMLSFYNLGGSEGRAQ-IKRAKAM-SSFPFVGLLNVAVTS 600

Query: 579  IKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQ---V 635
            ++RGMWDS+YD  Q  +  +  NT SGN +     EI P E   +  T+H +       V
Sbjct: 601  LRRGMWDSIYDNFQMTSLSDTTNTGSGNQVG----EINPIENSELSKTQHVMPPTHCNTV 656

Query: 636  EEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFF 695
            EE+++++S Y E D   L+ A     K I + RKL +CE  L EQF++++F+SLG+G FF
Sbjct: 657  EEIIRRLSLYFEHD---LSGA-----KHIGIFRKLQTCENLLAEQFQVQDFESLGWGGFF 708

Query: 696  TFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMIS 755
             FLEKH  +L T+LQ+  + +  +   LEV + ++LL +L+SQAS      +++++Q ++
Sbjct: 709  AFLEKHMLLLPTQLQRFLSRELQEEFPLEVHVNENLLTLLLSQASE-FSSDKVLSRQTLA 767

Query: 756  ELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEEN 815
             L+  QFP ISFK+  + S E+F E +GK ++   SKCVLFS T+LG           EN
Sbjct: 768  RLVAEQFPSISFKVVGRDSEENFSEIIGKKKSS--SKCVLFSATLLGA----------EN 815

Query: 816  SLL-----ETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLG 870
            SL      E+ +    T  R     +V SK+ +++LLR P LSDLNSW HWD  +AP  G
Sbjct: 816  SLTSKYLEESLTVGNDTEARSTTLNAVASKEVLDVLLRVPLLSDLNSWCHWDLRYAPQFG 875

Query: 871  PLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALA 930
            PL G  LNE+N  +LLCLVTRDGK+IR D SA+ DSFLEAAL+GS+++TA +LLS  +L 
Sbjct: 876  PLMG-CLNEINSTDLLCLVTRDGKIIRADPSATADSFLEAALQGSAYRTAAQLLSLISL- 933

Query: 931  GGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQN--CRMHGNVLCGRQNFDVANIDN 988
             G  ++P  LLKC+A+ AFEV F N  E++E +N +N   +MHG              + 
Sbjct: 934  NGRTHLPFSLLKCYAKRAFEVFFYNYSEEME-LNDRNSLVQMHGP-------------EK 979

Query: 989  LSGEVQKQLL-----KFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPS 1043
            LS    K ++     K  K    AS+F LDCLGYLP EFRS   D+LL GL+S +KDAP+
Sbjct: 980  LSTSFDKVIVVGEKAKVAKRDYAASKFLLDCLGYLPGEFRSLVVDILLPGLRSVVKDAPT 1039

Query: 1044 AILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNS 1103
             +L  C QTE R+MLH+ GL LGI+EWI DYH FCS+ + +       +    A+S L+S
Sbjct: 1040 RVLSACEQTEQRIMLHDAGLLLGIVEWISDYHKFCSSCSPN------SSIVENASSNLDS 1093

Query: 1104 GSGCAEGSLFESVGADV-----------HIEECGAICDTICGEAS-DDGLGDCTTQTLPE 1151
            G+G  +  L + V               + EE    C T  G     D +G+  TQT PE
Sbjct: 1094 GAGFVQNELEDPVQTKQRCMIVSEKSCEYKEEPHESCHTFGGSGILCDSVGEAFTQTAPE 1153

Query: 1152 DKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
                ++ A +++SIRRDEFGL    S  E +ML+KQHARLGRAL CLSQELYSQDSHF+L
Sbjct: 1154 --FYDNRASVIDSIRRDEFGLDLTSSGSEMSMLQKQHARLGRALQCLSQELYSQDSHFIL 1211

Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
            ELVQNADDN YPE+VEPTLTFILQ++GIVVLNNE GF  ENIRALCDVG STKKGS    
Sbjct: 1212 ELVQNADDNKYPEHVEPTLTFILQKTGIVVLNNECGFMPENIRALCDVGQSTKKGSGGYI 1271

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDP 1331
              +    FKSVFRV+DAPEIHSNGFH KFD SEGQIG++LPT+VPP +I+    +LS   
Sbjct: 1272 GKKGIG-FKSVFRVSDAPEIHSNGFHFKFDISEGQIGYILPTVVPPHDIESLSSMLSGRA 1330

Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
            + L+   WNTCI LPFR   SE   +N+I  MFSDLHPSLLLFLHRLQCI++RN+L+DSL
Sbjct: 1331 LHLKDAGWNTCITLPFRAIDSERTTVNHIEPMFSDLHPSLLLFLHRLQCIVYRNVLDDSL 1390

Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
            +V+RK++V   I+KVSCGE+ MTWFVAS+KL+A  +R DV+TTEI++  TL    +G Y 
Sbjct: 1391 LVMRKEVVSKNIVKVSCGENSMTWFVASEKLKATNLRDDVQTTEISIGFTLDMLEDGTYR 1450

Query: 1452 PLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERS 1511
              + Q+PVFAFLPLRTYGLKFI+QGDF+L SSRE+VD +SPWNQWLLSEFP LFV A RS
Sbjct: 1451 SCMIQEPVFAFLPLRTYGLKFIIQGDFILTSSREDVDEDSPWNQWLLSEFPGLFVDALRS 1510

Query: 1512 FCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAP 1571
            FC LP F +N  K  S YM  VPLVGEVHGFFS LPR I+S+LR +NCL+LEG+  +W P
Sbjct: 1511 FCSLPSFTQNLGKGVSSYMQLVPLVGEVHGFFSSLPRSIISRLRTTNCLLLEGDGEEWVP 1570

Query: 1572 PCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISS 1631
            PCKVLR WN++   LL D LLQ+HL LGFL KDIVLSDSL+RALGIE+YGPK L+QI+SS
Sbjct: 1571 PCKVLRNWNEKIRVLLKDGLLQEHLALGFLDKDIVLSDSLSRALGIEDYGPKTLVQILSS 1630

Query: 1632 LCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGT 1691
            L      L+SMG +WL+S L ELY + F SSG  +++ G++  LID+L +IPFIPLS+G 
Sbjct: 1631 LSHKNGCLQSMGFTWLSSILTELYLL-FRSSGHGNVELGIDKSLIDDLHKIPFIPLSNGK 1689

Query: 1692 FSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNL 1751
            F+S+DEG +WLH D +  D G   EAFP L   LRT+  +LL AS+VD+ S    SVD+L
Sbjct: 1690 FTSLDEGAVWLHHDTTGLDLGDVFEAFPVLYGNLRTIDHSLLLASSVDEKS----SVDDL 1745

Query: 1752 NRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVE 1811
              ML  IGVQ+LSAH+IVK HILPA    +T   D  LM DYLCFVM HL   C  C  E
Sbjct: 1746 VNMLCAIGVQKLSAHEIVKAHILPAFEARSTGAVD-GLMVDYLCFVMTHLRSGCHICLKE 1804

Query: 1812 REFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLK 1871
            R++I+SELR KA VL+N+G K+  E  IHFG+E+GN V++  L  ++DI W+ VD TYLK
Sbjct: 1805 RKYIISELRSKALVLSNYGLKQLGEGSIHFGEEYGNQVNMKKLTKNLDISWHVVDGTYLK 1864

Query: 1872 HPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAI 1931
            HPA +  +CGL +WR+FF+EIGI DFVQVVQV+K +A+         +   LLSP     
Sbjct: 1865 HPASKFYACGLKEWREFFQEIGIADFVQVVQVEKSIAEFYSVSHCEKYDINLLSPDLTVK 1924

Query: 1932 DWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSS 1991
            DWES ELV LLSLL  +  R+  K+LLE+LD LWDDCY DK    + S   G  RS +SS
Sbjct: 1925 DWESPELVDLLSLLHKSNGRKGCKYLLEVLDRLWDDCYYDKTTVNYNSGTHGIIRSSESS 1984

Query: 1992 FINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKT 2051
            F+  ICD  W +SSMD +LH  KDL+HDCD V+SILG +APY VP V S KL+ DIG KT
Sbjct: 1985 FMRVICDSLWIVSSMDSKLHLSKDLYHDCDDVQSILGMNAPYAVPTVTSVKLLSDIGFKT 2044

Query: 2052 EVTIDDILEILKVWTRLEAPF------MARYY 2077
            +V++DD LE+L+ W      F      + R+Y
Sbjct: 2045 KVSLDDALEVLESWVHCGDSFKSSISQITRFY 2076


>gi|222625995|gb|EEE60127.1| hypothetical protein OsJ_13005 [Oryza sativa Japonica Group]
          Length = 2821

 Score = 1620 bits (4196), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 881/1987 (44%), Positives = 1245/1987 (62%), Gaps = 97/1987 (4%)

Query: 138  LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
            LER   A  K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L  L + E
Sbjct: 172  LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 231

Query: 198  ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
             ++N FIHC+V  R+I S+YDLE  ICK+E I +FEEL LGPFL+ PL++HYF V +D+ 
Sbjct: 232  GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 291

Query: 258  VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
            +V K++++++I  L ++ D  K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 292  LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 350

Query: 318  IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
            +R+AR+++   +K             ++ +  + K    K+ LDERFSAI  R+K    I
Sbjct: 351  LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 410

Query: 374  NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
            N     KHI F  S+ ++ D D  +   + D S +    + I   D   RVSSCPYPS T
Sbjct: 411  N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 464

Query: 431  EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
            EE++RLGLK E + +     +   +    G +++KRK E +G  +S+  +          
Sbjct: 465  EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 514

Query: 491  ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
              P +             +   S   +  FITTWKEAC+ + + +VLE +  +Y  T  +
Sbjct: 515  --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAETPKE 572

Query: 551  KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
            K       +   FS YP IG LN+AV ++  G+ DS+YD +    + + ++ +   + E 
Sbjct: 573  KKK-----IIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 627

Query: 611  ESIEIEPSEQVAV----------VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPV 660
              +E  PS++ +           V +K     +  ++V+++++++ E +  +     S V
Sbjct: 628  MEVEPPPSKRKSKCVAKGDNDTNVGSKDPGCSVTADDVIRRITEFFESNRGVSRTDASQV 687

Query: 661  TKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDR 720
             K    LR L  CET +T +F   +F +LG+G F  FL KH   L  +L        +  
Sbjct: 688  RKST-FLRTLLDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTH 746

Query: 721  SSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLE 780
            SS+EVS++Q  + VL+ QA  N  E    ++   S+LL+RQFP ISF I    S E  L+
Sbjct: 747  SSVEVSVLQQQIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLD 806

Query: 781  TVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSK 840
            ++ + R  +    ++FS ++L  R SG +  + +     T   + +  ++   S +V+S+
Sbjct: 807  SIERQRKNIEINNIMFSMSLLEKRWSGIVPGNHD-----TVDGLMN-DIQQSCSVTVSSQ 860

Query: 841  DAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDH 900
            +AI+ LL+AP LSDL +WSHWD LFAPSLG    WLLN   V++L C+VT DG+ IR+D 
Sbjct: 861  EAIKCLLKAPMLSDLLTWSHWDLLFAPSLGSFMHWLLNTGPVQDLACIVTTDGRFIRVDP 920

Query: 901  SASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDI 960
            SA+VD FLE  ++ S FQ AVKLLS   +  G  N P+ LLKC+A+ A  ++  N  + +
Sbjct: 921  SATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLLKCYAQRAIGIIMDNGNDLL 980

Query: 961  EVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
                  N +  G     R         N+  ++ K +      V + ++F LDCLG+LPS
Sbjct: 981  ------NTKSEGKSFSAR---------NIWSDMSKDI---DDIVHLVAKFVLDCLGHLPS 1022

Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
            EFRS AADVLL+GL++  K+  SAILLE  +T    MLH++GLSLG+ EW  D    C T
Sbjct: 1023 EFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDIGLSLGVAEWAEDCRRLCLT 1082

Query: 1081 GTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD---------------VHIEE 1124
               D +   +    ++      SG   C   +L  +   D               V + +
Sbjct: 1083 ---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDIMKRSKSLPGKDNQIVAVSK 1139

Query: 1125 CGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNML 1184
               + + +  +          + TL E    E+A L++E+IRR+EFGL  ++S  E+++L
Sbjct: 1140 NQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETIRREEFGLDQSLSCTENSLL 1198

Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN 1244
            KKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL FILQ++GIVVLNN
Sbjct: 1199 KKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVEDVEPTLAFILQDNGIVVLNN 1258

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
            E GFSAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRVTDAPEIHSNGFHVKFD +E
Sbjct: 1259 ESGFSAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRVTDAPEIHSNGFHVKFDITE 1318

Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
            GQIGFVLPT V P+N D   R+LS + V+  S  WNTCI LPFR+KF EG  M++I  MF
Sbjct: 1319 GQIGFVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVLPFRSKFKEGTGMHSIASMF 1378

Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRA 1424
            SDLHPSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI+++S G D M+W V S+KL+ 
Sbjct: 1379 SDLHPSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVRISNGNDMMSWLVVSKKLQG 1438

Query: 1425 GVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSR 1484
             ++R DV +TEIA+A TLQE+  G Y P L QQPVFAFLPLR YGLKFILQGDFVLPSSR
Sbjct: 1439 TIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPLRNYGLKFILQGDFVLPSSR 1498

Query: 1485 EEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFS 1544
            EEVD ++ WNQWLLSEFP+LFVSA+ SFC LPCF+  P KA + +MSFVPLVGEVHGFF 
Sbjct: 1499 EEVDADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKAVTTFMSFVPLVGEVHGFFC 1558

Query: 1545 GLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKD 1604
             LP +ILSKLR++ C++LEG++++W  PC  LRGW+++   L+ D LL +HLGLG+LSKD
Sbjct: 1559 QLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRILISDSLLLEHLGLGYLSKD 1618

Query: 1605 IVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY-TISFHSSG 1663
            I++SD+L+RALGI EYGPK+L+ IISS+CR +  + S+GL WL +W   L+ ++  HSS 
Sbjct: 1619 IIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLEWLCAWFISLHLSLMHHSSK 1678

Query: 1664 QSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDC--SVFDGGFGLEAFPNL 1721
               L +  E DL+  L++IP IPLSDG+FSS+ +G IWL  D   S  D    +  FP L
Sbjct: 1679 NLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPYDVLNSKPDSRSSMLNFPVL 1737

Query: 1722 CAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDET 1781
             + LRT+ P LLS S  +K     +  ++L  +LLK+GV++LS HDI+K HIL ++S+ T
Sbjct: 1738 YSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRKLSGHDIIKNHILVSLSNST 1797

Query: 1782 TANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHF 1841
             AN    +M +Y+ F+M+HL+  C +C+ E+E I+SELR++  +LTNHG+K P + PIHF
Sbjct: 1798 EANVANTMMIEYVSFIMLHLQSPCASCNFEKEEIMSELRRRPILLTNHGYKCPYDEPIHF 1857

Query: 1842 GKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESL-SCGLVKWRKFFKEIGITDFVQV 1900
             KE+GN V +  L+ +++IKW E+D  YL +   +SL    L KWR+FF+E+G+TDFVQV
Sbjct: 1858 SKEYGNSVDLCKLLLNVEIKWIELDSCYLMNRGSDSLPPFELKKWRQFFEEMGVTDFVQV 1917

Query: 1901 VQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEI 1960
            V+V+K+++    +    +             DWES ELV +LS  ++   R+   +LLE+
Sbjct: 1918 VKVEKNISQADSSLAGRLSQGHHSGTPCIVYDWESPELVSILSTFSSKKCRENCVYLLEV 1977

Query: 1961 LDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDC 2020
            LD  WD  Y+ K      +  +G++ + +SSF+N I   +W  S+MD++LHY  DLF++ 
Sbjct: 1978 LDKFWDAHYSAKARIHADATHSGENIAVESSFMNSIRTFKWIASAMDEDLHYATDLFYNT 2037

Query: 2021 DAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------MA 2074
            + VRSILG  APY VP+V S  L  DIG K +V+  D L ILK W   +  F      M 
Sbjct: 2038 EDVRSILGSVAPYAVPQVCSRSLGKDIGFKIKVSHSDALMILKSWIASQTSFSASMDQMC 2097

Query: 2075 RYYLLVN 2081
            ++Y  V+
Sbjct: 2098 KFYTFVS 2104


>gi|297804940|ref|XP_002870354.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316190|gb|EFH46613.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 2136

 Score = 1520 bits (3936), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 802/1537 (52%), Positives = 1038/1537 (67%), Gaps = 83/1537 (5%)

Query: 571  LLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTV 630
            +L++ VTS+KRGMWDS+YD LQ  +  +  NT SGN +     EI PSE   +  T+H +
Sbjct: 1    MLHLQVTSLKRGMWDSIYDNLQMTSLSDTTNTGSGNQVG----EINPSENSELSKTQHVM 56

Query: 631  ---QDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFK 687
                   VEE+++++S Y E D S          K I + RKL +CE  L EQF++++F+
Sbjct: 57   PPKHSNTVEEIIRRLSLYFEHDLS--------GEKHIGIFRKLQTCEVLLAEQFQVQDFE 108

Query: 688  SLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESE 747
            SLG+G FFTFLEKH  +L T+LQ+  + + ++   LEV + ++LL +L+SQAS       
Sbjct: 109  SLGWGGFFTFLEKHMLLLPTQLQRFLSRELLEEFPLEVHVNENLLTLLLSQASE-FSGGN 167

Query: 748  IITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSG 807
            +I++Q ++ LL  QFP I+FK+  + S E+F E +GK ++   SKCVLFS T     LSG
Sbjct: 168  VISRQTVNRLLAEQFPSINFKVVGRNSEENFTEIIGKEKS--CSKCVLFSAT-----LSG 220

Query: 808  DLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAP 867
              +      L E+ +    T  R     +V SK+ +++LLR P LSDLNSW HWD  +AP
Sbjct: 221  AENSLTSKYLEESLTVGNDTEARGSTLNAVASKEVLDVLLRVPLLSDLNSWCHWDLRYAP 280

Query: 868  SLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSF 927
              GPL G  LNE+N K+LLCLVTRDGK+IR D SA+ DSFLEAAL+GS++ TA +LLS  
Sbjct: 281  QFGPLLG-CLNEINSKDLLCLVTRDGKIIRTDPSATADSFLEAALQGSAYCTAAQLLSLI 339

Query: 928  ALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQN-CRMHGNVLCGRQNFDVANI 986
            +L  G  ++P  LLKC+A+ AFEV F N+ E++E+ N  +  +MH               
Sbjct: 340  SL-NGRTHLPFSLLKCYAKRAFEVFFDNHSEEMELNNRNSLVQMH-------------RP 385

Query: 987  DNLSGEVQKQLL-----KFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDA 1041
            + LS  + K ++     K  K+   AS+F LDCLGYLP EFRS   D+LL GL+S +KDA
Sbjct: 386  EQLSTSLDKMIVVGEKTKAAKSDYAASKFLLDCLGYLPGEFRSLVVDILLPGLRSVVKDA 445

Query: 1042 PSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGL 1101
            P+ +L  C QTE R+MLH+ GL LGI+EWI DYH FCS+ + +       +   +A+S L
Sbjct: 446  PTRVLSACKQTEQRIMLHDAGLLLGIVEWISDYHMFCSSCSPN------SSIVESASSNL 499

Query: 1102 NSGSGCAEGSLFESVGADVHIEECGAICDTICGEASD---------------DGLGDCTT 1146
            +S +G  +  L + +  +   + C  + +  C +  +               D +G+  T
Sbjct: 500  DSVAGSVQKELEDPIQTE---QRCMIVSEKACQDKKEPHESCHTFGDSGTLCDSVGEAFT 556

Query: 1147 QTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQD 1206
            QT PE  +  + A +++SIRRDEFGL    S  E +ML+KQHARLGRAL CLSQELYSQD
Sbjct: 557  QTAPEFHD--NPASVIDSIRRDEFGLDLTSSGSEMSMLQKQHARLGRALQCLSQELYSQD 614

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
            SHF+LELVQNADDN YP +VEPTLTFILQ++GIVVLNNE GF  ENIRALCDVG STKKG
Sbjct: 615  SHFILELVQNADDNKYPAHVEPTLTFILQKTGIVVLNNECGFMPENIRALCDVGQSTKKG 674

Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL 1326
            S      +    FKSVFRV+DAPEIHSNGFH KFD SEGQIG++LPT+VPP +I+    +
Sbjct: 675  SGGYIGKKGIG-FKSVFRVSDAPEIHSNGFHFKFDISEGQIGYILPTVVPPQDIESLSSM 733

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
            LS   + L+   WNTCI LPFR+  SEG  +N+I  MFSDLHPSLLLFLHRLQCI++RN+
Sbjct: 734  LSGRALHLKDAGWNTCITLPFRSIDSEGTTVNHIEPMFSDLHPSLLLFLHRLQCIVYRNV 793

Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
            L DSL+V+RK++V + I+KVSCGE+ M WFVAS+KL+A  +R DV+TTEI++  TL    
Sbjct: 794  LEDSLLVMRKEVVSNNIVKVSCGENSMIWFVASEKLKAANLRDDVQTTEISIGFTLDMLE 853

Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
            +G Y   + Q+PVFAFLPLRTYGLKFI+QGDF+L SSRE+VD +SPWNQWLLSEFP LFV
Sbjct: 854  DGTYRSCMIQEPVFAFLPLRTYGLKFIIQGDFILTSSREDVDEDSPWNQWLLSEFPGLFV 913

Query: 1507 SAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNN 1566
             A RSFC LP F  N  KA S YM  VPL GEVHGFFS LPR I+S+LR +NCL+LEG+ 
Sbjct: 914  DALRSFCSLPSFTHNLGKAVSSYMQLVPLAGEVHGFFSSLPRSIISRLRTTNCLLLEGDG 973

Query: 1567 NQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILL 1626
             +W PPCKVLR WN++   LL D LLQ+HL LGFL KDIVLSDSL+RAL IE+YGPK L+
Sbjct: 974  EEWVPPCKVLRNWNEKIRVLLKDGLLQEHLALGFLDKDIVLSDSLSRALSIEDYGPKTLV 1033

Query: 1627 QIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIP 1686
            QI+SSL      L+SMG +WL+S L ELY I F SSG  +++ G++  LID+L RIPFIP
Sbjct: 1034 QILSSLSHKNGCLQSMGFTWLSSILTELY-ILFRSSGHGNVELGIDKTLIDDLHRIPFIP 1092

Query: 1687 LSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVI 1746
            LS+G F+++DEG +WLH D +  D G   EAFP L   LRT+  +LL AS+VD+ S    
Sbjct: 1093 LSNGKFTALDEGAVWLHHDTTGLDLGDVFEAFPVLYGNLRTIDHSLLLASSVDEKS---- 1148

Query: 1747 SVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCP 1806
            SVD+L  ML  IGVQ+LSAH+I+KVHILPA    +T +  + LM DYLCFVM HL   C 
Sbjct: 1149 SVDDLVNMLCAIGVQKLSAHEIIKVHILPAFEARST-DTVEGLMVDYLCFVMTHLRSSCH 1207

Query: 1807 NCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVD 1866
             CH ER +I+SELR KA VL+N+G K+  E  IHFG+EFGN V++  L  ++DI W+ VD
Sbjct: 1208 ICHNERNYIISELRSKALVLSNYGLKQLGEGSIHFGEEFGNQVNMKKLTKNLDISWHVVD 1267

Query: 1867 ITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSP 1926
             TYLKHPA +  +CGL +WR+FF+EIGI DFVQVVQV+K +A+         +   LLSP
Sbjct: 1268 GTYLKHPASKFYACGLKEWREFFQEIGIADFVQVVQVEKTIAEFYSVSRCEKYDINLLSP 1327

Query: 1927 GSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDR 1986
                 DWES ELV+LLSLL  +  R+  K+L+E+LD LWD+CY +K    + S   G  R
Sbjct: 1328 ELTVKDWESPELVNLLSLLHKSNGRKGCKYLMEVLDRLWDECYHEKTTVNYNSGTHGVIR 1387

Query: 1987 SFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCD 2046
            S +SSF+  ICD QW +SSMD ELH  K L+HDCD V+SILG +APY VPKV S KL+ D
Sbjct: 1388 SSESSFMRVICDSQWIVSSMDSELHLAKGLYHDCDDVQSILGMNAPYAVPKVTSVKLLSD 1447

Query: 2047 IGLKTEVTIDDILEILKVWTRLEAPF------MARYY 2077
            IG KT+V +DD LEIL+ W      F      + R+Y
Sbjct: 1448 IGFKTKVCLDDALEILEAWVHCGDNFKSSISQITRFY 1484


>gi|4455304|emb|CAB36839.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268079|emb|CAB78417.1| hypothetical protein [Arabidopsis thaliana]
          Length = 2137

 Score = 1518 bits (3929), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 806/1540 (52%), Positives = 1035/1540 (67%), Gaps = 89/1540 (5%)

Query: 571  LLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTV 630
            +L++ VTS++RGMWDS+YD  Q  +  +  NT SGN +     EI P E   +  T+H +
Sbjct: 1    MLHLQVTSLRRGMWDSIYDNFQMTSLSDTTNTGSGNQVG----EINPIENSELSKTQHVM 56

Query: 631  QDIQ---VEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFK 687
                   VEE+++++S Y E D   L+ A     K I + RKL +CE  L EQF++++F+
Sbjct: 57   PPTHCNTVEEIIRRLSLYFEHD---LSGA-----KHIGIFRKLQTCENLLAEQFQVQDFE 108

Query: 688  SLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESE 747
            SLG+G FF FLEKH  +L T+LQ+  + +  +   LEV + ++LL +L+SQAS      +
Sbjct: 109  SLGWGGFFAFLEKHMLLLPTQLQRFLSRELQEEFPLEVHVNENLLTLLLSQASE-FSSDK 167

Query: 748  IITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSG 807
            ++++Q ++ L+  QFP ISFK+  + S E+F E +GK ++   SKCVLFS T+LG     
Sbjct: 168  VLSRQTLARLVAEQFPSISFKVVGRDSEENFSEIIGKKKSS--SKCVLFSATLLGA---- 221

Query: 808  DLSVHEENSLL-----ETTSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWD 862
                  ENSL      E+ +    T  R     +V SK+ +++LLR P LSDLNSW HWD
Sbjct: 222  ------ENSLTSKYLEESLTVGNDTEARSTTLNAVASKEVLDVLLRVPLLSDLNSWCHWD 275

Query: 863  FLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVK 922
              +AP  GPL G  LNE+N  +LLCLVTRDGK+IR D SA+ DSFLEAAL+GS+++TA +
Sbjct: 276  LRYAPQFGPLMG-CLNEINSTDLLCLVTRDGKIIRADPSATADSFLEAALQGSAYRTAAQ 334

Query: 923  LLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQN--CRMHGNVLCGRQN 980
            LLS  +L  G  ++P  LLKC+A+ AFEV F N  E++E +N +N   +MHG        
Sbjct: 335  LLSLISL-NGRTHLPFSLLKCYAKRAFEVFFYNYSEEME-LNDRNSLVQMHGP------- 385

Query: 981  FDVANIDNLSGEVQKQLL-----KFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQ 1035
                  + LS    K ++     K  K    AS+F LDCLGYLP EFRS   D+LL GL+
Sbjct: 386  ------EKLSTSFDKVIVVGEKAKVAKRDYAASKFLLDCLGYLPGEFRSLVVDILLPGLR 439

Query: 1036 STIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTN 1095
            S +KDAP+ +L  C QTE R+MLH+ GL LGI+EWI DYH FCS+ + +       +   
Sbjct: 440  SVVKDAPTRVLSACEQTEQRIMLHDAGLLLGIVEWISDYHKFCSSCSPN------SSIVE 493

Query: 1096 AATSGLNSGSGCAEGSLFESVGADV-----------HIEECGAICDTICGEAS-DDGLGD 1143
             A+S L+SG+G  +  L + V               + EE    C T  G     D +G+
Sbjct: 494  NASSNLDSGAGFVQNELEDPVQTKQRCMIVSEKSCEYKEEPHESCHTFGGSGILCDSVGE 553

Query: 1144 CTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELY 1203
              TQT PE  +  + A +++SIRRDEFGL    S  E +ML+KQHARLGRAL CLSQELY
Sbjct: 554  AFTQTAPEFYD--NRASVIDSIRRDEFGLDLTSSGSEMSMLQKQHARLGRALQCLSQELY 611

Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            SQDSHF+LELVQNADDN YPE+VEPTLTFILQ++GIVVLNNE GF  ENIRALCDVG ST
Sbjct: 612  SQDSHFILELVQNADDNKYPEHVEPTLTFILQKTGIVVLNNECGFMPENIRALCDVGQST 671

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMF 1323
            KKGS      +    FKSVFRV+DAPEIHSNGFH KFD SEGQIG++LPT+VPP +I+  
Sbjct: 672  KKGSGGYIGKKGIG-FKSVFRVSDAPEIHSNGFHFKFDISEGQIGYILPTVVPPHDIESL 730

Query: 1324 CRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
              +LS   + L+   WNTCI LPFR   SE   +N+I  MFSDLHPSLLLFLHRLQCI++
Sbjct: 731  SSMLSGRALHLKDAGWNTCITLPFRAIDSERTTVNHIEPMFSDLHPSLLLFLHRLQCIVY 790

Query: 1384 RNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
            RN+L+DSL+V+RK++V   I+KVSCGE+ MTWFVAS+KL+A  +R DV+TTEI++  TL 
Sbjct: 791  RNVLDDSLLVMRKEVVSKNIVKVSCGENSMTWFVASEKLKATNLRDDVQTTEISIGFTLD 850

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
               +G Y   + Q+PVFAFLPLRTYGLKFI+QGDF+L SSRE+VD +SPWNQWLLSEFP 
Sbjct: 851  MLEDGTYRSCMIQEPVFAFLPLRTYGLKFIIQGDFILTSSREDVDEDSPWNQWLLSEFPG 910

Query: 1504 LFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE 1563
            LFV A RSFC LP F +N  K  S YM  VPLVGEVHGFFS LPR I+S+LR +NCL+LE
Sbjct: 911  LFVDALRSFCSLPSFTQNLGKGVSSYMQLVPLVGEVHGFFSSLPRSIISRLRTTNCLLLE 970

Query: 1564 GNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPK 1623
            G+  +W PPCKVLR WN++   LL D LLQ+HL LGFL KDIVLSDSL+RALGIE+YGPK
Sbjct: 971  GDGEEWVPPCKVLRNWNEKIRVLLKDGLLQEHLALGFLDKDIVLSDSLSRALGIEDYGPK 1030

Query: 1624 ILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIP 1683
             L+QI+SSL      L+SMG +WL+S L ELY + F SSG  +++ G++  LID+L +IP
Sbjct: 1031 TLVQILSSLSHKNGCLQSMGFTWLSSILTELYLL-FRSSGHGNVELGIDKSLIDDLHKIP 1089

Query: 1684 FIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSL 1743
            FIPLS+G F+S+DEG +WLH D +  D G   EAFP L   LRT+  +LL AS+VD+ S 
Sbjct: 1090 FIPLSNGKFTSLDEGAVWLHHDTTGLDLGDVFEAFPVLYGNLRTIDHSLLLASSVDEKS- 1148

Query: 1744 GVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEY 1803
               SVD+L  ML  IGVQ+LSAH+IVK HILPA    +T   D  LM DYLCFVM HL  
Sbjct: 1149 ---SVDDLVNMLCAIGVQKLSAHEIVKAHILPAFEARSTGAVD-GLMVDYLCFVMTHLRS 1204

Query: 1804 YCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWY 1863
             C  C  ER++I+SELR KA VL+N+G K+  E  IHFG+E+GN V++  L  ++DI W+
Sbjct: 1205 GCHICLKERKYIISELRSKALVLSNYGLKQLGEGSIHFGEEYGNQVNMKKLTKNLDISWH 1264

Query: 1864 EVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKEL 1923
             VD TYLKHPA +  +CGL +WR+FF+EIGI DFVQVVQV+K +A+         +   L
Sbjct: 1265 VVDGTYLKHPASKFYACGLKEWREFFQEIGIADFVQVVQVEKSIAEFYSVSHCEKYDINL 1324

Query: 1924 LSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTG 1983
            LSP     DWES ELV LLSLL  +  R+  K+LLE+LD LWDDCY DK    + S   G
Sbjct: 1325 LSPDLTVKDWESPELVDLLSLLHKSNGRKGCKYLLEVLDRLWDDCYYDKTTVNYNSGTHG 1384

Query: 1984 DDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKL 2043
              RS +SSF+  ICD  W +SSMD +LH  KDL+HDCD V+SILG +APY VP V S KL
Sbjct: 1385 IIRSSESSFMRVICDSLWIVSSMDSKLHLSKDLYHDCDDVQSILGMNAPYAVPTVTSVKL 1444

Query: 2044 VCDIGLKTEVTIDDILEILKVWTRLEAPF------MARYY 2077
            + DIG KT+V++DD LE+L+ W      F      + R+Y
Sbjct: 1445 LSDIGFKTKVSLDDALEVLESWVHCGDSFKSSISQITRFY 1484


>gi|414873448|tpg|DAA52005.1| TPA: hypothetical protein ZEAMMB73_895652 [Zea mays]
          Length = 2676

 Score = 1474 bits (3817), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 833/2010 (41%), Positives = 1185/2010 (58%), Gaps = 195/2010 (9%)

Query: 138  LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
            L+R   A  KAR EL+  G+ V++WKV+++VL+ L+ DSW SLG Q Q+VP L  L + E
Sbjct: 118  LDRAQEAARKAREELVKGGDGVTAWKVAQAVLVALKADSWDSLGVQPQDVPLLRDLFLIE 177

Query: 198  ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
             ++NAFIHC+V  R+I ++YDLE  ICK+EG+ +FEEL LGPFL+ PL++HYFLV +D+ 
Sbjct: 178  GKVNAFIHCYVAARKIVTVYDLEVEICKNEGVVQFEELGLGPFLQHPLVAHYFLVPADLS 237

Query: 258  VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
            VV K+       C  E  +T           L F+ K +                     
Sbjct: 238  VVPKL-------CSEEIINT----------LLKFVDKSK--------------------- 259

Query: 318  IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDF 377
                   + +T++ FL     +H   K+ +   EK   L  R  ++ + +  F+++   F
Sbjct: 260  -------KKITIEDFL-----NHLAEKKSVSGKEK---LGVRVQSLGKAIPFFTNLGR-F 303

Query: 378  GAKHIRFVSSSSEDE-----------DSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPY 426
               HI  +  + + E            S + + E   D+  N    +  K  D+     +
Sbjct: 304  CRLHISLLRQAKQTEVSAAKLRANMSGSGNSSQE--KDLLKNASFHTHKKALDK----RF 357

Query: 427  PSVTEELKRL-GLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRN 485
             S+T  +K L G+   I+   T        DD   S       ++ G  +      L + 
Sbjct: 358  ISLTNRIKELPGINKHIHFDST--------DDETNSCTGSEDGKSDGNENENECSILDKK 409

Query: 486  KAKQRALPIESGDQTQD-------DELNEADISFSNESMRMFITTWKEACKNNTMSEVLE 538
               +R        +T++        E+N+   S  N   R           +    +VLE
Sbjct: 410  DGDKRVNSCPYPSKTEELERLGLKSEINKRQ-SLENSKPR----------DSGKKGKVLE 458

Query: 539  KMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQE 598
             +  +Y  T   K     +R++  FS YP IGLLN+AV S+  G+ DS+YD +Q   + +
Sbjct: 459  MLANYYGRTISGK-----RRIRNFFSQYPGIGLLNVAVKSMGCGLLDSIYDVIQLADEND 513

Query: 599  LNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQS 658
            + ++   N+   E +EIEP               + +++V++++S+Y+E ++ I  +   
Sbjct: 514  VASSPPPNTTT-EVMEIEPPS-------------VAIDDVIRRISEYIESNSKISGD--- 556

Query: 659  PVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTI 718
                + + +R L  CETW+T QF   +F +LG+G F  FL+KH     T L   F +   
Sbjct: 557  ----VALQVRALNDCETWVTTQFSANQFSALGHGTFLEFLDKHCHQFRTALSSFFKEGPS 612

Query: 719  DRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKI-EDK--GSM 775
            + SSLEVS++Q  +  L+ QA +N  E    ++     LL++QFP ISF I +DK  G +
Sbjct: 613  NSSSLEVSVLQQQIEFLLCQAESNWLEDSDFSEDSFVMLLKKQFPTISFNIVQDKSDGGV 672

Query: 776  ESFLETVGKYRNEVMSKCVLFSETILGTRLSGDL-----SVHEENSLLETTSAITHTGLR 830
              F+E   K   ++ +  + FS ++L  R SG L     +V E  +++   S   HT   
Sbjct: 673  SGFIEGQKK---DIRTYSLKFSISLLEKRWSGTLPSRHGNVDELGNIVAEQSCY-HT--- 725

Query: 831  PKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVT 890
                 +V S++AI  LLRAP LSDL+ WSHWD LFAP+LG    WLLN   ++EL C+ T
Sbjct: 726  -----TVCSREAINCLLRAPMLSDLHLWSHWDSLFAPTLGSFVHWLLNTGPIQELACIAT 780

Query: 891  RDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFE 950
             DG+ IR+D SA+VD FLEA +  S  Q AVKLLS   +  G  N P+ LLKC+A+ A +
Sbjct: 781  TDGRFIRVDSSATVDQFLEAIIHRSPLQVAVKLLSLLYIYNGSMNTPMSLLKCYAQRAIK 840

Query: 951  VMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQ-------------L 997
            ++  NN  D+    S+N     +     QN    +    + + Q+              L
Sbjct: 841  LIVDNN-NDLMNAKSENKIFMPD---EPQNLSSESSTCFADQCQESSQASPGRLIRSDSL 896

Query: 998  LKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLM 1057
                  V + ++F LDCL +LPSEFRS AAD+LL+G +   K+  + +L E  +     M
Sbjct: 897  PNINNTVHLITKFVLDCLDHLPSEFRSLAADILLAGFRVVTKNWHAVMLHEATENGQLCM 956

Query: 1058 LHEVGLSLGILEWIHDYHAFCST----------GTSDLLMPCVVTCTNAATSGLNSGSGC 1107
            LH++GLSLG++EW+ D H  C T           ++ L  P  V  T+   S ++  S  
Sbjct: 957  LHDIGLSLGVVEWVEDCHRLCLTDEVHVQTEMHSSAKLATPASVGATHE-DSNMHISSDV 1015

Query: 1108 ----AEGSLFESVGADVHIE-ECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIV 1162
                    LF  +     I+ E   + +    EA    L   +  ++ E+   E+A+L++
Sbjct: 1016 NMMDKRKQLFPVINDRAGIDKEDNKMLNPAGTEADVAELHTTSRSSMMEETSLEEASLVI 1075

Query: 1163 ESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY 1222
            E+IRR+EFGL   +S+ E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y
Sbjct: 1076 ETIRREEFGLDQALSDTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTY 1135

Query: 1223 PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
            PE+VE TL FILQE GI VLNNE+GFSAENIRALCD+GNSTKKGS+ GYIG KGIGFKSV
Sbjct: 1136 PEDVEATLAFILQEDGIAVLNNERGFSAENIRALCDIGNSTKKGSNQGYIGNKGIGFKSV 1195

Query: 1283 FRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC 1342
            FRVTDAPEIHSNGFHVKFD ++GQIGFVLPT VPP++   F R+LS +  +     WNTC
Sbjct: 1196 FRVTDAPEIHSNGFHVKFDITDGQIGFVLPTAVPPYSTTSFSRMLSVEDDKDAHSLWNTC 1255

Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG 1402
            I LPFR+KF E           +DLHPSLLLFLHRL+CI F+N+ +D+L+++R++I+GDG
Sbjct: 1256 ILLPFRSKFREC----------TDLHPSLLLFLHRLKCIKFKNLFDDTLLIMRREILGDG 1305

Query: 1403 IIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
            I+++S G + M+W V S++L+  ++R DV TTEIA+A TLQ++ EG+Y P L QQPVFAF
Sbjct: 1306 IVRISHGIETMSWLVVSKRLQGTIVRHDVCTTEIAVAFTLQQTEEGDYEPYLKQQPVFAF 1365

Query: 1463 LPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENP 1522
            LPLR YGLKFILQGDFVLPSSREEVD +S WNQWLLSEFP+LFVSA+ SFC LPCF+  P
Sbjct: 1366 LPLRNYGLKFILQGDFVLPSSREEVDADSAWNQWLLSEFPSLFVSAQESFCALPCFQRCP 1425

Query: 1523 AKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDR 1582
             KA + ++SF+PL GEVHGFFS LP +ILSKLR++ C+ L+G+  QW  PC  LRGW+++
Sbjct: 1426 GKAVTAFLSFIPLAGEVHGFFSQLPHLILSKLRLTRCMFLDGSTVQWVYPCNTLRGWDEQ 1485

Query: 1583 AHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSM 1642
               L  + LL +HLGLG+LSK+IV+SD L+RALGI +YGP ILL  ISS+CR +  + S+
Sbjct: 1486 TKMLFSEGLLHEHLGLGYLSKNIVISDKLSRALGIHDYGPNILLDAISSICRIDGCIESL 1545

Query: 1643 GLSWLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
            GL WL +W   LY T+  HSS   S    +E  L+D +++IP IPLSDG+F+S+ +G IW
Sbjct: 1546 GLEWLCAWFVNLYLTLLSHSSQNVSSARNLEDILLDKVRKIPCIPLSDGSFTSISDGRIW 1605

Query: 1702 LHSD-CSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGV 1760
            L  D  S       +  FP L   LRTVSP LLSA   +K  +  + +++L  ML +IGV
Sbjct: 1606 LPYDVASSIPECSSIPNFPALYGNLRTVSPNLLSACCKNKYLMEEVRINDLADMLQRIGV 1665

Query: 1761 QQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELR 1820
            ++LS HDIVK HI+ ++ +   AN    ++ +Y+ F+M+HL+  C +C+ ER  IVSELR
Sbjct: 1666 RKLSGHDIVKNHIMVSLRNGLDANVADIVIREYVSFIMVHLQSSCTSCNFERGEIVSELR 1725

Query: 1821 KKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSC 1880
            K    LTNHG+K P++ PIHF K++GN V V  L+ +++I W E+D +YL+H   +S S 
Sbjct: 1726 KSPIFLTNHGYKSPADEPIHFSKDYGNSVDVTRLLQNVEISWIELDSSYLQHHGSQSSSF 1785

Query: 1881 GLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVH 1940
               KWR+FF+E+G+TDFVQVV+V+K ++ +       +   ++ +      DWES EL  
Sbjct: 1786 AREKWRQFFEEMGVTDFVQVVKVEKTLSQVDSILAGGLSLADVSTKQCTVYDWESPELSR 1845

Query: 1941 LLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQ 2000
            +LS+ ++   ++   +LLE+LD  WDD Y+ K   F  +   G++R+ +SSF+ CI   +
Sbjct: 1846 ILSIFSSKRCKENCVYLLEVLDRFWDDHYSAKSRIFTDATHCGENRAVESSFMKCIQSFK 1905

Query: 2001 WTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILE 2060
            W  S MD++LHY  DLF+DC+ VRS+ G  APY VP+V S  L   IG KTEV+  D L 
Sbjct: 1906 WIASRMDEDLHYATDLFYDCENVRSLFGSVAPYAVPQVSSSSLKKAIGFKTEVSYCDALM 1965

Query: 2061 ILKVWTRLEAPF------MARYYLLVNHDV 2084
            +LK W   + PF      M ++Y  ++  V
Sbjct: 1966 VLKSWITSKVPFRASMSQMCKFYTFLSEGV 1995


>gi|218193937|gb|EEC76364.1| hypothetical protein OsI_13954 [Oryza sativa Indica Group]
          Length = 1909

 Score = 1369 bits (3543), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 777/1842 (42%), Positives = 1108/1842 (60%), Gaps = 150/1842 (8%)

Query: 138  LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
            LER   A  K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L  L + E
Sbjct: 179  LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238

Query: 198  ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
             ++N FIHC+V  R+I S+YDLE  ICK+E I +FEEL LGPFL+ PL++HYF V +D+ 
Sbjct: 239  GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298

Query: 258  VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
            +V K++++++I  L ++ D  K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299  LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357

Query: 318  IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
            +R+AR+++   +K             ++ +  + K    K+ LDERFSAI  R+K    I
Sbjct: 358  LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417

Query: 374  NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
            N     KHI F  S+ ++ D D  +   + D S +    + I   D   RVSSCPYPS T
Sbjct: 418  N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471

Query: 431  EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
            EE++RLGLK E++ +     +   +    G +++KRK E +G  +S+  +          
Sbjct: 472  EEMERLGLKSEMSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521

Query: 491  ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
              P +             +   S   +  FITTWKEAC+ + + +VLE +  +Y  T   
Sbjct: 522  --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAET--- 576

Query: 551  KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
               +  K++   FS YP IG LN+AV ++  G+ DS+YD +    + + ++ +   + E 
Sbjct: 577  --PKEKKKIIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 634

Query: 611  ESIEIEPSEQVAV----------VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPV 660
              +E  PS++ +           V +K     +  ++V+++++++ E +  +     S V
Sbjct: 635  MEVEPPPSKRKSKCVAKGDNDTNVGSKDPGCSVTADDVIRRITEFFESNRGVSRTDASQV 694

Query: 661  TKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDR 720
             K    LR L  CET +T +F   +F +LG+G F  FL KH   L  +L        +  
Sbjct: 695  RK-STFLRTLLDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTH 753

Query: 721  SSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLE 780
            SS+EVS++Q  + VL+ QA  N  E    ++   S+LL+RQFP ISF I    S E  L+
Sbjct: 754  SSVEVSVLQQQIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLD 813

Query: 781  TVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSK 840
            ++ + R  +    ++FS ++L  R SG +  +      +T   + +  ++   S +V+S+
Sbjct: 814  SIERQRKNIEINNIMFSMSLLEKRWSGIVPGNH-----DTVDGLMND-IQQSCSVTVSSQ 867

Query: 841  DAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDH 900
            +AI+ LL+AP LSDL +WSHWD LFAPSLG    WLLN   V++L C+VT DG+ IR+D 
Sbjct: 868  EAIKCLLKAPMLSDLLTWSHWDLLFAPSLGSFMHWLLNTGPVQDLACIVTTDGRFIRVDP 927

Query: 901  SASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDI 960
            SA+VD FLE  ++ S FQ AVKLLS   +  G  N P+ LLKC+A+ A  ++  N  + +
Sbjct: 928  SATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLLKCYAQRAIGIIMDNGNDLL 987

Query: 961  EVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPS 1020
                  N +  G     R         N+  ++ K +      V + ++F LDCLG+LPS
Sbjct: 988  ------NTKSEGKSFSAR---------NIWSDMSKDI---DDIVHLVAKFVLDCLGHLPS 1029

Query: 1021 EFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST 1080
            EFRS AADVLL+GL++  K+  SAILLE  +T    MLH++GLSLG+ EW  D    C T
Sbjct: 1030 EFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDIGLSLGVAEWAEDCRRLCLT 1089

Query: 1081 GTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD---------------VHIEE 1124
               D +   +    ++      SG   C   +L  +   D               V + +
Sbjct: 1090 ---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDIMKRSKSLPGKDNQIVAVSK 1146

Query: 1125 CGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNML 1184
               + + +  +          + TL E    E+A L++E+IRR+EFGL  ++S  E+++L
Sbjct: 1147 NQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETIRREEFGLDQSLSCTENSLL 1205

Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN 1244
            KKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL FILQ++GIVVLNN
Sbjct: 1206 KKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVEDVEPTLAFILQDNGIVVLNN 1265

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
            E GFSAENIRALCD+GNSTKKGS+ GYIG KGIGFKSVFRVTDAPEIHSNGFHVKFD +E
Sbjct: 1266 ESGFSAENIRALCDIGNSTKKGSNQGYIGNKGIGFKSVFRVTDAPEIHSNGFHVKFDITE 1325

Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
            GQIGFVLPT V P+N D          +Q   +    CI + FR+               
Sbjct: 1326 GQIGFVLPTAVAPYNTDSI-------KIQGGYRHALHCIHV-FRSP-------------- 1363

Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRA 1424
                P    FLHRL+CI F+NM+ND+L+V+R+K +G+GI+++S G D M+W V S+KL+ 
Sbjct: 1364 ----PISAPFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVRISNGNDMMSWLVVSKKLQG 1419

Query: 1425 GVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSR 1484
             ++R DV +TEIA+A TLQE+  G Y P L QQPVFAFLPLR YGLKFILQGDFVLPSSR
Sbjct: 1420 TIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPLRNYGLKFILQGDFVLPSSR 1479

Query: 1485 EEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFS 1544
            EE                              C    P KA + +MSFVPLVGEVHGFF 
Sbjct: 1480 EE-----------------------------GC----PGKAVTTFMSFVPLVGEVHGFFC 1506

Query: 1545 GLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKD 1604
             LP +ILSKLR++ C++LEG++++W  PC  LRGW+++   L+ D LL +HLGLG+LSKD
Sbjct: 1507 QLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRILISDSLLLEHLGLGYLSKD 1566

Query: 1605 IVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY-TISFHSSG 1663
            I++SD+L+RALGI EYGPK+L+ IISS+CR +  + S+GL WL +W   L+ ++  HSS 
Sbjct: 1567 IIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLEWLCAWFISLHLSLMHHSSK 1626

Query: 1664 QSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDC--SVFDGGFGLEAFPNL 1721
               L +  E DL+  L++IP IPLSDG+FSS+ +G IWL  D   S  D    +  FP L
Sbjct: 1627 NLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPYDVLNSKPDSRSSMLNFPVL 1685

Query: 1722 CAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDET 1781
             + LRT+ P LLS S  +K     +  ++L  +LLK+GV++LS HDI+K HIL ++S+ T
Sbjct: 1686 YSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRKLSGHDIIKNHILVSLSNST 1745

Query: 1782 TANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHF 1841
             AN    +M +Y+ F+M+HL+  C +C+ E+E I+SELR++  +LTNHG+K P + PIHF
Sbjct: 1746 EANVANTMMIEYVSFIMLHLQSPCASCNFEKEEIMSELRRRPILLTNHGYKCPYDEPIHF 1805

Query: 1842 GKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESL-SCGLVKWRKFFKEIGITDFVQV 1900
             KE+GN V +  L+ +++IKW E+D  YL +   +SL    L KWR+FF+E+G+TDFVQV
Sbjct: 1806 SKEYGNSVDLCKLLLNVEIKWIELDSCYLMNRGSDSLPPFELKKWRQFFEEMGVTDFVQV 1865

Query: 1901 VQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHLL 1942
            V+V+K+++    +    +             DWES EL  +L
Sbjct: 1866 VKVEKNISQADSSLAGRLSQGHHSGTPCIVYDWESPELFEVL 1907


>gi|242032627|ref|XP_002463708.1| hypothetical protein SORBIDRAFT_01g004640 [Sorghum bicolor]
 gi|241917562|gb|EER90706.1| hypothetical protein SORBIDRAFT_01g004640 [Sorghum bicolor]
          Length = 2610

 Score = 1350 bits (3493), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 709/1591 (44%), Positives = 1006/1591 (63%), Gaps = 93/1591 (5%)

Query: 535  EVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAI 594
            +VLE +  +Y  T  +K     K +    S YP IGLLN+AV S+  G+ DS+YD +Q  
Sbjct: 435  KVLEMLANYYGRTPTEK-----KGIINFCSQYPGIGLLNVAVKSMGCGLLDSIYDVIQLA 489

Query: 595  TQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILN 654
            ++  ++++   N+   E +EIEP               + +++V++++++Y+E ++ +  
Sbjct: 490  SENNVSSSPLPNTTT-EVMEIEPPS-------------VAIDDVIRRITEYIESNSKVSG 535

Query: 655  NAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFT 714
            +       + + +R L  CETW+T QF   +F +LG+G F  FL+KH     T L     
Sbjct: 536  D-------VTMQVRALNDCETWVTTQFSANQFSALGHGTFLEFLDKHCHQFRTALSSFLK 588

Query: 715  DDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGS 774
            + T + SSLEVS++Q  +  L+ QA +N  E    ++     LL+RQFP ISF I    S
Sbjct: 589  EGTCNPSSLEVSVLQQQIEFLLYQAESNWLEDGGFSEDSFVMLLKRQFPTISFNIVQDKS 648

Query: 775  MESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMS 834
             E     +   + ++ +  + FS ++L  R SG L     N + E   ++     +    
Sbjct: 649  GEGVPGFIEGQKKDIQTNSLKFSISLLEKRWSGTLPSRHGN-VDELGKSVAE---QSYYC 704

Query: 835  ESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGK 894
             +V S++AI  LLRAP LSDL  WSHWD LFAP+LG    WL+N   ++EL C+ T DG+
Sbjct: 705  GTVCSREAINCLLRAPMLSDLLLWSHWDSLFAPTLGSFVHWLMNTGPIQELACIATTDGR 764

Query: 895  VIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFK 954
             IR+D SA+VD FLEA ++ S FQ AVKLLS   +  G  + P+ LLKC+A+ A +++  
Sbjct: 765  FIRVDSSATVDQFLEAIIQRSPFQVAVKLLSLLYVYNGSTSTPMSLLKCYAQRAIKLIVD 824

Query: 955  NNVEDIEVINS---------QNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVP 1005
            NN + +   +          QN     +     Q  + + + +      + L  F   V 
Sbjct: 825  NNNDLMNAKSENKIFMPDEPQNLISESSTCFADQCQESSQVSSARLIRSESLPNFDNTVH 884

Query: 1006 VASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSL 1065
            + ++F LDCLG+LP EFRS AAD+LL+G +   K+  + +L E  +     MLH++GLSL
Sbjct: 885  LIAKFVLDCLGHLPLEFRSLAADILLAGFRVVTKNWHAVMLHEATENGQLCMLHDIGLSL 944

Query: 1066 GILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFES----VGADVH 1121
            G++EW+ D    C T              +  T   +S    +EG+  E+    + +DV+
Sbjct: 945  GVVEWVEDCRRLCLTEE-----------VHVQTEMHSSAKLASEGATHENSNGHISSDVN 993

Query: 1122 I-----EECGAICDTI--------------CGEASDDGLGDCTTQTLPEDKECEDAALIV 1162
            +     +    I D +               G  +D      +  ++ E+   E+A+L++
Sbjct: 994  MMGERRQLFPGINDRVGMDNEDNNNKMLNPAGTEADIAELHTSRSSMMEETNLEEASLVI 1053

Query: 1163 ESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY 1222
            E+IRR+EFGL   +S+ E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y
Sbjct: 1054 ETIRREEFGLDQALSDTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTY 1113

Query: 1223 PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
            P++VEPTL F+LQE+GI VLNNE+GFSAENIRALCD+GNSTKKGS+ GYIG KGIGFKSV
Sbjct: 1114 PKDVEPTLAFVLQENGIAVLNNERGFSAENIRALCDIGNSTKKGSNQGYIGNKGIGFKSV 1173

Query: 1283 FRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC 1342
            FRVTDAPEIHSNGFHVKFD ++GQIGFVLPT VPP++   F R+L+ +  +     WNTC
Sbjct: 1174 FRVTDAPEIHSNGFHVKFDITDGQIGFVLPTAVPPYSTTSFSRMLAVEDDKDAHPLWNTC 1233

Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG 1402
            I LPFR+KF EG          +DLHPSLLLFLHRL+CI F+N+ +D+L+++R++++GDG
Sbjct: 1234 ILLPFRSKFREG----------TDLHPSLLLFLHRLKCIKFKNLFDDTLLIMRREVLGDG 1283

Query: 1403 IIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
            I+++S G + M+W V S++L+  ++R DV TTEIA+A TLQ++ +G+Y P L QQPVFAF
Sbjct: 1284 IVRISHGIETMSWLVVSKRLQGTIVRHDVCTTEIAVAFTLQQTEKGDYEPYLKQQPVFAF 1343

Query: 1463 LPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENP 1522
            LPLR YGLKFI+QGDFVLPSSREEVD +S WNQWLLSEFP+LFV A+ SFC LPCF   P
Sbjct: 1344 LPLRNYGLKFIIQGDFVLPSSREEVDADSAWNQWLLSEFPSLFVCAQESFCALPCFERCP 1403

Query: 1523 AKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDR 1582
             KA + ++SF+PL GEVHGFFS LP +ILSKLR++ C+ L+G+  QW  PC  LRGW+++
Sbjct: 1404 GKAVTAFLSFIPLAGEVHGFFSQLPHLILSKLRVTCCMFLDGSTVQWVYPCNTLRGWDEQ 1463

Query: 1583 AHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSM 1642
               LL + LL +HLGLG+LSKDI++SD+L+RALGI +YGP ILL  ISS+CR +  + S+
Sbjct: 1464 TKMLLSEGLLHEHLGLGYLSKDIIISDNLSRALGIHDYGPNILLDTISSICRIDGCIESL 1523

Query: 1643 GLSWLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
            GL WL +W   LY T+  HS   SS +S +E  L+D +++IP IPLSDG+FSSV +G IW
Sbjct: 1524 GLEWLCAWFVNLYLTLLSHSRNVSSARS-LEDALLDKVRKIPCIPLSDGSFSSVSDGPIW 1582

Query: 1702 LHSDC--SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIG 1759
            L  D   S+ +    +E FP L   LRTVSP LLSA   +K  +  + +D+L  ML KIG
Sbjct: 1583 LPYDVVSSIPECRSSIENFPVLYGNLRTVSPNLLSACCKNKYLMEEVRIDDLADMLQKIG 1642

Query: 1760 VQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSEL 1819
            V++LS HDI+K HI+ ++ +   AN    ++ +Y+ F+M+HL+  C +C+ E+E IVSEL
Sbjct: 1643 VRKLSGHDIIKNHIMVSLRNGLDANVTDRMIREYVSFIMVHLQSSCTSCNFEKEGIVSEL 1702

Query: 1820 RKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLS 1879
            RK    LTNHG+K P++ PIHF K++GN V V+ L+ +++I W E+D +YL H   ES S
Sbjct: 1703 RKSPIFLTNHGYKSPADEPIHFSKDYGNSVDVSRLLQNVEISWIELDSSYLIHHGSESSS 1762

Query: 1880 CGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELV 1939
                KWR+FF+E+G+TDFVQVV+V+K ++ +       +   ++ +      DWES EL 
Sbjct: 1763 FEREKWRRFFEEMGVTDFVQVVKVEKSLSQVDSLLAGGLSLADVSAKPCTVYDWESPELS 1822

Query: 1940 HLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDI 1999
             +LS+ ++   R+   +LLE+LD  WDD Y+ K M    +   G++R+ +SSF+ CI   
Sbjct: 1823 RILSIFSSKRCRENCVYLLEVLDRFWDDYYSPKSMILTDATHCGENRAVESSFMKCIQSF 1882

Query: 2000 QWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDIL 2059
            +W  S MD++LHYP DLF+D + VRS+ G  APY VP+V S  L   IG KTEV+  D L
Sbjct: 1883 KWIASRMDEDLHYPTDLFYDSENVRSLFGSVAPYAVPQVSSSSLKKAIGFKTEVSYCDAL 1942

Query: 2060 EILKVWTRLEAPF------MARYYLLVNHDV 2084
             +LK W   + PF      M ++Y  ++  V
Sbjct: 1943 MVLKYWITSKVPFRASMSQMCKFYTFLSEGV 1973



 Score =  241 bits (616), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 144/352 (40%), Positives = 213/352 (60%), Gaps = 39/352 (11%)

Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
           L+R   A  KAR EL+  GE V++WKV+++VL+ L+ DSW SLG Q Q+VP L  L + E
Sbjct: 103 LDRAQEAARKAREELVKGGEGVTAWKVAQAVLVALKADSWDSLGVQPQDVPLLRDLFLIE 162

Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
            ++NAFIHC+V  R+I ++YDLE  ICK+EG+ +FEEL LGPFL+ PL++HYFLV  D+ 
Sbjct: 163 GKVNAFIHCYVAARKIVTVYDLEVEICKNEGVVQFEELGLGPFLQHPLVAHYFLVPDDLS 222

Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVR-----IQNL- 311
           VV K+++ +VI  L ++ D  K K I + +FL+ +A+K+S++ KE+LGVR       NL 
Sbjct: 223 VVPKLSSEEVINVLLKFVDKSKKK-ITIVDFLNHLAEKKSVSGKEKLGVRKCHSTSTNLR 281

Query: 312 ---GMHINFIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAIC 364
               +HI+ I++A++++    K         +  R++ +  +      K+ LD+RF ++ 
Sbjct: 282 RFCRLHISLIQQAKQTETSAAKLLGNMSGSGNSSREKDLLKNASFRTHKKALDKRFISLT 341

Query: 365 ERVKSFSSINEDFGAKHIRFVS--------SSSEDEDSDDCTYECSNDISSNVQLPSQIK 416
            R+K    IN     KHI F S        +SSED  SDD      N+  +   +  +  
Sbjct: 342 NRIKQLPGIN-----KHIHFDSTDDETNGDTSSEDGKSDD------NENKNEYSVLDKKD 390

Query: 417 GSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKI 468
           G  RV+SCPYPS TEEL+RLGLK EIN +  S  NS  ++     S+KK K+
Sbjct: 391 GDKRVNSCPYPSKTEELERLGLKSEINKR-PSLENSKPKE-----SRKKGKV 436


>gi|357115001|ref|XP_003559282.1| PREDICTED: uncharacterized protein LOC100827209 [Brachypodium
            distachyon]
          Length = 2540

 Score = 1285 bits (3325), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 734/1673 (43%), Positives = 1001/1673 (59%), Gaps = 160/1673 (9%)

Query: 46   FFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPN----------------- 88
            + PNP   L         QNP F++  PNL +QN        +                 
Sbjct: 51   YAPNPHQYL---------QNP-FNLMLPNLLLQNQAAFAAAYHHQQQQQQQQQQQQQQQQ 100

Query: 89   -LPPQQPPSSACNQQRSQTQH-PPGASNQQRPLTQQPQSLPPQPQKL-----NKEFLERI 141
              PP   P        S  QH PP  + QQ      P + P +PQ        +  LER 
Sbjct: 101  YQPPALVPPGYTQNPTSNIQHRPPNPAPQQPRAASPPPAPPQRPQTQPPTRNQQAVLERA 160

Query: 142  DRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARIN 201
              A  KAR EL++AGE V+ WKV+++ LL L+VDSW SLG Q+ +VP L  L + E ++N
Sbjct: 161  QVAARKARDELVLAGEGVTGWKVAQAALLALKVDSWGSLGVQLHDVPILRDLFLVEGKVN 220

Query: 202  AFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFK 261
             FIHC+V  R+I S+ DLE  ICKSEG+ +FEEL LGPFL+ PLI+HYFLV SD+  V K
Sbjct: 221  TFIHCYVAARKIVSICDLEVEICKSEGVRQFEELGLGPFLQHPLIAHYFLVPSDLSTVPK 280

Query: 262  IATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREA 321
            +++ ++I CL +Y    K K + V+ FLD++A+++S++ KE+LG R +N    +  + + 
Sbjct: 281  LSSEEIINCLQKYIGDSKDK-VTVENFLDYLAEQKSVSGKEKLGAR-RNEVAAVKHLGKT 338

Query: 322  RKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKH 381
              S D T  K L +    H            K+ L++RF AI  R+K    INE     H
Sbjct: 339  SGSGDSTCGKDLPKQTDFHL----------GKQVLNKRFDAITSRIKQLPGINE-----H 383

Query: 382  IRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVTEELKRLGL 438
            I FVSS  E  D D  +   + D S N    S +   D   RVSSCPYPS +EE++RLGL
Sbjct: 384  IHFVSSDDEI-DGDTSSEGNAVDESENKNTCSILDSKDVDKRVSSCPYPSKSEEMERLGL 442

Query: 439  KGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSS---APAKFLRRNKAKQRALPIE 495
            K E++ + T+  +     D    S +KRK E +G  SS    P K  +    K+ A P  
Sbjct: 443  KSELSKKPTTESSMVKVTDKNVYSGEKRKYEGTGTPSSLCKQPKKRRKVGMHKKEASP-- 500

Query: 496  SGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARY 555
                         +   S   +  FITTWKEAC+ + + +VLE +  +Y  T  +K    
Sbjct: 501  -------------NCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYGETTHEK---- 543

Query: 556  FKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEI 615
             +++   FS YP IG LN+AV S+  G+ DS+YD +   ++ +L+++   N+   E +EI
Sbjct: 544  -RKIIQFFSQYPGIGFLNVAVRSMGCGLLDSIYDAIHVFSENKLSSSPIPNTTT-EIMEI 601

Query: 616  E-PSEQVAVVNTKHTV----------QDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKII 664
            E PS + A    K              ++  ++V+++++++ E   S ++ A +   +  
Sbjct: 602  EPPSRETAECIAKGAYDSNERNYEPGHNVTTDDVIRRITEHFE-SKSGVSKAGALQVERS 660

Query: 665  ILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLE 724
            I LR L  CETW+T QF  ++F SLG+G F  FLEK+ S    +L            SLE
Sbjct: 661  IFLRTLHDCETWVTTQFSARQFSSLGHGTFLEFLEKYGSHFPPKLSSFLKGGDSGSVSLE 720

Query: 725  VSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGK 784
            VS+ Q  +  L+ QA  N  E    +      LL++QFP +SF I    S E    +  +
Sbjct: 721  VSVPQQQIEALLCQAEGNWLEDGDFSGDSFLMLLKKQFPTMSFDITQYKSGEGLDGSTER 780

Query: 785  YRNEVMSKCVLFSETILGTRLSGDLSVHEENSL-LETTSAITHTGLRPKMSESVTSKDAI 843
             R  + +  + FS ++L  R SG LS  E +++  +  +A+  +      SE+V+S++AI
Sbjct: 781  QRKCIQTNNISFSISLLEKRWSG-LSPGEHDTVGGQRNNAVEQS----YYSETVSSREAI 835

Query: 844  EILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSAS 903
              LL+AP LSDL  WSHWD LFAPSLG    WLL+   +++L C+VT D + IR+D SA+
Sbjct: 836  NCLLKAPMLSDLLLWSHWDLLFAPSLGSFIRWLLSTGPIQQLACIVTTDCRFIRVDPSAT 895

Query: 904  VDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVI 963
            VD FLEA ++ S FQ AVKLLS   +  G +N P+ LLKC+A+ A  ++  NN +     
Sbjct: 896  VDQFLEAIIQCSPFQVAVKLLSLIHIYNGSRNTPISLLKCYAQRAIGIIMNNNNDS---- 951

Query: 964  NSQNCRMHGNVLCGRQNFD--VANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSE 1021
                    G     R      + +IDN               V   ++F LDCLG LPSE
Sbjct: 952  -------EGKSFSARNAMSGILKSIDN--------------TVHFVAKFVLDCLGQLPSE 990

Query: 1022 FRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCST- 1080
            FRS AAD+LLSGL++  K++ SAIL E  +T    MLH++GLSLGI EW+ DY  FC T 
Sbjct: 991  FRSLAADILLSGLRTVTKNSYSAILHEATETWQLCMLHDIGLSLGIAEWVEDYRGFCLTE 1050

Query: 1081 ------------------------GTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESV 1116
                                      S++L+P  V   N ++  L        G   + V
Sbjct: 1051 EDDAKAELHSSSGHASAASEGPTLENSNVLIPHDVDMMNDSSKSL-------PGKKDQVV 1103

Query: 1117 GADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNI 1176
              D   +    + + +  +A    L    +  + E    E+AAL++E+IRRDEFGL  ++
Sbjct: 1104 ATDNKNQN---MLNPVEAKAETTELHTTKSPMMGE-MNLEEAALVIETIRRDEFGLDQSL 1159

Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQE 1236
            S  E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL F+LQE
Sbjct: 1160 SCTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYHEDVEPTLVFVLQE 1219

Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
            +GIV+LNNE GFSAENIRALCD+GNSTKKG+++GYIG KGIGFKSVFRVTDAPEIHSNGF
Sbjct: 1220 NGIVILNNEMGFSAENIRALCDIGNSTKKGANSGYIGNKGIGFKSVFRVTDAPEIHSNGF 1279

Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA 1356
            HVKFD ++GQIGFVLPT VPP++     R+LS +  +     WNTCI LPFR+KF +G  
Sbjct: 1280 HVKFDITKGQIGFVLPTAVPPYSASSLSRMLSAEDDKGACSLWNTCILLPFRSKFRDGTG 1339

Query: 1357 MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWF 1416
            M +I  MFSDLHPSLLLFLHRL CI F+N+LND+L+++R+K +GDGI+++S G + M+W 
Sbjct: 1340 MCSIASMFSDLHPSLLLFLHRLNCIKFKNVLNDTLLIMRRKALGDGIVRISHGNEIMSWL 1399

Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
            V S+KL+  ++R DV+TTEIALA TLQE+  G Y P L QQPVFAFLPLR YGLKFILQG
Sbjct: 1400 VLSKKLQGTLVRHDVRTTEIALAFTLQETENGEYEPYLKQQPVFAFLPLRNYGLKFILQG 1459

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV 1536
            DFVLPSSREEVD ++ WNQWLLSEFP+LFVSA+ SFC L CF+  P KA + +MSFVPL 
Sbjct: 1460 DFVLPSSREEVDADNAWNQWLLSEFPSLFVSAQESFCSLSCFQRCPGKAVTAFMSFVPLA 1519

Query: 1537 GEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHL 1596
            GEVHGFF  LP +I+SKLR++ C++LEG+++QW  PC VL+GW+++   L  D LL  HL
Sbjct: 1520 GEVHGFFCKLPHLIISKLRLTRCMVLEGSSSQWVYPCNVLKGWDEQTRMLFSDGLLHDHL 1579

Query: 1597 GLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS 1649
            GLG+LSKDI++SD+L+RALGI  YGP +L+ ++SS+CRT+  + S+G+ WL +
Sbjct: 1580 GLGYLSKDIIISDTLSRALGIHVYGPNVLIDVLSSICRTDGCIESLGMEWLCA 1632



 Score =  167 bits (422), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 84/206 (40%), Positives = 122/206 (59%), Gaps = 6/206 (2%)

Query: 1882 LVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHL 1941
            L KWR+FF E+G+TDFVQVV+V+K++  +       +   ++    S   DWES EL  +
Sbjct: 1669 LKKWREFFAEMGVTDFVQVVKVEKNIPQVDSLIAGKLSQGDISGTPSTVYDWESPELTSI 1728

Query: 1942 LSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQW 2001
            LS  ++   R+   +LLE+LD+ WDD Y+ K          G++R+ +SSF+ CI   +W
Sbjct: 1729 LSTFSSRKCRENCIYLLEVLDSFWDDYYSAKAWCLTNVTHCGENRTVESSFMKCIQSFKW 1788

Query: 2002 TISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEI 2061
              SS+D +LHY  DLF+D + VRS+LG  APY VP+V S  L  DIG KT V+  D L +
Sbjct: 1789 IASSVDYDLHYATDLFYDFENVRSLLGSVAPYAVPQVSSRSLRKDIGFKTNVSHSDALMV 1848

Query: 2062 LKVWTRLEAPF------MARYYLLVN 2081
            LK+W   + PF      M ++Y  V+
Sbjct: 1849 LKLWIASQVPFNASVHQMCKFYTFVS 1874


>gi|29150358|gb|AAO72367.1| unknow protein [Oryza sativa Japonica Group]
          Length = 2592

 Score = 1275 bits (3300), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 693/1541 (44%), Positives = 962/1541 (62%), Gaps = 93/1541 (6%)

Query: 138  LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
            LER   A  K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L  L + E
Sbjct: 179  LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238

Query: 198  ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
             ++N FIHC+V  R+I S+YDLE  ICK+E I +FEEL LGPFL+ PL++HYF V +D+ 
Sbjct: 239  GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298

Query: 258  VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
            +V K++++++I  L ++ D  K K I V+ FLD++A+++S++ KE LG R   +   +  
Sbjct: 299  LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGARRTEVSA-VKV 356

Query: 318  IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDF 377
                  S D + +K L + +  H            K+ LDERFSAI  R+K    IN   
Sbjct: 357  QGNTSGSGDGSCEKDLVKNRKFHL----------SKQALDERFSAITSRIKKLPGIN--- 403

Query: 378  GAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVTEELK 434
              KHI F  S+ ++ D D  +   + D S +    + I   D   RVSSCPYPS TEE++
Sbjct: 404  --KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKTEEME 460

Query: 435  RLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPI 494
            RLGLK E + +     +   +    G +++KRK E +G  +S+  +            P 
Sbjct: 461  RLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR------------PK 508

Query: 495  ESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAAR 554
            +             +   S   +  FITTWKEAC+ + + +VLE +  +Y  T  +K   
Sbjct: 509  KKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAETPKEKKK- 567

Query: 555  YFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIE 614
                +   FS YP IG LN+AV ++  G+ DS+YD +    + + ++ +   + E   +E
Sbjct: 568  ----IIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTELMEVE 623

Query: 615  IEPSEQVAV----------VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKII 664
              PS++ +           V +K     +  ++V+++++++ E +  +     S V K  
Sbjct: 624  PPPSKRKSKCVAKGDNDTNVGSKDPGCSVTADDVIRRITEFFESNRGVSRTDASQVRKST 683

Query: 665  ILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLE 724
              LR L  CET +T +F   +F +LG+G F  FL KH   L  +L        +  SS+E
Sbjct: 684  -FLRTLLDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTHSSVE 742

Query: 725  VSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGK 784
            VS++Q  + VL+ QA  N  E    ++   S+LL+RQFP ISF I    S E  L+++ +
Sbjct: 743  VSVLQQQIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLDSIER 802

Query: 785  YRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRPKMSESVTSKDAIE 844
             R  +    ++FS ++L  R SG +  + +     T   + +  ++   S +V+S++AI+
Sbjct: 803  QRKNIEINNIMFSMSLLEKRWSGIVPGNHD-----TVDGLMN-DIQQSCSVTVSSQEAIK 856

Query: 845  ILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASV 904
             LL+AP LSDL +WSHWD LFAPSLG    WLLN   V++L C+VT DG+ IR+D SA+V
Sbjct: 857  CLLKAPMLSDLLTWSHWDLLFAPSLGSFMHWLLNTGPVQDLACIVTTDGRFIRVDPSATV 916

Query: 905  DSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVIN 964
            D FLE  ++ S FQ AVKLLS   +  G  N P+ LLKC+A+ A  ++  N  + +    
Sbjct: 917  DQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLLKCYAQRAIGIIMDNGNDLL---- 972

Query: 965  SQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRS 1024
              N +  G     R         N+  ++ K +      V + ++F LDCLG+LPSEFRS
Sbjct: 973  --NTKSEGKSFSAR---------NIWSDMSKDI---DDIVHLVAKFVLDCLGHLPSEFRS 1018

Query: 1025 FAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSD 1084
             AADVLL+GL++  K+  SAILLE  +T    MLH++GLSLG+ EW  D    C T   D
Sbjct: 1019 LAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDIGLSLGVAEWAEDCRRLCLT---D 1075

Query: 1085 LLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD---------------VHIEECGAI 1128
             +   +    ++      SG   C   +L  +   D               V + +   +
Sbjct: 1076 EIHANIEMHASSRHPSTASGVAICENSNLLNATDVDIMKRSKSLPGKDNQIVAVSKNQNV 1135

Query: 1129 CDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQH 1188
             + +  +          + TL E    E+A L++E+IRR+EFGL  ++S  E+++LKKQH
Sbjct: 1136 LNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETIRREEFGLDQSLSCTENSLLKKQH 1194

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGF 1248
            ARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+VEPTL FILQ++GIVVLNNE GF
Sbjct: 1195 ARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVEDVEPTLAFILQDNGIVVLNNESGF 1254

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
            SAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRVTDAPEIHSNGFHVKFD +EGQIG
Sbjct: 1255 SAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRVTDAPEIHSNGFHVKFDITEGQIG 1314

Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLH 1368
            FVLPT V P+N D   R+LS + V+  S  WNTCI LPFR+KF EG  M++I  MFSDLH
Sbjct: 1315 FVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVLPFRSKFKEGTGMHSIASMFSDLH 1374

Query: 1369 PSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIR 1428
            PSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI+++S G D M+W V S+KL+  ++R
Sbjct: 1375 PSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVRISNGNDMMSWLVVSKKLQGTIVR 1434

Query: 1429 PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD 1488
             DV +TEIA+A TLQE+  G Y P L QQPVFAFLPLR YGLKFILQGDFVLPSSREEVD
Sbjct: 1435 NDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPLRNYGLKFILQGDFVLPSSREEVD 1494

Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPR 1548
             ++ WNQWLLSEFP+LFVSA+ SFC LPCF+  P KA + +MSFVPLVGEVHGFF  LP 
Sbjct: 1495 ADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKAVTTFMSFVPLVGEVHGFFCQLPH 1554

Query: 1549 MILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS 1608
            +ILSKLR++ C++LEG++++W  PC  LRGW+++   L+ D LL +HLGLG+LSKDI++S
Sbjct: 1555 LILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRILISDSLLLEHLGLGYLSKDIIIS 1614

Query: 1609 DSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS 1649
            D+L+RALGI EYGPK+L+ IISS+CR +  + S+GL WL +
Sbjct: 1615 DTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLEWLCA 1655



 Score =  129 bits (325), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 27/206 (13%)

Query: 1882 LVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHL 1941
            L KWR+FF+E+G+TDFVQVV+V+K+++    +    +             DWES ELV  
Sbjct: 1691 LKKWRQFFEEMGVTDFVQVVKVEKNISQADSSLAGRLSQGHHSGTPCIVYDWESPELV-- 1748

Query: 1942 LSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQW 2001
                               LD  WD  Y+ K      +  +G++ + +SSF+N I   +W
Sbjct: 1749 -------------------LDKFWDAHYSAKARIHADATHSGENIAVESSFMNSIRTFKW 1789

Query: 2002 TISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEI 2061
              S+MD++LHY  DLF++ + VRSILG  APY VP+V S  L  DIG K +V+  D L I
Sbjct: 1790 IASAMDEDLHYATDLFYNTEDVRSILGSVAPYAVPQVCSRSLGKDIGFKIKVSHSDALMI 1849

Query: 2062 LKVWTRLEAPF------MARYYLLVN 2081
            LK W   +  F      M ++Y  V+
Sbjct: 1850 LKSWIASQTSFSASMDQMCKFYTFVS 1875


>gi|359477903|ref|XP_002270714.2| PREDICTED: uncharacterized protein LOC100258878 [Vitis vinifera]
          Length = 1578

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 549/847 (64%), Positives = 658/847 (77%), Gaps = 50/847 (5%)

Query: 1284 RVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCI 1343
            +VTDAPEIHSNGFHVKFD SEGQIGFVLPT++PP N+D+F RL S D  Q ++  WNTCI
Sbjct: 50   KVTDAPEIHSNGFHVKFDISEGQIGFVLPTVIPPCNVDLFRRLASSDTDQEDTDSWNTCI 109

Query: 1344 RLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGI 1403
             LPFR K S+G  M+NI+ MFSDLHPSLLLFLH L+CI F+NMLNDSL+++RK+IVGDGI
Sbjct: 110  VLPFRMKLSKGTGMSNIISMFSDLHPSLLLFLHHLRCIKFKNMLNDSLIIMRKEIVGDGI 169

Query: 1404 IKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFL 1463
            IKVS G +KMTWFV SQKLRA VIRPDV+TTEIA+A TLQES+ G Y P   QQPVFAFL
Sbjct: 170  IKVSHGREKMTWFVISQKLRADVIRPDVQTTEIAIAFTLQESDNGEYSPHFEQQPVFAFL 229

Query: 1464 PLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPA 1523
            PLRTYGLKFILQGDFVLPSSREEVDG+SPWNQWLLSEFP LFV+AERSFC LPCFRENP 
Sbjct: 230  PLRTYGLKFILQGDFVLPSSREEVDGDSPWNQWLLSEFPGLFVTAERSFCALPCFRENPG 289

Query: 1524 KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRA 1583
            KA + YMSFVPLVGEVHGFFS LPRMI+SKLRMSNCL+LEG+NN+W PPCKVLR WN++A
Sbjct: 290  KAVAAYMSFVPLVGEVHGFFSSLPRMIISKLRMSNCLLLEGDNNEWVPPCKVLRSWNEQA 349

Query: 1584 HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMG 1643
             SLLPD LL KHLGLGFL K+I LSD LARALGI+EYGPKILLQIISSLC TE+GL+SMG
Sbjct: 350  RSLLPDSLLCKHLGLGFLDKNIHLSDPLARALGIQEYGPKILLQIISSLCHTEDGLKSMG 409

Query: 1644 LSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLH 1703
            L+WL+SWLN LYT+  H SGQSSL S +E+DLI +L++IPFIPLSDG + S+DEGTIWLH
Sbjct: 410  LAWLSSWLNALYTMPLHYSGQSSLNSNMESDLIYDLKKIPFIPLSDGNYGSLDEGTIWLH 469

Query: 1704 SD--CSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
            SD   +  DG  GL AFPNL AKLR V+PALLSA++VD   + +   +N+ RMLL+IGVQ
Sbjct: 470  SDSLSTELDGEHGLGAFPNLYAKLRIVNPALLSAASVDIPCMDMTLAENVTRMLLRIGVQ 529

Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
            QLSAH+IV+VHILPA+SDE   N +KNLM +YL FVM+HL+  C NC VERE+I+SE+  
Sbjct: 530  QLSAHEIVQVHILPAMSDEGITNREKNLMIEYLSFVMVHLQSSCTNCRVEREYIISEICN 589

Query: 1822 KAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCG 1881
            KAF+LTNHG+KRP E+PIHF KEFGN + VN  I+  ++ W+ VDI YLKHP  ESLSCG
Sbjct: 590  KAFILTNHGYKRPVEVPIHFSKEFGNTIDVNRFINATNMTWHVVDIAYLKHPITESLSCG 649

Query: 1882 LVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNELVH 1940
            L+KWR FF+ +G+TDFVQ+VQV+K+V+DISH   KN MW ++L+S G+ A DWES ELV 
Sbjct: 650  LMKWRGFFQALGVTDFVQIVQVEKNVSDISHMILKNEMWDRDLISHGTIAKDWESPELVQ 709

Query: 1941 LLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQ 2000
            LLS+L+   +++  K+LL++LDTLWDDC++DK+ G+   K +GD + F+SS +  ICD Q
Sbjct: 710  LLSILSKTGDQESCKNLLDVLDTLWDDCFSDKVSGYCNFKSSGDRKPFKSSLMTSICDFQ 769

Query: 2001 WTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPK----------------------- 2037
            W  SSMDDELHYPKDLF+D D V  +LG SAPY +PK                       
Sbjct: 770  WIASSMDDELHYPKDLFYDSDEVHLVLGSSAPYALPKGTDIVLLVKNNKMKLHLEQGAVV 829

Query: 2038 ------------------VKSEKLVCDIGLKTEVTIDDILEILKVWTRLEAPF------M 2073
                              V+S KL CDIG KT+VT+DDIL IL+ W R E PF      M
Sbjct: 830  GAFTLFLVQRISDISAEMVRSGKLACDIGFKTKVTLDDILGILQEWRRSETPFKASIAQM 889

Query: 2074 ARYYLLV 2080
            +++Y  +
Sbjct: 890  SKFYTFI 896


>gi|108711621|gb|ABF99416.1| expressed protein [Oryza sativa Japonica Group]
          Length = 2655

 Score =  956 bits (2472), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/935 (52%), Positives = 643/935 (68%), Gaps = 42/935 (4%)

Query: 882  VKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
            V++L C+VT DG+ IR+D SA+VD FLE  ++ S FQ AVKLLS   +  G  N P+ LL
Sbjct: 822  VQDLACIVTTDGRFIRVDPSATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLL 881

Query: 942  KCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFG 1001
            KC+A+ A  ++  N  + +      N +  G     R         N+  ++ K +    
Sbjct: 882  KCYAQRAIGIIMDNGNDLL------NTKSEGKSFSAR---------NIWSDMSKDI---D 923

Query: 1002 KAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEV 1061
              V + ++F LDCLG+LPSEFRS AADVLL+GL++  K+  SAILLE  +T    MLH++
Sbjct: 924  DIVHLVAKFVLDCLGHLPSEFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDI 983

Query: 1062 GLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD- 1119
            GLSLG+ EW  D    C T   D +   +    ++      SG   C   +L  +   D 
Sbjct: 984  GLSLGVAEWAEDCRRLCLT---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDI 1040

Query: 1120 --------------VHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESI 1165
                          V + +   + + +  +          + TL E    E+A L++E+I
Sbjct: 1041 MKRSKSLPGKDNQIVAVSKNQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETI 1099

Query: 1166 RRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN 1225
            RR+EFGL  ++S  E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+
Sbjct: 1100 RREEFGLDQSLSCTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVED 1159

Query: 1226 VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
            VEPTL FILQ++GIVVLNNE GFSAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRV
Sbjct: 1160 VEPTLAFILQDNGIVVLNNESGFSAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRV 1219

Query: 1286 TDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
            TDAPEIHSNGFHVKFD +EGQIGFVLPT V P+N D   R+LS + V+  S  WNTCI L
Sbjct: 1220 TDAPEIHSNGFHVKFDITEGQIGFVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVL 1279

Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIK 1405
            PFR+KF EG  M++I  MFSDLHPSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI++
Sbjct: 1280 PFRSKFKEGTGMHSIASMFSDLHPSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVR 1339

Query: 1406 VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
            +S G D M+W V S+KL+  ++R DV +TEIA+A TLQE+  G Y P L QQPVFAFLPL
Sbjct: 1340 ISNGNDMMSWLVVSKKLQGTIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPL 1399

Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKA 1525
            R YGLKFILQGDFVLPSSREEVD ++ WNQWLLSEFP+LFVSA+ SFC LPCF+  P KA
Sbjct: 1400 RNYGLKFILQGDFVLPSSREEVDADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKA 1459

Query: 1526 ASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHS 1585
             + +MSFVPLVGEVHGFF  LP +ILSKLR++ C++LEG++++W  PC  LRGW+++   
Sbjct: 1460 VTTFMSFVPLVGEVHGFFCQLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRI 1519

Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
            L+ D LL +HLGLG+LSKDI++SD+L+RALGI EYGPK+L+ IISS+CR +  + S+GL 
Sbjct: 1520 LISDSLLLEHLGLGYLSKDIIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLE 1579

Query: 1646 WLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHS 1704
            WL +W   L+ ++  HSS    L +  E DL+  L++IP IPLSDG+FSS+ +G IWL  
Sbjct: 1580 WLCAWFISLHLSLMHHSSKNLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPY 1638

Query: 1705 DC--SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQ 1762
            D   S  D    +  FP L + LRT+ P LLS S  +K     +  ++L  +LLK+GV++
Sbjct: 1639 DVLNSKPDSRSSMLNFPVLYSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRK 1698

Query: 1763 LSAHDIVKVHILPAISDETTANGDKNLMADYLCFV 1797
            LS HDI+K HIL ++S+ T AN    +M +Y+  V
Sbjct: 1699 LSGHDIIKNHILVSLSNSTEANVANTMMIEYVVKV 1733



 Score =  395 bits (1014), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 238/677 (35%), Positives = 373/677 (55%), Gaps = 46/677 (6%)

Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
           LER   A  K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L  L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238

Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
            ++N FIHC+V  R+I S+YDLE  ICK+E I +FEEL LGPFL+ PL++HYF V +D+ 
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298

Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
           +V K++++++I  L ++ D  K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357

Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
           +R+AR+++   +K             ++ +  + K    K+ LDERFSAI  R+K    I
Sbjct: 358 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417

Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
           N     KHI F  S+ ++ D D  +   + D S +    + I   D   RVSSCPYPS T
Sbjct: 418 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471

Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
           EE++RLGLK E + +     +   +    G +++KRK E +G  +S+  +          
Sbjct: 472 EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521

Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
             P +             +   S   +  FITTWKEAC+ + + +VLE +  +Y  T  +
Sbjct: 522 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAETPKE 579

Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
           K       +   FS YP IG LN+AV ++  G+ DS+YD +    + + ++ +   + E 
Sbjct: 580 KKK-----IIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 634

Query: 611 ESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKL 670
             +E  PS              +  ++V+++++++ E +  +     S V K    LR L
Sbjct: 635 MEVEPPPS--------------VTADDVIRRITEFFESNRGVSRTDASQVRK-STFLRTL 679

Query: 671 CSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQH 730
             CET +T +F   +F +LG+G F  FL KH   L  +L        +  SS+EVS++Q 
Sbjct: 680 LDCETCITAKFSANQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTHSSVEVSVLQQ 739

Query: 731 LLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVM 790
            + VL+ QA  N  E    ++   S+LL+RQFP ISF I    S E  L+++ + R  + 
Sbjct: 740 QIEVLLCQAGGNWLEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLDSIERQRKNIE 799

Query: 791 SKCVLFSETILGTRLSG 807
              ++FS ++L  R SG
Sbjct: 800 INNIMFSMSLLEKRWSG 816



 Score =  120 bits (301), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 92/156 (58%), Gaps = 6/156 (3%)

Query: 1932 DWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSS 1991
            DWES ELV +LS  ++   R+   +LLE+LD  WD  Y+ K      +  +G++ + +SS
Sbjct: 1762 DWESPELVSILSTFSSKKCRENCVYLLEVLDKFWDAHYSAKARIHADATHSGENIAVESS 1821

Query: 1992 FINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKT 2051
            F+N I   +W  S+MD++LHY  DLF++ + VRSILG  APY VP+V S  L  DIG K 
Sbjct: 1822 FMNSIRTFKWIASAMDEDLHYATDLFYNTEDVRSILGSVAPYAVPQVCSRSLGKDIGFKI 1881

Query: 2052 EVTIDDILEILKVWTRLEAPF------MARYYLLVN 2081
            +V+  D L ILK W   +  F      M ++Y  V+
Sbjct: 1882 KVSHSDALMILKSWIASQTSFSASMDQMCKFYTFVS 1917


>gi|108711620|gb|ABF99415.1| expressed protein [Oryza sativa Japonica Group]
          Length = 2702

 Score =  956 bits (2470), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/935 (52%), Positives = 643/935 (68%), Gaps = 42/935 (4%)

Query: 882  VKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
            V++L C+VT DG+ IR+D SA+VD FLE  ++ S FQ AVKLLS   +  G  N P+ LL
Sbjct: 869  VQDLACIVTTDGRFIRVDPSATVDQFLEGIIQCSPFQVAVKLLSLLHVYNGSTNTPISLL 928

Query: 942  KCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFG 1001
            KC+A+ A  ++  N  + +      N +  G     R         N+  ++ K +    
Sbjct: 929  KCYAQRAIGIIMDNGNDLL------NTKSEGKSFSAR---------NIWSDMSKDI---D 970

Query: 1002 KAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEV 1061
              V + ++F LDCLG+LPSEFRS AADVLL+GL++  K+  SAILLE  +T    MLH++
Sbjct: 971  DIVHLVAKFVLDCLGHLPSEFRSLAADVLLAGLRTITKNCYSAILLEATETGQLCMLHDI 1030

Query: 1062 GLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG-CAEGSLFESVGAD- 1119
            GLSLG+ EW  D    C T   D +   +    ++      SG   C   +L  +   D 
Sbjct: 1031 GLSLGVAEWAEDCRRLCLT---DEIHANIEMHASSRHPSTASGVAICENSNLLNATDVDI 1087

Query: 1120 --------------VHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESI 1165
                          V + +   + + +  +          + TL E    E+A L++E+I
Sbjct: 1088 MKRSKSLPGKDNQIVAVSKNQNVLNIVTAKLDTAEFITNKSPTLGEVNP-EEATLVIETI 1146

Query: 1166 RRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN 1225
            RR+EFGL  ++S  E+++LKKQHARLGRALHCLSQELYSQDSH LLELVQNADDN Y E+
Sbjct: 1147 RREEFGLDQSLSCTENSLLKKQHARLGRALHCLSQELYSQDSHLLLELVQNADDNTYVED 1206

Query: 1226 VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
            VEPTL FILQ++GIVVLNNE GFSAENIRALCD+G+STKKGS+ GYIG KGIGFKSVFRV
Sbjct: 1207 VEPTLAFILQDNGIVVLNNESGFSAENIRALCDIGSSTKKGSNQGYIGNKGIGFKSVFRV 1266

Query: 1286 TDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
            TDAPEIHSNGFHVKFD +EGQIGFVLPT V P+N D   R+LS + V+  S  WNTCI L
Sbjct: 1267 TDAPEIHSNGFHVKFDITEGQIGFVLPTAVAPYNTDSVSRMLSVEDVKDSSSFWNTCIVL 1326

Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIK 1405
            PFR+KF EG  M++I  MFSDLHPSLLLFLHRL+CI F+NM+ND+L+V+R+K +G+GI++
Sbjct: 1327 PFRSKFKEGTGMHSIASMFSDLHPSLLLFLHRLKCIKFKNMMNDTLLVMRRKALGNGIVR 1386

Query: 1406 VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
            +S G D M+W V S+KL+  ++R DV +TEIA+A TLQE+  G Y P L QQPVFAFLPL
Sbjct: 1387 ISNGNDMMSWLVVSKKLQGTIVRNDVCSTEIAVAFTLQETQMGEYEPYLKQQPVFAFLPL 1446

Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKA 1525
            R YGLKFILQGDFVLPSSREEVD ++ WNQWLLSEFP+LFVSA+ SFC LPCF+  P KA
Sbjct: 1447 RNYGLKFILQGDFVLPSSREEVDADNAWNQWLLSEFPSLFVSAQESFCALPCFQGCPGKA 1506

Query: 1526 ASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHS 1585
             + +MSFVPLVGEVHGFF  LP +ILSKLR++ C++LEG++++W  PC  LRGW+++   
Sbjct: 1507 VTTFMSFVPLVGEVHGFFCQLPHLILSKLRLTRCMVLEGSSSRWVYPCNTLRGWDEQTRI 1566

Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
            L+ D LL +HLGLG+LSKDI++SD+L+RALGI EYGPK+L+ IISS+CR +  + S+GL 
Sbjct: 1567 LISDSLLLEHLGLGYLSKDIIISDTLSRALGIHEYGPKVLIDIISSICRVDGCIESLGLE 1626

Query: 1646 WLASWLNELY-TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHS 1704
            WL +W   L+ ++  HSS    L +  E DL+  L++IP IPLSDG+FSS+ +G IWL  
Sbjct: 1627 WLCAWFISLHLSLMHHSSKNLPLTTSPE-DLLCALRKIPCIPLSDGSFSSIADGPIWLPY 1685

Query: 1705 DC--SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQ 1762
            D   S  D    +  FP L + LRT+ P LLS S  +K     +  ++L  +LLK+GV++
Sbjct: 1686 DVLNSKPDSRSSMLNFPVLYSNLRTIKPRLLSVSCQNKYLTEEMRANDLMDILLKMGVRK 1745

Query: 1763 LSAHDIVKVHILPAISDETTANGDKNLMADYLCFV 1797
            LS HDI+K HIL ++S+ T AN    +M +Y+  V
Sbjct: 1746 LSGHDIIKNHILVSLSNSTEANVANTMMIEYVVKV 1780



 Score =  369 bits (947), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 239/724 (33%), Positives = 373/724 (51%), Gaps = 93/724 (12%)

Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
           LER   A  K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L  L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238

Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
            ++N FIHC+V  R+I S+YDLE  ICK+E I +FEEL LGPFL+ PL++HYF V +D+ 
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298

Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
           +V K++++++I  L ++ D  K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357

Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
           +R+AR+++   +K             ++ +  + K    K+ LDERFSAI  R+K    I
Sbjct: 358 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417

Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
           N     KHI F  S+ ++ D D  +   + D S +    + I   D   RVSSCPYPS T
Sbjct: 418 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471

Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
           EE++RLGLK E + +     +   +    G +++KRK E +G  +S+  +          
Sbjct: 472 EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521

Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTM--SEVLEKMFQFYKPTD 548
             P +             +   S   +  FITTWKEAC+ + +   EVLE +  +Y  T 
Sbjct: 522 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQKEVLELLANYYAETP 579

Query: 549 PKKAARYFKRMKLMFSSYPCIGLLNIA--------------------------------- 575
            +K       +   FS YP IG LN+A                                 
Sbjct: 580 KEKKK-----IIKFFSEYPGIGFLNVALDYFIWAWNGENIIQILAASMTSQILHAVTHVF 634

Query: 576 ------------VTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAV 623
                       V ++  G+ DS+YD +    + + ++ +   + E   +E  PS     
Sbjct: 635 SLEIFALPFLVEVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTELMEVEPPPS----- 689

Query: 624 VNTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRI 683
                    +  ++V+++++++ E +  +     S V K    LR L  CET +T +F  
Sbjct: 690 ---------VTADDVIRRITEFFESNRGVSRTDASQVRK-STFLRTLLDCETCITAKFSA 739

Query: 684 KEFKSLGYGEFFTFLEKHASMLSTELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNL 743
            +F +LG+G F  FL KH   L  +L        +  SS+EVS++Q  + VL+ QA  N 
Sbjct: 740 NQFSALGHGTFLEFLGKHEQHLPPKLSSFLKVGKLTHSSVEVSVLQQQIEVLLCQAGGNW 799

Query: 744 WESEIITKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGT 803
            E    ++   S+LL+RQFP ISF I    S E  L+++ + R  +    ++FS ++L  
Sbjct: 800 LEDGEFSEDSFSKLLKRQFPTISFDIVQDKSGEGLLDSIERQRKNIEINNIMFSMSLLEK 859

Query: 804 RLSG 807
           R SG
Sbjct: 860 RWSG 863



 Score =  120 bits (301), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 92/156 (58%), Gaps = 6/156 (3%)

Query: 1932 DWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSS 1991
            DWES ELV +LS  ++   R+   +LLE+LD  WD  Y+ K      +  +G++ + +SS
Sbjct: 1809 DWESPELVSILSTFSSKKCRENCVYLLEVLDKFWDAHYSAKARIHADATHSGENIAVESS 1868

Query: 1992 FINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKT 2051
            F+N I   +W  S+MD++LHY  DLF++ + VRSILG  APY VP+V S  L  DIG K 
Sbjct: 1869 FMNSIRTFKWIASAMDEDLHYATDLFYNTEDVRSILGSVAPYAVPQVCSRSLGKDIGFKI 1928

Query: 2052 EVTIDDILEILKVWTRLEAPF------MARYYLLVN 2081
            +V+  D L ILK W   +  F      M ++Y  V+
Sbjct: 1929 KVSHSDALMILKSWIASQTSFSASMDQMCKFYTFVS 1964


>gi|298205162|emb|CBI17221.3| unnamed protein product [Vitis vinifera]
          Length = 2388

 Score =  953 bits (2463), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 498/797 (62%), Positives = 573/797 (71%), Gaps = 114/797 (14%)

Query: 876  LLNEVNVKE--LLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGE 933
            LL    +K+  LLCLVT+DGKV+RIDHSA++DSFLEA+L+GSSF+TAV+LLS F+L GG+
Sbjct: 832  LLGTYTIKDSSLLCLVTKDGKVMRIDHSATMDSFLEASLQGSSFRTAVQLLSLFSLFGGK 891

Query: 934  KNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEV 993
            ++VP  LLKCHAR AFEV+ +N+VE++EV                               
Sbjct: 892  RHVPFSLLKCHARQAFEVILRNSVENMEV------------------------------- 920

Query: 994  QKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTE 1053
                    + VPVASRF LDCLGYLPSEFRSFAAD+LLSGLQ    + PSAIL EC+Q +
Sbjct: 921  -------NETVPVASRFLLDCLGYLPSEFRSFAADILLSGLQPFTINGPSAILDECNQMD 973

Query: 1054 LRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG------- 1106
             R+MLHEVGLSLG+++WI DYHAF S   ++  +     C  AA+S L  G+        
Sbjct: 974  QRVMLHEVGLSLGVMQWIDDYHAFSSAAATNSFVSSGALCLQAASSELRRGTKFTQNALA 1033

Query: 1107 ---CAEGSLFESVGADVHIEECGAICDTICGEA-SDDGLGDCTTQTLPEDKECEDAALIV 1162
                 EG +  S GA  H EE   IC T   E  S D  G       PE  E +DA L++
Sbjct: 1034 KFPSCEGEMIISDGACGHNEEHSEICQTTGSEGVSVDRSGHGCILYAPELNEHKDATLVI 1093

Query: 1163 ESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELV-----QNA 1217
            ESIRRDEFGL P +S+MES+MLKKQHARLGRALHCLSQELYSQDSHFLLELV     QNA
Sbjct: 1094 ESIRRDEFGLDPTLSSMESSMLKKQHARLGRALHCLSQELYSQDSHFLLELVMVLLVQNA 1153

Query: 1218 DDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
            DDNIYPENVEPTLTFILQ+ GI+VLNNEQGFSA+NIRALCDVGNSTKKGS AGYIG+KGI
Sbjct: 1154 DDNIYPENVEPTLTFILQDRGIIVLNNEQGFSAQNIRALCDVGNSTKKGSKAGYIGQKGI 1213

Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESK 1337
            GFKSVFRVTDAPEIHSNGFHVKFD SEGQIGFVLPT++PP N+D+               
Sbjct: 1214 GFKSVFRVTDAPEIHSNGFHVKFDISEGQIGFVLPTVIPPCNVDLM-------------- 1259

Query: 1338 CWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKK 1397
                        K S+G  M+NI+ MFSDLHPSLLLFLH L+CI F+NMLNDSL+++RK+
Sbjct: 1260 ------------KLSKGTGMSNIISMFSDLHPSLLLFLHHLRCIKFKNMLNDSLIIMRKE 1307

Query: 1398 IVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQ 1457
            IVGDGIIKVS G +KMTWFV SQKLRA VIRPDV+TTEIA+A TLQES+ G Y P   QQ
Sbjct: 1308 IVGDGIIKVSHGREKMTWFVISQKLRADVIRPDVQTTEIAIAFTLQESDNGEYSPHFEQQ 1367

Query: 1458 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
            PVFAFLPLRTYGLKFILQGDFVLPSSREEVDG+                           
Sbjct: 1368 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGDR-------------------------- 1401

Query: 1518 FRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR 1577
                  KA + YMSFVPLVGEVHGFFS LPRMI+SKLRMSNCL+LEG+NN+W PPCKVLR
Sbjct: 1402 ------KAVAAYMSFVPLVGEVHGFFSSLPRMIISKLRMSNCLLLEGDNNEWVPPCKVLR 1455

Query: 1578 GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTEN 1637
             WN++A SLLPD LL KHLGLGFL K+I LSD LARALGI+EYGPKILLQIISSLC TE+
Sbjct: 1456 SWNEQARSLLPDSLLCKHLGLGFLDKNIHLSDPLARALGIQEYGPKILLQIISSLCHTED 1515

Query: 1638 GLRSMGLSWLASWLNEL 1654
            GL+SMGL+WL+SW   L
Sbjct: 1516 GLKSMGLAWLSSWEKNL 1532



 Score =  710 bits (1833), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/817 (50%), Positives = 518/817 (63%), Gaps = 79/817 (9%)

Query: 1   MYGNHRPRFRPVGGVGGEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN 60
           MYG++ P  RP    GG G GG  PP Q  PP   PN+  QN N +F NP  Q   NL  
Sbjct: 100 MYGHNPPGHRPTAAGGGSGRGGAVPPLQALPP--TPNYFIQNVNPYFQNPTFQ--ANLGL 155

Query: 61  IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQRPLT 120
              QNP    QNP L MQN NL  QNPNLP Q       N         P  + Q    T
Sbjct: 156 PYLQNPTLPTQNPTLPMQNTNLPLQNPNLPMQNTSFPLQN---------PSFAIQNTNFT 206

Query: 121 QQPQSLPPQPQKLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSL 180
                  PQP K NKE L+R+D AVVKAR ++I  GESVS+WKVS+S LL LQVDSW SL
Sbjct: 207 ----GFRPQPPKRNKEALDRVDGAVVKARRDVIATGESVSAWKVSQSALLALQVDSWESL 262

Query: 181 GFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPF 240
           GF MQEVPSLH L+V E +IN+FIHCFVGVR+ITSLYDLE AICK+EG+E+FE+LELGP 
Sbjct: 263 GFPMQEVPSLHSLIVIEGKINSFIHCFVGVRRITSLYDLEMAICKNEGVEQFEDLELGPL 322

Query: 241 LRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLAS 300
           +R PLI HYF + SD   VFKI + ++I CL E+ +  + K I ++EFL++IAKKRSL  
Sbjct: 323 VRHPLIMHYFSISSDASGVFKITSAEIISCLDEFMEACQDKHIIIEEFLEYIAKKRSLTG 382

Query: 301 KEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQ--PHHKRRKRPIFSSEKKRQLDE 358
           +E+LGVRIQ+LGMHI+FIREARK + +TLKK    L+  P  K R+ P+ SSEKK+ LDE
Sbjct: 383 RERLGVRIQSLGMHISFIREARKLEHMTLKKSQGSLKQIPDKKIREHPLRSSEKKK-LDE 441

Query: 359 RFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSN---DISSNVQLPS-Q 414
           RFSA+ +RVKSF+S ++DFG KH  FVSS SE++ SDD  YE S    D  SN +  S  
Sbjct: 442 RFSAMSQRVKSFASAHDDFGGKHTIFVSSCSEEDGSDDHKYEESEEDIDGCSNSKFSSPN 501

Query: 415 IKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCT 474
            K  DRVSSCPYPS  EE+ RLGLKGE     +++G SS   +  G  K+KRK  N  CT
Sbjct: 502 SKTRDRVSSCPYPSAIEEMTRLGLKGETEGNPSASG-SSMHSENTGPFKRKRKSSNRSCT 560

Query: 475 SSAPAKFLRRNKAKQRALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMS 534
            S   K  +RNK +                                     EAC+ +T++
Sbjct: 561 VSKYLKLPKRNKLEL------------------------------------EACQEHTIA 584

Query: 535 EVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAI 594
           EVLE+M QF+     ++     K MK M SSYP +GLLN+AVTSIK GMWDS+YDT QAI
Sbjct: 585 EVLERMLQFHGTQTKQR-----KIMKSMLSSYPFVGLLNVAVTSIKSGMWDSIYDTFQAI 639

Query: 595 TQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFDNSILN 654
           +Q EL N +     EYESI+  PS  V V            E+VM+ +  + E D+ I +
Sbjct: 640 SQDELTNKLPDKHSEYESID-GPSTGVTV------------EDVMRNLVTFFELDHDISH 686

Query: 655 NAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTELQKLFT 714
           + +SP+ K  +L R+L +CE W+ E+F +KEFKSLG+G+FFTFLEKHAS+L  EL K  T
Sbjct: 687 SGKSPLEKKFLLFRQLSNCEFWVAEKFSVKEFKSLGFGDFFTFLEKHASILPNELHKCLT 746

Query: 715 DDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKIEDKGS 774
            DT ++S LEV ++Q  LVVL+SQASN+LWE+E +TKQ IS LL+RQFP + FKI + G 
Sbjct: 747 SDTYEKSPLEVCMLQKQLVVLLSQASNSLWENETLTKQKISMLLKRQFPSVGFKILENGC 806

Query: 775 MESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSV 811
           M+ FL+ V + ++ V+S CVLFS T+LGT    D S+
Sbjct: 807 MDDFLDIVREQKSCVVSTCVLFSSTLLGTYTIKDSSL 843



 Score =  229 bits (584), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 109/209 (52%), Positives = 143/209 (68%), Gaps = 24/209 (11%)

Query: 1879 SCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNE 1937
            SC   +WR FF+ +G+TDFVQ+VQV+K+V+DISH   KN MW ++L+S G+ A DWES E
Sbjct: 1547 SCTNCRWRGFFQALGVTDFVQIVQVEKNVSDISHMILKNEMWDRDLISHGTIAKDWESPE 1606

Query: 1938 LVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCIC 1997
            L                 +LL++LDTLWDDC++DK+ G+   K +GD + F+SS +  IC
Sbjct: 1607 L-----------------NLLDVLDTLWDDCFSDKVSGYCNFKSSGDRKPFKSSLMTSIC 1649

Query: 1998 DIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDD 2057
            D QW  SSMDDELHYPKDLF+D D V  +LG SAPY +PKV+S KL CDIG KT+VT+DD
Sbjct: 1650 DFQWIASSMDDELHYPKDLFYDSDEVHLVLGSSAPYALPKVRSGKLACDIGFKTKVTLDD 1709

Query: 2058 ILEILKVWTRLEAPF------MARYYLLV 2080
            IL IL+ W R E PF      M+++Y  +
Sbjct: 1710 ILGILQEWRRSETPFKASIAQMSKFYTFI 1738


>gi|168050836|ref|XP_001777863.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670728|gb|EDQ57291.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1962

 Score =  863 bits (2231), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 497/1288 (38%), Positives = 730/1288 (56%), Gaps = 101/1288 (7%)

Query: 838  TSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIR 897
            T+ DA++ L+ AP L +L+ WS WD +FAP+LGP   WL  E     +  LVT  G +++
Sbjct: 14   TNADALQALINAPMLVNLSLWSQWDTIFAPTLGPFLSWLEREGFKGNIYTLVTDAGVILK 73

Query: 898  IDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNV 957
            ID +A+ D+FL + + G S   AV+L S  AL GG  + P  LLK +A  A  V+ ++  
Sbjct: 74   IDGTATADNFLPSLISGDSKSAAVQLTSIVALYGGINHAPSALLKTYATKALTVLTRSFA 133

Query: 958  EDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGY 1017
             ++   +     M                                         LD L  
Sbjct: 134  SEVHSEDYSRVAM---------------------------------------LLLDILCV 154

Query: 1018 LPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAF 1077
            +P E + FAA +LL    S ++ + S +L  C   E R +LH++G++ GI EWI+++ A 
Sbjct: 155  VPQELQFFAAKILLPAFSSAVEYSASILLKSCKIDEHRFLLHKLGVAFGIEEWINNFKA- 213

Query: 1078 CSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEAS 1137
            C   +S   M       + +++ + S +         SV +   + E  +  + I  + +
Sbjct: 214  CVFKSS---MTEGAVVQDISSNEMRSPAISGNHEQLPSVQSQSDVMETDSTVERIENDQA 270

Query: 1138 DD-----GLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLG 1192
                   GL   +TQ+ P+    + A  +VE IRR+EFG+G ++   E  +L +QHAR+G
Sbjct: 271  STSAVVTGLSG-STQSAPDFDIRQRANDVVEFIRREEFGIGQDLKVQEQELLARQHARMG 329

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
            RALH LSQ+LYSQDSHF+LELVQNADDN Y   VE  L F++Q   ++V NNE GF++ N
Sbjct: 330  RALHRLSQDLYSQDSHFVLELVQNADDNSYAPGVEAALVFVVQSGHVMVFNNEVGFTSAN 389

Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
            +RALCDVGNSTK GSS GYIG KGIGFKSVFRVTDAPEIHSNGF+VKFD +E  +GF+LP
Sbjct: 390  LRALCDVGNSTKTGSSTGYIGHKGIGFKSVFRVTDAPEIHSNGFNVKFDITESNLGFILP 449

Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESK---------CWNTCIRLPFRTKFSEGIAMNNIVDM 1363
            T++        C       ++L  K         CWNT I LP ++  S G  M ++   
Sbjct: 450  TIIEARQ----CAKSYSGTLELVGKKVFSHEKVVCWNTRIDLPIKSSISNGKGMASLAAK 505

Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR 1423
            F D+HPSLLLFLHRL+CI  +N +  +  ++ ++ + +G+++V   +   TW VA  ++ 
Sbjct: 506  FDDIHPSLLLFLHRLRCIAVQNDVVSATKLMHREDLANGLVRVVHDKGHATWLVARHQVL 565

Query: 1424 AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSS 1483
            AGV RP++  TEIALA  L++  +GNY  +  QQ VFAFLPLR+YGL+FI+Q DF+LPSS
Sbjct: 566  AGVPRPNILETEIALAFPLEDLIDGNYKAVAEQQQVFAFLPLRSYGLRFIVQCDFLLPSS 625

Query: 1484 REEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFF 1543
            REEVD +S WNQWL SE P +FV A   F  LP      A A S Y+ FVPL GEV GFF
Sbjct: 626  REEVDSDSAWNQWLRSEIPEVFVKAAEIFKTLPTLGTRGA-AVSNYLKFVPLEGEVLGFF 684

Query: 1544 SGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK 1603
            S LPR+I ++L+   CL +EG+ + W+ PC VL  W++ A  L+PD LL++HLGL +L K
Sbjct: 685  SVLPRLIHTRLQAIPCLPVEGSESSWSFPCAVLMRWSETARELIPDALLKEHLGLQYLDK 744

Query: 1604 DIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTI--SFHS 1661
            ++ +SDSLAR+LG+++YG K L+ ++ S+ +  +G+  MGL W+  WL EL+        
Sbjct: 745  EVDISDSLARSLGVQKYGSKTLVALMKSVSQKSDGVGRMGLEWIRHWLIELHACLQKEQE 804

Query: 1662 SGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSD-CSVFDGGFG-LEAFP 1719
               SS  S  E+ L   L+ + FIPL+ G+F+++ +G +W  +    + DG    L  FP
Sbjct: 805  VFTSSADSTNESILFQELRAMQFIPLASGSFTAIKDGPVWFANHPAELQDGAIKPLGCFP 864

Query: 1720 NLCAKLRTVSPALL---SASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPA 1776
             L A+LRTV P L    SAS VD +    ++   ++ ML ++GV+ +SAH ++K H++PA
Sbjct: 865  LLYAELRTVHPTLFHSTSASEVDLADDPNVTGKVIS-MLHQLGVRPISAHHVLKSHVIPA 923

Query: 1777 ISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHG-FKRPS 1835
            ++DE     + +L+  Y+ +   HL   C  C  E   I  +L++ A + TN+G F+   
Sbjct: 924  MADENCTAKETSLLLQYIGYAKWHLGSSCNQCTGEIALIFEQLKQNALISTNNGVFRMGE 983

Query: 1836 EIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGIT 1895
            + PIHFGK  G+P     ++    +KW EV  +YL+      ++  +  WRKF  E+G+T
Sbjct: 984  KEPIHFGKSMGSPFDAQDVLSGCSMKWNEVSDSYLQLALPGQITPNVESWRKFLSELGVT 1043

Query: 1896 DFVQVVQVDKDVADISHTGFKNMWTKELLS------PGSAAIDWESNELVHLLSLLTTNV 1949
            DFVQV+ V K + + +     ++W  E+         G    DWES ELV L+S +TT  
Sbjct: 1044 DFVQVIPVKKIIKNKA----ASLWKDEVWEGVNKDDTGCVLEDWESPELVDLISTVTTKT 1099

Query: 1950 N--------RQCSKHLLEILDTLWDDCYTDKIMGFFK---SKPTGDDRSFQSSFINCICD 1998
            +        ++CS  +  ILD LW   Y  +    F+    K      +F +S++  +  
Sbjct: 1100 SARKVHKKAKRCSA-IFSILDKLWQSNYASRCRATFRISDEKQAKQGLTF-ASWVLQVRK 1157

Query: 1999 IQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDI 2058
            ++W  SS+D  LH P  LF  C++V  ILG  A Y+  ++ +  L   +GL+TEVT+DD 
Sbjct: 1158 LKWVKSSLDSRLHTPASLFQRCESVNKILGNHAAYLRTQIHNVHLTETLGLRTEVTVDDC 1217

Query: 2059 LEILKVWTR----LEAPF--MARYYLLV 2080
            L +L+ W      LEA    MAR YL +
Sbjct: 1218 LSLLQHWGSSNQVLEASVAQMARLYLFL 1245


>gi|302811920|ref|XP_002987648.1| hypothetical protein SELMODRAFT_126522 [Selaginella moellendorffii]
 gi|300144540|gb|EFJ11223.1| hypothetical protein SELMODRAFT_126522 [Selaginella moellendorffii]
          Length = 1823

 Score =  822 bits (2123), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 496/1253 (39%), Positives = 714/1253 (56%), Gaps = 122/1253 (9%)

Query: 836  SVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKV 895
            S T  DA   L++AP L+DL  WS W  ++ P LGPL  WL +    +++L + TRDGK+
Sbjct: 72   SKTDADAEACLVKAPMLADLTLWSQWTDIYEPQLGPLLEWLEHGRCNRQVLAVATRDGKL 131

Query: 896  IRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKN 955
            +R+D  ASVDSFL+A + G S   A +L+S   L GG    P  LL+ HA    ++ F  
Sbjct: 132  LRVDGDASVDSFLDAVINGRSLDVAAQLVSLVVLYGGAARTPKALLQTHALKGLQIYF-- 189

Query: 956  NVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCL 1015
                                      D A ++N  G  +KQ  +   A      F LDCL
Sbjct: 190  --------------------------DRARVNN--GRKRKQENRHEAAC-----FILDCL 216

Query: 1016 GYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYH 1075
               P +F  F+A+VL +GL     DA S +L  C      LMLH VGL+LG+ +WI+DY 
Sbjct: 217  ASAPHDFIEFSAEVLYAGLSRVFPDAGSLLLQNCSSVLHHLMLHRVGLALGVPQWINDYS 276

Query: 1076 AFCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHI---EECGAICDTI 1132
             F +   +D              S  +S S C   + F+    D  +    + G+   ++
Sbjct: 277  TFEAASHAD-------------ESYRDSSSACE--NHFKKPRRDETVLSERQPGSQARSV 321

Query: 1133 CGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLG 1192
              E +D          + E+    DA  +V  IR+DEFG        +S +L KQHAR+G
Sbjct: 322  KKEVAD----------MSEESLDSDAREVVRMIRKDEFGFEEQDEVTQSGLLAKQHARMG 371

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
            RAL CLSQ LYS+D HF+LELVQNADDN YP N  PTL FI+Q+  IVVLNNE+GF+A+N
Sbjct: 372  RALECLSQNLYSEDCHFILELVQNADDNSYPLNAHPTLAFIIQKDRIVVLNNEKGFTAKN 431

Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
            IRALCDVG STKKG  AGYIG KGIGFKSVFRVT++P+IHSNGFHVKFD SE  IGF+LP
Sbjct: 432  IRALCDVGKSTKKGVPAGYIGHKGIGFKSVFRVTESPQIHSNGFHVKFDISESSIGFILP 491

Query: 1313 TLV-PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSL 1371
            T V  P +      +LS+   + ++ C NT I LP R    E    N++  MF D+ P L
Sbjct: 492  TWVQSPLDNAYLENVLSQATGE-DTWCINTRIDLPLRASVVEA---NSMSRMFQDVQPKL 547

Query: 1372 LLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
            LLFLH+L+ ++ +N  +DS + +R++ +G+ IIK++ G    T+ V  Q L A + R  V
Sbjct: 548  LLFLHKLRTVVVKNEASDSCLTMRRQDIGNNIIKLTDGRSTGTFLVVRQLLDAPMDRAGV 607

Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
              TE+ALA  L+ES +G+Y  L  QQ VFAFLP+R+YGLKF++QGDF+LPSSRE++D +S
Sbjct: 608  PDTELALAFELKESRDGHYAALAVQQQVFAFLPVRSYGLKFVVQGDFILPSSREDLDRDS 667

Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPA-----KAASVYMSFVPLVGEVHGFFSGL 1546
             WNQWL +E P LF+ +      L CF+ +       +A S Y+S+VP+ GEV GFFS +
Sbjct: 668  AWNQWLRAEVPKLFLQS------LHCFQSSAVFGSRIEAISSYLSYVPVEGEVLGFFSSV 721

Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIV 1606
            PR+ILS LR + CL +EG +  W+ PC  LRGW D    L+PD +L+K+LGL +L ++++
Sbjct: 722  PRIILSLLRAAPCLPVEG-SELWSLPCTTLRGWGDAFRLLVPDDMLKKYLGLLYLDREVI 780

Query: 1607 LSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSS 1666
            +SD LA ALG+ EYG + L+ ++ +  ++   L  +G+ WL   L E++   F    Q+ 
Sbjct: 781  ISDILASALGVPEYGVETLVGLLKAAAKS---LDDLGIEWLRRCLVEIHKC-FAYEKQNF 836

Query: 1667 LQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEA----FPNLC 1722
            +    E  +   L+ +PFIPLS G F+S+++G +W       F  G GL+     F +L 
Sbjct: 837  VHQ--EQKVAGTLRGVPFIPLSSGGFTSLEDGLVW-------FLDGEGLQGHSKDFLDLY 887

Query: 1723 AKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETT 1782
            + +RTVSP L+     ++S+ G I       ML+K+GV+ ++AH+++K H+LPA++    
Sbjct: 888  SDIRTVSPFLID----EESTAGTI-----KEMLVKLGVKVVTAHEVIKSHVLPAMASVDC 938

Query: 1783 ANGDKNLMADYLCFVMMHLEYY--CPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIH 1840
               D +L+  YL +  +H +    C +C  E++ ++S+LR  A VLT  G +  +E  IH
Sbjct: 939  LEKDPSLLLQYLAYCALHFDKLDNCRSCQAEKDGLLSQLRNTAIVLTKDGLRSVAEHSIH 998

Query: 1841 FGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQV 1900
            FGKE GN V V  L+ D  + W EVD  YL+     +       WRKFF  +G++DF+ V
Sbjct: 999  FGKELGNIVDVEKLLEDAPVPWCEVDSKYLQFNPSPAF------WRKFFMSLGVSDFINV 1052

Query: 1901 VQVDKDVADISHTGFKN-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLE 1959
            +     V D S T +KN  W     +      DW   +   + + L  +   +  +  L 
Sbjct: 1053 LPSKVRVGDKSSTVWKNESWKHADGNTAWIIQDWGCADSFRICNALARSGRPENCRLFLT 1112

Query: 1960 ILDTLWDDCYTDKIMGFF--KSKPT-GDDRSFQSSFINCICDIQWTISSMDDELHYPKDL 2016
            +LD++WD+ Y     G +  +  P+ GDD   +SSF+  +    W  SS+D +LH P +L
Sbjct: 1113 VLDSMWDEHYDRFSKGSYSVEGAPSIGDD--CESSFVLQLRGYTWIGSSLDHQLHKPTEL 1170

Query: 2017 FHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTRLE 2069
            FH   +V S+LG    Y  P+V S KL   +GL TEV ++DI+ ++K+    E
Sbjct: 1171 FH--KSVESMLGSYVSYARPQVGSSKLAKALGLHTEVRLEDIINLMKLLNERE 1221


>gi|302811833|ref|XP_002987605.1| hypothetical protein SELMODRAFT_158880 [Selaginella moellendorffii]
 gi|300144759|gb|EFJ11441.1| hypothetical protein SELMODRAFT_158880 [Selaginella moellendorffii]
          Length = 1734

 Score =  805 bits (2080), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/1237 (39%), Positives = 705/1237 (56%), Gaps = 121/1237 (9%)

Query: 851  FLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEA 910
             L+DL  WS W  ++ P LGPL  WL +    +++L + TRDGK++R+D  ASVDSFL+A
Sbjct: 1    MLADLTLWSQWTDIYEPQLGPLLEWLEHGRCNRQVLAVATRDGKLLRVDGDASVDSFLDA 60

Query: 911  ALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRM 970
             + G S   A +L+S   L GG    P  LL+ HA    ++ F                 
Sbjct: 61   VVNGRSLDVAAQLVSLVVLYGGAARTPKALLQTHALKGLQIYF----------------- 103

Query: 971  HGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVL 1030
                       D A ++N  G  +KQ     +    A+ F LDCL   P +F  F+A+VL
Sbjct: 104  -----------DRARVNN--GRKRKQ-----ENHHEAACFILDCLASAPHDFIEFSAEVL 145

Query: 1031 LSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCV 1090
             +GL     DA S +L  C      LMLH VGL+LG+ +WI+DY  F +   +D      
Sbjct: 146  YAGLSRLFPDAGSLLLQNCSSVLHHLMLHRVGLALGVPQWINDYSTFEAASHAD------ 199

Query: 1091 VTCTNAATSGLNSGSGCAEGSLFESVGADVHI---EECGAICDTICGEASDDGLGDCTTQ 1147
                    S  +S S C   + F+    D  +    + G+   ++  E +D         
Sbjct: 200  -------ESYRDSSSACE--NHFKKPRRDETVLSERQPGSQLRSVKKEVAD--------- 241

Query: 1148 TLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDS 1207
             + E+    DA  +V  IR+DEFG        +S +L KQHAR+GRAL CLSQ LYS+D 
Sbjct: 242  -MSEESLDSDAREVVRMIRKDEFGFEEQDEVTQSGLLAKQHARMGRALECLSQNLYSEDC 300

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            HF+LELVQNADDN YP N  PTL FI+Q+  IVVLNNE+GF+A+NIRALCDVG STKKG 
Sbjct: 301  HFILELVQNADDNSYPLNAHPTLAFIIQKDRIVVLNNEKGFTAKNIRALCDVGKSTKKGV 360

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV-PPFNIDMFCRL 1326
             AGYIG KGIGFKSVFRVT++P+IHSNGFHVKFD SE  IGF+LPT V  P +      +
Sbjct: 361  PAGYIGHKGIGFKSVFRVTESPQIHSNGFHVKFDISESSIGFILPTWVQSPLDNAYLENV 420

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
            LS+   + ++ C NT I LP R    E    N++  MF D+ P LLLFLH+L+ ++ +N 
Sbjct: 421  LSQATGE-DTWCINTRIDLPLRASVVEA---NSMSRMFQDVQPKLLLFLHKLRTVVVKNE 476

Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
             +DS + +R++ +G+ IIK++ G    T+ V  Q L A + R  V  TE+ALA  L+ES 
Sbjct: 477  ASDSCLTMRRQDIGNNIIKLTDGRSTGTFLVVRQLLDAPMDRAGVPDTELALAFELKESR 536

Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
            +G+Y  L  QQ VFAFLP+R+YGLKF++QGDF+LPSSRE++D +S WNQWL +E P LF+
Sbjct: 537  DGHYAALAVQQQVFAFLPVRSYGLKFVVQGDFILPSSREDLDRDSAWNQWLRAEVPKLFL 596

Query: 1507 SAERSFCDLPCFRENPA-----KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLI 1561
             +      L CF+ +       +A S Y+S+VP+ GEV GFFS +PR+ILS LR + CL 
Sbjct: 597  QS------LHCFQSSAVFGSRIEAISSYLSYVPVEGEVLGFFSSVPRIILSLLRAAPCLP 650

Query: 1562 LEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYG 1621
            +EG +  W+ PC  LRGW D    L+PD +L+K+LGL +L +++++SD LA ALG+ EYG
Sbjct: 651  VEG-SELWSLPCTTLRGWGDAFRLLVPDDMLKKYLGLLYLDREVIISDILASALGVPEYG 709

Query: 1622 PKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQR 1681
             + L+ ++ +  ++   L  +G+ WL   L E++   F    Q+ +    E  +   L+ 
Sbjct: 710  VETLVGLLKAAAKS---LDDLGIGWLRRCLVEIHKC-FAYEKQNFVHQ--EQKVAGTLRG 763

Query: 1682 IPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEA----FPNLCAKLRTVSPALLSASA 1737
            +PFIPLS G F+S+++G +W       F  G GL+     F +L + +RTVSP L+    
Sbjct: 764  VPFIPLSSGGFTSLEDGLVW-------FLDGEGLQGRSKDFLDLYSDIRTVSPFLID--- 813

Query: 1738 VDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFV 1797
             ++S+ G I       ML+K+GV+ ++AH+++K H+LPA++       D +L+  YL + 
Sbjct: 814  -EESTAGTI-----KEMLVKLGVKVVTAHEVIKSHVLPAMASVDCLKKDPSLLLQYLAYC 867

Query: 1798 MMHLEY-YCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIH 1856
             +H E   C +C  E++ ++S+LR  A VLT  G +  +E  IHFGKE GN V V  L+ 
Sbjct: 868  ALHFELDNCRSCQAEKDGLLSQLRNTAIVLTKDGLRSVAEHSIHFGKELGNIVDVEKLLE 927

Query: 1857 DIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFK 1916
            D  + W EVD  YL+     +       WRKFF  +G++DF+ V+     V D S T +K
Sbjct: 928  DAPVPWCEVDSKYLQFNPSPAF------WRKFFMSLGVSDFINVLPSKVRVGDKSSTVWK 981

Query: 1917 N-MWTKELLSPGSAAIDWESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMG 1975
            N  W     +      DW   +   + + L  +   +  +  L +LD++WD+ Y     G
Sbjct: 982  NESWKHADGNTAWIIQDWGCADSFRICNALARSGRPENCRLFLTVLDSMWDEHYDRFSKG 1041

Query: 1976 FF--KSKPT-GDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAP 2032
             +  +  P+ G D   +SSF+  +    W  SS+D +LH P +LFH   +V S+LG    
Sbjct: 1042 SYSVEGAPSIGGD--CESSFVLQLRGYAWIGSSLDHQLHKPTELFH--KSVESMLGSYVS 1097

Query: 2033 YIVPKVKSEKLVCDIGLKTEVTIDDILEILKVWTRLE 2069
            Y  P+V S KL   +GL TEV ++DI+ ++K+    E
Sbjct: 1098 YARPQVGSSKLAKALGLHTEVRLEDIINLMKLLNERE 1134


>gi|147796512|emb|CAN68152.1| hypothetical protein VITISV_009301 [Vitis vinifera]
          Length = 604

 Score =  568 bits (1464), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 303/520 (58%), Positives = 357/520 (68%), Gaps = 53/520 (10%)

Query: 816  SLLETTSAITHTGLRPKMSESVTS-KDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPG 874
            +  E    I+H+G  P + + V+  + A E+   AP LSDLNSWSHWD +FAPSLGPL  
Sbjct: 88   TFFELDHDISHSGKSPLLGKEVSPLQAAFEL---APMLSDLNSWSHWDLIFAPSLGPLVW 144

Query: 875  WLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEK 934
            WLLNEVN KELLCLVT+DGKV+RIDHSA+ DSFLEA+L+GSSF+TAV+LLS F+L GG++
Sbjct: 145  WLLNEVNTKELLCLVTKDGKVMRIDHSATXDSFLEASLQGSSFRTAVQLLSLFSLFGGKR 204

Query: 935  NVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQ 994
            +VP  LLKCHAR AFEV+ +N+VE++EV  SQ+  MHG  L  R+  D+    NLS   Q
Sbjct: 205  HVPFSLLKCHARQAFEVILRNSVENMEVNESQDSLMHGKPLFQREMLDMDATSNLSSGSQ 264

Query: 995  KQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTEL 1054
            + + +  KAVPVASRF LDCLGYLPSEFRSFAAD+LLSGLQ    + PSAIL EC+Q + 
Sbjct: 265  RNMSRTSKAVPVASRFLLDCLGYLPSEFRSFAADILLSGLQPFTINGPSAILDECNQMDQ 324

Query: 1055 RLMLHEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSG-------- 1106
            R+MLHEVGLSLG+++WI DYHAF S   ++  +     C  AA+S L  G          
Sbjct: 325  RVMLHEVGLSLGVMQWIDDYHAFSSAAATNSFVSSGALCLQAASSELRRGXKFTQNALAK 384

Query: 1107 --CAEGSLFESVGADVHIEECGAICDTICGEA-SDDGLGDCTTQTLPEDKECEDAALIVE 1163
                EG +  S GA  H EE   IC T   E  S D  G       PE  E +DA L++E
Sbjct: 385  FPSCEGEMIISDGACGHNEEHSEICQTTGSEGVSVDRSGHGCILYAPELNEHKDATLVIE 444

Query: 1164 SIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLEL---------- 1213
            SIRRDEFGL P +S+MES+MLKKQHARLGRALHCLSQELYSQDSHFLLEL          
Sbjct: 445  SIRRDEFGLDPTLSSMESSMLKKQHARLGRALHCLSQELYSQDSHFLLELTVGSKFNAIE 504

Query: 1214 ----------------------------VQNADDNIYPENVEPTLTFILQESGIVVLNNE 1245
                                        VQNADDNIYPENVEPTLTFILQ+ GI+VLNNE
Sbjct: 505  IYALGGKQWRWSLVHHXRKKPWCLHIAQVQNADDNIYPENVEPTLTFILQDRGIIVLNNE 564

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
            QGFSA+NIRALCDVGNSTKKGS AGYIG+KGIGFKSVFRV
Sbjct: 565  QGFSAQNIRALCDVGNSTKKGSKAGYIGQKGIGFKSVFRV 604



 Score = 80.5 bits (197), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 40/88 (45%), Positives = 57/88 (64%), Gaps = 3/88 (3%)

Query: 576 VTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEYESIEIEPSEQVAVVNTKHTVQ---D 632
           VTSIK GMWDS+YDT QAI+Q EL N +     EYESI++EPSE   V  T   ++    
Sbjct: 17  VTSIKSGMWDSIYDTFQAISQDELTNKLPDKHSEYESIDVEPSENDTVAITDCILEHRHS 76

Query: 633 IQVEEVMKKVSKYLEFDNSILNNAQSPV 660
           + VE+VM+ +  + E D+ I ++ +SP+
Sbjct: 77  VTVEDVMRNLVTFFELDHDISHSGKSPL 104


>gi|359477941|ref|XP_002270965.2| PREDICTED: uncharacterized protein LOC100250350 [Vitis vinifera]
          Length = 488

 Score =  494 bits (1273), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 273/530 (51%), Positives = 351/530 (66%), Gaps = 49/530 (9%)

Query: 55  LQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASN 114
           +QN TN+P QNPN  +QN +  +QN +   QN N                          
Sbjct: 1   MQN-TNLPLQNPNLPMQNTSFPLQNPSFAIQNTNF------------------------- 34

Query: 115 QQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQV 174
                        PQP K NKE L+R+D AVVKAR ++I  GESVS+WKVS+S LL LQV
Sbjct: 35  ---------TGFRPQPPKRNKEALDRVDGAVVKARRDVIATGESVSAWKVSQSALLALQV 85

Query: 175 DSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEE 234
           DSW SLGF MQEVPSLH L+V E +IN+FIHCFVGVR+ITSLYDLE AICK+EG+E+FE+
Sbjct: 86  DSWESLGFPMQEVPSLHSLIVIEGKINSFIHCFVGVRRITSLYDLEMAICKNEGVEQFED 145

Query: 235 LELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAK 294
           LELGP +R PLI HYF + SD   VFKI + ++I CL E+ +  + K I ++EFL++IAK
Sbjct: 146 LELGPLVRHPLIMHYFSISSDASGVFKITSAEIISCLDEFMEACQDKHIIIEEFLEYIAK 205

Query: 295 KRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQ--PHHKRRKRPIFSSEK 352
           KRSL  +E+LGVRIQ+LGMHI+FIREARK + +TLKK    L+  P  K R+ P+ SSEK
Sbjct: 206 KRSLTGRERLGVRIQSLGMHISFIREARKLEHMTLKKSQGSLKQIPDKKIREHPLRSSEK 265

Query: 353 KRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSN---DISSNV 409
           K+ LDERFSA+ +RVKSF+S ++DFG KH  FVSS SE++ SDD  YE S    D  SN 
Sbjct: 266 KK-LDERFSAMSQRVKSFASAHDDFGGKHTIFVSSCSEEDGSDDHKYEESEEDIDGCSNS 324

Query: 410 QLPS-QIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKI 468
           +  S   K  DRVSSCPYPS  EE+ RLGLKGE     +++G SS   +  G  K+KRK 
Sbjct: 325 KFSSPNSKTRDRVSSCPYPSAIEEMTRLGLKGETEGNPSASG-SSMHSENTGPFKRKRKS 383

Query: 469 ENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQD-DELNEADISFSNESMRMFITTWKEA 527
            N  CT S   K  +RNK +   L ++  ++ ++ + LNEAD   +N+SMRMFITTWKEA
Sbjct: 384 SNRSCTVSKYLKLPKRNKLELVPLSVDHDNEKKELNNLNEADFLLANDSMRMFITTWKEA 443

Query: 528 CKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVT 577
           C+ +T++EVLE+M QF+     ++     K MK M SSYP +GLLN+AV+
Sbjct: 444 CQEHTIAEVLERMLQFHGTQTKQR-----KIMKSMLSSYPFVGLLNVAVS 488


>gi|405957164|gb|EKC23395.1| hypothetical protein CGI_10019612 [Crassostrea gigas]
          Length = 3111

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 386/1256 (30%), Positives = 602/1256 (47%), Gaps = 175/1256 (13%)

Query: 838  TSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPL----------PGWLL---NEVNVKE 884
            T + A+  L   P L ++  WS W  +F P LG L            W L   N++   +
Sbjct: 1000 TKEAALACLENCPLLENIAEWSQWTLVFEPQLGKLRDFIQKYGECQTWPLEGGNKIMKTD 1059

Query: 885  LLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCH 944
             + L +  G+ +R+  S+S + F  A    S  +    L+S    + G +N P  LL  H
Sbjct: 1060 FIALESSPGQFLRLISSSSPEKFNAALQRRSPREVGGHLMSLVVGSKGVENTPRALLASH 1119

Query: 945  ARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAV 1004
             +   E +FK               +HG           A++D   G       + G+AV
Sbjct: 1120 IK---EELFK---------------LHGE----------ASVDRTPGAPAS--FQHGEAV 1149

Query: 1005 PVASRFFLDCLGYLPSEFRSFAAD-VLLSGLQSTIKDAPS-AILLECHQTELRL-MLHEV 1061
                +F LDCL  LP +  S  A+ + L  L   +  + S  +LL+   TE +L  L ++
Sbjct: 1150 ---VKFVLDCLLVLPVKSCSVLANQMFLDPLGQVVGQSKSKTLLLQSCTTESQLSQLQKL 1206

Query: 1062 GLSLGILEWIHDYHAFCSTGTSDL--LMPCVV----------------TCTNAATSGLN- 1102
            G  LG+ EW +   A C    + L  + P +                    +A+TS L+ 
Sbjct: 1207 GCMLGVAEWTNSIQARCRPSHASLEEVDPEIADEFFRDDPEPIDDSEDEEDSASTSYLSD 1266

Query: 1103 -------SGSGCAEGSLFESVGADVHIEECGAICDTICGEA-------SDDGLGDC-TTQ 1147
                    G    E  +      ++ +EE     D    E        S+D + D  + +
Sbjct: 1267 VEVETEVKGQVEEEELIVLDERTEIEMEEGDVTGDKQKKEVTIDLTVKSEDNVIDLLSPE 1326

Query: 1148 TLPEDKECEDAAL-----IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQEL 1202
              PE    E  A      IVE IRR+EFG+G  ++     +++ Q  RLGR+L  LS++L
Sbjct: 1327 PKPEAVSMETDAEKTCRGIVEQIRREEFGIGVELNEDGQRLMRVQQERLGRSLDRLSKDL 1386

Query: 1203 YSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCD 1258
            YS+D+HF+LEL+QNADDN YP+++     P++ FI+Q  G+VVLNNE GF+ +NIRALCD
Sbjct: 1387 YSKDTHFVLELIQNADDNSYPDSLGEGECPSVQFIMQPEGVVVLNNETGFADKNIRALCD 1446

Query: 1259 VGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPF 1318
            VG STK    AGYIG+KGIGFKSVFRVTD PEIHSNGFH+KFD   G  G++LP  V   
Sbjct: 1447 VGRSTKGKHKAGYIGQKGIGFKSVFRVTDEPEIHSNGFHIKFDVHSGPTGYILPQWV--- 1503

Query: 1319 NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
                      ++    + + W T I LP +           +   F+D+HPSLLLFL+RL
Sbjct: 1504 ----------EEDKWKDGEGWTTKIVLPLKESIRS--QTTTLAARFNDIHPSLLLFLNRL 1551

Query: 1379 QCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIR------PDVK 1432
            + I   N + + +  I +K  G+ +++++  +    W V  + L A  I        +V+
Sbjct: 1552 RKITIDNRVENFVQNIHRKDYGENVVEITHDDKMDRWLVVKKSLDASKISLQAKSGVEVE 1611

Query: 1433 TTEIALALTLQ---ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1489
            +TEIALA  L+   +  +  + P   +QPVFAFLPLR+YG +FI+QGDF +PSSRE+VD 
Sbjct: 1612 STEIALAFPLRPPGQKKDSKFVP--PKQPVFAFLPLRSYGFRFIVQGDFDVPSSREDVDR 1669

Query: 1490 NSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRM 1549
            +S WNQWL +E   LF+ A   F     F     +A + ++ FVP+  E+  FF  +   
Sbjct: 1670 DSSWNQWLRNEIHILFLEALEIFKSHADFSN--MEAVTSFLQFVPVEDEIIDFFRPVANQ 1727

Query: 1550 ILSKLRMSNCLILEGNNN---QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDI- 1605
            IL KLR   C+  + N+     W  P +++R  +     ++   LL+  L L +L +D+ 
Sbjct: 1728 ILQKLRAKPCVPTQPNSKGQCAWKIPSQMVRVKDLLVLEVVTPDLLKDTLNLFYLHRDVA 1787

Query: 1606 -VLSDSLARALGIEEYGPKILLQIISSL----CRTENGLRSMGLSWLASWLNELY-TISF 1659
              L+ SL  +LGIE    + LL I  SL      T+   +   +   A WL  +Y +++ 
Sbjct: 1788 ATLNSSLTHSLGIESLNTEHLLLIGKSLSLQWAATDGRDKDNDILMTAKWLACVYRSLNE 1847

Query: 1660 HSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWL----HSDCSVFDGGFGL 1715
             +  Q          + D L+ +  IPL+        + TI+     +++ S  D     
Sbjct: 1848 FTDNQP---------VYDKLKALKIIPLATYQRQCTADSTIFFPLSENTELSYND----- 1893

Query: 1716 EAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILP 1775
                 L   L T+ P L+S++  + +S        + ++LLKIGV QLS  +++  HILP
Sbjct: 1894 -PMSVLQQDLNTIHPLLVSSADSEVNS-------QVQKLLLKIGVCQLSPREVISHHILP 1945

Query: 1776 AISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPS 1835
             +         +N++  Y+ ++    E      ++E      EL   A + TN G + PS
Sbjct: 1946 ILKSNKVKEKSENILISYVVYIKEQTERDPSLINME------ELGMSAMLKTNKGLQNPS 1999

Query: 1836 EIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGIT 1895
            ++PIHF  E+ N +++  ++   D  W  +D  YL++         + +WR FF +IG+T
Sbjct: 2000 KVPIHFTPEYKNSINLQAVLPGYD--WVLLDPVYLRNKKN---PVEIQRWRLFFSKIGLT 2054

Query: 1896 DFVQVVQVDKDVA--DISHTGFKNMWTKELLS---PGSAAIDWESNELVHLLSLLTTNVN 1950
            DF+ + QV+ +++  +I  T +  +  KELL     GS   DW S+ELV L+   +   +
Sbjct: 2055 DFISLKQVNVEISNENIGTTPWAPL--KELLGTIPKGSYIKDWSSSELVDLIQNNSQPTS 2112

Query: 1951 R-QCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDDRSFQSSFINCICDIQWTISS 2005
              Q  + L  +L   WD  +   I     ++  G  +   SSF   +    W   S
Sbjct: 2113 YPQQMRTLCVLLAEEWDRNFKGYISTHMCTESGGLLKDVDSSFSLLLRTASWVPGS 2168



 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/213 (21%), Positives = 116/213 (54%), Gaps = 16/213 (7%)

Query: 132 KLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGF--QMQEVPS 189
           + +++ ++ + +  ++A +E   A E VS  +V   +L +    + + LG     Q++P 
Sbjct: 538 RFSQDQIQMVAKECIEALAE---ADEHVSVERVENLMLQRFGKRNIAELGHWRNAQQIPC 594

Query: 190 L--HKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSE---GIEKFEELELGPFLRQP 244
           L  H  M+   ++NA++  ++    I +L++L+  +C +E     + FE L++GP  R P
Sbjct: 595 LFEHNRMI--CKVNAYVQAYMNTHSICTLHELK--LCMAEFAPDKDSFEALQIGPLQRLP 650

Query: 245 LISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHK--AKDIKVDEFLDFIAKKRSLASKE 302
           +I ++F    D+  + +I T+D+   L  Y + ++  +  ++++ F++++ +   + +  
Sbjct: 651 IIYNHFKFPHDMADIPEITTSDIFENLRNYLNKYQKWSTKLELEHFMEYLVETYHVENAY 710

Query: 303 QLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
            LG+RI++L + +  ++++++    T +K  ++
Sbjct: 711 VLGIRIRSLPLAMQVLKKSQRDAGGTRRKVTED 743


>gi|326672123|ref|XP_001920014.3| PREDICTED: hypothetical protein LOC335618 [Danio rerio]
          Length = 2252

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 304/934 (32%), Positives = 471/934 (50%), Gaps = 90/934 (9%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            ++E IR+ EFG+G  ++     +++KQ ARLGR+L  LS ELYS+D+HF+LEL+QNADDN
Sbjct: 580  VIEDIRKTEFGIGVELNQEGQILMEKQQARLGRSLQRLSDELYSKDTHFVLELIQNADDN 639

Query: 1221 IYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
             YP+  E P LTF++++S I +LNNE GF  +NIRA+CDVG STK     GYIG+KGIGF
Sbjct: 640  SYPDGGEQPALTFVVEKSCITILNNECGFEEKNIRAICDVGRSTKGKHKYGYIGQKGIGF 699

Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
            KSVF+VTD PEIHSNGFH++FD + G +G++LP  +         R ++    ++  K W
Sbjct: 700  KSVFKVTDCPEIHSNGFHIQFDKTSGPMGYILPHWLEQE------RPVNPAAKEIAEKRW 753

Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
             T I LP RT   E     N   +F D+HPSLLLFLHRL+ I   N     LV + ++ +
Sbjct: 754  TTKIHLPLRT---ENYQTKN---LFHDVHPSLLLFLHRLRSITIFNEAEKRLVSMTRRDL 807

Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
               I++VS    +  W V  + L    I+ DV++TE+ALA  +  S+  +  P   +QPV
Sbjct: 808  SHNILEVSHTGGEERWLVLKKMLFPKKIKDDVESTELALAFQIHSSSCADVKP--QKQPV 865

Query: 1460 FAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFR 1519
            FAFLPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE P LFV A   F   P F 
Sbjct: 866  FAFLPLRSFGFRFIIQGDFDIPSSREDVDRDSSWNQWLRSEIPQLFVQAMDVFSKHPEF- 924

Query: 1520 ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNN---QWAPPCKVL 1576
             +       ++ F+P   E+  FF+ +   I+  LR   CL  + ++    ++  P ++ 
Sbjct: 925  -SGLSGLCYFLQFIPQPSEILDFFNPVANQIIQLLRGRACLPAQEDHEGRVEFRQPSQL- 982

Query: 1577 RGWNDRAHSLLPDILLQKHLGLGFLSKDIVL-----------SDSLARALGIEEYGPKIL 1625
                    ++ PD L+Q+ +G   LS+ + L           S SL  ALG+       +
Sbjct: 983  --------AVCPDRLIQQVIGGQDLSRSLSLWYLHPDLQCSVSGSLLSALGVHRLRAAEI 1034

Query: 1626 LQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETD-LIDNLQRIPF 1684
            L +   + R                L  L    F      +L+S  E D L+ +++ +P 
Sbjct: 1035 LTVTCCMARELLQHTQTHTDGELEKLARLLVCIFR-----ALESEYEVDALLQSVRDVPM 1089

Query: 1685 IPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLG 1744
            IPL+DG+  S+    ++   D  V      L+A   L   L TV   LL        SL 
Sbjct: 1090 IPLADGSVVSLSSEGVFFPLD-DVTQAHADLQA---LYQDLNTVDRRLL-------QSLD 1138

Query: 1745 VISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYY 1804
             +    +  +L ++ V +L    ++  HI P +         + ++  YL F+ +H +  
Sbjct: 1139 ALGNSQVRELLRRLDVHELEPQQVLHQHIYPQLRSGDWQKKPQGIVVSYLVFIKLHAQ-- 1196

Query: 1805 CPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYE 1864
                  ++E  +  L     VLTN GF+ P+E  IHF +++ N    ++L     ++W  
Sbjct: 1197 ------DQE--IRSLAAHVPVLTNQGFRCPAEHRIHFSRDYKNIHLPSVL---PGVQWLL 1245

Query: 1865 VDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELL 1924
            +D  YL+  A  S       WR  F  +G+ D + + + +K     +         +  L
Sbjct: 1246 LDSCYLRADADVS------GWRSVFTALGVRDLL-IFRKEKRTLTTAQLSCSPWAAESAL 1298

Query: 1925 SPGSA-----AIDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWD--DCYTDKIMG 1975
             P  A      +D + +EL  LL+   L+     Q  + L+++LD  W+  + Y   +  
Sbjct: 1299 WPTPADGQYTIMDHQCSELHTLLTTDQLSDQEKLQQRRALIQLLDNNWNSGERYGQLVSA 1358

Query: 1976 FFKSKPTGDDRSFQSSFINCICDIQW--TISSMDDELHY--PKDLFHDCDAVRSILGPSA 2031
                   G  R+ +SSFI  + +  W  T   M   + Y  P+ ++    AV  +LG   
Sbjct: 1359 QLLDADGGVVRNVRSSFIQMLTEFSWIPTHRPMCSTVSYLQPRSVYILSTAVNRLLGAHV 1418

Query: 2032 PYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
             Y+  +  + +    IGL+  V ++D++  LK W
Sbjct: 1419 DYMHSEQTASEFTSAIGLRHSVCVEDMVGYLKRW 1452



 Score = 90.9 bits (224), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 108/432 (25%), Positives = 167/432 (38%), Gaps = 66/432 (15%)

Query: 648  FDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLST 707
            F  S+ N   S  T  ++ LR L   E  + E FR ++F  L  G F  F+ K+  +L  
Sbjct: 88   FQESVCN---SKSTLTLVHLRTL---EKRIAEHFRFRDFSQLQQGTFLDFIIKNKKVLQE 141

Query: 708  ELQKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISF 767
                    D  D         +  +   + Q  +     +      I   LR  + +   
Sbjct: 142  AAGGAVCIDNQDPGLSGFRPCRQDVFEFIKQCGDG----DDSRLAFIEAALRNHYRIRDS 197

Query: 768  KIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHT 827
            +    G + SFL  + + + E +S  VL S            +V  E  LL  T+ +  +
Sbjct: 198  RELGHGPL-SFLLNLTQRQKE-LSGHVLAS------------AVRYECPLLPITTGVCVS 243

Query: 828  GLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLC 887
                 +   V    A+  LL AP L DL+ W+ W  +F P+ GPL  ++       ELL 
Sbjct: 244  EPAVGLLGEVGRDQAMAALLSAPLLQDLHDWTQWQLVFLPNHGPLKDFIDKYCGKTELLA 303

Query: 888  LVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARH 947
            L    G ++R+        F +AA      +TA  L+ S A+A G  N P  LL  H   
Sbjct: 304  LEVSPGVLLRVTSETGDKHFSDAAQALDPLRTAGHLV-SMAVADGIPNTPTALLANHMES 362

Query: 948  AFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVA 1007
            A  V                         G    +++  D+            G +    
Sbjct: 363  ALNV-------------------------GVAQEEISPGDD------------GYSYSTV 385

Query: 1008 SRFFLDCLGYLPSEF-RSFAADVLLSGLQSTIKDAPS-AILLECHQTELRLM--LHEVGL 1063
            +RF L+C+  +P+   +     V L  L   +  A S A+LLE  QT  R +  LH++GL
Sbjct: 386  ARFVLECVTLIPTRICKELLQQVFLEPLSKVLGQAKSKAVLLEAAQTNTRHLNKLHQMGL 445

Query: 1064 SLGILEWIHDYH 1075
             LGI EW  D++
Sbjct: 446  LLGITEWRADFN 457


>gi|432926114|ref|XP_004080836.1| PREDICTED: uncharacterized protein LOC101171115 [Oryzias latipes]
          Length = 2547

 Score =  422 bits (1085), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 444/1663 (26%), Positives = 712/1663 (42%), Gaps = 244/1663 (14%)

Query: 526  EACKNNTMSEVLEKMFQFYKPT-DPKKAARYF--------KRMKLMFSSYPCIGLLNIAV 576
            E C+     E L+ + +  K   +P ++   F        K+   M ++   + +L+ A 
Sbjct: 286  EQCREAIEKETLDHLRKMKKSVFEPLQSTDSFSTINMDVRKKYASMSAAEVVLTVLSNAS 345

Query: 577  TSIKRGMWDSVYDTLQAITQQELNNT-----VSGNSLEYESIEIEP-------SEQVAVV 624
            T     M       L+A++  +L        + G SL     E+ P       +EQ   V
Sbjct: 346  TVFSPEMVKHCESFLKAVSDDQLAKALFQLAICGGSLAAPQ-ELVPQKKSSKNTEQSQAV 404

Query: 625  NTKHTVQDIQVEEVMKKVSKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIK 684
            +    V  +  E  +K      EF    L +  S VT     L  + + E  +T+ F+I 
Sbjct: 405  SDDSDVPALPCEATLK------EFLRESLRSQNSAVT-----LAHIAALERKMTKHFQIP 453

Query: 685  EFKSLGYGEFFTFLEKHASM-------LSTELQKLFTDDTIDRSSLEVSLVQHLL-VVLV 736
            +F SL  G F  FL K+  +        S ++  +    +     + V L+Q  L  +L 
Sbjct: 454  DFGSLEQGSFLEFLVKNIQIPPVLVLVRSPDIASVKLTPSPVCIPVTVQLLQESLGSLLF 513

Query: 737  SQASNNLWESEII-TKQMISELLRRQFPLISFKIEDKGSMESFLETVGKYRNE------- 788
             + +NNL        KQ + E +++         ++   +ES L      R+        
Sbjct: 514  LEDNNNLAGCGFRPCKQDVFEFIQQCGKTTPSDPDELSCVESALRAHYSVRDSRDLGYGS 573

Query: 789  --VMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGLRP--KMSESVTSKDAIE 844
               ++  V     + G  LS  L  +E   + + T+    +G  P  ++ E V+   A+ 
Sbjct: 574  LPTLAGLVQRQRELSGRGLS--LICYESALVSKHTNESPKSGSEPVGRLGE-VSKGQALA 630

Query: 845  ILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASV 904
             L   P L DL+ WS WD +F P  GPL  ++  +     L  L  R G +++I  +   
Sbjct: 631  SLQCCPLLEDLSDWSQWDCIFKPLHGPLKDFIERDAANTGLAALEVRPGLLLKITTNTGH 690

Query: 905  DSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAF-EVMFKNNVEDIEVI 963
              F  A        TA  L+S   +A G +N P  LL  H + A  E M K ++   E  
Sbjct: 691  KHFTSAVAALDPVGTAGHLVS-MMVADGIRNAPTALLANHMQSALAEAMAKEDLSQAEED 749

Query: 964  NSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEF- 1022
             S   R+                                     ++F LDCL  +P+   
Sbjct: 750  ISCYSRV-------------------------------------AKFLLDCLIRIPTRTC 772

Query: 1023 RSFAADVLLSGLQSTIKDAPSA-ILLECHQTELRLM--LHEVGLSLGILEWIHDY----- 1074
            ++    V L      +    S  +L+   Q++LR +  LH +G   G+ +WI DY     
Sbjct: 773  QALLQKVFLEPFSLVVGQTKSKQVLISTAQSDLRHLNCLHRLGTLSGVTDWIRDYQNKLR 832

Query: 1075 ----HAFC------------------------STGTSDL-------LMPCVVTCTNAATS 1099
                H  C                        S    DL       L P + T       
Sbjct: 833  PPQNHNLCKEHMEQVKCSWAVSNSSSLSALNVSEAKEDLTDCASTDLQPGLQTEEEEDEG 892

Query: 1100 GLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAA 1159
             +   + CA     +   A+    E G       GEA ++      ++T+     C   A
Sbjct: 893  EMCELASCANEKTSKRSDAE---GEAGDGLLNEAGEADEEKRAGFQSETVL----CFQRA 945

Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
             I+E IR+ EFG+G  +S     +L+    RLGR L  LS ELYS+D+HF+LEL+QNADD
Sbjct: 946  -IIEDIRKSEFGIGVELSAHGQRLLEVHQERLGRGLARLSTELYSKDTHFVLELIQNADD 1004

Query: 1220 NIYP--ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
            N Y     V P+L F++++  I +LNNE GF  +NIRA+CDVG STK     GYIG+KGI
Sbjct: 1005 NSYSLDPTVVPSLAFVVEQDCITILNNETGFQEKNIRAICDVGRSTKGKHRYGYIGQKGI 1064

Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFN-IDMFCRLLSKDPVQLES 1336
            GFKSVF+VTD PEIHSNGFH++FD S G +G++LP        +D   +L+S+       
Sbjct: 1065 GFKSVFKVTDCPEIHSNGFHLRFDRSCGPMGYILPHWTEEERPLDPQLKLISQ------- 1117

Query: 1337 KCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
              W T IRLP R+    G  +    ++F D+HPSLLLFLHRL+ I   N     LV + +
Sbjct: 1118 HSWTTKIRLPLRS----GSRLTQ--NLFHDVHPSLLLFLHRLRSITIYNHSEKRLVTMTR 1171

Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQ 1456
            K +   +++V   E    W V    L   +   DV++TE+ALA  L      +      +
Sbjct: 1172 KDLRKNVVEVEHSEGIDRWLVVKTTLYPQMGEQDVESTELALAFQLCSGTAEDLICQPQK 1231

Query: 1457 QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP 1516
            QPVFA+LPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE P LF+ A   F D  
Sbjct: 1232 QPVFAYLPLRSFGFRFIIQGDFHIPSSREDVDRDSSWNQWLRSEVPQLFLQAMAVFDDHS 1291

Query: 1517 CFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCK 1574
             F     K    ++ F+PL  EV  FF+ +   I+  L+    L    ++    +  P +
Sbjct: 1292 EFM--GLKGLCPFLQFIPLPDEVLDFFNPVAGQIIHLLKDKAFLPTRNSDGGVVYKLPSQ 1349

Query: 1575 VLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSL 1632
            V    +     ++    L+KHL L +L  D+  +   SL   LG+       +  + +++
Sbjct: 1350 VAICQDAVIRDIIGSDELEKHLSLSYLHPDLSPTPPTSLLTQLGVRYLQGSDVTTVTTAM 1409

Query: 1633 CR--TENG--LRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLS 1688
                 E G  L+   +  LA  L   +    H  G +         ++ NL+ +P IPL+
Sbjct: 1410 ANELVEAGHMLKEEKIRQLARLLVCNFRAVEHGFGDAE-------SILQNLKNLPIIPLA 1462

Query: 1689 DGTFSSVD-EGTIWLHSDC----SVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSL 1743
            +G   +++ EG I+L  +         G  G  A   L   +  V P+LLS        L
Sbjct: 1463 NGNVVTLNKEGVIFLMEETKAKKKKAQGQTG--AVSALYKDVSVVHPSLLSC-------L 1513

Query: 1744 GVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEY 1803
            G++    +  +L ++GV +L   ++++ +I P I      +  + ++  YL F+  H   
Sbjct: 1514 GLLESQQVRELLKRLGVHELEPQELLEQYIYPTIKSNNWKSKSEAVVVSYLVFIKQH--- 1570

Query: 1804 YCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWY 1863
                    +E+    +     VLT+ G + P+   +HF K + N   +N+        W 
Sbjct: 1571 ----SSSSQEYADIAVP----VLTSEGLQCPATDKVHFSKAYKN---INLRKKLPGCDWI 1619

Query: 1864 EVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM-WTKE 1922
             V   Y++          +  WR+    +G+ D    + + K+   ++     N  W+ E
Sbjct: 1620 LVSPRYVETDG------DVDGWRELLSRLGVRDG---LVIRKERRTLTAEELANSPWSAE 1670

Query: 1923 LLSPG-SAAID-------WESNELVHLLSLLTTNVNRQCSKHLLEILDTLWDDCYTDKIM 1974
              +   SAA D        E  E++    L   ++ RQ  + LLE+L+  WD   T  + 
Sbjct: 1671 SATWDLSAAQDCVLEDYTCEEFEILAKAKLPGKDLLRQ-RRTLLELLEANWD---TGHLY 1726

Query: 1975 GFFKSKPTGDD-----RSFQSSFINCICDIQWTISSMDDELHY-------PKDLFHDCDA 2022
              F +    D      R+ +SSF + +C ++W  +  + E          P  ++     
Sbjct: 1727 SQFLTAQVIDSSGRPIRTTKSSFHHLLCSLEWIPAHREQEGEQQEKEYLCPGSVYLSLPE 1786

Query: 2023 VRSILGPSAPYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
            V  +LG    Y+   +   +    +G++  V++D +   LK W
Sbjct: 1787 VTRLLGTHIFYV--DLNPSEFTRALGIRDTVSVDLLTHYLKEW 1827



 Score = 55.5 bits (132), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/202 (22%), Positives = 90/202 (44%), Gaps = 21/202 (10%)

Query: 187 VPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLI 246
           +P+L +L      IN F+     +  I +LY+L + +        +EEL LGP  + P++
Sbjct: 141 LPALRELQNLTREINTFVEATEALSCIFTLYELGQGLAALRNKTHYEELNLGPLCKVPVV 200

Query: 247 SHYFLV---KSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQ 303
              F +     D D+  +I T DV+  L  +  +++ + + + +F+  +A         +
Sbjct: 201 HRIFKIGHNTKDKDIP-QIRTVDVLKQLYIFRKSNEEQHLTLADFMKHLADHYDCEHPNK 259

Query: 304 LGVRIQNLGMHINFIREARKSQDVTLKK----FLKELQPHHKRRKRPIFSSEKKRQLDER 359
           LG+ I  L + ++ I +   +++  L++      KE   H ++ K+ +F           
Sbjct: 260 LGIVILGLHLPLSTIAKVTSTEEAVLEQCREAIEKETLDHLRKMKKSVFEP--------- 310

Query: 360 FSAICERVKSFSSINEDFGAKH 381
                +   SFS+IN D   K+
Sbjct: 311 ----LQSTDSFSTINMDVRKKY 328


>gi|224060363|ref|XP_002300162.1| predicted protein [Populus trichocarpa]
 gi|222847420|gb|EEE84967.1| predicted protein [Populus trichocarpa]
          Length = 570

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 271/570 (47%), Positives = 351/570 (61%), Gaps = 53/570 (9%)

Query: 1   MYGNHRPRFRPVGGVGGEGTGGGQPPSQPP-----PPH---FNPNFA-FQNPNIFFPNPA 51
           M+G HRP +RP         G GQPP + P     P     FNPNF   Q+PN F     
Sbjct: 1   MHG-HRPPYRP--------GGRGQPPQRHPLQQELPTELLTFNPNFVPLQDPNFFLHTFT 51

Query: 52  LQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPG 111
             LLQ   N P QNPNF IQNPN F+     +  +     + PP  A      Q Q    
Sbjct: 52  NALLQQ--NFPMQNPNFPIQNPNFFLPPQQQIQYHQQQHQEPPPPGAPPPPEQQEQQQEP 109

Query: 112 ASNQQRPLTQQPQSLPPQPQKLNKEF-LERIDRAVVKARSELIMAGESVSSWKVSRSVLL 170
           A+ QQ  L + PQ+   + ++ NKE  LERIDRAV KAR +L  A E+VS+WKVS+SVL+
Sbjct: 110 ATLQQN-LPKFPQNPKKKVKQNNKELQLERIDRAVEKARQDLSDAEENVSAWKVSQSVLV 168

Query: 171 KLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIE 230
             Q +SW SLGF+MQEVP+L +LMVTE++INAFIHCFVGVR+ITS+YDLE AICK+EGIE
Sbjct: 169 NFQAESWDSLGFKMQEVPALFRLMVTESKINAFIHCFVGVRRITSIYDLEVAICKNEGIE 228

Query: 231 KFEELELGPFLRQPLISHYFLVKSDVDV-VFKIATNDVIVCLSEYTDTHKAKDIKVDEFL 289
            FEEL LGP +R PL+ HYF +K D    VFKI + ++I+ LSE+ DT + K I VDEFL
Sbjct: 229 NFEELGLGPLMRHPLVLHYFSMKPDASAEVFKITSEEIILLLSEFMDTCQKKVIIVDEFL 288

Query: 290 DFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKK---------FLKELQPH- 339
            F++K   +   E LGVR+Q+LG HI+FIREA+ S++ T KK          LK+ Q   
Sbjct: 289 HFLSKNYPVKGPEMLGVRVQSLGTHISFIREAKASENSTQKKCRETLARNGSLKKCQEAR 348

Query: 340 ------HKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFV------SS 387
                   +R    FSSEK+R L+ERFSA+ ER+KSFS  N  F  KHIRFV        
Sbjct: 349 ASGPRVRSQRHEGRFSSEKER-LEERFSAVSERIKSFSQENYGFCGKHIRFVSSSSEDEK 407

Query: 388 SSEDEDSDDCTYECSNDISSNVQLPSQ-IKGSDRVSSCPYPSVTEELKRLGLKGEINHQL 446
           S + ++ D+ T   SN++ S+++  +Q I  SDRVSSCPYPS TEE+ RLGLKGE   Q 
Sbjct: 408 SDDGKNEDEMT---SNNVGSHLRSSAQAISNSDRVSSCPYPSATEEMSRLGLKGETGSQF 464

Query: 447 TSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQD-DEL 505
           +    SS   +   S  KKRK+E++    S P+K LR NK    A PI++ D+T++    
Sbjct: 465 SPDCGSSRPKESNRSFFKKRKLEDASWNVSVPSKLLRSNKT--HAHPIDNFDKTEEFVTP 522

Query: 506 NEADISFSNESMRMFITTWKEACKNNTMSE 535
           +E DIS S+  +  FITTWKEACK+ T++E
Sbjct: 523 SEDDISLSSNDLGAFITTWKEACKDYTVAE 552


>gi|410896246|ref|XP_003961610.1| PREDICTED: uncharacterized protein LOC101079642 [Takifugu rubripes]
          Length = 2475

 Score =  404 bits (1038), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 292/942 (30%), Positives = 456/942 (48%), Gaps = 100/942 (10%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            I+E IR+  FG+G  ++     +++ Q  R GR+L  L+ ELYS+D+HF+LEL+QNADDN
Sbjct: 799  IIEDIRKSAFGIGIELNAECQRLMQVQQERQGRSLDRLATELYSKDTHFVLELIQNADDN 858

Query: 1221 IYPE--NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
             YP    V P L F++++  I +LNNE GF  +NIRA+CDVG STK     GYIG+KGIG
Sbjct: 859  TYPSEAGVIPALAFVVEKDCITILNNEMGFQEKNIRAICDVGRSTKGKHKYGYIGQKGIG 918

Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
            FKSVF+++D PEIHSNGFH++FD + G +G++LP        D   R L      ++   
Sbjct: 919  FKSVFKISDCPEIHSNGFHLRFDKNSGPMGYILPHWA---EAD---RPLDSQLEDIKQHS 972

Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
            W T I LP R++       +   ++F D+HPSLLLFLHRL+ I   N     LV + +K 
Sbjct: 973  WTTKICLPLRSQ------SHQTRNLFHDVHPSLLLFLHRLRSITIYNRNEKRLVKMTRKD 1026

Query: 1399 VGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQ-- 1456
            + + +++V   +    W V  + L    I+ DV++TE+ALA  L  +N       ++Q  
Sbjct: 1027 LNNNVLEVEHTDGTERWLVVKKTLHPKKIKEDVESTELALAFQL--TNGDTESGTVWQPQ 1084

Query: 1457 -QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDL 1515
             QPVFAFLPLR++G +FI+QGDF +PSSRE+VD +SPWNQWL SE P LF+ A  ++ + 
Sbjct: 1085 KQPVFAFLPLRSFGFRFIVQGDFDIPSSREDVDRDSPWNQWLRSEIPHLFIQAMENYPEF 1144

Query: 1516 PCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPC 1573
                        +Y+ F+PL  EV  FF  +   I+  L+    L    ++ +  +  PC
Sbjct: 1145 SGL-----NGLYLYLHFIPLPDEVLDFFRPVAGQIIQLLKGKAFLPTINSDGKIVYKQPC 1199

Query: 1574 KVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEY-GP---KILLQ 1627
            KV    +     ++    L++HL L +L   +  +   SL   LG+    GP    +   
Sbjct: 1200 KVAVCQDAVIRDVIGGDELERHLSLSYLHPRLSSAPLTSLLTHLGVRYLQGPDVSTVTTA 1259

Query: 1628 IISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPL 1687
            +   L RTE      GL  LA  L   +    H  G++         ++  L+ +P IPL
Sbjct: 1260 MARELTRTEGICTDGGLRQLARLLVCNFRALEHGYGETE-------SVLQTLRDLPIIPL 1312

Query: 1688 SDGTFSSVDEGTIWLHSDCSVFDGG--FGLEAFPNLCAKLRTVSPALLSASAVDKSSLGV 1745
            +DG   ++ E  ++   +            E+   L   +  V P+LLS        L  
Sbjct: 1313 ADGRVVALSEEGVFFPVETQTKKSKDLIQTESLAALYKDINVVKPSLLSC-------LEP 1365

Query: 1746 ISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYC 1805
            +    + ++L  +GV  L    ++  HI P I      +  ++++  YL F+  H     
Sbjct: 1366 LESQQVQKLLRMLGVHDLEPQQLLDQHIYPTIQSNKWKSKPESVVVSYLVFIKEHSMSSH 1425

Query: 1806 PNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGN-PVSVNMLIHDIDIKWYE 1864
               H               VLTN G   P+E  +HF  ++ N  +   +  HD    W  
Sbjct: 1426 DYVHA-----------AVPVLTNRGLLCPAEEKVHFSVDYDNIDLPTQLPGHD----WVM 1470

Query: 1865 VDITYLKHPAKESLSCGLVKWRKFFKEIGITDFV-----QVVQVDKDVADISHTGFKNMW 1919
            +   Y+K          +  WR+ F  +G+ D +     +V    K++A    +     W
Sbjct: 1471 LSSCYVKTDGD------VTGWRELFSRLGVRDGLIIRKERVTLTAKELATSPWSVESRSW 1524

Query: 1920 TKELLSPG-SAAIDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMGF 1976
             K   +PG S  +D    E  H L+   L  +V  Q    LLE+L   WD  +      F
Sbjct: 1525 NK---NPGESCELDDYPCEEFHALATAQLPGSVLLQQRLTLLEMLANNWDTGH--HYSQF 1579

Query: 1977 FKSKPTGDD----RSFQSSFINCICDIQWT-----ISSMDDELHY--PKDLFHDCDAVRS 2025
             +++    D    +S +SSF + +C ++W      +     E +Y  P  ++ +   VRS
Sbjct: 1580 LRAQVVNSDGREVKSTKSSFYHYLCTLKWVPAFRPVEGEQQERNYLCPTSVYLNSPEVRS 1639

Query: 2026 ILGPSAPY--IVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
            +LG    Y  + P   S      IG++  ++++ ++  LK W
Sbjct: 1640 LLGTHVCYADLTPSDFSRA----IGIRQNISVEAMINYLKEW 1677



 Score = 76.6 bits (187), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 96/201 (47%), Gaps = 7/201 (3%)

Query: 156 GESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITS 215
           GE V+  KV   V  ++Q+ S  + G      P+L  L      +N FI C   V  I +
Sbjct: 17  GELVTKEKVISRVCQQMQIPSLEAGGIYPGMTPALKDLQYKTREVNMFIECAEAVSSICT 76

Query: 216 LYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDVIVCLSE 273
           LY+L  ++   +  +++EEL LGP  + P I   F + S+   D + +I T D++  L+ 
Sbjct: 77  LYELGHSLAGLKDKKQYEELNLGPLCKLPRIHRIFKIDSNTKDDDIQQIETVDILKLLNN 136

Query: 274 YTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHIN----FIREARKSQDVTL 329
           +    K + + + EF+  +A   +  S  +LG+RI ++ + I+     I + R  ++   
Sbjct: 137 FRKKSKGR-VDLTEFMQHLADHYNCDSPYKLGIRINSIALCISTLNKVIHQERAFREEAQ 195

Query: 330 KKFLKELQPHHKRRKRPIFSS 350
           +   +ELQ     R R I  S
Sbjct: 196 QLIERELQEETYERLRKIKKS 216


>gi|340371548|ref|XP_003384307.1| PREDICTED: hypothetical protein LOC100639095 [Amphimedon
            queenslandica]
          Length = 1639

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 334/1095 (30%), Positives = 509/1095 (46%), Gaps = 197/1095 (17%)

Query: 834  SESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDG 893
            S S+TS+DA++ L  AP L+DL  WS+W  +F PS G L  +L +  +V     L   DG
Sbjct: 629  SLSMTSQDALQYLKSAPLLADLALWSNWKLVFEPSCGSLSHFLSSHPDVH---TLCVDDG 685

Query: 894  KVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMF 953
              ++ID ++SV  F  A L  S  Q++  L+S    +G   N+P  LL  H +     + 
Sbjct: 686  VFLKIDPNSSVKDFANA-LYVSPSQSSGHLVSMVVRSGSTNNIPAQLLAQHVQTRLSGLL 744

Query: 954  KNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLD 1013
             + +   + +    C     ++C                     +    AV V    FLD
Sbjct: 745  SDTMPPEKFVF--EC-----LVC---------------------MPVELAVTVGKEVFLD 776

Query: 1014 CLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEWIHD 1073
             L  L     S   D LL  L++T  D P +          R  LH +GL LGI  WI D
Sbjct: 777  PLKKLHG---SSKVDSLL--LKAT-DDNPLS----------RSFLHTLGLKLGIKVWIDD 820

Query: 1074 YHAFCSTGTSDLLMPC------VVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGA 1127
            Y A    G     +P       V+   N   + ++        S  ES   +       +
Sbjct: 821  YLA---RGKQPNTIPATEPDLPVLKPVNPDLNKMDFSEESVAESFLESSFNNQSSNPSTS 877

Query: 1128 ICDTICGEASD----DGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNM 1183
            + +    E S+    D L    T     ++E ++A  IVESIR +EFG+G        ++
Sbjct: 878  LNEP---EESNTIFTDVLPLSLTSPFSVEEERDNALKIVESIRSEEFGIGLEFDEKTKSV 934

Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-------ENVEPTLTFILQE 1236
            + K   RL R+L  LS+ELYS+D+HF+LELVQNADDN YP        +  P+L F L +
Sbjct: 935  MSKGKGRLDRSLQRLSEELYSKDTHFVLELVQNADDNSYPVLTCGSTSSFVPSLKFTLSK 994

Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
            S IVVLNNE GFS +NIRA+CDVG STK  +++GYIG+KGIGFKSVFRVT  PEIHS G+
Sbjct: 995  SCIVVLNNELGFSEKNIRAICDVGRSTKDRNASGYIGQKGIGFKSVFRVTKTPEIHSRGY 1054

Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA 1356
            H++FD+S   IG++LP  V     D        DP +L    W T I LPF++  S    
Sbjct: 1055 HIRFDSS-VPIGYILPEWVESSISDDII-----DPGELAK--WTTKIVLPFKSD-STNAG 1105

Query: 1357 MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWF 1416
             ++++  F D+ PSLLLFL+RL+CI+  ++ +  +  +++    +GII++   +    W 
Sbjct: 1106 SSDMMSRFHDIQPSLLLFLNRLRCIIIEDLESKVIRSMKRHDRSNGIIEIRHNDGVDKWL 1165

Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
            +  +                    TL E+N+      L  Q VFAFLPLR+YG KFI+QG
Sbjct: 1166 LVRK--------------------TLHEANKD-----LSPQQVFAFLPLRSYGFKFIVQG 1200

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASV--YMSFVP 1534
            DF LPSSRE+V+ NS WNQWL  E P+LF+ +      + CF  +     ++   + F+P
Sbjct: 1201 DFDLPSSREDVNANSAWNQWLREEIPSLFMKS------IDCFESHLTAIEAIIHILKFIP 1254

Query: 1535 LVGEVHGFFSGLPRMILSKLRMSNCL-------------------------------ILE 1563
            L GEVH FF  +   IL   +   CL                                L+
Sbjct: 1255 LEGEVHDFFQPVTSEILRIFKERKCLPTEPFPVHSSSPTDSPFDAMREITSIISEIRSLK 1314

Query: 1564 GNNN--QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDI--VLSDSLARALGIEE 1619
            G++   QW  P ++L          +P  LL   L L +L+  +   LS  +   L I  
Sbjct: 1315 GSSAGIQWKQPSQLLLIKEQSIQDCIPQSLLTASLKLYYLNSALNNHLSSQICSQLNIGS 1374

Query: 1620 YGPKILLQIISSLCRTENGLRS---MGL-------------------------SWLASWL 1651
                 L+ +   +  + +  +    M L                          W+A WL
Sbjct: 1375 INIDHLMSVAQYVLSSYHQHKEEQFMSLQSSASSAEDESDEEEVAVHPVKCLNQWIALWL 1434

Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDG 1711
            + ++ +     G +      E  ++  L+  P IPLS+G   S+D G I+      V   
Sbjct: 1435 SCVHLLCEGVGGSTD-----EMSVLRKLKSFPVIPLSNGNIVSIDSGPIFF---PPVSSK 1486

Query: 1712 GFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKV 1771
               +   P +  +LRTV   L+ +   D+  + V      N +LL++GV+ L+  ++++ 
Sbjct: 1487 DVDISYRP-VYDELRTVHTDLIDS---DEKLVAV----KANDLLLRLGVRNLNITEVIEQ 1538

Query: 1772 HILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGF 1831
            HI+P    E T +  ++L+  YL F+   +E         ++  + EL+    ++T +G 
Sbjct: 1539 HIIPHFKKENTEDTSEHLLILYLSFIENAIE----RNQFSQKKAIDELKGHVVLITENGK 1594

Query: 1832 K-RPSEIPIHFGKEF 1845
            + +PS+  IHF  ++
Sbjct: 1595 RLKPSDEVIHFHSKY 1609



 Score = 75.5 bits (184), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 120/259 (46%), Gaps = 20/259 (7%)

Query: 90  PPQQPPSSACNQQRSQTQHPPG--ASNQQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVK 147
           PP  P SS          HPP     + Q P+  +   LPP P+    E ++ + R +  
Sbjct: 97  PPHFPDSSV---HPRDVHHPPSFVHKDHQIPVLDR---LPPPPEFKRGENVKDVVRKI-N 149

Query: 148 ARSELIMAGESVSSWKVSRSVL---LKLQVDSWSSLGF--QMQEVPSLHKLMVTEARINA 202
           + +E+     S     V  SV+   LKL +      G+   ++++           R   
Sbjct: 150 SYAEMFTGIISSRGKHVPLSVVEDELKLLLGKAKRHGYYINLRDIDFWVNWEKLHKRTIE 209

Query: 203 FIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKI 262
           FI  F     IT+LY+L+  + +SE +  F+EL++GP L+ P I   F V  D++ V +I
Sbjct: 210 FIKVFCWHSPITTLYELQRILNESENVSDFKELKMGPLLKHPDIIRLFKVPEDLETVPEI 269

Query: 263 ATNDVIVCLSEYTDTHKAKDIK--VDEFLDFIAKKRSLASKEQLGVRIQN--LGMHINFI 318
              DV   L  + D  K K  K  +++FLDF++   S+ S   + +RI +  L + INF 
Sbjct: 270 TAYDVHSKLFRFLDKSKNKKSKTQLEDFLDFLSHTFSIESPLHICIRITSYPLALQINF- 328

Query: 319 REARKSQDVTLKKFLKELQ 337
            + R+SQ    ++   EL+
Sbjct: 329 -KFRRSQREIKERIESELE 346


>gi|348501984|ref|XP_003438549.1| PREDICTED: hypothetical protein LOC100712247 [Oreochromis niloticus]
          Length = 2910

 Score =  398 bits (1022), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 281/935 (30%), Positives = 457/935 (48%), Gaps = 82/935 (8%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            ++E IR+ EFG+G  ++     +++    RLGR+L  LS ELYS+D+HF+LEL+QNADDN
Sbjct: 1233 VIEDIRKSEFGIGVKLTAEGQKLMQVHQERLGRSLDRLSTELYSKDTHFVLELIQNADDN 1292

Query: 1221 IYPE--NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
             YP    V P L F+++   I VLNNE GF  +NIRA+CDVG STK     GYIG+KGIG
Sbjct: 1293 SYPAEPGVVPALAFMVERDCITVLNNETGFQEKNIRAICDVGRSTKGKHKYGYIGQKGIG 1352

Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
            FKSVF+VTD PEIHSNGFH++FD + G +G++LP            R L      +    
Sbjct: 1353 FKSVFKVTDCPEIHSNGFHLRFDKNCGPMGYILPHWTED------TRPLDPQLKDINQHS 1406

Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
            W T I LP RT+       +   ++F D+HPSLLLFLHRL+ I   N      + + +K 
Sbjct: 1407 WTTKISLPLRTE------SHQTRNLFHDVHPSLLLFLHRLRSITIYNQSEKRFMTMTRKD 1460

Query: 1399 VGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEG-NYGPLLYQQ 1457
            +   +++V   E    W V    L+   I+ DV++TE+ALA  L   + G N      +Q
Sbjct: 1461 LNHNVLEVEHSEGTERWLVVKTTLKPKKIKEDVESTELALAFQLGSDSTGSNIVCQPQKQ 1520

Query: 1458 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
            PVFA+LPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE   LF+ A   F + P 
Sbjct: 1521 PVFAYLPLRSFGFRFIIQGDFDIPSSREDVDRDSSWNQWLRSEIAQLFLQAMDVFTNHPE 1580

Query: 1518 FRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKV 1575
            F+    K    ++ F+PL  EV  FF  +   I+  L+  + L    ++ +  +  P +V
Sbjct: 1581 FKG--LKGLCQFLQFIPLSDEVLDFFKPVAGQIIQLLKGKSFLPTLSSDGKIVYKLPSQV 1638

Query: 1576 LRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSLC 1633
                +     ++    L+KHL L +L  D+  +   SL   LG+       +  + +++ 
Sbjct: 1639 AVCQDPVIREVIGGGELEKHLSLSYLHPDLSPAPPTSLLTHLGVRYLRGTDVTIVTTAMA 1698

Query: 1634 R---TENGLRS-MGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSD 1689
            +   T  G+ S  GL  LA  L   +    H  G++         ++  L+ +P IPL+D
Sbjct: 1699 KELMTVGGIHSDAGLHQLARLLVCNFRALEHGYGEAD-------SILQTLRELPIIPLAD 1751

Query: 1690 GTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVS---PALLSASAVDKSSLGVI 1746
            G   ++    ++   + S             L A  + VS   P+LLS        L  +
Sbjct: 1752 GRAVALSVEGVFFPMEESKTKKKKAQAQTGPLAALYKDVSVVHPSLLSC-------LEPL 1804

Query: 1747 SVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCP 1806
                +  +L ++GV +L   ++++ HI P I +    +    ++  YL F+  H      
Sbjct: 1805 ESQQVRELLRRLGVHELEPQELLEQHIYPTIRNNKWKSKPTPVVVSYLVFIKQH------ 1858

Query: 1807 NCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVD 1866
                 +E+  + +     VLT+ G   P+   +HF +E+GN   VN+        W  + 
Sbjct: 1859 -SSSSQEYSDTAVP----VLTSRGLLCPANERVHFSEEYGN---VNLPTKLPGYDWILLS 1910

Query: 1867 ITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLS- 1925
              Y++          +  WR+ F ++GI D + +++  +     +     + W+ E  + 
Sbjct: 1911 PCYVQ------TDGDVAGWRELFSKLGIKDGL-IIRKQRQTLTATELA-SSPWSVESATW 1962

Query: 1926 ---PGSAAI--DWESNELVHLLSL-LTTNVNRQCSKHLLEILDTLWD--DCYTDKIMGFF 1977
               PG   +  D+   E   L +  L  ++  Q    LLE+L+  W+   CY+  +    
Sbjct: 1963 HQVPGEGYVLDDYPCEEFHALATAELPGSLLLQQRMALLELLEANWETGHCYSQYLTAQV 2022

Query: 1978 KSKPTGDDRSFQSSFINCICDIQWTIS-------SMDDELHYPKDLFHDCDAVRSILGPS 2030
                    +S +SSF + +C ++W  +        ++ +   P  ++     V ++LG  
Sbjct: 2023 IDSDGQPIKSTKSSFYHSLCRLEWVSAYCPQQGGQLERKYLCPSSVYLSSPEVINLLGTH 2082

Query: 2031 APYIVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
              Y+   + S      +G++  +++D ++  LK W
Sbjct: 2083 VYYV--NISSSDFSRALGMRETISVDVLINYLKEW 2115



 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 117/512 (22%), Positives = 184/512 (35%), Gaps = 121/512 (23%)

Query: 603  VSGNSLEYESIEIE-------------PSEQVAVVNTKHTVQDIQVEEVMKKVSKYLEFD 649
            VSGN LE    ++              P ++ +    K   +D   + +  K+ +YL   
Sbjct: 669  VSGNRLETALFQLAICGGSLAVPHDLVPKDKTSKPTEKTKPEDKSTKSLPTKLKQYLRDK 728

Query: 650  NSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHASMLSTEL 709
             +  N+A        I L  + S E  L E F +K+F SL  G F  FL  +  +L    
Sbjct: 729  LAAQNSA--------ITLAHIASLEKKLPEHFHVKDFLSLEKGSFLEFLVNNIQLL---- 776

Query: 710  QKLFTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQFPLISFKI 769
            Q       I  SS+E             +A +    S    KQ + E +++   + S   
Sbjct: 777  QDTVGSTLILGSSME-------------RAGSGFRPS----KQDVFEFIKQCGDVASTDP 819

Query: 770  EDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAITHTGL 829
            ++   +ES L +    R+               +R  G   +H    L++   A+   G+
Sbjct: 820  DELSYIESALRSHYGVRD---------------SRDLGHGPLHRLADLVQRQKALGGGGM 864

Query: 830  RPKMSESV----------------------TSKD-AIEILLRAPFLSDLNSWSHWDFLFA 866
             P   ES                        SKD A+  LL  P L DL+ WS W+ +F 
Sbjct: 865  SPICYESALFAKQGKSSAESGFESVGRLGEMSKDQALASLLSCPLLEDLSEWSQWELIFK 924

Query: 867  PSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSS 926
            P  G L  ++        L  L    G + +I        F  A        TA  L+ S
Sbjct: 925  PLHGSLKEFIEKNAGNTGLAALEVTPGVLFKITTDTGDKHFSSATTALDPVGTAGHLV-S 983

Query: 927  FALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANI 986
              +A G  N P+ LL  H + +         ED+ +                   DV+  
Sbjct: 984  MVVADGIVNAPIALLSNHMQSSLAAAVAK--EDLSLAEE----------------DVSCY 1025

Query: 987  DNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEF-RSFAADVLLSGLQSTIKDAPSA- 1044
            D +                  ++F L+CL  +P+   ++    V L      +  A S  
Sbjct: 1026 DKV------------------AKFLLECLTRIPTRTCQALLLQVFLEPFSLVLGQAKSKR 1067

Query: 1045 ILLECHQTELRLM--LHEVGLSLGILEWIHDY 1074
            +L+   Q+E R M  LH +G+ LGI +WI DY
Sbjct: 1068 VLMTVAQSEPRYMNCLHRLGILLGIPDWIKDY 1099



 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 59/226 (26%), Positives = 108/226 (47%), Gaps = 11/226 (4%)

Query: 151 ELIMAGESVSSWKV-SRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVG 209
           E+   GE V++ KV SR   L   +D  S     ++ +  L  L+     IN FI     
Sbjct: 408 EIAGEGELVTTKKVISRVCRLVPSLDPRSVEHLPLKALKDLQYLI---REINMFIETTSA 464

Query: 210 VRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDV 267
           V  I +LY+L +++   +  +++EEL LGP  + P I   F + S+   D + +I T D+
Sbjct: 465 VTSICTLYELGQSLAGLKDKKRYEELNLGPLCKLPFIHRMFKIDSNTKDDDIHQIETVDI 524

Query: 268 IVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDV 327
           +  L  +        + + EF+  +A   +  S  +LG+RI  LG+ I+ + +  +++  
Sbjct: 525 LKQLRVFRRKQSKVKVDLAEFMKHLADHYNCDSPYELGIRITGLGLPISTLSKVGRNEYT 584

Query: 328 TLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSI 373
            L+   + +Q   K  +  I+  E+ R+L +      +   SFSS+
Sbjct: 585 ILECAREVIQ---KELEEEIY--ERLRKLKKSILEPLQGAASFSSV 625


>gi|449666320|ref|XP_002160396.2| PREDICTED: uncharacterized protein LOC100210255, partial [Hydra
            magnipapillata]
          Length = 2259

 Score =  392 bits (1006), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 349/1348 (25%), Positives = 591/1348 (43%), Gaps = 257/1348 (19%)

Query: 834  SESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKEL--LCLVTR 891
            S+++ S  A+  +   P+L D++   +WD +F   LG L  ++ ++ N K++    L  +
Sbjct: 288  SDNILSMLAVSKINALPYLEDVSKLMNWDEIFQRKLGSLKEFVKSKFNSKDVCYFFLELQ 347

Query: 892  DGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEV 951
             GK +++    S++SF EA +   +   ++ L S                          
Sbjct: 348  FGKFVKVLKDCSIESFTEAFVARDAKLVSICLTS-------------------------- 381

Query: 952  MFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFF 1011
                                  ++C  +  + A +  LS  +QK +L   +    + + F
Sbjct: 382  ----------------------IICSNRRLEHAPLALLSNVIQKNMLTSSETSEFSEKHF 419

Query: 1012 LDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAILLEC------HQTELRLMLHEVGLSL 1065
            L  L  L + + +               + PS +L  C           R  LH +G  L
Sbjct: 420  LMFLTPLSTIYGNL--------------NVPSVLLKACSDCCRSRSDCCRSRLHALGFQL 465

Query: 1066 GILEWIHDYHAFCST----------------GTSDLLMPC-------VVTCTNAATSGLN 1102
             I+EWI +Y    S                 G   +++ C       V+ C     +   
Sbjct: 466  SIVEWIENYKDEFSVLKGLKSIKSDGIGLWWGYQIVILYCCNIKPLFVIICYQKNINEFM 525

Query: 1103 SG---------SGCAEGSLFESVGADVHIE-ECGAICDTICGEASDDGLGDCTTQTLPED 1152
            S              E + F+ +G + +IE +      T+  E  +  +    + T PE 
Sbjct: 526  SKRQQQTEENEKEAIEKTKFQVIGENKNIEIKHDESISTLVNEKEELIM---ESSTFPEQ 582

Query: 1153 KECED-----------AALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQE 1201
             + ++              ++ +IRRD+FG+G      E  +++    R GR+L  LS E
Sbjct: 583  GKVKEFFGSSSDQNNHCREVINNIRRDQFGVGIEFGEQEMKLIEVHREREGRSLQHLSNE 642

Query: 1202 LYSQDSHFLLELVQNADDNIYPE-----NVEPTLTFILQESGIVVLNNEQGFSAENIRAL 1256
            LYS+DSHF+LEL+QNADDN YPE     N +P + FI++   I +LNNE+GFS  +I+AL
Sbjct: 643  LYSKDSHFVLELIQNADDNQYPETFFEKNEKPAVAFIVEHDKITILNNEKGFSERDIKAL 702

Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVP 1316
            CDVG STK     GYIG+KGIGFKSVFRVT++PEIHSNG+H+KFD S G  G++LP  + 
Sbjct: 703  CDVGKSTKGIHCKGYIGQKGIGFKSVFRVTNSPEIHSNGYHIKFDASHGSNGYILPYWID 762

Query: 1317 PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLH 1376
             +      +LL + P + ++  W T I LP +      +    +   F D+HPSLLLFL+
Sbjct: 763  CWESQ--SKLLYQ-PSKKQNYTWITQIVLPLK----RDMQHTKLTSRFHDIHPSLLLFLN 815

Query: 1377 RLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEI 1436
            RL  I+  + +++ L VI ++ + + I+++S   D   WFV  ++L   V          
Sbjct: 816  RLHTIIIDDKVSNVLKVIERENLDENIVRLSNNNDSALWFVVGKELNVDV---------- 865

Query: 1437 ALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQW 1496
                             L  QPV+A+LPLR+YG KFI+Q DF +PSSRE+++ +S WNQ 
Sbjct: 866  -----------------LPNQPVYAYLPLRSYGFKFIIQADFEVPSSREDINKDSAWNQC 908

Query: 1497 LLSEFPALFVSAERSFCDLPCFRENP----AKAASVYMSFVPLVGEVHGFFSGLPRMILS 1552
            +L   P L V A        CF+++           Y+ F+ L   V G F  +   I+ 
Sbjct: 909  ILQHIPKLVVEA------FHCFKKHHHFTGLDGVIKYLKFMDLEDHVLGVFKQVSHQIIK 962

Query: 1553 KLRMSNCL-ILEGNNN-------QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKD 1604
             L+   CL  +  + N        W  P  ++    +    ++   +LQ+ LGL +L+KD
Sbjct: 963  SLQREACLPAMSKDCNAYPQMPITWVVPSCIINP-QENVLKVISQEMLQEKLGLYYLNKD 1021

Query: 1605 I--VLSDSLARALGIEEYGPKILLQI-------ISSLCRTENGLRSMGLSWLASWLNELY 1655
            +  V+S  +  +LG+ E   ++LL+I        +SL + E   ++    W+  WL+ +Y
Sbjct: 1022 VNTVISKRMLISLGVNEMTLELLLEICRLIVHQFNSLPQNEQDCKAF-YQWIGQWLHTVY 1080

Query: 1656 TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGL 1715
                    +   Q        + L+ +   P+SD +  S++   ++           F +
Sbjct: 1081 L-------RLQDQCDCSEKTFNILRSLYIYPMSDRSLKSLNGHVVF-----------FPV 1122

Query: 1716 EAFPNLCAKLRTVSPA------LLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIV 1769
                 +    ++V P       LL  S    S L  I    L   L  +GV++++  DIV
Sbjct: 1123 SDNNKIIFNKKSVQPIHLIEQDLLILSHDFMSCLSGIEQQQLKNFLCDLGVKEITPLDIV 1182

Query: 1770 KVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNH 1829
            +  I+P           +NL+  Y+ +  +H +Y   +    + F + E+R+   +LTN 
Sbjct: 1183 QNCIIPTFKSGKWKEKSENLVVAYVNY--LHEQYMLDS----QIFDILEIRQCLMLLTNK 1236

Query: 1830 GFKRP--SEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSC----GLV 1883
            GF+ P  S  PI+FG+E+ N + +       D  W  +   Y        LSC      V
Sbjct: 1237 GFQSPFDSTNPIYFGREYNNDIDLQDSFPSAD--WTLISSCY--------LSCLSDIEKV 1286

Query: 1884 KWRKFFKEIGITDFVQVVQVDK--DVADISHTGFKNMWTK--ELLSPGSAAIDWESNELV 1939
              ++F   + +  F+ V ++ K   V+D+  + ++  +    + +       D E  E  
Sbjct: 1287 SLKRFLSSLSVHTFMVVRKITKTLHVSDLIASCWEKYFETWPQTIDDTYIIDDLECPEFN 1346

Query: 1940 HLLSLLTTNVNRQCSKHLLEILDTL--WDDCYTDKIMGFFKSKP-----TGDDRSFQSSF 1992
             ++            K+  EI D L  +D+ ++ +   F   +P             S+F
Sbjct: 1347 QVVD----------GKNYREIYDMLEEFDERWSSEFSKFNFIRPGCLVKNTSGEVVSSTF 1396

Query: 1993 INCICDIQ---WTISSMDDE-LHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIG 2048
             + +  ++   W  ++ D+  LH P+DLF   D + S+LG    Y+  +VKS+  +  +G
Sbjct: 1397 CSFLLKMRSSAWLANNEDENSLHCPRDLFLRNDRLFSLLGNKVNYLTNRVKSKNFILTLG 1456

Query: 2049 LKTEVTIDDILEILKVWTRLEAPFMARY 2076
            ++TEV           W R +  F   Y
Sbjct: 1457 VQTEVNFKRFANEFMKW-REQTVFTTNY 1483


>gi|326672130|ref|XP_003199604.1| PREDICTED: hypothetical protein LOC100151305 [Danio rerio]
          Length = 868

 Score =  362 bits (929), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 217/561 (38%), Positives = 316/561 (56%), Gaps = 46/561 (8%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            ++E IR+ EFG+G  ++     +++KQ ARLGR+L  LS ELYS+D+HF+LEL+QNADDN
Sbjct: 321  VIEDIRKTEFGIGVELNQEGQILMEKQQARLGRSLQRLSDELYSKDTHFVLELIQNADDN 380

Query: 1221 IYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
             YP+  E P LTF++++S I +LNNE GF  +NIRA+CDVG STK     GYIG+KGIGF
Sbjct: 381  SYPDGGEQPALTFVVEKSCITILNNECGFEEKNIRAICDVGRSTKGKHKYGYIGQKGIGF 440

Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
            KSVF+VTD PEIHSNGFH++FD + G +G++LP  +         R ++    ++  K W
Sbjct: 441  KSVFKVTDCPEIHSNGFHIQFDKTSGPMGYILPHWLEQE------RPVNPAAKEIAEKRW 494

Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
             T I LP RT   E     N   +F D+HPSLLLFLHRL+ I   N     LV + ++ +
Sbjct: 495  TTKIHLPLRT---ENYQTKN---LFHDVHPSLLLFLHRLRSITIFNEAEKRLVSMTRRDL 548

Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
               I++VS    +  W V  + L    I+ DV++TE+ALA  +  S+  +  P   +QPV
Sbjct: 549  SHNILEVSHTGGEERWLVLKKMLFPKKIKDDVESTELALAFQIHSSSCADVKP--QKQPV 606

Query: 1460 FAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFR 1519
            FAFLPLR++G +FI+QGDF +PSSRE+VD +S WNQWL SE P LFV A   F   P F 
Sbjct: 607  FAFLPLRSFGFRFIIQGDFDIPSSREDVDRDSSWNQWLRSEIPQLFVQAMDVFSKHPEF- 665

Query: 1520 ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNN---QWAPPCKVL 1576
             +       ++ F+P   E+  FF+ +   I+  LR   CL  + ++    ++  P ++ 
Sbjct: 666  -SGLSGLCYFLQFIPQPSEILDFFNPVANQIIQLLRGRACLPAQEDHEGRVEFRQPSQL- 723

Query: 1577 RGWNDRAHSLLPDILLQKHLGLGFLSKDIVL-----------SDSLARALGIEEYGPKIL 1625
                    ++ PD L+Q+ +G   LS+ + L           S SL  ALG+       +
Sbjct: 724  --------AVCPDRLIQQVIGGQDLSRSLSLWYLHPDLQCSVSGSLLSALGVHRLQAAEI 775

Query: 1626 LQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETD-LIDNLQRIPF 1684
            L +   + R                L  L    F      +L+S  E D L+ +++ +P 
Sbjct: 776  LTVTCCMARELLQHTQTHTDGELEKLARLLVCIFR-----ALESEYEVDALLQSVRDVPM 830

Query: 1685 IPLSDGTFSSVDEGTIWLHSD 1705
            IPL+DG+  S+    ++   D
Sbjct: 831  IPLADGSVVSLSSEGVFFPLD 851



 Score = 79.7 bits (195), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/234 (28%), Positives = 95/234 (40%), Gaps = 42/234 (17%)

Query: 846  LLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKELLCLVTRDGKVIRIDHSASVD 905
            LL AP L DL  W+ W  +F P+ GPL  ++       ELL L    G ++R+       
Sbjct: 4    LLSAPLLQDLQDWTQWQLVFLPNHGPLKDFIDKYCGKTELLALEVSPGVLLRVTSETGDK 63

Query: 906  SFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINS 965
             F  AA      +TA  L+ S A+A G  N P  LL  H   A  V              
Sbjct: 64   HFSVAAQALDPLRTAGHLV-SMAVADGIPNTPTALLANHMESALNV-------------- 108

Query: 966  QNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEF-RS 1024
                       G    +++  D+            G +    +RF L+C+  +P    + 
Sbjct: 109  -----------GVAQEEISPGDD------------GYSYSTVARFVLECVTLIPIRICKE 145

Query: 1025 FAADVLLSGLQSTIKDAPS-AILLECHQTELRLM--LHEVGLSLGILEWIHDYH 1075
                V L  L   +  A S A+LLE  QT  R +  LH++GL LGI EW  D++
Sbjct: 146  LLQQVFLEPLSKVLGQAKSKAVLLEAAQTNTRHLNKLHQMGLLLGITEWRADFN 199


>gi|449686810|ref|XP_002155553.2| PREDICTED: uncharacterized protein LOC100210004, partial [Hydra
            magnipapillata]
          Length = 797

 Score =  360 bits (923), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 242/740 (32%), Positives = 384/740 (51%), Gaps = 71/740 (9%)

Query: 1138 DDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHC 1197
            ++ L +C T  +  D  C     I+  IR  +FG+G  +   E  + +    R GR+L  
Sbjct: 101  EEVLNECVTSDIQNDY-CHK---IINDIRCKQFGIGLELGEQEKKLFEILREREGRSLQR 156

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENV---EPTLTFILQESGIVVLNNEQGFSAENIR 1254
            LS ELYS DSHF+LEL+QNADDN Y EN    +PT+ FI++   I++LNNE+GFS  +I+
Sbjct: 157  LSNELYSNDSHFVLELIQNADDNQYIENSNNEKPTVAFIVEHDQIIILNNEKGFSESDIK 216

Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
            ALCDVG STK     GYIG+KGIGFKSVFRVTD PEIHSNG+H+KFD S G  G++LP  
Sbjct: 217  ALCDVGKSTKGIHRKGYIGQKGIGFKSVFRVTDMPEIHSNGYHIKFDASSGSNGYILPNW 276

Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
            +  +N D   +  +  P +  +  W T I LP + +         ++  F D+H SLLLF
Sbjct: 277  IYDWNSD---QRSTYQPCESRNYGWKTQIVLPLKKEMQHS----KLISRFHDIHSSLLLF 329

Query: 1375 LHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
            L+RL  I+  + +++ + V+ ++ V   I+++S       WFV  +++    IR D+K+T
Sbjct: 330  LNRLHTIIIDDKVSNIMKVMEREHVEKNIVRLSNNNYSRLWFVVEKEIPID-IREDIKST 388

Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
             IA A       +    P    Q VFAFLPLR+YG KFI+Q DF +PSSRE++D +S WN
Sbjct: 389  NIAFAFPFNNKLKAEALP---AQQVFAFLPLRSYGFKFIIQADFEVPSSREDIDKDSAWN 445

Query: 1495 QWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKL 1554
            Q++L + P L V A  SF     F+    +    Y+ F+ L   V G F  + + ++  L
Sbjct: 446  QYILQQLPKLTVEAFYSFKVHQYFK--GLEGVIEYLKFMDLEDNVSGVFKQVSQQMIKSL 503

Query: 1555 RMSNCLILEGNNNQ----WAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS 1610
            +   CL +   N +    W  P  ++    +    + P+I LQ+ LGL +L+ DI  + +
Sbjct: 504  QREACLPVIPRNIENILNWVIPKHIVFSQQEIYKVISPEI-LQEKLGLYYLNNDIDCT-A 561

Query: 1611 LAR---ALGIEEYG-------PKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFH 1660
            L R   +LG+ E          K+++Q  SS  +       M   W+  WL+ +YT    
Sbjct: 562  LKRVLTSLGVCELNFEQLIEVCKLVVQEFSSSSQQIKNPDKMQ-QWIGKWLHLVYT---- 616

Query: 1661 SSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPN 1720
             S Q+S     +T   D ++ +   P+++G+  S+ E            D  F +   P 
Sbjct: 617  -SLQNSCDCSKKT--FDIIRSLHIFPMTNGSMMSLKEN-----------DAFFPVNYIPK 662

Query: 1721 LCAK------LRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHIL 1774
               K      L  +   L   S+    SL  I  + + +++  +GV+ ++  D++  HI+
Sbjct: 663  HGEKKEDKRMLDLIENDLNILSSNLMESLDHIEQEQVKKLMCDLGVKYITPSDLIHNHII 722

Query: 1775 PAISDETTANGDKNLMADYLCFVMMHLEYYC-PNCHVEREFIVSELRKKAFVLTNHGFKR 1833
            P+       +  + L+  Y+  V +  +Y   P+      F ++E+ +   ++TN GF+ 
Sbjct: 723  PSFKSGLWKSKPEKLVVAYV--VYLEGQYRAQPDM-----FDIAEVSQSVMLMTNKGFRS 775

Query: 1834 P--SEIPIHFGKEFGNPVSV 1851
            P     PI+   ++GN +++
Sbjct: 776  PLDQSNPIYLTPQYGNEINL 795


>gi|147783924|emb|CAN74689.1| hypothetical protein VITISV_001349 [Vitis vinifera]
          Length = 566

 Score =  337 bits (865), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 188/362 (51%), Positives = 247/362 (68%), Gaps = 14/362 (3%)

Query: 222 AICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAK 281
           AICK+EG+E+FE+LELGP +R PLI HYF + SD   VFKI + ++I CL E+ +  K K
Sbjct: 2   AICKNEGVEQFEDLELGPLVRHPLIMHYFSISSDASGVFKITSAEIISCLDEFMEACKDK 61

Query: 282 DIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQ--PH 339
            I ++EFL++IAKKRSL  +E+LGVRIQ+LGMHI+FIREARK +  TLKK    L+  P 
Sbjct: 62  HIIIEEFLEYIAKKRSLTGRERLGVRIQSLGMHISFIREARKLEHXTLKKSQGSLKQIPD 121

Query: 340 HKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTY 399
            K R+ P+ SSEKK+ LDERFSA+ +RVKSF+S ++DFG KH  FVSS SE++ SDD  Y
Sbjct: 122 KKIREHPLRSSEKKK-LDERFSAMSQRVKSFASAHDDFGGKHTIFVSSCSEEDGSDDHKY 180

Query: 400 ECSN---DISSNVQLPS-QIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQ 455
           E S    D  SN +  S   K  DRVSSCPYPS  EE+ RLGLKGE     +++G SS  
Sbjct: 181 EESEEDIDGCSNSKFSSPNSKTRDRVSSCPYPSAIEEMTRLGLKGETEGNPSASG-SSMH 239

Query: 456 DDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQD-DELNEADISFSN 514
            +  G  K+KRK  N  CT S   K  +RNK +   L ++  ++ ++ + LNEAD   +N
Sbjct: 240 SENTGPFKRKRKSSNRSCTVSKYLKLPKRNKLELVPLSVDHXNEKKELNNLNEADFLLAN 299

Query: 515 ESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNI 574
           +SMRMFITTWKEAC  +T++EVLE+M QF+     ++     K MK M SSYP +GLLN+
Sbjct: 300 DSMRMFITTWKEACXEHTIAEVLERMLQFHGTQTKQR-----KIMKSMLSSYPFVGLLNV 354

Query: 575 AV 576
           A+
Sbjct: 355 AI 356


>gi|4455303|emb|CAB36838.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268078|emb|CAB78416.1| hypothetical protein [Arabidopsis thaliana]
          Length = 605

 Score =  329 bits (843), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 224/599 (37%), Positives = 319/599 (53%), Gaps = 96/599 (16%)

Query: 26  PSQPPPPHFNPNFAFQN-PNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLT 84
           P + P P F  NF   N P+    NP  Q  QNL  +  Q P+      N F+QN N   
Sbjct: 50  PIRYPIPQFPANFYRPNFPDFSLGNPNFQPHQNLNFLHQQIPHQYGSAANHFLQNHN--- 106

Query: 85  QNP-NLPPQQPPSS--ACNQQRSQTQHPPGASNQQRPLTQQ----PQSLPPQPQKLNKEF 137
           QN  + PPQ  P++  + +Q     ++      +Q  + Q     P+S     +  N  F
Sbjct: 107 QNSFSFPPQSIPNNDISISQNHGAFENSSLKRRRQEEVVQVTDVVPKSNFASGESANNSF 166

Query: 138 --------------------------------LERIDRAVVKARSELIMAGESVSSWKVS 165
                                           + RID+AV K R   + AGESVSS +VS
Sbjct: 167 SVSLPIPIATDDSGVSRVHGEKSSGKPKRKVDVLRIDKAVNKTRKLFVAAGESVSSTRVS 226

Query: 166 RSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICK 225
           R+VL +LQ DSW SLG QMQ+VPSL +LM  E +INAFIHCFVG R+I +L+DLE AIC+
Sbjct: 227 RAVLEELQADSWRSLGVQMQDVPSLRQLMAIEGKINAFIHCFVGARRIVTLHDLEVAICR 286

Query: 226 SEGIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKV 285
           +E ++ F++LELGP L+ PL+  YF   S      KI + ++I  L  Y  T+  +D+K+
Sbjct: 287 NEFVDSFDDLELGPLLQHPLVLLYFPSISSSTGPVKITSEEIISFLDSYLHTYMTEDVKL 346

Query: 286 DEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKR 345
           DEFL+F+A ++S+ SKE+LGVRIQ+L M+++FI +A++ +  TLK  L EL   H++   
Sbjct: 347 DEFLNFVASQKSVTSKEKLGVRIQSLRMYVSFILDAKRQEGETLKVLLTEL---HQKYHI 403

Query: 346 PIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSNDI 405
           P   S KK+Q D+  + + ER  SF+  ++D+  KHIRF SSSS   D +D  YE  N  
Sbjct: 404 P---SSKKQQRDKSLT-VSERADSFALHHKDYCGKHIRFDSSSS---DENDNVYEVRN-- 454

Query: 406 SSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKK 465
                    +  SD ++SCPYPSV EE+KRL                       G S KK
Sbjct: 455 ---------LNSSDHINSCPYPSVAEEMKRL-----------------------GGSNKK 482

Query: 466 RKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQTQDD-------ELNEADISFSNESMR 518
           RK E      S  +K LR++ +K +    +   +  DD        ++EAD + S   +R
Sbjct: 483 RKGERRNHEKSDSSKLLRKSPSKLQGHAKQEIPKLADDSEAKKVFSVDEADFTLSEGDLR 542

Query: 519 MFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVT 577
           +FI+TWK+ CK  ++S  +EKM  FY     +  A+  KR K M SS+P +GLLN+AV+
Sbjct: 543 LFISTWKDTCKELSISTFVEKMLSFYNLGGSEGRAQ-IKRAKAM-SSFPFVGLLNVAVS 599


>gi|449679290|ref|XP_002168168.2| PREDICTED: uncharacterized protein LOC100211694, partial [Hydra
            magnipapillata]
          Length = 366

 Score =  317 bits (811), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 231/369 (62%), Gaps = 14/369 (3%)

Query: 1149 LPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSH 1208
            L  D + +    I+  IRR++FG+G  +   E  +++    R GR+L  LS ELYS+DSH
Sbjct: 9    LTSDVQNDHCHKIINDIRREQFGIGLELGEQEKKLIEIHREREGRSLQRLSNELYSKDSH 68

Query: 1209 FLLELVQNADDNIYPENV---EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
            F+LEL+QNADDN Y EN    +PT+ FI++   I++LNNE+GFS  NI+ALCDVG STK 
Sbjct: 69   FVLELIQNADDNQYIENSNNEKPTVAFIVEHDQIIILNNEKGFSESNIKALCDVGKSTKG 128

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
                GYIG+KGIGFKSVFRVTD PEIHSNG+H+KFD S G  G++LP  +  +N D    
Sbjct: 129  IHRKGYIGQKGIGFKSVFRVTDMPEIHSNGYHIKFDASSGSNGYILPNWIYDWNSDQRS- 187

Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
              +  P +  +  W T I LP + +    +  + ++  F D+H SLLLFL+RL  I+  +
Sbjct: 188  --TYQPCESRNYGWKTQIVLPLKKE----MQHSKLISRFHDIHSSLLLFLNRLHTIIIDD 241

Query: 1386 MLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQES 1445
             +++ + V+ ++ V   I+++S   D   WFV  +++   V R DVK+T IA A      
Sbjct: 242  KVSNIMKVMEREHVEKNIVRLSNNNDSRLWFVVEKEIPIDV-REDVKSTNIAFAFPFNNK 300

Query: 1446 NEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF 1505
             +    P    Q VFAFLPLR+YG KFI+Q DF +PSSRE+++ +S WNQ++L + P L 
Sbjct: 301  LKTEALPA---QQVFAFLPLRSYGFKFIIQADFEVPSSREDINKDSAWNQYILQQLPKLT 357

Query: 1506 VSAERSFCD 1514
            V A  SF D
Sbjct: 358  VEAFHSFKD 366


>gi|297804942|ref|XP_002870355.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316191|gb|EFH46614.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 608

 Score =  309 bits (791), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 191/445 (42%), Positives = 270/445 (60%), Gaps = 53/445 (11%)

Query: 140 RIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEAR 199
           RID+AV K R  ++ AGESVSS +VSR+VL +LQ DSW SLG QMQ+VPSL +LM  E +
Sbjct: 204 RIDKAVNKTRKSVVAAGESVSSTRVSRAVLEELQADSWRSLGVQMQDVPSLRQLMAIEGK 263

Query: 200 INAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVV 259
           INAFIHCFVG R+I +L+DLE AIC++E ++ F++LELGP L+ PL+  YF   S     
Sbjct: 264 INAFIHCFVGARRIVTLHDLEVAICRNEFVDSFDDLELGPLLQHPLVLLYFPSISSSTGP 323

Query: 260 FKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIR 319
            +I + ++I  L  Y  T+  +D+K+DEFL+F+A ++S+ SKE+LGVRIQ+L M+++FI 
Sbjct: 324 VQITSEEIISFLDSYLHTYMTEDVKLDEFLNFVASQKSVTSKEKLGVRIQSLRMYVSFIL 383

Query: 320 EARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGA 379
           +A++ +  TLK  L EL      +K  I SS+K+RQ  ++   + ER  SF+  ++D+  
Sbjct: 384 DAKRQEGETLKVLLTELH-----QKYQIPSSKKQRQ--DKPLTVSERADSFALHHKDYCG 436

Query: 380 KHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLK 439
           KHIRF SSSS+D DS  C YE  N  S           SD  ++CPYPSV EE+KRL   
Sbjct: 437 KHIRFDSSSSDDNDS--C-YEVRNKNS-----------SDHFNTCPYPSVAEEMKRL--- 479

Query: 440 GEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPIESGDQ 499
                               G S KKRK E      S  +K LR++ +K +    +   +
Sbjct: 480 --------------------GGSNKKRKGEKRNHEKSDSSKLLRKSPSKLQGHAKQEIPR 519

Query: 500 TQDD-------ELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKA 552
             DD        ++EAD + S   +R+FI+TWK+ CK  ++S  +EKM  FY     +  
Sbjct: 520 LTDDSDAKKVFSVDEADFTLSEGDLRLFISTWKDTCKELSISTFVEKMLSFYNLGGSEGR 579

Query: 553 ARYFKRMKLMFSSYPCIGLLNIAVT 577
           A+  KR K M SS+P +GLLN+AV+
Sbjct: 580 AQ-IKRAKAM-SSFPFVGLLNVAVS 602


>gi|115456025|ref|NP_001051613.1| Os03g0803800 [Oryza sativa Japonica Group]
 gi|113550084|dbj|BAF13527.1| Os03g0803800, partial [Oryza sativa Japonica Group]
          Length = 642

 Score =  307 bits (786), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 181/488 (37%), Positives = 280/488 (57%), Gaps = 31/488 (6%)

Query: 138 LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTE 197
           LER   A  K R E+I AGE V+ WK++++VL+ L+VDSW SLG Q+ EVP L  L + E
Sbjct: 179 LERAQAAARKVRDEIIKAGEGVTGWKIAQAVLVALKVDSWGSLGVQLHEVPLLRDLFLVE 238

Query: 198 ARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVD 257
            ++N FIHC+V  R+I S+YDLE  ICK+E I +FEEL LGPFL+ PL++HYF V +D+ 
Sbjct: 239 GKVNTFIHCYVAARKIVSVYDLESEICKNESIGQFEELGLGPFLQHPLVAHYFSVPADLS 298

Query: 258 VVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINF 317
           +V K++++++I  L ++ D  K K I V+ FLD++A+++S++ KE LGVRIQ+L +HI+F
Sbjct: 299 LVPKLSSDEIINWLQKFMDNSKKK-ITVENFLDYLAEQKSVSGKENLGVRIQSLRLHISF 357

Query: 318 IREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEK----KRQLDERFSAICERVKSFSSI 373
           +R+AR+++   +K             ++ +  + K    K+ LDERFSAI  R+K    I
Sbjct: 358 LRQARRTEVSAVKVQGNTSGSGDGSCEKDLVKNRKFHLSKQALDERFSAITSRIKKLPGI 417

Query: 374 NEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSD---RVSSCPYPSVT 430
           N     KHI F  S+ ++ D D  +   + D S +    + I   D   RVSSCPYPS T
Sbjct: 418 N-----KHIHF-DSTDDETDGDSSSEGDAVDNSESKTGSAAIDNKDVDKRVSSCPYPSKT 471

Query: 431 EELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQR 490
           EE++RLGLK E + +     +   +    G +++KRK E +G  +S+  +          
Sbjct: 472 EEMERLGLKSETSKKPPLDSSKVKESSKKGYTREKRKSEENGSPTSSCKR---------- 521

Query: 491 ALPIESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
             P +             +   S   +  FITTWKEAC+ + + +VLE +  +Y  T   
Sbjct: 522 --PKKKQKVQMQKHELSPNCFLSIGKLEKFITTWKEACREHPVQQVLELLANYYAET--- 576

Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAVTSIKRGMWDSVYDTLQAITQQELNNTVSGNSLEY 610
              +  K++   FS YP IG LN+AV ++  G+ DS+YD +    + + ++ +   + E 
Sbjct: 577 --PKEKKKIIKFFSEYPGIGFLNVAVRAMGCGLLDSLYDAINVFNENKSSSNIPDTTTEL 634

Query: 611 ESIEIEPS 618
             +E  PS
Sbjct: 635 MEVEPPPS 642


>gi|240254043|ref|NP_172307.5| no vein-like protein [Arabidopsis thaliana]
 gi|332190150|gb|AEE28271.1| no vein-like protein [Arabidopsis thaliana]
          Length = 746

 Score =  287 bits (734), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 191/505 (37%), Positives = 276/505 (54%), Gaps = 82/505 (16%)

Query: 115 QQRPLT---QQPQSLP-PQPQ---KLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRS 167
           QQ+P+    Q  Q+L  P PQ     N E + RID+AV   R  LI AG++VSS +VS+S
Sbjct: 95  QQQPIRYNFQTQQNLNFPHPQFGGGANVELI-RIDKAVNNTRKSLIAAGDNVSSTRVSQS 153

Query: 168 VLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSE 227
           VL +LQ D+W SLG  MQ+VPSL +LM  E +I AFIHCF+G R I +L+DLE AIC++E
Sbjct: 154 VLAQLQADTWRSLGIWMQDVPSLRQLMALEGKIIAFIHCFIGARGIVTLHDLEVAICQNE 213

Query: 228 GIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDE 287
            +  F++L LGP L+ PL+  YF          KI + ++I  L  Y +T+   D+K+DE
Sbjct: 214 FVGCFDDLGLGPLLQHPLVLLYFPSVYCSTAPVKITSEEIISLLDSYLNTYDIDDVKLDE 273

Query: 288 FLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKRPI 347
           FLDF+A+ +++ SKE+LGVRIQNL M+++FI++A++ +   LK  L EL  H K R   I
Sbjct: 274 FLDFVAEAKAVTSKEKLGVRIQNLRMYVSFIQDAKRQEGEILKIVLTEL--HQKYR---I 328

Query: 348 FSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSNDISS 407
            SS+K+RQ                  ++D+  KH RF S SSE+ DS D  YE  N    
Sbjct: 329 LSSKKQRQ------------------DKDYCGKHTRFNSPSSEENDSAD--YEVEN---- 364

Query: 408 NVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKKKRK 467
                  +K SD  SSCPY S  EE+K+L                       GSS KKRK
Sbjct: 365 -------VKRSDHFSSCPYSSAEEEVKQL-----------------------GSSSKKRK 394

Query: 468 IENSGCTSSAPAKFLRRNKAKQR------ALPIESGDQ-TQDDELNEADISFSNESMRMF 520
            E+     S   K LRR  +K R       +P  + D   Q   +N+AD + S  ++++F
Sbjct: 395 AESRNHEKSDSPKLLRRGPSKLRRGHVKQKIPKSADDSDAQIFSVNDADFTLSEGALKLF 454

Query: 521 ITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVTSIK 580
           I+TWKE CK  ++S  ++K+  FY     +   +   ++    SS+P +GLL++A   + 
Sbjct: 455 ISTWKETCKELSISVFVKKLLSFYNLGGSEVHGQI--KIATAVSSFPFVGLLHVAKELVT 512

Query: 581 RGMWDSVYDTLQAITQQELNNTVSG 605
             + +      Q  T+  + N V+G
Sbjct: 513 EALEE------QIPTEITITNLVAG 531


>gi|6579199|gb|AAF18242.1|AC011438_4 T23G18.16 [Arabidopsis thaliana]
          Length = 512

 Score =  283 bits (723), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 187/480 (38%), Positives = 267/480 (55%), Gaps = 79/480 (16%)

Query: 115 QQRPLT---QQPQSLP-PQPQ---KLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRS 167
           QQ+P+    Q  Q+L  P PQ     N E + RID+AV   R  LI AG++VSS +VS+S
Sbjct: 95  QQQPIRYNFQTQQNLNFPHPQFGGGANVELI-RIDKAVNNTRKSLIAAGDNVSSTRVSQS 153

Query: 168 VLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSE 227
           VL +LQ D+W SLG  MQ+VPSL +LM  E +I AFIHCF+G R I +L+DLE AIC++E
Sbjct: 154 VLAQLQADTWRSLGIWMQDVPSLRQLMALEGKIIAFIHCFIGARGIVTLHDLEVAICQNE 213

Query: 228 GIEKFEELELGPFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDE 287
            +  F++L LGP L+ PL+  YF          KI + ++I  L  Y +T+   D+K+DE
Sbjct: 214 FVGCFDDLGLGPLLQHPLVLLYFPSVYCSTAPVKITSEEIISLLDSYLNTYDIDDVKLDE 273

Query: 288 FLDFIAKKRSLASKEQLGVRIQNL---GMHINFIREARKSQDVTLKKFLKELQPHHKRRK 344
           FLDF+A+ +++ SKE+LGVRIQNL    M+++FI++A++ +   LK  L EL  H K R 
Sbjct: 274 FLDFVAEAKAVTSKEKLGVRIQNLRYDQMYVSFIQDAKRQEGEILKIVLTEL--HQKYR- 330

Query: 345 RPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSDDCTYECSND 404
             I SS+K+RQ                  ++D+  KH RF S SSE+ DS D  YE  N 
Sbjct: 331 --ILSSKKQRQ------------------DKDYCGKHTRFNSPSSEENDSAD--YEVEN- 367

Query: 405 ISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGLKGEINHQLTSAGNSSGQDDYIGSSKK 464
                     +K SD  SSCPY S  EE+K+L                       GSS K
Sbjct: 368 ----------VKRSDHFSSCPYSSAEEEVKQL-----------------------GSSSK 394

Query: 465 KRKIENSGCTSSAPAKFLRRNKAKQR------ALPIESGDQ-TQDDELNEADISFSNESM 517
           KRK E+     S   K LRR  +K R       +P  + D   Q   +N+AD + S  ++
Sbjct: 395 KRKAESRNHEKSDSPKLLRRGPSKLRRGHVKQKIPKSADDSDAQIFSVNDADFTLSEGAL 454

Query: 518 RMFITTWKEACKNNTMSEVLEKMFQFYKPTDPKKAARYFKRMKLMFSSYPCIGLLNIAVT 577
           ++FI+TWKE CK  ++S  ++K+  FY     +   +   ++    SS+P +GLL++AV+
Sbjct: 455 KLFISTWKETCKELSISVFVKKLLSFYNLGGSEVHGQI--KIATAVSSFPFVGLLHVAVS 512


>gi|432926042|ref|XP_004080801.1| PREDICTED: uncharacterized protein LOC101162250 [Oryzias latipes]
          Length = 2775

 Score =  273 bits (698), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 145/316 (45%), Positives = 194/316 (61%), Gaps = 13/316 (4%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            ++E IR+ EFG+G  +S     +++ Q  RLGR L  LS ELYS+D+HF+LEL+QNADDN
Sbjct: 1030 VIEDIRKSEFGIGVELSAEGRRLMEVQQERLGRGLDRLSTELYSKDTHFVLELIQNADDN 1089

Query: 1221 IYP-ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
             Y  E   P+L F++++  I +LNNE GF  +N+RA+CDVG STK     GYIG+KGIGF
Sbjct: 1090 SYSSETSVPSLAFVVEQDCITILNNESGFQEKNVRAICDVGRSTKGKHKYGYIGQKGIGF 1149

Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
            KSVF+VTD PEIHSNGFH++FD S G +G++LP            R L      L    W
Sbjct: 1150 KSVFKVTDCPEIHSNGFHLRFDRSCGPMGYILPHWAEDD------RPLDPQLQLLRQHGW 1203

Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
             T IRLP R   SE     N   +F D+HPSLLLFLHRL+ I   N     LV + +K +
Sbjct: 1204 TTKIRLPLR---SESRQTRN---LFHDVHPSLLLFLHRLRSISIYNHSEKRLVTMTRKDL 1257

Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
            G  +++V   E    W V    L    ++ DV++TE+ALA  L  S   +      +QPV
Sbjct: 1258 GHNVLEVEHSEGIERWLVVKTTLYPKKVKQDVESTELALAFQLGSSPPEDVACQPQKQPV 1317

Query: 1460 FAFLPLRTYGLKFILQ 1475
            FA+LPLR++G +FI+Q
Sbjct: 1318 FAYLPLRSFGFRFIVQ 1333



 Score =  117 bits (293), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 146/621 (23%), Positives = 255/621 (41%), Gaps = 76/621 (12%)

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
            GDF +PSSRE+VD +S WNQWL SE P LF+ A   F D   F+    K    ++ F+PL
Sbjct: 1403 GDFDIPSSREDVDRDSSWNQWLRSEIPQLFLQAMEVFSDHSEFKG--LKGLCRFLQFIPL 1460

Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWNDRAHSLLPDILLQ 1593
              EV  FF  +   I+  L+    L    ++    +  P +V    +    S++    L+
Sbjct: 1461 PDEVLDFFKPVAGQIIQLLKGKAFLPTRNSDGGVVYKLPSQVAICQDAVIRSVIGSDELE 1520

Query: 1594 KHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS-- 1649
            K+L L +L  D+  +   SL   LG+       +  + +++ +    L++ G     +  
Sbjct: 1521 KYLSLSYLHPDLSPTPPTSLLTHLGVRYLRGSDVTTVTTAMAK--ELLKAGGFHEEENIH 1578

Query: 1650 WLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVD-EGTIWLHSDCSV 1708
             L  L   +F +        G    ++ NL+ +P IPL+DG+  S++ EG  +   +   
Sbjct: 1579 QLARLLVCNFRAMEHG---YGDAESILQNLKDLPIIPLADGSVVSLNGEGVFFPVEEAKT 1635

Query: 1709 F----DGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLS 1764
                  G  G  A   L   +  V P+LLS        L  +    +  +L ++GV +L 
Sbjct: 1636 KKKKAQGQTG--AVSALYQDVSVVHPSLLSC-------LEPLESQQVRELLKRLGVHELE 1686

Query: 1765 AHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAF 1824
              ++++ HI P I  +   +    ++  YL F+  H         V              
Sbjct: 1687 PQELLEQHIYPTIKSKKWKSKPAAVVVSYLVFIKQHSSSSKGYADV-----------AVP 1735

Query: 1825 VLTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDI--DIKWYEVDITYLKHPAKESLSCGL 1882
            VLT+ G   P+   +HF KE+      NM +H       W  V   Y++          +
Sbjct: 1736 VLTSRGLLCPATERVHFSKEYR-----NMDLHGTLPGCDWILVSPCYVQ------TDGDV 1784

Query: 1883 VKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAA----IDWESNEL 1938
              WR+    +G+ D + + +  +  A  +       W  E     SAA    +D    E 
Sbjct: 1785 DGWRELLSSLGVRDGLIIRKERR--ALTAEELANGPWAAERAMGLSAAQDCLLDDYPCEE 1842

Query: 1939 VHLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDD-----RSFQSS 1991
              LL+   L+ +   Q    LLE+L+  W+   T  +   F +    D      R+ +SS
Sbjct: 1843 FRLLATAPLSGSALLQQRTALLELLEANWN---TGHLYSQFLTAQVIDSNGRPIRNTKSS 1899

Query: 1992 FINCICDIQWTISSMDDELHY-------PKDLFHDCDAVRSILGPSAPYIVPKVKSEKLV 2044
            F + +C ++W  +  + E          P  ++     V  +LG    Y+   + S +  
Sbjct: 1900 FHHLLCSLEWIPAYREQEGEQQEKKCLCPGSVYLPLPEVTRLLGTHVFYV--DLNSSEFS 1957

Query: 2045 CDIGLKTEVTIDDILEILKVW 2065
              +G++  +++D ++  LK W
Sbjct: 1958 RALGIREVISVDVLIHFLKEW 1978



 Score = 77.4 bits (189), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 103/439 (23%), Positives = 167/439 (38%), Gaps = 63/439 (14%)

Query: 643  SKYLEFDNSILNNAQSPVTKIIILLRKLCSCETWLTEQFRIKEFKSLGYGEFFTFLEKHA 702
            +K  EF    L +  S VT     L  + + E  LT+ F+I  F SL  G F  FL K+ 
Sbjct: 530  AKVKEFVRENLRSQNSAVT-----LAHIAALERKLTKHFQIPHFLSLERGTFLEFLVKNI 584

Query: 703  SMLSTELQKL-FTDDTIDRSSLEVSLVQHLLVVLVSQASNNLWESEIITKQMISELLRRQ 761
             +L   L  L F ++  + +       +  +   + Q   ++  S+      +   LR  
Sbjct: 585  QLLQESLGSLLFLENNKNLAGCGFRPSKQDVFEFIQQC-GSMTPSDPDELCGMESALRAH 643

Query: 762  FPLISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETT 821
            + +   +    G + +    V + R E+  + V      L    S  +S H   S    +
Sbjct: 644  YGVRDSRDLGYGPLPTLAGLVQRQR-ELGGRGV-----SLICYESAVVSKHRNQSTESGS 697

Query: 822  SAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVN 881
             ++   G        ++   A+  LL  P L DL+ WS WDF+F P  GPL  ++     
Sbjct: 698  ESVGRLG-------DMSKGQALASLLCCPLLEDLSEWSQWDFVFKPLHGPLKDFIERNAA 750

Query: 882  VKELLCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
               L  L  R G +++I  +     F  A        TA  L+ S  +A G  N P  LL
Sbjct: 751  NTGLAALEVRPGLLLKITTNTGDKHFFRAVAALDPVGTAGHLV-SMVVADGVSNAPTALL 809

Query: 942  KCHARHAFE-VMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKF 1000
              H + A    M K ++   E   S   R+                              
Sbjct: 810  ANHMQSALAGAMAKEDLSQAEEDISCYSRV------------------------------ 839

Query: 1001 GKAVPVASRFFLDCLGYLPSEF-RSFAADVLLSGLQSTIKDAPSA-ILLECHQTELRLM- 1057
                   ++F LDCL  +P+   ++    V L      +  A S  +L+   Q++LR + 
Sbjct: 840  -------AKFLLDCLIRIPTRTCQALLQKVFLEPFSLVLGQAKSKQVLIRTAQSDLRHLN 892

Query: 1058 -LHEVGLSLGILEWIHDYH 1075
             LH +G+  G+ +WI DY 
Sbjct: 893  CLHRLGILTGVTDWIRDYQ 911



 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 96/203 (47%), Gaps = 7/203 (3%)

Query: 187 VPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLI 246
           +P+L+ L      IN FI     V  I +LY+L +A+   +  + +EEL LGP  + P+I
Sbjct: 252 LPALNSLQYLTKEINLFIEATEAVSCICTLYELGQALAALKNKKHYEELNLGPLCKLPVI 311

Query: 247 SHYFLVKSDV--DVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQL 304
              F + S+   D + KI T D++  L  +  + K   + + EF+  +A         +L
Sbjct: 312 YRMFKMDSNTKDDDIPKIRTVDILKQLYYFRKSQKQPKLDLAEFMKHLADHYDCELPYKL 371

Query: 305 GVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAIC 364
           G+RI + G+ I  +    +++   L++  + +Q   ++      + E+ R++ +      
Sbjct: 372 GIRISSFGLPIATVSRVARNERTMLEQCREVIQKELEQE-----ALERLRKIKDSVFKPL 426

Query: 365 ERVKSFSSINEDFGAKHIRFVSS 387
               SFS+IN D   K+    ++
Sbjct: 427 HSPGSFSTINMDVRKKYTSMTAA 449


>gi|297843606|ref|XP_002889684.1| hypothetical protein ARALYDRAFT_470891 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335526|gb|EFH65943.1| hypothetical protein ARALYDRAFT_470891 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 456

 Score =  270 bits (691), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 173/446 (38%), Positives = 247/446 (55%), Gaps = 70/446 (15%)

Query: 140 RIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDS-WSSLGFQMQEVPSLHKLMVTEA 198
           RID+A+   R  LI AGE+VSS +VS+SVL +LQ D  W SLG QMQ+VPSL +LM  E 
Sbjct: 71  RIDKAINNTRKSLIAAGENVSSTRVSQSVLEQLQADDIWRSLGIQMQDVPSLRQLMALEG 130

Query: 199 RINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDV 258
           +I AFIHCF+G R I +L+DLE AIC++E +  F++L LGP L+ PL+  YF   S    
Sbjct: 131 KIIAFIHCFIGARGIVTLHDLEVAICRNEFVGCFDDLGLGPLLQHPLVLLYFPSISGSTA 190

Query: 259 VFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFI 318
             +I + ++I  L  +  T+   D+K+DEFLDF+A+++++ SK++LGVRIQ+L M+++FI
Sbjct: 191 PVQITSEELISFLDSFLKTYDMDDVKLDEFLDFVAEEKAVTSKDKLGVRIQSLRMYVSFI 250

Query: 319 REARKSQDVTLKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFG 378
           + A++ +  TLK  L  L      +K  I  S+K++Q                  ++D+ 
Sbjct: 251 QNAKRQEGETLKILLTGLH-----QKYHILPSKKQQQ------------------DKDYC 287

Query: 379 AKHIRFVSSSSEDEDSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKRLGL 438
            KH RF S SSED DS D  YE  N           +   D+ SSCPYPSV EE+K+   
Sbjct: 288 GKHTRFNSLSSEDNDSAD--YEVEN-----------VNSCDQFSSCPYPSVAEEMKQ--- 331

Query: 439 KGEINHQLTSAGNSSGQDDYIGSSKKKRKIENSGCTSSAPAKFLRRNKAKQRALPI---- 494
                                 SS KKRK E+     S  +K  RR  +K     +    
Sbjct: 332 --------------------PVSSNKKRKAESRSHEKSDLSKLRRRYPSKLHRGHVKQEI 371

Query: 495 ----ESGDQTQDDELNEADISFSNESMRMFITTWKEACKNNTMSEVLEKMFQFYKPTDPK 550
               +  D  Q    NEAD + S  ++R+FI+TWK+ CK  +MS  ++K+  FY     +
Sbjct: 372 HNLADDSDAKQIFSGNEADFTLSEGALRLFISTWKDTCKELSMSMFVKKLLSFYNLGGSE 431

Query: 551 KAARYFKRMKLMFSSYPCIGLLNIAV 576
             A+  KR K M SS+P  GLL++AV
Sbjct: 432 VQAQ-IKRAKAM-SSFPFFGLLHVAV 455


>gi|126179420|ref|YP_001047385.1| hypothetical protein Memar_1474 [Methanoculleus marisnigri JR1]
 gi|125862214|gb|ABN57403.1| hypothetical protein Memar_1474 [Methanoculleus marisnigri JR1]
          Length = 1625

 Score =  260 bits (665), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 185/654 (28%), Positives = 319/654 (48%), Gaps = 65/654 (9%)

Query: 1161 IVESIRRDEFG--LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNAD 1218
            IVE I  +  G  L    +    N   K++  L  AL  LS++LY + +HF++EL+QNA+
Sbjct: 10   IVEQIHNENTGHRLKGLRNRRTKNAWNKKNRDLDSALQKLSEDLYRKKTHFVMELIQNAE 69

Query: 1219 DNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-KGSSAGYIGRKGI 1277
            DN Y   V+P++ F ++   +V+ NNE GF+AE++  +C VG + K K  S GYIG KGI
Sbjct: 70   DNTYDPAVQPSIVFSIKPDTLVIYNNEIGFTAEDVERICSVGQTIKAKKKSEGYIGEKGI 129

Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSE-GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLES 1336
            GFKSVF+++D+P+I SNGF  +F  S+ G +G+++P  +P              P     
Sbjct: 130  GFKSVFKISDSPKIISNGFQFEFRRSDGGDLGYIVPYWLPSI------------PEYANP 177

Query: 1337 KCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            K   T I LP R    E +      ++F ++ P+L+LFL +LQ I   + +ND+L  + K
Sbjct: 178  KV--TNIILPLRADARERL------NLFDEIDPTLILFLRKLQRIEIIDEINDNLKAVTK 229

Query: 1397 KIVGDGIIKVSCGEDKMTWFV--ASQKLRAGVI-----RPDVKTTEIALALTLQESNEGN 1449
                +G++ +   +    W+V  A+ K+   +I     R D+++TEI LA  + E     
Sbjct: 230  S-GSEGLVTIDYQDQSQYWWVIRATFKVPQEIIDQEPRREDIRSTEIVLAFPVSEGGHPI 288

Query: 1450 YGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAE 1509
                + +  +FAFLP++  G  FI+Q DF+L S+RE++  +S WNQWL S+ P + + A 
Sbjct: 289  NIQNVSKPDIFAFLPVKKCGFNFIVQADFLLTSNREDILQDSRWNQWLRSKIPDVLIKAI 348

Query: 1510 RSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMILSKLRMSNCLILEGNNNQ 1568
              F       ++  K ++ ++  + L  E+   FF+ L       L    C++ E  +  
Sbjct: 349  GKF-------KHDKKLSTSFLKHLSLPEEITDEFFNPLMHQAFQALSGVECILTESGS-- 399

Query: 1569 WAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQI 1628
            W  PC V    +D    L+ +  L+ + G  +++    L   L   LGI ++ PK L + 
Sbjct: 400  WKKPCDVYYAEDD-LKDLVSNNELRSNFGKEYVANTFYLKRELRDLLGIADFTPKHLEEC 458

Query: 1629 ISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLS 1688
            +                W+    NE   + +    Q    +      +D++ ++P I L 
Sbjct: 459  LKH------------PEWVKKHSNEWLIMLYRYCAQQLEVAKSGNLTLDDIIQLPIIKLE 506

Query: 1689 DGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISV 1748
            +G + S + G I+L+ + +     +G E+  +L   LR +   + +A   ++ +  +I  
Sbjct: 507  NGDYCSANNGQIFLYLERT--GKKYGFES--DLEGTLRVLDADIRNAVRAEEDAKDLIE- 561

Query: 1749 DNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLE 1802
                  L K+ +      +I+  HILP    ET       ++  ++ F+  H E
Sbjct: 562  -----FLKKLNIVNPDPSNIINQHILPQYHRETWKKKPPGILQGHIHFIKDHFE 610


>gi|291243341|ref|XP_002741561.1| PREDICTED: predicted protein-like, partial [Saccoglossus kowalevskii]
          Length = 810

 Score =  257 bits (657), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 207/345 (60%), Gaps = 30/345 (8%)

Query: 1151 EDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFL 1210
            E+K   +   ++  IR  +FG+G  +S   S +++    R GR+L  LS ELYS+D+HF+
Sbjct: 485  EEKIQRECRAVISDIRHHQFGVGVELSEEASALMEVYKERSGRSLERLSTELYSKDTHFV 544

Query: 1211 LELVQNADDNIYPENVE---PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            LELVQNADDN Y EN++   P L F++++  + VLNNE GF  ++IRALCDVG STK   
Sbjct: 545  LELVQNADDNNY-ENMDDSPPALRFVIEKECVTVLNNESGFREKDIRALCDVGKSTKGKH 603

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
              GYIG+KGIGFKSVFR+TD PEIHSNG+H+ FD   G  G++LP  +   +        
Sbjct: 604  KYGYIGQKGIGFKSVFRITDKPEIHSNGYHMCFDAKSGPNGYILPHWIDENSC------- 656

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
                  +E   W T I LP + +  +   + N    F D+HPSLLLFLHRL+ I  RN +
Sbjct: 657  ------VEDTEWKTRIVLPVKAETQDRRLLAN---RFHDIHPSLLLFLHRLKSISIRNKV 707

Query: 1388 NDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI-RPDVKTTEIALALTLQESN 1446
            N     + +  V D +I+V        W V  ++L A  + + +V++TE+ALA  LQ+ +
Sbjct: 708  NGEEHSMTRTDVDDDVIEVQHTNGVDRWIVTKKQLDASNMSKENVESTELALAFLLQQPS 767

Query: 1447 EGNYGPLLY------QQPVFAFLPLRTYGLKFILQGDFVLPSSRE 1485
             G+    +Y      +Q VFAFLPLR+YG +F++QG+  +  S+E
Sbjct: 768  SGS---AIYPQSTPDKQKVFAFLPLRSYGFRFVVQGEKNIAGSKE 809



 Score = 56.6 bits (135), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 58/116 (50%), Gaps = 10/116 (8%)

Query: 836 SVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLP-------GWLLNEVNVK---EL 885
           S + + A+  LL AP L DL+ WSHW  +F P  G +        G+++    +    ++
Sbjct: 78  SQSKEKAMICLLNAPLLEDLSVWSHWSLIFQPEHGDIKQFIEKNTGYMIKNNGISTAFDM 137

Query: 886 LCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVPLPLL 941
             L T+ G ++++   ++ + F ++A  G S QT   L+S     GG  + PL LL
Sbjct: 138 CVLETKPGVLLKLSTDSTPELFSQSAENGESHQTVGHLVSIVIHHGGVTSSPLALL 193


>gi|348676380|gb|EGZ16198.1| hypothetical protein PHYSODRAFT_334382 [Phytophthora sojae]
          Length = 2724

 Score =  253 bits (647), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 244/843 (28%), Positives = 374/843 (44%), Gaps = 127/843 (15%)

Query: 768  KIEDKG--SMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSAIT 825
            ++ED G  S+E  ++   K +    +  V F+ ++       DL V  + S  +T S+  
Sbjct: 679  RVEDLGLPSVEELIKLADKEQAGEETSVVKFAGSL-------DL-VQGDGSFDDTVSSNE 730

Query: 826  HTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVKEL 885
             TG    + + + ++ A+E L + P+L D + +  W   +AP  GPL    L+ + V E+
Sbjct: 731  TTG--ESVQQKLLTEQALEKLRKCPYLVDASLYMDWRERYAPLCGPL----LSFIRVHEM 784

Query: 886  LCLVTRDGKVIRIDHSASVDSFLEAALEGSSFQTAVKLLSSFALAGGEKNVP--LPLLKC 943
            + L          DH+ S ++F+            V  L+   L   EK+ P  L LL  
Sbjct: 785  ILL----------DHAPSSNNFM-----------FVSCLNGTILRVNEKSTPSDLELLFA 823

Query: 944  HARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVANIDNLSGEVQKQLLKFGKA 1003
             A+ A        V    V     C+       G+ NF    +          + K   A
Sbjct: 824  RAQQAKTHAAARQVAVHLVSMVVTCK-------GQANFPKQLVQAHLRAYLSSVTKDKTA 876

Query: 1004 VPVASRFFLDCLGYLPSEFRSFAADVLLSGL-QSTIKDAPSA--ILLECHQ-TELRLMLH 1059
                 RF L+ L   P EF  F   +LL  L Q+T  ++ SA  +   C    E + +L+
Sbjct: 877  GSYPERFVLEVLLETPVEFADFVMSLLLGVLSQTTGSESASADRVWKACRSDAERKALLY 936

Query: 1060 EVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNS----------GSGCAE 1109
                SL  L W  +   +C    +  L        NAA     S          G     
Sbjct: 937  VSSRSLSGL-WATESEQWCILRDTPSLSSTDDDGGNAAKENTKSSATPQPVTSTGDDGKT 995

Query: 1110 GSLFESVGADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDE 1169
             SLF +   +V +         + G+ ++D     + + +  D  C      +E +RR +
Sbjct: 996  VSLFNASSQEVAL---ANFSSELSGQNTNDL---SSVEDIASDDSCRS---FIEDLRRKQ 1046

Query: 1170 FGLGPNISN-MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP 1228
            FG+G  I +   +++L+ Q  RL RAL  LS ELYS+ +HF+LEL+QNADDN Y + V P
Sbjct: 1047 FGVGLQIQDEATTSVLRIQQQRLERALKRLSDELYSESTHFVLELLQNADDNTYDDAVVP 1106

Query: 1229 TLTFIL-QESGIVVLNNEQGFSAENIRALCDVGNSTKKG-SSAGYIGRKGIGFKSVFRVT 1286
               F L  E  IV  NNE+GFS  NI+A+CDVG STK+   S   IG+KGIGFKSVF+V+
Sbjct: 1107 LGDFTLTAEKEIVFYNNERGFSPSNIQAICDVGASTKEAVDSEASIGKKGIGFKSVFKVS 1166

Query: 1287 DAPEIHSNGFHVKFDTSEGQ----IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC 1342
            D P++HSNGFH+ F     +    +G++LP     + +D        D  Q + +   T 
Sbjct: 1167 DNPQVHSNGFHICFHAKNAEHGSGMGYILP-----YWLD--------DATQWKQRRGTTF 1213

Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSL--------------------LLFLHRLQCIM 1382
            + LP      + +  ++I        PS+                    L FL + + + 
Sbjct: 1214 V-LPLNDSSVQRV--DDISQSLMAFEPSVLLFLRRIRELRLRDPARQHALHFLKKEKQLQ 1270

Query: 1383 FRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
                + +    ++K      +++         W V  +KL    +      TEIALA  L
Sbjct: 1271 TNAQIVELYSQVKKSESSVEVVQ-------QNWLVVKEKLEPPQLFGRSHPTEIALAFPL 1323

Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
                E +  PL   Q VFA+LPLR+YG +FILQGDF +PSSRE +   S WN+WL+S+FP
Sbjct: 1324 TFQGEESRPPL---QEVFAYLPLRSYGFRFILQGDFEIPSSREAITNGSEWNEWLVSKFP 1380

Query: 1503 ALFVSAERSFCDLPCFRENPA----KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
             L  +A  S+          A    +A S  +S +PL  EV   F  +   ++ +LR   
Sbjct: 1381 KLVRAAVGSYVSSAQASGQEADKAIEAISHLLSLLPLENEVQAPFRSIIPEVMRELRQVK 1440

Query: 1559 CLI 1561
             L+
Sbjct: 1441 WLL 1443



 Score = 74.3 bits (181), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 59/232 (25%), Positives = 114/232 (49%), Gaps = 18/232 (7%)

Query: 166 RSVLLKLQVDSWSSL-GFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAIC 224
           R++    +V+++  L G    ++P+L +L     R+  F+ CF+  R+I +L++ ++A  
Sbjct: 103 RAICSHFRVNAFEELMGTSALQLPALRQLHTVNQRVWTFVTCFMQSRRINTLFECQQAFL 162

Query: 225 KSEGIEKFEELELG-PFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDT--HKA- 280
           + EG+  F EL+LG  FL    +    L +S V  +  + T DV+  L ++ D   H A 
Sbjct: 163 QHEGLRAFHELKLGNSFLHTEAVQQ--LYRSPV-ALLTVTTRDVLASLKQFEDMLGHDAF 219

Query: 281 ---KDIKVDEFLDFIAKKRSLASKEQLGVRI--QNLGMHINFIREARKSQDVTLK----K 331
                I ++EFL ++A++    S + +GV I     G+++  +R    ++   +K    +
Sbjct: 220 RASSHIDLNEFLQYLAQQYRQPSAQAMGVAIDPSGFGVYVGMLRRVANNEMKEMKTLEQQ 279

Query: 332 FLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIR 383
           F +E+     R  +  FS + ++Q  E   A     +S    +E+ GA+  R
Sbjct: 280 FQREVAEKMFRLTKEKFSDDNRKQALEELLAQTNAAQSTQQESEE-GAQKRR 330


>gi|301122067|ref|XP_002908760.1| phosphoserine aminotransferase, putative [Phytophthora infestans
            T30-4]
 gi|262099522|gb|EEY57574.1| phosphoserine aminotransferase, putative [Phytophthora infestans
            T30-4]
          Length = 3635

 Score =  243 bits (619), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 290/1080 (26%), Positives = 454/1080 (42%), Gaps = 199/1080 (18%)

Query: 821  TSAITHTGLRPKMSESVTSKDAIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGW----- 875
            + ++  TG   ++ +   +  A+E L   PFL D++ +  W   +AP  GPL  +     
Sbjct: 1809 SDSVEETG---ELGQKTLTDQALEKLRTCPFLVDVSLYMDWPERYAPLCGPLLSFIRVHE 1865

Query: 876  --LLNEVNVKELLCLVTRDGKVIRIDHSASVDSFLEAALEG--------SSFQTAVKLLS 925
              LL+       + +   +G ++R++   S  S LE  L          S+   AV L+S
Sbjct: 1866 MILLDHAPSSNFMFVSCVNGAIVRVNEK-STPSDLELLLTRAQQEKTRVSASVVAVHLVS 1924

Query: 926  SFALAGGEKNVPLPLLKCHARHAFEVMFKNNVEDIEVINSQNCRMHGNVLCGRQNFDVAN 985
                  G+ N P  L++ H R                                       
Sbjct: 1925 MLVTCKGQANFPKQLVQAHLR--------------------------------------- 1945

Query: 986  IDNLSGEVQKQLLKFGKAVPVASRFFLDCLGYLPSEFRSFAADVLLSGLQSTIKDAPSAI 1045
                       L  F K      RF L+ L   P EF  F   +LLS L  ++     A+
Sbjct: 1946 ---------AYLSSFAKHGDYPKRFLLEILLETPVEFADFVVALLLSVLGQSVSTNEYAV 1996

Query: 1046 LLE----CHQTELRLMLHEVGLSLGILEWIHDYHAFCSTGTSDL--LMPCVVTCTNAATS 1099
              +    C     R  L  V        W      +C+    DL  ++   V+ T A +S
Sbjct: 1997 ADQLWKTCTSDAERKALVFVSSHSSASLWKDQTEKWCAV-REDLTAIIDGDVSKTTAVSS 2055

Query: 1100 --GLNSGSGCAEGSLFESVGADV-------HIEECGAICDTICGEASDDGLGDCTTQTLP 1150
                +SG     GS+ E+V A+V        +EE   I  +  G         C +    
Sbjct: 2056 IDVADSGDDDVVGSVLEAVDAEVPSFSVDQEMEEVSVIETSTSGS--------CRS---- 2103

Query: 1151 EDKECEDAALIVESIRRDEFGLGPNISN-MESNMLKKQHARLGRALHCLSQELYSQDSHF 1209
                       +E +R+ +FG+G  I +   +++L  Q  RL RAL  LS ELYS+ +HF
Sbjct: 2104 ----------FIEDLRKKQFGVGLEIQDEATTSVLLIQQQRLERALKRLSDELYSESTHF 2153

Query: 1210 LLELVQNADDNIYPENVEPTLTFIL-QESGIVVLNNEQGFSAENIRALCDVGNSTKKG-S 1267
            +LEL+QNADDN Y + V P   F L  +  IV  NNEQGF+  NI+A+CDVG STK    
Sbjct: 2154 VLELLQNADDNTYDDAVVPLGEFTLTNDKEIVFYNNEQGFTRANIQAICDVGASTKAAVD 2213

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
            S   IG+KGIGFKSVF+V D P++HSNGFH+ F       G  +  ++P + +D      
Sbjct: 2214 SEASIGKKGIGFKSVFKVADDPQVHSNGFHICFHAKNSNHGTGMGYILP-YWLD------ 2266

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLH----------- 1376
              D  Q   +   T + LP      + +  ++I        PS+LLFL            
Sbjct: 2267 --DTTQWRQRRGTTFV-LPLNDASVQRV--DDISQSLMAFEPSVLLFLRRIRELRLLDSV 2321

Query: 1377 RLQCIMF-----RNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
            R   + F     R   N  LV +  ++  +  ++V+    +  W V  + L    +    
Sbjct: 2322 RQHSLHFLKKETRLQTNSKLVKLYSQVKSESSVEVT----QQKWLVVKETLEPPQLFHRS 2377

Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
             TTEIALA  + +  +G+  PL   Q VF +LPLR+YG +FILQGDF +PSSRE +   S
Sbjct: 2378 HTTEIALAFPVDDF-KGDRPPL---QQVFCYLPLRSYGFRFILQGDFEIPSSREAITNGS 2433

Query: 1492 PWNQWLLSEFPALFVSAERSFCD-LPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMI 1550
             WN+WL+S+FP L  +A  S+   LP      A  A +             F S +P M 
Sbjct: 2434 DWNEWLVSKFPKLMRAAVHSYVSTLPTDESRIAAIAHLLSLLPLESEVQAPFRSIIPEM- 2492

Query: 1551 LSKLRMSNCLILEGNNNQWAPP-----CKVLRGWNDRAHSLLPDI---LLQKHLGLGFLS 1602
            + +LR    L     +++   P     C  L G ++    LL  +   +L       FL 
Sbjct: 2493 MRELRQLKWLASTSLSSELYKPMELVDCSELAGTSESTVMLLEALSEDVLASTGNKRFLH 2552

Query: 1603 KDI--VLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFH 1660
              +   ++ S+   L +E+     L++++ SL   +N      ++W    L  L  + + 
Sbjct: 2553 ATLSQAMTTSMKNQLRVEQLHASHLMRVL-SLSMEKN-----DINWTVKVLALLAKL-WR 2605

Query: 1661 SSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTI-WLH-SDCS----VFDGGFG 1714
                SSL       L   L+ I   PL   +     EG + W+  +DC     V +   G
Sbjct: 2606 KDRHSSL-------LRQELRLIKCFPLHKDSA----EGVVKWVSLADCHDSLFVSEDSAG 2654

Query: 1715 LEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLK-IGVQQLSAHDIVKVHI 1773
                      L  +  +     AV+KS+        L   LLK +GV  +  H++++ H+
Sbjct: 2655 SGTSYAFFGDLHILDDSF--TKAVNKST-------RLRTFLLKDVGVHVMEDHELIRHHV 2705

Query: 1774 LPAISDETTANGDK---NLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHG 1830
            LP +++  +++ DK     + +Y  F++ HL   C +C +      +E++    V T+ G
Sbjct: 2706 LPKMTELRSSSDDKLDVGAVMEYGRFLVSHLA-TCRDCSMH-----AEVKANTAVATSSG 2759



 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 67/278 (24%), Positives = 129/278 (46%), Gaps = 25/278 (8%)

Query: 100  NQQRSQTQHPPGASNQQ---RPLTQQPQSLPPQPQKLNKEFLERIDRAVVKARSELIMAG 156
             QQ  + +    ASNQQ   +P  Q P++  PQ Q+    F+E    A    +S      
Sbjct: 1156 QQQHGEERGGSYASNQQGYYQPSHQHPRA-SPQVQRDVVAFIE----AFRANQSSPSAPP 1210

Query: 157  ESVSSWKVSRSVLLKLQVDSWSSL-GFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITS 215
             +++   V R++     V+++  L G    ++P+L +L     R+  F+ CF+  R+I +
Sbjct: 1211 NALTLDVVVRAICSHFHVNAFEELMGTSALQLPALRQLHTVNQRVWTFVTCFMQSRRINT 1270

Query: 216  LYDLEEAICKSEGIEKFEELELG-PFLRQPLISHYFLVKSDVDVVFKIATNDVIVCLSEY 274
            L++  +A  + EG+  F EL+LG  FL    + H +        +  + T DV+  L ++
Sbjct: 1271 LFECHQAFLQHEGLRSFHELKLGNSFLHTEAVQHLY---HSPRAMLTVTTRDVLTFLRQF 1327

Query: 275  TDT--HKA----KDIKVDEFLDFIAKKRSLASKEQLGVRI--QNLGMHINFIREARKSQD 326
             +   H A      + ++EFL F+A++    + + +GV I     G+++  +R     + 
Sbjct: 1328 EEMLGHDAFRASSHVDLNEFLQFLAQQYRQPNAQAMGVAIDPSGFGVYVGMLRRVANYEM 1387

Query: 327  VTLK----KFLKELQPHHKRRKRPIFSSEKKRQLDERF 360
              +K    +F +E+     R  +  FS E +++  E+ 
Sbjct: 1388 KEMKTLEQQFQREVAEQMFRLTKEKFSDENRKRALEKL 1425


>gi|389776172|ref|ZP_10193760.1| hypothetical protein UU7_07556 [Rhodanobacter spathiphylli B39]
 gi|388436844|gb|EIL93681.1| hypothetical protein UU7_07556 [Rhodanobacter spathiphylli B39]
          Length = 1590

 Score =  238 bits (606), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 223/739 (30%), Positives = 347/739 (46%), Gaps = 122/739 (16%)

Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
            AL  LS ELYS+ SHF+LEL+QNADDN Y     P LTF +  + +VV+NNE GF  ENI
Sbjct: 61   ALRLLSVELYSKKSHFVLELIQNADDNRYELGTTPQLTFEVTSNRLVVVNNEIGFRDENI 120

Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-TSEGQ-IGFVL 1311
             A+C VG S+K    +GYIG KGIGFKSVF V+DAPEIHSNGFH+KFD T EG  +G+++
Sbjct: 121  EAICKVGASSKPKEKSGYIGEKGIGFKSVFTVSDAPEIHSNGFHIKFDRTVEGNLLGYIV 180

Query: 1312 PTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSL 1371
            P    P             P ++   C  T I LP    +S G       D  + L   L
Sbjct: 181  PQWCEP-------------PEEVRPNC--TTIILPASKNYSFG------ADTLAALDARL 219

Query: 1372 LLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI---- 1427
            LLFL+ L+ +   +  ND   V  +++  + + +++   +K +  +AS+++R   +    
Sbjct: 220  LLFLNTLRQLTL-DYDNDR--VTYRRVDSEEVSRLTAEREKTSGELASEEMRYVRVSSRF 276

Query: 1428 -----------RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
                       R  +  + + LA  +         P      VFAFLP+R  G KF +QG
Sbjct: 277  AMDEAHADQEKRAGIGQSTVVLAFPIDAGGAAKPEP---SSNVFAFLPIRQMGFKFSIQG 333

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV 1536
            DF+L +SREE+  + PWN++L  +   +F SA      +  F+++   A S Y+ +VP  
Sbjct: 334  DFILSASREEILTDRPWNRFLRDQIATVFSSA------VEVFKKSEGLALS-YLKYVPDE 386

Query: 1537 GE-VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKH 1595
             E V  FF  + + I++KL    C  L   +  W  P ++LR       +L P  L+ + 
Sbjct: 387  HEVVDPFFRTVRKSIIAKLSGVEC--LRSASGVWKRPGELLRA-EKTFRALFPSKLVIEL 443

Query: 1596 LGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY 1655
                ++   I+ S  L RALG+ + GPK+++ ++    R +   ++  L+W A +     
Sbjct: 444  TDFDYVDAKILGSVELLRALGVTDVGPKLVVDVL----REKAWFQAQPLAWRAKFY---- 495

Query: 1656 TISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVF-----D 1710
              ++ +  + SL +         L + P +P+SD        G+  + S   VF      
Sbjct: 496  --AYVADNRESLTAV-------GLLQAPCLPISD--------GSCVVPSQLEVFFPLSRR 538

Query: 1711 GGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVK 1770
              +G E      ++L  V   L   +A  K S  V+       +   + V+    +D++ 
Sbjct: 539  KKYGFE------SELVFVDNELYEEAA--KHSKHVVE------LFSAMKVRSDVPYDLIS 584

Query: 1771 VHILPA-ISDETTANGDKNLMADYLCFVMMHLEYYCPNCHV---EREFIVSELRKKAFVL 1826
             HILP   SD  T + DK ++  +L +V   L+ Y     V     +     LR    V 
Sbjct: 585  SHILPTHASDAWTESEDKAIVG-HLRYVKDKLKDYLDGALVAGKSEQQAYQALRDGIRVG 643

Query: 1827 TNHG-------FKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYL----KHPAK 1875
            T          F+R  E  ++  KE+    S+   + D       V  +YL    K P +
Sbjct: 644  TKQQEAEGLWIFRRVGE--LYLSKEYRPDFSIESFLGDQIAAEKLVSPSYLTDKPKDPIQ 701

Query: 1876 ESLSCGLVKWRKFFKEIGI 1894
            E+ S     WR+FF  +G+
Sbjct: 702  EADS-----WRQFFTALGV 715


>gi|325185329|emb|CCA19816.1| phosphoserine aminotransferase putative [Albugo laibachii Nc14]
          Length = 2536

 Score =  233 bits (595), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 203/695 (29%), Positives = 319/695 (45%), Gaps = 95/695 (13%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESN---------MLKKQHARLGRALHCLSQELYSQD 1206
            E    ++ESIR    GL    +  ++N         +   Q  RLGRAL  LS ELY+ +
Sbjct: 940  EACRALIESIRTSRLGLEIPQAPADTNGSLVTSTRQIFTSQQRRLGRALKRLSAELYAFN 999

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
            SHF+LEL+QNADD ++  NV PT   ++ E+ +   +NE GF+  N++ALCDVG STK+ 
Sbjct: 1000 SHFVLELIQNADDAVFQGNVSPTCKLLITENSVEFTSNEVGFTDANVQALCDVGASTKE- 1058

Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE----GQIGFVLPTLVPPFNIDM 1322
            S A  IG KGIGFKSVF+++D P++HSNGFH++F  S     G +G+++PT +       
Sbjct: 1059 SDATCIGNKGIGFKSVFKISDRPQVHSNGFHIQFQASSSSDNGMLGYIVPTWI------- 1111

Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFR---TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
                  +DP      C  T I LPF     ++ E I    +  + + + PSLLLFL R++
Sbjct: 1112 ------EDPALWRVDC-GTSIVLPFHEEARRWKEAI----VEALHATIEPSLLLFLRRIK 1160

Query: 1380 CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV--KTTEIA 1437
             +   +  N     I K+   +   ++ C + K      S+  +   +   +  KT  + 
Sbjct: 1161 QLQVDH--NAQQTCIAKQ---ENWARIYCNDIKWKHISLSESSKGHNVSWLLVEKTIAVP 1215

Query: 1438 LALTLQESNE----------GNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEV 1487
              +T Q S+           G   P    Q VFAFLP+R+YG +FI+QGDF+LPSSRE +
Sbjct: 1216 ERITRQSSSADTRLVVGLPLGTIDPPY--QKVFAFLPVRSYGFRFIVQGDFILPSSRESI 1273

Query: 1488 DGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAK-AASVYMSFVPLVGEVHGFFSGL 1546
            D +S WNQWLL+  P LF         LP    NP   +   +   +P+   V   F  +
Sbjct: 1274 DASSEWNQWLLTFLPDLFEHIATHLW-LPGSPNNPDWISREQFHRLLPVEKHVEAPFRCI 1332

Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKVLR------GWNDRAHSLLPDILLQKHLGLGF 1600
             R IL   R++    +    +   PPC +L         ++   SL+ D   ++ L    
Sbjct: 1333 VREILQ--RLAKAPWIPSTQDTLLPPCTLLEVPVDVLEQDEWVESLMLDATHKRALSRET 1390

Query: 1601 LSKDIVLSDSLARALGIEE----YGPKILLQIISSLCRTENGL--RSMGLSWLASWLNEL 1654
             S    LS  +  AL IE     +  ++L     ++ R+E  +      L  L + L++L
Sbjct: 1391 SS---TLSKEIKCALQIESLQAIHVVRMLAVFTQAIGRSEEEMAKNEHKLVVLFNLLSKL 1447

Query: 1655 YTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDG-----TFSSVDEGTI--WLHSDCS 1707
            + I         +  G    L   L+ IP  P+        +  S + G    W     +
Sbjct: 1448 WPI---------VDRGASHALSRELEWIPCFPVDRPGERHISLKSAENGLFLPWKVKPAT 1498

Query: 1708 VFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHD 1767
                   +    +L  K   V P     + VD + L  +    +  +   IG++ L AH 
Sbjct: 1499 HLTHNPIVALVQSL--KYTPVKP----LAVVDGAFLLQLDGHTIEFLTRTIGLEPLDAHK 1552

Query: 1768 IVKVHILPAISDETTANGDKNLMADYLCFVMMHLE 1802
            +++  +LP +     A  D   +  +L  ++ HL+
Sbjct: 1553 VLEYQVLPGLKAHALAEMDHLQLEKWLFCLLDHLD 1587


>gi|326431928|gb|EGD77498.1| serine/threonine protein kinase [Salpingoeca sp. ATCC 50818]
          Length = 2796

 Score =  220 bits (561), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 166/539 (30%), Positives = 253/539 (46%), Gaps = 67/539 (12%)

Query: 1012 LDCLGYLPSEFRS-FAADVLLSGLQSTIKDAPSAILLECHQTELRLMLHEVGLSLGILEW 1070
            L  L +LP    S    DVL+ G +  + +  + +L        RL+L ++G    I +W
Sbjct: 1064 LAVLSHLPPPLLSPVGYDVLVEGARGVVPEMHTKLLHAAATPFHRLLLRQMGRPARIRDW 1123

Query: 1071 IHDYHAFCSTGTSDLL--MPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAI 1128
             +          S +    P     +NAA S     S     +L  +V     + + G  
Sbjct: 1124 NNAALQATDQAASRVQSSAPLSRIGSNAAPSADALASRTPSSALPPTVSGTTLLGQDG-- 1181

Query: 1129 CDTICGEA-----------------SDDGLGDCT-TQTLPEDKECEDAALIVESIRRDEF 1170
             D   G +                 +++ + D    + +PED EC     ++ +IR D  
Sbjct: 1182 -DGTTGNSVTTATATTTNGTGGVIGNEEDVHDAVLAREMPED-ECN---AVLRAIRADHG 1236

Query: 1171 -GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT 1229
             GL   ++  +   +++    + +++  L+ +LY+  +HF++ELVQNADDN Y  NV P 
Sbjct: 1237 EGLDDGMTEEKKRAVQQLRHLVQQSVEKLASDLYASTTHFVMELVQNADDNTYLTNV-PA 1295

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L+  +      + NNE GFS +N+RALC +GNSTK  S A +IG KGIGFKSVF+VT  P
Sbjct: 1296 LSITIANDRFELRNNEVGFSEKNVRALCSIGNSTKSASEARFIGHKGIGFKSVFKVTHCP 1355

Query: 1290 EIHSNGFHVKFDTSEGQIGFVLP-TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF- 1347
            +IHSN FH  FD     +G+VLP  + PP N                 +   T I LPF 
Sbjct: 1356 QIHSNNFHFGFDVKNNPLGYVLPHAIAPPPNW----------------RARETVIVLPFD 1399

Query: 1348 ----RTKFSEGIAMNNIV------DMFSDLHPSLLLFLHRLQCIMFRN-MLNDSLVVIRK 1396
                 +  + G      V      D+   L PSLLLFLH+L+ + F++ +LN+   + R 
Sbjct: 1400 EPDTNSGTNSGSGGRVAVTREFRNDLKEKLDPSLLLFLHKLKRLDFQDTVLNEKRSLQRT 1459

Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAG-VIRPDVKTTEIALALTLQESNEGNYGPLLY 1455
                 G+++++   D+  W      L     +R  V  TE+ +AL L+   +      L 
Sbjct: 1460 PRNTKGVVQITADSDRTYWLTRKLDLAPDPTLRQGVTATEMVIALPLRRYKQEER---LQ 1516

Query: 1456 QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
               VFAFLP+  Y  K I+Q D++L SSR  +   S WNQWLL + PA F    +S C+
Sbjct: 1517 SYDVFAFLPVSDYRFKLIVQADWMLTSSRNAIHEGSAWNQWLLKQVPAAF----KSLCE 1571


>gi|256074692|ref|XP_002573657.1| hypothetical protein [Schistosoma mansoni]
          Length = 3070

 Score =  211 bits (536), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 143/422 (33%), Positives = 209/422 (49%), Gaps = 93/422 (22%)

Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
            D    +E IRR EFG+G  +S   ++++++   +L R+L  LS+ELY    HFLLEL+QN
Sbjct: 989  DRHQFIEEIRRREFGVGVELSEEATDLIQRVEGKLSRSLVQLSEELYGLPGHFLLELIQN 1048

Query: 1217 ADDNIYPENVEPTLTFIL----------------QESGIVVLNNEQ-GFSAENIRALCDV 1259
            ADDN Y  N  PTL   L                Q+  ++V+NNE  GF+  ++ ALCD+
Sbjct: 1049 ADDNTYGINDTPTLQLQLSTISTMNNECNSNINGQKFSLLVMNNESVGFTEHDMSALCDI 1108

Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPT-----L 1314
            G STK       IGRKGIGFKSVF +TD PE+HSNGFHV+F        +   T     L
Sbjct: 1109 GQSTKVTQRDAKIGRKGIGFKSVFNITDTPEVHSNGFHVRFHRQSKCTKYTSQTSPSLIL 1168

Query: 1315 VPPF--------------NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI 1360
            +P +               I  +C+ L   P+  E     T I    R   S  + +  +
Sbjct: 1169 IPEWCDNEQLMKSQNSCNEIPSWCKTLFILPLNSE-----TLINRSTRIVQSPALYITQL 1223

Query: 1361 VDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI--------VGDGIIKVSCGEDK 1412
            +   + LHPSL+LFL RLQC+ F +M ++    +++ I             +K+ C  + 
Sbjct: 1224 IQ--TTLHPSLMLFLRRLQCLRFSSMESNIYWNVKRTIKTLSSSSISLGISVKIQCSAEI 1281

Query: 1413 MT-------------------WFVASQKLRAGVIRPDVK--------TTEIALALTLQES 1445
            +T                   WF   +     +I  D K         TEI+LA++L +S
Sbjct: 1282 ITVHETQCNLSSDGETNTIHKWFCFKE-----IIPVDFKETNKNLPSQTEISLAISLNDS 1336

Query: 1446 NEGNYGPLLYQQP--VFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
            +     PL   QP  +F++LP+R+ G +F +  DF L SSRE+VD  S WN+WL+ + P 
Sbjct: 1337 D-----PL---QPCAIFSYLPIRSVGFRFFINADFDLTSSREDVDSTSAWNRWLVGKIPN 1388

Query: 1504 LF 1505
            +F
Sbjct: 1389 IF 1390


>gi|358334131|dbj|GAA52586.1| hypothetical protein CLF_108397 [Clonorchis sinensis]
          Length = 2905

 Score =  209 bits (533), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 149/423 (35%), Positives = 215/423 (50%), Gaps = 86/423 (20%)

Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
            D    +  +RR EFG G  ++   ++++++   +L R+L  LS++LY    HFLLEL+QN
Sbjct: 807  DRETFITELRRREFGCGAELTQEAADLVRRVEGKLSRSLLQLSEDLYGLPGHFLLELIQN 866

Query: 1217 ADDNIYPENVEPTLTFILQE---------SGIVVLNNEQ-GFSAENIRALCDVGNSTK-- 1264
            ADDN Y  + +PTL F + +         + ++V+NNE  GF+  ++ +LCDVG STK  
Sbjct: 867  ADDNEYAASTKPTLEFCIAQQTNDTVGPAAALLVMNNEAVGFTEADMSSLCDVGISTKVQ 926

Query: 1265 -KGSSAG---YIGRKGIGFKSVFRVTDAPEIHSNGFHVKF------DTSEGQIGFVLPTL 1314
             +G   G    + RKGIGFKSVF VTDAPE+HSNGFHV+F      D  +  +   L  L
Sbjct: 927  QRGIKTGEFSLLCRKGIGFKSVFNVTDAPEVHSNGFHVRFHRQSRSDADQVDVANAL-LL 985

Query: 1315 VP--------PFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMN--NIVDM 1363
            VP        P++I    R     P+ +    W  T   LP  +K S   AMN  +I+ +
Sbjct: 986  VPEWCTAIATPYSIHEPSRW----PIPVSVPAWCKTLFVLPIISK-STATAMNFGSILQL 1040

Query: 1364 FSD-LHPSLLLFLHRLQCIMFRN-----------------MLNDSL------VVIRKKIV 1399
              D L P L+LFL RLQC+                     +L DS       +    +I+
Sbjct: 1041 AYDTLAPYLMLFLRRLQCLTLSYGPDCKAGQEPARRKLLVLLRDSTALPSTEIAKTAEII 1100

Query: 1400 GDGIIKVSCGEDKMT--------------WFVASQKLRA---GVIRPDVKTTEIALALTL 1442
               +I+   GED+++              WF   + LR     + +     TE+ALA++L
Sbjct: 1101 SVTMIRGGDGEDQLSGKAGESSSSTITHRWFCFKETLRVNEENLPKHAPTQTEVALAVSL 1160

Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
             +S        L Q PVFA+LP+R  G  F +  DF L SSRE+VDG S WNQWL+S  P
Sbjct: 1161 TDSEP------LPQCPVFAYLPVRAAGFTFYINADFDLTSSREDVDGTSVWNQWLVSRIP 1214

Query: 1503 ALF 1505
             ++
Sbjct: 1215 TVW 1217


>gi|393214741|gb|EJD00234.1| hypothetical protein FOMMEDRAFT_170214 [Fomitiporia mediterranea
            MF3/22]
          Length = 1480

 Score =  207 bits (528), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 162/537 (30%), Positives = 263/537 (48%), Gaps = 81/537 (15%)

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
            +AL  LS++LYS D+HF++E VQNADDN Y + +EPTL  +  ++ + +  NE+GF  +N
Sbjct: 30   QALELLSEQLYSTDTHFIMEFVQNADDNEYEKGIEPTLRILADKTSMTLSCNEKGFYEKN 89

Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
            +RA+C +G STKK ++ G+IG KGIGFKSVF+V D   + S  F  KFD S+ ++G + P
Sbjct: 90   VRAICKIGGSTKK-NNPGFIGEKGIGFKSVFKVADIVNVASGPFTFKFDRSK-RLGMITP 147

Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLL 1372
                             DP  +    W T   L        G A+   +D   ++ PSLL
Sbjct: 148  IW---------------DPSVVADHGW-TTFYLDLSRSPGSGTAIGVKLD---EIKPSLL 188

Query: 1373 LFLHRLQCI-------------MFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
            LFL +L+ I             + R  LN  LV + +  + +G I      +K T+FV  
Sbjct: 189  LFLRKLRAIEVDIRTGSPRQVEIKRRDLNSDLVQLTR--IANGTIL-----NKNTYFVVR 241

Query: 1420 QKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQ 1475
            Q +    +    R D+K TE+ LA  + +     + P ++ Q V AFLP+R +G KF++ 
Sbjct: 242  QCVDMSSVRDDRRKDIKETEVVLAFPVDD----EHLPQIFAQDVHAFLPVRDFGFKFVIN 297

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
            GDF+  SSRE+V  +SPWNQ +       F++A R+F       +       +++ F+P 
Sbjct: 298  GDFLTSSSREDVLHDSPWNQNIRRAIIQTFITAVRAF-------QTQGTLQDLWIRFLP- 349

Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDI--LLQ 1593
             G V  FF  + R +L  LR  N  ++  ++     PC+VL   +   +   P +     
Sbjct: 350  SGIVGSFFKAVERDLLRDLREMN--VVRDSSGAVRRPCQVLSVPSTFRYDFTPLVPARFM 407

Query: 1594 KHLGLGFLSK--DIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWL 1651
            +  G  +LS   D++    + + LG+ E     LL           GL++M  +   +  
Sbjct: 408  ERSGFYYLSDNYDMLADGPILKQLGVREMTVDDLL----------TGLQAMSSNHFMAHN 457

Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSV 1708
            N      +H +  S L    + +    LQ +  +PL DG++++  +   ++  DC +
Sbjct: 458  NR-----WHDAVCSQLLHVAKRN--KKLQGLKILPLDDGSWAATHKAGNFVF-DCDM 506


>gi|350644204|emb|CCD61038.1| hypothetical protein Smp_134120 [Schistosoma mansoni]
          Length = 3064

 Score =  204 bits (519), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 140/414 (33%), Positives = 203/414 (49%), Gaps = 83/414 (20%)

Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
            D    +E IRR EFG+G  +S   ++++++   +L R+L  LS+ELY    HFLLEL+QN
Sbjct: 989  DRHQFIEEIRRREFGVGVELSEEATDLIQRVEGKLSRSLVQLSEELYGLPGHFLLELIQN 1048

Query: 1217 ADDNIYPENVEPTLTFIL----------------QESGIVVLNNEQ-GFSAENIRALCDV 1259
            ADDN Y  N  PTL   L                Q+  ++V+NNE  GF+  ++ ALCD+
Sbjct: 1049 ADDNTYGINDTPTLQLQLSTISTMNNECNSNINGQKFSLLVMNNESVGFTEHDMSALCDI 1108

Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPT-----L 1314
            G STK       IGRKGIGFKSVF +TD PE+HSNGFHV+F        +   T     L
Sbjct: 1109 GQSTKVTQRDAKIGRKGIGFKSVFNITDTPEVHSNGFHVRFHRQSKCTKYTSQTSPSLIL 1168

Query: 1315 VPPF--------------NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI 1360
            +P +               I  +C+ L   P+  E     T I    R   S  + +  +
Sbjct: 1169 IPEWCDNEQLMKSQNSCNEIPSWCKTLFILPLNSE-----TLINRSTRIVQSPALYITQL 1223

Query: 1361 VDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI--------VGDGIIKVSCGEDK 1412
            +   + LHPSL+LFL RLQC+ F +M ++    +++ I             +K+ C  + 
Sbjct: 1224 IQ--TTLHPSLMLFLRRLQCLRFSSMESNIYWNVKRTIKTLSSSSISLGISVKIQCSAEI 1281

Query: 1413 MT-------------------WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPL 1453
            +T                   WF   +     +I  D K T       L  SN+ +  PL
Sbjct: 1282 ITVHETQCNLSSDGETNTIHKWFCFKE-----IIPVDFKETNKKYHW-LFHSNDSD--PL 1333

Query: 1454 LYQQP--VFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF 1505
               QP  +F++LP+R+ G +F +  DF L SSRE+VD  S WN+WL+ + P +F
Sbjct: 1334 ---QPCAIFSYLPIRSVGFRFFINADFDLTSSREDVDSTSAWNRWLVGKIPNIF 1384


>gi|393234598|gb|EJD42159.1| hypothetical protein AURDEDRAFT_186149 [Auricularia delicata
            TFB-10046 SS5]
          Length = 1966

 Score =  203 bits (516), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 174/635 (27%), Positives = 288/635 (45%), Gaps = 95/635 (14%)

Query: 1158 AALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNA 1217
            AA +VE + R    +   ++ ++ N        L ++L  LS ELYS+ +HF+LE +QNA
Sbjct: 403  AADLVEELARGNSSVQNTVNELKDN--------LAKSLQILSDELYSKPAHFVLETIQNA 454

Query: 1218 DDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
            DDN Y     P L  +L+ES + +  NE GFS+ N++A C +G STKK + AGYIG KGI
Sbjct: 455  DDNTYANGAAPELRVVLRESYMTIQCNEVGFSSANVKAFCGIGKSTKK-NKAGYIGEKGI 513

Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESK 1337
            G K++F+V +   +HS G+H   D  E ++G + P                    Q  S+
Sbjct: 514  GAKAMFKVANKVHVHSLGYHFVLD-REAELGMISP--------------------QWSSE 552

Query: 1338 CWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL--------QCIMFRNMLND 1389
              N       R  F +G   + +      +  ++LLF+ RL        +        + 
Sbjct: 553  VLNEPGWTLMRLDFCDGSQFSVVQAQLQHVRDTVLLFMRRLSRFKIDDGEQFSVSRSTDG 612

Query: 1390 SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRP---DVKTTEIALALTLQESN 1446
            ++  IR KI GD +        +  +   S+++ A    P   +V TTEI LA  L E  
Sbjct: 613  NICTIRTKI-GDSV-------SRKPYLRVSRRVAAFPQEPKRENVDTTEIVLAFPLDE-- 662

Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
              +  P++ +Q V AFLP+R+ G KFI+Q DF+  ++RE+V  N  WN+ L S   A F 
Sbjct: 663  --DLRPVIAKQMVHAFLPMRSVGFKFIIQADFLTAANREDVLSNLQWNKKLRSGIAATFC 720

Query: 1507 SAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMILSKLRMSNCLILEGN 1565
            SA   F  +P    N       ++ F+P   +V   FFS L   IL+ LR    ++    
Sbjct: 721  SAVTDFMAMPSLELN-------WLLFLPDARDVADPFFSELGNTILNSLRTMEIVLCADG 773

Query: 1566 NNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVL-SDSLARALGIEEYGPKI 1624
            + +     +    +      ++P    +KHL   ++S +  + + S  RALG+E      
Sbjct: 774  HYRAPASVRFPSAFAVDDEGIIP----EKHLPFHYISPEYPINTHSKLRALGVESMS--- 826

Query: 1625 LLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQR-IP 1683
                +S LCR  + +   G  WL      +    F      + +  + +   D+  R +P
Sbjct: 827  FNDFLSGLCRMGSTITLRGNDWLEDVCRLIIAQGF------TWKQRIGSQPRDHRVRTLP 880

Query: 1684 FIPLSDGTFSSVD-EGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSS 1742
               L+DG++++ +   +++   D S F  G G++        +   SP      A     
Sbjct: 881  LAQLTDGSWTTCELSDSLFFSGDSSRFPSGLGIDVL-----HMDGTSPQRRDVLA----R 931

Query: 1743 LGVISVD---------NLNRMLLKIGVQQLSAHDI 1768
            LG+ ++D         +L+R  + + V +   HD+
Sbjct: 932  LGITTIDVNVVAKKIFDLHRGNMGLTVARTLEHDL 966


>gi|346976275|gb|EGY19727.1| hypothetical protein VDAG_01743 [Verticillium dahliae VdLs.17]
          Length = 1542

 Score =  191 bits (486), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 182/640 (28%), Positives = 280/640 (43%), Gaps = 116/640 (18%)

Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE 1212
            +E ED    +  IRRD+ GLG     + +N      A L  AL  LS +LY   +HFLLE
Sbjct: 10   REAEDH---IRQIRRDK-GLGDGPGQIGNN-----GADLESALKVLSHDLYQTATHFLLE 60

Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--SSAG 1270
            L+QNADDN+Y  +  PTL    +   + +  NE+GFS +NI A+C +  STK G   SAG
Sbjct: 61   LIQNADDNMYAVDA-PTLAITYRPGRVRIDCNERGFSRKNIEAICRICQSTKSGRSKSAG 119

Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
            ++G KGIGFK+VF+V     I S  +  +FD  +G +G + P                 D
Sbjct: 120  FVGEKGIGFKAVFKVASTVWISSGYYSFRFD-RDGHLGMIAPIW---------------D 163

Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM-------- 1382
                  K   T I L      +EG     I+D        +L+FL RLQ +         
Sbjct: 164  AFPEAPKPGFTSIYLKLARDCNEG----GIIDELRSYDAKILIFLRRLQRLEIDVQREFW 219

Query: 1383 ----FRNML----NDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
                F+ +L    N S       ++ DG+ K       + W     +L     RP + ++
Sbjct: 220  KSGDFKTVLSRQANTSGNPSMTTLMNDGVKK-----QYLVWRHTVNRLPNDARRPGISSS 274

Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
            E+ LA  L +  E    PL+ +Q V+AFLP+R Y  KF+LQ DF+L ++RE+V  + PWN
Sbjct: 275  EVVLAFPLDKDGET---PLIERQSVYAFLPIRDYDFKFLLQADFLLSANREDVHADLPWN 331

Query: 1495 QWLLSEFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVGEVH-GFFSGLPRMILS 1552
                    AL  +A+++F D      N   K    +  FV        G F+ L R +L+
Sbjct: 332  L-------ALTTAAQKAFLDAVKHMSNLTNKLRYTWFRFVTGSYSAQLGIFADLQRKLLA 384

Query: 1553 KLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQ---KHLGLGFLSKDIVLSD 1609
             L+ +   +L+    +   P K+ R         +PDI      +   L + + D  LS 
Sbjct: 385  DLQKTQ--LLDSTWGRKKKPMKLTR---------VPDIFCDNDGRPFTLHYKNDDRYLSP 433

Query: 1610 SLA---------RALGIEEYGPKILLQIISSLCRTENG--LRSMGLSWLASWLNELYTIS 1658
              +         RALG+++  P+  +  I  L +   G   +     W A +   L + S
Sbjct: 434  KYSQDEPDASSLRALGVKDITPEAFMNEIDKLLKKHRGAFFQKQTRDWHAKFSQTLTSSS 493

Query: 1659 FHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWL--HSDCSVFDGGFGLE 1716
            F S                  + +  IPL D +++S++EG ++    S+ ++   G    
Sbjct: 494  FASWHYRK-------------RTMAIIPLRDKSWTSMNEGQVYFAAQSNSTLIPEGI--- 537

Query: 1717 AFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLL 1756
                   K+R V P   +A    K     + V NL+R L+
Sbjct: 538  -------KIRIVDPE-AAADPARKLLFEHLGVANLSRPLV 569


>gi|47203291|emb|CAG14733.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 238

 Score =  190 bits (482), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 87/157 (55%), Positives = 115/157 (73%), Gaps = 2/157 (1%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            I+E IR+ EFG+G  ++     +++ Q  RLGR+L  LS ELYS+D+HF+LEL+QNADDN
Sbjct: 62   IIEDIRKHEFGIGVELNAESQKLMQTQQERLGRSLDRLSTELYSKDTHFVLELIQNADDN 121

Query: 1221 IYPE--NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
             YP    V P L F+++   I +LNNE GF  +N+RA+CDVG STK     GYIG+KGIG
Sbjct: 122  TYPSEAGVLPALAFVVENDCITILNNEMGFQEKNVRAICDVGRSTKGKHKYGYIGQKGIG 181

Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
            FKSVF+VTD PEIHSNGFH++FD + G +G++LP  V
Sbjct: 182  FKSVFKVTDCPEIHSNGFHLRFDKTSGPMGYILPHWV 218


>gi|260803433|ref|XP_002596594.1| hypothetical protein BRAFLDRAFT_187605 [Branchiostoma floridae]
 gi|229281853|gb|EEN52606.1| hypothetical protein BRAFLDRAFT_187605 [Branchiostoma floridae]
          Length = 157

 Score =  187 bits (474), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 84/152 (55%), Positives = 111/152 (73%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            +VE IRR+EFG+G  +     +++++   R+GR L  LS+ELY++D+HF+LELVQNADDN
Sbjct: 1    VVEDIRRNEFGVGLKLEKKGEDLVERHKKRIGRGLERLSKELYTKDTHFVLELVQNADDN 60

Query: 1221 IYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFK 1280
             Y  +V  TL F+++   + VLNNE GF   NIRALCDVGNSTK     GYIG KGIGFK
Sbjct: 61   EYEADVSATLVFVVEREKVTVLNNESGFQEANIRALCDVGNSTKGKHKFGYIGHKGIGFK 120

Query: 1281 SVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
            SVFRV+D PE+HS G+H+ FD S   +G++LP
Sbjct: 121  SVFRVSDKPEVHSRGYHICFDASSDPMGYILP 152


>gi|428172047|gb|EKX40959.1| hypothetical protein GUITHDRAFT_112964 [Guillardia theta CCMP2712]
          Length = 2624

 Score =  186 bits (473), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 251/1006 (24%), Positives = 412/1006 (40%), Gaps = 184/1006 (18%)

Query: 1134 GEASDDGLGDCTTQTLPEDKECEDAALIVESIRR------DEFGLGPNISNMESNMLKKQ 1187
            GEA+ +   +    T   D+      LI   IR       D  GL P +  +    +++Q
Sbjct: 990  GEAAQERAIEARKCTHDGDEAESKTELICRKIRNKYGIQLDTMGLDPELRKV----VEEQ 1045

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQG 1247
               L  +L  LS ELY+++ H++ E++QN DDN Y  +VEP+L   L E  I   NNE G
Sbjct: 1046 RKMLKNSLKLLSTELYAKEIHYIHEIIQNCDDNSYTGDVEPSLIIRLTEELIEFENNEIG 1105

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF--DTSEG 1305
            FS  ++ ++C +G S+ K +  G+IG KGIGFKSVF++   P+IHSN FH +   +  EG
Sbjct: 1106 FSERDVESICSIGQSS-KANRKGFIGHKGIGFKSVFKICSKPKIHSNEFHFELAAEAKEG 1164

Query: 1306 Q----IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
            +    + +++P  +PP             P    +    T I LP  +          I 
Sbjct: 1165 EESAGLEYIVPQPIPP-------------PAGWRAGS-GTKIVLPIASS-----RRREIQ 1205

Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNM-LNDSLVVIRK-----KIVGD----GIIKVSCGED 1411
                DL  ++LLFL +L+ I   +    D  V IR+     +  G+     + KV  GE+
Sbjct: 1206 ACIKDLEETILLFLRKLRVIEIHDARAQDGSVHIRRMRREAETEGNFDLVSLTKVVDGEE 1265

Query: 1412 -KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGL 1470
                W V   K        D    ++A+    +   E +Y     Q+ VFA+LP+R YG 
Sbjct: 1266 SSQDWLVL--KTNVAEKTQDDLCLQLAVPFMHELGAEQDY----QQRDVFAYLPVRCYGF 1319

Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWL----LSEFPALFVSAERSFCDLPCFRENPAKAA 1526
            +FI+ GDF + SSRE VD     N  L       FP L    +  +  L   ++N     
Sbjct: 1320 RFIVNGDFAVTSSRESVDTMDSRNLRLRDGIARHFPTLIRELQERY--LSATKQNQEAER 1377

Query: 1527 SVY--------MSFVPLVGEVHGFFSGLPRMI------LSKLRMSNCLILEGNNNQWAPP 1572
             V+        MS +P  GEV+ FF+ +P  I      L  + +     ++     W P 
Sbjct: 1378 DVHVGEYTRVVMSVLPTPGEVNDFFAVVPDQIRRAIDSLELVPLRTGKFVKARCAVWRP- 1436

Query: 1573 CKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGI---EEYGPKILLQII 1629
                R       S L ++L ++  GLGF+   +     L   LGI   ++  P IL  + 
Sbjct: 1437 ----RKVESEVISFLEEMLAEE--GLGFVHDAVAKYQVLGEHLGIRRLQQVFPSILKHLE 1490

Query: 1630 SSLCRT-ENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLS 1688
              L  +  + L    L+W+ + L +   ++                L + LQR   IPL+
Sbjct: 1491 QRLQSSLSDDLTMECLAWIFNELRDEEDVAV---------------LREVLQR-KLIPLA 1534

Query: 1689 DGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISV 1748
             G F    E ++++  D  VFD    L+  P+     R + P  L A  +     G    
Sbjct: 1535 SGDFVRPQEKSVYVLQD--VFD--LSLDPSPS----FRILHP--LFAEKIQMYGRG---- 1580

Query: 1749 DNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNC 1808
              L   L ++ V   +A   ++  ++P + D+ +   D   +   L  +   LE      
Sbjct: 1581 --LKSFLKEMNVVWNNAEAFLRSQLMPVLHDQESEEKD---LVTGLILLKKVLEQQSEQV 1635

Query: 1809 HVEREFIVSELRKKAFVLTN-HGFKRPSEIPIHFGKEFGNPVSVNMLIHDI--------- 1858
               R  ++ ++R+K  VL +  G +  S   +H G  +         + DI         
Sbjct: 1636 ---RRLMLLDIRQKGIVLLDEQGDRVWSCNALHQGPSYSKHAD---RVTDIVGRSSGSMS 1689

Query: 1859 DIKWYEVDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVADISHTGFKNM 1918
             + W  V   YL+   KE  + G   WR  F  +GI  F  V +                
Sbjct: 1690 PVPWKLVSSKYLE---KEQDAGG---WRAVFLGLGILTFCHVRE---------------- 1727

Query: 1919 WTKELLSPGSAAIDWESNELVHLL-SLLTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFF 1977
                   PG     + S E+  L  S++  +   +  +  L++  +L  D + ++     
Sbjct: 1728 -------PG-----YTSEEMCSLYRSIVEADGAPEPRRRRLQLFFSLLSDVWQEEY---- 1771

Query: 1978 KSKPTGDDRSFQSSFINCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPK 2037
              +  G     + +F+  +   +  +   D +LH P+ L    + V  +LG  A  I P 
Sbjct: 1772 -EQALGLPEEGEVAFLTFLRQ-EAKLPGTDGQLHPPQLLLQRTEHVYRVLG--AKGIYPD 1827

Query: 2038 VKS--EKLVCDIGLKTEVTIDDILEILKVW----TRLEAPFMARYY 2077
            V +   KL   +G+ +++   +I + L+ W    T L   FM   Y
Sbjct: 1828 VDNLDVKLSQRLGISSQLQPSNIFDQLEHWRSSSTPLSLSFMESVY 1873


>gi|302905878|ref|XP_003049358.1| hypothetical protein NECHADRAFT_82646 [Nectria haematococca mpVI
            77-13-4]
 gi|256730293|gb|EEU43645.1| hypothetical protein NECHADRAFT_82646 [Nectria haematococca mpVI
            77-13-4]
          Length = 2237

 Score =  186 bits (473), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 177/561 (31%), Positives = 254/561 (45%), Gaps = 106/561 (18%)

Query: 1178 NMESNMLKKQHARLGRALH------CLSQELYSQDSHFLLELVQNADDNIYPENVEPTLT 1231
            N  +  + K ++ +GR L        LS+ELY + +HFLLEL+QNA+DN Y ++V P+L+
Sbjct: 23   NTVNGTITKHNSPVGRLLENALEACSLSRELYEKQTHFLLELIQNAEDNTY-DDVTPSLS 81

Query: 1232 FILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--SSAGYIGRKGIGFKSVFRVTDAP 1289
            F  +   + V  NE GF+  N+RA+C +G STK G   SA Y+G KGIGFKSVF+V    
Sbjct: 82   FTYKRGSLRVDCNETGFTERNVRAICKIGESTKAGLGQSARYVGEKGIGFKSVFKVASEV 141

Query: 1290 EIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
             I S  F  KFD ++  +G + P             + S+ P  +            F  
Sbjct: 142  YISSRSFSFKFDQAQA-VGLIAP-------------IWSEFPEPIRPGY------TSFYL 181

Query: 1350 KFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV---VIRKK--IVGDG-I 1403
            K SE     +I      L P++LLFL RL+ I  +  L D       +R++  +  D  I
Sbjct: 182  KLSENTDEEHIARDIKRLDPTMLLFLRRLREINLKATLMDGSTWSQTLRRQDALTQDSHI 241

Query: 1404 IKVSCGEDKMTWFVAS--------QKLRAGVIRPDVKTTEIALAL-TLQESNEGNYGPLL 1454
             ++  G     + V          +K R GV       +EI+LA   L E++EG      
Sbjct: 242  TQLHRGPTVSDYIVVKHLVSNLPEEKGRVGVTE-----SEISLAFPPLHETSEG------ 290

Query: 1455 YQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
            +   V+AFLP+R YG KF+LQGDF+L ++RE+V  N PWN+ L       FV A      
Sbjct: 291  HTHAVYAFLPIRDYGFKFLLQGDFILTANREDVPDN-PWNRALREACADAFVRA------ 343

Query: 1515 LPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCK 1574
            +      P + A  ++S++P    + GFF  L  MIL KL  S   ILE  + +   P  
Sbjct: 344  IHHLNSGPRRYA--WISYIP-GRHILGFFEPLRDMILEKL--SQEPILEACSRKMVAPAT 398

Query: 1575 VLRGWNDRAHSLLPDILLQKHLGLGFL----SKDIVLSDS-------LARALGIEEYGPK 1623
            +         S +P        GL       +KD+ LS S       L   LG +     
Sbjct: 399  L---------SYVPLSRFTDDDGLPLTLSPSTKDLYLSCSYPALDYTLLSRLGTQTLDSN 449

Query: 1624 ILLQIISSLCRTE-NGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGVETDLIDNL 1679
              LQ +SS+  T     R     W   LA+ L  LY    H                D +
Sbjct: 450  SFLQHLSSMISTRPQHFRQKPREWHEKLANVLIALYKAERHR---------------DCI 494

Query: 1680 QRIPFIPLSDGTFSSVDEGTI 1700
            + +  IPLS+G + S D G I
Sbjct: 495  KGLEIIPLSNGEWVSADTGPI 515


>gi|302419583|ref|XP_003007622.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261353273|gb|EEY15701.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 1505

 Score =  185 bits (470), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 153/517 (29%), Positives = 233/517 (45%), Gaps = 86/517 (16%)

Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE 1212
            +E ED    +  IRRD+ GLG     + +N      A L  AL  LS +LY   +HFLLE
Sbjct: 10   REAEDH---IRQIRRDK-GLGDGPGQIGNN-----GADLESALKVLSHDLYQTATHFLLE 60

Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--SSAG 1270
            L+QNADDN+Y  +  PTL    +   + +  NE+GFS +NI A+C +  STK G   SAG
Sbjct: 61   LIQNADDNMYTADA-PTLAITYRPGRVRIDCNERGFSRKNIEAICRICQSTKSGRSKSAG 119

Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
            ++G KGIGFK+VF+V     I S  +  +FD  +G +G + P                 D
Sbjct: 120  FVGEKGIGFKAVFKVASTVWISSGHYSFRFD-RDGHLGMIAPIW---------------D 163

Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM-------- 1382
                  K   T I L      +EG     IVD        +L+FL RLQ +         
Sbjct: 164  AFPEAPKPGFTSIYLKLARDCNEG----GIVDELRSYDAKILIFLRRLQRLEIDVQREFW 219

Query: 1383 ----FRNMLNDSLVVI----RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
                F+ +L+    +        ++ DG+ K       + W     +L     RP + ++
Sbjct: 220  KSGDFKTVLSRQATISGNPSMTTLMNDGVKK-----QYLVWRHTVNRLPNDARRPGISSS 274

Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWN 1494
            E+ +A  L +  E    PL+ +Q V+AFLP+R Y  KF+LQ DF+L ++RE+V  + PWN
Sbjct: 275  EVVIAFPLDKDGET---PLIERQSVYAFLPIRDYDFKFLLQADFLLSANREDVHADLPWN 331

Query: 1495 QWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH-GFFSGLPRMILSK 1553
              L +     F+ A R   +L        K    +  FV      H G F+ L R +L+ 
Sbjct: 332  LALTTAVQKAFLDAVRHMSNLSN------KLRYTWFRFVTCSYSAHLGIFADLQRKLLAD 385

Query: 1554 LRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQ---KHLGLGFLSKDIVLSDS 1610
            L+ +   +L+    +   P K+ R         +P+I      +   L + + D  LS  
Sbjct: 386  LQKTQ--LLDSTWGRKKKPTKLTR---------VPEIFCDNDGRPFTLHYKNDDRYLSPK 434

Query: 1611 LA---------RALGIEEYGPKILLQIISSLCRTENG 1638
             +         RALG+++  P+  +  I  L +   G
Sbjct: 435  YSQDEPDASSLRALGVKDITPEAFMNEIDKLLKKHRG 471


>gi|389750862|gb|EIM91935.1| hypothetical protein STEHIDRAFT_151290 [Stereum hirsutum FP-91666
            SS1]
          Length = 1525

 Score =  183 bits (465), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 129/366 (35%), Positives = 190/366 (51%), Gaps = 42/366 (11%)

Query: 1161 IVESIRRDEFGL-GPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
            +V+SIR+D  GL  PN + +E          LGRAL  L+ +LY  D+HF+LE VQNA+D
Sbjct: 3    VVQSIRKD-LGLDSPNPTELER--------MLGRALEHLADQLYRTDTHFVLEFVQNAED 53

Query: 1220 NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
            N Y  +V PTL+  +++  + +  NE+GF   N+RALC++G STKK    G+IG KGIGF
Sbjct: 54   NAYLSDVTPTLSLRIEDYTMTISCNERGFDESNVRALCNIGTSTKK-HLQGFIGEKGIGF 112

Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
            KSVF+V +   + S  F  KFD  + ++G + P     +  D+           LE    
Sbjct: 113  KSVFKVANVVRVTSGDFRFKFDRDQ-RLGMIAPI----WEDDILPVYPGWTEFHLE---- 163

Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND----SLVVIR 1395
                 LP +T  + G A+++ ++    +  SLLLFL +L+CI     LND     ++ + 
Sbjct: 164  --LTSLPGQTSITPGTALSSKLEQ---IRASLLLFLKKLRCIEVSFALNDPEDPRIITLF 218

Query: 1396 KKIVGDGIIKVS-CGEDK--------MTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
            +  VG  I+K++ C  D+        M  F A         R  V  +EI LA  + E  
Sbjct: 219  RHDVGPDIVKLTHCEHDRPIRSDFYLMISFSADMSAIEEPKRQGVNDSEIVLAFPVTEDI 278

Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
            +  + P    Q V  FLPLR  G  F++Q DF+  SSRE++  +S WN+ LL      F+
Sbjct: 279  QPVFSP----QDVHVFLPLRPVGFNFVIQADFLTSSSREDILADSYWNKTLLYGVVDAFI 334

Query: 1507 SAERSF 1512
             A   F
Sbjct: 335  QAIDRF 340


>gi|378727103|gb|EHY53562.1| hypothetical protein HMPREF1120_01751 [Exophiala dermatitidis
            NIH/UT8656]
          Length = 1492

 Score =  182 bits (462), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 165/584 (28%), Positives = 251/584 (42%), Gaps = 104/584 (17%)

Query: 1150 PEDKECEDAALIVESIRRDEFGL------GPNISNMESNMLKKQHARLGRALHCLSQELY 1203
            P   + + A   V++IR+ E GL        N+ N+E+            ALH L+ ELY
Sbjct: 4    PHIDQSDRARKHVDAIRK-EAGLDNGQDSSRNVQNLEN------------ALHTLADELY 50

Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
             + +HFLLEL+QNADDN Y    EP +T   ++  + +  NE+GFS +N+ A+C VG+ST
Sbjct: 51   HKSTHFLLELIQNADDNEY-LVAEPEITLTYKDRRLRIDCNERGFSPQNVEAICRVGSST 109

Query: 1264 KKGS--SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE--GQIGFVLPTLVPPFN 1319
            K G   ++ Y+G KGIGFKSVF+  D   I SN +  KFD     G I  +      P  
Sbjct: 110  KAGQDRASSYVGEKGIGFKSVFKAADIVWISSNSYTFKFDKRNVLGMIAPIWEAFPEP-- 167

Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
                      DP       W + +      + S G     ++       P +L+FL RL+
Sbjct: 168  ---------TDP------SWTSML-----LQLSPGYEEKQLLHDLKTFDPRMLIFLRRLK 207

Query: 1380 ---CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMT--WFVAS---QKLRAGVIRPDV 1431
                ++ R        + +K    D    +   +D  T  + V+S   + L A   RP  
Sbjct: 208  KVKIVIHRAGPVWQTTLTKKLQRNDQETAIELTQDHKTLRYMVSSYLAENLPAEPKRPGA 267

Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
            ++++I LA       + N    +  Q V+AFLP+R YG KF+LQ DF+L ++REE+    
Sbjct: 268  RSSQIMLAFP-----DCNGAEAIESQKVYAFLPVRDYGFKFLLQADFLLSANREEILAYP 322

Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMIL 1551
             WNQ L   F   FV+  + +C          + + ++  ++PL      FF    + IL
Sbjct: 323  QWNQALRQAFVDAFVAVTKEYCQ--------GRISHLWPQYIPLRPSQTRFFESARKEIL 374

Query: 1552 SKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK-------- 1603
               R+S   +L    N+   PC +L          +PD    K  G+   S         
Sbjct: 375  E--RLSREPVLRCRTNELMTPCSLLD---------VPDEFCDKE-GVPMTSTPTSDRAYL 422

Query: 1604 DIVLSDSLA---RALGIEEYGPKILLQIISSLCRTENGL--RSMGLSWLASWLNELYTIS 1658
              V   SL      LG+ +    I L  ++S     + L        W       L  +S
Sbjct: 423  SCVYPTSLTENLEVLGVRKMTASIFLDHLASFITQHSELFREQKPDGWHTRLAEILLKLS 482

Query: 1659 FHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWL 1702
            F S+              D +  +P IPLSDG + S  E  ++L
Sbjct: 483  FDSANH------------DRIAELPVIPLSDGRWVSAKEDYVFL 514


>gi|383762367|ref|YP_005441349.1| hypothetical protein CLDAP_14120 [Caldilinea aerophila DSM 14535 =
            NBRC 104270]
 gi|381382635|dbj|BAL99451.1| hypothetical protein CLDAP_14120 [Caldilinea aerophila DSM 14535 =
            NBRC 104270]
          Length = 1539

 Score =  181 bits (459), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 163/563 (28%), Positives = 252/563 (44%), Gaps = 77/563 (13%)

Query: 1165 IRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE 1224
            IRR++F L  + + +  N L K    L  +   LS +LYS+D+HF+LEL+QNA+DN YP 
Sbjct: 18   IRREKFSLDADGTLLSQNPLAKD---LHNSAERLSADLYSKDTHFVLELIQNAEDNAYPP 74

Query: 1225 NVEPTLTFIL----------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGR 1274
             V+P LTF L              +++LNNE G   E++ ALC VG STK     GYIG 
Sbjct: 75   GVQPDLTFCLLPEDPTATPGAHGALLILNNECGLRPEDVDALCAVGRSTKT-KRQGYIGE 133

Query: 1275 KGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ---IGFVLPTLVPPFNIDMFCRLLSKDP 1331
            KGIGFKSVF+VT  P + S G+  +FD        +G+++P  V               P
Sbjct: 134  KGIGFKSVFKVTKRPHLFSAGYQFRFDRDPDPAIGLGYIVPYWV------------DDVP 181

Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
             ++ S+   TCI LP      EG     ++     + P  +LFL RL+ +    +  DS 
Sbjct: 182  PEVASQHDKTCILLP----LDEG-QWEPVIAQLQRIAPETILFLSRLEGLTV-ELPGDSP 235

Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPD-----VKTTEIALALTLQESN 1446
            + +        ++ +  GE   T++V  +++R    + D     V+T  +++A  L  + 
Sbjct: 236  LRLLIDKSKAPLVDLMDGETLATYWVHQREVRCPPDQGDEKRIGVETRTVSIAFPLARAR 295

Query: 1447 EGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF 1505
            E  Y        VFA+LP     G  F++  DF+L SSRE +  + PWN WL  E    F
Sbjct: 296  EPIYS-------VFAYLPTEMQSGYPFLINADFILTSSREAILVDRPWNVWLRDEIVPCF 348

Query: 1506 VSAERSFC-DLPCFRENPAKAASVYMSFVPLVGEVH-GFFSGLPRMILSKLRMSNCLILE 1563
            V    +   DL  +++           F+PL  +V   FF      I  KL     + + 
Sbjct: 349  VEGFVALVRDLRYWQQA--------YGFIPLADKVQERFFQPPVERIHQKLANLPVVWVV 400

Query: 1564 GNNNQWAPPCKVLRGWNDRA----HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEE 1619
            G     APP   L     R        LP+ L Q  L    +  ++   +   RA+G++ 
Sbjct: 401  GKQEPVAPPYARLAPLTFRKLLDDGRPLPEQLTQTLL----VRPELEQFEKQLRAIGVKA 456

Query: 1620 YGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNL 1679
              P      + +  +    L+     W   WL  +Y         S  QS   +     L
Sbjct: 457  LQP----DEVRACLQDGAWLKDRPPQWFV-WLY-MYLQKEDWHAPSPWQSESRS-----L 505

Query: 1680 QRIPFIPLSDGTFSSVDEGTIWL 1702
            + +P IP ++G  ++V + T++L
Sbjct: 506  RDLPIIPTTEGRLAAVGKETVYL 528


>gi|358347408|ref|XP_003637749.1| hypothetical protein MTR_101s0015 [Medicago truncatula]
 gi|355503684|gb|AES84887.1| hypothetical protein MTR_101s0015 [Medicago truncatula]
          Length = 1586

 Score =  181 bits (458), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 179/588 (30%), Positives = 276/588 (46%), Gaps = 110/588 (18%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR +F +G N + +  +        L  A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    VEEIRRTKFSIGGNPNPLTED--------LHHAVKNLSAELYAKDVHFLMELIQNAEDNH 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y E VEPTL F++    I         ++ NNE+GFS +NI ++C VG STKKG+ S+GY
Sbjct: 60   YNEGVEPTLEFVITSDDITATGAPATLLIFNNEKGFSPKNIESICSVGRSTKKGNRSSGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF V+  P I SNG+ ++F+        +G+++P  V   P  +D+    
Sbjct: 120  IGEKGIGFKSVFLVSAQPYIFSNGYQIRFNEMPCPHCSLGYIVPEWVEEKPTLVDI---- 175

Query: 1327 LSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
              K       K    T I LP +        +N +    S +HP +LLFL +++ +  R 
Sbjct: 176  --KQIYDGAGKALPTTTIVLPLKPD-----KVNPVKQQLSSVHPEVLLFLTKIRHLSVRE 228

Query: 1386 MLND---SLV-----------VIRKKIVGDGII------KVSCGEDKMTWFVASQK--LR 1423
            +  +   S+V           V RK +  +         + S  E + ++++  QK  +R
Sbjct: 229  VNENPGQSMVTAVSISSEINFVTRKNMNAESYTLYLSAEENSNNEKECSYYMWKQKFPVR 288

Query: 1424 AGVI---RPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDF 1478
            +  +   R DV+   + LAL  QE  + G   P      V+AFLP        FI+Q DF
Sbjct: 289  SESVVERRTDVEEWVVTLALPNQERLHRGKSSP-----GVYAFLPTEMVTNFPFIIQADF 343

Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
            VL SSRE +  ++ WNQ +L   P  F+ A ++   +    + P  + +    F+P+   
Sbjct: 344  VLASSRETILLDNKWNQGILECVPIAFMDAFKTL--VVGSDQAPVSSLARMFKFLPIESS 401

Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWNDRAHSLLPDI--LLQK 1594
                F+ +   I  KL   N + +E    Q  +  P +V R        LLP+   +L K
Sbjct: 402  PFEKFNYVRDKIKEKLVNENIVPIETYKEQKHFYKPGEVSR--------LLPEFWNILSK 453

Query: 1595 HLGLGFLSKDIVLSDS---LARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWL 1651
              G G    ++   D    L+ +    EY       +++ L     G++S+ + W A  +
Sbjct: 454  ARGEGVYLLNLSSHDGRKILSSSFDKSEYD-----NVLNFL-----GVKSVTVDWYAKCI 503

Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGT 1699
                         S+L  GV  DL   LQ + FI      +SS+ EGT
Sbjct: 504  Q-----------SSNLVDGVSEDLY--LQLLLFIA---KNWSSIFEGT 535


>gi|302775136|ref|XP_002970985.1| hypothetical protein SELMODRAFT_411712 [Selaginella moellendorffii]
 gi|300160967|gb|EFJ27583.1| hypothetical protein SELMODRAFT_411712 [Selaginella moellendorffii]
          Length = 1543

 Score =  179 bits (454), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 191/695 (27%), Positives = 308/695 (44%), Gaps = 139/695 (20%)

Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
            + +E +RRD FG+G +  N  + ML++        +  LS ELY +D HFL EL+QNA+D
Sbjct: 6    MFIEHLRRDRFGIGQSGPNPLALMLQE-------TIRLLSAELYQKDIHFLSELLQNAED 58

Query: 1220 NIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SA 1269
            N Y  +VEP++   L ++ +         ++LNNE GFS +NI +LC  G STKKG  S 
Sbjct: 59   NTYHSSVEPSIEIYLTKTDVTGTGAAATLLLLNNEVGFSEKNIESLCGAGQSTKKGKKSQ 118

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRL 1326
            GYIG KGIGFKSVF VT  P I +NGF ++FD    +E ++G+++P  V      +  ++
Sbjct: 119  GYIGEKGIGFKSVFLVTKQPYIITNGFRIRFDEEPHNEAKLGYIVPEWVDRPTDAVLQKV 178

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
             SK  +        T I LP R +      ++ +    + +    LLFL +++ +   + 
Sbjct: 179  CSKTELP------ATVIILPLRQE-----KISEVAKQLTQIPAETLLFLSKIRMLSIYDQ 227

Query: 1387 LN---DSLVVIRK---KIVGDGIIKVSC----------GEDKMTWFVASQKLR----AGV 1426
             +     LV+ R        +G ++ S           G     +++  Q L+    A V
Sbjct: 228  QSRPPTELVLSRSVEMTTSKNGSVETSTTFLTASNGRNGTQHSNYYMFKQTLKVPEEAKV 287

Query: 1427 -IRPDVKTTEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPS 1482
              R DV +  I LA  + +  S+  + G       +F+FLP   + GL F++  DF+L S
Sbjct: 288  EKRKDVDSWTITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLVS 341

Query: 1483 SREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCF--RENPAKAASVYMSFVPLVGEVH 1540
            SRE +  +SPWNQ +    P +F +A     +   F  +  P+ A      +VP+    +
Sbjct: 342  SRETLLFDSPWNQGIFRCVPEVFYNAFELLLNTTAFGLQSIPSVARMAMYKYVPVTACRN 401

Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQWAPP----CK---------------------- 1574
             +   +   ILSKL+ +N ++   +  + APP    CK                      
Sbjct: 402  SYLERVRNDILSKLKDNNVILCASDVGR-APPSGQLCKPALARKIAPSFRELLKAAPKCP 460

Query: 1575 -VLRG--WN-DRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIIS 1630
              LRG  W   R+ S + D  LQ++ G            S+ R LG+ +       Q + 
Sbjct: 461  RFLRGSPWFLRRSSSYIVDSTLQENYG------------SVLRLLGVPDMSNDEYHQSLD 508

Query: 1631 SLCRTENGLRSMG-------LSWLASWLNELYTISF--HSSGQSSLQSGVETDLIDNLQR 1681
            S    E+ +RS+        L +L+S  + L ++ F  +      LQ     DL  N  R
Sbjct: 509  S----EDWMRSLPDDAYVKLLVFLSSKSSTLRSLRFLKYRGSDGRLQLASMDDLARNDGR 564

Query: 1682 IPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLC-AKLRT------------V 1728
              +       + S    T WL     VF         P+ C AK+R              
Sbjct: 565  SMY-------WDSGHRATAWLAGWTEVFQRWLTFNFVPSACAAKIRQHSVLENWLERTPF 617

Query: 1729 SPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQL 1763
            S + L+A ++ +  LG ++ ++ N  L  + +  L
Sbjct: 618  SVSELTACSLARQVLGAVTANSSNAKLAVLAMHFL 652


>gi|297819496|ref|XP_002877631.1| ATP binding protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323469|gb|EFH53890.1| ATP binding protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1902

 Score =  178 bits (451), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 141/435 (32%), Positives = 212/435 (48%), Gaps = 77/435 (17%)

Query: 1146 TQTLP-EDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYS 1204
            T+T P  +   E A   VE IRR +F +G      E+ + +  H    +A+  LS ELY+
Sbjct: 188  TKTKPVRETMAESAKQHVERIRRTKFSIG----GAENPLTEDLH----QAVKNLSAELYA 239

Query: 1205 QDSHFLLELVQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRA 1255
            +D HFL+EL+QNA+DN YPE V+P+L F++    I         ++ NNE+GFS +NI +
Sbjct: 240  KDVHFLMELIQNAEDNEYPEGVDPSLEFVITSDDITATGAPATLLIFNNEKGFSEKNIDS 299

Query: 1256 LCDVGNSTKKGSSA-GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVL 1311
            +C VG STKKG+   GYIG KGIGFKSVF +T  P I SNG+ ++F+    S   +G+++
Sbjct: 300  ICSVGRSTKKGNRKRGYIGEKGIGFKSVFLITSQPYIFSNGYQIRFNEAPCSHCSLGYIV 359

Query: 1312 PTLVP--PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
            P  V   P  +D+  R+         S    T I LP +        +  + +  S++HP
Sbjct: 360  PEWVEQHPSLVDI-QRMYGSG-----SALPTTTIILPLKYD-----KVKPVKEQLSNVHP 408

Query: 1370 SLLLFLHRLQCIMFR--------------NMLNDSLVVIRKKIVGDGII-------KVSC 1408
             +LLFL +++ +  R               +++++  V RK I  +          K   
Sbjct: 409  EVLLFLSKIKRLSIREHCLDAKLSTVNSIGIVSETNFVTRKSIDAESYTIHLSASEKGKN 468

Query: 1409 GEDKMTWFVASQKLRAGV-----IRPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAF 1462
             E + ++++  QK           R +V    I LA    E    GN  P +Y     AF
Sbjct: 469  SEQECSYYMWKQKFPVKHENRVDRRSEVDDWVITLAFPFGERLGRGNSSPGIY-----AF 523

Query: 1463 LPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD------- 1514
            LP        FI+Q DF+L SSRE +  +  WNQ +LS  P  F++A  +          
Sbjct: 524  LPTEMVTNFPFIIQADFILASSRETILLDDIWNQGILSCVPIAFLNAFTTLVKKTDAPVS 583

Query: 1515 --LPCFRENPAKAAS 1527
              LP FR  P K ++
Sbjct: 584  SVLPAFRFLPVKESN 598


>gi|125577277|gb|EAZ18499.1| hypothetical protein OsJ_34026 [Oryza sativa Japonica Group]
          Length = 1674

 Score =  178 bits (451), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 142/451 (31%), Positives = 221/451 (49%), Gaps = 71/451 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ F +GP     E N L +    + +A+  LSQE+YS+D HFL+EL+QNA+DN 
Sbjct: 8    VERIRRERFFIGPG----ERNPLAED---MHQAVKYLSQEIYSKDVHFLMELIQNAEDND 60

Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP  V P+L F++           S +++ NNE+GFS  N+ ++C VG STKKG+   GY
Sbjct: 61   YPSGVAPSLEFLITSKDITGLGAPSTLLIFNNEKGFSPSNVESICRVGKSTKKGNRHQGY 120

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF ++  P I SNG+ +KF+   +SE  IG+++P  V   P   D+    
Sbjct: 121  IGEKGIGFKSVFLISRQPHIFSNGYRIKFNEDPSSECNIGYIVPEWVESKPSLSDIQELH 180

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
             S  P+        T I LP +++      ++ +    S +HP +LLFL +++ +  R  
Sbjct: 181  GSSKPLP------TTTIILPLKSE-----KVDVVKKQLSSIHPEILLFLTKIRRLSVRED 229

Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGEDK----MTWFVASQKLRAGV- 1426
                        +M ++    +RK I  +   + +S  E+K      +++  QK      
Sbjct: 230  NSDPKCSTINEISMESEKNYQVRKNIHAESYTLHLSAQENKEQEECGYYMWRQKFPVKPE 289

Query: 1427 ----IRPDVKTTEIALAL----TLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
                +R D+    I LA      L    + + G       V+AFLP        FI+Q D
Sbjct: 290  NRVDMRADIDEYVITLAFPHGQRLSRGKQSSPG-------VYAFLPTEILTNFPFIIQAD 342

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
            F+L SSRE +  +S WN+ +L   P+ F++A  +        + PA +     +F+P+  
Sbjct: 343  FLLASSRETILFDSMWNKGILECVPSAFLNAFVALVKSSA--DAPAMSLPSMFNFLPVHP 400

Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
                F   +   I  K+R  N +  E    Q
Sbjct: 401  SHVPFLEPVRSAIKDKVRTENIMPCESYTLQ 431


>gi|15229022|ref|NP_190446.1| ATP/DNA binding protein [Arabidopsis thaliana]
 gi|7576215|emb|CAB87905.1| hypothetical protein [Arabidopsis thaliana]
 gi|332644933|gb|AEE78454.1| ATP/DNA binding protein [Arabidopsis thaliana]
          Length = 1899

 Score =  177 bits (450), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 137/424 (32%), Positives = 209/424 (49%), Gaps = 76/424 (17%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
            E A   +E IRR +F +G      E+ + +  H    +A+  LS ELY++D HFL+EL+Q
Sbjct: 194  ESAKQHIERIRRTKFSIG----GAENPLTEDLH----QAVKNLSAELYAKDVHFLMELIQ 245

Query: 1216 NADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
            NA+DN YPE V+P+L F++    I         ++ NNE+GFS +NI ++C VG STKKG
Sbjct: 246  NAEDNEYPEGVDPSLEFVITSEDITNTGAPATLLIFNNEKGFSEKNIESICSVGRSTKKG 305

Query: 1267 S-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVP--PFNI 1320
            +   GYIG KGIGFKSVF +T  P I SNG+ ++F+    S   +G+++P  V   P  +
Sbjct: 306  NRKCGYIGEKGIGFKSVFLITSQPYIFSNGYQIRFNEAPCSHCSLGYIVPEWVDQHPSLV 365

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
            D+  R+         S    T I LP ++       +  + +  S++HP +LLFL +++ 
Sbjct: 366  DI-QRMYGSG-----SALPTTTIILPLKSD-----KVKPVKEQLSNVHPEVLLFLSKIKR 414

Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGII-------KVSCGEDKMTWFVAS 1419
            +  R               +++++  V RK I  +          K    E + ++++  
Sbjct: 415  LSIREHCLDPKLSTVNSIGIVSETNFVTRKSIDAESYTIHLSASEKGKNSEQECSYYMWR 474

Query: 1420 QKLRAGV-----IRPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAFLPLR-TYGLKF 1472
            QK           R +V+   I LA    E    GN  P +Y     AFLP        F
Sbjct: 475  QKFPVKHENRVDRRSEVEEWVITLAFPFGERLGHGNNSPGIY-----AFLPTEMVTNFPF 529

Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD---------LPCFRENPA 1523
            I+Q DF+L SSRE +  +  WNQ +L+  P  F++A  S            LP FR  P 
Sbjct: 530  IIQADFILASSREMILLDDIWNQGILNCVPLAFLNAFTSLVKTTDAPVSSLLPAFRFLPV 589

Query: 1524 KAAS 1527
            K ++
Sbjct: 590  KESN 593


>gi|218186840|gb|EEC69267.1| hypothetical protein OsI_38308 [Oryza sativa Indica Group]
          Length = 1815

 Score =  176 bits (447), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 144/424 (33%), Positives = 211/424 (49%), Gaps = 84/424 (19%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ F +G      E N L +    + +A+  LSQELYS+D HFL+EL+QNA+DN 
Sbjct: 16   VERIRRERFYIG----REERNPLAED---IHQAVTYLSQELYSKDLHFLMELIQNAEDNE 68

Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP +VEP L F++           S ++V NNE+GFSA N+ ++C +G STKKG+   GY
Sbjct: 69   YPSDVEPALEFVIIKKDITATGAASTLLVFNNERGFSAANVESICRIGKSTKKGNRHLGY 128

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV---PPFNIDMFCR 1325
            IG KGIGFKSVF V+  P I SNG+ +KF+   + +  IG+++P  V   P  +      
Sbjct: 129  IGEKGIGFKSVFLVSSQPHIFSNGYQIKFNEEPSEDCDIGYIVPKWVDEKPSIDDIHAVY 188

Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
              SK       K   T I LP ++       +  + +  S  HP +LLFL +++ +  R 
Sbjct: 189  GYSK-------KLPTTTIILPLKSD-----KILPVKNELSSTHPEILLFLSKIRQLSVRE 236

Query: 1386 MLND------SLVVIRKKI---------VGDGIIKVSCGEDKM------TWFVASQKLRA 1424
            + +D      S + I  ++              + ++  E+K       T+++  QK   
Sbjct: 237  INDDPKASKLSQISISSEVNYKTQKDINAESYTLHLAMQENKRGDKEECTYYMWKQKF-- 294

Query: 1425 GVIRP---DVKTTEIA-LALTL-----QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFIL 1474
             V++P   D K  E+    +TL     Q  + G   P      V+AFLP      L FI+
Sbjct: 295  -VVKPECRDKKRMEVGKWVITLAFPHGQRLSRGARSP-----GVYAFLPTEMVTNLPFII 348

Query: 1475 QGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----------ERSFCDLPCFRENPAK 1524
            Q DF+L SSRE +  +S WN+ +L   P  FVSA            SF   P FR  P  
Sbjct: 349  QADFLLASSRESILLDSQWNRGILDCVPYAFVSAFEALLKSSSNAPSFALPPIFRFLPVA 408

Query: 1525 AASV 1528
             +S+
Sbjct: 409  TSSI 412


>gi|222617066|gb|EEE53198.1| hypothetical protein OsJ_36070 [Oryza sativa Japonica Group]
          Length = 1815

 Score =  176 bits (447), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 144/423 (34%), Positives = 209/423 (49%), Gaps = 82/423 (19%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ F +G      E N L +    + +A+  LSQELYS+D HFL+EL+QNA+DN 
Sbjct: 16   VERIRRERFYIG----REERNPLAED---IHQAVTYLSQELYSKDLHFLMELIQNAEDNE 68

Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP +VEP L F++           S ++V NNE+GFSA N+ ++C +G STKKG+   GY
Sbjct: 69   YPSDVEPALEFVIIKKDITATGAASTLLVFNNERGFSAANVESICRIGKSTKKGNRHLGY 128

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV---PPFNIDMFCR 1325
            IG KGIGFKSVF V+  P I SNG+ +KF+   + +  IG+++P  V   P  +      
Sbjct: 129  IGEKGIGFKSVFLVSSQPHIFSNGYQIKFNEEPSEDCDIGYIVPKWVDEKPSIDDIHAVY 188

Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN 1385
              SK       K   T I LP ++       +  + +  S  HP +LLFL +++ +  R 
Sbjct: 189  GYSK-------KLPTTTIILPLKSD-----KILPVKNELSSTHPEILLFLSKIRQLSVRE 236

Query: 1386 MLND------SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG-------------- 1425
            + +D      S + I  ++  D   +     +  T  +A Q+ + G              
Sbjct: 237  INDDPKASKLSQISISSEV--DYKTQKDINAESYTLHLAMQENKRGDKEECTYYMWKQKF 294

Query: 1426 VIRP---DVKTTEI-ALALTL-----QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQ 1475
            V++P   D K  E+    +TL     Q  + G   P      V+AFLP      L FI+Q
Sbjct: 295  VVKPECRDKKRMEVDKWVITLAFPHGQRLSRGARSP-----GVYAFLPTEMVTNLPFIIQ 349

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----------ERSFCDLPCFRENPAKA 1525
             DF+L SSRE +  +S WN+ +L   P  FVSA            SF   P FR  P   
Sbjct: 350  ADFLLASSRESILLDSQWNRGILDCVPYAFVSAFEALLKSSSNAPSFALPPIFRFLPVAT 409

Query: 1526 ASV 1528
            +S+
Sbjct: 410  SSI 412


>gi|302686242|ref|XP_003032801.1| hypothetical protein SCHCODRAFT_234274 [Schizophyllum commune H4-8]
 gi|300106495|gb|EFI97898.1| hypothetical protein SCHCODRAFT_234274 [Schizophyllum commune H4-8]
          Length = 1818

 Score =  176 bits (445), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 168/576 (29%), Positives = 259/576 (44%), Gaps = 78/576 (13%)

Query: 1157 DAALIVESIRRDEFGLG----------PNISNMESNMLKKQHARLGRALHCLSQELYSQD 1206
            +A  ++E IR D +G G          P++  +  N+       L  A   +S++LY++ 
Sbjct: 129  EAQKLIEEIR-DGYGSGRSAQAKMQQDPSLQAVIGNLTTT----LENACRNVSEDLYAKS 183

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
             HF+LE+VQNADDN Y E  +PTLT  +++  +   +NE GF   N+RA+C VG STKKG
Sbjct: 184  PHFILEMVQNADDNKYAEGTKPTLTIRIEDDAVTFSSNELGFEPANVRAICSVGKSTKKG 243

Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL 1326
             S    GR  +GFKS F+V D   I S  F  KFD  E ++G + P         ++C  
Sbjct: 244  MS----GRCYVGFKSCFKVADVVHIASRDFQFKFDKRE-KLGMITP---------IWC-- 287

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
             +  PV        T  +  FR + S G +   + +  S L PSLLLFL +L  +     
Sbjct: 288  -TDHPV--------TKGQTTFRLELSPGESGAALAEHASKLEPSLLLFLRQLNTLKVETA 338

Query: 1387 L---NDSLVVIRKKIVGD-GIIKVSCGED----KMTWFVAS---QKLRAGVIRPDVKTTE 1435
            L   N  ++  + +   D  +I ++  ED    + T+F+     Q +     R  + +TE
Sbjct: 339  LSGQNPQIIHFKYRTTNDPDVITLTRWEDHRTTRDTYFIVHHTVQGMPEEEKRRGINSTE 398

Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
            + LA  + +  EG   P    +   AFLPLR YG KFI+Q DF+ P+SRE++    PWN 
Sbjct: 399  VTLAFPIDK--EG--APQARAEYAHAFLPLRKYGFKFIIQADFLTPASREDILEEKPWNI 454

Query: 1496 WLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV-HGFFSGLPRMILSKL 1554
             L +   A  ++A     +    RE        +  F+PL  +V +G+F  L   I   L
Sbjct: 455  TLRNGLLAAVLAA----IERLKIRE---AMKFTWFRFLPLETDVSNGYFKPLVANIRHTL 507

Query: 1555 RMSNCLILEGNNNQWAPPCKVLRGW------NDRAHSLLPDILLQKHLGLGFL--SKDIV 1606
               +  + +      A    +L  W       D  H   P I  Q    + +L  S D  
Sbjct: 508  AAKSLFLSQDGEYHTAGQLLILPDWAAYQEKEDDKHVTKPLIPEQYLPFVWYLHSSYDTS 567

Query: 1607 LSDS--LARALGIEEYGPKILLQIISSLCRTENGLRSMGLS--WLASWLNELYTISFHSS 1662
            ++    L   L ++        Q I++L    N     G S  W  +  + LY I    S
Sbjct: 568  INGDGPLLEGLRLKRMNSA---QFINALQAMVNKRAHEGRSPAWFNAVYDNLYRIYDEPS 624

Query: 1663 GQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEG 1698
              +  +          ++ +  IPLSDGT++S  +G
Sbjct: 625  NPNQKRGRKSYPYQTPIEALKLIPLSDGTWASKVDG 660


>gi|115485641|ref|NP_001067964.1| Os11g0514100 [Oryza sativa Japonica Group]
 gi|77551084|gb|ABA93881.1| hypothetical protein LOC_Os11g31500 [Oryza sativa Japonica Group]
 gi|113645186|dbj|BAF28327.1| Os11g0514100 [Oryza sativa Japonica Group]
          Length = 1674

 Score =  175 bits (443), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 141/451 (31%), Positives = 221/451 (49%), Gaps = 71/451 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ F +G      E N L +    + +A++ LSQE+YS+D HFL+EL+QNA+DN 
Sbjct: 8    VERIRRERFFIGRG----ERNPLAED---MHQAVNYLSQEIYSKDVHFLMELIQNAEDND 60

Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP  V P+L F++           S +++ NNE+GFS  N+ ++C VG STKKG+   GY
Sbjct: 61   YPSGVAPSLEFLITSKDITGLGAPSTLLIFNNEKGFSPSNVESICRVGKSTKKGNRHQGY 120

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF ++  P I SNG+ +KF+   +SE  IG+++P  V   P   D+    
Sbjct: 121  IGEKGIGFKSVFLISRQPHIFSNGYRIKFNEDPSSECNIGYIVPEWVESKPSLSDIQELH 180

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
             S  P+        T I LP +++      ++ +    S +HP +LLFL +++ +  R  
Sbjct: 181  GSSKPLP------TTTIILPLKSE-----KVDVVKKQLSSIHPEILLFLTKIRRLSVRED 229

Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGEDK----MTWFVASQKLRAGV- 1426
                        +M ++    +RK I  +   + +S  E+K      +++  QK      
Sbjct: 230  NSDPKCSTINEISMESEKNYQVRKNIHAESYTLHLSAQENKEQEECGYYMWRQKFPVKPE 289

Query: 1427 ----IRPDVKTTEIALAL----TLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
                +R D+    I LA      L    + + G       V+AFLP        FI+Q D
Sbjct: 290  NRVDMRADIDEYVITLAFPHGQRLSRGKQSSPG-------VYAFLPTEILTNFPFIIQAD 342

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
            F+L SSRE +  +S WN+ +L   P+ F++A  +        + PA +     +F+P+  
Sbjct: 343  FLLASSRETILFDSMWNKGILECVPSAFLNAFVALVKSSA--DAPAMSLPSMFNFLPVHP 400

Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
                F   +   I  K+R  N +  E    Q
Sbjct: 401  SHVPFLEPVRSAIKDKVRTENIMPCESYTLQ 431


>gi|357513121|ref|XP_003626849.1| Sacsin [Medicago truncatula]
 gi|355520871|gb|AET01325.1| Sacsin [Medicago truncatula]
          Length = 1699

 Score =  174 bits (442), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 134/394 (34%), Positives = 196/394 (49%), Gaps = 61/394 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G N + +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEEIRRKKFSIGGNPNPLTED--------LHQAVKNLSDELYTRDVHFLMELIQNAEDNH 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y E V PTL FI+    I         ++ NNE+GFS +NI ++C VG STKKG+ S+GY
Sbjct: 60   YIEGVSPTLEFIITSKDITATGAPATLLIFNNEKGFSRQNIESICSVGRSTKKGNRSSGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF VT  P I SNG+ ++F     +   +G+++P  V   P  +D+  ++
Sbjct: 120  IGEKGIGFKSVFLVTAQPYIFSNGYQIRFSERPCTHCSLGYIVPEWVEEKPSLLDI-KQI 178

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
                   L +    T I LP +    E +         S ++P +LLFL +++ +  R  
Sbjct: 179  YGAGKDSLPT----TIIVLPLKLDKVEPVKQQ-----LSSVNPEVLLFLLKIRHLSVRED 229

Query: 1387 LNDSL--------------VVIRKKIVGDG-IIKVSCG-----EDKMTWFVASQKLRAGV 1426
              D                 V RK +  +   I +S G     E K  +++  QK    +
Sbjct: 230  NEDPKKNTVTSVSISSEIDFVTRKNMDAESYTINLSAGEKSSDEKKCCYYMWKQKFPVKL 289

Query: 1427 -----IRPDVKTTEIALALTLQESNEGNYGPLLYQQP--VFAFLPLR-TYGLKFILQGDF 1478
                  R DV+   + LA   QE  +       +Q    V+AFLP        FI+Q DF
Sbjct: 290  QNVVERRKDVEECVVTLAFPRQERLQKIKKLRKHQSSPGVYAFLPTEMVTNFPFIIQADF 349

Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
            VL SSRE +  N  WNQ +L   P+ FV A ++ 
Sbjct: 350  VLASSRETILLNDKWNQGILEYVPSAFVDAFKTL 383


>gi|125534525|gb|EAY81073.1| hypothetical protein OsI_36253 [Oryza sativa Indica Group]
          Length = 1695

 Score =  174 bits (441), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 141/451 (31%), Positives = 220/451 (48%), Gaps = 71/451 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ F +G      E N L +    + +A++ LSQE+YS+D HFL+EL+QNA+DN 
Sbjct: 8    VERIRRERFFIGRG----ERNPLAED---MHQAVNYLSQEIYSKDVHFLMELIQNAEDND 60

Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP  V P+L F++           S +++ NNE GFS  N+ ++C VG STKKG+   GY
Sbjct: 61   YPSGVAPSLEFLITSKDITGLGAPSTLLIFNNENGFSPSNVESICRVGKSTKKGNRHQGY 120

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF ++  P I SNG+ +KF+   +SE  IG+++P  V   P   D+    
Sbjct: 121  IGEKGIGFKSVFLISRQPHIFSNGYRIKFNEDPSSECNIGYIVPEWVESKPSLSDIQELH 180

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
             S  P+        T I LP +++      ++ +    S +HP +LLFL +++ +  R  
Sbjct: 181  GSSKPLP------TTTIILPLKSE-----KVDVVKKQLSSIHPEILLFLTKIRRLSVRED 229

Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGEDK----MTWFVASQKLRAGV- 1426
                        +M ++    +RK I  +   + +S  E+K      +++  QK      
Sbjct: 230  NSDPKCSTINEISMESEKNYQVRKNIHAESYTLHLSAQENKEQEECGYYMWRQKFPVKPE 289

Query: 1427 ----IRPDVKTTEIALAL----TLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
                +R D+    I LA      L    + + G       V+AFLP        FI+Q D
Sbjct: 290  NRVDMRADIDEYVITLAFPHGQRLSRGKQSSPG-------VYAFLPTEILTNFPFIIQAD 342

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
            F+L SSRE +  +S WN+ +L   P+ F++A  +        + PA +     +F+P+  
Sbjct: 343  FLLASSRETILFDSMWNKGILECVPSAFLNAFVALVKSSA--DAPAMSLPSMFNFLPVHP 400

Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
                F   +   I  K+R  N +  E    Q
Sbjct: 401  SHVPFLEPVRSAIKDKVRTENIMPCESYTLQ 431


>gi|393234641|gb|EJD42202.1| hypothetical protein AURDEDRAFT_152983 [Auricularia delicata
            TFB-10046 SS5]
          Length = 1430

 Score =  174 bits (440), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 163/597 (27%), Positives = 263/597 (44%), Gaps = 78/597 (13%)

Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFS 1249
            +L ++L  L  ELYS+ +HF+LE++QNADDN Y  + EP L  +L+ S + +  NE GFS
Sbjct: 41   KLAQSLQVLCAELYSKPAHFVLEVIQNADDNTYAPDTEPELRIVLRHSHMEIQCNEVGFS 100

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
             EN+RA C +G STKK  S GYIG KGIG KS+F+V     + S G+    D  E ++G 
Sbjct: 101  DENVRAFCGIGQSTKKNQS-GYIGEKGIGAKSIFQVARKVFVQSRGYSFVLDRDE-ELGM 158

Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK----FSEGIAMNNIVDMFS 1365
            + P                          W     LP  T+    F +      +     
Sbjct: 159  ISPH-------------------------WTELAVLPGWTRMTLEFCDEKQYYIVKAQLE 193

Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
             +  ++LLF+ RL  +   ++ + ++    +KI GD +     G     +   SQ + A 
Sbjct: 194  HVRMTVLLFMRRLAHL---HIDDGTVFAASRKIDGDVVTITVPGAPSWNFLRVSQIVPAY 250

Query: 1426 VIRPDVK---TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPS 1482
               P      +TE++LA  L ++ +    P +  Q V+AFLPLR  G  FI+Q DF+  +
Sbjct: 251  PYEPKRAHCVSTEVSLAFPLDQAGQ----PWISWQLVYAFLPLRAAGFTFIIQADFLTAA 306

Query: 1483 SREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG- 1541
            +RE+V  +  WN  L S    +F+SA   F  LP            +  ++P    +H  
Sbjct: 307  NREDVLWDLEWNINLRSAISKVFLSAVEKFKRLPALE-------LTWPRYLP--ANIHDP 357

Query: 1542 FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFL 1601
            FF+     IL+ LR S  +  E    + +P  +    +       L   +  ++L   + 
Sbjct: 358  FFAPAAHEILASLRNSPIVKCEDGKYRHSPLVRCSSWFVSPTGEAL---VAAQYLPFSYA 414

Query: 1602 SKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASW--LNELYTISF 1659
            +          R+LG+E+   K  L  +  L   E  L S+G  WL     L  LY  ++
Sbjct: 415  APYPEDVKPHLRSLGVEDMTFKDFLDGL--LAMGEPRLTSIGNEWLEQVCSLINLYGTTW 472

Query: 1660 HSSGQSSLQSGVETDLIDN-LQRIPFIPLSDGTFSSVDEGTIWLHSDCS--VFDGGFGLE 1716
            ++  Q         D +D  ++++P + L  G ++ ++ G      D        G G+ 
Sbjct: 473  NAKDQ-------RWDCVDQRVRQLPLVHLHTGRWARINSGRSLFFGDAQEIPLPSGLGIS 525

Query: 1717 AFPNLCAKLRTVSP---ALLSASAVDKSSLGVI--SVDNLNRMLLKIGVQQLSAHDI 1768
                   +  T SP    LL    V K  +G +   +  L+RM  K+ VQ++  H +
Sbjct: 526  -----LVQTPTSSPHRTQLLERLGVRKVEVGAVVDKILELHRMSSKMSVQEMLEHAL 577


>gi|302775142|ref|XP_002970988.1| hypothetical protein SELMODRAFT_411717 [Selaginella moellendorffii]
 gi|300160970|gb|EFJ27586.1| hypothetical protein SELMODRAFT_411717 [Selaginella moellendorffii]
          Length = 1493

 Score =  172 bits (437), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 140/453 (30%), Positives = 221/453 (48%), Gaps = 66/453 (14%)

Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
            + +E +RR+ FG+G +  N  + ML++        +  LS ELY +D HFL EL+QNA+D
Sbjct: 6    MFIERLRRERFGIGQSGPNPLAPMLQE-------TIRLLSAELYQKDIHFLSELLQNAED 58

Query: 1220 NIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SA 1269
            N Y  +VEP++   L ++ +         ++LNNE GFS +NI +LC  G STKKG  S 
Sbjct: 59   NTYHSSVEPSIEIYLTKTDVTGTGAAATLLLLNNEVGFSEKNIESLCGSGQSTKKGKKSQ 118

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRL 1326
            GYIG KGIGFKSVF VT  P I SNGF ++FD    +E ++G+++P  V      +  ++
Sbjct: 119  GYIGEKGIGFKSVFLVTKQPYIISNGFRIRFDEEPHNEAKLGYIVPEWVDRPTDAVLQKV 178

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI-MFRN 1385
             SK      ++   T I LP R +      ++ +    + +    LLFL +++ + ++  
Sbjct: 179  CSK------AELPATVIILPLRQE-----KISEVAKQLTQIPAETLLFLSKIRMLSIYDQ 227

Query: 1386 MLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI------------------ 1427
                   ++  + V     K    E   T+  AS   R G                    
Sbjct: 228  QSRPPTELVLSRSVEMTTSKNESVETSTTFLTASNG-RNGTQHSNYYMFKQTLKVPEEAK 286

Query: 1428 ---RPDVKTTEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLP 1481
               R DV +  I LA  + +  S+  + G       +F+FLP   + GL F++  DF+L 
Sbjct: 287  VENRKDVDSWTITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLV 340

Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCF--RENPAKAASVYMSFVPLVGEV 1539
            SSRE +  +SPWNQ +    P +F +A     +   F  +  P+ A      +VP+    
Sbjct: 341  SSRETLLFDSPWNQGIFRCVPEVFYNAFELLLNTTAFGLQSIPSVARMAMYKYVPVNACR 400

Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP 1572
            + +   +   ILSKL+ +N +IL  ++   APP
Sbjct: 401  NSYLERVRNDILSKLKDNN-VILCASDVGRAPP 432


>gi|393234597|gb|EJD42158.1| hypothetical protein AURDEDRAFT_186148 [Auricularia delicata
            TFB-10046 SS5]
          Length = 1503

 Score =  172 bits (435), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 167/601 (27%), Positives = 262/601 (43%), Gaps = 97/601 (16%)

Query: 1142 GDCTTQTLPEDKECEDA--ALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLS 1199
            G    Q + +   C++   A +V+  RRD     P + N  + + +K    L ++L  LS
Sbjct: 3    GISAEQQVVKQIRCDNGVDADLVQQARRD-----PVVWNTVTKLKEK----LEKSLQILS 53

Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
            +ELYS+ +HF+LEL+QNADDN Y     P +  +L+ S + +  NE GF+AEN++A C +
Sbjct: 54   EELYSKPAHFVLELLQNADDNSYAPGTTPEMRIVLRPSHMEIRCNETGFTAENVKAFCGI 113

Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFN 1319
            G STK+  + GYIG KGIG KS+F+V     +HS G+H   D  + ++G + P       
Sbjct: 114  GQSTKRNQT-GYIGEKGIGAKSMFKVARKVHVHSLGYHFVLD-RDAELGMISP------- 164

Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
                         Q      N       R  F +      +      +  S+LLF+ RL 
Sbjct: 165  -------------QWTGSSENLSGWTLLRLDFCDDKQYAIVKSQLGQVRESVLLFMRRLA 211

Query: 1380 CIMFRNMLNDSLVVIRKKIVGDG-IIKVSCGEDKMTW-FVASQKLRAGVIRPDVK----- 1432
             +   + ++ S  V R+    DG  + +S    K  W     Q LR     P  +     
Sbjct: 212  RLTVDDSVDKSFSVERQM---DGKFLTISV---KNGWSMTCRQFLRVANTIPAYRHEPKR 265

Query: 1433 ----TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD 1488
                TTEI LA  L  S E    PL+ +Q V AFLP+R+ G KFI+Q DF+  ++RE+V 
Sbjct: 266  EHSETTEIVLAFPLNASRE----PLVSRQMVHAFLPVRSVGFKFIIQADFLTAANREDVL 321

Query: 1489 GNSPWNQWLLSEFPALFVSA-ERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGL 1546
             +  WN+ L S     F SA    F  +P          + ++ F+P   +V   FFS +
Sbjct: 322  SDLEWNKTLRSGVVEAFCSAVTVQFQSMPTLD-------TTWLHFLPDPSDVADQFFSPI 374

Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKV---------LRGWNDRAHSLLPDILLQKHLG 1597
             R I S  R+ +  +++  +  W  P  V           G        LP         
Sbjct: 375  TRDIRS--RIESLKVIKDADGFWRTPSLVRFPSAFSETTDGAGIVPEPELPFYYASNQYT 432

Query: 1598 LGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTI 1657
             G  SK        A+ALG+ +         +S L +  + +   G  WL    + +YT 
Sbjct: 433  GGLASK--------AQALGVADM---TFNDFLSGLIKMGDRITRKGNVWLERVCSLIYTH 481

Query: 1658 SFHSSGQSSLQSGVETDLID-NLQRIPFIPLSDGTFSSVD-EGTIWLHSDCSVFDGGFGL 1715
             F+ + Q          L D  ++ +P   L++G + S      ++ +   S F  G G+
Sbjct: 482  GFNWNSQ----------LRDRRIRDLPLARLANGNWISYGISAALFFNRSSSEFPSGLGI 531

Query: 1716 E 1716
            +
Sbjct: 532  D 532


>gi|302142144|emb|CBI19347.3| unnamed protein product [Vitis vinifera]
          Length = 1801

 Score =  171 bits (433), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 136/419 (32%), Positives = 196/419 (46%), Gaps = 70/419 (16%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
            E A L +E IR  +F +G    N  +         L  A+  LS ELY++D HFL+EL+Q
Sbjct: 19   EAARLHIEKIRTQKFSIGAKSPNPLTE-------DLHHAVTSLSAELYAKDVHFLMELIQ 71

Query: 1216 NADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
            NA+DN Y + VEPTL F+     I         +V NNE GFS +N+ ++C VG STKKG
Sbjct: 72   NAEDNEYKQGVEPTLEFVFTTKDITGSGAPATLLVFNNEVGFSEKNMESICSVGRSTKKG 131

Query: 1267 SS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ---IGFVLPTLV--PPFNI 1320
                G+IG KGIGFKSVF VT  P I SNG+HV+F     Q   IG+++P  V   P+ +
Sbjct: 132  KRHLGFIGEKGIGFKSVFLVTIQPHIFSNGYHVRFSEDPDQDCGIGYIVPEWVGGKPY-L 190

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
               C +   D V        T I LP + +  E +         S+LHP LLLFL +++ 
Sbjct: 191  STICDIYGSDKVLP-----TTIILLPLKPEKVEAVKAQ-----LSELHPELLLFLSKIKR 240

Query: 1381 I-----------------MFRNMLNDSLVVIRKKIVGDGIIKVSCGED----KMTWFVAS 1419
            +                 +F +   D  V +  K     ++ +S  ED       +++  
Sbjct: 241  LSVCGNTCNSKEDVNVSDIFISTETDH-VALSDKSADSRVLHLSVKEDGAEETCRYYIWR 299

Query: 1420 QKLRAG-----VIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFI 1473
            +            R DVK   I++A    E  +     +     +FAFLP        F+
Sbjct: 300  ETFPVKPANEFSARNDVKKCVISIAFPFGERLKRGTSAV----GIFAFLPTAMITNFPFV 355

Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSF 1532
            +Q DFVL SSRE +  ++ WN  +L   P+ F  A  S     C +++     SV  +F
Sbjct: 356  IQADFVLASSRETILLDNKWNLGILECIPSAFFHAFIS-----CVKKDATTICSVAQAF 409


>gi|443713407|gb|ELU06277.1| hypothetical protein CAPTEDRAFT_76413, partial [Capitella teleta]
          Length = 180

 Score =  171 bits (432), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 81/161 (50%), Positives = 115/161 (71%), Gaps = 7/161 (4%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR+EFG+G  ++     + +K  ARLGR+L  L+ +LYS+D+HF+LEL+QNADDN 
Sbjct: 13   IERIRREEFGIGLELTCEFQAVFEKNQARLGRSLQRLAHDLYSKDTHFVLELIQNADDNS 72

Query: 1222 YPE-------NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGR 1274
            Y E       +V PTL+F++ +  + + NNE+GF  ++++A+CDVG STK     GYIG+
Sbjct: 73   YAEHLLNPDSDVVPTLSFVVSDRAVKISNNEKGFLEKHVKAICDVGCSTKPKHQMGYIGQ 132

Query: 1275 KGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
            KGIGFKSVFRV+D PEI S+GFH KFD +   +G++LP  V
Sbjct: 133  KGIGFKSVFRVSDEPEIVSSGFHFKFDKNSSDMGYILPHWV 173


>gi|147787343|emb|CAN62566.1| hypothetical protein VITISV_022763 [Vitis vinifera]
          Length = 1725

 Score =  171 bits (432), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 136/419 (32%), Positives = 196/419 (46%), Gaps = 70/419 (16%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
            E A L +E IR  +F +G    N  +         L  A+  LS ELY++D HFL+EL+Q
Sbjct: 37   EAARLHIEKIRTQKFSIGAKSPNPLTE-------DLHHAVTSLSAELYAKDVHFLMELIQ 89

Query: 1216 NADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
            NA+DN Y + VEPTL F+     I         +V NNE GFS +N+ ++C VG STKKG
Sbjct: 90   NAEDNEYKQGVEPTLEFVFTTKDITGSGAPATLLVFNNEVGFSEKNMESICSVGRSTKKG 149

Query: 1267 SS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ---IGFVLPTLV--PPFNI 1320
                G+IG KGIGFKSVF VT  P I SNG+HV+F     Q   IG+++P  V   P+ +
Sbjct: 150  KRHLGFIGEKGIGFKSVFLVTIQPHIFSNGYHVRFSEDPDQDCGIGYIVPEWVGGKPY-L 208

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
               C +   D V        T I LP + +  E +         S+LHP LLLFL +++ 
Sbjct: 209  STICDIYGSDKVLP-----TTIILLPLKPEKVEAVKAQ-----LSELHPELLLFLSKIKR 258

Query: 1381 I-----------------MFRNMLNDSLVVIRKKIVGDGIIKVSCGED----KMTWFVAS 1419
            +                 +F +   D  V +  K     ++ +S  ED       +++  
Sbjct: 259  LSVCGNTCNSKEDVNVSDIFISTETDH-VALSDKSADSRVLHLSVKEDGAEETCRYYIWR 317

Query: 1420 QKLRAG-----VIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFI 1473
            +            R DVK   I++A    E  +     +     +FAFLP        F+
Sbjct: 318  ETFPVKPANEFSARNDVKKCVISIAFPFGERLKRGTSAV----GIFAFLPTAMITNFPFV 373

Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSF 1532
            +Q DFVL SSRE +  ++ WN  +L   P+ F  A  S     C +++     SV  +F
Sbjct: 374  IQADFVLASSRETILLDNKWNLGILECIPSAFFHAFIS-----CVKKDATTICSVAQAF 427


>gi|242068521|ref|XP_002449537.1| hypothetical protein SORBIDRAFT_05g018376 [Sorghum bicolor]
 gi|241935380|gb|EES08525.1| hypothetical protein SORBIDRAFT_05g018376 [Sorghum bicolor]
          Length = 1709

 Score =  170 bits (431), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 140/459 (30%), Positives = 225/459 (49%), Gaps = 66/459 (14%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ + +G      E N L +    + +A++ LSQELYS+D HFL+ELVQNA+DN 
Sbjct: 11   VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQNAEDNE 63

Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP  V P+L F++           S +++ NNE+GFS+ NI ++C VG STKKG+   GY
Sbjct: 64   YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSSTNIESICRVGKSTKKGNRHQGY 123

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNIDMFCRLLS 1328
            IG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V     +    L  
Sbjct: 124  IGEKGIGFKSVFLISSQPYIFSNGYQIKFNEKPCAECNIGYIVPQWV-----ESTPSLSD 178

Query: 1329 KDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
             + +   SK    T I LP +++      ++ +    S +HP +LLFL +++ +  R   
Sbjct: 179  IEAIYGCSKVLPTTTIILPLKSE-----KIDAVKSQLSSMHPEMLLFLTKIRKLSVREDK 233

Query: 1388 ND------SLVVI--------RKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVI 1427
            +D      S + I        RK +  +         +    E +  +++  QK     +
Sbjct: 234  SDPNNTTVSEISISSEKNYQARKNMHAESYTLHLSSEETGKDEAECGYYMWRQKFP---V 290

Query: 1428 RPDVKTT------EIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVL 1480
            +P+ +        E  + L        + G  L    ++AFLP        FI+Q DF+L
Sbjct: 291  KPENRVNKRAEIDEWVITLAFPHGERLSRGKQL-SPGIYAFLPTEMVTSFPFIIQADFLL 349

Query: 1481 PSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH 1540
             SSRE +  ++PWN+ +L   P+ F++A  +        + PA +     SF+P+   + 
Sbjct: 350  GSSREAILFDNPWNKGILDCVPSAFMNAFVTLVKSKA--DAPAMSLPSMFSFLPVNSSLI 407

Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
                 +   I  K+   N + LE   +Q  +  P +V R
Sbjct: 408  PLLEPVRSGIKYKVLTENIVPLESYASQKIFCKPSEVAR 446


>gi|242068523|ref|XP_002449538.1| hypothetical protein SORBIDRAFT_05g018443 [Sorghum bicolor]
 gi|241935381|gb|EES08526.1| hypothetical protein SORBIDRAFT_05g018443 [Sorghum bicolor]
          Length = 1699

 Score =  170 bits (431), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 139/459 (30%), Positives = 226/459 (49%), Gaps = 66/459 (14%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ + +G      E N L +    + +A++ LSQELYS+D HFL+ELVQNA+DN 
Sbjct: 11   VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQNAEDNE 63

Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP  V P+L F++           S +++ NNE+GFS+ NI ++C VG STKKG+   GY
Sbjct: 64   YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSSTNIESICRVGKSTKKGNRHQGY 123

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNIDMFCRLLS 1328
            IG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V     +    L  
Sbjct: 124  IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECNIGYIVPQWV-----ESTPSLSD 178

Query: 1329 KDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
             + +   SK    T I LP +++ ++ +         S +HP +LLFL +++ +  R   
Sbjct: 179  IEALYGCSKVLPTTTIILPLKSEKTDAVK-----SQLSSMHPEMLLFLTKIRKLSVREDK 233

Query: 1388 ND------SLVVI--------RKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVI 1427
            +D      S + I        RK +  +         +    E +  +++  QK     +
Sbjct: 234  SDPNNTTVSEISISSERNYQARKNMHAESYTLHLSSEETGKDEAECGYYMWRQKFP---V 290

Query: 1428 RPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVL 1480
            +P+       +  E  + L        + G  L    ++AFLP        FI+Q DF+L
Sbjct: 291  KPENRVDKRAEIDEWVITLAFPHGERLSRGKQL-SPGIYAFLPTEMVTSFPFIIQADFLL 349

Query: 1481 PSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH 1540
             SSRE +  ++PWN+ +L   P+ F++A  +        + PA +     +F+P+   + 
Sbjct: 350  GSSREAILFDNPWNKGILDCVPSAFMNAFVTLVKSKA--DAPAMSLPSMFNFLPVSSSLI 407

Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
                 +   I +K+   N + LE   +Q  +  P +V R
Sbjct: 408  PLLEPVRSGIKNKVLTENIVPLESYASQKIFCKPSEVAR 446


>gi|414591567|tpg|DAA42138.1| TPA: hypothetical protein ZEAMMB73_860622 [Zea mays]
 gi|414591568|tpg|DAA42139.1| TPA: hypothetical protein ZEAMMB73_860622 [Zea mays]
          Length = 1700

 Score =  169 bits (429), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 132/395 (33%), Positives = 194/395 (49%), Gaps = 77/395 (19%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VESIRR  + +G      E N L +    L +A + LSQELYS+D HFL+ELVQNA+DN 
Sbjct: 13   VESIRRKRYYIGRG----EQNPLAED---LHQATNYLSQELYSKDVHFLMELVQNAEDNE 65

Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP  V P+L F++           S +++ NNE+GFSA NI ++C VG STKKG+   GY
Sbjct: 66   YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSATNIESICRVGKSTKKGNRHRGY 125

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV---PPF-NIDMFC 1324
            IG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V   P   +I+   
Sbjct: 126  IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECNIGYIVPEWVESTPSLSDIETLY 185

Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR 1384
                  P         T I LP +++      ++ +    S +HP +LLFL ++  +  R
Sbjct: 186  GCSKVLP--------TTTIILPLKSE-----KIDAVKTQLSSMHPEMLLFLTKITKLSVR 232

Query: 1385 --NMLNDSLVV-----------IRKKIVGD-----------GIIKVSCGEDKMTWFVASQ 1420
              + LN + V             RK +  +           G  +  CG     +++  Q
Sbjct: 233  ENSDLNHTTVSEISITSEKNYQARKNMHAESYTLHLSAEETGKEEAECG-----YYMWRQ 287

Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
            K     ++P+       +  E  + L          G  L    ++AFLP        FI
Sbjct: 288  KFP---VKPENRVDKRAEIDEWVITLAFPHGQRLTRGKQLLPS-IYAFLPTEMVTSFPFI 343

Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            +Q DF+L SSRE +  +SPWN+ +L   P  F+ A
Sbjct: 344  IQADFLLASSREAILFDSPWNKGILECVPKAFMDA 378


>gi|359491119|ref|XP_003634225.1| PREDICTED: uncharacterized protein LOC100252577 [Vitis vinifera]
          Length = 1711

 Score =  169 bits (429), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 134/461 (29%), Positives = 218/461 (47%), Gaps = 71/461 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            ++ IRR +F +G  ++ +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IKEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDND 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            YPE V P+L  ++    I         ++ NNE+GFSA+NI ++C VG STKK +   GY
Sbjct: 60   YPEGVNPSLELVITSQDITGTGASATLLIFNNEKGFSAKNIESICSVGRSTKKNNRKCGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVP--PFNIDMFCRL 1326
            IG KGIGFKSVF +T  P I SNG+ ++F+        +G+++P  V   P  +D   ++
Sbjct: 120  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVQWNP-TLDDIKQI 178

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
                 V        T I LP +        +  + +  S + P +LLFL +++    +  
Sbjct: 179  YGSHAV-----LPTTTIILPLKPD-----KIGPVKEQLSSIQPEVLLFLSKIKQFSVKKH 228

Query: 1387 LNDSLV--------------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKL--- 1422
              D  +              V RK I  D  I            E + ++++  QK    
Sbjct: 229  NEDPRLNTVNAISISSEINFVKRKNIDADSYILHLSTDGAKDVTEKECSYYMWRQKFPVR 288

Query: 1423 RAGVIRPDVKTTEIALALTL---QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDF 1478
            +   +   +   E+ + L     Q  N G   P      V+AFLP        FI+Q DF
Sbjct: 289  QENQVERRLGVEELVITLAFPFGQRLNRGISSP-----GVYAFLPTEMVTNFPFIIQADF 343

Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
            VL SSRE +  ++ WNQ +L   P+ F++A  S   +   ++ P    +    F+P+   
Sbjct: 344  VLASSRETILLDNKWNQGILDCVPSAFLNAFISL--VTTSQDVPVSTLTPMFKFLPIYSS 401

Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
             +   + +  +I ++L   N +  E  ++Q  +  PC+V R
Sbjct: 402  PYPKLNDVRELIKAELLKKNIVPCESYSDQKIFRKPCEVGR 442


>gi|357513141|ref|XP_003626859.1| hypothetical protein MTR_8g011330 [Medicago truncatula]
 gi|355520881|gb|AET01335.1| hypothetical protein MTR_8g011330 [Medicago truncatula]
          Length = 1702

 Score =  169 bits (428), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 137/448 (30%), Positives = 217/448 (48%), Gaps = 65/448 (14%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR++F +G   + +  +        L  A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEEIRREKFSIGGKPNPLTYD--------LHHAVKNLSAELYAKDVHFLMELIQNAEDNH 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y +   PTL F++    I         ++ NNE+GFS +NI ++C VG STKK + S+GY
Sbjct: 60   YIDGENPTLEFVITSDDITGTGAPATLLIFNNEKGFSPKNIESICSVGRSTKKDNRSSGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF VT  P I SNG+H++F+       +IG+V+P  V   P  +D+  ++
Sbjct: 120  IGEKGIGFKSVFLVTAQPYIFSNGYHIRFNERPCPHCRIGYVVPEWVEEKPTLVDI-QKI 178

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
                   L +    T I LP ++       +  +    S +HP +LLFL +++ +  R  
Sbjct: 179  YGAGKDSLPT----TTIVLPLKSD-----KVKPVKQQLSSIHPEVLLFLSKIRHLSVREG 229

Query: 1385 ------------NMLNDSLVVIRKKIVGDG-IIKVSCGED-----KMTWFVASQKLRAGV 1426
                        ++ ++   V RK +      I +S GE      + ++++  QK    +
Sbjct: 230  NKDFNQNTATSVSISSEINFVTRKNMNAKSYTIHLSTGESSRDKKECSYYMWKQKFPVKL 289

Query: 1427 -----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVL 1480
                  R  V+   + LA   QE  + +         V+AFLP        FI+Q DFVL
Sbjct: 290  ENVVERRRGVEECVVTLAFPQQERLDKDKS----LPGVYAFLPTEMVTNFPFIIQADFVL 345

Query: 1481 PSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH 1540
             SSRE +  +  WNQ +L   P  F+ A ++   +    E P  +      F+P+     
Sbjct: 346  ASSRETILFDDKWNQGILEYVPLAFIDAFKTL--VTGSDEAPLSSLPYMFKFLPIHSSPF 403

Query: 1541 GFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
              F+ + + I +KL     + +E   NQ
Sbjct: 404  EIFNRVRQKIKAKLDEEKIIPIETFINQ 431


>gi|225456431|ref|XP_002284314.1| PREDICTED: uncharacterized protein LOC100247409 [Vitis vinifera]
          Length = 1712

 Score =  169 bits (427), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 142/460 (30%), Positives = 219/460 (47%), Gaps = 69/460 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G      E N L +    L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEEIRRTKFSIGG-----EQNPLTED---LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
            Y E+V P+L  ++    I         ++ NNE+GFSA+NI ++C VG STKKG+   GY
Sbjct: 60   YGEDVNPSLELVITSKDITGTRALMTLLIFNNEKGFSAKNIESICGVGRSTKKGNRKRGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPF-NIDMFCRLL 1327
            IG KGIGFKSVF +T  P I SNG+ ++F+        +G+++P  V    ++D   ++ 
Sbjct: 120  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVEQNPSLDDIKQIY 179

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR--- 1384
                V        T I LP +        +  + +  S + P +LLFL +++    +   
Sbjct: 180  GSHAVLP-----TTTIILPLKPD-----KIKPVKEQLSSIQPEVLLFLSKIKHFSVKEDN 229

Query: 1385 -----NMLNDSLV------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKLRAGV 1426
                 N +N   +      V RK I  D               E + ++++  QK     
Sbjct: 230  EDPRLNTVNAISISSEINFVTRKNIDADSYTLHLSADETSDVTEKECSYYMWRQKFPVRQ 289

Query: 1427 -----IRPDVKTTEIALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
                  R +V+   I LA  L Q  N G   P      V+AFLP        FI+Q DFV
Sbjct: 290  ENQVERRLEVEEWVITLAFPLGQRLNRGMSSP-----GVYAFLPTEMVTNFPFIIQADFV 344

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
            L SSRE +  ++ WNQ +L   P+ F++A  S   +    + P    +    F+P+    
Sbjct: 345  LASSRETILLDNKWNQGILDCVPSAFLNALISL--VTTSEDVPVSTLTPMFKFLPIDSSS 402

Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
            +   + +   I +KL   N +  E  ++Q  +  PC+V R
Sbjct: 403  YPKLNVVRESIKAKLLKENIIPCESYSDQKIFRKPCEVGR 442


>gi|147775835|emb|CAN78188.1| hypothetical protein VITISV_033142 [Vitis vinifera]
          Length = 1488

 Score =  168 bits (426), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 137/441 (31%), Positives = 211/441 (47%), Gaps = 55/441 (12%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G  ++ +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y E V P+L  ++    I         ++ NNE+GFSA+NI ++C VG STKK +   GY
Sbjct: 60   YGEGVNPSLELVITSQDITDTGAPATLLIFNNEKGFSAKNIESICSVGRSTKKSNRKQGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP--PFNIDMFCRL 1326
            IG KGIGFKSVF +T  P I SNG+ ++F+    S   +G+++P  V   P   D+    
Sbjct: 120  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCSHSNLGYIVPEWVEQNPSLADI---- 175

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
              K      +    T I LP +        +  +    S + P +LLFL +++    +  
Sbjct: 176  --KQIYGSHAVLPTTTIILPLKPD-----KIKPVKQQLSSIQPEVLLFLSKIKQFSVKED 228

Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGV-----IRPDVKTTEIALALT 1441
              D  +     I    I      E + ++++  QK           R  V+   I LA  
Sbjct: 229  NKDPRLNTVNAI---SISTSDVTEKECSYYMWRQKFPVRQENQVERRLGVEEWVIKLAFP 285

Query: 1442 L-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
            + Q  N G   P +Y     AFLP        FI+Q DFVL SSRE +  ++ WNQ +L 
Sbjct: 286  IGQRLNRGMSSPGIY-----AFLPTEMVTNFPFIIQADFVLASSRETILLDNKWNQGILD 340

Query: 1500 EFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
              P+ F++A   F  L    E+ P    +    F+P+    +   + +   I +KL   N
Sbjct: 341  CVPSAFLNA---FISLVTTSEDVPVSTLTPMFKFLPINSSSYPKLNVVRESIKAKLLTEN 397

Query: 1559 CLILEGNNNQ--WAPPCKVLR 1577
             +  E  ++Q  +  PC+V R
Sbjct: 398  IIPCESYSDQKIFRKPCEVGR 418


>gi|255540111|ref|XP_002511120.1| DNA binding protein, putative [Ricinus communis]
 gi|223550235|gb|EEF51722.1| DNA binding protein, putative [Ricinus communis]
          Length = 2299

 Score =  168 bits (425), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 138/449 (30%), Positives = 207/449 (46%), Gaps = 74/449 (16%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E +R+  F +G    N  ++ML K       A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 626  IEHLRKTTFSIGGQ-ENPLASMLDK-------AVEYLSAELYAKDVHFLMELIQNAEDNE 677

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
            YP  V+P+L F++    I         ++ NNE GF+ +NI ++C VGNSTKKG+   GY
Sbjct: 678  YPAGVDPSLEFVITSRDITATGASATLLIFNNEMGFTPKNIDSICSVGNSTKKGNRKRGY 737

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNIDMFCRLLS 1328
            IG KGIGFKSVF V+  P I SNG+ ++F    + + ++G+++P  V           LS
Sbjct: 738  IGEKGIGFKSVFLVSAQPCIFSNGYQIRFSEKPSPDCKLGYIVPEWVED------SPSLS 791

Query: 1329 KDPVQLESKCW--NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
                   S+C    T I LP +        M  +    S +HP +LLFL +++C+  R  
Sbjct: 792  DIKQIYGSRCSLPTTTIVLPLKPD-----KMGPVKQQLSSIHPEVLLFLSKIKCLSVRED 846

Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
              D     R   V                 +A  K    V R ++      L L+ +E+ 
Sbjct: 847  NEDP----RLNTVSA---------------IAITKETNFVTRKNIDAESYTLHLSAEENG 887

Query: 1447 EGNYGPLLYQQPVFAFLPLR----------------TYGLKFILQGDFVLPSSREEVDGN 1490
            + +     Y      F P+R                     FI+Q DF+L SSRE +  +
Sbjct: 888  DSSRAECSYHIWRQKF-PVRQENRVERRMDVEDWLMVTNFPFIIQADFILASSRETILLD 946

Query: 1491 SPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMI 1550
            + WNQ +L   P  FV+A  S   +    + P  +      F+PL G  +   + +   I
Sbjct: 947  NTWNQGILDCVPCAFVNALISL--VKSTEDAPLSSLPRMFEFLPLSGSPYPKLNVVRESI 1004

Query: 1551 LSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
             +KL   N L  E    Q  +  PC+V R
Sbjct: 1005 KTKLAAENILPSESYTVQKFFHKPCEVRR 1033


>gi|297734475|emb|CBI15722.3| unnamed protein product [Vitis vinifera]
          Length = 875

 Score =  167 bits (424), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 142/460 (30%), Positives = 219/460 (47%), Gaps = 69/460 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G      E N L +    L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEEIRRTKFSIGG-----EQNPLTED---LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
            Y E+V P+L  ++    I         ++ NNE+GFSA+NI ++C VG STKKG+   GY
Sbjct: 60   YGEDVNPSLELVITSKDITGTRALMTLLIFNNEKGFSAKNIESICGVGRSTKKGNRKRGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPF-NIDMFCRLL 1327
            IG KGIGFKSVF +T  P I SNG+ ++F+        +G+++P  V    ++D   ++ 
Sbjct: 120  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVEQNPSLDDIKQIY 179

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR--- 1384
                V        T I LP +        +  + +  S + P +LLFL +++    +   
Sbjct: 180  GSHAV-----LPTTTIILPLKPD-----KIKPVKEQLSSIQPEVLLFLSKIKHFSVKEDN 229

Query: 1385 -----NMLNDSLV------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKLRAGV 1426
                 N +N   +      V RK I  D               E + ++++  QK     
Sbjct: 230  EDPRLNTVNAISISSEINFVTRKNIDADSYTLHLSADETSDVTEKECSYYMWRQKFPVRQ 289

Query: 1427 -----IRPDVKTTEIALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
                  R +V+   I LA  L Q  N G   P      V+AFLP        FI+Q DFV
Sbjct: 290  ENQVERRLEVEEWVITLAFPLGQRLNRGMSSP-----GVYAFLPTEMVTNFPFIIQADFV 344

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
            L SSRE +  ++ WNQ +L   P+ F++A  S   +    + P    +    F+P+    
Sbjct: 345  LASSRETILLDNKWNQGILDCVPSAFLNALISL--VTTSEDVPVSTLTPMFKFLPIDSSS 402

Query: 1540 HGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
            +   + +   I +KL   N +  E  ++Q  +  PC+V R
Sbjct: 403  YPKLNVVRESIKAKLLKENIIPCESYSDQKIFRKPCEVGR 442


>gi|357513125|ref|XP_003626851.1| Sacsin [Medicago truncatula]
 gi|355520873|gb|AET01327.1| Sacsin [Medicago truncatula]
          Length = 1698

 Score =  166 bits (420), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 137/421 (32%), Positives = 202/421 (47%), Gaps = 78/421 (18%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR++F +G   +N    +++  H    +A+  LS ELYS + HF+ EL+QNA+DN+
Sbjct: 8    IEEIRREKFSIGEKRAN---PLIEDLH----QAVKNLSAELYSTNVHFVKELIQNAEDNL 60

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKK-GSSAGY 1271
            Y + V P+L FI+    I         ++ NNE+GFS ENI +LC VG STKK   S+GY
Sbjct: 61   YNDGVSPSLEFIITSKDITATGAPATLLIFNNEKGFSRENIESLCSVGRSTKKENRSSGY 120

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF VT  P I SNG+ ++F+        +G+++P  V   P  +D     
Sbjct: 121  IGEKGIGFKSVFLVTAQPYIFSNGYQIRFNERPCPHCSLGYIVPEWVEEKPTLVD----- 175

Query: 1327 LSKDPVQLESKCWNTCIRLPF---RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
            + K     +    NT I LP    R +F        +    S +HP +LLFL +++ +  
Sbjct: 176  IQKIYGAGKDSLPNTIIVLPLKPDRVRF--------VNQQLSSIHPEVLLFLSKIRRLSV 227

Query: 1384 RNMLND------SLVVIRKKI---------VGDGIIKVSCGED-----KMTWFVASQKLR 1423
            R    D      + V I  +I              I +S GE+     + ++++  QK  
Sbjct: 228  RENSEDPKENTLTAVSISSEINFETSRNMDAESYTIHLSAGENNNDEQECSYYIWKQKFP 287

Query: 1424 AGV-----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGD 1477
              +      R  V+   + LA   QE    +         VFAFLP        FI+Q D
Sbjct: 288  VKLENVVERRMGVEECVVTLAFPHQERLHKDKS----LPGVFAFLPTEMVTNFPFIIQAD 343

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA---------ERSFCDLP-CFRENPAKAAS 1527
            FVL SSRE +  +  WN+ +L   P+ F+ A         E   C LP  F+  P  ++ 
Sbjct: 344  FVLASSRETILLDDKWNKGILEYVPSAFIDAFKTLVTGSDEAPMCSLPYVFKFIPINSSP 403

Query: 1528 V 1528
            V
Sbjct: 404  V 404


>gi|77555433|gb|ABA98229.1| expressed protein [Oryza sativa Japonica Group]
          Length = 1831

 Score =  166 bits (420), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 144/439 (32%), Positives = 209/439 (47%), Gaps = 98/439 (22%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ------ 1215
            VE IRR+ F +G      E N L +    + +A+  LSQELYS+D HFL+EL+Q      
Sbjct: 16   VERIRRERFYIG----REERNPLAED---IHQAVTYLSQELYSKDLHFLMELIQLRLAAF 68

Query: 1216 ----------NADDNIYPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRAL 1256
                      NA+DN YP +VEP L F++           S ++V NNE+GFSA N+ ++
Sbjct: 69   FTCVISSSALNAEDNEYPSDVEPALEFVIIKKDITATGAASTLLVFNNERGFSAANVESI 128

Query: 1257 CDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLP 1312
            C +G STKKG+   GYIG KGIGFKSVF V+  P I SNG+ +KF+   + +  IG+++P
Sbjct: 129  CRIGKSTKKGNRHLGYIGEKGIGFKSVFLVSSQPHIFSNGYQIKFNEEPSEDCDIGYIVP 188

Query: 1313 TLV---PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
              V   P  +        SK       K   T I LP ++       +  + +  S  HP
Sbjct: 189  KWVDEKPSIDDIHAVYGYSK-------KLPTTTIILPLKSD-----KILPVKNELSSTHP 236

Query: 1370 SLLLFLHRLQCIMFRNMLND------SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR 1423
             +LLFL +++ +  R + +D      S + I  ++  D   +     +  T  +A Q+ +
Sbjct: 237  EILLFLSKIRQLSVREINDDPKASKLSQISISSEV--DYKTQKDINAESYTLHLAMQENK 294

Query: 1424 AG--------------VIRP---DVKTTEI-ALALTL-----QESNEGNYGPLLYQQPVF 1460
             G              V++P   D K  E+    +TL     Q  + G   P      V+
Sbjct: 295  RGDKEECTYYMWKQKFVVKPECRDKKRMEVDKWVITLAFPHGQRLSRGARSP-----GVY 349

Query: 1461 AFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----------E 1509
            AFLP      L FI+Q DF+L SSRE +  +S WN+ +L   P  FVSA           
Sbjct: 350  AFLPTEMVTNLPFIIQADFLLASSRESILLDSQWNRGILDCVPYAFVSAFEALLKSSSNA 409

Query: 1510 RSFCDLPCFRENPAKAASV 1528
             SF   P FR  P   +S+
Sbjct: 410  PSFALPPIFRFLPVATSSI 428


>gi|225456428|ref|XP_002280614.1| PREDICTED: uncharacterized protein LOC100257713 [Vitis vinifera]
          Length = 1725

 Score =  166 bits (419), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 141/462 (30%), Positives = 216/462 (46%), Gaps = 73/462 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G  ++ +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y E V P+L  ++    I         ++ NNE+GFSA+NI ++C VG STKK +   GY
Sbjct: 60   YGEGVNPSLELVITSQDITDTGAPATLLIFNNEKGFSAKNIESICSVGRSTKKSNRKQGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP--PFNIDMFCRL 1326
            IG KGIGFKSVF +T  P I SNG+ ++F+    S   +G+++P  V   P   D+    
Sbjct: 120  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCSHSNLGYIVPEWVEQNPSLADI---- 175

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR-- 1384
              K      +    T I LP +        +  +    S + P +LLFL +++    +  
Sbjct: 176  --KQIYGSHAVLPTTTIILPLKPD-----KIKPVKQQLSSIQPEVLLFLSKIKQFSVKED 228

Query: 1385 ------NMLNDSLV------VIRKKIVGDGII-------KVSCGEDKMTWFVASQKLRAG 1425
                  N +N   +      V RK I  D               E + ++++  QK    
Sbjct: 229  NKDPRLNTVNAISISSEINFVTRKNIDADSYTLHLSTDEASDVTEKECSYYMWRQKFPVR 288

Query: 1426 V-----IRPDVKTTEIALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDF 1478
                   R  V+   I LA  + Q  N G   P +Y     AFLP        FI+Q DF
Sbjct: 289  QENQVERRLGVEEWVIKLAFPIGQRLNRGMSSPGIY-----AFLPTEMVTNFPFIIQADF 343

Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVG 1537
            VL SSRE +  ++ WNQ +L   P+ F++A   F  L    E+ P    +    F+P+  
Sbjct: 344  VLASSRETILLDNKWNQGILDCVPSAFLNA---FISLVTTSEDVPVSTLTPMFKFLPINS 400

Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLR 1577
              +   + +   I +KL   N +  E  ++Q  +  PC+V R
Sbjct: 401  SSYPKLNVVRESIKAKLLTENIIPCESYSDQKIFRKPCEVGR 442


>gi|297734473|emb|CBI15720.3| unnamed protein product [Vitis vinifera]
          Length = 2156

 Score =  165 bits (418), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 133/438 (30%), Positives = 208/438 (47%), Gaps = 80/438 (18%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G  ++ +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 686  IEEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDNE 737

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y E V P+L  ++    I         ++ NNE+GFSA+NI ++C VG STKK +   GY
Sbjct: 738  YGEGVNPSLELVITSQDITDTGAPATLLIFNNEKGFSAKNIESICSVGRSTKKSNRKQGY 797

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP--PFNIDMFCRL 1326
            IG KGIGFKSVF +T  P I SNG+ ++F+    S   +G+++P  V   P   D+    
Sbjct: 798  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCSHSNLGYIVPEWVEQNPSLADI---- 853

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ-CIMFRN 1385
              K      +    T I LP +        +  +    S + P +LLFL +++   M+R 
Sbjct: 854  --KQIYGSHAVLPTTTIILPLKPD-----KIKPVKQQLSSIQPEVLLFLSKIKHYYMWRQ 906

Query: 1386 M--LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
               +     V R+  V + +IK++        F   Q+L  G+  P              
Sbjct: 907  KFPVRQENQVERRLGVEEWVIKLA--------FPIGQRLNRGMSSPG------------- 945

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
                           ++AFLP        FI+Q DFVL SSRE +  ++ WNQ +L   P
Sbjct: 946  ---------------IYAFLPTEMVTNFPFIIQADFVLASSRETILLDNKWNQGILDCVP 990

Query: 1503 ALFVSAERSFCDLPCFREN-PAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLI 1561
            + F++A   F  L    E+ P    +    F+P+    +   + +   I +KL   N + 
Sbjct: 991  SAFLNA---FISLVTTSEDVPVSTLTPMFKFLPINSSSYPKLNVVRESIKAKLLTENIIP 1047

Query: 1562 LEGNNNQ--WAPPCKVLR 1577
             E  ++Q  +  PC+V R
Sbjct: 1048 CESYSDQKIFRKPCEVGR 1065



 Score =  165 bits (417), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 124/421 (29%), Positives = 203/421 (48%), Gaps = 51/421 (12%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            ++ IRR +F +G  ++ +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IKEIRRTKFSIGGELNPLTED--------LHQAVKNLSAELYAKDVHFLMELIQNAEDND 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            YPE V P+L  ++    I         ++ NNE+GFSA+NI ++C VG STKK +   GY
Sbjct: 60   YPEGVNPSLELVITSQDITGTGASATLLIFNNEKGFSAKNIESICSVGRSTKKNNRKCGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVP--PFNIDMFCRL 1326
            IG KGIGFKSVF +T  P I SNG+ ++F+        +G+++P  V   P  +D   ++
Sbjct: 120  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNEEPCPHSNLGYIVPEWVQWNP-TLDDIKQI 178

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
                 V        T I LP +        +  + +  S + P +LLFL +++    +  
Sbjct: 179  YGSHAV-----LPTTTIILPLKPD-----KIGPVKEQLSSIQPEVLLFLSKIKQFSVKKH 228

Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKL---RAGVIRPDVKTTEIALALTL- 1442
              D  +     I    I      E + ++++  QK    +   +   +   E+ + L   
Sbjct: 229  NEDPRLNTVNAI---SISTKDVTEKECSYYMWRQKFPVRQENQVERRLGVEELVITLAFP 285

Query: 1443 --QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
              Q  N G   P      V+AFLP        FI+Q DFVL SSRE +  ++ WNQ +L 
Sbjct: 286  FGQRLNRGISSP-----GVYAFLPTEMVTNFPFIIQADFVLASSRETILLDNKWNQGILD 340

Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNC 1559
              P+ F++A  S       ++ P    +    F+P+    +   + +  +I ++L   N 
Sbjct: 341  CVPSAFLNAFISLVTTS--QDVPVSTLTPMFKFLPIYSSPYPKLNDVRELIKAELLKKNI 398

Query: 1560 L 1560
            +
Sbjct: 399  V 399


>gi|449440953|ref|XP_004138248.1| PREDICTED: uncharacterized protein LOC101220895 [Cucumis sativus]
          Length = 1742

 Score =  165 bits (418), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 127/391 (32%), Positives = 196/391 (50%), Gaps = 71/391 (18%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G   + +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 47   IEDIRRSKFSIGGPPNPLTED--------LHQAVRNLSAELYTKDVHFLMELIQNAEDNE 98

Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y  +V+P+L FI+           + +++ NNE GFS +NI ++C VG STKK +   GY
Sbjct: 99   YSTSVKPSLEFIITSRDVTGTGAATTLLIFNNEIGFSFKNIDSICSVGRSTKKNNRERGY 158

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDMFCRLLS 1328
            IG KGIGFKSVF +T  P I SNG+ ++F+        +GFV+P  V         R+LS
Sbjct: 159  IGEKGIGFKSVFLITSQPYIFSNGYQIRFNEQPCPHCGVGFVVPEWVEE------NRILS 212

Query: 1329 --KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
              K+    +S    T I LP +        +  +    S++HP +LLFL +++ +  R +
Sbjct: 213  NIKEIYGPQSILPTTTIVLPLKAD-----KIKPVKQQLSNIHPEVLLFLSKIKQLSVREV 267

Query: 1387 LND--------------SLVVIRKKIVGDGII-----KVSCGEDKMTWFVASQKLRAGVI 1427
              D              +  V RK I  +        + S G  + ++++  QK     +
Sbjct: 268  NEDPKSSTVNAISISSETNFVSRKNIDAESYTLHLSSEESVGGTQCSYYMWKQKFP---V 324

Query: 1428 RPDVKTT------EIALALTL---QESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
            +P+ +        E+ + L     Q  N G+  P      V+AFLP        FI+Q D
Sbjct: 325  KPENRVERRMGVGELVIILAFPNGQRLNGGDKSP-----GVYAFLPTEMITDFPFIIQSD 379

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            FVL SSRE +  ++ WNQ +L   P+ FV+A
Sbjct: 380  FVLSSSRETILLDNEWNQGILDCVPSAFVNA 410


>gi|449501472|ref|XP_004161377.1| PREDICTED: uncharacterized LOC101220895 [Cucumis sativus]
          Length = 1703

 Score =  165 bits (418), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 127/391 (32%), Positives = 196/391 (50%), Gaps = 71/391 (18%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G   + +  +        L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEDIRRSKFSIGGPPNPLTED--------LHQAVRNLSAELYTKDVHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y  +V+P+L FI+           + +++ NNE GFS +NI ++C VG STKK +   GY
Sbjct: 60   YSTSVKPSLEFIITSRDVTGTGAATTLLIFNNEIGFSFKNIDSICSVGRSTKKNNRERGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDMFCRLLS 1328
            IG KGIGFKSVF +T  P I SNG+ ++F+        +GFV+P  V         R+LS
Sbjct: 120  IGEKGIGFKSVFLITSQPYIFSNGYQIRFNEQPCPHCGVGFVVPEWVEE------NRILS 173

Query: 1329 --KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
              K+    +S    T I LP +        +  +    S++HP +LLFL +++ +  R +
Sbjct: 174  NIKEIYGPQSILPTTTIVLPLKAD-----KIKPVKQQLSNIHPEVLLFLSKIKQLSVREV 228

Query: 1387 LND--------------SLVVIRKKIVGDGII-----KVSCGEDKMTWFVASQKLRAGVI 1427
              D              +  V RK I  +        + S G  + ++++  QK     +
Sbjct: 229  NEDPKSSTVNAISISSETNFVSRKNIDAESYTLHLSSEESVGGTQCSYYMWKQKFP---V 285

Query: 1428 RPDVKTT------EIALALTL---QESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
            +P+ +        E+ + L     Q  N G+  P      V+AFLP        FI+Q D
Sbjct: 286  KPENRVERRMGVGELVIILAFPNGQRLNGGDKSP-----GVYAFLPTEMITDFPFIIQSD 340

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            FVL SSRE +  ++ WNQ +L   P+ FV+A
Sbjct: 341  FVLSSSRETILLDNEWNQGILDCVPSAFVNA 371


>gi|224136274|ref|XP_002322288.1| predicted protein [Populus trichocarpa]
 gi|222869284|gb|EEF06415.1| predicted protein [Populus trichocarpa]
          Length = 1356

 Score =  164 bits (416), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 126/389 (32%), Positives = 192/389 (49%), Gaps = 64/389 (16%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IR+  F +G      E N L      L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEHIRKTTFSIGG-----EKNPLAPM---LDQAVKYLSAELYAKDIHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
            Y E V+P+L F++    I         ++ NNE+GFSA+NI ++C+VGNSTKKG+   GY
Sbjct: 60   YLERVDPSLEFVITSRDITNTGAPATLLIFNNEKGFSAKNIESICNVGNSTKKGNRKRGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF +   P I SNG+ ++F+        +G+++P  V   P   D+    
Sbjct: 120  IGEKGIGFKSVFLIAAQPYIFSNGYQIRFNEKPCPHCNLGYIVPEWVDDSPSLSDI---- 175

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
              K      S    T + LP +        +N +    S +HP +LLFL +++ +  R  
Sbjct: 176  --KQIYGSASTLPTTTLILPLKPD-----KVNPVKQQLSSIHPEILLFLSKIKRLSVREE 228

Query: 1387 LND------SLVVI--------RKKIVGDGII-------KVSCGEDKMTWFVASQKLRAG 1425
              D      S V I        RK +  +           +   E + ++++  QK    
Sbjct: 229  NEDPRLNTVSAVAITKETNFVQRKNMDAESYTLHLSAEENIDEFEQECSYYLWKQKFPVR 288

Query: 1426 V-----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
                  +R +V+   I LA     + E  +  + +   ++AFLP     G  FI+Q DF+
Sbjct: 289  EDNKVDMRMEVEDWVITLAFP---NGERLHRGMKHSPGIYAFLPTEMVTGFPFIIQADFI 345

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            L SSRE +  ++ WNQ +L   P  F+ A
Sbjct: 346  LASSRETIRWDNIWNQGILDCVPFAFIEA 374


>gi|357513119|ref|XP_003626848.1| hypothetical protein MTR_8g011200 [Medicago truncatula]
 gi|355520870|gb|AET01324.1| hypothetical protein MTR_8g011200 [Medicago truncatula]
          Length = 1645

 Score =  164 bits (415), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 132/420 (31%), Positives = 206/420 (49%), Gaps = 59/420 (14%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI---------VV 1241
            L  A+  L  ELY++D HFL+EL+QNA+DN Y E V PTL F++    I         ++
Sbjct: 73   LHHAVKNLFAELYAKDVHFLMELIQNAEDNHYNEGVNPTLEFVITSDDITGTGAPATLLI 132

Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
             NNE+GF ++NI ++C VG STKKG+ S+GYIG KGIGFKSVF VT  P + SNG+ ++F
Sbjct: 133  FNNEKGFFSKNIDSICSVGRSTKKGNRSSGYIGEKGIGFKSVFLVTAQPYVFSNGYQIRF 192

Query: 1301 DTS---EGQIGFVLPTLV--PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI 1355
            +        +G+++P  V   P  +D+  ++       L +    T I LP ++   E +
Sbjct: 193  NEKPCPHCSLGYIVPEWVEEKPTLVDI-KKIYGAGKGSLRT----TTIVLPLKSDKVEPV 247

Query: 1356 AMNNIVDMFSDLHPSLLLFLHRLQCIMFR--------NMLNDSLV------VIRKKIVGD 1401
                     S  HP +LLFL +++ +  R        N + D  +      V RK +  +
Sbjct: 248  KQK-----LSSFHPEVLLFLTKIRHLSVREVSENPKQNTVTDVSISSEINFVTRKNMNAE 302

Query: 1402 G-IIKVSC-----GEDKMTWFVASQKL-----RAGVIRPDVKTTEIALALTLQES-NEGN 1449
               I +S       E + ++++  QK       A   R DV+   + LA   QE  + G 
Sbjct: 303  SYTIHLSAEENSDAEKECSYYMWKQKFPVRPENAVERRTDVEEWVVTLAFPNQERLHRGK 362

Query: 1450 YGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
              P      V+AFLP        FI+Q DFVL SSRE +  ++ WNQ +L   P+ F+ A
Sbjct: 363  SSP-----GVYAFLPTGMVTNFPFIIQADFVLASSRETILLDNKWNQGILDCVPSAFLDA 417

Query: 1509 ERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ 1568
             ++        + P  + +    F+P+       F+ +   I +KL   + + +E  + Q
Sbjct: 418  FKTLVIGSV--QAPVSSLASMFKFLPIESSTFEKFNYVRDKIKAKLVEESIVPIETYSKQ 475


>gi|320033070|gb|EFW15019.1| hypothetical protein CPSG_08207 [Coccidioides posadasii str.
            Silveira]
          Length = 2121

 Score =  164 bits (415), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 174/594 (29%), Positives = 272/594 (45%), Gaps = 91/594 (15%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
            EDA   +E IR ++ GLG  +SN  S  L+        AL  L+++LY  ++ FL+EL+Q
Sbjct: 19   EDAQQHIERIRHEK-GLG--LSNHLSRDLEN-------ALVILAKQLYKTNTRFLMELIQ 68

Query: 1216 NADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY---I 1272
            NADDN Y  +V P+LT   ++  + V  NE GF+ E++ ++C +G S +KG SA     I
Sbjct: 69   NADDNRYRRDVSPSLTLSYRKMHLRVDCNELGFAPEDVDSICRIGGS-QKGISADRMDCI 127

Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
            G KGIGFKSVF+V     I S  +  KFD +  ++G + P             + ++ P 
Sbjct: 128  GEKGIGFKSVFKVASVVWISSGKYSFKFDNAT-RLGMIAP-------------MWAQFPA 173

Query: 1333 QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ----CIMFRNMLN 1388
            +  +   +  +RL  ++   E +    + D      P +L+FL++++    CI   N   
Sbjct: 174  EAPASGTSMLLRLLNKSVRDELLKEMRVCD------PKILIFLNQIRELKVCIAQDNGTT 227

Query: 1389 DSLVVIRK--KIV---GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
                + RK  K V   G  +  +  G D    FV  + + +G         EI LA  ++
Sbjct: 228  FERTISRKDDKDVDKPGQYVTTLRYG-DSSQQFVVIRYVPSGR---STDKAEILLAFPIK 283

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
             + E    P++  Q V+ FLP+R YGLKF++QG F L +SRE+++G+S  NQ L S  P 
Sbjct: 284  NNQE----PVIQSQCVYTFLPIRDYGLKFLVQGRFSLIASREDINGSSAENQRLRSLIPK 339

Query: 1504 LFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE 1563
             FV A   F +   FR        ++  ++P++ E   FF+ L    +    +S   +LE
Sbjct: 340  AFVQAVDYFQESDYFR-------YIWPRYLPIIPE-ENFFTSLTEETI--FTLSQRKLLE 389

Query: 1564 GNNNQWAPPCKVLR---GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS-LARALGIEE 1619
              + + A P  +      + D     LP   L       +LS+     DS L + LG+  
Sbjct: 390  SESGELATPGTLFYVPPRFRDPNGDPLP---LCSRTAFRYLSRKYFDEDSDLFQCLGVRP 446

Query: 1620 YGPKILLQIISSLCRTENGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGVETDLI 1676
               + LL  I S+             W   L+  LN           +  L++   T   
Sbjct: 447  LSLRDLLVDIKSVVNQYQNHSREATEWHIRLSKALNR----------EDMLRAYRHT--- 493

Query: 1677 DNLQRIPFIPLSDGTFSSVDE--GTIWLHS---DCSVFDGGFGLEAFPNLCAKL 1725
              L  IP IPL DG + S  E  G I+  S   D SV  G   +E  P+  + +
Sbjct: 494  --LHSIPLIPLRDGRWISASESAGIIFFPSALADLSVPSGLNSVEIHPDAASNV 545


>gi|209875783|ref|XP_002139334.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209554940|gb|EEA04985.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 2286

 Score =  164 bits (414), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 147/534 (27%), Positives = 237/534 (44%), Gaps = 78/534 (14%)

Query: 1161 IVESIRRDEFGLGPNISN--MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNAD 1218
            IV  I+ + FG+G +        N+L +    + R    LS+ LY   +HF LEL+QNAD
Sbjct: 686  IVSLIQLNRFGIGIDSRKDPQLENILLQYSKLIKRTGEKLSENLYCDRNHFQLELIQNAD 745

Query: 1219 DNIYP-----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-KGSSAGYI 1272
            DN Y      +   P+L FI    GI+VLNNE GF+  +I +L D+ NS+K        I
Sbjct: 746  DNFYEKLKYHQTSVPSLKFITGSGGILVLNNEDGFTPLDIMSLSDLANSSKIDCGKIRRI 805

Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR-LLSKDP 1331
            G+ G+GFKSVF +TD P I SNGF ++FD       ++ P  V     ++  + LL    
Sbjct: 806  GKFGLGFKSVFTITDTPYIFSNGFKIRFDAQSPYGAYIYPEWVSCEITELIPKDLLDISG 865

Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
             ++    WNT I LPF+ +    I           L P  LLF ++L  +   +   ++ 
Sbjct: 866  YEVNKSAWNTVIWLPFKARKLTRIH----------LKPEFLLFTNKL--VYLEHTQKNAT 913

Query: 1392 VVIRK--KIVGDG------IIKVSCGEDKMTWFVASQKLRAG---------VIRPDV--- 1431
            + I+K  +I+ D        I+           +  +KL  G         ++  D+   
Sbjct: 914  ISIKKCTEILDDNSYFTTIFIQEYIHSKDAERTIKKRKLLPGGRYWKRQFLLVDYDISSI 973

Query: 1432 --------------KTTEIALALTLQESNEGNYGPLLYQQ--PVFAFLPLRTYGLKFILQ 1475
                          +  +I + + L+     N G  L +Q   VFAFLP+R+YG  FI+ 
Sbjct: 974  CKQNHWLDREIYKNRQKKIVIGVELKADESTNTGVCLGKQYCDVFAFLPIRSYGFHFIIH 1033

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
             DF L SSRE +  N+ WN  L    P   + A +S+  L    +N +     ++S +PL
Sbjct: 1034 ADFELTSSREGIATNNEWNCLLRDCIPNALIHAIQSWRRL----DNLSMLHKSFLSVIPL 1089

Query: 1536 VGE-VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWN-------DRAHSLL 1587
              + V  FF+ +       L+   C+  +    ++  P  VL  +N       D    + 
Sbjct: 1090 SSDYVDNFFAPIITKFHHMLKSVPCIYTQSGIFEY--PRNVLCSFNTYSGLSVDTISKIF 1147

Query: 1588 PDI-----LLQKHLGLGFLSKDI--VLSDSLARALGIEEYGPKILLQIISSLCR 1634
            P++     +L KH     +  D+   +  ++   LGI+     I++ +I  + R
Sbjct: 1148 PNLDELCFVLDKHCKKKIICSDVEKYVEHTIFDDLGIDSVNLVIIITLIDGMIR 1201


>gi|218185812|gb|EEC68239.1| hypothetical protein OsI_36251 [Oryza sativa Indica Group]
          Length = 2351

 Score =  164 bits (414), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 124/387 (32%), Positives = 195/387 (50%), Gaps = 60/387 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ + +G      E N L +    + +A+  LSQE+YS+D HFL+ELVQNA+DN 
Sbjct: 17   VERIRRERYYIG----REERNPLAED---MHQAVIYLSQEIYSKDVHFLMELVQNAEDNE 69

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            YP  V P+L F++  + I         ++ NNE+GFS  NI ++  VG STKKG+   GY
Sbjct: 70   YPSGVAPSLEFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTKKGNRDKGY 129

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLS 1328
            IG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V         R + 
Sbjct: 130  IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECGIGYIVPEWVESRPSLSDIRTIY 189

Query: 1329 KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-L 1387
                 L +    T I LP +++      ++ +    S +HP +LLFL +++ +  +   L
Sbjct: 190  GSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQLSVKEENL 240

Query: 1388 NDSLVVI-------------RKKIVGDGI------IKVSCGEDKMTWFVASQKLRAGVIR 1428
            N     +             RK +  +        ++   GE++  +++  QK     ++
Sbjct: 241  NPKCSTVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQKFP---VK 297

Query: 1429 PD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLP 1481
            P+       +  E  + L        + G  L    V+AFLP        FI+Q DF+L 
Sbjct: 298  PENRVDKRAEIDEWVIILAFPYGQRLSRGKQL-SPGVYAFLPTEMVTNFPFIIQADFLLA 356

Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSA 1508
            SSRE +  +SPWN+ +L   P+ F++A
Sbjct: 357  SSREAILFDSPWNKGILECVPSAFLNA 383



 Score =  160 bits (404), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/387 (31%), Positives = 195/387 (50%), Gaps = 60/387 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ + +G      E N L +    + +A+  LSQE+YS+D HFL+ELVQNA+DN 
Sbjct: 761  VERIRRERYYIG----REERNPLAED---MHQAVTYLSQEIYSKDVHFLMELVQNAEDNE 813

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            YP  V P+L F++  + I         ++ NNE+GFS  NI ++  VG STKKG+   GY
Sbjct: 814  YPSGVAPSLEFLVSSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTKKGNRDKGY 873

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLS 1328
            IG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V         R + 
Sbjct: 874  IGEKGIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECGIGYIVPEWVESRPSLSDIRTIY 933

Query: 1329 KDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-L 1387
                 L +    T I +P +++      ++++    S +HP +LLFL +++ +  +   +
Sbjct: 934  GSSKVLPT----TTIIIPLKSE-----KVDSVKKQLSSIHPEMLLFLSKIRQLSVKEENV 984

Query: 1388 NDSLVVI-------------RKKIVGDGI------IKVSCGEDKMTWFVASQKLRAGVIR 1428
            N     +             RK +  +        ++   GE++  +++  QK     ++
Sbjct: 985  NPKCSTVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQKFP---VK 1041

Query: 1429 PD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLP 1481
            P+       +  E  + L        + G  +    V+AFLP        FI+Q DF+L 
Sbjct: 1042 PENRVDKRAEIDEWVITLAFPHGQRLSRGKQM-SPGVYAFLPTEMVTNFPFIIQADFLLA 1100

Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSA 1508
            SSRE +  +S WN+ +L   P+ F++A
Sbjct: 1101 SSREAILFDSTWNKGILECVPSAFLNA 1127



 Score = 77.4 bits (189), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 69/260 (26%), Positives = 121/260 (46%), Gaps = 42/260 (16%)

Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
            GIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V         R +     
Sbjct: 479  GIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECGIGYIVPEWVESRPSLSDIRTIYGSSK 538

Query: 1333 QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV 1392
             L +    T I LP +++      ++ +    S +HP +LLFL +++ +           
Sbjct: 539  VLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQLS---------- 579

Query: 1393 VIRKKIVGDGIIKVSC----GEDKMTWFVASQKLRAGVIRPD------VKTTEIALALTL 1442
             ++++ V      V+     GE++  +++  QK     ++P+       +  E  + L  
Sbjct: 580  -VKEENVNHKCSPVTLENGKGEEECGYYMWRQKFP---VKPENRVDKRAEIDEWVITLAF 635

Query: 1443 QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEF 1501
                  + G  +    V+AFLP        FI+Q DF+L SSRE +  +SPWN+ +L   
Sbjct: 636  PYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFIIQADFLLASSREAILFDSPWNKGILECV 694

Query: 1502 PALFVSA----ERSFCDLPC 1517
            P+ F++A     +S  D+P 
Sbjct: 695  PSAFLNAFVALVKSGADVPA 714


>gi|392866750|gb|EAS30073.2| hypothetical protein CIMG_08498 [Coccidioides immitis RS]
          Length = 2143

 Score =  163 bits (413), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 173/594 (29%), Positives = 273/594 (45%), Gaps = 91/594 (15%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
            EDA   +E IR ++ GLG  +SN  S  L+        AL  L+++LY  ++ FL+EL+Q
Sbjct: 19   EDAQQHIERIRHEK-GLG--LSNHLSRDLEN-------ALVILAKQLYKTNTRFLMELIQ 68

Query: 1216 NADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY---I 1272
            NADDN Y  +V P+LT   ++  + V  NE GF+ E++ ++C +G S +KG SA     I
Sbjct: 69   NADDNRYRRDVSPSLTLSYRKMHLRVDCNELGFAPEDVDSICRIGGS-QKGISADRMDCI 127

Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
            G KGIGFKSVF+V     I S  +  KFD +  ++G + P             + ++ P 
Sbjct: 128  GEKGIGFKSVFKVASVVWISSGKYSFKFDNAT-RLGMIAP-------------MWAQFPA 173

Query: 1333 QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ----CIMFRNMLN 1388
            +  +   +  +RL  ++   E +    + D      P +L+FL++++    CI   N   
Sbjct: 174  EAPASGTSMLLRLLNKSVRDELLKEMRVCD------PKILIFLNQIRELKVCIAQDNGTT 227

Query: 1389 DSLVVIRK--KIV---GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
                + RK  K V   G  +  +  G D    FV  + + +G         EI LA  ++
Sbjct: 228  FERTISRKDDKDVDKPGQYVTTLRYG-DSSQQFVVIRYVPSGR---STDKAEILLAFPIK 283

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
             + E    P++  Q V+ FLP+R YGLKF++QG F L +SRE+++G+S  NQ L S  P 
Sbjct: 284  NNQE----PVIQSQCVYTFLPIRDYGLKFLVQGRFSLIASREDINGSSAENQRLRSLIPK 339

Query: 1504 LFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE 1563
             FV A   F +   FR        ++  ++P++ E   FF+ L    +    +S   +LE
Sbjct: 340  AFVQAVDYFQESDYFR-------YIWPRYLPIIPE-ENFFTSLTEETI--FTLSQRKLLE 389

Query: 1564 GNNNQWAPPCKVLR---GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS-LARALGIEE 1619
              + + A P  +      + D     LP   L       +LS+     DS L + LG+  
Sbjct: 390  SESGELATPGTLFYVPPRFRDPNGDPLP---LCSRTAFRYLSRKYFDEDSDLFQCLGVRH 446

Query: 1620 YGPKILLQIISSLCRTENGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGVETDLI 1676
               + LL  + S+             W   L+  LN           +  L++   T   
Sbjct: 447  LSLRDLLVDVKSVVNQYQNHSREATEWHIRLSKALNR----------EDMLRAYRHT--- 493

Query: 1677 DNLQRIPFIPLSDGTFSSVDE--GTIWLHS---DCSVFDGGFGLEAFPNLCAKL 1725
              L+ IP IPL DG + S  E  G I+  S   D SV  G   +E  P+  + +
Sbjct: 494  --LRSIPLIPLRDGRWISASESAGIIFFPSALADLSVPSGLDSVEIHPDAASNV 545


>gi|436670130|ref|YP_007317869.1| hypothetical protein Cylst_6579 [Cylindrospermum stagnale PCC 7417]
 gi|428262402|gb|AFZ28351.1| hypothetical protein Cylst_6579 [Cylindrospermum stagnale PCC 7417]
          Length = 1205

 Score =  163 bits (413), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 160/661 (24%), Positives = 296/661 (44%), Gaps = 118/661 (17%)

Query: 1428 RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEV 1487
            R +VK TE+ LA +LQE  +G+    L +Q VFAFLP R+YG KF++Q DF++P++RE++
Sbjct: 79   RENVKYTELILAFSLQE--DGSADTRL-EQKVFAFLPTRSYGFKFLIQADFLVPANREDI 135

Query: 1488 DGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGL 1546
              ++ WN+W+       F+ A   F       +  +     Y +++PL  E+   FF  +
Sbjct: 136  HKDTQWNKWIRDNIATTFLLAVEKF-------KQDSNLQKTYYNYIPLNSEIKDEFFIPV 188

Query: 1547 PRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIV 1606
               I +KL+ S C++ E    +W  P  V++  +++   L+ +  LQK L   ++  ++ 
Sbjct: 189  VSEIHNKLKTSECILTES--GKWEIPSHVVQV-DEQIRKLISNADLQKLLNKEYIHPEVK 245

Query: 1607 LSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLA----SWLNELYTISFHSS 1662
               S+  +LG++++    LLQ     C   N        WL     +W  ELY       
Sbjct: 246  AKTSILESLGVKKFSFDDLLQ-----CLQNN-------EWLQKQSDNWFVELYIY----- 288

Query: 1663 GQSSLQSGVETDLID--NLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPN 1720
                L+S    D  D   ++++  IPL +   +S+ E +I+   +     G +  E    
Sbjct: 289  ----LKSCNLNDYSDLPKIKKLKIIPLENNQLASIAETSIFFPFNNV---GEYSFE---- 337

Query: 1721 LCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDE 1780
                L+ +   LL ++    S   V         L K+GVQ  S ++I++ HILP    +
Sbjct: 338  ----LQFIKKTLLESN----SKFAV------TEFLKKLGVQNASCYEIIENHILPLYKGD 383

Query: 1781 TTANGDKNLMADYLCFVMMHLEYY--------CPNCH-VEREFIVSELRKKAFVLTN-HG 1830
             +     N + +Y+C++  +L  Y         P  +   +E  +  L++   + T+ +G
Sbjct: 384  -SWKSKANQLLEYICYIKDNLSEYEKEFNKIKNPYSYSYSKEDPLKSLKELLLIKTDKNG 442

Query: 1831 FKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKW-----YEVDITYLKHPAKESLSCGLVKW 1885
            + RP    ++    +GNP  +  L+  I   W     Y  D+  +K+P+++  +  + +W
Sbjct: 443  YSRPGN--VYLPASYGNPNELETLLSGIQDVWFVSPEYIQDLMQIKNPSEK--AEKIREW 498

Query: 1886 RKFFKEIGITDFVQVVQVDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELV----HL 1941
            ++FF ++G+    ++     DV   +  G  N             ID+     +    H+
Sbjct: 499  KEFFIKLGVHTVPKI-----DVKIQTARGRSNQ-----------KIDYYKEYPIYSSPHI 542

Query: 1942 LSLLTTNVNRQCSKHLLEILDTLWD--------DCYTDKIMGFFKSKPTGDDRSFQSSFI 1993
            + +L T  N + ++ L + LD+ WD          Y     GF+   P   D      + 
Sbjct: 543  IRILETK-NVEKNQKLAKFLDSNWDYYKQYKSWQNYAYANGGFYSYNPNDAD------WF 595

Query: 1994 NCICDIQWTISSMDDELHYPKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCDIGLKTEV 2053
              I    W + +   +L  P ++F D     +ILG S PY+   + +   + D+G+K ++
Sbjct: 596  TKIKTAAW-LPTTKGKLANPSEVFLDKTETGTILGDSVPYLAISLNNPDFIKDLGIKEKI 654

Query: 2054 T 2054
            T
Sbjct: 655  T 655


>gi|224136270|ref|XP_002322287.1| predicted protein [Populus trichocarpa]
 gi|222869283|gb|EEF06414.1| predicted protein [Populus trichocarpa]
          Length = 1467

 Score =  163 bits (412), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 128/389 (32%), Positives = 195/389 (50%), Gaps = 64/389 (16%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IR+  F +G      E N L      L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEHIRKTTFSIGG-----EKNPLAPM---LDQAVKYLSAELYAKDVHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
            Y E V+P+L F++    I         ++ NNE+GFSA+NI ++C+VGNSTKKG+   GY
Sbjct: 60   YLERVDPSLEFVITSRDITNTGAPATLLMFNNEKGFSAKNIESICNVGNSTKKGNRKRGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF +   P I SNG+ ++F+        +G+++P  V   P   D+    
Sbjct: 120  IGEKGIGFKSVFLIAAQPYIFSNGYQIRFNEKPCPHCNLGYIVPEWVDDSPSLSDI---- 175

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
              K      S    T + LP +        +N +    S +HP +LLFL +++ +  R  
Sbjct: 176  --KQIYGSASTLPTTTLILPLKPD-----KVNPVKQQLSSIHPEILLFLSKIKRLSVREE 228

Query: 1387 LND------SLVVI--------RKKIVGDG-IIKVSCGED------KMTWFVASQKLRAG 1425
              D      S V I        RK +  +   + +S  E+      + ++++  QK    
Sbjct: 229  NADPRLNTVSAVAITKETNFMQRKNMDAESYTLHLSADENSDEFEKECSYYLWKQKFP-- 286

Query: 1426 VIRP----DVKTTEIALALTLQESN-EGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
             +RP    D++       +TL   N E  +  + Y   ++AFLP        FI+Q DF+
Sbjct: 287  -VRPENRVDMRMGVDDWVITLAFPNGERLHRGMKYSPGIYAFLPTEMVTDFPFIIQADFI 345

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            L SSRE +  ++ WNQ +L   P  F+ A
Sbjct: 346  LASSRETIRWDNIWNQGILDCVPFAFIEA 374


>gi|449295343|gb|EMC91365.1| hypothetical protein BAUCODRAFT_152628 [Baudoinia compniacensis UAMH
            10762]
          Length = 1509

 Score =  161 bits (407), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 155/530 (29%), Positives = 236/530 (44%), Gaps = 73/530 (13%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-NEQGFS 1249
            L  AL  LS+ELYS+ +HF+LEL+QNADDN YPE+V P LT + +E G + +  NE GF+
Sbjct: 33   LTNALRILSEELYSKPTHFILELIQNADDNNYPEDVRPKLTLLYREDGYLWIGCNEIGFT 92

Query: 1250 AENIRALCDVGNSTKK--GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            A N+RALC +G+STKK   S  GYIG KGIGFKS F+V     + S     KFD  +  +
Sbjct: 93   AANVRALCGIGDSTKKVENSQKGYIGEKGIGFKSAFKVAAEIWVKSGAISFKFDKHK-PL 151

Query: 1308 GFVLPTLVPPF------NIDMFCRLL---SKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
            G + P     F         MFC  +    +  VQ         + L  R      +   
Sbjct: 152  GMIAPMWTDFFENAHIKERTMFCFHILPEHQGTVQENLLELKPELLLFLRKLH---VLQV 208

Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMFRNM-LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFV 1417
             I    +D+  S  L     +    R++ L   +   R +   +  +      D+M   +
Sbjct: 209  KIQKTSADVKHSFSLSKVEGETAGIRSITLRHQMFRPRNQTYTEAFLVF----DQMIENM 264

Query: 1418 ASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGD 1477
             ++K R     P+V  ++I LA  + + N+    P +  +  F FLP+R+YGL F+LQ D
Sbjct: 265  PTEKKR-----PNVMKSQIVLAFPVNDQNQ----PAMRSRLTFNFLPVRSYGLPFVLQAD 315

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
            F+L +SREE+   + WN  L++    +FV+    F      R    +        VP   
Sbjct: 316  FMLSASREEILLGNRWNAVLVAATIDVFVACVERFNSTNTLRYTWPRFTKPLG--VPYGS 373

Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPC-------KVLRGWNDRAHSLLPDI 1590
              + +F  L R      R+    +LE  +  +A P        +   G N  +  + PD 
Sbjct: 374  IFNSYFKELWR------RLMQEPVLEAQDGTFAKPALLQSVPLEFTDGGNPPSPLIYPD- 426

Query: 1591 LLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGL-RSMGLSWLAS 1649
                H  + +LS    +SD     L + +  P    ++I+    TE  L R+    W   
Sbjct: 427  ----HGPVAYLSTAYDVSD--LTELSVVQQTPDTFSKLITEYASTEPELFRAQPTPW--- 477

Query: 1650 WLNELYTISFHSS-GQSSLQSGVETDLIDNLQRIPFIPLSDGTF-SSVDE 1697
                      HS    + L+SG +       + +  IP+ DG + SS DE
Sbjct: 478  ----------HSRLADAILRSGTQ-----RFRSVAIIPMRDGEWLSSKDE 512


>gi|224122118|ref|XP_002318757.1| predicted protein [Populus trichocarpa]
 gi|222859430|gb|EEE96977.1| predicted protein [Populus trichocarpa]
          Length = 1713

 Score =  160 bits (404), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 130/416 (31%), Positives = 200/416 (48%), Gaps = 66/416 (15%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IR   F +G      E N L      L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEHIRETTFSIG-----REKNPLAPM---LDQAVKYLSAELYAKDVHFLMELIQNAEDNE 59

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGSSA-GY 1271
            Y E V+P+L F++    I         ++ NNE+GFSA+NI ++C VGNSTKKG+   GY
Sbjct: 60   YLEGVDPSLEFVITSRDITNTGAPATLLMFNNEKGFSAKNIDSICSVGNSTKKGNRKRGY 119

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF +T  P I SNG+ ++F+ +      +G+++P  V   P   D+    
Sbjct: 120  IGEKGIGFKSVFLITAQPYIFSNGYQIRFNENPCPHCNLGYIVPEWVHESPSLSDI---- 175

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRN- 1385
              K      S    T + LP +        +  +    S +HP +LLFL +++ +  R  
Sbjct: 176  --KQIYGSTSMLPTTTLILPLKPD-----KVTAVKQQLSSVHPEVLLFLSKIKRLSVRED 228

Query: 1386 -------------MLNDSLVVIRKKIVGDG-IIKVSCGEDK------MTWFVASQKLRAG 1425
                         +  ++  V RK I  +   + +S  E+        ++++  QK    
Sbjct: 229  NEDPSLNTVSAIAITKETNFVTRKNIDAESYTLHLSAEENDDEFAKGCSYYLWKQKFPVR 288

Query: 1426 V-----IRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG-LKFILQGDFV 1479
                   R +V+   I LA     + E     + Y   ++AFLP        FI+Q DF+
Sbjct: 289  QENRVDRRMEVEDWVITLAFP---NGERLLRGMKYSPGIYAFLPTEMVSNFPFIIQADFI 345

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
            L SSRE +  ++ WNQ +L   P  FV+A  S   +    + P  +      F+P+
Sbjct: 346  LASSRETIQWDNIWNQGILDCVPFAFVNALVSL--IKTVDDAPVSSLPPMFKFLPV 399


>gi|302913717|ref|XP_003050987.1| hypothetical protein NECHADRAFT_80792 [Nectria haematococca mpVI
            77-13-4]
 gi|256731925|gb|EEU45274.1| hypothetical protein NECHADRAFT_80792 [Nectria haematococca mpVI
            77-13-4]
          Length = 2071

 Score =  160 bits (404), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 127/399 (31%), Positives = 190/399 (47%), Gaps = 48/399 (12%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            +  A+  LS +LY + SHF+LE +QNADD  Y +  EPTL   L+++ +    NE GF+ 
Sbjct: 40   MDNAIGILSDDLYQKPSHFILEFLQNADDGHY-DTPEPTLAITLEDNTVRFDANEVGFTR 98

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
             N+ ALC +G+S+K  S     G KGIGFKSVF+V D   I S  +  KFD  E ++G V
Sbjct: 99   TNVDALCSLGSSSKVQSDQA-TGHKGIGFKSVFKVADKVWIKSGHYSFKFDARE-RLGTV 156

Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
             P             + +K P   + +   T IRL    K +    ++ + D    L   
Sbjct: 157  DP-------------IWTKFPD--DEREGYTSIRLRLLPKLN----LSALADELFKLDAQ 197

Query: 1371 LLLFLHRLQCIMFRNMLNDSLVVIRKKI--------VGDGIIKVSCGEDKMTWFVASQ-- 1420
            LLLFL ++QCI    +         +++        + DG+ + +   D  + F+ S   
Sbjct: 198  LLLFLRKIQCIEVHVIPEKGQRARPRRVLKRSTDSRIPDGLQRFTLMSDYSSTFLVSSHR 257

Query: 1421 --KLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDF 1478
              KL     R   K ++I LA            P    Q V++FLP+R YG KFI+Q DF
Sbjct: 258  TIKLGHDNRRKGRKHSDILLAFPEMSFRH----PPTATQNVYSFLPIRNYGFKFIIQADF 313

Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
            +L +SRE++D +S WN  LL E P   ++A + F        N  +    ++ ++     
Sbjct: 314  LLTASREDIDNSSDWNIKLLDELPEAMLTAIKRF--------NSRRLYDAWLPYLIFDHV 365

Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR 1577
              GFF  L + ILS L  S   IL+       PP ++ R
Sbjct: 366  QPGFFENLGQDILSFL--SKQPILKSITGDLKPPSELTR 402


>gi|66357810|ref|XP_626083.1| superfamily I helicase [Cryptosporidium parvum Iowa II]
 gi|46227162|gb|EAK88112.1| superfamily I helicase [Cryptosporidium parvum Iowa II]
          Length = 2023

 Score =  159 bits (403), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 143/575 (24%), Positives = 266/575 (46%), Gaps = 93/575 (16%)

Query: 1125 CGAICDTICGEASDD-GLGDCTTQTLPEDK------ECE-----------DAALIVESIR 1166
            C ++  TI  E  D     +C  +   ED+       CE           D    + SI+
Sbjct: 589  CNSLNLTISLEVEDSLEENECLVKMSTEDEFIFEELRCEYCSNYKSINIFDCKSFILSIQ 648

Query: 1167 RDEFGLGPNISNMESNMLKKQHARL-GRALHCLSQELYSQDSHFLLELVQNADDNIYPEN 1225
            + +FG+     + + N + + +  +  ++   LS +LY++ +HF+ EL+QNADDN Y   
Sbjct: 649  KQKFGISEKALDKDYNQIIQNYTSIINQSCKNLSIKLYTKINHFIFELIQNADDNQYCSC 708

Query: 1226 VE--PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVF 1283
            +   P++ F+  ++G++V+NNE GF+  +I ++CD+GNS+K  +    IG  GIGFKSVF
Sbjct: 709  IGKIPSIIFVFHKNGVLVINNEIGFTEGDISSICDIGNSSKVFNEKK-IGCFGIGFKSVF 767

Query: 1284 RVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCI 1343
             +T+ P I SNG+  KF+ +     ++ P  V     ++       +  ++E   + T  
Sbjct: 768  LITNTPFIFSNGYCFKFNLNSKHGSYIFPEWVDEELYNLIPNHKFHNEYEIEVSTYKTKF 827

Query: 1344 RLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK--KIVGD 1401
             LP++    E I   ++      L+ +++LF ++L+ I  + + ND + VI +  K++  
Sbjct: 828  WLPYK----EDIIFEDL-----KLNDNIILFTNKLKRI--KLITNDRVTVITRSEKLISK 876

Query: 1402 GIIKVSCGEDKM------------------TWFVASQKLR----AGVIRPDVKTTEIALA 1439
             +I V+  +  M                  ++ + +           +  + K   +A+ 
Sbjct: 877  DLILVNIFKHIMCDPTNKKRKLSSNRNITKSFLIVNYNFEIPKNISNLVKNKKLNNLAIG 936

Query: 1440 LTLQESNEGNYGPLLY-QQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            + +   ++   G   Y  + VF+FLP+R+YGLKFILQ DF L SSRE +  +S WN ++ 
Sbjct: 937  IEINNESDTFDGECNYDNKEVFSFLPIRSYGLKFILQADFELTSSRESISIDSNWNIYIR 996

Query: 1499 SEFP---ALFVSAERSFCDLPCFREN-----PAKAASVYMSFVPLVGEVHGFFSGLPRMI 1550
               P      +S  R        +++     P KA ++   F+P++ +++       + +
Sbjct: 997  EIIPNAIIYLISKLRETNGFHLLKKSFLGILPTKADNIDEFFIPIIPKIN-------KAL 1049

Query: 1551 LSKLRMSNC--LILEGNNNQWAPPCKVLRGWNDRAHSLLPDIL--------LQKHLGLGF 1600
            +++  +  C  + ++       P   V    + R   +L DI         L K     F
Sbjct: 1050 INEKCIYTCEKIFIQ-------PSNSVFINNDSRTFKILLDIFPNINEFSYLLKKYSNKF 1102

Query: 1601 LSKDIVLSDS---LARALGIEEYGPKILLQIISSL 1632
            L  +   +++   L R LGI E+   IL+ II  +
Sbjct: 1103 LINNAFANNTENILLRDLGITEFNIDILVDIIKGI 1137


>gi|342886436|gb|EGU86266.1| hypothetical protein FOXB_03201 [Fusarium oxysporum Fo5176]
          Length = 1581

 Score =  159 bits (403), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 155/523 (29%), Positives = 237/523 (45%), Gaps = 58/523 (11%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN---NEQG 1247
            L  +L  +S +LY   +HF+LEL+QNADDNIYP  V P+LT  L  +    L    NE G
Sbjct: 38   LQHSLDIVSHDLYQTPTHFILELIQNADDNIYPPEVTPSLTLTLSGAESTTLRIDCNEAG 97

Query: 1248 FSAENIRALCDVGNSTKKGSS---AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
            F+ E I AL  +G STK  SS     YIG KGIGFKSVF+  D  +I S  +  +FD  E
Sbjct: 98   FTFEQIDALAGIGASTKTASSHRQRRYIGEKGIGFKSVFKAADVVKIASGYYEFRFDRRE 157

Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
              +G VLP +  PF ++      ++  + L+ K  +  IR   R      + +       
Sbjct: 158  S-LGMVLP-IASPFPVEHRLSNHTQFLLLLKDKQTHQRIRADLRV-IEPHLLIFLRRLRR 214

Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFV---ASQK 1421
              +       + ++Q  M    L + + +   + VG       C   K  + V   +  +
Sbjct: 215  LRIQTDDAQRVFQVQYDMSNTTLGEIVTITTSEGVG-------CSSSKTDYIVVRSSCDQ 267

Query: 1422 LRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLP 1481
            +     R  V+TTEI +A  +    +G+  P + +Q  FAFLP+  +G KF++  DF+L 
Sbjct: 268  MPDNEKRKGVQTTEITIAFPI----DGSERPKIGKQKAFAFLPVDDFGFKFLIHADFLLV 323

Query: 1482 SSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG 1541
            +SRE +D +  WN+ L       F+ A + F  +P     P     ++  +       H 
Sbjct: 324  ASREGLDYDCQWNETLHKAIRRAFLLAVKRFASVPASEPGPG-LRYLWPKYTKHHRASHN 382

Query: 1542 FFSGLPRMILSKLR----MSNCLILEGNNNQWAPPCK---VLRGWNDRAHSLLPDILLQK 1594
            F++ L + IL  LR    + +C + E +      P +   V   +    H+L     L+K
Sbjct: 383  FWNQLHQDILHDLRYEPILESCDLSECHRR----PTELRFVPHDFRYGGHALFDSPSLRK 438

Query: 1595 HLGLGFLSKDIVLSDSLARALGIEEYGPKILLQI--ISSLCRTE-NGLRSMGLSWLASWL 1651
               L F   D+           IEE  P + LQ   I  LC    + +R +G+S LA+  
Sbjct: 439  -THLSFQYDDV-----------IEELSP-LGLQTSSIQDLCNDFCSWIREVGVSVLATKT 485

Query: 1652 NELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSS 1694
             E +          SL  G    L D L  +P IPL DG+++S
Sbjct: 486  PEWH------RKVVSLFCG-HKKLKDQLLSLPIIPLRDGSWTS 521


>gi|242085496|ref|XP_002443173.1| hypothetical protein SORBIDRAFT_08g014300 [Sorghum bicolor]
 gi|241943866|gb|EES17011.1| hypothetical protein SORBIDRAFT_08g014300 [Sorghum bicolor]
          Length = 1948

 Score =  159 bits (401), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 157/517 (30%), Positives = 240/517 (46%), Gaps = 106/517 (20%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ------ 1215
            VE IRR+ F +G      E N L +    + +A+  LS+ELYS+D HFL+EL+Q      
Sbjct: 14   VERIRRERFYIGRG----ERNPLAED---IHQAVGYLSEELYSKDVHFLMELIQIILVIA 66

Query: 1216 ----------------------NADDNIYPENVEPTLTFILQE---------SGIVVLNN 1244
                                  +A+DN YP +V P L F++ E         S +VV NN
Sbjct: 67   VENENDCNQVANCMIVCANVKQHAEDNDYPLDVSPALEFVITEKDITATGARSTLVVFNN 126

Query: 1245 EQGFSAENIRALCDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-- 1301
            E+GFSA NI ++C +G STKKG+   GYIG KGIGFKSVF V+  P I SNG+ +KF+  
Sbjct: 127  ERGFSAANIDSICRIGRSTKKGNRHLGYIGEKGIGFKSVFLVSGQPHIFSNGYQIKFNEK 186

Query: 1302 -TSEGQIGFVLPTLV--PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
             +++  IG+++P  V   P NID   R +      L +    T I LP +T       + 
Sbjct: 187  PSADCDIGYIVPEWVDGKP-NIDDI-RTVYGPSRSLPT----TVIILPLKTD-----KIL 235

Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVI--------------RKKIVGD--- 1401
             +    +  HP +LLFL +++ +  R M +D                   RK I  +   
Sbjct: 236  AVKKELNSTHPEILLFLSKIRQLSVREMNHDPKASKISQISISSEVDYRSRKDIDAESYT 295

Query: 1402 ---GIIKVSCGE-DKMTWFVASQKLRAGVIRPDVKTTE--------IALALTL-QESNEG 1448
                + + S GE ++ T+++  QK     ++PD +  +        + LA    Q  + G
Sbjct: 296  LHLAMQENSKGETEECTYYMWKQKF---AVQPDCRIQKRMEVDQWVVTLAFPHGQRLSRG 352

Query: 1449 NYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
               P      V+AFLP      L FI+Q DF+L SSRE +  +S WN  +L   P+ FV+
Sbjct: 353  AKSP-----GVYAFLPTEMVTNLPFIIQADFLLASSRESILFDSKWNHGILECVPSAFVN 407

Query: 1508 AERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNN 1567
            A  +   L    + P  A +    F+P+       F  +   I +K+   + +  E    
Sbjct: 408  AFGAL--LKSSGDAPHFALAPIFRFLPIQDPSISLFDNVRLSIKTKVEAEDIMPCESCTT 465

Query: 1568 Q--WAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLS 1602
            Q  +  P +V R   D A   + ++  ++ +GL  LS
Sbjct: 466  QKVFCKPTEVSRL--DSAFWRILNLAQKQGIGLQNLS 500


>gi|167521057|ref|XP_001744867.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776481|gb|EDQ90100.1| predicted protein [Monosiga brevicollis MX1]
          Length = 3466

 Score =  158 bits (400), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/338 (30%), Positives = 174/338 (51%), Gaps = 50/338 (14%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            L +A   ++ ++Y  D+HFL+EL+QNA D  Y   V P L+ I++   + V  NE GF  
Sbjct: 1326 LHKATILVATDMYGSDTHFLMELIQNAADCTY-TGVTPCLSIIVKRDCVAVEYNEDGFGR 1384

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
            E++ A+C + +STK G   G+IG+KGIGFKSVF+V   PE+HS G+H +F+    ++G++
Sbjct: 1385 EDVMAICSICSSTKAGGKQGFIGQKGIGFKSVFKVATQPEVHSRGYHFRFNAENPKLGYL 1444

Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI------VDMF 1364
            +P  +            +K P   E +   T I LP     + G +  +I        + 
Sbjct: 1445 VPERI------------AKPPDFSEER--GTRIVLPRSAVRNSGTSFASIDLQDISQKLL 1490

Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLN---------DSLVV-----------IRKKIVGDGII 1404
            +   P +LLFL +L+ +   N+++         D+  V           +R  +      
Sbjct: 1491 TAFQPDMLLFLPKLRTV---NIVDGDCAFDSKPDTRSVHVELHGSQWRELRTTLYRPAND 1547

Query: 1405 KVSCGEDKMTWFVASQKLRAGVIRPDVKTTE--IALALTLQESNEGNYGPLL--YQQPVF 1460
             VS    +  W + + K  A  +R      E  +AL + L E+  G+    +  YQ+ VF
Sbjct: 1548 TVSETRSEF-WLMHTVKREAVNLRQHAAEAELTVALPVVLDEALLGHEASDMPGYQR-VF 1605

Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            AFLP++ + L+F +  D+ +P+SRE++D  SPWN++L+
Sbjct: 1606 AFLPMKEFNLRFAVHADWHVPASREDIDSTSPWNKFLI 1643


>gi|392310012|ref|ZP_10272546.1| hypothetical protein PcitN1_15279 [Pseudoalteromonas citrea NCIMB
            1889]
          Length = 1648

 Score =  156 bits (394), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 137/505 (27%), Positives = 234/505 (46%), Gaps = 65/505 (12%)

Query: 1152 DKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
            D+  + A   ++  +R ++    +    E N+ +   +    AL  LS++LYS+D HF+ 
Sbjct: 2    DQAVKTAKSQIDEFKRQKYHF--DTETGECNLPEDAKSDYDNALKILSEQLYSKDIHFIF 59

Query: 1212 ELVQNADDNIYPENVEPTLTFILQESG----------IVVLNNEQGFSAENIRALCDVGN 1261
            EL+QNA+DN Y +   P L+F L E            +V+ NNE GF   NI+A+  +G 
Sbjct: 60   ELIQNAEDNHYGKIATPELSFELLEYDPTNTPGCTGCLVIKNNENGFDINNIKAISSIGK 119

Query: 1262 STKKG-SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNI 1320
            STK     AGYIG KGIGFKSVF V+ AP I SNGF +KF   + + G  L  +VP +  
Sbjct: 120  STKANQKDAGYIGEKGIGFKSVFVVSPAPHILSNGFQIKFLKDDPKTG--LGYIVPYWLE 177

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTK-FSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
            D   +  + +          T + LP + K  SE    + +    S L P L+LFL +L+
Sbjct: 178  DDLSKFYNGN---------GTTLLLPLQNKQGSESRMFDKVQHELSKLSPELVLFLRKLK 228

Query: 1380 CIMFR--NMLNDSLVVIRKKIVGDGI---IKVSCGEDKMTWFVASQKLRAGV-----IRP 1429
             +  +  N   D  +       GD +    + S     + + + SQ +         +R 
Sbjct: 229  KLSIKTPNYYADYHLTKD----GDYVELNTQSSSQNTSVKYLLKSQTISVPADAVNELRE 284

Query: 1430 DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT-YGLKFILQGDFVLPSSREEVD 1488
            +V   +I++A  +      N+   L + P++ +LP  +  GL F++  DF+L +SRE + 
Sbjct: 285  NVTKRDISIAFPV------NFE--LEESPLYCYLPTESETGLPFLVNADFILSASRESIR 336

Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG-FFSGLP 1547
             +  WN W+  E       A ++   +   + + ++ A     +VP+ G     +++ + 
Sbjct: 337  QDLTWNNWMRGEVANF---AAKTISKILSQKGDISRWA-----WVPVFGSNRVIYWNNVR 388

Query: 1548 RMILSKLRMSNCL-ILEGNNNQWAPPCKVLRGWNDRA--HSLLPDILLQKHLGLGFLSKD 1604
              +   L++S C+  L G   Q+  P KVL   N     HS    ++  +         D
Sbjct: 389  ERVEEALKVSKCIPTLSG---QYELPSKVLSLGNTYGFLHSWPLSVI--ERFPFDLFWDD 443

Query: 1605 IVLSDSLARALGIEEYGPKILLQII 1629
              L   ++ A+G++ +G K  +  +
Sbjct: 444  SSLISRVSSAIGLKSFGQKDFVSFL 468


>gi|356504434|ref|XP_003521001.1| PREDICTED: uncharacterized protein LOC100800765 [Glycine max]
          Length = 1675

 Score =  156 bits (394), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 154/522 (29%), Positives = 229/522 (43%), Gaps = 103/522 (19%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IRR +F +G      E N L++    L +A   LS ELY++D HFL+ELVQNA+DN 
Sbjct: 9    IEKIRRTKFSIGG-----EPNPLRED---LHQATKNLSTELYAKDVHFLMELVQNAEDNK 60

Query: 1222 YPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKGS-SAGY 1271
            Y E V P+L FI+    I         ++ NNE+GFS ENI ++C VG STKKG+ S+GY
Sbjct: 61   YAEGVSPSLEFIITSKDITATGASATLLIFNNEKGFSRENIESICSVGRSTKKGNRSSGY 120

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG KGIGFKSVF +T  P I SNG+ ++F         IG+++P  V   P   D+  ++
Sbjct: 121  IGEKGIGFKSVFLLTAYPYIFSNGYQIRFSEKPCPHCDIGYIVPEWVEQKPTLHDI-KQI 179

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
                   L +    T I LP +        +  +    S++HP +LLFL +++ +  R  
Sbjct: 180  YGAGAGSLPT----TTIILPLKPD-----KVKPVKHQLSNIHPEVLLFLSKIRHLSVRED 230

Query: 1387 LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
              D       K+     + +S   +  T             R ++      L L+  E N
Sbjct: 231  NEDP------KLNSVNAVSISSEINFCT-------------RKNMNAESYTLHLSAAE-N 270

Query: 1447 EGNYGPLLYQQPVFAFLPLR----------------TYGLKFILQGDFVLPSSREEVDGN 1490
             G+     Y      F P+R                     FI+Q DFVL SSRE +  +
Sbjct: 271  GGSEKECRYYMWKQKF-PVRFENVVDRREDVEELIMVTNFPFIIQADFVLASSRETILLD 329

Query: 1491 SPWNQWLLSEFPALFVSAERSF----------CDLPCFRENPAKAASVYMSFVPLVGEVH 1540
            S WNQ +L   P+ F+ A ++           C  P FR            F+P+     
Sbjct: 330  SKWNQGILECVPSAFMDAFKTLVLGSDQAPISCLPPVFR------------FLPIDSSPF 377

Query: 1541 GFFSGLPRMILSKLRMSNCLILE--GNNNQWAPPCKVLR----GWNDRAHSLLPDILLQK 1594
               + +   I  K+   + + +E  G    +  PC+V R     W+  A +    + L  
Sbjct: 378  EKLNYVREKIKEKMLGESIIPIETYGEQKHFYKPCEVSRLLPEFWDILAEAQAERVYLHN 437

Query: 1595 ---HLGLGFLSK--DIVLSDSLARALGIEEYGPKILLQIISS 1631
               H G   LS   D +  D +   LGI++       + I S
Sbjct: 438  LSSHNGRKILSSSFDKIEYDDILSFLGIQQVNTDWYAKCIQS 479


>gi|436670129|ref|YP_007317868.1| hypothetical protein Cylst_6577 [Cylindrospermum stagnale PCC 7417]
 gi|428262401|gb|AFZ28350.1| hypothetical protein Cylst_6577 [Cylindrospermum stagnale PCC 7417]
          Length = 268

 Score =  155 bits (392), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/231 (38%), Positives = 129/231 (55%), Gaps = 23/231 (9%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            ++E IR++ +G+G         ++      L  AL  LS +LYS+++HF+LEL+QNADDN
Sbjct: 7    LIEDIRKNTYGIGLQTDQAAQGVIDNFRRSLNSALERLSVDLYSKETHFVLELIQNADDN 66

Query: 1221 IYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFK 1280
             Y   V PTL   ++   I+V NNE GFS +N+RA+C+VG STK     GYIG KGIGFK
Sbjct: 67   QYQSGVIPTLNLTIEPQKIIVQNNELGFSEDNVRAICNVGRSTKT-KVEGYIGEKGIGFK 125

Query: 1281 SVFRVTDAPEIHSNG--FHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
            SVFR++D P+I SNG  FH K   +E Q+GFV P  +               P  ++S  
Sbjct: 126  SVFRISDEPQIFSNGFQFHFKRQDNENQLGFVTPYWIEAI------------PNYVDSTL 173

Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND 1389
             N  I LP R    + ++         ++  +L+ FL +L+ +   N +N+
Sbjct: 174  TN--IILPLRESAKDELS------KLGEIEHTLIFFLRQLKTVTIENRVNN 216


>gi|350630573|gb|EHA18945.1| hypothetical protein ASPNIDRAFT_187731 [Aspergillus niger ATCC 1015]
          Length = 1659

 Score =  154 bits (390), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 168/621 (27%), Positives = 265/621 (42%), Gaps = 112/621 (18%)

Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
            E A  +VE I +    LG ++           +E  ML+K    +G ++  LS+ LY+  
Sbjct: 9    ERARKLVEGISKGHGYLGEDVLSQMSDEVRRQVEEAMLRKDEM-IGSSVVTLSKNLYNSS 67

Query: 1207 SHFLLELVQNADDNIYPEN----VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            + F+ ELVQNADDN Y         P+L F +    IVV  NE GF+ EN+ A+C+VG S
Sbjct: 68   ARFVFELVQNADDNHYTTARGGATVPSLAFHVYPRRIVVDCNEDGFTHENLVAICNVGKS 127

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
            +K G+  GYIG KGIGFKSVF V     I S  F   F    G  G           + M
Sbjct: 128  SKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSGDFSFSFRHKPGGSG-----------MGM 175

Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQ 1379
               +      QL+       + L       +       +  F +L  +LLLF   L RL+
Sbjct: 176  ISPIWEDVEEQLQRPMTRMTLYL-----HDDAAPRETAMQQFHELQDTLLLFTKNLQRLE 230

Query: 1380 CIMFRN-------------MLNDSLVVIRKKIVG-DGIIKVSCGEDKMTWFVA------- 1418
              M+ +               N++L V+ K ++  DG  +       +T +VA       
Sbjct: 231  VSMYDDDEEKLSSTVFAISSQNENLKVLTKTLLARDGTSEELTRNYHLTKYVAHDLPKSE 290

Query: 1419 -------SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
                    Q++RA          +I LA  + + +     P++  Q VFAFLP+R  G  
Sbjct: 291  NRNYTDKEQEIRA------YSRADIILAFPVTKDS----VPVIEPQDVFAFLPIRNMGFT 340

Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
            F++  DFV  +SR+++  +S  N+ L +E  + F +A    C  P  R         +M 
Sbjct: 341  FLIHSDFVTDASRQDIVRSSARNRKLRTEIASAFNAAVLEMCKHPSLR-------YTWMR 393

Query: 1532 FVPLVGEVHG--FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWND-----RAH 1584
            ++P     H   F+ GL   +  K R+++  IL+   +    P + +R   D        
Sbjct: 394  YLPQRDGHHWDPFWMGLLDEL--KSRLTSSPILQSRTSGLWFPIERMRCLQDFLLDKSGE 451

Query: 1585 SLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGL----- 1639
             L PD+  +K+L   +L +DI      AR   + EYG  + + +  S+ R    L     
Sbjct: 452  PLFPDLPREKYLSAHYLMEDI------AR---LHEYG-LLFMSMRESIKRARMDLELSHS 501

Query: 1640 RSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGT 1699
            +         WL+    +        S + G+ T  I+ ++ +P IPL  G + S+ +G 
Sbjct: 502  KMKNTDTSEDWLSRAAMVL-----SKSFRQGL-TRRIEEIKSMPLIPLITGQWVSIQKGA 555

Query: 1700 IWLHSDCSV---FDGGFGLEA 1717
            ++  S   +    D G GL A
Sbjct: 556  VYFSSINGIEIPSDLGIGLVA 576


>gi|224136262|ref|XP_002322285.1| predicted protein [Populus trichocarpa]
 gi|222869281|gb|EEF06412.1| predicted protein [Populus trichocarpa]
          Length = 1692

 Score =  154 bits (389), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 129/403 (32%), Positives = 197/403 (48%), Gaps = 60/403 (14%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E IR+  F +G      E N L      L +A+  LS ELY++D HFL+EL+QNA+DN 
Sbjct: 8    IEHIRKTTFSIGG-----EKNPLAPM---LDQAVKYLSAELYTKDVHFLMELIQNAEDND 59

Query: 1222 YPENVE------PTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTKKG 1266
            Y E V+      P+L F++    I         ++ N+E+GFSA+NI ++C VGNSTKKG
Sbjct: 60   YLEGVDYLEGVDPSLEFVITSRDITATGAPATLLIFNSEKGFSAKNIESICSVGNSTKKG 119

Query: 1267 SSA-GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE---GQIGFVLPTLVP--PFNI 1320
            +   GYIG KGIGFKSVF +T  P I SNG+ ++F+        +G+V+P  V   P   
Sbjct: 120  NRKRGYIGEKGIGFKSVFLITPQPFIFSNGYQIRFNEKPCPLCNLGYVVPEWVEENPSLS 179

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
            D+      K      S    T I LP +        +  +    S +HP +LLFL +++ 
Sbjct: 180  DI------KQIYGSNSTLPTTTIVLPLKPN-----KVKPVKQQLSSIHPEVLLFLSKIKS 228

Query: 1381 IMFRNMLND------SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
            +  R    D      S + I K+   +   + S   +  T  +++++        D +  
Sbjct: 229  LSVREQNEDPRLNTVSAIAITKRT--NFRARESMDAESYTLHLSAEENNT-----DEQDR 281

Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPW 1493
            E + ++  Q+      G  L    ++AFLP      L FI+Q DF+L SSRE +  +  W
Sbjct: 282  ECSYSVWKQKFPRLRRGRSL--PGIYAFLPTEMVTNLPFIIQADFILASSRETILLDDNW 339

Query: 1494 NQWLLSEFPALFVSAERSFCDLPCFREN-PAKAASVYMSFVPL 1535
            NQ +L   P  F++A  S   +   RE+ P         F+P+
Sbjct: 340  NQGILDCVPLAFINALVSLVKM---REDAPVSRLPRLFQFLPI 379


>gi|291243339|ref|XP_002741560.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 1441

 Score =  154 bits (389), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 129/518 (24%), Positives = 239/518 (46%), Gaps = 46/518 (8%)

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
            GDF +PSSRE+VD +S WNQWL  E P LF+ A ++F + P F E   +A   ++ +VPL
Sbjct: 15   GDFDVPSSREDVDRDSAWNQWLRGELPTLFIDALQTFKNRPEFSE--LEAICKFLQYVPL 72

Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ----WAPPCKVLRGWNDRAHSLLPDIL 1591
              E+ GFF  + + I+ +L+ S CL  + + ++    W  P ++    +     ++   L
Sbjct: 73   EDEILGFFKPVAKQIIQQLKASPCLPADVSTDEDLYAWKLPSELAVTRDPLLREVITSDL 132

Query: 1592 LQKHLGLGFLSKDIVLSDS--LARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLAS 1649
            L  HL   ++   I+ S S  L   LG++      L+ I  ++  +  G  ++ + ++A 
Sbjct: 133  LHHHLNRYYVHPKILSSISMPLINMLGVQTLSINHLIDIGKAISASIEG-NNITIGFVAK 191

Query: 1650 WLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVF 1709
            WL  L+ +        S  +  E D+I  +  I  IPLS+G F S+ E +++        
Sbjct: 192  WLACLHRL------MESEFNIPEDDIIKQVTNIRIIPLSNGGFVSLSEFSVFFPV----- 240

Query: 1710 DGGFGLE----AFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQLSA 1765
              G G +     F    + L+ +   L +      + L  +    ++ +L  +GV ++  
Sbjct: 241  -SGDGRQQSKSQFITKQSHLKYLESDLNTVPMELLTCLDEVGNSQVHSVLKILGVCEMKP 299

Query: 1766 HDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFV 1825
             +++  HI+P +  +     D+ L+  YL ++    E     C ++      EL+  A +
Sbjct: 300  REVIHNHIIPVLRSDKWQKKDRELLMQYLIYIKEEHEKTPSVCDLD------ELKTIARI 353

Query: 1826 LTNHGFKRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPAKESLSCGLVKW 1885
            +TNHG K P  IP+HF  ++ N ++    +   D  W  +D  YL+   + +    +  W
Sbjct: 354  ITNHGLKNPHSIPVHFTTDYANKLNFQKDLPGYD--WVLLDSCYLQ---RNNSQQNVSSW 408

Query: 1886 RKFFKEIGITDFV-----QVVQVDKDVADISHTGFKNMWTKELLSPGSAAI-DWESNELV 1939
            + FF ++G+ + +     +VV  + ++A          W +     G   I D+  +E  
Sbjct: 409  KSFFIKLGVLNSLAVKKEKVVLQESEIAHSPWAAVYKFWPRS--QDGCFYIEDYICDEFH 466

Query: 1940 HLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMG 1975
             L++   +TT      S+ LL +L+  W   Y+  + G
Sbjct: 467  SLVTAENITTEQRLHQSRILLRLLEEEWQSTYSKYLNG 504


>gi|317037598|ref|XP_001398736.2| hypothetical protein ANI_1_1220164 [Aspergillus niger CBS 513.88]
          Length = 1671

 Score =  154 bits (389), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 207/828 (25%), Positives = 338/828 (40%), Gaps = 145/828 (17%)

Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
            E A  +VE I +    LG ++           +E  ML+K    +G ++  LS+ LY+  
Sbjct: 9    ERARKLVEGISKRHGYLGEDVLSQMSDEVRRQVEEAMLRKDEM-IGSSVVTLSKNLYNSS 67

Query: 1207 SHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            + F+ ELVQNADDN Y    E    P+L F +    IVV  NE GF+ ENI A+C+VG S
Sbjct: 68   ARFVFELVQNADDNHYTTARERATVPSLAFHVYPRRIVVDCNEDGFTHENIVAICNVGKS 127

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
            +K G+  GYIG KGIGFKSVF V     I S  F   F    G  G           + M
Sbjct: 128  SKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSGDFSFSFRHKPGGSG-----------MGM 175

Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQ 1379
               +      QL+       + L   T   E  AM      F +L  +LLLF   L RL 
Sbjct: 176  ISPIWEDVEEQLQRPMTRMTLYLHEDTAPRE-TAMQQ----FRELQDTLLLFTKNLQRLG 230

Query: 1380 CIMFRN-------------MLNDSLVVIRKKIVG-DGIIKVSCGEDKMTWFVA------- 1418
              M+ +               N++L V+ K ++  DG  +       +T +VA       
Sbjct: 231  VFMYDDDEKQLSSTVFAISSQNENLKVLTKTLLAPDGTSEELTRNYHLTKYVAHDLPGSE 290

Query: 1419 -------SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
                    ++ RA          +I LA  + +    N  P++  Q VFAFLP+R  G  
Sbjct: 291  NRTYTDKERETRA------YSRADIILAFPVTK----NSVPVIEPQDVFAFLPIRNMGFT 340

Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
            F++  DFV  +SR+++  +S  N+ L +E    F +A       P  R         +M 
Sbjct: 341  FLIHSDFVTDASRQDIVRSSARNRKLRTEISIAFGTAVLEMRKHPTLRYR-------WMR 393

Query: 1532 FVPLVGEVHG--FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGW----NDRAHS 1585
            ++P     H   F+ GL   I S+L +S  ++    +  W P  ++ R W    +     
Sbjct: 394  YLPKRDGHHWDPFWVGLLGEIKSRL-ISLPILQSRTSGSWFPIEQMERLWEHWFDKSGDP 452

Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
            L PD+  +++L + +L KDI           + EYG  +    +  +    N    M L 
Sbjct: 453  LFPDLPREQYLSVHYLEKDIT---------RLMEYG--LTFMAMPGMIARAN----MDLE 497

Query: 1646 WLASWL-NELYTISFHSSGQSSLQSGVE---TDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
            +  S + N+  +  +HS   ++L         + I+ ++ +P IPL  G + S+  G ++
Sbjct: 498  FPHSKMKNKETSEDWHSRAATALSHSFRLGYDNCIEEVKAMPLIPLKTGQWVSIKSGAVY 557

Query: 1702 LHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
                   F    G++   +L   L T                G I+     ++   IGVQ
Sbjct: 558  -------FPSIHGIDIPSDLGVGLVTS---------------GAINNPQRKQLFKDIGVQ 595

Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
            + S   + K+     I D  +A G   +         ++L   C          V     
Sbjct: 596  EASVDFVRKL-----IKDNYSAYGIFFIPVSREHLTFLYLT-ECLASSSSTALFVHLFNG 649

Query: 1822 KAFVLTNHGF--KRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPA-KESL 1878
            K    ++H +  ++P    ++F  E  +P     L+  I      +D++++     ++  
Sbjct: 650  KGLAFSDHKYQIRQPKIHTVYFPNE--DPYGAKALLEPIQGGAPGLDVSFINEDYLRDPP 707

Query: 1879 SCGLVKWRKFFKEIGITDFVQVVQVDKDVA--DISHTGFKNMWTKELL 1924
            SC   + RK+ + +      QV  V   ++  D+ H+ F+    +E L
Sbjct: 708  SCPPKEQRKWIEWLQ-----QVFHVRGTLSLIDLRHSPFRARLGQECL 750


>gi|413934111|gb|AFW68662.1| hypothetical protein ZEAMMB73_452932, partial [Zea mays]
          Length = 1387

 Score =  153 bits (387), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 150/489 (30%), Positives = 231/489 (47%), Gaps = 87/489 (17%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ F +G      E N L +    + +A+  LS+ELYS+D HFL+EL+QNA+DN 
Sbjct: 12   VERIRRERFYIGRE----ERNPLAED---IHQAVSYLSEELYSKDVHFLMELIQNAEDNE 64

Query: 1222 YPENVEPTLTFILQE---------SGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP +V P L F++ E         S +VV NNE+GFSA NI ++C +G STKKG+   GY
Sbjct: 65   YPLDVSPALEFVITEKDITGTGADSTLVVFNNERGFSAANIESICRIGRSTKKGNRHLGY 124

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLV--PPFNIDMFCRL 1326
            IG K         V+  P I SNG+ +KF    ++   IG+++P  V   P NID   R 
Sbjct: 125  IGEK---------VSSQPHIFSNGYQIKFSEKPSAVCDIGYIVPEWVDGKP-NIDDI-RT 173

Query: 1327 LSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
            +      L +    T I LP +T       +  +    S  HP +LLFL +++ +  + M
Sbjct: 174  VYGPSRSLPT----TVIILPLKTD-----KILAVKKELSSTHPEILLFLSKIRQLSVKEM 224

Query: 1387 LNDSLVVI--------------RKKIVGD------GIIKVSCGE-DKMTWFVASQKLRAG 1425
             ND                    K I  +       + + S GE ++ T+++  QK    
Sbjct: 225  NNDPKASKISQISISSEVDYWSSKDIDAESYTLHLAMQENSKGETEECTYYMWRQKF--- 281

Query: 1426 VIRPDVKTTE--------IALALTL-QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQ 1475
             ++PD +  +        + LA    Q  + G   P      V+AFLP      L FI+Q
Sbjct: 282  AVQPDCRIQKRMEVDQWVVTLAFPHGQRLSRGAKSP-----GVYAFLPTEMVTNLPFIIQ 336

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL 1535
             DF+L SSRE +  +S WN+ +L   P+ F++A  +   L   R+ P  A      F+P+
Sbjct: 337  ADFLLASSRESILFDSQWNRGILECVPSAFINAFGAL--LKSSRDAPHFALPPIFRFLPI 394

Query: 1536 VGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWNDRAHSLLPDILLQ 1593
                   F  +   I +K++  + +  E    Q  +  P +V R   D A   + ++  +
Sbjct: 395  QESSISLFDSIRLSIKTKVQAEDIMPCEPCTAQKVFCKPAEVSR--LDSAFCRILNLAQK 452

Query: 1594 KHLGLGFLS 1602
            + +GL  LS
Sbjct: 453  QGIGLQNLS 461


>gi|302775138|ref|XP_002970986.1| hypothetical protein SELMODRAFT_411714 [Selaginella moellendorffii]
 gi|300160968|gb|EFJ27584.1| hypothetical protein SELMODRAFT_411714 [Selaginella moellendorffii]
          Length = 1435

 Score =  152 bits (385), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 139/444 (31%), Positives = 221/444 (49%), Gaps = 54/444 (12%)

Query: 1154 ECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLEL 1213
            E     + +E +RR+ FG+G +  N  + ML++        +  LS ELY +D HFL EL
Sbjct: 163  EMSSQRMFIERLRRELFGIGQSGPNPLALMLQE-------TIRLLSAELYQKDIHFLSEL 215

Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
            +QNA+DN Y  +VEP++   L ++ +         ++LNNE GFS +NI +LC  G STK
Sbjct: 216  LQNAEDNTYHSSVEPSIEIYLTKTDVTGTGAAATLLLLNNEVGFSEKNIESLCAAGQSTK 275

Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVP-PFN 1319
            KG  S GYIG KGIGFKSVF VT  P I +NGF ++F+    +E ++G+++P  V  P +
Sbjct: 276  KGKKSQGYIGEKGIGFKSVFLVTKQPYIINNGFRIRFNEEPHNEAKLGYIVPEWVDCPTD 335

Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
                  +L K+ +   S+      ++P  T  F   I M +I D  S   P+ L+    +
Sbjct: 336  A-----VLQKEKI---SEVAKQLTQIPAETLLFLSKIRMLSIYDQQSR-PPTELVLSRSV 386

Query: 1379 QCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR----AGV-IRPDVKT 1433
            +    +N   ++         G        G     +++  Q L+    A V  R DV +
Sbjct: 387  EMTTSKNGSVETSTTFLTASNGRN------GTQHSNYYMFKQTLKVPEEAKVEKRKDVDS 440

Query: 1434 TEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGN 1490
              I LA  + +  S+  + G       +F+FLP   + GL F++  DF+L SSR  +  +
Sbjct: 441  WMITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLVSSRATLLFD 494

Query: 1491 SPWNQWLLSEFPALFVSAERSFCDLPCFR-ENPAKAASVYM-SFVPLVGEVHGFFSGLPR 1548
            S WNQ +    P +F +A     +   F  ++    A + M  +VP+    + +   +  
Sbjct: 495  SSWNQGIFRCVPEVFYNAFELLLNTTAFGLQSILSVARMAMYKYVPVNVCRNSYLERVRN 554

Query: 1549 MILSKLRMSNCLILEGNNNQWAPP 1572
             ILSKL+ +N +IL  ++   APP
Sbjct: 555  DILSKLKDNN-VILCASDVGRAPP 577


>gi|242071023|ref|XP_002450788.1| hypothetical protein SORBIDRAFT_05g018310 [Sorghum bicolor]
 gi|241936631|gb|EES09776.1| hypothetical protein SORBIDRAFT_05g018310 [Sorghum bicolor]
          Length = 1755

 Score =  152 bits (384), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 134/437 (30%), Positives = 202/437 (46%), Gaps = 92/437 (21%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN----- 1216
            VE IR + + +G      E N L +    + +A++ LSQELYS+D HFL+EL+QN     
Sbjct: 17   VERIRWERYFIGRE----ERNPLAED---MHQAVNYLSQELYSKDVHFLMELIQNDVLNY 69

Query: 1217 ------------ADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRA 1255
                        A+DN YP  V P L F++    I         +V NNE GF+  NI +
Sbjct: 70   ANLNLFITIDMNAEDNDYPSGVLPALEFVITSKDITRTGATATLLVFNNENGFTPANIES 129

Query: 1256 LCDVGNSTKKGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVL 1311
            +C +G STKK +  +GYIG KGIGFKSVF V+  P I SNG+ +KF    +++  IG+++
Sbjct: 130  ICRIGRSTKKSNRGSGYIGEKGIGFKSVFLVSKNPHIFSNGYQIKFSEDPSADCGIGYIV 189

Query: 1312 PTLVP--PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
            P  V   P   D+       +    +     T I LP ++       ++ +    S  HP
Sbjct: 190  PEWVEDCPSIADIM------NIYGCQKSLPTTTIILPLKSD-----KIDAVKKELSGTHP 238

Query: 1370 SLLLFLHRLQCIMFRNMLNDSL---------------VVIRKKIVGDGI------IKVSC 1408
             +LLFL +++ I  R  +ND L                + RK I  +        ++   
Sbjct: 239  EVLLFLSKIRQISVRE-INDDLNATSLSQISISSEADALTRKDISAESYTLHLSALENKI 297

Query: 1409 GEDKMTWFVASQKLRAGVIRPDVKTTE--------IALALTL-QESNEGNYGPLLYQQPV 1459
            GE + ++++  Q      ++P+ +  +        I LA    Q  ++G   P      V
Sbjct: 298  GERQCSYYIWKQHFP---VKPECRVQKREGIDQWVIMLAFPHGQRLSQGMGSP-----GV 349

Query: 1460 FAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCF 1518
            +AFLP        FI+Q DF+L SSRE +  +S WNQ +L    + FV+A      L   
Sbjct: 350  YAFLPTEMATNFPFIIQADFLLSSSRESILLDSQWNQGILECVSSAFVNA--FLVLLKSI 407

Query: 1519 RENPAKAASVYMSFVPL 1535
               PA A      F+PL
Sbjct: 408  ESAPAFALPPIFKFLPL 424


>gi|429328793|gb|AFZ80553.1| hypothetical protein BEWA_034100 [Babesia equi]
          Length = 2424

 Score =  152 bits (384), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 145/554 (26%), Positives = 243/554 (43%), Gaps = 102/554 (18%)

Query: 1161 IVESIRRDEFGLG---PNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNA 1217
            ++E IR  EFG+G    NI     ++L KQ  R+ RA+  LS++LY+  +   LEL+QNA
Sbjct: 680  LIEEIRVKEFGIGLVESNIDERLKSVLNKQRDRISRAVKRLSEDLYNSRTQLQLELIQNA 739

Query: 1218 DDNIYPEN--VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
            DDN Y  N   EP ++F L    +V+LNNE GF  ++IR++CD+G+S+K   + G  G+ 
Sbjct: 740  DDNCYSVNGATEPFISFFLDYESVVMLNNEDGFMEKDIRSICDIGSSSKVLKT-GKTGKF 798

Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
            GIGFKSVF +T  P I SNG+   F      +  I ++LP  V          L    P+
Sbjct: 799  GIGFKSVFILTSNPHIFSNGYSFMFSNDPSFKDNIEYILPHWV---------ELSDSSPI 849

Query: 1333 QLESKCWNTCIRLPFRTKFSEGIA-----------------------MNNIVDMFSDLHP 1369
                    +  RLPF T  ++ +A                             +F  L  
Sbjct: 850  DYVKYKLGSESRLPFTTYSNKILAKSRGKRLTLMYFPLKDEYITNEYYKKYYQIFQMLDL 909

Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGII-KVSCGEDKMTWFVASQKLRAGVIR 1428
              +LFL  +  + F N ++    ++ KK++G  ++ K  C  D   + +    L    I 
Sbjct: 910  ESILFLRNISTVHFYNNVDGVETLMNKKLMGTFLVDKSDCEIDIDNFKLVHLNLERKNIH 969

Query: 1429 P---------DVKTTEIALALTLQESN------EGNYGPLL-------------YQQPVF 1460
                      +V     +  LT +ES        GN   L+              +  ++
Sbjct: 970  SLDIVDSVHLEVLVVNYSFKLTAEESTMLSTSKAGNNSSLISIALPVSPEIPENTRYKLY 1029

Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
              LP+  YGL F++  DFVL ++R+ +    PWN+ + S+   ++V        +   + 
Sbjct: 1030 NSLPIGEYGLNFMVNADFVLSANRQNLFTTDPWNEIIASKMCDVYVGI------IEIIKY 1083

Query: 1521 NPAKAASVYMSFV---PLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVL- 1576
            N +K + +Y  FV   P   +    F      I SKL ++   +L    +++  P K + 
Sbjct: 1084 NYSKMSILYNGFVYSIPRQDDGISLFKYSCNRIFSKL-LNIKWVLVDEVDEFVEPFKAIF 1142

Query: 1577 -------------RGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPK 1623
                           ++    +++P  +L+KH    +++  I+   +L   L   EYG K
Sbjct: 1143 VCSEFPDDFSNTEVMFSKLLRTVVPSNILRKHCDFCYINS-ILCRPNLKSKLM--EYGIK 1199

Query: 1624 IL-----LQIISSL 1632
            I+     ++I+SS+
Sbjct: 1200 IITFDLVVEIVSSI 1213



 Score = 42.4 bits (98), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 7/92 (7%)

Query: 842 AIEILLRAPFLSDLNSWSHWDFLFAPSLGPLPGWLLNEVNVK---ELLCLVTRDGKVIRI 898
           A E++  AP L +L  ++ W++++   LGPLP +LLNE  +K       L   D  V++I
Sbjct: 416 ATELIENAPLLLNLYDYTMWNYVYEDCLGPLPNFLLNE-KIKFFGSYSFLYIHDRSVVKI 474

Query: 899 ---DHSASVDSFLEAALEGSSFQTAVKLLSSF 927
              D   S    ++   +G     A+K+LS+F
Sbjct: 475 KSPDCYISDLVIVQEITDGFDNFDAIKVLSAF 506


>gi|302881512|ref|XP_003039668.1| hypothetical protein NECHADRAFT_88029 [Nectria haematococca mpVI
            77-13-4]
 gi|256720532|gb|EEU33955.1| hypothetical protein NECHADRAFT_88029 [Nectria haematococca mpVI
            77-13-4]
          Length = 1482

 Score =  151 bits (381), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 189/404 (46%), Gaps = 58/404 (14%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +  IRR E GLG     + +N      A L  AL  +S +LY   +HFLLEL+QNADDN 
Sbjct: 16   IRQIRR-EKGLGDGPDQIGNN-----GADLESALKIISHDLYQTSTHFLLELIQNADDNS 69

Query: 1222 YPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIG 1278
            Y   VE PT++       + +  NE+GFS +NI A+C +G STK G S   G+IG KGIG
Sbjct: 70   Y--TVETPTMSISYSPGKLRIDCNERGFSKQNIEAICRIGKSTKGGESKSEGFIGEKGIG 127

Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
            FK+VF+V     I S  +  +FD  +G +G + P +  PF            P Q + +C
Sbjct: 128  FKAVFKVASTVWISSGYYSFRFD-RDGHLGMIAP-ITCPF------------PEQPKQEC 173

Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFS--------------DLHPSLLLFLHRLQCIMFR 1384
              T I L       EG+ +  +                   ++HP      +RL    F 
Sbjct: 174  --TSILLKLDKDCDEGLIVEEMKSCDEKILLFLRRLRKLEINIHPGA---ENRLFKRTFT 228

Query: 1385 NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQE 1444
                 +       ++ +G+ K         W   ++ L +   RP + T+EI LA     
Sbjct: 229  RGAQPAENPSMMTLMNNGVEK-----HYFVWRHTAKNLPSEARRPGITTSEIVLAFPNTG 283

Query: 1445 SN-EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
             N +    P++  Q V++FLP+R YG  F+LQ DF+LP++RE+V  +S WN  L +E   
Sbjct: 284  VNKDAPVLPVIEPQNVYSFLPIRNYGFSFLLQADFLLPANREDVHVDSNWNIHLRTE--- 340

Query: 1504 LFVSAERSFCDLPCFRENPAKA-ASVYMSFVPLVGEVHGFFSGL 1546
                A ++F +  C  +N   +    ++ + P+      FF  L
Sbjct: 341  ----ARKAFMEAMCHIDNLDNSLRHSWVRYAPITPSQSEFFEEL 380


>gi|389750864|gb|EIM91937.1| hypothetical protein STEHIDRAFT_136037 [Stereum hirsutum FP-91666
            SS1]
          Length = 1448

 Score =  150 bits (380), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 159/536 (29%), Positives = 253/536 (47%), Gaps = 88/536 (16%)

Query: 1161 IVESIRRDEFGL-GPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
            ++ SIR+D  GL  PN + +E          LGRAL  L+ +LY  D+HF+LE VQNA+D
Sbjct: 3    VIRSIRQD-LGLDAPNPTEVER--------MLGRALEHLADQLYRTDTHFVLEFVQNAED 53

Query: 1220 NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGF 1279
            N Y  +V PTL+  + +  + +  NE+GF   N+RA+C++G STKK    G+     IGF
Sbjct: 54   NSYHPDVTPTLSLRIDDYSMRISCNERGFDESNVRAICNIGTSTKK-HLQGF-----IGF 107

Query: 1280 KSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW 1339
            KSVF+V +   + S  F  KFD  E ++G + P         M+   L   P   +    
Sbjct: 108  KSVFKVANVVRVASGDFRFKFDRDE-RLGMIAP---------MWEDDLPAQPGWTDFHLE 157

Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND----SLVVIR 1395
             T +R   ++    G+A++  +D   ++  SLLLFL RL+ I     L+D     ++ + 
Sbjct: 158  LTSLR---QSSMPPGMALSFKLD---EVKASLLLFLQRLRFIELDFALSDPDRQRIITLF 211

Query: 1396 KKIVGDGIIKVSCGE------DKMTWFV-------ASQKLRAGVIRPDVKTTEIALALTL 1442
            +  +G GI+K++  E      D     V       A+Q+ +    RP +  +EI LA  +
Sbjct: 212  RYDIGPGIVKLTHCENDCPLRDDFYLMVNHFNDMSATQEPK----RPGLDRSEIVLAFPI 267

Query: 1443 QESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
             E    ++ P+   Q V  FLPLR +G  F++Q DF+  SSRE++  +S WN+ L     
Sbjct: 268  AE----DFQPVFSPQDVHVFLPLRPFGFNFVIQADFLTSSSREDILADSRWNKLLRDGIV 323

Query: 1503 ALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
              FV A   F      R+        +  ++P    +  FFS +   +L  L  SN  I+
Sbjct: 324  TAFVQAIERFHSQDELRDT-------WFGYLPH-AVLDPFFSEVEPHLLRVL--SNLPII 373

Query: 1563 EGNNNQWAPPCKVL----RGWNDRAHSLLP-DILLQKHLGLGFLSKDI-----VLSDSLA 1612
               N  +  P +VL    R  +     L+P D L + H    F+S D      VL  SL 
Sbjct: 374  RNANGVYRRPREVLVIPTRFRDKIGIPLIPGDHLPKPHT---FISPDYDLSVRVLRPSLK 430

Query: 1613 RALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELY------TISFHSS 1662
            R LG+++    + ++ + ++      +    + W  +  + LY       + FH S
Sbjct: 431  R-LGVQDMDDDVFVKSLENMS-AGGAISHQSIEWHETVCDLLYRFPKTRKMQFHPS 484


>gi|429847683|gb|ELA23260.1| heterokaryon incompatibility, partial [Colletotrichum gloeosporioides
            Nara gc5]
          Length = 1350

 Score =  149 bits (376), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 146/519 (28%), Positives = 228/519 (43%), Gaps = 81/519 (15%)

Query: 1142 GDCTTQTLP------EDKECEDAALIV-----ESIRRDEF---GLGPNISNMESNMLKKQ 1187
            G+ TT  LP      E++   DA L       E +++ E      G + +N   +    Q
Sbjct: 3    GEVTTAVLPGWKARPENRAQADAQLRKLLKPRERLKQRELLRQQTGTDANNQSDDFEIDQ 62

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQES-GIVVLN--- 1243
               L  +L  LS++LY++D+HF+LEL QNADDN Y  +  P+    L ++ G        
Sbjct: 63   ---LDGSLTVLSEQLYNKDTHFILELTQNADDNAYGASTIPSFGMRLDKNDGDCTFQTGC 119

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSS---AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            NE GF+ + I ALC + +STKK       GYIG KGIGFK+VF+V D   I S  +  K 
Sbjct: 120  NEVGFTVDQIDALCRIASSTKKNEKDVRTGYIGEKGIGFKAVFKVADVVYIKSGFYSFKL 179

Query: 1301 DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR-TKFSEGIAM-N 1358
            D   G +G +LP                  P    +  ++   R   R T   +GI   +
Sbjct: 180  DKRNGLVGTLLPL-----------------PADFPASKYHEHDRSATRMTLLLKGIEQYD 222

Query: 1359 NIVDMFSDLHPSLLLFLHRLQCI-MFRNMLNDSLVVIR----KKIVGDGIIKVSC---GE 1410
             I    + + P +LLFL RL+ I M++       + +     +K+ G+    + C   GE
Sbjct: 223  KISKELTKIKPEILLFLRRLRSISMYKGQETTVYLAMHNEHEEKLQGE-TKTIDCQKQGE 281

Query: 1411 D----------KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVF 1460
                        M   + +++ RAG     +  +E+ LA  L +  +  + P    Q VF
Sbjct: 282  SFRNKKFLITRTMVNDMPNEEKRAG-----ISASEVTLAFPLNKYGQPEFHP----QHVF 332

Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP-CFR 1519
            +FL +  +G  F++QGDF+L +SRE +   + WN  L    PA F+ A   F  +  C R
Sbjct: 333  SFLAVACHGFPFLIQGDFILLASREGLSAGNAWNNKLRDAIPAAFLQAVDHFNKIEGCLR 392

Query: 1520 ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILE-GNNNQWAPPCKVLRG 1578
             +  +     +S+ P       F+  L   +   LR    L    G+  + A    + + 
Sbjct: 393  YSWLRYLEYNVSYDP-------FWKRLDDQLTPHLRRHPILRNRLGHTRKPADLIFIPKE 445

Query: 1579 WNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGI 1617
            +      LL D   ++HL   +LS D     S  R LG+
Sbjct: 446  YRLEGEPLL-DFCSERHLAFEYLSHDATCLTSSLRRLGV 483


>gi|399218232|emb|CCF75119.1| unnamed protein product [Babesia microti strain RI]
          Length = 1605

 Score =  149 bits (376), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/401 (30%), Positives = 191/401 (47%), Gaps = 42/401 (10%)

Query: 1112 LFESVGADVHIEECGAICDTICGEASDD---GLGDCTTQTLPEDKECEDAALIVESIRRD 1168
            L +SV A     EC    + I  + + D    +  CT   +  D   +DA   +E IRRD
Sbjct: 444  LEKSVSASTLSIECIKPNEPITPKHNADLNTSVNSCTGDNIIID--AKDARSHIEYIRRD 501

Query: 1169 EFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP 1228
            ++G+G +    +++   KQ+ RL R L  LS  LYS  +H L EL+QNADDN Y  ++ P
Sbjct: 502  KYGIGAD--QCKNDYTIKQNDRLSRILDKLSTNLYSSTNHILYELIQNADDNCYSNDIIP 559

Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
             L  +  E  + + NNE GFS  ++ A+CD+ NS+K   S    G+ GIGF+SVF +T+ 
Sbjct: 560  ELVIVQTEGSVTLFNNEIGFSKTDVEAICDLANSSKIYRSLC-TGKFGIGFRSVFTITNE 618

Query: 1289 PEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
            P I SNG+  KF TS G IG ++P  +   N+D+   +      +++++C+NT I LP  
Sbjct: 619  PHIFSNGYQFKF-TSNG-IGKLIPQWID--NLDLLNGI--SKSTKIDTRCYNTIIYLPLI 672

Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVV-IRKKIVGDGIIKVS 1407
            T     I  N            L+L L +L+ + F  +  D +   I  K   + I    
Sbjct: 673  TPVQFDITYN------------LILHLRKLRRLYFIQITADGIRSEINLKATNESITDTP 720

Query: 1408 CGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT 1467
                K+ +     K    +       T   + LT    N G        Q V++ LP+  
Sbjct: 721  LFIKKIFYNDKDFKYIQYI------DTSSNIQLTFPIDNIG--------QFVYSTLPVGN 766

Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            +GL F +  +F L   R  +  +   N+ ++   P +F++A
Sbjct: 767  FGLNFAINSNFNLTLDRSNL-LDDQHNRTIVERIPIIFLNA 806


>gi|70993156|ref|XP_751426.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gi|66849060|gb|EAL89388.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
          Length = 1516

 Score =  149 bits (375), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 131/429 (30%), Positives = 196/429 (45%), Gaps = 62/429 (14%)

Query: 1156 EDAALIVESIRRDEFGLGPN----ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
            E A  IV+ I+    GL P     +S       K      GR++  ++++LY  D+ FL 
Sbjct: 7    EQAQAIVDDIQATNGGLPPEEKQALSQRSLRAYKNLQTIAGRSIVHVAEDLYDTDTRFLF 66

Query: 1212 ELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            EL+QNA+DN Y   V    EP L   L E  + V +NE GF+  ++RA+C +  S+KK  
Sbjct: 67   ELIQNAEDNCYGHAVSQAEEPFLDLTLHEDHLTVDSNEDGFTENDVRAICSIHQSSKK-Q 125

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
            + GYIG KGIGFKSVF+V     I S  F   F+  +G  G     ++ PFN        
Sbjct: 126  TGGYIGHKGIGFKSVFKVAYKVSIQSGPFSFYFEHHQGDSGL---GMITPFN-------- 174

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP---SLLLFLHRLQ----- 1379
             ++P +L     NT I L F       I++++     S+L     ++LLFL +LQ     
Sbjct: 175  -EEPQELPPSV-NTRITLFF-------ISISDFEARASELREIPDTMLLFLRKLQRLTVN 225

Query: 1380 ------CIMFRNMLNDS--LVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
                   I F    + S  L+ + K+  G+ + K    E K T     Q L     RP  
Sbjct: 226  IPSLQSQISFTRFEDKSKHLITLTKETNGEQVTKFYHLE-KTTLSNLPQHLS----RPGQ 280

Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
               ++ LA  +QE    +Y PL+  Q V++FLP+R  G  F++Q DF+  +SR+ V    
Sbjct: 281  PEVDLILAFPVQE----DYSPLIQAQYVYSFLPMRDEGFNFLIQSDFITQASRQGVH-RC 335

Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMIL 1551
              N  +      LF+ A   FC     R         ++ ++P       F++ L  MI 
Sbjct: 336  DRNYAIREGISHLFLQAVAYFCKHTSLRYE-------WLQYLPGTHIRDPFWADLREMIF 388

Query: 1552 SKLRMSNCL 1560
              L+ S  L
Sbjct: 389  DCLKGSKIL 397


>gi|159125661|gb|EDP50778.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
          Length = 989

 Score =  149 bits (375), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 131/429 (30%), Positives = 196/429 (45%), Gaps = 62/429 (14%)

Query: 1156 EDAALIVESIRRDEFGLGPN----ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLL 1211
            E A  IV+ I+    GL P     +S       K      GR++  ++++LY  D+ FL 
Sbjct: 7    EQAQAIVDDIQATNGGLPPEEKQALSQRSLRAYKNLQTIAGRSIVHVAEDLYDTDTRFLF 66

Query: 1212 ELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            EL+QNA+DN Y   V    EP L   L E  + V +NE GF+  ++RA+C +  S+KK  
Sbjct: 67   ELIQNAEDNCYGHAVSQAEEPFLDLTLHEDHLTVDSNEDGFTENDVRAICSIHQSSKK-Q 125

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
            + GYIG KGIGFKSVF+V     I S  F   F+  +G  G     ++ PFN        
Sbjct: 126  TGGYIGHKGIGFKSVFKVAYKVSIQSGPFSFYFEHHQGDSGL---GMITPFN-------- 174

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP---SLLLFLHRLQ----- 1379
             ++P +L     NT I L F       I++++     S+L     ++LLFL +LQ     
Sbjct: 175  -EEPQELPPSV-NTRITLFF-------ISISDFEARASELREIPDTMLLFLRKLQRLTVN 225

Query: 1380 ------CIMFRNMLNDS--LVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
                   I F    + S  L+ + K+  G+ + K    E K T     Q L     RP  
Sbjct: 226  IPSLQSQISFTRFEDKSKHLITLTKETNGEQVTKFYHLE-KTTLSNLPQHLS----RPGQ 280

Query: 1432 KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNS 1491
               ++ LA  +QE    +Y PL+  Q V++FLP+R  G  F++Q DF+  +SR+ V    
Sbjct: 281  PEVDLILAFPVQE----DYSPLIQAQYVYSFLPMRDEGFNFLIQSDFITQASRQGVH-RC 335

Query: 1492 PWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMIL 1551
              N  +      LF+ A   FC     R         ++ ++P       F++ L  MI 
Sbjct: 336  DRNYAIREGISHLFLQAVAYFCKHTSLRYE-------WLQYLPGTHIRDPFWADLREMIF 388

Query: 1552 SKLRMSNCL 1560
              L+ S  L
Sbjct: 389  DCLKGSKIL 397


>gi|346976436|gb|EGY19888.1| hypothetical protein VDAG_01904 [Verticillium dahliae VdLs.17]
          Length = 1402

 Score =  148 bits (374), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 158/559 (28%), Positives = 236/559 (42%), Gaps = 104/559 (18%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG----IVVLNNEQ 1246
            LG AL  L+ +L S+ +HF+LEL+QNADDN YP  V+PT + +L ++G    +    NE 
Sbjct: 42   LGSALDILASQLNSKTTHFILELIQNADDNAYPSGVQPTFSLVLDKNGSNFTLRTDCNET 101

Query: 1247 GFSAENIRALCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            GFS +NI A+C +G STK G      G+IG KGIGFKSVF+V    +I SN +  + D +
Sbjct: 102  GFSLKNIDAICSIGKSTKTGIINGQKGFIGEKGIGFKSVFKVAHQVDILSNFYEFRIDRT 161

Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
               +G + P    P       R L +                                  
Sbjct: 162  -AHLGMLDPK---PHRFPDTQRRLQQ---------------------------------- 183

Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR-----KKIVGDGIIKVSCGEDK------ 1412
               + P LLLFL +L  +  +  + D   V R     +    DG   VS  ED+      
Sbjct: 184  ---IKPELLLFLRQLSVLDIK--VEDGHCVYRSIRQNRDPRYDGET-VSLNEDRTGSPFD 237

Query: 1413 ---------MTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFL 1463
                     +      Q +     R  V +TEI LAL +        G  L     FAFL
Sbjct: 238  EPVKTTKLHLIVRYEVQDMPVDARRNSVSSTEITLALPV--------GSELETCDAFAFL 289

Query: 1464 PLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPA 1523
            P+   G  FI+Q DF+L ++R+ +    PWNQ +L   P  FV A R F        N A
Sbjct: 290  PIDQSGFNFIVQADFLLVANRQALQHALPWNQRILEAIPDAFVQAVRHF--------NHA 341

Query: 1524 KAASVY-MSFV-PLVGEVHGFFSGLPRMILS--KLRMSNCLILEGNNNQWAP----PCKV 1575
             A   Y + +  PL  E +G  S     +    K R+ +  +LE  + ++      PC  
Sbjct: 342  DATHGYSLKYKWPLFLEYNGPKSSTWTEVHQGIKRRLCDEDVLEARDGRFRAVRPVPCPA 401

Query: 1576 LRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKI-LLQIISSLCR 1634
                   AH  L  I  +     GFL  D  ++     A    EY P +  L+ +     
Sbjct: 402  -GTQQTAAHKPLIIIPTEYRFDNGFLI-DATINHGHHLAF---EYDPVLEELKALDVKEM 456

Query: 1635 TENGLRSMGLSWLASWLN---ELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGT 1691
            T  GLR+    WL    +      +  +H++  ++      +   D LQ +P + L+DG+
Sbjct: 457  TGAGLRNEFSDWLERAGHAGLAAKSDKWHAALAAAFMWKGRSKQKDKLQELPLVHLADGS 516

Query: 1692 FSSVDEGTIWLHSDCSVFD 1710
            +++   G I+  S  +V +
Sbjct: 517  WAACAGGDIFFPSTGAVLN 535


>gi|406866769|gb|EKD19808.1| Heterokaryon incompatibility protein, putative [Marssonina brunnea f.
            sp. 'multigermtubi' MB_m1]
          Length = 1617

 Score =  147 bits (371), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 120/397 (30%), Positives = 180/397 (45%), Gaps = 82/397 (20%)

Query: 1157 DAALIVESIRRDEFGL------GPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFL 1210
            DA   +  IR D+ GL      GPNIS++E+            AL  LS++LY   +HFL
Sbjct: 11   DAKAHISQIRHDK-GLVDGKPSGPNISDLEN------------ALTTLSEQLYQSSTHFL 57

Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA- 1269
            LE++QNADDN Y   V P+L F  ++ G+ V  NE GFS  N+ A+C VG STKKG  A 
Sbjct: 58   LEIIQNADDNQYSAAV-PSLEFTYRKGGLRVDCNEIGFSPRNVEAICRVGQSTKKGMKAE 116

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE--GQIGFV---LPTLVPPFNIDMFC 1324
            GY+G KGIGFKSVF+  D   + S  +  KFD  +  G I  +    P  V P    ++ 
Sbjct: 117  GYVGEKGIGFKSVFKAADVVWVQSGHYSFKFDKRKPLGMISPIWDDFPMPVRPGYTSLYL 176

Query: 1325 RL--------LSKDPVQLES-KCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
            +L        L ++   L+S              + S+G    N  + FS       L  
Sbjct: 177  KLSAGYNTSGLLEELKALDSRLLIFLRRLRSVSVEISQGGVFGNFKNSFSRQDRGKTL-- 234

Query: 1376 HRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTE 1435
                  + +N ++   ++ R +++G                +   + R G       ++E
Sbjct: 235  ----VRLTQNKVDCDYIIKRHRVLG----------------MPQDRRRDGTT-----SSE 269

Query: 1436 IALALTLQES-NEGNYGPLLYQQPVFAFLPLRTYGL-------------------KFILQ 1475
            I L   +  S +E +  P    Q V+AFLP+R YG                    +F++Q
Sbjct: 270  IVLGFPIMMSEDEKSAVPKRESQQVYAFLPIRDYGFEVNTSLTCALKLAHSANVQQFLIQ 329

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
             DF+L +SRE++D ++ WN  +  E    F+   R F
Sbjct: 330  ADFLLIASREDIDSSAEWNHRIQRELTIAFIDTMREF 366


>gi|402076366|gb|EJT71789.1| hypothetical protein GGTG_11043 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 2200

 Score =  147 bits (370), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 149/547 (27%), Positives = 230/547 (42%), Gaps = 96/547 (17%)

Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNEQGFSAEN 1252
            AL  +S ELY   + FLLE+ QNADDN Y  +V P L    L +  ++   NE GF   +
Sbjct: 61   ALDVISNELYPNSTDFLLEIFQNADDNAYSSSVTPKLWITYLDDDTLMFDTNEIGFRRAD 120

Query: 1253 IRALCDVGNSTKKG-----SSAG--YIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----- 1300
            +  +C VG S+K+      + AG   IG KGIGFK+VF+V D   + S  +  KF     
Sbjct: 121  VEGICGVGRSSKQENLAEVTRAGERRIGEKGIGFKAVFKVADEVYVKSGFYSFKFTDKAV 180

Query: 1301 -DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
                E  +G V P               +  P Q   +   T I L  R          +
Sbjct: 181  PGAPEASLGRVAPEW-------------ASFPAQARRRG-GTSILLKLRRYIDRQKLAGD 226

Query: 1360 IVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC---GEDKMTWF 1416
            I+     L   LL+FL+R++ +     L +  VV+R++   D    + C     D  + +
Sbjct: 227  IMK----LDAKLLMFLNRVKEVEI-EALREPRVVLRREDRTDSSTGIPCRVLKPDVSSPY 281

Query: 1417 VASQKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKF 1472
            +      +G+     R  +  +E+ LA+    + +G    L     V++FLP+R YG KF
Sbjct: 282  ILFHHPVSGLPAEKKRKGLAQSEMVLAIPYDGTRDG--AALTQTHNVYSFLPIRDYGFKF 339

Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSF 1532
            +LQ DFVL ++R+E+D  S WN+ L ++ P   +        L      P        S 
Sbjct: 340  VLQADFVLIANRKEIDSGSEWNRALRAQLPTALLGFVAQLNRLGLQYRWP--------SL 391

Query: 1533 VPLVGEVHGFFSGLPRMILSKLRMSNCL-----ILEGNNNQWAPPCKVLRGWNDRAHSLL 1587
             P+  E HGFF G+   +L +   S  +     +LE  +  W  P           H L 
Sbjct: 392  FPIHDEGHGFFQGINAKVLEEFSRSAIIESIGGMLESASQLWLVP-----------HEL- 439

Query: 1588 PDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWL 1647
                            DI   D +A A  I     +      S    T +GL ++G+  L
Sbjct: 440  ---------------ADIPTGDGVAPAPMIPPDFSQFRYVSSSYSHATWDGLSNLGVRTL 484

Query: 1648 --ASWLNEL-YTISFHS-----------SGQSSLQSGVETDLIDNLQRIPFIPLSDGTFS 1693
              A++L  L + IS H            S  S++   +       + R+ F+PL DGTF 
Sbjct: 485  SGANFLEALGHFISHHPRQFYAMPNRWHSRLSTILDQLTAKHEGTITRLRFVPLRDGTFV 544

Query: 1694 SVDEGTI 1700
            ++D+G +
Sbjct: 545  ALDDGPV 551


>gi|342872578|gb|EGU74933.1| hypothetical protein FOXB_14550 [Fusarium oxysporum Fo5176]
          Length = 1502

 Score =  146 bits (368), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 152/549 (27%), Positives = 236/549 (42%), Gaps = 78/549 (14%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG----IVVLNNEQGFSAENI 1253
            +S +LY   +HF+LEL+QNADDN Y   + P+L   L  +G         NE GF+ + +
Sbjct: 70   VSDQLYKASTHFILELIQNADDNQYDSEIVPSLRLALYNNGGHRYFRSDCNEVGFTFKQL 129

Query: 1254 RALCDVGNSTKKGS---SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
             AL  VG STK  +   S   IG KGIGFKSVF+V D   + S  +  KFD +  +IG +
Sbjct: 130  DALTRVGQSTKAATIDNSKSCIGEKGIGFKSVFKVADVVHVASGFYEFKFDRN-ARIGMI 188

Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
            LP         +  R  + D V   ++         F  +       + I +  + + P 
Sbjct: 189  LP---------ITSRFPTADRVADHTQ---------FLLELKSQRDYDVIKEELNSIEPD 230

Query: 1371 LLLFLHRL-QCIMFRNMLNDSLVVIRKKIV------GDGIIKVSCGEDKMT---WFVAS- 1419
            LLLFL  L Q  +    LN      R+KI           +K+S   D +T   + V   
Sbjct: 231  LLLFLRNLDQVHISTRGLNKR---CRRKITRFDPRYKGETVKISVHGDAVTSKEYIVHRR 287

Query: 1420 --QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGD 1477
              +KL     R  V ++E+ +A T+    +    P+   Q VFAFLP+  +G +F++  D
Sbjct: 288  IVEKLPPEPQREGVVSSEVVIAFTV----DSEATPVFTTQNVFAFLPVDDFGFRFLIHAD 343

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVG 1537
            F+L +SRE +D +S WN  L  E    FV A      L   R+        +  ++P   
Sbjct: 344  FILVASREGLDESSLWNWSLRDEIQTAFVDAIHRLVALSPIRDGEGLCYK-WPKYLPRYS 402

Query: 1538 EVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP-CKVLRGWNDRAHSLLPDI--LLQK 1594
            E  GF+ GL + +++ LR +  L          P     + G     +  L D+  LL  
Sbjct: 403  ETSGFWYGLHQNMMNALRDTPLLKSRAGGALRKPTDLYYVPGDYRFENGTLFDLPSLLHS 462

Query: 1595 HLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNE- 1653
            HL   + S    LS      +G++          I+ L R             + W+NE 
Sbjct: 463  HLSFEYDSVKSELS-----LIGVDTLH-------INDLWRE-----------FSHWINEV 499

Query: 1654 ----LYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVF 1709
                L T S     + SL    +  L + L+ +P +PL DG+++   +  ++  S  +  
Sbjct: 500  GVDGLKTQSIEWHQKVSLIFCDQKTLREKLRNLPIVPLRDGSWAKARQDRVFFTSTHTEE 559

Query: 1710 DGGFGLEAF 1718
                G+E F
Sbjct: 560  HVPTGIELF 568


>gi|408393884|gb|EKJ73142.1| hypothetical protein FPSE_06755 [Fusarium pseudograminearum CS3096]
          Length = 2336

 Score =  146 bits (368), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 159/529 (30%), Positives = 225/529 (42%), Gaps = 73/529 (13%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            L  ALH LS ELY Q +HF+ EL+QNADDN + E   PTL F      + V  NE+GF  
Sbjct: 38   LTNALHILSTELYDQSTHFIQELLQNADDNHF-ETSTPTLIFSYSPGRLRVDCNEKGFLP 96

Query: 1251 ENIRALCDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
             +I ALC +  STK     A Y G KGIGFKSVFR+ D   I SN +  KFD  + Q G 
Sbjct: 97   HHIDALCTIRGSTKTDKDQASYTGEKGIGFKSVFRMADVVWISSNSYQFKFDKHK-QFGT 155

Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
            V P  V  F            PV   S+         F  +      M  +V+   D   
Sbjct: 156  VAPEWVEDF------------PVHSSSEY------TTFYLELCPQGDMEELVEKLVDFDA 197

Query: 1370 SLLLFLHRLQC--IMFRNMLNDSLVVIRKKIVGDG----IIKVSCGEDKMTWFVAS---Q 1420
              LLFL RL+   I       D   +IR+    DG    I  +  G  K  + V      
Sbjct: 198  DQLLFLRRLRQVKINIEREYQDLSHMIRRIDETDGCGNLITTLDLGTSKKRFMVRKFQVT 257

Query: 1421 KLRAGVIRPDVKTTEIALALTLQES-NEGNYGPLLYQQPVFAFLPLRT-YGLKFILQGDF 1478
             L     R +   +E+ LA  L E   +G     +++  +FA LP+    GLKF+LQGDF
Sbjct: 258  NLPEEPRRRNQTCSEMVLAFPLMEELPDGGTAMPVFEAKLFAGLPVGADGGLKFLLQGDF 317

Query: 1479 VLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE 1538
            +L +SR  +D +  WNQ L       F+ +  +F        N      ++  F+    +
Sbjct: 318  ILTASRLHLDTSLRWNQGLRDGLVNAFLESVLAF--------NAGIHRYIWPFFLSYDKK 369

Query: 1539 VHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGL 1598
            V  FF    + I   LR+ +CL  E        P            S L  + L+K    
Sbjct: 370  VSSFFQSATKSICEILRLMDCL--ESMEGIMVKP------------SALTYVDLEK---F 412

Query: 1599 GFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMG----LSWLASWLNEL 1654
            G  SKD +   +    L +    P   +  I SL     G+R +     L    + L+E 
Sbjct: 413  GDSSKDFLTLSAKTSPLYLSNKYPAWTIPGILSL-----GVRRLSATEFLEHFRTLLSEE 467

Query: 1655 YTI------SFHSSGQSSLQSGVETDLI-DNLQRIPFIPLSDGTFSSVD 1696
             ++      S+H      L S +  + I  +L+ +P IPL++G + S +
Sbjct: 468  DSVFIKKSPSWHGDVAKVLLSLISDERIRKDLRNLPIIPLTEGRWVSAN 516


>gi|212534372|ref|XP_002147342.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
 gi|210069741|gb|EEA23831.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
          Length = 1645

 Score =  145 bits (366), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 142/570 (24%), Positives = 247/570 (43%), Gaps = 64/570 (11%)

Query: 1156 EDAALIVESIRRDEFGL-GPNISNMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
            E A  I++ IR +  G+   +  N   N+L+       +LG +   L+++LY+ +S F+ 
Sbjct: 7    ERARYIIDEIRVNNGGITAEDRRNTSENVLRALENLRRQLGSSTRALARDLYTSESRFVY 66

Query: 1212 ELVQNADDNIY--PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            EL+QNA+DN Y   +N +  + F L    +VV NNE GF+  ++ ++C VG+ST K +  
Sbjct: 67   ELIQNAEDNSYSPAQNGDCYIKFTLTPENLVVENNELGFNEADVLSICKVGDST-KSNRI 125

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSK 1329
            G+IG KGIGFKSVF+V     I S  F   F+      G     ++ P N          
Sbjct: 126  GFIGEKGIGFKSVFQVAQKVRIQSGPFDFYFEHVPSDTGM---GMITPIN---------- 172

Query: 1330 DPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQCIMFRNM 1386
               + E++ +    R+   T            D+   +  +L+LF   L R++ ++ R  
Sbjct: 173  --GEEETEIYKEGTRMTL-TLLQPHDFSKRSTDLLG-IPDALILFLPKLQRIEVLIQRQN 228

Query: 1387 LNDSLVVIRKKIVGDG-----IIKVSCGEDKMTWFVASQKLR---AGVIRPDVKTTEIAL 1438
             + S  V  K+  G       + +V     +  + V  +  R   +   R +    E+ L
Sbjct: 229  GDLSTTVHEKQDDGQNSTVRLVKRVDDTVSEKFFHVERRMFRDLPSNDARKNRDRAEVVL 288

Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            A  +      N  P++  Q  F+FLP+R  G KF++Q DF+  +SR++V     WN  LL
Sbjct: 289  AFPIA----SNSNPIVEPQLTFSFLPIRNLGFKFLIQSDFITAASRQDV-RQCDWNDALL 343

Query: 1499 SEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
                 +FV A   FC  P  +         ++SF+P     H F++GL  +I+++L    
Sbjct: 344  QHVSDVFVDAVNKFCSRPVLQYG-------WLSFLPGTSVPHEFWNGLHGIIINQLAQHP 396

Query: 1559 CLILEGNNNQWAPPCKVL--RGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALG 1616
             L+      Q     + L  R  +     L  D+  + +L   +  +         + LG
Sbjct: 397  ILLTRRGVRQLPSRLQRLSERHCDKDGEPLFEDLQQEVYLSSDYRPE----HHEYLQQLG 452

Query: 1617 IEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGV-ETDL 1675
            +       ++  I+   + E             +L+      +H+     L  G+ + D+
Sbjct: 453  VTPLSWNNIIARITPYLQDET----------PRFLHPDRDDDWHTRVADLLIRGLRDRDI 502

Query: 1676 IDNLQRIPFIPLSDGTFSSVDEGTIWLHSD 1705
               ++ +P IPL DG+       TI+  +D
Sbjct: 503  ESKIKELPLIPLRDGSIHHNRSKTIYFPTD 532


>gi|119483844|ref|XP_001261825.1| hypothetical protein NFIA_095480 [Neosartorya fischeri NRRL 181]
 gi|119409981|gb|EAW19928.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 1634

 Score =  145 bits (365), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 180/722 (24%), Positives = 302/722 (41%), Gaps = 131/722 (18%)

Query: 1141 LGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISN-MESNMLKKQHARLGRALHCLS 1199
            +G  T + L +    +   L  E + R    + P++   +E  MLKK  A +G ++  L+
Sbjct: 8    MGSSTARALVKQISKQHGYLGEEVLSR----MDPDVRREVEEAMLKKD-AMIGSSVITLA 62

Query: 1200 QELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGIVVLNNEQGFSAENIRA 1255
            + LY+  + F+ EL+QNADDN Y      + +P ++F +    I+V  NE GF+ EN+ A
Sbjct: 63   KNLYNSTARFVFELLQNADDNSYNVAKSTSTDPFVSFCVYNRRIIVECNEDGFTHENLVA 122

Query: 1256 LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
            +C+VG S+K G+  GYIG KGIGFKSVF V     I S      F  + G  G  + + V
Sbjct: 123  ICNVGKSSKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSRDLSFSFRHNVGDSGMGMISPV 181

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN--NIVDMFSDLHPSLLL 1373
               + ++    L++  + L                 +E +A      +  F +L  + LL
Sbjct: 182  WEESEEVLPHPLTRITLFLHE------------NGSAEDLARQRETTLQQFRELKAAFLL 229

Query: 1374 FLHRLQCIMFRNMLN----------------DSLVVIRKKIVGDGIIKVSCGEDKMTWFV 1417
            F+  L+ I  R   N                +  V ++++IV DG ++        T   
Sbjct: 230  FMKNLRRIEVRICDNLDTEISQTTFSMRCEDEGRVALKQEIVEDGELQECTQYYHTTKLT 289

Query: 1418 ASQKLRA--------GVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG 1469
            AS   R+         ++      T I LA  L   +     P++ +  VFAFLP+R  G
Sbjct: 290  ASNIPRSENREYTDVELLTKAYSETSIILAFPLTHDSV----PIIEKHDVFAFLPIRNMG 345

Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVY 1529
            L F++Q DFV  +SR+++  +S  N  LL      F+ A + FC            A  +
Sbjct: 346  LPFLIQADFVTDASRQDIVRSSARNIKLLPAIAQAFIQAVKQFC-------THTSLAYQW 398

Query: 1530 MSFVPLV--GEVHGFFSGLPRMILSKLRMSNCLILEGNNN-------QWAPPCKVLRGWN 1580
            M ++P +  G   GF+  LP  I  +L++++ +    + N       +  P        +
Sbjct: 399  MRYLPNIEWGSQDGFWKALPGEIRRQLQLTSTMWTRSHRNMRCIGDMRLVPSSMC----D 454

Query: 1581 DRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLR 1640
                 LLPD+  +++L   +  KD+           +++YG K  +     L R    + 
Sbjct: 455  KYGKPLLPDLDTEQYLSSHYALKDLT---------SLKDYGLK-RMNRFKFLDRLRQDIN 504

Query: 1641 SMGLSWLASWLNELYTISFHSSGQSSL--QSGVET-DLIDNLQRIPFIPLSDGTF---SS 1694
            + G S +    N      +HS+    L   SG E  +    +++I  IPL  G++   S+
Sbjct: 505  NGGSSIMR---NPDTDDDWHSAVAKYLISVSGPENPNTTAKIKKIQLIPLLGGSWKCPSA 561

Query: 1695 VDEGTIWL---HSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNL 1751
            +D   ++    H      DG F      +L A     +PA                    
Sbjct: 562  LDRYPVYFTQAHGYAVPTDGIF------DLIAPDAENNPA-------------------R 596

Query: 1752 NRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVE 1811
             ++  ++GVQ+ S  DI +      I D+ T        A  L     HL++     H++
Sbjct: 597  KQLFARLGVQEASVRDIRR-----KIIDQDTR------FAPGLATSRSHLQFLYLTAHLD 645

Query: 1812 RE 1813
            RE
Sbjct: 646  RE 647


>gi|242068525|ref|XP_002449539.1| hypothetical protein SORBIDRAFT_05g018510 [Sorghum bicolor]
 gi|241935382|gb|EES08527.1| hypothetical protein SORBIDRAFT_05g018510 [Sorghum bicolor]
          Length = 1620

 Score =  144 bits (363), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 125/433 (28%), Positives = 196/433 (45%), Gaps = 110/433 (25%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            VE IRR+ + +G      E N L +    + +A++ LSQELYS+D HFL+ELVQNA+DN 
Sbjct: 11   VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQNAEDNE 63

Query: 1222 YPENVEPTLTFIL---------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-AGY 1271
            YP  V P+L F++           S +++ NNE+GFS+ NI ++C VG STKKG+   GY
Sbjct: 64   YPSEVAPSLEFLITSKDITGCGASSTLLIFNNERGFSSTNIESICRVGKSTKKGNRHQGY 123

Query: 1272 IGRK------------------------------------------GIGFKSVFRVTDAP 1289
            IG K                                          GIGFKSVF ++  P
Sbjct: 124  IGEKVLNMMLINYIMGKYSVESWKPPKMDMSNIWIYYDVLSFESYYGIGFKSVFLISSQP 183

Query: 1290 EIHSNGFHVKFD---TSEGQIGFVLPTLV---PPF-NIDMFCRLLSKDPVQLESKCWNTC 1342
             I SN + +KF+    ++  IG+++P  V   P F +I+         P         T 
Sbjct: 184  HIFSNSYQIKFNEKPCAQCNIGYIVPQWVESTPSFSDIEALYGCSKVLP--------TTT 235

Query: 1343 IRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR------NMLNDSLVVI-- 1394
            I LP +++      ++ +    S + P +LLFL +++ +  R      N+   S + I  
Sbjct: 236  IILPLKSE-----KIDAVKRQLSSMQPEMLLFLTKIRKLSVREDKSDPNITTVSEISISS 290

Query: 1395 ------RKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVIRPD------VKTTEI 1436
                  RK +  +         +    E +  +++  QK     ++P+       +  E 
Sbjct: 291  EKNYQARKDMHAESYTLHLSSEETGKDEAECGYYMWRQKFP---VKPENRVDKRAEIDEW 347

Query: 1437 ALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
             + L        + G  L    ++AFLP        FI+Q DF+L SSRE +  +SPWN+
Sbjct: 348  VITLAFPHGERLSRGKQL-SPGIYAFLPTEMVTSFPFIIQADFLLASSREAILFDSPWNK 406

Query: 1496 WLLSEFPALFVSA 1508
             +L   P+ F+ A
Sbjct: 407  GILECIPSTFMDA 419


>gi|302924848|ref|XP_003053981.1| hypothetical protein NECHADRAFT_32128 [Nectria haematococca mpVI
            77-13-4]
 gi|256734922|gb|EEU48268.1| hypothetical protein NECHADRAFT_32128 [Nectria haematococca mpVI
            77-13-4]
          Length = 1738

 Score =  144 bits (362), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 154/623 (24%), Positives = 273/623 (43%), Gaps = 76/623 (12%)

Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV---- 1226
             + PN+       L ++   +G A+  L++ LY+ ++ F+ EL+QNADDN Y   +    
Sbjct: 39   AINPNLRREVEEALLRKDEMIGSAVLTLARNLYTSNARFVFELLQNADDNNYSTAISQGE 98

Query: 1227 EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
            +P ++F +    + +  NE GF+ EN+RA+C +G S+K G+ AGYIG KGIGFKSVF   
Sbjct: 99   DPYVSFKIWPDKVSIECNENGFTHENLRAICAIGKSSKVGA-AGYIGEKGIGFKSVFMAA 157

Query: 1287 DAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
                I SN F   F   +G  G  + T V     ++    L++  + L +          
Sbjct: 158  WKVHIQSNDFSFSFTHRKGDSGLGMVTPVWEETDEVLGETLTRITLFLHTS--------- 208

Query: 1347 FRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG---- 1402
                  +   +  I   F DL  ++LLFL +L+ +       D  +         G    
Sbjct: 209  -DDPEEDARQLETIKLQFQDLQHTILLFLRKLRRVQVSFFDEDDSLTSATTYSLHGSNPA 267

Query: 1403 -IIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-----------TEIALALTLQESNEGNY 1450
             + K S    +   +  ++ +   + + + +T            E+ LA  L ES     
Sbjct: 268  TVRKTSSEGVEERRYHVTKHMAENIPKSENRTYSEERDRADSSAEVVLAFPLTESAT--- 324

Query: 1451 GPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAER 1510
             P++  Q VFAFLP+R  G KF++  DFV  +SR+ +   S  N+ LL+     F+ A +
Sbjct: 325  -PVVESQDVFAFLPMRPMGFKFLIHTDFVTEASRQGLVVTSLRNRALLTGIADCFIKAIQ 383

Query: 1511 SFCDLPCFRENPAKAASVYMSFVPLVGEV--HGFFSGLPRMILSKLRMSNCL--ILEGNN 1566
             FCD P  +         +M ++P         F+SGL   I  +++    L  +  G  
Sbjct: 384  EFCDHPTLQYQ-------WMRWLPQRDSYPWDSFWSGLLDRIEERIQEVKILRTLGTGKL 436

Query: 1567 NQWAPPCKVLRGW--NDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKI 1624
            +     C+ L+ W  + +   L  D+  + +L   + ++D+ L         ++ YG   
Sbjct: 437  DYIHKLCR-LQPWLRDKQGDPLFMDMDEEIYLSKEYKTEDLAL---------LKPYG--- 483

Query: 1625 LLQIISS---LCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLI----D 1677
             L+ +S+   + R E  L+    +  +   N      +HS    +L    +T  +    D
Sbjct: 484  -LENVSAWDMIVRVEKDLQKQPEA--SRMKNPDTDNEWHSLAARALVLLKDTATLKTSQD 540

Query: 1678 NLQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASA 1737
             ++ + FIPL +G + S   G ++ +S C    G   +    NL   +R +     + + 
Sbjct: 541  RIKDLKFIPLENGEWVSAGSGPLY-YSHCP---GQLAIPDDLNLSLDVRGLILKKPAVAV 596

Query: 1738 VDKSSLGVISVDNLNRMLLKIGV 1760
             D+++L   S+ +L  + L  G+
Sbjct: 597  ADEAALSA-SISHLKFLYLSEGL 618


>gi|303320881|ref|XP_003070435.1| Heterokaryon incompatibility protein, putative [Coccidioides
            posadasii C735 delta SOWgp]
 gi|240110131|gb|EER28290.1| Heterokaryon incompatibility protein, putative [Coccidioides
            posadasii C735 delta SOWgp]
          Length = 2074

 Score =  143 bits (361), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 153/539 (28%), Positives = 242/539 (44%), Gaps = 81/539 (15%)

Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAG 1270
            +EL+QNADDN Y  +V P+LT   ++  + V  NE GF+ E++ ++C +G S +KG SA 
Sbjct: 1    MELIQNADDNRYRRDVSPSLTLSYRKMHLRVDCNELGFAPEDVDSICRIGGS-QKGISAD 59

Query: 1271 Y---IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
                IG KGIGFKSVF+V     I S  +  KFD +  ++G + P             + 
Sbjct: 60   RMDCIGEKGIGFKSVFKVASVVWISSGKYSFKFDNAT-RLGMIAP-------------MW 105

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ----CIMF 1383
            ++ P +  +   +  +RL  ++   E +    + D      P +L+FL++++    CI  
Sbjct: 106  AQFPAEAPASGTSMLLRLLNKSVRDELLKEMRVCD------PKILIFLNQIRELKVCIAQ 159

Query: 1384 RNMLNDSLVVIRK--KIV---GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIAL 1438
             N       + RK  K V   G  +  +  G D    FV  + + +G         EI L
Sbjct: 160  DNGTTFERTISRKDDKDVDKPGQYVTTLRYG-DSSQQFVVIRYVPSGR---STDKAEILL 215

Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            A  ++ + E    P++  Q V+ FLP+R YGLKF++QG F L +SRE+++G+S  NQ L 
Sbjct: 216  AFPIKNNQE----PVIQSQCVYTFLPIRDYGLKFLVQGRFSLIASREDINGSSAENQRLR 271

Query: 1499 SEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
            S  P  FV A   F +   FR        ++  ++P++ E   FF+ L    +    +S 
Sbjct: 272  SLIPKAFVQAVDYFQESDYFR-------YIWPRYLPIIPE-ENFFTSLTEETI--FTLSQ 321

Query: 1559 CLILEGNNNQWAPPCKVLR---GWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS-LARA 1614
              +LE  + + A P  +      + D     LP   L       +LS+     DS L + 
Sbjct: 322  RKLLESESGELATPGTLFYVPPRFRDPNGDPLP---LCSRTAFRYLSRKYFDEDSDLFQC 378

Query: 1615 LGIEEYGPKILLQIISSLCRTENGLRSMGLSW---LASWLNELYTISFHSSGQSSLQSGV 1671
            LG+     + LL  + S+             W   L+  LN           +  L++  
Sbjct: 379  LGVRPLSLRDLLVDVKSVVNQYQNHSREATEWHIRLSKALNR----------EDMLRAYR 428

Query: 1672 ETDLIDNLQRIPFIPLSDGTFSSVDE--GTIWLHS---DCSVFDGGFGLEAFPNLCAKL 1725
             T     L  IP IPL DG + S  E  G I+  S   D SV  G   +E  P+  + +
Sbjct: 429  HT-----LHSIPLIPLRDGRWISASESAGIIFFPSALADLSVPSGLNSVEIHPDAASNV 482


>gi|302419259|ref|XP_003007460.1| hypothetical protein VDBG_01648 [Verticillium albo-atrum VaMs.102]
 gi|261353111|gb|EEY15539.1| hypothetical protein VDBG_01648 [Verticillium albo-atrum VaMs.102]
          Length = 1457

 Score =  142 bits (358), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 156/571 (27%), Positives = 238/571 (41%), Gaps = 102/571 (17%)

Query: 1186 KQHARLGR----ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG--- 1238
            +QH  + R    AL  L+ +L S+ +HF+LEL+QNADDN YP  V PT + +L ++G   
Sbjct: 33   EQHGFVARNLESALDILASQLNSKPTHFILELIQNADDNAYPRGVRPTFSLVLGKNGSNY 92

Query: 1239 -IVVLNNEQGFSAENIRALCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             +    NE GFS +NI A+C +G STK G      G+IG KGIGFKSVF+V    +I SN
Sbjct: 93   TLRTDCNETGFSLKNIDAICSIGKSTKTGIINGQKGFIGEKGIGFKSVFKVAHQVDILSN 152

Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEG 1354
             +  + D +   +G + P    P             P     +     +    R K    
Sbjct: 153  FYEFRIDRT-AHLGMLDPK---PHRF----------PDTQRREGHTQMLLHLLRDKDHR- 197

Query: 1355 IAMNNIVDMFSDLHPSLLLFLHRLQCI---------MFRNMLND----------SLVVIR 1395
              +++I      + P LLLFL +L  +         ++R+   D          SL+  R
Sbjct: 198  --ISDITKGLQQIKPELLLFLRQLSVLNIKAENGHCVYRSARQDRDPRYDGETVSLIEDR 255

Query: 1396 KKIVGDGIIKVSCGEDKMTWFVASQ--KLRAGVIRPDVKTTEIALALTLQESNEGNYGPL 1453
                 D  +K +    K+   V  +   +     R  V +TEI LA  +        G  
Sbjct: 256  TASPLDEPVKTT----KLYRIVRYELHDMPVDARRNSVSSTEITLAFPV--------GSE 303

Query: 1454 LYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF- 1512
            L     FAFLP+   G  FI+Q DF+L ++R+ +    PWNQ +L   P  FV A   F 
Sbjct: 304  LETCDAFAFLPIDQSGFNFIVQADFLLVANRQALQHALPWNQRILEAVPDAFVQAVHHFN 363

Query: 1513 -----------CDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLI 1561
                        + P F E     +S +        EVH    G+      K R+ +  +
Sbjct: 364  HPGATHGYSLKYEWPLFLEYTGPKSSPW-------AEVH---QGI------KKRLCDEDV 407

Query: 1562 LEGNNNQWAP----PCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGI 1617
            LE  +  +      PC         AH  L  I  +     GFL  D  ++     A   
Sbjct: 408  LEARDGSFRAVRPIPCPA-GTQQTAAHKPLIVIPTEYRFDNGFLI-DATINHGHHLAF-- 463

Query: 1618 EEYGPKI-LLQIISSLCRTENGLRSMGLSWLASWLN---ELYTISFHSSGQSSLQSGVET 1673
             EY P +  L+ +     T  GLR+    WL    +      +  +H +  ++      +
Sbjct: 464  -EYDPVLEKLKTLDMKEMTSAGLRNEFSDWLERAGHAGLAAKSDKWHVALAAAFMWKGRS 522

Query: 1674 DLIDNLQRIPFIPLSDGTFSSVDEGTIWLHS 1704
               D LQ +P + L+DG++++   G I+  S
Sbjct: 523  KQKDKLQELPLVHLADGSWAACASGDIFFPS 553


>gi|347976163|ref|XP_003437411.1| unnamed protein product [Podospora anserina S mat+]
 gi|170940269|emb|CAP65496.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1640

 Score =  140 bits (353), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 145/573 (25%), Positives = 247/573 (43%), Gaps = 98/573 (17%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFI 1233
             +E  +L K    +G+++  L++ELYS+D  F+ EL+QNADDN +        EP + F+
Sbjct: 37   KVERALLAKDQL-IGQSVITLARELYSKDVRFIFELLQNADDNHFNHAKSSGQEPYVAFL 95

Query: 1234 LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
            +    IV+  NE GF+ EN+RA+C VG S+K G+  GYIG KGIGFKSVF+V     + S
Sbjct: 96   IYHDKIVIDCNEDGFTEENLRAICSVGKSSKHGAQ-GYIGEKGIGFKSVFKVAWKVHVQS 154

Query: 1294 NGFHVKFDTSEGQ--IGFVLP-------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIR 1344
              F   F    G   +G + P       +  PP  I      L  +   L+ K       
Sbjct: 155  GPFSFCFIHRPGDSGMGMISPEWHQSEDSTRPPTGITRMTFTLHNN---LDQKL------ 205

Query: 1345 LPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR------------NMLNDSLV 1392
                    +     +IV+ F DL P++LLFL +L+ I                M   + V
Sbjct: 206  --------QAAQRQSIVNEFKDLQPAMLLFLKKLRRIEVHFFDVEGTKESHSAMSLSAPV 257

Query: 1393 VIRKKIVGDGIIKVSCGEDKM--TWFVASQKLRAGVIRPDVKT-------------TEIA 1437
             I  + V          E K     +  ++    G+ R + +              T++ 
Sbjct: 258  DISNRAVLTTTKTTHNSEPKTFSHHYHVTKLWATGLARNENRKYSPAEESSKAYSRTKVV 317

Query: 1438 LALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWL 1497
            LA  L + +     P++  Q +FAFLP+R  G  F++  DFV  ++RE++  +S  N+ L
Sbjct: 318  LAFPLTDQS----IPIIEPQDIFAFLPIRKIGFNFLIHTDFVTMANREDIVTSSQRNKGL 373

Query: 1498 LSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVH--GFFSGLPRMILSKLR 1555
                   FV+A    C+ P  +         +M ++P +   H   F+S    ++  ++ 
Sbjct: 374  RRYLAQAFVTAAEQMCEHPQLQFQ-------WMRYLPRLSGNHWDSFWSEFSGILQEEIT 426

Query: 1556 MSNCLILEGNNNQWAPPCKVLRGWNDRAHSL-------LPDILLQK--HLGLGFLSKDIV 1606
              + +I     +    P   L+ W  +   L         D+  +K  ++ L +   D+ 
Sbjct: 427  KVDVII--PFTSMTPRPLTALKQWPSKISWLDHLDRPVFEDLPKEKAAYVSLEYQDSDLS 484

Query: 1607 LSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSS 1666
            +         + +YG K  L  +  + R +  LRS      +   +      +HS     
Sbjct: 485  I---------LRDYGMK-YLSWVDLISRAKADLRSRSSKMKSPQTD----AKWHSLAAKL 530

Query: 1667 LQSGVET-DLIDNLQRIPFIPLSDGTFSSVDEG 1698
            L+ G+      + ++ +  +PLSDG+++SV +G
Sbjct: 531  LKRGLRNLKYEEEIKALNLLPLSDGSWTSVTKG 563


>gi|159125686|gb|EDP50803.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
          Length = 1608

 Score =  140 bits (352), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 123/471 (26%), Positives = 218/471 (46%), Gaps = 63/471 (13%)

Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVE 1227
            + P+I       L K+ A +G ++  L++ LY+  + F+ EL+QN DDN Y      + +
Sbjct: 35   MDPDIRREVEEALLKKDAMIGSSVITLAKNLYNSTARFVFELLQNVDDNSYNVARSRSAD 94

Query: 1228 PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
            P ++F +    I+V  NE GF+ EN+ A+C+VG S+K G+  GYIG KGIGFKSVF V  
Sbjct: 95   PFVSFCVDNRRIIVECNEDGFTHENLVAICNVGKSSKTGAQ-GYIGEKGIGFKSVFMVAW 153

Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
               I S      F  + G  G  + + +   + ++  R L++  + L             
Sbjct: 154  KVHIQSRDLSFSFRHNVGDSGMGMISPIWEESEEVLPRPLTRITLYLHE----------- 202

Query: 1348 RTKFSEGIAMN--NIVDMFSDLHPSLLLFLHRLQCIMFR----------------NMLND 1389
             T  +  +A      +  F +L  + LLF+  L+ I  +                   ++
Sbjct: 203  -TGSAADLATQRETTLQQFRELKAAFLLFMKNLRRIEVKLCGSLDKEISRTTFSMRCEDE 261

Query: 1390 SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI--RPDVKTTEIALAL-TLQESN 1446
              V ++++IV DG ++      + T +  + KL A  I    + + T++ LA     E+N
Sbjct: 262  GRVALKQEIVQDGDLQ------ECTQYYHTTKLTASNIPRSENREYTDVELATKAYSETN 315

Query: 1447 -------EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
                     +  P++ +  VFAFLP+R  GL F++Q DFV  +SR+++  +S  N  LL+
Sbjct: 316  IILAFPLTRDSVPIIEKHEVFAFLPIRNMGLPFLIQADFVTDASRQDIVRSSARNLKLLT 375

Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV--GEVHGFFSGLPRMILSKLRMS 1557
                 F+ A + FC            A  +M ++P    G    ++  LP  I  +L+++
Sbjct: 376  AIAQAFIQAVKQFC-------THTSLAYQWMRYLPDFEWGSQDRYWKALPGEIRRQLQLT 428

Query: 1558 NCLILEGN-NNQWAPPCKVLRG--WNDRAHSLLPDILLQKHLGLGFLSKDI 1605
            + +    + N +     ++L     +     LLPD+  +++L   +  KD+
Sbjct: 429  DTMWTRSHLNMRCIADVRLLPSSMCDKYGKPLLPDLDTEQYLSPHYALKDL 479


>gi|146323927|ref|XP_751403.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gi|129557466|gb|EAL89365.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
          Length = 1608

 Score =  140 bits (352), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 123/471 (26%), Positives = 218/471 (46%), Gaps = 63/471 (13%)

Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVE 1227
            + P+I       L K+ A +G ++  L++ LY+  + F+ EL+QN DDN Y      + +
Sbjct: 35   MDPDIRREVEEALLKKDAMIGSSVITLAKNLYNSTARFVFELLQNVDDNSYNVARSRSAD 94

Query: 1228 PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
            P ++F +    I+V  NE GF+ EN+ A+C+VG S+K G+  GYIG KGIGFKSVF V  
Sbjct: 95   PFVSFCVDNRRIIVECNEDGFTHENLVAICNVGKSSKTGAQ-GYIGEKGIGFKSVFMVAW 153

Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
               I S      F  + G  G  + + +   + ++  R L++  + L             
Sbjct: 154  KVHIQSRDLSFSFRHNVGDSGMGMISPIWEESEEVLPRPLTRITLYLHE----------- 202

Query: 1348 RTKFSEGIAMN--NIVDMFSDLHPSLLLFLHRLQCIMFR----------------NMLND 1389
             T  +  +A      +  F +L  + LLF+  L+ I  +                   ++
Sbjct: 203  -TGSAADLATQRETTLQQFRELKAAFLLFMKNLRRIEVKLCGSLDKEISRTTFSMRCEDE 261

Query: 1390 SLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI--RPDVKTTEIALAL-TLQESN 1446
              V ++++IV DG ++      + T +  + KL A  I    + + T++ LA     E+N
Sbjct: 262  GRVALKQEIVQDGDLQ------ECTQYYHTTKLTASNIPRSENREYTDVELATKAYSETN 315

Query: 1447 -------EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
                     +  P++ +  VFAFLP+R  GL F++Q DFV  +SR+++  +S  N  LL+
Sbjct: 316  IILAFPLTRDSVPIIEKHEVFAFLPIRNMGLPFLIQADFVTDASRQDIVRSSARNLKLLT 375

Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV--GEVHGFFSGLPRMILSKLRMS 1557
                 F+ A + FC            A  +M ++P    G    ++  LP  I  +L+++
Sbjct: 376  AIAQAFIQAVKQFC-------THTSLAYQWMRYLPDFEWGSQDRYWKALPGEIRRQLQLT 428

Query: 1558 NCLILEGN-NNQWAPPCKVLRG--WNDRAHSLLPDILLQKHLGLGFLSKDI 1605
            + +    + N +     ++L     +     LLPD+  +++L   +  KD+
Sbjct: 429  DTMWTRSHLNMRCIADVRLLPSSMCDKYGKPLLPDLDTEQYLSPHYALKDL 479


>gi|402084208|gb|EJT79226.1| hypothetical protein GGTG_04312 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 2143

 Score =  139 bits (350), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 171/378 (45%), Gaps = 49/378 (12%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC 1257
            L+  L ++ + FLLEL+QNADDN Y ++V P L     +  +    NE GF  ++++++C
Sbjct: 52   LANTLSTKPAQFLLELLQNADDNAYADSVNPELVITFTDETLQFDTNELGFRRQDVQSIC 111

Query: 1258 DVGNSTKK------GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL 1311
             +G S+KK      G +A  IG KG+GFK+VFRV +   IHS  +  +FD S  ++G + 
Sbjct: 112  SMGQSSKKEPKKQEGKAASCIGEKGVGFKAVFRVAEQVFIHSGHYSFQFD-SRRELGCIK 170

Query: 1312 PTL-VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
            P   V P    M                W T I L    K    +    +++       +
Sbjct: 171  PEWAVFPGKPRM---------------GW-TSIHL----KLKPDLNREQLLEEMGKFDAT 210

Query: 1371 LLLFLHRLQCIMFR----------NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS- 1419
            LLLFL R++ I  +            LN ++  +       G+  +S        +V + 
Sbjct: 211  LLLFLKRIRVIRVQIKPLYSWLSYKSLNRTIRRLDDASGLGGLAMLSLTPSTFPGYVVNH 270

Query: 1420 ---QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
               + +     RP    +EI LA       +        +  V+AFLP+R+YG KF +Q 
Sbjct: 271  YTVRNMPPAPERPSCAESEIVLAFPKAPQQQTKAAGFPGEHKVYAFLPIRSYGFKFTIQA 330

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLV 1536
            DF+LP+SRE++  +S WN+ L    P   V        +P  R         +  F P+ 
Sbjct: 331  DFLLPASREDIMLDSDWNRALREAIPPAVVDFIPKLRAVPELRYQ-------WPRFFPVR 383

Query: 1537 GEVHGFFSGLPRMILSKL 1554
             ++  FF GL  +I +++
Sbjct: 384  HDMDDFFHGLSTIIANQV 401


>gi|342874594|gb|EGU76590.1| hypothetical protein FOXB_12889 [Fusarium oxysporum Fo5176]
          Length = 2179

 Score =  137 bits (344), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 158/611 (25%), Positives = 253/611 (41%), Gaps = 89/611 (14%)

Query: 1140 GLGDCTTQTLPEDKECEDAALIVESIRRDEFG-LGPNISNMESNMLKKQHARLGRALHCL 1198
            G G  T Q L ++   E   L  E + R   G + P +       L ++   +G A+  L
Sbjct: 9    GPGQRTAQDLVKEIAKERGYLGEEQLAR--IGEINPELRREVEEALLRKDEMIGSAVLTL 66

Query: 1199 SQELYSQDSHFLLELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIR 1254
            ++ LY+ ++ F+ EL+QNADDN Y   +    +P ++F ++   + +  NE GF+ EN++
Sbjct: 67   ARNLYTSNARFVFELLQNADDNNYSTALSSGQDPYVSFEVRPDKVSIECNENGFTHENLK 126

Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
            A+C +G S+K G+ AGYIG KGIGFKSVF       I SN F   F   +G  G  + T 
Sbjct: 127  AICAIGKSSKVGA-AGYIGEKGIGFKSVFMAAWKVHIQSNDFSFSFTHRKGDSGLGMVTP 185

Query: 1315 V----------PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMF 1364
            V              I +F    S DP   E       IRL                  F
Sbjct: 186  VWEDADESLGGSSTRITLFLH-TSDDPE--EDARQRETIRL-----------------QF 225

Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKM-----TWFVAS 1419
             DL  ++LLFL +L+ +       D            G   V+  ++         +  +
Sbjct: 226  QDLQHTILLFLRKLRKVQVSFFDEDDTQTSSTTYSLRGSNPVTVKKETFEGVEERQYHVT 285

Query: 1420 QKLRAGVIRPDVKT-----------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY 1468
            + +   + + + +T           TE+ LA  L +       P++  Q VFAFLP+R  
Sbjct: 286  KHVAENIPKSENRTYSEEQDRADSSTEVVLAFPLADLGT----PIVENQDVFAFLPMRPM 341

Query: 1469 GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASV 1528
            G KF++  DFV  +SR+ +   S  N+ LL+     F+ A   FC  P  +         
Sbjct: 342  GFKFLIHTDFVTEASRQGIVTTSLRNRGLLNGIADCFIKAIEEFCQHPTLQYQ------- 394

Query: 1529 YMSFVPLVGEV--HGFFSGLPRMILSKLRMSNCLILEGNNN-QWAPPCKVLRGW--NDRA 1583
            +M ++P         F+SGL   I S+++    L   G     +      L+ W  +   
Sbjct: 395  WMRWLPQRNSYPWDSFWSGLLDRIESRIQEVKILRTLGTGKLDYIHKLCRLQPWLRDKNG 454

Query: 1584 HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGL-RSM 1642
              L  D+  + +L   +   D+     L    G+E   P        ++ R E  L R +
Sbjct: 455  DPLFTDLDEEIYLAKEYTRDDL----KLLETYGLESMSPS------DAITRVEKDLERDI 504

Query: 1643 GLSWLASWLNELYTISFHSSGQSSL----QSGVETDLIDNLQRIPFIPLSDGTF-SSVDE 1697
              S + S   +     +HS    +L     +       D L+++ FIPL +GT+ SS   
Sbjct: 505  KDSRMKSPATD---NEWHSLAARALILLKNTATVGPAQDRLKQLRFIPLENGTWVSSSSR 561

Query: 1698 GTIWLHSDCSV 1708
               + HS  S+
Sbjct: 562  PLYYSHSSGSL 572


>gi|323444857|gb|EGB01787.1| hypothetical protein AURANDRAFT_69496 [Aureococcus anophagefferens]
          Length = 360

 Score =  135 bits (341), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 68/149 (45%), Positives = 96/149 (64%), Gaps = 2/149 (1%)

Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE 1212
            ++ E    ++ SIR   FGL  + + ++ +    Q   L  AL  +++++YS DSHF+LE
Sbjct: 155  RDDEPPGAVLASIRAG-FGLETDDARIDVSQ-SGQLQMLDNALKVIARDIYSSDSHFILE 212

Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYI 1272
            L+QNADDN Y  +  PT+ F      +VV NNE+GF A ++RALC VG S+K    +GYI
Sbjct: 213  LLQNADDNDYAPSAAPTIVFDASPEAVVVRNNERGFRARHVRALCHVGASSKGAGGSGYI 272

Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            G+KGIGFKSVFRV+  PE+HSNG+    D
Sbjct: 273  GQKGIGFKSVFRVSSRPEVHSNGYAFALD 301


>gi|347826655|emb|CCD42352.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 1627

 Score =  135 bits (341), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 183/379 (48%), Gaps = 40/379 (10%)

Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQ-------HARLGRALHCLSQELYSQ 1205
            +E +DA   ++ +R    G+          +L K          ++G +   LS++LY +
Sbjct: 7    EELQDARAFIDDLRSANGGITKEEEEELDRLLPKVKEALENLQNKVGISTKILSEQLYQK 66

Query: 1206 DSHFLLELVQNADDNIYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
             + F+ E++QNA+DN Y  + + P L+F L    +++ +NE GF+ ENI+A+C +G ST 
Sbjct: 67   STRFIYEIIQNAEDNEYKGSKKLPFLSFTLYSDRLIIDSNEDGFTKENIKAICSIGEST- 125

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-----IGFVLPTLVPPFN 1319
            K S  GYIG KGIGFKSVF V     I S  +   F+ S+       +G V P    P  
Sbjct: 126  KASIQGYIGEKGIGFKSVFTVAHKVHIQSGCYSFAFEYSKNDDGKSGLGMVTPMNETPHE 185

Query: 1320 I--DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLL---F 1374
            +  D+  R++      L       C  L ++   +    +   +     L  +  L    
Sbjct: 186  LPGDVQTRMI------LYLHAGEDCGAL-YQELLNLPDTLLLFLKKLRQLTVNFELPDRN 238

Query: 1375 LHRLQCIMFR---NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQ--KLRAGVIRP 1429
            +H++Q  + R   + L++++V  +  I G     V+    K  W        L A   R 
Sbjct: 239  VHQVQFSLTRPNNDRLSENIVQTKTAITG----AVNSVTHKSFWVKKRPVFNLPADSARG 294

Query: 1430 DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDG 1489
             ++  E+ LA  + E    ++ P++ +Q  FAFLP++  G KF++Q DF+  +SRE++  
Sbjct: 295  KIREAEVILAFPIDE----DHVPVIEEQYAFAFLPMKKTGYKFLIQCDFITKASREDI-V 349

Query: 1490 NSPWNQWLLSEFPALFVSA 1508
            +S WN+ LL E  A F SA
Sbjct: 350  DSAWNRQLLVEVVATFRSA 368


>gi|119499764|ref|XP_001266639.1| hypothetical protein NFIA_102260 [Neosartorya fischeri NRRL 181]
 gi|119414804|gb|EAW24742.1| hypothetical protein NFIA_102260 [Neosartorya fischeri NRRL 181]
          Length = 1609

 Score =  135 bits (341), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 123/423 (29%), Positives = 188/423 (44%), Gaps = 70/423 (16%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQ 1215
            E A  IV+ I+    GL P +                     L QELY  D+ FL EL+Q
Sbjct: 7    EQAQAIVDDIQVTNGGLPPEVQK------------------ALPQELYDTDTRFLFELIQ 48

Query: 1216 NADDNIY----PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
            NA+DN Y     +  EP L   L +  I V +NE GF+  ++RA+C +  S+KK  + GY
Sbjct: 49   NAEDNCYGHAASQAEEPFLHLTLHKDYITVDSNEDGFTENDVRAICSIHQSSKK-QTGGY 107

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDP 1331
            IG KGIGFKSVF+V     I S  F   F+  +G  G     ++ PFN         ++P
Sbjct: 108  IGHKGIGFKSVFKVAYKVSIQSGPFSFFFEHHQGGSGL---GMITPFN---------EEP 155

Query: 1332 VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ-----------C 1380
             +L     NT I L F T  S+           S++  ++LLFL +LQ            
Sbjct: 156  QELPPGV-NTRITL-FLTNISD---FEACASELSEIPDTMLLFLRKLQRLTIEIPPLQSQ 210

Query: 1381 IMFRNM--LNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIAL 1438
            I F+    ++  L+ + K+  G+ + K    E      +     R G  +P+V   ++ L
Sbjct: 211  ISFKRHEDMSKHLITLTKETNGEQLKKSYHLEKTTLSNLPQHPSRPG--QPEV---DLIL 265

Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            A  +QE    +  PL+  Q V++FLP+   G  F++Q DF+  +SR+ +      N  + 
Sbjct: 266  AFPVQE----DCSPLIQAQYVYSFLPMHDEGFNFLIQSDFITQASRQGIH-RCDRNYAIR 320

Query: 1499 SEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSN 1558
                 LF+ A   FC     R         ++ ++P       F++ L  MI  +L+ S 
Sbjct: 321  EGISHLFLQAVAYFCKHTSLRYE-------WLQYLPGTHIRDPFWACLREMIFDRLKGSK 373

Query: 1559 CLI 1561
             L 
Sbjct: 374  ILF 376


>gi|358380666|gb|EHK18343.1| hypothetical protein TRIVIDRAFT_47475 [Trichoderma virens Gv29-8]
          Length = 1650

 Score =  133 bits (335), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 149/589 (25%), Positives = 250/589 (42%), Gaps = 102/589 (17%)

Query: 1156 EDAALIVESI-------RRDEFGLGPN-ISNMESNMLKKQHARLGRALHCLSQELYSQDS 1207
            +DA  +V+SI         D++ L P+ +       L+        A+  L++ LY+ D+
Sbjct: 9    KDACKLVKSIAEDHGYVHEDDWNLMPSAVRERVQKALRNLEGIAAVAVSTLAKNLYTSDA 68

Query: 1208 HFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
             F+ EL+QNA+DN + + +E    P ++F L    I++  NE GF  +N++A+C VG S+
Sbjct: 69   RFVFELLQNAEDNSFEKAIEANLTPYVSFELHPDHIIIECNEDGFEPKNLKAICAVGQSS 128

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ--IGFVLPTLVPPFNID 1321
            K G   GY+G KGIGFKSVF       I S  F   F    G   +G + P    P N  
Sbjct: 129  KSGVQKGYVGEKGIGFKSVFMAAWKVHIKSGHFSFFFQHKRGDQGLGMISPVWEEPENDQ 188

Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI 1381
                       Q++ +  +    +P             I+D FS L+ S+LLF+  L  I
Sbjct: 189  ------PHPGTQMKLELGDGYTPIP-------SSQYQIILDQFSSLNESILLFMRNLGEI 235

Query: 1382 ---MFRN-------------MLNDSLVVIRK--KIVGD--------GIIKVSCG----ED 1411
               +F N              +N +L  + K  K   +         I K + G     +
Sbjct: 236  RISIFDNEGNLEKSNTFSKERINANLTRLTKVSKTANNVDSSFKDYYIFKHTVGNLDKNE 295

Query: 1412 KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
              T+    +K +A        T EI L   L E +     P++  Q +FAFLP++  G  
Sbjct: 296  NRTYSEYEEKTKA------YATAEIVLGFPLTEES----IPIVEYQDIFAFLPMKEVGFT 345

Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
            F++  DFV  ++R+++   S  N  +       F+ A   FC+ P       K    +M 
Sbjct: 346  FLINSDFVTQANRQDIVTTSSRNIGIRQGIADAFIQAILEFCNHP-------KLQYTWMR 398

Query: 1532 FVPLVGE-VHGFFSGLPRMILSKLRMSNCLILEGNNN-------QWAPPCKVLRGWNDRA 1583
            F+P   +    F+S L  ++  KL+ +  L              +++   ++ R      
Sbjct: 399  FLPNKDQHFDAFWSQLVTLLDDKLKETAVLRPRSETTFRLISSLRYSVDSQLDR----HG 454

Query: 1584 HSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEY-GPKILLQIISSLCRTENGLRS- 1641
              L  D   + H+   +  +D+ +   L    G+ EY   +ILL I   L  +++ ++S 
Sbjct: 455  EPLFRDNNPEMHISKHYAKEDVNI---LRNFYGLTEYTASEILLAISLDLGMSDSRMKSE 511

Query: 1642 -MGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSD 1689
             M   W     N L  +  + +G +S++S         L+ +PF+PL D
Sbjct: 512  DMDDDWHTRSANYLSYV--YLTGDASIRS--------KLRSLPFLPLQD 550


>gi|302841821|ref|XP_002952455.1| hypothetical protein VOLCADRAFT_93075 [Volvox carteri f. nagariensis]
 gi|300262391|gb|EFJ46598.1| hypothetical protein VOLCADRAFT_93075 [Volvox carteri f. nagariensis]
          Length = 4091

 Score =  133 bits (334), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 73/148 (49%), Positives = 86/148 (58%), Gaps = 22/148 (14%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLE--------- 1212
            VESIRR+EFG G  +    + +  +  AR GRAL  L+ ELYS DSHF++E         
Sbjct: 1609 VESIRREEFGEGLELGGEAAALSGRLAARTGRALQRLAAELYSADSHFVMELVQLPLLAL 1668

Query: 1213 ------------LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVG 1260
                         +QNADDN YP  V P L F+LQ   I   NNE GFS  N+RALCD G
Sbjct: 1669 LLLPSHSVPCRTFLQNADDNSYPAGVTPCLEFVLQRDVITAANNEVGFSPANLRALCDAG 1728

Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
             STK     GYIG+KGIGFKSVFRV+ A
Sbjct: 1729 ASTKS-RVTGYIGQKGIGFKSVFRVSSA 1755



 Score = 58.2 bits (139), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 56/135 (41%), Gaps = 38/135 (28%)

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP-------------------- 1516
            D+V+PSSRE +D +SPWNQ L +  P LF+ A   F  LP                    
Sbjct: 1882 DWVVPSSREALDVDSPWNQELRARLPGLFLRALAVFKQLPHPYVAASGAAAAAGTTTACG 1941

Query: 1517 ------CFRENPAKAASV----------YMSFVPLVGEVHGFFSGLPRMILSKLRMSNCL 1560
                  C  +   +              ++  VPL GE  GFF+GLP  I + LR + C+
Sbjct: 1942 GGGGMQCIGQQHQQHQQRRAAELFWLDQWLRCVPLEGEAQGFFAGLPHSIATLLRAAPCI 2001

Query: 1561 ILEGNNNQWAPPCKV 1575
                 +  WA P + 
Sbjct: 2002 PT--ADGGWATPSET 2014


>gi|134084320|emb|CAK48660.1| unnamed protein product [Aspergillus niger]
          Length = 1689

 Score =  133 bits (334), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 202/828 (24%), Positives = 331/828 (39%), Gaps = 149/828 (17%)

Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
            E A  +VE I +    LG ++           +E  ML+K    +G ++  LS+ LY+  
Sbjct: 9    ERARKLVEGISKRHGYLGEDVLSQMSDEVRRQVEEAMLRKDEM-IGSSVVTLSKNLYNSS 67

Query: 1207 SHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            + F+ ELVQNADDN Y    E    P+L F +    IVV  NE GF+ ENI A+C+VG S
Sbjct: 68   ARFVFELVQNADDNHYTTARERATVPSLAFHVYPRRIVVDCNEDGFTHENIVAICNVGKS 127

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
            +K G+  GYIG KGIGFKSVF V     I S  F   F    G  G           + M
Sbjct: 128  SKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSGDFSFSFRHKPGGSG-----------MGM 175

Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF---LHRLQ 1379
               +      QL+       + L   T   E  AM      F +L  +LLLF   L RL 
Sbjct: 176  ISPIWEDVEEQLQRPMTRMTLYLHEDTAPRE-TAMQQ----FRELQDTLLLFTKNLQRLG 230

Query: 1380 CIMFRN-------------MLNDSLVVIRKKIVG-DGIIKVSCGEDKMTWFVA------- 1418
              M+ +               N++L V+ K ++  DG  +       +T +VA       
Sbjct: 231  VFMYDDDEKQLSSTVFAISSQNENLKVLTKTLLAPDGTSEELTRNYHLTKYVAHDLPGSE 290

Query: 1419 -------SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
                    ++ RA          +I LA  + +    N  P++  Q VFAFLP+R  G  
Sbjct: 291  NRTYTDKERETRA------YSRADIILAFPVTK----NSVPVIEPQDVFAFLPIRNMGFT 340

Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
                  F    + +++  +S  N+ L +E    F +A       P  R         +M 
Sbjct: 341  V----GFSSCKTLQDIVRSSARNRKLRTEISIAFGTAVLEMRKHPTLRYR-------WMR 389

Query: 1532 FVPLVGEVHG--FFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGW----NDRAHS 1585
            ++P     H   F+ GL   I S+L +S  ++    +  W P  ++ R W    +     
Sbjct: 390  YLPKRDGHHWDPFWVGLLGEIKSRL-ISLPILQSRTSGSWFPIEQMERLWEHWFDKSGDP 448

Query: 1586 LLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKILLQIISSLCRTENGLRSMGLS 1645
            L PD+  +++L + +L KDI           + EYG  +    +  +    N    M L 
Sbjct: 449  LFPDLPREQYLSVHYLEKDIT---------RLMEYG--LTFMAMPGMIARAN----MDLE 493

Query: 1646 WLASWL-NELYTISFHSSGQSSLQSGVE---TDLIDNLQRIPFIPLSDGTFSSVDEGTIW 1701
            +  S + N+  +  +HS   ++L         + I+ ++ +P IPL  G + S+  G ++
Sbjct: 494  FPHSKMKNKETSEDWHSRAATALSHSFRLGYDNCIEEVKAMPLIPLKTGQWVSIKSGAVY 553

Query: 1702 LHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
                   F    G++   +L   L T                G I+     ++   IGVQ
Sbjct: 554  -------FPSIHGIDIPSDLGVGLVTS---------------GAINNPQRKQLFKDIGVQ 591

Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
            + S   + K+     I D  +A G   +         ++L   C          V     
Sbjct: 592  EASVDFVRKL-----IKDNYSAYGIFFIPVSREHLTFLYLT-ECLASSSSTALFVHLFNG 645

Query: 1822 KAFVLTNHGF--KRPSEIPIHFGKEFGNPVSVNMLIHDIDIKWYEVDITYLKHPA-KESL 1878
            K    ++H +  ++P    ++F  E  +P     L+  I      +D++++     ++  
Sbjct: 646  KGLAFSDHKYQIRQPKIHTVYFPNE--DPYGAKALLEPIQGGAPGLDVSFINEDYLRDPP 703

Query: 1879 SCGLVKWRKFFKEIGITDFVQVVQVDKDVA--DISHTGFKNMWTKELL 1924
            SC   + RK+ + +      QV  V   ++  D+ H+ F+    +E L
Sbjct: 704  SCPPKEQRKWIEWLQ-----QVFHVRGTLSLIDLRHSPFRARLGQECL 746


>gi|429851713|gb|ELA26885.1| ino80 chromatin remodeling complex protein [Colletotrichum
            gloeosporioides Nara gc5]
          Length = 1640

 Score =  132 bits (332), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/392 (29%), Positives = 178/392 (45%), Gaps = 55/392 (14%)

Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQGFSAENIRA 1255
            + LYS  + F+ EL+QNADDN Y +       P ++F +  S IV+  NE GF+ EN++A
Sbjct: 40   KNLYSSKTRFIFELLQNADDNQYSKASALGDTPYISFRIYPSRIVIHCNEDGFTHENMKA 99

Query: 1256 LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
            +C VG S+K G+  GYIG KGIGFKSVF V     I SN F V F   +G  G       
Sbjct: 100  ICAVGESSKTGAQ-GYIGEKGIGFKSVFMVAWKVHIQSNAFSVSFKHKKGDSG------- 151

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN--IVDMFSDLHPSLLL 1373
                + M   +  +    L+S+     + L   T+ S G+A     I   F +L  ++LL
Sbjct: 152  ----MGMITPVWEETDEVLQSQLTRITLHL-LNTETSNGVAETREMIRAQFDELQETVLL 206

Query: 1374 FLHRLQCIMFRNMLND----SLVVIRKKIVGDGIIKVSC-------GEDKMTWFVASQKL 1422
            FL  ++ I      +D    + V   +     G +K+S          ++  +F  ++  
Sbjct: 207  FLKNIRKISVFFYGDDGRRTTAVSHLRTQPQPGQVKLSSVYATNGRQREQTKYFHVTRGK 266

Query: 1423 RAGVIRPDVKT-------------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG 1469
               + + D +T             +E+ +A  L E+      P++  Q VFAFLP+R  G
Sbjct: 267  ATNLAKNDNRTYTEAEDAAKAYSASEVVVAFPLSET----LVPIVEPQDVFAFLPVRPAG 322

Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVY 1529
              F++Q DFV  ++R+++  +S  N  LL      FV A   F      R         +
Sbjct: 323  FNFVIQADFVTNANRQDILKDSRRNIGLLHAVGEAFVIAAIDFQAHETLRYR-------W 375

Query: 1530 MSFVP-LVGEVHGFFSGLPRMILSKLRMSNCL 1560
            + F+P   G   GF+ GL   I   LR    L
Sbjct: 376  VQFLPDRNGSWDGFWLGLVEKIAMLLRFEPVL 407


>gi|302889371|ref|XP_003043571.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256724488|gb|EEU37858.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 2274

 Score =  129 bits (324), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 184/409 (44%), Gaps = 54/409 (13%)

Query: 1192 GRALHCLSQE----LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG------IVV 1241
             RAL+ + +E    LY+   HFLLEL+QNADD  Y   V PT+    +E G      + +
Sbjct: 60   ARALNAVLKETAGKLYADRMHFLLELLQNADDCQYAPGVTPTVNLTYKEHGDGRATTLRI 119

Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGSSAG--YIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
              NE GF +EN+RA+C    S+K  +       G KG+GFKSVF V D   I S  +  K
Sbjct: 120  DCNEIGFHSENVRAICGFCESSKPAAKDQNRTTGEKGMGFKSVFEVADVVWISSGFYSFK 179

Query: 1300 F-DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
            F +   G +  + P              L  D  +L     +T + L    K  +GI + 
Sbjct: 180  FVNNDSGMLSMITPHWAK----------LPDDERRLRG---HTSMLLELSAKCDKGILLE 226

Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMF------RNMLNDSLVVIRKKIVGDGIIK--VSCGE 1410
            N   M  D     +LFL  L+ I        +     ++  +      DG+    +  G+
Sbjct: 227  NFKSMSLDT----VLFLRNLKRIKISTIEAGKKRWETTIKRLDDGPTKDGMFSRTIQIGD 282

Query: 1411 DKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPL----LYQQPVFAFLPLR 1466
               ++ V    +    +R D +  ++  +  L   ++ +   L    L  Q V AFLP+ 
Sbjct: 283  TLASYHVFKHAVNERDLRYDDRRRDVKDSQILLAFSKADDLSLPRQDLKAQKVHAFLPID 342

Query: 1467 TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAA 1526
             YG KF +Q DF+L SSRE+V+ + PWN  L+S+ P  F+ A ++F     F        
Sbjct: 343  HYGFKFAIQADFILVSSREDVE-DYPWNTSLVSQIPTAFLGAIKAFQSQDVFS------- 394

Query: 1527 SVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKV 1575
              ++S+VPL       F  +P  I++ L  S   +L+ +  +W  P K+
Sbjct: 395  --WISYVPLDHYADTLFQDVPAAIVNVL--SENPVLQSDQREWLLPSKL 439


>gi|46137929|ref|XP_390655.1| hypothetical protein FG10479.1 [Gibberella zeae PH-1]
          Length = 1889

 Score =  129 bits (323), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 145/582 (24%), Positives = 246/582 (42%), Gaps = 109/582 (18%)

Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV----EPTLTFIL 1234
            +E  +L+K    +G A+  L++ LY+ ++ F+ EL+QNADDN Y   V    +P ++F +
Sbjct: 49   VEEALLRKDEM-IGSAVLTLARNLYTSNARFVFELLQNADDNDYSTAVSQGQDPYVSFKV 107

Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                + +  NE GF+ EN++A+C +G S+K G +AGYIG KGIGFKSVF       I SN
Sbjct: 108  HPDKVTIECNENGFTHENLKAICAIGKSSKVG-AAGYIGEKGIGFKSVFMAAWKVHIQSN 166

Query: 1295 GFHVKFDTSEGQIGFVLPTLV----PPFNIDMFCRL-----LSKDPVQLESKCWNTCIRL 1345
             F   F   +G  G  + T V         D   R+      S+DP + E +     IRL
Sbjct: 167  KFSFSFTHRKGDSGLGMVTPVWEETDEATGDSSTRITLLLHTSEDPEEDERR--RDTIRL 224

Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL-------------- 1391
                              F DL  ++LLFL +L+ +      ND +              
Sbjct: 225  -----------------QFQDLQHTILLFLQKLRKVQVSFYDNDDVQTATTVYSLHGSNP 267

Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-----------TEIALAL 1440
              IRK+   DG+      E++   +  ++ +   + R + +T           TE+ LA 
Sbjct: 268  ATIRKE-TSDGV------EERQ--YHVTKHVAENIPRSENRTYSDERDRADSSTEVVLAF 318

Query: 1441 TLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSE 1500
             L ++      P++  + VFAFLP+      F++  DFV  +SR+ +   S  N+ LL  
Sbjct: 319  PLTDTGT----PIVENKDVFAFLPI------FLIHTDFVTEASRQGIVVTSLRNRALLKG 368

Query: 1501 FPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV--HGFFSGLPRMILSKLRMSN 1558
                 V A   FC  P  +         +M ++P         F+S L   I  ++    
Sbjct: 369  IADCLVKAIEEFCRHPTLQYQ-------WMRWLPQRDSYPWDSFWSSLLDRIKDRIHQVK 421

Query: 1559 CLILEGNNN-QWAPPCKVLRGW--NDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARAL 1615
             L   G+    +    + L+ W  +++   L  DI  + +L   +   D+     L +  
Sbjct: 422  VLRTLGSRQLGYMYQLRNLQSWLRDEKGDPLFTDINNEIYLAKEYAKDDL----DLLKPY 477

Query: 1616 GIEEYGPKILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDL 1675
            G++   P      +  + R EN L+    + +      + +   HSS   +L   ++   
Sbjct: 478  GLQGASP------MEVILRIENDLQKPPEASIMK--RSMISNDGHSSAARALMLLLKNIT 529

Query: 1676 IDN-------LQRIPFIPLSDGTFSSVDEGTIWLHSDCSVFD 1710
            ++N       L+++ FIPL +G + S     ++        D
Sbjct: 530  LNNPGPIQARLEQLRFIPLDNGAWVSASRRPLYYSHSSGQLD 571


>gi|325093511|gb|EGC46821.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 1631

 Score =  128 bits (322), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 119/422 (28%), Positives = 193/422 (45%), Gaps = 46/422 (10%)

Query: 1156 EDAALIVESIRRDEFGLGPNIS-NMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
            E A  IVE IR +  GL   +  ++ +  L+      +  G ++  ++++LY  D+ F+ 
Sbjct: 63   EKAREIVEEIRINNGGLREEVQRSLHAEALRSIRNLQSLAGASIRHVAEDLYDADTRFIF 122

Query: 1212 ELVQNADDNIYP----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            ELVQNA+D+ Y     +  EP + F L    I V  NE GF+  ++RA+C + NS+K+  
Sbjct: 123  ELVQNAEDSRYSHAKSQEEEPFIYFTLYPDRITVDTNEDGFTENDVRAICSIHNSSKR-Q 181

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
              GYIG KGIGFKSVF+V     I S  F   F   +G+ G     ++ PF         
Sbjct: 182  IGGYIGHKGIGFKSVFKVAHKVHIQSGPFCFSFTHRKGESGM---GMITPF--------- 229

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
            S++P  L     N   R        E   +        ++  +LLLFL RL  I  +   
Sbjct: 230  SREPEDLPP---NVKTRFTLYFIHPEDFQIRE--GELKEIPDTLLLFLRRLCKIGVKIEP 284

Query: 1388 NDSLVVIRKKIVG-DGIIKV--SCGE--DKMTWFVASQKLRAGVIRPDVK---TTEIALA 1439
            +   ++ +++  G +G I +    G+   K  + V   +        D     T E+ LA
Sbjct: 285  SGFRLLFKREQTGPNGRITLVKEAGDVVSKRIYHVEGAEFENLPEHGDQSAQTTAEVILA 344

Query: 1440 LTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
              + +S+     P++  Q V++FLP+R  G KF++Q DF+  ++R+ V    P N  +  
Sbjct: 345  FPVDDSHR----PIIEPQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LCPRNYAIRE 399

Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNC 1559
                +FV A  +FC         A     ++ ++P       F++ L  MIL  L  S  
Sbjct: 400  RISLVFVQAVYNFC-------KNATLKYEWLQYLPGPSIPDPFWATLRDMILESLSESKI 452

Query: 1560 LI 1561
            L+
Sbjct: 453  LL 454


>gi|240277430|gb|EER40938.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
          Length = 1609

 Score =  128 bits (321), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 119/422 (28%), Positives = 193/422 (45%), Gaps = 46/422 (10%)

Query: 1156 EDAALIVESIRRDEFGLGPNIS-NMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
            E A  IVE IR +  GL   +  ++ +  L+      +  G ++  ++++LY  D+ F+ 
Sbjct: 41   EKAREIVEEIRINNGGLREEVQRSLHAEALRSIRNLQSLAGASIRHVAEDLYDADTRFIF 100

Query: 1212 ELVQNADDNIYP----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            ELVQNA+D+ Y     +  EP + F L    I V  NE GF+  ++RA+C + NS+K+  
Sbjct: 101  ELVQNAEDSRYSHAKSQEEEPFIYFTLYPDRITVDTNEDGFTENDVRAICSIHNSSKR-Q 159

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
              GYIG KGIGFKSVF+V     I S  F   F   +G+ G     ++ PF         
Sbjct: 160  IGGYIGHKGIGFKSVFKVAHKVHIQSGPFCFSFTHRKGESGM---GMITPF--------- 207

Query: 1328 SKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
            S++P  L     N   R        E   +        ++  +LLLFL RL  I  +   
Sbjct: 208  SREPEDLPP---NVKTRFTLYFIHPEDFQIRE--GELKEIPDTLLLFLRRLCKIGVKIEP 262

Query: 1388 NDSLVVIRKKIVG-DGIIKV--SCGE--DKMTWFVASQKLRAGVIRPDVK---TTEIALA 1439
            +   ++ +++  G +G I +    G+   K  + V   +        D     T E+ LA
Sbjct: 263  SGFRLLFKREQTGPNGRITLVKEAGDVVSKRIYHVEGAEFENLPEHGDQSAQTTAEVILA 322

Query: 1440 LTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
              + +S+     P++  Q V++FLP+R  G KF++Q DF+  ++R+ V    P N  +  
Sbjct: 323  FPVDDSHR----PIIEPQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LCPRNYAIRE 377

Query: 1500 EFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNC 1559
                +FV A  +FC         A     ++ ++P       F++ L  MIL  L  S  
Sbjct: 378  RISLVFVQAVYNFC-------KNATLKYEWLQYLPGPSIPDPFWATLRDMILESLSESKI 430

Query: 1560 LI 1561
            L+
Sbjct: 431  LL 432


>gi|77551082|gb|ABA93879.1| expressed protein [Oryza sativa Japonica Group]
          Length = 1707

 Score =  127 bits (320), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 168/348 (48%), Gaps = 57/348 (16%)

Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
            V NA+DN YP  V P+L F++  + I         ++ NNE+GFS  NI ++  VG STK
Sbjct: 62   VPNAEDNEYPSGVAPSLDFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTK 121

Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNI 1320
            KG+   GYIG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V     
Sbjct: 122  KGNRDKGYIGEKGIGFKSVFLISSQPHIFSNGYQIKFNGKPCAECGIGYIVPEWVESRPS 181

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
                R +      L +    T I LP +++      ++ +    S +HP +LLFL +++ 
Sbjct: 182  LSDIRTIYGSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQ 232

Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGI------IKVSCGEDKMTWFVASQ 1420
            +  +              ++ ++     RK +  +        ++   GE++  +++  Q
Sbjct: 233  LSVKEENVNHKCSPVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQ 292

Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
            K     ++P+       +  E  + L        + G  +    V+AFLP        FI
Sbjct: 293  KFP---VKPENRVDKRAEIDEWVITLAFPYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFI 348

Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----ERSFCDLPC 1517
            +Q DF+L SSRE +  +SPWN+ +L   P+ F++A     +S  D+P 
Sbjct: 349  IQADFLLASSREAILFDSPWNKGILECVPSAFLNAFVALVKSGADVPA 396


>gi|222616035|gb|EEE52167.1| hypothetical protein OsJ_34022 [Oryza sativa Japonica Group]
          Length = 1690

 Score =  127 bits (320), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 168/348 (48%), Gaps = 57/348 (16%)

Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
            V NA+DN YP  V P+L F++  + I         ++ NNE+GFS  NI ++  VG STK
Sbjct: 45   VPNAEDNEYPSGVAPSLDFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTK 104

Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNI 1320
            KG+   GYIG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V     
Sbjct: 105  KGNRDKGYIGEKGIGFKSVFLISSQPHIFSNGYQIKFNGKPCAECGIGYIVPEWVESRPS 164

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
                R +      L +    T I LP +++      ++ +    S +HP +LLFL +++ 
Sbjct: 165  LSDIRTIYGSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQ 215

Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGI------IKVSCGEDKMTWFVASQ 1420
            +  +              ++ ++     RK +  +        ++   GE++  +++  Q
Sbjct: 216  LSVKEENVNHKCSPVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQ 275

Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
            K     ++P+       +  E  + L        + G  +    V+AFLP        FI
Sbjct: 276  KFP---VKPENRVDKRAEIDEWVITLAFPYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFI 331

Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----ERSFCDLPC 1517
            +Q DF+L SSRE +  +SPWN+ +L   P+ F++A     +S  D+P 
Sbjct: 332  IQADFLLASSREAILFDSPWNKGILECVPSAFLNAFVALVKSGADVPA 379


>gi|156082529|ref|XP_001608749.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154795998|gb|EDO05181.1| conserved hypothetical protein [Babesia bovis]
          Length = 2581

 Score =  127 bits (320), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 69/178 (38%), Positives = 109/178 (61%), Gaps = 6/178 (3%)

Query: 1143 DCTTQTLPEDKECEDAALIVESIRRDEFGLG--PNISNMESNMLKKQHARLGRALHCLSQ 1200
            D ++ T     + +    ++++IR   FG G   +++   S ++  Q  +L R++  LS+
Sbjct: 673  DTSSNTAVSSMQLDQGKALIDAIRLYTFGYGILEHLNPEASAIVNNQRKQLERSVRRLSE 732

Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVG 1260
            +LYS   H  LEL+QNADDN Y   + P + F+L  SGIVV+NNE GF+  +++ALCD+G
Sbjct: 733  DLYSSRCHLHLELIQNADDNKYNCPL-PEIGFVLDRSGIVVINNECGFTDSDVKALCDIG 791

Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLV 1315
            +S+K  SS   IG+ G+GFKSVF VT  P + SNGFH  F ++   +  + +++P  V
Sbjct: 792  SSSKTNSSRDLIGQFGLGFKSVFLVTHTPCVFSNGFHFNFSSNPNLQNNVMYLMPHWV 849


>gi|115485639|ref|NP_001067963.1| Os11g0514000 [Oryza sativa Japonica Group]
 gi|113645185|dbj|BAF28326.1| Os11g0514000 [Oryza sativa Japonica Group]
          Length = 1757

 Score =  127 bits (320), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 168/348 (48%), Gaps = 57/348 (16%)

Query: 1214 VQNADDNIYPENVEPTLTFILQESGI---------VVLNNEQGFSAENIRALCDVGNSTK 1264
            V NA+DN YP  V P+L F++  + I         ++ NNE+GFS  NI ++  VG STK
Sbjct: 62   VPNAEDNEYPSGVAPSLDFLVTSNDITGSGASATLLIFNNEKGFSPANIESIIRVGKSTK 121

Query: 1265 KGS-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVPPFNI 1320
            KG+   GYIG KGIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V     
Sbjct: 122  KGNRDKGYIGEKGIGFKSVFLISSQPHIFSNGYQIKFNGKPCAECGIGYIVPEWVESRPS 181

Query: 1321 DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
                R +      L +    T I LP +++      ++ +    S +HP +LLFL +++ 
Sbjct: 182  LSDIRTIYGSSKVLPT----TTIILPLKSE-----KVDAVKKQLSSMHPEMLLFLSKIRQ 232

Query: 1381 IMFR--------------NMLNDSLVVIRKKIVGDGI------IKVSCGEDKMTWFVASQ 1420
            +  +              ++ ++     RK +  +        ++   GE++  +++  Q
Sbjct: 233  LSVKEENVNHKCSPVSEISISSEKNFQERKNMHAESYTLHLSALENGKGEEECGYYMWRQ 292

Query: 1421 KLRAGVIRPD------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
            K     ++P+       +  E  + L        + G  +    V+AFLP        FI
Sbjct: 293  KFP---VKPENRVDKRAEIDEWVITLAFPYGQRLSRGKQM-SPGVYAFLPTEMVTNFPFI 348

Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA----ERSFCDLPC 1517
            +Q DF+L SSRE +  +SPWN+ +L   P+ F++A     +S  D+P 
Sbjct: 349  IQADFLLASSREAILFDSPWNKGILECVPSAFLNAFVALVKSGADVPA 396


>gi|147783923|emb|CAN74688.1| hypothetical protein VITISV_001348 [Vitis vinifera]
          Length = 288

 Score =  127 bits (319), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 84/170 (49%), Positives = 98/170 (57%), Gaps = 29/170 (17%)

Query: 37  NFAFQNPNIFFPNPALQLLQNLTNIPF-QNPNFDIQNPNLFMQNANLLTQNPNLP----- 90
           N+  QN N +F NP  Q   NL  +P+ QNP    QNP L MQN NL  +NPNLP     
Sbjct: 35  NYFIQNVNPYFQNPTFQA--NL-GLPYLQNPTLPTQNPTLPMQNTNLPLKNPNLPMQNXS 91

Query: 91  -PQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERIDRAVVKAR 149
            P Q PS A  Q  + T                     PQP K NKE L+R+D AV KAR
Sbjct: 92  FPLQNPSFAI-QNTNFT------------------GFRPQPPKRNKEALDRVDGAVXKAR 132

Query: 150 SELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEAR 199
            ++I  GESVS+WKVS+S LL LQVDSW SLGF MQEVPSLH L+V E +
Sbjct: 133 RDVIAXGESVSAWKVSQSALLALQVDSWESLGFPMQEVPSLHSLIVIEGK 182


>gi|302825725|ref|XP_002994453.1| hypothetical protein SELMODRAFT_432374 [Selaginella moellendorffii]
 gi|300137602|gb|EFJ04480.1| hypothetical protein SELMODRAFT_432374 [Selaginella moellendorffii]
          Length = 1445

 Score =  126 bits (316), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 125/444 (28%), Positives = 195/444 (43%), Gaps = 91/444 (20%)

Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
            + +E +RR+ FG+G +  N  + ML++        +  LS ELY +D HFL ELVQ    
Sbjct: 6    MFIERLRRELFGIGQSGPNPLAPMLQE-------TIRLLSAELYQKDIHFLSELVQET-- 56

Query: 1220 NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-SAGYIGRKGIG 1278
                                            NI +LC  G STKKG  S GYIG KGIG
Sbjct: 57   --------------------------------NIESLCGSGQSTKKGKKSQGYIGEKGIG 84

Query: 1279 FKSVFRVTDAPEIHSNGFHVKFDT---SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLE 1335
            FKSVF VT  P I SNGF ++FD    +E ++G+++P  V      +  ++ SK      
Sbjct: 85   FKSVFLVTKQPYIISNGFRIRFDEEPHNEAKLGYIVPEWVDRPTDAVLQKVCSK------ 138

Query: 1336 SKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI-MFRNMLNDSLVVI 1394
            ++   T I LP R +      ++ +    + +    LLFL +++ + ++         ++
Sbjct: 139  AELPATVIILPLRQE-----KISEVAKQLTQIPAETLLFLSKIRMLSIYDQQSRPPTELV 193

Query: 1395 RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVI---------------------RPDVKT 1433
              + V     K    E   T+  AS   R G                       R DV +
Sbjct: 194  LSRSVEMTTSKNESVETSTTFLTASNG-RNGTQHSNYYMFKQTLKVPEEAKVENRKDVDS 252

Query: 1434 TEIALALTLQE--SNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGN 1490
              I LA  + +  S+  + G       +F+FLP   + GL F++  DF+L SSRE +  +
Sbjct: 253  WTITLAFPVDDRISSSCSIG------DIFSFLPSDIHSGLPFLINSDFLLVSSRETLLFD 306

Query: 1491 SPWNQWLLSEFPALFVSAERSFCDLPCF--RENPAKAASVYMSFVPLVGEVHGFFSGLPR 1548
            SPWNQ +    P +F +A     +   F  +  P+ A      +VP+    + +   +  
Sbjct: 307  SPWNQGIFRCVPEVFYNAFELLLNTTAFGLQSIPSVARMAMYKYVPVNVCRNSYLERVRN 366

Query: 1549 MILSKLRMSNCLILEGNNNQWAPP 1572
             ILSKL+ +N +IL  ++   APP
Sbjct: 367  DILSKLKDNN-VILCASDVGRAPP 389


>gi|398410674|ref|XP_003856685.1| hypothetical protein MYCGRDRAFT_34426 [Zymoseptoria tritici IPO323]
 gi|339476570|gb|EGP91661.1| hypothetical protein MYCGRDRAFT_34426 [Zymoseptoria tritici IPO323]
          Length = 1453

 Score =  126 bits (316), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 46/366 (12%)

Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE----NVEPTLTFIL 1234
            +E ++L K     G A+  L+Q +Y  D+ F+ EL+QNADDN + +       P ++F +
Sbjct: 37   VEESLLAKDKI-AGHAIITLAQNIYGSDARFVFELLQNADDNCFEKATAAGAAPFVSFHV 95

Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                IVV  NE GF+ +N+ A+C VG S+K  ++ GYIG KGIGFKSVF       + S 
Sbjct: 96   YSDRIVVECNEDGFTLQNLTAICAVGQSSK-STTHGYIGAKGIGFKSVFIAAWKVHVQSG 154

Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEG 1354
             F   F   +G  G           + M   +  +    L S      + L   TK +E 
Sbjct: 155  HFSFDFKHKKGDSG-----------LGMVVPIWHETSETLPSPMTRMTLYL-HETKGAEH 202

Query: 1355 I--AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDK 1412
            +      + +  S L  + LLFL +L+ I      N S            + +   G D+
Sbjct: 203  LNDIRKTVFEQLSSLQETSLLFLRKLKKI------NVSFYDANDHQASSKLFQCDKGNDQ 256

Query: 1413 MTWFVASQKLRAGVIRPDVK----TTEIALALTLQESNEG--------------NYGPLL 1454
                       +G  + D K    TT +A  L+  ES                 +  PLL
Sbjct: 257  NFRLEIVHTTHSGATKKDTKRFHVTTHLATGLSASESRGAAAEAEVILAFPLTHDLKPLL 316

Query: 1455 YQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
             +Q +FAFLP+R    +FI+  DF   +SR+++   S  N+ LL+     F +A   FC 
Sbjct: 317  EKQELFAFLPVRKSEFEFIIHSDFDTTASRQDIMTTSRRNRDLLNAIADAFKTAVTEFCG 376

Query: 1515 LP--CF 1518
             P  C+
Sbjct: 377  HPELCY 382


>gi|70986517|ref|XP_748750.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gi|66846380|gb|EAL86712.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
          Length = 1452

 Score =  125 bits (314), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 118/410 (28%), Positives = 175/410 (42%), Gaps = 78/410 (19%)

Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGI 1239
            L+ +  ++G ++  L++ LY+ ++ F+ EL+QNADDN +         P ++F +    I
Sbjct: 44   LRAKDEKIGHSIKTLAKNLYNSNARFVFELLQNADDNRFMLARKHKELPFISFKVYPDRI 103

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
            VV  NE GFS E++ A+C VG STK  +S GYIG KGIGFKSVF       I S  F   
Sbjct: 104  VVECNEDGFSIEDLSAICSVGESTK-AASHGYIGAKGIGFKSVFIAAWKVHIQSGNFSFH 162

Query: 1300 FDTSEGQIGF--VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
            F    G +G   VLP    P                 E +C +   R+      +EG   
Sbjct: 163  FKHRRGDLGLGMVLPVWEDP-----------------EYECPDRLTRMTLYLH-TEGEPH 204

Query: 1357 -----MNNIVDMFSDLHPSLLLFLHRLQCI---------------MFRNMLNDSLVVIRK 1396
                    I    SDL  + LLFL  ++ I                FR        V   
Sbjct: 205  ALEHLRKTIFQQLSDLQQTSLLFLRNIKQIGVFFDDGNGGMRRSKTFRVRDAPGHSVFLD 264

Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-------------TEIALALTLQ 1443
              V DG    +  E +   +  +++    +   D +               E+ LA  L 
Sbjct: 265  ATVVDGDENSTTVEQR---YHVTRRTATELPTSDNRNLLSTLETNTAFSEAEVVLAFPLT 321

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
              +     PLL +Q +FAFLP+R    KF++Q DF   ++R+++   S  N  LL    +
Sbjct: 322  VDSR----PLLEKQEIFAFLPVRESRFKFLIQSDFDTSANRQDISTTSRRNLCLLPHIAS 377

Query: 1504 LFVSAERSFCDLP--CFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMI 1550
             FV A   FC+ P  C+          + SF+P + +  G F+S L R I
Sbjct: 378  AFVMAVLEFCEDPELCY---------AWPSFLPSLDDDMGDFWSPLARSI 418


>gi|159123450|gb|EDP48569.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
          Length = 1590

 Score =  125 bits (314), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 118/410 (28%), Positives = 175/410 (42%), Gaps = 78/410 (19%)

Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGI 1239
            L+ +  ++G ++  L++ LY+ ++ F+ EL+QNADDN +         P ++F +    I
Sbjct: 44   LRAKDEKIGHSIKTLAKNLYNSNARFVFELLQNADDNRFTLARKHKELPFISFKVYPDRI 103

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
            VV  NE GFS E++ A+C VG STK  +S GYIG KGIGFKSVF       I S  F   
Sbjct: 104  VVECNEDGFSIEDLSAICSVGESTK-AASHGYIGAKGIGFKSVFIAAWKVHIQSGNFSFH 162

Query: 1300 FDTSEGQIGF--VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
            F    G +G   VLP    P                 E +C +   R+      +EG   
Sbjct: 163  FKHRRGDLGLGMVLPVWEDP-----------------EYECPDRLTRMTLYLH-TEGEPH 204

Query: 1357 -----MNNIVDMFSDLHPSLLLFLHRLQCI---------------MFRNMLNDSLVVIRK 1396
                    I    SDL  + LLFL  ++ I                FR        V   
Sbjct: 205  ALEHLRKTIFQQLSDLQQTSLLFLRNIKQIGVFFYDGNGGMRRSKTFRVRDAPGHSVFLD 264

Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKT-------------TEIALALTLQ 1443
              V DG    +  E +   +  +++    +   D +               E+ LA  L 
Sbjct: 265  ATVVDGDENSTTVEQR---YHVTRRTATELPTSDNRNLLSTLETNTAFSEAEVVLAFPLT 321

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
              +     PLL +Q +FAFLP+R    KF++Q DF   ++R+++   S  N  LL    +
Sbjct: 322  VDSR----PLLEKQEIFAFLPVRESRFKFLIQSDFDTSANRQDISTTSRRNLCLLPHIAS 377

Query: 1504 LFVSAERSFCDLP--CFRENPAKAASVYMSFVPLVGEVHG-FFSGLPRMI 1550
             FV A   FC+ P  C+          + SF+P + +  G F+S L R I
Sbjct: 378  AFVMAVLEFCEDPELCY---------AWPSFLPSLDDDMGDFWSPLARSI 418


>gi|407922036|gb|EKG15164.1| hypothetical protein MPH_07610 [Macrophomina phaseolina MS6]
          Length = 379

 Score =  125 bits (314), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 109/339 (32%), Positives = 158/339 (46%), Gaps = 44/339 (12%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESN-------MLKKQHARLGRALHCLSQELYSQDSH 1208
            E+A  ++  +R++  G+ P    +           L+    +LG A   L+  LYS+D+ 
Sbjct: 12   EEAQRLIAQVRKENGGISPEHEALLKENCPDVLEALQNVRRKLGSATKTLATNLYSKDTR 71

Query: 1209 FLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
            FL EL+QNA+DN Y     P L FIL    IV+ +NE GF++ N +A+C  G STK    
Sbjct: 72   FLYELIQNAEDNQYCVWTTPFLNFILYPDKIVIDSNENGFTSLNFKAICSTGESTKTNVQ 131

Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----DTSEGQIGFVLPTLVPPFNIDMFC 1324
             GY+G KGIGFKSVF+V     I S  F   F    D+ +  +G V P L  P+  D+  
Sbjct: 132  -GYVGEKGIGFKSVFKVAKKVHIQSGVFSFSFVYTRDSDDDGLGMVTP-LSEPYE-DL-- 186

Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR 1384
                  P  + ++   T +R        E   M    D+ S+L  +LLLFL +++ I  R
Sbjct: 187  ------PANVRTRMTLTLLR-------PEDF-MQRAADL-SNLPDTLLLFLSKIRSINVR 231

Query: 1385 --NMLNDSLVVIRKKIVG----DGIIKVSCGEDKMT------WFVASQKLRAGVIRPDVK 1432
              +  N    V  K + G    + I+K     D  T      + V  + +R     P  K
Sbjct: 232  ICSSANTITTVHYKHVRGSDNLEKILKSIKTYDGTTSIQTKLFLVTRRVIRNLPFDPARK 291

Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
             T  A  + L    + N  P+L  Q VFA+LPLR  G K
Sbjct: 292  HTNQA-TIILAFPVDVNDTPILENQHVFAYLPLRRVGFK 329


>gi|302917474|ref|XP_003052445.1| hypothetical protein NECHADRAFT_77414 [Nectria haematococca mpVI
            77-13-4]
 gi|256733385|gb|EEU46732.1| hypothetical protein NECHADRAFT_77414 [Nectria haematococca mpVI
            77-13-4]
          Length = 1747

 Score =  125 bits (313), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 139/563 (24%), Positives = 240/563 (42%), Gaps = 112/563 (19%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDN--IYPENVEPTLTFILQESGIVVLNNE 1245
             ++L  A+  L+Q LYS  + F+ EL+QNA+DN  ++ E   P ++F L +  +V+  NE
Sbjct: 52   QSKLEPAIKALAQSLYSSTARFVFELLQNAEDNSFVHAEG-RPYVSFHLSKDRLVIECNE 110

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN--GFHVKFDTS 1303
             GF+  N+ A+C +G S+K  ++ GYIG KGIGFKSVF       I S    F+ K   S
Sbjct: 111  DGFTTANLEAICSIGKSSKL-ATKGYIGEKGIGFKSVFMAAWKVHIQSGPYSFYFKHLPS 169

Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
            +  +G + P    P   +   R +++  + L ++     I L  R         ++I   
Sbjct: 170  DSGMGMITPVWQEP--TEELPRYMTRMTLDLHTEGDPQSI-LAQR---------HSIRQQ 217

Query: 1364 FSDLHPSLLLFLHRLQ--------------CIMFRNMLNDSLVVIRK------------- 1396
               L+ ++LLF+ +L+               +  ++  +DS V I K             
Sbjct: 218  LRKLNGNILLFMRKLEEIRIIIDENEAETSTVFTKSETDDSKVKILKTVTQDDGDSSESS 277

Query: 1397 ----KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGP 1452
                 I    +  +S  E++ T+      L+      +  T E+ LA  L       Y P
Sbjct: 278  STLYHITKHKVYNLSKNENR-TYSEEEDLLK------EYSTAEVVLAFPLTP----EYEP 326

Query: 1453 LLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
            ++  Q VFAFLP++T G  F++Q DF+  +SRE +   +  N  L       F+ A   F
Sbjct: 327  VIESQEVFAFLPVQTAGFSFLIQSDFMTNASREHIVTTAARNIGLRDGICMAFIQAALEF 386

Query: 1513 CDLPCFREN-----PAKAASVYMSF-VPLVGEVHGFF---------SGLPRMILSKLRMS 1557
            C+    +       P K   V+  F   LV ++             SG P  +++ LR  
Sbjct: 387  CNHETLQYTWMQFLPDKENKVHSDFWSALVTDIEAKVRETPLIRPDSGGPLRLITSLRNP 446

Query: 1558 NCLILEGNNN----QWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLAR 1613
              L ++  NN       P   + R +   + ++L ++ L     +GF  +DI+       
Sbjct: 447  RPLFVDKENNLLLRDLTPELSISRRYGSSSLAILENLGL-----VGFYRQDII------- 494

Query: 1614 ALGIEEYGPKILLQIISSLCRTENGLRSMGL-SWLASWLNELYTISFHSSGQSSLQSGVE 1672
                          I   L   ++ L+S    ++L S++     I++  +   + +S +E
Sbjct: 495  ------------RMIDQDLQSPDSWLKSRAADAFLQSFVACFLEIAYKDTHNRAARSMIE 542

Query: 1673 TDLIDNLQRIPFIPLSDGTFSSV 1695
                    ++P IPL DG + +V
Sbjct: 543  --------KLPIIPLQDGRWLAV 557


>gi|302413305|ref|XP_003004485.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261357061|gb|EEY19489.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 1583

 Score =  124 bits (312), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 163/375 (43%), Gaps = 58/375 (15%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE----NVEPTLTFI 1233
             +E +ML K       ++  L++ +Y  D+ F+ EL+QNADDN +       V P+++F 
Sbjct: 39   TIEESMLAKDKL-AAHSIKTLAKNIYGSDARFVFELLQNADDNRFTRARANGVLPSISFH 97

Query: 1234 LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
            +    IVV  NE GF+ +++RA+C VG ST K +S GYIG KGIGFKSVF       I S
Sbjct: 98   VHPHRIVVECNEDGFTEKDLRAICSVGEST-KSASRGYIGAKGIGFKSVFIAAWKVAIQS 156

Query: 1294 NGFHVKFDTSEGQIGF--VLP------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
              F   F   +G +G   VLP        +P     M   L  +   Q       T  R 
Sbjct: 157  GHFSFYFKHEKGDLGLGMVLPVWENQAAALPGPLTRMTLHLHEQGEAQELEHLRQTIFR- 215

Query: 1346 PFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI-VGD--- 1401
                               S+L  + LLFL  L+ I       D  +   K   VGD   
Sbjct: 216  -----------------QLSELQHTSLLFLRNLRQISVAFYDEDGGLKSSKNFRVGDERD 258

Query: 1402 -------GIIKVSCGEDKMT-----WFVASQKLRAGVIRPDVKTT------EIALALTLQ 1443
                      + + G    T      +  ++ +   + + D + +      E+ LA  L 
Sbjct: 259  HTVLLETTSTRSASGNGSQTSTENKQYHVTRHMATNLSKSDNRDSANNTEAEVVLAFPLT 318

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPA 1503
              ++    PL+ QQ +FAFLP+R    KF++  DF   +SR+++   S  N+ L+    A
Sbjct: 319  SDSQ----PLIEQQEIFAFLPIRESNFKFLIHSDFDTSASRQDIVSTSLRNKSLIDGIAA 374

Query: 1504 LFVSAERSFCDLPCF 1518
             F+ A   FC+ P  
Sbjct: 375  AFIKAVLQFCEHPSL 389


>gi|46127439|ref|XP_388273.1| hypothetical protein FG08097.1 [Gibberella zeae PH-1]
          Length = 2219

 Score =  124 bits (311), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 97/332 (29%), Positives = 157/332 (47%), Gaps = 45/332 (13%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC 1257
            LS++ YS  + FL EL+QNADDN+Y + VEP +   L+   + + +NE GF+  N+ ++C
Sbjct: 124  LSEDQYSDKTRFLFELLQNADDNLYAD-VEPLVKIHLKNQYLTIQSNEVGFTKANVDSIC 182

Query: 1258 DVGNSTKKGSSA-----GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
             +  S+K    A       IG KGIGFK+VF V     I S  +  +FD  +G +G V P
Sbjct: 183  RMTVSSKSNKDADVDKENQIGEKGIGFKAVFLVATKVWIKSGHYSFRFD-RDGPLGRVYP 241

Query: 1313 TLVPPFNIDMFCRLLSKDPVQLE-SKCWNTCIRLPFRTKFSEGIAMNN------IVDMFS 1365
              V  F          K  V +E ++  +  + L   +  ++    +N      IV++  
Sbjct: 242  NWVDHF---------PKGLVGIEDTRMTSIVLELAGTSGITDNFLSSNPDSGVDIVNLLR 292

Query: 1366 DLHPSLLLFLHRLQCI---------MFRNMLNDSLVVIRKKIVGD--------GIIKVSC 1408
            +L P  LLFL +++ I               N    ++R K + +             + 
Sbjct: 293  NLSPDQLLFLSKVRRIDIEIDRKNGTITGRANKYNAILRAKPIPELRQTLHVMDTFNGNA 352

Query: 1409 GEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTY 1468
                +T+      L   + RP+ + +EI LA  +  S E ++        V+AFLP++ Y
Sbjct: 353  WASYVTYRHPVSNLPKEINRPNRRNSEIILAFRVPVSEENSFS----ANNVYAFLPIKDY 408

Query: 1469 GLKFILQGDFVLPSSREEVDGNSPWNQWLLSE 1500
            G KF++  DF+L S+RE++  N  WN  LL E
Sbjct: 409  GFKFLIHSDFILCSNREDIQTNQ-WNDRLLEE 439


>gi|310791111|gb|EFQ26640.1| hypothetical protein GLRG_02460 [Glomerella graminicola M1.001]
          Length = 483

 Score =  124 bits (310), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 169/358 (47%), Gaps = 47/358 (13%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFIL-------- 1234
            +H  + R+L  LSQ+LYS ++ F+ E++QNADDN Y ++      P+ +  L        
Sbjct: 33   KHGIIHRSLQILSQQLYSTETRFIHEVIQNADDNAYGKSTAHGSLPSFSLRLKSKDTSND 92

Query: 1235 -QESGIVVLNNEQGFSAENIRALCDVGNSTK---KGSSAGYIGRKGIGFKSVFRVTDAPE 1290
             ++  +    NE GFS   I AL D+G STK   K    G+ G KG+GFK+VFRV DA  
Sbjct: 93   TEKYTLETSCNEDGFSIAQIAALTDIGASTKTKCKDVHGGFTGEKGVGFKAVFRVADAVY 152

Query: 1291 IHSNGFHVKFDTSEGQIGFVLP---TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
            I S  ++ K D S+  IG + P   +   P     + R++    ++L+S+     I    
Sbjct: 153  ITSGHYNFKLDNSQDMIGALRPIPCSYPAPKGTLNYTRMM----IELKSQGDFESIERDL 208

Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC-IMFRNMLNDSLVVIRKKIVGDGIIKV 1406
                 E +     +   S         +H   C   F    ND     + ++    +   
Sbjct: 209  EKVQPEILLFLRRLRRIS---------VHTATCNTTFTVAHNDDDEEYQGEMKTVTVQST 259

Query: 1407 SCGEDKMTWFVASQKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
            +    K T ++  +++   +     R ++ T+EI LA  L   N+G+   +L  Q VFAF
Sbjct: 260  TEKRRKKTRYIFVRRIVDDMPVEPRRENIVTSEIVLAFPLH--NDGSL--ILEPQHVFAF 315

Query: 1463 LPLRTYGLK-----FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDL 1515
            LP+  YG +     F++QGDFVL +SRE +   + WN+ L    P+ F +A   F ++
Sbjct: 316  LPINQYGFRASPCPFLIQGDFVLTASRESL-APTAWNEKLQEAIPSAFEAAVHRFNNI 372


>gi|320587287|gb|EFW99767.1| hypothetical protein CMQ_85 [Grosmannia clavigera kw1407]
          Length = 1655

 Score =  123 bits (309), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 168/392 (42%), Gaps = 63/392 (16%)

Query: 1149 LPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSH 1208
            LPE+K      L+   +RRD          +E   LKK    +G ++  L++ LY+  + 
Sbjct: 373  LPEEK----LRLLPADLRRD----------IEEAFLKKD-LMIGSSVITLAKNLYTSKAR 417

Query: 1209 FLLELVQNADDNIY----PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
            F+ EL+QNADDN Y      +V+P ++F +    +V+  NE GF++ N++A+C +G S+K
Sbjct: 418  FIFELLQNADDNSYTHAAASHVDPYVSFRVHSDRVVIECNEDGFNSANLKAICSIGQSSK 477

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFC 1324
             G+  GYIG KGIGFKSVF       I S  F   F    G  G           + M  
Sbjct: 478  MGAQ-GYIGEKGIGFKSVFMAASKVHIQSGAFSFSFTHRPGDSG-----------MGMIS 525

Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN-IVDMFSDLHPSLLLFLHRLQ---- 1379
             +  +   +L+       + L       +   + + I   F +L  + LLF+  L+    
Sbjct: 526  PVWEEPGEELQPHLTRITLFLHGGGDGEKHAQIQDAIATQFQELQETFLLFMRNLRSIRV 585

Query: 1380 ------------CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKL----- 1422
                        C          + ++++     G  + S    K  + V  Q       
Sbjct: 586  AFFDDGGAKTESCTYLIKRPQTDMAILKRTWATHGSKETSEEISKHFYVVTHQATDLTMN 645

Query: 1423 ------RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQG 1476
                   A        T+++ LA  L   +     P+L +Q +F FLP+R YG  FI+Q 
Sbjct: 646  ENRTYSEAEKASGAYTTSQVVLAFPLSAEST----PILEEQDLFVFLPVRKYGFNFIIQA 701

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            DFV  ++R+++  +S  N  L++     F  A
Sbjct: 702  DFVTDANRQDIVTDSVRNARLMNSVAEGFCKA 733


>gi|116201367|ref|XP_001226495.1| hypothetical protein CHGG_08568 [Chaetomium globosum CBS 148.51]
 gi|88177086|gb|EAQ84554.1| hypothetical protein CHGG_08568 [Chaetomium globosum CBS 148.51]
          Length = 1525

 Score =  123 bits (309), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 107/399 (26%), Positives = 182/399 (45%), Gaps = 59/399 (14%)

Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQ--HARL------GRALHCLSQELYSQDSH 1208
            +A  +V+SI  D   +   + +  S+  ++Q  HA L      G ++  L+Q LY+ ++ 
Sbjct: 11   EAQRLVKSIADDRRPIPDEVLDRMSDADRRQVEHAILMKDQVNGPSVLTLAQNLYTSNAR 70

Query: 1209 FLLELVQNADDNIYPE----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
            F+LEL+QNA+DN Y +      EP ++F +  S +V+  NE GF+ EN+RA+C VG S+K
Sbjct: 71   FVLELLQNAEDNKYTKAQAFGDEPYVSFSVYPSQLVLECNENGFTEENLRAICSVGQSSK 130

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ--IGFVLPTLVPPFNIDM 1322
             G+  GYIG KGIGFKSVF       + S  F  +F   +G   +G + PT    F    
Sbjct: 131  IGAR-GYIGEKGIGFKSVFMAAYQVHVQSGHFSFRFQHRQGDSGLGMITPTWKDAFYFSG 189

Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM 1382
                ++ D +Q  +    T      R +F +             LH ++LLF+ +++ I 
Sbjct: 190  PFTRITLDLLQGGTPEEQTGRNQTIRQQFEQ-------------LHDAILLFMKKIKKIK 236

Query: 1383 F------RNMLNDSLVVIRKKIVGDGIIKVSC--------GEDKMTWFVASQKLRAGVIR 1428
                   R    +    I   I     I V           + +  ++   + +  G+ +
Sbjct: 237  VAFYDHERGEPGEPTRTITHSIDRQSPITVCTKHTSQKGHKQTQHRYYYVKRGVATGLSK 296

Query: 1429 PDVKT-------------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQ 1475
             + +               E+ LA  L      +  P+L  Q VF+FLP+R  G KF++ 
Sbjct: 297  VEGRIYSQSEEASGAYSRGEVVLAFPLT----ADSVPILENQWVFSFLPIRQIGFKFLVH 352

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
             DFV+ + R+++   +  N+ L ++    F+ A    C+
Sbjct: 353  ADFVMQADRQDICLLAARNRELAAKITKGFIRAVLELCE 391


>gi|258568150|ref|XP_002584819.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237906265|gb|EEP80666.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 1609

 Score =  123 bits (308), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 173/391 (44%), Gaps = 75/391 (19%)

Query: 1156 EDAALIVESIRRDEFGLGPNISNMESN------MLKKQHARLGRALHCLSQELYSQDSHF 1209
            E+A  I+++IR    G+    S+ E        +L     +LG A+  L+  LYS D+ F
Sbjct: 7    EEARAIIDNIREQNGGI--TQSDREQTPQSVLRVLNNLQRKLGAAIKILASNLYSTDARF 64

Query: 1210 LLELVQNADDNIYPE----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
            + EL+QNA+D  Y      N  P + F +    +++ +NE GFS +++RA+C  G STK 
Sbjct: 65   VFELIQNAEDASYRVIQACNELPFIKFTVCHDRVIIDSNEDGFSVQDVRAICSTGESTKT 124

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ--IGFVLPTLVPPFNIDMF 1323
             +  GYIG KGIGFKSVF+V     I S  F   F+ + G+  +G V P    P ++ + 
Sbjct: 125  -NIQGYIGEKGIGFKSVFKVASKVHIQSGPFSFYFEHNRGEDGLGMVTPISEEPMDLPVD 183

Query: 1324 CR----LLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
             R    L+  DP +     +NTC++                     D+  +L LFL +L+
Sbjct: 184  VRTRITLILSDPSK-----FNTCMK------------------EIQDIPETLTLFLSKLK 220

Query: 1380 CIMFR-NMLNDSLVVIRKKIVGD------GIIKVSCGEDKMTWFVASQKLRAGVIRPDVK 1432
             I      LN+    +      D       + + +    + T++   +++ + + R   +
Sbjct: 221  IISLSIQPLNEDPWEVSYSCAYDTSSSSMTLTRTTNNSSRQTFYRVVKRILSPLPRDPAR 280

Query: 1433 T----TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVD 1488
                  E+ LA  + +                      +  LK +LQ DF+  +SRE++ 
Sbjct: 281  KGRHRAEVVLAFPVSD---------------------HSAELKPVLQSDFITQASREDIH 319

Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDLPCFR 1519
             +SPWN  +L      F  A   FCD P  R
Sbjct: 320  -HSPWNDAILDGVADTFRDAVIQFCDHPSLR 349


>gi|255951655|ref|XP_002566594.1| Pc23g00270 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211904215|emb|CAP79521.1| Pc23g00270 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1376

 Score =  118 bits (296), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 109/382 (28%), Positives = 175/382 (45%), Gaps = 65/382 (17%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC 1257
            L+  LY++ +HFLLEL+QNADDN Y  +  PT+ F ++   +++  NE GFS++++ A+C
Sbjct: 40   LAGGLYTKPTHFLLELIQNADDNNYATST-PTINFHIESDILLISCNEVGFSSKDVDAIC 98

Query: 1258 DVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
            +VG STK  S  +AGYIG KGIGFKS F++ D   I S  +  KF  S GQ+G + P + 
Sbjct: 99   NVGQSTKSASRKAAGYIGEKGIGFKSGFKLADTIRISSGAYSFKFVKS-GQLGMITP-IW 156

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
              F  D     L++   ++      +C R            + ++      L PS+L+FL
Sbjct: 157  EDFPSDYLQAGLTQFLFEI------SCPR-----------NVQSVKTDLCSLQPSILIFL 199

Query: 1376 HRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTE 1435
             RL+ I   N+  +SL    + I                     Q + A +    VK   
Sbjct: 200  RRLRRI---NLEGESLYKEIQSI---------------------QYVFADLAGAAVKIRG 235

Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
            I++   + E           Q+     LP      +F++Q DF+L ++RE+VD    WN 
Sbjct: 236  ISVDGIISED-------YFVQRHKVGALPGDPKRPEFLIQADFLLTANREDVDERPLWNL 288

Query: 1496 WLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHG--FFSGLPRMILSK 1553
             LL      F  A   F        N       + S++  +    G   F  L + ++++
Sbjct: 289  SLLDGIVQAFALAVHRF--------NKTILKYSWPSYLQCLSSSMGSSLFDSLRKKMVAR 340

Query: 1554 LRMSNCLILEGNNNQWAPPCKV 1575
            L+     ILE  ++ +  P +V
Sbjct: 341  LKSER--ILESKSSSFRTPAEV 360


>gi|225556893|gb|EEH05180.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1578

 Score =  118 bits (296), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 114/426 (26%), Positives = 187/426 (43%), Gaps = 54/426 (12%)

Query: 1156 EDAALIVESIRRDEFGLGPNIS-NMESNMLK---KQHARLGRALHCLSQELYSQDSHFLL 1211
            E A  IVE IR +  GL   +  ++ +  L+      +  G ++  ++++LY  D+ F+ 
Sbjct: 80   EKAREIVEEIRINNGGLREEVQRSLHAEALRSIRNLQSLAGASIRHVAEDLYDADTRFIF 139

Query: 1212 ELVQNADDNIYP----ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            ELVQNA+D+ Y     +  EP + F L    I V  NE GF+  ++RA+C +  S+K+  
Sbjct: 140  ELVQNAEDSRYSHAKSQEEEPFIYFTLYPDRITVDTNEDGFTENDVRAICSIHKSSKR-Q 198

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLL 1327
              GYIG KGIGFKSVF+V     I S  F   F   +G+ G     ++ PF         
Sbjct: 199  IGGYIGHKGIGFKSVFKVAHKVHIQSGPFCFSFTHRKGESGM---GMITPF--------- 246

Query: 1328 SKDPVQLESKC---WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFR 1384
            S++P  L       +      P   +  EG  +  I D        L     +++   FR
Sbjct: 247  SQEPEDLPPNVKTRFTLYFIHPEDFQIREG-ELKEIPDTLLLFLRRLCRIGVKIEPSGFR 305

Query: 1385 NML-------NDSLVVIRKK--IVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTE 1435
             +        N  + ++++   +V   I  V   E +       Q  +         T E
Sbjct: 306  LLFKREQTEPNGRITLVKEAGDVVSKRIYHVEGAEFENLPEHGDQSAQT--------TAE 357

Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQ 1495
            + LA  + +S+     P++  Q V++FLP+R  G KF++Q DF+  ++R+ V    P N 
Sbjct: 358  VILAFPVDDSHR----PIIEPQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LCPRNY 412

Query: 1496 WLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLR 1555
             +      +FV A  +FC         A     ++ ++P       F++ L  MIL  L 
Sbjct: 413  AIRERISLIFVQAVYNFC-------KNATLKYEWLQYLPGPSIPDPFWATLREMILESLS 465

Query: 1556 MSNCLI 1561
             S  L+
Sbjct: 466  ESKILL 471


>gi|408390643|gb|EKJ70034.1| hypothetical protein FPSE_09771 [Fusarium pseudograminearum CS3096]
          Length = 1687

 Score =  117 bits (294), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 103/373 (27%), Positives = 166/373 (44%), Gaps = 69/373 (18%)

Query: 1143 DCTTQTLPEDKECEDAALIVESIRRDEFGLG-----------PNISNMESNMLKKQHARL 1191
            D T    P+ +  +D   +V+ I ++   LG           P +       L ++   +
Sbjct: 4    DGTQTAEPDKRTAQD---LVKEIAKERGYLGEEQLARIGEINPELRREVEEALLRKEEMI 60

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENV----EPTLTFILQESGIVVLNNEQG 1247
            G A+  L++ +Y+ ++ F+ EL+QNADDN Y   V    +P ++F +    + +  NE G
Sbjct: 61   GSAVLTLARNIYTSNARFVFELLQNADDNDYSTAVSQGQDPYVSFKVHPDKVTIECNESG 120

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ- 1306
            F+ EN++A+C +G S+K G+ AGYIG KGIGFKSVF       I S+ F   F   +G  
Sbjct: 121  FTHENLKAICAIGKSSKVGA-AGYIGEKGIGFKSVFMAAWKVHIQSHKFSFSFTHRKGDS 179

Query: 1307 -IGFVLP-------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMN 1358
             +G V P       T V         R  S+DP + E +     IRL             
Sbjct: 180  GLGMVTPVWEETDETTVDSSTRITLLRHTSEDPEEDERR--RDAIRL------------- 224

Query: 1359 NIVDMFSDLHPSLLLFLHRLQCIMF----RNMLNDSLVVIRKKIVGDGIIK--VSCGEDK 1412
                 F DL  ++LLFL +L+ +      ++ +  +  V          I+   S G D+
Sbjct: 225  ----QFQDLQHTILLFLRKLRKVQVSFYDKDDVQTATTVYSLHGSNPATIRKETSDGVDE 280

Query: 1413 MTWFVASQKLRAGVIRPDVKT-----------TEIALALTLQESNEGNYGPLLYQQPVFA 1461
              + V ++ +   + R + +T           TE+ LA  L ++      P++  Q VFA
Sbjct: 281  RQYHV-TKHVAENIPRSENRTYSDERDRADSSTEVVLAFPLTDTGT----PIVENQDVFA 335

Query: 1462 FLPLRTYGLKFIL 1474
            FLP+R    K  L
Sbjct: 336  FLPMRPMSFKITL 348


>gi|221501417|gb|EEE27194.1| SWIM zinc finger protein domain-containing protein, putative
            [Toxoplasma gondii VEG]
          Length = 6038

 Score =  117 bits (292), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 80/246 (32%), Positives = 111/246 (45%), Gaps = 83/246 (33%)

Query: 1161 IVESIRRDEFGLGPNISNMES-------------------------------------NM 1183
            I+E+IRR+EFG+G  +S   +                                     ++
Sbjct: 1916 IIEAIRREEFGVGLQLSGAAAGSIGDTRHSDCASEGAHLSPEEREARRKERETNALVEDV 1975

Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------------------- 1224
            L +Q  RL RAL  LS+ LY+ DSH  +EL+QNADDN Y E                   
Sbjct: 1976 LLRQRQRLTRALDRLSRGLYTADSHLQMELLQNADDNNYSELQTQSKLKANGDLDAPSAP 2035

Query: 1225 -----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK----------GSSA 1269
                  +EP L F +   G+   NNE GF+ +++RALCDV  STK+          G+ A
Sbjct: 2036 GRPLTRLEPWLHFEITPQGLAAFNNENGFTEKDVRALCDVARSTKEKREGARKAEMGADA 2095

Query: 1270 GY-----------IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-IGFVLPTLVPP 1317
            G            IGR G+GFKSVF ++D P + S GF  KF+ S+   +GFVLP  + P
Sbjct: 2096 GAEDDGEKENVRKIGRFGLGFKSVFSISDRPHLFSQGFAFKFEASDPTGLGFVLPHWLEP 2155

Query: 1318 FNIDMF 1323
                 +
Sbjct: 2156 PEATFY 2161



 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 6/111 (5%)

Query: 1453 LLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
            L   +PVF +LP+R++GL F+L   F L +SRE +  +S +N  L +  P  F++A + F
Sbjct: 2535 LPESRPVFCYLPVRSFGLPFVLHAPFDLTASRESLVVSSFFNLQLRNALPETFLAALK-F 2593

Query: 1513 CDLPCFRENPAKAASVYMSFVP-LVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
            C         A     ++ F+P    + + FF    R IL  LR   CL++
Sbjct: 2594 CKHSALFSLQAS----FLRFLPFFAAKENDFFGPAARDILRLLRRDPCLMV 2640


>gi|357496873|ref|XP_003618725.1| Serine/threonine protein phosphatase [Medicago truncatula]
 gi|355493740|gb|AES74943.1| Serine/threonine protein phosphatase [Medicago truncatula]
          Length = 1547

 Score =  115 bits (288), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 112/390 (28%), Positives = 175/390 (44%), Gaps = 67/390 (17%)

Query: 1216 NADDNIYPENVEPTLTFILQESGIV---------VLNNEQGFSAENIRALCDVGNSTKKG 1266
            NA+DN Y   V PTL FI+  + I          + NNE+GFS +NI ++C VG STKKG
Sbjct: 17   NAEDNQYLVRVNPTLEFIITSTDIAATGAAATLLIFNNEKGFSRQNIESICSVGQSTKKG 76

Query: 1267 S-SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---EGQIGFVLPTLVPPFNIDM 1322
            + S+GYIG KGIGFKSVF VT  P I SNG+ + F+ +   +  +G+++P  V     + 
Sbjct: 77   NRSSGYIGEKGIGFKSVFLVTAQPYIFSNGYQISFNENPCPQCSLGYIVPEWVE----EK 132

Query: 1323 FCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM 1382
              RLL    +  +     T + LP +        +N++    S +HP +L FL +++   
Sbjct: 133  PTRLLDIKQIYGKDSLPTTTLVLPLKAD-----KINSVKQQLSSIHPEILSFLSKIRDFS 187

Query: 1383 FR------NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQ---KLRAGV-IRPDVK 1432
             R       M + + V I  +I  +   K +   +  T  ++ +   KL   V  R  V+
Sbjct: 188  VREDNEDPKMESVTTVSISSEI--NSAAKKNLNAESYTIHLSQKFPVKLENVVERRMGVE 245

Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSP 1492
               + L    QE    N             LP   Y  + IL               ++ 
Sbjct: 246  DCVVTLVFPHQERLHKNKS-----------LP-GVYAWETILL--------------DNK 279

Query: 1493 WNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILS 1552
            WNQ +L   P+ F+ A R+   +    E P  +      F+P+    HG F+ +   I +
Sbjct: 280  WNQGILECVPSAFMEAFRTL--VAGSYEAPVSSLIYLFKFIPIHSSRHGNFNHVREEIKA 337

Query: 1553 KLRMSNCLILEGNNNQ---WAPP--CKVLR 1577
            KL     + +E  +NQ   + P   C++LR
Sbjct: 338  KLAEEKIIPIETFSNQKHFYKPREVCRLLR 367


>gi|342890662|gb|EGU89430.1| hypothetical protein FOXB_00054 [Fusarium oxysporum Fo5176]
          Length = 1666

 Score =  115 bits (287), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 118/488 (24%), Positives = 215/488 (44%), Gaps = 71/488 (14%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI 1221
            +E+ + D   + P++  +    LK        A   + ++L + +     EL+QNADDN 
Sbjct: 27   IEAAKLD--AMPPDVRRVVEEALKSNSKSHADATELI-RKLAAGNIDVAFELLQNADDNQ 83

Query: 1222 YPEN----VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
            Y         P   FI+ +  +V+ +NE GF+  NI+ALCD G STK G+S GYIG+  +
Sbjct: 84   YTRAKAACANPGAQFIVTQDMVVIGSNEDGFTESNIKALCDFGKSTKAGTS-GYIGQPDV 142

Query: 1278 GFKSVFRVTDAPEIHSN--GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLE 1335
            GFKSVF +    +I S    F+++ +  +  +G V P  + P  +  + R         E
Sbjct: 143  GFKSVFTIASKVQIESGPFSFYLQHEDGDSGVGMVRPHWIEPPVLPDYVR---------E 193

Query: 1336 SKCWN------TCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND 1389
             + +       T I L  + +  + +  +N ++   D   S+LLF+  L+ +    +  D
Sbjct: 194  HEAYGGLFGNRTRIILFLKDQNRDNL-RSNFINQLWDTPESVLLFMRNLRSLRITVLGED 252

Query: 1390 SL------VVIRKKIVGDG---IIKVSCGEDKM-TW--FVASQKLRAGVI-------RPD 1430
            +       V++ + + G G     K   G  ++ TW  ++  +    G+         PD
Sbjct: 253  NKVQRSKDVLLDRDLFGSGTRITTKKKLGPSQLTTWNRYLVYKSTATGLPANPNRRHAPD 312

Query: 1431 VKT----TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREE 1486
                    ++ LA  + E  E    P++  Q VFAF+P+   GL F++  DFV  ++R++
Sbjct: 313  ASANSSNADVVLAFPVDEKGE----PIIEIQKVFAFMPVSETGLSFLIHSDFVANATRQK 368

Query: 1487 VDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGE--VHGFFS 1544
            +   S  N  +        V+A +  C  P            +M F+P +    +  F+ 
Sbjct: 369  LVTVSERNVRIKQAVENAVVTAVQDLCKKPGTEHK-------WMRFLPQINRNGIDPFWL 421

Query: 1545 GLPRMILSKLRMSNCLILEGNNNQW-----APPCKVLRG--WNDRAHSLLPDILLQKHLG 1597
            G   ++  +L++S   +L   + ++     A   K L     +     LLPD+  + +L 
Sbjct: 422  GFNNVL--RLKISIIPLLRPRSQKFGNLSKATSLKRLSADQLDQYGAPLLPDLEQEMYLS 479

Query: 1598 LGFLSKDI 1605
             G+  +D+
Sbjct: 480  SGYEPQDL 487


>gi|237845199|ref|XP_002371897.1| SWIM zinc finger domain-containing protein [Toxoplasma gondii ME49]
 gi|211969561|gb|EEB04757.1| SWIM zinc finger domain-containing protein [Toxoplasma gondii ME49]
 gi|221480764|gb|EEE19195.1| SWIM zinc finger protein domain-containing protein, putative
            [Toxoplasma gondii GT1]
          Length = 6036

 Score =  114 bits (285), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 81/241 (33%), Positives = 110/241 (45%), Gaps = 84/241 (34%)

Query: 1161 IVESIRRDEFGLGPNISNMES-------------------------------------NM 1183
            I+E+IRR+EFG+G  +S   +                                     ++
Sbjct: 1918 IIEAIRREEFGVGLQLSGAAAGSIGDTRHSDCASEGAHLSPEEREARRKERETNALVEDV 1977

Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------------------- 1224
            L +Q  RL RAL  LS+ LY+ DSH  +EL+QNADDN Y E                   
Sbjct: 1978 LLRQRQRLTRALDRLSRGLYTADSHLQMELLQNADDNNYSELQTQSKLKANGDLDAPSAP 2037

Query: 1225 -----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK----------GSSA 1269
                  +EP L F +   G+   NNE GF+ +++RALCDV  STK+           + A
Sbjct: 2038 GRPLTRLEPWLHFEITPQGLAAFNNENGFTEKDVRALCDVARSTKEKREGARKAEMSAEA 2097

Query: 1270 GY-----------IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-IGFVLPT-LVP 1316
            G            IGR G+GFKSVF ++D P + S GF  KF+ S+   +GFVLP  L P
Sbjct: 2098 GAEDDGEKENVRKIGRFGLGFKSVFSISDRPHLFSQGFAFKFEASDPTGLGFVLPHWLEP 2157

Query: 1317 P 1317
            P
Sbjct: 2158 P 2158



 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 56/111 (50%), Gaps = 6/111 (5%)

Query: 1453 LLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
            L   +PVF +LP+R++GL F+L   F L +SRE +  +S +N  L +  P  F++A + F
Sbjct: 2537 LPESRPVFCYLPVRSFGLPFVLHAPFDLTASRESLVVSSFFNLQLRNALPETFLAALK-F 2595

Query: 1513 CDLPCFRENPAKAASVYMSFVP-LVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
            C               ++ F+P    + + FF    R IL  LR   CL++
Sbjct: 2596 CKHSAL----FSLQVSFLRFLPFFAAKENDFFGPAARDILRLLRRDPCLMV 2642


>gi|358397613|gb|EHK46981.1| conserved hypothetical protein [Trichoderma atroviride IMI 206040]
          Length = 1578

 Score =  114 bits (284), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 137/567 (24%), Positives = 225/567 (39%), Gaps = 114/567 (20%)

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE-PTLTFILQESGIVVLNNEQGFSA 1250
              ++  L++++Y   + F+ EL+QNA+DN + +  E P ++F + E+ IVV  NE GF+ 
Sbjct: 52   AHSIKTLAKQIYGSGARFVFELLQNAEDNNFSKTTEDPYISFEVYENRIVVECNEDGFTE 111

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF--DTSEGQIG 1308
             ++ A+C VG ST K +S GYIG KGIGFKSVF       I S  F  +F     +  +G
Sbjct: 112  RDLVAICAVGQST-KSTSYGYIGAKGIGFKSVFIAAWKVHIQSGNFSFEFRHKRDDPGLG 170

Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV-DMFSDL 1367
             V P  +P                +L+       +         E   + +I+     DL
Sbjct: 171  MVRPIWIPTNE-------------KLKGPLTRMTLYFHDEGDKDELQRLKSIIFKQLDDL 217

Query: 1368 HPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED----------KMTWFV 1417
              + LLFL +LQ I        + V   K+ +   + K     D          K +   
Sbjct: 218  QQTSLLFLKKLQRIKMVFYDKSNRVEKSKEFLKRQVDKYRVALDTDSAKDNETRKESQIY 277

Query: 1418 ASQKLRA-GVIRPDVK-------------TTEIALALTLQESNEGNYGPLL---YQQPVF 1460
                L A G+ R D +             + E+ LA  L       + P++    +Q +F
Sbjct: 278  HITTLTAHGLARSDNRDLPNTDDTQTISTSAEVVLAFPLT----AEFKPIVNTKKKQEIF 333

Query: 1461 AFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFC------- 1513
            AFLP+R    KF++  DF   ++R+++   S  N+ LL      F++A + FC       
Sbjct: 334  AFLPVRESDYKFLIHSDFDTSANRQDIITTSTRNENLLDWIAKAFLTAIKQFCEHSSLCY 393

Query: 1514 DLPCFRENPAKAASVYMS--------FVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGN 1565
            + P F  N   A   + +         V     +       PR+I     ++     EG 
Sbjct: 394  EWPLFLPNTEGALDTFWAELNIKIRRIVTTSPTLKSRHRRDPRVITDISILTKDATYEGG 453

Query: 1566 NNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIEEYGPKIL 1625
               +  P        D+   L P             SK +    S+ +  G++   P  +
Sbjct: 454  VPLFDDP--------DKDPFLSPK-----------YSKPV---RSVLKGYGLKNISPNAI 491

Query: 1626 LQIISSLCRTEN----GLRSMG------LSWLASWLNELYTISFHSSGQSSLQSGVETDL 1675
            L+++ S  R  N    G ++ G       S L SW +  Y+                   
Sbjct: 492  LEMLESDLRNPNSKMHGKKTDGAWYSAVASLLCSWFDRNYS------------------F 533

Query: 1676 IDNLQRIPFIPLSDGTFSSVDEGTIWL 1702
            ++ L+++P IPL DG + S   G I+ 
Sbjct: 534  LNRLRQLPLIPLRDGKWISAALGPIYF 560


>gi|402080239|gb|EJT75384.1| hypothetical protein GGTG_05321 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 1699

 Score =  114 bits (284), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 105/357 (29%), Positives = 159/357 (44%), Gaps = 45/357 (12%)

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNEQG 1247
            G  +  L++ +YS D+ F+ EL+QNADD+ +         P++ F +    IVV  NE G
Sbjct: 52   GHMIKTLAKNIYSSDARFVFELLQNADDSRFQRARVRGDLPSIAFHVHPGRIVVECNEDG 111

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            F+  ++ A+C VG STK GS  GYIG KGIGFKSVF       I S  F   F   +G +
Sbjct: 112  FTKRDLSAICAVGQSTKSGSH-GYIGAKGIGFKSVFSAAWKVYIQSGYFSFYFKHRKGDL 170

Query: 1308 GF--VLPTLVPPFNI-DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV-DM 1363
            G   +LP  V   +  D    L S  PV          + L  +    E   ++NI+   
Sbjct: 171  GLGMILPIWVDGDDDNDTQNELPSPSPVT------RMTLHLHDKGDPEELEHLHNIILKQ 224

Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI--VGDGIIKVSCGEDKMT------- 1414
               L P+ LLFL  L+ I   +  +       ++   +GD        E  +T       
Sbjct: 225  LDGLQPTCLLFLRNLRRIQVISYDDGGRTTKSREFRQIGDAENHRVTLESTVTDQGGEAT 284

Query: 1415 ----WFVASQKLRAGVIR--------PDVKT-----TEIALALTLQESNEGNYGPLLYQQ 1457
                 F  ++ + + + R        P+ K+      E+ LA  L     G   P++ +Q
Sbjct: 285  TEERHFHVTRHMASNLSRSQNRELSGPEEKSRAFSEAEVVLAFPLT----GCSAPVVGRQ 340

Query: 1458 PVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
             +FAFLP+     KF++  DF   +SR+ +   S  N  LL    A F++A   FC+
Sbjct: 341  EIFAFLPVEDSTFKFLINSDFDTTASRQRIVTTSRRNIDLLDGIAAAFLTAVLQFCE 397


>gi|408391323|gb|EKJ70703.1| hypothetical protein FPSE_09213 [Fusarium pseudograminearum CS3096]
          Length = 1714

 Score =  112 bits (280), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 164/371 (44%), Gaps = 50/371 (13%)

Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFIL 1234
            ++ N L  + A +  ++  L++++Y   + F+ EL+QNA+DN++      N  P ++F +
Sbjct: 40   IDENWLAMESA-VSHSVKILAKQIYGSGARFVFELLQNAEDNMFNKARASNALPFISFKI 98

Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                IVV  NE GF+  ++ A+C VG STK  +S GYIG KGIGFKSVF       I S 
Sbjct: 99   YLDKIVVECNEDGFTEPDLDAICAVGKSTK-SASHGYIGAKGIGFKSVFIAASRVHIQSG 157

Query: 1295 GFHVKF--DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
             F  +F  + ++  IG V P  V             K    + S      + +  +    
Sbjct: 158  NFSFEFRHNKTDPGIGMVRPIWV-------------KATETIPSPMTRMTLYIHDQGDPD 204

Query: 1353 EGIAMNNIVDM-FSDLHPSLLLFLHRLQCI---------------MFRNMLNDSLVVIRK 1396
            E   +  I+ M F DL  + LLFLH+L  I                FR    D   V  +
Sbjct: 205  EIRHLKRIISMQFDDLQETCLLFLHKLGQIDLEFYDDNGELERSRCFRKYDIDKYRVSLE 264

Query: 1397 KIV---GDGIIK--VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
             I    GD   +  +     +    +A    R      + K  E+ LA  + E +E    
Sbjct: 265  TISDSHGDQTTRNQIYHITRQTATNLAPSDNRELSNTDEGKRAEVVLAFPITEDSE---- 320

Query: 1452 PLLYQ--QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAE 1509
            PL+ Q  Q +FAFLPLR    KF++  DF   ++R++V   S  N  L       F  A 
Sbjct: 321  PLITQRNQDLFAFLPLRRSEYKFLIHSDFDTNANRQDVITTSRRNLDLRDWIATAFYQAV 380

Query: 1510 RSFCDLP--CF 1518
              FC+ P  C+
Sbjct: 381  LQFCEHPQLCY 391


>gi|46110190|ref|XP_382153.1| hypothetical protein FG01977.1 [Gibberella zeae PH-1]
          Length = 1612

 Score =  111 bits (278), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 164/371 (44%), Gaps = 52/371 (14%)

Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFIL 1234
            ++ N L  + A L  ++  L++++Y   + F+ EL+QNA+DN++      N  P ++F +
Sbjct: 40   IDENWLAMESA-LSHSVKILAKQIYGSGARFVFELLQNAEDNMFNKARASNALPFISFKI 98

Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                IVV  NE GF+  ++ A+C VG S K  S  GYIG KGIGFKSVF V     I S 
Sbjct: 99   YPDKIVVECNEDGFTEPDLDAICAVGKSNKLASH-GYIGAKGIGFKSVFIVASRVHIQSG 157

Query: 1295 GFHVKF--DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
             F  +F  + ++  IG V P  V P   +     L++  + +  +     IR        
Sbjct: 158  NFSFEFRHNKTDPGIGMVRPIWVKP--TETIPSPLTRMTLYIHDQGDPDEIR-------- 207

Query: 1353 EGIAMNNIVDM-FSDLHPSLLLFLHRLQCIMF----RNMLNDSLVVIRKKIVGDGIIKVS 1407
                +  I+ M F DL  + LLFLH+L  I       N   +     RK  V    + + 
Sbjct: 208  ---HLKRIISMQFDDLQETCLLFLHKLGQIGLEFYDENGELERSRCFRKYNVDKYRVSLE 264

Query: 1408 CGED--------KMTWFVASQKLRAGVIRP----------DVKTTEIALALTLQESNEGN 1449
               D           + +  Q   A  + P          + K  E+ LA  + E ++  
Sbjct: 265  TTSDSHGNRTTRNQIYHITRQT--ATNLAPSDNRELSNTDEGKRAEVVLAFPITEDSK-- 320

Query: 1450 YGPLLYQ--QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
              PL+ Q  Q +FAFLPLR    KF++  DF   ++R+++   S  N  L       F  
Sbjct: 321  --PLVTQRNQDLFAFLPLRRSEYKFLIHSDFDTNANRQDIVTTSRRNLDLRGCIATAFYQ 378

Query: 1508 AERSFCDLPCF 1518
            A   FC+ P  
Sbjct: 379  AVLQFCEHPTL 389


>gi|317154996|ref|YP_004123044.1| hypothetical protein Daes_3310 [Desulfovibrio aespoeensis Aspo-2]
 gi|316945247|gb|ADU64298.1| hypothetical protein Daes_3310 [Desulfovibrio aespoeensis Aspo-2]
          Length = 1118

 Score =  111 bits (277), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 169/687 (24%), Positives = 271/687 (39%), Gaps = 134/687 (19%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV---EPTLTFILQESGIVVLNNE 1245
            AR GR L     + Y   +HF+ EL+QNA+D +           +TF L    + + +  
Sbjct: 43   ARYGRILLA---DRYDDRTHFIFELLQNAEDALGRRGTWDGTRGITFDLTSESLTLSHFG 99

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG 1305
              F+  ++R +C +  STK  SS   +GR GIGFKSV+  TD PEIHS           G
Sbjct: 100  ASFNEADVRGVCGIAESTKDRSS---VGRFGIGFKSVYTFTDRPEIHS-----------G 145

Query: 1306 QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
               F +   V P   D           +LE K   T I LP + +  +  A++ I + F 
Sbjct: 146  LEDFTVENYVQPRRAD-----------RLELKPEETRIILPLKPE--DTTALHEITEGFK 192

Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKK--IVGDGIIKVSC-GEDKMTWFVASQKL 1422
             L PS LLFL  +  I +      S + +R +   +   + K++  G++     V    +
Sbjct: 193  RLGPSSLLFLRHINEISWNAPGGASGIYLRSQPTFLAPDVQKITVIGQESGQADVDQNWI 252

Query: 1423 RAGVIRPDV--------KTTEIALALTLQESNEGNYGPL-LYQQPVFAFLP-LRTYGLKF 1472
               V R DV         + E+A ++       G +    +   P+  F P + +  L F
Sbjct: 253  ---VFRRDVFLADGAMGGSVEVAFSVEADGDEPGQWSVRPVPTSPLVVFFPTVVSTNLGF 309

Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA-----ERSFCDLPCFRENPAKAAS 1527
            ++QG ++   SR+ +    PWNQ L+ E  AL V A     E+S  D    R  P     
Sbjct: 310  LVQGPYLTTPSRDNIRQGEPWNQHLVEETAALVVDAIRWFREKSMLDTSTLRCLPLDRQG 369

Query: 1528 VYMS-FVPLVGEVHGFFSGLPRMILSKL---RMSNCLILEGNNNQWAPPCKVLRGWNDRA 1583
               S F P       FF  + +  L +    R     +  G         + LR      
Sbjct: 370  FKDSMFAP-------FFEAVRQAFLDEALLPRFGGGYVAAGQAK--LARTQELRELFSPG 420

Query: 1584 H-SLLPDILLQKHLGLGFLSKDIV------LSDSLARALGIEEYGPKILLQIISSLCRTE 1636
            H ++L DI      G  +LS DI       +   L R L IEE  P    +I+S L R  
Sbjct: 421  HVAVLFDI-----EGAAWLSGDITEDKAPEIRKYLMRELEIEEVTPA---KIVSRLSR-- 470

Query: 1637 NGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVD 1696
              L +    W+      +    F S  +++L+  ++T        IP + L+DG+     
Sbjct: 471  EFLEAQADEWI------VLLYQFLSEQETALRRYLDT--------IPLLRLADGSHIVAR 516

Query: 1697 EGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLL 1756
            E     +   + F           L +   T  P + +A         V S       LL
Sbjct: 517  E-----NGKPTAF-----------LPSGSETGFPTIRNA---------VCSTMESRNFLL 551

Query: 1757 KIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIV 1816
             +G+ +    D +  ++ P         GD     +Y+  +   L  +  +   +R+ ++
Sbjct: 552  SLGITEPDLVDDIIWNVKPKYDQSEVVVGD----TEYVTDIERILAAFETDSKAQRQKLL 607

Query: 1817 SELRKKAFV-------LTNHGFKRPSE 1836
            S LR+ AFV       + + G+ +PSE
Sbjct: 608  SGLRETAFVKIVEVGSVRSEGYGKPSE 634


>gi|67900586|ref|XP_680549.1| hypothetical protein AN7280.2 [Aspergillus nidulans FGSC A4]
 gi|40741961|gb|EAA61151.1| hypothetical protein AN7280.2 [Aspergillus nidulans FGSC A4]
 gi|259483371|tpe|CBF78706.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 1882

 Score =  110 bits (276), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 91/341 (26%), Positives = 152/341 (44%), Gaps = 41/341 (12%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFILQESGIVVLNNEQ 1246
            LG  +   +   ++ D+ F+ E++QNA+D  Y     +N  P+L F +    IVV +NE 
Sbjct: 37   LGAPVREAAARAWTSDTRFVYEMIQNAEDCSYNSAKSKNAIPSLKFYVFLDRIVVDSNED 96

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
            GF   ++RA+C  G   K+G     +G KGIGFKSVF++     I S  F       +G+
Sbjct: 97   GFEESHVRAICSAGKRKKEGK----LGEKGIGFKSVFKIASKARIQSGPFCFSLRHRDGE 152

Query: 1307 IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSD 1366
             G     ++ P N          +P+          +R  F T FS       I D    
Sbjct: 153  NGL---GMIAPVN-------EIHEPLP-------DGVRTRF-TLFSRPEKCREIQDELLR 194

Query: 1367 LHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMT--WFVASQKLRA 1424
            L  +++ FL RL  + F    + + ++  +  +   ++ V      +    ++  ++  A
Sbjct: 195  LPHTIIAFLSRLHMLSFYLSRSSTSLMFSRPTIEPSLVCVETRHSTVMVHRYIPFRRKVA 254

Query: 1425 GV----IRPDVKTTEIALALTLQESNEGNYGPLLYQQP-VFAFLPLRTYGLKFILQGDFV 1479
            G+     RP ++  E+ LA           G  L  QP   ++ PLR  GL F++Q DFV
Sbjct: 255  GLPPRESRPLIEDVEVVLAFPQD-------GRALTTQPFAHSYRPLRGAGLNFLIQSDFV 307

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
               + E++    PWN  +L   P +F +A +  C +P F +
Sbjct: 308  TQDNGEDLVA-CPWNDHILDSLPGVFFAALQEICSIPEFEK 347


>gi|378732103|gb|EHY58562.1| hypothetical protein HMPREF1120_06571 [Exophiala dermatitidis
            NIH/UT8656]
          Length = 1671

 Score =  108 bits (270), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 141/556 (25%), Positives = 241/556 (43%), Gaps = 112/556 (20%)

Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNE 1245
            +  A   R    ++++LYS     + EL+QN DD+ Y   VEP++TF ++ S +++ +NE
Sbjct: 57   EHRALYARYTRTVAEQLYSSRFRLIYELLQNGDDSCYANAVEPSITFRIKPSALIIESNE 116

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG 1305
             GF+ EN+ ++CD G S+ K S+A   G KG+GFKSVF + +   I S  +  +F+  +G
Sbjct: 117  LGFTLENVESICDTGKSS-KASNADTTGEKGLGFKSVFGIAEYVHIQSGLWSFRFEHRQG 175

Query: 1306 QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
            + G     ++ P   D+   L S           N   RL  R      I    +V  F 
Sbjct: 176  EEGV---GMITPVWTDVTSLLPS-----------NVGTRLTLRYYDQRDIFSTRLVSEFE 221

Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
             L  +++  L +L+ ++               IV DG+     G      F     LR+ 
Sbjct: 222  KLPRTIMFALRKLRKLV---------------IVVDGV----AGRSDRIAFEKDGDLRSD 262

Query: 1426 VIRPDV-----------KTTEIAL---------ALTLQESNEGN---------YG-PLLY 1455
             IR +             TT + L         + T + S+E +         YG P++ 
Sbjct: 263  QIRINTTVTGNFGAHTSATTWLRLFQNTVSDLPSDTHRTSSESDVTVAFEVDSYGLPVIP 322

Query: 1456 Q--QPVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
               Q VFA+LP+ R   L F++  +F+L ++RE V  ++ WN  L     + FV      
Sbjct: 323  ANGQHVFAYLPVERIIQLPFLINANFILTANRESVP-DTDWNNALREGVSSAFV----GL 377

Query: 1513 CDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP 1572
             D      +P +    +M ++P    + GF+  L       + + N L+  G        
Sbjct: 378  VDKVVTSGDPLEFK--WMRYIPQ-RPMQGFWEYL------WVEIQNELVNRG-------- 420

Query: 1573 CKVLRGWNDRAHSL-----LPDILLQKHLGL-GFLSKDIVLSDSLARA-LGI-EEYGPKI 1624
              +LR      HS+     LPD  +  +  +     +DI LSD    + +GI +++G   
Sbjct: 421  TPILRSRRGELHSIEHMATLPDWFVHDNEPIVADTDEDIYLSDGYESSDVGILKDFG--- 477

Query: 1625 LLQIISSLCRTENGLR-SMGLSWLASWLNELYTI----SFHSSGQSSLQSGV-ETDLIDN 1678
                ++ +  TE   R  +GLS  A     +Y I    S+H++  + +Q  + E  +  +
Sbjct: 478  ----VAPISATEILERIKLGLSQGAE--APIYHIPLDDSWHTAFAALIQRLLGERSIKKD 531

Query: 1679 LQRIPFIPLSDGTFSS 1694
            ++R+  IPL D  + S
Sbjct: 532  VERLRIIPLDDNRWVS 547


>gi|401403116|ref|XP_003881414.1| hypothetical protein NCLIV_044430 [Neospora caninum Liverpool]
 gi|325115826|emb|CBZ51381.1| hypothetical protein NCLIV_044430 [Neospora caninum Liverpool]
          Length = 6000

 Score =  107 bits (267), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 70/186 (37%), Positives = 92/186 (49%), Gaps = 50/186 (26%)

Query: 1182 NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP------------EN---- 1225
            ++L +Q  RL RAL  LS+ LY+ DSH  +EL+QNADDN Y             EN    
Sbjct: 1962 DVLLRQRQRLTRALDRLSRGLYTADSHLQMELLQNADDNNYSALEALRSQVRMRENASLS 2021

Query: 1226 --------VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK------------- 1264
                     EP+L F +   G+   NNE GF+ +++RALCDV  STK             
Sbjct: 2022 ADGLSLALAEPSLHFEVTPEGLAAFNNENGFTEKDVRALCDVARSTKVKREVRATQANGA 2081

Query: 1265 -----------KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-IGFVLP 1312
                       K      IG+ G+GFKSVF ++D P + S GF  KF+ S+   +GFVLP
Sbjct: 2082 GSIPEEGKDREKAGEVKKIGKFGLGFKSVFAISDRPHLFSKGFAFKFEASDPTGLGFVLP 2141

Query: 1313 T-LVPP 1317
              L PP
Sbjct: 2142 HWLEPP 2147



 Score = 52.0 bits (123), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 58/107 (54%), Gaps = 6/107 (5%)

Query: 1457 QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLP 1516
            +PVF +LP+R++GL F+L   F L +SRE +  +S +N  L +  P  F++A   FC   
Sbjct: 2534 RPVFCYLPIRSFGLPFVLHAPFELTASRESLVVSSFFNLQLRAALPETFLAA-LHFCK-- 2590

Query: 1517 CFRENPAKAASVYMSFVP-LVGEVHGFFSGLPRMILSKLRMSNCLIL 1562
              R +     + ++ F+P    + + FF+   R IL  LR   C+++
Sbjct: 2591 --RSDLFPLQTSFLRFLPFFAAKDNDFFAPAAREILRLLRQEPCVMV 2635


>gi|309791644|ref|ZP_07686136.1| hypothetical protein OSCT_2087 [Oscillochloris trichoides DG-6]
 gi|308226266|gb|EFO80002.1| hypothetical protein OSCT_2087 [Oscillochloris trichoides DG6]
          Length = 1068

 Score =  106 bits (265), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 149/591 (25%), Positives = 239/591 (40%), Gaps = 112/591 (18%)

Query: 1202 LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
            LY+  +HF+ EL+QNA+D    +     L F+L +  + V +N + F+  ++R +C VG 
Sbjct: 29   LYADRTHFVFELLQNAEDAGATQ-----LKFVLHKDRLDVWHNGRIFNEHDVRGICGVGE 83

Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNID 1321
             TK     G IG+ GIGFKSV+  T  P I S G                      F I+
Sbjct: 84   GTK-AEDMGKIGKFGIGFKSVYAYTRRPSIFSGGER--------------------FAIE 122

Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTK-FSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
             + R +  + + L  K W T I LPF  +     +A + I+  F  L  S +LFL  L  
Sbjct: 123  HYVRPVPLNSIAL-PKPWTTLISLPFSPEALHPNLAYHQILHRFRSLKASTILFLRHLTQ 181

Query: 1381 IMF------RNMLNDSLVVI---RKKI---VGDGIIKVSCGEDKMTWFV---------AS 1419
            + +      + ML+     +   R+K+   V DG       E++ TW V          S
Sbjct: 182  VEWQVEGGEQGMLSYDFTKLGAGRRKMALTVRDG------KEEEETWLVFERPVSLPKGS 235

Query: 1420 QKLRAGVIRPDVKT------------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT 1467
            ++L  G      +T             EIA  L   +      G       V  F   + 
Sbjct: 236  RRLPNGEYEVFGRTLALPSNHKEALKVEIAFRLDQDKRKGDQIGTAQVSHLVVFFPTHKE 295

Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAAS 1527
              L F++QG +    +R+ +  N PWN++L+SE   L V A      L          + 
Sbjct: 296  TELGFLMQGLYTTTPARDNIRINDPWNEYLISETAKLLVDALEELKSLGLL-------SV 348

Query: 1528 VYMSFVPLVGEVHGF---FSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAH 1584
              ++ +PL     G    F  +   + + LR    L     +   A   K+ RG      
Sbjct: 349  QAINTLPLRPSKFGHDNPFRPIFDAVRTALRDRPLLPTHDGSFTAARDAKLARGAG--LI 406

Query: 1585 SLLPDILLQKHLG----LGFLSKDIVLSDS--LARAL-GIEEYG--PKILLQIISSL-CR 1634
             LLP  LL K  G    + +LS +I L  +  L R L G  E    P+ L  ++  +  R
Sbjct: 407  ELLPPTLLGKLFGSSKPIQWLSSEISLDRTPDLYRYLVGRFERAEDPERLGALVERIEIR 466

Query: 1635 TENGLRSMGLSWLAS----WLNELYTISFHSSGQSSLQSGVETDLIDNLQ---------- 1680
             E  L  +  +WL +    W+ + Y++ F+  G++ ++  + T  I  L           
Sbjct: 467  PETVLAQLTPAWLKTQPDAWMIQFYSL-FNGIGEADIRKLLRTKEIIRLSNGEHVAAYGA 525

Query: 1681 ------RIPFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKL 1725
                   +P   ++D +F +V + TI  H D   F    GL + PN+  ++
Sbjct: 526  GSTPRVHLPSSVMADRSFPTV-KATIAAHPDALEFLKSLGL-SVPNVATEV 574


>gi|134080334|emb|CAK46256.1| unnamed protein product [Aspergillus niger]
          Length = 1917

 Score =  105 bits (263), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 133/293 (45%), Gaps = 41/293 (13%)

Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
            AL    +++YS  +HFLLEL+QNADDN Y  N+ P L    +   +    NE GF+   +
Sbjct: 39   ALQFFGEDIYSTAAHFLLELIQNADDNEYAANITPALVLTYRNRVLRTDCNELGFTPGQV 98

Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPT 1313
             A+C++G STKK       G KG+GFKSVF V  +  I S  +  KFD S  Q+G ++P 
Sbjct: 99   EAICNLGYSTKKDKVEA-TGEKGVGFKSVFSVAASVWIASGPYTFKFDAS-NQMGMLIPI 156

Query: 1314 LVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLL 1373
               P +   +                NT +   F  +  E I  +++      +   +L 
Sbjct: 157  WESPPDATRYA---------------NTSMYARFAPRADERIVFHDL----DVVGAKILT 197

Query: 1374 FLHRLQCIMFRNMLNDS----LVVIRKKIVG-DGIIKVS---CGE-----DKMTWFVASQ 1420
            FL +L+ +     L         V R+     DG+        GE      + T+ V   
Sbjct: 198  FLRKLRELNLHIELQSGDRICYNVARQDTTSPDGLCMTQLTFTGEMFPVAQRKTYVVCRY 257

Query: 1421 KLR---AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGL 1470
            +++       RP  +TTE+ +A  + +  +    P+L  Q V+A LP+R+YG 
Sbjct: 258  EVKEMPQERKRPGRQTTELRIAFPVDDFGQ----PVLESQDVYAVLPVRSYGF 306


>gi|323455043|gb|EGB10912.1| hypothetical protein AURANDRAFT_62372 [Aureococcus anophagefferens]
          Length = 957

 Score =  103 bits (257), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 121/471 (25%), Positives = 188/471 (39%), Gaps = 48/471 (10%)

Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
             T I LP   +F        + +  +++ P LLLFL+R++ I   +   D     R    
Sbjct: 226  GTTIVLPLTDEFRGDAKRAELRERLAEVDPLLLLFLNRVRRIDVVDRGADGETTARSMRR 285

Query: 1400 GDG--------IIKVSCGEDKM-----TWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
             D         ++ ++   D       TW V  ++       P    T   LA   + S 
Sbjct: 286  ADAPGGAANASVVTLADAVDGAEPAASTWLVVRREAAVVPWAPRGDATRTVLAAAFEASE 345

Query: 1447 EGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
                G    Q PVFAFLP+    LKF+L  D++L SSRE V  +SPWNQWL  E   L+V
Sbjct: 346  APRDGDA--QLPVFAFLPIAPANLKFVLHADWILASSRETVREDSPWNQWLRDEAAGLYV 403

Query: 1507 SAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNN 1566
             A  +               +  +  +P+  EV GFF  +   I  ++R   C +    +
Sbjct: 404  DALAAVRAG--AGAGAGFDVARVLEALPVEREVVGFFRPVAAEIARRVRGVECCL--AAD 459

Query: 1567 NQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSL----ARALGIEEYGP 1622
              WA P + L    +R     P +L +    LG    D    D L      ALG+E    
Sbjct: 460  GTWALPSRAL-AVPERLRREAPGLLARAPALLGRSCLDQAFQDRLRPETRAALGVEALDA 518

Query: 1623 KILLQIISSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRI 1682
               L++ ++L   ++    + L W A     LY  +  + G S+          D L+ +
Sbjct: 519  SHALRLAAALADADDLGAPVFLEWAAL----LYRAAAPAGGVSAADR-------DRLRSL 567

Query: 1683 PFIPLSDGTFSSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSS 1742
              +PL+ G   +     I++       +G   LE F  + A L  V PA+L+ S      
Sbjct: 568  AVVPLAGGGRGAPGAAPIYVDRG----EGAEPLEGFAAVRADLAVVDPAVLAPSLARPM- 622

Query: 1743 LGVISVDNLNRMLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADY 1793
              V+SV      L   GV+  +A  +   H+LPA++        K L A +
Sbjct: 623  --VVSV------LRSAGVKDATAEAVCLEHVLPALAAADAGEDRKRLNARW 665


>gi|302891159|ref|XP_003044462.1| hypothetical protein NECHADRAFT_83213 [Nectria haematococca mpVI
            77-13-4]
 gi|256725385|gb|EEU38749.1| hypothetical protein NECHADRAFT_83213 [Nectria haematococca mpVI
            77-13-4]
          Length = 2106

 Score =  103 bits (257), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 82/275 (29%), Positives = 123/275 (44%), Gaps = 34/275 (12%)

Query: 1182 NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVV 1241
            ++ K Q  RL   L C   ELY   +HF+ EL+QNADDN Y +   PT+ F+ +   + V
Sbjct: 5    SLAKAQIERLHEDLSC---ELYDTSTHFIQELLQNADDNKY-DCETPTMRFVYKPGSLRV 60

Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
              NE+GF A N+ A+C +  STK G   GY G KGIGFKSVFR  D   I S  +  KFD
Sbjct: 61   DCNERGFDASNVEAICTIRQSTKAGRR-GYTGEKGIGFKSVFRAADVVWISSGAYSFKFD 119

Query: 1302 TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
                + G V P             L ++ P  ++     T        +F E      +V
Sbjct: 120  KHR-RFGLVAP-------------LWAEFPEPIQEGY--TSFFFQLSPQFDE----KELV 159

Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV----VIRKKIVGDG--IIKVSCGEDKMTW 1415
             M  +   + +LFL RL+ +      +D  V    + R     DG  +  +  G  ++ +
Sbjct: 160  QMLLNFEATHILFLRRLRHVELEVTSSDGKVWTKSIHRTDTTEDGNAVSYLESGNIRLKY 219

Query: 1416 FVASQK---LRAGVIRPDVKTTEIALALTLQESNE 1447
             +   +   L     RP    +++ LA    + +E
Sbjct: 220  LMYKHQVTNLPEETKRPGCTDSDLVLAFPFLDRDE 254


>gi|159486654|ref|XP_001701353.1| hypothetical protein CHLREDRAFT_153826 [Chlamydomonas reinhardtii]
 gi|158271748|gb|EDO97561.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 2248

 Score =  102 bits (255), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 77/209 (36%), Positives = 101/209 (48%), Gaps = 50/209 (23%)

Query: 1406 VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL------QES-------------- 1445
            V C    +T  +  Q  RAGV+   V  T ++LA  L      +ES              
Sbjct: 3    VCCRWAVVTSALQPQTSRAGVV---VAQTRVSLAFCLGPPVAAEESAAVRSGGPPPPPLA 59

Query: 1446 ----NEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEF 1501
                 EG   P   QQPVFAFLPLR YGL+F++Q D+VLPSSRE +D +SPWNQ L  + 
Sbjct: 60   APHWREGGRPP---QQPVFAFLPLRGYGLRFVVQADWVLPSSREALDADSPWNQELRQQL 116

Query: 1502 PALFVSAERSFCDLPCFRENPAKAASV------------------YMSFVPLVGEVHGFF 1543
            P LF+ A   F  LP    + + A+S                   ++  VPL GE  GFF
Sbjct: 117  PGLFLRALDVFRSLPPPATSTSTASSTSTASSTPTAASCLFWTDQWLRCVPLEGEAQGFF 176

Query: 1544 SGLPRMILSKLRMSNCLILEGNNNQWAPP 1572
            +GLP  I + LR + C+ + G    WA P
Sbjct: 177  AGLPHSIAALLRAAPCIPVAGGG--WATP 203


>gi|121609568|ref|YP_997375.1| hypothetical protein Veis_2613 [Verminephrobacter eiseniae EF01-2]
 gi|121554208|gb|ABM58357.1| conserved hypothetical protein [Verminephrobacter eiseniae EF01-2]
          Length = 412

 Score =  102 bits (253), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 101/382 (26%), Positives = 154/382 (40%), Gaps = 48/382 (12%)

Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI--YPENVEP- 1228
            +  N   +     ++    +GR    L  + Y   +HF+ EL+QNA+D +    ++V P 
Sbjct: 1    MASNYEAISDENRRRYGTDIGRIGPMLLADRYDDRTHFIFELLQNAEDALGRRGDSVGPR 60

Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
             +TF L  + +   +  + F   ++R +C +  STK   S   IGR GIGFKSV+  TD 
Sbjct: 61   KVTFELTSTRLTRSHFGRPFDETDVRGVCGIAESTKDKFS---IGRFGIGFKSVYTFTDR 117

Query: 1289 PEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
            PEIHS           G   F +   V P   D           + + K   T I LP  
Sbjct: 118  PEIHS-----------GDEDFAIANYVQPQRAD-----------RTDRKAGETQIVLPL- 154

Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI-----------MFRNMLNDSLVVIRK- 1396
             K  +  A   I   F  L P  LLFL  ++ I              N+  +    +R  
Sbjct: 155  -KADDDTAQQEISAGFRHLGPGALLFLRHIEEINWSVDGGASGLYLHNVTKEFGENVRLI 213

Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG-PLLY 1455
             ++G    KV   E    W V  +++     +  V   EIA +L   +   G +    + 
Sbjct: 214  TVIGQETGKV---EVDQNWLVFHREVFTEA-KEKVGRVEIAFSLVAIKDAPGRWSVQPVA 269

Query: 1456 QQPVFAFLP-LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
              P+  F P +    L F++QG +    SR+ +    PWNQ L+ E  +L V A R   D
Sbjct: 270  ASPLVVFFPTVVPTNLGFLVQGPYRTTPSRDNIPRGEPWNQHLVKETSSLLVEAMRWMRD 329

Query: 1515 LPCFRENPAKAASVYMSFVPLV 1536
              C+   P  A        P+ 
Sbjct: 330  KGCWTRPPCAACRSTARSSPMA 351


>gi|342883116|gb|EGU83674.1| hypothetical protein FOXB_05809 [Fusarium oxysporum Fo5176]
          Length = 1626

 Score =  102 bits (253), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 95/325 (29%), Positives = 149/325 (45%), Gaps = 50/325 (15%)

Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY----PENVEPTLTFIL 1234
            ++ N  K ++A    ++  L++ +Y   + F+ EL+QNA+DN +     +N  P ++F +
Sbjct: 40   IDENWFKMENA-ASHSIKVLARNIYGSGARFVFELLQNAEDNSFRKADGKNDPPFISFQI 98

Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                IVV  NE GF++ +++A+C VG STK  +  GY+G KGIGFKSVF       I S 
Sbjct: 99   HPKHIVVECNEDGFTSLDLKAICSVGESTKT-AKHGYVGAKGIGFKSVFIAASRVHIQSG 157

Query: 1295 GFHVKF--DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
             F  +F  + ++  +G V P  V P   + F   L++  + L  +     I         
Sbjct: 158  NFSFEFRHNRNDPGLGMVRPIWVRP--TETFPSPLTRTTLYLHDQGDEDEIE-----HLK 210

Query: 1353 EGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF-----RNMLNDSLVVIRKKIVGDGI---I 1404
              IAM      F DL  + LLFL +L  I          L  S    +KK+    +    
Sbjct: 211  TVIAMQ-----FDDLQETCLLFLRKLSKISVAFYDEEGNLQRSKQFTKKKVDDYRVSLET 265

Query: 1405 KVSCGEDKMT---WFVASQKLRAGV-------------IRPDVKTTEIALALTLQESNEG 1448
             V  G++  T    +  +++L  G+              R  + + E+ LA  LQ     
Sbjct: 266  TVVSGDESATTSQMYHITKQLATGLAQSESRDAATTEEARKSLTSAEVVLAFPLQ----S 321

Query: 1449 NYGPLL--YQQPVFAFLPLRTYGLK 1471
            NY P +   +Q +FAFLPLRT   K
Sbjct: 322  NYQPQISTRRQELFAFLPLRTSDYK 346


>gi|66359780|ref|XP_627068.1| sacsin like HSP90 chaperone domain, likely plant origin
            [Cryptosporidium parvum Iowa II]
 gi|46228502|gb|EAK89372.1| sacsin like HSP90 chaperone domain, likely plant origin
            [Cryptosporidium parvum Iowa II]
          Length = 2120

 Score =  102 bits (253), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 55/139 (39%), Positives = 85/139 (61%), Gaps = 8/139 (5%)

Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIV 1240
            K Q  R  R++  LS ELYS     LLEL+QNADD+ +  N      P +  +L+   +V
Sbjct: 22   KSQRDRQDRSIGILSNELYSSFGRVLLELLQNADDSNFNFNEIGEGIPQVYILLRWDCLV 81

Query: 1241 VLNNEQGFSAENIRALCDVGNSTKKGSSAGYI-GRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
               NE+G + ENI ++CD+GNS+KK S + Y  G KGIG++S+F++TD P I S G+ +K
Sbjct: 82   FHINEKGMTRENIESVCDIGNSSKKNSESRYFTGEKGIGYRSIFKITDNPCIFSGGYSIK 141

Query: 1300 FDTS---EGQIGFVLPTLV 1315
            F +    E ++ +++P  +
Sbjct: 142  FSSKPDEEIELSYIVPKWI 160


>gi|154291063|ref|XP_001546118.1| hypothetical protein BC1G_15419 [Botryotinia fuckeliana B05.10]
          Length = 1440

 Score =  101 bits (252), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 157/343 (45%), Gaps = 44/343 (12%)

Query: 1153 KECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQE--------LYS 1204
            +E +DA   ++ +R    G    I+  E   L +   ++  AL  L  +        LY 
Sbjct: 7    EELQDARAFIDDLRSANGG----ITKEEEEELDRLLPKVKEALENLQNKVGISTKMFLYQ 62

Query: 1205 QDSHFLLELVQNADDNIYPENVE-PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            + + F+ E++QNA+DN Y  + + P L+F L    +++ +NE GF+ ENI+A+C +G ST
Sbjct: 63   KSTRFIYEIIQNAEDNEYKGSKKLPFLSFTLYSDRLIIDSNEDGFTKENIKAICSIGEST 122

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ-----IGFVLPTLVPPF 1318
             K S  GYIG KGIGFKSVF V     I S  +   F+ S+       +G V P    P 
Sbjct: 123  -KASIQGYIGEKGIGFKSVFTVAHKVHIQSGCYSFAFEYSKNDDGKSGLGMVTPMNETPH 181

Query: 1319 NI--DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLL--- 1373
             +  D+  R++      L       C  L ++   +    +   +     L  +  L   
Sbjct: 182  ELPGDVQTRMI------LYLHAGEDCGAL-YQELLNLPDTLLLFLKKLRQLTVNFELPDR 234

Query: 1374 FLHRLQCIMFR---NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQ--KLRAGVIR 1428
             +H++Q  + R   + L++++V  +  I G     V+    K  W        L A   R
Sbjct: 235  NVHQVQFSLTRPNNDRLSENIVQTKTAITG----AVNSVTHKSFWVKKRPVFNLPADSAR 290

Query: 1429 PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
              ++  E+ LA  + E    ++ P++ +Q  FAFLP++  G K
Sbjct: 291  GKIREAEVILAFPIDE----DHVPVIEEQYAFAFLPMKKTGYK 329


>gi|209876910|ref|XP_002139897.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555503|gb|EEA05548.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 2423

 Score = 99.8 bits (247), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 52/131 (39%), Positives = 83/131 (63%), Gaps = 8/131 (6%)

Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVE----PTLTFILQESGIVVLNNE 1245
            RL R++  L+ ELYS     LLEL+QNADD+ + E  +    PT+  +L+   ++V  NE
Sbjct: 25   RLERSISILTTELYSSLGRALLELLQNADDSNFSECYDKDSIPTVYILLRWDCLIVHINE 84

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAG-YIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             G + ENI ++CD+GNS+KK SS   Y G KGIG++SVF++++ P I+S  + + F    
Sbjct: 85   LGMTPENIESICDIGNSSKKNSSVRFYTGEKGIGYRSVFQISNNPSIYSGYYSISFSNEP 144

Query: 1305 GQ---IGFVLP 1312
             +   +G+++P
Sbjct: 145  DKDVGLGYIVP 155


>gi|154500304|ref|ZP_02038342.1| hypothetical protein BACCAP_03971 [Bacteroides capillosus ATCC 29799]
 gi|150271036|gb|EDM98310.1| hypothetical protein BACCAP_03971 [Pseudoflavonifractor capillosus
            ATCC 29799]
          Length = 1069

 Score = 99.0 bits (245), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 91/342 (26%), Positives = 153/342 (44%), Gaps = 45/342 (13%)

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQGFSAE 1251
            R+   L   LYS+++HF+ E +QNA+D       E + LTFI Q+  ++VL++ + F+  
Sbjct: 26   RSFEKLLTGLYSENNHFIYEFLQNAED------AEASFLTFICQKKQLLVLHDGRPFNEA 79

Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL 1311
            ++  +C V   TK+   A  IG  G+GFKSVF+ T+ PEI+SN      +        +L
Sbjct: 80   DVWGICSVMEGTKRREDAQKIGHFGVGFKSVFKYTERPEIYSNEETFAIEN------ILL 133

Query: 1312 PTLVPPFNIDMFCRL----LSKDPVQLESKCWNTCIRLPFRTK---FSEGIAMNNIVDMF 1364
            P  +P  +    C       +  P + +  C  T   LPFR +      GI  ++I    
Sbjct: 134  PVAIPRGDFPTDCTYQLDGQTVHPFKEKENC--TKFVLPFRDREELSRSGIDPDDIPRKL 191

Query: 1365 SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR----KKIVGDGIIK---VSCGEDKM---T 1414
             +L P +LLFL  ++ + + +    +    +    K        K   VS GE+++    
Sbjct: 192  QELEPEILLFLRHVRTLTWVDETTGAYGTYQNIPSKTDTNTHTCKKRCVSAGEEELEAQK 251

Query: 1415 WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFI 1473
            + V   +   G ++        A+   ++   +           V+ F P     GL+F+
Sbjct: 252  YLVFEDRFDCGEMKNACVKAAFAMDRAIKPVKDTR---------VWVFFPTTDESGLRFL 302

Query: 1474 LQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDL 1515
            L G +  P SRE++  +S +N+ L      L V   RS  DL
Sbjct: 303  LHGTYQTPISREKIISDSSFNKMLHERAEELVV---RSMEDL 341


>gi|325859904|ref|ZP_08173033.1| conserved domain protein [Prevotella denticola CRIS 18C-A]
 gi|325482586|gb|EGC85590.1| conserved domain protein [Prevotella denticola CRIS 18C-A]
          Length = 648

 Score = 99.0 bits (245), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 163/371 (43%), Gaps = 78/371 (21%)

Query: 1194 ALHCLSQELYSQ-DSHFLLELVQNADDNIYPENVEPTLTFILQ--ESGIVVLNNEQGFSA 1250
            ALH +  +LYS  D  F+ EL+QNADD   PE+ +P ++ ILQ  +  ++ ++N + F  
Sbjct: 160  ALHNIMDDLYSNADDRFVYELLQNADDQ--PEDGQP-VSVILQLLKEHLLFMHNGRVFDD 216

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
             ++ ++C +G+STK+      IG KGIGFKSVF  +D   I+S  +   FD      G  
Sbjct: 217  NDVDSICSIGDSTKRKDKEK-IGYKGIGFKSVFTGSDTVIINSGNYSFAFDKYSPVYG-- 273

Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSE-------GIAMN----- 1358
                    ++DM     +  P QL+   W    R P   + +E       GI++      
Sbjct: 274  --------DLDM-----NNIPWQLKP-IWQEKYRYPKEVRENEIFWKKRVGISLEIEEKD 319

Query: 1359 ------NIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDK 1412
                  +I  +FS  HP  LLFL  +  + F     +  V I K  VGD +     G   
Sbjct: 320  LAEYRMSIAKIFS--HPIFLLFLKNVTNLEFDE--GELHVRISKSNVGDILRIEKDGVVD 375

Query: 1413 MTWFVAS--------------------QKLRAGVIRPDVKTTEIALALTLQESNEGNYGP 1452
             +W V                      +KL+ G +      T+I+ A  + +        
Sbjct: 376  SSWIVKDYPITIPQEVRDALQDDRNVPEKLKKGTM------TQISFAAKVDDGKVVK--- 426

Query: 1453 LLYQQPVFAFLP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAER 1510
             +    ++A+LP  +  +G  FI+  DF+L ++RE++     WNQ+L  E   L V    
Sbjct: 427  -MDNSVLYAYLPTSVNDFGFNFIVNADFLLAANREQLHVKKRWNQFLFGEIGKLLVDWVA 485

Query: 1511 SFCD-LPCFRE 1520
            S    +P + E
Sbjct: 486  SLAKVIPSYLE 496


>gi|390441275|ref|ZP_10229407.1| hypothetical protein MICAI_370004 [Microcystis sp. T1-4]
 gi|389835403|emb|CCI33533.1| hypothetical protein MICAI_370004 [Microcystis sp. T1-4]
          Length = 1545

 Score = 98.2 bits (243), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 123/505 (24%), Positives = 220/505 (43%), Gaps = 68/505 (13%)

Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI 1239
            E+   ++    L  AL  L+ +LY++   F+ EL+QNADD I  E  +  + F++     
Sbjct: 14   ETGQSERDAENLANALTRLTGDLYTETERFIFELLQNADD-IPNETGQVNVLFVILREHF 72

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
            ++L+N + F + N+ A+  +  STK  +S    G KGIGFKSVF  ++   I+S  F  K
Sbjct: 73   LILHNGKPFDSTNVDAISSISKSTKVNNSEQ-TGYKGIGFKSVFADSECVYINSGNFSFK 131

Query: 1300 FDTSEGQIGFV--LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
            FD ++ +   +   P  + P  ID       K    +E + +      P  T    G A 
Sbjct: 132  FDRNDKRHKNLEKTPWQIKPIWID-------KKEYPVEIQQYQQFFTFPVATAIYVGQAK 184

Query: 1357 ----MNNIVDMFSDLHPSLLLFLHRLQCI--MFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
                   +  +F D  P L+LFL  +Q I  +  N +N   V I K+  G G  K+ C  
Sbjct: 185  IKEYKTKLEKLFQD--PRLILFLRHVQNINVIGLNNINIVDVSITKEKQG-GKYKI-CRN 240

Query: 1411 DKMT-WFV----------ASQKLRAGVIRPD----VKTTEIALALTLQES-----NEGNY 1450
            +++T W V             K+ +    PD    ++ ++I  A  L ++     NE + 
Sbjct: 241  NQITYWLVQDFDFNVPKQVKDKMESDKKIPDKLKLIERSKITFAAQLIDNKLVAVNEED- 299

Query: 1451 GPLLYQQPVFAFLP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
                    +F +LP  ++ Y L F++  DF+  ++R+E+  ++PWN +L +      ++ 
Sbjct: 300  ------SVLFTYLPTNVKEYKLPFLVNADFLTTANRQEIHHDNPWNIFLFANIAYYSLTW 353

Query: 1509 ERSFCDLPCFREN-----PAK------AASVYMSFVPLVGEVHGFFSGLPRMILSKLRMS 1557
                     +R       P+K       +  ++SF     +     + +P    +KL   
Sbjct: 354  IAEVAKKTEYRNQVTNILPSKFSLNLAKSETFISFNQEFDQALELIAFIPSEQDNKLLTV 413

Query: 1558 NCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK--DIVLSDSLARAL 1615
            N  IL+        P + +R   D     + + LL K     F  K  D+    S  +++
Sbjct: 414  NNSILDETEITKIIPVETVRTSFDSKRYFISNSLLNKTKLKNFGVKIFDVKSLGSFLKSI 473

Query: 1616 ---GIEEYGPKILLQIISSLCRTEN 1637
                I +  P+++ Q+++ L +T+N
Sbjct: 474  HYQNIVKSNPQLIFQVLAHL-KTKN 497


>gi|440754017|ref|ZP_20933219.1| putative transcriptional regulator domain protein [Microcystis
            aeruginosa TAIHU98]
 gi|440174223|gb|ELP53592.1| putative transcriptional regulator domain protein [Microcystis
            aeruginosa TAIHU98]
          Length = 1545

 Score = 98.2 bits (243), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 127/506 (25%), Positives = 227/506 (44%), Gaps = 70/506 (13%)

Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI 1239
            E+   ++    L  AL  L+ +LY++   F+ EL+QNADD I  E  +  + F++     
Sbjct: 14   ETGQSERDAENLANALTRLTGDLYTETERFIFELLQNADD-IPNETGQVNVLFVILREHF 72

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
            ++L+N + F + N+ A+  +  STK  +S    G KGIGFKSVF  ++   I+S  F  K
Sbjct: 73   LILHNGKPFDSTNVDAISSISKSTKVNNSEQ-TGYKGIGFKSVFADSECVYINSGNFSFK 131

Query: 1300 FDTSEGQIGFV--LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA- 1356
            FD ++ +   +   P  + P  ID       + PV+++   +      P  T    G A 
Sbjct: 132  FDRNDKRHKNLEKTPWQIKPIWIDK-----KEYPVEIQQ--YQQFFTFPVATAIYVGQAK 184

Query: 1357 ----MNNIVDMFSDLHPSLLLFLHRLQCI--MFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
                   +  +F D  P L+LFL  +Q I  +  N +N   V I K+  G G  K+ C  
Sbjct: 185  IKEYKTKLEKLFQD--PRLILFLRHVQNINVIGLNNINIVDVSITKEKQG-GKYKI-CRN 240

Query: 1411 DKMT-WFV----------ASQKLRAGVIRPD----VKTTEIALALTLQES-----NEGNY 1450
            +++T W V             K+ +    PD    ++ ++I  A  L ++     NE + 
Sbjct: 241  NQITYWLVQDFDFNVPKQVKDKMESDKKIPDKLKLIERSKITFAAQLIDNKLVAVNEED- 299

Query: 1451 GPLLYQQPVFAFLP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALF--- 1505
                    +F +LP  ++ Y L F++  DF+  ++R+E+  ++PWN +L +         
Sbjct: 300  ------SVLFTYLPTNVKEYKLPFLVNADFLTTANRQEIHHDNPWNIFLFANIAYYSLTW 353

Query: 1506 ---VSAERSFCD-----LPC-FRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRM 1556
               V+ +  + +     LP  F  N AK+ + ++SF     +     + +P    +KL  
Sbjct: 354  IAEVAKKTEYRNQVTNILPSKFSLNLAKSET-FISFNQEFDQALELIAFIPSEQDNKLLT 412

Query: 1557 SNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSK--DIVLSDSLARA 1614
             N  IL+        P + +R   D     + + LL K     F  K  D+    S  ++
Sbjct: 413  VNNSILDETEITKIIPVETVRTSFDSKRYFISNSLLNKTKLKNFGVKIFDVKSLGSFLKS 472

Query: 1615 L---GIEEYGPKILLQIISSLCRTEN 1637
            +    I +  P+++ Q+++ L +T+N
Sbjct: 473  IHYQNIVKSNPQLIFQVLAHL-KTKN 497


>gi|171679363|ref|XP_001904628.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939307|emb|CAP64535.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1843

 Score = 95.9 bits (237), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 155/356 (43%), Gaps = 66/356 (18%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTL--TFILQESGIVVLNNEQGFSAENIRA 1255
            LS+ L +  S F+ EL+QNADD  Y     P L  T+  ++  ++V +NE GF  +N+RA
Sbjct: 1    LSENLTTTTSRFIYELLQNADDLDY-GGASPQLHFTYDAEKRHLLVESNEIGFKKKNVRA 59

Query: 1256 LCDVGNSTKKGSSAGY------IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
            +C    S+ K +  GY      IG KG+GFKSVF+V  +  I S  +  +FD  +G++G 
Sbjct: 60   ICSAAESS-KNTQRGYGIERRLIGEKGLGFKSVFKVASSVWIRSGHYSFRFD-RDGELGR 117

Query: 1310 VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT-----KFSEGIAMNNIVDMF 1364
            + P     F  + +   L+            T I L   +     +      M+ ++   
Sbjct: 118  IRPIWDDDFPAEEYVAGLT------------TAILLKIASPTHDRQTGTSDPMDLVLQGL 165

Query: 1365 SDLHPSLLLFLHRLQCI---------MFRNMLNDSLVVIRKKIV---------GDGIIKV 1406
             +L P  LLFL + + +           R+ +  S    +   V          + ++ V
Sbjct: 166  ENLKPGHLLFLSKAKEVKVGKVVGSGALRSFIRGSTATAKPTAVFKADHIPPHRNTLLPV 225

Query: 1407 SC-GEDKMTWFVASQKL---------RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQ 1456
            S    D +   VA + L         R G +  +V T    L + L E  E         
Sbjct: 226  SLYHNDVINQQVAFRHLVTDLPDTPERPGCLESEV-TLVFPLKIPLDEPPES-------- 276

Query: 1457 QPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSF 1512
            Q V+AFLP+R YG K  + GDF+L  SRE +  N  WN+ L+       V+A   F
Sbjct: 277  QMVYAFLPIRDYGFKLSIHGDFLLVPSRESIIDNV-WNRALVEGVCRALVNAIPGF 331


>gi|288553765|ref|YP_003425700.1| putative transcriptional regulator [Bacillus pseudofirmus OF4]
 gi|288544925|gb|ADC48808.1| putative transcriptional regulator [Bacillus pseudofirmus OF4]
          Length = 1676

 Score = 94.7 bits (234), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 92/370 (24%), Positives = 160/370 (43%), Gaps = 62/370 (16%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESGIVVLNNE 1245
            Q      +L+ LS +LY+  + F+ EL+QNADD++   N VE  +  I QE  ++  ++ 
Sbjct: 22   QAVNQASSLNALSGDLYTDANRFIYELLQNADDSLEKGNPVEVWVKIIGQE--LIFGHSG 79

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---- 1301
            + F   ++  +C+V N TKK   +   G KGIGFKSVF  +D  +I+SNG + +FD    
Sbjct: 80   KAFDKRDLMGICNVNNGTKKNDISK-TGYKGIGFKSVFGQSDKVKIYSNGEYFRFDRDYD 138

Query: 1302 ------------TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
                         +E    F  P  + P           +D V L        IR   ++
Sbjct: 139  FRWQWEYSREEWETESGRKFQYPWQIIPI-------YTKEDEVSL-------SIREFIKS 184

Query: 1350 K---FSEGIAMNNIVDM-----FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGD 1401
            K    +  I ++N+ D+     +   + ++ LFL  +  I F    + S+++ R     +
Sbjct: 185  KDINVATIITLHNLADVEEAINYLTKNTNMFLFLKNISKINFELPQSSSILIDRS---SE 241

Query: 1402 GIIKVSCGEDKMTWFVASQKLRA-----GVIRPD-------VKTTEIALALTLQESNEGN 1449
             IIK+        W + S  L+       ++  D          TEI + L  +  +   
Sbjct: 242  NIIKLENKHSNSEWIIKSTTLKVPESMRAILNEDRNIPEKLAAATEIEITLAAKIIDNEI 301

Query: 1450 YGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
                  +  ++++LP     Y L  ++   F+  ++RE +  +S WNQW+ +  P   V 
Sbjct: 302  VKLSKEENLLYSYLPTGESKYELPVLVNTSFLTAANRESLHSDSKWNQWIFNSLPVEIV- 360

Query: 1508 AERSFCDLPC 1517
              R   +L C
Sbjct: 361  --RWIAELVC 368


>gi|300727815|ref|ZP_07061196.1| hypothetical protein PBR_0692 [Prevotella bryantii B14]
 gi|299774914|gb|EFI71525.1| hypothetical protein PBR_0692 [Prevotella bryantii B14]
          Length = 1264

 Score = 94.4 bits (233), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 161/378 (42%), Gaps = 66/378 (17%)

Query: 1177 SNMESNMLKK--------QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP 1228
            SN  SN+ KK        Q      +L  LS  +Y+++  F+ EL+QNA D    E+   
Sbjct: 3    SNFISNLQKKRCNYNHPDQATSQANSLILLSSGIYTEEERFVFELLQNAVDAHNEESEIL 62

Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
             +  I+++   + L+N + F+  +I  LCDVGN  K       IG KGIGFKSVF  +  
Sbjct: 63   DVKMIIKDGYFIFLHNGEAFTERDIEGLCDVGNGNKM-KDVKKIGYKGIGFKSVFMCSTN 121

Query: 1289 PEIHSNGFHVKFDTS-----------EGQIG-------FVLPTLVPPFNIDMFCRLLSKD 1330
              + S G+  KFD S           +   G       +++P  + P        + +  
Sbjct: 122  VTVQSGGYCFKFDKSHWDNYWNEHWNDNDFGTRDRDKKYLMPWQIIP--------IETTP 173

Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS 1390
            P+ L +  +N    +      S       I+ + S      LLFL      M       +
Sbjct: 174  PILLSNYEYNVATYIKINGTDS---LEQKILKLLSS--SQFLLFLMSDNIRMTFESNGKT 228

Query: 1391 LVVIRKKIVGDGIIKVSCGEDKMTWFVASQK-----------LRAGVIRPD----VKTTE 1435
               I K+   + ++  + G+++  W + + K           + A +  PD    V T +
Sbjct: 229  KCWIEKQKQNNQVVLSTNGKEESRWLIHTNKNVQVPPELKETINADINTPDKLKNVDTFD 288

Query: 1436 IALALTLQESNEGNYGPL--LYQQPVFAFLP----LRTYGLKFILQGDFVLPSSREEVDG 1489
            ++ A+ L E      G L  L Q  V+ +LP      + G  F++  +F+  + R+++  
Sbjct: 289  LSFAVALDEK-----GKLKRLEQAVVYTYLPTSFCFGSEGFPFLVNANFITDAGRQQLHK 343

Query: 1490 NSPWNQWLLSEFPALFVS 1507
            +S WN+ + S+ P+ +++
Sbjct: 344  DSEWNKLIFSKIPSEYLT 361


>gi|347535876|ref|YP_004843301.1| hypothetical protein FBFL15_0964 [Flavobacterium branchiophilum
            FL-15]
 gi|345529034|emb|CCB69064.1| Hypothetical protein FBFL15_0964 [Flavobacterium branchiophilum
            FL-15]
          Length = 1505

 Score = 93.6 bits (231), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/366 (25%), Positives = 162/366 (44%), Gaps = 81/366 (22%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
            Q   L  ++  L+ ++YS+D  F+ EL+QNADD    E+ E  L+  + +  +V+ +N  
Sbjct: 141  QVVNLANSVESLATDIYSEDKRFIYELIQNADDAALDEDSE--LSIDILDKYVVISHNGA 198

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE----IHSNGFHVKFDT 1302
             F++ +IR LC +G  T K + A   G KGIGFKSVF     PE    + +     +FD 
Sbjct: 199  PFNSRDIRGLCSIGLGT-KTNDATKTGYKGIGFKSVF---GQPEGIVYVKTENTLFRFDK 254

Query: 1303 SEGQI-GF-----------------------VLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
               ++ G+                       ++P L    N +    +L+     ++   
Sbjct: 255  EYARLKGWNSKWGNKRDWEEKNGVTFQCPWQMMPILSETANDNETDLILNNKIFSVK--- 311

Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
              T I++    K  E     NI   F D     LLFL R+  +  +   N   + +RKK 
Sbjct: 312  --TAIKIDDSKKLYE-----NINSFFGD--AKFLLFLRRISRVELK--FNSHKLELRKKK 360

Query: 1399 ---VGDGIIKVSCGEDKMTWFVAS----------QKLRAGVIRP----DVKTTEIALALT 1441
               +G+ +   S  E+   W+V +          ++L++    P     ++ T I+ A+ 
Sbjct: 361  NSEIGNIVTLHSNKENISNWYVKNWIHNIPLEIQKELKSDPKTPKKIQSMEKTVISFAIN 420

Query: 1442 LQES-------NEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSP 1492
            + E         EGN        P++++LP   + Y + FI+  +F+L +SRE++  N  
Sbjct: 421  INEKFDEIKLLEEGN-------SPLYSYLPTTEKEYNIPFIVNCNFLLDASREKIHKNRK 473

Query: 1493 WNQWLL 1498
            WN+WL 
Sbjct: 474  WNEWLF 479


>gi|323452221|gb|EGB08096.1| hypothetical protein AURANDRAFT_64375 [Aureococcus anophagefferens]
          Length = 1507

 Score = 92.8 bits (229), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 3/124 (2%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ-GFS 1249
            L  A   LS +LY    HFL EL QNADD  +     P L   L  S     +N++ GF 
Sbjct: 350  LAAAAERLSVDLYEHRDHFLHELFQNADDARHDGVDAPALEVGLSGSWFFASSNQRPGFD 409

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-EGQIG 1308
            A ++RA+CDV  STK G +A   G KGIGFK+VF + D P + SNGF   FD + +G +G
Sbjct: 410  AADVRAICDVNASTKAGDAA-TTGHKGIGFKAVFAMCDRPCVLSNGFAFYFDAARDGALG 468

Query: 1309 FVLP 1312
             V P
Sbjct: 469  LVAP 472


>gi|332671971|ref|YP_004454979.1| ATP-binding domain-containing protein [Cellulomonas fimi ATCC 484]
 gi|332341009|gb|AEE47592.1| ATP-binding region ATPase domain protein [Cellulomonas fimi ATCC 484]
          Length = 1453

 Score = 92.8 bits (229), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 150/331 (45%), Gaps = 40/331 (12%)

Query: 1196 HCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRA 1255
            H L +  Y+    +  EL+QNA D     N    + F L +  ++V +N  GFS + IR+
Sbjct: 45   HLLVRNQYT--GRYPFELLQNAADAAADTNHAGHVRFHLSDQALLVADNGIGFSPDAIRS 102

Query: 1256 LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE--GQIGFV--- 1310
            +C +G STK  S    IG KG+GF+SV  +TD P++ +      FD +    Q+  +   
Sbjct: 103  ICRLGRSTK--SPTKSIGYKGLGFRSVGEITDRPQVITGAVRFGFDENRVFEQVAALTGG 160

Query: 1311 ------LPTLVPPFNI---DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI- 1360
                  LPT   PF +   D+   +   D  +L +  + T +RLP RT    G+  +++ 
Sbjct: 161  LPADQKLPTYAFPFTLSEADLGEDIALVD--ELLTGGYTTVMRLPLRT----GVDRSHVE 214

Query: 1361 VDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV-VIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
              +  ++   L+LFL  L  +       D +  V R+   G   + +  G+D   W V +
Sbjct: 215  AQLLRNVDAQLMLFLGSLDRLEVSGTARDFVAEVARQDTDGAQRVLMQVGDDLTQWLVYA 274

Query: 1420 QKLRAG--VIRP-----DVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLK 1471
            ++L     ++ P     +V+T   A+AL L E    +        P+  + P     G  
Sbjct: 275  RQLPVARELVAPMDGWQEVETVTAAVALPLAEDGRPSTS---THYPMHVYFPTEEKTGFT 331

Query: 1472 FILQGDFVLPSSREEVDGN---SPWNQWLLS 1499
            FI   D+ L   R ++ G    +P+N+++ +
Sbjct: 332  FIAHADWALHLDRRQLSGTPDATPYNEFMTA 362


>gi|407917132|gb|EKG10453.1| hypothetical protein MPH_12311 [Macrophomina phaseolina MS6]
          Length = 207

 Score = 91.7 bits (226), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 52/100 (52%), Positives = 68/100 (68%), Gaps = 3/100 (3%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-NEQGFSAENIRAL 1256
            +S++L S   HFLLEL+QNADDN Y ++++PTL     E     L+ NE+GFS E++ A+
Sbjct: 50   ISEDLNSTSFHFLLELLQNADDNKY-DDIKPTLALAYTEDKHFRLHCNEKGFSPEDVEAI 108

Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
            CD+GNSTK  SSA   G KG+GFKSVFRV  A  I S  +
Sbjct: 109  CDIGNSTKL-SSASTTGEKGVGFKSVFRVARAVWIVSGEY 147


>gi|159479534|ref|XP_001697845.1| hypothetical protein CHLREDRAFT_176946 [Chlamydomonas reinhardtii]
 gi|158273943|gb|EDO99728.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1218

 Score = 90.9 bits (224), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 38/155 (24%)

Query: 1158 AALIVESIRRDEFGLGPNISNMES-NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
             A +VE IR   +G+    S+ ++   ++   AR+G A+  LS +LYS+D HF++EL+QN
Sbjct: 254  GAEVVERIRHT-YGVDKCYSDPDAAETIRCNTARVGNAVEQLSAQLYSKDLHFVMELLQN 312

Query: 1217 ADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKG 1276
            A+DN YP  V P L F+L    +VVLNNE+G                             
Sbjct: 313  AEDNSYPAGVVPCLEFVLTPDSLVVLNNEEG----------------------------- 343

Query: 1277 IGFKSVFRVTDAPEIHSNGFHVKFD-TSEGQIGFV 1310
                  F+V+DAPE+HS GFHV F  T+ G +   
Sbjct: 344  ------FKVSDAPEVHSGGFHVSFSRTAHGNLALA 372



 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/182 (29%), Positives = 79/182 (43%), Gaps = 21/182 (11%)

Query: 1462 FLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFREN 1521
            FL    YGL+F++  DF++ +SRE VD  S WN  L +  P LF S         C    
Sbjct: 381  FLKKLRYGLRFMVNADFIVTTSREAVDNGSSWNHMLAARIPQLFAS---------CLPLP 431

Query: 1522 PAKAASVYMSFVPLVGEVHGF----FSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR 1577
                  V +    L G V GF    F+ LP+ I +  R S   +        +P    LR
Sbjct: 432  DEPHVGVRVGVALLCGFVCGFVGDLFAPLPQRIAAVCR-SAVRLPTTCGAALSPATHQLR 490

Query: 1578 GWND-RAHSLLPDILLQKHL---GLGFLSKDIV---LSDSLARALGIEEYGPKILLQIIS 1630
              +D R   LL    +Q  L   GL F+S+ +     S  L   LG++++  + L+ ++ 
Sbjct: 491  VCHDPRVRELLAGPGMQHALRAAGLYFVSEAVAPLHASSRLRALLGVQDFHAQDLVALLE 550

Query: 1631 SL 1632
             L
Sbjct: 551  ML 552


>gi|171686084|ref|XP_001907983.1| hypothetical protein [Podospora anserina S mat+]
 gi|170943003|emb|CAP68656.1| unnamed protein product [Podospora anserina S mat+]
          Length = 420

 Score = 89.7 bits (221), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 74/111 (66%), Gaps = 2/111 (1%)

Query: 1202 LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
            +Y + SHFL EL+QN DDN Y  + +PTLT   +   +   +NE GF+ E++ A+C +GN
Sbjct: 1    MYRRPSHFLWELIQNVDDNNYVAH-DPTLTITYKIRVLRFDSNETGFTMEDVEAICSMGN 59

Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
            S+K  SS+  IGRKGIGFKSVF+V ++  I S  +  +FD+ E  +G ++P
Sbjct: 60   SSKTRSSSTRIGRKGIGFKSVFKVAESVYIASGCYSFRFDSRES-LGMMIP 109



 Score = 49.3 bits (116), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 54/121 (44%), Gaps = 15/121 (12%)

Query: 1459 VFAFLPLRTYGLK----FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
             FAFLP+R YG +    F+   DF L ++RE+VD +S  NQ  L E P  F+ A  +   
Sbjct: 244  TFAFLPIRDYGFRVRQAFVCHADFDLVANREDVDMSSARNQVFLHELPKAFLRAVHAL-- 301

Query: 1515 LPCFRENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCK 1574
                  N       +  ++P       FF  LP  I++ L  S   +LE +      P K
Sbjct: 302  ------NSGHLRYTWPYYLPF-PTTDSFFEPLPGDIITLL--SRTPMLESDAGHLMAPEK 352

Query: 1575 V 1575
            +
Sbjct: 353  L 353


>gi|326672132|ref|XP_003199605.1| PREDICTED: hypothetical protein LOC100537266 [Danio rerio]
          Length = 1057

 Score = 88.6 bits (218), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/189 (31%), Positives = 99/189 (52%), Gaps = 3/189 (1%)

Query: 152 LIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQ-MQEVPSLHKLMVTEARINAFIHCFVGV 210
           L   GE V+  +V   V   LQV S   +     + +P+L  L  T   IN +I     V
Sbjct: 585 LATEGEHVTIERVIGRVCSLLQVCSLRDMNIDPRRHLPALQDLQRTLKEINLYIESVEAV 644

Query: 211 RQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDVI 268
             + +LY+L +A+   +  ++FEEL LGP  + PLI   F + ++   D V +I+T D++
Sbjct: 645 SSVCTLYELGQALANLKNKKRFEELHLGPLCKLPLIHRMFRIDANTKDDDVHQISTVDIL 704

Query: 269 VCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVT 328
             L  Y   +K   + + EFL  +A + S  S  +LG+RIQ+LG+ I+ + +A  S  + 
Sbjct: 705 RSLRMYRRVNKVPRVDLAEFLKHLADQYSCESPYELGIRIQSLGLPISTLNKASASDYLL 764

Query: 329 LKKFLKELQ 337
           ++K  + +Q
Sbjct: 765 MEKAKETIQ 773


>gi|386783697|gb|AFJ24743.1| hypothetical protein, partial [Schmidtea mediterranea]
          Length = 714

 Score = 87.4 bits (215), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 122/524 (23%), Positives = 226/524 (43%), Gaps = 79/524 (15%)

Query: 1414 TWFVASQKLRAGVIRPDV--------KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
            +WF+ +Q++   ++ P++          TE+++A+ L E         L   P FAFLP+
Sbjct: 41   SWFLRTQQI--PILEPEILRISPRNLSVTELSVAIPLAEQE-------LEDCPTFAFLPI 91

Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKA 1525
            R+YG +FI+  DF +P+SRE++    PWN+WL+ + P +F       C L     +  + 
Sbjct: 92   RSYGFRFIINADFSIPASREDISNECPWNRWLIEQVPLVFTE-----CLLEFTSRDHVQN 146

Query: 1526 ASVYMSFVPLVGEV---HGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDR 1582
                +  +PL GEV     F   +P++   + ++     L   N +  PP +++    D 
Sbjct: 147  TIRAIKCIPLPGEVPTDSSFHVLVPKI---RAQLVQLAWLPDRNGKLHPPSQLIVESVD- 202

Query: 1583 AHSLLPDILLQKHLGLGFLSKDIVLS--DSLARALGIEEYGPKILLQIISSLCRTENGLR 1640
              S L   L  + L L  L   I  S  + ++RALG    G + L   +  L R    L+
Sbjct: 203  -DSSLTSALCNR-LSLTLLHPVIANSSDERVSRALG--WLGCQTL--SLQHLGRLLTALQ 256

Query: 1641 SMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPL-----SDGTFSSV 1695
               ++     L +   I+ HS  +          LI  L  +P  P+     +   F S+
Sbjct: 257  PEDIA--DPVLMKSIIIAIHSGLERDPSIATCNRLIAQLLSMPIWPVLSRSGAPDRFVSI 314

Query: 1696 DEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSA--SAVDKSSLGVISVDNLNR 1753
            ++  + +    S +      + +   C +L  +SPA +++  ++ +K  + V S      
Sbjct: 315  NQSALPIFIPPSHWPNTHNNDLYS--CLRL-MISPAFIASFKTSANKKHINVAS-----S 366

Query: 1754 MLLKIGVQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVERE 1813
            +L ++GV +L+++ ++   I+P +S+E  +   +N +     F++ H     P+ + +R 
Sbjct: 367  VLQRLGVHELNSNTMIDHAIIPRLSNECQS---RNWLIAAAQFLVDHCH---PSQYRDR- 419

Query: 1814 FIVSELRKKAFVLTNHGFKRPSEIPIHFGKEF---------GNPVSVNMLIHDIDIKWYE 1864
              V  L   A V  NH   + +    H+   +             +V  LIH+  I +  
Sbjct: 420  --VIPLLSFATVSYNHSNGKMNCTITHYSDLYKVHLPNQLEKTMTTVLPLIHN-RIDYTV 476

Query: 1865 VDITYLKHPAKESLSCGLVKWRKFFKEIGITDFVQVVQVDKDVA 1908
            +D  YL  P +        +WR+FF   G ++    V  D   A
Sbjct: 477  LDDAYLDCPIQPE------QWREFFCAAGTSETFYRVAHDNGNA 514


>gi|110669161|ref|YP_658972.1| hypothetical protein HQ3280A [Haloquadratum walsbyi DSM 16790]
 gi|109626908|emb|CAJ53377.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
          Length = 916

 Score = 87.0 bits (214), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 89/342 (26%), Positives = 141/342 (41%), Gaps = 72/342 (21%)

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESG-IVVLNNEQGFSAENIRALCDVGNSTKK 1265
            +HF LEL QNADD    E     L  + +E G + V ++  GF  + ++A+C  G S K 
Sbjct: 49   THFPLELTQNADD----EGATNLLFEVDEEKGELYVFDDGAGFDPDGLKAVCQQGQSPKD 104

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
                  IG  GIGFKS+F ++D   IHSNG+H  F   +      L  + P +       
Sbjct: 105  PRQQ--IGFMGIGFKSMFEISDNVRIHSNGYHFGFSIDDEDEDDGLDAVTPHW------- 155

Query: 1326 LLSKDPVQ--LESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
            +  + P Q  ++ K + T I    + +  +G+          +L PS+ LFL  L  I  
Sbjct: 156  VPEEPPEQRTVKQKEYTTRIVASIKNESDDGVEQIKQALEEDNLSPSVFLFLDSLSEIEI 215

Query: 1384 RN--------------------------------------------MLNDSLVVIRKKIV 1399
            R+                                            + +D LV IR+   
Sbjct: 216  RSRQDTSFSRSLYGGHVDTFDEPEGVSEARELYKKYAEGDPDPENKLSDDDLVQIRRTGN 275

Query: 1400 GDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNY-----GPLL 1454
             DG  +     D   W +  + LR   +R +++T+++ +A  +  S +G       G  +
Sbjct: 276  SDGEEQWVVFRD--IWDI-DKSLRRPRLRKNLETSDLFIAFRI--SQDGRLIELVDGGSV 330

Query: 1455 YQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEVDGNSPWN 1494
               PV ++LPL      + F++  DF L  SRE +  NS WN
Sbjct: 331  RISPVHSYLPLSELDVDIDFLIHADFDLNPSREGIRSNSEWN 372


>gi|150389397|ref|YP_001319446.1| putative transcriptional regulator [Alkaliphilus metalliredigens
            QYMF]
 gi|149949259|gb|ABR47787.1| putative transcriptional regulator [Alkaliphilus metalliredigens
            QYMF]
          Length = 1675

 Score = 86.7 bits (213), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 88/365 (24%), Positives = 163/365 (44%), Gaps = 61/365 (16%)

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAE 1251
              +++ LS +LY+    F+ EL+QNADD+   ++       I  E  +VV ++ + F++ 
Sbjct: 27   ASSINALSSDLYTDSKRFIYELLQNADDSYLDDHDVKVWIKIFGEK-LVVAHSGKPFNSR 85

Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-------- 1303
            ++R +C++ N TKK S     G KGIGFKSVF  +D   I S   + +FD+S        
Sbjct: 86   DLRGICNIANGTKK-SDITKTGYKGIGFKSVFGQSDKVTIFSKKEYFRFDSSYPFKWIWD 144

Query: 1304 EGQIG--------FVLPTLVPPFNIDM------FCRLLSKDPVQLESKCWNTCIRLPFRT 1349
            + ++         F  P  + P + D+        + L  + V +      T I++    
Sbjct: 145  DSKVNWENANDREFQFPWQIIPIHTDVNEIDESINKFLEDNQVNVA-----TIIQMKNVN 199

Query: 1350 KFSEGI-AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC 1408
            +  + +  ++  ++MF        LFL ++  I F  M  DS+ +   +   D I   + 
Sbjct: 200  ETRKAVQELSQNLNMF--------LFLKKISEINFDVM--DSVTIKINRTEKDKITLKNG 249

Query: 1409 GEDKMTWFVAS----------QKLRAGVIRPD--VKTTEIALALTLQESNEGNYGPLLYQ 1456
             + K  W + +          ++L++    P+  + T  I L L  +  ++G       +
Sbjct: 250  RDLKFVWLINTISLTVPTALKKELQSERNIPEKLLGTDIIELTLAAKVGSDGITRLSEQE 309

Query: 1457 QPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS-------EFPALFVS 1507
              ++++LP   R Y L  ++   F+  S+RE +  +S WNQWL          + +  V+
Sbjct: 310  TMLYSYLPTDERNYSLPILINTSFLTTSNRESLHSDSKWNQWLFKTIAIEIFHWISTLVT 369

Query: 1508 AERSF 1512
             E SF
Sbjct: 370  TEYSF 374


>gi|358367094|dbj|GAA83713.1| hypothetical protein AKAW_01828 [Aspergillus kawachii IFO 4308]
          Length = 1359

 Score = 84.7 bits (208), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 90/172 (52%), Gaps = 17/172 (9%)

Query: 1156 EDAALIVESIRRDEFGLGPNI---------SNMESNMLKKQHARLGRALHCLSQELYSQD 1206
            E A  +V SI R    L   I           +E  MLKK    +G ++  L++++Y+  
Sbjct: 5    ESARELVRSIARGRGYLSEEILERMDQDTRREVEEAMLKKDEM-IGSSVLTLAKDVYNSS 63

Query: 1207 SHFLLELVQNADDNIYPE----NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            + F+ EL+QNADDN Y +    +  P ++F +    IV   NE GF+ E++ A+C++G S
Sbjct: 64   ARFVFELLQNADDNSYSKAKSRSEAPYVSFRVYPRQIVFECNEDGFTPEDLIAICNIGKS 123

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN--GFHVKFDTSEGQIGFVLP 1312
            +K  S  GYIG K IGFKSVF V     I S    F+++    +  +G + P
Sbjct: 124  SKT-SVQGYIGEKDIGFKSVFMVAWKVLIQSGELSFYLQHRIGDSGMGMISP 174


>gi|113971542|ref|YP_735335.1| putative transcriptional regulator [Shewanella sp. MR-4]
 gi|113886226|gb|ABI40278.1| putative transcriptional regulator [Shewanella sp. MR-4]
          Length = 1687

 Score = 84.3 bits (207), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 148/342 (43%), Gaps = 39/342 (11%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
            Q      +L  LS +LY+    F+ EL+QNADD+  P +       +   S +VV ++ +
Sbjct: 22   QATNQASSLVSLSTDLYTDSKRFIYELLQNADDSAEPGHSVKVWIKVFGNS-LVVAHSGR 80

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--- 1303
             FS  ++R +C+V   TKK  S+   G KGIGFKSVF  +D   I +   + +FD S   
Sbjct: 81   EFSPRDVRGICNVNYGTKKEDSSK-TGYKGIGFKSVFGQSDKVTIFTANEYFRFDASYPF 139

Query: 1304 -------------EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK 1350
                         +    FV P  + P   D+   +L   P++L  K  N  +       
Sbjct: 140  NWIWDGTKEEWEAKSDREFVNPWQIIPIYTDVREGILK--PIELYLKTINANVATIIELN 197

Query: 1351 FSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
              +G  +  + ++  D++  + LFL  +  I F ++ N   + I +      ++K    E
Sbjct: 198  -DKGDVLLALKELSEDVN--MYLFLKSISSINF-DIDNRKHIEIDRSESNRIVLKDGQSE 253

Query: 1411 DKMTWFVAS------QKLRAGVIR----PD--VKTTEIALALTLQESNEGNYGPLLYQQP 1458
             K  W  +S        L+  +      PD  +    I L L  +   +G       ++ 
Sbjct: 254  QK-KWLTSSITLNVPSDLKVALQDERNIPDKLLNAETIELTLAAKTGADGIIKLSQNERR 312

Query: 1459 VFAFLPLR--TYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            ++++LP     Y L  ++   F+  ++RE +  +S WNQWL 
Sbjct: 313  IYSYLPTEETKYSLPVLVNTSFLTNANRENLHADSKWNQWLF 354


>gi|159466910|ref|XP_001691641.1| hypothetical protein CHLREDRAFT_170531 [Chlamydomonas reinhardtii]
 gi|158278987|gb|EDP04749.1| hypothetical protein CHLREDRAFT_170531 [Chlamydomonas reinhardtii]
          Length = 1333

 Score = 84.0 bits (206), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 57/101 (56%), Gaps = 22/101 (21%)

Query: 1213 LVQNADDNIYPENV---------EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            L+QNADD  YPE+           PT++  L  +G+ V  NE+GFS  ++RALCD G   
Sbjct: 513  LLQNADDCSYPESELAALPGGGEPPTMSLRLSGAGLAVATNERGFSEADVRALCDTGE-- 570

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
                       KGIGFKSVF ++DAP I+SNGF   FD  +
Sbjct: 571  -----------KGIGFKSVFALSDAPAIYSNGFAFGFDAQD 600


>gi|340353805|ref|ZP_08676610.1| hypothetical protein HMPREF9144_2422, partial [Prevotella pallens
            ATCC 700821]
 gi|339608492|gb|EGQ13386.1| hypothetical protein HMPREF9144_2422 [Prevotella pallens ATCC 700821]
          Length = 1310

 Score = 82.8 bits (203), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 93/367 (25%), Positives = 170/367 (46%), Gaps = 45/367 (12%)

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFILQESGIVVLNNEQGFSA 1250
             ++L+ LS  +Y+++  F+ EL+QNA D     E     +   ++++ ++ ++N   F+ 
Sbjct: 27   AQSLNLLSSGIYTEEERFVYELLQNAIDAFVDVEGESLKIRVEIRDNYLLFMHNGAAFTE 86

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
             +I  L DVGN +K  ++   +G KGIGFKSVF  +    + S  F  KFD  E  I + 
Sbjct: 87   ADIEGLSDVGNGSK-AANVKKVGYKGIGFKSVFMKSTQVFVKSGPFCFKFD-KEDSIDY- 143

Query: 1311 LPTLVPPFNI---DMFCRLLSKD---PVQLESKCWN--TCIRLPFRTKFSEGIAMNNIVD 1362
            +P+ +P   +   D+  +++  +   P ++E + +N  T IRL           +  + +
Sbjct: 144  MPSDMPDGKLSPEDIPWQIIPIEAIVPFEVEDEAFNVITYIRLT-----DTASLVPKVDE 198

Query: 1363 MFSDLHPSLLLFLHRLQC-IMFRNMLNDSLV--VIRK------------KIVGDGIIKVS 1407
            +   L+   LLFL+     I F N  +D+LV  V RK            K+V   +IK S
Sbjct: 199  LLKGLN--FLLFLNAQDVSIAFYN--SDTLVRKVARKENNGEISLLIDDKVVSRWLIKSS 254

Query: 1408 C--GEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
                  ++  ++A           D    E++ A+++ +  E +    +     + FLP 
Sbjct: 255  TISVTKEVKQYIAQNPASVPQKLKDASEIEVSFAISVNDKGELH---KVENAVAYTFLPT 311

Query: 1466 R--TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS--AERSFCDLPCFREN 1521
                 G  F++  +F+  + R+++  +S WN+ +  E P L++   AE S       R  
Sbjct: 312  SYGNLGAPFLINANFITDAGRQQLQQDSEWNKLIFKEAPTLYLKWIAELSHTYKTYHRVL 371

Query: 1522 PAKAASV 1528
            P+K A V
Sbjct: 372  PSKVAKV 378


>gi|408492908|ref|YP_006869277.1| ATP-binding protein, putative [Psychroflexus torquis ATCC 700755]
 gi|408470183|gb|AFU70527.1| ATP-binding protein, putative [Psychroflexus torquis ATCC 700755]
          Length = 1254

 Score = 82.8 bits (203), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 83/351 (23%), Positives = 152/351 (43%), Gaps = 60/351 (17%)

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAE 1251
              +L  LS ++Y+    F+ EL+QNADD     + E  +   + ++ +++ +  + FS  
Sbjct: 27   ANSLESLSSDIYTDSKRFIYELIQNADD-ASSNSRELDILLKVVDNYLIISHQGEKFSEI 85

Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL 1311
            +I ++C VG+  KKG +    G KGIGFKSVF  +D  +I+S  +  KFD S  Q     
Sbjct: 86   DIESICSVGDGNKKGDT-NKTGFKGIGFKSVFSHSDYVKINSGNYCFKFDKSHWQ----- 139

Query: 1312 PTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT-----------KFSEGIAMNNI 1360
                  +N +   +   +D  + + K   T I++P++             F +   ++ I
Sbjct: 140  ----GHWNSEWQNQKEWRDGREKKEK--ETTIKMPWQIIPIWSEIEQLFSFVKSYNVSTI 193

Query: 1361 VDM--FSDLHPSLLLFLHRLQCIMFRNMLNDSLVV------IRKKIVGDGIIKVSCGEDK 1412
            +     S L   LL      Q ++F    N  + V      + +KI  +  +K+      
Sbjct: 194  IKFEDTSKLEKELLELFSNTQILLFLRCQNVKITVEGNKPFVIEKIKENNTVKLKQNNKI 253

Query: 1413 MT-WFVAS-----QKLRAGVIRPDVK---------TTEIALALTLQES-----NEGNYGP 1452
            ++ W + S      K    +I+ D +          TEI+LA+ + +      N+ N   
Sbjct: 254  LSEWLLKSFTFIVDKSTTELIKNDARIPKKLRLATKTEISLAVQIDKGKIKTINKDN--- 310

Query: 1453 LLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP 1502
                + +F +LP    Y   F++   F+  + R+ +  +  WN WL  + P
Sbjct: 311  ----RLIFTYLPTSVNYDFPFLVNASFITDAGRQHIHEDLQWNIWLFKQIP 357


>gi|323446263|gb|EGB02492.1| hypothetical protein AURANDRAFT_68829 [Aureococcus anophagefferens]
          Length = 1632

 Score = 81.3 bits (199), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 101/218 (46%), Gaps = 8/218 (3%)

Query: 127 PPQP-QKLNKEFLERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLGFQMQ 185
           P  P Q+ +   + RI +   K R+     GE V   ++   V     V S   LG  + 
Sbjct: 15  PVDPAQRADAAAVSRIAK---KLRNRACERGERVRLLELVDDVCSAHGVRSLRDLGLSVA 71

Query: 186 EVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPL 245
            V  L  L   E  +   I  FV  R I +L+DL   +C + G+  FE+L LGP L+QPL
Sbjct: 72  AVDELRSLRQLEGLLEIQICAFVESRFIATLHDLSHNLCAAHGVRAFEDLRLGPILKQPL 131

Query: 246 ISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSLASKEQLG 305
           +   F     +  V  +   DV+  L ++    K + + ++EFL+++A KR +AS   L 
Sbjct: 132 VQRKFYADEALTAVPALEVADVVRELEKF--RFKKRRVDLEEFLNWLAAKRGVASPRSLC 189

Query: 306 VRIQN--LGMHINFIREARKSQDVTLKKFLKELQPHHK 341
           + +    +G  I +  +A + +  +    LK+L+   +
Sbjct: 190 LTMSKYAIGTTIKYASQAARDEKDSRVAALKQLRDARR 227


>gi|153870496|ref|ZP_01999887.1| conserved hypothetical protein [Beggiatoa sp. PS]
 gi|152073038|gb|EDN70113.1| conserved hypothetical protein [Beggiatoa sp. PS]
          Length = 1020

 Score = 80.9 bits (198), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 91/341 (26%), Positives = 142/341 (41%), Gaps = 62/341 (18%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNN--------------- 1244
            E YS ++HF+ EL+QNADD      V+ T   FIL +SGI   +N               
Sbjct: 40   EKYSDNAHFIYELLQNADD------VKATKARFILNQSGIYFAHNGTIHFSITAPDCEEI 93

Query: 1245 -EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
              Q     +I A+  +GNS K  +    IG+ G+GFK+VF+ T  PEI+   F  K D  
Sbjct: 94   DTQNQCLGHINAITSIGNSNKYEAQ---IGKFGVGFKAVFQYTQTPEIYDPPFCFKIDR- 149

Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVD 1362
                 F++P L           L ++ P   E +   T    PF   + +   A   IV+
Sbjct: 150  -----FIVPIL-----------LTTEHP---ERQPDETLFYFPFDNPQQTPHQAFQAIVN 190

Query: 1363 MFSDLHPSLLLFLHRLQCIMFRNMLNDS--LVVIRKKIVGDG--IIKVSCGEDKMTW--- 1415
              ++L   LL   +  +   + N  N    L  I+   V +    I++  G     W   
Sbjct: 191  KLTNLSYPLLFLRNLTEITWYCNNENQGKYLKTIKPLEVNNTHYHIELISGSYAPAWEPN 250

Query: 1416 -----FVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQ--PVFAFLPLR-T 1467
                    S+     V+  +      ++A  +  S  G    +L++Q  P + F P + T
Sbjct: 251  PDAPASTFSKNNHFLVLTDNTNNHPFSIAYLVNLSEHGQPIQILHEQTFPAYCFFPTKET 310

Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
              L FI+Q  F+L  SRE +   + WNQ L+     L  ++
Sbjct: 311  THLHFIVQAPFLLTDSREGIKQTNSWNQQLVKALAQLIANS 351


>gi|258514802|ref|YP_003191024.1| putative transcriptional regulator [Desulfotomaculum acetoxidans DSM
            771]
 gi|257778507|gb|ACV62401.1| putative transcriptional regulator [Desulfotomaculum acetoxidans DSM
            771]
          Length = 1672

 Score = 80.5 bits (197), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/345 (23%), Positives = 149/345 (43%), Gaps = 46/345 (13%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNE 1245
            Q      +L+ LS +LY+    F+ EL+QNADD+   +N +    +I +  + +V+ ++ 
Sbjct: 22   QAVNQASSLNALSADLYTDSKRFIYELLQNADDS--SQNNDTVKVWIKIFNNNLVIAHSG 79

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-- 1303
              F+  +++ +C+V N TKK S     G KGIGFKSVF  +D   I++   + +FD S  
Sbjct: 80   SPFTVRDLQGICNVNNGTKK-SDLTKTGYKGIGFKSVFGQSDKVTIYTKNEYFRFDASYS 138

Query: 1304 --------------EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ--LESKCWNTCIRLPF 1347
                               F  P  + P   +   + +S +P+   L     N    +  
Sbjct: 139  FEWKWAESKTIWENNNDRKFQFPWQIIPIYTE--AKEVS-EPINQFLNQNNANVATIIQM 195

Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVS 1407
            +      +A  N+       + ++ LFL  +  I F   + +S+ V   +   D I    
Sbjct: 196  KNVNETSLAAQNLSQ-----NLNMFLFLKNISEINFD--IAESVTVKIDRTKKDRITLQK 248

Query: 1408 CGEDKMTWFVASQKLRA-----GVIRPD-------VKTTEIALALTLQESNEGNYGPLLY 1455
                K+ W + +  L        V++ +       + T  I L+L  +  ++G       
Sbjct: 249  GNSSKVDWLINTVSLAVPNDVKAVLQDERNIPEKLLNTDSIELSLAAKVGSDGITKLSNQ 308

Query: 1456 QQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            ++ ++++LP     Y L  ++   F+  ++RE +  +S WNQWL 
Sbjct: 309  EKLLYSYLPTDETKYSLPVLVNTSFLTAANRESLHADSKWNQWLF 353


>gi|359415302|ref|ZP_09207767.1| hypothetical protein CDLVIII_5898 [Clostridium sp. DL-VIII]
 gi|357174186|gb|EHJ02361.1| hypothetical protein CDLVIII_5898 [Clostridium sp. DL-VIII]
          Length = 1470

 Score = 80.5 bits (197), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/370 (23%), Positives = 150/370 (40%), Gaps = 46/370 (12%)

Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSH---FLLELVQNADDNIYPENVE----- 1227
            ISN    M+K       R       E   +D +    +LEL+QNADD     + E     
Sbjct: 10   ISNFRETMIKTYTYDPKRIREDFGSETRVKDDYKGRVILELLQNADDAQVTNDTENMRIG 69

Query: 1228 -PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
               +   L++     +N     +   + A+C + +S KK     YIG KG+GFKSV   T
Sbjct: 70   DKNVILYLKKDAFYCVNGGYPVTQNGLEAICRLSHSPKKDRKL-YIGEKGLGFKSVLCFT 128

Query: 1287 DAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
            D PEIHS   H KF+ ++         +    N D     L + P  L  + W   ++  
Sbjct: 129  DRPEIHSGDLHCKFNRNDAMKFLYESNINGIDNFDSDNIALFRFPQYLSQEIW---MQDK 185

Query: 1347 FRTKFSE--------GIAMNNIV---DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR 1395
              T+F +         I  NN +   +   ++  ++LLFL+ L  I  R   +DS ++ +
Sbjct: 186  ILTEFVKENATVIKLKITDNNYIVLKEKLLEIKSTILLFLNSLNKISIRIEEDDSSLIEK 245

Query: 1396 KKIVG-------DGIIKVSCGEDKMTWFVASQKLRAGV---------IRPDVKTTEIALA 1439
              I+        D    +  G +K  W V ++               +  D+   +++ A
Sbjct: 246  DFIIHRSESQELDNQCTLEDGNNKSIWQVYNETFEIEKNFIGDTDEQVFQDINACKVSFA 305

Query: 1440 LTLQESNEGNYGPL--LYQQPVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQW 1496
            L   +  E NY PL  L +  +  + P    + L F+    +   SSR+ ++    +N+ 
Sbjct: 306  L---QKEELNYKPLDNLDENYIRVYFPTCEEFSLPFLFHASYYTDSSRKNINLEHEYNRN 362

Query: 1497 LLSEFPALFV 1506
            L  +   +F+
Sbjct: 363  LTKKAVDIFI 372


>gi|290990058|ref|XP_002677654.1| predicted protein [Naegleria gruberi]
 gi|284091262|gb|EFC44910.1| predicted protein [Naegleria gruberi]
          Length = 1534

 Score = 80.1 bits (196), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 101/411 (24%), Positives = 170/411 (41%), Gaps = 70/411 (17%)

Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV--V 1241
            ++K  + +G A    SQ   +   HF+LE +QNADD+ Y +     L F   ES  V  +
Sbjct: 249  IEKMQSEIGSARSLFSQGFNTNLFHFILEFIQNADDSEYLQG--GILEFDFDESRKVLKI 306

Query: 1242 LNNEQGFSAENIRALCDVGNSTK--KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
             +NE GF+ +N+ +L  + +S K  K      IG KGIGFK++F V     + SNGF   
Sbjct: 307  KSNECGFTYDNLYSLSGIADSDKVSKKLETLAIGEKGIGFKTIFSVCKKVHLVSNGF--- 363

Query: 1300 FDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
                                 D+F  L S D  + E++ +N           S+G     
Sbjct: 364  ---------------------DVF--LTSTDLTKKENQEFN----------LSDGTLYEI 390

Query: 1360 IVDMF--SDLHPSLLLFLHRLQCIMFRNMLND---SLVVIRKKIVGDGIIKVSCGEDKMT 1414
             ++ F  S      LL L  ++ I+F++ LN    SL     + +     K     +K  
Sbjct: 391  EIEKFDHSIFDKYTLLTLKNIKTIIFKDGLNTTKYSLTCKDNEYLLSTTFKSKTTTEKFY 450

Query: 1415 WFVASQKLRAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT-YG 1469
             F     +    I    RP+    ++ +    +   +  Y        + +FLP++  Y 
Sbjct: 451  GFSIENGINMKNILCEKRPNFHRADLMVFFPEEWKVDKRY-------KLHSFLPVKEDYN 503

Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVY 1529
            L  ++Q DF+L  +RE +  +S WN  ++   P L V A  +  ++  ++         Y
Sbjct: 504  LPLLIQSDFILSQNREALLESSDWNISIIDHIPQLLVKAFNTMKNINNWK---------Y 554

Query: 1530 MSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQW--APPCKVLRG 1578
              F  L  E    FS +    ++ L+  +C+ L G   ++   P   +L G
Sbjct: 555  DFFKVLQVESKLHFSNVQFKTINLLQNIDCIPLLGGQTKYFEKPSNIILMG 605


>gi|448528709|ref|ZP_21620217.1| restriction endonuclease-like protein [Halorubrum hochstenium ATCC
            700873]
 gi|445710449|gb|ELZ62263.1| restriction endonuclease-like protein [Halorubrum hochstenium ATCC
            700873]
          Length = 1561

 Score = 79.3 bits (194), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 91/370 (24%), Positives = 153/370 (41%), Gaps = 54/370 (14%)

Query: 1155 CEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELV 1214
             E+A  I+  IR  E  +      ++ +  ++Q  +  ++ H   Q      +  L EL+
Sbjct: 16   VEEAYEILRDIREKELKV------VDDDFERRQ--KRTQSFHDYIQRQLDHPAQVLPELI 67

Query: 1215 QNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGR 1274
            QNADD      VE  LT    +  + + N+ +   +E +  LC  G STK+     YIG 
Sbjct: 68   QNADDIKSCNTVEIELT----DDTLRIRNDGRPMKSEEVDTLCAAGESTKQDPE--YIGH 121

Query: 1275 KGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQL 1334
             G+GFKSVF ++D+P + S  FH +FD+                      RL   +   +
Sbjct: 122  FGLGFKSVFSISDSPRVRSGYFHFEFDSD---------------------RLTVPEIHDV 160

Query: 1335 ESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI---------MFRN 1385
             +    T I LP +   S       + +   ++H  LL +L  +  I         ++R 
Sbjct: 161  STPVKGTEIELPLKEDLS-AERREQLQERLDEVH-RLLTYLRNISQIEITQEGETTVYRR 218

Query: 1386 MLNDSLVVIRKKIVGDGIIK---VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL 1442
              +DS    R     D +++   V   +DK +     Q      I  +    E  +++TL
Sbjct: 219  E-SDSESPERIIYKDDEVLERRLVFTTDDKPSGEAFEQLAEKRQIDDEETVRESPVSVTL 277

Query: 1443 QESNEGNYGPLLYQQP--VFAFLPL-RTYGLKFILQGDFVL-PSSREEVDGNSPWNQWLL 1498
                + ++ P    +   +F FLP     G  F +  DF+L P+ RE    + P+NQWLL
Sbjct: 278  SFEVDDSHRPCSKAKDCYLFNFLPTDERSGFPFDVHADFLLEPNRRELAWKDGPYNQWLL 337

Query: 1499 SEFPALFVSA 1508
            +     +  A
Sbjct: 338  NRVGETYRKA 347


>gi|359415220|ref|ZP_09207685.1| peptidase C14 caspase catalytic subunit p20 [Clostridium sp. DL-VIII]
 gi|357174104|gb|EHJ02279.1| peptidase C14 caspase catalytic subunit p20 [Clostridium sp. DL-VIII]
          Length = 1679

 Score = 79.0 bits (193), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 149/350 (42%), Gaps = 56/350 (16%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNE 1245
            Q      +L+ LS +LY+    F+ EL+QNADD+   +N E    +I +    +VV ++ 
Sbjct: 22   QAVNQASSLNALSGDLYTDSKRFIYELLQNADDS--SQNSEAVKVWIKIFGDNLVVAHSG 79

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS-- 1303
            + F+  +I+ +C+V N TKK S     G KGIGFKSVF  +D   I +   + +FD+S  
Sbjct: 80   RPFTTRDIQGICNVNNGTKK-SDLTKTGYKGIGFKSVFGQSDKVTIFTKDEYFRFDSSYS 138

Query: 1304 --------------EGQIGFVLPTLVPPFNI---DMFCRLLSKDPVQLESKCWNTCIRLP 1346
                               F  P  + P      D+F      +P+    K  +  +   
Sbjct: 139  FDWKWDVSKKIWEQNNDREFQFPWQIIPIYTEVKDVF------EPINQFLKDIDASVATI 192

Query: 1347 FRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV 1406
             + K      +N        L  +L +FL  L+ I   N     LV +  K   +  I +
Sbjct: 193  IQMK-----NVNETSQAAQSLSQNLNMFLF-LKNINEINFDITELVSVEIKRTENDRITL 246

Query: 1407 SCGED-KMTWFVASQKLRAGVIRPDVKTT---------------EIALALTLQESNEGNY 1450
              G   K+ W + + +L+   +  +VKT                 I L+L  +  N+G  
Sbjct: 247  KRGNGLKVNWLLNTIRLK---VPSEVKTVLQDERNIPEKLLNADSIELSLAAKVENDGIT 303

Query: 1451 GPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
                 ++ ++++LP     Y L  ++   F+  ++RE +  +S WNQWL 
Sbjct: 304  KLSKQEKLLYSYLPTGETKYSLPVLVNTSFLTTANRESLHADSKWNQWLF 353


>gi|291460280|ref|ZP_06599670.1| conserved hypothetical protein [Oribacterium sp. oral taxon 078 str.
            F0262]
 gi|291417035|gb|EFE90754.1| conserved hypothetical protein [Oribacterium sp. oral taxon 078 str.
            F0262]
          Length = 1239

 Score = 79.0 bits (193), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 150/331 (45%), Gaps = 46/331 (13%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQ--ESGIVVLNNEQGFSAENIRALCDVG 1260
            YS  +HF+ EL+QNA+D       E     +++  +  +V  +N   F  +++R +  + 
Sbjct: 35   YSDSNHFVYELLQNAED-------EKATKVVIEFYKDQLVFYHNGDPFDEDDVRGVSSML 87

Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN--GFHVKF------DTSEGQIGFVLP 1312
              TK  + A  IGR G+GFKSVF+ T  PEI+S+   F +K       + S      +  
Sbjct: 88   MGTKDRNDAQTIGRFGMGFKSVFKYTYQPEIYSDEEAFIIKNYLLPVENDSHWDFQGLKS 147

Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEG----IAMNNIVDMFSDLH 1368
            TL  P + D    L   +   L      T I +PF+ +  +G    ++  +++D  + L 
Sbjct: 148  TLAYP-DGDGTKYLPFANTQHL------TKIVIPFKKRNEKGELVAVSGKDVLDKLNSLS 200

Query: 1369 PSLLLFLHRLQCIMFRNMLNDSLVVIR-KKIVGDG-------IIKVSCGEDKMTWFVASQ 1420
              +LLFL  ++ + + N  N   V I      GD        I   S G+++++ ++  +
Sbjct: 201  GEILLFLTYIRNLYWINKENGKYVHITLSHANGDDQLLTCRIIGSESDGKEEISRYLRYK 260

Query: 1421 KLRAGVIRPDVKTTEIALA--LTLQESNEGNYGPLLYQQPVFAFLPLRTY-GLKFILQGD 1477
            K+      PD+K+ E+A+A  L +Q  N       +   PV+ + P R    L F++ G 
Sbjct: 261  KV---FDHPDMKSAEVAVAYRLNVQARNINE----MEHTPVWVYFPTRDMTDLPFLIHGS 313

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            F    SRE++   S +N  L      L   +
Sbjct: 314  FETAVSREKLMTPSTFNDSLFGRLGDLIAES 344


>gi|428213778|ref|YP_007086922.1| hypothetical protein Oscil6304_3433 [Oscillatoria acuminata PCC 6304]
 gi|428002159|gb|AFY83002.1| hypothetical protein Oscil6304_3433 [Oscillatoria acuminata PCC 6304]
          Length = 1585

 Score = 78.2 bits (191), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 149/333 (44%), Gaps = 32/333 (9%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV-EPTLTFILQESGIVVLNNEQGFS 1249
            + + +  L+ + YS+   F+ EL+QNADD   P  + +  +  ++ E+ ++VL+N + F 
Sbjct: 29   IAKFVTDLTGDSYSEQERFIFELLQNADD--LPTIIGQVHVKLVILENHLLVLHNGKPFD 86

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--TSEGQI 1307
             EN+RA+C+   STK  +S    G KGIGFKSVF  +    I+S  F  KFD  +++ Q 
Sbjct: 87   DENLRAICNRSESTKSKNS-DQTGYKGIGFKSVFHDSQCVYINSGDFSFKFDRNSTKHQN 145

Query: 1308 GFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI--AMNNIVDMFS 1365
                P  + P  ++   R    D +Q  S  + + +    +    E I      I  +FS
Sbjct: 146  PAQTPWQIKPIWVE---RDEYPDEIQAYSDFFESPVATALKVS-PEKINDYKEKIKKLFS 201

Query: 1366 DLHPSLLLFLHRLQCIMFR--NMLNDSLVVIRKKIVGDGIIKVSCGEDKMT----WFV-- 1417
            D  P L+LFL  +  I     N L+  + VI+ K      I+ +    + T    W +  
Sbjct: 202  D--PRLILFLRHISSIEVYGLNDLSLDIKVIKLKQQDIYTIQYTALPSEQTIETQWIIKN 259

Query: 1418 ----ASQKLRAGVIR----PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLP--LRT 1467
                 SQ +R  +      PD   +     LT       N    +    +F +LP  ++ 
Sbjct: 260  FELPVSQNIREAMKNDEKVPDKLKSIQNSKLTFAVEIVNNQIHPVDNSVLFNYLPTNVKA 319

Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSE 1500
                F++  DF+  + RE +  +  WN +L  +
Sbjct: 320  DQFPFLINADFLTTAGRETIHVDHTWNCFLFEQ 352


>gi|435849813|ref|YP_007301754.1| molecular chaperone of HSP90 family [Methanomethylovorans hollandica
            DSM 15978]
 gi|433663301|gb|AGB50726.1| molecular chaperone of HSP90 family [Methanomethylovorans hollandica
            DSM 15978]
          Length = 1113

 Score = 77.4 bits (189), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 135/323 (41%), Gaps = 43/323 (13%)

Query: 1197 CLSQELYSQDSHFLLELVQNADDNIYPENVEPT----LTFILQESGIVVLNNEQGFSAEN 1252
             L  ELYS+ +HF+ EL+QNA+D     N E T    + F L    + + +N   F  E+
Sbjct: 27   TLLTELYSEKTHFIYELLQNAEDAYERLNKEDTRIKSVHFELYNDRLEIRHNGIPFDEED 86

Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
            I+ +C + + TKK      IG+ GIGFKSV+  T +PEI+S           G   F + 
Sbjct: 87   IKGICRLASGTKK-QDISQIGKFGIGFKSVYAYTKSPEIYS-----------GHYCFCIK 134

Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSL 1371
              V P++I+    ++             T   +P    K ++  +   I      L    
Sbjct: 135  NYVHPYSINQRNDIIKGQ----------TLFVIPLNNDKINQKDSYLEIKSKLIGLGART 184

Query: 1372 LLFLHRLQCIMFR---------NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKL 1422
            LLFL  ++ I ++            N      R  +  +  + V   E+   W   S+ +
Sbjct: 185  LLFLKNIEEITWKIDGTYGKYYKKSNHETTYSRAILYSESNVSVISIEN---WLKFSRTI 241

Query: 1423 RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLP 1481
                   +  + EIA  +   E+++           +  F P  +  GL F++QG+F   
Sbjct: 242  SNEC--NNKSSVEIAYLVQNDENSKKEQIISADNTQLVVFFPTDKKTGLSFLIQGNFQTT 299

Query: 1482 SSREEVDGNSPWNQWLLSEFPAL 1504
            +SR+ +  N  WN+ L+ E   L
Sbjct: 300  TSRDNIFSND-WNKLLIEEIAIL 321


>gi|404484308|ref|ZP_11019521.1| hypothetical protein HMPREF1135_02581 [Clostridiales bacterium
            OBRC5-5]
 gi|404342625|gb|EJZ68996.1| hypothetical protein HMPREF1135_02581 [Clostridiales bacterium
            OBRC5-5]
          Length = 1656

 Score = 77.4 bits (189), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 75/343 (21%), Positives = 149/343 (43%), Gaps = 59/343 (17%)

Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
            +L  LS +LY+  + F+ EL+QNADD+ + +N       I  ++ +VV ++ + F   +I
Sbjct: 32   SLTALSVDLYTDANRFIYELLQNADDSAFHDNGVKVWIKIFNDN-LVVAHSGKVFDTRDI 90

Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---------- 1303
            R +C++ N TKK S     G KGIGFKSVF  +D   I+S+G   +FD +          
Sbjct: 91   RGICNINNGTKK-SDLKKTGYKGIGFKSVFGQSDYVVIYSDGEFFRFDKNYDFKWRWEKS 149

Query: 1304 ------EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAM 1357
                  +    F  P  + P         +  +  ++  +  +   ++    K +  + +
Sbjct: 150  KKDWEKDNNRKFEYPWQIIP---------IYTENTEISEQIRDYIFKVG--AKVATILKI 198

Query: 1358 NNIVDMFSDL-----HPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDK 1412
            N I +    +     + ++ LFL  +  I    ++ D ++ I + I    I+       K
Sbjct: 199  NYIQETIDAIRVLTENSNMFLFLKHITEI----IIEDKVITINRNIKNRLILSDDTNSQK 254

Query: 1413 MTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV------------- 1459
              W + + KL    +  D++ + +      Q+  E N   +++   +             
Sbjct: 255  -EWIIKNIKLN---VSNDLRKSLLEERNIPQKLLEANNIDMIFAAQIDKNRIVELNKEDR 310

Query: 1460 --FAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
              +++LP     Y +  ++   F++ ++RE +  +S WN+WL 
Sbjct: 311  LLYSYLPTGETQYAIPVLVNTSFLVSANRESIHKDSKWNKWLF 353


>gi|315652131|ref|ZP_07905129.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
            3986]
 gi|315485623|gb|EFU76007.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
            3986]
          Length = 1286

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/332 (25%), Positives = 145/332 (43%), Gaps = 49/332 (14%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQE-SGIVVLNNEQGFSAENIRALCDVGN 1261
            YS  +HF+ EL+QNA+D      V  +   I      +V  +N + F  +++R +  +  
Sbjct: 35   YSDSNHFVYELLQNAED------VRASKVVIEYYCDKLVFYHNGKPFDEDDVRGVSSMLM 88

Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNI- 1320
             TK    A  IG+ G+GFKSVF+ T  PEI+S+           + GF++   + P  + 
Sbjct: 89   GTKSRDDASTIGKFGMGFKSVFKYTYQPEIYSD-----------EEGFIIQNYLLPKEVN 137

Query: 1321 ---------DMFCRLLSKDPVQLESKCWN-TCIRLPFRTKFSEG----IAMNNIVDMFSD 1366
                     + F        V + + C + T I +PF    S+G    ++ N+++D    
Sbjct: 138  NNWDFRKIKEEFNYSDGSKKVYVFANCEHLTKIIIPFNKIDSKGKLSYVSGNDVLDKLEG 197

Query: 1367 LHPSLLLFLHRLQCIMFRNMLNDSL--VVIRKKIVGDGII--KVSCGE-DKMTWFVASQK 1421
            L   +LLFL  +Q + + N        + + K    + +I  ++S    D+        K
Sbjct: 198  LSGEILLFLTCIQDLYWINKEAGKYAHITLSKDSKDENLITCRISGNNSDRKEDISKYLK 257

Query: 1422 LRAGVIRPDVKTTEIALALTL----QESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
             +     PD+K  E+++A  +    +  NE N        P++ + P R    L F++QG
Sbjct: 258  YKDVFDHPDMKNAEVSVAYKVNSLAKNVNEIN------SSPIWVYFPTRDNTRLPFLIQG 311

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
             F    SRE++   S +N  L +   AL   +
Sbjct: 312  SFETAVSREKLMTPSDFNNVLFNRLGALIAES 343


>gi|421077861|ref|ZP_15538823.1| putative transcriptional regulator [Pelosinus fermentans JBW45]
 gi|392524114|gb|EIW47278.1| putative transcriptional regulator [Pelosinus fermentans JBW45]
          Length = 347

 Score = 75.9 bits (185), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 86/341 (25%), Positives = 146/341 (42%), Gaps = 52/341 (15%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
            Q      +L+ LS +LY+    F+ EL+QNADD+     V      I  +  +VV ++ +
Sbjct: 22   QAVNQASSLNALSGDLYTDSKRFIYELLQNADDSSQNNEVVKVWIKIFNDK-LVVAHSGK 80

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--- 1303
             F+  +++ +C+V N TKK S     G KGIGFKSVF  +D   I SN  + +FD+S   
Sbjct: 81   PFTTRDLQGICNVNNGTKK-SDLTKTGYKGIGFKSVFGQSDKVTIFSNMEYFRFDSSYPF 139

Query: 1304 -----EGQI--------GFVLPTLVPPFNIDMFCRLLSKD-PVQ--LESKCWNTCIRLPF 1347
                 E +I         F  P  + P    ++ +    D P+   LE+   N    +  
Sbjct: 140  EWNWDEAKIIWEKKNDREFQFPWQIIP----IYTKAEEIDEPINQFLENIEANVATIIQM 195

Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVS 1407
            +      +A+  +       + ++ LFL  +  I F  M + S+ + R +   D II  +
Sbjct: 196  KNVNETSLAVGKLSQ-----NLNMFLFLKNISEINFDVMESVSIEINRSE--KDRIILKN 248

Query: 1408 CGEDKMTWFVASQKLRAGVIRPDVKTT---------------EIALALTLQESNEGNYGP 1452
                K  W + +  L    +  DVKT                 I L L  +  N+G    
Sbjct: 249  GNASKSDWLINTISL---TVPNDVKTALQDERNIPDKLLNTDSIELTLAAKVGNDGITKI 305

Query: 1453 LLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNS 1491
               +  ++++LP   R Y L  ++   F+  ++RE +  +S
Sbjct: 306  SEQETLLYSYLPTDERKYSLPVLVNTSFLTTANRESLHADS 346


>gi|313669368|ref|YP_004049795.1| hypothetical protein Sulku_2587 [Sulfuricurvum kujiense DSM 16994]
 gi|313156565|gb|ADR35242.1| hypothetical protein Sulku_2587 [Sulfuricurvum kujiense DSM 16994]
          Length = 1006

 Score = 75.9 bits (185), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 150/361 (41%), Gaps = 68/361 (18%)

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQGFSA 1250
            G   H +   LYS  +HFL EL+QNA D       E T ++F L    +   +N + FS 
Sbjct: 21   GDNSHRIIANLYSDSTHFLYELLQNAQD------AEATSVSFELYSDRLETSHNGRAFSF 74

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
             ++ ++  +G+STK  +    IG+ G GFKSVF +TD P I+S  ++ K         ++
Sbjct: 75   TDVESITTIGSSTK-SNEPNKIGKFGAGFKSVFSITDTPHIYSGTYNFKIS------DYI 127

Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVDMFSDLHP 1369
            +P  +              DP++ +S    T + LPF R +       ++I      +  
Sbjct: 128  IPETI--------------DPLENQSDE-RTSVILPFDRAEIDAKTLYDSIAKRLRSIGK 172

Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMTWFVASQKL----- 1422
              LLFL  L  I ++              +GD  G +K +  E   T  + + +      
Sbjct: 173  EELLFLSNLCRIGWK--------------IGDEAGYVKKATDEASETRSLITLETHNKTE 218

Query: 1423 ------RAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLP-LRTYGLKFILQ 1475
                  R   I   +   +IA AL    + +G Y  +    P++ F P +    L+F++ 
Sbjct: 219  RFELYRRGFTIENKLLECKIAFAL----NADGRYVSIG-TNPLYVFFPTVVPTNLQFLVH 273

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVS-----AERSFCDLPCFRENPAKAASVYM 1530
              +    +RE ++ + P NQ +      L+ S     AER   D+      P  + S  +
Sbjct: 274  APYKTTPNRETMNFSDPQNQTITDNIIELYKSVLKDMAERKVLDVETMGLFPLSSQSGEL 333

Query: 1531 S 1531
            S
Sbjct: 334  S 334


>gi|73748050|ref|YP_307289.1| hypothetical protein cbdb_A109 [Dehalococcoides sp. CBDB1]
 gi|73659766|emb|CAI82373.1| hypothetical protein cbdbA109 [Dehalococcoides sp. CBDB1]
          Length = 1023

 Score = 75.5 bits (184), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 80/353 (22%), Positives = 155/353 (43%), Gaps = 48/353 (13%)

Query: 1172 LGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI---YPEN--- 1225
            +  +   + S+ ++++          +S++ YS  +HF+ EL+QNA+D +   + +N   
Sbjct: 1    MASDYQKITSDNIRRRGEEFDDIGEWISEQFYSDRTHFIYELLQNAEDALSRRFRDNSDS 60

Query: 1226 -VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFR 1284
             V   + F L  + +   +  + F+ +++RA+ D+   TK       IG+ GIGFKSV+ 
Sbjct: 61   KVPRRVQFRLYSNRLEFRHFGKLFTEDDVRAISDILKGTK-AIDQNQIGKFGIGFKSVYA 119

Query: 1285 VTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIR 1344
             T  PE+HS   H           F +   + P N D   +L  ++          T   
Sbjct: 120  FTSTPEVHSGDEH-----------FFIERYIRPKNADQIPKLEDQE----------TLFV 158

Query: 1345 LPF-RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS----LVVIRKKIV 1399
             PF   + +   ++ +I +  +++    LLFL  +  I +  + NDS    L  +R++  
Sbjct: 159  FPFNHDRVTPEDSLRSISNKLNNIGVRTLLFLKHINEIEWL-VGNDSKGTYLKEVRRR-- 215

Query: 1400 GDGIIKVSCGEDKMT-----WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLL 1454
            GD       G++  T     W +  + +        VK   + +A  ++   + N   ++
Sbjct: 216  GDSRQVTVVGQNNGTEESEDWLMFERPVHHPGSTESVK---VEIAFRVEVDGKTNKEKIV 272

Query: 1455 YQQ--PVFAFLPL-RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPAL 1504
              +  P+  F P  +     F++QG F+   SR+ +  +S WN+ L+ E   L
Sbjct: 273  KSKASPLVVFFPTEQETRFGFLVQGPFLPTPSRDNILKDSEWNKLLIREVAQL 325


>gi|119179507|ref|XP_001241335.1| hypothetical protein CIMG_08498 [Coccidioides immitis RS]
          Length = 2079

 Score = 75.5 bits (184), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 125/506 (24%), Positives = 203/506 (40%), Gaps = 107/506 (21%)

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGY---IGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            NE GF+ E++ ++C +G S +KG SA     IG KGIGFKSVF+V     I S  +  KF
Sbjct: 59   NELGFAPEDVDSICRIGGS-QKGISADRMDCIGEKGIGFKSVFKVASVVWISSGKYSFKF 117

Query: 1301 DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNI 1360
            D +  ++G + P             + ++ P +  +   +  +RL  ++   E +    +
Sbjct: 118  DNAT-RLGMIAP-------------MWAQFPAEAPASGTSMLLRLLNKSVRDELLKEMRV 163

Query: 1361 VDMFSDLHPSLLLFLHRLQ----CIMFRNMLNDSLVVIRK--KIV---GDGIIKVSCGED 1411
             D      P +L+FL++++    CI   N       + RK  K V   G  +  +  G D
Sbjct: 164  CD------PKILIFLNQIRELKVCIAQDNGTTFERTISRKDDKDVDKPGQYVTTLRYG-D 216

Query: 1412 KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLK 1471
                FV  + + +G         EI LA  ++ +          Q+P             
Sbjct: 217  SSQQFVVIRYVPSGR---STDKAEILLAFPIKNN----------QEP------------- 250

Query: 1472 FILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMS 1531
                   V+ S RE+++G+S  NQ L S  P  FV A   F +   FR        ++  
Sbjct: 251  -------VIQSHREDINGSSAENQRLRSLIPKAFVQAVDYFQESDYFR-------YIWPR 296

Query: 1532 FVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLR---GWNDRAHSLLP 1588
            ++P++ E   FF+ L    +    +S   +LE  + + A P  +      + D     LP
Sbjct: 297  YLPIIPE-ENFFTSLTEETI--FTLSQRKLLESESGELATPGTLFYVPPRFRDPNGDPLP 353

Query: 1589 DILLQKHLGLGFLSKDIVLSDS-LARALGIEEYGPKILLQIISSLCRTENGLRSMGLSW- 1646
               L       +LS+     DS L + LG+     + LL  + S+             W 
Sbjct: 354  ---LCSRTAFRYLSRKYFDEDSDLFQCLGVRHLSLRDLLVDVKSVVNQYQNHSREATEWH 410

Query: 1647 --LASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDE--GTIWL 1702
              L+  LN           +  L++   T     L+ IP IPL DG + S  E  G I+ 
Sbjct: 411  IRLSKALNR----------EDMLRAYRHT-----LRSIPLIPLRDGRWISASESAGIIFF 455

Query: 1703 HS---DCSVFDGGFGLEAFPNLCAKL 1725
             S   D SV  G   +E  P+  + +
Sbjct: 456  PSALADLSVPSGLDSVEIHPDAASNV 481


>gi|227115501|ref|ZP_03829157.1| putative transcriptional regulator [Pectobacterium carotovorum subsp.
            brasiliensis PBR1692]
          Length = 1675

 Score = 74.7 bits (182), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 90/400 (22%), Positives = 168/400 (42%), Gaps = 85/400 (21%)

Query: 1175 NISNMESNMLK-----------KQHARLGRALHCLSQELYSQDSHFLLELVQNADDN-IY 1222
            NI N+++ ++K            Q      +L  LS++LY+    F+ EL+QNADD+ + 
Sbjct: 8    NIINLKAKVVKTFQDNINYSNPNQAVNQADSLDALSKDLYADSKRFIYELLQNADDSAVD 67

Query: 1223 PENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
             ++V   +  I  E+ ++V ++ + F   +I+ +C++ N TKK S     G KGIGFKSV
Sbjct: 68   GKSVNVWIKII--ENNLIVAHSGKPFDERDIQGICNINNGTKK-SDITKTGYKGIGFKSV 124

Query: 1283 FRVTDAPEIHSNGFHVKFDTS-----------------EGQIGFVLPTLVPPFNIDMFCR 1325
            F  +    + +N  + KFD +                 E    F  P  +          
Sbjct: 125  FGQSQKVIVFTNNEYFKFDVNYKHNWQWEESSQDKWEKENDRKFQYPWQII--------- 175

Query: 1326 LLSKDPVQLESKCWNTCIRLPFRT---KFSEGIAMNN---IVDMFSDL--HPSLLLFLHR 1377
                 P+ L+S    T I    R+     +  I + N   I+D   +L    ++ LFL  
Sbjct: 176  -----PIYLQSNEIETNIHEYLRSIKATVATVIELKNDKDIIDSLQELSQKTNMYLFLKN 230

Query: 1378 LQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED-KMTWFVASQKLRAGVIRPDVKTT-- 1434
            +  I+F    + ++ + R     +G + + C  + + TW   + ++    +  D+K +  
Sbjct: 231  ISNIIFDTERSVNVSIERN---ANGKVHLGCNNNCQSTWLSKTFEIS---LPEDIKNSLS 284

Query: 1435 -------------EIALALTLQESNEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFV 1479
                         ++ L L+ Q  +          + ++++LP     Y    ++   F+
Sbjct: 285  YDQNIPDKFKYIEKVDLTLSAQVKDGSIVKLHRTDKLLYSYLPTDETKYDFPVLVNTSFL 344

Query: 1480 LPSSREEVDGNSPWNQWLLS-------EFPALFVSAERSF 1512
              ++RE +  +S WNQWL +       E+ A  V +E  F
Sbjct: 345  TTANRESLHASSVWNQWLFTQIGTKLFEWIAELVQSEYQF 384


>gi|320585933|gb|EFW98612.1| hypothetical protein CMQ_4464 [Grosmannia clavigera kw1407]
          Length = 1083

 Score = 74.3 bits (181), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 64/103 (62%), Gaps = 4/103 (3%)

Query: 1203 YSQDSH-FLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-NEQGFSAENIRALCDVG 1260
            +S +SH F+LEL+ NAD+N Y ++ +PTLT      G   L+ NE+GFSA+++ A+CD  
Sbjct: 47   WSTNSHRFVLELLHNADENTY-DDADPTLTMTHTADGHFRLHCNERGFSAKDVDAVCD-N 104

Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            NS+ K +     G KG+GFK+VFRV     + S+ +  K   S
Sbjct: 105  NSSAKAALRTATGEKGVGFKAVFRVAHTVWVVSDPYRFKIRRS 147


>gi|328957430|ref|YP_004374816.1| putative transcriptional regulator [Carnobacterium sp. 17-4]
 gi|328673754|gb|AEB29800.1| putative transcriptional regulator [Carnobacterium sp. 17-4]
          Length = 1675

 Score = 73.9 bits (180), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 68/117 (58%), Gaps = 2/117 (1%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ 1246
            Q      +L  LS +LY+  + F+ EL+QNADD+    N+      I  E  +V+ ++  
Sbjct: 22   QAVNQASSLDALSGDLYTDSTRFIYELLQNADDSSSNNNLIKVWIKIFDEY-LVIAHSGT 80

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
             F+ ++IR +C+V N TKK S     G KGIGFKSVF  ++   I++N  +++FD+S
Sbjct: 81   PFNIQDIRGICNVNNGTKK-SDITKTGYKGIGFKSVFGQSNYVTIYTNNEYLRFDSS 136


>gi|404319253|ref|ZP_10967186.1| hypothetical protein OantC_13717 [Ochrobactrum anthropi CTS-325]
          Length = 1062

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 141/640 (22%), Positives = 246/640 (38%), Gaps = 118/640 (18%)

Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
            ++LY   +HF+ EL+QNA+D    +     ++FIL E  +   +N + F  ++I  + D+
Sbjct: 29   EDLYPDSAHFIYELLQNAEDREASQ-----VSFILFEDKLTFTHNGESFRPKDIYGITDI 83

Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFN 1319
            G  TK       IGR G+GFK+VF  T++P I S  F            F +  LV P  
Sbjct: 84   GEGTK-ADDEDRIGRFGVGFKAVFAFTESPHIWSPTF-----------SFKITELVLPSE 131

Query: 1320 IDMFCRLLSKDPVQLESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
            ID    L ++           T    PF   K S   A   I    +++  S LLFL  +
Sbjct: 132  IDSLPGLGTQ-----------TRFDFPFNNPKKSREDAHREIKAGLNEIAESTLLFLPNI 180

Query: 1379 QCIMFRNMLNDSL-------------VVIRKKIVGDGIIKVSCGE-DKMTWFVASQKLRA 1424
            + I ++  + DS+             V + K++ G         + D+    +  Q++  
Sbjct: 181  ESIKWQ--IGDSVSGEVLRIEHTRKHVEVLKQVNGQTTTSAHFLKFDQAVKGLEKQRVAI 238

Query: 1425 GV---IRPDVK--TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFV 1479
                   P+VK  +T+ ALA  L+        P   Q  VF      + GL+F L   FV
Sbjct: 239  AFALDFAPNVKSLSTDKALAQQLK------IVPAPGQVCVFFPANKESSGLRFHLHAPFV 292

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEV 1539
               SR  +   +P N  L  +  AL  SA     DL           + ++  +P   + 
Sbjct: 293  PELSRASIK-ETPANHPLYDQLAALCASALHEIRDLGLL-------TTEFLGVLPNAQDA 344

Query: 1540 HGFFSGLPRMILSKLR-MSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGL 1598
             G   G  ++  S +R M    +   +    AP  +++     +A + L ++L +  +  
Sbjct: 345  LGKGYGYEQIRDSIIRAMKTEPLFPTHGKGHAPADRLV-----QAKASLKELLGEDDIKY 399

Query: 1599 GFLSKD---------IVLSDSLAR---ALGIEEYGPKILLQII--SSLCRTENGLRSMGL 1644
                 D          +   ++ R   AL I ++  +  +  +   +   +  G+    L
Sbjct: 400  LIEYDDEPPLWAAPRALQGTNVERFMTALEIRDWDIEDFVDCLDEKTTVGSWKGVDKEFL 459

Query: 1645 SWLA----SWLNELYT-ISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGT 1699
            +WL      W  +LY+ +   S  QS+L           L+R   I LSDG++++ +   
Sbjct: 460  TWLGHKTVEWHQQLYSLLDRESEAQSALY---------RLKRCQLIRLSDGSYATGNACH 510

Query: 1700 IWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIG 1759
                 D S                 +R V PA+ +      S       +N  R L  +G
Sbjct: 511  FADERDLS--------------AEGVRCVDPAVYT------SGKSKAQQENARRFLESLG 550

Query: 1760 VQQLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMM 1799
            V  +    +V+  +    + +      +  +AD   F+ +
Sbjct: 551  VTGVGERQLVEALLKSKYTGDNREFNQRTYLADLRRFIKL 590


>gi|47206634|emb|CAF91966.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 911

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 97/453 (21%), Positives = 181/453 (39%), Gaps = 73/453 (16%)

Query: 1644 LSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSSVDEGTIWLH 1703
            + W   W        F +S + S+ +     ++ +L+ +P IPL+DG   ++ E  ++  
Sbjct: 45   IDWPGCW--------FATSERLSMATAERESVLQSLRELPIIPLADGRVVALGEEGVFFP 96

Query: 1704 SDCSVFDGGFGLE--AFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQ 1761
             +         ++  +   L   +  V P+LLS        L  +    + ++L  +GV 
Sbjct: 97   METQTKKSKDLIQTGSLAALYKDISVVEPSLLSC-------LEPLESQQVLKLLRMLGVH 149

Query: 1762 QLSAHDIVKVHILPAISDETTANGDKNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRK 1821
            +L    ++  HI P I      +  ++++  YL F+  H            ++  ++L  
Sbjct: 150  ELEPQQLLDQHIYPTIQSNKWKSKPESVVVSYLVFIKQH-------STSSHDYANADLP- 201

Query: 1822 KAFVLTNHGFKRPSEIPIHFGKEFGN-PVSVNMLIHDIDIKWYEVDITYLKHPAKESLSC 1880
               VLTN G   P+E  +HF  ++ N  +   +  HD    W  V   Y+K         
Sbjct: 202  ---VLTNRGLLCPTEEKVHFSVDYDNIDLPERLPGHD----WILVSSCYVKTDG------ 248

Query: 1881 GLVKWRKFFKEIGITDFV-----QVVQVDKDVADISHTGF-------KNMWTKELLSPGS 1928
             +  WR+    +G+ D +     +V    +++    H  F       KN  T   ++ G 
Sbjct: 249  DVAGWRELLSRLGVRDGLIIRKERVALTAEELVRALHMCFSAVCRLHKNAITCYQMNQGK 308

Query: 1929 AA-IDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWDDCYTDKIMGFFKSKPTGDD 1985
               +D    E  H L+   L  +V  Q    LLE+L   WD  +  +   + +++    D
Sbjct: 309  GCELDDYPCEEFHALATAQLPDSVLLQQRLTLLEMLANNWDTGH--RYSQYLRAQLVDGD 366

Query: 1986 ----RSFQSSFINCICDIQWT-----ISSMDDELHY--PKDLFHDCDAVRSILGPSAPY- 2033
                RS +SSF + +C ++W      +     E  Y  P  ++     V S+LG    Y 
Sbjct: 367  GREVRSTRSSFHHYMCSLEWVPAFRPLEGERQERKYLRPTSVYLSSPEVNSLLGTHVCYA 426

Query: 2034 -IVPKVKSEKLVCDIGLKTEVTIDDILEILKVW 2065
             I P   S      IG++  ++++ ++  LK W
Sbjct: 427  EITPSDFSRA----IGMRHNISVEAMINYLKEW 455


>gi|338213695|ref|YP_004657750.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336307516|gb|AEI50618.1| hypothetical protein Runsl_4277 [Runella slithyformis DSM 19594]
          Length = 1581

 Score = 72.0 bits (175), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 123/527 (23%), Positives = 226/527 (42%), Gaps = 70/527 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT---LTFILQESGIVVLNN 1244
            + +L +AL+ + +E+      F+ EL+QNA D  YP+        ++F + ++ ++  ++
Sbjct: 140  NQQLAKALNSIQREINKSPETFIFELLQNASD--YPDRARENKVYVSFKISDNYLLFCHS 197

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
               F   N+ ++C V N + K  S   IG KGIGFKSVF+  +   I S G+  +FD+S+
Sbjct: 198  GAVFKVNNVHSICSV-NESDKSDSIDKIGFKGIGFKSVFKDCNFAYIKSGGYSFRFDSSK 256

Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIV-- 1361
                FV  T  P   I ++      D   +E+  + N+ + +  + +  E +   +++  
Sbjct: 257  ----FV--TEKPYQIIPIWSEPSEMDSEIVETTIFKNSTVSIALKPRRREKLEEYHVILN 310

Query: 1362 DMFSDLHPSLLLFLHRLQCIMF----------RNMLNDSLVVIRKKIVGDGIIKVSCGED 1411
             +F+D    +LLFL  ++              +N+L D  +   + ++ D + +    E 
Sbjct: 311  RLFTD--DRILLFLPNIENAKIITPTGFKESKKNLL-DYWMGKEQVVIPDNVREWINKEI 367

Query: 1412 KMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRT-YGL 1470
                 V  +K        D++++ I  A     S   ++        ++ +LP +   G 
Sbjct: 368  DDNESVVPEKYY------DIQSSIIQFA----TSRSSDHILKTTNAVLYNYLPTKVNLGF 417

Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPA--KAASV 1528
             F++ GDF+ P    E   N  WN +L+ +   LF+   RS   L     N    K    
Sbjct: 418  DFLINGDFI-PDGTREFFHNIKWNDFLIEQSGFLFLKWVRS-IGLKKDERNAGNFKFNRN 475

Query: 1529 YMSFVP--------LVGEVH-----GFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKV 1575
            Y+S +P        ++ E +      F  G    +L  + ++    +   + Q  P   +
Sbjct: 476  YLSIIPDFTIIENQIIDERNKHLLLSFKKGFETGLLGTVEVAPIAFIPAQSGQLEPLNNI 535

Query: 1576 LRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSDS---LARALG-IEEYGPKILLQIISS 1631
            L      A  LLPD       GL  LS+ ++ SDS   L++  G I+ Y     L ++  
Sbjct: 536  LIDKTGIA-VLLPD----DFKGLTGLSQKLISSDSGEGLSKVEGLIKAYSTVGKLYLVDD 590

Query: 1632 LCRTENGLRSMGLS-WLASWLNELYTIS-FHSSGQSSLQSGVETDLI 1676
            L   +  L+      WL + +N    I  F++S   SL+  + T+ I
Sbjct: 591  L---KADLKKEAFQIWLKNPVNNYKLIKHFYTSTDPSLKLLLTTESI 634


>gi|262198949|ref|YP_003270158.1| transglutaminase [Haliangium ochraceum DSM 14365]
 gi|262082296|gb|ACY18265.1| transglutaminase domain protein [Haliangium ochraceum DSM 14365]
          Length = 2543

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 75/336 (22%), Positives = 142/336 (42%), Gaps = 53/336 (15%)

Query: 1194 ALHCLSQELYSQDSHFLLELVQNADD---NIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            A   LS+ +Y+   HFL+EL+QNA+D     +  +++           + V ++   F A
Sbjct: 230  AEELLSRRVYTDPGHFLVELLQNAEDAGARCWRADID--------AREVSVWHDGVPFDA 281

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
            +++  +  +G +TK     G+    G+GFKSV+ + + P+++S  F  +F+ +       
Sbjct: 282  KDVVGVLSIGQTTKHKEQIGFF---GVGFKSVYEICERPQVYSGPF--RFEIA------- 329

Query: 1311 LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPS 1370
                    ++ +  RL ++     E     T + LP R           +     ++ P 
Sbjct: 330  --------DVSLPRRLAARPEGYPEH---GTLLVLPLREPEDPARTPERLYQRAREVPPE 378

Query: 1371 LLLFLHRLQCIMFRNMLNDSLVVIRKKIV-GDG------------IIKVSCGEDKMTWFV 1417
             LL L  ++ +          +        GD             I  V    D  T ++
Sbjct: 379  TLLTLRNIREMRIAQPAQSRTIRAEAGTAPGDASSDGAEPNSAQRIDLVHLERDTRTRYI 438

Query: 1418 ASQKLRA---GVIRPDVKTTEIALALTLQESNEGNYGPLLY-QQPVFAFLPLRTYG-LKF 1472
             +++ RA   G  R   +++   + + L+ + +G   PL   +  V+++LP R    L+F
Sbjct: 439  IARE-RATWEGQAREGSRSSSTEVLVALRIAPDGVPVPLSEGEATVYSYLPTRERSRLRF 497

Query: 1473 ILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            ++   F LP  RE +D +SP+N+WLL     L   A
Sbjct: 498  LVHAHFDLPVDRERLDLDSPYNRWLLVNAGTLLAGA 533


>gi|410614948|ref|ZP_11325982.1| hypothetical protein GPSY_4260 [Glaciecola psychrophila 170]
 gi|410165514|dbj|GAC39871.1| hypothetical protein GPSY_4260 [Glaciecola psychrophila 170]
          Length = 1204

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 84/327 (25%), Positives = 139/327 (42%), Gaps = 54/327 (16%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAENIRAL 1256
            L  ELY  ++HF+ EL+QNA+D    E     + F L ES ++V +N ++ F+ E++ ++
Sbjct: 34   LLTELYPDNAHFIYELLQNAEDAEASE-----IHFSLSESCLIVKHNAKRQFTFEDVNSI 88

Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVP 1316
              +GN+TKK      IG+ G+GFK+VF  T +P + SN           Q  F +  LV 
Sbjct: 89   TGIGNTTKKDDVVQ-IGKFGVGFKAVFSYTTSPRVFSN-----------QYSFEIRDLVI 136

Query: 1317 PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM---FSDLHPSLLL 1373
            P  I+             E   + T    PF +  S  +  +  +++      L   +LL
Sbjct: 137  PHKINN----------NYEGH-YETIFEFPFNS--SNKLPKDAFIEIKKGLDTLSVKILL 183

Query: 1374 FLHRLQCIMFRNMLNDSLVVIRKKIVGDGI---IKVSCGEDKMTWFVASQKLRAGVIRPD 1430
            FL  ++ I +    + +  + +   +GD I    K   G   +T ++   +   G I   
Sbjct: 184  FLQNIKKITWEIESHKTATIRKVNEIGDYISIEKKSRDGLITLTHWLRFNRWLQGYIGLC 243

Query: 1431 VKTTEIALALTLQESNEGNYGP-----------LLYQQPVFAFLPL--RTYGLKFILQGD 1477
            V    IA  L  +  N   + P           +  Q  V  + P    T  LKF +   
Sbjct: 244  VS---IAYKLDYKTKNITEFNPKKGLDEQFKIVIDLQSNVSVYFPAEKETSNLKFNINAP 300

Query: 1478 FVLPSSREEVDGNSPWNQWLLSEFPAL 1504
            F    +R+ +  N P N+ L+ E   L
Sbjct: 301  FASTVARDSIQ-NRPENEILIEEIAKL 326


>gi|254516519|ref|ZP_05128578.1| conserved hypothetical protein [gamma proteobacterium NOR5-3]
 gi|219674942|gb|EED31309.1| conserved hypothetical protein [gamma proteobacterium NOR5-3]
          Length = 1083

 Score = 71.6 bits (174), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 57/206 (27%), Positives = 92/206 (44%), Gaps = 31/206 (15%)

Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAEN 1252
             L  L  ELY  ++HF+ E++QNA+D         T+ F+L+   +VV ++ E+ FS  +
Sbjct: 28   GLRNLLSELYPDNAHFIYEVLQNAEDA-----RATTVDFMLESDRLVVTHDGERPFSLND 82

Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLP 1312
            I ++  +G STKK      IG+ G+GFK+VF  T  PE+ S           G+  FV+ 
Sbjct: 83   IESITSIGQSTKKDDETS-IGKFGVGFKAVFAYTTRPEVRS-----------GKFNFVIE 130

Query: 1313 TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVDMFSDLHPSL 1371
             L  P   D              +    T    PF R +    +A+  +     +L    
Sbjct: 131  DLFVPRLTDG------------SAPTGKTSFTFPFDRPEKESSVAVAEVQRGLQELDEKT 178

Query: 1372 LLFLHRLQCIMFRNMLNDSLVVIRKK 1397
            LLFL  +  I +        +V+R++
Sbjct: 179  LLFLSHISTINYSLPDGTDAIVMREE 204


>gi|159486652|ref|XP_001701352.1| hypothetical protein CHLREDRAFT_153825 [Chlamydomonas reinhardtii]
 gi|158271747|gb|EDO97560.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 91

 Score = 71.2 bits (173), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 48/85 (56%), Positives = 54/85 (63%), Gaps = 10/85 (11%)

Query: 1216 NADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK---------- 1265
            NADDN YP  V P L F+L    I   NNE GFSA ++RALCDVG STK           
Sbjct: 1    NADDNAYPAGVTPCLEFVLGRGHITAANNELGFSAAHLRALCDVGASTKAVGGGAGGGGG 60

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPE 1290
            G   GYIG+KGIGFKSVFRV+ AP+
Sbjct: 61   GGGGGYIGQKGIGFKSVFRVSAAPQ 85


>gi|320594061|gb|EFX06464.1| hypothetical protein CMQ_6785 [Grosmannia clavigera kw1407]
          Length = 353

 Score = 70.9 bits (172), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/151 (31%), Positives = 72/151 (47%), Gaps = 12/151 (7%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQGFSAENI 1253
            L+  S+   S+ +HF+ EL+QNADD  Y  NV    + +       ++ +NE GF+  ++
Sbjct: 65   LNLASKGQNSKSTHFIYELLQNADDASYGGNVPRVCIRYDTTNRTFLIESNEIGFTKADV 124

Query: 1254 RALCDVGNSTKK---------GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--T 1302
             ++C    STK           ++   IG KG+GFKSVF   D   I S  +  +FD  T
Sbjct: 125  ESICSAAESTKSKKMPAASATATTERSIGEKGLGFKSVFLAADKVWISSGHYSFRFDGET 184

Query: 1303 SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ 1333
              GQI  +  T  P    D F   L   P++
Sbjct: 185  PVGQIQPIWDTQFPGLKTDGFTAKLLHIPIR 215


>gi|448300737|ref|ZP_21490736.1| type III restriction protein res subunit [Natronorubrum tibetense
            GA33]
 gi|445585556|gb|ELY39851.1| type III restriction protein res subunit [Natronorubrum tibetense
            GA33]
          Length = 1611

 Score = 70.9 bits (172), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 116/513 (22%), Positives = 201/513 (39%), Gaps = 66/513 (12%)

Query: 1213 LVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYI 1272
            ++QNADD I     E  +T  L E  +V  N+E+  S +N+ AL     STK+G     I
Sbjct: 1    MLQNADD-IGGNCSE--VTIQLTEDSLVFQNHEEPMSVKNVEALGAYTRSTKQGD-LDSI 56

Query: 1273 GRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPV 1332
            G  GIGFK+VF + +   +HS  F  ++ T E        T+  P            D  
Sbjct: 57   GHFGIGFKTVFSLANVVYVHSGFFSFRYTTDER-------TVPQPVG--------QNDQP 101

Query: 1333 QLESKCWN-TCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL 1391
              + +C++ T I LP      +      I++   D   SLL FL+ +  +          
Sbjct: 102  ATDDECYDGTTITLPLTDDAKKN--RREILEEQLDSIGSLLPFLNNIDTVHVSAF--GER 157

Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPD-----------------VKTT 1434
               R   + DG  +V C ++         +L +G   PD                 ++  
Sbjct: 158  KTFRASPIDDGGFEVICEQNGEPVRSERMRLFSGTFEPDRELVNYLAEKRNLNTDALREQ 217

Query: 1435 EIALALTLQESNEGNYGPL-LYQQPVFAFLPLR-TYGLKFILQGDFVLPSSREEVDGNSP 1492
             + L + +    + +  P+ L +  +F + P      L F +Q DF L   R+ +     
Sbjct: 218  TLKLDIEIAIPIDTDGAPMALDESHLFCYFPTEPDTNLPFHIQADFSLKPDRKHIIWPDE 277

Query: 1493 WNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVP---LVGEVHGFFSGLPRM 1549
            +N+ LL+    +F   E +F  L     +P++     +  VP   L  E+  +   L   
Sbjct: 278  FNEQLLNCAVDVF---ETAFVQLHSELVDPSQI----LELVPDPNLEREMDPYIDPLVTD 330

Query: 1550 ILSKLRMSNCLILEGNNNQWAPPCKVLRGWNDRAHSLLPDILLQKHLGLGFLSKDIVLSD 1609
            I+S +R  +C + +G +N + P   V   + D    L  +  +   L          +SD
Sbjct: 331  IVSFVRSESC-VPDGRDNLFQPSEVVF--FADPFRDLFTEAEVSDVLDRPVKYPSEQISD 387

Query: 1610 SLARALGIEEYGPKILLQIISSLCRTENGLRSMGLSWL--------ASWLNELYTISFHS 1661
            +    L       ++ L+     C   +   S   +WL          W +E     F  
Sbjct: 388  TARERLQAVVPDSQLQLETFLDHCTDSSLFESKSDNWLIRFLAGIMRHWSSEYEVSRFGR 447

Query: 1662 SGQSSLQSGVETDLIDNLQRIPFIPLSDGTFSS 1694
            +  SS +   + +  D+L+ + F+PL DG+ +S
Sbjct: 448  T--SSEEKQAQKEFTDSLEEVSFLPLEDGSVTS 478


>gi|148550774|ref|YP_001260213.1| hypothetical protein Swit_5338 [Sphingomonas wittichii RW1]
 gi|148503193|gb|ABQ71446.1| hypothetical protein Swit_5338 [Sphingomonas wittichii RW1]
          Length = 1934

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 91/349 (26%), Positives = 151/349 (43%), Gaps = 68/349 (19%)

Query: 1196 HCLSQELYSQDSH--FLLELVQNADDNIYPENVEPTLTFILQ----ESG-IVVLNNEQGF 1248
              LSQ + + D H  FL+EL+QN  D       +  +  +L     +SG + V N  Q F
Sbjct: 128  QSLSQHIIA-DYHGRFLIELIQNGHDAHDKARRDGEIAVLLAGDEGDSGTLYVANRGQPF 186

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS-----------NGFH 1297
            ++ N+ ALC++G S+K    A  +G KG+GF+SV  V+D P+I+S           +G+ 
Sbjct: 187  ASRNVDALCEMGLSSKPPGEA--VGNKGLGFRSVRHVSDIPQIYSRLAEDSPGDLFDGYR 244

Query: 1298 VKFDTSEGQIGFVLP---------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
              F     ++  +LP         T +P F I  +         +   K +++ IRLP R
Sbjct: 245  FTFARGS-ELDPLLPDEATRALARTDLPAFYIPRWLDQTPDTVREFARKGFSSVIRLPLR 303

Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC----IMFRNMLNDS--LVVIRKKIVGD- 1401
             + +  +    I+ +     P +LLFL R+      I    +  D+  L + R +   D 
Sbjct: 304  NQAALDVVRAEILSLAKAEAP-MLLFLRRIATLRAEIQGDRIKEDALDLKLTRSETPLDD 362

Query: 1402 --GIIKVSCGEDKMTWFVASQKLRAGVIRPDVK-----------------TTEIALALTL 1442
              G   V  GE+   W VA  ++  G ++  V                  + E+A+A+ +
Sbjct: 363  AQGPALVDLGENGSFW-VARAEIPEGDMQAAVAEGLESGALPESWNGWKGSGEVAVAVPI 421

Query: 1443 QESNEGNYGPLLYQQPVFAFLPL---RTYGLKFILQGDFVLPSSREEVD 1488
             E+   ++ P LY      FLP+    T   +  L G F   S+R  +D
Sbjct: 422  GETMP-DFHPRLY-----TFLPMGRGATSPFRGHLHGSFFPSSNRTSLD 464


>gi|402825950|ref|ZP_10875192.1| hypothetical protein LH128_22929 [Sphingomonas sp. LH128]
 gi|402260506|gb|EJU10627.1| hypothetical protein LH128_22929 [Sphingomonas sp. LH128]
          Length = 1846

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 88/348 (25%), Positives = 149/348 (42%), Gaps = 66/348 (18%)

Query: 1196 HCLSQELYSQDSH--FLLELVQNADDNIYPENVEPTLTFILQ----ESG-IVVLNNEQGF 1248
              LSQ + + D H  FL+EL+QN  D       +  +  +L     +SG + V N  Q F
Sbjct: 40   QSLSQHIIA-DYHGRFLIELIQNGHDAHDRARRDGEIAVLLAGDEGDSGTLYVANRGQPF 98

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS-----------NGFH 1297
            ++ N+ ALC++G S+K    A  +G KG+GF+SV  V+D P+I+S           +G+ 
Sbjct: 99   ASRNVDALCEMGLSSKPPGEA--VGNKGLGFRSVRHVSDIPQIYSRLTEDSPGDLFDGYR 156

Query: 1298 VKFDTSEGQIGFVLP---------TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR 1348
              F     ++  +LP         T +P F I  +         +   K +++ IRLP R
Sbjct: 157  FTF-ARGSELDPLLPDEATRALARTDLPAFYIPRWLDQTPDTVKEFARKGFSSVIRLPLR 215

Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI--------MFRNMLNDSLVVIRKKIVG 1400
             + +  +    I+ +     P +LLFL R+  +        +  + L+  L      +  
Sbjct: 216  NQAALDVVRAEILGLAKAEAP-MLLFLRRIATLRAEIQGDRIKEDALDLKLSRSETPLED 274

Query: 1401 D-GIIKVSCGEDKMTWF------------VASQKLRAGVIRPDVK----TTEIALALTLQ 1443
            D G   V  GE+   W               ++ L +G +    K    + E+A+A+ + 
Sbjct: 275  DQGPALVDLGENDSFWVARAEIPESDMQAAVAEGLESGALPESWKGWKGSGEVAVAVPIG 334

Query: 1444 ESNEGNYGPLLYQQPVFAFLPL---RTYGLKFILQGDFVLPSSREEVD 1488
            E+   ++ P LY      FLP+    T   +  L G F   S+R  +D
Sbjct: 335  ETMP-DFHPRLY-----TFLPMGRGATSPFRGHLHGSFFPSSNRTSLD 376


>gi|195996079|ref|XP_002107908.1| predicted protein [Trichoplax adhaerens]
 gi|190588684|gb|EDV28706.1| predicted protein [Trichoplax adhaerens]
          Length = 1467

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 146/356 (41%), Gaps = 50/356 (14%)

Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
            Q +++    F++EL+QNADD     +    + F      +   +N + F  E++ A+CD 
Sbjct: 39   QTVFNSPEKFVMELLQNADD-ASCGDTSVDVRFRFYNGYLAFSHNGKHFDEEDVEAICDN 97

Query: 1260 G--NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----------DTSEGQI 1307
               N   K  +    G KGIGFKSVF + D   I S  +  +F          D+ E + 
Sbjct: 98   AQQNLHSKRLNINKTGYKGIGFKSVFIIADCTYIISRNYCFRFDRDYTNWKESDSDENRA 157

Query: 1308 GF----VLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
            G     ++P    P ++    ++L  +P           +   FR + +  + + + +++
Sbjct: 158  GRYPWQIIPIWTEPCDVP---KILQLNPAD---------VNFLFRLRSNNDVIV-SALNI 204

Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED-KMTWFVASQKL 1422
             S+  P  +LFL  +Q I+  N  + + +  R ++       +      K +W V +  +
Sbjct: 205  LSNT-PRTMLFLRHIQNIVIDNHGSITCINRRNELTSQQRYDLYLNNVLKSSWHVMAYSV 263

Query: 1423 --------RAGVIRPD--------VKTTEIALALTLQESNEGNYGPLLYQQPVFAFLP-L 1465
                    +   + P          K  EI  A+ +          L     +F +LP L
Sbjct: 264  NIPKTVLAQLQAMGPQECPEKLKKAKNIEIKFAINIDPVENTKLNALDKNANLFCYLPTL 323

Query: 1466 RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFREN 1521
               GL F++  DF+L + R ++  N+ WN +L      L +++       P FR++
Sbjct: 324  VNCGLPFLVNTDFLLTADRMQLLSNA-WNNFLFESIGKLHITSLAEMVQFPGFRDD 378


>gi|339896689|ref|XP_001463180.2| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321398943|emb|CAM65531.2| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 1660

 Score = 68.2 bits (165), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 71/281 (25%), Positives = 131/281 (46%), Gaps = 28/281 (9%)

Query: 170 LKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGI 229
           L++ + S +SL F +++VP+L  +   + R+ + +   +  R I ++++LEE +C  EG+
Sbjct: 85  LEMLISSTNSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGV 144

Query: 230 EKFEELELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKV--- 285
           E F EL LG  L+  P++  YF ++++  VVF +   DVI  L    DT  A+D+ +   
Sbjct: 145 EHFAELGLGVGLQVLPVVQEYFQLRAN-SVVFPVRARDVIAFL--LNDT-TARDMLLYGG 200

Query: 286 ----DEFLDFIA--KKRSL--ASKEQLGVRIQNLGMHINFIREARKSQDVT--LKKFLKE 335
               D    F A  ++  L  AS       ++  G  +N  +     QD T  L    +E
Sbjct: 201 GDTRDLLYRFAAFYERHVLRNASPSSASATVRGQGRLLNVRQLGIHVQDYTALLAALTQE 260

Query: 336 LQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVSSSSEDEDSD 395
           L   H+     +   + +R ++   +A    V   +S  +  G    +  S++ ED    
Sbjct: 261 LASAHR-----LEQQQYQRWIESCVTAETA-VTQTASTRDSAGVAAAKD-SAAGEDALVR 313

Query: 396 DCTYECSNDISSNVQLPSQIKGS-DRVSSCPYPSVTEELKR 435
           D ++E +     N  L  ++  S D  S+C +  + E ++R
Sbjct: 314 DASFEVAARRERNRALYERVLHSFD--SACAHAELVEGMRR 352


>gi|398010417|ref|XP_003858406.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322496613|emb|CBZ31683.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 1656

 Score = 68.2 bits (165), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 62/103 (60%), Gaps = 2/103 (1%)

Query: 170 LKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGI 229
           L++ + S +SL F +++VP+L  +   + R+ + +   +  R I ++++LEE +C  EG+
Sbjct: 85  LEMLISSTNSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGV 144

Query: 230 EKFEELELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCL 271
           E F EL LG  L+  P++  YF ++++  VVF +   DVI  L
Sbjct: 145 EHFAELGLGVGLQVLPVVQEYFQLRAN-SVVFPVRARDVIAFL 186


>gi|195997659|ref|XP_002108698.1| predicted protein [Trichoplax adhaerens]
 gi|190589474|gb|EDV29496.1| predicted protein [Trichoplax adhaerens]
          Length = 1419

 Score = 68.2 bits (165), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 86/351 (24%), Positives = 144/351 (41%), Gaps = 57/351 (16%)

Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNE 1245
            K    L   L    Q +++    F LEL+QNADD    +N    +TF + +  +   +  
Sbjct: 19   KNSTTLANTLDKDIQTVFNSPEKFALELLQNADDAAI-DNQPIHVTFQINQRYLAFSHTG 77

Query: 1246 QGFSAENIRALCDVGNS--TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            + F  +++ A+CD      T+K +     G KGIGFKSVF ++    I S  +  +FD  
Sbjct: 78   RHFDYQDVEAICDNAQQQFTRKKTDLSKTGFKGIGFKSVFVISHRVYIISRNYCFRFDRD 137

Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP--FRTKFS--------- 1352
              +            + D F R     P Q+ +  W    +LP  F    S         
Sbjct: 138  YAKWR--------NNDSDRFRRY----PWQI-TPIWTQQNQLPKVFELNKSHVTFLFQLV 184

Query: 1353 -EGIAMNNIVDMFSDLH-----PSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV 1406
             E  ++NN+  +++ L      P+ +LFL  +Q I  R       + +RK        ++
Sbjct: 185  DETRSVNNVEKVYNALKILMQTPNTILFLRNIQEITIRQHNRIDSIQLRKLFDTSNQYEL 244

Query: 1407 SCGED-KMTWFVAS----------QKLRAGVIRPD-------VKTTEIALALTLQESNEG 1448
               +  K  W + S          QKL+     PD        + T + L        EG
Sbjct: 245  YVNDTLKSKWLIRSDEQQIPNEIHQKLQR---LPDQECPTKLKQATSVNLKFAFLFGEEG 301

Query: 1449 NYGPLLYQQPVFAFLPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
                  ++  +F +LP + Y GL F++ GDF+L + R ++  N  WN+++ 
Sbjct: 302  TLKA-HHKAVLFCYLPTQMYSGLPFLVNGDFLLSADRTQLLTN-IWNEYMF 350


>gi|300787661|ref|YP_003767952.1| hypothetical protein AMED_5803 [Amycolatopsis mediterranei U32]
 gi|384151062|ref|YP_005533878.1| hypothetical protein RAM_29730 [Amycolatopsis mediterranei S699]
 gi|399539544|ref|YP_006552206.1| hypothetical protein AMES_5725 [Amycolatopsis mediterranei S699]
 gi|299797175|gb|ADJ47550.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340529216|gb|AEK44421.1| hypothetical protein RAM_29730 [Amycolatopsis mediterranei S699]
 gi|398320314|gb|AFO79261.1| hypothetical protein AMES_5725 [Amycolatopsis mediterranei S699]
          Length = 1728

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 88/366 (24%), Positives = 147/366 (40%), Gaps = 73/366 (19%)

Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQES----GIV-VLNNEQGFSAENIRALCDVG 1260
            D   ++EL+QN  D       +  L F+L E+    G++   N  +    EN R++C + 
Sbjct: 54   DGRTVVELLQNGHDAHDSGRSDGKLEFLLDENEGAHGVLYAANGGRPLDQENFRSMCLIA 113

Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFHVKF---DTSEG--- 1305
             S+K+      IG KG+GFKSV +++D PE++S         +GF  +F   D  +G   
Sbjct: 114  RSSKRPDEG--IGNKGVGFKSVLQLSDCPEVYSARHEGAGRFDGFRFRFGRPDDFDGLAA 171

Query: 1306 ------------------QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
                               +   +P    P  ++ F R           + ++T +RLP 
Sbjct: 172  RSKENWPGLADELRENVSTLKITVPLFETPTVVEDFAR-----------RGYSTVVRLPL 220

Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS-----LVVIRKKIVGDG 1402
            R+  +   A + + ++  D  P   LFL R+  I+ R+   ++     L    +   G  
Sbjct: 221  RSASTRDHARDQLEELAHDEVP-FHLFLTRVAEIVVRSQAGETESRRRLTRSTRARRGTS 279

Query: 1403 IIKV---SCGEDK----MTWFVASQKLRAGVIRPDVKTTEIALALT-LQESNEGNY---- 1450
             + V     GE +    +   VA   +RA  IR  +    I       Q + E +     
Sbjct: 280  TLSVEHIDLGEQREYVILKRVVAETGIRA-AIRASLDDNAIGSGWAGWQGAAEVSVAVPL 338

Query: 1451 GPLLYQQPVFAFLPLRTYG---LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
            G  L Q  V+ FLPL       +  ++ G F     R +V  + P N +L +E  AL   
Sbjct: 339  GAALSQGRVYTFLPLGPGSPCPIAALVNGPFFTRLDRRDVHESIPLNDFLFAEIAALCAD 398

Query: 1508 AERSFC 1513
            A  + C
Sbjct: 399  AVVAAC 404


>gi|195997657|ref|XP_002108697.1| hypothetical protein TRIADDRAFT_51895 [Trichoplax adhaerens]
 gi|190589473|gb|EDV29495.1| hypothetical protein TRIADDRAFT_51895 [Trichoplax adhaerens]
          Length = 558

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 83/337 (24%), Positives = 141/337 (41%), Gaps = 57/337 (16%)

Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
            Q +++    F LEL+QNADD    +N    +TF + +  +   +  + F  +++ A+CD 
Sbjct: 33   QTVFNSPEKFALELLQNADDAAI-DNQPIHVTFQINQRYLAFSHTGRHFDYQDVEAICDN 91

Query: 1260 GNS--TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP 1317
                 T+K +     G KGIGFKSVF ++    I S  +  +FD    +           
Sbjct: 92   AQQQFTRKKTDLSKTGFKGIGFKSVFVISYRVYIISRNYCFRFDRDYAKWR--------N 143

Query: 1318 FNIDMFCRLLSKDPVQLESKCWNTCIRLP--FRTKFS----------EGIAMNNIVDMFS 1365
             + D F R     P Q+ +  W    +LP  F    S          E  ++NN+  +++
Sbjct: 144  NDSDRFRRY----PWQI-TPIWTQQNQLPKVFELNKSHVTFLFQLVDETRSVNNVEKVYN 198

Query: 1366 DLH-----PSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED-KMTWFVAS 1419
             L      P+ +LFL  +Q I  R       + +RK        ++   +  K  W + S
Sbjct: 199  ALKILMQTPNTILFLRNIQEITIRQHNRIDSIQLRKLFDTSNQYELYVNDTLKSKWLIRS 258

Query: 1420 ----------QKLRAGVIRPD-------VKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
                      QKL+     PD        + T + L        EG      ++  +F +
Sbjct: 259  DEQQIPNEIHQKLQR---LPDQECPTKLKQATSVNLKFAFLFGEEGTLKA-HHKAVLFCY 314

Query: 1463 LPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
            LP + Y GL F++ GDF+L + R ++  N  WN+++ 
Sbjct: 315  LPTQMYSGLPFLVNGDFLLSADRTQLLTN-IWNEYMF 350


>gi|242071029|ref|XP_002450791.1| hypothetical protein SORBIDRAFT_05g018540 [Sorghum bicolor]
 gi|241936634|gb|EES09779.1| hypothetical protein SORBIDRAFT_05g018540 [Sorghum bicolor]
          Length = 1582

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 119/265 (44%), Gaps = 47/265 (17%)

Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFVLPTLVP--PFNIDMFCRLLSKD 1330
            GIGFKSVF ++  P I SNG+ +KF+    +E  IG+++P  V   P   D+      K 
Sbjct: 25   GIGFKSVFLISSQPHIFSNGYQIKFNEKPCAECNIGYIVPEWVESGPTLSDI------KA 78

Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND- 1389
                      T I LP +++      ++ +    S LHP +LLFL +++ +  +   +D 
Sbjct: 79   IYGCSKILPTTTIILPLKSE-----KVDAVKKQLSSLHPEMLLFLSKIRKLSVQEHNSDP 133

Query: 1390 -SLVV------------IRKKIVGDGII------KVSCGEDKMTWFVASQKLRAGVIRPD 1430
             S+ V             RK +  +         +   GE++  +++  QK     ++P+
Sbjct: 134  KSITVSEISISSEKNYQARKNMHAESYTLHLSSEESGKGEEECGYYMWKQKFP---VKPE 190

Query: 1431 VKT------TEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSS 1483
             +        E  + L        + G  L    V+AFLP        FI+Q DF+L SS
Sbjct: 191  YRVDKRAEIDEWVITLAFPHGQRLSRGKQL-SPGVYAFLPTEMVTNFPFIIQADFLLASS 249

Query: 1484 REEVDGNSPWNQWLLSEFPALFVSA 1508
            RE +  +S WN+ +L   P+ F++A
Sbjct: 250  REAILFDSLWNKGILECVPSAFMNA 274


>gi|455650231|gb|EMF29014.1| hypothetical protein H114_11886 [Streptomyces gancidicus BKS 13-15]
          Length = 1765

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 92/427 (21%), Positives = 163/427 (38%), Gaps = 97/427 (22%)

Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQE-----SGIVVLNNEQGFSAENIRALCDVG 1260
            +   L+EL+QNA D       +  +   L E       + V N  + F  +N + LC + 
Sbjct: 51   EGRVLIELLQNAHDAHPARATDGRIEIRLDEDEGEHGTLYVANCGKPFGGKNFKDLCSIA 110

Query: 1261 NSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFHVKF----------- 1300
             S+K+  +   IG KG+GFKSV  + DAPE++S         +GF  +F           
Sbjct: 111  LSSKRADAG--IGHKGVGFKSVLHLCDAPEVYSVAREGSRLLDGFTFRFARLDDYDALAR 168

Query: 1301 DTSEGQIGFV-----------LPTLV--PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF 1347
            + +  + GF            +P  +  PP  +  F R           + + T IRLP 
Sbjct: 169  EVAPEREGFADYLKENLLTLKVPVFLEEPPKTVQGFAR-----------RGFVTVIRLPL 217

Query: 1348 RTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF--------RNMLNDSLVVIRKKIV 1399
            ++  +   A   + ++  D  P   LFL RL+ +          R    D  V +     
Sbjct: 218  KSADARSAAGAQVRELMDDNAP-FELFLDRLERVRLEWRAAGETRRRTYDRQVEVLHHAR 276

Query: 1400 GDGIIKVS----------CGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGN 1449
            G  + +++          CG  K   + A + L A + +  ++   +A     +      
Sbjct: 277  GLKVQQITLRRRMRLIVVCG--KADPYRAREALAASIEKSGMRRDWMAWEKKAEVRVAVP 334

Query: 1450 YGPLLYQQPVFAFLPLR---TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFP---- 1502
             G  L    ++AFLP+    +  ++  +   F    +R   +   PWN  LL        
Sbjct: 335  VGDPLRDGRLYAFLPMGEQISAPVRGYVHAPFFAEMNRRSFNEAVPWNGLLLDTVAEVCA 394

Query: 1503 -ALFVSAE-------RSFCDLPCFRENP-AKAASVY---------MSFVPLVGEVHGFFS 1544
             A+  +A+        +  DL C++    A+  S +         + F+P++G   G  +
Sbjct: 395  RAVLAAADGRASIQPGALVDLMCWQPKALARLTSAFQRLDNGIAHVPFIPVIGTTPGART 454

Query: 1545 GLPRMIL 1551
             L R +L
Sbjct: 455  SLHRALL 461


>gi|157864316|ref|XP_001680868.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68124160|emb|CAJ02143.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1657

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/289 (23%), Positives = 131/289 (45%), Gaps = 44/289 (15%)

Query: 170 LKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGI 229
           +++   S +SL F +++VP+L  +   + R+ + +   +  R I ++++LEE +C  EG+
Sbjct: 85  MEVLASSTNSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGV 144

Query: 230 EKFEELELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDI----- 283
           E F EL LG  L+  P++  YF ++++  VVF +   DVI  L   +DT  A+D+     
Sbjct: 145 EHFAELGLGVGLQVLPVVQEYFQLRAN-SVVFPVRARDVIAFL--LSDT-TARDMLLYGG 200

Query: 284 --------KVDEFLDFIAKKRSLASKEQLGVR-------IQNLGMHINFIREARKSQDVT 328
                   +   F +    + +  S     VR       ++ LG+H+         QD T
Sbjct: 201 GDTRDLLYRFAAFYERHVLRNASPSSAPAAVRGQGRLLNVRQLGIHV---------QDYT 251

Query: 329 --LKKFLKELQPHHKRRKRPIFSSEKKRQLDERFSAICERVKSFSSINEDFGAKHIRFVS 386
             L    +EL   H+  ++     + +R ++   +A  E  ++ S+   D     +   S
Sbjct: 252 ALLAALTQELASAHQLEQQ-----QYQRWIESCATAETEVTQTAST--RDSAGVAVAKDS 304

Query: 387 SSSEDEDSDDCTYECSNDISSNVQLPSQIKGSDRVSSCPYPSVTEELKR 435
           ++ ED      ++E +     N  L  ++  S   S+C +  + E ++R
Sbjct: 305 AAGEDALVRAASFEVAARRERNRALYERVLHSFD-SACAHAELVEGMRR 352


>gi|384099073|ref|ZP_10000177.1| putative transcriptional regulator [Imtechella halotolerans K1]
 gi|383834205|gb|EID73649.1| putative transcriptional regulator [Imtechella halotolerans K1]
          Length = 1884

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 93/413 (22%), Positives = 165/413 (39%), Gaps = 112/413 (27%)

Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQ 1235
            ++N E+N +      +   +  + + +YS     + EL+QNADD    E VE  + F   
Sbjct: 212  LNNPEANKM------IANLMREIGKGMYSSKQRMIFELLQNADDAPGKERVEFHIDF--N 263

Query: 1236 ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI--HS 1293
                 V+++   F+ +++ A+     STK+G S    G KGIGFKSVF  TD+ E+   S
Sbjct: 264  GDYFFVMHDGAPFNKDDVEAITSAAESTKRGDSKK-TGYKGIGFKSVF--TDSTEVWLKS 320

Query: 1294 NGFHVKFDTSEGQIGFVLPTLVPPF-NIDMFC----------RLLSKDPVQLESK--CWN 1340
             G+         Q  F+  T  P F N D F            LL +D ++  ++   +N
Sbjct: 321  GGY---------QFAFLRNT--PLFENFDNFYFSSERYQKYPELLDEDKLKYRNQRLRFN 369

Query: 1341 TCIRLPFR------------------------TKFSEGIAMNNIVDMFSDL-----HPSL 1371
                +P++                         +F+  +  NNI D  + +      P  
Sbjct: 370  GSTDIPWQVIPIWQNRLPSEFTDSNFNNFNNPVQFALKLGENNIEDYKTAIDNITKRPQF 429

Query: 1372 LLFLHRLQ--------CIMFRNMLNDSLVVIRKK-------------------IVGDGII 1404
            LLFL              + RN  N+ + +I+ K                   +  +   
Sbjct: 430  LLFLRNTSKFRSPKNGVTVLRNDANNVIEIIKSKRAEPDQVFHYTKQTFENVEVSDEAFE 489

Query: 1405 KVSCGEDKMT----------WFVASQKLRAGVIRPDVKT---TEIALALTLQE---SNEG 1448
            ++  G  K +          +F   +      I P + +   TEI+  ++L +   S E 
Sbjct: 490  QLDVGLKKQSKINDYNEVTYYFTDLEGREIETIPPKLASATQTEISFGISLVDNKISTEK 549

Query: 1449 NYGPLLYQ-QPVFAFLPLRT--YGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
             Y   L +   +F +LP+    + L F++  DFV  S R+ + G++ WN++++
Sbjct: 550  EYTKGLPKYSSLFTYLPMEDTRFQLPFLVNADFVPSSDRQRIQGDNLWNKYIM 602


>gi|381150652|ref|ZP_09862521.1| UvrD/REP helicase [Methylomicrobium album BG8]
 gi|380882624|gb|EIC28501.1| UvrD/REP helicase [Methylomicrobium album BG8]
          Length = 1718

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 88/344 (25%), Positives = 135/344 (39%), Gaps = 71/344 (20%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV---------VLNNEQGFSAE-- 1251
            YS  +HF+ EL+QNADD         T TF L+ESG+           ++N Q   A+  
Sbjct: 39   YSDQAHFIYELLQNADDV-----KATTATFRLEESGLYFTHNGSVRFTISNPQNEEADTN 93

Query: 1252 -----NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
                 +I A+  + NS K  +S   IG+ G+GFK+VF+ T  P I+      K +     
Sbjct: 94   NGTLGHINAITSIANSNKTEAS---IGKFGVGFKAVFQYTQTPHIYDPEIRFKIER---- 146

Query: 1307 IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSE-GIAMNNIVDMFS 1365
              F++P ++   + D+  R  SK           T    PF  K  +   + + I++   
Sbjct: 147  --FIVPHML---DADLDWRNSSK-----------TVFFFPFDHKKKQPQESHDEILEKLK 190

Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
             L    +LFL  L+ + F          +  K       K+  G     W   + KL   
Sbjct: 191  ALEFP-VLFLSSLKSVSFEAE------KVSGKYTKKVTRKIEEGGITAQWITLALKLDGK 243

Query: 1426 --------VIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
                      R +      ++   L ES  GN  P+   +  F F P +    L FIL  
Sbjct: 244  RTSQRLVMFTRNNASGHPCSIGYALGES--GNLVPI--DRTAFCFFPTKEATNLNFILHA 299

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
             F+L  SRE +      N  L+ +       AE +   LP  R+
Sbjct: 300  PFLLTDSREGIKAGEKHNNELIEQL------AELAANSLPILRD 337


>gi|72391776|ref|XP_846182.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62358393|gb|AAX78857.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70802718|gb|AAZ12623.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1645

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 76/164 (46%), Gaps = 11/164 (6%)

Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
           L F ++++PS+ ++   + R+ A +   V  R + +L++LE  +C +EG+  F EL LG 
Sbjct: 80  LPFALKQIPSIARIAELQLRVTALVQSAVATRPVVTLHELELEVCSAEGVSIFAELGLGD 139

Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTH-------KAKDIKVDEFLDF 291
            L   P + H F ++S   VV  I ++D +  L  + D          A D  V  F  F
Sbjct: 140 SLTALPFVQHAFNLRSSTGVVAPITSDDFLQFLLFHADAEIMLTSGGDAGD-AVRAFAQF 198

Query: 292 IAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
              K    +  QLG+ IQ+    +  +R+        + + + E
Sbjct: 199 Y--KNGYYTPLQLGIHIQHFSWLLRLLRQETGRTSAFISQIVAE 240


>gi|448389606|ref|ZP_21565745.1| hypothetical protein C477_05907 [Haloterrigena salina JCM 13891]
 gi|445668380|gb|ELZ21009.1| hypothetical protein C477_05907 [Haloterrigena salina JCM 13891]
          Length = 1620

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 56/106 (52%), Gaps = 4/106 (3%)

Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
            Q+    D  F+ EL+QNA D +  ++ EP +   L E  + V NN + F  +++ AL  +
Sbjct: 44   QDRAGYDGRFIFELLQNAADEMA-DSDEPAIHIELSEETLYVANNGRAFDYKDLYALTMI 102

Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN-GFHVKFDTSE 1304
              +TK G     IG KG GF SV  VTD P + S+ G   KFD  E
Sbjct: 103  AETTKAGERT--IGHKGRGFTSVLGVTDQPSVFSSEGVSAKFDREE 146


>gi|313126065|ref|YP_004036335.1| hypothetical protein Hbor_13070 [Halogeometricum borinquense DSM
            11551]
 gi|448285904|ref|ZP_21477143.1| hypothetical protein C499_03988 [Halogeometricum borinquense DSM
            11551]
 gi|312292430|gb|ADQ66890.1| hypothetical protein Hbor_13070 [Halogeometricum borinquense DSM
            11551]
 gi|445575934|gb|ELY30397.1| hypothetical protein C499_03988 [Halogeometricum borinquense DSM
            11551]
          Length = 1674

 Score = 64.7 bits (156), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 3/102 (2%)

Query: 1200 QELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDV 1259
            Q+    +  F+ EL+QNA D +  +  EP + F L E  ++V NN   FS E++ AL   
Sbjct: 38   QDRADYEGRFVFELLQNAVDEM-DDTGEPRVRFELTEDSLLVANNGAAFSFEDLYALTLT 96

Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
              +TK G +   IG KG GF SV  VTD P ++S     +F+
Sbjct: 97   TRTTKAGETT--IGHKGRGFTSVLGVTDNPSVYSENVQAQFN 136


>gi|261329772|emb|CBH12754.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1683

 Score = 64.3 bits (155), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 75/164 (45%), Gaps = 11/164 (6%)

Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
           L F ++++PS+ ++   + R  A +   V  R + +L++LE  +C +EG+  F EL LG 
Sbjct: 118 LPFALKQIPSIARIAELQLRATALVQSAVATRPVVTLHELELEVCSAEGVSIFAELGLGD 177

Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTH-------KAKDIKVDEFLDF 291
            L   P + H F ++S   VV  I ++D +  L  + D          A D  V  F  F
Sbjct: 178 SLTALPFVQHAFNLRSSTGVVAPITSDDFLQFLLFHADAEIMLTSGGDAGD-AVRAFAQF 236

Query: 292 IAKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
              K    +  QLG+ IQ+    +  +R+        + + + E
Sbjct: 237 Y--KNGYYTPLQLGIHIQHFSWLLRLLRQETGRTSAFISQIVAE 278


>gi|336316858|ref|ZP_08571747.1| hypothetical protein Rhein_3170 [Rheinheimera sp. A13L]
 gi|335879023|gb|EGM76933.1| hypothetical protein Rhein_3170 [Rheinheimera sp. A13L]
          Length = 1411

 Score = 64.3 bits (155), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 105/460 (22%), Positives = 181/460 (39%), Gaps = 69/460 (15%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            +LEL+QNA D    ENV   +   L E G+++ NN + FS   +++L +   S K+ +  
Sbjct: 54   ILELLQNAADQAKAENVSGKVIIELTEVGLLIANNGKAFSVGGVKSLQNAHLSPKRLNQR 113

Query: 1270 GYIGRKGIGFKSVFRVTDAP-------------EIHSNGFHVKFDTS------------E 1304
             +IG KG+GF+S+   T  P             E+ +N      D S            +
Sbjct: 114  QFIGCKGLGFRSILNWTRNPIILSGELAIYYSDEVSANKLQELMDESSELADLVQRERDK 173

Query: 1305 GQIGFVLPTL-VPPFNI-DMFCRLLSKDPVQLESKC-------WNTCIRLPFRTKFSEGI 1355
                 +LP L  P F I       +S+    L  +C       + T I +PF    S   
Sbjct: 174  NNNDIILPKLPFPGFPIPTAMSGSVSQTTSFLFERCQYWKSLGYTTVIGIPFEKAESFSK 233

Query: 1356 AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTW 1415
            AM  I      L P +LLF+  L  + F  +  + L V  K+   D  + +  GE    W
Sbjct: 234  AMQQI----ESLRPEMLLFVPYLDEVRFL-LPENKLSVWSKEGDDDASLVMQDGEPIGIW 288

Query: 1416 FVASQKLRAGVIRPDVKTT----EIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGL 1470
             +   +     +  +  T     EI +A+   +S        L   P+F + P      L
Sbjct: 289  QLFRTQGNVDKLYLEDSTAAVLYEIVIAVPEVKSRSE-----LKPSPLFTYFPTEINLPL 343

Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS-AERSFCDLPC-----FR----- 1519
              +    F L  SR ++      N ++ ++  +     AER     P      FR     
Sbjct: 344  SVVCHATFELDQSRNQISKLKS-NDYVFTQLASFLAEVAERRAAKYPVGANAGFRVAMPL 402

Query: 1520 -ENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPCKVLRG 1578
             E P     +   F  L+ E     + +P +  +  ++++   L+G N+ W P    + G
Sbjct: 403  GEYPESLKKI--GFPKLLNEACKTRTIVPTIAGNVSKVADVFSLKGANSDWLP----ISG 456

Query: 1579 WNDRAHSLLPDILLQKHLGLGFLSKDIVLSDSLARALGIE 1618
            + +   +   ++   + LG+  LS +  L ++L   +GI+
Sbjct: 457  FKNIVPNDDAELYWFRELGISELSHE-ALYEALDSLVGID 495


>gi|401415226|ref|XP_003872109.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488331|emb|CBZ23578.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 1662

 Score = 64.3 bits (155), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 57/97 (58%), Gaps = 2/97 (2%)

Query: 176 SWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEEL 235
           S SSL F +++VP+L  +   + R+ + +   +  R I ++++LEE +C  EG+E F EL
Sbjct: 92  STSSLPFALRDVPALANVAALQLRLTSTVMAAISTRAIVTVHELEEWVCAQEGVEHFAEL 151

Query: 236 ELGPFLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCL 271
            LG  L+  P++  YF ++++   VF +   DV+  L
Sbjct: 152 GLGIGLQVLPVVQEYFQLRAN-SAVFPVRARDVVAFL 187


>gi|340367655|ref|XP_003382369.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4323

 Score = 64.3 bits (155), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 154/392 (39%), Gaps = 70/392 (17%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y      + E++QNADD        ++      +   +L+    
Sbjct: 1306 QHEPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRTHSSHNLVLEGMAE 1364

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 1365 SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 1423

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQL-----ESKCWN-TCI 1343
             +   FD +   + G V     P   I    + L+K     P Q       S  +N T  
Sbjct: 1424 EWLYIFDPTLKHLKGVVQNESRPGKKIKFRSKFLAKSQQLAPYQGLFGFDSSSSYNGTIF 1483

Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            RLPFRT   + S  I  N++V M   DL  +   LLLFL  ++ I F +      +V   
Sbjct: 1484 RLPFRTHPSQISSTIYNNHLVQMIKKDLESNGSKLLLFLQNVKRITFSSRQEGRPIVEHL 1543

Query: 1397 KIVGDGI----IKV----SCGEDKMTWFVASQKLRA----GVIRPDVKTTEIALALT--- 1441
             I    +    +KV    S  +    W +++Q+ R     G  R  V +    L      
Sbjct: 1544 SIECSNVDTENVKVCVTQSSTKSTEYWLLSNQEQRLKSQDGAYRQAVASVACKLVKKQSS 1603

Query: 1442 -LQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------D 1488
             L E+ EGN          F FLPL     GL   +  +F + S+R  +          D
Sbjct: 1604 FLCEAIEGN---------AFCFLPLSVPATGLPVHVTANFAVMSNRSGIWTGASSAMASD 1654

Query: 1489 GNSPWNQWLLSE-FPALFVSAERSFCDLPCFR 1519
                WNQ L++   P  +    +S   + C R
Sbjct: 1655 EREYWNQQLMTTVIPEAYCKLLKSLQAMHCLR 1686


>gi|402311178|ref|ZP_10830127.1| GHKL domain protein [Eubacterium sp. AS15]
 gi|400365796|gb|EJP18844.1| GHKL domain protein [Eubacterium sp. AS15]
          Length = 841

 Score = 63.9 bits (154), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 60/109 (55%), Gaps = 1/109 (0%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGF 1248
            + L   L  L+  + +    ++ EL+QNADD  Y ++++  +   +++  + +   E+G 
Sbjct: 137  SNLSEQLVKLASNINNDSYRYIYELIQNADDCEYEDDIDINMQIDVEDGKLTISYPEKGM 196

Query: 1249 SAENIRALCDVGNSTK-KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
            +A +I A+  +G S K K   +  IG KG GFK++F   D  +I+SN F
Sbjct: 197  TASDIIAITTIGQSNKLKKKKSHIIGEKGKGFKTIFAACDNVQIYSNNF 245


>gi|340373769|ref|XP_003385412.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 3123

 Score = 63.9 bits (154), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 90/384 (23%), Positives = 150/384 (39%), Gaps = 67/384 (17%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y  D   + E++QNADD        +Y      T   + +    
Sbjct: 218  QHEPLTTRLCNILKE-YKDDITIIKEMIQNADDAGATKVNILYDSRTYSTQKLLFKGMAE 276

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N+  F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 277  SHGPALIVHNDSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 335

Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMF--CRLLS--KDPVQLE--------SKCWNTC 1342
             +   FD +   +  V+     P     +   R+ S  +D V  E        S    T 
Sbjct: 336  EWLYIFDPTLKYLKGVVHDQSKPGKKIKYQQSRIFSNCQDLVPYEGLFGFKSSSNYKGTM 395

Query: 1343 IRLPFRTKFSE---GIAMNNIVDMFS----DLHPSLLLFLHRLQCIMFRNMLNDSLVV-- 1393
             R PFRT  S+    +   +IV+       D    LLLFL  ++ I F +  ++  +   
Sbjct: 396  FRFPFRTSMSQISSKVYDEHIVEHMKRDLVDCGSKLLLFLQHMKHITFSSRRDNEEITHV 455

Query: 1394 ------IRKKIVGDGIIKVSCGEDKMT-WFVASQK---LRAGVIRPDVKTTEIALALTLQ 1443
                           +IK  C E     W +ASQ+       + +P V +    +A  L 
Sbjct: 456  SIDISSSNSNDFQKVVIKSPCIESSTEHWLIASQEGQWQHQNITKPAVAS----VACQLD 511

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNS 1491
            E N G Y     +   F FLPL   + GL   +  +F + S+R  +          D   
Sbjct: 512  EGNLGFYKCKEVEGAAFCFLPLAMPSTGLPVHVSANFAVMSNRSGIWTEVSSDTVTDSRV 571

Query: 1492 PWNQWLLSEFPALFVSAERSFCDL 1515
             WN+ L+ +      +  +++C L
Sbjct: 572  QWNKNLMEK------TIPKAYCKL 589



 Score = 43.5 bits (101), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 17/123 (13%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADD------------NIYPENVEPTLTF-- 1232
            Q   L R +  + QE Y  D+ FL EL+QNADD              +     P+ T+  
Sbjct: 1171 QKEELTRRIRNILQE-YPLDATFLKELLQNADDAKATKIRIILDKRQHGNETVPSETWGK 1229

Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
             LQ   ++V N+++ FS  ++  +  +G  +K+      IG+ GIGF  V+ +TD P   
Sbjct: 1230 ELQGPALLVWNDKE-FSDVDLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHLTDCPSFI 1287

Query: 1293 SNG 1295
            + G
Sbjct: 1288 TRG 1290


>gi|218891094|ref|YP_002439960.1| hypothetical protein PLES_23571 [Pseudomonas aeruginosa LESB58]
 gi|218771319|emb|CAW27084.1| hypothetical protein PLES_23571 [Pseudomonas aeruginosa LESB58]
          Length = 1603

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 85/322 (26%), Positives = 129/322 (40%), Gaps = 60/322 (18%)

Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQ-----------GFSAE-----NIR 1254
            LEL+QNADD         ++ F + + G+VV N+ Q            F AE     +  
Sbjct: 40   LELIQNADDA-----KAESVVFDVTDQGLVVFNSGQFTYCGGLDRPCAFLAEGNYKCDYH 94

Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
             + DVG S  K +    IGR GIGF S ++VTD PEI S G  +     +GQ  + +  L
Sbjct: 95   RITDVG-SGGKLTRGENIGRFGIGFVSTYQVTDHPEIRSAGIKLTLHPEKGQ--WSVEPL 151

Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
              P     F    + DP        NT  RL           ++ +VD F  +    LLF
Sbjct: 152  QQPEGTTFFLP-WADDP--------NTEARLALGVSHVSTAHIDQLVDDFQKVLRKSLLF 202

Query: 1375 LHRLQCIMFRNMLNDSLVVIRKKIVGDG---IIKVSCGEDKMTWFV-----ASQKLRAGV 1426
            L  ++    R   N  L++      GDG   I+      +   W +     A    R   
Sbjct: 203  LRHVRTAELRR--NGKLLLACDLERGDGSDLIVSFRPSGEVEQWHILQADAAESAARLYA 260

Query: 1427 IRPDVKT----TEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLP 1481
              P +++    T+I++ L +           L +  ++AFLP  ++ GL   +  DF   
Sbjct: 261  SHPRLESLGRGTKISIGLRIDPEP-------LAEGLLYAFLPTEQSTGLPLHINADFFPE 313

Query: 1482 SSREEV-----DGNSPWNQWLL 1498
            S R+ V          WN+ L+
Sbjct: 314  SDRKAVIFAGHQHEQAWNEMLI 335


>gi|390569708|ref|ZP_10249993.1| hypothetical protein WQE_15316 [Burkholderia terrae BS001]
 gi|389938568|gb|EIN00412.1| hypothetical protein WQE_15316 [Burkholderia terrae BS001]
          Length = 1042

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 91/366 (24%), Positives = 146/366 (39%), Gaps = 70/366 (19%)

Query: 1155 CEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELV 1214
             ED  L +E++RR+                 K    LG     + ++ Y   +HF+ EL+
Sbjct: 8    AEDRRLFIEALRRN-----------------KGDINLG-----IFEDFYPDRAHFIYELL 45

Query: 1215 QNADDNIYPENVEPTLTFILQESGIVVLNNE-QGFSAENIRALCDVGNSTKKGSSAGYIG 1273
            QNA+D    E      +F L   G   ++N  + F+  +IR +  + NSTKK  +   IG
Sbjct: 46   QNAEDAGATE-----ASFELMRDGCTFIHNGLRHFNEPDIRGITGIFNSTKK-DNLDKIG 99

Query: 1274 RKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ 1333
            + G+GFKSVF  TD+P I+S  +  +          VLP  +P   +D+           
Sbjct: 100  KFGVGFKSVFVYTDSPVIYSRDYSFRISQ------LVLPEPIPQ-RVDLGA--------- 143

Query: 1334 LESKCWNTCIRLPFRT-KFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV 1392
                   T    PF + K     A + +      +    LL+L+ L+ I ++        
Sbjct: 144  ------TTRFEFPFNSPKKDAAQAFSEVRAGLDAVSERTLLYLNNLRYISWK-CEGSQAA 196

Query: 1393 VIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTL---QESNEGN 1449
            V R +     +  +   E++    V+S  LR       ++  ++A+A  L    +  E +
Sbjct: 197  VFRVEHTPSHVEVLKEAENRK--LVSSHWLRFTAPVTKLEHQKVAVAYELGFVGDVTEFD 254

Query: 1450 YGPLLYQQ---------PVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLL 1498
                + QQ          V  F P    T GL+F L   FV   SR  +  N P N  L 
Sbjct: 255  PSKSIAQQMKIIAAVRGQVSVFFPAEKETSGLRFHLHAPFVPELSRASIK-NCPENATLF 313

Query: 1499 SEFPAL 1504
             +  AL
Sbjct: 314  EQLAAL 319


>gi|423314391|ref|ZP_17292325.1| hypothetical protein HMPREF1058_02937 [Bacteroides vulgatus
            CL09T03C04]
 gi|392683161|gb|EIY76499.1| hypothetical protein HMPREF1058_02937 [Bacteroides vulgatus
            CL09T03C04]
          Length = 1864

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 93/388 (23%), Positives = 151/388 (38%), Gaps = 88/388 (22%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQG 1247
            + R    L  + + +YS       EL+QNADD   P+    ++        +V  ++   
Sbjct: 479  NRRDAHLLKEIGKNMYSSKERMFFELIQNADDAA-PQKDSVSINSFTTGDYLVFCHDGFS 537

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH--SNGFHVKFDTSEG 1305
            F   +  A+      TKK +     G KGIGFKSVF  TD+ +++  +NG+H KFD  + 
Sbjct: 538  FDKYDFEAITSAAVGTKKANENK-TGYKGIGFKSVF--TDSKQVYICTNGYHFKFDKEDE 594

Query: 1306 QI----GFVL--PTLVPPFNIDMFCRLLSK----------DPVQLE-----------SKC 1338
            +     GF L  P L        F  + S            P QLE            K 
Sbjct: 595  RFRTFDGFYLNNPFLDTEKAKKRFFSMYSSVRGNFDGVNDIPWQLEPIHVNEFPPEFGKS 654

Query: 1339 W---NTCIRLPFRTKFSEGI-----AMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM---- 1386
            +   N  I L    K  +G      A+++IVD      P  +LFL +   I F       
Sbjct: 655  FMKSNVSIALRLGEKNIQGTGGYESAIDSIVD-----DPKFMLFLRKTDRIRFNTRTISR 709

Query: 1387 -LNDSLVVIRKKIVGDGI-------------------------IKVSCGEDK----MTWF 1416
             +N+ ++ I+     + +                         IK+   ED        F
Sbjct: 710  EINEGIITIKNSFAQNKVERYERFDFAAGVSNEIFEEENLPIRIKIEEREDSGNILKALF 769

Query: 1417 VASQKLRAGVIRPDV---KTTEIALALTLQESNEGNYGPLLYQQ-PVFAFLP--LRTYGL 1470
            V  +  +   I   +    +TE++ A+ + E  +G+   L      +FAFLP  ++ +  
Sbjct: 770  VDLEGEQIENIPMKIAISSSTELSFAIPVNE--DGSISVLENNTISIFAFLPTLVKDFVF 827

Query: 1471 KFILQGDFVLPSSREEVDGNSPWNQWLL 1498
             F +  +F+L   R+ V G+S WN +L 
Sbjct: 828  PFYINANFILDPPRQRVLGDSAWNLYLF 855


>gi|189466556|ref|ZP_03015341.1| hypothetical protein BACINT_02931 [Bacteroides intestinalis DSM
            17393]
 gi|189434820|gb|EDV03805.1| hypothetical protein BACINT_02931 [Bacteroides intestinalis DSM
            17393]
          Length = 1554

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 152/367 (41%), Gaps = 55/367 (14%)

Query: 1193 RALHCLSQ-ELYSQDSH---FLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGF 1248
            R L  + + E Y Q ++   +  EL+QN  D     + E  +   L+++ + + N    F
Sbjct: 20   RGLKTICEAEKYLQQAYEGRYFFELIQNVRDANKEASQEGDIYIELKDNVLSIANTGAPF 79

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT------ 1302
            S E I ++  +G S K+  S  +IG KGIGFKS   V+D P I +    + FD       
Sbjct: 80   SEEGIASITSIGQSPKQ--SLDFIGFKGIGFKSTLEVSDTPRIVTTEGAIYFDRKLTMEV 137

Query: 1303 -SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
             S+     V   L   F      R L+ +      K   T I LP +  F+E      I 
Sbjct: 138  LSQYNSNLVENQLPLFFFPHYRSRALTSEE---REKGIVTKIELPLKESFAE----EKIC 190

Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE--DKMTWFVAS 1419
              F ++    L+ L  +Q + F      ++  I+K I  + I+  +  E  ++  ++V +
Sbjct: 191  ADFGEIKAKQLVLLGNIQKVDFTTPTQRTIYNIQKDIRKNLIVVDNGKEILNRFRYYVPN 250

Query: 1420 QKL------------RAGVIRPDVKTTEIALALTLQESN----EGNYGPLLYQQPVFAFL 1463
            +K             +   I  +  + +I + L L E+N    + N         ++ F 
Sbjct: 251  EKNFLPDDILNSLEGKEKEIYSNSTSIDIHIVLELDENNLFVSDAN-------AKLYLFY 303

Query: 1464 PLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLL--------SEFPALFVSAERSFCD 1514
            PL    G +F++   F++   R  +   SP N++LL        S+   L    + +  +
Sbjct: 304  PLNIRSGFRFMIHSYFLVNPERTRL-RKSPLNRYLLRKIGEYIGSDMLKLLKRGKYNTNE 362

Query: 1515 LPCFREN 1521
            + CF+ N
Sbjct: 363  ILCFKRN 369


>gi|389599951|ref|XP_001562029.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322504277|emb|CAM37053.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 1654

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 68/132 (51%), Gaps = 3/132 (2%)

Query: 142 DRAVVKARSELIMAGESVSSWKVSRSV-LLKLQVDSWSSLGFQMQEVPSLHKLMVTEARI 200
           D A V     L   G   +S +++     L     S +SL F +++VP+L  +   + R+
Sbjct: 66  DDADVHGYGRLSGVGHPRTSARIASGAHTLAAITSSINSLPFALRDVPALAHVAALQLRL 125

Query: 201 NAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQ-PLISHYFLVKSDVDVV 259
            + +   +  R I ++++LEE +C  EG+  F EL LG  L+  P++  YF ++++  VV
Sbjct: 126 TSSVMAAISTRAIVTVHELEEWVCAQEGVAHFAELGLGVGLQVLPVVQEYFQLRAN-SVV 184

Query: 260 FKIATNDVIVCL 271
           F +   DVI  L
Sbjct: 185 FPVRARDVIAFL 196


>gi|374812444|ref|ZP_09716181.1| hypothetical protein TpriZ_01095 [Treponema primitia ZAS-1]
          Length = 1490

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 68/313 (21%), Positives = 125/313 (39%), Gaps = 65/313 (20%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGI-VVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
            LLEL+QN DD     +++     +  ++ I  V NN   F+AE I  LC    S K   +
Sbjct: 41   LLELLQNVDDAAVKADIQNADVLLEYKNNIFTVANNGTQFTAETIERLCHGAASNK---T 97

Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF------------------- 1309
              +IG KG GF+S+       EIHS  FH+ F  +     F                   
Sbjct: 98   DDFIGNKGTGFRSLLNWGRQIEIHSGEFHIGFSQNYANSEFEKILSGCAILQEQKKNVPN 157

Query: 1310 -VLPTLVPPFNI-------DMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
              +P L  PF +       D   ++++ D +Q + +                     NI+
Sbjct: 158  LSMPVLHCPFEVNSLNNKYDTTIQIITDDSLQNDEQ---------------------NIL 196

Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE--DKMTWFVAS 1419
                      LLFL  +  I+ +   ++   +IRK    +  I++   E  +K  +F   
Sbjct: 197  KQLQRFDHKTLLFLPNIMQIIIK--TDEYTKIIRKAPKENEEIEIYVDETIEKYLYFKNP 254

Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG--LKFILQGD 1477
             + +    + + +    A+A+ L E +  +       + ++ F P+R++   L  ++   
Sbjct: 255  DEPQIDTGKNEKRKIRTAIAIALDEWDYSD-------ETMYCFFPIRSFSTPLNMLMHAT 307

Query: 1478 FVLPSSREEVDGN 1490
            F L ++R+++  N
Sbjct: 308  FELNANRDDIPPN 320


>gi|340383695|ref|XP_003390352.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 2691

 Score = 62.0 bits (149), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 95/387 (24%), Positives = 157/387 (40%), Gaps = 75/387 (19%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y      + E++QNADD        ++      +L  +L+    
Sbjct: 486  QHEPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRTHSSLNLVLEGMAE 544

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 545  SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 603

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQL-----ESKCWN-TCI 1343
             +   FD +   + G V     P   +    + L+K     P Q       S  +N T  
Sbjct: 604  EWLYIFDPTLKHLKGVVQNENRPGKKLKFQSKFLAKSQQLAPYQGLFGFDSSASYNGTIF 663

Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            RLPFRT   + S  I  N+++ M   DL  +   LLLFL  ++ I F +   D  ++   
Sbjct: 664  RLPFRTHPSQISSTIYNNHLIQMIKKDLESNGSKLLLFLQNVKRITFSSRQEDRPIIKHL 723

Query: 1397 KI--VGDGI--IKV----SCGEDKMTWFVASQKLRA----GVIRPDVKTTEIALALT--- 1441
             I  + D    +KV    S  +    W +++Q+ R     G  +  V +    L      
Sbjct: 724  SIECINDETENVKVCVTQSSTKSTEYWLLSNQEQRLKSQDGAYKQAVASVACKLVKKQSS 783

Query: 1442 -LQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------D 1488
             L E+ EGN          F FLPL     GL   +  +F + S+R  +          D
Sbjct: 784  FLCEAIEGN---------AFCFLPLSVPATGLPVHVTANFAVMSNRSGIWAGASSAMASD 834

Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDL 1515
                WN+ L++   A      +++C+L
Sbjct: 835  EREYWNKQLMTTIIA------QAYCNL 855


>gi|254390316|ref|ZP_05005534.1| hypothetical protein SSCG_02861 [Streptomyces clavuligerus ATCC
            27064]
 gi|294811274|ref|ZP_06769917.1| Hypothetical protein SCLAV_0440 [Streptomyces clavuligerus ATCC
            27064]
 gi|326439778|ref|ZP_08214512.1| hypothetical protein SclaA2_01880 [Streptomyces clavuligerus ATCC
            27064]
 gi|197704021|gb|EDY49833.1| hypothetical protein SSCG_02861 [Streptomyces clavuligerus ATCC
            27064]
 gi|294323873|gb|EFG05516.1| Hypothetical protein SCLAV_0440 [Streptomyces clavuligerus ATCC
            27064]
          Length = 1801

 Score = 62.0 bits (149), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/201 (25%), Positives = 89/201 (44%), Gaps = 27/201 (13%)

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQE-----SGIVVLNNEQGFSAENIRALCDVGN 1261
              FLLEL+QN  D       + ++  +L E       + V N    F+  +I+A+C +  
Sbjct: 54   GRFLLELLQNGHDAHRGGRNDGSVHVLLDEDEGTHGTLYVANGGTPFTWASIQAVCKLAR 113

Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS-----------NGFHVKFDTSE------ 1304
            S K       IG KG+GF+S+  +TDAPE++S           +G+  +F   +      
Sbjct: 114  SEKTVGEG--IGNKGVGFRSILEITDAPEVYSASTAGEGLAVLDGYCFRFAVRDDLRVLL 171

Query: 1305 GQIGFV--LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVD 1362
            G    V      +PPF +      +     +L +    T IR+P R + S   A+  + +
Sbjct: 172  GDEDLVHKAEEELPPFQVPFPLSDVPPTCKKLAADGHVTVIRIPLRNEASRRAAVRRLDE 231

Query: 1363 MFSDLHPSLLLFLHRLQCIMF 1383
            + +   P ++LFL RL  ++ 
Sbjct: 232  LGAAKAP-VMLFLDRLNRLLI 251


>gi|449680062|ref|XP_002155856.2| PREDICTED: uncharacterized protein LOC100205182, partial [Hydra
            magnipapillata]
          Length = 1503

 Score = 62.0 bits (149), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/219 (20%), Positives = 99/219 (45%), Gaps = 34/219 (15%)

Query: 138  LERIDRAVVKARSELIMAGESVSSWKVSRSVLLKLQVDSWSSLG-------FQMQEVPSL 190
            L+ I++ V + R+++    E V   ++ + + +   V   + L         Q  ++ ++
Sbjct: 937  LDEINQKVSQIRNKICRRNEVVKVPEIVKELCVFYNVSCVADLCPFESKKIRQETDITAI 996

Query: 191  HKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYF 250
            + ++  + +INAFI  F+  R I +L++L + ICK E    F +L+LGP  +Q LI   F
Sbjct: 997  NDIIRLQGKINAFIQSFMTFRNICTLFELNQEICKMESKSSFLDLKLGPLQKQLLIHDNF 1056

Query: 251  LVKSDVDVVFKIATNDVIVCLSEYTDTHKAKD---------------------------I 283
                + + +  I T+D+   + +    +K K                             
Sbjct: 1057 KFPPNQNDIPHITTSDIFRLIWKMHIDYKKKQCTDEELSENSRTGLKSKKQPRERENVVF 1116

Query: 284  KVDEFLDFIAKKRSLASKEQLGVRIQNLGMHINFIREAR 322
             + EF++F+ K+       + GV+I ++ + ++ +++A+
Sbjct: 1117 SLSEFMEFLIKQHGFTQPYESGVKISSIALAVSVVKKAK 1155


>gi|254481776|ref|ZP_05095019.1| hypothetical protein GPB2148_1466 [marine gamma proteobacterium
            HTCC2148]
 gi|214037905|gb|EEB78569.1| hypothetical protein GPB2148_1466 [marine gamma proteobacterium
            HTCC2148]
          Length = 1032

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 83/196 (42%), Gaps = 42/196 (21%)

Query: 1194 ALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQE-SG---IVVLNNEQGFS 1249
             L  L  ELY   +HF+ EL+QNA+D           T++  E SG       N  + F+
Sbjct: 31   GLRKLMAELYPDQAHFVYELLQNAED--------AKATYVRFELSGQELRFAHNGARQFN 82

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
             +++ ++  +G STKK      IG+ G+GFK+VF   D PEIHS           G   F
Sbjct: 83   HKDVESITGIGTSTKK-DDVNKIGKFGVGFKAVFSYCDTPEIHS-----------GDYSF 130

Query: 1310 VLPTLVPPFNIDMFCRLLSKD----PVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
             +  L  P  I    R  S+     P+    K    C                 I+ +FS
Sbjct: 131  KIEHLFCPIQITPVSRKDSETLFVFPLGSSEKTPEQC--------------SEEILKVFS 176

Query: 1366 DLHPSLLLFLHRLQCI 1381
             L  S +LFL+ ++ I
Sbjct: 177  RLDHSAVLFLNSIKSI 192


>gi|336399055|ref|ZP_08579855.1| hypothetical protein Premu_2030 [Prevotella multisaccharivorax DSM
            17128]
 gi|336068791|gb|EGN57425.1| hypothetical protein Premu_2030 [Prevotella multisaccharivorax DSM
            17128]
          Length = 2327

 Score = 61.6 bits (148), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 72/130 (55%), Gaps = 7/130 (5%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFILQESGIVVLNNEQGFSAENI 1253
            L  LS++L +++S F+ EL+QNA+D  YP E     + F + ++ ++ +++   F+  NI
Sbjct: 250  LDTLSKQLSAKESTFVYELLQNAND--YPVEGKMVDVEFHITDNYLLFMHSGDKFNVRNI 307

Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQIGFV 1310
              +C + N  +K ++   IG KGIGFK+VF       + +  +  +FD   T E ++G  
Sbjct: 308  SGICGI-NEKEKIANKHTIGYKGIGFKTVFLHNHYVYLQTGDYSFRFDEGETPEKKVGGK 366

Query: 1311 LPTLVPPFNI 1320
            +  L  PF I
Sbjct: 367  IKRLGAPFQI 376


>gi|340373207|ref|XP_003385133.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4326

 Score = 61.6 bits (148), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 162/387 (41%), Gaps = 75/387 (19%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y      + E++QNADD        ++      +   +L+    
Sbjct: 1358 QHEPLTTRLSNILRE-YKDGLTIIKEMMQNADDAGATEVNILHDMRTHSSRNLVLEGMAE 1416

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK   S   IG+ G+GF SV+ +TD P   S 
Sbjct: 1417 SHGPALIVHNNSTFTKEDFENITKLAAATKANQSLK-IGKFGVGFCSVYHITDVPSFVSG 1475

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
             +   FD +   + G V     P   I+   + L+K     P Q       S  +N T  
Sbjct: 1476 DWLYIFDPTLKHLKGVVQNENRPGKKINFKSKFLAKSHQLAPYQDLFGFDPSSTYNGTIF 1535

Query: 1344 RLPFRTKFSE---GIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            RLPFRT  S+    I  ++++ M  +DL  +   LLLFL  ++ I F +    S VV + 
Sbjct: 1536 RLPFRTHPSQICPTIYNDHLIQMIKNDLESNGSKLLLFLQNVKRITFSSQQEGSQVVEQL 1595

Query: 1397 KI--VGDGIIKV-SCGEDKMT-----WFVASQKL----RAGVIRPDVKTTEIAL----AL 1440
             I    D I  V  C     T     W +++QK     + G+ +P V +    L    +L
Sbjct: 1596 SIECSKDDIKNVKKCITKSNTTIIEYWLLSNQKQQLKSQEGICKPAVASVACKLVKEQSL 1655

Query: 1441 TLQESNEGNYGPLLYQQPVFAFLPLRT--YGLKFILQGDFVLPSSREEV----------D 1488
             L E+ EGN          F FLPL     GL   +  +F + ++R  +          +
Sbjct: 1656 FLPEALEGN---------AFCFLPLSIPDTGLPVHVNANFAVMNNRSGIWTGVSSAIASE 1706

Query: 1489 GNSPWNQWLLSEFPALFVSAERSFCDL 1515
                WN+ L++       +  +++C+L
Sbjct: 1707 EREYWNKQLMTS------AIPQAYCNL 1727


>gi|449544899|gb|EMD35871.1| hypothetical protein CERSUDRAFT_138618, partial [Ceriporiopsis
            subvermispora B]
          Length = 2647

 Score = 61.6 bits (148), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 98/206 (47%), Gaps = 32/206 (15%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y      L E++QN+DD        +      PT   I      LQES ++  N+   F+
Sbjct: 22   YPYSVGLLREILQNSDDAKATKQIFVLDHRTYPTEHLIHEGLADLQESALLAFNDSV-FT 80

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG----FHVKFDTSEG 1305
             ++ +AL ++  S+K+  ++  IG+ GIGF++++ VTD P+I S+     F  + D SEG
Sbjct: 81   EQDWKALQNISQSSKRLDTSK-IGKYGIGFRALYHVTDNPQILSHDSLVVFDPQHDYSEG 139

Query: 1306 --QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK----FSEGIAMNN 1359
              +IG        P  +  F   L K  V   S    T IRLP R       SE  + ++
Sbjct: 140  GLRIGLAECQQEYPDQLSPFTHFLHKGGV---STLEGTAIRLPLRRTESRISSEVHSTDD 196

Query: 1360 IVDMFSD--LHP--SLLLFLHRLQCI 1381
            +  +  D  +H    ++LFL+ L+C+
Sbjct: 197  LRRLLDDFVVHELQDVMLFLNHLKCV 222


>gi|338534009|ref|YP_004667343.1| hypothetical protein LILAB_21825 [Myxococcus fulvus HW-1]
 gi|337260105|gb|AEI66265.1| hypothetical protein LILAB_21825 [Myxococcus fulvus HW-1]
          Length = 1809

 Score = 61.2 bits (147), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 60/108 (55%), Gaps = 7/108 (6%)

Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV-EPTLTFILQESGIVVLNN 1244
            ++H  +  A   + Q+ + +   FL+EL+QNA+D      V + T+  +  E  + V N 
Sbjct: 30   REHREIAHAAALIRQDYHDR---FLVELLQNANDQALRGEVHDSTVVVVRSERLLAVSNG 86

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
             Q  + +N+  L  + +S K G     +G KG+GFK+V++VT APEI+
Sbjct: 87   GQVLTPQNLERLSSLADSDKTGM---LVGNKGVGFKAVYQVTGAPEIY 131


>gi|373459999|ref|ZP_09551764.1| hypothetical protein HMPREF9944_00028 [Prevotella maculosa OT 289]
 gi|371957325|gb|EHO75091.1| hypothetical protein HMPREF9944_00028 [Prevotella maculosa OT 289]
          Length = 2449

 Score = 61.2 bits (147), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 72/139 (51%), Gaps = 7/139 (5%)

Query: 1186 KQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFILQESGIVVLNN 1244
            K        L  LS++L +++S F+ EL+QNA+D  YP E     + F + ++ ++ L++
Sbjct: 239  KDFGSAANFLDTLSKQLSAKESTFVYELLQNAND--YPVEGQMVDVEFHITDNYLLFLHS 296

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--- 1301
               F+  NI  +C + N  +K ++   IG KGIGFK+VF       I +  +  +FD   
Sbjct: 297  GDRFNVRNISGICGI-NEKEKVANKKTIGYKGIGFKTVFLNNHYVYIRTGNYSFRFDQGE 355

Query: 1302 TSEGQIGFVLPTLVPPFNI 1320
            T E ++G  +     PF I
Sbjct: 356  TPEKKLGGKIKRQDAPFQI 374


>gi|322710458|gb|EFZ02032.1| Heterokaryon incompatibility protein, putative [Metarhizium
            anisopliae ARSEF 23]
          Length = 1549

 Score = 61.2 bits (147), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 84/184 (45%), Gaps = 40/184 (21%)

Query: 1349 TKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC 1408
            T FS   A+ N V+    + P LLLFL +++ I  R   + S          DG+  +S 
Sbjct: 32   TDFS--TALQNDVEA---VDPGLLLFLRQIERIHLRLFTSSS----------DGVPAISK 76

Query: 1409 GEDKMTW-----FVASQKLRAGVIR------------------PDVKTTEIALALTLQES 1445
               ++ W     FV+ +   A   R                    +  T+I LA  ++ +
Sbjct: 77   RFHRVNWTPNSGFVSLKDEDADTRRNLYKHLFTIDCDGTETQRRGITNTDIVLAFPVK-N 135

Query: 1446 NEGNYGPLLYQQP-VFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPAL 1504
              G Y PL+ +   VFA+LPL  +G +F++Q DF+  S+R+ VD +S WN+ +    P  
Sbjct: 136  KSGRYVPLIQKTNFVFAYLPLGNFGFRFVIQADFLTTSNRQSVDEDSSWNKNIAHAIPRA 195

Query: 1505 FVSA 1508
            F +A
Sbjct: 196  FEAA 199


>gi|340384255|ref|XP_003390629.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
          Length = 1922

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 146/371 (39%), Gaps = 69/371 (18%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y      + E++QNADD        ++     P+   +L+    
Sbjct: 278  QHKPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRTHPSHNLVLEGMAE 336

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 337  SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 395

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQL-----ESKCWN-TCI 1343
             +   FD +   + G V     P   +      L+K     P Q       S  +N T  
Sbjct: 396  EWLYIFDPTLKHLKGVVQNESRPGKKLKFQSTFLAKSQQLAPYQGLFDFDSSSSYNGTIF 455

Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            RLPFRT   + S  I  N+++ M   DL  +   LLLFL  ++ I F +      ++   
Sbjct: 456  RLPFRTHSSQISSTIYNNHLIHMIKKDLESNGSKLLLFLQNVKRITFSSRREGKPIIEEL 515

Query: 1397 KIVGDGI----IKVSCGEDKMT----WFVASQKLRA----GVIRPDVKTTEIALALT--- 1441
             I         +KV   +   T    W +++Q+ R     G  +  V +    L      
Sbjct: 516  SIECSNFDTENVKVCVTQSNATSTEHWLLSNQEQRLKSQDGAYKQAVASVACKLVKKQFS 575

Query: 1442 -LQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------D 1488
             L E+ EGN          F FLPL     GL   +  +F + S+R  +          D
Sbjct: 576  FLCEAIEGN---------AFCFLPLSVPATGLPVHVTANFAVMSNRSGIWTGASSAMASD 626

Query: 1489 GNSPWNQWLLS 1499
                WNQ L++
Sbjct: 627  EREYWNQQLMT 637


>gi|343476855|emb|CCD12172.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1661

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 71/161 (44%), Gaps = 5/161 (3%)

Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
           L F ++++P L K+   + RI A +   V  R + ++Y+LE  +C +E I  + EL LG 
Sbjct: 76  LPFSIRQIPHLAKVAQRQVRITAMVQSAVATRPVVTIYELELEVCSNENISTYAELGLGD 135

Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVI--VCLSEYTDTHKAKDIKVDEFLDFIAK-- 294
            L   P + H F + S    +  + +ND +  +    Y            + +   A+  
Sbjct: 136 SLAMLPFVQHAFKLPSSGTEIAPVTSNDFLQFLLFDAYAKIVLTSGGDAGDAVRAFARSY 195

Query: 295 KRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKE 335
           K    +  QLG+ IQ+    + F+R+        L + + E
Sbjct: 196 KGDRYNPAQLGIHIQHFSWLLQFLRQEIARTSALLSQAVTE 236


>gi|305663140|ref|YP_003859428.1| hypothetical protein [Ignisphaera aggregans DSM 17230]
 gi|304377709|gb|ADM27548.1| hypothetical protein Igag_0718 [Ignisphaera aggregans DSM 17230]
          Length = 948

 Score = 59.7 bits (143), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 78/349 (22%), Positives = 146/349 (41%), Gaps = 62/349 (17%)

Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
            +  F+LE +QNA D    +NV+  L +      I+V N+   F+ ++  + C +  S +K
Sbjct: 43   ERQFILEYLQNAIDA-GSKNVKIVLDY--DNVRILVYNDGTEFNHDDFESFCKIARS-RK 98

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
              +   IG  GIG KS F +    E+HS  +HV F+ S      +   L   +N+     
Sbjct: 99   DPNKMLIGYIGIGAKSAFAIARKLELHSGEYHVTFEESN-----IPEELSKNYNLKYLWM 153

Query: 1326 LLSKDPVQLESKCWNTCIRL-----PFRTKF----------SEGIAMNNIVD----MFSD 1366
            ++ +     + +C+++  R       FRT F          +  I    ++D     F D
Sbjct: 154  IVPRT----KDECYSSSCRYLMNTASFRTVFVLENLTKDKETLEIIHRTLLDYNYEYFLD 209

Query: 1367 LHPSLLLFLH--RLQCIMFRNMLNDSLVVIRK---KIVGDGI-------IK----VSCGE 1410
                L + +H  +++  +F+N +   L  I K   K +  GI       +K    +S  E
Sbjct: 210  ARTLLFIPVHNIKVEICIFKNGVEKCLRRIEKFHEKEIQSGIPHLRKVVVKLREILSDRE 269

Query: 1411 DKMTWFVASQKLR-----------AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
            +   W +  +++            A   R +V    + +A    E+++      + +  +
Sbjct: 270  ETEDWLIILKEVEVPENIRRDPITAIFKRENVVRRLVGIAFK-TENDKLIPTKGVVKFSI 328

Query: 1460 FAFLPLRTY--GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFV 1506
            F+++P+R    G  ++L  DF+    R  +  N  WN+WL +E     V
Sbjct: 329  FSYMPIREIESGFNYLLHADFLTDPGRGGIIENIAWNEWLRNELVKFLV 377


>gi|53805067|ref|YP_113238.1| hypothetical protein MCA0731 [Methylococcus capsulatus str. Bath]
 gi|53758828|gb|AAU93119.1| hypothetical protein MCA0731 [Methylococcus capsulatus str. Bath]
          Length = 583

 Score = 59.3 bits (142), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 87/332 (26%), Positives = 130/332 (39%), Gaps = 70/332 (21%)

Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNN----------------EQGFSAENIR 1254
            LEL+QNADD    E V     F + ++G++V N+                 QG    +  
Sbjct: 36   LELIQNADDAKAEEVV-----FDITDNGLLVRNSGQFTYCGDLNKPCAFLSQGNYKCDYH 90

Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
             + DVG S  K S    IGR GIGF S ++VTD PEI S+G  +      GQ  + +   
Sbjct: 91   RITDVG-SGGKLSRGENIGRFGIGFVSTYQVTDHPEIRSSGIKLALYPERGQ--WFIEAF 147

Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
              P     F    S DP        NT  RL           ++ + + F  +    LLF
Sbjct: 148  EEPDGTSFFLPWAS-DP--------NTEARLALGISHVSPTHIDQLAEDFQAVLRKSLLF 198

Query: 1375 LHR-------------LQCIMFRNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMTWFVAS 1419
            L               L C + R+  +D +V  R     +   I++    E       A+
Sbjct: 199  LRHVRKAEIRRNGQLLLACDLDRSEGSDLIVSFRPSGAVEQWHILRADAAE-------AA 251

Query: 1420 QKLRAGVIRPDV--KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQG 1476
            Q+L A   R +   ++T I++ L +           L +  ++AFLP  ++ GL   L  
Sbjct: 252  QRLYATHPRLETLHRSTNISIGLRIDPEP-------LTEGLLYAFLPTEQSTGLPLHLNA 304

Query: 1477 DFVLPSSREEV-----DGNSPWNQWLLSEFPA 1503
            DF   S R+ V          WN+ L+    A
Sbjct: 305  DFFPESDRKAVIFAGHQHEQAWNEMLIDAAAA 336


>gi|407859405|gb|EKG07006.1| hypothetical protein TCSYLVIO_001871 [Trypanosoma cruzi]
          Length = 1541

 Score = 58.9 bits (141), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 75/177 (42%), Gaps = 20/177 (11%)

Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
           L F +++VPSL    V ++RI A I   V  R + +LY+LE  +C +E +E + EL LG 
Sbjct: 58  LPFSLKQVPSLASTAVLQSRITAMIQSAVATRPVVTLYELEAEVCLTENVEMYAELGLGC 117

Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYT-----DTHKAKDIK-----VDEF 288
            L   P + H F V         + TN   V  +E+      DT+    +       D  
Sbjct: 118 SLAALPCVRHLFGV--------SVGTNTAPVTSTEFMHFLLFDTNAQALLSGGGDAGDAV 169

Query: 289 LDFI-AKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRK 344
             F    K    +  QLG+ IQ+    + F+R+            L E    +KR K
Sbjct: 170 RAFACCYKGGKYTSMQLGIHIQHFPWLLRFVRQEVGRTSTFFSDLLNENAWCYKRNK 226


>gi|71665269|ref|XP_819606.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70884915|gb|EAN97755.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1541

 Score = 58.9 bits (141), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 75/177 (42%), Gaps = 20/177 (11%)

Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
           L F +++VPSL    V ++RI A I   V  R + +LY+LE  +C +E +E + EL LG 
Sbjct: 58  LPFSLKQVPSLASTAVLQSRITAMIQSAVATRPVVTLYELEAEVCLTENVEMYAELGLGC 117

Query: 240 FLRQ-PLISHYFLVKSDVDVVFKIATNDVIVCLSEYT-----DTHKAKDIK-----VDEF 288
            L   P + H F V         + TN   V  +E+      DT+    +       D  
Sbjct: 118 SLAALPCVRHLFGV--------SVGTNTAPVTSTEFMHFLLFDTNAQALLSGGGDAGDAV 169

Query: 289 LDFI-AKKRSLASKEQLGVRIQNLGMHINFIREARKSQDVTLKKFLKELQPHHKRRK 344
             F    K    +  QLG+ IQ+    + F+R+            L E    +KR K
Sbjct: 170 RAFACCYKGGKYTSMQLGIHIQHFPWLLRFVRQEVGRTSTFFSDLLNENAWCYKRNK 226


>gi|301105279|ref|XP_002901723.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099061|gb|EEY57113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 4807

 Score = 58.9 bits (141), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 94/213 (44%), Gaps = 39/213 (18%)

Query: 1203 YSQDSHFLLELVQNADD------NI---YPENVEPTLTF----ILQESGIVVLNNEQGFS 1249
            Y + +    E+VQNADD      N+   Y ++    L +      Q   ++V NN   FS
Sbjct: 29   YPEGTSIFKEMVQNADDAGATDVNLCLDYRQHAATGLAYEKLASFQGPSLLVHNNAT-FS 87

Query: 1250 AENIRALCDVGNSTKKGSSAGY-IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
            A + +++  +G+S KK +S G+  GR G+GF SV+ VTD P   S    V FD     + 
Sbjct: 88   AADFQSIQRIGDSLKKDNSKGWKTGRFGVGFNSVYHVTDLPTFVSGSQLVFFDPQACHLP 147

Query: 1309 FVLPTLVPPFNIDMFCR--LLSKDPVQLES-KCW---------NTCIRLPFRT------- 1349
             V P+  P   ID      L+   P Q++  +C+          T  RL  RT       
Sbjct: 148  NVNPS-NPGKMIDYIAHPDLVDNFPAQIDPFRCFGCNFSESFAGTIFRLALRTEEQAQQS 206

Query: 1350 ----KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
                +      M  ++  F+D  P++LLFL  +
Sbjct: 207  KLSNRAQTPAKMAEMLKEFADSLPTVLLFLRNV 239


>gi|260803431|ref|XP_002596593.1| hypothetical protein BRAFLDRAFT_78509 [Branchiostoma floridae]
 gi|229281852|gb|EEN52605.1| hypothetical protein BRAFLDRAFT_78509 [Branchiostoma floridae]
          Length = 282

 Score = 58.9 bits (141), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 62/268 (23%), Positives = 114/268 (42%), Gaps = 30/268 (11%)

Query: 1524 KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQ--WAPPCKVLRGWND 1581
            +A   ++ FVP+  E+  FF  +   I+  ++    + ++  + Q  W  P + +   + 
Sbjct: 25   EAVCRFLQFVPMPDEILDFFKPVAMQIVQHIKAQAFMPVQAEDGQISWKLPSETILSHDP 84

Query: 1582 RAHSLLPDILLQKHLGLGFLSKDIVLS----DSLARALGIEEYGPKILLQIISSLCRT-- 1635
                ++   LL   L L ++  D+        +L+  LGI++     L+++  ++  T  
Sbjct: 85   LVKDVIDSQLLSNTLNLFYIHGDVAAVLERYPALSTVLGIQKLSTNHLIEVGKAMATTFM 144

Query: 1636 ---ENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSDGTF 1692
               E G     ++ +A WL        H + +    S  E  ++  LQ I  +PLS G+F
Sbjct: 145  QQPEAGESLFMVAKVAKWLA-----CVHRAQEQEYNSSDE--MVKQLQDICILPLSCGSF 197

Query: 1693 SSVDEGTIWLHSDCSVFDGGFGLEAFPNLCAKLRTVSPALLSASAVDKSSLGVISVDNLN 1752
            +S +  TI+         G   +     L   L TV P LLS        L  +   +  
Sbjct: 198  ASANGPTIFFPLSEGQVKGVTSV-----LARDLNTVHPGLLSL-------LDDVGNSHAR 245

Query: 1753 RMLLKIGVQQLSAHDIVKVHILPAISDE 1780
             ML  +GV+++S  D++  HILP +  E
Sbjct: 246  LMLNTLGVKKISPIDVINHHILPILESE 273


>gi|407424690|gb|EKF39095.1| hypothetical protein MOQ_000682 [Trypanosoma cruzi marinkellei]
          Length = 1536

 Score = 58.9 bits (141), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 69/153 (45%), Gaps = 16/153 (10%)

Query: 178 SSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELEL 237
           + L F +++VPSL  + V ++RI A I   V  R + +LY+LE  +C +E +E + EL L
Sbjct: 56  NGLPFSLKQVPSLASIAVLQSRITAMIQSAVATRPVVTLYELEAEVCLTENVEMYAELGL 115

Query: 238 GPFLRQ-PLISHYFLVKSDVDVV---------FKIATNDVIVCLSEYTDTHKAKDIKVDE 287
           G  L   P + H F V    ++          F +   +    LS   D   A    V  
Sbjct: 116 GCSLAALPCVRHLFGVGVGTNIAPVTSTEFMHFLLFDTNAQALLSGGGDAGDA----VRA 171

Query: 288 FLDFIAKKRSLASKEQLGVRIQNLGMHINFIRE 320
           F      K    +  QLG+ IQ+    + F+R+
Sbjct: 172 FA--CCYKGGRYTSMQLGIHIQHFPWLLRFVRQ 202


>gi|379736889|ref|YP_005330395.1| hypothetical protein BLASA_3503 [Blastococcus saxobsidens DD2]
 gi|378784696|emb|CCG04365.1| conserved protein of unknown function [Blastococcus saxobsidens DD2]
          Length = 1569

 Score = 58.9 bits (141), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 53/189 (28%), Positives = 83/189 (43%), Gaps = 24/189 (12%)

Query: 1210 LLELVQNADDNIYPENV---EPTLTFIL--QESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
            LLELVQNA D +        E  +  +L  + + +   N+ + FS   I A+     S K
Sbjct: 58   LLELVQNAADALAGATDLADEARVEIVLDPRTATLYCANSGRPFSKSGITAVTMAYLSGK 117

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF--------VLPTLVP 1316
            +G     IGR G+GFKSV  VTDAP++ S     +F+++E +            LP L  
Sbjct: 118  RGDE---IGRFGLGFKSVLAVTDAPQVFSRSVSFEFNSAEAKAALRRIAPAARRLPVLRT 174

Query: 1317 PFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
               ID       +DPV  +   W +T ++LP  T       +  +     +     LLF+
Sbjct: 175  ATLIDAAAE-FDRDPVLADLATWASTVVKLPRVTD------IQRLEKEIQEFRSEFLLFV 227

Query: 1376 HRLQCIMFR 1384
              ++ +  R
Sbjct: 228  SAVREVRLR 236


>gi|340384390|ref|XP_003390695.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4028

 Score = 58.5 bits (140), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 91/376 (24%), Positives = 155/376 (41%), Gaps = 57/376 (15%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y      + E++QNADD        +Y +    T   + +    
Sbjct: 172  QHEPLTTRLGNILRE-YKDGLTIIKEMIQNADDAGATEVNILYDDRTHSTQKLLFKGMAE 230

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 231  SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 289

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
             +   FD +   + G V     P   +    + L++     P +       +  +N T  
Sbjct: 290  KWLYIFDPTLKYLKGVVRNESRPGKKVQYQSKFLAQSQQMAPYENLFGFKSTADYNGTIF 349

Query: 1344 RLPFRT---KFSEGIAMNNIVD-MFSDL---HPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            RLPFRT   + S  I   ++V  M +DL      LLLFL  ++ I F +M  D  V I  
Sbjct: 350  RLPFRTSASQISSTIYNKHLVQKMKNDLIENGSKLLLFLQNVKRITFHSMQEDE-VCISS 408

Query: 1397 KIVGDGI---IKVSCGEDKMT--WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYG 1451
                +GI   I  S   + ++  W V+ Q+ +      D +    ++A  L + N  ++ 
Sbjct: 409  INEENGIKYCITNSPANNSVSEYWLVSCQEQKLQAQGGDYQPGTASIACQLVKDN-SSFE 467

Query: 1452 PLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSR----------EEVDGNSPWNQWLLS 1499
                +  VF FLPL     GL   +  +F + S+R          E  D    WN+ L++
Sbjct: 468  CKAIEGSVFCFLPLSLPCTGLPVHVNANFAVMSNRSGIWTGASSEEPSDSREYWNKKLMT 527

Query: 1500 EFPALFVSAERSFCDL 1515
                      +++C+L
Sbjct: 528  TI------IPKAYCNL 537


>gi|340375790|ref|XP_003386417.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4207

 Score = 58.2 bits (139), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 88/389 (22%), Positives = 148/389 (38%), Gaps = 78/389 (20%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  +   L  + +E Y      + E++QNADD        +Y      T   I +    
Sbjct: 1372 QHEPITTRLRNILKE-YKDGLTIIKEMIQNADDAGATEVNILYDNRTHSTQNLIFKGMAD 1430

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ GIGF SV+ +TD P   S 
Sbjct: 1431 SHGPALIVHNNGSFTKEDFENITKLAGATKANQPLK-IGKFGIGFCSVYHITDVPSFVSG 1489

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKDP----------VQLESKCWNTCI 1343
             +   FD +   + G V+    P   +    ++ +K             +  S    T  
Sbjct: 1490 EWLYIFDPTLKYLKGIVINESRPGKKMKYQSKIFAKSQDLTPYEGLFGFKSSSDYEGTMF 1549

Query: 1344 RLPFRT---KFSEGIAMNNIV-DMFSDL---HPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            R PFRT   + S  I   ++V D+  DL      LLLFL  ++ I F ++        +K
Sbjct: 1550 RFPFRTSASQISSTIYNEHLVEDIKRDLIECGSKLLLFLQHVKQITFSSLKG------KK 1603

Query: 1397 KIVGDGIIKVSCGEDKM---------------TWFVASQK---LRAGVIRPDVKTTEIAL 1438
            KI+   I   +   D +                W +ASQ+       +I+P V +    +
Sbjct: 1604 KIIEVSIDASNSSNDDIHKVVIKSPHNKSSTEHWLIASQEEQWRHQNIIKPAVAS----I 1659

Query: 1439 ALTLQESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV--------- 1487
            A      N   Y     +   F FLPL   + GL   +  +F + S+R  +         
Sbjct: 1660 ACQFDRINSSCYKCKEIEGNAFCFLPLALPSTGLPVHVSANFAVMSNRSGIWTEASSGIV 1719

Query: 1488 -DGNSPWNQWLLSEFPALFVSAERSFCDL 1515
             D    WN+ L+       ++   ++C L
Sbjct: 1720 TDSREQWNKNLME------ITIPEAYCKL 1742



 Score = 42.4 bits (98), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 17/123 (13%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNI------------YPENVEPTLTF-- 1232
            Q   L R +  + QE Y  D+ FL EL+QNADD              + +   P+  +  
Sbjct: 2327 QKEELTRRIRNILQE-YPLDATFLKELLQNADDTKATKIRVILDKRQHKKKKVPSEEWGK 2385

Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
             LQ   ++V N+++ FS  ++  +  +G  +K+      IG+ GIGF  V+ +TD P   
Sbjct: 2386 ELQGPALLVWNDKE-FSDADLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHLTDCPSFI 2443

Query: 1293 SNG 1295
            + G
Sbjct: 2444 TRG 2446


>gi|423229230|ref|ZP_17215635.1| hypothetical protein HMPREF1063_01455 [Bacteroides dorei CL02T00C15]
 gi|423245069|ref|ZP_17226143.1| hypothetical protein HMPREF1064_02349 [Bacteroides dorei CL02T12C06]
 gi|392634508|gb|EIY28428.1| hypothetical protein HMPREF1063_01455 [Bacteroides dorei CL02T00C15]
 gi|392640534|gb|EIY34334.1| hypothetical protein HMPREF1064_02349 [Bacteroides dorei CL02T12C06]
          Length = 1804

 Score = 58.2 bits (139), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 68/128 (53%), Gaps = 13/128 (10%)

Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPE------NVEP---TLTFILQESGI 1239
            ++ R+L  L ++L  S    F+ EL+QNA+D  YP        ++P    + F + E+ +
Sbjct: 223  KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRTKINGKIQPLPVNVEFHITENYL 280

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
               +  + F+ +NI A+CD+ +  K  ++   IG KGIGFK+VF   D   +++  +  +
Sbjct: 281  TFEHTGEYFNPKNIAAICDINDGEKSDNTEA-IGYKGIGFKTVFLDNDYVLLNTGNYTFR 339

Query: 1300 FDTSEGQI 1307
            FD S   +
Sbjct: 340  FDKSATDV 347


>gi|85859233|ref|YP_461435.1| cytoplasmic protein [Syntrophus aciditrophicus SB]
 gi|85722324|gb|ABC77267.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
          Length = 1210

 Score = 58.2 bits (139), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 60/232 (25%), Positives = 101/232 (43%), Gaps = 32/232 (13%)

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-----GFVLPTLVPPFNIDMFCRL 1326
            IG KG+GFKSV  VT  PEIHS  FH+ F   +  +     GF      P F I      
Sbjct: 139  IGAKGLGFKSVMEVTTEPEIHSGPFHLHFSLDKTLVLLKKHGFSGIKQPPVFRIP----- 193

Query: 1327 LSKDP---VQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMF 1383
              K+P   ++   +C+ T +RLPF     E +  +       +L   +LLF   ++    
Sbjct: 194  FKKEPDEGIKELLECYATVLRLPFANHKREQVCQS-----LRELDAHVLLFCQHIE--EL 246

Query: 1384 RNMLNDSLVVI------RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGV---IRPDVKTT 1434
            R +L+++L  I        + + D  I++    +    F +++  R      ++   K  
Sbjct: 247  RIILDENLKRIWTAKRETDRPLTDCRIRIMFENEPEADFSSTEYQRWAKTWNVKGQSKLH 306

Query: 1435 EIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLPSSRE 1485
             +++ L L E+ +    P+ YQQP+  F P       + I+   F L  +R+
Sbjct: 307  SVSICLPLDENRKPI--PMKYQQPLHVFFPTEEELPFRSIIHSSFDLSQNRK 356


>gi|336405902|ref|ZP_08586569.1| hypothetical protein HMPREF0127_03882 [Bacteroides sp. 1_1_30]
 gi|335936369|gb|EGM98300.1| hypothetical protein HMPREF0127_03882 [Bacteroides sp. 1_1_30]
          Length = 2302

 Score = 57.8 bits (138), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 68/128 (53%), Gaps = 13/128 (10%)

Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPE------NVEP---TLTFILQESGI 1239
            ++ R+L  L ++L  S    F+ EL+QNA+D  YP        ++P    + F + E+ +
Sbjct: 223  KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRTKINGKIQPLPVNVEFHITENYL 280

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
               +  + F+ +NI A+CD+ +  K  ++   IG KGIGFK+VF   D   +++  +  +
Sbjct: 281  TFEHTGEYFNPKNIAAICDINDGEKSDNTEA-IGYKGIGFKTVFLDNDYVLLNTGNYTFR 339

Query: 1300 FDTSEGQI 1307
            FD S   +
Sbjct: 340  FDKSATDV 347


>gi|449486139|ref|XP_002186744.2| PREDICTED: sacsin-like [Taeniopygia guttata]
          Length = 5089

 Score = 57.8 bits (138), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 156/396 (39%), Gaps = 101/396 (25%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQGF 1248
            Y ++     E+VQNA+D        +       T T  L + G+   +       N   F
Sbjct: 1306 YGEEGDLFTEMVQNAEDASATVCRFLLDLRSHRTATSGLLDPGMAACHGPALWAYNNALF 1365

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH-- 1297
            S ++ + +  +G +TK+G  AG IGR G+GF SV+RVTD P + S         NG H  
Sbjct: 1366 SEDDFQNITRIGAATKEGQ-AGRIGRFGLGFSSVYRVTDVPAVLSGETLLIFDPNGTHLS 1424

Query: 1298 ----------VKFD-TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
                      ++ D +S  +I  V      P+N    C L    P         +  RLP
Sbjct: 1425 KHIPRAGSPGIRLDFSSRPRILRVFAEQFQPYNGIFGCCLPEPGPFP------GSLFRLP 1478

Query: 1347 FRTK--------FSEGIAMNNIVDMFSDLHPS---LLLFLHRLQCIMFRNMLNDS----- 1390
            FRT+         SE      I  + +    S   LLLFL  ++ +    ML D+     
Sbjct: 1479 FRTEEEAVTSQICSEAFGAERIQSLGTAFLGSNRLLLLFLKNVRELSL-EMLPDTATSTE 1537

Query: 1391 -----LVVIRKKIVGDG----------IIKVSCGED--KMTWF--------------VAS 1419
                  ++ RK+I G G          I ++S  E+  +  W               +  
Sbjct: 1538 DTMPLAMLQRKEIRGLGSPGDPPSWAAIEQLSACEEESRTVWHYLVLVCQGDGELLELFH 1597

Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQP--VFAFLPLRTY-GLKFILQG 1476
            Q  +AG+  P        +AL L  + +G + P L  +   VF  LP+    GL   + G
Sbjct: 1598 QNTQAGLHPP---LPMAGVALPLAPAKDGKWVPRLDAEKGQVFCHLPMPVISGLPIHVHG 1654

Query: 1477 DFVLPSSRE---EVDGNSPWNQWLL-SEFPALFVSA 1508
             F + S+R+   +      WN+ LL +  PA ++ A
Sbjct: 1655 AFSILSNRKGLWDTAERGKWNRVLLHNAVPAAWLRA 1690


>gi|423347365|ref|ZP_17325052.1| hypothetical protein HMPREF1060_02724 [Parabacteroides merdae
            CL03T12C32]
 gi|409217824|gb|EKN10798.1| hypothetical protein HMPREF1060_02724 [Parabacteroides merdae
            CL03T12C32]
          Length = 2302

 Score = 57.8 bits (138), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 68/128 (53%), Gaps = 13/128 (10%)

Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPE------NVEP---TLTFILQESGI 1239
            ++ R+L  L ++L  S    F+ EL+QNA+D  YP        ++P    + F + E+ +
Sbjct: 223  KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRTKIDGKIQPLPVDVEFHITENYL 280

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
               +  + F+ +NI A+CD+ +  K  ++   IG KGIGFK+VF   D   +++  +  +
Sbjct: 281  TFEHTGEYFNPKNIAAICDINDGEKSDNTEA-IGYKGIGFKTVFLDNDYVLLNTGNYTFR 339

Query: 1300 FDTSEGQI 1307
            FD S   +
Sbjct: 340  FDKSATDV 347


>gi|85819302|gb|EAQ40461.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
          Length = 1500

 Score = 57.8 bits (138), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 78/335 (23%), Positives = 137/335 (40%), Gaps = 56/335 (16%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNN--EQGFSAENIRALCDVGNSTKKGS 1267
            LLEL+QN+DD    E        +   S  + +NN  +  FS +  R+L  + N + K S
Sbjct: 42   LLELLQNSDD----EGATQVAIKLDTASKTISINNNGDTSFSKKGYRSLF-IANLSSKTS 96

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF-----------------V 1310
               YIG KG+GF+S+   ++A EI SN   +++  +  +  +                 +
Sbjct: 97   KRKYIGNKGLGFRSIINWSNAIEIQSNDLSLQYSETNTKARYESLFTTEQRATIAKEENL 156

Query: 1311 LPTLVP-PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
              T++P PF     C  + K+  Q   + + T I + ++ K     A  +I      + P
Sbjct: 157  SSTVIPMPF----LCLPVIKEITQ---ENFKTSIVISYKEK-----AFKDIYKQVKSITP 204

Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGED----KMTWFVASQKLRAG 1425
              LLFL+ ++ I F    +   +   +          S  E       +W +  ++    
Sbjct: 205  ETLLFLNHIKQIDFIGFEDKPQIKCNRSAHIKSSTTFSPQEAIQFPNNSWSIFEEEDELP 264

Query: 1426 VIRPDVKTTE-----IALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFV 1479
            +   DV  TE     I +A+   E N     P LY     +F P        ++L   F 
Sbjct: 265  LEFQDVNKTEKEFYQIKIAV---EDNFEKTSPYLY-----SFFPTNIKLNQPYVLHATFD 316

Query: 1480 LPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCD 1514
            L ++R +++ +S  N+++L +  A      R F D
Sbjct: 317  LDATRNQLN-DSEKNRFILEKVVAFTTKVARYFTD 350


>gi|449486128|ref|XP_002186552.2| PREDICTED: sacsin-like [Taeniopygia guttata]
          Length = 3423

 Score = 57.8 bits (138), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 155/396 (39%), Gaps = 101/396 (25%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQGF 1248
            Y ++     E+VQNA+D        +       T T  L + G+   +       N   F
Sbjct: 200  YGEEGDLFTEMVQNAEDASATVCRFLLDLRSHRTATSGLLDPGMAACHGPALWAYNNALF 259

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH-- 1297
            S ++ + +  +G +TK+G  AG IGR G+GF SV+RVTD P + S         NG H  
Sbjct: 260  SEDDFQNITRIGAATKEGQ-AGRIGRFGLGFSSVYRVTDVPAVLSGETLLIFDPNGTHLS 318

Query: 1298 ----------VKFD-TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
                      ++ D +S  +I  V      P+N    C L    P         +  RLP
Sbjct: 319  KHIPRAGSPGIRLDFSSRPRILRVFAEQFQPYNGIFGCCLPEPGPFP------GSLFRLP 372

Query: 1347 FRTK--------FSEGIAMNNIVDMFSDLHPS---LLLFLHRLQCIMFRNMLNDS----- 1390
            FRT+         SE      I  + +    S   LLLFL  ++ +    ML D+     
Sbjct: 373  FRTEEEAVTSQICSEAFGAERIQSLGTAFLGSNRLLLLFLKNVRELSL-EMLPDTATSTE 431

Query: 1391 -----LVVIRKKIVGDG----------IIKVSCGED--KMTWF--------------VAS 1419
                  ++ RK+I G G          I ++S  E+  +  W               +  
Sbjct: 432  DTMPLAMLQRKEIRGLGSPGDPPSWAAIEQLSACEEESRTVWHYLVLVCQGDGELLELFH 491

Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQP--VFAFLPLRTY-GLKFILQG 1476
            Q  +AG+  P        +AL L  + +G + P L  +   VF  LP+    GL   + G
Sbjct: 492  QNTQAGLHPP---LPMAGVALPLAPAKDGKWVPRLDAEKGQVFCHLPMPVISGLPIHVHG 548

Query: 1477 DFVLPSSRE---EVDGNSPWNQWLL-SEFPALFVSA 1508
             F + S R+   +      WN+ LL +  PA ++ A
Sbjct: 549  AFSILSHRKGLWDTAERGKWNRVLLHNAVPAAWLRA 584


>gi|345851153|ref|ZP_08804135.1| helicase domain-containing protein [Streptomyces zinciresistens K42]
 gi|345637382|gb|EGX58907.1| helicase domain-containing protein [Streptomyces zinciresistens K42]
          Length = 1573

 Score = 57.4 bits (137), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 95/228 (41%), Gaps = 26/228 (11%)

Query: 1210 LLELVQNADDNI--YPENVEPT----LTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            LLELVQNA D +    E+ E      +   L    +   N  + FS   +  L     S 
Sbjct: 56   LLELVQNAADAMSGAAEHEEGAGRVEIVLDLDRQTLYCANAGRPFSRSGLTTLAHAHLSG 115

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV--------LPTLV 1315
            K+G     IGR G+GFKSV  VTDAP++ S     +F++ + +            LP L 
Sbjct: 116  KRGDE---IGRFGLGFKSVLAVTDAPQVFSRSIAFEFNSPKAKAALAAIAPASKRLPVLR 172

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFL 1375
                ID      ++DP+  E   W   I      +      + N+     +     LLF+
Sbjct: 173  TLTRIDADVE-FAEDPILAELAEWAVTI-----VRLRRASRLENLRKEIEEFRSEFLLFV 226

Query: 1376 HRLQCIMFRNMLNDSLVVIR--KKIVGDGIIKVSCGE-DKMTWFVASQ 1420
            + ++ +  R +  D+  V     + +GDG  ++   + D   W+V ++
Sbjct: 227  NSVREVRLRVVGADADFVTSHVSRDLGDGRFRIERPDGDHEEWYVENR 274


>gi|336322039|ref|YP_004602007.1| type III restriction protein res subunit [[Cellvibrio] gilvus ATCC
            13127]
 gi|336105620|gb|AEI13439.1| type III restriction protein res subunit [[Cellvibrio] gilvus ATCC
            13127]
          Length = 1547

 Score = 57.0 bits (136), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 91/222 (40%), Gaps = 28/222 (12%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D     +    +   L    + V N+ + F+A+ + AL     S+K     
Sbjct: 52   LHELVQNAVDAA--RDGGGRIEVRLARDTLYVANDGEPFTAQGVEALMASDLSSKADDR- 108

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP-FNIDMFCRL-- 1326
              IGR GIGFKSV  VT  P ++S      FD +  +       L  P + +    RL  
Sbjct: 109  --IGRFGIGFKSVLAVTAEPRVYSRSVSFAFDRAWSESTIRAAGLTAPRYPVMRVARLED 166

Query: 1327 ----LSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDL---HPSLLLFLHRL 1378
                  +DPV  E   W  T + LP         A+  +  +  DL      LLLF   +
Sbjct: 167  PRAAAERDPVLAELMSWATTVVVLP---------ALTQLGSLKRDLLAYRGELLLFSRHI 217

Query: 1379 QCIMFR-NMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVAS 1419
            Q +  R + + D  V   ++    G++ +   E   TW V S
Sbjct: 218  QDLRLRIDGMADQRVTANER--AGGLVDLVSEEGVSTWKVLS 257


>gi|301621342|ref|XP_002940013.1| PREDICTED: sacsin-like, partial [Xenopus (Silurana) tropicalis]
          Length = 4152

 Score = 57.0 bits (136), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 101/449 (22%), Positives = 167/449 (37%), Gaps = 116/449 (25%)

Query: 1203 YSQDSHFLLELVQNADDNIYP---------ENVEPTLTFILQESGI-----VVLNNEQGF 1248
            YS+      EL+QNADD             EN     + I  +  I     +   N + F
Sbjct: 1254 YSESVDIFKELIQNADDAEATVCAFLIDMRENATSRQSLIDPDMAICHGPALWSYNNRTF 1313

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD------- 1301
            +  + R +  VG +TK+ +    IG+ G+GF SV+ +TD P I S    + FD       
Sbjct: 1314 TDNDFRNITRVGAATKE-TDIKKIGKFGLGFNSVYLITDIPLIMSRSHVLIFDPNVNHIR 1372

Query: 1302 ---------------TSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
                            ++ +I  + P    PF+    C L  K P   +     T IRLP
Sbjct: 1373 KHIPNDCNPGIKLNLQTKPEILQIFPDQFQPFSSVFGCEL--KQPFNYDG----TLIRLP 1426

Query: 1347 FRTK------------FSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSL--- 1391
            FRT+            FS    +  ++  F +   +LL+FL  +Q +    +  D     
Sbjct: 1427 FRTEKEAKNSQICQEAFSNK-QIKTLIMRFEESTDTLLVFLRNVQVLTISFLGKDLCPGD 1485

Query: 1392 VVIRKKIVGDGIIKVSCGEDKM--TWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGN 1449
             ++R  +  D +++    +D +       SQ L   +   D+  + I + +T+Q++ + N
Sbjct: 1486 QLLRASVQKDNVLRQDVPQDIILQQQITTSQLLEIQMNDLDISVSNI-IKITVQQTTKKN 1544

Query: 1450 ----------------------------------------------YGPLL--YQQPVFA 1461
                                                          + P L  +   VF 
Sbjct: 1545 EKYYMVQSSLGIKASIHMFSQKKSFICPLPFAGVALPLKKHNGTEKWAPDLENFNGMVFC 1604

Query: 1462 FLPLR-TYGLKFILQGDFVLPSSREEVDGNS---PWNQWLLSEFPALFVSAERSFCDLPC 1517
            FLPL  + GL F L G F + S+R+ + G +    WNQ LL++  A  V+   +   L  
Sbjct: 1605 FLPLPISTGLPFHLNGCFSVMSNRKGLWGTTEKGEWNQKLLTD--AALVALITALSQLQI 1662

Query: 1518 FRENPAKAASVYMSFVPLVGEVHGFFSGL 1546
                       Y +F P V +V+  +  L
Sbjct: 1663 LNHQGDLQDYYYYTFWPDVTKVNTQYKEL 1691



 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 111/545 (20%), Positives = 200/545 (36%), Gaps = 136/545 (24%)

Query: 1238 GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI------ 1291
             ++V NNE  FS  +   +   GNS K       +GR G+GF SV+ +TD P I      
Sbjct: 2    ALLVYNNE-VFSKNDWEGIQKPGNSIK-SKDPNTVGRFGLGFNSVYHITDYPAIFSGKNI 59

Query: 1292 ----------HSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQ-----LES 1336
                      H  G     + +   I  +    +P      FCR+L    +      L +
Sbjct: 60   GILDPQESVLHKGGLLWNLEKNMDSIKNLTDQFLP------FCRVLEDTGLSTWNEILNA 113

Query: 1337 KCWN-TCIRLPFRTK---FSEGI-AMNNIVDMFSDLHPSL---LLFLHRLQCIMFRNMLN 1388
             C+  T  R P R K    SE I +   + ++F          LLFL  +  +  + + +
Sbjct: 114  GCFQGTLFRFPLRVKPSEISENIYSSEKVEELFESFIKDAGISLLFLRHVNTVSLKRIGS 173

Query: 1389 DSLVV--IRKKIVGDGIIKVSCG------------------EDKMTWFVASQKLRAGVIR 1428
            D LV+  +   +    I     G                  E + TW V +      + +
Sbjct: 174  DGLVIHLLTVSVSTQTIASTKSGNITTESHLKMTSIKDFDEEKEQTWLVQTSTAHGSLFQ 233

Query: 1429 PDVKTTE---------IALALTLQESNEGNYGPLLYQQPVFAFLPL-----RTYGLKFIL 1474
              ++ +E         +A  L+ Q+++       L+   +   LPL        GL  ++
Sbjct: 234  DLIELSEKLCNNPSLDLAYPLSKQDTD-------LFAGRLSCVLPLPDKEENQTGLPVVI 286

Query: 1475 QGDFVLPSSR---------EEVDGNSPWNQWLLSE-FPALFVSAERSFCDLPCFRENPAK 1524
             G F L   R         ++ D  + WN  L+ +  P +++ A +              
Sbjct: 287  NGCFDLTDDRRSMKWLETDQQHDNAAKWNHILVHKLLPLVYICAVKD------------A 334

Query: 1525 AASVYMSFV-----------PLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPPC 1573
            AA V MS +           P   +    +  + + I ++L +   ++   +N+ W    
Sbjct: 335  AALVKMSKITTDMAYGIWPDPEKTKFKDKWYSITKQI-AQLLLCESVLQTADNSDWITAS 393

Query: 1574 KVLRGWNDRAHSL---LPDILLQKHLGLGFLSKDIV-LSDSLARALGIEEYGPKILLQII 1629
            K +  +N   H L   L ++L+       +L + +V + D + + L + E    +L  + 
Sbjct: 394  KAIFVYNIEDHDLQNCLEEVLI-------YLKQPLVKIPDHVQKTLKLAEAKTGMLNSVT 446

Query: 1630 SSLCRTENGLRSMGLSWLASWLNELYTISFHSSGQSSLQSGVETDLIDNLQRIPFIPLSD 1689
             S  R    L+S   + ++     L      S GQ            D+L  +  +PL++
Sbjct: 447  PSFIRM--VLKSGNWNTISKEQKMLLLRYIVSDGQ-----------YDDLLDLQILPLAN 493

Query: 1690 GTFSS 1694
            GTF+S
Sbjct: 494  GTFTS 498



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 58/215 (26%), Positives = 93/215 (43%), Gaps = 39/215 (18%)

Query: 1203 YSQDSHFLLELVQNADDNIYPE--------NVEPTLTF-----ILQESGIVVLNNEQGFS 1249
            Y      L EL+QNADD+   E          + T TF      L    + + NN Q F 
Sbjct: 2249 YLAQKDILKELIQNADDSEATEIHFVLDSRTHQATSTFGSEWNPLHGPALCIYNN-QKFD 2307

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFDTSEGQIG 1308
             ++I  +  +G   K+G  +  IGR G GF SV+ +TD P  +  +     FD +   + 
Sbjct: 2308 PKDIEGIQLLGKGGKRGQGSK-IGRFGTGFNSVYHITDCPSFLTGDAIMCVFDPN---LA 2363

Query: 1309 FVLPTLVPP----FNI-DMFCRLLSKDPVQLESKCWN----TCIRLPFR-------TKFS 1352
            F+  + V      F++ D F +             +N    T  RLP R       +K S
Sbjct: 2364 FLPSSTVGSPGGMFSVNDRFKKTFKDVYNTFLPSVFNLQEGTLFRLPLRMAETVAMSKIS 2423

Query: 1353 -EGIAMNNIVDMFSDLHP---SLLLFLHRLQCIMF 1383
             + +++ NI +M  +L     S++LFL+ ++ I F
Sbjct: 2424 NQPVSLENIRNMCRELEEDADSMILFLNNIKKITF 2458


>gi|340385242|ref|XP_003391119.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
          Length = 2392

 Score = 57.0 bits (136), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 86/366 (23%), Positives = 149/366 (40%), Gaps = 55/366 (15%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  +   L  + +E Y      + E++QNADD        +Y      T   IL+    
Sbjct: 339  QHEPITTRLRNILKE-YKDGLTIIKEMIQNADDAGATEVNILYDNRTHSTKNLILKGMAD 397

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ GIGF SV+ +TD P   S 
Sbjct: 398  SHGPALIVHNNGSFTKEDFENITKLAGATKANQPLK-IGKFGIGFCSVYHITDVPSFVSG 456

Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMF--CRLLS--KDPVQLE--------SKCWNTC 1342
             +   FD +   +  V+     P     +   R+ S  +D V  E        S    T 
Sbjct: 457  KWLYIFDPTLQFLKGVVHDQSKPGKKIKYQQSRIFSNCQDLVPYEGLFGFSSLSDYEGTM 516

Query: 1343 IRLPFRT---KFSEGIAMNNIV-DMFSDL---HPSLLLFLHRLQCIMFRNML-NDSLVVI 1394
             R PFRT   + S  +   ++V D+  DL      LLLFL  ++ I F ++  N+ ++ +
Sbjct: 517  FRFPFRTSSSQISSTVYNEHLVEDIKRDLIECGSKLLLFLQHVKRITFSSLRGNEKIIEV 576

Query: 1395 RKKI-------VGDGIIKV-SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESN 1446
               +       + + +IK    G     W +ASQ+      + +++    ++A  L  +N
Sbjct: 577  SIDVSSSSNDDIHEVVIKSPHNGSGTEHWLIASQE-EQWRYQNNIEPAVASVACQLDRTN 635

Query: 1447 EGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNSPWN 1494
               Y     +   F FLPL   + GL   +  +F + S+R  +          D    WN
Sbjct: 636  TNCYNCKETEGNAFCFLPLALPSTGLPVHVSANFAVMSNRSGIWTEVSSGTVTDSKERWN 695

Query: 1495 QWLLSE 1500
            + L+ +
Sbjct: 696  KNLMEK 701


>gi|429768006|ref|ZP_19300182.1| hypothetical protein HMPREF0185_00450 [Brevundimonas diminuta 470-4]
 gi|429189571|gb|EKY30398.1| hypothetical protein HMPREF0185_00450 [Brevundimonas diminuta 470-4]
          Length = 1743

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 82/351 (23%), Positives = 141/351 (40%), Gaps = 63/351 (17%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQ--ESGIVVL---NNEQGFSAENIRALCDVGNSTK 1264
            +LEL+QNA D   P +    + F+    E    VL   N+ +GF+  N +ALC    +TK
Sbjct: 56   VLELLQNAHDAHDPSDRNGRIRFVYDAGEGDFGVLYAANDGRGFTRANFKALCSPTRTTK 115

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-----GFVLPTLVPPFN 1319
              + A  IG KG+GF SVF++   PE++S     K    +G          L   + P  
Sbjct: 116  AVNEA--IGNKGVGFLSVFQICAHPEVYSKPTSRKSGVFDGYCFTFADNATLQAFLAPCG 173

Query: 1320 IDMFCRLLS-------------KDPVQLES---KCWNTCIRLPFRTKFSEGIAMNNIVDM 1363
            +    R ++             + P  +ES     + T +RLP +   +  +A+   +  
Sbjct: 174  LADRARRVAATMPQLYLATPSIQPPPAVESLGEDGYATVLRLPLKNAEAR-LAVETQLTE 232

Query: 1364 FSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG----IIKVSCGEDKMTWFVAS 1419
             ++  P + LFL R+  +         ++    + + +G    +  V+CGE +  + VA 
Sbjct: 233  LAEGQPDVQLFLERISELRIEFGGTTHVLGRAPETLHEGSDLVLQNVTCGERR--YIVAR 290

Query: 1420 QKLRAGVIRPDVKTTEIALALTLQESNEGNY----------------GPLLYQQPVFAFL 1463
            + L      P+    E+  A  L E+    +                GP +  + ++ FL
Sbjct: 291  RTL------PETAVREVINADVLSEALPERWNDWIGDAVISIAVTADGPPIDGR-LYTFL 343

Query: 1464 PLRTYGLKFILQGDFVLP----SSREEVDGNSPWNQWLLSEFPALFVSAER 1510
            P+   G      G    P      R++++    +N WLL E   L   A R
Sbjct: 344  PM-GKGAPSPFAGHMDAPFFATIERKDLEEGGTFNPWLLDECRRLAFDAAR 393


>gi|116620801|ref|YP_822957.1| superfamily I DNA/RNA helicase [Candidatus Solibacter usitatus
            Ellin6076]
 gi|116223963|gb|ABJ82672.1| Superfamily I DNA and RNA helicases and helicase subunits-like
            protein [Candidatus Solibacter usitatus Ellin6076]
          Length = 1586

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 81/351 (23%), Positives = 137/351 (39%), Gaps = 86/351 (24%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN------------EQGFSA 1250
            YS  +HF+ EL+QNADD         + +F+L   G+V  +N            E+  SA
Sbjct: 39   YSDQAHFIYELLQNADDA-----RATSASFVLTRDGLVFKHNGETRFSISDPATEEADSA 93

Query: 1251 E----NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
                 ++ A+  + NS K  +S   IG+ G+GFKSVF+ T  P I+   F  + +     
Sbjct: 94   SGALGHVNAITSIANSNKTAAS---IGKFGVGFKSVFQYTKTPHIYDPAFCFRIER---- 146

Query: 1307 IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSD 1366
              F++P  +   + D   R           +   T   LPF       +  ++ +     
Sbjct: 147  --FIVPHAL---DSDFAGR-----------QDHETVFWLPFDDDRKRPVECHDEI----- 185

Query: 1367 LHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDG------IIKVSCGEDKMTWFVASQ 1420
                      +L+ ++F  +    L  +  ++ G+       +I+     D     V   
Sbjct: 186  --------AEKLKSLIFPTLFLSKLGSVSWEVNGEAGGYSKQVIRTVQRHDLTARLV--- 234

Query: 1421 KLRAGVIRPDVKTTEIALALTLQESNEG---------NYGPL-LYQQPVFAFLPLRT-YG 1469
            +L + V R   +TTE  + L    + EG         + G L    QP F F P +    
Sbjct: 235  RLASTVGR---QTTEQDIWLFSPATVEGLPFCVGFSVSDGKLRATSQPAFCFFPTKEPTN 291

Query: 1470 LKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRE 1520
            L F++   F+L  SRE +      NQ ++ +   L   +      LP  R+
Sbjct: 292  LSFMVHAPFLLTDSREGIKAGEKHNQDMVEQLARLAADS------LPVLRD 336


>gi|392843829|gb|AFM84518.1| sacsin, partial [Polypterus senegalus]
          Length = 735

 Score = 56.6 bits (135), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        +Y     PT          LQ   + V NN Q F+
Sbjct: 416  YPSEKEMLKELLQNADDAKATEIYFVYDPRDHPTDRIFDDKWIPLQGPSLCVYNN-QPFT 474

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             ++IR + ++G  TK+G+  G  G+ GIGF SV+ +TD P   SN
Sbjct: 475  EDDIRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISN 518


>gi|82913421|ref|XP_728637.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23485089|gb|EAA20202.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 285

 Score = 56.6 bits (135), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 82/309 (26%), Positives = 123/309 (39%), Gaps = 65/309 (21%)

Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNN----------------EQGFSAENIR 1254
            LEL+QNADD    E V     F + ++G++V N+                 QG    +  
Sbjct: 3    LELIQNADDAKAEEVV-----FDITDNGLLVRNSGQFTYCGDLNKPCAFLSQGNYKCDYH 57

Query: 1255 ALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
             + DVG S  K S    IGR GIGF S ++VTD PEI S+G  +      GQ  + +   
Sbjct: 58   RITDVG-SGGKLSRGENIGRFGIGFVSTYQVTDHPEIRSSGIKLALYPERGQ--WFIEAF 114

Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
              P     F    S DP        NT  RL           ++ + + F  +    LLF
Sbjct: 115  EEPDGTSFFLPWAS-DP--------NTEARLALGISHVSPTHIDQLAEDFQAVLRKSLLF 165

Query: 1375 LHR-------------LQCIMFRNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMTWFVAS 1419
            L               L C + R+  +D +V  R     +   I++    E       A+
Sbjct: 166  LRHVRKAEIRRNGQLLLACDLDRSEGSDLIVSFRPSGAVEQWHILRADAAE-------AA 218

Query: 1420 QKLRAGVIRPDV--KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQG 1476
            Q+L A   R +   ++T I++ L +           L +  ++AFLP  ++ GL   L  
Sbjct: 219  QRLYATHPRLETLHRSTNISIGLRIDPEP-------LTEGLLYAFLPTEQSTGLPLHLNA 271

Query: 1477 DFVLPSSRE 1485
            DF   S R+
Sbjct: 272  DFFPESDRK 280


>gi|260592925|ref|ZP_05858383.1| hypothetical protein HMPREF0973_02386 [Prevotella veroralis F0319]
 gi|260535125|gb|EEX17742.1| hypothetical protein HMPREF0973_02386 [Prevotella veroralis F0319]
          Length = 2288

 Score = 56.6 bits (135), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/218 (24%), Positives = 99/218 (45%), Gaps = 40/218 (18%)

Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPENVEPT---------LTFILQESGI 1239
            ++ R+L  L ++L  S    F+ EL+QNA+D  YP   +           + F + ++ +
Sbjct: 213  KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRAKINGEYQALPVDVEFHITKNYL 270

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
               +  + F+ +NI A+CD+ N  +K  +   IG KGIGFK+VF   D   +++  +  +
Sbjct: 271  TFQHTGEYFNPKNIAAICDI-NDGEKADNTKAIGYKGIGFKTVFLDNDYVLLNTGNYTFR 329

Query: 1300 FDTSEGQI----GFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKF---- 1351
            FD S   +      +LP       ID   + + +     E           FR KF    
Sbjct: 330  FDKSATDVINTPWQILPIWTENNEIDNDIKFVFRQHPNDE-----------FRVKFALQP 378

Query: 1352 ------SEGIAMNNIVDMFSDLHPS--LLLFLHRLQCI 1381
                  ++    +N +D+F+D+  S  ++LF+  ++ +
Sbjct: 379  RDKEILTDEDRDDNYIDLFTDVFESERIILFIPNIKKV 416


>gi|260803427|ref|XP_002596591.1| hypothetical protein BRAFLDRAFT_78511 [Branchiostoma floridae]
 gi|229281850|gb|EEN52603.1| hypothetical protein BRAFLDRAFT_78511 [Branchiostoma floridae]
          Length = 1237

 Score = 56.6 bits (135), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 71/307 (23%), Positives = 122/307 (39%), Gaps = 43/307 (14%)

Query: 1787 KNLMADYLCFVMMHLEYYCPNCHVEREFIVSELRKKAFVLTNHGFKRPSEIPIHFGKEFG 1846
            + L+  YL F+    +    + +++R      L+  A V+ N G   PS  P+HF   +G
Sbjct: 73   RELLIGYLVFIKSQWDLQPDHVNLDR------LKPVAQVVNNKGLMTPSSQPVHFTPAYG 126

Query: 1847 NPVSVNMLIHDIDIKWYEVDITYL--KHPAKESLSCGLVKWRKFFKEIGITDFVQVVQ-- 1902
            N + +   +   +  W  +D  YL    PA+ ++      WR FF ++G+ DF+ V +  
Sbjct: 127  NRLDLQKKLPGNE--WTLLDSCYLGTSTPAQHTVQT----WRHFFVKLGVLDFLAVKKER 180

Query: 1903 ---VDKDVADISHTGFKNMWTKELLSPGSAAIDWESNELVHL-LSLLTTNVNRQCSKHLL 1958
                 + +     +   + W K     G    D+   E   L ++ LT +V     + L 
Sbjct: 181  VKLSTEQLGKSPWSTLSSFWPKS--DEGYEVEDFTCAEFYSLVMADLTADVRTPQMQILF 238

Query: 1959 EILDTLWDDCYTDKIMGFFKSKPTGDDR-SFQSSFINCICDIQW-----------TISSM 2006
            E++D  WD  Y  K        P+G      +SSF+  I  I W           T    
Sbjct: 239  EMIDKEWDGSYR-KYSTAQVFDPSGVKLCDAESSFLIHIKTIPWVPGIKTSTLPNTAPDG 297

Query: 2007 DDELHY------PKDLFHDCDAVRSILGPSAPYIVPKVKSEKLVCD--IGLKTEVTIDDI 2058
              +L Y        +L+        +L    PY+   V++E       + L+  +T D +
Sbjct: 298  SAQLFYNNVFMRGSELYEKTREHIGLLAHHVPYLTAAVQTETSSFGQVLQLQQSLTTDRL 357

Query: 2059 LEILKVW 2065
            L I+K W
Sbjct: 358  LAIVKGW 364


>gi|126667620|ref|ZP_01738589.1| hypothetical protein MELB17_00250 [Marinobacter sp. ELB17]
 gi|126627889|gb|EAZ98517.1| hypothetical protein MELB17_00250 [Marinobacter sp. ELB17]
          Length = 1516

 Score = 56.6 bits (135), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 68/146 (46%), Gaps = 27/146 (18%)

Query: 1211 LELVQNADD--------NIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCD 1258
            LEL+QNADD        +I  + +        T+  + S +  L  E  +S +  R + D
Sbjct: 43   LELIQNADDAKAHGVAFDITDDALVVWNSGAFTYCGEMSPVCRLMQEDNYSCDFHR-IAD 101

Query: 1259 VGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG---------- 1308
            VG+  K   S   IGR GIGF S ++VTD PEIHS+G  ++    E Q            
Sbjct: 102  VGSGGKLAKSEN-IGRFGIGFVSAYQVTDHPEIHSSGIKLQLIPEEAQYACSNIPLETGT 160

Query: 1309 -FVLPTLVPPFNIDMFCRLLSKDPVQ 1333
             FVLP  + P +I    R +   PV+
Sbjct: 161  KFVLPWAMNPHSIGR--RAIGATPVR 184


>gi|374996799|ref|YP_004972298.1| ATPase [Desulfosporosinus orientis DSM 765]
 gi|357215165|gb|AET69783.1| Putative ATPase (DUF699) [Desulfosporosinus orientis DSM 765]
          Length = 1586

 Score = 56.2 bits (134), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 138/339 (40%), Gaps = 63/339 (18%)

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV-VLNNEQGFSAE 1251
            R +     E YS  +HF+ EL+QNADD         ++ F L+ + ++ V N  + FS  
Sbjct: 29   RGVKTSVVEKYSDQAHFIYELLQNADDA-----GATSVRFDLRYNKLIFVHNGTRRFSIS 83

Query: 1252 N---------------IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
            N               I A+  + NS K  +S   IG+ G+GFK+VF+ T  P I+   F
Sbjct: 84   NPETEEADTNNGKLGDINAITSIANSNKTAAS---IGKFGVGFKAVFQYTQTPSIYDPEF 140

Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTK-FSEGI 1355
              + +       F++P  +     D F     + P +       T    PF  +  S   
Sbjct: 141  RFRIER------FIVPVKID----DDFS---GRRPEE-------TLFVFPFDHRDRSREE 180

Query: 1356 AMNNIVDMFSDL-HPSLLLFLHRL---QCIMFRNMLNDSLVVIRKKIVGDGI---IKVSC 1408
            A  +I D    L +P L L   R+   +C         S  V ++   G+ +   I ++ 
Sbjct: 181  AYEDISDKLRSLVYPVLFLTNLRMISVECAEKFGFYTKS--VDKEMAFGNTVARLITLTQ 238

Query: 1409 GEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-T 1467
              D     +   KL     R + +    ++ L+L  + EG +  L    P F F P +  
Sbjct: 239  NNDDDENELIKDKLWM-FTRANERGFTYSVCLSL--NTEGAF--LRSNFPAFCFFPTKEV 293

Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQ---WLLSEFPA 1503
             GL FIL   F+L  SRE +      N+    LL+E  A
Sbjct: 294  TGLNFILHAPFLLTDSREGIKAGEKHNKDMIKLLAELAA 332


>gi|429199883|ref|ZP_19191618.1| hypothetical protein STRIP9103_07746 [Streptomyces ipomoeae 91-03]
 gi|428664454|gb|EKX63742.1| hypothetical protein STRIP9103_07746 [Streptomyces ipomoeae 91-03]
          Length = 1527

 Score = 56.2 bits (134), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 92/385 (23%), Positives = 155/385 (40%), Gaps = 69/385 (17%)

Query: 1202 LYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
            L S     L E+VQNA+D +   +V      + +E  ++V +N       ++ A+  +  
Sbjct: 76   LSSDRLQGLSEIVQNAED-LGATDVR----ILTRERELLVAHNGSPVRLPDVLAMA-MPW 129

Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNID 1321
             T K   A   GR GIG  ++  ++   E+H    H +F   +  I        P    D
Sbjct: 130  LTSKAEQAESTGRFGIGLMTLQSLSPYLEVHCG--HYRFRVGDPDISVAEAFTAP----D 183

Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI 1381
             F               W T +R+P        +  + + +  ++   S LLFL R++ +
Sbjct: 184  WFA-----------GDTW-TVLRVPLELGV---LGADEVDEWLAEWGDSALLFLGRVRSV 228

Query: 1382 M---------FRNMLNDSLVVIRKKIVGDGIIKVSCGE----DKMTWFVASQKL--RAGV 1426
                       R  LN  +    + +VG   + V  GE    D   W V    +   +G+
Sbjct: 229  THLDEQGGVRHRLALNREVGEPFEAVVGGQSVDVETGEACAGDGSRWLVCRTTVPSPSGI 288

Query: 1427 IRPDVK---TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSS 1483
             R       TT IA+AL L   + G          ++A LP+    L   +   F   +S
Sbjct: 289  TRSHKAVEPTTAIAVALPLGPDDAGR---------IYAGLPVADTTLALCVSAQFDPIAS 339

Query: 1484 REEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPL---VGEVH 1540
            R+ +D N+PWN+ L      L+ +A      L  FR +PA+A       VPL   + +  
Sbjct: 340  RQALD-NTPWNRALNELVGDLWTAAV-----LWQFRRDPARA----WRAVPLPDDMRDAR 389

Query: 1541 GFFSGLPRMILSKLR--MSNCLILE 1563
               + L R++L + R  +S+ L+L+
Sbjct: 390  DPVAALERLLLDRARAGVSHGLVLD 414


>gi|47215794|emb|CAG02848.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 4658

 Score = 56.2 bits (134), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        IY     PT          +Q   + V NN Q F+
Sbjct: 2618 YPSEKEMLKELLQNADDAKATEIYFIYDPRTHPTDRVFDDKWIPMQGPALCVYNN-QPFT 2676

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             +++R + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN
Sbjct: 2677 EDDVRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 2720



 Score = 42.7 bits (99), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 77/189 (40%), Gaps = 45/189 (23%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-------NVEPTLTFILQESGIVVLN-------NEQ 1246
            E Y   +    EL+QNADD    E            +   L + G++  +       N  
Sbjct: 1547 EEYPSVADIFKELLQNADDASATECNFLIDMRKNNDIRENLLDPGMIACHGPSLWSFNNA 1606

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
             FS  +   +  +G S K+ S A  +G+ G+GF SV+ +TD P I S  F + FD +   
Sbjct: 1607 VFSDTDFLNITRLGGSVKR-SEADKVGKFGLGFNSVYHITDIPIIMSREFMIIFDPNINH 1665

Query: 1307 IG----------------------FVLPTLVPPFNIDMF-CRL-LSKD-PVQLESKCWNT 1341
            I                          P    PF I++F C+L L++D P +       T
Sbjct: 1666 ISKHIRDRSNPGIKINWSKQQKRLRKFPNQFKPF-INVFGCQLPLAQDSPYKFN----GT 1720

Query: 1342 CIRLPFRTK 1350
              RLPFRT+
Sbjct: 1721 LFRLPFRTE 1729


>gi|218670910|ref|ZP_03520581.1| hypothetical protein RetlG_04268 [Rhizobium etli GR56]
          Length = 260

 Score = 56.2 bits (134), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 73/277 (26%), Positives = 116/277 (41%), Gaps = 42/277 (15%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            +TF L    + V +    FSA ++R +C +  STK  ++   IGR GIGFKSV+  T+ P
Sbjct: 2    VTFHLSRQSLRVSHFGAPFSAGDVRGVCGINASTKDYTA---IGRFGIGFKSVYAFTEKP 58

Query: 1290 EIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
            EIHS         S     +V P   PP          +++P +       T + LP   
Sbjct: 59   EIHSG------SESFSISSYVFPEEAPP---------CAREPEE-------TVVLLPLAA 96

Query: 1350 KFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV--- 1406
               +  A + I +  +DL    LLFL  ++ I +  +        R+  V +G ++    
Sbjct: 97   --DDENAFDEIAEGLADLGAEALLFLKHVEEISW-TIEGRGSGRYRRDAVAEGFMQAVTL 153

Query: 1407 ----SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAF 1462
                  G  + T+ + S+ +        V   EIA  L   + N     P+   + V  F
Sbjct: 154  LSSGDAGGQEETYLLFSRAVEND--GKYVGLAEIAF-LVADQDNRWVVQPIDDARLV-VF 209

Query: 1463 LP--LRTYGLKFILQGDFVLPSSREEVDGNSPWNQWL 1497
             P  L TY   F++QG +    SR+ +  +  WN  L
Sbjct: 210  FPTVLPTY-TGFLIQGPYQTTPSRDNIPTDRAWNAKL 245


>gi|449678125|ref|XP_002155302.2| PREDICTED: uncharacterized protein LOC100207584 [Hydra
           magnipapillata]
          Length = 238

 Score = 56.2 bits (134), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 53/96 (55%), Gaps = 2/96 (2%)

Query: 200 INAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDVDVV 259
           INAFI  F+  R I +L++L + ICK E    F +L+LGP  +Q LI   F    + + +
Sbjct: 144 INAFIQSFMTFRNICTLFELNQEICKMESKSSFLDLKLGPLQKQLLIYDNFKFPPNQNDI 203

Query: 260 FKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKK 295
             I T+DV   +  +   +K  +  + EFL+F+ KK
Sbjct: 204 PPITTSDVFRLI--WKMLNKKHNFLMIEFLEFLIKK 237


>gi|340383697|ref|XP_003390353.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
          Length = 1275

 Score = 56.2 bits (134), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 88/376 (23%), Positives = 148/376 (39%), Gaps = 79/376 (21%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y      + E++QNADD        ++   +  +   +L+    
Sbjct: 909  QHEPLTTRLSNILRE-YKDGLTIIKEMIQNADDAGATEVNILHDTRMHSSHNLVLEGMAE 967

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 968  SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 1026

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
             +   FD +   + G V     P   I    + L+K     P Q       S  +N T  
Sbjct: 1027 EWLYIFDPTLKHLKGVVQNESRPGKKIKFRSKFLAKSQQLAPYQDLFGFDSSSSYNGTIF 1086

Query: 1344 RLPFRT---KFSEGIAMNNIVDMF-SDLHPS---LLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            RLPFRT   + S  I  N+++ M   DL  +   LLLFL  ++ I F +      ++   
Sbjct: 1087 RLPFRTHPSQISSTIYNNHLIQMIKKDLESNGSKLLLFLQNVKRITFSSQQEGRPIIEH- 1145

Query: 1397 KIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQE---SNEGNYG-- 1451
                   + + C  D       ++ ++  V +   K+TE  L    ++   S +G Y   
Sbjct: 1146 -------LSIECSNDD------TENVKVCVTQSSTKSTEYWLLSNKEQRLKSQDGAYKQA 1192

Query: 1452 ------PLLYQQP----------VFAFLPLR--TYGLKFILQGDFVLPSSREEV------ 1487
                   L+ +Q            F FLPL     GL   +  +F + S+R  +      
Sbjct: 1193 VASVACKLVKKQSSFLCEAIEGNTFCFLPLSVPATGLPVHVTANFAVMSNRSGIWTGASS 1252

Query: 1488 ----DGNSPWNQWLLS 1499
                D    WNQ L++
Sbjct: 1253 AMASDEREYWNQQLMT 1268


>gi|340383029|ref|XP_003390020.1| PREDICTED: sacsin-like, partial [Amphimedon queenslandica]
          Length = 5665

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 63/371 (16%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ-----ESGIVVLNNEQGFSA 1250
            Y      L E++QNADD        +Y      T   I +         ++++N   F+ 
Sbjct: 1579 YKDGLSILKEMIQNADDAGATEVNILYDNRKHSTKNLIFKGMADSHGPALIVHNNSTFAK 1638

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
            E+   +  +  +TK       IG+ GIGF SV+ VTD P   S  +   FD +   +  +
Sbjct: 1639 EDFENITKLAGTTKANQPLK-IGQFGIGFCSVYHVTDVPSFVSAEWLYIFDPTLNYLKDI 1697

Query: 1311 LPTLVPP-----------FNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT---KFSEGIA 1356
            +     P            N D           +  S    T  RLPFRT   + S  I 
Sbjct: 1698 VQNQSRPGKKIKYQSKILANTDHLVPYDGLFGFKSSSNYNGTLFRLPFRTSASQISSTIY 1757

Query: 1357 MNNIV-DMFSDL---HPSLLLFLHRLQCIMF------RNMLNDSLVVIRKKIVGDGIIKV 1406
             +++V DM  DL      LLLFL  ++ I F      + ML  S+   R   +   ++  
Sbjct: 1758 NDHLVEDMKRDLVNWGSKLLLFLQHVKRITFSDFKGTKKMLKVSIDRSRIADIRHFVVTF 1817

Query: 1407 SCGEDKMT--WFVA-----SQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPV 1459
            S  +  +T  W +A      Q   +G+      +   ++A  L + N  +Y     +   
Sbjct: 1818 SPQQKCITEHWLIADAEEQQQSQNSGI------SAVASVACQLDKVNLSSYKCKKIKGSA 1871

Query: 1460 FAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNSPWNQWLL-SEFPALFV 1506
            F FLPL   + GL   +  +F + S+R  +          D    WN+ L+ +  PA + 
Sbjct: 1872 FCFLPLALPSTGLPVHISANFAIMSNRTGIWTKPSLSTVSDFREQWNKKLMETTIPAAYC 1931

Query: 1507 SAERSFCDLPC 1517
               +   ++ C
Sbjct: 1932 KLLQKLQEMHC 1942


>gi|379005862|ref|YP_005255313.1| superfamily I DNA/RNA helicase [Sulfobacillus acidophilus DSM 10332]
 gi|361052124|gb|AEW03641.1| superfamily I DNA/RNA helicase [Sulfobacillus acidophilus DSM 10332]
          Length = 1580

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 63/137 (45%), Gaps = 27/137 (19%)

Query: 1181 SNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIV 1240
            +NML+K   R    +     E YS  +HF+ EL+QNADD         +  FIL +  ++
Sbjct: 20   ANMLEKPSMR---GVRTSVVEKYSDQAHFIYELLQNADDA-----GATSARFILHDQELI 71

Query: 1241 VLNN----------------EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFR 1284
              +N                 +  +  +I A+  + NS K  +S   IG+ G+GFK+VF+
Sbjct: 72   FAHNGIRRFSISDPDTEEEDSRNGTLGDINAITSIANSNKTTAS---IGKFGVGFKAVFQ 128

Query: 1285 VTDAPEIHSNGFHVKFD 1301
             T  P I+   F  K +
Sbjct: 129  YTQTPRIYDPNFRFKIE 145


>gi|288926515|ref|ZP_06420434.1| hypothetical protein HMPREF0649_01945 [Prevotella buccae D17]
 gi|288336727|gb|EFC75094.1| hypothetical protein HMPREF0649_01945 [Prevotella buccae D17]
          Length = 2291

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 66/128 (51%), Gaps = 13/128 (10%)

Query: 1190 RLGRALHCLSQEL-YSQDSHFLLELVQNADDNIYPENVEPT---------LTFILQESGI 1239
            ++ R+L  L ++L  S    F+ EL+QNA+D  YP   +           + F + ++ +
Sbjct: 213  KIARSLDTLKKQLTQSGKEVFIYELLQNAND--YPRRAKINGEYQALPVDVEFHITKNYL 270

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVK 1299
               +  + F+ +NI A+CD+ N  +K  +   IG KGIGFK+VF   D   +++  +  +
Sbjct: 271  TFQHTGEYFNPKNIAAICDI-NDGEKADNTKAIGYKGIGFKTVFLDNDYVLLNTGNYTFR 329

Query: 1300 FDTSEGQI 1307
            FD S   +
Sbjct: 330  FDKSATDV 337


>gi|284032160|ref|YP_003382091.1| hypothetical protein Kfla_4256 [Kribbella flavida DSM 17836]
 gi|283811453|gb|ADB33292.1| hypothetical protein Kfla_4256 [Kribbella flavida DSM 17836]
          Length = 1234

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 46/93 (49%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
              ++EL QNA D      V   + F L++  +VV N     SA  + +L  +  S K+  
Sbjct: 43   RVVVELAQNAADAAVRAGVPGRVRFSLRDGTLVVANTGTPLSAAGVESLATLRASAKRDD 102

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            +   +GR G+GF +V  VTD P I S    V+F
Sbjct: 103  AESAVGRFGVGFAAVLAVTDEPVILSRTGGVRF 135


>gi|392843735|gb|AFM84471.1| sacsin, partial [Lepisosteus oculatus]
          Length = 736

 Score = 55.8 bits (133), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     PT          +Q   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEIYFVFDPRSHPTDRIFDDKWVPMQGPALCVYNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             ++IR + ++G  TK+G+  G  G+ GIGF SV+ +TD P   SN
Sbjct: 476  EDDIRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISN 519


>gi|410909740|ref|XP_003968348.1| PREDICTED: sacsin-like [Takifugu rubripes]
          Length = 4585

 Score = 55.8 bits (133), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 53/106 (50%), Gaps = 15/106 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        IY     PT          +Q   + V NN Q F+
Sbjct: 2547 YPSEKEMLKELLQNADDAKATEIFFIYDPRTHPTDRVFDDKWIPMQGPALCVYNN-QPFT 2605

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
             +++R + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN 
Sbjct: 2606 EDDVRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYYITDCPSFISNS 2650



 Score = 48.5 bits (114), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 95/236 (40%), Gaps = 56/236 (23%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-------NVEPTLTFILQESGIVVLN-------NEQ 1246
            E Y   +    EL+QNADD    E            +   L + G++V +       N  
Sbjct: 1476 EEYPSVADIFKELLQNADDASATECNFLIDMRKNNEIRENLLDPGMIVCHGPSLWSFNNA 1535

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
             FS  +   +  +G S K+ S A  +G+ G+GF SV+ +TD P I S  F + FD +   
Sbjct: 1536 VFSDTDFLNITRLGGSVKR-SEADKVGKFGLGFNSVYHITDIPIIMSREFMIIFDPNINH 1594

Query: 1307 I----------GFVL------------PTLVPPFNIDMF-CRLLSKDPVQLES--KCWNT 1341
            I          G  +            P    PF I++F C L    P+  +S  K   T
Sbjct: 1595 IRKHIRDRSNPGIKINWSKQQKRLRKFPNQFKPF-INVFGCEL----PLAQDSPYKYNGT 1649

Query: 1342 CIRLPFRTK-----------FSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
              RLPFRT+           +     + ++VD FS      +LF   + C++ + +
Sbjct: 1650 LFRLPFRTEQEASQSEISSLYYNTTDIYSLVDEFSICGHRFILFTQHVGCMLLKYL 1705


>gi|301617805|ref|XP_002938327.1| PREDICTED: sacsin-like [Xenopus (Silurana) tropicalis]
          Length = 3379

 Score = 55.5 bits (132), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 101/437 (23%), Positives = 169/437 (38%), Gaps = 115/437 (26%)

Query: 1203 YSQDSHFLLELVQNADDN---------IYPENVEPTLTFILQE----SGIVVLN-NEQGF 1248
            YS+      EL+QNADD             +N E   + I  +     G  + + N++ F
Sbjct: 707  YSEQVELFKELIQNADDAEATICQFLVDMRQNSESRQSLIDPDMANCHGAALWSYNDRKF 766

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD------- 1301
            +  + R +  +G +TK+ +    IG+ G+GF +V+ +TD P I S    + FD       
Sbjct: 767  TDNDFRNITRIGAATKE-TEVKKIGKFGLGFNTVYHLTDVPSIMSGSNVLIFDPNINHIK 825

Query: 1302 ---TSEGQIGFVLPTLVPPFNIDMF------------CRLLSKDPVQLESKCWNTCIRLP 1346
               ++ G  G  L   + P  +++F            C L    P   +     T IRLP
Sbjct: 826  KHISNSGNPGIKLNLQMKPEILNVFLDQFQPFSGVFGCEL--NQPFYYDG----TLIRLP 879

Query: 1347 FRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHR-----LQCI--------- 1381
            FRT+     +           +++ +  F +   +LL+FL       L C+         
Sbjct: 880  FRTEKEAKDSQICQQAFNKEQIHSFIHNFEEAASTLLIFLKNIEKVTLNCLEKDLCPQAD 939

Query: 1382 ---------------MFRNML-----NDSLVVIRKK-----IVGDGIIKVSCGEDKMT-- 1414
                           M RNM+      +SL V+R       + G  IIK++  + + T  
Sbjct: 940  ESRVMVQRDKVQTLDMPRNMILQQEQMNSLNVLRMNVNNLDVTGSHIIKITVRQTETTEE 999

Query: 1415 -WFVASQKL----------RAGVIRPDVKTTEIALALTLQESNEGNYGPLL--YQQPVFA 1461
             +++A   L          +   +   +    +AL L  +  + G + P L  +   VF 
Sbjct: 1000 MYYLAQSSLGIKESIQMFSQNKKLNTYLPVAGVALPLK-KHCDTGKWAPNLSSFSGMVFC 1058

Query: 1462 FLPLR-TYGLKFILQGDFVLPSSRE---EVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
            FLPL  + GL F L G F + S+R+   +      WN+ LL +  A  V+   +   L  
Sbjct: 1059 FLPLPVSSGLPFHLNGTFAVMSNRKSLWDTTEKGEWNRTLLGD--AAVVALITALAQLQI 1116

Query: 1518 FRENPAKAASVYMSFVP 1534
               N       Y +F P
Sbjct: 1117 LSNNGDIQDYSYHTFWP 1133


>gi|300726908|ref|ZP_07060335.1| hypothetical protein PBR_1023 [Prevotella bryantii B14]
 gi|299775811|gb|EFI72394.1| hypothetical protein PBR_1023 [Prevotella bryantii B14]
          Length = 2150

 Score = 55.5 bits (132), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 76/317 (23%), Positives = 126/317 (39%), Gaps = 65/317 (20%)

Query: 1209 FLLELVQNADDNIYPEN--VEPT-LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
            F+ EL+QNA+D  Y  N  +E   + F +    +V +++   F+  NI A+C + N  +K
Sbjct: 17   FIYELLQNANDYPYKANNTIEKVNVEFHITRDSLVFMHSGAVFNERNIAAICSI-NDKEK 75

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
              +   IG KGIGFK+VF   +   + +  F  +FD  E +     P  + P        
Sbjct: 76   TDNKETIGYKGIGFKTVFLDNNYVYLQTGDFSFRFDREESRDIVDTPWQILPI------- 128

Query: 1326 LLSKDPVQLESKCWNTCIRLPFRTKFS------EGI--AMNNIVDMFSDL--HPSLLLFL 1375
              + + +    K   T     FR KFS      E +  +  N V MF D+  +  ++LF+
Sbjct: 129  WTNYNELTPSEKYVFTNADKQFRVKFSLRPTNIETLRGSGQNYVQMFKDVFKNERVILFI 188

Query: 1376 HRLQCI-MFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTT 1434
              L  + +F N  ++ LV               C  D   W V      +  + P++ T 
Sbjct: 189  PNLSSVKVFFNESDEPLV---------------CNCDNEHWQVNH---FSDDVSPEI-TK 229

Query: 1435 EIALALTLQESNEGNYGPLLY-----------------------QQPVFAFLPLRT-YGL 1470
             I L +  QE       P  Y                          V+ +LP +  +G 
Sbjct: 230  SINLDIDKQEDVGALKIPTKYYDFTKTKVSFACEIDGVQLKEVEDTLVYCYLPTKAKWGF 289

Query: 1471 KFILQGDFVLPSSREEV 1487
            KF++  D +    R++V
Sbjct: 290  KFLMNTDMIPTGPRDDV 306


>gi|302813581|ref|XP_002988476.1| hypothetical protein SELMODRAFT_159510 [Selaginella moellendorffii]
 gi|300143878|gb|EFJ10566.1| hypothetical protein SELMODRAFT_159510 [Selaginella moellendorffii]
          Length = 4500

 Score = 55.5 bits (132), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 89/220 (40%), Gaps = 46/220 (20%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI-----------------VVLN 1243
            E+Y+     L EL+QNADD    E     ++F+L +S                   +   
Sbjct: 1363 EMYADGPGTLFELLQNADDAGATE-----VSFLLDKSSFGTNSVLSRQMADWQGPALYCY 1417

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            N+  F  E+  A+  +G ++K    +  IGR G+GF SV+  TD P   S    V  D  
Sbjct: 1418 NDSVFKPEDFYAISRIGQNSKLDKPSA-IGRFGLGFNSVYHFTDVPSFVSGSTIVLLDPH 1476

Query: 1304 EGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQLES----KC------WNTCIRLPFRTKFS 1352
             G +  + P   P   I+   + LL + P Q ES     C        T  R P RT  +
Sbjct: 1477 AGNLPGISPA-QPGLKINFVGKELLKQFPDQFESFLLFGCDLQKPFLGTLFRFPLRTTVT 1535

Query: 1353 EGIAM--------NNIVDMFSDLHPS---LLLFLHRLQCI 1381
               +          ++V +FS L  S   +LLFL  +  I
Sbjct: 1536 AARSEIKQDPYQPEDVVTLFSALKKSAGEVLLFLRHVTSI 1575



 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 92/401 (22%), Positives = 148/401 (36%), Gaps = 104/401 (25%)

Query: 1203 YSQDSHFLLELVQNADD--------------NIYPENVEPTLTFILQESGIVVLN-NEQG 1247
            Y + +  L E++QNADD              +     V P L    Q  G  +L  N   
Sbjct: 21   YPEGTTILKEMIQNADDAGASQVCFCVDRRRHAVDSLVSPPLA---QWQGPALLAYNNAA 77

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            F+ E+ +++  +G+S K+G  A   GR GIGF SV+ ++D P   S G HV F   + Q 
Sbjct: 78   FTEEDFQSISRIGDSKKRGQ-AWKTGRFGIGFNSVYHLSDLPSFVS-GSHVVFFDPQCQF 135

Query: 1308 GFVLPTLVPPFNIDMFCRL-LSKDPVQLESKCWNTC----------IRLPFR-------T 1349
               + +L P   ID      L   P Q    C   C           R P R       +
Sbjct: 136  LPSVSSLNPGKRIDFVASTALENFPDQFSPYCVFGCDMRRPFGGTLFRFPLRSAEQAATS 195

Query: 1350 KFS-EGIAMNNIVDMFSDLHPSL---LLFLHRLQCIMF---------------------- 1383
            K S +     +++++  DL+  +   +LFL  ++ + F                      
Sbjct: 196  KLSKQSYTSTDMLELLRDLYHEIVQVMLFLKNVERVEFYEWSAGSSSPTLLYSCAVRLPT 255

Query: 1384 ------RNML------------NDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
                  R +              ++  V R ++V +       G++ ++ F+ SQ + + 
Sbjct: 256  SEIRSHRQLFLRLSKGKDAAASEETKDVFRLELVTNAYTGADSGKESLSSFLISQAMGSS 315

Query: 1426 VIRPDVKTT----EIALALTLQESNEGNYGP-------LLYQQPVFAFLPLRT-YGLKFI 1473
              R     T       L L    S   +  P       LL +   F FLPL    G+ F 
Sbjct: 316  ASRIGALATNAFKSYGLRLLPWASVAASILPGKVSIYCLLVEGRAFCFLPLPIKTGMPFH 375

Query: 1474 LQGDFVLPSSREEV----------DGNSPWNQWLLSEFPAL 1504
            + G F L ++R ++             S WNQ +L +  AL
Sbjct: 376  VNGYFELSANRRDIWHGSDMDRMGKLRSDWNQCVLEDVAAL 416


>gi|291238887|ref|XP_002739357.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 4368

 Score = 55.5 bits (132), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 57/115 (49%), Gaps = 15/115 (13%)

Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQ 1246
            E+Y +      EL+QNADD        +  +     +   L + G+   +       N+ 
Sbjct: 1403 EVYKEGVDIFKELIQNADDAGATDIRFLIDQRQNTDVQLKLLDPGMKHCHGPALWSYNDA 1462

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
             FS+E+IR +CD+  +TKK +    IGR G+GF SV+ VTD P + S    + FD
Sbjct: 1463 MFSSEDIRNICDIAAATKK-NKLDKIGRFGLGFTSVYHVTDVPSVVSGSHVMIFD 1516



 Score = 49.3 bits (116), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 16/97 (16%)

Query: 1212 ELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCD 1258
            EL+QNADD        IY     PT   +      +QES  + + N++ FSA +I  +  
Sbjct: 2445 ELLQNADDAMATELHIIYDPRSHPTGKIVSDLWKPIQESPAICVFNDRVFSAGDIEGIQQ 2504

Query: 1259 VGNSTKKGSSAGYI-GRKGIGFKSVFRVTDAPEIHSN 1294
            VG   K  S+ GY  G+ GIGF SV+ +TD P   S+
Sbjct: 2505 VGIGGK--SADGYTTGQYGIGFNSVYHLTDCPSFLSD 2539


>gi|323448598|gb|EGB04495.1| hypothetical protein AURANDRAFT_67157 [Aureococcus anophagefferens]
          Length = 4616

 Score = 55.1 bits (131), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 67/280 (23%), Positives = 112/280 (40%), Gaps = 61/280 (21%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------------- 1224
            N+E+    ++ +   R  H L  +LY + S  L ELVQNADD    E             
Sbjct: 1449 NVETEAFGQRESLTRRLAHIL--QLYPEGSGILNELVQNADDAGASEVRVLLDARSAATR 1506

Query: 1225 NVEPTLTFILQ--ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSV 1282
              +  L+  L   +   + ++N+ GF+  + R L  VG  +K   +    GR G+GF SV
Sbjct: 1507 GAQSLLSAGLAAWQGPALCVSNDAGFTESDWRNLVRVGQGSKL-ENFKTTGRFGLGFNSV 1565

Query: 1283 FRVTDAPEIHSNGFHVKF--DTSEGQI----------------GFVLPTLVPPFNIDMFC 1324
            + +TD P++ S G HV F    ++G +                G  LP   P    D F 
Sbjct: 1566 YHLTDVPQVAS-GDHVLFLDPHADGLVPGATPQRPGLRVKFTGGADLPAAFP----DQFA 1620

Query: 1325 RLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLL 1373
                     LE     T  RLP R++ +   +              +++ F     S+LL
Sbjct: 1621 SWADCG-CDLEKPLEGTLFRLPLRSRAAAATSEISKQHYGVEEAEALLEQFKAQATSVLL 1679

Query: 1374 FLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKM 1413
            FL  ++ +        ++ V+R     + + + +  ED +
Sbjct: 1680 FLRHVRAV--------TVTVLRDGGESETVFRAAAREDPL 1711


>gi|170757814|ref|YP_001782495.1| transcriptional regulator [Clostridium botulinum B1 str. Okra]
 gi|429246935|ref|ZP_19210220.1| transcriptional regulator [Clostridium botulinum CFSAN001628]
 gi|169123026|gb|ACA46862.1| putative transcriptional regulator [Clostridium botulinum B1 str.
            Okra]
 gi|428756042|gb|EKX78628.1| transcriptional regulator [Clostridium botulinum CFSAN001628]
          Length = 125

 Score = 55.1 bits (131), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 33/95 (34%), Positives = 54/95 (56%), Gaps = 4/95 (4%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNE 1245
            Q      +L+ LS +LY+    F+ E++QNADD+   +N E    +I +    +VV ++ 
Sbjct: 22   QAVNQASSLNALSGDLYTDSKRFIYEILQNADDS--SQNSEAVKVWIKILGDNLVVAHSG 79

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFK 1280
            + F+  +++ +C+V N TKK S     G KGIGFK
Sbjct: 80   KPFTTRDLQGICNVNNGTKK-SDLTKTGYKGIGFK 113


>gi|302866235|ref|YP_003834872.1| helicase domain-containing protein [Micromonospora aurantiaca ATCC
            27029]
 gi|302569094|gb|ADL45296.1| helicase domain protein [Micromonospora aurantiaca ATCC 27029]
          Length = 1568

 Score = 55.1 bits (131), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 64/252 (25%), Positives = 101/252 (40%), Gaps = 37/252 (14%)

Query: 1210 LLELVQNADD----NIYPENVEPTLTFILQESG--IVVLNNEQGFSAENIRALCDVGNST 1263
            LLELVQNA D    N   +     +  +L   G  +   N  + FS   + A+     S 
Sbjct: 54   LLELVQNAADAMAGNTDSDENTGRVEIVLDSVGGALYCANGGRPFSKRGLTAITHAHLSG 113

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV--------LPTLV 1315
            K+G     IGR G+GFKSV  V++ P++ S     +F++ E +             P L 
Sbjct: 114  KRGDE---IGRFGLGFKSVLAVSNTPQVFSRSVSFEFNSPEARSAIAEVGSGAKRYPALR 170

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
                ID        DPV      W  T ++LP  +          I++  S      LLF
Sbjct: 171  TATTIDAMA-AFGHDPVLRGLAEWAATIVKLPNVSSIPR--LRKEIIEFSS----HFLLF 223

Query: 1375 LHRLQCIMFRNMLNDSLVVIR-KKIVGDGIIKVS-CGEDKMTWFV----------ASQKL 1422
            +  ++ +  R +  D        + +GDG+ K+   G +   W+V          A +++
Sbjct: 224  VGAVREVRLRILGADHFEATHTSRNLGDGLFKIERSGGEYEEWYVEDRMHAPSAEARRQV 283

Query: 1423 RAGVIRPDVKTT 1434
               V R +VK T
Sbjct: 284  GEAVSRSEVKVT 295


>gi|296271243|ref|YP_003653875.1| hypothetical protein Tbis_3292 [Thermobispora bispora DSM 43833]
 gi|296094030|gb|ADG89982.1| hypothetical protein Tbis_3292 [Thermobispora bispora DSM 43833]
          Length = 985

 Score = 54.7 bits (130), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 79/180 (43%), Gaps = 10/180 (5%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
              ++EL QNA D      V   L   L++  +   N      A  + +LC +  S KK  
Sbjct: 40   RLVVELAQNAADAALRAGVPGRLRLTLRDGVLYAANVGAPLDAAGVESLCTLRASAKK-D 98

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGF-HVKFDTSE-GQIGFVLPTLVPPF-NIDMFC 1324
              G  GR G+GF +V  VTD P I S     V++  +E  ++    P L       D   
Sbjct: 99   DIGAAGRFGVGFAAVVSVTDTPAIMSRATGGVRWSRAETAELVAARPELAEELARRDGHV 158

Query: 1325 RLLSKDPVQLE----SKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
             LL + P  +E     + ++T +RLP R + +E  A+  ++D  S   P  L  L R++ 
Sbjct: 159  PLL-RLPFPVEDLEVPEGFDTVVRLPLRDEAAEQ-AVRRMLDEASPALPIALPALERIEI 216


>gi|392843741|gb|AFM84474.1| sacsin, partial [Acipenser sinensis]
          Length = 386

 Score = 54.7 bits (130), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 54/105 (51%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADD----NIY----PENVEPTLTF-----ILQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD     IY    P   +    F      LQ   + V NN Q F+
Sbjct: 67   YPSEKEMLKELLQNADDAKATEIYFVFDPREHQADRIFDDKWIPLQGPALCVYNN-QPFT 125

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             +++R + ++G  TK+G+  G  G+ GIGF SV+ +TD P   SN
Sbjct: 126  EDDVRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISN 169


>gi|373500743|ref|ZP_09591116.1| hypothetical protein HMPREF9140_01234 [Prevotella micans F0438]
 gi|371951701|gb|EHO69544.1| hypothetical protein HMPREF9140_01234 [Prevotella micans F0438]
          Length = 1860

 Score = 54.7 bits (130), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 11/125 (8%)

Query: 1186 KQHAR------LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESG 1238
            KQH        +   L  + Q LY+       EL+QNADD   PEN V+  L   L ++ 
Sbjct: 464  KQHTSPNTNTSVANLLEEVGQNLYTSKKRMFFELLQNADDAA-PENGVKVKLQ--LSDNY 520

Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
             V+ ++   F+  +  ++     STK  +     G KGIGFKSVF  +++  I S G++ 
Sbjct: 521  FVLTHDGFAFNKHDFESITSAAKSTKSANKKK-TGYKGIGFKSVFTNSESVLIKSAGYNF 579

Query: 1299 KFDTS 1303
             FD S
Sbjct: 580  SFDKS 584



 Score = 43.1 bits (100), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 1424 AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQ-PVFAFLPLRTYGLKF--ILQGDFVL 1480
            AG+       TE +++  ++   EG+  P+   +  ++A+LP+  +  KF   L  DF+ 
Sbjct: 781  AGIPDRIASATETSISFAIRLDEEGHVMPIGKDELSLYAYLPMNEHRFKFPFYLNADFIP 840

Query: 1481 PSSREEVDGNSPWNQWLL 1498
             S RE +  ++PWN +L 
Sbjct: 841  KSDREGIQSDNPWNYFLF 858


>gi|348508402|ref|XP_003441743.1| PREDICTED: sacsin-like [Oreochromis niloticus]
          Length = 4656

 Score = 54.7 bits (130), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 15/106 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     PT          +Q   + V NN Q F+
Sbjct: 2618 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPTDRIFDDKWIPMQGPALCVYNN-QPFT 2676

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
             ++IR + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN 
Sbjct: 2677 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISNN 2721



 Score = 45.4 bits (106), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 89/231 (38%), Gaps = 46/231 (19%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-------NVEPTLTFILQESGIVVLN-------NEQ 1246
            E Y   +    EL+QNADD    E            +   L + G++V +       N  
Sbjct: 1547 EEYPSVADIFKELLQNADDASATECSFLIDMRKNTDIRENLLDPGMIVCHGPALWSFNNA 1606

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
             FS  +   +  +G S K+   A  +G+ G+GF SV+ VTD P I S  F + FD +   
Sbjct: 1607 VFSDTDFLNITRLGGSVKR-CEADKVGKFGLGFNSVYHVTDIPIIMSREFMIMFDPNINH 1665

Query: 1307 IGFVLPTLVPP---FNIDMFCRLLSKDPVQLES-----------------KCWNTCIRLP 1346
            I   +     P    N +   R L K P Q +                  K   T  RLP
Sbjct: 1666 ISKHIRDRSNPGIKINWNKQQRRLRKFPNQFKPFINVFNCQLPLAQDSPYKYNGTLFRLP 1725

Query: 1347 FRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
            FRT+    ++           + ++VD FS      +LF   +  ++ + +
Sbjct: 1726 FRTEQEASVSEISSLYYNTTDIYSLVDEFSICGHRFILFTQHVGSMVLKYL 1776


>gi|168698281|ref|ZP_02730558.1| hypothetical protein GobsU_02097 [Gemmata obscuriglobus UQM 2246]
          Length = 1863

 Score = 54.7 bits (130), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 59/107 (55%), Gaps = 6/107 (5%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPT-LTFILQESGIVVLNNEQ 1246
            + ++  A   +S E + +   FL+EL+QNA+D    + ++ T +T    E  + + N+ Q
Sbjct: 42   YVQIANAAEHISAEYHGR---FLIELLQNANDQAVRQGLKDTAVTIHWTERLLAIGNSGQ 98

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             F    + A+  +  S K+      +G KGIGFK+VF+V D+ EI+S
Sbjct: 99   PFDEAKVDAITSIFKSDKQADKC--LGNKGIGFKAVFQVADSAEIYS 143


>gi|432894177|ref|XP_004075943.1| PREDICTED: sacsin-like [Oryzias latipes]
          Length = 4579

 Score = 54.7 bits (130), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     PT          +Q   + V NN Q F+
Sbjct: 2541 YPSEKEMLKELLQNADDAKATEIFFVFDSRTHPTDRIFDDKWVPMQGPALCVYNN-QPFT 2599

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             +++R + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN
Sbjct: 2600 EDDVRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFVSN 2643



 Score = 48.1 bits (113), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 97/235 (41%), Gaps = 54/235 (22%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPE-----NVEPTLTFI--LQESGIVVLNNEQGFSAEN- 1252
            E Y   +    EL+QNADD    E     ++   L     L + G++V +    +S  N 
Sbjct: 1470 EEYPSVADIFKELLQNADDASATECSFLIDMRKNLDIRENLLDPGMIVCHGPSLWSFNNA 1529

Query: 1253 IRALCDVGNSTKKGSS-----AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            +    D  N T+ G S     A  +G+ G+GF SV+ +TD P I S  F + FD +   I
Sbjct: 1530 VFTDTDFLNITRLGGSIKRCEADKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNINHI 1589

Query: 1308 G----------------------FVLPTLVPPFNIDMF-CRL-LSKD-PVQLESKCWNTC 1342
                                      P    PF I++F C+L L++D P +       T 
Sbjct: 1590 SKHIRDRSNPGIKINWSKQQKRLRKFPNQFKPF-INVFNCQLPLAQDSPYKFN----GTL 1644

Query: 1343 IRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
             RLPFRT+    ++           + ++VD FS      +LF   + C++ + +
Sbjct: 1645 FRLPFRTEQEAAVSEISSLYYNTTDIYSLVDEFSICGHRFILFTQHVGCMILKYL 1699



 Score = 42.7 bits (99), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 15/119 (12%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPEN---VEPTLTFIL---QESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    VE   +  +   Q + + V NN   F+
Sbjct: 109  YPEGGQILKELIQNAEDAGATEVRFMYDETEYGVESLWSPDMAQHQGAAMYVYNNAV-FT 167

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
             E+   + ++  S K+      +GR GIGF SV+ +TD P I S       D  +   G
Sbjct: 168  GEDWNGIQEIARSRKRDDPLK-VGRFGIGFNSVYHITDVPSIFSGNQIAMLDPHQTLFG 225


>gi|345885641|ref|ZP_08837002.1| hypothetical protein HMPREF0666_03178 [Prevotella sp. C561]
 gi|345045034|gb|EGW48983.1| hypothetical protein HMPREF0666_03178 [Prevotella sp. C561]
          Length = 352

 Score = 54.7 bits (130), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 11/125 (8%)

Query: 1186 KQHAR------LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESG 1238
            KQH        +   L  + Q LY+       EL+QNADD   PEN V+  L   L ++ 
Sbjct: 110  KQHTSPNTNTSVANLLEEVGQNLYTSKKRMFFELLQNADDAA-PENGVKVKLQ--LSDNY 166

Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
             V+ ++   F+  +  ++     STK  +     G KGIGFKSVF  +++  I S G++ 
Sbjct: 167  FVLTHDGFAFNKHDFESITSAAKSTKSANKKK-TGYKGIGFKSVFTNSESVLIKSAGYNF 225

Query: 1299 KFDTS 1303
             FD S
Sbjct: 226  SFDKS 230


>gi|390340927|ref|XP_798327.3| PREDICTED: sacsin-like, partial [Strongylocentrotus purpuratus]
          Length = 4131

 Score = 54.3 bits (129), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 69/257 (26%), Positives = 112/257 (43%), Gaps = 44/257 (17%)

Query: 1161 IVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN 1220
            I+ES      GLGP+    ++ +   Q   L   +  + +  Y  D   L ELVQNADD+
Sbjct: 2640 IIESYSVPVPGLGPHTLEAQNLLQFGQREPLTERIKNILKG-YPCDDTILKELVQNADDS 2698

Query: 1221 IYPE-------NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
               E          P           LQ   + V NN + F+ ++I  +  +G  +K+  
Sbjct: 2699 CATEVHLVLDVRTHPKDRLFRDEMSHLQGPALCVYNN-KSFTDDDIEGIQRLGVGSKRNL 2757

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGF-------HVKF--DTSEGQIGFVL-PT--LV 1315
            S+  +GR G+GF +V+ +TD P  ++NG        +VK+    ++   GF+  PT  L 
Sbjct: 2758 SSK-VGRFGVGFNAVYHLTDCPSFYTNGDTLGMFDPNVKYVPKATKAYPGFMAKPTGKLK 2816

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI-----------AMNNIVDMF 1364
              F     C L    P   E +  +T  RLP RTK    +           ++  ++  F
Sbjct: 2817 DDFPNVFNCYL----PEFFEGQ-GSTMFRLPLRTKSMAKVSHISSEAHDVKSVKGLLRKF 2871

Query: 1365 SDLHPSLLLFLHRLQCI 1381
             +  P LLLFL+ ++ +
Sbjct: 2872 KEGLPDLLLFLNHVKKV 2888



 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 58/120 (48%), Gaps = 15/120 (12%)

Query: 1212 ELVQNADD----NIY-----PENVEPTLTFILQ-----ESGIVVLNNEQGFSAENIRALC 1257
            E++QNA+D     +Y      EN E   +   +     +   + + N+  F+ ++ + + 
Sbjct: 1682 EMIQNAEDAGATKVYFMLDLRENSEARSSLFDKGMVDCQGAALWVYNDASFTPDDFQNIT 1741

Query: 1258 DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP 1317
             +G +TK+  ++  IGR GIGF SV+ +TD P   S  +   FD     +G +L +   P
Sbjct: 1742 KLGGATKELDTSK-IGRFGIGFNSVYHLTDVPSFLSREYLQVFDPHATHLGSMLKSKDSP 1800


>gi|392843739|gb|AFM84473.1| sacsin, partial [Sphyrna lewini]
          Length = 735

 Score = 54.3 bits (129), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 49/87 (56%), Gaps = 7/87 (8%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            +F+ +  Q+  D I+ E   P     LQ   I V NN Q F+ ++IR + ++G  TK+G+
Sbjct: 439  YFIFDPRQHPGDRIFDEKWAP-----LQGPAICVYNN-QPFTEDDIRGIQNLGRGTKEGN 492

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
              G  G+ GIGF SV+ +TD P   S+
Sbjct: 493  -PGKTGQYGIGFNSVYHITDCPAFISD 518


>gi|297194170|ref|ZP_06911568.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
            25486]
 gi|297152141|gb|EDY65904.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC
            25486]
          Length = 1584

 Score = 54.3 bits (129), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 94/366 (25%), Positives = 153/366 (41%), Gaps = 59/366 (16%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D I  ++    +  +L ++ +   N     + E    +  +  S K+G   
Sbjct: 50   LFELVQNAADEIA-DHPGGKIQVVLTDANLYCANEGNPVTPEGAETILRMSMSKKRG--- 105

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIGFVLPTLV----PPFNIDMF 1323
            G IGR G+G KSV  VTDAPE  S      FD   S  QI  V P +     P F+  + 
Sbjct: 106  GQIGRFGVGVKSVLVVTDAPEFFSRTGSFGFDRAWSYEQIKAV-PGVTDRYGPTFDAPVL 164

Query: 1324 --CRLL------SKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFS----DLHPS 1370
               R L      + DP+  E   W  T +RLP   K ++G    ++    S    +  P 
Sbjct: 165  RTARPLDDAAERAADPILDELLGWATTVVRLPL-LKGADGRLGKDMHGGGSSDREEFPPG 223

Query: 1371 LLLFLHRLQCIMF-----RNMLNDSLVVIRKKIVGD--GIIKVSCGEDKMT--WFV---- 1417
              LF   +  ++      R M   +L     + VGD   + +V  G+   +  W V    
Sbjct: 224  FQLFSPHVAEVVLEDRRPRPMARRTLTT---RQVGDVRTVWEVRTGKAASSEEWRVFTVT 280

Query: 1418 ----ASQKLRAGVIRPDVKTTEIALALTLQESN--EGNYG-PLLYQQPVF-AFLPLR-TY 1468
                 + +  AG +  D  T +++ A+ + E +   G YG P +  +  F +F P +   
Sbjct: 281  HEPGQTARADAGELH-DRVTLDVSWAVPVLERDPKSGLYGSPRIRGRGRFWSFFPTKYEM 339

Query: 1469 GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASV 1528
             L  +L G +     R+ +  +SP+NQ ++    AL V + R           P++  + 
Sbjct: 340  SLSGMLNGAWKTNEDRQNLLDSSPFNQEMIKVSAALVVDSLRHLA--------PSEDPAA 391

Query: 1529 YMSFVP 1534
            Y+  +P
Sbjct: 392  YLRLLP 397


>gi|392843857|gb|AFM84532.1| sacsin, partial [Hypostomus plecostomus]
          Length = 736

 Score = 54.3 bits (129), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     PT          +Q   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPTDRIFDDKWVPMQGPALCVYNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             ++IR + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN
Sbjct: 476  EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519


>gi|392843853|gb|AFM84530.1| sacsin, partial [Pelophylax nigromaculatus]
          Length = 736

 Score = 54.3 bits (129), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRHHPSDRIFDEKWTP-----LQGPALCVFNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+  G  G+ GIGF SV+ +TD P  + SN     FD
Sbjct: 472  -QPFTDDDIRGIQNLGRGTKEGN-PGKTGQYGIGFNSVYHITDCPSFVSSNDIICIFD 527


>gi|392843839|gb|AFM84523.1| sacsin, partial [Pangasianodon hypophthalmus]
          Length = 736

 Score = 53.9 bits (128), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     PT          +Q   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPTDRIFDDKWVPMQGPSLCVYNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             ++IR + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN
Sbjct: 476  EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519


>gi|47178510|emb|CAG14162.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 122

 Score = 53.9 bits (128), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/33 (69%), Positives = 27/33 (81%)

Query: 1476 GDFVLPSSREEVDGNSPWNQWLLSEFPALFVSA 1508
            GDF +PSSRE+VD +SPWNQWL SE P LF+ A
Sbjct: 75   GDFDIPSSREDVDRDSPWNQWLRSEIPQLFLQA 107


>gi|392843865|gb|AFM84536.1| sacsin, partial [Scincella reevesii]
          Length = 736

 Score = 53.9 bits (128), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 55/118 (46%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPSDRIFDEKWSP-----LQGPALCVFNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+  G  G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGN-PGKTGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|348684143|gb|EGZ23958.1| hypothetical protein PHYSODRAFT_479380 [Phytophthora sojae]
          Length = 4858

 Score = 53.9 bits (128), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 93/213 (43%), Gaps = 39/213 (18%)

Query: 1203 YSQDSHFLLELVQNADD------NI---YPENVEPTLTF----ILQESGIVVLNNEQGFS 1249
            Y + +    E+VQNADD      N+   Y ++    L +      Q   ++V NN   F+
Sbjct: 29   YPEGTSIFKEMVQNADDAGATEVNLCLDYRQHASTGLAYEKLGSFQGPSLLVHNNAT-FT 87

Query: 1250 AENIRALCDVGNSTKKGSSAGY-IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
              + +++  +G+S KK +S G+  GR G+GF SV+ VTD P   S    V FD     + 
Sbjct: 88   DADFQSIQRIGDSLKKDNSKGWKTGRFGVGFNSVYHVTDLPTFVSGSQLVFFDPQACHLP 147

Query: 1309 FVLPTLVPPFNIDMFCR--LLSKDPVQLES-KCW---------NTCIRLPFRT------- 1349
             V P+  P   ID      L++  P Q+   +C+          T  RL  RT       
Sbjct: 148  NVNPS-NPGKMIDYIAHPDLVNNFPDQIAPFRCFGCNFSEPFSGTIFRLALRTEDQAQRS 206

Query: 1350 ----KFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
                +      M  ++  F+D  P++LLFL  +
Sbjct: 207  KLSNRAQTPAKMAEMLKEFADSLPTVLLFLRNV 239


>gi|423480469|ref|ZP_17457159.1| hypothetical protein IEQ_00247 [Bacillus cereus BAG6X1-2]
 gi|401147405|gb|EJQ54907.1| hypothetical protein IEQ_00247 [Bacillus cereus BAG6X1-2]
          Length = 702

 Score = 53.9 bits (128), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 71/139 (51%), Gaps = 15/139 (10%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESG-IVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
             +E++QNADD       + T  ++ + +G + + NN + F  +++R++C  G S KK   
Sbjct: 46   FIEMLQNADD------AQSTAFYVCEVNGHVFIANNGKPFDDKDVRSICRSGASEKKRGE 99

Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--EGQIGFV----LPTLVPPFNIDM 1322
               IG +G+GFKS   ++    IHSN     F  S    ++G +    +PT+  PF ++ 
Sbjct: 100  T--IGYRGVGFKSTTHLSKNIYIHSNECTFTFSKSLVAERLGEINENKVPTIRIPFLLES 157

Query: 1323 FCRLLSKDPVQLESKCWNT 1341
                ++K   QL+ K + T
Sbjct: 158  LDADIAKTLNQLKQKGFTT 176


>gi|408375861|ref|ZP_11173500.1| hypothetical protein A11A3_17045, partial [Alcanivorax hongdengensis
            A-11-3]
 gi|407764271|gb|EKF72769.1| hypothetical protein A11A3_17045, partial [Alcanivorax hongdengensis
            A-11-3]
          Length = 498

 Score = 53.9 bits (128), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 86/322 (26%), Positives = 127/322 (39%), Gaps = 60/322 (18%)

Query: 1211 LELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALC------------- 1257
            LEL+QNADD    E     + F + ++G+ V N+ Q     ++   C             
Sbjct: 32   LELIQNADDAKAEE-----VIFDITDAGLYVRNSGQFTYCGDLERFCSFTTDGNYKCDYH 86

Query: 1258 ---DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL 1314
               DVG+  K   S   IGR GIGF S ++VTD PEI S G  +      G+  + +   
Sbjct: 87   RITDVGSGGKLSRSEN-IGRFGIGFVSTYQVTDHPEIRSAGIKLALYPEHGK--WSIEEF 143

Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLF 1374
              P     F    S DP   ES+      RL           ++ + D F ++    LLF
Sbjct: 144  EEPAGTSFFLPWAS-DP---ESEG-----RLALGISHISSDHIDQLTDDFQEVLRKSLLF 194

Query: 1375 LHRLQCIMFRNMLNDSLVV---IRKKIVGDGIIKVSCGEDKMTWFV-------ASQKLRA 1424
            L  ++    R   N  L++   + +    D II          W +       A+Q L A
Sbjct: 195  LRHIRKAEVRR--NGKLLMACDLDRSEETDLIISFRPSGTVEQWRILRANAEEAAQPLYA 252

Query: 1425 GVIRPDV--KTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL-RTYGLKFILQGDFVLP 1481
               R +   ++TEI++ L +    E     LLY     AFLP  +  GL   +  DF   
Sbjct: 253  THPRLESLHRSTEISIGLRVDP--EPITDGLLY-----AFLPTEQATGLPLHINADFFPE 305

Query: 1482 SSREEV-----DGNSPWNQWLL 1498
            S R+ V          WN+ L+
Sbjct: 306  SDRKAVIFAGHQHEQAWNEMLI 327


>gi|291234401|ref|XP_002737137.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 4311

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 58/117 (49%), Gaps = 19/117 (16%)

Query: 1201 ELYSQDSHFLLELVQNADDN---------IYPENVEPTLTFILQESGIVVLN-------N 1244
            E+Y +      EL+QNADD           + +N +  L  +  + G+   +       N
Sbjct: 1377 EMYKEGVDIFKELIQNADDAGASEIRFLIDHRQNTDVQLKLL--DPGMKHCHGPALWSYN 1434

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            +  FS+E+IR +CD+  ++KK      IGR G+GF SV+ VTD P + S    + FD
Sbjct: 1435 DAMFSSEDIRNICDIAAASKK-DKLDKIGRFGLGFTSVYHVTDVPSVVSGSHIMIFD 1490



 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 94/219 (42%), Gaps = 38/219 (17%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +S    EL+QNADD        IY      T   +      +Q+S  + + N Q FS
Sbjct: 2405 YPGESEIFKELLQNADDAKATELHIIYDPRSHSTDKIVSESWKPIQQSPAICVYNNQIFS 2464

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF-------HVKF-- 1300
             ++I  +  VG   K  + A   G+ GIGF SV+ +TD P   S+         H+K+  
Sbjct: 2465 DDDIAGVQQVGIGAK-SADAHTTGQYGIGFNSVYHLTDCPSFLSDDHQLGVFDPHLKYVP 2523

Query: 1301 DTSEGQIGFVLPTLVPPFNI--DMFCRLLSKDPVQLESKCWNTCIRLPFR-TKFSEGIA- 1356
            D +    G     L  P NI  + F  +LS    +  S    T  RLP R   +S  I+ 
Sbjct: 2524 DATHDYPGI----LYKPVNILKENFGDMLSGFLGEYFSLKGGTMFRLPLRQNAYSSKISS 2579

Query: 1357 -------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLN 1388
                   + ++++ F +     LLFL+ ++ +    +LN
Sbjct: 2580 KTYSDKDIRHMLNRFENEACDCLLFLNNVEVVTLSEILN 2618


>gi|260809550|ref|XP_002599568.1| hypothetical protein BRAFLDRAFT_77670 [Branchiostoma floridae]
 gi|229284848|gb|EEN55580.1| hypothetical protein BRAFLDRAFT_77670 [Branchiostoma floridae]
          Length = 4931

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/189 (29%), Positives = 84/189 (44%), Gaps = 37/189 (19%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTF--------ILQESGIVVLN-------NEQG 1247
            Y + S    EL+QNADD    E V+  + +         L  SG+           N + 
Sbjct: 1350 YKEGSGIFYELIQNADDAGATE-VKFAVDWRQNSGSRKSLFSSGMAAWQGPALWAYNNKV 1408

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            F+ E+I+ + ++  +TKK  ++  IGR G+GF SV+ +TD P   S  + V FD  +  +
Sbjct: 1409 FTDEDIQNINEISGATKKRDTSK-IGRFGLGFNSVYHITDMPSFISRHYFVCFDPHKTHL 1467

Query: 1308 GFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTC-IRLPFRTKFS--EGIAMNNIVDMF 1364
            G       P  N          DP +     WN   IR  +R +F+   GI   NI D  
Sbjct: 1468 G-------PVVN--------DNDPAEGVQVEWNRDEIRNIYRDQFAPFNGIFDCNIFD-- 1510

Query: 1365 SDLHPSLLL 1373
            SD +P+ L 
Sbjct: 1511 SDTYPATLF 1519



 Score = 43.5 bits (101), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 29/109 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVL 1242
            E YS       EL+QNADD                   I+  + +P     LQ   ++V 
Sbjct: 2323 ESYSTGMDIFKELLQNADDAGATEVKFIYDTRGHSDERIFGIDWKP-----LQGPALLVY 2377

Query: 1243 NNEQGFSAENIRALCDVG--NSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            NN + FS  +I A+  VG  N  +KGS+   IG+ G+GF +V+ +TD P
Sbjct: 2378 NN-KTFSQSDIEAIQKVGKGNKQEKGST---IGQFGVGFNAVYHLTDCP 2422


>gi|392843837|gb|AFM84522.1| sacsin, partial [Carassius auratus]
          Length = 736

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     PT          +Q   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEIYFVFDPRSHPTERIFDDKWVPMQGPSLCVYNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             ++IR + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN
Sbjct: 476  EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519


>gi|329938003|ref|ZP_08287485.1| hypothetical protein SGM_2977 [Streptomyces griseoaurantiacus M045]
 gi|329302960|gb|EGG46849.1| hypothetical protein SGM_2977 [Streptomyces griseoaurantiacus M045]
          Length = 1589

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 65/156 (41%), Gaps = 20/156 (12%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D I  E     +  +L ES +   N     + E    +  +  S K+G   
Sbjct: 50   LFELVQNAADEIANEP-GGRVHVVLTESRLYCANEGSPVTPEGAETILRMSMSKKRG--- 105

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIG------------FVLPTLV 1315
            G IGR G+G KSV  VTDAPE  S      FD   S  QI             F  P L 
Sbjct: 106  GQIGRFGVGVKSVLVVTDAPEFFSRTGSFGFDRAWSYQQIKAVPGVTERYGPEFDAPVLR 165

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTK 1350
                +DM     + DPV  +   W  T +RLP   K
Sbjct: 166  MARPLDMRAE-RAADPVLDKLLTWATTVVRLPLLKK 200


>gi|326402571|ref|YP_004282652.1| hypothetical protein ACMV_04230 [Acidiphilium multivorum AIU301]
 gi|325049432|dbj|BAJ79770.1| hypothetical protein ACMV_04230 [Acidiphilium multivorum AIU301]
          Length = 1764

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 91/215 (42%), Gaps = 29/215 (13%)

Query: 1208 HFLLELVQNADDN---IYPENVEPTLTFILQESG-IVVLNNEQGFSAENIRALCDVGNST 1263
              L EL+QNA D      P  +   LT    + G ++V N  + F+  N+ AL ++G S 
Sbjct: 64   RVLFELIQNAHDAHPPGEPGEIAVRLTIEADDRGELLVANRGRPFTTSNLNALRNIGTSD 123

Query: 1264 KKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG---------QIGFVLPTL 1314
            K+      IG KG+GF+SV  +TD   ++S       +  EG         +I   L  L
Sbjct: 124  KQIGEG--IGNKGLGFRSVEALTDDVRVYSAAVAGPAERFEGYCFRFARLDEIAAGLAGL 181

Query: 1315 -------------VPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIV 1361
                         +P + I +     S+  ++L    + T I LP R+  +  +A    V
Sbjct: 182  GASSDMAESVAGNIPRYLIPVAVPDQSEGIMELARGGYATVIALPLRSAHTVALARAQ-V 240

Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
            +   D    +LLFL R+  +        +++V R+
Sbjct: 241  EAVIDAGAPVLLFLDRIGSLEVSVRTVSTIIVERR 275


>gi|219666700|ref|YP_002457135.1| superfamily I DNA/RNA helicase [Desulfitobacterium hafniense DCB-2]
 gi|219536960|gb|ACL18699.1| superfamily I DNA/RNA helicase [Desulfitobacterium hafniense DCB-2]
          Length = 1584

 Score = 53.1 bits (126), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 83/337 (24%), Positives = 129/337 (38%), Gaps = 77/337 (22%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAEN------- 1252
            E YS  +HF+ EL+QNADD         +  F L ++ ++  +N  + FS  N       
Sbjct: 37   EKYSDQAHFIYELLQNADDA-----GATSARFKLCDNELIFAHNGTRRFSISNPETEGPD 91

Query: 1253 --------IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
                    I A+  + NS K  +S   IG+ G+GFK+VF+ T  P I+   F  K +   
Sbjct: 92   TCNGKLGDINAITSIANSNKTAAS---IGKFGVGFKAVFQYTQTPSIYDPEFRFKIER-- 146

Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGIAMNNIVDM 1363
                F++P       + ++     + P +       T    PF         A  NI D 
Sbjct: 147  ----FIVP-------VKIYNDFTGRRPEE-------TLFVFPFDHADRRHEEAYENISDK 188

Query: 1364 FSDL-HPSLLLFLHRLQCIMFRNMLNDSLVV--IRKKIVGDGIIKVSCG----------- 1409
               L +P  +LFL  L+ I F       L    + ++I  D  +                
Sbjct: 189  LRSLVYP--VLFLTNLREISFECSKKFGLYTKSVDEEIAFDNTVARLITLTQNDGDENNP 246

Query: 1410 -EDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-T 1467
             E+K+  F           R D      ++   L  +NEG      Y    F F P +  
Sbjct: 247  IENKLWLFT----------RMDEHDFTYSVGFLL--NNEGKLRGSNYS--AFCFFPTKEE 292

Query: 1468 YGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPAL 1504
             GL FI+   F+L  SRE +      N+ +++   AL
Sbjct: 293  TGLNFIVHAPFLLTDSREGIKAGEEHNKDMINLLAAL 329


>gi|302794218|ref|XP_002978873.1| hypothetical protein SELMODRAFT_152901 [Selaginella moellendorffii]
 gi|300153191|gb|EFJ19830.1| hypothetical protein SELMODRAFT_152901 [Selaginella moellendorffii]
          Length = 4493

 Score = 53.1 bits (126), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 71/162 (43%), Gaps = 22/162 (13%)

Query: 1203 YSQDSHFLLELVQNADD--------------NIYPENVEPTLTFILQESGIVVLN-NEQG 1247
            Y + +  L E++QNADD              +     V P L    Q  G  +L  N   
Sbjct: 21   YPEGTTILKEMIQNADDAGASQVCFCVDRRRHAVDSLVSPPLA---QWQGPALLAYNNAA 77

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            F+ E+ +++  +G+S K+G  A   GR GIGF SV+ ++D P   S G HV F   + Q 
Sbjct: 78   FTEEDFQSISRIGDSKKRGQ-AWKTGRFGIGFNSVYHLSDLPSFVS-GSHVVFFDPQCQF 135

Query: 1308 GFVLPTLVPPFNIDMFCRL-LSKDPVQLESKCWNTC-IRLPF 1347
               + +L P   ID      L   P Q    C   C +R PF
Sbjct: 136  LPSISSLNPGKRIDFVTSTALENFPDQFSPYCAFGCDMRRPF 177



 Score = 52.4 bits (124), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 89/220 (40%), Gaps = 46/220 (20%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI-----------------VVLN 1243
            E+Y+     L EL+QNADD    E     ++F+L +S                   +   
Sbjct: 1363 EMYADGPGTLFELLQNADDAGATE-----VSFLLDKSSFGTNSVLSRQMADWQGPALYCY 1417

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            N+  F  E+  A+  +G ++K    +  IGR G+GF SV+  TD P   S    V  D  
Sbjct: 1418 NDSVFKPEDFYAISRIGQNSKLDKPSA-IGRFGLGFNSVYHFTDVPSFVSGSTIVLLDPH 1476

Query: 1304 EGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQLES----KC------WNTCIRLPFRTKFS 1352
               +  + P   P   I+   + LL++ P Q ES     C        T  R P RT  +
Sbjct: 1477 ADNLPGISPA-QPGLKINFVGKELLNQFPDQFESFLLFGCDLQKPFLGTLFRFPLRTTVT 1535

Query: 1353 EGIAM--------NNIVDMFSDLHPS---LLLFLHRLQCI 1381
               +          ++V +FS L  S   +LLFL  +  I
Sbjct: 1536 AARSEIKQDPYQPEDVVTLFSALKKSAGEVLLFLRHVTSI 1575


>gi|326674209|ref|XP_003200093.1| PREDICTED: sacsin-like [Danio rerio]
          Length = 4580

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 15/106 (14%)

Query: 1203 YSQDSHFLLELVQNADD----NIY----PENVEPTLTF-----ILQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD     IY    P        F      LQ   + V NN Q F+
Sbjct: 2543 YPSEKEMLKELLQNADDAKATEIYFVFDPRTHSTDRIFDDKWTPLQGPSLCVYNN-QPFT 2601

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
             ++IR + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN 
Sbjct: 2602 EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISNN 2646



 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 66/238 (27%), Positives = 99/238 (41%), Gaps = 60/238 (25%)

Query: 1201 ELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESGIVVLN-------N 1244
            E Y   +    EL+QNADD    E         N+E  +   L + G+++ +       N
Sbjct: 1472 EEYPSVADIFKELLQNADDANATECSFLIDMRKNLE--IRENLLDPGMIICHGPALWSFN 1529

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
               FS  +   +  +G S K+   A  +G+ G+GF SV+ VTD P I S  F + FD + 
Sbjct: 1530 NSVFSDTDFLNITRLGGSMKR-CEADKVGKFGLGFNSVYHVTDIPIIMSREFMIMFDPNI 1588

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
              IG                         P    PF I++F C+L    P+ LE+  K  
Sbjct: 1589 NHIGKHIRDKSNPGIKINWSKQQKRLRKFPNQFRPF-INVFNCQL----PLSLEAPYKYD 1643

Query: 1340 NTCIRLPFRTKFS------EGIAMN-----NIVDMFSDLHPSLLLFLHRLQCIMFRNM 1386
             T  RLPFRT+         GI  N     ++VD FS     L+LF   +  ++ + +
Sbjct: 1644 GTLFRLPFRTEQEASASEISGIYYNTTDIYSLVDEFSICGHRLILFTQHVGTMVLKYL 1701



 Score = 42.4 bits (98), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 13/103 (12%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPEN---VEPTLTF-ILQESGIVV-LNNEQGFSA 1250
            Y +    L EL+QNA+D        +Y E    VE   +  + Q  G  + + N+  F+ 
Sbjct: 111  YPEGGQILKELIQNAEDAGATEVKFLYDETEYGVESLWSHDMAQYQGTALYVYNDAVFTT 170

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
            E+   + ++  S K+      +GR GIGF SV+ +TD P I S
Sbjct: 171  EDWNGIQEIARSRKREDPLK-VGRFGIGFNSVYHITDVPSIFS 212


>gi|301061109|ref|ZP_07201903.1| conserved hypothetical protein [delta proteobacterium NaphS2]
 gi|300444778|gb|EFK08749.1| conserved hypothetical protein [delta proteobacterium NaphS2]
          Length = 509

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 82/166 (49%), Gaps = 18/166 (10%)

Query: 1366 DLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGI--IKVSCGEDKMTWFVASQKL- 1422
            ++ P  +LFL +L+ I      ++   VI+       I  +    G D++  F+  +++ 
Sbjct: 13   EIEPETMLFLSKLKEIKIVTGHSEGFTVIKDDTSEPNIQLLTNKNGHDEIFEFLLFKRVF 72

Query: 1423 -RAGVI----RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
             +   I    R +V+  E+++A  + +    + G L      FA+LP+R   G  FI+  
Sbjct: 73   NKPEEINHEKRNNVEEREVSIAFPVGD-KPNSAGKL------FAYLPVRYDTGFPFIINA 125

Query: 1477 DFVLPSSREEVDGNSPWN-QWLLSEFPALFVSAERSFCDLPCFREN 1521
            DF+LPSSRE++  ++PWN +WL+     L  S+      LP  +EN
Sbjct: 126  DFILPSSREDIQ-DTPWNREWLMPCVGELVYSSLPILKSLPILKEN 170


>gi|115665280|ref|XP_001200196.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
          Length = 3846

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 63/137 (45%), Gaps = 26/137 (18%)

Query: 1175 NISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVE 1227
            + S  E    +K   R+ R L     + Y  D   L EL+QNADD        +Y     
Sbjct: 1867 DFSGEEFGQHEKLTTRIKRIL-----DSYPSDETILKELLQNADDAGATEIHFVYDPRSH 1921

Query: 1228 PTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY---IGRKGIG 1278
                 +      LQ   + V NN Q F+ E+IR + ++G    KGS A +   IGR G+G
Sbjct: 1922 SENKLLADSMKALQGPALCVFNN-QSFTDEDIRGIQNLG----KGSKADHMEKIGRYGVG 1976

Query: 1279 FKSVFRVTDAPEIHSNG 1295
            F +V+++TD P   S G
Sbjct: 1977 FNTVYQLTDCPSFISKG 1993



 Score = 46.6 bits (109), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            F+ ++ + +  +G  TK+   A  IG+ G GF SV+R+TD P + S  +   FD     +
Sbjct: 4    FTDQDFKNILRLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSLVSRDYMQFFDPHTTHL 62

Query: 1308 GFVLPTLVPP 1317
            G VLP    P
Sbjct: 63   GNVLPNKSQP 72



 Score = 46.2 bits (108), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            N+  F+ ++   +  +G  TK+   A  IG+ G GF SV+R+TD P + S  +   FD  
Sbjct: 955  NDAVFTDQDFENILCLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSLVSRNYMQCFDPH 1013

Query: 1304 EGQIGFVLPTLVPP 1317
               +G VLP    P
Sbjct: 1014 TTHLGNVLPNKSQP 1027


>gi|260907582|ref|ZP_05915904.1| hypothetical protein BlinB_19757 [Brevibacterium linens BL2]
          Length = 1609

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 89/223 (39%), Gaps = 27/223 (12%)

Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGY 1271
            ELVQNA D +        +  +  E  + V N+ + F +  I+ L     STK+      
Sbjct: 62   ELVQNAADAVREGGSR--IHVVATEKALYVANDGKAFDSNGIKTLLHSNMSTKRDEQ--- 116

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL--------PTLVPPFNIDMF 1323
            IGR G+GFKSV  +   P+I S+     FD         L        P L  PF ++  
Sbjct: 117  IGRFGLGFKSVLEICPQPQIFSSTVSFGFDAERSVQALSLINRGLTRYPVLRLPFLLNES 176

Query: 1324 CRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIM 1382
                S   ++ E + W +T +RLP     S       + D   +  P  LLF   +  + 
Sbjct: 177  SESSSDSILR-ELRSWASTVVRLPLAQPLSW------LKDSVREFRPEFLLFSPTVSSLT 229

Query: 1383 FRNMLNDSLVVI----RKKI--VGDGIIKVSCGEDKMTWFVAS 1419
                 +     +    R K+  VG   +++S  +  + W V++
Sbjct: 230  LEVRGDGESRTVWTSERTKLSTVGVEAVRLSSADSNIDWVVST 272


>gi|379010681|ref|YP_005268493.1| DNA/RNA helicase superfamily I [Acetobacterium woodii DSM 1030]
 gi|375301470|gb|AFA47604.1| DNA/RNA helicase superfamily I [Acetobacterium woodii DSM 1030]
          Length = 1587

 Score = 52.4 bits (124), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 79/334 (23%), Positives = 130/334 (38%), Gaps = 64/334 (19%)

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNN-EQGFSAE 1251
            R +     E YS  +HF+ EL+QNADD         ++ F L +  ++  +N  + FS  
Sbjct: 29   RGVKTSVVEKYSDQAHFIYELLQNADDA-----GATSVRFELHKDELIFAHNGTRRFSIS 83

Query: 1252 N---------------IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
            N               I A+  + NS K  +S   IG+ G+GFK+VF+ T  P ++   F
Sbjct: 84   NPDTEEADSDNGKLGDINAITSIANSNKTMAS---IGKFGVGFKAVFQYTQTPSVYDPDF 140

Query: 1297 HVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-RTKFSEGI 1355
              + +       F++P  +              D      K   T    PF     S+  
Sbjct: 141  QFRIER------FIVPVKI--------------DNDYASRKSGETLFVFPFDHNDRSQEE 180

Query: 1356 AMNNIVDMFSDL-HPSLLLFLHRLQCIMFRNMLNDSLVV--IRKKIVGDGI------IKV 1406
            A  +I    + L +P  +LFL  L  I + ++         + +KI+ +        +  
Sbjct: 181  AYEDISHKLNTLVYP--ILFLTNLNEICYESLEQSGKYTRSVCEKIIFENTEACLFDMSQ 238

Query: 1407 SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR 1466
             C  D+    V   KL     R D +    ++   L +  EG           F F P +
Sbjct: 239  ICENDERK--VIGDKLWL-FSRTDEQGLRYSVGYELNK--EGKLTA--SHHAAFCFFPTK 291

Query: 1467 -TYGLKFILQGDFVLPSSREEVDGNSPWNQWLLS 1499
               GL F++Q  F+L  SRE +      NQ +++
Sbjct: 292  ENTGLNFMIQAPFLLTDSREGIKAGEKHNQDMIA 325


>gi|340375792|ref|XP_003386418.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4104

 Score = 52.4 bits (124), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 147/369 (39%), Gaps = 61/369 (16%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  +   L  + +E Y      + E++QNADD        +Y      T   IL+    
Sbjct: 1372 QHEPITTRLRNILKE-YKDGLTIIKEMIQNADDAGATEVNILYDNRTHSTQNLILKGMAD 1430

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ GIGF SV+ +TD P   S 
Sbjct: 1431 SHGPALIVHNNGTFTKEDFENITKLAGATKANQPLK-IGKFGIGFCSVYHITDVPSFVSG 1489

Query: 1295 GFHVKFDTSEGQIGFVLPTLVPPFNIDMF--CRLLS--KDPVQLE--------SKCWNTC 1342
             +   FD +   +  V+     P     +   ++ S  +D V  E        S    T 
Sbjct: 1490 EWLYIFDPTLQFLKGVVHDQSKPGKKIKYQQSKIFSNCQDLVPYEGLFGFSSLSDYEGTM 1549

Query: 1343 IRLPFRT---KFSEGIAMNNIV-DMFSDL---HPSLLLFLHRLQCIMFRNML-NDSLVVI 1394
             R PFRT   + S  +   ++V D+  DL      LLLFL  ++ I F ++  N+ ++ +
Sbjct: 1550 FRFPFRTSASQISSTVYNEHLVEDIKRDLIECGSKLLLFLQHVKRITFSSLRGNEKIIEV 1609

Query: 1395 RKKI---VGDGIIKVSC-----GEDKMTWFVASQKLR---AGVIRPDVKTTEIALALTLQ 1443
               +     D I +V       G     W +ASQ+ +      I P V +    +A  L 
Sbjct: 1610 SIDVSSSSNDDIHEVVINSPHNGSGTEHWLIASQEEQWRHQNNIEPAVAS----VACQLD 1665

Query: 1444 ESNEGNYGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSREEV----------DGNS 1491
              N   Y     +   F FLPL   + GL   +  +F + S+R  +          D   
Sbjct: 1666 RINSNCYKCKEIEGNAFCFLPLALPSTGLPVHVSANFAVMSNRSGIWTEVLSGTVTDSKE 1725

Query: 1492 PWNQWLLSE 1500
             WN+ L+ +
Sbjct: 1726 RWNKNLMEK 1734



 Score = 42.0 bits (97), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 17/123 (13%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADD------------NIYPENVEPTLTF-- 1232
            Q   L R +  + QE Y  D+ FL EL+QNADD              + +   P+  +  
Sbjct: 2323 QKEELTRRIRNILQE-YPLDATFLKELLQNADDAKATKIRVILDKRQHKKEKVPSEEWGK 2381

Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
             LQ   ++V N+++ FS  ++  +  +G  +K+      IG+ GIGF  V+ +TD P   
Sbjct: 2382 ELQGPALLVWNDKE-FSDADLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHLTDCPSFI 2439

Query: 1293 SNG 1295
            + G
Sbjct: 2440 TRG 2442


>gi|392843833|gb|AFM84520.1| sacsin, partial [Monopterus albus]
          Length = 736

 Score = 52.4 bits (124), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           +Q   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEIYFVFDPRTHPIDRIFDDKWIPMQGPALCVFNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             ++IR + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN
Sbjct: 476  EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519


>gi|47190447|emb|CAF92480.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 145

 Score = 52.4 bits (124), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 2/115 (1%)

Query: 156 GESVSSWKVSRSVLLKLQVDSWSSLGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITS 215
           GE V+  KV   V  ++Q+ S  + G       +L  L      +N FI C   V  I +
Sbjct: 8   GELVTKEKVISRVCRQMQIPSLEAGGIYPGMTSALKDLQYKIREVNMFIECAESVSSICT 67

Query: 216 LYDLEEAICKSEGIEKFEELELGPFLRQPLISHYFLVKSDV--DVVFKIATNDVI 268
           LY++  ++   +G + +EEL LGP  + P I   F + S+   D + +I T D++
Sbjct: 68  LYEMGCSLAGLKGKKHYEELNLGPLCKLPRIHRIFKIDSNTKDDDIQQIETVDIL 122


>gi|340369659|ref|XP_003383365.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4979

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 84/346 (24%), Positives = 138/346 (39%), Gaps = 54/346 (15%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + ++ Y      + E++QNADD        +Y +        I++    
Sbjct: 1453 QHESLTTRLRNILKD-YKDGLTIIKEMIQNADDAGATELNILYDKRTHSKEKLIIKGMAD 1511

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N + FS ++   +  +  +TK       IG+ GIGF SV+ +TD P   S 
Sbjct: 1512 SHGPALIVHNNRAFSKDDFVNITKLAGATKADKPLK-IGKFGIGFCSVYHITDVPSFVSG 1570

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKDPVQLES-----------KCWNTC 1342
             +   FD +   + G V     P   ++   + L++   QL             K   T 
Sbjct: 1571 EWLYIFDPTLKYLKGAVCNDSSPGKKMEYRSKFLARSD-QLAPYHGLFGFDSSLKYNGTI 1629

Query: 1343 IRLPFRTKFSEGIA-------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNML--NDSLVV 1393
             RLPFRT  S+  +       +  I D   D    LLLFL  ++ I F  +    +SL  
Sbjct: 1630 FRLPFRTNPSQISSTLYSDELVQQIKDDLQDNASKLLLFLQNVKSITFSTVQENGESLCE 1689

Query: 1394 IRKK---IVGDGIIK-------VSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQ 1443
            I  K   I   GI K       + C E    W ++S K     +  ++    +A  L   
Sbjct: 1690 ISIKCNAIESYGIEKRVITVTSLKCKEKIEHWLMSSVKEN---LEGELAVASVACELL-- 1744

Query: 1444 ESNEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEV 1487
             + +  Y     +  VF FLPL   + GL   +  +F + S+R  +
Sbjct: 1745 -NTDNVYQCQSIKGNVFCFLPLAVSSTGLPVHVSANFAVMSNRSGI 1789


>gi|340377463|ref|XP_003387249.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4220

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 84/355 (23%), Positives = 140/355 (39%), Gaps = 70/355 (19%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG- 1238
            QH  L   L  + +E Y      + E++QNADD        +Y +        I++  G 
Sbjct: 1457 QHEPLTTRLGNILRE-YKDGLTIIKEMIQNADDAGATEVNIMYDKRTHSAKKLIIKGIGD 1515

Query: 1239 ----IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 1516 SHGPALIVHNNSTFTEEDFENITKLAAATKANQPLK-IGKFGVGFCSVYHITDVPSFISG 1574

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLS--------KDPVQLES--KCWNTCI 1343
             +   FD +   + G V     P   I+   + L+        +D     S  K   T  
Sbjct: 1575 KWLYIFDPTLKYLKGVVHNQSSPGKKIEYRSKFLAHSDQLAPYQDLFGFRSSIKYNGTIF 1634

Query: 1344 RLPFRTKFSEGIA-------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVV--- 1393
            RLPFRT  S+  +       +  I D   D    LLLFL  ++ I F  + ++   +   
Sbjct: 1635 RLPFRTNPSQISSTLYDDELVQQIKDDLEDNASKLLLFLQNVKSITFSTVQDNGDRLHEI 1694

Query: 1394 -----------IRKKIVGDGIIKVSCGEDKMTWFVA--SQKLRAGVIRPDVKTTEIALAL 1440
                       I KK++   +  ++  E+   W V+  ++ L       D+    +A  L
Sbjct: 1695 SIKCNAIESYGIEKKVI--TVTSLNYEEEIEHWLVSTVTENLEG---YNDLAVASVACGL 1749

Query: 1441 TLQ------ESNEGNYGPLLYQQPVFAFLPL--RTYGLKFILQGDFVLPSSREEV 1487
                     +S EGN         VF FLPL   + GL   +  +F + S+R  +
Sbjct: 1750 RKSDNIYHCQSIEGN---------VFCFLPLAVSSTGLPVHVSANFAVMSNRSGI 1795


>gi|340368021|ref|XP_003382551.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 4293

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 80/342 (23%), Positives = 132/342 (38%), Gaps = 67/342 (19%)

Query: 1212 ELVQNADDN-------IYPENVEPTLTFILQ-----ESGIVVLNNEQGFSAENIRALCDV 1259
            E++QNADD        +Y +    T   + +         ++++N   F  E+   +  +
Sbjct: 1361 EMIQNADDAEATEVNILYDDRTHSTEKLLFKGMAGSHGPALIVHNNSVFKKEDFENITKL 1420

Query: 1260 GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL-------- 1311
              +TKK      IG+ G+GF SV+ +TD P   S  +   FD +   +  V+        
Sbjct: 1421 AGATKKDQPLK-IGKFGVGFCSVYHITDVPSFVSGEWFYIFDPTLKHLKGVVHNESRPGK 1479

Query: 1312 ------------PTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFR---TKFSEGIA 1356
                          + P  NI  F    + D          T  RLPFR   ++ S  I 
Sbjct: 1480 KVKYQSKFMAQSQQMAPYENIFEFSSSANYD---------GTIFRLPFRENASQISSTIY 1530

Query: 1357 MNNIVDMFS-DLHPS---LLLFLHRLQCIMFRNMLNDSLVV---IRKKIVGDGIIKVSC- 1408
               +V     DL  +   LL+FL  ++ I FR++  D LV+   I      +G+ K +  
Sbjct: 1531 NEYLVQKLKEDLKENGSKLLIFLQNVKRITFRSLDRDELVIDISITCSNEANGVKKCTTI 1590

Query: 1409 --GEDKMT-WFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL 1465
               E ++  W VA QK    + + D     IA         +  Y     +  +F FLPL
Sbjct: 1591 CFNESEIEYWLVAHQK--GNLKQDDDDKPGIASVACQLIKKDLCYKSQEIKGNLFCFLPL 1648

Query: 1466 --RTYGLKFILQGDFVLPSSREEV-------DGNSPWNQWLL 1498
               + GL   +  +F + ++R  +       D    WN  L+
Sbjct: 1649 SSSSTGLPVHVNANFAVLNNRTGIFIEETPSDSRELWNMQLM 1690


>gi|405966416|gb|EKC31703.1| Sacsin [Crassostrea gigas]
          Length = 2134

 Score = 52.0 bits (123), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 75/276 (27%), Positives = 109/276 (39%), Gaps = 59/276 (21%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN- 1220
            V S+ R E   G  I ++E     +Q +   R  + L  E YS D     E++QNADD  
Sbjct: 485  VPSLIRREMENG-GIEDLEYEQWGQQESLTTRIKNIL--EDYS-DGPLFKEMLQNADDAG 540

Query: 1221 ------IYPENVEPTLTFILQESGI-------VVLNNEQGFSAENIRALCDVGNSTKKGS 1267
                  +Y E         L + G+       +   N+  F  ++ R +  +G  TK G+
Sbjct: 541  ATCMKILYDERRNEDARTGLIDKGMEECQGPALWFYNDAKFEEKDFRNIIKLGGGTK-GT 599

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL-----PTL-VP----- 1316
                IG+ G+GF SV+ VTD P I S  + V FD     +G  +     P L +P     
Sbjct: 600  EHTKIGKFGLGFCSVYSVTDVPSIVSGKYIVFFDPRTIHLGKAIKDKSKPGLRIPISKIV 659

Query: 1317 --------PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS-----EGIAMN----- 1358
                    P++    C    +D  +       T  RLP RT         GI+       
Sbjct: 660  KKFRKQFKPYDKIFHCDFSEEDEYE------GTLFRLPLRTANQASADPNGISTKQYNRE 713

Query: 1359 ---NIVDMFSDLHPSLLLFLHRLQCI--MFRNMLND 1389
               +IV  F D   +LLLF   ++ I   F N  ND
Sbjct: 714  EVLSIVKKFMDSAGNLLLFTQNVKKIELHFLNQTND 749



 Score = 46.6 bits (109), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 51/116 (43%), Gaps = 15/116 (12%)

Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQG 1247
            E Y  D   L ELVQNADD        +Y +               LQ   +VV NN   
Sbjct: 1502 EDYPCDVSILNELVQNADDAQATEIHFVYDKRRHSEKELFDDSMKCLQGPALVVYNN-SC 1560

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            FS+E+I  +  +G  +K  S     G+ GIGF +V+ +TD P   + G     D S
Sbjct: 1561 FSSEDINGISKLGEGSK-SSDPHQTGQFGIGFNAVYHITDVPSFLTKGTKTPNDGS 1615


>gi|159486690|ref|XP_001701371.1| hypothetical protein CHLREDRAFT_194490 [Chlamydomonas reinhardtii]
 gi|158271766|gb|EDO97579.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 785

 Score = 52.0 bits (123), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 33/62 (53%)

Query: 191 HKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGPFLRQPLISHYF 250
            +L   + R+ A +  +   R + +L++LE  +C SEG+E F EL LGP    P++   F
Sbjct: 278 QRLRQLDRRVGATMEAYCAARCVVTLWELERHVCASEGVEAFGELRLGPLTAHPVVRRTF 337

Query: 251 LV 252
             
Sbjct: 338 FA 339


>gi|260822114|ref|XP_002606448.1| hypothetical protein BRAFLDRAFT_118535 [Branchiostoma floridae]
 gi|229291789|gb|EEN62458.1| hypothetical protein BRAFLDRAFT_118535 [Branchiostoma floridae]
          Length = 4895

 Score = 52.0 bits (123), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 92/388 (23%), Positives = 150/388 (38%), Gaps = 74/388 (19%)

Query: 1003 AVPVASRF--FLDCLGY---LPSEFRSFAADVLLSGLQSTIKDAPSAILLECHQTELRLM 1057
            AV ++S F   LD   Y   LP EFR F      +G++ ++ ++     L   Q+  +L 
Sbjct: 1605 AVTLSSPFTPILDLAPYRFNLPEEFRDFKQMFRAAGVKESLDESDLIQTLHEIQSTPKLS 1664

Query: 1058 L-HEVGLSLGILEWIHDYHAFCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESV 1116
               ++ LSL I++WI  +         D+L+P         +S L     C      E  
Sbjct: 1665 TERDLKLSLSIIDWIVSHPQILEEVRDDILVPIAT----KDSSRLTLAPVC------ECT 1714

Query: 1117 GADVHIEECGAICDTICGEASDDGLGDCTTQTLPEDKE--CEDAALIVESIRRDEFGLGP 1174
              DV   +  A+ D       +D       + +P D     E  +L    +  +EFG   
Sbjct: 1715 YNDVSWSDAQALLD------DEDEEFKPLHKLVPLDTAQLLEVPSLSQRVVNAEEFGFDQ 1768

Query: 1175 NISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL 1234
                        QH ++ + L  +  + Y + +    EL+QNADD    E     + F+L
Sbjct: 1769 C----------GQHEKVTQRLKNILND-YPEGAGIFKELIQNADDAKATE-----VKFLL 1812

Query: 1235 Q------------ESGIVVLN-------NEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
                         + G+ V +       N   FS E+ + + ++G  TK   +   +GR 
Sbjct: 1813 DWRTNEDSREDLIDQGMAVCHGPALWAFNNATFSEEDFKNIQNLGGQTKL-EALEKVGRF 1871

Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPP---FNIDMFCRLLSKDPV 1332
            G+GF SV+ VTD P   S    + FD     +   +     P    ++    RL+S+   
Sbjct: 1872 GVGFNSVYHVTDVPSFVSGSRVLFFDPHATHLSKHIKDKSRPGIGLDLQKNTRLVSRFKD 1931

Query: 1333 QLE----------SKCWN-TCIRLPFRT 1349
            Q +          S  +N T  RLP RT
Sbjct: 1932 QFKPFQGVFGYSTSTSYNGTLFRLPLRT 1959


>gi|304320769|ref|YP_003854412.1| ATPase [Parvularcula bermudensis HTCC2503]
 gi|303299671|gb|ADM09270.1| putative ATPase [Parvularcula bermudensis HTCC2503]
          Length = 1418

 Score = 52.0 bits (123), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 64/129 (49%), Gaps = 10/129 (7%)

Query: 1175 NISNMESNMLKK--QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTF 1232
            +I    +NM ++   +  L    H L +++ S   H + EL+QNADD    E      + 
Sbjct: 10   SIREHSANMWEQLDGNPELAAPWHQLFKQVQSP-RHIISELLQNADDAHATE-----ASV 63

Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
             +Q+   +  +N + F+ E+  +LC  G S K+      IG +GIGFKS F + +  E++
Sbjct: 64   DIQDGSFLFSHNGEDFTEEHFASLCKFGYSNKRALHT--IGFRGIGFKSTFSLGERVELY 121

Query: 1293 SNGFHVKFD 1301
            +    V FD
Sbjct: 122  TPTLSVGFD 130


>gi|395774006|ref|ZP_10454521.1| hypothetical protein Saci8_29739 [Streptomyces acidiscabies 84-104]
          Length = 1579

 Score = 52.0 bits (123), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 71/164 (43%), Gaps = 20/164 (12%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D I  E    T   +L ++ +   N     + E    +  +  S K+G   
Sbjct: 43   LFELVQNAADEIADEP-GGTTHVVLTDTHLYCANEGTPVTPEGAETILRMSMSRKRG--- 98

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIGFVLPTLVPPFNIDMFCRLL 1327
            G IGR G+G KSV  VTDAP+  S      FD   S  QI  V P +   F+ +    +L
Sbjct: 99   GQIGRFGVGVKSVLVVTDAPQFFSRTGCFGFDREWSHDQIRSV-PEVRARFDAEFESPVL 157

Query: 1328 ------------SKDPVQLESKCW-NTCIRLPFRTKFSEGIAMN 1358
                        + DPV  E   W  T +RLP  +  +E +  +
Sbjct: 158  RMARPLDESAERAVDPVLDELLGWATTVVRLPLLSDAAERLGQD 201


>gi|340054935|emb|CCC49243.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 1705

 Score = 52.0 bits (123), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/150 (22%), Positives = 67/150 (44%), Gaps = 13/150 (8%)

Query: 180 LGFQMQEVPSLHKLMVTEARINAFIHCFVGVRQITSLYDLEEAICKSEGIEKFEELELGP 239
           L F ++++PSL  +   ++++   +   V  R + ++Y+L+  +C++  +  + EL LG 
Sbjct: 132 LPFALKQIPSLAGVAAMQSQLTQMVQGAVATRPVVTVYELQLEVCRAMNVASYAELGLGA 191

Query: 240 FLR-QPLISHYFLVKSDVDVVFKIATNDVIVCLSEYTDTHKAKDIKVDEFLDFIAKKRSL 298
            L   P + H F  K     V  + + D +  L    D      I +    D     R+ 
Sbjct: 192 SLACMPFVQHAFGFKPGPAAVAPVTSADFMRFLLFDADAR----IVLTSGGDAGDAVRAF 247

Query: 299 ASK--------EQLGVRIQNLGMHINFIRE 320
           A +         QLG+ IQ+    ++F+R+
Sbjct: 248 AKRYEGGRYAPRQLGIHIQHFSWLLHFVRQ 277


>gi|392843841|gb|AFM84524.1| sacsin, partial [Misgurnus anguillicaudatus]
          Length = 736

 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADD----NIY----PENVEPTLTF-----ILQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD     IY    P        F      +Q   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEIYFVFDPRTHATDRIFDDKWMPMQGPSLCVYNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             ++IR + ++G  TK+ ++ G  G+ GIGF SV+ +TD P   SN
Sbjct: 476  EDDIRGIQNLGRGTKE-ANPGKTGQYGIGFNSVYHITDCPSFISN 519


>gi|390348274|ref|XP_003726975.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
          Length = 3428

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 15/106 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  D   L EL+QNADD        +Y          +      LQ   + V NN Q F+
Sbjct: 1974 YQSDDTILKELLQNADDAGATEIHFVYDPRSHDGKKLLGDSMKGLQGPALCVFNN-QSFT 2032

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
             E+IR + ++G  +K     G IGR G+GF +V+++TD     S G
Sbjct: 2033 DEDIRGIQNLGEGSK-ADHMGKIGRYGVGFNAVYQLTDCSSFISKG 2077



 Score = 47.8 bits (112), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            N+  F+ ++ + +  +G  TK+   A  IG+ G GF SV+R+TD P + S  +   FD  
Sbjct: 205  NDAVFTDQDFKNILRLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSLVSRDYMQFFDPH 263

Query: 1304 EGQIGFVLPTLVPP 1317
               +G VLP    P
Sbjct: 264  TTHLGNVLPNKSQP 277



 Score = 45.4 bits (106), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 1/74 (1%)

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            N+  F+ ++   +  +G  TK+   A  IG+ G GF SV+R+TD P   S  +   FD  
Sbjct: 1040 NDAVFTDQDFENILRLGGRTKE-KDAEKIGKFGTGFNSVYRITDVPSFVSMNYMQFFDPH 1098

Query: 1304 EGQIGFVLPTLVPP 1317
               +G VLP    P
Sbjct: 1099 TTHLGNVLPNKSQP 1112


>gi|392843817|gb|AFM84512.1| sacsin, partial [Alligator sinensis]
          Length = 736

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRHHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNNILCIFD 527


>gi|392843867|gb|AFM84537.1| sacsin, partial [Naja atra]
          Length = 736

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPVDRIFDEKWSP-----LQGPALCVFNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|168026372|ref|XP_001765706.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683132|gb|EDQ69545.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 4642

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 25/173 (14%)

Query: 1201 ELYSQDSHFLLELVQNADD------NIYPENVEPTLTFILQES------GIVVLNNEQGF 1248
            E+YS     L ELVQNADD      N   +  E   + +L  S        +   N+  F
Sbjct: 1387 EMYSDGPGILCELVQNADDAGAREVNFLLDRSEFGTSSVLSPSMADWQGPALYCQNDSVF 1446

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
            ++ ++ A+  +G  +K    +  IGR G+GF SV+  TD P   S    V FD     + 
Sbjct: 1447 TSRDLYAISRIGQDSKLERPSA-IGRFGLGFNSVYHFTDIPGFISGSNLVMFDPHACNLP 1505

Query: 1309 FVLPTLVPPFNIDMFCR-LLSKDPVQ----------LESKCWNTCIRLPFRTK 1350
             V P+  P   I    R LL + P Q          L+     T  R P RT+
Sbjct: 1506 GVTPSH-PGLKISFVGRGLLEQFPDQLRPYLIFGCDLQRSYPGTLFRFPLRTE 1557


>gi|298715228|emb|CBJ27900.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 191

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 56/112 (50%), Gaps = 13/112 (11%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVE--PTLTFILQESGIVVLNNEQGFSA 1250
            Y + +  L ELVQNADD          ++     E  P+L     +   +++ N+  F+ 
Sbjct: 29   YPEGTSILKELVQNADDAGAREVRLCLDLRQHGTEGLPSLALAQFQGPALLVFNDGVFTD 88

Query: 1251 ENIRALCDVGNSTKKGSS-AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
             + R++  +G+S KK +S  G  GR GIGF SV+ +T+ P   S G    FD
Sbjct: 89   TDFRSIQRIGDSLKKDTSKGGKTGRFGIGFNSVYHLTELPAFVSAGKWCSFD 140


>gi|443695138|gb|ELT96105.1| hypothetical protein CAPTEDRAFT_195081 [Capitella teleta]
          Length = 4612

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 112/470 (23%), Positives = 169/470 (35%), Gaps = 112/470 (23%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGIVVLN-------NEQGF 1248
            Y  DS    ELVQNA+D        +Y      T+   L   G+           N+  F
Sbjct: 34   YPDDSQIFKELVQNAEDAGAREVKLLYDRKTYGTMGRYLHHPGMAKYQGPALYAYNDATF 93

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGF------HVKF-- 1300
            + ++ R +  +  S K+      +G  G+GFKSVF  TD P I S         H +F  
Sbjct: 94   NDDDWRGIRMLSQSVKQNDPLK-VGYFGMGFKSVFHFTDLPFIVSGDHAGVIEPHQEFFP 152

Query: 1301 ------DTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS-- 1352
                    S+  +  +LP    PF+    C   ++D  + +     T  R P RTK S  
Sbjct: 153  SSSHGWTLSKDNLLELLPHQFEPFSGIFTC---TRDTFK-DGYFPGTLFRFPLRTKPSAL 208

Query: 1353 EGIAMNN-----IVDMFSDLHPSLLLFLHRLQCI--------------MFRNMLN----D 1389
              +  NN     ++D F +    LLLFL  ++ +              +FR  L+    +
Sbjct: 209  SDVIYNNDKVQALLDAFQEDAHLLLLFLRHVESVEVHTKDAHTLTPKLLFRVELSEDCRE 268

Query: 1390 SLVVIRKKIVG-------DGIIKVS----------CGEDKM----TWFV--------ASQ 1420
             +   R+  V        DG + +S           G D++    TW V         S 
Sbjct: 269  DVRQKRQDFVARATAPRRDGPVMMSYPVTVVCTCVTGRDQIQRSYTWLVTNYFVGGQTSA 328

Query: 1421 KLRAGVIRPDVKTTE-IALALTLQESNEGNYGPLLYQQPVFAFLPLRT-----YGLKFIL 1474
              R   + PD+     +  A+ L    +G       +  +F  LPL +      GL   +
Sbjct: 329  TFRKLQMDPDLHYPPWVGAAMPLSSPEDGQAS---VRGHIFCALPLPSDQSMDTGLPVEI 385

Query: 1475 QGDFVLPSSREEV-------------DGNSPWNQWLLSE-FPALFVSAERSFCDLPCFRE 1520
             G F L  +R+ +             D    WNQ +L E  P  + S  R+   L     
Sbjct: 386  NGYFSLEQNRKHIKWPTSYRTREDMMDKRLLWNQCMLKEALPKAYASLLRAAIQLHRQGS 445

Query: 1521 NPA-KAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQW 1569
             PA    S+Y +F  L      + S LP M  S L     +  E N   W
Sbjct: 446  LPALTLDSIYAAFPDLTKVDRKWDSILPTM-YSDLFKDPVVFTEANGGNW 494



 Score = 46.6 bits (109), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 75/316 (23%), Positives = 122/316 (38%), Gaps = 83/316 (26%)

Query: 1203 YSQDSHFLLELVQNADDN------------------IYPENVEPTLTFILQESGIVVLNN 1244
            Y  D   L E++QNADD+                  ++ ++ +P     LQ  G+ V NN
Sbjct: 2446 YPCDKEILKEMLQNADDSGATEIQFIKDPRQHGRDRVFEDSWKP-----LQGPGLCVFNN 2500

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +I  +  +G  +K GS     G+ G+GF  V+ +TDAP          F T  
Sbjct: 2501 -RPFSESDIEGIQRLGEGSK-GSDPNKTGQYGVGFNCVYHLTDAP---------SFLTRS 2549

Query: 1305 GQIGFVLPTLVP---------------PFNIDMFCRLLSKD--PVQLESKCW---NTCIR 1344
              IG  L    P                ++     R +  D  P  LE K      T  R
Sbjct: 2550 ADIGETLCVFDPHGAFVPGASSEEPGRRYDEVSELRGIFTDVFPCYLEDKFALDEATMFR 2609

Query: 1345 LPFR-------TKFSEGIAMNNIVD-MFSDLHPSLLLFLHRLQCIMFRNMLNDSLVV--- 1393
             P R       ++ SE +    ++D +FS              C++F N + DSL +   
Sbjct: 2610 FPLRNEEMAAKSELSEQVIGLEVIDVLFSKFKAE------AFDCLLFLNNV-DSLTLCEV 2662

Query: 1394 --IRKKIVGDG--IIKVSCGEDKMTWFVASQKLRAGVIRP------DVKTTEIALALTLQ 1443
                K++ G     ++V  GE++   F    K  AG ++       D+   E+  +L + 
Sbjct: 2663 DSFTKRLTGTYKVTVQVERGEEEKKRFSEQVKKIAGQVKSGRLSVWDIPVHEVTYSLNIA 2722

Query: 1444 ESNEGNYGPLLYQQPV 1459
            + N+G +   L  Q +
Sbjct: 2723 D-NQGKWEKWLVSQQI 2737



 Score = 44.7 bits (104), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 62/151 (41%), Gaps = 17/151 (11%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGI 1239
            Q   L   L+ L QE Y+       ELVQNADD        +Y E         L + G+
Sbjct: 1380 QADSLTHRLNVLLQE-YTDGFAVPKELVQNADDAGATEVKFLYDERQNEDCMTCLIDEGM 1438

Query: 1240 -------VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIH 1292
                   +   N   FS E+   +  +  +TK+  +   IGR G+GF +V+ +TD P   
Sbjct: 1439 RECQGPALWAYNNAAFSDEDFENITKLSGATKENQT-DKIGRFGLGFNAVYNITDVPSFV 1497

Query: 1293 SNGFHVKFDTSEGQIGFVLPTLVPP-FNIDM 1322
            S    V FD     +G  +     P   IDM
Sbjct: 1498 SRHNIVIFDPHTTHLGKSIKNKAKPGIKIDM 1528


>gi|393244468|gb|EJD51980.1| hypothetical protein AURDEDRAFT_181547 [Auricularia delicata
            TFB-10046 SS5]
          Length = 2646

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 74/167 (44%), Gaps = 19/167 (11%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ-----ESGIVVLNNEQGFSA 1250
            Y   +  L EL+QN+DD        I  E   PT + ++      +   ++  N+  F  
Sbjct: 21   YPASNAILRELIQNSDDAGSRRQIFILDERQHPTRSLLVPGLACCQGPALLAVNDSIFRP 80

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---TSEGQI 1307
            E+  A+  + +S+K   +   IG+ G+GF+S + VTD P+I S+   V FD   T     
Sbjct: 81   EDWEAIKTIHHSSKADDNT-QIGKYGLGFRSSYHVTDFPQILSSKTLVVFDPHMTVLNDG 139

Query: 1308 GFVLPTLVPP--FNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
            G    T +P    ++D      S  P   E     T IRLP RT  S
Sbjct: 140  GRRHSTFLPAESTHVDQLAAFASVVPADCEDYN-GTAIRLPLRTNSS 185


>gi|392843823|gb|AFM84515.1| sacsin, partial [Crocodylus siamensis]
          Length = 736

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRHHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|392843869|gb|AFM84538.1| sacsin, partial [Pelodiscus sinensis]
          Length = 736

 Score = 51.6 bits (122), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDSRQHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|359146086|ref|ZP_09179729.1| hypothetical protein StrS4_09674 [Streptomyces sp. S4]
          Length = 1601

 Score = 51.2 bits (121), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 67/157 (42%), Gaps = 26/157 (16%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            LLELVQNA D I        L  +L E  +   N     + E    +  +  S K+G   
Sbjct: 56   LLELVQNAADEIA-NAPGGKLHVVLTEDHLYCANEGTPVTPEGAETILRMSVSRKRG--- 111

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSN----GFHVKFDTSEGQIGFVL------------PT 1313
            G IGR G+G KSV  V+D+P+  S+    GF    D S  +I   L            P 
Sbjct: 112  GQIGRFGVGVKSVLSVSDSPQFFSDHGAFGFGFDRDWSAAEIAEALGLDGAADLSMETPV 171

Query: 1314 L--VPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPF 1347
            L    P ++D   +  S DPV  +   W  T +RLP 
Sbjct: 172  LRMARPLDVD---QERSDDPVLDKLLTWATTVVRLPL 205


>gi|392843819|gb|AFM84513.1| sacsin, partial [Ichthyophis bannanicus]
          Length = 736

 Score = 51.2 bits (121), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 52/110 (47%), Gaps = 25/110 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEILKELLQNADDAKATEICFVFGPRHHPLDRIFDERWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P   S+
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFMSS 519


>gi|392843849|gb|AFM84528.1| sacsin, partial [Carettochelys insculpta]
          Length = 736

 Score = 51.2 bits (121), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|392843855|gb|AFM84531.1| sacsin, partial [Trachemys scripta]
          Length = 736

 Score = 51.2 bits (121), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDSRQHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|456384943|gb|EMF50521.1| hypothetical protein SBD_8085 [Streptomyces bottropensis ATCC 25435]
          Length = 1614

 Score = 51.2 bits (121), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 73/185 (39%), Gaps = 27/185 (14%)

Query: 1194 ALHCLSQELYSQDSHF---LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            A H  S+   +Q  +    L ELVQNA D I  E     +  +L  + +   N     + 
Sbjct: 58   AEHANSERRITQGGYGDRQLFELVQNAADEIA-EEPGGKVHVVLTGTHLYCANEGNPVTP 116

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD--------- 1301
            E    +  +  S K+G   G IGR G+G KSV  VTD PE  S+     FD         
Sbjct: 117  EGAETILRMSMSKKRG---GQIGRFGVGVKSVLVVTDTPEFFSSTGSFGFDRAWSYEQIK 173

Query: 1302 -----TSEGQIGFVLPTL--VPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSE 1353
                 T+     F  P L    P N+       + DPV  E   W  T +RLP   K   
Sbjct: 174  SVPGVTARYGAEFDAPVLRMARPLNVQAE---RAADPVLHELLSWATTVVRLPLLPKADR 230

Query: 1354 GIAMN 1358
             + ++
Sbjct: 231  RLGLD 235


>gi|294629556|ref|ZP_06708116.1| conserved hypothetical protein [Streptomyces sp. e14]
 gi|292832889|gb|EFF91238.1| conserved hypothetical protein [Streptomyces sp. e14]
          Length = 1587

 Score = 51.2 bits (121), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 68/164 (41%), Gaps = 20/164 (12%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D I  E     +  +L E+ +   N     + E    +  +  S K+G   
Sbjct: 50   LFELVQNAADEIAGEPGG-RIHVVLTETHLYCANEGNPVTPEGAETILRMSMSKKRG--- 105

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--SEGQIG------------FVLPTLV 1315
            G IGR G+G KSV  VT+APE  S      FD   S  QI             F  P L 
Sbjct: 106  GQIGRFGVGVKSVLVVTNAPEFFSRTGSFGFDRAWSYEQIKAVPGVKERYGPEFDAPVLR 165

Query: 1316 PPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSEGIAMN 1358
                +DM     S DPV  +   W  T +RLP   K  + +  +
Sbjct: 166  MARPLDMQAERAS-DPVLDKLLTWATTVVRLPLLPKADDRLGKD 208


>gi|392843821|gb|AFM84514.1| sacsin, partial [Calotes versicolor]
          Length = 736

 Score = 51.2 bits (121), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPSDRIFDEKWAP-----LQGPALCVFNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|392843859|gb|AFM84533.1| sacsin, partial [Coturnix coturnix]
          Length = 736

 Score = 50.8 bits (120), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     PT          LQ   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRQHPTDRIFDEKWAPLQGPALCVYNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             ++IR + ++G  TK G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 476  EDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|392843827|gb|AFM84517.1| sacsin, partial [Dibamus bourreti]
          Length = 736

 Score = 50.8 bits (120), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEVCFVFDSRHHSVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|392843835|gb|AFM84521.1| sacsin, partial [Hemidactylus bowringii]
          Length = 736

 Score = 50.8 bits (120), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEVCFVFDPRQHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|424862194|ref|ZP_18286140.1| hypothetical protein OPAG_00219 [Rhodococcus opacus PD630]
 gi|356660666|gb|EHI41030.1| hypothetical protein OPAG_00219 [Rhodococcus opacus PD630]
          Length = 1565

 Score = 50.8 bits (120), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 45/92 (48%), Gaps = 5/92 (5%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D +     +  +  +L    +   N  Q  +   I  L     S K+    
Sbjct: 81   LHELVQNAADALV--GNQGRIALVLANRKLYCANEGQPLTKSGIETLMASHLSRKRDEE- 137

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
              IGR G+GFKSV  V+DAPEI S G  ++FD
Sbjct: 138  --IGRFGLGFKSVLGVSDAPEIISRGVSIRFD 167


>gi|449436349|ref|XP_004135955.1| PREDICTED: uncharacterized protein LOC101212447 [Cucumis sativus]
          Length = 4709

 Score = 50.8 bits (120), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 94/218 (43%), Gaps = 41/218 (18%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL---QESGIVVLNNEQGFS 1249
            Y + +  L ELVQNADD          +      E  L+  L   Q   ++  NN   F+
Sbjct: 33   YPEGTTVLKELVQNADDAGATKVCLCLDRRVHGRESLLSASLAPFQGPALLAYNNAV-FT 91

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-------- 1301
             E+  ++  +G S K G  A   GR G+GF SV+ +T+ P   S  + V FD        
Sbjct: 92   EEDFVSISRIGGSNKHGQ-AWKTGRFGVGFNSVYHLTELPSFVSGKYVVMFDPQGIYLPK 150

Query: 1302 ---TSEGQ-IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-------RTK 1350
               ++ G+ I F+  + +  +  D F    + D   +ES    T  R P        R+K
Sbjct: 151  VSASNPGKRIDFIRSSAISQYR-DQFLPYCAFD-CTMESSFAGTLFRFPLRNTDQAARSK 208

Query: 1351 FS-EGIAMNNIVDMFSDLHPS---LLLFLHRLQCI-MF 1383
             S +     +I  MF++L+      LLFL  + CI MF
Sbjct: 209  ISRQAYTEEDISSMFAELYEEGVLTLLFLKSVLCIEMF 246


>gi|182438463|ref|YP_001826182.1| hypothetical protein SGR_4670 [Streptomyces griseus subsp. griseus
            NBRC 13350]
 gi|178466979|dbj|BAG21499.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350]
          Length = 1589

 Score = 50.8 bits (120), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 88/358 (24%), Positives = 145/358 (40%), Gaps = 49/358 (13%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D I   +    +  +L ++ +   N  +  +AE    +  +  S K+G   
Sbjct: 57   LFELVQNAADEIA-HSPGGKVHVVLTDTHLYCANEGKEVTAEGAETILRMSVSKKRG--- 112

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---------TSEGQIG---FVLPTLVPP 1317
            G IGR G+G KSV  VTDAP+  S      FD          + G+     F  P L   
Sbjct: 113  GQIGRFGVGVKSVLAVTDAPQFFSTSGAFGFDRAWSFEEIRKAGGRAAGGDFEAPVLRMA 172

Query: 1318 FNIDMFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSE--GIAMNNIV--DMFSDLHPSLL 1372
              ++      S DPV  E   W  T +RLP     ++  G  ++ +    +  +      
Sbjct: 173  RPLEAEAE-RSADPVLDELLGWATTVVRLPLLPGAADRLGHDIHTVAGRQIQREFPARFQ 231

Query: 1373 LFLHRLQCIMFRNMLNDSLVVIRKKI------VGDGIIKVSCG--EDKMTWFV------A 1418
            LF   +  ++  N    ++  IR++I      +   I +V  G  E + TW V       
Sbjct: 232  LFSSHVGTVILEN--RRAMPQIRREITIEREDIAHTIHEVRTGKPESRATWKVFTCAHRP 289

Query: 1419 SQKLRAGVIR-PDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQG 1476
            +++ RA      D  T +I+ A+     +     P    +  ++F P +    L  IL G
Sbjct: 290  TERARASAGEMHDRGTIDISWAVPEYTGDTVLTVPKGRGE-FWSFFPTKYPMTLAGILNG 348

Query: 1477 DFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVP 1534
             +     R+ +  +SP+N+ +L     L V +      LP  R  PA+    Y+  +P
Sbjct: 349  AWKTNEDRQNLLDSSPFNEEILQVAARLVVES------LP--RLAPAEDPGAYLPLLP 398


>gi|363729189|ref|XP_417138.3| PREDICTED: sacsin [Gallus gallus]
          Length = 4612

 Score = 50.4 bits (119), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 2572 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 2626

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2627 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2682



 Score = 44.7 bits (104), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 54/124 (43%), Gaps = 19/124 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1500 EEYPSISDIFKELLQNADDANASECNFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1558

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+ S    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1559 SE-FSDSDFLNITRLGESLKR-SEVDKVGKFGLGFNSVYHITDVPIILSREFMIMFDPNV 1616

Query: 1305 GQIG 1308
              IG
Sbjct: 1617 NHIG 1620



 Score = 43.9 bits (102), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 47/103 (45%), Gaps = 13/103 (12%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTF----ILQESGIVVLN-NEQGFSA 1250
            Y +    L EL+QNA+D        +Y E    T T     + Q  G      N+  F+ 
Sbjct: 140  YPEGGQILKELIQNAEDAGATEVRFLYDETQYGTETLWSKDMAQYQGPAFYAYNDAIFTP 199

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
            E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 200  EDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPSIFS 241


>gi|326914333|ref|XP_003203480.1| PREDICTED: sacsin-like [Meleagris gallopavo]
          Length = 4699

 Score = 50.4 bits (119), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 2658 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 2712

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2713 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2768



 Score = 45.1 bits (105), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 78/195 (40%), Gaps = 29/195 (14%)

Query: 1123 EECGAICDTICGEASDDGLGDCTTQTLPEDKECEDAALIVESIRRDEFGLGPNISN---- 1178
            ++C  +C  +     D   G  T++  P +   +D  L+   +  D   +    S     
Sbjct: 138  QQCPPLCPILVSSVPD---GQRTSRERPVEGNKDDEGLVASPLSDDTKIVDLQKSQNQIF 194

Query: 1179 --MESNMLKKQHARLGRALHCLSQEL------YSQDSHFLLELVQNADDN-------IYP 1223
              ++S  LK    R G+    L   L      Y +    L EL+QNA+D        +Y 
Sbjct: 195  LKLQSEALKGG-GRFGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVRFLYD 253

Query: 1224 ENVEPTLTF----ILQESGIVVLN-NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
            E    T T     + Q  G      N+  F+ E+   + ++  S KK      +GR GIG
Sbjct: 254  ETQYGTETLWSKDMAQYQGPAFYAYNDAVFTPEDWHGIQEIARSRKKDDPLK-VGRFGIG 312

Query: 1279 FKSVFRVTDAPEIHS 1293
            F SV+ +TD P I S
Sbjct: 313  FNSVYHITDVPSIFS 327



 Score = 42.4 bits (98), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 19/124 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1586 EEYPSISDIFKELLQNADDANATECNFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1644

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+ S    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1645 SE-FSDSDFLNITRLGESLKR-SEVDKVGKFGLGFNSVYHITDVPIILSREFMIMFDPNV 1702

Query: 1305 GQIG 1308
              I 
Sbjct: 1703 NHIS 1706


>gi|449489127|ref|XP_004158223.1| PREDICTED: LOW QUALITY PROTEIN: sacsin-like [Cucumis sativus]
          Length = 3586

 Score = 50.4 bits (119), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 94/218 (43%), Gaps = 41/218 (18%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL---QESGIVVLNNEQGFS 1249
            Y + +  L ELVQNADD          +      E  L+  L   Q   ++  NN   F+
Sbjct: 33   YPEGTTVLKELVQNADDAGATKVCLCLDRRVHGRESLLSASLAPFQGPALLAYNNAV-FT 91

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-------- 1301
             E+  ++  +G S K G  A   GR G+GF SV+ +T+ P   S  + V FD        
Sbjct: 92   EEDFVSISRIGGSNKHGQ-AWKTGRFGVGFNSVYHLTELPSFVSGKYVVMFDPQGIYLPK 150

Query: 1302 ---TSEGQ-IGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-------RTK 1350
               ++ G+ I F+  + +  +  D F    + D   +ES    T  R P        R+K
Sbjct: 151  VSASNPGKRIDFIRSSAISQYR-DQFLPYCAFD-CTMESSFAGTLFRFPLRNTDQAARSK 208

Query: 1351 FS-EGIAMNNIVDMFSDLHPS---LLLFLHRLQCI-MF 1383
             S +     +I  MF++L+      LLFL  + CI MF
Sbjct: 209  ISRQAYTEEDISSMFAELYEEGVLTLLFLKSVLCIEMF 246


>gi|392843863|gb|AFM84535.1| sacsin, partial [Podocnemis unifilis]
          Length = 736

 Score = 50.4 bits (119), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDSRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 475

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 476  EDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|327311582|ref|YP_004338479.1| hypothetical protein TUZN_1702 [Thermoproteus uzoniensis 768-20]
 gi|326948061|gb|AEA13167.1| hypothetical protein TUZN_1702 [Thermoproteus uzoniensis 768-20]
          Length = 849

 Score = 50.4 bits (119), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 46/84 (54%), Gaps = 4/84 (4%)

Query: 1428 RPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPL---RTYGLKFILQGDFVLPSSR 1484
            R + +  E+A+A  + E+           Q +++FLPL   RT GLKF++ GDF++ + R
Sbjct: 89   RNNARKREVAIAFQVDEAKSSLVPVEAGLQFLYSFLPLTEVRT-GLKFLIHGDFIVQAGR 147

Query: 1485 EEVDGNSPWNQWLLSEFPALFVSA 1508
              ++  + WN+W++ E   L   A
Sbjct: 148  RTLNPEAKWNEWMVREIANLLNRA 171


>gi|328712264|ref|XP_003244766.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            14-like [Acyrthosiphon pisum]
          Length = 1572

 Score = 50.4 bits (119), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 44/94 (46%), Gaps = 15/94 (15%)

Query: 4    NHRPRFRPVGGVGGEGTGG---GQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN 60
            N  P+  P G V     G     Q P+   P   NPN    NPN+  PNP         N
Sbjct: 1462 NMHPQMTPGGPVPSPLMGQHMMNQNPNMVGP---NPNMGGPNPNMVGPNP---------N 1509

Query: 61   IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQP 94
            +  QNPN  +QNPN+   N N++ QNPN+  Q P
Sbjct: 1510 MVGQNPNMVVQNPNMVGPNPNMVGQNPNMVGQNP 1543


>gi|345867558|ref|ZP_08819568.1| hypothetical protein BZARG_2748 [Bizionia argentinensis JUB59]
 gi|344048054|gb|EGV43668.1| hypothetical protein BZARG_2748 [Bizionia argentinensis JUB59]
          Length = 1534

 Score = 50.4 bits (119), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 69/302 (22%), Positives = 126/302 (41%), Gaps = 43/302 (14%)

Query: 1206 DSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
            D   LLE++QN DD    ++ +  + ++ +E  + + N  + FS E I +L    +S K 
Sbjct: 40   DGRQLLEMLQNVDD---TKSKKVRIEWVKKEKKLAISNYGEAFSFEGIESLMRSHSSPK- 95

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD---------------TSEGQIGFV 1310
             +   YIG KG+GF+S+    +  +I++N   + F                T++ Q+   
Sbjct: 96   -TKEDYIGNKGLGFRSLLTWAERIDIYANDCKISFSDEIAHSIFDTKLKLTTNQKQVVKT 154

Query: 1311 LPTLVP---PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDL 1367
               +V    PF +    +LL+          W T I + +  +      + +++D  S+ 
Sbjct: 155  QAKVVNGAIPFPVLAVPQLLTFK----RQDDWQTVIEIQYTEEEEVEEKIKDVLDQISE- 209

Query: 1368 HPSLLLFLHRLQCIMFRNMLNDSLVVIRK--KIVGDGIIKVSCGEDKMTWFVASQKLRAG 1425
               LLLFL+ ++ I    ++ D +V I K  KI  +   ++S        F     +   
Sbjct: 210  --ELLLFLNHIEEI---ELVKDGVVSIFKSTKIEFEHWTEISINNKVWRVFTKDGLIPQK 264

Query: 1426 VIRPDVKT-TEIALALTLQESNEGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSS 1483
             +    +T  +  L L  Q      Y  L      F F P + +  L  ++ G F L +S
Sbjct: 265  AVNGGEETFKKYMLKLAFQNDLSDTYYKL------FNFFPTKISVSLPCVIHGTFELNAS 318

Query: 1484 RE 1485
            R+
Sbjct: 319  RD 320


>gi|407472961|ref|YP_006787361.1| hypothetical protein Curi_c04390 [Clostridium acidurici 9a]
 gi|407049469|gb|AFS77514.1| hypothetical protein Curi_c04390 [Clostridium acidurici 9a]
          Length = 1533

 Score = 50.4 bits (119), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 12/99 (12%)

Query: 1210 LLELVQNADDNIYPENV--------EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGN 1261
            +LEL+QNADD  Y +++        E  L    + + + V N    F  E I+A+    N
Sbjct: 57   ILELLQNADD-AYQKSIDMGEKPECELELLIEYKNNILTVSNTGTFFDKEGIKAIVQGNN 115

Query: 1262 STKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            S K G    YIG KG GF+S+    D  +I+S  F+++F
Sbjct: 116  SPKSGK---YIGNKGTGFRSILNWADKVKIYSGDFNIEF 151


>gi|260788479|ref|XP_002589277.1| hypothetical protein BRAFLDRAFT_102519 [Branchiostoma floridae]
 gi|229274453|gb|EEN45288.1| hypothetical protein BRAFLDRAFT_102519 [Branchiostoma floridae]
          Length = 4368

 Score = 50.4 bits (119), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 85/215 (39%), Gaps = 49/215 (22%)

Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDN---------I 1221
             LGP +       L  Q   + R L  +  E Y +      EL+QNADD           
Sbjct: 1319 ALGPGMQ------LFGQKEPITRRLKTIIDE-YKEGPGIFYELIQNADDAGATEVKFAVD 1371

Query: 1222 YPENVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
            + +N +   + +       Q   +   NN + F+ E+I+ + ++  +TKK +    IGR 
Sbjct: 1372 WRQNTQVRKSLLSPGMAACQGPALWAYNN-KVFTDEDIQNITEISGATKK-TDTNKIGRF 1429

Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNI------------DMF 1323
            G+GF SV+ +TD P   S  + V FD     +  ++  L P   I            D F
Sbjct: 1430 GLGFNSVYHMTDVPSFVSRHYFVCFDPHTTHLAPMVNRLEPGMKIEWNREEIQNIYTDQF 1489

Query: 1324 --------CRLLSKDPVQLESKCWNTCIRLPFRTK 1350
                    C + + D     +    T  R PFRTK
Sbjct: 1490 KPYDGIFGCNIFNSD-----TPYPATLFRFPFRTK 1519



 Score = 42.0 bits (97), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 19/104 (18%)

Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQG 1247
            E YS       EL+QNADD        IY                 LQ   ++V NN + 
Sbjct: 2317 ESYSTGIDIFKELLQNADDAGATEVKFIYDTRCHSDKKIFDKEWKYLQGPSLLVYNN-RT 2375

Query: 1248 FSAENIRALCDVG--NSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            FS  +I A+  +G  N  +KGS+   IG+ G+GF +V+ +TD P
Sbjct: 2376 FSQSDIDAIQKIGRGNKQEKGST---IGQFGVGFNAVYHLTDCP 2416


>gi|392843845|gb|AFM84526.1| sacsin, partial [Struthio camelus]
          Length = 736

 Score = 50.4 bits (119), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|397771742|ref|YP_006543591.1| hypothetical protein NJ7G_4300 [Natrinema sp. J7-2]
 gi|397688955|gb|AFO59515.1| hypothetical protein NJ7G_4300 [Natrinema sp. J7-2]
          Length = 1637

 Score = 50.4 bits (119), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 53/92 (57%), Gaps = 5/92 (5%)

Query: 1208 HFLLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-- 1264
             F+ EL+QNADD +  ++ +P  + F L+   ++V NN + F A ++ A+  +  +TK  
Sbjct: 48   RFVFELLQNADDEMPEDSDKPRKVRFELRGDCLLVANNGRPFDAADLDAITTLTQTTKGD 107

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHS-NG 1295
            +G  A  IG KG GF S+  +T  P I S NG
Sbjct: 108  EGDEA-TIGHKGRGFTSILDITSNPAIFSQNG 138


>gi|356576915|ref|XP_003556575.1| PREDICTED: sacsin-like [Glycine max]
          Length = 4756

 Score = 50.4 bits (119), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 53/111 (47%), Gaps = 13/111 (11%)

Query: 1203 YSQDSHFLLELVQNADD------------NIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            Y + +  L EL+QNADD              +P     + +    +   ++  N+  F+ 
Sbjct: 30   YPEGTTVLKELIQNADDAGATTVSLCLDRRSHPAGSLLSDSLAQWQGPALLAFNDAVFTE 89

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            E+  ++  +G STK G  A   GR G+GF SV+ +TD P   S+ + V FD
Sbjct: 90   EDFVSISKIGGSTKHGQ-ASKTGRFGVGFNSVYHLTDLPSFVSHKYVVLFD 139


>gi|448495168|ref|ZP_21609788.1| hypothetical protein C463_14400 [Halorubrum californiensis DSM 19288]
 gi|445688533|gb|ELZ40790.1| hypothetical protein C463_14400 [Halorubrum californiensis DSM 19288]
          Length = 1637

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 53/92 (57%), Gaps = 5/92 (5%)

Query: 1208 HFLLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTK-- 1264
             F+ EL+QNADD +  ++ +P  + F L+   ++V NN + F A ++ A+  +  +TK  
Sbjct: 48   RFVFELLQNADDEMPEDSDKPRKVRFELRGDCLLVANNGRPFDAADLDAITTLTQTTKGD 107

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEIHS-NG 1295
            +G  A  IG KG GF S+  +T  P I S NG
Sbjct: 108  EGDEA-TIGHKGRGFTSILDITSNPAIFSQNG 138


>gi|449483857|ref|XP_004186247.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Taeniopygia guttata]
          Length = 4432

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 25/109 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 2392 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 2446

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ +TD P   S
Sbjct: 2447 -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFLS 2493



 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 96/246 (39%), Gaps = 56/246 (22%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1320 EEYPSISDIFKELLQNADDANATECNFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1378

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+ S    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1379 SE-FSDSDFLNITRLGESLKR-SEVDKVGKFGLGFNSVYHITDVPIILSREFMIMFDPNV 1436

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMFCRLLSKDPVQLES--KCWN 1340
              I                          P    PF     C+L    P+ +ES      
Sbjct: 1437 NHISKHIRDKSNPGIKINWSKQQKRLRKFPNQFKPFIGVFGCQL----PLTVESPYSYKG 1492

Query: 1341 TCIRLPFRT----KFSE-GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIR 1395
            T  RL FRT    K SE   A  N  D++S L     +  HRL  I+F   +N S+V+  
Sbjct: 1493 TLFRLSFRTQQEAKVSEVSSASYNTADIYS-LVDEFSICGHRL--IIFAQSVN-SMVLKY 1548

Query: 1396 KKIVGD 1401
             KI  D
Sbjct: 1549 LKIEAD 1554


>gi|154284896|ref|XP_001543243.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150406884|gb|EDN02425.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 953

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 46/81 (56%), Gaps = 5/81 (6%)

Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSP 1492
            T E+ LA  + +S+     P++ QQ V++FLP+R  G KF++Q DF+  ++R+ V    P
Sbjct: 94   TAEVILAFPVDDSHR----PIIEQQHVYSFLPMRQEGFKFLIQSDFITTANRQGVH-LYP 148

Query: 1493 WNQWLLSEFPALFVSAERSFC 1513
             N  +      +F+ A  +FC
Sbjct: 149  RNYTIRERISLVFIQAVYNFC 169


>gi|149030215|gb|EDL85271.1| rCG52205 [Rattus norvegicus]
          Length = 3830

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1899



 Score = 44.3 bits (103), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 719  EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 778  SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 836  NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 890

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 891  GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 950

Query: 1387 ------LNDSLVVIRKK 1397
                  L    ++I+KK
Sbjct: 951  EETNPTLAQDTIIIKKK 967


>gi|395848271|ref|XP_003796777.1| PREDICTED: sacsin [Otolemur garnettii]
          Length = 4604

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2563 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2621

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2622 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2673



 Score = 46.2 bits (108), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 122  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 180

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
            +E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 181  SEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 223



 Score = 42.7 bits (99), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 61/258 (23%), Positives = 99/258 (38%), Gaps = 68/258 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1493 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1551

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+      +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1552 SE-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHLTDIPIIMSREFMIMFDPNI 1609

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
              I                          P    PF ID+F C+L    P+ +E+     
Sbjct: 1610 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDIFGCQL----PLTVEAPYSYH 1664

Query: 1340 NTCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1665 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1724

Query: 1387 ------LNDSLVVIRKKI 1398
                  L    V+I+KK+
Sbjct: 1725 EETNPSLAQDTVIIKKKV 1742


>gi|148704160|gb|EDL36107.1| mCG7227 [Mus musculus]
          Length = 3830

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1899



 Score = 45.4 bits (106), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 62/258 (24%), Positives = 102/258 (39%), Gaps = 68/258 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 719  EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 778  SE-FSDSDFLNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 836  NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 890

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 891  GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 950

Query: 1387 ------LNDSLVVIRKKI 1398
                  L    ++I+KK+
Sbjct: 951  EETNPSLAQDTIIIKKKV 968


>gi|260822048|ref|XP_002606415.1| hypothetical protein BRAFLDRAFT_67666 [Branchiostoma floridae]
 gi|229291756|gb|EEN62425.1| hypothetical protein BRAFLDRAFT_67666 [Branchiostoma floridae]
          Length = 4521

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 89/379 (23%), Positives = 146/379 (38%), Gaps = 65/379 (17%)

Query: 1018 LPSEFRSFAADVLLSGLQSTIKDAP-SAILLECHQTELRLMLHEVGLSLGILEWIHDYHA 1076
            LP EFR F       G++ ++ +      L E   T       ++ LSLGI++WI  +  
Sbjct: 1288 LPDEFRDFKQMFRAVGVKESLDELDLIQTLHEIRSTPKLSTERDLKLSLGIIDWIVSHPD 1347

Query: 1077 FCSTGTSDLLMPCVVTCTNAATSGLNSGSGCAEGSLFESVGADVHIEECGAICDTICGEA 1136
                   D+L+P  +   +++   L   S C           DV   +  A+ D      
Sbjct: 1348 ILQKVRDDILVP--IATKDSSRLTLAPVSECTYN--------DVSWSDAQALLD------ 1391

Query: 1137 SDDGLGDCTTQTLPEDKE--CEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRA 1194
             +D       + +P D     E  +L    +  +EFG               QH ++ + 
Sbjct: 1392 DEDEEFKPLHKQVPLDTAQLLEVPSLSQRVVDAEEFGFDQC----------GQHEKVTQR 1441

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESGIVVLN-- 1243
            L  +  + Y + +    EL+QNADD    E         N +     I Q  G+ V +  
Sbjct: 1442 LKNILND-YPEGAGIFKELIQNADDAGATEVKFLVDWRTNEDSREDLIDQ--GMAVCHGP 1498

Query: 1244 -----NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
                 N   FS E+ + + ++G  TK   +   +GR G+GF SV+ VTD P   S    +
Sbjct: 1499 ALWAFNNATFSEEDFKNIQNLGGQTKL-DALEKVGRFGVGFNSVYHVTDVPSFVSGSRVI 1557

Query: 1299 KFDTSEGQIGFVLPTLVPP---FNIDMFCRLLSKDPVQLE----------SKCWN-TCIR 1344
             FD     +   +     P    ++    RL+S+   Q +          S  +N T  R
Sbjct: 1558 FFDPHATHLSKHIKDKSKPGIGLDLQKNTRLVSRFKDQFKPFQGVFGYSTSTSYNGTLFR 1617

Query: 1345 LPFRTKFSEGIAMNNIVDM 1363
            LP RT   +G + + I D+
Sbjct: 1618 LPLRT--PQGASDSEISDV 1634


>gi|410947206|ref|XP_003980343.1| PREDICTED: sacsin [Felis catus]
          Length = 4653

 Score = 50.1 bits (118), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2612 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2670

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2671 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2722



 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 182  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 240

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 241  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 283



 Score = 43.5 bits (101), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 63/234 (26%), Positives = 91/234 (38%), Gaps = 57/234 (24%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1542 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1600

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+      +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1601 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1658

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESKCW-- 1339
              I                          P    PF ID+F CRL    P+ +E+     
Sbjct: 1659 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCRL----PLTVEAPYSYD 1713

Query: 1340 NTCIRLPFRTKFSEGIA-----MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLN 1388
             T  RL FRT+    ++       N  D++S L     L  HRL  I+F   +N
Sbjct: 1714 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYS-LVDEFSLCGHRL--IIFTQSVN 1764


>gi|392843831|gb|AFM84519.1| sacsin, partial [Anas platyrhynchos]
          Length = 736

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|344284593|ref|XP_003414050.1| PREDICTED: sacsin [Loxodonta africana]
          Length = 4805

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2764 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2822

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2823 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2874



 Score = 47.0 bits (110), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 67/257 (26%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1694 EEYPSVSDIFKELLQNADDAGATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1752

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1753 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1810

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
              I                          P    PF ID+F C+L    P+ LE+     
Sbjct: 1811 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTLEAPYSYK 1865

Query: 1340 NTCIRLPFRT----KFSE--GIAMN-----NIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT    K SE  G   N     ++VD FS     L++F   +  +  + +  
Sbjct: 1866 GTLFRLSFRTQQEAKVSEVSGTCYNTADIYSLVDEFSLCGHRLIVFTQSVNSMYLKYLKI 1925

Query: 1387 ------LNDSLVVIRKK 1397
                  L   +++I+KK
Sbjct: 1926 EETNPNLAQDIIIIKKK 1942



 Score = 44.3 bits (103), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 60/153 (39%), Gaps = 42/153 (27%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S +   NN   F+
Sbjct: 333  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYAYNNAI-FT 391

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S            QIG 
Sbjct: 392  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFSG----------DQIGM 440

Query: 1310 VLP--TLVPPFNIDMFCRLLSKDPVQLESKCWN 1340
            + P  TL  P               Q   +CWN
Sbjct: 441  LDPHQTLFGP---------------QESGQCWN 458


>gi|392843851|gb|AFM84529.1| sacsin, partial [Columba livia]
          Length = 736

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 26/118 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 527


>gi|344248514|gb|EGW04618.1| Sacsin [Cricetulus griseus]
          Length = 4371

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2330 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2388

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2389 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2440



 Score = 44.3 bits (103), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1259 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1317

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1318 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1375

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1376 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1430

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1431 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1490

Query: 1387 ------LNDSLVVIRKK 1397
                  L    ++I+KK
Sbjct: 1491 EESNPSLAQDTIIIKKK 1507


>gi|348583059|ref|XP_003477292.1| PREDICTED: LOW QUALITY PROTEIN: sacsin-like [Cavia porcellus]
          Length = 4573

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2532 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2590

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2591 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2642


>gi|345790413|ref|XP_534533.3| PREDICTED: LOW QUALITY PROTEIN: sacsin [Canis lupus familiaris]
          Length = 4575

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2534 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2592

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2593 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2644



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 104  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 162

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 163  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 205



 Score = 43.1 bits (100), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 98/257 (38%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1464 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1522

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+      +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1523 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1580

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
              I                          P    PF ID+F C+L    P+ +E+     
Sbjct: 1581 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYS 1635

Query: 1340 NTCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1636 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVTSMYLKYLKI 1695

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1696 EETNPSLAQDTVIIKKK 1712


>gi|194221780|ref|XP_001490776.2| PREDICTED: sacsin [Equus caballus]
          Length = 4612

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2571 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2629

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2630 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2681



 Score = 45.4 bits (106), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 142  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 200

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 201  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 243


>gi|444706237|gb|ELW47586.1| Sacsin [Tupaia chinensis]
          Length = 4519

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2478 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2536

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2537 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2588



 Score = 43.9 bits (102), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 100/257 (38%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1408 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1466

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1467 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1524

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C L    P+ +E+   +N
Sbjct: 1525 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCHL----PLTVEAPYSYN 1579

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1580 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1639

Query: 1387 ------LNDSLVVIRKK 1397
                  L    ++I+KK
Sbjct: 1640 EETNPSLAQDTIIIKKK 1656


>gi|281346562|gb|EFB22146.1| hypothetical protein PANDA_009473 [Ailuropoda melanoleuca]
          Length = 4582

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2541 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2599

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2600 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2651



 Score = 44.3 bits (103), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 100/257 (38%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1471 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1529

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+      +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1530 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1587

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1588 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1642

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1643 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1702

Query: 1387 ------LNDSLVVIRKK 1397
                  L    VVI+KK
Sbjct: 1703 EETNPRLAQDTVVIKKK 1719



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 112  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 170

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 171  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 213


>gi|351695481|gb|EHA98399.1| Sacsin [Heterocephalus glaber]
          Length = 4576

 Score = 50.1 bits (118), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2590 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2648

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2649 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2700


>gi|428185144|gb|EKX53997.1| hypothetical protein GUITHDRAFT_100248 [Guillardia theta CCMP2712]
          Length = 4444

 Score = 50.1 bits (118), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 60/134 (44%), Gaps = 15/134 (11%)

Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD------NIYPENVEPTLTFI 1233
            +S    +  +  GR  H L  ELY +    + EL+QNADD      +I       + + +
Sbjct: 1382 DSESFGQSESLTGRLKHIL--ELYPEGPGIMNELIQNADDAQATEFSIIVNKCTYSTSSL 1439

Query: 1234 LQ------ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
            L       +   +  +N   FS ++ + +C +G  +K       IGR G+GF +V+  TD
Sbjct: 1440 LSPRMASWQGPAIYFHNNSIFSPDDFKNICKIGQGSKL-DFLNKIGRFGLGFNAVYHFTD 1498

Query: 1288 APEIHSNGFHVKFD 1301
             P   S  + V FD
Sbjct: 1499 VPSFVSGDYLVYFD 1512



 Score = 42.4 bits (98), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 71/172 (41%), Gaps = 29/172 (16%)

Query: 1203 YSQDSHFLLELVQNADD------------NIYPEN--VEPTLTFILQESGIVVLNNEQGF 1248
            Y   +  L E +QNADD              +P +   +P L    Q S +V   N+  F
Sbjct: 40   YPPGATILKEFIQNADDAGAQQIKICLDERSFPSSSLADPKLGQFQQSSLLVY--NDAVF 97

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
            S E+  ++  +G S+K+       GR GIGF S + +T+ P   S    V FD     + 
Sbjct: 98   SDEDFDSIQRIGQSSKQAHPTK-TGRFGIGFNSCYHLTELPTFLSRSSIVMFDPQAKYLP 156

Query: 1309 FVLPTLVPPFNIDMFCRLLSKD-PVQL---------ESKCWN-TCIRLPFRT 1349
             V P   P   I+     + ++ P Q+           K +N T  RLP RT
Sbjct: 157  NVNPA-NPGKRINFLSEEIQRNYPDQIAPYFNFGARSDKAFNGTLFRLPLRT 207


>gi|301770575|ref|XP_002920712.1| PREDICTED: sacsin-like [Ailuropoda melanoleuca]
          Length = 4608

 Score = 50.1 bits (118), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2567 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2625

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2626 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2677



 Score = 44.3 bits (103), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 100/257 (38%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1497 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1555

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+      +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1556 SQ-FSDSDFVNITRLGESLKR-EEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1613

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1614 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1668

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1669 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1728

Query: 1387 ------LNDSLVVIRKK 1397
                  L    VVI+KK
Sbjct: 1729 EETNPRLAQDTVVIKKK 1745



 Score = 43.9 bits (102), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 138  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGPALYVYNNAV-FT 196

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 197  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 239


>gi|293342144|ref|XP_001063390.2| PREDICTED: sacsin isoform 1 [Rattus norvegicus]
 gi|293353889|ref|XP_224256.5| PREDICTED: sacsin isoform 2 [Rattus norvegicus]
          Length = 4581

 Score = 50.1 bits (118), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2540 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2598

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2599 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2650



 Score = 45.4 bits (106), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 106  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 164

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 165  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 207



 Score = 44.3 bits (103), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1470 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1528

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1529 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1586

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1587 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1641

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1642 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1701

Query: 1387 ------LNDSLVVIRKK 1397
                  L    ++I+KK
Sbjct: 1702 EETNPTLAQDTIIIKKK 1718


>gi|392333336|ref|XP_003752865.1| PREDICTED: sacsin isoform 2 [Rattus norvegicus]
 gi|392353567|ref|XP_003751541.1| PREDICTED: sacsin isoform 1 [Rattus norvegicus]
          Length = 4578

 Score = 50.1 bits (118), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2537 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2595

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2596 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2647



 Score = 45.4 bits (106), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 103  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 161

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 162  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 204



 Score = 44.3 bits (103), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1467 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1525

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1526 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1583

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1584 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1638

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1639 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1698

Query: 1387 ------LNDSLVVIRKK 1397
                  L    ++I+KK
Sbjct: 1699 EETNPTLAQDTIIIKKK 1715


>gi|354476692|ref|XP_003500557.1| PREDICTED: sacsin [Cricetulus griseus]
          Length = 4610

 Score = 50.1 bits (118), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2569 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2627

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2628 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2679



 Score = 45.4 bits (106), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 133  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 191

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 192  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 234



 Score = 44.3 bits (103), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1498 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1556

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1557 SE-FSDSDFINITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1614

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1615 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1669

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1670 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1729

Query: 1387 ------LNDSLVVIRKK 1397
                  L    ++I+KK
Sbjct: 1730 EESNPSLAQDTIIIKKK 1746


>gi|334330536|ref|XP_003341374.1| PREDICTED: LOW QUALITY PROTEIN: sacsin-like [Monodelphis domestica]
          Length = 4680

 Score = 50.1 bits (118), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2640 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2698

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2699 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2750



 Score = 44.3 bits (103), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 66/266 (24%), Positives = 101/266 (37%), Gaps = 68/266 (25%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1571 EEYPSVSDIFKELLQNADDANATECNFMIDMRRNMDIRENLLDPGMA-ACHGPALWSYNN 1629

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1630 SE-FSDSDFINITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1687

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
              +                          P    PF ID+F C+L    P+ +E+     
Sbjct: 1688 NHLSKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYK 1742

Query: 1340 NTCIRLPFRT----KFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT    K SE          + ++VD FS     L+LF   +  +  + +  
Sbjct: 1743 GTLFRLSFRTQQEAKLSEVSSTCYNTADIYSLVDEFSLCGHRLILFTQNVNTMFLKYLKI 1802

Query: 1387 ------LNDSLVVIRKKIVGDGIIKV 1406
                  L    +VI+K I     + V
Sbjct: 1803 EETNPGLAQDTIVIKKNICSSKALNV 1828



 Score = 43.5 bits (101), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E      T         Q S + V NN   F+
Sbjct: 210  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGKETLWSKDMEQYQGSALYVYNNAV-FT 268

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 269  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 311


>gi|395520793|ref|XP_003764508.1| PREDICTED: sacsin [Sarcophilus harrisii]
          Length = 4804

 Score = 50.1 bits (118), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2765 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2823

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2824 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2875



 Score = 45.1 bits (105), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 65/266 (24%), Positives = 103/266 (38%), Gaps = 68/266 (25%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1696 EEYPSVSDIFKELLQNADDADATECNFMIDMRRNMDIRENLLDPGMA-ACHGPALWSYNN 1754

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1755 SE-FSDSDFINITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1812

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
              I                          P    PF ID+F C+L    P+ +E+     
Sbjct: 1813 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYK 1867

Query: 1340 NTCIRLPFRT----KFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT    K SE          + ++VD FS     L+LF   +  +  + +  
Sbjct: 1868 GTLFRLSFRTQQEAKQSEISSTCYNTADIYSLVDEFSLCGHRLILFTQNVNTMFLKYLKI 1927

Query: 1387 ------LNDSLVVIRKKIVGDGIIKV 1406
                  L   +++I+K +    ++ V
Sbjct: 1928 EETNPGLAQDIIIIKKNMCSSKVLNV 1953



 Score = 43.5 bits (101), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E      T         Q S + V NN   F+
Sbjct: 335  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGKETLWSKDMEQYQGSALYVYNNAV-FT 393

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 394  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 436


>gi|453052088|gb|EME99578.1| hypothetical protein H340_15726 [Streptomyces mobaraensis NBRC 13819
            = DSM 40847]
          Length = 1585

 Score = 49.7 bits (117), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 64/154 (41%), Gaps = 22/154 (14%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            L ELVQNA D I  E     +  +L ++ +   N     + E    +  +  S K+G   
Sbjct: 50   LFELVQNAADEIRDEP-GGRIHVVLTDTHLYCANQGNPVTPEGAETILRMSMSRKRG--- 105

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS---------------EGQIGFVLPTL 1314
            G IGR G+G KSV  VTD PE  S+     FD +                G+  F  P L
Sbjct: 106  GQIGRFGVGVKSVLVVTDTPEFFSSTGSFGFDRAWSYDLIRNVPGVRERHGE-DFEAPVL 164

Query: 1315 VPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPF 1347
                 +D     L+ DPV  E   W  T +RLP 
Sbjct: 165  RMARELDSDAERLA-DPVLDELLGWATTVVRLPL 197


>gi|6907042|gb|AAF31262.1|AF193556_1 sacsin [Homo sapiens]
 gi|119628727|gb|EAX08322.1| hCG1774880 [Homo sapiens]
          Length = 3829

 Score = 49.7 bits (117), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1899



 Score = 47.0 bits (110), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 719  EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 778  SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 836  NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 890

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 891  GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 950

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 951  EETNPSLAQDTVIIKKK 967


>gi|390358686|ref|XP_003729316.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
          Length = 2828

 Score = 49.7 bits (117), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 62/251 (24%), Positives = 105/251 (41%), Gaps = 52/251 (20%)

Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE------ 1224
            GLGP+    ++ +   Q   L   +  + +  Y  D   L ELVQNADD+   E      
Sbjct: 813  GLGPHTLEAQNLLPFGQRQPLTERIKNILKG-YPCDDTILKELVQNADDSCATEVHLVLD 871

Query: 1225 -NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGI 1277
                P           LQ   + V NN + F+  +I  +  +G  +K+  S+  +GR G+
Sbjct: 872  VRTHPKDRLFRDEMSSLQGPALCVYNN-RSFTDADIEGIQRLGVGSKRNLSSK-VGRFGV 929

Query: 1278 GFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTL-VPPFNIDMFCRLLSKDPVQLES 1336
            GF +V+ +TD P  ++NG     DT    +G + P +   P     +   ++K   +L+ 
Sbjct: 930  GFNAVYHLTDCPSFYTNG-----DT----LGMLDPNVKYAPNATLAYPGFMAKPTGKLKD 980

Query: 1337 ------KCW---------NTCIRLPFRT-----------KFSEGIAMNNIVDMFSDLHPS 1370
                   C+         +T  RLP RT           K  +  ++  ++  F +  P 
Sbjct: 981  GFPNVFNCYLPEFFEAQGSTMFRLPLRTESMARTSDISSKAHDVDSVKRLMRKFKEGLPD 1040

Query: 1371 LLLFLHRLQCI 1381
            LLLFL  ++ +
Sbjct: 1041 LLLFLKHVKKV 1051


>gi|281211462|gb|EFA85624.1| putative ATP binding protein [Polysphondylium pallidum PN500]
          Length = 2733

 Score = 49.7 bits (117), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 93/417 (22%), Positives = 154/417 (36%), Gaps = 96/417 (23%)

Query: 1145 TTQTLPEDKECEDAALIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYS 1204
            T   L  D    DA  +      D    G  I   E    +K   RL         E Y 
Sbjct: 1256 TYSILHRDLSLNDARKLKIKTSLDTIKRGKTILGSEFGQFEKLLTRLKNI-----NEEYK 1310

Query: 1205 QDSHFLLELVQNADDN--------------IYPENVEPTL--TFILQESGIVVLNNEQGF 1248
             +S  L E+VQNA+D+                 ++++P L   F       +V+ N+  F
Sbjct: 1311 VES-LLKEVVQNAEDSKATVLEICLDKRRYTITDDLDPELPKNFGNYLEPSLVIFNDSKF 1369

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI----------------- 1291
            +  +I+ +  +G+S KK + +  IG  GIGF  V+ +TD P I                 
Sbjct: 1370 TDLDIQNIQSLGDSYKKNNYSK-IGHYGIGFNCVYNITDVPSIITRDSLFIFDPLKSHFS 1428

Query: 1292 --HSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT 1349
              HS G   +F   +  +    P   P F +            QL+ +  +T IRLP R 
Sbjct: 1429 VEHSAGRSYEFSIRDQLVDTFKPLNYPGFGL------------QLDKEFDHTIIRLPLRK 1476

Query: 1350 KFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLND------SLVVIRK 1396
             +S+         ++N ++  F     + LLF   L+ I F  + +D      +L  + +
Sbjct: 1477 SYSDLSSNVWTMDSINKLLKEFQSSAENCLLFQRSLKKITFSVIEDDNSNQAKTLYSVER 1536

Query: 1397 KIVGD-GIIKVSCGEDKMTW-------FVASQKLRAGVIRPDVKTTEIALALTLQESNEG 1448
            ++ G    I +S   +   W       +         V+       +I      Q SN  
Sbjct: 1537 RVTGGYDPINLSSDIEAYNWKNFNPKSYFMDIITNGNVVNKWFVGWKIGAE---QSSNLF 1593

Query: 1449 NYGPLLYQQPV-----------------FAFLPL-RTYGLKFILQGDFVLPSSREEV 1487
                 L+Q+ +                 F +LPL ++  L  ++ G F+L SSR++V
Sbjct: 1594 KIYEKLHQKSIPLGSIAICMDKEIIGVPFTYLPLPKSTELPVMVNGSFILNSSRQDV 1650


>gi|45382637|ref|NP_990050.1| chondroitin sulfate proteoglycan 5 precursor [Gallus gallus]
 gi|2065050|emb|CAA70459.1| epidermal growth factor [Gallus gallus]
          Length = 551

 Score = 49.7 bits (117), Expect = 0.018,   Method: Composition-based stats.
 Identities = 44/169 (26%), Positives = 65/169 (38%), Gaps = 37/169 (21%)

Query: 6   RPRFRPVGGVG--GEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN--- 60
           RP   P  G+G    G   G P    P    NPN    +PN+  PNP+L L         
Sbjct: 134 RPPGLPSPGLGLSSPGPNLGLPSLDLP----NPNLGLPDPNLGLPNPSLGLPSPGPTPDR 189

Query: 61  -IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQ--------------------------Q 93
            IP  NP+ D+ +P L +Q  NL   NPN+P                            Q
Sbjct: 190 PIPNPNPSLDLPDPGLAIQTPNLGLSNPNIPLPSPSPGPGTEPDLLPVAEDSEVSMELPQ 249

Query: 94  PPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERID 142
           P SS    QR++ +          P++  P +  P P+ ++ ++ +  D
Sbjct: 250 PSSSPAPAQRARGRTDRTWLGAPEPISAAPGTAEP-PEIIDVDYYDVFD 297


>gi|297274100|ref|XP_001089964.2| PREDICTED: sacsin-like isoform 2 [Macaca mulatta]
          Length = 4517

 Score = 49.7 bits (117), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 85/345 (24%), Positives = 132/345 (38%), Gaps = 67/345 (19%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2536 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2594

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFDT----SE 1304
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD     + 
Sbjct: 2595 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFDPHARYAP 2653

Query: 1305 GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLE-------SKCWNTCIRLPFR----TKFSE 1353
            G        +    + D   R    D + L          C  T  R P R     K SE
Sbjct: 2654 GATSISPGRMFRDLDADF--RTQFSDVLDLYLGTHFKLDNC--TMFRFPLRNAEMAKVSE 2709

Query: 1354 -------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKV 1406
                      + N++D        LL+FL+ ++ I        S+  I K      ++  
Sbjct: 2710 ISSVPASDRMVQNLLDKLRSDGAELLMFLNHMEKI--------SICEIDKSTGALNVLYS 2761

Query: 1407 SCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLR 1466
              G+      +  ++  A VI    K  ++   + +Q+  + +          F FLPL 
Sbjct: 2762 VKGKITDGDRLKRKQFHASVIDSVTKKRQLK-DIPVQQXXKPHRA--------FCFLPLS 2812

Query: 1467 -TYGLKFILQGDFVLPSSREEV--DGN-----SPWNQWLLSEFPA 1503
               GL F + G F L S+R  +  D N     S WN  L++   A
Sbjct: 2813 LETGLPFHVNGHFALDSARRNLWRDDNGVGVRSDWNNSLMTALIA 2857



 Score = 45.1 bits (105), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1466 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1524

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1525 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1582

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1583 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1637

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1638 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1697

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1698 EETNPSLAQDTVIIKKK 1714


>gi|426236795|ref|XP_004012352.1| PREDICTED: sacsin [Ovis aries]
          Length = 4691

 Score = 49.7 bits (117), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2650 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDDKWAPLQGPALCVYNN-QPFT 2708

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ G+GF SV+ +TD P  I  N     FD
Sbjct: 2709 EDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDILCIFD 2760



 Score = 44.3 bits (103), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 216  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTQTLWSKDMAQYQGPALYVYNNAV-FT 274

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 275  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 317



 Score = 43.1 bits (100), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 19/123 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1580 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1638

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1639 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPVIMSREFMIMFDPNI 1696

Query: 1305 GQI 1307
              I
Sbjct: 1697 NHI 1699


>gi|410047646|ref|XP_001174272.3| PREDICTED: sacsin [Pan troglodytes]
          Length = 4722

 Score = 49.7 bits (117), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2682 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2740

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2741 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2792



 Score = 47.0 bits (110), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1612 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1670

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1671 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1728

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1729 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1783

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1784 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1843

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1844 EETNPSLAQDTVIIKKK 1860



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 250  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 308

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 309  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 351


>gi|355700867|gb|EHH28888.1| DnaJ-like protein subfamily C member 29, partial [Macaca mulatta]
          Length = 4523

 Score = 49.7 bits (117), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2483 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2541

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2542 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2593



 Score = 45.1 bits (105), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1413 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1472 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1529

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1530 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1584

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1585 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1644

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1645 EETNPSLAQDTVIIKKK 1661


>gi|440911870|gb|ELR61498.1| Sacsin, partial [Bos grunniens mutus]
          Length = 4032

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 1991 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2049

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ G+GF SV+ +TD P  I  N     FD
Sbjct: 2050 EDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDILCIFD 2101



 Score = 43.1 bits (100), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 19/124 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 921  EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 979

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 980  SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPVIMSREFMIMFDPNI 1037

Query: 1305 GQIG 1308
              I 
Sbjct: 1038 NHIS 1041


>gi|431903116|gb|ELK09292.1| Sacsin [Pteropus alecto]
          Length = 3445

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 1520 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRVFDDKWAPLQGPALCVYNN-QPFT 1578

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 1579 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1630



 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 59/208 (28%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 772  EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 830

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 831  SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 888

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESKCWNT 1341
              I                          P    PF ID+F CRL    P+ +E+     
Sbjct: 889  NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCRL----PLTVEA----- 938

Query: 1342 CIRLPFRTKFSEGIAMNN---IVDMFSD 1366
                P+  +F E IA      +  +FSD
Sbjct: 939  ----PYSYRFYEDIAHGKGKELTRVFSD 962


>gi|410350047|gb|JAA41627.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Pan troglodytes]
          Length = 4579

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649



 Score = 47.0 bits (110), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 107  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 166  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208


>gi|441613500|ref|XP_004088146.1| PREDICTED: sacsin [Nomascus leucogenys]
          Length = 4527

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2487 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2545

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2546 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2597



 Score = 47.0 bits (110), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1417 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1475

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1476 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1533

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1534 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1588

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1589 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1648

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1649 EETNPSLAQDTVIIKKK 1665


>gi|410220792|gb|JAA07615.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Pan troglodytes]
 gi|410306884|gb|JAA32042.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Pan troglodytes]
          Length = 4579

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649



 Score = 47.0 bits (110), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 107  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 166  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208


>gi|380798317|gb|AFE71034.1| sacsin, partial [Macaca mulatta]
          Length = 3313

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 1273 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1331

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 1332 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 1383



 Score = 44.7 bits (104), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 203  EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 261

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 262  SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 319

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 320  NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 374

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 375  GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 434

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 435  EETNPSLAQDTVIIKKK 451


>gi|402901560|ref|XP_003913715.1| PREDICTED: sacsin [Papio anubis]
          Length = 4579

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649



 Score = 45.1 bits (105), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSNTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717


>gi|392843825|gb|AFM84516.1| sacsin, partial [Zosterops japonicus]
          Length = 736

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 25/109 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRQHPADRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ +TD P   S
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHITDCPSFLS 518


>gi|355761304|gb|EHH61786.1| DnaJ-like protein subfamily C member 29, partial [Macaca
            fascicularis]
          Length = 4523

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2483 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2541

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2542 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2593



 Score = 45.1 bits (105), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1413 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1472 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1529

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1530 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1584

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1585 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1644

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1645 EETNPSLAQDTVIIKKK 1661


>gi|397526330|ref|XP_003833084.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Pan paniscus]
          Length = 4579

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649



 Score = 45.8 bits (107), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I KK
Sbjct: 1701 EETNPSLAQDTVIIXKK 1717



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 107  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 166  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208


>gi|426374909|ref|XP_004054297.1| PREDICTED: sacsin [Gorilla gorilla gorilla]
          Length = 4579

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649



 Score = 47.0 bits (110), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717



 Score = 43.9 bits (102), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 107  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 166  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208


>gi|76364133|sp|Q9DF69.1|CSPG5_CHICK RecName: Full=Chondroitin sulfate proteoglycan 5; AltName:
           Full=Acidic leucine-rich EGF-like domain-containing
           brain protein; Contains: RecName: Full=Chondroitin
           sulfate proteoglycan 5, 38 kDa form; Contains: RecName:
           Full=Chondroitin sulfate proteoglycan 5, 80 kDa form;
           Flags: Precursor
 gi|11093919|gb|AAG29499.1|AF292101_1 CALEB isoform b [Gallus gallus]
          Length = 594

 Score = 49.7 bits (117), Expect = 0.019,   Method: Composition-based stats.
 Identities = 44/169 (26%), Positives = 65/169 (38%), Gaps = 37/169 (21%)

Query: 6   RPRFRPVGGVG--GEGTGGGQPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTN--- 60
           RP   P  G+G    G   G P    P    NPN    +PN+  PNP+L L         
Sbjct: 134 RPPGLPSPGLGLSSPGPNLGLPSLDLP----NPNLGLPDPNLGLPNPSLGLPSPGPTPDR 189

Query: 61  -IPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQ--------------------------Q 93
            IP  NP+ D+ +P L +Q  NL   NPN+P                            Q
Sbjct: 190 PIPNPNPSLDLPDPGLAIQTPNLGLSNPNIPLPSPSPGPGTEPDLLPVAEDSEVSMELPQ 249

Query: 94  PPSSACNQQRSQTQHPPGASNQQRPLTQQPQSLPPQPQKLNKEFLERID 142
           P SS    QR++ +          P++  P +  P P+ ++ ++ +  D
Sbjct: 250 PSSSPAPAQRARGRTDRTWLGAPEPISAAPGTAEP-PEIIDVDYYDVFD 297


>gi|345319042|ref|XP_001519222.2| PREDICTED: sacsin [Ornithorhynchus anatinus]
          Length = 4639

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2598 YPSEKEMLKELLQNADDAKATEICFVFDPRHHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2656

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2657 EDDVRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2708



 Score = 45.8 bits (107), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 49/104 (47%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQE------SGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T   +E      S + V NN   F+
Sbjct: 167  YPEGGQILKELIQNAEDAGATEVKFLYDETKYGTETLWSKEMEQYQGSALYVYNNAV-FT 225

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 226  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 268



 Score = 43.1 bits (100), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 65/258 (25%), Positives = 100/258 (38%), Gaps = 68/258 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1528 EEYPSVSDIFKELLQNADDANATECNFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1586

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1587 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1644

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLES--KCW 1339
              +                          P    PF ID+F C+L    P+ +E+     
Sbjct: 1645 NHLSKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYK 1699

Query: 1340 NTCIRLPFRT----KFSE--GIAMN-----NIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT    K SE  G   N     ++VD FS     L++F   +  +  + +  
Sbjct: 1700 GTLFRLSFRTQQEAKLSEVSGTCYNTADIYSLVDEFSLCGHRLIIFTQNVNSMHLKYLKI 1759

Query: 1387 ------LNDSLVVIRKKI 1398
                  L    V+I+K +
Sbjct: 1760 EETNPALAQDTVIIKKTL 1777


>gi|297693647|ref|XP_002824122.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Pongo abelii]
          Length = 4576

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2536 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2594

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2595 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2646



 Score = 47.0 bits (110), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1466 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1524

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1525 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1582

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1583 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1637

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1638 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1697

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1698 EETNPSLAQDTVIIKKK 1714



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 104  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 162

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 163  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 205


>gi|163659918|ref|NP_055178.3| sacsin [Homo sapiens]
 gi|122066060|sp|Q9NZJ4.2|SACS_HUMAN RecName: Full=Sacsin; AltName: Full=DnaJ homolog subfamily C member
            29; Short=DNAJC29
          Length = 4579

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2539 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2597

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2598 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2649



 Score = 47.0 bits (110), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1469 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1527

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1528 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1585

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1586 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1640

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1641 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1700

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1701 EETNPSLAQDTVIIKKK 1717



 Score = 43.9 bits (102), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 107  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 165

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 166  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208


>gi|271970078|ref|YP_003344274.1| hypothetical protein [Streptosporangium roseum DSM 43021]
 gi|270513253|gb|ACZ91531.1| hypothetical protein Sros_8899 [Streptosporangium roseum DSM 43021]
          Length = 1043

 Score = 49.7 bits (117), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 64/153 (41%), Gaps = 9/153 (5%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
              ++EL QNA D      V   L   L+E  +   N      A  I  L  +  S K+  
Sbjct: 43   RLIVELAQNAADAALRAGVPGRLRLSLREGVLSAANTGAPVDAAGIEGLSTLRVSGKRDE 102

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVL--PTLVPPF-NIDMFC 1324
            S G  GR G+GF +V  VTD P I S G  V   + +     V   P L     + D   
Sbjct: 103  S-GMAGRFGVGFAAVVSVTDEPVIGSLGSGVVRWSRDQTTALVAAEPALARELTDRDRHV 161

Query: 1325 RLL----SKDPVQLESKCWNTCIRLPFRTKFSE 1353
             LL       PV++    ++T +RLP R + +E
Sbjct: 162  PLLRLPFPAGPVEVPG-GFDTLVRLPLRDRAAE 193


>gi|390463960|ref|XP_002806928.2| PREDICTED: LOW QUALITY PROTEIN: sacsin [Callithrix jacchus]
          Length = 5004

 Score = 49.7 bits (117), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2964 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 3022

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 3023 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 3074



 Score = 44.3 bits (103), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 53/123 (43%), Gaps = 19/123 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1952 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 2010

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 2011 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 2068

Query: 1305 GQI 1307
              I
Sbjct: 2069 NHI 2071



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 590  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 648

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 649  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 691


>gi|348519944|ref|XP_003447489.1| PREDICTED: sacsin-like [Oreochromis niloticus]
          Length = 301

 Score = 49.7 bits (117), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 62/236 (26%), Positives = 92/236 (38%), Gaps = 47/236 (19%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
            Y      L EL+QNADD        I+ E    T +   +E G      +   N   F+ 
Sbjct: 31   YPDGGQILKELIQNADDARATEVVFIHDERSYGTESIWTEELGKYQGPALYAYNNAAFTV 90

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI------------------H 1292
            E+ + +   G S K+   +  +GR GIGF SV+ +TD P I                   
Sbjct: 91   EDWQGIQMAGRSVKRDDPSK-VGRFGIGFNSVYHITDVPSILSSEHLGMMDPQEKIFGER 149

Query: 1293 SNGFHVKFDTSEGQ-IGFVLPTLVPPFN--IDMFCRLLSKDPVQLESKCWN-TCIRLPFR 1348
            S GF    D SE Q +   +     PF   + + C       V +E + +N T  R P R
Sbjct: 150  SGGFRWSLDDSEDQEVLLNMHHQFQPFRDIVSLVCE-HGWSKVVMEDQHFNGTIFRFPLR 208

Query: 1349 TKFSEGIAMNNIV--DMFSDLHPSL-------LLFLHRLQCIMFRNMLNDSLVVIR 1395
             + SE    +N+   D  ++L  S        LLFL  +  +   ++  D  V  R
Sbjct: 209  NEASE--ISDNLYDSDKVTELFDSFIADADLSLLFLKNVTSVSLLHISEDGAVNTR 262


>gi|403255390|ref|XP_003920419.1| PREDICTED: sacsin [Saimiri boliviensis boliviensis]
          Length = 4696

 Score = 49.7 bits (117), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2656 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2714

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 2715 EDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 2766



 Score = 46.2 bits (108), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 101/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1586 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1644

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1645 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1702

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1703 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1757

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1758 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1817

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1818 EESNPSLAQDTVIIKKK 1834



 Score = 43.5 bits (101), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 224  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAI-FT 282

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 283  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 325


>gi|76631591|ref|XP_586753.2| PREDICTED: sacsin [Bos taurus]
 gi|297481151|ref|XP_002691909.1| PREDICTED: sacsin [Bos taurus]
 gi|296481777|tpg|DAA23892.1| TPA: hypothetical protein BOS_12530 [Bos taurus]
          Length = 4575

 Score = 49.7 bits (117), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2534 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2592

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ G+GF SV+ +TD P  I  N     FD
Sbjct: 2593 EDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDILCIFD 2644



 Score = 44.3 bits (103), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 104  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTQTLWSKDMAQYQGPALYVYNNAV-FT 162

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 163  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 205



 Score = 42.7 bits (99), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 19/123 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1464 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1522

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1523 SQ-FSDSDFVNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPVIMSREFMIMFDPNI 1580

Query: 1305 GQI 1307
              I
Sbjct: 1581 NHI 1583


>gi|448307818|ref|ZP_21497706.1| hypothetical protein C494_08772 [Natronorubrum bangense JCM 10635]
 gi|445594953|gb|ELY49079.1| hypothetical protein C494_08772 [Natronorubrum bangense JCM 10635]
          Length = 863

 Score = 49.7 bits (117), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 52/209 (24%), Positives = 89/209 (42%), Gaps = 38/209 (18%)

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESG---IVVLNNEQGFSAENIRA-------- 1255
            ++F LE +QNADD    E        + Q  G   + +LNN + F+   +          
Sbjct: 6    TYFPLEFIQNADD----EGASAVRFQVRQIDGDWCLEILNNGRKFTDSALEGQDRDTDDV 61

Query: 1256 ------LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-G 1308
                  LC  G S K      +IG  G+GFKS+F ++   E+HS  FH  FD    +  G
Sbjct: 62   KDDVTGLCAAGVSPKHPRD--HIGFIGVGFKSIFEISQCVEVHSGQFHFSFDRERAEKHG 119

Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNT--CIRLPFRTKFSEGIAMNNIVDMFSD 1366
              +P  V P+  +      ++ P +++   + T   +RL  R     G  + +  ++F  
Sbjct: 120  DEIPWRVVPWECNATKE--TEVPAEIDGTEYTTRFVVRLDER-----GRELLSKDELFEP 172

Query: 1367 LHPS-----LLLFLHRLQCIMFRNMLNDS 1390
            LH       + LFL  ++ +   +  +D+
Sbjct: 173  LHSENIDRRVFLFLKDVRELSIEDHYSDN 201


>gi|374321716|ref|YP_005074845.1| HSP90 family protein [Paenibacillus terrae HPL-003]
 gi|357200725|gb|AET58622.1| HSP90 family protein [Paenibacillus terrae HPL-003]
          Length = 615

 Score = 49.7 bits (117), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 58/118 (49%), Gaps = 21/118 (17%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-----------ENVEPTLTFILQESG- 1238
            LG  +  L+  LYS    F+ E++QNA D I             ++ E  +T  L  SG 
Sbjct: 13   LGGMIDILANHLYSSPRVFIREVLQNATDAITARKELGETAAESKSFEGKVTVELSGSGE 72

Query: 1239 ---IVVLNNEQGFSAENI-RALCDVGNSTKKG-----SSAGYIGRKGIGFKSVFRVTD 1287
               +++ +N  G + E+I R L  +G S+KKG     S   +IGR GIG  S F V+D
Sbjct: 73   QQVLMIQDNGIGLTEEDIHRFLSIIGQSSKKGTDLLASETSFIGRFGIGLLSCFMVSD 130


>gi|383625857|ref|ZP_09950263.1| hypothetical protein HlacAJ_21134 [Halobiforma lacisalsi AJ5]
 gi|448701043|ref|ZP_21699657.1| hypothetical protein C445_16934 [Halobiforma lacisalsi AJ5]
 gi|445778437|gb|EMA29381.1| hypothetical protein C445_16934 [Halobiforma lacisalsi AJ5]
          Length = 862

 Score = 49.3 bits (116), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 52/209 (24%), Positives = 89/209 (42%), Gaps = 38/209 (18%)

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESG---IVVLNNEQGFSAENIRA-------- 1255
            ++F LE +QNADD    E        + Q  G   + +LNN + F+   +          
Sbjct: 6    TYFPLEFIQNADD----EGASAVRFQVRQIGGDWCLEILNNGRKFTDSALEGQDRDTDDV 61

Query: 1256 ------LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI-G 1308
                  LC  G S K      +IG  G+GFKS+F ++   E+HS  FH  FD    +  G
Sbjct: 62   KDDVTGLCAAGVSPKHPRD--HIGFIGVGFKSIFEISQCVEVHSGQFHFSFDRERAEKHG 119

Query: 1309 FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNT--CIRLPFRTKFSEGIAMNNIVDMFSD 1366
              +P  V P+  +      ++ P +++   + T   +RL  R     G  + +  ++F  
Sbjct: 120  DEIPWRVVPWECNATKE--TEVPTEIDGTEYTTRFVVRLDER-----GQELLSKDELFEP 172

Query: 1367 LHPS-----LLLFLHRLQCIMFRNMLNDS 1390
            LH       + LFL  ++ +   +  +D+
Sbjct: 173  LHSENIDRRVFLFLKDVRELSIEDHYSDN 201


>gi|6907044|gb|AAF31263.1| sacsin [Mus musculus]
          Length = 3830

 Score = 49.3 bits (116), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 15/100 (15%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 1789 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 1847

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
             +++R + ++G  TK+G+     G  GIGF SV+ +TD P
Sbjct: 1848 EDDVRGIQNLGKGTKEGNPCK-TGHYGIGFNSVYHITDCP 1886



 Score = 45.8 bits (107), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 62/258 (24%), Positives = 102/258 (39%), Gaps = 68/258 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 719  EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 777

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 778  SE-FSDSDFLNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 835

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 836  NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 890

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 891  GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 950

Query: 1387 ------LNDSLVVIRKKI 1398
                  L    ++I+KK+
Sbjct: 951  EETNPSLAQDTIIIKKKV 968


>gi|224148547|ref|XP_002336671.1| predicted protein [Populus trichocarpa]
 gi|222836495|gb|EEE74902.1| predicted protein [Populus trichocarpa]
          Length = 1060

 Score = 49.3 bits (116), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 41/126 (32%), Positives = 55/126 (43%), Gaps = 26/126 (20%)

Query: 1428 RPDVKTTEIALALTLQESN-EGNYGPLLYQQPVFAFLPLR-TYGLKFILQGDFVLPSSRE 1485
            R D++     L +TL   N E  +  + Y   ++AFLP        FI+Q DF+L SSRE
Sbjct: 26   RVDMRMEVEDLVITLAFPNGERLHRGMKYSPGIYAFLPTEMVTDFPFIIQADFILASSRE 85

Query: 1486 EVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAKAASVYMSFVPLVGEVHGF-FS 1544
             +  ++ WNQ +L   P  F+ A                       FV LV  VHG   S
Sbjct: 86   TIRWDNIWNQGILDCVPFAFIEA-----------------------FVSLVKTVHGAPAS 122

Query: 1545 GLPRMI 1550
             LPRM 
Sbjct: 123  SLPRMF 128


>gi|432115436|gb|ELK36851.1| Sacsin [Myotis davidii]
          Length = 2206

 Score = 49.3 bits (116), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 800  YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 858

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGFHVKFD 1301
             +++R + ++G  TK+G+     G+ GIGF SV+ +TD P  I  N     FD
Sbjct: 859  DDDVRGIQNLGKGTKEGNPYK-TGQYGIGFNSVYHITDCPSFISGNDILCIFD 910


>gi|170120064|ref|XP_001891101.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164633702|gb|EDQ98250.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 208

 Score = 49.3 bits (116), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 74/153 (48%), Gaps = 18/153 (11%)

Query: 1209 FLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
            F+L+   +A D+I+   ++PT     Q   ++  N+   F  E+  AL ++  S+KK  +
Sbjct: 46   FVLDHRTHATDHIHHPRLKPT-----QGPALLAFNDAM-FQEEDWEALQNIHRSSKKMDT 99

Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPPF--NID 1321
            +  IG+ GIGF+SV+ +TD P+I S  +    D        G +   L      +  ++ 
Sbjct: 100  SK-IGKYGIGFRSVYHITDYPQIMSGPYLAILDPQHNFARSGGVKIELSANPSTYQGHLT 158

Query: 1322 MFCRLLSKDPVQLESKCW-NTCIRLPFRTKFSE 1353
             F  LL   P   + K +  + +RLP R + SE
Sbjct: 159  TFDSLL---PSDWQGKEFTGSVVRLPLRNRESE 188


>gi|297563109|ref|YP_003682083.1| ATP-binding region ATPase domain-containing protein [Nocardiopsis
            dassonvillei subsp. dassonvillei DSM 43111]
 gi|296847557|gb|ADH69577.1| ATP-binding region ATPase domain protein [Nocardiopsis dassonvillei
            subsp. dassonvillei DSM 43111]
          Length = 1084

 Score = 49.3 bits (116), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 43/93 (46%), Gaps = 1/93 (1%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
              ++EL QNA D      V   L   L    +   N     +AE + +L  +  STK+G 
Sbjct: 51   RVVVELAQNAADAARRAGVPGRLRLALTGHRLTAANTGAPLTAEGVESLATLRASTKRGE 110

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
              G  GR G+GF +V  ++D   + S G  V+F
Sbjct: 111  Q-GSAGRFGVGFSAVAALSDDVTVDSGGSAVRF 142


>gi|327269040|ref|XP_003219303.1| PREDICTED: sacsin-like [Anolis carolinensis]
          Length = 4592

 Score = 49.3 bits (116), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 25/105 (23%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ +   P     LQ   + V NN
Sbjct: 2545 YPSEKEMLKELLQNADDAKATEICFVFDPRQHSTDRIFDDKWAP-----LQGPALCVYNN 2599

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
             Q F+ ++IR + ++G  TK+G+     G+ GIGF SV+ +TD P
Sbjct: 2600 -QPFTEDDIRGIQNLGKGTKEGNPCK-TGQYGIGFNSVYHLTDCP 2642



 Score = 45.1 bits (105), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 68/178 (38%), Gaps = 51/178 (28%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E      +         Q S + V NN   F+
Sbjct: 114  YPEGGQILKELIQNAEDAGATEVRFLYDETQYGNESLWSKDMAQYQGSALYVYNNAI-FT 172

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S            QIG 
Sbjct: 173  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPSIFSG----------DQIGM 221

Query: 1310 VLP--TLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFS 1365
            + P  TL  P                   +CWN       +  F E   MN + D F+
Sbjct: 222  LDPHQTLFGPHE---------------SGQCWN------LKDDFKE---MNELTDQFA 255



 Score = 40.8 bits (94), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 49/116 (42%), Gaps = 17/116 (14%)

Query: 1201 ELYSQDSHFLLELVQNADD---------------NIYPENVEPTLTFILQESGIVVLNNE 1245
            E Y   S    EL+QNADD               N   EN+            +   NN 
Sbjct: 1474 EEYPSVSDIFKELLQNADDANATDCNFLIDMRRNNDIRENLLDPGMAASHGPALWSFNNS 1533

Query: 1246 QGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            + FS  +   +  +G S K+ +    +G+ G+GF SV+ +TD P I S  F + FD
Sbjct: 1534 E-FSDNDFLNITRLGESLKR-AEVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFD 1587


>gi|122066080|sp|Q9JLC8.2|SACS_MOUSE RecName: Full=Sacsin; AltName: Full=DnaJ homolog subfamily C member
            29; Short=DNAJC29
          Length = 4582

 Score = 49.3 bits (116), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 48/100 (48%), Gaps = 15/100 (15%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++     P           LQ   + V NN Q F+
Sbjct: 2541 YPSEKEMLKELLQNADDAKATEICFVFDPRQHPVDRIFDDKWAPLQGPALCVYNN-QPFT 2599

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
             +++R + ++G  TK+G+     G  GIGF SV+ +TD P
Sbjct: 2600 EDDVRGIQNLGKGTKEGNPCK-TGHYGIGFNSVYHITDCP 2638



 Score = 45.8 bits (107), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 62/258 (24%), Positives = 102/258 (39%), Gaps = 68/258 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1471 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1529

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1530 SE-FSDSDFLNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1587

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1588 NHISKHIKDRSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLAVEAPYSYN 1642

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   +  +  + +  
Sbjct: 1643 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVNSMYLKYLKI 1702

Query: 1387 ------LNDSLVVIRKKI 1398
                  L    ++I+KK+
Sbjct: 1703 EETNPSLAQDTIIIKKKV 1720



 Score = 45.4 bits (106), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 107  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 165

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 166  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 208


>gi|392843861|gb|AFM84534.1| sacsin, partial [Batrachuperus yenyuanensis]
          Length = 736

 Score = 49.3 bits (116), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 50/109 (45%), Gaps = 25/109 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL+QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELLQNADDAKATEICFVFDPRYHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             Q F+ ++IR + ++G  TK+ +     G+ GIGF SV+ +TD P   S
Sbjct: 472  -QPFTEDDIRGIQNLGRGTKEANPCK-TGQYGIGFNSVYHITDCPSFMS 518


>gi|311747181|ref|ZP_07720966.1| hypothetical protein ALPR1_12575 [Algoriphagus sp. PR1]
 gi|126578890|gb|EAZ83054.1| hypothetical protein ALPR1_12575 [Algoriphagus sp. PR1]
          Length = 1880

 Score = 48.9 bits (115), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 3/110 (2%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            L   +  + + +YS     + EL+QNADD    + V   + F  +     ++++   FS 
Sbjct: 222  LATTMTEIGKGMYSSKQRMIFELLQNADDAPGRDKVGFHIDF--KGDYFFIMHDGAPFSK 279

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
             ++ A+   G STK+       G KGIGFKSVF  +    I S G+   F
Sbjct: 280  SDVEAITRSGESTKRDKRK-VTGYKGIGFKSVFTDSSEVWIKSGGYQFAF 328



 Score = 43.9 bits (102), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 62/126 (49%), Gaps = 12/126 (9%)

Query: 1400 GDGIIKVSCGED--KMTWFVAS-QKLRAGVIRPDVKT---TEIALALTLQES---NEGNY 1450
            G G++K S   D  ++T+F    +  +   I P + +   TE++    +Q     +E  Y
Sbjct: 492  GIGLLKDSQVNDYNEITYFFTDLEGNKITTIPPKIASVSETEVSFGFLIQNGKLVSEPGY 551

Query: 1451 GPLLYQ-QPVFAFLPL--RTYGLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVS 1507
                 Q   +F +LP+  + + L F++ GDFV  S REE+ G++ WN++++++     V 
Sbjct: 552  ATSNSQYSSLFTYLPMEDKRFKLPFLVNGDFVPDSRREEIQGDNLWNKYIMTKVAEKHVE 611

Query: 1508 AERSFC 1513
                F 
Sbjct: 612  TLEKFA 617


>gi|134097209|ref|YP_001102870.1| HSP90 family heat shock protein [Saccharopolyspora erythraea NRRL
            2338]
 gi|291005334|ref|ZP_06563307.1| HSP90 family heat shock protein [Saccharopolyspora erythraea NRRL
            2338]
 gi|133909832|emb|CAL99944.1| putative heat shock protein, HSP90-family [Saccharopolyspora
            erythraea NRRL 2338]
          Length = 984

 Score = 48.9 bits (115), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 47/183 (25%), Positives = 77/183 (42%), Gaps = 23/183 (12%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
              L+EL QNA D    + V   L   L +  + V N  +  +AE +  L  +  S K+  
Sbjct: 45   RLLVELAQNAADAAGSKGV---LRVRLADGELRVANTGEPLTAEGVAGLASLRASAKREG 101

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG-QIGFVLPTLVPPFNIDMFCRL 1326
            S   +GR G+GF +V  V++ P + S+   V F      Q    LP        +     
Sbjct: 102  S--TVGRYGVGFAAVLAVSEEPRVVSSSGSVTFSAERTRQEAERLPG-----PAEELAAR 154

Query: 1327 LSKDPV--------QLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
              + PV         +  + + T +RLP R+    G+  + +++ F++    LLL L  L
Sbjct: 155  AGEVPVLRLVWPLEDVPPEGFATEVRLPLRS----GVDADALLESFAEQASGLLLALPAL 210

Query: 1379 QCI 1381
              I
Sbjct: 211  TEI 213


>gi|340384534|ref|XP_003390766.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 3558

 Score = 48.9 bits (115), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 55/362 (15%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQ---- 1235
            QH  L   L  + +E Y      + E++QNADD        +Y +    T   + +    
Sbjct: 589  QHEPLTTRLGNILRE-YKDGLTIIKEMIQNADDAGATEVNILYDDRTHSTHKLLFKGMTE 647

Query: 1236 -ESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
                 ++++N   F+ E+   +  +  +TK       IG+ G+GF SV+ +TD P   S 
Sbjct: 648  SHGPALIVHNNSTFTKEDFENITKLAGATKANQPLK-IGKFGVGFCSVYHITDVPSFVSG 706

Query: 1295 GFHVKFDTSEGQI-GFVLPTLVPPFNIDMFCRLLSKD----PVQ-----LESKCWN-TCI 1343
             +   FD +   + G V     P   +    + L++     P +       +  +N T  
Sbjct: 707  EWLYIFDPTLKYLKGVVHNESRPGKKVQYQSKFLAQSQQMAPYEGLFGFTSTANYNGTIF 766

Query: 1344 RLPFRT---KFSEGIAMNNIVD-MFSDL---HPSLLLFLHRLQCIMFRNMLNDSLVVIR- 1395
            RLPFRT   + S  I   +++  M +DL      LLLFL  ++ I F ++  D L +   
Sbjct: 767  RLPFRTSPSQISSTIYNEHLIQKMKNDLINNGSKLLLFLQHVKRITFSSVQGDVLSINSI 826

Query: 1396 ------KKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALALTLQESNEGN 1449
                  K  + +     S  E    W V+ Q+ +      D +    ++A  L +    +
Sbjct: 827  NEVNGIKHCITNFSTSNSVSE---YWLVSCQEEKLQEQGGDYQPGTASVACQLFKEG-SS 882

Query: 1450 YGPLLYQQPVFAFLPLR--TYGLKFILQGDFVLPSSR----------EEVDGNSPWNQWL 1497
            +     +  VF FLPL     GL   +  +F + S+R          E  D    WN+ L
Sbjct: 883  FECKAVEGCVFCFLPLSLPCTGLPVHVNANFAVMSNRSGIWTEASSGEPSDSREYWNEKL 942

Query: 1498 LS 1499
            ++
Sbjct: 943  MT 944



 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 100/443 (22%), Positives = 157/443 (35%), Gaps = 122/443 (27%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENV----------EPTLT---FILQESGIVVLNNEQGFS 1249
            Y  D  F+ EL+QNADD    E +          E  L+     LQ   ++V N ++ FS
Sbjct: 1563 YPFDETFVKELIQNADDAKANEMIIILDRRQHGTEKVLSENWGELQGPALLVWN-DKNFS 1621

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
             +++  +  +G  +K+      IG+ GIGF  V+ VTD P   + G   KF   +    +
Sbjct: 1622 DQDLEGIQKLGLGSKRDDDES-IGQFGIGFNVVYHVTDCPSFITRG--EKFCVFDPHCKY 1678

Query: 1310 V---------------------LPTLVPPFNIDMFC-RLLSKDPVQLESKCWNTCIRLPF 1347
            V                     +P L   F  D F  +LL  D             R P 
Sbjct: 1679 VPGADAISPGRQYNIDKTFWNDMPDLSSAFLQDNFTSKLLEIDK--------GVLFRFPL 1730

Query: 1348 RTK---FSEGIAMNNIVD--MFSDLHPS-----------LLLFLHRLQCIMFRNMLNDSL 1391
            R+      +   +N I D  + +D+  S           LLLFL+ L    +  ++N+S 
Sbjct: 1731 RSTQKLVEQSKIINEINDTVVTADVMESYLRNWVSVIKDLLLFLNHLTTFEYY-VINESG 1789

Query: 1392 VVIRKKIVGDGIIKVSCGEDKMTWFVASQKLR--AGVIRPDVKTTEIALALTLQESNEGN 1449
             +  +      I +    ED +      + L     +++P + T  + L  ++  S E N
Sbjct: 1790 SIALQTKYNIHITE----EDHLKRIHLKESLSNFKSILQPCLVTYSLTLEASVNSSKESN 1845

Query: 1450 YGPLLYQQPV------------------------------------FAFLPLRTY-GLKF 1472
                L QQ V                                    F FLPL  Y  L  
Sbjct: 1846 REVWLIQQGVGDMLDPLKDWKFVPKILPQHGIATTLIRSQNFKGQVFCFLPLPIYTNLPV 1905

Query: 1473 ILQGDFVLPSSR--------EEVDGNSPWNQWLLSEFPALFVSAERSFCDLPCFRENPAK 1524
             + G F+L S R        E++D  S WN        +L+ +   S+       ++   
Sbjct: 1906 HINGKFILNSDRRSLWASTSEKLDDRSTWNN-------SLYKAIASSYVHFLKIAQSSYV 1958

Query: 1525 AASVYMSFVPLVGEVHGFFSGLP 1547
            AAS Y     L   V  ++   P
Sbjct: 1959 AASGYSKLEELFSAVDVYYGLFP 1981


>gi|392843847|gb|AFM84527.1| sacsin, partial [Melopsittacus undulatus]
          Length = 736

 Score = 48.9 bits (115), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 49/109 (44%), Gaps = 25/109 (22%)

Query: 1203 YSQDSHFLLELVQNADD------------------NIYPENVEPTLTFILQESGIVVLNN 1244
            Y  +   L EL QNADD                   I+ E   P     LQ   + V NN
Sbjct: 417  YPSEKEMLKELPQNADDAKATEICFVFDPRQHPVDRIFDEKWAP-----LQGPALCVYNN 471

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             Q F+ ++IR + ++G  TK G+     G+ GIGF SV+ VTD P   S
Sbjct: 472  -QPFTEDDIRGIQNLGKGTKVGNPCK-TGQYGIGFNSVYHVTDCPSFLS 518


>gi|356530237|ref|XP_003533689.1| PREDICTED: sacsin-like [Glycine max]
          Length = 4760

 Score = 48.9 bits (115), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 87/220 (39%), Gaps = 50/220 (22%)

Query: 1203 YSQDSHFLLELVQNADD------NIYPENVEPTLTFILQES-----GIVVLN-NEQGFSA 1250
            Y + +  L EL+QNADD      ++  +        +L +S     G  +L  N+  F+ 
Sbjct: 30   YPEGTTVLKELIQNADDAGATTVSLCLDRRSHAAGSLLSDSLAQWQGPALLAFNDAVFTE 89

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
            E+  ++  +G STK G  A   GR G+GF SV+ +TD P   S  + V FD      G  
Sbjct: 90   EDFVSISKIGGSTKHGQ-ASKTGRFGVGFNSVYHLTDLPSFVSGKYVVLFDPQ----GVY 144

Query: 1311 LPTLV---PPFNIDM---------------FCRLLSKDPVQLESKCWNTCIRLPFRTKFS 1352
            LP +    P   ID                +C         ++S    T  R P R  + 
Sbjct: 145  LPRVSAANPGKRIDFTGSSALSFYRDQFSPYCAF----GCDMQSPFSGTLFRFPLRNAYQ 200

Query: 1353 --------EGIAMNNIVDMFSDLHPS---LLLFLHRLQCI 1381
                    +  +  +I  MF  L+      LLFL  + CI
Sbjct: 201  AATSKLSRQAYSPEDISSMFVQLYEEGVLTLLFLKSVLCI 240



 Score = 44.3 bits (103), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 93/238 (39%), Gaps = 55/238 (23%)

Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGI------ 1239
            QH  L  R  H L  E+Y+     L E+VQNA+D    E     + F+L +S        
Sbjct: 1383 QHEALTTRLKHIL--EMYADGPGSLFEMVQNAEDAGASE-----VIFLLDKSHYGTSSIL 1435

Query: 1240 -----------VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDA 1288
                       +   N+  FS +++ A+  +G  +K    A  IGR G+GF  V+  TD 
Sbjct: 1436 SPEMADWQGPALYCYNDSVFSPQDLYAISRIGQESKL-EKAFAIGRFGLGFNCVYHFTDI 1494

Query: 1289 PEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQ----------LESK 1337
            P   S    V FD     +  + P+  P   I    R +L + P Q          L+  
Sbjct: 1495 PMFVSGENVVMFDPHASNLPGISPSH-PGLRIKFVGRQILEQFPDQFSPLLHFGCDLQHP 1553

Query: 1338 CWNTCIRLPFRTKFSEGIA--------------MNNIVDMFSDLHPSLLLFLHRLQCI 1381
               T  R P RT    G+A              + ++   FS++    LLFL  ++ I
Sbjct: 1554 FPGTLFRFPLRTA---GVASRSQIKKEAYTPEDVRSLFAAFSEVVSETLLFLRNVKSI 1608


>gi|328854067|gb|EGG03202.1| hypothetical protein MELLADRAFT_90363 [Melampsora larici-populina
           98AG31]
          Length = 295

 Score = 48.9 bits (115), Expect = 0.036,   Method: Composition-based stats.
 Identities = 42/127 (33%), Positives = 54/127 (42%), Gaps = 9/127 (7%)

Query: 9   FRPVGGVGGEGTGGGQPPSQPPPPHFNPNFAFQ-NPN-IFFPNPALQLLQNLTNIPFQNP 66
           + P G +  + + G   P + P PH  P  A+  NP+   FP+   Q  Q   NIP Q P
Sbjct: 49  YPPPGILYPQFSQGAHNPHETPNPH-GPQHAYPVNPSHPLFPH--YQHQQPTNNIPHQQP 105

Query: 67  NFDIQNPNLFMQNANLLTQNPNLPPQQ---PPSSACNQQRSQTQHPPGASNQQRPLTQQP 123
               Q PN   Q+     Q P+ P QQ   P      Q  +  Q PPG +N       Q 
Sbjct: 106 PNSYQQPNHPYQHTTSHYQQPHHPYQQHHNPYQQPFQQPFNPYQQPPG-NNFIHSQPSQN 164

Query: 124 QSLPPQP 130
             LPPQP
Sbjct: 165 HFLPPQP 171


>gi|432934233|ref|XP_004081920.1| PREDICTED: sacsin-like [Oryzias latipes]
          Length = 2838

 Score = 48.5 bits (114), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 66/276 (23%), Positives = 107/276 (38%), Gaps = 60/276 (21%)

Query: 1157 DAALIVESIRRDEFGLGPNISNMESNMLKKQHARLG-RALHCLSQELYSQDSHFLLELVQ 1215
            +A  + ESI  D      +I  + + + K+    L  R  + +S   YS     L EL+Q
Sbjct: 1203 NAMFLHESIEED-----TDIGALAAQICKEHREELAVRIKNIISA--YSSKKDILKELIQ 1255

Query: 1216 NADDNIYPENVEPTLTFI------------------LQESGIVVLNNEQGFSAENIRALC 1257
            NADD    E     + F+                  LQ   + V NN + FS E+++ + 
Sbjct: 1256 NADDAEATE-----IHFVWDKRQHGKEKTFGERWNHLQGPALCVFNN-KVFSDEDLKGIQ 1309

Query: 1258 DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG------------FHVKFDTSEG 1305
             +G   K+  + G  G+ G+GF SV+ +TD P I +                 + +  + 
Sbjct: 1310 QLGEGGKR-ETLGKTGKFGVGFNSVYHLTDCPSILTGDKFLCISDPNMKYIETQPNNHKL 1368

Query: 1306 QIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIA--------- 1356
             IG+ L         D++   L +  +  E     T  RLP RT++S   +         
Sbjct: 1369 GIGYKLADTFKEMYDDVYRSFLPEQFLLEEG----TMFRLPLRTEYSAKNSRISCQEVTE 1424

Query: 1357 --MNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDS 1390
              M  +    SD    L+LFL  +  I    + N S
Sbjct: 1425 HDMRELCSALSDDPEGLILFLKNICKITVHEINNQS 1460



 Score = 46.6 bits (109), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 60/274 (21%), Positives = 109/274 (39%), Gaps = 68/274 (24%)

Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
            L  ES+  ++FG    I+    N+LK+                Y + +    EL+QNA+D
Sbjct: 699  LAPESLGIEQFGQREPITTRIKNILKE----------------YDEPNDIFKELIQNAED 742

Query: 1220 N---------IYPENVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTK 1264
                       + E+ +P  + I       Q   +   NN+Q F+ E+   +  VG+++K
Sbjct: 743  AGADTCKFMIDFREHRDPPESLIDPGMSLCQGPCLWAFNNKQ-FTTEDWENIVKVGSASK 801

Query: 1265 KGSSAGYIGRKGIGFKSVFRVTDAPEI---------------------HSNGFHVKFDTS 1303
            + +    IG+ G+GF +V+ VTD P I                     H     ++ D S
Sbjct: 802  E-TKVEKIGKFGLGFNTVYHVTDVPSILSGSRLLILDPNVTHLKKHIQHQTNPGIRLDLS 860

Query: 1304 EGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPF-------RTKFSEGIA 1356
            + +I    P     +     C    + P +  +    T I+LPF       R++ S+ + 
Sbjct: 861  QKRILDCFPGQFAQYEHIFDCNFSRQSPPKPYT---GTLIKLPFRSEKEVLRSEISKKVF 917

Query: 1357 MNNIV----DMFSDLHPSLLLFLHRLQCIMFRNM 1386
              + +    D F +   S LLFL  +  +  +++
Sbjct: 918  YKHDILALRDYFRENSKSYLLFLKNINTLSLQSI 951



 Score = 43.9 bits (102), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 49/118 (41%), Gaps = 13/118 (11%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
            Y      L EL+QNADD        I+ E    T +    + G      +   N   F+ 
Sbjct: 30   YPDGGQILKELIQNADDGQATEVVFIHDERRYGTESLWTNKLGKYQGPALYAYNNTAFTD 89

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
            E+   +   G S K  +    IGR GIGF SV+ +TD P I S+G     D  E   G
Sbjct: 90   EDWERIQMAGRSGK-VNDPNKIGRFGIGFNSVYHITDVPIIFSSGHLGVMDPQENIFG 146


>gi|410897104|ref|XP_003962039.1| PREDICTED: sacsin-like [Takifugu rubripes]
          Length = 3639

 Score = 48.5 bits (114), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 67/240 (27%), Positives = 101/240 (42%), Gaps = 47/240 (19%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
            Y      L EL+QNADD        I+ +    T +   QE G      + + N+  F+ 
Sbjct: 31   YPDGGQILKELIQNADDAQATEVVFIHDDRSYGTESLWTQELGQYQGPALYVYNDASFTE 90

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI--- 1307
            E+   +   G S K+ +    +GR GIGF SV+ +TD P I S G H+ F   + QI   
Sbjct: 91   EDWEGIQLAGRSFKR-NDPNKVGRFGIGFNSVYHITDLPCILS-GSHLGFMDPQEQIFGE 148

Query: 1308 ---GF-----------VLPTLVPPFN-IDMFCRLLSK---DPVQLESKCWN-TCIRLPFR 1348
               GF            L T+   F  I     L+S      +  E + +N T  R P R
Sbjct: 149  REGGFRWYMDDPQHQEYLTTMHDQFQPIRDIVSLISNKKWSSIVTEHQYFNGTLFRFPLR 208

Query: 1349 TKFSEGIAMN-----NIVDMF----SDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
             + SE I+ N      + D+F    +D   S LLFL  +  +  +++  D  V  R ++ 
Sbjct: 209  NRVSE-ISDNLYNSGKVADLFESFITDAELS-LLFLKNVASVSLKHVTADGEVSTRLQVT 266



 Score = 46.2 bits (108), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 48/102 (47%), Gaps = 15/102 (14%)

Query: 1203 YSQDSHFLLELVQNADDNIYPE----------NVEPTLT---FILQESGIVVLNNEQGFS 1249
            Y      L EL+QNADD    E          +VE T       LQ   + V NN + FS
Sbjct: 2163 YPSKKDILKELIQNADDAEATEIHFVWDKRQHDVEKTFGENWNQLQGPALCVFNN-KVFS 2221

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
              +++ +  +G   K  S+ G  GR G+GF SV+ +TD P I
Sbjct: 2222 DADLKGIQRLGEGGKH-STPGKTGRYGVGFNSVYHLTDCPSI 2262



 Score = 45.1 bits (105), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 74/184 (40%), Gaps = 44/184 (23%)

Query: 1203 YSQDSHFLLELVQNADD---------------NIYPENVEPTLTFILQESGIVVLNNEQG 1247
            Y ++S    EL+QNA+D                  PE +      + Q   +   NNE  
Sbjct: 1172 YDEESDIFKELIQNAEDAGAEQCKFLVDFRAHKYSPEELIDRDMSLCQGPCLWAFNNET- 1230

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH- 1297
            F  ++   +  VG+++K+ +    IG+ G+GF +V+ VTD P + S         N  H 
Sbjct: 1231 FKDDDWANIVRVGSASKE-NQVEKIGKFGLGFNTVYHVTDIPSVLSGTSLLILDPNVTHL 1289

Query: 1298 -----------VKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLP 1346
                       +K + S+ ++    P    P+     C    ++P         T IRLP
Sbjct: 1290 KTHINNKSNPGIKLNLSQRELFKCFPGQFGPYERIFDCDFTQQNPYA------GTLIRLP 1343

Query: 1347 FRTK 1350
            FRT+
Sbjct: 1344 FRTE 1347


>gi|340367671|ref|XP_003382377.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 3279

 Score = 48.5 bits (114), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 47/111 (42%), Gaps = 17/111 (15%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENV----------------EPTLTFILQESGIVVLNNEQ 1246
            Y  D+    E +QNADD    E V                E T    LQ    + + N +
Sbjct: 1337 YPADTSIFREYIQNADDAKATEIVFVLDHRSKHPQEKLLTEGTEWASLQRVPALCIYNNR 1396

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFH 1297
             F  E+I+ +C +G   K G +A  IGR GIGF   + +TD P   S   H
Sbjct: 1397 QFMEEDIKGICKLGRGGK-GDTADTIGRFGIGFNVSYHLTDCPTFVSFDSH 1446


>gi|402311146|ref|ZP_10830095.1| hypothetical protein HMPREF1142_1759 [Eubacterium sp. AS15]
 gi|400365764|gb|EJP18812.1| hypothetical protein HMPREF1142_1759 [Eubacterium sp. AS15]
          Length = 1235

 Score = 48.5 bits (114), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 48/83 (57%), Gaps = 5/83 (6%)

Query: 1212 ELVQNADDNIYPENVEPTLTFILQESGIVVLNNE-QGFSAENIRALCDVGNSTKKGSSAG 1270
            EL+QNA+D+I  E +E ++     +S + +   E +GF+ ++  A+   GNS  + S   
Sbjct: 190  ELLQNANDHISEEIMEVSI----DDSKLCLEYAENEGFTVKDFYAIVTEGNSGNESSLEN 245

Query: 1271 YIGRKGIGFKSVFRVTDAPEIHS 1293
              GRKG GFKSV++  D  EI S
Sbjct: 246  REGRKGTGFKSVYKYFDKVEISS 268


>gi|380490481|emb|CCF35984.1| hypothetical protein CH063_07653 [Colletotrichum higginsianum]
          Length = 151

 Score = 48.5 bits (114), Expect = 0.042,   Method: Composition-based stats.
 Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 20/92 (21%)

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPEN----VEPTLTFILQ--ESGIVVLNN-- 1244
            R+L  L Q+LYS ++ F+ E++Q ADDN Y ++    + PT +  L+   +G    +N  
Sbjct: 53   RSLEILLQQLYSTETRFIHEIIQTADDNAYGKSATNGIPPTFSLSLKVGRTGDTNTDNAN 112

Query: 1245 ------------EQGFSAENIRALCDVGNSTK 1264
                        E GFS   + AL D+G STK
Sbjct: 113  PGKGYILETACSEDGFSMSQMTALTDIGESTK 144


>gi|365842393|ref|ZP_09383406.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Flavonifractor plautii ATCC 29863]
 gi|364575731|gb|EHM53107.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Flavonifractor plautii ATCC 29863]
          Length = 605

 Score = 48.5 bits (114), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 54/117 (46%), Gaps = 13/117 (11%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNI------YPENVEPTLTFILQESG-IVVLN 1243
            LG  L  LS  LY     FL EL+QN  D +       P+     LT  ++    ++  +
Sbjct: 31   LGGMLDVLSNHLYKSPDVFLRELLQNGVDAVTLRQKSQPDWNRGKLTITVEPGRRLIFRD 90

Query: 1244 NEQGFSAENI-RALCDVGNSTKKGSSAG-----YIGRKGIGFKSVFRVTDAPEIHSN 1294
            N  G S E I R L  +G S+K     G     YIGR GIG  S F V+D+  +H+ 
Sbjct: 91   NGAGLSEEGIHRFLAVIGQSSKTELVNGQIPEDYIGRFGIGLLSCFMVSDSIVVHTR 147


>gi|357400009|ref|YP_004911934.1| hypothetical protein SCAT_2418 [Streptomyces cattleya NRRL 8057 = DSM
            46488]
 gi|386356049|ref|YP_006054295.1| hypothetical protein SCATT_24020 [Streptomyces cattleya NRRL 8057 =
            DSM 46488]
 gi|337766418|emb|CCB75129.1| conserved protein of unknown function [Streptomyces cattleya NRRL
            8057 = DSM 46488]
 gi|365806557|gb|AEW94773.1| hypothetical protein SCATT_24020 [Streptomyces cattleya NRRL 8057 =
            DSM 46488]
          Length = 1038

 Score = 48.5 bits (114), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 48/212 (22%), Positives = 86/212 (40%), Gaps = 13/212 (6%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
              ++EL QNA D          L   L+E  +   N      A  + +L  +  S K+  
Sbjct: 54   RLVVELAQNAADAAVRAGTTGRLRLTLREGVLAAANTGAPLDAAGVESLSTLRASAKRDD 113

Query: 1268 SA-GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL 1326
            +A   +GR G+GF +V  V+D P +      V++  +E +          P   D   R 
Sbjct: 114  AAEAAVGRFGVGFSAVLAVSDEPAVIGRTGGVRWSLAEARAAAEETAADAPELADELRRR 173

Query: 1327 -----LSKDPVQLESKC---WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
                 L + P   E      ++T + LP R   +E +A      + + +  +LLL L  L
Sbjct: 174  DGHVPLLRLPFPAEGAAPEGYDTVVVLPLRDGAAEDLAHR----LLAGVDDALLLTLPGL 229

Query: 1379 QCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE 1410
              ++        ++  R+ + GD +++ S G+
Sbjct: 230  AEVVVETADTVRILTRRQLLEGDVVVEDSAGQ 261


>gi|347839221|emb|CCD53793.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 2724

 Score = 48.5 bits (114), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 99/243 (40%), Gaps = 47/243 (19%)

Query: 1203 YSQDSHFLLELVQNADD------------NIY-PENVEPTLTFILQESGIVVLNNEQGFS 1249
            Y      L EL+QNADD            N Y  EN+ P L    Q   ++  NN + F 
Sbjct: 26   YPAGGSVLRELLQNADDAQASEVKFFLDENSYGTENLPPKLA-QYQGPALLAYNNAE-FK 83

Query: 1250 AENIRALCDVGNSTK--KGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF-----DT 1302
             E+ ++L  VGNS K   G++ G  GR   GF SV+  TD+P I S G  V+        
Sbjct: 84   KEHFQSLSQVGNSLKMEDGATTGKFGR---GFNSVYNWTDSPSIVS-GHRVQILDPHHSW 139

Query: 1303 SEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRT---KFSEGIA--- 1356
            S G  G+         ++     +        +     T IRLP RT       GI+   
Sbjct: 140  SSGGNGYNFVEHSENSSLKAHMTVYQSFIKNAKQPFPGTIIRLPLRTIEQAKESGISDRK 199

Query: 1357 -----MNNIVDMFS-DLHPSLLLFL-HRLQCIMFRNM--------LNDSLVVIRKKIVGD 1401
                 M N+++ F+ D     LLF+ H ++  +  N+        +N   V + K  V D
Sbjct: 200  ITAQEMANVLEKFTEDFGKEGLLFMKHVIKISVISNITGLIEIEVMNKKAVQLNKTKVND 259

Query: 1402 GII 1404
             I+
Sbjct: 260  AIV 262


>gi|408827281|ref|ZP_11212171.1| hypothetical protein SsomD4_08878 [Streptomyces somaliensis DSM
            40738]
          Length = 1588

 Score = 48.1 bits (113), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 62/155 (40%), Gaps = 24/155 (15%)

Query: 1210 LLELVQNADDNIYPENVEP--TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            L ELVQNA D +     EP   +  +L E+ +   N     + E    +  +  S K+G 
Sbjct: 50   LFELVQNAADEM---RDEPGCRIHVVLTETHLYCANQGNPVTPEGAETILRMSMSRKRG- 105

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS--------------EGQIGFVLPT 1313
              G IGR G+G KSV  VTD PE  S      FD +                   F  P 
Sbjct: 106  --GQIGRFGVGVKSVLVVTDTPEFFSRTGSFGFDRTWSYESIRNVPGVRERHGDDFEAPV 163

Query: 1314 LVPPFNIDMFCRLLSKDPVQLESKCW-NTCIRLPF 1347
            L     +DM    L +D V  E   W  T +RLP 
Sbjct: 164  LRMARELDMTAERL-RDRVLDELLEWATTVVRLPL 197


>gi|379732169|ref|YP_005324365.1| hypothetical protein SGRA_4066 [Saprospira grandis str. Lewin]
 gi|378577780|gb|AFC26781.1| hypothetical protein SGRA_4066 [Saprospira grandis str. Lewin]
          Length = 1842

 Score = 48.1 bits (113), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 9/86 (10%)

Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            +QD   + E +QNA D+   +        I  E   +VLNN + F+ ++++AL +    T
Sbjct: 62   AQDEQAIYEFLQNAADSQASD-----CAVIYDEQYFMVLNNGRAFTDKDLKALLNSFQGT 116

Query: 1264 K----KGSSAGYIGRKGIGFKSVFRV 1285
            K    K  + G IGR GIGFK  +R+
Sbjct: 117  KADKSKAENCGKIGRYGIGFKLAYRL 142


>gi|152967525|ref|YP_001363309.1| hypothetical protein Krad_3582 [Kineococcus radiotolerans SRS30216]
 gi|151362042|gb|ABS05045.1| conserved hypothetical protein [Kineococcus radiotolerans SRS30216]
          Length = 985

 Score = 48.1 bits (113), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 2/68 (2%)

Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIGFKSVFRVTDAPEIHSNGF 1296
            ++  N      AE ++ L  +  S K G +   G +GR G+GF +V  VTDAP I S+  
Sbjct: 35   LLAANTGAPLDAEGVQGLATLRASAKAGPAGAPGVVGRFGVGFAAVLAVTDAPAIRSSTG 94

Query: 1297 HVKFDTSE 1304
             V+F  +E
Sbjct: 95   AVRFSRTE 102


>gi|424843792|ref|ZP_18268417.1| hypothetical protein SapgrDRAFT_3267 [Saprospira grandis DSM 2844]
 gi|395321990|gb|EJF54911.1| hypothetical protein SapgrDRAFT_3267 [Saprospira grandis DSM 2844]
          Length = 1840

 Score = 48.1 bits (113), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 9/86 (10%)

Query: 1204 SQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNST 1263
            +QD   + E +QNA D+   +        I  E   +VLNN + F+ ++++AL +    T
Sbjct: 62   AQDEQAIYEFLQNAADSQASD-----CAVIYDEQYFMVLNNGRAFTDKDLKALLNSFQGT 116

Query: 1264 K----KGSSAGYIGRKGIGFKSVFRV 1285
            K    K  + G IGR GIGFK  +R+
Sbjct: 117  KADKSKAENCGKIGRYGIGFKLAYRL 142


>gi|224083922|ref|XP_002307173.1| predicted protein [Populus trichocarpa]
 gi|222856622|gb|EEE94169.1| predicted protein [Populus trichocarpa]
          Length = 2870

 Score = 48.1 bits (113), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 57/117 (48%), Gaps = 13/117 (11%)

Query: 1203 YSQDSHFLLELVQNADD------NIYPENVEPTLTFILQES-----GIVVLN-NEQGFSA 1250
            Y + +  L EL+QNADD       +  +    + T +L  S     G  +L+ N+  F+ 
Sbjct: 35   YPEGTTVLKELIQNADDAGATVVRLCLDRRHHSTTSLLSPSLSAFQGPALLSYNDAVFTE 94

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            E+  ++  +G S K  + A   GR G+GF SV+ +TD P   S  + V FD   G +
Sbjct: 95   EDFTSISRIGGSVKH-NQATKTGRFGVGFNSVYHLTDLPSFVSGNYIVMFDPQGGYL 150



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 68/248 (27%), Positives = 96/248 (38%), Gaps = 59/248 (23%)

Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL----------- 1234
            QH  L  R  H L  E+Y+     L ELVQNA+D    E     + F+L           
Sbjct: 1398 QHEALTTRLKHIL--EMYADGPGILFELVQNAEDAGASE-----VIFLLDKTQYGTSSVL 1450

Query: 1235 -------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
                   Q   +   NN   FS++++ A+  +G  +K       IGR G+GF  V+  TD
Sbjct: 1451 SPEMADWQGPALYCFNNSV-FSSQDLYAISRIGQESKLEKPFA-IGRFGLGFNCVYHFTD 1508

Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR-LLSKDPVQ----------LES 1336
             P   S    V FD     +  + P+  P   I    R +L + P Q          L+ 
Sbjct: 1509 IPTFVSGENVVMFDPHACNLPGISPSH-PGLRIKFTGRKILEQFPDQFSPFLHFGCDLQH 1567

Query: 1337 KCWNTCIRLPFRTKF--------SEGIAMNNIVDM---FSDLHPSLLLFLHRLQCIMFRN 1385
                T  R P R+           EG A  +++ +   FS +    LLFL        RN
Sbjct: 1568 PFPGTLFRFPLRSSTVARRSLIKKEGYAPEDVMSLFTSFSGVVSDALLFL--------RN 1619

Query: 1386 MLNDSLVV 1393
            + N SL V
Sbjct: 1620 VKNISLFV 1627


>gi|156402329|ref|XP_001639543.1| predicted protein [Nematostella vectensis]
 gi|156226672|gb|EDO47480.1| predicted protein [Nematostella vectensis]
          Length = 4754

 Score = 47.8 bits (112), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 18/137 (13%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENV-EPTLTFILQES- 1237
            Q+  L R L  L +E Y+       EL+QNADD        +Y E   E  +T ++ E  
Sbjct: 1267 QYEPLTRRLQRLLEE-YTDGFSVPKELIQNADDAGATEVRFLYDERANEDAMTCLISEGM 1325

Query: 1238 ------GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
                   + V NN   F+  + + +  +  +TK+  S G IGR G+GF +V+ +TD P +
Sbjct: 1326 KECQGPALWVYNNAV-FTDADFQNITKLNGATKEEDS-GKIGRFGLGFNAVYNITDVPSL 1383

Query: 1292 HSNGFHVKFDTSEGQIG 1308
             S    V FD     +G
Sbjct: 1384 LSRNNLVIFDPHTTHLG 1400



 Score = 44.3 bits (103), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 55/129 (42%), Gaps = 30/129 (23%)

Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADD------------------NIYPENV 1226
            +K   RL R L     E Y  D   L EL+QNADD                   I+  + 
Sbjct: 2307 EKLTTRLKRIL-----EGYPCDGAILKELLQNADDAKASKICFIKDLRRHPSERIFEHSW 2361

Query: 1227 EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
            EP     LQ   + V N+   F+ ++I  +  +G  +K G      G+ G+GF +V+ +T
Sbjct: 2362 EP-----LQGPALCVYNDSP-FTTQDIEGIQRLGEGSK-GDDPNKTGQYGVGFNAVYHMT 2414

Query: 1287 DAPEIHSNG 1295
            D P   S G
Sbjct: 2415 DCPSFVSKG 2423


>gi|345850264|ref|ZP_08803264.1| hypothetical protein SZN_11043 [Streptomyces zinciresistens K42]
 gi|345638310|gb|EGX59817.1| hypothetical protein SZN_11043 [Streptomyces zinciresistens K42]
          Length = 1040

 Score = 47.8 bits (112), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 49/208 (23%), Positives = 82/208 (39%), Gaps = 36/208 (17%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
              ++EL QNA D      V   L   L+E  +   N      A  + +L  +  S K+ +
Sbjct: 53   RLVVELAQNAADAAGRAGVPGRLRLTLREGVLFAANTGAPLDAAGVESLSTLRASAKRDT 112

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF----------DTSEGQIGFV------- 1310
             +  +GR G+GF SV  VTD P +      V++          +T+ G  G         
Sbjct: 113  DS--VGRFGVGFASVLAVTDEPAVVGRHGGVRWALAEARDLALETAGGSAGLGEEIRRRD 170

Query: 1311 --LPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLH 1368
              +P L  PF  +      + DP       ++T + LP R   + G+A      +   + 
Sbjct: 171  GHVPLLRLPFAAEGT----APDP-------YDTVVILPLRDTAAAGLAER----LLRAVD 215

Query: 1369 PSLLLFLHRLQCIMFRNMLNDSLVVIRK 1396
             +LLL L  L+ ++      +   V R+
Sbjct: 216  DALLLALPGLEEVVIEVGDGEPRTVRRR 243


>gi|326430063|gb|EGD75633.1| hypothetical protein PTSG_06697 [Salpingoeca sp. ATCC 50818]
          Length = 5949

 Score = 47.8 bits (112), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 86/204 (42%), Gaps = 30/204 (14%)

Query: 1203 YSQDSHFLLELVQNADD-------------NIYPENVEPTLTFILQESGIVVLNNEQGFS 1249
            Y + +  + EL+QNADD                 E +    T   Q   ++  N+   FS
Sbjct: 30   YPEGTSVIKELIQNADDAEATCVRIALDKRQFGTEKLRFPDTGKFQGPALLCYNDSV-FS 88

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG- 1308
             E+  ++ ++  S+K+  +   +GR G+GF SV+ +TD P   S  ++V FD  +  +G 
Sbjct: 89   DEDFTSIKNICRSSKQADTT-KVGRFGLGFNSVYHITDLPSFVSRNYYVVFDPHKSSLGT 147

Query: 1309 ------FVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGI------- 1355
                  F           D +  + +     + +    T  RLP RT  SE         
Sbjct: 148  EGRRVKFTSKHERDGKYEDTWATMRAFG-CTMTADFKGTLFRLPLRTAASELSSSCHTID 206

Query: 1356 AMNNIVDMFSDLHPSLLLFLHRLQ 1379
            +M +++D  +   P +LLFL  ++
Sbjct: 207  SMLSLLDTLAAEAPHMLLFLRHVR 230


>gi|340367673|ref|XP_003382378.1| PREDICTED: sacsin-like [Amphimedon queenslandica]
          Length = 992

 Score = 47.8 bits (112), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 51/105 (48%), Gaps = 17/105 (16%)

Query: 1203 YSQDSHFLLELVQNADD------------NIYPEN--VEPTLTFILQESGIVVLNNEQGF 1248
            Y      L EL+QNADD              YPEN  + PTL  + Q   ++  NN   F
Sbjct: 31   YPDGGQILKELIQNADDAGATVRKFCLDTKQYPENDLMFPTLN-LFQGPALLAYNNAI-F 88

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
              E+      + NS K+ ++A  +GR G+GF SVF +TD P + S
Sbjct: 89   QEEDWEHFGRIENSGKE-TAALKVGRFGVGFLSVFHITDLPMVIS 132


>gi|195997653|ref|XP_002108695.1| predicted protein [Trichoplax adhaerens]
 gi|190589471|gb|EDV29493.1| predicted protein [Trichoplax adhaerens]
          Length = 1421

 Score = 47.8 bits (112), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 89/194 (45%), Gaps = 41/194 (21%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS----- 1262
              + EL++NA+D+    ++  +    ++E  + V +  + F   N+ A+C  GN+     
Sbjct: 42   QIIFELLRNANDH---SDISLSFKIEIKERYLSVSHTGKAFDLGNVEAIC--GNTQSKSN 96

Query: 1263 --TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT--------SEGQIGFVLP 1312
                  ++  YIG   +GFKSVF  +DA  I S G+  +FD+        S+  +    P
Sbjct: 97   SKKNNSNNNDYIG---MGFKSVFVFSDAAFIKSGGYTFRFDSSYQKWNKPSQSDLASDYP 153

Query: 1313 TLVPPF---NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHP 1369
              V P    N D+  R+ +  P       +NT  +  F+ K  E I      D+F DL  
Sbjct: 154  WQVIPIWTENKDLPHRIANVPP------GYNT--QFLFKLKKDEIIE-----DLFQDLKK 200

Query: 1370 SLL--LFLHRLQCI 1381
            SLL  L+L +++ I
Sbjct: 201  SLLFMLYLRKIKMI 214


>gi|302844699|ref|XP_002953889.1| hypothetical protein VOLCADRAFT_118510 [Volvox carteri f.
            nagariensis]
 gi|300260701|gb|EFJ44918.1| hypothetical protein VOLCADRAFT_118510 [Volvox carteri f.
            nagariensis]
          Length = 5796

 Score = 47.8 bits (112), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 50/111 (45%), Gaps = 13/111 (11%)

Query: 1203 YSQDSHFLLELVQNADD------------NIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            Y +    L+EL+QNADD              YP +   +    + +   +++ N+  FS 
Sbjct: 1759 YPEGPGVLMELLQNADDAGATSLELLLDDTTYPASSILSPAMAVWQGPALLVANDAVFSP 1818

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
             +   +  +G  +K  S    IGR G+GF +V+  TD P   S  + V FD
Sbjct: 1819 ADFANISRIGQDSKA-SRPTSIGRFGLGFNAVYHFTDLPCFVSGDYLVMFD 1868


>gi|390366753|ref|XP_003731107.1| PREDICTED: sacsin-like [Strongylocentrotus purpuratus]
          Length = 292

 Score = 47.8 bits (112), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 2/62 (3%)

Query: 1234 LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
            LQ   + V NN + F+ ++IR + ++G  +K     G IGR G+GF +V+++TD P   S
Sbjct: 4    LQGPALCVFNN-KSFTDDDIRGIQNLGEGSK-ADHMGKIGRYGVGFNAVYQLTDCPSFIS 61

Query: 1294 NG 1295
            NG
Sbjct: 62   NG 63


>gi|254443594|ref|ZP_05057070.1| hypothetical protein VDG1235_1830 [Verrucomicrobiae bacterium DG1235]
 gi|198257902|gb|EDY82210.1| hypothetical protein VDG1235_1830 [Verrucomicrobiae bacterium DG1235]
          Length = 1483

 Score = 47.8 bits (112), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 9/118 (7%)

Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFI-LQESGIVVLN 1243
            + + A+   ++  L + + S+  H + EL+QNADD         T   I ++E   V  +
Sbjct: 23   QSESAQYRNSVRQLFETVASRVQHVVSELLQNADD------AGATCARIEIEEDNFVFRH 76

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            +   F  E + ALC    S+K   +    G +GIGFKS F + D+    +  + ++FD
Sbjct: 77   DGADFEPEQLEALCSFAVSSKH--TIKTTGFRGIGFKSAFSIGDSVYFITPSYQIRFD 132


>gi|297737937|emb|CBI27138.3| unnamed protein product [Vitis vinifera]
          Length = 3960

 Score = 47.4 bits (111), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 57/125 (45%), Gaps = 19/125 (15%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL---QESGIVVLNNEQGFS 1249
            Y + +  L EL+QNADD          +      E  L+  L   Q   ++  NN + F+
Sbjct: 37   YPEGTTVLKELIQNADDAGATKVCLCLDRRVHGSESLLSEKLAQWQGPALLAYNNAE-FT 95

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGF 1309
             E+  ++  +G S K G  A   GR G+GF SV+ +TD P   S  + V FD      G 
Sbjct: 96   EEDFVSISRIGGSNKHGQ-AWKTGRFGVGFNSVYHLTDLPSFVSGKYVVLFDPQ----GV 150

Query: 1310 VLPTL 1314
             LP +
Sbjct: 151  YLPNV 155


>gi|124001984|ref|ZP_01686838.1| hypothetical protein M23134_01824 [Microscilla marina ATCC 23134]
 gi|123992450|gb|EAY31795.1| hypothetical protein M23134_01824 [Microscilla marina ATCC 23134]
          Length = 1914

 Score = 47.4 bits (111), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 24/115 (20%)

Query: 1171 GLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTL 1230
            G  P+I+N  S  LK                  +QD   + E +QNA D           
Sbjct: 47   GAVPDIANFLSTPLK-----------------IAQDDQAIYEFIQNAADA-----QSSLF 84

Query: 1231 TFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRV 1285
                 ES  V +NN + FS +++ +L ++  STKK  S   IGR GIGFK  +R+
Sbjct: 85   NIFYDESYFVAINNGKPFSRDDLTSLLNIAASTKK--SCEKIGRFGIGFKLAYRL 137


>gi|303284333|ref|XP_003061457.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456787|gb|EEH54087.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 5511

 Score = 47.4 bits (111), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 54/112 (48%), Gaps = 18/112 (16%)

Query: 1203 YSQDSHFLLELVQNADD--------------NIYPENVEPTLTFILQESGIVVLNNEQGF 1248
            Y + +  L ELVQNADD              +     V P L     +   ++++N+  F
Sbjct: 32   YPEGTSILKELVQNADDAGATVVKVCLDARSHGTSTLVSPALAPF--QGPALLVHNDSVF 89

Query: 1249 SAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            +  +  ++  VG+S K+ +  G  GR G+GF SV+ +TD P   S G HV F
Sbjct: 90   AEADFVSISRVGDSVKR-TQVGKTGRFGVGFNSVYHLTDVPSFVS-GRHVVF 139


>gi|357166714|ref|XP_003580814.1| PREDICTED: sacsin-like [Brachypodium distachyon]
          Length = 4693

 Score = 47.4 bits (111), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 13/111 (11%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL-QESGIVVL-NNEQGFSA 1250
            Y + +  L EL+QNADD          +         L   L Q  G  +L +N+  F+ 
Sbjct: 32   YPEGTTALRELIQNADDAGASAVRLCLDRRSHGARSLLAPALAQWQGPALLAHNDAVFTD 91

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            E+  ++  +G+S KK S A   GR G+GF SV+ +TD P   S  + V FD
Sbjct: 92   EDFASISRIGDS-KKVSQAWKTGRFGVGFNSVYHLTDLPSFVSGKYVVMFD 141


>gi|154299794|ref|XP_001550315.1| hypothetical protein BC1G_11523 [Botryotinia fuckeliana B05.10]
          Length = 2580

 Score = 47.4 bits (111), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 51/106 (48%), Gaps = 20/106 (18%)

Query: 1203 YSQDSHFLLELVQNADD------------NIY-PENVEPTLTFILQESGIVVLNNEQGFS 1249
            Y      L EL+QNADD            N Y  EN+ P L    Q   ++  NN + F 
Sbjct: 26   YPAGGSVLRELLQNADDAQASEVKFFLDENSYGTENLPPKLA-QYQGPALLAYNNAE-FK 83

Query: 1250 AENIRALCDVGNSTK--KGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+ ++L  VGNS K   G++ G  GR   GF SV+  TD+P I S
Sbjct: 84   KEHFQSLSQVGNSLKMEDGATTGKFGR---GFNSVYNWTDSPSIVS 126


>gi|255572407|ref|XP_002527141.1| protein binding protein, putative [Ricinus communis]
 gi|223533501|gb|EEF35243.1| protein binding protein, putative [Ricinus communis]
          Length = 4704

 Score = 47.4 bits (111), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 15/112 (13%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVV---LNNEQG----------FS 1249
            Y + +  L EL+QNADD      V   L   L  S  ++   L+  QG          F+
Sbjct: 34   YPEGTTVLKELIQNADD-AGATKVRLCLDRRLHRSDSIISTSLSQWQGPALLAYNDAVFT 92

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
             E+  ++  +G S K G  A   GR G+GF SV+ +TD P   S  + V FD
Sbjct: 93   EEDFVSISRIGGSAKHGQ-AWKTGRFGVGFNSVYHLTDLPSFVSGKYIVLFD 143



 Score = 43.9 bits (102), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 67/261 (25%), Positives = 101/261 (38%), Gaps = 53/261 (20%)

Query: 1160 LIVESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD 1219
            L+ E+     FGL     +  +    +  A   R  H L  E+Y+     L ELVQNA+D
Sbjct: 1368 LLAENADSMNFGL-----SGAAEAFGQHEALTTRLKHIL--EMYADGPGILFELVQNAED 1420

Query: 1220 NIYPENVEPTLTFIL---QESGIVVLN--------------NEQGFSAENIRALCDVGNS 1262
                E     + F+L   Q     VL+              N+  FS +++ A+  +G  
Sbjct: 1421 AGASE-----VIFLLDKTQYGTSSVLSPEMADWQGPALYCFNDSVFSPQDLYAISRIGQE 1475

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
            +K       IGR G+GF  V+  TD P   S    V FD     +  + P+  P   I  
Sbjct: 1476 SKLEKPLA-IGRFGLGFNCVYHFTDIPTFVSGENIVMFDPHACNLPGISPSH-PGLRIKF 1533

Query: 1323 FCR-LLSKDPVQ----------LESKCWNTCIRLPFRTKF--------SEGIAMNNIVDM 1363
              R +L + P Q          L+     T  R P R+           EG A  +++ +
Sbjct: 1534 AGRKILEQFPDQFSPFLHFGCDLQHSFPGTLFRFPLRSSAIALRSQIKKEGYAPEDVMSL 1593

Query: 1364 ---FSDLHPSLLLFLHRLQCI 1381
               FS +    LLFL  ++ I
Sbjct: 1594 FESFSGIVSDALLFLRNVKSI 1614


>gi|395328407|gb|EJF60799.1| hypothetical protein DICSQDRAFT_181063 [Dichomitus squalens LYAD-421
            SS1]
          Length = 2845

 Score = 47.0 bits (110), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 70/160 (43%), Gaps = 21/160 (13%)

Query: 1244 NEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
            N+  FS  +  A+  V  S+K G +   IG+ G+G ++ + +TD P+  S G+HV FD  
Sbjct: 264  NDAVFSDADWEAIRTVSESSKTGDTEK-IGKHGLGSRAYYHLTDNPQYLSGGYHVVFDPH 322

Query: 1304 E---GQIGF--VLPTLVP--PFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKF----- 1351
                GQ G+   L  +    P  ++ F  LL   P    +    T +RL  R        
Sbjct: 323  RWAFGQGGWKQSLEEIARDYPDQLEPFSSLL---PHGDRTSLPGTIVRLALRPATAASRI 379

Query: 1352 -SEGIAMNNIVDMFSDLHPS----LLLFLHRLQCIMFRNM 1386
             S+ +  + I D+  D         LLFL  L CI  R +
Sbjct: 380  SSKKVTADEIHDLLLDFVRKELDVALLFLSHLTCIEVREI 419


>gi|51476565|emb|CAH18265.1| hypothetical protein [Homo sapiens]
          Length = 2015

 Score = 47.0 bits (110), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 102/257 (39%), Gaps = 68/257 (26%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1046 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1104

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             Q FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1105 SQ-FSDSDFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1162

Query: 1305 GQIG----------------------FVLPTLVPPFNIDMF-CRLLSKDPVQLESK-CWN 1340
              I                          P    PF ID+F C+L    P+ +E+   +N
Sbjct: 1163 NHISKHIKDKSNPGIKINWSKQQKRLRKFPNQFKPF-IDVFGCQL----PLTVEAPYSYN 1217

Query: 1341 -TCIRLPFRTKFSEGIA-----------MNNIVDMFSDLHPSLLLFLHRLQCIMFRNM-- 1386
             T  RL FRT+    ++           + ++VD FS     L++F   ++ +  + +  
Sbjct: 1218 GTLFRLSFRTQQEAKVSEVSSTCYNTADIYSLVDEFSLCGHRLIIFTQSVKSMYLKYLKI 1277

Query: 1387 ------LNDSLVVIRKK 1397
                  L    V+I+KK
Sbjct: 1278 EETNPSLAQDTVIIKKK 1294


>gi|156392678|ref|XP_001636175.1| predicted protein [Nematostella vectensis]
 gi|156223275|gb|EDO44112.1| predicted protein [Nematostella vectensis]
          Length = 4477

 Score = 47.0 bits (110), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 108/474 (22%), Positives = 173/474 (36%), Gaps = 115/474 (24%)

Query: 1203 YSQDSHFLLELVQNADDNIYPE----------NVEPTLT---FILQESGIVVLNNEQGFS 1249
            Y +D   L E++Q+ADD    E          N +  L+     LQ   I V N+   F+
Sbjct: 2489 YPRDEGILKEMLQSADDAKATEFHVIYDPRQHNKKRVLSEGWKDLQGPAICVYND-CPFT 2547

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF--------- 1300
             E + +   +G + K   S      K  GF +++ +TD P   ++   +           
Sbjct: 2548 EERMESFQQIGRTAKSEESD-----KAFGFNTIYHLTDCPSFITDNSTICILDPHATHAP 2602

Query: 1301 DTSEGQIGFVLPTLVPPFN---IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFS----- 1352
            D +    G +L  L   F    +D+F   L +D   LE     T +R P RTK +     
Sbjct: 2603 DVTRETPGRILQPLDDEFRENFVDVFKGYL-EDFFNLEGA---TMLRFPLRTKINAEKSE 2658

Query: 1353 ------EGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLV-------------- 1392
                  E + ++N++  F       LLFL+ ++ +    + N  LV              
Sbjct: 2659 ISKLSFEDVDVDNLLKTFEHDVKDSLLFLNSVRKVTISRIENGRLVKKYQASATLPDESE 2718

Query: 1393 ---------VIRKKIVGDGIIK--------VSCGEDKMT--WFV----ASQKLRAGVIRP 1429
                     VIR K V    I         V    ++ T  W V     +++  A  + P
Sbjct: 2719 EQLQTLSSEVIRSKGVPTADISYFGVTYKLVISDSERSTEEWLVHQCLGAKEYVAAPLVP 2778

Query: 1430 DVKT------TEIALALTLQESNEGNYGPLLYQQP-------VFAFLPLRT-YGLKFILQ 1475
            D +         +A  +++QE   G +     Q+P        F+ LPL     L   + 
Sbjct: 2779 DGRALGFLPRAGVAAKVSVQEDVLGQHVLRALQEPFTDKTCRAFSSLPLHVETRLPVQVS 2838

Query: 1476 GDFVLPSSR------EEVDGNSPWNQWLLSEFPA-----LFVSAERSFCDL----PCF-- 1518
            G F L  +R      E+ D  S WN+++ +E  A     L +SA      +    P F  
Sbjct: 2839 GQFALDVARRKLCDDEKKDCRSRWNEFMKTEVIAPAYVHLLMSARHKITGIQEGNPWFFN 2898

Query: 1519 -RENPAKAASVYMSFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAP 1571
             R       S Y S  P    VH  +  L   +   L  +NC ++   + Q AP
Sbjct: 2899 TRSGVKSGLSWYHSLFPDPHSVHSDWKELSLAVFRCLENANCPVIPVIHGQEAP 2952



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 68/165 (41%), Gaps = 43/165 (26%)

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFV 1310
            ENI+ L    ++T  GS+    GR GIGF SVF +TD P   S  F V FD     +G V
Sbjct: 1538 ENIKML---ASTTTTGSAKR--GRPGIGFCSVFNITDIPSFISREFIVMFDPQTTYLGNV 1592

Query: 1311 L-----PTL-----------------VPPFNIDMFCRLLSKDPVQLESKCWN-TCIRLPF 1347
            +     P +                   P+N    C +L     Q +   +N T  RLPF
Sbjct: 1593 IKDKTRPGIRLNLKENSRAVAAFSDQFSPYNKIFGCSILE----QGKEFFYNGTLFRLPF 1648

Query: 1348 RT--KFSEGIAMNNIVD---------MFSDLHPSLLLFLHRLQCI 1381
            RT  +    +   N  D         +F +   SLLLFL  ++ I
Sbjct: 1649 RTLSQAQTSVISGNCYDESNISKLLGIFREHASSLLLFLRNVKRI 1693


>gi|320591625|gb|EFX04064.1| ino80 chromatin remodeling complex protein [Grosmannia clavigera
            kw1407]
          Length = 1728

 Score = 47.0 bits (110), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 4/76 (5%)

Query: 1433 TTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSSREEVDGNSP 1492
            T+E+ +A  + +S      P++    +F FLP+R  G  F++Q DFV  +SR++V  +SP
Sbjct: 43   TSEVVVAFPMTDSGV----PIVKPLDLFVFLPVRHVGFNFLIQADFVTDASRQDVVADSP 98

Query: 1493 WNQWLLSEFPALFVSA 1508
             N  L+      FV  
Sbjct: 99   RNIGLIDGIAKAFVQG 114


>gi|377562530|ref|ZP_09791921.1| hypothetical protein GOOTI_256_00540 [Gordonia otitidis NBRC 100426]
 gi|377520356|dbj|GAB37086.1| hypothetical protein GOOTI_256_00540 [Gordonia otitidis NBRC 100426]
          Length = 892

 Score = 46.6 bits (109), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 35/66 (53%), Gaps = 2/66 (3%)

Query: 1241 VLNNEQGFSAENIRALCDVGNSTKKGSSAG--YIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
            V N     SA  +R+L  +  S K G SAG   IGR G+GF +   V D+ EI S    +
Sbjct: 28   VANTGAPLSANGVRSLMALRASAKSGDSAGSGVIGRYGVGFTATATVADSVEIRSTTGSI 87

Query: 1299 KFDTSE 1304
            +FD ++
Sbjct: 88   RFDRAD 93


>gi|393781484|ref|ZP_10369679.1| hypothetical protein HMPREF1071_00547 [Bacteroides salyersiae
            CL02T12C01]
 gi|392676547|gb|EIY69979.1| hypothetical protein HMPREF1071_00547 [Bacteroides salyersiae
            CL02T12C01]
          Length = 587

 Score = 46.6 bits (109), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN    +   L E G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYAGRIDVYLNEDGSLVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  YIGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDYIGRFGIGLLSCFVVTNEITVES 124


>gi|359793721|ref|ZP_09296461.1| hypothetical protein MAXJ12_29405 [Mesorhizobium alhagi CCNWXJ12-2]
 gi|359250101|gb|EHK53640.1| hypothetical protein MAXJ12_29405 [Mesorhizobium alhagi CCNWXJ12-2]
          Length = 1574

 Score = 46.6 bits (109), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 119/313 (38%), Gaps = 38/313 (12%)

Query: 1210 LLELVQNADDNIYPENVEPT--LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            L ELVQNA D I  E  E    +   +    +V  N       + I AL +  +S K+G 
Sbjct: 62   LFELVQNAADAIL-EGGEGAGRIHVKVGRDALVAANTGAPLDQDGIVALLNARSSPKRG- 119

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS------EGQIGFVLPTLVPPFNID 1321
              G IGR GIGFKS+ ++    ++ S    ++FD           +G       P   + 
Sbjct: 120  --GQIGRFGIGFKSLLKLGGRVDLVSRSIGLRFDPEACRVRIRTHLGLDSSARAPGMRLA 177

Query: 1322 MFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCI 1381
                +   +P  LE   W T +        SE      + + F+      +LFL      
Sbjct: 178  EV--IDGPNPKGLE---WATTV---VSATISEAATFQRLAEEFTKFPSEFVLFLP-ANIH 228

Query: 1382 MFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTW--FVASQKLRAGVIRPDV----KTTE 1435
            +  ++   +   I + + G+ ++ V  G ++  W  F  +  +R    R D        E
Sbjct: 229  LTLDVEGGARREITRTVDGENVV-VRDGSNETRWKLFETTAAIRDPGAREDATHIQARDE 287

Query: 1436 IALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGL-KFILQGDFVLPSSREE-VDGNSPW 1493
            + LA  +  S      P+      +AF P  T  L   IL   + L S R   V G   W
Sbjct: 288  VPLAWAVPLS------PIPRAGRFWAFFPTETATLTSGILNAPWKLNSDRTNLVRGE--W 339

Query: 1494 NQWLLSEFPALFV 1506
            N  L+SE   L  
Sbjct: 340  NTTLMSEAARLIA 352


>gi|255086673|ref|XP_002509303.1| predicted protein [Micromonas sp. RCC299]
 gi|226524581|gb|ACO70561.1| predicted protein [Micromonas sp. RCC299]
          Length = 5379

 Score = 46.6 bits (109), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 50/112 (44%), Gaps = 15/112 (13%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y + +  L ELVQNADD           E    T T         Q   ++V N+   F 
Sbjct: 30   YPEGTSILKELVQNADDAGARTVKVCLDERSHGTETLAYDGLAQFQGPSLLVFNDST-FK 88

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
              +  ++  +G+S K+ +  G  GR G+GF SV+ +TD P   S    V FD
Sbjct: 89   ESDFESISRIGDSVKR-TQVGKTGRFGVGFNSVYHLTDMPSFVSGRHAVFFD 139


>gi|336319563|ref|YP_004599531.1| ATP-binding region ATPase domain protein [[Cellvibrio] gilvus ATCC
            13127]
 gi|336103144|gb|AEI10963.1| ATP-binding region ATPase domain protein [[Cellvibrio] gilvus ATCC
            13127]
          Length = 1056

 Score = 46.6 bits (109), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 47/102 (46%), Gaps = 5/102 (4%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFIL----QESGIVVLNNEQGFSAENIRALCD 1258
            Y +D   ++EL QNA D      V   L   L    + + +V  N     +AE + AL  
Sbjct: 40   YYRD-RVVVELAQNAADAAVRGGVPGRLLLRLARTDEGTTLVAANTGAPLTAEGVAALAS 98

Query: 1259 VGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            +  S K+    G +GR G+GF +V  V+D   + S    V+F
Sbjct: 99   MRASAKRSDQPGVVGRFGVGFAAVRAVSDELSVVSTTGGVRF 140


>gi|441509455|ref|ZP_20991373.1| putative helicase [Gordonia aichiensis NBRC 108223]
 gi|441446553|dbj|GAC49334.1| putative helicase [Gordonia aichiensis NBRC 108223]
          Length = 1602

 Score = 46.6 bits (109), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 74/184 (40%), Gaps = 28/184 (15%)

Query: 1184 LKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY-PENVEPTLTFILQESGIVVL 1242
            L ++HA   R +H   Q  Y      L EL+QNA D +  P      +  +L ++ +   
Sbjct: 31   LVEEHANGERRIH---QGGYGDRQ--LFELIQNAADELRQPAFYGGRIEAVLTDTHLYCA 85

Query: 1243 NNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT 1302
            N     +      +  +G S K+G   G IGR G+G KSV  VT +P+  S      FD 
Sbjct: 86   NEGSPITPAGAETILRMGVSRKRG---GQIGRFGVGVKSVLSVTKSPQFFSTSGAFGFDA 142

Query: 1303 -----------SEGQIGFVLPTL-VPPFNIDMFCRLL------SKDPVQLESKCW-NTCI 1343
                       +E Q   VLP L +    +    R L      + DP+      W  T +
Sbjct: 143  DWSAHKIRTAVNEAQSKQVLPVLDLTDTPVLRLARQLDPASERAADPILDALLAWATTVV 202

Query: 1344 RLPF 1347
            RLP 
Sbjct: 203  RLPL 206


>gi|47226310|emb|CAG09278.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 3392

 Score = 46.2 bits (108), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 56/236 (23%), Positives = 97/236 (41%), Gaps = 57/236 (24%)

Query: 1203 YSQDSHFLLELVQNADD---------------NIYPEN-VEPTLTFILQESGIVVLNNEQ 1246
            Y ++     EL+QNA+D                  PE  ++P ++ + Q   +   NNE 
Sbjct: 675  YDEEGDIFKELIQNAEDAGAQQCSFLVDFRTHKYNPEELIDPDMS-LCQGPCLWAFNNE- 732

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH 1297
             F  ++   +  VG+++K+ +    IG+ G+GF +V+ VTD P I S         N  H
Sbjct: 733  TFKDDDWTNIVRVGSASKE-TKLEKIGKFGLGFNTVYHVTDIPSILSGNRLLILDPNVTH 791

Query: 1298 VK------------FDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRL 1345
            +K             D S+ ++    P    P++    C    + P +       T IRL
Sbjct: 792  LKTHIKQKSNPGIRLDLSQLRLFKCFPGQFGPYDRIFDCDFAKQKPYE------GTLIRL 845

Query: 1346 PFRT-------KFSEGIAMNNIVDMFSDL----HPSLLLFLHRLQCIMFRNMLNDS 1390
            PFRT       + S+ +   + +  F +       S LLFL  +  +  +N+  D+
Sbjct: 846  PFRTEEEALTSEISKRVYYKSDIVSFQECLAENSKSHLLFLRSISSLALQNIPKDA 901



 Score = 46.2 bits (108), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 73/188 (38%), Gaps = 34/188 (18%)

Query: 1203 YSQDSHFLLELVQNADDNIYPE----------NVEPTLT---FILQESGIVVLNNEQGFS 1249
            Y      L EL+QNADD    E           VE T       LQ   + V NN + FS
Sbjct: 1665 YPSKKDILKELLQNADDAEATEIHFVWDKRQHGVEKTFGERWNQLQGPALCVFNN-KVFS 1723

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI------------HSNGFH 1297
              +++ +  +G   K  S+ G IG+ G+GF SV+ +TD P I            +     
Sbjct: 1724 NADLKGIQQLGEGGKH-STRGKIGKYGVGFNSVYHLTDCPSILTGDSLLCISDPNQKYIE 1782

Query: 1298 VKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKF---SEG 1354
               D +   IG+ L         D++   L  D   L      T  RLP RT F   S  
Sbjct: 1783 CYSDKAPAGIGYKLDDKFKNMYADVYQSFLP-DRFPLHE---GTMFRLPLRTFFMATSSN 1838

Query: 1355 IAMNNIVD 1362
            I+   + D
Sbjct: 1839 ISDKEVTD 1846


>gi|403739234|ref|ZP_10951766.1| putative heat shock protein [Austwickia chelonae NBRC 105200]
 gi|403190895|dbj|GAB78536.1| putative heat shock protein [Austwickia chelonae NBRC 105200]
          Length = 605

 Score = 46.2 bits (108), Expect = 0.19,   Method: Composition-based stats.
 Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 32/119 (26%)

Query: 1198 LSQELYSQDSHFLLELVQNADD--------------------NIYPENVEPTLTFILQES 1237
            LS  LYS    +L EL+QNA D                     I P +V P   F L ++
Sbjct: 16   LSHHLYSGPRVYLRELIQNAADALTARLLLAPATAASAPPRITITPADVSPDGRFHLDDN 75

Query: 1238 GIVVLNNEQGFSAENI-RALCDVGNSTKKGSSAG----YIGRKGIGFKSVFRVTDAPEI 1291
            GI       G + E+I R L  +G+S+K G++A     +IG  GIG  + F V++  E+
Sbjct: 76   GI-------GLTREDIDRFLVGIGSSSKPGAAASTQETFIGHFGIGLLACFLVSETIEL 127


>gi|297808305|ref|XP_002872036.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317873|gb|EFH48295.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 4711

 Score = 46.2 bits (108), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 52/112 (46%), Gaps = 15/112 (13%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-------------NEQGFS 1249
            Y + +  L EL+QNADD      V   L   +  SG ++ +             N+  F+
Sbjct: 26   YPEGTTVLKELIQNADD-AGATKVRLCLDRRVHGSGSLLSDSLAQWQGPSLLAYNDAVFT 84

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
             E+  ++  +G S K G  A   GR G+GF SV+ +TD P   S  + V FD
Sbjct: 85   EEDFVSISRIGGSGKHGQ-AWKTGRFGVGFNSVYHLTDIPSFVSGKYVVLFD 135



 Score = 44.7 bits (104), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 61/146 (41%), Gaps = 28/146 (19%)

Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL----------- 1234
            QH  L  R  H L  E+Y+     L ELVQNA+D    E     +TF+L           
Sbjct: 1383 QHEALTTRLKHIL--EMYADGPGILFELVQNAEDAGASE-----VTFLLDKTHYGTSSLL 1435

Query: 1235 -------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
                   Q   +   NN   F+ +++ A+  +G ++K       IGR G+GF  V+  TD
Sbjct: 1436 SPEMADWQGPALYCFNNSV-FTQQDMYAISRIGQASKLEKPFA-IGRFGLGFNCVYHFTD 1493

Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPT 1313
             P   S    V FD     +  + PT
Sbjct: 1494 IPGFVSGENIVMFDPHANHLPGISPT 1519


>gi|156359807|ref|XP_001624956.1| predicted protein [Nematostella vectensis]
 gi|156211764|gb|EDO32856.1| predicted protein [Nematostella vectensis]
          Length = 250

 Score = 46.2 bits (108), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 18/137 (13%)

Query: 1187 QHARLGRALHCLSQELYSQDSHFLLELVQNADDN-------IYPENV-EPTLTFILQES- 1237
            Q+  L R L  L +E Y+       EL+QNADD        +Y E   E  +T ++ E  
Sbjct: 107  QYEPLTRRLQRLLEE-YTDGFSVPKELIQNADDAGATEVRFLYDERANEDAMTCLISEGM 165

Query: 1238 ------GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
                   + V NN   F+  + + +  +  +TK+  S G IGR G+GF +V+ +TD P +
Sbjct: 166  KECQGPALWVYNNAV-FTDADFQNITKLNGATKEEDS-GKIGRFGLGFNAVYNITDVPSL 223

Query: 1292 HSNGFHVKFDTSEGQIG 1308
             S    V FD     +G
Sbjct: 224  LSRNNLVIFDPHTTHLG 240


>gi|296128608|ref|YP_003635858.1| hypothetical protein Cfla_0748 [Cellulomonas flavigena DSM 20109]
 gi|296020423|gb|ADG73659.1| conserved hypothetical protein [Cellulomonas flavigena DSM 20109]
          Length = 1026

 Score = 46.2 bits (108), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 48/107 (44%), Gaps = 6/107 (5%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQ--ESGIVVL---NNEQGFSAENIRALC 1257
            Y +D   L+EL QNA D      V   L   L   + G  VL   N      A  +R+L 
Sbjct: 40   YYRD-RVLVELAQNAADAAVRAGVAGRLLLRLATTDDGTAVLVAANTGAPLDAAGVRSLA 98

Query: 1258 DVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             +  S K+  + G +GR G+GF +V  V D   + S    V+F  ++
Sbjct: 99   TLRASAKRDGAPGTVGRFGVGFAAVRAVADEVTVCSTSGAVRFSLAD 145


>gi|15237223|ref|NP_197702.1| C3HC4-type RING finger domain-containing protein [Arabidopsis
            thaliana]
 gi|9759369|dbj|BAB09828.1| unnamed protein product [Arabidopsis thaliana]
 gi|332005740|gb|AED93123.1| C3HC4-type RING finger domain-containing protein [Arabidopsis
            thaliana]
          Length = 4706

 Score = 46.2 bits (108), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 52/112 (46%), Gaps = 15/112 (13%)

Query: 1203 YSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLN-------------NEQGFS 1249
            Y + +  L EL+QNADD      V   L   +  SG ++ +             N+  F+
Sbjct: 26   YPEGTTVLKELIQNADD-AGATKVRLCLDRRVHGSGSLLSDSLAQWQGPSLLAYNDAVFT 84

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
             E+  ++  +G S K G  A   GR G+GF SV+ +TD P   S  + V FD
Sbjct: 85   EEDFVSISRIGGSGKHGQ-AWKTGRFGVGFNSVYHLTDIPSFVSGKYVVLFD 135



 Score = 44.7 bits (104), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 61/146 (41%), Gaps = 28/146 (19%)

Query: 1187 QHARLG-RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFIL----------- 1234
            QH  L  R  H L  E+Y+     L ELVQNA+D    E     +TF+L           
Sbjct: 1383 QHEALTTRLKHIL--EMYADGPGILFELVQNAEDAGASE-----VTFLLDKTHYGTSSLL 1435

Query: 1235 -------QESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
                   Q   +   NN   F+ +++ A+  +G ++K       IGR G+GF  V+  TD
Sbjct: 1436 SPEMADWQGPALYCFNNSV-FTQQDMYAISRIGQASKLEKPFA-IGRFGLGFNCVYHFTD 1493

Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPT 1313
             P   S    V FD     +  + PT
Sbjct: 1494 IPGFVSGENIVMFDPHANHLPGISPT 1519


>gi|329941389|ref|ZP_08290668.1| hypothetical protein SGM_6160 [Streptomyces griseoaurantiacus M045]
 gi|329299920|gb|EGG43819.1| hypothetical protein SGM_6160 [Streptomyces griseoaurantiacus M045]
          Length = 1067

 Score = 46.2 bits (108), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 39/157 (24%), Positives = 65/157 (41%), Gaps = 12/157 (7%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L   L++  +   N      A  + +L  +  S K+  + G +GR G+GF +V  V+D P
Sbjct: 75   LRLTLRDGVLAAANTGAPLDAAGVESLATLRASAKRDGAPGAVGRFGVGFAAVLAVSDEP 134

Query: 1290 EIHSNGFHVKFDTSE-----GQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC---WNT 1341
             +      V++  +E     G      P L            L + P   E +    ++T
Sbjct: 135  AVLGRHGGVRWSLAEARDLAGDTARHSPGLGDEIRRRDGHVPLLRLPFAAEGRAPEPYDT 194

Query: 1342 CIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRL 1378
             + LP R   +EG+A   ++D   D   +LLL L  L
Sbjct: 195  VVVLPLRDPAAEGLA-ERLLDAVDD---ALLLALPGL 227


>gi|190894939|ref|YP_001985232.1| heat shock protein 90 [Rhizobium etli CIAT 652]
 gi|190700600|gb|ACE94682.1| heat shock chaperone protein, Hsp90 family [Rhizobium etli CIAT 652]
          Length = 628

 Score = 46.2 bits (108), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 63/144 (43%), Gaps = 30/144 (20%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD---NIYPENVE-PTLTFI 1233
            N+ES++ +   ARL   LH +   +YS    FL EL+ NA D    +  E +E P L   
Sbjct: 8    NVESHVFEADVARL---LHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIEAPALLGS 64

Query: 1234 LQESGI-----------VVLNNEQGFSAENI------------RALCDVGNSTKKGSSAG 1270
            + ES I           VV +N  G   + +            RA  D   ++K G  A 
Sbjct: 65   VAESRITLSLDDESRQLVVEDNGIGMQRDEMIEALGTIARSGTRAFMDRIAASKAGEGAQ 124

Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSN 1294
             IG+ G+GF S F V D  ++ S 
Sbjct: 125  LIGQFGVGFYSCFMVADRVDVISR 148


>gi|323356692|ref|YP_004223088.1| hypothetical protein MTES_0244 [Microbacterium testaceum StLB037]
 gi|323273063|dbj|BAJ73208.1| hypothetical protein MTES_0244 [Microbacterium testaceum StLB037]
          Length = 715

 Score = 46.2 bits (108), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 42/130 (32%), Positives = 62/130 (47%), Gaps = 21/130 (16%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNA---DDNIYPENVEPTLTFILQESGIVVLNNEQGFSAE 1251
            L  L   LYS  +  + ELV NA   D N+    VE        ++ + + ++ +G + E
Sbjct: 29   LEALGINLYSNAAAVVSELVANAWDADANL----VEINWNAGGDDAFVEISDDGRGMTRE 84

Query: 1252 NI-RALCDVGNS---TKKGSSAGY----IGRKGIGFKSVFRVTDAPEIHS------NGFH 1297
             + R    VG S   T+ G S GY    +GRKGIG  SVF + D  E+HS      N F 
Sbjct: 85   ELKRKFLTVGYSKRTTEGGLSDGYKRPFMGRKGIGKLSVFSIADTVEVHSARNGERNAFR 144

Query: 1298 VKFDTSEGQI 1307
            ++ D  + +I
Sbjct: 145  IEVDELKKRI 154


>gi|448690608|ref|ZP_21695769.1| hypothetical protein C444_18702 [Haloarcula japonica DSM 6131]
 gi|445776570|gb|EMA27547.1| hypothetical protein C444_18702 [Haloarcula japonica DSM 6131]
          Length = 1399

 Score = 45.8 bits (107), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 46/91 (50%), Gaps = 1/91 (1%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA 1269
            LLEL+QNADD  +  +    +   L E G+ V N+ + FS E I +L    NS K+    
Sbjct: 38   LLELIQNADDAGWESSGSVDVLIELTEEGLFVANSGEPFSREGISSLMLSDNSPKQFREE 97

Query: 1270 GYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
              IG KG+GF+S+   +    I S    + F
Sbjct: 98   -CIGYKGLGFRSILNWSSDVLIQSGEVSIGF 127


>gi|432115437|gb|ELK36852.1| Sacsin [Myotis davidii]
          Length = 1679

 Score = 45.8 bits (107), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 147  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 205

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 206  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 248



 Score = 43.5 bits (101), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 19/124 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1508 EEYPSVSDIFKELLQNADDANATECSFMIDMRRNMDIRENLLDPGMA-ACHGPALWSFNN 1566

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1567 SE-FSDSDFVNITRLGESLKRG-EVDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1624

Query: 1305 GQIG 1308
              I 
Sbjct: 1625 NHIS 1628


>gi|153809618|ref|ZP_01962286.1| hypothetical protein BACCAC_03936 [Bacteroides caccae ATCC 43185]
 gi|149127743|gb|EDM18968.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein, partial
            [Bacteroides caccae ATCC 43185]
          Length = 488

 Score = 45.8 bits (107), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN    +   L +   V+  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYAGRIDVFLNDDKTVIFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VTD  ++ S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTDEIKVES 124


>gi|187951065|gb|AAI38484.1| Sacs protein [Mus musculus]
 gi|223460292|gb|AAI38483.1| Sacs protein [Mus musculus]
          Length = 744

 Score = 45.8 bits (107), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 59/136 (43%), Gaps = 22/136 (16%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQEL------YSQDSHFLLELVQNADDN-------IYPE 1224
            N++S  LK    R G+    L   L      Y +    L EL+QNA+D        +Y E
Sbjct: 77   NLQSKGLKG-GGRFGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVKFLYDE 135

Query: 1225 NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
                T T         Q S + V NN   F+ E+   + ++  S KK      +GR GIG
Sbjct: 136  TQYGTETLWSKDMAQYQGSALYVYNNAV-FTPEDWHGIQEIARSRKKDDPLK-VGRFGIG 193

Query: 1279 FKSVFRVTDAPEIHSN 1294
            F SV+ +TD P I S 
Sbjct: 194  FNSVYHITDVPCIFSG 209


>gi|219521373|gb|AAI71956.1| Sacs protein [Mus musculus]
          Length = 744

 Score = 45.8 bits (107), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 59/136 (43%), Gaps = 22/136 (16%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQEL------YSQDSHFLLELVQNADDN-------IYPE 1224
            N++S  LK    R G+    L   L      Y +    L EL+QNA+D        +Y E
Sbjct: 77   NLQSKGLKG-GGRFGQTTPPLVDFLKDILRRYPEGGQILKELIQNAEDAGATEVKFLYDE 135

Query: 1225 NVEPTLTFI------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIG 1278
                T T         Q S + V NN   F+ E+   + ++  S KK      +GR GIG
Sbjct: 136  TQYGTETLWSKDMAQYQGSALYVYNNAV-FTPEDWHGIQEIARSRKKDDPLK-VGRFGIG 193

Query: 1279 FKSVFRVTDAPEIHSN 1294
            F SV+ +TD P I S 
Sbjct: 194  FNSVYHITDVPCIFSG 209


>gi|405975168|gb|EKC39753.1| Sacsin [Crassostrea gigas]
          Length = 629

 Score = 45.4 bits (106), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 98/242 (40%), Gaps = 41/242 (16%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPT-----LTFILQESGIVVLNNEQG 1247
            Y  D   L E++QNA+D          +  P   EP+          +   +VV NN + 
Sbjct: 34   YPDDGQILKEIIQNAEDAGASEMKILYDARPAVQEPSTKKAPFRKYFKGPALVVHNNAE- 92

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQI 1307
            F+ ++ + +  + +S K+      +GR G+GFKSVF +TD P I S    +  D  +   
Sbjct: 93   FTEDDWKGIKMLYSSIKEFDKTK-VGRFGLGFKSVFHITDYPVIISGDQLLVLDPHQDS- 150

Query: 1308 GFVLPTL-------VPPFNID----MFCRLLSKDPVQLESKCW-NTCIRLPFR---TKFS 1352
              V  T+           NI+     F  +   D   LES  +  T  R P R   T  S
Sbjct: 151  SKVCQTMKLKKLHRYKKMNIEDCLKTFSGVFGFDQNTLESGHFRGTIFRFPLRQEETDLS 210

Query: 1353 EGIAMNNIVD----MFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSC 1408
            + I   + VD     F D  P  LLFL  L+ I      N+S    +   +G+ I  V  
Sbjct: 211  DNIYDKSKVDDLFMSFKDEAPVSLLFLKCLESITLLREENES----KTGDIGEKIFSVHI 266

Query: 1409 GE 1410
             E
Sbjct: 267  DE 268


>gi|392843737|gb|AFM84472.1| sacsin, partial [Capra hircus]
          Length = 737

 Score = 45.4 bits (106), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 59/125 (47%), Gaps = 20/125 (16%)

Query: 1180 ESNMLKK--QHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQES 1237
            E  MLK+  Q+A   +A           D  F+ +  Q+  D I+ +   P     L   
Sbjct: 421  EKEMLKELLQNANYAKA----------TDICFVFDPRQHPVDRIFDDKWAP-----LHGP 465

Query: 1238 GIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPE-IHSNGF 1296
             + V NN Q F+ +++R + ++G  TK+G+     G+ G+GF SV+ +TD P  I  N  
Sbjct: 466  ALCVYNN-QPFTEDDVRGIQNLGRGTKEGNPCK-TGQYGVGFNSVYHITDCPSFISGNDI 523

Query: 1297 HVKFD 1301
               FD
Sbjct: 524  LCIFD 528


>gi|313147938|ref|ZP_07810131.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423281212|ref|ZP_17260123.1| hypothetical protein HMPREF1203_04340 [Bacteroides fragilis HMW 610]
 gi|424665851|ref|ZP_18102887.1| hypothetical protein HMPREF1205_01726 [Bacteroides fragilis HMW 616]
 gi|313136705|gb|EFR54065.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404574104|gb|EKA78855.1| hypothetical protein HMPREF1205_01726 [Bacteroides fragilis HMW 616]
 gi|404583376|gb|EKA88057.1| hypothetical protein HMPREF1203_04340 [Bacteroides fragilis HMW 610]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       E+    +   L+E G VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALHNIDESYAGRIDVFLKEDGSVVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVHRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|383111062|ref|ZP_09931880.1| hypothetical protein BSGG_2167 [Bacteroides sp. D2]
 gi|313694632|gb|EFS31467.1| hypothetical protein BSGG_2167 [Bacteroides sp. D2]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E+  VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNENKTVVFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|92119166|ref|YP_578895.1| hypothetical protein Nham_3709 [Nitrobacter hamburgensis X14]
 gi|91802060|gb|ABE64435.1| hypothetical protein Nham_3709 [Nitrobacter hamburgensis X14]
          Length = 1306

 Score = 45.4 bits (106), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 71/309 (22%), Positives = 122/309 (39%), Gaps = 39/309 (12%)

Query: 1210 LLELVQNADDNIYPENVEP--TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            L ELVQNA D I  E  EP   +   L    +   N         I AL +V +S K+G+
Sbjct: 57   LYELVQNAADAIL-EAEEPHGRICVRLTPDRLEAANTGAALDEGGIVALLNVRSSPKRGN 115

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS------EGQIGFVLPTLVPPFNID 1321
                IGR GIGFKS+ R+    ++ S    ++FD           +G       P   + 
Sbjct: 116  Q---IGRFGIGFKSLLRLGGRVDLTSKSLGLRFDPGACRNRIRNHLGLTSDARAPGMRLA 172

Query: 1322 MFCRLLSK-DPVQLESKC-WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQ 1379
                  +K  P+  + +  W T +      + ++    + +    ++     LLFL    
Sbjct: 173  EVLDPNAKGSPLSGQGRWNWATTV---VSAEIADKSTFDRLKKELAEFPAEFLLFLPSDI 229

Query: 1380 CIMFRNMLNDSLVVIRKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDVKTTEIALA 1439
             +       +  ++ ++   G+ +  VS G ++  W V   ++R  V  P+ ++     A
Sbjct: 230  SLEIEVEGAEPRLIAKRFEEGNAV--VSDGANESIWKVFGTRVR--VENPEARSD----A 281

Query: 1440 LTLQESNEGNYGPLLYQQPV---------FAFLPLRTYG-LKFILQGDFVLPSSREEVDG 1489
              +Q  ++    PL +  PV         +AF P  T      IL   + L S R  +  
Sbjct: 282  THIQARDQ---VPLAWAAPVGGREQAGRFWAFFPTETQSRTSGILNAPWKLNSDRTNLI- 337

Query: 1490 NSPWNQWLL 1498
              PWN+ ++
Sbjct: 338  RGPWNEAIM 346


>gi|423294100|ref|ZP_17272227.1| hypothetical protein HMPREF1070_00892 [Bacteroides ovatus CL03T12C18]
 gi|392676596|gb|EIY70027.1| hypothetical protein HMPREF1070_00892 [Bacteroides ovatus CL03T12C18]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E+  VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNENKTVVFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|423220459|ref|ZP_17206954.1| hypothetical protein HMPREF1061_03727 [Bacteroides caccae CL03T12C61]
 gi|392623536|gb|EIY17639.1| hypothetical protein HMPREF1061_03727 [Bacteroides caccae CL03T12C61]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN    +   L +   V+  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYAGRIDVFLNDDKTVIFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VTD  ++ S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTDEIKVES 124


>gi|336404446|ref|ZP_08585141.1| hypothetical protein HMPREF0127_02454 [Bacteroides sp. 1_1_30]
 gi|335942243|gb|EGN04090.1| hypothetical protein HMPREF0127_02454 [Bacteroides sp. 1_1_30]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGVGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|85677482|ref|NP_766397.2| sacsin [Mus musculus]
 gi|74144304|dbj|BAE36019.1| unnamed protein product [Mus musculus]
          Length = 768

 Score = 45.4 bits (106), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q S + V NN   F+
Sbjct: 107  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGSALYVYNNAV-FT 165

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S 
Sbjct: 166  PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFSG 209


>gi|300764737|ref|ZP_07074728.1| hypothetical protein LMHG_11046 [Listeria monocytogenes FSL N1-017]
 gi|404281267|ref|YP_006682165.1| hypothetical protein LMOSLCC2755_1715 [Listeria monocytogenes
            SLCC2755]
 gi|300514623|gb|EFK41679.1| hypothetical protein LMHG_11046 [Listeria monocytogenes FSL N1-017]
 gi|404227902|emb|CBY49307.1| hypothetical protein LMOSLCC2755_1715 [Listeria monocytogenes
            SLCC2755]
          Length = 1507

 Score = 45.4 bits (106), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 139/317 (43%), Gaps = 58/317 (18%)

Query: 1210 LLELVQNADD---NIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            +LEL+QN DD   +I  EN     E  +    + + + V N    FS + I  LC +G +
Sbjct: 46   ILELLQNVDDAYGDIKSENDASNDEVKVQITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
            + K S    IG KG GF+S+    +  E++S  F ++F  SE     +    V   + D 
Sbjct: 105  SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF--SEEFTKKLFEQYVSR-DSDK 159

Query: 1323 FCRLLSK--------------------DPVQLESKCWNTCIRLPFR-TKFSEGIAMNNIV 1361
            F  L+ +                     P+      ++T IR+  + T  S+G +++   
Sbjct: 160  FSELIFEQKKNWKKEDYDLCFPIMNCPQPINKIDSGFDTLIRVKLKETNLSKGTSVSK-- 217

Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV-GDGII----KVSCGEDKMTWF 1416
             +    + S LLFL  +  I+     ++    I +KIV G+ ++     V+       +F
Sbjct: 218  QLQQPFYKS-LLFLPNITKIVIETTDDEK---ISEKIVDGNDVLIEKSDVTGQSPSEEYF 273

Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG--LKFIL 1474
            V  ++   G      K  +IALA+    S + +Y     ++ ++ + P+R +G  +  ++
Sbjct: 274  VFDKESMIG-----DKVAKIALAV----SKDSDYD--YSKEKLYCYFPVRNFGTPIHALI 322

Query: 1475 QGDFVLPSSREEVDGNS 1491
               F+  +SR++V  +S
Sbjct: 323  NAPFITNNSRDDVPDDS 339


>gi|160887425|ref|ZP_02068428.1| hypothetical protein BACOVA_05444 [Bacteroides ovatus ATCC 8483]
 gi|156107836|gb|EDO09581.1| S1 RNA binding domain protein [Bacteroides ovatus ATCC 8483]
          Length = 1862

 Score = 45.4 bits (106), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 61/123 (49%), Gaps = 11/123 (8%)

Query: 1186 KQHAR------LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESG 1238
            KQH        +   L  + Q LY+       EL+QNADD   PEN V+  L   L ++ 
Sbjct: 464  KQHTSPNTNTSVANLLEEVGQNLYTSKKRMFFELLQNADDAA-PENGVKVKLQ--LSDNY 520

Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
             VV ++   F+  +  ++     STK  SS    G KGIGFKSVF  +++  I S G++ 
Sbjct: 521  FVVTHDGFAFNKHDFESITSAAKSTKSASSKK-TGYKGIGFKSVFTNSESVLIKSAGYNF 579

Query: 1299 KFD 1301
             FD
Sbjct: 580  SFD 582


>gi|298480533|ref|ZP_06998730.1| heat shock protein HtpG [Bacteroides sp. D22]
 gi|295086232|emb|CBK67755.1| Molecular chaperone, HSP90 family [Bacteroides xylanisolvens XB1A]
 gi|298273354|gb|EFI14918.1| heat shock protein HtpG [Bacteroides sp. D22]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGVGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|262405471|ref|ZP_06082021.1| heat shock protein HtpG [Bacteroides sp. 2_1_22]
 gi|294645217|ref|ZP_06722936.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Bacteroides ovatus SD CC 2a]
 gi|294809664|ref|ZP_06768354.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Bacteroides xylanisolvens SD CC 1b]
 gi|345510656|ref|ZP_08790220.1| heat shock protein HtpG [Bacteroides sp. D1]
 gi|229442678|gb|EEO48469.1| heat shock protein HtpG [Bacteroides sp. D1]
 gi|262356346|gb|EEZ05436.1| heat shock protein HtpG [Bacteroides sp. 2_1_22]
 gi|292639460|gb|EFF57759.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Bacteroides ovatus SD CC 2a]
 gi|294443149|gb|EFG11926.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Bacteroides xylanisolvens SD CC 1b]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGVGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|397642533|gb|EJK75294.1| hypothetical protein THAOC_02985 [Thalassiosira oceanica]
          Length = 2093

 Score = 45.4 bits (106), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 50/92 (54%), Gaps = 5/92 (5%)

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAEN 1252
            R    + ++ +++   +L EL+QNA D+        T+ F +Q++ + V +N + F+  +
Sbjct: 159  RGFLSVLEKQFARGDLYLFELLQNAVDD-----GADTVRFAIQDNTLQVSHNGRRFTPLD 213

Query: 1253 IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFR 1284
            +  L  VG STK       IG  G+GFK+V++
Sbjct: 214  VLGLSSVGLSTKSRQGKRTIGFMGVGFKAVYK 245


>gi|315641083|ref|ZP_07896162.1| phage minor structural protein [Enterococcus italicus DSM 15952]
 gi|315483091|gb|EFU73608.1| phage minor structural protein [Enterococcus italicus DSM 15952]
          Length = 1507

 Score = 45.4 bits (106), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 75/317 (23%), Positives = 135/317 (42%), Gaps = 58/317 (18%)

Query: 1210 LLELVQNADD---NIYPENV----EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            +LEL+QN DD   +I  EN     E  +    + + + V N    FS + I  LC +G +
Sbjct: 46   ILELLQNVDDAYGDIKSENDSNNDEVKVRITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDM 1322
            + K S    IG KG GF+S+    +  E++S  F ++F  SE     +    V     D 
Sbjct: 105  SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF--SEEFTKRLFEQYVSR-ESDK 159

Query: 1323 FCRLLSK--------------------DPVQLESKCWNTCIRLPFR-TKFSEGIAMNNIV 1361
            F  L+ +                     P+      ++T IR+  + T  S+G +   + 
Sbjct: 160  FSELIFEQQKNWKKEDYDLCFPIMNCPQPINKVDSDFDTLIRVKLKETNLSKGTS---VA 216

Query: 1362 DMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVG--DGIIK---VSCGEDKMTWF 1416
                      LLFL  +  I+     ++    I +KIV   D +I+   V+       +F
Sbjct: 217  KQLQQPFYKSLLFLPNITKIVIETSDDEK---ISEKIVDGHDVLIEKSDVTGQAPSEEYF 273

Query: 1417 VASQKLRAGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYG--LKFIL 1474
            V  ++ R G      K  +IA+A+    S + +Y     ++ ++ + P+R +G  +  ++
Sbjct: 274  VFDKESRIG-----DKVAKIAVAV----SKDSDYD--YSKEKLYCYFPVRNFGTPIHALI 322

Query: 1475 QGDFVLPSSREEVDGNS 1491
               F+  +SR++V  +S
Sbjct: 323  NAPFITNNSRDDVPDDS 339


>gi|336417388|ref|ZP_08597712.1| hypothetical protein HMPREF1017_04820 [Bacteroides ovatus 3_8_47FAA]
 gi|335936134|gb|EGM98074.1| hypothetical protein HMPREF1017_04820 [Bacteroides ovatus 3_8_47FAA]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|299147086|ref|ZP_07040153.1| heat shock protein HtpG [Bacteroides sp. 3_1_23]
 gi|298514971|gb|EFI38853.1| heat shock protein HtpG [Bacteroides sp. 3_1_23]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|237720427|ref|ZP_04550908.1| heat shock protein HtpG [Bacteroides sp. 2_2_4]
 gi|293370430|ref|ZP_06616983.1| histidine kinase domain family protein [Bacteroides ovatus SD CMC 3f]
 gi|229450178|gb|EEO55969.1| heat shock protein HtpG [Bacteroides sp. 2_2_4]
 gi|292634422|gb|EFF52958.1| histidine kinase domain family protein [Bacteroides ovatus SD CMC 3f]
          Length = 588

 Score = 45.4 bits (106), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|39935288|ref|NP_947564.1| ATPase [Rhodopseudomonas palustris CGA009]
 gi|39649140|emb|CAE27660.1| Possible ATPase [Rhodopseudomonas palustris CGA009]
          Length = 1413

 Score = 45.1 bits (105), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 52/111 (46%), Gaps = 8/111 (7%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            L    H L +++ S   H L EL+QNADD    E         ++    +  +N + F  
Sbjct: 38   LAGPWHQLFKQVQSP-RHILSELLQNADDAGASEA-----RVSIENDRFIFQHNGEDFIE 91

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
             +  ++C  G S K+      IG +GIGFKS F + D  E+++    V F+
Sbjct: 92   AHFASICRFGYSNKRALHT--IGFRGIGFKSTFSLGDRVELYTPTLAVAFE 140


>gi|160883939|ref|ZP_02064942.1| hypothetical protein BACOVA_01913 [Bacteroides ovatus ATCC 8483]
 gi|423290634|ref|ZP_17269483.1| hypothetical protein HMPREF1069_04526 [Bacteroides ovatus CL02T12C04]
 gi|156110669|gb|EDO12414.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Bacteroides ovatus ATCC 8483]
 gi|392665287|gb|EIY58815.1| hypothetical protein HMPREF1069_04526 [Bacteroides ovatus CL02T12C04]
          Length = 588

 Score = 45.1 bits (105), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN +  +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYKGRIDVFLNEDKTVVFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|424878843|ref|ZP_18302481.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
            trifolii WU95]
 gi|392520353|gb|EIW45083.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
            trifolii WU95]
          Length = 630

 Score = 45.1 bits (105), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 59/139 (42%), Gaps = 30/139 (21%)

Query: 1186 KQHA---RLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
            +QHA    + R LH +   +YS    FL EL+ NA D              +  + ++P 
Sbjct: 10   EQHAFEADVSRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYDSISQPQLSADGIQPG 69

Query: 1230 LTFILQESG--IVVLNNEQGFS-AENIRALCDVGNS-----------TKKGSSAGYIGRK 1275
            +   L E    +VV +N  G S +E I AL  +  S            K G  A  IG+ 
Sbjct: 70   ILVTLNEEALTLVVEDNGIGMSRSEMIDALGTIARSGTKAFMERLEAAKAGEKAELIGQF 129

Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
            G+GF S F V D  ++ S 
Sbjct: 130  GVGFYSSFMVADKVDVISR 148


>gi|374849494|dbj|BAL52508.1| ATPase [uncultured Bacteroidetes bacterium]
          Length = 1361

 Score = 45.1 bits (105), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 53/108 (49%), Gaps = 11/108 (10%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            H L EL+QNADD          ++  ++++  +  ++   FS +   +LC  G S K+  
Sbjct: 45   HVLSELLQNADDAGARH-----VSVRVEDNVFIFEHDGHDFSEDEFASLCRFGFSNKR-- 97

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLV 1315
            +   IG +GIGFKS F + ++ E+ +    V+F        F LP  +
Sbjct: 98   TLYTIGFRGIGFKSTFSLGESVELLTQTLAVRFKKKR----FTLPIWI 141


>gi|315281648|ref|ZP_07870227.1| histidine kinase domain-containing protein [Listeria marthii FSL
            S4-120]
 gi|313614704|gb|EFR88269.1| histidine kinase domain-containing protein [Listeria marthii FSL
            S4-120]
          Length = 601

 Score = 45.1 bits (105), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 30/123 (24%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
               L   +  LS  LY +   ++ EL+QNA D I   + ++PTL               T
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDPTLEGKIHASLIGDINEKT 68

Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFR 1284
             IL+++GI       G + + + A L  + NS+K        SS  +IGR GIG  S F 
Sbjct: 69   LILEDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFI 121

Query: 1285 VTD 1287
            V+D
Sbjct: 122  VSD 124


>gi|148553124|ref|YP_001260706.1| hypothetical protein Swit_0197 [Sphingomonas wittichii RW1]
 gi|148498314|gb|ABQ66568.1| hypothetical protein Swit_0197 [Sphingomonas wittichii RW1]
          Length = 1414

 Score = 45.1 bits (105), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 7/96 (7%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            H L EL+QNADD    E         ++    V  +N + F   +  ++C  G S K+  
Sbjct: 54   HILSELLQNADDAGASEA-----HVSIEGERFVFEHNGEDFIEAHFASICRFGYSNKRAL 108

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
                IG +GIGFKS F + D  E+++    V F+ +
Sbjct: 109  HT--IGFRGIGFKSTFSLGDRVELYTPSLAVAFEKA 142


>gi|258654836|ref|YP_003203992.1| hypothetical protein Namu_4727 [Nakamurella multipartita DSM 44233]
 gi|258558061|gb|ACV81003.1| hypothetical protein Namu_4727 [Nakamurella multipartita DSM 44233]
          Length = 1012

 Score = 44.7 bits (104), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 1/96 (1%)

Query: 1207 SHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG 1266
               L+EL  N  D     +V   L   L +  + V N     +A  +  L  +  S K+ 
Sbjct: 50   GRALVELAANGVDAARDADVPARLRIRLIDGELRVANTGAPLTAAGVAGLASLRASAKR- 108

Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT 1302
             SAG IG  G+GF +V   T AP + S    ++FD 
Sbjct: 109  DSAGSIGFFGVGFTAVLSWTQAPRVISTTGGIRFDA 144


>gi|318062490|ref|ZP_07981211.1| hypothetical protein SSA3_31430 [Streptomyces sp. SA3_actG]
          Length = 1051

 Score = 44.7 bits (104), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 74/167 (44%), Gaps = 14/167 (8%)

Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
            + V N+     A  + +L  +  S K+    G +GR G+GF +V  V+D P +      V
Sbjct: 94   LAVANSGAPLDASGVASLATLRASAKREDGTGTVGRFGVGFAAVLAVSDEPAVLGRTGGV 153

Query: 1299 KFDTSEGQIGFVLPTLVPPFNIDMFCR----LLSKDPVQLESKC---WNTCIRLPFRTKF 1351
            ++  +E +   V    VP  + ++  R     L + P   E      ++T + LP R   
Sbjct: 154  RWSLTEAREA-VAALRVPALDEEVRRREGHVPLLRLPFPAEGSAPEGYDTVVVLPLR--- 209

Query: 1352 SEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
             + +A + +  + + +  +LLL L  L  I+      +   V+R+++
Sbjct: 210  -DAVAEDLVERLLTGIDDALLLSLPGLAEIVVET--PEGTRVLRRRV 253


>gi|326670429|ref|XP_682866.5| PREDICTED: sacsin [Danio rerio]
          Length = 2235

 Score = 44.7 bits (104), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 48/186 (25%), Positives = 82/186 (44%), Gaps = 45/186 (24%)

Query: 1203 YSQDSHFLLELVQNADDN---------IYPENVEPTLTFILQESGIVVLN-------NEQ 1246
            Y ++S    EL+QNA+D             ++ +P  + I  + G+ + N       N +
Sbjct: 842  YDEESDIFKELLQNAEDAGATTCKFLLDLRKHRDPPESLI--DDGMALCNGPCLWIFNNE 899

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS---------NGFH 1297
             FS E+ R +  VG+++K+ +    IG+ G+GF +V+ V+D P I S         N  H
Sbjct: 900  LFSEEDWRNILKVGSASKE-NKVEMIGKFGLGFNAVYHVSDIPSILSGKKLLILDPNVTH 958

Query: 1298 ------------VKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKCWN-TCIR 1344
                        +K D  + ++    P    P+     C L     V+   + +N T I+
Sbjct: 959  LEKHIGSKGNPGIKLDPFQERLCKRFPGQFKPYEGIFDCDL----SVRNSKRAYNGTLIK 1014

Query: 1345 LPFRTK 1350
            LPFRT+
Sbjct: 1015 LPFRTQ 1020



 Score = 44.7 bits (104), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 49/104 (47%), Gaps = 15/104 (14%)

Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTF------ILQESGIVVLNNEQG 1247
            + Y      L EL+QNADD        ++ +    T         +LQ   + V NN + 
Sbjct: 1833 DAYPSKKDILKELIQNADDAQATEIHFVWDKRKHQTKKIFGKKWELLQGPALCVYNN-RT 1891

Query: 1248 FSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI 1291
            FS  +++ +  +G   K G+  G  G+ G+GF SV+ +TD P I
Sbjct: 1892 FSDADLQGIQQLGEGGKHGT-LGRTGKYGLGFNSVYHLTDCPSI 1934



 Score = 44.7 bits (104), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 49/111 (44%), Gaps = 13/111 (11%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQE-----SGIVVLNNEQGFSA 1250
            Y      L EL+QNADD        I+ E    T +   +E        +   N+  F+ 
Sbjct: 31   YPDGGQILKELIQNADDAGASTVVFIHDERHYETHSLWTKELRKYQGPALYAFNDAAFTE 90

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            E+   +  VG S K+      +GR GIGF SV+ +TD P + S+     FD
Sbjct: 91   EDWEGIQRVGRSIKQDDPTK-VGRFGIGFNSVYHITDLPCVFSSEHLAIFD 140


>gi|241258650|ref|YP_002978534.1| heat shock protein 90 [Rhizobium leguminosarum bv. trifolii WSM1325]
 gi|240863120|gb|ACS60783.1| heat shock protein Hsp90 [Rhizobium leguminosarum bv. trifolii
            WSM1325]
          Length = 630

 Score = 44.7 bits (104), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 56/132 (42%), Gaps = 30/132 (22%)

Query: 1186 KQHA---RLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
            +QHA    + R LH +   +YS    FL EL+ NA D              +  + ++P 
Sbjct: 10   EQHAFEADVSRLLHMMVHSVYSDKDIFLRELISNAADACEKLRYDSISQPQLSADGIQPG 69

Query: 1230 LTFILQESG--IVVLNNEQGFS-AENIRALCDVGNS-----------TKKGSSAGYIGRK 1275
            +   L E    +VV +N  G S +E I AL  +  S            K G  A  IG+ 
Sbjct: 70   ILVTLNEEALTLVVEDNGIGMSRSEMIDALGTIARSGTKAFMERLDAAKAGEKAELIGQF 129

Query: 1276 GIGFKSVFRVTD 1287
            G+GF S F V D
Sbjct: 130  GVGFYSSFMVAD 141


>gi|424876975|ref|ZP_18300634.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
            viciae WSM1455]
 gi|393164578|gb|EJC64631.1| molecular chaperone of HSP90 family [Rhizobium leguminosarum bv.
            viciae WSM1455]
          Length = 630

 Score = 44.7 bits (104), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 59/139 (42%), Gaps = 30/139 (21%)

Query: 1186 KQHA---RLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
            +QHA    + R LH +   +YS    FL EL+ NA D              +  + ++P 
Sbjct: 10   EQHAFEADVSRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYDSISEPRLSADGIQPG 69

Query: 1230 LTFILQESG--IVVLNNEQGFSA-ENIRALCDVGNS-----------TKKGSSAGYIGRK 1275
            +   L E    +VV +N  G S  E I AL  +  S           +K G  A  IG+ 
Sbjct: 70   ILVTLNEEALMLVVEDNGIGMSRNEMIDALGTIARSGTKAFMERLEASKAGEKAELIGQF 129

Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
            G+GF S F V D  ++ S 
Sbjct: 130  GVGFYSSFMVADKVDVISR 148


>gi|402771382|ref|YP_006590919.1| Chaperone protein HtpG [Methylocystis sp. SC2]
 gi|401773402|emb|CCJ06268.1| Chaperone protein HtpG [Methylocystis sp. SC2]
          Length = 673

 Score = 44.7 bits (104), Expect = 0.60,   Method: Composition-based stats.
 Identities = 37/127 (29%), Positives = 56/127 (44%), Gaps = 24/127 (18%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFIL- 1234
            A + R L  ++  +YS+   FL ELV NA             D+ +  +   P +T  L 
Sbjct: 65   ADVARLLELMTHSVYSERDVFLRELVSNAADACEKLRYEALSDEKLAAQAAAPLVTISLD 124

Query: 1235 -QESGIVVLNNEQGFSAEN-IRALCDVGNSTK--------KGSSAGYIGRKGIGFKSVFR 1284
             ++  + + +N  G S E  I AL  + NS          K   AG+IG+ G+GF S F 
Sbjct: 125  KEKRTLTIADNGVGMSREELIGALGTIANSGTRAFLDKLGKEKDAGFIGQFGVGFYSAFM 184

Query: 1285 VTDAPEI 1291
            V    E+
Sbjct: 185  VAGRVEV 191


>gi|451820496|ref|YP_007456697.1| hypothetical protein DUF3883 [Clostridium saccharoperbutylacetonicum
            N1-4(HMT)]
 gi|451786475|gb|AGF57443.1| hypothetical protein DUF3883 [Clostridium saccharoperbutylacetonicum
            N1-4(HMT)]
          Length = 1614

 Score = 44.7 bits (104), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 133/330 (40%), Gaps = 50/330 (15%)

Query: 1210 LLELVQNADDN--IYPENVEPT--LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKK 1265
            LLEL+QNADD   I  E  E    +   L+++ ++V NN + FS E I++L    N + K
Sbjct: 43   LLELIQNADDACVIKKEQTEKKRIIHIRLKDNKLIVANNGEPFSREGIKSLM-YSNLSPK 101

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCR 1325
                  IG KG GF+S+   T +  I S   ++ F + E    F++  +    NI  F  
Sbjct: 102  AKKKNKIGNKGTGFRSILSWTKSIFIKSGDLYLNF-SKESAEKFLIDLMDKNQNISEFIE 160

Query: 1326 LLS--KDPVQLESKCWNTCIRLPFRTKFSEGI--------------AMNNIVDMFSDLHP 1369
              S  K P+          + +P    F EG                   I    S+L  
Sbjct: 161  EYSDFKYPI--------ATLIVPEMVDFREGYDEYDTYIELELIENVKEEIEKQISELDK 212

Query: 1370 SLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGDGIIKVSCGE------DKMTWFVASQKLR 1423
              +LFL+ L+ +   +   +  +   KKI+   I+K+   +      +  TW V   K R
Sbjct: 213  ESILFLNNLETLKIES--ENISIEFTKKIITSKIVKIVESDLLNGTFNNKTWNV---KKR 267

Query: 1424 AGVIRPDVKTTEIALALTLQESNEGNYGPLLYQQPVFAFLPLRTYGLKFILQGDFVLPSS 1483
             G +  D K  EI++A T    +  N     ++  V    P        I+ G F L  +
Sbjct: 268  EGQV--DGKNFEISIAYTDNLDDNKNILYSYFKTDVKFPFPA-------IIHGTFNLTGN 318

Query: 1484 REEVDGNSPWNQWLLSEFPALFVSAERSFC 1513
            R ++  ++  N+ LL     L +   +   
Sbjct: 319  RNQLVKDNEENKKLLQFLAELLIETAKEIA 348


>gi|423214844|ref|ZP_17201372.1| hypothetical protein HMPREF1074_02904 [Bacteroides xylanisolvens
            CL03T12C04]
 gi|392692107|gb|EIY85345.1| hypothetical protein HMPREF1074_02904 [Bacteroides xylanisolvens
            CL03T12C04]
          Length = 588

 Score = 44.7 bits (104), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN    +   L E   VV  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYNGRIDVFLNEDKTVVFRDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEITVES 124


>gi|326526959|dbj|BAK00868.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 161

 Score = 44.7 bits (104), Expect = 0.61,   Method: Composition-based stats.
 Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 13/111 (11%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPTLTFIL-QESGIVVL-NNEQGFSA 1250
            Y + +  L EL+QNADD          ++        L   L Q  G  +L +N+  F+ 
Sbjct: 35   YPEGTTALRELIQNADDAGASRVRLCLDLRAHGASSLLAPALAQWQGPALLAHNDAAFTD 94

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            ++  ++  +G S KK S A    R G+GF SV+ +T+ P   S  + V FD
Sbjct: 95   DDFASISRIGGS-KKASQAWKTSRFGVGFNSVYHLTNLPSFVSGKYVVMFD 144


>gi|229489297|ref|ZP_04383161.1| hypothetical protein RHOER0001_0584 [Rhodococcus erythropolis SK121]
 gi|229323790|gb|EEN89547.1| hypothetical protein RHOER0001_0584 [Rhodococcus erythropolis SK121]
          Length = 1181

 Score = 44.7 bits (104), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 4/94 (4%)

Query: 1210 LLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS 1268
            L ELVQNA D +  ++     +  +L E  +   N     + E    +  +G S K+G  
Sbjct: 51   LFELVQNAADELRTDDHRGGKIHAVLTERYLYCANEGSPMTPEGADTILRMGVSKKRG-- 108

Query: 1269 AGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT 1302
             G IGR G+G KSV  VT  P+  S      FD 
Sbjct: 109  -GQIGRFGVGVKSVLSVTRTPQFFSKTGSFGFDA 141


>gi|149175056|ref|ZP_01853679.1| Molecular chaperone, HSP90 family protein [Planctomyces maris DSM
            8797]
 gi|148846034|gb|EDL60374.1| Molecular chaperone, HSP90 family protein [Planctomyces maris DSM
            8797]
          Length = 861

 Score = 44.7 bits (104), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 18/114 (15%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP---------------ENVEPTLTFILQESGI 1239
            +  L++ LYS+   F+ EL+QNA D I                 E+    L FI++++G+
Sbjct: 12   IQLLAKNLYSEKRVFIRELIQNAHDGILRRQSRESDAFSPRIDIESRPDELQFIIRDNGL 71

Query: 1240 VVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             +  N+ G   E +  +       +KG   G +G+ GIGF S F V +  E+ +
Sbjct: 72   GMDLNDIG---EYLAVIGRGATRLEKGDVTGLVGQFGIGFLSAFIVAERVEVET 122


>gi|405755020|ref|YP_006678484.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2540]
 gi|404224220|emb|CBY75582.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2540]
          Length = 601

 Score = 44.7 bits (104), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNAFIGRFGIGLLSCFIVSD 124


>gi|405968312|gb|EKC33393.1| Sacsin [Crassostrea gigas]
          Length = 1679

 Score = 44.3 bits (103), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 12/102 (11%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGI----VVLNNEQGFSAE 1251
            Y   S  + ELVQNA+D         Y E  +   + +   S      + ++N+  F+AE
Sbjct: 168  YPDGSQIIRELVQNAEDAGARRFEIFYKEGDDKPASQLSYRSYFRTPALCVHNDGVFTAE 227

Query: 1252 NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
            + + +  +  S K+   A  +GR G+GFKS+F +TD+P I S
Sbjct: 228  DWKGIKSIYTSVKE-KDALKVGRFGLGFKSIFHITDSPIIIS 268


>gi|431903117|gb|ELK09293.1| Sacsin [Pteropus alecto]
          Length = 653

 Score = 44.3 bits (103), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q + + V NN   F+
Sbjct: 103  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAQYQGAALYVYNNAV-FT 161

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S 
Sbjct: 162  PEDWHGIQEIARSRKKDDPL-KVGRFGIGFNSVYHITDVPCIFSG 205


>gi|409078002|gb|EKM78366.1| hypothetical protein AGABI1DRAFT_114668 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 483

 Score = 44.3 bits (103), Expect = 0.73,   Method: Composition-based stats.
 Identities = 33/111 (29%), Positives = 42/111 (37%), Gaps = 13/111 (11%)

Query: 26  PSQPPPPHFNPNFAFQNPN----------IFFPNPALQLLQNLTNIPFQNPNFDIQNPNL 75
           PS P PP F P F   +PN             P+P+ QL   + N  FQ P F     N 
Sbjct: 351 PSSPMPPQFQPQFVSSSPNPQYLAPSPSAQLMPSPSPQLQPTMPNPSFQ-PAFGNSTANG 409

Query: 76  FMQNANLLTQNPNLP--PQQPPSSACNQQRSQTQHPPGASNQQRPLTQQPQ 124
           F       +  P +   P Q   S    Q   T +P     QQ+P+T   Q
Sbjct: 410 FAPGYMTPSPQPMMAAMPGQTQMSGMGMQPMGTFNPGMMQPQQQPMTMNGQ 460


>gi|422418435|ref|ZP_16495390.1| histidine kinase domain-containing protein [Listeria seeligeri FSL
            N1-067]
 gi|313634043|gb|EFS00725.1| histidine kinase domain-containing protein [Listeria seeligeri FSL
            N1-067]
          Length = 600

 Score = 44.3 bits (103), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 30/120 (25%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
            L   +  LS  LY +   ++ EL+QNA D I   + +EP++               T IL
Sbjct: 11   LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLEPSIEGEIHVSLSGKADEKTLIL 70

Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK-----KGSSAG-YIGRKGIGFKSVFRVTD 1287
            +++GI       G + E + A L  + NS+K     +G  A  +IGR GIG  S F V+D
Sbjct: 71   EDNGI-------GLTEEEVHAFLATIANSSKGEKKFEGQEANDFIGRFGIGLLSCFIVSD 123


>gi|295838149|ref|ZP_06825082.1| hsp90-like protein [Streptomyces sp. SPB74]
 gi|295826894|gb|EDY43651.2| hsp90-like protein [Streptomyces sp. SPB74]
          Length = 1079

 Score = 44.3 bits (103), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 74/167 (44%), Gaps = 14/167 (8%)

Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
            + V N+     A  + +L  +  S K+    G +GR G+GF +V  V+D P +      V
Sbjct: 122  LAVANSGAPLDAGGVASLATLRASAKREDGTGTVGRFGVGFAAVLAVSDEPAVLGRTGGV 181

Query: 1299 KFDTSEGQIGFVLPTLVPPFNIDMFCR----LLSKDPVQLESKC---WNTCIRLPFRTKF 1351
            ++  +E +   V    VP  + ++  R     L + P+  E      ++T + LP R   
Sbjct: 182  RWSLAEAREA-VAALGVPALDEEVRRREGHVPLLRLPLPAEGSAPEGYDTVVVLPLRDSV 240

Query: 1352 SEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKI 1398
            +E +    +  + + +  +LLL L  L  I+      +   V+R+++
Sbjct: 241  AEDL----VERLLAGIDDALLLSLPGLAEIVVET--PEGTRVLRRRV 281


>gi|432952627|ref|XP_004085167.1| PREDICTED: sacsin-like [Oryzias latipes]
          Length = 664

 Score = 44.3 bits (103), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 49/118 (41%), Gaps = 13/118 (11%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESG-----IVVLNNEQGFSA 1250
            Y      L EL+QNADD        I+ E    T +    + G      +   N   F+ 
Sbjct: 82   YPDGGQILKELIQNADDGQATEVVFIHDERRYGTESLWTNKLGKYQGPALYAYNNAAFTD 141

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQIG 1308
            E+   +   G S K  +    IGR GIGF SV+ +TD P I S+G     D  E   G
Sbjct: 142  EDWERIQMAGRSGKV-NDPNKIGRFGIGFNSVYHITDVPIIFSSGHLGVMDPQENIFG 198


>gi|289434206|ref|YP_003464078.1| HSP90 domain family protein [Listeria seeligeri serovar 1/2b str.
            SLCC3954]
 gi|289170450|emb|CBH26990.1| HSP90 domain family protein [Listeria seeligeri serovar 1/2b str.
            SLCC3954]
          Length = 600

 Score = 44.3 bits (103), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 57/120 (47%), Gaps = 30/120 (25%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
            L   +  LS  LY +   ++ EL+QNA D I   + +EP++               T IL
Sbjct: 11   LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLEPSIEGKIHVSLSGKADEKTLIL 70

Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK-----KGSSAG-YIGRKGIGFKSVFRVTD 1287
            +++GI       G + E + A L  + NS+K     +G  A  +IGR GIG  S F V+D
Sbjct: 71   EDNGI-------GLTEEEVHAFLATIANSSKGEKKFEGQEANDFIGRFGIGLLSCFIVSD 123


>gi|154246920|ref|YP_001417878.1| putative ATPase [Xanthobacter autotrophicus Py2]
 gi|154161005|gb|ABS68221.1| putative ATPase [Xanthobacter autotrophicus Py2]
          Length = 1406

 Score = 44.3 bits (103), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 45/96 (46%), Gaps = 7/96 (7%)

Query: 1208 HFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            H L EL+QNADD    E         ++    V  +N + F   +  ++C  G S K+  
Sbjct: 54   HILSELLQNADDAGASEA-----RVSIENDRFVFEHNGEDFIEAHFASICRFGYSNKRAL 108

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTS 1303
                IG +GIGFKS F + D  E+ +    V F+ +
Sbjct: 109  HT--IGFRGIGFKSTFSLGDRVELFTPTLAVAFERT 142


>gi|429730804|ref|ZP_19265450.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Corynebacterium durum F0235]
 gi|429147242|gb|EKX90272.1| ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein
            [Corynebacterium durum F0235]
          Length = 623

 Score = 43.9 bits (102), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 18/127 (14%)

Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLT 1231
            +K    LG  +  LS+ LYS    +L EL+QN              D+  Y  +    + 
Sbjct: 5    EKFRVDLGGIVELLSRNLYSGSDVYLRELLQNGVDAITANAQSGKDDEGTYSSDTSGRIR 64

Query: 1232 FILQESGIVVLNNEQGFSAENIR-ALCDVGNSTKKG----SSAGYIGRKGIGFKSVFRVT 1286
            F++    + V +N  G + +  R  L  +G S+K+       + Y+G+ GIG  S F V+
Sbjct: 65   FVVDGHTLRVTDNGVGLNIDEARNLLATIGGSSKRDEFGLGRSDYLGQFGIGLLSCFMVS 124

Query: 1287 DAPEIHS 1293
            D   ++S
Sbjct: 125  DNIVVYS 131


>gi|423100037|ref|ZP_17087744.1| conserved hypothetical protein TIGR03982 [Listeria innocua ATCC
            33091]
 gi|370793038|gb|EHN60876.1| conserved hypothetical protein TIGR03982 [Listeria innocua ATCC
            33091]
          Length = 601

 Score = 43.9 bits (102), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 30/120 (25%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
            L   +  LS  LY +   ++ EL+QNA D I   + +EPTL               T IL
Sbjct: 12   LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIEPTLNGKIHVSLSGEKNNKTLIL 71

Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            +++GI       G + + + A L  + NS+K         +  +IGR GIG  S F V+D
Sbjct: 72   EDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGQDTNDFIGRFGIGLLSCFIVSD 124


>gi|348174622|ref|ZP_08881516.1| HSP90 family protein [Saccharopolyspora spinosa NRRL 18395]
          Length = 622

 Score = 43.9 bits (102), Expect = 0.94,   Method: Composition-based stats.
 Identities = 37/146 (25%), Positives = 60/146 (41%), Gaps = 30/146 (20%)

Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD---------------- 1219
            + + E+++++     LG  +  LS+ +YS    +L EL+QN  D                
Sbjct: 1    MDSAENSLVRPFQVELGGVVDLLSRHIYSGAQIYLRELLQNGRDAVAARAKLPGQPDGGS 60

Query: 1220 -NIYPENVEPTL--TFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-----SAGY 1271
              IYP    P     F+ +++G+ +   E G        L  VG S+K+          Y
Sbjct: 61   LEIYPVGTAPDAPEAFVFRDAGLGLSAAEAG------ELLATVGKSSKRDDVLSLRDENY 114

Query: 1272 IGRKGIGFKSVFRVTDAPEIHSNGFH 1297
            +G+ GIG  S F V D   + S   H
Sbjct: 115  LGQFGIGLLSCFMVCDRIVVRSRSVH 140


>gi|422412394|ref|ZP_16489353.1| histidine kinase domain-containing protein [Listeria innocua FSL
            S4-378]
 gi|313619696|gb|EFR91318.1| histidine kinase domain-containing protein [Listeria innocua FSL
            S4-378]
          Length = 601

 Score = 43.9 bits (102), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 30/120 (25%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
            L   +  LS  LY +   ++ EL+QNA D I   + +EPTL               T IL
Sbjct: 12   LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIEPTLNGKIHVSLSGEKNNKTLIL 71

Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            +++GI       G + + + A L  + NS+K         +  +IGR GIG  S F V+D
Sbjct: 72   EDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGQDTNDFIGRFGIGLLSCFIVSD 124


>gi|404413028|ref|YP_006698615.1| chaperone/heat shock protein [Listeria monocytogenes SLCC7179]
 gi|404238727|emb|CBY60128.1| chaperone/heat shock protein [Listeria monocytogenes SLCC7179]
          Length = 601

 Score = 43.9 bits (102), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|405968314|gb|EKC33395.1| Sacsin [Crassostrea gigas]
          Length = 1493

 Score = 43.9 bits (102), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 50/106 (47%), Gaps = 15/106 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPT-LTFI-----LQESGIVVLNNEQGFS 1249
            Y  +   L EL+QNADD        ++ +   PT L F       Q   +VV NN   FS
Sbjct: 1315 YPCNVSILNELIQNADDAGATEIHIVHDQRFHPTELIFDEKLADTQGPSLVVYNN-SCFS 1373

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
              +I  +  +G  +K GS     G+ GIGF +V+ +TD P   + G
Sbjct: 1374 KSDIEGISKLGEGSK-GSDPMTTGQFGIGFNAVYHITDVPSFLTIG 1418



 Score = 41.6 bits (96), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 53/125 (42%), Gaps = 15/125 (12%)

Query: 1201 ELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQESGI-------VVLNNEQ 1246
            E Y+     L E++QNADD        +Y E         L + G+       + + N+ 
Sbjct: 314  EEYADGVSVLKEMLQNADDAGATCLRILYDERTNENAKIGLIDKGMAECQGPAIWVYNDA 373

Query: 1247 GFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEGQ 1306
             F  ++ + +  +G  TK       IG+ G+GF SV+ +TD P I S    V  D     
Sbjct: 374  KFKEDDFQNIIKLGAGTKMADLTK-IGKFGLGFCSVYNITDVPSILSGDSLVILDPRMIH 432

Query: 1307 IGFVL 1311
            +G  L
Sbjct: 433  LGKAL 437


>gi|46907174|ref|YP_013563.1| HSP90 family protein [Listeria monocytogenes serotype 4b str. F2365]
 gi|405752162|ref|YP_006675627.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2378]
 gi|417315010|ref|ZP_12101698.1| HSP90 family protein [Listeria monocytogenes J1816]
 gi|424713821|ref|YP_007014536.1| Histidine kinase domain protein [Listeria monocytogenes serotype 4b
            str. LL195]
 gi|46880441|gb|AAT03740.1| histidine kinase domain protein [Listeria monocytogenes serotype 4b
            str. F2365]
 gi|328466963|gb|EGF38066.1| HSP90 family protein [Listeria monocytogenes J1816]
 gi|404221362|emb|CBY72725.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2378]
 gi|424013005|emb|CCO63545.1| Histidine kinase domain protein [Listeria monocytogenes serotype 4b
            str. LL195]
          Length = 601

 Score = 43.9 bits (102), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|333026262|ref|ZP_08454326.1| putative hsp90-like protein [Streptomyces sp. Tu6071]
 gi|332746114|gb|EGJ76555.1| putative hsp90-like protein [Streptomyces sp. Tu6071]
          Length = 1051

 Score = 43.9 bits (102), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 42/183 (22%), Positives = 76/183 (41%), Gaps = 25/183 (13%)

Query: 1239 IVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHV 1298
            + V N+     A  + +L  +  S K+    G +GR G+GF +V  V+D P +      V
Sbjct: 94   LAVANSGAPLDASGVASLATLRASAKREDGTGTVGRFGVGFAAVLAVSDEPAVLGRTGGV 153

Query: 1299 KFDTSEGQIGFVLPTLVPPFNIDMFCR----LLSKDPVQLESKC---WNTCIRLPFRTKF 1351
            ++  +E +   V    VP  + ++  R     L + P   E      ++T + LP R   
Sbjct: 154  RWSLTEAREA-VAALGVPALDEEVRRREGHVPLLRLPFPAEGSAPEGYDTVVVLPLR--- 209

Query: 1352 SEGIAMNNIVDMFSDLHPSLLLFLHRLQCI---------MFRNMLNDSLVVIRKKIVGDG 1402
             + +A + +  + + +  +LLL L  L  I         + R  + D   VI      DG
Sbjct: 210  -DAVAEDLVERLLTGIDDALLLSLPGLAEIVVETPEGTRLLRRRVEDPYTVIED----DG 264

Query: 1403 IIK 1405
             ++
Sbjct: 265  AVR 267


>gi|284801273|ref|YP_003413138.1| hypothetical protein LM5578_1023 [Listeria monocytogenes 08-5578]
 gi|284994415|ref|YP_003416183.1| hypothetical protein LM5923_0977 [Listeria monocytogenes 08-5923]
 gi|284056835|gb|ADB67776.1| hypothetical protein LM5578_1023 [Listeria monocytogenes 08-5578]
 gi|284059882|gb|ADB70821.1| hypothetical protein LM5923_0977 [Listeria monocytogenes 08-5923]
          Length = 601

 Score = 43.9 bits (102), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|93006822|ref|YP_581259.1| heat shock protein 90 [Psychrobacter cryohalolentis K5]
 gi|118575198|sp|Q1Q978.1|HTPG_PSYCK RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
            protein HtpG; AltName: Full=High temperature protein G
 gi|92394500|gb|ABE75775.1| heat shock protein Hsp90 [Psychrobacter cryohalolentis K5]
          Length = 666

 Score = 43.9 bits (102), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 53/234 (22%), Positives = 90/234 (38%), Gaps = 61/234 (26%)

Query: 1184 LKKQ--HARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEP 1228
            LKK    A + + LH ++  LYS    F+ ELV NA             DD++Y ++ E 
Sbjct: 24   LKKHTFEAEVAQLLHLVTHSLYSNSDIFVRELVSNASDACDKLRFEATNDDSLYEDDGEL 83

Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRK 1275
             +   + E    +   + G       A+ ++G   K G+ A               IG+ 
Sbjct: 84   KIRIAVDEDAKTITFTDNGIGMNEADAIENLGTIAKSGTKAFLDKLSESQKQDGQLIGQF 143

Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
            G+GF S F V D   + +       D      S+G   F +                  +
Sbjct: 144  GVGFYSGFIVADTISVETRKAGEPADKGVRWVSDGTGSFTV------------------E 185

Query: 1331 PVQLESKCWNTCIRLPFRTKFSEGI-------AMNNIVDMFSDLHPSLLLFLHR 1377
            P+  E++   T I L  + ++SEG         +  +V+ +SD H SL + + +
Sbjct: 186  PITKETR--GTAITLHLKEQYSEGEESYLDRGKIKQLVNKYSD-HISLPIQMRK 236


>gi|254993169|ref|ZP_05275359.1| HSP90 family protein [Listeria monocytogenes FSL J2-064]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|254823788|ref|ZP_05228789.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL J1-194]
 gi|293593010|gb|EFG00771.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL J1-194]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|255026807|ref|ZP_05298793.1| HSP90 family protein [Listeria monocytogenes FSL J2-003]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|226223560|ref|YP_002757667.1| heat shock protein HtpG [Listeria monocytogenes serotype 4b str. CLIP
            80459]
 gi|254852435|ref|ZP_05241783.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL R2-503]
 gi|254931325|ref|ZP_05264684.1| histidine kinase domain-containing protein [Listeria monocytogenes
            HPB2262]
 gi|300764151|ref|ZP_07074146.1| histidine kinase [Listeria monocytogenes FSL N1-017]
 gi|386731696|ref|YP_006205192.1| HSP90 family protein [Listeria monocytogenes 07PF0776]
 gi|404280495|ref|YP_006681393.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2755]
 gi|404286354|ref|YP_006692940.1| chaperone/heat shock protein [Listeria monocytogenes serotype 7 str.
            SLCC2482]
 gi|405749298|ref|YP_006672764.1| chaperone/heat shock protein [Listeria monocytogenes ATCC 19117]
 gi|406703717|ref|YP_006754071.1| chaperone/heat shock protein [Listeria monocytogenes L312]
 gi|417317168|ref|ZP_12103791.1| HSP90 family protein [Listeria monocytogenes J1-220]
 gi|424822669|ref|ZP_18247682.1| Histidine kinase domain protein [Listeria monocytogenes str. Scott A]
 gi|225876022|emb|CAS04728.1| Putative heat shock protein HtpG [Listeria monocytogenes serotype 4b
            str. CLIP 80459]
 gi|258605743|gb|EEW18351.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL R2-503]
 gi|293582875|gb|EFF94907.1| histidine kinase domain-containing protein [Listeria monocytogenes
            HPB2262]
 gi|300515141|gb|EFK42193.1| histidine kinase [Listeria monocytogenes FSL N1-017]
 gi|328475415|gb|EGF46184.1| HSP90 family protein [Listeria monocytogenes J1-220]
 gi|332311349|gb|EGJ24444.1| Histidine kinase domain protein [Listeria monocytogenes str. Scott A]
 gi|384390454|gb|AFH79524.1| HSP90 family protein [Listeria monocytogenes 07PF0776]
 gi|404218498|emb|CBY69862.1| chaperone/heat shock protein [Listeria monocytogenes ATCC 19117]
 gi|404227130|emb|CBY48535.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2755]
 gi|404245283|emb|CBY03508.1| chaperone/heat shock protein [Listeria monocytogenes serotype 7 str.
            SLCC2482]
 gi|406360747|emb|CBY67020.1| chaperone/heat shock protein [Listeria monocytogenes L312]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|34365253|emb|CAE45964.1| hypothetical protein [Homo sapiens]
          Length = 731

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 15/104 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 6    YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 64

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S
Sbjct: 65   PEDWHGIQEIARSRKKDDPLK-VGRFGIGFNSVYHITDVPCIFS 107


>gi|418475524|ref|ZP_13044916.1| hypothetical protein SMCF_7960 [Streptomyces coelicoflavus ZG0656]
 gi|371543860|gb|EHN72628.1| hypothetical protein SMCF_7960 [Streptomyces coelicoflavus ZG0656]
          Length = 1042

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 37/77 (48%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L   L++  +V  N      A  + +L  +  S K+ +  G +GR G+GF +V  VTD P
Sbjct: 75   LRLTLRDGVLVAANTGAPLDATGVESLSTLRASAKRDAGDGSVGRFGVGFAAVLAVTDEP 134

Query: 1290 EIHSNGFHVKFDTSEGQ 1306
             +      V++  +E +
Sbjct: 135  AVVGRHGGVRWSLAEAR 151


>gi|404410184|ref|YP_006695772.1| chaperone/heat shock protein [Listeria monocytogenes SLCC5850]
 gi|404230010|emb|CBY51414.1| chaperone/heat shock protein [Listeria monocytogenes SLCC5850]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|254827887|ref|ZP_05232574.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL N3-165]
 gi|258600268|gb|EEW13593.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL N3-165]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|386043267|ref|YP_005962072.1| hypothetical protein [Listeria monocytogenes 10403S]
 gi|345536501|gb|AEO05941.1| hypothetical protein LMRG_02040 [Listeria monocytogenes 10403S]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|47096228|ref|ZP_00233826.1| histidine kinase domain protein [Listeria monocytogenes str. 1/2a
            F6854]
 gi|254911626|ref|ZP_05261638.1| histidine kinase domain-containing protein [Listeria monocytogenes
            J2818]
 gi|254935952|ref|ZP_05267649.1| histidine kinase domain-containing protein [Listeria monocytogenes
            F6900]
 gi|386046604|ref|YP_005964936.1| histidine kinase domain-containing protein [Listeria monocytogenes
            J0161]
 gi|47015373|gb|EAL06308.1| histidine kinase domain protein [Listeria monocytogenes str. 1/2a
            F6854]
 gi|258608540|gb|EEW21148.1| histidine kinase domain-containing protein [Listeria monocytogenes
            F6900]
 gi|293589573|gb|EFF97907.1| histidine kinase domain-containing protein [Listeria monocytogenes
            J2818]
 gi|345533595|gb|AEO03036.1| histidine kinase domain-containing protein [Listeria monocytogenes
            J0161]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|386053211|ref|YP_005970769.1| histidine kinase domain-containing protein [Listeria monocytogenes
            Finland 1998]
 gi|346645862|gb|AEO38487.1| histidine kinase domain-containing protein [Listeria monocytogenes
            Finland 1998]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|47093853|ref|ZP_00231596.1| histidine kinase domain protein [Listeria monocytogenes str. 4b
            H7858]
 gi|47017762|gb|EAL08552.1| histidine kinase domain protein [Listeria monocytogenes str. 4b
            H7858]
          Length = 601

 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRTRKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|16800010|ref|NP_470278.1| HSP90 family protein [Listeria innocua Clip11262]
 gi|16413400|emb|CAC96172.1| lin0941 [Listeria innocua Clip11262]
          Length = 564

 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 55/120 (45%), Gaps = 30/120 (25%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
            L   +  LS  LY +   ++ EL+QNA D I   + +EPTL               T IL
Sbjct: 12   LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIEPTLNGKIHVSLSGEKNNKTLIL 71

Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            +++GI       G + + + A L  + NS+K         +  +IGR GIG  S F V+D
Sbjct: 72   EDNGI-------GLTEDEVHAFLATIANSSKGEKNFDGQDTNDFIGRFGIGLLSCFIVSD 124


>gi|386039035|ref|YP_005957989.1| chaperone protein HtpG [Paenibacillus polymyxa M1]
 gi|343095073|emb|CCC83282.1| chaperone protein HtpG [Paenibacillus polymyxa M1]
          Length = 634

 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 60/134 (44%), Gaps = 37/134 (27%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPE-----------------NVEPTLT-- 1231
            LG  +  L+  LYS    F+ E++QNA D I                     +E T++  
Sbjct: 16   LGGMIDILANHLYSSPRVFIREILQNATDAITARKELRKTEAESRSLKEDNTLEDTVSES 75

Query: 1232 ------FILQESG------IVVLNNEQGFSAENI-RALCDVGNSTKKG-----SSAGYIG 1273
                   I++ SG      +++ +N  G S E+I R L  +G S+KKG     S   +IG
Sbjct: 76   NDYEGKVIVELSGAGDQQVLMIQDNGIGLSEEDIHRFLSIIGQSSKKGADLLTSETSFIG 135

Query: 1274 RKGIGFKSVFRVTD 1287
            R GIG  S F V+D
Sbjct: 136  RFGIGLLSCFMVSD 149


>gi|393786767|ref|ZP_10374899.1| hypothetical protein HMPREF1068_01179 [Bacteroides nordii CL02T12C05]
 gi|392658002|gb|EIY51632.1| hypothetical protein HMPREF1068_01179 [Bacteroides nordii CL02T12C05]
          Length = 588

 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN  D I       EN    +   L ++G +   +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNCVDAITALRNIDENYSGRIDVYLNDNGSLTFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  YIGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRETPDADDYIGRFGIGLLSCFVVTNEITVES 124


>gi|423248197|ref|ZP_17229213.1| hypothetical protein HMPREF1066_00223 [Bacteroides fragilis
            CL03T00C08]
 gi|423253146|ref|ZP_17234077.1| hypothetical protein HMPREF1067_00721 [Bacteroides fragilis
            CL03T12C07]
 gi|392657046|gb|EIY50683.1| hypothetical protein HMPREF1067_00721 [Bacteroides fragilis
            CL03T12C07]
 gi|392660304|gb|EIY53918.1| hypothetical protein HMPREF1066_00223 [Bacteroides fragilis
            CL03T00C08]
          Length = 588

 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|310639839|ref|YP_003944597.1| hypothetical protein [Paenibacillus polymyxa SC2]
 gi|309244789|gb|ADO54356.1| ATPase, histidine kinase-, DNA gyrase B-, and HSP90-like domain
            protein [Paenibacillus polymyxa SC2]
          Length = 631

 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 60/134 (44%), Gaps = 37/134 (27%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPE-----------------NVEPTLT-- 1231
            LG  +  L+  LYS    F+ E++QNA D I                     +E T++  
Sbjct: 13   LGGMIDILANHLYSSPRVFIREILQNATDAITARKELRKTEAESRSLKEDNTLEDTVSES 72

Query: 1232 ------FILQESG------IVVLNNEQGFSAENI-RALCDVGNSTKKG-----SSAGYIG 1273
                   I++ SG      +++ +N  G S E+I R L  +G S+KKG     S   +IG
Sbjct: 73   NDYEGKVIVELSGAGDQQVLMIQDNGIGLSEEDIHRFLSIIGQSSKKGADLLTSETSFIG 132

Query: 1274 RKGIGFKSVFRVTD 1287
            R GIG  S F V+D
Sbjct: 133  RFGIGLLSCFMVSD 146


>gi|53711528|ref|YP_097520.1| heat shock protein 90 [Bacteroides fragilis YCH46]
 gi|52214393|dbj|BAD46986.1| heat shock protein HtpG [Bacteroides fragilis YCH46]
          Length = 588

 Score = 43.9 bits (102), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|375356574|ref|YP_005109345.1| putative heat-shock protein [Bacteroides fragilis 638R]
 gi|383116538|ref|ZP_09937286.1| hypothetical protein BSHG_1389 [Bacteroides sp. 3_2_5]
 gi|251948188|gb|EES88470.1| hypothetical protein BSHG_1389 [Bacteroides sp. 3_2_5]
 gi|301161254|emb|CBW20792.1| putative heat-shock protein [Bacteroides fragilis 638R]
          Length = 588

 Score = 43.9 bits (102), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|60679789|ref|YP_209933.1| HSP90 family protein [Bacteroides fragilis NCTC 9343]
 gi|60491223|emb|CAH05971.1| putative heat-shock protein [Bacteroides fragilis NCTC 9343]
          Length = 588

 Score = 43.5 bits (101), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|297200503|ref|ZP_06917900.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
 gi|297147668|gb|EFH28702.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
          Length = 1083

 Score = 43.5 bits (101), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 46/217 (21%), Positives = 85/217 (39%), Gaps = 32/217 (14%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L   L+E  +   N      A  + +L  +  S K+ ++ G +GR G+GF +V  VTD P
Sbjct: 75   LRLTLREGVLFAANTGAPLDAAGVESLSTLRASAKRDNADGSVGRFGVGFAAVLAVTDEP 134

Query: 1290 EIHSNGFHVKFDTSE-----GQIGFVLPTL----------VPPFNIDMFCRLLSKDPVQL 1334
             +      V++  +E      +     P L          VP   +       + DP   
Sbjct: 135  AVVGRHGGVRWSLAEARDHAAETARHSPGLGDEMRRRDGHVPLLRLPFAAEGTAPDP--- 191

Query: 1335 ESKCWNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVI 1394
                ++T + LP R   +  +A   +++   D   +LLL L  L+ ++      D+   I
Sbjct: 192  ----YDTVVILPLRDTAAADLA-ERLLNAVDD---ALLLALPGLEEVVIE---TDTTRTI 240

Query: 1395 RKKIVGDGIIKVSCGEDKMTWFVASQKLRAGVIRPDV 1431
             ++  G+  +     +    W  A+     G + PD+
Sbjct: 241  SRRTEGELTVVEDSRDGTTRWRTAAAH---GQLTPDL 274


>gi|265764922|ref|ZP_06093197.1| heat shock protein HtpG [Bacteroides sp. 2_1_16]
 gi|263254306|gb|EEZ25740.1| heat shock protein HtpG [Bacteroides sp. 2_1_16]
          Length = 588

 Score = 43.5 bits (101), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|423269904|ref|ZP_17248876.1| hypothetical protein HMPREF1079_01958 [Bacteroides fragilis
            CL05T00C42]
 gi|423272641|ref|ZP_17251588.1| hypothetical protein HMPREF1080_00241 [Bacteroides fragilis
            CL05T12C13]
 gi|392700750|gb|EIY93912.1| hypothetical protein HMPREF1079_01958 [Bacteroides fragilis
            CL05T00C42]
 gi|392708718|gb|EIZ01823.1| hypothetical protein HMPREF1080_00241 [Bacteroides fragilis
            CL05T12C13]
          Length = 588

 Score = 43.5 bits (101), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|336407723|ref|ZP_08588219.1| hypothetical protein HMPREF1018_00234 [Bacteroides sp. 2_1_56FAA]
 gi|335944802|gb|EGN06619.1| hypothetical protein HMPREF1018_00234 [Bacteroides sp. 2_1_56FAA]
          Length = 588

 Score = 43.5 bits (101), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|390166348|ref|ZP_10218611.1| heat shock protein 90 [Sphingobium indicum B90A]
 gi|389590745|gb|EIM68730.1| heat shock protein 90 [Sphingobium indicum B90A]
          Length = 626

 Score = 43.5 bits (101), Expect = 1.2,   Method: Composition-based stats.
 Identities = 35/132 (26%), Positives = 55/132 (41%), Gaps = 27/132 (20%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFILQ 1235
            A + R LH +   +YS    FL EL+ NA             D  +  ++ +P +T  L 
Sbjct: 16   ADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYELLSDPALAGDDGQPRITVTLD 75

Query: 1236 ESG--IVVLNNEQGFSAENI------------RALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
                 ++V +N  G S   +            RA  +   + K+G  A  IG+ G+GF S
Sbjct: 76   PEARRLIVEDNGIGMSEAELAEALGTIARSGTRAFMERVTAAKEGEGAQLIGQFGVGFYS 135

Query: 1282 VFRVTDAPEIHS 1293
             F V D  ++ S
Sbjct: 136  AFMVADKVDVFS 147


>gi|294012641|ref|YP_003546101.1| molecular chaperone HtpG [Sphingobium japonicum UT26S]
 gi|292675971|dbj|BAI97489.1| molecular chaperone HtpG [Sphingobium japonicum UT26S]
          Length = 626

 Score = 43.5 bits (101), Expect = 1.2,   Method: Composition-based stats.
 Identities = 35/132 (26%), Positives = 55/132 (41%), Gaps = 27/132 (20%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFILQ 1235
            A + R LH +   +YS    FL EL+ NA             D  +  ++ +P +T  L 
Sbjct: 16   ADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYELLSDPALAGDDGQPRITVTLD 75

Query: 1236 ESG--IVVLNNEQGFSAENI------------RALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
                 ++V +N  G S   +            RA  +   + K+G  A  IG+ G+GF S
Sbjct: 76   PEARRLIVEDNGIGMSEAELAEALGTIARSGTRAFMERVTAAKEGEGAQLIGQFGVGFYS 135

Query: 1282 VFRVTDAPEIHS 1293
             F V D  ++ S
Sbjct: 136  AFMVADKVDVFS 147


>gi|227495919|ref|ZP_03926230.1| HSP90 family heat shock protein [Actinomyces urogenitalis DSM 15434]
 gi|226834535|gb|EEH66918.1| HSP90 family heat shock protein [Actinomyces urogenitalis DSM 15434]
          Length = 609

 Score = 43.5 bits (101), Expect = 1.2,   Method: Composition-based stats.
 Identities = 39/132 (29%), Positives = 60/132 (45%), Gaps = 18/132 (13%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI-----YPENVEPTLTFILQ-- 1235
            M  +    LG  +  LS+ LYS    +L EL+QNA D +      PE  +  +   L+  
Sbjct: 1    MADRFQVDLGGMVDLLSRHLYSGPQVYLRELLQNAVDAMTARGQLPEPGQEPMRVRLRTG 60

Query: 1236 -----ESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAG-----YIGRKGIGFKSVFR 1284
                 +  + V +   G +A+  R  L  +G S+K+ S  G     YIG+ GIG  + F 
Sbjct: 61   TAPDGQPTLSVTDTGIGLTADQARELLATIGRSSKRDSLLGEGRREYIGQFGIGMLAAFM 120

Query: 1285 VTDAPEIHSNGF 1296
            V D  E+ S  +
Sbjct: 121  VADEIEVISRSY 132


>gi|302825729|ref|XP_002994455.1| hypothetical protein SELMODRAFT_432376 [Selaginella moellendorffii]
 gi|300137604|gb|EFJ04482.1| hypothetical protein SELMODRAFT_432376 [Selaginella moellendorffii]
          Length = 884

 Score = 43.5 bits (101), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 84/346 (24%), Positives = 142/346 (41%), Gaps = 58/346 (16%)

Query: 1459 VFAFLPLRTY-GLKFILQGDFVLPSSREEVDGNSPWNQWLLSEFPALFVSAERSFCDLPC 1517
            +F+FLP   + GL F++  DF+L SSR  +  +S WNQ +    P +F +A     +   
Sbjct: 161  IFSFLPSDIHSGLPFLINSDFLLVSSRATLLFDSSWNQGIFRCVPEVFYNAFELLLNTTA 220

Query: 1518 FR-ENPAKAASVYM-SFVPLVGEVHGFFSGLPRMILSKLRMSNCLILEGNNNQWAPP--- 1572
            F  ++    A + M  +VP+    + +   +   ILSKL+ +N ++   +  + APP   
Sbjct: 221  FGLQSILSVARMAMYKYVPVNVCRNSYLERVRNDILSKLKDNNVILCASDVGR-APPSGQ 279

Query: 1573 -CKVLRGWNDRAHSLLP---DILLQKHLGLGFL--SKDIVLSDSLARALGIEEYGPKILL 1626
             CK        A  + P   ++L        FL  S   ++  +L      E YG  + L
Sbjct: 280  LCK-----PALARKIAPSFRELLKAAPKCPRFLRGSPSYIVDSTLQ-----ENYGSVLRL 329

Query: 1627 QIISSLCRTENGLRSMGLSWLASWLNELYT--ISFHSSGQSSLQS-------GVETDL-- 1675
              +  +   E         W+ S  ++ Y   + F SS  S+L+S       G +  L  
Sbjct: 330  LGLPDMSNDEYHQSLDSEDWMRSLPDDAYVKLLVFLSSKSSTLRSLKFLKYRGSDGRLQL 389

Query: 1676 --IDNLQRIPFIPLSDGTFSSVDEG---TIWLHSDCSVFDGGFGLEAFPNLC-AKLRT-- 1727
              +D+L R      +DG     D G   T WL     VF         P+ C AK+R   
Sbjct: 390  ASMDDLAR------NDGRSMYWDSGHRATAWLAGWTEVFQRWLTFNFVPSACAAKIRQHS 443

Query: 1728 ----------VSPALLSASAVDKSSLGVISVDNLNRMLLKIGVQQL 1763
                       S + L+A ++ +  LG ++ ++ N  L  + +  L
Sbjct: 444  VLENWLERTPFSVSELTACSLARQVLGAVTANSSNAKLAVLAMHFL 489


>gi|149277937|ref|ZP_01884076.1| putative heat-shock protein [Pedobacter sp. BAL39]
 gi|149231135|gb|EDM36515.1| putative heat-shock protein [Pedobacter sp. BAL39]
          Length = 594

 Score = 43.5 bits (101), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 57/120 (47%), Gaps = 12/120 (10%)

Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFI 1233
            ME N    Q   L   +  LS+ LYS  + F+ EL+QN  D I       E+ E  L   
Sbjct: 1    MEDNQYLFQ-VNLKGMISLLSEHLYSNPNTFVRELLQNGIDAITALKTLDEHHEGALNVH 59

Query: 1234 LQESG---IVVLNNEQGFSAENI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTD 1287
            L E G   +V  +N  G   +   R L  +G S+K+ +  +  YIG+ GIG  S F V+D
Sbjct: 60   LPEEGQPELVFEDNGIGLKEDEFHRFLAVIGESSKRDAFEAKDYIGKFGIGLLSCFVVSD 119


>gi|405955306|gb|EKC22471.1| Sacsin [Crassostrea gigas]
          Length = 3200

 Score = 43.5 bits (101), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 83/214 (38%), Gaps = 39/214 (18%)

Query: 1203 YSQDSHFLLELVQNADD----------NIYPENVEPT-----LTFILQESGIVVLNNEQG 1247
            Y  D   L EL+QNA+D          +  P   EP+          +   +VV NN + 
Sbjct: 177  YPDDGQILKELIQNAEDAGASEMKILYDDRPAVQEPSTKRAPFRKYFKGPALVVYNNAEF 236

Query: 1248 FSAE--NIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSEG 1305
              A+   I+ L     S+ K      +GR G+GFKSVF +TD P I S    +  D  + 
Sbjct: 237  IEADWTGIKMLY----SSIKEFDKTKVGRFGLGFKSVFHITDHPVIISGDQLLVLDPHQD 292

Query: 1306 QIGFVLPTLVPPFN----------IDMFCRLLSKDPVQLESKCWN-TCIRLPFR---TKF 1351
                     +   +          +  F  +   D   LE   +N T  R P R   T+ 
Sbjct: 293  SSKVCQTMKLKKLHRYKKMKVEDCLKAFSGVFGFDQNTLECGHFNGTIFRFPLRQEETEL 352

Query: 1352 SEGIAMNNIVD----MFSDLHPSLLLFLHRLQCI 1381
            S+ I   + VD     F D  P  LLFL  L+ I
Sbjct: 353  SDNIYDKSKVDDLFMSFKDEAPVSLLFLKCLESI 386


>gi|441470598|emb|CCQ20353.1| Chaperone protein htpG [Listeria monocytogenes]
 gi|441473731|emb|CCQ23485.1| Chaperone protein htpG [Listeria monocytogenes N53-1]
          Length = 433

 Score = 43.5 bits (101), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 54/122 (44%), Gaps = 16/122 (13%)

Query: 1182 NMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTF 1232
            N   +    L   +  LS  LY +   ++ EL+QNA D I             +  +LT 
Sbjct: 3    NYSHRFQVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTG 62

Query: 1233 ILQESGIVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRV 1285
               E  ++V +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V
Sbjct: 63   DNNEKTLIVEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIV 122

Query: 1286 TD 1287
            +D
Sbjct: 123  SD 124


>gi|167034745|ref|YP_001669976.1| heat shock protein 90 [Pseudomonas putida GB-1]
 gi|166861233|gb|ABY99640.1| heat shock protein Hsp90 [Pseudomonas putida GB-1]
          Length = 634

 Score = 43.5 bits (101), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 90/222 (40%), Gaps = 59/222 (26%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDN---------IYPENVEPTLTFIL-----QESGIV 1240
            LH +   LYS    FL EL+ NA D            PE +E      +     +++G V
Sbjct: 20   LHLMIHSLYSNKEIFLRELISNASDAADKLRFEALAKPELLEGDADLKIRLSFDKDAGTV 79

Query: 1241 VL-NNEQGFSAENIRALCDVGNSTKKGSS-------------AGYIGRKGIGFKSVFRVT 1286
             L +N  G S E++  +  +G   K G++             +  IG+ G+GF S F V 
Sbjct: 80   TLEDNGIGMSREDV--IAHLGTIAKSGTADFMKNLTGDQKKDSHLIGQFGVGFYSAFIVA 137

Query: 1287 DAPEIHSN--------GFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESKC 1338
            D  +++S         G H    +S+G+  F + T+  P         L KD ++     
Sbjct: 138  DKVDVYSRRAGQPAAEGVHW---SSKGEGEFEVATIDKPQRGTRIVLHLKKDELE----- 189

Query: 1339 WNTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQC 1380
                        F++G  + N+V  +SD H +L + L + Q 
Sbjct: 190  ------------FADGWRLRNVVKKYSD-HIALPIQLPKEQA 218


>gi|398378376|ref|ZP_10536539.1| molecular chaperone of HSP90 family [Rhizobium sp. AP16]
 gi|397724975|gb|EJK85433.1| molecular chaperone of HSP90 family [Rhizobium sp. AP16]
          Length = 629

 Score = 43.5 bits (101), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 61/142 (42%), Gaps = 30/142 (21%)

Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENV 1226
            ES++ +   ARL   LH +   +YS    FL EL+ NA D             ++   + 
Sbjct: 11   ESHVFEADVARL---LHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIATPSLLASDP 67

Query: 1227 EPTLTFILQESG--IVVLNNEQGFSA-ENIRALCDVGNS-----------TKKGSSAGYI 1272
            +  +   L E G  +VV +N  G S  E I AL  +  S            K+G  A  I
Sbjct: 68   DSRILLTLDEEGSRLVVEDNGIGMSRDEMIEALGTIARSGTRAFMERIEANKQGEGAQLI 127

Query: 1273 GRKGIGFKSVFRVTDAPEIHSN 1294
            G+ G+GF S F V D  ++ S 
Sbjct: 128  GQFGVGFYSCFMVADRVDVVSR 149


>gi|16802982|ref|NP_464467.1| HSP90 family protein [Listeria monocytogenes EGD-e]
 gi|386049870|ref|YP_005967861.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL R2-561]
 gi|404283385|ref|YP_006684282.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2372]
 gi|405757941|ref|YP_006687217.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2479]
 gi|16410344|emb|CAC99020.1| lmo0942 [Listeria monocytogenes EGD-e]
 gi|346423716|gb|AEO25241.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL R2-561]
 gi|404232887|emb|CBY54290.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2372]
 gi|404235823|emb|CBY57225.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2479]
          Length = 601

 Score = 43.5 bits (101), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 52/116 (44%), Gaps = 16/116 (13%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPE---------NVEPTLTFILQESG 1238
               L   +  LS  LY +   ++ EL+QNA D I             +  +LT    E  
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKIDSTLEGKIHASLTGDNNEKT 68

Query: 1239 IVVLNNEQGFSAENIRA-LCDVGNSTK------KGSSAGYIGRKGIGFKSVFRVTD 1287
            +++ +N  G + + + A L  + NS+K        SS  +IGR GIG  S F V+D
Sbjct: 69   LIIEDNGIGLTEDEVHAFLATIANSSKGEKNFDGESSNDFIGRFGIGLLSCFIVSD 124


>gi|119628728|gb|EAX08323.1| spastic ataxia of Charlevoix-Saguenay (sacsin) [Homo sapiens]
          Length = 1313

 Score = 43.5 bits (101), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 47/105 (44%), Gaps = 15/105 (14%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNNEQGFS 1249
            Y +    L EL+QNA+D        +Y E    T T         Q   + V NN   F+
Sbjct: 658  YPEGGQILKELIQNAEDAGATEVKFLYDETQYGTETLWSKDMAPYQGPALYVYNNAV-FT 716

Query: 1250 AENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             E+   + ++  S KK      +GR GIGF SV+ +TD P I S 
Sbjct: 717  PEDWHGIQEIARSRKKDDPL-KVGRFGIGFNSVYHITDVPCIFSG 760


>gi|222082149|ref|YP_002541514.1| heat shock protein 90 [Agrobacterium radiobacter K84]
 gi|221726828|gb|ACM29917.1| heat shock chaperone protein [Agrobacterium radiobacter K84]
          Length = 629

 Score = 43.5 bits (101), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 60/142 (42%), Gaps = 30/142 (21%)

Query: 1180 ESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNI----YPENVEPTLT---- 1231
            ES++ +   ARL   LH +   +YS    FL EL+ NA D      Y     P+L     
Sbjct: 11   ESHVFEADVARL---LHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIATPSLLASDP 67

Query: 1232 -----FILQESG--IVVLNNEQGFSA-ENIRALCDVGNS-----------TKKGSSAGYI 1272
                   L E G  +VV +N  G S  E I AL  +  S            K+G  A  I
Sbjct: 68   DSRILLTLDEEGSRLVVEDNGIGMSRDEMIEALGTIARSGTRAFMERIEANKQGEGAQLI 127

Query: 1273 GRKGIGFKSVFRVTDAPEIHSN 1294
            G+ G+GF S F V D  ++ S 
Sbjct: 128  GQFGVGFYSCFMVADRVDVVSR 149


>gi|315302374|ref|ZP_07873248.1| histidine kinase domain-containing protein [Listeria ivanovii FSL
            F6-596]
 gi|313629261|gb|EFR97520.1| histidine kinase domain-containing protein [Listeria ivanovii FSL
            F6-596]
          Length = 600

 Score = 43.5 bits (101), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 56/120 (46%), Gaps = 30/120 (25%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------TFIL 1234
            L   +  LS  LY +   ++ EL+QNA D I   + +EP +               T IL
Sbjct: 11   LAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLEPEIEGEIHVSLSGKPDEKTLIL 70

Query: 1235 QESGIVVLNNEQGFSAENIRA-LCDVGNSTK-----KGSSAG-YIGRKGIGFKSVFRVTD 1287
            +++GI       G + E + A L  + NS+K     +G  A  +IGR GIG  S F V+D
Sbjct: 71   EDNGI-------GLTEEEVHAFLATIANSSKGEKKFEGQEANDFIGRFGIGLLSCFIVSD 123


>gi|416356103|ref|ZP_11681929.1| hypothetical protein CBCST_12012 [Clostridium botulinum C str.
            Stockholm]
 gi|338195095|gb|EGO87427.1| hypothetical protein CBCST_12012 [Clostridium botulinum C str.
            Stockholm]
          Length = 515

 Score = 43.1 bits (100), Expect = 1.5,   Method: Composition-based stats.
 Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 9/87 (10%)

Query: 1212 ELVQNADDNIYPENVEPTLTFILQESG----IVVLNNEQGFSAENIRALCDVGNSTKKGS 1267
            E+VQNA+DN   +N    L  IL + G    + +   + GFS ENI    +    TK+ +
Sbjct: 94   EIVQNANDNTTSDN----LDIILSKKGNIYEMTLQYEDIGFSVENIVGFFNTEIHTKRDN 149

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             +   G+ G+G KS+F   D  +I SN
Sbjct: 150  FSA-TGKHGVGIKSLFYFVDYMKIESN 175


>gi|350589712|ref|XP_003357851.2| PREDICTED: sacsin [Sus scrofa]
          Length = 1980

 Score = 43.1 bits (100), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 19/123 (15%)

Query: 1201 ELYSQDSHFLLELVQNADD----------------NIYPENVEPTLTFILQESGIVVLNN 1244
            E Y   S    EL+QNADD                +I    ++P +        +   NN
Sbjct: 1336 EEYPSVSDIFKELLQNADDANATECSFLIDMRRNLDIRESLLDPGMA-ACHGPALWSFNN 1394

Query: 1245 EQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDTSE 1304
             + FS  +   +  +G S K+G     +G+ G+GF SV+ +TD P I S  F + FD + 
Sbjct: 1395 SE-FSDADFVNITRLGESLKRGE-VDKVGKFGLGFNSVYHITDIPIIMSREFMIMFDPNI 1452

Query: 1305 GQI 1307
              I
Sbjct: 1453 NHI 1455


>gi|456822983|gb|EMF71453.1| GHKL domain protein [Leptospira interrogans serovar Canicola str.
            LT1962]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|290893499|ref|ZP_06556483.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL J2-071]
 gi|404407395|ref|YP_006690110.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2376]
 gi|290557000|gb|EFD90530.1| histidine kinase domain-containing protein [Listeria monocytogenes
            FSL J2-071]
 gi|404241544|emb|CBY62944.1| chaperone/heat shock protein [Listeria monocytogenes SLCC2376]
          Length = 601

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 30/123 (24%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
               L   +  LS  LY +   ++ EL+QNA D I   + + PTL               T
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLTPTLEGKILVSLTGDTNEKT 68

Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGS------SAGYIGRKGIGFKSVFR 1284
             IL+++GI       G + E + A L  + NS+K         S  +IGR GIG  S F 
Sbjct: 69   LILEDNGI-------GLTEEEVHAFLATIANSSKGEKDFDGERSNDFIGRFGIGLLSCFI 121

Query: 1285 VTD 1287
            V+D
Sbjct: 122  VSD 124


>gi|148745189|gb|AAI42917.1| LOC555303 protein [Danio rerio]
          Length = 304

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 49/111 (44%), Gaps = 13/111 (11%)

Query: 1203 YSQDSHFLLELVQNADDN-------IYPENVEPTLTFILQE-----SGIVVLNNEQGFSA 1250
            Y      L EL+QNADD        I+ E    T +   +E        +   N+  F+ 
Sbjct: 31   YPDGGQILKELIQNADDAGASTVVFIHDERHYETHSLWTKELRKYQGPALYAFNDAAFTE 90

Query: 1251 ENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFD 1301
            E+   +  VG S K+      +GR GIGF SV+ +TD P + S+     FD
Sbjct: 91   EDWEGIQRVGRSIKQDDPTK-VGRFGIGFNSVYHITDLPCVFSSEHLAIFD 140


>gi|24213931|ref|NP_711412.1| HSP90 family protein [Leptospira interrogans serovar Lai str. 56601]
 gi|45658311|ref|YP_002397.1| HSP90 family protein [Leptospira interrogans serovar Copenhageni str.
            Fiocruz L1-130]
 gi|386073466|ref|YP_005987783.1| protein HtpG [Leptospira interrogans serovar Lai str. IPAV]
 gi|421084773|ref|ZP_15545629.1| GHKL domain protein [Leptospira santarosai str. HAI1594]
 gi|421103352|ref|ZP_15563952.1| GHKL domain protein [Leptospira interrogans serovar
            Icterohaemorrhagiae str. Verdun LP]
 gi|24194785|gb|AAN48430.1| HtpG [Leptospira interrogans serovar Lai str. 56601]
 gi|45601553|gb|AAS71034.1| HtpG [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130]
 gi|353457255|gb|AER01800.1| HtpG [Leptospira interrogans serovar Lai str. IPAV]
 gi|410367098|gb|EKP22486.1| GHKL domain protein [Leptospira interrogans serovar
            Icterohaemorrhagiae str. Verdun LP]
 gi|410432723|gb|EKP77078.1| GHKL domain protein [Leptospira santarosai str. HAI1594]
 gi|455790481|gb|EMF42343.1| GHKL domain protein [Leptospira interrogans serovar Lora str. TE
            1992]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|418709533|ref|ZP_13270319.1| GHKL domain protein [Leptospira interrogans serovar Grippotyphosa
            str. UI 08368]
 gi|418732663|ref|ZP_13290390.1| GHKL domain protein [Leptospira interrogans str. UI 12758]
 gi|410769768|gb|EKR44995.1| GHKL domain protein [Leptospira interrogans serovar Grippotyphosa
            str. UI 08368]
 gi|410773443|gb|EKR53471.1| GHKL domain protein [Leptospira interrogans str. UI 12758]
 gi|456973160|gb|EMG13410.1| GHKL domain protein [Leptospira interrogans serovar Grippotyphosa
            str. LT2186]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|417763687|ref|ZP_12411664.1| GHKL domain protein [Leptospira interrogans str. 2002000624]
 gi|417773990|ref|ZP_12421865.1| GHKL domain protein [Leptospira interrogans str. 2002000621]
 gi|418675150|ref|ZP_13236442.1| GHKL domain protein [Leptospira interrogans str. 2002000623]
 gi|409940506|gb|EKN86146.1| GHKL domain protein [Leptospira interrogans str. 2002000624]
 gi|410576461|gb|EKQ39468.1| GHKL domain protein [Leptospira interrogans str. 2002000621]
 gi|410577722|gb|EKQ45591.1| GHKL domain protein [Leptospira interrogans str. 2002000623]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|217964958|ref|YP_002350636.1| HSP90 family protein [Listeria monocytogenes HCC23]
 gi|386007672|ref|YP_005925950.1| chaperone/heat shock protein [Listeria monocytogenes L99]
 gi|386026265|ref|YP_005947041.1| putative chaperonin (heat shock protein Hsp90) [Listeria
            monocytogenes M7]
 gi|217334228|gb|ACK40022.1| histidine kinase domain protein [Listeria monocytogenes HCC23]
 gi|307570482|emb|CAR83661.1| chaperone/heat shock protein [Listeria monocytogenes L99]
 gi|336022846|gb|AEH91983.1| putative chaperonin (heat shock protein Hsp90) [Listeria
            monocytogenes M7]
          Length = 601

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 30/123 (24%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
               L   +  LS  LY +   ++ EL+QNA D I   + + PTL               T
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLTPTLEGKILVSLTGDTNEKT 68

Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGS------SAGYIGRKGIGFKSVFR 1284
             IL+++GI       G + E + A L  + NS+K         S  +IGR GIG  S F 
Sbjct: 69   LILEDNGI-------GLTEEEVHAFLATIANSSKGEKDFDGERSNDFIGRFGIGLLSCFI 121

Query: 1285 VTD 1287
            V+D
Sbjct: 122  VSD 124


>gi|421122696|ref|ZP_15582979.1| GHKL domain protein [Leptospira interrogans str. Brem 329]
 gi|410344596|gb|EKO95762.1| GHKL domain protein [Leptospira interrogans str. Brem 329]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|71066274|ref|YP_265001.1| heat shock protein 90 [Psychrobacter arcticus 273-4]
 gi|88909149|sp|Q4FQZ1.1|HTPG_PSYA2 RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
            protein HtpG; AltName: Full=High temperature protein G
 gi|71039259|gb|AAZ19567.1| probable chaperone protein htpG (Hsp90) [Psychrobacter arcticus
            273-4]
          Length = 656

 Score = 43.1 bits (100), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 56/138 (40%), Gaps = 28/138 (20%)

Query: 1184 LKKQ--HARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEP 1228
            LKK    A + + LH ++  LYS    F+ ELV NA             DD++Y ++ E 
Sbjct: 14   LKKHTFEAEVAQLLHLVTHSLYSNSDIFVRELVSNASDACDKLRFEATNDDSLYEDDGEL 73

Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRK 1275
             +   + E    +   + G       A+ ++G   K G+ A               IG+ 
Sbjct: 74   RIRIAVDEDAKTITFTDNGIGMNEADAIENLGTIAKSGTKAFLDKLSDSQKQDGQLIGQF 133

Query: 1276 GIGFKSVFRVTDAPEIHS 1293
            G+GF S F V D   + +
Sbjct: 134  GVGFYSGFIVADTISVET 151


>gi|423259412|ref|ZP_17240335.1| hypothetical protein HMPREF1055_02612 [Bacteroides fragilis
            CL07T00C01]
 gi|423263614|ref|ZP_17242617.1| hypothetical protein HMPREF1056_00304 [Bacteroides fragilis
            CL07T12C05]
 gi|387776992|gb|EIK39092.1| hypothetical protein HMPREF1055_02612 [Bacteroides fragilis
            CL07T00C01]
 gi|392707036|gb|EIZ00156.1| hypothetical protein HMPREF1056_00304 [Bacteroides fragilis
            CL07T12C05]
          Length = 588

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +V  +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMVFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR G+G  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGVGLLSCFVVTNEIRVES 124


>gi|418667447|ref|ZP_13228858.1| GHKL domain protein [Leptospira interrogans serovar Pyrogenes str.
            2006006960]
 gi|410756618|gb|EKR18237.1| GHKL domain protein [Leptospira interrogans serovar Pyrogenes str.
            2006006960]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|422809027|ref|ZP_16857438.1| heat shock protein HtpG [Listeria monocytogenes FSL J1-208]
 gi|378752641|gb|EHY63226.1| heat shock protein HtpG [Listeria monocytogenes FSL J1-208]
          Length = 601

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 54/123 (43%), Gaps = 30/123 (24%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTL---------------T 1231
               L   +  LS  LY +   ++ EL+QNA D I   + + PTL               T
Sbjct: 9    QVNLAGMIDILSNHLYDEKDVYIRELLQNATDAIRARKKLTPTLEGKILVSLTGDTNEKT 68

Query: 1232 FILQESGIVVLNNEQGFSAENIRA-LCDVGNSTKKGS------SAGYIGRKGIGFKSVFR 1284
             IL+++GI       G + E + A L  + NS+K         S  +IGR GIG  S F 
Sbjct: 69   LILEDNGI-------GLTEEEVHAFLATIANSSKGEKDFDGEHSNDFIGRFGIGLLSCFI 121

Query: 1285 VTD 1287
            V+D
Sbjct: 122  VSD 124


>gi|299822814|ref|ZP_07054700.1| histidine kinase domain protein [Listeria grayi DSM 20601]
 gi|299816343|gb|EFI83581.1| histidine kinase domain protein [Listeria grayi DSM 20601]
          Length = 597

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 56/125 (44%), Gaps = 18/125 (14%)

Query: 1179 MESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEPTLTFIL--- 1234
            ME NM  K    LG  +  LS  LY     ++ EL+QNA D I   +  E +LT  +   
Sbjct: 1    MEENMRFK--VNLGGMIDILSNHLYDSKDVYIRELLQNATDAIRARKKNESSLTGEIAVS 58

Query: 1235 ----QESGIVVL--NNEQGFSAENIRALCDVGNSTKKGSSA------GYIGRKGIGFKSV 1282
                 + G V++  +N  G + E +        S+ KGS         YIGR GIG  S 
Sbjct: 59   LNTSDQGGSVIIFEDNGIGLTTEEVHEFLATIASSSKGSKRFQIEENDYIGRFGIGLLSC 118

Query: 1283 FRVTD 1287
            F V+D
Sbjct: 119  FIVSD 123


>gi|414083827|ref|YP_006992535.1| chaperone/heat shock protein [Carnobacterium maltaromaticum LMA28]
 gi|412997411|emb|CCO11220.1| chaperone/heat shock protein [Carnobacterium maltaromaticum LMA28]
          Length = 497

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 51/126 (40%), Gaps = 19/126 (15%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV--------EPT 1229
            N E N  K     LG  +  LS  LYSQ   ++ EL+QNA D I    +        E  
Sbjct: 2    NPEENRFK---VNLGGMIEILSNHLYSQKDVYIRELLQNATDAIRGRKLISEDNFVGEIH 58

Query: 1230 LTFILQESGIVVL---NNEQGFSAENIRALCDVGNSTKKG-----SSAGYIGRKGIGFKS 1281
            +   +  SG   L   +N  G + E I        S+ KG         +IGR GIG  S
Sbjct: 59   VDLTIHRSGSSTLFFEDNGIGLTEEEIHQFLATIASSSKGEKNFEEKQDFIGRFGIGLLS 118

Query: 1282 VFRVTD 1287
             F V D
Sbjct: 119  CFIVAD 124


>gi|418689769|ref|ZP_13250888.1| GHKL domain protein [Leptospira interrogans str. FPW2026]
 gi|400360958|gb|EJP16927.1| GHKL domain protein [Leptospira interrogans str. FPW2026]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|345885642|ref|ZP_08837003.1| hypothetical protein HMPREF0666_03179 [Prevotella sp. C561]
 gi|345045035|gb|EGW48984.1| hypothetical protein HMPREF0666_03179 [Prevotella sp. C561]
          Length = 320

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 1429 PD--VKTTEIALALTLQESNEGNYGPLLYQQ-PVFAFLPLRTYGLKF--ILQGDFVLPSS 1483
            PD     TE A++  ++ + EG+  P+   +  ++A+LP+  +  KF   L  DF+  S 
Sbjct: 85   PDRIASATETAISFAIRLNEEGHVMPIGKDELSLYAYLPMNEHRFKFPFYLNADFIPKSD 144

Query: 1484 REEVDGNSPWNQWLL 1498
            RE +   +PWN +L 
Sbjct: 145  REGIQSENPWNYFLF 159


>gi|21222723|ref|NP_628502.1| hypothetical protein SCO4331 [Streptomyces coelicolor A3(2)]
 gi|8249984|emb|CAB93405.1| hypothetical protein SCD12A.14 [Streptomyces coelicolor A3(2)]
          Length = 1042

 Score = 43.1 bits (100), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 37/77 (48%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L   L++  +V  N      A  + +L  +  S K+ +  G +GR G+GF +V  VTD P
Sbjct: 75   LRLTLRDGVLVAANTGAPLDATGVESLSTLRASAKRDAVDGSVGRFGVGFAAVLAVTDEP 134

Query: 1290 EIHSNGFHVKFDTSEGQ 1306
             +      V++  +E +
Sbjct: 135  AVVGRHGGVRWSLAEAR 151


>gi|334343983|ref|YP_004552535.1| chaperone protein htpG [Sphingobium chlorophenolicum L-1]
 gi|334100605|gb|AEG48029.1| Chaperone protein htpG [Sphingobium chlorophenolicum L-1]
          Length = 626

 Score = 43.1 bits (100), Expect = 1.9,   Method: Composition-based stats.
 Identities = 38/132 (28%), Positives = 57/132 (43%), Gaps = 27/132 (20%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEPTLTFILQ 1235
            A + R LH +   +YS    FL EL+ NA             D  +  E+ +P +T  L 
Sbjct: 16   ADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYELLSDPALAGEDGQPRITVTLD 75

Query: 1236 ESG--IVVLNNEQGFS----AENI--------RALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
                 ++V +N  G S    AE +        RA  +   + K+G  A  IG+ G+GF S
Sbjct: 76   PDTRQLIVEDNGIGMSEVELAEALGTIARSGTRAFMERVAAAKEGEGAQLIGQFGVGFYS 135

Query: 1282 VFRVTDAPEIHS 1293
             F V D  ++ S
Sbjct: 136  AFMVADRVDVFS 147


>gi|289770094|ref|ZP_06529472.1| conserved hypothetical protein [Streptomyces lividans TK24]
 gi|289700293|gb|EFD67722.1| conserved hypothetical protein [Streptomyces lividans TK24]
          Length = 1047

 Score = 43.1 bits (100), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 37/77 (48%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L   L++  +V  N      A  + +L  +  S K+ +  G +GR G+GF +V  VTD P
Sbjct: 80   LRLTLRDGVLVAANTGAPLDATGVESLSTLRASAKRDAVDGSVGRFGVGFAAVLAVTDEP 139

Query: 1290 EIHSNGFHVKFDTSEGQ 1306
             +      V++  +E +
Sbjct: 140  AVVGRHGGVRWSLAEAR 156


>gi|400288856|ref|ZP_10790888.1| heat shock protein 90 [Psychrobacter sp. PAMC 21119]
          Length = 661

 Score = 43.1 bits (100), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 55/138 (39%), Gaps = 28/138 (20%)

Query: 1184 LKKQ--HARLGRALHCLSQELYSQDSHFLLELVQNA-------------DDNIYPENVEP 1228
            LKK    A + + LH ++  LYS    F+ ELV NA             DD++Y  + E 
Sbjct: 14   LKKHTFEAEVAQLLHLVTHSLYSNADIFVRELVSNASDACDKLRFEGTNDDSLYENDGEL 73

Query: 1229 TLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRK 1275
             +   + E    +   + G       A+ ++G   K G+ A               IG+ 
Sbjct: 74   KIRIAVDEDAKTITFTDNGIGMNEADAIENLGTIAKSGTKAFLDKLSESQKQEGQLIGQF 133

Query: 1276 GIGFKSVFRVTDAPEIHS 1293
            G+GF S F V D   + S
Sbjct: 134  GVGFYSGFIVADTISVES 151


>gi|418724257|ref|ZP_13283077.1| GHKL domain protein [Leptospira interrogans str. UI 12621]
 gi|409962206|gb|EKO25945.1| GHKL domain protein [Leptospira interrogans str. UI 12621]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|417785783|ref|ZP_12433485.1| GHKL domain protein [Leptospira interrogans str. C10069]
 gi|409951124|gb|EKO05641.1| GHKL domain protein [Leptospira interrogans str. C10069]
          Length = 603

 Score = 43.1 bits (100), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 15/120 (12%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  I  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARSYLEPENEGEIHLEIIPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKG---SSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G   S  G+IG+ G+G  S F V+D
Sbjct: 61   KDGTPPTLIFTDNGVGLVESEIHEFLATIGQSSKRGEFQSPKGFIGQFGVGLLSCFIVSD 120


>gi|443626565|ref|ZP_21110981.1| hypothetical protein STVIR_4886 [Streptomyces viridochromogenes
            Tue57]
 gi|443339870|gb|ELS54096.1| hypothetical protein STVIR_4886 [Streptomyces viridochromogenes
            Tue57]
          Length = 1044

 Score = 42.7 bits (99), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 36/77 (46%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L   L+E  +   N      A  + +L  +  S K+ +  G +GR G+GF +V  VTD P
Sbjct: 75   LRLTLREGVLFAANTGAPLDAAGVESLATLRASAKRENQQGAVGRFGVGFAAVLSVTDEP 134

Query: 1290 EIHSNGFHVKFDTSEGQ 1306
             +      V++  +E +
Sbjct: 135  AVVGRHGGVRWSLAEAR 151


>gi|261415453|ref|YP_003249136.1| hypothetical protein Fisuc_1049 [Fibrobacter succinogenes subsp.
            succinogenes S85]
 gi|261371909|gb|ACX74654.1| hypothetical protein Fisuc_1049 [Fibrobacter succinogenes subsp.
            succinogenes S85]
          Length = 2408

 Score = 42.7 bits (99), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 56/121 (46%), Gaps = 20/121 (16%)

Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHF--LLELVQNADDNIYPENVEPTLTFI- 1233
            SN E+N+L      L           Y   S F  L EL+QNA D    + V  + T + 
Sbjct: 8    SNYETNILSTIQLNLDN---------YKYGSGFTILKELIQNASDANATKIVIRSFTAVP 58

Query: 1234 ------LQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
                  L+ +GIVV NN + F  ++ + +   G   KK + A  IGR G G KS++ + D
Sbjct: 59   NAHCPLLKRAGIVVYNNGK-FDEDDAKNIAMFGGDNKK-NDATTIGRYGFGLKSIYHLCD 116

Query: 1288 A 1288
            A
Sbjct: 117  A 117


>gi|403415159|emb|CCM01859.1| predicted protein [Fibroporia radiculosa]
          Length = 2665

 Score = 42.7 bits (99), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 50/188 (26%), Positives = 77/188 (40%), Gaps = 23/188 (12%)

Query: 1218 DDNIYPEN--VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
            D  I+P    +EP   F   +    +  N+  FS ++  AL ++  S+KK S    IG+ 
Sbjct: 56   DSRIHPSEHLIEPK--FAKLQGPAFIAYNDATFSDDDWDALQNISRSSKK-SDTSKIGKY 112

Query: 1276 GIGFKSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPPFNIDMFCRLLSKD 1330
            GIG +S + +TD  E+ S    V FD       EG   + L  +   F  D  C  LS  
Sbjct: 113  GIGIRSCYHLTDNLEVLSGNSFVIFDPHRKLIDEGGRKWSLADVKDTFG-DQLCPFLSFS 171

Query: 1331 PVQLESKCWNTCIRLPFR-----TKFSEGIAMNNIVDMFSDLHPS-----LLLFLHRLQC 1380
                 S      IRLP R     ++ S  +   + +              ++LFL  ++C
Sbjct: 172  --NAGSTFDGAAIRLPLRQEGHQSRLSSKVPQPDEIRKLLLEFVEEELGVVMLFLSSIKC 229

Query: 1381 IMFRNMLN 1388
            I  R + N
Sbjct: 230  IEIREIDN 237


>gi|384156150|ref|YP_005538965.1| heat shock protein 90 [Arcobacter butzleri ED-1]
 gi|345469704|dbj|BAK71155.1| heat shock protein 90 [Arcobacter butzleri ED-1]
          Length = 636

 Score = 42.7 bits (99), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 33/142 (23%)

Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
            M K Q    +G+ LH ++  LYS    F+ ELV NA D I          EN+       
Sbjct: 1    MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYADW 60

Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
              E  ++F  ++  + +++N  G +  ++  +  +G   K G+              +  
Sbjct: 61   KGEINISFDEKDKSLSIIDNGIGMNEADL--IASIGTIAKSGTKSFVEALTGDAKKDSNL 118

Query: 1272 IGRKGIGFKSVFRVTDAPEIHS 1293
            IG+ G+GF SVF V D  ++ S
Sbjct: 119  IGQFGVGFYSVFMVADKVDVIS 140


>gi|326672128|ref|XP_003199603.1| PREDICTED: hypothetical protein LOC100537175 [Danio rerio]
          Length = 547

 Score = 42.7 bits (99), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 66/143 (46%), Gaps = 8/143 (5%)

Query: 1931 IDWESNELVHLLSL--LTTNVNRQCSKHLLEILDTLWD--DCYTDKIMGFFKSKPTGDDR 1986
            +D + +EL  LL+   L+     Q  + L+++LD  W+  + Y   +         G  R
Sbjct: 1    MDHQCSELHTLLTTDQLSDQEKLQQRRTLIQLLDNNWNSGERYGQLVSAQLLDADGGVVR 60

Query: 1987 SFQSSFINCICDIQW--TISSMDDELHY--PKDLFHDCDAVRSILGPSAPYIVPKVKSEK 2042
            + +SSFI  + +  W  T   M   + Y  P+ ++    AV  +LG    Y+  +  + +
Sbjct: 61   NVRSSFIQMLTEFSWIPTHRPMCSTVSYLQPRSIYIFSTAVNRLLGAHVDYMHSEQTASE 120

Query: 2043 LVCDIGLKTEVTIDDILEILKVW 2065
                IGL+  V ++D++  LK W
Sbjct: 121  FTSAIGLRHSVCVEDMVGYLKRW 143


>gi|313235804|emb|CBY19788.1| unnamed protein product [Oikopleura dioica]
          Length = 664

 Score = 42.7 bits (99), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 44/182 (24%), Positives = 78/182 (42%), Gaps = 40/182 (21%)

Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------------PE 1224
            S+ +SN   +  A   + L  +++ LYS+   F+ EL+ N+ D +             P 
Sbjct: 32   SDADSNPTYEFQAETKKLLDIVAKSLYSEVEIFVRELISNSSDALNKRKFEEMAAGGDPP 91

Query: 1225 NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--------------SSAG 1270
             +E  L F    + +++ +N  G S E   A+ ++G   K G              S+  
Sbjct: 92   ALEIKLHFDKSNNKLIIEDNGIGMSQE--EAMTNLGTIAKSGSQDFMTAVKDENSASAQS 149

Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSN------GFHVKFDTSEGQIGFVLPTLVPPFNIDMFC 1324
             IG+ G+GF SVF V D  ++ +       G H +   S G  G+   T+ P   +++  
Sbjct: 150  IIGQFGVGFYSVFMVADGVDVFTRRWNEDVGLHWR---STGDSGY---TIEPKEGLNVGT 203

Query: 1325 RL 1326
            R+
Sbjct: 204  RI 205


>gi|157737782|ref|YP_001490466.1| heat shock protein 90 [Arcobacter butzleri RM4018]
 gi|166918431|sp|A8EV23.1|HTPG_ARCB4 RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
            protein HtpG; AltName: Full=High temperature protein G
 gi|157699636|gb|ABV67796.1| heat shock protein 90 HtpG [Arcobacter butzleri RM4018]
          Length = 636

 Score = 42.7 bits (99), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 33/142 (23%)

Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
            M K Q    +G+ LH ++  LYS    F+ ELV NA D I          EN+       
Sbjct: 1    MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYAQW 60

Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
              E  ++F  ++  + +++N  G +  ++  +  +G   K G+              +  
Sbjct: 61   KGEINISFDEKDKSLSIIDNGIGMNEADL--IASIGTIAKSGTKSFVEALTGDAKKDSNL 118

Query: 1272 IGRKGIGFKSVFRVTDAPEIHS 1293
            IG+ G+GF SVF V D  ++ S
Sbjct: 119  IGQFGVGFYSVFMVADKVDVIS 140


>gi|291229801|ref|XP_002734859.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 2190

 Score = 42.7 bits (99), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 87/209 (41%), Gaps = 38/209 (18%)

Query: 1206 DSHFLLELVQNADD------NIYPENVEPTLTFILQES------GIVVLNNEQGFSAENI 1253
            D   L EL+QNADD          +  E   T +L  S        +   N+  F+  + 
Sbjct: 42   DGQILKELIQNADDAGATEVKFLLDTTEYGTTSLLTPSIATYQGAALYCYNDSVFNDADW 101

Query: 1254 RALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKFDT-----SEGQIG 1308
            +++  V  S KK       GR G+GF SV+ +TD P I S    V FD       EG  G
Sbjct: 102  KSIQSVQQSVKKKDPLR-TGRFGLGFVSVYHITDMPSIMSADKIVFFDPFENRFREGYPG 160

Query: 1309 FVLP---TLVPPFNIDM--FCRLLSKDPVQLESKCWN------TCIRLPFRTKFSEG--- 1354
            + +P   +L+  ++     F ++ S+  +    + +N      T +R P R + S+    
Sbjct: 161  WQVPLDKSLLTAYHHQFLPFTKIHSQ--IGFGDQPFNNTFFNGTLMRFPLRKEISKMSSN 218

Query: 1355 -IAMNNIVDMFSDLHPSL---LLFLHRLQ 1379
                 NI DMF      L   LLFL  ++
Sbjct: 219  VFKPENIDDMFESFKEELDVILLFLKSVE 247


>gi|400405574|ref|YP_006588433.1| molecular chaperone of HSP90 family [secondary endosymbiont of
            Ctenarytaina eucalypti]
 gi|400363937|gb|AFP85005.1| molecular chaperone of HSP90 family [secondary endosymbiont of
            Ctenarytaina eucalypti]
          Length = 630

 Score = 42.7 bits (99), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 39/165 (23%), Positives = 64/165 (38%), Gaps = 34/165 (20%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPTLTFILQESGIVV 1241
            LH +   +YS    FL ELV NA D              +Y  + E  L   L +    +
Sbjct: 19   LHLMINSIYSNKEIFLRELVSNASDAADKLRFRALSQPTLYETDSELRLRIALNKEKRTI 78

Query: 1242 LNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRKGIGFKSVFRVTDA 1288
            + ++ G        + ++G   K G+ A               IG+ G+GF SVF V D 
Sbjct: 79   ILSDNGIGMRRAEVIENLGTIAKSGTKAFLDSMKSDHLKEHPLIGQFGVGFYSVFIVADK 138

Query: 1289 PEIHSN--GFHVK---FDTSEGQIGFVLPTLVPP---FNIDMFCR 1325
              + +   G H     F  S G+  + +  ++ P    +I ++ R
Sbjct: 139  VTVRTRAAGAHADEGVFWESAGEGDYTIADIIKPDRGTDITLYLR 183


>gi|313213012|emb|CBY36894.1| unnamed protein product [Oikopleura dioica]
          Length = 664

 Score = 42.7 bits (99), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 44/182 (24%), Positives = 78/182 (42%), Gaps = 40/182 (21%)

Query: 1177 SNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------------PE 1224
            S+ +SN   +  A   + L  +++ LYS+   F+ EL+ N+ D +             P 
Sbjct: 32   SDADSNPTYEFQAETKKLLDIVAKSLYSEVEIFVRELISNSSDALNKRKFEEMAAGGDPP 91

Query: 1225 NVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKG--------------SSAG 1270
             +E  L F    + +++ +N  G S E   A+ ++G   K G              S+  
Sbjct: 92   ALEIKLHFDKSNNKLIIEDNGIGMSQE--EAMTNLGTIAKSGSQDFMTAVKDENSASAQS 149

Query: 1271 YIGRKGIGFKSVFRVTDAPEIHSN------GFHVKFDTSEGQIGFVLPTLVPPFNIDMFC 1324
             IG+ G+GF SVF V D  ++ +       G H +   S G  G+   T+ P   +++  
Sbjct: 150  IIGQFGVGFYSVFMVADGVDVFTRRWNEDVGLHWR---STGDSGY---TIEPKEGLNVGT 203

Query: 1325 RL 1326
            R+
Sbjct: 204  RI 205


>gi|374599692|ref|ZP_09672694.1| HSP90 family protein [Myroides odoratus DSM 2801]
 gi|423324846|ref|ZP_17302687.1| hypothetical protein HMPREF9716_02044 [Myroides odoratimimus CIP
            103059]
 gi|373911162|gb|EHQ43011.1| HSP90 family protein [Myroides odoratus DSM 2801]
 gi|404607820|gb|EKB07321.1| hypothetical protein HMPREF9716_02044 [Myroides odoratimimus CIP
            103059]
          Length = 592

 Score = 42.7 bits (99), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 52/111 (46%), Gaps = 8/111 (7%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNI-----YPENVEPTLTFILQESGIVVLNNEQGFS 1249
            +  LS+ +YS  + F+ EL+QN  D +       E+ +  +T       ++  +N  G S
Sbjct: 17   IELLSEHIYSSPTVFIRELLQNGVDAVTVRKGIDESFQGKITLFFDNDQLIFQDNGVGLS 76

Query: 1250 AENIRALCDV-GNSTKKGSSA--GYIGRKGIGFKSVFRVTDAPEIHSNGFH 1297
             E++     V G S+K+G  A    IGR GIG  S   V++   + S   +
Sbjct: 77   LEDMHQFLSVIGQSSKRGELAEKDLIGRFGIGLLSCLVVSEEIVVESRSLY 127


>gi|325168888|ref|YP_004280678.1| heat shock chaperone protein [Agrobacterium sp. H13-3]
 gi|325064611|gb|ADY68300.1| heat shock chaperone protein [Agrobacterium sp. H13-3]
          Length = 630

 Score = 42.4 bits (98), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 58/139 (41%), Gaps = 30/139 (21%)

Query: 1186 KQH---ARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
            +QH   A + R LH +   +YS    FL EL+ NA D              +  + ++P 
Sbjct: 10   EQHVFEADVNRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYESISKPQLSADGIQPG 69

Query: 1230 L---------TFILQESGIVVLNNEQ-----GFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
            +         T ++++SGI +  +E        +    +A  D   + K G  A  IG+ 
Sbjct: 70   ILVTLNEEAQTLVVEDSGIGMDRDEMIDALGTIARSGTKAFMDRLEAAKAGEKAQLIGQF 129

Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
            G+GF S F V    E+ S 
Sbjct: 130  GVGFYSSFMVAHRVEVISR 148


>gi|300771857|ref|ZP_07081728.1| chaperone HtpG [Sphingobacterium spiritivorum ATCC 33861]
 gi|300761243|gb|EFK58068.1| chaperone HtpG [Sphingobacterium spiritivorum ATCC 33861]
          Length = 591

 Score = 42.4 bits (98), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVLNNEQGFS 1249
            +  LS+ LYS  + F+ EL+QNA D I       E+ +  +T  L++  ++  +N  G +
Sbjct: 17   ISLLSEHLYSNPNTFVRELLQNAVDAITAFQNLDESHQGKITVRLEDKKMIFEDNGIGLT 76

Query: 1250 -AENIRALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTD 1287
             AE  + +  +G S+K+ S  +  +IG+ GIG  S F V++
Sbjct: 77   EAEIHKFIAVIGESSKRDSIDAQDFIGKFGIGLLSCFVVSN 117


>gi|156359821|ref|XP_001624963.1| predicted protein [Nematostella vectensis]
 gi|156211771|gb|EDO32863.1| predicted protein [Nematostella vectensis]
          Length = 2047

 Score = 42.4 bits (98), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 55/129 (42%), Gaps = 30/129 (23%)

Query: 1185 KKQHARLGRALHCLSQELYSQDSHFLLELVQNADD------------------NIYPENV 1226
            +K   RL R L     E Y  D   L EL+QNADD                   I+  + 
Sbjct: 45   EKLTTRLKRIL-----EGYPCDGAILKELLQNADDAKASKICFIKDLRRHPSERIFEHSW 99

Query: 1227 EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVT 1286
            +P     LQ   + V N+   F+ ++I  +  +G  +K G      G+ G+GF +V+ +T
Sbjct: 100  KP-----LQGPALCVYNDSP-FTTQDIEGIQRLGEGSK-GDDPNKTGQYGVGFNAVYHMT 152

Query: 1287 DAPEIHSNG 1295
            D P   S G
Sbjct: 153  DCPSFVSKG 161


>gi|227540135|ref|ZP_03970184.1| heat-shock protein [Sphingobacterium spiritivorum ATCC 33300]
 gi|227239996|gb|EEI90011.1| heat-shock protein [Sphingobacterium spiritivorum ATCC 33300]
          Length = 591

 Score = 42.4 bits (98), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 56/101 (55%), Gaps = 8/101 (7%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVLNNEQGFS 1249
            +  LS+ LYS  + F+ EL+QNA D I       E+ +  +T  L++  ++  +N  G +
Sbjct: 17   ISLLSEHLYSNPNTFVRELLQNAVDAITAFQNLDESHQGKITVRLEDKKMIFEDNGIGLT 76

Query: 1250 -AENIRALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTD 1287
             AE  + +  +G S+K+ S  +  +IG+ GIG  S F V++
Sbjct: 77   EAEIHKFIAVIGESSKRDSIDAQDFIGKFGIGLLSCFVVSN 117


>gi|418409404|ref|ZP_12982716.1| heat shock protein 90 [Agrobacterium tumefaciens 5A]
 gi|358004043|gb|EHJ96372.1| heat shock protein 90 [Agrobacterium tumefaciens 5A]
          Length = 630

 Score = 42.4 bits (98), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 58/139 (41%), Gaps = 30/139 (21%)

Query: 1186 KQH---ARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPT 1229
            +QH   A + R LH +   +YS    FL EL+ NA D              +  + ++P 
Sbjct: 10   EQHVFEADVSRLLHMMVHSVYSDKDVFLRELISNAADACEKLRYESISNPQLSADGIQPG 69

Query: 1230 L---------TFILQESGIVVLNNEQ-----GFSAENIRALCDVGNSTKKGSSAGYIGRK 1275
            +         T ++++SGI +  +E        +    +A  D   + K G  A  IG+ 
Sbjct: 70   ILVTLNEEAQTLVVEDSGIGMDRDEMIDALGTIARSGTKAFMDRLEAAKAGEKAQLIGQF 129

Query: 1276 GIGFKSVFRVTDAPEIHSN 1294
            G+GF S F V    E+ S 
Sbjct: 130  GVGFYSSFMVAHRVEVISR 148


>gi|242280887|ref|YP_002993016.1| heat shock protein 90 [Desulfovibrio salexigens DSM 2638]
 gi|242123781|gb|ACS81477.1| heat shock protein Hsp90 [Desulfovibrio salexigens DSM 2638]
          Length = 632

 Score = 42.4 bits (98), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 35/133 (26%), Positives = 50/133 (37%), Gaps = 25/133 (18%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADD---------NIYPE---NVEPTLTFILQE 1236
            A + + L  L   LY+    FL ELV NA D         N  PE    VEP +     E
Sbjct: 12   AEVNQLLDILVHSLYTNREIFLRELVSNASDALDKMRFAINSNPELEDEVEPEILIAYDE 71

Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRKGIGFKSVF 1283
                V   + G        + ++G     GS+                IGR G+GF S+F
Sbjct: 72   EKKTVTVTDTGIGMTREEVMTNIGTIAHSGSAEFVKQAAESKESLDSLIGRFGVGFYSIF 131

Query: 1284 RVTDAPEIHSNGF 1296
             V+D   + +  +
Sbjct: 132  MVSDHVVVRTKSY 144


>gi|421088019|ref|ZP_15548848.1| GHKL domain protein [Leptospira kirschneri str. 200802841]
 gi|410003275|gb|EKO53720.1| GHKL domain protein [Leptospira kirschneri str. 200802841]
          Length = 600

 Score = 42.4 bits (98), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  +  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G    +IG+ G+G  S F V+D
Sbjct: 61   KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117


>gi|347532187|ref|YP_004838950.1| Electron transfer flavoprotein subunit alpha [Roseburia hominis
           A2-183]
 gi|345502335|gb|AEN97018.1| Electron transfer flavoprotein alpha subunit [Roseburia hominis
           A2-183]
          Length = 302

 Score = 42.4 bits (98), Expect = 3.0,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 42/80 (52%)

Query: 764 LISFKIEDKGSMESFLETVGKYRNEVMSKCVLFSETILGTRLSGDLSVHEENSLLETTSA 823
           +++  +E+ GS E F + + +   +  + CVL   TILG  L+G LSV+   ++  T S+
Sbjct: 46  ILTCPLEESGSPEDFAKAIAEEILKSDAACVLVGNTILGRALAGKLSVYLNTAVFATISS 105

Query: 824 ITHTGLRPKMSESVTSKDAI 843
           +T    +P    +V    A+
Sbjct: 106 LTSEDGKPCFERTVFGGTAV 125


>gi|256833671|ref|YP_003162398.1| ATPase, histidine kinase-, DNA gyrase B-, and HSP90-like
            domain-containing protein [Jonesia denitrificans DSM
            20603]
 gi|256687202|gb|ACV10095.1| ATPase, histidine kinase-, DNA gyrase B-, and HSP90-like domain
            protein [Jonesia denitrificans DSM 20603]
          Length = 621

 Score = 42.4 bits (98), Expect = 3.0,   Method: Composition-based stats.
 Identities = 36/116 (31%), Positives = 50/116 (43%), Gaps = 13/116 (11%)

Query: 1191 LGRALHCLSQELYSQDSHFLLELVQNADDNIYPEN-----VEPTLTFILQESGIVVLNNE 1245
            L   +  LS+ +YS    +L EL+QN  D I   +       P +T      G   L ++
Sbjct: 9    LSGVIDLLSRHIYSSPRVYLRELIQNGIDAISAAHDTTPITNPQITITPARHGEPFLFHD 68

Query: 1246 QGF---SAENIRALCDVGNSTKKGSSAG-----YIGRKGIGFKSVFRVTDAPEIHS 1293
             G    + E    L  VG S+K+    G     Y+G+ GIG  S F VTD   I S
Sbjct: 69   NGIGLTATEAAELLATVGRSSKRDPELGFRRDNYLGQFGIGLLSCFMVTDNIRIIS 124


>gi|418697545|ref|ZP_13258536.1| GHKL domain protein [Leptospira kirschneri str. H1]
 gi|409954557|gb|EKO13507.1| GHKL domain protein [Leptospira kirschneri str. H1]
          Length = 600

 Score = 42.4 bits (98), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  +  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G    +IG+ G+G  S F V+D
Sbjct: 61   KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117


>gi|398338615|ref|ZP_10523318.1| heat shock protein 90 [Leptospira kirschneri serovar Bim str. 1051]
 gi|418675785|ref|ZP_13237071.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
            RM52]
 gi|418686140|ref|ZP_13247309.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
            Moskva]
 gi|418739204|ref|ZP_13295592.1| GHKL domain protein [Leptospira kirschneri serovar Valbuzzi str.
            200702274]
 gi|400323550|gb|EJO71398.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
            RM52]
 gi|410739094|gb|EKQ83823.1| GHKL domain protein [Leptospira kirschneri serovar Grippotyphosa str.
            Moskva]
 gi|410753456|gb|EKR10421.1| GHKL domain protein [Leptospira kirschneri serovar Valbuzzi str.
            200702274]
          Length = 600

 Score = 42.4 bits (98), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  +  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G    +IG+ G+G  S F V+D
Sbjct: 61   KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117


>gi|421131131|ref|ZP_15591316.1| GHKL domain protein [Leptospira kirschneri str. 2008720114]
 gi|410357497|gb|EKP04747.1| GHKL domain protein [Leptospira kirschneri str. 2008720114]
          Length = 600

 Score = 42.4 bits (98), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 54/117 (46%), Gaps = 12/117 (10%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIY------PENV-EPTLTFILQ 1235
            M  K    L   ++ LS+ LYS    F+ EL+QN  D I       PEN  E  L  +  
Sbjct: 1    MSHKFQVNLRGIINLLSEHLYSGPQVFVRELLQNGVDAIQARAYLEPENKGEIHLEVVPG 60

Query: 1236 ESG----IVVLNNEQGFSAENIRA-LCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTD 1287
            + G    ++  +N  G     I   L  +G S+K+G    +IG+ G+G  S F V+D
Sbjct: 61   KDGGSPTLIFTDNGIGLVESEIHEFLATIGQSSKRGLQKDFIGQFGVGLLSCFIVSD 117


>gi|322786202|gb|EFZ12807.1| hypothetical protein SINV_11884 [Solenopsis invicta]
          Length = 633

 Score = 42.4 bits (98), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 14/116 (12%)

Query: 1192 GRALHCLSQELYSQDSHFLLELVQNADDNIY------PENVEPT----LTFILQESG--I 1239
             R +  +   LYS    FL EL+ N+ D +       P+ ++ T    +  +L E    +
Sbjct: 10   SRFMSRVMSSLYSNSDIFLRELISNSSDALDKFSHKPPKEIDNTEELCIRIVLNEKNRTL 69

Query: 1240 VVLNNEQGFSAENIRALCDV--GNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             +++N  G +   I     +   + TKK  +  YIG+ GIGF SVF V D   + S
Sbjct: 70   TIIDNGIGMNRSQITNYLGIMARSGTKKAENKNYIGQFGIGFYSVFLVADKVTVTS 125


>gi|296138562|ref|YP_003645805.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
 gi|296026696|gb|ADG77466.1| conserved hypothetical protein [Tsukamurella paurometabola DSM 20162]
          Length = 847

 Score = 42.4 bits (98), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 2/74 (2%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L+      G+ V N     SA  +R+L  +  S K   +A  +GR G+GF +V  VTD  
Sbjct: 70   LSVWAAADGVHVANTGAPLSAAGVRSLAALRVSPKSDGAA--VGRFGVGFAAVLTVTDHV 127

Query: 1290 EIHSNGFHVKFDTS 1303
            +I S G  ++F  +
Sbjct: 128  QIRSAGGGIEFSAA 141


>gi|423282496|ref|ZP_17261381.1| hypothetical protein HMPREF1204_00919 [Bacteroides fragilis HMW 615]
 gi|404582064|gb|EKA86759.1| hypothetical protein HMPREF1204_00919 [Bacteroides fragilis HMW 615]
          Length = 587

 Score = 42.4 bits (98), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 9/105 (8%)

Query: 1198 LSQELYSQDSHFLLELVQNADDNIYP-----ENVEPTLTFILQESGIVVL-NNEQGFSAE 1251
            LS+ +YS  + F+ EL+QN+ D I       EN    +   L   G +   +N  G   E
Sbjct: 20   LSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVFLNGDGSMAFQDNGIGLKEE 79

Query: 1252 NI-RALCDVGNSTKKGS--SAGYIGRKGIGFKSVFRVTDAPEIHS 1293
             + R L  +G S+K+ +  +  +IGR GIG  S F VT+   + S
Sbjct: 80   EVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEIRVES 124


>gi|241766434|ref|ZP_04764308.1| heat shock protein Hsp90 [Acidovorax delafieldii 2AN]
 gi|241363376|gb|EER58887.1| heat shock protein Hsp90 [Acidovorax delafieldii 2AN]
          Length = 657

 Score = 42.0 bits (97), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 66/163 (40%), Gaps = 35/163 (21%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPE--NVEPTLTF 1232
             A + + LH ++  LYS    FL EL+ NA D              +Y +  N+E  ++F
Sbjct: 9    QAEVAQLLHLVTHSLYSNQEIFLRELISNASDACDKLRFEALNNNALYEDAPNLEVRVSF 68

Query: 1233 ILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-------------SAGYIGRKGIGF 1279
                  + + +N  G SA+   A+  +G   K G+              A  IG+ G+GF
Sbjct: 69   DKDARTLTIQDNGIGMSAQE--AIDHLGTIAKSGTKDFMGKLTGDQKQDAQLIGQFGVGF 126

Query: 1280 KSVFRVTDAPEIHSNGFHVKFD-----TSEGQIGFVLPTLVPP 1317
             S F V D   + S    +K D      S G   F + T+  P
Sbjct: 127  YSGFIVADKITVESRRAGLKADEGVRWVSGGAGDFEVETITRP 169


>gi|386840873|ref|YP_006245931.1| hypothetical protein SHJG_4789 [Streptomyces hygroscopicus subsp.
            jinggangensis 5008]
 gi|374101174|gb|AEY90058.1| hypothetical protein SHJG_4789 [Streptomyces hygroscopicus subsp.
            jinggangensis 5008]
 gi|451794167|gb|AGF64216.1| hypothetical protein SHJGH_4552 [Streptomyces hygroscopicus subsp.
            jinggangensis TL01]
          Length = 1042

 Score = 42.0 bits (97), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 2/79 (2%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIGFKSVFRVTD 1287
            L   L++  +V  N      A  + +L  +  S K+ + A  G +GR G+GF +V  VTD
Sbjct: 75   LRLTLRDGVLVAANTGAPLDAAGVESLSTLRASAKRDAEATPGAVGRFGVGFAAVLAVTD 134

Query: 1288 APEIHSNGFHVKFDTSEGQ 1306
             P I      V++  +E +
Sbjct: 135  EPAIVGRHGGVRWSLAEAR 153


>gi|315637563|ref|ZP_07892770.1| chaperone HtpG [Arcobacter butzleri JV22]
 gi|315478154|gb|EFU68880.1| chaperone HtpG [Arcobacter butzleri JV22]
          Length = 527

 Score = 42.0 bits (97), Expect = 3.6,   Method: Composition-based stats.
 Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 33/142 (23%)

Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
            M K Q    +G+ LH ++  LYS    F+ ELV NA D I          EN+       
Sbjct: 1    MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYADW 60

Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
              E  ++F  ++  + +++N  G +  ++  +  +G   K G+              +  
Sbjct: 61   KGEINISFDEKDKSLSIIDNGIGMNEADL--IASIGTIAKSGTKSFVEALTGDAKKDSNL 118

Query: 1272 IGRKGIGFKSVFRVTDAPEIHS 1293
            IG+ G+GF SVF V D  ++ S
Sbjct: 119  IGQFGVGFYSVFMVADKVDVIS 140


>gi|76802578|ref|YP_327586.1| hypothetical protein NP3886A [Natronomonas pharaonis DSM 2160]
 gi|76558443|emb|CAI50034.1| hypothetical protein NP_3886A [Natronomonas pharaonis DSM 2160]
          Length = 1999

 Score = 42.0 bits (97), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 3/85 (3%)

Query: 1210 LLELVQNADDNIYPENVEPTLTFI-LQESGIVVLNNEQGFS--AENIRALCDVGNSTKKG 1266
            +LEL+QNA D I   + E    ++ + + G++V N    F    E +     +   T KG
Sbjct: 59   ILELLQNARDAIRQGDAEQGRVYVGVYDEGVLVANTGSRFDLFDEQVEDAVTMIGETGKG 118

Query: 1267 SSAGYIGRKGIGFKSVFRVTDAPEI 1291
                 IG KG+G KS+    D+ EI
Sbjct: 119  DDDQSIGHKGVGLKSILATGDSFEI 143


>gi|296447419|ref|ZP_06889344.1| Heat shock protein Hsp90-like protein [Methylosinus trichosporium
            OB3b]
 gi|296255039|gb|EFH02141.1| Heat shock protein Hsp90-like protein [Methylosinus trichosporium
            OB3b]
          Length = 655

 Score = 42.0 bits (97), Expect = 3.7,   Method: Composition-based stats.
 Identities = 40/131 (30%), Positives = 57/131 (43%), Gaps = 27/131 (20%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNI-------------YPENVEPTLTFIL 1234
             A + R L  ++  +YS+   FL ELV NA D I               +   P +T  +
Sbjct: 38   QADVARLLELMTHSVYSERDIFLRELVSNAADAIEKLRHDALSDEALAAKAAAPLVTISI 97

Query: 1235 QESG--IVVLNNEQGFSAEN-IRALCDVGNS---------TKKG--SSAGYIGRKGIGFK 1280
             +    + V +N  G S E+ + AL  + +S          KKG   SA  IGR GIGF 
Sbjct: 98   DKDARTLTVADNGVGMSREDLVAALGTIASSGTRAFLDDLEKKGGEKSASLIGRFGIGFY 157

Query: 1281 SVFRVTDAPEI 1291
            S F V +  E+
Sbjct: 158  SAFMVAEKVEV 168


>gi|313674537|ref|YP_004052533.1| hypothetical protein Ftrac_0419 [Marivirga tractuosa DSM 4126]
 gi|312941235|gb|ADR20425.1| hypothetical protein Ftrac_0419 [Marivirga tractuosa DSM 4126]
          Length = 468

 Score = 42.0 bits (97), Expect = 3.9,   Method: Composition-based stats.
 Identities = 50/219 (22%), Positives = 92/219 (42%), Gaps = 11/219 (5%)

Query: 1193 RALHCLSQELYSQDSHFLLELVQNADDNIYPEN-VEPTLTFILQESGIVVLNNEQGFSAE 1251
            + LH L + + +    +  EL+QNA D++  EN  + ++   L +  IV  +N + F+  
Sbjct: 23   KRLHDLEKTVETNHGRWAWELLQNAKDSVATENDRKVSVQIKLYDGKIVFKHNGRRFTET 82

Query: 1252 NIRALCDVGNST--KKGSSAGYIGRKGIGFKSVFRVTDAPEIH-----SNGFHVKFDTSE 1304
            ++R + +  +S   ++       GR G GF +   ++    I       NG   KF    
Sbjct: 83   DVRGIINQISSKEIEEDEETKQTGRFGTGFLTTHILSKVITIQGILETENGDFYKFKFPL 142

Query: 1305 GQIGFVLPTLVPPFN--IDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNNIVD 1362
             + G     LVP        F   ++K     +   +NT       ++F + IA   I +
Sbjct: 143  DREGKTTTQLVPKIEKAWSSFHESVTKITQDYDRNDFNTSFTYVLDSEFQQAIAEKGIKE 202

Query: 1363 MFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIVGD 1401
             FS L P +L F+  ++ +   N + ++ V    K   D
Sbjct: 203  -FSQLIPYVLTFIPAIEDVTIINKVKETEVTYNTKSHAD 240


>gi|260785782|ref|XP_002587939.1| hypothetical protein BRAFLDRAFT_87327 [Branchiostoma floridae]
 gi|229273094|gb|EEN43950.1| hypothetical protein BRAFLDRAFT_87327 [Branchiostoma floridae]
          Length = 857

 Score = 42.0 bits (97), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 38/72 (52%), Gaps = 4/72 (5%)

Query: 35  NPNFAFQNPNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQP 94
           NP+   +NP++   NP LQ      ++  +NP    +NP+L  QN++L  QNP L P  P
Sbjct: 122 NPDLQPENPDLQPENPDLQPEN--PDLQPENPGLQPENPDLQPQNSDLQPQNPELQPGDP 179

Query: 95  P--SSACNQQRS 104
              S   N +R+
Sbjct: 180 EIQSGQSNAERA 191


>gi|156359811|ref|XP_001624958.1| predicted protein [Nematostella vectensis]
 gi|156211766|gb|EDO32858.1| predicted protein [Nematostella vectensis]
          Length = 216

 Score = 42.0 bits (97), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 21/113 (18%)

Query: 1198 LSQELYSQDSHFLLELVQNADDN-------IYPENVEPTLTFI------LQESGIVVLNN 1244
            +S+   +   H + EL+QNA+D        +Y E+   T +         Q   +   NN
Sbjct: 1    MSKHGVNTQCHCVQELIQNAEDAEAREVKVLYDEHSYGTHSLFKPELAKFQGPALYAYNN 60

Query: 1245 EQGFSAEN---IRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
             + F  ++   +R LCD    + K      +GR G+GFKSVF +TD P + SN
Sbjct: 61   AR-FKRDDWNGLRMLCD----SIKVKDPMKVGRFGLGFKSVFHITDLPSVISN 108


>gi|440223583|ref|YP_007336979.1| chaperone protein HtpG [Rhizobium tropici CIAT 899]
 gi|440042455|gb|AGB74433.1| chaperone protein HtpG [Rhizobium tropici CIAT 899]
          Length = 629

 Score = 42.0 bits (97), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 39/149 (26%), Positives = 59/149 (39%), Gaps = 30/149 (20%)

Query: 1176 ISNMESNMLKKQH---ARLGRALHCLSQELYSQDSHFLLELVQNADDNI----YPENVEP 1228
            ++   + M  + H   A + R LH +   +YS    FL EL+ NA D      Y    +P
Sbjct: 1    MTTTAAEMTAESHVFEADVARLLHMMVHSVYSDKDVFLRELISNAADACEKLRYEAIAQP 60

Query: 1229 TL-----------TFILQESGIVVLNNEQGFSAENI------------RALCDVGNSTKK 1265
             L           T       +VV +N  G S + +            RA  +   + K+
Sbjct: 61   GLLANDPDSRILLTLDADGGKLVVEDNGIGMSRDEMIEALGTIARSGTRAFMERIEANKQ 120

Query: 1266 GSSAGYIGRKGIGFKSVFRVTDAPEIHSN 1294
            G  A  IG+ G+GF S F V D  ++ S 
Sbjct: 121  GEGAQLIGQFGVGFYSCFMVADRVDVISR 149


>gi|242071031|ref|XP_002450792.1| hypothetical protein SORBIDRAFT_05g018550 [Sorghum bicolor]
 gi|241936635|gb|EES09780.1| hypothetical protein SORBIDRAFT_05g018550 [Sorghum bicolor]
          Length = 68

 Score = 42.0 bits (97), Expect = 4.4,   Method: Composition-based stats.
 Identities = 25/55 (45%), Positives = 36/55 (65%), Gaps = 7/55 (12%)

Query: 1162 VESIRRDEFGLGPNISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQN 1216
            VE IRR+ + +G      E N L +    + +A++ LSQELYS+D HFL+ELVQ+
Sbjct: 17   VERIRRERYYIGRG----EQNPLAED---MHQAVNYLSQELYSKDVHFLMELVQS 64


>gi|436841092|ref|YP_007325470.1| Chaperone protein htpG [Desulfovibrio hydrothermalis AM13 = DSM
            14728]
 gi|432169998|emb|CCO23369.1| Chaperone protein htpG [Desulfovibrio hydrothermalis AM13 = DSM
            14728]
          Length = 632

 Score = 41.6 bits (96), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 37/155 (23%), Positives = 60/155 (38%), Gaps = 29/155 (18%)

Query: 1189 ARLGRALHCLSQELYSQDSHFLLELVQNADDNIY---------PE---NVEPTLTFILQE 1236
            A + + L  L   LY+    FL EL+ NA D +          PE   +VEP +     E
Sbjct: 12   AEVNQLLDILIHSLYTNREIFLRELISNASDALDKMRFAISCDPELNDDVEPEILIAYDE 71

Query: 1237 SGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA-------------GYIGRKGIGFKSVF 1283
                +   + G        + ++G     GS+                IGR G+GF S+F
Sbjct: 72   EKKTITVTDTGIGMTRDEVVANIGTIAHSGSAEFIKQAAESKESLDSLIGRFGVGFYSIF 131

Query: 1284 RVTDAPEIHSNGF----HVKFDTSEGQIGFVLPTL 1314
             V+D   + +  +    H     S+G+  + L T+
Sbjct: 132  MVSDHVVVRTKSYKDADHATQWISDGKNSYELSTI 166


>gi|290959492|ref|YP_003490674.1| hypothetical protein SCAB_50881 [Streptomyces scabiei 87.22]
 gi|260649018|emb|CBG72132.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
          Length = 1047

 Score = 41.6 bits (96), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 42/201 (20%), Positives = 82/201 (40%), Gaps = 15/201 (7%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSA--GYIGRKGIGFKSVFRVTD 1287
            L   L++  +V  N      A  + +L  +  S K+ +    G +GR G+GF +V  VTD
Sbjct: 75   LRLTLRDGVLVAANTGAPLDAGGVESLSTLRASAKRDAQGHDGAVGRFGVGFAAVLAVTD 134

Query: 1288 APEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRL-----LSKDPVQLES---KCW 1339
             P +      +++  +E ++         P   D   R      L + P   E    + +
Sbjct: 135  EPAVVGRHGGIRWSLAEARLLAAETARHSPGLGDEIRRRDGHVPLLRLPFAAEGTAPESY 194

Query: 1340 NTCIRLPFRTKFSEGIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNMLNDSLVVIRKKIV 1399
            +T + LP R   +  +A   ++    D   +LLL L  L  ++   +  D +  +R+ + 
Sbjct: 195  DTVVILPLRDTAAADLA-ERLLHAVDD---ALLLALPGLDEVVI-EIGADEVRTVRRSVE 249

Query: 1400 GDGIIKVSCGEDKMTWFVASQ 1420
            G  ++     +    W  A++
Sbjct: 250  GAHVVVEDSRDGVTRWRTAAE 270


>gi|45443521|ref|NP_995060.1| hypothetical protein YP_3789 [Yersinia pestis biovar Microtus str.
            91001]
 gi|145600493|ref|YP_001164569.1| hypothetical protein YPDSF_3239 [Yersinia pestis Pestoides F]
 gi|162420877|ref|YP_001605784.1| hypothetical protein YpAngola_A1243 [Yersinia pestis Angola]
 gi|229837034|ref|ZP_04457199.1| DNA mismatch repair enzyme [Yersinia pestis Pestoides A]
 gi|45438390|gb|AAS63937.1| DNA mismatch repair enzyme (predicted ATPase) [Yersinia pestis biovar
            Microtus str. 91001]
 gi|145212189|gb|ABP41596.1| hypothetical protein YPDSF_3239 [Yersinia pestis Pestoides F]
 gi|162353692|gb|ABX87640.1| DNA mismatch repair enzyme [Yersinia pestis Angola]
 gi|229705977|gb|EEO91986.1| DNA mismatch repair enzyme [Yersinia pestis Pestoides A]
          Length = 240

 Score = 41.6 bits (96), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 14/108 (12%)

Query: 1192 GRALHCLSQELYSQDSHFLL-ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            G+ +  LSQ++ S  S F L ELV+N+ D   P+    T+T I  E  I++ +   G S 
Sbjct: 10   GKIIEELSQKIPS--SLFALNELVKNSYDAFSPD---VTITVIPSELKIIISDYGNGMSV 64

Query: 1251 ENIRALCDVGNSTKK--------GSSAGYIGRKGIGFKSVFRVTDAPE 1290
            + I +L  +  STKK        G      G KG+GF S F+  D  E
Sbjct: 65   DEIHSLFHISKSTKKYGCEVSQNGIKRIVQGSKGLGFLSAFKFGDKVE 112


>gi|392531652|ref|ZP_10278789.1| HSP90 family protein [Carnobacterium maltaromaticum ATCC 35586]
          Length = 605

 Score = 41.6 bits (96), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 51/126 (40%), Gaps = 19/126 (15%)

Query: 1178 NMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENV--------EPT 1229
            N E N  K     LG  +  LS  LYSQ   ++ EL+QNA D I    +        E  
Sbjct: 2    NPEENRFK---VNLGGMIEILSNHLYSQKDVYIRELLQNATDAIRGRKLISEDNFVGEIH 58

Query: 1230 LTFILQESGIVVL---NNEQGFSAENIRALCDVGNSTKKG-----SSAGYIGRKGIGFKS 1281
            +   +  SG   L   +N  G + + I        S+ KG         +IGR GIG  S
Sbjct: 59   VDLTIHRSGSSTLFFEDNGIGLTEDEIHQFLATIASSSKGEKNFEEKQDFIGRFGIGLLS 118

Query: 1282 VFRVTD 1287
             F V D
Sbjct: 119  CFIVAD 124


>gi|22127662|ref|NP_671085.1| hypothetical protein y3791 [Yersinia pestis KIM10+]
 gi|108809882|ref|YP_653798.1| hypothetical protein YPA_3892 [Yersinia pestis Antiqua]
 gi|108810429|ref|YP_646196.1| hypothetical protein YPN_0263 [Yersinia pestis Nepal516]
 gi|150260565|ref|ZP_01917293.1| hypothetical protein YPE_2877 [Yersinia pestis CA88-4125]
 gi|165927293|ref|ZP_02223125.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            F1991016]
 gi|165936424|ref|ZP_02224992.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            IP275]
 gi|166008913|ref|ZP_02229811.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
            E1979001]
 gi|166211933|ref|ZP_02237968.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
            B42003004]
 gi|167400522|ref|ZP_02306031.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
            UG05-0454]
 gi|167419396|ref|ZP_02311149.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            MG05-1020]
 gi|167426097|ref|ZP_02317850.1| DNA mismatch repair enzyme [Yersinia pestis biovar Mediaevalis str.
            K1973002]
 gi|167467846|ref|ZP_02332550.1| hypothetical protein YpesF_08094 [Yersinia pestis FV-1]
 gi|218927594|ref|YP_002345469.1| hypothetical protein YPO0392 [Yersinia pestis CO92]
 gi|229840268|ref|ZP_04460427.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            PEXU2]
 gi|229842346|ref|ZP_04462501.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            India 195]
 gi|229900611|ref|ZP_04515735.1| DNA mismatch repair enzyme [Yersinia pestis Nepal516]
 gi|270488028|ref|ZP_06205102.1| conserved hypothetical protein [Yersinia pestis KIM D27]
 gi|294502497|ref|YP_003566559.1| hypothetical protein YPZ3_0387 [Yersinia pestis Z176003]
 gi|384120932|ref|YP_005503552.1| hypothetical protein YPD4_0340 [Yersinia pestis D106004]
 gi|384124810|ref|YP_005507424.1| hypothetical protein YPD8_0341 [Yersinia pestis D182038]
 gi|384137625|ref|YP_005520327.1| hypothetical protein A1122_03155 [Yersinia pestis A1122]
 gi|384416547|ref|YP_005625909.1| DNA mismatch repair protein [Yersinia pestis biovar Medievalis str.
            Harbin 35]
 gi|420544862|ref|ZP_15043047.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-01]
 gi|420549649|ref|ZP_15047333.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-02]
 gi|420550157|ref|ZP_15047782.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-02]
 gi|420555603|ref|ZP_15052630.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-03]
 gi|420561253|ref|ZP_15057546.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-04]
 gi|420566294|ref|ZP_15062094.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-05]
 gi|420571920|ref|ZP_15067207.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-06]
 gi|420577270|ref|ZP_15072041.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-07]
 gi|420582606|ref|ZP_15076902.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-08]
 gi|420587744|ref|ZP_15081542.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-09]
 gi|420593089|ref|ZP_15086352.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-10]
 gi|420598742|ref|ZP_15091417.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-11]
 gi|420604273|ref|ZP_15096345.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-12]
 gi|420609640|ref|ZP_15101223.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-13]
 gi|420620323|ref|ZP_15110632.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-15]
 gi|420625371|ref|ZP_15115209.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-16]
 gi|420635658|ref|ZP_15124476.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-25]
 gi|420641260|ref|ZP_15129525.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-29]
 gi|420646412|ref|ZP_15134254.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-32]
 gi|420656895|ref|ZP_15143684.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-36]
 gi|420657506|ref|ZP_15144230.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-36]
 gi|420662841|ref|ZP_15148990.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-42]
 gi|420667836|ref|ZP_15153506.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-45]
 gi|420673135|ref|ZP_15158327.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-46]
 gi|420678638|ref|ZP_15163336.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-47]
 gi|420683871|ref|ZP_15168038.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-48]
 gi|420689038|ref|ZP_15172631.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-52]
 gi|420706270|ref|ZP_15187190.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-55]
 gi|420711557|ref|ZP_15191989.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-56]
 gi|420728212|ref|ZP_15206564.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-60]
 gi|420733305|ref|ZP_15211152.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-61]
 gi|420744021|ref|ZP_15220787.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-64]
 gi|420749890|ref|ZP_15225718.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-65]
 gi|420761062|ref|ZP_15235105.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-71]
 gi|420766235|ref|ZP_15239787.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-72]
 gi|420771280|ref|ZP_15244302.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-76]
 gi|420776585|ref|ZP_15249089.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-88]
 gi|420782081|ref|ZP_15253912.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-89]
 gi|420787526|ref|ZP_15258683.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-90]
 gi|420792972|ref|ZP_15263591.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-91]
 gi|420798144|ref|ZP_15268235.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-92]
 gi|420803491|ref|ZP_15273048.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-93]
 gi|420808679|ref|ZP_15277747.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-94]
 gi|420814471|ref|ZP_15282927.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-95]
 gi|420819594|ref|ZP_15287579.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-96]
 gi|420824682|ref|ZP_15292126.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-98]
 gi|420830514|ref|ZP_15297394.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-99]
 gi|420840446|ref|ZP_15306368.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-101]
 gi|420846057|ref|ZP_15311448.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-102]
 gi|420856993|ref|ZP_15320924.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-113]
 gi|421761820|ref|ZP_16198620.1| DNA mismatch repair protein [Yersinia pestis INS]
 gi|21960777|gb|AAM87336.1|AE013982_3 hypothetical [Yersinia pestis KIM10+]
 gi|108774077|gb|ABG16596.1| hypothetical protein YPN_0263 [Yersinia pestis Nepal516]
 gi|108781795|gb|ABG15853.1| hypothetical protein YPA_3892 [Yersinia pestis Antiqua]
 gi|115346205|emb|CAL19073.1| hypothetical protein YPO0392 [Yersinia pestis CO92]
 gi|149289973|gb|EDM40050.1| hypothetical protein YPE_2877 [Yersinia pestis CA88-4125]
 gi|165915540|gb|EDR34149.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            IP275]
 gi|165920755|gb|EDR38003.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            F1991016]
 gi|165992252|gb|EDR44553.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
            E1979001]
 gi|166206679|gb|EDR51159.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
            B42003004]
 gi|166963390|gb|EDR59411.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            MG05-1020]
 gi|167049890|gb|EDR61298.1| DNA mismatch repair enzyme [Yersinia pestis biovar Antiqua str.
            UG05-0454]
 gi|167055020|gb|EDR64820.1| DNA mismatch repair enzyme [Yersinia pestis biovar Mediaevalis str.
            K1973002]
 gi|229681950|gb|EEO78042.1| DNA mismatch repair enzyme [Yersinia pestis Nepal516]
 gi|229690656|gb|EEO82710.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            India 195]
 gi|229696634|gb|EEO86681.1| DNA mismatch repair enzyme [Yersinia pestis biovar Orientalis str.
            PEXU2]
 gi|262360528|gb|ACY57249.1| hypothetical protein YPD4_0340 [Yersinia pestis D106004]
 gi|262364474|gb|ACY61031.1| hypothetical protein YPD8_0341 [Yersinia pestis D182038]
 gi|270336532|gb|EFA47309.1| conserved hypothetical protein [Yersinia pestis KIM D27]
 gi|294352956|gb|ADE63297.1| hypothetical protein YPZ3_0387 [Yersinia pestis Z176003]
 gi|320017051|gb|ADW00623.1| DNA mismatch repair enzyme [Yersinia pestis biovar Medievalis str.
            Harbin 35]
 gi|342852754|gb|AEL71307.1| hypothetical protein A1122_03155 [Yersinia pestis A1122]
 gi|391432509|gb|EIQ93945.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-01]
 gi|391433640|gb|EIQ94949.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-02]
 gi|391434921|gb|EIQ96049.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-02]
 gi|391436310|gb|EIQ97282.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-03]
 gi|391448550|gb|EIR08355.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-04]
 gi|391449295|gb|EIR09033.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-05]
 gi|391451745|gb|EIR11215.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-06]
 gi|391464572|gb|EIR22844.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-07]
 gi|391466198|gb|EIR24293.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-08]
 gi|391468290|gb|EIR26180.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-09]
 gi|391481652|gb|EIR38165.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-10]
 gi|391482627|gb|EIR39063.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-12]
 gi|391482741|gb|EIR39165.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-11]
 gi|391496834|gb|EIR51747.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-13]
 gi|391497530|gb|EIR52385.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-15]
 gi|391512473|gb|EIR65787.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-16]
 gi|391516004|gb|EIR68942.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-25]
 gi|391528008|gb|EIR79870.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-29]
 gi|391532165|gb|EIR83588.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-32]
 gi|391545078|gb|EIR95212.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-36]
 gi|391546722|gb|EIR96684.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-42]
 gi|391546995|gb|EIR96935.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-36]
 gi|391547579|gb|EIR97467.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-45]
 gi|391561250|gb|EIS09801.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-46]
 gi|391562376|gb|EIS10794.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-47]
 gi|391564446|gb|EIS12648.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-48]
 gi|391576616|gb|EIS23147.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-52]
 gi|391588721|gb|EIS33710.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-55]
 gi|391592635|gb|EIS37029.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-56]
 gi|391605684|gb|EIS48533.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-60]
 gi|391620114|gb|EIS61303.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-61]
 gi|391629489|gb|EIS69416.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-64]
 gi|391631622|gb|EIS71233.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-65]
 gi|391642888|gb|EIS81113.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-71]
 gi|391645647|gb|EIS83504.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-72]
 gi|391655398|gb|EIS92146.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-76]
 gi|391662291|gb|EIS98244.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-88]
 gi|391667285|gb|EIT02641.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-89]
 gi|391668917|gb|EIT04107.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-90]
 gi|391672856|gb|EIT07631.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-91]
 gi|391686576|gb|EIT19984.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-93]
 gi|391688090|gb|EIT21342.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-92]
 gi|391689398|gb|EIT22536.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-94]
 gi|391700675|gb|EIT32752.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-95]
 gi|391703966|gb|EIT35667.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-96]
 gi|391704757|gb|EIT36393.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-98]
 gi|391715204|gb|EIT45773.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-99]
 gi|391720819|gb|EIT50813.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-101]
 gi|391731418|gb|EIT60129.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-102]
 gi|391737317|gb|EIT65218.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein [Yersinia pestis PY-113]
 gi|411178142|gb|EKS48154.1| DNA mismatch repair protein [Yersinia pestis INS]
          Length = 240

 Score = 41.6 bits (96), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 14/108 (12%)

Query: 1192 GRALHCLSQELYSQDSHFLL-ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            G+ +  LSQ++ S  S F L ELV+N+ D   P+    T+T I  E  I++ +   G S 
Sbjct: 10   GKIIEELSQKIPS--SLFALNELVKNSYDAFSPD---VTITVIPSELKIIISDYGNGMSV 64

Query: 1251 ENIRALCDVGNSTKK--------GSSAGYIGRKGIGFKSVFRVTDAPE 1290
            + I +L  +  STKK        G      G KG+GF S F+  D  E
Sbjct: 65   DEIHSLFHISKSTKKYGCEVSQNGIKRIVQGSKGLGFLSAFKFGDKVE 112


>gi|387761650|ref|YP_006068627.1| hypothetical protein Ssal_01492 [Streptococcus salivarius 57.I]
 gi|339292417|gb|AEJ53764.1| conserved hypothetical protein [Streptococcus salivarius 57.I]
          Length = 1506

 Score = 41.6 bits (96), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 10/98 (10%)

Query: 1210 LLELVQNADD-------NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            +LEL+QN DD       +    N E  +    + + + V N    FS + I  LC +G +
Sbjct: 46   ILELLQNVDDAYGDIKSDNDASNDEVKVQITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            + K S    IG KG GF+S+    +  E++S  F ++F
Sbjct: 105  SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF 140


>gi|340399189|ref|YP_004728214.1| hypothetical protein SALIVB_1412 [Streptococcus salivarius CCHSS3]
 gi|338743182|emb|CCB93690.1| hypothetical protein SALIVB_1412 [Streptococcus salivarius CCHSS3]
          Length = 1506

 Score = 41.6 bits (96), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 10/98 (10%)

Query: 1210 LLELVQNADD-------NIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNS 1262
            +LEL+QN DD       +    N E  +    + + + V N    FS + I  LC +G +
Sbjct: 46   ILELLQNVDDAYGDIKSDNDASNDEVKVQITYKNNILEVGNTGTSFSKDTIERLC-LGRA 104

Query: 1263 TKKGSSAGYIGRKGIGFKSVFRVTDAPEIHSNGFHVKF 1300
            + K S    IG KG GF+S+    +  E++S  F ++F
Sbjct: 105  SNKSSQN--IGNKGTGFRSLLNDAEWVELYSGEFAIRF 140


>gi|403359361|gb|EJY79339.1| hypothetical protein OXYTRI_23390 [Oxytricha trifallax]
          Length = 338

 Score = 41.6 bits (96), Expect = 5.0,   Method: Composition-based stats.
 Identities = 39/110 (35%), Positives = 48/110 (43%), Gaps = 18/110 (16%)

Query: 29  PPPPHFNPNFAFQNPNI-FFPNPALQLL-------QNLTNIPF-QNPNFD----IQNPNL 75
           PPP  FN    + + NI   PN   Q         Q    +P  QNP +     IQNPN 
Sbjct: 147 PPPNAFNGG-QYPSANIQGMPNGTQQYQPGMMMDNQPAMGMPVSQNPQYSNLPPIQNPNS 205

Query: 76  FMQNANLLTQNPN---LPPQQPPSSACNQQRSQTQHPP-GASNQQRPLTQ 121
           +MQN + L  + N   LP Q+  SS  N QR +  H   G    Q  LTQ
Sbjct: 206 YMQNPSGLPNSQNLMPLPQQRQESSNLNSQRDRLDHSQMGLLQNQTQLTQ 255


>gi|260782174|ref|XP_002586166.1| hypothetical protein BRAFLDRAFT_109861 [Branchiostoma floridae]
 gi|229271259|gb|EEN42177.1| hypothetical protein BRAFLDRAFT_109861 [Branchiostoma floridae]
          Length = 649

 Score = 41.6 bits (96), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 11/71 (15%)

Query: 24  QPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLL 83
           QPP +  P   NP    +NP++   NP L           +NP+   +NP+L  +N +L 
Sbjct: 138 QPPDREIP--ANPELQPENPDLQPENPDLHP---------ENPDLQPENPDLHPENPDLQ 186

Query: 84  TQNPNLPPQQP 94
            +NP+L P+ P
Sbjct: 187 PENPDLQPENP 197


>gi|296272791|ref|YP_003655422.1| Heat shock protein Hsp90 [Arcobacter nitrofigilis DSM 7299]
 gi|296096965|gb|ADG92915.1| Heat shock protein Hsp90 [Arcobacter nitrofigilis DSM 7299]
          Length = 637

 Score = 41.6 bits (96), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 58/136 (42%), Gaps = 34/136 (25%)

Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI----------------YPEN 1225
            M K Q    +G+ LH ++  LYS    F+ ELV NA D I                 PE+
Sbjct: 1    MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLKLTDDKIKAALPED 60

Query: 1226 --VEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AG 1270
               E  ++F   +  + +++N  G + E++  +  +G   K G+              + 
Sbjct: 61   WSGEINVSFDEADKSLTIVDNGIGMNEEDL--IASIGTIAKSGTKSFIEAMTGDAKKDSN 118

Query: 1271 YIGRKGIGFKSVFRVT 1286
             IG+ G+GF SVF V 
Sbjct: 119  LIGQFGVGFYSVFMVA 134


>gi|258627089|ref|ZP_05721885.1| heat shock protein HtpG [Vibrio mimicus VM603]
 gi|258580607|gb|EEW05560.1| heat shock protein HtpG [Vibrio mimicus VM603]
          Length = 635

 Score = 41.2 bits (95), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 65/148 (43%), Gaps = 33/148 (22%)

Query: 1176 ISNMESNMLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIY 1222
            I+N E+   + +  +L   LH +   LYS    FL EL+ NA D             ++Y
Sbjct: 6    ITNKETRGFQSEVKQL---LHLMIHSLYSNKEIFLRELISNASDAADKLRFQALSNPDLY 62

Query: 1223 PENVE--PTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS------------- 1267
              + E    L+F  + + + + +N  G S E++  + ++G   K G+             
Sbjct: 63   QGDAELGVKLSFDQENNTLTISDNGIGMSREDV--IENLGTIAKSGTAEFFSKLSQEQSK 120

Query: 1268 SAGYIGRKGIGFKSVFRVTDAPEIHSNG 1295
            S+  IG+ G+GF S F V DA  + +  
Sbjct: 121  SSQLIGQFGVGFYSAFIVADAVTVRTRA 148


>gi|404371673|ref|ZP_10976977.1| hypothetical protein CSBG_01028 [Clostridium sp. 7_2_43FAA]
 gi|226912201|gb|EEH97402.1| hypothetical protein CSBG_01028 [Clostridium sp. 7_2_43FAA]
          Length = 649

 Score = 41.2 bits (95), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 54/110 (49%), Gaps = 8/110 (7%)

Query: 1197 CLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSAENIRAL 1256
             L  ++   DS  + EL++N+ D  + + V  T+ F   ++ I + +N  G S E I+  
Sbjct: 20   ALGSKIIELDSIVIAELIKNSKD-AHAKKV--TINFSDYKNSITIWDNGDGMSIEEIKDK 76

Query: 1257 CDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAPEI-----HSNGFHVKFD 1301
              V  S  K +    +G KGIG  S+F++ ++  I     HSN +   FD
Sbjct: 77   WGVVASNNKSNEVDTLGGKGIGRFSIFKLCNSFTIITKKEHSNEYKFTFD 126


>gi|91978860|ref|YP_571519.1| heat shock protein 90 [Rhodopseudomonas palustris BisB5]
 gi|123735061|sp|Q12ZX1.1|HTPG_RHOPS RecName: Full=Chaperone protein HtpG; AltName: Full=Heat shock
            protein HtpG; AltName: Full=High temperature protein G
 gi|91685316|gb|ABE41618.1| heat shock protein Hsp90 [Rhodopseudomonas palustris BisB5]
          Length = 628

 Score = 41.2 bits (95), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 81/209 (38%), Gaps = 44/209 (21%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEPTLTFILQESG--------- 1238
             A +   L+ +   +YS+   FL EL+ NA D +     E   T  L E+G         
Sbjct: 14   QAEVAELLNLMVHSVYSETDIFLRELISNASDALDKLRYESIATPALMEAGGAPKIQIVP 73

Query: 1239 ------IVVLNNEQGFSAENIRALCDVGNSTKKGS------------SAGYIGRKGIGFK 1280
                  + V++N  G + + +  + ++G   K G+             AG IG+ G+GF 
Sbjct: 74   RKAPDTLTVIDNGIGMNRQEL--IDNLGTIAKSGTKSFLTKLTEAKDGAGLIGQFGVGFY 131

Query: 1281 SVFRVTDAPEIHS---NGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQLESK 1337
            + F V D   + S       V   +S G  GF     + P + +   R++    + L  K
Sbjct: 132  AAFMVADNIVVTSRRAGSGEVWTWSSSGGAGFE----IAPASEEAAARVVRGTEIVLHLK 187

Query: 1338 CWNTCIRLPFRTKFSEGIAMNNIVDMFSD 1366
                    P   K+ E   +  IV  +SD
Sbjct: 188  --------PDAAKYLEAYQIERIVSEYSD 208


>gi|420694858|ref|ZP_15177717.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein, partial [Yersinia pestis PY-53]
 gi|391577417|gb|EIS23851.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
            protein, partial [Yersinia pestis PY-53]
          Length = 132

 Score = 41.2 bits (95), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 14/108 (12%)

Query: 1192 GRALHCLSQELYSQDSHFLL-ELVQNADDNIYPENVEPTLTFILQESGIVVLNNEQGFSA 1250
            G+ +  LSQ++ S  S F L ELV+N+ D   P+    T+T I  E  I++ +   G S 
Sbjct: 10   GKIIEELSQKIPS--SLFALNELVKNSYDAFSPD---VTITVIPSELKIIISDYGNGMSV 64

Query: 1251 ENIRALCDVGNSTKK--------GSSAGYIGRKGIGFKSVFRVTDAPE 1290
            + I +L  +  STKK        G      G KG+GF S F+  D  E
Sbjct: 65   DEIHSLFHISKSTKKYGCEVSQNGIKRIVQGSKGLGFLSAFKFGDKVE 112


>gi|408373681|ref|ZP_11171375.1| heat shock protein 90 [Alcanivorax hongdengensis A-11-3]
 gi|407766385|gb|EKF74828.1| heat shock protein 90 [Alcanivorax hongdengensis A-11-3]
          Length = 617

 Score = 41.2 bits (95), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 33/133 (24%), Positives = 49/133 (36%), Gaps = 26/133 (19%)

Query: 1188 HARLGRALHCLSQELYSQDSHFLLELVQNADD-------------NIYPENVEPTLTFIL 1234
             A + R LH +   LYS    FL EL+ NA D              +  E+ EP +   L
Sbjct: 11   QAEVSRLLHLMIHSLYSNREIFLRELISNASDACDKLRFEALDTPALMEEDPEPAIRLTL 70

Query: 1235 QESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-------------SAGYIGRKGIGFKS 1281
             +S   +   + G        +  +G   + G+              A  IG+ G+GF S
Sbjct: 71   DKSAGTLTLADNGIGMSEAEIIDHLGTIARSGTEKFLASLSGDQKKDAQLIGQFGVGFYS 130

Query: 1282 VFRVTDAPEIHSN 1294
             F V D   + S 
Sbjct: 131  AFIVADKVTVESR 143


>gi|260785720|ref|XP_002587908.1| hypothetical protein BRAFLDRAFT_87296 [Branchiostoma floridae]
 gi|229273063|gb|EEN43919.1| hypothetical protein BRAFLDRAFT_87296 [Branchiostoma floridae]
          Length = 1403

 Score = 41.2 bits (95), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 4/71 (5%)

Query: 24  QPPSQPPPPHFNPNFAFQNPNIFFPNPALQLLQNLTNIPFQNPNFDIQNPNLFMQNANLL 83
           QPP +  P   NP+   +NP++   NP L       ++  +NP+   +NP+    N +L 
Sbjct: 533 QPPDREIPE--NPDLQPENPDLQPGNPELH--PGNPDLQLENPDLQPENPDQQPDNPDLQ 588

Query: 84  TQNPNLPPQQP 94
            +NP+L PQ P
Sbjct: 589 PENPDLQPQNP 599


>gi|119484754|ref|ZP_01619236.1| hypothetical protein L8106_14815 [Lyngbya sp. PCC 8106]
 gi|119457572|gb|EAW38696.1| hypothetical protein L8106_14815 [Lyngbya sp. PCC 8106]
          Length = 864

 Score = 41.2 bits (95), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 88/199 (44%), Gaps = 11/199 (5%)

Query: 1190 RLGRALHCLSQELYSQDSHFLLELVQNADDNIYPENVEP-TLTFILQESGIVVLNNEQGF 1248
            ++ + LH L + + +    +  EL+QNA D+I  ++    ++   L  + +   +N   F
Sbjct: 23   KIIKRLHDLEKTVENNQGRWAWELLQNAKDSIAEDDDRTVSIQIELDRNSVTFRHNGNHF 82

Query: 1249 SAENIRALCDVGNS--TKKGSSAGYIGRKGIGFKSVFRVTDAPEIH-----SNGFHVKFD 1301
            +++++  L D  +S   ++G      GR G GF +   ++   +I      ++G   KF+
Sbjct: 83   TSQDVIGLIDQISSKEVEEGQETKKTGRFGTGFLTTHLLSRVIKIKGIIGSADGSFHKFE 142

Query: 1302 TSEGQIGFVLPTLVPPF--NIDMFCRLLSKDPVQLESKCWNTCIRLPFRTKFSEGIAMNN 1359
                + G     L+P    +   F   + K     +  C+NT       T+  + IA   
Sbjct: 143  FLLNRQGRTTAQLMPRITNSWKEFDESIEKISPSYDKNCFNTSFCYQLDTEEQKKIAQIG 202

Query: 1360 IVDMFSDLHPSLLLFLHRL 1378
             VD FS L P +L F+ ++
Sbjct: 203  -VDEFSRLIPFVLAFIPKI 220


>gi|87311767|ref|ZP_01093882.1| HtpG [Blastopirellula marina DSM 3645]
 gi|87285551|gb|EAQ77470.1| HtpG [Blastopirellula marina DSM 3645]
          Length = 595

 Score = 41.2 bits (95), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 52/108 (48%), Gaps = 15/108 (13%)

Query: 1195 LHCLSQELYSQDSHFLLELVQNADDNIYP-ENVEP---------TLTFILQESGIVVLNN 1244
            +  LSQ LYS    F+ EL+QN  D I     +EP          +T    +  I+  +N
Sbjct: 7    IELLSQHLYSGPHVFIRELLQNGVDAIQARRQIEPKHEGAIEIEVVTSEESDPTIIFQDN 66

Query: 1245 EQGFS-AENIRALCDVGNSTKKGSSAG----YIGRKGIGFKSVFRVTD 1287
              G + AE  + L  +G S+K+G +      ++G+ GIG  S F V+D
Sbjct: 67   GVGLTEAEVQQFLATIGQSSKRGEATSRPDDFLGQFGIGLLSCFTVSD 114


>gi|386859793|ref|YP_006272499.1| chaperone protein htpG [Borrelia crocidurae str. Achema]
 gi|384934674|gb|AFI31347.1| Chaperone protein htpG [Borrelia crocidurae str. Achema]
          Length = 615

 Score = 40.8 bits (94), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 56/135 (41%), Gaps = 28/135 (20%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQDSHFLLELVQNADD--------NIYPE-----NVEPT 1229
            M K+    +   L+ +   LYS    FL EL+ NA D        NI  E      ++P 
Sbjct: 1    MKKQFDTEVNDLLYLIIHSLYSHKEIFLRELISNASDAIDKLKFLNITDEKFKDIKLDPR 60

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGS-------------SAGYIGRKG 1276
            +     E  I + +N  G + E++  +  +G   K G+             +A  IG+ G
Sbjct: 61   IDINFDEKSITIKDNGIGMNREDL--ISHLGTIAKSGTKEFINNLKKDEKKAASLIGQFG 118

Query: 1277 IGFKSVFRVTDAPEI 1291
            +GF S F V D  E+
Sbjct: 119  VGFYSAFIVADKVEV 133


>gi|66827323|ref|XP_647016.1| hypothetical protein DDB_G0268852 [Dictyostelium discoideum AX4]
 gi|60475079|gb|EAL73015.1| hypothetical protein DDB_G0268852 [Dictyostelium discoideum AX4]
          Length = 2673

 Score = 40.8 bits (94), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 68/258 (26%), Positives = 110/258 (42%), Gaps = 48/258 (18%)

Query: 1183 MLKKQHARLGRALHCLSQELYSQ-DSHFLLELVQNADDNI------------YP------ 1223
            M K +  RL +  H L   L +  +S  + EL+QNA+D +            YP      
Sbjct: 1    MPKTKDQRL-KVSHSLRNNLLNYPESKIVRELIQNAEDALADTFIIKVDEGSYPNNGLLI 59

Query: 1224 --ENVEPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKS 1281
              +N E   +  L    I++ NN   F  ++   +C++   +KK +    +G  G+G+ S
Sbjct: 60   SNDNQEIESSKELLGPSILIYNN-ATFHEQDWEGICEINQGSKKENLKS-VGNFGLGWNS 117

Query: 1282 VFRVTDAPEIHSNGFHVKFDTSEGQIGFVLPTLVPPFNIDMFCRLLSKDPVQL-ESKC-- 1338
            V+ +TD P ++S G +V F     +IG  L   +   +    C    +  +Q  ES C  
Sbjct: 118  VYHITDNPIVYS-GNNVWFADPNERIGGGLFFDLDDEDDYNKCFSFLEPFIQFKESNCNP 176

Query: 1339 ---WN-TCIRLPFRTKFSE-------GIAMNNIVDMFSDLHPSLLLFLHRLQCIMFRNML 1387
               +N T IRLP R   SE         A+  I++ FS     +L+FL  L  I      
Sbjct: 177  KEYFNGTIIRLPLRIYPSEIKKAVLTMDAIKKILNEFSKDINEILIFLKSLASIT----- 231

Query: 1388 NDSLVVIRKKIVGDGIIK 1405
                +++  +IV    IK
Sbjct: 232  ----IIVNNQIVNSVKIK 245


>gi|384172841|ref|YP_005554218.1| heat shock protein 90 [Arcobacter sp. L]
 gi|345472451|dbj|BAK73901.1| heat shock protein 90 [Arcobacter sp. L]
          Length = 636

 Score = 40.8 bits (94), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 58/134 (43%), Gaps = 33/134 (24%)

Query: 1183 MLKKQ-HARLGRALHCLSQELYSQDSHFLLELVQNADDNI--------YPENV------- 1226
            M K Q    +G+ LH ++  LYS    F+ ELV NA D I          EN+       
Sbjct: 1    MAKHQFQTEVGQLLHLMTHSLYSNKEIFIRELVSNASDAIDKLNYLRLTDENLKDKYAQW 60

Query: 1227 --EPTLTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSS-------------AGY 1271
              E  ++F  ++  + + +N  G + E++  +  +G   K G+              +  
Sbjct: 61   KGEINISFDEKDKSLTISDNGIGMNDEDL--IASIGTIAKSGTKSFIDNLTGDAKKDSNL 118

Query: 1272 IGRKGIGFKSVFRV 1285
            IG+ G+GF SVF V
Sbjct: 119  IGQFGVGFYSVFMV 132


>gi|226509636|ref|NP_001148622.1| ubiquilin-1 [Zea mays]
 gi|195620874|gb|ACG32267.1| ubiquilin-1 [Zea mays]
          Length = 576

 Score = 40.8 bits (94), Expect = 9.1,   Method: Composition-based stats.
 Identities = 39/166 (23%), Positives = 62/166 (37%), Gaps = 18/166 (10%)

Query: 7   PRFRPVGGVGGEGTGGGQPPSQPPPPHFNPNFA---------FQNPNIFFPNPALQLLQN 57
           P   P GG+GG   G   P +  P   F P F           QNPN+      +  +QN
Sbjct: 118 PSSGPAGGLGGLFPGVDAPGNDRPSGIFRPGFPELEQVEQHLSQNPNLMREIMNMPAMQN 177

Query: 58  LTNIPFQNPNFDIQNPNLFMQNANLLTQNPNLPPQQPPSSACNQQRSQTQHPPGASNQQR 117
           L N P    N  + NP    Q  +++ +NP+L       S   Q     ++P       R
Sbjct: 178 LMNNPDLIRNMIMSNP----QMRDIMDRNPDLAHVLNDPSVLRQTLEAARNPEIMREMMR 233

Query: 118 PLTQQPQSLPPQPQKLN--KEFLERIDRAVVKARSELIMAGESVSS 161
              +   ++   P+  N  +   E +    + A +   M GE  ++
Sbjct: 234 NTDRAMSNIESSPEGFNMLRRMYETVQEPFLNATT---MGGEGSTA 276


>gi|291438926|ref|ZP_06578316.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
 gi|291341821|gb|EFE68777.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672]
          Length = 1048

 Score = 40.8 bits (94), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 20/77 (25%), Positives = 36/77 (46%)

Query: 1230 LTFILQESGIVVLNNEQGFSAENIRALCDVGNSTKKGSSAGYIGRKGIGFKSVFRVTDAP 1289
            L   L++  +V  N      A  + +L  +  S K+ +    +GR G+GF +V  VTD P
Sbjct: 80   LRLTLRDGVLVAANTGAPLDAAGVESLSTLRASAKRDAPERSVGRFGVGFAAVLAVTDEP 139

Query: 1290 EIHSNGFHVKFDTSEGQ 1306
             +      V++  +E +
Sbjct: 140  AVVGRHGGVRWSLAEAR 156


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.136    0.405 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 32,939,753,213
Number of Sequences: 23463169
Number of extensions: 1414801664
Number of successful extensions: 4032289
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 271
Number of HSP's successfully gapped in prelim test: 1519
Number of HSP's that attempted gapping in prelim test: 4011808
Number of HSP's gapped (non-prelim): 14956
length of query: 2109
length of database: 8,064,228,071
effective HSP length: 159
effective length of query: 1950
effective length of database: 8,628,551,496
effective search space: 16825675417200
effective search space used: 16825675417200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 85 (37.4 bits)