BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 002242
         (948 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224075882|ref|XP_002304811.1| predicted protein [Populus trichocarpa]
 gi|222842243|gb|EEE79790.1| predicted protein [Populus trichocarpa]
          Length = 1122

 Score =  650 bits (1676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/966 (43%), Positives = 557/966 (57%), Gaps = 64/966 (6%)

Query: 15   DANKALLVPEVNTSAGAGK----KNDS--DKVLPLQLA----------ETFACSGESGNQ 58
            D++KAL + +V TS G G+    K DS    V+P   +            F  + +SG Q
Sbjct: 119  DSSKALPLKQVETSGGVGEAIQLKMDSVGGGVIPSLTSAPISVENADPRDFTSNSDSGKQ 178

Query: 59   F---KIDNNASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFK----KVADVNDA 111
            +   K  N +  M      S P T +N   +S  N +D+  + EEK      K AD++DA
Sbjct: 179  YEERKGQNASCMMHDSRLISIPTTAENAGPQSATNYKDRGCQHEEKCNVICIKDADISDA 238

Query: 112  VELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHYPNKEIDET 171
            VELSISASEALVIH+       S+ L    +LEAAL +KQARLE+ ED    P+ E DE 
Sbjct: 239  VELSISASEALVIHKFVKTGSSSDALTKQAILEAALHIKQARLESSEDAFGCPSDEADEI 298

Query: 172  HLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVEC 231
              +S+LDD  M  A  DVGL  +   +  +   +   ++ TP+ EN++   + G +HV+ 
Sbjct: 299  DFLSDLDDSIMEDAYLDVGLSFSAHGDEHLHDLDVSQVEETPVLENHHL--EKGSEHVQL 356

Query: 232  LAHKKKLPADPALNLNGSYMEGKIDPMLHQSTQETSHVLAAAQ-----------PEASTN 280
            L  +     D  L  N S      D +L Q  ++ S   + A+           P    N
Sbjct: 357  LPQQNNADDDSDLGSNPSDAACLGDHILTQPAEKLSESSSGAKFVFTKDGNLGLPPVDVN 416

Query: 281  SIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWTGKEVDVPIQLEQSNNKGIPKFYVNETS 340
            S +A  +  A G  E+   + DRF+SRWFGGW  +E D   +L+Q++ K IPKF+V ETS
Sbjct: 417  SFHACRAEKAEGAREVHYLIADRFKSRWFGGWALEEGDASAKLKQNSPKSIPKFFVGETS 476

Query: 341  SLSESADVAPDENSFVQKHETGSKIASQSSIPFEGVNNKADEEISFCQDVISSAQSFVDP 400
             LSESADVAPDENSFVQKHET S I SQSSIPFE +++K        Q+VISS  S VDP
Sbjct: 477  FLSESADVAPDENSFVQKHETRSNIGSQSSIPFEALHDK--------QEVISSDLSLVDP 528

Query: 401  LCSFVPCSVSAVDASSPQTINVAAAQKQ-------LLEVENLQWTSDFNAEFINRDRQDV 453
            LCS VPCS+S  +A SP   N      +          +EN Q TS   AE +  D Q V
Sbjct: 529  LCSVVPCSISLENAISPSVQNNRKVDAENCFNPNTDTGMENFQKTSHLKAEPVFMDSQTV 588

Query: 454  STFSVEGFENPVKRQLTLLKPYSTLSPINDPKLEKRSCFRNHLVPSECHEEIISLEQNIG 513
                 +    PV+R++  L+ YSTLSP  D  LE+     N    S     +++  Q +G
Sbjct: 589  PIIMGQCSNAPVRRRVASLRTYSTLSPNCDAVLEREGPCHNGRYSSGHVRNLLASHQEMG 648

Query: 514  CIWSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTGNPYAVATNCTENNDKLVSHGAG 573
            CI   D+R  KG+L F S  +   GR+NEE     V  N  A  T    ++D+       
Sbjct: 649  CIRPSDQRNSKGVLPFKSVFESTDGRDNEEN--QDVVRNLVAEITCQKRSHDQPTKDRTE 706

Query: 574  L-LEPLKESRLPPFPNRGT-CRSHASKLIGDVNNSFGKTNPDEAGVQETVVMLH------ 625
            + ++P  + R P   NR T CR  AS+L    +N  G+ +P++A  QE ++ LH      
Sbjct: 707  MKVKPSVQRRSPLILNRRTRCRPQASELF--THNLTGEISPEQAVGQENIIKLHPSKNAE 764

Query: 626  NLSEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNAC 685
             +  +  N    + P+RKR+  SEVEVDL QN+D+ K Q+ H+  S  R  +   N N C
Sbjct: 765  KIKLKWENSFGARNPVRKRVCFSEVEVDLYQNKDLRKPQTLHRNGSTIRADKKKNNGNTC 824

Query: 686  RDSQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP- 744
             + Q Q VK   T + K  KRLIF G++FLLTGFS +KEKEI  +IQ YGG+++LDIPP 
Sbjct: 825  SEVQPQDVKSSFTCQIKDAKRLIFHGLEFLLTGFSHKKEKEIIEIIQIYGGMIVLDIPPV 884

Query: 745  PNSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAK 804
            PNS+ KR SRS+ Q LPVVIC KKL+TTKFLYGCA+NA ILK KWLTDSVAAGS VSP K
Sbjct: 885  PNSRLKRVSRSNLQHLPVVICSKKLQTTKFLYGCAMNALILKLKWLTDSVAAGSVVSPDK 944

Query: 805  YMILSSQADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTL 864
            YMI+S+QA LK T I + +C ++ K+IF RVGI+LHGK  FCTK  VIVKH  GQVFKTL
Sbjct: 945  YMIISNQAYLKCTRIGKSVCCNHRKHIFDRVGIVLHGKHRFCTKLTVIVKHAHGQVFKTL 1004

Query: 865  HWLVLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQ 924
              LV SL+++K  +G IV ENE R  RHLR+CA E+KI M+PASWI KSLH+G LLP  +
Sbjct: 1005 QSLVESLDSEKISMGAIVTENETRELRHLRYCASERKIPMMPASWIAKSLHLGKLLPFIE 1064

Query: 925  DKHIPS 930
            D+  PS
Sbjct: 1065 DEATPS 1070


>gi|255569143|ref|XP_002525540.1| conserved hypothetical protein [Ricinus communis]
 gi|223535119|gb|EEF36799.1| conserved hypothetical protein [Ricinus communis]
          Length = 991

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 392/874 (44%), Positives = 524/874 (59%), Gaps = 50/874 (5%)

Query: 91  DQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQ 148
           D  ++G R   FK++  ADVN+AVELSI+ASEALVIHEL  +   S+ + T  +LEAALQ
Sbjct: 152 DNVEIGERIH-FKRIKDADVNEAVELSIAASEALVIHELMCSGSTSKAVPTGAILEAALQ 210

Query: 149 VKQARLENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFL 208
           VKQARLE  ED +   + EIDE H++S+LDD  M +A  DVGL    +D+    GS+   
Sbjct: 211 VKQARLEALEDENCCSSDEIDEIHVLSDLDDSVMENAFVDVGLSFRDTDDQRACGSDVSQ 270

Query: 209 IKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEGKIDPMLHQSTQETSH 268
           +K TP+  +Y   + NG KHVE + H  K+ +  AL LN + +   ++P+LHQS + + +
Sbjct: 271 VKDTPVDSHY--GHKNGSKHVELMVHHDKVYSGSALGLNNNNIASLVNPLLHQSAEASPY 328

Query: 269 VLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWTGKEVDVPIQLEQSNN 328
           V +  QP+   +S  A  S  + G+E++C  VVD+FRSRW GGWT KE D   +++Q N+
Sbjct: 329 VPSVVQPQVDVSSSPACMSEKSEGEEKLCSLVVDKFRSRWSGGWTVKEADTSEKVKQKNS 388

Query: 329 KGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIPFEGVNNKADEEISFCQ 388
           KG+ KF+  ETS LSESA VA DENSFVQKHETGSKIASQSSIPFEG  ++ADE I   Q
Sbjct: 389 KGVLKFFDFETSFLSESAYVAADENSFVQKHETGSKIASQSSIPFEGGLDRADEGICSFQ 448

Query: 389 DVISSAQSFVDPLCSFVPCSVSAVDASSPQTINVAAAQ-----KQLLEVENLQWTSDFNA 443
           + I     F   + S    S+S+ +    +    A AQ     K    +EN Q TSD N 
Sbjct: 449 EYI-----FRXXISSENAGSLSSQNKDDRE----AYAQNCFSTKSEPRMENFQSTSDVNV 499

Query: 444 EFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSPINDPKLEKRSCFRNHLVPSECHE 503
           E     R+ V     E FE PV+R L  LK YS L P ++  L ++   +NH   SE   
Sbjct: 500 EPPYAGREAVPKTYGEYFEVPVRRHLASLKTYSKLLPKHEATLSEQQLNQNH--SSEYVG 557

Query: 504 EIISLEQNIGCIWSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTGNPYAVATNCTEN 563
           E+IS +         D+   K  +   S   F+ GR++ +     +  +P +       +
Sbjct: 558 ELISSKS------FSDQMNSKLFVPLRSEFHFSVGRDHGQNEDVNILRSPLS-------D 604

Query: 564 NDKLVSHGAGLLEPLKESRLPPFPNRGT-CRSHASKLIGDVNNSFGKTNPDEAGVQETVV 622
           +D+    G    +P K+ R P   NR T C    S+L+    N  G+  P+    Q TV 
Sbjct: 605 HDEPAKDGPE--QPSKQRRSPLVLNRKTRCHLQPSELL---RNLAGEKTPERIFAQGTVN 659

Query: 623 M------LHNLSEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTS 676
                  L  +  +  N  +T   +RKR+ L E EV L+QN+D  K Q+S +  S    +
Sbjct: 660 KALQRKNLQKIDSKLENPHKTANLVRKRVCLLEAEVGLQQNKDAPKTQTSQRNSSAVTAN 719

Query: 677 EMSKNSNACRDSQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGG 736
           + SK S  C DSQ Q +K C       +KRLIF G++FLLTGFSS+K++EI  LIQ+YGG
Sbjct: 720 KRSKYSKPCSDSQNQDMKNCFKDLVNDMKRLIFHGLQFLLTGFSSRKKREIAGLIQEYGG 779

Query: 737 LVLLDIPPP--NSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSV 794
           LVLLD+P P  N +  R +RS+ Q+LP+VIC KKL+TTKFLYGCAVN  ILK KWL DSV
Sbjct: 780 LVLLDVPSPPSNIRTTRKARSNFQRLPIVICSKKLQTTKFLYGCAVNTLILKVKWLADSV 839

Query: 795 AAGSTVSPAKYMILSSQADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVK 854
           AAGS + P KYMI+S+Q   + T   +    D  + IF RVGIMLHGK SFC+K  +I+K
Sbjct: 840 AAGSAIPPDKYMIISNQVGQQYTRFGKLGYHDAPRCIFDRVGIMLHGKHSFCSKLEIIIK 899

Query: 855 HGGGQVFKTLHWLVLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSL 914
           HGGGQVFKTL  L  SL+T+K  VG I+AE+E   SRHLRHCA E+K+ M+PASWI +SL
Sbjct: 900 HGGGQVFKTLQRLFQSLDTEKISVGAIIAEDESTASRHLRHCALERKLPMMPASWIARSL 959

Query: 915 HIGMLLPQTQDKHIPSPTVNVPEFRISIDWSEEI 948
           H G LLP  +     +P +N PE  IS+DWS+EI
Sbjct: 960 HFGKLLPFKEKDD--APKINSPESAISLDWSQEI 991


>gi|449435496|ref|XP_004135531.1| PREDICTED: uncharacterized protein LOC101203785 [Cucumis sativus]
          Length = 1330

 Score =  427 bits (1099), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 325/935 (34%), Positives = 474/935 (50%), Gaps = 125/935 (13%)

Query: 64   NASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFK----KVADVNDAVELSISAS 119
            NA D ++   SSY     N   +SL N +D+  R+ EK      K A+V+DA+ELS+ AS
Sbjct: 146  NAGDTDLAPHSSYKDVLHNVGCQSLTNTEDRENRQGEKLDVGCLKNAEVSDAIELSVVAS 205

Query: 120  EALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHYPNKEIDETHLVSELDD 179
            EALVIHEL  +E DS  +    VLEA++QVK+AR+E  E      ++E+D +  +S+LD+
Sbjct: 206  EALVIHELLKDELDSAAVSVEAVLEASIQVKKARIELLESALESIDEEVDLSDSLSDLDN 265

Query: 180  LTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYE-----------------SN 222
             TM  A +DVGLPS+  + +  SG+  F ++ TP+++N +                   N
Sbjct: 266  STMRDAFDDVGLPSSILNSDH-SGTACFDVQDTPVNKNEFTHGSQCNSIDMTSQPDILGN 324

Query: 223  DNGFKHVE---CLAHKKKLPA-DPALNL-----------NGSYMEGKIDPMLHQSTQETS 267
                K +E    +     LP  D + N+           + S    K D ML   TQ  S
Sbjct: 325  GLTLKQLEENLVVTRPVGLPMEDLSCNIQHQLSNDDVLGSTSTNYCKYDSMLQHPTQNES 384

Query: 268  HVLAAAQPEAST--------------NSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
                  Q   S+              +S++  +  +A   E +     +RF+SRW GGW+
Sbjct: 385  DEFVVKQKIVSSIVNTNLCTIHAKENSSLHESSKVSAKNDELVAFFTPERFKSRWLGGWS 444

Query: 314  GKEVDVPIQLEQS-NNKGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIP 372
            GKEVDV  QL Q  + K IP  +VNETS LSESAD+APDENS VQ+ E+  ++ASQSSI 
Sbjct: 445  GKEVDVSEQLRQDVDGKTIPLMFVNETSFLSESADIAPDENSCVQRCESKFQVASQSSIH 504

Query: 373  FEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVSAVDASSPQTINVAAAQKQLLEV 432
            F  ++ K D+ +   ++++  + S VDPLCSFVPCS+S    S+ Q +N           
Sbjct: 505  FGHLDEKGDDGLLVAEEIVKCSLSLVDPLCSFVPCSISLDTDSAGQNLN----------- 553

Query: 433  ENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSPINDPKLEKRSCF 492
            E    T +    F++          V G    ++RQ+T LK YST+SP +          
Sbjct: 554  EGKDCTEELLGTFVD----------VGGSRPSIRRQVTSLKNYSTISPTHATMEGGLDNS 603

Query: 493  RNHLVPSECHEEIISLEQNIGCI-WSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTG 551
              H +P   +  ++S +  + C  +S    + + L S  + S+     +  E  +T    
Sbjct: 604  YAHQLPG--NMRLLSSDSQLDCTRFSSKINFMETLPSQSTKSR---DMDTVEDSQTDARH 658

Query: 552  NPYAVATNCTENNDKLVSHGAGLL-EPLKESRLPPFPNRGTCRSHASKLIGDVNNSF--- 607
            N     T     +D++    +  L + +K+S      N     S ++     +       
Sbjct: 659  NLVEEITELKSKSDEVAGDVSEFLADTVKKSVTCDILNGSLQLSKSTMKKSSIKKDHLQS 718

Query: 608  GKTNPDEAGVQETVVMLHNLSEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEK----- 662
             KT  +   V   V M H    E  N LE  + ++KR+R  E     ++N D +K     
Sbjct: 719  SKTISNPQKVDNVVKMQH----ESKNPLEPCMLVQKRVRFLEANDQPQENLDFQKVHPPI 774

Query: 663  ----------------------LQSSHKFHSVTRTSEMSKNSNACR-----DSQLQYVKR 695
                                  L       S  RTS+  K SN C      D +     R
Sbjct: 775  NCKDWSIVGVIDMIYYVFFLRNLPFGWNADSTLRTSKRRKFSNQCLLSRHPDGKGHLKSR 834

Query: 696  CSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRS 755
              ++R+K    LIFQGI+FL+TGFSS+KEK+I  ++   GG++L DIP P+S+G++ S+S
Sbjct: 835  YCSSRKK----LIFQGIQFLVTGFSSRKEKDINGIVCNNGGIILPDIPCPSSRGQKMSKS 890

Query: 756  HSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLK 815
              +  PV++  KKL+T KFLYGCAVN+ I+   WLTDS+AAGS V P KYMI+S+QAD  
Sbjct: 891  DCKGPPVILSSKKLQTKKFLYGCAVNSLIVNVSWLTDSIAAGSIVPPWKYMIISNQADCT 950

Query: 816  RTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQK 875
            + G    +   + +YIF  VG+MLHGK  FCTK   ++KHGGGQVFKTL WLV SL  +K
Sbjct: 951  QIG--RSVRHSSRRYIFENVGVMLHGKQGFCTKLTNVLKHGGGQVFKTLQWLVKSLNQEK 1008

Query: 876  NIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWI 910
              VGVIV E+E + SRHL+ CA E+ I ++ A  +
Sbjct: 1009 ISVGVIVVEDEHKSSRHLKQCALEQGIPLMIAKLV 1043


>gi|357456285|ref|XP_003598423.1| hypothetical protein MTR_3g013490 [Medicago truncatula]
 gi|355487471|gb|AES68674.1| hypothetical protein MTR_3g013490 [Medicago truncatula]
          Length = 1201

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 282/781 (36%), Positives = 390/781 (49%), Gaps = 122/781 (15%)

Query: 255  IDPMLHQSTQETSHVLA---AAQPEASTNSIYAWNSNN--AVGKEEICCSVV-DRFRSRW 308
            ID  L  + Q+    +A   + +P+ S NS     S N   V  E     +  +RFRSRW
Sbjct: 456  IDATLISTFQKDDLAMADFTSIKPQHSVNSSLFQTSENFKDVKNENWAAYLAPERFRSRW 515

Query: 309  FGGWTGK--------EVDVPI---------QLEQSNNKGIPKFYVNETSSLSESADVAPD 351
             GGWT K        E+D P           L  +N K IPKF V ETS L+ESAD+ PD
Sbjct: 516  LGGWTSKTYYLCLASELDSPSLPDSKKASPSLNLNNEKRIPKFLVRETSFLTESADIIPD 575

Query: 352  EN----------------------------SFVQKHETGSKIASQSSIPFEGVNNKADEE 383
            EN                            S V KH+    I+SQ S+  EG +NK DE 
Sbjct: 576  ENSRLLKHDPKCVRETSFLTESVDIVPDESSCVLKHDPKRAISSQLSMHCEGSHNKPDEG 635

Query: 384  ISFCQDVIS-SAQSFVDPLCSFVPCSVSA------VDASSPQTINVAAAQKQLLEVENLQ 436
            +   QD++  S+ S  DPLCS VPCS ++              I   A      EV+N Q
Sbjct: 636  VLHSQDMVGCSSLSLNDPLCSVVPCSFASEHDNYKTHIDKENDIEYLAPLVSDFEVDNCQ 695

Query: 437  WTSDFNAEFINRDR--------QDV---STFSVEGFENPVKR-QLTLLKPYSTLSPINDP 484
              SD N  F  RD         +D+   ST  VE     + R + T LK YS +      
Sbjct: 696  RISDKNTTFDCRDEKITSIIDGKDILLTSTEVVEQMSEKLTRVEHTCLKTYSMI------ 749

Query: 485  KLEKRSCFRNHLVPSECHEEIISLEQNIGCIWSWDKRYCKGLLSFGSASKFAAGRENEET 544
                    +N  V   C+   +S  Q+     S   R  +      S S F    EN+  
Sbjct: 750  -------LQNQDVNQNCNSTPLSTNQSDIAAASLGTRVSE------SPSAFEHSNENKNE 796

Query: 545  YKTTVTGNPYAVATNCTENNDKLVSHGAGLLEPLKESRLPPFPNRGTCRSHASKLIG--- 601
                   +  ++     + +D+L +      E   E R P   N  T      +L+G   
Sbjct: 797  VNHQHLIDQKSIIQITDDKSDELKAS-----ELTHERRSPINLNHRT----RHRLLGPKT 847

Query: 602  DVNNSFGKTNPDEAGVQETVVM------LHNLSEERNNYLETQVPIRKRIRLSEVEVDLR 655
             VN+   + N ++  V ETVV       L  +  E N  ++  V +RK++  S+   +L 
Sbjct: 848  SVNDISIEKNINQYVVAETVVQHQQNNNLIKVQVECNKSVDGHVKVRKQVHFSDKVEELH 907

Query: 656  QNRDIEKLQSSHKFHSVTR------TSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRLIF 709
            Q R + KL+SS K  S  R      +  ++ ++   + S   Y   C   R ++    IF
Sbjct: 908  QKRKLSKLESSDKKCSSIRAKRQRVSKTLTTSAPHMKHSLTDY---CRGARNEF----IF 960

Query: 710  QGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP-PPNSQGKRCSRSHSQQLPVVICPKK 768
            +G +FLLTG SSQKE+E+E LI+  GG++L DIP PPNS+GKR S     QLP+++C KK
Sbjct: 961  RGREFLLTGLSSQKERELEALIRNSGGVILYDIPSPPNSKGKRSSTLSCLQLPIILCNKK 1020

Query: 769  LKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRDNH 828
            L+T KFLYGCAV A IL+  WLTD +A+G+ + P KYMIL ++ D+K T   + + R N 
Sbjct: 1021 LQTAKFLYGCAVGASILRVDWLTDCLASGTILLPEKYMILPNRNDMKWTRSGKAVHRRNQ 1080

Query: 829  KYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVAENERR 888
            K+IF RVGIMLHGK SFCTKFA I+KHGGG+ FKTL WLV S + ++ ++G IV E +  
Sbjct: 1081 KHIFERVGIMLHGKHSFCTKFACIIKHGGGRAFKTLQWLVRSTDEERTLMGAIVVEEKAT 1140

Query: 889  VSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPS-PTVNVPEFRISIDWSEE 947
            +SRHL+ CA E+ I ++ +SWIIKSL+ G L+P T++    S  +V VPE R  +D SEE
Sbjct: 1141 ISRHLKSCALERDIPIMTSSWIIKSLYSGNLIPLTEENSTLSLRSVQVPEVRNDVDMSEE 1200

Query: 948  I 948
            I
Sbjct: 1201 I 1201



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 78/189 (41%), Positives = 110/189 (58%), Gaps = 6/189 (3%)

Query: 39  KVLPLQLAETFACSGESGNQFKIDNNASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRR 98
           + L LQ A+T    GE+ +     N+A + +V++T   P+T+K   ++SL++  D V ++
Sbjct: 160 EALSLQSAQTSVDFGENMHSM-TGNHACEQDVMATF-IPETKKKNSLKSLIHTFDSVKQQ 217

Query: 99  EEKFK----KVADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARL 154
           +EK K    +  D++DAVELSI+ASEALVIH+L   E  SE L T  VLE AL+VKQARL
Sbjct: 218 KEKPKDECFESNDISDAVELSIAASEALVIHDLVKMESVSETLSTEAVLEIALRVKQARL 277

Query: 155 ENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFLIKCTPL 214
           E  ED     + E D +  +S+L+D  M  A  D+GLP   S    +S S  F  K    
Sbjct: 278 EGLEDGFFSSSMESDCSDSLSDLNDFLMEDAYADIGLPVGVSCGENLSNSNVFHAKGVSG 337

Query: 215 SENYYESND 223
            ENY E N+
Sbjct: 338 DENYNEGNN 346


>gi|296087815|emb|CBI35071.3| unnamed protein product [Vitis vinifera]
          Length = 693

 Score =  340 bits (873), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 206/443 (46%), Positives = 264/443 (59%), Gaps = 51/443 (11%)

Query: 514 CIWSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTGNPYAVATNCTENNDKLVSHGAG 573
           CI S    + + LL F S SK+A  R+ E  + T+V            + N  L+     
Sbjct: 294 CISSGRSDFTE-LLPFTSMSKYAISRDYEYNHDTSVHMP--------KKGNSPLI----- 339

Query: 574 LLEPLKESRLPPFPNRGTCRSHASKLIGDVNNSFGKTNPDEAGVQETVVMLH---NL--- 627
                          R   +  ASK    +NNS G+ N     V + +V +    NL   
Sbjct: 340 ------------LNQRMHYQLQASKHF--INNSSGEENSKGTAVPDIIVGIQQSKNLEMP 385

Query: 628 SEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRD 687
             E +N   TQV IRKR+R SE  V   QN+++  +Q+++K       +   K       
Sbjct: 386 QPEFHNAHSTQVQIRKRVRFSEANVQHPQNKNLHNIQTAYKTRKTWILTHKVK------- 438

Query: 688 SQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPP-N 746
               Y+  C        ++LIFQG++FLLTGFSS KEKEIE LI+KYGG+VL DIP P N
Sbjct: 439 ---WYMTNCCIN-----EKLIFQGLEFLLTGFSSCKEKEIEGLIRKYGGIVLPDIPSPSN 490

Query: 747 SQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYM 806
           S+ KR S  + + LP+V+C KKL+TTKFLYGCAVN F+LK  WL +SVAAGS + P KYM
Sbjct: 491 SRRKRNSNFNQKHLPIVLCSKKLQTTKFLYGCAVNTFMLKVDWLNNSVAAGSVLPPGKYM 550

Query: 807 ILSSQADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHW 866
           ILS++ D KR  I  PI R+NHKYIF +VGIMLHGK SFC+KFA I +HGGGQVFKTL  
Sbjct: 551 ILSNEGDKKRIRIGMPIHRENHKYIFEKVGIMLHGKHSFCSKFAKIFRHGGGQVFKTLQC 610

Query: 867 LVLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDK 926
           LV SL+T+   VG IVAE+E R SRHLR CA E+KI M+PASWII SLH+G LLP T++K
Sbjct: 611 LVQSLDTESISVGAIVAEDESRASRHLRLCASERKIPMMPASWIINSLHLGRLLPLTENK 670

Query: 927 H-IPSPTVNVPEFRISIDWSEEI 948
           H  P   + V E   S+++S EI
Sbjct: 671 HSSPLTRIMVQELPTSMEFSAEI 693



 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 63/95 (66%), Positives = 75/95 (78%), Gaps = 1/95 (1%)

Query: 316 EVDVPIQLEQSNNKGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIPFEG 375
           EV V + ++Q+N K IPK +V ETS LSESADVAPDENSFV K+ET SK+ASQSSIPFEG
Sbjct: 157 EVGVSVLMKQANPKSIPKLFVGETSFLSESADVAPDENSFVPKNETSSKLASQSSIPFEG 216

Query: 376 VNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS 410
           + +KA+EEI     V SS+ S VDPLCS VPCS+S
Sbjct: 217 LCDKANEEILL-SQVQSSSLSLVDPLCSVVPCSIS 250



 Score =  100 bits (249), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 63/127 (49%), Positives = 81/127 (63%), Gaps = 4/127 (3%)

Query: 104 KVADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHY 163
           KVA+V+DAVELSI+ASEALVIHEL  +E  SE      VLE AL+VKQARLE  ED+ H+
Sbjct: 12  KVAEVDDAVELSIAASEALVIHELLKSESPSEAFPAAAVLEVALRVKQARLEMLEDSVHH 71

Query: 164 PNKEIDETHLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYESND 223
           P +E+D    +S+L D  M    EDVGL   GS +    GS    ++ TP+S N+   +D
Sbjct: 72  PIEEVD---FLSDLVDSAMEDVFEDVGLFDVGSHDQCSGGSIMSRVEDTPISGNHCGFDD 128

Query: 224 NGFKHVE 230
              KHV+
Sbjct: 129 KS-KHVD 134


>gi|218193963|gb|EEC76390.1| hypothetical protein OsI_14021 [Oryza sativa Indica Group]
          Length = 883

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 131/368 (35%), Positives = 200/368 (54%), Gaps = 31/368 (8%)

Query: 588 NRGTCRSHASKLIGDVNNSFGKTNPDE-AGVQETVVMLHNLSEERNNYLETQVPIRKRIR 646
           N+   R  ASK+   V N+F   N  E + V +       +SE +N+    +   RK+ +
Sbjct: 540 NKKIRRVQASKVC--VENNFEAGNLHEFSKVLKNPSYAQGVSEHQNSMKSLK---RKKAQ 594

Query: 647 LSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYV-- 704
            SE ++  R+ ++I + Q+  +F              +  DS+L      +   R+Y+  
Sbjct: 595 FSEAKISTRKTKNIRRTQTKSRF--------------SWSDSRLID----TIEPREYIDN 636

Query: 705 KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP--PPNSQGKRCSRSHSQQLPV 762
           K  +F G+ FLLTGF S KEKEIE LI+K+GG VL  +P  P + + K    +  + L +
Sbjct: 637 KEALFHGLDFLLTGFQSHKEKEIEPLIRKFGGYVLSRVPSCPLDKRSKLAELARCKPL-I 695

Query: 763 VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEP 822
           V+ PKK+ T KFLYGCA+N++IL   WL DS+ AG  + P KY I    A    +   + 
Sbjct: 696 VLSPKKVSTAKFLYGCAINSWILNPSWLFDSIQAGVMLPPGKYFIRQVHAMQGISMFDQS 755

Query: 823 ICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIV 882
           +    +  +F  VG ++ GK SFC+KF+ ++KHGGGQVF +L  LV SL+ + +  G+I+
Sbjct: 756 LHLRKNTLLFDGVGFLILGKISFCSKFSNVIKHGGGQVFASLQGLVQSLKDRSSSHGIIL 815

Query: 883 AENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPEFR--I 940
             NE   SRHL +C  E  I   PASW+I SL+ G L+P  +D+      + +P F+   
Sbjct: 816 VANEASASRHLSYCGLEHDIKTAPASWVIGSLYSGKLIPLKKDRCASFRKIKMPSFQQPQ 875

Query: 941 SIDWSEEI 948
           + D S+EI
Sbjct: 876 AFDMSQEI 883



 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 119/415 (28%), Positives = 176/415 (42%), Gaps = 78/415 (18%)

Query: 80  EKNRIVRSLVNDQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDL 137
           E N+   S  + + ++  R+   +K+  ADVNDAVELSI+ASEA+VI E+   +  S D 
Sbjct: 162 EINKTCDSKGHRRQQLSGRKVDVRKLRSADVNDAVELSIAASEAMVIAEMILLDSQS-DK 220

Query: 138 LTTIVLEAALQVKQAR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPS 193
           L T  LEAAL VK+AR    LE  E +      ++DET  +SELD+  M  A +DVGL  
Sbjct: 221 LATASLEAALHVKEARKQCFLEELEHSCGSSESDLDETDGLSELDETEMLDAFQDVGLS- 279

Query: 194 AGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEG 253
                         L++    S+     N +G K    ++H    P D            
Sbjct: 280 --------------LVQTACASQG---QNISGLKQE--ISHASSHPCDA----------- 309

Query: 254 KIDPMLHQSTQETSHVLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
                        +HVL ++ PE        WNS++A   + +  S+    +    GG  
Sbjct: 310 ------------EAHVLESS-PEKQN---IRWNSHDADTNDHVSDSLASHNKE---GGV- 349

Query: 314 GKEVDVPIQLEQSNNKGIPKFYVNETSSLSESADVA---PDENSFVQKHETGSKIAS-QS 369
                V +Q      K +   +  ETS +SES D     P  +         S  AS   
Sbjct: 350 -----VAVQTNVGTRKHVKGLFNKETSFISESMDGMDEFPSPSRIASMEMAASSRASFLH 404

Query: 370 SIPFEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS---AVDASSPQTINVAAAQ 426
            I      N+  E    C  V+ S  S VDPLCS VPCS+S      + +P+ I  +   
Sbjct: 405 KIKGSCEENQGAEAAQLCSQVVCSNLSLVDPLCSIVPCSISFNEGPPSQAPECIQ-SKGD 463

Query: 427 KQLLEVENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSPI 481
           K+L+  +      D   E        VS   +       +R+ + L+P+ST++PI
Sbjct: 464 KELISTKEFPSKQDLEGE-AGPSCTPVSNILLR------RRKYSSLRPFSTIAPI 511


>gi|32129326|gb|AAP73853.1| unknown protein [Oryza sativa Japonica Group]
          Length = 850

 Score =  209 bits (531), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 128/367 (34%), Positives = 199/367 (54%), Gaps = 29/367 (7%)

Query: 588 NRGTCRSHASKLIGDVNNSFGKTNPDE-AGVQETVVMLHNLSEERNNYLETQVPIRKRIR 646
           N+   R  ASK+   V N+F   N  E + V +       +SE +N+    +   RK+ +
Sbjct: 507 NKKIRRVQASKVC--VENNFEAGNLHEFSKVLKNPSYAQGVSEHQNSMKSLK---RKKAQ 561

Query: 647 LSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYV-- 704
            SE ++  R+ ++I + Q+  +F              +  DS+L      +   R+Y+  
Sbjct: 562 FSEAKISTRKTKNIRRTQTKSRF--------------SWSDSRLID----TIEPREYIDN 603

Query: 705 KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHSQQLPVV 763
           K  +F G+ FLLTGF S KEKEIE LI+K+GG VL  +P  P  +  + +     + P+V
Sbjct: 604 KEALFHGLDFLLTGFQSHKEKEIEPLIRKFGGYVLSRVPSCPLDKRSKLAELARCKPPIV 663

Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPI 823
           + PKK+ T KFLYGCA+N++IL   WL DS+ AG  + P KY I    +    +   + +
Sbjct: 664 LSPKKVSTAKFLYGCAINSWILNPSWLFDSIQAGVMLPPGKYFIRQVHSMQGISMFDQSL 723

Query: 824 CRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
               +  +F  VG ++ GK SFC+KF+ ++KHGGGQVF +L  LV SL+ + +  G+I+ 
Sbjct: 724 HLRKNTLLFDGVGFLILGKISFCSKFSNVIKHGGGQVFASLQGLVQSLKDRSSSHGIILV 783

Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPEFR--IS 941
            +E   SRHL +C  E  I   PASW+I SL+ G L+P  +D+      + +P F+   +
Sbjct: 784 ASEASASRHLSYCGLEHDIKTAPASWVIGSLYSGKLIPLKKDRCASFRKIKMPSFQQPQA 843

Query: 942 IDWSEEI 948
            D S+EI
Sbjct: 844 FDMSQEI 850



 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 118/414 (28%), Positives = 175/414 (42%), Gaps = 78/414 (18%)

Query: 80  EKNRIVRSLVNDQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDL 137
           E N+   S  + + ++  R+   +K+  ADVNDAVELSI+ASEA+VI E+   +  S D 
Sbjct: 129 EINKTCDSKGHRRQQLSGRKVDVRKLRSADVNDAVELSIAASEAMVIAEMILLDSQS-DK 187

Query: 138 LTTIVLEAALQVKQAR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPS 193
           L T  LEAAL VK+AR    LE  E +      ++DET  +SELD+  M  A +DVGL  
Sbjct: 188 LATASLEAALHVKEARKQCFLEELEHSCGSSESDLDETDRLSELDETEMLDAFQDVGLS- 246

Query: 194 AGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEG 253
                         L++    S+     N +G K    ++H    P D            
Sbjct: 247 --------------LVQTACASQG---QNISGLKQE--ISHASSHPCDA----------- 276

Query: 254 KIDPMLHQSTQETSHVLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
                        +HVL ++ PE        WNS++A   + +  S+    +    GG  
Sbjct: 277 ------------EAHVLESS-PEKQN---IRWNSHDADTNDHVSDSLASHNKE---GGV- 316

Query: 314 GKEVDVPIQLEQSNNKGIPKFYVNETSSLSESADVA---PDENSFVQKHETGSKIAS-QS 369
                V +Q      K +   +  ETS +SES D     P  +         S  AS   
Sbjct: 317 -----VAVQTNVGTRKHVKGLFNKETSFISESMDGMDEFPSPSRIASMEMAASSRASFLH 371

Query: 370 SIPFEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS---AVDASSPQTINVAAAQ 426
            I      N+  E    C  V+ S  S VDPLCS VPCS+S      + +P+ I  +   
Sbjct: 372 KIKGSCEENQGAEAAQLCSQVVCSNLSLVDPLCSIVPCSISFNEGPPSQAPECIQ-SKGD 430

Query: 427 KQLLEVENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSP 480
           K+L+  +      D   E        VS   +       +R+ + L+P+ST++P
Sbjct: 431 KELISTKEFPSKQDLEGE-AGPSCTPVSNILLR------RRKYSSLRPFSTIAP 477


>gi|115456137|ref|NP_001051669.1| Os03g0811200 [Oryza sativa Japonica Group]
 gi|50540742|gb|AAT77898.1| expressed protein [Oryza sativa Japonica Group]
 gi|108711696|gb|ABF99491.1| BRCA1 C Terminus domain containing protein, expressed [Oryza sativa
           Japonica Group]
 gi|113550140|dbj|BAF13583.1| Os03g0811200 [Oryza sativa Japonica Group]
 gi|222626025|gb|EEE60157.1| hypothetical protein OsJ_13067 [Oryza sativa Japonica Group]
          Length = 883

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/367 (34%), Positives = 199/367 (54%), Gaps = 29/367 (7%)

Query: 588 NRGTCRSHASKLIGDVNNSFGKTNPDE-AGVQETVVMLHNLSEERNNYLETQVPIRKRIR 646
           N+   R  ASK+   V N+F   N  E + V +       +SE +N+    +   RK+ +
Sbjct: 540 NKKIRRVQASKVC--VENNFEAGNLHEFSKVLKNPSYAQGVSEHQNSMKSLK---RKKAQ 594

Query: 647 LSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYV-- 704
            SE ++  R+ ++I + Q+  +F              +  DS+L      +   R+Y+  
Sbjct: 595 FSEAKISTRKTKNIRRTQTKSRF--------------SWSDSRLID----TIEPREYIDN 636

Query: 705 KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHSQQLPVV 763
           K  +F G+ FLLTGF S KEKEIE LI+K+GG VL  +P  P  +  + +     + P+V
Sbjct: 637 KEALFHGLDFLLTGFQSHKEKEIEPLIRKFGGYVLSRVPSCPLDKRSKLAELARCKPPIV 696

Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPI 823
           + PKK+ T KFLYGCA+N++IL   WL DS+ AG  + P KY I    +    +   + +
Sbjct: 697 LSPKKVSTAKFLYGCAINSWILNPSWLFDSIQAGVMLPPGKYFIRQVHSMQGISMFDQSL 756

Query: 824 CRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
               +  +F  VG ++ GK SFC+KF+ ++KHGGGQVF +L  LV SL+ + +  G+I+ 
Sbjct: 757 HLRKNTLLFDGVGFLILGKISFCSKFSNVIKHGGGQVFASLQGLVQSLKDRSSSHGIILV 816

Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPEFR--IS 941
            +E   SRHL +C  E  I   PASW+I SL+ G L+P  +D+      + +P F+   +
Sbjct: 817 ASEASASRHLSYCGLEHDIKTAPASWVIGSLYSGKLIPLKKDRCASFRKIKMPSFQQPQA 876

Query: 942 IDWSEEI 948
            D S+EI
Sbjct: 877 FDMSQEI 883



 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 118/414 (28%), Positives = 175/414 (42%), Gaps = 78/414 (18%)

Query: 80  EKNRIVRSLVNDQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDL 137
           E N+   S  + + ++  R+   +K+  ADVNDAVELSI+ASEA+VI E+   +  S D 
Sbjct: 162 EINKTCDSKGHRRQQLSGRKVDVRKLRSADVNDAVELSIAASEAMVIAEMILLDSQS-DK 220

Query: 138 LTTIVLEAALQVKQAR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPS 193
           L T  LEAAL VK+AR    LE  E +      ++DET  +SELD+  M  A +DVGL  
Sbjct: 221 LATASLEAALHVKEARKQCFLEELEHSCGSSESDLDETDRLSELDETEMLDAFQDVGLS- 279

Query: 194 AGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEG 253
                         L++    S+     N +G K    ++H    P D            
Sbjct: 280 --------------LVQTACASQG---QNISGLKQE--ISHASSHPCD------------ 308

Query: 254 KIDPMLHQSTQETSHVLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
                        +HVL ++ PE        WNS++A   + +  S+    +    GG  
Sbjct: 309 -----------AEAHVLESS-PEKQN---IRWNSHDADTNDHVSDSLASHNKE---GGV- 349

Query: 314 GKEVDVPIQLEQSNNKGIPKFYVNETSSLSESADVA---PDENSFVQKHETGSKIAS-QS 369
                V +Q      K +   +  ETS +SES D     P  +         S  AS   
Sbjct: 350 -----VAVQTNVGTRKHVKGLFNKETSFISESMDGMDEFPSPSRIASMEMAASSRASFLH 404

Query: 370 SIPFEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS---AVDASSPQTINVAAAQ 426
            I      N+  E    C  V+ S  S VDPLCS VPCS+S      + +P+ I  +   
Sbjct: 405 KIKGSCEENQGAEAAQLCSQVVCSNLSLVDPLCSIVPCSISFNEGPPSQAPECIQ-SKGD 463

Query: 427 KQLLEVENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSP 480
           K+L+  +      D   E        VS   +       +R+ + L+P+ST++P
Sbjct: 464 KELISTKEFPSKQDLEGE-AGPSCTPVSNILLR------RRKYSSLRPFSTIAP 510


>gi|357114949|ref|XP_003559256.1| PREDICTED: uncharacterized protein LOC100844410 [Brachypodium
           distachyon]
          Length = 931

 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 107/254 (42%), Positives = 151/254 (59%), Gaps = 6/254 (2%)

Query: 701 RKYV--KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPN-SQGKRCSRSHS 757
           R+Y+  K  IF G++FLLTG  S KEKEIE  I+K+GG +L  +PP    +  + +    
Sbjct: 678 REYIDKKEAIFHGLEFLLTGLQSHKEKEIESAIRKFGGCILSKVPPCTFDKRSKLAEFAR 737

Query: 758 QQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRT 817
            + PVV+  KK+ T KFLYGCA +++IL   WL DS+ AG  + P KY+I    A  +  
Sbjct: 738 WKPPVVLSSKKVSTAKFLYGCATDSWILNPNWLFDSIEAGVLLPPGKYLIRQRHAVKESL 797

Query: 818 GITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNI 877
              + +   N + +F  VG ++HGK SFC+KF+ I+KHGGGQVF +L  L+ SL+ + + 
Sbjct: 798 TFGQSVHLRNDRLVFRGVGFLIHGKISFCSKFSNIIKHGGGQVFVSLEGLIRSLKDRSSS 857

Query: 878 VGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPE 937
            G+I+  NE   SRHL HC  E  I  VPASWII SL  G L P  +D+      + +P 
Sbjct: 858 HGIILVANEASASRHLSHCGMEHDIKTVPASWIISSLFSGKLNPLKKDRCASFRRIKMPS 917

Query: 938 FRI---SIDWSEEI 948
           F+    + D S+EI
Sbjct: 918 FQPRHGAFDMSQEI 931



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 60/97 (61%), Gaps = 8/97 (8%)

Query: 100 EKFKKVADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARL----- 154
           +K + V D NDAVELSI+ASEA++I E+  N+    D L    LE+AL VK+AR      
Sbjct: 191 QKLRNV-DANDAVELSIAASEAILIAEMIINDCQP-DKLAAAALESALHVKEARKQYCLE 248

Query: 155 ENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGL 191
           E + D   + N ++DE   ++ELD++ M  A EDVGL
Sbjct: 249 ETEHDCGSFEN-DLDERDWLAELDEVEMLDAFEDVGL 284


>gi|356539174|ref|XP_003538075.1| PREDICTED: uncharacterized protein LOC100808072 [Glycine max]
          Length = 186

 Score =  196 bits (498), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 102/185 (55%), Positives = 129/185 (69%), Gaps = 3/185 (1%)

Query: 767 KKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRD 826
           +KL+TTKFLYGCAV A ILK  W+TD VA+ + + P KYMIL ++ D+K T I   I   
Sbjct: 2   RKLQTTKFLYGCAVGASILKVDWITDCVASRTILQPEKYMILPNRKDMKWTRIGTTIHHR 61

Query: 827 NHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVAENE 886
           N K IF RVGI+LHGKPSFCTK A I+KHGGG VFKTL  L  S + ++ +VG IV E++
Sbjct: 62  NQKDIFERVGILLHGKPSFCTKLACIIKHGGGHVFKTLQGLEWSTDEERTLVGAIVVEDK 121

Query: 887 RRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKH---IPSPTVNVPEFRISID 943
             +SRHL+HCA E+ I ++P SWIIKSL+ G LLP T++K+   +P   V V E   S D
Sbjct: 122 ATISRHLKHCAKERNIPIMPFSWIIKSLYSGKLLPFTEEKNTLSLPFVNVKVSEVPSSSD 181

Query: 944 WSEEI 948
            SEEI
Sbjct: 182 MSEEI 186


>gi|414873573|tpg|DAA52130.1| TPA: hypothetical protein ZEAMMB73_979881 [Zea mays]
          Length = 1017

 Score =  174 bits (440), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 100/248 (40%), Positives = 139/248 (56%), Gaps = 23/248 (9%)

Query: 642 RKRIRLSEVEVDLRQNRDIEKLQSSHKF-HSVTRTSEMSKNSNACRDSQLQYVKRCSTTR 700
           RKR++ SE +   R+ ++  ++ +  +F  S +RT EM +                    
Sbjct: 672 RKRVQFSEAKRSSRRTKNNRRILTKSRFSRSDSRTGEMLET------------------- 712

Query: 701 RKYV--KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHS 757
           R+Y   K   FQG++FLLTGF  QKEKEIE LI+K GG VL  + P P  + K  +   S
Sbjct: 713 REYTDNKEDTFQGVEFLLTGFPKQKEKEIESLIRKSGGYVLSKVSPFPLDKRKNMAEFPS 772

Query: 758 QQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRT 817
              P+V+ PKK+ T KFLYGCA++A+IL   WL DS+ AG  + P KY+I    A    +
Sbjct: 773 WNPPIVLSPKKVSTAKFLYGCAIDAWILNPIWLFDSLQAGIMLPPGKYLIRQRNAQKHNS 832

Query: 818 GITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNI 877
               P+    +  IF  VG ++HGK SFC+KF+ IVKHGGGQVF +L  LV SL+     
Sbjct: 833 SFDRPLHPKCNTLIFDGVGFLIHGKISFCSKFSNIVKHGGGQVFISLQGLVQSLKDGSTS 892

Query: 878 VGVIVAEN 885
            G+I+  N
Sbjct: 893 RGIILIAN 900



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 80/164 (48%), Gaps = 26/164 (15%)

Query: 45  LAETFACSGE-------SGNQFKIDNNASDMEVL----STSSYPKTEKNRIVRSLVNDQD 93
           L+E+    GE       S NQF+ ++   DM         S  P T ++R  R   N   
Sbjct: 93  LSESLCADGEEQEVNAVSQNQFEANDPQVDMPTKVARKEISIQPDTIRHR--RHGFNG-- 148

Query: 94  KVGRREEKFKKVADVNDAVELSISASEALVIHE--LSNNEPDSEDLLTTIVLEAALQVKQ 151
             G+ + +  +  DVNDA+ELSI+ASEA+VI E  L +++PD         +EAAL VK+
Sbjct: 149 --GKVDVRKLRNVDVNDAIELSIAASEAMVIAEMILDDSQPDKS---AAAAIEAALHVKE 203

Query: 152 AR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGL 191
           AR     E  +         +DE   ++ELD+  M  A +DV L
Sbjct: 204 ARKQFYFEEPQHACGSSEGGLDEIDWLAELDESEMVDAYQDVAL 247


>gi|449527384|ref|XP_004170691.1| PREDICTED: uncharacterized LOC101203785, partial [Cucumis sativus]
          Length = 504

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 131/378 (34%), Positives = 189/378 (50%), Gaps = 70/378 (18%)

Query: 64  NASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFK----KVADVNDAVELSISAS 119
           NA D ++   SSY     N   +SL N +D+  R+ EK      K A+V+DA+ELS+ AS
Sbjct: 146 NAGDTDLAPHSSYKDVLHNVGCQSLTNTEDRENRQGEKLDVGCLKNAEVSDAIELSVVAS 205

Query: 120 EALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHYPNKEIDETHLVSELDD 179
           EALVIHEL  +E DS  +    VLEA++Q                  E+D +  +S+LD+
Sbjct: 206 EALVIHELLKDELDSAAVSVEAVLEASIQ------------------EVDLSDSLSDLDN 247

Query: 180 LTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYESND---------------N 224
            TM  A +DVGLPS+  + +  SG+  F ++ TP+++N +                   N
Sbjct: 248 STMRDAFDDVGLPSSILNSDH-SGTACFDVQDTPVNKNEFTHGSQCNSIDMTSQPDILGN 306

Query: 225 GF--KHVE---CLAHKKKLPA-DPALNLN---------GSYMEG--KIDPMLHQSTQETS 267
           G   K +E    +     LP  D + N+          GS      K D ML   TQ  S
Sbjct: 307 GLTLKQLEENLVVTRPVGLPMEDLSCNIQHQLSNDDVLGSTSTNYCKYDSMLQHPTQNES 366

Query: 268 HVLAAAQPEAST--------------NSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
                 Q   S+              +S++  +  +A   E +     +RF+SRW GGW+
Sbjct: 367 DEFVVKQKIVSSIVNTNLCTIHAKENSSLHESSKVSAKNDELVAFFTPERFKSRWLGGWS 426

Query: 314 GKEVDVPIQLEQS-NNKGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIP 372
           GKEVDV  QL Q  + K IP  +VNETS LSESAD+APDENS VQ+ E+  ++ASQSSI 
Sbjct: 427 GKEVDVSEQLRQDVDGKTIPLMFVNETSFLSESADIAPDENSCVQRCESKFQVASQSSIH 486

Query: 373 FEGVNNKADEEISFCQDV 390
           F  ++ K D+ +   +++
Sbjct: 487 FGHLDEKGDDGLLVAEEI 504


>gi|242037659|ref|XP_002466224.1| hypothetical protein SORBIDRAFT_01g003850 [Sorghum bicolor]
 gi|241920078|gb|EER93222.1| hypothetical protein SORBIDRAFT_01g003850 [Sorghum bicolor]
          Length = 852

 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 83/225 (36%), Positives = 119/225 (52%), Gaps = 10/225 (4%)

Query: 584 PPFPNRGTCRSHASKLIGDVNNSFGKTN-PDEAGVQETVVMLHNLSEERNNYLETQVPIR 642
           P   N+   R  A+K    +  S G  N  D + VQ+      + SE++    E QVP  
Sbjct: 634 PIILNKNVQRVQAAKKF--IQYSAGAENLQDFSAVQKQTYYPQDDSEDQ--IRELQVP-- 687

Query: 643 KRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRK 702
           + +  S   +  +Q+   +++Q S   HS  RT    +  N  R S+       +   R+
Sbjct: 688 RDVFPSTENLVGKQHLKRKRVQFSEANHSSRRTKINRRILNKSRFSRSDSRTGQTLEARE 747

Query: 703 YV--KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHSQQ 759
           Y   K   FQG++FLLTGF +QKEKEIE LI+K GG VL  +PP P  + K  +   S +
Sbjct: 748 YTDNKEATFQGVEFLLTGFPNQKEKEIESLIRKSGGYVLSKVPPFPLDKRKSMAEFPSWK 807

Query: 760 LPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAK 804
            P+V+ PKK+ T KFLYGCA++A+IL   WL DS+ AG  + P K
Sbjct: 808 PPIVLSPKKVSTAKFLYGCAIDAWILNPNWLFDSLQAGVMLPPGK 852



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 74/145 (51%), Gaps = 14/145 (9%)

Query: 55  SGNQFKIDNNASDM----EVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFKKVADVND 110
           S NQF+ +N  ++M    EV       + + NR  R  V      G+ + +  + ADVND
Sbjct: 143 SQNQFEANNPQAEMLTVTEVACKEISKQLDTNRHKRHDVYG----GKVDVRKLRNADVND 198

Query: 111 AVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQAR----LENQEDTSHYPNK 166
           A+ELSI+ASEA+VI E+  +     D      +EAAL VK+AR     E  E        
Sbjct: 199 AIELSIAASEAMVIAEMILD--SQSDKSAAAAIEAALHVKEARKQFYFEEPEHACGSSED 256

Query: 167 EIDETHLVSELDDLTMASACEDVGL 191
            +DET  ++ELD+  M    +DVGL
Sbjct: 257 GLDETDWLAELDESEMVDVYQDVGL 281


>gi|147822457|emb|CAN66205.1| hypothetical protein VITISV_041149 [Vitis vinifera]
          Length = 281

 Score = 69.7 bits (169), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 36/67 (53%), Positives = 45/67 (67%)

Query: 839 LHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVAENERRVSRHLRHCAF 898
           L+ +P      ++   HGGGQVFKTL  LV SL+T+   VG IVAE+E R SRHLR CA 
Sbjct: 5   LYERPKETEWLSLSYMHGGGQVFKTLQCLVQSLDTESISVGAIVAEDESRASRHLRLCAS 64

Query: 899 EKKIAMV 905
           E+KI M+
Sbjct: 65  ERKIPMM 71


>gi|303272239|ref|XP_003055481.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463455|gb|EEH60733.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1435

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/182 (32%), Positives = 90/182 (49%), Gaps = 24/182 (13%)

Query: 700  RRKYVKRLIFQGIKFLLTGFSSQKEKE-IEVLIQKYGGLVLLDIPPPNSQ---------G 749
            RRK  K   F G+ FLL+G  S+ + E +  LI+ +GG V  ++PPP +          G
Sbjct: 1179 RRKPTK--TFAGMTFLLSGMPSKNDVEDLTELIELHGGEVEAEVPPPETPDNFTSSPEVG 1236

Query: 750  KRCSR--SHSQQLPV---VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAK 804
             R S+  +  ++  V   VI PK  +T K LY  AV A +LK  W+T S+ AG  +SPA 
Sbjct: 1237 ARMSQFAAPPRRARVRMRVITPKPGRTLKHLYAAAVGAPLLKPDWITASLDAGRALSPAS 1296

Query: 805  Y---MILSSQ--ADLKRTGITEPICRDNHKYIF-GRVGIMLHGKPSFCTKFAVIVKHGGG 858
                ++LS +  A        +    D+   +F GRV   L G   F  +F ++++H G 
Sbjct: 1297 TPPGVLLSKRHAAIASDDDDDDDDDDDDDGGLFSGRVAA-LCGDDRFTREFGLLLRHAGA 1355

Query: 859  QV 860
             V
Sbjct: 1356 DV 1357


>gi|424513323|emb|CCO65945.1| unnamed protein product [Bathycoccus prasinos]
          Length = 1147

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/269 (22%), Positives = 114/269 (42%), Gaps = 33/269 (12%)

Query: 665  SSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRL--IFQGIKFLLTGF-SS 721
            ++ K    T++ + S+  N+  DS     KR S T      +   +F G+ F++T   SS
Sbjct: 890  ATDKRRKSTQSVQQSRRRNSVEDS---AEKRKSITGGGGGGKTTALFSGLSFMITSMPSS 946

Query: 722  QKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRSHSQQLPVVI--CPKKLKTTKFLYGCA 779
            +   E+   I + GG +   IP  +S  +  +R     LP +     K+ +T K LY  A
Sbjct: 947  RVMNEMRASITENGGRLFDSIPKFSSSKRVPARKQIVVLPTMNNNSEKRKETAKCLYARA 1006

Query: 780  VNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRDNHKYIFGRVGIML 839
            + A +   +W+ +S+         K  I+ +         T+P      K  F     +L
Sbjct: 1007 IGARLRSVRWVRESL--------EKNRIMENDGQ----NATQPT-----KLNFQGTSFLL 1049

Query: 840  HGKPSFCTKFAVIVKHGGGQVF--------KTLHWLVLSLETQKNIVGVIVAENERRVSR 891
             G   F  +F+ ++ H G            K L+  +   + ++N +  +V ++  ++SR
Sbjct: 1050 LGNTFFQDEFSNLMLHAGANTVMTCGKMTKKALNQALKRAQEEQNEIHAVVIQSSEKLSR 1109

Query: 892  HLRHCAFEKKIAMVPASWIIKSLHIGMLL 920
             ++    E +  +V   W+ +SL  G +L
Sbjct: 1110 EIKSALSESRKIVVRHDWLTESLQRGEIL 1138


>gi|328870861|gb|EGG19233.1| hypothetical protein DFA_02018 [Dictyostelium fasciculatum]
          Length = 1006

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/221 (24%), Positives = 97/221 (43%), Gaps = 19/221 (8%)

Query: 708 IFQGIKFLLTGFSSQKEKEIEV--LIQKYGGLVLLDIPPPNSQGKRCS-RSHSQQ-LPVV 763
           IF+G  FLLTG     + EI +  L+ KYGG +L    P +   K  S RS + Q + ++
Sbjct: 774 IFKGYNFLLTGIPEVGDPEINISTLVLKYGGKIL----PVHDFFKAYSYRSKANQSIYLI 829

Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPI 823
           +  ++ KT K+L G       +  KW+ DS+     +    Y + S     K   +   +
Sbjct: 830 LASEQQKTFKYLSGIVHGIPCINKKWVLDSIDQEEPLPYKPYRLPSGYCLSKDQNV---L 886

Query: 824 CRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
            +     I     I +  +  F  ++  ++K  G +V K     ++S+  + +    IV 
Sbjct: 887 LKHGSNAILSNTRIEIFAESVFLEEWTTLLKSLGAKVVKR----IISVNDRVD----IVL 938

Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQ 924
            + +++S  L       KI +V   W+I+S+     LP  Q
Sbjct: 939 TDGKKISSVLLKYTNAMKIPVVTTDWVIQSILEKTKLPYNQ 979


>gi|255080150|ref|XP_002503655.1| predicted protein [Micromonas sp. RCC299]
 gi|226518922|gb|ACO64913.1| predicted protein [Micromonas sp. RCC299]
          Length = 1388

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/235 (22%), Positives = 91/235 (38%), Gaps = 24/235 (10%)

Query: 708  IFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP--------------PPNSQGKRCS 753
             F  + F L+GF   +   +  LI + GG+V   IP              PP    +  +
Sbjct: 1143 TFADLTFYLSGFGRDESAALAELIVRNGGVVESQIPSPGDGQTLEPFTPSPPAGAPQPVA 1202

Query: 754  RSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGS-TVSPAKY---MILS 809
               +    VV      +T K LY  A+ A ++   W+ ++ A G+ ++SP++     I+S
Sbjct: 1203 PDPADYTRVVTPTSMGRTLKCLYASAIGAPVVTPAWVREAAATGNPSLSPSRTPPGSIIS 1262

Query: 810  SQADLKRTGIT-EPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLV 868
                  R   +  P      + IF  +   L G   F  +F  +++H G  V        
Sbjct: 1263 RGGRPSRDKTSGHPGSSRVFERIFAGLAFALSGDDGFHAQFGALLEHAGASVVDGSELTD 1322

Query: 869  LSLETQKNIVGVIVAEN---ERRVSRH--LRHCAFEKKIAMVPASWIIKSLHIGM 918
             + + +     ++V +    ERR  R   LR  A    +  V   W ++SL  G 
Sbjct: 1323 DAGDDRDARCDLVVVQESPAERRGFRGSALRRAAERHGVPCVWHPWAVESLLRGF 1377


>gi|308810240|ref|XP_003082429.1| unnamed protein product [Ostreococcus tauri]
 gi|116060897|emb|CAL57375.1| unnamed protein product [Ostreococcus tauri]
          Length = 672

 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 91/229 (39%), Gaps = 28/229 (12%)

Query: 694 KRCSTTRRKYVKR--LIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKR 751
           KR   T++  VKR   +F  + FLL+G    +    + LI+++GG VL     P+S    
Sbjct: 440 KRARDTKKATVKRSEKLFASLHFLLSGLDKPQGSTTKELIREHGGFVLT---APSSD--- 493

Query: 752 CSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQ 811
                S    +VI PK  +T K LYG A         W+   V A          ++   
Sbjct: 494 ----VSVNSVIVITPKMGRTMKCLYGVAAGTRFATPSWVEACVEA--------RELIDLD 541

Query: 812 ADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSL 871
             L   G      R +   +F  V   + G  +F   F  +++H G         L  + 
Sbjct: 542 EPLDDNGRMRAKHRSSAGKLFRDVKTAITGNDAFVKDFTSLLRHAGAT-------LEPNP 594

Query: 872 ETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLL 920
           +T +    +I+   E+  S  +R  +    +  V   W+++S+  G LL
Sbjct: 595 QTAERFDYLIIQSGEKPHSAWIR-ASKRLDVPCVRHEWLVESILAGELL 642


>gi|312071487|ref|XP_003138631.1| hypothetical protein LOAG_03046 [Loa loa]
 gi|307766206|gb|EFO25440.1| hypothetical protein LOAG_03046 [Loa loa]
          Length = 1067

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 57/240 (23%), Positives = 101/240 (42%), Gaps = 51/240 (21%)

Query: 708  IFQGIKFLLT-----------GFSSQKEKEIEVLIQKYGGLVLLDI---PPPNSQGKRCS 753
            IF GI F++T           GFS   +KEI  +I+  GG V+ D+   P    QGK   
Sbjct: 848  IFNGITFVVTSALRKNREGEQGFS---KKEIRCMIENGGGKVIDDVMKLP----QGK--- 897

Query: 754  RSHSQQLPV-VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQA 812
                   P+ +I     +T K+L   A +   +  +W++D       +   KYM+ +  +
Sbjct: 898  -------PIYLIADTHYRTHKYLTALARSIPCVSNQWISDCAKENKLLDHMKYMLPAGVS 950

Query: 813  DLKRTGITEPICRDN-----HKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWL 867
             L  TG  +P   +N      K IF     + +  P+F   +  I+ H GG V       
Sbjct: 951  LL--TGDMKPWHSNNGTLLSGKRIFISSNNVFYDMPNFSQIWTPIINHMGGTVVP----- 1003

Query: 868  VLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKS-LHIGMLLPQTQDK 926
            V+  E      G+ +   +   +  + + A  +   +V + WII++ +H  +  P+  ++
Sbjct: 1004 VIPTE------GLDILLTDASCTEEVLNIARSQGATVVSSEWIIQAIIHGSLPAPEAHER 1057


>gi|402593984|gb|EJW87911.1| hypothetical protein WUBG_01177 [Wuchereria bancrofti]
          Length = 1086

 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 58/240 (24%), Positives = 103/240 (42%), Gaps = 51/240 (21%)

Query: 708  IFQGIKFLLT-----------GFSSQKEKEIEVLIQKYGGLVLLDI---PPPNSQGKRCS 753
            IF GI F++T           GFS   +KEI  +I+  GG V+ D+   P    QGK   
Sbjct: 867  IFDGITFVVTSALRKNREGEQGFS---KKEIRCMIENGGGKVIDDVMKLP----QGK--- 916

Query: 754  RSHSQQLPV-VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQA 812
                   P+ +I     +T K+L   A +   +  +W++D       +   KYM+ +  +
Sbjct: 917  -------PIYLIADTHYRTHKYLTALARSIPCVSNQWISDCAKENKLLDHMKYMLPAGVS 969

Query: 813  DLKRTGITEPICRDNHKYIFG-RVGI----MLHGKPSFCTKFAVIVKHGGGQVFKTLHWL 867
             L  TG  +P   +N   + G RV I    + +  P+F   +  I+ H GG V       
Sbjct: 970  LL--TGDMKPWHSNNGTLLSGKRVFISSNNVFYDMPNFSQIWTPIINHMGGTVVP----- 1022

Query: 868  VLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKS-LHIGMLLPQTQDK 926
            V+  E      G+ +   +   +  + + A  +   +V + WII++ +H  +  P+  ++
Sbjct: 1023 VIPEE------GLDILLTDASCTEEILNTARSQGATVVSSEWIIQAIIHGSLPAPEAHER 1076


>gi|170580376|ref|XP_001895237.1| hypothetical protein [Brugia malayi]
 gi|158597922|gb|EDP35939.1| conserved hypothetical protein [Brugia malayi]
          Length = 983

 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 121/284 (42%), Gaps = 55/284 (19%)

Query: 668 KFHSV-TRTSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRL---IFQGIKFLLT------ 717
           K H++ ++TS +  +       +LQ  +  S++     K+    IF GI F++T      
Sbjct: 720 KQHTIPSKTSSLKGDEALLEPEELQEQQHESSSTNGSAKKALDKIFDGITFVVTSALRKN 779

Query: 718 -----GFSSQKEKEIEVLIQKYGGLVLLDI---PPPNSQGKRCSRSHSQQLPV-VICPKK 768
                GFS   +KEI  +I+  GG V+ D+   P    QGK          P+ +I    
Sbjct: 780 REGEQGFS---KKEIRCMIENGGGKVIDDVMKLP----QGK----------PIYLIADTH 822

Query: 769 LKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRDNH 828
            +T K+L   A +   +  +W++D       +   KYM+ +  + L  TG   P   +N 
Sbjct: 823 YRTHKYLTALARSIPCVSNQWISDCAKENKLLDHMKYMLPAGVSLL--TGDMRPWHSNNG 880

Query: 829 KYIFG-RVGI----MLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
             + G RV I    + +  P+F   +  I+ H GG V       V+  E      G+ + 
Sbjct: 881 TLLSGKRVFISSNNVFYDMPNFSQIWTPIINHMGGTVVP-----VIPEE------GLDIL 929

Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKS-LHIGMLLPQTQDK 926
             +   +  + + A  +   +V + WII++ +H  +  P+  ++
Sbjct: 930 LTDASCTEEILNVARSQGATVVSSEWIIQAIIHGSLPAPEAHER 973


>gi|440801631|gb|ELR22641.1| BRCA1 C Terminus (BRCT) domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 972

 Score = 46.2 bits (108), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 50/218 (22%), Positives = 96/218 (44%), Gaps = 23/218 (10%)

Query: 708 IFQGIKFLLTGFSSQKEKE--------IEVLIQKYGGLVLLDIPPPNSQGKRCSRSHSQQ 759
           +F+G++F++TG S + +KE        I  LI + GG +L +    +   +   RS    
Sbjct: 749 LFRGLQFIVTGLSGEDDKENDDRSAERINRLIGEEGGKLLGNFANASLASQCKKRSGYPG 808

Query: 760 LP--VVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRT 817
           LP  +++  K   T K+L   A     +   WL         +  + Y++ +  +  K +
Sbjct: 809 LPSTILLSAKPRTTLKYLLSVAAGLPTVHYNWLIHCCQQRCLLPHSLYLLPAGYSVEKGS 868

Query: 818 GITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNI 877
            +     +     IF    + + G P F   + +++K GG +V + L         Q++ 
Sbjct: 869 DV---FIKKPRGDIFEGTRVEVVGTPEFKNLWKLVLKEGGAKVVERL------FTPQEDR 919

Query: 878 VGVIVAENERRVSRHLRHCAFEK-KIAMVPASWIIKSL 914
           V  IV+  +     +L+   F+K ++ +V   WII+SL
Sbjct: 920 VECIVSPPD--PDDYLK-VKFKKLRVPVVGKQWIIQSL 954


>gi|443693208|gb|ELT94638.1| hypothetical protein CAPTEDRAFT_160085 [Capitella teleta]
          Length = 199

 Score = 45.4 bits (106), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 58/130 (44%), Gaps = 22/130 (16%)

Query: 763 VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLK-RTGITE 821
           ++C +  +T KFL G +    I++  WL  S   G    P  Y +  + A+ K +  +TE
Sbjct: 31  LVCDQVRRTVKFLCGVSRGLVIVRPDWLHQSEEQGVFQDPVTYFVRDASAEKKFKFRLTE 90

Query: 822 PICRDNHKYIFGRVGIMLHGKPSFCT--------KFAVIVKHGGGQVFKTL------HWL 867
            + R N K        ML G   F T        + A I+K  GGQ+ K++      H +
Sbjct: 91  SVERANQKG-------MLKGHKCFITANVKPEPKQMADIIKCSGGQLLKSMPKTSSGHTV 143

Query: 868 VLSLETQKNI 877
           V+S E  + I
Sbjct: 144 VISCEDDEAI 153


>gi|328866094|gb|EGG14480.1| hypothetical protein DFA_12254 [Dictyostelium fasciculatum]
          Length = 890

 Score = 43.5 bits (101), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 8/112 (7%)

Query: 703 YVKRLIFQGIKFLLTG-FSSQKEKEIEVLIQKYGGLVLLDIPPPN-SQGKRCSRSHSQQL 760
           + ++ +F G+ F L G F     +EIE +I++ GG+V LD PPP  S  K   +S   + 
Sbjct: 764 FSEKRLFYGLAFYLAGEFDQPSRQEIESVIKEAGGIV-LDAPPPKPSNTKELLKS---KC 819

Query: 761 PVVICPKKLKTTKFL--YGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSS 810
            V++ P    + + L           +  KW+ DS++    +   KY+I  S
Sbjct: 820 TVIVHPSYTSSHQQLSQIHLTTKQLPISYKWIFDSISNYQLIPKDKYLIFKS 871


>gi|241602471|ref|XP_002405192.1| topbp1, putative [Ixodes scapularis]
 gi|215500576|gb|EEC10070.1| topbp1, putative [Ixodes scapularis]
          Length = 1362

 Score = 42.7 bits (99), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 29/122 (23%), Positives = 58/122 (47%), Gaps = 15/122 (12%)

Query: 687  DSQLQYVKRC--STTRRKYVKRLIFQ-GIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP 743
            DS+ Q+V       TRR+ + RL  +   KF+L+G + +++     ++++ GG++L    
Sbjct: 1108 DSESQFVGVTWDDPTRRQEMARLAGKLSEKFMLSGLAEEQKAHYAGVVEELGGVLL---- 1163

Query: 744  PPNSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPA 803
                     S+++  ++  ++    LK  ++L   A   F+L   +L DS  AG  +   
Sbjct: 1164 --------TSKNYDPEMTHLVLASALKNERYLAAVAAGKFVLHTAYLDDSAKAGQFLDEE 1215

Query: 804  KY 805
             Y
Sbjct: 1216 GY 1217


>gi|343477469|emb|CCD11710.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1275

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 72/152 (47%), Gaps = 22/152 (14%)

Query: 674  RTSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQK 733
            R  E+SK S       L    +   +R  +++   F G++ L+TG S    K  +V   +
Sbjct: 883  RLREVSKESRNLETPPLLCTIKAHQSRPNFME--WFSGLRVLITGGSESDVKLFDVSGNE 940

Query: 734  YGGLVLLDIPPP-----------NSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNA 782
             G L+++ +PPP           +++GK  S S S++ P  + P+ +KTT+    CA   
Sbjct: 941  LGQLLMMRLPPPPMFDAVAGGLQDNRGKSVS-SASREPPYSL-PQDIKTTRKCDSCA--- 995

Query: 783  FILKAKWLTDSV--AAGSTVSPAKYMILSSQA 812
              L  +   DS+  +AG   SP + + LSS+ 
Sbjct: 996  --LVEERRQDSIPPSAGDDPSPMENIFLSSEG 1025


>gi|344233866|gb|EGV65736.1| DNA repair protein [Candida tenuis ATCC 10573]
          Length = 1054

 Score = 40.8 bits (94), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 35/165 (21%), Positives = 68/165 (41%), Gaps = 17/165 (10%)

Query: 646 RLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQY--VKRCSTTRRKY 703
           +L E + D  ++ ++  +   H+F       +   +    RD   +   +KR     +  
Sbjct: 90  QLGEQDSDTSEDEEVSGIGPKHEFGQYDVYFQNKHDKQQLRDEAYRDWDLKRRGLQGQVA 149

Query: 704 VKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRSHSQQLPVV 763
            +  IF+G    + G++     EI  L+  YGG+ L  +        + S +H      +
Sbjct: 150 YQEPIFRGCTIYVNGYTVPSINEIHRLVIIYGGIFLPHL------ANKSSATH------I 197

Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMIL 808
           +C K     + ++    N  ++KAKW+ DSV  G  +   KY + 
Sbjct: 198 VCDKLTPRKQQIFK---NCKVVKAKWIVDSVEVGRLLDWKKYRLF 239


>gi|242015866|ref|XP_002428568.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212513202|gb|EEB15830.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 1258

 Score = 40.8 bits (94), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 34/71 (47%)

Query: 762  VVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITE 821
            +V+C K L+T K L G     +I+   WL DS +    +S + Y I  +  D  +  I +
Sbjct: 1093 LVVCGKPLRTIKMLKGLVAAKYIVSDTWLIDSCSQNKLLSESSYAIQQADFDGFKCNIPK 1152

Query: 822  PICRDNHKYIF 832
             +   +  Y+F
Sbjct: 1153 ILSSPDRNYLF 1163


>gi|325189629|emb|CCA24114.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 812

 Score = 40.4 bits (93), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 46/109 (42%), Gaps = 15/109 (13%)

Query: 700 RRK---YVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRSH 756
           RRK     K++I    +FLLTG   +       +I   GG V            +  R  
Sbjct: 576 RRKSHTTAKKVITHQFRFLLTGNKDECSSN-STIIAALGGRVC-----------QSGRKF 623

Query: 757 SQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKY 805
             Q   VIC +  +T KF+ GCA   +IL+  +L  S  AG  V  A +
Sbjct: 624 DTQCTHVICSELKRTEKFVAGCAAGKWILRPSYLIASSQAGHFVDEAPH 672


>gi|348671212|gb|EGZ11033.1| hypothetical protein PHYSODRAFT_520284 [Phytophthora sojae]
          Length = 932

 Score = 40.0 bits (92), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 76/170 (44%), Gaps = 36/170 (21%)

Query: 566 KLVSHGAGLLEPLKESRLPPFPNRGTCRSHASKLIGDVNNSFGKTNPDEAGVQETV--VM 623
           +L S G G  +PL  +  PP   +G  +  A++L   + NS  K    E+ +  +V  V 
Sbjct: 188 RLRSTGEGP-KPLTAAVAPPIDTKGPSKRQAARLATRLRNSPLKPRTRESALHASVREVK 246

Query: 624 LHNLSEERNNY---LET-----------QVPIRKRI---------RLSEVEVDLRQNR-D 659
           LH L +ER  +    ET           QV + +R+         +LSE E  LR ++ D
Sbjct: 247 LHELRDERARWQRDYETMREQVIEEKARQVALFQRLEMLRRSHAKQLSEAEAALRASQAD 306

Query: 660 IEKLQSS------HKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKY 703
           +E+LQ+          H   RT+ + +NS + R    + V   + TR K+
Sbjct: 307 VERLQTRLAQARVSAAHEARRTASLVQNSRSERQ---RLVCALAETRHKF 353


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.130    0.376 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,673,934,193
Number of Sequences: 23463169
Number of extensions: 615722149
Number of successful extensions: 1099888
Number of sequences better than 100.0: 65
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 1099713
Number of HSP's gapped (non-prelim): 144
length of query: 948
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 796
effective length of database: 8,792,793,679
effective search space: 6999063768484
effective search space used: 6999063768484
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 82 (36.2 bits)