BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 002242
(948 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224075882|ref|XP_002304811.1| predicted protein [Populus trichocarpa]
gi|222842243|gb|EEE79790.1| predicted protein [Populus trichocarpa]
Length = 1122
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/966 (43%), Positives = 557/966 (57%), Gaps = 64/966 (6%)
Query: 15 DANKALLVPEVNTSAGAGK----KNDS--DKVLPLQLA----------ETFACSGESGNQ 58
D++KAL + +V TS G G+ K DS V+P + F + +SG Q
Sbjct: 119 DSSKALPLKQVETSGGVGEAIQLKMDSVGGGVIPSLTSAPISVENADPRDFTSNSDSGKQ 178
Query: 59 F---KIDNNASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFK----KVADVNDA 111
+ K N + M S P T +N +S N +D+ + EEK K AD++DA
Sbjct: 179 YEERKGQNASCMMHDSRLISIPTTAENAGPQSATNYKDRGCQHEEKCNVICIKDADISDA 238
Query: 112 VELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHYPNKEIDET 171
VELSISASEALVIH+ S+ L +LEAAL +KQARLE+ ED P+ E DE
Sbjct: 239 VELSISASEALVIHKFVKTGSSSDALTKQAILEAALHIKQARLESSEDAFGCPSDEADEI 298
Query: 172 HLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVEC 231
+S+LDD M A DVGL + + + + ++ TP+ EN++ + G +HV+
Sbjct: 299 DFLSDLDDSIMEDAYLDVGLSFSAHGDEHLHDLDVSQVEETPVLENHHL--EKGSEHVQL 356
Query: 232 LAHKKKLPADPALNLNGSYMEGKIDPMLHQSTQETSHVLAAAQ-----------PEASTN 280
L + D L N S D +L Q ++ S + A+ P N
Sbjct: 357 LPQQNNADDDSDLGSNPSDAACLGDHILTQPAEKLSESSSGAKFVFTKDGNLGLPPVDVN 416
Query: 281 SIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWTGKEVDVPIQLEQSNNKGIPKFYVNETS 340
S +A + A G E+ + DRF+SRWFGGW +E D +L+Q++ K IPKF+V ETS
Sbjct: 417 SFHACRAEKAEGAREVHYLIADRFKSRWFGGWALEEGDASAKLKQNSPKSIPKFFVGETS 476
Query: 341 SLSESADVAPDENSFVQKHETGSKIASQSSIPFEGVNNKADEEISFCQDVISSAQSFVDP 400
LSESADVAPDENSFVQKHET S I SQSSIPFE +++K Q+VISS S VDP
Sbjct: 477 FLSESADVAPDENSFVQKHETRSNIGSQSSIPFEALHDK--------QEVISSDLSLVDP 528
Query: 401 LCSFVPCSVSAVDASSPQTINVAAAQKQ-------LLEVENLQWTSDFNAEFINRDRQDV 453
LCS VPCS+S +A SP N + +EN Q TS AE + D Q V
Sbjct: 529 LCSVVPCSISLENAISPSVQNNRKVDAENCFNPNTDTGMENFQKTSHLKAEPVFMDSQTV 588
Query: 454 STFSVEGFENPVKRQLTLLKPYSTLSPINDPKLEKRSCFRNHLVPSECHEEIISLEQNIG 513
+ PV+R++ L+ YSTLSP D LE+ N S +++ Q +G
Sbjct: 589 PIIMGQCSNAPVRRRVASLRTYSTLSPNCDAVLEREGPCHNGRYSSGHVRNLLASHQEMG 648
Query: 514 CIWSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTGNPYAVATNCTENNDKLVSHGAG 573
CI D+R KG+L F S + GR+NEE V N A T ++D+
Sbjct: 649 CIRPSDQRNSKGVLPFKSVFESTDGRDNEEN--QDVVRNLVAEITCQKRSHDQPTKDRTE 706
Query: 574 L-LEPLKESRLPPFPNRGT-CRSHASKLIGDVNNSFGKTNPDEAGVQETVVMLH------ 625
+ ++P + R P NR T CR AS+L +N G+ +P++A QE ++ LH
Sbjct: 707 MKVKPSVQRRSPLILNRRTRCRPQASELF--THNLTGEISPEQAVGQENIIKLHPSKNAE 764
Query: 626 NLSEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNAC 685
+ + N + P+RKR+ SEVEVDL QN+D+ K Q+ H+ S R + N N C
Sbjct: 765 KIKLKWENSFGARNPVRKRVCFSEVEVDLYQNKDLRKPQTLHRNGSTIRADKKKNNGNTC 824
Query: 686 RDSQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP- 744
+ Q Q VK T + K KRLIF G++FLLTGFS +KEKEI +IQ YGG+++LDIPP
Sbjct: 825 SEVQPQDVKSSFTCQIKDAKRLIFHGLEFLLTGFSHKKEKEIIEIIQIYGGMIVLDIPPV 884
Query: 745 PNSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAK 804
PNS+ KR SRS+ Q LPVVIC KKL+TTKFLYGCA+NA ILK KWLTDSVAAGS VSP K
Sbjct: 885 PNSRLKRVSRSNLQHLPVVICSKKLQTTKFLYGCAMNALILKLKWLTDSVAAGSVVSPDK 944
Query: 805 YMILSSQADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTL 864
YMI+S+QA LK T I + +C ++ K+IF RVGI+LHGK FCTK VIVKH GQVFKTL
Sbjct: 945 YMIISNQAYLKCTRIGKSVCCNHRKHIFDRVGIVLHGKHRFCTKLTVIVKHAHGQVFKTL 1004
Query: 865 HWLVLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQ 924
LV SL+++K +G IV ENE R RHLR+CA E+KI M+PASWI KSLH+G LLP +
Sbjct: 1005 QSLVESLDSEKISMGAIVTENETRELRHLRYCASERKIPMMPASWIAKSLHLGKLLPFIE 1064
Query: 925 DKHIPS 930
D+ PS
Sbjct: 1065 DEATPS 1070
>gi|255569143|ref|XP_002525540.1| conserved hypothetical protein [Ricinus communis]
gi|223535119|gb|EEF36799.1| conserved hypothetical protein [Ricinus communis]
Length = 991
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 392/874 (44%), Positives = 524/874 (59%), Gaps = 50/874 (5%)
Query: 91 DQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQ 148
D ++G R FK++ ADVN+AVELSI+ASEALVIHEL + S+ + T +LEAALQ
Sbjct: 152 DNVEIGERIH-FKRIKDADVNEAVELSIAASEALVIHELMCSGSTSKAVPTGAILEAALQ 210
Query: 149 VKQARLENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFL 208
VKQARLE ED + + EIDE H++S+LDD M +A DVGL +D+ GS+
Sbjct: 211 VKQARLEALEDENCCSSDEIDEIHVLSDLDDSVMENAFVDVGLSFRDTDDQRACGSDVSQ 270
Query: 209 IKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEGKIDPMLHQSTQETSH 268
+K TP+ +Y + NG KHVE + H K+ + AL LN + + ++P+LHQS + + +
Sbjct: 271 VKDTPVDSHY--GHKNGSKHVELMVHHDKVYSGSALGLNNNNIASLVNPLLHQSAEASPY 328
Query: 269 VLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWTGKEVDVPIQLEQSNN 328
V + QP+ +S A S + G+E++C VVD+FRSRW GGWT KE D +++Q N+
Sbjct: 329 VPSVVQPQVDVSSSPACMSEKSEGEEKLCSLVVDKFRSRWSGGWTVKEADTSEKVKQKNS 388
Query: 329 KGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIPFEGVNNKADEEISFCQ 388
KG+ KF+ ETS LSESA VA DENSFVQKHETGSKIASQSSIPFEG ++ADE I Q
Sbjct: 389 KGVLKFFDFETSFLSESAYVAADENSFVQKHETGSKIASQSSIPFEGGLDRADEGICSFQ 448
Query: 389 DVISSAQSFVDPLCSFVPCSVSAVDASSPQTINVAAAQ-----KQLLEVENLQWTSDFNA 443
+ I F + S S+S+ + + A AQ K +EN Q TSD N
Sbjct: 449 EYI-----FRXXISSENAGSLSSQNKDDRE----AYAQNCFSTKSEPRMENFQSTSDVNV 499
Query: 444 EFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSPINDPKLEKRSCFRNHLVPSECHE 503
E R+ V E FE PV+R L LK YS L P ++ L ++ +NH SE
Sbjct: 500 EPPYAGREAVPKTYGEYFEVPVRRHLASLKTYSKLLPKHEATLSEQQLNQNH--SSEYVG 557
Query: 504 EIISLEQNIGCIWSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTGNPYAVATNCTEN 563
E+IS + D+ K + S F+ GR++ + + +P + +
Sbjct: 558 ELISSKS------FSDQMNSKLFVPLRSEFHFSVGRDHGQNEDVNILRSPLS-------D 604
Query: 564 NDKLVSHGAGLLEPLKESRLPPFPNRGT-CRSHASKLIGDVNNSFGKTNPDEAGVQETVV 622
+D+ G +P K+ R P NR T C S+L+ N G+ P+ Q TV
Sbjct: 605 HDEPAKDGPE--QPSKQRRSPLVLNRKTRCHLQPSELL---RNLAGEKTPERIFAQGTVN 659
Query: 623 M------LHNLSEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTS 676
L + + N +T +RKR+ L E EV L+QN+D K Q+S + S +
Sbjct: 660 KALQRKNLQKIDSKLENPHKTANLVRKRVCLLEAEVGLQQNKDAPKTQTSQRNSSAVTAN 719
Query: 677 EMSKNSNACRDSQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGG 736
+ SK S C DSQ Q +K C +KRLIF G++FLLTGFSS+K++EI LIQ+YGG
Sbjct: 720 KRSKYSKPCSDSQNQDMKNCFKDLVNDMKRLIFHGLQFLLTGFSSRKKREIAGLIQEYGG 779
Query: 737 LVLLDIPPP--NSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSV 794
LVLLD+P P N + R +RS+ Q+LP+VIC KKL+TTKFLYGCAVN ILK KWL DSV
Sbjct: 780 LVLLDVPSPPSNIRTTRKARSNFQRLPIVICSKKLQTTKFLYGCAVNTLILKVKWLADSV 839
Query: 795 AAGSTVSPAKYMILSSQADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVK 854
AAGS + P KYMI+S+Q + T + D + IF RVGIMLHGK SFC+K +I+K
Sbjct: 840 AAGSAIPPDKYMIISNQVGQQYTRFGKLGYHDAPRCIFDRVGIMLHGKHSFCSKLEIIIK 899
Query: 855 HGGGQVFKTLHWLVLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSL 914
HGGGQVFKTL L SL+T+K VG I+AE+E SRHLRHCA E+K+ M+PASWI +SL
Sbjct: 900 HGGGQVFKTLQRLFQSLDTEKISVGAIIAEDESTASRHLRHCALERKLPMMPASWIARSL 959
Query: 915 HIGMLLPQTQDKHIPSPTVNVPEFRISIDWSEEI 948
H G LLP + +P +N PE IS+DWS+EI
Sbjct: 960 HFGKLLPFKEKDD--APKINSPESAISLDWSQEI 991
>gi|449435496|ref|XP_004135531.1| PREDICTED: uncharacterized protein LOC101203785 [Cucumis sativus]
Length = 1330
Score = 427 bits (1099), Expect = e-116, Method: Compositional matrix adjust.
Identities = 325/935 (34%), Positives = 474/935 (50%), Gaps = 125/935 (13%)
Query: 64 NASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFK----KVADVNDAVELSISAS 119
NA D ++ SSY N +SL N +D+ R+ EK K A+V+DA+ELS+ AS
Sbjct: 146 NAGDTDLAPHSSYKDVLHNVGCQSLTNTEDRENRQGEKLDVGCLKNAEVSDAIELSVVAS 205
Query: 120 EALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHYPNKEIDETHLVSELDD 179
EALVIHEL +E DS + VLEA++QVK+AR+E E ++E+D + +S+LD+
Sbjct: 206 EALVIHELLKDELDSAAVSVEAVLEASIQVKKARIELLESALESIDEEVDLSDSLSDLDN 265
Query: 180 LTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYE-----------------SN 222
TM A +DVGLPS+ + + SG+ F ++ TP+++N + N
Sbjct: 266 STMRDAFDDVGLPSSILNSDH-SGTACFDVQDTPVNKNEFTHGSQCNSIDMTSQPDILGN 324
Query: 223 DNGFKHVE---CLAHKKKLPA-DPALNL-----------NGSYMEGKIDPMLHQSTQETS 267
K +E + LP D + N+ + S K D ML TQ S
Sbjct: 325 GLTLKQLEENLVVTRPVGLPMEDLSCNIQHQLSNDDVLGSTSTNYCKYDSMLQHPTQNES 384
Query: 268 HVLAAAQPEAST--------------NSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
Q S+ +S++ + +A E + +RF+SRW GGW+
Sbjct: 385 DEFVVKQKIVSSIVNTNLCTIHAKENSSLHESSKVSAKNDELVAFFTPERFKSRWLGGWS 444
Query: 314 GKEVDVPIQLEQS-NNKGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIP 372
GKEVDV QL Q + K IP +VNETS LSESAD+APDENS VQ+ E+ ++ASQSSI
Sbjct: 445 GKEVDVSEQLRQDVDGKTIPLMFVNETSFLSESADIAPDENSCVQRCESKFQVASQSSIH 504
Query: 373 FEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVSAVDASSPQTINVAAAQKQLLEV 432
F ++ K D+ + ++++ + S VDPLCSFVPCS+S S+ Q +N
Sbjct: 505 FGHLDEKGDDGLLVAEEIVKCSLSLVDPLCSFVPCSISLDTDSAGQNLN----------- 553
Query: 433 ENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSPINDPKLEKRSCF 492
E T + F++ V G ++RQ+T LK YST+SP +
Sbjct: 554 EGKDCTEELLGTFVD----------VGGSRPSIRRQVTSLKNYSTISPTHATMEGGLDNS 603
Query: 493 RNHLVPSECHEEIISLEQNIGCI-WSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTG 551
H +P + ++S + + C +S + + L S + S+ + E +T
Sbjct: 604 YAHQLPG--NMRLLSSDSQLDCTRFSSKINFMETLPSQSTKSR---DMDTVEDSQTDARH 658
Query: 552 NPYAVATNCTENNDKLVSHGAGLL-EPLKESRLPPFPNRGTCRSHASKLIGDVNNSF--- 607
N T +D++ + L + +K+S N S ++ +
Sbjct: 659 NLVEEITELKSKSDEVAGDVSEFLADTVKKSVTCDILNGSLQLSKSTMKKSSIKKDHLQS 718
Query: 608 GKTNPDEAGVQETVVMLHNLSEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEK----- 662
KT + V V M H E N LE + ++KR+R E ++N D +K
Sbjct: 719 SKTISNPQKVDNVVKMQH----ESKNPLEPCMLVQKRVRFLEANDQPQENLDFQKVHPPI 774
Query: 663 ----------------------LQSSHKFHSVTRTSEMSKNSNACR-----DSQLQYVKR 695
L S RTS+ K SN C D + R
Sbjct: 775 NCKDWSIVGVIDMIYYVFFLRNLPFGWNADSTLRTSKRRKFSNQCLLSRHPDGKGHLKSR 834
Query: 696 CSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRS 755
++R+K LIFQGI+FL+TGFSS+KEK+I ++ GG++L DIP P+S+G++ S+S
Sbjct: 835 YCSSRKK----LIFQGIQFLVTGFSSRKEKDINGIVCNNGGIILPDIPCPSSRGQKMSKS 890
Query: 756 HSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLK 815
+ PV++ KKL+T KFLYGCAVN+ I+ WLTDS+AAGS V P KYMI+S+QAD
Sbjct: 891 DCKGPPVILSSKKLQTKKFLYGCAVNSLIVNVSWLTDSIAAGSIVPPWKYMIISNQADCT 950
Query: 816 RTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQK 875
+ G + + +YIF VG+MLHGK FCTK ++KHGGGQVFKTL WLV SL +K
Sbjct: 951 QIG--RSVRHSSRRYIFENVGVMLHGKQGFCTKLTNVLKHGGGQVFKTLQWLVKSLNQEK 1008
Query: 876 NIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWI 910
VGVIV E+E + SRHL+ CA E+ I ++ A +
Sbjct: 1009 ISVGVIVVEDEHKSSRHLKQCALEQGIPLMIAKLV 1043
>gi|357456285|ref|XP_003598423.1| hypothetical protein MTR_3g013490 [Medicago truncatula]
gi|355487471|gb|AES68674.1| hypothetical protein MTR_3g013490 [Medicago truncatula]
Length = 1201
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 282/781 (36%), Positives = 390/781 (49%), Gaps = 122/781 (15%)
Query: 255 IDPMLHQSTQETSHVLA---AAQPEASTNSIYAWNSNN--AVGKEEICCSVV-DRFRSRW 308
ID L + Q+ +A + +P+ S NS S N V E + +RFRSRW
Sbjct: 456 IDATLISTFQKDDLAMADFTSIKPQHSVNSSLFQTSENFKDVKNENWAAYLAPERFRSRW 515
Query: 309 FGGWTGK--------EVDVPI---------QLEQSNNKGIPKFYVNETSSLSESADVAPD 351
GGWT K E+D P L +N K IPKF V ETS L+ESAD+ PD
Sbjct: 516 LGGWTSKTYYLCLASELDSPSLPDSKKASPSLNLNNEKRIPKFLVRETSFLTESADIIPD 575
Query: 352 EN----------------------------SFVQKHETGSKIASQSSIPFEGVNNKADEE 383
EN S V KH+ I+SQ S+ EG +NK DE
Sbjct: 576 ENSRLLKHDPKCVRETSFLTESVDIVPDESSCVLKHDPKRAISSQLSMHCEGSHNKPDEG 635
Query: 384 ISFCQDVIS-SAQSFVDPLCSFVPCSVSA------VDASSPQTINVAAAQKQLLEVENLQ 436
+ QD++ S+ S DPLCS VPCS ++ I A EV+N Q
Sbjct: 636 VLHSQDMVGCSSLSLNDPLCSVVPCSFASEHDNYKTHIDKENDIEYLAPLVSDFEVDNCQ 695
Query: 437 WTSDFNAEFINRDR--------QDV---STFSVEGFENPVKR-QLTLLKPYSTLSPINDP 484
SD N F RD +D+ ST VE + R + T LK YS +
Sbjct: 696 RISDKNTTFDCRDEKITSIIDGKDILLTSTEVVEQMSEKLTRVEHTCLKTYSMI------ 749
Query: 485 KLEKRSCFRNHLVPSECHEEIISLEQNIGCIWSWDKRYCKGLLSFGSASKFAAGRENEET 544
+N V C+ +S Q+ S R + S S F EN+
Sbjct: 750 -------LQNQDVNQNCNSTPLSTNQSDIAAASLGTRVSE------SPSAFEHSNENKNE 796
Query: 545 YKTTVTGNPYAVATNCTENNDKLVSHGAGLLEPLKESRLPPFPNRGTCRSHASKLIG--- 601
+ ++ + +D+L + E E R P N T +L+G
Sbjct: 797 VNHQHLIDQKSIIQITDDKSDELKAS-----ELTHERRSPINLNHRT----RHRLLGPKT 847
Query: 602 DVNNSFGKTNPDEAGVQETVVM------LHNLSEERNNYLETQVPIRKRIRLSEVEVDLR 655
VN+ + N ++ V ETVV L + E N ++ V +RK++ S+ +L
Sbjct: 848 SVNDISIEKNINQYVVAETVVQHQQNNNLIKVQVECNKSVDGHVKVRKQVHFSDKVEELH 907
Query: 656 QNRDIEKLQSSHKFHSVTR------TSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRLIF 709
Q R + KL+SS K S R + ++ ++ + S Y C R ++ IF
Sbjct: 908 QKRKLSKLESSDKKCSSIRAKRQRVSKTLTTSAPHMKHSLTDY---CRGARNEF----IF 960
Query: 710 QGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP-PPNSQGKRCSRSHSQQLPVVICPKK 768
+G +FLLTG SSQKE+E+E LI+ GG++L DIP PPNS+GKR S QLP+++C KK
Sbjct: 961 RGREFLLTGLSSQKERELEALIRNSGGVILYDIPSPPNSKGKRSSTLSCLQLPIILCNKK 1020
Query: 769 LKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRDNH 828
L+T KFLYGCAV A IL+ WLTD +A+G+ + P KYMIL ++ D+K T + + R N
Sbjct: 1021 LQTAKFLYGCAVGASILRVDWLTDCLASGTILLPEKYMILPNRNDMKWTRSGKAVHRRNQ 1080
Query: 829 KYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVAENERR 888
K+IF RVGIMLHGK SFCTKFA I+KHGGG+ FKTL WLV S + ++ ++G IV E +
Sbjct: 1081 KHIFERVGIMLHGKHSFCTKFACIIKHGGGRAFKTLQWLVRSTDEERTLMGAIVVEEKAT 1140
Query: 889 VSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPS-PTVNVPEFRISIDWSEE 947
+SRHL+ CA E+ I ++ +SWIIKSL+ G L+P T++ S +V VPE R +D SEE
Sbjct: 1141 ISRHLKSCALERDIPIMTSSWIIKSLYSGNLIPLTEENSTLSLRSVQVPEVRNDVDMSEE 1200
Query: 948 I 948
I
Sbjct: 1201 I 1201
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/189 (41%), Positives = 110/189 (58%), Gaps = 6/189 (3%)
Query: 39 KVLPLQLAETFACSGESGNQFKIDNNASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRR 98
+ L LQ A+T GE+ + N+A + +V++T P+T+K ++SL++ D V ++
Sbjct: 160 EALSLQSAQTSVDFGENMHSM-TGNHACEQDVMATF-IPETKKKNSLKSLIHTFDSVKQQ 217
Query: 99 EEKFK----KVADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARL 154
+EK K + D++DAVELSI+ASEALVIH+L E SE L T VLE AL+VKQARL
Sbjct: 218 KEKPKDECFESNDISDAVELSIAASEALVIHDLVKMESVSETLSTEAVLEIALRVKQARL 277
Query: 155 ENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFLIKCTPL 214
E ED + E D + +S+L+D M A D+GLP S +S S F K
Sbjct: 278 EGLEDGFFSSSMESDCSDSLSDLNDFLMEDAYADIGLPVGVSCGENLSNSNVFHAKGVSG 337
Query: 215 SENYYESND 223
ENY E N+
Sbjct: 338 DENYNEGNN 346
>gi|296087815|emb|CBI35071.3| unnamed protein product [Vitis vinifera]
Length = 693
Score = 340 bits (873), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 206/443 (46%), Positives = 264/443 (59%), Gaps = 51/443 (11%)
Query: 514 CIWSWDKRYCKGLLSFGSASKFAAGRENEETYKTTVTGNPYAVATNCTENNDKLVSHGAG 573
CI S + + LL F S SK+A R+ E + T+V + N L+
Sbjct: 294 CISSGRSDFTE-LLPFTSMSKYAISRDYEYNHDTSVHMP--------KKGNSPLI----- 339
Query: 574 LLEPLKESRLPPFPNRGTCRSHASKLIGDVNNSFGKTNPDEAGVQETVVMLH---NL--- 627
R + ASK +NNS G+ N V + +V + NL
Sbjct: 340 ------------LNQRMHYQLQASKHF--INNSSGEENSKGTAVPDIIVGIQQSKNLEMP 385
Query: 628 SEERNNYLETQVPIRKRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRD 687
E +N TQV IRKR+R SE V QN+++ +Q+++K + K
Sbjct: 386 QPEFHNAHSTQVQIRKRVRFSEANVQHPQNKNLHNIQTAYKTRKTWILTHKVK------- 438
Query: 688 SQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPP-N 746
Y+ C ++LIFQG++FLLTGFSS KEKEIE LI+KYGG+VL DIP P N
Sbjct: 439 ---WYMTNCCIN-----EKLIFQGLEFLLTGFSSCKEKEIEGLIRKYGGIVLPDIPSPSN 490
Query: 747 SQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYM 806
S+ KR S + + LP+V+C KKL+TTKFLYGCAVN F+LK WL +SVAAGS + P KYM
Sbjct: 491 SRRKRNSNFNQKHLPIVLCSKKLQTTKFLYGCAVNTFMLKVDWLNNSVAAGSVLPPGKYM 550
Query: 807 ILSSQADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHW 866
ILS++ D KR I PI R+NHKYIF +VGIMLHGK SFC+KFA I +HGGGQVFKTL
Sbjct: 551 ILSNEGDKKRIRIGMPIHRENHKYIFEKVGIMLHGKHSFCSKFAKIFRHGGGQVFKTLQC 610
Query: 867 LVLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDK 926
LV SL+T+ VG IVAE+E R SRHLR CA E+KI M+PASWII SLH+G LLP T++K
Sbjct: 611 LVQSLDTESISVGAIVAEDESRASRHLRLCASERKIPMMPASWIINSLHLGRLLPLTENK 670
Query: 927 H-IPSPTVNVPEFRISIDWSEEI 948
H P + V E S+++S EI
Sbjct: 671 HSSPLTRIMVQELPTSMEFSAEI 693
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/95 (66%), Positives = 75/95 (78%), Gaps = 1/95 (1%)
Query: 316 EVDVPIQLEQSNNKGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIPFEG 375
EV V + ++Q+N K IPK +V ETS LSESADVAPDENSFV K+ET SK+ASQSSIPFEG
Sbjct: 157 EVGVSVLMKQANPKSIPKLFVGETSFLSESADVAPDENSFVPKNETSSKLASQSSIPFEG 216
Query: 376 VNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS 410
+ +KA+EEI V SS+ S VDPLCS VPCS+S
Sbjct: 217 LCDKANEEILL-SQVQSSSLSLVDPLCSVVPCSIS 250
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 63/127 (49%), Positives = 81/127 (63%), Gaps = 4/127 (3%)
Query: 104 KVADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHY 163
KVA+V+DAVELSI+ASEALVIHEL +E SE VLE AL+VKQARLE ED+ H+
Sbjct: 12 KVAEVDDAVELSIAASEALVIHELLKSESPSEAFPAAAVLEVALRVKQARLEMLEDSVHH 71
Query: 164 PNKEIDETHLVSELDDLTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYESND 223
P +E+D +S+L D M EDVGL GS + GS ++ TP+S N+ +D
Sbjct: 72 PIEEVD---FLSDLVDSAMEDVFEDVGLFDVGSHDQCSGGSIMSRVEDTPISGNHCGFDD 128
Query: 224 NGFKHVE 230
KHV+
Sbjct: 129 KS-KHVD 134
>gi|218193963|gb|EEC76390.1| hypothetical protein OsI_14021 [Oryza sativa Indica Group]
Length = 883
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/368 (35%), Positives = 200/368 (54%), Gaps = 31/368 (8%)
Query: 588 NRGTCRSHASKLIGDVNNSFGKTNPDE-AGVQETVVMLHNLSEERNNYLETQVPIRKRIR 646
N+ R ASK+ V N+F N E + V + +SE +N+ + RK+ +
Sbjct: 540 NKKIRRVQASKVC--VENNFEAGNLHEFSKVLKNPSYAQGVSEHQNSMKSLK---RKKAQ 594
Query: 647 LSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYV-- 704
SE ++ R+ ++I + Q+ +F + DS+L + R+Y+
Sbjct: 595 FSEAKISTRKTKNIRRTQTKSRF--------------SWSDSRLID----TIEPREYIDN 636
Query: 705 KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP--PPNSQGKRCSRSHSQQLPV 762
K +F G+ FLLTGF S KEKEIE LI+K+GG VL +P P + + K + + L +
Sbjct: 637 KEALFHGLDFLLTGFQSHKEKEIEPLIRKFGGYVLSRVPSCPLDKRSKLAELARCKPL-I 695
Query: 763 VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEP 822
V+ PKK+ T KFLYGCA+N++IL WL DS+ AG + P KY I A + +
Sbjct: 696 VLSPKKVSTAKFLYGCAINSWILNPSWLFDSIQAGVMLPPGKYFIRQVHAMQGISMFDQS 755
Query: 823 ICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIV 882
+ + +F VG ++ GK SFC+KF+ ++KHGGGQVF +L LV SL+ + + G+I+
Sbjct: 756 LHLRKNTLLFDGVGFLILGKISFCSKFSNVIKHGGGQVFASLQGLVQSLKDRSSSHGIIL 815
Query: 883 AENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPEFR--I 940
NE SRHL +C E I PASW+I SL+ G L+P +D+ + +P F+
Sbjct: 816 VANEASASRHLSYCGLEHDIKTAPASWVIGSLYSGKLIPLKKDRCASFRKIKMPSFQQPQ 875
Query: 941 SIDWSEEI 948
+ D S+EI
Sbjct: 876 AFDMSQEI 883
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 119/415 (28%), Positives = 176/415 (42%), Gaps = 78/415 (18%)
Query: 80 EKNRIVRSLVNDQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDL 137
E N+ S + + ++ R+ +K+ ADVNDAVELSI+ASEA+VI E+ + S D
Sbjct: 162 EINKTCDSKGHRRQQLSGRKVDVRKLRSADVNDAVELSIAASEAMVIAEMILLDSQS-DK 220
Query: 138 LTTIVLEAALQVKQAR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPS 193
L T LEAAL VK+AR LE E + ++DET +SELD+ M A +DVGL
Sbjct: 221 LATASLEAALHVKEARKQCFLEELEHSCGSSESDLDETDGLSELDETEMLDAFQDVGLS- 279
Query: 194 AGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEG 253
L++ S+ N +G K ++H P D
Sbjct: 280 --------------LVQTACASQG---QNISGLKQE--ISHASSHPCDA----------- 309
Query: 254 KIDPMLHQSTQETSHVLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
+HVL ++ PE WNS++A + + S+ + GG
Sbjct: 310 ------------EAHVLESS-PEKQN---IRWNSHDADTNDHVSDSLASHNKE---GGV- 349
Query: 314 GKEVDVPIQLEQSNNKGIPKFYVNETSSLSESADVA---PDENSFVQKHETGSKIAS-QS 369
V +Q K + + ETS +SES D P + S AS
Sbjct: 350 -----VAVQTNVGTRKHVKGLFNKETSFISESMDGMDEFPSPSRIASMEMAASSRASFLH 404
Query: 370 SIPFEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS---AVDASSPQTINVAAAQ 426
I N+ E C V+ S S VDPLCS VPCS+S + +P+ I +
Sbjct: 405 KIKGSCEENQGAEAAQLCSQVVCSNLSLVDPLCSIVPCSISFNEGPPSQAPECIQ-SKGD 463
Query: 427 KQLLEVENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSPI 481
K+L+ + D E VS + +R+ + L+P+ST++PI
Sbjct: 464 KELISTKEFPSKQDLEGE-AGPSCTPVSNILLR------RRKYSSLRPFSTIAPI 511
>gi|32129326|gb|AAP73853.1| unknown protein [Oryza sativa Japonica Group]
Length = 850
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 128/367 (34%), Positives = 199/367 (54%), Gaps = 29/367 (7%)
Query: 588 NRGTCRSHASKLIGDVNNSFGKTNPDE-AGVQETVVMLHNLSEERNNYLETQVPIRKRIR 646
N+ R ASK+ V N+F N E + V + +SE +N+ + RK+ +
Sbjct: 507 NKKIRRVQASKVC--VENNFEAGNLHEFSKVLKNPSYAQGVSEHQNSMKSLK---RKKAQ 561
Query: 647 LSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYV-- 704
SE ++ R+ ++I + Q+ +F + DS+L + R+Y+
Sbjct: 562 FSEAKISTRKTKNIRRTQTKSRF--------------SWSDSRLID----TIEPREYIDN 603
Query: 705 KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHSQQLPVV 763
K +F G+ FLLTGF S KEKEIE LI+K+GG VL +P P + + + + P+V
Sbjct: 604 KEALFHGLDFLLTGFQSHKEKEIEPLIRKFGGYVLSRVPSCPLDKRSKLAELARCKPPIV 663
Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPI 823
+ PKK+ T KFLYGCA+N++IL WL DS+ AG + P KY I + + + +
Sbjct: 664 LSPKKVSTAKFLYGCAINSWILNPSWLFDSIQAGVMLPPGKYFIRQVHSMQGISMFDQSL 723
Query: 824 CRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
+ +F VG ++ GK SFC+KF+ ++KHGGGQVF +L LV SL+ + + G+I+
Sbjct: 724 HLRKNTLLFDGVGFLILGKISFCSKFSNVIKHGGGQVFASLQGLVQSLKDRSSSHGIILV 783
Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPEFR--IS 941
+E SRHL +C E I PASW+I SL+ G L+P +D+ + +P F+ +
Sbjct: 784 ASEASASRHLSYCGLEHDIKTAPASWVIGSLYSGKLIPLKKDRCASFRKIKMPSFQQPQA 843
Query: 942 IDWSEEI 948
D S+EI
Sbjct: 844 FDMSQEI 850
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 118/414 (28%), Positives = 175/414 (42%), Gaps = 78/414 (18%)
Query: 80 EKNRIVRSLVNDQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDL 137
E N+ S + + ++ R+ +K+ ADVNDAVELSI+ASEA+VI E+ + S D
Sbjct: 129 EINKTCDSKGHRRQQLSGRKVDVRKLRSADVNDAVELSIAASEAMVIAEMILLDSQS-DK 187
Query: 138 LTTIVLEAALQVKQAR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPS 193
L T LEAAL VK+AR LE E + ++DET +SELD+ M A +DVGL
Sbjct: 188 LATASLEAALHVKEARKQCFLEELEHSCGSSESDLDETDRLSELDETEMLDAFQDVGLS- 246
Query: 194 AGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEG 253
L++ S+ N +G K ++H P D
Sbjct: 247 --------------LVQTACASQG---QNISGLKQE--ISHASSHPCDA----------- 276
Query: 254 KIDPMLHQSTQETSHVLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
+HVL ++ PE WNS++A + + S+ + GG
Sbjct: 277 ------------EAHVLESS-PEKQN---IRWNSHDADTNDHVSDSLASHNKE---GGV- 316
Query: 314 GKEVDVPIQLEQSNNKGIPKFYVNETSSLSESADVA---PDENSFVQKHETGSKIAS-QS 369
V +Q K + + ETS +SES D P + S AS
Sbjct: 317 -----VAVQTNVGTRKHVKGLFNKETSFISESMDGMDEFPSPSRIASMEMAASSRASFLH 371
Query: 370 SIPFEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS---AVDASSPQTINVAAAQ 426
I N+ E C V+ S S VDPLCS VPCS+S + +P+ I +
Sbjct: 372 KIKGSCEENQGAEAAQLCSQVVCSNLSLVDPLCSIVPCSISFNEGPPSQAPECIQ-SKGD 430
Query: 427 KQLLEVENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSP 480
K+L+ + D E VS + +R+ + L+P+ST++P
Sbjct: 431 KELISTKEFPSKQDLEGE-AGPSCTPVSNILLR------RRKYSSLRPFSTIAP 477
>gi|115456137|ref|NP_001051669.1| Os03g0811200 [Oryza sativa Japonica Group]
gi|50540742|gb|AAT77898.1| expressed protein [Oryza sativa Japonica Group]
gi|108711696|gb|ABF99491.1| BRCA1 C Terminus domain containing protein, expressed [Oryza sativa
Japonica Group]
gi|113550140|dbj|BAF13583.1| Os03g0811200 [Oryza sativa Japonica Group]
gi|222626025|gb|EEE60157.1| hypothetical protein OsJ_13067 [Oryza sativa Japonica Group]
Length = 883
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/367 (34%), Positives = 199/367 (54%), Gaps = 29/367 (7%)
Query: 588 NRGTCRSHASKLIGDVNNSFGKTNPDE-AGVQETVVMLHNLSEERNNYLETQVPIRKRIR 646
N+ R ASK+ V N+F N E + V + +SE +N+ + RK+ +
Sbjct: 540 NKKIRRVQASKVC--VENNFEAGNLHEFSKVLKNPSYAQGVSEHQNSMKSLK---RKKAQ 594
Query: 647 LSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYV-- 704
SE ++ R+ ++I + Q+ +F + DS+L + R+Y+
Sbjct: 595 FSEAKISTRKTKNIRRTQTKSRF--------------SWSDSRLID----TIEPREYIDN 636
Query: 705 KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHSQQLPVV 763
K +F G+ FLLTGF S KEKEIE LI+K+GG VL +P P + + + + P+V
Sbjct: 637 KEALFHGLDFLLTGFQSHKEKEIEPLIRKFGGYVLSRVPSCPLDKRSKLAELARCKPPIV 696
Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPI 823
+ PKK+ T KFLYGCA+N++IL WL DS+ AG + P KY I + + + +
Sbjct: 697 LSPKKVSTAKFLYGCAINSWILNPSWLFDSIQAGVMLPPGKYFIRQVHSMQGISMFDQSL 756
Query: 824 CRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
+ +F VG ++ GK SFC+KF+ ++KHGGGQVF +L LV SL+ + + G+I+
Sbjct: 757 HLRKNTLLFDGVGFLILGKISFCSKFSNVIKHGGGQVFASLQGLVQSLKDRSSSHGIILV 816
Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPEFR--IS 941
+E SRHL +C E I PASW+I SL+ G L+P +D+ + +P F+ +
Sbjct: 817 ASEASASRHLSYCGLEHDIKTAPASWVIGSLYSGKLIPLKKDRCASFRKIKMPSFQQPQA 876
Query: 942 IDWSEEI 948
D S+EI
Sbjct: 877 FDMSQEI 883
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 118/414 (28%), Positives = 175/414 (42%), Gaps = 78/414 (18%)
Query: 80 EKNRIVRSLVNDQDKVGRREEKFKKV--ADVNDAVELSISASEALVIHELSNNEPDSEDL 137
E N+ S + + ++ R+ +K+ ADVNDAVELSI+ASEA+VI E+ + S D
Sbjct: 162 EINKTCDSKGHRRQQLSGRKVDVRKLRSADVNDAVELSIAASEAMVIAEMILLDSQS-DK 220
Query: 138 LTTIVLEAALQVKQAR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGLPS 193
L T LEAAL VK+AR LE E + ++DET +SELD+ M A +DVGL
Sbjct: 221 LATASLEAALHVKEARKQCFLEELEHSCGSSESDLDETDRLSELDETEMLDAFQDVGLS- 279
Query: 194 AGSDENFVSGSESFLIKCTPLSENYYESNDNGFKHVECLAHKKKLPADPALNLNGSYMEG 253
L++ S+ N +G K ++H P D
Sbjct: 280 --------------LVQTACASQG---QNISGLKQE--ISHASSHPCD------------ 308
Query: 254 KIDPMLHQSTQETSHVLAAAQPEASTNSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
+HVL ++ PE WNS++A + + S+ + GG
Sbjct: 309 -----------AEAHVLESS-PEKQN---IRWNSHDADTNDHVSDSLASHNKE---GGV- 349
Query: 314 GKEVDVPIQLEQSNNKGIPKFYVNETSSLSESADVA---PDENSFVQKHETGSKIAS-QS 369
V +Q K + + ETS +SES D P + S AS
Sbjct: 350 -----VAVQTNVGTRKHVKGLFNKETSFISESMDGMDEFPSPSRIASMEMAASSRASFLH 404
Query: 370 SIPFEGVNNKADEEISFCQDVISSAQSFVDPLCSFVPCSVS---AVDASSPQTINVAAAQ 426
I N+ E C V+ S S VDPLCS VPCS+S + +P+ I +
Sbjct: 405 KIKGSCEENQGAEAAQLCSQVVCSNLSLVDPLCSIVPCSISFNEGPPSQAPECIQ-SKGD 463
Query: 427 KQLLEVENLQWTSDFNAEFINRDRQDVSTFSVEGFENPVKRQLTLLKPYSTLSP 480
K+L+ + D E VS + +R+ + L+P+ST++P
Sbjct: 464 KELISTKEFPSKQDLEGE-AGPSCTPVSNILLR------RRKYSSLRPFSTIAP 510
>gi|357114949|ref|XP_003559256.1| PREDICTED: uncharacterized protein LOC100844410 [Brachypodium
distachyon]
Length = 931
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 107/254 (42%), Positives = 151/254 (59%), Gaps = 6/254 (2%)
Query: 701 RKYV--KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPN-SQGKRCSRSHS 757
R+Y+ K IF G++FLLTG S KEKEIE I+K+GG +L +PP + + +
Sbjct: 678 REYIDKKEAIFHGLEFLLTGLQSHKEKEIESAIRKFGGCILSKVPPCTFDKRSKLAEFAR 737
Query: 758 QQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRT 817
+ PVV+ KK+ T KFLYGCA +++IL WL DS+ AG + P KY+I A +
Sbjct: 738 WKPPVVLSSKKVSTAKFLYGCATDSWILNPNWLFDSIEAGVLLPPGKYLIRQRHAVKESL 797
Query: 818 GITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNI 877
+ + N + +F VG ++HGK SFC+KF+ I+KHGGGQVF +L L+ SL+ + +
Sbjct: 798 TFGQSVHLRNDRLVFRGVGFLIHGKISFCSKFSNIIKHGGGQVFVSLEGLIRSLKDRSSS 857
Query: 878 VGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKHIPSPTVNVPE 937
G+I+ NE SRHL HC E I VPASWII SL G L P +D+ + +P
Sbjct: 858 HGIILVANEASASRHLSHCGMEHDIKTVPASWIISSLFSGKLNPLKKDRCASFRRIKMPS 917
Query: 938 FRI---SIDWSEEI 948
F+ + D S+EI
Sbjct: 918 FQPRHGAFDMSQEI 931
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 60/97 (61%), Gaps = 8/97 (8%)
Query: 100 EKFKKVADVNDAVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQARL----- 154
+K + V D NDAVELSI+ASEA++I E+ N+ D L LE+AL VK+AR
Sbjct: 191 QKLRNV-DANDAVELSIAASEAILIAEMIINDCQP-DKLAAAALESALHVKEARKQYCLE 248
Query: 155 ENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGL 191
E + D + N ++DE ++ELD++ M A EDVGL
Sbjct: 249 ETEHDCGSFEN-DLDERDWLAELDEVEMLDAFEDVGL 284
>gi|356539174|ref|XP_003538075.1| PREDICTED: uncharacterized protein LOC100808072 [Glycine max]
Length = 186
Score = 196 bits (498), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 102/185 (55%), Positives = 129/185 (69%), Gaps = 3/185 (1%)
Query: 767 KKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRD 826
+KL+TTKFLYGCAV A ILK W+TD VA+ + + P KYMIL ++ D+K T I I
Sbjct: 2 RKLQTTKFLYGCAVGASILKVDWITDCVASRTILQPEKYMILPNRKDMKWTRIGTTIHHR 61
Query: 827 NHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVAENE 886
N K IF RVGI+LHGKPSFCTK A I+KHGGG VFKTL L S + ++ +VG IV E++
Sbjct: 62 NQKDIFERVGILLHGKPSFCTKLACIIKHGGGHVFKTLQGLEWSTDEERTLVGAIVVEDK 121
Query: 887 RRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQDKH---IPSPTVNVPEFRISID 943
+SRHL+HCA E+ I ++P SWIIKSL+ G LLP T++K+ +P V V E S D
Sbjct: 122 ATISRHLKHCAKERNIPIMPFSWIIKSLYSGKLLPFTEEKNTLSLPFVNVKVSEVPSSSD 181
Query: 944 WSEEI 948
SEEI
Sbjct: 182 MSEEI 186
>gi|414873573|tpg|DAA52130.1| TPA: hypothetical protein ZEAMMB73_979881 [Zea mays]
Length = 1017
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 100/248 (40%), Positives = 139/248 (56%), Gaps = 23/248 (9%)
Query: 642 RKRIRLSEVEVDLRQNRDIEKLQSSHKF-HSVTRTSEMSKNSNACRDSQLQYVKRCSTTR 700
RKR++ SE + R+ ++ ++ + +F S +RT EM +
Sbjct: 672 RKRVQFSEAKRSSRRTKNNRRILTKSRFSRSDSRTGEMLET------------------- 712
Query: 701 RKYV--KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHS 757
R+Y K FQG++FLLTGF QKEKEIE LI+K GG VL + P P + K + S
Sbjct: 713 REYTDNKEDTFQGVEFLLTGFPKQKEKEIESLIRKSGGYVLSKVSPFPLDKRKNMAEFPS 772
Query: 758 QQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRT 817
P+V+ PKK+ T KFLYGCA++A+IL WL DS+ AG + P KY+I A +
Sbjct: 773 WNPPIVLSPKKVSTAKFLYGCAIDAWILNPIWLFDSLQAGIMLPPGKYLIRQRNAQKHNS 832
Query: 818 GITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNI 877
P+ + IF VG ++HGK SFC+KF+ IVKHGGGQVF +L LV SL+
Sbjct: 833 SFDRPLHPKCNTLIFDGVGFLIHGKISFCSKFSNIVKHGGGQVFISLQGLVQSLKDGSTS 892
Query: 878 VGVIVAEN 885
G+I+ N
Sbjct: 893 RGIILIAN 900
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 80/164 (48%), Gaps = 26/164 (15%)
Query: 45 LAETFACSGE-------SGNQFKIDNNASDMEVL----STSSYPKTEKNRIVRSLVNDQD 93
L+E+ GE S NQF+ ++ DM S P T ++R R N
Sbjct: 93 LSESLCADGEEQEVNAVSQNQFEANDPQVDMPTKVARKEISIQPDTIRHR--RHGFNG-- 148
Query: 94 KVGRREEKFKKVADVNDAVELSISASEALVIHE--LSNNEPDSEDLLTTIVLEAALQVKQ 151
G+ + + + DVNDA+ELSI+ASEA+VI E L +++PD +EAAL VK+
Sbjct: 149 --GKVDVRKLRNVDVNDAIELSIAASEAMVIAEMILDDSQPDKS---AAAAIEAALHVKE 203
Query: 152 AR----LENQEDTSHYPNKEIDETHLVSELDDLTMASACEDVGL 191
AR E + +DE ++ELD+ M A +DV L
Sbjct: 204 ARKQFYFEEPQHACGSSEGGLDEIDWLAELDESEMVDAYQDVAL 247
>gi|449527384|ref|XP_004170691.1| PREDICTED: uncharacterized LOC101203785, partial [Cucumis sativus]
Length = 504
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 131/378 (34%), Positives = 189/378 (50%), Gaps = 70/378 (18%)
Query: 64 NASDMEVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFK----KVADVNDAVELSISAS 119
NA D ++ SSY N +SL N +D+ R+ EK K A+V+DA+ELS+ AS
Sbjct: 146 NAGDTDLAPHSSYKDVLHNVGCQSLTNTEDRENRQGEKLDVGCLKNAEVSDAIELSVVAS 205
Query: 120 EALVIHELSNNEPDSEDLLTTIVLEAALQVKQARLENQEDTSHYPNKEIDETHLVSELDD 179
EALVIHEL +E DS + VLEA++Q E+D + +S+LD+
Sbjct: 206 EALVIHELLKDELDSAAVSVEAVLEASIQ------------------EVDLSDSLSDLDN 247
Query: 180 LTMASACEDVGLPSAGSDENFVSGSESFLIKCTPLSENYYESND---------------N 224
TM A +DVGLPS+ + + SG+ F ++ TP+++N + N
Sbjct: 248 STMRDAFDDVGLPSSILNSDH-SGTACFDVQDTPVNKNEFTHGSQCNSIDMTSQPDILGN 306
Query: 225 GF--KHVE---CLAHKKKLPA-DPALNLN---------GSYMEG--KIDPMLHQSTQETS 267
G K +E + LP D + N+ GS K D ML TQ S
Sbjct: 307 GLTLKQLEENLVVTRPVGLPMEDLSCNIQHQLSNDDVLGSTSTNYCKYDSMLQHPTQNES 366
Query: 268 HVLAAAQPEAST--------------NSIYAWNSNNAVGKEEICCSVVDRFRSRWFGGWT 313
Q S+ +S++ + +A E + +RF+SRW GGW+
Sbjct: 367 DEFVVKQKIVSSIVNTNLCTIHAKENSSLHESSKVSAKNDELVAFFTPERFKSRWLGGWS 426
Query: 314 GKEVDVPIQLEQS-NNKGIPKFYVNETSSLSESADVAPDENSFVQKHETGSKIASQSSIP 372
GKEVDV QL Q + K IP +VNETS LSESAD+APDENS VQ+ E+ ++ASQSSI
Sbjct: 427 GKEVDVSEQLRQDVDGKTIPLMFVNETSFLSESADIAPDENSCVQRCESKFQVASQSSIH 486
Query: 373 FEGVNNKADEEISFCQDV 390
F ++ K D+ + +++
Sbjct: 487 FGHLDEKGDDGLLVAEEI 504
>gi|242037659|ref|XP_002466224.1| hypothetical protein SORBIDRAFT_01g003850 [Sorghum bicolor]
gi|241920078|gb|EER93222.1| hypothetical protein SORBIDRAFT_01g003850 [Sorghum bicolor]
Length = 852
Score = 112 bits (280), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 83/225 (36%), Positives = 119/225 (52%), Gaps = 10/225 (4%)
Query: 584 PPFPNRGTCRSHASKLIGDVNNSFGKTN-PDEAGVQETVVMLHNLSEERNNYLETQVPIR 642
P N+ R A+K + S G N D + VQ+ + SE++ E QVP
Sbjct: 634 PIILNKNVQRVQAAKKF--IQYSAGAENLQDFSAVQKQTYYPQDDSEDQ--IRELQVP-- 687
Query: 643 KRIRLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRK 702
+ + S + +Q+ +++Q S HS RT + N R S+ + R+
Sbjct: 688 RDVFPSTENLVGKQHLKRKRVQFSEANHSSRRTKINRRILNKSRFSRSDSRTGQTLEARE 747
Query: 703 YV--KRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPP-PNSQGKRCSRSHSQQ 759
Y K FQG++FLLTGF +QKEKEIE LI+K GG VL +PP P + K + S +
Sbjct: 748 YTDNKEATFQGVEFLLTGFPNQKEKEIESLIRKSGGYVLSKVPPFPLDKRKSMAEFPSWK 807
Query: 760 LPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAK 804
P+V+ PKK+ T KFLYGCA++A+IL WL DS+ AG + P K
Sbjct: 808 PPIVLSPKKVSTAKFLYGCAIDAWILNPNWLFDSLQAGVMLPPGK 852
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 74/145 (51%), Gaps = 14/145 (9%)
Query: 55 SGNQFKIDNNASDM----EVLSTSSYPKTEKNRIVRSLVNDQDKVGRREEKFKKVADVND 110
S NQF+ +N ++M EV + + NR R V G+ + + + ADVND
Sbjct: 143 SQNQFEANNPQAEMLTVTEVACKEISKQLDTNRHKRHDVYG----GKVDVRKLRNADVND 198
Query: 111 AVELSISASEALVIHELSNNEPDSEDLLTTIVLEAALQVKQAR----LENQEDTSHYPNK 166
A+ELSI+ASEA+VI E+ + D +EAAL VK+AR E E
Sbjct: 199 AIELSIAASEAMVIAEMILD--SQSDKSAAAAIEAALHVKEARKQFYFEEPEHACGSSED 256
Query: 167 EIDETHLVSELDDLTMASACEDVGL 191
+DET ++ELD+ M +DVGL
Sbjct: 257 GLDETDWLAELDESEMVDVYQDVGL 281
>gi|147822457|emb|CAN66205.1| hypothetical protein VITISV_041149 [Vitis vinifera]
Length = 281
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/67 (53%), Positives = 45/67 (67%)
Query: 839 LHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVAENERRVSRHLRHCAF 898
L+ +P ++ HGGGQVFKTL LV SL+T+ VG IVAE+E R SRHLR CA
Sbjct: 5 LYERPKETEWLSLSYMHGGGQVFKTLQCLVQSLDTESISVGAIVAEDESRASRHLRLCAS 64
Query: 899 EKKIAMV 905
E+KI M+
Sbjct: 65 ERKIPMM 71
>gi|303272239|ref|XP_003055481.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463455|gb|EEH60733.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1435
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 90/182 (49%), Gaps = 24/182 (13%)
Query: 700 RRKYVKRLIFQGIKFLLTGFSSQKEKE-IEVLIQKYGGLVLLDIPPPNSQ---------G 749
RRK K F G+ FLL+G S+ + E + LI+ +GG V ++PPP + G
Sbjct: 1179 RRKPTK--TFAGMTFLLSGMPSKNDVEDLTELIELHGGEVEAEVPPPETPDNFTSSPEVG 1236
Query: 750 KRCSR--SHSQQLPV---VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAK 804
R S+ + ++ V VI PK +T K LY AV A +LK W+T S+ AG +SPA
Sbjct: 1237 ARMSQFAAPPRRARVRMRVITPKPGRTLKHLYAAAVGAPLLKPDWITASLDAGRALSPAS 1296
Query: 805 Y---MILSSQ--ADLKRTGITEPICRDNHKYIF-GRVGIMLHGKPSFCTKFAVIVKHGGG 858
++LS + A + D+ +F GRV L G F +F ++++H G
Sbjct: 1297 TPPGVLLSKRHAAIASDDDDDDDDDDDDDGGLFSGRVAA-LCGDDRFTREFGLLLRHAGA 1355
Query: 859 QV 860
V
Sbjct: 1356 DV 1357
>gi|424513323|emb|CCO65945.1| unnamed protein product [Bathycoccus prasinos]
Length = 1147
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/269 (22%), Positives = 114/269 (42%), Gaps = 33/269 (12%)
Query: 665 SSHKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRL--IFQGIKFLLTGF-SS 721
++ K T++ + S+ N+ DS KR S T + +F G+ F++T SS
Sbjct: 890 ATDKRRKSTQSVQQSRRRNSVEDS---AEKRKSITGGGGGGKTTALFSGLSFMITSMPSS 946
Query: 722 QKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRSHSQQLPVVI--CPKKLKTTKFLYGCA 779
+ E+ I + GG + IP +S + +R LP + K+ +T K LY A
Sbjct: 947 RVMNEMRASITENGGRLFDSIPKFSSSKRVPARKQIVVLPTMNNNSEKRKETAKCLYARA 1006
Query: 780 VNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRDNHKYIFGRVGIML 839
+ A + +W+ +S+ K I+ + T+P K F +L
Sbjct: 1007 IGARLRSVRWVRESL--------EKNRIMENDGQ----NATQPT-----KLNFQGTSFLL 1049
Query: 840 HGKPSFCTKFAVIVKHGGGQVF--------KTLHWLVLSLETQKNIVGVIVAENERRVSR 891
G F +F+ ++ H G K L+ + + ++N + +V ++ ++SR
Sbjct: 1050 LGNTFFQDEFSNLMLHAGANTVMTCGKMTKKALNQALKRAQEEQNEIHAVVIQSSEKLSR 1109
Query: 892 HLRHCAFEKKIAMVPASWIIKSLHIGMLL 920
++ E + +V W+ +SL G +L
Sbjct: 1110 EIKSALSESRKIVVRHDWLTESLQRGEIL 1138
>gi|328870861|gb|EGG19233.1| hypothetical protein DFA_02018 [Dictyostelium fasciculatum]
Length = 1006
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 97/221 (43%), Gaps = 19/221 (8%)
Query: 708 IFQGIKFLLTGFSSQKEKEIEV--LIQKYGGLVLLDIPPPNSQGKRCS-RSHSQQ-LPVV 763
IF+G FLLTG + EI + L+ KYGG +L P + K S RS + Q + ++
Sbjct: 774 IFKGYNFLLTGIPEVGDPEINISTLVLKYGGKIL----PVHDFFKAYSYRSKANQSIYLI 829
Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPI 823
+ ++ KT K+L G + KW+ DS+ + Y + S K + +
Sbjct: 830 LASEQQKTFKYLSGIVHGIPCINKKWVLDSIDQEEPLPYKPYRLPSGYCLSKDQNV---L 886
Query: 824 CRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
+ I I + + F ++ ++K G +V K ++S+ + + IV
Sbjct: 887 LKHGSNAILSNTRIEIFAESVFLEEWTTLLKSLGAKVVKR----IISVNDRVD----IVL 938
Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLLPQTQ 924
+ +++S L KI +V W+I+S+ LP Q
Sbjct: 939 TDGKKISSVLLKYTNAMKIPVVTTDWVIQSILEKTKLPYNQ 979
>gi|255080150|ref|XP_002503655.1| predicted protein [Micromonas sp. RCC299]
gi|226518922|gb|ACO64913.1| predicted protein [Micromonas sp. RCC299]
Length = 1388
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/235 (22%), Positives = 91/235 (38%), Gaps = 24/235 (10%)
Query: 708 IFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP--------------PPNSQGKRCS 753
F + F L+GF + + LI + GG+V IP PP + +
Sbjct: 1143 TFADLTFYLSGFGRDESAALAELIVRNGGVVESQIPSPGDGQTLEPFTPSPPAGAPQPVA 1202
Query: 754 RSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGS-TVSPAKY---MILS 809
+ VV +T K LY A+ A ++ W+ ++ A G+ ++SP++ I+S
Sbjct: 1203 PDPADYTRVVTPTSMGRTLKCLYASAIGAPVVTPAWVREAAATGNPSLSPSRTPPGSIIS 1262
Query: 810 SQADLKRTGIT-EPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLV 868
R + P + IF + L G F +F +++H G V
Sbjct: 1263 RGGRPSRDKTSGHPGSSRVFERIFAGLAFALSGDDGFHAQFGALLEHAGASVVDGSELTD 1322
Query: 869 LSLETQKNIVGVIVAEN---ERRVSRH--LRHCAFEKKIAMVPASWIIKSLHIGM 918
+ + + ++V + ERR R LR A + V W ++SL G
Sbjct: 1323 DAGDDRDARCDLVVVQESPAERRGFRGSALRRAAERHGVPCVWHPWAVESLLRGF 1377
>gi|308810240|ref|XP_003082429.1| unnamed protein product [Ostreococcus tauri]
gi|116060897|emb|CAL57375.1| unnamed protein product [Ostreococcus tauri]
Length = 672
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 91/229 (39%), Gaps = 28/229 (12%)
Query: 694 KRCSTTRRKYVKR--LIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKR 751
KR T++ VKR +F + FLL+G + + LI+++GG VL P+S
Sbjct: 440 KRARDTKKATVKRSEKLFASLHFLLSGLDKPQGSTTKELIREHGGFVLT---APSSD--- 493
Query: 752 CSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQ 811
S +VI PK +T K LYG A W+ V A ++
Sbjct: 494 ----VSVNSVIVITPKMGRTMKCLYGVAAGTRFATPSWVEACVEA--------RELIDLD 541
Query: 812 ADLKRTGITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSL 871
L G R + +F V + G +F F +++H G L +
Sbjct: 542 EPLDDNGRMRAKHRSSAGKLFRDVKTAITGNDAFVKDFTSLLRHAGAT-------LEPNP 594
Query: 872 ETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKSLHIGMLL 920
+T + +I+ E+ S +R + + V W+++S+ G LL
Sbjct: 595 QTAERFDYLIIQSGEKPHSAWIR-ASKRLDVPCVRHEWLVESILAGELL 642
>gi|312071487|ref|XP_003138631.1| hypothetical protein LOAG_03046 [Loa loa]
gi|307766206|gb|EFO25440.1| hypothetical protein LOAG_03046 [Loa loa]
Length = 1067
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 101/240 (42%), Gaps = 51/240 (21%)
Query: 708 IFQGIKFLLT-----------GFSSQKEKEIEVLIQKYGGLVLLDI---PPPNSQGKRCS 753
IF GI F++T GFS +KEI +I+ GG V+ D+ P QGK
Sbjct: 848 IFNGITFVVTSALRKNREGEQGFS---KKEIRCMIENGGGKVIDDVMKLP----QGK--- 897
Query: 754 RSHSQQLPV-VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQA 812
P+ +I +T K+L A + + +W++D + KYM+ + +
Sbjct: 898 -------PIYLIADTHYRTHKYLTALARSIPCVSNQWISDCAKENKLLDHMKYMLPAGVS 950
Query: 813 DLKRTGITEPICRDN-----HKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWL 867
L TG +P +N K IF + + P+F + I+ H GG V
Sbjct: 951 LL--TGDMKPWHSNNGTLLSGKRIFISSNNVFYDMPNFSQIWTPIINHMGGTVVP----- 1003
Query: 868 VLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKS-LHIGMLLPQTQDK 926
V+ E G+ + + + + + A + +V + WII++ +H + P+ ++
Sbjct: 1004 VIPTE------GLDILLTDASCTEEVLNIARSQGATVVSSEWIIQAIIHGSLPAPEAHER 1057
>gi|402593984|gb|EJW87911.1| hypothetical protein WUBG_01177 [Wuchereria bancrofti]
Length = 1086
Score = 48.5 bits (114), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 103/240 (42%), Gaps = 51/240 (21%)
Query: 708 IFQGIKFLLT-----------GFSSQKEKEIEVLIQKYGGLVLLDI---PPPNSQGKRCS 753
IF GI F++T GFS +KEI +I+ GG V+ D+ P QGK
Sbjct: 867 IFDGITFVVTSALRKNREGEQGFS---KKEIRCMIENGGGKVIDDVMKLP----QGK--- 916
Query: 754 RSHSQQLPV-VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQA 812
P+ +I +T K+L A + + +W++D + KYM+ + +
Sbjct: 917 -------PIYLIADTHYRTHKYLTALARSIPCVSNQWISDCAKENKLLDHMKYMLPAGVS 969
Query: 813 DLKRTGITEPICRDNHKYIFG-RVGI----MLHGKPSFCTKFAVIVKHGGGQVFKTLHWL 867
L TG +P +N + G RV I + + P+F + I+ H GG V
Sbjct: 970 LL--TGDMKPWHSNNGTLLSGKRVFISSNNVFYDMPNFSQIWTPIINHMGGTVVP----- 1022
Query: 868 VLSLETQKNIVGVIVAENERRVSRHLRHCAFEKKIAMVPASWIIKS-LHIGMLLPQTQDK 926
V+ E G+ + + + + + A + +V + WII++ +H + P+ ++
Sbjct: 1023 VIPEE------GLDILLTDASCTEEILNTARSQGATVVSSEWIIQAIIHGSLPAPEAHER 1076
>gi|170580376|ref|XP_001895237.1| hypothetical protein [Brugia malayi]
gi|158597922|gb|EDP35939.1| conserved hypothetical protein [Brugia malayi]
Length = 983
Score = 48.5 bits (114), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 121/284 (42%), Gaps = 55/284 (19%)
Query: 668 KFHSV-TRTSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRL---IFQGIKFLLT------ 717
K H++ ++TS + + +LQ + S++ K+ IF GI F++T
Sbjct: 720 KQHTIPSKTSSLKGDEALLEPEELQEQQHESSSTNGSAKKALDKIFDGITFVVTSALRKN 779
Query: 718 -----GFSSQKEKEIEVLIQKYGGLVLLDI---PPPNSQGKRCSRSHSQQLPV-VICPKK 768
GFS +KEI +I+ GG V+ D+ P QGK P+ +I
Sbjct: 780 REGEQGFS---KKEIRCMIENGGGKVIDDVMKLP----QGK----------PIYLIADTH 822
Query: 769 LKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITEPICRDNH 828
+T K+L A + + +W++D + KYM+ + + L TG P +N
Sbjct: 823 YRTHKYLTALARSIPCVSNQWISDCAKENKLLDHMKYMLPAGVSLL--TGDMRPWHSNNG 880
Query: 829 KYIFG-RVGI----MLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNIVGVIVA 883
+ G RV I + + P+F + I+ H GG V V+ E G+ +
Sbjct: 881 TLLSGKRVFISSNNVFYDMPNFSQIWTPIINHMGGTVVP-----VIPEE------GLDIL 929
Query: 884 ENERRVSRHLRHCAFEKKIAMVPASWIIKS-LHIGMLLPQTQDK 926
+ + + + A + +V + WII++ +H + P+ ++
Sbjct: 930 LTDASCTEEILNVARSQGATVVSSEWIIQAIIHGSLPAPEAHER 973
>gi|440801631|gb|ELR22641.1| BRCA1 C Terminus (BRCT) domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 972
Score = 46.2 bits (108), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 50/218 (22%), Positives = 96/218 (44%), Gaps = 23/218 (10%)
Query: 708 IFQGIKFLLTGFSSQKEKE--------IEVLIQKYGGLVLLDIPPPNSQGKRCSRSHSQQ 759
+F+G++F++TG S + +KE I LI + GG +L + + + RS
Sbjct: 749 LFRGLQFIVTGLSGEDDKENDDRSAERINRLIGEEGGKLLGNFANASLASQCKKRSGYPG 808
Query: 760 LP--VVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRT 817
LP +++ K T K+L A + WL + + Y++ + + K +
Sbjct: 809 LPSTILLSAKPRTTLKYLLSVAAGLPTVHYNWLIHCCQQRCLLPHSLYLLPAGYSVEKGS 868
Query: 818 GITEPICRDNHKYIFGRVGIMLHGKPSFCTKFAVIVKHGGGQVFKTLHWLVLSLETQKNI 877
+ + IF + + G P F + +++K GG +V + L Q++
Sbjct: 869 DV---FIKKPRGDIFEGTRVEVVGTPEFKNLWKLVLKEGGAKVVERL------FTPQEDR 919
Query: 878 VGVIVAENERRVSRHLRHCAFEK-KIAMVPASWIIKSL 914
V IV+ + +L+ F+K ++ +V WII+SL
Sbjct: 920 VECIVSPPD--PDDYLK-VKFKKLRVPVVGKQWIIQSL 954
>gi|443693208|gb|ELT94638.1| hypothetical protein CAPTEDRAFT_160085 [Capitella teleta]
Length = 199
Score = 45.4 bits (106), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 58/130 (44%), Gaps = 22/130 (16%)
Query: 763 VICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLK-RTGITE 821
++C + +T KFL G + I++ WL S G P Y + + A+ K + +TE
Sbjct: 31 LVCDQVRRTVKFLCGVSRGLVIVRPDWLHQSEEQGVFQDPVTYFVRDASAEKKFKFRLTE 90
Query: 822 PICRDNHKYIFGRVGIMLHGKPSFCT--------KFAVIVKHGGGQVFKTL------HWL 867
+ R N K ML G F T + A I+K GGQ+ K++ H +
Sbjct: 91 SVERANQKG-------MLKGHKCFITANVKPEPKQMADIIKCSGGQLLKSMPKTSSGHTV 143
Query: 868 VLSLETQKNI 877
V+S E + I
Sbjct: 144 VISCEDDEAI 153
>gi|328866094|gb|EGG14480.1| hypothetical protein DFA_12254 [Dictyostelium fasciculatum]
Length = 890
Score = 43.5 bits (101), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 8/112 (7%)
Query: 703 YVKRLIFQGIKFLLTG-FSSQKEKEIEVLIQKYGGLVLLDIPPPN-SQGKRCSRSHSQQL 760
+ ++ +F G+ F L G F +EIE +I++ GG+V LD PPP S K +S +
Sbjct: 764 FSEKRLFYGLAFYLAGEFDQPSRQEIESVIKEAGGIV-LDAPPPKPSNTKELLKS---KC 819
Query: 761 PVVICPKKLKTTKFL--YGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSS 810
V++ P + + L + KW+ DS++ + KY+I S
Sbjct: 820 TVIVHPSYTSSHQQLSQIHLTTKQLPISYKWIFDSISNYQLIPKDKYLIFKS 871
>gi|241602471|ref|XP_002405192.1| topbp1, putative [Ixodes scapularis]
gi|215500576|gb|EEC10070.1| topbp1, putative [Ixodes scapularis]
Length = 1362
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 58/122 (47%), Gaps = 15/122 (12%)
Query: 687 DSQLQYVKRC--STTRRKYVKRLIFQ-GIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIP 743
DS+ Q+V TRR+ + RL + KF+L+G + +++ ++++ GG++L
Sbjct: 1108 DSESQFVGVTWDDPTRRQEMARLAGKLSEKFMLSGLAEEQKAHYAGVVEELGGVLL---- 1163
Query: 744 PPNSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPA 803
S+++ ++ ++ LK ++L A F+L +L DS AG +
Sbjct: 1164 --------TSKNYDPEMTHLVLASALKNERYLAAVAAGKFVLHTAYLDDSAKAGQFLDEE 1215
Query: 804 KY 805
Y
Sbjct: 1216 GY 1217
>gi|343477469|emb|CCD11710.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1275
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 72/152 (47%), Gaps = 22/152 (14%)
Query: 674 RTSEMSKNSNACRDSQLQYVKRCSTTRRKYVKRLIFQGIKFLLTGFSSQKEKEIEVLIQK 733
R E+SK S L + +R +++ F G++ L+TG S K +V +
Sbjct: 883 RLREVSKESRNLETPPLLCTIKAHQSRPNFME--WFSGLRVLITGGSESDVKLFDVSGNE 940
Query: 734 YGGLVLLDIPPP-----------NSQGKRCSRSHSQQLPVVICPKKLKTTKFLYGCAVNA 782
G L+++ +PPP +++GK S S S++ P + P+ +KTT+ CA
Sbjct: 941 LGQLLMMRLPPPPMFDAVAGGLQDNRGKSVS-SASREPPYSL-PQDIKTTRKCDSCA--- 995
Query: 783 FILKAKWLTDSV--AAGSTVSPAKYMILSSQA 812
L + DS+ +AG SP + + LSS+
Sbjct: 996 --LVEERRQDSIPPSAGDDPSPMENIFLSSEG 1025
>gi|344233866|gb|EGV65736.1| DNA repair protein [Candida tenuis ATCC 10573]
Length = 1054
Score = 40.8 bits (94), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/165 (21%), Positives = 68/165 (41%), Gaps = 17/165 (10%)
Query: 646 RLSEVEVDLRQNRDIEKLQSSHKFHSVTRTSEMSKNSNACRDSQLQY--VKRCSTTRRKY 703
+L E + D ++ ++ + H+F + + RD + +KR +
Sbjct: 90 QLGEQDSDTSEDEEVSGIGPKHEFGQYDVYFQNKHDKQQLRDEAYRDWDLKRRGLQGQVA 149
Query: 704 VKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRSHSQQLPVV 763
+ IF+G + G++ EI L+ YGG+ L + + S +H +
Sbjct: 150 YQEPIFRGCTIYVNGYTVPSINEIHRLVIIYGGIFLPHL------ANKSSATH------I 197
Query: 764 ICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMIL 808
+C K + ++ N ++KAKW+ DSV G + KY +
Sbjct: 198 VCDKLTPRKQQIFK---NCKVVKAKWIVDSVEVGRLLDWKKYRLF 239
>gi|242015866|ref|XP_002428568.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212513202|gb|EEB15830.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 1258
Score = 40.8 bits (94), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 34/71 (47%)
Query: 762 VVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKYMILSSQADLKRTGITE 821
+V+C K L+T K L G +I+ WL DS + +S + Y I + D + I +
Sbjct: 1093 LVVCGKPLRTIKMLKGLVAAKYIVSDTWLIDSCSQNKLLSESSYAIQQADFDGFKCNIPK 1152
Query: 822 PICRDNHKYIF 832
+ + Y+F
Sbjct: 1153 ILSSPDRNYLF 1163
>gi|325189629|emb|CCA24114.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 812
Score = 40.4 bits (93), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 46/109 (42%), Gaps = 15/109 (13%)
Query: 700 RRK---YVKRLIFQGIKFLLTGFSSQKEKEIEVLIQKYGGLVLLDIPPPNSQGKRCSRSH 756
RRK K++I +FLLTG + +I GG V + R
Sbjct: 576 RRKSHTTAKKVITHQFRFLLTGNKDECSSN-STIIAALGGRVC-----------QSGRKF 623
Query: 757 SQQLPVVICPKKLKTTKFLYGCAVNAFILKAKWLTDSVAAGSTVSPAKY 805
Q VIC + +T KF+ GCA +IL+ +L S AG V A +
Sbjct: 624 DTQCTHVICSELKRTEKFVAGCAAGKWILRPSYLIASSQAGHFVDEAPH 672
>gi|348671212|gb|EGZ11033.1| hypothetical protein PHYSODRAFT_520284 [Phytophthora sojae]
Length = 932
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 76/170 (44%), Gaps = 36/170 (21%)
Query: 566 KLVSHGAGLLEPLKESRLPPFPNRGTCRSHASKLIGDVNNSFGKTNPDEAGVQETV--VM 623
+L S G G +PL + PP +G + A++L + NS K E+ + +V V
Sbjct: 188 RLRSTGEGP-KPLTAAVAPPIDTKGPSKRQAARLATRLRNSPLKPRTRESALHASVREVK 246
Query: 624 LHNLSEERNNY---LET-----------QVPIRKRI---------RLSEVEVDLRQNR-D 659
LH L +ER + ET QV + +R+ +LSE E LR ++ D
Sbjct: 247 LHELRDERARWQRDYETMREQVIEEKARQVALFQRLEMLRRSHAKQLSEAEAALRASQAD 306
Query: 660 IEKLQSS------HKFHSVTRTSEMSKNSNACRDSQLQYVKRCSTTRRKY 703
+E+LQ+ H RT+ + +NS + R + V + TR K+
Sbjct: 307 VERLQTRLAQARVSAAHEARRTASLVQNSRSERQ---RLVCALAETRHKF 353
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.130 0.376
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,673,934,193
Number of Sequences: 23463169
Number of extensions: 615722149
Number of successful extensions: 1099888
Number of sequences better than 100.0: 65
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 1099713
Number of HSP's gapped (non-prelim): 144
length of query: 948
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 796
effective length of database: 8,792,793,679
effective search space: 6999063768484
effective search space used: 6999063768484
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 82 (36.2 bits)