BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018571
(354 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225460963|ref|XP_002279971.1| PREDICTED: uncharacterized protein LOC100248711 [Vitis vinifera]
gi|297737440|emb|CBI26641.3| unnamed protein product [Vitis vinifera]
Length = 347
Score = 466 bits (1199), Expect = e-129, Method: Compositional matrix adjust.
Identities = 231/349 (66%), Positives = 274/349 (78%), Gaps = 13/349 (3%)
Query: 9 KNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKNLQFNSRLPHEKSKPPSLVSLCLGIVG 68
K+ TTS +KL L P+ +K+ + + +SR P EK+KPPSL SLCLG+VG
Sbjct: 7 KSLTTSFQKLHLSPI---------SKSKPSVIPPTFQSSRSPIEKTKPPSLESLCLGVVG 57
Query: 69 KHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSD 128
KH EDIIGDL EIA+NFP D K+A+A+IARRR+LL DDVI+SLA+SSWEILDISGSDVSD
Sbjct: 58 KHFEDIIGDLGEIAVNFPADTKMAMAAIARRRQLLNDDVIISLAESSWEILDISGSDVSD 117
Query: 129 FGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGI 184
FGL KVA+ CK L+AVDI ++TAAGVSE + HCHSLETLRCGG PRS+H AR+CLGI
Sbjct: 118 FGLAKVAERCKVLRAVDISRCSKVTAAGVSELVWHCHSLETLRCGGCPRSDHTARQCLGI 177
Query: 185 LKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPF 244
KPKLND+EG+SWEEL T+I HGA+SLRW VWP ID +S+E + ECPRIIVNPKPSPF
Sbjct: 178 FKPKLNDIEGESWEELDPTEIAHGAESLRWLVWPKIDNNSLESFAAECPRIIVNPKPSPF 237
Query: 245 GFRGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFRLAF 304
GFRG +VP EA P++ LD+P V DIDP WAV F + + S S EL +AEKFRLAF
Sbjct: 238 GFRGVKVPVEALPNVALDEPIVKDIDPRTWAVSGFTARPTAPSSPSSTELPIAEKFRLAF 297
Query: 305 VERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSLSSR 353
VERD+RLAPKRAKNARQH RRAEREW+ TST AKA+ALAS+A+KSL R
Sbjct: 298 VERDSRLAPKRAKNARQHLRRAEREWVMTSTRAKALALASQASKSLHGR 346
>gi|449444052|ref|XP_004139789.1| PREDICTED: uncharacterized protein LOC101203553 [Cucumis sativus]
gi|449518907|ref|XP_004166477.1| PREDICTED: uncharacterized LOC101203553 [Cucumis sativus]
Length = 367
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 216/316 (68%), Positives = 258/316 (81%), Gaps = 8/316 (2%)
Query: 44 QFN-SRLPHEKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKL 102
QF+ + L E+ KPP+LVSLC+G++GKHLEDII DLD I+ NFP D+K +IA+IARRR+L
Sbjct: 53 QFSGTGLAFERKKPPNLVSLCVGVIGKHLEDIIPDLDVISANFPSDVKQSIAAIARRREL 112
Query: 103 LADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLL 158
L DDVI+SL DSSWE LD+SGS+VSDFGL ++ + CKSL+AVDI +ITAAGVSE +
Sbjct: 113 LNDDVIISLVDSSWETLDVSGSEVSDFGLAEIGKTCKSLRAVDISRCNKITAAGVSELVQ 172
Query: 159 HCHSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWP 218
HC SLETLRCGG PRS++ ARR L I KP+L+D+EGDSWEEL +I +GAQSLRW VWP
Sbjct: 173 HCCSLETLRCGGCPRSDYTARRSLDIFKPRLDDIEGDSWEELDTAEIANGAQSLRWLVWP 232
Query: 219 NIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAVPR 278
+DKDS+E+ STECPRI +NPKPSPFGFRG +VP EA P+I LD+ + DIDP WAV R
Sbjct: 233 KVDKDSLEIFSTECPRITINPKPSPFGFRGKQVPGEALPNIALDEHTIVDIDPKTWAVGR 292
Query: 279 -FASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSA 337
A IS S + +ELS+AEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWM TST A
Sbjct: 293 STARAPISPS--NTSELSLAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMTTSTRA 350
Query: 338 KAIALASKATKSLSSR 353
KA+ALAS+A+KSL SR
Sbjct: 351 KALALASQASKSLQSR 366
>gi|356553046|ref|XP_003544869.1| PREDICTED: uncharacterized protein LOC100784617 [Glycine max]
Length = 351
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 224/358 (62%), Positives = 272/358 (75%), Gaps = 14/358 (3%)
Query: 1 MERNEAATKNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKNLQFNSRLPHEKSKPPSLV 60
M++ + A K TSL+ LDL P N+ ++ + + + QF LP K+KPPSLV
Sbjct: 1 MDKGKGA-KALATSLQNLDLNPPSNVKSKSSIS------ITHPQFPGLLPM-KTKPPSLV 52
Query: 61 SLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILD 120
SLC+G++G+HLEDII DL EIAIN P DIKIA+A+IARRRKLL DD++++LAD+SWEILD
Sbjct: 53 SLCIGVLGRHLEDIIADLSEIAINLPADIKIAVAAIARRRKLLNDDILIALADTSWEILD 112
Query: 121 ISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNH 176
+SGSDVSDFGL+K A++C+ +KA+DI +ITA G+SE + HCH LETLRCGG PR+++
Sbjct: 113 VSGSDVSDFGLIKAAEVCRFIKALDISRCTKITANGISELVKHCHLLETLRCGGCPRTDN 172
Query: 177 AARRCLGILKPKLND-VEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRI 235
ARRCLGI KPK +D VE DSWEEL +I GAQSLRW VWPNIDK+S+E STECPR+
Sbjct: 173 TARRCLGIFKPKFDDYVEEDSWEELDTKEIASGAQSLRWLVWPNIDKNSLEDFSTECPRV 232
Query: 236 IVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFASMGISTSLLSPNELS 295
+VNPK SPFGF+G EVPREA +I LDD V DIDP W + FA IS S S ELS
Sbjct: 233 VVNPKSSPFGFKGTEVPREALQNIILDDEVVKDIDPRTWTMHGFALKPISPSSSS-TELS 291
Query: 296 MAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSLSSR 353
+AEKFRLAFVERD RLAPKRAKNARQHQRRA RE M ST AKA+ LAS+A+KSL R
Sbjct: 292 VAEKFRLAFVERDNRLAPKRAKNARQHQRRAVRELMLMSTRAKAMVLASQASKSLHGR 349
>gi|363806726|ref|NP_001242271.1| uncharacterized protein LOC100797026 [Glycine max]
gi|255639475|gb|ACU20032.1| unknown [Glycine max]
Length = 351
Score = 427 bits (1097), Expect = e-117, Method: Compositional matrix adjust.
Identities = 222/358 (62%), Positives = 270/358 (75%), Gaps = 14/358 (3%)
Query: 1 MERNEAATKNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKNLQFNSRLPHEKSKPPSLV 60
M++ + A K TSL+ LDL P N+ ++ T + + QF LP +K+KP SLV
Sbjct: 1 MDKGKGA-KALATSLQNLDLNPPSNIKSKSSIT------IAHPQFPGLLP-KKAKPLSLV 52
Query: 61 SLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILD 120
SLC+G++G+HLEDII DL EIAIN P DIKIA+A+IARRRKLL DDV+++LAD+SWEILD
Sbjct: 53 SLCVGVLGRHLEDIIADLSEIAINLPADIKIAVAAIARRRKLLNDDVLIALADTSWEILD 112
Query: 121 ISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNH 176
+SGSDVSDFGL+K A++C+ +KA+DI +ITA G+SE + HC LETLRCGG PRS++
Sbjct: 113 VSGSDVSDFGLIKAAEVCRFIKALDISRCTKITANGISELVKHCRLLETLRCGGCPRSDN 172
Query: 177 AARRCLGILKPKLND-VEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRI 235
ARRCLGI KPK +D VE DSWEEL +I GAQSL W VWPNIDK+S+E STECPR+
Sbjct: 173 TARRCLGIFKPKFDDYVEEDSWEELDTKEIASGAQSLGWLVWPNIDKNSLEDFSTECPRV 232
Query: 236 IVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFASMGISTSLLSPNELS 295
+VNPK SPFGF+G EVP+EA +I LDD V DIDP W + FA +S S LS ELS
Sbjct: 233 MVNPKSSPFGFKGTEVPQEALQNILLDDEVVKDIDPRTWTMHGFALKPMSPS-LSSTELS 291
Query: 296 MAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSLSSR 353
+AEKFRLAFVERD RLAPKRAKNARQHQRRA RE M ST AKA+ LAS+ +KSL R
Sbjct: 292 VAEKFRLAFVERDNRLAPKRAKNARQHQRRAVRELMLISTRAKAMVLASQVSKSLHGR 349
>gi|357494547|ref|XP_003617562.1| hypothetical protein MTR_5g092970 [Medicago truncatula]
gi|355518897|gb|AET00521.1| hypothetical protein MTR_5g092970 [Medicago truncatula]
gi|388520841|gb|AFK48482.1| unknown [Medicago truncatula]
Length = 355
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 207/323 (64%), Positives = 251/323 (77%), Gaps = 7/323 (2%)
Query: 35 NVTAFLKNLQFNSRLPHEKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIA 94
++T + QF L K+KPPSLV+LC+G++G+HLEDII DLDEIAI P +IK+A+A
Sbjct: 34 SITTTITRPQFPGFL--TKTKPPSLVNLCIGLIGRHLEDIIEDLDEIAIGLPAEIKLAVA 91
Query: 95 SIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITA 150
+IARRRK L DDV+++LAD+SWEILD+SGSDVSD GLVK A++C+S+KA+DI +ITA
Sbjct: 92 AIARRRKFLNDDVLIALADASWEILDVSGSDVSDLGLVKAAEVCRSVKALDISRCTKITA 151
Query: 151 AGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQ 210
G+SE + HCHSLETLRCGG PRS++ ARRCL I KPKL V DSWEEL ++ +GAQ
Sbjct: 152 TGISELVKHCHSLETLRCGGCPRSDNTARRCLSIFKPKLEYVAEDSWEELDTKEMANGAQ 211
Query: 211 SLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDID 270
SLRW VWPNID +S+E STECPRI+VNPKPSPFGF G +VP EAF +I LDD V DID
Sbjct: 212 SLRWLVWPNIDNNSLEDFSTECPRIVVNPKPSPFGFMGTQVPFEAFQNIILDDAVVKDID 271
Query: 271 PSAWAVPRFASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREW 330
P W + A IS+ S ELS+AEKFRLAF ERD RLAPKRAKNARQHQRRA R+
Sbjct: 272 PKTWTMHGIAKRPISSPSSS-TELSVAEKFRLAFEERDNRLAPKRAKNARQHQRRAARDM 330
Query: 331 METSTSAKAIALASKATKSLSSR 353
+ STSAKA+ LAS+A+KSL SR
Sbjct: 331 LLMSTSAKAVVLASQASKSLHSR 353
>gi|224116614|ref|XP_002317347.1| predicted protein [Populus trichocarpa]
gi|222860412|gb|EEE97959.1| predicted protein [Populus trichocarpa]
Length = 355
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 210/352 (59%), Positives = 260/352 (73%), Gaps = 9/352 (2%)
Query: 9 KNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKN---LQFNSRLPHEKSKPPSLVSLCLG 65
K TTSLKKLDL +T + + L++ L R P KSKPPSL+SLCLG
Sbjct: 5 KEITTSLKKLDLNS-NIMTSTPSKSSLFSPALQSPGLLSCRRRRPG-KSKPPSLLSLCLG 62
Query: 66 IVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDISGSD 125
+VGK+L+DII L +I+I FP IK+ IA+IARRRK+L D+VI+ LAD+SWEILDISGS+
Sbjct: 63 VVGKNLQDIIAHLSDISIAFPPHIKMTIAAIARRRKMLCDEVIIPLADTSWEILDISGSE 122
Query: 126 VSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRC 181
V+D GL++V + CK L+AVDI +ITA+ VS + HC SL+TLRCGG PRS++ AR C
Sbjct: 123 VTDSGLIEVTKTCKFLRAVDISRCNKITASSVSVLVEHCKSLQTLRCGGCPRSDYTARCC 182
Query: 182 LGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKP 241
L +LKPKL+D+ GDSWEEL +I H A+SL W VWP IDKDS+E+++TECPRI VNPK
Sbjct: 183 LTLLKPKLDDMVGDSWEELDTAEISHNAESLHWLVWPKIDKDSLEILATECPRISVNPKW 242
Query: 242 SPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFR 301
SPFGFRG ++P EAFPD LDD FV +I+PS WA +S L + ELS+AEKFR
Sbjct: 243 SPFGFRGKDIPVEAFPDTALDDLFVQEINPSTWAANGITLKPVSPILSNSKELSLAEKFR 302
Query: 302 LAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSLSSR 353
LAFVERDTRLAPKRAKNARQHQRR++REWM S +KAI LAS+ +KSL R
Sbjct: 303 LAFVERDTRLAPKRAKNARQHQRRSDREWMTMSAESKAIVLASQVSKSLHGR 354
>gi|30687441|ref|NP_194428.2| uncharacterized protein [Arabidopsis thaliana]
gi|63003790|gb|AAY25424.1| At4g26980 [Arabidopsis thaliana]
gi|90093278|gb|ABD85152.1| At4g26980 [Arabidopsis thaliana]
gi|110737995|dbj|BAF00933.1| hypothetical protein [Arabidopsis thaliana]
gi|332659880|gb|AEE85280.1| uncharacterized protein [Arabidopsis thaliana]
Length = 343
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 211/344 (61%), Positives = 257/344 (74%), Gaps = 18/344 (5%)
Query: 14 SLKKLDLYPVQNLTERNEATKN---VTAFLKNLQFNSRLPHEKSKPPSLVSLCLGIVGKH 70
SLK LDL T R +N V+A++ + SR+ KSKPPSLVS CLG++GKH
Sbjct: 8 SLKNLDLN-----TNRGRGPENKILVSAYVSS----SRMSPLKSKPPSLVSSCLGVIGKH 58
Query: 71 LEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFG 130
LED+I L EI++ FP DIK++IA+IARR+KLL DDVI+ LADSSWEILD+SGSDV++FG
Sbjct: 59 LEDMIRCLAEISVIFPADIKMSIAAIARRKKLLDDDVIICLADSSWEILDVSGSDVTNFG 118
Query: 131 LVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILK 186
L KVA++CKSL+AVDI +I++ GV E + HC SLETLRCGG P S ARR L I K
Sbjct: 119 LAKVAEICKSLRAVDISRCNKISSMGVLELVQHCRSLETLRCGGCPSSESTARRSLSIFK 178
Query: 187 PKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGF 246
P L++VEG++WEE+ ++IGHG QSLRW VWP IDKDS+EM+S+ECPRI+VNPKPS +
Sbjct: 179 PNLSNVEGETWEEIDTSEIGHGGQSLRWLVWPRIDKDSLEMLSSECPRIVVNPKPSLVAY 238
Query: 247 RGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFRLAFVE 306
R EVPREA PD+ LD+PFV DIDP W V S L NELS+AEKFRLAF E
Sbjct: 239 RADEVPREALPDVALDEPFVKDIDPKTWVVTGVVQKPTSFPL--SNELSIAEKFRLAFAE 296
Query: 307 RDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSL 350
RD R+APKRAKNARQ QRRAER+WM +S AKA+ ASKAT+SL
Sbjct: 297 RDARMAPKRAKNARQRQRRAERDWMMSSDEAKAMVFASKATRSL 340
>gi|297799294|ref|XP_002867531.1| hypothetical protein ARALYDRAFT_492098 [Arabidopsis lyrata subsp.
lyrata]
gi|297313367|gb|EFH43790.1| hypothetical protein ARALYDRAFT_492098 [Arabidopsis lyrata subsp.
lyrata]
Length = 342
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 211/344 (61%), Positives = 254/344 (73%), Gaps = 19/344 (5%)
Query: 14 SLKKLDLYPVQNLTERNEATKN---VTAFLKNLQFNSRLPHEKSKPPSLVSLCLGIVGKH 70
SLK LDL RN K V+A+L + + S L KSKPPSLVSLCLG++GKH
Sbjct: 8 SLKNLDL-------NRNRGPKKKILVSAYLSSSRIMSPL---KSKPPSLVSLCLGVIGKH 57
Query: 71 LEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFG 130
LE++I L +I++ FP DIK++IA+IARR+KLL DDVI SLADSSWEILD+SGSDV++ G
Sbjct: 58 LEEMIPCLADISVIFPADIKMSIAAIARRKKLLDDDVITSLADSSWEILDVSGSDVTNSG 117
Query: 131 LVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILK 186
L KVA+MCKSL+AVDI +I++ GV E + HC SLETLRCGG P S ARR L K
Sbjct: 118 LAKVAEMCKSLRAVDISRCNKISSMGVLELVQHCRSLETLRCGGCPSSESTARRSLSFFK 177
Query: 187 PKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGF 246
P L++ EG++WEE+ ++IGHG QSLRW VWP IDKDS+EM+S ECPRI+VNPKPS +
Sbjct: 178 PNLSNGEGETWEEIDTSEIGHGGQSLRWLVWPRIDKDSLEMLSLECPRIVVNPKPSLVAY 237
Query: 247 RGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFRLAFVE 306
R EVPREA PD+ +D+PFV DIDP W V TS L NEL +AEKFRLAF E
Sbjct: 238 RADEVPREALPDVAVDEPFVKDIDPKTWVVTGVVQK--PTSFLLCNELPIAEKFRLAFAE 295
Query: 307 RDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSL 350
RD RLAPKRAKNARQ QRRAER+WM +S AKA+ LASKAT+SL
Sbjct: 296 RDARLAPKRAKNARQRQRRAERDWMMSSDEAKAMVLASKATRSL 339
>gi|255583615|ref|XP_002532563.1| conserved hypothetical protein [Ricinus communis]
gi|223527718|gb|EEF29824.1| conserved hypothetical protein [Ricinus communis]
Length = 266
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 189/264 (71%), Positives = 221/264 (83%), Gaps = 4/264 (1%)
Query: 91 IAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI---- 146
+A+A+I+RRRKLL DDVI+SLADSSWEILD+SGSDV+D GL+ +A++CK L+AVDI
Sbjct: 1 MALAAISRRRKLLTDDVIISLADSSWEILDLSGSDVTDSGLMNIAEICKFLRAVDISRCH 60
Query: 147 RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIG 206
ITAAGVSE + HC SL+TLRCGG PRS++ AR L ILKPKLND+EGDSWEEL +IG
Sbjct: 61 NITAAGVSEVIKHCKSLQTLRCGGCPRSDYCARNSLYILKPKLNDMEGDSWEELDTAEIG 120
Query: 207 HGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFV 266
HGA+SLRW VWP IDK+S+E+M TECPRIIVNPKPSPFGFRG EVP+EA PD+ LDDP V
Sbjct: 121 HGAESLRWLVWPKIDKNSMEIMDTECPRIIVNPKPSPFGFRGIEVPKEALPDMVLDDPIV 180
Query: 267 NDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRA 326
DIDP WA FA +++SL S ELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRA
Sbjct: 181 KDIDPKTWATHGFAPKPVASSLSSSTELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRA 240
Query: 327 EREWMETSTSAKAIALASKATKSL 350
E+E M +T AKA+ALAS+A+KSL
Sbjct: 241 EKELMMMNTKAKALALASRASKSL 264
>gi|4455221|emb|CAB36544.1| putative protein [Arabidopsis thaliana]
gi|7269551|emb|CAB79553.1| putative protein [Arabidopsis thaliana]
Length = 403
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 199/330 (60%), Positives = 238/330 (72%), Gaps = 34/330 (10%)
Query: 53 KSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLA 112
KSKPPSLVS CLG++GKHLED+I L EI++ FP DIK++IA+IARR+KLL DDVI+ LA
Sbjct: 73 KSKPPSLVSSCLGVIGKHLEDMIRCLAEISVIFPADIKMSIAAIARRKKLLDDDVIICLA 132
Query: 113 DSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDIR------------------------- 147
DSSWEILD+SGSDV++FGL KVA++CKSL+AVDIR
Sbjct: 133 DSSWEILDVSGSDVTNFGLAKVAEICKSLRAVDIRYYSCLLRIYSGLAGSLVWGFCMLDK 192
Query: 148 -------ITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEEL 200
I++ GV E + HC SLETLRCGG P S ARR L I KP L++VEG++WEE+
Sbjct: 193 WCSRCNKISSMGVLELVQHCRSLETLRCGGCPSSESTARRSLSIFKPNLSNVEGETWEEI 252
Query: 201 VNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDIT 260
++IGHG QSLRW VWP IDKDS+EM+S+ECPRI+VNPKPS +R EVPREA PD+
Sbjct: 253 DTSEIGHGGQSLRWLVWPRIDKDSLEMLSSECPRIVVNPKPSLVAYRADEVPREALPDVA 312
Query: 261 LDDPFVNDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNAR 320
LD+PFV DIDP W V TS NELS+AEKFRLAF ERD R+APKRAKNAR
Sbjct: 313 LDEPFVKDIDPKTWVVTGVVQK--PTSFPLSNELSIAEKFRLAFAERDARMAPKRAKNAR 370
Query: 321 QHQRRAEREWMETSTSAKAIALASKATKSL 350
Q QRRAER+WM +S AKA+ ASKAT+SL
Sbjct: 371 QRQRRAERDWMMSSDEAKAMVFASKATRSL 400
>gi|343172605|gb|AEL99006.1| hypothetical protein, partial [Silene latifolia]
Length = 314
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 198/310 (63%), Positives = 236/310 (76%), Gaps = 4/310 (1%)
Query: 45 FNSRLPHEKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLA 104
F+ L +++K PSL+SLC+G++GKHLED+I DL E+AINFP DIKIAIA+IARR+KLL
Sbjct: 5 FSPVLQVKRTKAPSLLSLCIGVIGKHLEDLIPDLGEVAINFPADIKIAIAAIARRKKLLN 64
Query: 105 DDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHC 160
D+VI+ LAD +WEILDIS SDVSDFGL K+A+MC+SL+AVDI IT GV+ + C
Sbjct: 65 DEVILLLADDTWEILDISSSDVSDFGLAKIAEMCRSLRAVDISRCRNITPDGVARLVQLC 124
Query: 161 HSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNI 220
SLETLRCGG PRS+ ARR LG+ KP LN VE DSWEEL I G SLRW VWP I
Sbjct: 125 PSLETLRCGGCPRSDFTARRSLGLFKPALNSVEEDSWEELDTAKITDGGDSLRWIVWPKI 184
Query: 221 DKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFA 280
+ +E S+ECPRI +NP+PSPFGF+G VPREAFPDI LDDP + DIDPS WA
Sbjct: 185 GPEFLEDFSSECPRIKINPQPSPFGFKGAVVPREAFPDIVLDDPIIEDIDPSTWASFGRP 244
Query: 281 SMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAI 340
S + SP+ELSMAEKFRLAF ERDTRLAPKRAKNARQHQRRAE+EW ++ KA+
Sbjct: 245 SRILVQPFCSPDELSMAEKFRLAFEERDTRLAPKRAKNARQHQRRAEKEWRMMTSEGKAV 304
Query: 341 ALASKATKSL 350
+LAS+ATK L
Sbjct: 305 SLASQATKIL 314
>gi|343172607|gb|AEL99007.1| hypothetical protein, partial [Silene latifolia]
Length = 314
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 197/310 (63%), Positives = 235/310 (75%), Gaps = 4/310 (1%)
Query: 45 FNSRLPHEKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLA 104
F+ L +++K PSL+SLC+G++GKHLED+I DL E+AI+FP DIKIAIA+IARR+KLL
Sbjct: 5 FSPVLQVKRTKAPSLLSLCIGVIGKHLEDLIPDLGEVAISFPADIKIAIAAIARRKKLLN 64
Query: 105 DDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHC 160
D+VI+ LAD +WEILDIS SDVSDFGL K+A+MC+SL+AVDI IT GV+ + C
Sbjct: 65 DEVILLLADDTWEILDISSSDVSDFGLAKIAEMCRSLRAVDISRCRNITPDGVARLVQLC 124
Query: 161 HSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNI 220
SLETLRCGG PRS+ ARR LG+ KP LN VE DSWEEL I G SLRW VWP I
Sbjct: 125 PSLETLRCGGCPRSDFTARRSLGLFKPALNSVEEDSWEELDTAKITDGGDSLRWIVWPKI 184
Query: 221 DKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAVPRFA 280
+ +E S+ECPRI +NP+PSPFGF+G VPREAFPDI LDDP + DIDPS WA
Sbjct: 185 GPEFLEDFSSECPRIKINPQPSPFGFKGAVVPREAFPDIVLDDPIIEDIDPSTWASFGRP 244
Query: 281 SMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAI 340
S + SP+ELSMAEKFRLAF ERDTRLAPKRAKNARQHQRRAE+EW + KA+
Sbjct: 245 SRILVQPFCSPDELSMAEKFRLAFEERDTRLAPKRAKNARQHQRRAEKEWRMMTAEGKAV 304
Query: 341 ALASKATKSL 350
+LAS+ATK L
Sbjct: 305 SLASQATKIL 314
>gi|357123446|ref|XP_003563421.1| PREDICTED: uncharacterized protein LOC100843238 [Brachypodium
distachyon]
Length = 343
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 188/352 (53%), Positives = 236/352 (67%), Gaps = 21/352 (5%)
Query: 1 MERNEAATKNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKNLQFNSRLPHEKSKPPSLV 60
ME+ ++ SL + + P + T+ L F S P +K++P LV
Sbjct: 1 MEKGKSVVAELAASLSDIQVTP----------RRKPTSLLPATSFYS--PRKKARPRKLV 48
Query: 61 SLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILD 120
SLC G++G+HLEDII D+ E FP IK+AI SIARRR+LL D+V++SLADSSW+ILD
Sbjct: 49 SLCTGVLGQHLEDIISDISEFTSLFPPHIKLAIMSIARRRRLLNDEVLVSLADSSWKILD 108
Query: 121 ISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNH 176
ISGSDVSD GL VA + +L A+DI +ITAA VSE + HC SLE LRCGG PRS
Sbjct: 109 ISGSDVSDVGLTTVAHIFSNLWAIDISRCEKITAAAVSEIICHCPSLEILRCGGCPRSEF 168
Query: 177 AARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRII 236
AR C+ +LKPKLN +E DSWEEL DIG GA+SLRW VWP ID +S E+++ ECPRII
Sbjct: 169 TARACVNLLKPKLNTLEEDSWEELEAVDIGSGAESLRWLVWPKIDDNSKEILAAECPRII 228
Query: 237 VNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAV---PRFASMGISTSLLSPNE 293
VNP+PS FG G ++P EAF +I LD V DIDP WAV PR M +P E
Sbjct: 229 VNPQPSIFGHHGLKIPSEAFANIPLDHSIVEDIDPKTWAVCAAPR--RMAAPPQPNAPPE 286
Query: 294 LSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASK 345
+ +AE+FRLA+VERD RLAPKRA+ RQH+RRAER+++ AK+IALASK
Sbjct: 287 IPIAERFRLAYVERDARLAPKRARRERQHRRRAERDYLMNDIDAKSIALASK 338
>gi|218198755|gb|EEC81182.1| hypothetical protein OsI_24182 [Oryza sativa Indica Group]
gi|222636091|gb|EEE66223.1| hypothetical protein OsJ_22371 [Oryza sativa Japonica Group]
Length = 346
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 190/357 (53%), Positives = 242/357 (67%), Gaps = 21/357 (5%)
Query: 1 MERNEAATKNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKNLQFNSRLPHEKSKPPSLV 60
ME+ ++ SL+ +++ P R A+ L F S P K++P LV
Sbjct: 1 MEKGKSVVAELAASLRDVEVTP-----RRKPASS-----LPAASFYS--PTNKARPRKLV 48
Query: 61 SLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILD 120
SLCLGI+G+HLEDII D+ E + FP IK+AI SIARRR+LL D+V++SLADSSWEILD
Sbjct: 49 SLCLGILGQHLEDIITDISEFSTFFPPHIKLAILSIARRRRLLNDEVLISLADSSWEILD 108
Query: 121 ISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNH 176
ISGSDVSD GL VA + +L A+DI RITAA VSE + HC SLE LRCGG P S
Sbjct: 109 ISGSDVSDIGLATVANISNNLWAIDISRCERITAAAVSEVICHCPSLEILRCGGCPGSES 168
Query: 177 AARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRII 236
ARR + +LKPKLN +E DSWEEL +IG GA+SLRW VWP ID +S E++S ECPRI
Sbjct: 169 TARRSVYLLKPKLNTLEEDSWEELDTVEIGGGAESLRWLVWPKIDDNSKEIISMECPRIT 228
Query: 237 VNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAV---PRFASMGISTSLLSPNE 293
VNP+PSPF RG +VP EA + LD + DIDP WAV PR + + T+ +P E
Sbjct: 229 VNPQPSPFDLRGHKVPAEALASVPLDHSIIADIDPKTWAVAAAPRRPT--VPTNPNAPPE 286
Query: 294 LSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSL 350
+ +AEKFRLA+VER+ RLAPKRAK RQ +RRAERE++ +AK++ALA++ +K L
Sbjct: 287 IPIAEKFRLAYVEREARLAPKRAKRERQQRRRAEREYLMNDINAKSVALAAQVSKYL 343
>gi|242096736|ref|XP_002438858.1| hypothetical protein SORBIDRAFT_10g027310 [Sorghum bicolor]
gi|241917081|gb|EER90225.1| hypothetical protein SORBIDRAFT_10g027310 [Sorghum bicolor]
Length = 348
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 186/357 (52%), Positives = 237/357 (66%), Gaps = 19/357 (5%)
Query: 1 MERNEAATKNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKNLQFNSRLPHEKSKPPSLV 60
M++ ++ S + + P QN + +FL + F S +K+ P LV
Sbjct: 1 MDKGKSVVAELAASFSVVRVAPRQNPKPK--------SFLPSPSFYSF--SKKANPRKLV 50
Query: 61 SLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILD 120
SLCLG +G+HLEDII D+ E A FP IK+AI SIARRR+LL DDV+ SLA+SSWEILD
Sbjct: 51 SLCLGTLGQHLEDIITDISEFAALFPPHIKLAIMSIARRRRLLNDDVLTSLAESSWEILD 110
Query: 121 ISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNH 176
ISGSDV+D GL VA +C +L+AVDI +IT AGVSE + HC SLE LRCGG PRS
Sbjct: 111 ISGSDVTDAGLATVANVCSNLRAVDISRCDKITTAGVSEIVCHCPSLEILRCGGCPRSEV 170
Query: 177 AARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRII 236
ARRCL +LKPKLN +E DSWEEL DIG GA+SLRW VWP ID +S E ++ ECPR+
Sbjct: 171 TARRCLNLLKPKLNTLEEDSWEELDTLDIGGGAESLRWLVWPKIDDNSKETLAAECPRVT 230
Query: 237 VNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAV---PRFASMGISTSLLSPNE 293
VNP+PS F G +VP EA + LD V DIDP WAV PR + + E
Sbjct: 231 VNPQPSLFDLSGSKVPVEALASVPLDHSVVEDIDPKTWAVSAAPRRPVAPPNPNAPP--E 288
Query: 294 LSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSL 350
+ +AE+FRLA+VER+ RLAPKRAK RQ +RRAER++M A++IALA++A+++L
Sbjct: 289 IPIAERFRLAYVEREARLAPKRAKRERQQRRRAERDYMMNDIDARSIALAAQASRNL 345
>gi|226497640|ref|NP_001145222.1| uncharacterized protein LOC100278488 [Zea mays]
gi|195653143|gb|ACG46039.1| hypothetical protein [Zea mays]
Length = 338
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 174/308 (56%), Positives = 217/308 (70%), Gaps = 9/308 (2%)
Query: 52 EKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSL 111
+K+KP LVSLCLG +G+HLEDII D+ E A FP I +AI SIARR+ LL D V+ SL
Sbjct: 28 KKAKPRKLVSLCLGTLGQHLEDIITDISEFAALFPPHITLAIMSIARRKSLLNDKVLTSL 87
Query: 112 ADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLR 167
A+SSW+ILDISGSDV+D GL VA +C +L+AVDI +IT AGVSE + HC SLE LR
Sbjct: 88 AESSWQILDISGSDVTDAGLATVANVCSNLRAVDISCCEKITTAGVSEIVCHCPSLEILR 147
Query: 168 CGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEM 227
CGG PRS ARRCL +LKPKLN +E DSWEEL DIG GA+SLRW VWP ID S
Sbjct: 148 CGGCPRSEVTARRCLNLLKPKLNALEEDSWEELDTLDIGGGAESLRWLVWPMIDDKSKVT 207
Query: 228 MSTECPRIIVNPKPSPFGFRGF-EVPREAFPDITLDDPFVNDIDPSAWAV----PRFASM 282
++ ECPR+ VNP+PS F G+ +VP EA + LD V DIDP WAV PR +
Sbjct: 208 LAVECPRVTVNPQPSLFNLSGYSKVPIEALSSVPLDHSVVEDIDPKTWAVSAAPPRPVAP 267
Query: 283 GISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIAL 342
+P E+ MAE+FRLA+VER+ RLAPKRAK RQ +RRAER++M A+++AL
Sbjct: 268 VNPNVPSAPPEIPMAERFRLAYVEREARLAPKRAKRERQRRRRAERDYMMVDIDARSVAL 327
Query: 343 ASKATKSL 350
A++A+++L
Sbjct: 328 AAQASRNL 335
>gi|413943347|gb|AFW75996.1| hypothetical protein ZEAMMB73_793752 [Zea mays]
Length = 338
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 172/308 (55%), Positives = 217/308 (70%), Gaps = 9/308 (2%)
Query: 52 EKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSL 111
+K+KP LVSLCLG +G+HLEDII D+ E A FP I +AI SIARR+ LL D V+ SL
Sbjct: 28 KKAKPRKLVSLCLGTLGQHLEDIITDISEFAALFPPHITLAIMSIARRKSLLNDKVLTSL 87
Query: 112 ADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLR 167
A+SSW+ILDISGSDV+D GL VA +C +L+AVDI +IT AGVSE + HC SLE LR
Sbjct: 88 AESSWQILDISGSDVTDAGLATVANVCSNLRAVDISRCEKITTAGVSEIVCHCPSLEILR 147
Query: 168 CGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEM 227
CGG PRS ARRCL +LKPKLN +E DSWEEL DIG GA+SLRW VWP ID S
Sbjct: 148 CGGCPRSEVTARRCLNLLKPKLNALEEDSWEELDTLDIGGGAESLRWLVWPMIDDKSKVT 207
Query: 228 MSTECPRIIVNPKPSPFGFRGF-EVPREAFPDITLDDPFVNDIDPSAWAVPRFASMGIST 286
++ ECPR+ VNP+PS F G+ +VP EA + LD V DIDP WAV ++
Sbjct: 208 LAVECPRVTVNPQPSLFDLSGYSKVPIEALSSVPLDHSVVEDIDPKTWAVSAAPPRAVAP 267
Query: 287 SLL----SPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIAL 342
+ +P E+ MAE+FRLA+VER+ RLAPKRAK RQ +RRAER++M A+++AL
Sbjct: 268 ANPNVPSAPPEIPMAERFRLAYVEREARLAPKRAKRERQRRRRAERDYMMVDIDARSVAL 327
Query: 343 ASKATKSL 350
A++A+++L
Sbjct: 328 AAQASRNL 335
>gi|326505626|dbj|BAJ95484.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509709|dbj|BAJ87070.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 169/302 (55%), Positives = 212/302 (70%), Gaps = 6/302 (1%)
Query: 50 PHEKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIM 109
P +K++P LVSLC+G++G+HLEDII D+ E FP IK+AI SIARRR+LL DDV++
Sbjct: 38 PMKKARPRKLVSLCIGVLGQHLEDIINDISEFTELFPPHIKLAILSIARRRRLLNDDVLV 97
Query: 110 SLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLET 165
SL DSSW+ILDISGS+V+D GL VA C +L AVDI +IT A VSE + HC SLE
Sbjct: 98 SLVDSSWKILDISGSEVTDVGLATVAHTCSNLWAVDISRCEKITVAAVSEIICHCPSLEI 157
Query: 166 LRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSI 225
LRCGG PRS AR C+ +LKPKLN +E DSWEEL D G GAQSLRW VWP I ++S
Sbjct: 158 LRCGGCPRSEFTARGCVNLLKPKLNTLEEDSWEELEAVDFGSGAQSLRWLVWPKIGENSK 217
Query: 226 EMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAVP--RFASMG 283
E+++ ECPR+IVNPKPS G + P EA + LD V DIDP WAV ++
Sbjct: 218 EILAAECPRVIVNPKPSLLDLGGSKTPSEALASVPLDHSVVQDIDPKTWAVSAAPRRAVA 277
Query: 284 ISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALA 343
+P E+ +AE+FRLA+VERD RLAPKRA+ RQH+RRAER+++ AK++ALA
Sbjct: 278 APPRPNAPPEIPIAERFRLAYVERDARLAPKRARRERQHRRRAERDYLMNDIDAKSVALA 337
Query: 344 SK 345
SK
Sbjct: 338 SK 339
>gi|413934420|gb|AFW68971.1| hypothetical protein ZEAMMB73_292359 [Zea mays]
Length = 381
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 181/390 (46%), Positives = 236/390 (60%), Gaps = 52/390 (13%)
Query: 1 MERNEAATKNKTTSLKKLDLYPVQNLTERNEATKNVTAFLKNLQFNSRLPHEKSKPPSLV 60
M++ ++ S + + P QN +FL + F S +K KP LV
Sbjct: 1 MDKGKSVVAELAASFSDVRVTPRQN--------PKTKSFLPSPSFYSS--SKKGKPQKLV 50
Query: 61 SLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILD 120
SLCLG +G+HLEDII D+ E A FP IK+ I SIARRR+LL D+V+ SLA+SSWEILD
Sbjct: 51 SLCLGTLGQHLEDIITDISEFAALFPPHIKLTIMSIARRRRLLNDEVLTSLAESSWEILD 110
Query: 121 ISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRSNH 176
ISGSDV+D GL VA +C +L+AVDI ++T +GVSE + HC SLE LRCGG PRS
Sbjct: 111 ISGSDVTDAGLATVANVCSNLRAVDISRCEKVTTSGVSEIVCHCPSLEILRCGGCPRSEV 170
Query: 177 AARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVW------------------- 217
ARRCL +LKPKLN++E DSWEEL DIG GA+SL+W VW
Sbjct: 171 TARRCLDLLKPKLNNLEEDSWEELNTLDIGGGAESLKWLVWVCKQVHFSSYDLWCVYNSP 230
Query: 218 --------------PNIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDD 263
P ID +S E ++ ECPR+ VNP+PS F G +VP EA + LD
Sbjct: 231 IRISCLMNIFPVLQPKIDDNSKETLAAECPRVTVNPQPSLFDLSGSKVPVEALASVPLDH 290
Query: 264 PFVNDIDPSAWAV---PRFASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNAR 320
V DIDP WAV PR + + E+ +AE+FRLA+VER+ RLAPKRAK R
Sbjct: 291 SIVEDIDPKTWAVSAAPRRPVAPPNPNAPP--EIPIAERFRLAYVEREARLAPKRAKRER 348
Query: 321 QHQRRAEREWMETSTSAKAIALASKATKSL 350
Q +RRAER++M A+++ALA++A+++L
Sbjct: 349 QQRRRAERDYMMNDIDARSMALAAQASRNL 378
>gi|52076580|dbj|BAD45482.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 324
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 156/269 (57%), Positives = 195/269 (72%), Gaps = 9/269 (3%)
Query: 89 IKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI-- 146
+ +AI SIARRR+LL D+V++SLADSSWEILDISGSDVSD GL VA + +L A+DI
Sbjct: 55 VCLAILSIARRRRLLNDEVLISLADSSWEILDISGSDVSDIGLATVANISNNLWAIDISR 114
Query: 147 --RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTD 204
RITAA VSE + HC SLE LRCGG P S ARR + +LKPKLN +E DSWEEL +
Sbjct: 115 CERITAAAVSEVICHCPSLEILRCGGCPGSESTARRSVYLLKPKLNTLEEDSWEELDTVE 174
Query: 205 IGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDP 264
IG GA+SLRW VWP ID +S E++S ECPRI VNP+PSPF RG +VP EA + LD
Sbjct: 175 IGGGAESLRWLVWPKIDDNSKEIISMECPRITVNPQPSPFDLRGHKVPAEALASVPLDHS 234
Query: 265 FVNDIDPSAWAV---PRFASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQ 321
+ DIDP WAV PR + + T+ +P E+ +AEKFRLA+VER+ RLAPKRAK RQ
Sbjct: 235 IIADIDPKTWAVAAAPRRPT--VPTNPNAPPEIPIAEKFRLAYVEREARLAPKRAKRERQ 292
Query: 322 HQRRAEREWMETSTSAKAIALASKATKSL 350
+RRAERE++ +AK++ALA++ +K L
Sbjct: 293 QRRRAEREYLMNDINAKSVALAAQVSKYL 321
>gi|168029101|ref|XP_001767065.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681807|gb|EDQ68231.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 333
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 153/304 (50%), Positives = 202/304 (66%), Gaps = 14/304 (4%)
Query: 58 SLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWE 117
SL+SLC+G +G+HLED+I D+D+IA FP +K + +IARRR LL D+++ +LAD SWE
Sbjct: 28 SLLSLCVGYIGEHLEDLIDDIDDIAPAFPFHVKGKLLAIARRRNLLCDELLTALADISWE 87
Query: 118 ILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPR 173
ILD+SGSDV+D L+ AQ C L+ VDI ++T A V + HC +L TLR GG+P
Sbjct: 88 ILDVSGSDVTDSSLIAAAQTCPRLQDVDISRCNKLTCAAVRALVEHCPNLRTLRYGGTPM 147
Query: 174 SNHAARRCLGILKPKLN-DVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTEC 232
S+ AAR+ + + PKLN + E DSWE L + GAQ+LRW VWP ID S+E + +EC
Sbjct: 148 SDAAARKSISYIVPKLNRNEEEDSWEVLETKAVAEGAQTLRWLVWPGIDPISMERLKSEC 207
Query: 233 PRIIVNPKPSPFGFRGFE-VPREAFPDITLDDPFVNDIDPSAWA-----VPRFASMGIST 286
PRI++NP F ++ + +P A D LD+ F+ DIDP WA PR +
Sbjct: 208 PRIVINPV---FSWKSYGIIPPAALRDAVLDESFLEDIDPKTWANKSKTPPRSEKVPPRA 264
Query: 287 SLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKA 346
+ NELS+AEKFRLAFV RD RLAPKRAKNARQ+QRRAE+ W+ + T AK+I A A
Sbjct: 265 TDEVKNELSVAEKFRLAFVARDERLAPKRAKNARQNQRRAEKAWLNSDTEAKSIVWAGIA 324
Query: 347 TKSL 350
KSL
Sbjct: 325 QKSL 328
>gi|302823372|ref|XP_002993339.1| hypothetical protein SELMODRAFT_136924 [Selaginella moellendorffii]
gi|300138848|gb|EFJ05601.1| hypothetical protein SELMODRAFT_136924 [Selaginella moellendorffii]
Length = 261
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 120/260 (46%), Positives = 162/260 (62%), Gaps = 11/260 (4%)
Query: 91 IAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI---- 146
+ + +IARRR LL D +++ LAD++WEILD+SGSDV+D G+ + A++C L+AVD+
Sbjct: 1 LQLVAIARRRGLLCDRLLVPLADTAWEILDVSGSDVTDTGIHQAAKICSRLRAVDVSRCS 60
Query: 147 RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILKPKL---NDVEGDSWEELVNT 203
+TA + + C +LETLRCGG+ SN AAR L + P L N VE +SWE+L
Sbjct: 61 SLTAESIRTIVECCPALETLRCGGTHLSNRAARLALPCVLPGLRVRNQVE-ESWEDLDAV 119
Query: 204 DIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDD 263
+G GAQ LRW VWP ID +S + + ECPR++VNP + + VPREA P LD+
Sbjct: 120 AVGSGAQELRWIVWPAIDGESRQTVELECPRVVVNPGMA--SSKHARVPREALPSTVLDE 177
Query: 264 PFVNDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQ 323
V DI PSAW V R + + L +S AE FR+AF RD RLAPKRAKN RQ +
Sbjct: 178 QLVEDISPSAWEV-RCSPSQAAALGLGEISISKAELFRMAFAARDERLAPKRAKNWRQKE 236
Query: 324 RRAEREWMETSTSAKAIALA 343
RR + W+ +S +A LA
Sbjct: 237 RRERKLWVSSSLELRARHLA 256
>gi|302801682|ref|XP_002982597.1| hypothetical protein SELMODRAFT_116472 [Selaginella moellendorffii]
gi|300149696|gb|EFJ16350.1| hypothetical protein SELMODRAFT_116472 [Selaginella moellendorffii]
Length = 261
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 120/260 (46%), Positives = 162/260 (62%), Gaps = 11/260 (4%)
Query: 91 IAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI---- 146
+ + +IARRR LL D +++ LAD++WEILD+SGSDV+D G+ + A++C L+AVD+
Sbjct: 1 LQLVAIARRRGLLCDRLLVPLADTAWEILDVSGSDVTDTGIHQAAKICSKLRAVDVSRCS 60
Query: 147 RITAAGVSEFLLHCHSLETLRCGGSPRSNHAARRCLGILKPKL---NDVEGDSWEELVNT 203
+TA + + C +LETLRCGG+ SN AAR L + P L N VE +SWE+L
Sbjct: 61 SLTAESIRTIVECCPALETLRCGGTHLSNRAARLALPCVLPGLRVRNQVE-ESWEDLDAV 119
Query: 204 DIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDD 263
+G GAQ LRW VWP ID +S + + ECPR++VNP + + VPREA P LD+
Sbjct: 120 AVGSGAQELRWIVWPAIDGESRQTVELECPRVVVNPGMA--SSKHACVPREALPSTVLDE 177
Query: 264 PFVNDIDPSAWAVPRFASMGISTSLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQ 323
V DI PSAW V R + + L +S AE FR+AF RD RLAPKRAKN RQ +
Sbjct: 178 QLVEDISPSAWEV-RCSPSQAAALGLGEISISKAELFRMAFAARDERLAPKRAKNWRQKE 236
Query: 324 RRAEREWMETSTSAKAIALA 343
RR + W+ +S +A LA
Sbjct: 237 RRERKLWVSSSLELRARHLA 256
>gi|308080076|ref|NP_001183107.1| uncharacterized protein LOC100501472 [Zea mays]
gi|238009360|gb|ACR35715.1| unknown [Zea mays]
Length = 196
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/184 (52%), Positives = 126/184 (68%), Gaps = 5/184 (2%)
Query: 170 GSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMS 229
G PRS ARRCL +LKPKLN++E DSWEEL DIG GA+SL+W VWP ID +S E ++
Sbjct: 12 GCPRSEVTARRCLDLLKPKLNNLEEDSWEELNTLDIGGGAESLKWLVWPKIDDNSKETLA 71
Query: 230 TECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSAWAV---PRFASMGIST 286
ECPR+ VNP+PS F G +VP EA + LD V DIDP WAV PR +
Sbjct: 72 AECPRVTVNPQPSLFDLSGSKVPVEALASVPLDHSIVEDIDPKTWAVSAAPRRPVAPPNP 131
Query: 287 SLLSPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKA 346
+ E+ +AE+FRLA+VER+ RLAPKRAK RQ +RRAER++M A+++ALA++A
Sbjct: 132 NAPP--EIPIAERFRLAYVEREARLAPKRAKRERQQRRRAERDYMMNDIDARSMALAAQA 189
Query: 347 TKSL 350
+++L
Sbjct: 190 SRNL 193
>gi|255076763|ref|XP_002502051.1| predicted protein [Micromonas sp. RCC299]
gi|226517316|gb|ACO63309.1| predicted protein [Micromonas sp. RCC299]
Length = 450
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 99/202 (49%), Gaps = 10/202 (4%)
Query: 56 PPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSS 115
P L + ++ +H+++++ E + P D++ A+ ++ARRR+ L D + +L D S
Sbjct: 13 PKPLKWIVADLLARHVDELVDAGGESLRHLPCDVRDALLAVARRRRCLDDAALRALVDES 72
Query: 116 WEILDISGSD---VSDFGL--VKVAQMCKSLKAVDIR----ITAAGVSEFLLHCHSLETL 166
I+D SG V+D G+ + + +++ AVD+ +TAAG+ L TL
Sbjct: 73 TTIIDASGCGGGRVTDAGIAALAARRALRNVTAVDLSRCDGVTAAGLRILARAAPRLRTL 132
Query: 167 RCGGSPRSNHAARRCLGILKPKLNDVEG-DSWEELVNTDIGHGAQSLRWFVWPNIDKDSI 225
RCGG N A + + P+L D G D WE D A +L W VWP++ D+
Sbjct: 133 RCGGDATCNAACEAAIPGIVPRLEDGGGVDDWETRAEVDSTTRATALEWLVWPDVRPDTR 192
Query: 226 EMMSTECPRIIVNPKPSPFGFR 247
+ +CPRI + P R
Sbjct: 193 ATIVRQCPRIRIVAPPREVHLR 214
>gi|219887253|gb|ACL54001.1| unknown [Zea mays]
Length = 140
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/136 (45%), Positives = 88/136 (64%), Gaps = 5/136 (3%)
Query: 220 IDKDSIEMMSTECPRIIVNPKPSPFGFRGF-EVPREAFPDITLDDPFVNDIDPSAWAVPR 278
ID S ++ ECPR+ VNP+PS F G+ +VP EA + LD V DIDP WAV
Sbjct: 2 IDDKSKVTLAVECPRVTVNPQPSLFDLSGYSKVPIEALSSVPLDHSVVEDIDPKTWAVSA 61
Query: 279 FASMGISTSLL----SPNELSMAEKFRLAFVERDTRLAPKRAKNARQHQRRAEREWMETS 334
++ + +P E+ MAE+FRLA+VER+ RLAPKRAK RQ +RRAER++M
Sbjct: 62 APPRAVAPANPNVPSAPPEIPMAERFRLAYVEREARLAPKRAKRERQRRRRAERDYMMVD 121
Query: 335 TSAKAIALASKATKSL 350
A+++ALA++A+++L
Sbjct: 122 IDARSVALAAQASRNL 137
>gi|384254181|gb|EIE27655.1| hypothetical protein COCSUDRAFT_39259 [Coccomyxa subellipsoidea
C-169]
Length = 322
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 133/293 (45%), Gaps = 26/293 (8%)
Query: 84 NFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDIS-GSDVSDFGLVKVAQMCKSLK 142
P K + ++ARR + D+ + LAD LD++ ++ G+ L+
Sbjct: 27 QLPWTWKAPLLAVARRTGAMTDEALALLADEDLVSLDLARARALTHSGIAAAVSGLPLLR 86
Query: 143 AVDIRITA---AGVSEFLLHCHSLETLRCGG-SPRSNHAARRCLGILKPKLNDVE-GDSW 197
A+D+ TA A + C LE LR GG P+ AA L P+L + DSW
Sbjct: 87 ALDLSYTAFQPAALPTLAEACPLLEVLRLGGLGPKPARAAAAALLRCLPRLQQADVADSW 146
Query: 198 EELVNT---DIGHGAQSLRWFVWPNIDKDSIEMMSTECPRIIVNPKP-SPF--GFRGFEV 251
E+L T GHG SL++ VWP++ +++ + P++ V P P R +
Sbjct: 147 EDLAETSAVQAGHGFTSLQYVVWPDVTPEALAHLHAHQPQLKVISAPDHPLLRASRRHRL 206
Query: 252 PREAFPDITLDDPFVNDIDPSAWAVPRF-----------ASMGISTSLLSPNELSMAEKF 300
P A P + LD P+ + W S G +++ EL++AE+F
Sbjct: 207 PAAADPGVPLDQPYAEAVAQYDWEAAAAAVRARCGDEAAGSEGAKSTVAGLRELTLAERF 266
Query: 301 RLAFVERDTRLAPKRAKNARQHQRRAEREWMETSTSAKAIALASKATKSLSSR 353
RLA+ R R+A RA+NA++ QRRA R + +A+ALA + +R
Sbjct: 267 RLAYESRAARVAVVRARNAKKEQRRALR---AMGGAERALALAEHGVPLVHTR 316
>gi|297606365|ref|NP_001058370.2| Os06g0679700 [Oryza sativa Japonica Group]
gi|255677325|dbj|BAF20284.2| Os06g0679700, partial [Oryza sativa Japonica Group]
Length = 63
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/59 (69%), Positives = 50/59 (84%)
Query: 89 IKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDIR 147
I++AI SIARRR+LL D+V++SLADSSWEILDISGSDVSD GL VA + +L A+DIR
Sbjct: 5 IQLAILSIARRRRLLNDEVLISLADSSWEILDISGSDVSDIGLATVANISNNLWAIDIR 63
>gi|302833776|ref|XP_002948451.1| hypothetical protein VOLCADRAFT_120630 [Volvox carteri f.
nagariensis]
gi|300266138|gb|EFJ50326.1| hypothetical protein VOLCADRAFT_120630 [Volvox carteri f.
nagariensis]
Length = 765
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 98/207 (47%), Gaps = 21/207 (10%)
Query: 52 EKSKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSL 111
+ + P+L +L LG +G+H++ ++ L P D+K + +ARRR +L++ V++SL
Sbjct: 184 QTGRIPTLRALVLGFLGRHVDSLVAQLGPHLAALPADVKACLLCVARRRGVLSNRVLLSL 243
Query: 112 ADSSWEILDISGSD-VSDFGLVKVAQMCKSLKAVDIRITAAGVSEFLLH----CHSLETL 166
AD W +LD+ G+ V++ L + C+ L+A+ + + L + C SLE L
Sbjct: 244 ADEFWTLLDLGGAHRVTERALRQALLRCRHLRALSLSGLSCPSPALLRNLPDLCPSLEVL 303
Query: 167 RCGGSPRSNHAARRCLGILKPKL------NDVEGDSWEELVNTDIGH----------GAQ 210
G A + L L P + D DSWE++ + G G Q
Sbjct: 304 VVGDCKPQEEALLQVLPDLLPHVRVREQAQDAVHDSWEDVAVSSGGGAPLPGPGVPCGLQ 363
Query: 211 SLRWFVWPNIDKDSIEMMSTECPRIIV 237
L+ VWPN ++ + PR++V
Sbjct: 364 RLKLLVWPNPPSLVRHVVRSVNPRVVV 390
>gi|307105128|gb|EFN53379.1| hypothetical protein CHLNCDRAFT_53938 [Chlorella variabilis]
Length = 338
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 121/286 (42%), Gaps = 38/286 (13%)
Query: 66 IVGKHLEDIIGD---LDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDIS 122
++G+HLE+++ D + + P + K + ++AR R+LL D ++ LAD ILD+
Sbjct: 47 VLGEHLEELLSDPGCAEYVLPLLPAEAKACLLAVARLRRLLCDAALLLLADQGQHILDLR 106
Query: 123 GSD--VSDFGLVKVAQMCKSLKAVDIRITAAGVSEFLL---HCHSLETLRCGGSPRSNHA 177
G +SD G+ + L+ D+ G C L LR GSP ++ +
Sbjct: 107 GCGDLLSDSGIRAAIRRMPDLRLADLTSCPVGADTLRALGEACPGLHVLRL-GSPLTDAS 165
Query: 178 ARRCLGILKPKLNDVE----GDSWEELVNTDIGHGAQ-----------SLRWFVWPNIDK 222
A R L + P L DSW+ L+ + A L WPNI
Sbjct: 166 AGRGLKDVLPVLEQRHAAPAADSWDALLEVEGDEQALLAAVAGGARLMQLHCLAWPNIPH 225
Query: 223 DSIEMMSTECPRIIVNPKPSPFGFRGFEVPREAFPDITLDDPFVNDIDPSA-WAVPRFAS 281
E CP + +N PS + P + LD + ++ S W
Sbjct: 226 RLAEHCRAACPMVALN--PSEEQVVALRLMPACDPAVQLDAALLAEVAGSERWQ------ 277
Query: 282 MGISTSLLSPNE--LSMAEKFRLAFVERDTRLAPKRAKNARQHQRR 325
+ + P E + +AEKFRLA++ R R++ K +N +Q +RR
Sbjct: 278 ---AGAKEQPREPIVHIAEKFRLAYLSRAQRVSAKMERNWQQARRR 320
>gi|303276378|ref|XP_003057483.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461835|gb|EEH59128.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 496
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 96/222 (43%), Gaps = 48/222 (21%)
Query: 66 IVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADS--SWEILDISG 123
++ H++++I E + P D++ A+ ++ARRR L D +++L D LD++G
Sbjct: 35 LLAAHVDELIELAGESLLQLPSDVRCALRAVARRRGCLDDRALLALVDGHDGAPFLDVAG 94
Query: 124 S-DVSDFGLVKVAQMCKSLKAVDIR------ITAAGVSEFLLHCHSLETLRCGGSPRSNH 176
V+D G+ +A+ V I+A G++ + TLRCGG +
Sbjct: 95 CVGVTDDGIDALARRGALAAVVAADVTGCGGISARGIATLCAAAPMMHTLRCGGDDACDD 154
Query: 177 AARRCL-------GILK-------------PKLNDVEG----DSWEE------------- 199
A RR + G L P+ +D +SWE+
Sbjct: 155 AMRRAIDPGGGGRGKLTRGGRRERGVGGILPRFDDARHHHHVESWEDRAAAASPPPAASS 214
Query: 200 -LVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRI-IVNP 239
+T A SL+W VWP++D E++ ECPR+ IV P
Sbjct: 215 SATDTLTTARASSLQWLVWPDVDVAHRELIRAECPRMRIVAP 256
>gi|159464088|ref|XP_001690274.1| predicted protein [Chlamydomonas reinhardtii]
gi|158284262|gb|EDP10012.1| predicted protein [Chlamydomonas reinhardtii]
Length = 281
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 45/70 (64%), Gaps = 2/70 (2%)
Query: 57 PSLVSLCLGIVGKHLEDIIG--DLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADS 114
P+L +LCLG++G H+E + G D+ P D+K + +ARRR +L++ +++ L D
Sbjct: 77 PTLRALCLGLLGSHVEHLAGTEDMGPHLALLPPDVKACMLCVARRRGVLSNRILLGLVDP 136
Query: 115 SWEILDISGS 124
W +LD++G+
Sbjct: 137 GWAVLDLAGA 146
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 50/95 (52%), Gaps = 7/95 (7%)
Query: 261 LDDPFVNDIDPSAWAVPRFASMGISTSLLSPNE-LSMAEKFRLAFVERDTRLAPKRAKNA 319
LDD + PSAWA G + + E L +AEKFRLA+V++D RL RAK A
Sbjct: 188 LDDCLAALVAPSAWAGGGCGGPGRGGAREAEWERLHIAEKFRLAYVQQDERL---RAK-A 243
Query: 320 RQHQRRAEREWMETSTS--AKAIALASKATKSLSS 352
R+ AER+ + S A + +K +SL+S
Sbjct: 244 RREAALAERQELRASGRRLTPADYIVTKTQRSLAS 278
>gi|156391949|ref|XP_001635812.1| predicted protein [Nematostella vectensis]
gi|156222909|gb|EDO43749.1| predicted protein [Nematostella vectensis]
Length = 263
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 16/126 (12%)
Query: 54 SKPPSLVSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLAD 113
S SL + C+ + H+ G L ++ P ++K + + +R LL+DDV+ +
Sbjct: 9 SSLDSLQTSCIRAIADHVSQHKGGLKDL----PREVKDNLVLLFSKRGLLSDDVLPKIVS 64
Query: 114 SSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI--------RITAAGVSEFLLHCHSLET 165
LD+S SD SD GL+ + Q+CK L+ +D+ IT+ G+ CH L+T
Sbjct: 65 PLIRDLDLSESDTSDEGLMAL-QVCKKLRKLDLNAVKDRRENITSNGIITISQSCHDLQT 123
Query: 166 L---RC 168
+ RC
Sbjct: 124 VYLRRC 129
>gi|156395364|ref|XP_001637081.1| predicted protein [Nematostella vectensis]
gi|156224190|gb|EDO45018.1| predicted protein [Nematostella vectensis]
Length = 1333
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 68/125 (54%), Gaps = 9/125 (7%)
Query: 92 AIASIARRRKLLADDVIMSLADSSWE---ILDIS-GSDVSDFGLVKVAQMCKSLKAVDIR 147
A+ +++ + D ++++A+S+ E LD+S DV+D G+ +VA C +L+ + +R
Sbjct: 1087 ALQTVSLAEAAITDATLVTIAESAAERLQDLDLSWCEDVTDVGISRVATSCVNLRTLSLR 1146
Query: 148 -ITAAGVSEFLL--HCHSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTD 204
A+GVS +L +CH++ +L+ G + CL P+L D+ SW + TD
Sbjct: 1147 QCDASGVSMDMLTANCHAMTSLKLSGVTNLTDSMVSCLASYMPQL-DIIDLSWNSSL-TD 1204
Query: 205 IGHGA 209
+G A
Sbjct: 1205 VGISA 1209
>gi|198438507|ref|XP_002126400.1| PREDICTED: similar to Protein AMN1 homolog [Ciona intestinalis]
Length = 259
Score = 44.3 bits (103), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 54/103 (52%), Gaps = 15/103 (14%)
Query: 77 DLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQ 136
DLDEI PV+IK + SI ++ LL+DD I S ++ L+++ +SD GL+++++
Sbjct: 28 DLDEI----PVNIKRKLTSILAQQGLLSDDNIRSCLYANMGELNLNRCHISDKGLLEISK 83
Query: 137 MCKSLKAVDIR--------ITAAGVSEFLLHCHSLETL---RC 168
C +L +D+ IT+ GV C L + RC
Sbjct: 84 SCPNLLKIDLNALKGSQTEITSVGVQNLSKGCRKLRVVYLRRC 126
>gi|242014676|ref|XP_002428011.1| fbxl7, putative [Pediculus humanus corporis]
gi|212512530|gb|EEB15273.1| fbxl7, putative [Pediculus humanus corporis]
Length = 690
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 41/76 (53%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C++LK + +R +T
Sbjct: 583 ARGCEAVSDDAITVLARSCPRLRALDIGKCDVSDAGLRALAECCQNLKKLSLRNCDLVTD 642
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 643 RGVQCIAYYCRGLQQL 658
>gi|260801939|ref|XP_002595852.1| hypothetical protein BRAFLDRAFT_128232 [Branchiostoma floridae]
gi|229281101|gb|EEN51864.1| hypothetical protein BRAFLDRAFT_128232 [Branchiostoma floridae]
Length = 276
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 41/174 (23%), Positives = 78/174 (44%), Gaps = 25/174 (14%)
Query: 85 FPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAV 144
P +IK + + +R ++ DD I + LD+S S+VSD L +++ +C++L+ +
Sbjct: 36 LPPNIKDKLVHLMSKRGVIRDDNIGKVVHQKMRELDLSESEVSDLSLEQLS-VCRNLRKI 94
Query: 145 DIR--------ITAAGVSEFLLHCHSLETL---RC-----GG--SPRSNHAARRCLGI-- 184
D+ +T AG+ + CH L+ + RC G + N CL I
Sbjct: 95 DLNAVKGNTTTVTTAGILSLVRSCHLLQVVHLRRCIEVTDGAIVALSENCHQLMCLNIGG 154
Query: 185 ---LKPKLNDVEGDSWEELVNTDIGHGAQSLRWFVWPNIDKDSIEMMSTECPRI 235
+ + + G + L + + GH + + + D++E++ CPRI
Sbjct: 155 CAKITDRSLEALGQNSRMLRSINFGH-TKEVHMDGCQLLTDDAVEVIMESCPRI 207
>gi|195054589|ref|XP_001994207.1| GH23468 [Drosophila grimshawi]
gi|193896077|gb|EDV94943.1| GH23468 [Drosophila grimshawi]
Length = 766
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 659 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 718
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 719 RGVQCIAYYCRGLQQL 734
>gi|195328589|ref|XP_002030997.1| GM24270 [Drosophila sechellia]
gi|194119940|gb|EDW41983.1| GM24270 [Drosophila sechellia]
Length = 772
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 665 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRSCDMITD 724
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 725 RGVQCIAYYCRGLQQL 740
>gi|21357913|ref|NP_650512.1| CG4221 [Drosophila melanogaster]
gi|16198189|gb|AAL13904.1| LD38495p [Drosophila melanogaster]
gi|23171416|gb|AAF55252.2| CG4221 [Drosophila melanogaster]
gi|220947248|gb|ACL86167.1| CG4221-PA [synthetic construct]
gi|220956790|gb|ACL90938.1| CG4221-PA [synthetic construct]
Length = 772
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 665 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRSCDMITD 724
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 725 RGVQCIAYYCRGLQQL 740
>gi|195570510|ref|XP_002103250.1| GD19058 [Drosophila simulans]
gi|194199177|gb|EDX12753.1| GD19058 [Drosophila simulans]
Length = 772
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 665 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRSCDMITD 724
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 725 RGVQCIAYYCRGLQQL 740
>gi|195080846|ref|XP_001997322.1| GH23299 [Drosophila grimshawi]
gi|193905658|gb|EDW04525.1| GH23299 [Drosophila grimshawi]
Length = 746
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 639 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 698
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 699 RGVQCIAYYCRGLQQL 714
>gi|194745027|ref|XP_001954994.1| GF18550 [Drosophila ananassae]
gi|190628031|gb|EDV43555.1| GF18550 [Drosophila ananassae]
Length = 771
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 664 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRSCDMITD 723
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 724 RGVQCIAYYCRGLQQL 739
>gi|195389118|ref|XP_002053225.1| GJ23768 [Drosophila virilis]
gi|194151311|gb|EDW66745.1| GJ23768 [Drosophila virilis]
Length = 780
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 673 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 732
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 733 RGVQCIAYYCRGLQQL 748
>gi|195501137|ref|XP_002097674.1| GE24351 [Drosophila yakuba]
gi|194183775|gb|EDW97386.1| GE24351 [Drosophila yakuba]
Length = 780
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 673 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRSCDMITD 732
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 733 RGVQCIAYYCRGLQQL 748
>gi|194901162|ref|XP_001980121.1| GG16963 [Drosophila erecta]
gi|190651824|gb|EDV49079.1| GG16963 [Drosophila erecta]
Length = 778
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 671 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRSCDMITD 730
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 731 RGVQCIAYYCRGLQQL 746
>gi|195152237|ref|XP_002017043.1| GL22080 [Drosophila persimilis]
gi|194112100|gb|EDW34143.1| GL22080 [Drosophila persimilis]
Length = 789
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 682 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 741
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 742 RGVQCIAYYCRGLQQL 757
>gi|198453621|ref|XP_001359268.2| GA18044 [Drosophila pseudoobscura pseudoobscura]
gi|198132440|gb|EAL28413.2| GA18044 [Drosophila pseudoobscura pseudoobscura]
Length = 787
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 680 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 739
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 740 RGVQCIAYYCRGLQQL 755
>gi|195110169|ref|XP_001999654.1| GI22952 [Drosophila mojavensis]
gi|193916248|gb|EDW15115.1| GI22952 [Drosophila mojavensis]
Length = 782
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 675 ARGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 734
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 735 RGVQCIAYYCRGLQQL 750
>gi|170032738|ref|XP_001844237.1| f-box/leucine rich repeat protein [Culex quinquefasciatus]
gi|167873067|gb|EDS36450.1| f-box/leucine rich repeat protein [Culex quinquefasciatus]
Length = 750
Score = 41.6 bits (96), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 643 ARGCEAVSDDSINVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 702
Query: 151 AGVSEFLLHCHSLETL 166
G+ +C L+ L
Sbjct: 703 RGIQTIAYYCRGLQQL 718
>gi|440804643|gb|ELR25520.1| leucine rich repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 444
Score = 41.2 bits (95), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 38/62 (61%), Gaps = 4/62 (6%)
Query: 109 MSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLE 164
++L S E L+I+G+ VSD GL ++ C++LK +D+ R+T G+ L +C SL+
Sbjct: 324 IALGCSRIESLNINGTQVSDEGLKQLVTSCRNLKQLDVSFCKRLTVDGIRLLLTNCPSLQ 383
Query: 165 TL 166
L
Sbjct: 384 KL 385
>gi|195444896|ref|XP_002070079.1| GK11217 [Drosophila willistoni]
gi|194166164|gb|EDW81065.1| GK11217 [Drosophila willistoni]
Length = 764
Score = 41.2 bits (95), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
+R + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 657 SRGCEAVSDDSITVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 716
Query: 151 AGVSEFLLHCHSLETL 166
GV +C L+ L
Sbjct: 717 RGVQCIAYYCRGLQQL 732
>gi|347969608|ref|XP_307793.5| AGAP003285-PA [Anopheles gambiae str. PEST]
gi|333466227|gb|EAA03580.5| AGAP003285-PA [Anopheles gambiae str. PEST]
Length = 841
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 734 ARGCEAVSDDSINVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 793
Query: 151 AGVSEFLLHCHSLETL 166
G+ +C L+ L
Sbjct: 794 RGIQCIAYYCRGLQQL 809
>gi|226479182|emb|CAX73086.1| F-box and leucine-rich repeat protein 20 [Schistosoma japonicum]
Length = 517
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 14/103 (13%)
Query: 76 GDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLAD--SSWEILDISGSDVSDFGLVK 133
G D + N + +++ S +R DD + + S+ LD+S V D GL
Sbjct: 139 GTCDYLGKNCSLLTTLSLESCSR-----VDDTGLEMLSWCSNLTCLDVSWCSVGDRGLTA 193
Query: 134 VAQMCKSL---KAVDIR-ITAAGVSEFLLHCHSLETLR---CG 169
+A+ CK+L +AV + IT+ GV + HCHSL L CG
Sbjct: 194 IAKGCKNLQRFRAVGCQEITSRGVEQLARHCHSLLLLNLNYCG 236
>gi|312372308|gb|EFR20294.1| hypothetical protein AND_20333 [Anopheles darlingi]
Length = 850
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 743 ARGCEAVSDDSINVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 802
Query: 151 AGVSEFLLHCHSLETL 166
G+ +C L+ L
Sbjct: 803 RGIQCIAYYCRGLQQL 818
>gi|443699865|gb|ELT99119.1| hypothetical protein CAPTEDRAFT_20554 [Capitella teleta]
Length = 217
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 11/81 (13%)
Query: 99 RRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQMCKSLKAVDIR--------ITA 150
+R L+ D I + + +L+++ DVSD L K+A MC LK +D+ +T
Sbjct: 3 KRGLVTDANIAEVLHKNITVLNLNECDVSDIALYKIAAMCPHLKKLDVNACKANRTDVTD 62
Query: 151 AGVSEFLLHCHSLETL---RC 168
AGV E C L+ + RC
Sbjct: 63 AGVCEIAAKCRFLQVVFLRRC 83
>gi|225443116|ref|XP_002273285.1| PREDICTED: EIN3-binding F-box protein 1-like [Vitis vinifera]
Length = 575
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 50/120 (41%), Gaps = 16/120 (13%)
Query: 60 VSLCLGIVGKHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEIL 119
VS C GI G + +I+ D++ + V+ + S KL S +L
Sbjct: 425 VSFCAGITGGGIAEILKSCDDVR-HLEVNFCAGVKSFGADSKL-----------SKLGVL 472
Query: 120 DISGSDVSDFGLVKVAQMCKSLKAVDIR----ITAAGVSEFLLHCHSLETLRCGGSPRSN 175
+GS + D GLV V Q C L +D+R ++ GV E + C L + G N
Sbjct: 473 KAAGSGICDEGLVMVGQTCPWLLHLDLRGCSGVSTKGVKEIVRSCKGLREINIKGCLDVN 532
>gi|312794676|ref|YP_004027599.1| crispr-associated protein, csx11 family [Caldicellulosiruptor
kristjanssonii 177R1B]
gi|312181816|gb|ADQ41986.1| CRISPR-associated protein, Csx11 family [Caldicellulosiruptor
kristjanssonii 177R1B]
Length = 1223
Score = 40.4 bits (93), Expect = 1.4, Method: Composition-based stats.
Identities = 22/71 (30%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 194 GDSWEELVNTDIGHGAQ-SLRWFVWPNIDKDSIEMMSTECPRIIVNPKPSPFGFRG-FEV 251
GD WEE + +G+ + W N+ D I+ +ST + ++ PSP R +E
Sbjct: 681 GDRWEEFIKQKLGNKIDFKYKKIYWQNLTDDDIDFLSTLILQFLLRKNPSPARLRRIWES 740
Query: 252 PREAFPDITLD 262
RE F DI ++
Sbjct: 741 TREFFEDIKIN 751
>gi|157167214|ref|XP_001652226.1| f-box/leucine rich repeat protein [Aedes aegypti]
gi|108877352|gb|EAT41577.1| AAEL006797-PA [Aedes aegypti]
Length = 522
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD I LA S + LDI DVSD GL +A+ C +LK + +R IT
Sbjct: 415 ARGCEAVSDDSINVLARSCPRLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDMITD 474
Query: 151 AGVSEFLLHCHSLETL 166
G+ +C L+ L
Sbjct: 475 RGIQCIAYYCRGLQQL 490
>gi|91094271|ref|XP_970021.1| PREDICTED: similar to f-box/leucine rich repeat protein [Tribolium
castaneum]
gi|270016255|gb|EFA12701.1| hypothetical protein TcasGA2_TC002335 [Tribolium castaneum]
Length = 478
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 6/76 (7%)
Query: 97 ARRRKLLADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITA 150
AR + ++DD ++ LA S + LDI DVSD GL +A+ C +LK + +R +T
Sbjct: 371 ARGCEAVSDDAVIFLARSCTRLCALDIGKCDVSDAGLRALAESCPNLKKLSLRSCDLVTD 430
Query: 151 AGVSEFLLHCHSLETL 166
GV C L+ L
Sbjct: 431 RGVQCVAYFCRGLQQL 446
>gi|358335697|dbj|GAA54333.1| F-box and leucine-rich repeat protein 2/20 [Clonorchis sinensis]
Length = 699
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query: 105 DDVIMSLAD--SSWEILDISGSDVSDFGLVKVAQMCKSL---KAVDIR-ITAAGVSEFLL 158
DD M L S+ +LD+S V D GL +A+ CK L +AV R IT+ GV +
Sbjct: 354 DDTGMELLSWCSNLTVLDVSWCTVGDRGLTAIARGCKGLQRFRAVGCREITSRGVQQLAE 413
Query: 159 HCHSLETLR---CGGS 171
CH L L CG S
Sbjct: 414 RCHGLILLNLNYCGQS 429
>gi|390349919|ref|XP_003727308.1| PREDICTED: protein AMN1 homolog [Strongylocentrotus purpuratus]
Length = 264
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 69 KHLEDIIGDLDEIAINFPVDIKIAIASIARRRKLLADDVIMSLADSSWEILDISGSDVSD 128
K L + LD N P+++K + + +R LL D + + ILD+S S VSD
Sbjct: 25 KQLPCFLSQLD----NLPINVKDNLLHLMSKRGLLTDTNVAKVIHKRTRILDLSESSVSD 80
Query: 129 FGLVKVAQMCKSLKAVDI 146
L+++ +C++L+ +D+
Sbjct: 81 RALLRLG-VCRNLRKIDL 97
>gi|328715076|ref|XP_001950086.2| PREDICTED: f-box/LRR-repeat protein 7-like [Acyrthosiphon pisum]
Length = 474
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 6/70 (8%)
Query: 103 LADDVIMSLADSSWEI--LDISGSDVSDFGLVKVAQMCKSLKAVDIR----ITAAGVSEF 156
++DD I LA S + LDI DVSD GL +A+ C +LK + +R +T G+
Sbjct: 373 VSDDAITVLARSCARLRALDIGKCDVSDAGLRALAESCPNLKKLSLRNCDLVTDRGIQLI 432
Query: 157 LLHCHSLETL 166
+C L+ L
Sbjct: 433 AYYCRGLQQL 442
>gi|343172599|gb|AEL99003.1| leucine-rich repeats-ribonuclease inhibitor domain-containing
protein, partial [Silene latifolia]
Length = 592
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 71/150 (47%), Gaps = 32/150 (21%)
Query: 96 IARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQ---MCKSLKAVDIR---IT 149
I R KL DD L S+ I+D+SG + D GL+ +A+ K+++ VD IT
Sbjct: 146 IERDSKLKLDDFAREL--RSFRIVDMSGRNFGDDGLIFLAESLAYNKAVEEVDFSANGIT 203
Query: 150 AAGVSEF---LLHCHSLETLRCGGSPRSNHAARRCL--------GILKPKLNDVEGDSWE 198
AAG+ F L L+TL G+P + A +CL GI K +LN
Sbjct: 204 AAGLKAFDGILQSNFFLKTLNLSGNPIGDEGA-KCLCEILVSNTGIQKLQLN-------- 254
Query: 199 ELVNTDIG-HGAQSLRWFVWPNIDKDSIEM 227
+T +G GA+++ + N + +IE+
Sbjct: 255 ---STGLGDEGAKAIAQLLRKNSNLTAIEL 281
>gi|256088972|ref|XP_002580594.1| fbxl20 [Schistosoma mansoni]
Length = 518
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 7/63 (11%)
Query: 114 SSWEILDISGSDVSDFGLVKVAQMCKSL---KAVDIR-ITAAGVSEFLLHCHSLETLR-- 167
S+ LD+S V D GL +A+ CKSL +A+ + IT+ GV + HCH L L
Sbjct: 174 SNLTCLDVSWCSVGDRGLTAIARGCKSLQRFRAIGCQEITSRGVEQLARHCHGLLLLNLN 233
Query: 168 -CG 169
CG
Sbjct: 234 YCG 236
>gi|353232353|emb|CCD79708.1| putative fbxl20 [Schistosoma mansoni]
Length = 522
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 7/63 (11%)
Query: 114 SSWEILDISGSDVSDFGLVKVAQMCKSL---KAVDIR-ITAAGVSEFLLHCHSLETLR-- 167
S+ LD+S V D GL +A+ CKSL +A+ + IT+ GV + HCH L L
Sbjct: 178 SNLTCLDVSWCSVGDRGLTAIARGCKSLQRFRAIGCQEITSRGVEQLARHCHGLLLLNLN 237
Query: 168 -CG 169
CG
Sbjct: 238 YCG 240
>gi|256088974|ref|XP_002580595.1| fbxl20 [Schistosoma mansoni]
Length = 525
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 114 SSWEILDISGSDVSDFGLVKVAQMCKSL---KAVDIR-ITAAGVSEFLLHCHSL 163
S+ LD+S V D GL +A+ CKSL +A+ + IT+ GV + HCH L
Sbjct: 174 SNLTCLDVSWCSVGDRGLTAIARGCKSLQRFRAIGCQEITSRGVEQLARHCHGL 227
>gi|353232352|emb|CCD79707.1| putative fbxl20 [Schistosoma mansoni]
Length = 529
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 34/63 (53%), Gaps = 7/63 (11%)
Query: 114 SSWEILDISGSDVSDFGLVKVAQMCKSL---KAVDIR-ITAAGVSEFLLHCHSLETLR-- 167
S+ LD+S V D GL +A+ CKSL +A+ + IT+ GV + HCH L L
Sbjct: 178 SNLTCLDVSWCSVGDRGLTAIARGCKSLQRFRAIGCQEITSRGVEQLARHCHGLLLLNLN 237
Query: 168 -CG 169
CG
Sbjct: 238 YCG 240
>gi|170051095|ref|XP_001861609.1| f-box/lrr protein [Culex quinquefasciatus]
gi|167872486|gb|EDS35869.1| f-box/lrr protein [Culex quinquefasciatus]
Length = 608
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 7/80 (8%)
Query: 100 RKLLADDVIMSLADS--SWEILDISGS-DVSDFGLVKVAQMCKSLKAVDIR----ITAAG 152
R L+ D + S S S EI+D S S +++D + VA+ C+ L + + +T A
Sbjct: 504 RLLITDLAVQSFVQSCPSLEIIDFSESPNITDLCVELVARHCRRLTTLKLHNCPLVTDAS 563
Query: 153 VSEFLLHCHSLETLRCGGSP 172
++ + HCH L+ L G P
Sbjct: 564 LAALIKHCHELKHLNIRGCP 583
>gi|321472393|gb|EFX83363.1| hypothetical protein DAPPUDRAFT_315868 [Daphnia pulex]
Length = 276
Score = 37.7 bits (86), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 4/52 (7%)
Query: 127 SDFGLVKVAQMCKSLKAVDI----RITAAGVSEFLLHCHSLETLRCGGSPRS 174
D LVK++Q C L ++I RIT +G+ +F +CH++ TL+ G S
Sbjct: 187 QDETLVKISQFCPQLMYLNISRCCRITDSGIKKFAEYCHTVRTLKVDGCEVS 238
>gi|343172597|gb|AEL99002.1| leucine-rich repeats-ribonuclease inhibitor domain-containing
protein, partial [Silene latifolia]
Length = 592
Score = 37.7 bits (86), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 68/142 (47%), Gaps = 16/142 (11%)
Query: 96 IARRRKLLADDVIMSLADSSWEILDISGSDVSDFGLVKVAQ---MCKSLKAVDIR---IT 149
I R KL DD L S+ I+D+SG + D GL+ +A+ K+++ VD IT
Sbjct: 146 IERDSKLKLDDFAREL--RSFRIVDMSGRNFGDDGLIFLAESLAYNKAVEEVDFSANGIT 203
Query: 150 AAGVSEF---LLHCHSLETLRCGGSPRSNHAARRCLGILKPKLNDVEGDSWEELVNTDIG 206
AAG+ F L L+TL G+P + A+ IL G +L +T +G
Sbjct: 204 AAGLKAFDGILQSNFFLKTLNLSGNPIGDEGAKCLCEILVSN----TGLQKLQLNSTGLG 259
Query: 207 -HGAQSLRWFVWPNIDKDSIEM 227
GA+++ + N + +IE+
Sbjct: 260 DEGAKAIAELLRKNSNLTAIEL 281
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.132 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,237,301,512
Number of Sequences: 23463169
Number of extensions: 214931204
Number of successful extensions: 514834
Number of sequences better than 100.0: 82
Number of HSP's better than 100.0 without gapping: 31
Number of HSP's successfully gapped in prelim test: 51
Number of HSP's that attempted gapping in prelim test: 514652
Number of HSP's gapped (non-prelim): 129
length of query: 354
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 211
effective length of database: 9,003,962,200
effective search space: 1899836024200
effective search space used: 1899836024200
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)