BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018197
(359 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297738286|emb|CBI27487.3| unnamed protein product [Vitis vinifera]
Length = 456
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 288/353 (81%), Positives = 316/353 (89%), Gaps = 4/353 (1%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
D+SPIE+ARARF+QIIVDHFI+DHVIEV DS A+Y+ Q GQDKL+KRK+R+VQYEGDPRF
Sbjct: 107 DISPIEAARARFMQIIVDHFIDDHVIEVADSEADYNGQSGQDKLNKRKSREVQYEGDPRF 166
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
LPLMYVANMYETLVN+VN+RLAS NGIREKTIGVALEAAGGLYR LAKKFPKKG CT+K
Sbjct: 167 VLPLMYVANMYETLVNEVNIRLASLNGIREKTIGVALEAAGGLYRRLAKKFPKKGPCTFK 226
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATS+ETRTRFPELVIQEEKRVRFVVVNGL IV+KPN++PI+DAEWFKRLTGR+EVA
Sbjct: 227 RRELATSIETRTRFPELVIQEEKRVRFVVVNGLVIVDKPNSVPIDDAEWFKRLTGRDEVA 286
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQTP 245
+SA+DYKFYSPRHKYRRVASN VSNIP LP FPGTD S TM +AQGF T++EPQNQQ TP
Sbjct: 287 VSARDYKFYSPRHKYRRVASNPVSNIPGLPTFPGTDTSPTMASAQGFRTVNEPQNQQATP 346
Query: 246 SKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTIS 305
SKHHMQ LS QFH IH NHHQ IHQ QH AHFS +H+CGPP HLP+I H HQSPTI
Sbjct: 347 SKHHMQSLS---QFHPIHQNHHQPIHQ-TQHSAHFSHSHQCGPPSHLPEIPHGHQSPTIP 402
Query: 306 QHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLG 358
QHMACLQP+TGGHV RLHV+P SPAKFCDECGAPYLRETSKFCSECG KR G
Sbjct: 403 QHMACLQPITGGHVSGRLHVLPTSPAKFCDECGAPYLRETSKFCSECGGKRFG 455
>gi|359473523|ref|XP_003631314.1| PREDICTED: uncharacterized protein At2g02148-like [Vitis vinifera]
Length = 452
Score = 595 bits (1534), Expect = e-167, Method: Compositional matrix adjust.
Identities = 287/353 (81%), Positives = 313/353 (88%), Gaps = 7/353 (1%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
D+SPIE+ARARF+QIIVDHFI+DHVIEV DS A+Y+ Q GQDKL+KRK+R+VQYEGDPRF
Sbjct: 106 DISPIEAARARFMQIIVDHFIDDHVIEVADSEADYNGQSGQDKLNKRKSREVQYEGDPRF 165
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
LPLMYVANMYETLVN+VN+RLAS NGIREKTIGVALEAAGGLYR LAKKFPKKG CT+K
Sbjct: 166 VLPLMYVANMYETLVNEVNIRLASLNGIREKTIGVALEAAGGLYRRLAKKFPKKGPCTFK 225
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATS+ETRTRFPELVIQEEKRVRFVVVNGL IV+KPN++PI+DAEWFKRLTGR+EVA
Sbjct: 226 RRELATSIETRTRFPELVIQEEKRVRFVVVNGLVIVDKPNSVPIDDAEWFKRLTGRDEVA 285
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQTP 245
+SA+DYKFYSPRHKYRRVASN VSNIP LP FPGTD S TM +AQGF T PQNQQ TP
Sbjct: 286 VSARDYKFYSPRHKYRRVASNPVSNIPGLPTFPGTDTSPTMASAQGFRT---PQNQQATP 342
Query: 246 SKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTIS 305
SKHHMQ LS QFH IH NHHQ IHQ QH AHFS +H+CGPP HLP+I H HQSPTI
Sbjct: 343 SKHHMQSLS---QFHPIHQNHHQPIHQ-TQHSAHFSHSHQCGPPSHLPEIPHGHQSPTIP 398
Query: 306 QHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLG 358
QHMACLQP+TGGHV RLHV+P SPAKFCDECGAPYLRETSKFCSECG KR G
Sbjct: 399 QHMACLQPITGGHVSGRLHVLPTSPAKFCDECGAPYLRETSKFCSECGGKRFG 451
>gi|255555531|ref|XP_002518802.1| cgi-62, putative [Ricinus communis]
gi|223542183|gb|EEF43727.1| cgi-62, putative [Ricinus communis]
Length = 448
Score = 559 bits (1440), Expect = e-157, Method: Compositional matrix adjust.
Identities = 273/357 (76%), Positives = 305/357 (85%), Gaps = 12/357 (3%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQD-KLSKRKNRDVQYEGDPR 64
DVSPIESAR+RFLQ+IVDHFI DHVI+V D+ A+Y QD KL+KRK+ DVQYEGDP
Sbjct: 101 DVSPIESARSRFLQLIVDHFIIDHVIQVPDNEADYLSHSAQDNKLNKRKSGDVQYEGDPT 160
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTY 124
+ALPLMYVANMYE+LVNDVN+RLAS NGIR+KTIG+ALEAAGGLYR +AKKFPKKGSC +
Sbjct: 161 YALPLMYVANMYESLVNDVNVRLASLNGIRDKTIGLALEAAGGLYRRIAKKFPKKGSCVF 220
Query: 125 KRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEV 184
KRRELATSLETRTRFPELVIQE KRVRFVVVNGLD+VEKPNNM IEDAEWFKRLTGRNEV
Sbjct: 221 KRRELATSLETRTRFPELVIQEVKRVRFVVVNGLDVVEKPNNMSIEDAEWFKRLTGRNEV 280
Query: 185 AISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQT 244
A+S++DYKFYSPRHKYRRVA SN+P LP P TD+ +M +AQGF + PQ +QQ
Sbjct: 281 AVSSRDYKFYSPRHKYRRVA----SNVPGLPTLPATDSPPSMASAQGFRS---PQTEQQP 333
Query: 245 PSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTI 304
P+KHH+ +SHQPQFH IH NHH Q+HQ QH FSQNH+CG HLP+IAHA+QSP I
Sbjct: 334 PAKHHVDSMSHQPQFHPIHQNHH-QVHQS-QHSTQFSQNHQCGTTSHLPEIAHANQSPNI 391
Query: 305 SQHMAC--LQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
SQHMA LQPLTGGHVG RLH+MPASPAKFCDECGAPYLRETSKFCSECG KRLGI
Sbjct: 392 SQHMAYLQLQPLTGGHVGGRLHLMPASPAKFCDECGAPYLRETSKFCSECGTKRLGI 448
>gi|449463503|ref|XP_004149473.1| PREDICTED: uncharacterized protein At2g02148-like [Cucumis sativus]
Length = 455
Score = 546 bits (1408), Expect = e-153, Method: Compositional matrix adjust.
Identities = 264/364 (72%), Positives = 298/364 (81%), Gaps = 21/364 (5%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
D+SPIE+ARARFLQIIVDHFI+DH++EVT++ +Y Q GQDKL+KRK ++VQYE DP+F
Sbjct: 103 DISPIEAARARFLQIIVDHFIHDHILEVTETDNDYISQSGQDKLTKRKTKEVQYEADPKF 162
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
LPLMYVANMYETLVND N+RLAS +GIR+K IGVALEAAGGLYR LA+KFPKKG CTYK
Sbjct: 163 VLPLMYVANMYETLVNDANIRLASLSGIRDKNIGVALEAAGGLYRKLAQKFPKKGPCTYK 222
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATSLETRTRFPELV+QEEKRVRFVVVNGLDIVEKPN M EDAEWF+RLTGR+EVA
Sbjct: 223 RRELATSLETRTRFPELVVQEEKRVRFVVVNGLDIVEKPNRMSTEDAEWFRRLTGRSEVA 282
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQ------ 239
+SAQDYKFYSPRHKYRRVA+NSVS+I L N S +V F +I +P
Sbjct: 283 VSAQDYKFYSPRHKYRRVAANSVSSISSL-------NVSGIVI---FSSICDPSLYGNIF 332
Query: 240 ----NQQQTPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDI 295
QQQTP KHH+Q L HQPQF IH NHHQ +HQ QH +HF+ NH CG P L DI
Sbjct: 333 CKCILQQQTPCKHHIQQLPHQPQFQSIHQNHHQSMHQS-QHTSHFAHNHHCGQPSQLQDI 391
Query: 296 AHAHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVK 355
+H H SPT+SQHMA LQPL+GGHVG RLH+ P+SPAKFCDECGAPYLRETSKFCSECGVK
Sbjct: 392 SHTHHSPTLSQHMASLQPLSGGHVGGRLHLQPSSPAKFCDECGAPYLRETSKFCSECGVK 451
Query: 356 RLGI 359
RLGI
Sbjct: 452 RLGI 455
>gi|356512247|ref|XP_003524832.1| PREDICTED: uncharacterized protein At2g02148-like [Glycine max]
Length = 440
Score = 540 bits (1391), Expect = e-151, Method: Compositional matrix adjust.
Identities = 267/356 (75%), Positives = 300/356 (84%), Gaps = 11/356 (3%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
D+SPIESARARFLQIIVDHFI+D VIEV DS A+Y+ GQDK+SKR+ R++QYEGDP F
Sbjct: 92 DISPIESARARFLQIIVDHFIDDRVIEVPDSEADYT---GQDKMSKRRTREIQYEGDPNF 148
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
LPLMYVANMYE+LV+DVN RLAS NGIREKTIGVALEAAGGLYR LAKKFPKKG CT+K
Sbjct: 149 TLPLMYVANMYESLVSDVNNRLASLNGIREKTIGVALEAAGGLYRRLAKKFPKKGPCTFK 208
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATS+ETRTRFPELVIQEEKRVRFVVVNGL IVEKPNNMPI+DAEWFKRLTGR+EVA
Sbjct: 209 RRELATSMETRTRFPELVIQEEKRVRFVVVNGLRIVEKPNNMPIDDAEWFKRLTGRSEVA 268
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQTP 245
ISA DYKFYSPRHKYRR AS S+SNI +P +PG +NS+ + T QGFH+ PQNQQQTP
Sbjct: 269 ISASDYKFYSPRHKYRRGASISLSNIQDIPSYPGAENSTALATPQGFHS---PQNQQQTP 325
Query: 246 SKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTIS 305
KH +Q L HQPQFH + N +Q +HQ QH +S N + GPP HLP+I+HAHQ +IS
Sbjct: 326 CKHQLQSLPHQPQFHPVLQN-NQTMHQS-QHAGPYSHNPQSGPPSHLPEISHAHQPISIS 383
Query: 306 QHMACLQPLT---GGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLG 358
QHMACLQPLT G G RLH++P +PAKFCDECGAPYLRETSKFCSECG KRLG
Sbjct: 384 QHMACLQPLTGGHVGGHGGRLHMLPTTPAKFCDECGAPYLRETSKFCSECGSKRLG 439
>gi|224101991|ref|XP_002312503.1| predicted protein [Populus trichocarpa]
gi|222852323|gb|EEE89870.1| predicted protein [Populus trichocarpa]
Length = 424
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 254/321 (79%), Positives = 281/321 (87%), Gaps = 8/321 (2%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
DVSPIESARARFLQIIVDHFI+DHVIEV D+ +EY+ PGQDKL+KRK+ DVQYEGDPRF
Sbjct: 99 DVSPIESARARFLQIIVDHFISDHVIEVADNESEYAAHPGQDKLTKRKSGDVQYEGDPRF 158
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
ALPLMYVANMYETLVNDVN+RLAS NG+R+KTIGVALEAAGGLYR +AKKFPKKGSC +K
Sbjct: 159 ALPLMYVANMYETLVNDVNMRLASLNGVRDKTIGVALEAAGGLYRRMAKKFPKKGSCIFK 218
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKP++MPI DAEWF+RLTGR+EVA
Sbjct: 219 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPSSMPIVDAEWFRRLTGRSEVA 278
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQTP 245
+SAQDYKFYSPRHK+RRV +SNIP L P DNSSTM TAQGF + PQN+QQTP
Sbjct: 279 VSAQDYKFYSPRHKFRRV----LSNIPGLATLPTEDNSSTMTTAQGFRS---PQNEQQTP 331
Query: 246 SKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTIS 305
SKHH+Q LSHQPQFH I+ NHHQ++HQ QH FSQNH+CGPP H+P+IAHA SPTI
Sbjct: 332 SKHHVQLLSHQPQFHPINQNHHQEVHQS-QHATQFSQNHQCGPPSHMPEIAHATHSPTIP 390
Query: 306 QHMACLQPLTGGHVGERLHVM 326
QHM LQPLTGGHVG RLH M
Sbjct: 391 QHMVYLQPLTGGHVGGRLHSM 411
>gi|356528164|ref|XP_003532675.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
At2g02148-like [Glycine max]
Length = 423
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 252/353 (71%), Positives = 284/353 (80%), Gaps = 28/353 (7%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
DVSPIE ARARFLQI+VDHF+ D VIE+ DS A+ + GQDK+SKR+ ++QYEGDP F
Sbjct: 98 DVSPIEPARARFLQIVVDHFVEDRVIEMPDSEADCA---GQDKMSKRRTSEIQYEGDPNF 154
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
ALPLMY+ANMYE+LV+DVN RLAS NGIREKTIGVALEAAGGLYR LAKKFPKKG CT+K
Sbjct: 155 ALPLMYMANMYESLVSDVNNRLASLNGIREKTIGVALEAAGGLYRKLAKKFPKKGPCTFK 214
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATS+ETRTRFPELVIQEEKRVRFVVVNGL IVEKPNN+PI+DAEWFKRLTGR+EVA
Sbjct: 215 RRELATSMETRTRFPELVIQEEKRVRFVVVNGLRIVEKPNNVPIDDAEWFKRLTGRSEVA 274
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQTP 245
ISA DYKFYSPRHKYRR AS S+SNI +P PQNQQQTP
Sbjct: 275 ISANDYKFYSPRHKYRRGASISLSNIQDIP----------------------PQNQQQTP 312
Query: 246 SKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTIS 305
KHH+Q L HQPQ H + N +Q +HQ QH +S N + G P HLP+I+HAHQ +IS
Sbjct: 313 CKHHLQSLPHQPQLHPVLQN-NQTMHQS-QHAGPYSHNPQSG-PSHLPEISHAHQPISIS 369
Query: 306 QHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLG 358
QHM CLQPLTGG+VG RLH++P +PAKFCDECGAPYLRETSKFCSECG KRLG
Sbjct: 370 QHMTCLQPLTGGNVGGRLHMLPTTPAKFCDECGAPYLRETSKFCSECGSKRLG 422
>gi|229914885|gb|ACQ90610.1| putative PPR repeat protein [Eutrema halophilum]
Length = 1023
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 254/370 (68%), Positives = 290/370 (78%), Gaps = 29/370 (7%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGA--EYSQQPGQDKLS---KRKNRDVQYE 60
DVSPIESAR RFLQII+D+FI+ HVIEV ++ E GQD + KRK+ D +YE
Sbjct: 667 DVSPIESARGRFLQIILDYFISQHVIEVCENKRDHETGSGGGQDNNNTKVKRKSDDTKYE 726
Query: 61 GDPRFALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKG 120
GDP FALPLMY+AN+YETLV + N+RLAS GIREKTIGVALEAAGGLYR L KKFPKKG
Sbjct: 727 GDPSFALPLMYIANLYETLVGEANVRLASLTGIREKTIGVALEAAGGLYRKLTKKFPKKG 786
Query: 121 SCTYKRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTG 180
+C Y+RRELATS+ETRTRFPELVI EEKRVRFVVVNGLDIVEKP+N+PIEDAEWFKRLTG
Sbjct: 787 ACMYRRRELATSVETRTRFPELVIHEEKRVRFVVVNGLDIVEKPDNLPIEDAEWFKRLTG 846
Query: 181 RNEVAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQN 240
RNEVA+SA+DYKFY PRHK+RRV NSVS+I LP FPG D SST+ QGF ++SE Q+
Sbjct: 847 RNEVAVSARDYKFYCPRHKHRRV-QNSVSSIHGLPTFPGMD-SSTLANTQGFRSVSEDQS 904
Query: 241 QQQ------TPSKHHMQPLSHQP-QFH---HIHNNHHQQIHQGQQHPAHF-SQNHRCGPP 289
QQQ +PSKHHM LSH P QFH H ++HHQ I+Q Q AHF QNH+C
Sbjct: 905 QQQQQQHTPSPSKHHMSSLSHHPHQFHQPIHQSHHHHQPIYQSQHAAAHFPGQNHQCD-- 962
Query: 290 PHLPDIAHAHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFC 349
P+++H HQSP+ISQHMACLQPLTGG HVMP SPAKFCD+CGA YLR+TSKFC
Sbjct: 963 ---PELSHTHQSPSISQHMACLQPLTGG------HVMPTSPAKFCDQCGAQYLRDTSKFC 1013
Query: 350 SECGVKRLGI 359
SECG KR GI
Sbjct: 1014 SECGCKRFGI 1023
>gi|297817834|ref|XP_002876800.1| hypothetical protein ARALYDRAFT_484139 [Arabidopsis lyrata subsp.
lyrata]
gi|297322638|gb|EFH53059.1| hypothetical protein ARALYDRAFT_484139 [Arabidopsis lyrata subsp.
lyrata]
Length = 1010
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 244/362 (67%), Positives = 285/362 (78%), Gaps = 21/362 (5%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLS---KRKNRDVQYEGD 62
DVSPIE+AR RFLQII+D+FI+ HVIEV ++ ++ G S KRK+ D +YEGD
Sbjct: 662 DVSPIEAARGRFLQIILDYFISQHVIEVAENKRDHETDSGGRDNSNKVKRKSDDTRYEGD 721
Query: 63 PRFALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSC 122
P FALPLMY+AN+YETLV + N+RLAS NGIR+KTIGVALEAAGGLYR L KKFPKKG+C
Sbjct: 722 PSFALPLMYIANLYETLVGEANVRLASLNGIRDKTIGVALEAAGGLYRKLTKKFPKKGTC 781
Query: 123 TYKRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRN 182
Y+RRELATS+ETRTRFPELVI EEKRVRFVVVNGLDIVEKP+++PIEDAEWFKRLTGRN
Sbjct: 782 MYRRRELATSVETRTRFPELVIHEEKRVRFVVVNGLDIVEKPDDLPIEDAEWFKRLTGRN 841
Query: 183 EVAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQ 242
+VAISA+DYKFY PR K+RR+ NSVS+I LP FPG D SST+ + QGF +++E Q+QQ
Sbjct: 842 DVAISARDYKFYCPRRKHRRL-QNSVSSISGLPTFPGID-SSTLASTQGFRSVNEDQSQQ 899
Query: 243 Q---TPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPA-HF-SQNHRCGPPPHLPDIAH 297
Q +PSKHHM LSHQ ++HH Q QH A HF QNH+C P+++H
Sbjct: 900 QHTPSPSKHHMSSLSHQFHQSIHQSHHHHQSIYQNQHAATHFPGQNHQCD-----PELSH 954
Query: 298 AHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
HQSP+ISQHMACLQPLTGG HVMP SPAKFCD+CGA YLRETSKFCSECG KRL
Sbjct: 955 THQSPSISQHMACLQPLTGG------HVMPNSPAKFCDQCGAQYLRETSKFCSECGSKRL 1008
Query: 358 GI 359
GI
Sbjct: 1009 GI 1010
>gi|312190399|gb|ADQ43199.1| unknown [Eutrema parvulum]
Length = 1128
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 249/365 (68%), Positives = 287/365 (78%), Gaps = 26/365 (7%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLS---KRKNRDVQYEGD 62
+VSPIESAR RFLQII+D+FI+ HVIEV ++ ++ G S KRK+ D +YEGD
Sbjct: 779 EVSPIESARGRFLQIILDYFISQHVIEVCENKRDHEADSGGKDNSNKVKRKSDDTRYEGD 838
Query: 63 PRFALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSC 122
P FALPLMY+AN+YETLV + N+R+AS NGIREKT+GVALEAAGGLYR L KKFPKK +C
Sbjct: 839 PSFALPLMYIANLYETLVGEANVRVASLNGIREKTLGVALEAAGGLYRKLTKKFPKK-TC 897
Query: 123 TYKRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRN 182
Y+RRELATS+ETRTRFPELVI EEKRVRFVVVNGLDIVEKP+N+PIEDAE F+RLTGRN
Sbjct: 898 MYRRRELATSVETRTRFPELVIHEEKRVRFVVVNGLDIVEKPDNIPIEDAE-FQRLTGRN 956
Query: 183 EVAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQ 242
EVA+SA+DYKFY PRHK+RRV NSVS I LP FPG D SST+ QGF ++SE Q+QQ
Sbjct: 957 EVAVSARDYKFYCPRHKHRRV-QNSVSTIHGLPTFPGID-SSTLADTQGFPSVSEDQSQQ 1014
Query: 243 QTP--SKHHMQPLSHQP-QFH---HIHNNHHQQIHQGQQHPAHF-SQNHRCGPPPHLPDI 295
TP SKHHM LSH P QFH H +HHQ I+QGQ AHF S NH+C P++
Sbjct: 1015 HTPSSSKHHMSSLSHHPHQFHQSIHQSQHHHQSIYQGQHAAAHFPSPNHQCD-----PEL 1069
Query: 296 AHA-HQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGV 354
+HA HQSP+ISQHM CLQPLTGG HVMP SPAKFCD+CG+ YLRETSKFCSECG
Sbjct: 1070 SHAQHQSPSISQHMTCLQPLTGG------HVMPTSPAKFCDQCGSQYLRETSKFCSECGS 1123
Query: 355 KRLGI 359
KRLGI
Sbjct: 1124 KRLGI 1128
>gi|334184104|ref|NP_001189499.1| uncharacterized protein [Arabidopsis thaliana]
gi|75217988|sp|Q56XX3.1|Y2215_ARATH RecName: Full=Uncharacterized protein At2g02148
gi|62320448|dbj|BAD94934.1| hypothetical protein [Arabidopsis thaliana]
gi|330250458|gb|AEC05552.1| uncharacterized protein [Arabidopsis thaliana]
Length = 432
Score = 456 bits (1173), Expect = e-126, Method: Compositional matrix adjust.
Identities = 241/364 (66%), Positives = 278/364 (76%), Gaps = 36/364 (9%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYS-QQPGQDKLSK--RKNRDVQYEGD 62
DVSPIE+AR RFLQII+D+FI+ HVIEV +S ++ G+D SK RK+ D +YEGD
Sbjct: 95 DVSPIEAARGRFLQIILDYFISQHVIEVCESKRDHDVDSGGRDSNSKVKRKSDDTRYEGD 154
Query: 63 PRFALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSC 122
P FALPLMY+AN+YETLV + N+RLAS NGIR+KTIGVALEAAGGLYR L KKFPKKG+C
Sbjct: 155 PSFALPLMYIANLYETLVGEANVRLASLNGIRDKTIGVALEAAGGLYRKLTKKFPKKGTC 214
Query: 123 TYKRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRN 182
Y+RRELATS+ETRTRFPELVI EEKRVRFVVVNGLDIVEKP+++PIE+AEWFKRLTGRN
Sbjct: 215 MYRRRELATSVETRTRFPELVIHEEKRVRFVVVNGLDIVEKPSDLPIEEAEWFKRLTGRN 274
Query: 183 EVAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQ 242
EVAISA+DYKFY PR K+RR+ NSVS+I LP FPG D SST+ QGF E Q+QQ
Sbjct: 275 EVAISARDYKFYCPRRKHRRL-QNSVSSINGLPTFPGID-SSTLANTQGFR---EDQSQQ 329
Query: 243 Q---TPSKHHMQPLSHQPQFH---HIHNNHHQQIHQGQQHPAHF-SQNHRCGPPPHLPDI 295
Q +PSKHHM LSH QFH H + HHQ I+Q Q H+ SQNH+C P++
Sbjct: 330 QHTPSPSKHHMSSLSH--QFHQSIHQSHQHHQSIYQSQHAATHYPSQNHQCD-----PEL 382
Query: 296 AHAHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVK 355
+H MACLQPLTGG HVMP SPAKFCD+CGA YLRETSKFCSECG K
Sbjct: 383 SHT--------QMACLQPLTGG------HVMPNSPAKFCDQCGAQYLRETSKFCSECGSK 428
Query: 356 RLGI 359
RLGI
Sbjct: 429 RLGI 432
>gi|357123405|ref|XP_003563401.1| PREDICTED: uncharacterized protein At2g02148-like [Brachypodium
distachyon]
Length = 430
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 220/354 (62%), Positives = 261/354 (73%), Gaps = 6/354 (1%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
DV PIE AR+RFL IIVDHFI ++VIEV +S A Q DKL+KRK ++V+YEGDPRF
Sbjct: 83 DVVPIEMARSRFLDIIVDHFIGENVIEVAESSALDHIQ-SNDKLNKRKQQEVRYEGDPRF 141
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
ALPLMY+AN+YE LV+DVN RLAS G REKTIGVALEAAGGLYR L +KFPKKGSC+++
Sbjct: 142 ALPLMYIANLYEALVSDVNTRLASLIGSREKTIGVALEAAGGLYRKLTQKFPKKGSCSFR 201
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATS TRTRFPELV+QEEKRVRFVV+NGL I+E+PNNM +EDAEWFKRLTGRNEVA
Sbjct: 202 RRELATSHATRTRFPELVVQEEKRVRFVVINGLVIIERPNNMRMEDAEWFKRLTGRNEVA 261
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQTP 245
IS+ DYKFYSPRHKYRR + +IPV D SS +V + GF +E QNQ +
Sbjct: 262 ISSIDYKFYSPRHKYRRTPQSGF-DIPVTSVLT-EDESSPLVCSLGFRPPNEIQNQHEPS 319
Query: 246 SKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTIS 305
K H+Q + QP H + H+ I Q QH + H C HL D A HQS +S
Sbjct: 320 LKRHIQQIESQPYLHFLDQAEHETIQQ-NQHSSQLPPIHPCASASHLSDNAQQHQS-YLS 377
Query: 306 QHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
H+A +Q GH+G R++++P SPAKFCDECG+PYLR TSKFCSECG KRLGI
Sbjct: 378 PHLASMQA-GHGHLGGRVNILPTSPAKFCDECGSPYLRTTSKFCSECGTKRLGI 430
>gi|308081275|ref|NP_001183571.1| uncharacterized protein LOC100502164 [Zea mays]
gi|238013144|gb|ACR37607.1| unknown [Zea mays]
gi|413935070|gb|AFW69621.1| hypothetical protein ZEAMMB73_403549 [Zea mays]
Length = 388
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 206/355 (58%), Positives = 264/355 (74%), Gaps = 7/355 (1%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTD-SGAEYSQQPGQDKLSKRKNRDVQYEGDPR 64
DV PIE+AR+RFL ++V+ FI +HV E+ + S ++ SQ G K SKRK ++V+YEGDPR
Sbjct: 40 DVIPIETARSRFLDLVVESFICEHVTELVECSSSDCSQVDG--KSSKRKQQEVRYEGDPR 97
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTY 124
ALPLMY+AN+YETL++DVN+RLA+ G REKTIG+ALEA+GGLYR L ++FPK+G C++
Sbjct: 98 VALPLMYIANLYETLISDVNVRLAALIGFREKTIGLALEASGGLYRKLIQRFPKRGHCSF 157
Query: 125 KRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEV 184
KRRELATS TRT+FPELV+QEEKRVRFVV+NGL IVE+P NM +EDAEWFKRLTGR+EV
Sbjct: 158 KRRELATSHATRTKFPELVVQEEKRVRFVVINGLAIVERPENMRMEDAEWFKRLTGRSEV 217
Query: 185 AISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQT 244
AI ++DYKFYSPRHK RR + + +IP +NSS +V + GF + +E QNQ Q+
Sbjct: 218 AICSRDYKFYSPRHKIRR-SPQAAFDIPDTSALAEDENSS-LVCSSGFRSPNEIQNQHQS 275
Query: 245 PSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTI 304
SK ++ L +QP H H + Q QH A F H+C PHL D Q +
Sbjct: 276 TSKRPIEHLENQPFLHLFHQAEDDSMQQ-VQHCAQFPPIHQCTSTPHLSDNPQHQQQAYL 334
Query: 305 SQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
SQH++CLQ + GH+G R+H++P SPAKFCDECG+PYLR TSKFCSECG KRLG+
Sbjct: 335 SQHISCLQ-VGQGHLGGRMHIIPTSPAKFCDECGSPYLRATSKFCSECGTKRLGM 388
>gi|242097162|ref|XP_002439071.1| hypothetical protein SORBIDRAFT_10g031060 [Sorghum bicolor]
gi|241917294|gb|EER90438.1| hypothetical protein SORBIDRAFT_10g031060 [Sorghum bicolor]
Length = 388
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 203/355 (57%), Positives = 261/355 (73%), Gaps = 7/355 (1%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDS-GAEYSQQPGQDKLSKRKNRDVQYEGDPR 64
DV PIE AR+RFL ++VD+FI +HVIE+ + G++ SQ G K +KRK ++V+YEGDPR
Sbjct: 40 DVIPIEMARSRFLDLVVDYFICEHVIEMVECCGSDCSQVDG--KSNKRKQQEVRYEGDPR 97
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTY 124
ALPLMY+AN+YETLV+DVN+RL + G EKTIG+ALEA+GGLYR L ++FPK+G C++
Sbjct: 98 VALPLMYIANLYETLVSDVNVRLEALIGFHEKTIGLALEASGGLYRKLTQRFPKRGHCSF 157
Query: 125 KRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEV 184
KRRELATS TRT+FPELV+QEEKRVRFVV+NGL IVE+P NM +EDAEWFKRLTGR+EV
Sbjct: 158 KRRELATSHATRTKFPELVVQEEKRVRFVVINGLAIVERPENMRVEDAEWFKRLTGRSEV 217
Query: 185 AISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQT 244
AI ++DYKFYSPRHK RR + + +IP D +S++V + GF + +E QNQ Q+
Sbjct: 218 AICSRDYKFYSPRHKIRR-SPQAAFDIPDTSAL-AEDETSSLVCSSGFRSPNEIQNQHQS 275
Query: 245 PSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTI 304
SK ++ L +QP H H + Q QH F H+C PHL D Q +
Sbjct: 276 TSKRPIEHLENQPFLHLFHQAEDDSMQQ-VQHCTQFPPIHQCTSNPHLSDNPQHQQQAYL 334
Query: 305 SQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
SQ ++CLQ + GH+G R+H++P SPAKFCDECG+PYLR TSKFCSECG KRLG+
Sbjct: 335 SQQISCLQ-VGQGHLGGRMHIVPTSPAKFCDECGSPYLRATSKFCSECGTKRLGM 388
>gi|326519841|dbj|BAK00293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 438
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/359 (56%), Positives = 254/359 (70%), Gaps = 16/359 (4%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
DV PIE AR+RFL II+DHFI + IEV + Y ++++KRK ++V+YEGDPRF
Sbjct: 91 DVVPIEMARSRFLDIIIDHFIGQNAIEVAEPSL-YDSILASNRINKRKQQEVRYEGDPRF 149
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
ALPLMY+AN+YE+LV+DV+ RLAS G REKT+GVALEAAGGLYR L +KFPKKG+C+++
Sbjct: 150 ALPLMYIANLYESLVSDVDARLASLIGSREKTMGVALEAAGGLYRKLTQKFPKKGNCSFR 209
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATS T+T+FPELV+QEEKR+RFVV+NGL I+E+PNNM ++DAEWFKRLTGRNEVA
Sbjct: 210 RRELATSNATKTKFPELVVQEEKRIRFVVINGLVIIERPNNMRMQDAEWFKRLTGRNEVA 269
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGT-----DNSSTMVTAQGFHTISEPQN 240
IS++DYKFYS RHKYRR PV PGT D +S +V + F E QN
Sbjct: 270 ISSRDYKFYSARHKYRRTPQ------PVF-DIPGTSVLSEDETSPLVCSSEFRPPFEMQN 322
Query: 241 QQQTPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQ 300
Q + SK H++ + QP H + H I Q QH + F H C HL D HQ
Sbjct: 323 QHELSSKRHIEQIESQPYLHFLDQGEHGAIQQ-NQHTSQFPTIHPCASSSHLSDNPQQHQ 381
Query: 301 SPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
S +S H++ +Q H G RL+++P+ PAKFCDECG+PYLR TSKFCSECG KRLGI
Sbjct: 382 S-YLSPHLSSMQA-GHSHPGGRLNILPSRPAKFCDECGSPYLRATSKFCSECGTKRLGI 438
>gi|294463740|gb|ADE77395.1| unknown [Picea sitchensis]
Length = 473
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 202/368 (54%), Positives = 254/368 (69%), Gaps = 16/368 (4%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDS-GAEYSQQPGQDKLSKRKNRDVQYEGDPR 64
DV PIESARARFLQ+I+D+F+ VIEV DS + Y+ G+DK KRK+R+ QYEGD R
Sbjct: 106 DVIPIESARARFLQLILDYFLRPRVIEVPDSPESSYNSPNGKDKQKKRKSREAQYEGDAR 165
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTY 124
F LPL YVAN+YETLVN+VN+RLA+ +GIREKT+GVALEAAGGLYR L KKFP+KG+ T+
Sbjct: 166 FVLPLAYVANLYETLVNEVNIRLATLDGIREKTMGVALEAAGGLYRRLVKKFPRKGAYTF 225
Query: 125 KRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEV 184
+RRE+A++LE RT+FPELV +EEKRVRFVVV+GL+I+E+P M IEDAEWFKRLTGR+E
Sbjct: 226 RRREMASALEARTKFPELVTREEKRVRFVVVHGLEILEQP-TMSIEDAEWFKRLTGRHEA 284
Query: 185 AISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGF------------ 232
AIS +DYKFYS RHKYRR N++S + +P +P +NSST+ G
Sbjct: 285 AISERDYKFYSARHKYRRTPQNALSTVTGMPAYPNAENSSTLGAPPGLRSPYTNDVLNLT 344
Query: 233 -HTISEPQNQQQTPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPH 291
+ + Q Q Q + H P Q Q H + Q H +H S C H
Sbjct: 345 SRSPPQHQIQHQPQHQLHCLPHQAQHQQLHQSQHQSLHQSQQAPHLSHPSHGQHCTHTSH 404
Query: 292 LPDIAHAHQSPTISQHMA-CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCS 350
L D+ HA + + QHM LQP G +VG R+ V+P SP K+CD+CG+PYLRETSKFCS
Sbjct: 405 LDDMPHAQHTSPVPQHMPRNLQPPPGLNVGGRMLVLPTSPPKYCDQCGSPYLRETSKFCS 464
Query: 351 ECGVKRLG 358
ECG KR+G
Sbjct: 465 ECGAKRIG 472
>gi|449481248|ref|XP_004156126.1| PREDICTED: uncharacterized protein At2g02148-like [Cucumis sativus]
Length = 348
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 169/209 (80%), Positives = 191/209 (91%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQPGQDKLSKRKNRDVQYEGDPRF 65
D+SPIE+ARARFLQIIVDHFI+DH++EVT++ +Y Q GQDKL+KRK ++VQYE DP+F
Sbjct: 103 DISPIEAARARFLQIIVDHFIHDHILEVTETDNDYISQSGQDKLTKRKTKEVQYEADPKF 162
Query: 66 ALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYK 125
LPLMYVANMYETLVND N+RLAS +GIR+K IGVALEAAGGLYR LA+KFPKKG CTYK
Sbjct: 163 VLPLMYVANMYETLVNDANIRLASLSGIRDKNIGVALEAAGGLYRKLAQKFPKKGPCTYK 222
Query: 126 RRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEVA 185
RRELATSLETRTRFPELV+QEEKRVRFVVVNGLDIVEKPN M EDAEWF+RLTGR+EVA
Sbjct: 223 RRELATSLETRTRFPELVVQEEKRVRFVVVNGLDIVEKPNRMSTEDAEWFRRLTGRSEVA 282
Query: 186 ISAQDYKFYSPRHKYRRVASNSVSNIPVL 214
+SAQDYKFYSPRHKYRRVA+NSVS+I L
Sbjct: 283 VSAQDYKFYSPRHKYRRVAANSVSSISSL 311
>gi|4038037|gb|AAC97219.1| hypothetical protein [Arabidopsis thaliana]
Length = 1107
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 174/269 (64%), Positives = 208/269 (77%), Gaps = 8/269 (2%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYS-QQPGQDKLSK--RKNRDVQYEGD 62
DVSPIE+AR RFLQII+D+FI+ HVIEV +S ++ G+D SK RK+ D +YEGD
Sbjct: 804 DVSPIEAARGRFLQIILDYFISQHVIEVCESKRDHDVDSGGRDSNSKVKRKSDDTRYEGD 863
Query: 63 PRFALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSC 122
P FALPLMY+AN+YETLV + N+RLAS NGIR+KTIGVALEAAGGLYR L KKFPKKG+C
Sbjct: 864 PSFALPLMYIANLYETLVGEANVRLASLNGIRDKTIGVALEAAGGLYRKLTKKFPKKGTC 923
Query: 123 TYKRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRN 182
Y+RRELATS+ETRTRFPELVI EEKRVRFVVVNGLDIVEKP+++PIE+AEWFKRLTGRN
Sbjct: 924 MYRRRELATSVETRTRFPELVIHEEKRVRFVVVNGLDIVEKPSDLPIEEAEWFKRLTGRN 983
Query: 183 EVAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQ 242
EVAISA+DYKFY PR K+RR+ NSVS+I LP FPG D SST+ QGF EP +
Sbjct: 984 EVAISARDYKFYCPRRKHRRL-QNSVSSINGLPTFPGID-SSTLANTQGFR---EPNSPA 1038
Query: 243 QTPSKHHMQPLSHQPQFHHIHNNHHQQIH 271
+ + Q L +F +N+ + I
Sbjct: 1039 KFCDQCGAQYLRETSKFCSENNDQIESIE 1067
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/25 (88%), Positives = 23/25 (92%)
Query: 327 PASPAKFCDECGAPYLRETSKFCSE 351
P SPAKFCD+CGA YLRETSKFCSE
Sbjct: 1034 PNSPAKFCDQCGAQYLRETSKFCSE 1058
>gi|147784212|emb|CAN72842.1| hypothetical protein VITISV_000983 [Vitis vinifera]
Length = 475
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 145/185 (78%), Positives = 155/185 (83%), Gaps = 4/185 (2%)
Query: 175 FKRLTGRNEVAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHT 234
FKRLTGR+EVA+SA DYKFYSPRHKYRRVASN VSNIP LP FPGTD S TM +AQGF T
Sbjct: 13 FKRLTGRDEVAVSAXDYKFYSPRHKYRRVASNPVSNIPGLPTFPGTDTSPTMASAQGFRT 72
Query: 235 ISEPQNQQQTPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPD 294
++EPQNQQ TPSKHHMQ LS QFH IH NHHQ IHQ QH AHFS +H+CGPP HLP+
Sbjct: 73 VNEPQNQQATPSKHHMQSLS---QFHPIHQNHHQPIHQ-TQHSAHFSHSHQCGPPSHLPE 128
Query: 295 IAHAHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGV 354
I H HQSPTI QHMACLQP+TGGHV RLHV+P SPAKFCDECGAPYLRETSKFCSECG
Sbjct: 129 IPHGHQSPTIPQHMACLQPITGGHVSGRLHVLPTSPAKFCDECGAPYLRETSKFCSECGG 188
Query: 355 KRLGI 359
KR I
Sbjct: 189 KRPSI 193
>gi|242088451|ref|XP_002440058.1| hypothetical protein SORBIDRAFT_09g025240 [Sorghum bicolor]
gi|241945343|gb|EES18488.1| hypothetical protein SORBIDRAFT_09g025240 [Sorghum bicolor]
Length = 325
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 144/236 (61%), Positives = 190/236 (80%), Gaps = 5/236 (2%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTD-SGAEYSQQPGQDKLSKRKNRDVQYEGDPR 64
DV PIE+AR+RFL ++VD+FI +HVIE+ + SG++ SQ G K +KRK ++V+YEGDPR
Sbjct: 64 DVIPIETARSRFLDLVVDYFICEHVIEMVECSGSDCSQVDG--KSNKRKQQEVRYEGDPR 121
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTY 124
+LPLMY+AN+YETLV+DVN+RLA+ G EKTIG+ALEA+GGLYR L ++FPK+G C++
Sbjct: 122 VSLPLMYIANLYETLVSDVNVRLAALIGFHEKTIGLALEASGGLYRKLTQRFPKRGHCSF 181
Query: 125 KRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEV 184
KRRELATS TRT+FPELV+QEEKRVRFVV+NGL I+E+P NM +EDAEWF+RLTGR+EV
Sbjct: 182 KRRELATSHATRTKFPELVVQEEKRVRFVVINGLAIIERPENMRMEDAEWFRRLTGRSEV 241
Query: 185 AISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQN 240
AI ++DYKFYSPRHK RR + + +IP +NSS +V + GF + +EP +
Sbjct: 242 AICSRDYKFYSPRHKIRR-SPQAAFDIPDTSALAEDENSS-LVCSSGFRSPNEPTS 295
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/33 (84%), Positives = 30/33 (90%)
Query: 327 PASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
P SPAKFCDECG+PYLR TSKFCSECG KRLG+
Sbjct: 293 PTSPAKFCDECGSPYLRVTSKFCSECGTKRLGM 325
>gi|302785556|ref|XP_002974549.1| hypothetical protein SELMODRAFT_414814 [Selaginella moellendorffii]
gi|300157444|gb|EFJ24069.1| hypothetical protein SELMODRAFT_414814 [Selaginella moellendorffii]
Length = 450
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 165/356 (46%), Positives = 223/356 (62%), Gaps = 50/356 (14%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDS-GAEYSQQPGQDKLSKRKNRDVQYEGDPR 64
DV PIE+ARARF+Q+I+D FI V+DS YS ++K SKRK +V YEGDPR
Sbjct: 142 DVVPIEAARARFVQLIIDRFIQPRSYPVSDSPEGNYSNPNSKEKESKRKPSEVHYEGDPR 201
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKG-SCT 123
F LPL +VAN+YETL+++++ RL + G+ EKT+GVALEA+GGLYR L KKFPK G +
Sbjct: 202 FLLPLTFVANLYETLISEIDARLHTVEGVHEKTMGVALEASGGLYRKLVKKFPKSGHPVS 261
Query: 124 YKRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNE 183
+KRRE+A++LE +T+FP+LV +EKRVRFVVV+GL+++E+P MP EDAEWFKRLTGR E
Sbjct: 262 FKRREMASALEAKTKFPQLVRGDEKRVRFVVVHGLELIERPE-MPQEDAEWFKRLTGRQE 320
Query: 184 VAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQ 243
A++ DY FYS R K+RR+ +V + P +P S +++T
Sbjct: 321 AAVTGSDYNFYSARVKHRRL---TVRDPPPVPQQYQAGESPSVITPH------------- 364
Query: 244 TPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPT 303
+P++ H Q I + HQ QQ SQ H PPH
Sbjct: 365 SPTRDHPQ----------IQMQQSRVRHQVQQP----SQTHTPRTPPH------------ 398
Query: 304 ISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
+S+ + +G H G + V P SPAKFCD+CGAP+LRETS+FC++CG KR G+
Sbjct: 399 VSRDIQ-----SGIHPGMTVMVPPPSPAKFCDQCGAPFLRETSRFCTDCGNKRHGV 449
>gi|302759645|ref|XP_002963245.1| hypothetical protein SELMODRAFT_404974 [Selaginella moellendorffii]
gi|300168513|gb|EFJ35116.1| hypothetical protein SELMODRAFT_404974 [Selaginella moellendorffii]
Length = 451
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 163/356 (45%), Positives = 223/356 (62%), Gaps = 50/356 (14%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDS-GAEYSQQPGQDKLSKRKNRDVQYEGDPR 64
DV PIE+ARARF+Q+I+D FI V+DS YS ++K SKRK +V YEGDPR
Sbjct: 143 DVVPIEAARARFVQLIIDRFIQPRSYPVSDSPEGNYSNPNSKEKESKRKPSEVHYEGDPR 202
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKG-SCT 123
+ LPL +VAN+YETL+++++ RL + G+ EKT+GVALEA+GGLYR L KKFPK G +
Sbjct: 203 YLLPLTFVANLYETLISEIDARLHTVEGVHEKTMGVALEASGGLYRKLVKKFPKSGHPVS 262
Query: 124 YKRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNE 183
+KRRE+A++LE +T+FP+LV +EKRVRFVVV+GL+++E+P MP EDAEWFKRLTGR E
Sbjct: 263 FKRREMASALEAKTKFPQLVRGDEKRVRFVVVHGLELIERPE-MPQEDAEWFKRLTGRQE 321
Query: 184 VAISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQ 243
A++ DY FYS R K+RR+ +V + P +P S +++T
Sbjct: 322 AAVTGSDYNFYSARVKHRRL---TVRDPPPVPQQYQAGESPSVITPH------------- 365
Query: 244 TPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPT 303
+P++ H Q Q + H Q+ Q SQ H PPH
Sbjct: 366 SPTRDHQQIQMQQSRVRH-------QVQQP-------SQTHTPRTPPH------------ 399
Query: 304 ISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
+S+ + +G H G + V P SPAKFCD+CGAP+LRETS+FC++CG KR G+
Sbjct: 400 VSRDIQ-----SGIHPGMTVMVPPPSPAKFCDQCGAPFLRETSRFCTDCGNKRHGV 450
>gi|168024988|ref|XP_001765017.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683826|gb|EDQ70233.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 175/361 (48%), Positives = 227/361 (62%), Gaps = 29/361 (8%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQP-GQDKLSKRKNRDVQYEGDPR 64
DV PIES RA FLQ+I+D FI HVI ++ S P G+DK KRK+RDVQYEGDPR
Sbjct: 90 DVLPIESTRAFFLQLIIDSFIRCHVIPAAEAPEGNSYSPNGKDKSKKRKSRDVQYEGDPR 149
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTY 124
+ LPL +VAN+YETL+ ++N RLA+ +G++EK+ GVALEAAGGLYR L KKFPK + T+
Sbjct: 150 YLLPLTFVANLYETLIREINQRLATVDGLQEKSFGVALEAAGGLYRRLVKKFPKSATMTF 209
Query: 125 KRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEV 184
KRRE+A++LE R +FP+LV EEKRVRFVVV+GL++VE+PN P EDAEWFKRLTGR+E
Sbjct: 210 KRREVASALEARIKFPQLVTGEEKRVRFVVVHGLELVERPNLSP-EDAEWFKRLTGRHEA 268
Query: 185 AISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQT 244
I DYKF++ R K+RR +S++++ V+ G D S M+ QGF S Q
Sbjct: 269 QIYESDYKFFAARVKHRRAPQHSLASM-VMLQVVGVDASCVMLW-QGFE--SHGMECQGY 324
Query: 245 PSKH------HMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDIAHA 298
P + H+Q ++ H Q Q Q H PPH+ A
Sbjct: 325 PGRDPGGLPAHLQHQHQLHHHQQHQHSAHVQYQQHSQPTGH--------TPPHV-----A 371
Query: 299 HQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLG 358
H+A L+P T V + P P+K+CDECG+ Y+RETSKFCSECG KR G
Sbjct: 372 RDMDASGMHVAVLRPDTFAMVQHQ----PTGPSKYCDECGSAYVRETSKFCSECGTKRAG 427
Query: 359 I 359
+
Sbjct: 428 V 428
>gi|168019203|ref|XP_001762134.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686538|gb|EDQ72926.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 411
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 168/362 (46%), Positives = 223/362 (61%), Gaps = 55/362 (15%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQP-GQDKLSKRKNRDVQYEGDPR 64
DV PIES RARFLQ+I+D+FI HV+ T++ S P G+DK KRK+RDVQYEGDPR
Sbjct: 96 DVIPIESTRARFLQLIIDYFIRYHVVPATEAPEGNSYSPNGKDKSKKRKSRDVQYEGDPR 155
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTY 124
+ LPL +VAN+YETL+ ++N RLA+ G++EKT GVALEAAGGLYR L KKFPK + T+
Sbjct: 156 YLLPLTFVANLYETLIREINQRLATIEGLQEKTFGVALEAAGGLYRRLVKKFPKSATMTF 215
Query: 125 KRRELATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEWFKRLTGRNEV 184
KRRE+A++LE RT+FP+LV EEKRVRFVVV+GL++VE+PN P EDAEWFKRLTGR+E
Sbjct: 216 KRREMASALEARTKFPQLVTGEEKRVRFVVVHGLELVERPNLSP-EDAEWFKRLTGRHEA 274
Query: 185 AISAQDYKFYSPRHKYRRVASNSVSNIPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQT 244
I DYK+++ R K+RR +++S++ +L V++ F + + Q
Sbjct: 275 QIYESDYKYFAARVKHRRAPHHTLSSMTLL-----------QVSSWAFEQLQQQHQLQHQ 323
Query: 245 PSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLP---DIAHAHQS 301
+H Q H +H PPH+P D H S
Sbjct: 324 QHQHSPHVQHQQLTQHTVHT------------------------PPHVPRDLDAPGMHGS 359
Query: 302 PT-ISQHMACLQPLTGGHVGERLHVMPAS---PAKFCDECGAPYLRETSKFCSECGVKRL 357
+ + L P ++P S P+K+CDECG+ Y+RETSKFCSECG KRL
Sbjct: 360 VVQVGLNNLWLAP-----------IIPCSATGPSKYCDECGSAYIRETSKFCSECGTKRL 408
Query: 358 GI 359
G+
Sbjct: 409 GV 410
>gi|357481731|ref|XP_003611151.1| hypothetical protein MTR_5g010860 [Medicago truncatula]
gi|355512486|gb|AES94109.1| hypothetical protein MTR_5g010860 [Medicago truncatula]
Length = 199
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 123/201 (61%), Positives = 142/201 (70%), Gaps = 23/201 (11%)
Query: 75 MYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYKRRELATSLE 134
MYE+LVN+VN+RLAS NGI +K IGVALEAA GL+R LAKKFPKKG CTYKRRELATS E
Sbjct: 1 MYESLVNEVNIRLASLNGIHDKFIGVALEAAIGLHRRLAKKFPKKGPCTYKRRELATSTE 60
Query: 135 TRTRFPELVIQEEKRVRFVVVNGLD------------IVEKPNNMPIEDAEWFKRLTGRN 182
TRTRFPELVIQEEKRVRFVVVNGL IVE+ + I+DA WFKRL RN
Sbjct: 61 TRTRFPELVIQEEKRVRFVVVNGLKPQPPPKKVKGLIIVER---LKIDDAGWFKRLICRN 117
Query: 183 EVAISAQDYKFYSPRH---KYRRVASNSVSNIPVLP----GFPGTDNSSTMVTAQGFHTI 235
EVAISA DYKFYSPRH +YRR AS S+ NIP +P + G DNS+T+ T QG+ +
Sbjct: 118 EVAISANDYKFYSPRHRYSRYRRGASISLPNIPHIPLSMVFYLGADNSTTLTTTQGYRSK 177
Query: 236 SEPQNQQQTPSKHHMQPLSHQ 256
S +K +MQ + Q
Sbjct: 178 SHT-TLITLCAKDYMQQVLQQ 197
>gi|147784213|emb|CAN72843.1| hypothetical protein VITISV_000984 [Vitis vinifera]
Length = 105
Score = 200 bits (509), Expect = 7e-49, Method: Composition-based stats.
Identities = 93/105 (88%), Positives = 101/105 (96%)
Query: 70 MYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKGSCTYKRREL 129
MYVANMYETLVN+VN+RLAS NGIREKTIGVALEAAGGLYR LAKKFPKKG CT+KRREL
Sbjct: 1 MYVANMYETLVNEVNIRLASLNGIREKTIGVALEAAGGLYRRLAKKFPKKGPCTFKRREL 60
Query: 130 ATSLETRTRFPELVIQEEKRVRFVVVNGLDIVEKPNNMPIEDAEW 174
ATS+ETRTRFPELVIQEEKRVRFVVVNGL IV+KPN++PI+DAEW
Sbjct: 61 ATSIETRTRFPELVIQEEKRVRFVVVNGLVIVDKPNSVPIDDAEW 105
>gi|168034461|ref|XP_001769731.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679080|gb|EDQ65532.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 237
Score = 144 bits (362), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 68/116 (58%), Positives = 89/116 (76%), Gaps = 1/116 (0%)
Query: 6 DVSPIESARARFLQIIVDHFINDHVIEVTDSGAEYSQQP-GQDKLSKRKNRDVQYEGDPR 64
DVSPIES RARFLQ+I+D+FI H+I ++ +S G+DKL+KRK+RDVQYEG R
Sbjct: 68 DVSPIESTRARFLQLIIDYFIKYHIISAAEAHEAHSYSSNGKDKLTKRKSRDVQYEGGHR 127
Query: 65 FALPLMYVANMYETLVNDVNLRLASFNGIREKTIGVALEAAGGLYRSLAKKFPKKG 120
+ LPL ++AN+YETL+ ++N RLA+ G++E+T G ALEAAGGLYR L KKF K G
Sbjct: 128 YLLPLTFLANLYETLIREINQRLAAVEGLQERTFGAALEAAGGLYRRLVKKFLKSG 183
>gi|449526116|ref|XP_004170060.1| PREDICTED: uncharacterized protein At2g02148-like [Cucumis sativus]
Length = 53
Score = 98.6 bits (244), Expect = 4e-18, Method: Composition-based stats.
Identities = 46/53 (86%), Positives = 49/53 (92%), Gaps = 1/53 (1%)
Query: 308 MACLQPLTGGHVGERLHV-MPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
MA LQPL+GGHVG RLH +P+SPAKFCDECGAPYLRETSKFCSECGVKRLGI
Sbjct: 1 MASLQPLSGGHVGGRLHHGLPSSPAKFCDECGAPYLRETSKFCSECGVKRLGI 53
>gi|357444493|ref|XP_003592524.1| hypothetical protein MTR_1g106960 [Medicago truncatula]
gi|355481572|gb|AES62775.1| hypothetical protein MTR_1g106960 [Medicago truncatula]
Length = 53
Score = 77.0 bits (188), Expect = 1e-11, Method: Composition-based stats.
Identities = 33/43 (76%), Positives = 39/43 (90%)
Query: 42 QQPGQDKLSKRKNRDVQYEGDPRFALPLMYVANMYETLVNDVN 84
Q+ QDKL+KRK R++QYEGDP FALPLMYVANMYE+LVN+VN
Sbjct: 11 QRSDQDKLNKRKTREIQYEGDPNFALPLMYVANMYESLVNEVN 53
>gi|148229302|ref|NP_001088799.1| zinc finger C2HC domain-containing protein 1A [Xenopus laevis]
gi|82179605|sp|Q5PPV5.1|ZC21A_XENLA RecName: Full=Zinc finger C2HC domain-containing protein 1A
gi|56269221|gb|AAH87481.1| LOC496067 protein [Xenopus laevis]
Length = 323
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 27/46 (58%)
Query: 312 QPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+PL GH+ + M KFC ECG Y E +KFC ECGV+R+
Sbjct: 276 KPLNDGHIRAADNNMSGQLTKFCHECGTKYPVEWAKFCCECGVRRM 321
>gi|224046407|ref|XP_002198154.1| PREDICTED: zinc finger C2HC domain-containing protein 1A
[Taeniopygia guttata]
Length = 351
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 31/64 (48%)
Query: 294 DIAHAHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECG 353
DI+ T + C + GG + P +KFC ECG Y E +KFC ECG
Sbjct: 286 DISLRSNGKTGYDNGECSSSVNGGSAKSSEGISPVQLSKFCHECGTKYPVEWAKFCCECG 345
Query: 354 VKRL 357
++R+
Sbjct: 346 IRRM 349
>gi|410908969|ref|XP_003967963.1| PREDICTED: zinc finger C2HC domain-containing protein 1A-like
[Takifugu rubripes]
Length = 266
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%)
Query: 287 GPPPHLPDIAHAHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETS 346
G P +P ++ + T + P +G V E +KFC CG+ Y ET+
Sbjct: 194 GQPTGIPSSKVSYPASTRTNSPGLTSPASGEDVAENELENGGMKSKFCHGCGSKYPVETA 253
Query: 347 KFCSECGVKRLGI 359
KFC ECG+KR+ I
Sbjct: 254 KFCCECGIKRMFI 266
>gi|426359969|ref|XP_004047226.1| PREDICTED: zinc finger C2HC domain-containing protein 1A [Gorilla
gorilla gorilla]
Length = 328
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 279 CASSLNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 326
>gi|332240690|ref|XP_003269519.1| PREDICTED: zinc finger C2HC domain-containing protein 1A [Nomascus
leucogenys]
Length = 328
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 279 CASSLNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 326
>gi|410908967|ref|XP_003967962.1| PREDICTED: zinc finger C2HC domain-containing protein 1A-like
[Takifugu rubripes]
Length = 272
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 36/73 (49%)
Query: 287 GPPPHLPDIAHAHQSPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETS 346
G P +P ++ + T + P +G V E +KFC CG+ Y ET+
Sbjct: 200 GQPAGIPSSKVSYPASTRTNSPGLTSPASGEDVAENELGNGGMKSKFCHGCGSKYPVETA 259
Query: 347 KFCSECGVKRLGI 359
KFC ECG+KR+ I
Sbjct: 260 KFCCECGIKRMFI 272
>gi|4929593|gb|AAD34057.1|AF151820_1 CGI-62 protein [Homo sapiens]
Length = 325
Score = 45.8 bits (107), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 276 CASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 323
>gi|355779772|gb|EHH64248.1| hypothetical protein EGM_17421, partial [Macaca fascicularis]
Length = 309
Score = 45.8 bits (107), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C + GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 260 CASSVNGGNIKSIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 307
>gi|410292576|gb|JAA24888.1| family with sequence similarity 164, member A [Pan troglodytes]
gi|410353997|gb|JAA43602.1| family with sequence similarity 164, member A [Pan troglodytes]
Length = 324
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 275 CASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|189053958|dbj|BAG36465.1| unnamed protein product [Homo sapiens]
Length = 325
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 276 CASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 323
>gi|397522568|ref|XP_003831334.1| PREDICTED: zinc finger C2HC domain-containing protein 1A [Pan
paniscus]
Length = 324
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 275 CASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|410041895|ref|XP_519818.4| PREDICTED: zinc finger C2HC domain-containing protein 1A isoform 2
[Pan troglodytes]
Length = 328
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 279 CASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 326
>gi|223555989|ref|NP_057094.2| zinc finger C2HC domain-containing protein 1A [Homo sapiens]
gi|296434498|sp|Q96GY0.2|ZC21A_HUMAN RecName: Full=Zinc finger C2HC domain-containing protein 1A
gi|119607465|gb|EAW87059.1| chromosome 8 open reading frame 70 [Homo sapiens]
gi|312151560|gb|ADQ32292.1| chromosome 8 open reading frame 70 [synthetic construct]
Length = 325
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 276 CASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 323
>gi|14290590|gb|AAH09074.1| Family with sequence similarity 164, member A [Homo sapiens]
Length = 325
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 276 CASSLNGGNIKGIEGHSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 323
>gi|402878543|ref|XP_003902939.1| PREDICTED: zinc finger C2HC domain-containing protein 1A isoform 2
[Papio anubis]
Length = 363
Score = 45.4 bits (106), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C + GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 314 CASSVNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 361
>gi|357481729|ref|XP_003611150.1| hypothetical protein MTR_5g010850 [Medicago truncatula]
gi|355512485|gb|AES94108.1| hypothetical protein MTR_5g010850 [Medicago truncatula]
Length = 52
Score = 45.4 bits (106), Expect = 0.042, Method: Composition-based stats.
Identities = 22/41 (53%), Positives = 29/41 (70%), Gaps = 2/41 (4%)
Query: 1 MKFEPDVS--PIESARARFLQIIVDHFINDHVIEVTDSGAE 39
+K P V PIESARA F+ I++D+FI+DH IEV DS +
Sbjct: 12 LKITPSVYDFPIESARAMFMHIVMDYFIDDHGIEVVDSDVD 52
>gi|405973613|gb|EKC38315.1| hypothetical protein CGI_10022985 [Crassostrea gigas]
Length = 351
Score = 45.4 bits (106), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%)
Query: 322 RLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
R + + AKFC ECG Y +KFC ECGV+R+G
Sbjct: 313 RKNSLGKGGAKFCHECGTKYPLSEAKFCCECGVRRMGF 350
>gi|297299637|ref|XP_001088418.2| PREDICTED: protein FAM164A-like isoform 1 [Macaca mulatta]
Length = 325
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C + GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 276 CASSVNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 323
>gi|402878541|ref|XP_003902938.1| PREDICTED: zinc finger C2HC domain-containing protein 1A isoform 1
[Papio anubis]
Length = 324
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C + GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 275 CASSVNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|380813318|gb|AFE78533.1| hypothetical protein LOC51101 [Macaca mulatta]
Length = 324
Score = 45.1 bits (105), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 27/48 (56%)
Query: 310 CLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C + GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 275 CASSVNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|197102252|ref|NP_001124880.1| zinc finger C2HC domain-containing protein 1A [Pongo abelii]
gi|75042482|sp|Q5REC0.1|ZC21A_PONAB RecName: Full=Zinc finger C2HC domain-containing protein 1A
gi|55726230|emb|CAH89887.1| hypothetical protein [Pongo abelii]
Length = 324
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 27/44 (61%)
Query: 314 LTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
L GG++ P + +KFC ECG Y E +KFC ECG++R+
Sbjct: 279 LNGGNIKGIEGNSPGNLSKFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|261330771|emb|CBH13756.1| protein kinase, putative [Trypanosoma brucei gambiense DAL972]
Length = 433
Score = 44.7 bits (104), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 2/38 (5%)
Query: 323 LHVMPASPAK--FCDECGAPYLRETSKFCSECGVKRLG 358
+ PA+ K CD CGAP+ + T+KFCS+CG K+ G
Sbjct: 1 MSCTPATLGKPPVCDHCGAPFTKSTAKFCSKCGTKKKG 38
>gi|72393421|ref|XP_847511.1| protein kinase [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|62359609|gb|AAX80042.1| protein kinase, putative [Trypanosoma brucei]
gi|70803541|gb|AAZ13445.1| protein kinase, putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 433
Score = 44.7 bits (104), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 2/38 (5%)
Query: 323 LHVMPASPAK--FCDECGAPYLRETSKFCSECGVKRLG 358
+ PA+ K CD CGAP+ + T+KFCS+CG K+ G
Sbjct: 1 MSCTPATLGKPPVCDHCGAPFTKSTAKFCSKCGTKKKG 38
>gi|357625382|gb|EHJ75846.1| hypothetical protein KGM_04049 [Danaus plexippus]
Length = 1564
Score = 44.3 bits (103), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 24/29 (82%), Gaps = 1/29 (3%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
++FC ECG+ +L +T+KFC ECGVKRL I
Sbjct: 1537 SRFCHECGSRFL-DTAKFCIECGVKRLLI 1564
>gi|403298907|ref|XP_003940243.1| PREDICTED: zinc finger C2HC domain-containing protein 1A [Saimiri
boliviensis boliviensis]
Length = 324
Score = 44.3 bits (103), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 26/44 (59%)
Query: 314 LTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 279 LNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|296226768|ref|XP_002759063.1| PREDICTED: zinc finger C2HC domain-containing protein 1A
[Callithrix jacchus]
Length = 325
Score = 44.3 bits (103), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 26/44 (59%)
Query: 314 LTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
L GG++ P + KFC ECG Y E +KFC ECG++R+
Sbjct: 280 LNGGNIKGIEGNSPGNLPKFCHECGTKYPVEWAKFCCECGIRRM 323
>gi|41054429|ref|NP_955974.1| zinc finger C2HC domain-containing protein 1A [Danio rerio]
gi|82187663|sp|Q7SXT7.1|ZC21A_DANRE RecName: Full=Zinc finger C2HC domain-containing protein 1A
gi|32766437|gb|AAH55250.1| Family with sequence similarity 164, member A [Danio rerio]
Length = 327
Score = 44.3 bits (103), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 18/28 (64%), Positives = 21/28 (75%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRLGI 359
KFC ECG Y E++KFC ECGVKR+ I
Sbjct: 300 KFCHECGTKYPVESAKFCCECGVKRMYI 327
>gi|91088293|ref|XP_968768.1| PREDICTED: similar to CG10999 CG10999-PA [Tribolium castaneum]
Length = 926
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC ECG Y T+KFC ECGVKRL +
Sbjct: 898 SKFCHECGNKYPLTTAKFCVECGVKRLVL 926
>gi|270011804|gb|EFA08252.1| hypothetical protein TcasGA2_TC005880 [Tribolium castaneum]
Length = 973
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC ECG Y T+KFC ECGVKRL +
Sbjct: 945 SKFCHECGNKYPLTTAKFCVECGVKRLVL 973
>gi|410987377|ref|XP_003999981.1| PREDICTED: zinc finger C2HC domain-containing protein 1A [Felis
catus]
Length = 446
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 29/52 (55%), Gaps = 8/52 (15%)
Query: 310 CLQPLTGGHV----GERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
C+ L GG++ G +P KFC ECG Y E +KFC ECG++R+
Sbjct: 397 CVSSLNGGNIKGMEGNSTGHLP----KFCHECGTKYPVEWAKFCCECGIRRM 444
>gi|170039578|ref|XP_001847607.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863125|gb|EDS26508.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1541
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 15/29 (51%), Positives = 23/29 (79%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC ECG+ ++ T+KFC ECGV+R+ +
Sbjct: 1512 SKFCHECGSRFMISTAKFCMECGVRRIML 1540
>gi|196009271|ref|XP_002114501.1| hypothetical protein TRIADDRAFT_58389 [Trichoplax adhaerens]
gi|190583520|gb|EDV23591.1| hypothetical protein TRIADDRAFT_58389 [Trichoplax adhaerens]
Length = 353
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 18/28 (64%), Positives = 20/28 (71%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRLGI 359
KFC ECG Y + +KFC ECGVKRL I
Sbjct: 324 KFCHECGTKYPVQNAKFCCECGVKRLHI 351
>gi|312383962|gb|EFR28825.1| hypothetical protein AND_02741 [Anopheles darlingi]
Length = 1722
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 58/148 (39%), Gaps = 45/148 (30%)
Query: 211 IPVLPGFPGTDNSSTMVTAQGFHTISEPQNQQQTPSKHHMQPLSHQPQFHHIHNNHHQQI 270
+PV G+PG+ T+ G H PQ+Q T ++ +N HH
Sbjct: 1616 VPVRQGYPGSR------TSTGAHGAQPPQSQDTTD--------------NNAYNAHHT-- 1653
Query: 271 HQGQQHPAHFSQNHRCGPPPHLPDIAHAHQSPTISQHMACLQPLTGGHV-GERLHVMPAS 329
P LP + + + M L P V G ++ M
Sbjct: 1654 ------------------PMSLPPLTTSFSVAKGAHKMDGLSPKDSDSVPGLIMNPM--- 1692
Query: 330 PAKFCDECGAPYLRETSKFCSECGVKRL 357
+KFC ECGA ++ ++KFC CGV+R+
Sbjct: 1693 -SKFCHECGARFMISSAKFCMSCGVRRI 1719
>gi|358332324|dbj|GAA29975.2| protein FAM164A [Clonorchis sinensis]
Length = 533
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 21/28 (75%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRLGI 359
KFC ECG+ + T+KFC ECG++RL +
Sbjct: 506 KFCYECGSQFPNATAKFCPECGIRRLAV 533
>gi|194756622|ref|XP_001960575.1| GF13426 [Drosophila ananassae]
gi|190621873|gb|EDV37397.1| GF13426 [Drosophila ananassae]
Length = 1814
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 333 FCDECGAPYLRETSKFCSECGVKRL 357
FC ECGA ++ E +KFC ECGV+R+
Sbjct: 1788 FCHECGAKFIVEQAKFCMECGVRRM 1812
>gi|327269679|ref|XP_003219620.1| PREDICTED: LOW QUALITY PROTEIN: protein FAM164A-like [Anolis
carolinensis]
Length = 321
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 17/26 (65%), Positives = 20/26 (76%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECGA Y E +KFC ECGV+R+
Sbjct: 294 KFCYECGAKYPIEAAKFCCECGVRRM 319
>gi|195122542|ref|XP_002005770.1| GI18895 [Drosophila mojavensis]
gi|193910838|gb|EDW09705.1| GI18895 [Drosophila mojavensis]
Length = 1826
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 10/57 (17%)
Query: 301 SPTISQHMACLQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+PT SQH LQ +V ++ +KFC ECGA ++ E +KFC +CGV+R+
Sbjct: 1778 TPTTSQH--SLQS-NNSNVSVKM-------SKFCHECGAKFVLEHAKFCMDCGVRRV 1824
>gi|195150771|ref|XP_002016324.1| GL10553 [Drosophila persimilis]
gi|194110171|gb|EDW32214.1| GL10553 [Drosophila persimilis]
Length = 1789
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 21/27 (77%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRL 357
+KFC ECG ++ E +KFC ECGV+R+
Sbjct: 1761 SKFCHECGGRFIIEHAKFCMECGVRRM 1787
>gi|198432697|ref|XP_002130661.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 388
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 16/34 (47%), Positives = 24/34 (70%)
Query: 324 HVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+ +P+ +KFC ECG+ Y +KFC ECG+KR+
Sbjct: 352 YRLPSPGSKFCHECGSRYPISNAKFCCECGLKRI 385
>gi|198457563|ref|XP_002138413.1| GA24409 [Drosophila pseudoobscura pseudoobscura]
gi|198136021|gb|EDY68971.1| GA24409 [Drosophila pseudoobscura pseudoobscura]
Length = 1840
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 21/27 (77%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRL 357
+KFC ECG ++ E +KFC ECGV+R+
Sbjct: 1812 SKFCHECGGRFIIEHAKFCMECGVRRM 1838
>gi|332019679|gb|EGI60153.1| UPF0418 protein FAM164A [Acromyrmex echinatior]
Length = 682
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 22/29 (75%), Gaps = 1/29 (3%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC ECG + ET+KFC ECGVKRL +
Sbjct: 655 SKFCHECGQRFP-ETAKFCCECGVKRLAL 682
>gi|345479856|ref|XP_003424043.1| PREDICTED: hypothetical protein LOC100679241 [Nasonia vitripennis]
Length = 685
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 23/29 (79%), Gaps = 1/29 (3%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC ECGA + +++KFC ECGVKRL +
Sbjct: 658 SKFCHECGAKFP-DSAKFCCECGVKRLTL 685
>gi|344250327|gb|EGW06431.1| UPF0418 protein FAM164A [Cricetulus griseus]
Length = 241
Score = 42.0 bits (97), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 214 KFCHECGTKYPVEWAKFCCECGIRRM 239
>gi|242018380|ref|XP_002429655.1| synaptonemal complex protein, putative [Pediculus humanus corporis]
gi|212514640|gb|EEB16917.1| synaptonemal complex protein, putative [Pediculus humanus corporis]
Length = 2036
Score = 41.6 bits (96), Expect = 0.53, Method: Composition-based stats.
Identities = 18/29 (62%), Positives = 21/29 (72%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC ECG+ Y +KFC ECGVKRL I
Sbjct: 2008 SKFCHECGSKYPVPVAKFCCECGVKRLII 2036
>gi|156717642|ref|NP_001096361.1| zinc finger, C2HC-type containing 1A [Xenopus (Silurana)
tropicalis]
gi|134026138|gb|AAI35866.1| LOC100124952 protein [Xenopus (Silurana) tropicalis]
Length = 363
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 25/46 (54%)
Query: 312 QPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+P+ G + + KFC ECG Y E +KFC ECG++R+
Sbjct: 316 KPVNDGRIRAADNNTSGQLTKFCHECGTKYPVEWAKFCCECGIRRM 361
>gi|351698344|gb|EHB01263.1| UPF0418 protein FAM164A, partial [Heterocephalus glaber]
Length = 319
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 314 LTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
L GG++ + KFC ECG Y E +KFC ECG++R+
Sbjct: 274 LNGGNIKGIEGNLSGHLPKFCHECGTKYPVEWAKFCCECGIRRM 317
>gi|350583090|ref|XP_001924441.3| PREDICTED: protein FAM164A-like isoform 1 [Sus scrofa]
Length = 324
Score = 41.6 bits (96), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 28/51 (54%), Gaps = 8/51 (15%)
Query: 311 LQPLTGGHV----GERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+ L GG+V G +P KFC ECG Y E +KFC ECGV+R+
Sbjct: 276 ISSLNGGNVKGIEGNSTGHLP----KFCHECGTKYPVEWAKFCCECGVRRM 322
>gi|348512232|ref|XP_003443647.1| PREDICTED: protein FAM164A-like [Oreochromis niloticus]
Length = 311
Score = 41.6 bits (96), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 21/29 (72%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC CG Y E++KFC ECGV+R+ I
Sbjct: 283 SKFCHACGTKYPVESAKFCCECGVRRMCI 311
>gi|317419798|emb|CBN81834.1| UPF0418 protein FAM164A [Dicentrarchus labrax]
Length = 332
Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 21/29 (72%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC CG Y E++KFC ECGV+R+ I
Sbjct: 304 SKFCHACGTKYPVESAKFCCECGVRRMCI 332
>gi|354483896|ref|XP_003504128.1| PREDICTED: protein FAM164A-like [Cricetulus griseus]
Length = 374
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 347 KFCHECGTKYPVEWAKFCCECGIRRM 372
>gi|134085724|ref|NP_001076906.1| zinc finger C2HC domain-containing protein 1A [Bos taurus]
gi|205831288|sp|A4FUE7.1|ZC21A_BOVIN RecName: Full=Zinc finger C2HC domain-containing protein 1A
gi|133777455|gb|AAI14758.1| C14H8ORF70 protein [Bos taurus]
Length = 323
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECGV+R+
Sbjct: 296 KFCHECGTKYPVEWAKFCCECGVRRM 321
>gi|426235670|ref|XP_004011803.1| PREDICTED: zinc finger C2HC domain-containing protein 1A [Ovis
aries]
Length = 323
Score = 41.2 bits (95), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECGV+R+
Sbjct: 296 KFCHECGTKYPVEWAKFCCECGVRRM 321
>gi|440903738|gb|ELR54360.1| Protein FAM164A, partial [Bos grunniens mutus]
Length = 318
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 16/26 (61%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECGV+R+
Sbjct: 291 KFCHECGTKYPVEWAKFCCECGVRRM 316
>gi|432927702|ref|XP_004081028.1| PREDICTED: zinc finger C2HC domain-containing protein 1A-like
[Oryzias latipes]
Length = 312
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 21/29 (72%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC CG Y E++KFC ECGV+R+ I
Sbjct: 283 SKFCHACGTKYPVESAKFCCECGVRRMCI 311
>gi|281347411|gb|EFB22995.1| hypothetical protein PANDA_014390 [Ailuropoda melanoleuca]
Length = 293
Score = 41.2 bits (95), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 266 KFCHECGTKYPVEWAKFCCECGIRRM 291
>gi|355735868|gb|AES11813.1| hypothetical protein [Mustela putorius furo]
Length = 323
Score = 41.2 bits (95), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 296 KFCHECGTKYPVEWAKFCCECGIRRM 321
>gi|344272922|ref|XP_003408277.1| PREDICTED: protein FAM164A-like [Loxodonta africana]
Length = 429
Score = 41.2 bits (95), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 402 KFCHECGTKYPVEWAKFCCECGIRRM 427
>gi|126321250|ref|XP_001377619.1| PREDICTED: protein FAM164A-like [Monodelphis domestica]
Length = 355
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRL 357
+KFC ECG Y E +KFC ECG++R+
Sbjct: 327 SKFCHECGTRYPVEWAKFCCECGIRRM 353
>gi|27502351|ref|NP_775273.1| zinc finger C2HC domain-containing protein 1A [Mus musculus]
gi|81896367|sp|Q8BJH1.1|ZC21A_MOUSE RecName: Full=Zinc finger C2HC domain-containing protein 1A
gi|26350877|dbj|BAC39075.1| unnamed protein product [Mus musculus]
gi|56388561|gb|AAH87731.1| Fam164a protein [Mus musculus]
gi|148673260|gb|EDL05207.1| RIKEN cDNA 3110050N22 [Mus musculus]
Length = 324
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 297 KFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|159109830|ref|XP_001705178.1| Hypothetical protein GL50803_8405 [Giardia lamblia ATCC 50803]
gi|157433258|gb|EDO77504.1| hypothetical protein GL50803_8405 [Giardia lamblia ATCC 50803]
Length = 445
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKR 356
+KFC ECG+ + +T+KFC ECG KR
Sbjct: 420 SKFCSECGSKFASDTAKFCMECGSKR 445
>gi|291388161|ref|XP_002710696.1| PREDICTED: hypothetical protein isoform 2 [Oryctolagus cuniculus]
Length = 324
Score = 41.2 bits (95), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 297 KFCHECGTKYPVEWAKFCCECGIRRM 322
>gi|291388159|ref|XP_002710695.1| PREDICTED: hypothetical protein isoform 1 [Oryctolagus cuniculus]
Length = 325
Score = 41.2 bits (95), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 298 KFCHECGTKYPVEWAKFCCECGIRRM 323
>gi|348588415|ref|XP_003479962.1| PREDICTED: protein FAM164A-like [Cavia porcellus]
Length = 353
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 326 KFCHECGTKYPVEWAKFCCECGIRRM 351
>gi|338728257|ref|XP_001491920.3| PREDICTED: protein FAM164A-like [Equus caballus]
Length = 341
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 314 KFCHECGTKYPVEWAKFCCECGIRRM 339
>gi|301779095|ref|XP_002924965.1| PREDICTED: protein FAM164A-like [Ailuropoda melanoleuca]
Length = 323
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 296 KFCHECGTKYPVEWAKFCCECGIRRM 321
>gi|253743217|gb|EES99687.1| Hypothetical protein GL50581_3077 [Giardia intestinalis ATCC 50581]
Length = 446
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKR 356
+KFC ECG+ + +T+KFC ECG KR
Sbjct: 421 SKFCGECGSKFASDTAKFCMECGSKR 446
>gi|432116240|gb|ELK37284.1| Protein FAM164A, partial [Myotis davidii]
Length = 323
Score = 40.8 bits (94), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 25/47 (53%)
Query: 311 LQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+ L GG+V KFC ECG Y E +KFC ECG++R+
Sbjct: 275 VSSLNGGNVKGFEGSSSGHLPKFCHECGTKYPVEWAKFCCECGIRRI 321
>gi|389576006|ref|ZP_10166034.1| putative virion core protein (lumpy skin disease virus)
[Eubacterium cellulosolvens 6]
gi|389311491|gb|EIM56424.1| putative virion core protein (lumpy skin disease virus)
[Eubacterium cellulosolvens 6]
Length = 328
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 20/29 (68%), Gaps = 1/29 (3%)
Query: 327 PASPAKFCDECGAPYLRETSKFCSECGVK 355
PA+ AKFC CG+P L +KFCS CG K
Sbjct: 299 PATGAKFCSNCGSP-LDPGAKFCSNCGTK 326
>gi|74215412|dbj|BAE41909.1| unnamed protein product [Mus musculus]
Length = 288
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 261 KFCHECGTKYPVEWAKFCCECGIRRM 286
>gi|363730718|ref|XP_001232351.2| PREDICTED: protein FAM164A [Gallus gallus]
Length = 341
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRL 357
+KFC ECG Y E +KFC ECG++R+
Sbjct: 313 SKFCHECGTRYPVEWAKFCCECGIRRM 339
>gi|342182965|emb|CCC92445.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 442
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 20/32 (62%)
Query: 327 PASPAKFCDECGAPYLRETSKFCSECGVKRLG 358
P+ C CGA + R T+KFCS+CG K+ G
Sbjct: 7 PSGKLPLCGHCGASFTRATAKFCSKCGTKKKG 38
>gi|449275305|gb|EMC84178.1| UPF0418 protein FAM164A, partial [Columba livia]
Length = 318
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRL 357
+KFC ECG Y E +KFC ECG++R+
Sbjct: 290 SKFCHECGTRYPVEWAKFCCECGIRRM 316
>gi|395510972|ref|XP_003759739.1| PREDICTED: zinc finger C2HC domain-containing protein 1A
[Sarcophilus harrisii]
Length = 323
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y E +KFC ECG++R+
Sbjct: 296 KFCHECGTRYPVEWAKFCCECGIRRM 321
>gi|308160090|gb|EFO62597.1| Hypothetical protein GLP15_2434 [Giardia lamblia P15]
Length = 442
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKR 356
+KFC ECG+ + +T+KFC ECG KR
Sbjct: 417 SKFCSECGSRFASDTAKFCMECGSKR 442
>gi|195441973|ref|XP_002068735.1| GK17934 [Drosophila willistoni]
gi|194164820|gb|EDW79721.1| GK17934 [Drosophila willistoni]
Length = 1884
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 333 FCDECGAPYLRETSKFCSECGVKRL 357
FC ECG ++ E +KFC ECGV+R+
Sbjct: 1858 FCHECGGKFVLEHAKFCMECGVRRV 1882
>gi|195024072|ref|XP_001985805.1| GH20883 [Drosophila grimshawi]
gi|193901805|gb|EDW00672.1| GH20883 [Drosophila grimshawi]
Length = 1822
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 14/27 (51%), Positives = 21/27 (77%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRL 357
+KFC ECG ++ E +KFC +CGV+R+
Sbjct: 1794 SKFCHECGGKFVIEHAKFCMDCGVRRV 1820
>gi|195381515|ref|XP_002049494.1| GJ20718 [Drosophila virilis]
gi|194144291|gb|EDW60687.1| GJ20718 [Drosophila virilis]
Length = 1933
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 333 FCDECGAPYLRETSKFCSECGVKRL 357
FC ECGA ++ E +KFC +CGV+R+
Sbjct: 1907 FCHECGAKFVIEHAKFCMDCGVRRV 1931
>gi|358333427|dbj|GAA51942.1| protein FAM164A [Clonorchis sinensis]
Length = 817
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 18/24 (75%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECGA Y+ ET+KFCS CG R
Sbjct: 752 YCYECGASYVAETAKFCSHCGAVR 775
>gi|328789105|ref|XP_001120544.2| PREDICTED: hypothetical protein LOC724652 [Apis mellifera]
Length = 685
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 323 LHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+HV+ +K+C +CG + ET+KFC ECG++RL
Sbjct: 650 IHVVKCKMSKYCHQCGFKFP-ETAKFCCECGIRRL 683
>gi|380013924|ref|XP_003690994.1| PREDICTED: uncharacterized protein LOC100864583 [Apis florea]
Length = 540
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 24/35 (68%), Gaps = 1/35 (2%)
Query: 323 LHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+HV +K+C +CG + ET+KFC ECG++RL
Sbjct: 505 IHVTKCKMSKYCHQCGFKFP-ETAKFCCECGIRRL 538
>gi|407411278|gb|EKF33418.1| hypothetical protein MOQ_002715 [Trypanosoma cruzi marinkellei]
Length = 746
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 55/130 (42%), Gaps = 34/130 (26%)
Query: 236 SEPQNQQQTPSKHHMQPLSHQPQFHHIHNNHHQQIHQGQQHPAHFSQNHRCGPPPHLPDI 295
+ PQ+ +T S +S QP +QI +G++ PA FS H+ +
Sbjct: 643 TSPQHSSETASS-----ISKQPSL--------RQIPRGRKSPASFS---------HITPL 680
Query: 296 AHAHQSPTISQHMACLQPLTG---GHVGERLHVMPASPAK---FCDECGAPYLRETSKFC 349
P + + + PL G V E P SP + FC ECG +L E KFC
Sbjct: 681 ERESYRP-LGKDLPIDTPLRAVRRGPVAE-----PYSPGRLVPFCTECGKKHLSEKVKFC 734
Query: 350 SECGVKRLGI 359
+ CG KR +
Sbjct: 735 AFCGHKREAV 744
>gi|407425845|gb|EKF39526.1| protein kinase, putative,serine/threonine-protein kinase, putative
[Trypanosoma cruzi marinkellei]
Length = 444
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 14/26 (53%), Positives = 18/26 (69%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKR 356
A C CGAP+ + T+KFCS CG K+
Sbjct: 15 AMVCGHCGAPFSKATAKFCSRCGTKK 40
>gi|322795285|gb|EFZ18090.1| hypothetical protein SINV_04874 [Solenopsis invicta]
Length = 321
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 21/29 (72%), Gaps = 1/29 (3%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRLGI 359
+KFC ECG + ET+KFC ECG+KRL
Sbjct: 294 SKFCHECGQQFP-ETAKFCCECGMKRLAF 321
>gi|157817901|ref|NP_001101131.1| zinc finger, C2HC-type containing 1A [Rattus norvegicus]
gi|149048487|gb|EDM01028.1| similar to RIKEN cDNA 3110050N22 (predicted) [Rattus norvegicus]
gi|197246614|gb|AAI68976.1| Family with sequence similarity 164, member A [Rattus norvegicus]
Length = 324
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRL 357
KFC ECG Y + +KFC ECG++R+
Sbjct: 297 KFCHECGTKYPVDWAKFCCECGIRRM 322
>gi|345793151|ref|XP_860590.2| PREDICTED: protein FAM164A isoform 2 [Canis lupus familiaris]
Length = 325
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 8/51 (15%)
Query: 311 LQPLTGGHV----GERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+ L GG++ G +P KFC ECG Y E +K+C ECG++R+
Sbjct: 277 ISSLNGGNIKGTEGNSTGHLP----KFCHECGTKYPVEWAKYCCECGIRRM 323
>gi|383859373|ref|XP_003705169.1| PREDICTED: uncharacterized protein LOC100879304 [Megachile
rotundata]
Length = 673
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 22/27 (81%), Gaps = 1/27 (3%)
Query: 331 AKFCDECGAPYLRETSKFCSECGVKRL 357
+K+C ECG+ + ET+KFC ECGV+RL
Sbjct: 646 SKYCHECGSRFP-ETAKFCCECGVRRL 671
>gi|308810453|ref|XP_003082535.1| COG0149: Triosephosphate isomerase (ISS) [Ostreococcus tauri]
gi|116061004|emb|CAL56392.1| COG0149: Triosephosphate isomerase (ISS) [Ostreococcus tauri]
Length = 685
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 12/29 (41%), Positives = 21/29 (72%)
Query: 328 ASPAKFCDECGAPYLRETSKFCSECGVKR 356
A+ +KFC +CG + +T++FC+ CG +R
Sbjct: 656 AAKSKFCSDCGVAFASKTARFCANCGARR 684
>gi|45552691|ref|NP_995870.1| CG30460, isoform D [Drosophila melanogaster]
gi|45445510|gb|AAS64828.1| CG30460, isoform D [Drosophila melanogaster]
Length = 1402
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1376 YCYECGSKFIFETAKFCMDCGLRR 1399
>gi|260786781|ref|XP_002588435.1| hypothetical protein BRAFLDRAFT_63383 [Branchiostoma floridae]
gi|229273597|gb|EEN44446.1| hypothetical protein BRAFLDRAFT_63383 [Branchiostoma floridae]
Length = 2054
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 15/42 (35%), Positives = 24/42 (57%)
Query: 318 HVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRLGI 359
+G ++ + +KFC ECG Y +KFC ECG +R+ +
Sbjct: 2012 RLGSTENIAGKTASKFCHECGTRYPIPNAKFCCECGTRRIWL 2053
>gi|320544074|ref|NP_001188962.1| CG30460, isoform I [Drosophila melanogaster]
gi|318068632|gb|ADV37208.1| CG30460, isoform I [Drosophila melanogaster]
Length = 888
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 862 YCYECGSKFIFETAKFCMDCGLRR 885
>gi|256818833|gb|ACV31086.1| AT04745p [Drosophila melanogaster]
Length = 891
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 865 YCYECGSKFIFETAKFCMDCGLRR 888
>gi|45552693|ref|NP_995871.1| CG30460, isoform A [Drosophila melanogaster]
gi|45445509|gb|AAS64827.1| CG30460, isoform A [Drosophila melanogaster]
Length = 1520
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1494 YCYECGSKFIFETAKFCMDCGLRR 1517
>gi|303276893|ref|XP_003057740.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460397|gb|EEH57691.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1549
Score = 39.3 bits (90), Expect = 3.2, Method: Composition-based stats.
Identities = 14/24 (58%), Positives = 15/24 (62%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
FC +CGA Y R KFC ECG R
Sbjct: 1526 FCSDCGAAYARAGQKFCMECGAPR 1549
>gi|45550446|ref|NP_611186.2| CG30460, isoform C [Drosophila melanogaster]
gi|45445507|gb|AAF57906.3| CG30460, isoform C [Drosophila melanogaster]
Length = 1868
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1842 YCYECGSKFIFETAKFCMDCGLRR 1865
>gi|33636665|gb|AAQ23630.1| AT29286p [Drosophila melanogaster]
Length = 1482
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1456 YCYECGSKFIFETAKFCMDCGLRR 1479
>gi|431891864|gb|ELK02398.1| UPF0418 protein FAM164A, partial [Pteropus alecto]
Length = 322
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 25/47 (53%)
Query: 311 LQPLTGGHVGERLHVMPASPAKFCDECGAPYLRETSKFCSECGVKRL 357
+ L GG+V KFC ECG + + +KFC ECGV+R+
Sbjct: 274 VSALNGGNVKGLEGSSSGHLPKFCHECGTKFPVDWAKFCCECGVRRM 320
>gi|221330344|ref|NP_001137685.1| CG30460, isoform F [Drosophila melanogaster]
gi|220902255|gb|ACL83139.1| CG30460, isoform F [Drosophila melanogaster]
Length = 1799
Score = 39.3 bits (90), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1773 YCYECGSKFIFETAKFCMDCGLRR 1796
>gi|221330346|ref|NP_001137686.1| CG30460, isoform G [Drosophila melanogaster]
gi|220902256|gb|ACL83140.1| CG30460, isoform G [Drosophila melanogaster]
Length = 1480
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1454 YCYECGSKFIFETAKFCMDCGLRR 1477
>gi|195584122|ref|XP_002081864.1| GD25497 [Drosophila simulans]
gi|194193873|gb|EDX07449.1| GD25497 [Drosophila simulans]
Length = 1807
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1781 YCYECGSKFIFETAKFCMDCGLRR 1804
>gi|45552695|ref|NP_995872.1| CG30460, isoform B [Drosophila melanogaster]
gi|21428802|gb|AAM50120.1| GH03957p [Drosophila melanogaster]
gi|45445508|gb|AAS64826.1| CG30460, isoform B [Drosophila melanogaster]
Length = 1132
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1106 YCYECGSKFIFETAKFCMDCGLRR 1129
>gi|194882323|ref|XP_001975261.1| GG22220 [Drosophila erecta]
gi|190658448|gb|EDV55661.1| GG22220 [Drosophila erecta]
Length = 1805
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1779 YCYECGSKFIFETAKFCMDCGLRR 1802
>gi|195335133|ref|XP_002034230.1| GM20008 [Drosophila sechellia]
gi|194126200|gb|EDW48243.1| GM20008 [Drosophila sechellia]
Length = 1781
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1755 YCYECGSKFIFETAKFCMDCGLRR 1778
>gi|161077190|ref|NP_001097355.1| CG30460, isoform E [Drosophila melanogaster]
gi|157400381|gb|ABV53833.1| CG30460, isoform E [Drosophila melanogaster]
Length = 1711
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1685 YCYECGSKFIFETAKFCMDCGLRR 1708
>gi|281363540|ref|NP_001163172.1| CG30460, isoform H [Drosophila melanogaster]
gi|21428970|gb|AAM50204.1| GH27233p [Drosophila melanogaster]
gi|272432519|gb|ACZ94444.1| CG30460, isoform H [Drosophila melanogaster]
Length = 1006
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 980 YCYECGSKFIFETAKFCMDCGLRR 1003
>gi|195488835|ref|XP_002092481.1| GE14217 [Drosophila yakuba]
gi|194178582|gb|EDW92193.1| GE14217 [Drosophila yakuba]
Length = 1810
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
+C ECG+ ++ ET+KFC +CG++R
Sbjct: 1784 YCYECGSKFIFETAKFCMDCGLRR 1807
>gi|407860692|gb|EKG07445.1| protein kinase, putative,serine/threonine-protein kinase, putative
[Trypanosoma cruzi]
Length = 444
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 334 CDECGAPYLRETSKFCSECGVKR 356
C CGAP+ + T+KFCS CG K+
Sbjct: 18 CGHCGAPFSKVTAKFCSRCGTKK 40
>gi|71651650|ref|XP_814498.1| protein kinase [Trypanosoma cruzi strain CL Brener]
gi|70879476|gb|EAN92647.1| protein kinase, putative [Trypanosoma cruzi]
Length = 444
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 334 CDECGAPYLRETSKFCSECGVKR 356
C CGAP+ + T+KFCS CG K+
Sbjct: 18 CGHCGAPFSKVTAKFCSRCGTKK 40
>gi|71406074|ref|XP_805605.1| protein kinase [Trypanosoma cruzi strain CL Brener]
gi|70869076|gb|EAN83754.1| protein kinase, putative [Trypanosoma cruzi]
Length = 444
Score = 38.1 bits (87), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 333 FCDECGAPYLRETSKFCSECGVKR 356
C CGAP+ + T+KFCS CG K+
Sbjct: 17 VCGHCGAPFSKVTAKFCSRCGTKK 40
>gi|345321349|ref|XP_001514301.2| PREDICTED: protein FAM164A-like, partial [Ornithorhynchus anatinus]
Length = 369
Score = 37.7 bits (86), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 332 KFCDECGAPYLRETSKFCSECGVKRLGI 359
KFC ECG + + +KFC ECGV+R+ +
Sbjct: 342 KFCHECGTRHPVDWAKFCCECGVRRMAL 369
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,902,496,692
Number of Sequences: 23463169
Number of extensions: 256348670
Number of successful extensions: 950700
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 554
Number of HSP's successfully gapped in prelim test: 2143
Number of HSP's that attempted gapping in prelim test: 911119
Number of HSP's gapped (non-prelim): 22662
length of query: 359
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 216
effective length of database: 9,003,962,200
effective search space: 1944855835200
effective search space used: 1944855835200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)