BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011479
(484 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255585261|ref|XP_002533331.1| conserved hypothetical protein [Ricinus communis]
gi|223526836|gb|EEF29052.1| conserved hypothetical protein [Ricinus communis]
Length = 502
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 241/543 (44%), Positives = 294/543 (54%), Gaps = 102/543 (18%)
Query: 1 MATQPPAPRQWFRLPSIGRPAPPVTPTPEPPAP------QPRPPMIRAPFRPMGQPPLES 54
MA QP R W RL SI RPAPP + P PPA QPR + R RP+ QP L +
Sbjct: 1 MANQPSQSRPWSRLYSIARPAPPES-IPTPPAAPESAPGQPRAVLARPSLRPVTQPTLPT 59
Query: 55 IP-----------------------------APA----AVGVSSVPTSPIRRTSVPSSPV 81
P +PA GV+SVP+SP+ R SVP++ +
Sbjct: 60 RPQEPTQAPQPPVATVPPPVAAPPPATPTQRSPAKPSGGGGVASVPSSPVLRASVPTASL 119
Query: 82 PRESPSPPSPPATTSP----------PKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPK 131
P+ SP P P+P+VS+P TSS+P+SP K
Sbjct: 120 PK---SPVQTATAVGPVTTSSSVQSSPRPRVSAP-------TSSLPSSPVL--------K 161
Query: 132 PLATTFSVPTSPA-KPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPK 190
P TFSVPTSPA KP PTT + T+ P T KP PT V T+ K
Sbjct: 162 P-TQTFSVPTSPATKPMPTTVSLPTS------PAT---------KPMPTTVSVPTSPATK 205
Query: 191 PAPTTFRVPTSP-PKPAPSTFSVPTSPAKPKPAVTAAISTTTSPPR-VPSPAPVGPRTLL 248
P PTT VPTSP PKPAP+T SVPTSPA + T ++T P V P + P
Sbjct: 206 PMPTTISVPTSPLPKPAPATSSVPTSPANRAVSTTTTATSTARVPSPVQYPRTINPVVQ- 264
Query: 249 SDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWF 308
+PP+SP TAPPPSPL LPP + S AET+ IP EQKTV+VQ +KPK W
Sbjct: 265 ---SPPRSPKLNPTAPPPSPLILPPAKIKSDAETEAKIPLVAEQKTVVVQKTIDKPKPWL 321
Query: 309 NGTGNHFDETHKP--SSYAHRGNHEVHKEVAK-NGKVHRKQFSSDSDEVGMRVITIAGEN 365
GN + SS G E K+ + KV K+ S S++ GMRVIT+AGEN
Sbjct: 322 TNGGNSERDLVNAVKSSIGLNGKQETTKDHGETKEKVLHKKIFSGSEDGGMRVITVAGEN 381
Query: 366 KGAFMEVIRSP-LKHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSK-----KDK 419
KGAFMEVIRSP KH F+G+PH LH K + +SDG W S+SS+ + + KDK
Sbjct: 382 KGAFMEVIRSPNKKHVFEGSPHHLHKKDN-DPKSDGIKWISYSSNGSSSEGEGKPKMKDK 440
Query: 420 NYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHVKDKG 479
N+K K M PM AFMNSNVQGVNNSIV++SSC+HHDPGVHL+LSRK +G GFHVKD+
Sbjct: 441 NHKGKG-MNSLPMTAFMNSNVQGVNNSIVYNSSCTHHDPGVHLALSRKSAGGGFHVKDRS 499
Query: 480 NGH 482
NGH
Sbjct: 500 NGH 502
>gi|224127999|ref|XP_002320216.1| predicted protein [Populus trichocarpa]
gi|222860989|gb|EEE98531.1| predicted protein [Populus trichocarpa]
Length = 447
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 231/513 (45%), Positives = 279/513 (54%), Gaps = 95/513 (18%)
Query: 1 MATQPPAPRQWFRLPSIGRPAPPVTPTPEPPAPQPRPPMIRAP-FRPMGQPPLESIPAPA 59
M+ QP PR WFRLPSI RPA P P P + P FR L ++P P
Sbjct: 1 MSNQPAPPRPWFRLPSIARPAAPAPTPTPESPPPQPRPALARPSFR------LTALPQPV 54
Query: 60 --------------------AVGVSSVPTSPIRRT-----SVPSSPVPRESPSPPSPPAT 94
A G++SVPTSP + S+P+SP PR
Sbjct: 55 PTQPQEPTPAPPPSAATVPPAAGIASVPTSPAVKAAGGVASLPTSPAPR----------- 103
Query: 95 TSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVI 154
+P+P+ SS+PTSP P T PP TTFS+P P+PT P
Sbjct: 104 ---------APAPS-----SSVPTSP-VPTSTVFPP----TTFSLP-----PSPTLKPSR 139
Query: 155 TTS--PPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKPAPSTFSV 212
T+S P PA T P + S +P+ PV TTS +P T SV
Sbjct: 140 TSSSVPTSPASTPSPSASVST---SPSTRPVSTTSSAPSSPAPKPATV--------TSSV 188
Query: 213 PTSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKATAPPPSPLTLP 272
P SPA T A+ TTS RVP P P + TPPQSP K TAPPPSPLT P
Sbjct: 189 PNSPA------TKAV--TTSAARVPGPTPSLRIIKPTVQTPPQSPKPKPTAPPPSPLTRP 240
Query: 273 PPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPSSYAHRGNHEV 332
P + S A+ +P IP EQKTVLVQ +KPK + D S A E
Sbjct: 241 PSRVKSDADLEPKIPLVAEQKTVLVQKIIDKPKEAGDSLRAFADSLS--SGIARLAKPET 298
Query: 333 HKEVAKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKG 392
K+ K K K+ SSDS++VGMRVITIAGENKGAFMEVIRSP KH F+G HTL+ KG
Sbjct: 299 AKDQTKE-KGSGKKISSDSEDVGMRVITIAGENKGAFMEVIRSPKKHFFEGNSHTLNKKG 357
Query: 393 SLKTQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSS 452
+ + S+G+ W S SSS +E SKKDKN+K ++ M P PM AFMNSNVQGVNNSIV++SS
Sbjct: 358 NPR--SEGSDWGSQSSSGEEGNSKKDKNHKGRS-MGPSPMSAFMNSNVQGVNNSIVYNSS 414
Query: 453 CSHHDPGVHLSLSRKPSGN-GFHVKDKGNGHHS 484
CSHHDPGVH++LSRKPSG+ GFHVKD+GNG+ S
Sbjct: 415 CSHHDPGVHVALSRKPSGSAGFHVKDRGNGYQS 447
>gi|224068030|ref|XP_002302653.1| predicted protein [Populus trichocarpa]
gi|222844379|gb|EEE81926.1| predicted protein [Populus trichocarpa]
Length = 454
Score = 234 bits (597), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 223/502 (44%), Positives = 270/502 (53%), Gaps = 66/502 (13%)
Query: 1 MATQPPAPRQWFRLPSIGRPAPPVTPTPEPPAPQPRPPMIRAP-FRPMG--QPPLESIPA 57
M+ QP R WFRLPSI RP P T P P P + P FRP QP
Sbjct: 1 MSNQPAPARPWFRLPSIARPTAPTTTPTPEPPPPQPRPALARPAFRPTAPPQP------- 53
Query: 58 PAAVGVSSVPTSPIRRTSVPSSPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMP 117
VPT P RE+ +PPA T P V++P+P +P S +
Sbjct: 54 --------VPTQP------------RET----TPPAATGPQ--AVAAPTPEPVPPVSGVA 87
Query: 118 TSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPA 177
+ P P + SVPTSPA AP + TSP P P T S P
Sbjct: 88 SVPTSPVA-----RAAVGVASVPTSPASRAPAPSSSVPTSP-VPTSAVLPLTTSSLPPSP 141
Query: 178 PTVFPVITTSPP-----KPAPTTFRVPTSP-PKPAPSTFSVPTSPAKPKPAVTAAI---- 227
++S P KP TT VPTSP PKP +T SVPTS A VT+++
Sbjct: 142 TPKPAPTSSSVPTSPATKPMTTTSSVPTSPAPKPVTTTSSVPTSYASKSVTVTSSVQNSP 201
Query: 228 ---STTTSPPRVPSPAPVGPRTLL-SDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQ 283
+ TT+ RVPSP P RT+ + TPPQSP + TAPPPSPLTLPP Q S A+ +
Sbjct: 202 ATKAVTTNATRVPSPGP-SLRTIKPAVQTPPQSPKPRPTAPPPSPLTLPPSQVKSYADLE 260
Query: 284 PTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPSSYAHRGNHEVHKEVAKNGKVH 343
P IP EQKTVLVQ +KPK + + D S A E K+V K +
Sbjct: 261 PKIPLVAEQKTVLVQKTFDKPKVARDSQRDFADSLS--SGIARLAKQETTKDVQTKEKGN 318
Query: 344 RKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGW 403
RK+ SSDS++ GM+VITIAGENKGAFMEVIRSP K F+GT H LH KG+ K++
Sbjct: 319 RKKNSSDSEDGGMKVITIAGENKGAFMEVIRSPKKQVFEGTSHFLHKKGNPKSEG----- 373
Query: 404 HSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLS 463
SSS + KKDKN KA M P P+ +FMNSNVQGVNNSIV++SSCSHHDPGVH++
Sbjct: 374 SDSSSSEEGNSKKKDKNPNGKA-MGPSPLSSFMNSNVQGVNNSIVYNSSCSHHDPGVHVA 432
Query: 464 LSRKPSG-NGFHVKDKGNGHHS 484
LSRKP+G FHVKD GN + S
Sbjct: 433 LSRKPAGAAAFHVKDHGNDNQS 454
>gi|357512591|ref|XP_003626584.1| hypothetical protein MTR_7g117980 [Medicago truncatula]
gi|355501599|gb|AES82802.1| hypothetical protein MTR_7g117980 [Medicago truncatula]
Length = 371
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 118/264 (44%), Positives = 157/264 (59%), Gaps = 21/264 (7%)
Query: 229 TTTSPPRVPSPAPVGPRTLLSD-GTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIP 287
TT + RVP P P+T+ + +P SP ++T PPSPLTLPP QF ++ E IP
Sbjct: 115 TTANTSRVPIPTQ-SPKTIKQNVRSPMHSPNARSTNSPPSPLTLPPSQFKTREEHSNKIP 173
Query: 288 AETEQKTVLVQTRSEKPKSWFNGTG-------NHFD-ETHKPS-SYAHRGNHEVHKEVAK 338
E E K VLVQ + PK W N G NH + E H+ + ++++ +H H V
Sbjct: 174 VEAEPKAVLVQKTVDVPKPWHNDKGELHRENQNHSNGEFHRQTQNHSNSSHHGKHITVKG 233
Query: 339 NGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGS-LKTQ 397
K+FS DS++ GMRVITIAGEN+GA+ME+++S KH P+ LH KG+ +K
Sbjct: 234 RESSKEKKFS-DSEDSGMRVITIAGENRGAYMELVQSQKKHQ----PNYLHKKGNTIKVD 288
Query: 398 SDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHD 457
G S SSS+DE K K PM A+MNSNVQ VNNS+++H+SCSHHD
Sbjct: 289 ----GGESESSSADEGKINKKDKNNKGRTKSSIPMAAYMNSNVQCVNNSLLYHASCSHHD 344
Query: 458 PGVHLSLSRKPSGNGFHVKDKGNG 481
PGV LSLS+KP G G+HVK+ +G
Sbjct: 345 PGVRLSLSKKPFGEGYHVKENVDG 368
>gi|297824745|ref|XP_002880255.1| hypothetical protein ARALYDRAFT_904132 [Arabidopsis lyrata subsp.
lyrata]
gi|297326094|gb|EFH56514.1| hypothetical protein ARALYDRAFT_904132 [Arabidopsis lyrata subsp.
lyrata]
Length = 385
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/143 (52%), Positives = 92/143 (64%), Gaps = 4/143 (2%)
Query: 341 KVHRKQFSSDSDEV-GMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSD 399
K+HR+ +SDS+ + RVITIAGENKGA ME++RSP + G+ T ++ S T
Sbjct: 238 KMHRQPSTSDSENIMSTRVITIAGENKGAVMEILRSPQGNKTGGSG-THSSRVSHGTGEK 296
Query: 400 GTGWHSFSSSSDEEKSKKDKNYKA-KAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDP 458
G S SSSS +E K K K PM AFMNSNVQ +NNSIV++S+ SHHDP
Sbjct: 297 GRRLQSSSSSSSDEGEGKKKTTKNPNNGNSNLPMKAFMNSNVQMINNSIVYNSTASHHDP 356
Query: 459 GVHLSLSRKP-SGNGFHVKDKGN 480
GVHL +SRKP S NGFHV D GN
Sbjct: 357 GVHLKISRKPGSDNGFHVSDYGN 379
>gi|147864007|emb|CAN78958.1| hypothetical protein VITISV_008152 [Vitis vinifera]
Length = 277
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 85/223 (38%), Positives = 121/223 (54%), Gaps = 40/223 (17%)
Query: 263 APPPSPLTLPPPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPS 322
+PPPS LTLPP + +K+ +P EQK + +KP F + +T KPS
Sbjct: 94 SPPPSALTLPPAR--AKSTHEPI-----EQKIL------QKPNPPFKLSNGVLGDTAKPS 140
Query: 323 SYAHRGNHEVHKEV-AKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTF 381
+ E+ K++ +K+ K + K+ +DS++ M VIT+AG+NKGA ME
Sbjct: 141 ISPNYEKRELTKDMESKDHKGNHKK--ADSEDASMEVITVAGDNKGAIME---------- 188
Query: 382 DGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQ 441
L S K SD +SS+ EEKSKK + K PPM AF+NSNVQ
Sbjct: 189 ------LSLASSNKLGSD-------TSSNYEEKSKKMDKTENKMGTPSPPMSAFVNSNVQ 235
Query: 442 GVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHVKDKGNGHHS 484
+NNSI+F++S +HHDPGVH+ S G G H+KD G+G ++
Sbjct: 236 AINNSILFNTSLTHHDPGVHVFFS-GGGGRGLHLKDSGDGQNN 277
>gi|315653992|ref|ZP_07906908.1| conserved hypothetical protein [Lactobacillus iners ATCC 55195]
gi|315488688|gb|EFU78334.1| conserved hypothetical protein [Lactobacillus iners ATCC 55195]
Length = 2148
Score = 110 bits (274), Expect = 2e-21, Method: Composition-based stats.
Identities = 83/272 (30%), Positives = 121/272 (44%), Gaps = 32/272 (11%)
Query: 20 PAPPVTPT-PEPPAPQPRPPMIRAPFRPMG--QPPLESIP----APAAVGVSSVPTSPI- 71
P P TP+ PE P +P M P P+ +P + S+P P + S P P+
Sbjct: 1815 PVKPKTPSEPEVPVTPEKPKMPSVPEVPVTPEKPKMPSVPEVTVKPETPKIPSKPEVPVT 1874
Query: 72 -RRTSVPSSP-VPRESPSPPSPPATTSPPKPQVSS----------PSPTKLPSTSSMPTS 119
+ +PS P VP + +P P + P KP +S PS + P T P +
Sbjct: 1875 PEKPKIPSEPEVPVKPNTPSEPEVSVKPVKPNTTSVPEVSVKPKTPSEPETPVTPVKPKT 1934
Query: 120 PPKPAVTTSPPKP-LATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAP 178
P +P + +P KP + V P KP T+ P ++ P KP + P + +P KP
Sbjct: 1935 PSEPETSETPVKPKTPSAPEVSVKPVKPNTTSVPEVSVKPVKPKTPSVPEVPVTPVKPKT 1994
Query: 179 TVFPVITTSPPKP-APTTFRVPTSPPKP-APSTFSVPTSPAKPKPAVTAAISTTTSPPRV 236
P ++ +P KP P+ VP +P KP PS VP +P KP+ + T P+
Sbjct: 1995 PSAPEVSLTPEKPKTPSVPEVPVTPEKPKTPSVPEVPVTPVKPQTPSVPEVPVTPEKPKT 2054
Query: 237 P-----SPAPVGPRTLLSDGTPPQSPYKKATA 263
P S PV P+T + P+ P KKA A
Sbjct: 2055 PSVPEVSVTPVKPKT----PSVPEVPAKKANA 2082
Score = 95.1 bits (235), Expect = 8e-17, Method: Composition-based stats.
Identities = 89/288 (30%), Positives = 120/288 (41%), Gaps = 34/288 (11%)
Query: 23 PVTPTPEPPAPQPRPPMIRAPFRPMGQPPLESIPAPAAVGVSSVPTSPIRRTSVPSSPVP 82
PVTP +P P+ +P + S+P V P P + SVP V
Sbjct: 1812 PVTPVKPKTPSEPEVPVTPE------KPKMPSVPE-----VPVTPEKP-KMPSVPEVTVK 1859
Query: 83 RESPSPPSPPAT-TSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPT 141
E+P PS P +P KP++ S P P +P +P V+ P KP T SVP
Sbjct: 1860 PETPKIPSKPEVPVTPEKPKIPSE-----PEVPVKPNTPSEPEVSVKPVKP--NTTSVPE 1912
Query: 142 SPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTF-RVPT 200
KP + P +P KP + P + +P KP P ++ P KP T+ V
Sbjct: 1913 VSVKPKTPSEPETPVTPVKPKTPSEPETSETPVKPKTPSAPEVSVKPVKPNTTSVPEVSV 1972
Query: 201 SPPKP-APSTFSVPTSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYK 259
P KP PS VP +P KPK +S T P+ PS P P T P+ P
Sbjct: 1973 KPVKPKTPSVPEVPVTPVKPKTPSAPEVSLTPEKPKTPS-VPEVPVT-------PEKP-- 2022
Query: 260 KATAPPPSPLTLPPPQFTSKAETQPTIPAETEQKTV-LVQTRSEKPKS 306
K + P P+T PQ S E P P + + +V V KPK+
Sbjct: 2023 KTPSVPEVPVTPVKPQTPSVPEV-PVTPEKPKTPSVPEVSVTPVKPKT 2069
Score = 87.4 bits (215), Expect = 1e-14, Method: Composition-based stats.
Identities = 76/278 (27%), Positives = 112/278 (40%), Gaps = 54/278 (19%)
Query: 77 PSSPVPRESPSPPSPPAT-------------TSPPKPQVSS-------PSPTKLPSTSSM 116
P +PV ++PS P P T +P KP++ S P K+PS +
Sbjct: 1812 PVTPVKPKTPSEPEVPVTPEKPKMPSVPEVPVTPEKPKMPSVPEVTVKPETPKIPSKPEV 1871
Query: 117 PTSPPKPAVTTSPPKPLA----------------TTFSVPTSPAKPAPTTFPVITTSPPK 160
P +P KP + + P P+ T SVP KP + P +P K
Sbjct: 1872 PVTPEKPKIPSEPEVPVKPNTPSEPEVSVKPVKPNTTSVPEVSVKPKTPSEPETPVTPVK 1931
Query: 161 PAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTF-RVPTSPPKP-APSTFSVPTSPAK 218
P + P + +P KP P ++ P KP T+ V P KP PS VP +P K
Sbjct: 1932 PKTPSEPETSETPVKPKTPSAPEVSVKPVKPNTTSVPEVSVKPVKPKTPSVPEVPVTPVK 1991
Query: 219 PKPAVTAAISTTTSPPRVPS--PAPVGPRTLLSDGTP--PQSPYKKAT-APPPSPLTLPP 273
PK +S T P+ PS PV P + P P +P K T + P P+T
Sbjct: 1992 PKTPSAPEVSLTPEKPKTPSVPEVPVTPEKPKTPSVPEVPVTPVKPQTPSVPEVPVTPEK 2051
Query: 274 PQFTS-----------KAETQPTIPAETEQKTVLVQTR 300
P+ S K + P +PA+ + +V+++
Sbjct: 2052 PKTPSVPEVSVTPVKPKTPSVPEVPAKKANASTIVKSQ 2089
Score = 65.5 bits (158), Expect = 6e-08, Method: Composition-based stats.
Identities = 60/221 (27%), Positives = 86/221 (38%), Gaps = 44/221 (19%)
Query: 112 STSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITR 171
S S P +P KP + P VP +P KP + P + +P KP + P +T
Sbjct: 1807 SISVKPVTPVKPKTPSEP--------EVPVTPEKPKMPSVPEVPVTPEKPKMPSVPEVTV 1858
Query: 172 SPPKPAPTVFPVITTSPPKPA-PTTFRVPTSPPKPA----------PSTFSVPTSPAKPK 220
P P P + +P KP P+ VP P P+ P+T SVP KPK
Sbjct: 1859 KPETPKIPSKPEVPVTPEKPKIPSEPEVPVKPNTPSEPEVSVKPVKPNTTSVPEVSVKPK 1918
Query: 221 PAVTAAISTTTSPPRVP-----SPAPVGPRTLLSDGTPPQSPYKKATAPPPSPLTLPPPQ 275
T P+ P S PV P+T P +P + + P P T P+
Sbjct: 1919 TPSEPETPVTPVKPKTPSEPETSETPVKPKT-------PSAP--EVSVKPVKPNTTSVPE 1969
Query: 276 FTSKAETQPTIPAETEQKTVLVQTRS----------EKPKS 306
+ K +P P+ E V+ ++ EKPK+
Sbjct: 1970 VSVKP-VKPKTPSVPEVPVTPVKPKTPSAPEVSLTPEKPKT 2009
>gi|357514507|ref|XP_003627542.1| hypothetical protein MTR_8g027290 [Medicago truncatula]
gi|355521564|gb|AET02018.1| hypothetical protein MTR_8g027290 [Medicago truncatula]
Length = 207
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 24/134 (17%)
Query: 341 KVHRKQFSSDSDEV-GMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSD 399
KV K SSDS+E G+R ITI GENKGA M++I++ K P+ LH +LK
Sbjct: 94 KVPNKNCSSDSEEYSGIRKITIGGENKGACMKIIQTRKK------PNQLHKMENLKI--- 144
Query: 400 GTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPG 459
K D++ K + PP +G +MNSNVQ VNNS++ H+SC+HHDPG
Sbjct: 145 --------------KGYDDQSKKVRTISSPPMVGVYMNSNVQCVNNSLLLHASCTHHDPG 190
Query: 460 VHLSLSRKPSGNGF 473
V L+LS+KP G G+
Sbjct: 191 VQLTLSKKPFGKGY 204
>gi|296088979|emb|CBI38682.3| unnamed protein product [Vitis vinifera]
Length = 585
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 79/178 (44%), Positives = 108/178 (60%), Gaps = 14/178 (7%)
Query: 291 EQKTVLVQTRSEKPKSWF----NGTGNHFDETHKPSSYAHRGNHEVHKEVAKNGKVHRKQ 346
EQKTV++ EKPKS F G ++ ++ KPS +E + K+ KV+ K+
Sbjct: 243 EQKTVVI----EKPKSPFRPSIGGLQSNVGDSVKPSIVP---GYEKSEPGTKDQKVNLKK 295
Query: 347 FSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWHSF 406
+DS++ +++IT+AG+N GA ME+ + K +G + L K LKT GT S
Sbjct: 296 --ADSEDASLKIITLAGDNTGAVMELSPARKKPDPEGNSNPLVKKTDLKTWL-GTKLGSD 352
Query: 407 SSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSL 464
+SSSDEEKSKK K PPM AF NSNVQ VNNSI+F++SC+HHDPGVHLS+
Sbjct: 353 ASSSDEEKSKKKDKAHNKMGTPLPPMNAFTNSNVQAVNNSILFNTSCNHHDPGVHLSI 410
>gi|449445712|ref|XP_004140616.1| PREDICTED: uncharacterized protein LOC101216647 [Cucumis sativus]
Length = 397
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 75/213 (35%), Positives = 100/213 (46%), Gaps = 38/213 (17%)
Query: 278 SKAETQP-TIPAETEQKTVLVQTRSEKP----KSWFNGTGNH------FDETHKPSSYAH 326
S AET+ TIP E E+KTV+ Q +KP + N+ FD+ K S
Sbjct: 191 SVAETKDETIPQEVERKTVVFQKVMDKPSQAEEHHLQNITNYRTHTSEFDKNGKQESNKG 250
Query: 327 RGNHEVHKE---------VAKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPL 377
KE + N R F D++ RVIT+AGENKGAFME+ S
Sbjct: 251 DDGDRDEKETSSKKKGATIGGNNNYKRTAFDHDNN---TRVITMAGENKGAFMEINLSSE 307
Query: 378 KHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKS-KKDKNYKAKAAMLPPPMGAFM 436
K+ ++ H + ++ + S D KS K K K + + PM AF
Sbjct: 308 KN------NSRHQQQQQIQDNN-----TVVSVKDSNKSINKTKGIKTRNVL---PMRAFF 353
Query: 437 NSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPS 469
NSNVQG+NNSI+ S SHHDPG+HL S P+
Sbjct: 354 NSNVQGINNSILMDSKFSHHDPGIHLVFSSLPT 386
>gi|224082878|ref|XP_002306875.1| predicted protein [Populus trichocarpa]
gi|222856324|gb|EEE93871.1| predicted protein [Populus trichocarpa]
Length = 545
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 117/421 (27%), Positives = 178/421 (42%), Gaps = 47/421 (11%)
Query: 63 VSSVPTSPIRRTSVPSSPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPK 122
+SVP SP R T P R + PPSP T+ + + SP++ S + PP
Sbjct: 116 AASVPPSPSRAT-----PQSRAASVPPSPSRATTQTQAASQTQSPSRATPQSRTASVPPS 170
Query: 123 PAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFP 182
P+ TTS P+ A P SP++ A + P T+S P SP K A V P
Sbjct: 171 PSRTTSQPRTAALAVQQPESPSRLA-SRVPGKTSSQPS-----------SPSKIATQVQP 218
Query: 183 VITTSPPKPAPTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSPPRVPSPAPV 242
++ SP K + + PP P++ T + PA++ +S V
Sbjct: 219 TVSRSPSKKLQLATQETSQPP---PTSTQSATQQQETNPALSFPLSQVPQEKTEIRAENV 275
Query: 243 GPRTLLSDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIPAE----TEQKTVLVQ 298
+ S+ + ATA P S L P + K+++ P +EQ + +
Sbjct: 276 SRQQQQSEPVQASGVVRAATATPTSVAALEIPAASQKSDSYTIGPDHPMNLSEQLKNVKE 335
Query: 299 TRSEKPKSWFNGTGNH---------FDETHKPSSYAHRGNHEVHKEVAKNGK--VHRKQF 347
E+ K+ + G E+H+ SS ++ +HKE+ ++ VH+
Sbjct: 336 DIFERKKTTVSSNGETAKSARARYVLGESHQKSSMSNGEKVPLHKEIREDISKFVHKLGM 395
Query: 348 SSDSDEVGMR---VITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWH 404
+G + ++T+AGEN+GA M P + DG+ H H K D
Sbjct: 396 EHIKHPIGEKPVSIVTLAGENRGASMYEGSEPTRK--DGSVHIHHG---YKINPD----E 446
Query: 405 SFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSL 464
S + +D E S K +K P M A++NSN Q VNNSI+F SS + PGV L L
Sbjct: 447 SSENPTDGEGSSKGGKFKDLLTKEDPAMKAYINSNTQSVNNSILFESSLNERSPGVQLHL 506
Query: 465 S 465
S
Sbjct: 507 S 507
>gi|224066211|ref|XP_002302027.1| predicted protein [Populus trichocarpa]
gi|222843753|gb|EEE81300.1| predicted protein [Populus trichocarpa]
Length = 552
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 149/507 (29%), Positives = 207/507 (40%), Gaps = 78/507 (15%)
Query: 1 MATQPPAPRQWFRLPSIGRP-APPVTPTPEPPAPQPRP------------PMIRAPFRPM 47
MA Q P FRLP + P APP+ PT EP P+PR P+ R PFRP
Sbjct: 1 MADQRPF---RFRLPWLSAPVAPPLRPTAEPQPPRPRVEIQAPARPITTIPIQRPPFRPA 57
Query: 48 GQPPLESI----------------PAPAAV--GVSSVPTSPIRRTSVPSSPVPRESPSPP 89
G P + P AA+ V++ P S R T S P P +S P
Sbjct: 58 GITPATTPTTQTTAQEQTEPQPVPPRRAAIESQVNAQPVSQSRETRAASVP-PSQSRETP 116
Query: 90 SPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPT 149
A + PP P + P T S + P+ + P P TT + P A
Sbjct: 117 QSRAASVPPSPSRAKTQSQAAPQTQSPSRATPQSRAASVAPSPSRTT----SQPQSAAAV 172
Query: 150 TFPVITTSPPKPAPTTFPG----ITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKP 205
T P T SP + A T PG SP K A V P ++ SP K + + PP P
Sbjct: 173 TVPQ-TQSPSRLA-TQVPGRKSPQLSSPSKTATQVQPTVSQSPSKKLQLATQEISQPP-P 229
Query: 206 APSTFSVPTSPAKPKPAV--TAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKATA 263
+ KP PA A T+P P PA P +L +S + A
Sbjct: 230 KSTQSDTQQEETKPTPAAEPVQASDAVTAPTPTPEPALETPASLQ------KSDSRTIGA 283
Query: 264 PPPSPLTLPPPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPSS 323
P PL+ P + + +KT + E K+ T + F E+H+ +S
Sbjct: 284 DHPMPLSQPVKRVKEDIFER--------KKTTTISPNGETIKTAR--TRSAFGESHQKTS 333
Query: 324 YAHRGNHEVHKEVAKNGK--VHRKQFSSDSDEVGMR---VITIAGENKGAFMEVIRSPLK 378
+ + KE+ ++ VH +G + V+T+AGEN+GA M V +
Sbjct: 334 MSSGEKVPLQKEIREDISKFVHNLGMEHMEHPIGEKPVSVVTLAGENRGATMYVGSESSR 393
Query: 379 HTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNS 438
DG+ H +H K D S +++D E S + ++ K P A++NS
Sbjct: 394 K--DGSVH-IHR--GYKINPD----ESSEATTDGEGSSRGRSSKDLLTKEDPARKAYINS 444
Query: 439 NVQGVNNSIVFHSSCSHHDPGVHLSLS 465
N Q VNNSI+F +S S PGV LSLS
Sbjct: 445 NTQSVNNSILFETSVSERSPGVQLSLS 471
>gi|449432018|ref|XP_004133797.1| PREDICTED: uncharacterized protein LOC101205942 [Cucumis sativus]
Length = 295
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/291 (31%), Positives = 122/291 (41%), Gaps = 66/291 (22%)
Query: 198 VPTSPPKPAPSTFSVP----TSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLL---SD 250
P SP + +P S P TSP + + A T SPP +G + L +
Sbjct: 46 APASPRRESPRPLSSPSKKATSPFASRVDSSPAAKATRSPP-----DSIGDKYLERRNGE 100
Query: 251 GTPPQSPYKK--ATAPPPSPLTLPPPQFTSKAET--QPTIPAETEQKTVLVQTRSEKPKS 306
TPP +P K A P SPL LP Q + T QP + E E K ++ +++
Sbjct: 101 TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADE--- 157
Query: 307 WFNGTGNHFDETHKPSSYAHRGNHEVHKEVAKNGKVHRKQFSSDSDEVGMRVITIAGENK 366
KPS + H K V H+KQ V+ + G N
Sbjct: 158 -------------KPSKSNRQSEHGSGKSV------HQKQQKPG-------VMKLKGHNV 191
Query: 367 GAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGT--GWHSFSSSSDEEKSKKDKNYKAK 424
GA MEV +S + G TL K +T+ DG G+ D++ +K K
Sbjct: 192 GAVMEVNKSSAGYRLGG--ETLKKK---ETEDDGDVDGY----GHEDKKTGRKKK----- 237
Query: 425 AAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHV 475
PP+ AFMNSN Q VNNS++F SSC H DPG+HLS G G V
Sbjct: 238 -----PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVV 283
>gi|116788029|gb|ABK24731.1| unknown [Picea sitchensis]
Length = 358
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 73/133 (54%), Gaps = 24/133 (18%)
Query: 341 KVHRKQ-FSSDSD----EVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLK 395
++H +Q S+ D ++G+RVIT+AGEN+GA M++ K+ H G
Sbjct: 217 ELHGQQKLGSNGDQKMQDLGVRVITLAGENRGAIMDMGSESRKNDVLDVQRVNHKPGG-- 274
Query: 396 TQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSH 455
++ +EE D N++ KAA P+ AF+NSN+Q NNSI+++S+C H
Sbjct: 275 ------------ANKNEESP--DGNHRQKAA---GPVNAFVNSNIQAANNSILYNSTCPH 317
Query: 456 HDPGVHLSLSRKP 468
DPGV L+LS P
Sbjct: 318 RDPGVRLALSNNP 330
>gi|449432016|ref|XP_004133796.1| PREDICTED: uncharacterized protein LOC101205715 [Cucumis sativus]
gi|449477975|ref|XP_004155181.1| PREDICTED: uncharacterized LOC101205715 [Cucumis sativus]
Length = 308
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 63/120 (52%), Gaps = 25/120 (20%)
Query: 358 VITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSKK 417
VI I GEN GA M + +S DG+ + K ++ S ++E+ +K
Sbjct: 211 VINIKGENVGAVMHITQSS-----DGS-EVIKKKPTVGQ----------SKENEEKTNKS 254
Query: 418 DKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHVKD 477
+ NY P +FMNSN QGVNNSI+++SS SH DPG+HL+ K S +G + D
Sbjct: 255 NSNY---------PGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFGSKKSKHGDSIHD 305
>gi|255538680|ref|XP_002510405.1| oxidoreductase, putative [Ricinus communis]
gi|223551106|gb|EEF52592.1| oxidoreductase, putative [Ricinus communis]
Length = 551
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 64/114 (56%), Gaps = 11/114 (9%)
Query: 352 DEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSD 411
DE + V+TIAGEN GA M V P + DG+ H +H +G D TG +++D
Sbjct: 368 DEKPVCVMTIAGENTGASMHVGAEPARK--DGSIH-IH-RGYKLNPDDSTG-----ATTD 418
Query: 412 EEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLS 465
E S+ ++ + P A++NSN Q VNNSI+ SS + DPGVHL+LS
Sbjct: 419 GEGSRDGRSKNPTKEV--PVTKAYLNSNTQSVNNSIILDSSVNEQDPGVHLALS 470
>gi|242052375|ref|XP_002455333.1| hypothetical protein SORBIDRAFT_03g008720 [Sorghum bicolor]
gi|241927308|gb|EES00453.1| hypothetical protein SORBIDRAFT_03g008720 [Sorghum bicolor]
Length = 386
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 58/114 (50%), Gaps = 19/114 (16%)
Query: 355 GMRVITIAGENKGAFMEVIRSPLKH-TFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEE 413
G +IT+AGENKGA M+V S + D +G + DG S SS +E
Sbjct: 269 GTTIITLAGENKGASMKVDSSAVADGRGDAAAGGKERRGH---KLDG----SVVGSSGKE 321
Query: 414 KSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRK 467
+ K + AF+NSNVQ +NNS++ SSC+ DPGVHL LS K
Sbjct: 322 QGAGGKG-----------LTAFVNSNVQVINNSLMLQSSCNGGDPGVHLKLSTK 364
>gi|195342490|ref|XP_002037833.1| GM18084 [Drosophila sechellia]
gi|194132683|gb|EDW54251.1| GM18084 [Drosophila sechellia]
Length = 14551
Score = 62.4 bits (150), Expect = 5e-07, Method: Composition-based stats.
Identities = 73/270 (27%), Positives = 98/270 (36%), Gaps = 50/270 (18%)
Query: 14 LPSIGRPAPPVT-----------PTPEPPAPQPRPPMIRAPFRPMGQPPL-------ESI 55
+PS RP PP + PTP P P + P P Q P+
Sbjct: 11206 IPSAHRPVPPTSQRPVFVTSPGNPTPTPQPGVINIPSVSQPGYPTPQSPIYDANYPTTQS 11265
Query: 56 PAPAAVGVSSVPTSPIRRTSVPSSPVPRESPSPPSPPATTSPPKPQV----SSPSPTKLP 111
P P GV ++P+ P P+ P P + P+PP+ P +P V S+P P P
Sbjct: 11266 PIPQQPGVVNIPSLP-----SPAYPAPNPPVNYPTPPSPQKPVQPGVINIPSAPHPATPP 11320
Query: 112 STSSMPTSPPKPAVTTSPPKPLATTFSV--PTSPAK-----------------PAPTTFP 152
P P P ++ PKP A + V PT P+ P P
Sbjct: 11321 QHP--PVFIPSPESPSTAPKPGAPVYDVNYPTPPSAIPHQPGVVNIPSVPPPPPPVMQRP 11378
Query: 153 VITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKPAPSTFSV 212
V SP P P PG+ P P V P SP P + V PP P ++
Sbjct: 11379 VFVPSPAHPTPGPQPGVVNIPSVVQP-VHPTY-QSPVVERPAIYDVYYPPPPSRPGVINI 11436
Query: 213 PTSPAKPKPAVTAAISTTTSPPRVPSPAPV 242
P+ P P I + +P+P PV
Sbjct: 11437 PSPPRPVYPVPQQPIYVPSPVLHIPAPRPV 11466
Score = 58.2 bits (139), Expect = 1e-05, Method: Composition-based stats.
Identities = 60/215 (27%), Positives = 81/215 (37%), Gaps = 37/215 (17%)
Query: 19 RPAPPVTP-------TPEPPAPQPRPPMIRAPFRPMGQPPLESIPAPAAVGVSSVPTSPI 71
+PA P P P+P P P+PP+ + P+ P + + P P
Sbjct: 11160 QPANPEKPGVVNIPSVPQPVYPSPQPPVYHVNY---ATTPVSQQPGVVNIPSAHRPVPPT 11216
Query: 72 RRTSV----PSSPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTT 127
+ V P +P P P + P+ + P P SP T+ P P +P V
Sbjct: 11217 SQRPVFVTSPGNPTPTPQPGVINIPSVSQPGYPTPQSPIYDANYPTTQSPI-PQQPGVVN 11275
Query: 128 SPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTS 187
P P SPA PAP PV +PP P PG+ P P P +
Sbjct: 11276 IPSLP---------SPAYPAPNP-PVNYPTPPSPQKPVQPGVINIPSAPHP-------AT 11318
Query: 188 PPKPAPTTFRVPTSP---PKPAPSTFSV--PTSPA 217
PP+ P P SP PKP + V PT P+
Sbjct: 11319 PPQHPPVFIPSPESPSTAPKPGAPVYDVNYPTPPS 11353
Score = 48.5 bits (114), Expect = 0.007, Method: Composition-based stats.
Identities = 63/232 (27%), Positives = 79/232 (34%), Gaps = 32/232 (13%)
Query: 77 PSSPVPRESPSPPSPPATTSPPKPQVSSPSP----TKLPSTSSMPTSPPKPAVTTSPPKP 132
PS + P P A + V + SP T S P +P KP V P P
Sbjct: 11117 PSLACVNQKCRDPCPGACNHLAQCHVINHSPHHHNTMFNYPSPQPANPEKPGVVNIPSVP 11176
Query: 133 LATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPA 192
P P+ P T+P P + P P + PV TSP P
Sbjct: 11177 Q------PVYPSPQPPVYHVNYATTPVSQQPGVVNIPSAHRPVPPTSQRPVFVTSPGNPT 11230
Query: 193 PTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSP----------PRVPSPAPV 242
PT P + S P P P A TT SP P +PSPA
Sbjct: 11231 PT-----PQPGVINIPSVSQPGYPTPQSPIYDANYPTTQSPIPQQPGVVNIPSLPSPAYP 11285
Query: 243 GPRTLLSDGTP--PQSPYKKA-----TAPPPSPLTLPPPQFTSKAETQPTIP 287
P ++ TP PQ P + +AP P+ PP F E+ T P
Sbjct: 11286 APNPPVNYPTPPSPQKPVQPGVINIPSAPHPATPPQHPPVFIPSPESPSTAP 11337
>gi|449514829|ref|XP_004164492.1| PREDICTED: uncharacterized LOC101216647 [Cucumis sativus]
Length = 334
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 55/123 (44%), Gaps = 23/123 (18%)
Query: 278 SKAETQP-TIPAETEQKTVLVQTRSEKP----KSWFNGTGNH------FDETHKPSSYAH 326
S AET+ TIP E E+KTV+ Q +KP + N+ FD+ K S
Sbjct: 191 SVAETKDETIPQEVERKTVVFQKVMDKPSQAEEHHLQNITNYRTHTSEFDKNGKQESNKG 250
Query: 327 RGNHEVHKE---------VAKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPL 377
KE + N R F D++ RVIT+AGENKGAFME+ S
Sbjct: 251 DDGDRDEKETSSKKKGATIGGNNNYKRTAFDHDNN---TRVITMAGENKGAFMEINLSSE 307
Query: 378 KHT 380
K T
Sbjct: 308 KTT 310
>gi|145482197|ref|XP_001427121.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394200|emb|CAK59723.1| unnamed protein product [Paramecium tetraurelia]
Length = 1734
Score = 52.8 bits (125), Expect = 4e-04, Method: Composition-based stats.
Identities = 51/211 (24%), Positives = 64/211 (30%), Gaps = 26/211 (12%)
Query: 82 PRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPT 141
P++ P P PPK Q P P K P P P P+P
Sbjct: 1026 PKQPEPPKEQPKQPEPPKEQPKQPEPLK-----EQPKQPEPPKEQPKQPEP------PKE 1074
Query: 142 SPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTS 201
P +P P PPK P P +P P PPK P P
Sbjct: 1075 QPKQPEPPKEQPKQPEPPKEQPKQPEPPKEQPKQPEPPKEQPKQPEPPKEQPKQPEPPKE 1134
Query: 202 PPKPAPSTFSVPTSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKA 261
PK P P PK P+ P P P+ PP+ K+
Sbjct: 1135 QPKQPEPPKEQPKQPEPPK-----------EQPKQPEP----PKEQPKQPEPPKEQPKQP 1179
Query: 262 TAPPPSPLTLPPPQFTSKAETQPTIPAETEQ 292
P P PP+ + + QP P +Q
Sbjct: 1180 EPPKEQPKQQEPPKQPEQPKEQPKQPESPKQ 1210
>gi|351714165|gb|EHB17084.1| Proteoglycan 4 [Heterocephalus glaber]
Length = 1374
Score = 43.5 bits (101), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 64/145 (44%), Positives = 84/145 (57%), Gaps = 15/145 (10%)
Query: 86 PSPPSPPATTSPPKPQV-SSPSPTKLPSTSSMPTSPPKPA-----VTTSPPKPLATTFSV 139
P+ P PA T P P P+PTK P +PT+P +PA V T+P +P T V
Sbjct: 328 PTAPKEPAPTKEPVPTTPKEPAPTKEP----VPTAPKEPAPTKEPVPTTPKEPAPTKEPV 383
Query: 140 PTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPV-ITTSPPKPAPTTFRV 198
PT+P +PAPT PV TT P +PAPT PG T +P +PAPT + T+ +PAPT
Sbjct: 384 PTAPKEPAPTKEPVPTT-PKEPAPTKEPGPT-TPKEPAPTTTKEPVPTAQKEPAPTKEPG 441
Query: 199 PTSPPKPAPSTFS--VPTSPAKPKP 221
PT+P +PAP+T VPT+ +P P
Sbjct: 442 PTTPKEPAPTTTKEPVPTALKEPAP 466
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 86/160 (53%), Gaps = 29/160 (18%)
Query: 75 SVPSSPVPRESPSPPSP--PATTSPPKPQV-SSPSPTKLPSTSSMPTSPPKPAVTTSPPK 131
+ P P P + P P +P PA T P P P+PTK P +PT+P +PA T P
Sbjct: 343 TTPKEPAPTKEPVPTAPKEPAPTKEPVPTTPKEPAPTKEP----VPTAPKEPAPTKEP-- 396
Query: 132 PLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSP-----PKPAPTVFPVITT 186
VPT+P +PAPT P TT P +PAPTT T+ P +PAPT P TT
Sbjct: 397 -------VPTTPKEPAPTKEPGPTT-PKEPAPTT----TKEPVPTAQKEPAPTKEPGPTT 444
Query: 187 SPPKPAPTTFR--VPTSPPKPAPSTFSVPTSPAKPKPAVT 224
P +PAPTT + VPT+ +PAP+ PT+P +P P T
Sbjct: 445 -PKEPAPTTTKEPVPTALKEPAPTKEPGPTTPKEPAPTTT 483
>gi|390334661|ref|XP_781315.3| PREDICTED: protein SpAN-like [Strongylocentrotus purpuratus]
Length = 931
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 93/190 (48%), Gaps = 22/190 (11%)
Query: 61 VGVSSVPTSPIRRTSVPSSPVPRESPSPPSPPATT-SPPKPQVSSPSPTKLPSTSSMPTS 119
V + PT+P+ T P++PV + +P +P TT +P P V++ +PT T+ PT+
Sbjct: 524 VVTTEAPTTPVVTTEAPTTPVVT-TKAPTTPVVTTEAPTTPVVTTKAPTTPVVTTEAPTT 582
Query: 120 PPKPAVTTSPPKPLATTFSVPTSPA--KPAPTTFPVITTSPPKPAPTTFPGITRSPPKPA 177
P VTT P T PT+P APTT PV+TT APTT T++P P
Sbjct: 583 P---VVTTKAPTTPVVTTKAPTTPVVTTEAPTT-PVVTTE----APTTPVVTTKAPTTPV 634
Query: 178 PTV----FPVITTSPPKPAPTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSP 233
T PV+TT APTT V T P + T PT+P A T + TT +P
Sbjct: 635 VTTEAPTTPVVTTE----APTTSVVTTEAPTTSVVTTKAPTTPVVTTEAQTTPVVTTKAP 690
Query: 234 --PRVPSPAP 241
P V + AP
Sbjct: 691 TTPVVTTKAP 700
Score = 38.9 bits (89), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 103/217 (47%), Gaps = 28/217 (12%)
Query: 40 IRAPF---RPMGQPPLESIPAPAAVGVSS-VPTSPIRRTSVPSSPVPRESPSPPSPPATT 95
RA F +P+ + P+ + AP GV++ PT+P+ T P++PV + +P +P TT
Sbjct: 449 FRAHFTIDQPLTEEPVVTTRAPTTPGVTTEAPTTPVVTTKAPTTPVVT-TKAPTTPVVTT 507
Query: 96 -SPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPA--KPAPTTFP 152
+P P V++ + T T+ PT+ P VTT P T PT+P APTT P
Sbjct: 508 KAPTTPVVTTKALTTPVVTTEAPTT---PVVTTEAPTTPVVTTKAPTTPVVTTEAPTT-P 563
Query: 153 VITTSPP------KPAPTTFPGITRSPPKPAPT----VFPVITTSPP------KPAPTTF 196
V+TT P APTT T++P P T PV+TT P APTT
Sbjct: 564 VVTTKAPTTPVVTTEAPTTPVVTTKAPTTPVVTTKAPTTPVVTTEAPTTPVVTTEAPTTP 623
Query: 197 RVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSP 233
V T P T PT+P A T ++ TT +P
Sbjct: 624 VVTTKAPTTPVVTTEAPTTPVVTTEAPTTSVVTTEAP 660
>gi|332230676|ref|XP_003264519.1| PREDICTED: LOW QUALITY PROTEIN: proteoglycan 4 [Nomascus
leucogenys]
Length = 1335
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 62/143 (43%), Positives = 82/143 (57%), Gaps = 24/143 (16%)
Query: 79 SPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTT-------SPPK 131
PVP + S P+ P +P P+ SP+ TK P+ PT+P +PA TT +P +
Sbjct: 364 EPVPTTTKSAPTTPKEPAPTTPKEPSPTTTKEPA----PTTPKEPAPTTPKEPSPTTPKE 419
Query: 132 PL-ATTFSVPTSPAKPAPTT--FPVITTSPPKPAPTTFPGITRSPPKPAPTV-FPVITTS 187
P TT S PT+P +PAPTT P TT P KPAPTT P +PAPT +T+
Sbjct: 420 PAPTTTKSAPTTPKEPAPTTPKEPAPTT-PKKPAPTT-------PKEPAPTTPKEPASTT 471
Query: 188 PPKPAPTTFR-VPTSPPKPAPST 209
P +PAPTT + PT+P +PAP+T
Sbjct: 472 PKEPAPTTIKSAPTTPKEPAPTT 494
>gi|162457641|ref|YP_001620008.1| cellulose synthase 1 operon protein C [Sorangium cellulosum So
ce56]
gi|161168223|emb|CAN99528.1| Cellulose synthase 1 operon protein C precursor [Sorangium
cellulosum So ce56]
Length = 2239
Score = 38.1 bits (87), Expect = 9.4, Method: Composition-based stats.
Identities = 60/249 (24%), Positives = 82/249 (32%), Gaps = 11/249 (4%)
Query: 50 PPLESIPAPAAVGVSSVPTSPIRRTS--VPSSPVPRESPSPPSPPATTSPPKPQVSSPSP 107
P S P A G P SP R + P S P+P S P + + S
Sbjct: 237 PSGSSPARPPARGPGQRPPSPNRAATQPAPGSAASAGKPAPGSQPGGAAQAIAAFDA-ST 295
Query: 108 TKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFP 167
T +P S PT+ P P PL + +SPA + + +PAP
Sbjct: 296 TPMP-ISVEPTATPLPISVEPTATPLPISVEPSSSPAPISVESIEPALAQSAEPAPVQSA 354
Query: 168 GITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAI 227
T P P P++ P P P++ P P
Sbjct: 355 APAPVQSAAPAPAPSATRT----PLPGGAHAPSATRTPLPGGAHAPSATRTPLPGGAPTP 410
Query: 228 STTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIP 287
+P +P P RT + TP + AT P+P T P Q + A T P
Sbjct: 411 GAPRAPLPGGAPTPGATRTPIPGVTPSAAHAPGAT---PAPSTAAPAQSAAHAATVLHTP 467
Query: 288 AETEQKTVL 296
+ TVL
Sbjct: 468 SAAHAATVL 476
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.127 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,806,622,620
Number of Sequences: 23463169
Number of extensions: 573886114
Number of successful extensions: 11830621
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 28163
Number of HSP's successfully gapped in prelim test: 157130
Number of HSP's that attempted gapping in prelim test: 5313494
Number of HSP's gapped (non-prelim): 2241157
length of query: 484
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 337
effective length of database: 8,910,109,524
effective search space: 3002706909588
effective search space used: 3002706909588
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 79 (35.0 bits)