BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011479
         (484 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255585261|ref|XP_002533331.1| conserved hypothetical protein [Ricinus communis]
 gi|223526836|gb|EEF29052.1| conserved hypothetical protein [Ricinus communis]
          Length = 502

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 241/543 (44%), Positives = 294/543 (54%), Gaps = 102/543 (18%)

Query: 1   MATQPPAPRQWFRLPSIGRPAPPVTPTPEPPAP------QPRPPMIRAPFRPMGQPPLES 54
           MA QP   R W RL SI RPAPP +  P PPA       QPR  + R   RP+ QP L +
Sbjct: 1   MANQPSQSRPWSRLYSIARPAPPES-IPTPPAAPESAPGQPRAVLARPSLRPVTQPTLPT 59

Query: 55  IP-----------------------------APA----AVGVSSVPTSPIRRTSVPSSPV 81
            P                             +PA      GV+SVP+SP+ R SVP++ +
Sbjct: 60  RPQEPTQAPQPPVATVPPPVAAPPPATPTQRSPAKPSGGGGVASVPSSPVLRASVPTASL 119

Query: 82  PRESPSPPSPPATTSP----------PKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPK 131
           P+   SP        P          P+P+VS+P       TSS+P+SP          K
Sbjct: 120 PK---SPVQTATAVGPVTTSSSVQSSPRPRVSAP-------TSSLPSSPVL--------K 161

Query: 132 PLATTFSVPTSPA-KPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPK 190
           P   TFSVPTSPA KP PTT  + T+      P T         KP PT   V T+   K
Sbjct: 162 P-TQTFSVPTSPATKPMPTTVSLPTS------PAT---------KPMPTTVSVPTSPATK 205

Query: 191 PAPTTFRVPTSP-PKPAPSTFSVPTSPAKPKPAVTAAISTTTSPPR-VPSPAPVGPRTLL 248
           P PTT  VPTSP PKPAP+T SVPTSPA    + T   ++T   P  V  P  + P    
Sbjct: 206 PMPTTISVPTSPLPKPAPATSSVPTSPANRAVSTTTTATSTARVPSPVQYPRTINPVVQ- 264

Query: 249 SDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWF 308
              +PP+SP    TAPPPSPL LPP +  S AET+  IP   EQKTV+VQ   +KPK W 
Sbjct: 265 ---SPPRSPKLNPTAPPPSPLILPPAKIKSDAETEAKIPLVAEQKTVVVQKTIDKPKPWL 321

Query: 309 NGTGNHFDETHKP--SSYAHRGNHEVHKEVAK-NGKVHRKQFSSDSDEVGMRVITIAGEN 365
              GN   +      SS    G  E  K+  +   KV  K+  S S++ GMRVIT+AGEN
Sbjct: 322 TNGGNSERDLVNAVKSSIGLNGKQETTKDHGETKEKVLHKKIFSGSEDGGMRVITVAGEN 381

Query: 366 KGAFMEVIRSP-LKHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSK-----KDK 419
           KGAFMEVIRSP  KH F+G+PH LH K +   +SDG  W S+SS+    + +     KDK
Sbjct: 382 KGAFMEVIRSPNKKHVFEGSPHHLHKKDN-DPKSDGIKWISYSSNGSSSEGEGKPKMKDK 440

Query: 420 NYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHVKDKG 479
           N+K K  M   PM AFMNSNVQGVNNSIV++SSC+HHDPGVHL+LSRK +G GFHVKD+ 
Sbjct: 441 NHKGKG-MNSLPMTAFMNSNVQGVNNSIVYNSSCTHHDPGVHLALSRKSAGGGFHVKDRS 499

Query: 480 NGH 482
           NGH
Sbjct: 500 NGH 502


>gi|224127999|ref|XP_002320216.1| predicted protein [Populus trichocarpa]
 gi|222860989|gb|EEE98531.1| predicted protein [Populus trichocarpa]
          Length = 447

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 231/513 (45%), Positives = 279/513 (54%), Gaps = 95/513 (18%)

Query: 1   MATQPPAPRQWFRLPSIGRPAPPVTPTPEPPAPQPRPPMIRAP-FRPMGQPPLESIPAPA 59
           M+ QP  PR WFRLPSI RPA P         P    P +  P FR      L ++P P 
Sbjct: 1   MSNQPAPPRPWFRLPSIARPAAPAPTPTPESPPPQPRPALARPSFR------LTALPQPV 54

Query: 60  --------------------AVGVSSVPTSPIRRT-----SVPSSPVPRESPSPPSPPAT 94
                               A G++SVPTSP  +      S+P+SP PR           
Sbjct: 55  PTQPQEPTPAPPPSAATVPPAAGIASVPTSPAVKAAGGVASLPTSPAPR----------- 103

Query: 95  TSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVI 154
                    +P+P+     SS+PTSP  P  T  PP    TTFS+P     P+PT  P  
Sbjct: 104 ---------APAPS-----SSVPTSP-VPTSTVFPP----TTFSLP-----PSPTLKPSR 139

Query: 155 TTS--PPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKPAPSTFSV 212
           T+S  P  PA T  P  + S    +P+  PV TTS    +P               T SV
Sbjct: 140 TSSSVPTSPASTPSPSASVST---SPSTRPVSTTSSAPSSPAPKPATV--------TSSV 188

Query: 213 PTSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKATAPPPSPLTLP 272
           P SPA      T A+  TTS  RVP P P       +  TPPQSP  K TAPPPSPLT P
Sbjct: 189 PNSPA------TKAV--TTSAARVPGPTPSLRIIKPTVQTPPQSPKPKPTAPPPSPLTRP 240

Query: 273 PPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPSSYAHRGNHEV 332
           P +  S A+ +P IP   EQKTVLVQ   +KPK   +      D     S  A     E 
Sbjct: 241 PSRVKSDADLEPKIPLVAEQKTVLVQKIIDKPKEAGDSLRAFADSLS--SGIARLAKPET 298

Query: 333 HKEVAKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKG 392
            K+  K  K   K+ SSDS++VGMRVITIAGENKGAFMEVIRSP KH F+G  HTL+ KG
Sbjct: 299 AKDQTKE-KGSGKKISSDSEDVGMRVITIAGENKGAFMEVIRSPKKHFFEGNSHTLNKKG 357

Query: 393 SLKTQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSS 452
           + +  S+G+ W S SSS +E  SKKDKN+K ++ M P PM AFMNSNVQGVNNSIV++SS
Sbjct: 358 NPR--SEGSDWGSQSSSGEEGNSKKDKNHKGRS-MGPSPMSAFMNSNVQGVNNSIVYNSS 414

Query: 453 CSHHDPGVHLSLSRKPSGN-GFHVKDKGNGHHS 484
           CSHHDPGVH++LSRKPSG+ GFHVKD+GNG+ S
Sbjct: 415 CSHHDPGVHVALSRKPSGSAGFHVKDRGNGYQS 447


>gi|224068030|ref|XP_002302653.1| predicted protein [Populus trichocarpa]
 gi|222844379|gb|EEE81926.1| predicted protein [Populus trichocarpa]
          Length = 454

 Score =  234 bits (597), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 223/502 (44%), Positives = 270/502 (53%), Gaps = 66/502 (13%)

Query: 1   MATQPPAPRQWFRLPSIGRPAPPVTPTPEPPAPQPRPPMIRAP-FRPMG--QPPLESIPA 57
           M+ QP   R WFRLPSI RP  P T     P P    P +  P FRP    QP       
Sbjct: 1   MSNQPAPARPWFRLPSIARPTAPTTTPTPEPPPPQPRPALARPAFRPTAPPQP------- 53

Query: 58  PAAVGVSSVPTSPIRRTSVPSSPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMP 117
                   VPT P            RE+    +PPA T P    V++P+P  +P  S + 
Sbjct: 54  --------VPTQP------------RET----TPPAATGPQ--AVAAPTPEPVPPVSGVA 87

Query: 118 TSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPA 177
           + P  P       +      SVPTSPA  AP     + TSP  P     P  T S P   
Sbjct: 88  SVPTSPVA-----RAAVGVASVPTSPASRAPAPSSSVPTSP-VPTSAVLPLTTSSLPPSP 141

Query: 178 PTVFPVITTSPP-----KPAPTTFRVPTSP-PKPAPSTFSVPTSPAKPKPAVTAAI---- 227
                  ++S P     KP  TT  VPTSP PKP  +T SVPTS A     VT+++    
Sbjct: 142 TPKPAPTSSSVPTSPATKPMTTTSSVPTSPAPKPVTTTSSVPTSYASKSVTVTSSVQNSP 201

Query: 228 ---STTTSPPRVPSPAPVGPRTLL-SDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQ 283
              + TT+  RVPSP P   RT+  +  TPPQSP  + TAPPPSPLTLPP Q  S A+ +
Sbjct: 202 ATKAVTTNATRVPSPGP-SLRTIKPAVQTPPQSPKPRPTAPPPSPLTLPPSQVKSYADLE 260

Query: 284 PTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPSSYAHRGNHEVHKEVAKNGKVH 343
           P IP   EQKTVLVQ   +KPK   +   +  D     S  A     E  K+V    K +
Sbjct: 261 PKIPLVAEQKTVLVQKTFDKPKVARDSQRDFADSLS--SGIARLAKQETTKDVQTKEKGN 318

Query: 344 RKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGW 403
           RK+ SSDS++ GM+VITIAGENKGAFMEVIRSP K  F+GT H LH KG+ K++      
Sbjct: 319 RKKNSSDSEDGGMKVITIAGENKGAFMEVIRSPKKQVFEGTSHFLHKKGNPKSEG----- 373

Query: 404 HSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLS 463
              SSS +    KKDKN   KA M P P+ +FMNSNVQGVNNSIV++SSCSHHDPGVH++
Sbjct: 374 SDSSSSEEGNSKKKDKNPNGKA-MGPSPLSSFMNSNVQGVNNSIVYNSSCSHHDPGVHVA 432

Query: 464 LSRKPSG-NGFHVKDKGNGHHS 484
           LSRKP+G   FHVKD GN + S
Sbjct: 433 LSRKPAGAAAFHVKDHGNDNQS 454


>gi|357512591|ref|XP_003626584.1| hypothetical protein MTR_7g117980 [Medicago truncatula]
 gi|355501599|gb|AES82802.1| hypothetical protein MTR_7g117980 [Medicago truncatula]
          Length = 371

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 118/264 (44%), Positives = 157/264 (59%), Gaps = 21/264 (7%)

Query: 229 TTTSPPRVPSPAPVGPRTLLSD-GTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIP 287
           TT +  RVP P    P+T+  +  +P  SP  ++T  PPSPLTLPP QF ++ E    IP
Sbjct: 115 TTANTSRVPIPTQ-SPKTIKQNVRSPMHSPNARSTNSPPSPLTLPPSQFKTREEHSNKIP 173

Query: 288 AETEQKTVLVQTRSEKPKSWFNGTG-------NHFD-ETHKPS-SYAHRGNHEVHKEVAK 338
            E E K VLVQ   + PK W N  G       NH + E H+ + ++++  +H  H  V  
Sbjct: 174 VEAEPKAVLVQKTVDVPKPWHNDKGELHRENQNHSNGEFHRQTQNHSNSSHHGKHITVKG 233

Query: 339 NGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGS-LKTQ 397
                 K+FS DS++ GMRVITIAGEN+GA+ME+++S  KH     P+ LH KG+ +K  
Sbjct: 234 RESSKEKKFS-DSEDSGMRVITIAGENRGAYMELVQSQKKHQ----PNYLHKKGNTIKVD 288

Query: 398 SDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHD 457
               G  S SSS+DE K  K             PM A+MNSNVQ VNNS+++H+SCSHHD
Sbjct: 289 ----GGESESSSADEGKINKKDKNNKGRTKSSIPMAAYMNSNVQCVNNSLLYHASCSHHD 344

Query: 458 PGVHLSLSRKPSGNGFHVKDKGNG 481
           PGV LSLS+KP G G+HVK+  +G
Sbjct: 345 PGVRLSLSKKPFGEGYHVKENVDG 368


>gi|297824745|ref|XP_002880255.1| hypothetical protein ARALYDRAFT_904132 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326094|gb|EFH56514.1| hypothetical protein ARALYDRAFT_904132 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 385

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 75/143 (52%), Positives = 92/143 (64%), Gaps = 4/143 (2%)

Query: 341 KVHRKQFSSDSDEV-GMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSD 399
           K+HR+  +SDS+ +   RVITIAGENKGA ME++RSP  +   G+  T  ++ S  T   
Sbjct: 238 KMHRQPSTSDSENIMSTRVITIAGENKGAVMEILRSPQGNKTGGSG-THSSRVSHGTGEK 296

Query: 400 GTGWHSFSSSSDEEKSKKDKNYKA-KAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDP 458
           G    S SSSS +E   K K  K         PM AFMNSNVQ +NNSIV++S+ SHHDP
Sbjct: 297 GRRLQSSSSSSSDEGEGKKKTTKNPNNGNSNLPMKAFMNSNVQMINNSIVYNSTASHHDP 356

Query: 459 GVHLSLSRKP-SGNGFHVKDKGN 480
           GVHL +SRKP S NGFHV D GN
Sbjct: 357 GVHLKISRKPGSDNGFHVSDYGN 379


>gi|147864007|emb|CAN78958.1| hypothetical protein VITISV_008152 [Vitis vinifera]
          Length = 277

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 121/223 (54%), Gaps = 40/223 (17%)

Query: 263 APPPSPLTLPPPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPS 322
           +PPPS LTLPP +  +K+  +P      EQK +      +KP   F  +     +T KPS
Sbjct: 94  SPPPSALTLPPAR--AKSTHEPI-----EQKIL------QKPNPPFKLSNGVLGDTAKPS 140

Query: 323 SYAHRGNHEVHKEV-AKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTF 381
              +    E+ K++ +K+ K + K+  +DS++  M VIT+AG+NKGA ME          
Sbjct: 141 ISPNYEKRELTKDMESKDHKGNHKK--ADSEDASMEVITVAGDNKGAIME---------- 188

Query: 382 DGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQ 441
                 L    S K  SD       +SS+ EEKSKK    + K     PPM AF+NSNVQ
Sbjct: 189 ------LSLASSNKLGSD-------TSSNYEEKSKKMDKTENKMGTPSPPMSAFVNSNVQ 235

Query: 442 GVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHVKDKGNGHHS 484
            +NNSI+F++S +HHDPGVH+  S    G G H+KD G+G ++
Sbjct: 236 AINNSILFNTSLTHHDPGVHVFFS-GGGGRGLHLKDSGDGQNN 277


>gi|315653992|ref|ZP_07906908.1| conserved hypothetical protein [Lactobacillus iners ATCC 55195]
 gi|315488688|gb|EFU78334.1| conserved hypothetical protein [Lactobacillus iners ATCC 55195]
          Length = 2148

 Score =  110 bits (274), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 83/272 (30%), Positives = 121/272 (44%), Gaps = 32/272 (11%)

Query: 20   PAPPVTPT-PEPPAPQPRPPMIRAPFRPMG--QPPLESIP----APAAVGVSSVPTSPI- 71
            P  P TP+ PE P    +P M   P  P+   +P + S+P     P    + S P  P+ 
Sbjct: 1815 PVKPKTPSEPEVPVTPEKPKMPSVPEVPVTPEKPKMPSVPEVTVKPETPKIPSKPEVPVT 1874

Query: 72   -RRTSVPSSP-VPRESPSPPSPPATTSPPKPQVSS----------PSPTKLPSTSSMPTS 119
              +  +PS P VP +  +P  P  +  P KP  +S          PS  + P T   P +
Sbjct: 1875 PEKPKIPSEPEVPVKPNTPSEPEVSVKPVKPNTTSVPEVSVKPKTPSEPETPVTPVKPKT 1934

Query: 120  PPKPAVTTSPPKP-LATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAP 178
            P +P  + +P KP   +   V   P KP  T+ P ++  P KP   + P +  +P KP  
Sbjct: 1935 PSEPETSETPVKPKTPSAPEVSVKPVKPNTTSVPEVSVKPVKPKTPSVPEVPVTPVKPKT 1994

Query: 179  TVFPVITTSPPKP-APTTFRVPTSPPKP-APSTFSVPTSPAKPKPAVTAAISTTTSPPRV 236
               P ++ +P KP  P+   VP +P KP  PS   VP +P KP+      +  T   P+ 
Sbjct: 1995 PSAPEVSLTPEKPKTPSVPEVPVTPEKPKTPSVPEVPVTPVKPQTPSVPEVPVTPEKPKT 2054

Query: 237  P-----SPAPVGPRTLLSDGTPPQSPYKKATA 263
            P     S  PV P+T     + P+ P KKA A
Sbjct: 2055 PSVPEVSVTPVKPKT----PSVPEVPAKKANA 2082



 Score = 95.1 bits (235), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 89/288 (30%), Positives = 120/288 (41%), Gaps = 34/288 (11%)

Query: 23   PVTPTPEPPAPQPRPPMIRAPFRPMGQPPLESIPAPAAVGVSSVPTSPIRRTSVPSSPVP 82
            PVTP       +P  P+         +P + S+P      V   P  P +  SVP   V 
Sbjct: 1812 PVTPVKPKTPSEPEVPVTPE------KPKMPSVPE-----VPVTPEKP-KMPSVPEVTVK 1859

Query: 83   RESPSPPSPPAT-TSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPT 141
             E+P  PS P    +P KP++ S      P     P +P +P V+  P KP   T SVP 
Sbjct: 1860 PETPKIPSKPEVPVTPEKPKIPSE-----PEVPVKPNTPSEPEVSVKPVKP--NTTSVPE 1912

Query: 142  SPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTF-RVPT 200
               KP   + P    +P KP   + P  + +P KP     P ++  P KP  T+   V  
Sbjct: 1913 VSVKPKTPSEPETPVTPVKPKTPSEPETSETPVKPKTPSAPEVSVKPVKPNTTSVPEVSV 1972

Query: 201  SPPKP-APSTFSVPTSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYK 259
             P KP  PS   VP +P KPK      +S T   P+ PS  P  P T       P+ P  
Sbjct: 1973 KPVKPKTPSVPEVPVTPVKPKTPSAPEVSLTPEKPKTPS-VPEVPVT-------PEKP-- 2022

Query: 260  KATAPPPSPLTLPPPQFTSKAETQPTIPAETEQKTV-LVQTRSEKPKS 306
            K  + P  P+T   PQ  S  E  P  P + +  +V  V     KPK+
Sbjct: 2023 KTPSVPEVPVTPVKPQTPSVPEV-PVTPEKPKTPSVPEVSVTPVKPKT 2069



 Score = 87.4 bits (215), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 76/278 (27%), Positives = 112/278 (40%), Gaps = 54/278 (19%)

Query: 77   PSSPVPRESPSPPSPPAT-------------TSPPKPQVSS-------PSPTKLPSTSSM 116
            P +PV  ++PS P  P T              +P KP++ S       P   K+PS   +
Sbjct: 1812 PVTPVKPKTPSEPEVPVTPEKPKMPSVPEVPVTPEKPKMPSVPEVTVKPETPKIPSKPEV 1871

Query: 117  PTSPPKPAVTTSPPKPLA----------------TTFSVPTSPAKPAPTTFPVITTSPPK 160
            P +P KP + + P  P+                  T SVP    KP   + P    +P K
Sbjct: 1872 PVTPEKPKIPSEPEVPVKPNTPSEPEVSVKPVKPNTTSVPEVSVKPKTPSEPETPVTPVK 1931

Query: 161  PAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTF-RVPTSPPKP-APSTFSVPTSPAK 218
            P   + P  + +P KP     P ++  P KP  T+   V   P KP  PS   VP +P K
Sbjct: 1932 PKTPSEPETSETPVKPKTPSAPEVSVKPVKPNTTSVPEVSVKPVKPKTPSVPEVPVTPVK 1991

Query: 219  PKPAVTAAISTTTSPPRVPS--PAPVGPRTLLSDGTP--PQSPYKKAT-APPPSPLTLPP 273
            PK      +S T   P+ PS    PV P    +   P  P +P K  T + P  P+T   
Sbjct: 1992 PKTPSAPEVSLTPEKPKTPSVPEVPVTPEKPKTPSVPEVPVTPVKPQTPSVPEVPVTPEK 2051

Query: 274  PQFTS-----------KAETQPTIPAETEQKTVLVQTR 300
            P+  S           K  + P +PA+    + +V+++
Sbjct: 2052 PKTPSVPEVSVTPVKPKTPSVPEVPAKKANASTIVKSQ 2089



 Score = 65.5 bits (158), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 60/221 (27%), Positives = 86/221 (38%), Gaps = 44/221 (19%)

Query: 112  STSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITR 171
            S S  P +P KP   + P         VP +P KP   + P +  +P KP   + P +T 
Sbjct: 1807 SISVKPVTPVKPKTPSEP--------EVPVTPEKPKMPSVPEVPVTPEKPKMPSVPEVTV 1858

Query: 172  SPPKPAPTVFPVITTSPPKPA-PTTFRVPTSPPKPA----------PSTFSVPTSPAKPK 220
             P  P     P +  +P KP  P+   VP  P  P+          P+T SVP    KPK
Sbjct: 1859 KPETPKIPSKPEVPVTPEKPKIPSEPEVPVKPNTPSEPEVSVKPVKPNTTSVPEVSVKPK 1918

Query: 221  PAVTAAISTTTSPPRVP-----SPAPVGPRTLLSDGTPPQSPYKKATAPPPSPLTLPPPQ 275
                     T   P+ P     S  PV P+T       P +P  + +  P  P T   P+
Sbjct: 1919 TPSEPETPVTPVKPKTPSEPETSETPVKPKT-------PSAP--EVSVKPVKPNTTSVPE 1969

Query: 276  FTSKAETQPTIPAETEQKTVLVQTRS----------EKPKS 306
             + K   +P  P+  E     V+ ++          EKPK+
Sbjct: 1970 VSVKP-VKPKTPSVPEVPVTPVKPKTPSAPEVSLTPEKPKT 2009


>gi|357514507|ref|XP_003627542.1| hypothetical protein MTR_8g027290 [Medicago truncatula]
 gi|355521564|gb|AET02018.1| hypothetical protein MTR_8g027290 [Medicago truncatula]
          Length = 207

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/134 (44%), Positives = 79/134 (58%), Gaps = 24/134 (17%)

Query: 341 KVHRKQFSSDSDEV-GMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSD 399
           KV  K  SSDS+E  G+R ITI GENKGA M++I++  K      P+ LH   +LK    
Sbjct: 94  KVPNKNCSSDSEEYSGIRKITIGGENKGACMKIIQTRKK------PNQLHKMENLKI--- 144

Query: 400 GTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPG 459
                         K   D++ K +    PP +G +MNSNVQ VNNS++ H+SC+HHDPG
Sbjct: 145 --------------KGYDDQSKKVRTISSPPMVGVYMNSNVQCVNNSLLLHASCTHHDPG 190

Query: 460 VHLSLSRKPSGNGF 473
           V L+LS+KP G G+
Sbjct: 191 VQLTLSKKPFGKGY 204


>gi|296088979|emb|CBI38682.3| unnamed protein product [Vitis vinifera]
          Length = 585

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 79/178 (44%), Positives = 108/178 (60%), Gaps = 14/178 (7%)

Query: 291 EQKTVLVQTRSEKPKSWF----NGTGNHFDETHKPSSYAHRGNHEVHKEVAKNGKVHRKQ 346
           EQKTV++    EKPKS F     G  ++  ++ KPS       +E  +   K+ KV+ K+
Sbjct: 243 EQKTVVI----EKPKSPFRPSIGGLQSNVGDSVKPSIVP---GYEKSEPGTKDQKVNLKK 295

Query: 347 FSSDSDEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWHSF 406
             +DS++  +++IT+AG+N GA ME+  +  K   +G  + L  K  LKT   GT   S 
Sbjct: 296 --ADSEDASLKIITLAGDNTGAVMELSPARKKPDPEGNSNPLVKKTDLKTWL-GTKLGSD 352

Query: 407 SSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSL 464
           +SSSDEEKSKK      K     PPM AF NSNVQ VNNSI+F++SC+HHDPGVHLS+
Sbjct: 353 ASSSDEEKSKKKDKAHNKMGTPLPPMNAFTNSNVQAVNNSILFNTSCNHHDPGVHLSI 410


>gi|449445712|ref|XP_004140616.1| PREDICTED: uncharacterized protein LOC101216647 [Cucumis sativus]
          Length = 397

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 75/213 (35%), Positives = 100/213 (46%), Gaps = 38/213 (17%)

Query: 278 SKAETQP-TIPAETEQKTVLVQTRSEKP----KSWFNGTGNH------FDETHKPSSYAH 326
           S AET+  TIP E E+KTV+ Q   +KP    +       N+      FD+  K  S   
Sbjct: 191 SVAETKDETIPQEVERKTVVFQKVMDKPSQAEEHHLQNITNYRTHTSEFDKNGKQESNKG 250

Query: 327 RGNHEVHKE---------VAKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPL 377
                  KE         +  N    R  F  D++    RVIT+AGENKGAFME+  S  
Sbjct: 251 DDGDRDEKETSSKKKGATIGGNNNYKRTAFDHDNN---TRVITMAGENKGAFMEINLSSE 307

Query: 378 KHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKS-KKDKNYKAKAAMLPPPMGAFM 436
           K+      ++ H +      ++     +  S  D  KS  K K  K +  +   PM AF 
Sbjct: 308 KN------NSRHQQQQQIQDNN-----TVVSVKDSNKSINKTKGIKTRNVL---PMRAFF 353

Query: 437 NSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPS 469
           NSNVQG+NNSI+  S  SHHDPG+HL  S  P+
Sbjct: 354 NSNVQGINNSILMDSKFSHHDPGIHLVFSSLPT 386


>gi|224082878|ref|XP_002306875.1| predicted protein [Populus trichocarpa]
 gi|222856324|gb|EEE93871.1| predicted protein [Populus trichocarpa]
          Length = 545

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 117/421 (27%), Positives = 178/421 (42%), Gaps = 47/421 (11%)

Query: 63  VSSVPTSPIRRTSVPSSPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPK 122
            +SVP SP R T     P  R +  PPSP   T+  +    + SP++    S   + PP 
Sbjct: 116 AASVPPSPSRAT-----PQSRAASVPPSPSRATTQTQAASQTQSPSRATPQSRTASVPPS 170

Query: 123 PAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFP 182
           P+ TTS P+  A     P SP++ A +  P  T+S P            SP K A  V P
Sbjct: 171 PSRTTSQPRTAALAVQQPESPSRLA-SRVPGKTSSQPS-----------SPSKIATQVQP 218

Query: 183 VITTSPPKPAPTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSPPRVPSPAPV 242
            ++ SP K      +  + PP   P++    T   +  PA++  +S             V
Sbjct: 219 TVSRSPSKKLQLATQETSQPP---PTSTQSATQQQETNPALSFPLSQVPQEKTEIRAENV 275

Query: 243 GPRTLLSDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIPAE----TEQKTVLVQ 298
             +   S+        + ATA P S   L  P  + K+++    P      +EQ   + +
Sbjct: 276 SRQQQQSEPVQASGVVRAATATPTSVAALEIPAASQKSDSYTIGPDHPMNLSEQLKNVKE 335

Query: 299 TRSEKPKSWFNGTGNH---------FDETHKPSSYAHRGNHEVHKEVAKNGK--VHRKQF 347
              E+ K+  +  G             E+H+ SS ++     +HKE+ ++    VH+   
Sbjct: 336 DIFERKKTTVSSNGETAKSARARYVLGESHQKSSMSNGEKVPLHKEIREDISKFVHKLGM 395

Query: 348 SSDSDEVGMR---VITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWH 404
                 +G +   ++T+AGEN+GA M     P +   DG+ H  H     K   D     
Sbjct: 396 EHIKHPIGEKPVSIVTLAGENRGASMYEGSEPTRK--DGSVHIHHG---YKINPD----E 446

Query: 405 SFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSL 464
           S  + +D E S K   +K       P M A++NSN Q VNNSI+F SS +   PGV L L
Sbjct: 447 SSENPTDGEGSSKGGKFKDLLTKEDPAMKAYINSNTQSVNNSILFESSLNERSPGVQLHL 506

Query: 465 S 465
           S
Sbjct: 507 S 507


>gi|224066211|ref|XP_002302027.1| predicted protein [Populus trichocarpa]
 gi|222843753|gb|EEE81300.1| predicted protein [Populus trichocarpa]
          Length = 552

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 149/507 (29%), Positives = 207/507 (40%), Gaps = 78/507 (15%)

Query: 1   MATQPPAPRQWFRLPSIGRP-APPVTPTPEPPAPQPRP------------PMIRAPFRPM 47
           MA Q P     FRLP +  P APP+ PT EP  P+PR             P+ R PFRP 
Sbjct: 1   MADQRPF---RFRLPWLSAPVAPPLRPTAEPQPPRPRVEIQAPARPITTIPIQRPPFRPA 57

Query: 48  GQPPLESI----------------PAPAAV--GVSSVPTSPIRRTSVPSSPVPRESPSPP 89
           G  P  +                 P  AA+   V++ P S  R T   S P P +S   P
Sbjct: 58  GITPATTPTTQTTAQEQTEPQPVPPRRAAIESQVNAQPVSQSRETRAASVP-PSQSRETP 116

Query: 90  SPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPT 149
              A + PP P  +       P T S   + P+    +  P P  TT    + P   A  
Sbjct: 117 QSRAASVPPSPSRAKTQSQAAPQTQSPSRATPQSRAASVAPSPSRTT----SQPQSAAAV 172

Query: 150 TFPVITTSPPKPAPTTFPG----ITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKP 205
           T P  T SP + A T  PG       SP K A  V P ++ SP K      +  + PP P
Sbjct: 173 TVPQ-TQSPSRLA-TQVPGRKSPQLSSPSKTATQVQPTVSQSPSKKLQLATQEISQPP-P 229

Query: 206 APSTFSVPTSPAKPKPAV--TAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKATA 263
             +         KP PA     A    T+P   P PA   P +L       +S  +   A
Sbjct: 230 KSTQSDTQQEETKPTPAAEPVQASDAVTAPTPTPEPALETPASLQ------KSDSRTIGA 283

Query: 264 PPPSPLTLPPPQFTSKAETQPTIPAETEQKTVLVQTRSEKPKSWFNGTGNHFDETHKPSS 323
             P PL+ P  +       +        +KT  +    E  K+    T + F E+H+ +S
Sbjct: 284 DHPMPLSQPVKRVKEDIFER--------KKTTTISPNGETIKTAR--TRSAFGESHQKTS 333

Query: 324 YAHRGNHEVHKEVAKNGK--VHRKQFSSDSDEVGMR---VITIAGENKGAFMEVIRSPLK 378
            +      + KE+ ++    VH          +G +   V+T+AGEN+GA M V     +
Sbjct: 334 MSSGEKVPLQKEIREDISKFVHNLGMEHMEHPIGEKPVSVVTLAGENRGATMYVGSESSR 393

Query: 379 HTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNS 438
              DG+ H +H     K   D     S  +++D E S + ++ K       P   A++NS
Sbjct: 394 K--DGSVH-IHR--GYKINPD----ESSEATTDGEGSSRGRSSKDLLTKEDPARKAYINS 444

Query: 439 NVQGVNNSIVFHSSCSHHDPGVHLSLS 465
           N Q VNNSI+F +S S   PGV LSLS
Sbjct: 445 NTQSVNNSILFETSVSERSPGVQLSLS 471


>gi|449432018|ref|XP_004133797.1| PREDICTED: uncharacterized protein LOC101205942 [Cucumis sativus]
          Length = 295

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 91/291 (31%), Positives = 122/291 (41%), Gaps = 66/291 (22%)

Query: 198 VPTSPPKPAPSTFSVP----TSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLL---SD 250
            P SP + +P   S P    TSP   +   + A   T SPP       +G + L     +
Sbjct: 46  APASPRRESPRPLSSPSKKATSPFASRVDSSPAAKATRSPP-----DSIGDKYLERRNGE 100

Query: 251 GTPPQSPYKK--ATAPPPSPLTLPPPQFTSKAET--QPTIPAETEQKTVLVQTRSEKPKS 306
            TPP +P K   A   P SPL LP  Q  +   T  QP +  E E K ++    +++   
Sbjct: 101 TTPPLTPAKSRPAKTSPLSPLALPRSQVITGNGTTAQPRVQPEVETKGIVYNKAADE--- 157

Query: 307 WFNGTGNHFDETHKPSSYAHRGNHEVHKEVAKNGKVHRKQFSSDSDEVGMRVITIAGENK 366
                        KPS    +  H   K V      H+KQ           V+ + G N 
Sbjct: 158 -------------KPSKSNRQSEHGSGKSV------HQKQQKPG-------VMKLKGHNV 191

Query: 367 GAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGT--GWHSFSSSSDEEKSKKDKNYKAK 424
           GA MEV +S   +   G   TL  K   +T+ DG   G+       D++  +K K     
Sbjct: 192 GAVMEVNKSSAGYRLGG--ETLKKK---ETEDDGDVDGY----GHEDKKTGRKKK----- 237

Query: 425 AAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHV 475
                PP+ AFMNSN Q VNNS++F SSC H DPG+HLS      G G  V
Sbjct: 238 -----PPISAFMNSNFQSVNNSLLFDSSCQHRDPGLHLSFPNAADGGGAVV 283


>gi|116788029|gb|ABK24731.1| unknown [Picea sitchensis]
          Length = 358

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/133 (36%), Positives = 73/133 (54%), Gaps = 24/133 (18%)

Query: 341 KVHRKQ-FSSDSD----EVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLK 395
           ++H +Q   S+ D    ++G+RVIT+AGEN+GA M++     K+         H  G   
Sbjct: 217 ELHGQQKLGSNGDQKMQDLGVRVITLAGENRGAIMDMGSESRKNDVLDVQRVNHKPGG-- 274

Query: 396 TQSDGTGWHSFSSSSDEEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSH 455
                       ++ +EE    D N++ KAA    P+ AF+NSN+Q  NNSI+++S+C H
Sbjct: 275 ------------ANKNEESP--DGNHRQKAA---GPVNAFVNSNIQAANNSILYNSTCPH 317

Query: 456 HDPGVHLSLSRKP 468
            DPGV L+LS  P
Sbjct: 318 RDPGVRLALSNNP 330


>gi|449432016|ref|XP_004133796.1| PREDICTED: uncharacterized protein LOC101205715 [Cucumis sativus]
 gi|449477975|ref|XP_004155181.1| PREDICTED: uncharacterized LOC101205715 [Cucumis sativus]
          Length = 308

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 63/120 (52%), Gaps = 25/120 (20%)

Query: 358 VITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEEKSKK 417
           VI I GEN GA M + +S      DG+   +  K ++            S  ++E+ +K 
Sbjct: 211 VINIKGENVGAVMHITQSS-----DGS-EVIKKKPTVGQ----------SKENEEKTNKS 254

Query: 418 DKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRKPSGNGFHVKD 477
           + NY         P  +FMNSN QGVNNSI+++SS SH DPG+HL+   K S +G  + D
Sbjct: 255 NSNY---------PGKSFMNSNFQGVNNSILYNSSLSHRDPGLHLAFGSKKSKHGDSIHD 305


>gi|255538680|ref|XP_002510405.1| oxidoreductase, putative [Ricinus communis]
 gi|223551106|gb|EEF52592.1| oxidoreductase, putative [Ricinus communis]
          Length = 551

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/114 (40%), Positives = 64/114 (56%), Gaps = 11/114 (9%)

Query: 352 DEVGMRVITIAGENKGAFMEVIRSPLKHTFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSD 411
           DE  + V+TIAGEN GA M V   P +   DG+ H +H +G      D TG     +++D
Sbjct: 368 DEKPVCVMTIAGENTGASMHVGAEPARK--DGSIH-IH-RGYKLNPDDSTG-----ATTD 418

Query: 412 EEKSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLS 465
            E S+  ++      +  P   A++NSN Q VNNSI+  SS +  DPGVHL+LS
Sbjct: 419 GEGSRDGRSKNPTKEV--PVTKAYLNSNTQSVNNSIILDSSVNEQDPGVHLALS 470


>gi|242052375|ref|XP_002455333.1| hypothetical protein SORBIDRAFT_03g008720 [Sorghum bicolor]
 gi|241927308|gb|EES00453.1| hypothetical protein SORBIDRAFT_03g008720 [Sorghum bicolor]
          Length = 386

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 58/114 (50%), Gaps = 19/114 (16%)

Query: 355 GMRVITIAGENKGAFMEVIRSPLKH-TFDGTPHTLHNKGSLKTQSDGTGWHSFSSSSDEE 413
           G  +IT+AGENKGA M+V  S +     D        +G    + DG    S   SS +E
Sbjct: 269 GTTIITLAGENKGASMKVDSSAVADGRGDAAAGGKERRGH---KLDG----SVVGSSGKE 321

Query: 414 KSKKDKNYKAKAAMLPPPMGAFMNSNVQGVNNSIVFHSSCSHHDPGVHLSLSRK 467
           +    K            + AF+NSNVQ +NNS++  SSC+  DPGVHL LS K
Sbjct: 322 QGAGGKG-----------LTAFVNSNVQVINNSLMLQSSCNGGDPGVHLKLSTK 364


>gi|195342490|ref|XP_002037833.1| GM18084 [Drosophila sechellia]
 gi|194132683|gb|EDW54251.1| GM18084 [Drosophila sechellia]
          Length = 14551

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 73/270 (27%), Positives = 98/270 (36%), Gaps = 50/270 (18%)

Query: 14    LPSIGRPAPPVT-----------PTPEPPAPQPRPPMIRAPFRPMGQPPL-------ESI 55
             +PS  RP PP +           PTP P       P +  P  P  Q P+          
Sbjct: 11206 IPSAHRPVPPTSQRPVFVTSPGNPTPTPQPGVINIPSVSQPGYPTPQSPIYDANYPTTQS 11265

Query: 56    PAPAAVGVSSVPTSPIRRTSVPSSPVPRESPSPPSPPATTSPPKPQV----SSPSPTKLP 111
             P P   GV ++P+ P      P+ P P    + P+PP+   P +P V    S+P P   P
Sbjct: 11266 PIPQQPGVVNIPSLP-----SPAYPAPNPPVNYPTPPSPQKPVQPGVINIPSAPHPATPP 11320

Query: 112   STSSMPTSPPKPAVTTSPPKPLATTFSV--PTSPAK-----------------PAPTTFP 152
                  P   P P   ++ PKP A  + V  PT P+                  P     P
Sbjct: 11321 QHP--PVFIPSPESPSTAPKPGAPVYDVNYPTPPSAIPHQPGVVNIPSVPPPPPPVMQRP 11378

Query: 153   VITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKPAPSTFSV 212
             V   SP  P P   PG+   P    P V P    SP    P  + V   PP   P   ++
Sbjct: 11379 VFVPSPAHPTPGPQPGVVNIPSVVQP-VHPTY-QSPVVERPAIYDVYYPPPPSRPGVINI 11436

Query: 213   PTSPAKPKPAVTAAISTTTSPPRVPSPAPV 242
             P+ P    P     I   +    +P+P PV
Sbjct: 11437 PSPPRPVYPVPQQPIYVPSPVLHIPAPRPV 11466



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 60/215 (27%), Positives = 81/215 (37%), Gaps = 37/215 (17%)

Query: 19    RPAPPVTP-------TPEPPAPQPRPPMIRAPFRPMGQPPLESIPAPAAVGVSSVPTSPI 71
             +PA P  P        P+P  P P+PP+    +      P+   P    +  +  P  P 
Sbjct: 11160 QPANPEKPGVVNIPSVPQPVYPSPQPPVYHVNY---ATTPVSQQPGVVNIPSAHRPVPPT 11216

Query: 72    RRTSV----PSSPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTT 127
              +  V    P +P P   P   + P+ + P  P   SP       T+  P  P +P V  
Sbjct: 11217 SQRPVFVTSPGNPTPTPQPGVINIPSVSQPGYPTPQSPIYDANYPTTQSPI-PQQPGVVN 11275

Query: 128   SPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTS 187
              P  P         SPA PAP   PV   +PP P     PG+   P  P P        +
Sbjct: 11276 IPSLP---------SPAYPAPNP-PVNYPTPPSPQKPVQPGVINIPSAPHP-------AT 11318

Query: 188   PPKPAPTTFRVPTSP---PKPAPSTFSV--PTSPA 217
             PP+  P     P SP   PKP    + V  PT P+
Sbjct: 11319 PPQHPPVFIPSPESPSTAPKPGAPVYDVNYPTPPS 11353



 Score = 48.5 bits (114), Expect = 0.007,   Method: Composition-based stats.
 Identities = 63/232 (27%), Positives = 79/232 (34%), Gaps = 32/232 (13%)

Query: 77    PSSPVPRESPSPPSPPATTSPPKPQVSSPSP----TKLPSTSSMPTSPPKPAVTTSPPKP 132
             PS     +    P P A     +  V + SP    T     S  P +P KP V   P  P
Sbjct: 11117 PSLACVNQKCRDPCPGACNHLAQCHVINHSPHHHNTMFNYPSPQPANPEKPGVVNIPSVP 11176

Query: 133   LATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPA 192
                    P  P+   P       T+P    P      +   P P  +  PV  TSP  P 
Sbjct: 11177 Q------PVYPSPQPPVYHVNYATTPVSQQPGVVNIPSAHRPVPPTSQRPVFVTSPGNPT 11230

Query: 193   PTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSP----------PRVPSPAPV 242
             PT       P      + S P  P    P   A   TT SP          P +PSPA  
Sbjct: 11231 PT-----PQPGVINIPSVSQPGYPTPQSPIYDANYPTTQSPIPQQPGVVNIPSLPSPAYP 11285

Query: 243   GPRTLLSDGTP--PQSPYKKA-----TAPPPSPLTLPPPQFTSKAETQPTIP 287
              P   ++  TP  PQ P +       +AP P+     PP F    E+  T P
Sbjct: 11286 APNPPVNYPTPPSPQKPVQPGVINIPSAPHPATPPQHPPVFIPSPESPSTAP 11337


>gi|449514829|ref|XP_004164492.1| PREDICTED: uncharacterized LOC101216647 [Cucumis sativus]
          Length = 334

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 55/123 (44%), Gaps = 23/123 (18%)

Query: 278 SKAETQP-TIPAETEQKTVLVQTRSEKP----KSWFNGTGNH------FDETHKPSSYAH 326
           S AET+  TIP E E+KTV+ Q   +KP    +       N+      FD+  K  S   
Sbjct: 191 SVAETKDETIPQEVERKTVVFQKVMDKPSQAEEHHLQNITNYRTHTSEFDKNGKQESNKG 250

Query: 327 RGNHEVHKE---------VAKNGKVHRKQFSSDSDEVGMRVITIAGENKGAFMEVIRSPL 377
                  KE         +  N    R  F  D++    RVIT+AGENKGAFME+  S  
Sbjct: 251 DDGDRDEKETSSKKKGATIGGNNNYKRTAFDHDNN---TRVITMAGENKGAFMEINLSSE 307

Query: 378 KHT 380
           K T
Sbjct: 308 KTT 310


>gi|145482197|ref|XP_001427121.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394200|emb|CAK59723.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1734

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 51/211 (24%), Positives = 64/211 (30%), Gaps = 26/211 (12%)

Query: 82   PRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPT 141
            P++   P   P    PPK Q   P P K       P  P  P      P+P         
Sbjct: 1026 PKQPEPPKEQPKQPEPPKEQPKQPEPLK-----EQPKQPEPPKEQPKQPEP------PKE 1074

Query: 142  SPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTS 201
             P +P P         PPK  P         P +P P         PPK  P     P  
Sbjct: 1075 QPKQPEPPKEQPKQPEPPKEQPKQPEPPKEQPKQPEPPKEQPKQPEPPKEQPKQPEPPKE 1134

Query: 202  PPKPAPSTFSVPTSPAKPKPAVTAAISTTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKA 261
             PK        P  P  PK             P+ P P    P+       PP+   K+ 
Sbjct: 1135 QPKQPEPPKEQPKQPEPPK-----------EQPKQPEP----PKEQPKQPEPPKEQPKQP 1179

Query: 262  TAPPPSPLTLPPPQFTSKAETQPTIPAETEQ 292
              P   P    PP+   + + QP  P   +Q
Sbjct: 1180 EPPKEQPKQQEPPKQPEQPKEQPKQPESPKQ 1210


>gi|351714165|gb|EHB17084.1| Proteoglycan 4 [Heterocephalus glaber]
          Length = 1374

 Score = 43.5 bits (101), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 64/145 (44%), Positives = 84/145 (57%), Gaps = 15/145 (10%)

Query: 86  PSPPSPPATTSPPKPQV-SSPSPTKLPSTSSMPTSPPKPA-----VTTSPPKPLATTFSV 139
           P+ P  PA T  P P     P+PTK P    +PT+P +PA     V T+P +P  T   V
Sbjct: 328 PTAPKEPAPTKEPVPTTPKEPAPTKEP----VPTAPKEPAPTKEPVPTTPKEPAPTKEPV 383

Query: 140 PTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSPPKPAPTVFPV-ITTSPPKPAPTTFRV 198
           PT+P +PAPT  PV TT P +PAPT  PG T +P +PAPT     + T+  +PAPT    
Sbjct: 384 PTAPKEPAPTKEPVPTT-PKEPAPTKEPGPT-TPKEPAPTTTKEPVPTAQKEPAPTKEPG 441

Query: 199 PTSPPKPAPSTFS--VPTSPAKPKP 221
           PT+P +PAP+T    VPT+  +P P
Sbjct: 442 PTTPKEPAPTTTKEPVPTALKEPAP 466



 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 67/160 (41%), Positives = 86/160 (53%), Gaps = 29/160 (18%)

Query: 75  SVPSSPVPRESPSPPSP--PATTSPPKPQV-SSPSPTKLPSTSSMPTSPPKPAVTTSPPK 131
           + P  P P + P P +P  PA T  P P     P+PTK P    +PT+P +PA T  P  
Sbjct: 343 TTPKEPAPTKEPVPTAPKEPAPTKEPVPTTPKEPAPTKEP----VPTAPKEPAPTKEP-- 396

Query: 132 PLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFPGITRSP-----PKPAPTVFPVITT 186
                  VPT+P +PAPT  P  TT P +PAPTT    T+ P      +PAPT  P  TT
Sbjct: 397 -------VPTTPKEPAPTKEPGPTT-PKEPAPTT----TKEPVPTAQKEPAPTKEPGPTT 444

Query: 187 SPPKPAPTTFR--VPTSPPKPAPSTFSVPTSPAKPKPAVT 224
            P +PAPTT +  VPT+  +PAP+    PT+P +P P  T
Sbjct: 445 -PKEPAPTTTKEPVPTALKEPAPTKEPGPTTPKEPAPTTT 483


>gi|390334661|ref|XP_781315.3| PREDICTED: protein SpAN-like [Strongylocentrotus purpuratus]
          Length = 931

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 93/190 (48%), Gaps = 22/190 (11%)

Query: 61  VGVSSVPTSPIRRTSVPSSPVPRESPSPPSPPATT-SPPKPQVSSPSPTKLPSTSSMPTS 119
           V  +  PT+P+  T  P++PV   + +P +P  TT +P  P V++ +PT    T+  PT+
Sbjct: 524 VVTTEAPTTPVVTTEAPTTPVVT-TKAPTTPVVTTEAPTTPVVTTKAPTTPVVTTEAPTT 582

Query: 120 PPKPAVTTSPPKPLATTFSVPTSPA--KPAPTTFPVITTSPPKPAPTTFPGITRSPPKPA 177
           P    VTT  P     T   PT+P     APTT PV+TT     APTT    T++P  P 
Sbjct: 583 P---VVTTKAPTTPVVTTKAPTTPVVTTEAPTT-PVVTTE----APTTPVVTTKAPTTPV 634

Query: 178 PTV----FPVITTSPPKPAPTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSP 233
            T      PV+TT     APTT  V T  P  +  T   PT+P     A T  + TT +P
Sbjct: 635 VTTEAPTTPVVTTE----APTTSVVTTEAPTTSVVTTKAPTTPVVTTEAQTTPVVTTKAP 690

Query: 234 --PRVPSPAP 241
             P V + AP
Sbjct: 691 TTPVVTTKAP 700



 Score = 38.9 bits (89), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 103/217 (47%), Gaps = 28/217 (12%)

Query: 40  IRAPF---RPMGQPPLESIPAPAAVGVSS-VPTSPIRRTSVPSSPVPRESPSPPSPPATT 95
            RA F   +P+ + P+ +  AP   GV++  PT+P+  T  P++PV   + +P +P  TT
Sbjct: 449 FRAHFTIDQPLTEEPVVTTRAPTTPGVTTEAPTTPVVTTKAPTTPVVT-TKAPTTPVVTT 507

Query: 96  -SPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPA--KPAPTTFP 152
            +P  P V++ + T    T+  PT+   P VTT  P     T   PT+P     APTT P
Sbjct: 508 KAPTTPVVTTKALTTPVVTTEAPTT---PVVTTEAPTTPVVTTKAPTTPVVTTEAPTT-P 563

Query: 153 VITTSPP------KPAPTTFPGITRSPPKPAPT----VFPVITTSPP------KPAPTTF 196
           V+TT  P        APTT    T++P  P  T      PV+TT  P        APTT 
Sbjct: 564 VVTTKAPTTPVVTTEAPTTPVVTTKAPTTPVVTTKAPTTPVVTTEAPTTPVVTTEAPTTP 623

Query: 197 RVPTSPPKPAPSTFSVPTSPAKPKPAVTAAISTTTSP 233
            V T  P     T   PT+P     A T ++ TT +P
Sbjct: 624 VVTTKAPTTPVVTTEAPTTPVVTTEAPTTSVVTTEAP 660


>gi|332230676|ref|XP_003264519.1| PREDICTED: LOW QUALITY PROTEIN: proteoglycan 4 [Nomascus
           leucogenys]
          Length = 1335

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 82/143 (57%), Gaps = 24/143 (16%)

Query: 79  SPVPRESPSPPSPPATTSPPKPQVSSPSPTKLPSTSSMPTSPPKPAVTT-------SPPK 131
            PVP  + S P+ P   +P  P+  SP+ TK P+    PT+P +PA TT       +P +
Sbjct: 364 EPVPTTTKSAPTTPKEPAPTTPKEPSPTTTKEPA----PTTPKEPAPTTPKEPSPTTPKE 419

Query: 132 PL-ATTFSVPTSPAKPAPTT--FPVITTSPPKPAPTTFPGITRSPPKPAPTV-FPVITTS 187
           P   TT S PT+P +PAPTT   P  TT P KPAPTT       P +PAPT      +T+
Sbjct: 420 PAPTTTKSAPTTPKEPAPTTPKEPAPTT-PKKPAPTT-------PKEPAPTTPKEPASTT 471

Query: 188 PPKPAPTTFR-VPTSPPKPAPST 209
           P +PAPTT +  PT+P +PAP+T
Sbjct: 472 PKEPAPTTIKSAPTTPKEPAPTT 494


>gi|162457641|ref|YP_001620008.1| cellulose synthase 1 operon protein C [Sorangium cellulosum So
           ce56]
 gi|161168223|emb|CAN99528.1| Cellulose synthase 1 operon protein C precursor [Sorangium
           cellulosum So ce56]
          Length = 2239

 Score = 38.1 bits (87), Expect = 9.4,   Method: Composition-based stats.
 Identities = 60/249 (24%), Positives = 82/249 (32%), Gaps = 11/249 (4%)

Query: 50  PPLESIPAPAAVGVSSVPTSPIRRTS--VPSSPVPRESPSPPSPPATTSPPKPQVSSPSP 107
           P   S   P A G    P SP R  +   P S      P+P S P   +       + S 
Sbjct: 237 PSGSSPARPPARGPGQRPPSPNRAATQPAPGSAASAGKPAPGSQPGGAAQAIAAFDA-ST 295

Query: 108 TKLPSTSSMPTSPPKPAVTTSPPKPLATTFSVPTSPAKPAPTTFPVITTSPPKPAPTTFP 167
           T +P  S  PT+ P P        PL  +    +SPA  +  +         +PAP    
Sbjct: 296 TPMP-ISVEPTATPLPISVEPTATPLPISVEPSSSPAPISVESIEPALAQSAEPAPVQSA 354

Query: 168 GITRSPPKPAPTVFPVITTSPPKPAPTTFRVPTSPPKPAPSTFSVPTSPAKPKPAVTAAI 227
                             T    P P     P++   P P     P++   P P      
Sbjct: 355 APAPVQSAAPAPAPSATRT----PLPGGAHAPSATRTPLPGGAHAPSATRTPLPGGAPTP 410

Query: 228 STTTSPPRVPSPAPVGPRTLLSDGTPPQSPYKKATAPPPSPLTLPPPQFTSKAETQPTIP 287
               +P    +P P   RT +   TP  +    AT   P+P T  P Q  + A T    P
Sbjct: 411 GAPRAPLPGGAPTPGATRTPIPGVTPSAAHAPGAT---PAPSTAAPAQSAAHAATVLHTP 467

Query: 288 AETEQKTVL 296
           +     TVL
Sbjct: 468 SAAHAATVL 476


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.308    0.127    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,806,622,620
Number of Sequences: 23463169
Number of extensions: 573886114
Number of successful extensions: 11830621
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 28163
Number of HSP's successfully gapped in prelim test: 157130
Number of HSP's that attempted gapping in prelim test: 5313494
Number of HSP's gapped (non-prelim): 2241157
length of query: 484
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 337
effective length of database: 8,910,109,524
effective search space: 3002706909588
effective search space used: 3002706909588
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 79 (35.0 bits)