BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy5195
         (334 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|328700243|ref|XP_003241193.1| PREDICTED: hypothetical protein LOC100574750 isoform 1
           [Acyrthosiphon pisum]
 gi|328700245|ref|XP_003241194.1| PREDICTED: hypothetical protein LOC100574750 isoform 2
           [Acyrthosiphon pisum]
          Length = 376

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 174/369 (47%), Positives = 216/369 (58%), Gaps = 47/369 (12%)

Query: 6   SKVSTSQNKSLHLTGIGLAVQAFYINLTLDTVMGGEYAELQPVGPPLINTPGVWGVNGAY 65
           +KVST+ NKSLHLTGIGLAVQAFYINLTLDTVMGG++ +L+P GPP+IN  GVW V+   
Sbjct: 2   TKVSTASNKSLHLTGIGLAVQAFYINLTLDTVMGGDFVQLKPAGPPIINPTGVWSVDQVP 61

Query: 66  TGAYNTGVNGLSNNGIQALNTYGDRGYGAGGYTDANGFYHGNLGTKNGYDVGHAYGGGKD 125
            G Y T  + L+ NG+QAL  +G     A GY D NGFY  N G  NGYDVG+ YGG  D
Sbjct: 62  QG-YATAGSNLALNGLQALQAFGGNKIDANGYRDQNGFYRDNNGALNGYDVGNTYGGVND 120

Query: 126 IGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHR 185
            G+T+V+GG+YG + GRN+GHQVAGFSTSY K+ESGNKATYYDDGLGHGG IQYG+KD R
Sbjct: 121 RGVTSVEGGAYGGINGRNKGHQVAGFSTSYQKNESGNKATYYDDGLGHGGVIQYGAKDQR 180

Query: 186 SRDGVGNAFGGAYHDAHLKTNAHGNKQNFGGGQGFGTKTGYVNNVGDTKLYGGGVGQG-- 243
            RDG G AFGG YHD+ +KTNA G +  FG G G+ T  G   +  + +LYG     G  
Sbjct: 181 FRDGEGKAFGGGYHDSSVKTNAVGQQGQFGNGDGYHTAGGQTGDAVNNQLYGTRSDSGPG 240

Query: 244 -YTGT---------------FNNGFNPALPVLPPAPVPYIPPPPVI-------TPPVLPQ 280
              GT                   +    P   P    Y+PP   +         P +PQ
Sbjct: 241 IVAGTPFLVSTTDPLPPVVHLPPQYVQQSPQYIPQAPQYVPPAQYVPQAQYVPQAPYIPQ 300

Query: 281 SPVISG---------VFSPN-----PVITKN----YQPYPSH--PVGLVTRS-STKYQIP 319
           +P ++          +F  N     P+I +       PY      + L  RS S +Y +P
Sbjct: 301 APFVAQAPFAAQAPTLFQNNIPQLPPIIQQQPGSILPPYNVQLPALSLSARSPSGQYVVP 360

Query: 320 APTKLFVDK 328
            P KL+VDK
Sbjct: 361 MPPKLYVDK 369


>gi|347964393|ref|XP_559373.4| AGAP000734-PA [Anopheles gambiae str. PEST]
 gi|333467510|gb|EAL41122.4| AGAP000734-PA [Anopheles gambiae str. PEST]
          Length = 376

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 94/234 (40%), Gaps = 24/234 (10%)

Query: 104 YHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNK 163
           YH      +G+ V  A  G     + ++ G ++   +  NR H  +GFS SY+K ESG+K
Sbjct: 100 YHQLDNQLDGHRVVDASKGSTVDKVKHLAGENFEADKSHNRKHIKSGFSNSYHKDESGSK 159

Query: 164 ATYYDDGLGHGGSIQYGSKDHRSRDGVGNAFGGAYHDAHLKTNAHGN---KQNFGGGQ-- 218
           ++YY+D   HGG   Y ++ +   D     +     + +L+ +        +  GG Q  
Sbjct: 160 SSYYEDSDDHGGKQVYDNRHNLRNDYADKLYSQDRRNDYLRDHYDDRASGSELLGGHQYR 219

Query: 219 ---------GFGTKTGYVNNVGDTKLYGGGVGQGYTGTFNNGFNPALPVLPPAPVPYIPP 269
                     +G + GY +N    + YG     GY   +    N     L      Y P 
Sbjct: 220 RVGALDRGNAYGHRDGYAHNDHSDQHYGTQSYGGYD-HYQPAANYHYDDLYRRRSYYDPR 278

Query: 270 PPVITPPVLPQSPVISG---VFSPNPVITKNYQPYPSHPV----GLVTRSSTKY 316
           P   T P+ P    I      ++P P    +  PYP+  +     +++R  + Y
Sbjct: 279 PAFATAPLAPAPITIYEDPRDYAPYPAALAS--PYPAREIRDGSAVLSRDESSY 330


>gi|157126299|ref|XP_001654582.1| hypothetical protein AaeL_AAEL002072 [Aedes aegypti]
 gi|108882554|gb|EAT46779.1| AAEL002072-PA [Aedes aegypti]
          Length = 319

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 88/214 (41%), Gaps = 16/214 (7%)

Query: 104 YHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNK 163
           YH      NG+ +  A  GG    + ++ G ++   +  NR    +GF+ SY+K ESG+K
Sbjct: 80  YHALDNAVNGHRIADASKGGSLDKVNHLAGENFEADKSHNRKQVKSGFTNSYHKDESGSK 139

Query: 164 ATYYDDGLGHGGSIQYGSKDHRSRDGVGNAFGGAYHDAHLKTNAHGNK--QNFGGGQGFG 221
           +++Y+D    GG   +      +R  + N F    ++  L+ +   ++    FGG    G
Sbjct: 140 SSFYEDSDDRGGKQVFD-----NRKNLRNNFNDHLYNKELRNDQLRDRYDDRFGGHDLRG 194

Query: 222 TKTGY---VNNVGDTKLYGGGVGQGYTGTFNNGFNPALPVLPPAPVPYIPPPPVITPPVL 278
            +  +     + G+   Y  G         N+      PVLPP  VP      +      
Sbjct: 195 VRDFHHQAAADRGNLHDYRDGFRNDRDHHVNDYAQEFYPVLPPL-VPVRNYDELFQRRTF 253

Query: 279 --PQSPVISGVFSPNPVIT--KNYQ-PYPSHPVG 307
             P+    +G  SP  V    + Y   YP+H VG
Sbjct: 254 HEPRPMFKTGPSSPITVYEDPREYMYDYPAHSVG 287


>gi|189234118|ref|XP_001811130.1| PREDICTED: hypothetical protein [Tribolium castaneum]
 gi|270002525|gb|EEZ98972.1| hypothetical protein TcasGA2_TC004827 [Tribolium castaneum]
          Length = 344

 Score = 44.7 bits (104), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 31/127 (24%), Positives = 52/127 (40%), Gaps = 11/127 (8%)

Query: 53  INTPGVWGVNGAYTGAYNTGVNGLSNNGIQALNTYGDRGYGAGGYTDANGFYHGNLGTKN 112
           +N       +G     +N G  G +       N  G+  Y    + DA G        K 
Sbjct: 135 LNDEAFEKASGKKGEEFNAGKEGFTKGVAAVKNNKGESAY----FNDAKG-------AKK 183

Query: 113 GYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLG 172
             + G AY GG+        GG + + +G  +GH + GF +S++K E+G    +YD+   
Sbjct: 184 MAEDGKAYYGGQHFDKEGKHGGQFNEKKGHKKGHVIKGFKSSHHKDETGKTEEFYDEEND 243

Query: 173 HGGSIQY 179
            G  + +
Sbjct: 244 EGAKLNF 250


>gi|312371829|gb|EFR19917.1| hypothetical protein AND_21590 [Anopheles darlingi]
          Length = 303

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 44/193 (22%), Positives = 83/193 (43%), Gaps = 32/193 (16%)

Query: 128 LTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHRSR 187
           + ++ G ++   +  NR +  +GFS +Y+K E+G+K++YY+D    GG   Y ++ +   
Sbjct: 71  MKHLSGENFETDKSHNRKYIKSGFSNTYHKDENGSKSSYYEDSDDRGGKQVYDNRHNFRN 130

Query: 188 DGVGNAFGGAYHDAHLKTNAHGNKQ---NFGG------------GQGFGTKTGYVNNVGD 232
           D +   +   +   +L+ N + NK+   +F G            G  +G + GY+   GD
Sbjct: 131 DHLDKLYNQEHRKDYLRDN-YDNKRAESDFQGTHNIHRAAALNRGNTYGYRDGYMQ--GD 187

Query: 233 TKLYGGGVGQGYTGTFNNGFNPALPVLP-------PAPVPYIPPPPVITPPVLPQSPVIS 285
                 G  + Y GT++  + P     P         P P  P  P+ + P+     +  
Sbjct: 188 HNDQRRGTTRSY-GTYD--YYPHATYQPYQRQSYYYEPRPSFPLTPLQSVPI----TIYD 240

Query: 286 GVFSPNPVITKNY 298
            +    P +  NY
Sbjct: 241 DLREFKPYVASNY 253


>gi|170062847|ref|XP_001866847.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880612|gb|EDS43995.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 318

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 42/79 (53%)

Query: 104 YHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNK 163
           YH      NG+ V  +  GG    + ++ G ++   +  NR    +GFS SY+K E+G+K
Sbjct: 83  YHALDNVVNGHRVADSSKGGSVDKVKHLAGENFEADKSHNRKQIKSGFSNSYHKDENGSK 142

Query: 164 ATYYDDGLGHGGSIQYGSK 182
           ++YY+D    GG   Y ++
Sbjct: 143 SSYYEDSDDRGGKQVYDNR 161


>gi|328708619|ref|XP_003243748.1| PREDICTED: hypothetical protein LOC100568636 [Acyrthosiphon pisum]
          Length = 421

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 55/127 (43%), Gaps = 2/127 (1%)

Query: 76  LSNNGIQALNTYGDRGY-GAGGYTDANGFYHGNLGTKNGYDVGHAYGGGKDIGLTNVQGG 134
           L N+G +  + + D+G+ G  G  D    Y    G  NG+     + G    G   V+G 
Sbjct: 234 LGNSGYKNQHDF-DKGHAGKYGKDDLKSHYKKASGHSNGHKNEADHYGKHKSGTHGVKGH 292

Query: 135 SYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHRSRDGVGNAF 194
            +G+     +GH+  GF   Y+K E   +  +YDD   HG   +YG K        G   
Sbjct: 293 KFGETEHHKKGHKTTGFHNVYHKDEYNKEQKFYDDAHKHGKFDKYGGKHKDFSQKEGGHK 352

Query: 195 GGAYHDA 201
            G +HD+
Sbjct: 353 HGTHHDS 359


>gi|357622822|gb|EHJ74201.1| hypothetical protein KGM_19919 [Danaus plexippus]
          Length = 411

 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 24/42 (57%)

Query: 141 GRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSK 182
           G  +GH   GF  SY+K ES NK+TY+DD    G    Y SK
Sbjct: 118 GHKKGHHKQGFQNSYHKDESSNKSTYFDDFNDEGDRSGYNSK 159


>gi|407642100|ref|YP_006805859.1| hypothetical protein O3I_004610 [Nocardia brasiliensis ATCC 700358]
 gi|407304984|gb|AFT98884.1| hypothetical protein O3I_004610 [Nocardia brasiliensis ATCC 700358]
          Length = 6502

 Score = 37.4 bits (85), Expect = 9.1,   Method: Composition-based stats.
 Identities = 58/230 (25%), Positives = 83/230 (36%), Gaps = 25/230 (10%)

Query: 40  GEYAELQPVGPPLINTPGVWGVNGAYTGA-----YNTGVNGLSNN---------GIQALN 85
           G    + PVG P + + G     G   GA     + +G NGL NN         G     
Sbjct: 323 GGGVRVDPVGVPRVGSLGTLTPTGVGAGANSPAVHPSGANGLGNNGSGANTGTTGGGTGA 382

Query: 86  TYGDRGYGAGGYTDA----NGFYHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRG 141
                G GAGG  ++    +G  + N GT +G     + G G     +   G S     G
Sbjct: 383 GRTGDGTGAGGNNNSGRAGDGSRNTNSGTGDGSSSRTSTGDGSSSRTSTGDGASQTSGDG 442

Query: 142 RNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHRSRDGVGNAFGGAYHDA 201
            NR +  +   T   +H+ G   +   DG  H      G  ++R+ DG G   GG+   A
Sbjct: 443 SNRTNPSSA-ETGAQQHDRGGSESRASDGGRH-----EGGSENRASDG-GRHEGGSESRA 495

Query: 202 HLKTNAHGNKQNFGGGQGFGTKTGYVNNVGDTKLYGGGVGQGYTGTFNNG 251
                  G  +N  G       +G  +  G T   G G  +G  G   N 
Sbjct: 496 SDGGRHEGGSENRAGAAPEQAGSGDQSRSGATDQGGSGQDRGTAGETGNA 545


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.313    0.138    0.426 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,843,621,321
Number of Sequences: 23463169
Number of extensions: 382258741
Number of successful extensions: 1611276
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 575
Number of HSP's successfully gapped in prelim test: 11595
Number of HSP's that attempted gapping in prelim test: 1440184
Number of HSP's gapped (non-prelim): 98844
length of query: 334
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 191
effective length of database: 9,003,962,200
effective search space: 1719756780200
effective search space used: 1719756780200
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 77 (34.3 bits)