BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy5195
(334 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|328700243|ref|XP_003241193.1| PREDICTED: hypothetical protein LOC100574750 isoform 1
[Acyrthosiphon pisum]
gi|328700245|ref|XP_003241194.1| PREDICTED: hypothetical protein LOC100574750 isoform 2
[Acyrthosiphon pisum]
Length = 376
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 174/369 (47%), Positives = 216/369 (58%), Gaps = 47/369 (12%)
Query: 6 SKVSTSQNKSLHLTGIGLAVQAFYINLTLDTVMGGEYAELQPVGPPLINTPGVWGVNGAY 65
+KVST+ NKSLHLTGIGLAVQAFYINLTLDTVMGG++ +L+P GPP+IN GVW V+
Sbjct: 2 TKVSTASNKSLHLTGIGLAVQAFYINLTLDTVMGGDFVQLKPAGPPIINPTGVWSVDQVP 61
Query: 66 TGAYNTGVNGLSNNGIQALNTYGDRGYGAGGYTDANGFYHGNLGTKNGYDVGHAYGGGKD 125
G Y T + L+ NG+QAL +G A GY D NGFY N G NGYDVG+ YGG D
Sbjct: 62 QG-YATAGSNLALNGLQALQAFGGNKIDANGYRDQNGFYRDNNGALNGYDVGNTYGGVND 120
Query: 126 IGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHR 185
G+T+V+GG+YG + GRN+GHQVAGFSTSY K+ESGNKATYYDDGLGHGG IQYG+KD R
Sbjct: 121 RGVTSVEGGAYGGINGRNKGHQVAGFSTSYQKNESGNKATYYDDGLGHGGVIQYGAKDQR 180
Query: 186 SRDGVGNAFGGAYHDAHLKTNAHGNKQNFGGGQGFGTKTGYVNNVGDTKLYGGGVGQG-- 243
RDG G AFGG YHD+ +KTNA G + FG G G+ T G + + +LYG G
Sbjct: 181 FRDGEGKAFGGGYHDSSVKTNAVGQQGQFGNGDGYHTAGGQTGDAVNNQLYGTRSDSGPG 240
Query: 244 -YTGT---------------FNNGFNPALPVLPPAPVPYIPPPPVI-------TPPVLPQ 280
GT + P P Y+PP + P +PQ
Sbjct: 241 IVAGTPFLVSTTDPLPPVVHLPPQYVQQSPQYIPQAPQYVPPAQYVPQAQYVPQAPYIPQ 300
Query: 281 SPVISG---------VFSPN-----PVITKN----YQPYPSH--PVGLVTRS-STKYQIP 319
+P ++ +F N P+I + PY + L RS S +Y +P
Sbjct: 301 APFVAQAPFAAQAPTLFQNNIPQLPPIIQQQPGSILPPYNVQLPALSLSARSPSGQYVVP 360
Query: 320 APTKLFVDK 328
P KL+VDK
Sbjct: 361 MPPKLYVDK 369
>gi|347964393|ref|XP_559373.4| AGAP000734-PA [Anopheles gambiae str. PEST]
gi|333467510|gb|EAL41122.4| AGAP000734-PA [Anopheles gambiae str. PEST]
Length = 376
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 94/234 (40%), Gaps = 24/234 (10%)
Query: 104 YHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNK 163
YH +G+ V A G + ++ G ++ + NR H +GFS SY+K ESG+K
Sbjct: 100 YHQLDNQLDGHRVVDASKGSTVDKVKHLAGENFEADKSHNRKHIKSGFSNSYHKDESGSK 159
Query: 164 ATYYDDGLGHGGSIQYGSKDHRSRDGVGNAFGGAYHDAHLKTNAHGN---KQNFGGGQ-- 218
++YY+D HGG Y ++ + D + + +L+ + + GG Q
Sbjct: 160 SSYYEDSDDHGGKQVYDNRHNLRNDYADKLYSQDRRNDYLRDHYDDRASGSELLGGHQYR 219
Query: 219 ---------GFGTKTGYVNNVGDTKLYGGGVGQGYTGTFNNGFNPALPVLPPAPVPYIPP 269
+G + GY +N + YG GY + N L Y P
Sbjct: 220 RVGALDRGNAYGHRDGYAHNDHSDQHYGTQSYGGYD-HYQPAANYHYDDLYRRRSYYDPR 278
Query: 270 PPVITPPVLPQSPVISG---VFSPNPVITKNYQPYPSHPV----GLVTRSSTKY 316
P T P+ P I ++P P + PYP+ + +++R + Y
Sbjct: 279 PAFATAPLAPAPITIYEDPRDYAPYPAALAS--PYPAREIRDGSAVLSRDESSY 330
>gi|157126299|ref|XP_001654582.1| hypothetical protein AaeL_AAEL002072 [Aedes aegypti]
gi|108882554|gb|EAT46779.1| AAEL002072-PA [Aedes aegypti]
Length = 319
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 88/214 (41%), Gaps = 16/214 (7%)
Query: 104 YHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNK 163
YH NG+ + A GG + ++ G ++ + NR +GF+ SY+K ESG+K
Sbjct: 80 YHALDNAVNGHRIADASKGGSLDKVNHLAGENFEADKSHNRKQVKSGFTNSYHKDESGSK 139
Query: 164 ATYYDDGLGHGGSIQYGSKDHRSRDGVGNAFGGAYHDAHLKTNAHGNK--QNFGGGQGFG 221
+++Y+D GG + +R + N F ++ L+ + ++ FGG G
Sbjct: 140 SSFYEDSDDRGGKQVFD-----NRKNLRNNFNDHLYNKELRNDQLRDRYDDRFGGHDLRG 194
Query: 222 TKTGY---VNNVGDTKLYGGGVGQGYTGTFNNGFNPALPVLPPAPVPYIPPPPVITPPVL 278
+ + + G+ Y G N+ PVLPP VP +
Sbjct: 195 VRDFHHQAAADRGNLHDYRDGFRNDRDHHVNDYAQEFYPVLPPL-VPVRNYDELFQRRTF 253
Query: 279 --PQSPVISGVFSPNPVIT--KNYQ-PYPSHPVG 307
P+ +G SP V + Y YP+H VG
Sbjct: 254 HEPRPMFKTGPSSPITVYEDPREYMYDYPAHSVG 287
>gi|189234118|ref|XP_001811130.1| PREDICTED: hypothetical protein [Tribolium castaneum]
gi|270002525|gb|EEZ98972.1| hypothetical protein TcasGA2_TC004827 [Tribolium castaneum]
Length = 344
Score = 44.7 bits (104), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 31/127 (24%), Positives = 52/127 (40%), Gaps = 11/127 (8%)
Query: 53 INTPGVWGVNGAYTGAYNTGVNGLSNNGIQALNTYGDRGYGAGGYTDANGFYHGNLGTKN 112
+N +G +N G G + N G+ Y + DA G K
Sbjct: 135 LNDEAFEKASGKKGEEFNAGKEGFTKGVAAVKNNKGESAY----FNDAKG-------AKK 183
Query: 113 GYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLG 172
+ G AY GG+ GG + + +G +GH + GF +S++K E+G +YD+
Sbjct: 184 MAEDGKAYYGGQHFDKEGKHGGQFNEKKGHKKGHVIKGFKSSHHKDETGKTEEFYDEEND 243
Query: 173 HGGSIQY 179
G + +
Sbjct: 244 EGAKLNF 250
>gi|312371829|gb|EFR19917.1| hypothetical protein AND_21590 [Anopheles darlingi]
Length = 303
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 83/193 (43%), Gaps = 32/193 (16%)
Query: 128 LTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHRSR 187
+ ++ G ++ + NR + +GFS +Y+K E+G+K++YY+D GG Y ++ +
Sbjct: 71 MKHLSGENFETDKSHNRKYIKSGFSNTYHKDENGSKSSYYEDSDDRGGKQVYDNRHNFRN 130
Query: 188 DGVGNAFGGAYHDAHLKTNAHGNKQ---NFGG------------GQGFGTKTGYVNNVGD 232
D + + + +L+ N + NK+ +F G G +G + GY+ GD
Sbjct: 131 DHLDKLYNQEHRKDYLRDN-YDNKRAESDFQGTHNIHRAAALNRGNTYGYRDGYMQ--GD 187
Query: 233 TKLYGGGVGQGYTGTFNNGFNPALPVLP-------PAPVPYIPPPPVITPPVLPQSPVIS 285
G + Y GT++ + P P P P P P+ + P+ +
Sbjct: 188 HNDQRRGTTRSY-GTYD--YYPHATYQPYQRQSYYYEPRPSFPLTPLQSVPI----TIYD 240
Query: 286 GVFSPNPVITKNY 298
+ P + NY
Sbjct: 241 DLREFKPYVASNY 253
>gi|170062847|ref|XP_001866847.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880612|gb|EDS43995.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 318
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 42/79 (53%)
Query: 104 YHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRGRNRGHQVAGFSTSYNKHESGNK 163
YH NG+ V + GG + ++ G ++ + NR +GFS SY+K E+G+K
Sbjct: 83 YHALDNVVNGHRVADSSKGGSVDKVKHLAGENFEADKSHNRKQIKSGFSNSYHKDENGSK 142
Query: 164 ATYYDDGLGHGGSIQYGSK 182
++YY+D GG Y ++
Sbjct: 143 SSYYEDSDDRGGKQVYDNR 161
>gi|328708619|ref|XP_003243748.1| PREDICTED: hypothetical protein LOC100568636 [Acyrthosiphon pisum]
Length = 421
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 55/127 (43%), Gaps = 2/127 (1%)
Query: 76 LSNNGIQALNTYGDRGY-GAGGYTDANGFYHGNLGTKNGYDVGHAYGGGKDIGLTNVQGG 134
L N+G + + + D+G+ G G D Y G NG+ + G G V+G
Sbjct: 234 LGNSGYKNQHDF-DKGHAGKYGKDDLKSHYKKASGHSNGHKNEADHYGKHKSGTHGVKGH 292
Query: 135 SYGDVRGRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHRSRDGVGNAF 194
+G+ +GH+ GF Y+K E + +YDD HG +YG K G
Sbjct: 293 KFGETEHHKKGHKTTGFHNVYHKDEYNKEQKFYDDAHKHGKFDKYGGKHKDFSQKEGGHK 352
Query: 195 GGAYHDA 201
G +HD+
Sbjct: 353 HGTHHDS 359
>gi|357622822|gb|EHJ74201.1| hypothetical protein KGM_19919 [Danaus plexippus]
Length = 411
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 24/42 (57%)
Query: 141 GRNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSK 182
G +GH GF SY+K ES NK+TY+DD G Y SK
Sbjct: 118 GHKKGHHKQGFQNSYHKDESSNKSTYFDDFNDEGDRSGYNSK 159
>gi|407642100|ref|YP_006805859.1| hypothetical protein O3I_004610 [Nocardia brasiliensis ATCC 700358]
gi|407304984|gb|AFT98884.1| hypothetical protein O3I_004610 [Nocardia brasiliensis ATCC 700358]
Length = 6502
Score = 37.4 bits (85), Expect = 9.1, Method: Composition-based stats.
Identities = 58/230 (25%), Positives = 83/230 (36%), Gaps = 25/230 (10%)
Query: 40 GEYAELQPVGPPLINTPGVWGVNGAYTGA-----YNTGVNGLSNN---------GIQALN 85
G + PVG P + + G G GA + +G NGL NN G
Sbjct: 323 GGGVRVDPVGVPRVGSLGTLTPTGVGAGANSPAVHPSGANGLGNNGSGANTGTTGGGTGA 382
Query: 86 TYGDRGYGAGGYTDA----NGFYHGNLGTKNGYDVGHAYGGGKDIGLTNVQGGSYGDVRG 141
G GAGG ++ +G + N GT +G + G G + G S G
Sbjct: 383 GRTGDGTGAGGNNNSGRAGDGSRNTNSGTGDGSSSRTSTGDGSSSRTSTGDGASQTSGDG 442
Query: 142 RNRGHQVAGFSTSYNKHESGNKATYYDDGLGHGGSIQYGSKDHRSRDGVGNAFGGAYHDA 201
NR + + T +H+ G + DG H G ++R+ DG G GG+ A
Sbjct: 443 SNRTNPSSA-ETGAQQHDRGGSESRASDGGRH-----EGGSENRASDG-GRHEGGSESRA 495
Query: 202 HLKTNAHGNKQNFGGGQGFGTKTGYVNNVGDTKLYGGGVGQGYTGTFNNG 251
G +N G +G + G T G G +G G N
Sbjct: 496 SDGGRHEGGSENRAGAAPEQAGSGDQSRSGATDQGGSGQDRGTAGETGNA 545
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.313 0.138 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,843,621,321
Number of Sequences: 23463169
Number of extensions: 382258741
Number of successful extensions: 1611276
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 575
Number of HSP's successfully gapped in prelim test: 11595
Number of HSP's that attempted gapping in prelim test: 1440184
Number of HSP's gapped (non-prelim): 98844
length of query: 334
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 191
effective length of database: 9,003,962,200
effective search space: 1719756780200
effective search space used: 1719756780200
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 77 (34.3 bits)