BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8838
(319 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|289739557|gb|ADD18526.1| pyruvate dehydrogenase E1 beta subunit [Glossina morsitans
morsitans]
Length = 308
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 116/217 (53%), Gaps = 15/217 (6%)
Query: 69 GFSQCQLMAVSLCGHSLIMELVFLTISQSALGLTVLKNNPTTSAHHSMLLFSSMKCVGMD 128
GFS + A ++ G I+E + S A+ L+ ++ L KC
Sbjct: 98 GFSGVAIGA-AMAGLRPILEFMTFNFSMQAIDQASLRCCTLSTIRAIHLQTGRTKC---- 152
Query: 129 KSYSLPPLASTSTSVVLDNPVFNKNVKIDIPLMPFYIEKIDKLNE--RNNSIIEDPVIPS 186
+ + P A ++ P+ N K I + PF +D+LN+ ++ + P + +
Sbjct: 153 NNVTTPVYAEYPRNI----PIGKLNNKPQI-VDPFITNCLDELNDMLKDGNQKHAPAL-N 206
Query: 187 QAPEKHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEG 246
+ + A MI+ RRRKMKKHKL+KLR +MKFEWAK+RQRREM+KEKAFQ + T++KE
Sbjct: 207 RKDKVEAARMIVIRRRKMKKHKLRKLRRKMKFEWAKVRQRREMRKEKAFQATLITQIKEA 266
Query: 247 ERFSAEQYVENKLQLLHAEQLAESWKN--VPESFIKE 281
E F+A ++VE KLQ + L WK +P IKE
Sbjct: 267 EVFNASKFVEEKLQKANETPLQRYWKGRRLPAFIIKE 303
>gi|312379429|gb|EFR25704.1| hypothetical protein AND_08724 [Anopheles darlingi]
Length = 1821
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 65/141 (46%), Positives = 88/141 (62%), Gaps = 15/141 (10%)
Query: 156 IDIPLMPFYIEK-IDKLNERNNSIIED--PVIPSQ------APE----KHAIVMIIRRRR 202
IDIPL+ IE + + + S+I D P+IP+ P+ K A +I+ RRR
Sbjct: 1676 IDIPLVSRIIENPVSRPTAGSGSVIGDDLPIIPTLDLPTGIEPQGDNGKQAARLIVIRRR 1735
Query: 203 KMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLL 262
KM+KHKLKKLR RMKFEW K+RQRRE+KKEK FQ E+ ++KE E+FSAE YV +KL+
Sbjct: 1736 KMRKHKLKKLRKRMKFEWLKVRQRRELKKEKLFQAELLAQIKEAEKFSAEAYVADKLRRS 1795
Query: 263 HAEQLAESWKN--VPESFIKE 281
+ WK +P+ IK+
Sbjct: 1796 TEVPIPRFWKGKRLPQFLIKQ 1816
>gi|357613829|gb|EHJ68740.1| hypothetical protein KGM_12263 [Danaus plexippus]
Length = 213
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 62/94 (65%), Gaps = 2/94 (2%)
Query: 190 EKHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERF 249
EKHA+ +I+ RR+KMKKH+ KL RM+ WA++++ R +KKEK FQ E+ VKE F
Sbjct: 102 EKHAVRLIVIRRKKMKKHQRNKLWKRMRHRWARVKKNRRIKKEKIFQNELFAMVKEANEF 161
Query: 250 SAEQYVENKLQLLHAEQLAESW--KNVPESFIKE 281
SAEQYV +KL+ + L W K +PE I++
Sbjct: 162 SAEQYVTSKLERANHTPLPTRWRHKRLPEFIIRQ 195
>gi|170042172|ref|XP_001848810.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167865678|gb|EDS29061.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 248
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 121/238 (50%), Gaps = 32/238 (13%)
Query: 60 FNSQRGLMDGFSQCQLMAVSLCGHSLIMELVFLTISQSALGLTVLKNNPTTSAHHSMLLF 119
FN+ R D + +L +S SLI +++++++ TV+ T+ L
Sbjct: 22 FNTPRP-TDSTATARLAPIS----SLIRNFSSISLNETSRAATVIPLKATSCQTKPELPS 76
Query: 120 -SSMKCVGMDKSYSLPPLASTSTSVVLDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSI 178
S+ + V + + SL V NP+ + +D+PL+ IE N + I
Sbjct: 77 GSTSRLVFLPREISL------DIGQVRINPLLPRKEILDLPLISRIIE-----NPNQSRI 125
Query: 179 IEDPVIPSQAP-------------EKHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQ 225
E +P P A +I+ RRRKM+KHKL+KLR +MK+EWAK+RQ
Sbjct: 126 TEIGDLPRSLPVIDLPKSNTVDDGGIQAARLIVIRRRKMRKHKLRKLRKKMKYEWAKVRQ 185
Query: 226 RREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKN--VPESFIKE 281
RRE++KEKAFQ ++ ++K+ ERFSAE YV +KL+ + WK +P+ IK+
Sbjct: 186 RRELRKEKAFQADLIAQIKDAERFSAEAYVADKLRQATDTPIPRFWKGKRLPQFLIKQ 243
>gi|94468528|gb|ABF18113.1| unknown [Aedes aegypti]
Length = 218
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/147 (41%), Positives = 91/147 (61%), Gaps = 12/147 (8%)
Query: 147 NPVFNKNVKIDIPLMPFYIE-----KIDKLNERNNSIIEDPVIPSQAPEK-----HAIVM 196
NP+ + IDIPL+ IE +I ++ +R ++ +P+ + + A +
Sbjct: 67 NPLIPRKEIIDIPLISRIIENPNQSRISEIGDRLPAVAPVVDLPTSSGKVDDNGIQAARL 126
Query: 197 IIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVE 256
I+ RRRKM+KHKLKKLR +MK+EWAK+RQRRE++KEKAFQ + ++KE ERFSAE YV
Sbjct: 127 IVIRRRKMRKHKLKKLRKKMKYEWAKVRQRRELRKEKAFQAGLINQIKEAERFSAEMYVA 186
Query: 257 NKLQLLHAEQLAESWKN--VPESFIKE 281
KL+ + + WK +P+ IK+
Sbjct: 187 EKLRQANDTPIPRFWKGKRLPQFIIKQ 213
>gi|58387862|ref|XP_315868.2| AGAP005843-PA [Anopheles gambiae str. PEST]
gi|55238658|gb|EAA11543.2| AGAP005843-PA [Anopheles gambiae str. PEST]
Length = 230
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 102/215 (47%), Gaps = 55/215 (25%)
Query: 108 PTTSAHHSMLLFSS-------MKCVGMDKSYSLP--PLASTST-------SVVLDNPVFN 151
P T+A M+ + + G + S LP PL ST + + NP+
Sbjct: 25 PATAALFPMVALKTRTPRTQQIPLKGHESSILLPIRPLELRSTVRNVEIGNSIRRNPLIP 84
Query: 152 KNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAP---------------------- 189
IDIP + + IIE+PV P P
Sbjct: 85 LKEIIDIPAV--------------SRIIENPVQPPTGPIGDNLTLIPSLDLPTDTGSKDE 130
Query: 190 -EKHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGER 248
K A +I+ RRRKM+KHKLKKLR RMKFEW K+RQRRE+KKEK FQ E+ ++KE E+
Sbjct: 131 HGKQAARLIVIRRRKMRKHKLKKLRKRMKFEWLKVRQRRELKKEKLFQAELLGQIKEAEK 190
Query: 249 FSAEQYVENKLQLLHAEQLAESWKN--VPESFIKE 281
FSAE YV +KL+ L WK +PE IK+
Sbjct: 191 FSAEAYVASKLRQATDVPLPRFWKGKRLPEFIIKQ 225
>gi|91089759|ref|XP_966664.1| PREDICTED: similar to CG11876 CG11876-PB [Tribolium castaneum]
gi|91090005|ref|XP_966665.1| PREDICTED: similar to CG11876 CG11876-PB [Tribolium castaneum]
gi|270013614|gb|EFA10062.1| hypothetical protein TcasGA2_TC012236 [Tribolium castaneum]
gi|270013689|gb|EFA10137.1| hypothetical protein TcasGA2_TC012323 [Tribolium castaneum]
Length = 193
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/119 (48%), Positives = 74/119 (62%), Gaps = 3/119 (2%)
Query: 145 LDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKM 204
++ P V ID+P Y+ ++ + ++ IE P I K A +I+ RRRKM
Sbjct: 53 VEAPKIKNIVHIDLPNGLRYVPPLE--DPSSDKEIELPTINGDV-TKEAARLIVIRRRKM 109
Query: 205 KKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLH 263
K+HKLKKLR RMKFEWAK RQRRE KEK FQ E+ + KEGE FSAE+YVE +L L
Sbjct: 110 KRHKLKKLRKRMKFEWAKKRQRREWNKEKLFQAELIKQCKEGEAFSAEKYVEERLAKLR 168
>gi|242247409|ref|NP_001156199.1| uncharacterized protein LOC100164847 [Acyrthosiphon pisum]
gi|239788906|dbj|BAH71108.1| ACYPI005832 [Acyrthosiphon pisum]
Length = 211
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 85/148 (57%), Gaps = 17/148 (11%)
Query: 136 LASTSTSVVLDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAIV 195
L +T + + D P N + + IP + D L + +N I A
Sbjct: 50 LPTTISKKITDEP--NPCIPLRIPRPTISLPLADPLVDNDNEI-------------QAAR 94
Query: 196 MIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYV 255
+I+ RR+KMKKHKL+KLR RMK+ WAK+RQ+REM+KEKAFQ E+ +VKE E+FS YV
Sbjct: 95 LIVIRRKKMKKHKLRKLRKRMKYVWAKVRQKREMRKEKAFQAELMGQVKEAEKFSPANYV 154
Query: 256 ENKLQLLHAEQLAESW--KNVPESFIKE 281
+ KL +++ L W K +PE IK+
Sbjct: 155 QEKLNIVNMTILPNRWRGKRLPEFVIKD 182
>gi|195445019|ref|XP_002070135.1| GK11887 [Drosophila willistoni]
gi|194166220|gb|EDW81121.1| GK11887 [Drosophila willistoni]
Length = 512
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/56 (53%), Positives = 42/56 (75%)
Query: 218 FEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKN 273
FEWAK+RQRREM+KEKAFQ ++ +++K+ E FSAEQYV + L+ + + L WK
Sbjct: 204 FEWAKVRQRREMRKEKAFQAQLMSQIKKAESFSAEQYVADVLKQANEKPLPRFWKG 259
>gi|194765039|ref|XP_001964635.1| GF23287 [Drosophila ananassae]
gi|190614907|gb|EDV30431.1| GF23287 [Drosophila ananassae]
Length = 509
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/119 (47%), Positives = 77/119 (64%), Gaps = 1/119 (0%)
Query: 156 IDIPLMPFYIEKIDKLN-ERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKMKKHKLKKLRI 214
I+ P + E D LN +R +I P+ S+ + A +I+ RRRKMKKHKLKKLR
Sbjct: 140 IESPRSTWITEINDILNRDRVGEVIAPPLDKSKPDKMEAARLIVIRRRKMKKHKLKKLRR 199
Query: 215 RMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKN 273
+MKFEWAK+RQRREM+KEKAFQ ++ ++K+ E FSAEQ+V L+ + L WK
Sbjct: 200 KMKFEWAKVRQRREMRKEKAFQAKLMAQIKQAEAFSAEQHVAEILRQANETPLPRFWKG 258
>gi|24650943|ref|NP_733266.1| CG11876, isoform B [Drosophila melanogaster]
gi|24650945|ref|NP_733267.1| CG11876, isoform C [Drosophila melanogaster]
gi|23172530|gb|AAN14150.1| CG11876, isoform B [Drosophila melanogaster]
gi|23172531|gb|AAN14151.1| CG11876, isoform C [Drosophila melanogaster]
gi|85857602|gb|ABC86336.1| IP14839p [Drosophila melanogaster]
Length = 273
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 143/291 (49%), Gaps = 35/291 (12%)
Query: 3 MLRHR-PHASNVAVVSIKVGAEIIKSEQLTINFPLGSPVWALIADNDSTAI--EE--HFG 57
MLR R A++ A + + + ++Q+T+ L S + +A +D I EE +
Sbjct: 1 MLRTRLIQAASSAQRAFSTSQKALAAKQMTVRDALNSALDDELARDDRVFILGEEVAQYD 60
Query: 58 QKFNSQRGLMDGFSQCQLMAVSLCGHSLIMELVFLTISQSALGLTVLKNNPTTSAHHSML 117
+ RGL + +++ + I E+ F I A+G + P
Sbjct: 61 GAYKVSRGLWKKYGDKRVI------DTPITEMGFAGI---AVGAAMAGLRPV-------- 103
Query: 118 LFSSMKCVGMDKSYSLPPLASTSTSVVLDNPVFNKNVKIDI----PLMPFYIEKIDKLN- 172
C M ++S+ + PV ++ + I + P + E D L+
Sbjct: 104 ------CEFMTWNFSMQAIDHAKILDCAKPPVGDRPLPISLIIESPRSTWLNEINDILHR 157
Query: 173 ERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKE 232
+R +I P+ S++ + A +I+ RRRKMKKHKLKKLR +MKFEWAK+RQRREM+KE
Sbjct: 158 DRVGEVIAPPLDKSKSDKMEAARLIVIRRRKMKKHKLKKLRRKMKFEWAKVRQRREMRKE 217
Query: 233 KAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKN--VPESFIKE 281
KAFQ ++ +++K+ E FSAEQ+V L+ + L WK +P IK+
Sbjct: 218 KAFQAKLISQIKQAEAFSAEQHVAEILRQANETPLPRFWKGRRLPAFIIKQ 268
>gi|390177588|ref|XP_003736426.1| GA11252, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859107|gb|EIM52499.1| GA11252, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 275
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 32/66 (48%), Positives = 44/66 (66%), Gaps = 2/66 (3%)
Query: 218 FEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKN--VP 275
FEWAK+RQRREM+KEKAFQ ++ ++K+ E FSAEQ+V L+ + L WK +P
Sbjct: 205 FEWAKVRQRREMRKEKAFQAKLMAQIKQAESFSAEQHVAEILRQANDNPLPRFWKGRRLP 264
Query: 276 ESFIKE 281
IK+
Sbjct: 265 AFIIKQ 270
>gi|307195648|gb|EFN77490.1| hypothetical protein EAI_11500 [Harpegnathos saltator]
Length = 193
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 93/162 (57%), Gaps = 20/162 (12%)
Query: 153 NVKIDIPLMPFYIEKID---------KLNERNNSIIEDPVIPSQA---P------EKHAI 194
N+ +D+P+ P + I+ ++ +N + DP +P + P EK A+
Sbjct: 25 NIDLDVPISPLTPKIIEIPVTRIPPLQVPTKNLPVQYDPPLPQTSIDLPSNGNNFEKQAV 84
Query: 195 VMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQY 254
+I+ RR+K+KKHK +KLR +MKF W K+RQRR +K K FQ EM K+KE + F A++Y
Sbjct: 85 RLIVIRRKKIKKHKRRKLRKKMKFVWMKLRQRRNYEKVKRFQNEMVAKIKEAQAFDAKEY 144
Query: 255 VENKLQLLHAEQLAESWKN--VPESFIKEWTERKERHRNEKQ 294
V+ +L +L+ E++ +++ +P IK + K+ R K+
Sbjct: 145 VKRRLTILNKERIPRTYRGEILPPEMIKRFLAEKKAKREVKR 186
>gi|340724394|ref|XP_003400567.1| PREDICTED: hypothetical protein LOC100650901 [Bombus terrestris]
Length = 233
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 86/153 (56%), Gaps = 21/153 (13%)
Query: 145 LDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKM 204
++ P+ + +K D+PL N+ E P EK A+ +++ RR+KM
Sbjct: 89 MEEPLIKQPIKQDLPL--------------TNTSFELPTT-ENILEKLAVRLMVIRRKKM 133
Query: 205 KKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHA 264
KKHK +KLR RMKF WAKIR R + KEK FQ E+ K+KE F+ ++YVE +L +L
Sbjct: 134 KKHKRRKLRKRMKFVWAKIRANRNITKEKVFQAELIAKIKEAHMFNPKKYVEERLAILDK 193
Query: 265 EQLAESWK------NVPESFIKEWTERKERHRN 291
E L ++++ ++ FIKE TE + R RN
Sbjct: 194 EILPKTFRGEILSEDMIRKFIKEKTEERMRKRN 226
>gi|195574655|ref|XP_002105300.1| GD21413 [Drosophila simulans]
gi|194201227|gb|EDX14803.1| GD21413 [Drosophila simulans]
Length = 448
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 35/40 (87%)
Query: 218 FEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVEN 257
FEWAK+RQRREM+KEKAFQ ++ +++K+ + FSAEQ++ N
Sbjct: 165 FEWAKVRQRREMRKEKAFQAKLISQIKQAKAFSAEQHIIN 204
>gi|350397678|ref|XP_003484953.1| PREDICTED: hypothetical protein LOC100748317 [Bombus impatiens]
Length = 235
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 84/153 (54%), Gaps = 21/153 (13%)
Query: 145 LDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKM 204
++ P+ + K D+PL N+ E P + EK A+ +I+ RR+KM
Sbjct: 91 MEEPLIKQPTKQDLPL--------------TNTSFELPTTENML-EKLAVRLIVIRRKKM 135
Query: 205 KKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHA 264
KKHK +KLR RMKF WAKIR R KEK FQ E+ ++KE F ++YVE +L +L
Sbjct: 136 KKHKRRKLRKRMKFVWAKIRVNRNAAKEKVFQAELIAQIKEAHMFDPKKYVEERLAILDK 195
Query: 265 EQLAESWKN--VPES----FIKEWTERKERHRN 291
E L ++K +PE FI+E E++ R RN
Sbjct: 196 EILPRTFKGEILPEDMIKKFIQEKVEKRMRKRN 228
>gi|345495229|ref|XP_003427462.1| PREDICTED: hypothetical protein LOC100679774 [Nasonia vitripennis]
Length = 233
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 97/177 (54%), Gaps = 16/177 (9%)
Query: 135 PLASTSTSVVLDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAI 194
P S +++ P+ N +ID P+ KI+K + N ++E P + EK A
Sbjct: 69 PRNGLPISKIIECPII-PNFQIDEPVK----NKIEKSDIIPNEVVETP--GTNIIEKQAA 121
Query: 195 VMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQY 254
+I+ RR+KMKKHK +K +MKF K RQ++++ +E+AFQ E+ + E+F ++Y
Sbjct: 122 RLIVIRRKKMKKHKRRKFLKKMKFVILKRRQKKKLLRERAFQAELQEIYAKAEQFDPKEY 181
Query: 255 VENKLQLLHAEQLAESWKN--VPESFIKEWTERKERHRNEKQAKESRKKALHSIYVK 309
V +L +L E+L W+ VPES I ++ + EK K++ K+ LH+ +K
Sbjct: 182 VAEQLHILKRERLPTKWRGEVVPESMIIQFMK-------EKADKKAHKQRLHNYRLK 231
>gi|307175919|gb|EFN65732.1| hypothetical protein EAG_05465 [Camponotus floridanus]
Length = 234
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 93/164 (56%), Gaps = 23/164 (14%)
Query: 151 NKNVKIDIPLMPFYIEKIDKLNERNN--------SIIEDPV-IPSQAP--EKHAIVMIIR 199
N+N+K I MP + KI L E N I+E + +P+ EK A+ +I+
Sbjct: 74 NRNLKNIIE-MP--VAKISPLQEPINRLPIQYDLPILEKSIDLPTNGKIFEKQAVRLIVI 130
Query: 200 RRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKL 259
R +KMKKHK KKLR RM F W KIR +R +K+EK FQ E+ K+K+ + F A YV+ +L
Sbjct: 131 RHKKMKKHKRKKLRKRMHFLWLKIRNKRNLKREKLFQAELIDKIKQAQAFDARVYVKERL 190
Query: 260 QLLHAEQLAESWKN--VPESFIKEWTERKERHRNEKQAKESRKK 301
+L+ E++ +++ +P IK+ R+EK+AK K+
Sbjct: 191 DILNKERIPRTFRGEILPAEMIKKL-------RDEKKAKLEAKR 227
>gi|332017820|gb|EGI58481.1| hypothetical protein G5I_13447 [Acromyrmex echinatior]
Length = 165
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 47/76 (61%), Gaps = 6/76 (7%)
Query: 222 KIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKN--VPESFI 279
KIR +R KKEK F+ E++ KVKE E F A++YV KL +L+ + L +++ +P I
Sbjct: 84 KIRLKRRQKKEKLFEAELTAKVKEAEAFDAKEYVNEKLSILNKKILPRTFRGEILPAEMI 143
Query: 280 KEWTE----RKERHRN 291
K++ + RKE RN
Sbjct: 144 KQFLDEKRARKEAKRN 159
>gi|380022333|ref|XP_003695004.1| PREDICTED: uncharacterized protein LOC100872300 [Apis florea]
Length = 235
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 79/128 (61%), Gaps = 15/128 (11%)
Query: 145 LDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKM 204
++ P+ N+ +K D+PL I+ +L N++ EK AI MI+ RR+KM
Sbjct: 89 MEEPLINQPIKQDLPL----IDTSFQLPPTENTL-----------EKLAIRMIVIRRKKM 133
Query: 205 KKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHA 264
KKHK +KLR +MKF+WAKI+ R + KEK FQ E+ ++K+ + F ++YVE +L +L
Sbjct: 134 KKHKRRKLRKKMKFKWAKIKANRNIMKEKLFQAELVAQIKQAQNFDPKKYVEERLTILDK 193
Query: 265 EQLAESWK 272
E L ++++
Sbjct: 194 EILPKTYR 201
>gi|322800719|gb|EFZ21623.1| hypothetical protein SINV_02043 [Solenopsis invicta]
Length = 276
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 86/153 (56%), Gaps = 28/153 (18%)
Query: 146 DNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKMK 205
D+PV K+V D+P N SII EK A+ M+ R +KMK
Sbjct: 133 DSPVLEKSV--DLP--------------TNGSII----------EKQAVNMLRIRHKKMK 166
Query: 206 KHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAE 265
KHKLKKLR +MKF WAK+R +R+ KKEK FQ E+ KVKE + F A+QYV +L +L+ E
Sbjct: 167 KHKLKKLRKKMKFVWAKLRLKRKQKKEKLFQAELIGKVKEAQAFDAKQYVNERLNILNKE 226
Query: 266 QLAESWKN--VPESFIKEWTERKERHRNEKQAK 296
+L +++ +P IK++ + K K+ K
Sbjct: 227 RLPRTFRGELLPPEMIKQFLDEKRARYEAKRYK 259
>gi|383854866|ref|XP_003702941.1| PREDICTED: uncharacterized protein LOC100879594 [Megachile
rotundata]
Length = 234
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 83/150 (55%), Gaps = 19/150 (12%)
Query: 135 PLASTSTSVVLDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAI 194
PL+ +S+ PV ++++ +P +++K L N EK AI
Sbjct: 82 PLSKIISSI--QEPVIQQSIRDKLP----FVDKSYNLPTMEN-----------VTEKLAI 124
Query: 195 VMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQY 254
+I+ RR+KMKKHK KKLR +MK W KIR RR + K KA E+ K+K+ E F A +Y
Sbjct: 125 RLIVIRRKKMKKHKRKKLRRKMKVVWEKIRTRRNVAKAKALHAELLKKIKKAEAFDAREY 184
Query: 255 VENKLQLLHAEQLAESWKN--VPESFIKEW 282
+ K+ + E++ ++++ +P++ IKE+
Sbjct: 185 INKKIAFIKKERIPQTYRGEILPKAMIKEF 214
>gi|321476575|gb|EFX87535.1| hypothetical protein DAPPUDRAFT_306398 [Daphnia pulex]
Length = 214
Score = 44.3 bits (103), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 41/89 (46%), Positives = 60/89 (67%), Gaps = 5/89 (5%)
Query: 178 IIEDPVIPSQAP-----EKHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKE 232
I ED +P+ +P EK A MI+ RRRKMKKHKLKKLR KFE+ ++ +R+ KKE
Sbjct: 87 IKEDISLPTTSPDMPQLEKQAARMIVIRRRKMKKHKLKKLRKVRKFEYRRMALKRKTKKE 146
Query: 233 KAFQLEMSTKVKEGERFSAEQYVENKLQL 261
+ FQ+ +S +V + E+F A+ YVE +++
Sbjct: 147 REFQMRLSAQVAQAEQFDAKSYVEGIIRI 175
>gi|442749835|gb|JAA67077.1| Putative secreted protein [Ixodes ricinus]
Length = 238
Score = 44.3 bits (103), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 36/65 (55%)
Query: 216 MKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKNVP 275
M F WAK R RRE KKE+ F+ E+ ++ E + F AE+YV L + E+ +
Sbjct: 157 MHFLWAKQRFRREKKKEQVFRAELLAQIHEAQAFDAEKYVTGVLDKIRNRPQPETPEERR 216
Query: 276 ESFIK 280
E F++
Sbjct: 217 EKFLE 221
>gi|242000260|ref|XP_002434773.1| conserved hypothetical protein [Ixodes scapularis]
gi|215498103|gb|EEC07597.1| conserved hypothetical protein [Ixodes scapularis]
Length = 201
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 36/65 (55%)
Query: 216 MKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAEQLAESWKNVP 275
M F WAK R RRE KKE+ F+ E+ ++ E + F AE+YV L + E+ +
Sbjct: 120 MHFLWAKQRFRREKKKEQVFRAELLAQIHEAQAFDAEKYVTGVLDKIRNRPQPETPEERR 179
Query: 276 ESFIK 280
E F++
Sbjct: 180 EKFLE 184
>gi|391326466|ref|XP_003737735.1| PREDICTED: uncharacterized protein LOC100902861 [Metaseiulus
occidentalis]
Length = 205
Score = 42.0 bits (97), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 34/52 (65%), Gaps = 3/52 (5%)
Query: 216 MKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQ---LLHA 264
M+F +AK+R RRE KE F+ E+ K++E E F AE+YV++ +Q HA
Sbjct: 123 MRFVFAKLRFRREKAKEAQFKEELLCKIREAEEFDAEKYVKSVIQQIRDRHA 174
>gi|226372406|gb|ACO51828.1| Aurora kinase A-interacting protein [Rana catesbeiana]
Length = 196
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 29/40 (72%)
Query: 200 RRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEM 239
RRRKM KHK KKL+ R+KF K+ RR +KK+K F+ ++
Sbjct: 131 RRRKMNKHKYKKLQKRVKFVRQKVLHRRTVKKQKNFEKDL 170
>gi|434398617|ref|YP_007132621.1| transposase, IS605 OrfB family [Stanieria cyanosphaera PCC 7437]
gi|428269714|gb|AFZ35655.1| transposase, IS605 OrfB family [Stanieria cyanosphaera PCC 7437]
Length = 405
Score = 40.8 bits (94), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 64/139 (46%), Gaps = 21/139 (15%)
Query: 152 KNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPEKHAIVMIIRRRRKMKKH-KLK 210
K V ID+ L FY + NN +E P +A EK + R+ RK KK+ K
Sbjct: 184 KCVAIDVGLKYFYCDS-------NNETVECPKYYRKA-EKRLNKLNRRKSRKFKKNQKQS 235
Query: 211 KLRIRMKFEWAK-----IRQRREMKKEKAFQLEMSTKVKEGERFSAEQYVENKLQLLHAE 265
K I+ + ++AK RQR E KEKA +L S + E + V NK
Sbjct: 236 KNYIKARNKYAKAHLKVTRQREEFAKEKALRLIQSNDLIAYEDLKVKNLVRNK------- 288
Query: 266 QLAESWKNVPESFIKEWTE 284
+LA+S + S +++W E
Sbjct: 289 KLAKSINDAAWSQLRKWIE 307
>gi|213511590|ref|NP_001134605.1| Aurora kinase A-interacting protein [Salmo salar]
gi|209734614|gb|ACI68176.1| Aurora kinase A-interacting protein [Salmo salar]
Length = 212
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 69/156 (44%), Gaps = 35/156 (22%)
Query: 131 YSLPPLASTSTSVVLDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPE 190
YSLPPL P+ +V +D ++P + + L E + S+ P
Sbjct: 91 YSLPPLLEAP------QPLEEGDVMVDTKVLPPFA--VPLLEEGDGSV---------TPL 133
Query: 191 KHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEMSTKVKEGERFS 250
V+ IRRR KM +HK KKL R KF ++ + R KK+K F+ ++
Sbjct: 134 SCKNVLEIRRR-KMNRHKYKKLLKRTKFLRRRVLEGRRKKKQKRFEKDL----------- 181
Query: 251 AEQYVENKLQLLHAEQLAESWKNVPESFIKEWTERK 286
Q + + L A E W N P+ FIK++ ++
Sbjct: 182 --QRIWMRAGLKKA---PEGW-NTPKIFIKQYKSKR 211
>gi|148237526|ref|NP_001090093.1| uncharacterized protein LOC735168 [Xenopus laevis]
gi|76779503|gb|AAI06337.1| MGC130871 protein [Xenopus laevis]
Length = 197
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%)
Query: 200 RRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEM 239
RRRKM KHK KKL+ RMKF KIR R +++ F+ ++
Sbjct: 133 RRRKMNKHKYKKLQKRMKFLKRKIRNGRRHRQQAQFEKDL 172
>gi|432866615|ref|XP_004070891.1| PREDICTED: aurora kinase A-interacting protein-like [Oryzias
latipes]
Length = 210
Score = 37.4 bits (85), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 53/109 (48%), Gaps = 19/109 (17%)
Query: 131 YSLPPLASTSTSVVLDNPVFNKNVKIDIPLMPFYIEKIDKLNERNNSIIEDPVIPSQAPE 190
YSLPPL +S D + N+ V L P + +D E P P +
Sbjct: 90 YSLPPLLESSQPQG-DVELLNEKV-----LPPISVPSLDD--------GESPATPMKC-- 133
Query: 191 KHAIVMIIRRRRKMKKHKLKKLRIRMKFEWAKIRQRREMKKEKAFQLEM 239
K+ + + RRRKM +HK KKL+ R KF ++ + R KK+K F+ ++
Sbjct: 134 KNVLEI---RRRKMNRHKYKKLQKRTKFLRRRVMEGRGKKKQKRFEEDL 179
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.131 0.375
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,704,714,638
Number of Sequences: 23463169
Number of extensions: 183506264
Number of successful extensions: 704955
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 152
Number of HSP's successfully gapped in prelim test: 856
Number of HSP's that attempted gapping in prelim test: 697913
Number of HSP's gapped (non-prelim): 6870
length of query: 319
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 177
effective length of database: 9,027,425,369
effective search space: 1597854290313
effective search space used: 1597854290313
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)