BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy444
(495 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|145497979|ref|XP_001434978.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402106|emb|CAK67581.1| unnamed protein product [Paramecium tetraurelia]
Length = 5605
Score = 88.6 bits (218), Expect = 7e-15, Method: Composition-based stats.
Identities = 87/471 (18%), Positives = 215/471 (45%), Gaps = 21/471 (4%)
Query: 6 NLKFKEWHTKISEAEL---LVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKD 62
+ K K T + E+ L +V + + N+Q E EQ +E +KQ + + + + + +
Sbjct: 1061 SFKMKNEITDVVESYLKGNVVIKPQISNSQKKEQEQSTQE-IKQVNSDIKVVNQEVKQVN 1119
Query: 63 QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQ------RDLNQKTETIRSLTN 116
QE++ Q+ +Q+ + + Q + + E +++ ++ + + NQ+T+ + T
Sbjct: 1120 QEIKQTNQEEKQTTQETKSVNQEIRQNNSETQQINQETKSVISETKSTNQETQQVNQETK 1179
Query: 117 KNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKEN 176
+ + Q +E + K+ +T E+ + + + EI++ ++EN++++ + V +E
Sbjct: 1180 QTNQEVKQTTQETKQINQETKQTNQETREVSQETKQVNQEIKQTTQENKQVNQE-VKQET 1238
Query: 177 QEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDE 236
Q+V + K+V ++ ++ VK + KQV N+E+ + E +++
Sbjct: 1239 QQVNQQTKQVSQETQQTNQETRQTTQEVKQTNQESKQV-----NQEVKQTTQETKQTNEQ 1293
Query: 237 IRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLH-- 294
+Q +IKQ +I + K++ ++ ++ ++ + P K+ T L
Sbjct: 1294 TKQTNEQIKQSNEQIKQATQETKQITQEIKQVDQKQQTAVPYDHDTIKEGYSDTPGLKPG 1353
Query: 295 KLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENE--CERAGLLTKEKRKDEEIKR 352
K+ + ++ Q + N+K LD E+ K + +++ E E A + + + D+ +
Sbjct: 1354 KIDQSEQQECQEQTNDKDTPLDRPIEK-KSVSRTVQQQEQTSEEAQAIIIDSKVDQSLSN 1412
Query: 353 LKEEIGKYKMLVQERDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLR 412
++ K + Q+ + N +E +Q V + + + + + ++ +
Sbjct: 1413 SEQNQEIIKKVDQKVESSQNNAQETEQVTSKVTETTSQTNSITQQTNDQSSITNKQTQQT 1472
Query: 413 DSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQR 463
+ + + ++ I E S+ NN+ +Q+ E +NN I E + +Q++ +
Sbjct: 1473 NETIQQNNKTIQETNESISQNNKTVQETNETTQQNNKTIQETNETVQQVNK 1523
>gi|157869463|ref|XP_001683283.1| hypothetical protein LMJF_22_1320 [Leishmania major strain Friedlin]
gi|68224167|emb|CAJ04669.1| hypothetical protein LMJF_22_1320 [Leishmania major strain Friedlin]
Length = 2046
Score = 62.4 bits (150), Expect = 5e-07, Method: Composition-based stats.
Identities = 79/457 (17%), Positives = 205/457 (44%), Gaps = 12/457 (2%)
Query: 15 KISEAELLVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQ 74
++ EAE + +E + + +LRE +++ ++ ++ +D E+ DLR+Q ++
Sbjct: 1212 QLREAEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLRE 1271
Query: 75 SVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQYEE----EID 130
+ +A E E +L+E+ + + ++++ E+ E+ E
Sbjct: 1272 AEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEH 1331
Query: 131 NRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQE-VTELMKEVDKK 189
RD ++ D D ++AE E ++ D +E ++ M+E
Sbjct: 1332 ARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEIDRVKELLSSSMREAASS 1391
Query: 190 DNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMS 249
+ L ++ E + + L +Q+ + +E L+ E+ +++R+ + + +
Sbjct: 1392 GEMLGALEEQREEAAREMRGLREQLAVAQVRRE--ALDAEVADLREQLREAEEHARDVEA 1449
Query: 250 EIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRN 309
+ + ++ + L +Q E ER VE + ++ E+L + AEEH ++ +++
Sbjct: 1450 QQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQLRE-AEEHARDVEAQQS 1508
Query: 310 EKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERD- 368
++ E+ ++RE+++ EE ++ E +++ + E+++ +E + +RD
Sbjct: 1509 DRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDA 1568
Query: 369 ---QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAE 425
L L E ++ A+DV + +D +L + E+ E+ + +++ + +A+
Sbjct: 1569 EVADLREQLREAEERARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVAD 1628
Query: 426 MKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQ 462
++ + +D+E Q ++ +A + + +++E +
Sbjct: 1629 LREQLREAEEHARDVEAQQSDRDAEVADLREQLREAE 1665
Score = 60.8 bits (146), Expect = 1e-06, Method: Composition-based stats.
Identities = 82/441 (18%), Positives = 206/441 (46%), Gaps = 28/441 (6%)
Query: 39 ELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQE 98
+LRE +++ ++ ++ +D E+ DLR+Q +++ +A E E +L+E
Sbjct: 1180 DLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLRE 1239
Query: 99 KYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIE 158
+ + + ++++ E+ E++ + R +++ + S+ D V A++
Sbjct: 1240 QLREAEEHARDVEAQQSDRDAEV-ADLREQLREAEERARDVEAQQSDRDAEV----ADLR 1294
Query: 159 KLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
+ +E E+ + D+ +++ ++ E ++E ++ +E + +A V +E +
Sbjct: 1295 EQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLR 1354
Query: 213 QVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNE----- 267
+ E + E + + +D EI + + M E AS E + L +Q E
Sbjct: 1355 EAEEHARDVEAQQSD-----RDAEIDRVKELLSSSMREAASSGEMLGALEEQREEAAREM 1409
Query: 268 --LVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRL 325
L E+L + R E L ++ E+L + AEEH ++ +++++ E+ ++RE+++
Sbjct: 1410 RGLREQLAVAQVRREALDAEVADLREQLRE-AEEHARDVEAQQSDRDAEVADLREQLREA 1468
Query: 326 EEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERD----QLANMLEEYKQNA 381
EE ++ E +++ + E+++ +E + +RD L L E +++A
Sbjct: 1469 EERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEHA 1528
Query: 382 KDVVYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRRIQDLE 441
+DV + +D +L + E+ E + +++ + +A+++ + R +D+E
Sbjct: 1529 RDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVE 1588
Query: 442 EQLAENNAWINEQDLKIQEIQ 462
Q ++ +A + + +++E +
Sbjct: 1589 AQQSDRDAEVADLREQLREAE 1609
Score = 56.2 bits (134), Expect = 4e-05, Method: Composition-based stats.
Identities = 74/447 (16%), Positives = 202/447 (45%), Gaps = 12/447 (2%)
Query: 15 KISEAELLVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQ 74
++ EAE + +E + + +LRE +++ +++ ++ +D E+ DLR+Q ++
Sbjct: 1240 QLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQLRE 1299
Query: 75 SVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLE---LKVQYEEEIDN 131
+ A E E +L+E+ + + ++++ E L+ Q E ++
Sbjct: 1300 AEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEH 1359
Query: 132 RDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDN 191
+ + +EID++ + + + + E L + ++ +E M+ + ++
Sbjct: 1360 ARDVEAQQSDRDAEIDRVKELLSSSMREAASSGEMLGA--LEEQREEAAREMRGLREQLA 1417
Query: 192 YIETLLKENEANVKLCKEL--EKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMS 249
+ + +A V +E E + H ++ + + E+ +++R+ + + + +
Sbjct: 1418 VAQVRREALDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEA 1477
Query: 250 EIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRN 309
+ + ++ + L +Q E E VE + ++ E+L + AEEH ++ +++
Sbjct: 1478 QQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLRE-AEEHARDVEAQQS 1536
Query: 310 EKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERD- 368
++ E+ ++RE+++ EE ++ E +++ + E+++ +E + +RD
Sbjct: 1537 DRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDA 1596
Query: 369 ---QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAE 425
L L E ++ A+DV + +D +L + E+ E + +++ + +A+
Sbjct: 1597 EVADLREQLREAEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVAD 1656
Query: 426 MKLSMEANNRRIQDLEEQLAENNAWIN 452
++ + +D+E Q ++ +A I+
Sbjct: 1657 LREQLREAEEHARDVEAQQSDRDAEID 1683
Score = 45.4 bits (106), Expect = 0.058, Method: Composition-based stats.
Identities = 68/355 (19%), Positives = 158/355 (44%), Gaps = 18/355 (5%)
Query: 3 LLENLKFKEWHTKISEAELLVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKD 62
L E L+ E H + EA+ + E + L +RE + L L +
Sbjct: 1349 LREQLREAEEHARDVEAQQSDRDAEIDRVKEL-LSSSMREA-ASSGEMLGALEEQREEAA 1406
Query: 63 QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELK 122
+E+R LR+Q + + + + ++++ +E +E + Q+++ + + +L+
Sbjct: 1407 REMRGLREQLAVAQVRREALDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLR 1466
Query: 123 VQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTEL 182
E E RD ++ D D ++AE E ++ D E ++ E
Sbjct: 1467 ---EAEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRD---AEVADLREQ 1520
Query: 183 MKEVDKKDNYIETLLKENEANVKLCKEL--EKQVHELNINKELDRLNHEIIHKDDEIRQN 240
++E ++ +E + +A V +E E + H ++ + + E+ +++R+
Sbjct: 1521 LREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREA 1580
Query: 241 ASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEH 300
+ + + ++ + ++ + L +Q E ER VE + ++ E+L + AEEH
Sbjct: 1581 EERARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQLRE-AEEH 1639
Query: 301 EELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKE 355
++ +++++ E+ ++RE+++ EE ++ E +++ +D EI R+KE
Sbjct: 1640 ARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSD-------RDAEIDRVKE 1687
Score = 41.6 bits (96), Expect = 1.0, Method: Composition-based stats.
Identities = 49/241 (20%), Positives = 114/241 (47%), Gaps = 12/241 (4%)
Query: 233 KDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNE-------LVERLNSVEPRIEVLTKD 285
+D EI + + M E AS E + L +Q E L E+L + R E L +
Sbjct: 1118 RDAEIDRVKELLSSSMREAASSGEMLGALEEQREEAAREMRGLREQLAVAQVRREALDAE 1177
Query: 286 IQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKR 345
+ E+L + AEEH ++ +++++ E+ ++RE+++ EE ++ E +++ +
Sbjct: 1178 VADLREQLRE-AEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVAD 1236
Query: 346 KDEEIKRLKEEIGKYKMLVQERD----QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKL 401
E+++ +E + +RD L L E ++ A+DV + +D +L +
Sbjct: 1237 LREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQ 1296
Query: 402 RMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEI 461
E+ E + +++ + +A+++ + +D+E Q ++ +A + + +++E
Sbjct: 1297 LREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREA 1356
Query: 462 Q 462
+
Sbjct: 1357 E 1357
>gi|154413468|ref|XP_001579764.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
gi|121913974|gb|EAY18778.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
Length = 3369
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 181/350 (51%), Gaps = 61/350 (17%)
Query: 53 ELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKY---QRDLNQKTE 109
ELAK KD+E ++ + Q++++ ++ + K K LQEK ++++N K E
Sbjct: 865 ELAK----KDKENKEFEELMSQAISEK---DEEISKSKNGISSLQEKLAEKEKEINSKNE 917
Query: 110 TIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSM 169
+ +N +L Q +EEI N + I EL + S D+ +S+ +++I +L +E K +
Sbjct: 918 ANTAEKEENSKLISQRDEEISNLNKSIDELRKEISTKDETISQFESKINELIEEISKKEL 977
Query: 170 DLVFKENQEVTELMKEVDKKDNYI----------ETLLKENEANV----KLCKELEKQVH 215
+ KE + + EL +++ +K+N I ET + E E+ + K ELE+ V
Sbjct: 978 TINEKETK-IAELNEQITQKENEINGLKEAEKVMETKISEIESQLTEKEKSINELEETVQ 1036
Query: 216 ----ELN---------------INKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
E+N +N+ + + + EI K++EI N SKI +L +I++KE
Sbjct: 1037 NKETEINQKNEELSERETKINELNEIISQKDSEIQQKNEEISSNNSKIDELNQQISNKEN 1096
Query: 257 SMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNE---KKL 313
S+++L + + L + + E +IE LTK + K E+ +KL +E IQT+ E K+
Sbjct: 1097 SLQELTDKVHSLETKNSEQETQIEELTKLVSEKEEENNKL----QETIQTKETEIKDKQS 1152
Query: 314 ELDEMREEI----KRLEEVLKENECERAGLLTKE-KRKDEEIKRLKEEIG 358
++DEM +EI K +EE+ ER L +E K K+ +I +KE+I
Sbjct: 1153 KVDEMNQEISDKDKSIEEI-----TERVNKLEEENKTKNSQIDEMKEQIS 1197
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 142/268 (52%), Gaps = 48/268 (17%)
Query: 127 EEIDNRDTRIKELTSDTSEIDKIVSRKDAEI-EKLSKENE-------------KLSMDLV 172
E+I +D +K+ S+ SE+ +VS+K+ E+ EK + NE +L+ L
Sbjct: 408 EQIQTQDINLKQKDSNISELQVLVSQKETELSEKDNSINEFIHKLEEKDLQIKELNEQLN 467
Query: 173 FKENQEVTELMKEVDKKDNYIETLL------------KENEANVKL--CKELEKQVHELN 218
KE+Q + EL ++ K+N ++ + KE E N K E E +++ELN
Sbjct: 468 NKESQ-INELNAQISDKENSLQEITDKVHTLEETVQNKETEINQKNEELSERETKINELN 526
Query: 219 INKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPR 278
+ + + + EI K++EI N SKI +L +I++KE S+++L + + L + + E +
Sbjct: 527 --EIISQKDSEIQQKNEEISSNNSKIDELNQQISNKENSLQELTDKVHSLETKNSEQETQ 584
Query: 279 IEVLTKDIQIKTEKLHKLAEEHEELIQTRRNE---KKLELDEMREEI----KRLEEVLKE 331
I+ LTK + K E+ +KL +E IQT+ E K+ ++DEM +EI K +EE+
Sbjct: 585 IDELTKLVSEKEEENNKL----QETIQTKETEIKDKQSKVDEMNQEISDKDKSIEEI--- 637
Query: 332 NECERAGLLTKE-KRKDEEIKRLKEEIG 358
ER L +E K K+ +I +KE+I
Sbjct: 638 --TERVNKLEEENKTKNSQIDEMKEQIS 663
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 121/429 (28%), Positives = 206/429 (48%), Gaps = 74/429 (17%)
Query: 40 LRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKK------EF 93
L E + K+ + EL + +K+QE+ D + Q T+ N Q E+IK+ EF
Sbjct: 715 LYEEIANKNTNITELNEQISSKNQEIVDRDNKLQSLGTELN---QKNEEIKEKDSKIGEF 771
Query: 94 KELQEKYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRK 153
+L K ++NQ E I +++K EL EI +D I EL + ++ K
Sbjct: 772 NDLVSKKDSEINQLQEEIADISSKIEELN----NEIATKDASILELNNK-------IAEK 820
Query: 154 DAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQ 213
D +I+ L +E L KEN ++++L+ + D+K + IE + EL K+
Sbjct: 821 DLKIKSLDEEKSSLQSKPAEKEN-DISDLLVKYDEKCSEIEAV----------QSELAKK 869
Query: 214 VHELNINKELDRL-NHEIIHKDDEIRQNASKIKQLMSEIASKE--------------ESM 258
E NKE + L + I KD+EI ++ + I L ++A KE E
Sbjct: 870 DKE---NKEFEELMSQAISEKDEEISKSKNGISSLQEKLAEKEKEINSKNEANTAEKEEN 926
Query: 259 KKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEM 318
KLI Q +E + LN I+ L K+I K E + + + ELI+ ++K+L ++E
Sbjct: 927 SKLISQRDEEISNLN---KSIDELRKEISTKDETISQFESKINELIE-EISKKELTINEK 982
Query: 319 REEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERDQLANMLEEYK 378
+I L E + + E E GL KE K E K I + + + E+++ N LEE
Sbjct: 983 ETKIAELNEQITQKENEINGL--KEAEKVMETK-----ISEIESQLTEKEKSINELEETV 1035
Query: 379 QNAKDVVYY--EDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRR 436
QN + + E++ ++ E ++++L E I +DSE+ +K+E I+ +NN +
Sbjct: 1036 QNKETEINQKNEELSER-ETKINELN----EIISQKDSEIQQKNEEIS-------SNNSK 1083
Query: 437 IQDLEEQLA 445
I +L +Q++
Sbjct: 1084 IDELNQQIS 1092
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 122/477 (25%), Positives = 223/477 (46%), Gaps = 92/477 (19%)
Query: 7 LKFKEWHTKISEAELLVKQIEY----KNNQLLEFEQELRETMKQKDDKLLELAKTARTKD 62
+ K+ + ISE ++LV Q E K+N + EF +L E KD ++ EL + K+
Sbjct: 415 INLKQKDSNISELQVLVSQKETELSEKDNSINEFIHKLEE----KDLQIKELNEQLNNKE 470
Query: 63 QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELK 122
++ +L Q + E L++I + L+E Q NK E+
Sbjct: 471 SQINELNAQ-------ISDKENSLQEITDKVHTLEETVQ--------------NKETEIN 509
Query: 123 VQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTEL 182
Q EE+ R+T+I EL ++I+S+KD+EI+ ++NE++S + N ++ EL
Sbjct: 510 -QKNEELSERETKINEL-------NEIISQKDSEIQ---QKNEEISSN-----NSKIDEL 553
Query: 183 MKEVDKKDNYIETLLKENEANVKLCKELEKQVHELN-----INKELDRLNHEIIHKDDEI 237
+++ K+N ++ L + + E E Q+ EL +E ++L I K+ EI
Sbjct: 554 NQQISNKENSLQELTDKVHSLETKNSEQETQIDELTKLVSEKEEENNKLQETIQTKETEI 613
Query: 238 RQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIK--TEKLHK 295
+ SK+ ++ EI+ K++S++ E+ ER+N +E E TK+ QI E++
Sbjct: 614 KDKQSKVDEMNQEISDKDKSIE-------EITERVNKLEE--ENKTKNSQIDEMKEQISS 664
Query: 296 LAEEHEELIQ---TRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKR 352
+ E I T+ N K E+D + ++++ KE E E+A I
Sbjct: 665 ITTNEETAISTLNTQLNNKNNEIDLLHQQLQS-----KETENEKA------------INE 707
Query: 353 LKEEIGK-YKMLVQERDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEK--- 408
L +++ K Y+ + + + + E+ +++V ++ L EL++ E EK
Sbjct: 708 LNDKLNKLYEEIANKNTNITELNEQISSKNQEIVDRDNKLQSLGTELNQKNEEIKEKDSK 767
Query: 409 -------IKLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKI 458
+ +DSE+++ E IA++ +E N I + + E N I E+DLKI
Sbjct: 768 IGEFNDLVSKKDSEINQLQEEIADISSKIEELNNEIATKDASILELNNKIAEKDLKI 824
>gi|154416763|ref|XP_001581403.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
gi|121915630|gb|EAY20417.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
Length = 4263
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 131/245 (53%), Gaps = 40/245 (16%)
Query: 256 ESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEE---HEELIQTRRNEKK 312
+++++L KQ E E +N +E +V ++I K +++ ++ + EE+I NE+
Sbjct: 1375 QTLEQLAKQLQEAEEDINVLEGNCQVYEQEIAEKDKQIEQMTNDIKSLEEVI----NEQS 1430
Query: 313 LELDEMR-------EEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQ 365
+D ++ EEIK+L++ + ENE L T ++KD EI++ KEEI ++K +
Sbjct: 1431 NTIDSLKQDVATKEEEIKQLKQTVSENEEVIKQLQTDIEQKDAEIQKNKEEIEQHKQTIS 1490
Query: 366 ERD----QLANMLEEYKQNAKD-----------VVYYEDMKDKLEVELDKLRMESAEKIK 410
+RD QL + +E++KQ D + E+ +L+ E+++ + AE
Sbjct: 1491 QRDETIKQLQSEIEQHKQTIADKNNEIEQLKNTISEREETIKQLQNEIEQHKQTMAE--- 1547
Query: 411 LRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQRTLLEPIE 470
RD+E+ K E I + K ++ NN I+ L++ I+E+D +I+++++T+ E E
Sbjct: 1548 -RDAEIQKNKEEIEQQKQTISNNNNEIEQLKK-------TISERDAEIEQLKKTIAERDE 1599
Query: 471 PLSEL 475
+ +L
Sbjct: 1600 SIKQL 1604
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 188/390 (48%), Gaps = 73/390 (18%)
Query: 125 YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMK 184
YE+EI +D +I+++T+D ++++++ + I+ L + D+ KE +E+ +L +
Sbjct: 1401 YEQEIAEKDKQIEQMTNDIKSLEEVINEQSNTIDSLKQ-------DVATKE-EEIKQLKQ 1452
Query: 185 EVDKKDNYIETLLKENEANVK-LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASK 243
V ENE +K L ++E++ E+ NKE EI I Q
Sbjct: 1453 TVS-----------ENEEVIKQLQTDIEQKDAEIQKNKE------EIEQHKQTISQRDET 1495
Query: 244 IKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEEL 303
IKQL SEI E+ + + +NNE+ + N++ R E + K +Q + E+ + E +
Sbjct: 1496 IKQLQSEI---EQHKQTIADKNNEIEQLKNTISEREETI-KQLQNEIEQHKQTMAERDAE 1551
Query: 304 IQTRRNE---KKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKY 360
IQ + E +K + EI++L++ + E + E L +DE IK+L+ EI ++
Sbjct: 1552 IQKNKEEIEQQKQTISNNNNEIEQLKKTISERDAEIEQLKKTIAERDESIKQLQNEIEQH 1611
Query: 361 KMLVQERD-------------------------QLANMLEEYKQNAKD-----------V 384
K + +RD QL + +E++KQ D V
Sbjct: 1612 KQTISQRDAEIEQLKQTVQQRDQTIAEKEDLIKQLQSEIEQHKQTISDKNNEIEQLKQTV 1671
Query: 385 VYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQL 444
++ +L+ E+++ + AE RD+E+ K E I + K ++ + I+ L+ ++
Sbjct: 1672 NARDEAIKQLQSEIEQHKQTIAE----RDAEIQKNKEEIEQQKQTISQRDETIKQLQNEI 1727
Query: 445 AENNAWINEQDLKIQEIQRTLLEPIEPLSE 474
++ I+++D +I+++++T+ + + ++E
Sbjct: 1728 EQHKQTISQRDAEIEQLKQTVQQSDQTIAE 1757
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 104/425 (24%), Positives = 205/425 (48%), Gaps = 76/425 (17%)
Query: 41 RETMKQKDDKLLELAKTARTKDQ---ELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQ 97
++T+ Q+D ++ +L +T + DQ E DL +Q Q + EQH + I + E+Q
Sbjct: 1731 KQTISQRDAEIEQLKQTVQQSDQTIAEKEDLIKQLQSEI------EQHKQTIAERDAEIQ 1784
Query: 98 ------EKYQRDLNQKTETIR-----------SLTNKNLELKVQYEEEIDNRDTRIKELT 140
E+ ++ ++Q+ E+I+ ++ ++ E++ Q+++ I RD IK+L
Sbjct: 1785 KNKEEIEQQKQTISQRDESIKQMQSEIEQNKQTIADREKEIE-QHKQTIAERDNSIKQLQ 1843
Query: 141 SDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKEN 200
+ + + ++ +DAEI+K +E + ++N+ + L E ++K I L N
Sbjct: 1844 EEIEQHKQTIAERDAEIQKNKEE--------IQQKNEAINALTNEGEEKRLKILELEANN 1895
Query: 201 EANVKLCKELEKQVHELNIN---------------KELDRLNHEIIHKDD----EIRQNA 241
E + KEL V +LN++ K+L++ HE+ K + +I +
Sbjct: 1896 ENLINKVKELNDSVSDLNLSTENQNSVVKQMTDEIKDLNKQIHELEVKSENQQKQIEEKD 1955
Query: 242 SKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHE 301
+I+ L + A EE +KKL E VE L + + + E K++Q + + L + ++E
Sbjct: 1956 KEIQSLTNTKAQNEELIKKL----QEEVENLTNTKNQNEETIKNLQEQVQSLTETKNQNE 2011
Query: 302 ELIQTR----------RNEKKLELDEMREEIKRL-------EEVLKENECERAGLLTKEK 344
+LI+ + +NE + + ++E+++ L EE +K+ + E L +
Sbjct: 2012 DLIKKQQEQIQSLTNTKNENEETIKNLQEQVQSLTETKATNEETIKKLQGEVQSLTETKA 2071
Query: 345 RKDEEIKRLKEEIGKYKMLVQERDQLANMLEEYKQNAKDV-VYYEDMKDKLEVELDKLRM 403
+E+IK+ +EEI E ++L L+E QN + E+ KL+ E+ L+
Sbjct: 2072 TNEEQIKKQQEEIQSLSNTKNENEELIKKLQEEIQNLTNTKTQNEEQIKKLQEEIQNLQK 2131
Query: 404 ESAEK 408
++AEK
Sbjct: 2132 QNAEK 2136
>gi|145539868|ref|XP_001455624.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423432|emb|CAK88227.1| unnamed protein product [Paramecium tetraurelia]
Length = 1003
Score = 46.6 bits (109), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 107/420 (25%), Positives = 197/420 (46%), Gaps = 77/420 (18%)
Query: 8 KFKEWHTKISEAELLVKQI-EYKNNQLLEFEQELRETMKQKDDKLLELAKTART----KD 62
+ + + + ++A+ +KQ+ +Y + E E++L++ M Q + ++ EL A+ KD
Sbjct: 266 RVQNYEGQFNDAQAKIKQLAQY----IQELEKQLQDQMNQYEKQIKELVNNAKATEDEKD 321
Query: 63 QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLN----------QKTETI- 111
Q + L + + T + + +++KE K+ K R++ +KT T+
Sbjct: 322 QTIDQLEKDISNKANQLETQNKQISQLQKELKDADNKKDREVKDVQRKLDAELKKTATLE 381
Query: 112 ---RSLTNKNLELKVQ---YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENE 165
++L +KN E Q +E+D D +I EL + + + EI+ L+KE +
Sbjct: 382 KNNKTLKDKNDEQAKQINDANQELDQLDEKIAELEQKVKDQQNQIKDLEKEIKDLNKEKQ 441
Query: 166 KLSMD---LVFKENQ----------EVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
L D L K NQ ++ + KE+ K N E L K+ + + KEL+K
Sbjct: 442 SLIQDNNNLHQKYNQAEEKALQQQKDLAKAQKELGDKHNDAEQLNKDLDEYEQENKELQK 501
Query: 213 QVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERL 272
++++LN +++LN EI K +I Q A I++L + E K+ + NN+L E+L
Sbjct: 502 EINQLN--DSINQLNKEINQKQKQIDQQAKDIQELQENL----EKQKQDNQNNNDLDEQL 555
Query: 273 NSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKEN 332
N + + + L I +K +K ++ + +Q +NE IK+L++ +KE
Sbjct: 556 NESKKQNQKLQDQINNTEQKQNKTQDQLKNQLQDAQNE-----------IKQLKDQIKEQ 604
Query: 333 ECERAGLLT--------------------KEKRKDEEIKRLKEEIGKY-KMLVQERDQLA 371
E E+ L KE++++ EIKRL +E+ K + L Q+ DQL
Sbjct: 605 EKEKKNLQNEVNNLNKECDDLDAKLQQKIKEQQENSEIKRLNDELNKAQQQLKQKEDQLT 664
>gi|145479739|ref|XP_001425892.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392964|emb|CAK58494.1| unnamed protein product [Paramecium tetraurelia]
Length = 1014
Score = 46.2 bits (108), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 105/427 (24%), Positives = 206/427 (48%), Gaps = 80/427 (18%)
Query: 8 KFKEWHTKISEAELLVKQI-EYKNNQLLEFEQELRETMKQKDDKLLELAKTAR-TKDQEL 65
+ + + + ++A+ +KQ+ +Y + E E++L++ M Q + ++ EL A+ T+D++
Sbjct: 266 RVQNYEGQYNDAQAKIKQLAQY----IQELEKQLQDQMNQYEKQIKELLNNAKATEDEKD 321
Query: 66 RDLRQQTQQSVTKANTCE---QHLEKIKKEFKELQEKYQRDLN----------QKTETI- 111
++ Q + + KAN E + + +++KE K+ K R++ +KT T+
Sbjct: 322 HNIDQLEKDNSNKANQLEAQNKQISQLQKELKDADNKRDREVKDVQRKLDAELKKTATLD 381
Query: 112 ---RSLTNKNLELKVQ---YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENE 165
++L +KN E Q EE+D D +I +L + + + EI+ L+KE +
Sbjct: 382 KNNKTLKDKNDEQAKQINAANEELDQLDQKIADLEQKVKDQQNQIKDLEKEIKDLNKEKQ 441
Query: 166 KLSMD-------------LVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
L D ++ +++ + KE++ K N E L K+ + + KEL+K
Sbjct: 442 NLIQDNNNLHQKFNQAEEKALQQQKDLVKAQKELNDKHNNAEQLNKDLDEYEQENKELQK 501
Query: 213 QVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEI-------ASKEESMKKLIKQN 265
+++ LN ++++LN EI K +I Q A I++L + SK++ K+L + N
Sbjct: 502 EINSLN--DQINQLNKEINQKQKQIDQQAKDIQKLQENLEKQKQDNQSKQQENKQLQQNN 559
Query: 266 NELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRL 325
N+L ++LN + + + L I +K +K ++ + +Q +N EIK+L
Sbjct: 560 NDLNKQLNESKKQNQKLQDQINNTEQKQNKTQDQLKNQLQDAQN-----------EIKQL 608
Query: 326 EEVLKENECERAGLLT--------------------KEKRKDEEIKRLKEEIGKY-KMLV 364
++ +KE E E+ L KE++++ EI RL +E+ K + L
Sbjct: 609 KDQIKEQEKEKKNLQNEVNNLNKECDDLDAKLQQKIKEQQENSEINRLNDELNKAQQQLK 668
Query: 365 QERDQLA 371
Q+ DQL
Sbjct: 669 QKEDQLT 675
>gi|389603539|ref|XP_003722983.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504673|emb|CBZ14502.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 3658
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 131/268 (48%), Gaps = 47/268 (17%)
Query: 155 AEIEKLSKENEKLSMDLVFKEN--QEVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
AEI+KL +NE+L+ +L EN +E L E++K E EN KLC + E+
Sbjct: 1931 AEIDKLRGDNERLAEEL---ENVQEEAERLAGELEKAQADAEAQRAEN---GKLCGDNER 1984
Query: 213 QVHEL-NINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVER 271
V EL ++ +E +RL E+ +E + A ++++ ++ ++ KL N LVE
Sbjct: 1985 LVEELESLQEEAERLAGELEKAQEEAERLAGELEKAQADAEAQRAENGKLCGDNERLVEE 2044
Query: 272 LNSVEPRIEVLTKDIQIKTEKLHKLAEEHEEL---IQTRRNE-KKL---------ELDEM 318
L S++ E L +++ E+ +LA E E+ + +R E KL EL+ +
Sbjct: 2045 LESLQEEAERLASELEKAQEEAERLAGELEKAQADAEAQRAENGKLCGDNERLVEELESL 2104
Query: 319 REEIKRLEEVLK--ENECER-AGLLTKEK------------------RKDEEIKRLKEE- 356
+EE +RL L+ + E ER AG L K + R EE++RL+EE
Sbjct: 2105 QEEAERLASELEKAQEEAERLAGELEKAQANAEAQRAENGKLCGDNERLAEELERLQEEA 2164
Query: 357 ---IGKYKMLVQERDQLANMLEEYKQNA 381
G+ + +E ++LA LE+ + NA
Sbjct: 2165 ERLAGELEKAQEEAERLAGELEKAQANA 2192
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 90/340 (26%), Positives = 161/340 (47%), Gaps = 46/340 (13%)
Query: 156 EIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQVH 215
E+E+L +E E+L+ +L K +E L E++K E E KLC + E+ V
Sbjct: 2659 ELERLQEEAERLAGEL-EKAQEEAERLAGELEKAQADAEAQRAEI---GKLCGDNERLVE 2714
Query: 216 EL-NINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNS 274
EL ++ +E +RL E+ +E + A ++++ ++ ++ KL N L E L S
Sbjct: 2715 ELESLQEEAERLAGELEKAQEEAERLAGELEKAQADAEAQRAENGKLCGDNERLAEELES 2774
Query: 275 VEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLK--EN 332
++ E L +++ E+ +LA E E+ +Q ++ E+ ++R + +RL + L+ +
Sbjct: 2775 LQEEAERLAGELEKAQEEAERLAGELEK-VQADAEAQRAEIGKLRGDTERLADELEKLQE 2833
Query: 333 ECER-AGLLTK-----EKRKDEEIK------RLKEEIGKYKMLVQERDQLANMLEEYKQN 380
E ER AG L K E ++ E K RL EE+ K + +E ++LA LE+ ++
Sbjct: 2834 EAERLAGELEKAQADAEAQRAENGKLCGDNERLAEELEKAQ---EEAERLAGELEKAQEE 2890
Query: 381 AKDVVY----YEDMKDKLEVELDKLRM-------------ESAEKIKLRDSELDKKHEVI 423
A+ + + + L E+DKLR E AE++ EL+K E
Sbjct: 2891 AERLAGELEKAQADAEALRAEIDKLRGDNERLAEELEKAQEEAERLA---GELEKAQEEA 2947
Query: 424 AEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQR 463
+ +E + D E Q AEN + + +E++R
Sbjct: 2948 ERLAGELE---KAQADAEAQRAENGKLCGDNERLAEELER 2984
>gi|123445769|ref|XP_001311641.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
gi|121893459|gb|EAX98711.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
Length = 1513
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 163/315 (51%), Gaps = 34/315 (10%)
Query: 75 SVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDT 134
S K + E ++K+K++ K L+E +N I+ L N + Q + EI +D
Sbjct: 367 SSEKIHALESEIQKLKQDNKSLEEALSL-VNSTKSDIKELEN----VIEQLQGEIAEKDQ 421
Query: 135 RIKELTSDTSEIDKIVSRKDAEI---EKLSKENEKLSMDLVFKENQEVTELMKEVDKKDN 191
+IKEL+S + E D+I+ + +I E +SK +K + ++ KE +E EL + +D
Sbjct: 422 KIKELSS-SKENDEILQELEVQIQEKENISKSLQKKAEEIEMKE-KENKELEQVIDSLKT 479
Query: 192 YIETLLKENEANVKLCKELEKQVHELNINKE----LDRLNHEIIHKDDEIRQNASKIKQL 247
I++L KENE K C+ N++KE +D +N +I K++EI+ N SKIK+L
Sbjct: 480 EIDSLTKENEKLNKACERASDAA--TNLSKERDMIVDEMNKDINEKEEEIQNNLSKIKEL 537
Query: 248 MSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEH------- 300
+I K+ K + QNN+ E +N ++ +I+ I+ +K+ +L E +
Sbjct: 538 EQKI--KDIETDKDLTQNNKSEEIINELQNKIQNNLSKIRKLEQKIKELEEANAQLSNNK 595
Query: 301 -EELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGK 359
EE+I +NE + L ++RE LE+ +KE E + + DE I +L+ EI K
Sbjct: 596 SEEIINELQNEIQNNLSKIRE----LEQKIKELESTQLS----NNKSDETINQLEVEIAK 647
Query: 360 YKMLVQERDQLANML 374
K +++ ++ N L
Sbjct: 648 NKETIEKINKENNYL 662
>gi|154413470|ref|XP_001579765.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
gi|121913975|gb|EAY18779.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
Length = 883
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 170/339 (50%), Gaps = 53/339 (15%)
Query: 169 MDLVFKENQEV----TELMKEVDKKDNYIETLLK----ENEANV---KLCKELEKQVHEL 217
+ ++ KEN ++ +E +K+ +KD IE L++ EN N KL KE K+ EL
Sbjct: 458 IQMITKENDDLKAQNSEYVKQNQEKDRQIEELVQSLSNENNNNADIQKLYKE--KEETEL 515
Query: 218 NINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEP 277
I+ +L EI + +I++ ++ QL I KEE + + K N++L ++ +
Sbjct: 516 LIS----QLGDEITQLNTKIQEKVDEVNQLTETILDKEEVINAVTKDNSDLNNKIAELNN 571
Query: 278 RIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKE--NECE 335
I +TK+I K EK+++L + EEL + EK+ E++ +I L E + E NE
Sbjct: 572 AISEMTKEITEKEEKINELNRKIEELNNVIK-EKEEEINRFSSKISELNESINEKINEIN 630
Query: 336 RAGLLTKE-----KRKDEEIKRLKEEIGKYKMLVQERDQLANMLEE----YKQNAKDVVY 386
E K KDE+I L + + + + E ++L N +++ + + K+
Sbjct: 631 NTNTAINELNNQIKEKDEKINELNNQNQEKQNKIDELNELNNTVQQNETKFGELNKENRE 690
Query: 387 YEDMKDKLEVELDKLRMESAEKIKL----------RDSELDKKHEVIAEMKLSME----- 431
E+ ++L E++++ S+EK K +D+E+ KK E+I E++ S++
Sbjct: 691 KENRINELNKEIERINNSSSEKDKTIANLNESLLEKDNEITKKDELIKELQESVQTKETE 750
Query: 432 ---------ANNRRIQDLEEQLAENNAWINEQDLKIQEI 461
+NN +I +L +Q+ E NA I++++ ++EI
Sbjct: 751 INQKNELISSNNTKIDELNQQINELNAQISDKENSLKEI 789
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 135/254 (53%), Gaps = 29/254 (11%)
Query: 39 ELRETMKQKDDKLLELAKTARTKDQ---ELRDLRQQTQQSVTKANTCEQHLEKIKKEFKE 95
EL +K+KD+K+ EL + K EL +L QQ+ TK + + + E
Sbjct: 638 ELNNQIKEKDEKINELNNQNQEKQNKIDELNELNNTVQQNETKFGELNKENREKENRINE 697
Query: 96 LQEKYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDT----SEIDK--- 148
L ++ +R N +E +++ N N L ++ + EI +D IKEL +EI++
Sbjct: 698 LNKEIERINNSSSEKDKTIANLNESL-LEKDNEITKKDELIKELQESVQTKETEINQKNE 756
Query: 149 IVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLL--KENEANVKL 206
++S + +I++L+++ +L+ + KEN +KE+ K + +E + KE E N K
Sbjct: 757 LISSNNTKIDELNQQINELNAQISDKENS-----LKEITDKVHTLEETVQNKETEINQKN 811
Query: 207 --CKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQ 264
E E +++ELN + + + + EI K++EI N SKI++L +I++KE S++
Sbjct: 812 EELSERETKINELN--EIISQKDSEIQQKNEEISSNNSKIEELNQQISNKENSLQ----- 864
Query: 265 NNELVERLNSVEPR 278
EL ++++S+E +
Sbjct: 865 --ELTDKVHSLETK 876
>gi|308464124|ref|XP_003094331.1| hypothetical protein CRE_08534 [Caenorhabditis remanei]
gi|308247909|gb|EFO91861.1| hypothetical protein CRE_08534 [Caenorhabditis remanei]
Length = 709
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 145/298 (48%), Gaps = 54/298 (18%)
Query: 27 EYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVT--------- 77
E+ + L+ +EL + K ++ LE+ + R D++ LR + ++V
Sbjct: 19 EWTVEKFLDMRKELLDRQKDQEKSNLEMVEKLRLSDEKFEKLRAKFDKTVESLESEASSE 78
Query: 78 ---KANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLE-LKVQYEEEIDNRD 133
K + +EK+ K+F +Q K LNQ +T+R + + ++ L ++++ + D
Sbjct: 79 IVEKIRLSDDKIEKLSKKFDSIQWK----LNQTNKTLRPVVSSEIKKLSMEHQGKFDWIQ 134
Query: 134 TRIKELTSDT-SEIDKIVSR---KDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKK 189
+++ E+ E+ KIV + D IEKLSKE+E E M + K
Sbjct: 135 SKLNEIDKTLKPEVPKIVEKIRLSDERIEKLSKEHE---------------EKMNSIQLK 179
Query: 190 DNYIETLLKENEANVKLCKELEKQVHELN--INKELDRLNHEIIHK-DDEIRQNASKIKQ 246
N IE K ++ E++K H+L ++K ++ L E I + +IR +I++
Sbjct: 180 LNKIEKTRKREVSS-----EIDKLSHDLQSKMDKIVEALKPETISGIEQKIRLCDERIEK 234
Query: 247 LMSE-------IASKEESMKK---LIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLH 294
L +E + ++E + K L K N+E+VE++ + + RIE ++K++Q K +K+H
Sbjct: 235 LSNEQNEKFVRLDMRKEFLNKQKDLEKSNSEIVEKVRTCDERIEKVSKELQSKMDKIH 292
>gi|38347763|dbj|BAD01607.1| myosin heavy chain [Lethenteron camtschaticum]
Length = 1491
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 107/398 (26%), Positives = 188/398 (47%), Gaps = 69/398 (17%)
Query: 85 HLEKIKKEFKELQEKYQRDLNQKTETIRSL-----TNKNLE---LKVQY-----EEEIDN 131
HLE +K+E K LQE+ Q E+ ++L + K LE L++Q E +++
Sbjct: 1050 HLETLKRENKNLQEEISDLTEQLGESGKALHEVEKSRKQLEQEKLEIQAALEEAEASLEH 1109
Query: 132 RDTRIK----ELTSDTSEIDKIVSRKDAEIEKLSKENEK------LSMDLVFKENQEVTE 181
+ +I EL ++ID+ ++ KD E+++L + N++ S+D K E
Sbjct: 1110 EEGKILRIQLELNQVKADIDRKIAEKDEELDQLKRSNQRTVESMQASLDAETKSRNEALR 1169
Query: 182 LMKEVDKKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNA 241
L K+++ N +E L N AN + E Q H N+ ++ L +H DD +R N
Sbjct: 1170 LKKKMEGDLNEMEIQL--NHANRQAA---ESQKHLRNVQGQIKDLQ---VHLDDTLRSN- 1220
Query: 242 SKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHE 301
L ++A E L+ + E+ L E +V +++ TE++ L ++
Sbjct: 1221 ---DDLKEQVAMIERRNNLLLAEVEEMRTALEQTERARKVAEQELLGATERVQLLHSQNT 1277
Query: 302 ELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYK 361
LI T+R KLE D I +L+ NECE A +E R +E + K+ I
Sbjct: 1278 SLINTKR---KLEGD-----IVQLQ-----NECEEA---IQESRNADE--KAKKAITDAA 1319
Query: 362 MLVQE----RDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEK--------I 409
M+ +E +D A+ LE K+N + V +D++++L+ E +++ M+ +K +
Sbjct: 1320 MMAEELKKEQDTSAH-LERMKKNMEQTV--KDLQNRLD-EAEQIAMKGGKKQIQKLETRV 1375
Query: 410 KLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAEN 447
+ +SELD + IAE RR+++L Q E+
Sbjct: 1376 RELESELDSEQRRIAETIKGARKYERRVKELTYQAEED 1413
>gi|154419561|ref|XP_001582797.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
gi|121917034|gb|EAY21811.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
Length = 1553
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 147/310 (47%), Gaps = 60/310 (19%)
Query: 83 EQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQYEE-------EIDNRDTR 135
EQ +E++KK +E EKYQ++ T+T+ + N+ LK Q E+ EI+N + +
Sbjct: 348 EQEVEELKKSQQENDEKYQKEKEDLTQTVNNQNNEISNLKKQNEDLSNSTTNEINNLNKQ 407
Query: 136 IKELTSDTSEIDK-------IVSRKDAEIEKLSKENEKLSMDLVFKEN--QEVTELMKEV 186
I++L + S+++K VS + E+ L K N++L + K N +E L V
Sbjct: 408 IQDLQNQKSDLEKQNADYNNTVSNNNDELANLKKLNQELQNE---KSNLQKETENLSNTV 464
Query: 187 DKKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQ 246
+ K+N IE L K+NE L +E +QN K+K+
Sbjct: 465 NDKNNEIEELKKQNED-----------------------LQNE--------KQNLQKVKE 493
Query: 247 -LMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQ 305
L + I +K++ +K L KQN +L + N +E + E L + K +L+ L ++++L +
Sbjct: 494 DLTNTITTKDDEIKDLKKQNEDLQNQNNDLEKQKEDLNNTVANKDSELNNLKNDNQQLQE 553
Query: 306 TRR------NEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGK 359
+ N K ++ +++ LE + E E GL+ K +D ++ L+ G+
Sbjct: 554 ANKKQNDDINNLKKSNQDLEDKVTDLEGKIDEMTAENEGLMENVKTRDLQLDNLQ---GE 610
Query: 360 YKMLVQERDQ 369
+ V E +Q
Sbjct: 611 HSQTVDELNQ 620
>gi|123438677|ref|XP_001310118.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
gi|121891874|gb|EAX97188.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
Length = 3977
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 114/418 (27%), Positives = 199/418 (47%), Gaps = 109/418 (26%)
Query: 31 NQLLEFEQELRETMKQKDDKLLE--------LAKTARTKDQELRDLR------------- 69
NQ+ + ++ L ET KQ DDKL+E L ++KDQ + DL
Sbjct: 3185 NQIDQLKKLLEET-KQNDDKLVEELTKEIEKLKNEQQSKDQNINDLSALNKDKSSLIQQN 3243
Query: 70 ----QQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQ- 124
++TQ+ Q +E +KK+ + LQ+ + + N+ + I LT +L +
Sbjct: 3244 DDLSKKTQEFYNSQQNQAQMIEDLKKQNESLQKNLEINNNETQQNIDQLTKDKSDLASKL 3303
Query: 125 --YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTEL 182
YE +I++ ++ IKEL +E + I+ +K+ E + + ++ DL+ K NQ L
Sbjct: 3304 HDYEAKINDLNSLIKEL----NEKNAIIEKKNYEFSQQLE----VNNDLISKNNQ----L 3351
Query: 183 MKEVDK--KDNYIETLLKENEANVKLCKELEKQVHEL-NINKEL-DRLNH------EIIH 232
+ +D+ KD + L KQ+ +L N N E+ ++LN+ E
Sbjct: 3352 QQTIDQLNKDKTV----------------LSKQIQDLANKNNEITNQLNNKDKIILESKQ 3395
Query: 233 KDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEK 292
K DE+ Q+ S LM E+ + +K NN+ LNS + + +++Q++ EK
Sbjct: 3396 KSDELNQSLSN---LMKELHT--------LKANND---DLNSQISQSKQNEENLQLQIEK 3441
Query: 293 LHKLAEEHEELIQTRRNEKKLELDEMREEIKRL-------EEVLKENECERAGLLTKEKR 345
KL L T++N+ KL +D++ +E++ L EE++K+N + +G+L + ++
Sbjct: 3442 QKKL------LQDTKQNDNKL-VDDLSKEVETLTSEKLKNEEIIKQNNAKYSGILKQLQQ 3494
Query: 346 KDEEIKRLKEEIGKYKMLVQERDQLANMLEEYKQ-NAKDVVYYEDMKDKLEVELDKLR 402
K+EEI + E++Q + LE KQ N K V KDKL E +KL+
Sbjct: 3495 KNEEINK-------------EKEQFKHDLEGEKQKNEKLVNDLNQTKDKLSQENEKLK 3539
>gi|17536671|ref|NP_496914.1| Protein W02B8.2 [Caenorhabditis elegans]
gi|3880401|emb|CAB03458.1| Protein W02B8.2 [Caenorhabditis elegans]
Length = 1256
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 115/427 (26%), Positives = 207/427 (48%), Gaps = 58/427 (13%)
Query: 103 DLNQKTETIRSLTNKNLELKVQ----------YEEEIDNRDTRIKELTSDTSEIDKIVSR 152
DL QK IR L N+ L L+ + Y+E++ + RI+ L S +++ +S
Sbjct: 100 DLKQKESQIRILQNRCLRLETEKQKMQDTISGYQEDLKENEIRIENLNSRLHKLEDELSA 159
Query: 153 KDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKE--NEANVKLCKEL 210
K EI + +E + +M L K +Q T+L E+ ++ +E +++ E VK + L
Sbjct: 160 KTHEIFSIGEELKNKTMKLNEKNSQFQTKLA-EISSENRNLERKVQKFREELIVKDQRSL 218
Query: 211 EKQVHELNINKEL-------DRLNH---------EIIHKDDEIRQNASKIKQLMSEIASK 254
E + N K L DRL++ I +DD ++ +A I++ MSE+ K
Sbjct: 219 EVHQDQENTQKVLKEVKQLSDRLDYLTPKRKDVSRIKERDDFLQFSAKIIEETMSELKLK 278
Query: 255 EESMKKLIKQNNELV----ERLNSVEPRIEVLTKDIQIKTEKLH----KL----AEEHEE 302
+++ + + ELV E L ++ + D + T+ LH KL A+ +
Sbjct: 279 NARLERELSEKEELVKVTKEELQELQKTVTQAMGDSEQATKYLHAENMKLTRQKADIRCD 338
Query: 303 LIQTRRN-----EKKLEL----DEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRL 353
L++ RRN EK+ EL DE E+++R+ E+ K E E L + ++E+I L
Sbjct: 339 LLEARRNLKGFDEKREELEKQRDEALEDVRRITELKKNVEIELRSLKLLAEEREEQIDEL 398
Query: 354 KEEIGKYKMLVQERDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLRD 413
K + Y++L ++ + + N L + ++ + + M DK L+ E+AE + R
Sbjct: 399 KSRVAGYEVLRRDHEAVKNELAKAEEKLNKMGAHLVMADKQSSHFKTLK-ETAEGSRRR- 456
Query: 414 SELDKKHEVIAEMKLSMEA---NNRRI-QDLEEQLAENNAWINEQDLKIQEIQRTLLEPI 469
+++ +E++A ++ +EA N R++ Q+LE AEN + + +EIQ T L+
Sbjct: 457 -AIEQCNEMVARIR-GLEASLENQRKVEQELEMVKAENVRQAKKIEFMKEEIQETHLDYR 514
Query: 470 EPLSELV 476
E LS+L
Sbjct: 515 EELSKLA 521
>gi|26325538|dbj|BAC26523.1| unnamed protein product [Mus musculus]
Length = 524
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 166/349 (47%), Gaps = 37/349 (10%)
Query: 33 LLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKE 92
LLE ++ +++ + D +LA+ + K RDL + T Q +K + ++K E
Sbjct: 127 LLEMQESIKDLSAIRADLANKLAEEEKAKKAVFRDLSELTAQVKSKEEETATAITQLKLE 186
Query: 93 FKELQEKYQRDLNQKTETIRSLTNKNL-ELKVQYEEEIDNRDTRIKELTSDTSEIDKIVS 151
+++ ++ DL+ E+++ +N+ EL +++E ++ I+ L +++ E +
Sbjct: 187 -RDVHQRELEDLSSSLESVKLKHEQNIQELMKHFKKEKSEAESHIRMLKAESLEDKNMAK 245
Query: 152 RKDAEIEKLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIET------LLKE 199
++EKL + EKL+ +L EN+ + L +E+DKK+ YI T ++E
Sbjct: 246 AHLGQLEKLKSQCEKLTEELTHTENENKKLKLKYQSLKEELDKKEKYISTEEEHLRRMEE 305
Query: 200 NEANVK---LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
+ ++K LC E E++ I KE+D R + K+K L S +
Sbjct: 306 SRLHLKDQLLCLETEQESILGVIGKEIDEACKTFS------RDSLEKLKVLTSGPQLHYD 359
Query: 257 SMKKLIKQNNEL---VERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKL 313
+ L + +L E L E R + + + ++L +L E K+
Sbjct: 360 PHRWLAESKTKLQWLCEELKERESRERSMRQQLAACRQELRELTEH-----------KES 408
Query: 314 ELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKM 362
EL + E I+R E++L+E E+ GLL + +RKDEE++ L++ + +M
Sbjct: 409 ELLCLFEHIERQEQLLEEFHQEKRGLLEETQRKDEEVETLQDRVNALQM 457
>gi|148687007|gb|EDL18954.1| RIKEN cDNA 4930534B04 [Mus musculus]
Length = 1026
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 166/349 (47%), Gaps = 37/349 (10%)
Query: 33 LLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKE 92
LLE ++ +++ + D +LA+ + K RDL + T Q +K + ++K E
Sbjct: 629 LLEMQESIKDLSAIRADLANKLAEEEKAKKAVFRDLSELTAQVKSKEEETATAITQLKLE 688
Query: 93 FKELQEKYQRDLNQKTETIRSLTNKNL-ELKVQYEEEIDNRDTRIKELTSDTSEIDKIVS 151
+++ ++ DL+ E+++ +N+ EL +++E ++ I+ L +++ E +
Sbjct: 689 -RDVHQRELEDLSSSLESVKLKHEQNIQELMKHFKKEKSEAESHIRMLKAESLEDKNMAK 747
Query: 152 RKDAEIEKLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIET------LLKE 199
++EKL + EKL+ +L EN+ + L +E+DKK+ YI T ++E
Sbjct: 748 AHLGQLEKLKSQCEKLTEELTHTENENKKLKLKYQSLKEELDKKEKYISTEEEHLRRMEE 807
Query: 200 NEANVK---LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
+ ++K LC E E++ I KE+D R + K+K L S +
Sbjct: 808 SRLHLKDQLLCLETEQESILGVIGKEIDEACKTFS------RDSLEKLKVLTSGPQLHYD 861
Query: 257 SMKKLIKQNNEL---VERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKL 313
+ L + +L E L E R + + + ++L +L E K+
Sbjct: 862 PHRWLAESKTKLQWLCEELKERESRERSMRQQLAACRQELRELTEH-----------KES 910
Query: 314 ELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKM 362
EL + E I+R E++L+E E+ GLL + +RKDEE++ L++ + +M
Sbjct: 911 ELLCLFEHIERQEQLLEEFHQEKRGLLEETQRKDEEVETLQDRVNALQM 959
>gi|159877|gb|AAA29413.1| myosin-like antigen, partial [Onchocerca volvulus]
Length = 630
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 97/210 (46%), Gaps = 39/210 (18%)
Query: 219 INKELDRLNHEIIHKDDEIRQNASK-IKQLMSEIASKEESMKKLIKQNNELVERLNSVEP 277
+N ELDRL E++ +R++A K I + +++ + +K L NNE+ +LN+ E
Sbjct: 274 LNMELDRLRDELL----SVRRDAEKEINRYNTDLQTARNEIKLLTSTNNEMKSQLNAAED 329
Query: 278 RIEVLTKDIQIKTEKLHKLAEE-HE---ELIQTRRNEKKL--ELDEMREEIKRLEEVLKE 331
+I L K I + K+ L E H EL + N L ELD RE I L E
Sbjct: 330 KINSLNKVITDQQNKIRDLTGEVHHLEGELKDAKGNVANLESELDTTRERIHLLGE---- 385
Query: 332 NECERAGLLTKEKRKDEEIKRLKEEI----GKYKMLVQER-------DQLANMLEEYKQN 380
+ A L T E+ ++K +I G+ ML + D+L L+ +N
Sbjct: 386 ---QNASLQT-------ELNKIKGDIDSLFGENDMLKTAKESNEAEIDRLKQKLQRSIEN 435
Query: 381 AKDVVYYEDMKDKLEVELDKLRMESAEKIK 410
AK Y D DKL E D+L+ EKIK
Sbjct: 436 AK---KYSDALDKLRPEYDRLQNLYREKIK 462
>gi|160333189|ref|NP_861536.3| centrosomal protein of 128 kDa [Mus musculus]
gi|166214933|sp|Q8BI22.2|CE128_MOUSE RecName: Full=Centrosomal protein of 128 kDa
gi|219518696|gb|AAI45502.1| RIKEN cDNA 4930534B04 gene [Mus musculus]
Length = 1102
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 166/349 (47%), Gaps = 37/349 (10%)
Query: 33 LLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKE 92
LLE ++ +++ + D +LA+ + K RDL + T Q +K + ++K E
Sbjct: 629 LLEMQESIKDLSAIRADLANKLAEEEKAKKAVFRDLSELTAQVKSKEEETATAITQLKLE 688
Query: 93 FKELQEKYQRDLNQKTETIRSLTNKNL-ELKVQYEEEIDNRDTRIKELTSDTSEIDKIVS 151
+++ ++ DL+ E+++ +N+ EL +++E ++ I+ L +++ E +
Sbjct: 689 -RDVHQRELEDLSSSLESVKLKHEQNIQELMKHFKKEKSEAESHIRMLKAESLEDKNMAK 747
Query: 152 RKDAEIEKLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIET------LLKE 199
++EKL + EKL+ +L EN+ + L +E+DKK+ YI T ++E
Sbjct: 748 AHLGQLEKLKSQCEKLTEELTHTENENKKLKLKYQSLKEELDKKEKYISTEEEHLRRMEE 807
Query: 200 NEANVK---LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
+ ++K LC E E++ I KE+D R + K+K L S +
Sbjct: 808 SRLHLKDQLLCLETEQESILGVIGKEIDEACKTFS------RDSLEKLKVLTSGPQLHYD 861
Query: 257 SMKKLIKQNNEL---VERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKL 313
+ L + +L E L E R + + + ++L +L E K+
Sbjct: 862 PHRWLAESKTKLQWLCEELKERESRERSMRQQLAACRQELRELTEH-----------KES 910
Query: 314 ELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKM 362
EL + E I+R E++L+E E+ GLL + +RKDEE++ L++ + +M
Sbjct: 911 ELLCLFEHIERQEQLLEEFHQEKRGLLEETQRKDEEVETLQDRVNALQM 959
>gi|444792584|gb|AGE12543.1| SzM [Streptococcus equi subsp. zooepidemicus]
Length = 536
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 98/181 (54%), Gaps = 22/181 (12%)
Query: 82 CEQHLEKIKKEFKELQEKYQRDLNQKTETIR-SLTNKNLEL---KVQYEEEIDNRDTRIK 137
E+ L+++++ +E QE+ + L+QK E + L N L K +YE E+D+ +++
Sbjct: 136 AEEKLKQVEESLRE-QEEQTKSLSQKLEDKQMDLENAALGYVAEKQRYEAELDDLKKQVE 194
Query: 138 ELTSDTSEIDKIVSRKDAEIEKLSKE-NEKLSMDLVF---------KENQEVTELMKEVD 187
EL + S D+++S K +E+ KL++E N + M F K+N E+ +L +++D
Sbjct: 195 ELEVELSSADELLSSKHSELTKLTQEFNRREKMYNTFLDQAKEEISKQNDEINQLTQDLD 254
Query: 188 KKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEII-HKDDEIRQNASKIKQ 246
++ N E L + L KQ E+ K+LDR+N ++ + DE+ + ++K +Q
Sbjct: 255 RQINMYEAFLSQ------YKDALAKQEEEITKLKDLDRINRNLLGNAKDELDKLSAKNEQ 308
Query: 247 L 247
L
Sbjct: 309 L 309
>gi|145521570|ref|XP_001446640.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414118|emb|CAK79243.1| unnamed protein product [Paramecium tetraurelia]
Length = 537
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 58/106 (54%), Gaps = 17/106 (16%)
Query: 76 VTKANT-CEQHLE-------KIKKEFKELQEKYQRDLNQKTETIRSL--TNKNLELKVQY 125
+TK T CE+++E +I+KE+ + QEK+ + L +K +++L T KN K+
Sbjct: 35 LTKYKTECEKYIETIQKEHERIQKEYGQEQEKFNKSLKEKINQVKALVDTEKNNTKKL-- 92
Query: 126 EEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDL 171
I N ++ E+T S D +V KD EIE L KE +KL + +
Sbjct: 93 ---IANAQQKVVEITQ--SSFDSLVQVKDQEIENLKKEQQKLQIQI 133
>gi|366998906|ref|XP_003684189.1| hypothetical protein TPHA_0B00830 [Tetrapisispora phaffii CBS 4417]
gi|357522485|emb|CCE61755.1| hypothetical protein TPHA_0B00830 [Tetrapisispora phaffii CBS 4417]
Length = 2536
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 132/295 (44%), Gaps = 64/295 (21%)
Query: 68 LRQQTQQSVTKANTCEQHLEKIKKE--------------FKELQEKYQRDLNQKTETIRS 113
L +Q +V K N E+ L+++KKE +LQEKY ++N + I
Sbjct: 1317 LYKQYNDNVEKINNYEESLKQLKKEKTTLQANLKTTEAELNDLQEKYSANVNSYNDIISE 1376
Query: 114 LTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKL-------SKENEK 166
K + EEI D +IKEL E DK+++ ++AEI+ + + ++
Sbjct: 1377 --------KDKLTEEIKLSDDKIKELELVLEEKDKVITNQNAEIKNIKLDFEEKGNDYKR 1428
Query: 167 LSMDL------VFKENQEVTELM-KEVDKKDNYIETLLKENEANVKLCKEL--------- 210
+DL + KE+ E LM K + + +ETL E+ + L E
Sbjct: 1429 TIVDLKAKLSDLTKESAESKLLMEKTIKDLEKNMETLKSESVISSNLSTERITSLELEIA 1488
Query: 211 ----EKQVHELNINKELDRLNHEIIH----KDDEIRQNASKIKQLMSEIASKEESMKKLI 262
EK++++ N N ++ L +II+ KDD +N ++ +L SEI + K+I
Sbjct: 1489 KLTEEKKINQTNFNDKISELKSDIINLEKEKDDSKEKNKTESAELKSEIEELQNKNFKMI 1548
Query: 263 KQNNEL-------VERLNSVEPRIEV----LTKDIQIKTEKLHKLAEEHEELIQT 306
+++ VERL + +V LT D++ K E LHKL E+ I +
Sbjct: 1549 EEHTITLDALKTDVERLEKAKEDFQVQNNSLTSDLETKNELLHKLKEKSANTISS 1603
>gi|145504665|ref|XP_001438299.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124405471|emb|CAK70902.1| unnamed protein product [Paramecium tetraurelia]
Length = 986
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 92/333 (27%), Positives = 171/333 (51%), Gaps = 35/333 (10%)
Query: 63 QELRDLRQQTQQSVTKANTCEQHLEK----------IKKEFKELQ-EKYQRDLNQKTETI 111
Q+ DLR+Q Q+ + N +Q LE IK + ELQ EK Q LN K +
Sbjct: 422 QQSDDLRRQNQELAQENNNLQQDLENQTQNLGQLDEIKDQLNELQDEKNQ--LNDKVSDL 479
Query: 112 RSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDL 171
++ + L Q ++E+++ R+K+L + E+D + + +++K K+N++L+ +
Sbjct: 480 QNNLKEKQRLFDQKQKELEDALKRVKDLEAKLLEMDHYIDTLEDDLQKFEKDNQQLNREA 539
Query: 172 VFKE--NQEVTELMKEVDKKDNYIETLLKEN---EANVKLCKELEKQVHELNINKELDRL 226
K+ ++E+ L +D+ N + KE + N+ ++L+ ++ + ELDR
Sbjct: 540 GQKQLADRELERLRGLLDQMKNQYDQQQKELGKLKNNLDQMRDLQDELAQ--AKSELDRA 597
Query: 227 NHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDI 286
N I + DE+ Q ++I QL+ E+ + EES +L QNN L + L + +T
Sbjct: 598 NSVIAQQQDELAQKENEISQLVREVQNLEESNNQLQDQNNNLQQTLQEQ----QAVTNGN 653
Query: 287 QIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRK 346
Q + KL ++AE+++E I ++LEL E +EE LK+++ A L+T+ +R
Sbjct: 654 QEELTKLRRIAEDYKEKI------RQLEL--KFNEYSNMEERLKQSDHRIAVLMTEIERL 705
Query: 347 DEEIKRLKEEIGKYKMLVQERDQLANMLEEYKQ 379
+ +K++ E +K Q+ ++ L EY+Q
Sbjct: 706 NSLVKQITSETEDWK---QKHSRIELALLEYRQ 735
>gi|441622736|ref|XP_003264109.2| PREDICTED: LOW QUALITY PROTEIN: CDK5 regulatory subunit-associated
protein 2 [Nomascus leucogenys]
Length = 1890
Score = 38.9 bits (89), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 159/338 (47%), Gaps = 45/338 (13%)
Query: 132 RDTRIKELTSDTSEIDKIVSRKDAEIEKLS----KENEKL----SMDLVFKENQEVTELM 183
RD I+ LT +K V ++EIEKLS K E L + + E+ E T
Sbjct: 315 RDKAIQGLTMALKSKEKKVEELNSEIEKLSAAFAKAREALRKAQTQEFQGSEDYEATLSG 374
Query: 184 KEVDKKDNYIETLLKENEANVKLCKELEKQVHELN-INKELDRLNHEI--IHK-----DD 235
KE + + L K E N +LC+ + K EL+ + +E +RL ++ H+ D
Sbjct: 375 KEALLAELRSQNLTKSTE-NHRLCRSIXKITQELSDLQQERERLEXDLEEAHREKSKGDC 433
Query: 236 EIRQNASKIKQLMSEIASKEESM----KKLIKQNNELVERLNSVEPRIEVLTKDIQIKTE 291
IR +++++L +E+ +E++M K L+ ++N+ +L++ E I+ LT+ K
Sbjct: 434 TIRDLRNEVEKLRNEVNEREKAMENRYKSLLSESNK---KLHNQEQVIKHLTESTNQKDV 490
Query: 292 KLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEE-- 349
L K E+ E+IQ E E+R E E+ + + +KEK++ E
Sbjct: 491 LLQKFNEKDLEVIQQNHYLMAAEDLELRSEGLITEKCSSQRSPGSETIFSKEKKQSSEYE 550
Query: 350 --IKRLKEEIGKYKMLV---QERDQLANMLEEY-------KQNAKDVVYYEDMKDKLEVE 397
I+ LK+E Y LV QE D + N+ E KQ +DV+ Y++++ LE +
Sbjct: 551 ELIQVLKKEQDIYTHLVKSLQESDSINNLQAELNNIFALRKQLEQDVLSYQNLRKTLEEQ 610
Query: 398 LDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNR 435
+ ++R E L +D+ + + + +E NNR
Sbjct: 611 IGEIRRREEESFSLY---IDQ----TSYLSICLEENNR 641
>gi|297813019|ref|XP_002874393.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297320230|gb|EFH50652.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 1191
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 61/99 (61%), Gaps = 11/99 (11%)
Query: 244 IKQLMSEIASKEESM-----KKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAE 298
I++L SEI SKEE++ K+L+KQ +L+S E R+E T + IK ++L + +
Sbjct: 464 IRKLSSEIVSKEETIQQLSEKQLVKQR-----KLDSTEKRLENTTAEFVIKEKELGSVKD 518
Query: 299 EHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERA 337
+ E +Q ++K EL ++EEIK++++ LK+ + + A
Sbjct: 519 TYRECLQNWEIKEK-ELKSLQEEIKKIQDSLKDFQSKEA 556
>gi|225619396|ref|YP_002720622.1| hypothetical protein BHWA1_00453 [Brachyspira hyodysenteriae WA1]
gi|225214215|gb|ACN82949.1| hypothetical protein BHWA1_00453 [Brachyspira hyodysenteriae WA1]
Length = 7854
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 68/123 (55%), Gaps = 8/123 (6%)
Query: 230 IIHK--DDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQ 287
II+K +DE+ + KI +S + K +S ++ NN + + +NS+ + E L+KD
Sbjct: 3658 IIYKKIEDEVSKINEKIDSNLSTVTEKFDS--NIVSLNNSIYDDINSIIEKYESLSKDFD 3715
Query: 288 IKTEKLHKLAEEHEELIQTRRNEKKL---ELDEMREEIKRL-EEVLKENECERAGLLTKE 343
E L + EL++T+ +EK+L ++DE+++E + E++L N+ L +E
Sbjct: 3716 SSLEVLKDSILDRNELLETQSSEKELLVSKVDEIKKEFESFKEDILNTNKESIPALFDEE 3775
Query: 344 KRK 346
K+K
Sbjct: 3776 KQK 3778
>gi|156839520|ref|XP_001643450.1| hypothetical protein Kpol_483p11 [Vanderwaltozyma polyspora DSM
70294]
gi|156114061|gb|EDO15592.1| hypothetical protein Kpol_483p11 [Vanderwaltozyma polyspora DSM
70294]
Length = 2546
Score = 38.9 bits (89), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 80/284 (28%), Positives = 137/284 (48%), Gaps = 49/284 (17%)
Query: 210 LEKQVHELNI-NKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNEL 268
L+K +L I N D+ N E+ ++ IR+ SKI L ++ KE + +L
Sbjct: 2198 LKKVTSDLQIANNTSDKYNDELKVANNTIREIESKIPNLQKQLDLKEIEYNDTLSSKKDL 2257
Query: 269 VERLNSVEPRIEVLTKDIQ-IKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEE 327
++L++ E+L+K+++ +K EKL L +E +LI +ELD+ + +I L
Sbjct: 2258 DKKLDNFSKESEILSKEVKSLKKEKLD-LEKEKNDLI--------VELDKYKNDIIDLNN 2308
Query: 328 VLKEN-------ECERAGLLTKEKRKDEEIKRLKEEIGK----YKMLVQERD-------- 368
LKEN E E GL K + + +L EE GK K LV ERD
Sbjct: 2309 TLKENEKNGSEVETEIIGLKDKIEFYKLNVSQLNEEKGKLESDIKTLVGERDDKLKSSQS 2368
Query: 369 ---QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAE---KIKLRDSELDKKHEV 422
+L N LE +K + V YE+ K VE D L+ + +E ++ + +D ++
Sbjct: 2369 EIIELTNKLESFKDIS---VAYEESK----VESDGLKNKLSEIEKSLEESNENVDSANKE 2421
Query: 423 IAEMKL---SMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQR 463
I+++K+ ++++N ++D +QL + N KIQ+I +
Sbjct: 2422 ISDLKVMNDKLQSHNLELEDKFQQLKNDTTETNR---KIQDINK 2462
>gi|85074861|ref|XP_965798.1| hypothetical protein NCU00658 [Neurospora crassa OR74A]
gi|28927611|gb|EAA36562.1| predicted protein [Neurospora crassa OR74A]
Length = 4007
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 64/274 (23%), Positives = 132/274 (48%), Gaps = 23/274 (8%)
Query: 104 LNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKE 163
L Q+ E I L N+ +EL + +D RDT + T + + + + V R E KL K+
Sbjct: 3115 LAQRMEEINGLRNEMVEL----NKALDTRDTTFLQNTDEINRLKENVRRLGDETTKLKKD 3170
Query: 164 NEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQVHELNINKEL 223
KL D E + V + E++K ++ I+ L +E + L + +++ L E+
Sbjct: 3171 TVKLKEDSKSWE-ETVKQRQTEINKLNDNIKNLQEEIKRKEALLATRQGEINALK--DEI 3227
Query: 224 DRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLT 283
L ++ KD +++ ++ + IA+KE ++++L K+ L E++ +E +
Sbjct: 3228 VGLKKDLAEKDAQLKSRDGELGKFRKSIAAKETALERLEKEKTALREKVEHLEGEVGRRR 3287
Query: 284 KDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKE 343
+ + ++ +K+ +L + +L+LD+ R K LEE GL +
Sbjct: 3288 RSLDLRADKILELTNSE--------SAARLDLDKQRARNKSLEET-------NTGLRNRA 3332
Query: 344 KRKDEEIKRLKEEIGK-YKMLVQERDQLANMLEE 376
++ + RL +E+ K + L+++ + LA++ +E
Sbjct: 3333 AKEGGSLGRLGDEVTKLSRQLLEKENDLASLRDE 3366
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.128 0.325
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,601,109,423
Number of Sequences: 23463169
Number of extensions: 279966843
Number of successful extensions: 3424240
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6120
Number of HSP's successfully gapped in prelim test: 151506
Number of HSP's that attempted gapping in prelim test: 2277348
Number of HSP's gapped (non-prelim): 607958
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 79 (35.0 bits)