BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy444
         (495 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|145497979|ref|XP_001434978.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402106|emb|CAK67581.1| unnamed protein product [Paramecium tetraurelia]
          Length = 5605

 Score = 88.6 bits (218), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 87/471 (18%), Positives = 215/471 (45%), Gaps = 21/471 (4%)

Query: 6    NLKFKEWHTKISEAEL---LVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKD 62
            + K K   T + E+ L   +V + +  N+Q  E EQ  +E +KQ +  +  + +  +  +
Sbjct: 1061 SFKMKNEITDVVESYLKGNVVIKPQISNSQKKEQEQSTQE-IKQVNSDIKVVNQEVKQVN 1119

Query: 63   QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQ------RDLNQKTETIRSLTN 116
            QE++   Q+ +Q+  +  +  Q + +   E +++ ++ +      +  NQ+T+ +   T 
Sbjct: 1120 QEIKQTNQEEKQTTQETKSVNQEIRQNNSETQQINQETKSVISETKSTNQETQQVNQETK 1179

Query: 117  KNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKEN 176
            +  +   Q  +E    +   K+   +T E+ +   + + EI++ ++EN++++ + V +E 
Sbjct: 1180 QTNQEVKQTTQETKQINQETKQTNQETREVSQETKQVNQEIKQTTQENKQVNQE-VKQET 1238

Query: 177  QEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDE 236
            Q+V +  K+V ++        ++    VK   +  KQV     N+E+ +   E    +++
Sbjct: 1239 QQVNQQTKQVSQETQQTNQETRQTTQEVKQTNQESKQV-----NQEVKQTTQETKQTNEQ 1293

Query: 237  IRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLH-- 294
             +Q   +IKQ   +I    +  K++ ++  ++ ++  +  P      K+    T  L   
Sbjct: 1294 TKQTNEQIKQSNEQIKQATQETKQITQEIKQVDQKQQTAVPYDHDTIKEGYSDTPGLKPG 1353

Query: 295  KLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENE--CERAGLLTKEKRKDEEIKR 352
            K+ +  ++  Q + N+K   LD   E+ K +   +++ E   E A  +  + + D+ +  
Sbjct: 1354 KIDQSEQQECQEQTNDKDTPLDRPIEK-KSVSRTVQQQEQTSEEAQAIIIDSKVDQSLSN 1412

Query: 353  LKEEIGKYKMLVQERDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLR 412
             ++     K + Q+ +   N  +E +Q    V       + +  + +     + ++ +  
Sbjct: 1413 SEQNQEIIKKVDQKVESSQNNAQETEQVTSKVTETTSQTNSITQQTNDQSSITNKQTQQT 1472

Query: 413  DSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQR 463
            +  + + ++ I E   S+  NN+ +Q+  E   +NN  I E +  +Q++ +
Sbjct: 1473 NETIQQNNKTIQETNESISQNNKTVQETNETTQQNNKTIQETNETVQQVNK 1523


>gi|157869463|ref|XP_001683283.1| hypothetical protein LMJF_22_1320 [Leishmania major strain Friedlin]
 gi|68224167|emb|CAJ04669.1| hypothetical protein LMJF_22_1320 [Leishmania major strain Friedlin]
          Length = 2046

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 79/457 (17%), Positives = 205/457 (44%), Gaps = 12/457 (2%)

Query: 15   KISEAELLVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQ 74
            ++ EAE   + +E + +       +LRE +++ ++   ++      +D E+ DLR+Q ++
Sbjct: 1212 QLREAEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLRE 1271

Query: 75   SVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQYEE----EID 130
            +  +A   E        E  +L+E+ +       +     ++++ E+    E+    E  
Sbjct: 1272 AEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEH 1331

Query: 131  NRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQE-VTELMKEVDKK 189
             RD   ++   D    D     ++AE      E ++   D      +E ++  M+E    
Sbjct: 1332 ARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEIDRVKELLSSSMREAASS 1391

Query: 190  DNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMS 249
               +  L ++ E   +  + L +Q+    + +E   L+ E+    +++R+     + + +
Sbjct: 1392 GEMLGALEEQREEAAREMRGLREQLAVAQVRRE--ALDAEVADLREQLREAEEHARDVEA 1449

Query: 250  EIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRN 309
            + + ++  +  L +Q  E  ER   VE +      ++    E+L + AEEH   ++ +++
Sbjct: 1450 QQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQLRE-AEEHARDVEAQQS 1508

Query: 310  EKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERD- 368
            ++  E+ ++RE+++  EE  ++ E +++    +     E+++  +E     +    +RD 
Sbjct: 1509 DRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDA 1568

Query: 369  ---QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAE 425
                L   L E ++ A+DV   +  +D    +L +   E+ E+ +  +++   +   +A+
Sbjct: 1569 EVADLREQLREAEERARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVAD 1628

Query: 426  MKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQ 462
            ++  +       +D+E Q ++ +A + +   +++E +
Sbjct: 1629 LREQLREAEEHARDVEAQQSDRDAEVADLREQLREAE 1665



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 82/441 (18%), Positives = 206/441 (46%), Gaps = 28/441 (6%)

Query: 39   ELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQE 98
            +LRE +++ ++   ++      +D E+ DLR+Q +++  +A   E        E  +L+E
Sbjct: 1180 DLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLRE 1239

Query: 99   KYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIE 158
            + +       +     ++++ E+     E++   + R +++ +  S+ D  V    A++ 
Sbjct: 1240 QLREAEEHARDVEAQQSDRDAEV-ADLREQLREAEERARDVEAQQSDRDAEV----ADLR 1294

Query: 159  KLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
            +  +E E+ + D+  +++       ++ E ++E ++    +E    + +A V   +E  +
Sbjct: 1295 EQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLR 1354

Query: 213  QVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNE----- 267
            +  E   + E  + +     +D EI +    +   M E AS  E +  L +Q  E     
Sbjct: 1355 EAEEHARDVEAQQSD-----RDAEIDRVKELLSSSMREAASSGEMLGALEEQREEAAREM 1409

Query: 268  --LVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRL 325
              L E+L   + R E L  ++    E+L + AEEH   ++ +++++  E+ ++RE+++  
Sbjct: 1410 RGLREQLAVAQVRREALDAEVADLREQLRE-AEEHARDVEAQQSDRDAEVADLREQLREA 1468

Query: 326  EEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERD----QLANMLEEYKQNA 381
            EE  ++ E +++    +     E+++  +E     +    +RD     L   L E +++A
Sbjct: 1469 EERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEHA 1528

Query: 382  KDVVYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRRIQDLE 441
            +DV   +  +D    +L +   E+ E  +  +++   +   +A+++  +     R +D+E
Sbjct: 1529 RDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVE 1588

Query: 442  EQLAENNAWINEQDLKIQEIQ 462
             Q ++ +A + +   +++E +
Sbjct: 1589 AQQSDRDAEVADLREQLREAE 1609



 Score = 56.2 bits (134), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 74/447 (16%), Positives = 202/447 (45%), Gaps = 12/447 (2%)

Query: 15   KISEAELLVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQ 74
            ++ EAE   + +E + +       +LRE +++ +++  ++      +D E+ DLR+Q ++
Sbjct: 1240 QLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQLRE 1299

Query: 75   SVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLE---LKVQYEEEIDN 131
            +   A   E        E  +L+E+ +       +     ++++ E   L+ Q  E  ++
Sbjct: 1300 AEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEEH 1359

Query: 132  RDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDN 191
                  + +   +EID++     + + + +   E L    + ++ +E    M+ + ++  
Sbjct: 1360 ARDVEAQQSDRDAEIDRVKELLSSSMREAASSGEMLGA--LEEQREEAAREMRGLREQLA 1417

Query: 192  YIETLLKENEANVKLCKEL--EKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMS 249
              +   +  +A V   +E   E + H  ++  +    + E+    +++R+   + + + +
Sbjct: 1418 VAQVRREALDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEA 1477

Query: 250  EIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRN 309
            + + ++  +  L +Q  E  E    VE +      ++    E+L + AEEH   ++ +++
Sbjct: 1478 QQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLRE-AEEHARDVEAQQS 1536

Query: 310  EKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERD- 368
            ++  E+ ++RE+++  EE  ++ E +++    +     E+++  +E     +    +RD 
Sbjct: 1537 DRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDA 1596

Query: 369  ---QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAE 425
                L   L E ++ A+DV   +  +D    +L +   E+ E  +  +++   +   +A+
Sbjct: 1597 EVADLREQLREAEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVAD 1656

Query: 426  MKLSMEANNRRIQDLEEQLAENNAWIN 452
            ++  +       +D+E Q ++ +A I+
Sbjct: 1657 LREQLREAEEHARDVEAQQSDRDAEID 1683



 Score = 45.4 bits (106), Expect = 0.058,   Method: Composition-based stats.
 Identities = 68/355 (19%), Positives = 158/355 (44%), Gaps = 18/355 (5%)

Query: 3    LLENLKFKEWHTKISEAELLVKQIEYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKD 62
            L E L+  E H +  EA+   +  E    + L     +RE      + L  L +      
Sbjct: 1349 LREQLREAEEHARDVEAQQSDRDAEIDRVKEL-LSSSMREA-ASSGEMLGALEEQREEAA 1406

Query: 63   QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELK 122
            +E+R LR+Q   +  +    +  +  ++++ +E +E  +    Q+++    + +   +L+
Sbjct: 1407 REMRGLREQLAVAQVRREALDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLR 1466

Query: 123  VQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTEL 182
               E E   RD   ++   D    D     ++AE      E ++   D    E  ++ E 
Sbjct: 1467 ---EAEERARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRD---AEVADLREQ 1520

Query: 183  MKEVDKKDNYIETLLKENEANVKLCKEL--EKQVHELNINKELDRLNHEIIHKDDEIRQN 240
            ++E ++    +E    + +A V   +E   E + H  ++  +    + E+    +++R+ 
Sbjct: 1521 LREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREA 1580

Query: 241  ASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEH 300
              + + + ++ + ++  +  L +Q  E  ER   VE +      ++    E+L + AEEH
Sbjct: 1581 EERARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQLRE-AEEH 1639

Query: 301  EELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKE 355
               ++ +++++  E+ ++RE+++  EE  ++ E +++        +D EI R+KE
Sbjct: 1640 ARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSD-------RDAEIDRVKE 1687



 Score = 41.6 bits (96), Expect = 1.0,   Method: Composition-based stats.
 Identities = 49/241 (20%), Positives = 114/241 (47%), Gaps = 12/241 (4%)

Query: 233  KDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNE-------LVERLNSVEPRIEVLTKD 285
            +D EI +    +   M E AS  E +  L +Q  E       L E+L   + R E L  +
Sbjct: 1118 RDAEIDRVKELLSSSMREAASSGEMLGALEEQREEAAREMRGLREQLAVAQVRREALDAE 1177

Query: 286  IQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKR 345
            +    E+L + AEEH   ++ +++++  E+ ++RE+++  EE  ++ E +++    +   
Sbjct: 1178 VADLREQLRE-AEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVAD 1236

Query: 346  KDEEIKRLKEEIGKYKMLVQERD----QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKL 401
              E+++  +E     +    +RD     L   L E ++ A+DV   +  +D    +L + 
Sbjct: 1237 LREQLREAEEHARDVEAQQSDRDAEVADLREQLREAEERARDVEAQQSDRDAEVADLREQ 1296

Query: 402  RMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEI 461
              E+ E  +  +++   +   +A+++  +       +D+E Q ++ +A + +   +++E 
Sbjct: 1297 LREAEEHARDVEAQQSDRDAEVADLREQLREAEEHARDVEAQQSDRDAEVADLREQLREA 1356

Query: 462  Q 462
            +
Sbjct: 1357 E 1357


>gi|154413468|ref|XP_001579764.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
 gi|121913974|gb|EAY18778.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
          Length = 3369

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 103/350 (29%), Positives = 181/350 (51%), Gaps = 61/350 (17%)

Query: 53   ELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKY---QRDLNQKTE 109
            ELAK    KD+E ++  +   Q++++    ++ + K K     LQEK    ++++N K E
Sbjct: 865  ELAK----KDKENKEFEELMSQAISEK---DEEISKSKNGISSLQEKLAEKEKEINSKNE 917

Query: 110  TIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSM 169
               +   +N +L  Q +EEI N +  I EL  + S  D+ +S+ +++I +L +E  K  +
Sbjct: 918  ANTAEKEENSKLISQRDEEISNLNKSIDELRKEISTKDETISQFESKINELIEEISKKEL 977

Query: 170  DLVFKENQEVTELMKEVDKKDNYI----------ETLLKENEANV----KLCKELEKQVH 215
             +  KE + + EL +++ +K+N I          ET + E E+ +    K   ELE+ V 
Sbjct: 978  TINEKETK-IAELNEQITQKENEINGLKEAEKVMETKISEIESQLTEKEKSINELEETVQ 1036

Query: 216  ----ELN---------------INKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
                E+N               +N+ + + + EI  K++EI  N SKI +L  +I++KE 
Sbjct: 1037 NKETEINQKNEELSERETKINELNEIISQKDSEIQQKNEEISSNNSKIDELNQQISNKEN 1096

Query: 257  SMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNE---KKL 313
            S+++L  + + L  + +  E +IE LTK +  K E+ +KL    +E IQT+  E   K+ 
Sbjct: 1097 SLQELTDKVHSLETKNSEQETQIEELTKLVSEKEEENNKL----QETIQTKETEIKDKQS 1152

Query: 314  ELDEMREEI----KRLEEVLKENECERAGLLTKE-KRKDEEIKRLKEEIG 358
            ++DEM +EI    K +EE+      ER   L +E K K+ +I  +KE+I 
Sbjct: 1153 KVDEMNQEISDKDKSIEEI-----TERVNKLEEENKTKNSQIDEMKEQIS 1197



 Score = 46.6 bits (109), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 83/268 (30%), Positives = 142/268 (52%), Gaps = 48/268 (17%)

Query: 127 EEIDNRDTRIKELTSDTSEIDKIVSRKDAEI-EKLSKENE-------------KLSMDLV 172
           E+I  +D  +K+  S+ SE+  +VS+K+ E+ EK +  NE             +L+  L 
Sbjct: 408 EQIQTQDINLKQKDSNISELQVLVSQKETELSEKDNSINEFIHKLEEKDLQIKELNEQLN 467

Query: 173 FKENQEVTELMKEVDKKDNYIETLL------------KENEANVKL--CKELEKQVHELN 218
            KE+Q + EL  ++  K+N ++ +             KE E N K     E E +++ELN
Sbjct: 468 NKESQ-INELNAQISDKENSLQEITDKVHTLEETVQNKETEINQKNEELSERETKINELN 526

Query: 219 INKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPR 278
             + + + + EI  K++EI  N SKI +L  +I++KE S+++L  + + L  + +  E +
Sbjct: 527 --EIISQKDSEIQQKNEEISSNNSKIDELNQQISNKENSLQELTDKVHSLETKNSEQETQ 584

Query: 279 IEVLTKDIQIKTEKLHKLAEEHEELIQTRRNE---KKLELDEMREEI----KRLEEVLKE 331
           I+ LTK +  K E+ +KL    +E IQT+  E   K+ ++DEM +EI    K +EE+   
Sbjct: 585 IDELTKLVSEKEEENNKL----QETIQTKETEIKDKQSKVDEMNQEISDKDKSIEEI--- 637

Query: 332 NECERAGLLTKE-KRKDEEIKRLKEEIG 358
              ER   L +E K K+ +I  +KE+I 
Sbjct: 638 --TERVNKLEEENKTKNSQIDEMKEQIS 663



 Score = 43.9 bits (102), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 121/429 (28%), Positives = 206/429 (48%), Gaps = 74/429 (17%)

Query: 40   LRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKK------EF 93
            L E +  K+  + EL +   +K+QE+ D   + Q   T+ N   Q  E+IK+      EF
Sbjct: 715  LYEEIANKNTNITELNEQISSKNQEIVDRDNKLQSLGTELN---QKNEEIKEKDSKIGEF 771

Query: 94   KELQEKYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRK 153
             +L  K   ++NQ  E I  +++K  EL      EI  +D  I EL +        ++ K
Sbjct: 772  NDLVSKKDSEINQLQEEIADISSKIEELN----NEIATKDASILELNNK-------IAEK 820

Query: 154  DAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQ 213
            D +I+ L +E   L      KEN ++++L+ + D+K + IE +            EL K+
Sbjct: 821  DLKIKSLDEEKSSLQSKPAEKEN-DISDLLVKYDEKCSEIEAV----------QSELAKK 869

Query: 214  VHELNINKELDRL-NHEIIHKDDEIRQNASKIKQLMSEIASKE--------------ESM 258
              E   NKE + L +  I  KD+EI ++ + I  L  ++A KE              E  
Sbjct: 870  DKE---NKEFEELMSQAISEKDEEISKSKNGISSLQEKLAEKEKEINSKNEANTAEKEEN 926

Query: 259  KKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEM 318
             KLI Q +E +  LN     I+ L K+I  K E + +   +  ELI+   ++K+L ++E 
Sbjct: 927  SKLISQRDEEISNLN---KSIDELRKEISTKDETISQFESKINELIE-EISKKELTINEK 982

Query: 319  REEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQERDQLANMLEEYK 378
              +I  L E + + E E  GL  KE  K  E K     I + +  + E+++  N LEE  
Sbjct: 983  ETKIAELNEQITQKENEINGL--KEAEKVMETK-----ISEIESQLTEKEKSINELEETV 1035

Query: 379  QNAKDVVYY--EDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRR 436
            QN +  +    E++ ++ E ++++L     E I  +DSE+ +K+E I+       +NN +
Sbjct: 1036 QNKETEINQKNEELSER-ETKINELN----EIISQKDSEIQQKNEEIS-------SNNSK 1083

Query: 437  IQDLEEQLA 445
            I +L +Q++
Sbjct: 1084 IDELNQQIS 1092



 Score = 43.1 bits (100), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 122/477 (25%), Positives = 223/477 (46%), Gaps = 92/477 (19%)

Query: 7   LKFKEWHTKISEAELLVKQIEY----KNNQLLEFEQELRETMKQKDDKLLELAKTARTKD 62
           +  K+  + ISE ++LV Q E     K+N + EF  +L E    KD ++ EL +    K+
Sbjct: 415 INLKQKDSNISELQVLVSQKETELSEKDNSINEFIHKLEE----KDLQIKELNEQLNNKE 470

Query: 63  QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELK 122
            ++ +L  Q        +  E  L++I  +   L+E  Q              NK  E+ 
Sbjct: 471 SQINELNAQ-------ISDKENSLQEITDKVHTLEETVQ--------------NKETEIN 509

Query: 123 VQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTEL 182
            Q  EE+  R+T+I EL       ++I+S+KD+EI+   ++NE++S +     N ++ EL
Sbjct: 510 -QKNEELSERETKINEL-------NEIISQKDSEIQ---QKNEEISSN-----NSKIDEL 553

Query: 183 MKEVDKKDNYIETLLKENEANVKLCKELEKQVHELN-----INKELDRLNHEIIHKDDEI 237
            +++  K+N ++ L  +  +      E E Q+ EL        +E ++L   I  K+ EI
Sbjct: 554 NQQISNKENSLQELTDKVHSLETKNSEQETQIDELTKLVSEKEEENNKLQETIQTKETEI 613

Query: 238 RQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIK--TEKLHK 295
           +   SK+ ++  EI+ K++S++       E+ ER+N +E   E  TK+ QI    E++  
Sbjct: 614 KDKQSKVDEMNQEISDKDKSIE-------EITERVNKLEE--ENKTKNSQIDEMKEQISS 664

Query: 296 LAEEHEELIQ---TRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKR 352
           +    E  I    T+ N K  E+D + ++++      KE E E+A            I  
Sbjct: 665 ITTNEETAISTLNTQLNNKNNEIDLLHQQLQS-----KETENEKA------------INE 707

Query: 353 LKEEIGK-YKMLVQERDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEK--- 408
           L +++ K Y+ +  +   +  + E+     +++V  ++    L  EL++   E  EK   
Sbjct: 708 LNDKLNKLYEEIANKNTNITELNEQISSKNQEIVDRDNKLQSLGTELNQKNEEIKEKDSK 767

Query: 409 -------IKLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKI 458
                  +  +DSE+++  E IA++   +E  N  I   +  + E N  I E+DLKI
Sbjct: 768 IGEFNDLVSKKDSEINQLQEEIADISSKIEELNNEIATKDASILELNNKIAEKDLKI 824


>gi|154416763|ref|XP_001581403.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
 gi|121915630|gb|EAY20417.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
          Length = 4263

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 67/245 (27%), Positives = 131/245 (53%), Gaps = 40/245 (16%)

Query: 256  ESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEE---HEELIQTRRNEKK 312
            +++++L KQ  E  E +N +E   +V  ++I  K +++ ++  +    EE+I    NE+ 
Sbjct: 1375 QTLEQLAKQLQEAEEDINVLEGNCQVYEQEIAEKDKQIEQMTNDIKSLEEVI----NEQS 1430

Query: 313  LELDEMR-------EEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKMLVQ 365
              +D ++       EEIK+L++ + ENE     L T  ++KD EI++ KEEI ++K  + 
Sbjct: 1431 NTIDSLKQDVATKEEEIKQLKQTVSENEEVIKQLQTDIEQKDAEIQKNKEEIEQHKQTIS 1490

Query: 366  ERD----QLANMLEEYKQNAKD-----------VVYYEDMKDKLEVELDKLRMESAEKIK 410
            +RD    QL + +E++KQ   D           +   E+   +L+ E+++ +   AE   
Sbjct: 1491 QRDETIKQLQSEIEQHKQTIADKNNEIEQLKNTISEREETIKQLQNEIEQHKQTMAE--- 1547

Query: 411  LRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQRTLLEPIE 470
             RD+E+ K  E I + K ++  NN  I+ L++        I+E+D +I+++++T+ E  E
Sbjct: 1548 -RDAEIQKNKEEIEQQKQTISNNNNEIEQLKK-------TISERDAEIEQLKKTIAERDE 1599

Query: 471  PLSEL 475
             + +L
Sbjct: 1600 SIKQL 1604



 Score = 46.6 bits (109), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 93/390 (23%), Positives = 188/390 (48%), Gaps = 73/390 (18%)

Query: 125  YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMK 184
            YE+EI  +D +I+++T+D   ++++++ +   I+ L +       D+  KE +E+ +L +
Sbjct: 1401 YEQEIAEKDKQIEQMTNDIKSLEEVINEQSNTIDSLKQ-------DVATKE-EEIKQLKQ 1452

Query: 185  EVDKKDNYIETLLKENEANVK-LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASK 243
             V            ENE  +K L  ++E++  E+  NKE      EI      I Q    
Sbjct: 1453 TVS-----------ENEEVIKQLQTDIEQKDAEIQKNKE------EIEQHKQTISQRDET 1495

Query: 244  IKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEEL 303
            IKQL SEI   E+  + +  +NNE+ +  N++  R E + K +Q + E+  +   E +  
Sbjct: 1496 IKQLQSEI---EQHKQTIADKNNEIEQLKNTISEREETI-KQLQNEIEQHKQTMAERDAE 1551

Query: 304  IQTRRNE---KKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKY 360
            IQ  + E   +K  +     EI++L++ + E + E   L      +DE IK+L+ EI ++
Sbjct: 1552 IQKNKEEIEQQKQTISNNNNEIEQLKKTISERDAEIEQLKKTIAERDESIKQLQNEIEQH 1611

Query: 361  KMLVQERD-------------------------QLANMLEEYKQNAKD-----------V 384
            K  + +RD                         QL + +E++KQ   D           V
Sbjct: 1612 KQTISQRDAEIEQLKQTVQQRDQTIAEKEDLIKQLQSEIEQHKQTISDKNNEIEQLKQTV 1671

Query: 385  VYYEDMKDKLEVELDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQL 444
               ++   +L+ E+++ +   AE    RD+E+ K  E I + K ++   +  I+ L+ ++
Sbjct: 1672 NARDEAIKQLQSEIEQHKQTIAE----RDAEIQKNKEEIEQQKQTISQRDETIKQLQNEI 1727

Query: 445  AENNAWINEQDLKIQEIQRTLLEPIEPLSE 474
             ++   I+++D +I+++++T+ +  + ++E
Sbjct: 1728 EQHKQTISQRDAEIEQLKQTVQQSDQTIAE 1757



 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 104/425 (24%), Positives = 205/425 (48%), Gaps = 76/425 (17%)

Query: 41   RETMKQKDDKLLELAKTARTKDQ---ELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQ 97
            ++T+ Q+D ++ +L +T +  DQ   E  DL +Q Q  +      EQH + I +   E+Q
Sbjct: 1731 KQTISQRDAEIEQLKQTVQQSDQTIAEKEDLIKQLQSEI------EQHKQTIAERDAEIQ 1784

Query: 98   ------EKYQRDLNQKTETIR-----------SLTNKNLELKVQYEEEIDNRDTRIKELT 140
                  E+ ++ ++Q+ E+I+           ++ ++  E++ Q+++ I  RD  IK+L 
Sbjct: 1785 KNKEEIEQQKQTISQRDESIKQMQSEIEQNKQTIADREKEIE-QHKQTIAERDNSIKQLQ 1843

Query: 141  SDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKEN 200
             +  +  + ++ +DAEI+K  +E        + ++N+ +  L  E ++K   I  L   N
Sbjct: 1844 EEIEQHKQTIAERDAEIQKNKEE--------IQQKNEAINALTNEGEEKRLKILELEANN 1895

Query: 201  EANVKLCKELEKQVHELNIN---------------KELDRLNHEIIHKDD----EIRQNA 241
            E  +   KEL   V +LN++               K+L++  HE+  K +    +I +  
Sbjct: 1896 ENLINKVKELNDSVSDLNLSTENQNSVVKQMTDEIKDLNKQIHELEVKSENQQKQIEEKD 1955

Query: 242  SKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHE 301
             +I+ L +  A  EE +KKL     E VE L + + + E   K++Q + + L +   ++E
Sbjct: 1956 KEIQSLTNTKAQNEELIKKL----QEEVENLTNTKNQNEETIKNLQEQVQSLTETKNQNE 2011

Query: 302  ELIQTR----------RNEKKLELDEMREEIKRL-------EEVLKENECERAGLLTKEK 344
            +LI+ +          +NE +  +  ++E+++ L       EE +K+ + E   L   + 
Sbjct: 2012 DLIKKQQEQIQSLTNTKNENEETIKNLQEQVQSLTETKATNEETIKKLQGEVQSLTETKA 2071

Query: 345  RKDEEIKRLKEEIGKYKMLVQERDQLANMLEEYKQNAKDV-VYYEDMKDKLEVELDKLRM 403
              +E+IK+ +EEI        E ++L   L+E  QN  +     E+   KL+ E+  L+ 
Sbjct: 2072 TNEEQIKKQQEEIQSLSNTKNENEELIKKLQEEIQNLTNTKTQNEEQIKKLQEEIQNLQK 2131

Query: 404  ESAEK 408
            ++AEK
Sbjct: 2132 QNAEK 2136


>gi|145539868|ref|XP_001455624.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423432|emb|CAK88227.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1003

 Score = 46.6 bits (109), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 107/420 (25%), Positives = 197/420 (46%), Gaps = 77/420 (18%)

Query: 8   KFKEWHTKISEAELLVKQI-EYKNNQLLEFEQELRETMKQKDDKLLELAKTART----KD 62
           + + +  + ++A+  +KQ+ +Y    + E E++L++ M Q + ++ EL   A+     KD
Sbjct: 266 RVQNYEGQFNDAQAKIKQLAQY----IQELEKQLQDQMNQYEKQIKELVNNAKATEDEKD 321

Query: 63  QELRDLRQQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLN----------QKTETI- 111
           Q +  L +       +  T  + + +++KE K+   K  R++           +KT T+ 
Sbjct: 322 QTIDQLEKDISNKANQLETQNKQISQLQKELKDADNKKDREVKDVQRKLDAELKKTATLE 381

Query: 112 ---RSLTNKNLELKVQ---YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENE 165
              ++L +KN E   Q     +E+D  D +I EL     +    +   + EI+ L+KE +
Sbjct: 382 KNNKTLKDKNDEQAKQINDANQELDQLDEKIAELEQKVKDQQNQIKDLEKEIKDLNKEKQ 441

Query: 166 KLSMD---LVFKENQ----------EVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
            L  D   L  K NQ          ++ +  KE+  K N  E L K+ +   +  KEL+K
Sbjct: 442 SLIQDNNNLHQKYNQAEEKALQQQKDLAKAQKELGDKHNDAEQLNKDLDEYEQENKELQK 501

Query: 213 QVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERL 272
           ++++LN    +++LN EI  K  +I Q A  I++L   +    E  K+  + NN+L E+L
Sbjct: 502 EINQLN--DSINQLNKEINQKQKQIDQQAKDIQELQENL----EKQKQDNQNNNDLDEQL 555

Query: 273 NSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKEN 332
           N  + + + L   I    +K +K  ++ +  +Q  +NE           IK+L++ +KE 
Sbjct: 556 NESKKQNQKLQDQINNTEQKQNKTQDQLKNQLQDAQNE-----------IKQLKDQIKEQ 604

Query: 333 ECERAGLLT--------------------KEKRKDEEIKRLKEEIGKY-KMLVQERDQLA 371
           E E+  L                      KE++++ EIKRL +E+ K  + L Q+ DQL 
Sbjct: 605 EKEKKNLQNEVNNLNKECDDLDAKLQQKIKEQQENSEIKRLNDELNKAQQQLKQKEDQLT 664


>gi|145479739|ref|XP_001425892.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392964|emb|CAK58494.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1014

 Score = 46.2 bits (108), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 105/427 (24%), Positives = 206/427 (48%), Gaps = 80/427 (18%)

Query: 8   KFKEWHTKISEAELLVKQI-EYKNNQLLEFEQELRETMKQKDDKLLELAKTAR-TKDQEL 65
           + + +  + ++A+  +KQ+ +Y    + E E++L++ M Q + ++ EL   A+ T+D++ 
Sbjct: 266 RVQNYEGQYNDAQAKIKQLAQY----IQELEKQLQDQMNQYEKQIKELLNNAKATEDEKD 321

Query: 66  RDLRQQTQQSVTKANTCE---QHLEKIKKEFKELQEKYQRDLN----------QKTETI- 111
            ++ Q  + +  KAN  E   + + +++KE K+   K  R++           +KT T+ 
Sbjct: 322 HNIDQLEKDNSNKANQLEAQNKQISQLQKELKDADNKRDREVKDVQRKLDAELKKTATLD 381

Query: 112 ---RSLTNKNLELKVQ---YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENE 165
              ++L +KN E   Q     EE+D  D +I +L     +    +   + EI+ L+KE +
Sbjct: 382 KNNKTLKDKNDEQAKQINAANEELDQLDQKIADLEQKVKDQQNQIKDLEKEIKDLNKEKQ 441

Query: 166 KLSMD-------------LVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
            L  D                ++ +++ +  KE++ K N  E L K+ +   +  KEL+K
Sbjct: 442 NLIQDNNNLHQKFNQAEEKALQQQKDLVKAQKELNDKHNNAEQLNKDLDEYEQENKELQK 501

Query: 213 QVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEI-------ASKEESMKKLIKQN 265
           +++ LN   ++++LN EI  K  +I Q A  I++L   +        SK++  K+L + N
Sbjct: 502 EINSLN--DQINQLNKEINQKQKQIDQQAKDIQKLQENLEKQKQDNQSKQQENKQLQQNN 559

Query: 266 NELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRL 325
           N+L ++LN  + + + L   I    +K +K  ++ +  +Q  +N           EIK+L
Sbjct: 560 NDLNKQLNESKKQNQKLQDQINNTEQKQNKTQDQLKNQLQDAQN-----------EIKQL 608

Query: 326 EEVLKENECERAGLLT--------------------KEKRKDEEIKRLKEEIGKY-KMLV 364
           ++ +KE E E+  L                      KE++++ EI RL +E+ K  + L 
Sbjct: 609 KDQIKEQEKEKKNLQNEVNNLNKECDDLDAKLQQKIKEQQENSEINRLNDELNKAQQQLK 668

Query: 365 QERDQLA 371
           Q+ DQL 
Sbjct: 669 QKEDQLT 675


>gi|389603539|ref|XP_003722983.1| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|322504673|emb|CBZ14502.1| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 3658

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 79/268 (29%), Positives = 131/268 (48%), Gaps = 47/268 (17%)

Query: 155  AEIEKLSKENEKLSMDLVFKEN--QEVTELMKEVDKKDNYIETLLKENEANVKLCKELEK 212
            AEI+KL  +NE+L+ +L   EN  +E   L  E++K     E    EN    KLC + E+
Sbjct: 1931 AEIDKLRGDNERLAEEL---ENVQEEAERLAGELEKAQADAEAQRAEN---GKLCGDNER 1984

Query: 213  QVHEL-NINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVER 271
             V EL ++ +E +RL  E+    +E  + A ++++  ++  ++     KL   N  LVE 
Sbjct: 1985 LVEELESLQEEAERLAGELEKAQEEAERLAGELEKAQADAEAQRAENGKLCGDNERLVEE 2044

Query: 272  LNSVEPRIEVLTKDIQIKTEKLHKLAEEHEEL---IQTRRNE-KKL---------ELDEM 318
            L S++   E L  +++   E+  +LA E E+     + +R E  KL         EL+ +
Sbjct: 2045 LESLQEEAERLASELEKAQEEAERLAGELEKAQADAEAQRAENGKLCGDNERLVEELESL 2104

Query: 319  REEIKRLEEVLK--ENECER-AGLLTKEK------------------RKDEEIKRLKEE- 356
            +EE +RL   L+  + E ER AG L K +                  R  EE++RL+EE 
Sbjct: 2105 QEEAERLASELEKAQEEAERLAGELEKAQANAEAQRAENGKLCGDNERLAEELERLQEEA 2164

Query: 357  ---IGKYKMLVQERDQLANMLEEYKQNA 381
                G+ +   +E ++LA  LE+ + NA
Sbjct: 2165 ERLAGELEKAQEEAERLAGELEKAQANA 2192



 Score = 40.8 bits (94), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 90/340 (26%), Positives = 161/340 (47%), Gaps = 46/340 (13%)

Query: 156  EIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQVH 215
            E+E+L +E E+L+ +L  K  +E   L  E++K     E    E     KLC + E+ V 
Sbjct: 2659 ELERLQEEAERLAGEL-EKAQEEAERLAGELEKAQADAEAQRAEI---GKLCGDNERLVE 2714

Query: 216  EL-NINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNS 274
            EL ++ +E +RL  E+    +E  + A ++++  ++  ++     KL   N  L E L S
Sbjct: 2715 ELESLQEEAERLAGELEKAQEEAERLAGELEKAQADAEAQRAENGKLCGDNERLAEELES 2774

Query: 275  VEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLK--EN 332
            ++   E L  +++   E+  +LA E E+ +Q     ++ E+ ++R + +RL + L+  + 
Sbjct: 2775 LQEEAERLAGELEKAQEEAERLAGELEK-VQADAEAQRAEIGKLRGDTERLADELEKLQE 2833

Query: 333  ECER-AGLLTK-----EKRKDEEIK------RLKEEIGKYKMLVQERDQLANMLEEYKQN 380
            E ER AG L K     E ++ E  K      RL EE+ K +   +E ++LA  LE+ ++ 
Sbjct: 2834 EAERLAGELEKAQADAEAQRAENGKLCGDNERLAEELEKAQ---EEAERLAGELEKAQEE 2890

Query: 381  AKDVVY----YEDMKDKLEVELDKLRM-------------ESAEKIKLRDSELDKKHEVI 423
            A+ +       +   + L  E+DKLR              E AE++     EL+K  E  
Sbjct: 2891 AERLAGELEKAQADAEALRAEIDKLRGDNERLAEELEKAQEEAERLA---GELEKAQEEA 2947

Query: 424  AEMKLSMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQR 463
              +   +E   +   D E Q AEN     + +   +E++R
Sbjct: 2948 ERLAGELE---KAQADAEAQRAENGKLCGDNERLAEELER 2984


>gi|123445769|ref|XP_001311641.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
 gi|121893459|gb|EAX98711.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
          Length = 1513

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 163/315 (51%), Gaps = 34/315 (10%)

Query: 75  SVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDT 134
           S  K +  E  ++K+K++ K L+E     +N     I+ L N    +  Q + EI  +D 
Sbjct: 367 SSEKIHALESEIQKLKQDNKSLEEALSL-VNSTKSDIKELEN----VIEQLQGEIAEKDQ 421

Query: 135 RIKELTSDTSEIDKIVSRKDAEI---EKLSKENEKLSMDLVFKENQEVTELMKEVDKKDN 191
           +IKEL+S + E D+I+   + +I   E +SK  +K + ++  KE +E  EL + +D    
Sbjct: 422 KIKELSS-SKENDEILQELEVQIQEKENISKSLQKKAEEIEMKE-KENKELEQVIDSLKT 479

Query: 192 YIETLLKENEANVKLCKELEKQVHELNINKE----LDRLNHEIIHKDDEIRQNASKIKQL 247
            I++L KENE   K C+         N++KE    +D +N +I  K++EI+ N SKIK+L
Sbjct: 480 EIDSLTKENEKLNKACERASDAA--TNLSKERDMIVDEMNKDINEKEEEIQNNLSKIKEL 537

Query: 248 MSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEH------- 300
             +I  K+    K + QNN+  E +N ++ +I+     I+   +K+ +L E +       
Sbjct: 538 EQKI--KDIETDKDLTQNNKSEEIINELQNKIQNNLSKIRKLEQKIKELEEANAQLSNNK 595

Query: 301 -EELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGK 359
            EE+I   +NE +  L ++RE    LE+ +KE E  +        + DE I +L+ EI K
Sbjct: 596 SEEIINELQNEIQNNLSKIRE----LEQKIKELESTQLS----NNKSDETINQLEVEIAK 647

Query: 360 YKMLVQERDQLANML 374
            K  +++ ++  N L
Sbjct: 648 NKETIEKINKENNYL 662


>gi|154413470|ref|XP_001579765.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
 gi|121913975|gb|EAY18779.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
          Length = 883

 Score = 43.9 bits (102), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 91/339 (26%), Positives = 170/339 (50%), Gaps = 53/339 (15%)

Query: 169 MDLVFKENQEV----TELMKEVDKKDNYIETLLK----ENEANV---KLCKELEKQVHEL 217
           + ++ KEN ++    +E +K+  +KD  IE L++    EN  N    KL KE  K+  EL
Sbjct: 458 IQMITKENDDLKAQNSEYVKQNQEKDRQIEELVQSLSNENNNNADIQKLYKE--KEETEL 515

Query: 218 NINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEP 277
            I+    +L  EI   + +I++   ++ QL   I  KEE +  + K N++L  ++  +  
Sbjct: 516 LIS----QLGDEITQLNTKIQEKVDEVNQLTETILDKEEVINAVTKDNSDLNNKIAELNN 571

Query: 278 RIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKE--NECE 335
            I  +TK+I  K EK+++L  + EEL    + EK+ E++    +I  L E + E  NE  
Sbjct: 572 AISEMTKEITEKEEKINELNRKIEELNNVIK-EKEEEINRFSSKISELNESINEKINEIN 630

Query: 336 RAGLLTKE-----KRKDEEIKRLKEEIGKYKMLVQERDQLANMLEE----YKQNAKDVVY 386
                  E     K KDE+I  L  +  + +  + E ++L N +++    + +  K+   
Sbjct: 631 NTNTAINELNNQIKEKDEKINELNNQNQEKQNKIDELNELNNTVQQNETKFGELNKENRE 690

Query: 387 YEDMKDKLEVELDKLRMESAEKIKL----------RDSELDKKHEVIAEMKLSME----- 431
            E+  ++L  E++++   S+EK K           +D+E+ KK E+I E++ S++     
Sbjct: 691 KENRINELNKEIERINNSSSEKDKTIANLNESLLEKDNEITKKDELIKELQESVQTKETE 750

Query: 432 ---------ANNRRIQDLEEQLAENNAWINEQDLKIQEI 461
                    +NN +I +L +Q+ E NA I++++  ++EI
Sbjct: 751 INQKNELISSNNTKIDELNQQINELNAQISDKENSLKEI 789



 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 72/254 (28%), Positives = 135/254 (53%), Gaps = 29/254 (11%)

Query: 39  ELRETMKQKDDKLLELAKTARTKDQ---ELRDLRQQTQQSVTKANTCEQHLEKIKKEFKE 95
           EL   +K+KD+K+ EL    + K     EL +L    QQ+ TK     +   + +    E
Sbjct: 638 ELNNQIKEKDEKINELNNQNQEKQNKIDELNELNNTVQQNETKFGELNKENREKENRINE 697

Query: 96  LQEKYQRDLNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDT----SEIDK--- 148
           L ++ +R  N  +E  +++ N N  L ++ + EI  +D  IKEL        +EI++   
Sbjct: 698 LNKEIERINNSSSEKDKTIANLNESL-LEKDNEITKKDELIKELQESVQTKETEINQKNE 756

Query: 149 IVSRKDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLL--KENEANVKL 206
           ++S  + +I++L+++  +L+  +  KEN      +KE+  K + +E  +  KE E N K 
Sbjct: 757 LISSNNTKIDELNQQINELNAQISDKENS-----LKEITDKVHTLEETVQNKETEINQKN 811

Query: 207 --CKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQ 264
               E E +++ELN  + + + + EI  K++EI  N SKI++L  +I++KE S++     
Sbjct: 812 EELSERETKINELN--EIISQKDSEIQQKNEEISSNNSKIEELNQQISNKENSLQ----- 864

Query: 265 NNELVERLNSVEPR 278
             EL ++++S+E +
Sbjct: 865 --ELTDKVHSLETK 876


>gi|308464124|ref|XP_003094331.1| hypothetical protein CRE_08534 [Caenorhabditis remanei]
 gi|308247909|gb|EFO91861.1| hypothetical protein CRE_08534 [Caenorhabditis remanei]
          Length = 709

 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 76/298 (25%), Positives = 145/298 (48%), Gaps = 54/298 (18%)

Query: 27  EYKNNQLLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVT--------- 77
           E+   + L+  +EL +  K ++   LE+ +  R  D++   LR +  ++V          
Sbjct: 19  EWTVEKFLDMRKELLDRQKDQEKSNLEMVEKLRLSDEKFEKLRAKFDKTVESLESEASSE 78

Query: 78  ---KANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLE-LKVQYEEEIDNRD 133
              K    +  +EK+ K+F  +Q K    LNQ  +T+R + +  ++ L ++++ + D   
Sbjct: 79  IVEKIRLSDDKIEKLSKKFDSIQWK----LNQTNKTLRPVVSSEIKKLSMEHQGKFDWIQ 134

Query: 134 TRIKELTSDT-SEIDKIVSR---KDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKK 189
           +++ E+      E+ KIV +    D  IEKLSKE+E               E M  +  K
Sbjct: 135 SKLNEIDKTLKPEVPKIVEKIRLSDERIEKLSKEHE---------------EKMNSIQLK 179

Query: 190 DNYIETLLKENEANVKLCKELEKQVHELN--INKELDRLNHEIIHK-DDEIRQNASKIKQ 246
            N IE   K   ++     E++K  H+L   ++K ++ L  E I   + +IR    +I++
Sbjct: 180 LNKIEKTRKREVSS-----EIDKLSHDLQSKMDKIVEALKPETISGIEQKIRLCDERIEK 234

Query: 247 LMSE-------IASKEESMKK---LIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLH 294
           L +E       +  ++E + K   L K N+E+VE++ + + RIE ++K++Q K +K+H
Sbjct: 235 LSNEQNEKFVRLDMRKEFLNKQKDLEKSNSEIVEKVRTCDERIEKVSKELQSKMDKIH 292


>gi|38347763|dbj|BAD01607.1| myosin heavy chain [Lethenteron camtschaticum]
          Length = 1491

 Score = 41.6 bits (96), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 107/398 (26%), Positives = 188/398 (47%), Gaps = 69/398 (17%)

Query: 85   HLEKIKKEFKELQEKYQRDLNQKTETIRSL-----TNKNLE---LKVQY-----EEEIDN 131
            HLE +K+E K LQE+      Q  E+ ++L     + K LE   L++Q      E  +++
Sbjct: 1050 HLETLKRENKNLQEEISDLTEQLGESGKALHEVEKSRKQLEQEKLEIQAALEEAEASLEH 1109

Query: 132  RDTRIK----ELTSDTSEIDKIVSRKDAEIEKLSKENEK------LSMDLVFKENQEVTE 181
             + +I     EL    ++ID+ ++ KD E+++L + N++       S+D   K   E   
Sbjct: 1110 EEGKILRIQLELNQVKADIDRKIAEKDEELDQLKRSNQRTVESMQASLDAETKSRNEALR 1169

Query: 182  LMKEVDKKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNA 241
            L K+++   N +E  L  N AN +     E Q H  N+  ++  L    +H DD +R N 
Sbjct: 1170 LKKKMEGDLNEMEIQL--NHANRQAA---ESQKHLRNVQGQIKDLQ---VHLDDTLRSN- 1220

Query: 242  SKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHE 301
                 L  ++A  E     L+ +  E+   L   E   +V  +++   TE++  L  ++ 
Sbjct: 1221 ---DDLKEQVAMIERRNNLLLAEVEEMRTALEQTERARKVAEQELLGATERVQLLHSQNT 1277

Query: 302  ELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYK 361
             LI T+R   KLE D     I +L+     NECE A    +E R  +E  + K+ I    
Sbjct: 1278 SLINTKR---KLEGD-----IVQLQ-----NECEEA---IQESRNADE--KAKKAITDAA 1319

Query: 362  MLVQE----RDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEK--------I 409
            M+ +E    +D  A+ LE  K+N +  V  +D++++L+ E +++ M+  +K        +
Sbjct: 1320 MMAEELKKEQDTSAH-LERMKKNMEQTV--KDLQNRLD-EAEQIAMKGGKKQIQKLETRV 1375

Query: 410  KLRDSELDKKHEVIAEMKLSMEANNRRIQDLEEQLAEN 447
            +  +SELD +   IAE         RR+++L  Q  E+
Sbjct: 1376 RELESELDSEQRRIAETIKGARKYERRVKELTYQAEED 1413


>gi|154419561|ref|XP_001582797.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
 gi|121917034|gb|EAY21811.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
          Length = 1553

 Score = 41.6 bits (96), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 82/310 (26%), Positives = 147/310 (47%), Gaps = 60/310 (19%)

Query: 83  EQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQYEE-------EIDNRDTR 135
           EQ +E++KK  +E  EKYQ++    T+T+ +  N+   LK Q E+       EI+N + +
Sbjct: 348 EQEVEELKKSQQENDEKYQKEKEDLTQTVNNQNNEISNLKKQNEDLSNSTTNEINNLNKQ 407

Query: 136 IKELTSDTSEIDK-------IVSRKDAEIEKLSKENEKLSMDLVFKEN--QEVTELMKEV 186
           I++L +  S+++K        VS  + E+  L K N++L  +   K N  +E   L   V
Sbjct: 408 IQDLQNQKSDLEKQNADYNNTVSNNNDELANLKKLNQELQNE---KSNLQKETENLSNTV 464

Query: 187 DKKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQ 246
           + K+N IE L K+NE                        L +E        +QN  K+K+
Sbjct: 465 NDKNNEIEELKKQNED-----------------------LQNE--------KQNLQKVKE 493

Query: 247 -LMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQ 305
            L + I +K++ +K L KQN +L  + N +E + E L   +  K  +L+ L  ++++L +
Sbjct: 494 DLTNTITTKDDEIKDLKKQNEDLQNQNNDLEKQKEDLNNTVANKDSELNNLKNDNQQLQE 553

Query: 306 TRR------NEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGK 359
             +      N  K    ++ +++  LE  + E   E  GL+   K +D ++  L+   G+
Sbjct: 554 ANKKQNDDINNLKKSNQDLEDKVTDLEGKIDEMTAENEGLMENVKTRDLQLDNLQ---GE 610

Query: 360 YKMLVQERDQ 369
           +   V E +Q
Sbjct: 611 HSQTVDELNQ 620


>gi|123438677|ref|XP_001310118.1| viral A-type inclusion protein [Trichomonas vaginalis G3]
 gi|121891874|gb|EAX97188.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3]
          Length = 3977

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 114/418 (27%), Positives = 199/418 (47%), Gaps = 109/418 (26%)

Query: 31   NQLLEFEQELRETMKQKDDKLLE--------LAKTARTKDQELRDLR------------- 69
            NQ+ + ++ L ET KQ DDKL+E        L    ++KDQ + DL              
Sbjct: 3185 NQIDQLKKLLEET-KQNDDKLVEELTKEIEKLKNEQQSKDQNINDLSALNKDKSSLIQQN 3243

Query: 70   ----QQTQQSVTKANTCEQHLEKIKKEFKELQEKYQRDLNQKTETIRSLTNKNLELKVQ- 124
                ++TQ+         Q +E +KK+ + LQ+  + + N+  + I  LT    +L  + 
Sbjct: 3244 DDLSKKTQEFYNSQQNQAQMIEDLKKQNESLQKNLEINNNETQQNIDQLTKDKSDLASKL 3303

Query: 125  --YEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDLVFKENQEVTEL 182
              YE +I++ ++ IKEL    +E + I+ +K+ E  +  +    ++ DL+ K NQ    L
Sbjct: 3304 HDYEAKINDLNSLIKEL----NEKNAIIEKKNYEFSQQLE----VNNDLISKNNQ----L 3351

Query: 183  MKEVDK--KDNYIETLLKENEANVKLCKELEKQVHEL-NINKEL-DRLNH------EIIH 232
             + +D+  KD  +                L KQ+ +L N N E+ ++LN+      E   
Sbjct: 3352 QQTIDQLNKDKTV----------------LSKQIQDLANKNNEITNQLNNKDKIILESKQ 3395

Query: 233  KDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEK 292
            K DE+ Q+ S    LM E+ +        +K NN+    LNS   + +   +++Q++ EK
Sbjct: 3396 KSDELNQSLSN---LMKELHT--------LKANND---DLNSQISQSKQNEENLQLQIEK 3441

Query: 293  LHKLAEEHEELIQTRRNEKKLELDEMREEIKRL-------EEVLKENECERAGLLTKEKR 345
              KL      L  T++N+ KL +D++ +E++ L       EE++K+N  + +G+L + ++
Sbjct: 3442 QKKL------LQDTKQNDNKL-VDDLSKEVETLTSEKLKNEEIIKQNNAKYSGILKQLQQ 3494

Query: 346  KDEEIKRLKEEIGKYKMLVQERDQLANMLEEYKQ-NAKDVVYYEDMKDKLEVELDKLR 402
            K+EEI +             E++Q  + LE  KQ N K V      KDKL  E +KL+
Sbjct: 3495 KNEEINK-------------EKEQFKHDLEGEKQKNEKLVNDLNQTKDKLSQENEKLK 3539


>gi|17536671|ref|NP_496914.1| Protein W02B8.2 [Caenorhabditis elegans]
 gi|3880401|emb|CAB03458.1| Protein W02B8.2 [Caenorhabditis elegans]
          Length = 1256

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 115/427 (26%), Positives = 207/427 (48%), Gaps = 58/427 (13%)

Query: 103 DLNQKTETIRSLTNKNLELKVQ----------YEEEIDNRDTRIKELTSDTSEIDKIVSR 152
           DL QK   IR L N+ L L+ +          Y+E++   + RI+ L S   +++  +S 
Sbjct: 100 DLKQKESQIRILQNRCLRLETEKQKMQDTISGYQEDLKENEIRIENLNSRLHKLEDELSA 159

Query: 153 KDAEIEKLSKENEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKE--NEANVKLCKEL 210
           K  EI  + +E +  +M L  K +Q  T+L  E+  ++  +E  +++   E  VK  + L
Sbjct: 160 KTHEIFSIGEELKNKTMKLNEKNSQFQTKLA-EISSENRNLERKVQKFREELIVKDQRSL 218

Query: 211 EKQVHELNINKEL-------DRLNH---------EIIHKDDEIRQNASKIKQLMSEIASK 254
           E    + N  K L       DRL++          I  +DD ++ +A  I++ MSE+  K
Sbjct: 219 EVHQDQENTQKVLKEVKQLSDRLDYLTPKRKDVSRIKERDDFLQFSAKIIEETMSELKLK 278

Query: 255 EESMKKLIKQNNELV----ERLNSVEPRIEVLTKDIQIKTEKLH----KL----AEEHEE 302
              +++ + +  ELV    E L  ++  +     D +  T+ LH    KL    A+   +
Sbjct: 279 NARLERELSEKEELVKVTKEELQELQKTVTQAMGDSEQATKYLHAENMKLTRQKADIRCD 338

Query: 303 LIQTRRN-----EKKLEL----DEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRL 353
           L++ RRN     EK+ EL    DE  E+++R+ E+ K  E E   L    + ++E+I  L
Sbjct: 339 LLEARRNLKGFDEKREELEKQRDEALEDVRRITELKKNVEIELRSLKLLAEEREEQIDEL 398

Query: 354 KEEIGKYKMLVQERDQLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAEKIKLRD 413
           K  +  Y++L ++ + + N L + ++    +  +  M DK       L+ E+AE  + R 
Sbjct: 399 KSRVAGYEVLRRDHEAVKNELAKAEEKLNKMGAHLVMADKQSSHFKTLK-ETAEGSRRR- 456

Query: 414 SELDKKHEVIAEMKLSMEA---NNRRI-QDLEEQLAENNAWINEQDLKIQEIQRTLLEPI 469
             +++ +E++A ++  +EA   N R++ Q+LE   AEN     + +   +EIQ T L+  
Sbjct: 457 -AIEQCNEMVARIR-GLEASLENQRKVEQELEMVKAENVRQAKKIEFMKEEIQETHLDYR 514

Query: 470 EPLSELV 476
           E LS+L 
Sbjct: 515 EELSKLA 521


>gi|26325538|dbj|BAC26523.1| unnamed protein product [Mus musculus]
          Length = 524

 Score = 40.8 bits (94), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 86/349 (24%), Positives = 166/349 (47%), Gaps = 37/349 (10%)

Query: 33  LLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKE 92
           LLE ++ +++    + D   +LA+  + K    RDL + T Q  +K       + ++K E
Sbjct: 127 LLEMQESIKDLSAIRADLANKLAEEEKAKKAVFRDLSELTAQVKSKEEETATAITQLKLE 186

Query: 93  FKELQEKYQRDLNQKTETIRSLTNKNL-ELKVQYEEEIDNRDTRIKELTSDTSEIDKIVS 151
            +++ ++   DL+   E+++    +N+ EL   +++E    ++ I+ L +++ E   +  
Sbjct: 187 -RDVHQRELEDLSSSLESVKLKHEQNIQELMKHFKKEKSEAESHIRMLKAESLEDKNMAK 245

Query: 152 RKDAEIEKLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIET------LLKE 199
               ++EKL  + EKL+ +L   EN+      +   L +E+DKK+ YI T       ++E
Sbjct: 246 AHLGQLEKLKSQCEKLTEELTHTENENKKLKLKYQSLKEELDKKEKYISTEEEHLRRMEE 305

Query: 200 NEANVK---LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
           +  ++K   LC E E++     I KE+D             R +  K+K L S      +
Sbjct: 306 SRLHLKDQLLCLETEQESILGVIGKEIDEACKTFS------RDSLEKLKVLTSGPQLHYD 359

Query: 257 SMKKLIKQNNEL---VERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKL 313
             + L +   +L    E L   E R   + + +    ++L +L E            K+ 
Sbjct: 360 PHRWLAESKTKLQWLCEELKERESRERSMRQQLAACRQELRELTEH-----------KES 408

Query: 314 ELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKM 362
           EL  + E I+R E++L+E   E+ GLL + +RKDEE++ L++ +   +M
Sbjct: 409 ELLCLFEHIERQEQLLEEFHQEKRGLLEETQRKDEEVETLQDRVNALQM 457


>gi|148687007|gb|EDL18954.1| RIKEN cDNA 4930534B04 [Mus musculus]
          Length = 1026

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 86/349 (24%), Positives = 166/349 (47%), Gaps = 37/349 (10%)

Query: 33  LLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKE 92
           LLE ++ +++    + D   +LA+  + K    RDL + T Q  +K       + ++K E
Sbjct: 629 LLEMQESIKDLSAIRADLANKLAEEEKAKKAVFRDLSELTAQVKSKEEETATAITQLKLE 688

Query: 93  FKELQEKYQRDLNQKTETIRSLTNKNL-ELKVQYEEEIDNRDTRIKELTSDTSEIDKIVS 151
            +++ ++   DL+   E+++    +N+ EL   +++E    ++ I+ L +++ E   +  
Sbjct: 689 -RDVHQRELEDLSSSLESVKLKHEQNIQELMKHFKKEKSEAESHIRMLKAESLEDKNMAK 747

Query: 152 RKDAEIEKLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIET------LLKE 199
               ++EKL  + EKL+ +L   EN+      +   L +E+DKK+ YI T       ++E
Sbjct: 748 AHLGQLEKLKSQCEKLTEELTHTENENKKLKLKYQSLKEELDKKEKYISTEEEHLRRMEE 807

Query: 200 NEANVK---LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
           +  ++K   LC E E++     I KE+D             R +  K+K L S      +
Sbjct: 808 SRLHLKDQLLCLETEQESILGVIGKEIDEACKTFS------RDSLEKLKVLTSGPQLHYD 861

Query: 257 SMKKLIKQNNEL---VERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKL 313
             + L +   +L    E L   E R   + + +    ++L +L E            K+ 
Sbjct: 862 PHRWLAESKTKLQWLCEELKERESRERSMRQQLAACRQELRELTEH-----------KES 910

Query: 314 ELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKM 362
           EL  + E I+R E++L+E   E+ GLL + +RKDEE++ L++ +   +M
Sbjct: 911 ELLCLFEHIERQEQLLEEFHQEKRGLLEETQRKDEEVETLQDRVNALQM 959


>gi|159877|gb|AAA29413.1| myosin-like antigen, partial [Onchocerca volvulus]
          Length = 630

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 97/210 (46%), Gaps = 39/210 (18%)

Query: 219 INKELDRLNHEIIHKDDEIRQNASK-IKQLMSEIASKEESMKKLIKQNNELVERLNSVEP 277
           +N ELDRL  E++     +R++A K I +  +++ +    +K L   NNE+  +LN+ E 
Sbjct: 274 LNMELDRLRDELL----SVRRDAEKEINRYNTDLQTARNEIKLLTSTNNEMKSQLNAAED 329

Query: 278 RIEVLTKDIQIKTEKLHKLAEE-HE---ELIQTRRNEKKL--ELDEMREEIKRLEEVLKE 331
           +I  L K I  +  K+  L  E H    EL   + N   L  ELD  RE I  L E    
Sbjct: 330 KINSLNKVITDQQNKIRDLTGEVHHLEGELKDAKGNVANLESELDTTRERIHLLGE---- 385

Query: 332 NECERAGLLTKEKRKDEEIKRLKEEI----GKYKMLVQER-------DQLANMLEEYKQN 380
              + A L T       E+ ++K +I    G+  ML   +       D+L   L+   +N
Sbjct: 386 ---QNASLQT-------ELNKIKGDIDSLFGENDMLKTAKESNEAEIDRLKQKLQRSIEN 435

Query: 381 AKDVVYYEDMKDKLEVELDKLRMESAEKIK 410
           AK    Y D  DKL  E D+L+    EKIK
Sbjct: 436 AK---KYSDALDKLRPEYDRLQNLYREKIK 462


>gi|160333189|ref|NP_861536.3| centrosomal protein of 128 kDa [Mus musculus]
 gi|166214933|sp|Q8BI22.2|CE128_MOUSE RecName: Full=Centrosomal protein of 128 kDa
 gi|219518696|gb|AAI45502.1| RIKEN cDNA 4930534B04 gene [Mus musculus]
          Length = 1102

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 86/349 (24%), Positives = 166/349 (47%), Gaps = 37/349 (10%)

Query: 33  LLEFEQELRETMKQKDDKLLELAKTARTKDQELRDLRQQTQQSVTKANTCEQHLEKIKKE 92
           LLE ++ +++    + D   +LA+  + K    RDL + T Q  +K       + ++K E
Sbjct: 629 LLEMQESIKDLSAIRADLANKLAEEEKAKKAVFRDLSELTAQVKSKEEETATAITQLKLE 688

Query: 93  FKELQEKYQRDLNQKTETIRSLTNKNL-ELKVQYEEEIDNRDTRIKELTSDTSEIDKIVS 151
            +++ ++   DL+   E+++    +N+ EL   +++E    ++ I+ L +++ E   +  
Sbjct: 689 -RDVHQRELEDLSSSLESVKLKHEQNIQELMKHFKKEKSEAESHIRMLKAESLEDKNMAK 747

Query: 152 RKDAEIEKLSKENEKLSMDLVFKENQ------EVTELMKEVDKKDNYIET------LLKE 199
               ++EKL  + EKL+ +L   EN+      +   L +E+DKK+ YI T       ++E
Sbjct: 748 AHLGQLEKLKSQCEKLTEELTHTENENKKLKLKYQSLKEELDKKEKYISTEEEHLRRMEE 807

Query: 200 NEANVK---LCKELEKQVHELNINKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEE 256
           +  ++K   LC E E++     I KE+D             R +  K+K L S      +
Sbjct: 808 SRLHLKDQLLCLETEQESILGVIGKEIDEACKTFS------RDSLEKLKVLTSGPQLHYD 861

Query: 257 SMKKLIKQNNEL---VERLNSVEPRIEVLTKDIQIKTEKLHKLAEEHEELIQTRRNEKKL 313
             + L +   +L    E L   E R   + + +    ++L +L E            K+ 
Sbjct: 862 PHRWLAESKTKLQWLCEELKERESRERSMRQQLAACRQELRELTEH-----------KES 910

Query: 314 ELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEEIKRLKEEIGKYKM 362
           EL  + E I+R E++L+E   E+ GLL + +RKDEE++ L++ +   +M
Sbjct: 911 ELLCLFEHIERQEQLLEEFHQEKRGLLEETQRKDEEVETLQDRVNALQM 959


>gi|444792584|gb|AGE12543.1| SzM [Streptococcus equi subsp. zooepidemicus]
          Length = 536

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 98/181 (54%), Gaps = 22/181 (12%)

Query: 82  CEQHLEKIKKEFKELQEKYQRDLNQKTETIR-SLTNKNLEL---KVQYEEEIDNRDTRIK 137
            E+ L+++++  +E QE+  + L+QK E  +  L N  L     K +YE E+D+   +++
Sbjct: 136 AEEKLKQVEESLRE-QEEQTKSLSQKLEDKQMDLENAALGYVAEKQRYEAELDDLKKQVE 194

Query: 138 ELTSDTSEIDKIVSRKDAEIEKLSKE-NEKLSMDLVF---------KENQEVTELMKEVD 187
           EL  + S  D+++S K +E+ KL++E N +  M   F         K+N E+ +L +++D
Sbjct: 195 ELEVELSSADELLSSKHSELTKLTQEFNRREKMYNTFLDQAKEEISKQNDEINQLTQDLD 254

Query: 188 KKDNYIETLLKENEANVKLCKELEKQVHELNINKELDRLNHEII-HKDDEIRQNASKIKQ 246
           ++ N  E  L +          L KQ  E+   K+LDR+N  ++ +  DE+ + ++K +Q
Sbjct: 255 RQINMYEAFLSQ------YKDALAKQEEEITKLKDLDRINRNLLGNAKDELDKLSAKNEQ 308

Query: 247 L 247
           L
Sbjct: 309 L 309


>gi|145521570|ref|XP_001446640.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414118|emb|CAK79243.1| unnamed protein product [Paramecium tetraurelia]
          Length = 537

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 58/106 (54%), Gaps = 17/106 (16%)

Query: 76  VTKANT-CEQHLE-------KIKKEFKELQEKYQRDLNQKTETIRSL--TNKNLELKVQY 125
           +TK  T CE+++E       +I+KE+ + QEK+ + L +K   +++L  T KN   K+  
Sbjct: 35  LTKYKTECEKYIETIQKEHERIQKEYGQEQEKFNKSLKEKINQVKALVDTEKNNTKKL-- 92

Query: 126 EEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDL 171
              I N   ++ E+T   S  D +V  KD EIE L KE +KL + +
Sbjct: 93  ---IANAQQKVVEITQ--SSFDSLVQVKDQEIENLKKEQQKLQIQI 133


>gi|366998906|ref|XP_003684189.1| hypothetical protein TPHA_0B00830 [Tetrapisispora phaffii CBS 4417]
 gi|357522485|emb|CCE61755.1| hypothetical protein TPHA_0B00830 [Tetrapisispora phaffii CBS 4417]
          Length = 2536

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 79/295 (26%), Positives = 132/295 (44%), Gaps = 64/295 (21%)

Query: 68   LRQQTQQSVTKANTCEQHLEKIKKE--------------FKELQEKYQRDLNQKTETIRS 113
            L +Q   +V K N  E+ L+++KKE                +LQEKY  ++N   + I  
Sbjct: 1317 LYKQYNDNVEKINNYEESLKQLKKEKTTLQANLKTTEAELNDLQEKYSANVNSYNDIISE 1376

Query: 114  LTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKL-------SKENEK 166
                    K +  EEI   D +IKEL     E DK+++ ++AEI+ +         + ++
Sbjct: 1377 --------KDKLTEEIKLSDDKIKELELVLEEKDKVITNQNAEIKNIKLDFEEKGNDYKR 1428

Query: 167  LSMDL------VFKENQEVTELM-KEVDKKDNYIETLLKENEANVKLCKEL--------- 210
              +DL      + KE+ E   LM K +   +  +ETL  E+  +  L  E          
Sbjct: 1429 TIVDLKAKLSDLTKESAESKLLMEKTIKDLEKNMETLKSESVISSNLSTERITSLELEIA 1488

Query: 211  ----EKQVHELNINKELDRLNHEIIH----KDDEIRQNASKIKQLMSEIASKEESMKKLI 262
                EK++++ N N ++  L  +II+    KDD   +N ++  +L SEI   +    K+I
Sbjct: 1489 KLTEEKKINQTNFNDKISELKSDIINLEKEKDDSKEKNKTESAELKSEIEELQNKNFKMI 1548

Query: 263  KQNNEL-------VERLNSVEPRIEV----LTKDIQIKTEKLHKLAEEHEELIQT 306
            +++          VERL   +   +V    LT D++ K E LHKL E+    I +
Sbjct: 1549 EEHTITLDALKTDVERLEKAKEDFQVQNNSLTSDLETKNELLHKLKEKSANTISS 1603


>gi|145504665|ref|XP_001438299.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124405471|emb|CAK70902.1| unnamed protein product [Paramecium tetraurelia]
          Length = 986

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 92/333 (27%), Positives = 171/333 (51%), Gaps = 35/333 (10%)

Query: 63  QELRDLRQQTQQSVTKANTCEQHLEK----------IKKEFKELQ-EKYQRDLNQKTETI 111
           Q+  DLR+Q Q+   + N  +Q LE           IK +  ELQ EK Q  LN K   +
Sbjct: 422 QQSDDLRRQNQELAQENNNLQQDLENQTQNLGQLDEIKDQLNELQDEKNQ--LNDKVSDL 479

Query: 112 RSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKENEKLSMDL 171
           ++   +   L  Q ++E+++   R+K+L +   E+D  +   + +++K  K+N++L+ + 
Sbjct: 480 QNNLKEKQRLFDQKQKELEDALKRVKDLEAKLLEMDHYIDTLEDDLQKFEKDNQQLNREA 539

Query: 172 VFKE--NQEVTELMKEVDKKDNYIETLLKEN---EANVKLCKELEKQVHELNINKELDRL 226
             K+  ++E+  L   +D+  N  +   KE    + N+   ++L+ ++ +     ELDR 
Sbjct: 540 GQKQLADRELERLRGLLDQMKNQYDQQQKELGKLKNNLDQMRDLQDELAQ--AKSELDRA 597

Query: 227 NHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDI 286
           N  I  + DE+ Q  ++I QL+ E+ + EES  +L  QNN L + L       + +T   
Sbjct: 598 NSVIAQQQDELAQKENEISQLVREVQNLEESNNQLQDQNNNLQQTLQEQ----QAVTNGN 653

Query: 287 QIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRK 346
           Q +  KL ++AE+++E I      ++LEL     E   +EE LK+++   A L+T+ +R 
Sbjct: 654 QEELTKLRRIAEDYKEKI------RQLEL--KFNEYSNMEERLKQSDHRIAVLMTEIERL 705

Query: 347 DEEIKRLKEEIGKYKMLVQERDQLANMLEEYKQ 379
           +  +K++  E   +K   Q+  ++   L EY+Q
Sbjct: 706 NSLVKQITSETEDWK---QKHSRIELALLEYRQ 735


>gi|441622736|ref|XP_003264109.2| PREDICTED: LOW QUALITY PROTEIN: CDK5 regulatory subunit-associated
           protein 2 [Nomascus leucogenys]
          Length = 1890

 Score = 38.9 bits (89), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 159/338 (47%), Gaps = 45/338 (13%)

Query: 132 RDTRIKELTSDTSEIDKIVSRKDAEIEKLS----KENEKL----SMDLVFKENQEVTELM 183
           RD  I+ LT      +K V   ++EIEKLS    K  E L    + +    E+ E T   
Sbjct: 315 RDKAIQGLTMALKSKEKKVEELNSEIEKLSAAFAKAREALRKAQTQEFQGSEDYEATLSG 374

Query: 184 KEVDKKDNYIETLLKENEANVKLCKELEKQVHELN-INKELDRLNHEI--IHK-----DD 235
           KE    +   + L K  E N +LC+ + K   EL+ + +E +RL  ++   H+     D 
Sbjct: 375 KEALLAELRSQNLTKSTE-NHRLCRSIXKITQELSDLQQERERLEXDLEEAHREKSKGDC 433

Query: 236 EIRQNASKIKQLMSEIASKEESM----KKLIKQNNELVERLNSVEPRIEVLTKDIQIKTE 291
            IR   +++++L +E+  +E++M    K L+ ++N+   +L++ E  I+ LT+    K  
Sbjct: 434 TIRDLRNEVEKLRNEVNEREKAMENRYKSLLSESNK---KLHNQEQVIKHLTESTNQKDV 490

Query: 292 KLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKEKRKDEE-- 349
            L K  E+  E+IQ        E  E+R E    E+   +       + +KEK++  E  
Sbjct: 491 LLQKFNEKDLEVIQQNHYLMAAEDLELRSEGLITEKCSSQRSPGSETIFSKEKKQSSEYE 550

Query: 350 --IKRLKEEIGKYKMLV---QERDQLANMLEEY-------KQNAKDVVYYEDMKDKLEVE 397
             I+ LK+E   Y  LV   QE D + N+  E        KQ  +DV+ Y++++  LE +
Sbjct: 551 ELIQVLKKEQDIYTHLVKSLQESDSINNLQAELNNIFALRKQLEQDVLSYQNLRKTLEEQ 610

Query: 398 LDKLRMESAEKIKLRDSELDKKHEVIAEMKLSMEANNR 435
           + ++R    E   L    +D+     + + + +E NNR
Sbjct: 611 IGEIRRREEESFSLY---IDQ----TSYLSICLEENNR 641


>gi|297813019|ref|XP_002874393.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297320230|gb|EFH50652.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1191

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 61/99 (61%), Gaps = 11/99 (11%)

Query: 244 IKQLMSEIASKEESM-----KKLIKQNNELVERLNSVEPRIEVLTKDIQIKTEKLHKLAE 298
           I++L SEI SKEE++     K+L+KQ      +L+S E R+E  T +  IK ++L  + +
Sbjct: 464 IRKLSSEIVSKEETIQQLSEKQLVKQR-----KLDSTEKRLENTTAEFVIKEKELGSVKD 518

Query: 299 EHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERA 337
            + E +Q    ++K EL  ++EEIK++++ LK+ + + A
Sbjct: 519 TYRECLQNWEIKEK-ELKSLQEEIKKIQDSLKDFQSKEA 556


>gi|225619396|ref|YP_002720622.1| hypothetical protein BHWA1_00453 [Brachyspira hyodysenteriae WA1]
 gi|225214215|gb|ACN82949.1| hypothetical protein BHWA1_00453 [Brachyspira hyodysenteriae WA1]
          Length = 7854

 Score = 38.9 bits (89), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 68/123 (55%), Gaps = 8/123 (6%)

Query: 230  IIHK--DDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLTKDIQ 287
            II+K  +DE+ +   KI   +S +  K +S   ++  NN + + +NS+  + E L+KD  
Sbjct: 3658 IIYKKIEDEVSKINEKIDSNLSTVTEKFDS--NIVSLNNSIYDDINSIIEKYESLSKDFD 3715

Query: 288  IKTEKLHKLAEEHEELIQTRRNEKKL---ELDEMREEIKRL-EEVLKENECERAGLLTKE 343
               E L     +  EL++T+ +EK+L   ++DE+++E +   E++L  N+     L  +E
Sbjct: 3716 SSLEVLKDSILDRNELLETQSSEKELLVSKVDEIKKEFESFKEDILNTNKESIPALFDEE 3775

Query: 344  KRK 346
            K+K
Sbjct: 3776 KQK 3778


>gi|156839520|ref|XP_001643450.1| hypothetical protein Kpol_483p11 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156114061|gb|EDO15592.1| hypothetical protein Kpol_483p11 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 2546

 Score = 38.9 bits (89), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 80/284 (28%), Positives = 137/284 (48%), Gaps = 49/284 (17%)

Query: 210  LEKQVHELNI-NKELDRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNEL 268
            L+K   +L I N   D+ N E+   ++ IR+  SKI  L  ++  KE      +    +L
Sbjct: 2198 LKKVTSDLQIANNTSDKYNDELKVANNTIREIESKIPNLQKQLDLKEIEYNDTLSSKKDL 2257

Query: 269  VERLNSVEPRIEVLTKDIQ-IKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEE 327
             ++L++     E+L+K+++ +K EKL  L +E  +LI        +ELD+ + +I  L  
Sbjct: 2258 DKKLDNFSKESEILSKEVKSLKKEKLD-LEKEKNDLI--------VELDKYKNDIIDLNN 2308

Query: 328  VLKEN-------ECERAGLLTKEKRKDEEIKRLKEEIGK----YKMLVQERD-------- 368
             LKEN       E E  GL  K +     + +L EE GK     K LV ERD        
Sbjct: 2309 TLKENEKNGSEVETEIIGLKDKIEFYKLNVSQLNEEKGKLESDIKTLVGERDDKLKSSQS 2368

Query: 369  ---QLANMLEEYKQNAKDVVYYEDMKDKLEVELDKLRMESAE---KIKLRDSELDKKHEV 422
               +L N LE +K  +   V YE+ K    VE D L+ + +E    ++  +  +D  ++ 
Sbjct: 2369 EIIELTNKLESFKDIS---VAYEESK----VESDGLKNKLSEIEKSLEESNENVDSANKE 2421

Query: 423  IAEMKL---SMEANNRRIQDLEEQLAENNAWINEQDLKIQEIQR 463
            I+++K+    ++++N  ++D  +QL  +    N    KIQ+I +
Sbjct: 2422 ISDLKVMNDKLQSHNLELEDKFQQLKNDTTETNR---KIQDINK 2462


>gi|85074861|ref|XP_965798.1| hypothetical protein NCU00658 [Neurospora crassa OR74A]
 gi|28927611|gb|EAA36562.1| predicted protein [Neurospora crassa OR74A]
          Length = 4007

 Score = 38.1 bits (87), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 132/274 (48%), Gaps = 23/274 (8%)

Query: 104  LNQKTETIRSLTNKNLELKVQYEEEIDNRDTRIKELTSDTSEIDKIVSRKDAEIEKLSKE 163
            L Q+ E I  L N+ +EL     + +D RDT   + T + + + + V R   E  KL K+
Sbjct: 3115 LAQRMEEINGLRNEMVEL----NKALDTRDTTFLQNTDEINRLKENVRRLGDETTKLKKD 3170

Query: 164  NEKLSMDLVFKENQEVTELMKEVDKKDNYIETLLKENEANVKLCKELEKQVHELNINKEL 223
              KL  D    E + V +   E++K ++ I+ L +E +    L    + +++ L    E+
Sbjct: 3171 TVKLKEDSKSWE-ETVKQRQTEINKLNDNIKNLQEEIKRKEALLATRQGEINALK--DEI 3227

Query: 224  DRLNHEIIHKDDEIRQNASKIKQLMSEIASKEESMKKLIKQNNELVERLNSVEPRIEVLT 283
              L  ++  KD +++    ++ +    IA+KE ++++L K+   L E++  +E  +    
Sbjct: 3228 VGLKKDLAEKDAQLKSRDGELGKFRKSIAAKETALERLEKEKTALREKVEHLEGEVGRRR 3287

Query: 284  KDIQIKTEKLHKLAEEHEELIQTRRNEKKLELDEMREEIKRLEEVLKENECERAGLLTKE 343
            + + ++ +K+ +L            +  +L+LD+ R   K LEE          GL  + 
Sbjct: 3288 RSLDLRADKILELTNSE--------SAARLDLDKQRARNKSLEET-------NTGLRNRA 3332

Query: 344  KRKDEEIKRLKEEIGK-YKMLVQERDQLANMLEE 376
             ++   + RL +E+ K  + L+++ + LA++ +E
Sbjct: 3333 AKEGGSLGRLGDEVTKLSRQLLEKENDLASLRDE 3366


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.308    0.128    0.325 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,601,109,423
Number of Sequences: 23463169
Number of extensions: 279966843
Number of successful extensions: 3424240
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6120
Number of HSP's successfully gapped in prelim test: 151506
Number of HSP's that attempted gapping in prelim test: 2277348
Number of HSP's gapped (non-prelim): 607958
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 79 (35.0 bits)