RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4218
         (520 letters)



>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score =  250 bits (640), Expect = 2e-80
 Identities = 135/231 (58%), Positives = 165/231 (71%), Gaps = 17/231 (7%)

Query: 160 DYDSKRDRWAGYDPSNHKAIVEHYQKIEEAKRELRADKLDAGVSIDNRGSKEGDEDKDDL 219
           DYD+KRDRW GYDPS +K +VE Y+K+EE +++L+A K D             D D +D 
Sbjct: 3   DYDAKRDRWNGYDPSEYKEVVEEYEKLEELRKKLKAKKEDDSD----------DTDYED- 51

Query: 220 EEDEDKYVDEVDMPGTKVDSKQRITVRNLRIREDTAKYLRNLDPNSAYYDPKTRSMRDNP 279
            EDE+ Y+DE DM  +  DSK R TVRNLRIREDTAKYL NLD NSAYYDPK+RSMRD+P
Sbjct: 52  -EDEELYLDESDMGDS--DSKTRTTVRNLRIREDTAKYLLNLDSNSAYYDPKSRSMRDDP 108

Query: 280 YETVANPEDVDYAGENFVRFSGDTRKHATAQLFAWEAYEKGVDVHVLAEPTKLELLKKEY 339
               A+PEDV +AG+NFVR SG+  +    Q FAWEA EKG DVH+ A PTKLELL+K+ 
Sbjct: 109 L---ADPEDVLFAGDNFVRLSGEALEFEELQKFAWEAAEKGGDVHLQANPTKLELLRKKE 165

Query: 340 EEKKEEFKKEVSSSVIDKYGGAEHLQAPPKALLLAQTEDYVEYSRYGKVIK 390
           +EKKE+ K +   S+++KYGG EHL  PPK LLL Q+EDYVEY R GK  K
Sbjct: 166 KEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKK 216


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 36.2 bits (84), Expect = 0.025
 Identities = 18/58 (31%), Positives = 28/58 (48%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
           DE+E++   +++ K     K KK  +   EEK K   +KEE+  RE +     DE   
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAE 95



 Score = 33.1 bits (76), Expect = 0.25
 Identities = 13/49 (26%), Positives = 26/49 (53%)

Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
            +E++E+   +K K   K+K KK  K K  E++  +  K+E+  +   +
Sbjct: 38  DEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEE 86



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 13/50 (26%), Positives = 32/50 (64%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREA 487
           E +D+D +DS  +++ + K+ +K K   +  A++ +K  ++++E+ +RE 
Sbjct: 27  EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREK 76



 Score = 28.5 bits (64), Expect = 8.6
 Identities = 18/62 (29%), Positives = 30/62 (48%), Gaps = 4/62 (6%)

Query: 441 DEDEEDSGHKKQ----KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDER 496
           +EDEE    K +     K KK+ K K E+++ A+ + +E   +E EE    D L      
Sbjct: 40  EEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLRL 99

Query: 497 KR 498
           ++
Sbjct: 100 RK 101


>gnl|CDD|130009 TIGR00934, 2a38euk, potassium uptake protein, Trk family.  The
           proteins of the Trk family are derived from
           Gram-negative and Gram-positive bacteria, yeast and
           wheat. The proteins of E. coli K12 TrkH and TrkG as well
           as several yeast proteins have been functionally
           characterized.The E. coli TrkH and TrkG proteins are
           complexed to two peripheral membrane proteins, TrkA, an
           NAD-binding protein, and TrkE, an ATP-binding protein.
           This complex forms the potassium uptake system. This
           family is specific for the eukaryotic Trk system
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 800

 Score = 36.9 bits (85), Expect = 0.030
 Identities = 20/141 (14%), Positives = 43/141 (30%), Gaps = 3/141 (2%)

Query: 206 NRGSKEGDEDKDDLEEDEDKYVDEVDMPGTKVDSKQRITVRNLRIREDTAKYLRNLDPNS 265
              ++  D ++ D  +D +K   +      +     R    NL+ R+ +     +     
Sbjct: 309 IDRAEACDLEELDRAKDFEKMTYDNWKAHHRKKKNFRPRGWNLKFRKASRFPKDSDRNYE 368

Query: 266 AYYDPKTRS---MRDNPYETVANPEDVDYAGENFVRFSGDTRKHATAQLFAWEAYEKGVD 322
              +  + S     + P  +        Y  +        ++  +T  L       +  +
Sbjct: 369 DNGNHLSASSSFGSEEPSLSSEENLYPTYNKKREDSRHTLSKTMSTNYLSWQPTIGRNSN 428

Query: 323 VHVLAEPTKLELLKKEYEEKK 343
              L +  K EL   EY   K
Sbjct: 429 FVGLTKEQKDELGGIEYRALK 449


>gnl|CDD|215323 PLN02598, PLN02598, omega-6 fatty acid desaturase.
          Length = 421

 Score = 36.0 bits (83), Expect = 0.039
 Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 10/54 (18%)

Query: 24  PKKKSREDWRKAKELEEARKAGTA----PAAVDEHGRDINPHIPQYISSA-PWY 72
           P K++RE W  A    +A+  GT     PA ++    DI+ HIP +ISS  P Y
Sbjct: 305 PFKQARE-WNAA----QAQLNGTVHCDYPAWIEFLCHDISVHIPHHISSKIPSY 353


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 35.1 bits (81), Expect = 0.049
 Identities = 17/48 (35%), Positives = 29/48 (60%)

Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
             E + E+EE    KK+K+ KK KK+KK+K++   E      KK++++
Sbjct: 144 EKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKK 191



 Score = 30.1 bits (68), Expect = 1.8
 Identities = 19/75 (25%), Positives = 40/75 (53%), Gaps = 1/75 (1%)

Query: 424 GASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
           GA  GPP  + +  E   E E  +  +K+ + ++ +KK+K+K+K  +++ KE   K+E+ 
Sbjct: 119 GAPDGPPSELGSESET-SEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKM 177

Query: 484 QREADRLLRMDERKR 498
                   +  ++K+
Sbjct: 178 VEPKGSKKKKKKKKK 192



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 17/73 (23%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 428 GPPPGIMNRPEHQ-DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
           G P G  +    + +  E+++  K +K+ +  +++KKEK+K  E K ++  KK+++E+  
Sbjct: 119 GAPDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMV 178

Query: 487 ADRLLRMDERKRS 499
             +  +  ++K+ 
Sbjct: 179 EPKGSKKKKKKKK 191


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.9 bits (82), Expect = 0.063
 Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 2/61 (3%)

Query: 440  QDEDEEDSGHKKQKKNKKSKKKKK--EKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
              E  +        + + +++K +  EK+K   +K  +A KK+ EE+++AD   +  E  
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403

Query: 498  R 498
            +
Sbjct: 1404 K 1404



 Score = 35.1 bits (80), Expect = 0.12
 Identities = 15/49 (30%), Positives = 28/49 (57%)

Query: 450  KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
            K  +  K   +   ++ ++AEEK + A KK+EE +++AD   +  E K+
Sbjct: 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391



 Score = 34.3 bits (78), Expect = 0.22
 Identities = 19/57 (33%), Positives = 34/57 (59%)

Query: 443  DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
            +E     + +KK +++KKK  E +K+AE K K    K+ EE ++AD   + +E K++
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKA 1536



 Score = 34.0 bits (77), Expect = 0.23
 Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDER 496
            E + + EE     + KK  +  KK  E +K AEE  K E  KK+ EE ++AD   +  E 
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481

Query: 497  KR 498
             +
Sbjct: 1482 AK 1483



 Score = 34.0 bits (77), Expect = 0.27
 Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 444  EEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEA--LKKEEEEQREADRLLRMDERKRS 499
            EE     + KK  +  KK  E +K AEE  K+A   KK  E +++AD   + +E K++
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA 1524



 Score = 34.0 bits (77), Expect = 0.28
 Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 443  DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDERKR 498
            +E+     + KK   +KKK  E +K AEEK K +  KK+ EE ++AD   +  E  +
Sbjct: 1401 EEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1457



 Score = 33.6 bits (76), Expect = 0.33
 Identities = 19/62 (30%), Positives = 32/62 (51%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
            E + + EE      + K     KKK ++ K AEE  K    K+ EE ++AD   + +E+K
Sbjct: 1487 EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546

Query: 498  RS 499
            ++
Sbjct: 1547 KA 1548



 Score = 33.6 bits (76), Expect = 0.34
 Identities = 18/57 (31%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 443  DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDERKR 498
            +E+    + +KK ++ KKK  E +K+A  K K +  KK+ EE+++AD   +  E  +
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444



 Score = 33.6 bits (76), Expect = 0.37
 Identities = 20/65 (30%), Positives = 36/65 (55%), Gaps = 4/65 (6%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKK----EKEKSAEEKMKEALKKEEEEQREADRLLRM 493
            E +  +EE+     ++  K  + KKK    +K +  E+K  EALKKE EE ++A+ L + 
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710

Query: 494  DERKR 498
            +  ++
Sbjct: 1711 EAEEK 1715



 Score = 33.2 bits (75), Expect = 0.49
 Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDER 496
            E +          + KK  + KKK  E +K AEE  K +  KK+ EE ++A+   +  E 
Sbjct: 1409 ELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE 1468

Query: 497  KR 498
             +
Sbjct: 1469 AK 1470



 Score = 32.8 bits (74), Expect = 0.49
 Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 2/59 (3%)

Query: 442  EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK--EALKKEEEEQREADRLLRMDERKR 498
            E  E+     +KK +++KKK    +K AEEK K  EA KK EE++++AD L +    K+
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418



 Score = 32.8 bits (74), Expect = 0.52
 Identities = 23/63 (36%), Positives = 34/63 (53%), Gaps = 2/63 (3%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK--EALKKEEEEQREADRLLRMDE 495
            E + + EE    ++ KK  +  KK  E +K AEE  K  EA KK EE +++AD   +  E
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507

Query: 496  RKR 498
             K+
Sbjct: 1508 AKK 1510



 Score = 32.8 bits (74), Expect = 0.56
 Identities = 21/75 (28%), Positives = 40/75 (53%), Gaps = 4/75 (5%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK----EALKKEEEEQREADRLLRM 493
            E +  +EE    ++ KK +  +KKK E+ K AEE+ K    E  KK EE++++A+   + 
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683

Query: 494  DERKRSYNSMISVKE 508
            +E ++     +  + 
Sbjct: 1684 EEDEKKAAEALKKEA 1698



 Score = 32.4 bits (73), Expect = 0.83
 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 2/58 (3%)

Query: 443  DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEA--LKKEEEEQREADRLLRMDERKR 498
            +E        KK  + KKK  E +K AEE  K+A  LKK    +++AD   +  E K+
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKK 1431



 Score = 32.0 bits (72), Expect = 0.90
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 4/78 (5%)

Query: 442  EDEEDSGHKKQKKNKKSKKKKKEKEKSAEE----KMKEALKKEEEEQREADRLLRMDERK 497
            E EE    ++ KK +  +KKK E+ K AEE    K +EA K+ EE++++A+   + +E K
Sbjct: 1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEK 1756

Query: 498  RSYNSMISVKEPTMDSDR 515
            +    +   +E   +  R
Sbjct: 1757 KKIAHLKKEEEKKAEEIR 1774



 Score = 32.0 bits (72), Expect = 0.90
 Identities = 16/57 (28%), Positives = 28/57 (49%)

Query: 443  DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
            +E     + +KK +++KKK    +K AEE  K A   + E +  AD     +E+  +
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEA 1368



 Score = 32.0 bits (72), Expect = 1.1
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 1/62 (1%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EALKKEEEEQREADRLLRMDER 496
            E + + EE     + KK  +  KK +E +K AEE  K +  KK+ EE ++AD   +  E 
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEE 1494

Query: 497  KR 498
             +
Sbjct: 1495 AK 1496



 Score = 31.6 bits (71), Expect = 1.1
 Identities = 15/65 (23%), Positives = 38/65 (58%)

Query: 434  MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRM 493
            + + E +++ + +   K +++NK    ++ +K +  ++K +EA K EE+E++ A+ L + 
Sbjct: 1638 LKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697

Query: 494  DERKR 498
             E  +
Sbjct: 1698 AEEAK 1702



 Score = 31.6 bits (71), Expect = 1.2
 Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 3/64 (4%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEE---KMKEALKKEEEEQREADRLLRMD 494
            E +  +E+    ++ KK ++ +KK  E  K   E   K +E  KKE EE+++A+ L + +
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAE 1725

Query: 495  ERKR 498
            E  +
Sbjct: 1726 EENK 1729



 Score = 30.9 bits (69), Expect = 1.9
 Identities = 20/63 (31%), Positives = 36/63 (57%), Gaps = 1/63 (1%)

Query: 438  EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEE-KMKEALKKEEEEQREADRLLRMDER 496
            E +  DE     ++ KK  ++KKK +E +K A+E K     KK+ +E ++A+   + DE 
Sbjct: 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527

Query: 497  KRS 499
            K++
Sbjct: 1528 KKA 1530



 Score = 30.5 bits (68), Expect = 2.7
 Identities = 17/63 (26%), Positives = 35/63 (55%)

Query: 436  RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
            +   + + + D   K ++  K  + KK E+ K A+E  K   KK+ +E ++A+ L + +E
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562

Query: 496  RKR 498
            +K+
Sbjct: 1563 KKK 1565



 Score = 30.5 bits (68), Expect = 2.7
 Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 2/66 (3%)

Query: 436  RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK--EALKKEEEEQREADRLLRM 493
            + +   + EE     + KK  +  KK  E +K AEE  K  +A KK+ EE ++A    + 
Sbjct: 1291 KADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350

Query: 494  DERKRS 499
            +    +
Sbjct: 1351 EAEAAA 1356



 Score = 30.5 bits (68), Expect = 2.8
 Identities = 17/55 (30%), Positives = 31/55 (56%)

Query: 445  EDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
             ++  K  +  K  + KK ++ K AEE  K    K+ EE+++AD L + +E K++
Sbjct: 1506 AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKA 1560



 Score = 30.1 bits (67), Expect = 3.4
 Identities = 16/63 (25%), Positives = 32/63 (50%), Gaps = 5/63 (7%)

Query: 442  EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEE-----EEQREADRLLRMDER 496
            ++ +    + +KK   +KKK +E +K+AE    EA    +     EE+ EA    + + +
Sbjct: 1318 DEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377

Query: 497  KRS 499
            K++
Sbjct: 1378 KKA 1380



 Score = 29.0 bits (64), Expect = 7.5
 Identities = 22/65 (33%), Positives = 33/65 (50%)

Query: 434  MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRM 493
              +   +D+ + +   K +++ KK    KKE+EK AEE  KE     EEE  E D   RM
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRM 1794

Query: 494  DERKR 498
            +  K+
Sbjct: 1795 EVDKK 1799



 Score = 29.0 bits (64), Expect = 7.8
 Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 4/72 (5%)

Query: 441  DEDEEDSGHKKQKKNKKSKKKKK--EKEKSAEEKMK--EALKKEEEEQREADRLLRMDER 496
            DE ++    KK  + KK+++KKK  E +K AEE  K  EA KK EE +++AD   +  E 
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE 1340

Query: 497  KRSYNSMISVKE 508
             +        + 
Sbjct: 1341 AKKAAEAAKAEA 1352



 Score = 28.6 bits (63), Expect = 9.8
 Identities = 16/54 (29%), Positives = 29/54 (53%)

Query: 445  EDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
            E++    + K  + KKK  E +K+ E+K  +  KK+ EE ++AD   +  E  +
Sbjct: 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 34.5 bits (80), Expect = 0.064
 Identities = 14/46 (30%), Positives = 27/46 (58%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
           +++ K  + +K++ EK  +  E   EA++K EEE+R+ +     DE
Sbjct: 126 EQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADE 171


>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6.  RPL6
           contains KOW motif that has an extra ribosomal role as
           an oncogenic. KOW domain is known as an RNA-binding
           motif that is shared so far among some families of
           ribosomal proteins, the essential bacterial
           transcriptional elongation factor NusG, the eukaryotic
           chromatin elongation factor Spt5, the higher eukaryotic
           KIN17 proteins and Mtr4. .
          Length = 152

 Score = 33.7 bits (78), Expect = 0.086
 Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKE-EEEQREADRLL 491
           K+ KK KKKKKE E   E+K K  + +E +E+Q+  D  L
Sbjct: 82  KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKAVDAAL 121


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 34.2 bits (79), Expect = 0.16
 Identities = 14/59 (23%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
           ++E+E+     ++++ +++++KK+EK+K   E     L    EEQR+ +   R  + ++
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKL--SPEEQRKLEEKERKKQARK 322


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 33.0 bits (76), Expect = 0.19
 Identities = 15/46 (32%), Positives = 30/46 (65%), Gaps = 4/46 (8%)

Query: 450 KKQKKNKKSKKKKKEKEK----SAEEKMKEALKKEEEEQREADRLL 491
           KK KK KK K+K++++ +      ++++K+A ++ + E+ E DR L
Sbjct: 15  KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDREL 60


>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
          Length = 613

 Score = 33.5 bits (78), Expect = 0.25
 Identities = 13/41 (31%), Positives = 27/41 (65%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
           K+ ++ K+ K++ +E+ K   E++KEAL  + ++ R + RL
Sbjct: 478 KEDEEEKEEKEEAEEEFKPLLERLKEALGDKVKDVRLSHRL 518


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 32.7 bits (75), Expect = 0.30
 Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 448 GHKKQKKNKKSKKK-KKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSM 503
             +K K+ +K  K+ +KE  ++ E    + LKK +E+Q E       D+R+      
Sbjct: 70  DQEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEM----MDDQRELMKMQF 122


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 33.0 bits (76), Expect = 0.31
 Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 2/58 (3%)

Query: 462 KKEKEKSAEEKMKEALKKEEEEQREADRLL--RMDERKRSYNSMISVKEPTMDSDRGN 517
            KEK   AE    EA + E+E  RE  +     M+ ++RSY   +      M+++R  
Sbjct: 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREK 258


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 33.0 bits (76), Expect = 0.37
 Identities = 19/39 (48%), Positives = 27/39 (69%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
           KK+KK K    KKKE+E+  E++ KE  K+EEEE+ E +
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465



 Score = 33.0 bits (76), Expect = 0.39
 Identities = 13/40 (32%), Positives = 26/40 (65%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
             +K  KK +++KKEK+K A    K+  ++EEE++++ + 
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454



 Score = 32.2 bits (74), Expect = 0.61
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
           KK KK  +  +KK+E+EK  ++K   A KK+EEE+ E   
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449



 Score = 32.2 bits (74), Expect = 0.63
 Identities = 11/46 (23%), Positives = 25/46 (54%)

Query: 443 DEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
            ++ +   K+   K  KK+++EK++  ++      K+EEEE+ +  
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450



 Score = 32.2 bits (74), Expect = 0.65
 Identities = 16/51 (31%), Positives = 31/51 (60%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
             +  +EE    KK+    K K++++E+EK  +E+ KE  ++E EE++E +
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469



 Score = 31.8 bits (73), Expect = 1.0
 Identities = 17/41 (41%), Positives = 26/41 (63%)

Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
            KK+++ KK KKKK    K  EE+ +E  +K+EEE+ E + 
Sbjct: 420 EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE 460



 Score = 30.3 bits (69), Expect = 2.5
 Identities = 14/48 (29%), Positives = 29/48 (60%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
           ++++K +K KK    K+K  EE+ ++  K+EE+E+ E +     +E +
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEE 470



 Score = 29.5 bits (67), Expect = 4.8
 Identities = 17/66 (25%), Positives = 34/66 (51%), Gaps = 1/66 (1%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMISVKEP 509
           K +K  +K++KK++E++K  ++K     KKEEEE+ E ++     E +    +    +E 
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEE-EAEEEKEEE 469

Query: 510 TMDSDR 515
                +
Sbjct: 470 EEKKKK 475



 Score = 29.5 bits (67), Expect = 4.9
 Identities = 14/50 (28%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 441 DEDEED--SGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
            E+E +  +G KK  K  K   +K EK++  E+K K+      +++ E +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 33.3 bits (76), Expect = 0.40
 Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 4/78 (5%)

Query: 432 GIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
              +  E +   EE+     ++K KK +K K+++ K  +   KEA  K + +Q      +
Sbjct: 3   RTESEAEKKILTEEEL----ERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNV 58

Query: 492 RMDERKRSYNSMISVKEP 509
                K+S    +  + P
Sbjct: 59  PKKSEKKSRKRDVEDENP 76


>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
          Length = 434

 Score = 33.0 bits (76), Expect = 0.42
 Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 4/44 (9%)

Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
           RP+ +      S  K  K +KK   K+ EK++  +EK K   + 
Sbjct: 377 RPKTK----APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRH 416



 Score = 28.8 bits (65), Expect = 8.9
 Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 5/41 (12%)

Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
            K   + K  K  KK   K AE+K     +KE+E+ +   R
Sbjct: 380 TKAPSEKKTGKPSKKVLAKRAEKK-----EKEKEKPKVKKR 415


>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.  This
           domain is found to the N-terminus of bacterial signal
           peptidases of the S49 family (pfam01343).
          Length = 154

 Score = 31.7 bits (73), Expect = 0.42
 Identities = 12/33 (36%), Positives = 18/33 (54%), Gaps = 2/33 (6%)

Query: 449 HKKQKK--NKKSKKKKKEKEKSAEEKMKEALKK 479
            KK+ K   K  KK +K K K+ ++K K+   K
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEPK 96



 Score = 30.6 bits (70), Expect = 0.95
 Identities = 11/32 (34%), Positives = 19/32 (59%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEK 472
           D+ E  +  K +KK +K+K K ++K+   EE 
Sbjct: 64  DKKELKAWEKAEKKAEKAKAKAEKKKAKKEEP 95


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 32.2 bits (73), Expect = 0.46
 Identities = 20/58 (34%), Positives = 28/58 (48%), Gaps = 2/58 (3%)

Query: 429 PPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
           P PG  + P+  D  E+D   +     KK +KKKKE EK A+   K     E + + E
Sbjct: 68  PSPG--HPPQTNDTHEKDLALQPPPGGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGE 123


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 32.4 bits (74), Expect = 0.46
 Identities = 19/49 (38%), Positives = 29/49 (59%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
           K Q K +K K+KKKEKE + EE++ EA + E    +  +R    +E K+
Sbjct: 166 KIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYEEQEEEKK 214


>gnl|CDD|215456 PLN02850, PLN02850, aspartate-tRNA ligase.
          Length = 530

 Score = 32.8 bits (75), Expect = 0.55
 Identities = 15/49 (30%), Positives = 19/49 (38%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
            E  E+SG K  KK  K    K EK +        A   E+E+   A  
Sbjct: 3   QEAVEESGEKISKKAAKKAAAKAEKLRREATAKAAAASLEDEDDPLASN 51


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 31.6 bits (72), Expect = 0.56
 Identities = 18/73 (24%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
           R + ++ +EE+   +K+ + K+  ++K+E+E    E+ +E  K+EEE +   ++  +  E
Sbjct: 17  RRQQREAEEEEREERKKLEEKREGERKEEEEL---EEEREKKKEEEERKEREEQARKEQE 73

Query: 496 RKRSYNSMISVKE 508
                 S   V+E
Sbjct: 74  EYEKLKSSFVVEE 86


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 31.1 bits (71), Expect = 0.59
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 7/40 (17%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
           KK +K ++  +K+ EKE  A E+      K+EEE REA R
Sbjct: 79  KKNEKEERELRKRAEKE--ALEQ-----AKKEEELREAKR 111


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 32.6 bits (75), Expect = 0.61
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 5/52 (9%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEA----LKKEEEEQREAD 488
              E     KKQ+K +K K +K+E EK+A +K  EA     K  + E ++ D
Sbjct: 407 SPAERKKLRKKQRKAEK-KAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVD 457



 Score = 28.7 bits (65), Expect = 8.6
 Identities = 16/50 (32%), Positives = 31/50 (62%), Gaps = 2/50 (4%)

Query: 439 HQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
             +E+E ++G+    + KK +KK+++ EK AE+  +EA K   +++ EA 
Sbjct: 395 EGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK--EEAEKAAAKKKAEAA 442


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 32.3 bits (74), Expect = 0.78
 Identities = 15/50 (30%), Positives = 26/50 (52%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRS 499
           +K++K    ++K+  K K  EEK K+ L+K E+ +       +  E K S
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYS 605



 Score = 28.8 bits (65), Expect = 7.8
 Identities = 17/84 (20%), Positives = 41/84 (48%), Gaps = 12/84 (14%)

Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYN 501
           ED+ D     +  +K+  +++KE+++ A ++ K   K +++E+++   L ++++ K    
Sbjct: 536 EDKPDGPSVWKLDDKEELQREKEEKE-ALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPA 594

Query: 502 SMISVKE-----------PTMDSD 514
                +E           PT D+D
Sbjct: 595 EFFKRQEDKYSAFDETGLPTHDAD 618


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 31.2 bits (71), Expect = 0.79
 Identities = 17/60 (28%), Positives = 33/60 (55%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMISVKEP 509
           +K K  KK  KKKK+K+K  ++  K+   ++++E+   D+L  + +      S +S  +P
Sbjct: 82  QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTLSELKP 141



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 16/49 (32%), Positives = 25/49 (51%)

Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
           E EE    K +KK  K KK K + +K  ++  K   K E+E + + + L
Sbjct: 77  EYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDL 125



 Score = 29.3 bits (66), Expect = 3.4
 Identities = 13/43 (30%), Positives = 23/43 (53%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
              ++    K +K +KK  K +K+ EK AE+K+++  K   E 
Sbjct: 90  KSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSET 132


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 32.3 bits (74), Expect = 0.80
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 454 KNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
           +N   KKK+ +     EEK K+  KKE+ ++ E +R+
Sbjct: 644 RNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680



 Score = 29.2 bits (66), Expect = 6.1
 Identities = 13/54 (24%), Positives = 27/54 (50%), Gaps = 3/54 (5%)

Query: 450 KKQKKNKKSKKKKKEK--EKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYN 501
           KK K+  +++++++E   E     + K   K+E+E   E D  L  +E++    
Sbjct: 224 KKLKETSETEEREEETDVEIETTSETKGT-KQEQEGSTEEDPSLFSEEKEDPDK 276


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 32.2 bits (73), Expect = 0.86
 Identities = 9/39 (23%), Positives = 18/39 (46%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
           K++ K K    K+ + E  + E   E    ++ E++  D
Sbjct: 798 KQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGID 836



 Score = 31.1 bits (70), Expect = 1.8
 Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 3/45 (6%)

Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
           K   ++K+K KEK  + +E   E      E + E    ++  E K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIE--SPSVETEGERCT-IKQREEK 833


>gnl|CDD|136027 PRK06743, PRK06743, flagellar motor protein MotP; Reviewed.
          Length = 254

 Score = 31.5 bits (71), Expect = 0.90
 Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 3/75 (4%)

Query: 287 EDVDYAGENFVRFSGDTRKHATAQLFAWEAYEKGVDVHVLAEPTKLELLKKEYEEKKEEF 346
           ED++   + FV FS  ++KH    L + E   + VD   + +  +L L   + +E KE  
Sbjct: 69  EDLEQLTDLFVDFSKKSKKHG---LLSLEVDGEQVDNPFIQKGIRLMLSGYDEDELKEVL 125

Query: 347 KKEVSSSVIDKYGGA 361
            K+V + V +   GA
Sbjct: 126 MKDVETEVYELRKGA 140


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 13/44 (29%), Positives = 27/44 (61%)

Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
           ++ ++E     K+ + K++ K KKEK++  E+K+ E L K++  
Sbjct: 93  EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKST 136



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 10/39 (25%), Positives = 20/39 (51%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
             +E +   K+  K  + K+  K K++  E+K K+  +K
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEK 129



 Score = 28.6 bits (64), Expect = 5.5
 Identities = 13/30 (43%), Positives = 20/30 (66%)

Query: 458 SKKKKKEKEKSAEEKMKEALKKEEEEQREA 487
           S+K++KE  K  EEK     KKE++E++E 
Sbjct: 95  SEKEQKEVSKETEEKEAIKAKKEKKEKKEK 124



 Score = 28.2 bits (63), Expect = 7.4
 Identities = 11/47 (23%), Positives = 26/47 (55%)

Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
           PE  ++++++   + ++K     KK+K+++K  +   K A KK  + 
Sbjct: 92  PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKT 138


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 31.5 bits (72), Expect = 1.2
 Identities = 14/45 (31%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 454 KNKKSKKKKKEKEKSAEEKMKEALKKEEEE--QREADRLLRMDER 496
           K +  +++++ +E+ AEEK +EAL+K + E  ++ A R  R+   
Sbjct: 87  KAETLEERREIREE-AEEKWEEALEKGDLEEARKYAQRSSRLTSE 130


>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein. 
          Length = 198

 Score = 30.9 bits (70), Expect = 1.2
 Identities = 14/38 (36%), Positives = 20/38 (52%)

Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
            +K     K KKK+KEK  EE+ K     ++ E+ E D
Sbjct: 59  PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENEND 96



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 17/53 (32%), Positives = 26/53 (49%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNS 502
           +K    KKSKKK KEK    E+K +    K EE + + D      + + S ++
Sbjct: 60  RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASA 112


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 31.6 bits (72), Expect = 1.2
 Identities = 17/64 (26%), Positives = 29/64 (45%), Gaps = 4/64 (6%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALK---KEEEEQRE-ADRLLRMDERKRSYNSMIS 505
            +     +  +K + + KS EE +K A K   + EE  R+  +RL  +DE       ++ 
Sbjct: 78  GRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLE 137

Query: 506 VKEP 509
             EP
Sbjct: 138 ELEP 141


>gnl|CDD|187750 cd09233, ACE1-Sec16-like, Ancestral coatomer element 1 (ACE1) of
           COPII coat complex assembly protein Sec16.  COPII coat
           complex plays an important role in vesicular traffic of
           newly synthezised proteins from the endoplasmatic
           reticulum (ER) to the Golgi apparatus by mediating the
           formation of transport vesicles. COPII consists of an
           outer coat, made up of the scaffold proteins Sec31 and
           Sec13, and the cargo adaptor complex, Sec23 and Sec24,
           which are recruited by the small GTPase Sar1. Sec16 is
           involved in the early steps of the assembly process.
           Sec16 forms elongated heterotetramers with Sec13,
           Sec13-(Sec16)2-Sec13. It interacts with Sec13 by
           insertion of a single beta-blade to close the
           six-bladded beta propeller of Sec13. In the same way
           Sec13 interacts with Sec31 and Nup145C, a nuclear pore
           protein, all of these contain a structurally related
           ancestral coatomer element 1 (ACE1). Sec16 is believed
           to be a key component in maintaining the integrity of
           the ER exit site.
          Length = 314

 Score = 31.1 bits (71), Expect = 1.3
 Identities = 14/35 (40%), Positives = 17/35 (48%)

Query: 457 KSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
           K K KKK+  K  EEK+ E  + E     E   LL
Sbjct: 7   KGKTKKKDVLKWLEEKIAELEENEGYLDLEDKLLL 41


>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain.  This family consists
           of several ATP synthase E chain sequences which are
           components of the CF(0) subunit.
          Length = 83

 Score = 29.0 bits (65), Expect = 1.3
 Identities = 9/47 (19%), Positives = 24/47 (51%)

Query: 432 GIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALK 478
           G  ++   + ++E+   ++ Q+K  +  K    K+K A+++ K   +
Sbjct: 25  GAKHQRSLKKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSLAE 71


>gnl|CDD|222730 pfam14392, zf-CCHC_4, Zinc knuckle.  The zinc knuckle is a zinc
           binding motif composed of the the following CX2CX4HX4C
           where X can be any amino acid. This particular family is
           found in plant proteins.
          Length = 49

 Score = 28.0 bits (63), Expect = 1.3
 Identities = 6/14 (42%), Positives = 9/14 (64%)

Query: 118 CENCGAMTHKKKDC 131
           C +CG + H  K+C
Sbjct: 34  CFHCGRLGHSDKEC 47


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 22/74 (29%), Positives = 40/74 (54%)

Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
           + + +++ +E+   +K K+  K K+  KEKEK  E+K++E   +EEE++RE  R     +
Sbjct: 117 KKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPK 176

Query: 496 RKRSYNSMISVKEP 509
           +          KEP
Sbjct: 177 KPPKKKPPNKKKEP 190



 Score = 29.1 bits (65), Expect = 6.2
 Identities = 17/73 (23%), Positives = 30/73 (41%), Gaps = 1/73 (1%)

Query: 418 ATSSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK-EA 476
           A   V       P       +    +      K++++ K+ KKKKKEK K   +  K + 
Sbjct: 76  AVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKE 135

Query: 477 LKKEEEEQREADR 489
             KE+   +E ++
Sbjct: 136 EAKEKRPPKEKEK 148


>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
           domain.  This domain is found in a number of different
           types of plant proteins including NAM-like proteins.
          Length = 147

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 22/85 (25%), Positives = 40/85 (47%), Gaps = 1/85 (1%)

Query: 414 EAGLATSSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKM 473
             G  ++               RPE + + +E     K K  K+  +K+KEKE+   + +
Sbjct: 39  SPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKKEEAEKEKEKEERFMKAL 98

Query: 474 KEALKKEEE-EQREADRLLRMDERK 497
            EA K+  E E+++A+  L  +E+K
Sbjct: 99  AEAEKERAELEKKKAEAKLMKEEKK 123



 Score = 29.3 bits (66), Expect = 2.3
 Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 429 PPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
             PG  +  E++DED+E +   K+ + +K  K+K  ++K  + K +EA K++E+E+R   
Sbjct: 38  SSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKL-KAKKEEAEKEKEKEERFMK 96

Query: 489 RLLRMDERK 497
            L   ++ +
Sbjct: 97  ALAEAEKER 105


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 31.2 bits (72), Expect = 1.4
 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 8/50 (16%)

Query: 451 KQKKNKKSKK--KKKEKEKSAEEKMKEA--LKKE----EEEQREADRLLR 492
           + ++N  SK+  + K K + AE  + E   LK+E    E E  E +  L 
Sbjct: 48  QAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELE 97


>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1.  This contains a
           group of peptides derived from a salivary gland cDNA
           library of the tick Ixodes scapularis. Also present are
           peptides from a related tick species, Ixodes ricinus.
           They are characterized by a putative signal peptide
           indicative of secretion and conserved cysteine residues.
          Length = 120

 Score = 29.8 bits (67), Expect = 1.4
 Identities = 15/35 (42%), Positives = 21/35 (60%)

Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEE 471
           P H D+   +   K +KK KKSKK KK K+ S ++
Sbjct: 86  PSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKKD 120



 Score = 29.5 bits (66), Expect = 1.8
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 431 PGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEK 467
            G  + P+    +  +   KK+KK+KK+KK KK  +K
Sbjct: 83  SGEPSHPDDHPPEPTEKPKKKKKKSKKTKKPKKSSKK 119


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 18/45 (40%), Positives = 30/45 (66%)

Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDER 496
           QK+NK   +K KEK    +E+++EAL+ E+EE+ +   LL+ +E 
Sbjct: 126 QKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEE 170


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 30.5 bits (69), Expect = 1.5
 Identities = 13/43 (30%), Positives = 26/43 (60%)

Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
             ED+ED G  +++  +  ++ ++E E+S++E  KE  +K E 
Sbjct: 74  NSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTES 116


>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
          Length = 432

 Score = 30.9 bits (70), Expect = 1.6
 Identities = 30/112 (26%), Positives = 51/112 (45%), Gaps = 18/112 (16%)

Query: 413 REAGLATSSVVGASL-------GPPPGIMNRPEHQDEDE----EDSGHKKQKKNKKSKKK 461
           R+ G  TSS   AS        G   GI+  P+ Q   +    E    +K       +  
Sbjct: 54  RDDGPETSSGTPASTFADQLKRGVTDGIIGEPQQQQTFQPTPVETQQEEKAVNPFTPQPG 113

Query: 462 KKEKEKS---AEEKMKEALKKEEEEQ--READR--LLRMDERKRSYNSMISV 506
           ++E+ +    +EE+ +  LK+E+EEQ  RE  R  + R+     +Y+S ++V
Sbjct: 114 QREERRPTLESEEEWRARLKREQEEQYLRERQRQRMARLQANAAAYDSPLAV 165


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 31.1 bits (70), Expect = 1.7
 Identities = 14/68 (20%), Positives = 31/68 (45%), Gaps = 5/68 (7%)

Query: 435 NRPEHQDEDEEDSGHKKQKKNKKSKKK-----KKEKEKSAEEKMKEALKKEEEEQREADR 489
           +  +   + + D   +KQ++ K   K       KE ++ AE + +E  K + E ++  + 
Sbjct: 237 DFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEE 296

Query: 490 LLRMDERK 497
            L+  + K
Sbjct: 297 ALKAKDHK 304


>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
           factor.  Replicative DNA polymerases are capable of
           polymerising tens of thousands of nucleotides without
           dissociating from their DNA templates. The high
           processivity of these polymerases is dependent upon
           accessory proteins that bind to the catalytic subunit of
           the polymerase or to the substrate. The Epstein-Barr
           virus (EBV) BMRF1 protein is an essential component of
           the viral DNA polymerase and is absolutely required for
           lytic virus replication. BMRF1 is also a transactivator.
           This family is predicted to have a UL42 like structure.
          Length = 381

 Score = 30.8 bits (70), Expect = 1.7
 Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 4/63 (6%)

Query: 420 SSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSK----KKKKEKEKSAEEKMKE 475
                +S  PP    + P  +        H      ++      K+K+E +K  ++K  +
Sbjct: 312 HLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSK 371

Query: 476 ALK 478
            LK
Sbjct: 372 KLK 374


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 31.0 bits (71), Expect = 1.8
 Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 7/84 (8%)

Query: 427 LGPPPGIMNRP-EHQDEDEED-----SGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKE 480
           LG P  I+    +   ED+E      +  ++ ++  + K ++ E      EK+KE L+++
Sbjct: 497 LGLPENIIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEK 556

Query: 481 EEEQREADRLLRMDERKRSYNSMI 504
           +E+ +E +  L   E ++     I
Sbjct: 557 KEKLQEEEDKLLE-EAEKEAQQAI 579



 Score = 30.6 bits (70), Expect = 2.6
 Identities = 10/50 (20%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 449 HKKQKKNKKSKKKKKEKEKS-AEEKMKEALKKEEEEQREADRLLRMDERK 497
            ++ ++ K+  +++++K    AE++ ++A+K+ ++E  E  + LR  ++ 
Sbjct: 550 KEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKG 599



 Score = 29.8 bits (68), Expect = 4.4
 Identities = 18/58 (31%), Positives = 36/58 (62%), Gaps = 5/58 (8%)

Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKE-----EEEQREADRLLR 492
           Q  +E ++  K+ +K K+  ++KKEK +  E+K+ E  +KE     +E ++EAD +++
Sbjct: 534 QKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIK 591


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 30.6 bits (70), Expect = 1.8
 Identities = 12/44 (27%), Positives = 29/44 (65%), Gaps = 3/44 (6%)

Query: 454 KNKKSKKKKKEKEKSAEEKMKEALKKEEEE--QREADRLLRMDE 495
           K ++ +++++ +E+ AEEK +EA ++ + E  ++ A R  R+ +
Sbjct: 40  KAEEIEERREVREE-AEEKYEEAKEEGDLEEARKYAQRSSRLTD 82


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 30.2 bits (68), Expect = 1.9
 Identities = 15/47 (31%), Positives = 23/47 (48%)

Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
            E   + E   GH+K+ K KK +  K+ K+K  E+K K+     E  
Sbjct: 128 EETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHSPEHP 174


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 29.7 bits (67), Expect = 2.0
 Identities = 9/49 (18%), Positives = 22/49 (44%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADR 489
           DE+EE       +  ++   + K + K  +++     K E+E+ +   +
Sbjct: 96  DEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144



 Score = 29.3 bits (66), Expect = 2.9
 Identities = 13/46 (28%), Positives = 25/46 (54%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
           E ++ +EE +   +Q+   ++K + KEK+K    K K   +K + E
Sbjct: 97  EEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142



 Score = 28.2 bits (63), Expect = 6.8
 Identities = 14/46 (30%), Positives = 22/46 (47%), Gaps = 1/46 (2%)

Query: 435 NRPEHQDEDEEDSGHKKQKKNKKSKKK-KKEKEKSAEEKMKEALKK 479
           +  E + ED  ++  + ++K K+   K K EKEK   E  K    K
Sbjct: 105 STDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSK 150


>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
           complex component).  RNases H are enzymes that
           specifically hydrolyse RNA when annealed to a
           complementary DNA and are present in all living
           organisms. In yeast RNase H2 is composed of a complex of
           three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
           family represents the homologues of Ydr279p. It is not
           known whether non yeast proteins in this family fulfil
           the same function.
          Length = 287

 Score = 30.4 bits (69), Expect = 2.0
 Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 3/41 (7%)

Query: 434 MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMK 474
            +R E + + +E+    K+KK K+SK  K  K+  A+   K
Sbjct: 245 ESRAEKKRKSKEE---IKKKKPKESKGVKALKKVVAKGMKK 282


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 30.8 bits (70), Expect = 2.0
 Identities = 11/54 (20%), Positives = 24/54 (44%)

Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
            E  + + E      + K K   ++ KEK +   E+  +  K+ +++  +  RL
Sbjct: 259 WEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARL 312



 Score = 30.4 bits (69), Expect = 2.3
 Identities = 11/51 (21%), Positives = 26/51 (50%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
             +  + E     K  + K+  K ++ KEK  +E  +EA ++++ +++ A 
Sbjct: 258 AWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ 308


>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
           Williams-Beuren syndrome.  This domain family is found
           in eukaryotes, and is typically between 72 and 83 amino
           acids in length. The family is found in association with
           pfam08241. This family is made up of
           S-adenosylmethionine-dependent methyltransferases. The
           proteins are deleted in Williams-Beuren syndrome (WBS),
           a complex developmental disorder with multisystemic
           manifestations including supravalvular aortic stenosis
           (SVAS) and a specific cognitive phenotype.
          Length = 85

 Score = 28.4 bits (64), Expect = 2.0
 Identities = 18/61 (29%), Positives = 25/61 (40%), Gaps = 3/61 (4%)

Query: 420 SSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
              V   L    G     + +  D        Q+  +K KKKKK K+KS E  ++   KK
Sbjct: 5   GPSVPEQLPNGLGEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILR---KK 61

Query: 480 E 480
           E
Sbjct: 62  E 62


>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
           C-terminal domain.  This model describes the C-terminal
           domain, or longer subunit, of the Firmicutes type VII
           secretion protein EssC. This protein (homologous to EccC
           in Actinobacteria) and the WXG100 target proteins are
           the only homologous parts of type VII secretion between
           Firmicutes and Actinobacteria [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 1296

 Score = 31.1 bits (71), Expect = 2.0
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 6/39 (15%)

Query: 454 KNKKSKKKKKEKEKSA-----EEKMKEALKKEEEEQREA 487
           + KK  KK  EK   +     ++K KE L+   E+QR  
Sbjct: 84  REKKKYKKDVEKRNRSYRLYLDKKRKE-LQALSEKQRHV 121


>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
          Length = 135

 Score = 29.3 bits (66), Expect = 2.2
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
           K++ K KK KKKKK+K+K    K     KK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135



 Score = 27.7 bits (62), Expect = 7.3
 Identities = 14/26 (53%), Positives = 17/26 (65%)

Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMK 474
             KQKK KK KKKKK+K+ S +   K
Sbjct: 108 KAKQKKQKKKKKKKKKKKTSKKAAKK 133


>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase.  Cellulose, an
           aggregate of unbranched polymers of beta-1,4-linked
           glucose residues, is the major component of wood and
           thus paper, and is synthesised by plants, most algae,
           some bacteria and fungi, and even some animals. The
           genes that synthesise cellulose in higher plants differ
           greatly from the well-characterized genes found in
           Acetobacter and Agrobacterium sp. More correctly
           designated as 'cellulose synthase catalytic subunits',
           plant cellulose synthase (CesA) proteins are integral
           membrane proteins, approximately 1,000 amino acids in
           length. There are a number of highly conserved residues,
           including several motifs shown to be necessary for
           processive glycosyltransferase activity.
          Length = 716

 Score = 30.9 bits (70), Expect = 2.2
 Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 1/47 (2%)

Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEE-EEQREADRLLRMDERK 497
           ++K  KS  K K+K+   +E        E+ +E  E ++   M +  
Sbjct: 312 RRKKPKSASKAKKKDSKRKESEAPIFNLEDIDEGDEDEKSSLMSQLS 358


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 30.8 bits (70), Expect = 2.2
 Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 442 EDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRM 493
           +++ +    KQ  ++  KK +KE +K  E K+++  + E EE  +A   LR 
Sbjct: 272 DEKFERDKIKQLASELEKKLEKELKKL-ENKLEK-QEDELEELEKAAEELRQ 321


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 30.1 bits (68), Expect = 2.3
 Identities = 13/46 (28%), Positives = 30/46 (65%)

Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
           +K+NK S  + K +    +E++++AL++E+E + E    L+ +E++
Sbjct: 74  EKENKDSIMRNKRRLTREQEELEQALEEEKEMKEEKRLHLQKEEQE 119


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.4 bits (69), Expect = 2.3
 Identities = 11/51 (21%), Positives = 27/51 (52%)

Query: 440 QDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
           +     +     ++K K   ++  EK  +  ++ K+  KKEE+++++ +RL
Sbjct: 298 KMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERL 348



 Score = 28.9 bits (65), Expect = 8.3
 Identities = 14/51 (27%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 433 IMNRPEHQDEDEE-DSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
           I+      D   +  S  ++  +   ++ K+K+KEK  EEK K+ +++ EE
Sbjct: 300 ILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEE 350


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 28.7 bits (65), Expect = 2.8
 Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 4/36 (11%)

Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
           K+ KK KKKK E E  AE+K K+ +     EQR+AD
Sbjct: 38  KREKKKKKKKSEGEFFAEKKEKKEVS----EQRKAD 69


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
           (DUF1754).  This is a eukaryotic protein family of
           unknown function.
          Length = 90

 Score = 28.2 bits (63), Expect = 2.8
 Identities = 17/46 (36%), Positives = 24/46 (52%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
            K+   KK KKKKK+K KS EE + E  ++E+       +    DE
Sbjct: 14  GKKIDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDE 59



 Score = 27.4 bits (61), Expect = 4.9
 Identities = 14/32 (43%), Positives = 20/32 (62%)

Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
           K KK    KKKKK+K+K+  ++     K+EEE
Sbjct: 13  KGKKIDVKKKKKKKKKKNKSKEEVVTEKEEEE 44


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 29.6 bits (67), Expect = 2.9
 Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
           K++K  +K +K + + E+   +  KE LK   +  +EA   
Sbjct: 35  KREKAQEKKRKAEAQAERRELKARKEKLKTRSDWLKEAQAA 75


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 29.7 bits (68), Expect = 2.9
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
           + +K  KS  K+ ++E    EK+KE L++
Sbjct: 142 RLEKKAKSGDKEAKEELELLEKIKEHLEE 170


>gnl|CDD|107228 cd06471, ACD_LpsHSP_like, Group of bacterial proteins containing an
           alpha crystallin domain (ACD) similar to Lactobacillus
           plantarum (Lp) small heat shock proteins (sHsp) HSP
           18.5, HSP 18.55 and HSP 19.3. sHsps are molecular
           chaperones that suppress protein aggregation and protect
           against cell stress, and are generally active as large
           oligomers consisting of multiple subunits. Transcription
           of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is
           regulated by a variety of stresses including heat, cold
           and ethanol. Early growing L. plantarum cells contain
           elevated levels of these mRNAs which rapidly fall of as
           the cells enter stationary phase. Also belonging to this
           group is Bifidobacterium breve (Bb) HSP20 and Oenococcus
           oenis (syn. Leuconostoc oenos) (Oo) HSP18.
           Transcription of the gene encoding BbHSP20 is strongly
           induced following heat or osmotic shock, and that of the
           gene encoding OoHSP18 following heat, ethanol or acid
           shock. OoHSP18 is peripherally associated with the
           cytoplasmic membrane.
          Length = 93

 Score = 28.2 bits (64), Expect = 2.9
 Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 3/30 (10%)

Query: 216 KDDLEEDEDKYVDEVDMPGTKVDSKQRITV 245
           K D++E +D+Y+ E D+PG K   K+ I +
Sbjct: 2   KTDIKETDDEYIVEADLPGFK---KEDIKL 28


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 30.3 bits (69), Expect = 3.1
 Identities = 17/52 (32%), Positives = 31/52 (59%)

Query: 433 IMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQ 484
           I +  + + E++E     ++K +K SKK+ ++  K  E++MKEA K  E E+
Sbjct: 593 IRDILDGEYEEDEYKAKIEKKASKMSKKELEKLIKKLEKEMKEAAKNLEFEE 644


>gnl|CDD|115108 pfam06431, Polyoma_lg_T_C, Polyomavirus large T antigen C-terminus.
           
          Length = 417

 Score = 29.9 bits (67), Expect = 3.2
 Identities = 14/57 (24%), Positives = 28/57 (49%), Gaps = 1/57 (1%)

Query: 413 REAGLATSSVVGASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSA 469
           +E  + T S +  ++    GI++    +D ++EDSGH       +S+ K +   +S 
Sbjct: 341 KEISMYTFSKMKFNVAMGIGILDWLREEDSEDEDSGHGSS-TESQSQCKSQVSHESG 396


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 30.1 bits (68), Expect = 3.3
 Identities = 16/71 (22%), Positives = 27/71 (38%), Gaps = 14/71 (19%)

Query: 438 EHQDEDEEDSGHKKQ--------------KKNKKSKKKKKEKEKSAEEKMKEALKKEEEE 483
           E    D ++S   ++              K    S+  K  +EKSA+E+M  A   EE  
Sbjct: 871 EGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELM 930

Query: 484 QREADRLLRMD 494
           +R  +   +  
Sbjct: 931 KRAKEYQDKHK 941


>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
          Length = 330

 Score = 29.8 bits (68), Expect = 3.5
 Identities = 12/27 (44%), Positives = 19/27 (70%), Gaps = 2/27 (7%)

Query: 450 KKQKK--NKKSKKKKKEKEKSAEEKMK 474
           KK+ K  +K  KKK+K++ K+A+ K K
Sbjct: 63  KKELKAWHKAQKKKEKQEAKAAKAKSK 89


>gnl|CDD|220176 pfam09317, DUF1974, Domain of unknown function (DUF1974).  Members
           of this family of functionally uncharacterized domains
           are predominantly found in various prokaryotic
           acyl-coenzyme a dehydrogenases.
          Length = 284

 Score = 29.8 bits (68), Expect = 3.6
 Identities = 14/43 (32%), Positives = 22/43 (51%)

Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDE 495
           KK KK+ KK K  +   EE ++ AL+     + EA+ L   + 
Sbjct: 231 KKLKKALKKGKLPKLRLEELLEAALEAGVITEEEAELLREAEA 273


>gnl|CDD|240292 PTZ00148, PTZ00148, 40S ribosomal protein S8; Provisional.
          Length = 205

 Score = 29.2 bits (66), Expect = 3.6
 Identities = 18/51 (35%), Positives = 27/51 (52%), Gaps = 5/51 (9%)

Query: 446 DSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALK-----KEEEEQREADRLL 491
           D G KK+ K ++  KK K+K +S   K++   K     K   EQ ++ RLL
Sbjct: 120 DLGKKKKYKKEEENKKAKKKSESLVAKLEVRKKNHKIDKALLEQFQSGRLL 170


>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3.  This family
           of proteins are membrane localised chaperones that are
           required for correct plasma membrane localisation of
           amino acid permeases (AAPs). Shr3 prevents AAPs proteins
           from aggregating and assists in their correct folding.
           In the absence of Shr3, AAPs are retained in the ER.
          Length = 196

 Score = 29.3 bits (66), Expect = 3.8
 Identities = 8/39 (20%), Positives = 18/39 (46%), Gaps = 1/39 (2%)

Query: 438 EHQDEDEEDSGHKKQKK-NKKSKKKKKEKEKSAEEKMKE 475
           E +D  +++    +++K    +  KK    +  E K K+
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 30.0 bits (68), Expect = 3.9
 Identities = 13/60 (21%), Positives = 26/60 (43%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSY 500
            + EE     + +  KK ++ +K K K  ++  K+  KK +    EA + L   +  +  
Sbjct: 105 KKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGLNYKKNFKEK 164


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.8 bits (67), Expect = 3.9
 Identities = 10/35 (28%), Positives = 19/35 (54%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEK 472
           E + ++E+    + +KK K+   + KEK +  E K
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433


>gnl|CDD|153286 cd07602, BAR_RhoGAP_OPHN1-like, The Bin/Amphiphysin/Rvs (BAR)
           domain of Oligophrenin1-like Rho GTPase Activating
           Proteins.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions. This subfamily is
           composed of Rho and Rac GTPase activating proteins
           (GAPs) with similarity to oligophrenin1 (OPHN1). Members
           contain an N-terminal BAR domain, followed by a
           Pleckstrin homology (PH) domain, and a Rho GAP domain.
           Some members contain a C-terminal SH3 domain.
           Vertebrates harbor at least three Rho GAPs in this
           subfamily including OPHN1, GTPase Regulator Associated
           with Focal adhesion kinase (GRAF), GRAF2, and an
           uncharacterized protein called GAP10-like. OPHN1, GRAF
           and GRAF2 show GAP activity towards RhoA and Cdc42. In
           addition, OPHN1 is active towards Rac. BAR domains form
           dimers that bind to membranes, induce membrane bending
           and curvature, and may also be involved in
           protein-protein interactions. The BAR domains of OPHN1
           and GRAF directly interact with their Rho GAP domains
           and inhibit their activity. The autoinhibited proteins
           are able to bind membranes and tubulate liposomes,
           showing that the membrane-tubulation and GAP-inhibitory
           functions of the BAR domains can occur simultaneously.
          Length = 207

 Score = 29.2 bits (66), Expect = 4.0
 Identities = 19/52 (36%), Positives = 29/52 (55%), Gaps = 5/52 (9%)

Query: 450 KKQKKNKKSKKKKKEKE-----KSAEEKMKEALKKEEEEQREADRLLRMDER 496
           K+Q    K +KKK +KE      S E+ +  + KK+E + +EAD  L M+ R
Sbjct: 103 KEQIGGAKEEKKKFDKETEKFCSSLEKHLNLSTKKKENQLQEADAQLDMERR 154


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 30.0 bits (67), Expect = 4.0
 Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 2/44 (4%)

Query: 187  EEAKRELRADKLDAGVSIDNRGSKEGDEDKDDLEEDEDKYVDEV 230
            E   REL +D    GV+ D+  S+  + D ++  +D D+ V+++
Sbjct: 3854 ESEARELESDM--NGVTKDSVVSENENSDSEEENQDLDEEVNDI 3895


>gnl|CDD|214787 smart00721, BAR, BAR domain. 
          Length = 239

 Score = 29.3 bits (66), Expect = 4.1
 Identities = 16/43 (37%), Positives = 25/43 (58%)

Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
           HK +K  K  +KKK EK   AEE++++A ++ EE   +    L
Sbjct: 155 HKLKKAKKSKEKKKDEKLAKAEEELRKAKQEFEESNAQLVEEL 197


>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator.  This
           family of trans-activating transcriptional regulator
           (TATR), also known as intermediate early protein 1, are
           common to the Nucleopolyhedroviruses.
          Length = 571

 Score = 29.5 bits (66), Expect = 4.6
 Identities = 18/76 (23%), Positives = 27/76 (35%), Gaps = 5/76 (6%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSY 500
           D  +   G K   K KK +K KK   +S       +L +EE    E   +   D+  + +
Sbjct: 137 DSSDSSKGKKLVNKPKKRQKYKKATIQS-----PTSLTEEENYNTEICTVAPTDQIAKYF 191

Query: 501 NSMISVKEPTMDSDRG 516
               S       SD  
Sbjct: 192 LMDFSPYLEDAKSDCQ 207


>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
           (DUF2098).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function.
          Length = 91

 Score = 27.7 bits (62), Expect = 4.6
 Identities = 15/32 (46%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 457 KSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
           K K+KK+E+E+  EE + E +KKEEE   + D
Sbjct: 51  KVKEKKEEREEDKEE-LIERIKKEEETFEDVD 81


>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 881

 Score = 29.9 bits (67), Expect = 4.7
 Identities = 19/68 (27%), Positives = 35/68 (51%), Gaps = 8/68 (11%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMK--------EALKKEEEEQREADRLLRMDERKRSYN 501
           +KQ  NKK KKK++E++ S E+ M         E ++  + + +     +  +  K S +
Sbjct: 21  EKQTPNKKDKKKEEEEQLSEEDAMLKGDLELLVERIQDPDIDLQNNSLNMLKEVIKSSTS 80

Query: 502 SMISVKEP 509
           SM +V +P
Sbjct: 81  SMTAVPKP 88


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 29.8 bits (67), Expect = 4.9
 Identities = 13/49 (26%), Positives = 31/49 (63%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
           E + E E+ +  + +++ +  +++++E+EK+A E  +   K E E++RE
Sbjct: 253 EKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRRE 301


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 11/48 (22%), Positives = 23/48 (47%), Gaps = 2/48 (4%)

Query: 432 GIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKK 479
           G+ +    Q  ++E        + +++KKK++ + K  +E   E LK 
Sbjct: 64  GVKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKE--MEELKA 109



 Score = 28.2 bits (63), Expect = 9.4
 Identities = 12/49 (24%), Positives = 26/49 (53%)

Query: 437 PEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQR 485
           PE + E    S   +Q K K+  + KK+KE    + +++ + +  +E++
Sbjct: 74  PEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQ 122


>gnl|CDD|173500 PTZ00264, PTZ00264, circumsporozoite-related antigen; Provisional.
          Length = 148

 Score = 28.4 bits (63), Expect = 5.1
 Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 10/60 (16%)

Query: 447 SGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMISV 506
           SG  K+KK K    K  E      E + E ++KEEE       L+ + ++K  Y    +V
Sbjct: 29  SGVPKKKKPK---SKSPEPLIDVHELISEMVRKEEE-------LVNVTKKKSKYKLATTV 78


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 29.5 bits (66), Expect = 5.1
 Identities = 14/60 (23%), Positives = 27/60 (45%)

Query: 429 PPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
           P    + + E  +E EE+   ++   +KK KK KK K K       ++   ++ +  + D
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDSDDSDID 368


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 28.0 bits (63), Expect = 5.1
 Identities = 18/55 (32%), Positives = 30/55 (54%)

Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKRSYNSMIS 505
           ++K+ +K  + +K  + SAEEK +  L+K E+E  E +  L   E K     M+S
Sbjct: 25  EKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKMLS 79


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 29.4 bits (66), Expect = 5.2
 Identities = 15/48 (31%), Positives = 32/48 (66%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
           +K  K  + ++KKKE++++ E + K+A ++E  +Q E +RL   +++K
Sbjct: 71  QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKK 118


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 29.0 bits (65), Expect = 5.4
 Identities = 14/36 (38%), Positives = 23/36 (63%)

Query: 452 QKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREA 487
           + K K++ + K + E  AE+K KE  KK+ EE+ +A
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKA 158


>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992).  This
           bacterial family of proteins has no known function.
           However, the cis-regulatory yjdF motif, just upstream
           from the gene encoding the proteins for this family, is
           a small non-coding RNA, Rfam:RF01764. The yjdF motif is
           found in many Firmicutes, including Bacillus subtilis.
           In most cases, it resides in potential 5' UTRs of
           homologues of the yjdF gene whose function is unknown.
           However, in Streptococcus thermophilus, a yjdF RNA motif
           is associated with an operon whose protein products
           synthesise nicotinamide adenine dinucleotide (NAD+).
           Also, the S. thermophilus yjdF RNA lacks typical yjdF
           motif consensus features downstream of and including the
           P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
           S. thermophilus RNAs might sense a distinct compound
           that structurally resembles the ligand bound by other
           yjdF RNAs. On the ohter hand, perhaps these RNAs have an
           alternative solution forming a similar binding site, as
           is observed with some SAM riboswitches.
          Length = 132

 Score = 28.0 bits (63), Expect = 5.6
 Identities = 18/54 (33%), Positives = 31/54 (57%), Gaps = 5/54 (9%)

Query: 449 HKKQKKNKKSKKK----KKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
            K+ KK   S K     K E E++ +EK K+  K+++EE++E  R L+  ++K 
Sbjct: 75  AKEVKKPGISTKAQQALKLEHERNKQEK-KKRSKEKKEEEKERKRQLKQQKKKA 127


>gnl|CDD|236928 PRK11552, PRK11552, putative DNA-binding transcriptional
          regulator; Provisional.
          Length = 225

 Score = 28.9 bits (65), Expect = 5.7
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 56 RDINPHIPQYISSAPWYFGSK 76
          RDI     Q I++  +YFGSK
Sbjct: 36 RDIAAQAGQNIAAITYYFGSK 56


>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated.  The HNOBA domain is
           found associated with the HNOB domain and pfam00211 in
           soluble cyclases and signalling proteins. The HNOB
           domain is predicted to function as a heme-dependent
           sensor for gaseous ligands, and transduce diverse
           downstream signals, in both bacteria and animals.
          Length = 209

 Score = 28.7 bits (65), Expect = 5.9
 Identities = 14/32 (43%), Positives = 23/32 (71%), Gaps = 3/32 (9%)

Query: 460 KKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
           KK+ EK K   +K++EA ++ EEE+++ D LL
Sbjct: 169 KKRLEKLK---DKLEEAHRELEEEKKKTDDLL 197


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 29.2 bits (66), Expect = 6.0
 Identities = 15/66 (22%), Positives = 24/66 (36%), Gaps = 21/66 (31%)

Query: 447 SGHKKQKKNKKSKKKKK-------------------EKEKSAEEKMKEALKKEEEEQREA 487
              K   K   + ++K                    ++ K+ EE  KE    E  ++ EA
Sbjct: 219 RTVKPPPKPPMTPEEKDDEYDQRVRELTFDRRAQPTDRTKTEEELAKEE--AERLKKLEA 276

Query: 488 DRLLRM 493
           +RL RM
Sbjct: 277 ERLRRM 282


>gnl|CDD|235364 PRK05201, hslU, ATP-dependent protease ATP-binding subunit HslU;
           Provisional.
          Length = 443

 Score = 29.3 bits (67), Expect = 6.1
 Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 10/44 (22%)

Query: 455 NKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
            KK KK+K         K+KEA K   EE  EA +L+ M+E K+
Sbjct: 208 PKKKKKRKL--------KVKEARKILIEE--EAAKLIDMEEIKQ 241


>gnl|CDD|221864 pfam12932, Sec16, Vesicle coat trafficking protein Sec16
           mid-region.  Sec16 is a multi-domain vesicle coat
           protein. This central region is the functional part of
           the molecules and thus is vital for the family's role in
           mediating the movement of protein-cargo between the
           organelles of the secretory pathway.
          Length = 104

 Score = 27.6 bits (62), Expect = 6.1
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query: 456 KKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLL 491
            K K KKK+  K  E K+ E     +  +R+ D+LL
Sbjct: 60  VKGKSKKKDVIKWLESKIAELENSNDSLKRKEDKLL 95


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 28.9 bits (65), Expect = 6.2
 Identities = 11/36 (30%), Positives = 22/36 (61%)

Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREAD 488
           K    S KK K+++K+ E K K   KK+++++++  
Sbjct: 48  KVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEP 83


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 28.0 bits (63), Expect = 6.2
 Identities = 11/49 (22%), Positives = 23/49 (46%)

Query: 449 HKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
            K+  + K+ KKK+K   K     ++   ++ +E   +  R  +  E+K
Sbjct: 72  EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKK 120


>gnl|CDD|237847 PRK14879, PRK14879, serine/threonine protein kinase; Provisional.
          Length = 211

 Score = 28.7 bits (65), Expect = 6.3
 Identities = 10/43 (23%), Positives = 18/43 (41%), Gaps = 5/43 (11%)

Query: 318 EKGVDVHVLAEPTKLELLKKEYEEKKEEFKKEVSSSVIDKYGG 360
           ++ VD+HVL     L  L+  + +  EE  +       +  G 
Sbjct: 149 DRAVDLHVL-----LRSLESTHPDWAEELFEAFLEGYREVMGE 186


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 29.3 bits (65), Expect = 6.4
 Identities = 13/47 (27%), Positives = 25/47 (53%)

Query: 444 EEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRL 490
           EE+  H+++ +  K +KKK+EKEK    +      + +E+   A  +
Sbjct: 85  EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMV 131


>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
          Length = 364

 Score = 28.9 bits (66), Expect = 6.4
 Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 11/65 (16%)

Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQ--READRLLRMDERK--RSYNSMISV 506
           + +K  K   K+ + E    EK+ E L   EE +  R  +  L  +E K  +S   +++ 
Sbjct: 146 RLEKKAKGGDKEAKAELELLEKLLEHL---EEGKPARTLE--LTDEEEKLLKSLQ-LLTA 199

Query: 507 KEPTM 511
           K P +
Sbjct: 200 K-PVL 203


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 28.9 bits (65), Expect = 6.5
 Identities = 13/40 (32%), Positives = 23/40 (57%), Gaps = 5/40 (12%)

Query: 453 KKNKKSKKKKKEKEKSAE-----EKMKEALKKEEEEQREA 487
           K NK+  K K  KE + E     +++K+ + ++E E +EA
Sbjct: 58  KLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEA 97


>gnl|CDD|223325 COG0247, GlpC, Fe-S oxidoreductase [Energy production and
           conversion].
          Length = 388

 Score = 29.1 bits (65), Expect = 6.6
 Identities = 16/54 (29%), Positives = 22/54 (40%)

Query: 299 FSGDTRKHATAQLFAWEAYEKGVDVHVLAEPTKLELLKKEYEEKKEEFKKEVSS 352
           F    +K A   + A++   +G D  V   P     LKKEY E   E   +V  
Sbjct: 198 FLERAKKLAKKNIEAFKKLIEGGDPIVTVCPACYGALKKEYPELLGERALKVVD 251


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 29.0 bits (65), Expect = 6.6
 Identities = 17/59 (28%), Positives = 20/59 (33%), Gaps = 9/59 (15%)

Query: 424 GASLGPPPGIMNRPEHQDEDEEDSGHKKQKKNKKSKKK---------KKEKEKSAEEKM 473
           G S     G        +ED    G K QKK +    K         KKE  KS E+  
Sbjct: 425 GKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESVKSQEDNN 483


>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
          Length = 147

 Score = 28.1 bits (64), Expect = 6.8
 Identities = 6/32 (18%), Positives = 14/32 (43%)

Query: 450 KKQKKNKKSKKKKKEKEKSAEEKMKEALKKEE 481
           K+ +  +   + K  +E +  E + E L+   
Sbjct: 45  KQLEARRAELEAKAAEELAEAEALAEKLEGLT 76


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 29.1 bits (65), Expect = 7.1
 Identities = 9/47 (19%), Positives = 22/47 (46%)

Query: 436 RPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE 482
           +     ++E+   ++   + K  +++K+EK+  A+       K EE 
Sbjct: 124 KESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNEER 170


>gnl|CDD|184712 PRK14501, PRK14501, putative bifunctional trehalose-6-phosphate
           synthase/HAD hydrolase subfamily IIB; Provisional.
          Length = 726

 Score = 29.1 bits (66), Expect = 7.3
 Identities = 15/32 (46%), Positives = 19/32 (59%), Gaps = 2/32 (6%)

Query: 470 EEKMKEALKKEEEEQREADRLLRMDERKRSYN 501
              +K AL+  EEEQRE  R+  M ER R Y+
Sbjct: 419 AAAIKRALEMPEEEQRE--RMQAMQERLRRYD 448


>gnl|CDD|189037 cd09867, PIN_FEN1, PIN domain of Flap Endonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Flap endonuclease-1 (FEN1) is
           involved in multiple DNA metabolic pathways, including
           DNA replication processes (5' flap DNA endonuclease
           activity and double stranded DNA 5'-exonuclease
           activity) and DNA repair processes (long-patch base
           excision repair) in eukaryotes and archaea. Interaction
           between FEN1 and PCNA (Proliferating cell nuclear
           antigen) is an essential prerequisite to FEN1's DNA
           replication functionality and stimulates FEN1 nuclease
           activity by 10-50 fold. FEN1 belongs to the
           FEN1-EXO1-like family of structure-specific, 5'
           nucleases. These nucleases contain a PIN (PilT N
           terminus) domain with a helical arch/clamp region (I
           domain) of variable length (approximately 45 residues in
           FEN1 PIN domains) and a H3TH (helix-3-turn-helix)
           domain, an atypical helix-hairpin-helix-2-like region.
           Both the H3TH domain (not included here) and the helical
           arch/clamp region are involved in DNA binding. Nucleases
           within this group also have a carboxylate-rich active
           site that is involved in binding essential divalent
           metal ion cofactors (Mg2+/Mn2+).  FEN1 has a C-terminal
           extension containing residues forming the consensus
           PIP-box - Qxx(M/L/I)xxF(Y/F) which serves to anchor FEN1
           to PCNA.
          Length = 261

 Score = 28.6 bits (65), Expect = 7.5
 Identities = 12/37 (32%), Positives = 25/37 (67%), Gaps = 2/37 (5%)

Query: 461 KKKEKEKSAEEKMKEALKKEEEEQRE--ADRLLRMDE 495
           K++E+ + AEEK++EA ++ + E+    A R +R+ +
Sbjct: 94  KRRERREEAEEKLEEAKEEGDAEEARKYAKRTVRVTK 130


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 27.7 bits (62), Expect = 7.7
 Identities = 12/48 (25%), Positives = 24/48 (50%)

Query: 451 KQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERKR 498
           K+    K K K ++ EK A+ +++   ++  E+ R    L    E++R
Sbjct: 28  KKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPVLPEDLEKER 75


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 29.2 bits (65), Expect = 7.8
 Identities = 11/60 (18%), Positives = 30/60 (50%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMDERK 497
           E  D  +E    ++++     ++ +KE+E  A+   +   +++E++ +E +  L   E +
Sbjct: 238 ERIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEE 297


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.0 bits (66), Expect = 7.8
 Identities = 17/63 (26%), Positives = 38/63 (60%), Gaps = 1/63 (1%)

Query: 434 MNRPEHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEE-EQREADRLLR 492
           + + E +   +E++  +K +  +K +++ ++KEK  E+K +E  KKEEE E+   ++L  
Sbjct: 84  LQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143

Query: 493 MDE 495
           ++ 
Sbjct: 144 LER 146


>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
           SseC is a secreted protein that forms a complex together
           with SecB and SecD on the surface of Salmonella. All
           these proteins are secreted by the type III secretion
           system. Many mucosal pathogens use type III secretion
           systems for the injection of effector proteins into
           target cells. SecB, SseC and SecD are inserted into the
           target cell membrane. where they form a small pore or
           translocon. In addition to SseC, this family includes
           the bacterial secreted proteins PopB, PepB, YopB and
           EspD which are thought to be directly involved in pore
           formation, and type III secretion system translocon.
          Length = 303

 Score = 28.6 bits (64), Expect = 7.9
 Identities = 12/39 (30%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 450 KKQKKNKKSK-KKKKEKEKSAEEKMKEALKKEEEEQREA 487
           +KQ K+K  + ++ ++K++   E+ +E +KK  E+  EA
Sbjct: 14  EKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAEEA 52


>gnl|CDD|216853 pfam02030, Lipoprotein_8, Hypothetical lipoprotein (MG045 family). 
           This family includes hypothetical lipoproteins, the
           amino terminal part of this protein is related to
           pfam01547, a family of solute binding proteins. This
           suggests this family also has a solute binding function.
          Length = 493

 Score = 28.8 bits (64), Expect = 8.1
 Identities = 8/32 (25%), Positives = 17/32 (53%)

Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQ 484
           K   ++  K+++K++S E + K   +KE    
Sbjct: 390 KTGFEADLKRQKKQESFEIQTKYKTEKERNHI 421


>gnl|CDD|235012 PRK02201, PRK02201, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 357

 Score = 28.8 bits (65), Expect = 8.2
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 461 KKKEKEKSAEEKMKEALKKEEEEQR 485
           KKK KE++   K KEALKK+ + Q 
Sbjct: 279 KKKNKERTLNVKEKEALKKQNKTQN 303


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 28.5 bits (64), Expect = 8.5
 Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 2/57 (3%)

Query: 438 EHQDEDEEDSGHKKQKKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLRMD 494
           E +  D ED   K   +  K  K+  ++E S E++ + A ++E E ++E    L   
Sbjct: 126 EIESSDSEDEEEK--DEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATT 180


>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
           This domain is found associated with the Pfam tRNA
           synthetase class II domain (pfam00587) and represents
           the N-terminal domain of seryl-tRNA synthetase.
          Length = 108

 Score = 27.2 bits (61), Expect = 8.5
 Identities = 15/46 (32%), Positives = 26/46 (56%), Gaps = 8/46 (17%)

Query: 451 KQKKNKKSK---KKKKEKEKSAE-----EKMKEALKKEEEEQREAD 488
           + ++N+ SK   K KK+KE +       +++K+ LK  E E RE +
Sbjct: 49  QAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKALEAELRELE 94


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 26.2 bits (58), Expect = 8.6
 Identities = 16/40 (40%), Positives = 27/40 (67%)

Query: 453 KKNKKSKKKKKEKEKSAEEKMKEALKKEEEEQREADRLLR 492
           +K  +  KK +EKEK+ E + +EA  +++EE+ EA + LR
Sbjct: 8   EKRAEFDKKLEEKEKALEAEKEEAEARQKEEEEEAIKQLR 47


>gnl|CDD|225995 COG3464, COG3464, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 402

 Score = 28.6 bits (64), Expect = 9.0
 Identities = 21/107 (19%), Positives = 34/107 (31%), Gaps = 17/107 (15%)

Query: 96  LDEWYKRGVDTTSQAKKFRKGACENCGAMTHKKKDCLERPRKVLAKYSNAQIACDEFVQP 155
           L+E     +     A+  RK  C  CG  T ++     R RK+        +   E   P
Sbjct: 19  LEEVEMGPIIIDLYAEMPRKHRCPECGQRTIRRHG--WRIRKIQD------LPLFEV--P 68

Query: 156 KLIQDYDSKRDRWAGYDPSNHKA----IVEHYQKIEEAKRELRADKL 198
             +      + R+         A     V+  Q+I     +  A  L
Sbjct: 69  VYLFLR---KRRYKCCRCGKRFAEKVDFVDRAQRITRRWAQAAAIAL 112


>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
          Length = 1094

 Score = 28.7 bits (64), Expect = 9.3
 Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 6/55 (10%)

Query: 448 GHKKQKKNKKSKKKKKEKEKSAE------EKMKEALKKEEEEQREADRLLRMDER 496
           G +K+KK KKSK+KKK+K + A       E ++E ++    E+      L+++++
Sbjct: 682 GSRKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 28.5 bits (64), Expect = 9.5
 Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 2/59 (3%)

Query: 441 DEDEEDSGHKKQKKNKKSKKKKKEKEKSA--EEKMKEALKKEEEEQREADRLLRMDERK 497
             +  ++  K+QK+  K K  KK+ +KS    +     L +   E +EA+      ++K
Sbjct: 2   MNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 28.3 bits (64), Expect = 9.9
 Identities = 17/72 (23%), Positives = 38/72 (52%), Gaps = 13/72 (18%)

Query: 440 QDEDEEDSGHKKQKKNK-------------KSKKKKKEKEKSAEEKMKEALKKEEEEQRE 486
           Q+EDE ++  K++K+ K             + K+++KE+E+  E K+ E  +++ E + E
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEE 168

Query: 487 ADRLLRMDERKR 498
            +   R  + ++
Sbjct: 169 REAERRERKEEK 180


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.309    0.129    0.368 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,778,914
Number of extensions: 2672825
Number of successful extensions: 9566
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7779
Number of HSP's successfully gapped: 789
Length of query: 520
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 419
Effective length of database: 6,457,848
Effective search space: 2705838312
Effective search space used: 2705838312
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 61 (27.1 bits)