RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6666
         (175 letters)



>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score =  129 bits (325), Expect = 5e-36
 Identities = 59/188 (31%), Positives = 96/188 (51%), Gaps = 16/188 (8%)

Query: 1   MDESGEQFVAYFLP-------TAETLDKRKLDQLSNLEYTDEQEYEYKMAREYNWNVKSK 53
           ++   E+FV+Y+LP         +TL+KR  D     +Y +++EY++K  REY+  VKSK
Sbjct: 233 VELEEEEFVSYYLPDEKDSEILEDTLEKRSDDLH---DYDEDEEYKFKRVREYDMKVKSK 289

Query: 54  SCKGYEENYFLIVRD-DAVYYNELETRVRLSKRR-----QKVGAAPNNTRLVVTHRPLNA 107
           + K  E   F +  +   VYY  L +RV L +RR     + +    NN +L V  R  + 
Sbjct: 290 ATKLNELALFFVSDENGVVYYKPLRSRVELRRRRVNDVIRPLVREHNNDQLNVKLRNPST 349

Query: 108 TEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
            E +++  R   L+P + EE +EDE+EEEE+  +  + E  E   +EG     +G  E  
Sbjct: 350 KESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESS 409

Query: 168 ISECSNED 175
               S+ +
Sbjct: 410 SDVGSDSE 417


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 38.5 bits (89), Expect = 0.001
 Identities = 17/47 (36%), Positives = 32/47 (68%), Gaps = 3/47 (6%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA---ERQEGEGGE 159
           R+ F+   ++E+E+ED+++EE++ EE ++ E  +    E +E EGGE
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194



 Score = 33.1 bits (75), Expect = 0.073
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ-EGEGGEREGGGEGE 167
           E      N   +++DE+E+E++ +E    E  E E + +G   E E    GE
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194



 Score = 32.7 bits (74), Expect = 0.084
 Identities = 12/51 (23%), Positives = 30/51 (58%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           ++++E+++E++++E ++ ++ E  E  +   +  E + GGE    E S  D
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVD 204


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 38.1 bits (88), Expect = 0.002
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
           + EEEEE+EEEEEEE EE ++ E  E E +E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 37.3 bits (86), Expect = 0.003
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAE 151
           +   EEEEEE+EEEEEEE EE ++ E    E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 36.1 bits (83), Expect = 0.007
 Identities = 19/34 (55%), Positives = 21/34 (61%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
           E   EEEEEE+EEEEEEE EE +  E    E  E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPE 900



 Score = 35.7 bits (82), Expect = 0.008
 Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 1/70 (1%)

Query: 106 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEG-GEREGGG 164
           N +E  I   R+   E   E E +E + + E EAEE +    TEAE  E EG  E    G
Sbjct: 679 NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEG 738

Query: 165 EGEISECSNE 174
           E    E   E
Sbjct: 739 EEVEDEGEGE 748



 Score = 35.4 bits (81), Expect = 0.014
 Identities = 18/34 (52%), Positives = 22/34 (64%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
           E   EEEEEE+EEEEEEE EE    E  E  +++
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905



 Score = 34.2 bits (78), Expect = 0.026
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
           + EEE+EEEEEEE EE ++ E  E E +  E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 32.3 bits (73), Expect = 0.13
 Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGG-GEGEI 168
           EE E+E E E E + E   +G+  E E +     E +    EGEI
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEI 783



 Score = 31.1 bits (70), Expect = 0.34
 Identities = 17/47 (36%), Positives = 21/47 (44%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           E P E E E  EE   E  +E +     E E +     ER+G  EGE
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696



 Score = 30.7 bits (69), Expect = 0.36
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 129 EEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
           + +EEEEEEE EE ++ E  E E +E E  E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 30.7 bits (69), Expect = 0.38
 Identities = 22/67 (32%), Positives = 27/67 (40%), Gaps = 9/67 (13%)

Query: 116 RERFLEPPNEEEEE---EDEEEEEEEAEETQQGETTEAERQ-----EGEGGEREGGGEGE 167
           R    E  N EE     E E E E + E   +GE   AER+     EGE   +E   +GE
Sbjct: 651 RPTEAEGENGEESGGEAEQEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGE 709

Query: 168 ISECSNE 174
                 E
Sbjct: 710 TEAEEVE 716



 Score = 29.6 bits (66), Expect = 0.86
 Identities = 20/48 (41%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEI 168
           E    E E E E E  E+  E + GE  E    EGE GE EG  E E 
Sbjct: 711 EAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE-GEAEGKHEVET 757



 Score = 29.6 bits (66), Expect = 1.1
 Identities = 15/31 (48%), Positives = 18/31 (58%)

Query: 131 DEEEEEEEAEETQQGETTEAERQEGEGGERE 161
           D EEEEEE EE ++ E  E E +E E    E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892



 Score = 29.6 bits (66), Expect = 1.1
 Identities = 19/47 (40%), Positives = 24/47 (51%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
             E E E E +E+E+  E Q GE  E +  EG  G+ E  GE E  E
Sbjct: 765 EHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGE 811



 Score = 29.2 bits (65), Expect = 1.2
 Identities = 19/52 (36%), Positives = 24/52 (46%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
            E+E E +  EE EE E+  +GE       E EG  +E   EGE      ED
Sbjct: 726 TEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKED 777



 Score = 29.2 bits (65), Expect = 1.3
 Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 4/51 (7%)

Query: 121 EPPNEEE---EEEDEEEEEEEAEETQQGET-TEAERQEGEGGEREGGGEGE 167
           E   E E   E E E E   E +  Q+GE   EA+  + +G       E E
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718



 Score = 29.2 bits (65), Expect = 1.3
 Identities = 14/52 (26%), Positives = 20/52 (38%)

Query: 123 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
              E E + E E E E    ++GE       E +  + +G  E E  E   E
Sbjct: 669 QEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGE 720



 Score = 29.2 bits (65), Expect = 1.3
 Identities = 14/42 (33%), Positives = 17/42 (40%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
           E   E E  EDE E E   E  +  +  E E +     E EG
Sbjct: 718 EGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759



 Score = 29.2 bits (65), Expect = 1.5
 Identities = 17/54 (31%), Positives = 24/54 (44%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
           E   E E  E+ EE E+E E   +G+       + +  E EG  E E  E  +E
Sbjct: 727 EDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780



 Score = 28.8 bits (64), Expect = 1.8
 Identities = 17/60 (28%), Positives = 26/60 (43%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           +E   E   E E +EDE+E E +A E  + +  E    + E       GE +  E  +E 
Sbjct: 762 KETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSET 821



 Score = 28.8 bits (64), Expect = 1.9
 Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 15/66 (22%)

Query: 125 EEEEEEDEEEEEEEAEETQQ------GETTEAE---------RQEGEGGEREGGGEGEIS 169
           E E E  E++E E   ETQ        ET E E         +Q+ +G +  GG +G  S
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863

Query: 170 ECSNED 175
           E   E+
Sbjct: 864 EEEEEE 869



 Score = 28.4 bits (63), Expect = 2.1
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEG-GEREGGGEGEI 168
           E   E E + + E E +  E   +GET EAE +E E  GE + G +GE+
Sbjct: 744 EGEGEAEGKHEVETEGDRKETEHEGET-EAEGKEDEDEGEIQAGEDGEM 791



 Score = 28.4 bits (63), Expect = 2.6
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
               +  + EEEEE EE ++ E  E E +E E  E E
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 28.0 bits (62), Expect = 3.0
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
           +     D  + EEE EE ++ E  E E +E E  E E
Sbjct: 853 DGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889



 Score = 27.7 bits (61), Expect = 4.7
 Identities = 16/43 (37%), Positives = 20/43 (46%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           E E +  E E E E E   + +  E E Q GE GE +G    E
Sbjct: 756 ETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798



 Score = 27.3 bits (60), Expect = 5.0
 Identities = 12/40 (30%), Positives = 17/40 (42%)

Query: 128 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           E   EE E     E + GE +  E ++    E +G  E E
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESE 682



 Score = 27.3 bits (60), Expect = 6.0
 Identities = 15/50 (30%), Positives = 21/50 (42%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           E E E + E E E E   + +  +    E E  E +  GE E  E  +E 
Sbjct: 670 EGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEG 719



 Score = 27.3 bits (60), Expect = 6.4
 Identities = 16/48 (33%), Positives = 21/48 (43%)

Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           L   ++ +  E E   E   EE ++    E E  E  GGE E  GE E
Sbjct: 627 LGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETE 674



 Score = 26.9 bits (59), Expect = 6.5
 Identities = 19/46 (41%), Positives = 23/46 (50%), Gaps = 13/46 (28%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEIS 169
           +EEEEEE+EEEEEEE              +E E  E E   E  +S
Sbjct: 863 SEEEEEEEEEEEEEE-------------EEEEEEEEEEEENEEPLS 895


>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
          Length = 584

 Score = 35.8 bits (83), Expect = 0.008
 Identities = 14/57 (24%), Positives = 21/57 (36%), Gaps = 3/57 (5%)

Query: 96  TRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAER 152
            RLV+  R   AT             PP     E+D++  E+E E+    +    E 
Sbjct: 247 ARLVLAPR---ATRLPAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300


>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein.  This
           protein is covalently attached to the terminii of
           replicating DNA in vivo.
          Length = 548

 Score = 35.4 bits (82), Expect = 0.011
 Identities = 20/43 (46%), Positives = 24/43 (55%)

Query: 114 RLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
           R R R   PP+  E EE+EEEEEE  EE ++ E  E    E E
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 32.3 bits (74), Expect = 0.12
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 153
           R +R R     PP  EEEEE+EEE  EE EE ++ E    E +
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333



 Score = 31.5 bits (72), Expect = 0.22
 Identities = 20/46 (43%), Positives = 24/46 (52%)

Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
           R+   R R   PP    E E+EEEEEEE  E ++ E  E ER   E
Sbjct: 287 RLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEE 332


>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
          Length = 249

 Score = 34.3 bits (79), Expect = 0.019
 Identities = 12/41 (29%), Positives = 17/41 (41%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 165
           + EE E++EE   E    +      AE +EGE        E
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246



 Score = 32.3 bits (74), Expect = 0.084
 Identities = 13/41 (31%), Positives = 17/41 (41%)

Query: 127 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           + EE EE+EE  AE     +   A  +  EG E       E
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 34.1 bits (78), Expect = 0.027
 Identities = 34/103 (33%), Positives = 43/103 (41%), Gaps = 14/103 (13%)

Query: 75  ELETRVRLSKRRQKV---GAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEED 131
           E E R R  + RQ      + P+         P  A++           E   EE  EE+
Sbjct: 400 EEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQ-----------ESEEEESVEEE 448

Query: 132 EEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
           EEEEEEE EE Q+ E  E E +E E       G  E  E S+E
Sbjct: 449 EEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 32.6 bits (75), Expect = 0.075
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 109 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
           E R      R LE  ++++EE DEE+EE    E  + +  + +  +    ER
Sbjct: 113 EPRYDDA-YRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRER 163



 Score = 30.0 bits (68), Expect = 0.62
 Identities = 9/52 (17%), Positives = 28/52 (53%)

Query: 110 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
               R  + + +   +++++E+ +EE+EE+ +++  E  + +  + +   RE
Sbjct: 111 QDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 32.8 bits (75), Expect = 0.085
 Identities = 15/42 (35%), Positives = 25/42 (59%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
           EEEE+E+EEEEE+E E   +  + + E +E +   +    +G
Sbjct: 391 EEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432



 Score = 32.0 bits (73), Expect = 0.15
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGG 164
                +   EE+EEE+EEE+E+E    +  +  E E  + E    +  G
Sbjct: 384 ANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432



 Score = 27.0 bits (60), Expect = 6.3
 Identities = 15/48 (31%), Positives = 26/48 (54%)

Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           +E  N E ++E+EE+EEEE EE +    ++    + E  E +   + E
Sbjct: 381 IEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYE 428


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 32.5 bits (74), Expect = 0.090
 Identities = 15/55 (27%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           E   ++  EED +EE+ + ++    E +EA R+E EG +     E + +E ++ +
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSD---ESEEDEAEATDGE 271



 Score = 32.1 bits (73), Expect = 0.13
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 1/52 (1%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           NE+ E   EE E  +  E  + E T+ E +EGE    E   + E S+ S+ED
Sbjct: 244 NEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSE-SDESDED 294



 Score = 30.6 bits (69), Expect = 0.42
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 4/52 (7%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           ++ +  ED E   EE+E + + E  EAE  +GEG E    GE + +E S + 
Sbjct: 239 DQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEE----GEMDAAEASEDS 286


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 32.4 bits (74), Expect = 0.097
 Identities = 18/41 (43%), Positives = 27/41 (65%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
           +E   E P+EEEEEE EE++EEE + T + E  + E ++ E
Sbjct: 29  KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69



 Score = 26.7 bits (59), Expect = 8.5
 Identities = 15/38 (39%), Positives = 22/38 (57%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
           E E+E  +EEEEEE EE ++ E    +++E    E E 
Sbjct: 30  EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 32.1 bits (74), Expect = 0.12
 Identities = 14/21 (66%), Positives = 15/21 (71%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
               EEEEEE+EEEEEEE  E
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSE 318



 Score = 31.0 bits (71), Expect = 0.31
 Identities = 14/20 (70%), Positives = 16/20 (80%)

Query: 122 PPNEEEEEEDEEEEEEEAEE 141
              EEEEEE+EEEEEE +EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEE 319



 Score = 30.6 bits (70), Expect = 0.32
 Identities = 13/19 (68%), Positives = 14/19 (73%)

Query: 121 EPPNEEEEEEDEEEEEEEA 139
           E   EEEEEE+EE  EEEA
Sbjct: 303 EEEEEEEEEEEEEPSEEEA 321



 Score = 30.2 bits (69), Expect = 0.56
 Identities = 13/21 (61%), Positives = 14/21 (66%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
               EEEEEE+EEEEE   EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320



 Score = 29.1 bits (66), Expect = 1.2
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 121 EPPNEEEEEEDEEEEEEEAE 140
           E   EEEEEE+E  EEE A 
Sbjct: 304 EEEEEEEEEEEEPSEEEAAA 323



 Score = 27.9 bits (63), Expect = 2.6
 Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 122 PPNEEEEEEDEEEEEEEAEE 141
                 EEE+EEEEEEE EE
Sbjct: 296 AQAAAAEEEEEEEEEEEEEE 315



 Score = 27.9 bits (63), Expect = 3.2
 Identities = 12/21 (57%), Positives = 14/21 (66%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
           +    EEEEE+EEEEEEE   
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPS 317



 Score = 27.5 bits (62), Expect = 4.3
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
                 EEEE+EEEEEEE E 
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEP 316


>gnl|CDD|220776 pfam10483, Hap2_elong, Histone acetylation protein 2.  Hap2 is one
           of three histone acetyltransferases proteins that, in
           yeasts, are found associated with elongating forms of
           RNA polymerase II (Elongator). The Haps can be isolated
           in two forms, as a six-subunit complex with Elongator
           and as a complex of the three proteins on their own. The
           role of the Hap complex in transcription is still
           speculative, being possibly to keep the HAT activity of
           free Elongator in check, allowing histone acetylation
           only in the presence of a transcribing polymerase, or
           the interaction with Haps might render Elongator
           susceptible to modifications thereby altering its
           activity.
          Length = 280

 Score = 31.5 bits (72), Expect = 0.17
 Identities = 29/116 (25%), Positives = 43/116 (37%), Gaps = 30/116 (25%)

Query: 75  ELETRVR-LSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQR---LRERFLEPPNE----- 125
           + E   R LS+    +    N   +V+        E R +    L E F+  P       
Sbjct: 165 DKEALDRSLSEPVFPLNEGLNGVGIVLE------LENRRKSGRSLTETFILSPATHEYES 218

Query: 126 -EEEEEDEEEEEEEAEETQQGETT------EAERQEGEG--------GEREGGGEG 166
            +E+ E + E  EE  E+ +  TT      E +R   E          E+ GGGEG
Sbjct: 219 LKEKPEGDAEAAEEDPESLEDLTTFNLGLSEKQRLAREQVELPFFDAQEKLGGGEG 274


>gnl|CDD|233683 TIGR02005, PTS-IIBC-alpha, PTS system, alpha-glucoside-specific
           IIBC component.  This model represents a family of fused
           PTS enzyme II B and C domains. A gene from Clostridium
           has been partially characterized as a maltose
           transporter, while genes from Fusobacterium and
           Klebsiella have been proposed to transport the five
           non-standard isomers of sucrose [Transport and binding
           proteins, Carbohydrates, organic alcohols, and acids].
          Length = 524

 Score = 31.3 bits (71), Expect = 0.20
 Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 6/69 (8%)

Query: 110 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG-----GG 164
           FR   L+     P  E+ EEE +   + + +  ++G++ E++R      +        GG
Sbjct: 402 FRFLILKFNIPTPGREDTEEEVKLYSKADYK-AKKGDSVESKRASSGNDKAAYFLQALGG 460

Query: 165 EGEISECSN 173
           +  I + +N
Sbjct: 461 KENIKDVTN 469


>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
           Cobaltochelatase is responsible for the insertion of
           cobalt into the corrin ring of coenzyme B12 during its
           biosynthesis. Two versions have been well described.
           CbiK/CbiX is a monomeric, anaerobic version which acts
           early in the biosynthesis (pfam06180). CobNST is a
           trimeric, ATP-dependent, aerobic version which acts late
           in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
           number of genomes (actinobacteria, cyanobacteria,
           betaproteobacteria and pseudomonads) which apparently
           biosynthesize B12, encode a cobN gene but are
           demonstrably lacking cobS and cobT. These genomes do,
           however contain a homolog (modelled here) of the
           magnesium chelatase subunits BchI/BchD family. Aside
           from the cyanobacteria (which have a separate magnesium
           chelatase trimer), these species do not make chlorins,
           so do not have any use for a magnesium chelatase.
           Furthermore, in nearly all cases the members of this
           family are proximal to either CobN itself or other genes
           involved in cobalt transport or B12 biosynthesis.
          Length = 633

 Score = 31.2 bits (71), Expect = 0.24
 Identities = 18/82 (21%), Positives = 30/82 (36%), Gaps = 8/82 (9%)

Query: 97  RLVVTHR----PLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEE----AEETQQGETT 148
            LV+ HR    P    + + ++  E   E P  + E+ DE E++ E          G   
Sbjct: 302 ELVLPHRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDD 361

Query: 149 EAERQEGEGGEREGGGEGEISE 170
           E E    E  + +G       +
Sbjct: 362 EKEPDPQEEADGQGSSTDPAGD 383


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 30.9 bits (70), Expect = 0.28
 Identities = 18/42 (42%), Positives = 21/42 (50%), Gaps = 4/42 (9%)

Query: 118 RFLE----PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 155
            FL+    P  E   EE+EEEEEEE EE  + E    E   G
Sbjct: 388 EFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPG 429



 Score = 30.2 bits (68), Expect = 0.61
 Identities = 15/34 (44%), Positives = 18/34 (52%)

Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA 150
           ER+ E   EEEEEE+EE  E EA   +     E 
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEV 432


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
           consists of several bacterial cobalamin biosynthesis
           (CobT) proteins. CobT is involved in the transformation
           of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 30.9 bits (70), Expect = 0.30
 Identities = 15/55 (27%), Positives = 27/55 (49%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
              NE+E++  E+E++++ EE + G +           E    GE E +E S +D
Sbjct: 221 SEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADD 275


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 30.8 bits (70), Expect = 0.39
 Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 155
           R  ++ P  ++EEEE E+EEEEE EE    E  E E +EG
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEE----EPEEPEPEEG 373



 Score = 26.6 bits (59), Expect = 8.1
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 127 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
            E E+EEEEEE  E  ++ E  E E +E E   +E  G G
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207


>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D.  This
           model represents one of two ATPase subunits of the
           trimeric magnesium chelatase responsible for insertion
           of magnesium ion into protoporphyrin IX. This is an
           essential step in the biosynthesis of both chlorophyll
           and bacteriochlorophyll. This subunit is found in green
           plants, photosynthetic algae, cyanobacteria and other
           photosynthetic bacteria. Unlike subunit I (TIGR02030),
           this subunit is not found in archaea [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Chlorophyll
           and bacteriochlorphyll].
          Length = 589

 Score = 30.5 bits (69), Expect = 0.44
 Identities = 8/29 (27%), Positives = 14/29 (48%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTE 149
             P E EEE DE ++ +  +  +  +  E
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQIPE 308


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 30.3 bits (68), Expect = 0.45
 Identities = 12/18 (66%), Positives = 16/18 (88%)

Query: 121 EPPNEEEEEEDEEEEEEE 138
           +  +EE+EEE+EEEEEEE
Sbjct: 225 DSESEEDEEEEEEEEEEE 242



 Score = 29.5 bits (66), Expect = 0.88
 Identities = 13/17 (76%), Positives = 15/17 (88%)

Query: 125 EEEEEEDEEEEEEEAEE 141
           E EE+E+EEEEEEE EE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243



 Score = 29.1 bits (65), Expect = 1.1
 Identities = 12/17 (70%), Positives = 14/17 (82%)

Query: 122 PPNEEEEEEDEEEEEEE 138
              E+EEEE+EEEEEEE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243



 Score = 28.0 bits (62), Expect = 2.6
 Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 121 EPPNEEEEEEDEEEEEEEAE 140
                EE+EE+EEEEEEE E
Sbjct: 224 SDSESEEDEEEEEEEEEEEE 243



 Score = 26.4 bits (58), Expect = 7.9
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQ 153
           + E EEDEEEEEEE EE +    T  +R+
Sbjct: 225 DSESEEDEEEEEEEEEEEEIDVVTVEKRR 253


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 29.2 bits (66), Expect = 0.53
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGE 146
               EE++EE+EEEEE+E  E +   
Sbjct: 74  AAAAEEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.5 bits (59), Expect = 4.5
 Identities = 10/25 (40%), Positives = 16/25 (64%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQG 145
                EE++E+EEEEEE+ E  ++ 
Sbjct: 73  AAAAAEEKKEEEEEEEEKEESEEEA 97



 Score = 26.1 bits (58), Expect = 5.6
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQG 145
               ++EEEE+EEE+EE  EE   G
Sbjct: 76  AAEEKKEEEEEEEEKEESEEEAAAG 100


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 29.3 bits (66), Expect = 0.61
 Identities = 13/35 (37%), Positives = 22/35 (62%)

Query: 127 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
            ++E+EE EEE  +ET+Q +  E + +  E  +RE
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128



 Score = 27.4 bits (61), Expect = 2.8
 Identities = 12/33 (36%), Positives = 21/33 (63%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
           +EEEE E+E  +E E E+  + +T   E+++ E
Sbjct: 96  DEEEETEEESTDETEQEDPPETKTESKEKKKRE 128



 Score = 26.6 bits (59), Expect = 5.2
 Identities = 11/32 (34%), Positives = 19/32 (59%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
           ++EEEE EEE  +E E+    ET    +++ +
Sbjct: 95  DDEEEETEEESTDETEQEDPPETKTESKEKKK 126



 Score = 26.6 bits (59), Expect = 6.1
 Identities = 12/35 (34%), Positives = 19/35 (54%)

Query: 118 RFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAER 152
           R L   ++EEEE +EE  +E  +E      TE++ 
Sbjct: 89  RELSSSDDEEEETEEESTDETEQEDPPETKTESKE 123


>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi.  This family includes
           the fifth essential DNA polymerase in yeast EC:2.7.7.7.
           Pol5p is localised exclusively to the nucleolus and
           binds near or at the enhancer region of rRNA-encoding
           DNA repeating units.
          Length = 784

 Score = 30.2 bits (68), Expect = 0.63
 Identities = 10/45 (22%), Positives = 16/45 (35%)

Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
           +      + E E E + E+ EE E+    E  E      +   R 
Sbjct: 663 DECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRA 707



 Score = 27.9 bits (62), Expect = 3.3
 Identities = 15/36 (41%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQE-GEGGE 159
           EEE+E+D EE +++ +E +  E +E+E +  GE GE
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESESDGEDGE 683


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 30.2 bits (68), Expect = 0.66
 Identities = 14/54 (25%), Positives = 22/54 (40%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
           E  N +  EE++EEE E    +   ++ + E    +  E E   E   S    E
Sbjct: 3   EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAE 56



 Score = 29.0 bits (65), Expect = 1.7
 Identities = 12/51 (23%), Positives = 20/51 (39%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           EEE+   +  EEE+ EE +    +     + +    E   E E  +   E 
Sbjct: 1   EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAES 51



 Score = 27.8 bits (62), Expect = 3.2
 Identities = 11/51 (21%), Positives = 18/51 (35%)

Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           E  +     EEE+E+E E    +  +   +       E E  E +   E  
Sbjct: 2   EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 29.9 bits (67), Expect = 0.70
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
           NE+EE+ + E E +E    Q+ E T+ E + GE    +   +    E  ++
Sbjct: 223 NEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDD 273



 Score = 29.5 bits (66), Expect = 0.91
 Identities = 14/38 (36%), Positives = 21/38 (55%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
            E E+E++ E E E +E    + +EA  +E E GE E 
Sbjct: 221 TENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEM 258



 Score = 28.8 bits (64), Expect = 1.9
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 7/53 (13%)

Query: 123 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           P E E+EE  E E E  E +   E+   +R+          GE E+ +   +D
Sbjct: 220 PTENEQEEQGEGEGEGQEGSAPQESEATDRESES-------GEEEMVQSDQDD 265



 Score = 26.8 bits (59), Expect = 8.3
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           EE  ++ E E+EE+ ++ Q  E  + E+ EGEG  +EG    E SE ++ +
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQE-SEATDRE 250



 Score = 26.4 bits (58), Expect = 10.0
 Identities = 12/45 (26%), Positives = 20/45 (44%)

Query: 131 DEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           +  EE  +  E++  E  + ++      E +G GEGE  E S   
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQ 242


>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
           Tim54.  Mitochondrial function depends on the import of
           hundreds of different proteins synthesised in the
           cytosol. Protein import is a multi-step pathway which
           includes the binding of precursor proteins to surface
           receptors, translocation of the precursor across one or
           both mitochondrial membranes, and folding and assembly
           of the imported protein inside the mitochondrion. Most
           precursor proteins carry amino-terminal targeting
           signals, called pre-sequences, and are imported into
           mitochondria via import complexes located in both the
           outer and the inner membrane (IM). The IM complex, TIM,
           is made up of at least two proteins which mediate
           translocation of proteins into the matrix by removing
           their signal peptide and another pair of proteins, Tim54
           and Tim22, that insert the polytopic proteins, that
           carry internal targetting information, into the inner
           membrane.
          Length = 377

 Score = 29.7 bits (67), Expect = 0.71
 Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQ--GETTEAERQEGEGG 158
           RI++LR +  E P EE   E+E  +E++  + +Q  G   +A+   G GG
Sbjct: 122 RIRKLRRKAGEEPEEELPLEEEATKEDDLADPRQVLGVYYKAKDPPGAGG 171



 Score = 28.9 bits (65), Expect = 1.4
 Identities = 13/48 (27%), Positives = 17/48 (35%)

Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           L+PP   E   DE   E E E T   E+     +E      E   +  
Sbjct: 195 LDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAP 242


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 29.7 bits (67), Expect = 0.73
 Identities = 14/29 (48%), Positives = 15/29 (51%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQE 154
             EEE+EEEEEEE EE    E       E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431



 Score = 29.7 bits (67), Expect = 0.76
 Identities = 13/18 (72%), Positives = 14/18 (77%)

Query: 125 EEEEEEDEEEEEEEAEET 142
             EEEE+EEEEEEE EE 
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420



 Score = 29.7 bits (67), Expect = 0.92
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEA 150
            EEEEE+EEEEEEE E   +     A
Sbjct: 404 AEEEEEEEEEEEEEEEPVAEVMMMPA 429



 Score = 28.5 bits (64), Expect = 1.9
 Identities = 15/36 (41%), Positives = 18/36 (50%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
           EEEEEE+EEEEEEE             +    GG +
Sbjct: 406 EEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIK 441



 Score = 27.7 bits (62), Expect = 3.2
 Identities = 15/28 (53%), Positives = 17/28 (60%)

Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQ 144
           ER+     EEEEEE+EEEEE  AE    
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAEVMMM 427


>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
           binding domain.  Human SGLT1 (hSGLT1) is a
           high-affinity/low-capacity glucose transporter, which
           can also transport galactose. In the transport
           mechanism, two Na+ ions first bind to the extracellular
           side of the transporter and induce a conformational
           change in the glucose binding site. This results in an
           increased affinity for glucose. A second conformational
           change in the transporter follows, bringing the Na+ and
           glucose binding sites to the inner surface of the
           membrane. Glucose is then released, followed by the Na+
           ions. In the process, hSGLT1 is also able to transport
           water and urea and may be a major pathway for transport
           of these across the intestinal brush-border membrane.
           hSGLT1 is encoded by the SLC5A1 gene and expressed
           mostly in the intestine, but also in the trachea,
           kidney, heart, brain, testis, and prostate. The
           WHO/UNICEF oral rehydration solution (ORS) for the
           treatment of secretory diarrhea contains salt and
           glucose. The glucose, along with sodium ions, is
           transported by hSGLT1 and water is either co-transported
           along with these or follows by osmosis. Mutations in
           SGLT1 are associated with intestinal glucose galactose
           malabsorption (GGM). Up-regulation of intestinal SGLT1
           may protect against enteric infections. SGLT1 is
           expressed in colorectal, head and neck, and prostate
           tumors. Epidermal growth factor receptor (EGFR)
           functions in cell survival by stabilizing SGLT1, and
           thereby maintaining intracellular glucose levels. SGLT1
           is predicted to have 14 membrane-spanning regions. This
           subgroup belongs to the solute carrier 5
           (SLC5)transporter family.
          Length = 635

 Score = 29.5 bits (66), Expect = 0.94
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
           N  EE  D + ++   +E +    T+ ER++
Sbjct: 539 NSTEERIDLDADDWTEDEDENEMETDEERKK 569



 Score = 29.1 bits (65), Expect = 1.3
 Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 7/40 (17%)

Query: 106 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQG 145
           N+TE RI       L+  ++  E+EDE E E + E  + G
Sbjct: 539 NSTEERID------LDA-DDWTEDEDENEMETDEERKKPG 571


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 29.6 bits (66), Expect = 0.99
 Identities = 8/47 (17%), Positives = 15/47 (31%), Gaps = 1/47 (2%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
             PN++     +   E+  EE +    +E E       + E      
Sbjct: 157 PSPNQQPSSFLQPSHEDSPEEPEPPT-SEPEPDSPGPPQSETPTSSP 202



 Score = 28.0 bits (62), Expect = 3.2
 Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 3/57 (5%)

Query: 117 ERFLEPPNEEEEEEDE---EEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
             FL+P +E+  EE E    E E ++    Q ET  +        +  G  +    +
Sbjct: 164 SSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQ 220



 Score = 27.6 bits (61), Expect = 3.6
 Identities = 13/45 (28%), Positives = 18/45 (40%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
           E+     E + E AEE++ GE  E  +    G   E  G    S 
Sbjct: 63  EQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSP 107



 Score = 27.6 bits (61), Expect = 3.8
 Identities = 7/47 (14%), Positives = 13/47 (27%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
             P+E    E       +   +    + E   +E E    E   +  
Sbjct: 144 PGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSP 190



 Score = 27.2 bits (60), Expect = 4.7
 Identities = 2/47 (4%), Positives = 4/47 (8%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
             P                   +            +          E
Sbjct: 126 SSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHE 172



 Score = 26.9 bits (59), Expect = 6.7
 Identities = 4/51 (7%), Positives = 6/51 (11%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
                   EE            ++                 G       E 
Sbjct: 104 TPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES 154


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 29.6 bits (66), Expect = 1.1
 Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 4/42 (9%)

Query: 125  EEEEEEDEEEEEEEA----EETQQGETTEAERQEGEGGEREG 162
            E++E+ +E+  EE      E T+ G  ++ E ++GE  E + 
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQA 4098



 Score = 27.3 bits (60), Expect = 5.7
 Identities = 8/37 (21%), Positives = 15/37 (40%)

Query: 123  PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
                E+++ +E+E+EE      G   E +    E   
Sbjct: 3942 NKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNS 3978


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 29.2 bits (65), Expect = 1.1
 Identities = 6/34 (17%), Positives = 9/34 (26%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 158
           E      EE     +E   + E    E+      
Sbjct: 392 EAASAAPEEPAALASEAHDETEPEVPEKAAPIPD 425



 Score = 28.4 bits (63), Expect = 2.4
 Identities = 3/42 (7%), Positives = 11/42 (26%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           +     + + E  +   ++     +E  +    E        
Sbjct: 382 QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 29.2 bits (65), Expect = 1.1
 Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 1/52 (1%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGE-TTEAERQEGEGGEREGGGEGEISEC 171
           E    EE +  EEEE+EE      GE   E   +E E  E    GE  +   
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYE 229


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 28.4 bits (64), Expect = 1.2
 Identities = 9/24 (37%), Positives = 22/24 (91%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGET 147
           +EE+EEE+++EE+++ +E+++ E+
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEES 140



 Score = 27.7 bits (62), Expect = 2.0
 Identities = 9/25 (36%), Positives = 20/25 (80%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETT 148
           ++EE+EE+E++EE++ E+  + E +
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 26.9 bits (60), Expect = 4.2
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 125 EEEEEEDEEEEEEEAEETQQ 144
           EE++EED++E+E E EE+  
Sbjct: 123 EEDDEEDDDEDESEEEESPV 142


>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
           domain 1.  The small mechanosensitive channel, MscS, is
           a part of the turgor-driven solute efflux system that
           protects bacteria from lysis in the event of osmotic
           shock. The MscS protein alone is sufficient to form a
           functional mechanosensitive channel gated directly by
           tension in the lipid bilayer. The MscS proteins are
           heptamers of three transmembrane subunits with seven
           converging M3 domains, and this domain is one of the
           inner membrane domains.
          Length = 339

 Score = 28.7 bits (65), Expect = 1.3
 Identities = 11/37 (29%), Positives = 17/37 (45%), Gaps = 4/37 (10%)

Query: 111 RI--QRLRER--FLEPPNEEEEEEDEEEEEEEAEETQ 143
           R+  +R + +   +     EEEEE  E   E  EE +
Sbjct: 257 RLAYERAKAKRAEILAQRAEEEEESSEGAAETIEEPE 293


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 29.2 bits (65), Expect = 1.4
 Identities = 16/44 (36%), Positives = 24/44 (54%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
           R+  +     EEEEE E EE+EE  E + G+  + + +E E  E
Sbjct: 131 RKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
           domain family is found in eukaryotes, and is typically
           between 125 and 136 amino acids in length.
          Length = 133

 Score = 28.2 bits (63), Expect = 1.4
 Identities = 13/42 (30%), Positives = 21/42 (50%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
               ++EEEEEEE E+ QQ +  + ++ +      E G E  
Sbjct: 92  GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEESA 133



 Score = 27.5 bits (61), Expect = 2.8
 Identities = 10/19 (52%), Positives = 17/19 (89%)

Query: 125 EEEEEEDEEEEEEEAEETQ 143
           EEEEEE+E E++++++E Q
Sbjct: 97  EEEEEEEENEKQQQSDEAQ 115


>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
          Length = 246

 Score = 28.4 bits (64), Expect = 1.7
 Identities = 12/47 (25%), Positives = 14/47 (29%)

Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
             PP       +  E EE  EE     T E      +    EG   G
Sbjct: 30  PPPPEPAAPPPEPPEPEEVPEEEAAPLTEEELEAIRQQAHEEGFAAG 76


>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
           mRNA processing and transport [Signal transduction
           mechanisms / RNA processing and modification].
          Length = 388

 Score = 28.7 bits (64), Expect = 1.8
 Identities = 10/35 (28%), Positives = 17/35 (48%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
            +EE ++E E +E+   +  E    E Q  +  ER
Sbjct: 348 TDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382


>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
          Length = 869

 Score = 28.8 bits (65), Expect = 1.8
 Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 120 LEP-PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 158
           LEP P EE EEE  EE EE   E  + E    +   G+ G
Sbjct: 828 LEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 28.3 bits (64), Expect = 2.0
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 113 QRLRERFLEPPNEEEEEEDEEEEEEEAEE 141
           Q++RE FL+    EEEEE+EEEE     E
Sbjct: 517 QKIREEFLQKHKNEEEEEEEEEELPLIPE 545


>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
           subunit.  The alpha-subunit of the TRAP complex (TRAP
           alpha) is a single-spanning membrane protein of the
           endoplasmic reticulum (ER) which is found in proximity
           of nascent polypeptide chains translocating across the
           membrane.
          Length = 281

 Score = 28.2 bits (63), Expect = 2.1
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
             +EE E++  +E+EE+    ++ E    E +E E GE
Sbjct: 35  TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72



 Score = 27.0 bits (60), Expect = 5.2
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 119 FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
             +   E++  +++EE+E   EE +   T E E +EGE
Sbjct: 35  TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72


>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
           [Intracellular trafficking and secretion].
          Length = 776

 Score = 28.4 bits (63), Expect = 2.1
 Identities = 15/57 (26%), Positives = 23/57 (40%)

Query: 84  KRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 140
           ++     A   +  L  +  PL      I    +  +E  N EE E +EE E+ E E
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
           family includes archaebacterial L12, eukaryotic P0, P1
           and P2.
          Length = 88

 Score = 26.8 bits (60), Expect = 2.2
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 121 EPPNEEEEEEDEEEEEEEA 139
               EEE++E+EEEEEE+ 
Sbjct: 63  AAAAEEEKKEEEEEEEEDD 81



 Score = 26.4 bits (59), Expect = 3.9
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 125 EEEEEEDEEEEEEEAEE 141
             EEE+ EEEEEEE ++
Sbjct: 65  AAEEEKKEEEEEEEEDD 81



 Score = 26.0 bits (58), Expect = 4.4
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query: 122 PPNEEEEEEDEEEEEEEAEE 141
                   E+E++EEEE EE
Sbjct: 59  AAAAAAAAEEEKKEEEEEEE 78



 Score = 25.7 bits (57), Expect = 6.6
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 121 EPPNEEEEEEDEEEEEEE 138
                EEE+++EEEEEEE
Sbjct: 62  AAAAAEEEKKEEEEEEEE 79


>gnl|CDD|233055 TIGR00617, rpa1, replication factor-a protein 1 (rpa1).  All
           proteins in this family for which functions are known
           are part of a multiprotein complex made up of homologs
           of RPA1, RPA2 and RPA3 that bind ssDNA and function in
           the recognition of DNA damage for nucleotide excision
           repairThis family is based on the phylogenomic analysis
           of JA Eisen (1999, Ph.D. Thesis, Stanford University)
           [DNA metabolism, DNA replication, recombination, and
           repair].
          Length = 608

 Score = 28.2 bits (63), Expect = 2.4
 Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 9/65 (13%)

Query: 2   DESGEQFVAYFLPTAETLDKRKLDQLSNLEYTDEQEYEYKMAR----EYNWNVKSKSCKG 57
           DE+G+ +V  F   AE +  +   +L  L+  D  E+E          Y + ++ K    
Sbjct: 518 DETGQLWVTAFNDQAEQILGKSAAELGELKEEDPDEFEAIFQEAQFVPYIFRLRVK---- 573

Query: 58  YEENY 62
            ++ Y
Sbjct: 574 -QDTY 577


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 28.2 bits (63), Expect = 2.4
 Identities = 16/57 (28%), Positives = 21/57 (36%), Gaps = 3/57 (5%)

Query: 107 ATEFRIQRLRERFLEPPNEEEEEED---EEEEEEEAEETQQGETTEAERQEGEGGER 160
             E  +QR  E  LE    E  EED    E      +E +  +  E  + E E   R
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449



 Score = 27.8 bits (62), Expect = 3.4
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query: 106 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
           ++ E+  +    +  E  ++ EEEE  + E EE  E +Q E  E + +  E
Sbjct: 372 DSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422



 Score = 26.3 bits (58), Expect = 10.0
 Identities = 14/49 (28%), Positives = 15/49 (30%)

Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
               EEEE  + E EE  E  Q     E  R        E   E E   
Sbjct: 389 ISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDEL 437


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 28.2 bits (63), Expect = 2.6
 Identities = 7/43 (16%), Positives = 16/43 (37%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
             ++ +E + + E    +   G   + E    +      GG+G
Sbjct: 883 ENKDSKEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDG 925



 Score = 27.1 bits (60), Expect = 6.6
 Identities = 5/35 (14%), Positives = 11/35 (31%)

Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
           + E  + +     A + +     +    EG  G  
Sbjct: 893 DAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDGAA 927


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 27.8 bits (62), Expect = 3.0
 Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 6/59 (10%)

Query: 118 RFLEPPNEEEEEEDEEEEEEEAEETQQGET------TEAERQEGEGGEREGGGEGEISE 170
           R+ E P  E+EEE+E+E++++     + E       TE      E G      + E  E
Sbjct: 130 RYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDE 188


>gnl|CDD|215518 PLN02959, PLN02959, aminoacyl-tRNA ligase.
          Length = 1084

 Score = 28.1 bits (63), Expect = 3.0
 Identities = 12/46 (26%), Positives = 22/46 (47%)

Query: 101 THRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGE 146
           T  P+ A+  ++ R  +++  PP   EE+EDE      A+   +  
Sbjct: 94  TGMPIKASADKLAREIQQYGNPPVFPEEDEDEAAAVAAAKAEAEAA 139


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 27.7 bits (61), Expect = 3.1
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query: 112 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGE 146
           + +L+E   E   EE++ ++E+EEEE  EE +Q E
Sbjct: 203 LSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 28.0 bits (63), Expect = 3.2
 Identities = 13/42 (30%), Positives = 27/42 (64%)

Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
           ++  +EEE+EEDE+ ++E+ EE ++ E  + +++  E    E
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390



 Score = 26.5 bits (59), Expect = 9.9
 Identities = 16/59 (27%), Positives = 31/59 (52%), Gaps = 1/59 (1%)

Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
           + F     EE+EEE+E+  ++E EE    +  E E ++ +  + E   E E S+  +++
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEE-DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDE 369


>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
          Length = 258

 Score = 27.4 bits (62), Expect = 3.7
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQE 154
             E  E+EEEE EE EE ++ E  E    E
Sbjct: 229 LAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258



 Score = 26.7 bits (60), Expect = 7.4
 Identities = 18/34 (52%), Positives = 20/34 (58%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTE 149
           R+  L    EEEEEE EEEEEEE EE  +    E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 26.7 bits (59), Expect = 3.9
 Identities = 12/21 (57%), Positives = 16/21 (76%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
           E   E+EEE+ +EEE+EE EE
Sbjct: 76  EKAEEKEEEKKKEEEKEEEEE 96



 Score = 25.5 bits (56), Expect = 8.3
 Identities = 12/25 (48%), Positives = 17/25 (68%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQG 145
           +   +EEE++ EEE+EEE EE   G
Sbjct: 77  KAEEKEEEKKKEEEKEEEEEEALAG 101


>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
          Length = 336

 Score = 27.6 bits (62), Expect = 4.0
 Identities = 17/57 (29%), Positives = 31/57 (54%)

Query: 114 RLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
           +L+E+  +   + EE+ +EEEEEE  E  ++ E  + E ++ + G  +     EI E
Sbjct: 4   KLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKE 60


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
           represents the eukaryotic large ribosomal protein P1.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P1 is located in the L12 stalk, with proteins
           P2, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers) and bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plant species have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site located in the C-terminal
           region of P2. TCS inactivates the ribosome by
           depurinating a specific adenine in the sarcin-ricin loop
           of 28S rRNA.
          Length = 103

 Score = 26.5 bits (59), Expect = 4.1
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
               E ++EE +EEEEEE+++
Sbjct: 76  AAAAEAKKEEKKEEEEEESDD 96


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 27.0 bits (60), Expect = 4.1
 Identities = 15/48 (31%), Positives = 18/48 (37%)

Query: 107 ATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
             E R + + E+          EE    EEEEAEE    E    E  E
Sbjct: 139 VNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESAE 186


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 27.2 bits (60), Expect = 4.1
 Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 10/57 (17%)

Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE----------GGGEGEI 168
           PP  +++E+ ++E E+ A+  ++ +     + EGEG E E           GGEGE+
Sbjct: 89  PPGGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPPPEDTPPPPGGEGEV 145


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
           represents the L12P protein of the large (50S) subunit
           of the archaeal ribosome.
          Length = 105

 Score = 26.5 bits (59), Expect = 4.2
 Identities = 14/26 (53%), Positives = 16/26 (61%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGE 146
               EEEEEE+EEEEEEE  E +   
Sbjct: 73  AAAEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 25.4 bits (56), Expect = 9.2
 Identities = 14/21 (66%), Positives = 15/21 (71%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
               EEEEEE+EEEEEEE E 
Sbjct: 72  AAAAEEEEEEEEEEEEEEEES 92


>gnl|CDD|202496 pfam02981, FokI_N, Restriction endonuclease FokI, recognition
           domain. 
          Length = 145

 Score = 26.8 bits (59), Expect = 4.3
 Identities = 11/42 (26%), Positives = 18/42 (42%)

Query: 76  LETRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRE 117
           +   V     R ++ +  N T L VT++ L  T   I+   E
Sbjct: 46  IPELVTELNLRYELKSVINQTPLSVTYKQLVGTGTSIRSKAE 87


>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381).  This
           domain is functionally uncharacterized. This domain is
           found in eukaryotes. This presumed domain is typically
           between 156 to 174 amino acids in length. This domain is
           found associated with pfam07780, pfam01728.
          Length = 154

 Score = 26.9 bits (60), Expect = 4.3
 Identities = 13/37 (35%), Positives = 21/37 (56%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
            E EE DEEE+ +E  E +  +    +R+E E  ++E
Sbjct: 109 VEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145



 Score = 26.5 bits (59), Expect = 6.2
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
           E   EEE E +E +EEE+ +E  + E  + +R++    ER
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141



 Score = 26.5 bits (59), Expect = 6.5
 Identities = 17/45 (37%), Positives = 24/45 (53%)

Query: 110 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
           +R +  +   L+   +EEEEE+E E EE  EE Q  E  E E  +
Sbjct: 86  WRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAK 130


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 27.3 bits (61), Expect = 4.9
 Identities = 6/21 (28%), Positives = 8/21 (38%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
                       EEEEEE ++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
           [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 26.2 bits (58), Expect = 4.9
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
           E   E +E E+EE+EEE  EE
Sbjct: 78  EAAAEADEAEEEEKEEEAEEE 98



 Score = 25.8 bits (57), Expect = 8.2
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
                E +E +EEE+EEEAEE
Sbjct: 77  AEAAAEADEAEEEEKEEEAEE 97


>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
           synthase complex, beta subunit.  Nomenclature follows
           the description for Methanosarcina thermophila. The
           CO-methylating acetyl-CoA synthase is considered the
           defining enzyme of the Wood-Ljungdahl pathway, used for
           acetate catabolism by sulfate reducing bacteria but for
           acetate biosynthesis by acetogenic bacteria such as
           oorella thermoacetica (f. Clostridium thermoaceticum)
           [Energy metabolism, Chemoautotrophy].
          Length = 458

 Score = 27.1 bits (60), Expect = 5.2
 Identities = 12/25 (48%), Positives = 14/25 (56%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTE 149
           E +EEE EEEEE    E  + E  E
Sbjct: 402 EVDEEEIEEEEEAMQPEEMEMEGFE 426


>gnl|CDD|182341 PRK10260, PRK10260, L,D-transpeptidase; Provisional.
          Length = 306

 Score = 26.9 bits (59), Expect = 5.3
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query: 78  TRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQ 113
           TRV+      K    P+ +R +  H PL+ TE + +
Sbjct: 229 TRVQFIDEPVKATTEPDGSRYIEVHNPLSTTEAQFE 264


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 27.2 bits (61), Expect = 5.5
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
           + F     EEEEEE++E++EEE EE ++    E E +E +  +
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475



 Score = 26.8 bits (60), Expect = 7.6
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query: 113 QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
           ++  E   E   +    + +EEEEEE +E ++ E  E E +  E  E E   + +
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475


>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
          Length = 1634

 Score = 27.2 bits (60), Expect = 5.6
 Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 4/52 (7%)

Query: 107  ATEFRIQR----LRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
            AT   I +    L ++  + P++E+++ED +  E+E ++ QQ     AE Q 
Sbjct: 1544 ATADEIDQAAHMLMDQVGQVPDDEDDDEDGDVLEKEVDQLQQSLERLAEVQR 1595


>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
           protein.  Members of this protein family are designated
           TraM and are found in a proposed transfer region of a
           class of conjugative transposon found in the Bacteroides
           lineage [Cellular processes, DNA transformation].
          Length = 410

 Score = 27.0 bits (60), Expect = 5.9
 Identities = 18/59 (30%), Positives = 22/59 (37%), Gaps = 1/59 (1%)

Query: 103 RPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
           R   A    I R    F E P  +EE+E    E EE E     E + A   E +    E
Sbjct: 120 RSSAAAYRDINRELGSFYEYPKTDEEKE-LLREVEELESRLATEPSPAPELEEQLALME 177


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 26.8 bits (60), Expect = 6.0
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 153
           E   E+++++DE+E+E++ EE   G   E  R+
Sbjct: 50  EEDLEDDDDDDEDEDEDDEEEADLGPDPEEARE 82



 Score = 26.0 bits (58), Expect = 8.9
 Identities = 9/40 (22%), Positives = 13/40 (32%)

Query: 123 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
           PN           E E +E    +  + +  E E  E E 
Sbjct: 32  PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71


>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
          Length = 382

 Score = 27.1 bits (61), Expect = 6.0
 Identities = 12/31 (38%), Positives = 13/31 (41%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
            EEEEEE+E EEEE          T      
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPA 255


>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
          Length = 746

 Score = 26.8 bits (59), Expect = 6.6
 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 12/61 (19%)

Query: 95  NTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
           NT  +V  RP  +           FL P    + E DEE+ E++ +  +   T     Q+
Sbjct: 584 NTAAIVFARPYQS-----------FLPPYGLADVELDEEDTEDD-DAVELPSTPSMGTQD 631

Query: 155 G 155
           G
Sbjct: 632 G 632


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 26.5 bits (59), Expect = 6.8
 Identities = 11/35 (31%), Positives = 23/35 (65%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
           E+ EEE+EE++  E+E+  + +  E E ++ + G+
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318



 Score = 26.5 bits (59), Expect = 7.9
 Identities = 16/41 (39%), Positives = 22/41 (53%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
            E  LE   EEEEE+D  E E+E EE +  E  E + +  +
Sbjct: 279 LEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 26.9 bits (60), Expect = 7.3
 Identities = 12/44 (27%), Positives = 18/44 (40%)

Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 165
            P      E+EE  E++A      +  +AE    E  E E G +
Sbjct: 773 APPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEELGAK 816


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 26.2 bits (58), Expect = 7.5
 Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 112 IQRLRERFLEPP--NEEEEEEDEEEEEEEAEETQQGETTE 149
           + R R +F  PP  +E+  + D+++E++E +E    E  +
Sbjct: 34  LSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDD 73


>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
          Length = 158

 Score = 26.3 bits (58), Expect = 7.7
 Identities = 10/33 (30%), Positives = 20/33 (60%)

Query: 119 FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAE 151
           F E   +E++E  +E EE+EA + ++ +  + E
Sbjct: 91  FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKE 123


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 26.6 bits (59), Expect = 7.7
 Identities = 15/51 (29%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
            +++L  +      E++E+E EE E+   E  Q+GE   A  Q  E G + 
Sbjct: 294 ELKKLENKL-----EKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKS 339


>gnl|CDD|225962 COG3428, COG3428, Predicted membrane protein [Function unknown].
          Length = 494

 Score = 26.6 bits (59), Expect = 7.8
 Identities = 11/47 (23%), Positives = 17/47 (36%), Gaps = 1/47 (2%)

Query: 113 QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
             +RE  ++  +  E  E +  EEE   E      +   R     GE
Sbjct: 140 ALVREARVKKLDALELAEADTPEEEV-AEVLARSQSSVLRYPMNKGE 185


>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2.  This model
           describes a tightly conserved subfamily of the larger
           family of sugar (and other) transporters described by
           PFAM model pfam00083. Members of this subfamily include
           closely related forms SV2A and SV2B of synaptic vesicle
           protein from vertebrates and a more distantly related
           homolog (below trusted cutoff) from Drosophila
           melanogaster. Members are predicted to have two sets of
           six transmembrane helices.
          Length = 742

 Score = 26.9 bits (59), Expect = 7.9
 Identities = 23/70 (32%), Positives = 31/70 (44%), Gaps = 14/70 (20%)

Query: 113 QRLRERFLE--------PPNE---EEEEEDEEEEEEEAEETQQGETTEAERQEGE--GGE 159
           QR   RF +         P +     E  D+EEE E + +  +G   + E  EGE  G  
Sbjct: 43  QRSYSRFEDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEGEYQGIP 102

Query: 160 R-EGGGEGEI 168
           R E GG+ EI
Sbjct: 103 RAENGGKDEI 112


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 26.7 bits (59), Expect = 8.0
 Identities = 15/42 (35%), Positives = 21/42 (50%)

Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 165
           ++E   E    E+ E EE ++GE  E E  E E  E E  G+
Sbjct: 66  DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGD 107


>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor. 
          Length = 456

 Score = 26.5 bits (58), Expect = 8.8
 Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
           +EE +E  ++++  ++  +     EA R  GEG  R  
Sbjct: 56  QEEPDEKTQDQQSLSDVERAEPEVEASRGRGEGSSRPP 93


>gnl|CDD|182388 PRK10337, PRK10337, sensor protein QseC; Provisional.
          Length = 449

 Score = 26.5 bits (59), Expect = 9.1
 Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 6/64 (9%)

Query: 64  LIVRDDAVYYNELETRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPP 123
           L+VR      N L+  +R S +   V    N     V       T   + R+ ERF  PP
Sbjct: 355 LLVR------NLLDNAIRYSPQGSVVDVTLNARNFTVRDNGPGVTPEALARIGERFYRPP 408

Query: 124 NEEE 127
            +E 
Sbjct: 409 GQEA 412


>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
          Length = 613

 Score = 26.4 bits (59), Expect = 9.3
 Identities = 12/58 (20%), Positives = 18/58 (31%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSN 173
           + R  E  +       EEE  +  EE     +   E     G     G   ++   SN
Sbjct: 108 KRRLFEEQDSGLGNSLEEESTDVVEEEGVESSGGGEGGSETGQGGGNGLVLDLLRSSN 165


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 26.3 bits (58), Expect = 9.9
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEE 141
           + + LE  + +EE+E +EEEEEE EE
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEE 184


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.305    0.128    0.349 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,113,579
Number of extensions: 870070
Number of successful extensions: 6548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4330
Number of HSP's successfully gapped: 894
Length of query: 175
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 85
Effective length of database: 6,945,742
Effective search space: 590388070
Effective search space used: 590388070
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 56 (25.2 bits)