RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy318
         (438 letters)



>gnl|CDD|224402 COG1485, COG1485, Predicted ATPase [General function prediction
           only].
          Length = 367

 Score =  337 bits (867), Expect = e-113
 Identities = 131/387 (33%), Positives = 200/387 (51%), Gaps = 57/387 (14%)

Query: 11  PSKQYYEKLSNNEIQEDKHQEQIVKQLDNVYVSIKNYAPQSKSMFSFFQDKVKQPKGLYI 70
           P ++Y + +     Q D  Q      LD +Y  +       K++   F       +GLY+
Sbjct: 11  PVERYAQLVPAGTFQPDPAQPAAAAALDRLYDELVAPRSARKALGWLFGRDHGPVRGLYL 70

Query: 71  YGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVP 130
           +G VG GKTMLMD+FYES   ++K+R+HF++FM  VH ++H ++             G  
Sbjct: 71  WGGVGRGKTMLMDLFYESLPGERKRRLHFHRFMARVHQRLHTLQ-------------GQT 117

Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKTWLICFDEFQ 190
           +                                     P+PP+A+++ ++T ++CFDEF+
Sbjct: 118 D-------------------------------------PLPPIADELAAETRVLCFDEFE 140

Query: 191 VTDIADAMILKRLFTELFQLGVVVVATSNRAPDDLYKNGLQRSNFLPFIDVLKTYCDVAS 250
           VTDIADAMIL RL   LF  GVV+VATSN APD+LYK+GLQR  FLP ID++K++ +V +
Sbjct: 141 VTDIADAMILGRLLEALFARGVVLVATSNTAPDNLYKDGLQRERFLPAIDLIKSHFEVVN 200

Query: 251 LNSNIDYRSLKANAEESSTKTYFVK-NFANEKKLHGIFKLLCSQENDIVRPRVITIMGRN 309
           ++  +DYR  K          Y    +   E  L  ++  L     +      + I GR 
Sbjct: 201 VDGPVDYRLRKLEQAP----VYLTPLDAEAEAALDKLWAALSDGAPE--AAANLEIKGRE 254

Query: 310 VTFNKTCGQILEATFDELCSRDLGSSDYLHICQIFHTVIIRNVPQLNIKLRSQSRRFITL 369
           +      G +L   F +LC     +SDYL + + FHTV + +VPQ++   R ++RRFI L
Sbjct: 255 IRVPAVAGGLLWFDFAQLCEAPRSASDYLALAERFHTVFLTDVPQMDPLDRDEARRFIAL 314

Query: 370 IDALYDNNIRLVISSDVPLNKLFSNEA 396
           +D LYD  ++LV S++VPLN+L+    
Sbjct: 315 VDELYDRGVKLVASAEVPLNELYQGGR 341


>gnl|CDD|217821 pfam03969, AFG1_ATPase, AFG1-like ATPase.  This family of proteins
           contains a P-loop motif and are predicted to be ATPases.
          Length = 361

 Score =  274 bits (703), Expect = 4e-89
 Identities = 118/389 (30%), Positives = 182/389 (46%), Gaps = 61/389 (15%)

Query: 11  PSKQYYEKLSNNEIQEDKHQEQIVKQLDNVY--VSIKNYAPQSKSMFSFFQDKVKQ--PK 66
           P ++Y  +L    I  D  Q   V  LD +Y  +   ++  QS +    +  K      +
Sbjct: 3   PPQRYTAQLQRGAIFPDVAQANAVPALDRLYQRLQAADFVRQSGAGGKLWGRKRSHQPVR 62

Query: 67  GLYIYGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYD 126
           GLY++G VG GKT LMD F+ES   ++K+R HF++FM  VH                   
Sbjct: 63  GLYLWGGVGRGKTHLMDSFFESLPGERKRRTHFHRFMFRVHD------------------ 104

Query: 127 PGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKTWLICF 186
                 L  ++  +                           DP+P  A+   ++  L+CF
Sbjct: 105 -----ELTTLQGGD---------------------------DPLPIAADRFANEARLLCF 132

Query: 187 DEFQVTDIADAMILKRLFTELFQLGVVVVATSNRAPDDLYKNGLQRSNFLPFIDVLKTYC 246
           DEF+V DI DAMIL RLF  LF  GV +VATSN AP+ LY+NGL R  FLP ID+L+++ 
Sbjct: 133 DEFEVDDIGDAMILGRLFEALFARGVSLVATSNTAPEQLYRNGLNRQRFLPAIDLLESHF 192

Query: 247 DVASLNSNIDYRSLKANAEESSTKTYFVKNFANEKKLHGIFKLLCSQENDIVRPRV-ITI 305
           +V  ++  +DYR              +  + A    L  ++  L   E     P   + +
Sbjct: 193 EVVRVDGPVDYRLRTLEQAPLW---LYPLDGAAWAALDRLWDALTLGEPQ---PLSTLEV 246

Query: 306 MGRNVTFNKTCGQILEATFDELCSRDLGSSDYLHICQIFHTVIIRNVPQLNIKLRSQSRR 365
             R        G ++  +F +LC      SDYL + + F TV + +VP +    + ++RR
Sbjct: 247 AARLARARAAGGDLVWFSFADLCLAPRHPSDYLALAERFSTVFLTDVPPMTRCSQDEARR 306

Query: 366 FITLIDALYDNNIRLVISSDVPLNKLFSN 394
           FI L+D LYD +++LV S++  L+ L+  
Sbjct: 307 FIALVDELYDRHVKLVASAEAALDDLYRG 335



 Score = 28.2 bits (63), Expect = 9.7
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 203 LFTELFQLGVVVVATSNRAPDDLYKNGLQRSNFL 236
           L  EL+   V +VA++  A DDLY+ G     F 
Sbjct: 310 LVDELYDRHVKLVASAEAALDDLYRGGRLAFEFQ 343


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 57.0 bits (137), Expect = 1e-08
 Identities = 21/30 (70%), Positives = 26/30 (86%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           K  EE+ERE+E+E+ERERERERE ER AK+
Sbjct: 595 KAREEREREKEKEKEREREREREAERAAKA 624



 Score = 53.5 bits (128), Expect = 1e-07
 Identities = 21/30 (70%), Positives = 27/30 (90%), Gaps = 1/30 (3%)

Query: 137 KEEEEKER-EREREREREREREREREREAK 165
           +E E+K R ERERE+E+E+EREREREREA+
Sbjct: 590 REAEQKAREEREREKEKEKEREREREREAE 619



 Score = 51.6 bits (123), Expect = 6e-07
 Identities = 19/30 (63%), Positives = 25/30 (83%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           +EE E+E+E+E+ERERERERE ER  +A S
Sbjct: 597 REEREREKEKEKEREREREREAERAAKASS 626



 Score = 47.0 bits (111), Expect = 2e-05
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDP 169
           + E EKE+E+ERERERERE ER  +  + S++ 
Sbjct: 599 EREREKEKEKEREREREREAERAAKASSSSHES 631



 Score = 38.5 bits (89), Expect = 0.006
 Identities = 13/26 (50%), Positives = 21/26 (80%)

Query: 138 EEEEKERERERERERERERERERERE 163
           E+ ++E E++   ERERE+E+E+ERE
Sbjct: 586 EKAKREAEQKAREEREREKEKEKERE 611



 Score = 35.0 bits (80), Expect = 0.082
 Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 6/54 (11%)

Query: 127 PGVPNRLDVMKEEEEKERERERERE-RERE-RERER---EREAKSYDPIPPVAN 175
           PG+ N    ++E E +ERE  RERE RERE RER +   E +    D + P AN
Sbjct: 706 PGLYNADPTIRERELREREM-REREIRERELRERMKPGFEVKPPELDTLHPSAN 758



 Score = 33.9 bits (77), Expect = 0.21
 Identities = 14/39 (35%), Positives = 25/39 (64%)

Query: 125 YDPGVPNRLDVMKEEEEKERERERERERERERERERERE 163
           + P   ++L   +EE  ++ +RE E++   ERERE+E+E
Sbjct: 569 FTPLASSKLAKKREEAVEKAKREAEQKAREEREREKEKE 607


>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase.  Grn1 (yeast) and
           GNL3L (human) are putative GTPases which are required
           for growth and play a role in processing of nucleolar
           pre-rRNA. This family contains a potential nuclear
           localisation signal.
          Length = 80

 Score = 41.1 bits (97), Expect = 5e-05
 Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 112 EVKKILARDKAKSYDPGVPNRL----DVMKEEEEKEREREREREREREREREREREAK 165
           E KK       K  DPG+PN      ++++E EEK+R++E E+ER +E  +    EA+
Sbjct: 20  EAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAERAEAR 77


>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812).  This is
           a family of fungal proteins whose function is not known.
          Length = 126

 Score = 36.9 bits (86), Expect = 0.004
 Identities = 12/35 (34%), Positives = 21/35 (60%)

Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAK 165
            R +  +  +E+++  E E +R+ E  +ERERE K
Sbjct: 91  ERAEAQRARDEEKKLDEEEAKRQHEEAKEREREKK 125



 Score = 36.5 bits (85), Expect = 0.005
 Identities = 12/28 (42%), Positives = 20/28 (71%)

Query: 137 KEEEEKEREREREREREREREREREREA 164
           ++EE+K  E E +R+ E  +ERERE++ 
Sbjct: 99  RDEEKKLDEEEAKRQHEEAKEREREKKK 126


>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein.  This entry includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 106

 Score = 35.8 bits (83), Expect = 0.008
 Identities = 13/64 (20%), Positives = 23/64 (35%), Gaps = 10/64 (15%)

Query: 112 EVKKILARDKAKSYDPGV--------PNRLDVMKEEEEKER--ERERERERERERERERE 161
           E+  +  + + +                  D+ +EE    R    E   E   E+E ERE
Sbjct: 40  EIWAMFNKGRKRYAYRDDDDDDDDMEAGGADIQEEERRSARLARLEDREEERLEKEEERE 99

Query: 162 REAK 165
           + A+
Sbjct: 100 KRAR 103


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
           There are two completely conserved L residues that may
           be functionally important.
          Length = 70

 Score = 34.2 bits (79), Expect = 0.010
 Identities = 13/26 (50%), Positives = 17/26 (65%)

Query: 136 MKEEEEKERERERERERERERERERE 161
           +K EEEK  E E  R+RE  +ERE+ 
Sbjct: 29  LKAEEEKREEEEEARKREERKEREKN 54



 Score = 31.5 bits (72), Expect = 0.12
 Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 1/37 (2%)

Query: 132 RLDVMKEE-EEKEREREREREREREREREREREAKSY 167
           +L   K+E + +E +RE E E  +  ER+   + KS+
Sbjct: 21  KLKAKKKELKAEEEKREEEEEARKREERKEREKNKSF 57



 Score = 26.9 bits (60), Expect = 4.5
 Identities = 12/33 (36%), Positives = 18/33 (54%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDP 169
           +EEEE+ R+RE  +ERE+ +  E        D 
Sbjct: 36  REEEEEARKREERKEREKNKSFEELLNESELDW 68


>gnl|CDD|221970 pfam13191, AAA_16, AAA ATPase domain.  This family of domains
           contain a P-loop motif that is characteristic of the AAA
           superfamily.
          Length = 154

 Score = 35.3 bits (81), Expect = 0.020
 Identities = 23/154 (14%), Positives = 49/154 (31%), Gaps = 24/154 (15%)

Query: 64  QPKGLYIYGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAK 123
            P  + + G  G GKT L+                  + +L    K  + ++      ++
Sbjct: 23  GPPSVLLTGPSGTGKTSLLR--------------ELLEGLLVAAGKCDQAERNPPYAFSQ 68

Query: 124 SYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKT-- 181
           +    +   L         E   E    RE           +    +  +   ++++   
Sbjct: 69  ALRELLRQLLR--------ELAAELLLLREALLAALGAELIEGLQDLVELLERLLARARP 120

Query: 182 WLICFDEFQVTDIADAMILKRLFTELFQLGVVVV 215
            ++  D+ Q  D     +L  L   L +L ++VV
Sbjct: 121 LVLVLDDLQWADEESLDLLAALLRRLERLPLLVV 154


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 36.3 bits (84), Expect = 0.029
 Identities = 9/47 (19%), Positives = 27/47 (57%)

Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
           A+   +        + + +++++  E+ER ++ E+ER   +E++++A
Sbjct: 74  AKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120



 Score = 30.9 bits (70), Expect = 1.4
 Identities = 10/28 (35%), Positives = 19/28 (67%)

Query: 138 EEEEKEREREREREREREREREREREAK 165
           EE ++++  E+ER ++ E+ER   +E K
Sbjct: 90  EELQQKQAAEQERLKQLEKERLAAQEQK 117



 Score = 30.5 bits (69), Expect = 1.5
 Identities = 9/28 (32%), Positives = 19/28 (67%)

Query: 138 EEEEKEREREREREREREREREREREAK 165
            E+E+ ++ E+ER   +E++++ E  AK
Sbjct: 98  AEQERLKQLEKERLAAQEQKKQAEEAAK 125



 Score = 30.5 bits (69), Expect = 1.8
 Identities = 9/27 (33%), Positives = 19/27 (70%)

Query: 137 KEEEEKERERERERERERERERERERE 163
           K+  E+ER ++ E+ER   +E++++ E
Sbjct: 95  KQAAEQERLKQLEKERLAAQEQKKQAE 121


>gnl|CDD|218693 pfam05687, DUF822, Plant protein of unknown function (DUF822).
           This family consists of the N terminal regions of
           several plant proteins of unknown function.
          Length = 151

 Score = 34.8 bits (80), Expect = 0.032
 Identities = 8/21 (38%), Positives = 11/21 (52%)

Query: 144 REREREREREREREREREREA 164
             +   +ERE  + RER R A
Sbjct: 4   GRKPTWKERENNKRRERRRRA 24



 Score = 32.1 bits (73), Expect = 0.28
 Identities = 7/18 (38%), Positives = 10/18 (55%)

Query: 143 ERERERERERERERERER 160
            +   +ERE  + RER R
Sbjct: 5   RKPTWKERENNKRRERRR 22



 Score = 32.1 bits (73), Expect = 0.29
 Identities = 12/27 (44%), Positives = 14/27 (51%), Gaps = 1/27 (3%)

Query: 141 EKEREREREREREREREREREREAKSY 167
            K   +ERE  + RER R R   AK Y
Sbjct: 5   RKPTWKERENNKRRERRR-RAIAAKIY 30



 Score = 28.2 bits (63), Expect = 5.3
 Identities = 8/18 (44%), Positives = 11/18 (61%)

Query: 137 KEEEEKERERERERERER 154
           ++   KERE  + RER R
Sbjct: 5   RKPTWKERENNKRRERRR 22


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 35.1 bits (81), Expect = 0.037
 Identities = 13/34 (38%), Positives = 24/34 (70%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDPI 170
           +EE E+ERE+++E E  +ERE +  +E + Y+ +
Sbjct: 45  EEELEEEREKKKEEEERKEREEQARKEQEEYEKL 78



 Score = 31.6 bits (72), Expect = 0.58
 Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 1/35 (2%)

Query: 132 RLDVMKEEE-EKEREREREREREREREREREREAK 165
           +L+  +E E ++E E E ERE+++E E  +ERE +
Sbjct: 33  KLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67



 Score = 30.4 bits (69), Expect = 1.4
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 136 MKEEEEKEREREREREREREREREREREAKS 166
            + EE K+ E +RE ER+ E E E ERE K 
Sbjct: 26  EEREERKKLEEKREGERKEEEELEEEREKKK 56



 Score = 30.1 bits (68), Expect = 1.6
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           K EE++E ER+ E E E ERE+++E E + 
Sbjct: 33  KLEEKREGERKEEEELEEEREKKKEEEERK 62



 Score = 28.9 bits (65), Expect = 4.1
 Identities = 12/29 (41%), Positives = 21/29 (72%)

Query: 136 MKEEEEKEREREREREREREREREREREA 164
            +++ E++RE ER+ E E E ERE+++E 
Sbjct: 30  ERKKLEEKREGERKEEEELEEEREKKKEE 58


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 35.7 bits (83), Expect = 0.037
 Identities = 25/80 (31%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 90  ETKQKQRVH---FNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERER 146
           E  Q++R       + ML+  A+  E+++  A  +       + +R ++ ++ EEKE  R
Sbjct: 248 ERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKR---LEHRRELEQQIEEKEERR 304

Query: 147 ERERERER-EREREREREAK 165
             ERE E  E ER RE EA+
Sbjct: 305 AAEREEELEEGERLREEEAE 324



 Score = 33.3 bits (77), Expect = 0.21
 Identities = 16/31 (51%), Positives = 22/31 (70%), Gaps = 1/31 (3%)

Query: 133 LDVMKEEEEKERERERER-ERERERERERER 162
           L+  +E+ E+E ERE ER ER+ E+ERE  R
Sbjct: 156 LEYQREKAEREEEREAERRERKEEKEREVAR 186



 Score = 31.8 bits (73), Expect = 0.64
 Identities = 14/37 (37%), Positives = 17/37 (45%), Gaps = 2/37 (5%)

Query: 131 NRLDVMKEEEEKERERERERERERERER--EREREAK 165
                  E EE+  E ER RE E ER+   E ER+  
Sbjct: 300 KEERRAAEREEELEEGERLREEEAERQARIEEERQRL 336



 Score = 31.8 bits (73), Expect = 0.65
 Identities = 19/39 (48%), Positives = 22/39 (56%), Gaps = 10/39 (25%)

Query: 136 MKEEEEK---------EREREREREREREREREREREAK 165
            +EEE K         ERE ERE ER RER+ E+ERE  
Sbjct: 148 EREEELKILEYQREKAEREEEREAER-RERKEEKEREVA 185



 Score = 31.8 bits (73), Expect = 0.72
 Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAK 165
            + +  +EE   +   E ER +    E ERER+ K
Sbjct: 32  IKAEEKEEERRIDEMMEEERLKALAEEEERERKRK 66



 Score = 30.6 bits (70), Expect = 1.4
 Identities = 12/30 (40%), Positives = 19/30 (63%)

Query: 136 MKEEEEKEREREREREREREREREREREAK 165
            +EEE +   RER+ E+ERE  R R ++ +
Sbjct: 164 EREEEREAERRERKEEKEREVARLRAQQEE 193



 Score = 30.6 bits (70), Expect = 1.7
 Identities = 9/34 (26%), Positives = 14/34 (41%), Gaps = 5/34 (14%)

Query: 136 MKEEEEKERERERERER-----EREREREREREA 164
             E  E++ E+ERE  R     E   +   E + 
Sbjct: 170 EAERRERKEEKEREVARLRAQQEEAEDEREELDE 203



 Score = 29.9 bits (68), Expect = 2.6
 Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 17/50 (34%)

Query: 132 RLDVMKEEEEKERER----------------ERERERERERE-REREREA 164
           RL  + EEEE+ER+R                ERE+ R+ E E R +ERE 
Sbjct: 51  RLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQ 100



 Score = 29.1 bits (66), Expect = 4.3
 Identities = 7/29 (24%), Positives = 14/29 (48%)

Query: 132 RLDVMKEEEEKERERERERERERERERER 160
           + ++ +  EE+  E+E   + ER  E   
Sbjct: 232 KQELQRAREEQIEEKEERLQEERAEEEAE 260



 Score = 28.7 bits (65), Expect = 6.0
 Identities = 8/24 (33%), Positives = 18/24 (75%)

Query: 138 EEEEKERERERERERERERERERE 161
           +EE + +ER++E+E   +R R+++
Sbjct: 210 QEEYERKERQKEKEEAEKRRRQKQ 233


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 35.7 bits (82), Expect = 0.048
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYD 168
           +EEEE+E E E E E E E E E   E  S +
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897



 Score = 35.4 bits (81), Expect = 0.061
 Identities = 16/30 (53%), Positives = 18/30 (60%)

Query: 134 DVMKEEEEKERERERERERERERERERERE 163
           D  +EEEE+E E E E E E E E E E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891



 Score = 34.6 bits (79), Expect = 0.11
 Identities = 14/33 (42%), Positives = 20/33 (60%)

Query: 138 EEEEKEREREREREREREREREREREAKSYDPI 170
           + EE+E E E E E E E E E E E ++ +P+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 35.6 bits (82), Expect = 0.052
 Identities = 10/36 (27%), Positives = 14/36 (38%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPP 172
                  R  ++ RER R R R R+R  +  D    
Sbjct: 8   GRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYY 43



 Score = 31.4 bits (71), Expect = 0.96
 Identities = 9/28 (32%), Positives = 12/28 (42%)

Query: 139 EEEKEREREREREREREREREREREAKS 166
           E  + R   R  ++ RER R R R    
Sbjct: 6   ERGRLRNDTRRSDKGRERSRRRSRSRDR 33



 Score = 30.2 bits (68), Expect = 2.3
 Identities = 11/31 (35%), Positives = 13/31 (41%)

Query: 139 EEEKEREREREREREREREREREREAKSYDP 169
             +K RER R R R R+R R R         
Sbjct: 16  RSDKGRERSRRRSRSRDRSRRRRDRDYYRGR 46



 Score = 29.9 bits (67), Expect = 3.1
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 142 KEREREREREREREREREREREAK 165
           + R+RER R R   R  ++ RE  
Sbjct: 1   RYRDRERGRLRNDTRRSDKGRERS 24



 Score = 28.7 bits (64), Expect = 6.9
 Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 1/32 (3%)

Query: 132 RLDVMKEEEEKERERERERERERERERERERE 163
           R D  + ++ +ER R R R R+R R R R+R+
Sbjct: 11  RNDTRRSDKGRERSRRRSRSRDRSR-RRRDRD 41



 Score = 28.7 bits (64), Expect = 7.2
 Identities = 8/21 (38%), Positives = 10/21 (47%)

Query: 146 REREREREREREREREREAKS 166
           R R+RER R R   R  +   
Sbjct: 1   RYRDRERGRLRNDTRRSDKGR 21



 Score = 28.3 bits (63), Expect = 9.6
 Identities = 12/49 (24%), Positives = 18/49 (36%), Gaps = 1/49 (2%)

Query: 119 RDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSY 167
           R+  +  D G        +  +   R R+R+  R R   R R R    Y
Sbjct: 11  RNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRG-RSRSRSPNRY 58


>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777).  This is
           a family of eukaryotic proteins of unknown function.
           Some of the proteins in this family are putative nucleic
           acid binding proteins.
          Length = 158

 Score = 34.1 bits (78), Expect = 0.061
 Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 3/56 (5%)

Query: 117 LARDKAKSYDPGVPNRLDVMKEEEEKERERERERER---EREREREREREAKSYDP 169
           + R +++S            ++  E+ RER R RER    R R R   R  +S  P
Sbjct: 1   MGRSRSRSPRRSRRRGRSRSRDRRERRRERSRSRERDRRRRSRSRSPHRSRRSRSP 56



 Score = 29.5 bits (66), Expect = 2.4
 Identities = 12/42 (28%), Positives = 19/42 (45%)

Query: 139 EEEKEREREREREREREREREREREAKSYDPIPPVANDIISK 180
                R R R R   R R+R+RER+  + +P       +I +
Sbjct: 53  SRSPRRHRSRSRSPSRRRDRKRERDKDAREPKKRERQKLIKE 94


>gnl|CDD|236485 PRK09368, PRK09368, gas vesicle synthesis-like protein; Reviewed.
          Length = 140

 Score = 33.5 bits (77), Expect = 0.067
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 110 IHEVKKILARDKAKSYDPGVPNRL-DVMKE--EEEKERERERERERERERERERERE 163
           + EV +  AR K+K    G      + + +    ++E ER RER R R+  R R RE
Sbjct: 84  VGEVTESGARGKSKGALTGAAETASEALGQGRGSDEEEERRRERPRPRKAPRRRRRE 140


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 35.3 bits (81), Expect = 0.069
 Identities = 13/26 (50%), Positives = 18/26 (69%)

Query: 138 EEEEKERERERERERERERERERERE 163
           EE ++ERE+ R R+R+R  ER R R 
Sbjct: 3   EEPDREREKSRGRDRDRSSERPRRRS 28



 Score = 34.5 bits (79), Expect = 0.10
 Identities = 13/39 (33%), Positives = 21/39 (53%)

Query: 130 PNRLDVMKEEEEKEREREREREREREREREREREAKSYD 168
           P+R        +++R  ER R R R+R R R+R  +S +
Sbjct: 5   PDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRE 43



 Score = 33.7 bits (77), Expect = 0.18
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 139 EEEKERERERERERERERERERERE 163
           +EE +RERE+ R R+R+R  ER R 
Sbjct: 2   DEEPDREREKSRGRDRDRSSERPRR 26



 Score = 32.9 bits (75), Expect = 0.35
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 139 EEEKEREREREREREREREREREREAKSY 167
           +   ER R R R+R R R+R R    +SY
Sbjct: 18  DRSSERPRRRSRDRSRFRDRHRRSRERSY 46



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 141 EKEREREREREREREREREREREAK 165
           ++E +RERE+ R R+R+R  ER  +
Sbjct: 2   DEEPDREREKSRGRDRDRSSERPRR 26



 Score = 29.9 bits (67), Expect = 3.2
 Identities = 10/22 (45%), Positives = 16/22 (72%)

Query: 142 KERERERERERERERERERERE 163
           ++ E +RERE+ R R+R+R  E
Sbjct: 1   RDEEPDREREKSRGRDRDRSSE 22



 Score = 29.1 bits (65), Expect = 5.1
 Identities = 13/39 (33%), Positives = 18/39 (46%), Gaps = 10/39 (25%)

Query: 140 EEKERERERERERERERER----------EREREAKSYD 168
            E+ R R R+R R R+R R           R R+ + YD
Sbjct: 21  SERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYD 59


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 31.0 bits (71), Expect = 0.074
 Identities = 8/29 (27%), Positives = 16/29 (55%)

Query: 131 NRLDVMKEEEEKERERERERERERERERE 159
           N+  V + E++   E+++  E  +E E E
Sbjct: 9   NQEKVWEAEQKALEEQKKIEELRKEIEEE 37



 Score = 26.8 bits (60), Expect = 2.1
 Identities = 7/23 (30%), Positives = 13/23 (56%)

Query: 139 EEEKERERERERERERERERERE 161
           E E++   E+++  E  +E E E
Sbjct: 15  EAEQKALEEQKKIEELRKEIEEE 37


>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
           co-repressor CIR.  This is a 45 residue conserved region
           at the N-terminal end of a family of proteins referred
           to as CIRs (CBF1-interacting co-repressors). CBF1
           (centromere-binding factor 1) acts as a transcription
           factor that causes repression by binding specifically to
           GTGGGAA motifs in responsive promoters, and it requires
           CIR as a co-repressor. CIR binds to histone deacetylase
           and to SAP30 and serves as a linker between CBF1 and the
           histone deacetylase complex.
          Length = 37

 Score = 31.0 bits (71), Expect = 0.076
 Identities = 11/29 (37%), Positives = 19/29 (65%)

Query: 131 NRLDVMKEEEEKERERERERERERERERE 159
           N+  V K E++ E E+++  ER +E E+E
Sbjct: 9   NQKRVWKAEQKAEEEKKKIEERRKEIEKE 37



 Score = 27.1 bits (61), Expect = 2.0
 Identities = 8/24 (33%), Positives = 16/24 (66%)

Query: 138 EEEEKERERERERERERERERERE 161
            + E++ E E+++  ER +E E+E
Sbjct: 14  WKAEQKAEEEKKKIEERRKEIEKE 37



 Score = 26.3 bits (59), Expect = 3.1
 Identities = 8/24 (33%), Positives = 16/24 (66%)

Query: 140 EEKERERERERERERERERERERE 163
            + E++ E E+++  ER +E E+E
Sbjct: 14  WKAEQKAEEEKKKIEERRKEIEKE 37


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 34.9 bits (79), Expect = 0.081
 Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 7/82 (8%)

Query: 86  YESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERE 145
           Y     K K  V  NK  L  H  IH+VK     +K K            +K+E+ + + 
Sbjct: 120 YRELYVKAKGNVFRNKRNLMEH--IHKVKN----EKKKERQLAEQLAAKRLKDEQHRHKA 173

Query: 146 RERE-REREREREREREREAKS 166
           R++E R+RE++RER R  +A +
Sbjct: 174 RKQELRKREKDRERARREDAAA 195


>gnl|CDD|205235 pfam13054, DUF3915, Protein of unknown function (DUF3915).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 120 amino acids in length.
          Length = 116

 Score = 32.6 bits (74), Expect = 0.13
 Identities = 11/26 (42%), Positives = 14/26 (53%)

Query: 144 REREREREREREREREREREAKSYDP 169
           R+       + ERERERERE +   P
Sbjct: 12  RDCHHHEREDFEREREREREKEPQRP 37



 Score = 30.2 bits (68), Expect = 0.71
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 142 KEREREREREREREREREREREAK 165
           ++       + ERERERERE+E +
Sbjct: 12  RDCHHHEREDFEREREREREKEPQ 35



 Score = 29.5 bits (66), Expect = 1.4
 Identities = 13/19 (68%), Positives = 16/19 (84%)

Query: 138 EEEEKERERERERERERER 156
           E E+ ERERERERE+E +R
Sbjct: 18  EREDFEREREREREKEPQR 36



 Score = 29.1 bits (65), Expect = 2.0
 Identities = 12/19 (63%), Positives = 16/19 (84%)

Query: 140 EEKERERERERERERERER 158
           E ++ ERERERERE+E +R
Sbjct: 18  EREDFEREREREREKEPQR 36



 Score = 28.3 bits (63), Expect = 3.6
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 141 EKERERERERERERERERERERE 163
           +       + ERERERERE+E +
Sbjct: 13  DCHHHEREDFEREREREREKEPQ 35


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 34.1 bits (78), Expect = 0.14
 Identities = 13/32 (40%), Positives = 20/32 (62%)

Query: 138 EEEEKEREREREREREREREREREREAKSYDP 169
           +E+EK+ E  R+RE E++RER R +      P
Sbjct: 148 KEKEKKVEEPRDREEEKKRERVRAKSRPKKPP 179



 Score = 33.3 bits (76), Expect = 0.29
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           ++E++ E  R+RE E++RER R + R  K 
Sbjct: 149 EKEKKVEEPRDREEEKKRERVRAKSRPKKP 178



 Score = 32.9 bits (75), Expect = 0.34
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
            +E+EKE+E++ E  R+RE E++RER    
Sbjct: 143 PKEKEKEKEKKVEEPRDREEEKKRERVRAK 172



 Score = 32.6 bits (74), Expect = 0.45
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           ++E+EKE++ E  R+RE E++RER R    
Sbjct: 145 EKEKEKEKKVEEPRDREEEKKRERVRAKSR 174



 Score = 32.6 bits (74), Expect = 0.46
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDP 169
           +++ E+ R+RE E++RER R + R ++     P
Sbjct: 151 EKKVEEPRDREEEKKRERVRAKSRPKKPPKKKP 183



 Score = 32.2 bits (73), Expect = 0.62
 Identities = 12/30 (40%), Positives = 21/30 (70%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           +  +EKE+E+E++ E  R+RE E++RE   
Sbjct: 141 RPPKEKEKEKEKKVEEPRDREEEKKRERVR 170



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 15/60 (25%), Positives = 30/60 (50%)

Query: 107 HAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKS 166
             +  +VK+   + K K  +     +     +E+   +E+E+E+E++ E  R+RE E K 
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166



 Score = 30.2 bits (68), Expect = 2.3
 Identities = 9/36 (25%), Positives = 26/36 (72%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPP 172
           KE+E+ + E+++++E+ +E  ++R+ + ++ +  PP
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPP 143



 Score = 30.2 bits (68), Expect = 2.5
 Identities = 10/40 (25%), Positives = 25/40 (62%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPPVAND 176
           KE+ ++E+++++E+ +E  ++R+ + EAK   P      +
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKE 149



 Score = 29.1 bits (65), Expect = 5.0
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query: 137 KEEEEKEREREREREREREREREREREAK 165
             EEEK+RER R + R ++  +++    K
Sbjct: 159 DREEEKKRERVRAKSRPKKPPKKKPPNKK 187



 Score = 28.3 bits (63), Expect = 8.3
 Identities = 9/48 (18%), Positives = 25/48 (52%)

Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAK 165
           A++             + +KEE++K++E+ +E  ++R+ + E + +  
Sbjct: 95  AKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRP 142


>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
           structure-specific, divalent-metal-ion dependent, 5'
           nuclease and homologs.  Exonuclease-1 (EXO1) is involved
           in multiple, eukaryotic DNA metabolic pathways,
           including DNA replication processes (5' flap DNA
           endonuclease activity and double stranded DNA
           5'-exonuclease activity), DNA repair processes (DNA
           mismatch repair (MMR) and post-replication repair
           (PRR)), recombination, and telomere integrity. EXO1
           functions in the MMS2 error-free branch of the PRR
           pathway in the maintenance and repair of stalled
           replication forks. Studies also suggest that EXO1 plays
           both structural and catalytic roles during MMR-mediated
           mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
           family of structure-specific, 5' nucleases. These
           nucleases contain a PIN (PilT N terminus) domain with a
           helical arch/clamp region (I domain) of variable length
           (approximately 43 residues in EXO1 PIN domains) and a
           H3TH (helix-3-turn-helix) domain, an atypical
           helix-hairpin-helix-2-like region. Both the H3TH domain
           (not included here) and the helical arch/clamp region
           are involved in DNA binding. Nucleases within this group
           also have a carboxylate-rich active site that is
           involved in binding essential divalent metal ion
           cofactors (Mg2+/Mn2+). EXO1 nucleases also have
           C-terminal Mlh1- and Msh2-binding domains which allow
           interaction with MMR and PRR proteins, respectively.
          Length = 210

 Score = 33.2 bits (77), Expect = 0.18
 Identities = 12/35 (34%), Positives = 16/35 (45%), Gaps = 3/35 (8%)

Query: 136 MKEEEEKERERERERERERERERERE---REAKSY 167
            K+  E+ER   RE   E+  E  RE    EA+  
Sbjct: 84  SKKGTEEERRERREENLEKALELLREGNRSEAREC 118


>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family.  This is a family of
           sequences found in both bacteria and bacteriophages.
           This region is approximately 130 residues long and in
           some cases is found as part of the PVL (Panton-Valentine
           leukocidin) group of genes, which encode a member of the
           leukocidin group of bacterial toxins that kill
           leukocytes by creation of pores in the cell membrane.
           PVL appears to be a virulence factor associated with a
           number of human diseases.
          Length = 118

 Score = 32.1 bits (73), Expect = 0.19
 Identities = 12/23 (52%), Positives = 17/23 (73%)

Query: 138 EEEEKERERERERERERERERER 160
           E+E KERE ER+R +E E  R++
Sbjct: 64  EQERKERELERKRRKEAELRRKK 86



 Score = 30.2 bits (68), Expect = 0.88
 Identities = 10/27 (37%), Positives = 18/27 (66%)

Query: 136 MKEEEEKERERERERERERERERERER 162
            +  E++ +ERE ER+R +E E  R++
Sbjct: 60  SENIEQERKERELERKRRKEAELRRKK 86



 Score = 29.8 bits (67), Expect = 1.1
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query: 134 DVMKEEEEKERERERERERERERERERER 162
           ++ K E  ++  +ERE ER+R +E E  R
Sbjct: 56  EIKKSENIEQERKERELERKRRKEAELRR 84



 Score = 29.0 bits (65), Expect = 2.3
 Identities = 13/38 (34%), Positives = 21/38 (55%)

Query: 132 RLDVMKEEEEKEREREREREREREREREREREAKSYDP 169
           RL   +E ++ E   +  +ERE ER+R +E E +   P
Sbjct: 50  RLKEYREIKKSENIEQERKERELERKRRKEAELRRKKP 87


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 33.7 bits (77), Expect = 0.21
 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 6/41 (14%)

Query: 134 DVMKEEEEKERER------EREREREREREREREREAKSYD 168
           D + EE+ +E E+      ERER+ E +R RE E+ A   D
Sbjct: 248 DFLLEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEAD 288



 Score = 32.5 bits (74), Expect = 0.55
 Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 11/71 (15%)

Query: 100 NKFMLDVHAKI--HEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERER- 156
             F +++   +  H  +  L  +K +  +     +L   KEE E+ER+ E +R RE E+ 
Sbjct: 231 KDFCVEIEGGMDEHSFEDFLLEEKRRELE-----KLA--KEEAERERQAEEQRRREEEKA 283

Query: 157 ERERER-EAKS 166
             E +R +AK+
Sbjct: 284 AMEADRAQAKA 294


>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family.  Members of this family are
           coiled-coil proteins that are involved in pre-rRNA
           processing.
          Length = 105

 Score = 30.8 bits (70), Expect = 0.37
 Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 1/35 (2%)

Query: 132 RLDVMKEEEEKERERERERERER-EREREREREAK 165
           R   +K+E+E ER+R  +  +ER   + E+ER  K
Sbjct: 46  REKELKDEKEAERQRRIQAIKERRAAKEEKERYEK 80



 Score = 29.7 bits (67), Expect = 0.95
 Identities = 9/29 (31%), Positives = 19/29 (65%)

Query: 137 KEEEEKEREREREREREREREREREREAK 165
           + E+  E++  + RE+E + E+E ER+ +
Sbjct: 33  RMEKRLEQQAIKAREKELKDEKEAERQRR 61



 Score = 27.4 bits (61), Expect = 5.8
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 139 EEEKEREREREREREREREREREREAK 165
             EKE + E+E ER+R  +  +ER A 
Sbjct: 45  AREKELKDEKEAERQRRIQAIKERRAA 71


>gnl|CDD|178307 PLN02705, PLN02705, beta-amylase.
          Length = 681

 Score = 32.6 bits (74), Expect = 0.41
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 13/48 (27%)

Query: 145 EREREREREREREREREREA---------KSYD--PIPPVA--NDIIS 179
           +RERE+E+ER + RER R A         + Y   P+P  A  ND+++
Sbjct: 78  KREREKEKERTKLRERHRRAITSRMLAGLRQYGNFPLPARADMNDVLA 125



 Score = 31.1 bits (70), Expect = 1.5
 Identities = 11/18 (61%), Positives = 15/18 (83%)

Query: 143 ERERERERERERERERER 160
           +RERE+E+ER + RER R
Sbjct: 78  KREREKEKERTKLRERHR 95



 Score = 29.9 bits (67), Expect = 3.3
 Identities = 10/18 (55%), Positives = 15/18 (83%)

Query: 141 EKERERERERERERERER 158
           ++ERE+E+ER + RER R
Sbjct: 78  KREREKEKERTKLRERHR 95


>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein.  This family includes the
           Saccharomyces cerevisiae protein SPT2 which is a
           chromatin protein involved in transcriptional
           regulation.
          Length = 116

 Score = 31.0 bits (70), Expect = 0.41
 Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 10/64 (15%)

Query: 112 EVKKILARDKAKSYDPGV----------PNRLDVMKEEEEKERERERERERERERERERE 161
           E+  I  + + +SY                 +++ KEE    R    E ERE  RE E E
Sbjct: 48  EIWAIFGKGRKRSYYDRYDEDDALDNMEATFMEIQKEERRSARMARLEDERELAREEEEE 107

Query: 162 REAK 165
           +  K
Sbjct: 108 KRKK 111


>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824).  This is
           a repeating domain found in fungal proteins. It is
           proline-rich, and the function is not known.
          Length = 135

 Score = 31.0 bits (70), Expect = 0.47
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPPVA 174
           +++ E+ERER R          E   +   Y P PPV+
Sbjct: 29  RKKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVS 66



 Score = 29.5 bits (66), Expect = 1.8
 Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 1/37 (2%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPPV 173
           K  + +ER++  ERERER R           +P  P 
Sbjct: 22  KYSQRRERKKA-ERERERYRHDHDAYSDSYEEPYDPT 57


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 30.8 bits (70), Expect = 0.60
 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 105 DVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERE 161
            + A   E K+    ++ ++  P     L  ++  E ++RERE   ER+ ++ERE+E
Sbjct: 26  SIKAAKEEKKQAKLSERKENRKPKYIGSL--LEAAERRKREREIAEERKLQKEREKE 80



 Score = 30.5 bits (69), Expect = 0.64
 Identities = 11/27 (40%), Positives = 18/27 (66%)

Query: 142 KEREREREREREREREREREREAKSYD 168
           + R+RERE   ER+ ++ERE+E   + 
Sbjct: 59  ERRKREREIAEERKLQKEREKEGDEFA 85



 Score = 30.1 bits (68), Expect = 0.91
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 141 EKEREREREREREREREREREREA 164
           E    R+RERE   ER+ ++ERE 
Sbjct: 56  EAAERRKREREIAEERKLQKEREK 79


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 32.3 bits (74), Expect = 0.62
 Identities = 11/43 (25%), Positives = 19/43 (44%), Gaps = 1/43 (2%)

Query: 134 DVMKEEEEKEREREREREREREREREREREAKSYDPIPPVAND 176
              K   + E+++   RER+R R  E+ R+A+       V   
Sbjct: 661 VTEKARTQDEQQQAPRRERQRRRNDEK-RQAQQEAKALNVEEQ 702



 Score = 30.0 bits (68), Expect = 3.3
 Identities = 12/63 (19%), Positives = 26/63 (41%), Gaps = 5/63 (7%)

Query: 119 RDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERER-----EAKSYDPIPPV 173
           R   +       ++   + E+   + E+++   RER+R R  E+     EAK+ +     
Sbjct: 644 RQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQS 703

Query: 174 AND 176
             +
Sbjct: 704 VQE 706


>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
           This family contains a number of defence peptides
           secreted from the skin of amphibians, including the
           opiate-like dermorphins and deltorphins, and the
           antimicrobial dermoseptins and temporins. The alignment
           for this family includes the signal peptide.
          Length = 46

 Score = 28.5 bits (64), Expect = 0.73
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 138 EEEEKERERERERERERERERERER 162
           EEE++E E E E E E E + E +R
Sbjct: 22  EEEKREDEEENEDEEEGEEQSEVKR 46



 Score = 27.7 bits (62), Expect = 1.6
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 137 KEEEEKERERERERERERERER 158
           K E+E+E E E E E + E +R
Sbjct: 25  KREDEEENEDEEEGEEQSEVKR 46



 Score = 26.5 bits (59), Expect = 3.3
 Identities = 14/27 (51%), Positives = 15/27 (55%), Gaps = 3/27 (11%)

Query: 139 EEEKEREREREREREREREREREREAK 165
           EEEK   RE E E E E E E + E K
Sbjct: 22  EEEK---REDEEENEDEEEGEEQSEVK 45


>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078).  This
           family is found from fungi to humans, but its exact
           function is not known.
          Length = 88

 Score = 29.6 bits (67), Expect = 0.75
 Identities = 11/28 (39%), Positives = 18/28 (64%)

Query: 138 EEEEKEREREREREREREREREREREAK 165
           E+  +E ERER+   ER+ +R+R  E +
Sbjct: 53  EKAREETERERKEREERKEKRKRAIEER 80



 Score = 29.2 bits (66), Expect = 0.97
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 133 LDVMKEEEEKEREREREREREREREREREREA 164
           L+  +EE E+ER+   ER+ +R+R  E  R+ 
Sbjct: 52  LEKAREETERERKEREERKEKRKRAIEERRKK 83



 Score = 29.2 bits (66), Expect = 1.1
 Identities = 12/31 (38%), Positives = 19/31 (61%), Gaps = 1/31 (3%)

Query: 136 MKEEEEKERERER-EREREREREREREREAK 165
            ++ EE E+ RE  ERER+   ER+ +R+  
Sbjct: 46  KEQMEELEKAREETERERKEREERKEKRKRA 76



 Score = 27.2 bits (61), Expect = 5.5
 Identities = 14/33 (42%), Positives = 21/33 (63%), Gaps = 4/33 (12%)

Query: 137 KEEEEKERE-RERERER---EREREREREREAK 165
           K+EEE++ +  E E+ R   ERER+   ER+ K
Sbjct: 40  KDEEERKEQMEELEKAREETERERKEREERKEK 72


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 31.3 bits (71), Expect = 0.85
 Identities = 18/75 (24%), Positives = 34/75 (45%), Gaps = 1/75 (1%)

Query: 103 MLDVHAKIHEVKKI-LARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERE 161
           ++D  A   +  +I   +  A   +     +L+   EE EK+R  E+ R++E E+    E
Sbjct: 42  LVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAE 101

Query: 162 REAKSYDPIPPVAND 176
           + AK  +     A +
Sbjct: 102 KAAKQAEQAAKQAEE 116


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 31.6 bits (71), Expect = 0.92
 Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 127 PGVPNRLDVMK-EEEEKERER-EREREREREREREREREAKSYDPIPPVANDI 177
           PG+P     +K EEE + R+R E+++  +++RE+E+    + +        DI
Sbjct: 73  PGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDI 125



 Score = 29.6 bits (66), Expect = 3.9
 Identities = 12/68 (17%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 104 LDVHAKIHEVKKILARDKAKSYDPGVPNRLDV-MKEEEEKERERERERERERERERERER 162
           +D+   + + +K+  +    +     P + DV   E++ K+ +++ ++E+E+ER++++++
Sbjct: 161 IDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKK 220

Query: 163 EAKSYDPI 170
           E + +  +
Sbjct: 221 EVEGFKSL 228


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 30.9 bits (70), Expect = 1.0
 Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 137 KEEEEKEREREREREREREREREREREAK 165
           K ++  E ++++ERE+ERE    R+R AK
Sbjct: 103 KLQKLLEEKQKQEREKEREEAELRQRLAK 131



 Score = 30.9 bits (70), Expect = 1.2
 Identities = 7/27 (25%), Positives = 21/27 (77%)

Query: 137 KEEEEKERERERERERERERERERERE 163
            ++++++RE ER ++R++++E E  ++
Sbjct: 192 LKQQQQKREEERRKQRKKQQEEEERKQ 218



 Score = 29.7 bits (67), Expect = 3.0
 Identities = 8/27 (29%), Positives = 22/27 (81%)

Query: 136 MKEEEEKERERERERERERERERERER 162
           +K+ ++++++RE ER ++R++++E E 
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQEEEE 215



 Score = 29.3 bits (66), Expect = 3.8
 Identities = 6/30 (20%), Positives = 23/30 (76%)

Query: 138 EEEEKEREREREREREREREREREREAKSY 167
           +++++E ER ++R++++E E  +++  +++
Sbjct: 195 QQQKREEERRKQRKKQQEEEERKQKAEEAW 224



 Score = 29.3 bits (66), Expect = 4.0
 Identities = 8/30 (26%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 137 KEEEEKER-EREREREREREREREREREAK 165
           K+ + +++ ++  E ++++ERE+ERE    
Sbjct: 96  KQAQRQKKLQKLLEEKQKQEREKEREEAEL 125



 Score = 28.9 bits (65), Expect = 4.2
 Identities = 8/25 (32%), Positives = 21/25 (84%)

Query: 139 EEEKERERERERERERERERERERE 163
           E +K ++++++RE ER ++R++++E
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQE 212



 Score = 28.5 bits (64), Expect = 7.2
 Identities = 7/28 (25%), Positives = 22/28 (78%)

Query: 138 EEEEKEREREREREREREREREREREAK 165
           +E E ++ ++++++RE ER ++R+++ +
Sbjct: 185 QEWELKKLKQQQQKREEERRKQRKKQQE 212


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 13/44 (29%), Positives = 16/44 (36%), Gaps = 2/44 (4%)

Query: 138 EEEEKERERE--REREREREREREREREAKSYDPIPPVANDIIS 179
               K RE     E    R R  E +R  ++Y P  P    I S
Sbjct: 245 AAAAKAREAAAAAEAAAARARAAEAKRTGETYKPTAPEKMLISS 288


>gnl|CDD|234953 PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed.
          Length = 476

 Score = 31.2 bits (72), Expect = 1.1
 Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 10/73 (13%)

Query: 104 LDVHAKIHEVKKILARDKA-KSYDPGVPNRLDVMKEEEEKERERER--ERERERERERER 160
           LD++ +    +++L   KA   Y    P  L+ M+EE+    E  R   R R+  +E   
Sbjct: 88  LDIYKEY--AEQLLEEGKAYYCYCT--PEELEAMREEQRAAGEPPRYDGRCRDLTKEEVA 143

Query: 161 EREAKSYDPIPPV 173
            R A      PPV
Sbjct: 144 ARLAAG---EPPV 153


>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component.  This
           family is a region of the Myb-Related Cdc5p/Cef1
           proteins, in fungi, and is part of the pre-mRNA splicing
           factor complex.
          Length = 363

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 10/44 (22%), Positives = 20/44 (45%), Gaps = 3/44 (6%)

Query: 138 EEEEKEREREREREREREREREREREAKSYD---PIPPVANDII 178
           EE+  +R+  +    E + + E  R ++      P P V + I+
Sbjct: 165 EEDAADRDARKRAAEEAKEQEELRRRSQVIQRNLPRPSVLDLIV 208


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 15/37 (40%), Positives = 24/37 (64%), Gaps = 1/37 (2%)

Query: 133 LDVMKEEEEKEREREREREREREREREREREAKSYDP 169
           L+ +K+E  +E+ERE E E+  E E+ RE E  + +P
Sbjct: 151 LEKIKKERAEEKERE-EEEKAAEEEKAREEEILTGNP 186



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 11/30 (36%), Positives = 20/30 (66%)

Query: 136 MKEEEEKEREREREREREREREREREREAK 165
           ++E E+ ++ER  E+ERE E +   E +A+
Sbjct: 148 LRELEKIKKERAEEKEREEEEKAAEEEKAR 177


>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168).  This
           family consists of several hypothetical eukaryotic
           proteins of unknown function.
          Length = 142

 Score = 30.0 bits (68), Expect = 1.1
 Identities = 13/36 (36%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 131 NRLDVMKEEEEKERERER-EREREREREREREREAK 165
            RL++M E+ +KE E E  +++RE ++ ++ E+ AK
Sbjct: 50  ERLELMDEKWKKETEDEEFQQKREEKKRKDEEKTAK 85


>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 406

 Score = 31.1 bits (71), Expect = 1.2
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 62  VKQPKGLYIYGAVGGGKTML 81
           +  PKG+ +YG  G GKT+L
Sbjct: 182 IDPPKGVLLYGPPGTGKTLL 201


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           +EEEEK  E ERER     + R  +   K 
Sbjct: 124 REEEEKRDEEERERLLRAAKSRSEQSRLKQ 153


>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
           hydrophilic C-term.  This domain is a hydrophilic region
           found at the C-terminus of plant and metazoan
           pre-mRNA-splicing factor 38 proteins. The function is
           not known.
          Length = 97

 Score = 29.4 bits (66), Expect = 1.2
 Identities = 13/58 (22%), Positives = 21/58 (36%), Gaps = 2/58 (3%)

Query: 112 EVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDP 169
           E  + + R   +  D G   R    +      R +   + R R R+R+R R     D 
Sbjct: 20  EDDEEIRRKAERDVDRG--RRSPRRRTRRRSRRRKRSRKRRRRRRDRDRARYRDRDDR 75



 Score = 28.2 bits (63), Expect = 2.8
 Identities = 11/35 (31%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 136 MKEEEEKERERERERERERERERE-REREAKSYDP 169
            ++   K R R R+R+R R R+R+ R+R+      
Sbjct: 50  RRKRSRKRRRRRRDRDRARYRDRDDRDRDRYDRSR 84



 Score = 27.8 bits (62), Expect = 3.9
 Identities = 10/28 (35%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 137 KEEEEKERER-ERERERERERERERERE 163
           ++ ++++R+R +R R R R R R+R R 
Sbjct: 70  RDRDDRDRDRYDRSRSRSRSRSRDRRRR 97


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 31.2 bits (71), Expect = 1.3
 Identities = 15/34 (44%), Positives = 22/34 (64%), Gaps = 7/34 (20%)

Query: 143 ERE-RER-ERE-REREREREREREAK----SYDP 169
           ERE + R ERE RE  R R+++++AK    +Y P
Sbjct: 253 EREAKARAEREAREAARVRDKQKQAKRKGSTYKP 286


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 29.9 bits (67), Expect = 1.4
 Identities = 13/56 (23%), Positives = 27/56 (48%), Gaps = 4/56 (7%)

Query: 112 EVKKILARDKAKSYDPGV----PNRLDVMKEEEEKERERERERERERERERERERE 163
           +VK +L   + +  +           + ++E EE+ + + R+ E ERE  R+  R+
Sbjct: 15  DVKGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRKMEEEREVMRQGIRD 70


>gnl|CDD|213395 cd12152, F1-ATPase_delta, mitochondrial ATP synthase delta subunit.
            The F-ATPase is found in bacterial plasma membranes,
           mitochondrial inner membranes and in chloroplast
           thylakoid membranes. It has also been found in the
           archaea Methanosarcina barkeri. It uses a proton
           gradient to drive ATP synthesis and hydrolyzes ATP to
           build the proton gradient. The extrinisic membrane
           domain, F1, is composed of alpha, beta, gamma, delta,
           and epsilon subunits with a stoichiometry of 3:3:1:1:1.
           Alpha and beta subunit form the globular catalytic
           moiety, a hexameric ring of alternating subunits. Gamma,
           delta and epsilon subunits form a stalk, connecting F1
           to F0, the integral membrane proton translocating
           domain. In bacteria, which is lacking a eukaryotic
           epsilon subunit homolog, this subunit is called the
           epsilon subunit.
          Length = 123

 Score = 29.4 bits (67), Expect = 1.7
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query: 133 LDVMKEEEEKEREREREREREREREREREREA 164
           +DV + EE  ER  ER  + + ERE+ R   A
Sbjct: 86  IDVERAEEALERAEERLAQAKDEREKARAEAA 117


>gnl|CDD|206316 pfam14147, Spore_YhaL, Sporulation protein YhaL.  This family of
           proteins is involved in sporulation. In B. subtilis its
           expression is regulated by the early
           mother-cell-specific transcription factor sigma-E.
          Length = 52

 Score = 27.7 bits (62), Expect = 1.7
 Identities = 17/29 (58%), Positives = 19/29 (65%), Gaps = 2/29 (6%)

Query: 136 MKEEEEKERER-ERERERERER-ERERER 162
            KEE EKE+E  E+E E   ER E ERER
Sbjct: 23  AKEEREKEQEFIEKEGEVYMERLEEERER 51


>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
           This is a family of fungal proteins whose function is
           unknown.
          Length = 130

 Score = 29.2 bits (66), Expect = 2.0
 Identities = 14/59 (23%), Positives = 26/59 (44%)

Query: 107 HAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAK 165
           +AK+ +        K K   PG   R+ +    E  +   E+E+   + RE++ +R  K
Sbjct: 68  NAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFKRRQK 126


>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
          Length = 188

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 138 EEEEKEREREREREREREREREREREAKS 166
           E + +E   +RE E ERE E+ RE+E  S
Sbjct: 37  EADAEEILEDREAEAEREIEQLREQELSS 65


>gnl|CDD|237897 PRK15067, PRK15067, ethanolamine ammonia lyase large subunit;
           Provisional.
          Length = 461

 Score = 30.2 bits (69), Expect = 2.1
 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 2/34 (5%)

Query: 384 SDVPLNKLFSNEAVIDTHSDE-HRMLMDDLNIKA 416
           +D+PL K F N  VI    DE  R+++DD +  A
Sbjct: 52  ADLPL-KDFLNNPVIPYEEDEVTRLIIDDHDAAA 84


>gnl|CDD|180935 PRK07333, PRK07333, 2-octaprenyl-6-methoxyphenyl hydroxylase;
           Provisional.
          Length = 403

 Score = 30.3 bits (69), Expect = 2.1
 Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 11/44 (25%)

Query: 354 QLNIKLRSQS-RRFITLIDALYDNNIRLVISSDVPLNKLFSNEA 396
            L++  R Q  RRF T+         R+ +++DV LN+LFSN++
Sbjct: 326 SLDVLERYQRWRRFDTV---------RMGVTTDV-LNRLFSNDS 359


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 30.3 bits (69), Expect = 2.2
 Identities = 11/47 (23%), Positives = 19/47 (40%)

Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
                K+ + G     +   +  E+  E ER+  R R    +R+ EA
Sbjct: 139 RGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEA 185



 Score = 29.1 bits (66), Expect = 6.1
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query: 131 NRLDVMKEEEEKERERERERERERERERERERE 163
           +R D   ++ E    R   R R+R+R   R  +
Sbjct: 246 DRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGD 278



 Score = 29.1 bits (66), Expect = 6.3
 Identities = 8/31 (25%), Positives = 12/31 (38%)

Query: 138 EEEEKEREREREREREREREREREREAKSYD 168
           E   +    +R+ E ER     RE   +  D
Sbjct: 171 ERRRRGDREDRQAEAERGERGRREERGRDGD 201



 Score = 28.3 bits (64), Expect = 9.7
 Identities = 12/47 (25%), Positives = 16/47 (34%)

Query: 118 ARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
           A  KA                E  +E ER+  R R    +R+ E E 
Sbjct: 141 AARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAER 187


>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing.  This is a family of
           proteins that are involved in rRNA processing. In a
           localisation study they were found to localise to the
           nucleus and nucleolus. The family also includes other
           metazoa members from plants to mammals where the protein
           has been named BR22 and is associated with TTF-1,
           thyroid transcription factor 1. In the lungs, the family
           binds TTF-1 to form a complex which influences the
           expression of the key lung surfactant protein-B (SP-B)
           and -C (SP-C), the small hydrophobic surfactant proteins
           that maintain surface tension in alveoli.
          Length = 150

 Score = 29.1 bits (65), Expect = 2.3
 Identities = 20/81 (24%), Positives = 38/81 (46%), Gaps = 9/81 (11%)

Query: 100 NKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERE 159
            K  LD   +I + +K   R+K  +            KE E+ E  +++++ERER R++ 
Sbjct: 70  KKKKLDEKKEIAKQRKREQREKELAK---------RQKELEKIELSKKKQKERERRRKKL 120

Query: 160 REREAKSYDPIPPVANDIISK 180
            ++       + P  ND++ K
Sbjct: 121 TKKTKSGQPLMGPRINDLLDK 141


>gnl|CDD|224107 COG1186, PrfB, Protein chain release factor B [Translation,
           ribosomal structure and biogenesis].
          Length = 239

 Score = 29.9 bits (68), Expect = 2.5
 Identities = 7/33 (21%), Positives = 17/33 (51%)

Query: 135 VMKEEEEKEREREREREREREREREREREAKSY 167
            +    +++R +E+ RER  ++      + +SY
Sbjct: 167 KLYILAQEKRSQEKNRERALKKLIGWGNQIRSY 199


>gnl|CDD|150737 pfam10097, DUF2335, Predicted membrane protein (DUF2335).  Members
           of this family of hypothetical bacterial proteins have
           no known function.
          Length = 50

 Score = 26.9 bits (60), Expect = 2.7
 Identities = 12/40 (30%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 124 SYDPGVPNRLDVMKEEEEKERERERERE-RERERERERER 162
           S  PG   R+  M E+E++ R    ER+ + + R  +R +
Sbjct: 11  SIVPGAAERIIAMAEKEQEHRHELEERQLKAQARRAKRGQ 50


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.0 bits (68), Expect = 2.9
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 132 RLDVMKEEEEKEREREREREREREREREREREAKS 166
            L+ ++ E E  RE   E ERERE   E  R+ + 
Sbjct: 252 ELETLEAEIEDLRETIAETEREREELAEEVRDLRE 286


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 29.3 bits (65), Expect = 2.9
 Identities = 14/33 (42%), Positives = 22/33 (66%)

Query: 136 MKEEEEKEREREREREREREREREREREAKSYD 168
           ++E  E ER  E+  ERE + E+ERERE ++ +
Sbjct: 138 LEERMEWERREEKIDEREDQEEQEREREEQTIE 170


>gnl|CDD|232941 TIGR00362, DnaA, chromosomal replication initiator protein DnaA.
           DnaA is involved in DNA biosynthesis; initiation of
           chromosome replication and can also be transcription
           regulator. The C-terminal of the family hits the pfam
           bacterial DnaA (bac_dnaA) domain family. For a review,
           see Kaguni (2006) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 405

 Score = 29.8 bits (68), Expect = 3.0
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 68  LYIYGAVGGGKTMLM 82
           L+IYG VG GKT L+
Sbjct: 139 LFIYGGVGLGKTHLL 153


>gnl|CDD|220331 pfam09660, DUF2397, Protein of unknown function (DUF2397).
           Proteins in this entry are encoded within a conserved
           gene four-gene neighborhood found sporadically in a
           phylogenetically broad range of bacteria including:
           Nocardia farcinica, Symbiobacterium thermophilum, and
           Streptomyces avermitilis (Actinobacteria), Geobacillus
           kaustophilus (Firmicutes), Azoarcus sp. EbN1 and
           Ralstonia solanacearum (Betaproteobacteria).
          Length = 478

 Score = 30.0 bits (68), Expect = 3.0
 Identities = 6/28 (21%), Positives = 11/28 (39%)

Query: 140 EEKEREREREREREREREREREREAKSY 167
            ++  E+    E+ RE + ER       
Sbjct: 379 RDRSAEKRELAEQAREEQAERAAALAEL 406


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 29.1 bits (66), Expect = 3.1
 Identities = 17/57 (29%), Positives = 28/57 (49%), Gaps = 4/57 (7%)

Query: 109 KIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAK 165
           K  + K+   +   K  D G     D +K+  E+ +  + ER+RE  R+R+ E E K
Sbjct: 19  KAKKEKRKQRKQARKGADDGD----DELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71


>gnl|CDD|236353 PRK08939, PRK08939, primosomal protein DnaI; Reviewed.
          Length = 306

 Score = 29.5 bits (67), Expect = 3.1
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 66  KGLYIYGAVGGGKTMLM 82
           KGLY+YG  G GK+ L+
Sbjct: 157 KGLYLYGDFGVGKSYLL 173


>gnl|CDD|221041 pfam11241, DUF3043, Protein of unknown function (DUF3043).  Some
           members in this family of proteins with unknown function
           are annotated as membrane proteins. This cannot be
           confirmed.
          Length = 168

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 9/25 (36%), Positives = 14/25 (56%), Gaps = 1/25 (4%)

Query: 140 EEKEREREREREREREREREREREA 164
           E+++  ++  R   RER R R R A
Sbjct: 23  EDRKAAKKAARAARRER-RARARAA 46


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 29.8 bits (67), Expect = 3.2
 Identities = 20/73 (27%), Positives = 28/73 (38%), Gaps = 6/73 (8%)

Query: 107 HAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERER----ERERERERER 162
           HA+  E+KK   R   K+        +   +   E+E  RE  +ER     R R R    
Sbjct: 326 HAENAEIKK--TRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383

Query: 163 EAKSYDPIPPVAN 175
           +AK    I    N
Sbjct: 384 KAKKKGLIDASPN 396


>gnl|CDD|119354 cd02875, GH18_chitobiase, Chitobiase (also known as
           di-N-acetylchitobiase) is a lysosomal glycosidase that
           hydrolyzes the reducing-end N-acetylglucosamine from the
           chitobiose core of oligosaccharides during the ordered
           degradation of asparagine-linked glycoproteins in
           eukaryotes. Chitobiase can only do so if the asparagine
           that joins the oligosaccharide to protein is previously
           removed by a glycosylasparaginase. Chitobiase is
           therefore the final step in the lysosomal degradation of
           the protein/carbohydrate linkage component of
           asparagine-linked glycoproteins. The catalytic domain of
           chitobiase is an eight-stranded alpha/beta barrel fold
           similar to that of other family 18 glycosyl hydrolases
           such as hevamine and chitotriosidase.
          Length = 358

 Score = 29.7 bits (67), Expect = 3.2
 Identities = 14/44 (31%), Positives = 20/44 (45%), Gaps = 6/44 (13%)

Query: 376 NNIRLVISSDVPLNKLFSNEAVIDTHSDEHRML-----MDDLNI 414
             +RLV+  DVPL ++ SN         +   L     MD +NI
Sbjct: 76  KGVRLVLKGDVPLEQI-SNPTYRTQWIQQKVELAKSQFMDGINI 118


>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6.  The surfeit locus
           protein SURF-6 is shown to be a component of the
           nucleolar matrix and has a strong binding capacity for
           nucleic acids.
          Length = 206

 Score = 29.2 bits (66), Expect = 3.4
 Identities = 8/36 (22%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 137 KEEEEKEREREREREREREREREREREAKSYDPIPP 172
           K E+ K R ++++R+  +++E  ++ EA+       
Sbjct: 14  KREQRKAR-KKQKRKEAKKKEDAQKSEAEEVKNEEN 48



 Score = 28.8 bits (65), Expect = 3.8
 Identities = 7/36 (19%), Positives = 22/36 (61%)

Query: 131 NRLDVMKEEEEKEREREREREREREREREREREAKS 166
           +R  ++++   K  +R+  ++++R+  +++E   KS
Sbjct: 3   SREALLEQRRRKREQRKARKKQKRKEAKKKEDAQKS 38


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 29.5 bits (67), Expect = 3.4
 Identities = 9/43 (20%), Positives = 23/43 (53%), Gaps = 13/43 (30%)

Query: 137 KEEEEKERERERERERERER-------------EREREREAKS 166
           ++EE +E++ E+++E    +             E+ER+++A+ 
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322



 Score = 29.1 bits (66), Expect = 3.9
 Identities = 9/30 (30%), Positives = 19/30 (63%)

Query: 136 MKEEEEKEREREREREREREREREREREAK 165
           +K  EE+ +E  +E++ E+++E    + AK
Sbjct: 273 LKAAEEERQEEAQEKKEEKKKEEREAKLAK 302


>gnl|CDD|192464 pfam10159, MMtag, Kinase phosphorylation protein.  This is a
           glycine-rich domain that is the most highly conserved
           region of a family of proteins that in vertebrates are
           associated with tumours in multiple myelomas. The region
           may contain phosphorylation sites for several protein
           kinases, as well as N-myristoylation sites and nuclear
           localisation signals, so it might act as a signal
           molecule in the nucleus.
          Length = 78

 Score = 27.3 bits (61), Expect = 3.5
 Identities = 7/28 (25%), Positives = 12/28 (42%)

Query: 137 KEEEEKEREREREREREREREREREREA 164
            +  E+  + + E  R +E E E    A
Sbjct: 49  DDTAEEAEKLKEEIRRIKEAEEEAMAAA 76


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 29.4 bits (66), Expect = 3.5
 Identities = 10/33 (30%), Positives = 20/33 (60%)

Query: 132 RLDVMKEEEEKEREREREREREREREREREREA 164
            L+  KEEEE+ R   ++ E E++  + + ++A
Sbjct: 150 ALEFEKEEEEQRRLLLQKEEEEQQMNKRKNKQA 182


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
           represents a conserved region approximately 60 residues
           long within the eukaryotic targeting protein for Xklp2
           (TPX2). Xklp2 is a kinesin-like protein localised on
           centrosomes throughout the cell cycle and on spindle
           pole microtubules during metaphase. In Xenopus, it has
           been shown that Xklp2 protein is required for centrosome
           separation and maintenance of spindle bi-polarity. TPX2
           is a microtubule-associated protein that mediates the
           binding of the C-terminal domain of Xklp2 to
           microtubules. It is phosphorylated during mitosis in a
           microtubule-dependent way.
          Length = 57

 Score = 27.0 bits (60), Expect = 3.8
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 140 EEKEREREREREREREREREREREA 164
           EEKE+  E E+E    R++E E EA
Sbjct: 18  EEKEKALEAEKEEAEARQKEEEEEA 42



 Score = 26.6 bits (59), Expect = 5.2
 Identities = 12/28 (42%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 138 EEEEKEREREREREREREREREREREAK 165
           EE+EK  E E+E E E  ++ E E   K
Sbjct: 18  EEKEKALEAEKE-EAEARQKEEEEEAIK 44


>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper. 
          Length = 54

 Score = 26.8 bits (60), Expect = 3.9
 Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 5/34 (14%)

Query: 138 EEEEKERERERE-----REREREREREREREAKS 166
           +E    R R  E     RE++++RE E E   K 
Sbjct: 3   DEYRDRRRRNNEAARRSREKKKQREEELEERVKE 36


>gnl|CDD|234667 PRK00149, dnaA, chromosomal replication initiation protein;
           Reviewed.
          Length = 401

 Score = 29.3 bits (67), Expect = 4.0
 Identities = 10/15 (66%), Positives = 12/15 (80%)

Query: 68  LYIYGAVGGGKTMLM 82
           L+IYG VG GKT L+
Sbjct: 102 LFIYGGVGLGKTHLL 116


>gnl|CDD|197400 cd10164, ClassIIa_HDAC5_Gln-rich-N, Glutamine-rich N-terminal
           helical domain of HDAC5, a Class IIa histone
           deacetylase.  This family consists of the glutamine-rich
           domain of histone deacetylase 5 (HDAC5). It belongs to a
           superfamily that consists of the glutamine-rich
           N-terminal helical extension to certain Class IIa
           histone deacetylases (HDACs), including HDAC4, HDAC5 and
           HDCA9; it is missing from HDAC7. This domain confers
           responsiveness to calcium signals and mediates
           interactions with transcription factors and cofactors,
           and it is able to repress transcription independently of
           the HDAC C-terminal, zinc-dependent catalytic domain. It
           has many intra- and inter-helical interactions which are
           possibly involved in reversible assembly and disassembly
           of proteins. HDACs regulate diverse cellular processes
           through enzymatic deacetylation of histone as well as
           non-histone proteins, in particular deacetylating
           N(6)-acetyl-lysine residues.
          Length = 97

 Score = 27.9 bits (61), Expect = 4.0
 Identities = 9/29 (31%), Positives = 23/29 (79%)

Query: 134 DVMKEEEEKERERERERERERERERERER 162
           +++  + ++E E++R+RE++R+ E E++R
Sbjct: 64  EILAAKRQQELEQQRKREQQRQEELEKQR 92


>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
           MW N terminal.  This domain is found in eukaryotes. This
           domain is about 90 amino acids in length. This domain is
           found associated with pfam00076. This domain is part of
           U1 snRNP, which is the pre-mRNA binding protein of the
           penta-snRNP spliceosome complex. It extends over a
           distance of 180 A from its RNA binding domain, wraps
           around the core domain of U1 snRNP consisting of the
           seven Sm proteins and finally contacts U1-C, which is
           crucial for 5'-splice-site recognition.
          Length = 94

 Score = 27.6 bits (62), Expect = 4.3
 Identities = 12/31 (38%), Positives = 21/31 (67%), Gaps = 3/31 (9%)

Query: 139 EEEKEREREREREREREREREREREAKSYDP 169
            E++ERE+   RE++ + E++ E E K +DP
Sbjct: 63  LEKREREK---REKKEKLEKKLEEELKEWDP 90


>gnl|CDD|184260 PRK13704, PRK13704, plasmid SOS inhibition protein A; Provisional.
          Length = 240

 Score = 28.9 bits (65), Expect = 4.3
 Identities = 16/57 (28%), Positives = 23/57 (40%), Gaps = 16/57 (28%)

Query: 117 LARDKAKSYDPGVPNRLDVMKEEEEKERERERERER------EREREREREREAKSY 167
           L+ D A    P V  R           RER R R         R++ RE E++ ++Y
Sbjct: 98  LSGDVAARLFPSVRFR----------VRERRRHRSAIAFQRYSRQQAREAEQKWRAY 144


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 29.3 bits (66), Expect = 4.6
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 138 EEEEKEREREREREREREREREREREAKSYD 168
           +EEE+ RE+E   E ER      E+  K  +
Sbjct: 398 KEEERPREKEGTEEEERREITVYEKRIKKLE 428



 Score = 28.5 bits (64), Expect = 8.6
 Identities = 9/27 (33%), Positives = 15/27 (55%)

Query: 134 DVMKEEEEKERERERERERERERERER 160
           + + + +E+ER RE+E   E ER    
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREIT 418


>gnl|CDD|215579 PLN03106, TCP2, Protein TCP2; Provisional.
          Length = 447

 Score = 29.3 bits (65), Expect = 4.6
 Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 138 EEEEKEREREREREREREREREREREAKSYDPIPPVAND 176
           E  +K RER RER   +E+E+E    A  ++   P++ +
Sbjct: 205 ELRDKARERARERT-AKEKEKEDHNHAAHHNNNNPISQN 242


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
           entry is characterized by proteins with alternating
           conserved and low-complexity regions. Bud13 together
           with Snu17p and a newly identified factor,
           Pml1p/Ylr016c, form a novel trimeric complex. called The
           RES complex, pre-mRNA retention and splicing complex.
           Subunits of this complex are not essential for viability
           of yeasts but they are required for efficient splicing
           in vitro and in vivo. Furthermore, inactivation of this
           complex causes pre-mRNA leakage from the nucleus. Bud13
           contains a unique, phylogenetically conserved C-terminal
           region of unknown function.
          Length = 141

 Score = 28.0 bits (63), Expect = 4.7
 Identities = 9/24 (37%), Positives = 17/24 (70%)

Query: 141 EKEREREREREREREREREREREA 164
           E++RE +   + E+ER+ E+E+E 
Sbjct: 15  EEKREEKEREKEEKERKEEKEKEW 38



 Score = 27.7 bits (62), Expect = 7.6
 Identities = 9/23 (39%), Positives = 17/23 (73%)

Query: 139 EEEKERERERERERERERERERE 161
           EE++E +   + E+ER+ E+E+E
Sbjct: 15  EEKREEKEREKEEKERKEEKEKE 37


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 28.7 bits (64), Expect = 4.7
 Identities = 9/25 (36%), Positives = 17/25 (68%), Gaps = 1/25 (4%)

Query: 139 EEEKERERERER-ERERERERERER 162
           ++E ERE+ RE+ E++ ++    ER
Sbjct: 5   DKEAEREKLREKYEQDEQKREATER 29


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 28.9 bits (65), Expect = 4.9
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query: 137 KEEEEKEREREREREREREREREREREAKS 166
           K++EE+ER  ER RE   E   +   + K+
Sbjct: 121 KQKEEEERRVERRRELGLEDPEQLRLKQKA 150



 Score = 28.9 bits (65), Expect = 5.4
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 137 KEEEEKEREREREREREREREREREREAK 165
           KEEEE+  ER RE   E   +   +++AK
Sbjct: 123 KEEEERRVERRRELGLEDPEQLRLKQKAK 151


>gnl|CDD|147845 pfam05914, RIB43A, RIB43A.  This family consists of several
           RIB43A-like eukaryotic proteins. Ciliary and flagellar
           microtubules contain a specialised set of
           protofilaments, termed ribbons, that are composed of
           tubulin and several associated proteins. RIB43A was
           first characterized in the unicellular biflagellate,
           Chlamydomonas reinhardtii although highly related
           sequences are present in several higher eukaryotes
           including humans. The function of this protein is
           unknown although the structure of RIB43A and its
           association with the specialised protofilament ribbons
           and with basal bodies is relevant to the proposed role
           of ribbons in forming and stabilising doublet and
           triplet microtubules and in organising their
           three-dimensional structure. Human RIB43A homologues
           could represent a structural requirement in centriole
           replication in dividing cells.
          Length = 379

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 13/35 (37%), Positives = 24/35 (68%)

Query: 134 DVMKEEEEKEREREREREREREREREREREAKSYD 168
            + K +E++ +E+ER RE E+ RE+E +R+A +  
Sbjct: 284 AIRKGQEQQLQEKERRREEEQLREKEWDRQAINQA 318



 Score = 28.9 bits (65), Expect = 5.2
 Identities = 12/26 (46%), Positives = 19/26 (73%)

Query: 140 EEKEREREREREREREREREREREAK 165
           +EKER RE E+ RE+E +R+   +A+
Sbjct: 294 QEKERRREEEQLREKEWDRQAINQAR 319


>gnl|CDD|183452 PRK12337, PRK12337, 2-phosphoglycerate kinase; Provisional.
          Length = 475

 Score = 29.0 bits (65), Expect = 5.4
 Identities = 12/44 (27%), Positives = 21/44 (47%)

Query: 135 VMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDII 178
           V   +E   R R   R+RE    R RER  + ++ I  + + ++
Sbjct: 389 VTLPDEALHRRRFELRDRETGASRPRERYLRHFEEIRLIQDHLL 432


>gnl|CDD|233480 TIGR01579, MiaB-like-C, MiaB-like tRNA modifying enzyme.  This
           clade of sequences is closely related to MiaB, a
           modifier of isopentenylated adenosine-37 of certain
           eukaryotic and bacterial tRNAs (see TIGR01574). Sequence
           alignments suggest that this equivalog performs the same
           chemical transformation as MiaB, perhaps on a different
           (or differently modified) tRNA base substrate. This
           clade is a member of a subfamily (TIGR00089) and spans
           low GC Gram positive bacteria, alpha and epsilon
           proteobacteria, Campylobacter, Porphyromonas, Aquifex,
           Thermotoga, Chlamydia, Treponema and Fusobacterium
           [Protein synthesis, tRNA and rRNA base modification].
          Length = 414

 Score = 28.9 bits (65), Expect = 5.4
 Identities = 12/58 (20%), Positives = 21/58 (36%), Gaps = 9/58 (15%)

Query: 121 KAKSYDPGVPN-----RLDVMKEEEEKERERERERERERER----EREREREAKSYDP 169
            A +    VP      R+  +KE  EK  +   ++   +E     E+E+      Y  
Sbjct: 334 PASTMKDKVPETIKKERVKRLKELAEKNYQEFLKKNIGKELEVLVEKEKAGVLTGYSE 391


>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein.  NinG or Rap is
           involved in recombination. Rap (recombination adept with
           plasmid) increases lambda-by-plasmid recombination
           catalyzed by Escherichia coli's RecBCD pathway.
          Length = 188

 Score = 28.5 bits (64), Expect = 5.6
 Identities = 9/31 (29%), Positives = 20/31 (64%)

Query: 136 MKEEEEKEREREREREREREREREREREAKS 166
           +K E+ +E++R+ E + ER   + R+ + K+
Sbjct: 34  LKREKAQEKKRKAEAQAERRELKARKEKLKT 64



 Score = 28.1 bits (63), Expect = 6.3
 Identities = 9/30 (30%), Positives = 18/30 (60%)

Query: 136 MKEEEEKEREREREREREREREREREREAK 165
           +  + EK +E++R+ E + ER   + R+ K
Sbjct: 32  LALKREKAQEKKRKAEAQAERRELKARKEK 61


>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
           proteins-interacting protein 1.  Members of this family
           of proteins act as negative regulators of G1 to S cell
           cycle phase progression by inhibiting cyclin-dependent
           kinases. Inhibitory effects are additive with GADD45
           proteins but occur also in the absence of GADD45
           proteins. Furthermore, they act as a repressor of the
           orphan nuclear receptor NR4A1 by inhibiting AB
           domain-mediated transcriptional activity.
          Length = 217

 Score = 28.6 bits (64), Expect = 5.6
 Identities = 8/33 (24%), Positives = 16/33 (48%)

Query: 128 GVPNRLDVMKEEEEKERERERERERERERERER 160
                L+ M EE  ++++ +  R + RE E  +
Sbjct: 96  EWYPSLNQMLEENREQQKEKEARRQAREAEIAK 128


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
            protein family belong to a conserved gene four-gene
            neighborhood found sporadically in a phylogenetically
            broad range of bacteria: Nocardia farcinica,
            Symbiobacterium thermophilum, and Streptomyces
            avermitilis (Actinobacteria), Geobacillus kaustophilus
            (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
            solanacearum (Betaproteobacteria). Proteins in this
            family average over 1400 amino acids in length
            [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 29.4 bits (66), Expect = 5.6
 Identities = 10/30 (33%), Positives = 13/30 (43%)

Query: 128  GVPNRLDVMKEEEEKERERERERERERERE 157
            G P     +    E+E  + RE    RERE
Sbjct: 1079 GRPVTPAGLLARLEQEIAQRRELLTARERE 1108



 Score = 29.0 bits (65), Expect = 6.0
 Identities = 13/41 (31%), Positives = 16/41 (39%), Gaps = 5/41 (12%)

Query: 133 LDVMKEEEEKERERERE-----REREREREREREREAKSYD 168
           L   ++E E  RE ERE        ERE +  R R      
Sbjct: 285 LGRARDELETAREEERELDARTEALEREADALRTRLEALQG 325


>gnl|CDD|240312 PTZ00202, PTZ00202, tuzin; Provisional.
          Length = 550

 Score = 29.0 bits (65), Expect = 5.9
 Identities = 13/32 (40%), Positives = 21/32 (65%), Gaps = 2/32 (6%)

Query: 132 RLDVMKEEEEKERERERE-RERERERERERER 162
           R+ V+ E+ E +R+  R+ RE  RER +ER+ 
Sbjct: 166 RMSVL-EKSEWQRDHHRQMRELLRERVKERDF 196


>gnl|CDD|223666 COG0593, DnaA, ATPase involved in DNA replication initiation [DNA
           replication, recombination, and repair].
          Length = 408

 Score = 28.8 bits (65), Expect = 6.1
 Identities = 32/158 (20%), Positives = 46/158 (29%), Gaps = 51/158 (32%)

Query: 68  LYIYGAVGGGKTMLMDIFYESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDP 127
           L+IYG VG GKT L+                         A  +E    LA         
Sbjct: 116 LFIYGGVGLGKTHLL------------------------QAIGNE---ALANGPNARVVY 148

Query: 128 GVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDPIPPVANDIISKTWLICFD 187
                          E       +  R+ E E+ +E  S D +      +I     +   
Sbjct: 149 ------------LTSEDFTNDFVKALRDNEMEKFKEKYSLDLL------LIDDIQFLAGK 190

Query: 188 EFQVTDIADAMILKRLFTELFQLGVVVVATSNRAPDDL 225
           E               F  L + G  +V TS+R P +L
Sbjct: 191 E------RTQEEFFHTFNALLENGKQIVLTSDRPPKEL 222


>gnl|CDD|99707 cd00009, AAA, The AAA+ (ATPases Associated with a wide variety of
          cellular Activities) superfamily represents an ancient
          group of ATPases belonging to the ASCE (for additional
          strand, catalytic E) division of the P-loop NTPase
          fold. The ASCE division also includes ABC, RecA-like,
          VirD4-like, PilT-like, and SF1/2 helicases. Members of
          the AAA+ ATPases function as molecular chaperons,
          ATPase subunits of proteases, helicases, or
          nucleic-acid stimulated ATPases. The AAA+ proteins
          contain several distinct features in addition to the
          conserved alpha-beta-alpha core domain structure and
          the Walker A and B motifs of the P-loop NTPases.
          Length = 151

 Score = 27.9 bits (62), Expect = 6.2
 Identities = 10/20 (50%), Positives = 11/20 (55%)

Query: 63 KQPKGLYIYGAVGGGKTMLM 82
            PK L +YG  G GKT L 
Sbjct: 17 PPPKNLLLYGPPGTGKTTLA 36


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 28.8 bits (65), Expect = 6.3
 Identities = 13/52 (25%), Positives = 26/52 (50%), Gaps = 6/52 (11%)

Query: 123 KSYDPGVPNRLDVMKEEEEKERERERERERE--REREREREREAKSYDPIPP 172
           K  D     R    KE  ++++   + +++E  +++E E+  +AK    IPP
Sbjct: 546 KLDDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK----IPP 593


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.8 bits (64), Expect = 6.5
 Identities = 13/34 (38%), Positives = 14/34 (41%)

Query: 134 DVMKEEEEKEREREREREREREREREREREAKSY 167
           D   E E  E E + E   ERE   E E   KS 
Sbjct: 103 DSGAENETVEEEEKEESREEREEVEETEGVTKSE 136


>gnl|CDD|226494 COG4006, COG4006, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 278

 Score = 28.7 bits (64), Expect = 6.7
 Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 7/86 (8%)

Query: 285 GIFKLLCSQENDIVRPRVITIMGRNVTFNKTCGQILEATFDELCSRDLGSSDYLHICQIF 344
           G+  L+     DI         G  V  N T G   E+ F  L     G+S   +I + F
Sbjct: 132 GLSDLVREVYCDIKVRIG---RGNVVYINATGGFKPESGFLTLAGSLAGASAVYYIHEAF 188

Query: 345 HTV----IIRNVPQLNIKLRSQSRRF 366
           + V     IR  PQ+++ L   +RR 
Sbjct: 189 NDVVFIPPIRLSPQVDLHLIEDARRL 214


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 28.9 bits (65), Expect = 6.8
 Identities = 12/49 (24%), Positives = 19/49 (38%), Gaps = 4/49 (8%)

Query: 120 DKAKSYDPGVPN----RLDVMKEEEEKEREREREREREREREREREREA 164
            K K  +    N     +  M+  E +++E       E  RE E E E+
Sbjct: 359 GKLKEGEDDEENSGLLSMKFMQRAEARKKEENDAEIEELRRELEGEEES 407


>gnl|CDD|224648 COG1734, DksA, DnaK suppressor protein [Signal transduction
           mechanisms].
          Length = 120

 Score = 27.4 bits (61), Expect = 6.9
 Identities = 24/92 (26%), Positives = 43/92 (46%), Gaps = 14/92 (15%)

Query: 93  QKQRVHFNKFMLDVHAKI-HEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERE---RER 148
           ++Q  HF   +L+    +  E+++     + ++     P   D   +EEE+E E   R+R
Sbjct: 3   KEQLEHFKNKLLEWKKDLLEELEQTEEHLQDENEASPDPA--DRATQEEERELELRLRDR 60

Query: 149 ERERERERERERER-EAKSY-------DPIPP 172
           ER+  R+ E   +R E  +Y       +PIP 
Sbjct: 61  ERKLLRKIESALDRIEEGTYGICEECGEPIPE 92


>gnl|CDD|233050 TIGR00609, recB, exodeoxyribonuclease V, beta subunit.  The RecBCD
           holoenzyme is a multifunctional nuclease with potent
           ATP-dependent exodeoxyribonuclease activity. Ejection of
           RecD, as occurs at chi recombinational hotspots,
           cripples exonuclease activity in favor of recombinagenic
           helicase activity. All proteins in this family for which
           functions are known are DNA-DNA helicases that are used
           as part of an exonuclease-helicase complex (made up of
           RecBCD homologs) that function to generate substrates
           for the initiation of recombination and recombinational
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 1087

 Score = 28.9 bits (65), Expect = 7.0
 Identities = 29/101 (28%), Positives = 44/101 (43%), Gaps = 4/101 (3%)

Query: 107 HAKIHEVKKILA-RDKAKSYDPGVPNRLDVMKE-EEEKEREREREREREREREREREREA 164
           HA   E+ K+ A R+        V   L ++KE +EE ++E++R RE   +    R   A
Sbjct: 223 HAVFEEIDKLNAERNNLFCLKDRV--FLTLLKEVQEELKKEKKRRREIGFDDLLSRLETA 280

Query: 165 KSYDPIPPVANDIISKTWLICFDEFQVTDIADAMILKRLFT 205
                   +A  I  +  +   DEFQ TD     I  +LF 
Sbjct: 281 LKSAEGEKLAQAIREQYPIALIDEFQDTDPQQYRIFSKLFI 321


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 28.9 bits (64), Expect = 7.1
 Identities = 24/59 (40%), Positives = 35/59 (59%), Gaps = 5/59 (8%)

Query: 109 KIHEVKKI--LARDKAKSYDPGVPNRLDVMKEEEEK-ERER-ERER-ERERERERERER 162
           KI E K+I  L R++ +  +     R++  + E E+ ERER ER+R ER+R    ERER
Sbjct: 453 KILEKKRIERLEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERER 511


>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
          Length = 432

 Score = 28.5 bits (64), Expect = 7.3
 Identities = 20/47 (42%), Positives = 29/47 (61%), Gaps = 9/47 (19%)

Query: 138 EEEEKERER-EREREREREREREREREAK------SYDPIPPVANDI 177
           E EE+ R R +RE+E +  RER+R+R A+      +YD   P+A DI
Sbjct: 123 ESEEEWRARLKREQEEQYLRERQRQRMARLQANAAAYD--SPLAVDI 167


>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
           protein; Provisional.
          Length = 438

 Score = 28.6 bits (64), Expect = 7.5
 Identities = 11/20 (55%), Positives = 15/20 (75%)

Query: 62  VKQPKGLYIYGAVGGGKTML 81
           +K PKG+ +YG  G GKT+L
Sbjct: 214 IKPPKGVILYGPPGTGKTLL 233


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 28.8 bits (65), Expect = 7.8
 Identities = 7/35 (20%), Positives = 16/35 (45%), Gaps = 5/35 (14%)

Query: 137 KEEEEKERERERERER-----EREREREREREAKS 166
           +  E+++++ E  + R      R    +  REA+ 
Sbjct: 439 RAIEQEKKKAEEAKARFEARQARLEREKAAREARH 473


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.5 bits (64), Expect = 8.0
 Identities = 16/76 (21%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 87  ESCETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERER 146
           +  + K ++ +  ++F  +   ++ E++K L+R      +  +    + +KE EEKE   
Sbjct: 286 KELKEKAEEYIKLSEFYEEYLDELREIEKRLSR-----LEEEINGIEERIKELEEKEERL 340

Query: 147 ERERERERERERERER 162
           E  +++ +E E+  E 
Sbjct: 341 EELKKKLKELEKRLEE 356


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 28.5 bits (64), Expect = 8.3
 Identities = 28/113 (24%), Positives = 42/113 (37%), Gaps = 23/113 (20%)

Query: 110 IHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREAKSYDP 169
           +H V+ IL + +    +P   +     +EEE+++ E E E E   E E E          
Sbjct: 327 VHHVQHILPQGRCTWVNPEQKD-----EEEEQEDEEEEEEEEEPEEPEPEEGPP-----L 376

Query: 170 IPPVANDIISKTWLICFDEFQVTDIADAMILKRLFTELFQLGVVVVATSNRAP 222
           + P++ D                D A      RL + L     V V  SNR P
Sbjct: 377 LTPISED----------APLPNDDPAWTF---RLSSSLSPKYAVAVLRSNRWP 416


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
           N-terminal.  This is the very acidic N-terminal region
           of the early transcription elongation factor Spt5. The
           Spt5-Spt4 complex regulates early transcription
           elongation by RNA polymerase II and has an imputed role
           in pre-mRNA processing via its physical association with
           mRNA capping enzymes. The actual function of this
           N-terminal domain is not known although it is
           dispensable for binding to Spt4.
          Length = 92

 Score = 26.6 bits (59), Expect = 8.5
 Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 120 DKAKSYDPGVPNRLDVMKEEEEKERERERE-REREREREREREREAK 165
           ++ +  +  + +  D  +  +E E E +R  R  +R RE+E E +A+
Sbjct: 13  EEEEEEEDDLEDLSDEDEFIDEAEAEDDRRHRRLDRRREKEEEEDAE 59


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 28.1 bits (63), Expect = 8.8
 Identities = 13/59 (22%), Positives = 28/59 (47%)

Query: 106 VHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKEREREREREREREREREREREA 164
           + A    V  +       SY+P   +  ++++EE EKE + E++R+     E ++  + 
Sbjct: 169 LAANGKPVPAVEVPHAGASYNPSFEDHQELLQEEYEKEVKAEKKRQELERVEEKKLEKM 227


>gnl|CDD|181091 PRK07720, fliJ, flagellar biosynthesis chaperone; Validated.
          Length = 146

 Score = 27.3 bits (61), Expect = 9.1
 Identities = 18/96 (18%), Positives = 40/96 (41%), Gaps = 17/96 (17%)

Query: 87  ESCETKQKQRV----------HFNKFMLDVHAKIHEVKK--ILARDKAKSYDPGVPNRLD 134
           E  E  +++++          H+ +F+ ++   I   +   + AR++          +  
Sbjct: 47  EDLEQAKEEKLQSGLSIQEIRHYQQFVTNLERTIDHYQLLVMQAREQMNR-----KQQDL 101

Query: 135 VMKEEEEKEREREREREREREREREREREAKSYDPI 170
             K  E K+ E+ +E+++E     E+  E K  D I
Sbjct: 102 TEKNIEVKKYEKMKEKKQEMFALEEKAAEMKEMDEI 137


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
           the C subunit of DNA polymerase delta. It carries the
           essential residues for binding to the Pol1 subunit of
           polymerase alpha, from residues 293-332, which are
           characterized by the motif D--G--VT, referred to as the
           DPIM motif. The first 160 residues of the protein form
           the minimal domain for binding to the B subunit, Cdc1,
           of polymerase delta, the final 10 C-terminal residues,
           362-372, being the DNA sliding clamp, PCNA, binding
           motif.
          Length = 427

 Score = 28.3 bits (63), Expect = 9.1
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 2/50 (4%)

Query: 118 ARDKAKSYDPGVPNRLDVMKEEEE--KEREREREREREREREREREREAK 165
            RD     +   P  LD  ++E+E     ER    E   E+E+E+ +  K
Sbjct: 250 KRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLK 299


>gnl|CDD|233714 TIGR02082, metH, 5-methyltetrahydrofolate--homocysteine
           methyltransferase.  This family represents
           5-methyltetrahydrofolate--homocysteine methyltransferase
           (EC 2.1.1.13), one of at least three different enzymes
           able to convert homocysteine to methionine by
           transferring a methyl group on to the sulfur atom. It is
           also called the vitamin B12(or cobalamine)-dependent
           methionine synthase. Other methionine synthases include
           5-methyltetrahydropteroyltriglutamate--homocysteine
           S-methyltransferase (MetE, EC 2.1.1.14, the
           cobalamin-independent methionine synthase) and
           betaine-homocysteine methyltransferase [Amino acid
           biosynthesis, Aspartate family].
          Length = 1178

 Score = 28.6 bits (64), Expect = 9.1
 Identities = 18/114 (15%), Positives = 38/114 (33%), Gaps = 23/114 (20%)

Query: 89  CETKQKQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERER 148
                K  V +   +LD    +  +  +++  +  +           +KEE +  RE+  
Sbjct: 832 IAPIYKGPVVY---VLDASRAVTVMDTLMSAKRKATE-------NGRIKEEYDTAREKHG 881

Query: 149 ERERERE------REREREREAKSYDPIPPVANDIISKTWLICFDEFQVTDIAD 196
           E+  +R         +       S D  PP      ++         + +DIA+
Sbjct: 882 EQRSKRIAASEQAARKNVFAPDWSDDIEPPAPPFWGTQ-------IVEASDIAE 928


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 28.4 bits (63), Expect = 9.2
 Identities = 15/64 (23%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 103 MLDVHAKIHEVKKILA-RDKAKSYDPGVPNRLDVMKEEEEKERERERERERERERERERE 161
           M+D  A + +  +I + +  AK  +     + + + EE + ++  E+ER ++ E+ER + 
Sbjct: 54  MVDPGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKA 113

Query: 162 REAK 165
           +E +
Sbjct: 114 QEQQ 117


>gnl|CDD|227184 COG4847, COG4847, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 103

 Score = 26.8 bits (59), Expect = 9.2
 Identities = 14/65 (21%), Positives = 22/65 (33%)

Query: 94  KQRVHFNKFMLDVHAKIHEVKKILARDKAKSYDPGVPNRLDVMKEEEEKERERERERERE 153
           K  VH+         K    K  +A   A      +   L  +    E E  +E  R+  
Sbjct: 27  KGSVHYECLAESKRKKPGGDKDEVALLLANLLLLYLIVLLKELAVIAESEEAKESIRQVR 86

Query: 154 RERER 158
           +E E+
Sbjct: 87  QEVEK 91


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 27.8 bits (61), Expect = 9.7
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 137 KEEEEKEREREREREREREREREREREAK 165
           +  EE+ R  E  R +E ER RE+E +AK
Sbjct: 63  RAAEERLRREEEARRQEEERAREKEEKAK 91


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 27.1 bits (60), Expect = 9.7
 Identities = 13/25 (52%), Positives = 18/25 (72%), Gaps = 1/25 (4%)

Query: 141 EKEREREREREREREREREREREAK 165
            K +E+E E+ER  ERE  RE++AK
Sbjct: 39  AKAQEKEHEKER-AEREEAREKKAK 62


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.134    0.381 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,178,323
Number of extensions: 2205611
Number of successful extensions: 6938
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5734
Number of HSP's successfully gapped: 435
Length of query: 438
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 338
Effective length of database: 6,502,202
Effective search space: 2197744276
Effective search space used: 2197744276
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.7 bits)