RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16189
         (262 letters)



>gnl|CDD|238093 cd00162, RING, RING-finger (Really Interesting New Gene) domain, a
           specialized type of Zn-finger of 40 to 60 residues that
           binds two atoms of zinc; defined by the 'cross-brace'
           motif C-X2-C-X(9-39)-C-X(1-3)-
           H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved
           in mediating protein-protein interactions; identified in
           a proteins with a wide range of functions such as viral
           replication, signal transduction, and development; has
           two variants, the C3HC4-type and a C3H2C3-type (RING-H2
           finger), which have different cysteine/histidine
           pattern; a subset of RINGs are associated with B-Boxes
           (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
          Length = 45

 Score = 50.5 bits (121), Expect = 5e-09
 Identities = 15/45 (33%), Positives = 20/45 (44%), Gaps = 1/45 (2%)

Query: 207 TCPSCK-VKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
            CP C    R+  VL  C HVFC  C+    ++ +  CP C    
Sbjct: 1   ECPICLEEFREPVVLLPCGHVFCRSCIDKWLKSGKNTCPLCRTPI 45


>gnl|CDD|214546 smart00184, RING, Ring finger.  E3 ubiquitin-protein ligase
           activity is intrinsic to the RING domain of c-Cbl and is
           likely to be a general function of this domain; Various
           RING fingers exhibit binding activity towards E2
           ubiquitin-conjugating enzymes (Ubc' s).
          Length = 40

 Score = 49.0 bits (117), Expect = 2e-08
 Identities = 16/40 (40%), Positives = 19/40 (47%), Gaps = 1/40 (2%)

Query: 208 CPSCK-VKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKC 246
           CP C     KD V+  C H FC  C+R   E+    CP C
Sbjct: 1   CPICLEEYLKDPVILPCGHTFCRSCIRKWLESGNNTCPIC 40


>gnl|CDD|215715 pfam00097, zf-C3HC4, Zinc finger, C3HC4 type (RING finger).  The
           C3HC4 type zinc-finger (RING finger) is a cysteine-rich
           domain of 40 to 60 residues that coordinates two zinc
           ions, and has the consensus sequence:
           C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C
           where X is any amino acid. Many proteins containing a
           RING finger play a key role in the ubiquitination
           pathway.
          Length = 40

 Score = 47.5 bits (113), Expect = 6e-08
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 1/40 (2%)

Query: 208 CPSCKVKRKDAV-LTKCFHVFCWDCLRTRYETRQRKCPKC 246
           CP C  + KD V +  C H+FC  C+ +  E+    CP C
Sbjct: 1   CPICLEEPKDPVTILPCGHLFCSKCILSWLESGNVTCPLC 40


>gnl|CDD|206094 pfam13923, zf-C3HC4_2, Zinc finger, C3HC4 type (RING finger). 
          Length = 45

 Score = 46.0 bits (109), Expect = 2e-07
 Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 2/44 (4%)

Query: 206 LTCPSCKVKRKDA-VLTKCFHVFCWDCLRTRYETRQRKCPKCNA 248
           L CP C    +D  VLT C HVFC +C+  RY  ++ KCP C  
Sbjct: 3   LECPICLDLLRDPVVLTPCGHVFCRECIL-RYLKKKSKCPICRT 45


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 46.2 bits (110), Expect = 1e-05
 Identities = 38/208 (18%), Positives = 93/208 (44%), Gaps = 7/208 (3%)

Query: 1   MSLGQVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLH 60
           ++ G  +     A   E++   +   +++ Q  +L ++L+   +    L     +  +  
Sbjct: 652 ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711

Query: 61  KLAREEKDIL-------HEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRN 113
           +    + + L        E++  L+ +++ +   + +LEE+   LQ  L  +E+EL    
Sbjct: 712 EELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLE 771

Query: 114 QAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIA 173
           +A+   K +  E  +    L+  LE+   +L+EA++ +      LE+   + +RL++EI 
Sbjct: 772 EALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831

Query: 174 QLRRKTERMKKIEQAGTLDEVMMEEIRE 201
           +L  + E +++       +   +E+  E
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELE 859



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 39/193 (20%), Positives = 90/193 (46%), Gaps = 3/193 (1%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
            E E L   +E   +  E ++E  ++L +++ E ++    L  E  +  +  + A    D
Sbjct: 751 EELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLD 810

Query: 69  ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
            L  ++ +L+ + + +   I +LEE+   L+  L  +E+EL    + ++  K +  E   
Sbjct: 811 ALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEA 870

Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
              +L+  L++   + +E ++ + E  S L     + ++L+E + +L     +++++E  
Sbjct: 871 EKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELE---AKLERLEVE 927

Query: 189 GTLDEVMMEEIRE 201
               E  +EE  E
Sbjct: 928 LPELEEELEEEYE 940



 Score = 36.6 bits (85), Expect = 0.011
 Identities = 34/180 (18%), Positives = 80/180 (44%), Gaps = 3/180 (1%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
            + EAL  E+E      E++++  + L +   E ++    L+ E  +   L +  REE  
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE---LFEALREELA 386

Query: 69  ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
            L  ++  ++ +++ +   I  LEE+   L   L ++++EL      ++  + +  E  +
Sbjct: 387 ELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNE 446

Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
              +L+  LE+   +LKE ++ + E    L+    +   L+  + +L  +    + +   
Sbjct: 447 ELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAV 506



 Score = 34.3 bits (79), Expect = 0.057
 Identities = 23/124 (18%), Positives = 56/124 (45%)

Query: 61  KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNK 120
               EE + L E++   + +I+ + + + +L E+   LQ  L  +++E+      + L +
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308

Query: 121 RKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
            +  E      +L+  LE+   +++  ++ + E+ + LE        L+E   +L  K  
Sbjct: 309 ERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLS 368

Query: 181 RMKK 184
            + +
Sbjct: 369 ALLE 372



 Score = 32.0 bits (73), Expect = 0.31
 Identities = 31/135 (22%), Positives = 68/135 (50%), Gaps = 10/135 (7%)

Query: 73  QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAAD 132
           ++  L+ ++  +   + KLEE+ + L+  L ++E  L    + ++  +R+  E  +  A 
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727

Query: 133 LKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLD 192
           L+  LE+  ++L+E ++ + E    LE        LQE + +L    E ++ +E+A    
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELE-------ELQERLEELE---EELESLEEALAKL 777

Query: 193 EVMMEEIREYKETLT 207
           +  +EE+ E ++ L 
Sbjct: 778 KEEIEELEEKRQALQ 792



 Score = 30.8 bits (70), Expect = 0.85
 Identities = 28/141 (19%), Positives = 65/141 (46%)

Query: 66  EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIE 125
           E  +L  ++  L+ +++ +   + +LEE+   LQ  L   EKE+      ++  + +  E
Sbjct: 226 ELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285

Query: 126 SAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
             +   +LK  +E+   ++   ++ + E  + LE    + + L+E+I  L+ + E  + +
Sbjct: 286 LQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345

Query: 186 EQAGTLDEVMMEEIREYKETL 206
            +        +EE +E  E  
Sbjct: 346 LEELEQLLAELEEAKEELEEK 366



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 21/125 (16%), Positives = 53/125 (42%), Gaps = 1/125 (0%)

Query: 8   GNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEK 67
               E L    E   +   +++ + + +  +L E       L     + ++  +  +EE 
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEEL 427

Query: 68  DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
             L  ++  L+ +++ ++  + +LEE+   L+  L  +E+EL    + +     K + S 
Sbjct: 428 KELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQ-RLEKELSSL 486

Query: 128 QSAAD 132
           ++  D
Sbjct: 487 EARLD 491


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 44.3 bits (105), Expect = 4e-05
 Identities = 40/185 (21%), Positives = 94/185 (50%), Gaps = 4/185 (2%)

Query: 26  EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMH 85
             ++E+ SR+  +LRE +    +L  E+    +  +  +E++  L EQ+ +++ +I+ ++
Sbjct: 801 SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLN 860

Query: 86  TAIRKLEEKERYLQTVLTNVEKEL-HLRNQAMDL-NKRKAIESAQSAADLKLHLEKYH-A 142
               +LEE+   L+  L ++E  L  L+ +  +L  + + +E      + ++  ++   +
Sbjct: 861 GKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLS 920

Query: 143 QLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA-GTLDEVMMEEIRE 201
           +LK   + + E+ S +E    + + + EE   L      ++++E+    L+ V M  I+E
Sbjct: 921 ELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQE 980

Query: 202 YKETL 206
           Y+E L
Sbjct: 981 YEEVL 985



 Score = 38.1 bits (89), Expect = 0.004
 Identities = 51/203 (25%), Positives = 93/203 (45%), Gaps = 13/203 (6%)

Query: 10  EEEALLNEMEVTGQAFEDMQEQNSRLLQQL----REKDDANFKLMTERIKSNQLHKLARE 65
           ++E  L E+E   +  E +        QQL    RE++ A      ER ++    K   E
Sbjct: 171 KKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKA------ERYQALLKEKREYE 224

Query: 66  EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAM-DLNKRKAI 124
             ++L E   AL+ Q +A+   +  LEE+   L   ++ +EK L    Q + +LNK+   
Sbjct: 225 GYELLKE-KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD 283

Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
              +    +K  + +  A++   ++ + EK   LE    +  +L+ EI +L  + E +++
Sbjct: 284 LGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELER 343

Query: 185 -IEQAGTLDEVMMEEIREYKETL 206
            IE+     + + EE  E KE L
Sbjct: 344 EIEEERKRRDKLTEEYAELKEEL 366



 Score = 37.0 bits (86), Expect = 0.009
 Identities = 36/191 (18%), Positives = 78/191 (40%), Gaps = 13/191 (6%)

Query: 26  EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMH 85
           E ++ + S L  +LR  ++   +L  E   +++      +E + L ++   LK +++ + 
Sbjct: 684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELE 743

Query: 86  TAIRKLEEKERYLQTVLTNVEKEL--------HLRNQAMDLNKRKAIESAQSAADLKLHL 137
             +  LE++   +++ L  +E  +         L     DL  R +             L
Sbjct: 744 EDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKL 803

Query: 138 EKYHAQLKEAQQVVTEKTSSLEAE-AYKTKRLQEEIAQLR----RKTERMKKIEQAGTLD 192
           E+  ++++   + + +K + L  E  Y  K +QE   Q      +     K+IE      
Sbjct: 804 EEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKK 863

Query: 193 EVMMEEIREYK 203
           E + EE+ E +
Sbjct: 864 EELEEELEELE 874



 Score = 35.8 bits (83), Expect = 0.023
 Identities = 43/214 (20%), Positives = 91/214 (42%), Gaps = 17/214 (7%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERI--KSNQLHKLAREE 66
            E E L  E E   +  E+++E  S L +Q  E   +  K +  RI      LHKL    
Sbjct: 723 KEIEQLEQEEEKLKERLEELEEDLSSL-EQEIENVKSELKELEARIEELEEDLHKLEEAL 781

Query: 67  KDI----LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL--------HLRNQ 114
            D+     H ++  ++ ++  +   + ++E + R ++  L  +  E          L+ Q
Sbjct: 782 NDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQ 841

Query: 115 AMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQ 174
            +DL ++      +   +L    E+   +L+E +  + +  S L     +   L+ ++ +
Sbjct: 842 RIDLKEQIK-SIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRE 900

Query: 175 LRRKTERMK-KIEQAGTLDEVMMEEIREYKETLT 207
           L RK E ++ +IE+       +  ++   +E L+
Sbjct: 901 LERKIEELEAQIEKKRKRLSELKAKLEALEEELS 934



 Score = 28.9 bits (65), Expect = 3.6
 Identities = 31/167 (18%), Positives = 69/167 (41%), Gaps = 21/167 (12%)

Query: 22  GQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQI 81
            + + +++E+   L  +L E D    +   E        K  RE+ + L  ++N LK ++
Sbjct: 356 TEEYAELKEELEDLRAELEEVDKEFAETRDE-------LKDYREKLEKLKREINELKREL 408

Query: 82  DAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYH 141
           D      R  EE +R         E+   L      +  +   E  +   D  L ++K  
Sbjct: 409 D------RLQEELQRL-------SEELADLNAAIAGIEAKIN-ELEEEKEDKALEIKKQE 454

Query: 142 AQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
            +L++    +++    L     +  R+++E+++L+R+    +   +A
Sbjct: 455 WKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARA 501


>gnl|CDD|222279 pfam13639, zf-RING_2, Ring finger domain. 
          Length = 46

 Score = 39.7 bits (93), Expect = 5e-05
 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)

Query: 207 TCPSCK---VKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNA 248
            CP C       ++ V+  C HVF  +CL  ++      CP C A
Sbjct: 2   ECPICLDEFEPGEEVVVLPCGHVFHKECLD-KWLRSSNTCPLCRA 45


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 43.1 bits (102), Expect = 8e-05
 Identities = 38/201 (18%), Positives = 93/201 (46%), Gaps = 11/201 (5%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS-NQLHKLAREEK 67
            + E L  ++    +  ++++E+ + LL++L E    + + + ER+K     +    E K
Sbjct: 549 EKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELK 608

Query: 68  DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
           D   E    L+ +   +     +L++    L      +E+   LR +  +L K+ + E  
Sbjct: 609 DAEKE----LEREEKELKKLEEELDKAFEELAETEKRLEE---LRKELEELEKKYSEEEY 661

Query: 128 QSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQ 187
           +   +  L L +  A L+   + + ++   ++      ++L+EE+ +  +  + ++K+E+
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTL---EKLKEELEEREKAKKELEKLEK 718

Query: 188 AGTLDEVMMEEIREYKETLTC 208
           A    E + E++++YK  L  
Sbjct: 719 ALERVEELREKVKKYKALLKE 739



 Score = 38.9 bits (91), Expect = 0.002
 Identities = 47/209 (22%), Positives = 102/209 (48%), Gaps = 10/209 (4%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
              E L   ++ T    E ++E+   L + LRE ++ + +L   R +  +L K   +E +
Sbjct: 176 RRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKE-VKELE 234

Query: 69  ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELH-LRNQAMDLNK------- 120
            L E++  L+ +++++  + RKLEEK R L+  +  ++KE+  L  +  +L +       
Sbjct: 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEE 294

Query: 121 -RKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKT 179
             K  E  +   D    +EK  ++L+E    + E+   LE +  + + L++++ +L ++ 
Sbjct: 295 YIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRL 354

Query: 180 ERMKKIEQAGTLDEVMMEEIREYKETLTC 208
           E +++  +     +   EE+   K+ LT 
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTG 383


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
            bacterial type.  SMC (structural maintenance of
            chromosomes) proteins bind DNA and act in organizing and
            segregating chromosomes for partition. SMC proteins are
            found in bacteria, archaea, and eukaryotes. This family
            represents the SMC protein of most bacteria. The smc gene
            is often associated with scpB (TIGR00281) and scpA genes,
            where scp stands for segregation and condensation
            protein. SMC was shown (in Caulobacter crescentus) to be
            induced early in S phase but present and bound to DNA
            throughout the cell cycle [Cellular processes, Cell
            division, DNA metabolism, Chromosome-associated
            proteins].
          Length = 1179

 Score = 41.6 bits (98), Expect = 3e-04
 Identities = 41/198 (20%), Positives = 86/198 (43%), Gaps = 23/198 (11%)

Query: 9    NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
               E+L   +  T +  ED++EQ   L + +        +L     +     +    E+ 
Sbjct: 824  ERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA 883

Query: 69   ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
             L E +  L+ +++ +   +R+LE K   L+                     R+  E  +
Sbjct: 884  SLEEALALLRSELEELSEELRELESKRSELR---------------------RELEELRE 922

Query: 129  SAADLKLHLEKYHAQLKEAQQVVTEKTS-SLEAEAYKTKRLQEEIAQLRRKTERMK-KIE 186
              A L+L LE    ++   Q+ ++E+ S +LE       +++++  + RR+ +R++ KI+
Sbjct: 923  KLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIK 982

Query: 187  QAGTLDEVMMEEIREYKE 204
            + G ++   +EE  E KE
Sbjct: 983  ELGPVNLAAIEEYEELKE 1000



 Score = 37.0 bits (86), Expect = 0.010
 Identities = 25/169 (14%), Positives = 66/169 (39%), Gaps = 9/169 (5%)

Query: 29  QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAI 88
           +E+ + L +QL E +    +L ++  +  +      E+ + L E++ +L+ +++ +   +
Sbjct: 308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAEL 367

Query: 89  RKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQ 148
            +LE +   L+  L  +  ++      +     +          L+   E+   +++E  
Sbjct: 368 EELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427

Query: 149 Q---------VVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
           +         +  E     E      + L+     L    E +++ EQA
Sbjct: 428 KKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476



 Score = 37.0 bits (86), Expect = 0.010
 Identities = 29/142 (20%), Positives = 60/142 (42%), Gaps = 2/142 (1%)

Query: 64  REEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKA 123
           R E + L E++  L+ +I  +  A+ +L ++   L+  L  + KEL   ++ +   ++  
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDL 735

Query: 124 IESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMK 183
                    L+  + +   +L E +  + E    LE    +    + EI +L  + E++K
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795

Query: 184 KIEQAGTLDEVMMEEIREYKET 205
             E+   L E + E   E    
Sbjct: 796 --EELKALREALDELRAELTLL 815



 Score = 36.2 bits (84), Expect = 0.018
 Identities = 35/196 (17%), Positives = 78/196 (39%), Gaps = 15/196 (7%)

Query: 13  ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHE 72
           AL  ++       E ++E+ ++L ++L E +    +L     ++ +    A  E + L  
Sbjct: 730 ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEA 789

Query: 73  QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAAD 132
           Q+  LK ++ A+  A+ +L                EL L N+     + +     +  A 
Sbjct: 790 QIEQLKEELKALREALDELR--------------AELTLLNEEAANLRERLESLERRIAA 835

Query: 133 LKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL-RRKTERMKKIEQAGTL 191
            +  LE    Q++E  + +    + +E      + L+ E+  L   +    + +    + 
Sbjct: 836 TERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895

Query: 192 DEVMMEEIREYKETLT 207
            E + EE+RE +   +
Sbjct: 896 LEELSEELRELESKRS 911



 Score = 35.0 bits (81), Expect = 0.038
 Identities = 39/174 (22%), Positives = 83/174 (47%), Gaps = 14/174 (8%)

Query: 5   QVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 64
            +     E L  E+E   +  ++ +E+   L  +L+E ++   KL   R++ ++L +   
Sbjct: 228 ALLVLRLEELREELEELQEELKEAEEELEELTAELQELEE---KLEELRLEVSELEEEIE 284

Query: 65  EEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAI 124
           E +  L+   N            I +LE++++ L+  L N+E++L      ++  + K  
Sbjct: 285 ELQKELYALANE-----------ISRLEQQKQILRERLANLERQLEELEAQLEELESKLD 333

Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRK 178
           E A+  A+L+  LE+   +L+  +  + E  + LE    + + L+E++  LR K
Sbjct: 334 ELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK 387



 Score = 35.0 bits (81), Expect = 0.040
 Identities = 23/167 (13%), Positives = 62/167 (37%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
            +   L   +    +  E+++E+  +L ++L E       L  +  +     +   E   
Sbjct: 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA 750

Query: 69  ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
            L +++  L+ +I+ +   + + EE+    +  +  +E ++    + +   +    E   
Sbjct: 751 QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRA 810

Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL 175
               L         +L+  ++ +      LE    + + L E+I  L
Sbjct: 811 ELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL 857



 Score = 35.0 bits (81), Expect = 0.044
 Identities = 25/138 (18%), Positives = 62/138 (44%), Gaps = 1/138 (0%)

Query: 47  FKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVE 106
           +K +   ++  +L  L    ++ L E++  L+ ++      + +L  + + L+  L  + 
Sbjct: 215 YKELKAELRELELALLVLRLEE-LREELEELQEELKEAEEELEELTAELQELEEKLEELR 273

Query: 107 KELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTK 166
            E+    + ++  +++    A   + L+   +    +L   ++ + E  + LE    K  
Sbjct: 274 LEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLD 333

Query: 167 RLQEEIAQLRRKTERMKK 184
            L EE+A+L  K E +K+
Sbjct: 334 ELAEELAELEEKLEELKE 351



 Score = 34.3 bits (79), Expect = 0.067
 Identities = 35/173 (20%), Positives = 78/173 (45%), Gaps = 5/173 (2%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
           +E    L E+E   +  E+++E+   L  +L E +    +L +   +  +  +  R +  
Sbjct: 333 DELAEELAELE---EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389

Query: 69  ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
            L  Q+ +L  +I+ +   + +LE++   LQ  +  + K+L      +   + +  E  +
Sbjct: 390 QLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE--EAELKELQAELEELEE 447

Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER 181
              +L+  LE+    L+E ++ + E   +L+A   +  +LQ  +  L R  E 
Sbjct: 448 ELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQEN 500



 Score = 29.3 bits (66), Expect = 2.6
 Identities = 27/165 (16%), Positives = 66/165 (40%), Gaps = 18/165 (10%)

Query: 61  KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL-------HLRN 113
           +   E+   L + +  L+ +++ +   + +L ++   L   ++ + K+L           
Sbjct: 687 EELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLE 746

Query: 114 QAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEA-------QQVVTEKTSSLEAEAYKTK 166
           + +    ++  E      +L+  LE+   +L EA       +  + +    L+A      
Sbjct: 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD 806

Query: 167 RLQEEIAQLRR----KTERMKKIEQAGTLDEVMMEEIREYKETLT 207
            L+ E+  L        ER++ +E+     E  +E++ E  E L+
Sbjct: 807 ELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELS 851



 Score = 28.9 bits (65), Expect = 3.7
 Identities = 36/188 (19%), Positives = 88/188 (46%)

Query: 10  EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDI 69
           E E L  E++   +  E+++ + S L +++ E     + L  E  +  Q  ++ RE    
Sbjct: 254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLAN 313

Query: 70  LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
           L  Q+  L+ Q++ + + + +L E+   L+  L  +++EL      ++  + +  E    
Sbjct: 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373

Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAG 189
             +L+  LE   +++ + +  +    + +E    + +RL++   +L+++ E + K  +  
Sbjct: 374 LEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEA 433

Query: 190 TLDEVMME 197
            L E+  E
Sbjct: 434 ELKELQAE 441


>gnl|CDD|193335 pfam12861, zf-Apc11, Anaphase-promoting complex subunit 11 RING-H2
           finger.  Apc11 is one of the subunits of the
           anaphase-promoting complex or cyclosome. The APC
           subunits are cullin family proteins with ubiquitin
           ligase activity. Polyubiquitination marks proteins for
           degradation by the 26S proteasome and is carried out by
           a cascade of enzymes that includes ubiquitin-activating
           enzymes (E1s), ubiquitin-conjugating enzymes (E2s), and
           ubiquitin ligases (E3s). Apc11 acts as an E3 enzyme and
           is responsible for recruiting E2s to the APC and for
           mediating the subsequent transfer of ubiquitin to APC
           substrates in vivo. In Saccharomyces cerevisiae this
           RING-H2 finger protein defines the minimal ubiquitin
           ligase activity of the APC, and the integrity of the
           RING-H2 finger is essential for budding yeast cell
           viability.
          Length = 85

 Score = 37.1 bits (86), Expect = 8e-04
 Identities = 18/48 (37%), Positives = 19/48 (39%), Gaps = 4/48 (8%)

Query: 207 TCPSCKVKRKDA--VLTKCFHVFCWDCLRTRYETRQRK--CPKCNAAF 250
           TCP CK    D   V  KC H F   C+     T   K  CP C   F
Sbjct: 34  TCPQCKFPGDDCPLVWGKCKHNFHMHCILKWLATETSKGLCPMCRQTF 81


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 40.0 bits (93), Expect = 8e-04
 Identities = 33/185 (17%), Positives = 78/185 (42%), Gaps = 4/185 (2%)

Query: 26  EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMH 85
           +D++ Q   L  +L           TER  +    + AR+E++ + +++ A +  +    
Sbjct: 77  DDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQ 136

Query: 86  TAIRKLEEKERYLQTVLTNVEKELHLRNQAMD--LNKRKAIESAQSAADLK-LHLEKYHA 142
             + +L ++ + LQT L  + ++              +K ++++ +    + L L+   A
Sbjct: 137 QELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSA 196

Query: 143 QLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRK-TERMKKIEQAGTLDEVMMEEIRE 201
           Q+++  Q +  + ++ +A   +  R      Q  +   +R  +I Q         E+IRE
Sbjct: 197 QIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRE 256

Query: 202 YKETL 206
            +  L
Sbjct: 257 RERQL 261



 Score = 33.5 bits (76), Expect = 0.12
 Identities = 32/184 (17%), Positives = 72/184 (39%), Gaps = 11/184 (5%)

Query: 5   QVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR 64
           Q +  E EA+  E+    Q     Q++ +RL +Q ++       L  +R +     +  +
Sbjct: 112 QKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQ 171

Query: 65  EEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAI 124
             +  L      LK Q+  +     ++E++ + L T           R  A      +  
Sbjct: 172 ASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLAT-----------RANAAQARTEELA 220

Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
             A +A      +++  AQ+ +  Q +  +   +     + +RL+   A+L ++  +++ 
Sbjct: 221 RRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLETAQARLEQEVAQLEA 280

Query: 185 IEQA 188
             QA
Sbjct: 281 YYQA 284



 Score = 31.5 bits (71), Expect = 0.40
 Identities = 31/198 (15%), Positives = 75/198 (37%), Gaps = 7/198 (3%)

Query: 12  EALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILH 71
            AL  E+          + +      +L++       +  E   + Q    A++E   L 
Sbjct: 84  RALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLT 143

Query: 72  EQVNALKLQIDAMHTAIRKLEEKERYL---QTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
           +Q   L+ ++  +    R+LE + + L   Q  L     +L  +   + L   +  + AQ
Sbjct: 144 KQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQ 203

Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIA----QLRRKTERMKK 184
           + A      +    +L        +   +++    +  +  ++IA    Q+R +  ++++
Sbjct: 204 NLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQR 263

Query: 185 IEQAGTLDEVMMEEIREY 202
           +E A    E  + ++  Y
Sbjct: 264 LETAQARLEQEVAQLEAY 281


>gnl|CDD|222454 pfam13920, zf-C3HC4_3, Zinc finger, C3HC4 type (RING finger). 
          Length = 49

 Score = 35.0 bits (81), Expect = 0.002
 Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 3/46 (6%)

Query: 204 ETLTCPSCKVKRKDAVLTKCFH-VFCWDCLRTRYETRQRKCPKCNA 248
           E   C  C  + ++ V   C H   C +C +      ++KCP C  
Sbjct: 1   EDDLCVICLERPRNVVFLPCGHLCLCEECAKRLR--SKKKCPICRQ 44


>gnl|CDD|233043 TIGR00599, rad18, DNA repair protein rad18.  All proteins in this
           family for which functions are known are involved in
           nucleotide excision repair.This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 397

 Score = 38.4 bits (89), Expect = 0.003
 Identities = 20/45 (44%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 206 LTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
           L C  CK      VLT C H FC  C+R R  + Q KCP C A  
Sbjct: 27  LRCHICKDFFDVPVLTSCSHTFCSLCIR-RCLSNQPKCPLCRAED 70


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 37.7 bits (88), Expect = 0.006
 Identities = 25/177 (14%), Positives = 73/177 (41%), Gaps = 24/177 (13%)

Query: 24  AFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDA 83
            ++++ EQ   L+ +     +   + +   + + + +   + + + L   + AL+  ++ 
Sbjct: 279 EYQELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILE- 337

Query: 84  MHTAIRKLEEKERYLQTVLT---------NVEKELHLRNQAMDLNKRKA--IESAQSAA- 131
               ++KLEEK +   T +          ++   +   N+ +  +  K   ++  ++ A 
Sbjct: 338 --KNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDNLKKEKNKAK 395

Query: 132 -DLKLH--------LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKT 179
             L LH        ++ Y  + K  ++ +      ++    + K L++EI +L ++ 
Sbjct: 396 KKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKELEKQL 452


>gnl|CDD|227861 COG5574, PEX10, RING-finger-containing E3 ubiquitin ligase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 271

 Score = 36.8 bits (85), Expect = 0.007
 Identities = 25/107 (23%), Positives = 38/107 (35%), Gaps = 9/107 (8%)

Query: 147 AQQVVTEKTSSLEAEA--YKTKRLQEEIAQLRRKTERMKKI--EQAGTLDEVMMEEIREY 202
               + E  S L A A      RLQ  +          + I  E     + +    + +Y
Sbjct: 157 LAMNIPEVISDLTAVALSLDESRLQPILQPSNNLHTLFQVITKENLSKKNGLPFIPLADY 216

Query: 203 KETLTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQ-RKCPKCNA 248
           K    C  C  + +    T C H+FC  CL   +  ++   CP C A
Sbjct: 217 K----CFLCLEEPEVPSCTPCGHLFCLSCLLISWTKKKYEFCPLCRA 259


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 36.6 bits (85), Expect = 0.007
 Identities = 35/155 (22%), Positives = 65/155 (41%), Gaps = 13/155 (8%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSNQLHKLARE 65
            E EAL   +E      ED++ Q S+L   +Q++RE+     + ++      +L  L  E
Sbjct: 38  AELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIE 97

Query: 66  EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIE 125
              I  E++N+L+ ++  +   I KLE++   L+  L  +EK L      ++    +  E
Sbjct: 98  I-QIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIRE 156

Query: 126 SAQSAADLKLHL---------EKYHAQLKEAQQVV 151
             Q  +  +  L          +Y    K  + V 
Sbjct: 157 EGQELSSKREELKEKLDPELLSEYERIRKNKKGVG 191


>gnl|CDD|222135 pfam13445, zf-RING_LisH, RING-type zinc-finger, LisH dimerisation
           motif.  This zinc-finger is the dimerisation motif for
           LisH proteins, and is also a typical RING-type of plant
           ubiquitin ligases.
          Length = 55

 Score = 33.1 bits (76), Expect = 0.013
 Identities = 12/54 (22%), Positives = 17/54 (31%), Gaps = 5/54 (9%)

Query: 196 MEEIREYKETLTCPSCKVKRKDA----VLTKCFHVFCWDCLRT-RYETRQRKCP 244
           +     +     CP  K    D     V+  C HV+    L        + KCP
Sbjct: 1   LPYAHHFHSIFVCPISKEVMTDEENPPVMLPCGHVYSRKALEKLAKNGGKFKCP 54


>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase.  BRE1 is an E3 ubiquitin
           ligase that has been shown to act as a transcriptional
           activator through direct activator interactions.
          Length = 96

 Score = 33.8 bits (78), Expect = 0.016
 Identities = 22/96 (22%), Positives = 46/96 (47%)

Query: 14  LLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQ 73
           L  E+    +AF+++     + + +L   +    +L  E+ K++Q +  A   KD L  +
Sbjct: 1   LQKELVSLEKAFKELSSLALKKVAELTILEQKKLRLTAEKAKADQKYFAAMRSKDALDAE 60

Query: 74  VNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL 109
              L  Q++     I +L++ E  L+  +  +EK+L
Sbjct: 61  NKKLNTQLNKSSELIEQLKDLENELRRRIKELEKQL 96


>gnl|CDD|178747 PLN03208, PLN03208, E3 ubiquitin-protein ligase RMA2; Provisional.
          Length = 193

 Score = 35.4 bits (81), Expect = 0.016
 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 15/56 (26%)

Query: 208 CPSCKVKRKDAVLTKCFHVFCWDCLR----TRYETRQR-----------KCPKCNA 248
           C  C  + +D V+T C H+FCW C+         +RQR           KCP C +
Sbjct: 21  CNICLDQVRDPVVTLCGHLFCWPCIHKWTYASNNSRQRVDQYDHKREPPKCPVCKS 76


>gnl|CDD|227547 COG5222, COG5222, Uncharacterized conserved protein, contains RING
           Zn-finger [General function prediction only].
          Length = 427

 Score = 35.9 bits (82), Expect = 0.017
 Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 1/43 (2%)

Query: 205 TLTCPSCKVKRKDAVLTK-CFHVFCWDCLRTRYETRQRKCPKC 246
           +L CP C    ++ + T  C H FC +C+ T       KCP C
Sbjct: 274 SLKCPLCHCLLRNPMKTPCCGHTFCDECIGTALLDSDFKCPNC 316


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.5 bits (81), Expect = 0.030
 Identities = 50/199 (25%), Positives = 87/199 (43%), Gaps = 11/199 (5%)

Query: 11   EEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDIL 70
            EEA + E+    +  + M+ + ++  ++ + K +   K   E+ K  QL K   EEK   
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649

Query: 71   HEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSA 130
             E   A +          +K EE ++  +      +K      +A +  K++A E A+ A
Sbjct: 1650 EELKKAEEENKIKAAEEAKKAEEDKKKAE----EAKKAEEDEKKAAEALKKEA-EEAKKA 1704

Query: 131  ADLKLHLE---KYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQ 187
             +LK       K   +LK+A++    K    + EA + K+  EE    ++  E  KKI  
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEE---AKKDEEEKKKIAH 1761

Query: 188  AGTLDEVMMEEIREYKETL 206
                +E   EEIR+ KE +
Sbjct: 1762 LKKEEEKKAEEIRKEKEAV 1780



 Score = 30.1 bits (67), Expect = 1.4
 Identities = 37/177 (20%), Positives = 80/177 (45%), Gaps = 11/177 (6%)

Query: 37   QQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKER 96
            ++ ++ D+A  K   ++ K+++L K A  +K    ++      +      A +K EE ++
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKA--DEAKKKAEEKKKADEAKKKAEEAKK 1445

Query: 97   Y--LQTVLTNVEKELHLRNQAMDLNK----RKAIESAQSAADLKLHLEKYHAQLKEAQQV 150
                +      +K    + +A +  K    +K  E A+ A + K   E+   +  EA++ 
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505

Query: 151  VTEKTSSLEAEAYKTKRLQEEI--AQLRRKTERMKKIEQAGTLDEV-MMEEIREYKE 204
               K  + EA+  +  +  +E   A+  +K +  KK E+    DE+   EE+++ +E
Sbjct: 1506 AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEE 1562


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 35.4 bits (81), Expect = 0.032
 Identities = 37/184 (20%), Positives = 59/184 (32%), Gaps = 15/184 (8%)

Query: 15  LNEMEVTGQAFEDM----QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAR--EEKD 68
           L E      + ED       +      +L+E+ +A   +      S  L    R   E  
Sbjct: 621 LRETVKAMLSREDFYMIKIMRQQGEYIKLQEQANALAHIQALNFASIDLPSAQRQIAELQ 680

Query: 69  ILHEQVNA-------LKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKR 121
              E++          K  +DA  T  + LE +  Y Q V      +  L+  AM   K 
Sbjct: 681 ARLERLTHTQSDIAIAKAALDAAQTRQKVLERQ--YQQEVTECAGLKKDLKRAAMLSRKV 738

Query: 122 KAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER 181
            +I        L+     +  Q+   Q         +E      KR+    A+LRR  E 
Sbjct: 739 HSIAKQGMTGALQALGAAHFPQVAPEQHDDIVDIERIEHRRQLQKRIDAVNARLRRLREE 798

Query: 182 MKKI 185
           +   
Sbjct: 799 IIGR 802


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 34.7 bits (80), Expect = 0.045
 Identities = 22/99 (22%), Positives = 44/99 (44%), Gaps = 15/99 (15%)

Query: 9   NEEEALLNEM-EVTGQAFEDMQEQNS-RLLQQLREKDDANFKLMTERIKSNQLHKLAREE 66
           +E + L NE+ E   +   ++ +QN  +LL+ LRE  +   + + +RI  +         
Sbjct: 105 SEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESA-------- 156

Query: 67  KDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNV 105
                E+ + L  +ID +   I++L ++   L   L   
Sbjct: 157 -----EERSTLLEEIDRLLGEIQQLAQEAGNLTAALKGN 190


>gnl|CDD|223447 COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion
           transport and metabolism].
          Length = 653

 Score = 34.6 bits (80), Expect = 0.046
 Identities = 33/180 (18%), Positives = 68/180 (37%), Gaps = 18/180 (10%)

Query: 30  EQNSRLLQQLREK-----------DDANFKLMTERIKSNQLHKLAREEKDILHEQVNALK 78
           E+N  L  QL E            D+A        I+ +   KL++     +   V    
Sbjct: 95  ERNLYLTLQLLELGIPMILALNMIDEAK----KRGIRIDI-EKLSKLLGVPVVPTVAKRG 149

Query: 79  LQIDAMHTAIRKLEEKE--RYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLH 136
             ++ +  AI +L E +           +E+E+       +  +  AI+  +    ++  
Sbjct: 150 EGLEELKRAIIELAESKTTPREVDYGEEIEEEIKELEALSEDPRWLAIKLLEDDELVEAV 209

Query: 137 LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMM 196
           L++   +++E  + ++E+   L   A     L E I +   K E  +K      +D V++
Sbjct: 210 LKEPEKRVEELLEELSEEEGHLLLIADARYALIERILRSVVKQEEEEKSSLTDKIDRVLL 269


>gnl|CDD|227827 COG5540, COG5540, RING-finger-containing ubiquitin ligase
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 374

 Score = 33.8 bits (77), Expect = 0.072
 Identities = 17/73 (23%), Positives = 26/73 (35%), Gaps = 3/73 (4%)

Query: 180 ERMKKIEQAGTLDEVMMEEIREYKETLTCPSCK---VKRKDAVLTKCFHVFCWDCLRTRY 236
           E +      G+L  + +E   E  + + C  C    +K     +  C H F   C+    
Sbjct: 298 EAIPTTTTKGSLKPLSIERAVEADKGVECAICMSNFIKNDRLRVLPCDHRFHVGCVDKWL 357

Query: 237 ETRQRKCPKCNAA 249
                KCP C  A
Sbjct: 358 LGYSNKCPVCRTA 370


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 33.8 bits (78), Expect = 0.082
 Identities = 29/146 (19%), Positives = 59/146 (40%), Gaps = 17/146 (11%)

Query: 58  QLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMD 117
               L    K  L  Q+  +  QI  +   +  L+ + + L+  L  + +EL  R +   
Sbjct: 130 GQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRK--- 186

Query: 118 LNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL-- 175
           L ++  +         +L L +   +  EAQ  +    + LE    +   LQ E  Q+  
Sbjct: 187 LKEKGLVS--------RLELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQQIEQ 238

Query: 176 RRKTERMKKIEQAGTLDEVMMEEIRE 201
             + E ++++ +A    +  + E+RE
Sbjct: 239 TFREEVLEELTEA----QARLAELRE 260


>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
          Length = 156

 Score = 32.1 bits (74), Expect = 0.12
 Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 5/68 (7%)

Query: 122 KAIESA-QSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
             + +A ++  +L+L   KY AQL EA+     + + +  +A K      E A+   + E
Sbjct: 42  DGLAAAERAKKELELAQAKYEAQLAEARA----EAAEIIEQAKKRAAQIIEEAKAEAEAE 97

Query: 181 RMKKIEQA 188
             +   QA
Sbjct: 98  AARIKAQA 105


>gnl|CDD|227719 COG5432, RAD18, RING-finger-containing E3 ubiquitin ligase [Signal
           transduction mechanisms].
          Length = 391

 Score = 33.1 bits (75), Expect = 0.12
 Identities = 15/45 (33%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query: 206 LTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
           L C  C  +      T C H FC  C+R R+   Q  CP C    
Sbjct: 26  LRCRICDCRISIPCETTCGHTFCSLCIR-RHLGTQPFCPVCREDP 69


>gnl|CDD|222944 PHA02929, PHA02929, N1R/p28-like protein; Provisional.
          Length = 238

 Score = 32.8 bits (75), Expect = 0.14
 Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 219 VLTKCFHVFCWDCLRTRYETRQRKCPKCNAAF 250
           +L+ C HVFC +C+   ++  +  CP C   F
Sbjct: 196 ILSNCNHVFCIECI-DIWKKEKNTCPVCRTPF 226


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 32.8 bits (75), Expect = 0.20
 Identities = 50/199 (25%), Positives = 91/199 (45%), Gaps = 26/199 (13%)

Query: 3   LGQVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMT----ERIKSNQ 58
           L +     E + L   ++  + +  +Q Q    L+QL+E+   NF+L       RIK  +
Sbjct: 219 LQERLAQLEGSSLGPNQLGSKKYNLLQSQ----LEQLQEE---NFRLEAARDDYRIKCEE 271

Query: 59  LHKLARE------EKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLR 112
           L K   E      E   L  +  ALK +ID +  +  K ++ E  ++T    +E    LR
Sbjct: 272 LEKELAELQHRNDELTSLAAESQALKDEIDVLRESSDKAKKLEAQVETYKKKLEDLNDLR 331

Query: 113 NQAMDLNKRKAI---------ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAY 163
            Q   L +R A+         E  + A   +  LE Y  Q++E    ++E++   +   +
Sbjct: 332 RQVKLLEERNAMYMQNTVQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADKLEF 391

Query: 164 KTKRLQEEIAQLRRKTERM 182
           + KRL+E++  L+++ ER+
Sbjct: 392 EYKRLEEKLEALQKEKERL 410


>gnl|CDD|236292 PRK08566, PRK08566, DNA-directed RNA polymerase subunit A';
           Validated.
          Length = 882

 Score = 32.5 bits (75), Expect = 0.22
 Identities = 15/54 (27%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 169 QEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKETLTCPSCKVKRKDAVLTK 222
           +EEI +   K ER+K+       D+++ E  +E  + + CP C  K+      K
Sbjct: 111 EEEIEEYLEKLERLKEWGSL--ADDLIKEVKKEAAKRMVCPHCGEKQYKIKFEK 162


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 32.3 bits (74), Expect = 0.22
 Identities = 19/102 (18%), Positives = 46/102 (45%), Gaps = 4/102 (3%)

Query: 61  KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNK 120
           KL R   D++ E + A  L +D     +  L+++ + L + +  ++ + +  ++ +   K
Sbjct: 5   KLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAK 64

Query: 121 RKAIESAQSAA----DLKLHLEKYHAQLKEAQQVVTEKTSSL 158
            +  +  +       +LK  L +  A LK  +  + +K  S+
Sbjct: 65  GQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSI 106


>gnl|CDD|233843 TIGR02390, RNA_pol_rpoA1, DNA-directed RNA polymerase subunit A'.
           This family consists of the archaeal A' subunit of the
           DNA-directed RNA polymerase. The example from
           Methanocaldococcus jannaschii contains an intein.
          Length = 867

 Score = 32.4 bits (74), Expect = 0.23
 Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 3/58 (5%)

Query: 169 QEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREY-KETLTCPSCKVKRKDAVLTKCFH 225
           +EEI Q   K  ++K  E+ G L   ++E+I +   + + CP C  ++K     K  +
Sbjct: 106 EEEIEQYLEKINKLK--EEGGDLASTLIEKIVKEAAKRMKCPHCGEEQKKIKFEKPTY 161


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 32.1 bits (74), Expect = 0.27
 Identities = 22/98 (22%), Positives = 45/98 (45%), Gaps = 17/98 (17%)

Query: 26  EDMQEQNSRLLQQLREK-DDANFKL-MT------------ERIKSNQLHKLAREEKDILH 71
           E+ QE+     ++L E+  +  F L  T            E +   +   L  EE++ L 
Sbjct: 122 EEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEFEALPEEEREELE 181

Query: 72  EQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL 109
           E+++ L+   + +   +R+L E ER  +  L  +++E+
Sbjct: 182 EKIDELE---EELQEILRQLRELEREAREKLRELDREV 216


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 31.7 bits (72), Expect = 0.39
 Identities = 34/176 (19%), Positives = 77/176 (43%), Gaps = 10/176 (5%)

Query: 10  EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDI 69
           EE  LL   +   +   +  ++  + L++L E+     +L+     S   ++L   E++ 
Sbjct: 576 EELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEE- 634

Query: 70  LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
           L  ++  L LQ +        LEE E  ++ +   + +EL         N+ +  E  + 
Sbjct: 635 LESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI-----ENEEQLEEKLEE 689

Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
              L+  LE+   +L+E      +K   +E    + +  + E+ +L+++ E+++K 
Sbjct: 690 LEQLEEELEQLREELEELL----KKLGEIEQLIEELESRKAELEELKKELEKLEKA 741



 Score = 29.3 bits (66), Expect = 2.1
 Identities = 38/199 (19%), Positives = 79/199 (39%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
            E EAL   +    +  E ++E  +RLL+    + +A      E  +  +L +   E+ +
Sbjct: 232 QEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIE 291

Query: 69  ILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQ 128
            L E    ++   + +      LEE E  L+ + +  E+   L  +   L       + +
Sbjct: 292 RLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351

Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
                KL  E+     +  +++  E   +LE      + +QE   +L   +  +++I++ 
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411

Query: 189 GTLDEVMMEEIREYKETLT 207
               E  +EE+    E L 
Sbjct: 412 LEELEKELEELERELEELE 430



 Score = 28.6 bits (64), Expect = 3.8
 Identities = 27/144 (18%), Positives = 60/144 (41%), Gaps = 20/144 (13%)

Query: 63  AREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRK 122
            +E  ++   ++  L+ ++        +L E+   L+  L  +E+EL    +  +  K +
Sbjct: 472 EKELLELYELELEELEEELSREKE-EAELREEIEELEKELRELEEELIELLELEEALKEE 530

Query: 123 AIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERM 182
             E  +   +L   LE+   +L+                    ++L+EE+ QL  + + +
Sbjct: 531 LEEKLEKLENLLEELEELKEKLQ-------------------LQQLKEELRQLEDRLQEL 571

Query: 183 KKIEQAGTLDEVMMEEIREYKETL 206
           K++ +   L     EE+ E +E L
Sbjct: 572 KELLEELRLLRTRKEELEELRERL 595


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 31.5 bits (72), Expect = 0.39
 Identities = 32/146 (21%), Positives = 66/146 (45%), Gaps = 25/146 (17%)

Query: 64  REEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKA 123
           + ++++L++ +N +       ++   KL +K + L+  + +      LR  A +LN    
Sbjct: 160 KRDEELLNKDLNLI-------NSIKPKLRKKLQALKEEIAS------LRQLADELNLCDP 206

Query: 124 IESAQSAADLKLH---LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
           +E  ++  +L+     + +   QL+E QQ + E T ++EA   K   L EEIA+  +  E
Sbjct: 207 LELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIRE 266

Query: 181 RMKKIEQAGTLDEVMMEEIREYKETL 206
             +             +EI + K  +
Sbjct: 267 ECRGWS---------AKEISKLKAKV 283


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 30.7 bits (70), Expect = 0.45
 Identities = 23/91 (25%), Positives = 41/91 (45%), Gaps = 3/91 (3%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA---RE 65
            E   L  E+    +   ++Q++N  L Q+L   +    +L  E  +  QL   A    E
Sbjct: 73  QELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDE 132

Query: 66  EKDILHEQVNALKLQIDAMHTAIRKLEEKER 96
           E   L E++  LK + +A+     +L+E E+
Sbjct: 133 ENRELREELAELKQENEALEAENERLQENEQ 163



 Score = 30.4 bits (69), Expect = 0.68
 Identities = 18/91 (19%), Positives = 37/91 (40%), Gaps = 3/91 (3%)

Query: 12  EALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIK-SNQLHKLAREEKDI- 69
             L  E+    +   ++QEQ + L Q+ +E       L  E  +   +L ++ +   +  
Sbjct: 69  PELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI 128

Query: 70  -LHEQVNALKLQIDAMHTAIRKLEEKERYLQ 99
            L E+   L+ ++  +      LE +   LQ
Sbjct: 129 ELDEENRELREELAELKQENEALEAENERLQ 159


>gnl|CDD|220784 pfam10497, zf-4CXXC_R1, Zinc-finger domain of monoamine-oxidase A
           repressor R1.  R1 is a transcription factor repressor
           that inhibits monoamine oxidase A gene expression. This
           domain is a four-CXXC zinc finger putative DNA-binding
           domain found at the C-terminal end of R1. The domain
           carries 12 cysteines of which four pairs are of the CXXC
           type.
          Length = 105

 Score = 29.7 bits (67), Expect = 0.46
 Identities = 16/45 (35%), Positives = 18/45 (40%), Gaps = 8/45 (17%)

Query: 210 SCKVKRKDAVLTKCFHVFCWDCLRTRY-----ETRQRK---CPKC 246
           SCK K+K          FC  CLR RY     E  +     CP C
Sbjct: 23  SCKDKKKGKECHGVRGQFCGKCLRNRYGENADEAAKDPDWICPPC 67


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 31.3 bits (71), Expect = 0.52
 Identities = 30/127 (23%), Positives = 55/127 (43%), Gaps = 20/127 (15%)

Query: 70  LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
            H +   L+ ++ A+H  I   +++   LQ  + NV +E   RN A++  +++  E  Q 
Sbjct: 417 QHAEQRPLRQRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQ 476

Query: 130 AADLKL---------HLEKYHAQLKEAQ---------QVVTEKTSSLEAEAYKTKR--LQ 169
            AD+K           LE   AQL+  Q             E   +LE    +++   L+
Sbjct: 477 LADVKTICEQEARIKDLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLDALE 536

Query: 170 EEIAQLR 176
           +E+ +L 
Sbjct: 537 KEVKKLG 543


>gnl|CDD|146145 pfam03357, Snf7, Snf7.  This family of proteins are involved in
           protein sorting and transport from the endosome to the
           vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
           play an important role in the degradation of both lipids
           and cellular proteins. In order to perform this
           degradative function, vacuoles/lysosomes contain
           numerous hydrolases which have been transported in the
           form of inactive precursors via the biosynthetic pathway
           and are proteolytically activated upon delivery to the
           vacuole/lysosome. The delivery of transmembrane
           proteins, such as activated cell surface receptors to
           the lumen of the vacuole/lysosome, either for
           degradation/downregulation, or in the case of
           hydrolases, for proper localisation, requires the
           formation of multivesicular bodies (MVBs). These late
           endosomal structures are formed by invaginating and
           budding of the limiting membrane into the lumen of the
           compartment. During this process, a subset of the
           endosomal membrane proteins is sorted into the forming
           vesicles. Mature MVBs fuse with the vacuole/lysosome,
           thereby releasing cargo containing vesicles into its
           hydrolytic lumen for degradation. Endosomal proteins
           that are not sorted into the intralumenal MVB vesicles
           are either recycled back to the plasma membrane or Golgi
           complex, or remain in the limiting membrane of the MVB
           and are thereby transported to the limiting membrane of
           the vacuole/lysosome as a consequence of fusion.
           Therefore, the MVB sorting pathway plays a critical role
           in the decision between recycling and degradation of
           membrane proteins. A few archaeal sequences are also
           present within this family.
          Length = 169

 Score = 30.3 bits (69), Expect = 0.53
 Identities = 27/129 (20%), Positives = 59/129 (45%), Gaps = 18/129 (13%)

Query: 81  IDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKY 140
           I ++  AIR+L++K+  L+  +  +E E  ++  A   NK  A           + L++ 
Sbjct: 3   ILSLRKAIRELDKKQESLEKKIKKLEAE--IKKLAKKGNKDAA----------LILLKQK 50

Query: 141 HAQLKEAQQVVTEKTSSLEAEAYKTKRL---QEEIAQLRRKTERMKKIEQAGTLDEV--M 195
               K+  Q+  +  ++LE      +     QE +  ++   + MK + +   +D++  +
Sbjct: 51  KRYEKQLDQLDGQL-ANLEQVRMAIENAKTNQEVLNAMKGGAKAMKAMNKNMDIDKIDDL 109

Query: 196 MEEIREYKE 204
           M+EI +  E
Sbjct: 110 MDEIEDQME 118


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 31.5 bits (72), Expect = 0.55
 Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 4/62 (6%)

Query: 136 HLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVM 195
           +L++    LKE   ++  K         K +RL EE+ +L ++ ER+KK   A  L +++
Sbjct: 709 YLQEQEDLLKEIASLL--KVP--PELLPKVERLLEELKELEKELERLKKKLAAAELADLL 764

Query: 196 ME 197
             
Sbjct: 765 SN 766


>gnl|CDD|227568 COG5243, HRD1, HRD ubiquitin ligase complex, ER membrane component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 491

 Score = 31.1 bits (70), Expect = 0.56
 Identities = 20/82 (24%), Positives = 28/82 (34%), Gaps = 7/82 (8%)

Query: 171 EIAQLRRKTERMKK------IEQAGTLDEVMMEEIREYKETLTCPSCKVKRKDAVLTKCF 224
           E A+ RR T+ +         EQ    D      + E       P  +          C 
Sbjct: 260 EHARFRRATKDLNAMYPTATEEQLTNSDRTCTICMDEMFHPDHEPLPRGLDMTPKRLPCG 319

Query: 225 HVFCWDCLRTRYETRQRKCPKC 246
           H+    CL+   E RQ+ CP C
Sbjct: 320 HILHLHCLKNWLE-RQQTCPIC 340


>gnl|CDD|218160 pfam04587, ADP_PFK_GK, ADP-specific Phosphofructokinase/Glucokinase
           conserved region.  In archaea a novel type of glycolytic
           pathway exists that is deviant from the classical
           Embden-Meyerhof pathway. This pathway utilises two novel
           proteins: an ADP-dependent Glucokinase and an
           ADP-dependent Phosphofructokinase. This conserved region
           is present at the C-terminal of both these proteins.
           Interestingly this family contains sequences from higher
           eukaryotes..
          Length = 444

 Score = 30.9 bits (70), Expect = 0.64
 Identities = 14/54 (25%), Positives = 25/54 (46%)

Query: 149 QVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREY 202
           Q++T   S  +   Y  KR++E+I  L++       +E A   D  + +EI   
Sbjct: 219 QMLTLFYSDGKDAEYYLKRVKEDIKLLKKPKGIKVHLEFASIQDRELRKEIVHN 272


>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
           protein.  This family of transmembrane coiled-coil
           containing proteins is conserved from worms to humans.
           Its function is unknown.
          Length = 387

 Score = 30.9 bits (70), Expect = 0.65
 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 11/82 (13%)

Query: 78  KLQIDAMHTAIRKLEEKERYLQT-VLTNVEKELHLRNQAMDLNKRKAIE---------SA 127
           K  ID++   I KL+E+ R  QT    NV + L L ++A D  +   I+         SA
Sbjct: 3   KAAIDSLKQKILKLKEQLRVEQTARDENVAEYLKLVSKA-DKQQVGRIKQVFEKKNQKSA 61

Query: 128 QSAADLKLHLEKYHAQLKEAQQ 149
            + A L+  LE+YH +LKE ++
Sbjct: 62  HTIAQLQKKLEQYHQKLKELEE 83


>gnl|CDD|226419 COG3904, COG3904, Predicted periplasmic protein [Function unknown].
          Length = 245

 Score = 30.7 bits (69), Expect = 0.65
 Identities = 13/47 (27%), Positives = 20/47 (42%)

Query: 13  ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQL 59
           A L EM +     + M       ++QL  K+    KL+T R   + L
Sbjct: 197 AYLREMGIGPGLLQMMLATPPSDIRQLGLKEMTAMKLVTSRDAVDLL 243


>gnl|CDD|238862 cd01824, Phospholipase_B_like, Phospholipase-B_like. This subgroup
           of the SGNH-family of lipolytic enzymes may have both
           esterase and phospholipase-A/lysophospholipase activity.
            It's members may be involved in the conversion of
           phosphatidylcholine to fatty acids and
           glycerophosphocholine, perhaps in the context of dietary
           lipid uptake. Members may be membrane proteins. The
           tertiary fold of the SGNH-hydrolases is substantially
           different from that of the alpha/beta hydrolase family
           and unique among all known hydrolases; Its active site
           closely resembles two of the three components of typical
           Ser-His-Asp(Glu) triad from other serine hydrolases.
          Length = 288

 Score = 30.8 bits (70), Expect = 0.68
 Identities = 26/124 (20%), Positives = 48/124 (38%), Gaps = 22/124 (17%)

Query: 2   SLGQVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANF----KLMTERIKSN 57
           S+G       ++  N +   G   ED+ +Q   L++++++    +F    KL+T  I  N
Sbjct: 72  SVGTGDETLPDSGFN-VAEPGAKSEDLPQQARLLVRRMKKDPRVDFKNDWKLITIFIGGN 130

Query: 58  ---------QLHKLAREEK------DILHEQV-NALKLQIDAMHTAI-RKLEEKERYLQT 100
                             K      DIL ++V  A    +  ++ A  R L +K    +T
Sbjct: 131 DLCSLCEDANPGSPQTFVKNLRKALDILRDEVPRAFVNLVGLLNVASLRSLTKKPLQCET 190

Query: 101 VLTN 104
           +L  
Sbjct: 191 LLAP 194


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.8 bits (70), Expect = 0.74
 Identities = 39/221 (17%), Positives = 89/221 (40%), Gaps = 32/221 (14%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
            E   L +E+E   +A ED +E+   L +++ E  +       +   +    +  REE+D
Sbjct: 363 EEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERD 422

Query: 69  ILHEQVNALKLQIDAMHTAIRK-------------------------LEEKERYLQTVLT 103
            L E+   L+  +      + +                         +EE    ++ +  
Sbjct: 423 ELREREAELEATLRTARERVEEAEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEA 482

Query: 104 NVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAY 163
            +E    L  +  ++ +R  +E A+   + +  +E+   + ++ ++++ E+  ++E +  
Sbjct: 483 ELED---LEEEVEEVEER--LERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRE 537

Query: 164 KTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKE 204
           + + L+E  A+L    E  +K E A   +E   E   E  E
Sbjct: 538 RAEELRERAAELE--AEAEEKREAAAEAEEEAEEAREEVAE 576


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 30.8 bits (70), Expect = 0.82
 Identities = 22/117 (18%), Positives = 48/117 (41%), Gaps = 19/117 (16%)

Query: 80  QIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRK-----------AIESAQ 128
           + DA+  A +         +  L  VE++L       D+ + K           A+   +
Sbjct: 343 ERDALAAAYKAYLA----YKPKLEKVERKLPELGIWKDVERIKALVIRGYPLAEALSKVK 398

Query: 129 SAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
                +   EK   + +E +++   +    + E    +RL+EE ++L+R+ E +K+ 
Sbjct: 399 EEERPR---EKEGTEEEERREITVYEKRIKKLEE-TVERLEEENSELKRELEELKRE 451


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 30.5 bits (70), Expect = 0.86
 Identities = 35/182 (19%), Positives = 78/182 (42%), Gaps = 34/182 (18%)

Query: 27  DMQEQNSRLLQQLREKDDA---NFKLMTERIK-----SNQLHKLAREEKDILHEQVNALK 78
            +    S L +++ EK  A    +K + +  +         HK   + ++ L E++ +L+
Sbjct: 208 SLAADESELPEKVLEKFKALAKQYKKLRKAQEKKVEGRLAQHKKYAKLREKLKEELKSLR 267

Query: 79  L---QIDA----MHTAIRKLEEKERYLQTVLTN--VEKELHLRN-QAMDLN---KRKAIE 125
           L   QID     +    +++  +ER L  ++    + ++  L+  Q  +L+     K I 
Sbjct: 268 LTSKQIDELVEQLRDINKRVRGQERELLRLVERLKMPRKDFLKLFQGNELDITWLEKEIA 327

Query: 126 SAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI 185
           S +  ++    L + + ++K+ QQ    +  ++E E   T      I +L+    ++ K 
Sbjct: 328 SGKPWSEF---LVRVYDEIKKLQQ----ELEAIEEETGLT------IEELKEINRQISKG 374

Query: 186 EQ 187
           E 
Sbjct: 375 EA 376


>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 239

 Score = 30.0 bits (68), Expect = 0.89
 Identities = 35/184 (19%), Positives = 74/184 (40%), Gaps = 16/184 (8%)

Query: 34  RLLQQLREK------DDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQ------- 80
           + L+ L+EK      D+  F +++ +           E++ +  EQV  LK Q       
Sbjct: 37  KTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKEVLSKEQVKLLKTQDLNYVRT 96

Query: 81  -IDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEK 139
              A    I KL+E+     +     +  + + ++    +   A     +   L     +
Sbjct: 97  KRQAEAKKIEKLKEELHLTDSGSAGPKHIVFVDDEEEQKSFDPAEYFDTTPELLDRRENR 156

Query: 140 YHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEI 199
                 E   +V EK     A+  K ++L +E+ + + + +++KK+EQ   L   +M++ 
Sbjct: 157 PRISQLEKTSLVDEKQKKKSAK--KKRKLYKELKERKEREKKLKKVEQRLELQRELMKKG 214

Query: 200 REYK 203
           +  K
Sbjct: 215 KGKK 218


>gnl|CDD|179749 PRK04147, PRK04147, N-acetylneuraminate lyase; Provisional.
          Length = 293

 Score = 30.3 bits (69), Expect = 0.89
 Identities = 13/44 (29%), Positives = 23/44 (52%)

Query: 91  LEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLK 134
            EEK++ L+ V    + ++ L  Q   +N  +A E A+ A +L 
Sbjct: 56  TEEKKQVLEIVAEEAKGKVKLIAQVGSVNTAEAQELAKYATELG 99


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 30.7 bits (69), Expect = 0.95
 Identities = 45/187 (24%), Positives = 89/187 (47%), Gaps = 21/187 (11%)

Query: 21  TGQAFEDMQEQNSRL---LQQLREKDD---ANFKLMT--ERIKSNQLHKLAREEKDILHE 72
           T  A E +++   +L   LQQ R+ +        L+T  ER   + L +L ++E D+L  
Sbjct: 404 TPSAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQL-KKENDMLQT 462

Query: 73  QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIE---SAQS 129
           ++N++         +++ +E++ +       N EK+L       +  KRK  E   +A++
Sbjct: 463 KLNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLA------EEKKRKKEEEETAARA 516

Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAG 189
           AA      E+    LK+A+Q +  +   LE   +  K  +EE   L ++ + ++K +++ 
Sbjct: 517 AAQAAASREECAESLKQAKQDLEMEIKKLE---HDLKLKEEECRMLEKEAQELRKYQESE 573

Query: 190 TLDEVMM 196
              EV+M
Sbjct: 574 KETEVLM 580


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 29.7 bits (67), Expect = 1.1
 Identities = 24/129 (18%), Positives = 61/129 (47%), Gaps = 3/129 (2%)

Query: 54  IKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRN 113
           +    +  +  ++ + L  +++ LK Q+  +   +  LEE+   L++ L +++++L L  
Sbjct: 61  LLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLR 120

Query: 114 QAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIA 173
           + +   + +     +S  +L    ++     ++ ++ V E    LE      +RLQE I 
Sbjct: 121 ELLKSLEERLESLEESIKEL---AKELRELRQDLREEVEELREELERLQENLQRLQEAIQ 177

Query: 174 QLRRKTERM 182
           +L+   E++
Sbjct: 178 ELQSLLEQL 186



 Score = 27.8 bits (62), Expect = 5.2
 Identities = 20/105 (19%), Positives = 47/105 (44%), Gaps = 7/105 (6%)

Query: 10  EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQ-LHKLAREEKD 68
           E +  L+E++   +  ED++E+ + L  +L +  +    L        + L  L    K+
Sbjct: 83  ELKQQLSELQ---EELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKE 139

Query: 69  I---LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELH 110
           +   L E    L+ +++ +   + +L+E  + LQ  +  ++  L 
Sbjct: 140 LAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLE 184


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 30.3 bits (68), Expect = 1.2
 Identities = 28/123 (22%), Positives = 51/123 (41%), Gaps = 20/123 (16%)

Query: 84  MHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQ 143
           + T IRKL E++  L+ +  ++E                  E  + +  LKL  +K   Q
Sbjct: 73  LQTKIRKLTEQDEGLRNIRESIE------------------ERQKESDQLKLQNQKLVNQ 114

Query: 144 LKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYK 203
           L  A+ V  +     +     ++ L+EE   L+ + + +  +++     E   E  RE  
Sbjct: 115 LGHARGVFIKTKGRYQKLEELSRHLKEENQCLQIQLDAL--VQECNEKIEENQELNRELA 172

Query: 204 ETL 206
           ETL
Sbjct: 173 ETL 175


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 30.0 bits (67), Expect = 1.2
 Identities = 35/156 (22%), Positives = 63/156 (40%), Gaps = 14/156 (8%)

Query: 27  DMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHT 86
           DM+ +  +L   LR   +   +L  +  ++    +    E++ L  Q    + +I  +  
Sbjct: 168 DMRSRIEQLETALRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQ 227

Query: 87  AIRKLEEKERYLQTVLTN-VEKELHLRNQAMDLNKRKAIESAQ------SAADLKLHLEK 139
            I+ L +K++    VL    + E  L     +L +R    + Q         DL+L  E+
Sbjct: 228 DIQTLTQKKQENDRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERIDLQLENEQ 287

Query: 140 YHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQL 175
            H  L+  Q+        LE+   K   L EE+A L
Sbjct: 288 LHEDLRTLQER-------LESSQQKAGLLGEELASL 316


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 30.3 bits (68), Expect = 1.2
 Identities = 19/111 (17%), Positives = 50/111 (45%), Gaps = 8/111 (7%)

Query: 88  IRKLEEKERYLQTVLTNVEKELHLRNQAMDLN---KRKAIESAQSAADLKLHLEKYHAQL 144
             +++E +  ++ +   ++   + + Q  +++         +++   D++ +L+K    L
Sbjct: 635 YNEIQENKILIEKLRGKID---NYKKQIAEIDSIIPDLKEITSR-INDIEDNLKKSRKAL 690

Query: 145 KEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA-GTLDEV 194
            +A+       S++E    +   L + I  +    E MKKI++A G L  +
Sbjct: 691 DDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDLKRL 741


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 37/206 (17%), Positives = 75/206 (36%), Gaps = 27/206 (13%)

Query: 5   QVSGNEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS-------- 56
                + + +  E+    +   + Q+Q ++L +QL+  +     L  + I++        
Sbjct: 34  AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93

Query: 57  ----------NQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEE---KERYLQTVLT 103
                     N L    RE++  L EQ+ AL+        A+    E   +   L     
Sbjct: 94  KQIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYG 153

Query: 104 NVEKEL-----HLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSL 158
            +          L+     L   +A  +A+ A    L  E+  AQ  +  Q++ E+  +L
Sbjct: 154 ALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQ-RAQQAKLAQLLEERKKTL 212

Query: 159 EAEAYKTKRLQEEIAQLRRKTERMKK 184
                +    Q+++ +LR    R+K 
Sbjct: 213 AQLNSELSADQKKLEELRANESRLKN 238


>gnl|CDD|227481 COG5152, COG5152, Uncharacterized conserved protein, contains RING
           and CCCH-type Zn-fingers [General function prediction
           only].
          Length = 259

 Score = 29.7 bits (66), Expect = 1.3
 Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 4/61 (6%)

Query: 186 EQAGTLDEVMMEEIREYKETLTCPSCKVKRKDAVLTKCFHVFCWDCLRTRYETRQRKCPK 245
           E     +E  +      K    C  CK   +  V+T+C H FC  C   +Y    +K  +
Sbjct: 177 EWNAEYEEAPVISGPGEKIPFLCGICKKDYESPVVTECGHSFCSLCAIRKY----QKGDE 232

Query: 246 C 246
           C
Sbjct: 233 C 233


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 30.0 bits (69), Expect = 1.3
 Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 10/81 (12%)

Query: 35  LLQQLREKDDANFKLMTE----RIKSNQL----HKLAREEKDI--LHEQVNALKLQIDAM 84
            + +L E D+   +L TE    + + N L     +  R+ +D   L  +V  LK +I A+
Sbjct: 26  DVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKAL 85

Query: 85  HTAIRKLEEKERYLQTVLTNV 105
              + +LE +   L   + N+
Sbjct: 86  EAELDELEAELEELLLRIPNL 106


>gnl|CDD|165237 PHA02926, PHA02926, zinc finger-like protein; Provisional.
          Length = 242

 Score = 29.6 bits (66), Expect = 1.4
 Identities = 16/41 (39%), Positives = 19/41 (46%), Gaps = 5/41 (12%)

Query: 215 RKDAVLTKCFHVFCWDCL----RTRYET-RQRKCPKCNAAF 250
           R   +L  C H+FC  C+    RTR ET     CP C   F
Sbjct: 189 RYFGLLDSCNHIFCITCINIWHRTRRETGASDNCPICRTRF 229


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 30.0 bits (68), Expect = 1.6
 Identities = 16/66 (24%), Positives = 26/66 (39%), Gaps = 6/66 (9%)

Query: 144 LKEAQQVVTEKTSSLEAE----AYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVM--ME 197
           L E +  + E    L+        K +R  EE   L+++ E +K       +DE+    E
Sbjct: 703 LNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDKAE 762

Query: 198 EIREYK 203
            I   K
Sbjct: 763 TINGVK 768



 Score = 28.5 bits (64), Expect = 4.9
 Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 1/91 (1%)

Query: 68  DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL-HLRNQAMDLNKRKAIES 126
           + L+EQ + LK   D +     +L +K          ++KEL  L+ +  D    +  + 
Sbjct: 701 EYLNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQKIDELKDK 760

Query: 127 AQSAADLKLHLEKYHAQLKEAQQVVTEKTSS 157
           A++   +K+ +E   A+  ++ + + ++  S
Sbjct: 761 AETINGVKVLVEVVDAKDMKSLKTMADRLKS 791


>gnl|CDD|220191 pfam09344, Cas_CT1975, CT1975-like protein.  CRISPR is a term for
           Clustered, Regularly Interspaced Short Palidromic
           Repeats. A number of protein families appear only in
           association with these repeats and are designated Cas
           (CRISPR-Associated) proteins. This family is represented
           by CT1975 of Chlorobium tepidum.
          Length = 353

 Score = 29.6 bits (67), Expect = 1.7
 Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 2/54 (3%)

Query: 144 LKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMME 197
           LK A +   E    L  E  +TKRL EE+A  +R  +   + E A  L E ++ 
Sbjct: 40  LKRAIRTSEEFAQLLAEEGVRTKRLAEEVA--KRLEQAGVEEELAKALAEKILG 91


>gnl|CDD|99801 cd06204, CYPOR, NADPH cytochrome p450 reductase (CYPOR) serves as
           an electron donor in several oxygenase systems and is a
           component of nitric oxide synthases and methionine
           synthase reductases. CYPOR transfers two electrons from
           NADPH to the heme of cytochrome p450 via FAD and FMN.
           Ferredoxin-NADP+ (oxido)reductase is an FAD-containing
           enzyme that catalyzes the reversible electron transfer
           between NADP(H) and electron carrier proteins such as
           ferredoxin and flavodoxin. Isoforms of these
           flavoproteins (i.e. having a non-covalently bound FAD as
           a prosthetic group) are present in chloroplasts,
           mitochondria, and bacteria in which they participate in
           a wide variety of redox metabolic pathways. The
           C-terminal domain contains most of the NADP(H) binding
           residues and the N-terminal domain interacts
           non-covalently with the isoalloxazine rings of the
           flavin molecule which lies largely in a large gap
           betweed the two domains. Ferredoxin-NADP+ reductase
           first accepts one electron from reduced ferredoxin to
           form a flavin semiquinone intermediate. The enzyme then
           accepts a second electron to form FADH2 which then
           transfers two electrons and a proton to NADP+ to form
           NADPH.
          Length = 416

 Score = 29.5 bits (67), Expect = 2.0
 Identities = 7/45 (15%), Positives = 17/45 (37%), Gaps = 10/45 (22%)

Query: 57  NQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTV 101
             L ++  E+  +   +              ++KL+ + RY + V
Sbjct: 381 KTLLEILAEQGGMTETEAEEY----------VKKLKTRGRYQEDV 415


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 29.6 bits (66), Expect = 2.1
 Identities = 25/124 (20%), Positives = 51/124 (41%)

Query: 95  ERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEK 154
           ++ ++ +  + EK ++ R    DL +R++ E A+ A  LK  L+K      +AQQ     
Sbjct: 180 KKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFA 239

Query: 155 TSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKETLTCPSCKVK 214
             + + +  + ++ Q+E   L +  +     E     +    E  +   E        +K
Sbjct: 240 QDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALK 299

Query: 215 RKDA 218
            KD 
Sbjct: 300 AKDH 303


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 29.7 bits (67), Expect = 2.2
 Identities = 40/216 (18%), Positives = 82/216 (37%), Gaps = 36/216 (16%)

Query: 9   NEEE--ALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE 66
           NE E    L + E   Q+    Q+Q    L Q   + +   +   E        K AR +
Sbjct: 601 NETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEA---RTALKQARLD 657

Query: 67  KDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL------------HLRNQ 114
              L  +  +LK +++        + E+++  +T L  ++ +L             L++ 
Sbjct: 658 LQRLQNEQQSLKDKLELA------IAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDD 711

Query: 115 AMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQ----- 169
             +L   +  +      +L   L +  A ++ A+     +   L+ + Y    L      
Sbjct: 712 FRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELK-KQYDR-ELASLDVD 769

Query: 170 -EEIAQLRRKTERMK-KIEQAGTLDEVMMEEIREYK 203
              + +L+R+ E ++  IE+      V   E+REY+
Sbjct: 770 PNTVKELKRQIEELETTIERI----AVRRPEVREYR 801



 Score = 28.5 bits (64), Expect = 4.8
 Identities = 19/139 (13%), Positives = 50/139 (35%), Gaps = 10/139 (7%)

Query: 73  QVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAAD 132
            V         +   +++ EE  +         E++L   N  ++  KR   E+  +   
Sbjct: 594 DVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQ 653

Query: 133 LKLHLEKYHAQLKEAQQ-VVTEKTSSLEAEAYKTKR----LQEEIAQLRRKTERMKKIEQ 187
            +L L+    +L+  QQ +  +   ++     + +     L  ++ QL  + +   +   
Sbjct: 654 ARLDLQ----RLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLE-AL 708

Query: 188 AGTLDEVMMEEIREYKETL 206
                E+  E + +++   
Sbjct: 709 KDDFRELRTERLAKWQVVE 727


>gnl|CDD|178657 PLN03110, PLN03110, Rab GTPase; Provisional.
          Length = 216

 Score = 28.7 bits (64), Expect = 2.3
 Identities = 24/85 (28%), Positives = 41/85 (48%), Gaps = 11/85 (12%)

Query: 23  QAFEDMQEQNSRLLQQLREKDDANFKLMTERIKS--NQLHKLAREEKDILHEQVNALKLQ 80
           Q F+++Q    R L++LR+  D+N  +M    KS  N L  +A E+   L E     K  
Sbjct: 98  QTFDNVQ----RWLRELRDHADSNIVIMMAGNKSDLNHLRSVAEEDGQALAE-----KEG 148

Query: 81  IDAMHTAIRKLEEKERYLQTVLTNV 105
           +  + T+  +    E+  QT+L  +
Sbjct: 149 LSFLETSALEATNVEKAFQTILLEI 173


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 29.2 bits (66), Expect = 2.3
 Identities = 44/196 (22%), Positives = 80/196 (40%), Gaps = 29/196 (14%)

Query: 11  EEALLNEMEVTGQAF-EDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLA------ 63
           EE LL  +E    A  + ++ +  R  ++LR+K +   +   ER       KL       
Sbjct: 206 EEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQ 265

Query: 64  -----REEKDILHEQV----NALKLQIDAMHTAIRKLEEK-ERYLQTVLTNVE-KELHLR 112
                RE    + E+V    N    ++  +++ ++ LE+  +   +    N + ++L L 
Sbjct: 266 AIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKGLEKALDSRSEAEDENHKVQQLWLA 325

Query: 113 NQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEA---QQVVTEKTSSLEAEAYKTKRLQ 169
            +A+    + A++S    A     L K    LKE     ++V    +SL  EA +   L 
Sbjct: 326 VEAL----KSALKSGS--AGSPRPLVKELDALKELAKDDELVDAALASLPPEASQRGILS 379

Query: 170 EEIAQLRRKTERMKKI 185
           EE  QLR +   +   
Sbjct: 380 EE--QLRNRFNLLAPE 393


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 29.2 bits (66), Expect = 2.5
 Identities = 21/131 (16%), Positives = 49/131 (37%), Gaps = 20/131 (15%)

Query: 91  LEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIES-AQSAADLKLHLEKYHAQLKEAQQ 149
            ++K R L   +  ++ ++    Q +    +   E   ++  ++     KY   ++EA+ 
Sbjct: 172 NKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKT 231

Query: 150 VVTEKTSSLEAEAYKTKRLQEEIAQLRRK----TERMKKIEQAGTLDEVMME----EIRE 201
           +  E              L +E+  L       +  + K+  A    +  +E     I+ 
Sbjct: 232 IKAEIE-----------ELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKM 280

Query: 202 YKETLTCPSCK 212
           Y++   CP+C 
Sbjct: 281 YEKGGVCPTCT 291


>gnl|CDD|220735 pfam10408, Ufd2P_core, Ubiquitin elongating factor core.  This is
           the most conserved part of the core region of Ufd2P
           ubiquitin elongating factor or E4, running from helix
           alpha-11 to alpha-38. It consists of 31 helices of
           variable length connected by loops of variable size
           forming a compact unit; the helical packing pattern of
           the compact unit consists of five structural repeats
           that resemble tandem Armadillo (ARM) repeats. This
           domain is involved in ubiquitination as it binds Cdc48p
           and escorts ubiquitinated proteins from Cdc48p to the
           proteasome for degradation. The core is structurally
           similar to the nuclear transporter protein
           importin-alpha. The core is associated with the U-box at
           the C-terminus, pfam04564, which has ligase activity.
          Length = 625

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 1/54 (1%)

Query: 28  MQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQI 81
           + E+  RLL++L+   D   +L   R    QL +L  ++ +IL  +  +L+  +
Sbjct: 186 LIEKYKRLLRELKRLQDELEELEQSRSNWAQLKRL-EKQLEILSAEKLSLEGFL 238


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.1 bits (66), Expect = 2.9
 Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 21/96 (21%)

Query: 87  AIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKE 146
            IR+++E+   ++    ++ +EL               E A+   +  L L +Y     E
Sbjct: 216 LIREIKEELEEIEKERESLLEEL--------------KELAKKYLEELLALYEYLEIELE 261

Query: 147 AQQV-----VTEKTSSLEA--EAYKTKRLQEEIAQL 175
             +       T+KT ++E      + K+L+E I + 
Sbjct: 262 RAEALSKFLKTDKTFAIEGWVPEDRVKKLKELIDKA 297


>gnl|CDD|188641 cd00954, NAL, N-Acetylneuraminic acid aldolase, also called
           N-acetylneuraminate lyase (NAL).  N-Acetylneuraminic
           acid aldolase, also called N-acetylneuraminate lyase
           (NAL), which catalyses the reversible aldol reaction of
           N-acetyl-D-mannosamine and pyruvate to give
           N-acetyl-D-neuraminic acid (D-sialic acid). It has a
           widespread application as biocatalyst for the synthesis
           of sialic acid and its derivatives. This enzyme has been
           shown to be quite specific for pyruvate as the donor,
           but flexible to a variety of D- and, to some extent,
           L-hexoses and pentoses as acceptor substrates. NAL is
           member of dihydrodipicolinate synthase family that
           comprises several pyruvate-dependent class I aldolases.
          Length = 288

 Score = 28.4 bits (64), Expect = 3.2
 Identities = 10/43 (23%), Positives = 23/43 (53%)

Query: 91  LEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADL 133
           +EE+++  + V    + ++ L      LN +++ E A+ A +L
Sbjct: 53  VEERKQIAEIVAEAAKGKVTLIAHVGSLNLKESQELAKHAEEL 95


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 29.0 bits (65), Expect = 3.3
 Identities = 27/130 (20%), Positives = 55/130 (42%), Gaps = 11/130 (8%)

Query: 60  HKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELH-LRNQAMDL 118
           H +  + K    E    + + I+ +    ++LE+K  +L+ +L   EK    L  +  +L
Sbjct: 496 HFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEEL 555

Query: 119 NKRKAIESAQSAADLKLHLEKYHAQ-LKEAQQVVTEK-TSSLEAEAYKTKRLQEEIAQLR 176
            +R+         + KL LEK   + LK  ++ V        E + +K K ++     ++
Sbjct: 556 KERE--------RNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVK 607

Query: 177 RKTERMKKIE 186
            K  + K  +
Sbjct: 608 LKETKQKIPQ 617


>gnl|CDD|149785 pfam08838, DUF1811, Protein of unknown function (DUF1811).  This is
           a bacterial family of uncharacterized proteins. Some of
           the proteins are annotated as being transcriptional
           regulators. The structure of one of the proteins in this
           family has revealed a beta-barrel like structure with
           helix-turn-helix like motif.
          Length = 102

 Score = 27.3 bits (61), Expect = 3.4
 Identities = 11/27 (40%), Positives = 17/27 (62%), Gaps = 3/27 (11%)

Query: 168 LQEEIAQLRRKTERMKKIEQAGTLDEV 194
           L++EIA L    E+ +K EQ G ++E 
Sbjct: 12  LRQEIALL---KEKARKAEQLGIVNEF 35


>gnl|CDD|222465 pfam13942, Lipoprotein_20, YfhG lipoprotein.  This family includes
           the YfhG protein from E. coli. Members of this family
           have an N-terminal lipoprotein attachment site. The
           members of this family are functionally uncharacterized.
          Length = 180

 Score = 28.0 bits (63), Expect = 3.5
 Identities = 26/81 (32%), Positives = 39/81 (48%), Gaps = 11/81 (13%)

Query: 35  LLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEK 94
           LLQ  RE      +L  ER +  +L + +  E D L EQ   L+ +++   T  RKLE  
Sbjct: 100 LLQLWREGQVLQLQLAEERARYQRLQQSSDSELDRLREQQARLQYELE---TTTRKLEN- 155

Query: 95  ERYLQTVLTNVEKELHLRNQA 115
                  LT++E++L  R Q 
Sbjct: 156 -------LTDIERQLSSRKQG 169


>gnl|CDD|112645 pfam03841, SelA, L-seryl-tRNA selenium transferase. 
          Length = 367

 Score = 28.4 bits (64), Expect = 3.7
 Identities = 33/135 (24%), Positives = 55/135 (40%), Gaps = 21/135 (15%)

Query: 48  KLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEK 107
           K + ER+K N L +  R +K  L      L+L ++       KL EK   L+ +LT   K
Sbjct: 227 KELIERLKKNPLKRALRVDKLTLAALEATLRLYLNP-----EKLYEKIPTLR-LLTQPLK 280

Query: 108 ELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTE----------KTSS 157
           EL    +A  L K      A+  +  ++ +E   +Q+      V E          +  +
Sbjct: 281 ELRA--KAERLQKLL---KARLGSGAEVSVEPSFSQIGGGSLPVKELPSAAVTIRPENMA 335

Query: 158 LEAEAYKTKRLQEEI 172
           L+A   + + L   I
Sbjct: 336 LDALKLRLRLLDPPI 350


>gnl|CDD|118696 pfam10168, Nup88, Nuclear pore component.  Nup88 can be divided
           into two structural domains; the N-terminal two-thirds
           of the protein has no obvious structural motifs but is
           the region for binding to Nup98, one of the components
           of the nuclear pore. the C-terminal end is a predicted
           coiled-coil domain. Nup88 is overexpressed in tumour
           cells.
          Length = 717

 Score = 28.7 bits (64), Expect = 3.8
 Identities = 35/162 (21%), Positives = 63/162 (38%), Gaps = 31/162 (19%)

Query: 27  DMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQ---VNALKLQIDA 83
             +E+    +Q  RE+     K ++ER       KLA + ++  + Q   VN  K  + +
Sbjct: 572 LQKEKQLEDIQDCREER----KSLSERA-----EKLAEKFEEAKYNQELLVNRCKRLLQS 622

Query: 84  MHTAIRKLEEKERYLQTVLTNVEKEL-HLRNQAMDLNKRKAIESAQSAADLKLHLEKYHA 142
            ++ +  L + ER +   L  + K+L HL N    + K+K  +    A+           
Sbjct: 623 ANSQLPVLSDSERDMSKELQRINKQLQHLANGIKQVKKKKNYQRYHMASQESP------- 675

Query: 143 QLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
                      K SS      + K + E + +L    +RM K
Sbjct: 676 -----------KKSSYTLPEKQHKTITEILKELGEHIDRMIK 706


>gnl|CDD|226107 COG3579, PepC, Aminopeptidase C [Amino acid transport and
           metabolism].
          Length = 444

 Score = 28.6 bits (64), Expect = 3.9
 Identities = 15/63 (23%), Positives = 25/63 (39%)

Query: 137 LEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVMM 196
            EKY    K         ++S E  A   K L+++   LR   +     +    L E ++
Sbjct: 148 FEKYGVVPKSVYPESFSSSNSRELNALLNKLLRQDAQILRDALKEGADDDTVEALKEELL 207

Query: 197 EEI 199
           +EI
Sbjct: 208 QEI 210


>gnl|CDD|232933 TIGR00348, hsdR, type I site-specific deoxyribonuclease, HsdR
           family.  This gene is part of the type I restriction and
           modification system which is composed of three
           polypeptides R (restriction endonuclease), M
           (modification) and S (specificity). This group of
           enzymes recognize specific short DNA sequences and have
           an absolute requirement for ATP (or dATP) and
           S-adenosyl-L-methionine. They also catalyse the
           reactions of EC 2.1.1.72 and EC 2.1.1.73, with similar
           site specificity.(J. Mol. Biol. 271 (3), 342-348
           (1997)). Members of this family are assumed to differ
           from each other in DNA site specificity [DNA metabolism,
           Restriction/modification].
          Length = 667

 Score = 28.5 bits (64), Expect = 4.1
 Identities = 16/112 (14%), Positives = 36/112 (32%), Gaps = 2/112 (1%)

Query: 52  ERIKSNQLHKLAREEKDILHEQVNA--LKLQIDAMHTAIRKLEEKERYLQTVLTNVEKEL 109
           E     +      +E   L  +      K  +       +K+   E  L+++  ++ +  
Sbjct: 447 EDHLDKKKLDAFFDEIFELLPERIREITKESLKEKLQKTKKILFNEDRLESIAKDIAEHY 506

Query: 110 HLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAE 161
               +           S  +  + K  L++   +  EA  +V     S +AE
Sbjct: 507 AKFKELFKFKAMVVAISRYACVEEKNALDEELNEKFEASAIVMTGKESDDAE 558


>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein.  The CorA
           transport system is the primary Mg2+ influx system of
           Salmonella typhimurium and Escherichia coli. CorA is
           virtually ubiquitous in the Bacteria and Archaea. There
           are also eukaryotic relatives of this protein. The
           family includes the MRS2 protein from yeast that is
           thought to be an RNA splicing protein. However its
           membership of this family suggests that its effect on
           splicing is due to altered magnesium levels in the cell.
          Length = 291

 Score = 28.0 bits (63), Expect = 4.2
 Identities = 26/156 (16%), Positives = 54/156 (34%), Gaps = 33/156 (21%)

Query: 39  LREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYL 98
           +R +       + +R+++  L        D+L+  ++++   +D     + KLEE+    
Sbjct: 83  VRREPLPAVDEIRKRLENLGLGPK--SPGDLLYLLLDSI---VDRYFEILEKLEEE---- 133

Query: 99  QTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQQVVT------ 152
              L  +E EL        L         +    L+  L +    L   ++V+       
Sbjct: 134 ---LDELEDELEDSTTNELL---------RELLRLRRSLVRLRRSLAPLREVLNRLLSDD 181

Query: 153 ------EKTSSLEAEAYKTKRLQEEIAQLRRKTERM 182
                 E+   L     + +RL E    LR +   +
Sbjct: 182 GPLIDEEEKEYLRDLLDELERLLEMAEILRERLRSL 217


>gnl|CDD|218186 pfam04636, PA26, PA26 p53-induced protein (sestrin).  PA26 is a
           p53-inducible protein. Its function is unknown. It has
           similarity to pfam04636 in its N-terminus.
          Length = 450

 Score = 28.1 bits (63), Expect = 4.5
 Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 5/36 (13%)

Query: 180 ERMKKIEQAGTLDEVMMEEI-----REYKETLTCPS 210
           ERMKK+++    DE   EE+     +E KE+L   S
Sbjct: 234 ERMKKLQEERDDDEASQEEMTTRFEKEKKESLLVES 269


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 28.5 bits (65), Expect = 4.5
 Identities = 26/113 (23%), Positives = 48/113 (42%), Gaps = 15/113 (13%)

Query: 20  VTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKL 79
           VTG+A  +   +   LL++L     A    + ER+++        EE   L +++  LK 
Sbjct: 687 VTGEAALEYLNEQEALLKELAALLKAKPSELPERVEALL------EELKELEKELEQLKA 740

Query: 80  QIDAMHTAIRKLEEKE-----RYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
           ++ A   A   L + +     + L   +  V+ +  LR  A DL  +  + S 
Sbjct: 741 KL-AAAAAGDLLAQAKEVNGVKVLAAQVEGVDAK-ALRTLADDL--KDKLGSG 789



 Score = 27.7 bits (63), Expect = 8.4
 Identities = 17/86 (19%), Positives = 36/86 (41%), Gaps = 23/86 (26%)

Query: 136 HLEKYHAQLKEAQQV-------VTEKTSSLEAEAYKTKRLQEEIAQLRRK------TERM 182
           +L +  A LKE   +       + E+  +L  E    K L++E+ QL+ K       + +
Sbjct: 695 YLNEQEALLKELAALLKAKPSELPERVEALLEE---LKELEKELEQLKAKLAAAAAGDLL 751

Query: 183 KKIEQ-------AGTLDEVMMEEIRE 201
            + ++       A  ++ V  + +R 
Sbjct: 752 AQAKEVNGVKVLAAQVEGVDAKALRT 777


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 28.4 bits (64), Expect = 4.7
 Identities = 27/113 (23%), Positives = 52/113 (46%), Gaps = 17/113 (15%)

Query: 68  DILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESA 127
           D++ E    ++++ID+    + +L+   R +Q           L  +   L K K   S 
Sbjct: 387 DLIDEAAARIRMEIDSKPEELDELDR--RIIQ-----------LEIEREALKKEKDEASK 433

Query: 128 QSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTE 180
           +   DL    EK  A+L+E    + E+  + +A     ++++EEI Q+R + E
Sbjct: 434 ERLEDL----EKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIEQVRLELE 482


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 28.3 bits (64), Expect = 4.8
 Identities = 37/178 (20%), Positives = 75/178 (42%), Gaps = 29/178 (16%)

Query: 12  EALLNEMEVTGQAFEDMQE-----QNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREE 66
           E  L+E+E   + F ++ E     +   +L +L E+ DA  + M E I            
Sbjct: 167 EKQLDELEEEFEQFVELTESGDYLEAREVLLKLEEETDALEQKM-EEIP----------- 214

Query: 67  KDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIES 126
             +L E  N    Q++ +    R++ E+  +   +  ++EKEL    + +D N    +E 
Sbjct: 215 -PLLKELQNEFPDQLEELKAGYREMTEEGYHFDHL--DIEKELQDLKEQIDQNL-ALLE- 269

Query: 127 AQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
                  +L L++   + +E ++ +      LE E    K +++ I +L    E  ++
Sbjct: 270 -------ELDLDEAEEENEEIEERIDTLYDILEKEVKAKKFVEKNIDKLTDFLEHARE 320


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 28.2 bits (63), Expect = 5.3
 Identities = 32/192 (16%), Positives = 63/192 (32%), Gaps = 36/192 (18%)

Query: 61  KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERY-----LQTVLTNVEKELHLRNQA 115
           + A +E       ++  + ++ A+  A R    +E         T     ++   L    
Sbjct: 818 EEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVERSDTYRELRKRIAALERTL 877

Query: 116 MDLNKRKAIESAQSAAD-------------LKLHLEKYHAQLKEAQQVVTEKTSSLEAEA 162
           +      ++E+  + A              L   +E+   +L E  Q V      L    
Sbjct: 878 VRAGGGLSLEALVAEAAALDPDELPARLEELARDIEELEEELNELAQEVGAAKQELARMD 937

Query: 163 YKTK------RLQEEIAQLRRKTERMKKIEQAGTLDEVMMEEIREYKETLTCPSCKVKRK 216
             +         +  +AQLR   ER  ++  A  L   + + I  Y+E         +R+
Sbjct: 938 GGSTAAELEAERESLLAQLRDLAERYLELALASRL---LRKAIERYRE---------QRQ 985

Query: 217 DAVLTKCFHVFC 228
             VL +   +F 
Sbjct: 986 PPVLNRAGEIFA 997


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 28.3 bits (63), Expect = 5.3
 Identities = 37/195 (18%), Positives = 66/195 (33%), Gaps = 19/195 (9%)

Query: 10  EEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDI 69
           +EE    E EV      + +E    L+++ +E +             N   + A+EE   
Sbjct: 301 QEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEP---QERSYFINAAQRQAQEEAKA 357

Query: 70  LHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQS 129
                 A+  Q +A     R+ EE ER  Q  L                 ++  I     
Sbjct: 358 AANIAEAIGAQAEAAVETARETEEAERAEQAALVAA--------AEAAEQEQVEIAVRAE 409

Query: 130 AADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK---IE 186
           AA  +   +   A++K   + + EK    +AEA   + L E I  L            ++
Sbjct: 410 AAKAEAEAQA--AEIKAEAEAIREKG---KAEAEAKRALAEAIQVLGDAAAAELFKALVQ 464

Query: 187 QAGTLDEVMMEEIRE 201
               + E   + ++ 
Sbjct: 465 ALPEVAEEAAQPMKN 479


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 28.0 bits (63), Expect = 5.8
 Identities = 28/137 (20%), Positives = 60/137 (43%), Gaps = 24/137 (17%)

Query: 72  EQVNALKLQIDAMHTAIR-KLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSA 130
           E+V+ L+ +++      R +L+  ER L       ++E  L  +   L+K++        
Sbjct: 58  EEVHKLRAELERELKERRNELQRLERRLL------QREETLDRKMESLDKKEE------- 104

Query: 131 ADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRR---KTERMKKIEQ 187
                +LEK   +L   ++ + EK   LE    + +   E I+ L +   K   ++++E+
Sbjct: 105 -----NLEKKEKELSNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEE 159

Query: 188 AGTLDEVMMEEIREYKE 204
               +   +  I+E +E
Sbjct: 160 EARHEAAKL--IKEIEE 174


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 27.3 bits (61), Expect = 5.8
 Identities = 15/64 (23%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 125 ESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
           E+ +   + +  L +Y  +L+EA++  +E     + EA +    +E  A+   + ER+K+
Sbjct: 48  EAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIA--EEIKAEAEEELERIKE 105

Query: 185 IEQA 188
             +A
Sbjct: 106 AAEA 109


>gnl|CDD|226306 COG3783, CybC, Soluble cytochrome b562 [Energy production and
           conversion].
          Length = 100

 Score = 26.8 bits (59), Expect = 5.8
 Identities = 21/73 (28%), Positives = 33/73 (45%)

Query: 122 KAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER 181
           K +E A  AA +K  LEK  A   +AQ+    K      ++ + K  +     L  + ++
Sbjct: 9   KVVEKADKAAVIKAALEKMRAAALDAQKATPPKLEGKYPDSPEMKDYRHGFDILIGQIDK 68

Query: 182 MKKIEQAGTLDEV 194
             K+ Q G LDE 
Sbjct: 69  ADKLAQEGNLDEA 81


>gnl|CDD|184546 PRK14163, PRK14163, heat shock protein GrpE; Provisional.
          Length = 214

 Score = 27.8 bits (61), Expect = 5.9
 Identities = 19/58 (32%), Positives = 30/58 (51%), Gaps = 11/58 (18%)

Query: 142 AQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTER----MKKIEQAGTLDEVM 195
           AQL + +  + E+T+ L+       RLQ E    RR+ ER    +K+I  A  L E++
Sbjct: 47  AQLDQVRTALGERTADLQ-------RLQAEYQNYRRRVERDRVTVKEIAVANLLSELL 97


>gnl|CDD|113545 pfam04778, LMP, LMP repeated region.  This family consists of a
           repeated sequence element found in the LMP group of
           surface-located membrane proteins of Mycoplasma hominis.
           The the number of repeats in the protein affects the
           tendency of cells to spontaneously aggregate.
           Agglutination may be an important factor in
           colonisation. Non-agglutinating microorganisms might
           easily be distributed whereas aggregation might provide
           a better chance to avoid an antibody response since some
           of the epitopes may be buried.
          Length = 157

 Score = 27.2 bits (60), Expect = 6.0
 Identities = 16/50 (32%), Positives = 30/50 (60%)

Query: 86  TAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKL 135
           T + KL + +  +Q   T ++KE+   NQA+  N   +++SA+S+ D K+
Sbjct: 4   TLLAKLTDNDNTIQKAKTELDKEIQKANQAVASNNTASMQSAKSSLDAKV 53


>gnl|CDD|234734 PRK00349, uvrA, excinuclease ABC subunit A; Reviewed.
          Length = 943

 Score = 28.1 bits (64), Expect = 6.3
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 193 EVMMEEIREYKETLTCPSCKVKR 215
           E + EE+ +Y     CPSCK KR
Sbjct: 394 EYVREELEKYMSERPCPSCKGKR 416


>gnl|CDD|113156 pfam04375, HemX, HemX.  This family consists of several bacterial
           HemX proteins. The hemX gene is not essential for haem
           synthesis in B. subtilis. HemX is a polytopic membrane
           protein which by an unknown mechanism down-regulates the
           level of HemA.
          Length = 372

 Score = 27.6 bits (61), Expect = 7.0
 Identities = 19/85 (22%), Positives = 35/85 (41%), Gaps = 8/85 (9%)

Query: 23  QAFEDMQEQNSRL---LQQLREKDDANFKLMTERIKSNQLHKLAREEKDI--LHEQVNAL 77
           Q    +Q +   L   L QL +  +A        +  +    LA+ EK +  L + +  L
Sbjct: 53  QQVAGLQIKLEALAQELTQLAQALEAQLAQEVRVLIRDNQEALAQLEKVLGQLDKSLAEL 112

Query: 78  KLQIDAMHTAIRK---LEEKERYLQ 99
           + ++ A+  A R    L E +  L+
Sbjct: 113 QDRVAAISGANRNDWLLAEADFLLK 137


>gnl|CDD|224337 COG1419, FlhF, Flagellar GTP-binding protein [Cell motility and
           secretion].
          Length = 407

 Score = 27.7 bits (62), Expect = 7.4
 Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 3/94 (3%)

Query: 111 LRNQAMDLNKRKAIESAQSAAD-LKLHLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQ 169
           + ++      R+A +   + A  LK   EK   + K A+++     S +E +  + K   
Sbjct: 57  VDSELRKFQTREAAKVKDAEAQLLKDPAEKKREERKAAKKIERSTPSLIERKTQEVKDSG 116

Query: 170 EEIAQLRR--KTERMKKIEQAGTLDEVMMEEIRE 201
           EEIA++ R  K    +  E       +  +EI+ 
Sbjct: 117 EEIAEMMRDEKVPIRELEEIPPEFVALYKQEIQS 150


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 27.7 bits (62), Expect = 7.7
 Identities = 19/113 (16%), Positives = 36/113 (31%), Gaps = 11/113 (9%)

Query: 12  EALLNEMEVTGQAFEDMQEQNSRLLQQL-----------REKDDANFKLMTERIKSNQLH 60
           E  L + E   +  E   E+  +  + L           +    A+F    E        
Sbjct: 299 ENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADFYGNEEIKIELDKS 358

Query: 61  KLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRN 113
           K   E      ++   LK     +   + +L+E   Y ++  T +EK    + 
Sbjct: 359 KTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKA 411


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
          Provisional.
          Length = 389

 Score = 27.5 bits (62), Expect = 7.8
 Identities = 10/36 (27%), Positives = 17/36 (47%)

Query: 64 REEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQ 99
           E    L EQ+  L+L++  +     KLE +   L+
Sbjct: 7  EERNSELEEQIRQLELKLRDLEAENEKLERELERLK 42


>gnl|CDD|234162 TIGR03299, LGT_TIGR03299, phage/plasmid-like protein TIGR03299.
           Members of this uncharacterized protein family are found
           in various Mycobacterium phage genomes, in Streptomyces
           coelicolor plasmid SCP1, and in bacterial genomes near
           various markers that suggest lateral gene transfer. The
           function is unknown [Mobile and extrachromosomal element
           functions, Other].
          Length = 309

 Score = 27.4 bits (61), Expect = 8.1
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 9/60 (15%)

Query: 136 HLEKYHAQLKEAQQVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQAGTLDEVM 195
           H  K+ A+L EA+Q +       +  A + KRL E     R  ++R    + A  LD+V+
Sbjct: 177 HTAKFDARLDEAKQALGIVVKQFDQFAAEAKRLAE-----RPVSDR----QFAAFLDDVL 227


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 27.7 bits (62), Expect = 8.6
 Identities = 48/257 (18%), Positives = 98/257 (38%), Gaps = 64/257 (24%)

Query: 12  EALLNEMEVT----GQAFEDMQEQNSRLLQQLRE-----------KDDA--NFKLMTERI 54
           E  +NE+E+      +A  + Q+   +  QQ++E           ++DA     +   R 
Sbjct: 552 EGDINELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLAVAERRA 611

Query: 55  K---------------SNQLHKLAREEKDILHEQVNALKLQIDAMHTAIRKLEEKERYLQ 99
                           + +  K A  E     E+VN L  Q  ++    RKLE +   L 
Sbjct: 612 TALEAELEELRSALEQAERARKQAETELAEASERVNELTAQNSSLIAQKRKLEGE---LA 668

Query: 100 TVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLK------LHLEKYHAQLKEAQQVVTE 153
            + +++++ ++    A +  K+   ++A+ A +L+       HLE+   QL+   + +  
Sbjct: 669 ALQSDLDEAVNELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQV 728

Query: 154 KTSSLEAEAYK-----------------------TKRLQEEIAQLRRKTERMKKIEQAGT 190
           +    EA A K                        +R  E    LR+   R+K+++    
Sbjct: 729 RLDEAEAAALKGGKKMIQKLEARVRELEAELDGEQRRHAETQKNLRKMERRVKELQFQVE 788

Query: 191 LDEVMMEEIREYKETLT 207
            D+  +E +++  + L 
Sbjct: 789 EDKKNLERLQDLVDKLQ 805


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 27.4 bits (61), Expect = 8.7
 Identities = 39/146 (26%), Positives = 58/146 (39%), Gaps = 29/146 (19%)

Query: 72  EQVNALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAA 131
           E++  LK    A+    R+LEEK R +         EL +  Q     K+  I+  +S  
Sbjct: 312 EKMRMLKKARSALDACDRELEEKAREVS--------ELKMERQR----KKPQIDELESIV 359

Query: 132 DLKLHLEKYHAQLK------EA---QQVVTEKTSSLE---AEAYKTKRLQEEIAQLRRKT 179
            LK   E    QLK      EA   Q++   KT   E   A  Y   RL E   + +   
Sbjct: 360 RLK-QAEADMFQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLF 418

Query: 180 ERMKKIEQAG----TLDEVMMEEIRE 201
           E +K  E++     ++   M  +IR 
Sbjct: 419 EELKLQEESHRDSDSMKMRMESKIRG 444


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 27.3 bits (61), Expect = 9.0
 Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 14/171 (8%)

Query: 25  FEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAM 84
            ED+     + + ++R  D+   KL     K     KL    KD   +       +   +
Sbjct: 16  NEDVSAFQRKFVNEVRRCDEVERKLRKLESKIK---KLGIPLKDTGGKPDVPPSKEFLDL 72

Query: 85  HTAIRKLEEKERYLQTVLTNVEKELH-LRNQAMDLNKRK-----AIESAQSAADLKLHLE 138
              I  LE + + ++  L ++EKE++ L      L++ K      +E     ++L +  +
Sbjct: 73  EEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNLDIDFK 132

Query: 139 KYHAQLKEAQQVVT-----EKTSSLEAEAYKTKRLQEEIAQLRRKTERMKK 184
                       V      EK  + E E ++  R     A++    E  KK
Sbjct: 133 YLRGAEGLKLGFVAGVINREKLEAFERELWRACRGYIRQAEIEEPLEDPKK 183


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 27.2 bits (60), Expect = 9.2
 Identities = 30/160 (18%), Positives = 60/160 (37%), Gaps = 6/160 (3%)

Query: 29  QEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKDILHEQVNALKLQIDAMHTAI 88
           Q+      QQ   K     +   E   + +L      E++ L +    L+ +        
Sbjct: 62  QQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQ----LEKERLKAQEQQ 117

Query: 89  RKLEEKERYLQTVLTNVEKELHLRNQAMDLNKRKAIESAQSAADLKLHLEKYHAQLKEAQ 148
           ++ EE E+  Q  L   ++E   R  A +  K+     A++AA+         A+ K  +
Sbjct: 118 KQAEEAEK--QAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEE 175

Query: 149 QVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKIEQA 188
                + +  +AEA   K+  E  A+   +  + +   +A
Sbjct: 176 AAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKA 215


>gnl|CDD|214763 smart00647, IBR, In Between Ring fingers.  the domains occurs
           between pairs og RING fingers.
          Length = 64

 Score = 25.1 bits (55), Expect = 9.3
 Identities = 5/13 (38%), Positives = 6/13 (46%)

Query: 222 KCFHVFCWDCLRT 234
           KC   FC+ C   
Sbjct: 45  KCGFSFCFRCKVP 57


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 26.9 bits (60), Expect = 9.3
 Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 20/160 (12%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLREKDDANFKLMTERIKSNQLHKLAREEKD 68
                L + +   G+A   + +    +  +L E      K   E          A +E  
Sbjct: 32  KRRRELGSALGEFGKALIKLAKCEEEVGGELGEALSKLGKAAEELSSL--SEAQANQELV 89

Query: 69  ILHEQVN-------ALKLQIDAMHTAIRKLEEKERYLQTVLTNVEKELHLRNQAMDLNKR 121
            L E +        A+K  +D    A+  L+  ++ L +    +EK              
Sbjct: 90  KLLEPLKEYLRYCQAVKETLDDRADALLTLQSLKKDLASKKAQLEKLKAAPGIKPA---- 145

Query: 122 KAIESAQSAADLKLHLEKYHAQLKEAQQVVTEKTSSLEAE 161
                     +L+  LE+  + L+EA++   E +  L+ E
Sbjct: 146 -------KVEELEEELEEAESALEEARKRYEEISERLKEE 178


>gnl|CDD|185747 cd09234, V_HD-PTP_like, Protein-interacting V-domain of mammalian
           His-Domain type N23 protein tyrosine phosphatase and
           related domains.  This family contains the V-shaped (V)
           domain of mammalian His-Domain type N23 protein tyrosine
           phosphatase (HD-PTP, also known as PTPN23) and related
           domains. It belongs to the V_Alix_like superfamily which
           includes the V domains of  Bro1 and Rim20 (also known as
           PalA) from Saccharomyces cerevisiae, mammalian Alix
           (apoptosis-linked gene-2 interacting protein X/ also
           known as apoptosis-linked gene-2 interacting protein 1,
           AIP1), and related domains. HD_PTP interacts with the
           ESCRT (Endosomal Sorting Complexes Required for
           Transport) system, and participates in cell migration
           and endosomal trafficking. The related Alix V-domain
           (belonging to a different family in this superfamily)
           contains a binding site, partially conserved in the
           superfamily, for the retroviral late assembly (L) domain
           YPXnL motif. The Alix V-domain is also a dimerization
           domain. In addition to the V-domain, HD_PTP also has an
           N-terminal Bro1-like domain, a proline-rich region
           (PRR), a catalytically inactive tyrosine phosphatase
           domain, and a region containing a PEST motif. Bro1-like
           domains bind components of the ESCRT-III complex,
           specifically to CHMP4 in the case of HD-PTP. The
           Bro1-like domain of HD-PTP can also bind human
           immunodeficiency virus type 1 (HIV-1) nucleocapsid.
           HD-PTP is encoded by the PTPN23 gene, a tumor suppressor
           gene candidate frequently absent in human kidney,
           breast, lung, and cervical tumors. This family also
           contains Drosophila Myopic, which promotes epidermal
           growth factor receptor (EGFR) signaling, and
           Caenorhabditis elegans (enhancer of glp-1) EGO-2 which
           promotes Notch signaling.
          Length = 337

 Score = 27.3 bits (61), Expect = 9.4
 Identities = 16/46 (34%), Positives = 26/46 (56%), Gaps = 3/46 (6%)

Query: 9   NEEEALLNEMEVTGQAFEDMQEQNSRLLQQLRE---KDDANFKLMT 51
            E+EA+  E++       +M++Q   L QQLR+   +DD   KL+T
Sbjct: 184 PEDEAIEKELKRILNKVNEMRKQRRSLEQQLRDAIHEDDITSKLVT 229


>gnl|CDD|116585 pfam07975, C1_4, TFIIH C1-like domain.  The carboxyl-terminal
           region of TFIIH is essential for transcription activity.
           This regions binds three zinc atoms through two
           independent domain. The first contains a C4 zinc finger
           motif, whereas the second is characterized by a
           CX(2)CX(2-4)FCADCD motif. The solution structure of the
           second C-terminal domain revealed homology with the
           regulatory domain of protein kinase C (pfam00130).
          Length = 48

 Score = 24.6 bits (54), Expect = 9.5
 Identities = 11/33 (33%), Positives = 14/33 (42%), Gaps = 8/33 (24%)

Query: 207 TCPSCKVKRKDAVL--------TKCFHVFCWDC 231
            C  C+ K  D +          KC +VFC DC
Sbjct: 1   FCYGCQQKLPDGINKTSSVYRCPKCKNVFCLDC 33


>gnl|CDD|233062 TIGR00630, uvra, excinuclease ABC, A subunit.  This family is a
           member of the ABC transporter superfamily of proteins of
           which all members for which functions are known except
           the UvrA proteins are involved in the transport of
           material through membranes. UvrA orthologs are involved
           in the recognition of DNA damage as a step in nucleotide
           excision repair. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 925

 Score = 27.3 bits (61), Expect = 9.7
 Identities = 9/26 (34%), Positives = 12/26 (46%)

Query: 190 TLDEVMMEEIREYKETLTCPSCKVKR 215
           T  E M E + ++     CPSC   R
Sbjct: 390 TESESMREYLEKFMSERPCPSCGGTR 415


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 27.2 bits (61), Expect = 10.0
 Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 4/58 (6%)

Query: 137 LEKYHAQLKEAQ-QVVTEKTSSLEAEAYKTKRLQEEIAQLRRKTERMKKI---EQAGT 190
           LE    +   AQ QV       + A   +   LQ EI + + +     +    E  GT
Sbjct: 115 LEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQCEAEGT 172


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.127    0.348 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,850,290
Number of extensions: 1222711
Number of successful extensions: 3351
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2924
Number of HSP's successfully gapped: 620
Length of query: 262
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 167
Effective length of database: 6,723,972
Effective search space: 1122903324
Effective search space used: 1122903324
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (26.2 bits)