RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5592
         (257 letters)



>gnl|CDD|201420 pfam00735, Septin, Septin.  Members of this family include CDC3,
           CDC10, CDC11 and CDC12/Septin. Members of this family
           bind GTP. As regards the septins, these are polypeptides
           of 30-65kDa with three characteristic GTPase motifs
           (G-1, G-3 and G-4) that are similar to those of the Ras
           family. The G-4 motif is strictly conserved with a
           unique septin consensus of AKAD. Most septins are
           thought to have at least one coiled-coil region, which
           in some cases is necessary for intermolecular
           interactions that allow septins to polymerise to form
           rod-shaped complexes. In turn, these are arranged into
           tandem arrays to form filaments. They are
           multifunctional proteins, with roles in cytokinesis,
           sporulation, germ cell development, exocytosis and
           apoptosis.
          Length = 280

 Score =  227 bits (581), Expect = 2e-74
 Identities = 88/141 (62%), Positives = 111/141 (78%)

Query: 1   MQRLHDKVNIIPVIAKADTMTPEECALFKKQILSEIAQHKIEIYQFPPGGSSEDDTSKFN 60
           M++LH+KVNIIPVIAKADT+TP+E   FKK+I +EI +  I+IY FP   S ED+  +  
Sbjct: 135 MKKLHEKVNIIPVIAKADTLTPDELQRFKKRIRAEIERQNIKIYHFPDEESDEDEEKEQT 194

Query: 61  KNLRDRVPFAVVGSNTVVEIDGKKVRGRKYPWGIAEVENLEHCDFIALRNMLIRTNLQDL 120
           + L+  +PFA+VGSNT +E DGKKVRGRKYPWG+ EVEN  HCDF+ LRNMLIRT+LQDL
Sbjct: 195 EQLKSSIPFAIVGSNTEIENDGKKVRGRKYPWGVVEVENPSHCDFLKLRNMLIRTHLQDL 254

Query: 121 KDVTSNVHYENFRCRKLAGLG 141
           K+VT  +HYEN+R  KL+ LG
Sbjct: 255 KEVTHELHYENYRSEKLSALG 275


>gnl|CDD|206649 cd01850, CDC_Septin, CDC/Septin GTPase family.  Septins are a
           conserved family of GTP-binding proteins associated with
           diverse processes in dividing and non-dividing cells.
           They were first discovered in the budding yeast S.
           cerevisiae as a set of genes (CDC3, CDC10, CDC11 and
           CDC12) required for normal bud morphology. Septins are
           also present in metazoan cells, where they are required
           for cytokinesis in some systems, and implicated in a
           variety of other processes involving organization of the
           cell cortex and exocytosis. In humans, 12 septin genes
           generate dozens of polypeptides, many of which comprise
           heterooligomeric complexes. Since septin mutants are
           commonly defective in cytokinesis and formation of the
           neck formation of the neck filaments/septin rings,
           septins have been considered to be the primary
           constituents of the neck filaments. Septins belong to
           the GTPase superfamily for their conserved GTPase motifs
           and enzymatic activities.
          Length = 275

 Score =  223 bits (571), Expect = 7e-73
 Identities = 85/141 (60%), Positives = 110/141 (78%), Gaps = 1/141 (0%)

Query: 1   MQRLHDKVNIIPVIAKADTMTPEECALFKKQILSEIAQHKIEIYQFPPGGSSEDDTSKFN 60
           M++L  KVNIIPVIAKADT+TPEE   FKK+I+ +I ++ I+IY+FP     +++  + N
Sbjct: 136 MKKLSKKVNIIPVIAKADTLTPEELTEFKKRIMEDIEENNIKIYKFP-EDEEDEEEIEEN 194

Query: 61  KNLRDRVPFAVVGSNTVVEIDGKKVRGRKYPWGIAEVENLEHCDFIALRNMLIRTNLQDL 120
           K L+  +PFA+VGSN  VE++GKKVRGRKYPWG+ EVEN EHCDF+ LRN+LIRT+LQDL
Sbjct: 195 KKLKSLIPFAIVGSNEEVEVNGKKVRGRKYPWGVVEVENEEHCDFVKLRNLLIRTHLQDL 254

Query: 121 KDVTSNVHYENFRCRKLAGLG 141
           K+ T NVHYEN+R  KL  L 
Sbjct: 255 KETTHNVHYENYRSEKLEALK 275


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score =  206 bits (526), Expect = 6e-65
 Identities = 92/227 (40%), Positives = 146/227 (64%), Gaps = 19/227 (8%)

Query: 1   MQRLHDKVNIIPVIAKADTMTPEECALFKKQILSEIAQHKIEIYQFPPGGSSEDDTSKFN 60
           M+RL  +VN+IPVIAKADT+T +E A FK++I  ++ Q+ I ++        ED++ + N
Sbjct: 156 MKRLSKRVNLIPVIAKADTLTDDELAEFKERIREDLEQYNIPVFDPYDPEDDEDESLEEN 215

Query: 61  KNLRDRVPFAVVGSNTVVEIDGKKVRGRKYPWGIAEVENLEHCDFIALRNMLIRTNLQDL 120
           ++LR  +PFA++GSNT +E  G++VRGRKYPWG+ E+++ EH DF  LRN+LIRT+LQ+L
Sbjct: 216 QDLRSLIPFAIIGSNTEIENGGEQVRGRKYPWGVVEIDDEEHSDFKKLRNLLIRTHLQEL 275

Query: 121 KDVTSNVHYENFRCRKLAGLGTDGKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFE 180
           K+ T N+ YEN+R  KL+GL   G+P              K  HEA++ + E ++++ F 
Sbjct: 276 KETTENLLYENYRTEKLSGLKNSGEP------------SLKEIHEARLNEEERELKKKFT 323

Query: 181 MKVKEKKQKLKDSE-------IDMKKSLEAQIKELEEKRRGLELEIS 220
            K++EK+++L++ E        ++   LE   K+LE+  + LE   S
Sbjct: 324 EKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKLKS 370


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 36.2 bits (84), Expect = 0.014
 Identities = 30/99 (30%), Positives = 61/99 (61%), Gaps = 3/99 (3%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
           + ++EEE  E E K+ ++E ++E++ E +++E K++L++ E   K+ LE ++KELEE++ 
Sbjct: 830 IEELEEEIEELEEKLDELEEELEEL-EKELEELKEELEELE-AEKEELEDELKELEEEKE 887

Query: 214 GLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVK 252
            LE E+ + E      + E   +  ER   L+ K ++++
Sbjct: 888 ELEEELRELESEL-AELKEEIEKLRERLEELEAKLERLE 925



 Score = 28.9 bits (65), Expect = 3.2
 Identities = 20/97 (20%), Positives = 43/97 (44%)

Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
             ++   E + +  E +       +  E  ++ ++      + LE +I+ELEEK   LE 
Sbjct: 790 ALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEE 849

Query: 218 EISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
           E+ + E+      +EL     E++   D  ++  ++K
Sbjct: 850 ELEELEKELEELKEELEELEAEKEELEDELKELEEEK 886



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 19/66 (28%), Positives = 39/66 (59%), Gaps = 3/66 (4%)

Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
           +E K E E K+++ E ++E++ E  ++E +++L+  E   +K+   Q  EL+ + R LEL
Sbjct: 171 KERKEEAERKLERTEENLERL-EDLLEELEKQLEKLERQAEKAERYQ--ELKAELRELEL 227

Query: 218 EISQWE 223
            +   +
Sbjct: 228 ALLLAK 233



 Score = 27.8 bits (62), Expect = 6.7
 Identities = 24/92 (26%), Positives = 50/92 (54%), Gaps = 7/92 (7%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEM--KVKEKKQKLKDSEIDMKKSLEAQ---IKEL 208
           L ++EEE  E E ++++++  +E++ E    ++E   KLK+   ++++  +A    ++EL
Sbjct: 739 LEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEEL 798

Query: 209 EEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
           EE+    E  +   E+   +   E RR  LE+
Sbjct: 799 EEELEEAERRLDALERE--LESLEQRRERLEQ 828



 Score = 27.8 bits (62), Expect = 7.0
 Identities = 21/109 (19%), Positives = 58/109 (53%), Gaps = 9/109 (8%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQV------FEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
           L+++EEE  E + ++++ E ++E++         +++E +++L + + +  + LE +I  
Sbjct: 248 LSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEE-IEELEGEISL 306

Query: 208 LEEKRRGLELEISQWEQSNGVSMDEL--RRRSLERDSSLDGKEKKVKKK 254
           L E+   LE E+ + E+      +++   +  LE   +L  + +++  +
Sbjct: 307 LRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAE 355



 Score = 27.8 bits (62), Expect = 8.0
 Identities = 20/104 (19%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDM-----KKSLEAQIKEL 208
           ++ + E   E E +++++E  +E++ E     K++  +   +          LE   +EL
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEEL 363

Query: 209 EEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVK 252
           EEK   L  E+ +  ++    + EL     E  + L+  +++++
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE 407


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 34.6 bits (80), Expect = 0.028
 Identities = 22/106 (20%), Positives = 51/106 (48%), Gaps = 4/106 (3%)

Query: 151 KNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
           +N ++Q+E E +E   ++K+ E  +  V + +             +   SLE ++ EL E
Sbjct: 58  ENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELME 117

Query: 211 KRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKKGL 256
           +   LE EI   ++     ++ L +   E ++ L+ +  +++++G 
Sbjct: 118 EIEKLEKEIEDLKE----RLERLEKNLAEAEARLEEEVAEIREEGQ 159


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 32.6 bits (75), Expect = 0.13
 Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 5/48 (10%)

Query: 194 EIDMKKS-LEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
           + +  KS LE +IK+LEE++  LE  +++ E      ++ + +R  E 
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEA----KLEAIEKREEEE 160


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 32.8 bits (74), Expect = 0.19
 Identities = 25/97 (25%), Positives = 45/97 (46%), Gaps = 4/97 (4%)

Query: 158  EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
            EE K+  EA   + E   ++    + K +  + K  E   KK  +A  K+ EEK++  E 
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKADEA 1396

Query: 218  EISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            +    E       DEL++ +  +  + + K+K  +KK
Sbjct: 1397 KKKAEEDKK--KADELKKAAAAKKKADEAKKKAEEKK 1431



 Score = 30.9 bits (69), Expect = 0.90
 Identities = 23/104 (22%), Positives = 50/104 (48%), Gaps = 3/104 (2%)

Query: 151  KNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
                A+ E E    EA+  + + +  +  + + K+K    K    + KK+ EA+ K  E+
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEED 1403

Query: 211  KRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            K++  EL+ +   +      DE ++++ E+  + + K+K  + K
Sbjct: 1404 KKKADELKKAAAAKKKA---DEAKKKAEEKKKADEAKKKAEEAK 1444



 Score = 30.1 bits (67), Expect = 1.3
 Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 7/98 (7%)

Query: 158  EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQK-----LKDSEIDMKKSLEAQIKELEEKR 212
             EEK++ E   K  E +  +  E   K ++ K      K +E D KK+ EA  KE EE +
Sbjct: 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702

Query: 213  RGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
            +  EL+  + E+      +EL++   E     +  +K+
Sbjct: 1703 KAEELKKKEAEEKK--KAEELKKAEEENKIKAEEAKKE 1738



 Score = 29.3 bits (65), Expect = 2.3
 Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 4/97 (4%)

Query: 158  EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLEL 217
            EE K++ +A  KK E + ++  E    E +    ++E   +K+  A+ K+ E K++    
Sbjct: 1325 EEAKKKADAAKKKAE-EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383

Query: 218  EISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            +    E+      DE ++++ E     D  +K    K
Sbjct: 1384 KKKAEEKKKA---DEAKKKAEEDKKKADELKKAAAAK 1417



 Score = 27.8 bits (61), Expect = 6.8
 Identities = 25/100 (25%), Positives = 48/100 (48%), Gaps = 8/100 (8%)

Query: 159  EEKREHEAKMKKMEVDMEQVFEMKVKEKKQK----LKDSEIDMKKSLEAQIKELEEKRRG 214
            EEK++ +   KK E D ++  E+K     +K     K    + KK+ EA+ K+ EE ++ 
Sbjct: 1388 EEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK-KKAEEAKKA 1446

Query: 215  LELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
             E +    E       +E ++++ E   + + K+K  + K
Sbjct: 1447 DEAKKKAEEAKKA---EEAKKKAEEAKKADEAKKKAEEAK 1483



 Score = 27.8 bits (61), Expect = 7.0
 Identities = 23/108 (21%), Positives = 54/108 (50%), Gaps = 5/108 (4%)

Query: 147  RLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIK 206
            +L  +    + EE K+  EAK+K  E+   +    + K+K ++LK  E + KK  E ++K
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAE----EEKKKVEQLKKKEAEEKKKAE-ELK 1653

Query: 207  ELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            + EE+ +    E ++  + +    +E ++   +   + +  +K+ ++ 
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701



 Score = 27.4 bits (60), Expect = 9.6
 Identities = 25/99 (25%), Positives = 44/99 (44%), Gaps = 6/99 (6%)

Query: 158  EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQ--KLKDSEIDMKKSLEAQIKELEEKRRGL 215
              +K+  EAK K  E       + K +E K+  + K    + KK+ EA+ K+ EE ++  
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAK-KKAEEAKKAD 1473

Query: 216  ELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            E +    E       DE ++++ E     D  +K  + K
Sbjct: 1474 EAKKKAEEAKKA---DEAKKKAEEAKKKADEAKKAAEAK 1509


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 32.4 bits (75), Expect = 0.21
 Identities = 19/104 (18%), Positives = 53/104 (50%), Gaps = 13/104 (12%)

Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSE---IDMKKSLEAQIKELEEK 211
            + E  K+E   + K+    +   FE +++E++ +L+  E   +  +++L+ +++ LE++
Sbjct: 49  KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108

Query: 212 RRGLELEISQWEQSNGVSMDELRRRSLE-RDSSLDGKEKKVKKK 254
              LE +  + EQ         +++ LE ++  L+   ++  ++
Sbjct: 109 EEELEKKEKELEQ---------KQQELEKKEEELEELIEEQLQE 143



 Score = 28.2 bits (64), Expect = 5.1
 Identities = 17/57 (29%), Positives = 36/57 (63%), Gaps = 1/57 (1%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
           L Q EE        ++K E ++E+  E ++++K+Q+L+  E ++++ +E Q++ELE 
Sbjct: 91  LLQKEENLDRKLELLEKREEELEKK-EKELEQKQQELEKKEEELEELIEEQLQELER 146


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 32.2 bits (73), Expect = 0.29
 Identities = 23/82 (28%), Positives = 40/82 (48%), Gaps = 1/82 (1%)

Query: 140 LGTDGKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKK 199
           L ++   R+  +  LA+ ++ K+E E    +         E   +  KQ  +D E+++KK
Sbjct: 485 LKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKK 544

Query: 200 SLEAQIKELEEKRRGLELEISQ 221
            LE  +K  EE+ R LE E  +
Sbjct: 545 -LEHDLKLKEEECRMLEKEAQE 565


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 31.8 bits (73), Expect = 0.33
 Identities = 17/79 (21%), Positives = 47/79 (59%), Gaps = 3/79 (3%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEM---KVKEKKQKLKDSEIDMKKSLEAQIKELEE 210
           L ++EEE +E E +++K++  ++++ +    ++  +K++  +  ++ K+ L  +++ELEE
Sbjct: 336 LKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEELEE 395

Query: 211 KRRGLELEISQWEQSNGVS 229
           + + L+ E+        +S
Sbjct: 396 ELKELKEELESLYSEGKIS 414


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 31.6 bits (72), Expect = 0.48
 Identities = 28/99 (28%), Positives = 50/99 (50%), Gaps = 12/99 (12%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
            A++ EE R+ +   + +E   +Q+ E +VKE + +L ++E    K  +  I++LE + R
Sbjct: 695 AARLAEELRQEQEHSQHLERLRKQL-ESQVKELQVRLDEAEAAALKGGKKMIQKLEARVR 753

Query: 214 GLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVK 252
            LE E           +D  +RR  E   +L   E++VK
Sbjct: 754 ELEAE-----------LDGEQRRHAETQKNLRKMERRVK 781


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 31.5 bits (71), Expect = 0.49
 Identities = 22/97 (22%), Positives = 46/97 (47%), Gaps = 7/97 (7%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKS------LEAQIKE 207
           +  + ++K+E++ ++ +   D+E   E    E KQ+LK   I  +        L+ + ++
Sbjct: 229 IQTLTQKKQEND-RVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERIDLQLENEQ 287

Query: 208 LEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSL 244
           L E  R L+  +   +Q  G+  +EL      RD ++
Sbjct: 288 LHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTI 324


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 31.2 bits (71), Expect = 0.58
 Identities = 17/84 (20%), Positives = 39/84 (46%), Gaps = 9/84 (10%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
           + ++EEEK +   ++KK E  +EQ+     K +++            L+ +   +E++  
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYD---------LKEEYDRVEKELS 486

Query: 214 GLELEISQWEQSNGVSMDELRRRS 237
            L+ E+++ E     S + +R   
Sbjct: 487 KLQRELAEAEAQARASEERVRGGR 510



 Score = 30.0 bits (68), Expect = 1.3
 Identities = 23/117 (19%), Positives = 47/117 (40%), Gaps = 19/117 (16%)

Query: 154 LAQMEEEKREHEAKMKKMEV--DMEQVFEMKVKEKKQKLKDSE------IDMKKSLEAQI 205
             ++ +  R+     K++E     E+  + +++E ++ L   E          K LEA+I
Sbjct: 708 SQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARI 767

Query: 206 KELEEKRRGLELEI---------SQWEQSNGV--SMDELRRRSLERDSSLDGKEKKV 251
           +ELEE    LE  +         S+  +       ++E   R   R   ++ K  ++
Sbjct: 768 EELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRL 824



 Score = 27.7 bits (62), Expect = 7.2
 Identities = 25/125 (20%), Positives = 57/125 (45%), Gaps = 29/125 (23%)

Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDM----------------- 197
           A+ + +K +   +++++E ++E++ ++ + EK+Q+L+    +                  
Sbjct: 166 AEFDRKKEKALEELEEVEENIERL-DLIIDEKRQQLERLRREREKAERYQALLKEKREYE 224

Query: 198 -------KKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
                  K++LE Q + +E +   LE E+ +        + EL +R  E +  L+   KK
Sbjct: 225 GYELLKEKEALERQKEAIERQLASLEEELEKLT----EEISELEKRLEEIEQLLEELNKK 280

Query: 251 VKKKG 255
           +K  G
Sbjct: 281 IKDLG 285



 Score = 27.7 bits (62), Expect = 7.8
 Identities = 21/93 (22%), Positives = 45/93 (48%), Gaps = 11/93 (11%)

Query: 149 LNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKEL 208
             +  L   EE   + EA++ K+  ++E+  E +++E++++        +  L  +  EL
Sbjct: 312 EKERELEDAEERLAKLEAEIDKLLAEIEE-LEREIEEERKR--------RDKLTEEYAEL 362

Query: 209 EEKRRGLELEISQWEQSNGVSMDEL--RRRSLE 239
           +E+   L  E+ + ++    + DEL   R  LE
Sbjct: 363 KEELEDLRAELEEVDKEFAETRDELKDYREKLE 395


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 29.9 bits (68), Expect = 0.66
 Identities = 16/71 (22%), Positives = 45/71 (63%), Gaps = 2/71 (2%)

Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMK-KSLEAQIKELEEKRR 213
            Q+E+E ++ +A+++K E ++++  + K++++   L +     K + L+ + +EL++K++
Sbjct: 38  KQLEKEFKKLQAELQKKEKELQKEEQ-KLQKQAATLSEEARKAKQQELQQKQQELQQKQQ 96

Query: 214 GLELEISQWEQ 224
             + E+ Q +Q
Sbjct: 97  AAQQELQQKQQ 107



 Score = 26.4 bits (59), Expect = 9.2
 Identities = 16/84 (19%), Positives = 44/84 (52%), Gaps = 12/84 (14%)

Query: 173 VDMEQVFE--MKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSM 230
           VD+++V       K  +++L+     ++  L+ + KEL+++ + L        Q    ++
Sbjct: 21  VDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKL--------QKQAATL 72

Query: 231 DELRRRSLERDSSLDGKEKKVKKK 254
            E  R++ +++  L  K++++++K
Sbjct: 73  SEEARKAKQQE--LQQKQQELQQK 94


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 30.7 bits (69), Expect = 0.73
 Identities = 19/108 (17%), Positives = 49/108 (45%), Gaps = 11/108 (10%)

Query: 152 NPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQ------I 205
           N +  M+++ +E   K++K++ ++E       +E+ + L+ +  ++ K L  Q       
Sbjct: 316 NYVNAMKQKSQEWPGKLEKLKSEIE-----LKEEEIKALQSNIDELHKQLRKQGISTEQF 370

Query: 206 KELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKK 253
           + + ++R  L  E+ +    +      ++ R LE        EK +++
Sbjct: 371 ELMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQ 418


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.8 bits (70), Expect = 0.74
 Identities = 21/69 (30%), Positives = 43/69 (62%), Gaps = 1/69 (1%)

Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVK-EKKQKLKDSEIDMKKSLEAQIKELEEKRRGL 215
           +  E  E   +++K+E +++++ E+K + E+ +K  +S    K+ LE +I+ELEE+   L
Sbjct: 212 ISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271

Query: 216 ELEISQWEQ 224
           + EI + E+
Sbjct: 272 KKEIEELEE 280


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 30.4 bits (70), Expect = 1.00
 Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 9/54 (16%)

Query: 166 AKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEI 219
            + K+   D E +   +VKE K+++K         LEA++ ELE +   L L I
Sbjct: 59  GQAKRKGEDAEALIA-EVKELKEEIKA--------LEAELDELEAELEELLLRI 103


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
           bacterial family of proteins has no known function.
          Length = 131

 Score = 29.1 bits (66), Expect = 1.0
 Identities = 15/70 (21%), Positives = 36/70 (51%), Gaps = 4/70 (5%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
           L Q+E + ++   +++K      +  + +V++++ K  +     K  L  QI++++E   
Sbjct: 36  LQQLEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAKRLEQ----KNQLLFQIEQVQELPL 91

Query: 214 GLELEISQWE 223
           G E++  Q E
Sbjct: 92  GSEVKQGQVE 101


>gnl|CDD|193505 cd03884, M20_bAS, M20 Peptidase beta-alanine synthase, an
           amidohydrolase.  Peptidase M20 family, beta-alanine
           synthase (bAS; N-carbamoyl-beta-alanine amidohydrolase
           and beta-ureidopropionase; EC 3.5.1.6) subfamily. bAS is
           an amidohydrolase and is the final enzyme in the
           pyrimidine catabolic pathway, which is involved in the
           regulation of the cellular pyrimidine pool. The bAS
           catalyzes the irreversible hydrolysis of the
           N-carbamylated beta-amino acids to beta-alanine or
           aminoisobutyrate under the release of carbon dioxide and
           ammonia. Also included in this subfamily is allantoate
           amidohydrolase (allantoate deiminase), which catalyzes
           the conversion of allantoate to (S)-ureidoglycolate, one
           of the crucial alternate steps in purine metabolism. It
           is possible that these two enzymes arose from the same
           ancestral peptidase that evolved into two structurally
           related enzymes with distinct catalytic properties and
           biochemical roles within the cell. Yeast requires
           beta-alanine as a precursor of pantothenate and coenzyme
           A biosynthesis, but generates it mostly via degradation
           of spermine. Disorders in pyrimidine degradation and
           beta-alanine metabolism caused by beta-ureidopropionase
           deficiency (UPB1 gene) in humans are normally associated
           with neurological disorders.
          Length = 399

 Score = 30.1 bits (69), Expect = 1.2
 Identities = 13/45 (28%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 196 DMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
            +  ++ A  +E+   RRG+E+EI +   S  V  D     +LE 
Sbjct: 291 ALVAAIRAAAEEIAA-RRGVEVEIERLSDSPPVPFDPELVAALEA 334


>gnl|CDD|221482 pfam12240, Angiomotin_C, Angiomotin C terminal.  This domain family
           is found in eukaryotes, and is typically between 197 and
           211 amino acids in length. This family is the C terminal
           region of angiomotin. Angiomotin regulates the action of
           angiogenesis inhibitor angiostatin. The C terminal
           region of angiomotin appears to be involved in directing
           the protein chemotactically.
          Length = 204

 Score = 29.5 bits (66), Expect = 1.5
 Identities = 22/103 (21%), Positives = 50/103 (48%), Gaps = 14/103 (13%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSE-----IDMKKSLEAQIKEL 208
           LAQ++    + E   +++   +EQ  E++    +Q+  + +          +L+ Q++E 
Sbjct: 12  LAQLQAACEKREQLERRLRTRLEQ--ELRSLRAQQRQANRQGASTSSYNADALQEQLREK 69

Query: 209 EEKRRGLELEISQWEQSN-------GVSMDELRRRSLERDSSL 244
           EE+   LE ++++WEQ           +MD     + +RD+++
Sbjct: 70  EERILALEADMTKWEQKYLEESTMRQFAMDAAATAAAQRDTTI 112


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 29.7 bits (68), Expect = 1.5
 Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 3/51 (5%)

Query: 185 EKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRR 235
           E+ + L + E   ++ LE +I ELEE+ + +  ++ + E+     + EL R
Sbjct: 167 EEFEALPEEE---REELEEKIDELEEELQEILRQLRELEREAREKLRELDR 214


>gnl|CDD|219925 pfam08598, Sds3, Sds3-like.  Repression of gene transcription is
           mediated by histone deacetylases containing
           repressor-co-repressor complexes, which are recruited to
           promoters of target genes via interactions with
           sequence-specific transcription factors. The
           co-repressor complex contains a core of at least seven
           proteins. This family represents the conserved region
           found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
          Length = 184

 Score = 29.2 bits (66), Expect = 1.6
 Identities = 27/99 (27%), Positives = 46/99 (46%), Gaps = 13/99 (13%)

Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGL 215
           ++E  +RE+EA+ +  +    Q FE + +  +++L +        LE +I  LEE+RR L
Sbjct: 69  KLECIEREYEAERQAAK----QEFEKEKRLLRERLLEE-------LEEKIYRLEEERRSL 117

Query: 216 ELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
           ++  S +           R R L R  +      K KKK
Sbjct: 118 DITDSDYGLLGSHP--NRRTRKLRRRVNSPELVAKRKKK 154


>gnl|CDD|133381 cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the
           transcription regulators TipAL, Mta, and SkgA.
           Helix-turn-helix (HTH) TipAL, Mta, and SkgA
           transcription regulators, and related proteins,
           N-terminal domain. TipAL regulates resistance to and
           activation by numerous cyclic thiopeptide antibiotics,
           such as thiostrepton. Mta is a global transcriptional
           regulator; the N-terminal DNA-binding domain of Mta
           interacts directly with the promoters of mta, bmr, blt,
           and ydfK, and induces transcription of these
           multidrug-efflux transport genes. SkgA has been shown to
           control stationary-phase expression of
           catalase-peroxidase in Caulobacter crescentus. These
           proteins are comprised of distinct domains that harbor
           an  N-terminal active (DNA-binding) site and a
           regulatory (effector-binding) site. The conserved
           N-terminal domain of these transcription regulators
           contains winged HTH motifs that mediate DNA binding.
           These proteins share the N-terminal DNA binding domain
           with other transcription regulators of the MerR
           superfamily that promote transcription by reconfiguring
           the spacer between the -35 and -10 promoter elements.
           Unique to this family, is a TipAL-like, lineage specific
           Bacilli subgroup, which has five conserved cysteines in
           the C-terminus of the protein.
          Length = 103

 Score = 28.2 bits (64), Expect = 1.7
 Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 179 FEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
               +KE K+ LKD   D+ ++L  Q + LEEK+  L+
Sbjct: 56  LGFSLKEIKELLKDPSEDLLEALREQKELLEEKKERLD 93


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 29.5 bits (67), Expect = 1.8
 Identities = 20/59 (33%), Positives = 31/59 (52%), Gaps = 9/59 (15%)

Query: 162 REHEAKMKKMEVDMEQVFEMKVKEKKQKL----KDSEID-----MKKSLEAQIKELEEK 211
            + E K+ K+ + +E VF   ++  K+K+     +SEID      K  LEAQ K L E+
Sbjct: 386 PKVEKKLDKLGLSIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKRLREE 444


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 28.7 bits (65), Expect = 1.9
 Identities = 12/80 (15%), Positives = 45/80 (56%), Gaps = 3/80 (3%)

Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGL 215
           +++    + + ++++++  + ++     K KK + +  E      L  ++K+LE++ + L
Sbjct: 66  KLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERT---ELLEELKQLEKELKKL 122

Query: 216 ELEISQWEQSNGVSMDELRR 235
           + E+ ++E+++   +++L+ 
Sbjct: 123 KAELEKYEKNDPERIEKLKE 142


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.7 bits (67), Expect = 1.9
 Identities = 18/69 (26%), Positives = 34/69 (49%)

Query: 186 KKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLD 245
              +  +  ++  KSLE ++++LEEK   LE E+ +  +        L  R  E +  L+
Sbjct: 312 ALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLE 371

Query: 246 GKEKKVKKK 254
             EK+++K 
Sbjct: 372 ELEKELEKA 380


>gnl|CDD|240411 PTZ00419, PTZ00419, valyl-tRNA synthetase-like protein;
           Provisional.
          Length = 995

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 4/64 (6%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQV-FEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKR 212
           LA++E++  + +  ++     +    +E KV E  +KL D +ID    L  +IK+LE+  
Sbjct: 931 LAKLEKKLAKLQKSLESYLKKISIPNYEDKVPEDVRKLNDEKID---ELNEEIKQLEQAI 987

Query: 213 RGLE 216
             L+
Sbjct: 988 EELK 991


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 29.5 bits (67), Expect = 2.1
 Identities = 21/57 (36%), Positives = 36/57 (63%), Gaps = 1/57 (1%)

Query: 168 MKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEISQWEQ 224
           + K+    E+  ++ VK  ++ +KD E +++K +E +IKELEE+   LE EI + EQ
Sbjct: 66  LPKLNPLREEKKKVSVKSLEELIKDVEEELEK-IEKEIKELEEEISELENEIKELEQ 121



 Score = 29.1 bits (66), Expect = 2.5
 Identities = 20/65 (30%), Positives = 35/65 (53%), Gaps = 5/65 (7%)

Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
           + +     E K K     +E++ +  V+E+ +K+ + EI   K LE +I ELE + + LE
Sbjct: 66  LPKLNPLREEKKKVSVKSLEELIK-DVEEELEKI-EKEI---KELEEEISELENEIKELE 120

Query: 217 LEISQ 221
            EI +
Sbjct: 121 QEIER 125


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 29.1 bits (65), Expect = 2.1
 Identities = 21/106 (19%), Positives = 41/106 (38%), Gaps = 8/106 (7%)

Query: 150 NKNPLAQMEEEKREHEAKMKKMEVDMEQV--FEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
           +   L+  E +  E E     ++   E+V   E  +   + KL   +   K+ L+A+  E
Sbjct: 193 DGISLSLEETKASEVEHLSASLKTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAE 252

Query: 208 LEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKK 253
           L +    LE   +  E        + R  ++E  + L G  + +  
Sbjct: 253 LSKALAALEAANAADED------PQDRDAAVEAAARLMGAARALVP 292


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 28.7 bits (65), Expect = 2.3
 Identities = 27/96 (28%), Positives = 50/96 (52%), Gaps = 10/96 (10%)

Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRG 214
           A+ E E +E   ++++ E  + Q  E  +  K + L+  E    +SLE + KEL  +++ 
Sbjct: 67  AEAERELKERRNELQRQEKRLLQKEET-LDRKDESLEKKE----ESLEEKEKELAARQQQ 121

Query: 215 LELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
           LE +  + E+     + E +++ LER S L  +E K
Sbjct: 122 LEEKEEELEE-----LIEEQQQELERISGLTAEEAK 152


>gnl|CDD|217877 pfam04079, DUF387, Putative transcriptional regulators (Ypuh-like).
            This family of conserved bacterial proteins are thought
           to possibly be helix-turn-helix type transcriptional
           regulators.
          Length = 159

 Score = 28.3 bits (64), Expect = 2.3
 Identities = 11/34 (32%), Positives = 22/34 (64%)

Query: 185 EKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELE 218
           E+  K+  S  +++++L+   +E E++ RGLEL 
Sbjct: 17  EELAKILGSREEVREALDELKEEYEDRDRGLELV 50


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 29.2 bits (66), Expect = 2.5
 Identities = 15/45 (33%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 196 DMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELR-RRSLE 239
              + + AQIK+LE +  GL+ ++    Q   V  +EL  RR L+
Sbjct: 144 AQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLK 188


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 28.3 bits (64), Expect = 2.7
 Identities = 10/48 (20%), Positives = 19/48 (39%)

Query: 171 MEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELE 218
           +  D+ ++     + + Q         K    ++ KELE K +  E E
Sbjct: 56  LAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAE 103


>gnl|CDD|235580 PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase
           subunit alpha; Validated.
          Length = 319

 Score = 28.6 bits (65), Expect = 3.0
 Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 7/55 (12%)

Query: 168 MKKMEVDMEQ-VFEM--KVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLELEI 219
           M    +D E+ + E+  K++E +   +DS++D    L  +I+ LE+K   L  +I
Sbjct: 1   MMLNYLDFEKPIAELEAKIEELRAVAEDSDVD----LSEEIERLEKKLEELTKKI 51


>gnl|CDD|221059 pfam11285, DUF3086, Protein of unknown function (DUF3086).  This
           family of proteins with unknown function appears to be
           restricted to Cyanobacteria.
          Length = 283

 Score = 28.5 bits (64), Expect = 3.1
 Identities = 12/39 (30%), Positives = 23/39 (58%), Gaps = 3/39 (7%)

Query: 198 KKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRR 236
           +++LE +I++LE ++  +E E+       G S D + RR
Sbjct: 13  RQALEIEIEKLERRKEQIEQELRT--SFAGQS-DAIARR 48



 Score = 27.3 bits (61), Expect = 8.4
 Identities = 11/36 (30%), Positives = 22/36 (61%), Gaps = 9/36 (25%)

Query: 206 KELEEKRRGLELEISQWEQSNGVSMDELRRRSLERD 241
           K+LE++R+ LE+EI + E+         R+  +E++
Sbjct: 7   KDLEQRRQALEIEIEKLER---------RKEQIEQE 33


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 28.8 bits (65), Expect = 3.1
 Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 8/72 (11%)

Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKK----QKLKDS----EIDMKKSLEAQIK 206
           A  ++ + E E + +++    E+    +++ +     QKLK+      I++++    +IK
Sbjct: 219 ALEKQLRLEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIK 278

Query: 207 ELEEKRRGLELE 218
           E  E+ R   L 
Sbjct: 279 EKVEEERNGRLA 290


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 28.9 bits (65), Expect = 3.2
 Identities = 25/103 (24%), Positives = 51/103 (49%), Gaps = 11/103 (10%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQ------VFEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
           L + EEE  E  A+++++E  +E+        E +++E +++L     ++ + LE Q + 
Sbjct: 248 LKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISR-LEQQKQI 306

Query: 208 LEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKK 250
           L E+   LE ++ + E      ++EL  +  E    L   E+K
Sbjct: 307 LRERLANLERQLEELE----AQLEELESKLDELAEELAELEEK 345


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 27.2 bits (61), Expect = 3.9
 Identities = 20/79 (25%), Positives = 48/79 (60%), Gaps = 6/79 (7%)

Query: 154 LAQMEEEKREHEAKMKKME-VD-----MEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKE 207
           + Q+E + +E E  ++++E +D      + + ++ VK+ K+++K+   + K++LE +IK 
Sbjct: 21  IKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEEVKEELEERKETLEKEIKT 80

Query: 208 LEEKRRGLELEISQWEQSN 226
           LE++   LE E+ + ++  
Sbjct: 81  LEKQLEKLEKELEELKEEL 99


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 28.5 bits (64), Expect = 3.9
 Identities = 12/62 (19%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
           ++E +       K++   + +  ++   E+++   D+  +    L+A+I  LEE    LE
Sbjct: 1   LKELREALAELAKELR-KLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELE 59

Query: 217 LE 218
            +
Sbjct: 60  AK 61


>gnl|CDD|147845 pfam05914, RIB43A, RIB43A.  This family consists of several
           RIB43A-like eukaryotic proteins. Ciliary and flagellar
           microtubules contain a specialised set of
           protofilaments, termed ribbons, that are composed of
           tubulin and several associated proteins. RIB43A was
           first characterized in the unicellular biflagellate,
           Chlamydomonas reinhardtii although highly related
           sequences are present in several higher eukaryotes
           including humans. The function of this protein is
           unknown although the structure of RIB43A and its
           association with the specialised protofilament ribbons
           and with basal bodies is relevant to the proposed role
           of ribbons in forming and stabilising doublet and
           triplet microtubules and in organising their
           three-dimensional structure. Human RIB43A homologues
           could represent a structural requirement in centriole
           replication in dividing cells.
          Length = 379

 Score = 28.5 bits (64), Expect = 4.1
 Identities = 21/86 (24%), Positives = 40/86 (46%), Gaps = 9/86 (10%)

Query: 149 LNKNPLAQMEEEKREHEAKMKKMEVDME----QVFEMKVKEKKQKLKDSEID-----MKK 199
           + +    ++E + R   AK++ + VD+E    QV E K++E  +K +D           +
Sbjct: 16  IERRRNRELERQSRIFNAKVRTIGVDLEALDLQVQEKKLREAAEKERDEAYAAQMLLNDR 75

Query: 200 SLEAQIKELEEKRRGLELEISQWEQS 225
             +   KE E+ RR L  ++  + Q 
Sbjct: 76  VAQLLQKEQEQDRRRLNKDLQDFRQQ 101


>gnl|CDD|227094 COG4752, COG4752, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 190

 Score = 27.9 bits (62), Expect = 4.3
 Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 2/49 (4%)

Query: 195 IDMKKSLEAQIKELE--EKRRGLELEISQWEQSNGVSMDELRRRSLERD 241
           + +  +LE  I+++E  E RR L +  S     N +S   LR    ERD
Sbjct: 85  VRLAYTLEEVIEDIEKEEGRRPLIVGTSARTYPNTISYSWLRNEIQERD 133


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 28.2 bits (63), Expect = 4.7
 Identities = 20/65 (30%), Positives = 38/65 (58%), Gaps = 2/65 (3%)

Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
             E K E    ++K+   +E+  E K +  ++ LK+ E  +KK LE +++EL+E+ R  +
Sbjct: 506 YGEFKEEINVLIEKLS-ALEKELEQKNEHLEKLLKEQEK-LKKELEQEMEELKERERNKK 563

Query: 217 LEISQ 221
           LE+ +
Sbjct: 564 LELEK 568


>gnl|CDD|218651 pfam05600, DUF773, Protein of unknown function (DUF773).  This
           family contains several eukaryotic sequences which are
           thought to be CDK5 activator-binding proteins, however,
           the function of this family is unknown.
          Length = 506

 Score = 28.1 bits (63), Expect = 4.9
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 18/81 (22%)

Query: 144 GKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEA 203
             PR +++  L +  E+K + E K+K+    M         EK+Q+ ++     + SLE 
Sbjct: 428 ASPRYVDR--LVETLEQKLKQEDKLKEKRELM--------TEKRQEAREE----QSSLEP 473

Query: 204 QIKELEEKRRGL----ELEIS 220
           ++  L EK R L    E +IS
Sbjct: 474 KLDLLIEKTRELQKLIEADIS 494


>gnl|CDD|224073 COG1151, COG1151, 6Fe-6S prismane cluster-containing protein
           [Energy production and conversion].
          Length = 576

 Score = 28.1 bits (63), Expect = 5.0
 Identities = 10/24 (41%), Positives = 15/24 (62%)

Query: 194 EIDMKKSLEAQIKELEEKRRGLEL 217
           E D KK+ +   + +EEKR+ L L
Sbjct: 553 EEDPKKAADLLEEHIEEKRKKLGL 576


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 28.1 bits (63), Expect = 5.3
 Identities = 16/81 (19%), Positives = 39/81 (48%), Gaps = 1/81 (1%)

Query: 147 RLLNKNPLAQMEE-EKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQI 205
           R ++K     ME+ +++    K +   +    +    + + K+KLK       + L+A++
Sbjct: 269 RTVSKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEV 328

Query: 206 KELEEKRRGLELEISQWEQSN 226
           KE +++++  E +  Q E+  
Sbjct: 329 KEKKKEKKKEEKKKKQIERLE 349


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 28.2 bits (63), Expect = 5.3
 Identities = 18/66 (27%), Positives = 35/66 (53%), Gaps = 9/66 (13%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSL---EAQIKELEE 210
           LAQ++   +    +  +    + ++ + + KEK Q+L D      K++   EA+IK+LE 
Sbjct: 442 LAQLQVAIQNVTQEQTQRNAALNEMRQ-RYKEKTQQLADV-----KTICEQEARIKDLEA 495

Query: 211 KRRGLE 216
           +R  L+
Sbjct: 496 QRAQLQ 501


>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e. 
          Length = 108

 Score = 26.8 bits (60), Expect = 5.4
 Identities = 11/45 (24%), Positives = 24/45 (53%), Gaps = 5/45 (11%)

Query: 161 KREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQI 205
           KRE + K KK E +         K++K+++ +     +K+++A +
Sbjct: 38  KREKKKKKKKSEGEF-----FAEKKEKKEVSEQRKADQKAVDAAL 77


>gnl|CDD|153348 cd07664, BAR_SNX2, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexin 2.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. SNX2 is a component of the retromer
           complex, a membrane coat multimeric complex required for
           endosomal retrieval of lysosomal hydrolase receptors to
           the Golgi. The retromer consists of a cargo-recognition
           subcomplex and a subcomplex formed by a dimer of sorting
           nexins (SNX1 and/or SNX2), which ensures effcient cargo
           sorting by facilitating proper membrane localization of
           the cargo-recognition subcomplex. BAR domains form
           dimers that bind to membranes, induce membrane bending
           and curvature, and may also be involved in
           protein-protein interactions.
          Length = 234

 Score = 27.7 bits (61), Expect = 5.5
 Identities = 22/73 (30%), Positives = 44/73 (60%), Gaps = 5/73 (6%)

Query: 143 DGKPRLLNK-NPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKE----KKQKLKDSEIDM 197
           + K +  NK + L Q ++E +E EAK+++ E D EQ+ +   KE    +K+++KD +  +
Sbjct: 149 EAKLQYANKPDKLQQAKDEIKEWEAKVQQGERDFEQISKTIRKEVGRFEKERVKDFKTVI 208

Query: 198 KKSLEAQIKELEE 210
            K LE+ ++  ++
Sbjct: 209 IKYLESLVQTQQQ 221


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 27.4 bits (61), Expect = 5.7
 Identities = 22/103 (21%), Positives = 52/103 (50%), Gaps = 7/103 (6%)

Query: 158 EEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEID------MKKSLEAQIKELEEK 211
           + E  E + ++ +++ ++E   E ++ E + +L+D + D      + KSLE +++ LEE 
Sbjct: 78  QGELSELKQQLSELQEELED-LEERIAELESELEDLKEDLQLLRELLKSLEERLESLEES 136

Query: 212 RRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            + L  E+ +  Q     ++ELR        +L   ++ +++ 
Sbjct: 137 IKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQEL 179


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 27.8 bits (62), Expect = 5.8
 Identities = 17/75 (22%), Positives = 35/75 (46%), Gaps = 7/75 (9%)

Query: 144 GKPRLLNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEA 203
           G  R    + L + +EE+ E E + +           MK  E +      E+++ + LE 
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEELAGD------AMKKLENRTADSKREMEVLERLE- 155

Query: 204 QIKELEEKRRGLELE 218
           ++KEL+ +R  +++ 
Sbjct: 156 ELKELQSRRADVDVN 170


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 27.8 bits (62), Expect = 5.9
 Identities = 20/105 (19%), Positives = 48/105 (45%), Gaps = 7/105 (6%)

Query: 155 AQMEEEKREHEAKMKKMEVDMEQVFEMKVK-EKKQKLKDSEIDMKKSLEAQIKELEE--- 210
            ++  E +   +K+K++  ++  V +   + E++ K+K   +++    E  + +L+    
Sbjct: 287 EEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQALVV 346

Query: 211 --KRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKK 253
               R LEL   QWE      +DE R    +  +  D  ++++ +
Sbjct: 347 ASSERLLELA-QQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDE 390


>gnl|CDD|184042 PRK13415, PRK13415, flagella biosynthesis protein FliZ;
           Provisional.
          Length = 219

 Score = 27.4 bits (61), Expect = 6.1
 Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 2/59 (3%)

Query: 156 QMEEEKREHEAKMKKME--VDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKR 212
           ++EE   +HE +++         Q    + K K ++ + +       L+ ++KEL+EKR
Sbjct: 145 EIEEILAQHEERLESKAEWSRWGQKLRDQWKGKSKQKQTTLPSFSALLKEELKELKEKR 203


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 27.6 bits (62), Expect = 6.3
 Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 7/58 (12%)

Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
           + EEEK    A+ ++ E   E+  E K +E++ KL       K S E Q K LEEK R
Sbjct: 266 EEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLA------KLSPEEQRK-LEEKER 316


>gnl|CDD|220582 pfam10117, McrBC, McrBC 5-methylcytosine restriction system
           component.  Members of this family of bacterial proteins
           modify the specificity of mcrB restriction by expanding
           the range of modified sequences restricted.
          Length = 317

 Score = 27.6 bits (62), Expect = 6.6
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 13/64 (20%)

Query: 118 QDLKDVTSNVHYENFR-CRKLAGLGTDGKPRLLNKNPLAQMEEEKREHEAKMKKMEVDME 176
           +D +    N     +     LA L       L N++PLA       + + K   +   ME
Sbjct: 171 KDFQKWRLNRLNARYELLLPLARLI------LSNQSPLAS------QGDQKAYSLLFPME 218

Query: 177 QVFE 180
           ++FE
Sbjct: 219 KLFE 222


>gnl|CDD|236456 PRK09290, PRK09290, allantoate amidohydrolase; Reviewed.
          Length = 413

 Score = 27.8 bits (63), Expect = 6.9
 Identities = 12/45 (26%), Positives = 19/45 (42%), Gaps = 1/45 (2%)

Query: 196 DMKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
            +   L A  + +   RRG+E+EI    +   V  D     +LE 
Sbjct: 299 ALVAELRAAAEAIAA-RRGVEVEIELISRRPPVPFDPGLVAALEE 342


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 27.7 bits (62), Expect = 7.3
 Identities = 24/104 (23%), Positives = 51/104 (49%), Gaps = 13/104 (12%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
           + +  E   E EA+++ +E ++E     +V+E+ ++ +D         E +I+ LEE+R 
Sbjct: 470 IEEDRERVEELEAELEDLEEEVE-----EVEERLERAEDL-----VEAEDRIERLEERRE 519

Query: 214 GLELEISQWE---QSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            LE  I++     +      +ELR R+ E ++  + K +   + 
Sbjct: 520 DLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEA 563


>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
           pleckstrin homology (PH) domain.  The BCR gene is one of
           the two genes in the BCR-ABL complex, which is
           associated with the Philadelphia chromosome, a product
           of a reciprocal translocation between chromosomes 22 and
           9. BCR is a GTPase-activating protein (GAP) for RAC1
           (primarily) and CDC42. The Dbl region of BCR has the
           most RhoGEF activity for Cdc42, and less activity
           towards Rac and Rho. Since BCR possesses both GAP and
           GEF activities, it may function to temporally regulate
           the activity of these GTPases. It also displays
           serine/threonine kinase activity. The BCR protein
           contains multiple domains including an N-terminal kinase
           domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
           domain, and a C-terminal RhoGAP domain. ABR, a related
           smaller protein, is structurally similar to BCR, but
           lacks the N-terminal kinase domain and has GAP activity
           for both Rac and Cdc42. PH domains have diverse
           functions, but in general are involved in targeting
           proteins to the appropriate cellular location or in the
           interaction with a binding partner. They share little
           sequence conservation, but all have a common fold, which
           is electrostatically polarized. Less than 10% of PH
           domains bind phosphoinositide phosphates (PIPs) with
           high affinity and specificity. PH domains are
           distinguished from other PIP-binding domains by their
           specific high-affinity binding to PIPs with two vicinal
           phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
           PtdIns(3,4,5)P3 which results in targeting some PH
           domain proteins to the plasma membrane. A few display
           strong specificity in lipid binding. Any specificity is
           usually determined by loop regions or insertions in the
           N-terminus of the domain, which are not conserved across
           all PH domains. PH domains are found in cellular
           signaling proteins such as serine/threonine kinase,
           tyrosine kinases, regulators of G-proteins, endocytotic
           GTPases, adaptors, as well as cytoskeletal associated
           molecules and in lipid associated enzymes.
          Length = 179

 Score = 26.9 bits (60), Expect = 7.6
 Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 4/74 (5%)

Query: 153 PLAQME-EEKREHEAKMKKMEVDMEQVFEMKVKE--KKQKLKDSEIDMKKSLEAQIKELE 209
           PL  +    K E EA    +    E++ +MK      KQK+K    + KK+  +  + +E
Sbjct: 65  PLRDLSLHPKDESEAAPDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRAIE 124

Query: 210 EKRRGL-ELEISQW 222
           + R+ L E E +  
Sbjct: 125 KLRKKLAEQEAALL 138


>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
           outer membrane].
          Length = 170

 Score = 27.0 bits (60), Expect = 7.9
 Identities = 21/101 (20%), Positives = 43/101 (42%), Gaps = 12/101 (11%)

Query: 154 LAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRR 213
            A +E E ++ + +++KM+ +++         K+ KL+D    M+   +    E E K+ 
Sbjct: 46  SADLESEFKKRQKELQKMQKELKA--------KEAKLQDD-GKMEALSDRAKAEAEIKKE 96

Query: 214 GLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKK 254
            L       +Q       +L RR  E +  L  K ++  + 
Sbjct: 97  KLVNA-FNKKQQE--YEKDLNRREAEEEQKLLEKIQRAIES 134


>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein.  All proteins in this family
           for which functions are known are used for the
           recognition of DNA damage as part of nucleotide excision
           repair. This family is based on the phylogenomic
           analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
           University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 172

 Score = 27.1 bits (60), Expect = 7.9
 Identities = 17/63 (26%), Positives = 28/63 (44%)

Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSLEAQIKELEEKRRGLE 216
            E    E E + +  E   E+ FE K+KE ++ ++ SE   KK       E   +  G+E
Sbjct: 93  EEALDEEKERREESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHEFGPETNGVE 152

Query: 217 LEI 219
            + 
Sbjct: 153 EDT 155


>gnl|CDD|235171 PRK03903, PRK03903, transaldolase; Provisional.
          Length = 274

 Score = 27.3 bits (61), Expect = 8.0
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 2/51 (3%)

Query: 151 KNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMKKSL 201
           K PL    EE      ++K   +D+E  ++  +K+  +  K +  D+ KSL
Sbjct: 226 KKPL--KIEEIEAFFKELKSHNIDLENTYQKLLKDGLEAFKQAFEDILKSL 274


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 27.7 bits (62), Expect = 8.5
 Identities = 14/58 (24%), Positives = 29/58 (50%)

Query: 199 KSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDGKEKKVKKKGL 256
           + L+ ++K+LEEK   LE EI + E+    + ++   ++ ++ +           KGL
Sbjct: 98  EELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGL 155


>gnl|CDD|218031 pfam04335, VirB8, VirB8 protein.  VirB8 is a bacterial virulence
           protein with cytoplasmic, transmembrane, and periplasmic
           regions. It is thought that it is a primary constituent
           of a DNA transporter. The periplasmic region interacts
           with VirB9, VirB10, and itself. This family also
           includes the conjugal transfer protein family TrbF, a
           family of proteins known to be involved in conjugal
           transfer. The TrbF protein is thought to compose part of
           the pilus required for transfer. This domain has a
           similar fold to the NTF2 protein.
          Length = 208

 Score = 26.9 bits (60), Expect = 8.5
 Identities = 11/31 (35%), Positives = 19/31 (61%)

Query: 51  SSEDDTSKFNKNLRDRVPFAVVGSNTVVEID 81
           S+    +++N   R   PFA +G NTV+E++
Sbjct: 109 STGAVAAEYNAYFRPNSPFARLGKNTVIEVE 139


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 27.0 bits (60), Expect = 8.5
 Identities = 25/87 (28%), Positives = 44/87 (50%), Gaps = 4/87 (4%)

Query: 156 QMEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQKLKDSEIDMK-KSLEAQIKELEEKRRG 214
           + E E++E E +M++ME DM      + + ++ +    E++ K K  E + + LE+K   
Sbjct: 2   EAEREQQELEERMEQMEEDMR---RAQKELEEYEETALELEEKLKQEEEEAQLLEKKADE 58

Query: 215 LELEISQWEQSNGVSMDELRRRSLERD 241
           LE E  + E+    S +E  R   E D
Sbjct: 59  LEEENRRLEEEAAASEEERERLEAEVD 85


>gnl|CDD|226799 COG4353, COG4353, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 26.7 bits (59), Expect = 9.2
 Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 4/45 (8%)

Query: 202 EAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLERDSSLDG 246
           E  I++LE  R  +E  I + E        +L R+S +R    + 
Sbjct: 151 EEGIRQLETIREEIENFIIRKED----EGIDLYRKSRDRTVYWEI 191


>gnl|CDD|237250 PRK12893, PRK12893, allantoate amidohydrolase; Reviewed.
          Length = 412

 Score = 27.2 bits (61), Expect = 9.3
 Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 1/44 (2%)

Query: 197 MKKSLEAQIKELEEKRRGLELEISQWEQSNGVSMDELRRRSLER 240
           M+ +L A   ++    RG+++ +        V  D      +E 
Sbjct: 299 MEAALRAACAKIAA-ARGVQVTVETVWDFPPVPFDPALVALVEA 341


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 27.3 bits (61), Expect = 9.4
 Identities = 15/66 (22%), Positives = 35/66 (53%), Gaps = 11/66 (16%)

Query: 155 AQMEEEKREHEAKMKKMEVDMEQ-----------VFEMKVKEKKQKLKDSEIDMKKSLEA 203
             ME ++R ++  +K++   ME            + E K++E+++ LK+      +SL+ 
Sbjct: 232 QMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFKTEAESLQK 291

Query: 204 QIKELE 209
           +I++L+
Sbjct: 292 EIQDLK 297



 Score = 27.3 bits (61), Expect = 9.7
 Identities = 15/72 (20%), Positives = 32/72 (44%), Gaps = 8/72 (11%)

Query: 158 EEEKREHEAKMKKMEVDME----QVFEMKVKEKKQKLKDSEIDMKKSLEAQ----IKELE 209
            E+++E E  M+  E   +    Q+ E    E+++ L + E  ++  L+ Q     +  +
Sbjct: 224 REKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEELLKEGFK 283

Query: 210 EKRRGLELEISQ 221
            +   L+ EI  
Sbjct: 284 TEAESLQKEIQD 295


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 27.3 bits (60), Expect = 9.8
 Identities = 18/55 (32%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 149  LNKNPLAQMEEEKREHEAKMKKMEVDMEQVFEMKVKEKK-QKLKDSEIDMKKSLE 202
            L K  L +++EEK++ E  +K ME  +E + E+K K  + +     E+D+K  +E
Sbjct: 1223 LGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEME 1277


>gnl|CDD|152255 pfam11819, DUF3338, Domain of unknown function (DUF3338).  This
           family of proteins are functionally uncharacterized.
           This family is found in eukaryotes. This presumed domain
           is about 130 amino acids in length.
          Length = 138

 Score = 26.3 bits (58), Expect = 10.0
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query: 157 MEEEKREHEAKMKKMEVDMEQVFEMKVKEKKQ 188
            EE+K+E    +KK +  +E+  E K++E K+
Sbjct: 26  SEEQKKEKLRALKKKQQALEERLEKKLEELKK 57


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.132    0.366 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,200,282
Number of extensions: 1288458
Number of successful extensions: 2473
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2202
Number of HSP's successfully gapped: 444
Length of query: 257
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 162
Effective length of database: 6,723,972
Effective search space: 1089283464
Effective search space used: 1089283464
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.2 bits)