RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14443
         (235 letters)



>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947).  Family of
           eukaryotic proteins with unknown function.
          Length = 168

 Score = 89.3 bits (222), Expect = 2e-22
 Identities = 68/169 (40%), Positives = 101/169 (59%), Gaps = 2/169 (1%)

Query: 60  KRENKNRPREISSKIPQKKFAQQSAPVIPVQKNIPRDPRFDSLCGTFKEKAFHSAYKFID 119
           KR++K+ P E+SSK P  +        +  +K   RDPRFDSL G      F   YKF+D
Sbjct: 1   KRKSKHAPMELSSKKPVSRKRDI--IEVEKKKPKVRDPRFDSLSGKLNLDKFRKNYKFLD 58

Query: 120 DIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHTARIQA 179
           D REKEI  L+K LKK K+ EEKE++K  +Q M+++++  K +  + +  KE      + 
Sbjct: 59  DYREKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKEL 118

Query: 180 LKHGEKPLFRKKSEKTMLNLVQKFETLKKTGKINKHIEKKTKALKAKER 228
           +K G+KP + KKSE   L L +KF+ LKK+ +++K +EKK K    KE+
Sbjct: 119 IKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEKKRKKNAGKEK 167


>gnl|CDD|237953 PRK15378, PRK15378, inositol phosphate phosphatase SopB;
           Provisional.
          Length = 564

 Score = 31.8 bits (72), Expect = 0.29
 Identities = 19/94 (20%), Positives = 37/94 (39%), Gaps = 9/94 (9%)

Query: 135 KEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHTARIQALKH--GEKPLFRKKS 192
             + P+  E V  L +++++       +H KD  E  +   R+  L +     P +  KS
Sbjct: 408 LAQYPDNYEVVNTLARQIKDIW--KNNQHHKDGGEPYKLAQRLAMLAYEIDAVPAWNCKS 465

Query: 193 EKT---MLNLVQKFETLKK--TGKINKHIEKKTK 221
            K    M++   K E +    TG ++      + 
Sbjct: 466 GKDRTGMMDSEIKREIISLHQTGMLSAPGSLPSS 499


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 31.3 bits (72), Expect = 0.43
 Identities = 18/96 (18%), Positives = 40/96 (41%), Gaps = 3/96 (3%)

Query: 140 EEKEKVKYLVQRMENQMREAKQRHE--KDQQEKEEHTARIQALKHGEKPLFRKKSEKTML 197
              ++ + L + +E +  + ++  +   ++ EKE   A  +A K  ++ +   +  +   
Sbjct: 541 ALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGG 600

Query: 198 NLVQKFETLK-KTGKINKHIEKKTKALKAKERSQHN 232
               K   L     ++NK  EKK K  K ++  Q  
Sbjct: 601 YASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636



 Score = 29.0 bits (66), Expect = 2.7
 Identities = 19/92 (20%), Positives = 45/92 (48%), Gaps = 3/92 (3%)

Query: 140 EEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHTARIQALKHGEKPLFRKKSEKTMLNL 199
           E+KEK+  L+  +E   RE +Q+ E+ +   +E     + L+  +K   +++ +K +   
Sbjct: 513 EDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEE-KKEKLQEEEDKLLEEA 571

Query: 200 VQKFETLKKTGKINKHIEKKTKALKAKERSQH 231
            ++ +   K  K  K  ++  K L+  ++  +
Sbjct: 572 EKEAQQAIKEAK--KEADEIIKELRQLQKGGY 601


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 31.3 bits (72), Expect = 0.49
 Identities = 28/115 (24%), Positives = 53/115 (46%), Gaps = 14/115 (12%)

Query: 115 YKFIDDIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHT 174
            K I + + KE     K++ +E   E +   K  +   + ++ + +   EK+ +E+    
Sbjct: 25  RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNE- 83

Query: 175 ARIQALKHGEKPLFRKKSEKTMLNLVQKFETLKKTGKINKHIEKKTKALKAKERS 229
                L+  EK L +K+      NL +K E L+K     + +EKK K L+ K++ 
Sbjct: 84  -----LQKLEKRLLQKEE-----NLDRKLELLEKR---EEELEKKEKELEQKQQE 125


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 30.4 bits (69), Expect = 0.78
 Identities = 23/78 (29%), Positives = 39/78 (50%), Gaps = 10/78 (12%)

Query: 149 VQRMENQMREAKQRHEKDQQEKEEHTARIQALKHGEKPLFRKKSEKTMLNLVQKFETLKK 208
           ++RM   ++EA+   E+D++ KEE  AR  A           ++EKT+     K    +K
Sbjct: 503 IERM---VKEAEANAEEDKKRKEEIEARNNADS------LAYQAEKTLKEAGDKLPAEEK 553

Query: 209 TGKINKHIEKKTKALKAK 226
             KI K + +  +ALK +
Sbjct: 554 E-KIEKAVAELKEALKGE 570


>gnl|CDD|169397 PRK08350, PRK08350, hypothetical protein; Provisional.
          Length = 341

 Score = 29.4 bits (66), Expect = 1.5
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query: 1   MESSDIDEERVAIREELKGLTFEQLQKLKEEMGTKL 36
           ME +D+ +    I E  K ++ E L K  E+ G +L
Sbjct: 150 MEITDVVDAVNKILENSKEVSLEGLSKASEKAGDEL 185


>gnl|CDD|148701 pfam07246, Phlebovirus_NSM, Phlebovirus nonstructural protein NS-M.
            This family consists of several Phlebovirus
           nonstructural NS-M proteins which represent the
           N-terminal region of the M polyprotein precursor. The
           function of this family is unknown.
          Length = 264

 Score = 29.5 bits (65), Expect = 1.5
 Identities = 28/111 (25%), Positives = 47/111 (42%), Gaps = 1/111 (0%)

Query: 117 FIDDIREKEISILKKKLKKEKNPEEKEKVKYL-VQRMENQMREAKQRHEKDQQEKEEHTA 175
           FI +I   E S      K +KN EEK+       + ME   R     HE  Q++ E    
Sbjct: 128 FIRNITTGEQSPRVDYEKLKKNAEEKDATIQRKTKEMEEDSRNQIAHHEIQQKKNEIQKL 187

Query: 176 RIQALKHGEKPLFRKKSEKTMLNLVQKFETLKKTGKINKHIEKKTKALKAK 226
           R    +  E    + + E++M  +    E LK+     +H + + ++L+ K
Sbjct: 188 RNDLKRGQEHRDAKLRVEESMARVKGLKEELKQLTDELQHAKSEMQSLRKK 238


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 29.1 bits (65), Expect = 2.1
 Identities = 15/80 (18%), Positives = 30/80 (37%)

Query: 115 YKFIDDIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHT 174
           + F +D+ E+E    +   +       K     L   ++   R  ++  EK   +KE   
Sbjct: 754 FCFNEDLEEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKI 813

Query: 175 ARIQALKHGEKPLFRKKSEK 194
                   GE+   +++ EK
Sbjct: 814 ESPSVETEGERCTIKQREEK 833


>gnl|CDD|234592 PRK00045, hemA, glutamyl-tRNA reductase; Reviewed.
          Length = 423

 Score = 29.0 bits (66), Expect = 2.2
 Identities = 15/63 (23%), Positives = 30/63 (47%), Gaps = 7/63 (11%)

Query: 115 YKFIDDIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQM-----REAKQRHEKDQQE 169
            +  ++IRE+E+    KKL   ++ E  E ++ L + + N++        K+  E+   E
Sbjct: 349 REQAEEIREEELERALKKLGPGEDEE--EVLEKLARSLVNKLLHAPTVRLKEAAEEGDDE 406

Query: 170 KEE 172
             E
Sbjct: 407 YLE 409


>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 304

 Score = 28.5 bits (64), Expect = 2.6
 Identities = 8/38 (21%), Positives = 22/38 (57%), Gaps = 1/38 (2%)

Query: 150 QRMENQMREAKQRHEKDQQEKEEHTARIQALKHGEKPL 187
           Q ++ Q++  ++++  D+Q+      ++Q  +HG  P+
Sbjct: 62  QELQPQIKALQKKYGNDRQKMALEMQKLQK-EHGFNPI 98


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 28.7 bits (64), Expect = 2.9
 Identities = 16/60 (26%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 136 EKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKE-EHTARIQALKHGEKPLFRKKSEK 194
           E   E+K   +  ++R + +  +AK++  K  +  + E  A++QA +  +     KKSEK
Sbjct: 5   ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 28.2 bits (63), Expect = 3.1
 Identities = 12/45 (26%), Positives = 24/45 (53%)

Query: 150 QRMENQMREAKQRHEKDQQEKEEHTARIQALKHGEKPLFRKKSEK 194
           QR+ N+ ++A+QR ++  QE E +  + Q+L+        +  E 
Sbjct: 39  QRIANRWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEA 83


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 28.5 bits (64), Expect = 3.4
 Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 2/72 (2%)

Query: 107 KEKAFHSAYKFIDDIREKEISILKKKLKKEKNPEEKEKVKYLVQRME-NQMREAKQRHEK 165
           +E+  +     ++DI    +   K+KL+K K+ +EK K K  +  +  + M + K  + K
Sbjct: 239 EEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAK-KLGISIILFDDMTKNKTTNYK 297

Query: 166 DQQEKEEHTARI 177
            Q E  +    I
Sbjct: 298 IQNEDPDFITSI 309


>gnl|CDD|182528 PRK10535, PRK10535, macrolide transporter ATP-binding /permease
           protein; Provisional.
          Length = 648

 Score = 28.5 bits (64), Expect = 3.5
 Identities = 18/48 (37%), Positives = 23/48 (47%), Gaps = 8/48 (16%)

Query: 166 DQQEKEEHTARIQALKHGEKPLFR-------KKSEKTMLNLVQKFETL 206
           D  E E+   R+  L+HG+K  F        K +EKT   L Q F TL
Sbjct: 481 DSAEAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTRTL-QLFLTL 527


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 28.6 bits (63), Expect = 3.5
 Identities = 27/111 (24%), Positives = 56/111 (50%), Gaps = 2/111 (1%)

Query: 122  REKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMR--EAKQRHEKDQQEKEEHTARIQA 179
             E E +  K +  ++K  E K+K     ++ E + +  EAK++ E+D+++ +E      A
Sbjct: 1357 DEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAA 1416

Query: 180  LKHGEKPLFRKKSEKTMLNLVQKFETLKKTGKINKHIEKKTKALKAKERSQ 230
             K  ++   + + +K      +K E  KK  +  K  E+  KA +AK++++
Sbjct: 1417 KKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 28.5 bits (64), Expect = 4.2
 Identities = 26/169 (15%), Positives = 63/169 (37%), Gaps = 28/169 (16%)

Query: 31  EMGTKLFNK-----------KVFNSKSPSSKFKEHENKN---FKRENKNRPREISSKIPQ 76
           E GTK FN             +FN+ +        E  +      +  +RP+E+  +I +
Sbjct: 754 EEGTKKFNILLQEIAYDVIVSLFNNPNAEKILIITEILSDGINNSDINDRPQELIDQIIE 813

Query: 77  KKFAQQSAPVIPVQKNIPRDPRFDSLCGTFKEKAFHSAYKFIDDIREKEISIL------K 130
            +  +  A  I  ++ + R    + +    ++         + +I+E E  I        
Sbjct: 814 SEEERLKALRIQREEMLMRPEELELI-NEEQKNLKQEIKLELSEIQEAEEEIQNINENKN 872

Query: 131 KKLKKEKNPE-------EKEKVKYLVQRMENQMREAKQRHEKDQQEKEE 172
           + ++ + +P+        K+ +  LV   +   ++ K   +K +  ++ 
Sbjct: 873 EFVEFKNDPKKLNKLIIAKDVLIKLVISSDEIKQDEKTTKKKKKDLEKT 921


>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
           non-fungal.  The approx. 70 residue Med15 domain of the
           ARC-Mediator co-activator is a three-helix bundle with
           marked similarity to the KIX domain. The sterol
           regulatory element binding protein (SREBP) family of
           transcription activators use the ARC105 subunit to
           activate target genes in the regulation of cholesterol
           and fatty acid homeostasis. In addition, Med15 is a
           critical transducer of gene activation signals that
           control early metazoan development.
          Length = 768

 Score = 28.0 bits (62), Expect = 4.9
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 148 LVQRMENQMREAKQRHEKDQQEKEEH 173
           +V ++E  +R+  Q H K+ +E E H
Sbjct: 3   VVSKIEEALRKNGQTHTKNAREMENH 28


>gnl|CDD|233242 TIGR01035, hemA, glutamyl-tRNA reductase.  This enzyme, together
           with glutamate-1-semialdehyde-2,1-aminomutase
           (TIGR00713), leads to the production of
           delta-amino-levulinic acid from Glu-tRNA [Biosynthesis
           of cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 417

 Score = 27.7 bits (62), Expect = 5.3
 Identities = 18/67 (26%), Positives = 34/67 (50%), Gaps = 1/67 (1%)

Query: 115 YKFIDDIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHT 174
               + +REKE+    KKL    + + +E ++ L +++ N++  A     K   +KEE  
Sbjct: 346 RSLAEIVREKELEKALKKL-PGLSKDVEEVLEDLARKLINKLLHAPTVRLKQLADKEESE 404

Query: 175 ARIQALK 181
             ++ALK
Sbjct: 405 VCLEALK 411


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 28.0 bits (63), Expect = 5.4
 Identities = 19/77 (24%), Positives = 41/77 (53%), Gaps = 3/77 (3%)

Query: 119 DDIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHTARIQ 178
                K+I  + +K +K++  E+KEK K   +    + +E ++  EK+++E+E+     +
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKK---KAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461

Query: 179 ALKHGEKPLFRKKSEKT 195
           A +  E+   +KK + T
Sbjct: 462 AEEEKEEEEEKKKKQAT 478


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
           consists of several mammalian specific proacrosin
           binding protein sp32 sequences. sp32 is a sperm specific
           protein which is known to bind with with 55- and 53-kDa
           proacrosins and the 49-kDa acrosin intermediate. The
           exact function of sp32 is unclear, it is thought however
           that the binding of sp32 to proacrosin may be involved
           in packaging the acrosin zymogen into the acrosomal
           matrix.
          Length = 243

 Score = 27.3 bits (60), Expect = 5.8
 Identities = 23/113 (20%), Positives = 47/113 (41%), Gaps = 5/113 (4%)

Query: 60  KRENKNRPREISSKIPQKKFAQQSAPVIPVQKNIPRDPRFDSLCGTFKEKAFHSAYKFID 119
           KR   ++P  I S     K A+ SA V P    +P            + ++F    + + 
Sbjct: 119 KRVPCSQPVSILSPN-TLKEAEPSAEVQPTTMTLPIAEHPTIT----ENQSFQPWPERLH 173

Query: 120 DIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEE 172
           +  E+ +          +    K K + L+ +++  ++E K   ++ Q+E+EE
Sbjct: 174 NNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQEHKTEEKQPQEEQEE 226


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 27.5 bits (62), Expect = 6.2
 Identities = 12/55 (21%), Positives = 30/55 (54%), Gaps = 2/55 (3%)

Query: 134 KKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHTARIQALKHGEKPLF 188
           + E   E   +++  ++++E ++R+ +  +EK ++E E   + ++ LK    PL 
Sbjct: 2   RLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLK--SPPLI 54


>gnl|CDD|188618 cd09898, H3TH_53EXO, H3TH domain of the 5'-3' exonuclease of Taq
           DNA polymerase I and homologs.  H3TH
           (helix-3-turn-helix) domains of the 5'-3' exonuclease
           (53EXO) of mutli-domain DNA polymerase I and single
           domain protein homologs are included in this family. Taq
           DNA polymerase I contains a polymerase domain for
           synthesizing a new DNA strand and a 53EXO domain for
           cleaving RNA primers or damaged DNA strands. Taq's 53EXO
           recognizes and endonucleolytically cleaves a
           structure-specific DNA substrate that has a bifurcated
           downstream duplex and an upstream template-primer duplex
           that overlaps the downstream duplex by 1 bp. The 53EXO
           cleaves the unpaired 5'-arm of the overlap flap DNA
           substrate. 5'-3' exonucleases are members of the
           structure-specific, 5' nuclease family that catalyzes
           hydrolysis of DNA duplex-containing nucleic acid
           structures during DNA replication, repair, and
           recombination. These nucleases contain a PIN (PilT N
           terminus) domain with a helical arch/clamp region/I
           domain (not included here) and inserted within the PIN
           domain is an atypical helix-hairpin-helix-2 (HhH2)-like
           region. This atypical HhH2 region, the H3TH domain, has
           an extended loop with at least three turns between the
           first two helices, and only three of the four helices
           appear to be conserved. Both the H3TH domain and the
           helical arch/clamp region are involved in DNA binding.
           Studies suggest that a glycine-rich loop in the H3TH
           domain contacts the phosphate backbone of the template
           strand in the downstream DNA duplex. The nucleases
           within this family have a carboxylate rich active site
           that is involved in binding essential divalent metal ion
           cofactors (i. e., Mg2+ or Mn2+ or Zn2+) required for
           nuclease activity. The first metal binding site is
           composed entirely of Asp/Glu residues from the PIN
           domain, whereas, the second metal binding site is
           composed generally of two Asp residues from the PIN
           domain and two Asp residues from the H3TH domain.
           Together with the helical arch and network of amino
           acids interacting with metal binding ions, the H3TH
           region defines a positively charged active-site
           DNA-binding groove in structure-specific 5' nucleases.
          Length = 73

 Score = 25.8 bits (58), Expect = 6.2
 Identities = 8/34 (23%), Positives = 18/34 (52%), Gaps = 3/34 (8%)

Query: 193 EKTMLNLVQKFETLKKTGKINKHIEKKTKALKAK 226
            KT   L+Q++ +L+    I  ++++    L+ K
Sbjct: 26  PKTAAKLLQEYGSLE---NILANLDELKGKLREK 56


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 27.2 bits (61), Expect = 6.2
 Identities = 14/20 (70%), Positives = 15/20 (75%)

Query: 123 EKEISILKKKLKKEKNPEEK 142
           EKEI+ LKKKLKKEK    K
Sbjct: 188 EKEIAKLKKKLKKEKQFNRK 207


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 27.5 bits (62), Expect = 7.0
 Identities = 13/40 (32%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 118 IDDIREKEISILKKKLKKEKNPEE-KEKVKYLVQRMENQM 156
            DD  + E+  L++K++K K P+E KEK +  ++++E   
Sbjct: 244 EDDDDKDEVEELREKIEKLKLPKEAKEKAEKELKKLETMS 283


>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
           component YidC; Validated.
          Length = 429

 Score = 27.5 bits (61), Expect = 7.3
 Identities = 24/107 (22%), Positives = 40/107 (37%), Gaps = 8/107 (7%)

Query: 125 EISILKKKLKKEKNPEEKEKVKYLVQRMENQMREA-KQRHEKDQQEKEEHTARIQALKHG 183
           E + +KK    EKN E K + K + Q+     RE  ++  ++         AR  A+K  
Sbjct: 328 ENAEIKKTRTAEKN-EAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAK 386

Query: 184 EKPLFRKKSEKTMLNLVQKFETLKKTGKINKHIEKKTKALKAKERSQ 230
           +K L                   +++      +E  T A   +E SQ
Sbjct: 387 KKGL------IDASPNEDTPSENEESKGSPPQVEATTTAEPNREPSQ 427


>gnl|CDD|237053 PRK12315, PRK12315, 1-deoxy-D-xylulose-5-phosphate synthase;
          Provisional.
          Length = 581

 Score = 27.3 bits (61), Expect = 7.5
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 9  ERVAIREELKGLTFEQLQKLKEEMGTKLFNK 39
          E++    +LK L+ ++L++L  E+ T L  K
Sbjct: 4  EKINSPADLKKLSLDELEQLASEIRTALLEK 34


>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein.  This family
           consists of proteins over 500 amino acids long in
           Caenorhabditis elegans and several bacteria (Pseudomonas
           aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
           etc.). The function is unknown.
          Length = 525

 Score = 27.4 bits (61), Expect = 8.0
 Identities = 17/80 (21%), Positives = 36/80 (45%), Gaps = 5/80 (6%)

Query: 116 KFIDDIREKEISILKKKLKKEKNPEEKEK-----VKYLVQRMENQMREAKQRHEKDQQEK 170
           KF++DIRE     +K   K+ +   ++           ++R+  + REA++R  + +++ 
Sbjct: 102 KFLEDIREGLTEPIKDSAKRNEPDLKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQL 161

Query: 171 EEHTARIQALKHGEKPLFRK 190
            E    + AL  G+      
Sbjct: 162 SELQNELNALLTGKSQRSHT 181


>gnl|CDD|216312 pfam01125, G10, G10 protein. 
          Length = 145

 Score = 26.5 bits (59), Expect = 8.5
 Identities = 12/52 (23%), Positives = 26/52 (50%), Gaps = 1/52 (1%)

Query: 131 KKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHTARIQALKH 182
           +  + +K PE  +K++  +   E +MR+A +    + + K+E    I  + H
Sbjct: 5   RTSRTKKPPEGFDKIEPTLDEFEAKMRDA-ENEPHEGKRKKELLWPIFRIHH 55


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 27.0 bits (59), Expect = 8.8
 Identities = 21/65 (32%), Positives = 42/65 (64%), Gaps = 8/65 (12%)

Query: 119 DDIREKEISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAK----QRHEKDQQEKEEHT 174
           DDI++K+  I++++ + EK  +E+E+++   +RME + RE K    +  E+ ++E+EE T
Sbjct: 113 DDIKKKKSLIIRQE-QIEKARQEREELE---ERMEWERREEKIDEREDQEEQEREREEQT 168

Query: 175 ARIQA 179
              Q+
Sbjct: 169 IEEQS 173


>gnl|CDD|181462 PRK08537, PRK08537, 2-oxoglutarate ferredoxin oxidoreductase
           subunit gamma; Validated.
          Length = 177

 Score = 26.6 bits (59), Expect = 8.8
 Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 112 HSAY-KFIDDIREKEISILKKKLKKEKNPEEKEKVKY 147
             AY K++DD++E    I+   L   +  E ++KVK 
Sbjct: 76  QEAYDKYLDDLKEGGTVIVDPDLVPIREIEYEKKVKV 112


>gnl|CDD|218734 pfam05758, Ycf1, Ycf1.  The chloroplast genomes of most higher
           plants contain two giant open reading frames designated
           ycf1 and ycf2. Although the function of Ycf1 is unknown,
           it is known to be an essential gene.
          Length = 832

 Score = 27.3 bits (61), Expect = 9.9
 Identities = 17/81 (20%), Positives = 29/81 (35%), Gaps = 5/81 (6%)

Query: 126 ISILKKKLKKEKNPEEKEKVKYLVQRMENQMREAKQRHEKDQQEKEEHTARIQALKHGEK 185
                KKLK+    EE+E+   +     ++ +  KQ      QE           +  E 
Sbjct: 219 SPFFTKKLKETSETEEREEETDVEIETTSETKGTKQE-----QEGSTEEDPSLFSEEKED 273

Query: 186 PLFRKKSEKTMLNLVQKFETL 206
           P   +  +K  +   +K E L
Sbjct: 274 PDKTEDLDKLEILKEKKDEEL 294


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.127    0.339 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,744,732
Number of extensions: 1114036
Number of successful extensions: 2951
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2611
Number of HSP's successfully gapped: 484
Length of query: 235
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 141
Effective length of database: 6,768,326
Effective search space: 954333966
Effective search space used: 954333966
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.8 bits)