RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7416
         (381 letters)



>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 181 to 207 amino acids in length. This domain
           has two conserved sequence motifs: PIP and CEICG. The
           domain carries a zinc-finger domain of the C2H2-type.
          Length = 187

 Score =  282 bits (723), Expect = 2e-95
 Identities = 109/184 (59%), Positives = 136/184 (73%), Gaps = 9/184 (4%)

Query: 176 LEAQIYKLAEQVSEQRSATKDNVQRKQARGEGER---------GDSEESEAGDTSDEDDE 226
           LE +I KLAE + ++R  T +NV+RKQA  E ER           SE+    D+ D+++E
Sbjct: 3   LEFKIEKLAELLKKEREDTIENVERKQALTEEERQASADESSEDASEDGSDDDSDDDEEE 62

Query: 227 VPYNPKNLPLGWDGKPIPYWLYKLHGLNISYTCEICGNFVYKGPKAFQRHFAEWRHAHGM 286
             YNP NLPLGWDGKPIPYWLYKLHGL   + CEICGN  YKG KAF++HF+EWRHA+G+
Sbjct: 63  PIYNPLNLPLGWDGKPIPYWLYKLHGLGKEFKCEICGNTSYKGRKAFEKHFSEWRHAYGL 122

Query: 287 RCLGIPNTAHFANVTQIEDALALWEKLKAQKQEERWQPEQEEEFEDSLGNVVNRKTYEDL 346
           +CLGIPNT HF  +T IE+AL LWEKLK Q + +R++PE EEE ED  GNV+++KTYEDL
Sbjct: 123 KCLGIPNTRHFKEITSIEEALELWEKLKKQTKRKRFEPENEEEVEDEEGNVMSKKTYEDL 182

Query: 347 KRQG 350
           KRQG
Sbjct: 183 KRQG 186



 Score = 50.3 bits (121), Expect = 2e-07
 Identities = 21/56 (37%), Positives = 34/56 (60%)

Query: 46  LEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEKEAGS 101
           LE +I KLAE + ++R  T +NV+RKQA  E ER  S +  + D S++  + ++  
Sbjct: 3   LEFKIEKLAELLKKEREDTIENVERKQALTEEERQASADESSEDASEDGSDDDSDD 58


>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
           modification].
          Length = 470

 Score =  161 bits (408), Expect = 4e-45
 Identities = 75/204 (36%), Positives = 108/204 (52%), Gaps = 18/204 (8%)

Query: 165 KQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDS------------ 212
           K+ + ++E    E  +++  + + +  S T+  V R  A    ER               
Sbjct: 265 KEGQGKEEFVYSEYVLHRYLKYLGDPVSETRSLVLRSLAITAKERKAEISLLSRRKKQPA 324

Query: 213 ------EESEAGDTSDEDDEVPYNPKNLPLGWDGKPIPYWLYKLHGLNISYTCEICGNFV 266
                 E +E  D    D+ V     ++PLG DG P+P WL KLHGL+I + CEICGN+V
Sbjct: 325 TKSSEKEGAEQVDGEQRDEHVSGKSFDMPLGPDGLPMPRWLCKLHGLDIEFECEICGNYV 384

Query: 267 YKGPKAFQRHFAEWRHAHGMRCLGIPNTAHFANVTQIEDALALWEKLKAQKQEERWQPEQ 326
           Y G   F RHF E RH +G+ CLGI  +  F  +T+I +A+ LW +++      +   E 
Sbjct: 385 YYGRDRFDRHFEEDRHIYGLECLGIKPSRVFKGITRIGEAMKLWNRMEESSSSLKVPTEY 444

Query: 327 EEEFEDSLGNVVNRKTYEDLKRQG 350
            EEFED  GNV+++K YEDLKRQG
Sbjct: 445 SEEFEDEEGNVMSKKVYEDLKRQG 468


>gnl|CDD|205477 pfam13297, Telomere_Sde2_2, Telomere stability C-terminal.  This
           short C-terminal domain is found in higher eukaryotes
           further downstream from the Sde2 family, pfam13019. It
           is found in all Sde2-related proteins except those from
           fission yeast, fly, and mosquito. Its exact function in
           telomere formation and maintenance has not yet been
           established.
          Length = 60

 Score = 67.2 bits (165), Expect = 2e-14
 Identities = 33/60 (55%), Positives = 35/60 (58%), Gaps = 14/60 (23%)

Query: 110 LDLSAFSSWEELASLGLDRLK------------TLEERAQRLFSTKGKTA--LDPSLLAK 155
           LDL AFSS EEL +LGLDRLK            TL+ERA RLFS KG     LD  L AK
Sbjct: 1   LDLDAFSSAEELEALGLDRLKAALMALGLKCGGTLQERAARLFSLKGLPLEELDKKLFAK 60



 Score = 40.6 bits (96), Expect = 4e-05
 Identities = 16/27 (59%), Positives = 17/27 (62%), Gaps = 2/27 (7%)

Query: 1  TLEERAQRLFSTKGKTA--LDPSLLAK 25
          TL+ERA RLFS KG     LD  L AK
Sbjct: 34 TLQERAARLFSLKGLPLEELDKKLFAK 60


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 39.6 bits (92), Expect = 0.003
 Identities = 38/194 (19%), Positives = 72/194 (37%), Gaps = 5/194 (2%)

Query: 36  QQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDD 95
             ER+ E  G      K A+   E  +   ++    +A G  + G+ E  E G+  +++ 
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG 745

Query: 96  EKEA-GSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLLA 154
           E EA G            +      E      +     E +A      KG    +  +  
Sbjct: 746 EGEAEGKHEVETEGDRKETEHEGETEAEGKEDE--DEGEIQAGEDGEMKGDEGAEGKVEH 803

Query: 155 KSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQ--VSEQRSATKDNVQRKQARGEGERGDS 212
           + + + G K + + + +  A       +  EQ   +E +   K + +     G  + GDS
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863

Query: 213 EESEAGDTSDEDDE 226
           EE E  +  +E++E
Sbjct: 864 EEEEEEEEEEEEEE 877



 Score = 34.2 bits (78), Expect = 0.12
 Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 2/88 (2%)

Query: 13  KGKTALDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQ--VSEQRSATKDNVQR 70
           KG    +  +  + + + G K + + + +  A       +  EQ   +E +   K + + 
Sbjct: 792 KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851

Query: 71  KQARGEGERGDSEESEAGDTSDEDDEKE 98
               G  + GDSEE E  +  +E++E+E
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEE 879


>gnl|CDD|131925 TIGR02878, spore_ypjB, sporulation protein YpjB.  Members of this
           protein, YpjB, family are restricted to a subset of
           endospore-forming bacteria, including Bacillus species
           but not CLostridium or some others. In Bacillus
           subtilis, ypjB was found to be part of the sigma-E
           regulon, where sigma-E is a sporulation sigma factor
           that regulates expression in the mother cell
           compartment. Null mutants of ypjB show a sporulation
           defect. This protein family is not, however, a part of
           the endospore formation minimal gene set [Cellular
           processes, Sporulation and germination].
          Length = 233

 Score = 34.4 bits (79), Expect = 0.061
 Identities = 13/52 (25%), Positives = 21/52 (40%), Gaps = 1/52 (1%)

Query: 103 LTHVGAHLD-LSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLL 153
           +  V +HL  L  F   +       ++L  +    + LF    +   DPSLL
Sbjct: 150 VQRVDSHLSYLENFRFQQRSEDEKEEQLSLMRGDLKALFDGVKEDEADPSLL 201


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 33.9 bits (78), Expect = 0.15
 Identities = 32/186 (17%), Positives = 65/186 (34%), Gaps = 21/186 (11%)

Query: 34  SKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDE 93
           S   ER++EI  LE +I +L E+++E            + R E E  + E  +     +E
Sbjct: 670 SSILERRREIEELEEKIEELEEKIAELE------KALAELRKELEELEEELEQLRKELEE 723

Query: 94  DDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLL 153
              +             DL+   +  E     + +L       +          L+  L 
Sbjct: 724 LSRQ-------ISALRKDLARLEAEVEQLEERIAQLSKELTELE-----AEIEELEERLE 771

Query: 154 AKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQR---SATKDNVQRKQARGEGERG 210
              +     +++ +E + +I  L+ ++  L E + E R   +   +     + R E    
Sbjct: 772 EAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLER 831

Query: 211 DSEESE 216
               +E
Sbjct: 832 RIAATE 837



 Score = 32.7 bits (75), Expect = 0.39
 Identities = 38/197 (19%), Positives = 81/197 (41%), Gaps = 12/197 (6%)

Query: 30  KGLKSKQQERQKEIAGLEAQIYKLAEQVSEQR---SATKDNVQRKQARGEGE--RGDSEE 84
              +++ +E + +I  L+ ++  L E + E R   +   +     + R E    R  + E
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATE 837

Query: 85  SEAGDTSDE-DDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFSTK 143
               D  ++ ++  E    L      L+        EL +L L+   +LEE    L S  
Sbjct: 838 RRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAL-LNERASLEEALALLRSEL 896

Query: 144 GK--TALDPSLLAKSKPDKGLKSKQQERQK---EIAGLEAQIYKLAEQVSEQRSATKDNV 198
            +    L      +S+  + L+  +++  +    + GLE +I  L E++SE+ S T +  
Sbjct: 897 EELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEA 956

Query: 199 QRKQARGEGERGDSEES 215
           +  + + E +  ++   
Sbjct: 957 EALENKIEDDEEEARRR 973



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 44/214 (20%), Positives = 92/214 (42%), Gaps = 17/214 (7%)

Query: 2   LEERAQRLFSTKGKTA-LDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAE---QV 57
           +EE  + L++   + + L+       +    L+ + +E + ++  LE+++ +LAE   ++
Sbjct: 283 IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL 342

Query: 58  SEQRSATKDNVQRKQARGEGERGDSEESEAG-DTSDEDDEKEAGSVLTHVGAHLDLSAFS 116
            E+    K+ ++  +A  E    + EE E+  +  +E  E     V         L+   
Sbjct: 343 EEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNN-- 400

Query: 117 SWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLLAKSKPDKGLKSKQQER---QKEI 173
              E+  L   RL+ LE+R +RL     +           +    L+  ++E    Q+E+
Sbjct: 401 ---EIERLE-ARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEEL 456

Query: 174 AGLEAQIYKLAEQVSEQRSA---TKDNVQRKQAR 204
             LE  + +L E++ E   A    +  + + QAR
Sbjct: 457 ERLEEALEELREELEEAEQALDAAERELAQLQAR 490


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 33.8 bits (78), Expect = 0.17
 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 7/52 (13%)

Query: 29  DKGLKSKQQERQKEIAGLEAQIYKLAEQV-------SEQRSATKDNVQRKQA 73
           DK  K K +E  ++I  L+ +I  + +Q+        EQR    +N+ RKQ 
Sbjct: 169 DKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQN 220



 Score = 33.8 bits (78), Expect = 0.17
 Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 7/52 (13%)

Query: 159 DKGLKSKQQERQKEIAGLEAQIYKLAEQV-------SEQRSATKDNVQRKQA 203
           DK  K K +E  ++I  L+ +I  + +Q+        EQR    +N+ RKQ 
Sbjct: 169 DKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQN 220


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 32.7 bits (75), Expect = 0.33
 Identities = 41/201 (20%), Positives = 75/201 (37%), Gaps = 16/201 (7%)

Query: 32  LKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSE--ESEAGD 89
           L  +Q+ERQ+    L  Q+  L +Q+ E R      +    A+   +R + E  E + G 
Sbjct: 279 LAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGA 338

Query: 90  TSDEDDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFST--KGKTA 147
             D D E+             DL    S          RL  L  + Q +     + K  
Sbjct: 339 FEDADIEQ----------LQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQK 388

Query: 148 LDPSLLAK-SKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGE 206
           +   L     K ++ L + ++E+ ++ A +E  +  L  Q+ +Q  A K     ++   E
Sbjct: 389 IKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELE 448

Query: 207 GERGDSEESEAGDTSDEDDEV 227
             R    +      +   +E+
Sbjct: 449 -LRLGRLKQRLDSATATPEEL 468


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.40
 Identities = 35/166 (21%), Positives = 68/166 (40%), Gaps = 27/166 (16%)

Query: 37  QERQKEIAGLEAQIYKLAEQVSE---QRSATKDNVQ------------RKQARGEGERGD 81
           +ER++E+  LEA+I  L E ++E   +R    + V+            R     E    D
Sbjct: 247 EERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDD 306

Query: 82  SEESEAGDTSDEDDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFS 141
           ++        +E ++++           +   A +   E      +    LEERA+ L  
Sbjct: 307 ADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESL---REDADDLEERAEEL-- 361

Query: 142 TKGKTALDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQV 187
            + + A   S L ++      +   ++R++EI  LE +I +L E+ 
Sbjct: 362 -REEAAELESELEEA------REAVEDRREEIEELEEEIEELRERF 400


>gnl|CDD|150291 pfam09577, Spore_YpjB, Sporulation protein YpjB (SpoYpjB).  These
           proteins are found in the endospore-forming bacteria
           which include Bacillus species. In Bacillus subtilis,
           ypjB was found to be part of the sigma-E regulon.
           Sigma-E is a sporulation sigma factor that regulates
           expression in the mother cell compartment. Null mutants
           of ypjB show a sporulation defect, but this gene is not,
           however, a part of the endospore formation minimal gene
           set.
          Length = 232

 Score = 31.2 bits (71), Expect = 0.72
 Identities = 18/52 (34%), Positives = 23/52 (44%), Gaps = 1/52 (1%)

Query: 103 LTHVGAHLD-LSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLL 153
           L  V +HL  L  F   +      L+RL  LE   + LF    +   DPSLL
Sbjct: 149 LQRVDSHLSVLENFRFQQLSKKSKLERLTLLEGDLKALFDGVKEDEADPSLL 200


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 26/224 (11%)

Query: 17  ALDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAE---QVSEQRSATKDNVQRKQA 73
           A D  L    K    L+ K +E+Q + A LE Q+  L      +  Q   T D++++ + 
Sbjct: 35  ADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94

Query: 74  RGEGERGDSEESEAGDTSDEDDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLE 133
           +           E  +        E  + L   G +   +   S E+  +    RL    
Sbjct: 95  QIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPED--AQRSVRLAIYY 152

Query: 134 -----ERAQRLFSTKG---------------KTALDPSLLAKSKPDKGLKSKQQERQKEI 173
                 RA+R+ + K                +  L   L  +      L    +ER+K +
Sbjct: 153 GALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTL 212

Query: 174 AGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEA 217
           A L ++     ++  E+  A +  ++ + A  E     + E+ A
Sbjct: 213 AQLNSE-LSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255


>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form.  The
           Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
           2.A.36) The CPA1 family is a large family of proteins
           derived from Gram-positive and Gram-negative bacteria,
           blue green bacteria, yeast, plants and animals.
           Transporters from eukaryotes have been functionally
           characterized, and all of these catalyze Na+:H+
           exchange. Their primary physiological functions may be
           in (1) cytoplasmic pH regulation, extruding the H+
           generated during metabolism, and (2) salt tolerance (in
           plants), due to Na+ uptake into vacuoles. This model is
           specific for the bacterial members of this family
           [Transport and binding proteins, Cations and iron
           carrying compounds].
          Length = 525

 Score = 30.6 bits (69), Expect = 1.5
 Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 6/70 (8%)

Query: 36  QQERQKEIA-GLEAQIYKLAEQVSEQRSATKDNVQRK-----QARGEGERGDSEESEAGD 89
           ++E ++ IA  + A+  K A   + ++    + V R+      AR E  R D EE     
Sbjct: 414 ERELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDARIEELRADGEEKIRSG 473

Query: 90  TSDEDDEKEA 99
             +++  + A
Sbjct: 474 MGEKNLRRRA 483


>gnl|CDD|173689 cd05598, STKc_LATS, Catalytic domain of the Protein
           Serine/Threonine Kinase, Large Tumor Suppressor.
           Serine/Threonine Kinases (STKs), Large Tumor Suppressor
           (LATS) subfamily, catalytic (c) domain. STKs catalyze
           the transfer of the gamma-phosphoryl group from ATP to
           serine/threonine residues on protein substrates. The
           LATS subfamily is part of a larger superfamily that
           includes the catalytic domains of other protein STKs,
           protein tyrosine kinases, RIO kinases, aminoglycoside
           phosphotransferase, choline kinase, and phosphoinositide
           3-kinase. LATS was originally identified in Drosophila
           using a screen for genes whose inactivation led to
           overproliferation of cells. In tetrapods, there are two
           LATS isoforms, LATS1 and LATS2. Inactivation of LATS1 in
           mice results in the development of various tumors,
           including sarcomas and ovarian cancer. LATS functions as
           a tumor suppressor and is implicated in cell cycle
           regulation.
          Length = 376

 Score = 29.8 bits (67), Expect = 2.4
 Identities = 11/32 (34%), Positives = 15/32 (46%)

Query: 107 GAHLDLSAFSSWEELASLGLDRLKTLEERAQR 138
           G H    +    EE + +   RLK LE R +R
Sbjct: 163 GDHHRQDSMEPSEEWSEIDRCRLKPLERRRKR 194


>gnl|CDD|235131 PRK03562, PRK03562, glutathione-regulated potassium-efflux system
           protein KefC; Provisional.
          Length = 621

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 13/54 (24%), Positives = 23/54 (42%), Gaps = 2/54 (3%)

Query: 48  AQIYKLAEQVSEQRSA--TKDNVQRKQARGEGERGDSEESEAGDTSDEDDEKEA 99
            +   +A+Q S   S   T+D       +  G +G  E    G+ +DE + K +
Sbjct: 567 KKRAAVAKQTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPS 620



 Score = 29.2 bits (66), Expect = 4.0
 Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 2/53 (3%)

Query: 178 AQIYKLAEQVSEQRSA--TKDNVQRKQARGEGERGDSEESEAGDTSDEDDEVP 228
            +   +A+Q S   S   T+D       +  G +G  E    G+ +DE +  P
Sbjct: 567 KKRAAVAKQTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKP 619


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.1 bits (67), Expect = 2.5
 Identities = 37/189 (19%), Positives = 74/189 (39%), Gaps = 9/189 (4%)

Query: 30   KGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGD 89
            K  ++K+ E  K +A  +A+  K AE+   +        ++K    E ++ +  + +A +
Sbjct: 1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEE 1624

Query: 90   TSDEDDEKEAGSVLTHVGAHLDLSA--FSSWEELASLGLDRLKTLEERAQRLFSTKGKTA 147
                ++EK+    L    A     A      EE   +         E  ++      K  
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684

Query: 148  LDPSLLAKSKPDKGLKSKQQERQK--EIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARG 205
             D     + K  + LK + +E +K  E+   EA+  K AE++ +     K   +  +   
Sbjct: 1685 ED-----EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEA 1739

Query: 206  EGERGDSEE 214
            E ++  +EE
Sbjct: 1740 EEDKKKAEE 1748



 Score = 29.7 bits (66), Expect = 3.2
 Identities = 33/169 (19%), Positives = 68/169 (40%), Gaps = 1/169 (0%)

Query: 35   KQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDED 94
            K +E +K++  L+ +  +  ++  E + A ++N  +     +    D +++E    ++ED
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686

Query: 95   DEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEE-RAQRLFSTKGKTALDPSLL 153
            ++K A ++                E       + LK  EE    +    K +   D    
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746

Query: 154  AKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQ 202
             ++K D+  K K    +KE      +I K  E V E+    +D  +R +
Sbjct: 1747 EEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRME 1795


>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138).  This
           family of proteins with unknown function appear to be
           restricted to Proteobacteria.
          Length = 514

 Score = 29.5 bits (66), Expect = 3.3
 Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 7/64 (10%)

Query: 42  EIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEKEAGS 101
           +I  L+AQ+  L +QV+E R+A      +  A G G +  S  + A      D    A +
Sbjct: 26  QIKALQAQLTALQQQVNELRAALA---AKPAAAGGGAKIQSAAAAAAAAPSSD----AAA 78

Query: 102 VLTH 105
            LT+
Sbjct: 79  ALTN 82



 Score = 28.3 bits (63), Expect = 8.1
 Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 3/53 (5%)

Query: 172 EIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDED 224
           +I  L+AQ+  L +QV+E R+A      +  A G G +  S  + A      D
Sbjct: 26  QIKALQAQLTALQQQVNELRAALA---AKPAAAGGGAKIQSAAAAAAAAPSSD 75


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 29.4 bits (66), Expect = 3.8
 Identities = 12/61 (19%), Positives = 30/61 (49%), Gaps = 1/61 (1%)

Query: 38  ERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEK 97
           +RQ+ +     ++    +++ E +  T  + + + A+ + E+   EE+E       D++K
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAE-AKAKEEKLKQEENEKKQKEQADEDK 424

Query: 98  E 98
           E
Sbjct: 425 E 425


>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein.  This family includes FlxA from
           E. coli. The expression of FlxA is regulated by the FliA
           sigma factor, a transcription factor specific for class
           3 flagellar operons. However FlxA is not required for
           flagellar function or formation.
          Length = 106

 Score = 27.8 bits (62), Expect = 4.0
 Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 25  KSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRK---QARGEGERGD 81
           +    +  + +QQ  Q++I  L+AQI +L +Q SE+    + +       Q   EG    
Sbjct: 42  EDLTAEEKQKQQQLIQQQIQMLQAQIAQLQQQQSEEAQQQQQSSIEDSPSQTSAEGVNRP 101

Query: 82  SEESE 86
           + +++
Sbjct: 102 TADNQ 106



 Score = 27.8 bits (62), Expect = 4.0
 Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 3/65 (4%)

Query: 155 KSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRK---QARGEGERGD 211
           +    +  + +QQ  Q++I  L+AQI +L +Q SE+    + +       Q   EG    
Sbjct: 42  EDLTAEEKQKQQQLIQQQIQMLQAQIAQLQQQQSEEAQQQQQSSIEDSPSQTSAEGVNRP 101

Query: 212 SEESE 216
           + +++
Sbjct: 102 TADNQ 106


>gnl|CDD|223426 COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and
           biogenesis].
          Length = 361

 Score = 28.8 bits (65), Expect = 5.2
 Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 4/46 (8%)

Query: 307 LALWEKLKAQKQEERWQPEQEEEFEDSLGNVVNRKTYEDLKRQGWQ 352
           L L++KL  +   E      E+EF      +  R+TY+ L    W+
Sbjct: 156 LPLYDKLTEELAREGRLEWAEDEFR----LLPTRRTYKVLPEDAWR 197


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 27.7 bits (62), Expect = 5.8
 Identities = 10/45 (22%), Positives = 17/45 (37%)

Query: 190 QRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEVPYNPKNL 234
            R   K N   +     G+  D +E +  D   + ++   NP  L
Sbjct: 35  SRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDENPYKL 79


>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
          Length = 880

 Score = 28.5 bits (65), Expect = 7.9
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 32  LKSKQQERQKEIAGLEAQIYKLAEQ 56
           LK    E  + +A LE +IY+LA +
Sbjct: 505 LKELSAELAQRLAELEQEIYELAGE 529



 Score = 28.5 bits (65), Expect = 7.9
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 162 LKSKQQERQKEIAGLEAQIYKLAEQ 186
           LK    E  + +A LE +IY+LA +
Sbjct: 505 LKELSAELAQRLAELEQEIYELAGE 529


>gnl|CDD|234856 PRK00885, PRK00885, phosphoribosylamine--glycine ligase;
           Provisional.
          Length = 420

 Score = 28.1 bits (64), Expect = 8.3
 Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 9/47 (19%)

Query: 269 GPKAFQRHFAEWRHAHGMRCLGIPNTAHFANVTQIEDALALWEKLKA 315
           G KAF + F   R+       GIP TA +   T  E+ALA  ++  A
Sbjct: 101 GSKAFAKDFMA-RY-------GIP-TAAYETFTDAEEALAYLDEKGA 138


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 27.9 bits (62), Expect = 9.2
 Identities = 16/72 (22%), Positives = 37/72 (51%)

Query: 33  KSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSD 92
            SKQ ++++  A +E ++ +L E ++      + +++++Q   E    DS++ +      
Sbjct: 207 TSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDS 266

Query: 93  EDDEKEAGSVLT 104
           ED E +  S +T
Sbjct: 267 EDGETKPESYIT 278


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 28.2 bits (63), Expect = 9.5
 Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 1/54 (1%)

Query: 308 ALWEKLKAQKQEERWQPEQEEEFEDSLGNVVNRKTYEDLKRQGWQPEQEEEFED 361
           ALWE L  QK  +  + EQ+    D+   +       +   +  + E+EEE ED
Sbjct: 351 ALWETLNKQKWTKAKETEQDY-ILDAFSALEIEDANTERDDEEEEDEEEEEEED 403


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 28.1 bits (63), Expect = 9.7
 Identities = 27/89 (30%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 130 KTLEERAQRLFSTKGKTALDPSLLAKSKPDKG-LKSKQQERQKEIAGLEAQIYKLAEQVS 188
           +T EE A +    + K +   +LL + +  +  L+  + ER+K + GLE+ + K  +Q+S
Sbjct: 177 QTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSLQKDQQQLS 236

Query: 189 EQRSATKDNVQRKQARGEGERGDSEESEA 217
           E R A +  ++   AR E E     E EA
Sbjct: 237 ELR-ANESRLRDSIARAEREAKARAEREA 264


>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
           This is a family of fungal proteins of unknown function.
          Length = 182

 Score = 27.3 bits (61), Expect = 9.7
 Identities = 15/70 (21%), Positives = 33/70 (47%), Gaps = 9/70 (12%)

Query: 30  KGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGD 89
              K K++E  +EI  ++ +  +  +   +++ + K   + K         D ++ +  D
Sbjct: 58  TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDK---------DKKDDKKDD 108

Query: 90  TSDEDDEKEA 99
            S++ DEKEA
Sbjct: 109 KSEKKDEKEA 118


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.128    0.362 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,201,493
Number of extensions: 1842098
Number of successful extensions: 2791
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2628
Number of HSP's successfully gapped: 187
Length of query: 381
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 282
Effective length of database: 6,546,556
Effective search space: 1846128792
Effective search space used: 1846128792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (26.8 bits)