RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17674
         (302 letters)



>gnl|CDD|216339 pfam01167, Tub, Tub family. 
          Length = 243

 Score =  241 bits (617), Expect = 1e-79
 Identities = 85/128 (66%), Positives = 106/128 (82%)

Query: 169 DTNVLGFKGPRKMTVLIPGMTQSYERVSITPLEISETLIECWNSKNMDNIIELHNKTPTW 228
           +TNVLGF+GPRKMTV++PG+  S+ERV I PL   E+L+  W +KN ++++ LHNK P W
Sbjct: 116 ETNVLGFRGPRKMTVIMPGIPSSHERVPIQPLNDQESLLSRWQNKNKESLLVLHNKAPRW 175

Query: 229 NEDTQSYVLNFHGRVTQASVKNFQIIHDNDPEYIVLQFGRTSEDVFTMDYRYPLCPLQAF 288
           N+  QSYVLNFHGRVTQASVKNFQI+H +DP+YIVLQFGR ++D+FTMDYRYPL   QAF
Sbjct: 176 NDQLQSYVLNFHGRVTQASVKNFQIVHASDPDYIVLQFGRVAKDMFTMDYRYPLSAFQAF 235

Query: 289 SIALSSFD 296
           +I LSSFD
Sbjct: 236 AICLSSFD 243



 Score = 66.3 bits (162), Expect = 1e-12
 Identities = 23/38 (60%), Positives = 31/38 (81%)

Query: 131 PAAQKVSYKCRITRDRKGVDRGLYPTYFLHLERDYGKK 168
           PA +  + +CRITRD+ G+D GL+PTY+LHLER+ GKK
Sbjct: 1   PAPRGGTVQCRITRDKSGMDYGLFPTYYLHLERENGKK 38


>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
           Transcription initiation factor IIA (TFIIA) is a
           heterotrimer, the three subunits being known as alpha,
           beta, and gamma, in order of molecular weight. The N and
           C-terminal domains of the gamma subunit are represented
           in pfam02268 and pfam02751, respectively. This family
           represents the precursor that yields both the alpha and
           beta subunits. The TFIIA heterotrimer is an essential
           general transcription initiation factor for the
           expression of genes transcribed by RNA polymerase II.
           Together with TFIID, TFIIA binds to the promoter region;
           this is the first step in the formation of a
           pre-initiation complex (PIC). Binding of the rest of the
           transcription machinery follows this step. After
           initiation, the PIC does not completely dissociate from
           the promoter. Some components, including TFIIA, remain
           attached and re-initiate a subsequent round of
           transcription.
          Length = 332

 Score = 33.6 bits (77), Expect = 0.12
 Identities = 13/25 (52%), Positives = 15/25 (60%)

Query: 67  PDRSSQDDEDDDDDDEDDIEQELSD 91
            D   + D  DDDDDED IE +L D
Sbjct: 247 IDSDDEGDGSDDDDDEDAIESDLDD 271


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.0 bits (73), Expect = 0.27
 Identities = 10/27 (37%), Positives = 12/27 (44%)

Query: 63  GSGYPDRSSQDDEDDDDDDEDDIEQEL 89
            S     S   D+D DDDD +D    L
Sbjct: 121 DSDSSSDSDSSDDDSDDDDSEDETAAL 147



 Score = 30.8 bits (70), Expect = 0.65
 Identities = 14/37 (37%), Positives = 18/37 (48%)

Query: 52  RTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
           R +   D  +  +   D SS  D  DDD D+DD E E
Sbjct: 107 RKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDE 143



 Score = 30.8 bits (70), Expect = 0.74
 Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 60  SDYGSGYPDRSSQDDEDDDDDDEDDIEQ---ELSDI 92
           +D      D  S DD+ DDDD ED+      EL  I
Sbjct: 119 ADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKI 154


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 31.1 bits (71), Expect = 0.36
 Identities = 8/21 (38%), Positives = 16/21 (76%)

Query: 68  DRSSQDDEDDDDDDEDDIEQE 88
           D   +++EDD++DD++D  +E
Sbjct: 117 DEEDEEEEDDEEDDDEDESEE 137



 Score = 30.4 bits (69), Expect = 0.55
 Identities = 8/31 (25%), Positives = 18/31 (58%)

Query: 63  GSGYPDRSSQDDEDDDDDDEDDIEQELSDIS 93
                D   +D+E++DD+++DD ++   + S
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEES 140



 Score = 28.4 bits (64), Expect = 2.7
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 73  DDEDDDDDDEDDIEQELSDISLH 95
           DD+++D+++EDD E +  D S  
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEE 137



 Score = 28.1 bits (63), Expect = 3.6
 Identities = 7/30 (23%), Positives = 17/30 (56%)

Query: 59  TSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
             D      +   ++D+++DDD+++  E+E
Sbjct: 110 EEDESDDDEEDEEEEDDEEDDDEDESEEEE 139


>gnl|CDD|225618 COG3076, COG3076, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 135

 Score = 30.6 bits (69), Expect = 0.51
 Identities = 11/23 (47%), Positives = 19/23 (82%)

Query: 62  YGSGYPDRSSQDDEDDDDDDEDD 84
           +G+ + D +++DD+DD+D DEDD
Sbjct: 108 WGTYFEDPNAEDDDDDEDVDEDD 130


>gnl|CDD|184209 PRK13650, cbiO, cobalt transporter ATP-binding subunit;
           Provisional.
          Length = 279

 Score = 30.9 bits (70), Expect = 0.63
 Identities = 14/29 (48%), Positives = 17/29 (58%), Gaps = 3/29 (10%)

Query: 215 MDNIIELHNKTPTWNEDTQSYVLN---FH 240
           M NIIE+ N T  + ED + Y LN   FH
Sbjct: 1   MSNIIEVKNLTFKYKEDQEKYTLNDVSFH 29


>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457).  This is
           a family of uncharacterized proteins.
          Length = 449

 Score = 31.1 bits (70), Expect = 0.72
 Identities = 16/41 (39%), Positives = 21/41 (51%)

Query: 68  DRSSQDDEDDDDDDEDDIEQELSDISLHRGSHVISDPDEIN 108
           D    DDED+DDDD+DD E +  +       H  S  D+ N
Sbjct: 59  DDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGN 99


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 31.1 bits (70), Expect = 0.74
 Identities = 12/61 (19%), Positives = 20/61 (32%)

Query: 50  FTRTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQELSDISLHRGSHVISDPDEINS 109
           + +  +S    D+        S DDE    +D D+ E    D +        +  DE   
Sbjct: 252 YKQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDEDCD 311

Query: 110 G 110
            
Sbjct: 312 N 312


>gnl|CDD|223507 COG0430, RCL1, RNA 3'-terminal phosphate cyclase [RNA processing
           and modification].
          Length = 341

 Score = 29.9 bits (68), Expect = 1.7
 Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 20/66 (30%)

Query: 90  SDISLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQKVSYKCRITRDRKGV 149
           S I++  G+ V   P        P++          R V  P  +K+  +C +   +   
Sbjct: 114 SRITVTGGTDVPWAP--------PVD--------YIRRVTLPVLRKMGIECELEVLK--- 154

Query: 150 DRGLYP 155
            RG YP
Sbjct: 155 -RGFYP 159


>gnl|CDD|223032 PHA03283, PHA03283, envelope glycoprotein E; Provisional.
          Length = 542

 Score = 29.5 bits (66), Expect = 2.2
 Identities = 15/71 (21%), Positives = 24/71 (33%), Gaps = 3/71 (4%)

Query: 44  YNGPLQFTRTQASTDTSDYGSGYPDRSSQDDEDDDDD--DEDDIEQELSDISLHRGSHVI 101
           Y     F     S  ++D      +R + D +D  D   DE+         +   G +V 
Sbjct: 433 YEVLNPFETVYTSVPSNDPEVLVFERLASDSDDSFDSSSDEELEPPPPPGPAGQLGPYVS 492

Query: 102 SDP-DEINSGS 111
            D  D  +  S
Sbjct: 493 LDAEDAFSGRS 503


>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein.  This family consists of several
           eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
           to suppress GCR2 and is highly expressed in the muscle
           and heart. The function of this family is unknown
           although it has been speculated that SGT1 may be
           functionally analogous to the Gcr2p protein of
           Saccharomyces cerevisiae which is known to be a
           regulatory factor of glycolytic gene expression.
          Length = 557

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 61  DYGSGYPDRSSQDDEDDDDDDEDDIEQELSDISL 94
           +  +  PD S   DEDDD+DD++D E   SD +L
Sbjct: 476 EIDNDLPDDSDDADEDDDEDDDED-EDSSSDSTL 508


>gnl|CDD|238761 cd01484, E1-2_like, Ubiquitin activating enzyme (E1), repeat
           2-like. E1, a highly conserved small protein present
           universally in eukaryotic cells, is part of cascade to
           attach ubiquitin (Ub) covalently to substrate proteins.
           This cascade consists of activating (E1), conjugating
           (E2), and/or ligating (E3) enzymes and then targets them
           for degradation by the 26S proteasome. E1 activates
           ubiquitin by C-terminal adenylation, and subsequently
           forms a highly reactive thioester bond between its
           catalytic cysteine and ubiquitin's C-terminus. E1 also
           associates with E2 and promotes ubiquitin transfer to
           the E2's catalytic cysteine. A set of novel molecules
           with a structural similarity to Ub, called Ub-like
           proteins (Ubls), have similar conjugation cascades. In
           contrast to ubiquitin-E1, which is a single-chain
           protein with a weakly conserved two-fold repeat, many of
           the Ubls-E1are a heterodimer where each subunit
           corresponds to one half of a single-chain E1. This CD
           represents the family homologous to the second repeat of
           Ub-E1.
          Length = 234

 Score = 29.1 bits (65), Expect = 2.4
 Identities = 10/25 (40%), Positives = 15/25 (60%), Gaps = 2/25 (8%)

Query: 169 DTNVLGFKGPRKMTVLIPGMTQSYE 193
           ++   GFKG     V++PGMT+  E
Sbjct: 121 ESGTEGFKG--NAQVILPGMTECIE 143


>gnl|CDD|221965 pfam13186, DUF4008, Domain of unknown function (DUF4008).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and archaea, and is
           approximately 70 amino acids in length. The family is
           found in association with pfam04055.
          Length = 65

 Score = 27.0 bits (60), Expect = 2.4
 Identities = 6/23 (26%), Positives = 10/23 (43%)

Query: 202 ISETLIECWNSKNMDNIIELHNK 224
           I ++L E WNS+      +    
Sbjct: 32  IKDSLKEIWNSEKYREFRKAVLD 54


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 12/41 (29%), Positives = 21/41 (51%)

Query: 66  YPDRSSQDDEDDDDDDEDDIEQELSDISLHRGSHVISDPDE 106
             +     D++D++DD+DD+E+E  D+ L        D D 
Sbjct: 323 EEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363


>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
          Length = 251

 Score = 29.2 bits (66), Expect = 2.6
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 68 DRSSQDDEDDDDDDEDDIEQELSDI 92
          D    DDED +D+   ++ QE SD+
Sbjct: 69 DSEPPDDEDLEDEQFLEVYQEASDL 93


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 29.2 bits (66), Expect = 2.8
 Identities = 13/31 (41%), Positives = 15/31 (48%)

Query: 58  DTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
           D            S+DDEDDDDDD+DD    
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIAT 160



 Score = 28.0 bits (63), Expect = 7.1
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query: 54  QASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQ 87
               + SD       +S  D++DDDDDD+DDI  
Sbjct: 127 DDDDEESDEEDEESSKSEDDEDDDDDDDDDDIAT 160


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The
          domain is found in the primary vegetative sigma factor.
          The function of this domain is unclear and can be
          removed without loss of function.
          Length = 211

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 9/16 (56%), Positives = 15/16 (93%)

Query: 73 DDEDDDDDDEDDIEQE 88
          DD+DDD+D+++D E+E
Sbjct: 55 DDDDDDEDEDEDDEEE 70



 Score = 28.7 bits (65), Expect = 3.1
 Identities = 9/23 (39%), Positives = 16/23 (69%)

Query: 68 DRSSQDDEDDDDDDEDDIEQELS 90
          D    DD+D+D+D++D+ E +L 
Sbjct: 52 DLEDDDDDDEDEDEDDEEEADLG 74



 Score = 27.2 bits (61), Expect = 8.8
 Identities = 7/24 (29%), Positives = 16/24 (66%)

Query: 68 DRSSQDDEDDDDDDEDDIEQELSD 91
              ++D +DDDDD++D +++  +
Sbjct: 46 SELDEEDLEDDDDDDEDEDEDDEE 69


>gnl|CDD|234846 PRK00844, glgC, glucose-1-phosphate adenylyltransferase;
           Provisional.
          Length = 407

 Score = 29.0 bits (66), Expect = 3.3
 Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 3/30 (10%)

Query: 250 NFQIIHDNDPEYIVLQFGRTSEDVFTMDYR 279
           +  +I D DP+Y+V+ FG  ++ V+ MD R
Sbjct: 108 SLNLIEDEDPDYVVV-FG--ADHVYRMDPR 134


>gnl|CDD|223009 PHA03211, PHA03211, serine/threonine kinase US3; Provisional.
          Length = 461

 Score = 28.7 bits (64), Expect = 3.7
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 66  YPDRSSQDDEDDDDDDEDDI 85
           YPD  ++ ++DDDDD  DD+
Sbjct: 84  YPDSGAEAEDDDDDDAPDDV 103


>gnl|CDD|238278 cd00498, Hsp33, Heat shock protein 33 (Hsp33):  Cytosolic protein
           that acts as a molecular chaperone under oxidative
           conditions.  In normal (reducing) cytosolic conditions,
           four conserved Cys residues are coordinated by a Zn ion.
            Under oxidative stress (such as heat shock), the Cys
           are reversibly oxidized to disulfide bonds, which causes
           the chaperone activity to be turned on.  Hsp33 is
           homodimeric in its functional form.
          Length = 275

 Score = 28.7 bits (65), Expect = 3.9
 Identities = 15/71 (21%), Positives = 26/71 (36%), Gaps = 10/71 (14%)

Query: 76  DDDDDDEDDIEQELSDISLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQK 135
             D++D D  E             + +       G  P E+   +    E  +LE   Q 
Sbjct: 178 GADEEDIDAWE--------KVIKLMPTVSALELLGLSPEELLYRLFHEEEVRILEK--QP 227

Query: 136 VSYKCRITRDR 146
           V ++C  +R+R
Sbjct: 228 VRFRCDCSRER 238


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.
          The mitochondrial protein translocase family, which is
          responsible for movement of nuclear encoded
          pre-proteins into mitochondria, is very complex with at
          least 19 components. These proteins include several
          chaperone proteins, four proteins of the outer membrane
          translocase (Tom) import receptor, five proteins of the
          Tom channel complex, five proteins of the inner
          membrane translocase (Tim) and three "motor" proteins.
          This family represents the Tom22 proteins. The N
          terminal region of Tom22 has been shown to have
          chaperone-like activity, and the C terminal region
          faces the intermembrane face.
          Length = 136

 Score = 27.6 bits (62), Expect = 4.0
 Identities = 11/28 (39%), Positives = 19/28 (67%)

Query: 66 YPDRSSQDDEDDDDDDEDDIEQELSDIS 93
           P   +Q++ DDDD+D+ D + ++SD S
Sbjct: 18 APKNLAQEESDDDDEDDTDTDSDISDDS 45



 Score = 26.8 bits (60), Expect = 8.4
 Identities = 10/39 (25%), Positives = 12/39 (30%)

Query: 50 FTRTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
          F    A+           D     D D D  D+ D E E
Sbjct: 12 FQEKPAAPKNLAQEESDDDDEDDTDTDSDISDDSDFENE 50


>gnl|CDD|235394 PRK05297, PRK05297, phosphoribosylformylglycinamidine synthase;
           Provisional.
          Length = 1290

 Score = 29.0 bits (66), Expect = 4.3
 Identities = 16/62 (25%), Positives = 25/62 (40%), Gaps = 2/62 (3%)

Query: 78  DDDDEDDIEQELSDISLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQKVS 137
              D D +E  L++  L    HVI  P+      + I   G+ V +  R  L     + S
Sbjct: 928 RAADRDAVEAILAEHGLSDCVHVIGKPNA--GDRIVITRNGKTVFSESRTELRRWWSETS 985

Query: 138 YK 139
           Y+
Sbjct: 986 YQ 987


>gnl|CDD|177868 PLN02222, PLN02222, phosphoinositide phospholipase C 2.
          Length = 581

 Score = 28.8 bits (64), Expect = 4.3
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 68  DRSSQDDEDDDDDDEDDIE 86
           + S+ DD+DDDDD ED  +
Sbjct: 291 NDSNGDDDDDDDDGEDKSK 309


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 28.9 bits (64), Expect = 4.5
 Identities = 15/42 (35%), Positives = 23/42 (54%)

Query: 47  PLQFTRTQASTDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
             +  R  A  D S   +   D   +D+++DDDD+EDD E+E
Sbjct: 133 KRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEE 174


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 28.7 bits (64), Expect = 4.6
 Identities = 13/23 (56%), Positives = 17/23 (73%)

Query: 67  PDRSSQDDEDDDDDDEDDIEQEL 89
           P+ SS +DEDD D+DE D E+E 
Sbjct: 367 PEYSSDEDEDDSDEDEVDYEKER 389


>gnl|CDD|239451 cd03335, TCP1_alpha, TCP-1 (CTT or eukaryotic type II) chaperonin
           family, alpha subunit. Chaperonins are involved in
           productive folding of proteins. They share a common
           general morphology, a double toroid of 2 stacked rings.
           In contrast to bacterial group I chaperonins (GroEL),
           each ring of the eukaryotic cytosolic chaperonin (CTT)
           consists of eight different, but homologous subunits.
           Their common function is to sequester nonnative proteins
           inside their central cavity and promote folding by using
           energy derived from ATP hydrolysis. The best studied in
           vivo substrates of CTT are actin and tubulin.
          Length = 527

 Score = 28.4 bits (64), Expect = 4.8
 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 1/44 (2%)

Query: 93  SLHRGSHVISDPDEINSGSLPIEMQGEVVGNLERFVLEPAAQKV 136
           + H  + V  D   +    L + + G+V  NLE  VLEP   K+
Sbjct: 461 AYHAAAQVKPDKKHLKWYGLDL-INGKVRDNLEAGVLEPTVSKI 503


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The
          CAF-1 or chromatin assembly factor-1 consists of three
          subunits, and this is the first, or A. The A domain is
          uniquely required for the progression of S phase in
          mouse cells, independent of its ability to promote
          histone deposition but dependent on its ability to
          interact with HP1 - heterochromatin protein 1-rich
          heterochromatin domains next to centromeres that are
          crucial for chromosome segregation during mitosis. This
          HP1-CAF-1 interaction module functions as a built-in
          replication control for heterochromatin, which, like a
          control barrier, has an impact on S-phase progression
          in addition to DNA-based checkpoints.
          Length = 76

 Score = 26.4 bits (59), Expect = 5.0
 Identities = 9/17 (52%), Positives = 15/17 (88%)

Query: 68 DRSSQDDEDDDDDDEDD 84
          D  S+D+ED+++DD+DD
Sbjct: 56 DLESEDEEDEEEDDDDD 72


>gnl|CDD|151714 pfam11273, DUF3073, Protein of unknown function (DUF3073).  This
          family of proteins with unknown function appears to be
          restricted to Actinobacteria.
          Length = 64

 Score = 26.1 bits (58), Expect = 5.1
 Identities = 16/37 (43%), Positives = 19/37 (51%), Gaps = 7/37 (18%)

Query: 56 STDTSDYG------SGYPDRSSQDDEDDDDDDEDDIE 86
          S DT D        SG   RSS DD+D  +DD DD +
Sbjct: 20 SPDT-DLDALQRELSGGSSRSSADDDDQYEDDYDDYD 55


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 28.5 bits (64), Expect = 5.3
 Identities = 7/24 (29%), Positives = 15/24 (62%)

Query: 68  DRSSQDDEDDDDDDEDDIEQELSD 91
             S +D+++D D +E+D + +  D
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDD 345


>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
          Length = 487

 Score = 28.5 bits (64), Expect = 5.6
 Identities = 10/15 (66%), Positives = 11/15 (73%)

Query: 70  SSQDDEDDDDDDEDD 84
           S  D+EDDDDDDE  
Sbjct: 466 SVSDEEDDDDDDEVL 480


>gnl|CDD|220815 pfam10577, UPF0560, Uncharacterized protein family UPF0560.  This
           family of proteins has no known function.
          Length = 805

 Score = 28.3 bits (63), Expect = 5.6
 Identities = 17/61 (27%), Positives = 30/61 (49%), Gaps = 2/61 (3%)

Query: 30  LRPLSSSSSREFYCYNGPLQFTRTQASTDT--SDYGSGYPDRSSQDDEDDDDDDEDDIEQ 87
           LRPL    S+        ++ +R ++STD+      S    R  + +++ DDDD DD  +
Sbjct: 728 LRPLLDEGSKVSGEQAPTVEESRGRSSTDSSREPLSSPRRRRGRRPEDELDDDDGDDAGE 787

Query: 88  E 88
           +
Sbjct: 788 D 788


>gnl|CDD|218566 pfam05349, GATA-N, GATA-type transcription activator, N-terminal.
           GATA transcription factors mediate cell differentiation
           in a diverse range of tissues. Mutation are often
           associated with certain congenital human disorders. The
           six classical vertebrate GATA proteins, GATA-1 to
           GATA-6, are highly homologous and have two tandem zinc
           fingers. The classical GATA transcription factors
           function transcription activators. In lower metazoans
           GATA proteins carry a single canonical zinc finger. This
           family represents the N-terminal domain of the family of
           GATA transcription activators.
          Length = 177

 Score = 27.5 bits (61), Expect = 6.4
 Identities = 13/43 (30%), Positives = 17/43 (39%), Gaps = 8/43 (18%)

Query: 32  PLSSSSSREFYCYNGPL-------QFTRTQASTDTSDYGSGYP 67
           P+S+   R+   Y GPL       Q+         S Y S YP
Sbjct: 95  PMSNGGGRDAGAYQGPLLLSGGREQYGAALRPLGGS-YSSPYP 136


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid). 
          Members of this family are bacterial proteins with a
          conserved motif [KR]FYDLN, sometimes flanked by a pair
          of CXXC motifs, followed by a long region of low
          complexity sequence in which roughly half the residues
          are Asp and Glu, including multiple runs of five or
          more acidic residues. The function of members of this
          family is unknown.
          Length = 104

 Score = 26.5 bits (59), Expect = 6.6
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 73 DDEDDDDDDEDDIEQELSDIS 93
           D+DDDDDD+DD+     D  
Sbjct: 68 LDDDDDDDDDDDLPDLDDDDV 88



 Score = 26.5 bits (59), Expect = 8.6
 Identities = 13/34 (38%), Positives = 19/34 (55%)

Query: 73  DDEDDDDDDEDDIEQELSDISLHRGSHVISDPDE 106
           DD+DDDDDD  D++ +  D+       +  D DE
Sbjct: 71  DDDDDDDDDLPDLDDDDVDLDDDDDDFLEDDDDE 104


>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase
          19 kDa subunit.  This family contains several
          DNA-directed RNA polymerase 19 kDa polypeptides. The
          Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6)
          catalyzes DNA-template-directed extension of the 3'-end
          of an RNA strand by one nucleotide at a time.
          Length = 167

 Score = 27.4 bits (61), Expect = 6.9
 Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 2/38 (5%)

Query: 56 STDTSDYGSGYPDRSSQDDEDDDDDDEDDIEQELSDIS 93
          S D  DY S   D  S++ E++++D+ED    E SD+S
Sbjct: 4  SDDIIDYESD--DDDSEEYEEEEEDEEDAESLESSDVS 39


>gnl|CDD|239566 cd03484, MutL_Trans_hPMS_2_like, MutL_Trans_hPMS2_like: transducer
           domain, having a ribosomal S5 domain 2-like fold, found
           in proteins similar to human PSM2 (hPSM2). hPSM2 belongs
           to the DNA mismatch repair (MutL/MLH1/PMS2) family.
           This transducer domain is homologous to the second
           domain of the DNA gyrase B subunit, which is known to be
           important in nucleotide hydrolysis and the transduction
           of structural signals from ATP-binding site to the DNA
           breakage/reunion regions of the enzymes. Included in
           this group are proteins similar to yeast PMS1. The yeast
           MLH1-PMS1 and the human MLH1-PMS2 heterodimers play a
           role in meiosis. hMLH1-hPMS2 also participates in the
           repair of all DNA mismatch repair (MMR) substrates.
           Cells lacking hPMS2 have a strong mutator phenotype and
           display microsatellite instability (MSI). Mutation in
           hPMS2 causes predisposition to HPNCC and Turcot
           syndrome.
          Length = 142

 Score = 27.2 bits (61), Expect = 7.0
 Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 4/34 (11%)

Query: 153 LYPTYFLHLERDYGKKDTNVLGFKGPRKMTVLIP 186
            YP + L++       D NV     P K TVL+ 
Sbjct: 95  QYPFFILNISLPTSLYDVNVT----PDKRTVLLH 124


>gnl|CDD|221392 pfam12043, DUF3527, Domain of unknown function (DUF3527).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is about 120
           amino acids in length. This domain has a conserved
           CDCGGWD sequence motif.
          Length = 277

 Score = 27.8 bits (62), Expect = 7.9
 Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 235 YVLNFHGRVTQASVKNFQI-IHDNDPEYIVLQFGRTSEDVFTMDYRYPLCPLQAFSIALS 293
              N     TQ S + F++ +     E   L      E ++++++   L  LQAFSI ++
Sbjct: 212 ARKNKGDSPTQESSQPFELFVQGGKEEDPALSMTNVREGLYSVEFHASLSALQAFSICVA 271


>gnl|CDD|165163 PHA02811, PHA02811, putative host range protein; Provisional.
          Length = 197

 Score = 27.3 bits (60), Expect = 8.1
 Identities = 11/23 (47%), Positives = 18/23 (78%)

Query: 70  SSQDDEDDDDDDEDDIEQELSDI 92
           SS DD+D+D+ D D+ + E++DI
Sbjct: 170 SSDDDDDNDNADNDEEDDEVNDI 192


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 27.5 bits (62), Expect = 8.3
 Identities = 10/15 (66%), Positives = 11/15 (73%)

Query: 73  DDEDDDDDDEDDIEQ 87
           DD+ DDD DEDD E 
Sbjct: 369 DDDWDDDWDEDDDEG 383



 Score = 27.5 bits (62), Expect = 8.9
 Identities = 6/19 (31%), Positives = 12/19 (63%)

Query: 68  DRSSQDDEDDDDDDEDDIE 86
           +     D+D D+DD++ +E
Sbjct: 367 EDDDDWDDDWDEDDDEGVE 385


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 27.0 bits (60), Expect = 8.4
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 67  PDRSSQDDEDDDDDDEDDIEQELSDISLHRGSHV 100
            D    DD+D+DD+D+++ + E  D + ++   +
Sbjct: 49  EDGGDIDDDDEDDEDDEEADAEDDDENPYKLIRL 82


>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
          N-terminal.  This is the very acidic N-terminal region
          of the early transcription elongation factor Spt5. The
          Spt5-Spt4 complex regulates early transcription
          elongation by RNA polymerase II and has an imputed role
          in pre-mRNA processing via its physical association
          with mRNA capping enzymes. The actual function of this
          N-terminal domain is not known although it is
          dispensable for binding to Spt4.
          Length = 92

 Score = 26.3 bits (58), Expect = 8.6
 Identities = 8/19 (42%), Positives = 17/19 (89%)

Query: 73 DDEDDDDDDEDDIEQELSD 91
          DDE++++++E+D  ++LSD
Sbjct: 9  DDEEEEEEEEEDDLEDLSD 27


>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein.  This is a family of
           fungal and plant proteins and contains many hypothetical
           proteins. VID27 is a cytoplasmic protein that plays a
           potential role in vacuolar protein degradation.
          Length = 794

 Score = 27.8 bits (62), Expect = 9.2
 Identities = 7/28 (25%), Positives = 17/28 (60%)

Query: 61  DYGSGYPDRSSQDDEDDDDDDEDDIEQE 88
           D  +   D   +D+E+++++DED+   +
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDEGPSK 410


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 27.3 bits (61), Expect = 9.3
 Identities = 11/34 (32%), Positives = 13/34 (38%), Gaps = 2/34 (5%)

Query: 53  TQASTDTSDYGSG--YPDRSSQDDEDDDDDDEDD 84
             +     D GS     D    D   DDDD +DD
Sbjct: 197 DDSDAAGGDQGSNGDDDDGGFADSGYDDDDMDDD 230


>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
           subunit 1; Provisional.
          Length = 319

 Score = 27.3 bits (61), Expect = 9.4
 Identities = 6/24 (25%), Positives = 17/24 (70%)

Query: 68  DRSSQDDEDDDDDDEDDIEQELSD 91
              S+D++++D+D+E++ + +  D
Sbjct: 295 YSESEDEDEEDEDEEEEEDDDEGD 318


>gnl|CDD|234824 PRK00725, glgC, glucose-1-phosphate adenylyltransferase;
           Provisional.
          Length = 425

 Score = 27.5 bits (62), Expect = 9.4
 Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 7/36 (19%)

Query: 243 VTQASVKNFQIIHDNDPEYIVLQFGRTSEDVFTMDY 278
           V Q    N  II   DP+Y+V+  G   + ++ MDY
Sbjct: 117 VYQ----NLDIIRRYDPKYVVILAG---DHIYKMDY 145


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.133    0.387 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,066,881
Number of extensions: 1422329
Number of successful extensions: 3274
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2855
Number of HSP's successfully gapped: 134
Length of query: 302
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 206
Effective length of database: 6,679,618
Effective search space: 1376001308
Effective search space used: 1376001308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.6 bits)