RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= 046801
         (272 letters)



>gnl|CDD|107224 cd06467, p23_NUDC_like, p23_like domain of NUD (nuclear
           distribution) C and similar proteins. Aspergillus
           nidulas (An) NUDC is needed for nuclear movement. AnNUDC
           is localized at the hyphal cortex, and binds NUDF at
           spindle pole bodies (SPBs) and in the cytoplasm at
           different stages in the cell cycle. At the SPBs it is
           part of the dynein molecular motor/NUDF complex that
           regulates microtubule dynamics.  Mammalian(m) NUDC
           associates both with the dynein complex and also with an
           anti-inflammatory enzyme, platelet activating factor
           acetylhydrolase I, PAF-AH(I) complex, through binding
           mNUDF, the regulatory beta subunit of PAF-AH(I).  mNUDC
           is important for cell proliferation both in normal and
           tumor tissues.  Its expression is elevated in various
           cell types undergoing mitosis or stimulated to
           proliferate, with high expression levels observed in
           leukemic cells and tumors.  For a leukemic cell line,
           human NUDC was shown to activate the thrombopoietin
           (TPO) receptor (Mpl) by binding to its extracellular
           domain, and promoting cell proliferation and
           differentiation.  This group also includes the human
           broadly immunogenic tumor associated antigen, CML66,
           which is highly expressed in a variety of solid tumors
           and in leukemias. In normal tissues high expression of
           CML66 is limited to testis and heart.
          Length = 85

 Score =  135 bits (343), Expect = 6e-41
 Identities = 46/85 (54%), Positives = 61/85 (71%)

Query: 155 YSWTQILQEVTVSVPVPAGTKSRFVLCEIKKNHLKVGLKGQPPIIDGELHMAVKPDDCYW 214
           YSWTQ L EVTV++P+P GTKS+ V  EI   HLKVG+KG  P++DGEL+  VK D+  W
Sbjct: 1   YSWTQTLDEVTVTIPLPEGTKSKDVKVEITPKHLKVGVKGGEPLLDGELYAKVKVDESTW 60

Query: 215 SIEDQKTVSILLTKQNQMEWWKCLI 239
           ++ED K + I L K+N+ EWW  L+
Sbjct: 61  TLEDGKLLEITLEKRNEGEWWPSLV 85


>gnl|CDD|107241 cd06492, p23_mNUDC_like, p23-like NUD (nuclear distribution) C-like
           domain of mammalian(m) NUDC and similar proteins.
           Mammalian(m) NUDC associates both with the dynein
           complex and also with an anti-inflammatory enzyme,
           platelet activating factor acetylhydrolase I, PAF-AH(I)
           complex, through binding mNUDF, the regulatory beta
           subunit of PAF-AH(I).  mNUDC is important for cell
           proliferation both in normal and tumor tissues.  Its
           expression is elevated in various cell types undergoing
           mitosis or stimulated to proliferate, with high
           expression levels observed in leukemic cells and tumors.
           For a leukemic cell line, human NUDC was shown to
           activate the thrombopoietin (TPO) receptor (Mpl) by
           binding to its extracellular domain, and promoting cell
           proliferation and differentiation.
          Length = 87

 Score =  114 bits (287), Expect = 2e-32
 Identities = 45/83 (54%), Positives = 57/83 (68%), Gaps = 2/83 (2%)

Query: 155 YSWTQILQEVTVSVP--VPAGTKSRFVLCEIKKNHLKVGLKGQPPIIDGELHMAVKPDDC 212
           Y WTQ L EV + VP  V    K + V+ +I++ HLKVGLKGQPPIIDGEL+  VK ++ 
Sbjct: 1   YRWTQTLSEVELKVPFKVSFRLKGKDVVVDIQRKHLKVGLKGQPPIIDGELYNEVKVEES 60

Query: 213 YWSIEDQKTVSILLTKQNQMEWW 235
            W IED K V++ L K N+MEWW
Sbjct: 61  SWLIEDGKVVTVNLEKINKMEWW 83


>gnl|CDD|218355 pfam04969, CS, CS domain.  The CS and CHORD (pfam04968) are fused
           into a single polypeptide chain in metazoans but are
           found in separate proteins in plants; this is thought to
           be indicative of an interaction between CS and CHORD. It
           has been suggested that the CS domain is a binding
           module for HSP90, implying that CS domain-containing
           proteins are involved in recruiting heat shock proteins
           to multiprotein assemblies.
          Length = 78

 Score = 82.6 bits (205), Expect = 1e-20
 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 153 ENYSWTQILQEVTVSVPVPAGTKSRFVLCEIKKNHLKVGLKGQ---PPIIDGELHMAVKP 209
             Y W Q L EVT+++P+   TK + V  EIK   LKV +KG      ++DGEL   + P
Sbjct: 1   PRYDWYQTLDEVTITIPLKGVTKKKDVKVEIKPRSLKVSIKGPGGKEYLLDGELFGPIDP 60

Query: 210 DDCYWSIEDQKTVSILLTK 228
           ++  W IED K V I L K
Sbjct: 61  EESSWKIEDTK-VEITLKK 78


>gnl|CDD|107220 cd06463, p23_like, Proteins containing this p23_like domain include
           p23 and its Saccharomyces cerevisiae (Sc) homolog Sba1.
           Both are co-chaperones for the heat shock protein (Hsp)
           90.  p23 binds Hsp90 and participates in the folding of
           a number of Hsp90 clients, including the progesterone
           receptor. p23 also has a passive chaperoning activity
           and in addition may participate in prostaglandin
           synthesis.  Both p23 and Sba1p can regulate telomerase
           activity. This group includes domains similar to the
           C-terminal CHORD-SGT1 (CS) domain of suppressor of G2
           allele of Skp1 (Sgt1). Sgt1 interacts with multiple
           protein complexes and has the features of a
           co-chaperone. Human (h) Sgt1 interacts with both Hsp70
           and Hsp90, and has been shown to bind Hsp90 through its
           CS domain.  Saccharomyces cerevisiae (Sc) Sgt1 is a
           subunit of both core kinetochore and SCF
           (Skp1-Cul1-F-box) ubiquitin ligase complexes. Sgt1 is
           required for pathogen resistance in plants.  This group
           also includes the p23_like domains of human
           butyrate-induced transcript 1 (hB-ind1), NUD (nuclear
           distribution) C, Melusin, and NAD(P)H cytochrome b5
           (NCB5) oxidoreductase (OR). hB-ind1 plays a role in the
           signaling pathway mediated by the small GTPase Rac1,
           NUDC is needed for nuclear movement, Melusin interacts
           with two splice variants of beta1 integrin, and NCB5OR
           plays a part in maintaining viable pancreatic beta
           cells.
          Length = 84

 Score = 79.6 bits (197), Expect = 3e-19
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 5/86 (5%)

Query: 157 WTQILQEVTVSVPVPAGTKSRFVLCEIKKNHLKVGLK---GQPPIIDGELHMAVKPDDCY 213
           W Q L EVT+++P+   TK   V  E     L V +K   G+  +++GEL   + P++  
Sbjct: 1   WYQTLDEVTITIPLKDVTKKD-VKVEFTPKSLTVSVKGGGGKEYLLEGELFGPIDPEESK 59

Query: 214 WSIEDQKTVSILLTKQNQMEWWKCLI 239
           W++ED + + I L K+   EWW  L 
Sbjct: 60  WTVED-RKIEITLKKKEPGEWWPRLE 84


>gnl|CDD|107243 cd06494, p23_NUDCD2_like, p23-like NUD (nuclear distribution)
           C-like found in human NUDC domain-containing protein 2
           (NUDCD2) and similar proteins.  Little is known about
           the function of the proteins in this subgroup.
          Length = 93

 Score = 68.3 bits (167), Expect = 7e-15
 Identities = 28/76 (36%), Positives = 49/76 (64%), Gaps = 1/76 (1%)

Query: 156 SWTQILQEVTVSVPVPAGTKSRFVLCEIKKNHLKVGLKGQPPIIDGELHMAVKPDDCYWS 215
            W Q + EV + V VP GT+++ V C++    + + +KGQ  ++ G+L  +V  D+C W+
Sbjct: 9   CWYQTMDEVFIEVNVPPGTRAKDVKCKLGSRDISLAVKGQE-VLKGKLFDSVVADECTWT 67

Query: 216 IEDQKTVSILLTKQNQ 231
           +ED+K + I+LTK N+
Sbjct: 68  LEDRKLIRIVLTKSNR 83


>gnl|CDD|107244 cd06495, p23_NUDCD3_like, p23-like NUD (nuclear distribution)
           C-like domain found in human NUDC domain-containing
           protein 3 (NUDCD3) and similar proteins.   Little is
           known about the function of the proteins in this
           subgroup.
          Length = 102

 Score = 56.0 bits (135), Expect = 3e-10
 Identities = 32/98 (32%), Positives = 58/98 (59%), Gaps = 5/98 (5%)

Query: 153 ENYSWTQILQEVTVSVPVP-AGTKSRFVLCEIKKNHLKVGLK---GQPPIIDGELHMAVK 208
           ENY+W+Q   +V V VPVP    K R V  +++ + ++V ++   G+  +++GE    + 
Sbjct: 5   ENYTWSQDYTDVEVRVPVPKDVVKGRQVSVDLQSSSIRVSVRDGGGEKVLMEGEFTHKIN 64

Query: 209 PDDCYWSIEDQKTVSILLTKQNQMEWWKCLIKGGPEID 246
            ++  WS+E  K V + L+K +++ WW  ++KG  EID
Sbjct: 65  TENSLWSLEPGKCVLLSLSKCSEV-WWNAVLKGEEEID 101


>gnl|CDD|107242 cd06493, p23_NUDCD1_like, p23_NUDCD1: p23-like NUD (nuclear
           distribution) C-like domain found in human NUD (nuclear
           distribution) C domain-containing protein 1, NUDCD1
           (also known as CML66), and similar proteins.
           NUDCD1/CML66 is a broadly immunogenic tumor associated
           antigen, which is highly expressed in a variety of solid
           tumors and in leukemias. In normal tissues high
           expression of NUDCD1/CML66 is limited to testis and
           heart.
          Length = 85

 Score = 52.5 bits (126), Expect = 3e-09
 Identities = 20/85 (23%), Positives = 49/85 (57%)

Query: 155 YSWTQILQEVTVSVPVPAGTKSRFVLCEIKKNHLKVGLKGQPPIIDGELHMAVKPDDCYW 214
           Y W Q  +++T+++ +P  T    +  +   +H+ + LK Q P+++G+L+ ++  +   W
Sbjct: 1   YYWQQTEEDLTLTIRLPEDTTKEDIRIKFLPDHISIALKDQAPLLEGKLYSSIDHESSTW 60

Query: 215 SIEDQKTVSILLTKQNQMEWWKCLI 239
            I++ K++ + L K+++   W  L+
Sbjct: 61  IIKENKSLEVSLIKKDEGPTWPELV 85


>gnl|CDD|107219 cd00298, ACD_sHsps_p23-like, This domain family includes the
           alpha-crystallin domain (ACD) of alpha-crystallin-type
           small heat shock proteins (sHsps) and a similar domain
           found in p23-like proteins.  sHsps are small stress
           induced proteins with monomeric masses between 12 -43
           kDa, whose common feature is this ACD. sHsps are
           generally active as large oligomers consisting of
           multiple subunits, and are believed to be
           ATP-independent chaperones that prevent aggregation and
           are important in refolding in combination with other
           Hsps. p23 is a cochaperone of the Hsp90 chaperoning
           pathway. It binds Hsp90 and participates in the folding
           of a number of Hsp90 clients including the progesterone
           receptor. p23 also has a passive chaperoning activity.
           p23 in addition may act as the cytosolic prostaglandin
           E2 synthase. Included in this family is the p23-like
           C-terminal CHORD-SGT1 (CS) domain of suppressor of G2
           allele of Skp1 (Sgt1) and  the p23-like domains of human
           butyrate-induced transcript 1 (hB-ind1), NUD (nuclear
           distribution) C, Melusin, and NAD(P)H cytochrome b5
           (NCB5) oxidoreductase (OR).
          Length = 80

 Score = 36.8 bits (86), Expect = 0.001
 Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 12/82 (14%)

Query: 157 WTQILQEVTVSVPVPAGTKSRFVLCEIKKNHLKVGLK----------GQPPIIDGELHMA 206
           W Q   EV V+V +P G K   +  E++ N L +  K                  EL   
Sbjct: 1   WYQTDDEVVVTVDLP-GVKKEDIKVEVEDNVLTISGKREEEEERERSYGEFERSFELPED 59

Query: 207 VKPDDCYWSIEDQKTVSILLTK 228
           V P+    S+E+   + I L K
Sbjct: 60  VDPEKSKASLEN-GVLEITLPK 80


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 38.4 bits (90), Expect = 0.003
 Identities = 23/102 (22%), Positives = 39/102 (38%), Gaps = 9/102 (8%)

Query: 50  LEKAFEFVAKESDFLTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEE 109
           L    E   +E +FLT      + +  +    EK    +K        +++  A K KEE
Sbjct: 388 LAAFLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEK-----KEKKKKAFAGKKKEE 442

Query: 110 KKEEEKFEDLPTMKVQKAPPPEPQLDANGNPARAPNKGNGLD 151
           ++EEEK E     K ++    E + +      +   K   L 
Sbjct: 443 EEEEEK-EKKEEEKEEEEEEAEEEKEEEEEKKK---KQATLF 480


>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
          Length = 222

 Score = 31.5 bits (71), Expect = 0.36
 Identities = 13/53 (24%), Positives = 27/53 (50%)

Query: 96  PHQEEVKAKKMKEEKKEEEKFEDLPTMKVQKAPPPEPQLDANGNPARAPNKGN 148
           PH  + K+K  K+E +E+++  +   ++ +K   P  Q  +   P + P + N
Sbjct: 143 PHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPN 195


>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
          Length = 1066

 Score = 32.2 bits (73), Expect = 0.36
 Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 85  SNTKKQPQPQSPHQEEVKAKKMKEEKKEEEKFEDLPTMKVQKAPPPEPQLDANGNPARAP 144
           S T+ + + +   +EE++ KK KEEK +E++ + L     QK    + Q     +    P
Sbjct: 2   SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAA--QKEAKAKLQAQQASDGTNVP 59

Query: 145 NKG 147
            K 
Sbjct: 60  KKS 62


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 31.9 bits (73), Expect = 0.39
 Identities = 13/113 (11%), Positives = 30/113 (26%), Gaps = 2/113 (1%)

Query: 10   DENKPDPKKNPSASASSSSPSSAKTSSFSATLDPSNPLGFLEKAFEFVAKESDFLTKDAA 69
              +    +  P  +    S         S   D  +  G    +      +       AA
Sbjct: 1261 SSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAA 1320

Query: 70   EKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKA--KKMKEEKKEEEKFEDLP 120
             K+   + +    K  +  +  Q  +     +    +K K +   E+  +   
Sbjct: 1321 LKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEV 1373


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 31.4 bits (72), Expect = 0.39
 Identities = 17/68 (25%), Positives = 30/68 (44%), Gaps = 4/68 (5%)

Query: 56  FVAKES--DFLTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSP--HQEEVKAKKMKEEKK 111
            V  E+  D L+K  A+   +AA  A K+      K+           E + ++ +EE++
Sbjct: 254 IVTPETADDILSKAVAQALALAAALADKDALDEELKEVLSAQAQAAAAEEEEEEEEEEEE 313

Query: 112 EEEKFEDL 119
           EE   E+ 
Sbjct: 314 EEPSEEEA 321


>gnl|CDD|219594 pfam07816, DUF1645, Protein of unknown function (DUF1645).  These
           sequences are derived from a number of hypothetical
           plant proteins. The region in question is approximately
           270 amino acids long. Some members of this family are
           annotated as yeast pheromone receptor proteins AR781 but
           no literature was found to support this.
          Length = 191

 Score = 30.2 bits (68), Expect = 0.83
 Identities = 20/111 (18%), Positives = 35/111 (31%), Gaps = 17/111 (15%)

Query: 9   EDENKPDPKKNPSASASSSSPSSAKTSSFSATLDPSNPLGFLEKAFEFVAKESDFL---- 64
             +        PS+   SSSP+S+   S S         G  ++      + SD L    
Sbjct: 68  STDEGELEGVPPSSYCVSSSPASSSRKSSST--------GSSKR-----WRLSDLLLFRS 114

Query: 65  TKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKKEEEK 115
             D  +  +  A +    K S       P  P   + +     + K+  + 
Sbjct: 115 ASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPASAKSRESSASKGKRRGKT 165


>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
          Length = 333

 Score = 29.9 bits (67), Expect = 1.4
 Identities = 21/103 (20%), Positives = 42/103 (40%), Gaps = 4/103 (3%)

Query: 13  KPDPKKNPSASASSSSPSSAKTSSFSATLDPSNPLGFLEKAFEFVAKESDFLTKDAAEKQ 72
                K  S   ++ +  SAK ++ +A    +      +K  +   K++    K AA+K 
Sbjct: 219 AKSAAKKVSKKKAAKTAVSAKKAAKTA----AKAAKKAKKTAKKALKKAAKAVKKAAKKA 274

Query: 73  IVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKKEEEK 115
             AA +AAK  +  TK + + +    ++  A    +   +  K
Sbjct: 275 AKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPK 317


>gnl|CDD|223711 COG0638, PRE1, 20S proteasome, alpha and beta subunits
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 236

 Score = 29.6 bits (67), Expect = 1.4
 Identities = 13/43 (30%), Positives = 22/43 (51%), Gaps = 6/43 (13%)

Query: 47  LGFLEKAFEFVAKESDFLTKDAAEKQIVAAMRAAKEKSSNTKK 89
            GFLEK +       + L+ + A +  V A+RAA E+ + +  
Sbjct: 168 YGFLEKEYR------EDLSLEEAIELAVKALRAAIERDAASGG 204


>gnl|CDD|240309 PTZ00189, PTZ00189, 60S ribosomal protein L21; Provisional.
          Length = 160

 Score = 28.8 bits (65), Expect = 1.7
 Identities = 12/45 (26%), Positives = 20/45 (44%)

Query: 62  DFLTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKM 106
           DFL + AA   I A  +   E  S  ++  QP+     + K  ++
Sbjct: 104 DFLKRVAANDAIKAEAKKKGELPSTKRQPAQPRPAGFVKPKPTEV 148


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 29.7 bits (67), Expect = 1.9
 Identities = 17/55 (30%), Positives = 22/55 (40%), Gaps = 7/55 (12%)

Query: 11  ENKPDPKKNPSASASSSSPSSAKTSSFSATLDPSNPLGFLEKAFEFV--AKESDF 63
           + +   K  P AS S        +SS S+  D S     LE    F+  AKES  
Sbjct: 746 DYRLTSKPIPEASTSEEVSPVLPSSSMSSDEDQS-----LEYELSFIREAKESGV 795


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 29.5 bits (66), Expect = 2.0
 Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 4/74 (5%)

Query: 64  LTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKK----EEEKFEDL 119
           L+ D A K++           +   K+P+ +S  +EE + +++KEEKK    + ++    
Sbjct: 71  LSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKD 130

Query: 120 PTMKVQKAPPPEPQ 133
              K +      P+
Sbjct: 131 RKPKEEAKEKRPPK 144


>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
           membranes [Cell envelope biogenesis, outer membrane].
          Length = 244

 Score = 29.0 bits (65), Expect = 2.4
 Identities = 13/66 (19%), Positives = 24/66 (36%)

Query: 79  AAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKKEEEKFEDLPTMKVQKAPPPEPQLDANG 138
               + +  K + +P+   + +    K K + K + K +  P  K   +        A  
Sbjct: 74  ETPPEPTPPKPKEKPKPEKKPKKPKPKPKPKPKPKPKVKPQPKPKKPPSKTAAKAPAAPN 133

Query: 139 NPARAP 144
            PAR P
Sbjct: 134 QPARPP 139


>gnl|CDD|215406 PLN02761, PLN02761, lipase class 3 family protein.
          Length = 527

 Score = 29.2 bits (65), Expect = 2.9
 Identities = 20/101 (19%), Positives = 40/101 (39%), Gaps = 12/101 (11%)

Query: 24  ASSSSPSSAKTSSFSATLDPSNPLGFLEKAFEFVAKESDFLTKDAAEKQIVAAMRAAKEK 83
           AS+S P + K   F ++  P+           F  +    +     +   +    +    
Sbjct: 2   ASASLPITLKNPRFFSS-SPNKI---------FKTQPQTLILTTKFKTCSIICSSSCTSI 51

Query: 84  SSNTKKQPQPQSPHQEEVKAKKMKEEKKEEEKFEDLPTMKV 124
           SS+T +Q Q     Q  V   K +EE +EE + +++   ++
Sbjct: 52  SSSTTQQKQSNK--QTHVSDNKREEEPEEELEEKEVSLREI 90


>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.  Members
           of this family are necessary for accurate chromosome
           transmission during cell division.
          Length = 804

 Score = 29.0 bits (65), Expect = 3.1
 Identities = 29/144 (20%), Positives = 51/144 (35%), Gaps = 19/144 (13%)

Query: 8   QEDENKPDPKKNPSASASSSSPSSAKTSS--FSATLDPSNPLGFLEKAFEFVAKESDFLT 65
           + ++N P        S  S   S+       + A+ DP      L+         S + T
Sbjct: 76  RYNQNAPGAPSVGPDSDLSQKTSTFSPCQSGYEASTDPEYIP-DLQ------PDPSLWGT 128

Query: 66  KDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKKEEEKFEDLPTMKVQ 125
               E Q   A           + QPQPQ+P Q+ +  ++++ + ++ ++   LP    Q
Sbjct: 129 APKPEPQPPQA----------PESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQ 178

Query: 126 KAPPPEPQLDANGNPARAPNKGNG 149
             P   P   A       P +  G
Sbjct: 179 VLPQGMPPRQAAFPQQGPPEQPPG 202


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 20/93 (21%), Positives = 28/93 (30%), Gaps = 5/93 (5%)

Query: 70  EKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKK-EEEKFEDLPTMKVQKAP 128
               ++ +  A   ++ T        P   +  A K K+                   A 
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAE 420

Query: 129 PPEPQLDANGNPARAPNKGNGLDLENYSWTQIL 161
           P EP      N A AP   N  +L    W QIL
Sbjct: 421 PTEPSPTPPANAANAPPSLNLEEL----WQQIL 449


>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
          domain.  Spiders use fibroins to make silk strands.
          This family includes tubuliform silk fibroins which are
          used to protect egg cases. This domain is a structural
          domain which is found in repeats of up to 20 in many
          individuals (although this is not necessarily the
          case). RP1 makes up structural domains in the N
          terminal while RP2 makes up structural domains in the C
          terminal.
          Length = 167

 Score = 27.9 bits (62), Expect = 3.8
 Identities = 8/24 (33%), Positives = 17/24 (70%)

Query: 21 SASASSSSPSSAKTSSFSATLDPS 44
          ++ A+SS+ SS+  S+F+ +L  +
Sbjct: 3  ASQAASSASSSSSASAFAQSLSSA 26


>gnl|CDD|215167 PLN02296, PLN02296, carbonate dehydratase.
          Length = 269

 Score = 28.2 bits (63), Expect = 4.4
 Identities = 28/101 (27%), Positives = 45/101 (44%), Gaps = 17/101 (16%)

Query: 45  NPLGFLEKAFEFVAKESDFLTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAK 104
           NP  FL K      +E  F+++ A     +A + AA+    N K          +E++ +
Sbjct: 184 NPAKFLRK---LTEEEIAFISQSATNYSNLAQVHAAE----NAKSF--------DEIEFE 228

Query: 105 KMKEEKKEEEKFEDLPTM-KVQKAPPPEPQLDANGNPARAP 144
           K+   KK   + E+  +M  V +  PPE  L  N  P +AP
Sbjct: 229 KVLR-KKFARRDEEYDSMLGVVRETPPELILPDNILPDKAP 268


>gnl|CDD|215585 PLN03115, PLN03115, ferredoxin--NADP(+) reductase; Provisional.
          Length = 367

 Score = 28.4 bits (63), Expect = 4.5
 Identities = 22/93 (23%), Positives = 33/93 (35%), Gaps = 19/93 (20%)

Query: 22  ASASSSSPSSAKTSSFSATLDPSNPLGFLEKAFEFVAKESDFLTKDAAEKQIVAAMRAAK 81
            +A+ S PSS  +S  + T   S     L+K   +         +       V ++RA  
Sbjct: 5   VNAAVSLPSSKSSSLPARTSAISPERIRLKKGPLYYRNNVSSGKR-------VVSIRA-- 55

Query: 82  EKSSNTKKQPQPQSPHQEEVKAKKMKEEKKEEE 114
                     Q  +    E  AK +K  KK EE
Sbjct: 56  ----------QVTTETTTEAPAKVVKVSKKNEE 78


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 28.4 bits (64), Expect = 5.0
 Identities = 19/78 (24%), Positives = 28/78 (35%), Gaps = 7/78 (8%)

Query: 44  SNPLGFLEKAFEF-VAKESDFLTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVK 102
             PL    +  E      S    K   +KQ  A  +A KE++     +       + E  
Sbjct: 389 DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAK------KKAEAA 442

Query: 103 AKKMKEEKKEEEKFEDLP 120
           AKK K    E +K +  P
Sbjct: 443 AKKAKGPDGETKKVDPDP 460


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 28.2 bits (63), Expect = 5.5
 Identities = 15/82 (18%), Positives = 31/82 (37%), Gaps = 7/82 (8%)

Query: 66  KDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKKEEEKFEDLPTMKVQ 125
           K   EK+   A++A K++  +  ++ + +  H+ +         K +E K +        
Sbjct: 563 KLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNF 622

Query: 126 KAPPPEPQLDANGNPARAPNKG 147
           +A       D  G+  R    G
Sbjct: 623 QA-------DKIGDKVRIRYFG 637


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 944

 Score = 28.2 bits (63), Expect = 5.5
 Identities = 21/145 (14%), Positives = 44/145 (30%), Gaps = 7/145 (4%)

Query: 2   AIISDFQEDENKPDPKKNPSASASSSSPSSAKTSSFSATLDPSNPLGFLEKAFEFVAKES 61
           +++SD      K    K  SA     +P S    + S +   S+P      A   +  + 
Sbjct: 621 SLLSDLDALSPKEGDGKKSSADRKPKTPPSRAPPA-SLSKPASSPDASQTSASFDLDPDF 679

Query: 62  DFLTKDAA-EKQIVAAMRAAKEKSSNTKKQP-----QPQSPHQEEVKAKKMKEEKKEEEK 115
           +  T  +  E  + +    A     +   +P        +   +           +  E 
Sbjct: 680 ELATHQSVPEAALASGSAPAPPPVPDPYDRPPWEEAPEVASANDGPNNAAEGNLSESVED 739

Query: 116 FEDLPTMKVQKAPPPEPQLDANGNP 140
             +     V++    +PQ+ A    
Sbjct: 740 ASNSELQAVEQQATHQPQVQAEAQS 764


>gnl|CDD|222651 pfam14286, DHHW, DHHW protein.  This family of proteins is found in
           bacteria. Proteins in this family are typically between
           366 and 404 amino acids in length. There is a conserved
           DHHW motif.
          Length = 378

 Score = 28.1 bits (63), Expect = 5.7
 Identities = 15/76 (19%), Positives = 27/76 (35%), Gaps = 10/76 (13%)

Query: 4   ISDFQEDENKPDPKKN-------PSASASSSSPSSAKTSS---FSATLDPSNPLGFLEKA 53
             D  E E +  P +         S + SS   SS + SS       +DP + + F +  
Sbjct: 67  KDDIPETEGETAPTQPIQTVAPEESDTVSSEDASSNQDSSAETTVIPVDPDDSVVFDDGK 126

Query: 54  FEFVAKESDFLTKDAA 69
            +   +   ++    A
Sbjct: 127 VQGEIQNGVYVYGGRA 142


>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR).  This
           family consists of several bovine specific leukaemia
           virus receptors which are thought to function as
           transmembrane proteins, although their exact function is
           unknown.
          Length = 561

 Score = 28.1 bits (62), Expect = 6.3
 Identities = 20/90 (22%), Positives = 32/90 (35%)

Query: 88  KKQPQPQSPHQEEVKAKKMKEEKKEEEKFEDLPTMKVQKAPPPEPQLDANGNPARAPNKG 147
           KK  +P+   ++E + ++ K++KKE E F+ L               +A+         G
Sbjct: 197 KKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSG 256

Query: 148 NGLDLENYSWTQILQEVTVSVPVPAGTKSR 177
              D E         E T   P     K R
Sbjct: 257 TAPDSEPDEPKDAEAEETKKSPKHKKKKQR 286


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 27.7 bits (62), Expect = 6.5
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 108 EEKKEEEKFEDLPTMKVQKAPPPEPQLDANG 138
           EE++EEE+ E+ P  +V   P PE QL A+G
Sbjct: 408 EEEEEEEEEEEEPVAEVMMMPAPEMQLPASG 438


>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic.  This model
           describes the eukaryotic 60S (cytosolic) ribosomal
           protein L7 and paralogs that may or may not also be L7.
           Human, Drosophila, and Arabidopsis all have both a
           typical L7 and an L7-related paralog. This family is
           designated subfamily rather than equivalog to reflect
           these uncharacterized paralogs. Members of this family
           average ~ 250 residues in length, somewhat longer than
           the archaeal L30P/L7E homolog (~ 155 residues) and much
           longer than the related bacterial/organellar form (~ 60
           residues).
          Length = 235

 Score = 27.3 bits (61), Expect = 6.8
 Identities = 13/62 (20%), Positives = 24/62 (38%), Gaps = 2/62 (3%)

Query: 65  TKDAAEKQIVAAMRAAKEKSSNTKKQPQP--QSPHQEEVKAKKMKEEKKEEEKFEDLPTM 122
                 KQ  A  +A K+K     K+ +   +   + E +  ++K   K+  KF      
Sbjct: 13  LAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGKFYVPAEH 72

Query: 123 KV 124
           K+
Sbjct: 73  KL 74


>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 926

 Score = 28.0 bits (62), Expect = 7.4
 Identities = 18/89 (20%), Positives = 35/89 (39%), Gaps = 9/89 (10%)

Query: 51  EKAFEFVAKESDFLTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEE-VKAKKMKEE 109
           E+ FE+     +  +  +  K   A +    + ++  K++P+ + P+ +E        E 
Sbjct: 762 EEEFEYPRM-YEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSVET 820

Query: 110 KKE-------EEKFEDLPTMKVQKAPPPE 131
           + E       EEK  D P +   K   P 
Sbjct: 821 EGERCTIKQREEKGIDAPAILNVKKKKPY 849


>gnl|CDD|215138 PLN02248, PLN02248, cellulose synthase-like protein.
          Length = 1135

 Score = 27.7 bits (62), Expect = 7.8
 Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 3/30 (10%)

Query: 98  QEEVKAKKMKEEKKEEEKFEDLPTMKVQKA 127
           +EE+KAKK + E    +  E L   KV KA
Sbjct: 498 REEIKAKKKQRESGGGDPSEPL---KVPKA 524


>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
          Length = 186

 Score = 27.0 bits (60), Expect = 7.8
 Identities = 16/81 (19%), Positives = 27/81 (33%), Gaps = 3/81 (3%)

Query: 64  LTKDAAEKQIVAAMRAAKEKSSNTKKQPQPQSPHQEEVKAKKMKEEKKEEEKFEDLPTMK 123
            T+  AE +   A +  KE   N KK    ++  ++  K   ++ EKK  E   +    K
Sbjct: 94  FTEAQAEAKF-EAWKEEKEGKVNAKKDKLSKA--KKAAKKAALEAEKKVNEARAEAVAEK 150

Query: 124 VQKAPPPEPQLDANGNPARAP 144
                      +A        
Sbjct: 151 KAAEAAAVAAEEAAAAEEEEA 171


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.307    0.125    0.354 

Gapped
Lambda     K      H
   0.267   0.0552    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,385,481
Number of extensions: 1223943
Number of successful extensions: 2167
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1969
Number of HSP's successfully gapped: 184
Length of query: 272
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 177
Effective length of database: 6,723,972
Effective search space: 1190143044
Effective search space used: 1190143044
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 58 (26.5 bits)