RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3337
         (297 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score = 63.5 bits (155), Expect = 1e-11
 Identities = 21/87 (24%), Positives = 37/87 (42%)

Query: 112 VFAGSESLQRVVRLSRNGDLMVTGGTDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISG 171
              G       V  S +G L+ TG  DG +++W     + L  ++ H+  + DV  S  G
Sbjct: 4   TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADG 63

Query: 172 EQIVSIAKDGKAFVWNSKNGSLSKELK 198
             + S + D    +W+ + G   + L 
Sbjct: 64  TYLASGSSDKTIRLWDLETGECVRTLT 90



 Score = 54.6 bits (132), Expect = 1e-08
 Identities = 18/76 (23%), Positives = 35/76 (46%)

Query: 123 VRLSRNGDLMVTGGTDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGK 182
           V  S +G ++ +   D  +++W     K L  +  H+  ++ V FS  G  + S ++DG 
Sbjct: 99  VAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT 158

Query: 183 AFVWNSKNGSLSKELK 198
             +W+ + G     L 
Sbjct: 159 IKLWDLRTGKCVATLT 174



 Score = 52.7 bits (127), Expect = 6e-08
 Identities = 21/76 (27%), Positives = 38/76 (50%)

Query: 123 VRLSRNGDLMVTGGTDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGK 182
           V  S +G+ +++  +DG ++LW     K L  +  H   ++ V FS  G  + S ++DG 
Sbjct: 183 VAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGT 242

Query: 183 AFVWNSKNGSLSKELK 198
             VW+ + G   + L 
Sbjct: 243 IRVWDLRTGECVQTLS 258



 Score = 52.0 bits (125), Expect = 9e-08
 Identities = 23/88 (26%), Positives = 39/88 (44%)

Query: 110 QTVFAGSESLQRVVRLSRNGDLMVTGGTDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSI 169
            T   G       V  S +G  + +   DG ++LW     K +  +  H+ E++ V FS 
Sbjct: 128 LTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP 187

Query: 170 SGEQIVSIAKDGKAFVWNSKNGSLSKEL 197
            GE+++S + DG   +W+   G     L
Sbjct: 188 DGEKLLSSSSDGTIKLWDLSTGKCLGTL 215



 Score = 48.5 bits (116), Expect = 1e-06
 Identities = 16/65 (24%), Positives = 34/65 (52%)

Query: 123 VRLSRNGDLMVTGGTDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGK 182
           V  S +G L+ +G  DG +R+W     + +  +  H+  +  + +S  G+++ S + DG 
Sbjct: 225 VAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGT 284

Query: 183 AFVWN 187
             +W+
Sbjct: 285 IRIWD 289



 Score = 48.5 bits (116), Expect = 1e-06
 Identities = 20/78 (25%), Positives = 36/78 (46%)

Query: 121 RVVRLSRNGDLMVTGGTDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKD 180
           R V  S +G  + +G +D  +RLW     + +  +  H+  +  V FS  G  + S ++D
Sbjct: 55  RDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRD 114

Query: 181 GKAFVWNSKNGSLSKELK 198
               VW+ + G     L+
Sbjct: 115 KTIKVWDVETGKCLTTLR 132


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score = 50.1 bits (118), Expect = 6e-07
 Identities = 39/177 (22%), Positives = 72/177 (40%), Gaps = 17/177 (9%)

Query: 103 FKTFDSVQTVFAGSESLQRVVRLSRNGDLMVTGGTDGHLRLWSFP-KMKPLLHIEAHSKE 161
             T   +++  +G      V   S +G L+ +G +DG +RLW        L  +  HS  
Sbjct: 228 LSTGKLLRSTLSGHSD-SVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSS 286

Query: 162 IDDVDFSISGEQIVSIAKDGKAFVWNSKNGSLSKELKWNTPDNIKYLFKRCRYGLVEDSP 221
           +  V FS  G+ + S + DG   +W+ + G L   L     +             +  SP
Sbjct: 287 VLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEG--------PVSSLSFSP 338

Query: 222 KRSRLFTLANPLAQNKRGISYVQQWDVDSGRLRLAREMKESLSALAVRDDGRFVAVG 278
             S L +     + +      ++ WD+ +G+     E   ++ +++   DGR V+ G
Sbjct: 339 DGSLLVS---GGSDDGT----IRLWDLRTGKPLKTLEGHSNVLSVSFSPDGRVVSSG 388



 Score = 46.6 bits (109), Expect = 7e-06
 Identities = 41/193 (21%), Positives = 75/193 (38%), Gaps = 8/193 (4%)

Query: 7   SNGKNIYVVAGQENSNQLYKITAEFTQMNGVVKSKESNN----TADTIVRKRKEKDKENA 62
           S+   I +   + +S+ L  ++   + +  V  S +       ++D  VR     D E  
Sbjct: 260 SSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLW---DLETG 316

Query: 63  KQKGGKKEKSNEKGKNKKDKIEDPPILNSIDSKEKNLRFGFKTFDSVQTVFAGSESLQRV 122
           K       K +E   +      D  +L S  S +  +R  +              S    
Sbjct: 317 KLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRL-WDLRTGKPLKTLEGHSNVLS 375

Query: 123 VRLSRNGDLMVTGGTDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGK 182
           V  S +G ++ +G TDG +RLW       L +++ H+  +  +DFS  G+ + S + D  
Sbjct: 376 VSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNT 435

Query: 183 AFVWNSKNGSLSK 195
             +W+ K    S 
Sbjct: 436 IRLWDLKTSLKSV 448



 Score = 44.3 bits (103), Expect = 4e-05
 Identities = 34/217 (15%), Positives = 72/217 (33%), Gaps = 22/217 (10%)

Query: 78  NKKDKIEDPPILNSIDSKEKNLRFGFKTFDSVQTVFAGSESLQRVVRLSRNGDLMVTGGT 137
           N    +      + I              D    +  G E     +  S +G+L+++G +
Sbjct: 26  NSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSS 85

Query: 138 DGHLRLWSFPKMKPLLHI--EAHSKEIDDVDF-SISGEQIVSIAK--DGKAFVWNSKNGS 192
           DG ++LW     + L+      H   +  +   S  G  I+  +   DG   +W+     
Sbjct: 86  DGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPG 145

Query: 193 LSKELKWNTPDNIKYLFKRCRYGLVEDSPKRSRLFTLANPLAQNKRGISYVQQWDVDSGR 252
                     +++           +  SP    L +              ++ WD+ +G+
Sbjct: 146 KLIRTLEGHSESVT---------SLAFSPDGKLLAS-------GSSLDGTIKLWDLRTGK 189

Query: 253 -LRLAREMKESLSALAVRDDGRFVAVGTMFTGSVFVY 288
            L       + +S+LA   DG  +       G++ ++
Sbjct: 190 PLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLW 226



 Score = 43.5 bits (101), Expect = 7e-05
 Identities = 33/159 (20%), Positives = 59/159 (37%), Gaps = 4/159 (2%)

Query: 77  KNKKDKIEDPPILNSIDSKEKNLR-FGFKTFDSVQTVFAGSESLQRVVRLSRNGDLMVTG 135
           K      +   IL +  S +  ++ +   T   +     G       +  S +G L+ +G
Sbjct: 114 KLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASG 173

Query: 136 GT-DGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISGE-QIVSIAKDGKAFVWNSKNGS- 192
            + DG ++LW     KPL  +  H+  +  + FS  G   I S + DG   +W+   G  
Sbjct: 174 SSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKL 233

Query: 193 LSKELKWNTPDNIKYLFKRCRYGLVEDSPKRSRLFTLAN 231
           L   L  ++   +              S    RL+ L +
Sbjct: 234 LRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRS 272



 Score = 40.5 bits (93), Expect = 7e-04
 Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 2/87 (2%)

Query: 113 FAGSESLQRVVRLSRNGDLMVTGG-TDGHLRLWSFPKMKPLLHIEAHSKEIDDVDFSISG 171
             G E     +  S +G L+V+GG  DG +RLW     KPL  +E HS  +  V FS  G
Sbjct: 324 LKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHS-NVLSVSFSPDG 382

Query: 172 EQIVSIAKDGKAFVWNSKNGSLSKELK 198
             + S + DG   +W+   GSL + L 
Sbjct: 383 RVVSSGSTDGTVRLWDLSTGSLLRNLD 409


>gnl|CDD|225333 COG2706, COG2706, 3-carboxymuconate cyclase [Carbohydrate transport
           and metabolism].
          Length = 346

 Score = 40.4 bits (95), Expect = 7e-04
 Identities = 30/108 (27%), Positives = 43/108 (39%), Gaps = 15/108 (13%)

Query: 184 FVWNSKNGSLSKELKWNTPDNIKYLFKRCRYGLVEDSPKRSRLFTLANPLAQNKRGISYV 243
           F  ++K G LS         N  YL           +P +  L+ +  P  +     +Y 
Sbjct: 21  FNLDTKTGELSLLQLVAELGNPTYL---------AVNPDQRHLYVVNEPGEEG-GVAAY- 69

Query: 244 QQWDVDSGRLRLAREMKESLSA---LAVRDDGRFVAVGTMFTGSVFVY 288
            + D D GRL          S    ++V +DGRFV V    +GSV VY
Sbjct: 70  -RIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHSGSVSVY 116


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 35.4 bits (82), Expect = 0.001
 Identities = 10/40 (25%), Positives = 20/40 (50%)

Query: 148 KMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGKAFVWN 187
             + L  ++ H+  +  V FS  G+ + S + DG   +W+
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 31.9 bits (73), Expect = 0.029
 Identities = 8/23 (34%), Positives = 13/23 (56%)

Query: 123 VRLSRNGDLMVTGGTDGHLRLWS 145
           V  S +G  + +G  DG ++LW 
Sbjct: 18  VAFSPDGKYLASGSDDGTIKLWD 40


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 35.0 bits (81), Expect = 0.002
 Identities = 12/39 (30%), Positives = 19/39 (48%)

Query: 149 MKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGKAFVWN 187
            K L  ++ H+  +  V FS  G  + S + DG   VW+
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 32.3 bits (74), Expect = 0.016
 Identities = 10/38 (26%), Positives = 17/38 (44%)

Query: 108 SVQTVFAGSESLQRVVRLSRNGDLMVTGGTDGHLRLWS 145
            +     G       V  S +G+L+ +G  DG +R+W 
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
           RPA34.5.  This is a family of proteins conserved from
           yeasts to human. Subunit A34.5 of RNA polymerase I is a
           non-essential subunit which is thought to help Pol I
           overcome topological constraints imposed on ribosomal
           DNA during the process of transcription.
          Length = 193

 Score = 33.5 bits (77), Expect = 0.076
 Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 1/60 (1%)

Query: 39  KSKESNNTADTIVRKRKEKDKENAKQKGGKKEKSNEKGKNKKDKIEDPPILNSIDSKEKN 98
           +S+ S       V K  E ++E  K+K  KKE   EK K KKDK E          K+K 
Sbjct: 131 ESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEK-KEKKDKKEKMVEPKGSKKKKKK 189


>gnl|CDD|188875 cd09769, Luminal_IRE1, The Luminal domain, a dimerization domain,
           of the Serine/Threonine protein kinase,
           Inositol-requiring protein 1.  The Luminal domain is a
           dimerization domain present in Inositol-requiring
           protein 1 (IRE1), a serine/threonine protein kinase
           (STK) and a type I transmembrane protein that is
           localized in the endoplasmic reticulum (ER). IRE1, also
           called Endoplasmic reticulum (ER)-to-nucleus signaling
           protein (or ERN), is a kinase receptor that also
           contains an endoribonuclease domain in the cytoplasmic
           side. It plays roles in the signaling of the unfolded
           protein response (UPR), which is activated when protein
           misfolding is detected in the ER in order to decrease
           the synthesis of new proteins and increase the capacity
           of the ER to cope with the stress. IRE1 acts as an ER
           stress sensor and is the oldest and most conserved
           component of the UPR in eukaryotes. During ER stress,
           IRE1 dimerizes through its luminal domain and forms
           oligomers, allowing the kinase domain to undergo
           trans-autophosphorylation. This leads to a
           conformational change that stimulates its
           endoribonuclease activity and results in the cleavage of
           its mRNA substrate, HAC1 in yeast and Xbp1 in metazoans,
           promoting a splicing event that enables translation into
           a transcription factor which activates the UPR. Mammals
           contain two IRE1 proteins, IRE1alpha (or ERN1) and
           IRE1beta (or ERN2). IRE1alpha is expressed in all cells
           and tissues while IRE1beta is found only in intestinal
           epithelial cells.
          Length = 295

 Score = 32.3 bits (74), Expect = 0.29
 Identities = 30/103 (29%), Positives = 36/103 (34%), Gaps = 31/103 (30%)

Query: 138 DGHL---------RLWSFPKMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGKAFVWNS 188
           DG L          LWS     PL+ +  HS    D         IV   +DG  +V N 
Sbjct: 10  DGGLHAVDRKTGKILWSLKAEDPLVEVPHHSTLSIDG-----PTFIVE-PRDGSLYVLNP 63

Query: 189 KNGSLSKELKWNTPDNIKYLFKRCRYGLVEDSPKRS---RLFT 228
            N  L K      P  I          LV+ SP RS    L+T
Sbjct: 64  GNEGLKK-----LPFTIPQ--------LVQSSPCRSSDGILYT 93


>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p.  Rrp15p is required for the formation of
           60S ribosomal subunits.
          Length = 132

 Score = 31.2 bits (71), Expect = 0.30
 Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 6/73 (8%)

Query: 34  MNGVVKSKES-NNTADTIVRKRKEKDKENAKQKGGKKEKSNEK-GKNKKDKIED----PP 87
              ++ SK   +   D I+ + K+  K   K K  K EK  ++  + +K +  +     P
Sbjct: 6   FAKILASKLPASKRKDPILSRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKP 65

Query: 88  ILNSIDSKEKNLR 100
           +L     KE+ LR
Sbjct: 66  VLPEDLEKERRLR 78


>gnl|CDD|215639 PLN03226, PLN03226, serine hydroxymethyltransferase; Provisional.
          Length = 475

 Score = 31.9 bits (73), Expect = 0.38
 Identities = 15/29 (51%), Positives = 18/29 (62%), Gaps = 3/29 (10%)

Query: 124 RLSRNGDLMVTGGTDGHLRLWSFPKMKPL 152
           RL   G  +VTGGTD HL LW    ++PL
Sbjct: 329 RLMSKGYKLVTGGTDNHLVLWD---LRPL 354


>gnl|CDD|216152 pfam00852, Glyco_transf_10, Glycosyltransferase family 10
           (fucosyltransferase).  This family of
           Fucosyltransferases are the enzymes transferring fucose
           from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This
           family is know as glycosyltransferase family 10.
          Length = 353

 Score = 31.1 bits (71), Expect = 0.63
 Identities = 9/49 (18%), Positives = 20/49 (40%), Gaps = 1/49 (2%)

Query: 213 RYGLVEDSPKRSRLFTLANPLAQNKRGISYVQ-QWDVDSGRLRLAREMK 260
            YG +        +  L   L +  + +++V    +  S R++  RE+ 
Sbjct: 153 PYGYLVPLEGPEEVRVLPEILEKKSKLVAWVVSNCNEPSKRVKYYRELS 201


>gnl|CDD|237280 PRK13034, PRK13034, serine hydroxymethyltransferase; Reviewed.
          Length = 416

 Score = 30.7 bits (70), Expect = 0.87
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 4/49 (8%)

Query: 98  NLRFGFKTFDSVQTVFAGSESLQRVVRLSRNGDLMVTGGTDGHLRLWSF 146
            L+  FKT+   + V A +++L  V+   R  DL V+GGTD HL L   
Sbjct: 279 ALQPEFKTY--AKQVIANAQALAEVL-KERGYDL-VSGGTDNHLLLVDL 323


>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
          Length = 200

 Score = 30.3 bits (68), Expect = 0.88
 Identities = 13/36 (36%), Positives = 22/36 (61%), Gaps = 1/36 (2%)

Query: 53  KRKEKDKENAKQKGGKKEKSNE-KGKNKKDKIEDPP 87
           ++K+K+ E   Q G K ++  E KG+ +  + EDPP
Sbjct: 96  EKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPP 131


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 30.6 bits (69), Expect = 1.2
 Identities = 18/59 (30%), Positives = 27/59 (45%)

Query: 39  KSKESNNTADTIVRKRKEKDKENAKQKGGKKEKSNEKGKNKKDKIEDPPILNSIDSKEK 97
            +KE  N +     K KE+ KE  K+K  K ++  +  K K++  E  P       KEK
Sbjct: 94  PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEK 152


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 22/79 (27%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 25  YKITAEFTQMNGVVKSKESNNTADTIV----RKRKEKDKENAK--QKGGKKEKSNEKGKN 78
            ++T E  ++  +  SK++      IV    +KR+E+ KE  K    G KKE+  E+ K 
Sbjct: 392 LELTEE--EIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449

Query: 79  KKDKIEDPPILNSIDSKEK 97
           KK++ ++     + + KE+
Sbjct: 450 KKEEEKEEEEEEAEEEKEE 468


>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain.  This region covers
           both the Brf homology II and III regions. This region is
           involved in binding TATA binding protein.
          Length = 95

 Score = 28.4 bits (64), Expect = 1.3
 Identities = 12/49 (24%), Positives = 25/49 (51%)

Query: 52  RKRKEKDKENAKQKGGKKEKSNEKGKNKKDKIEDPPILNSIDSKEKNLR 100
            +++EK+ +    +G    K  +K K KK + E  P   + ++ +K L+
Sbjct: 30  EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQ 78


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 30.3 bits (69), Expect = 1.4
 Identities = 10/31 (32%), Positives = 19/31 (61%)

Query: 52  RKRKEKDKENAKQKGGKKEKSNEKGKNKKDK 82
           +K++E+ K   ++K  K++K  +K K K  K
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414


>gnl|CDD|240264 PTZ00094, PTZ00094, serine hydroxymethyltransferase; Provisional.
          Length = 452

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 125 LSRNGDLMVTGGTDGHLRLWSF 146
           L + G  +VTGGTD HL L   
Sbjct: 319 LEKRGYDLVTGGTDNHLVLVDL 340


>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
          Length = 586

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 10/13 (76%), Positives = 11/13 (84%)

Query: 132 MVTGGTDGHLRLW 144
           +VTGGTD HL LW
Sbjct: 458 LVTGGTDNHLLLW 470


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 29.5 bits (66), Expect = 2.1
 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 39  KSKESNNTADTIVRKRKEKDKENAKQKGGKKEKSNEKGKNKKDKI----EDPPIL 89
           K   S   A T  R+ K+ D+E   +   + +++ E G++  D I    +DPP++
Sbjct: 74  KPTSSPPKAKTTRRRTKKTDQELDPEGAEEDQEAAEDGEDYDDGIDFPYDDPPLV 128


>gnl|CDD|199835 cd10277, PQQ_ADH_I, Ethanol dehydrogenase, a bacterial quinoprotein
           (PQQ-dependent type I alcohol dehydrogenase).  This
           bacterial family of homodimeric ethanol dehydrogenases
           utilize pyrroloquinoline quinone (PQQ) as a cofactor. It
           represents proteins whose expression may be induced by
           ethanol, and which are similar to quinoprotein methanol
           dehydrogenases, but have higher specificities for
           ethanol and other primary and secondary alcohols.
           Dehydrogenases with PQQ cofactors, such as ethanol,
           methanol, and membrane-bound glucose dehydrogenases,
           form an 8-bladed beta-propeller.
          Length = 529

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 15/62 (24%), Positives = 23/62 (37%), Gaps = 17/62 (27%)

Query: 125 LSRNGDLMVTGGTDGHLR---------LWSFPKMKPLLHIEAHSKEIDDVDFSISGEQIV 175
           L+  G L+ TG  DG+ R         LW F                  V + + G+Q V
Sbjct: 443 LTTAGGLVFTGTPDGYFRAFDAKTGKELWEFQT--------GSGIIGPPVTWEVDGKQYV 494

Query: 176 SI 177
           ++
Sbjct: 495 AV 496


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 13/43 (30%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 46  TADTIVRKRKEKDKENAKQKGGKKEKSNEKGKNKKDKIEDPPI 88
                 +K K+K  E+ K++  KK+K  +K K K+   E P +
Sbjct: 136 GLKGHEKKHKKKKHEDDKER--KKKKKEKKKKKKRHSPEHPGV 176


>gnl|CDD|222706 pfam14356, DUF4403, Domain of unknown function (DUF4403).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 455 and 518 amino
           acids in length. There is a single completely conserved
           residue W that may be functionally important.
          Length = 425

 Score = 29.1 bits (66), Expect = 2.9
 Identities = 18/76 (23%), Positives = 31/76 (40%), Gaps = 14/76 (18%)

Query: 95  KEKNLRFGFK-TFDSVQTVFAGSESLQRVVRLSRNGDLMVTGGTDGHLRLWSFPKMKPLL 153
           K   L  G K T  SV  V+   + L  V+       + V G  +G + L      +P+ 
Sbjct: 273 KTFPLSGGRKVTVKSVS-VYGSGDRL--VI------AVDVDGSLNGTVYLSG----RPVY 319

Query: 154 HIEAHSKEIDDVDFSI 169
                +  ++D+DF +
Sbjct: 320 DAATQTLRLEDLDFDV 335


>gnl|CDD|238625 cd01300, YtcJ_like, YtcJ_like metal dependent amidohydrolases. YtcJ
           is a Bacillus subtilis ORF of unknown function. The
           Arabidopsis homolog LAF3 has been identified as a factor
           required for photochrome A signalling.
          Length = 479

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 11/15 (73%), Positives = 13/15 (86%), Gaps = 1/15 (6%)

Query: 265 ALAVRDDGRFVAVGT 279
           A+AVRD GR VAVG+
Sbjct: 1   AVAVRD-GRIVAVGS 14


>gnl|CDD|227371 COG5038, COG5038, Ca2+-dependent lipid-binding protein, contains C2
           domain [General function prediction only].
          Length = 1227

 Score = 29.0 bits (65), Expect = 3.9
 Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 5/92 (5%)

Query: 26  KITAEFTQMNGVVKSK-ESNNTADTIVRKRKEKDKENAKQKGGKKEKSNEKGKNKKDKIE 84
            +  +  + +   KS  +++NT  ++ R+  + D     Q GG + K     + +     
Sbjct: 78  TVPKQSIESSSPEKSDVDTSNTRPSVSRELHKDDYVGPDQDGGWQRKVELSSEQELSDFT 137

Query: 85  DPPILNSIDSKEKNLRFGFKTFDSVQTVFAGS 116
           D  IL +   K+    F    + SV  V  GS
Sbjct: 138 DMSILLNYILKK----FYGDWYQSVAIVLIGS 165


>gnl|CDD|223381 COG0304, FabB, 3-oxoacyl-(acyl-carrier-protein) synthase [Lipid
           metabolism / Secondary metabolites biosynthesis,
           transport, and catabolism].
          Length = 412

 Score = 28.4 bits (64), Expect = 5.4
 Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 11/56 (19%)

Query: 114 AGSESLQRVVRLSRNG--DLMVTGGTDGHLR---LWSFPKMKPLLHIEAHSKEIDD 164
            G+ ++   VRL R G  D+++ GG +  +    +  F  M+      A S   DD
Sbjct: 164 TGAHAIGDAVRLIRLGKADVVIAGGAEAAITPLGIAGFEAMR------ALSTRNDD 213


>gnl|CDD|224490 COG1574, COG1574, Predicted metal-dependent hydrolase with the
           TIM-barrel fold [General function prediction only].
          Length = 535

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 9/16 (56%), Positives = 13/16 (81%), Gaps = 1/16 (6%)

Query: 264 SALAVRDDGRFVAVGT 279
            A+A++D GR VAVG+
Sbjct: 25  EAVAIKD-GRIVAVGS 39


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 28.1 bits (63), Expect = 6.7
 Identities = 10/41 (24%), Positives = 16/41 (39%)

Query: 58 DKENAKQKGGKKEKSNEKGKNKKDKIEDPPILNSIDSKEKN 98
          + E  +QK   K+K  +K   K  K  D       +   +N
Sbjct: 7  ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISEN 47


>gnl|CDD|184161 PRK13580, PRK13580, serine hydroxymethyltransferase; Provisional.
          Length = 493

 Score = 28.1 bits (63), Expect = 7.0
 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 8/32 (25%)

Query: 120 QRVV--------RLSRNGDLMVTGGTDGHLRL 143
           Q+VV           + G  +VTGGTD HL L
Sbjct: 342 QQVVDNARALAEGFLKRGARLVTGGTDNHLVL 373


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 27.8 bits (62), Expect = 7.1
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 13/66 (19%)

Query: 40  SKESNNTADTIVRKRKEKDKENAKQKGGKKEKSNEKGKNKKDKIEDPPILNSIDSKEKNL 99
           S+E N  A T  +K K++DK N        +K +EK K KK + ++P         E   
Sbjct: 43  SEEENKVATTSTKKDKKEDKNNE------SKKKSEKKKKKKKEKKEP-------KSEGET 89

Query: 100 RFGFKT 105
           + GFKT
Sbjct: 90  KLGFKT 95


>gnl|CDD|220674 pfam10282, Lactonase, Lactonase, 7-bladed beta-propeller.  This
           entry contains bacterial 6-phosphogluconolactonases
           (6PGL)YbhE-type (EC:3.1.1.31) which hydrolyse
           6-phosphogluconolactone to 6-phosphogluconate. The entry
           also contains the fungal muconate lactonising enzyme
           carboxy-cis,cis-muconate cyclase (EC:5.5.1.5) and
           muconate cycloisomerase (EC:5.5.1.1), which convert
           cis,cis-muconates to muconolactones and vice versa as
           part of the microbial beta-ketoadipate pathway.
           Structures of proteins in this family have revealed a
           7-bladed beta-propeller fold.
          Length = 344

 Score = 27.6 bits (62), Expect = 7.8
 Identities = 18/47 (38%), Positives = 23/47 (48%), Gaps = 3/47 (6%)

Query: 245 QWDVDSGRLRL---AREMKESLSALAVRDDGRFVAVGTMFTGSVFVY 288
           + D D+G+L L         S   L+V  DGRF+ V     GSV VY
Sbjct: 67  RIDPDTGKLTLLNQVPTGGASPCHLSVDPDGRFLFVANYGGGSVSVY 113


>gnl|CDD|185390 PRK15493, PRK15493, 5-methylthioadenosine/S-adenosylhomocysteine
           deaminase; Provisional.
          Length = 435

 Score = 27.7 bits (61), Expect = 7.9
 Identities = 14/50 (28%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 147 PKMKPLLHIEAHSKEIDDVDFSISGEQIVSIAKDGKAFVWNSKNGSLSKE 196
           P  KP  H++   + +  + ++ SG+ I  +  +GK  VWN +  +L +E
Sbjct: 373 PSNKP--HLQPADEVLSHLVYAASGKDISDVIINGKRVVWNGECKTLDEE 420


>gnl|CDD|217451 pfam03249, TSA, Type specific antigen.  There are several antigenic
           variants in Rickettsia tsutsugamushi, and a
           type-specific antigen (TSA) of 56-kilodaltons located on
           the rickettsial surface is responsible for the
           variation. TSA proteins are probably integral membrane
           proteins.
          Length = 502

 Score = 27.9 bits (62), Expect = 8.1
 Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 12  IYVVAGQENSNQLYKITAEFTQMNGVVKSKESNNTADTIVRKRKEKDKENAKQKGGKKEK 71
           + ++ G E   QLYK   +  +  G+ K+ E    A         +   + K+K G  EK
Sbjct: 334 VRLLNGNEQIIQLYKDLVKLQRHAGIKKAME--KLAAQQEEDAGNQGGGDCKKKQGASEK 391

Query: 72  SNEKGKNKK 80
           S E GK K+
Sbjct: 392 SKEGGKGKE 400


>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
           (DUF1764).  This is a family of eukaryotic proteins of
           unknown function. This family contains many hypothetical
           proteins.
          Length = 98

 Score = 26.3 bits (58), Expect = 8.9
 Identities = 14/67 (20%), Positives = 30/67 (44%), Gaps = 9/67 (13%)

Query: 38  VKSKESNNTA----DTIVRKRKEKDKENAKQKGGKKEKSNEKGKNKKDKIEDPPILNSID 93
            + K++  T     D I    K++ K+  +     + K+ +KG+ K  K ++ P     +
Sbjct: 1   KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFP-----E 55

Query: 94  SKEKNLR 100
             E++ R
Sbjct: 56  FPEESKR 62


>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
          unknown].
          Length = 523

 Score = 27.5 bits (61), Expect = 9.1
 Identities = 15/63 (23%), Positives = 25/63 (39%), Gaps = 5/63 (7%)

Query: 37 VVKSKESNNTADTIVRK---RKEKDKENAKQKGGKKEKSNEKG--KNKKDKIEDPPILNS 91
            K    N+  +TI R+    + K     K +    E   E G   N K+K +   + + 
Sbjct: 7  ATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSD 66

Query: 92 IDS 94
          +DS
Sbjct: 67 VDS 69


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.376 

Gapped
Lambda     K      H
   0.267   0.0710    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,779,740
Number of extensions: 1404779
Number of successful extensions: 2074
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1949
Number of HSP's successfully gapped: 102
Length of query: 297
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 201
Effective length of database: 6,679,618
Effective search space: 1342603218
Effective search space used: 1342603218
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.5 bits)