RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14347
         (316 letters)



>gnl|CDD|216686 pfam01762, Galactosyl_T, Galactosyltransferase.  This family
           includes the galactosyltransferases
           UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-
           galactosyltransferase and UDP-Gal:beta-GlcNAc beta
           1,3-galactosyltranferase. Specific
           galactosyltransferases transfer galactose to GlcNAc
           terminal chains in the synthesis of the lacto-series
           oligosaccharides types 1 and 2.
          Length = 196

 Score =  142 bits (360), Expect = 2e-41
 Identities = 56/217 (25%), Positives = 91/217 (41%), Gaps = 24/217 (11%)

Query: 59  ERRLAIRQSWGYEKRFSDVPIVTVFILGYDPDNEG-LQIEIAEESERYNDIVQAKFIDSY 117
            RR AIR++W  +       I ++F++G   D +G ++  + EE++ Y DIV   F D+Y
Sbjct: 1   ARRNAIRKTWMNKGNSEGGRIKSLFLVGLSADTDGKVKDLVMEEAKLYGDIVVVDFEDTY 60

Query: 118 FNNTIKTMMGFKWAANYCKHSKFYFFADDDFYVSTRNVLRFLRNPLQYPQYLELPIETIQ 177
            N T KT+ G  +A +    +K+    DDD Y     +L  L      P         ++
Sbjct: 61  ENLTFKTLTGLLYAVSKAPSAKYIGKIDDDVYFFPDKLLSLLDRGNINPSESSFYGYVMK 120

Query: 178 SKSNIMDYELPSDVKLFSGFVFVSSPHRHYTSKWYISLQEYPYHLWPPYVTAGSYVVSRE 237
               I                      R+  SKWY+   +YP   +PPY +   Y++SR+
Sbjct: 121 EGPVI----------------------RNKKSKWYVPPSDYPCSRYPPYASGPFYLLSRD 158

Query: 238 VLLDFYFASHFTKHFRFDDIYL-GILAKKTNTEPFHC 273
                  AS   +  + +D+Y+ GILA          
Sbjct: 159 AAELLLKASKHRRFLQIEDVYVTGILADDLGISRVDL 195


>gnl|CDD|215596 PLN03133, PLN03133, beta-1,3-galactosyltransferase; Provisional.
          Length = 636

 Score = 68.7 bits (168), Expect = 6e-13
 Identities = 77/281 (27%), Positives = 121/281 (43%), Gaps = 59/281 (20%)

Query: 51  VKSALRHFERRLAIRQSWGYEKRFSDVPIVTV---FILGYDPDNEGLQIEIAEESERYND 107
           V S   +F+RR+A+R++W    ++  V    V   F +G    N+ +  E+  E+  Y D
Sbjct: 391 VFSTANNFKRRMAVRRTW---MQYDAVRSGAVAVRFFVGLH-KNQMVNEELWNEARTYGD 446

Query: 108 IVQAKFIDSYFNNTIKT----MMGFKWAANYCKHSKFYFFADDDFYVSTRNVLRFLRNPL 163
           I    F+D Y   T KT    + G +  +     +K+    DDD +V    VL  L    
Sbjct: 447 IQLMPFVDYYSLITWKTLAICIFGTEVVS-----AKYVMKTDDDAFVRVDEVLASL---- 497

Query: 164 QYPQYLELPIETIQSKSNIMDYELPSDVKLFSGFVFVSSPHRHYTSKWYISLQEYPYHLW 223
                          ++N+      S   L+      S PHR+  SKWYIS +E+P   +
Sbjct: 498 --------------KRTNV------SHGLLYGLINSDSQPHRNPDSKWYISPEEWPEETY 537

Query: 224 PPYVTAGSYVVSREVLLDFY--FASHFTKHFRFDDIYLGIL---AKKTNTEPFHCGEFYF 278
           PP+     YVVSR++  + Y        K F+ +D+ +GI     KK   E         
Sbjct: 538 PPWAHGPGYVVSRDIAKEVYKRHKEGRLKMFKLEDVAMGIWIAEMKKEGLEV-------K 590

Query: 279 YKKDYSLHNYQ----YVIASHGYGNHDELLRVWNE-QRGIG 314
           Y+ D  ++N      YV+A H Y +  E+L +W + Q G  
Sbjct: 591 YENDGRIYNEGCKDGYVVA-H-YQSPREMLCLWQKLQEGKR 629


>gnl|CDD|224618 COG1704, LemA, Uncharacterized conserved protein [Function
           unknown].
          Length = 185

 Score = 31.5 bits (72), Expect = 0.39
 Identities = 13/56 (23%), Positives = 21/56 (37%), Gaps = 6/56 (10%)

Query: 91  NEGLQIEIAEESERYNDIVQAKFIDSYFNNTIKTMMGFKWAANYCKHSKFYFFADD 146
            EG +  IA     YN+ V+       +N  I++      A  +   +K  FF   
Sbjct: 127 LEGTENRIAVARRLYNEAVRD------YNVKIRSFPSNIVAKLFGFKAKPNFFQAK 176


>gnl|CDD|235050 PRK02546, PRK02546, NAD(P)H-quinone oxidoreductase subunit 4;
           Provisional.
          Length = 525

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 11/28 (39%), Positives = 13/28 (46%), Gaps = 5/28 (17%)

Query: 79  IVTVFILGYDPDNEGLQIEIAEESERYN 106
            V  F  GYD  N GLQ+      E Y+
Sbjct: 50  TVYAFYTGYDFSNPGLQL-----VESYS 72


>gnl|CDD|216679 pfam01751, Toprim, Toprim domain.  This is a conserved region from
           DNA primase. This corresponds to the Toprim domain
           common to DnaG primases, topoisomerases, OLD family
           nucleases and RecR proteins. Both DnaG motifs IV and V
           are present in the alignment, the DxD (V) motif may be
           involved in Mg2+ binding and mutations to the conserved
           glutamate (IV) completely abolish DnaG type primase
           activity. DNA primase EC:2.7.7.6 is a
           nucleotidyltransferase it synthesises the
           oligoribonucleotide primers required for DNA replication
           on the lagging strand of the replication fork; it can
           also prime the leading stand and has been implicated in
           cell division. This family also includes the atypical
           archaeal A subunit from type II DNA topoisomerases. Type
           II DNA topoisomerases catalyze the relaxation of DNA
           supercoiling by causing transient double strand breaks.
          Length = 86

 Score = 28.0 bits (63), Expect = 1.6
 Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 2/33 (6%)

Query: 84  ILGYDPDNEGLQI--EIAEESERYNDIVQAKFI 114
           IL  DPD EG  I  ++ E  +    I + +  
Sbjct: 52  ILATDPDREGEAIALKLLELLKPLGPIGRVEIS 84


>gnl|CDD|182895 PRK11006, phoR, phosphate regulon sensor protein; Provisional.
          Length = 430

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 13/26 (50%), Positives = 14/26 (53%)

Query: 50  IVKSALRHFERRLAIRQSWGYEKRFS 75
           IVK AL H + RL I    G   RFS
Sbjct: 393 IVKHALSHHDSRLEIESEVGKGTRFS 418


>gnl|CDD|150994 pfam10413, Rhodopsin_N, Amino terminal of the G-protein receptor
           rhodopsin.  Rhodopsin is the archetypal
           G-protein-coupled receptor. Such receptors participate
           in virtually all physiological processes, as signalling
           molecules. They utilise heterotrimeric guanosine
           triphosphate (GTP)-binding proteins to transduce
           extracellular signals to intracellular events. Rhodopsin
           is important because of the pivotal role it plays in
           visual signal transduction. Rhodopsin is a dimeric
           transmembrane protein and its intradiskal surface
           consists of this amino terminal domain and three loops
           connecting six of the seven transmembrane helices. The
           N-terminus is a compact domain of alpha-helical regions
           with breaks and bends at proline residues outside the
           membrane. The transmembrane part of rhodopsin is
           represented by 7tm_1 (pfam00001). The N-terminal domain
           is extracellular is and is necessary for successful
           dimerisation and molecular stability.
          Length = 36

 Score = 26.2 bits (58), Expect = 2.8
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 146 DDFYVSTRNVLRFLRNPLQYPQY 168
            +FYV   N    +R+P +YPQY
Sbjct: 6   PNFYVPMSNRTGVVRSPFEYPQY 28


>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
           (trigger factor) [Posttranslational modification,
           protein turnover, chaperones].
          Length = 441

 Score = 29.2 bits (66), Expect = 2.9
 Identities = 9/38 (23%), Positives = 15/38 (39%)

Query: 89  PDNEGLQIEIAEESERYNDIVQAKFIDSYFNNTIKTMM 126
              E ++ EI E + +Y      + I  Y+NN      
Sbjct: 368 VTEEEIKAEIEELARQYGGEQPEEVIKLYYNNQELLDA 405


>gnl|CDD|218536 pfam05282, AAR2, AAR2 protein.  This family consists of several
           eukaryotic AAR2-like proteins. The yeast protein AAR2 is
           involved in splicing pre-mRNA of the a1 cistron and
           other genes that are important for cell growth.
          Length = 339

 Score = 29.2 bits (66), Expect = 3.0
 Identities = 11/36 (30%), Positives = 15/36 (41%)

Query: 82  VFILGYDPDNEGLQIEIAEESERYNDIVQAKFIDSY 117
           V +L +D  +E L  E   E E Y +       D Y
Sbjct: 66  VLVLKWDSKDETLVPEEESEEEDYRERANLGEFDQY 101


>gnl|CDD|227475 COG5146, PanK, Pantothenate kinase, acetyl-CoA regulated [Coenzyme
           metabolism].
          Length = 342

 Score = 29.2 bits (65), Expect = 3.1
 Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 1/54 (1%)

Query: 97  EIAEESERYNDIVQAKFIDSYFNNTIKTMMGFKWAANY-CKHSKFYFFADDDFY 149
           +IA    R  +     F  S+  N + TM+   +A     K +   +F + + Y
Sbjct: 267 QIAYLVAREFNTQNIYFGGSFHRNHLLTMVTLDYAILRWSKPTMNAYFLEHEGY 320


>gnl|CDD|112810 pfam04011, LemA, LemA family.  The members of this family are
           related to the LemA protein. LemA contains an amino
           terminal predicted transmembrane helix. It has been
           predicted that the small amino terminus is
           extracellular. The exact molecular function of this
           protein is uncertain.
          Length = 186

 Score = 28.1 bits (63), Expect = 4.8
 Identities = 11/40 (27%), Positives = 15/40 (37%), Gaps = 6/40 (15%)

Query: 92  EGLQIEIAEESERYNDIVQAKFIDSYFNNTIKTMMGFKWA 131
           EG +  IA     YN+ VQ       +N  I+       A
Sbjct: 128 EGTENRIAVARRDYNEAVQE------YNTKIRQFPSVIVA 161


>gnl|CDD|235846 PRK06654, fliL, flagellar basal body-associated protein FliL;
           Reviewed.
          Length = 181

 Score = 27.8 bits (62), Expect = 5.4
 Identities = 13/47 (27%), Positives = 19/47 (40%), Gaps = 8/47 (17%)

Query: 75  SDVPIVTVFI---LGYDPDNEGLQIEIAEESERYNDIVQAKFIDSYF 118
           +D P  T  +   LGY  +N+ +  E+     R  D      I  YF
Sbjct: 96  ADTPPKTFVVKLALGYAENNKNILNELGRRKVRLKD-----IIREYF 137


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.139    0.441 

Gapped
Lambda     K      H
   0.267   0.0668    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,628,177
Number of extensions: 1602479
Number of successful extensions: 1295
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1292
Number of HSP's successfully gapped: 23
Length of query: 316
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 219
Effective length of database: 6,635,264
Effective search space: 1453122816
Effective search space used: 1453122816
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 59 (26.6 bits)