RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13425
         (224 letters)



>gnl|CDD|216389 pfam01251, Ribosomal_S7e, Ribosomal protein S7e. 
          Length = 189

 Score =  271 bits (696), Expect = 1e-93
 Identities = 126/220 (57%), Positives = 157/220 (71%), Gaps = 34/220 (15%)

Query: 6   SKIIKKGGE-PDAFENSISQALLELEMNS-DLKAQLRELYITRAKEIDLGGKKSVIVYVP 63
           +KIIK  G  P   E S++QAL +LE NS DLKA+LR+L IT AKEI++GG         
Sbjct: 1   NKIIKPKGSKPSELELSVAQALFDLESNSPDLKAELRDLQITSAKEIEVGG--------- 51

Query: 64  MPQLKQFQKIQPRLVPKEIDLGGKKSVIVYVPMPQLKQFQKIQPRLVRELEKKFSGKHVV 123
                                 GKK+V+++VP+PQLK ++KIQ RLVRELEKKFSGKHVV
Sbjct: 52  ----------------------GKKAVVIFVPVPQLKAYRKIQQRLVRELEKKFSGKHVV 89

Query: 124 FIGDRKILPKPTRKTRAKTKQKRPRSRTLTSVYDCILEDIVFPAEIVGKRIKVKLDGKQL 183
           F+  R+ILPKP RK+R + KQKRPRSRTLT+V+D ILED+V+PAEIVGKRI+ +LDG +L
Sbjct: 90  FVAQRRILPKPKRKSRYQ-KQKRPRSRTLTAVHDAILEDLVYPAEIVGKRIRYRLDGSKL 148

Query: 184 IKVHLDKNQQTNIEHKTDTFSQIYKKLTGREVNFEFPEPY 223
           IKV LD   QTN+EHK D+FS +YKKLTG++V FEFPE  
Sbjct: 149 IKVFLDSKDQTNVEHKLDSFSAVYKKLTGKDVVFEFPETN 188


>gnl|CDD|185592 PTZ00389, PTZ00389, 40S ribosomal protein S7; Provisional.
          Length = 184

 Score =  191 bits (487), Expect = 5e-62
 Identities = 93/216 (43%), Positives = 127/216 (58%), Gaps = 35/216 (16%)

Query: 7   KIIKKGGEPDAFENSISQALLELEMNS-DLKAQLRELYITRAKEIDLGGKKSVIVYVPMP 65
               K  EP   E  +++AL ELE +S DLKA L+ L I+  KE+ +G  K         
Sbjct: 1   IKKLKKAEPSELEKQVAKALFELEASSKDLKADLKNLKISSVKEVTVGKDK--------- 51

Query: 66  QLKQFQKIQPRLVPKEIDLGGKKSVIVYVPMPQLKQFQKIQPRLVRELEKKFSGKHVVFI 125
                                KK+V+V+VP   L  ++KIQ +L+ ELEKK   KHVV +
Sbjct: 52  ---------------------KKAVVVFVPYRMLMIYRKIQRKLIPELEKKL-KKHVVIV 89

Query: 126 GDRKILPKPTRKTRAKTKQKRPRSRTLTSVYDCILEDIVFPAEIVGKRIKVKLDGKQLIK 185
             R IL KP +  +    + RPRSRTLT+V++ ILED+V+P+EIVGKR +V++DG +L+K
Sbjct: 90  AQRTILKKPVKNYK---LKTRPRSRTLTAVHEAILEDLVYPSEIVGKRTRVRVDGSKLLK 146

Query: 186 VHLDKNQQTNIEHKTDTFSQIYKKLTGREVNFEFPE 221
           V LD   + N+E K D FS +YKKLTGR+V FEFP 
Sbjct: 147 VFLDPKDRKNVEEKLDAFSAVYKKLTGRDVVFEFPW 182


>gnl|CDD|213315 cd05968, AACS_like, Uncharacterized acyl-CoA synthetase subfamily
           similar to Acetoacetyl-CoA synthetase.  This
           uncharacterized acyl-CoA synthetase family is highly
           homologous to acetoacetyl-CoA synthetase. However, the
           proteins in this family exist in only bacteria and
           archaea. AACS is a cytosolic ligase that specifically
           activates acetoacetate to its coenzyme A ester by a
           two-step reaction. Acetoacetate first reacts with ATP to
           form an acyl-adenylate intermediate, which then reacts
           with CoA to produce an acyl-CoA ester. This is the first
           step of the mevalonate pathway of isoprenoid
           biosynthesis via isopentenyl diphosphate. Isoprenoids
           are a large class of compounds found in all living
           organisms.
          Length = 474

 Score = 30.3 bits (69), Expect = 0.72
 Identities = 18/63 (28%), Positives = 25/63 (39%), Gaps = 10/63 (15%)

Query: 78  VPKEIDLGGKKSVIVYVPMPQLKQFQKIQPRL----VRELEKKFSGKHVVFIGDRKILPK 133
           VP  +   G+  V   V  P +     ++  L       L K F  + V F+ D   LPK
Sbjct: 391 VPDPVK--GEAIVCFVVLKPGVTPSAALEAELADHVGDALGKPFRPREVHFVSD---LPK 445

Query: 134 PTR 136
            TR
Sbjct: 446 -TR 447


>gnl|CDD|172682 PRK14194, PRK14194, bifunctional 5,10-methylene-tetrahydrofolate
           dehydrogenase/ 5,10-methylene-tetrahydrofolate
           cyclohydrolase; Provisional.
          Length = 301

 Score = 28.7 bits (64), Expect = 2.2
 Identities = 20/69 (28%), Positives = 28/69 (40%), Gaps = 8/69 (11%)

Query: 108 RLVRELEKKFSGKHVVFIGDRKILPKPTR----KTRAKTKQKRPRSRTLTSVYDCILEDI 163
           RL+ +     +GKH V IG   I+ KP      +          RS    ++  C   DI
Sbjct: 148 RLLEDTCGDLTGKHAVVIGRSNIVGKPMAALLLQAHCSVTVVHSRSTDAKAL--CRQADI 205

Query: 164 VFPAEIVGK 172
           V  A  VG+
Sbjct: 206 VVAA--VGR 212


>gnl|CDD|212511 cd09015, Ureohydrolase, Ureohydrolase superfamily includes
           arginase, formiminoglutamase, agmatinase and
           proclavaminate amidinohydrolase (PAH).  This family,
           also known as arginase-like amidino hydrolase family,
           includes Mn-dependent enzymes: arginase (Arg, EC
           3.5.3.1), formimidoylglutamase (HutG, EC 3.5.3.8 ),
           agmatinase (SpeB, EC 3.5.3.11), guanidinobutyrase (Gbh,
           EC=3.5.3.7), proclavaminate amidinohydrolase (PAH, EC
           3.5.3.22) and related proteins. These enzymes catalyze
           hydrolysis of amide bond. They are involved in control
           of cellular levels of arginine and ornithine (both
           involved in protein biosynthesis, and production of
           creatine, polyamines, proline and nitric acid), in
           histidine and arginine degradation, and in clavulanic
           acid biosynthesis.
          Length = 270

 Score = 28.5 bits (64), Expect = 2.3
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query: 114 EKKFSGKHVVFIGDRKILPKPTRKTRAKTKQ 144
           E + S KH+V IG R + P P     A+   
Sbjct: 138 ELQQSPKHIVCIGVRGLDPGPALFEYARKLG 168


>gnl|CDD|191413 pfam05970, PIF1, PIF1-like helicase.  This family includes
           homologues of the PIF1 helicase, which inhibits
           telomerase activity and is cell cycle regulated. This
           family includes a large number of largely
           uncharacterized plant proteins. This family includes a
           P-loop motif that is involved in nucleotide binding.
          Length = 364

 Score = 28.9 bits (65), Expect = 2.5
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 12/78 (15%)

Query: 72  KIQPRLVPKEIDLGGKKSVIVY--VPMPQLKQFQKIQPRLVR------ELEKKFSGKHVV 123
           KI+     K  +L    S+I++   PM     F+ +  R +R       ++K F GK VV
Sbjct: 89  KIKRG--SKLAELLENTSLIIWDEAPMVNRHCFEALD-RTLRDILKETNIDKPFGGKTVV 145

Query: 124 FIGD-RKILPKPTRKTRA 140
             GD R+ILP   + +RA
Sbjct: 146 LGGDFRQILPVIPKGSRA 163


>gnl|CDD|172659 PRK14171, PRK14171, bifunctional 5,10-methylene-tetrahydrofolate
           dehydrogenase/ 5,10-methylene-tetrahydrofolate
           cyclohydrolase; Provisional.
          Length = 288

 Score = 28.4 bits (63), Expect = 3.0
 Identities = 30/101 (29%), Positives = 48/101 (47%), Gaps = 15/101 (14%)

Query: 44  ITRAKEIDLGGKKS-VIVYVPMPQLKQFQKIQPRLVP-KEID------LGGKKSVIV--Y 93
           I++  E++L  + S +IV +P+P      KI   + P K+ID      +G   S I   +
Sbjct: 79  ISKINELNLDNEISGIIVQLPLPSSIDKNKILSAVSPSKDIDGFHPLNVGYLHSGISQGF 138

Query: 94  VPMPQLKQFQKIQPRLVRELEKKFSGKHVVFIGDRKILPKP 134
           +P   L     I     ++ E   +GK+VV IG   I+ KP
Sbjct: 139 IPCTALGCLAVI-----KKYEPNLTGKNVVIIGRSNIVGKP 174


>gnl|CDD|221413 pfam12080, GldM_C, GldM C-terminal domain.  This domain is found in
           bacteria at the C-terminus of the GldM protein. This
           domain is typically between 169 to 182 amino acids in
           length. This domain has two completely conserved
           residues (Y and N) that may be functionally important.
           GldM, is named for the member from Cytophaga johnsonae
           (Flavobacterium johnsoniae), which is required for a
           type of rapid gliding motility found in certain members
           of the Bacteriodetes.
          Length = 175

 Score = 27.7 bits (62), Expect = 3.8
 Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 2/60 (3%)

Query: 129 KILPKPTRKTRAKTKQKRPRSRTLTS--VYDCILEDIVFPAEIVGKRIKVKLDGKQLIKV 186
           K LP PT   R +    +   R L      D   ED  F A       +VK+  +  I V
Sbjct: 83  KALPDPTGTVRGEKGGAQMPKRNLEVAAKIDASFEDFPFDANYEVTGFEVKVPRQPTIVV 142


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 27.7 bits (62), Expect = 4.5
 Identities = 19/61 (31%), Positives = 27/61 (44%)

Query: 31 MNSDLKAQLRELYITRAKEIDLGGKKSVIVYVPMPQLKQFQKIQPRLVPKEIDLGGKKSV 90
          M    K   ++ Y  +  E+    KK +IV V     KQ Q I+  L  K   L GK ++
Sbjct: 1  MAKPEKKAKKKAYFEKLYELLEKYKKILIVSVDNVGSKQMQDIRRSLRGKAELLMGKNTL 60

Query: 91 I 91
          I
Sbjct: 61 I 61


>gnl|CDD|184561 PRK14192, PRK14192, bifunctional 5,10-methylene-tetrahydrofolate
           dehydrogenase/ 5,10-methylene-tetrahydrofolate
           cyclohydrolase; Provisional.
          Length = 283

 Score = 27.9 bits (62), Expect = 4.6
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 108 RLVRELEKKFSGKHVVFIGDRKILPKP 134
           RL++    + +GKH V +G   IL KP
Sbjct: 148 RLLKAYNIELAGKHAVVVGRSAILGKP 174


>gnl|CDD|235701 PRK06116, PRK06116, glutathione reductase; Validated.
          Length = 450

 Score = 27.8 bits (63), Expect = 5.6
 Identities = 10/23 (43%), Positives = 13/23 (56%), Gaps = 2/23 (8%)

Query: 176 VKLDGKQLIKVHLDKNQQTNIEH 198
           VKL+ K  I V  D+ Q TN+  
Sbjct: 277 VKLNEKGYIIV--DEYQNTNVPG 297


>gnl|CDD|220272 pfam09508, Lact_bio_phlase, Lacto-N-biose phosphorylase.  The gene
           which codes for this protein in gut-bacteria is located
           in a novel putative operon for galactose metabolism. The
           protein appears to be a carbohydrate-processing
           phosphorolytic enzyme (EC:2.4.1.211), unlike either
           glycoside hydrolases or glycoside lyase. Intestinal
           colonisation by bifidobacteria is important for human
           health, especially in pediatrics, because colonisation
           seems to prevent infection by some pathogenic bacteria
           that cause diarrhoea or other illnesses. The operon
           seems to be involved in intestinal colonisation by
           bifidobacteria mediated by metabolism of mucin sugars.
           In addition, it may also resolve the question of the
           nature of the bifidus factor in human milk as the
           lacto-N-biose structure found in milk oligosaccharides.
          Length = 716

 Score = 27.8 bits (62), Expect = 5.6
 Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 62  VPMPQLKQFQK-IQPRLVPKEIDLGGKKSVIVYVPMPQLKQFQKIQPRLVRELEKKF--- 117
           V    L+QF+K    +L P++    G  +    VP  Q + + + Q R V +  K+    
Sbjct: 236 VSPYALEQFEKEYGYKLRPEDFIDQGYYNSTYRVPSKQFRDWMEFQQRFVVKFAKELVDI 295

Query: 118 ---SGKH-VVFIGDRKILPKP 134
              +GK  ++F+GD  I  +P
Sbjct: 296 VHEAGKEAMMFLGDHWIGTEP 316


>gnl|CDD|113491 pfam04723, GRDA, Glycine reductase complex selenoprotein A.  Found
           in clostridia, this protein contains one active site
           selenocysteine and catalyzes the reductive deamination
           of glycine, which is coupled to the esterification of
           orthophosphate resulting in the formation of ATP. A
           member of this family may also exist in Treponema
           denticola.
          Length = 150

 Score = 26.8 bits (59), Expect = 5.7
 Identities = 10/18 (55%), Positives = 12/18 (66%)

Query: 117 FSGKHVVFIGDRKILPKP 134
           F GK V+ IGDR  +P P
Sbjct: 3   FQGKKVIIIGDRDGIPGP 20


>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
           kinases [Cell motility and secretion / Signal
           transduction mechanisms].
          Length = 716

 Score = 27.7 bits (62), Expect = 5.8
 Identities = 18/45 (40%), Positives = 24/45 (53%), Gaps = 11/45 (24%)

Query: 94  VPMPQLKQFQKIQPRLVRELEKKFSGKHV--VFIG-----DRKIL 131
           VP  Q+  F +  PR+VR+L +K  GK V  V  G     D+ IL
Sbjct: 390 VPFEQV--FSR-FPRMVRDLARKL-GKQVELVIEGEDTELDKSIL 430


>gnl|CDD|221057 pfam11279, DUF3080, Protein of unknown function (DUF3080).  Some
           members in this family of proteins are annotated as
           lipoproteins however this cannot be confirmed. Currently
           this family has no known function.
          Length = 315

 Score = 27.3 bits (61), Expect = 7.9
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 18  FENSISQALLELEMNSDLKAQLRELYIT 45
           F  +++  L  L ++ +LKA+L E    
Sbjct: 84  FIRALNACLASLPLSEELKAELTEALQQ 111


>gnl|CDD|222410 pfam13839, PC-Esterase, GDSL/SGNH-like Acyl-Esterase family found
           in Pmr5 and Cas1p.  The PC-Esterase family is comprised
           of Cas1p, the Homo sapiens C7orf58, Arabidopsis thaliana
           PMR5 and a group of plant freezing
           resistance/coldacclimatization proteins typified by
           Arabidopsis thaliana ESKIMO1, animal FAM55D proteins,
           and animal FAM113 proteins. The PC-Esterase family has
           features that are both similar and different from the
           canonical GDSL/SGNH superfamily. The members of this
           family are predicted to have Acyl esterase activity and
           predicted to modify cell-surface biopolymers such as
           glycans and glycoproteins. The Cas1p protein has a
           Cas1_AcylT domain, in addition, with the opposing
           acyltransferase activity. The C7orf58 family has a
           ATP-Grasp domain fused to the PC-Esterase and is the
           first identified secreted tubulin-tyrosine ligase like
           enzyme in eukaryotes. The plant family with PMR5, ESK1,
           TBL3 etc have a N-terminal C rich potential sugar
           binding domain followed by the PC-Esterase domain.
          Length = 270

 Score = 26.6 bits (59), Expect = 9.8
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query: 110 VRELEKKFSGKHVVFIGD 127
            ++  ++  GK VVF+GD
Sbjct: 6   AQQFLERLRGKRVVFVGD 23


>gnl|CDD|183931 PRK13265, PRK13265, glycine/sarcosine/betaine reductase complex
           protein A; Reviewed.
          Length = 154

 Score = 26.2 bits (58), Expect = 9.8
 Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 116 KFSGKHVVFIGDRKILPKPTRKTRAKT 142
              GK V+ IGDR  +P P  +   KT
Sbjct: 3   LLEGKKVIIIGDRDGIPGPAIEECVKT 29


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.138    0.383 

Gapped
Lambda     K      H
   0.267   0.0682    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,487,848
Number of extensions: 1109039
Number of successful extensions: 1211
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1202
Number of HSP's successfully gapped: 39
Length of query: 224
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 131
Effective length of database: 6,812,680
Effective search space: 892461080
Effective search space used: 892461080
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.8 bits)