RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13952
         (445 letters)



>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score =  116 bits (294), Expect = 4e-31
 Identities = 48/87 (55%), Positives = 58/87 (66%), Gaps = 2/87 (2%)

Query: 31  SMMTEMLKGNLTNVLPMIVIGGWINWMFSGFVTTKVPFPLTLRFKPMLQRGIELLSLDPA 90
            MM +M+K N   ++P I+I GWIN  FSGFV  K+PFPLT RFK MLQRG++   LD  
Sbjct: 84  EMMMDMMKPNFAIMIPQILIFGWINSFFSGFVVAKLPFPLTGRFKGMLQRGVD--GLDVT 141

Query: 91  WVSSASWYFLNVFGLRSIYALVLGENN 117
            VSS SWYFL  FGLR +   +LG  N
Sbjct: 142 DVSSISWYFLCSFGLRQVIQKLLGFAN 168



 Score =  100 bits (250), Expect = 7e-25
 Identities = 53/175 (30%), Positives = 71/175 (40%), Gaps = 61/175 (34%)

Query: 218 RFKPMLQRGKYLTRSAFQMRRYYFNDEEVGYFLTQKRTAPASNPMTDPSMMTEMLKGNLT 277
           R + + + G  L+   F+ R+    +++                     MM +M+K N  
Sbjct: 55  RARELRKNGDKLSPKKFEKRQEELMEDQKE-------------------MMMDMMKPNFA 95

Query: 278 NVLPMIVIGGWINWMFSGFVTTKVPFPLTLRFKPMLQRGWINWMFSGFVTTKVPFPLTLR 337
            ++P I+I GWIN                               FSGFV  K+PFPLT R
Sbjct: 96  IMIPQILIFGWINSF-----------------------------FSGFVVAKLPFPLTGR 126

Query: 338 FKPMLQRDILGNATFSPTGIELLSLDPAWVSSASWYFLNVFGLRSIYALVLGENN 392
           FK MLQR + G             LD   VSS SWYFL  FGLR +   +LG  N
Sbjct: 127 FKGMLQRGVDG-------------LDVTDVSSISWYFLCSFGLRQVIQKLLGFAN 168



 Score = 78.8 bits (195), Expect = 2e-17
 Identities = 34/89 (38%), Positives = 47/89 (52%), Gaps = 6/89 (6%)

Query: 141 KAAFKAEWEALEIYQHQSALEGLAA---EMIEVDMTDPSMMTEMLKGNLTNVLPMIVIGG 197
           +   +     L     + + +       E++E       MM +M+K N   ++P I+I G
Sbjct: 49  EKEIQKRARELRKNGDKLSPKKFEKRQEELMEDQK---EMMMDMMKPNFAIMIPQILIFG 105

Query: 198 WINWMFSGFVTTKVPFPLTLRFKPMLQRG 226
           WIN  FSGFV  K+PFPLT RFK MLQRG
Sbjct: 106 WINSFFSGFVVAKLPFPLTGRFKGMLQRG 134


>gnl|CDD|222064 pfam13351, DUF4099, Protein of unknown function (DUF4099).  A
           family of uncharacterized proteins found by clustering
           human gut metagenomic sequences. The C-terminal repeat
           region of this family is DUF4098, pfam13345.
          Length = 84

 Score = 31.3 bits (72), Expect = 0.17
 Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 12/51 (23%)

Query: 148 WEALEIYQHQSALEGLAAEMIEVDMTDPSMMTEMLKGNLTNVLPM-IVIGG 197
           WE LE         G+  E +E     P  + ++L+G  T ++P+ I I G
Sbjct: 8   WETLEKL-------GITKEKLE----KPGDLDKLLQGEKTPLVPISINIEG 47


>gnl|CDD|222152 pfam13468, Glyoxalase_3, Glyoxalase-like domain.  This domain is
           related to the Glyoxalase domain pfam00903.
          Length = 174

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 15/100 (15%), Positives = 31/100 (31%), Gaps = 6/100 (6%)

Query: 82  IELLSLDPAWVSSASWYFLNVFGLRS---IYALVLGENNAADQSRQFQDQMSGAAVQMPV 138
           +ELL++DP   +     +  +  L     +    L  ++    + + +    G      V
Sbjct: 49  LELLAIDPEAPAPDGGRWFGLDRLAGGPGLSGWALRTDDIDAVAARLRAA--GLDFGGRV 106

Query: 139 DPKAAFKAEWEALEIYQHQSALEGLAAEMIEVDMTDPSMM 178
            P       W             GL   +I+   + P+  
Sbjct: 107 RP-DGGDLRWRLAVPADGDLPAGGLLPFLIQWGTSHPAAR 145


>gnl|CDD|163619 cd00844, MPP_Dbr1_N, Dbr1 RNA lariat debranching enzyme, N-terminal
           metallophosphatase domain.  Dbr1 is an RNA lariat
           debranching enzyme that hydrolyzes 2'-5' phosphodiester
           bonds at the branch points of excised intron lariats.
           This alignment model represents the N-terminal
           metallophosphatase domain of Dbr1.  This domain belongs
           to the metallophosphatase (MPP) superfamily.  MPPs are
           functionally diverse, but all share a conserved domain
           with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets. This domain is thought to allow for
           productive metal coordination.
          Length = 262

 Score = 29.1 bits (66), Expect = 3.7
 Identities = 15/36 (41%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 336 LRFKPMLQRDILGNATFSPTGIELL-SLDPAWVSSA 370
           LR KP  ++DI      SP   ELL  L P +  SA
Sbjct: 188 LRKKPFFRQDIESGTLGSPAAEELLKHLKPRYWFSA 223


>gnl|CDD|130852 TIGR01792, urease_alph, urease, alpha subunit.  This model
           describes the urease alpha subunit UreC (designated beta
           or B chain, UreB in Helicobacter species). Accessory
           proteins for incorporation of the nickel cofactor are
           usually found in addition to the urease alpha, beta, and
           gamma subunits. The trusted cutoff is set above the
           scores of many reported fragments and of a putative
           second urease alpha chain in Streptomyces coelicolor
           [Central intermediary metabolism, Nitrogen metabolism].
          Length = 567

 Score = 29.0 bits (65), Expect = 6.6
 Identities = 17/71 (23%), Positives = 28/71 (39%), Gaps = 2/71 (2%)

Query: 192 MIVIGGWINWMFSGFVTTKVPFPLTLRFKPML-QRGKYLTRSAFQMRRYYFNDEEVG-YF 249
           M++ GG I W   G     +P P  + ++PM    G+ L  ++         D+ +    
Sbjct: 446 MVLKGGLIAWAIMGDPNASIPTPQPVLYRPMFGAYGRALQSTSITFVSQAAYDKGIKERL 505

Query: 250 LTQKRTAPASN 260
             QK   P  N
Sbjct: 506 GLQKLLLPVHN 516



 Score = 28.6 bits (64), Expect = 7.0
 Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 47  MIVIGGWINWMFSGFVTTKVPFPLTLRFKPML-QRGIELLSLDPAWVSSASW 97
           M++ GG I W   G     +P P  + ++PM    G  L S    +VS A++
Sbjct: 446 MVLKGGLIAWAIMGDPNASIPTPQPVLYRPMFGAYGRALQSTSITFVSQAAY 497


>gnl|CDD|164753 MTH00207, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 572

 Score = 28.6 bits (64), Expect = 7.3
 Identities = 20/73 (27%), Positives = 28/73 (38%), Gaps = 8/73 (10%)

Query: 263 TDPSMMTEMLKGNLTNVLPMIVIGGWINWMFSGFVTTKVPFPLTLRFKP---MLQRGWIN 319
            D S+ T ML   LT  L  I  G  +NW         +  PL  +  P   +     + 
Sbjct: 415 EDKSLTTPML--LLT--LGAITSGSALNWWLPTPQEEPI-LPLQYKLLPLLVVTLGALLA 469

Query: 320 WMFSGFVTTKVPF 332
           W+F+    TK P 
Sbjct: 470 WLFTTPNITKTPL 482


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.136    0.423 

Gapped
Lambda     K      H
   0.267   0.0753    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 23,346,790
Number of extensions: 2261934
Number of successful extensions: 2042
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2039
Number of HSP's successfully gapped: 15
Length of query: 445
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 345
Effective length of database: 6,502,202
Effective search space: 2243259690
Effective search space used: 2243259690
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.2 bits)