RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3602
         (390 letters)



>gnl|CDD|216556 pfam01532, Glyco_hydro_47, Glycosyl hydrolase family 47.  Members
           of this family are alpha-mannosidases that catalyze the
           hydrolysis of the terminal 1,2-linked alpha-D-mannose
           residues in the oligo-mannose oligosaccharide
           Man(9)(GlcNAc)(2).
          Length = 445

 Score =  363 bits (935), Expect = e-123
 Identities = 141/421 (33%), Positives = 199/421 (47%), Gaps = 94/421 (22%)

Query: 37  MFYHAYRAYMDNAYPADELMPLSCTGRWRSQHTSRGDIDDTLGNFSLTLIDTLDTLAILG 96
            F HA+  Y   A+  DEL P+SC GR            D+ G +  TL+D+LDTL I+G
Sbjct: 1   AFLHAWDGYKKYAWGHDELRPISCRGR------------DSFGGWGATLVDSLDTLYIMG 48

Query: 97  DIEEFEHATKLISENVNFDSD-IVVSVFETNIRVLGGLLSGHVLAEYFKLNTDLMQWYTG 155
             +EFE A   + + ++FD D   VSVFET IR LGGLLS + L+               
Sbjct: 49  LTDEFEEAVDWVEKKLDFDKDSTDVSVFETTIRYLGGLLSAYDLS------------GDD 96

Query: 156 QLLELAKDIGFRLLPAFNTTTGIPHGRINLRHGLKSKGLEWSVDTCTACAGTMILEMAAL 215
            LLE A D+  RLLPAF+T TGIP+ R+NL  G  +        +  A AGT+ LE   L
Sbjct: 97  LLLEKAVDLADRLLPAFDTPTGIPYPRVNLGKGGVNP--VAGGASSLAEAGTLQLEFTRL 154

Query: 216 SRLSGLPIFEAKAHRAMDGLWAARNRDSD-LMGTVLNIDSGDWIRRDSGVGAGIDSYYEY 274
           S+L+G P +E  A R MD LW +++R    L+   ++  +G +   D  +GAG DSYYEY
Sbjct: 155 SQLTGDPKYEDAAERVMDALWKSQSRSLPGLVPIFIDPSTGQFTGGDITLGAGGDSYYEY 214

Query: 275 CLKAYVLLGQ--DKYLSRFDKHYSAVMKYISRGPL----LIDVHMHK--------PQMTS 320
            LK Y+LLG    +YL  +D+   A+ K++   P     L+ V            P+M  
Sbjct: 215 LLKQYILLGGKDPQYLDMYDEAMDAIKKHLLFRPSTPSDLLFVGELSSGGGGELSPKMDH 274

Query: 321 -RNFMDALLA-----------------------------FWPGL---------------- 334
              F   LLA                                GL                
Sbjct: 275 LVCFAGGLLALGAKLGLPDKGDLELAEELTDTCYKMYKSTPTGLGPEIFYFNPCPCWDED 334

Query: 335 ------QVHWGQHPLRPEFLESTYFLYKSTGDPHYLDVGKSVLRALQTHARVPCGYAAVR 388
                 ++    + LRPE +ES ++LY+ TGDP Y + G  + +A++ + R PCGYA ++
Sbjct: 335 KWDFYVKIADSHYLLRPETIESLFYLYRLTGDPKYREWGWEIFQAIEKYTRTPCGYAGIK 394

Query: 389 D 389
           D
Sbjct: 395 D 395


>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
           Provisional.
          Length = 522

 Score =  177 bits (450), Expect = 1e-50
 Identities = 114/431 (26%), Positives = 173/431 (40%), Gaps = 107/431 (24%)

Query: 31  REESRDMFYHAYRAYMDNAYPADELMPLSCTGR-WRSQHTSRGDIDDTLGNFSLTLIDTL 89
           RE  R+   HA+  Y + A+  DEL PL+     W                  LT+ID+L
Sbjct: 73  RESVREAMKHAWEGYKEYAWGHDELRPLTKRHHEW--------------FGLGLTIIDSL 118

Query: 90  DTLAILGDIEEFEHATKLISENVNFDSDI--VVSVFETNIRVLGGLLSGHVLAEYFKLNT 147
           DTL I+G  +E++     ++ N+    D    VSVFET IRVLGGLLS +       L  
Sbjct: 119 DTLKIMGLKKEYKEGRDWVANNLKQSKDTGLGVSVFETTIRVLGGLLSAY------DLTG 172

Query: 148 DLMQWYTGQLLELAKDIGFRLLPAFNTTTGIPHGRINLRHGLKSKGLEWSVDTCTAC-AG 206
           D M       LE A++I  RLLPAFN  TG P   INL  G +     W+         G
Sbjct: 173 DEM------YLEKAREIADRLLPAFNEDTGFPASEINLATG-RKSYPGWAGGCSILSEVG 225

Query: 207 TMILEMAALSRLSGLPIFEAKAHRAMDGLWAARNRDSDLMGTVLNIDSGDWIRRDSGVGA 266
           T+ LE   LS ++G P +     + MD L++ +   + L    LN D+G +      +GA
Sbjct: 226 TLQLEFNYLSEITGDPKYAEYVDKVMDALFSMKPAINGLYPIFLNPDAGRFCGNHISLGA 285

Query: 267 GIDSYYEYCLKAYVLL--GQDKYLSRFDKHYSAVMKYISR---------GPLLIDVHMHK 315
             DSYYEY LK ++     +++Y   F +    +++++ +           +      +K
Sbjct: 286 LGDSYYEYLLKQWLYTNGREERYRRLFVESAKGIIEHLYKRSPKGLTYIAEMDGGSLTNK 345

Query: 316 PQMTSRNFMDALLAFWPG------------------LQVHWG------------QHP--L 343
                   M+ L  F  G                    +  G              P  L
Sbjct: 346 --------MEHLACFAGGMFALGAAINITPDDEKSARYMEVGEEVTKTCYETYATSPTGL 397

Query: 344 RPEFL--------------ESTYFL-----------YKSTGDPHYLDVGKSVLRALQTHA 378
            PE                +S Y L           Y+ TGDP Y +    + +A++ H 
Sbjct: 398 GPEIFHFDPNSGDISPNVHDSHYILRPETVESIFILYRLTGDPKYREWAWKIFQAIEKHC 457

Query: 379 RVPCGYAAVRD 389
           +   GY+ +++
Sbjct: 458 KTENGYSGLKN 468


>gnl|CDD|184130 PRK13545, tagH, teichoic acids export protein ATP-binding subunit;
           Provisional.
          Length = 549

 Score = 32.6 bits (74), Expect = 0.41
 Identities = 13/32 (40%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query: 9   ENYNHTSKSYTNGMSKEEKMTLREESRDMFYH 40
           ++Y+   K Y N MS EE+   REE    F H
Sbjct: 232 DHYDEFLKKY-NQMSVEERKDFREEQISQFQH 262


>gnl|CDD|221228 pfam11796, DUF3323, Protein of unknown function N-terminus
           (DUF3323).  Proteins in this entry are encoded within a
           conserved gene four-gene neighborhood found sporadically
           in a phylogenetically broad range of bacteria including:
           Nocardia farcinica, Symbiobacterium thermophilum, and
           Streptomyces avermitilis (Actinobacteria), Geobacillus
           kaustophilus (Firmicutes), Azoarcus sp. EbN1 and
           Ralstonia solanacearum (Beta-proteobacteria).
          Length = 215

 Score = 31.9 bits (73), Expect = 0.41
 Identities = 19/77 (24%), Positives = 28/77 (36%), Gaps = 11/77 (14%)

Query: 302 ISRGPLLIDVHMHKPQMTSRNFMDALLAFWPGLQVHWGQHPLRPEFLESTYFLYKSTGDP 361
           +  G  +  ++   P+  S      L A    L     +      F  +     + TGDP
Sbjct: 104 LRGGGTVRRLYGQDPEAASALLKQVLAALA-ALPQAGKEKERLAVF--AA----RVTGDP 156

Query: 362 HYLD----VGKSVLRAL 374
           H LD     G+ VL AL
Sbjct: 157 HALDAGTPAGRLVLAAL 173


>gnl|CDD|114308 pfam05578, Peptidase_S31, Pestivirus NS3 polyprotein peptidase S31.
            These serine peptidases are involved in processing of
           the flavivirus polyprotein.
          Length = 211

 Score = 31.9 bits (72), Expect = 0.43
 Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 200 TCTACAGT-MILEMAALSRLSGLPIFEAKAHRAMDGLWAARNRDS 243
           TC   +GT    ++  L   SGLPIFEA + R +  +   +N +S
Sbjct: 143 TCVTASGTPAFFDLKNLKGWSGLPIFEASSGRVVGRVKVGKNEES 187


>gnl|CDD|224688 COG1774, COG1774, Uncharacterized homolog of PSP1 [Function
           unknown].
          Length = 265

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 4/57 (7%)

Query: 98  IEEFEH---ATKLISENVNFDSDIVVSVFETNIRVLGGLLSGHVLAEYFKLNTDLMQ 151
           I +FE      KL+   +  D++  V +F    RV    L G  LA+ FK   +L Q
Sbjct: 89  IVKFEEHRLDMKLVDVEITLDNNKYVFLFTARGRVDFRELVG-DLAKIFKTRIELRQ 144


>gnl|CDD|236049 PRK07562, PRK07562, ribonucleotide-diphosphate reductase subunit
           alpha; Validated.
          Length = 1220

 Score = 29.9 bits (68), Expect = 3.2
 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 1/34 (2%)

Query: 75  DDTLGNF-SLTLIDTLDTLAILGDIEEFEHATKL 107
           DDT  N  SL L+   D      D+E +EHA +L
Sbjct: 483 DDTACNLASLNLLTFRDEDTKRFDVEAYEHACRL 516


>gnl|CDD|219654 pfam07944, DUF1680, Putative glycosyl hydrolase of unknown function
           (DUF1680).  The members of this family are sequences
           derived from hypothetical bacterial and eukaryotic
           proteins of unknown function. One members of this family
           is annotated as a possible arabinosidase, but no
           references were found to back this. These proteins are
           related to a large family of glycosyl hydrolases.
          Length = 511

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 14/33 (42%), Positives = 17/33 (51%), Gaps = 3/33 (9%)

Query: 339 GQHPLRPEF---LESTYFLYKSTGDPHYLDVGK 368
            Q  L PE     E+   LY+ TGD  YLD+ K
Sbjct: 171 MQKHLYPEHGGINEALVELYELTGDKRYLDLAK 203


>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein.  A novel
           genetic system characterized by six major proteins,
           included a ParB homolog and a ThiF homolog, is
           designated PRTRC, or ParB-Related,ThiF-Related Cassette.
           It is often found on plasmids. This protein family the
           member related to ParB, and is designated PRTRC system
           ParB family protein.
          Length = 554

 Score = 29.7 bits (67), Expect = 3.4
 Identities = 12/52 (23%), Positives = 24/52 (46%), Gaps = 3/52 (5%)

Query: 119 VVSVFETNIRVLGGLLSGHV--LAEYFKLNTDLMQWYTGQLLE-LAKDIGFR 167
             +     +  L  LL+     L  ++KLN + ++  T   ++ LA +IG +
Sbjct: 463 ASAAEGLEVHALKSLLNYLEVDLTRHWKLNKEFLELLTKSEIDALADEIGLK 514


>gnl|CDD|227154 COG4817, COG4817, DNA-binding ferritin-like protein (Dps family)
           [General function prediction only].
          Length = 111

 Score = 27.8 bits (62), Expect = 4.2
 Identities = 19/94 (20%), Positives = 31/94 (32%), Gaps = 11/94 (11%)

Query: 26  EKMTLREESRDMFYHAYRAYMDNAYPAD--ELMPLSCTGRWRSQHTSRGD---IDDTLGN 80
           +K+   EE R+      RA      P D            W+S  T   +   + + L  
Sbjct: 4   KKIIGLEEKREWKEFENRA---KKLPKDYYTAYKEIQKYLWKSGPTGWNEMKILGNILEL 60

Query: 81  FSLTLIDTLDTLAILG-DIEEFEHATKLISENVN 113
           F     D  +   +LG D+  F     L+ +   
Sbjct: 61  FEEAAADGKEVTDVLGDDVATF--CDALLGDYEK 92


>gnl|CDD|185707 cd07896, Adenylation_kDNA_ligase_like, Adenylation domain of kDNA
           ligases and similar proteins.  The mitochondrial DNA of
           parasitic protozoans is highly unusual. It is termed the
           kinetoplast DNA (kDNA) and consists of circular DNA
           molecules (maxicircles) and several thousand smaller
           circular molecules (minicircles). This group is composed
           of kDNA ligase, Chlorella virus DNA ligase, and similar
           proteins. kDNA ligase and Chlorella virus DNA ligase are
           the smallest known ATP-dependent ligases. They are
           involved in DNA replication or repair. ATP-dependent
           polynucleotide ligases catalyze phosphodiester bond
           formation using nicked nucleic acid substrates with the
           high energy nucleotide of ATP as a cofactor in a three
           step reaction mechanism. They have a highly modular
           architecture consisting of a unique arrangement of two
           or more discrete domains. The adenylation and the
           C-terminal oligonucleotide/oligosaccharide binding
           (OB)-fold domains comprise a catalytic core unit that is
           common to most members of the ATP-dependent DNA ligase
           family, including this group. The adenylation domain
           binds ATP and contains many of the active-site residues.
          Length = 174

 Score = 28.3 bits (64), Expect = 5.2
 Identities = 18/52 (34%), Positives = 20/52 (38%), Gaps = 10/52 (19%)

Query: 215 LSRLSGLPI-----FEAK-AHRAMDG-LWAARNRDSDLMGTVL--NIDSGDW 257
           LSR SG PI     F A      +DG LW  R +       V     D  DW
Sbjct: 37  LSR-SGKPIAAPAWFTAGLPPFPLDGELWIGRGQFEQTSSIVRSKKPDDEDW 87


>gnl|CDD|216462 pfam01373, Glyco_hydro_14, Glycosyl hydrolase family 14.  This
           family are beta amylases.
          Length = 399

 Score = 28.7 bits (64), Expect = 6.0
 Identities = 17/67 (25%), Positives = 24/67 (35%), Gaps = 10/67 (14%)

Query: 136 GHVLAEYFKLNTDLMQWYTGQLLELAKDIGFRLLPAFNTTTGIPHGRINLRHGLKSKGLE 195
           G   + Y K   D ++WY   L      IG      F+   G+P G        K  G+ 
Sbjct: 227 GGYNSTYGK---DFLEWYQSVLENHLGVIGSAAHKNFDPVFGVPIGA-------KIAGIH 276

Query: 196 WSVDTCT 202
           W  +  T
Sbjct: 277 WQYNNPT 283


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.137    0.424 

Gapped
Lambda     K      H
   0.267   0.0709    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,148,277
Number of extensions: 1961385
Number of successful extensions: 1624
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1613
Number of HSP's successfully gapped: 22
Length of query: 390
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 291
Effective length of database: 6,546,556
Effective search space: 1905047796
Effective search space used: 1905047796
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (26.9 bits)