RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8340
         (173 letters)



>gnl|CDD|227522 COG5195, COG5195, Uncharacterized conserved protein [Function
           unknown].
          Length = 118

 Score = 73.5 bits (180), Expect = 2e-17
 Identities = 39/125 (31%), Positives = 56/125 (44%), Gaps = 13/125 (10%)

Query: 44  EDINSQNQMQEKLAPKYIFKNPEYEQRKLSRRSTSSNLPTKQWKSLKQILSYEKNIPWPE 103
            + N+ +    KL  +  FK   Y+               +++K+L+Q++          
Sbjct: 2   NNNNNNDGGDMKLLKRPWFKKSTYKG------------KNRRFKTLRQLIPRLTIENESS 49

Query: 104 DTIHYSSIVAPPSFKPSLKYSDISRFECNYKDPQTKLYYYNIEEYKLIRKLPSDIVNGYL 163
               Y SI APPS KP +KY D++     Y  P T L Y+N E YKLI +LPS     YL
Sbjct: 50  KH-RYLSIEAPPSVKPRMKYCDVTGLPAPYTCPNTGLRYHNSEIYKLICELPSGRDQEYL 108

Query: 164 ALRGA 168
            LR  
Sbjct: 109 KLREF 113


>gnl|CDD|191983 pfam08265, YL1_C, YL1 nuclear protein C-terminal domain.  This
           domain is found in proteins of the YL1 family. These
           proteins have been shown to be DNA-binding and may be a
           transcription factor. This domain is found in proteins
           that are not YL1 proteins.
          Length = 30

 Score = 50.5 bits (122), Expect = 1e-09
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 122 KYSDISRFECNYKDPQTKLYYYNIEEYKLI 151
           KY DI+     YKDP+T L Y N+E +K+I
Sbjct: 1   KYCDITGLPAKYKDPKTGLPYSNVEAFKVI 30


>gnl|CDD|198061 smart00993, YL1_C, YL1 nuclear protein C-terminal domain.  This
           domain is found in proteins of the YL1 family. These
           proteins have been shown to be DNA-binding and may be a
           transcription factor. This domain is found in proteins
           that are not YL1 proteins.
          Length = 30

 Score = 48.3 bits (116), Expect = 9e-09
 Identities = 15/30 (50%), Positives = 19/30 (63%)

Query: 122 KYSDISRFECNYKDPQTKLYYYNIEEYKLI 151
           KY DI+     Y+DP+T L Y N E YK+I
Sbjct: 1   KYCDITGLPARYRDPKTGLRYANAEAYKVI 30


>gnl|CDD|182071 PRK09777, fecD, iron-dicitrate transporter subunit FecD; Reviewed.
          Length = 318

 Score = 30.8 bits (70), Expect = 0.27
 Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 6/38 (15%)

Query: 142 YYNIEEYKLIRKLPSDIVNGYLALRGAL------NPLA 173
           YY + EY+L R L +  V   LA+ G L      NPLA
Sbjct: 44  YYVLMEYRLPRLLLALFVGAALAVSGVLVQGIVRNPLA 81


>gnl|CDD|215862 pfam00331, Glyco_hydro_10, Glycosyl hydrolase family 10. 
          Length = 308

 Score = 30.3 bits (69), Expect = 0.49
 Identities = 13/30 (43%), Positives = 15/30 (50%), Gaps = 10/30 (33%)

Query: 135 DPQTKLYY--YNIEE--------YKLIRKL 154
           DP  KLYY  YNIE         Y L++ L
Sbjct: 160 DPNAKLYYNDYNIEGPNAKREAIYNLVKDL 189


>gnl|CDD|214750 smart00633, Glyco_10, Glycosyl hydrolase family 10. 
          Length = 263

 Score = 29.9 bits (68), Expect = 0.58
 Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 10/30 (33%)

Query: 135 DPQTKLYY--YNIEE--------YKLIRKL 154
           DP  KL+Y  YN EE        Y+L++KL
Sbjct: 120 DPDAKLFYNDYNTEEPNAKRQAIYELVKKL 149


>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat.  Spectrin repeat-domains
          are found in several proteins involved in cytoskeletal
          structure. These include spectrin, alpha-actinin and
          dystrophin. The sequence repeat used in this family is
          taken from the structural repeat in reference. The
          spectrin domain- repeat forms a three helix bundle. The
          second helix is interrupted by proline in some
          sequences. The repeats are defined by a characteristic
          tryptophan (W) residue at position 17 in helix A and a
          leucine (L) at 2 residues from the carboxyl end of
          helix C. Although the domain occurs in ultiple repeats
          along sequences, the domains are actually stable on
          their own - ie they act, biophysically, like domains
          rather than repeats that along function when
          aggregated.
          Length = 105

 Score = 26.1 bits (58), Expect = 5.2
 Identities = 18/90 (20%), Positives = 42/90 (46%), Gaps = 11/90 (12%)

Query: 5  INAW---KEEALYTLKDPTNLLAIQKFFSKGEGIEITMSQSQEDINSQNQMQEKLAPKYI 61
          + +W   KE  L +     +L ++Q    K + +E  ++  Q+ + + N++ EKL  +  
Sbjct: 13 LESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEGH 72

Query: 62 FKNPEYEQRKLSRRSTSSNLPTKQWKSLKQ 91
          + + E ++R          L  ++W+ L +
Sbjct: 73 YASEEIQERL-------EEL-NERWEQLLE 94


>gnl|CDD|222499 pfam14022, DUF4238, Protein of unknown function (DUF4238).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria, archaea and
           eukaryotes. Proteins in this family are typically
           between 274 and 374 amino acids in length.
          Length = 268

 Score = 27.0 bits (60), Expect = 5.9
 Identities = 9/49 (18%), Positives = 20/49 (40%)

Query: 18  DPTNLLAIQKFFSKGEGIEITMSQSQEDINSQNQMQEKLAPKYIFKNPE 66
            P  LL +           + +   + ++   N +Q K A +Y++ N +
Sbjct: 214 SPKLLLVLSPPRDDASREILILILDEAEVREINSLQAKNAERYVYSNSK 262


>gnl|CDD|173023 PRK14558, pyrH, uridylate kinase; Provisional.
          Length = 231

 Score = 26.6 bits (59), Expect = 6.0
 Identities = 11/30 (36%), Positives = 14/30 (46%), Gaps = 2/30 (6%)

Query: 105 TIHYSSIVAPPSFKPSLKYSDI-SRFECNY 133
            +  S IV  PS +P + Y DI   F   Y
Sbjct: 91  AVIVSQIVNLPSVEP-INYDDIELYFRAGY 119


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 26.3 bits (58), Expect = 7.7
 Identities = 14/74 (18%), Positives = 30/74 (40%), Gaps = 3/74 (4%)

Query: 2   VLRINAWKEEALYTLKD---PTNLLAIQKFFSKGEGIEITMSQSQEDINSQNQMQEKLAP 58
              +  W EE    L       +L ++++   K + +E  +   +  + S N++ E+L  
Sbjct: 115 ADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLE 174

Query: 59  KYIFKNPEYEQRKL 72
           +      E  + KL
Sbjct: 175 EGHPDADEEIEEKL 188


>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1.  Members of this protein
           family are the major adhesin of the Mycoplasma terminal
           organelle. The protein is called adhesin P1, cytadhesin
           P1, P140, attachment protein, and MgPa, with locus names
           MG191 in Mycoplasma genitalium and MPN141 in M.
           pneumoniae. A conserved C-terminal region is shared by
           additional paralogs in M. pneumoniae and M.
           gallisepticum, as well as by the member of this family
           [Cell envelope, Surface structures, Cellular processes,
           Pathogenesis].
          Length = 1425

 Score = 26.7 bits (58), Expect = 8.2
 Identities = 12/65 (18%), Positives = 24/65 (36%)

Query: 46  INSQNQMQEKLAPKYIFKNPEYEQRKLSRRSTSSNLPTKQWKSLKQILSYEKNIPWPEDT 105
             S   +Q    P Y   NP+ +    S    +   P +     K I  Y  +  +   +
Sbjct: 607 AWSSGALQSSGNPYYFHNNPDSQDMPNSNGVEAIIKPWEDKDDAKYIYPYRYSGSYAWQS 666

Query: 106 IHYSS 110
           +++S+
Sbjct: 667 VNWSN 671


>gnl|CDD|165152 PHA02789, PHA02789, uncharacterized protein; Provisional.
          Length = 173

 Score = 26.1 bits (57), Expect = 9.0
 Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 4/38 (10%)

Query: 120 SLKYSDISRF-ECNYKDPQT---KLYYYNIEEYKLIRK 153
           S  Y D+  F EC+Y D +    + Y+YN EEY  I K
Sbjct: 90  SSPYEDLEYFYECDYTDNKPTFDQFYFYNGEEYTEIDK 127


>gnl|CDD|226956 COG4590, COG4590, ABC-type uncharacterized transport system,
           permease component [General function prediction only].
          Length = 733

 Score = 26.4 bits (58), Expect = 9.0
 Identities = 14/60 (23%), Positives = 22/60 (36%), Gaps = 3/60 (5%)

Query: 67  YEQRKLSRRSTSSNLPTKQWKSLKQILSYEKNIPWPEDTIHYSSIVAPPSFKPSLKYSDI 126
            +Q K+      +  P   W +L Q + YE    +PE    + S  A   F+       I
Sbjct: 375 EDQGKIRLAQLENRNPEISWSALWQKVWYES---YPEPAYVWQSTAATDDFEAKFSLVPI 431


>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
          Length = 746

 Score = 26.4 bits (58), Expect = 10.0
 Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 8/41 (19%)

Query: 27 KFFS---KGEGIEITMSQSQEDINSQNQMQEKLAPKYIFKN 64
          K+F+   +GEG E+     Q D+N  +  ++K A K I  N
Sbjct: 24 KYFAQTRRGEGAEL-----QNDLNGTDSYRKKAAVKRIIAN 59


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.131    0.379 

Gapped
Lambda     K      H
   0.267   0.0829    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,843,077
Number of extensions: 778201
Number of successful extensions: 459
Number of sequences better than 10.0: 1
Number of HSP's gapped: 459
Number of HSP's successfully gapped: 29
Length of query: 173
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 83
Effective length of database: 6,945,742
Effective search space: 576496586
Effective search space used: 576496586
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 55 (24.8 bits)