RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12936
         (187 letters)



>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
           CCCH-type Zn-finger protein [General function prediction
           only].
          Length = 299

 Score =  107 bits (267), Expect = 2e-28
 Identities = 63/151 (41%), Positives = 88/151 (58%), Gaps = 9/151 (5%)

Query: 1   ICKYFIEAVENSKYGWFWSCPNG-PTCIYKHALPPGFVLKKDKKKEEKKDQISLEDLIER 59
           +CK+FIEA+E+ KYGW W+CPNG   C Y H LP G+VL +DK K+  + +ISLE+ IE 
Sbjct: 143 VCKFFIEAMESGKYGWGWTCPNGNMRCSYIHKLPDGYVLSRDKIKDSTQVEISLEEFIEL 202

Query: 60  ERAALAASSKFTKVTLDSFLAWKKRKLKEKSEAMTKAEEKKRSDFKA-GRQVGLSGRDMF 118
           ER +L    K T +T ++F  WK      +   + K +E+K S  K  GR  G  G ++F
Sbjct: 203 ERQSL--PEKLTPLTEENFKEWK----DGRRLRILKQKEEKESARKVKGRATGTKGVELF 256

Query: 119 SFNPDLAKDDMDEGDE-AFDSYTRDEEEDPE 148
               DL KDD++ G+E  +      E ED E
Sbjct: 257 ETRRDLFKDDVEAGEEWDYTERCYSESEDEE 287


>gnl|CDD|181730 PRK09256, PRK09256, hypothetical protein; Provisional.
          Length = 138

 Score = 32.0 bits (74), Expect = 0.063
 Identities = 19/62 (30%), Positives = 28/62 (45%), Gaps = 15/62 (24%)

Query: 50  QISLEDLIERERAALAA--SSKFTKVTLDSFLAWKKRKLKEKSEAMTKAEEKKRSDFKAG 107
           + +LE L+   R AL      + TK T  S    K+R+L+ K         K+R + KA 
Sbjct: 85  EDALERLVALIREALKPPKKRRATKPTRGS----KERRLESK---------KQRGEVKAL 131

Query: 108 RQ 109
           R 
Sbjct: 132 RG 133


>gnl|CDD|182885 PRK10991, fucI, L-fucose isomerase; Provisional.
          Length = 588

 Score = 30.7 bits (70), Expect = 0.44
 Identities = 18/61 (29%), Positives = 32/61 (52%), Gaps = 6/61 (9%)

Query: 79  LAWKKRKLKE----KSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAKDDMDEGDE 134
           LAW K+  KE     +E   +  E+KR+ ++   ++ +  RD+   NP LA  ++   +E
Sbjct: 229 LAWAKKNCKEGEDENAEQYQRNAEQKRAVWEESVKMAMIIRDLMQGNPKLA--ELGFVEE 286

Query: 135 A 135
           A
Sbjct: 287 A 287


>gnl|CDD|214632 smart00356, ZnF_C3H1, zinc finger. 
          Length = 27

 Score = 27.2 bits (61), Expect = 0.59
 Identities = 10/30 (33%), Positives = 13/30 (43%), Gaps = 10/30 (33%)

Query: 1  ICKYFIEAVENSKYGWFWSCPNGPTCIYKH 30
          +CK+F       K G    CP G  C + H
Sbjct: 6  LCKFF-------KRG---YCPRGDRCKFAH 25


>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
          Length = 1832

 Score = 30.5 bits (69), Expect = 0.65
 Identities = 21/53 (39%), Positives = 30/53 (56%), Gaps = 1/53 (1%)

Query: 38  LKKDKKKEEKKDQISLEDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKS 90
           LKK  +K +KK  + L+ L ER +  L  S +  K  + SFL + KR +KE S
Sbjct: 922 LKKKIRKFKKKYFLVLKILKERTKLFLKVSKETKKWIIKSFL-FLKRIIKELS 973


>gnl|CDD|233835 TIGR02361, dak_ATP, dihydroxyacetone kinase, ATP-dependent.  This
           family consists of examples of the form of
           dihydroxyacetone kinase (also called glycerone kinase)
           that uses ATP (2.7.1.29) as the phosphate donor, rather
           than a phosphoprotein as in E. coli. This form is
           composed of a single chain with separable domains
           homologous to the K and L subunits of the E. coli
           enzyme, and is found in yeasts and other eukaryotes and
           in some bacteria, including Citrobacter freundii. The
           member from tomato has been shown to phosphorylate
           dihydroxyacetone, 3,4-dihydroxy-2-butanone, and some
           other aldoses and ketoses (PMID:11985845).
          Length = 574

 Score = 29.6 bits (67), Expect = 0.98
 Identities = 25/110 (22%), Positives = 38/110 (34%), Gaps = 19/110 (17%)

Query: 27  IYKHALPPGFVLKKDKKKEEKKDQISLEDLIERERAALAASSKFTKVT------------ 74
           I+  A          K+   KK++I+ E      + AL A  K+T               
Sbjct: 457 IFFSAAAQSL-----KQNASKKEEITPEAWAAALKTALDALYKYTGARPGDRTMIDALQP 511

Query: 75  -LDSFLAWKKRKLKEKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPD 123
            +++  A K         A   AE  K+   KAGR   +S  +     PD
Sbjct: 512 FVEALEAGKSNLNAAVKAAEAGAEATKKMKAKAGRASYVSS-EALEGVPD 560


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 20/147 (13%), Positives = 48/147 (32%), Gaps = 6/147 (4%)

Query: 37  VLKKDKKKEEKKDQISLEDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKSEAMTKA 96
           VL       +  D  +  D+ +++    A ++                  K + +     
Sbjct: 55  VLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDK 114

Query: 97  EEKKRSDFKAGRQVGLSGRDMFSFNPDLAKDDMDEG------DEAFDSYTRDEEEDPESN 150
           ++                 D    + DL  DD+D+       DE  D    D+E++ +  
Sbjct: 115 DDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE 174

Query: 151 YRELDLDMLLNEAAELDETNISQAKDR 177
            +EL+     ++    ++ + +  + R
Sbjct: 175 AKELEKLSDDDDFVWDEDDSEALRQAR 201


>gnl|CDD|201014 pfam00116, COX2, Cytochrome C oxidase subunit II, periplasmic
           domain. 
          Length = 120

 Score = 27.8 bits (62), Expect = 1.7
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 129 MDEGDEAFDSYTRDEEEDPESNYRELDLD 157
            D  +  FDSY   E++  E  +R L++D
Sbjct: 17  SDFKNIEFDSYMIPEDDLEEGQFRLLEVD 45


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.0 bits (64), Expect = 2.0
 Identities = 19/62 (30%), Positives = 29/62 (46%)

Query: 42   KKKEEKKDQISLEDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKSEAMTKAEEKKR 101
            KK EE K     +   E  + A  A  K  +    +  A K  + K+K++   KAEE K+
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523

Query: 102  SD 103
            +D
Sbjct: 1524 AD 1525


>gnl|CDD|216491 pfam01421, Reprolysin, Reprolysin (M12B) family zinc
          metalloprotease.  The members of this family are
          enzymes that cleave peptides. These proteases require
          zinc for catalysis. Members of this family are also
          known as adamalysins. Most members of this family are
          snake venom endopeptidases, but there are also some
          mammalian proteins such as human ADAM8 and fertilin.
          Fertilin and closely related proteins appear to not
          have some active site residues and may not be active
          enzymes.
          Length = 198

 Score = 28.0 bits (63), Expect = 2.3
 Identities = 9/37 (24%), Positives = 14/37 (37%), Gaps = 8/37 (21%)

Query: 54 EDLIERERAALAASSKFTKVTLDSFLAWKKRKLKEKS 90
           D I  +  A          TL  FL W++  L ++ 
Sbjct: 57 GDKITVQGDAN--------DTLHRFLEWRETDLLKRK 85


>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 199 to 238 amino acids in length. This domain is
           found associated with pfam06512, pfam00520. This domain
           has a conserved ADD sequence motif.
          Length = 222

 Score = 27.8 bits (62), Expect = 2.8
 Identities = 7/28 (25%), Positives = 11/28 (39%)

Query: 77  SFLAWKKRKLKEKSEAMTKAEEKKRSDF 104
           S    + R  K K E  +   E+  S+ 
Sbjct: 10  SAKERRNRNDKNKKEEHSIGSEEGDSEK 37


>gnl|CDD|220645 pfam10239, DUF2465, Protein of unknown function (DUF2465).  FAM98A
           and B proteins are found from worms to humans but their
           function is unknown. This entry is of a family of
           proteins that is rich in glycines.
          Length = 318

 Score = 27.9 bits (62), Expect = 2.9
 Identities = 29/100 (29%), Positives = 40/100 (40%), Gaps = 9/100 (9%)

Query: 29  KHALPPGFVLKKDKKKEEKKDQIS--LEDLIERERAALAASSKFTKVTLDSFLAWKKRKL 86
            H   P        ++ EK + I+  LE   E  R  L    K   VT+ SF  W  R  
Sbjct: 174 THLGKPLLKKPLTSEQWEKLELINHELEAEYELRRRML---LKRLDVTVQSFG-WSDR-A 228

Query: 87  KEKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAK 126
           K K + M    + KR    AG  V L+   + +   DLA+
Sbjct: 229 KTKEDQMAAVYQPKRHSLDAGSTVDLA--HLLAAREDLAR 266


>gnl|CDD|227220 COG4883, COG4883, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 500

 Score = 27.6 bits (61), Expect = 4.7
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 14/41 (34%)

Query: 5   FIEAVENS----------KYGWFWSCPNGPTCIYKHALPPG 35
           FIE +E S          +YG F+    GPTC+   AL PG
Sbjct: 113 FIEEIEKSEALIGKESIRRYGGFY----GPTCVVDFALVPG 149


>gnl|CDD|214438 MTH00154, COX2, cytochrome c oxidase subunit II; Provisional.
          Length = 227

 Score = 27.1 bits (61), Expect = 5.3
 Identities = 9/22 (40%), Positives = 12/22 (54%)

Query: 136 FDSYTRDEEEDPESNYRELDLD 157
           FDSY     E   + +R LD+D
Sbjct: 118 FDSYMIPTNELENNGFRLLDVD 139


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 27.5 bits (61), Expect = 5.5
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query: 17  FWSCPNGPTCIYKHALPPGFVLKKDKK 43
           FWS P+ PT       PP +VL  D++
Sbjct: 723 FWSVPSDPTNDANATQPPYYVLVGDQQ 749


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 26.5 bits (59), Expect = 6.5
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 110 VGLSGRDMFSFNPDLAKDDMDEGDEAFDSYTRDEEEDPE 148
           V +SG+ + +   D + DD ++ +E  D    DE+E  E
Sbjct: 99  VHISGQHLVASEEDESDDDEEDEEEEDDEEDDDEDESEE 137


>gnl|CDD|204356 pfam09959, DUF2193, Uncharacterized protein conserved in archaea
           (DUF2193).  This domain, found in various hypothetical
           archaeal proteins, has no known function.
          Length = 499

 Score = 27.0 bits (60), Expect = 6.8
 Identities = 21/64 (32%), Positives = 27/64 (42%), Gaps = 21/64 (32%)

Query: 5   FIEAVENS----------KYGWFWSCPNGPTCIYKHALPPGF-------VLKKDKKKEEK 47
           FI+A+  S          +YG F+    GPTC+   AL PG        +LKK     E 
Sbjct: 112 FIDAIGKSEALIGKESIRRYGGFY----GPTCVVDFALIPGSTSNVVNRILKKTDIPVEH 167

Query: 48  KDQI 51
           K  I
Sbjct: 168 KQAI 171


>gnl|CDD|239618 cd03556, L-fucose_isomerase, L-fucose isomerase (FucIase); FucIase
           converts L-fucose, an aldohexose, to its ketose form,
           which prepares it for aldol cleavage (similar to the
           isomerization of glucose during glycolysis). L-fucose
           (or 6-deoxy-L-galactose) is found in blood group
           determinants as well as in various oligo- and
           polysaccharides, and glycosides in mammals, bacteria and
           plants.
          Length = 584

 Score = 27.1 bits (60), Expect = 7.2
 Identities = 15/60 (25%), Positives = 25/60 (41%), Gaps = 4/60 (6%)

Query: 70  FTKVTLDSFLAWKKRKLKE----KSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLA 125
           +     +  LAW K   KE      +   K  E+K  D++   ++ +  RD+   N  LA
Sbjct: 216 YDPEEYEKALAWTKENCKEGEDENEKNDQKKREQKDKDWEFVVKMAIIIRDLMYGNEKLA 275


>gnl|CDD|181735 PRK09262, PRK09262, hypothetical protein; Provisional.
          Length = 225

 Score = 26.4 bits (59), Expect = 7.4
 Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 4/39 (10%)

Query: 88  EKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAK 126
           + +EA    EE KR    AG ++GL   D++     LAK
Sbjct: 184 DAAEAREANEESKRERLAAG-ELGL---DIYKMRERLAK 218


>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal.  The family represents
           the N-terminal region of RXT2-like proteins. In S.
           cerevisiae, RXT2 has been demonstrated to be involved in
           conjugation with cellular fusion (mating) and invasive
           growth. A high throughput localisation study has
           localised RXT2 to the nucleus.
          Length = 141

 Score = 26.2 bits (58), Expect = 7.7
 Identities = 15/65 (23%), Positives = 27/65 (41%)

Query: 85  KLKEKSEAMTKAEEKKRSDFKAGRQVGLSGRDMFSFNPDLAKDDMDEGDEAFDSYTRDEE 144
           KL +++        KK  ++    +  LS + +   NP    +D  + D+  +    DEE
Sbjct: 7   KLLKRARPSGPPLYKKVVEYNGYERDVLSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEE 66

Query: 145 EDPES 149
            D E 
Sbjct: 67  ADAED 71


>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus.  This
           family represents the C-terminus (approximately 500
           residues) of the eukaryotic coatomer alpha subunit.
           Coatomer (COPI) is a large cytosolic protein complex
           which forms a coat around vesicles budding from the
           Golgi apparatus. Such coatomer-coated vesicles have been
           proposed to play a role in many distinct steps of
           intracellular transport. Note that many family members
           also contain the pfam04053 domain.
          Length = 421

 Score = 26.4 bits (58), Expect = 9.8
 Identities = 10/53 (18%), Positives = 20/53 (37%)

Query: 127 DDMDEGDEAFDSYTRDEEEDPESNYRELDLDMLLNEAAELDETNISQAKDRVF 179
           D   EG++         E   E    +++ D++L    ++ +     A   VF
Sbjct: 52  DGYLEGEDGLLDDEEGPEGGDEEGGWDVEDDLVLPPELDVPKDQAGNADSDVF 104


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.131    0.376 

Gapped
Lambda     K      H
   0.267   0.0763    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,714,484
Number of extensions: 906905
Number of successful extensions: 1104
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1074
Number of HSP's successfully gapped: 102
Length of query: 187
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 96
Effective length of database: 6,901,388
Effective search space: 662533248
Effective search space used: 662533248
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.3 bits)