RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7466
         (338 letters)



>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 36.0 bits (83), Expect = 0.006
 Identities = 17/51 (33%), Positives = 22/51 (43%)

Query: 224 PYQCSQCSLCFDQGSLLKAHVLEKHEVFQCTQCEKYYRSQSTLDVHVSKKH 274
           PY C  C + F     LK H+        C  C K +R+  +   HV KKH
Sbjct: 73  PYVCPLCLMPFSSSVSLKQHIRYTEHSKVCPVCGKEFRNTDSTLDHVCKKH 123



 Score = 26.8 bits (59), Expect = 9.7
 Identities = 26/98 (26%), Positives = 37/98 (37%), Gaps = 9/98 (9%)

Query: 153 EKKNLSKHFFKCLMKFTQKKKNLSKHFFKCLMKFTQSGRSSFKCAICFEYFSLLESLETH 212
           E+K L +   K L+   Q     S + +K L        S + C +C   FS   SL   
Sbjct: 37  EQKRLIRAVVKTLIYNPQLLDE-SSYLYKLLT---SKAVSPYVCPLCLMPFSSSVSL--- 89

Query: 213 ITRRHIGICEEPYQCSQCSLCFDQGSLLKAHVLEKHEV 250
             ++HI   E    C  C   F        HV +KH +
Sbjct: 90  --KQHIRYTEHSKVCPVCGKEFRNTDSTLDHVCKKHNI 125


>gnl|CDD|216357 pfam01197, Ribosomal_L31, Ribosomal protein L31. 
          Length = 69

 Score = 30.6 bits (70), Expect = 0.16
 Identities = 15/40 (37%), Positives = 19/40 (47%), Gaps = 8/40 (20%)

Query: 250 VFQCTQCEKYYRSQST-------LDVHVSKKHPAGTGKVK 282
           VF C+ C   + + ST       +DV  S  HP  TGK K
Sbjct: 13  VFTCSSCGNVFVTGSTKEYPVIKIDVC-SACHPFYTGKQK 51


>gnl|CDD|238916 cd01941, YeiC_kinase_like, YeiC-like sugar kinase.  Found in
           eukaryotes and bacteria, YeiC-like kinase is part of the
           ribokinase/pfkB sugar kinase superfamily. Its
           oligomerization state is unknown at this time.
          Length = 288

 Score = 31.9 bits (73), Expect = 0.42
 Identities = 11/45 (24%), Positives = 15/45 (33%)

Query: 275 PAGTGKVKRDIKPNTNILTVTPNRRSARNDSTKATVKTETEVKVK 319
           P    K+K+       I  +TPNR      +       E E K  
Sbjct: 161 PTSAPKLKKLFYLLHAIDLLTPNRAELEALAGALIENNEDENKAA 205


>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
           This domain family is found in archaea and eukaryotes,
           and is approximately 30 amino acids in length. The
           mammalian members of this group occur multiple times
           along the protein, joined by flexible linkers, and are
           referred to as JAZ - dsRNA-binding ZF protein -
           zinc-fingers. The JAZ proteins are expressed in all
           tissues tested and localise in the nucleus, particularly
           the nucleolus. JAZ preferentially binds to
           double-stranded (ds) RNA or RNA/DNA hybrids rather than
           DNA. In addition to binding double-stranded RNA, these
           zinc-fingers are required for nucleolar localisation.
          Length = 27

 Score = 28.3 bits (64), Expect = 0.48
 Identities = 11/23 (47%), Positives = 18/23 (78%), Gaps = 1/23 (4%)

Query: 253 CTQCEKYYRSQSTLDVHV-SKKH 274
           C  C+KY++S++ L+ H+ SKKH
Sbjct: 4   CVACDKYFKSENALENHLKSKKH 26


>gnl|CDD|223332 COG0254, RpmE, Ribosomal protein L31 [Translation, ribosomal
           structure and biogenesis].
          Length = 75

 Score = 28.4 bits (64), Expect = 1.1
 Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 6/39 (15%)

Query: 250 VFQCTQCEKYYRSQSTL---DVHV---SKKHPAGTGKVK 282
           VF C+ C   + ++ST    ++++   SK HP  TGK K
Sbjct: 14  VFVCSSCGNEFTTRSTKGTDEINLDVCSKCHPFYTGKQK 52


>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
          Length = 141

 Score = 29.0 bits (65), Expect = 1.9
 Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 8/50 (16%)

Query: 76  GRPKGSLNKVKREVVESPSPKREIDAPIVKRKKKDEILLMKALDEEMFKM 125
           GR K S N +K+EV      +REI A  ++R+  +E +L  ++++E  K+
Sbjct: 37  GRLKRSFNAIKQEV------EREIGADEIRRQLHNEHIL--SMEQEARKI 78


>gnl|CDD|218967 pfam06273, eIF-4B, Plant specific eukaryotic initiation factor 4B. 
           This family consists of several plant specific
           eukaryotic initiation factor 4B proteins.
          Length = 496

 Score = 30.1 bits (67), Expect = 2.1
 Identities = 21/68 (30%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 73  DGRGRPKGSLNKVKREVVESPSP---KREIDAPIVKRKKKDEILLMKALDEEMFKMDEEN 129
           +G  RP+  LN  KR+V  +P+P    R   +P    + ++E+L  K LD +  K+D E 
Sbjct: 231 NGEERPRLILNPRKRDVSATPTPPAEARSRPSPFGAARPREEVLAEKGLDWK--KLDSEI 288

Query: 130 NKEMERIS 137
             +  R +
Sbjct: 289 EAKTSRPT 296


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 29.5 bits (66), Expect = 2.8
 Identities = 19/117 (16%), Positives = 39/117 (33%), Gaps = 6/117 (5%)

Query: 22  RKEGEEKEEERVETQLFQRRTRQRGVTEPETITGKRKRSLDDSQEEVIVKKD---GRGRP 78
            K+ ++++ +        +   +      E    K K+  +    E   K++    + RP
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRP 175

Query: 79  KGSLNKVKREVVESP---SPKREIDAPIVKRKKKDEILLMKALDEEMFKMDEENNKE 132
           K    K      + P     +R+     VK K ++  +  +   EE    D E    
Sbjct: 176 KKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRETTTS 232


>gnl|CDD|223790 COG0718, COG0718, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 105

 Score = 28.0 bits (63), Expect = 3.0
 Identities = 22/89 (24%), Positives = 44/89 (49%), Gaps = 12/89 (13%)

Query: 58  KRSLDDSQEEVIVKKDGRGRPKGSLNKV----KREVVESPSPKREIDAPIVKRKKKD--E 111
           ++ +   QEE + +K+  G+  G L  V    K EV        EID  ++  + K+  E
Sbjct: 18  QKKMQKMQEE-LAQKEVTGKAGGGLVTVTINGKGEVK-----SVEIDPSLLDPEDKEMLE 71

Query: 112 ILLMKALDEEMFKMDEENNKEMERISQGL 140
            L++ A ++   K++E   ++M  ++ G+
Sbjct: 72  DLILAAFNDAKKKVEETRKEKMGALTGGM 100


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 25.7 bits (56), Expect = 3.4
 Identities = 9/24 (37%), Positives = 13/24 (54%)

Query: 251 FQCTQCEKYYRSQSTLDVHVSKKH 274
           F+C  C K + S+  L  H+ K H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 25.7 bits (56), Expect = 3.5
 Identities = 7/23 (30%), Positives = 12/23 (52%)

Query: 142 YQCSVCSQVFIEKKNLSKHFFKC 164
           ++C +C + F  K  L +H  K 
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 24.9 bits (54), Expect = 8.1
 Identities = 9/24 (37%), Positives = 16/24 (66%)

Query: 194 FKCAICFEYFSLLESLETHITRRH 217
           FKC +C + FS  ++L+ H+ + H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 24.5 bits (53), Expect = 10.0
 Identities = 7/24 (29%), Positives = 11/24 (45%)

Query: 225 YQCSQCSLCFDQGSLLKAHVLEKH 248
           ++C  C   F     LK H+ + H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24


>gnl|CDD|225936 COG3401, COG3401, Fibronectin type 3 domain-containing protein
           [General function prediction only].
          Length = 343

 Score = 29.1 bits (65), Expect = 3.5
 Identities = 31/154 (20%), Positives = 49/154 (31%), Gaps = 19/154 (12%)

Query: 18  LICVRKEGEEKEEER-VETQLFQRRTRQRGVTEPET-------ITGKRKRSLDDSQEEVI 69
            +  R + + K +        +Q      G+ E E         T   K S D  +  V 
Sbjct: 8   FVLYRTKKDLKLKRIGTIKNKYQTHYYDEGL-EGEESYPYQEGTTKVDKISYDSERILVK 66

Query: 70  VKKDGRGR-PKGSLNKVKR-EVVESPSPKREIDAPIVKRKKKDEILLMKALDEEMFKMDE 127
                R R    SL + K  +V  SP P   +   I++R+  D   L   L         
Sbjct: 67  TSFIERVRSVFASLERPKSVKVFWSPHPDVSVGKYIIQRQNGDGKFLRTGL--------V 118

Query: 128 ENNKEMERISQGLEYQCSVCSQVFIEKKNLSKHF 161
           +N   +E I   L +       +      + K F
Sbjct: 119 KNRLFVEFIDSDLGHNEKYMELIIAADFQMGKSF 152


>gnl|CDD|223623 COG0549, ArcC, Carbamate kinase [Amino acid transport and
           metabolism].
          Length = 312

 Score = 28.8 bits (65), Expect = 4.0
 Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 8/38 (21%)

Query: 69  IVKKDGRGRPKGSLNKVKREVVESPSPKREIDAPIVKR 106
           + K+D             R VV SP P R ++A  +K 
Sbjct: 150 VFKEDAGRG--------YRRVVPSPKPVRIVEAEAIKA 179


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 4/88 (4%)

Query: 22  RKEGEEKEEERVETQLFQRRTRQRGVTEPETITGKRKRSLDDSQEEVIVKKDGRGRPKGS 81
           R+E +E+EEE  E +   +         PET   +R+   +DS+++     +      GS
Sbjct: 178 RREKKEEEEEEEEDEALIKSLSF----GPETEEDRRRADDEDSEDDEEDNDNTPSPKSGS 233

Query: 82  LNKVKREVVESPSPKREIDAPIVKRKKK 109
            +  K   +   S  +  +AP   + KK
Sbjct: 234 SSPAKPTSILKKSAAKRSEAPSSSKAKK 261


>gnl|CDD|227431 COG5100, NPL4, Nuclear pore protein [Nuclear structure].
          Length = 571

 Score = 28.8 bits (64), Expect = 5.0
 Identities = 28/105 (26%), Positives = 43/105 (40%), Gaps = 11/105 (10%)

Query: 79  KGSLNKVKREVVESPSPKREIDAPIVKRKKKDEILLMKALDEEMFKMDEENNKEMERISQ 138
             S N+ KR+V   P   ++  +  +KR+K    ++   +D+ + K D    + M  + Q
Sbjct: 81  IASNNEKKRDVPGKP---KQDCSKGIKREKDSMPVIQDPIDDSLEKEDGLIRRSMTMLCQ 137

Query: 139 -GLEYQCSVCSQV-----FIEKKNLSKH--FFKCLMKFTQKKKNL 175
            G    CS CS +        K N  KH  F   L K    K  L
Sbjct: 138 HGSNGMCSYCSPLDPWDEKYYKDNKIKHLSFHSYLEKMNSNKNKL 182


>gnl|CDD|233665 TIGR01978, sufC, FeS assembly ATPase SufC.  SufC is part of the SUF
           system, shown in E. coli to consist of six proteins and
           believed to act in Fe-S cluster formation during
           oxidative stress. SufC forms a complex with SufB and
           SufD. SufC belongs to the ATP-binding cassette
           transporter family (pfam00005) but is no longer thought
           to be part of a transporter. The complex is reported as
           cytosolic (PMID:12554644) or associated with the
           membrane (PMID:11943156). The SUF system also includes a
           cysteine desulfurase (SufS, enhanced by SufE) and a
           probable iron-sulfur cluster assembly scaffold protein,
           SufA [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Other].
          Length = 243

 Score = 28.4 bits (64), Expect = 5.1
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 107 KKKDEILLMKALDEEMFKMDE 127
           KK++EIL M  L+ ++  +DE
Sbjct: 150 KKRNEILQMALLEPKLAILDE 170


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 28.8 bits (65), Expect = 5.4
 Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 4/83 (4%)

Query: 23  KEGEEKEEERVETQLFQRRTRQRGVTEPETIT-GKRKRSLDDSQEEVIVKKDGRGRPKGS 81
            E EEK E+     L   + R   +T+ +     K  + L + Q++   ++D    P   
Sbjct: 459 PELEEKLEKEERELL---QKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPTLK 515

Query: 82  LNKVKREVVESPSPKREIDAPIV 104
           L  V   + ++       +   V
Sbjct: 516 LGDVPDPIEKTSLETEVSNEAKV 538


>gnl|CDD|239139 cd02674, Peptidase_C19R, A subfamily of peptidase C19. Peptidase
           C19 contains ubiquitinyl hydrolases. They are
           intracellular peptidases that remove ubiquitin molecules
           from polyubiquinated peptides by cleavage of isopeptide
           bonds. They hydrolyze bonds involving the carboxyl group
           of the C-terminal Gly residue of ubiquitin. The purpose
           of the de-ubiquitination is thought to be editing of the
           ubiquitin conjugates, which could rescue them from
           degradation, as well as recycling of the ubiquitin. The
           ubiquitin/proteasome system is responsible for most
           protein turnover in the mammalian cell, and with over 50
           members, family C19 is one of the largest families of
           peptidases in the human genome.
          Length = 230

 Score = 28.0 bits (63), Expect = 6.1
 Identities = 19/93 (20%), Positives = 32/93 (34%), Gaps = 23/93 (24%)

Query: 191 RSSFKCAIC------FEYFSLLESLETHITRRHIGICEEPYQCSQCSL--CFDQGSLLKA 242
           +S   C  C      FE F+ L SL          I        + +L  C       K 
Sbjct: 49  KSRLTCLTCGKTSTTFEPFTYL-SLP---------IPSGSGDAPKVTLEDCLRL--FTKE 96

Query: 243 HVLEKHEVFQCTQCEKYYRSQSTLDVHVSKKHP 275
             L+    ++C +C+K  ++   L +    + P
Sbjct: 97  ETLDGDNAWKCPKCKKKRKATKKLTIS---RLP 126


>gnl|CDD|117168 pfam08594, UPF0300, Uncharacterized protein family (UPF0300).  This
           family of proteins appear to be specific to S. pombe.
          Length = 215

 Score = 27.8 bits (62), Expect = 6.3
 Identities = 14/41 (34%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 109 KDEILLMKALDEEMFKMDEENNKEMERISQGLEYQCSVCSQ 149
             E+LL +  DE +F+ D+EN  E+E   + L    S+C  
Sbjct: 174 PKELLLFRKSDEFIFQ-DQENIDELESFRRVLLEDLSLCES 213


>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type.  This is a zinc-finger
           domain with the CxxCx(12)Hx(6)H motif, found in multiple
           copies in a wide range of proteins from plants to
           metazoans. Some member proteins, particularly those from
           plants, are annotated as being RNA-binding.
          Length = 25

 Score = 24.8 bits (55), Expect = 7.6
 Identities = 10/25 (40%), Positives = 13/25 (52%), Gaps = 1/25 (4%)

Query: 251 FQCTQCEKYYRSQSTLDVHV-SKKH 274
           F C  C   + S+S L  H+  KKH
Sbjct: 1   FYCELCNVTFTSESQLKSHLRGKKH 25


>gnl|CDD|184959 PRK14997, PRK14997, LysR family transcriptional regulator;
          Provisional.
          Length = 301

 Score = 28.0 bits (62), Expect = 7.8
 Identities = 12/20 (60%), Positives = 14/20 (70%)

Query: 30 EERVETQLFQRRTRQRGVTE 49
          EER+  +L QR TRQ  VTE
Sbjct: 41 EERLGVRLIQRTTRQFNVTE 60


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 24.7 bits (54), Expect = 8.0
 Identities = 8/24 (33%), Positives = 16/24 (66%), Gaps = 1/24 (4%)

Query: 251 FQCTQCEKYYRSQSTLDVHVSKKH 274
           ++C +C K ++S+S L  H+ + H
Sbjct: 1   YRCPECGKVFKSKSALREHM-RTH 23



 Score = 24.3 bits (53), Expect = 9.3
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query: 142 YQCSVCSQVFIEKKNLSKH 160
           Y+C  C +VF  K  L +H
Sbjct: 1   YRCPECGKVFKSKSALREH 19


>gnl|CDD|215642 PLN03238, PLN03238, probable histone acetyltransferase MYST;
           Provisional.
          Length = 290

 Score = 27.9 bits (62), Expect = 8.4
 Identities = 12/50 (24%), Positives = 23/50 (46%), Gaps = 3/50 (6%)

Query: 248 HEVFQCTQCEKYYRSQSTLDVHVSK---KHPAGTGKVKRDIKPNTNILTV 294
            +++ C  C KY R + +L  H++K   + P G G      +   ++  V
Sbjct: 46  TKLYICEYCLKYMRKKKSLLRHLAKCDIRQPPGGGIYGAVTEGPLSVFEV 95


>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
          Length = 55

 Score = 25.4 bits (55), Expect = 9.4
 Identities = 8/24 (33%), Positives = 17/24 (70%)

Query: 140 LEYQCSVCSQVFIEKKNLSKHFFK 163
           L Y+C +C +++I++K++  H  K
Sbjct: 4   LGYECPICGEIYIKRKSMITHLRK 27


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.131    0.372 

Gapped
Lambda     K      H
   0.267   0.0709    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,251,579
Number of extensions: 1535562
Number of successful extensions: 1790
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1784
Number of HSP's successfully gapped: 77
Length of query: 338
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 240
Effective length of database: 6,590,910
Effective search space: 1581818400
Effective search space used: 1581818400
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)