RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5736
(346 letters)
>gnl|CDD|235198 PRK04011, PRK04011, peptide chain release factor 1; Provisional.
Length = 411
Score = 36.0 bits (84), Expect = 0.024
Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 8/72 (11%)
Query: 218 LMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEIKTKLSELN 277
+ E L K + +C N CG K ++ + EK C C S ++ E + + EL+
Sbjct: 318 ISEDLRKDRVTYKCPN--CGYE--EEKTVKRREELPEKTCPKCGSELEIVEEEDIIEELS 373
Query: 278 EMFYRGIEQMNT 289
E+ EQ T
Sbjct: 374 EL----AEQSGT 381
>gnl|CDD|216155 pfam00856, SET, SET domain. SET domains are protein lysine
methyltransferase enzymes. SET domains appear to be
protein-protein interaction domains. It has been
demonstrated that SET domains mediate interactions with
a family of proteins that display similarity with
dual-specificity phosphatases (dsPTPases). A subset of
SET domains have been called PR domains. These domains
are divergent in sequence from other SET domains, but
also appear to mediate protein-protein interaction. The
SET domain consists of two regions known as SET-N and
SET-C. SET-C forms an unusual and conserved knot-like
structure of probably functional importance.
Additionally to SET-N and SET-C, an insert region
(SET-I) and flanking regions of high structural
variability form part of the overall structure.
Length = 113
Score = 32.5 bits (74), Expect = 0.098
Identities = 17/51 (33%), Positives = 24/51 (47%), Gaps = 4/51 (7%)
Query: 138 GVGIYPTVSLFNHDCYP-AVTRYFN---GKNIIVKALRPLKPKEVVAENYG 184
G+ NH C P R+ G I+V+ALR +KP E + +YG
Sbjct: 63 ATGLGNVARFINHSCEPNCEVRFVFVNGGDRIVVRALRDIKPGEELTIDYG 113
>gnl|CDD|214614 smart00317, SET, SET (Su(var)3-9, Enhancer-of-zeste, Trithorax)
domain. Putative methyl transferase, based on outlier
plant homologues.
Length = 124
Score = 29.6 bits (67), Expect = 1.1
Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 4/49 (8%)
Query: 146 SLFNHDCYPAVTRYF----NGKNIIVKALRPLKPKEVVAENYGLVFSRK 190
NH C P F I++ ALR +KP E + +YG ++ +
Sbjct: 76 RFINHSCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
>gnl|CDD|146106 pfam03302, VSP, Giardia variant-specific surface protein.
Length = 397
Score = 30.7 bits (69), Expect = 1.2
Identities = 16/62 (25%), Positives = 21/62 (33%), Gaps = 10/62 (16%)
Query: 209 CRACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTE 268
C C + L K C +NC K ++V CE CNS + LT
Sbjct: 1 CDECKPGYELSADKTKCTSSAPCKTENC-------KACSNDKREV---CEECNSNNYLTP 50
Query: 269 IK 270
Sbjct: 51 TS 52
>gnl|CDD|225633 COG3091, SprT, Zn-dependent metalloprotease, SprT family [General
function prediction only].
Length = 156
Score = 28.9 bits (65), Expect = 2.2
Identities = 14/55 (25%), Positives = 24/55 (43%), Gaps = 13/55 (23%)
Query: 176 KEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVENWPLMESLEKYPIRIR 230
K ++ + GL F R H + Q V Y + C+C+ ++ +RIR
Sbjct: 89 KLLMQQVLGLRFCRTHQFEVQSVRRTTYPYRCQCQ-------------QHYLRIR 130
>gnl|CDD|193497 cd03875, M28_Fxna_like, M28 Zn-peptidase Endoplasmic reticulum
metallopeptidase 1. Peptidase family M28; Endoplasmic
reticulum metallopeptidase 1 (ERMP1; Felix-ina, FXNA or
Fxna peptidase; KIAA1815) subfamily. ERMP1 is a
multi-pass membrane protein located in the endoplasmic
reticulum membrane. In humans, Fxna may play a crucial
role in processing proteins required for the
organization of somatic cells and oocytes into discrete
follicular structures, although which proteins are
hydrolyzed has not yet been determined. Another member
of this cluster is the 24-kDa vacuolar protein (VP24)
which is probably involved in the formation of
intravacuolar pigmented globules (cyanoplasts) in highly
anthocyanin-containing vacuoles; however, the biological
function of the C-terminal region which includes the
putative transmembrane metallopeptidase domain is
unknown.
Length = 307
Score = 29.5 bits (67), Expect = 2.7
Identities = 9/39 (23%), Positives = 16/39 (41%), Gaps = 4/39 (10%)
Query: 148 FNHDCYPAVTRYFNGKNIIVKALRPLKPKEVVAENYGLV 186
+ + YF G NI+V+ + K +E L+
Sbjct: 64 SDFLGGGMTSVYFEGTNIVVR----ISGKNNGSEGALLL 98
>gnl|CDD|227506 COG5179, TAF1, Transcription initiation factor TFIID, subunit TAF1
[Transcription].
Length = 968
Score = 28.9 bits (64), Expect = 4.6
Identities = 20/81 (24%), Positives = 37/81 (45%)
Query: 246 LEPSAKKVEKKCESCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQ 305
L+ AK +K+ E + L E+K K + N+ +Y +++ + S R + +L
Sbjct: 333 LKRQAKPKKKQQEVFSDDKPLNELKGKFNISNDKYYGILKKTHQSKVRNTLSTLGIQHSL 392
Query: 306 IHELVVPPYKLASLAHEALRN 326
+ PP+ + E LRN
Sbjct: 393 PALRLDPPFYKNNHTKEELRN 413
>gnl|CDD|173900 cd00674, LysRS_core_class_I, catalytic core domain of class I
lysyl tRNA synthetase. Class I lysyl tRNA synthetase
(LysRS) catalytic core domain. This class I enzyme is a
monomer which aminoacylates the 2'-OH of the nucleotide
at the 3' of the appropriate tRNA. The core domain is
based on the Rossman fold and is responsible for the
ATP-dependent formation of the enzyme bound
aminoacyl-adenylate. It contains the characteristic
class I HIGH and KMSKS motifs, which are involved in ATP
binding. The class I LysRS is found only in archaea and
some bacteria and has evolved separately from class II
LysRS, as the two do not share structural or sequence
similarity.
Length = 353
Score = 28.8 bits (65), Expect = 4.9
Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 11/66 (16%)
Query: 210 RACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS-DLTE 268
R E W YP C CG+ TV+ + A V KCE + + D+
Sbjct: 159 RELQETW--------YPFMPYCEK--CGKDTTTVEAYDAKAGTVTYKCECGHEETVDIRT 208
Query: 269 IKTKLS 274
+ KL+
Sbjct: 209 GRGKLT 214
>gnl|CDD|216284 pfam01074, Glyco_hydro_38, Glycosyl hydrolases family 38 N-terminal
domain. Glycosyl hydrolases are key enzymes of
carbohydrate metabolism.
Length = 269
Score = 28.3 bits (64), Expect = 5.0
Identities = 12/57 (21%), Positives = 20/57 (35%), Gaps = 9/57 (15%)
Query: 10 IWDHITGSRLISWRPDRGSWARNSARCTALTLLNE---------HDDSLLLTGYDDG 57
IW GS +++ + A+ A LL++ + L G DG
Sbjct: 161 IWRGPDGSEILTHMLPFDYYPTYGAQFRADDLLDQAKKYADKTRTNHVLWPFGDGDG 217
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 28.8 bits (64), Expect = 6.3
Identities = 21/107 (19%), Positives = 30/107 (28%), Gaps = 17/107 (15%)
Query: 207 CKCRACVENWPLMESLEKYPIRIRC--------SNDNCGQIIATVKKLEPSAKKVEKKCE 258
C+ + E W M+S IR D G I K+
Sbjct: 2726 CELSSEFEKWRRMKSKYLCAIRYMLMSSEWILDHEDRSGFIHRLDVSFLLRTKR------ 2779
Query: 259 SCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQ 305
S L E K L+ G + ++ SC R + Q
Sbjct: 2780 ---FVSTLLEDKNYRQVLSSCSLYGNDVISHSCDRFDRDVYRALKHQ 2823
>gnl|CDD|237234 PRK12854, PRK12854, glucose-6-phosphate 1-dehydrogenase;
Provisional.
Length = 484
Score = 28.5 bits (64), Expect = 6.9
Identities = 9/17 (52%), Positives = 13/17 (76%)
Query: 163 KNIIVKALRPLKPKEVV 179
KN + +++RPL P EVV
Sbjct: 270 KNKVFRSMRPLDPAEVV 286
>gnl|CDD|235164 PRK03824, hypA, hydrogenase nickel incorporation protein;
Provisional.
Length = 135
Score = 27.0 bits (60), Expect = 8.6
Identities = 10/23 (43%), Positives = 10/23 (43%)
Query: 208 KCRACVENWPLMESLEKYPIRIR 230
KCR C W L E E IR
Sbjct: 72 KCRNCGNEWSLKEVKESLDEEIR 94
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 27.7 bits (62), Expect = 9.3
Identities = 26/100 (26%), Positives = 42/100 (42%), Gaps = 13/100 (13%)
Query: 238 QIIATVKKLEPSAKKVEKKCESCNSTSDLT----EIKTKLSELNEMFYRGI-------EQ 286
+++ +K+L + +K E +L E+K K E++E E+
Sbjct: 135 ELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEE 194
Query: 287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRN 326
M F EA + L K +D++HE V K HE RN
Sbjct: 195 MIK-LFEEA-DELRKEADELHEEFVELSKKIDELHEEFRN 232
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.133 0.407
Gapped
Lambda K H
0.267 0.0717 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,021,828
Number of extensions: 1582749
Number of successful extensions: 1403
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1399
Number of HSP's successfully gapped: 24
Length of query: 346
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 248
Effective length of database: 6,590,910
Effective search space: 1634545680
Effective search space used: 1634545680
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (26.5 bits)