RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15690
(236 letters)
>gnl|CDD|219755 pfam08231, SYF2, SYF2 splicing factor. Proteins in this family are
involved in cell cycle progression and pre-mRNA
splicing.
Length = 157
Score = 154 bits (391), Expect = 1e-47
Identities = 81/156 (51%), Positives = 109/156 (69%), Gaps = 9/156 (5%)
Query: 85 EEGEDYERVKMLNMTVAEAEKFNKKKK----NPDVGFSDY-QQATARQYNRLVKQMKPDM 139
E+G D+ER + L+ + E+EK++KKKK D GF+DY Q A R Y R +K +KPD+
Sbjct: 2 EQGIDFERKRALDWSAEESEKWDKKKKKKKNKDDFGFNDYEQDALYRAYKRRLKNLKPDL 61
Query: 140 SKYEEQKKKLGPAFYGDQK----TILQGLHSDRPEAVDKMVEDLEKQISKREKYSRRRTH 195
+YE+QK+ +G FY +IL + EA+D++V+DLEKQ KR K+SRRR
Sbjct: 62 EEYEKQKEAVGEKFYATFYRTADSILYTSNKPTEEAIDRLVQDLEKQQEKRLKFSRRRGF 121
Query: 196 NDDADIDYINERNMKFNKKLERFYGEHTAEIKQNLE 231
+D+ADIDYINERN KFNKKL+RFY ++TAEIKQNLE
Sbjct: 122 DDEADIDYINERNKKFNKKLDRFYDKYTAEIKQNLE 157
>gnl|CDD|222040 pfam13311, DUF4080, Protein of unknown function (DUF4080). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 187
Score = 30.2 bits (69), Expect = 0.58
Identities = 25/104 (24%), Positives = 40/104 (38%), Gaps = 24/104 (23%)
Query: 22 QEKHAERLKRLKEI----HLKRNESRQLNHQEVVEEDRRKQLPNNWEARKRKAEW----- 72
+EK E L+ L+E+ +L R ++ K P +A K E
Sbjct: 95 EEKLPEDLELLRELLKFDYLLREKA--------------KSRPEFLDAEDDKEEKEEIRE 140
Query: 73 -ILKDEEQRQKALEEGEDYERVKMLNMTVAEAEKFNKKKKNPDV 115
LK+EE+ + L E E + ++ MT E + N K
Sbjct: 141 FYLKEEEKLRYYLPEYEGTDAKQLRKMTHLEKFRINVWKYEKTA 184
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 29.0 bits (64), Expect = 3.0
Identities = 43/217 (19%), Positives = 80/217 (36%), Gaps = 8/217 (3%)
Query: 13 EAEQKNLTFQEKHAERLKRLKEIHLKRNESRQLNHQEVVEED-----RRKQLPNNWEARK 67
E K + K AE K+ E K+ + +E+ + + + + E K
Sbjct: 1615 AEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
Query: 68 RKAEWILKDEEQRQKALEEGEDYERVKMLNMTVAEAEKFNKKKKNPDVGFSDYQQATARQ 127
+KAE K EE +KA E + + + E KKK + + A +
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734
Query: 128 YNRLVKQMK---PDMSKYEEQKKKLGPAFYGDQKTILQGLHSDRPEAVDKMVEDLEKQIS 184
+ ++ K + K EE+KKK+ ++K + +++ E+ EK+
Sbjct: 1735 AKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRM 1794
Query: 185 KREKYSRRRTHNDDADIDYINERNMKFNKKLERFYGE 221
+ +K + N I+ E N+ N E
Sbjct: 1795 EVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSA 1831
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 28.7 bits (64), Expect = 3.1
Identities = 19/97 (19%), Positives = 36/97 (37%), Gaps = 9/97 (9%)
Query: 22 QEKHAERLKRLKEIHLKRNESRQLNHQEVVEEDRRKQLPNNWEARKRKAEWILKDEEQRQ 81
Q++ ER ++ + E N +E + R++Q + D+E
Sbjct: 210 QDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQ-------ELEENVERDSDDEDEH 262
Query: 82 KALEEGEDYERVKMLNMTVAEAEKFNKKKKNPDVGFS 118
E+ ED E +T E+ ++ +K D S
Sbjct: 263 G--EDSEDGETKPESYITSEYIERISEIRKMKDERLS 297
>gnl|CDD|133422 cd01337, MDH_glyoxysomal_mitochondrial, Glyoxysomal and
mitochondrial malate dehydrogenases. MDH is one of the
key enzymes in the citric acid cycle, facilitating both
the conversion of malate to oxaloacetate and
replenishing levels of oxalacetate by reductive
carboxylation of pyruvate. Members of this subfamily are
localized to the glycosome and mitochondria. MDHs are
part of the NAD(P)-binding Rossmann fold superfamily,
which includes a wide variety of protein families
including the NAD(P)-binding domains of alcohol
dehydrogenases, tyrosine-dependent oxidoreductases,
glyceraldehyde-3-phosphate dehydrogenases,
formate/glycerate dehydrogenases, siroheme synthases,
6-phosphogluconate dehydrogenases, aminoacid
dehydrogenases, repressor rex, and NAD-binding potassium
channel domains, among others.
Length = 310
Score = 28.2 bits (64), Expect = 3.7
Identities = 6/30 (20%), Positives = 14/30 (46%)
Query: 160 ILQGLHSDRPEAVDKMVEDLEKQISKREKY 189
L L+ + ++ + +L+K I K +
Sbjct: 280 GLGKLNDYEKKLLEAALPELKKNIEKGVDF 309
>gnl|CDD|224820 COG1908, FrhD, Coenzyme F420-reducing hydrogenase, delta subunit
[Energy production and conversion].
Length = 132
Score = 26.9 bits (60), Expect = 4.9
Identities = 15/47 (31%), Positives = 26/47 (55%), Gaps = 8/47 (17%)
Query: 67 KRKAEWILKDEEQRQKALEE-GEDYERVKMLNMTVAEAEKFNKKKKN 112
KR+ E + ++ L+E G + ERV++L ++ AE EKF +
Sbjct: 77 KRRMELL-------KELLKELGIEPERVRVLWISAAEGEKFAETINE 116
>gnl|CDD|226481 COG3972, COG3972, Superfamily I DNA and RNA helicases [General
function prediction only].
Length = 660
Score = 27.9 bits (62), Expect = 4.9
Identities = 28/128 (21%), Positives = 48/128 (37%), Gaps = 13/128 (10%)
Query: 13 EAEQKNLTFQEKHAERLKRLKEIHLKRNESRQLNHQEVVEEDRRKQLPNNWEARKRKAEW 72
EA +N +K E LK + I + + R L N +++
Sbjct: 108 EALSRNDIDFKKLEEELKESRVIIASLRVGKIN-------DTSRINLEANALLDTIESKI 160
Query: 73 ILKDEEQRQKALEEGEDYERVKML---NMTVAEAEKFNK-KKKNPD--VGFSDYQQATAR 126
D +Q + A + G +R++ L T A K + KNPD + F+ + + A
Sbjct: 161 ANFDTDQTKAAFQSGFGKQRIRGLAGSGKTELLAHKAAELHSKNPDSRIAFTFFTKILAS 220
Query: 127 QYNRLVKQ 134
LV +
Sbjct: 221 TMRTLVPE 228
>gnl|CDD|223193 COG0115, IlvE, Branched-chain amino acid
aminotransferase/4-amino-4-deoxychorismate lyase [Amino
acid transport and metabolism / Coenzyme metabolism].
Length = 284
Score = 27.7 bits (62), Expect = 5.8
Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 12 MEAEQKNLTFQEKHAERLKR-LKEIHLKRNESRQLNHQEVVEEDRRKQ 58
+ A L ++H RLKR K + L R ES + +++ K
Sbjct: 35 LRAYNGKLFRLDEHLARLKRSAKRLGLPRPESE-EEIELLIQLLLAKN 81
>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
Provisional.
Length = 784
Score = 27.6 bits (61), Expect = 6.4
Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 9/95 (9%)
Query: 12 MEAEQKNLTFQEKHAERLKRLKEIHLKR---NESRQLNHQEVVEEDRRKQLPNNWEARKR 68
ME+E L +++ R+K+ E + NE + +E+ E D P+ EA KR
Sbjct: 200 MESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMD---DAPDENEALKR 256
Query: 69 KAEWILKDEEQRQKALEEGEDYERVKMLNMTVAEA 103
K + +E ++KA E +++KM++ AEA
Sbjct: 257 KIDAAKMPKEAKEKAEAE---LQKLKMMSPMSAEA 288
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 27.3 bits (61), Expect = 9.1
Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 9/95 (9%)
Query: 2 SITLTKYT-LDMEAEQKNLTFQEKHAERLKRLKEIHLKRNESRQLNHQ-EVVEEDRRKQL 59
SIT+T + L E ++ + E +AE K+ KE RN + L +Q E ++ +L
Sbjct: 489 SITITASSGLSEEEIERMVKEAEANAEEDKKRKEEIEARNNADSLAYQAEKTLKEAGDKL 548
Query: 60 PNNWEARKRKAEWILKDEEQRQKALEEGEDYERVK 94
P K K E + + K +GED E +K
Sbjct: 549 P---AEEKEKIEKAVAE----LKEALKGEDVEEIK 576
>gnl|CDD|177848 PLN02197, PLN02197, pectinesterase.
Length = 588
Score = 27.3 bits (60), Expect = 9.3
Identities = 16/76 (21%), Positives = 27/76 (35%), Gaps = 7/76 (9%)
Query: 65 ARKRKAEWILKDEEQR----QKALEEGEDYERVK---MLNMTVAEAEKFNKKKKNPDVGF 117
+ + + KD + +A+ D + + + + KKKN F
Sbjct: 271 GKIKATHVVAKDGSGQFKTISQAVMACPDKNPGRCIIHIKAGIYNEQVTIPKKKNNIFMF 330
Query: 118 SDYQQATARQYNRLVK 133
D + T YNR VK
Sbjct: 331 GDGARKTVISYNRSVK 346
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.128 0.351
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,211,832
Number of extensions: 1191281
Number of successful extensions: 1680
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1577
Number of HSP's successfully gapped: 201
Length of query: 236
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 142
Effective length of database: 6,768,326
Effective search space: 961102292
Effective search space used: 961102292
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.8 bits)