RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15691
(620 letters)
>gnl|CDD|218295 pfam04858, TH1, TH1 protein. TH1 is a highly conserved but
uncharacterized metazoan protein. No homologue has been
identified in Caenorhabditis elegans. TH1 binds
specifically to A-Raf kinase.
Length = 582
Score = 884 bits (2287), Expect = 0.0
Identities = 360/584 (61%), Positives = 427/584 (73%), Gaps = 11/584 (1%)
Query: 1 MNDDYDDEQTHRWDQDMVDV----DSDGGDDPMENPQEVLNECLEKFSTPDYIMEPGIFS 56
M +DY E T WD+ D D + ENP VL ECL +FST DYIMEP IF
Sbjct: 1 MEEDY--EVTSGWDRSDGRGTAQGDDDARAEVDENPTAVLQECLRRFSTKDYIMEPSIFD 58
Query: 57 QLKRYFQAGGNPEQVIELLSQNYKAVAQMANLVAEWLILGGVNVTEVQAMVENHLKDMIL 116
LKRYFQAGG PE VIELLS+NY+AVAQM NL+AEWLI+ GV+ T+VQAMVENHLK +I
Sbjct: 59 DLKRYFQAGGAPETVIELLSENYRAVAQMTNLLAEWLIVTGVSPTKVQAMVENHLKTLIR 118
Query: 117 KTFDPKKADTIFTEEGETPAWLTEMIEHPTWRSLIYRLAEEYPDCLMLNFTIKLISDAGF 176
K FDP KAD IFTEEGETPAWL MI H TWR+L+Y LAEE P+CLMLNFTIKLISDAG
Sbjct: 119 KRFDPNKADAIFTEEGETPAWLDGMIRHETWRALLYSLAEENPNCLMLNFTIKLISDAGM 178
Query: 177 QREITSISTAAQQIEVFSRVLKTSISNFLES-SDDWQSSVEECAKMVCHGQHTYVYSQVL 235
Q EITS+STAAQQ EVFSRVL T I+ L S DD + ++EE A MVCHG HTYV+ Q++
Sbjct: 179 QHEITSLSTAAQQFEVFSRVLATHIAGILVSGPDDVRRAIEELASMVCHGIHTYVFGQLM 238
Query: 236 LQVLSLESKGGSKVKRLAQEITKCAQQSGHDVTPIIMALNGAAFYPQANQALSSMISRNS 295
L VL E GG+ RL+QEI + A + T I +AL +A PQA AL+ M+SR +
Sbjct: 239 LAVLGQEEDGGTAAARLSQEIEEAAAKLHGAATVITLALATSAVDPQACYALADMLSRGA 298
Query: 296 LNPADITVLFRLYSSPDPPPIDLIRIPQFLELLTDSLFKPGVKLNPEHKPKYIYLLAYAS 355
LNPADITVL R+Y+S DPPP+DLIR P FL LL D LFK GVK+N HK KY LLAYA+
Sbjct: 299 LNPADITVLLRMYTSSDPPPVDLIRHPDFLYLLIDELFKWGVKINNRHKSKYFELLAYAA 358
Query: 356 SVYEVSTGKKTKQKSFNKDDLKTTIQAIEKVHSI-CNSSKSSNELIAEITTLFHCIRYPV 414
SV E K ++ NKD+LK+TI+AIEK H+I CN+ + +ELIAE++TL++CIR+PV
Sbjct: 359 SVVETP---PNKDRNINKDELKSTIKAIEKAHAIICNAKRGKSELIAELSTLYNCIRFPV 415
Query: 415 VSVGVIRWVECTVTEPSYFKLCTEHTPTHLALLDEVVTCHTFLHHKVLQLFIELFESKQD 474
V GVI WV VTEPSYF+L T P HLA+LDEV H L +VL+L + LFES D
Sbjct: 416 VGAGVIAWVRSAVTEPSYFRLVTNSNPVHLAVLDEVARKHVALQGQVLELLVALFESHGD 475
Query: 475 ELEILVQLEMRKMLLDRMVNLLCRGCVVPVVKYVKQCWQRGDTDISLIRYFVTEVLEAIT 534
ELEILVQLE++KMLLDRMV LL RG VVPV+ Y QCW+R DTDISLIRYFVTEVLE
Sbjct: 476 ELEILVQLELKKMLLDRMVELLARGHVVPVITYACQCWRRLDTDISLIRYFVTEVLEIAG 535
Query: 535 PPYTPEFVQLFLPIVENDEITGTMRGDGENDPVSEFIVQTLEHE 578
PPY+ +FVQLFLP++EN+ I GTMRG+G++DPV EFIV
Sbjct: 536 PPYSRDFVQLFLPLLENENIRGTMRGEGDSDPVDEFIVHCKRRY 579
>gnl|CDD|219491 pfam07632, DUF1593, Protein of unknown function (DUF1593). A
family of proteins in Rhodopirellula baltica that are
predicted to be secreted. Also, a member has been
identified in Caulobacter crescentus. These proteins mat
be related to pfam01156.
Length = 260
Score = 31.9 bits (73), Expect = 0.89
Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 18/92 (19%)
Query: 511 CWQRGDTDISLIRYFV---TEV---LEAITPPY-TPEFVQLFLPIVENDEITGTMRGDGE 563
W +G I + +V L P Y T E+ IV+ M GE
Sbjct: 39 TWHKGGVRPEDIDRIIDAYEKVYPNLNKHAPGYPTAEY---LRSIVKQGNPDYGMPFVGE 95
Query: 564 NDPV--SEFIVQTLEHEWHQLEDDKKIY-QCW 592
SE I++ L +D + ++ W
Sbjct: 96 GKDTEGSELIIEALLD-----DDPRPLWILVW 122
>gnl|CDD|224296 COG1378, COG1378, Predicted transcriptional regulators
[Transcription].
Length = 247
Score = 30.9 bits (70), Expect = 1.6
Identities = 18/124 (14%), Positives = 49/124 (39%), Gaps = 5/124 (4%)
Query: 460 KVLQLFIELFESKQDELEILVQLEMRKMLLDRMVNLLCRGCVVPVVKYVKQCWQRGDTDI 519
++++ E+ + E+ I++ E+ K L + ++ L RG V ++ + +
Sbjct: 120 EIIEKIKEVINEAEKEIIIVLPYEIFKELKEPLIRALKRGVRVLILVFPI---RDVPLAF 176
Query: 520 SLIRYFVTEVLEAITPPYTPEFV--QLFLPIVENDEITGTMRGDGENDPVSEFIVQTLEH 577
+ ++F+ +++ V + I+ T+ G + FI + +
Sbjct: 177 IVNKFFLDALIDIRFGDPLAGEVIISDRIRILIVPSDGETVEGVIYSFDEIAFITHNIFY 236
Query: 578 EWHQ 581
E +
Sbjct: 237 EVWK 240
>gnl|CDD|177383 PHA02548, 24, capsid vertex protein; Provisional.
Length = 412
Score = 31.2 bits (71), Expect = 1.7
Identities = 22/72 (30%), Positives = 34/72 (47%), Gaps = 10/72 (13%)
Query: 140 EMIEHPTWRSLIYRLAEEYPDCLMLNFTIKLISDAG----FQREITSISTAAQQI----- 190
E++E S L E + ++L I+L+SDA F+ I+ + QI
Sbjct: 105 EVLEAGPLASSETDLEEAIQEAVVLG-KIRLVSDAASTEKFEDGDEEIAETSFQIDKWQV 163
Query: 191 EVFSRVLKTSIS 202
V SR LKTS++
Sbjct: 164 PVRSRKLKTSLT 175
>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K. CENP-K is one
of seven new CENP-A-nucleosome distal (CAD) centromere
components (the others being CENP-L, CENP-O, CENP-P,
CENP-Q, CENP-R and CENP-S) that are identified as
assembling on the CENP-A nucleosome associated complex,
NAC. The CENP-A NAC is essential, as disruption of the
complex causes errors of chromosome alignment and
segregation that preclude cell survival despite
continued centromere-derived mitotic checkpoint
signalling. CENP-K is centromere-associated through its
interaction with one or more components of the CENP-A
NAC.
Length = 273
Score = 30.6 bits (69), Expect = 2.2
Identities = 14/46 (30%), Positives = 21/46 (45%), Gaps = 4/46 (8%)
Query: 314 PPIDLIRIPQFLELLTDSLFK----PGVKLNPEHKPKYIYLLAYAS 355
P + LI + + LE+L + L P VK+ P Y+ LL
Sbjct: 209 PSVQLITLHEILEILINRLMDTPHDPYVKVKDSFWPPYVELLLRYG 254
>gnl|CDD|176458 cd03468, PolY_like, DNA Polymerase Y-family. Y-family DNA
polymerases are a specialized subset of polymerases that
facilitate translesion synthesis (TLS), a process that
allows the bypass of a variety of DNA lesions. Unlike
replicative polymerases, TLS polymerases lack
proofreading activity and have low fidelity and low
processivity. They use damaged DNA as templates and
insert nucleotides opposite the lesions. The active
sites of TLS polymerases are large and flexible to allow
the accomodation of distorted bases. Expression of
Y-family polymerases is often induced by DNA damage and
is believed to be highly regulated. TLS is likely
induced by the monoubiquitination of the replication
clamp PCNA, which provides a scaffold for TLS
polymerases to bind in order to access the lesion.
Because of their high error rates, TLS polymerases are
potential targets for cancer treatment and prevention.
Length = 335
Score = 30.8 bits (70), Expect = 2.3
Identities = 15/65 (23%), Positives = 22/65 (33%), Gaps = 2/65 (3%)
Query: 481 QLEMRKMLLDRMVNLLCRGCVVPVVKYVKQCWQRGDTDISLIRYFVTEVLEAITPPYTPE 540
L +R + R+ L R V V R D + + E LE + P
Sbjct: 267 FLALRGLGARRLSLTLFR--EDGRVTRVLVGLARPSRDDLPLLRLLRERLERLALPRGIA 324
Query: 541 FVQLF 545
V+L
Sbjct: 325 PVRLL 329
>gnl|CDD|185231 PRK15331, PRK15331, chaperone protein SicA; Provisional.
Length = 165
Score = 29.6 bits (66), Expect = 2.5
Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 1/45 (2%)
Query: 39 CLEKFSTPDYIMEPGIFSQLKRYFQAGGNPEQVIELLSQN-YKAV 82
C+ F PDY M QLK+ FQ + V L +N Y+ V
Sbjct: 64 CIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPV 108
>gnl|CDD|236007 PRK07379, PRK07379, coproporphyrinogen III oxidase; Provisional.
Length = 400
Score = 30.0 bits (68), Expect = 3.6
Identities = 15/41 (36%), Positives = 20/41 (48%), Gaps = 5/41 (12%)
Query: 573 QTLEHEWHQ-LEDDKKIYQCWPVHFSSVDFVIEPKGTYGRR 612
QTLE W LE + P H S D V+EP +G++
Sbjct: 179 QTLED-WQASLEAAIALN---PTHLSCYDLVLEPGTAFGKQ 215
>gnl|CDD|233925 TIGR02553, SipD_IpaD_SspD, type III effector protein
IpaD/SipD/SspD. These proteins are found within type
III secretion operons and have been shown to be secreted
by that system.
Length = 313
Score = 29.6 bits (66), Expect = 4.7
Identities = 15/82 (18%), Positives = 38/82 (46%), Gaps = 9/82 (10%)
Query: 69 EQVIELLSQNYKAVAQMANLVAEWLILGG------VNVTEVQAMVE---NHLKDMILKTF 119
E V+E + Y+A + + + + +W+ G ++V +++A+++ +HL + F
Sbjct: 130 ENVVEGYTDFYQAFSDILSKMQDWISPGKDGNNVKLDVGKLKALLQQLIDHLPNKKNVLF 189
Query: 120 DPKKADTIFTEEGETPAWLTEM 141
+ +E + W E+
Sbjct: 190 PAQDGGMQLPKEADARRWRKEL 211
>gnl|CDD|218236 pfam04735, Baculo_helicase, Baculovirus DNA helicase.
Length = 1173
Score = 29.9 bits (68), Expect = 5.0
Identities = 22/76 (28%), Positives = 27/76 (35%), Gaps = 3/76 (3%)
Query: 446 LLDEVVTCHTFLHHKVLQLFIELFESKQDELEILVQLEMRKMLLDRMVNLLCRGCVVPVV 505
L C K+ LF EL+ EL IL K + D + NL C C
Sbjct: 593 LFGNCANCKIEEQDKLNDLFRELWNYDHSELIILGLYLNDKKMSDLIHNLKCSECKESA- 651
Query: 506 KYVKQCWQRGDTDISL 521
K+C DI L
Sbjct: 652 --SKKCDCLKKIDIDL 665
>gnl|CDD|219096 pfam06586, TraK, TraK protein. This family consists of several
TraK proteins from Escherichia coli, Salmonella typhi
and Salmonella typhimurium. TraK is known to be
essential for pilus assembly but its exact role in this
process is unknown.
Length = 231
Score = 28.8 bits (65), Expect = 6.5
Identities = 27/157 (17%), Positives = 53/157 (33%), Gaps = 28/157 (17%)
Query: 166 FTIKLISDAG--FQREITSISTAAQQIEVFSRVLKTSISN-FLESSDDWQSSVEECAKMV 222
FT+ + ++ G F +T + + I + S E+S ++ ++ + + +
Sbjct: 87 FTLFVTTEKGRTFSLTLTPKAGPGETIRLTSEPALKREEAAVWETSTPYEKALVKLTRAM 146
Query: 223 CHG------QHTYVYSQVLLQVLSLESK-------GGSKVKRLAQEITKCAQQSGHDVTP 269
G V + L L K KR +T +V P
Sbjct: 147 LTGKLPDGYSVKPVTKRTLTLPKGLRLTPVAVYDGNELKGKRY--RLT--------NVGP 196
Query: 270 IIMALNGAAFYPQANQALSSMISRNSLNPADITVLFR 306
+ L + FY + +SR L P + T ++
Sbjct: 197 KPVELRESDFYSPG--TRAVALSRPVLAPGETTQVYV 231
>gnl|CDD|175984 cd04017, C2D_Ferlin, C2 domain fourth repeat in Ferlin. Ferlins
are involved in vesicle fusion events. Ferlins and
other proteins, such as Synaptotagmins, are implicated
in facilitating the fusion process when cell membranes
fuse together. There are six known human Ferlins:
Dysferlin (Fer1L1), Otoferlin (Fer1L2), Myoferlin
(Fer1L3), Fer1L4, Fer1L5, and Fer1L6. Defects in these
genes can lead to a wide range of diseases including
muscular dystrophy (dysferlin), deafness (otoferlin),
and infertility (fer-1, fertilization factor-1).
Structurally they have 6 tandem C2 domains, designated
as (C2A-C2F) and a single C-terminal transmembrane
domain, though there is a new study that disputes this
and claims that there are actually 7 tandem C2 domains
with another C2 domain inserted between C2D and C2E.
In a subset of them (Dysferlin, Myoferlin, and Fer1)
there is an additional conserved domain called DysF. C2
domains fold into an 8-standed beta-sandwich that can
adopt 2 structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. This cd contains the fourth C2
repeat, C2D, and has a type-II topology.
Length = 135
Score = 27.9 bits (63), Expect = 7.6
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 19/80 (23%)
Query: 283 ANQALSSMISRNSLNPA-DITVLFR---LYSSP-----DPPPI-------DLIRIPQFLE 326
NQ+ + + + +L+P D T++F LY SP +PP + D + +FL
Sbjct: 31 LNQSQETEVIKETLSPTWDQTLIFDEVELYGSPEEIAQNPPLVVVELFDQDSVGKDEFLG 90
Query: 327 LLTDSLFKPGVKLNPEHKPK 346
S+ KP VKL+ E
Sbjct: 91 ---RSVAKPLVKLDLEEDFP 107
>gnl|CDD|225749 COG3208, GrsT, Predicted thioesterase involved in non-ribosomal
peptide biosynthesis [Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 244
Score = 28.8 bits (65), Expect = 7.7
Identities = 20/58 (34%), Positives = 24/58 (41%), Gaps = 10/58 (17%)
Query: 499 GCVVPVVKYVKQCWQRGDTDI--SLIRYFVT--EVLEAITPPYTPEFVQLFLPIVEND 552
GC P KQ D D L+ T E+LE PE + LFLPI+ D
Sbjct: 109 GCRAPHYDRGKQIHHLDDADFLADLVDLGGTPPELLE------DPELMALFLPILRAD 160
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 29.1 bits (66), Expect = 8.2
Identities = 12/28 (42%), Positives = 20/28 (71%), Gaps = 1/28 (3%)
Query: 17 MVDVDSDGGDDPMENPQEVLNECLEKFS 44
++D+ G DP+EN + ++NE LEK+S
Sbjct: 244 LIDIAPIDGSDPVENARIIINE-LEKYS 270
>gnl|CDD|215170 PLN02301, PLN02301, pectinesterase/pectinesterase inhibitor.
Length = 548
Score = 29.1 bits (65), Expect = 8.2
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 10/56 (17%)
Query: 95 LGGVNVTEVQAMVENHLKDMILKT---------FDPKKADTIFTEEGETPAWLTEM 141
L G+N Q+M + LKD+I + P K D + G+ P+WLT
Sbjct: 168 LDGINGPSRQSM-KPGLKDLISRARTSLAILVSVSPAKEDLLMPLSGDFPSWLTSK 222
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.135 0.402
Gapped
Lambda K H
0.267 0.0609 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,587,499
Number of extensions: 3090353
Number of successful extensions: 2930
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2926
Number of HSP's successfully gapped: 27
Length of query: 620
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 517
Effective length of database: 6,369,140
Effective search space: 3292845380
Effective search space used: 3292845380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (27.9 bits)