RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3869
(203 letters)
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 35.0 bits (80), Expect = 0.025
Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 114 ELLDTESSDNE--GEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANE 171
+LL+TE NE +N++D +DDN+ D+ Q+ E ++ + P +
Sbjct: 3915 DLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQ 3974
Query: 172 ETLVEPKVESEP---PEQSTTDSQETDQDQDV 200
E +P E+E PE D +E D +D
Sbjct: 3975 ENNSQPPPENEDLDLPEDLKLDEKEGDVSKDS 4006
Score = 33.4 bits (76), Expect = 0.071
Identities = 20/102 (19%), Positives = 38/102 (37%), Gaps = 10/102 (9%)
Query: 112 TIELLDTESSDNEGEDNDNDND-EDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPAN 170
++L + D + D D+D ED D D ++ D+EK P ++E E N
Sbjct: 3986 DLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDE 4045
Query: 171 EE--------TLVEPKVESE-PPEQSTTDSQETDQDQDVPEQ 203
+ + K+ + E + + T+ E+
Sbjct: 4046 DIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEE 4087
Score = 28.4 bits (63), Expect = 3.7
Identities = 24/98 (24%), Positives = 39/98 (39%), Gaps = 16/98 (16%)
Query: 114 ELLDTESSDNEGEDNDNDN---DEDKDD-NEHDPDQSTQDSEKWSPYEEEPFPE------ 163
D+ S+NE D++ +N DE+ +D E + + + + W EE E
Sbjct: 3867 VTKDSVVSENENSDSEEENQDLDEEVNDIPEDLSN--SLNEKLWDEPNEEDLLETEQKSN 3924
Query: 164 RNTAPANEETLV----EPKVESEPPEQSTTDSQETDQD 197
+A NE LV + K + Q D +E D
Sbjct: 3925 EQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDD 3962
Score = 28.0 bits (62), Expect = 4.3
Identities = 22/92 (23%), Positives = 38/92 (41%), Gaps = 11/92 (11%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDN---EHDPDQSTQDS-------EKWSPYEEEPFPERNT 166
D E NE ED N +D D+ + E D + T+DS ++ E N
Sbjct: 3836 DLEELANE-EDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVND 3894
Query: 167 APANEETLVEPKVESEPPEQSTTDSQETDQDQ 198
P + + K+ EP E+ ++++ +Q
Sbjct: 3895 IPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQ 3926
>gnl|CDD|227889 COG5602, SIN3, Histone deacetylase complex, SIN3 component [Chromatin
structure and dynamics].
Length = 1163
Score = 33.4 bits (76), Expect = 0.086
Identities = 22/116 (18%), Positives = 44/116 (37%), Gaps = 8/116 (6%)
Query: 1 MVKNVLDGNMESNSYEDTLRELFGIHAYIAFTLDKIVSNAVRQLQHLVSDETCIECYETF 60
++ G + +E LR + Y +T+D++ + +QL + D I E F
Sbjct: 906 DSYRLIFGALTPCEFEKYLRPFYNNTCYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELF 965
Query: 61 LKESKKGATGGLCETAHQRVHLEQGYQKKSEVAMTDENCFKIIIFKKDCKLTIELL 116
S + +T + R+ +E + DE F+ K I+++
Sbjct: 966 EMNSAASSKRKQDQTKY-RLEVES-------LLNPDEILFRFCWINKFKSFGIQIM 1013
>gnl|CDD|181179 PRK07948, PRK07948, putative monovalent cation/H+ antiporter
subunit F; Reviewed.
Length = 86
Score = 30.0 bits (68), Expect = 0.22
Identities = 10/27 (37%), Positives = 16/27 (59%), Gaps = 2/27 (7%)
Query: 17 DTLRELF--GIHAYIAFTLDKIVSNAV 41
DTL + + + A+TLD V+NA+
Sbjct: 37 DTLVAVTMCALATWAAWTLDTTVTNAM 63
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 31.1 bits (70), Expect = 0.37
Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 5/88 (5%)
Query: 115 LLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETL 174
L E ++ G+D +++++ED DD DQ T++ ++ E E + +E T
Sbjct: 194 LRSMELAEEMGDDTESEDEEDGDD-----DQPTENEQEEQGEGEGEGQEGSAPQESEATD 248
Query: 175 VEPKVESEPPEQSTTDSQETDQDQDVPE 202
E + E QS D + D D
Sbjct: 249 RESESGEEEMVQSDQDDLPDESDDDSET 276
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 30.7 bits (69), Expect = 0.58
Identities = 12/81 (14%), Positives = 34/81 (41%), Gaps = 1/81 (1%)
Query: 118 TESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLVEP 177
ES ++++ + ++++ E + ++ + + E+E E A E +E
Sbjct: 429 GESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEG 488
Query: 178 KVESE-PPEQSTTDSQETDQD 197
E + E+ D++ + +
Sbjct: 489 SSEGDGDGEEPEEDAERRNSE 509
Score = 27.6 bits (61), Expect = 5.4
Identities = 17/80 (21%), Positives = 34/80 (42%), Gaps = 3/80 (3%)
Query: 119 ESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLVE-- 176
ES + EGED + + + + D+ + + + + + EE RN+ A + E
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQ 520
Query: 177 -PKVESEPPEQSTTDSQETD 195
P+ S PE + + +
Sbjct: 521 QPRGSSVQPESPQEEPLQPE 540
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 30.6 bits (69), Expect = 0.63
Identities = 19/84 (22%), Positives = 37/84 (44%), Gaps = 1/84 (1%)
Query: 117 DTESSDNEGED-NDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLV 175
D S + E+ D+ +ED D+ + D DQ + + + EE + + E T
Sbjct: 211 DMLGSMDMAEETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDG 270
Query: 176 EPKVESEPPEQSTTDSQETDQDQD 199
E + +++ DS+ + D+D
Sbjct: 271 EGEEGEMDAAEASEDSESDESDED 294
Score = 27.1 bits (60), Expect = 7.8
Identities = 12/47 (25%), Positives = 26/47 (55%), Gaps = 5/47 (10%)
Query: 117 DTESSDNEGEDNDND-----NDEDKDDNEHDPDQSTQDSEKWSPYEE 158
+ E++D EGE+ + D D + D+++ D + +D+ +P+ E
Sbjct: 264 EAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDARPATPFTE 310
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 30.4 bits (68), Expect = 0.80
Identities = 23/95 (24%), Positives = 31/95 (32%), Gaps = 10/95 (10%)
Query: 117 DTESSDNEGEDNDNDNDEDK----------DDNEHDPDQSTQDSEKWSPYEEEPFPERNT 166
D+ S + EG + D D+D DNE D D S Q
Sbjct: 128 DSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGA 187
Query: 167 APANEETLVEPKVESEPPEQSTTDSQETDQDQDVP 201
A A P ++ PP+ S +Q Q Q
Sbjct: 188 ALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPS 222
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 29.7 bits (67), Expect = 0.84
Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 2/49 (4%)
Query: 104 IFKKDCKLTIELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEK 152
K + TI +D SD+EG+ +D+D+DED E D D S D
Sbjct: 232 QAKSSKRRTIAQIDGIDSDDEGDGSDDDDDED--AIESDLDDSDDDVSD 278
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 30.0 bits (68), Expect = 0.96
Identities = 9/59 (15%), Positives = 20/59 (33%)
Query: 145 QSTQDSEKWSPYEEEPFPERNTAPANEETLVEPKVESEPPEQSTTDSQETDQDQDVPEQ 203
+ + +E ++E A E + EP+V ++P + + E
Sbjct: 941 EVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEP 999
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 29.3 bits (66), Expect = 1.1
Identities = 15/88 (17%), Positives = 27/88 (30%), Gaps = 1/88 (1%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEE-EPFPERNTAPANEETLV 175
E + +D + E+ + E + S +K +E E E N E +
Sbjct: 46 ADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDE 105
Query: 176 EPKVESEPPEQSTTDSQETDQDQDVPEQ 203
K E E + + V +
Sbjct: 106 NEKETEEKTESNVEKEITNPSWKPVGTE 133
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 29.7 bits (66), Expect = 1.2
Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 4/75 (5%)
Query: 114 ELLDTESSDNEGEDND---NDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPAN 170
+ D ES+ + +D+D NDN+E D E D + + E+ SP EEE
Sbjct: 402 QCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDN-EGFEELSPEEEERQLREFRDMEK 460
Query: 171 EETLVEPKVESEPPE 185
E+ + E +P E
Sbjct: 461 EDREFPDEAELQPSE 475
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 29.8 bits (67), Expect = 1.2
Identities = 15/89 (16%), Positives = 32/89 (35%), Gaps = 5/89 (5%)
Query: 114 ELLDTESSDNEGEDNDNDNDEDKDDNEHDPD-----QSTQDSEKWSPYEEEPFPERNTAP 168
++ D E N + ++ N ++D D Q D++K N P
Sbjct: 132 DISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQP 191
Query: 169 ANEETLVEPKVESEPPEQSTTDSQETDQD 197
+ + + S+P T + + + +D
Sbjct: 192 NSPKPTQPNQSNSQPASDDTANQKSSSKD 220
>gnl|CDD|177141 MTH00069, ND3, NADH dehydrogenase subunit 3; Provisional.
Length = 114
Score = 28.0 bits (63), Expect = 1.6
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 145 QSTQDSEKWSPYE 157
+ DSEK SPYE
Sbjct: 25 LNNPDSEKLSPYE 37
>gnl|CDD|233037 TIGR00592, pol2, DNA polymerase (pol2). All proteins in this
superfamily for which functions are known are DNA
polymerases.This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1172
Score = 29.3 bits (65), Expect = 1.6
Identities = 22/100 (22%), Positives = 37/100 (37%), Gaps = 7/100 (7%)
Query: 106 KKDCKLTIELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERN 165
+ D +L + D E+ ++G+D ++ D E +T K YE EP
Sbjct: 193 EGDPELKLASFDIETYFHDGKDFFPGDENPAD--EEIMISTTPVIAKQWDYESEPEARVV 250
Query: 166 TAPANEETLVEPKVESEPPEQST-----TDSQETDQDQDV 200
T ++ VES E S + D D ++
Sbjct: 251 TWKKPDKPTTGSYVESVSEEISMIKRFWDVIDQEDTDVEI 290
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 28.6 bits (64), Expect = 1.7
Identities = 8/63 (12%), Positives = 25/63 (39%)
Query: 121 SDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLVEPKVE 180
+ D++ + E++ +E + + + + ++ P+ T +T +
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
Query: 181 SEP 183
S+P
Sbjct: 149 SKP 151
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 28.8 bits (64), Expect = 1.8
Identities = 15/65 (23%), Positives = 25/65 (38%), Gaps = 1/65 (1%)
Query: 121 SDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWS-PYEEEPFPERNTAPANEETLVEPKV 179
+D D D D K+ N ++ + ST +S++ S +E P + P
Sbjct: 116 ADLLTPPPDVDEDALKEANVNETESSTDESDRSSHSHEVRSKSNFPMGPPSPWNPRFPPG 175
Query: 180 ESEPP 184
PP
Sbjct: 176 PPPPP 180
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 29.2 bits (65), Expect = 1.8
Identities = 17/103 (16%), Positives = 40/103 (38%), Gaps = 7/103 (6%)
Query: 66 KGATGGLCETAHQRVHLEQGYQKKSEVAMTDENCFKIIIFKKDCKLTIELLDTESSDNEG 125
KG G + H+ + + T + ++ + +L E D +G
Sbjct: 792 KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851
Query: 126 EDN-------DNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPF 161
D D++ +E++++ E + ++ ++ E+ EEP
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPL 894
Score = 26.9 bits (59), Expect = 9.6
Identities = 10/46 (21%), Positives = 24/46 (52%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFP 162
D + D G + D++E++++ E + ++ ++ E+ EE P
Sbjct: 848 DEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEP 893
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 29.2 bits (65), Expect = 1.9
Identities = 20/81 (24%), Positives = 29/81 (35%), Gaps = 8/81 (9%)
Query: 125 GEDNDNDNDE---DKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLVEPKVES 181
++ N D DN DP + + +P EE FP P ++E E V
Sbjct: 308 DDEFAVPNFNEGLDVPDNPQDPVPPPNEGKDGNPNEENLFP-----PGDDEVPDESNVPP 362
Query: 182 EPPEQSTTDSQETDQDQDVPE 202
PP + E D + P
Sbjct: 363 NPPNVPGGSNSEFSSDVENPP 383
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 28.8 bits (65), Expect = 2.1
Identities = 10/36 (27%), Positives = 15/36 (41%)
Query: 168 PANEETLVEPKVESEPPEQSTTDSQETDQDQDVPEQ 203
PA + E PP + D E +++QD E
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAED 294
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 28.8 bits (64), Expect = 2.1
Identities = 12/35 (34%), Positives = 24/35 (68%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSE 151
D + D++ +D+++++D+D DD+E D D+ DS
Sbjct: 54 DDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDST 88
Score = 26.9 bits (59), Expect = 8.8
Identities = 13/38 (34%), Positives = 23/38 (60%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWS 154
D + D+E ED+D+D+D++ D++E D D + D
Sbjct: 59 DDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSAD 96
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 28.5 bits (64), Expect = 2.1
Identities = 7/37 (18%), Positives = 22/37 (59%)
Query: 113 IELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQD 149
+E + E +++ +++++++ED+D+ E + D
Sbjct: 283 LEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 28.5 bits (64), Expect = 2.3
Identities = 6/44 (13%), Positives = 24/44 (54%)
Query: 103 IIFKKDCKLTIELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQS 146
++ + L L E + E + +++++++++D++E + +
Sbjct: 271 VVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDD 314
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 28.7 bits (64), Expect = 2.3
Identities = 18/93 (19%), Positives = 31/93 (33%), Gaps = 7/93 (7%)
Query: 117 DTESSDNEGEDND-NDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNT------APA 169
D ++ N + ND + +D + QS S EE+ NT
Sbjct: 129 DADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENTFKFYNFGND 188
Query: 170 NEETLVEPKVESEPPEQSTTDSQETDQDQDVPE 202
EE + +S+ + + + D PE
Sbjct: 189 GEEAAAKDGGKSKSSDPGPLNDSDGQGDDGDPE 221
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 28.1 bits (63), Expect = 3.2
Identities = 16/78 (20%), Positives = 27/78 (34%), Gaps = 5/78 (6%)
Query: 124 EGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLVEPKVESEP 183
EGED + D++D + D + D+ + + +P + P + P V S P
Sbjct: 144 EGEDPNPGPPLDEEDEDADVATTNSDN-SFPGEDADPASASPSDPPSS----SPGVPSFP 198
Query: 184 PEQSTTDSQETDQDQDVP 201
S P
Sbjct: 199 SPPEDPSSPSDSSLPPAP 216
>gnl|CDD|221569 pfam12420, DUF3671, Protein of unknown function. This domain
family is found in eukaryotes, and is typically between
96 and 116 amino acids in length.
Length = 106
Score = 26.9 bits (60), Expect = 3.6
Identities = 9/18 (50%), Positives = 11/18 (61%)
Query: 96 DENCFKIIIFKKDCKLTI 113
D+ FK IIFKK + I
Sbjct: 33 DKKKFKKIIFKKYGLIFI 50
>gnl|CDD|215122 PLN02190, PLN02190, cellulose synthase-like protein.
Length = 756
Score = 28.3 bits (63), Expect = 3.6
Identities = 8/13 (61%), Positives = 11/13 (84%)
Query: 152 KWSPYEEEPFPER 164
KWSP E +P+P+R
Sbjct: 72 KWSPAEYKPYPDR 84
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 28.0 bits (63), Expect = 4.2
Identities = 9/36 (25%), Positives = 20/36 (55%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEK 152
+++ D E +++D D+D DD++ D + E+
Sbjct: 132 ESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLER 167
>gnl|CDD|193564 cd08017, M20_IAA_Hyd, M20 Peptidase Indole-3-acetic acid amino acid
hydrolase. Peptidase M20 family, Plant Aminoacyclase-1
indole-3-acetic-L-aspartic acid hydrolase (IAA-Asp
hydrolase; IAAspH; IAAH; IAA amidohydrolase; EC 3.5.1.-)
subfamily. IAAspH hydrolyzes indole-3-acetyl-N-aspartic
acid (IAA or auxin) to indole-3-acetic acid. Genes
encoding IAA-amidohydrolases were first cloned from
Arabidopsis; ILR1, IAR3, ILL1 and ILL2 encode active
IAA- amino acid hydrolases, and three additional
amidohydrolase-like genes (ILL3, ILL5, ILL6) have been
isolated. In higher plants, the growth regulator
indole-3-acetic acid (IAA or auxin) is found both free
and conjugated via amide bonding to a variety of amino
acids and peptides, and via an ester linkage to
carbohydrates. IAA-Asp conjugates are involved in
homeostatic control, protection, storing and subsequent
use of free IAA. IAA-Asp is also found in some plants as
a unique intermediate for entering into IAA
non-decarboxylative oxidative pathway. IAA
amidohydrolase cleaves the amide bond between the auxin
and the conjugated amino acid. Enterobacter agglomerans
IAAspH has very strong enzyme activity and substrate
specificity towards IAA-Asp, although its substrate
affinity is weaker compared to Arabidopsis enzymes of
the ILR1 gene family. Enhanced IAA-hydrolase activity
has been observed during clubroot disease in Chinese
cabbage.
Length = 377
Score = 27.7 bits (62), Expect = 4.4
Identities = 15/32 (46%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
Query: 24 GIHAYIA-FTLDKIV--SNAVRQLQHLVSDET 52
G HA + T+D ++ S+AV LQ LVS ET
Sbjct: 179 GGHAAMPHHTVDPVLAASSAVVALQQLVSRET 210
>gnl|CDD|198367 cd07795, FGGY_ScGut1p_like, Saccharomyces cerevisiae Gut1p and
related proteins; belongs to the FGGY family of
carbohydrate kinases. This subgroup corresponds to a
small group of fungal glycerol kinases (GK), including
Saccharomyces cerevisiae Gut1p/YHL032Cp, which
phosphorylates glycerol to glycerol-3-phosphate in the
cytosol. Glycerol utilization has been considered as the
sole source of carbon and energy in S. cerevisiae, and
is mediated by glycerol kinase and glycerol 3-phosphate
dehydrogenase, which is encoded by the GUT2 gene.
Members in this family show high similarity to their
prokaryotic and eukaryotic homologs. GKs belong to the
FGGY family of carbohydrate kinases, the monomers of
which contain two large domains, which are separated by
a deep cleft that forms the active site. This model
includes both the N-terminal domain, which adopts a
ribonuclease H-like fold, and the structurally related
C-terminal domain.
Length = 496
Score = 27.8 bits (62), Expect = 4.5
Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 2/32 (6%)
Query: 9 NMESNSYEDTLRELFGIHAYIAFTLDKIVSNA 40
N+ + Y+ L + FGI I L +IVS++
Sbjct: 199 NLATLKYDQKLLDFFGIDKKI--ILPEIVSSS 228
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 27.0 bits (60), Expect = 4.6
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 121 SDNEGEDNDNDNDEDKDDNEHDPD 144
D +G D D+D+++D+DD E D +
Sbjct: 47 IDEDGGDIDDDDEDDEDDEEADAE 70
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 27.7 bits (61), Expect = 4.7
Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 2/67 (2%)
Query: 109 CKLTIELLDTESSDNEGEDNDNDNDEDKD--DNEHDPDQSTQDSEKWSPYEEEPFPERNT 166
CK +E +D E D GE +D+ + E K NE + +Q+ Q ++ P E
Sbjct: 141 CKEDLESMDKEVDDILGEGSDDSDSEKKKPPGNEEEQEQAPQPRKQQPPGPRREPDEGTP 200
Query: 167 APANEET 173
+ +
Sbjct: 201 SSSERSN 207
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 28.0 bits (62), Expect = 4.8
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 116 LDTESSDNEGEDNDNDNDEDKDDN 139
+ E +D+E E + D+ ED DDN
Sbjct: 778 SEDEMADDEAESENMDDYEDSDDN 801
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 27.7 bits (62), Expect = 4.9
Identities = 9/43 (20%), Positives = 24/43 (55%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEE 159
+ E D ++++ D+D+D ++ E D D S ++ ++ ++
Sbjct: 323 EEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDD 365
Score = 27.3 bits (61), Expect = 6.7
Identities = 13/71 (18%), Positives = 28/71 (39%)
Query: 116 LDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLV 175
+ E + +G D++++ D+D D E + D D E+ E+ + E+
Sbjct: 320 EEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKK 379
Query: 176 EPKVESEPPEQ 186
+ K +
Sbjct: 380 KKKSAESTRSE 390
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 27.4 bits (61), Expect = 4.9
Identities = 11/57 (19%), Positives = 26/57 (45%)
Query: 110 KLTIELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNT 166
+ + +NE ++ D +ED + E + ++ +DSE+ + + E + N
Sbjct: 8 AKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENN 64
>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
Length = 520
Score = 27.4 bits (61), Expect = 5.3
Identities = 16/92 (17%), Positives = 32/92 (34%), Gaps = 15/92 (16%)
Query: 126 EDNDNDNDEDKDDNE----HDPDQST----QDSEKW---SPYEEEPFPERNTAPAN---- 170
D+D++ D++ DD+ D D+ +D ++W E +
Sbjct: 426 PDDDDEGDDEDDDDWEDLGFDLDEDDVYDLKDVDEWRLERRIATAERWELGQRLLDAYMT 485
Query: 171 EETLVEPKVESEPPEQSTTDSQETDQDQDVPE 202
E + E +VE+ + S + D
Sbjct: 486 EAEVTEAEVEARRVRLNEDLSPSWVKSFDFRS 517
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 27.5 bits (61), Expect = 6.8
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 122 DNEGEDNDNDNDEDKDDNE 140
+E ED D D DED+D++E
Sbjct: 892 ADEDEDQDEDEDEDEDEDE 910
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 27.3 bits (60), Expect = 6.8
Identities = 15/88 (17%), Positives = 35/88 (39%)
Query: 114 ELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEET 173
E+ D + + + + D++ED DD+E + ++ST + + +P ++ T
Sbjct: 98 EVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGT 157
Query: 174 LVEPKVESEPPEQSTTDSQETDQDQDVP 201
+ K S + +Q+
Sbjct: 158 DLNTKQSQTGLGASGSHAQQDPAVSQSG 185
>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 702
Score = 27.3 bits (60), Expect = 7.2
Identities = 25/83 (30%), Positives = 32/83 (38%), Gaps = 6/83 (7%)
Query: 114 ELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWS-PYEEEPFPERNTAPANE- 171
E+L +E G+ N + + E P + Q E S P EP PE P E
Sbjct: 365 EILVSEPVQQNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEP 424
Query: 172 ----ETLVEPKVESEPPEQSTTD 190
E EP+ E EP Q D
Sbjct: 425 EPEPEPEPEPEPEPEPEPQPNQD 447
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 26.8 bits (59), Expect = 7.5
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 4/38 (10%)
Query: 119 ESSDNEGEDNDNDNDE----DKDDNEHDPDQSTQDSEK 152
SD++ + ND DE D DDNE + D +T ++
Sbjct: 271 YESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDE 308
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 27.2 bits (60), Expect = 7.5
Identities = 19/80 (23%), Positives = 30/80 (37%)
Query: 121 SDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEEPFPERNTAPANEETLVEPKVE 180
SD+ ND + EDK E + E EEE E + L + K +
Sbjct: 288 SDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGK 347
Query: 181 SEPPEQSTTDSQETDQDQDV 200
++ +DS + D D+
Sbjct: 348 KNGLDKDDSDSGDDSDDSDI 367
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 27.3 bits (60), Expect = 7.7
Identities = 11/43 (25%), Positives = 27/43 (62%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEE 159
D DN D+D++++++ DD+E D ++ ++ E+ +++E
Sbjct: 144 DMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDE 186
>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus. This
family represents the C-terminus (approximately 500
residues) of the eukaryotic coatomer alpha subunit.
Coatomer (COPI) is a large cytosolic protein complex
which forms a coat around vesicles budding from the
Golgi apparatus. Such coatomer-coated vesicles have been
proposed to play a role in many distinct steps of
intracellular transport. Note that many family members
also contain the pfam04053 domain.
Length = 421
Score = 27.1 bits (60), Expect = 7.9
Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 8/84 (9%)
Query: 117 DTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEK-------WSPYEEEPFPERNTAPA 169
D+ GED D DED D ++ + W ++ P P
Sbjct: 34 AVMGGDDWGEDADLGLDEDGYLEGEDGLLDDEEGPEGGDEEGGWDVEDDLVLPPELDVPK 93
Query: 170 NEETLVEPKVESEPPEQSTTDSQE 193
++ + V P + SQ
Sbjct: 94 DQAGNADSDVFVAPN-PGMSVSQI 116
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 27.1 bits (60), Expect = 8.2
Identities = 10/38 (26%), Positives = 18/38 (47%)
Query: 114 ELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSE 151
E ++ S++E + D + DE +DD E + D
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKA 703
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 27.1 bits (60), Expect = 8.3
Identities = 12/61 (19%), Positives = 20/61 (32%), Gaps = 11/61 (18%)
Query: 143 PDQSTQDSEKWSPYEEEPFPERNTAPANEETLVEPKVESEPPEQSTTDSQETDQDQDVPE 202
++T+ E P P P E P E TD + ++ +PE
Sbjct: 260 LPRATRLPEPEPQPPPPPPPPEPPEPEEE-----------PDEPDQTDPDDGEETDQIPE 308
Query: 203 Q 203
+
Sbjct: 309 E 309
>gnl|CDD|222046 pfam13324, GCIP, Grap2 and cyclin-D-interacting. GCIP, or Grap2
and cyclin-D-interacting protein, is found in
eukaryotes, and in the human protein CCNDBP1, residues
149-190 constitute a helix-loop-helix domain, residues
190-240 an acidic region, and 240-261 a leucine zipper
domain. GCIP interacts with full-length Grap2 protein
and with the COOH-terminal unique and SH3 domains
(designated QC domain) of Grap2. It is potentially
involved in the regulation of cell differentiation and
proliferation through Grap2 and cyclin D-mediated
signalling pathways. In mice, it is involved in
G1/S-phase progression of hepatocytes, which in older
animals is associated with the development of liver
tumours. In vitro it acts as an inhibitory HLH protein,
for example, blocking transcription of the HNF-4
promoter. In its function as a cyclin D1-binding protein
it is able to reduce CDK4-mediated phosphorylation of
the retinoblastoma protein and to inhibit E2F-mediated
transcriptional activity. GCIP has also been shown to
have interact physically with Rad (Ras associated with
diabetes), Rad being important in regulating cellular
senescence.
Length = 271
Score = 26.6 bits (59), Expect = 8.8
Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 3/59 (5%)
Query: 101 KIIIFKKDCKLTIELLDTESSDNEGEDNDNDNDEDKDDNEHDPDQSTQDSEKWSPYEEE 159
++ + KD ++ S+++ ED +D D DD E D D WS E E
Sbjct: 130 QVAVSVKDVLREMKEAKEGCSNDDSEDPFSDGHHDDDDVEGDD--LNDDRY-WSEEEME 185
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.127 0.357
Gapped
Lambda K H
0.267 0.0563 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,719,132
Number of extensions: 853792
Number of successful extensions: 1431
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1088
Number of HSP's successfully gapped: 221
Length of query: 203
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 111
Effective length of database: 6,857,034
Effective search space: 761130774
Effective search space used: 761130774
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 57 (26.1 bits)