RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy545
(531 letters)
>gnl|CDD|238363 cd00707, Pancreat_lipase_like, Pancreatic lipase-like enzymes.
Lipases are esterases that can hydrolyze long-chain
acyl-triglycerides into di- and monoglycerides,
glycerol, and free fatty acids at a water/lipid
interface. A typical feature of lipases is "interfacial
activation," the process of becoming active at the
lipid/water interface, although several examples of
lipases have been identified that do not undergo
interfacial activation . The active site of a lipase
contains a catalytic triad consisting of Ser - His -
Asp/Glu, but unlike most serine proteases, the active
site is buried inside the structure. A "lid" or "flap"
covers the active site, making it inaccessible to
solvent and substrates. The lid opens during the process
of interfacial activation, allowing the lipid substrate
access to the active site.
Length = 275
Score = 378 bits (973), Expect = e-129
Identities = 141/299 (47%), Positives = 182/299 (60%), Gaps = 25/299 (8%)
Query: 75 IDTRFILYTRKNPTEGHMLKVQNERTIEKSNFDPKKKTKFIIHGFIDTPLSSWVKEMRRE 134
ID RF+LYTR+NP +L + +++ SNF+P + T+FIIHG+ + SW+ ++R+
Sbjct: 1 IDVRFLLYTRENPNCPQLLFADDPSSLKNSNFNPSRPTRFIIHGWTSSGEESWISDLRKA 60
Query: 135 LLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGLNPADVHMIGHS 194
L D+NVIVVDW G+ P Y QA NTR+VG ELA F+++L DN GL+ +VH+IGHS
Sbjct: 61 YLSRGDYNVIVVDWGRGANPNYPQAVNNTRVVGAELAKFLDFLVDNTGLSLENVHLIGHS 120
Query: 195 LGAHTAGYAGQAIEG-LGRITGLDPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSIFLL 253
LGAH AG+AG+ + G LGRITGLDPA P F G P RLDP+DA VDVIHTDG
Sbjct: 121 LGAHVAGFAGKRLNGKLGRITGLDPAGPLFSGADPEDRLDPSDAQFVDVIHTDGG----- 175
Query: 254 GYGMSETCGHLDFYPNNGKEQPGCDLTETPLPLTLIKEGIEEAGRVLVACNHVRAIKLFT 313
G S+ GH DFYPN G++QPGC + VAC+H RA+ F
Sbjct: 176 LLGFSQPIGHADFYPNGGRDQPGC--------------PKDILSSDFVACSHQRAVHYFA 221
Query: 314 ESINSKCPYVAHRCPSYQHFLQGRCFSCGENGTGCALMGLEAQKTNHPPGSKYYITTGK 372
ESI S C +VA+ C SY FL G+CF CG GC MG A + K+Y+ T
Sbjct: 222 ESILSPCGFVAYPCSSYDEFLAGKCFPCGS---GCVRMGYHADRF--RREGKFYLKTNA 275
>gnl|CDD|215752 pfam00151, Lipase, Lipase.
Length = 329
Score = 331 bits (850), Expect = e-110
Identities = 144/341 (42%), Positives = 186/341 (54%), Gaps = 23/341 (6%)
Query: 44 CYDELGCLNVTRDWYHL-IYRPFNIFPLPRAVIDTRFILYTRKNPTEGHMLKVQNERTIE 102
CY LGC + W + RP P I+ RF+LYT +NP ++ TIE
Sbjct: 4 CYGLLGCFGDKKPWAGNTLARPAKSLPWQPKFIEVRFLLYTNENPNNCQLITTGLPSTIE 63
Query: 103 KSNFDPKKKTKFIIHGFIDTPLS-SWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATA 161
SNF+ KKT+ IIHGF D SW+ +M + L + NVIVVDW GS Y QAT
Sbjct: 64 NSNFNTSKKTRAIIHGFTDKGQEESWLSDMCKNLFQVEGVNVIVVDWGSGSTTFYRQATL 123
Query: 162 NTRLVGLELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEG---LGRITGLDP 218
N R+VG E+A + L++ ++P +VH+IGHSLGAH AG AG+ +G LGRITGLDP
Sbjct: 124 NVRVVGAEVAKLLVELEEELNVSPENVHLIGHSLGAHVAGEAGRRTKGKTKLGRITGLDP 183
Query: 219 AEPYFQGMPPFARLDPTDADLVDVIHTDGSSIFLLGYGMSETCGHLDFYPNNGKEQPGCD 278
A PYF+G P RLDP DAD VD IHTD I LG G S+ GH+DF+PN G EQPGC
Sbjct: 184 AGPYFKGTPELTRLDPGDADFVDAIHTDTRPIPGLGMGTSQRVGHVDFFPNGGSEQPGCQ 243
Query: 279 LTETPLPLTLIKEGIEEAGRVLVACNHVRAIKLFTESINSKCPYVAHRCPSYQHFLQGRC 338
G VAC H+R+++ F ES+ + + A+ C SY F +C
Sbjct: 244 -------------NNVLEGTQFVACAHMRSVRYFAESLLNPRNFPAYPCSSYDEFSNNKC 290
Query: 339 FSCGENGTGCALMGLEAQKTNHPPGS---KYYITTGKEVPF 376
F C + C MG+ A K + YY+ T + PF
Sbjct: 291 FGCPKG--RCPYMGIYADKFDGKTSKLEGDYYLNTNSKSPF 329
>gnl|CDD|132274 TIGR03230, lipo_lipase, lipoprotein lipase. Members of this
protein family are lipoprotein lipase (EC 3.1.1.34), a
eukaryotic triacylglycerol lipase active in plasma and
similar to pancreatic and hepatic triacylglycerol
lipases (EC 3.1.1.3). It is also called clearing factor.
It cleaves chylomicron and VLDL triacylglycerols; it
also has phospholipase A-1 activity.
Length = 442
Score = 210 bits (536), Expect = 3e-62
Identities = 136/392 (34%), Positives = 213/392 (54%), Gaps = 28/392 (7%)
Query: 75 IDTRFILYTRKNPTEG--HMLKVQNERTIEKSNFDPKKKTKFIIHGFIDTPL-SSWVKEM 131
I+++F L T + P + +++ Q + +I NF+ + KT +IHG+ T + SWV ++
Sbjct: 5 IESKFSLRTPEEPDDDTCYIVPGQPD-SIADCNFNHETKTFIVIHGWTVTGMFESWVPKL 63
Query: 132 RRELLKHS-DWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGLNPADVHM 190
L + NVIVVDW + Y + A T+LVG ++A FVN++++ + +VH+
Sbjct: 64 VAALYEREPSANVIVVDWLSRAQQHYPTSAAYTKLVGKDVAKFVNWMQEEFNYPWDNVHL 123
Query: 191 IGHSLGAHTAGYAGQAIE-GLGRITGLDPAEPYFQGMPPFARLDPTDADLVDVIHTD--- 246
+G+SLGAH AG AG + + RITGLDPA P F+ + L P DAD VDV+HT+
Sbjct: 124 LGYSLGAHVAGIAGSLTKHKVNRITGLDPAGPTFEYADAPSTLSPDDADFVDVLHTNTRG 183
Query: 247 --GSSIFLLGYGMSETCGHLDFYPNNGKEQPGCDLTETPLPLTLIKE-GIEEAGRVLVAC 303
SI G+ GH+D YPN G QPGCD+ ET L +I E G+ + LV C
Sbjct: 184 SPDRSI-----GIQRPVGHIDIYPNGGTFQPGCDIQET---LLVIAEKGLGNMDQ-LVKC 234
Query: 304 NHVRAIKLFTES-INSKCPYVAHRCPSYQHFLQGRCFSCGENGTGCALMGLEAQKTNHPP 362
+H R+I LF +S +N + P +A+RC S + F +G C SC +N C +G E K
Sbjct: 235 SHERSIHLFIDSLLNEENPSMAYRCSSKEAFNKGLCLSCRKN--RCNKLGYEINKVRTKR 292
Query: 363 GSKYYITTGKEVPFCRHHYRITLDLARPPRAETWVQGFMRVSLHSDNGVIRNLDLTPNGY 422
SK Y+ T + +P+ HY++ + + + M++SL+ +G N+ P
Sbjct: 293 SSKMYLKTREMMPYKVFHYQVKVHFF-GKTSLSHTDQPMKISLYGTHGEKENI---PFTL 348
Query: 423 EKLEHGTSRSFVVTHPEDVGTINKVEFYWEYD 454
++ + SF++T D+G + V+ WE D
Sbjct: 349 PEVSTNKTYSFLITTDVDIGELLMVKLKWEKD 380
>gnl|CDD|238382 cd00741, Lipase, Lipase. Lipases are esterases that can hydrolyze
long-chain acyl-triglycerides into di- and
monoglycerides, glycerol, and free fatty acids at a
water/lipid interface. A typical feature of lipases is
"interfacial activation", the process of becoming active
at the lipid/water interface, although several examples
of lipases have been identified that do not undergo
interfacial activation . The active site of a lipase
contains a catalytic triad consisting of Ser - His -
Asp/Glu, but unlike most serine proteases, the active
site is buried inside the structure. A "lid" or "flap"
covers the active site, making it inaccessible to
solvent and substrates. The lid opens during the process
of interfacial activation, allowing the lipid substrate
access to the active site.
Length = 153
Score = 100 bits (251), Expect = 5e-25
Identities = 42/156 (26%), Positives = 54/156 (34%), Gaps = 12/156 (7%)
Query: 161 ANTRLVGLELAYFVNYLKDNYGLN--PADVHMIGHSLGAHTAGYAG-----QAIEGLGRI 213
LA V L + +H+ GHSLG AG AG + + L R+
Sbjct: 1 KGFYKAARSLANLVLPLLKSALAQYPDYKIHVTGHSLGGALAGLAGLDLRGRGLGRLVRV 60
Query: 214 TGLDPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSI-FLLGYGMSETCGHLDFYPNNGK 272
P RLDP+DA VD I D + L G G +FY N GK
Sbjct: 61 YTFGPPRVGNAAFAED-RLDPSDALFVDRIVNDNDIVPRLPPGGEGYPHGGAEFYINGGK 119
Query: 273 EQPGCDLTETPLPLTLIKEGIEEAGRVLVACNHVRA 308
QPGC + C+H+R
Sbjct: 120 SQPGCCKNVLEAVD---IDFGNIGLSGNGLCDHLRY 152
>gnl|CDD|221720 pfam12697, Abhydrolase_6, Alpha/beta hydrolase family. This family
contains alpha/beta hydrolase enzymes of diverse
specificity.
Length = 187
Score = 42.4 bits (100), Expect = 2e-04
Identities = 22/119 (18%), Positives = 43/119 (36%), Gaps = 12/119 (10%)
Query: 115 IIHGFIDTPLSSWVKEMRRELLKHSDWNVIVVDWAG-GSLPLYTQATANTRLVGLELAYF 173
++HG + SW + + L + + V+ D G G + + +LA
Sbjct: 3 LLHGAGGSA-ESW-RPLAEAL--AAGYRVLAPDLPGHGDSDGPPRTPYSLEDDAADLAAL 58
Query: 174 VNYLKDNYGLNPADVHMIGHSLGAHTA-GYAGQAIEGLGRITGLDPAEPYFQGMPPFAR 231
++ L V ++GHSLG A A + E + + + P + +
Sbjct: 59 LDAL------GLGPVVLVGHSLGGAVALAAAARRPERVAGLVLISPPLRDLEELLAADA 111
>gnl|CDD|238853 cd01755, PLAT_lipase, PLAT/ LH2 domain present in connection with a
lipase domain. This family contains two major subgroups,
the lipoprotein lipase (LPL) and the pancreatic
triglyceride lipase. LPL is a key enzyme in catabolism
of plasma lipoprotein triglycerides (TGs). The central
role of triglyceride lipases is in energy production. In
general, PLAT/LH2 domain's proposed function is to
mediate interaction with lipids or membrane bound
proteins.
Length = 120
Score = 37.3 bits (87), Expect = 0.003
Identities = 26/106 (24%), Positives = 45/106 (42%), Gaps = 10/106 (9%)
Query: 379 HHYRITLDLARPPRAETWVQGFMRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHP 438
HY++ + L+ + V G VSL+ G L + +L+ + SF++
Sbjct: 1 WHYQVKVHLSG--KKNLEVDGTFTVSLYGTKGETEQLPIV---LGELKPNKTYSFLIDTE 55
Query: 439 EDVGTINKVEFYWEYDMDVLQPRSLCFLWCNDHLYVSSIKVTETKD 484
D+G + KV+F WE +V+ S L L I+V +
Sbjct: 56 VDIGDLLKVKFKWEN--NVINSNSGETL---PKLGARKIRVKSGET 96
>gnl|CDD|223669 COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta
hydrolase superfamily) [General function prediction
only].
Length = 282
Score = 36.9 bits (84), Expect = 0.018
Identities = 24/111 (21%), Positives = 37/111 (33%), Gaps = 11/111 (9%)
Query: 106 FDPKKKTKFIIHGFIDTPLSSWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRL 165
++HGF + S W + + + VI D G +
Sbjct: 17 AGGGGPPLVLLHGFPGSS-SVWRPVFKVLPALAARYRVIAPDLRGHGR------SDPAGY 69
Query: 166 VGLELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEGLGRITGL 216
A + L D GL V ++GHS+G A A+ R+ GL
Sbjct: 70 SLSAYADDLAALLDALGLEK--VVLVGHSMGGAVALAL--ALRHPDRVRGL 116
>gnl|CDD|221718 pfam12695, Abhydrolase_5, Alpha/beta hydrolase family. This family
contains a diverse range of alpha/beta hydrolase
enzymes.
Length = 145
Score = 34.7 bits (80), Expect = 0.036
Identities = 19/111 (17%), Positives = 34/111 (30%), Gaps = 21/111 (18%)
Query: 115 IIHGFIDTPLSSWVKEMRR--ELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAY 172
++HG + L +NV+ VD+ G A ++
Sbjct: 4 LLHGA-----GGDPEAYAPLARALASRGYNVVAVDYPGHGA--SLGAPDAEAVL------ 50
Query: 173 FVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEGLGRITGLDPAEPYF 223
+ L+P + ++GHSLG A + L +P
Sbjct: 51 ------ADAPLDPERIVLVGHSLGGGVALLLAARDPRVKAAVVLAAGDPPD 95
>gnl|CDD|201306 pfam00561, Abhydrolase_1, alpha/beta hydrolase fold. This
catalytic domain is found in a very wide range of
enzymes.
Length = 226
Score = 33.3 bits (76), Expect = 0.22
Identities = 20/86 (23%), Positives = 36/86 (41%), Gaps = 9/86 (10%)
Query: 141 WNVIVVDWAG-GSLPLYTQATANTRLVGLE-LAYFVNYLKDNYGLNPADVHMIGHSLGAH 198
++VI D G G + + + LA + L D GL+ V+++GHS+G
Sbjct: 1 FDVIAFDLRGFG----RSSPPKDFADYRFDDLAEDLEALLDALGLDK--VNLVGHSMGGL 54
Query: 199 TAG-YAGQAIEGLGRITGLDPAEPYF 223
A YA + + + + + P
Sbjct: 55 IALAYAAKYPDRVKALVLVGTVHPAG 80
>gnl|CDD|237934 PRK15290, lfpB, fimbrial chaperone protein; Provisional.
Length = 243
Score = 33.4 bits (76), Expect = 0.23
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 217 DPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSI 250
D ++ F +PP +RL+P+ ++ +IHT G S+
Sbjct: 81 DKSKAPFTVIPPVSRLEPSQEKVLRIIHTKGVSL 114
>gnl|CDD|216521 pfam01477, PLAT, PLAT/LH2 domain. This domain is found in a
variety of membrane or lipid associated proteins. It is
called the PLAT (Polycystin-1, Lipoxygenase,
Alpha-Toxin) domain or LH2 (Lipoxygenase homology)
domain. The known structure of pancreatic lipase shows
this domain binds to procolipase pfam01114, which
mediates membrane association. So it appears possible
that this domain mediates membrane attachment via other
protein binding partners. The structure of this domain
is known for many members of the family and is composed
of a beta sandwich.
Length = 115
Score = 31.6 bits (72), Expect = 0.34
Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 12/80 (15%)
Query: 401 MRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHPEDVGTINKVEFYWEYDMDVLQP 460
+ +SL+ +G L + + G++ SF ED+G + ++ WE
Sbjct: 19 VYISLYGTHGKSGEYPLLEPELKPDDRGSTDSFTFDTDEDLGDLGAIKIRWEI------- 71
Query: 461 RSLCFLWCNDHLYVSSIKVT 480
+D ++ SI V
Sbjct: 72 -----NGHSDEWFLKSILVK 86
>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 1061
Score = 32.8 bits (75), Expect = 0.60
Identities = 20/81 (24%), Positives = 32/81 (39%), Gaps = 4/81 (4%)
Query: 126 SWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGLNP 185
S + L + +++ V +A L + A L + +A L ++G+ P
Sbjct: 554 SAFDALEALLHRLLGFSLPEVIFAPDYPFLESIDFAQPALFAVSVALAA--LWRSWGVIP 611
Query: 186 ADVHMIGHSLGAHTAGYAGQA 206
V IGHSLG A A
Sbjct: 612 GAV--IGHSLGELAAAVAAGV 630
>gnl|CDD|238061 cd00113, PLAT, PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin)
domain or LH2 (Lipoxygenase homology 2) domain. It
consists of an eight stranded beta-barrel. The domain
can be found in various domain architectures, in case of
lipoxygenases, alpha toxin, lipases and polycystin, but
also as a single domain or as repeats.The putative
function of this domain is to facilitate access to
sequestered membrane or micelle bound substrates.
Length = 116
Score = 30.8 bits (70), Expect = 0.64
Identities = 24/111 (21%), Positives = 38/111 (34%), Gaps = 19/111 (17%)
Query: 379 HHYRITLDLARPPRAETWVQGFMRVSLHSDNGVI-RNLDLTPNGYEKLEHGTSRSFVVTH 437
Y +T+ + + ++L+ +NG L G E G++ +F +
Sbjct: 1 CRYTVTIKTG--DKKGAGTDSNISLALYGENGNSSDIPILDGPG--SFERGSTDTFQIDL 56
Query: 438 PEDVGTINKVEFYWEYDMDVLQPRSLCFLWCNDHLYVSSIKV--TETKDIM 486
D+G I KV D L W Y SI V TK +
Sbjct: 57 KLDIGDITKVYLRR--DGSGLSDG-----W-----YCESITVQALGTKKVY 95
>gnl|CDD|238857 cd01759, PLAT_PL, PLAT/LH2 domain of pancreatic triglyceride
lipase. Lipases hydrolyze phospholipids and
triglycerides to generate fatty acids for energy
production or for storage and to release inositol
phosphates that act as second messengers. The central
role of triglyceride lipases is in energy production.
The proposed function of PLAT/LH2 domains is to mediate
interaction with lipids or membrane bound proteins.
Length = 113
Score = 30.0 bits (68), Expect = 0.88
Identities = 20/76 (26%), Positives = 33/76 (43%), Gaps = 15/76 (19%)
Query: 381 YRITLDLARPPRAETWVQGFMRVSLHSDNGVIRNLDLTPNGYE----KLEHGTSRSFVVT 436
Y++++ L + V G + VSL+ + G YE L+ G + S +
Sbjct: 3 YKVSVTL----SGKKKVTGTILVSLYGNKG-------NTRQYEIFKGTLKPGNTYSAFID 51
Query: 437 HPEDVGTINKVEFYWE 452
DVG + KV+F W
Sbjct: 52 VDVDVGPLTKVKFIWN 67
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.0 bits (72), Expect = 1.0
Identities = 11/27 (40%), Positives = 23/27 (85%), Gaps = 1/27 (3%)
Query: 3 KKDKEREKERDREEEKGEREKEMRKSS 29
++++EREKE+++E E+ ERE+E +++
Sbjct: 597 REEREREKEKEKERER-EREREAERAA 622
Score = 30.4 bits (68), Expect = 2.7
Identities = 12/23 (52%), Positives = 20/23 (86%), Gaps = 1/23 (4%)
Query: 2 EKKDKEREKERDREEEKGEREKE 24
E++++E+EKE++RE E+ ERE E
Sbjct: 598 EEREREKEKEKERERER-EREAE 619
Score = 28.9 bits (64), Expect = 7.6
Identities = 11/23 (47%), Positives = 20/23 (86%), Gaps = 1/23 (4%)
Query: 2 EKKDKEREKERDREEEKGEREKE 24
+K +ERE+E+++E+E+ ERE+E
Sbjct: 594 QKAREEREREKEKEKER-ERERE 615
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 31.1 bits (71), Expect = 1.1
Identities = 8/30 (26%), Positives = 16/30 (53%)
Query: 1 MEKKDKEREKERDREEEKGEREKEMRKSSD 30
+E+ ++E E++RD EE + +S
Sbjct: 119 LEQLEREEEEKRDEEERERLLRAAKSRSEQ 148
>gnl|CDD|129739 TIGR00654, PhzF_family, phenazine biosynthesis protein PhzF family.
Members of this family show a distant global similarity
to diaminopimelate epimerases, which can be taken as the
outgroup. One member of this family has been shown to
act as an enzyme in the biosynthesis of the antibiotic
phenazine in Pseudomonas aureofaciens. The function in
other species is unclear [Cellular processes, Toxin
production and resistance].
Length = 297
Score = 31.0 bits (70), Expect = 1.3
Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 24/99 (24%)
Query: 351 MGLEAQKTNH-------PPGSK------YYITTGKEVPFCRH-----HYRITLDLARPPR 392
M A +TN+ P S+ T E+PF H Y L+ +
Sbjct: 35 MQAIANETNYSETTFLLKPSSEKYDYRLRIFTPRSELPFAGHPTIGSCY-ALLEFTKLTT 93
Query: 393 AETWVQ----GFMRVSLHSDNGVIRNLDLTPN-GYEKLE 426
A T VQ G + V+++ NG +R P +E +
Sbjct: 94 ATTLVQECKAGAVPVTINEKNGDLRISLEQPMPDFEPIS 132
>gnl|CDD|214838 smart00827, PKS_AT, Acyl transferase domain in polyketide synthase
(PKS) enzymes.
Length = 298
Score = 31.2 bits (72), Expect = 1.3
Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 17/84 (20%)
Query: 127 WVKEMRRELLKHSDWNVIVVDWAGG-----SLPLYTQATANTRLVGLELAYFVNYLKDNY 181
+ E L W+++ V Q L +++A L+ +
Sbjct: 26 ALDECDAALQPLLGWSLLDVLLGEDGAASLLDTEVAQPA----LFAVQVA-LARLLRS-W 79
Query: 182 GLNPADVHMIGHSLG----AHTAG 201
G+ P ++GHS G A+ AG
Sbjct: 80 GVRPD--AVVGHSSGEIAAAYVAG 101
>gnl|CDD|214608 smart00308, LH2, Lipoxygenase homology 2 (beta barrel) domain.
Length = 105
Score = 29.1 bits (66), Expect = 1.8
Identities = 11/54 (20%), Positives = 19/54 (35%)
Query: 401 MRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHPEDVGTINKVEFYWEYD 454
+ +SL G + L G++ F ED G + V+ E+
Sbjct: 21 VSLSLVGAEGDGKESKLDYLFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR 74
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 28.1 bits (63), Expect = 1.9
Identities = 13/23 (56%), Positives = 16/23 (69%), Gaps = 1/23 (4%)
Query: 2 EKKDKEREKERDREEEKGEREKE 24
E+K +E E+ R REE K EREK
Sbjct: 33 EEKREEEEEARKREERK-EREKN 54
Score = 28.1 bits (63), Expect = 2.2
Identities = 7/24 (29%), Positives = 14/24 (58%)
Query: 2 EKKDKEREKERDREEEKGEREKEM 25
EK+++E E + E ++ E+ K
Sbjct: 34 EKREEEEEARKREERKEREKNKSF 57
Score = 27.7 bits (62), Expect = 3.5
Identities = 8/25 (32%), Positives = 17/25 (68%)
Query: 3 KKDKEREKERDREEEKGEREKEMRK 27
KK+ + E+E+ EEE+ + +E ++
Sbjct: 26 KKELKAEEEKREEEEEARKREERKE 50
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 30.6 bits (70), Expect = 2.0
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 6 KEREKERDREEEKGEREKEMRK 27
+EREK R E E+ +E+E
Sbjct: 81 EEREKRRQEEYEERLQEREQMD 102
Score = 29.5 bits (67), Expect = 4.5
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 1 MEKKDKEREKERDREEEKGEREKEMRK 27
+E+K++ R ER+ E E+GER +E
Sbjct: 297 IEEKEERRAAEREEELEEGERLREEEA 323
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 30.6 bits (69), Expect = 2.4
Identities = 14/28 (50%), Positives = 21/28 (75%), Gaps = 1/28 (3%)
Query: 2 EKKDKEREKERDREEEKGEREKEMRKSS 29
++K+K+ E+ RDREEEK +RE+ KS
Sbjct: 148 KEKEKKVEEPRDREEEK-KRERVRAKSR 174
>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family. This model
represents a subfamily of RNA splicing factors
including the Pad-1 protein (N. crassa), CAPER (M.
musculus) and CC1.3 (H.sapiens). These proteins are
characterized by an N-terminal arginine-rich, low
complexity domain followed by three (or in the case of
4 H. sapiens paralogs, two) RNA recognition domains
(rrm: pfam00706). These splicing factors are closely
related to the U2AF splicing factor family (TIGR01642).
A homologous gene from Plasmodium falciparum was
identified in the course of the analysis of that genome
at TIGR and was included in the seed.
Length = 457
Score = 30.2 bits (68), Expect = 2.7
Identities = 8/29 (27%), Positives = 15/29 (51%), Gaps = 1/29 (3%)
Query: 3 KKDKEREKERDREEEKGE-REKEMRKSSD 30
+D+ER + R+ + RE+ R+S
Sbjct: 2 YRDRERGRLRNDTRRSDKGRERSRRRSRS 30
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 29.2 bits (66), Expect = 2.7
Identities = 14/26 (53%), Positives = 17/26 (65%), Gaps = 1/26 (3%)
Query: 2 EKKDKEREKERDREEEKGEREKEMRK 27
EKK E E +R EE K ERE+E +K
Sbjct: 102 EKKLDEEEAKRQHEEAK-EREREKKK 126
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 29.9 bits (68), Expect = 2.8
Identities = 13/38 (34%), Positives = 23/38 (60%), Gaps = 13/38 (34%)
Query: 2 EKKDKEREKERDRE------------EEKGEREKEMRK 27
EKK++++++ER+ + EEK ER+K+ RK
Sbjct: 286 EKKEEKKKEEREAKLAKLSPEEQRKLEEK-ERKKQARK 322
Score = 28.4 bits (64), Expect = 9.3
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 1 MEKKDKEREKERDREEEKGEREKEMRKSS 29
E++ +E +E+ E++K ERE ++ K S
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLS 304
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 30.3 bits (68), Expect = 2.9
Identities = 9/45 (20%), Positives = 17/45 (37%)
Query: 1 MEKKDKEREKERDREEEKGEREKEMRKSSDVLYIFYIVENETRCY 45
+E+ + E D EE K + K ++ L + + E
Sbjct: 180 IEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEE 224
>gnl|CDD|223730 COG0657, Aes, Esterase/lipase [Lipid metabolism].
Length = 312
Score = 29.9 bits (67), Expect = 3.1
Identities = 22/99 (22%), Positives = 37/99 (37%), Gaps = 15/99 (15%)
Query: 114 FIIHG--FIDTPLSSWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELA 171
+HG ++ L + + R L + V+ VD Y A + LE A
Sbjct: 83 LYLHGGGWVLGSLRTHDALVAR-LAAAAGAVVVSVD--------YRLAPEHPFPAALEDA 133
Query: 172 Y----FVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQA 206
Y ++ G++P+ + + G S G H A A
Sbjct: 134 YAAYRWLRANAAELGIDPSRIAVAGDSAGGHLALALALA 172
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 28.5 bits (64), Expect = 3.7
Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 4/27 (14%)
Query: 1 MEKKDKEREKERDREEEK---GEREKE 24
+E ++ R++ER+ EE+ EREKE
Sbjct: 55 LEAAER-RKREREIAEERKLQKEREKE 80
>gnl|CDD|163133 TIGR03090, SASP_tlp, small, acid-soluble spore protein tlp. This
protein family is restricted to a subset of
endospore-forming bacteria such as Bacillus subtilis,
all of which are in the Firmicutes (low-GC
Gram-positive) lineage. Although previously designated
tlp (thioredoxin-like protein), the B. subtilis protein
was shown to be a minor small acid-soluble spore
protein SASP, unique to spores. The motif E[VIL]XDE
near the C-terminus probably represents at a
germination protease cleavage site [Cellular processes,
Sporulation and germination].
Length = 70
Score = 27.5 bits (61), Expect = 3.7
Identities = 9/24 (37%), Positives = 15/24 (62%)
Query: 1 MEKKDKEREKERDREEEKGEREKE 24
+E + E+E+ R EEK ER ++
Sbjct: 31 IEAHAELSEEEKQRIEEKNERREQ 54
>gnl|CDD|184995 PRK15035, PRK15035, cytochrome bd-II oxidase subunit 1;
Provisional.
Length = 514
Score = 30.0 bits (67), Expect = 3.8
Identities = 32/116 (27%), Positives = 50/116 (43%), Gaps = 22/116 (18%)
Query: 124 LSSWVKEMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLELAYFVNYLKDNYGL 183
L + V+ +R ++ +H W + + W SLPL + +E +F+ +G
Sbjct: 405 LIALVQTLRGKIDQHR-WVLKMALW---SLPL--------PWIAIEAGWFMT----EFGR 448
Query: 184 NPADVHMIGHSLGAHTAGYAGQAIEGLGRITGLDP----AEPYFQGMPPFARLDPT 235
P + I + AH+A GQ L I GL AE Y M +ARL P+
Sbjct: 449 QPWAIQDILPTYSAHSALTTGQLAFSLIMIVGLYTLFLIAEVYL--MQKYARLGPS 502
>gnl|CDD|233111 TIGR00745, apbA_panE, 2-dehydropantoate 2-reductase. This model
describes enzymes that perform as 2-dehydropantoate
2-reductase, one of four enzymes required for the de
novo biosynthesis of pantothenate (vitamin B5) from Asp
and 2-oxoisovalerate. Although few members of the seed
alignment are characterized experimentally, nearly all
from complete genomes are found in a genome-wide (but
not local) context of all three other
pantothenate-biosynthetic enzymes (TIGR00222, TIGR00018,
TIGR00223). The gene encoding this enzyme is designated
apbA in Salmonella typhimurium and panE in Escherichia
coli; this protein functions as a monomer and functions
in the alternative pyrimidine biosynthetic, or APB,
pathway, used to synthesize the pyrimidine moiety of
thiamine. Note, synthesis of the pyrimidine moiety of
thiamine occurs either via the first five steps in de
novo purine biosynthesis, which uses the pur gene
products, or through the APB pathway. Note that this
family includes both NADH and NADPH-dependent enzymes,
and enzymes with broad specificity, such as a
D-mandelate dehydrogease that is also a
2-dehydropantoate 2-reductase [Biosynthesis of
cofactors, prosthetic groups, and carriers, Pantothenate
and coenzyme A].
Length = 293
Score = 29.6 bits (67), Expect = 4.1
Identities = 15/60 (25%), Positives = 22/60 (36%), Gaps = 1/60 (1%)
Query: 279 LTETPLPLTLIKEGIEEAGRVLVACNHVRAIKLFTESINSKCPYVAHRCPS-YQHFLQGR 337
L E P L++ ++E RV A E + + A S Q L+GR
Sbjct: 196 LLENPEARELLRRLMDEVVRVARAEGVDLPDDEVEELVRAVIRMTAENTSSMLQDLLRGR 255
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 29.3 bits (66), Expect = 4.5
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 2 EKKDKEREKERDREEEKGEREKEMRK 27
+K K E+++ +E EK E E+R+
Sbjct: 102 KKLQKLLEEKQKQEREKEREEAELRQ 127
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 28.9 bits (65), Expect = 4.6
Identities = 12/45 (26%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
Query: 1 MEKKDKEREKERDREEE--KGEREKEMRKSSDVLYI--FYIVENE 41
E+++ E E+E+ +EEE K E+ ++ + + ++VE E
Sbjct: 43 KEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEE 87
>gnl|CDD|179997 PRK05293, glgC, glucose-1-phosphate adenylyltransferase;
Provisional.
Length = 380
Score = 29.5 bits (67), Expect = 4.6
Identities = 11/21 (52%), Positives = 15/21 (71%), Gaps = 3/21 (14%)
Query: 440 DVGTINKVEFYWEYDMDVLQP 460
DVGTI E WE +M++L+P
Sbjct: 243 DVGTI---ESLWEANMELLRP 260
>gnl|CDD|233733 TIGR02117, chp_urease_rgn, conserved hypothetical protein. This
conserved hypothetical protein of unknown function is
found in several Proteobacteria. Its function is unknown
and its genome context is not well-conserved. It is
found amid urease genes in at least one species
[Hypothetical proteins, Conserved].
Length = 208
Score = 28.7 bits (64), Expect = 6.2
Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 4/47 (8%)
Query: 392 RAETWVQGFMRVSL----HSDNGVIRNLDLTPNGYEKLEHGTSRSFV 434
R+ T + V + ++ L ++ N Y +L S SFV
Sbjct: 99 RSLTADMSVLHVDGQPERYEAQSEVKRLLVSENQYNRLMDFISASFV 145
>gnl|CDD|223635 COG0561, Cof, Predicted hydrolases of the HAD superfamily [General
function prediction only].
Length = 264
Score = 28.9 bits (65), Expect = 6.2
Identities = 19/132 (14%), Positives = 32/132 (24%), Gaps = 25/132 (18%)
Query: 70 LPRAVIDTRFILYTRKNPTEGHMLKVQNERTIEKSNFDPKKKTKFIIHGFIDTPLSSWVK 129
+ + I T+K T + + K I L
Sbjct: 101 IALVLYTDDGIYLTKKRGTFAEARIGFANLSPVGREAAELEDNKIIALDKDHEILE---- 156
Query: 130 EMRRELLKHSDWNVIVVDWAGGSLPLYTQATANTRLVGLEL-------AYFVNYLKDNYG 182
E+ L K + V +G L++ Y + L G
Sbjct: 157 ELVEALRKRFPDLGLTVSSSGPIS--------------LDITPKGVSKGYALQRLAKLLG 202
Query: 183 LNPADVHMIGHS 194
+ +V G S
Sbjct: 203 IKLEEVIAFGDS 214
>gnl|CDD|213298 cd05932, LC_FACS_bac, Bacterial long-chain fatty acid CoA
synthetase (LC-FACS), including Marinobacter
hydrocarbonoclasticus isoprenoid Coenzyme A synthetase.
The members of this family are bacterial long-chain
fatty acid CoA synthetase. Marinobacter
hydrocarbonoclasticus isoprenoid Coenzyme A synthetase
in this family is involved in the synthesis of
isoprenoid wax ester storage compounds when grown on
phytol as the sole carbon source. LC-FACS catalyzes the
formation of fatty acyl-CoA in a two-step reaction: the
formation of a fatty acyl-AMP molecule as an
intermediate, and the formation of a fatty acyl-CoA.
Free fatty acids must be "activated" to their CoA
thioesters before participating in most catabolic and
anabolic reactions.
Length = 504
Score = 29.2 bits (66), Expect = 6.2
Identities = 16/57 (28%), Positives = 22/57 (38%), Gaps = 19/57 (33%)
Query: 254 GYGMSETCGHLDF-YPNN------GKEQPGCDLTETPLPLTLIKEGIEEAGRVLVAC 303
GYGM+E + YP + GK PG +L I E G +L+
Sbjct: 305 GYGMTENFAYSTLNYPGDRRIGTVGKPIPGVELK------------ISEDGEILIRS 349
>gnl|CDD|205067 pfam12771, SusD-like_2, Starch-binding associating with outer
membrane. SusD is a secreted starch-binding protein
with an N-terminal lipid tail that allows it to
associate with the outer membrane.
Length = 488
Score = 29.2 bits (65), Expect = 6.5
Identities = 18/61 (29%), Positives = 26/61 (42%), Gaps = 2/61 (3%)
Query: 200 AGYAGQAIEGLGRITGLDPAEPYFQGMPPFARLDPTDADLVDVIHTDGSSIFLLGYGMSE 259
G+ I G DPA PY MP + + D + + D TD +L +G +E
Sbjct: 281 LKGDGKYIGDFSGYEGGDPAAPYGYSMPKYEQGDISPIN--DRFRTDPIVEPMLLFGYAE 338
Query: 260 T 260
T
Sbjct: 339 T 339
>gnl|CDD|223477 COG0400, COG0400, Predicted esterase [General function prediction
only].
Length = 207
Score = 28.4 bits (64), Expect = 6.9
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 8/80 (10%)
Query: 169 ELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEG-LGRITGLDPAEPYFQGMP 227
+LA F+ L + YG++ + + +IG S GA+ A G + G F GM
Sbjct: 82 KLAEFLEELAEEYGIDSSRIILIGFSQGANIALSLGLTLPGLFAGAIL-------FSGML 134
Query: 228 PFARLDPTDADLVDVIHTDG 247
P D ++ + G
Sbjct: 135 PLEPELLPDLAGTPILLSHG 154
>gnl|CDD|181184 PRK07967, PRK07967, 3-oxoacyl-(acyl carrier protein) synthase I;
Reviewed.
Length = 406
Score = 28.9 bits (65), Expect = 7.7
Identities = 16/46 (34%), Positives = 18/46 (39%), Gaps = 11/46 (23%)
Query: 190 MIGHSLGA---HTAGYA-----GQAIEGLGRITGLDPAEPYFQGMP 227
+ GHSLGA A Y+ I I LDP GMP
Sbjct: 329 LTGHSLGAAGVQEAIYSLLMMEHGFIAPSANIEELDPQ---AAGMP 371
>gnl|CDD|238856 cd01758, PLAT_LPL, PLAT/ LH2 domain present in lipoprotein lipase
(LPL). LPL is a key enzyme in catabolism of plasma
lipoprotein triglycerides (TGs) and has therefeore has a
profound influence on triglyceride and high-density
lipoprotein (HDL) cholesterol levels in the blood. In
general, PLAT/LH2 domain's proposed function is to
mediate interaction with lipids or membrane bound
proteins.
Length = 137
Score = 27.7 bits (62), Expect = 7.8
Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 401 MRVSLHSDNGVIRNLDLTPNGYEKLEHGTSRSFVVTHPEDVGTINKVEFYWE 452
+SL+ G NL LT E + + SF++T +D+G + ++ WE
Sbjct: 22 FTISLYGTLGESENLPLTLP--EGITGNKTNSFLITTEKDIGDLLMLKLKWE 71
>gnl|CDD|227388 COG5055, RAD52, Recombination DNA repair protein (RAD52 pathway)
[DNA replication, recombination, and repair].
Length = 375
Score = 28.7 bits (64), Expect = 8.2
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 95 VQNERTIEKSNFDPKKKTKFIIH 117
VQNER I +S FDPK +++ I
Sbjct: 324 VQNERYIGESIFDPKYQSQSIRK 346
>gnl|CDD|234109 TIGR03105, gln_synth_III, glutamine synthetase, type III. This
family consists of the type III isozyme of glutamine
synthetase, originally described in Rhizobium meliloti,
where types I and II also occur.
Length = 435
Score = 28.9 bits (65), Expect = 8.4
Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 5/48 (10%)
Query: 188 VHMIGHSLGAHTAGYAGQAIEGLGRITGLDPAEPYFQGMPPFARLDPT 235
I H + AG+AG A GL G PA+P +P L
Sbjct: 31 AEAIDH-MATGGAGFAGFAAWGL----GQSPADPDLMAIPDLDSLTQL 73
>gnl|CDD|184875 PRK14875, PRK14875, acetoin dehydrogenase E2 subunit
dihydrolipoyllysine-residue acetyltransferase;
Provisional.
Length = 371
Score = 28.8 bits (65), Expect = 8.5
Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 4/48 (8%)
Query: 169 ELAYFVNYLKDNYGLNPADVHMIGHSLGAHTAGYAGQAIEGLGRITGL 216
ELA V D G+ A H++GHS+G A A R+ L
Sbjct: 182 ELAAAVLAFLDALGIERA--HLVGHSMGGAVA--LRLAARAPQRVASL 225
>gnl|CDD|182945 PRK11071, PRK11071, esterase YqiA; Provisional.
Length = 190
Score = 28.0 bits (63), Expect = 8.8
Identities = 24/91 (26%), Positives = 34/91 (37%), Gaps = 25/91 (27%)
Query: 116 IHGFIDTPLSSWVKEMRRELLKH-SDWNVIVVDWAGGSLPLYTQATA------------- 161
+HGF +P S+ ++ L +H D +IV LP Y A
Sbjct: 7 LHGFNSSPRSAKATLLKNWLAQHHPDIEMIV-----PQLPPYPADAAELLESLVLEHGGD 61
Query: 162 NTRLVGLEL-AYFVNYLKDNYGL-----NPA 186
LVG L Y+ +L + L NPA
Sbjct: 62 PLGLVGSSLGGYYATWLSQCFMLPAVVVNPA 92
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.2 bits (63), Expect = 8.9
Identities = 13/31 (41%), Positives = 21/31 (67%)
Query: 2 EKKDKEREKERDREEEKGEREKEMRKSSDVL 32
EK KER +E++REEE+ E+E + ++L
Sbjct: 152 EKIKKERAEEKEREEEEKAAEEEKAREEEIL 182
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 28.9 bits (64), Expect = 9.2
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 2 EKKDKEREKERDREEEKGERE 22
EKK+KE+E+++D+++E +
Sbjct: 206 EKKEKEKERDKDKKKEVEGFK 226
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 27.7 bits (62), Expect = 9.6
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 2 EKKDKEREKERDREEEKGEREKEMRK 27
E+K +E+E+E++ +E K E+EKE K
Sbjct: 15 EEKREEKEREKEEKERKEEKEKEWGK 40
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.138 0.430
Gapped
Lambda K H
0.267 0.0893 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,511,698
Number of extensions: 2699545
Number of successful extensions: 3546
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3477
Number of HSP's successfully gapped: 77
Length of query: 531
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 430
Effective length of database: 6,457,848
Effective search space: 2776874640
Effective search space used: 2776874640
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.0 bits)