RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12216
(646 letters)
>gnl|CDD|214744 smart00611, SEC63, Domain of unknown function in Sec63p, Brr2p and
other proteins.
Length = 312
Score = 176 bits (448), Expect = 4e-50
Identities = 70/268 (26%), Positives = 118/268 (44%), Gaps = 32/268 (11%)
Query: 67 GMWWYKSIRYTGDKVLL-ETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIER 125
G+W R + TI + + P M K +L IL S EFD+
Sbjct: 1 GIWPTDLGRIASYYYISYTTIRTF--NELLKPKMTTKDLLRILSMSSEFDQ--------- 49
Query: 126 PSDEIEVPQLIRQIPNLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIV 185
E L+ ++ E P +KA L+ AHLS ++L S LE+D +Y++
Sbjct: 50 IPVRHEEDLLLEELAEKLPIRLENPSLDDPHVKANLLLQAHLSRLKLPSFALESDTVYVL 109
Query: 186 KKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPLLQLP 245
+ LLQ MV + L+ A N + L MI+Q +W +PLLQLP
Sbjct: 110 QNAGRLLQAMVDIALERGWLSTA------------LNALNLSQMIIQALWPTDSPLLQLP 157
Query: 246 YVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDLHV 305
++ ++ LK +K+ + SL+ ++++EER ++ + D + + KVL +P +++ +
Sbjct: 158 HLPEEILKR--LEKKKVLSLEDLLELEDEERGELLG-LLDAEGERVYKVLSRLPKLNIEI 214
Query: 306 QPEVIDDEATTEYTAGAIITVTCTLMRK 333
E I G +T+T L
Sbjct: 215 SLEPITRT-----VLGVEVTLTVDLTWD 237
Score = 50.3 bits (121), Expect = 1e-06
Identities = 17/70 (24%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 373 LPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDL 432
LP++ ++ LK +K+ + SL+ ++++EER ++ + D + + KVL +P +++
Sbjct: 156 LPHLPEEILKR--LEKKKVLSLEDLLELEDEERGELLG-LLDAEGERVYKVLSRLPKLNI 212
Query: 433 HVQPEVIDDE 442
+ E I
Sbjct: 213 EISLEPITRT 222
>gnl|CDD|217270 pfam02889, Sec63, Sec63 Brl domain. This domain (also known as the
Brl domain) is required for assembly of functional
endoplasmic reticulum translocons.
Length = 309
Score = 141 bits (358), Expect = 1e-37
Identities = 61/255 (23%), Positives = 116/255 (45%), Gaps = 40/255 (15%)
Query: 63 PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 119
PT +G +Y +L ET+ + + P+ LK +L IL ++ EF+
Sbjct: 1 PTELGRIASHYY---------ILYETMETF--NTSLKPNTTLKDLLEILSSASEFE---- 45
Query: 120 SEIIERPSDEIEVPQLIRQIPN-LGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 178
EI R ++ E+ +L+ ++P + + + P K L+ A+LS ++L +L
Sbjct: 46 -EIPVREEEKKELKKLLERVPIPVKSPSIDDP-----HAKVNLLLQAYLSRLKLPDFSLV 99
Query: 179 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 238
+D YI++ LL+ + I L+ + N ++L MI Q +W
Sbjct: 100 SDLNYILQNAGRLLRALF-----EIALSKG-------WLSPALNALELSKMIEQRLWPSD 147
Query: 239 NPLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNM 298
+PL QLP++ + LK +K+ I SL+ + EE ++ N +Q D+ + +
Sbjct: 148 SPLRQLPHIPPEVLKK--LEKKGISSLEDLMDLDPEELGELLG-NNPKQGKDIAEFVNRF 204
Query: 299 PYVDLHVQPEVIDDE 313
P +++ + + I E
Sbjct: 205 PRLEIEAEVQPITRE 219
Score = 49.9 bits (120), Expect = 2e-06
Identities = 19/89 (21%), Positives = 39/89 (43%), Gaps = 5/89 (5%)
Query: 373 LPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYVDL 432
LP++ + LK +K+ I SL+ + EE ++ N +Q D+ + + P +++
Sbjct: 153 LPHIPPEVLKK--LEKKGISSLEDLMDLDPEELGELLG-NNPKQGKDIAEFVNRFPRLEI 209
Query: 433 HVQPEVIDDEATTEYTAGAIITPEVIDDE 461
+ + I E +T + DE
Sbjct: 210 EAEVQPITRE--VSLRVEVTLTRDFEWDE 236
>gnl|CDD|227694 COG5407, SEC63, Preprotein translocase subunit Sec63 [Intracellular
trafficking and secretion].
Length = 610
Score = 98.5 bits (245), Expect = 2e-21
Identities = 70/329 (21%), Positives = 126/329 (38%), Gaps = 52/329 (15%)
Query: 2 KLTKAYQALTDEESRRNFEKYGNPDGPGAMSFGIALPSYIVEKENSVWVLGLYALVFMVA 61
+TKAY LTD++ R N+ YG PD P S GIALP IV E S++ +Y+L+ V
Sbjct: 148 TITKAYGLLTDKKRRENYLNYGTPDSPQHTSEGIALPKVIVRSERSMYAFVMYSLLLGVF 207
Query: 62 LPTAVGMWWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFNSE 121
LP V WW + YT V T+ +Y +I + L +L + +S E R
Sbjct: 208 LPYWVYRWWREIRDYTKVGVHFVTMEMFYE--RIDGSLTLDELLSLFASSKELTRMN--- 262
Query: 122 IIERPSDEIEVPQLIRQIPNLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLEADR 181
P + E+ L+ HL N ++E +
Sbjct: 263 ------------------PKGPKCTLEK------------LLGDHL-NRA---RSVEFNE 288
Query: 182 MYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFKNPL 241
I LL ++ I +A + + +++ P+ D +
Sbjct: 289 YRIKSNVEGLLGALLR-----IASNFAFPLKECGKENKGQYIVQAIPL------DHLFRI 337
Query: 242 LQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGNMPYV 301
LQLP ++ + R I+ ++ + + +V+K + + ++ V +P
Sbjct: 338 LQLPRSDVEYAQRV--SLRLIEGMKAIGSLIAKRYGNVLKNLVVLELMEIQAVADGIPRY 395
Query: 302 DLHVQPEVIDDEATTEYTAGAIITVTCTL 330
L P ++ ++ + +++ L
Sbjct: 396 FLLQAPFLVFEDLFITEKSKERVSLKGYL 424
>gnl|CDD|214946 smart00973, Sec63, Sec63 Brl domain. This domain was named after
the yeast Sec63 (or NPL1) (also known as the Brl domain)
protein in which it was found. This protein is required
for assembly of functional endoplasmic reticulum
translocons. Other yeast proteins containing this domain
include pre-mRNA splicing helicase BRR2, HFM1 protein
and putative helicases.
Length = 314
Score = 83.2 bits (206), Expect = 2e-17
Identities = 59/256 (23%), Positives = 104/256 (40%), Gaps = 39/256 (15%)
Query: 63 PTAVGM---WWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASLEFDRRFN 119
PT +G ++Y I Y ETI + + P LK +L IL + EF
Sbjct: 1 PTELGRIASYYY--ISY-------ETIETFNQ--SLKPTTTLKDILEILSRASEFK---- 45
Query: 120 SEIIERPSDEIEVPQLIRQIP-NLGEKNRERPLYHKYSIKARALIYAHLSNMQLTSETLE 178
EI R +++ E+ +L +++P + E + P K L+ AHLS + L L
Sbjct: 46 -EIPVRHNEKKELNELNKRVPIPVKEGIIDSP-----HAKVNLLLQAHLSRLPLPDFDLV 99
Query: 179 ADRMYIVKKCPYLLQEMVTCISQLILLAYAQRVPRLIHIETLENVMKLCPMIVQGMWDFK 238
+D YI++ P +L+ +V L A N + L M+VQ +W+
Sbjct: 100 SDLKYILQNAPRILRALVDIALSKGWLRTAL------------NALDLSQMVVQRLWEDS 147
Query: 239 N-PLLQLPYVTDDHLKHFICKKRYIKSLQQFAQMKNEERRSVVKFMNDEQYSDMLKVLGN 297
+ PL QLP+ + + + K +S + M E + + ++L
Sbjct: 148 DSPLKQLPHFLIEDVYDKLELKDGSRSFELLLDMNAAELGEFLNRLPPNGRLIY-ELLRR 206
Query: 298 MPYVDLHVQPEVIDDE 313
P +++ + I +
Sbjct: 207 FPKIEVEAEVLPITRD 222
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 40.2 bits (94), Expect = 0.004
Identities = 27/105 (25%), Positives = 41/105 (39%), Gaps = 25/105 (23%)
Query: 542 KKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV 601
K K++H + G+++ + +E+T+ +D E +D SESE ESD D
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDD--------DEDECEAIED--SESESESDGEDG 682
Query: 602 ENDEVVDKKEKNEDSNNKSEE---------------SSDDDDDWE 631
E DE D E NE + D +D E
Sbjct: 683 EEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDEE 727
Score = 29.5 bits (66), Expect = 6.8
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 578 KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQ 634
++ + E E+E DL + ++DE D+ E EDS ++SE +D ++ E+
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDE--DECEAIEDSESESESDGEDGEEDEQED 689
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 39.4 bits (92), Expect = 0.006
Identities = 24/112 (21%), Positives = 40/112 (35%), Gaps = 18/112 (16%)
Query: 536 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK---------DTPEESK 586
K K K K K KK ++ E+ D +K E+ + +S+
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSE 202
Query: 587 DV---------SSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
D SS+ EE + + +E + E +E + SE S D +
Sbjct: 203 DYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDSEISESRSVSDSE 254
Score = 31.3 bits (71), Expect = 1.5
Identities = 27/137 (19%), Positives = 50/137 (36%), Gaps = 20/137 (14%)
Query: 517 ITPVVNKEKQQTHRPVWMKNKKGGKKK--KFTKNKHDKKSSG---------QKKITNTVE 565
IT K ++ + N K K + +K S K + N +E
Sbjct: 71 ITSKPTKAWAESPEFRQIINDKSHKANPSRSEAEVVEKLKSLNNVVSRLLNSKPVKNVLE 130
Query: 566 QTEAPADVA-----KVEKKDTPEESKDVSS----ESEEESDLSDVENDEVVDKKEKNEDS 616
D K +K +++K ES ++ D + E+++ +E ED
Sbjct: 131 SIMEGLDKILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAEDD 190
Query: 617 NNKSEESSDDDDDWEKY 633
++ EE D +D+ +Y
Sbjct: 191 SDDEEEEDSDSEDYSQY 207
Score = 29.8 bits (67), Expect = 5.3
Identities = 23/109 (21%), Positives = 42/109 (38%), Gaps = 26/109 (23%)
Query: 540 GKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS 599
G + K K K KK+ KK ++ + +D E+ ++ +ESK ES E+
Sbjct: 141 GIETKAKKGKAKKKTKKSKK-----KEAKESSDKDDEEESESEDESKS--EESAEDDSDD 193
Query: 600 DVE-------------------NDEVVDKKEKNEDSNNKSEESSDDDDD 629
+ E ++E ++ + + SE SD+ D
Sbjct: 194 EEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSESESDESDS 242
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 39.0 bits (91), Expect = 0.007
Identities = 26/134 (19%), Positives = 49/134 (36%), Gaps = 3/134 (2%)
Query: 498 EPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 557
+ +K+ + +E V + + P KKK TK K +KK + +
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKK-TKEKKEKKEASE 237
Query: 558 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESE--EESDLSDVENDEVVDKKEKNED 615
+ E+ DV ++ P + E E + SD E + +KEK +
Sbjct: 238 STVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR 297
Query: 616 SNNKSEESSDDDDD 629
E+ +D++
Sbjct: 298 LKKMMEDEDEDEEM 311
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 36.9 bits (86), Expect = 0.022
Identities = 15/63 (23%), Positives = 36/63 (57%)
Query: 580 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNK 639
D+ +E + + ES++E + SD E++E D+ K ++ E S +D+++ + + +
Sbjct: 111 DSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAE 170
Query: 640 RDK 642
++K
Sbjct: 171 KEK 173
Score = 29.2 bits (66), Expect = 5.8
Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 1/82 (1%)
Query: 552 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 611
KK G++++ + + E + E+K E S + ++E + +VE DE D +
Sbjct: 57 KKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEG 116
Query: 612 KNED-SNNKSEESSDDDDDWEK 632
+ D ++K ESSD +D+ EK
Sbjct: 117 EWIDVESDKEIESSDSEDEEEK 138
>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
Length = 1713
Score = 37.5 bits (87), Expect = 0.029
Identities = 29/132 (21%), Positives = 49/132 (37%), Gaps = 23/132 (17%)
Query: 500 SKEGEGEIE-----EEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 554
SK G I+ ++++P G++ + K K K K +
Sbjct: 10 SKRGLKVIQIGDDSDDDEPIGSLFKL-----------------KRPKNSKKVKVGLESTG 52
Query: 555 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 614
++K++ E +E D +K K V S S S + +EN E VD NE
Sbjct: 53 KREEKLSALDEDSEGMDDTLASFRKRLKGPKKGVGSVSARMSQGALLENQEEVDTV-LNE 111
Query: 615 DSNNKSEESSDD 626
S + + +D
Sbjct: 112 GSKRSLDGNLED 123
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 37.3 bits (86), Expect = 0.032
Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 556 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 615
G + ITNT+ + + ++A E + D S E ESD++ V D VV + E N D
Sbjct: 3823 GAEDITNTLNEDDDLEELANEEDTAN-QSDLDESEARELESDMNGVTKDSVVSENE-NSD 3880
Query: 616 SNNKSEESSDDDDD 629
S ++++ ++ +D
Sbjct: 3881 SEEENQDLDEEVND 3894
Score = 33.0 bits (75), Expect = 0.73
Identities = 17/71 (23%), Positives = 33/71 (46%), Gaps = 1/71 (1%)
Query: 565 EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 624
Q + E D PE+ K + + + S SD+E+ ++ E E+++ + +E
Sbjct: 3973 IQENNSQPPPENEDLDLPEDLK-LDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPM 4031
Query: 625 DDDDDWEKYQT 635
D+D E+ T
Sbjct: 4032 QDEDPLEENNT 4042
Score = 32.7 bits (74), Expect = 0.80
Identities = 24/134 (17%), Positives = 51/134 (38%), Gaps = 5/134 (3%)
Query: 502 EGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKIT 561
E EE + + + + + +W + + + K+ ++ + +
Sbjct: 3877 ENSDSEEENQDLDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLV 3936
Query: 562 NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK---NEDSN- 617
+ + +A D + EK+D E S DV + E + D+ + N + + E ED
Sbjct: 3937 SKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQE-NNSQPPPENEDLDLPEDLKL 3995
Query: 618 NKSEESSDDDDDWE 631
++ E D D E
Sbjct: 3996 DEKEGDVSKDSDLE 4009
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 36.8 bits (85), Expect = 0.034
Identities = 23/134 (17%), Positives = 55/134 (41%), Gaps = 2/134 (1%)
Query: 498 EPSKEGEGEIEE--EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSS 555
E KE E ++ +EKP KE+ + RP K K+ KK + +++ ++K
Sbjct: 107 EKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166
Query: 556 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 615
+ + + ++ K ++ E+ + + E+ + N+E +++ +D
Sbjct: 167 ERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKD 226
Query: 616 SNNKSEESSDDDDD 629
+ +D+
Sbjct: 227 RETTTSPMEEDESR 240
Score = 29.9 bits (67), Expect = 4.8
Identities = 17/99 (17%), Positives = 34/99 (34%), Gaps = 4/99 (4%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEES-KDVSSESE 593
K K+ K++K K + K+ +K ++ P + K ++K E ++ + E
Sbjct: 108 KEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRE 167
Query: 594 EESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEK 632
S + K ++ EE E
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKKE---PPEEEKQRQAAREA 203
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 36.5 bits (85), Expect = 0.036
Identities = 16/92 (17%), Positives = 32/92 (34%), Gaps = 14/92 (15%)
Query: 31 MSFGIALPSYIVEKENSV----WVLGLYALVFMVALPTAVGMWWYKSIRYTGDKVLLETI 86
ALP + S ++ ++L + + + + S R+ +VLL +
Sbjct: 16 GLLSPALPLLAEDLGLSASQAGLIVSAFSLGYALGSL--LAGYL--SDRFGRRRVLLLGL 71
Query: 87 ------NFYYAFFQITPHMALKRVLMILGASL 112
+ AF + + R L+ LG
Sbjct: 72 LLFALGSLLLAFASSLWLLLVGRFLLGLGGGA 103
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 33.8 bits (78), Expect = 0.054
Identities = 18/63 (28%), Positives = 28/63 (44%)
Query: 567 TEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDD 626
E P +VAK E E EE+ D +++D+ D + D ++ + DD
Sbjct: 34 EEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDD 93
Query: 627 DDD 629
DDD
Sbjct: 94 DDD 96
>gnl|CDD|178151 PLN02536, PLN02536, diaminopimelate epimerase.
Length = 267
Score = 35.5 bits (82), Expect = 0.065
Identities = 22/62 (35%), Positives = 26/62 (41%)
Query: 321 GAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVLPYVTDDH 380
G VTC M P V FG+ +KV L K G E P T T + V+ H
Sbjct: 135 GKTWLVTCVSMGNPHCVTFGEKELKVDDLPLEKIGPKFEHHEMFPARTNTEFVQVVSRSH 194
Query: 381 LK 382
LK
Sbjct: 195 LK 196
Score = 29.7 bits (67), Expect = 3.7
Identities = 19/53 (35%), Positives = 21/53 (39%)
Query: 469 GAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVV 521
G VTC M P V FG+ +KV L K G E P T T V
Sbjct: 135 GKTWLVTCVSMGNPHCVTFGEKELKVDDLPLEKIGPKFEHHEMFPARTNTEFV 187
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 35.4 bits (82), Expect = 0.092
Identities = 31/131 (23%), Positives = 54/131 (41%), Gaps = 10/131 (7%)
Query: 501 KEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKI 560
+E EGE E +E+ N +K R + G K+ + K K + K+ ++K
Sbjct: 399 RELEGEEESDEEENE-----EPSKKNVGRRK--FGPENGEKEAESKKLKKENKNEFKEKK 451
Query: 561 TNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE---DSN 617
+ E+ + AKVEK + ++ EEE + D EN + + +
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511
Query: 618 NKSEESSDDDD 628
+K + SS D
Sbjct: 512 SKKKSSSKLDK 522
Score = 32.7 bits (75), Expect = 0.69
Identities = 23/132 (17%), Positives = 50/132 (37%), Gaps = 3/132 (2%)
Query: 501 KEGEGEIEEEEKPNGTITPVVN--KEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQK 558
K G E+E + N KEK+++ +++++ K +K + QK
Sbjct: 424 KFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQK 483
Query: 559 KITNTVEQTEAPADVAKVE-KKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 617
+ E P K ++ S S+ + + + V KK+K ++ +
Sbjct: 484 EEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANKISKAAVKVKKKKKKEKS 543
Query: 618 NKSEESSDDDDD 629
++ D++D
Sbjct: 544 IDLDDDLIDEED 555
Score = 30.8 bits (70), Expect = 2.5
Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 575 KVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
K+E K EE +D S+SEEE + D ++D+ + K +E DD+++
Sbjct: 317 KIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPW-MLRKKLGKLKEGEDDEEN 370
Score = 29.3 bits (66), Expect = 7.2
Identities = 31/161 (19%), Positives = 52/161 (32%), Gaps = 31/161 (19%)
Query: 489 DDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPV--WMKNKKGGKKKK-- 544
KV + + + EEEE+ P + K K K K
Sbjct: 465 AKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAA 524
Query: 545 FTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDV--- 601
+K K +KK +++ + D K D +E E E++ +L +
Sbjct: 525 NKISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDE------EDEDDEELPFLFKQ 578
Query: 602 --------ENDEVVDK--KEKNEDSNNKSEESSDDDDDWEK 632
D+VV + KEK E +++D E
Sbjct: 579 KDLIKEAFAGDDVVAEFEKEKKEV--------IEEEDPKEI 611
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 35.3 bits (81), Expect = 0.099
Identities = 17/65 (26%), Positives = 26/65 (40%)
Query: 565 EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 624
E +P AK E + + + ++EEE LS K +KN + S+
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGD 360
Query: 625 DDDDD 629
D DD
Sbjct: 361 DSDDS 365
Score = 31.1 bits (70), Expect = 2.0
Identities = 23/109 (21%), Positives = 44/109 (40%), Gaps = 9/109 (8%)
Query: 537 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 596
+ KKK K +KK K + D + + D +E ++ S+ +
Sbjct: 237 DEEKSKKKKKKLAKNKKKLDDDK----KGKRGGDDDADEYDSDDGDDEGREEDYISDSSA 292
Query: 597 DLSDVENDEVV---DKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDK 642
+D E E + K E ++ E S+++ + E + GL+K+ K
Sbjct: 293 SGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEE--EGGLSKKGK 339
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 34.3 bits (79), Expect = 0.10
Identities = 17/99 (17%), Positives = 34/99 (34%), Gaps = 4/99 (4%)
Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSES 592
+ + + KKS Q+ + E +K+D + K+
Sbjct: 34 LFFPSSPSDQAAADEQE-AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESE 92
Query: 593 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
EE + + +DE + E+ +SN E + W+
Sbjct: 93 EENEEEDEESSDENEKETEEKTESN---VEKEITNPSWK 128
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 35.3 bits (82), Expect = 0.11
Identities = 21/71 (29%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 572 DVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
D +E++ E D + EE D D E+DE ++ K+ED + ++ DDDDD
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDED--DDDDDDDDDIA 159
Query: 632 KYQTGLNKRDK 642
+ L +R +
Sbjct: 160 TRERSLERRRR 170
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 35.0 bits (81), Expect = 0.13
Identities = 16/95 (16%), Positives = 36/95 (37%), Gaps = 1/95 (1%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
+ K +KK D + N + + D ++ D ++ D + E+
Sbjct: 101 DSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDI-DDDDDDEDDDED 159
Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
+ D + DE + ++ E ++ + D+DD
Sbjct: 160 DDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDS 194
Score = 34.2 bits (79), Expect = 0.21
Identities = 19/145 (13%), Positives = 45/145 (31%), Gaps = 8/145 (5%)
Query: 493 KVQFLEPSKEGEGEIEEEEKPNGTI---TPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
+++ SK+ E ++ + E + K K K
Sbjct: 37 EIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKL 96
Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLS-----DVEND 604
D+ S +K + + K + D + ++ D D E+D
Sbjct: 97 KDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDD 156
Query: 605 EVVDKKEKNEDSNNKSEESSDDDDD 629
+ D + +D + + +E+ + +
Sbjct: 157 DEDDDDDDVDDEDEEKKEAKELEKL 181
Score = 32.3 bits (74), Expect = 0.88
Identities = 26/145 (17%), Positives = 55/145 (37%), Gaps = 5/145 (3%)
Query: 490 DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
++ E +K+ ++ + K G IT K + + + K
Sbjct: 6 TKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIK-EALESKKKTPEQIDQVLIFLSGMVK 64
Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 609
D + + I +T A A AK K ++ D S ++E+++ L ++ V
Sbjct: 65 -DTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKD 123
Query: 610 KEKNEDSNNKSEESSDD---DDDWE 631
+ +++ ++ DD DDD +
Sbjct: 124 IDVLNQADDDDDDDDDDDLDDDDID 148
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 35.0 bits (81), Expect = 0.15
Identities = 21/125 (16%), Positives = 38/125 (30%), Gaps = 3/125 (2%)
Query: 507 IEEEEKPNGTITPV--VNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTV 564
+ +E KP V V + ++ G K TK K KK
Sbjct: 1265 LSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKK-VKKRLEGSLAALKK 1323
Query: 565 EQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESS 624
++ K + K +++ S + D + D + ++
Sbjct: 1324 KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDED 1383
Query: 625 DDDDD 629
D+DDD
Sbjct: 1384 DEDDD 1388
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 35.0 bits (81), Expect = 0.15
Identities = 11/53 (20%), Positives = 30/53 (56%)
Query: 577 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
E ++ E+ D E +++ DL + E D + +E++E+ + +E +++++
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEE 373
Score = 33.4 bits (77), Expect = 0.41
Identities = 18/66 (27%), Positives = 33/66 (50%)
Query: 577 EKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTG 636
++D EE V E EE+ D E +E VD ++ ED ++ + DD+++ E+ +
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378
Query: 637 LNKRDK 642
K+
Sbjct: 379 KKKKSA 384
>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
terminus. The C terminus of the plasma membrane Nha1
antiporter plays an important role in the immediate cell
response to hypo-osmotic shock which prevents an
execessive loss of ions and water. This domain is found
with pfam00999.
Length = 430
Score = 34.0 bits (78), Expect = 0.21
Identities = 24/128 (18%), Positives = 38/128 (29%), Gaps = 19/128 (14%)
Query: 510 EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEA 569
+ PVV++ + G K K ++ K +T A
Sbjct: 274 GKPERDRSNPVVSR----------ALSWVGLKGTKHD-FIKQIRNLDPKARREVDLETNA 322
Query: 570 PADVAKVEKKDTPE----ESKDVSSESEEESDLSD----VENDEVVDKKEKNEDSNNKSE 621
P V + + D SE + E DL+ E E +E + S
Sbjct: 323 PERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSLLGSEASERRKNREPSSSSPPSRP 382
Query: 622 ESSDDDDD 629
S D +D
Sbjct: 383 RSRRDSED 390
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 33.9 bits (77), Expect = 0.31
Identities = 18/88 (20%), Positives = 30/88 (34%), Gaps = 4/88 (4%)
Query: 547 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 606
K ++K E KV K+ ++ + + ES++SDVE E
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEF 167
Query: 607 VDKKEKNEDSNNKSEESSDDD----DDW 630
V E + E+ +D W
Sbjct: 168 VTSLENESEEELDLEKDDGEDISHTYGW 195
Score = 32.3 bits (73), Expect = 0.94
Identities = 24/117 (20%), Positives = 38/117 (32%), Gaps = 9/117 (7%)
Query: 510 EEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEA 569
E P N K K K+ TK + S ++ + V +A
Sbjct: 53 ESALAVDEEPDENGAV--------SKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDA 104
Query: 570 PADVAKVEKKD-TPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSD 625
D + T ++ SS+ EEE V V K +++ + E SD
Sbjct: 105 ALDKESKKTPRRTRRKAAAASSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSD 161
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 33.7 bits (77), Expect = 0.33
Identities = 21/110 (19%), Positives = 44/110 (40%), Gaps = 5/110 (4%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
KK K K KK ++ N ++ + PE+ D S+++E
Sbjct: 34 PEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDP-SKNKE 92
Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDKYL 644
E + E + K + ++ +N +++ +++ D+ L+K K L
Sbjct: 93 EIEKPKDEPKKPDKKPQADQPNNVHADQPNNNKVDFSD----LDKLKKEL 138
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 33.1 bits (76), Expect = 0.34
Identities = 26/140 (18%), Positives = 41/140 (29%), Gaps = 11/140 (7%)
Query: 490 DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKK---FT 546
D K + KE + + E E+ + H T
Sbjct: 27 DFFKKLLKKKPKEIDEQTEAEDNLLDKAVDDILSN----HYLKEGNGTTSVAIGDGMEKT 82
Query: 547 KNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEV 606
+ K+S ITN P +A E +D SSES +
Sbjct: 83 IHGDVKESDPMSFITNPSLTVSVP--IASGESDLENLNDEDTSSESSYGF--KEESKKGS 138
Query: 607 VDKKEKNEDSNNKSEESSDD 626
+ + E ++ SE S+ D
Sbjct: 139 AETLKLEEKDSSSSEGSTVD 158
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 32.9 bits (75), Expect = 0.36
Identities = 12/71 (16%), Positives = 24/71 (33%)
Query: 558 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 617
KK T + + K +K+ S+ E + + + + K +N +
Sbjct: 52 KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAE 111
Query: 618 NKSEESSDDDD 628
E+ D D
Sbjct: 112 VNHEDVIDLSD 122
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 32.5 bits (74), Expect = 0.37
Identities = 21/80 (26%), Positives = 44/80 (55%), Gaps = 4/80 (5%)
Query: 552 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 611
KK +KK N +E + + +K+D +E++DV E +E+ + + E++E +++ E
Sbjct: 11 KKPEEEKKDENLLEHVKITS----WDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66
Query: 612 KNEDSNNKSEESSDDDDDWE 631
ED E+ ++++D E
Sbjct: 67 DIEDEEEIVEDEEEEEEDEE 86
Score = 31.4 bits (71), Expect = 0.70
Identities = 21/101 (20%), Positives = 48/101 (47%), Gaps = 5/101 (4%)
Query: 539 GGKKKKFTKNKHDKKSSGQKKIT-----NTVEQTEAPADVAKVEKKDTPEESKDVSSESE 593
G +KK + K D+ KIT + +++ E D + + ++ EE ++ E E
Sbjct: 7 GAPEKKPEEEKKDENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPE 66
Query: 594 EESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQ 634
+ D ++ DE +++++ ++ + K E + +D + Q
Sbjct: 67 DIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQ 107
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 33.8 bits (77), Expect = 0.39
Identities = 34/195 (17%), Positives = 74/195 (37%), Gaps = 25/195 (12%)
Query: 435 QPEVIDDEATTEYTAGAIITPEVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTIKV 494
+ E +DE E E ++DE E + + +
Sbjct: 722 EAEGTEDEGEIETGEEG----EEVEDEGEGEAEGKHEVETEGDRKE-------TEHEGET 770
Query: 495 QFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKS 554
+ E EGEI+ E +G + + K + G+ + K++H+ +S
Sbjct: 771 EAEGKEDEDEGEIQAGE--DG---------EMKGDEGAEGKVEHEGETEAGEKDEHEGQS 819
Query: 555 SGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNE 614
Q T ++T ++ + + ++ K V + SD D E +E +++E+ E
Sbjct: 820 ETQADDTEVKDETGE-QELNAENQGEAKQDEKGV--DGGGGSDGGDSEEEEEEEEEEEEE 876
Query: 615 DSNNKSEESSDDDDD 629
+ + EE +++++
Sbjct: 877 EEEEEEEEEEEEENE 891
Score = 30.3 bits (68), Expect = 4.5
Identities = 32/145 (22%), Positives = 59/145 (40%), Gaps = 11/145 (7%)
Query: 478 LMRKPMS-VLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKN 536
L R+P++ V+ D K E GE EE E+P T N E+
Sbjct: 616 LSRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERP--TEAEGENGEESGGEA------ 667
Query: 537 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 596
++ G+ + +N+ + + ++K E E A A K +T E + E+E E
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGE-GEIEAKEADH-KGETEAEEVEHEGETEAEG 725
Query: 597 DLSDVENDEVVDKKEKNEDSNNKSE 621
+ E + + +E ++ ++E
Sbjct: 726 TEDEGEIETGEEGEEVEDEGEGEAE 750
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
The proteins in this family are poorly characterized,
but an investigation has indicated that the immediate
early protein is required the down-regulation of MHC
class I expression in dendritic cells. Human herpesvirus
6 immediate early protein is also referred to as U90.
Length = 993
Score = 33.5 bits (76), Expect = 0.40
Identities = 30/106 (28%), Positives = 48/106 (45%), Gaps = 4/106 (3%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
K KKK + N+ KK + ++K TV + K + T +SSESE+
Sbjct: 643 TFYKRDKKKDISTNRKVKKRTAKRK---TVGYKTDKSKKIKSDSLPTDTNVIVISSESED 699
Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSD-DDDDWEKYQTGLNK 639
E D ++ + KK K+E + S ES D +D E + + +K
Sbjct: 700 EEDGFNIIKKSQLKKKIKSELKSESSSESDDCTSEDNELHLSDYDK 745
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 33.2 bits (76), Expect = 0.50
Identities = 18/130 (13%), Positives = 45/130 (34%), Gaps = 13/130 (10%)
Query: 518 TPVVNKEKQQTHRPVWMKNKKGGKK--KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAK 575
+ K K Q ++ K+ F++ ++ ++ KK + E+ K
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEK 72
Query: 576 VEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQT 635
+KK + E++ S+ E +K++ + K + ++D +
Sbjct: 73 KKKK-----------KKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNK 121
Query: 636 GLNKRDKYLL 645
+K +
Sbjct: 122 IAELAEKSNV 131
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 32.5 bits (74), Expect = 0.57
Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 4/63 (6%)
Query: 571 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE----VVDKKEKNEDSNNKSEESSDD 626
AD E +D P+E +D EEES SD +++ + + ++ S + + D
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDTPD 278
Query: 627 DDD 629
DD
Sbjct: 279 SDD 281
Score = 30.9 bits (70), Expect = 1.8
Identities = 16/85 (18%), Positives = 37/85 (43%), Gaps = 8/85 (9%)
Query: 551 DKKSSGQKKITNTVEQTEAPADVAK--VEKKDTPEESKDVSSESEEE------SDLSDVE 602
DK ++ V+ +A A V + + D EE D ++ E D +
Sbjct: 176 DKAGKDLDRLAACVDDQQAFARVVRDMLSSMDMAEELGDEPESADSEDNEDEDDPKEDED 235
Query: 603 NDEVVDKKEKNEDSNNKSEESSDDD 627
+D+ +++ + DS ++ ++S ++
Sbjct: 236 DDQGEEEESGSSDSLSEDSDASSEE 260
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 31.6 bits (72), Expect = 0.72
Identities = 16/64 (25%), Positives = 32/64 (50%), Gaps = 4/64 (6%)
Query: 574 AKVEKKDTPEESKDVSSESEEE----SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
AK +KK+ EE + V E EE+ + + DK +K++ ++KSE+ + + +
Sbjct: 60 AKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAE 119
Query: 630 WEKY 633
+
Sbjct: 120 DKLE 123
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 32.2 bits (74), Expect = 0.98
Identities = 19/72 (26%), Positives = 34/72 (47%)
Query: 541 KKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSD 600
KK TK +KK ++ + A K ++++ EE + E EEE + ++
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 601 VENDEVVDKKEK 612
E +E +KK+K
Sbjct: 464 EEKEEEEEKKKK 475
Score = 29.1 bits (66), Expect = 7.2
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 4/80 (5%)
Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEE--SKDVSSESEEESDLSDVENDEVV 607
KK++ KKI VE+ E + K EKK K+ E E+E + E +E
Sbjct: 404 GSKKAT--KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Query: 608 DKKEKNEDSNNKSEESSDDD 627
++EK E+ K ++++ D
Sbjct: 462 AEEEKEEEEEKKKKQATLFD 481
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 31.8 bits (72), Expect = 0.98
Identities = 19/133 (14%), Positives = 44/133 (33%)
Query: 490 DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
K + E KE + E+ + KE Q + +++ K
Sbjct: 95 RKTKQKVEEEVKEQLQSLLEKIVVSKQEEDGPGKEPQLDEDKFLLAEDSDDRQETLEAGK 154
Query: 550 HDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 609
+++ + T + K +++ + + V +++ DV ++ ++
Sbjct: 155 VHEETEDSYHVEETASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEE 214
Query: 610 KEKNEDSNNKSEE 622
EDS EE
Sbjct: 215 DNPVEDSKAIKEE 227
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 32.1 bits (73), Expect = 0.99
Identities = 19/91 (20%), Positives = 35/91 (38%)
Query: 539 GGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDL 598
+K+ +N DK+ S + +V E ++ + + D S+ E E E ++
Sbjct: 21 ESSQKRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEI 80
Query: 599 SDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
+N V K + S + DDD
Sbjct: 81 EVEQNQSDVLKSSSIVKEESISTDMDGIDDD 111
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 31.6 bits (72), Expect = 1.0
Identities = 16/52 (30%), Positives = 27/52 (51%)
Query: 580 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
T EE S E L ++E ++V ++ EK+E+ + EE +D DD +
Sbjct: 142 FTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 32.2 bits (73), Expect = 1.1
Identities = 17/64 (26%), Positives = 30/64 (46%)
Query: 568 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 627
E+ + + E+++ EE ++ ESEEE + E +EV E+ SE D +
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497
Query: 628 DDWE 631
+ E
Sbjct: 498 EPEE 501
Score = 31.8 bits (72), Expect = 1.4
Identities = 15/95 (15%), Positives = 43/95 (45%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
++ ++K+ + + S + E+P+ ++ +++ E ++ E EE
Sbjct: 397 DTEEEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEE 456
Query: 595 ESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
E + E + +++E+ +++N SEE + +
Sbjct: 457 EEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 31.7 bits (71), Expect = 1.1
Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 5/59 (8%)
Query: 574 AKVEKKDTPEESKDVSSESEEESDL----SDV-ENDEVVDKKEKNEDSNNKSEESSDDD 627
A+ + +P ESK + + S+ DL + EN + +E +ED ++ SEE DDD
Sbjct: 34 AQQDGGKSPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDD 92
>gnl|CDD|219928 pfam08602, Mgr1, Mgr1-like, i-AAA protease complex subunit. The S.
cerevisiae Mgr1 protein has been shown to be required
for mitochondrial viability in yeast lacking
mitochondrial DNA. It is a mitochondrial inner membrane
protein, which interacts with Yme1 and is a new subunit
of the i-AAA protease complex.
Length = 362
Score = 31.8 bits (72), Expect = 1.1
Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 4/68 (5%)
Query: 53 LYALVFMVALPTAVGMWWYKSIRYTGDKVLLETINFYYAFFQITPHMALKRVLMILGASL 112
L L +V L T +G++++K R G K+ I A F + M + +
Sbjct: 54 LPGLYTLVGLQTGIGLFFFKRFRTLGKKLRKRDI----ADFPSLNRFSTTHGDMHMAPVI 109
Query: 113 EFDRRFNS 120
+ RRF S
Sbjct: 110 PWQRRFVS 117
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 31.9 bits (72), Expect = 1.3
Identities = 22/107 (20%), Positives = 41/107 (38%), Gaps = 7/107 (6%)
Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQ------TEAPADVAKVEKKDTPEESK 586
W+ K+K T N KK + +K+ E+ + D + +D ++
Sbjct: 15 WV-RGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDD 73
Query: 587 DVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKY 633
D + E+E D +D+ D+ +S D+ DD +Y
Sbjct: 74 DDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDDGSEY 120
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 31.9 bits (72), Expect = 1.3
Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 582 PEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKS--EESSDDDDDWEK 632
E S+ +S +E + + E + + ++S N S EE +D D+ E
Sbjct: 936 EEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDELES 988
Score = 30.8 bits (69), Expect = 2.8
Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 11/58 (18%)
Query: 585 SKDVSSESEEESD----LSDVENDEVVDKKEKNEDSNNKSEESSDDD-------DDWE 631
S D S ESEEE SD E+DE + +E +E S + SE+ S++D +DW+
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984
Score = 29.2 bits (65), Expect = 9.8
Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 580 DTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED-SNNKSEESSDDDDDWEKYQ 634
D +ES++ SE E SD E DE + E +ED S ++SE S D++D E +
Sbjct: 929 DESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWD 984
>gnl|CDD|237662 PRK14292, PRK14292, chaperone protein DnaJ; Provisional.
Length = 371
Score = 31.4 bits (71), Expect = 1.3
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 2 KLTKAYQALTDEESRRNFEKYGNPDGPG 29
++ +AY L+D E R +++++G G G
Sbjct: 46 QINEAYAVLSDAEKRAHYDRFGTAPGAG 73
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 31.6 bits (72), Expect = 1.4
Identities = 17/74 (22%), Positives = 26/74 (35%), Gaps = 14/74 (18%)
Query: 556 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNED 615
Q K + A + ++ D + S D E ESDL D ++D
Sbjct: 227 KQPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDV---------- 276
Query: 616 SNNKSEESSDDDDD 629
S+E +D D
Sbjct: 277 ----SDEDGEDLFD 286
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 31.8 bits (72), Expect = 1.4
Identities = 24/111 (21%), Positives = 42/111 (37%), Gaps = 4/111 (3%)
Query: 540 GKKKKFTKNKHDKKSS---GQKKITNTVEQTEAPADVAKVEKKDTPE-ESKDVSSESEEE 595
G + K+ + +S QKK + + Q ++ D E E + D + + E
Sbjct: 53 GVLRDEGKHANILYNSILCNQKKHASFLNQRKSLDDDDDDEFDFLYEDDEDDAGNATSGE 112
Query: 596 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDKYLLT 646
S D E+ D+ E + N + + D DD QT + L+
Sbjct: 113 SSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLS 163
>gnl|CDD|220361 pfam09721, Exosortase_EpsH, Transmembrane exosortase
(Exosortase_EpsH). Members of this family are
designated exosortase, analogous to sortase in cell
wall sorting mediated by LPXTG domains in Gram-positive
bacteria. The phylogenetic distribution of the proteins
in this entry is nearly perfectly correlated with the
distribution of the proteins having the PEP-CTERM
anchor motif, IPR013424. Members of this entry are
integral membrane proteins with eight predicted
transmembrane helices in common. Some members of this
family have long trailing sequences past the region
described by this model. This model does not include
the region of the first predicted transmembrane region.
The best characterized member is EpsH of
Methylobacillus sp. 12S, where it is part of a locus
associated with biosynthesis of the exopolysaccharide
methanol-an.
Length = 254
Score = 31.1 bits (71), Expect = 1.5
Identities = 12/55 (21%), Positives = 23/55 (41%), Gaps = 2/55 (3%)
Query: 16 RRNFEKYGNPDGPGAMSFGIALPSYIVEKENSVWVLGLYALVFMVALPTAVGMWW 70
R K PG + +AL Y++ + + V +L L +L + L + +
Sbjct: 39 RDELAKLPPRPSPGLLLLALALLLYLLGRLSYVLLLALLSLPLL--LAGLLLLLL 91
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 31.6 bits (71), Expect = 1.5
Identities = 25/94 (26%), Positives = 35/94 (37%), Gaps = 9/94 (9%)
Query: 536 NKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEE 595
KK K NK S E E A V+ + E+ D S E
Sbjct: 663 TKKTADGKGKKSNKASFDSDD--------EMDENEIWSALVKSRPDVEDDSDDSELDFAE 714
Query: 596 SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
D SD +D+ K + +D + KSE S + D +
Sbjct: 715 DDFSDSTSDD-EPKLDAIDDEDAKSEGSQESDQE 747
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 30.9 bits (70), Expect = 1.7
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 2/60 (3%)
Query: 570 PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
PA + P S +E EE D DV +++ D+ ED N +EE D++ +
Sbjct: 15 PAVLLTFGNGKGPFASAQDLTEDEEAED--DVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 31.1 bits (70), Expect = 1.8
Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 1/70 (1%)
Query: 562 NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSE 621
N V + D A + D E ++ ES+ + D E E N + N + E
Sbjct: 155 NIVWDNDEDNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDSY-GEGNRELNEEEE 213
Query: 622 ESSDDDDDWE 631
E ++ DD E
Sbjct: 214 EEAEGSDDGE 223
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 31.1 bits (70), Expect = 1.9
Identities = 14/52 (26%), Positives = 21/52 (40%)
Query: 592 SEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQTGLNKRDKY 643
S + D V D E + ++ EE DDDD+ + T N D+
Sbjct: 258 SNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNTNHGDED 309
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 30.5 bits (69), Expect = 2.0
Identities = 16/78 (20%), Positives = 32/78 (41%), Gaps = 2/78 (2%)
Query: 552 KKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE 611
K +K+ T T + K +K+ EE K S+ ++ + + D + +
Sbjct: 48 PKQPKKKRPT-TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDP-DNNEESGD 105
Query: 612 KNEDSNNKSEESSDDDDD 629
E ++ S D++DD
Sbjct: 106 SQESASANSLSDIDNEDD 123
>gnl|CDD|183953 PRK13290, ectC, L-ectoine synthase; Reviewed.
Length = 125
Score = 29.4 bits (67), Expect = 2.1
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 5/33 (15%)
Query: 354 EGEGEIEEEEKPNGTITPVLP---YVTDDHLKH 383
EGEGE+E+ G + P+ P Y D H +H
Sbjct: 63 EGEGEVEDLA--TGEVHPIRPGTMYALDKHDRH 93
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 29.4 bits (66), Expect = 2.1
Identities = 21/65 (32%), Positives = 28/65 (43%), Gaps = 5/65 (7%)
Query: 500 SKEGEGEIEEEEKPNGTI-TPVVNKE---KQQTHRPVWMKNKKGGKKKKFTKNKHDKKSS 555
KEG E++EK V NK+ Q P WM ++ KK K + K KK
Sbjct: 60 KKEGTESEEDDEKMEVDNKAAVRNKKTLRDQHGQYPAWMNQRQ-AKKLKKKREKKKKKKK 118
Query: 556 GQKKI 560
G K +
Sbjct: 119 GAKNL 123
>gnl|CDD|132386 TIGR03343, biphenyl_bphD, 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate
hydrolase. Members of this family are
2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or
HOPD hydrolase, the BphD protein of biphenyl
degradation. BphD acts on the product of ring
meta-cleavage by BphC. Many species carrying bphC and
bphD are capable of degrading polychlorinated biphenyls
as well as biphenyl itself.
Length = 282
Score = 30.7 bits (69), Expect = 2.2
Identities = 21/83 (25%), Positives = 43/83 (51%), Gaps = 12/83 (14%)
Query: 230 IVQGMWDFKNPLLQLPYVTDDHLKHFI--CKKRYIKSLQQFAQMKNEERRSVVKFMNDEQ 287
++QG W + + P +HLK+F+ +K + + A++ + +++V + D++
Sbjct: 183 LLQGRW---ENIQRQP----EHLKNFLISSQKAPLSTWDVTARLGEIKAKTLVTWGRDDR 235
Query: 288 YSDM---LKVLGNMPYVDLHVQP 307
+ + LK+L NMP LHV
Sbjct: 236 FVPLDHGLKLLWNMPDAQLHVFS 258
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 31.2 bits (70), Expect = 2.3
Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 4/84 (4%)
Query: 547 KNKHDKKSSGQ--KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVEND 604
+ KH +K Q KI ++ E+ A D ++E D S V+SE +EE D + ++
Sbjct: 5 QGKHQRKGKKQLENKILHSYEEESAGFDSEELEDNDEQGYSFGVNSEDDEEIDSDEAFDE 64
Query: 605 EVVDKKEKNEDSNNKSEESSDDDD 628
E D+K + S N S+ + D
Sbjct: 65 E--DEKRFADWSFNASKSGKSNKD 86
>gnl|CDD|233222 TIGR00986, 3a0801s05tom22, mitochondrial import receptor subunit
Tom22. The mitochondrial protein translocase (MPT)
family, which brings nuclearly encoded preproteins into
mitochondria, is very complex with 19 currently
identified protein constituents.These proteins include
several chaperone proteins, four proteins of the outer
membrane translocase (Tom) import receptor, five
proteins of the Tom channel complex, five proteins of
the inner membrane translocase (Tim) and three "motor"
proteins. This family is specific for the Tom22 proteins
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 145
Score = 29.9 bits (67), Expect = 2.3
Identities = 13/52 (25%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 559 KITNTVEQTEA-PADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDK 609
++T ++ P VE ++ E+ DV SE +SD +E + D+
Sbjct: 3 QLTEVEDEHFQQPEQQTVVEPEEDDEDFTDVDSEDSVDSDFESLEEETFTDR 54
>gnl|CDD|235943 PRK07133, PRK07133, DNA polymerase III subunits gamma and tau;
Validated.
Length = 725
Score = 30.9 bits (70), Expect = 2.5
Identities = 24/138 (17%), Positives = 54/138 (39%), Gaps = 6/138 (4%)
Query: 496 FLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSS 555
E +E E EI+ ++ +I + KEK+ + ++ K K + D K+
Sbjct: 355 LSELEEEDENEIKFKKIEENSIDNLDIKEKKIENE-NDIEGKSDTKNLEEGFETKDNKNK 413
Query: 556 GQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVV-DKKEKNE 614
I T ++ +K T ++ E+ +E + + ND + + + +E
Sbjct: 414 NSSFINKTENILTNSPLKDELLEKTTEI----INIENPQEFEFGQIGNDIISTEIAQLDE 469
Query: 615 DSNNKSEESSDDDDDWEK 632
+ N D ++++
Sbjct: 470 NQNLIDTGEFDLENNFSN 487
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 30.5 bits (69), Expect = 2.5
Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 36/140 (25%)
Query: 528 THRPVWM----KNKKGGKKKK-------------FTKNKHDKKSSG------QKKITNTV 564
HRP + K ++GG + TK K+ + G ++ + +
Sbjct: 4 AHRPTFDPARGKEEQGGNRLSGPSSIYSSRDLPAHTKLKYRQPGQGTEDELRKRDLRAEL 63
Query: 565 EQTEAPADVAKVEKK-----DTPEESKDVSSESEEESD--------LSDVENDEVVDKKE 611
E+ E K K D + E++ + + + + D +
Sbjct: 64 EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123
Query: 612 KNEDSNNKSEESSDDDDDWE 631
+ DS++ ++S DDD + E
Sbjct: 124 SSSDSDSSDDDSDDDDSEDE 143
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 30.6 bits (69), Expect = 2.6
Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 5/71 (7%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSG----QKKITNTVEQTEAPADVAKVEKKD-TPEESKDVS 589
K K K + K +S + + ++T + E+ + K+ + S S
Sbjct: 10 KFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSKLYSDVDS 69
Query: 590 SESEEESDLSD 600
S SEEE D S+
Sbjct: 70 SSSEEEEDGSE 80
>gnl|CDD|237658 PRK14284, PRK14284, chaperone protein DnaJ; Provisional.
Length = 391
Score = 30.6 bits (69), Expect = 2.7
Identities = 14/35 (40%), Positives = 25/35 (71%), Gaps = 3/35 (8%)
Query: 2 KLTKAYQALTDEESRRNFEKYGNPDGP--GAMSFG 34
++++AY+ L+D + R ++++YG DGP GA FG
Sbjct: 46 EVSEAYEVLSDAQKRESYDRYGK-DGPFAGAGGFG 79
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 30.8 bits (69), Expect = 2.8
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 4/66 (6%)
Query: 568 EAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVV----DKKEKNEDSNNKSEES 623
E + A V K + ++K E EE+ +D E+ D KE + + EES
Sbjct: 373 EQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEES 432
Query: 624 SDDDDD 629
+ DD++
Sbjct: 433 AIDDNE 438
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 30.5 bits (69), Expect = 2.9
Identities = 13/57 (22%), Positives = 25/57 (43%), Gaps = 5/57 (8%)
Query: 571 ADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 627
A +K T + + S +S ++ S + + DK D+NN S + ++D
Sbjct: 25 AYADDPQKDSTAKTTSHDSKKSNDDET-SKDTSSKDTDK----ADNNNTSNQDNNDK 76
Score = 29.4 bits (66), Expect = 6.6
Identities = 5/47 (10%), Positives = 19/47 (40%)
Query: 583 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
+ + + +S S+ + + + ++N + + D++D
Sbjct: 30 PQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDK 76
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 30.5 bits (69), Expect = 2.9
Identities = 15/59 (25%), Positives = 34/59 (57%)
Query: 573 VAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWE 631
V K +K+ P+E ++ E ++E + + +E VD++E+ E+ K+++ + +WE
Sbjct: 27 VEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWE 85
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 30.7 bits (69), Expect = 2.9
Identities = 22/98 (22%), Positives = 42/98 (42%), Gaps = 13/98 (13%)
Query: 534 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKD---VSS 590
++++ G +++ + H K +K+ + V E A +E E + + S
Sbjct: 723 IEDESGEVFRRYRISPHGGKRKIKKRNDSVVSVDEEKA----IEGPSRVPERGNHDLLHS 778
Query: 591 ESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 628
E E D ++ EN + EDS++ + ES D D
Sbjct: 779 EDEMADDEAESENM------DDYEDSDDNAYESKDHAD 810
>gnl|CDD|113815 pfam05058, ActA, ActA Protein. The ActA family is found in
Listeria and is associated with motility. ActA protein
acts as a scaffold to assemble and activate host cell
actin cytoskeletal factors at the bacterial surface,
resulting in directional actin polymerisation and
propulsion of the bacterium through the cytoplasm of the
host cell.
Length = 601
Score = 30.6 bits (68), Expect = 3.0
Identities = 28/123 (22%), Positives = 56/123 (45%), Gaps = 12/123 (9%)
Query: 473 TVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPV 532
T T T+++K V K+ L +K E I E + P + K+ + +P+
Sbjct: 454 TTTKTVLKKITPV---KIAPKLAELPATKPQETAIGENK------APFIEKQAETNKQPI 504
Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSES 592
M + +K+ ++K + K ++K+ VE++E D ++ EE K ++ +
Sbjct: 505 DMPSLPVIQKEATERDKEEMKPQTEEKM---VEESEPANDANGKKRSAGIEEGKLIAKSA 561
Query: 593 EEE 595
E+E
Sbjct: 562 EDE 564
>gnl|CDD|221403 pfam12064, DUF3544, Domain of unknown function (DUF3544). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 198 to 216 amino acids in length. This domain is
found associated with pfam00628, pfam01753, pfam00439,
pfam00855.
Length = 207
Score = 30.0 bits (67), Expect = 3.1
Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 3/79 (3%)
Query: 546 TKNKHDKKSSGQKKITNTVEQTEAPADVAKV-EKKDTPEESKDVSSESEEESDLSDVEND 604
T K +S Q + T+E +A + ++ E T E D SE E+SD SD E
Sbjct: 131 TSPKRQIRSRFQLNLDKTIESCKAQLGINEISEDAYTGVEHSD--SEDSEKSDSSDSEYS 188
Query: 605 EVVDKKEKNEDSNNKSEES 623
++K KNE + +E+
Sbjct: 189 SDDEQKAKNEQEDASDKEA 207
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic
phosphoprotein 1 (DMP1) gene has been mapped to human
chromosome 4q21. DMP1 is a bone and teeth specific
protein initially identified from mineralised dentin.
DMP1 is primarily localised in the nuclear compartment
of undifferentiated osteoblasts. In the nucleus, DMP1
acts as a transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates
mineralised matrix formation extracellularly, at later
stages of osteoblast maturation. The DMP1 gene has been
found to be ectopically expressed in lung cancer
although the reason for this is unknown.
Length = 514
Score = 30.4 bits (68), Expect = 3.2
Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 7/70 (10%)
Query: 577 EKKDTPEESKDV---SSESEEESDLSDVEND----EVVDKKEKNEDSNNKSEESSDDDDD 629
E+ + E+S++V SSES +E+DL EN E V + + ++ +N + S D +D
Sbjct: 317 EENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDS 376
Query: 630 WEKYQTGLNK 639
+ L+
Sbjct: 377 ESSEEDSLDT 386
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.9 bits (69), Expect = 3.2
Identities = 20/106 (18%), Positives = 46/106 (43%), Gaps = 6/106 (5%)
Query: 523 KEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTP 582
K+ ++ + + + +K + +KK + +A A K E+K
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE--AKKKADAAKKKAEEKKKA 1393
Query: 583 EESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 628
+E+K + E ++++D E + K+K +++ K+EE D+
Sbjct: 1394 DEAKKKAEEDKKKAD----ELKKAAAAKKKADEAKKKAEEKKKADE 1435
Score = 29.3 bits (65), Expect = 8.1
Identities = 32/143 (22%), Positives = 54/143 (37%), Gaps = 4/143 (2%)
Query: 490 DTIKVQFLEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNK 549
D K + E K E E E K + + K K KKK + K
Sbjct: 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKK 1391
Query: 550 H----DKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDE 605
KK+ KK + +++ A A KK E+ K ++ + E E +
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKK 1451
Query: 606 VVDKKEKNEDSNNKSEESSDDDD 628
++ +K E++ K+EE+ D+
Sbjct: 1452 KAEEAKKAEEAKKKAEEAKKADE 1474
Score = 29.3 bits (65), Expect = 9.0
Identities = 27/145 (18%), Positives = 60/145 (41%), Gaps = 6/145 (4%)
Query: 498 EPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQ 557
E K+ E ++E + + E++ + K KKK + K ++ +
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE-K 1688
Query: 558 KKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSN 617
K ++ E +++KK+ E+ K + EE + E + K++ E+
Sbjct: 1689 KAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE-----EAKKEAEEDK 1743
Query: 618 NKSEESSDDDDDWEKYQTGLNKRDK 642
K+EE+ D+++ +K + +K
Sbjct: 1744 KKAEEAKKDEEEKKKIAHLKKEEEK 1768
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 30.6 bits (69), Expect = 3.4
Identities = 28/158 (17%), Positives = 51/158 (32%), Gaps = 25/158 (15%)
Query: 497 LEPSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSG 556
L ++++ ++ +K+ +P K K SG
Sbjct: 1304 LASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSG 1363
Query: 557 QKKITNTVEQTEA-----PADVAKVEKKDTPEESKDV--------SSESEEESDLSDVEN 603
QK +T ++ EA V K+ ++S V +ES E S
Sbjct: 1364 QKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSE 1423
Query: 604 DEVVD------------KKEKNEDSNNKSEESSDDDDD 629
+ +D K+ S+++SE + D D D
Sbjct: 1424 KDEIDVSAKPRPQRANRKQTTYVLSDSESESADDSDFD 1461
>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
C-terminal ATPase domain [Signal transduction
mechanisms].
Length = 456
Score = 30.1 bits (68), Expect = 3.7
Identities = 28/134 (20%), Positives = 50/134 (37%), Gaps = 10/134 (7%)
Query: 88 FYYAFFQITPHMALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNLGEKNR 147
+ L +L ++ + SE+ E S+ I ++ ++ L E+N
Sbjct: 191 VILLLSSFSSRSILLPILGLILLLSRIALGYLSELKEI-SETIN--EMEERLEYLIEENY 247
Query: 148 ERPLYHKYSIKARALIYAHLSNMQLTSETLEADRMYIVKKCPYLLQEMVTCISQLILLAY 207
+ RAL N TLE RM + P ++V +S+L+ +
Sbjct: 248 SLEQEQLE-AELRAL--QSQINPHFLYNTLETIRMLAEEDDPEEAAKVVKALSKLLRYSL 304
Query: 208 AQRVPRLIHIETLE 221
+ L +I TLE
Sbjct: 305 S----NLDNIVTLE 314
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 30.1 bits (68), Expect = 3.7
Identities = 15/93 (16%), Positives = 26/93 (27%), Gaps = 4/93 (4%)
Query: 537 KKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEES 596
K + F ++K KK+ E K ++ T ES S+ +
Sbjct: 106 KSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRP 165
Query: 597 DLSDVENDEVV----DKKEKNEDSNNKSEESSD 625
+ + KK S S +
Sbjct: 166 STPHLLKAKEGPSRRAKKAAKLSSTASSGDEKS 198
>gnl|CDD|216161 pfam00865, Osteopontin, Osteopontin.
Length = 293
Score = 30.1 bits (67), Expect = 3.8
Identities = 24/101 (23%), Positives = 47/101 (46%), Gaps = 4/101 (3%)
Query: 539 GGKKKKFTKNKHDKKSSGQKKITNTVEQTE-APADVAKVEKKDTPEESK--DVSSESEEE 595
G ++K NKH + K + +Q AP + E+ D ++ S+ES E
Sbjct: 6 GSSEEKQLYNKHPDAVATWLKPDPSQKQNLLAPQNSVSSEETDDLKQETLPSKSNESHEH 65
Query: 596 -SDLSDVENDEVVDKKEKNEDSNNKSEESSDDDDDWEKYQT 635
D D ++D+ VD ++ N + ++ ++ + D D E + +
Sbjct: 66 MDDDDDDDDDDHVDSQDSNSNESDDADHTDDSDHSDESHHS 106
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 28.8 bits (64), Expect = 3.9
Identities = 9/72 (12%), Positives = 27/72 (37%)
Query: 563 TVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSEE 622
A A+ ++ ++ + + E+ E + + +E D+ +E
Sbjct: 59 EKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEA 118
Query: 623 SSDDDDDWEKYQ 634
D+++ E+ +
Sbjct: 119 EEADEEEDEEAE 130
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 29.3 bits (65), Expect = 4.3
Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 10/112 (8%)
Query: 534 MKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQT--EAPADVAKVEKKDTPEESKDVSSE 591
M N K K K + + + ++K+ VE+ + P D+ + E +D PEE + E
Sbjct: 1 MGNYCRKNKAKEPKRRDIDELAEREKLKKEVEEIPEQKPEDIVE-ELEDQPEEPPEQEEE 59
Query: 592 SEEESDLSDVENDEVVDK--KEKNEDSNNKS-----EESSDDDDDWEKYQTG 636
+EE+ +++ +K EKN D +S ES D+ +K +TG
Sbjct: 60 NEEQKPKEEIDYPIQENKSFDEKNLDDLERSNSDIYSESHKYDNASDKLETG 111
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 30.0 bits (67), Expect = 4.5
Identities = 19/68 (27%), Positives = 32/68 (47%), Gaps = 8/68 (11%)
Query: 569 APADVAKVEKKDTPEESKDVSS-------ESEEESDLSDVENDEVVDKKEKNEDSNNKSE 621
APAD VE + E +D + +S+E+ DE D ++ ++ S + S
Sbjct: 423 APAD-QDVEAEAHDEFDQDPGAPAHADRADSDEDDMDEQESGDERADGEDDSDSSYSYST 481
Query: 622 ESSDDDDD 629
SS+D+ D
Sbjct: 482 TSSEDESD 489
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 30.0 bits (67), Expect = 4.7
Identities = 11/67 (16%), Positives = 25/67 (37%), Gaps = 3/67 (4%)
Query: 573 VAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKE---KNEDSNNKSEESSDDDDD 629
+ + + E KD +S ++ +K E +N + + EE D +D+
Sbjct: 336 LNNEKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395
Query: 630 WEKYQTG 636
+ +
Sbjct: 396 NDHSKRI 402
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 30.1 bits (68), Expect = 4.7
Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 2/60 (3%)
Query: 572 DVAKVEKKDTPEESKDVSSESEEESD--LSDVENDEVVDKKEKNEDSNNKSEESSDDDDD 629
D A + + +E +D SS+S E D + E+ N + S S +DDD
Sbjct: 486 DDADEDDDEDDDEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDD 545
Score = 29.3 bits (66), Expect = 8.6
Identities = 19/122 (15%), Positives = 43/122 (35%), Gaps = 25/122 (20%)
Query: 537 KKGGKKKKFTKNKHDKKSSGQ-KKITNTVEQ-TEAPADVAKVEKKDTPEESKDVSSESEE 594
++ G KK NK D +++ +++E+ + E D +E D +SE+
Sbjct: 393 ERQGDKKDLKSNKEDANEVDDLEEVVSSMEEFLNKVSSFEGAEFADDEDEDDDEPDDSED 452
Query: 595 ESDLSDVEN-----------------------DEVVDKKEKNEDSNNKSEESSDDDDDWE 631
+ D + + D+ + +D ++ S ++ E
Sbjct: 453 KDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDSDDADEDDDEDDDEDEDSSSDSTLEELE 512
Query: 632 KY 633
+Y
Sbjct: 513 EY 514
>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
N-terminus. Sec16 is a multi-domain vesicle coat
protein. The overall function of Sec16 is in mediating
the movement of protein-cargo between the organelles of
the secretory pathway. Over-expression of truncated
mutants of only the N-terminus are lethal, and this
portion does not appear to be essential for function so
may act as a stabilising region.
Length = 246
Score = 29.5 bits (66), Expect = 4.9
Identities = 21/151 (13%), Positives = 49/151 (32%), Gaps = 13/151 (8%)
Query: 489 DDTIKVQFLEPSKEGEGEIEE---EEKPNGT---ITPVVNKEKQQTHRPVWMKNKKGGKK 542
D+ + + +E S I+ E++ + + E + +P +K +
Sbjct: 47 DNGEQNEPVEESAPQTVAIDSVFVEDEDDEGSDFFNSLHEGEAVEEQQPPPHLTRKSTSQ 106
Query: 543 KKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVE 602
+ + S + + + + + S+SE S+
Sbjct: 107 VLDSLGLNPDSLSSPASAEPLDPTAQNEFSNVL---AASTDGNPESESQSEPSSEEELAA 163
Query: 603 NDEVVDKK-EKNEDSNNKSEESS---DDDDD 629
E+ D + E ++ +E D+DDD
Sbjct: 164 RAELSDDESESTPTEDDLAERWQAFLDNDDD 194
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 29.7 bits (67), Expect = 5.3
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 14/121 (11%)
Query: 533 WMKNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKK-DTPEESKDV--- 588
W+K K +KKK + K ++K E T KV K + EE +DV
Sbjct: 94 WLKKSKKRQKKKEAERKKALLLDEKEK-ERAAEYTSEDLAGLKVGHKVEEFEEGEDVILT 152
Query: 589 -----SSESEEESDLSDVENDEVVDKK--EKNEDSNNKSEESSDDDDDWEKYQTGLNKRD 641
E E+E D +EN E+V+K+ +KN + K + DDDD ++ L+K D
Sbjct: 153 LKDTGVLEDEDEGDE--LENVELVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYD 210
Query: 642 K 642
+
Sbjct: 211 E 211
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 29.9 bits (67), Expect = 5.5
Identities = 38/183 (20%), Positives = 60/183 (32%), Gaps = 19/183 (10%)
Query: 456 EVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTIKVQFLEPSKEGEGEIEEEEKPNG 515
EV +D+A++ A K + GD +K + K+ ++ E+ N
Sbjct: 368 EVFEDQASSVKNKQAQSEENLNESPKTPIAVTGDPHLKDPTIIAGKKLMNKLTSEKINNP 427
Query: 516 TITPVVNKEKQQTH--------------RPVWMKNKKGGKKKKFTKNKHDKKSSGQKKIT 561
V+K K PV + +K K K TK K+ K + K
Sbjct: 428 VKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPVPNKSKKQ 487
Query: 562 NTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNNKSE 621
++ KV K T ++ N E DK+ S KS+
Sbjct: 488 LANNNSQNI-KSKKVVKAKTNNKANLQDVGECSSPPN----NKEKNDKQTSTSSSVLKSD 542
Query: 622 ESS 624
SS
Sbjct: 543 RSS 545
>gnl|CDD|148137 pfam06339, Ectoine_synth, Ectoine synthase. This family consists
of several bacterial ectoine synthase proteins. The
ectABC genes encode the diaminobutyric acid
acetyltransferase (EctA), the diaminobutyric acid
aminotransferase (EctB), and the ectoine synthase
(EctC). Together these proteins constitute the ectoine
biosynthetic pathway.
Length = 127
Score = 28.3 bits (64), Expect = 5.8
Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 5/33 (15%)
Query: 354 EGEGEIEEEEKPNGTITPVLP---YVTDDHLKH 383
EGEGEI E+ G + P+ P Y D H +H
Sbjct: 63 EGEGEI--EDLATGEVHPIKPGTMYALDKHDRH 93
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 29.7 bits (67), Expect = 6.0
Identities = 17/53 (32%), Positives = 22/53 (41%), Gaps = 3/53 (5%)
Query: 576 VEKKDTPEESKDVSSESEEESDLSDVEND---EVVDKKEKNEDSNNKSEESSD 625
+E +D E D E EEE + D + E D +E ED E SD
Sbjct: 379 LEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSD 431
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 27.4 bits (61), Expect = 6.1
Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 4/61 (6%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSGQKKITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEE 594
K KK KK K K KK + K+ T E+ E A+ + K+ E+ + E EE
Sbjct: 13 KGKKIDVKK---KKKKKKKKNKSKEEVVT-EKEEEEKSSAESDLKEGEEDEDNEKIEQEE 68
Query: 595 E 595
+
Sbjct: 69 D 69
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 29.4 bits (65), Expect = 6.5
Identities = 23/133 (17%), Positives = 51/133 (38%), Gaps = 11/133 (8%)
Query: 499 PSKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQK 558
E E ++K TI + K +++ + KK K + DKK S K
Sbjct: 416 SHLPASNESESDDKLETTIEKLDRKLRERQENRKERQLKK--TKDDSDVDLKDKKESINK 473
Query: 559 KITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEKNEDSNN 618
K +E+ +E ++ +E+ + D + + + K EK + +
Sbjct: 474 KNKKGKHA---------IERTAASKEELELIKADDEDDEQLDHFDMKSILKAEKFKKNRK 524
Query: 619 KSEESSDDDDDWE 631
+++S+ ++ +
Sbjct: 525 LKKKASNLEEGFV 537
>gnl|CDD|221883 pfam12985, DUF3869, Domain of unknown function (DUF3869). A family
based on the N-terminal domain of 3KOG, which shows weak
but consistent remote homology with adhesive families
such as immunoglobulins and cadherins, suggesting it
might form an attachment module.
Length = 108
Score = 27.8 bits (62), Expect = 6.9
Identities = 20/52 (38%), Positives = 24/52 (46%), Gaps = 6/52 (11%)
Query: 441 DEATTEYTAGAIITPEVIDDEATTEYTAGAIITVTCTLMRKPMSVLFGDDTI 492
+ A A A I P V D ATT G I+T T T+ SV G+ TI
Sbjct: 12 NVAVEPAPAKATIAPTVYD--ATT----GTIVTGTATVTISAGSVTTGNGTI 57
>gnl|CDD|221193 pfam11735, CAP59_mtransfer, Cryptococcal mannosyltransferase 1.
The capsule of pathogenic fungi is a complex
polysaccharide whose formation is determined by a number
of enzymes including, most importantly,
alpha-1,3-mannosyltransferase 1, EC:2.4.1.-.
Length = 244
Score = 28.8 bits (65), Expect = 7.5
Identities = 19/56 (33%), Positives = 24/56 (42%), Gaps = 7/56 (12%)
Query: 99 MALKRVLMILGASLEFDRRFNSEIIERPSDEIEVPQLIRQIPNLGE-KNRE-RPLY 152
AL L LG RR + DE +P +R+IP L E +N PLY
Sbjct: 54 RALDPELDALGV-----RRSITLSEITHLDEDSLPPRLRRIPYLAELRNLALEPLY 104
>gnl|CDD|184611 PRK14297, PRK14297, chaperone protein DnaJ; Provisional.
Length = 380
Score = 29.0 bits (65), Expect = 7.6
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 3 LTKAYQALTDEESRRNFEKYGNPDGPGAMSFG 34
+ +AYQ L+D + + ++++G D GA FG
Sbjct: 50 INEAYQVLSDPQKKAQYDQFGTADFNGAGGFG 81
>gnl|CDD|226907 COG4530, COG4530, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 129
Score = 27.9 bits (62), Expect = 8.4
Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 6/61 (9%)
Query: 574 AKVEKKDTPEESKDVSSESEEE------SDLSDVENDEVVDKKEKNEDSNNKSEESSDDD 627
V++ D E +V S + + DL D+ +DE VD + ++D+ + EE DDD
Sbjct: 56 EVVKEVDAENEEVEVVSLEDADDDPKGGDDLPDLGDDEDVDLDDDDDDTFLEDEEDDDDD 115
Query: 628 D 628
Sbjct: 116 V 116
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 28.8 bits (64), Expect = 8.8
Identities = 16/82 (19%), Positives = 26/82 (31%), Gaps = 4/82 (4%)
Query: 535 KNKKGGKKKKFTKNKHDKKSSGQK----KITNTVEQTEAPADVAKVEKKDTPEESKDVSS 590
+ KK KK T + K K K + + V KK T + K S
Sbjct: 14 EEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESV 73
Query: 591 ESEEESDLSDVENDEVVDKKEK 612
+ E + + + + E
Sbjct: 74 KKESVAKKTVKKEAVSAEVFEA 95
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 28.9 bits (64), Expect = 9.1
Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 4/74 (5%)
Query: 560 ITNTVEQTEAPADVAKVEKKDTPEESKDVSSESEEESDLSDVENDEVVDKKEK----NED 615
I N E ++ + EE + E EE+ D+ ++E E V+ E+ +ED
Sbjct: 77 IANDKSDNENDVELEGLNIIVKNEEERGTQKEEEEDEDVEEIEEVEEVEVVEEEYDDDED 136
Query: 616 SNNKSEESSDDDDD 629
S E+ SD + D
Sbjct: 137 SEKDDEKESDAEGD 150
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 28.9 bits (63), Expect = 9.1
Identities = 36/143 (25%), Positives = 55/143 (38%), Gaps = 26/143 (18%)
Query: 500 SKEGEGEIEEEEKPNGTITPVVNKEKQQTHRPVWMKNKKGGKKKKFTKNKHDKKSSGQKK 559
+K+ + + E E P + TPV PV + +K KNK DKK S +K
Sbjct: 259 NKKNKRKSESENVPAASETPV---------EPVVETTPPASENQK--KNKKDKKKSESEK 307
Query: 560 ITNTVEQTEAPAD---------------VAKVEKKDTPEESKDVSSESEEESDLSDVEND 604
+ Q EAP AK E KD P E+ E E+ + +V
Sbjct: 308 VVEEPVQAEAPKSKKPTADDNMDFLDFVTAKEEPKDEPAETPAAPVEEVVENVVENVVEK 367
Query: 605 EVVDKKEKNEDSNNKSEESSDDD 627
+N+ N K ++ S+ +
Sbjct: 368 STTPPATENKKKNKKDKKKSESE 390
>gnl|CDD|147374 pfam05160, DSS1_SEM1, DSS1/SEM1 family. This family contains the
breast cancer tumour suppressor BRCA2-interacting
protein DSS1 and its homologue SEM1, both of which are
short acidic proteins. DSS1 has been shown to be a
conserved component of the Rae1 mediated mRNA export
pathway in Schizosaccharomyces pombe.
Length = 63
Score = 26.5 bits (59), Expect = 9.2
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 593 EEESDLSDVENDEVVDKKEKNEDSNNKSEESSDDDD 628
EE+ + D ++ +++ + E+ NN EE DDDD
Sbjct: 12 EEDDEFEDFPVEDWPEEETEKENGNNLWEEDWDDDD 47
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.132 0.378
Gapped
Lambda K H
0.267 0.0705 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,118,197
Number of extensions: 3311735
Number of successful extensions: 3913
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3441
Number of HSP's successfully gapped: 264
Length of query: 646
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 543
Effective length of database: 6,369,140
Effective search space: 3458443020
Effective search space used: 3458443020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 62 (27.7 bits)