RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11552
(714 letters)
>gnl|CDD|214539 smart00147, RasGEF, Guanine nucleotide exchange factor for Ras-like
small GTPases.
Length = 242
Score = 205 bits (523), Expect = 3e-61
Identities = 76/198 (38%), Positives = 113/198 (57%), Gaps = 13/198 (6%)
Query: 467 RYKFRGPNRFQQITANLDVFLRRFNEIQYWVITEILLVTSLNKRVQILRKMIKLAAYCKE 526
+ ++ NL+ F+RRFNE+ WV TEIL T+ R ++L K I++A +C+E
Sbjct: 31 GSVWGKRSKKSPSPLNLEAFIRRFNEVSNWVATEILKQTTPKDRAELLSKFIQVAKHCRE 90
Query: 527 YRNINALFAVLMGLSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKL 586
N N+L A++ LS+ +SRL TW+KLPSK KK + ELE L+ P +N++ YR+A+S
Sbjct: 91 LNNFNSLMAIVSALSSSPISRLKKTWEKLPSKYKKLFEELEELLSPERNYKNYREALSSC 150
Query: 587 -QSPVIPFMPLLLKDLAFTHDGNKTVV-DGLVNFEKMHMFAQTLRTLRYCRTRHLDHDKT 644
P IPF+ +LLKDL F +GN + +GLVNFEK A+ LR +R +++
Sbjct: 151 NLPPCIPFLGVLLKDLTFIDEGNPDFLENGLVNFEKRRQIAEILREIRQLQSQPY----- 205
Query: 645 VVDGLVNFEKMHMFAQTL 662
N Q+L
Sbjct: 206 ------NLRPNRSDIQSL 217
>gnl|CDD|216025 pfam00617, RasGEF, RasGEF domain. Guanine nucleotide exchange
factor for Ras-like small GTPases.
Length = 185
Score = 200 bits (510), Expect = 3e-60
Identities = 65/151 (43%), Positives = 97/151 (64%), Gaps = 1/151 (0%)
Query: 474 NRFQQITANLDVFLRRFNEIQYWVITEILLVTSLNKRVQILRKMIKLAAYCKEYRNINAL 533
+ +++ N++ F+ RFN++ WV TEIL L KR ++++K IK+A +C+E N N+L
Sbjct: 35 KKDSKLSPNIEAFISRFNQLSRWVATEILSEEDLKKRAKVIKKFIKIAQHCRELNNFNSL 94
Query: 534 FAVLMGLSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKLQSPVIPF 593
A++ GL++ +SRL TW+ + K KK ELE L+ PSKN + YR+A+ K P IPF
Sbjct: 95 MAIISGLNSSPISRLKKTWELVSKKYKKLLEELEELMSPSKNFKNYREALKKANPPCIPF 154
Query: 594 MPLLLKDLAFTHDGNKTVVD-GLVNFEKMHM 623
+ L LKDL F +GN +D GL+NFEK
Sbjct: 155 LGLYLKDLTFIEEGNPDFLDNGLINFEKRRK 185
>gnl|CDD|238087 cd00155, RasGEF, Guanine nucleotide exchange factor for Ras-like
small GTPases. Small GTP-binding proteins of the Ras
superfamily function as molecular switches in
fundamental events such as signal transduction,
cytoskeleton dynamics and intracellular trafficking.
Guanine-nucleotide-exchange factors (GEFs) positively
regulate these GTP-binding proteins in response to a
variety of signals. GEFs catalyze the dissociation of
GDP from the inactive GTP-binding proteins. GTP can then
bind and induce structural changes that allow
interaction with effectors.
Length = 237
Score = 196 bits (501), Expect = 4e-58
Identities = 72/188 (38%), Positives = 114/188 (60%), Gaps = 5/188 (2%)
Query: 480 TANLDVFLRRFNEIQYWVITEILLVTSLNKRVQILRKMIKLAAYCKEYRNINALFAVLMG 539
+ NL+ F+ RFN + WV +EILL T+ KR ++L K I++A +C+E N N+L A++
Sbjct: 44 SPNLERFIERFNNLSNWVASEILLCTNPKKRARLLSKFIQVAKHCRELNNFNSLMAIVSA 103
Query: 540 LSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKL--QSPVIPFMPLL 597
LS+ +SRL TW+ L SK KK + ELE L+DPS+N + YR+ + + P +PF+ +
Sbjct: 104 LSSSPISRLKKTWEVLSSKLKKLFEELEELVDPSRNFKNYRKLLKSVGPNPPCVPFLGVY 163
Query: 598 LKDLAFTHDGNKTVVD-GLVNFEKMHMFAQTLRTLRYCRTRH--LDHDKTVVDGLVNFEK 654
LKDL F H+GN ++ LVNFEK A+ LR +R ++ L+ D+ ++ L +
Sbjct: 164 LKDLTFLHEGNPDFLEGNLVNFEKRRKIAEILREIRQLQSNSYELNRDEDILAFLWKLLE 223
Query: 655 MHMFAQTL 662
+ + L
Sbjct: 224 LILNEDEL 231
>gnl|CDD|239884 cd04437, DEP_Epac, DEP (Dishevelled, Egl-10, and Pleckstrin) domain
found in Epac-like proteins. Epac (exchange proteins
directly activated by cAMP) proteins are GEFs
(guanine-nucleotide-exchange factors) for the small
GTPases, Rap1 and Rap2. They are directly regulated by
cyclic AMP, a second messenger that plays a role in the
control of diverse cellular processes, such as cell
adhesion and insulin secretion. Epac-like proteins
share a common domain architecture, containing RasGEF,
DEP and CAP-effector (cAMP binding) domains. The DEP
domain is involved in membrane localization.
Length = 125
Score = 115 bits (291), Expect = 2e-30
Identities = 48/106 (45%), Positives = 63/106 (59%), Gaps = 5/106 (4%)
Query: 143 TIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYN 202
R+C G+ELVDWL+ +P + SR GMWQ LLEEGV+ HV+ E F+DK Y
Sbjct: 25 RTYRQCCVGTELVDWLLQQSPCVQ-SRSQAVGMWQVLLEEGVLLHVDQELHFQDKYQFYR 83
Query: 203 FWQDKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRK 248
F D+ + EAEE L+EA+ L++ PDA+LR ILRK
Sbjct: 84 FSDDECSP----APLEKREAEEELQEAVTLLSQLGPDALLRMILRK 125
Score = 81.6 bits (202), Expect = 2e-18
Identities = 33/71 (46%), Positives = 42/71 (59%), Gaps = 3/71 (4%)
Query: 44 AGWVLRTLLLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMW 103
AG LR +L+D +RDRK R+C G+ELVDWL+ +P + SR GMW
Sbjct: 1 AGRALRNAILSDAPHLIRDRKYHL--RTYRQCCVGTELVDWLLQQSPCVQ-SRSQAVGMW 57
Query: 104 QALLEEGVIYH 114
Q LLEEGV+ H
Sbjct: 58 QVLLEEGVLLH 68
>gnl|CDD|237999 cd00038, CAP_ED, effector domain of the CAP family of transcription
factors; members include CAP (or cAMP receptor protein
(CRP)), which binds cAMP, FNR (fumarate and nitrate
reduction), which uses an iron-sulfur cluster to sense
oxygen) and CooA, a heme containing CO sensor. In all
cases binding of the effector leads to conformational
changes and the ability to activate transcription.
Cyclic nucleotide-binding domain similar to CAP are also
present in cAMP- and cGMP-dependent protein kinases
(cAPK and cGPK) and vertebrate cyclic nucleotide-gated
ion-channels. Cyclic nucleotide-monophosphate binding
domain; proteins that bind cyclic nucleotides (cAMP or
cGMP) share a structural domain of about 120 residues;
the best studied is the prokaryotic catabolite gene
activator, CAP, where such a domain is known to be
composed of three alpha-helices and a distinctive
eight-stranded, antiparallel beta-barrel structure;
three conserved glycine residues are thought to be
essential for maintenance of the structural integrity of
the beta-barrel; CooA is a homodimeric transcription
factor that belongs to CAP family; cAMP- and
cGMP-dependent protein kinases (cAPK and cGPK) contain
two tandem copies of the cyclic nucleotide-binding
domain; cAPK's are composed of two different subunits, a
catalytic chain and a regulatory chain, which contains
both copies of the domain; cGPK's are single chain
enzymes that include the two copies of the domain in
their N-terminal section; also found in vertebrate
cyclic nucleotide-gated ion-channels.
Length = 115
Score = 62.0 bits (151), Expect = 9e-12
Identities = 28/81 (34%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 310 PLHHLSNSVRRELAGVVMFEAHPRKGEILFHQGDEGKSWYIIIQGSVDVVIYGKG----C 365
L + ELA + E GE++ QGD S YI++ GSV+V +
Sbjct: 1 LFSGLDDEELEELADA-LEERRFPAGEVIIRQGDPADSLYIVLSGSVEVYKLDEDGREQI 59
Query: 366 VTSLYAGEDFGKLALVNNAPR 386
V L G+ FG+LAL+ N PR
Sbjct: 60 VGFLGPGDLFGELALLGNGPR 80
Score = 47.3 bits (113), Expect = 2e-06
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 4/55 (7%)
Query: 391 SMVFHQGDEGKSWYIIIQGSVDVVIYGKG----CVTSLYAGEDFGKLALVNNAPS 441
++ QGD S YI++ GSV+V + V L G+ FG+LAL+ N P
Sbjct: 26 EVIIRQGDPADSLYIVLSGSVEVYKLDEDGREQIVGFLGPGDLFGELALLGNGPR 80
>gnl|CDD|239836 cd04371, DEP, DEP domain, named after Dishevelled, Egl-10, and
Pleckstrin, where this domain was first discovered. The
function of this domain is still not clear, but it is
believed to be important for the membrane association of
the signaling proteins in which it is present. New
studies show that the DEP domain of Sst2, a yeast RGS
protein is necessary and sufficient for receptor
interaction.
Length = 81
Score = 57.7 bits (140), Expect = 1e-10
Identities = 25/63 (39%), Positives = 37/63 (58%), Gaps = 4/63 (6%)
Query: 140 GGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHV-NGEQAFRDKC 198
+T C +GSELVDWL+ A++R+ + QALL+ G+I+HV + + FRD
Sbjct: 21 HLKTY-PNCFTGSELVDWLLDN--LEAITREEAVELGQALLKHGLIHHVSDDKHTFRDSY 77
Query: 199 ILY 201
LY
Sbjct: 78 ALY 80
Score = 51.2 bits (123), Expect = 3e-08
Identities = 25/68 (36%), Positives = 41/68 (60%), Gaps = 4/68 (5%)
Query: 47 VLRTLLLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQAL 106
++R +L +D ++DRK +T C +GSELVDWL+ A++R+ + QAL
Sbjct: 2 LVRIMLDSDSGVPIKDRKY-HLKTY-PNCFTGSELVDWLLDN--LEAITREEAVELGQAL 57
Query: 107 LEEGVIYH 114
L+ G+I+H
Sbjct: 58 LKHGLIHH 65
>gnl|CDD|216020 pfam00610, DEP, Domain found in Dishevelled, Egl-10, and Pleckstrin
(DEP). The DEP domain is responsible for mediating
intracellular protein targeting and regulation of
protein stability in the cell. The DEP domain is present
in a number of signaling molecules, including Regulator
of G protein Signaling (RGS) proteins, and has been
implicated in membrane targeting. New findings in yeast,
however, demonstrate a major role for a DEP domain in
mediating the interaction of an RGS protein to the
C-terminal tail of a GPCR, thus placing RGS in close
proximity with its substrate G protein alpha subunit.
Length = 74
Score = 56.5 bits (137), Expect = 3e-10
Identities = 24/66 (36%), Positives = 34/66 (51%), Gaps = 3/66 (4%)
Query: 140 GGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGE-QAFRD-K 197
+T C +GSE VDWLM L + R+ + Q LL+ G+I+HV + + F D K
Sbjct: 10 HLKTY-PNCFTGSEAVDWLMDNFEGLVIDREEAVELGQLLLDHGLIHHVGDKHRGFLDSK 68
Query: 198 CILYNF 203
Y F
Sbjct: 69 YSFYRF 74
Score = 53.8 bits (130), Expect = 3e-09
Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 2/56 (3%)
Query: 59 TLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
L+DRK +T C +GSE VDWLM L + R+ + Q LL+ G+I+H
Sbjct: 3 KLKDRK-KHLKTY-PNCFTGSEAVDWLMDNFEGLVIDREEAVELGQLLLDHGLIHH 56
>gnl|CDD|197516 smart00100, cNMP, Cyclic nucleotide-monophosphate binding domain.
Catabolite gene activator protein (CAP) is a prokaryotic
homologue of eukaryotic cNMP-binding domains, present in
ion channels, and cNMP-dependent kinases.
Length = 120
Score = 56.6 bits (137), Expect = 7e-10
Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 313 HLSNSVRRELAGVVMFEAHPRKGEILFHQGDEGKSWYIIIQGSVDVVI---YGKGCVTSL 369
+L RELA + GE++ QGD G S+YII+ G V+V G+ +
Sbjct: 4 NLDAEELRELADA-LEPVRYPAGEVIIRQGDVGDSFYIIVSGEVEVYKVLEDGEEQIVGT 62
Query: 370 YAGED-FGKLALVNNAPR 386
D FG+LAL+ N+ R
Sbjct: 63 LGPGDFFGELALLTNSRR 80
Score = 44.7 bits (106), Expect = 1e-05
Identities = 19/55 (34%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 393 VFHQGDEGKSWYIIIQGSVDVVI---YGKGCVTSLYAGED-FGKLALVNNAPSYQ 443
+ QGD G S+YII+ G V+V G+ + D FG+LAL+ N+
Sbjct: 28 IIRQGDVGDSFYIIVSGEVEVYKVLEDGEEQIVGTLGPGDFFGELALLTNSRRAA 82
>gnl|CDD|214489 smart00049, DEP, Domain found in Dishevelled, Egl-10, and
Pleckstrin. Domain of unknown function present in
signalling proteins that contain PH, rasGEF, rhoGEF,
rhoGAP, RGS, PDZ domains. DEP domain in Drosophila
dishevelled is essential to rescue planar polarity
defects and induce JNK signalling (Cell 94, 109-118).
Length = 77
Score = 55.0 bits (133), Expect = 1e-09
Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 143 TIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNG--EQAFRDKCIL 200
C +GSELVDWLM + R+ + Q LL+EG+I+HVNG + F+D L
Sbjct: 15 KTYPNCFTGSELVDWLMDNLEII--DREEAVHLGQLLLDEGLIHHVNGPNKHTFKDSKAL 72
Query: 201 YNF 203
Y F
Sbjct: 73 YRF 75
Score = 45.7 bits (109), Expect = 2e-06
Identities = 21/60 (35%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 55 DESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
E+G + +T C +GSELVDWLM + R+ + Q LL+EG+I+H
Sbjct: 1 PETGLKLRDRKYFLKTY-PNCFTGSELVDWLMDNLEII--DREEAVHLGQLLLDEGLIHH 57
>gnl|CDD|215664 pfam00027, cNMP_binding, Cyclic nucleotide-binding domain.
Length = 91
Score = 51.1 bits (123), Expect = 3e-08
Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 4/58 (6%)
Query: 333 RKGEILFHQGDEGKSWYIIIQGSVDVVIYGKG----CVTSLYAGEDFGKLALVNNAPR 386
+KGE++F +GD S YI++ G V V + + L G+ FG+LAL+ PR
Sbjct: 5 KKGEVIFREGDPADSLYIVLSGKVKVYKLDEDGREQILAFLGPGDFFGELALLGGEPR 62
Score = 42.7 bits (101), Expect = 3e-05
Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 4/55 (7%)
Query: 391 SMVFHQGDEGKSWYIIIQGSVDVVIYGKG----CVTSLYAGEDFGKLALVNNAPS 441
++F +GD S YI++ G V V + + L G+ FG+LAL+ P
Sbjct: 8 EVIFREGDPADSLYIVLSGKVKVYKLDEDGREQILAFLGPGDFFGELALLGGEPR 62
>gnl|CDD|239890 cd04443, DEP_GPR155, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in GPR155-like proteins. GRP155-like
proteins, also known as PGR22, contain an N-terminal
permease domain, a central transmembrane region and a
C-terminal DEP domain. They are orphan receptors of the
class B G protein-coupled receptors. Their function is
unknown.
Length = 83
Score = 46.6 bits (111), Expect = 1e-06
Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 141 GRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCIL 200
G + G +LV WL+ + LA R + LL+ GV+ H+ E FRD+ +L
Sbjct: 23 GLRTYKGVFCGCDLVSWLIEVG--LAQDRGEAVLYGRRLLQGGVLQHITNEHHFRDENLL 80
Query: 201 YNF 203
Y F
Sbjct: 81 YRF 83
Score = 28.1 bits (63), Expect = 3.5
Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 2/47 (4%)
Query: 68 GRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
G + G +LV WL+ + LA R + LL+ GV+ H
Sbjct: 23 GLRTYKGVFCGCDLVSWLIEVG--LAQDRGEAVLYGRRLLQGGVLQH 67
>gnl|CDD|239886 cd04439, DEP_1_P-Rex, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain 1 found in P-Rex-like proteins. The P-Rex family
is the guanine-nucleotide exchange factor (GEF) for the
small GTPase Rac that contains an N-terminal RhoGEF
domain, two DEP and PDZ domains. Rac-GEF activity is
stimulated by phosphatidylinositol (3,4,5)-trisphosphate
(PtdIns(3,4,5)P3), a lipid second messenger, and by the
G beta-gamma subunits of heterotrimeric G proteins. The
DEP domains are not involved in mediating these stimuli,
but may be of importance for basal and stimulated levels
Rac-GEF activity.
Length = 81
Score = 44.5 bits (105), Expect = 6e-06
Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 2/61 (3%)
Query: 143 TIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYN 202
+ +C G+E V WL+ + + G QALLE G+I+HV+ + F+++ +LY
Sbjct: 23 STFPKCFLGNEFVSWLLEIGEISKPEEGVNLG--QALLENGIIHHVSDKHQFKNEQVLYR 80
Query: 203 F 203
F
Sbjct: 81 F 81
Score = 34.9 bits (80), Expect = 0.014
Identities = 17/63 (26%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 52 LLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGV 111
++ + ++DR+ + +C G+E V WL+ + + G QALLE G+
Sbjct: 7 MMCKQGSLIKDRRRK--LSTFPKCFLGNEFVSWLLEIGEISKPEEGVNLG--QALLENGI 62
Query: 112 IYH 114
I+H
Sbjct: 63 IHH 65
>gnl|CDD|239888 cd04441, DEP_2_DEP6, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain 2 found in DEP6-like proteins. DEP6 proteins
contain two DEP and a PDZ domain. Their function is
unknown.
Length = 85
Score = 43.6 bits (103), Expect = 2e-05
Identities = 23/58 (39%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 146 RRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYNF 203
R GSE +DWL L A SR+ + + LLE G+I HV+ + F D +LY F
Sbjct: 30 ERTFVGSEFIDWL--LQEGEAESRREAVQLCRRLLEHGIIQHVSNKHHFFDSNLLYQF 85
Score = 35.5 bits (82), Expect = 0.011
Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 4/63 (6%)
Query: 52 LLNDESGTLRDRKTSGGRTIARRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGV 111
L++ E+ L+ R+ G + R GSE +DWL L A SR+ + + LLE G+
Sbjct: 11 LMSTENSILQVREEEGVKY--ERTFVGSEFIDWL--LQEGEAESRREAVQLCRRLLEHGI 66
Query: 112 IYH 114
I H
Sbjct: 67 IQH 69
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 41.4 bits (98), Expect = 7e-04
Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 19/114 (16%)
Query: 206 DKEGSSSQATAQDI-----AEAEEHLEEALLALARRAPDAILRY---------ILRKQLK 251
+ + SS++ A+ I EAE +EALL A + I + R +L+
Sbjct: 26 EAKISSAEEEAKQIIEEAKKEAEALKKEALLE----AKEEIHKLRAEAERELKERRNELQ 81
Query: 252 KLQNRYLPREYML-HLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
+ + R L +E L + +E KEE +E+E+E + L+ E+LE++ EE
Sbjct: 82 RQEKRLLQKEETLDRKDESLEKKEESLEEKEKELAARQQQLEEKEEELEELIEE 135
>gnl|CDD|239895 cd04448, DEP_PIKfyve, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in fungal RhoGEF (GDP/GTP exchange factor)
PIKfyve-like proteins. PIKfyve contains N-terminal Fyve
finger and DEP domains, a central chaperonin-like domain
and a C-terminal PIPK (phosphatidylinositol phosphate
kinase) domain. PIKfyve-like proteins are important
phosphatidylinositol (3)-monophosphate
(PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which
plays a major role in multivesicular body (MVB) sorting
and control of retrograde traffic from the vacuole back
to the endosome and/or Golgi. PIKfyve itself has been
shown to be play a role in regulating
early-endosome-to-trans-Golgi network (TGN) retrograde
trafficking.
Length = 81
Score = 37.8 bits (88), Expect = 0.001
Identities = 21/54 (38%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 148 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILY 201
C G ELV+WL + A +R + QALL+ G I V+ + FRD+ LY
Sbjct: 28 CILGKELVNWL--IRQGKAATRVQAIAIGQALLDAGWIECVSDDDLFRDEYALY 79
Score = 30.9 bits (70), Expect = 0.34
Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 75 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVI 112
C G ELV+WL + A +R + QALL+ G I
Sbjct: 28 CILGKELVNWL--IRQGKAATRVQAIAIGQALLDAGWI 63
>gnl|CDD|239889 cd04442, DEP_1_DEP6, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain 1 found in DEP6-like proteins. DEP6 proteins
contain two DEP and a PDZ domain. Their function is
unknown.
Length = 82
Score = 38.0 bits (88), Expect = 0.001
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 148 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGE-QAFRDKCILYNF 203
C G EL+DWL+ A R+ + Q LL+ +I+HV E + F+D + Y F
Sbjct: 28 CFVGKELIDWLIE--HKEASDRETAIKIMQKLLDHSIIHHVCDEHKEFKDAKLFYRF 82
Score = 29.1 bits (65), Expect = 1.5
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 75 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYH 114
C G EL+DWL+ A R+ + Q LL+ +I+H
Sbjct: 28 CFVGKELIDWLIE--HKEASDRETAIKIMQKLLDHSIIHH 65
>gnl|CDD|239891 cd04444, DEP_PLEK2, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in pleckstrin 2-like proteins. Pleckstrin
2 is found in a wide variety of cell types, which
suggest a more general role in signaling than pleckstrin
1. Pleckstrin-like proteins contain a central DEP
domain, flanked by 2 PH (pleckstrin homology) domains.
Length = 109
Score = 37.6 bits (87), Expect = 0.003
Identities = 26/82 (31%), Positives = 35/82 (42%), Gaps = 16/82 (19%)
Query: 146 RRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVN------------GEQA 193
++ GS LVDWL+S S A SR + L+EE + V EQ
Sbjct: 29 KKTFLGSALVDWLIS--NSFAASRLEAVTLASMLMEENFLRPVGVRSMGAIRSGDLAEQF 86
Query: 194 FRDKCILYNFWQD--KEGSSSQ 213
D LY F + K+GSS +
Sbjct: 87 LDDSTALYTFAESYKKKGSSKE 108
Score = 33.4 bits (76), Expect = 0.10
Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 73 RRCASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEG 110
++ GS LVDWL+S S A SR + L+EE
Sbjct: 29 KKTFLGSALVDWLIS--NSFAASRLEAVTLASMLMEEN 64
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 40.5 bits (95), Expect = 0.003
Identities = 25/95 (26%), Positives = 49/95 (51%)
Query: 212 SQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
S ++ E EE EE L + A L +L+ L++L+ + E + E +
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677
Query: 272 DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
+ EE+ +E+ EE Q+ + L++ E+LE++ ++L
Sbjct: 678 ENEEQLEEKLEELEQLEEELEQLREELEELLKKLG 712
Score = 36.3 bits (84), Expect = 0.069
Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 11/101 (10%)
Query: 216 AQDIAEAEEHLEEA------LLALARRAPDAILRYILRKQLKKLQNRYL-----PREYML 264
+++ E E LEE L L R + + L + L E +
Sbjct: 273 EEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLE 332
Query: 265 HLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
LE ++E E E +E EE+ ++ K L+ ++LE+ EEL
Sbjct: 333 KLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEEL 373
Score = 34.4 bits (79), Expect = 0.27
Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 13/101 (12%)
Query: 218 DIAEAEEHLEEALLAL-----ARRAPDAI--LRYILRKQLKKLQNRYLPREYMLHLEREI 270
++ E LEE L +L +A L L K + + L + + LE ++
Sbjct: 604 ELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKV 663
Query: 271 E------DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
E +E + E EE+ + + L++ E+LEQ+ EEL
Sbjct: 664 EELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREEL 704
Score = 33.6 bits (77), Expect = 0.43
Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 3/90 (3%)
Query: 246 LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEE-EERQIGKPLDRTSEDLEQIYEE 304
+ ++++L+ + E + +E +E EEE KE ++ EE Q + + +++E + E
Sbjct: 183 AKAKIEELEGQL--SELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEER 240
Query: 305 LLHLKPLHHLSNSVRRELAGVVMFEAHPRK 334
L L+ ++ L + E K
Sbjct: 241 LAELEEEKERLEELKARLLEIESLELEALK 270
Score = 30.1 bits (68), Expect = 5.0
Identities = 23/93 (24%), Positives = 42/93 (45%), Gaps = 9/93 (9%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
+ EE LE+ + AL R + +++L++L+ R L E + +I ++E
Sbjct: 221 IQEEQEEEELEQEIEALEERLAE---LEEEKERLEELKARLLEIESLELEALKIREEELR 277
Query: 277 EKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
E E EE + E LE++ E+ L+
Sbjct: 278 ELERLLEELE------EKIERLEELEREIEELE 304
Score = 29.7 bits (67), Expect = 6.4
Identities = 18/97 (18%), Positives = 41/97 (42%)
Query: 216 AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
+++ L E + L + + +++++ Q + + LE + + EE
Sbjct: 187 IEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEE 246
Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
E++ EE + ++ + E L+ EEL L+ L
Sbjct: 247 EKERLEELKARLLEIESLELEALKIREEELRELERLL 283
Score = 29.3 bits (66), Expect = 8.7
Identities = 24/101 (23%), Positives = 43/101 (42%), Gaps = 14/101 (13%)
Query: 215 TAQDIAEAEEHLEEALLALARR-----APDAILRYILRKQLKKLQNRY---------LPR 260
++ E E LE+ L L + L+ L ++L+KL+N L
Sbjct: 495 EEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQL 554
Query: 261 EYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQI 301
+ + R++ED+ +E KE EE R + + E E++
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERL 595
Score = 29.3 bits (66), Expect = 9.2
Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 15/92 (16%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
+ + + E LEE A +L ++LK+L+ R LE+E+E E
Sbjct: 336 EKLEKLESELEELAEEKNELAK------LLEERLKELEERLE------ELEKELEKALER 383
Query: 277 EK---EEEEEERQIGKPLDRTSEDLEQIYEEL 305
K E +E ++ L E++++ EEL
Sbjct: 384 LKQLEEAIQELKEELAELSAALEEIQEELEEL 415
Score = 29.3 bits (66), Expect = 9.9
Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 12/116 (10%)
Query: 219 IAEAEEHLEEA---LLALARRAPDAILRYI---LRKQLKKLQNRYLPREYMLHLEREIED 272
+ E ++ L+E L L L L + ++L++ LE ++
Sbjct: 595 LKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQA 654
Query: 273 KEEE---EKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGV 325
EE + EE E E + + E+ EQ+ E+L L+ L +R EL +
Sbjct: 655 ALEELEEKVEELEAEIR---RELQRIENEEQLEEKLEELEQLEEELEQLREELEEL 707
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 39.9 bits (94), Expect = 0.005
Identities = 35/134 (26%), Positives = 65/134 (48%), Gaps = 25/134 (18%)
Query: 200 LYNFWQDKEGSSSQATAQDI-----AEAEEHLEEALLALARRAPDAILRY---------I 245
L +K+ S++ A+ I EAE +EALL A + + +
Sbjct: 18 LRKRIAEKKLGSAEELAKRIIEEAKKEAETLKKEALLE----AKEEVHKLRAELERELKE 73
Query: 246 LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIG---KPLDRTSEDLEQIY 302
R +L++L+ R L RE L+R++E +++E+ E++E+++ K LD E+LE++
Sbjct: 74 RRNELQRLERRLLQRE--ETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEELI 131
Query: 303 EELLHLKPLHHLSN 316
E + L +S
Sbjct: 132 AE--QREELERISG 143
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.8 bits (94), Expect = 0.005
Identities = 35/107 (32%), Positives = 56/107 (52%), Gaps = 16/107 (14%)
Query: 220 AEAEEHLEEALLALARRAPDAILRY---------ILRKQLKKLQNRYLPREYMLHLEREI 270
EAE +EALL A + I + R +L+KL+ R L +E L + E+
Sbjct: 49 KEAEAIKKEALLE----AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLEL 104
Query: 271 EDKEEEEKEEEEEE-RQIGKPLDRTSEDLEQIYEELLHLKPLHHLSN 316
+K EEE E++E+E Q + L++ E+LE++ EE L+ L +S
Sbjct: 105 LEKREEELEKKEKELEQKQQELEKKEEELEELIEE--QLQELERISG 149
Score = 33.6 bits (78), Expect = 0.39
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%)
Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEK 278
I EAEE + +L A++ +AI K+ L+ + + E+E+ ++ E +
Sbjct: 33 IKEAEE-EAKRILEEAKKEAEAI------KKEALLEAKEEIHKLRNEFEKELRERRNELQ 85
Query: 279 EEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
+ E+ Q + LDR E LE+ EEL +
Sbjct: 86 KLEKRLLQKEENLDRKLELLEKREEELEKKE 116
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 39.7 bits (93), Expect = 0.006
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 16/103 (15%)
Query: 215 TAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREY------MLHLER 268
+++ +A E L E L LRK+L++L+ +Y EY L L R
Sbjct: 624 LEEELDKAFEELAETEKRLEE----------LRKELEELEKKYSEEEYEELREEYLELSR 673
Query: 269 EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
E+ E +E E+ +I K L++ E+LE+ + L+ L
Sbjct: 674 ELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKL 716
Score = 39.3 bits (92), Expect = 0.008
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 4/92 (4%)
Query: 219 IAEAEEHLEEALLALARRAPDAILRYILR-KQLKKLQNRYLPREYMLHLEREIEDKEEEE 277
+ E EE L E L L +++ R K+L+ N YL + E+E+E +E+E
Sbjct: 565 LDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLE---LKDAEKELEREEKEL 621
Query: 278 KEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
K+ EEE + + L T + LE++ +EL L+
Sbjct: 622 KKLEEELDKAFEELAETEKRLEELRKELEELE 653
Score = 32.0 bits (73), Expect = 1.5
Identities = 28/99 (28%), Positives = 44/99 (44%), Gaps = 5/99 (5%)
Query: 207 KEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHL 266
KE + +++ + + LE+ L L R Y K K+ R R L
Sbjct: 331 KELEEKEERLEELKKKLKELEKRLEELEERH----ELYEEAKAKKEELERLKKRLTGLTP 386
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
E+ ++ EE EK +EE E +I K R E L++ +EL
Sbjct: 387 EKLEKELEELEKAKEEIEEEISKITARIGE-LKKEIKEL 424
Score = 30.4 bits (69), Expect = 4.1
Identities = 24/109 (22%), Positives = 44/109 (40%), Gaps = 23/109 (21%)
Query: 219 IAEAEEHLEEALLALARRAPD---AILRYILRKQLKKLQNRYLPREYMLHLE-------- 267
I E ++ +EE L + + YI +L + YL +
Sbjct: 268 IEELKKEIEE-LEEKVKELKELKEKAEEYI---KLSEFYEEYLDELREIEKRLSRLEEEI 323
Query: 268 -------REIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
+E+E+KEE +E +++ +++ K L+ E E +YEE K
Sbjct: 324 NGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHE-LYEEAKAKK 371
Score = 29.6 bits (67), Expect = 6.3
Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 10/100 (10%)
Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLP----REYMLHLEREIEDKE 274
I E E+ LEE L + + LR++L+KL+ +E + LE+E+E E
Sbjct: 195 IKEKEKELEEVLREINEISS---ELPELREELEKLEKEVKELEELKEEIEELEKELESLE 251
Query: 275 EEEKEEEEEERQIGKPLDRT---SEDLEQIYEELLHLKPL 311
+++ EE+ R++ + ++ E+LE+ +EL LK
Sbjct: 252 GSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEK 291
Score = 29.6 bits (67), Expect = 7.6
Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 4/84 (4%)
Query: 246 LRKQLKKLQNRYLPREYML-HLEREIEDKEEEEKEEEEEERQIG--KPLDRTSEDLEQIY 302
L K+L+ L+ E + LE IE+ ++E +E EE+ +++ K L + Y
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFY 302
Query: 303 EELLH-LKPLHHLSNSVRRELAGV 325
EE L L+ + + + E+ G+
Sbjct: 303 EEYLDELREIEKRLSRLEEEINGI 326
Score = 29.3 bits (66), Expect = 9.2
Identities = 24/105 (22%), Positives = 41/105 (39%), Gaps = 20/105 (19%)
Query: 239 DAILRYILRKQLKKLQNRYLP-REYMLHLEREIEDKEEEEKEEEEEERQIG--------- 288
+ ++R IL L +N Y E + ++R IE E+ K E E I
Sbjct: 148 EKVVRQIL--GLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEV 205
Query: 289 --------KPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGV 325
L E+LE++ +E+ L+ L + +EL +
Sbjct: 206 LREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESL 250
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 38.0 bits (89), Expect = 0.015
Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 8/89 (8%)
Query: 216 AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
++ AE E E L A + + + K+ K E+ LE++IE+KEE
Sbjct: 251 QEERAEEEAERERMLEKQAED--EELEQENAEKRRMKRL------EHRRELEQQIEEKEE 302
Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
E EEE + G+ L + + EE
Sbjct: 303 RRAAEREEELEEGERLREEEAERQARIEE 331
Score = 33.0 bits (76), Expect = 0.46
Identities = 22/70 (31%), Positives = 31/70 (44%), Gaps = 7/70 (10%)
Query: 221 EAEEHLEEALLALA--RRAPDAILRYILRKQLKKLQNRYLPREYMLHLERE-IEDKEEEE 277
E EE L+E R + K+ K+ + R E + E IE KEEE+
Sbjct: 90 EYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLR----EEIDEFNEERIERKEEEK 145
Query: 278 KEEEEEERQI 287
+ E EEE +I
Sbjct: 146 EREREEELKI 155
Score = 33.0 bits (76), Expect = 0.57
Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 10/109 (9%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRY-ILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
++ + E L+E L + + R + K+ + + + E E++ +
Sbjct: 192 EEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQ 251
Query: 276 EEKEEEEEERQIGKPLDRTSED--LEQIYEELLHLKPLHHLSNSVRREL 322
EE+ EEE ER+ + L++ +ED LEQ E +K L H RREL
Sbjct: 252 EERAEEEAERE--RMLEKQAEDEELEQENAEKRRMKRLEH-----RREL 293
Score = 31.4 bits (72), Expect = 1.4
Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 13/98 (13%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLERE-----IE 271
EE E R R +L++Q+++ + R ERE IE
Sbjct: 52 LKALAEEEERERKRKEERRE-----GRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIE 106
Query: 272 DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
+EE++ E +E+R+ K + E++++ EE + K
Sbjct: 107 RIQEEDEAEAQEKREKQK---KLREEIDEFNEERIERK 141
Score = 30.3 bits (69), Expect = 3.8
Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 18/96 (18%)
Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIED 272
+ + E E ++E + +A L ++ K+ + R RE L+ +IE+
Sbjct: 30 KRIKAEEKEEERRIDEMMEEERLKA----LAEEEERERKRKEER---REGRAVLQEQIEE 82
Query: 273 KE----EEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
+E EE +E +E Q+ E +E+I EE
Sbjct: 83 REKRRQEEYEERLQEREQM-------DEIIERIQEE 111
Score = 29.5 bits (67), Expect = 6.7
Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 6/70 (8%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
+++ E E L A + R A I K+ K + + R +E E E
Sbjct: 4 EELRELNEKLRAAKVNKERDAQ------IEEKKRIKAEEKEEERRIDEMMEEERLKALAE 57
Query: 277 EKEEEEEERQ 286
E+E E + ++
Sbjct: 58 EEERERKRKE 67
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 37.4 bits (87), Expect = 0.030
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 11/95 (11%)
Query: 219 IAEAEEHLEEALLALARRAPDAI--LRYILRKQLKKLQNRY---LPREYML---HLEREI 270
+ EAEEH+ AL ++ R P + L+ L QL+ L+ Y Y L +++ +
Sbjct: 199 LEEAEEHMI-ALRSIMERIPSLLAELQTELPGQLQDLKAGYRDMKEEGYHLEHVNIDSRL 257
Query: 271 EDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
E +E+ E E Q+ LD E+L I E++
Sbjct: 258 ERLKEQLVENSELLTQL--ELDEAEEELGLIQEKI 290
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 37.3 bits (86), Expect = 0.031
Identities = 15/25 (60%), Positives = 20/25 (80%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPL 291
E E E++EEEE+EEEEEE + +PL
Sbjct: 870 EEEEEEEEEEEEEEEEEEEENEEPL 894
Score = 33.4 bits (76), Expect = 0.53
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 267 EREIEDKEEEEKEEEEEERQ 286
E E E++EEEE+EEEEEE +
Sbjct: 866 EEEEEEEEEEEEEEEEEEEE 885
Score = 32.3 bits (73), Expect = 1.1
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 267 EREIEDKEEEEKEEEEEERQ 286
E E E++EEEE+EEEEEE +
Sbjct: 867 EEEEEEEEEEEEEEEEEEEE 886
Score = 32.3 bits (73), Expect = 1.1
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 267 EREIEDKEEEEKEEEEEERQ 286
E E E++EEEE+EEEEEE +
Sbjct: 865 EEEEEEEEEEEEEEEEEEEE 884
Score = 31.9 bits (72), Expect = 1.5
Identities = 17/41 (41%), Positives = 26/41 (63%), Gaps = 5/41 (12%)
Query: 269 EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
+ E++EEEE+EEEEEE + + E+ E+ EE L L+
Sbjct: 862 DSEEEEEEEEEEEEEEEE-----EEEEEEEEEENEEPLSLE 897
Score = 31.5 bits (71), Expect = 1.9
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 267 EREIEDKEEEEKEEEEEERQ 286
E E E++EEEE+EEEEEE +
Sbjct: 868 EEEEEEEEEEEEEEEEEEEE 887
Score = 31.5 bits (71), Expect = 2.1
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 267 EREIEDKEEEEKEEEEEERQ 286
E E E++EEEE+EEEEEE +
Sbjct: 869 EEEEEEEEEEEEEEEEEEEE 888
Score = 31.1 bits (70), Expect = 2.3
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 267 EREIEDKEEEEKEEEEEERQ 286
E E E++EEEE+EEEEEE +
Sbjct: 864 EEEEEEEEEEEEEEEEEEEE 883
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 37.4 bits (87), Expect = 0.032
Identities = 20/93 (21%), Positives = 45/93 (48%), Gaps = 8/93 (8%)
Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIED 272
+ + E E+ LEE + + LR +++++ +L+ + LER I +
Sbjct: 261 SELEKRLEEIEQLLEELNKKIKDLGEEEQLR--VKEKIGELEAE------IASLERSIAE 312
Query: 273 KEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
KE E ++ EE ++ +D+ ++E++ E+
Sbjct: 313 KERELEDAEERLAKLEAEIDKLLAEIEELEREI 345
Score = 32.7 bits (75), Expect = 0.94
Identities = 24/104 (23%), Positives = 49/104 (47%), Gaps = 16/104 (15%)
Query: 218 DIAEAEEHLEEALLALA---------RRAPDAILRY-ILRKQLKKLQNRYLPREYMLH-- 265
++ E EE++E L + RR + RY L K+ ++ + L +E
Sbjct: 178 ELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALER 237
Query: 266 ----LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+ER++ EEE ++ EE ++ K L+ + LE++ +++
Sbjct: 238 QKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKI 281
Score = 32.0 bits (73), Expect = 1.4
Identities = 27/94 (28%), Positives = 38/94 (40%), Gaps = 16/94 (17%)
Query: 218 DIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEE 277
E EE LEE AL L +L L+ E L RE+E K EE
Sbjct: 862 KKEELEEELEELEAALRD----------LESRLGDLKKERDELEAQL---RELERKIEEL 908
Query: 278 KEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
+ + E++R+ L LE + EEL ++
Sbjct: 909 EAQIEKKRKR---LSELKAKLEALEEELSEIEDP 939
Score = 29.7 bits (67), Expect = 8.2
Identities = 18/87 (20%), Positives = 34/87 (39%), Gaps = 1/87 (1%)
Query: 235 RRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRT 294
+A LR I +K + + + + L+ + D +E+ K E+E + +
Sbjct: 807 VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL 866
Query: 295 SEDLEQIYEELLHL-KPLHHLSNSVRR 320
E+LE++ L L L L
Sbjct: 867 EEELEELEAALRDLESRLGDLKKERDE 893
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 34.6 bits (79), Expect = 0.036
Identities = 16/37 (43%), Positives = 20/37 (54%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYE 303
+ E E+ E EE+E EEEE G R +ED E E
Sbjct: 60 DDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAE 96
>gnl|CDD|236969 PRK11753, PRK11753, DNA-binding transcriptional dual regulator Crp;
Provisional.
Length = 211
Score = 35.3 bits (82), Expect = 0.055
Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 391 SMVFHQGDEGKSWYIIIQGSVDVVIY---GKGCVTS-LYAGEDFGKLAL 435
S + H G++ ++ Y I++GSV V+I GK + S L G+ G+L L
Sbjct: 29 STLIHAGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGL 77
Score = 35.3 bits (82), Expect = 0.069
Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 4/47 (8%)
Query: 338 LFHQGDEGKSWYIIIQGSVDVVIY---GKGCVTS-LYAGEDFGKLAL 380
L H G++ ++ Y I++GSV V+I GK + S L G+ G+L L
Sbjct: 31 LIHAGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGL 77
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 36.2 bits (84), Expect = 0.062
Identities = 27/102 (26%), Positives = 47/102 (46%), Gaps = 11/102 (10%)
Query: 211 SSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH-LERE 269
+ +++ EAE+ +EE L LR++L++LQ L + + LE E
Sbjct: 254 ELEELQEELEEAEKEIEELKSELEE----------LREELEELQEELLELKEEIEELEGE 303
Query: 270 IEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
I E +E E E ++ + L+ E +E + EEL + L
Sbjct: 304 ISLLRERLEELENELEELEERLEELKEKIEALKEELEERETL 345
Score = 34.7 bits (80), Expect = 0.24
Identities = 28/122 (22%), Positives = 56/122 (45%), Gaps = 9/122 (7%)
Query: 205 QDKEGSSSQATAQD-IAEAEEHLEEALLALARRAPDAILRYI--LRKQLKKLQNRYLP-R 260
Q ++ + ++ + E+ LEE RR + + R + L+++L L+ +
Sbjct: 682 QLEKLEEELKSLKNELRSLEDLLEEL-----RRQLEELERQLEELKRELAALEEELEQLQ 736
Query: 261 EYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRR 320
+ LE E+E+ EEE +E +E ++ + L+ E L ++ EE+ L+
Sbjct: 737 SRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796
Query: 321 EL 322
EL
Sbjct: 797 EL 798
Score = 32.4 bits (74), Expect = 1.0
Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 11/103 (10%)
Query: 213 QATAQDIAEAE---EHLEEALLALARRAPDAILRYI-LRKQLKKLQNRYLPREYMLH--- 265
+ T +++ E E LE+ L L R+A A RY L+ +L++L+ L +
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKA-ERYQELKAELRELELALLLAKLKELRKE 240
Query: 266 ---LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
LE E+ EEE +E +EE + K ++ +LE++ EEL
Sbjct: 241 LEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283
Score = 31.6 bits (72), Expect = 1.9
Identities = 22/94 (23%), Positives = 41/94 (43%), Gaps = 11/94 (11%)
Query: 217 QDIAEAEEHLEEALLALARR-APDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
+ + E EE LE ALA+ L + ++L+ LE E+E+ E
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE----------LEEELEEAER 807
Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
E E + + +R +++E++ EE+ L+
Sbjct: 808 RLDALERELESLEQRRERLEQEIEELEEEIEELE 841
Score = 31.2 bits (71), Expect = 2.3
Identities = 31/122 (25%), Positives = 50/122 (40%), Gaps = 18/122 (14%)
Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRY-----ILRKQLKKLQNRYLPREYMLHLE 267
+ ++ EA+E LEE L AL + L +L +++N + L+
Sbjct: 350 EQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE------LEELK 403
Query: 268 REIEDKEEE-EKEEEEEERQIGK------PLDRTSEDLEQIYEELLHLKPLHHLSNSVRR 320
REIE EE E+ E E + L+ +LE++ EEL L+ +
Sbjct: 404 REIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLK 463
Query: 321 EL 322
EL
Sbjct: 464 EL 465
Score = 30.8 bits (70), Expect = 3.2
Identities = 25/105 (23%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH------- 265
+ ++ + E L + + + L ++L++ + R E L
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELE--ELEEELEEAERRLDALERELESLEQRRE 824
Query: 266 -LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
LE+EIE+ EEE +E EE+ ++ + L+ ++LE++ EEL L+
Sbjct: 825 RLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELE 869
Score = 30.5 bits (69), Expect = 4.7
Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 10/105 (9%)
Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEK 278
+ E +E LEE D + L ++ ++L+ RE LE E+ + +EE +
Sbjct: 858 LEELKEELEELEAEKEELE-DELKE--LEEEKEELEEEL--RE----LESELAELKEEIE 908
Query: 279 EEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELA 323
+ E ++ L+R +L ++ EEL L + RE+
Sbjct: 909 KLRERLEELEAKLERLEVELPELEEELEEEYED-TLETELEREIE 952
Score = 29.7 bits (67), Expect = 6.9
Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 4/90 (4%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAIL-RYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
+++ E E L+E LL L + +LR++L++L+N E + E+++K E
Sbjct: 277 EELREELEELQEELLELKEEIEELEGEISLLRERLEELENE---LEELEERLEELKEKIE 333
Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
KEE EE + + L++ +LE+ EEL
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEEL 363
Score = 29.3 bits (66), Expect = 9.0
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 7/90 (7%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH-LEREIEDKEE 275
Q + E E LEE L RR L ++L+ L+ R E + LE EIE+ EE
Sbjct: 789 QALQEELEELEEELEEAERRLDA------LERELESLEQRRERLEQEIEELEEEIEELEE 842
Query: 276 EEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+ E EEE ++ K L+ E+LE++ E
Sbjct: 843 KLDELEEELEELEKELEELKEELEELEAEK 872
>gnl|CDD|223736 COG0664, Crp, cAMP-binding proteins - catabolite gene activator and
regulatory subunit of cAMP-dependent protein kinases
[Signal transduction mechanisms].
Length = 214
Score = 35.2 bits (81), Expect = 0.065
Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 333 RKGEILFHQGDEGKSWYIIIQGSVDVVIY---GKGCVTSLYAGED-FGKLALVNNAPR 386
KGE+LF +G+E S YII+ G V + G+ + D FG+LAL+ PR
Sbjct: 29 PKGEVLFTEGEEADSLYIILSGIVKLYANTEDGREIILGFLGPGDFFGELALLGGDPR 86
Score = 28.6 bits (64), Expect = 9.3
Identities = 12/53 (22%), Positives = 20/53 (37%), Gaps = 3/53 (5%)
Query: 365 CVTSLYAGEDFGKLALVNNAPRLIIT---SMVFHQGDEGKSWYIIIQGSVDVV 414
L L L+ + ++F +G+E S YII+ G V +
Sbjct: 3 KENPLLNLLPSELLELLALKLEVRKLPKGEVLFTEGEEADSLYIILSGIVKLY 55
>gnl|CDD|221528 pfam12320, SbcD_C, Type 5 capsule protein repressor C-terminal
domain. This domain is found in bacteria and archaea.
This domain is about 90 amino acids in length. This
domain is found associated with pfam00149. SbcD works in
complex with SbdC (SbcDC) which is a transcription
regulator. It down-regulates transcription of arl and
mgr to inhibit type 5 capsule protein production. It
acts as part of the SOS pathway of bacteria.
Length = 97
Score = 33.4 bits (77), Expect = 0.066
Identities = 27/98 (27%), Positives = 44/98 (44%), Gaps = 14/98 (14%)
Query: 217 QDIAEAEEHLEEALLALA-RRAPDAILRYIL---------RKQLKKLQNRYLPREYMLHL 266
+ + + LEE L AL + DA L L R++L++ LP +L +
Sbjct: 4 RPLRRIKGSLEELLAALEELKDRDAYLEVELTDEEPLPDLRERLREA----LPNIELLRI 59
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
R+ E +E +EEEEEE + PL+ ++ E
Sbjct: 60 RRKRERRERALEEEEEEELEELSPLELFERFYKEETGE 97
>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972). This
family consists of several hypothetical bacterial
sequences. The function of this family is unknown.
Length = 106
Score = 33.4 bits (77), Expect = 0.069
Identities = 28/96 (29%), Positives = 52/96 (54%), Gaps = 17/96 (17%)
Query: 219 IAEAEEHLEEALLALARRAPDAILRYILRKQLKKL--QNRYLPREYMLHLEREIEDKEEE 276
++E E+ L + L L L++QL +L +N L E HL +E+ E+E
Sbjct: 10 LSELEQQLNQLLAELGE----------LKQQLAELLEENAELRIE-NEHLRERLEELEQE 58
Query: 277 EKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
+KE+E+ ++++G+ D +L ++Y+E H+ LH
Sbjct: 59 QKEKEKGKKKLGEGYD----NLARLYQEGFHICNLH 90
>gnl|CDD|224450 COG1533, SplB, DNA repair photolyase [DNA replication,
recombination, and repair].
Length = 297
Score = 35.4 bits (82), Expect = 0.082
Identities = 21/73 (28%), Positives = 30/73 (41%), Gaps = 2/73 (2%)
Query: 220 AEAEEHLEEALLALARRAPDAILRY--ILRKQLKKLQNRYLPREYMLHLEREIEDKEEEE 277
E E LE A A AR LR + ++ K+ R P Y L+LE + E + +
Sbjct: 199 EELERILEAAAEAGARVVVYGTLRLRLDILRRFKEYLKRIEPERYRLYLELDREKRRGKF 258
Query: 278 KEEEEEERQIGKP 290
E +R G
Sbjct: 259 YAPEYGKRVYGDG 271
>gnl|CDD|226251 COG3728, XtmA, Phage terminase, small subunit [DNA replication,
recombination, and repair].
Length = 179
Score = 34.8 bits (80), Expect = 0.085
Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 2/69 (2%)
Query: 241 ILRYILRKQLKKLQNRYLPREYMLHLEREI--EDKEEEEKEEEEEERQIGKPLDRTSEDL 298
I YI KQ + ++ +P +L + EI D E E + ++ + P ++
Sbjct: 47 IKAYIDEKQCETVEEDEVPAYELLKVLTEIALGDIETEAIVQPKDTLPVLTPPVVIAQLE 106
Query: 299 EQIYEELLH 307
E I EEL
Sbjct: 107 EAIEEELED 115
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 34.7 bits (80), Expect = 0.11
Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 2/102 (1%)
Query: 207 KEGSSSQATAQDIAEAEEHLEEA--LLALARRAPDAILRYILRKQLKKLQNRYLPREYML 264
KE + TA ++ E + EE LL + R + + + R +
Sbjct: 26 KELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVD 85
Query: 265 HLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
E+ EEE +++E E RQ+ + L E E+ +ELL
Sbjct: 86 EATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELL 127
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 35.3 bits (82), Expect = 0.12
Identities = 23/104 (22%), Positives = 52/104 (50%), Gaps = 8/104 (7%)
Query: 216 AQDIAEA---EEHLEEALLALARRAPDAILRY----ILRKQLKKLQNRYLPREYMLHLER 268
Q++ E E+ L++AL L + L+ + ++++K Q Y RE + +++
Sbjct: 178 LQEVLETVNIEKRLKKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKK 237
Query: 269 EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
E+ +++++ E E+ + ++ + L E + I +EL L L
Sbjct: 238 ELGIEKDDKDELEKLKEKL-EELKLPEEVKKVIEKELTKLSLLE 280
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.4 bits (82), Expect = 0.13
Identities = 30/112 (26%), Positives = 46/112 (41%), Gaps = 10/112 (8%)
Query: 204 WQDKEGSSSQATAQDIA--EAEEHLEEAL--LALARRAPDAILRYI--LRKQLKKL---Q 254
+ E + + A E + LEE L R+ + + R I LRK L +L
Sbjct: 683 EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742
Query: 255 NRYLPREYMLHLER-EIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+ R L E E+E + EE +E EE + + E+LE E+L
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794
Score = 31.6 bits (72), Expect = 1.7
Identities = 22/96 (22%), Positives = 38/96 (39%), Gaps = 8/96 (8%)
Query: 213 QATAQDIAEAEEHLEEALLALA--RRAPDAILRYILRKQLKKLQNRYLPREYMLH-LERE 269
+ + I E E +E + + L +L+ L N E L L E
Sbjct: 841 EDLEEQIEELSEDIESLAAEIEELEELIEE-----LESELEALLNERASLEEALALLRSE 895
Query: 270 IEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+E+ EE +E E + ++ + L+ E L Q+ L
Sbjct: 896 LEELSEELRELESKRSELRRELEELREKLAQLELRL 931
Score = 31.6 bits (72), Expect = 1.7
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 226 LEEALLALARRAPDAILRYI-LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEE 284
LE L +L R+A A RY L+ +L++L+ L L E+E+ +EE KE EEE
Sbjct: 198 LERQLKSLERQAEKAE-RYKELKAELRELELALLVLRL-EELREELEELQEELKEAEEEL 255
Query: 285 RQIGKPLDRTSEDLEQI 301
++ L E LE++
Sbjct: 256 EELTAELQELEEKLEEL 272
Score = 31.2 bits (71), Expect = 2.8
Identities = 17/43 (39%), Positives = 28/43 (65%)
Query: 263 MLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+L REIE+ EE+ +E EE+ ++ K L ++LE++ EEL
Sbjct: 672 ILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEEL 714
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 33.4 bits (77), Expect = 0.13
Identities = 16/42 (38%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 268 REIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
E E ++EE++ +EE+E++ GK L + E+ E+ EEL K
Sbjct: 19 EEKEREKEEKERKEEKEKEWGKGLVQ-KEEREKRLEELEKAK 59
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 34.6 bits (80), Expect = 0.14
Identities = 28/116 (24%), Positives = 46/116 (39%), Gaps = 20/116 (17%)
Query: 199 ILYNFWQDKEGSSS-------QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLK 251
+L F KE TA++ A E + A +LR +++ +
Sbjct: 177 VLQEFLNSKEAVEEAILQTDQALTAKEKAIEAERAKAEAA----EAEQELLREKQKEEEQ 232
Query: 252 KLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLH 307
++ + R Y H+++ IE K E E+E+ E +R E Q EELL
Sbjct: 233 MMEAQ--ERSYQEHVKQLIE-KMEAEREKLLAE------QERMLEHKLQEQEELLK 279
>gnl|CDD|173171 PRK14708, PRK14708, flagellin; Provisional.
Length = 888
Score = 34.9 bits (80), Expect = 0.19
Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 6/80 (7%)
Query: 8 ISARIRTITFNLSVTNTRVVHYTTTLQSSPSVPMARAGW--VLRTLLLNDESG----TLR 61
IS + TI+ N T + V T+QSS + G +L+ L L +G TL
Sbjct: 412 ISNGVATISTNAGQTASIVTAGIVTIQSSTGADLNVTGPTDLLKNLGLTTSTGSGPLTLT 471
Query: 62 DRKTSGGRTIARRCASGSEL 81
++T+ G T+ A GS L
Sbjct: 472 KQRTTSGTTMGTLIADGSTL 491
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 31.8 bits (73), Expect = 0.25
Identities = 12/18 (66%), Positives = 16/18 (88%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E++ E++EEEEKEE EEE
Sbjct: 79 EKKEEEEEEEEKEESEEE 96
Score = 31.1 bits (71), Expect = 0.54
Identities = 10/18 (55%), Positives = 15/18 (83%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E + E++EEEE++EE EE
Sbjct: 78 EEKKEEEEEEEEKEESEE 95
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 31.4 bits (72), Expect = 0.29
Identities = 9/18 (50%), Positives = 16/18 (88%)
Query: 271 EDKEEEEKEEEEEERQIG 288
E++++EE+EEEEE+ +G
Sbjct: 67 EEEKKEEEEEEEEDDDMG 84
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 31.8 bits (72), Expect = 0.32
Identities = 11/14 (78%), Positives = 13/14 (92%)
Query: 271 EDKEEEEKEEEEEE 284
E K+EE+KEEEEEE
Sbjct: 89 EAKKEEKKEEEEEE 102
Score = 30.6 bits (69), Expect = 0.82
Identities = 10/18 (55%), Positives = 14/18 (77%)
Query: 271 EDKEEEEKEEEEEERQIG 288
KEE+++EEEEEE +G
Sbjct: 90 AKKEEKKEEEEEEEDDLG 107
>gnl|CDD|239896 cd04449, DEP_DEPDC5-like, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in DEPDC5-like proteins. DEPDC5, in human
also known as KIAA0645, is a DEP domain containing
protein of unknown function.
Length = 83
Score = 31.1 bits (71), Expect = 0.38
Identities = 18/49 (36%), Positives = 27/49 (55%), Gaps = 1/49 (2%)
Query: 148 CASGSELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRD 196
C GSE V WL++ + +R+ + Q L+ EG+I HV+G F D
Sbjct: 29 CFIGSEAVSWLINNFEDV-DTREEAVELGQELMNEGLIEHVSGRHPFLD 76
>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
Reviewed.
Length = 110
Score = 31.4 bits (72), Expect = 0.44
Identities = 16/53 (30%), Positives = 30/53 (56%), Gaps = 8/53 (15%)
Query: 264 LHLE----REIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLH 312
L LE RE ++ E E+ +E++++ G+ D +L ++Y+E H+ LH
Sbjct: 41 LRLENDKLRERLEELEAEEPAKEKKKKEGEGKD----NLARLYQEGFHICNLH 89
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 31.3 bits (71), Expect = 0.50
Identities = 14/18 (77%), Positives = 16/18 (88%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E+E E K+EEEKEEEEEE
Sbjct: 80 EKEEEKKKEEEKEEEEEE 97
Score = 30.2 bits (68), Expect = 1.1
Identities = 11/18 (61%), Positives = 16/18 (88%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E + E+KEEE+K+EEE+E
Sbjct: 75 EEKAEEKEEEKKKEEEKE 92
Score = 29.0 bits (65), Expect = 2.5
Identities = 10/18 (55%), Positives = 15/18 (83%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E+ E +EE++KEEE+EE
Sbjct: 76 EKAEEKEEEKKKEEEKEE 93
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 31.0 bits (70), Expect = 0.55
Identities = 17/43 (39%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHL 308
LE E ++ +EE+EEEEEE G+ D E++ EE+ L
Sbjct: 55 LEAEESEENDEEEEEEEEEEDEGEIEY--VSDDEELEEEIEDL 95
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 30.8 bits (70), Expect = 0.62
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E E E++EEEE+EE EEE
Sbjct: 78 EEEEEEEEEEEEEESEEE 95
Score = 30.8 bits (70), Expect = 0.68
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E E E++EEEE+EEE EE
Sbjct: 77 EEEEEEEEEEEEEEESEE 94
Score = 29.2 bits (66), Expect = 2.4
Identities = 11/18 (61%), Positives = 14/18 (77%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E++EEEE+EEEEEE
Sbjct: 72 AAAAEEEEEEEEEEEEEE 89
Score = 29.2 bits (66), Expect = 2.5
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPL 291
E E E++EEEE+EEEE E + L
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAMAGL 100
>gnl|CDD|219996 pfam08738, Gon7, Gon7 family. In S. cerevisiae Gon7 is a member of
the KEOPS protein complex. A protein complex proposed to
be involved in transcription and promoting telomere
uncapping and telomere elongation.
Length = 97
Score = 30.5 bits (69), Expect = 0.63
Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 4/45 (8%)
Query: 246 LRKQLKKLQ---NRYLPREYMLHLEREIEDKEEEEKEEEEEERQI 287
LR L LQ N +L E M +++ K EKE E++E +
Sbjct: 52 LRASLTTLQDDINEFLT-ERMEEDKKKAAAKGRGEKEIEDKEEEN 95
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 32.7 bits (75), Expect = 0.67
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPL 291
E+E E++EEEE+E EE E + G PL
Sbjct: 352 EQEDEEEEEEEEEPEEPEPEEGPPL 376
Score = 29.6 bits (67), Expect = 6.7
Identities = 13/18 (72%), Positives = 15/18 (83%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E E ED+EEEE+EEE EE
Sbjct: 350 EEEQEDEEEEEEEEEPEE 367
>gnl|CDD|212745 cd11811, SH3_CHK, Src Homology 3 domain of CSK homologous kinase.
CHK is also referred to as megakaryocyte-associated
tyrosine kinase (Matk). It inhibits Src kinases using a
noncatalytic mechanism by simply binding to them. As a
negative regulator of Src kinases, Chk may play
important roles in cell proliferation, survival, and
differentiation, and consequently, in cancer development
and progression. To inhibit Src kinases that are
anchored to the plasma membrane, CHK is translocated to
the membrane via binding to specific transmembrane
proteins, G-proteins, or adaptor proteins near the
membrane. CHK also plays a role in neural
differentiation in a manner independent of Src by
enhancing MAPK activation via Ras-mediated signaling. It
is a cytoplasmic (or nonreceptor) tyr kinase containing
the Src homology domains, SH3 and SH2, N-terminal to the
catalytic tyr kinase domain. SH3 domains are protein
interaction domains that bind to proline-rich ligands
with moderate affinity and selectivity, preferentially
to PxxP motifs. They play versatile and diverse roles in
the cell including the regulation of enzymes, changing
the subcellular localization of signaling pathway
components, and mediating the formation of multiprotein
complex assemblies.
Length = 59
Score = 29.8 bits (67), Expect = 0.67
Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 8/26 (30%)
Query: 332 PRKGEILFHQGD--------EGKSWY 349
P+ GE+ FH+GD E K WY
Sbjct: 13 PKPGELAFHKGDIVTIVETCERKGWY 38
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 32.7 bits (75), Expect = 0.67
Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 11/74 (14%)
Query: 247 RKQLKKLQNRYLPREYMLH------LEREIEDKE-----EEEKEEEEEERQIGKPLDRTS 295
++L L+N P +H ERE++ K E+EK EE E+ + + +
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348
Query: 296 EDLEQIYEELLHLK 309
LE+I ++L L+
Sbjct: 349 SKLEEIQKKLEDLE 362
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 32.0 bits (73), Expect = 0.70
Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 245 ILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQ 300
+ R++ + L ++ E E+ EEE EEEEEE Q G + ED +
Sbjct: 64 LDREEAR---LELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDS 116
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 32.4 bits (74), Expect = 0.72
Identities = 26/93 (27%), Positives = 40/93 (43%), Gaps = 14/93 (15%)
Query: 200 LYNFWQDKEGSSSQATAQDIAEAEEHLEEALLAL------ARRAPDAILRYILRKQLKKL 253
Y + E Q + EE +A+ L ++R + + R ++LK+L
Sbjct: 107 NYEADKLDEE---QEERVEKEREEELAGDAMKKLENRTADSKREMEVLERL---EELKEL 160
Query: 254 QNRYLP--REYMLHLEREIEDKEEEEKEEEEEE 284
Q+R ML E KEEEE+EEE+E
Sbjct: 161 QSRRADVDVNSMLEALFRREKKEEEEEEEEDEA 193
Score = 30.9 bits (70), Expect = 2.0
Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 1/77 (1%)
Query: 260 REYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVR 319
R Y E +++ E++ EEE K L+ + D ++ E L L+ L L S R
Sbjct: 106 RNYEADKLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKEL-QSRR 164
Query: 320 RELAGVVMFEAHPRKGE 336
++ M EA R+ +
Sbjct: 165 ADVDVNSMLEALFRREK 181
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 32.5 bits (74), Expect = 0.74
Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 14/76 (18%)
Query: 246 LRKQLKKLQ--NRYLPREYMLHLEREIEDKEEEE--------KEEEEEERQI---GKPLD 292
L+++LK L+ N +L R HL+ E EE+E K+ E QI + L
Sbjct: 166 LQEKLKLLEEENEHL-RSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLSEELA 224
Query: 293 RTSEDLEQIYEELLHL 308
+ +EDLE+ EE+ HL
Sbjct: 225 KKTEDLERQQEEITHL 240
>gnl|CDD|226873 COG4467, COG4467, Regulator of replication initiation timing
[Replication, recombination, and repair].
Length = 114
Score = 30.9 bits (70), Expect = 0.75
Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 10/90 (11%)
Query: 226 LEEALLALARRAPDAILRYILRKQLKKL--QNRYLPREYMLHLEREIEDKEEEEKEEEEE 283
LEE L L L++ L L +N L E L RE + EK ++
Sbjct: 13 LEEQLGVLLAELGG------LKQHLGSLVEENTALRLE-NEKL-RERLGEPTLEKTAVKK 64
Query: 284 ERQIGKPLDRTSEDLEQIYEELLHLKPLHH 313
E+ K ++L ++Y+E H+ +H+
Sbjct: 65 EKPAVKKKGEGYDNLARLYQEGFHICNVHY 94
>gnl|CDD|239887 cd04440, DEP_2_P-Rex, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain 2 found in P-Rex-like proteins. The P-Rex family
is the guanine-nucleotide exchange factor (GEF) for the
small GTPase Rac that contains an N-terminal RhoGEF
domain, two DEP and PDZ domains. Rac-GEF activity is
stimulated by phosphatidylinositol (3,4,5)-trisphosphate
(PtdIns(3,4,5)P3), a lipid second messenger, and the G
beta-gamma subunits of heterotrimeric G proteins. The
DEP domains are not involved in mediating these stimuli,
but may be of importance for basal and stimulated levels
Rac-GEF activity.
Length = 93
Score = 30.3 bits (68), Expect = 0.79
Identities = 16/55 (29%), Positives = 29/55 (52%), Gaps = 2/55 (3%)
Query: 152 SELVDWLMSLAPSLAVSRQITTGMWQALLEEGVIYHVNGEQAFRDKCILYNFWQD 206
S+LVDWL++ + G+ L G ++HV + F+D+ +L+ F+ D
Sbjct: 41 SKLVDWLLAQGDCRTREEAVILGV--GLCNNGFMHHVLEKSEFKDEPLLFRFYAD 93
>gnl|CDD|239894 cd04447, DEP_BRCC3, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in BBRC3-like proteins. BBRC3, also known
as DEPDC1B, is a DEP containing protein of unknown
function.
Length = 92
Score = 30.2 bits (68), Expect = 0.81
Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 8/65 (12%)
Query: 147 RCASGSELVDWLMSLAPSLA-----VSRQITTGMWQALLEEGVIYHVN---GEQAFRDKC 198
C + SE VDWL L S + V+RQ T + + L+ VI + G++ D
Sbjct: 28 NCFTASEAVDWLHELLRSNSNFGPEVTRQQTVQLLKKFLKNHVIEDIKGRWGKEDLEDNN 87
Query: 199 ILYNF 203
LY F
Sbjct: 88 HLYRF 92
>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated. This domain
appears in eukaryotes as well as bacteria and tends to
be found near the C-terminus of the metalloprotease M16C
(pfam05193).
Length = 248
Score = 31.8 bits (73), Expect = 0.90
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 271 EDKEEEEKEEEEEE-RQIGKPLDRTSEDLEQIYEELLHLK 309
E EE+ EEE E+ + L + ED E+I E L L+
Sbjct: 5 ETLEEKLNEEEREKLEKKKSSL--SEEDKEKIIERGLELE 42
>gnl|CDD|241266 cd01233, PH_KIFIA_KIFIB, KIFIA and KIFIB protein pleckstrin
homology (PH) domain. The kinesin-3 family motors KIFIA
(Caenorhabditis elegans homolog unc-104) and KIFIB
transport synaptic vesicle precursors that contain
synaptic vesicle proteins, such as synaptophysin,
synaptotagmin and the small GTPase RAB3A, but they do
not transport organelles that contain plasma membrane
proteins. They have a N-terminal motor domain, followed
by a coiled-coil domain, and a C-terminal PH domain.
KIF1A adopts a monomeric form in vitro, but acts as a
processive dimer in vivo. KIF1B has alternatively
spliced isoforms distinguished by the presence or
absence of insertion sequences in the conserved
amino-terminal region of the protein; this results in
their different motor activities. KIF1A and KIF1B bind
to RAB3 proteins through the adaptor protein
mitogen-activated protein kinase (MAPK) -activating
death domain (MADD; also calledDENN), which was first
identified as a RAB3 guanine nucleotide exchange factor
(GEF). PH domains have diverse functions, but in general
are involved in targeting proteins to the appropriate
cellular location or in the interaction with a binding
partner. They share little sequence conservation, but
all have a common fold, which is electrostatically
polarized. Less than 10% of PH domains bind
phosphoinositide phosphates (PIPs) with high affinity
and specificity. PH domains are distinguished from other
PIP-binding domains by their specific high-affinity
binding to PIPs with two vicinal phosphate groups:
PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
results in targeting some PH domain proteins to the
plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 111
Score = 30.2 bits (69), Expect = 1.0
Identities = 9/32 (28%), Positives = 18/32 (56%)
Query: 135 YSLYSGGRTIARRCASGSELVDWLMSLAPSLA 166
+++Y+ + + S E+ DWL ++ P LA
Sbjct: 80 FAVYTPTNSYLLQARSEKEMHDWLYAIDPLLA 111
Score = 27.6 bits (62), Expect = 8.7
Identities = 8/25 (32%), Positives = 13/25 (52%)
Query: 69 RTIARRCASGSELVDWLMSLAPSLA 93
+ + S E+ DWL ++ P LA
Sbjct: 87 NSYLLQARSEKEMHDWLYAIDPLLA 111
>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
This domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 120
Score = 30.4 bits (69), Expect = 1.1
Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 9/105 (8%)
Query: 247 RKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
+ L +L+ E + + E+E+ E E E G + E EQI
Sbjct: 9 NELLPELRRLL---EEVREAKAELEELEAALSEGERSLEARGL-KEELRELEEQIRA--- 61
Query: 307 HLKPLHHLSNSVRRELAGVVMFEAHPRKGEILF--HQGDEGKSWY 349
+ + L V+ G+V F A E+ G+ +++
Sbjct: 62 AIAEIEELGIEVKDLDIGLVDFPALRDGEEVYLCWKYGEPEIAFW 106
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 32.2 bits (73), Expect = 1.1
Identities = 15/76 (19%), Positives = 28/76 (36%), Gaps = 1/76 (1%)
Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRY-ILRKQLKKLQNRYLPREYMLHLEREIE 271
+ ++ + E+ E + + + +Y L LE E
Sbjct: 889 LESKEEKEKEEKKELEEESQKDNLLEEKENEIEERIAEEAIILLKYESEPEELLLEEADE 948
Query: 272 DKEEEEKEEEEEERQI 287
++EE+ +EEEEER
Sbjct: 949 KEKEEDNKEEEEERNK 964
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 32.3 bits (73), Expect = 1.3
Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 2/33 (6%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
+ + +D +EE+ EEEEEE + K D ED E
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFD--DEDEE 189
Score = 31.9 bits (72), Expect = 1.6
Identities = 11/53 (20%), Positives = 30/53 (56%), Gaps = 3/53 (5%)
Query: 250 LKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIY 302
+K+ + R+L E M + + D ++E+++E++++ + + E+ E+I
Sbjct: 132 IKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDD---EEDDEEEEEEEEEIK 181
>gnl|CDD|178459 PLN02868, PLN02868, acyl-CoA thioesterase family protein.
Length = 413
Score = 31.6 bits (72), Expect = 1.4
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 1/62 (1%)
Query: 297 DLEQIYEELLHLKPLHHLSNSVRRELAGVVMFEAHPRKGEILFHQGDEGKSWYIIIQGSV 356
+ E + E L + L L +S +++A VV+ + KGE + +G+ G Y I +G
Sbjct: 2 NTESVVEFLGSVPLLQRLPSSSLKKIAEVVVPK-RYGKGEYVVREGEPGDGLYFIWKGEA 60
Query: 357 DV 358
+V
Sbjct: 61 EV 62
>gnl|CDD|204076 pfam08824, Serine_rich, Serine rich protein interaction domain.
This is a serine rich domain that is found in the
docking protein p130(cas) (Crk-associated substrate).
This domain folds into a four helix bundle which is
associated with protein-protein interactions.
Length = 159
Score = 30.8 bits (70), Expect = 1.5
Identities = 16/55 (29%), Positives = 26/55 (47%), Gaps = 10/55 (18%)
Query: 213 QATAQDIAEAEEHLEEAL----------LALARRAPDAILRYILRKQLKKLQNRY 257
+A DI A + +EE+L + A + D L LR+QL+ L++ Y
Sbjct: 39 EANINDIRRAVDRVEESLREFLEFARGAVGNATQLTDRNLHAKLRRQLQTLEDSY 93
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 30.9 bits (70), Expect = 1.5
Identities = 35/140 (25%), Positives = 57/140 (40%), Gaps = 13/140 (9%)
Query: 207 KEGSSSQATAQDIAEAEEHLEEALLALARR----APDAI--LRYILRKQ-LKKLQNRYLP 259
A A+D A + A + + DA Y L+K+ L+KL++
Sbjct: 30 FAAKPVGAEAEDAAAELAEKKAAAQSEIEKFCANIADAAADQLYALQKELLEKLKDLAEV 89
Query: 260 REYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVR 319
E + + +E K+ E K+ EEE I + L +IY ++ L N +
Sbjct: 90 NERL----KALEKKKAELKDLEEEREGILRSKQAEDGKLVKIYSKMKPDAAAAILEN-LP 144
Query: 320 RELAGVVMFEAHPRK-GEIL 338
E A ++ + PRK G IL
Sbjct: 145 DEEAAAILMKLKPRKLGLIL 164
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 31.3 bits (71), Expect = 1.5
Identities = 18/39 (46%), Positives = 27/39 (69%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
E E+E+ EE+E+EE EEE + +PL T E+L + E+L
Sbjct: 81 EEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTEASEQL 119
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 31.6 bits (72), Expect = 1.5
Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 267 EREIEDK----EEEEKEEEEEERQIGKPLDRTSEDLEQIY 302
E+ ++ K E E+KEEE E+ K +++ EDLE++Y
Sbjct: 86 EQGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVY 125
>gnl|CDD|218954 pfam06246, Isy1, Isy1-like splicing family. Isy1 protein is
important in the optimisation of splicing.
Length = 253
Score = 31.1 bits (71), Expect = 1.6
Identities = 14/46 (30%), Positives = 18/46 (39%)
Query: 260 REYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+L E E E + E+ EE +E Q E E IY L
Sbjct: 174 DGVLLEYEAEREKELIEKLVEEWKEIQDEDLQAEEDEREEDIYIPL 219
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 31.4 bits (71), Expect = 1.6
Identities = 30/129 (23%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 220 AEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKE-EEEK 278
E E E+AL A+R + R L+ ++++ ++ LE I+D + ++E
Sbjct: 53 REREAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDEN 112
Query: 279 EEEEEE----RQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRR--ELAGVVMFEAHP 332
EEEE E R++ + + R E+ E++ E ++ + +++ RR +
Sbjct: 113 EEEEYEAWKLRELKR-IKRDREEREEMEREKAEIEKMRNMTEEERRAELRKNPKVSTNKA 171
Query: 333 RKGEILFHQ 341
+KG+ F Q
Sbjct: 172 KKGKYKFLQ 180
>gnl|CDD|235359 PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha;
Provisional.
Length = 310
Score = 31.2 bits (72), Expect = 1.7
Identities = 15/37 (40%), Positives = 21/37 (56%), Gaps = 2/37 (5%)
Query: 265 HLE--REIEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
L ++E+ EE E+E EEEE + L R E+LE
Sbjct: 219 QLSVFVDLEEAEEAEEEVEEEEPEFDPILLRPIEELE 255
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 30.8 bits (70), Expect = 1.8
Identities = 26/77 (33%), Positives = 32/77 (41%), Gaps = 12/77 (15%)
Query: 245 ILRKQLKKL---QNRYLPREYMLHLEREIEDKEE------EEKEEEEEERQIGKPLDRTS 295
I K+ KL Q R RE E + EE +E+EE EEER+ K +
Sbjct: 3 IGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK 62
Query: 296 EDLEQI---YEELLHLK 309
E EQ EE LK
Sbjct: 63 EREEQARKEQEEYEKLK 79
Score = 28.9 bits (65), Expect = 6.2
Identities = 19/66 (28%), Positives = 29/66 (43%), Gaps = 5/66 (7%)
Query: 234 ARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDR 293
ARR ++ K + R R+ LE E E K+EEE+ +E EE+ +
Sbjct: 16 ARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR-----K 70
Query: 294 TSEDLE 299
E+ E
Sbjct: 71 EQEEYE 76
>gnl|CDD|223771 COG0699, COG0699, Predicted GTPases (dynamin-related) [General
function prediction only].
Length = 546
Score = 31.4 bits (71), Expect = 2.0
Identities = 19/80 (23%), Positives = 27/80 (33%)
Query: 212 SQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
Q I E L + AI + K+ + Q + L R Y+ LE E+
Sbjct: 458 KTLEKQLIKSLLESLLILAQKIRDSVLKAIFELLKNKRKRLAQKQRLKRLYLEQLEDELL 517
Query: 272 DKEEEEKEEEEEERQIGKPL 291
EE E Q + L
Sbjct: 518 RTAEEILELRLLLEQFLEAL 537
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 31.3 bits (72), Expect = 2.0
Identities = 22/106 (20%), Positives = 39/106 (36%), Gaps = 20/106 (18%)
Query: 208 EGSSSQATA--QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH 265
Q EAE L+EA L+ L ++ +KLQ + +
Sbjct: 523 ASLEELERELEQKAEEAEALLKEA----------EKLKEELEEKKEKLQEEE--DKLLEE 570
Query: 266 LEREIEDKEEEEKEEEEE------ERQIGKPLDRTSEDLEQIYEEL 305
E+E + +E K+E +E + Q G + +L + + L
Sbjct: 571 AEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRL 616
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 31.4 bits (71), Expect = 2.0
Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 1/65 (1%)
Query: 260 REYMLHLEREIEDKEEEEKEEEEE-ERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSV 318
+ ++ LE E+ KE E K + E ER+ + +L++ E + LK L L
Sbjct: 67 KSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELEVRLKALEELEKKA 126
Query: 319 RRELA 323
E A
Sbjct: 127 ENEAA 131
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 29.3 bits (66), Expect = 2.1
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 269 EIEDKEEEEKEEEEEE 284
E ++ EEEEKEEE EE
Sbjct: 82 EADEAEEEEKEEEAEE 97
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 31.3 bits (71), Expect = 2.2
Identities = 13/45 (28%), Positives = 25/45 (55%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPL 311
E E + ++EE+ +EEEE++ K + ++ +E L KP+
Sbjct: 49 EEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWELLNKTKPI 93
Score = 29.4 bits (66), Expect = 8.3
Identities = 20/70 (28%), Positives = 44/70 (62%), Gaps = 16/70 (22%)
Query: 244 YILRKQLKKLQNRYLPREYM-----LHLEREIE----DKEEEEKEEEEEERQIGKPLDRT 294
Y+ +++K+L ++ E++ L +E+E+E D+EEEE++EE++E + ++T
Sbjct: 2 YLEERRIKELVKKH--SEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEE-----EKT 54
Query: 295 SEDLEQIYEE 304
++ E++ EE
Sbjct: 55 TDKEEEVDEE 64
>gnl|CDD|131219 TIGR02164, torA, trimethylamine-N-oxide reductase TorA. This very
narrowly defined family represents TorA, part of a
family of related molybdoenzymes that include biotin
sulfoxide reductases, dimethyl sulfoxide reductases, and
at least two different subfamilies of
trimethylamine-N-oxide reductases. A single enzyme from
the larger family may have more than one activity. TorA
typically is located in the periplasm, has a Tat
(twin-arginine translocation)-dependent signal sequence,
and is encoded in a torCAD operon.
Length = 822
Score = 31.4 bits (71), Expect = 2.3
Identities = 16/67 (23%), Positives = 32/67 (47%), Gaps = 12/67 (17%)
Query: 385 PRLIITSMVFHQGDEGKSWYIIIQGSVDVVIYGKGCVTSLYAGEDFGKLALVNNAPSYQL 444
P ++ ++ V+ QG SW +I++ S +V++ V +L G N +++
Sbjct: 189 PYVLGSTEVYAQG---TSWPLILENSDTIVLWANDPVKNLQVGW---------NCETHES 236
Query: 445 HMYLEQM 451
YL Q+
Sbjct: 237 FAYLAQL 243
>gnl|CDD|183384 PRK11924, PRK11924, RNA polymerase sigma factor; Provisional.
Length = 179
Score = 30.3 bits (69), Expect = 2.4
Identities = 24/121 (19%), Positives = 42/121 (34%), Gaps = 17/121 (14%)
Query: 218 DIAEAEEHLEEALLALARRAPD------------AILRYILRKQLKKLQNRYLPREYMLH 265
D A AE+ ++EA L R+A I R + L++ +
Sbjct: 36 DRALAEDAVQEAFLRAWRKADLFNGKGSARTWLLTIARNVCYDLLRRRRREKAVLSDD-- 93
Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGV 325
+ E E + +DR + L E+ L+ + LS RE+A +
Sbjct: 94 ALEPEFAETAETPEAALLAKDDLARIDRCLDALPVKQREVFLLRYVEGLS---YREIAEI 150
Query: 326 V 326
+
Sbjct: 151 L 151
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 29.9 bits (68), Expect = 2.4
Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 3/44 (6%)
Query: 247 RKQLKKLQN-RYLPREYMLHL--EREIEDKEEEEKEEEEEERQI 287
+K KKL R R+ + E+E E++EE E EE +EE QI
Sbjct: 77 KKDFKKLLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQI 120
>gnl|CDD|183004 PRK11161, PRK11161, fumarate/nitrate reduction transcriptional
regulator; Provisional.
Length = 235
Score = 30.5 bits (69), Expect = 2.6
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 334 KGEILFHQGDEGKSWYIIIQGSV 356
KG+ LF GDE KS Y I G++
Sbjct: 44 KGQTLFKAGDELKSLYAIRSGTI 66
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 30.6 bits (69), Expect = 2.6
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
L++E E +E +E E+E+ + L E+ EQ+ E
Sbjct: 75 LKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENE 113
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 30.1 bits (68), Expect = 2.7
Identities = 26/99 (26%), Positives = 45/99 (45%), Gaps = 4/99 (4%)
Query: 210 SSSQATAQDIAEAEEHLEEALLALARRAPD-AILRYILRKQLKKLQNRY--LPREYMLHL 266
S + ++ E E LEE + L D +LR+ LK L+ R L L
Sbjct: 82 SELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKE-L 140
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+E+ + ++ +EE EE R+ + L + L++ +EL
Sbjct: 141 AKELRELRQDLREEVEELREELERLQENLQRLQEAIQEL 179
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 30.4 bits (69), Expect = 2.7
Identities = 25/104 (24%), Positives = 37/104 (35%), Gaps = 19/104 (18%)
Query: 204 WQDKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYM 263
D+ + +D E E EE + D ++KLQ
Sbjct: 53 ANDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDE-----ASSIIEKLQ--------- 98
Query: 264 LHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLH 307
+ +ERE E+ EEE + EE + G E E+ EE L
Sbjct: 99 MQIEREQEEWEEELERLIEEAKAEGY-----EEGYEKGREEGLE 137
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 30.5 bits (69), Expect = 2.7
Identities = 16/62 (25%), Positives = 30/62 (48%), Gaps = 2/62 (3%)
Query: 224 EHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEE 283
L+E + L R+A +A ++ KQ ++ + E ERE E +E E ++ +
Sbjct: 74 RSLKE--VKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAK 131
Query: 284 ER 285
E+
Sbjct: 132 EK 133
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.8 bits (70), Expect = 2.8
Identities = 33/194 (17%), Positives = 66/194 (34%), Gaps = 34/194 (17%)
Query: 221 EAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRY----LPREYMLHLEREIEDKEEE 276
E E L I L +L++ + + +R IE E+E
Sbjct: 432 ERLEEENSELKRELEELKREI--EKLESELERFRREVRDKVRKDREIRARDRRIERLEKE 489
Query: 277 EKEEEEEERQIGKPLDRTSEDLEQIYE--------ELLHLKPLHHLSNSVRRELAGVVMF 328
+E+++ + L+R +L ++ + + ++ L + E G+
Sbjct: 490 LEEKKKRVEE----LERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYGI--- 542
Query: 329 EAHPRKGEILFHQGDEGKSWYII---IQGSVDVVIYGKGCVTSLYAGEDFGKLALVNNAP 385
++G+++ + G I +I G+ S A E+F K N P
Sbjct: 543 ----KEGDVILVEDPSGGGARTAEELIDKKPRAIIRGEE--MSHAAAEEFFK----NEIP 592
Query: 386 RLIITSMVFHQGDE 399
L + + DE
Sbjct: 593 VLPEGDVQIIRLDE 606
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 30.0 bits (68), Expect = 3.5
Identities = 15/67 (22%), Positives = 21/67 (31%), Gaps = 2/67 (2%)
Query: 538 MGLSNVAVSRLSLTWDKLPSKSKKTYTELEALIDPSKNHRAYRQAVSKLQSPVIPFMPLL 597
MG+ N A +L K P++S T L K A + P
Sbjct: 6 MGIKNFAFCKLKGPSMKAPTRSLPTLNAWLKLNLDEKIGADDAPATDEKSKIS--LSPRY 63
Query: 598 LKDLAFT 604
LA+
Sbjct: 64 YAFLAYK 70
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 30.4 bits (69), Expect = 3.7
Identities = 20/50 (40%), Positives = 28/50 (56%)
Query: 235 RRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEE 284
RR + I R+I R L++ + R P E E E++E E+EEEEEE
Sbjct: 275 RRRGEMIRRFIDRLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEE 324
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 29.5 bits (66), Expect = 3.8
Identities = 18/58 (31%), Positives = 29/58 (50%), Gaps = 3/58 (5%)
Query: 247 RKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
+ KKL+ R P + E + + + EE +E+EEE LD E+ E++ EE
Sbjct: 4 KGNAKKLKVRPPPTKDE---EEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEE 58
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.2 bits (68), Expect = 3.9
Identities = 17/84 (20%), Positives = 31/84 (36%), Gaps = 4/84 (4%)
Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQL-KKLQNRYLPREYMLHLERE-I 270
A + + +EE + AL D + + L +L K + ++ R+ I
Sbjct: 30 NAEKKSKNDKLARIEEEIEALKTLDLDQLAKKYLFSKLIKVITSKPTKAWAESPEFRQII 89
Query: 271 EDKEEEEK--EEEEEERQIGKPLD 292
DK + E E + K L+
Sbjct: 90 NDKSHKANPSRSEAEVVEKLKSLN 113
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 30.0 bits (68), Expect = 3.9
Identities = 21/98 (21%), Positives = 44/98 (44%), Gaps = 11/98 (11%)
Query: 214 ATAQDI-AEAEEHLEEA-LLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
A Q++ E E E+ L R+ ++ R + + KK Q L E L ++I+
Sbjct: 83 AKLQELRKEYRELKEKRNEFNLGGRSIKSLER-EIERLEKKQQTSVLTPEEERELVQKIK 141
Query: 272 DKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLK 309
+ +E ++ ++ + +E L+++ E+ LK
Sbjct: 142 ELRKELEDAKKALEE--------NEKLKELKAEIDELK 171
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 30.3 bits (69), Expect = 4.0
Identities = 20/67 (29%), Positives = 32/67 (47%), Gaps = 4/67 (5%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE 276
+++ E EE LE+ L + A R L + ++ + E L E+E+ EEE
Sbjct: 341 EELKELEEELEKIKKLLKKLPKKA--RGQLPPEKREQLEKL--LETKEKLSEELEELEEE 396
Query: 277 EKEEEEE 283
KE +EE
Sbjct: 397 LKELKEE 403
>gnl|CDD|220222 pfam09403, FadA, Adhesion protein FadA. FadA (Fusobacterium
adhesin A) is an adhesin which forms two alpha helices.
Length = 126
Score = 28.7 bits (64), Expect = 4.3
Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 3/70 (4%)
Query: 206 DKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLH 265
++E + +++ E +E A+ ++ R+ + + KKL RY EY+
Sbjct: 48 EEEKQEKETAEKEVQELKERQLGREELYAKLKEESKTRF-HKDEYKKLLKRY--DEYLNK 104
Query: 266 LEREIEDKEE 275
LE++I DKE+
Sbjct: 105 LEQKITDKEQ 114
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.5 bits (67), Expect = 4.3
Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 1/67 (1%)
Query: 244 YILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIY- 302
+ +RK L+ Q + + + LE E E+ E+ E E + I K + + E+ +
Sbjct: 110 FGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHA 169
Query: 303 EELLHLK 309
+E+ LK
Sbjct: 170 DEIAFLK 176
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 29.4 bits (66), Expect = 4.4
Identities = 11/40 (27%), Positives = 24/40 (60%), Gaps = 5/40 (12%)
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDL--EQIYEE 304
E ++K+E+++EEEEE+ + ++ + E+I E+
Sbjct: 41 EDVKDEKQEDDEEEEEEDEEE---IEEPEDIEDEEEIVED 77
>gnl|CDD|205495 pfam13315, DUF4085, Protein of unknown function (DUF4085). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 101 and 269 amino
acids in length.
Length = 208
Score = 29.8 bits (67), Expect = 4.5
Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 264 LHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKP 310
L E+ EEE +EEEE L E++E+I E+LL P
Sbjct: 17 LSFPESQEEWEEELREEEESGNDY---LQSLKEEIEEIKEDLLKFLP 60
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.1 bits (67), Expect = 5.0
Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 11/109 (10%)
Query: 207 KEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRK----------QLKKLQN- 255
K+ + A++ +AEE ++A AL + A +A L+K +LKK +
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727
Query: 256 RYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
+ E E + K EE K++EEE+++I + E+I +E
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
>gnl|CDD|239885 cd04438, DEP_dishevelled, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in dishevelled-like proteins.
Dishevelled-like proteins play a key role in the
transduction of the Wnt signal from the cell surface to
the nucleus, which in turn is an important regulatory
pathway for cellular development and growth. They
contain an N-terminal DIX domain, a central PDZ domain,
and a C-terminal DEP domain.
Length = 84
Score = 27.7 bits (62), Expect = 5.1
Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 7/51 (13%)
Query: 151 GSELVDWLMSLAPSLAVS---RQITTGMWQALLEEGVIYHVNGEQAFRDKC 198
GS+LVDWL+S L R+ + + + G I H + F ++C
Sbjct: 32 GSDLVDWLLSHVEGLTDRREARKYASSLLKL----GYIRHTVNKITFSEQC 78
>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
Provisional.
Length = 784
Score = 29.9 bits (67), Expect = 5.1
Identities = 30/124 (24%), Positives = 56/124 (45%), Gaps = 13/124 (10%)
Query: 218 DIAEAEEHL------EEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE 271
D+ E E+L E LL + +R +R ++KQ++K Q Y E M +++E+
Sbjct: 188 DVNERLEYLMAMMESEIDLLQVEKR-----IRNRVKKQMEKSQREYYLNEQMKAIQKELG 242
Query: 272 DKEEEEKEEEEEERQI--GKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGVVMFE 329
+ ++ E E +R+I K E E ++L + P+ + VR + +V
Sbjct: 243 EMDDAPDENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMSAEATVVRGYIDWMVQVP 302
Query: 330 AHPR 333
+ R
Sbjct: 303 WNAR 306
>gnl|CDD|177972 PLN02338, PLN02338, 3-phosphoshikimate 1-carboxyvinyltransferase.
Length = 443
Score = 30.1 bits (68), Expect = 5.1
Identities = 16/53 (30%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 596 LLLKDLAFTHDGNKTVVDGLVNFEKMHMFAQTLRTLRYCRTRHLDHDKTVVDG 648
LLL L+ +G TVVD L++ + + L+TL ++++ VV+G
Sbjct: 29 LLLAALS---EG-TTVVDNLLDSDDIRYMLGALKTLGLNVEEDSENNRAVVEG 77
>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH, subunit TFB3 [Cell division and chromosome
partitioning / Transcription / DNA replication,
recombination, and repair].
Length = 314
Score = 29.6 bits (66), Expect = 5.1
Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 4/87 (4%)
Query: 223 EEHLEEALLALARRAPDAILRYILRKQLK----KLQNRYLPREYMLHLEREIEDKEEEEK 278
EE + D+IL + R + K +N R + + E + +EEE
Sbjct: 126 VSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQIEEEERM 185
Query: 279 EEEEEERQIGKPLDRTSEDLEQIYEEL 305
+E +++IG L SED + +
Sbjct: 186 NKEWTKKEIGNRLGTASEDGNKTIKIG 212
>gnl|CDD|238663 cd01367, KISc_KIF2_like, Kinesin motor domain, KIF2-like group.
KIF2 is a protein expressed in neurons, which has been
associated with axonal transport and neuron development;
alternative splice forms have been implicated in
lysosomal translocation. This catalytic (head) domain
has ATPase activity and belongs to the larger group of
P-loop NTPases. Kinesins are microtubule-dependent
molecular motors that play important roles in
intracellular transport and in cell division. In this
subgroup the motor domain is found in the middle
(M-type) of the protein chain. M-type kinesins are (+)
end-directed motors, i.e. they transport cargo towards
the (+) end of the microtubule. Kinesin motor domains
hydrolyze ATP at a rate of about 80 per second, and move
along the microtubule at a speed of about 6400 Angstroms
per second (KIF2 may be slower). To achieve that,
kinesin head groups work in pairs. Upon replacing ADP
with ATP, a kinesin motor domain increases its affinity
for microtubule binding and locks in place. Also, the
neck linker binds to the motor domain, which repositions
the other head domain through the coiled-coil domain
close to a second tubulin dimer, about 80 Angstroms
along the microtubule. Meanwhile, ATP hydrolysis takes
place, and when the second head domain binds to the
microtubule, the first domain again replaces ADP with
ATP, triggering a conformational change that pulls the
first domain forward.
Length = 322
Score = 29.5 bits (67), Expect = 5.3
Identities = 21/69 (30%), Positives = 30/69 (43%), Gaps = 16/69 (23%)
Query: 571 DPSKNHRAYRQAVSKLQSPVIPFMPLLLKDLAFTHDGN-KTVVDGLVNFEKMHMFAQTLR 629
+K H +R SKL +L+D +F GN KTV+ ++ TL
Sbjct: 268 ASNKAHVPFRG--SKLT--------QVLRD-SF--IGNSKTVMIATIS-PSASSCEHTLN 313
Query: 630 TLRY-CRTR 637
TLRY R +
Sbjct: 314 TLRYADRVK 322
>gnl|CDD|222480 pfam13968, DUF4220, Domain of unknown function (DUF4220). This
family is found in plants and is often associated with
DUF294, pfam04578.
Length = 362
Score = 29.6 bits (67), Expect = 5.3
Identities = 21/116 (18%), Positives = 40/116 (34%), Gaps = 6/116 (5%)
Query: 228 EALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIE---DKEEEEKEEEEEE 284
E + AL + ++I +L + Y + ++ EE +
Sbjct: 104 ERIWALRSASMESIRSSMLGEPDPGPNYAKFMEGYKYLVAEGLQAKIVIGREEDVDLNSV 163
Query: 285 RQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGVVMFEAHPRKGEILFH 340
Q+ + L + E+ L L H L +RR +++ E RK + F
Sbjct: 164 EQVWEQLLLIILK-DGDEEKDLCLA--HALFKLLRRLFVDLILSEQDRRKSQEFFR 216
>gnl|CDD|215276 PLN02500, PLN02500, cytochrome P450 90B1.
Length = 490
Score = 29.8 bits (67), Expect = 5.5
Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 266 LEREIED---KEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRREL 322
+ER++E+ K +EE E EE+ +G L ++ EQI + +L L H ++SV L
Sbjct: 242 IERKMEERIEKLKEEDESVEEDDLLGWVLKHSNLSTEQILDLILSLLFAGHETSSVAIAL 301
Query: 323 AGVVMFEAHPR 333
A + + P+
Sbjct: 302 A-IFFLQGCPK 311
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 29.8 bits (68), Expect = 5.6
Identities = 18/67 (26%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 246 LRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEEL 305
+++Q++K Q Y RE + +++E+ + ++++ E EE +I K L E E+ +EL
Sbjct: 218 VKEQMEKSQREYYLREQLKAIQKELGEDDDDKDEVEELREKIEK-LKLPKEAKEKAEKEL 276
Query: 306 LHLKPLH 312
L+ +
Sbjct: 277 KKLETMS 283
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 29.9 bits (68), Expect = 5.7
Identities = 34/173 (19%), Positives = 61/173 (35%), Gaps = 36/173 (20%)
Query: 210 SSSQATAQDIAEAEEHLEEALLALARR---APDAILRYILRKQLKKLQNRYLPREYMLHL 266
+ + E EE L L +LAR+ + +L Y L K++
Sbjct: 290 DELEFDPNRLEEVEERLFA-LKSLARKYGVTIEDLLEY-----LDKIKEEL--------- 334
Query: 267 EREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRRELAGVV 326
++++ EE + E+E +++ L +E L I ++ L V EL +
Sbjct: 335 -AQLDNSEESLEALEKEVKKLKAELLEAAEALSAIRKKA-----AKELEKEVTAELKALA 388
Query: 327 MFEAHPRKGEILFHQGDEGKSWYIIIQGSVDVVIYGKGCVTSLYAGEDFGKLA 379
M +A + + +E + G+ V S GE LA
Sbjct: 389 MEKA---RFTVELKPLEESPT----ADGADKVEFL-----ISTNPGEPLKPLA 429
>gnl|CDD|220212 pfam09376, NurA, NurA domain. This family includes NurA a nuclease
exhibiting both single-stranded endonuclease activity
and 5'-3' exonuclease activity on single-stranded and
double-stranded DNA from the hyperthermophilic archaeon
Sulfolobus acidocaldarius.
Length = 260
Score = 29.5 bits (66), Expect = 6.0
Identities = 24/139 (17%), Positives = 38/139 (27%), Gaps = 26/139 (18%)
Query: 256 RYLPREYMLHLEREIEDKEEEEKE---------------EEEEERQIGKPLDRTSEDLEQ 300
R RE M LE +E+ E + L LE
Sbjct: 71 RRRVRELMALLELAAALLLLKEEGPDLILLDGSLIPTLIRLPEYLERILELLYLLRLLEL 130
Query: 301 IYEELLHLKPLHHLSNSVRRELAGVVMFEAHPRK---------GEILFHQGDEGKSWYII 351
L + + + S EL ++ E G +L G ++
Sbjct: 131 A--LLEGIPLVGIVKTSRSSELLNLLRDELEEILPGISDRALLGLLLEKGGYREPIPLLL 188
Query: 352 IQGSVDVVIYGKGCVTSLY 370
I G + V +Y + Y
Sbjct: 189 ISGPLLVELYEDDEIYFFY 207
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 29.6 bits (66), Expect = 6.1
Identities = 23/144 (15%), Positives = 52/144 (36%), Gaps = 8/144 (5%)
Query: 205 QDKEGSSSQATAQDIAEA--EEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREY 262
+ K S+ Q+ ++ + E + + R + + + +
Sbjct: 83 EFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREERE------EVEETEGVTKSE 136
Query: 263 MLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHLSNSVRREL 322
+ R+ E+ ++EEKE E EE + K + E + +L H + + ++
Sbjct: 137 QKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMTHKLKHTENTFSRGGAEGAQV 196
Query: 323 AGVVMFEAHPRKGEILFHQGDEGK 346
FE +K + + +E K
Sbjct: 197 EAGKEFEKLKQKQQEAALELEELK 220
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 27.3 bits (61), Expect = 6.2
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 266 LEREIEDKEEEEKEEEEEERQ 286
L+ E E +EEEE+ + EER+
Sbjct: 29 LKAEEEKREEEEEARKREERK 49
Score = 26.9 bits (60), Expect = 6.7
Identities = 13/44 (29%), Positives = 25/44 (56%), Gaps = 4/44 (9%)
Query: 263 MLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELL 306
+ ++E++ +EE+ +EEEE K +R + + +EELL
Sbjct: 22 LKAKKKELKAEEEKREEEEEAR----KREERKEREKNKSFEELL 61
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 29.4 bits (67), Expect = 6.3
Identities = 11/18 (61%), Positives = 14/18 (77%)
Query: 267 EREIEDKEEEEKEEEEEE 284
E E E++EEEE+EE EE
Sbjct: 302 EEEEEEEEEEEEEEPSEE 319
Score = 29.1 bits (66), Expect = 9.3
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 267 EREIEDKEEEEKEEEEEERQIG 288
E E++EEEE+EEE E +
Sbjct: 301 AEEEEEEEEEEEEEEPSEEEAA 322
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 29.9 bits (68), Expect = 6.4
Identities = 20/98 (20%), Positives = 41/98 (41%), Gaps = 12/98 (12%)
Query: 216 AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREY---MLHLEREIED 272
A ++ L E L +A R R+QL++ Q + +Y + L+ +
Sbjct: 976 AAEMLAKNSDLNEKLRQRLEQAEQE--RTRAREQLRQAQAQL--AQYNQVLASLKSSYDA 1031
Query: 273 KEEEEKEEEEEERQIGKPLD-----RTSEDLEQIYEEL 305
K + +E ++E + +G P D R ++++ L
Sbjct: 1032 KRQMLQELKQELQDLGVPADSGAEERARARRDELHARL 1069
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 29.4 bits (66), Expect = 6.5
Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 5/47 (10%)
Query: 261 EYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLH 307
E L E+E E++ ++EEEE+Q+ K ++ + + +EL
Sbjct: 148 EEALEFEKEEEEQRRLLLQKEEEEQQMNKRKNK-----QALLDELET 189
>gnl|CDD|233703 TIGR02063, RNase_R, ribonuclease R. This family consists of an
exoribonuclease, ribonuclease R, also called VacB. It is
one of the eight exoribonucleases reported in E. coli
and is broadly distributed throughout the bacteria. In
E. coli, double mutants of this protein and
polynucleotide phosphorylase are not viable. Scoring
between trusted and noise cutoffs to the model are
shorter, divergent forms from the Chlamydiae, and
divergent forms from the Campylobacterales (including
Helicobacter pylori) and Leptospira interrogans
[Transcription, Degradation of RNA].
Length = 709
Score = 29.5 bits (67), Expect = 6.5
Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 5/57 (8%)
Query: 235 RRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEEE----EKEEEEEERQI 287
RR PD I+ +++K L +N + +LE ++E+ E E+ +E ER +
Sbjct: 559 RRYPDLIVHRLIKKALFGGEN-TTTEKEREYLEAKLEEIAEHSSKTERRADEAERDV 614
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 29.5 bits (67), Expect = 6.5
Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 250 LKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQI 301
L KL++ L LE+E EE+ E E ++ + +DR E++E++
Sbjct: 22 LNKLEDTKLKL-----LEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERL 68
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 29.7 bits (67), Expect = 6.7
Identities = 23/96 (23%), Positives = 36/96 (37%), Gaps = 16/96 (16%)
Query: 213 QATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIED 272
A Q++A A+ L +AL L D + LP ++
Sbjct: 413 AALEQEVAVAKRELAQALSRLGLWRGDL------------EELVALPVPSAEQVQA---- 456
Query: 273 KEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHL 308
+E +E + +RQ L EDLE + +L HL
Sbjct: 457 FLKEAEEIMQAKRQKRDRLLELEEDLETLELQLRHL 492
>gnl|CDD|147887 pfam05974, DUF892, Domain of unknown function (DUF892). This
family consists of several hypothetical bacterial
proteins of unknown function.
Length = 159
Score = 28.7 bits (65), Expect = 7.0
Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 7/45 (15%)
Query: 217 QDIAEAEEHLEEALLALARRAPDAILRYIL-------RKQLKKLQ 254
+DI AE+ +AL +A+ A D L+ L Q+++L+
Sbjct: 11 RDIYAAEKQALKALPKMAKAAEDPELKAALEQHLEETEGQIERLE 55
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 29.4 bits (66), Expect = 7.3
Identities = 14/34 (41%), Positives = 21/34 (61%)
Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
+ER E++EEEE+EEEEE + P++ E
Sbjct: 398 VERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFE 431
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 28.4 bits (64), Expect = 7.4
Identities = 28/99 (28%), Positives = 41/99 (41%), Gaps = 21/99 (21%)
Query: 216 AQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKEE 275
A D+AEAE EEA LA ++L++ + + E + ++E E E
Sbjct: 43 ADDLAEAERLKEEAQALLAEY----------EQELEEAREQA--SEIIEQAKKEAEQIAE 90
Query: 276 EEKEEEEEER---------QIGKPLDRTSEDLEQIYEEL 305
E K E EEE +I +R E+L EL
Sbjct: 91 EIKAEAEEELERIKEAAEAEIEAEKERALEELRAEVAEL 129
>gnl|CDD|237722 PRK14476, PRK14476, nitrogenase molybdenum-cofactor biosynthesis
protein NifN; Provisional.
Length = 455
Score = 29.4 bits (67), Expect = 8.1
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 5/34 (14%)
Query: 213 QATAQD----IAEAEEHLEEALLALARRA-PDAI 241
Q TA D I +E++EEA+L + ++A P I
Sbjct: 63 QTTAMDEVTTILGGDENVEEAILNICKKAKPKII 96
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 29.3 bits (66), Expect = 8.3
Identities = 11/39 (28%), Positives = 16/39 (41%)
Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEE 304
+R E +EEE ++ EEE Q + S E
Sbjct: 379 EQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 28.8 bits (65), Expect = 8.6
Identities = 20/90 (22%), Positives = 42/90 (46%), Gaps = 14/90 (15%)
Query: 211 SSQATAQDIAEAEEHLEEALLALARRAPDAILRY-ILRKQLKKLQNRYLPREYMLHLERE 269
S + + + ++ E+ EE + L + +A R + ++KL E+E
Sbjct: 159 SLEVSEEKASQREDSYEEKIRDLTEKLKEAETRAEFAERSVQKL-------------EKE 205
Query: 270 IEDKEEEEKEEEEEERQIGKPLDRTSEDLE 299
++ E+E E+E+ + I + LD+T +L
Sbjct: 206 VDRLEDELLAEKEKYKAISEELDQTLAELT 235
>gnl|CDD|224228 COG1309, AcrR, Transcriptional regulator [Transcription].
Length = 201
Score = 28.8 bits (63), Expect = 8.6
Identities = 19/86 (22%), Positives = 27/86 (31%)
Query: 200 LYNFWQDKEGSSSQATAQDIAEAEEHLEEALLALARRAPDAILRYILRKQLKKLQNRYLP 259
LY + KE + + E LEEALL A +A+L +L + L
Sbjct: 49 LYRHFPSKEDLLLALLERALEELRALLEEALLDGAPEELEALLALLLAYLEFARDDPRLL 108
Query: 260 REYMLHLEREIEDKEEEEKEEEEEER 285
+ E E E
Sbjct: 109 LLLLAAALAGELAPELEAILRELLAL 134
>gnl|CDD|219663 pfam07957, DUF3294, Protein of unknown function (DUF3294). This
family was annotated as mitochondrial Ribosomal protein
MRP8, based on the presumed similarity of the
S.cerevisiae protein to an E.coli mitochondrial
ribosomal protein; however, this similarity is spurious,
and the function is not known [Wood, V].
Length = 217
Score = 28.6 bits (64), Expect = 8.8
Identities = 7/43 (16%), Positives = 23/43 (53%)
Query: 245 ILRKQLKKLQNRYLPREYMLHLEREIEDKEEEEKEEEEEERQI 287
+L ++ Q ++ H+E E D++ E++ ++ ++++
Sbjct: 153 LLLADQEEFQEFLQGKKEAFHIESEERDEDIEKEIKKYSKKEL 195
>gnl|CDD|216831 pfam01988, VIT1, VIT family. This family includes the vacuolar
Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
transporter VIT1.
Length = 209
Score = 28.8 bits (65), Expect = 9.0
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 266 LEREIEDKEEEEKEEEEEERQIGKPLDRTSEDLEQIYEELLHLKPLHHL 314
LERE + E + + E EE +I + + E Q+ EL+ L +
Sbjct: 67 LEREKRELENDPEGEREELAEIYRARGLSEELARQVARELMKKDALDAM 115
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 29.3 bits (66), Expect = 9.3
Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 8/78 (10%)
Query: 217 QDIAEAEEHLEEAL--LALARRAPDAILRYILRKQLKKLQNRYLPREYMLHLEREIEDKE 274
+ E E+ +E L + L + D +LKK + Y P + +R I K
Sbjct: 156 TGVLEDEDEGDE-LENVELVEKEKDK-----KNLELKKKKPDYDPDDDDKFNKRSILSKY 209
Query: 275 EEEKEEEEEERQIGKPLD 292
+EE E ++++ LD
Sbjct: 210 DEEIEGKKKKSDNLFTLD 227
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.135 0.398
Gapped
Lambda K H
0.267 0.0696 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 37,146,043
Number of extensions: 3795955
Number of successful extensions: 7461
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6801
Number of HSP's successfully gapped: 334
Length of query: 714
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 610
Effective length of database: 6,324,786
Effective search space: 3858119460
Effective search space used: 3858119460
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.1 bits)