RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3983
(318 letters)
>gnl|CDD|241348 cd13194, FERM_C_ERM, ERM family FERM domain C-lobe/F3. The ERM
family includes ezrin, radixin, moesin and merlin. They
are composed of a N-terminal FERM (ERM) domain (also
called N-ERMAD (N-terminal ERM association domain)), a
coiled coil region (CRR), and a C-terminal domain CERMAD
(C-terminal ERM association domain) which has an
F-actin-binding site (ABD). Two actin-binding sites have
been identified in the middle and N-terminal domains.
Merlin is structurally similar to the ERM proteins, but
instead of an actin-binding domain (ABD), it contains a
C-terminal domain (CTD), just like the proteins from the
4.1 family. Activated ezrin, radixin and moesin are
thought to be involved in the linking of actin filaments
to CD43, CD44, ICAM1-3 cell adhesion molecules, various
membrane channels and receptors, such as the Na+/H+
exchanger-3 (NHE3), cystic fibrosis transmembrane
conductance regulator (CFTR), and the beta2-adrenergic
receptor. The ERM proteins exist in two states, a
dormant state in which the FERM domain binds to its own
C-terminal tail and thereby precludes binding of some
partner proteins, and an activated state, in which the
FERM domain binds to one of many membrane binding
proteins and the C-terminal tail binds to F-actin. The
FERM domain has a cloverleaf tripart structure (FERM_N,
FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe,
C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM domain
of ERM is part of the PH domain family. The FERM domain
is found in the cytoskeletal-associated proteins such as
ezrin, moesin, radixin, 4.1R, and merlin. These proteins
provide a link between the membrane and cytoskeleton and
are involved in signal transduction pathways. The FERM
domain is also found in protein tyrosine phosphatases
(PTPs) , the tyrosine kinases FAK and JAK, in addition
to other proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 97
Score = 145 bits (367), Expect = 9e-44
Identities = 56/67 (83%), Positives = 62/67 (92%)
Query: 143 PNCLLTPKIGFPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHE 202
LTPKIGFPWSEIRNISFND+KF+IKPIDKKAPDFVF++PR+R+NKRIL LCMGNHE
Sbjct: 31 LENKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYSPRLRINKRILDLCMGNHE 90
Query: 203 LYMRRRK 209
LYMRRRK
Sbjct: 91 LYMRRRK 97
Score = 81.5 bits (202), Expect = 1e-19
Identities = 32/40 (80%), Positives = 35/40 (87%)
Query: 51 MYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMS 90
MYGVNYFEIKNKKGT+LWLGVDALGLNIYE E+ LT +
Sbjct: 1 MYGVNYFEIKNKKGTDLWLGVDALGLNIYELENKLTPKIG 40
>gnl|CDD|220216 pfam09380, FERM_C, FERM C-terminal PH-like domain.
Length = 90
Score = 86.2 bits (214), Expect = 3e-21
Identities = 28/65 (43%), Positives = 34/65 (52%), Gaps = 3/65 (4%)
Query: 151 IGFPWSEIRNISFNDRKFIIKPID---KKAPDFVFFAPRVRVNKRILALCMGNHELYMRR 207
FPWSEIR ISF +KF+IK D K F+ P R K + LC+ H + R
Sbjct: 26 NTFPWSEIRKISFKRKKFLIKLRDEDESKETTLTFYTPSPRACKYLWKLCVEQHAFFRLR 85
Query: 208 RKPDT 212
R PDT
Sbjct: 86 RPPDT 90
Score = 41.9 bits (99), Expect = 2e-05
Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 5/73 (6%)
Query: 61 NKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQFKFRAKFYPEDVVEEIIQDI 120
+KKGT+LWLGV A G+ +YE + + +++K + FK R KF + E+ ++
Sbjct: 1 DKKGTDLWLGVSAKGILVYEDNNKINTFPWSEIRK---ISFK-RKKFLIKLRDEDESKET 56
Query: 121 TLRLFYLQVSRSA 133
TL FY R+
Sbjct: 57 TLT-FYTPSPRAC 68
>gnl|CDD|241233 cd00836, FERM_C-lobe, FERM domain C-lobe. The FERM domain has a
cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N,
alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3).
The C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs), the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 91
Score = 62.0 bits (151), Expect = 2e-12
Identities = 21/58 (36%), Positives = 25/58 (43%)
Query: 148 TPKIGFPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHELYM 205
P F W EIR ISF+ + F I+ K F R K I LC+G H Y
Sbjct: 34 NPLTLFRWPEIRKISFSKKTFTIEIRGDKEEKLAFQTSSPREAKYIWKLCVGYHRFYR 91
>gnl|CDD|215882 pfam00373, FERM_M, FERM central domain. This domain is the central
structural domain of the FERM domain.
Length = 113
Score = 60.4 bits (147), Expect = 1e-11
Identities = 17/47 (36%), Positives = 24/47 (51%)
Query: 11 QHKMSPSEWESSITTWWKEHKGMLREDAMMEYLKIAQDLEMYGVNYF 57
K E E I K +G+ +A ++YL+IAQ L YGV +F
Sbjct: 67 IKKEKSKELEKRILEAHKNLRGLSPAEAKLKYLQIAQSLPTYGVTFF 113
>gnl|CDD|214604 smart00295, B41, Band 4.1 homologues. Also known as
ezrin/radixin/moesin (ERM) protein domains. Present in
myosins, ezrin, radixin, moesin, protein tyrosine
phosphatases. Plasma membrane-binding domain. These
proteins play structural and regulatory roles in the
assembly and stabilization of specialized plasmamembrane
domains. Some PDZ domain containing proteins bind one or
more of this family. Now includes JAKs.
Length = 201
Score = 62.3 bits (152), Expect = 1e-11
Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 5 PLRVMDQHKMSPSEWESSITTWWKEHKGMLREDAMMEYLKIAQDLEMYGVNYF 57
P +++D EW I KE G+ E+A ++YL++A+ L YGV F
Sbjct: 151 PKQLLDS--RKLKEWRERIVELHKELIGLSPEEAKLKYLELARKLPTYGVELF 201
Score = 43.4 bits (103), Expect = 4e-05
Identities = 28/92 (30%), Positives = 37/92 (40%), Gaps = 19/92 (20%)
Query: 44 KIAQDLEMYGVNYF---EIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQ 100
+ + L + YF + WL D ++ QDVK E PL
Sbjct: 28 TVCRKLGIRESEYFGLQFEDPDEDLRHWL-------------DPAKTLLDQDVKSE-PLT 73
Query: 101 FKFRAKFYPEDVVEEIIQDIT-LRLFYLQVSR 131
FR KFYP D ++ +D T L L YLQV
Sbjct: 74 LYFRVKFYPPD-PNQLKEDPTRLNLLYLQVRN 104
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 55.1 bits (133), Expect = 7e-09
Identities = 24/71 (33%), Positives = 44/71 (61%)
Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
E+AE++ QE ER++Q+E +M + ++L E +E LEE+LKQ + + LE + EL+
Sbjct: 1 EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60
Query: 304 LELQPIRNQSN 314
E + + ++
Sbjct: 61 EENRRLEEEAA 71
Score = 34.3 bits (79), Expect = 0.058
Identities = 23/104 (22%), Positives = 49/104 (47%)
Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE 263
R + D+++ + + E + + +KL+ E + EKK E E ++LE E
Sbjct: 10 LEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEE 69
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
A +++ + + ++ +L+ +E+ EA +LQ EL+
Sbjct: 70 AAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELR 113
Score = 28.2 bits (63), Expect = 5.6
Identities = 17/102 (16%), Positives = 41/102 (40%), Gaps = 10/102 (9%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
++ + EE A +++R++L+ E+ + +LE E K++ + + Q
Sbjct: 60 EEENRRLEEEAAASEEERERLEAEVDEAT----------AEVAKLEEEREKKEAETRQLQ 109
Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
+ +R +E ++ + E A T P+ +
Sbjct: 110 QELREAQEAHERARQELLEAAAAPTAPPHVAAPVNGEQLEPD 151
>gnl|CDD|241343 cd13189, FERM_C_PTPN4_PTPN3_like, Protein tyrosine phosphatase
non-receptor proteins 3 and 4 (PTPN4 and PTPN3) FERM
domain C-lobe. PTPN4 (also called PTPMEG, protein
tyrosine phosphatase, megakaryocyte) is a cytoplasmic
protein-tyrosine phosphatase (PTP) thought to play a
role in cerebellar function. PTPMEG-knockout mice have
impaired memory formation and cerebellar long-term
depression. PTPN3/PTPH1 is a membrane-associated PTP
that is implicated in regulating tyrosine
phosphorylation of growth factor receptors, p97 VCP
(valosin-containing protein, or Cdc48 in Saccharomyces
cerevisiae), and HBV (Hepatitis B Virus) gene
expression; it is mutated in a subset of colon cancers.
PTPMEG and PTPN3/PTPH1 contains a N-terminal FERM
domain, a middle PDZ domain, and a C-terminal
phosphatase domain. PTP1/Tyrosine-protein phosphatase 1
from nematodes and a FERM_C repeat 1 from Tetraodon
nigroviridis also included in this cd. The FERM domain
has a cloverleaf tripart structure (FERM_N, FERM_M,
FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1,
F2, F3). The C-lobe within the FERM domain is part of
the PH domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a
link between the membrane and cytoskeleton and are
involved in signal transduction pathways. The FERM
domain is also found in protein tyrosine phosphatases
(PTPs) , the tyrosine kinases FAK and JAK, in addition
to other proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 123
Score = 50.0 bits (120), Expect = 7e-08
Identities = 14/50 (28%), Positives = 26/50 (52%)
Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEK 81
G +A +L A+ LE+YGV ++ EL +GV + G+ +++
Sbjct: 1 GQSPAEAEFNFLNTARTLELYGVELHSARDASNLELQVGVTSGGILVFQN 50
Score = 28.4 bits (64), Expect = 2.0
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 153 FPWSEIRNISFNDRKFII 170
FPWS+I ISF ++F I
Sbjct: 57 FPWSKIVKISFKRKQFFI 74
>gnl|CDD|241341 cd13187, FERM_C_PTPH13, Protein tyrosine phosphatase non-receptor
13 (PTPH13) FERM domain C-lobe. There are many
functions of PTPN13 (also called PTPL1, PTP-BAS, hPTP1E,
FAP1, or PTPL1). Mice lacking PTPN13 activity have
abnormal regulation of signal transducer and activator
of transcription signaling in their T cells, mild
impairment of motor nerve repair, and a significant
reduction in the growth of retinal glia cultures. It
also plays a role in adipocyte differentiation. PTPN13
contains a kinase non-catalytic C-lobe domain (KIND), a
FERM domain with two potential phosphatidylinositol
4,5-biphosphate [PtdIns(4,5)P2]-binding motifs, 5 PDZ
domains, and a carboxy-terminal catalytic domain. There
is an nteraction between the FERM domain of PTPL1 and
PtdIns(4,5)P2 which is thought to regulate the membrane
localization of PTPN13. PDZ are protein/protein
interaction domains so there is the potential for
numerous partners that can actively participate in the
regulation of its phosphatase activity or can permit
direct or indirect recruitment of tyrosine
phosphorylated PTPL1 substrates. The FERM domain has a
cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N,
alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3).
The C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs), the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 124
Score = 46.9 bits (112), Expect = 1e-06
Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 11/83 (13%)
Query: 32 GMLREDAMMEYLKIAQDLEMYGVNY---FEIKNKKGTELWLGVDALGLNIYEKEDNLTNV 88
G+ E+A +E+LK+ Q L YGV++ + K +WLG+ + G+ IYE +
Sbjct: 1 GLSEEEAELEFLKVCQKLPEYGVHFHRVYREKKSSTGGIWLGICSRGIIIYEVHNGARTP 60
Query: 89 MS----QDVKKENPLQFKFRAKF 107
+ ++ KK + F R KF
Sbjct: 61 VLRFPWRETKK---ISFD-RKKF 79
Score = 40.7 bits (96), Expect = 1e-04
Identities = 20/61 (32%), Positives = 33/61 (54%)
Query: 148 TPKIGFPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHELYMRR 207
TP + FPW E + ISF+ +KFII+ F+ + ++ +L LC H+ +M+
Sbjct: 59 TPVLRFPWRETKKISFDRKKFIIENRGGSGIKHTFYTDSYKKSQYLLRLCSSQHKFHMKM 118
Query: 208 R 208
R
Sbjct: 119 R 119
>gnl|CDD|241353 cd13199, FERM_C2_MyoVII, Myosin VII (MyoVII, Myo7) FERM domain
C-lobe, repeat 2. MyoVII, a MyTH-FERM myosin, is an
actin-based motor protein essential for a variety of
biological processes in the actin cytoskeleton
function. Mutations in MyoVII leads to problems in
sensory perception: deafness and blindness in humans
(Usher Syndrome), retinal defects and deafness in mice
(shaker 1), and aberrant auditory and vestibular
function in zebrafish. Myosin VIIAs have plus (barbed)
end-directed motor activity on actin filaments and a
characteristic actin-activated ATPase activity. MyoVII
consists of a conserved spectrin-like, SH3 subdomain
N-terminal region, a motor/head region, a neck made of
4-5 IQ motifs, and a tail consisting of a coiled-coil
domain, followed by a tandem repeat of myosin tail
homology 4 (MyTH4) domains and partial FERM domains
that are separated by an SH3 subdomain and are thought
to mediate dimerization and binding to other proteins
or cargo. Members include: MyoVIIa, MyoVIIb, and MyoVII
members that do not have distinct myosin VIIA and
myosin VIIB genes. The FERM domain has a cloverleaf
tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-,
and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a
link between the membrane and cytoskeleton and are
involved in signal transduction pathways. The FERM
domain is also found in protein tyrosine phosphatases
(PTPs) , the tyrosine kinases FAK and JAK, in addition
to other proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 176
Score = 46.9 bits (112), Expect = 2e-06
Identities = 17/48 (35%), Positives = 26/48 (54%)
Query: 13 KMSPSEWESSITTWWKEHKGMLREDAMMEYLKIAQDLEMYGVNYFEIK 60
SP EW+ I + +H GM E+A + +LKI +G +FE+K
Sbjct: 41 LQSPDEWKKLIIAAYNKHAGMSSEEAKLAFLKIIYRWPTFGSAFFEVK 88
>gnl|CDD|241338 cd13184, FERM_C_4_1_family, Protein 4.1 family FERM domain
C-lobe. The protein 4.1 family includes four
well-defined members: erythroid protein 4.1 (4.1R), the
bestknown and characterized member, 4.1G (general),
4.1N (neuronal), and 4.1 B (brain). The less well
understood 4.1O/FRMD3 is not a true member of this
family and is not included in this hierarchy. Besides
three highly conserved domains, FERM, SAB (spectrin and
actin binding domain) and CTD (C-terminal domain), the
proteins from this family contain several unique
domains: U1, U2 and U3. FERM domains like other members
of the FERM domain superfamily have a cloverleaf
architecture with three distinct lobes (FERM_N, FERM_M,
FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1,
F2, F3). The C-lobe/F3 within the FERM domain is part
of the PH domain family. The brain is a particularly
rich source of protein 4.1 isoforms. The various 4.1R,
4.1G, 4.1N, and 4.1B mRNAs are all expressed in
distinct patterns within the brain. It is likely that
4.1 proteins play important functional roles in the
brain including motor coordination and spatial
learning, postmitotic differentiation, and synaptic
architecture and function. In addition they are found
in nonerythroid, nonneuronal cells where they may play
a general structural role in nuclear architecture
and/or may interact with splicing factors. The FERM C
domain is the third structural domain within the FERM
domain. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a
link between the membrane and cytoskeleton and are
involved in signal transduction pathways. The FERM
domain is also found in protein tyrosine phosphatases
(PTPs) , the tyrosine kinases FAK and JAK, in addition
to other proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 113
Score = 44.2 bits (105), Expect = 7e-06
Identities = 16/49 (32%), Positives = 29/49 (59%)
Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYE 80
G +A + +L+ A+ L +YGV+ K+ +G ++ LGV A G+ +Y
Sbjct: 1 GQTPAEAELHFLENAKKLALYGVDLHPAKDSEGVDIMLGVCAGGILVYR 49
Score = 32.2 bits (74), Expect = 0.090
Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 4/53 (7%)
Query: 153 FPWSEIRNISFNDRKFIIK----PIDKKAPDFVFFAPRVRVNKRILALCMGNH 201
F W +I IS+ F IK ++ F P R KR+ +C+ +H
Sbjct: 57 FAWPKILKISYKRNNFYIKLRPGEFEQSETTIGFKLPNHRAAKRLWKVCVEHH 109
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 45.0 bits (107), Expect = 1e-05
Identities = 14/97 (14%), Positives = 43/97 (44%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
+ ++E Q++ +LQ ++A ++ ++ ++ + L+ + K + +
Sbjct: 69 PELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI 128
Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLN 316
L+E+ ++L+ EL+ L+ E + ++
Sbjct: 129 ELDEENRELREELAELKQENEALEAENERLQENEQRR 165
Score = 43.8 bits (104), Expect = 3e-05
Identities = 26/89 (29%), Positives = 44/89 (49%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+QQ A+ +EE Q+Q +LQ E ++ + +ERL++ + + + +E
Sbjct: 71 LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
E R L E+L +L+ E LEA LQ
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQ 159
Score = 37.3 bits (87), Expect = 0.005
Identities = 21/86 (24%), Positives = 49/86 (56%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
++Q+ A+ +E+ QQ+ +L+ E++ E ++ Q+ + R+KQL + D++ E
Sbjct: 77 ELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRE 136
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQ 299
+E + L+++ + L+A E L+ +
Sbjct: 137 LREELAELKQENEALEAENERLQENE 162
Score = 35.0 bits (81), Expect = 0.025
Identities = 21/59 (35%), Positives = 35/59 (59%)
Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
ERL +L+ E+A+ ++L E QE + L+++ ++L+ LEA LQ EL I+ S
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124
Score = 32.3 bits (74), Expect = 0.22
Identities = 18/83 (21%), Positives = 40/83 (48%), Gaps = 2/83 (2%)
Query: 231 QQRDKLQLEIA-AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
++ +LQ E+A +E+ + ++ E ++ + + E + + + L +KQL
Sbjct: 66 ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELAR-IKQLS 124
Query: 290 AAKEELEARQTELQLELQPIRNQ 312
A EL+ EL+ EL ++ +
Sbjct: 125 ANAIELDEENRELREELAELKQE 147
Score = 30.0 bits (68), Expect = 1.1
Identities = 16/74 (21%), Positives = 38/74 (51%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
++QQ + ++E + + + ++LQ E+A ++ E E ++L E+A+ Q+
Sbjct: 91 ELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQENEA 150
Query: 274 AQEMIRRLEEQLKQ 287
+ RL+E ++
Sbjct: 151 LEAENERLQENEQR 164
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 46.2 bits (110), Expect = 2e-05
Identities = 26/107 (24%), Positives = 52/107 (48%), Gaps = 11/107 (10%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAARE-----------KAEKKHQESVERLKQLEVEMAKR 267
+ +EE +++ ++L E+ E + E++ +E + L L E+++
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
+Q +E + LE QL++L+A EELE++ EL EL + +
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE 347
Score = 45.8 bits (109), Expect = 2e-05
Identities = 22/106 (20%), Positives = 43/106 (40%), Gaps = 4/106 (3%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIA----AREKAEKKHQESVERLKQLEVEMAK 266
++ E+ + + L EI E+ E + + + LE +A
Sbjct: 832 RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALAL 891
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+L E E +R LE + +L+ EEL + +L+L L+ + +
Sbjct: 892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVR 937
Score = 43.9 bits (104), Expect = 9e-05
Identities = 23/98 (23%), Positives = 50/98 (51%), Gaps = 4/98 (4%)
Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEMAKRDQDLMEAQE 276
+ REE Q++ + + E+ ++ +E +E L+ +LE E+ + ++L
Sbjct: 236 ELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALAN 295
Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
I RLE+Q + L+ LE + EL+ +L+ + ++ +
Sbjct: 296 EISRLEQQKQILRERLANLERQLEELEAQLEELESKLD 333
Score = 43.5 bits (103), Expect = 1e-04
Identities = 22/103 (21%), Positives = 56/103 (54%), Gaps = 4/103 (3%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIA----AREKAEKKHQESVERLKQLEVEMAKRDQ 269
++Q K RE ++Q ++L+ ++ ++ ++ E E+L++L+ E+ +
Sbjct: 299 RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA 358
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+L E + + LE +L++L+ E L ++ +L+L++ + N+
Sbjct: 359 ELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401
Score = 42.0 bits (99), Expect = 3e-04
Identities = 27/97 (27%), Positives = 56/97 (57%), Gaps = 3/97 (3%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
++++ Q R+E +Q L+ ++A + E + ++ ER+ QL E+ + + ++ E
Sbjct: 709 ELEEELEQLRKELEELSRQISALRKDLA---RLEAEVEQLEERIAQLSKELTELEAEIEE 765
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
+E + EE+L + +A EELEA+ +L+ EL+ +R
Sbjct: 766 LEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR 802
Score = 41.6 bits (98), Expect = 5e-04
Identities = 20/97 (20%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAA----REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
A+ E +++ +L E+ E+ E++ +E+ E L + E E+ + + + + +
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795
Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E ++ L E L +L+A L L+ L+ + +
Sbjct: 796 EELKALREALDELRAELTLLNEEAANLRERLESLERR 832
Score = 41.6 bits (98), Expect = 5e-04
Identities = 19/114 (16%), Positives = 54/114 (47%), Gaps = 4/114 (3%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQL 260
+ + + + ++ + E +++ ++L+LE++ E+ +K+ + +L
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
E + + L + + LE QL++L++ +EL EL+ +L+ ++ +
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELE 354
Score = 40.8 bits (96), Expect = 7e-04
Identities = 24/106 (22%), Positives = 53/106 (50%), Gaps = 11/106 (10%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAR----EKAEKKHQESVERLKQLEVEM--AKR 267
+++ + EE +++ ++L+ E+ + E+ E + +E RL++LE ++ +
Sbjct: 327 ELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRS 386
Query: 268 DQDLMEAQEM-----IRRLEEQLKQLQAAKEELEARQTELQLELQP 308
+E Q I RLE +L++L+ +E L+ EL +L+
Sbjct: 387 KVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEE 432
Score = 40.4 bits (95), Expect = 0.001
Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 1/87 (1%)
Query: 222 AREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
E + ++ K +A + E++ Q ERL LE ++ + + L E + +
Sbjct: 275 EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
Query: 281 LEEQLKQLQAAKEELEARQTELQLELQ 307
L E+L +L+ EEL+ L+ EL+
Sbjct: 335 LAEELAELEEKLEELKEELESLEAELE 361
Score = 40.4 bits (95), Expect = 0.001
Identities = 26/100 (26%), Positives = 52/100 (52%), Gaps = 7/100 (7%)
Query: 215 VQQMKAQAR--EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
++ ++ QA E + + +L+L + E + E L++L+ E+ + +++L
Sbjct: 202 LKSLERQAEKAERYKELKAELRELELALLVLRLEELR-----EELEELQEELKEAEEELE 256
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E ++ LEE+L++L+ ELE ELQ EL + N+
Sbjct: 257 ELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANE 296
Score = 40.0 bits (94), Expect = 0.001
Identities = 18/89 (20%), Positives = 42/89 (47%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
+A+ R++L+ E++ ++ E++++L ++ ++ E +E+I
Sbjct: 809 RAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELI 868
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQ 307
LE +L+ L + LE L+ EL+
Sbjct: 869 EELESELEALLNERASLEEALALLRSELE 897
Score = 40.0 bits (94), Expect = 0.001
Identities = 24/122 (19%), Positives = 61/122 (50%), Gaps = 4/122 (3%)
Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK- 258
E + + ++++ A+ R+E +++ ++L+ E+ + ++ + RL+
Sbjct: 681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEA 740
Query: 259 ---QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
QLE +A+ ++L E + I LEE+L++ + E EA EL+ +++ ++ +
Sbjct: 741 EVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKA 800
Query: 316 NK 317
+
Sbjct: 801 LR 802
Score = 40.0 bits (94), Expect = 0.002
Identities = 24/103 (23%), Positives = 51/103 (49%), Gaps = 1/103 (0%)
Query: 206 RRRKPD-TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
R+ + + + ++ + +Q +++ + E + +E ERL++ E E+
Sbjct: 718 RKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEEL 777
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
A+ + ++ E + I +L+E+LK L+ A +EL A T L E
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820
Score = 38.9 bits (91), Expect = 0.004
Identities = 25/92 (27%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
+ + +R++LQ EI K ++ + LK+L+ E+ + +++L E QE + R
Sbjct: 404 RLEARLERLEDRRERLQQEIEELLKKLEEAE-----LKELQAELEELEEELEELQEELER 458
Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
LEE L++L+ EE E + EL ++ +
Sbjct: 459 LEEALEELREELEEAEQALDAAERELAQLQAR 490
Score = 37.0 bits (86), Expect = 0.012
Identities = 20/95 (21%), Positives = 41/95 (43%), Gaps = 4/95 (4%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKA----EKKHQESVERLKQLEVEMAKRDQDL 271
++ A+A E + Q ++L+ E+ A +A + E L + ++ +
Sbjct: 774 EEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRI 833
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+ + LEEQ+++L E L A EL+ +
Sbjct: 834 AATERRLEDLEEQIEELSEDIESLAAEIEELEELI 868
Score = 37.0 bits (86), Expect = 0.014
Identities = 22/92 (23%), Positives = 48/92 (52%), Gaps = 2/92 (2%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ--ESVERLKQLEVEMAKRDQDL 271
++ + + ++E ++ ++ +L+ E E + + E E L++LE + + ++L
Sbjct: 411 RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL 470
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
EA++ + E +L QLQA + LE Q L+
Sbjct: 471 EEAEQALDAAERELAQLQARLDSLERLQENLE 502
Score = 34.6 bits (80), Expect = 0.068
Identities = 20/88 (22%), Positives = 38/88 (43%), Gaps = 4/88 (4%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEMAKRDQDLMEAQ 275
E ++ ++L+ E+ A +E++ L+ +L E+ + + E +
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914
Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ L E+L QL+ E LE R LQ
Sbjct: 915 RELEELREKLAQLELRLEGLEVRIDNLQ 942
Score = 34.6 bits (80), Expect = 0.075
Identities = 24/93 (25%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL----KQLEVEMAKRDQDLMEA 274
++ EE Q++ L EI+ E+ ++ +E + L ++LE ++ + + L E
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335
Query: 275 QEMIRRLEEQLKQLQ----AAKEELEARQTELQ 303
E + LEE+L++L+ + + ELE + EL+
Sbjct: 336 AEELAELEEKLEELKEELESLEAELEELEAELE 368
Score = 34.3 bits (79), Expect = 0.087
Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 12/109 (11%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ------ 269
++++++ E + + R K+ + + RL++LE + Q
Sbjct: 368 EELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427
Query: 270 ------DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+L E Q + LEE+L++LQ E LE EL+ EL+
Sbjct: 428 KKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476
Score = 33.9 bits (78), Expect = 0.14
Identities = 25/113 (22%), Positives = 52/113 (46%), Gaps = 16/113 (14%)
Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA----REKAEKKHQESVER 256
E + + + ++ + + +++ +++ ++L+ ++A E E + ER
Sbjct: 885 LEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQER 944
Query: 257 LK---QLEVEMA-----KRDQDLMEAQEMIRRLEEQLKQL----QAAKEELEA 297
L L +E A K + D EA+ ++RLE ++K+L AA EE E
Sbjct: 945 LSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLAAIEEYEE 997
Score = 28.1 bits (63), Expect = 8.7
Identities = 16/44 (36%), Positives = 27/44 (61%)
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
R +++ E +E I LEE++ +L+ A EL EL+ EL+ +R
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLR 718
>gnl|CDD|241345 cd13191, FERM_C_FRMD4A_FRMD4B, FERM domain-containing protein 4A
and 4B (FRMD4A and 4B) FERM domain C-lobe. FRMD4A is
part of the Par-3/FRMD4A/cytohesin-1 complex that
activates Arf6, a central player in actin cytoskeleton
dynamics and membrane trafficking, during junctional
remodeling and epithelial polarization. The
Par-3/Par-6/aPKC/Cdc42 complex regulates the conversion
of primordial adherens junctions (AJs) into belt-like
AJs and the formation of linear actin cables. When
primordial AJs are formed, Par-3 recruits scaffolding
protein FRMD4A which connects Par-3 and the Arf6
guanine-nucleotide exchange factor (GEF), cytohesin-1.
FRMD4B (also called GRP1-binding protein, GRSP1) is a
novel member of GRP1 signaling complexes that are
recruited to plasma membrane ruffles in response to
insulin receptor signaling. The GRSP1/FRMD4B protein
contains a FERM protein domain as well as two coiled
coil domains and may function as a scaffolding protein.
GRP1 and GRSP1 interact through the coiled coil domains
in the two proteins. The FERM domain has a cloverleaf
tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3
within the FERM domain is part of the PH domain family.
The FERM domain is found in the cytoskeletal-associated
proteins such as ezrin, moesin, radixin, 4.1R, and
merlin. These proteins provide a link between the
membrane and cytoskeleton and are involved in signal
transduction pathways. The FERM domain is also found in
protein tyrosine phosphatases (PTPs) , the tyrosine
kinases FAK and JAK, in addition to other proteins
involved in signaling. This domain is structurally
similar to the PH and PTB domains and consequently is
capable of binding to both peptides and phospholipids at
different sites.
Length = 115
Score = 43.1 bits (102), Expect = 2e-05
Identities = 32/173 (18%), Positives = 57/173 (32%), Gaps = 75/173 (43%)
Query: 50 EMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQFKFRAKFYP 109
YGV+Y+E+K+K G WLG+ G+ Y+ +D K P K +
Sbjct: 1 PTYGVHYYEVKDKNGIPWWLGLSYKGIGQYDLQD-----------KVKP------RKLFQ 43
Query: 110 EDVVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCLLTPKIGFPWSEIRNISFNDRKFI 169
W ++ N+ F DRKF
Sbjct: 44 ---------------------------------------------WKQLENLYFRDRKFS 58
Query: 170 IKPID-KKAPD------------FVFFAPRVRVNKRILALCMGNHELYMRRRK 209
I+ D ++ V++ + K I ++ + H+ Y+ R++
Sbjct: 59 IEVHDPRRISVSRRTFGQSGVAVHVWYGNTPSLIKSIWSMAISQHQFYLDRKQ 111
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 44.6 bits (106), Expect = 2e-05
Identities = 29/126 (23%), Positives = 57/126 (45%), Gaps = 9/126 (7%)
Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAARE-KAEKKHQ--------E 252
E +R + + +Q AQA + +++ ++ Q E KAE Q E
Sbjct: 30 EQAIRDMESELAKARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLARE 89
Query: 253 SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
++E + LE + +L +A+E + +L++QL L+ EL A++ L+ + Q
Sbjct: 90 ALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEALKARKAAAKAQ 149
Query: 313 SNLNKV 318
+N+
Sbjct: 150 EKVNRS 155
>gnl|CDD|241339 cd13185, FERM_C_FRMD1_FRMD6, FERM domain containing 1 and 6
proteins FERM domain C-lobe. FRMD6 (also called willin
and hEx/human expanded) is localized throughout the
cytoplasm or along the plasma membrane The Drosophilla
protein Ex is a regulator of the Hippo/SWH (Sav/Wts/Hpo)
signaling pathway, a signaling pathway that plays a
pivotal role in organ size control and is tumor
suppression by restricting proliferation and promoting
apoptosis. Surprisingly, hEx is thought to function
independently of the Hippo pathway. Instead it is
hypothesized that hEx inhibits progression through the S
phase of the cell cycle by upregulating p21(Cip1) and
downregulating Cyclin A. It is also implicated in the
progression of Alzheimer disease. Not much is known
about FRMD1 to date. Both FRMD1 and FRMD6 contains a
single FERM domain which has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe is
a member of the PH superfamily. The FERM domain is found
in the cytoskeletal-associated proteins such as ezrin,
moesin, radixin, 4.1R, and merlin. These proteins
provide a link between the membrane and cytoskeleton and
are involved in signal transduction pathways. The FERM
domain is also found in protein tyrosine phosphatases
(PTPs) , the tyrosine kinases FAK and JAK, in addition
to other proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 109
Score = 42.7 bits (101), Expect = 2e-05
Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 4/67 (5%)
Query: 141 PGPNCLLTPKIGFPWSEIRNISFNDRKFIIKPIDKKAPD-FVFFAPRVRVNKRILALCMG 199
G LL FPWS I +SF+ +KF I+P + ++ +K +LALC
Sbjct: 37 DGEKQLLRT---FPWSNIGKLSFDRKKFEIRPEGLPSLRKLTYYTGSDEKSKYLLALCRE 93
Query: 200 NHELYMR 206
H+ M
Sbjct: 94 THQFSMA 100
>gnl|CDD|241349 cd13195, FERM_C_MYLIP_IDOL, The E3 ubiquitin ligase myosin
regulatory light chain-interacting protein (MYLIP; also
called inducible degrader of the LDL receptor, IDOL)
FERM domain C-lobe. MYLIP/IDOL is a regulator of the
LDL receptor (LDLR) pathway via the nuclear receptor
liver X receptor (LXR). In response to cellular
cholesterol loading, the activation of LXR leads to the
induction of MYLIP expression. MYLIP stimulates
ubiquitination of the LDLR on its cytoplasmic tail,
directing its degradation. The LXR-MYLIP-LDLR pathway
provides a complementary pathway to sterol regulatory
element-binding proteins for the feedback inhibition of
cholesterol uptake. MYLIP has an N-terminal FERM domain
and in some cases a C-terminal RING domain. The FERM
domain has a cloverleaf tripart structure (FERM_N,
FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe,
C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM
domain is part of the PH domain family. The FERM domain
is found in the cytoskeletal-associated proteins such
as ezrin, moesin, radixin, 4.1R, and merlin. These
proteins provide a link between the membrane and
cytoskeleton and are involved in signal transduction
pathways. The FERM domain is also found in protein
tyrosine phosphatases (PTPs) , the tyrosine kinases FAK
and JAK, in addition to other proteins involved in
signaling. This domain is structurally similar to the
PH and PTB domains and consequently is capable of
binding to both peptides and phospholipids at different
sites.
Length = 131
Score = 42.6 bits (101), Expect = 3e-05
Identities = 18/56 (32%), Positives = 27/56 (48%)
Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTN 87
GM A +LK +LE YGV F ++ +G +L +GV G+ I + L
Sbjct: 1 GMSPSTAEYLFLKEVSELENYGVETFHVRTIEGQKLVIGVGPDGIRICNDDGELIQ 56
>gnl|CDD|241356 cd13202, FERM_C_MyoX, Myosin X (MyoX, Myo10) FERM domain C-lobe.
MyoX, a MyTH-FERM myosin, is a molecular motor that has
crucial functions in the transport and/or tethering of
integrins in the actin-based extensions known as
filopodia, microtubule binding, and in netrin-mediated
axon guidance. It functions as a dimer. MyoX walks on
bundles of actin, rather than single filaments, unlike
the other unconventional myosins. MyoX is present in
organisms ranging from humans to choanoflagellates, but
not in Drosophila and Caenorhabditis elegans.MyoX
consists of a N-terminal motor/head region, a neck made
of 3 IQ motifs, and a tail consisting of a coiled-coil
domain, a PEST region, 3 PH domains, a myosin tail
homology 4 (MyTH4), and a FERM domain at its very
C-terminus. The MyoX FERM domain binds to the NPXY
motif of several beta-integrins, a key family of cell
surface receptors that are involved in cell adhesion
and migration. In addition the FERM domain binds to the
cytoplasmic domains of the netrin receptors DCC
(deleted in colorectal cancer) and neogenin. The FERM
domain also forms a supramodule with its MyTH4 domain
which binds to the negatively charged E-hook region in
the tails of alpha- and beta-tubulin forming a proposed
motorized link between actin filaments and
microtubules. The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a
link between the membrane and cytoskeleton and are
involved in signal transduction pathways. The FERM
domain is also found in protein tyrosine phosphatases
(PTPs) , the tyrosine kinases FAK and JAK, in addition
to other proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 111
Score = 42.3 bits (100), Expect = 3e-05
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 4/51 (7%)
Query: 33 MLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGT----ELWLGVDALGLNIY 79
M +EDAM +Y+ I ++ YG FE++ K ELWLGV A G+++Y
Sbjct: 1 MSQEDAMAKYMAIIREWPGYGSTLFEVECNKEGGFPQELWLGVSAKGVSLY 51
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 44.7 bits (106), Expect = 5e-05
Identities = 26/104 (25%), Positives = 55/104 (52%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+++++ Q E + ++ + +L E+ + + +E E L++LE E+ + + L E
Sbjct: 703 LLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEE 762
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
+E + LEE L +L+ EELE ++ LQ EL+ + + +
Sbjct: 763 LEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAE 806
Score = 44.7 bits (106), Expect = 5e-05
Identities = 26/107 (24%), Positives = 63/107 (58%), Gaps = 4/107 (3%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAR----EKAEKKHQESVERLKQLEVEMAKRDQD 270
+++++ QA + + ++ + + +LE+A ++ K+ +E E L +LE E+ + ++
Sbjct: 202 LEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEE 261
Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
L EA++ I L+ +L++L+ EEL+ EL+ E++ + + +L +
Sbjct: 262 LEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308
Score = 43.2 bits (102), Expect = 2e-04
Identities = 31/103 (30%), Positives = 57/103 (55%), Gaps = 4/103 (3%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQLEVEMAKRDQ 269
+ ++ E + +Q+R++L+ EI E+ E+K E E L++LE E+ + +
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+L E + LE++LK+L+ KEELE EL+ EL ++ +
Sbjct: 864 ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEE 906
Score = 42.8 bits (101), Expect = 2e-04
Identities = 22/94 (23%), Positives = 44/94 (46%), Gaps = 4/94 (4%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAARE----KAEKKHQESVERLKQLEVEMAKRDQDLM 272
+ REE Q++ +L+ EI E ++ +E L++LE + + + +
Sbjct: 274 SELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIE 333
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+E + E L++L+ ELE + EL+ +L
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKL 367
Score = 42.4 bits (100), Expect = 2e-04
Identities = 29/100 (29%), Positives = 54/100 (54%), Gaps = 3/100 (3%)
Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
+++ A+ R E +++ + L+ E+ ++ ++ E LK+LE E+ + +L
Sbjct: 386 AELEAELAEIRNELEELKREIESLEER---LERLSERLEDLKEELKELEAELEELQTELE 442
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E E + LEEQL++L+ +ELE ELQ ELQ + +
Sbjct: 443 ELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKE 482
Score = 42.4 bits (100), Expect = 3e-04
Identities = 26/94 (27%), Positives = 52/94 (55%), Gaps = 3/94 (3%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
+ EE + +++ ++LQ E+AEK+ +E L++L E+ + ++L+E +E I
Sbjct: 241 LEELEEELSRLEEELEELQ---EELEEAEKEIEELKSELEELREELEELQEELLELKEEI 297
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
LE ++ L+ EELE EL+ L+ ++ +
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEK 331
Score = 42.0 bits (99), Expect = 3e-04
Identities = 23/101 (22%), Positives = 58/101 (57%), Gaps = 3/101 (2%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
+ K + ++++ + +EE +++ ++ + + A E+ + ++ ERL+Q E+ +
Sbjct: 776 KLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQ---EIEE 832
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
++++ E +E + LEE+L++L+ EEL+ EL+ E +
Sbjct: 833 LEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE 873
Score = 40.9 bits (96), Expect = 8e-04
Identities = 23/98 (23%), Positives = 52/98 (53%), Gaps = 4/98 (4%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE----RLKQLEVEMAKRDQ 269
++ + ++Q ++L+ ++ ++ +E +E RL++LE E+ + ++
Sbjct: 692 SLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEE 751
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+L E QE + LEE+L+ L+ A +L+ EL+ + Q
Sbjct: 752 ELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQ 789
Score = 40.5 bits (95), Expect = 0.001
Identities = 28/110 (25%), Positives = 56/110 (50%), Gaps = 11/110 (10%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIA-----------AREKAEKKHQESVERLKQLEV 262
++Q+ + EE + ++ KL+ EI E+ E++ +E+ RL LE
Sbjct: 755 ELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALER 814
Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E+ +Q ++ I LEE++++L+ +ELE EL+ EL+ ++ +
Sbjct: 815 ELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEE 864
Score = 40.5 bits (95), Expect = 0.001
Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 7/108 (6%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+++ + RE + + ++L+ + ++ + +E +E + L E+ + +L E
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEE 358
Query: 274 AQEMIRRLEEQLKQLQAAKEE----LEARQTELQLELQPIRNQSNLNK 317
A+E LEE+L L EE L EL+ EL IRN+ K
Sbjct: 359 AKE---ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELK 403
Score = 38.5 bits (90), Expect = 0.004
Identities = 25/99 (25%), Positives = 61/99 (61%), Gaps = 3/99 (3%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
++++ +A +A +++ + L+ RE+ E++ +E E +++LE ++ + +++L E
Sbjct: 797 ELEEELEEAERRLDALERELESLE---QRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
++ + L+E+L++L+A KEELE EL+ E + + +
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEE 892
Score = 38.2 bits (89), Expect = 0.006
Identities = 28/90 (31%), Positives = 50/90 (55%), Gaps = 10/90 (11%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
D+++ + E Q + ++L E+ E+ E++ +E +RLK+LE ++L E
Sbjct: 422 DLKEELKELEAELEELQTELEELNEEL---EELEEQLEELRDRLKELE-------RELAE 471
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
QE ++RLE++L L+A + LEA Q Q
Sbjct: 472 LQEELQRLEKELSSLEARLDRLEAEQRASQ 501
Score = 38.2 bits (89), Expect = 0.006
Identities = 28/105 (26%), Positives = 59/105 (56%), Gaps = 11/105 (10%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEMAKRDQDL 271
+Q + EE +++ D+L+ E+ EK ++ +E +E L+ +LE E+ + +++
Sbjct: 827 EQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEK 886
Query: 272 MEAQEMIR-------RLEEQLKQLQAAKEELEARQTELQLELQPI 309
E +E +R L+E++++L+ EELEA+ L++EL +
Sbjct: 887 EELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPEL 931
Score = 37.8 bits (88), Expect = 0.007
Identities = 23/112 (20%), Positives = 54/112 (48%), Gaps = 1/112 (0%)
Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL 260
++ + + + + + + + + ++ + +L E E+ + E L +L
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEEL-EELFEALREELAEL 388
Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E E+A+ +L E + I LEE+L++L E+L+ EL+ EL+ ++ +
Sbjct: 389 EAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTE 440
Score = 37.8 bits (88), Expect = 0.007
Identities = 30/122 (24%), Positives = 58/122 (47%)
Query: 191 KRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKH 250
K L E R+ + ++++K + + +Q + +L+ E+ E++
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753
Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
+E ERL++LE E+ ++ L + +E I LEE+ + LQ EELE E + L +
Sbjct: 754 EELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALE 813
Query: 311 NQ 312
+
Sbjct: 814 RE 815
Score = 37.4 bits (87), Expect = 0.011
Identities = 28/101 (27%), Positives = 55/101 (54%), Gaps = 4/101 (3%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE----RLKQLEVEMAKRDQDL 271
++ A+ +EE +++R LQ E+ E+ ++ + ++ L+ LE + +Q++
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E +E I LEE+L +L+ EELE EL+ EL+ + +
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE 871
Score = 35.1 bits (81), Expect = 0.046
Identities = 20/87 (22%), Positives = 43/87 (49%), Gaps = 1/87 (1%)
Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQEMIRRL 281
+E +++ ++L+ E E K+ +E E L ++L ++ + E +++ RL
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERL 914
Query: 282 EEQLKQLQAAKEELEARQTELQLELQP 308
EE +L+ + EL + EL+ E +
Sbjct: 915 EELEAKLERLEVELPELEEELEEEYED 941
Score = 35.1 bits (81), Expect = 0.056
Identities = 26/120 (21%), Positives = 59/120 (49%), Gaps = 9/120 (7%)
Query: 202 ELYMRRRKPDTIDVQQMKAQAR-----EEKNAKQQQRDKLQLEIAAREKAEKKHQESVER 256
+L +R+ ++ + + Q+R EE +++ ++LQ + E+ + +E++ +
Sbjct: 717 QLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAK 776
Query: 257 LKQLEVEMAKRDQDLMEAQEM----IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
LK+ E+ ++ Q L E E + E +L L+ E LE R+ L+ E++ + +
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEE 836
Score = 34.7 bits (80), Expect = 0.073
Identities = 18/99 (18%), Positives = 51/99 (51%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+++++K + + ++ ++L+ ++ E + +E L++L E+ + ++ L E
Sbjct: 398 ELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
++ ++ LE +L +LQ + LE + L+ L + +
Sbjct: 458 LRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAE 496
Score = 34.3 bits (79), Expect = 0.10
Identities = 25/103 (24%), Positives = 56/103 (54%), Gaps = 2/103 (1%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
+++++K + E + K++ D+L+ +E+ E++ +E L +L+ E+ K + L
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERL 914
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELE--ARQTELQLELQPIRNQ 312
E + + RLE +L +L+ EE +TEL+ E++ + +
Sbjct: 915 EELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEE 957
Score = 33.9 bits (78), Expect = 0.11
Identities = 24/97 (24%), Positives = 51/97 (52%), Gaps = 4/97 (4%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM----EAQEM 277
+ E +++ +L+ ++ E+ K + + L+ L E+ ++ ++L E +
Sbjct: 665 QKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRE 724
Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
+ LEE+L+QLQ+ EELE EL+ EL+ ++ +
Sbjct: 725 LAALEEELEQLQSRLEELEEELEELEEELEELQERLE 761
Score = 33.9 bits (78), Expect = 0.13
Identities = 19/97 (19%), Positives = 51/97 (52%), Gaps = 7/97 (7%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK------- 266
++++++ + E + ++ + +L+ E+ +++ E E +++LE E++
Sbjct: 254 ELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEE 313
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ +L E +E + L+E+++ L+ EE E EL+
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELE 350
Score = 32.8 bits (75), Expect = 0.28
Identities = 18/100 (18%), Positives = 47/100 (47%), Gaps = 6/100 (6%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKA------EKKHQESVERLKQLEVEMAKRDQDLM 272
+ RE + + ++L++E+ E+ + E +++LE E+ +
Sbjct: 907 IEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNL 966
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
A E +EE+ ++L++ +E+LE + +L ++ + +
Sbjct: 967 RAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKE 1006
Score = 32.4 bits (74), Expect = 0.37
Identities = 24/104 (23%), Positives = 48/104 (46%), Gaps = 5/104 (4%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA-----REKAEKKHQESVERLKQ 259
+ R+ +++Q+ A+ E K +++ L E+ RE+ + E E +
Sbjct: 339 LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE 398
Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
LE + + + + RLE+ ++L+ + ELE QTEL+
Sbjct: 399 LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELE 442
Score = 29.7 bits (67), Expect = 2.5
Identities = 17/62 (27%), Positives = 37/62 (59%)
Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
LK+LE E+A+ + L + +E ++ L+ +L+ L+ EEL + EL+ +L+ ++ +
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727
Query: 316 NK 317
+
Sbjct: 728 LE 729
Score = 28.1 bits (63), Expect = 7.9
Identities = 20/76 (26%), Positives = 41/76 (53%)
Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
+ +++ +E E L +LE ++ K +++L + +R LE+ L++L+ EELE +
Sbjct: 658 NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ 717
Query: 299 QTELQLELQPIRNQSN 314
EL+ EL + +
Sbjct: 718 LEELKRELAALEEELE 733
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 43.4 bits (102), Expect = 9e-05
Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 2/89 (2%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
VQQ ++ +AK+ ++ + + E E+ + K ERLKQLE E K + +A
Sbjct: 61 VQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQA 120
Query: 275 QEMIRRLEEQLKQL--QAAKEELEARQTE 301
+E ++ + + KQ QA K E ++
Sbjct: 121 EEAEKQAQLEQKQQEEQARKAAAEQKKKA 149
Score = 33.8 bits (77), Expect = 0.10
Identities = 15/88 (17%), Positives = 37/88 (42%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
++ + ++ A+ + K A +Q+R K + + + ++K E E+ QLE +
Sbjct: 75 KKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQ 134
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
+ A++ + + K A +
Sbjct: 135 EEQARKAAAEQKKKAEAAKAKAAAEAAK 162
Score = 31.8 bits (72), Expect = 0.41
Identities = 20/83 (24%), Positives = 41/83 (49%), Gaps = 3/83 (3%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
Q K + + ++ KQ + + Q ++ +++ E+ + + E+ K+ E AK EA
Sbjct: 105 QLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA---AAEAA 161
Query: 276 EMIRRLEEQLKQLQAAKEELEAR 298
++ E + K +AAK EA+
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAK 184
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 43.0 bits (102), Expect = 2e-04
Identities = 29/96 (30%), Positives = 54/96 (56%), Gaps = 6/96 (6%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEI---AAREKAEKKHQESVERLKQLEVEMAKRDQD 270
+ + ++ARE + A +QQ ++LQ I AAR A Q+++ RL++ E + QD
Sbjct: 569 SLSESVSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQDALARLREQSGEEFEDSQD 628
Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+ E +++L E+ ++L ++EL AR+ L E+
Sbjct: 629 VTEY---MQQLLERERELTVERDELAARKQALDEEI 661
Score = 39.9 bits (94), Expect = 0.002
Identities = 18/92 (19%), Positives = 46/92 (50%), Gaps = 1/92 (1%)
Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
+ RE++ +Q +L++ ++ E+ ++ Q + L + + K D E +++
Sbjct: 504 RLREQR-HLAEQLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEE 562
Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
LE +L+ L + E R+ L+ +L+ ++ +
Sbjct: 563 LEARLESLSESVSEARERRMALRQQLEQLQAR 594
Score = 35.7 bits (83), Expect = 0.030
Identities = 18/88 (20%), Positives = 40/88 (45%), Gaps = 4/88 (4%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+QQ++ + E + +QQ+ +L + K+ ++++ +LE + + L
Sbjct: 515 LQQLRMRLSELEQRLRQQQRAERL----LAEFCKRLGKNLDDEDELEQLQEELEARLESL 570
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTEL 302
E + E+ L+ E+L+AR L
Sbjct: 571 SESVSEARERRMALRQQLEQLQARIQRL 598
Score = 34.2 bits (79), Expect = 0.090
Identities = 25/113 (22%), Positives = 50/113 (44%), Gaps = 16/113 (14%)
Query: 216 QQMKAQAREE-KNAKQQQRDKLQLEIA---AREKAEKKHQESVERLKQLEV-------EM 264
+Q + +ARE+ + A+ Q Q+ + + + + QE + L+ L V E
Sbjct: 998 EQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSGAEER 1057
Query: 265 AKRDQD-----LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
A+ +D L + +LE+QL +A + L + +L+ + +R Q
Sbjct: 1058 ARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMREQ 1110
Score = 32.2 bits (74), Expect = 0.41
Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 23/124 (18%)
Query: 206 RRRKPD-TIDVQQMKAQAREEKNAKQQQ----RDKLQLEIAAREKA-EKKHQESVERLKQ 259
RR + +++++ +R + A+Q + +L E+ E E+ +Q + + L
Sbjct: 281 RRVHLEEALELRRELYTSRRQLAAEQYRLVEMARELA-ELNEAESDLEQDYQAASDHL-N 338
Query: 260 LEVEMAKRDQ--------DLMEAQEMIRRLEEQLKQLQAAKE---ELEARQTELQLELQP 308
L V+ A R Q DL E +E RLEEQ + ++ A E E EAR + E+
Sbjct: 339 L-VQTALRQQEKIERYQADLEELEE---RLEEQNEVVEEADEQQEENEARAEAAEEEVDE 394
Query: 309 IRNQ 312
+++Q
Sbjct: 395 LKSQ 398
Score = 29.5 bits (67), Expect = 3.3
Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 2/89 (2%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
++Q QA++ + +QQ L E+ R +A ++++ E L + K Q L +A
Sbjct: 940 LKQDYQQAQQTQRDAKQQAFALT-EVVQR-RAHFSYEDAAEMLAKNSDLNEKLRQRLEQA 997
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
++ R EQL+Q QA + L+
Sbjct: 998 EQERTRAREQLRQAQAQLAQYNQVLASLK 1026
Score = 28.4 bits (64), Expect = 5.8
Identities = 24/116 (20%), Positives = 48/116 (41%), Gaps = 16/116 (13%)
Query: 213 IDVQQMKA-QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
+DVQQ +A Q ++ A ++ + L + AE +E + ++ E+ +Q L
Sbjct: 406 LDVQQTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEELLSLEQKL 465
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELE-------ARQTELQLE--------LQPIRNQ 312
AQ + E+ + ++ E+ AR+ +L LQ +R +
Sbjct: 466 SVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQLQQLRMR 521
Score = 28.4 bits (64), Expect = 6.8
Identities = 18/101 (17%), Positives = 41/101 (40%), Gaps = 15/101 (14%)
Query: 222 AREEKNAK-QQQRDKLQLEIAAREKAEKKHQESVERLKQL-------------EVEMAKR 267
ARE++ + + +R++L A +K Q + + E E+ +
Sbjct: 783 AREKRIEQLRAEREELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAFEADPEAELRQL 842
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
++ +E + + E Q +Q ++ E+ + + L L P
Sbjct: 843 NRRRVELERALADHESQEQQQRSQLEQAKEGLSALN-RLLP 882
Score = 28.0 bits (63), Expect = 9.1
Identities = 24/107 (22%), Positives = 48/107 (44%), Gaps = 17/107 (15%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
A KN+ ++ + +LE A +E+ + E+L+Q + ++A+ +Q L +
Sbjct: 976 AAEMLAKNSDLNEKLRQRLEQAEQERTRAR-----EQLRQAQAQLAQYNQVLASLKSSYD 1030
Query: 280 RLEEQLKQLQ------------AAKEELEARQTELQLELQPIRNQSN 314
+ L++L+ A+E AR+ EL L R++ N
Sbjct: 1031 AKRQMLQELKQELQDLGVPADSGAEERARARRDELHARLSANRSRRN 1077
>gnl|CDD|241346 cd13192, FERM_C_FRMD3_FRMD5, FERM domain-containing protein 3 and
5 FERM (FRMD3 and 5) domain C-lobe. FRMD3 (also called
Band 4.1-like protein 4O/4.1O though it is not a true
member of that family) is a novel putative tumor
suppressor gene that is implicated in the origin and
progression of lung cancer. In humans there are 5
isoforms that are produced by alternative splicing.
Less is known about FRMD5, though there are 2 isoforms
of the human protein are produced by alternative
splicing. Both FRMD3 and FRMD5 contain a N-terminal
FERM domain, followed by a FERM adjacent (FA) domain,
and 4.1 protein C-terminal domain (CTD). The FERM
domain has a cloverleaf tripart structure (FERM_N,
FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe,
C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM
domain is part of the PH domain family. The FERM domain
is found in the cytoskeletal-associated proteins such
as ezrin, moesin, radixin, 4.1R, and merlin. These
proteins provide a link between the membrane and
cytoskeleton and are involved in signal transduction
pathways. The FERM domain is also found in protein
tyrosine phosphatases (PTPs) , the tyrosine kinases FAK
and JAK, in addition to other proteins involved in
signaling. This domain is structurally similar to the
PH and PTB domains and consequently is capable of
binding to both peptides and phospholipids at different
sites.
Length = 105
Score = 39.7 bits (93), Expect = 2e-04
Identities = 15/42 (35%), Positives = 26/42 (61%)
Query: 38 AMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIY 79
A + +L+ A LE YGV+ +K+ GT+L+LG G+ ++
Sbjct: 1 AELNFLRKAHTLETYGVDPHPVKDHYGTQLYLGFTHTGVVVF 42
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 42.6 bits (101), Expect = 2e-04
Identities = 38/120 (31%), Positives = 68/120 (56%), Gaps = 13/120 (10%)
Query: 207 RRKPD-TID-VQQMKAQAREEKNAKQQQRDKL-----QLEIAAREKAEK-KHQESVERLK 258
R PD ID + + A+ R E ++K ++ D+L QLEI RE +K K + S ERL+
Sbjct: 379 RFLPDKAIDLIDEAAARIRMEIDSKPEELDELDRRIIQLEIE-REALKKEKDEASKERLE 437
Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
LE E+A+ +++ + +E + + ++ +Q KEE+E +++LEL+ + +L K
Sbjct: 438 DLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIE----QVRLELEQAEREGDLAKA 493
Score = 39.2 bits (92), Expect = 0.003
Identities = 34/119 (28%), Positives = 61/119 (51%), Gaps = 28/119 (23%)
Query: 209 KPDTID-----VQQMK----AQAREEKNAKQQQRDKLQLEIAARE----------KAEK- 248
KP+ +D + Q++ A +E+ A +++ + L+ E+A E KAEK
Sbjct: 403 KPEELDELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKA 462
Query: 249 ------KHQESVERLKQLEVEMAKRDQDLMEAQE-MIRRLEEQLKQLQAAKEELEARQT 300
+ +E +E+++ LE+E A+R+ DL +A E +L E K+LQAA+ +L
Sbjct: 463 AIQGIQQIKEEIEQVR-LELEQAEREGDLAKAAELQYGKLPELEKRLQAAEAKLGEETK 520
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 42.0 bits (99), Expect = 4e-04
Identities = 24/105 (22%), Positives = 50/105 (47%), Gaps = 10/105 (9%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+ K +E Q+QR L+ +I + EK ++ L + + +++L E
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEK----------EIENLNGKKEELEEELEE 872
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
+ +R LE +L L+ ++ELEA+ EL+ +++ + Q +
Sbjct: 873 LEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRK 917
Score = 41.6 bits (98), Expect = 4e-04
Identities = 24/104 (23%), Positives = 48/104 (46%), Gaps = 11/104 (10%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEI-----------AAREKAEKKHQESVERLKQLEV 262
D ++ + + E N +++ D+LQ E+ AA E K E E + +
Sbjct: 389 DYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448
Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
E+ K++ L + + + E++L L+ + +E ++LQ EL
Sbjct: 449 EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQREL 492
Score = 40.1 bits (94), Expect = 0.001
Identities = 19/99 (19%), Positives = 47/99 (47%), Gaps = 4/99 (4%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQLEVEMAKRDQ 269
+Q + + Q+ +I EK E++ ++ ERL++LE +++ +Q
Sbjct: 692 SLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQ 751
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
++ + ++ LE ++++L+ +LE +L+ L
Sbjct: 752 EIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790
Score = 38.9 bits (91), Expect = 0.004
Identities = 16/109 (14%), Positives = 40/109 (36%), Gaps = 16/109 (14%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAARE----------------KAEKKHQESVERL 257
+V+ + ++ KL+ + E K E++ RL
Sbjct: 755 NVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARL 814
Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+++E ++ + + ++ I+ L+EQ L+ + +E L +
Sbjct: 815 REIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKK 863
Score = 38.1 bits (89), Expect = 0.006
Identities = 27/104 (25%), Positives = 54/104 (51%), Gaps = 10/104 (9%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL--EVEMAKRDQD- 270
+ ++ + + EE +++ + L+ E+ E+ +K+ E+ + LK ++E KR+ +
Sbjct: 347 EERKRRDKLTEEYAELKEELEDLRAEL---EEVDKEFAETRDELKDYREKLEKLKREINE 403
Query: 271 ----LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
L QE ++RL E+L L AA +EA+ EL+ E +
Sbjct: 404 LKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKA 447
Score = 37.4 bits (87), Expect = 0.011
Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 3/96 (3%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
++K EE+ +++ +L+ EI E+ E L+ E +AK + ++ +
Sbjct: 280 KIKDLGEEEQLRVKEKIGELEAEI---ASLERSIAEKERELEDAEERLAKLEAEIDKLLA 336
Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
I LE ++++ + +++L EL+ EL+ +R +
Sbjct: 337 EIEELEREIEEERKRRDKLTEEYAELKEELEDLRAE 372
Score = 37.0 bits (86), Expect = 0.013
Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 3/101 (2%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+++ + + E + + D+L A +K E + ++QLE E K + L E
Sbjct: 685 GLKRELSSLQSELRRIENRLDELS---QELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
+E + LE++++ +++ +ELEAR EL+ +L + N
Sbjct: 742 LEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALN 782
Score = 36.6 bits (85), Expect = 0.019
Identities = 24/116 (20%), Positives = 54/116 (46%), Gaps = 2/116 (1%)
Query: 186 RVRVNKRILALCMGNHELYMRRRKPDTI--DVQQMKAQAREEKNAKQQQRDKLQLEIAAR 243
R + L E+ +R+ D + ++Q++ + + A K+ +
Sbjct: 384 RDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEK 443
Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
E + ++ +L+QL +++K +Q+L + +E R+E++L +LQ E EA+
Sbjct: 444 EDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQA 499
Score = 34.3 bits (79), Expect = 0.083
Identities = 18/93 (19%), Positives = 43/93 (46%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
A+ E + RD+L+ EK +++ E L +L+ E+ + ++L + I
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430
Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+E ++ +L+ KE+ + + +L+ +
Sbjct: 431 GIEAKINELEEEKEDKALEIKKQEWKLEQLAAD 463
Score = 32.3 bits (74), Expect = 0.39
Identities = 20/97 (20%), Positives = 42/97 (43%), Gaps = 1/97 (1%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIA-AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
K +A EE ++ ++L L I R++ E+ +E + + + KR+ + E +
Sbjct: 172 KEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKE 231
Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
LE Q + ++ LE +L E+ + +
Sbjct: 232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLE 268
Score = 32.3 bits (74), Expect = 0.43
Identities = 27/123 (21%), Positives = 58/123 (47%), Gaps = 8/123 (6%)
Query: 194 LALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA----REKAEKK 249
+A + R + D ++ A+ E + + ++L+ EI R+K ++
Sbjct: 303 IASLERSIAEKERELE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEE 358
Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
+ E E L+ L E+ + D++ E ++ ++ E+L++L+ EL+ LQ ELQ +
Sbjct: 359 YAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL 418
Query: 310 RNQ 312
+
Sbjct: 419 SEE 421
Score = 32.0 bits (73), Expect = 0.47
Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 5/84 (5%)
Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
+E + + +L+ I E+ K +E++ L+ + E Q + +LEE
Sbjct: 751 QEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL-----SHSRIPEIQAELSKLEE 805
Query: 284 QLKQLQAAKEELEARQTELQLELQ 307
++ +++A E+E + L LE +
Sbjct: 806 EVSRIEARLREIEQKLNRLTLEKE 829
Score = 30.0 bits (68), Expect = 1.8
Identities = 15/98 (15%), Positives = 41/98 (41%), Gaps = 5/98 (5%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREK-----AEKKHQESVERLKQLEVEMAKRDQDLMEA 274
A EE ++ +L+ + E+ +K E +++ ++ + + ++
Sbjct: 247 ASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASL 306
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+ I E +L+ + +LEA +L E++ + +
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELERE 344
Score = 30.0 bits (68), Expect = 1.9
Identities = 21/80 (26%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 225 EKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL-EE 283
EK A ++Q++ ++ ++A+ E++ ++ E + +LE + + +Q L E + I+ L EE
Sbjct: 231 EKEALERQKEAIERQLAS---LEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEE 287
Query: 284 QLKQLQAAKEELEARQTELQ 303
+ +++ ELEA L+
Sbjct: 288 EQLRVKEKIGELEAEIASLE 307
Score = 29.3 bits (66), Expect = 3.6
Identities = 20/103 (19%), Positives = 49/103 (47%), Gaps = 13/103 (12%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE------VEMAKR 267
+ +++AQ RE + ++ +++ + + + K + E L ++E E+ +
Sbjct: 890 ERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE 949
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAK-------EELEARQTELQ 303
+ L + Q ++R+EE+++ L+ EE+ R EL+
Sbjct: 950 ELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELK 992
Score = 28.9 bits (65), Expect = 4.9
Identities = 18/80 (22%), Positives = 39/80 (48%), Gaps = 8/80 (10%)
Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
+R K+ EIA + ++K ++++E L+++E + +I +QL++L+
Sbjct: 153 VERRKIIDEIAGVAEFDRKKEKALEELEEVEENIE-------RLDLIIDEKRQQLERLRR 205
Query: 291 AKEELEARQTELQLELQPIR 310
+E+ E R L E +
Sbjct: 206 EREKAE-RYQALLKEKREYE 224
Score = 28.5 bits (64), Expect = 6.6
Identities = 16/88 (18%), Positives = 42/88 (47%), Gaps = 3/88 (3%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
+ RE +++ LQ E+ E + + + + ++ +++ + +E +E
Sbjct: 677 QRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR-KIGEIEKEIEQLEQEE--E 733
Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQ 307
+L+E+L++L+ LE ++ EL+
Sbjct: 734 KLKERLEELEEDLSSLEQEIENVKSELK 761
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 40.8 bits (96), Expect = 4e-04
Identities = 23/120 (19%), Positives = 55/120 (45%), Gaps = 9/120 (7%)
Query: 204 YMRRRKPDTIDVQQMKAQ---AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL 260
R K +++ KA+ + A + + + L+ +++ E ++ +E ++R ++
Sbjct: 22 LEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEK 81
Query: 261 ------EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
E E+ + ++ A+E I LE++L +L E+LE +L+ L+ +
Sbjct: 82 LSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLA 141
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 41.7 bits (98), Expect = 4e-04
Identities = 24/100 (24%), Positives = 51/100 (51%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
A EE K ++ + R + E++ +E +E L+QLE E+ + ++L E + +
Sbjct: 652 LQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKL 711
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
+E+ +++L++ K ELE + EL+ + + L +
Sbjct: 712 GEIEQLIEELESRKAELEELKKELEKLEKALELLEELREK 751
Score = 37.8 bits (88), Expect = 0.006
Identities = 20/107 (18%), Positives = 56/107 (52%)
Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
+ + K ++++ ++ +A + + + ++ ++L E+ + + ++ + +E L++
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305
Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+ ++L E E ++ LEE+L++L+ E+LE+ EL E
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352
Score = 37.8 bits (88), Expect = 0.006
Identities = 27/111 (24%), Positives = 57/111 (51%), Gaps = 6/111 (5%)
Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQ----QRDKLQLEIAAREKAEKKHQESVER 256
EL R + ++++++ +K + E + ++ + +LE RE E + + E
Sbjct: 252 EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEEL--EG 309
Query: 257 LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
L+ L E+ + + L +E + +LEE+L++L++ EEL + EL L+
Sbjct: 310 LRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLE 360
Score = 36.3 bits (84), Expect = 0.021
Identities = 27/90 (30%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLM 272
++ + EEKN + ++ E+ R E+ EK+ ++++ERLKQLE + + ++L
Sbjct: 340 KLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELA 399
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
E + ++E+L++L+ EELE EL
Sbjct: 400 ELSAALEEIQEELEELEKELEELERELEEL 429
Score = 35.9 bits (83), Expect = 0.028
Identities = 24/108 (22%), Positives = 50/108 (46%), Gaps = 11/108 (10%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE-----------M 264
++ R ++ +KL+ EK E+K ++ L++L E +
Sbjct: 304 EEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERL 363
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+ ++ L E ++ + + E+LKQL+ A +EL+ EL L+ I+ +
Sbjct: 364 KELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411
Score = 35.5 bits (82), Expect = 0.033
Identities = 24/85 (28%), Positives = 45/85 (52%)
Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
EEK ++ + +L + +A K +E + L++L E+ ++ + L E + I LE
Sbjct: 245 EEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELE 304
Query: 283 EQLKQLQAAKEELEARQTELQLELQ 307
E+L+ L+A EELE +L+ +
Sbjct: 305 EELEGLRALLEELEELLEKLKSLEE 329
Score = 35.5 bits (82), Expect = 0.038
Identities = 26/102 (25%), Positives = 56/102 (54%), Gaps = 3/102 (2%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA---KRDQDLM 272
+++KA+ E ++ + + + E+ E+ ++ +E +ERL++LE E+ + + L
Sbjct: 252 EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLR 311
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
E + L E+LK L+ E+LE + +L+ EL+ + + N
Sbjct: 312 ALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN 353
Score = 34.7 bits (80), Expect = 0.066
Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE-RLKQLEVEMAKRDQDLM 272
+++ + E+ + + ++L E K ++ + +E RL++LE E+ K + L
Sbjct: 326 SLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLK 385
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+ +E I+ L+E+L +L AA EE++ EL+ EL+
Sbjct: 386 QLEEAIQELKEELAELSAALEEIQEELEELEKELE 420
Score = 34.0 bits (78), Expect = 0.11
Identities = 23/97 (23%), Positives = 49/97 (50%), Gaps = 2/97 (2%)
Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
++ + +Q + E+ + + + E L++L + + + L E +E + +LE
Sbjct: 554 LQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE 613
Query: 283 EQLKQLQA--AKEELEARQTELQLELQPIRNQSNLNK 317
E L+ L+ A+ ELE + EL+ EL+ + Q+ L +
Sbjct: 614 ELLQSLELSEAENELEEAEEELESELEKLNLQAELEE 650
Score = 32.0 bits (73), Expect = 0.47
Identities = 23/83 (27%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA----QEMIRRLEEQ 284
++Q+ ++L+ EI A E+ + +E ERL++L+ + + + +EA +E +R LE
Sbjct: 223 EEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERL 282
Query: 285 LKQLQAAKEELEARQTELQLELQ 307
L++L+ E LE + E++ +
Sbjct: 283 LEELEEKIERLEELEREIEELEE 305
Score = 32.0 bits (73), Expect = 0.47
Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 1/91 (1%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
E Q + +L AA E+ E+K +E +++ + +Q + +E + +L
Sbjct: 635 LESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEE-LEQL 693
Query: 282 EEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
EE+L+QL+ EEL + E++ ++ + ++
Sbjct: 694 EEELEQLREELEELLKKLGEIEQLIEELESR 724
Score = 31.7 bits (72), Expect = 0.68
Identities = 20/84 (23%), Positives = 44/84 (52%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
+E + + ++L+ E++ ++ + +E E K+L + + L + + L
Sbjct: 472 EKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEEL 531
Query: 282 EEQLKQLQAAKEELEARQTELQLE 305
EE+L++L+ EELE + +LQL+
Sbjct: 532 EEKLEKLENLLEELEELKEKLQLQ 555
Score = 31.3 bits (71), Expect = 0.87
Identities = 25/111 (22%), Positives = 57/111 (51%), Gaps = 4/111 (3%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
++++ + + + E + + + +KL L+ E + +E E++++LE E+ +
Sbjct: 613 EELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRR 672
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
Q + E +LEE+L++L+ +EELE + EL+ L+ + L +
Sbjct: 673 ELQRI----ENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIE 719
Score = 30.9 bits (70), Expect = 0.94
Identities = 20/92 (21%), Positives = 47/92 (51%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
R K +++++ + + +Q + +Q + +E E L++LE E+
Sbjct: 361 ERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELE 420
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
+ +++L E +E I++LEEQ+ QL++ + +
Sbjct: 421 ELERELEELEEEIKKLEEQINQLESKELMIAE 452
Score = 30.1 bits (68), Expect = 2.0
Identities = 14/90 (15%), Positives = 43/90 (47%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+++ ++ + E + K++ + + + + E L++LE + + ++ +
Sbjct: 233 EIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIER 292
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+E+ R +EE ++L+ + LE + L+
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEELLE 322
Score = 29.7 bits (67), Expect = 2.1
Identities = 19/101 (18%), Positives = 43/101 (42%), Gaps = 1/101 (0%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
R + + + + + R + + Q +L E+AE++ + +E+L
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEEL 651
Query: 267 RDQDLMEAQEMIRRLEEQLKQ-LQAAKEELEARQTELQLEL 306
L E +E + LE ++++ LQ + E + + +LE
Sbjct: 652 LQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQ 692
Score = 29.0 bits (65), Expect = 4.7
Identities = 19/87 (21%), Positives = 42/87 (48%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
E +++ + + E RE+ E+ +E E ++L + + E +E +
Sbjct: 476 LELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKL 535
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLE 305
+LE L++L+ KE+L+ +Q + +L
Sbjct: 536 EKLENLLEELEELKEKLQLQQLKEELR 562
Score = 28.2 bits (63), Expect = 6.8
Identities = 25/122 (20%), Positives = 53/122 (43%), Gaps = 9/122 (7%)
Query: 188 RVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA--REK 245
+ R+L + E R + + ++ + + ++ +R+ +LE
Sbjct: 253 ELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRA 312
Query: 246 AEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
++ +E +E+LK LE + L + +E + +LE +L++L K EL E E
Sbjct: 313 LLEELEELLEKLKSLE-------ERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKE 365
Query: 306 LQ 307
L+
Sbjct: 366 LE 367
Score = 28.2 bits (63), Expect = 6.9
Identities = 23/118 (19%), Positives = 49/118 (41%)
Query: 190 NKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKK 249
+ I L E +K + I +Q + + +E A +++ +L+ E E+ + +
Sbjct: 198 LEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKAR 257
Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
E + + ++L E + E+L++L+ EELE L+ L+
Sbjct: 258 LLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLE 315
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 40.8 bits (95), Expect = 7e-04
Identities = 23/100 (23%), Positives = 49/100 (49%), Gaps = 3/100 (3%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
D++ R E Q ++ + E RE A + Q++ + + + E+A Q+L +
Sbjct: 78 DIRPQLRALRTELGTAQGEKRAAETE---REAARSELQKARQEREAVRQELAAARQNLAK 134
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
AQ+ + RL +Q + LQ + L ++ +L+ + Q ++
Sbjct: 135 AQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQ 174
Score = 39.6 bits (92), Expect = 0.001
Identities = 25/112 (22%), Positives = 51/112 (45%), Gaps = 6/112 (5%)
Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
+L +R + ++Q NA Q + ++L AA ++ + Q+ ++ Q
Sbjct: 190 DLKLRSAQ-----IEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKA 244
Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA-RQTELQLELQPIRNQ 312
++A R + + E + ++RLE +L+ +LEA Q ++L Q Q
Sbjct: 245 QQIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAATQ 296
Score = 36.2 bits (83), Expect = 0.019
Identities = 24/139 (17%), Positives = 59/139 (42%), Gaps = 3/139 (2%)
Query: 174 DKKAPDFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQR 233
++ VF +R R L +G + R + + + +AR+E+ A +Q+
Sbjct: 66 NRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQEL 125
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
+ +A ++ + + + L+ + + + + + L+ KQLQA+
Sbjct: 126 AAARQNLAKAQQELARLTKQAQDLQT---RLKTLAEQRRQLEAQAQSLQASQKQLQASAT 182
Query: 294 ELEARQTELQLELQPIRNQ 312
+L+++ +L+L I +
Sbjct: 183 QLKSQVLDLKLRSAQIEQE 201
Score = 34.2 bits (78), Expect = 0.094
Identities = 16/103 (15%), Positives = 37/103 (35%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
+ R D+Q E++ + Q LQ + + + + V LK ++
Sbjct: 139 LARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQI 198
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+ Q+L + E+L + AA ++ + ++
Sbjct: 199 EQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQIS 241
Score = 30.8 bits (69), Expect = 1.1
Identities = 29/113 (25%), Positives = 39/113 (34%), Gaps = 18/113 (15%)
Query: 215 VQQMKAQAREEKNAKQQ-------QRDKLQLEIAA----REKAEKKHQESVERLKQLEVE 263
V+Q A AR+ QQ Q LQ + R + E + Q KQL+
Sbjct: 121 VRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQAS 180
Query: 264 MAK---RDQDLMEAQEMI----RRLEEQLKQLQAAKEELEARQTELQLELQPI 309
+ + DL I + L + QA EEL R Q Q I
Sbjct: 181 ATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAI 233
Score = 30.0 bits (67), Expect = 2.1
Identities = 14/96 (14%), Positives = 34/96 (35%), Gaps = 3/96 (3%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
+ Q + + Q + +LQ + + +++Q +A R AQ
Sbjct: 160 RRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRAN---AAQART 216
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
L + Q + ++ R ++ + Q I ++
Sbjct: 217 EELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAE 252
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 40.4 bits (95), Expect = 0.001
Identities = 28/137 (20%), Positives = 56/137 (40%), Gaps = 8/137 (5%)
Query: 176 KAPDFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDK 235
+ PD + A V + I G L+ R+ ++ + + +
Sbjct: 111 EFPDDLLSAEDPAVPELIK----GQQSLFESRK----STLRAQLELILAQIKQLEAELAG 162
Query: 236 LQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
LQ ++ A + + E +E ++L+ + +L+E + + +L +L+A E L
Sbjct: 163 LQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVL 222
Query: 296 EARQTELQLELQPIRNQ 312
+ + ELQLE Q I
Sbjct: 223 KRQIDELQLERQQIEQT 239
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 40.6 bits (96), Expect = 0.001
Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 13/111 (11%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIA-AREKAEKKHQESVERLKQLEVE- 263
R + + + + +A + K +++++KLQ E E+AEK+ Q++++ K+ E +
Sbjct: 530 RELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKK-EADE 588
Query: 264 ----------MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQL 304
E E +RL + ++ + K++ + +Q EL++
Sbjct: 589 IIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKV 639
Score = 32.5 bits (75), Expect = 0.35
Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 18/85 (21%)
Query: 238 LEIAAR--------EKAEKKHQES-------VERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
EIA R E+A+K E + L++LE E+ +Q EA+ +++ E
Sbjct: 491 FEIAKRLGLPENIIEEAKKLIGEDKEKLNELIASLEELEREL---EQKAEEAEALLKEAE 547
Query: 283 EQLKQLQAAKEELEARQTELQLELQ 307
+ ++L+ KE+L+ + +L E +
Sbjct: 548 KLKEELEEKKEKLQEEEDKLLEEAE 572
Score = 31.7 bits (73), Expect = 0.52
Identities = 20/98 (20%), Positives = 47/98 (47%), Gaps = 14/98 (14%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
D + ++ A E+ ++ ++ + E A ++AEK +E E+ ++L+ E K
Sbjct: 514 DKEKLNELIA--SLEELERELEQKAEEAE-ALLKEAEKLKEELEEKKEKLQEEEDK---- 566
Query: 271 LMEAQEMIRRLEEQLKQ-LQAAKEELEARQTELQLELQ 307
++ E++ +Q ++ AK+E + EL+ +
Sbjct: 567 ------LLEEAEKEAQQAIKEAKKEADEIIKELRQLQK 598
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 40.1 bits (94), Expect = 0.001
Identities = 25/117 (21%), Positives = 57/117 (48%), Gaps = 6/117 (5%)
Query: 203 LYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ--ESVERLKQL 260
+ +++R+ + +++ +A++E K + E+ RE+ +++ + E L++
Sbjct: 23 VLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQRE 82
Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQL----KQLQAAKEELEARQTELQLELQPIRNQS 313
E + ++++ L E + LE QL K L A + ELE + +L EL + +
Sbjct: 83 EERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLT 139
Score = 30.8 bits (70), Expect = 0.91
Identities = 21/100 (21%), Positives = 49/100 (49%), Gaps = 2/100 (2%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
V +K + R K A++ ++ + E A + +E + R + + + A+R+++ E
Sbjct: 22 VVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARRERE--EL 79
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
Q RL ++ +QL A E+L+ + +L+ + + +
Sbjct: 80 QREEERLVQKEEQLDARAEKLDNLENQLEEREKALSAREL 119
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 39.8 bits (94), Expect = 0.001
Identities = 25/106 (23%), Positives = 53/106 (50%), Gaps = 13/106 (12%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAARE----KAEKKHQESVERLKQLEVEMAKRDQDL 271
++ +A+EE + R++ + E+ R K EK+ + E L + + KR+++L
Sbjct: 56 KEALLEAKEEI---HKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEEL 112
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
+ ++ + + +++L++ +EELE E EL+ I S L
Sbjct: 113 EKKEKELEQKQQELEKK---EEELEELIEEQLQELERI---SGLTA 152
Score = 34.4 bits (80), Expect = 0.076
Identities = 24/97 (24%), Positives = 53/97 (54%), Gaps = 16/97 (16%)
Query: 207 RRKPDTIDVQQMKAQAREEK-NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
RK + ++ ++ + + +E++ KQQ+ +K + E+ E+ ++ + +ER+ L E A
Sbjct: 99 DRKLELLEKREEELEKKEKELEQKQQELEKKEEEL---EELIEEQLQELERISGLTAEEA 155
Query: 266 KRDQDLM---------EAQEMIRRLEEQLKQLQAAKE 293
K + L+ EA +I+ +EE+ K+ +A K+
Sbjct: 156 K--EILLEKVEEEARHEAAVLIKEIEEEAKE-EADKK 189
Score = 33.2 bits (77), Expect = 0.17
Identities = 24/114 (21%), Positives = 54/114 (47%), Gaps = 18/114 (15%)
Query: 204 YMRRRKPDTIDVQQMKAQARE-----EKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
Y R+K +++ + +A+ +K A+ +++ L + A+E+ K E
Sbjct: 22 YFVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEAL---LEAKEEIHKLRNE------ 72
Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E E+ +R +L + ++ + + EE L + E LE R+ EL+ + + + +
Sbjct: 73 -FEKELRERRNELQKLEKRLLQKEENLDR---KLELLEKREEELEKKEKELEQK 122
Score = 30.1 bits (69), Expect = 1.8
Identities = 23/78 (29%), Positives = 44/78 (56%), Gaps = 13/78 (16%)
Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
A E+A++ +E+ + E E K++ L+EA+E I +L + ++ EL R+
Sbjct: 35 EAEEEAKRILEEA-----KKEAEAIKKEA-LLEAKEEIHKLRNEFEK------ELRERRN 82
Query: 301 ELQ-LELQPIRNQSNLNK 317
ELQ LE + ++ + NL++
Sbjct: 83 ELQKLEKRLLQKEENLDR 100
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 39.5 bits (93), Expect = 0.002
Identities = 23/81 (28%), Positives = 45/81 (55%), Gaps = 2/81 (2%)
Query: 235 KLQLEIAAREKAEKKHQESVERLKQL--EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAK 292
+L+ E+ E+ K+ +E +E++K+L ++ R Q E +E + +L E ++L
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEEL 390
Query: 293 EELEARQTELQLELQPIRNQS 313
EELE EL+ EL+ + ++
Sbjct: 391 EELEEELKELKEELESLYSEG 411
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 38.8 bits (91), Expect = 0.002
Identities = 24/86 (27%), Positives = 43/86 (50%), Gaps = 2/86 (2%)
Query: 231 QQRDKLQLEIAAREKAEKKHQESVER--LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
++ +L+ E+ A +K + + L QLE +A+ L E QE +++ QL +L
Sbjct: 52 KELRELRQELEALKKTDAPVFPELANLSLSQLEQRLAQTLSQLQELQEQLQQENSQLIEL 111
Query: 289 QAAKEELEARQTELQLELQPIRNQSN 314
Q E + + +E + LQ IRN+
Sbjct: 112 QTRPERAQQQLSEARRRLQEIRNRLQ 137
Score = 30.3 bits (69), Expect = 1.0
Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 9/93 (9%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAARE---KAEKKHQESVERLKQLEVEMAKRDQDLM 272
+++A + Q QR LQ E+AA + + + S ++L + +DL+
Sbjct: 134 NRLQALSPGGTPLAQAQRTLLQAELAALKAQIEELELELLSNNNRQEL----LRLQRDLL 189
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
+ + I RLE +L+ LQ A ++E +E
Sbjct: 190 KKR--IERLEAELQALQNAINRKRLAESEQAVE 220
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome
partitioning].
Length = 1480
Score = 39.5 bits (92), Expect = 0.002
Identities = 25/109 (22%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEI---AAREKAEKKHQESVERLK 258
E + + + + ARE++ A +Q++++LQ I R Q ++E+L
Sbjct: 556 EALHQELEALIESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQRAPVWLAAQNALEQLS 615
Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+ E QD+ E +++L E+ ++ ++EL AR+ L E++
Sbjct: 616 EQSGEEFTDSQDVT---EYMQQLLEREREATVERDELGARKNALDEEIE 661
Score = 29.9 bits (67), Expect = 2.1
Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 16/125 (12%)
Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEI-AAREKAEKKHQES-VERLKQLE 261
YMR + + Q RE ++QQ + + +RE AE E +E Q
Sbjct: 274 YMRHANERRVHLDQALEFRRELYTSRQQLAAEQYRHVDMSRELAELNGAEGDLEADYQAA 333
Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQA--------------AKEELEARQTELQLELQ 307
+ Q + QE I R + L++L +EE EAR +LE+
Sbjct: 334 SDHLNLVQTALRQQEKIERYQADLEELTIRLEEQNEVVEEANERQEENEARAEAAELEVD 393
Query: 308 PIRNQ 312
+++Q
Sbjct: 394 ELKSQ 398
Score = 29.1 bits (65), Expect = 3.9
Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+ + +ARE+ Q Q + +A+ + + +E + L+Q ++ R E +
Sbjct: 997 EAERTRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQELQDIGVRADSGAEER 1056
Query: 276 EMIRR--LEEQLKQLQAAKEELEARQTELQLELQ 307
IRR L QL ++ + +LE + T + E+
Sbjct: 1057 ARIRRDELHAQLSTNRSRRNQLEKQLTFCEAEMD 1090
>gnl|CDD|241342 cd13188, FERM_C_PTPN14_PTPN21, Protein tyrosine phosphatase
non-receptor proteins 14 and 21 (PTPN14 and 21) FERM
domain. This CD contains PTP members: pez/PTPN14 and
PTPN21. A number of mutations in Pez have been shown to
be associated with breast and colorectal cancer. The
PTPN protein family belong to larger family of PTPs.
PTPs are known to be signaling molecules that regulate
a variety of cellular processes including cell growth,
differentiation, mitotic cycle, and oncogenic
transformation. The members are composed of a
N-terminal FERM domain and a C-terminal PTP catalytic
domain. The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. Like most other ERM members they have a
phosphoinositide-binding site in their FERM domain. The
FERM C domain is the third structural domain within the
FERM domain. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a
link between the membrane and cytoskeleton and are
involved in signal transduction pathways. The FERM
domain is also found in protein tyrosine phosphatases
(PTPs) , the tyrosine kinases FAK and JAK, in addition
to other proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 111
Score = 36.9 bits (86), Expect = 0.002
Identities = 13/53 (24%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDN 84
G+ +A + Y++ + L+ YG + K+ +G ++++G G I+ K N
Sbjct: 1 GLSPAEAELLYIQEVEQLDGYGEESYPAKDNQGNDIFIGASFEG--IFVKHKN 51
Score = 32.7 bits (75), Expect = 0.069
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)
Query: 153 FPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHELY 204
F W++I NIS N F ++ I+K+ F K + +C+ H+ Y
Sbjct: 58 FKWNDIGNISHNKSFFSVELINKEET-IQFQMEDAETAKYVWRMCVARHKFY 108
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 38.9 bits (91), Expect = 0.002
Identities = 21/78 (26%), Positives = 46/78 (58%), Gaps = 4/78 (5%)
Query: 230 QQQRDKLQLEIAAREKA--EKKHQES--VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL 285
+ Q +Q +IAA+EK+ +++ Q + + +LK+ E +++ + L E Q + +L +Q+
Sbjct: 46 RDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQI 105
Query: 286 KQLQAAKEELEARQTELQ 303
+L A+ +LE +Q +
Sbjct: 106 DELNASIAKLEQQQAAQE 123
Score = 33.9 bits (78), Expect = 0.089
Identities = 24/89 (26%), Positives = 43/89 (48%), Gaps = 3/89 (3%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+++ AQ E + + QQ+ L + A ++K E+ E + L LE + K Q L E +
Sbjct: 180 EELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSLQKDQQQLSELR 239
Query: 276 EMIRRLEEQLKQLQA---AKEELEARQTE 301
RL + + + + A+ E EAR+
Sbjct: 240 ANESRLRDSIARAEREAKARAEREAREAA 268
Score = 29.3 bits (66), Expect = 2.8
Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 7/86 (8%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+QQ A + +QQQR L ++ +E+A ++ +L++ + + + ++ + E
Sbjct: 51 SIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAI---SQASRKLRETQNTLNQLNKQIDE 107
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQ 299
I +LE+Q QAA+E L A Q
Sbjct: 108 LNASIAKLEQQ----QAAQERLLAAQ 129
Score = 28.5 bits (64), Expect = 4.5
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 2/67 (2%)
Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRR-LEEQLKQLQAAKEELEARQTEL-QLELQP 308
QE++ LKQ E+A + +L E Q + L EQ Q Q ++ R+ L LE
Sbjct: 169 QETIAELKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSL 228
Query: 309 IRNQSNL 315
++Q L
Sbjct: 229 QKDQQQL 235
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 39.1 bits (92), Expect = 0.003
Identities = 32/112 (28%), Positives = 46/112 (41%), Gaps = 4/112 (3%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
R+K +T ++Q AQA + Q + + L+ E + S L+QLE +A+
Sbjct: 77 RQKEETEQLKQQLAQAPAKLRQAQAELEALK----DDNDEETRETLSTLSLRQLESRLAQ 132
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
L AQ + QL LQ E +A LQ IRN KV
Sbjct: 133 TLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGKV 184
Score = 32.2 bits (74), Expect = 0.46
Identities = 23/125 (18%), Positives = 47/125 (37%), Gaps = 33/125 (26%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+ K E+K +Q L L + ++ ++ +QL+ ++A+ L +AQ
Sbjct: 49 NKQKLLEAEDKLVQQDLEQTLAL-LDKIDRQKE-------ETEQLKQQLAQAPAKLRQAQ 100
Query: 276 EMIRRLEEQ-------------LKQLQAAKEELEARQTELQLEL------------QPIR 310
+ L++ L+QL++ + + Q +L QP R
Sbjct: 101 AELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPER 160
Query: 311 NQSNL 315
Q+ L
Sbjct: 161 AQAAL 165
Score = 30.3 bits (69), Expect = 1.7
Identities = 26/95 (27%), Positives = 38/95 (40%), Gaps = 13/95 (13%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+K K + QR LQ E A ++S+E QL QDL++ Q
Sbjct: 177 NLLKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQL--------QDLLQKQ 228
Query: 276 -----EMIRRLEEQLKQLQAAKEELEARQTELQLE 305
I+RLE QL+ LQ A +E ++
Sbjct: 229 RDYLTARIQRLEHQLQLLQEAINSKRLTLSEKTVQ 263
Score = 28.7 bits (65), Expect = 4.9
Identities = 27/107 (25%), Positives = 44/107 (41%), Gaps = 21/107 (19%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEK--KHQESVERLKQLEVEMAKRDQDLMEAQE 276
+AQA N+++ Q +I K K + L+ E Q L+ AQ
Sbjct: 160 RAQAALYANSQRLQ------QIRNLLKGGKVGGKALRPSQRVLLQAE-----QALLNAQN 208
Query: 277 MIRRLE----EQLKQL-QAAKEELEARQTELQLELQPIR---NQSNL 315
++R QL+ L Q ++ L AR L+ +LQ ++ N L
Sbjct: 209 DLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEAINSKRL 255
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 37.5 bits (87), Expect = 0.003
Identities = 27/99 (27%), Positives = 53/99 (53%), Gaps = 2/99 (2%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
+ ++++ + R E+ + + ++K E E QE V+ +K+LE +R +
Sbjct: 64 EKKELKKKLDRKRLERIQEAKTKEKHAQEREKTEINRSHIQEVVQSIKRLEEAQKRRQEK 123
Query: 271 LMEAQ-EMIRRLEEQLKQLQA-AKEELEARQTELQLELQ 307
L E Q E++++++E+ +LQA A E EA+ L E+Q
Sbjct: 124 LEEKQAEILQQIQEEEPKLQAEATAEYEAKLKRLPQEVQ 162
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 38.3 bits (89), Expect = 0.004
Identities = 20/92 (21%), Positives = 43/92 (46%), Gaps = 2/92 (2%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
+R + ++++ +A EK AKQ ++ Q E ++ E K +++ E + E E K
Sbjct: 83 QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEK 142
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
+ ++ + Q + +A K+ EA+
Sbjct: 143 KAKEEAKKQAEEEAKAKA--AAEAKKKAAEAK 172
Score = 35.6 bits (82), Expect = 0.027
Identities = 23/88 (26%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 215 VQQMKAQAREEKNAKQQQRDKL-QLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
QQ +++K A ++++++ +LE A E +++ E R K+LE + A ++ +
Sbjct: 49 AQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQ-ARQKELE-QRAAAEKAAKQ 106
Query: 274 AQEMIRRLEEQLKQL--QAAKEELEARQ 299
A++ ++ EE+ KQ AK+ EA+
Sbjct: 107 AEQAAKQAEEKQKQAEEAKAKQAAEAKA 134
Score = 33.7 bits (77), Expect = 0.10
Identities = 21/96 (21%), Positives = 49/96 (51%), Gaps = 5/96 (5%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAARE---KAEKKHQESVERLKQLEV 262
++ + ++Q +A +++ A+Q ++ +L+ AA + +AE+ +++ E+ KQ E
Sbjct: 64 KKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEE 123
Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
AK+ + E E++ K+ + E EA+
Sbjct: 124 AKAKQAAEAKAKAEAEA--EKKAKEEAKKQAEEEAK 157
Score = 33.7 bits (77), Expect = 0.12
Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 6/82 (7%)
Query: 226 KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL 285
A QQ +++Q + K E+ ER K+LE + + ++ Q + LE++
Sbjct: 45 PGAVAQQANRIQQQKKPAAKKEQ------ERQKKLEQQAEEAEKQRAAEQARQKELEQRA 98
Query: 286 KQLQAAKEELEARQTELQLELQ 307
+AAK+ +A + + + Q
Sbjct: 99 AAEKAAKQAEQAAKQAEEKQKQ 120
Score = 27.9 bits (62), Expect = 8.5
Identities = 15/79 (18%), Positives = 32/79 (40%), Gaps = 7/79 (8%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
KA+ E AK + + + + A+++AE+ E + AK+ + +
Sbjct: 125 KAKQAAEAKAKAEAEAEKKAKEEAKKQAEE------EAKAK-AAAEAKKKAAEAKKKAEA 177
Query: 279 RRLEEQLKQLQAAKEELEA 297
+ + +A EE +A
Sbjct: 178 EAKAKAEAKAKAKAEEAKA 196
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA
a protein that suppresses sigma54-dependent
transcription. The PspA protein, a negative regulator of
the Escherichia coli phage shock psp operon, is produced
when virulence factors are exported through secretins in
many Gram-negative pathogenic bacteria and its homologue
in plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to
the AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 37.7 bits (88), Expect = 0.004
Identities = 24/108 (22%), Positives = 53/108 (49%), Gaps = 4/108 (3%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ----ESVERLKQLEVEMAKRDQD 270
Q + Q + E+ ++Q+ +LE AR K ++ E++ + LE + +
Sbjct: 47 AQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEELAREALAEIATLEKQAEALETQ 106
Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
L + + + +L +QL L+ ++L+A++T L+ L+ + Q +N
Sbjct: 107 LTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKAQEAVNTS 154
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 38.4 bits (90), Expect = 0.004
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 16/101 (15%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLE-IAAREKAEKKHQESVERL-KQLEVEMAKRDQDL 271
++++M A E+ A + DK + E I A+ +AE E V L K L+ E K +
Sbjct: 505 EIERMVKDA--EEYAAE---DKKRKERIEAKNEAE----EYVYSLEKSLKEEGDKLPE-- 553
Query: 272 MEAQEMIRRLEEQLKQLQAA--KEELEARQTELQLELQPIR 310
++ + E LK+ KEE+EA+ ELQ +QPI
Sbjct: 554 -ADKKKVEEAIEWLKEELEGEDKEEIEAKTEELQKVVQPIG 593
>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
mechanisms].
Length = 352
Score = 37.7 bits (88), Expect = 0.005
Identities = 25/110 (22%), Positives = 45/110 (40%), Gaps = 5/110 (4%)
Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
L + + Q AQA + A+ Q D+ Q E+ R + ++ S E L +
Sbjct: 99 ALAAAEAQLRNLRAQLASAQALIAQ-AEAQDLDQAQNELERRAELAQRGVVSREELDRAR 157
Query: 262 VEMAKRDQDLMEAQEMIRR----LEEQLKQLQAAKEELEARQTELQLELQ 307
+ + L AQ ++ LE ++ QA EA + +L+L+
Sbjct: 158 AALQAAEAALAAAQAAQKQNLALLESEVSGAQAQVASAEAALDQAKLDLE 207
Score = 32.3 bits (74), Expect = 0.28
Identities = 19/113 (16%), Positives = 37/113 (32%), Gaps = 18/113 (15%)
Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKH----QESVERLKQL-------- 260
++Q +A + + R +L A +AE + Q +ER +L
Sbjct: 91 AALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQNELERRAELAQRGVVSR 150
Query: 261 -EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E++ A+ EA L + LE+ + Q ++
Sbjct: 151 EELDRARAALQAAEAA-----LAAAQAAQKQNLALLESEVSGAQAQVASAEAA 198
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 37.9 bits (88), Expect = 0.005
Identities = 18/84 (21%), Positives = 36/84 (42%), Gaps = 1/84 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+Q + + E++ Q++ K Q E AA++ A K+ Q K AK + +
Sbjct: 97 AAEQERLKQLEKERLAAQEQKK-QAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKR 155
Query: 274 AQEMIRRLEEQLKQLQAAKEELEA 297
A ++ + K+ A+ +A
Sbjct: 156 AAAAAKKAAAEAKKKAEAEAAKKA 179
Score = 37.5 bits (87), Expect = 0.007
Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 15/96 (15%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
QQ K+ R ++QR K + + A E+ ++K ERLKQLE E + +A+
Sbjct: 69 QQQKSAKR-----AEEQRKKKEQQQA--EELQQKQAAEQERLKQLEKERLAAQEQKKQAE 121
Query: 276 EMIRRLEEQLKQ--------LQAAKEELEARQTELQ 303
E ++ + KQ AAK + EA
Sbjct: 122 EAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAA 157
>gnl|CDD|241340 cd13186, FERM_C_NBL4_NBL5, Novel band 4.1-like protein 4 and 5
FERM (NBL4 and 5) domain C-lobe. NBL4 (also called
Erythrocyte protein band 4.1-like 4; Epb4 1l4) plays a
role the beta-catenin/Tcf signaling pathway and is
thought to be involved in establishing the cell
polarity or proliferation. NBL4 may be also involved in
adhesion, in cell motility and/or in cell-to-cell
communication. No role for NBL5 has been proposed to
date. Both NBL4 and NBL5 contain a N-terminal FERM
domain which has a cloverleaf tripart structure
(FERM_N, FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe,
B-lobe, C-lobe/F1, F2, F3). A single FERM_C repeat 2
from Tetraodon nigroviridis also included in this cdThe
C-lobe is a member of the PH superfamily. The FERM
domain is found in the cytoskeletal-associated proteins
such as ezrin, moesin, radixin, 4.1R, and merlin. These
proteins provide a link between the membrane and
cytoskeleton and are involved in signal transduction
pathways. The FERM domain is also found in protein
tyrosine phosphatases (PTPs) , the tyrosine kinases FAK
and JAK, in addition to other proteins involved in
signaling. This domain is structurally similar to the
PH and PTB domains and consequently is capable of
binding to both peptides and phospholipids at different
sites.
Length = 111
Score = 35.7 bits (83), Expect = 0.006
Identities = 18/44 (40%), Positives = 28/44 (63%)
Query: 37 DAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYE 80
+A + YL A+ LEMYGV+ +K + G E +LG+ G+ +YE
Sbjct: 4 EAELNYLNKAKWLEMYGVDLHPVKGEDGNEYFLGLTPTGILVYE 47
Score = 26.5 bits (59), Expect = 9.7
Identities = 9/27 (33%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 150 KIG-FPWSEIRNISFNDRKFIIKPIDK 175
KIG F W +I + F +K + ++K
Sbjct: 51 KIGLFFWPKITKLDFKGKKLKLVVVEK 77
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 37.0 bits (86), Expect = 0.010
Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 10/93 (10%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM- 277
KAQ +EK K+ +R+ A EK KK + RLK++ E+A++++ +E
Sbjct: 280 KAQRNKEKRRKELERE------AKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQR 333
Query: 278 -IRRLEEQLKQLQAAKEELEARQTELQL--ELQ 307
R +++LK+ + K + E+ L EL
Sbjct: 334 KERGEKKKLKRRKLGKHKYPEPPLEVVLPDELS 366
>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP). This family consists of
several PV-1 (PLVAP) proteins which seem to be specific
to mammals. PV-1 is a novel protein component of the
endothelial fenestral and stomatal diaphragms. The
function of this family is unknown.
Length = 442
Score = 36.9 bits (85), Expect = 0.010
Identities = 26/84 (30%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
ARE + ++Q+ + Q AA+E K +E+ R +L+ E A++ Q +E + +R
Sbjct: 298 ARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAE 357
Query: 282 EEQL-KQLQAAKEELEARQTELQL 304
+ L K+L+A K E E + EL +
Sbjct: 358 RDNLAKELEAKKREAEQLRMELAI 381
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 36.8 bits (86), Expect = 0.011
Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 7/90 (7%)
Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI----R 279
E K + R +L+ E+ R ++ + RL Q E + ++ + L + +E + +
Sbjct: 55 EAKEEVHKLRAELERELKERRNELQRLER---RLLQREETLDRKMESLDKKEENLEKKEK 111
Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQPI 309
L + K L +EELE E + EL+ I
Sbjct: 112 ELSNKEKNLDEKEEELEELIAEQREELERI 141
Score = 33.4 bits (77), Expect = 0.16
Identities = 20/112 (17%), Positives = 55/112 (49%), Gaps = 9/112 (8%)
Query: 190 NKRILALCMGNHELYMRRRKPDTIDVQQMKA--QAREEKNAKQQQRDKLQLEIAAREKAE 247
+ AL E++ R + + ++++ + Q E + Q+ + L ++ + +K E
Sbjct: 47 TLKKEALLEAKEEVHKLRAELER-ELKERRNELQRLERRL--LQREETLDRKMESLDKKE 103
Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
+ ++ + L E + ++++ E +E+I E+L+++ +E EA++
Sbjct: 104 ENLEKKEKELSNKEKNLDEKEE---ELEELIAEQREELERISGLTQE-EAKE 151
Score = 33.0 bits (76), Expect = 0.18
Identities = 25/103 (24%), Positives = 55/103 (53%), Gaps = 17/103 (16%)
Query: 208 RKPDTIDVQQMKAQAREEK-NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
RK +++D ++ + +E++ + K++ D+ + E+ E+ + +E +ER+ L E AK
Sbjct: 94 RKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL---EELIAEQREELERISGLTQEEAK 150
Query: 267 -------RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
++ EA ++I+ +EE+ AKEE + + E+
Sbjct: 151 EILLEEVEEEARHEAAKLIKEIEEE------AKEEADKKAKEI 187
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 36.9 bits (86), Expect = 0.011
Identities = 28/106 (26%), Positives = 55/106 (51%), Gaps = 10/106 (9%)
Query: 214 DVQQMKAQAREE-KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
+ ++KA+ EE + A +++R++L ++ A + +E+ KQL +E +
Sbjct: 178 KLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEA-ALEKQLRLEFER------ 230
Query: 273 EAQEMIRRLEEQLK-QLQAAKEELEARQTELQLELQPIRNQSNLNK 317
E +E+ ++ EE+L+ +L+ E E + +L LQ I Q NK
Sbjct: 231 EKEELRKKYEEKLRQELERQAEAHEQKLKN-ELALQAIELQREFNK 275
Score = 30.8 bits (70), Expect = 1.1
Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 2/76 (2%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ-LQAAK 292
D+L ++A + E++ E + K+ E+ ++ L + LE+QL+ + K
Sbjct: 173 DQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREK 232
Query: 293 EELEAR-QTELQLELQ 307
EEL + + +L+ EL+
Sbjct: 233 EELRKKYEEKLRQELE 248
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 37.2 bits (87), Expect = 0.012
Identities = 18/70 (25%), Positives = 37/70 (52%)
Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
+++ L+LE+ A E+ + ERL +L E+A + +L + ++ +E ++ +
Sbjct: 425 RRRIAALELELDALEREAALGADHDERLAELRAELAALEAELAALEARWQQEKELVEAIL 484
Query: 290 AAKEELEARQ 299
A + ELEA
Sbjct: 485 ALRAELEADA 494
Score = 28.0 bits (63), Expect = 7.6
Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 6/63 (9%)
Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEE---LEAR-QTELQL--ELQPI 309
R+ LE+E+ +++ + RL E +L A + E LEAR Q E +L + +
Sbjct: 427 RIAALELELDALEREAALGADHDERLAELRAELAALEAELAALEARWQQEKELVEAILAL 486
Query: 310 RNQ 312
R +
Sbjct: 487 RAE 489
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 36.8 bits (85), Expect = 0.014
Identities = 32/104 (30%), Positives = 54/104 (51%), Gaps = 15/104 (14%)
Query: 205 MRRRKPDTIDVQQM----KAQAREEKNAKQQ-----QRDKLQLEIAAR--EKAEKKHQES 253
+ ++ D +Q M K++A NA++Q +R K + E AAR +A +E
Sbjct: 468 VSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREEC 527
Query: 254 VERLKQ----LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
E LKQ LE+E+ K + DL +E R LE++ ++L+ +E
Sbjct: 528 AESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQE 571
Score = 35.3 bits (81), Expect = 0.045
Identities = 26/92 (28%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+ M + +++K + Q +L+ E +R AEK+ E +R K+ E E A R A
Sbjct: 464 LNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEE-ETAARAAAQAAA 522
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
EE + L+ AK++LE +L+ +L
Sbjct: 523 SR-----EECAESLKQAKQDLEMEIKKLEHDL 549
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 35.6 bits (83), Expect = 0.015
Identities = 16/78 (20%), Positives = 39/78 (50%), Gaps = 9/78 (11%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
+++Q + EEK +++ +L+ ++ A EK E++ ++ E+ E+ K+
Sbjct: 121 GKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQ-- 178
Query: 271 LMEAQEMIRRLEEQLKQL 288
++L+ QL+Q+
Sbjct: 179 -------NQQLKSQLEQI 189
Score = 29.5 bits (67), Expect = 1.7
Identities = 14/57 (24%), Positives = 29/57 (50%), Gaps = 2/57 (3%)
Query: 258 KQLEVEMAKRD--QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
K L+ E K + Q++ + +E LE+++ +L+A E +E R+ E + +
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHAD 170
>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968). This
family of proteins has no known function.
Length = 192
Score = 35.5 bits (82), Expect = 0.019
Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 3/84 (3%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
++ KAQ + A + + ++LQ ++ + E Q+ R +Q E +
Sbjct: 106 RRAQLEAQKAQLERQIAASEARAERLQADLQLAQAQE---QQVAARQQQARQEAVALEAQ 162
Query: 271 LMEAQEMIRRLEEQLKQLQAAKEE 294
AQ +R+L+ Q++QLQA + E
Sbjct: 163 RQAAQAQLRKLQRQIRQLQAQQNE 186
Score = 27.8 bits (62), Expect = 6.1
Identities = 15/81 (18%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
+ QLE A + + E++ S R ++L+ ++ + ++ ++ L+A ++
Sbjct: 106 RRAQLE-AQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQARQEAVALEAQRQ 164
Query: 294 ELEARQTELQLELQPIRNQSN 314
+A+ +LQ +++ ++ Q N
Sbjct: 165 AAQAQLRKLQRQIRQLQAQQN 185
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 35.3 bits (82), Expect = 0.020
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 5/105 (4%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREK--AE--KKHQESVERLKQLEVEMAKRDQDL 271
+ +K + E ++ +KL EIA K E KK +E VE L++ + K Q L
Sbjct: 30 KSLKEEI-AEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSL 88
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLN 316
+ ++ LE++LK L+ E LE R +++ E + ++
Sbjct: 89 KNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAA 133
Score = 29.9 bits (68), Expect = 1.4
Identities = 18/94 (19%), Positives = 48/94 (51%), Gaps = 7/94 (7%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
++ Q + E +++ ++L+ ++ EK ++ + RLK+LE E+ ++L
Sbjct: 52 EIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKEL----KNLKW 107
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
E+ LE++ ++++ ++EL + ++Q
Sbjct: 108 ESEV---LEQRFEKVERERDELYDKFEAAIQDVQ 138
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
Length = 850
Score = 36.1 bits (84), Expect = 0.021
Identities = 31/119 (26%), Positives = 54/119 (45%), Gaps = 20/119 (16%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRD-KLQLEI-------AAREKAEKKHQES---VERLK-- 258
+DVQ + + +E+ Q+ +L +EI AAR +AE QE+ +ER K
Sbjct: 632 NVDVQSV--EPVDERTRDSLQKSVQLAIEITTKSQEAAARHQAELLEQEARGRLERQKMH 689
Query: 259 -QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ--PIRNQSN 314
+ + E + L+E Q +E + A E EAR E + E++ +R ++
Sbjct: 690 DKAKAEEQRT--KLLELQAESAAVESSGQSRAEALAEAEARLIEAEAEVEQAELRAKAL 746
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 36.2 bits (84), Expect = 0.026
Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 1/94 (1%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
++ + E N + +L+ E+ EK K+ +E E +++LE E+ + +
Sbjct: 198 KEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKL 257
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
+E IR LEE++++L+ EELE + EL+ EL+
Sbjct: 258 EEKIRELEERIEELKKEIEELEEKVKELK-ELKE 290
Score = 32.3 bits (74), Expect = 0.36
Identities = 19/92 (20%), Positives = 54/92 (58%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
T +++++ + +E ++ +++ E+ + +K ++ V+ L++L+ E+ + +++L
Sbjct: 188 TENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKEL 247
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ R+LEE++++L+ EEL+ EL+
Sbjct: 248 ESLEGSKRKLEEKIRELEERIEELKKEIEELE 279
Score = 30.0 bits (68), Expect = 2.0
Identities = 20/98 (20%), Positives = 48/98 (48%), Gaps = 2/98 (2%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
+ + + EEK + ++ K E+ R + ++ E E K + E+ + + L
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRL--TGL 384
Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
+LE++L++L+ AKEE+E +++ + ++ +
Sbjct: 385 TPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIK 422
Score = 29.6 bits (67), Expect = 2.6
Identities = 26/102 (25%), Positives = 56/102 (54%), Gaps = 6/102 (5%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
++++ ++ EE N +++ +L+ + E+ +KK +E +RL++LE E + ++
Sbjct: 311 EIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELE-ERHELYEEAKA 369
Query: 274 AQEMIRRLEEQLKQLQAAK-----EELEARQTELQLELQPIR 310
+E + RL+++L L K EELE + E++ E+ I
Sbjct: 370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKIT 411
Score = 29.6 bits (67), Expect = 2.8
Identities = 19/73 (26%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 236 LQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
++ I EK K+ + E +K+ E E+ + +++ E + L E+L++L+ +EL
Sbjct: 174 IKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKEL 233
Query: 296 EARQTEL-QLELQ 307
E + E+ +LE +
Sbjct: 234 EELKEEIEELEKE 246
Score = 29.3 bits (66), Expect = 3.0
Identities = 21/98 (21%), Positives = 49/98 (50%), Gaps = 9/98 (9%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQLEV-----EMAKRDQDLM 272
A +E ++++ KL+ E+ + EK+ +E + L++LE E + ++ +
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYL 669
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
E + L +L++L+ +EE++ +L+ EL+
Sbjct: 670 ELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707
Score = 29.3 bits (66), Expect = 3.2
Identities = 23/87 (26%), Positives = 44/87 (50%), Gaps = 1/87 (1%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
+ +E +++ +L+ EI E+ K+ +E ERL++L+ ++ + ++ L E +E
Sbjct: 302 YEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH 361
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLE 305
EE K + E L+ R T L E
Sbjct: 362 ELYEE-AKAKKEELERLKKRLTGLTPE 387
Score = 28.5 bits (64), Expect = 5.7
Identities = 21/78 (26%), Positives = 44/78 (56%)
Query: 233 RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAK 292
++L+ I E E+ +E + L+++ E+ + +L E +E + +LE+++K+L+ K
Sbjct: 178 IERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELK 237
Query: 293 EELEARQTELQLELQPIR 310
EE+E + EL+ R
Sbjct: 238 EEIEELEKELESLEGSKR 255
Score = 28.5 bits (64), Expect = 6.1
Identities = 17/68 (25%), Positives = 38/68 (55%)
Query: 237 QLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
+LE ++ +E++++E E +L E+A +L E ++ +++ L++L+ EE E
Sbjct: 648 ELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707
Query: 297 ARQTELQL 304
+ EL+
Sbjct: 708 KAKKELEK 715
Score = 28.1 bits (63), Expect = 8.8
Identities = 17/59 (28%), Positives = 35/59 (59%)
Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
EK +++++ E+L +L+ E+ ++L + +E+ ++L E K+L +EEL EL
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 35.5 bits (82), Expect = 0.032
Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 16/100 (16%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAA-------REKAEKKHQESV---ERLKQLEVE 263
+ + A+ + +++QR +L ++AA A E RL
Sbjct: 95 QIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGA 154
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ E I L+ LKQL A + E+ A Q EL
Sbjct: 155 LNP------ARAERIDALKATLKQLAAVRAEIAAEQAELT 188
Score = 35.5 bits (82), Expect = 0.034
Identities = 23/106 (21%), Positives = 43/106 (40%), Gaps = 4/106 (3%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKL---QLEIAA-REKAEKKHQESVERLKQLEVEMAK 266
D ++ A+ ++ D L ++AA R + + E L + + AK
Sbjct: 141 DAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAK 200
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
Q L E ++ + +L +L Q EEL A ++ L+ E+
Sbjct: 201 LAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAA 246
Score = 35.1 bits (81), Expect = 0.047
Identities = 15/97 (15%), Positives = 31/97 (31%), Gaps = 4/97 (4%)
Query: 214 DVQQMKAQAREEKNAKQ----QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ 269
V+ A + E Q+ KL + R+K + + ++ E+ +
Sbjct: 175 AVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANES 234
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
L + ++ AA E AR + +
Sbjct: 235 RLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAKR 271
Score = 34.7 bits (80), Expect = 0.054
Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 10/85 (11%)
Query: 228 AKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ 287
A +Q ++Q EIAA EK K +E ++ +LE ++ + + I LE QL +
Sbjct: 35 ADDKQLKQIQKEIAALEK---KIREQQDQRAKLEKQLKSLETE-------IASLEAQLIE 84
Query: 288 LQAAKEELEARQTELQLELQPIRNQ 312
++L + +L L + Q
Sbjct: 85 TADDLKKLRKQIADLNARLNALEVQ 109
Score = 30.5 bits (69), Expect = 1.1
Identities = 20/62 (32%), Positives = 36/62 (58%)
Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
++LKQ++ E+A ++ + E Q+ +LE+QLK L+ LEA+ E +L+ +R
Sbjct: 34 AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93
Query: 311 NQ 312
Q
Sbjct: 94 KQ 95
Score = 30.1 bits (68), Expect = 1.5
Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 10/88 (11%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+Q+ A ++ +Q QR KL+ ++ K + + LE ++ + DL +
Sbjct: 43 IQKEIAALEKKIREQQDQRAKLEKQL-------KSLETE---IASLEAQLIETADDLKKL 92
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTEL 302
++ I L +L L+ + E R E
Sbjct: 93 RKQIADLNARLNALEVQEREQRRRLAEQ 120
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 35.0 bits (81), Expect = 0.038
Identities = 22/99 (22%), Positives = 45/99 (45%), Gaps = 12/99 (12%)
Query: 200 NHELYMR-RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
+ LY R + + + +E A+ + E ++K +K +E +RL+
Sbjct: 280 ENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEE-----ERELKKKFTEKIREKEKRLE 334
Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
+LE + + E +E+ +LEE K+L+ ++ LE
Sbjct: 335 ELEQNLIE------ERKELNSKLEEIQKKLEDLEKRLEK 367
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 35.2 bits (81), Expect = 0.042
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 10/87 (11%)
Query: 242 AREKAEKKHQESVERLK-QLEVEM-----AKRDQDLMEAQEMIRRL---EEQLKQLQAAK 292
AREK E++ +E +R K +LE E A + ++ +EA + RRL E Q+K Q +
Sbjct: 8 AREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKADQQMQ 67
Query: 293 EELEA-RQTELQLELQPIRNQSNLNKV 318
E L+A R + + Q N +K+
Sbjct: 68 ESLQAGRGIVFSRIFEAVSFQGNGDKI 94
Score = 31.0 bits (70), Expect = 0.93
Identities = 19/55 (34%), Positives = 34/55 (61%), Gaps = 4/55 (7%)
Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQ--ESVERLKQLEVEMA--KRDQDLME 273
RE++ KQ+ + KL+ E A+E+A K+ + E+ +R ++L+ A K DQ + E
Sbjct: 14 REQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKADQQMQE 68
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 35.0 bits (81), Expect = 0.043
Identities = 26/106 (24%), Positives = 47/106 (44%), Gaps = 17/106 (16%)
Query: 214 DVQQMKAQAREEKNAKQQQR-----DKLQLEIAAREKA--------EKKHQESVERLKQL 260
D+ ++ EEKN+++ ++L+ ++ A EKK + E L Q
Sbjct: 126 DLNELVES--EEKNSEEIDHVLELYEELRRDVLANRHQYGEAAPELEKKLENIEEELSQF 183
Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
D +EA+E++ EE + L++ E + + ELQ EL
Sbjct: 184 VEL--TSSGDYIEAREVLEEAEEHMIALRSIMERIPSLLAELQTEL 227
>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein. This family
consists of proteins over 500 amino acids long in
Caenorhabditis elegans and several bacteria (Pseudomonas
aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
etc.). The function is unknown.
Length = 525
Score = 35.1 bits (81), Expect = 0.045
Identities = 23/111 (20%), Positives = 44/111 (39%), Gaps = 10/111 (9%)
Query: 210 PDTIDVQQMKAQAREEKNAK---QQQRDKLQLEIA----AREKAEKKHQESVERLKQLEV 262
PD + +++ Q RE + + + D L+ RE + ++S +R +
Sbjct: 68 PDPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAKRNEPDLK 127
Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
E + I RL + ++ + ELE + +ELQ EL +
Sbjct: 128 EWFQAFDFNGSE---IERLLTEDREAERRIRELEKQLSELQNELNALLTGK 175
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 35.4 bits (82), Expect = 0.046
Identities = 26/97 (26%), Positives = 50/97 (51%), Gaps = 6/97 (6%)
Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
N E Y + + + A A E+ + + D+ A EKA+++ +++ ++Q
Sbjct: 441 NEEEYELELRLGRLKQRLDSATATPEELEQLEINDE------ALEKAQEEQEQAEANVEQ 494
Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
L+ E+ + + EA E ++R E +L QL+ A +ELE
Sbjct: 495 LQSELRQLRKRRDEALEALQRAERRLLQLRQALDELE 531
Score = 34.7 bits (80), Expect = 0.061
Identities = 22/114 (19%), Positives = 46/114 (40%), Gaps = 22/114 (19%)
Query: 222 AREEKNAK--------QQQRDKLQLEIAAREKAEKKHQESV----ERLKQLEVEMA---- 265
K A+ +QQ K +LE+ A + +++ + + +LKQL +
Sbjct: 648 RTALKQARLDLQRLQNEQQSLKDKLEL-AIAERKQQAETQLRQLDAQLKQLLEQQQAFLE 706
Query: 266 ----KRDQDLMEAQEMIRRLEEQLK-QLQAAKEELEARQTELQLELQPIRNQSN 314
+ E + +E +L QL +EA +T+ + L+ ++ Q +
Sbjct: 707 ALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYD 760
Score = 34.7 bits (80), Expect = 0.070
Identities = 28/107 (26%), Positives = 50/107 (46%), Gaps = 15/107 (14%)
Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK--------QLEVEM 264
+ QA E+ + ++ + A A K+ + ++RL+ +LE+ +
Sbjct: 617 QSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAI 676
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQL----QAAKEELEARQTELQLELQ 307
A+R Q +A+ +R+L+ QLKQL QA E L+ EL+ E
Sbjct: 677 AERKQ---QAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERL 720
Score = 34.7 bits (80), Expect = 0.080
Identities = 20/114 (17%), Positives = 48/114 (42%), Gaps = 16/114 (14%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ-------LEVEMAKRD 268
+++ Q R ++ ++ RD+L E++A + +E L+ ++E + D
Sbjct: 291 NRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQAD 350
Query: 269 QD--------LMEAQEMIRRLEEQLKQLQAAKEELE-ARQTELQLELQPIRNQS 313
D L E + + L + + +Q E L+ + +L+ +L+ +
Sbjct: 351 LDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERL 404
Score = 32.7 bits (75), Expect = 0.25
Identities = 25/122 (20%), Positives = 46/122 (37%), Gaps = 15/122 (12%)
Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAK-----QQQRDKLQLEIAAREKAEKKHQESVE 255
E + + +++ K + + Q R +L+ + E + + +
Sbjct: 393 LERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRLG 452
Query: 256 RLKQL--------EVEMAKR--DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
RLKQ E D+ L +AQE + E ++QLQ+ +L R+ E
Sbjct: 453 RLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEA 512
Query: 306 LQ 307
LQ
Sbjct: 513 LQ 514
Score = 32.4 bits (74), Expect = 0.42
Identities = 22/108 (20%), Positives = 52/108 (48%), Gaps = 1/108 (0%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
+DV A E + QQ + LQ +A +++AE++ ++ L++ + A+
Sbjct: 591 QRLDVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTA 650
Query: 271 LMEAQEMIRRLEEQLKQLQAAKEE-LEARQTELQLELQPIRNQSNLNK 317
L +A+ ++RL+ + + L+ E + R+ + + +L+ + Q
Sbjct: 651 LKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLL 698
Score = 32.4 bits (74), Expect = 0.42
Identities = 19/95 (20%), Positives = 42/95 (44%), Gaps = 11/95 (11%)
Query: 216 QQMKAQAREEKNAKQQQR--------DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
+ +K RE + + + ++L AA E A + + RLK+L+ + +
Sbjct: 706 EALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKA---RLKELKKQYDRE 762
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
L ++ L+ Q+++L+ E + R+ E+
Sbjct: 763 LASLDVDPNTVKELKRQIEELETTIERIAVRRPEV 797
Score = 29.7 bits (67), Expect = 2.3
Identities = 28/129 (21%), Positives = 55/129 (42%), Gaps = 18/129 (13%)
Query: 203 LYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEV 262
L ++ + D++Q++A + + + + + ++ + A + Q ERLKQ
Sbjct: 332 LEDQKGAFEDADIEQLQADLDQLPSIRSELEE-VEARLDALTGKHQDVQRKYERLKQKIK 390
Query: 263 EMAKRDQDLMEAQEMIRR----------------LEEQL-KQLQAAKEELEARQTELQLE 305
E +RD + + R LE QL +QL+A K E + EL+L
Sbjct: 391 EQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELR 450
Query: 306 LQPIRNQSN 314
L ++ + +
Sbjct: 451 LGRLKQRLD 459
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 33.9 bits (78), Expect = 0.047
Identities = 21/99 (21%), Positives = 45/99 (45%), Gaps = 9/99 (9%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
Q+ QAR E+ + + K +L + R ++L E ++ L E Q
Sbjct: 34 QRDIHQARVERME-EIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQ 92
Query: 276 EMIRRLEEQ--------LKQLQAAKEELEARQTELQLEL 306
+ +++ E+ L++L+ +++L+ Q EL+ +L
Sbjct: 93 DQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKL 131
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 34.6 bits (80), Expect = 0.052
Identities = 17/94 (18%), Positives = 41/94 (43%), Gaps = 1/94 (1%)
Query: 226 KNAKQQQRDKLQLEIAAREKAEKKHQE-SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQ 284
K +++ L+ EIA+ + + L++ E+ + E ++ + L+++
Sbjct: 177 KPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQE 236
Query: 285 LKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
L++L A E L +++EL E+ +
Sbjct: 237 LQELTIAIEALTNKKSELLEEIAEAEKIREECRG 270
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 34.5 bits (80), Expect = 0.056
Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 18/103 (17%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE------------M 264
Q+ A R + A + LQ EI + + Q E E +
Sbjct: 128 QVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEA----QCEAEGTGGTGVAGKGPV 183
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
K ++ ++A + RLE +L AA +LEA++ L+ Q
Sbjct: 184 YKEKREKLDAAQ--ARLETLKARLDAAIAQLEAQKAALERNRQ 224
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 34.1 bits (78), Expect = 0.074
Identities = 24/115 (20%), Positives = 43/115 (37%), Gaps = 4/115 (3%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQES--VERLKQLEV 262
+ D + Q + + Q + E E E K +E ++ L++LE
Sbjct: 34 LDSELRDAEKERDTYKQYLSK--LESQNVEISNYEALDSELDELKKEEERLLDELEELEK 91
Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
E D +L+E QE +LE + Q + +L+ LQ + Q +
Sbjct: 92 EDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSL 146
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 34.2 bits (78), Expect = 0.078
Identities = 25/123 (20%), Positives = 56/123 (45%), Gaps = 10/123 (8%)
Query: 189 VNKRILALCMGNHELYMRRRKPDTIDVQQMKAQ--------AREEKNAKQQQRDKLQLEI 240
+K+++ ++E + R+ D D+++ ++Q +EE + KQ DK Q +
Sbjct: 178 SDKKVVEALREDNEKGVNFRR-DMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKA 236
Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
+ K ++ V + KQ E + + D +E + E Q ++++ A+ E++
Sbjct: 237 DFAQDNADKQRDEVRQ-KQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDE 295
Query: 301 ELQ 303
E
Sbjct: 296 EAL 298
Score = 28.4 bits (63), Expect = 6.4
Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 10/111 (9%)
Query: 208 RKPDTIDVQQMKAQAREEKNAKQQQRD---KLQLEIAAREKA---EKKHQESVERLKQLE 261
+K D Q KA + NA +Q+ + K Q + A K + V ++ E
Sbjct: 224 KKQIDADKAQQKAD-FAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKRE 282
Query: 262 VEMAKRD--QDLMEAQEMIRRLEEQLKQLQAAKEELEARQT-ELQLELQPI 309
+E A+ + ++ EA + LKQ A E+ + E Q + +P+
Sbjct: 283 IEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV 333
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 33.6 bits (77), Expect = 0.081
Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 4/65 (6%)
Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
K R ++LE +L E ++ + L+E+L+ L+ ELE+ +L+ +LQ
Sbjct: 62 LKAPVQSVRPQKLEEL----QGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQ 117
Query: 308 PIRNQ 312
+R
Sbjct: 118 LLREL 122
Score = 32.4 bits (74), Expect = 0.18
Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 6/85 (7%)
Query: 226 KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL 285
K Q R + E+ K ++L +L+ E+ ++ + E + + L+E L
Sbjct: 63 KAPVQSVRPQKLEELQGELSELK------QQLSELQEELEDLEERIAELESELEDLKEDL 116
Query: 286 KQLQAAKEELEARQTELQLELQPIR 310
+ L+ + LE R L+ ++ +
Sbjct: 117 QLLRELLKSLEERLESLEESIKELA 141
Score = 29.3 bits (66), Expect = 1.7
Identities = 18/76 (23%), Positives = 39/76 (51%), Gaps = 1/76 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREK-AEKKHQESVERLKQLEVEMAKRDQDLM 272
D+++ RE + +++ + L+ I K + Q+ E +++L E+ + ++L
Sbjct: 111 DLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQ 170
Query: 273 EAQEMIRRLEEQLKQL 288
QE I+ L+ L+QL
Sbjct: 171 RLQEAIQELQSLLEQL 186
Score = 29.0 bits (65), Expect = 2.4
Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 15/122 (12%)
Query: 203 LYMRRRKPDTIDVQQMKAQA-REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
L K V+ K + + E + +QQ +LQ E+ E E++ E L+ L+
Sbjct: 57 LLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEEL---EDLEERIAELESELEDLK 113
Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQL-----------QAAKEELEARQTELQLELQPIR 310
++ + L +E + LEE +K+L + EEL LQ LQ ++
Sbjct: 114 EDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQ 173
Query: 311 NQ 312
Sbjct: 174 EA 175
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 34.3 bits (79), Expect = 0.090
Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 10/120 (8%)
Query: 199 GNHELYMRRRKPD---TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE 255
G+ +RRRK + + +++A E K + ++ + L EIA ++ Q S+E
Sbjct: 189 GDLTNVLRRRKKSELPSSILSEIEALEAELK-EQSEKYEDLAQEIAHLRNELEEAQRSLE 247
Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
L++ + DL E +E +LE QLK+++AA++ A+ EL + P+ NL
Sbjct: 248 SLEK---KFRSEGGDLFEERE---QLERQLKEIEAARKANRAQLRELAADPLPLLLIPNL 301
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 34.3 bits (78), Expect = 0.090
Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 225 EKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE----VEMAKRDQDLMEAQEMIRR 280
+ KQ ++D +LE K EK Q + E+L L + +++++L++ Q + +
Sbjct: 275 KSRKKQMEKDNSELE----LKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEK 330
Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
L ++ + L K EL Q LQL+ +
Sbjct: 331 LNKERRLLNQEKTELLVEQGRLQLQADRHQEHI 363
Score = 32.3 bits (73), Expect = 0.36
Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 4/97 (4%)
Query: 209 KPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEM 264
K D+ + Q +EK KQ + D + +I K + QE ++ LK +L+ E
Sbjct: 814 KLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEK 873
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
+ +L Q+ +L E ++Q+ E++ + +
Sbjct: 874 LQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQ 910
Score = 29.2 bits (65), Expect = 3.9
Identities = 19/94 (20%), Positives = 50/94 (53%), Gaps = 2/94 (2%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQL--EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
Q+++ E K KQ + ++ +I ++E + +E V+ + + R +++
Sbjct: 200 QKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEH 259
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
I +L+ ++K L++ K+++E +EL+L+++
Sbjct: 260 NLSKIMKLDNEIKALKSRKKQMEKDNSELELKME 293
>gnl|CDD|241235 cd00900, PH-like, Pleckstrin homology-like domain. The PH-like
family includes the PH domain, both the Shc-like and
IRS-like PTB domains, the ran-binding domain, the EVH1
domain, a domain in neurobeachin and the third domain of
FERM. All of these domains have a PH fold, but lack
significant sequence similarity. They are generally
involved in targeting to protein to the appropriate
cellular location or interacting with a binding partner.
This domain family possesses multiple functions
including the ability to bind inositol phosphates and to
other proteins.
Length = 88
Score = 31.6 bits (72), Expect = 0.10
Identities = 11/54 (20%), Positives = 19/54 (35%), Gaps = 6/54 (11%)
Query: 148 TPKIGFPWSEIRNISFN-----DRKFIIKPIDKKAPDFVFFAPRVRVNKRILAL 196
K+ P S+I ++ + DR +++ D K F I A
Sbjct: 36 RLKLKIPISDIVKVNPSRASDSDRALVLRLKD-KRESLGFSFKSKEEAVLIKAA 88
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 33.9 bits (78), Expect = 0.11
Identities = 17/68 (25%), Positives = 37/68 (54%)
Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
+ K E++ +E ++ E+ ++ + + +E + RLEE+ +L+ EEL+ +L
Sbjct: 396 KVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKL 455
Query: 303 QLELQPIR 310
+ EL+ R
Sbjct: 456 ESELERFR 463
Score = 33.1 bits (76), Expect = 0.21
Identities = 16/68 (23%), Positives = 33/68 (48%), Gaps = 1/68 (1%)
Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
E REK + +E E + E + K ++ + +E L+ +L++L+ E+LE+
Sbjct: 400 EERPREKEGTEEEERRE-ITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESE 458
Query: 299 QTELQLEL 306
+ E+
Sbjct: 459 LERFRREV 466
Score = 31.6 bits (72), Expect = 0.62
Identities = 27/100 (27%), Positives = 49/100 (49%), Gaps = 13/100 (13%)
Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
K + E K+ ++ + EI EK KK +E+VERL++ E ++ ++L E +
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEE---ENSELKRELEELKRE 451
Query: 278 IRRLEEQLKQLQAAKE-------ELEARQTE---LQLELQ 307
I +LE +L++ + E+ AR L+ EL+
Sbjct: 452 IEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELE 491
Score = 28.1 bits (63), Expect = 7.1
Identities = 13/72 (18%), Positives = 34/72 (47%), Gaps = 5/72 (6%)
Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
A K +++ + + + E + + I++LEE +++L+ EL+
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREITVYE-----KRIKKLEETVERLEEENSELKRELE 446
Query: 301 ELQLELQPIRNQ 312
EL+ E++ + ++
Sbjct: 447 ELKREIEKLESE 458
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 33.7 bits (77), Expect = 0.11
Identities = 29/101 (28%), Positives = 56/101 (55%), Gaps = 17/101 (16%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
+R P+ +D+++ + +E+ Q+++++L E+ E+ E +++E ERLK+LEVE +
Sbjct: 125 QRVPEYMDLKEDYEELKEKLEELQKEKEELLKEL---EELEAEYEEVQERLKRLEVENS- 180
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
RLEE LK+L +L+ R EL+ ++
Sbjct: 181 -------------RLEEMLKKLPGEVYDLKKRWDELEPGVE 208
Score = 29.5 bits (66), Expect = 2.0
Identities = 15/47 (31%), Positives = 28/47 (59%)
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+R + M+ +E L+E+L++LQ KEEL EL+ E + ++ +
Sbjct: 125 QRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQER 171
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 33.0 bits (76), Expect = 0.11
Identities = 20/76 (26%), Positives = 41/76 (53%), Gaps = 3/76 (3%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
Q +KA ++K AK+ +++K + AR+ A+ E + ++ + E A+RD++L +
Sbjct: 5 QLLKAGLVDKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQR 64
Query: 276 EMIRRLEE---QLKQL 288
+ + Q+KQL
Sbjct: 65 QAEAEQKAIVAQIKQL 80
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 34.0 bits (78), Expect = 0.12
Identities = 16/110 (14%), Positives = 36/110 (32%), Gaps = 9/110 (8%)
Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL 260
H L + R + A+ R + + +A + +E+ RL+ L
Sbjct: 861 HTLEVAVR--ELRHAATRAAEQRAR---AARAESDAREAAEDAAEARAEAEEASLRLRTL 915
Query: 261 EVEMAKRDQDLM----EAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
E + ++ E + + +L +L A E + + +
Sbjct: 916 EESVGAMVDEIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKR 965
Score = 32.9 bits (75), Expect = 0.29
Identities = 22/100 (22%), Positives = 41/100 (41%), Gaps = 5/100 (5%)
Query: 220 AQAREEKNAKQQQRDKLQLEI-AAREKAEKKHQESV----ERLKQLEVEMAKRDQDLMEA 274
AREE+ + + L+ E A R + E E L++ + +A
Sbjct: 293 ETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADA 352
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
++ IR E +L++ + +E R + + EL+ R Q
Sbjct: 353 RQAIREAESRLEEERRRLDEEAGRLDDAERELRAAREQLA 392
Score = 31.3 bits (71), Expect = 0.71
Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 6/81 (7%)
Query: 238 LEIAARE--KAEKKHQESVERLKQLEVEMAKRDQDLME----AQEMIRRLEEQLKQLQAA 291
LE+A RE A + E R + E + + +D E A+E RL + + A
Sbjct: 863 LEVAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAM 922
Query: 292 KEELEARQTELQLELQPIRNQ 312
+E+ AR E + L +
Sbjct: 923 VDEIRARLAETRAALASGGRE 943
Score = 30.2 bits (68), Expect = 1.6
Identities = 20/98 (20%), Positives = 42/98 (42%), Gaps = 10/98 (10%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
++++A R +N Q+ R R +A RL+ + + + +DL A+
Sbjct: 240 ERLEALERALRNFLQRYR--RYARTMLRRRAT--------RLRSAQTQYDQLSRDLGRAR 289
Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
+ + E+ ++L A E LE L+ L+ ++
Sbjct: 290 DELETAREEERELDARTEALEREADALRTRLEALQGSP 327
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 33.6 bits (77), Expect = 0.13
Identities = 20/95 (21%), Positives = 45/95 (47%), Gaps = 1/95 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
D + + + Q+ +LQ ++ +KA ++ E L+ ++ + + L E
Sbjct: 492 DANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLEQLHE 551
Query: 274 AQEMIRRLEEQLKQLQA-AKEELEARQTELQLELQ 307
A E +++ EQ+++L+ + L + EL+ LQ
Sbjct: 552 ANEELQKKREQIEELEPDQDQNLSRKIAELEAALQ 586
Score = 28.2 bits (63), Expect = 6.5
Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 13/86 (15%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ------ESVERLKQLEVEMAKRDQ 269
+Q+K EE+NA Q +QLE ++ + Q + E +L E K D+
Sbjct: 332 RQVKLL--EERNAMYMQNT-VQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADK 388
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEEL 295
E +RLEE+L+ LQ KE L
Sbjct: 389 LEFE----YKRLEEKLEALQKEKERL 410
Score = 27.8 bits (62), Expect = 9.8
Identities = 19/96 (19%), Positives = 46/96 (47%), Gaps = 12/96 (12%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
+ +A EE K++Q ++L+ + Q ++ +LE + K+D+D+ +E
Sbjct: 547 EQLHEANEELQKKREQIEELE---------PDQDQNLSRKIAELEAALQKKDEDMRAMEE 597
Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
++ E+ +++ L+ +Q E+Q ++ Q
Sbjct: 598 RYKKYVEKAREVIKT---LDPKQNPASPEIQLLKKQ 630
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 33.4 bits (77), Expect = 0.13
Identities = 32/135 (23%), Positives = 53/135 (39%), Gaps = 15/135 (11%)
Query: 188 RVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAE 247
R+N +I L L + R+ D+Q A R +A + +R +LQ A +
Sbjct: 57 RLNSQIAEL---ADLLSLERQG--NQDLQDSVANLRASLSAAEAERSRLQ---ALLAELA 108
Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQ-------EMIRRLEEQLKQLQAAKEELEARQT 300
+ R +L E+ Q A + I L QL L+AA + E R
Sbjct: 109 GAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDR 168
Query: 301 ELQLELQPIRNQSNL 315
E Q ++ + + N+
Sbjct: 169 ESQAKIADLGRRLNV 183
Score = 30.3 bits (69), Expect = 1.2
Identities = 26/93 (27%), Positives = 36/93 (38%), Gaps = 7/93 (7%)
Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK--QLEVEMAKRDQDLMEAQEMIRR 280
E + K D+L +IA E S+ER L+ +A L A+ R
Sbjct: 45 SREISGKDSALDRLNSQIA-----ELADLLSLERQGNQDLQDSVANLRASLSAAEAERSR 99
Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
L+ L +L A E R EL EL + S
Sbjct: 100 LQALLAELAGAGAAAEGRAGELAQELDSEKQVS 132
>gnl|CDD|241347 cd13193, FERM_C_FARP1-like, FERM, RhoGEF and pleckstrin
domain-containing protein 1 and related proteins FERM
domain. Members here include FARP1 (also called
Chondrocyte-derived ezrin-like protein; PH
domain-containing family C member 2), FARP2 (also called
FIR/FERM domain including RhoGEF; FGD1-related
Cdc42-GEF/FRG), and FRMD7(FERM domain containing 7).
FARP1 and FARP2 are members of the Dbl family guanine
nucleotide exchange factors (GEFs) which are upstream
positive regulators of Rho GTPases. FARP1 has increased
expression in differentiated chondrocytes. FARP2 is
thought to regulate neurite remodeling by mediating the
signaling pathways from membrane proteins to Rac. It is
found in brain, lung, and testis, as well as embryonic
hippocampal and cortical neurons. These members are
composed of a N-terminal FERM domain, a proline-rich
(PR) domain, Dbl-homology (DH), and two C-terminal PH
domains. Other members in this family do not contain
the DH domains such as the Human FERM domain containing
protein 7 and Caenorhabditis elegans CFRM3, both of
which have unknown functions. They contain an N-terminal
FERM domain, a PH domain, followed by a FA (FERM
adjacent) domain. The FERM domain has a cloverleaf
tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3
within the FERM domain is part of the PH domain family.
The FERM domain is found in the cytoskeletal-associated
proteins such as ezrin, moesin, radixin, 4.1R, and
merlin. These proteins provide a link between the
membrane and cytoskeleton and are involved in signal
transduction pathways. The FERM domain is also found in
protein tyrosine phosphatases (PTPs) , the tyrosine
kinases FAK and JAK, in addition to other proteins
involved in signaling. This domain is structurally
similar to the PH and PTB domains and consequently is
capable of binding to both peptides and phospholipids at
different sites.
Length = 122
Score = 31.9 bits (73), Expect = 0.14
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 152 GFPWSEIRNISFNDRKFIIK 171
F W++IR +SF ++F+IK
Sbjct: 44 TFSWAKIRKLSFKRKRFLIK 63
Score = 30.4 bits (69), Expect = 0.46
Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 12/79 (15%)
Query: 45 IAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQFK-- 102
IA+ E+YG+ K+++G +L L V +G+ +++ T + + K L FK
Sbjct: 2 IARRCELYGIRLHPAKDREGVDLNLAVAHMGILVFQ---GFTKINTFSWAKIRKLSFKRK 58
Query: 103 -FRAKFYPE------DVVE 114
F K +PE D VE
Sbjct: 59 RFLIKLHPEAYGYYKDTVE 77
>gnl|CDD|235319 PRK04914, PRK04914, ATP-dependent helicase HepA; Validated.
Length = 956
Score = 33.7 bits (78), Expect = 0.14
Identities = 26/98 (26%), Positives = 45/98 (45%), Gaps = 20/98 (20%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-EVEMAKRDQDLM 272
D+ ++ +A E A+ Q R +L A+++A++K + RL+ L V RD
Sbjct: 873 DIHKLLQKA--EAQAEAQAR---ELIEQAKQEADEKLSAELSRLEALKAVNPNIRD---- 923
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
+ I LE Q +++ A + + QL L IR
Sbjct: 924 ---DEIEALESQRQEVLEALD-------QAQLRLDAIR 951
>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
Length = 857
Score = 33.7 bits (77), Expect = 0.14
Identities = 29/133 (21%), Positives = 64/133 (48%), Gaps = 11/133 (8%)
Query: 193 ILALCMGNHELYMRRRKPD-TID-VQQMKAQAREEKNAKQQQRDKLQ---LEIAAREKAE 247
I+A +H R+ PD ID + + + R + ++K ++ D+L +++ ++A
Sbjct: 370 IVAAATLSHRYIADRQLPDKAIDLIDEAASSIRMQIDSKPEELDRLDRRIIQLKLEQQAL 429
Query: 248 KKHQE--SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
K + S +RL L E++ +++ E +E + + L Q K ELE + ++
Sbjct: 430 MKESDEASKKRLDMLNEELSDKERQYSELEEEWKAEKASLSGTQTIKAELE----QAKIA 485
Query: 306 LQPIRNQSNLNKV 318
++ R +L ++
Sbjct: 486 IEQARRVGDLARM 498
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 33.8 bits (77), Expect = 0.15
Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 8/103 (7%)
Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE 263
Y RR+ + + +E+ + KQ Q+++++L +EK K +E V V+
Sbjct: 3 YRNRREANI----NNNDRMQEKDDEKQDQKNRMEL----KEKVLDKKEEVVTDNVDSPVK 54
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
++L A E+ + +E+ KQL + E Q E+Q E+
Sbjct: 55 EQSSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEI 97
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 33.5 bits (77), Expect = 0.16
Identities = 20/86 (23%), Positives = 48/86 (55%), Gaps = 2/86 (2%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEI--AAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
+ ++ + RE K++++ +L+ E A E+A + + + E L+Q+E ++ + ++
Sbjct: 617 ALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREER 676
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEA 297
+ Q I +E +L++L+ +E EA
Sbjct: 677 DDLQAEIGAVENELEELEELRERREA 702
Score = 33.1 bits (76), Expect = 0.22
Identities = 23/98 (23%), Positives = 52/98 (53%), Gaps = 9/98 (9%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKL-----QLEIAA---REKAEKKHQESVERLKQLEVEMA 265
D++++ A+ RE K+++ ++L +LE A RE A + +E+ E +++ E+
Sbjct: 520 DLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVA-ELN 578
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ +L E E + R+ L + A++E+E + + +
Sbjct: 579 SKLAELKERIESLERIRTLLAAIADAEDEIERLREKRE 616
Score = 32.7 bits (75), Expect = 0.30
Identities = 22/94 (23%), Positives = 50/94 (53%), Gaps = 4/94 (4%)
Query: 219 KAQAREEKNA-KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
A EE+ +++ +L+ E+ E+A + ++ E +++LE E+ + + +A
Sbjct: 350 DADDLEERAEELREEAAELESEL---EEAREAVEDRREEIEELEEEIEELRERFGDAPVD 406
Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQPIRN 311
+ E+ L++L+ ++EL R+ EL+ L+ R
Sbjct: 407 LGNAEDFLEELREERDELREREAELEATLRTARE 440
Score = 31.9 bits (73), Expect = 0.53
Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 7/79 (8%)
Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
E++ ++ RD+ + H+E E L+ LE E+ + + E + L E
Sbjct: 227 EQREQARETRDEADEVLEE-------HEERREELETLEAEIEDLRETIAETEREREELAE 279
Query: 284 QLKQLQAAKEELEARQTEL 302
+++ L+ EELE + +L
Sbjct: 280 EVRDLRERLEELEEERDDL 298
Score = 30.8 bits (70), Expect = 1.3
Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 4/111 (3%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
R + + +V + + REE + + + L+ IA E+ ++ E V L++ E+
Sbjct: 233 RETRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELE 292
Query: 266 KRDQDLMEAQEM----IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+ DL+ + +E + ++L+ EEL R E ++ Q +
Sbjct: 293 EERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEE 343
Score = 30.4 bits (69), Expect = 1.3
Identities = 22/83 (26%), Positives = 44/83 (53%), Gaps = 5/83 (6%)
Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME----AQEMIRRLEEQ 284
+++ ++L+ E+ E+ ++ +E +ER + L VE R + L E +E+I E
Sbjct: 473 DRERVEELEAELEDLEEEVEEVEERLERAEDL-VEAEDRIERLEERREDLEELIAERRET 531
Query: 285 LKQLQAAKEELEARQTELQLELQ 307
+++ + EEL R EL+ E +
Sbjct: 532 IEEKRERAEELRERAAELEAEAE 554
Score = 30.4 bits (69), Expect = 1.5
Identities = 18/58 (31%), Positives = 31/58 (53%)
Query: 246 AEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
EK+ ++ ERL LE E+A+ D+++ +E + E + EE E R+ EL+
Sbjct: 197 EEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELE 254
Score = 30.0 bits (68), Expect = 1.8
Identities = 21/97 (21%), Positives = 44/97 (45%), Gaps = 11/97 (11%)
Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
+ RE +++ ++ + E+A + +E +E L+++ +A +A++ I R
Sbjct: 555 EKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIA----DAEDEIER 610
Query: 281 LEEQLKQLQA----AKEELEA---RQTELQLELQPIR 310
L E+ + L +E L R+ EL+ E R
Sbjct: 611 LREKREALAELNDERRERLAEKRERKRELEAEFDEAR 647
Score = 30.0 bits (68), Expect = 2.2
Identities = 27/105 (25%), Positives = 47/105 (44%), Gaps = 8/105 (7%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE---VEMAKR 267
D ++ ++ AQA E A+ + D LE E+AE+ +E+ E +LE + R
Sbjct: 328 DRLEECRVAAQAHNE-EAESLREDADDLE----ERAEELREEAAELESELEEAREAVEDR 382
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+++ E +E I L E+ E EL+ E +R +
Sbjct: 383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELRER 427
Score = 29.6 bits (67), Expect = 2.3
Identities = 18/81 (22%), Positives = 40/81 (49%), Gaps = 3/81 (3%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRD---QDLMEAQE 276
+ RE + + + L+ E+ E+ ++ ++ VE ++E +R+ + + E +E
Sbjct: 471 EEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRE 530
Query: 277 MIRRLEEQLKQLQAAKEELEA 297
I E+ ++L+ ELEA
Sbjct: 531 TIEEKRERAEELRERAAELEA 551
Score = 29.2 bits (66), Expect = 3.5
Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 7/94 (7%)
Query: 220 AQAREEKNAKQQQRDKLQ-LEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
A A +E +++R+ L L RE+ +K ER ++LE E + + EA+E
Sbjct: 602 ADAEDEIERLREKREALAELNDERRERLAEKR----ERKRELEAEF--DEARIEEAREDK 655
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
R EE L+Q++ +EL + +LQ E+ + N+
Sbjct: 656 ERAEEYLEQVEEKLDELREERDDLQAEIGAVENE 689
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 33.1 bits (76), Expect = 0.16
Identities = 23/114 (20%), Positives = 47/114 (41%), Gaps = 11/114 (9%)
Query: 188 RVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAR-----EEKNAKQQQRDKLQLEIAA 242
R N+ + LC NH+ R T + K Q + + ++ ++
Sbjct: 256 RANREVAILC--NHQ----RTVSKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDL 309
Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
+ K + K + E+L E K + + ++ I RLEE++++L+ + E
Sbjct: 310 KRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKE 363
>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80. Spc24 is a component of
the evolutionarily conserved kinetochore-associated
Ndc80 complex and is involved in chromosome segregation.
Length = 117
Score = 31.5 bits (72), Expect = 0.17
Identities = 11/45 (24%), Positives = 22/45 (48%)
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
+ + + + + + LE +L LQ+ EEL+ R EL+ +
Sbjct: 1 IQELENEKFRLAKELNDLESELNNLQSELEELKERLEELEEQEVE 45
Score = 26.9 bits (60), Expect = 6.9
Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 3/52 (5%)
Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
++ L+ + +AK DL + + L+ +L++L+ EELE ++ E
Sbjct: 1 IQELENEKFRLAKELNDL---ESELNNLQSELEELKERLEELEEQEVEGDEV 49
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 33.2 bits (75), Expect = 0.19
Identities = 15/72 (20%), Positives = 43/72 (59%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
+ R + + ++ ++M+ RE ++ +R++L+ + R++ ++ +E V+RL++ +E
Sbjct: 463 LEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRLEK 522
Query: 265 AKRDQDLMEAQE 276
A+R+ ++ E
Sbjct: 523 ARRNSYFLKGME 534
Score = 28.9 bits (64), Expect = 4.8
Identities = 17/80 (21%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 221 QAREEKNAKQQQR------DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+AR EK ++ +K ++E RE+ E+ +E +ER+++ +E + +++ +E
Sbjct: 437 RARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLER 496
Query: 275 QEMIRRLEEQLKQLQAAKEE 294
+ R ++L++ + + E
Sbjct: 497 DRLERDRLDRLERERVDRLE 516
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 32.6 bits (75), Expect = 0.23
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 10/96 (10%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ-LEVEMAK--RD 268
I+ ++ KA+A E ++ R+K + E E E+ +QE V++L + +E E K +
Sbjct: 205 AIEAERAKAEAAEA--EQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAE 262
Query: 269 QDLMEAQEMIRRLEEQLKQLQAA-KEELEARQTELQ 303
Q+ M ++ EQ + L+ K E E+ Q E+Q
Sbjct: 263 QERMLEHKLQ----EQEELLKEGFKTEAESLQKEIQ 294
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 31.4 bits (72), Expect = 0.24
Identities = 19/84 (22%), Positives = 43/84 (51%), Gaps = 10/84 (11%)
Query: 216 QQMKAQAREEKNAKQQQ----RDKLQLEI------AAREKAEKKHQESVERLKQLEVEMA 265
++ + + E+ ++++ ++ + A R +AEKK +E + K+ E E+
Sbjct: 25 RREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEEEIK 84
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQ 289
+ +L E + I +LEE+L++ Q
Sbjct: 85 ELKAELEELKAEIEKLEEKLEEYQ 108
Score = 29.1 bits (66), Expect = 1.4
Identities = 22/95 (23%), Positives = 53/95 (55%), Gaps = 1/95 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVER-LKQLEVEMAKRDQDLM 272
+VQ REE +++ + + E+ +E+ ++ ++ LK+ E + + ++
Sbjct: 11 EVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAE 70
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
E +++ + EE++K+L+A EEL+A +L+ +L+
Sbjct: 71 EEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLE 105
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 33.0 bits (76), Expect = 0.24
Identities = 14/78 (17%), Positives = 31/78 (39%)
Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
+ +N + ++ E ++ +S + + E+ + E +E + LE
Sbjct: 139 DPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEA 198
Query: 284 QLKQLQAAKEELEARQTE 301
QL+QLQ E + +
Sbjct: 199 QLEQLQEKAAETSQERKQ 216
Score = 31.1 bits (71), Expect = 0.90
Identities = 17/62 (27%), Positives = 37/62 (59%), Gaps = 3/62 (4%)
Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
Q+ V LKQ ++E+ R++ ++Q + +++L L+ ELE +Q EL+ +L+ ++
Sbjct: 148 QQEVLTLKQ-QLELQAREKA--QSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQ 204
Query: 311 NQ 312
+
Sbjct: 205 EK 206
Score = 30.3 bits (69), Expect = 1.6
Identities = 17/79 (21%), Positives = 38/79 (48%), Gaps = 5/79 (6%)
Query: 230 QQQRDKLQLEIAA-REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
+ LQ E+ +++ E + +E Q + + Q+L+ + + LEE+ ++L
Sbjct: 141 ENLLHALQQEVLTLKQQLELQARE----KAQSQALAEAQQQELVALEGLAAELEEKQQEL 196
Query: 289 QAAKEELEARQTELQLELQ 307
+A E+L+ + E E +
Sbjct: 197 EAQLEQLQEKAAETSQERK 215
Score = 28.8 bits (65), Expect = 4.4
Identities = 18/87 (20%), Positives = 38/87 (43%), Gaps = 2/87 (2%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK-RDQDLME 273
V +K Q + K Q + + + E E E+ ++LE ++ + +++
Sbjct: 151 VLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAET 210
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQT 300
+QE ++ +E Q A + EL +T
Sbjct: 211 SQERKQKRKEITDQ-AAKRLELSEEET 236
>gnl|CDD|227635 COG5325, COG5325, t-SNARE complex subunit, syntaxin [Intracellular
trafficking and secretion].
Length = 283
Score = 32.5 bits (74), Expect = 0.26
Identities = 18/89 (20%), Positives = 36/89 (40%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+QQ AQ R+ + + + + E+ E + S +Q + +++L
Sbjct: 127 IQQKSAQFRKYQVLQAKFLRNKNNDQHPLEEEEDEESLSSLGSQQTLQQQGLSNEELEYQ 186
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
Q +I +E++K L EL +L
Sbjct: 187 QILITERDEEIKNLARGIYELNEIFRDLG 215
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 32.5 bits (74), Expect = 0.26
Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 3/105 (2%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
Q K Q +Q LQ EIA E E + E + L+ LE E+ + ++L +
Sbjct: 75 NQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQL 134
Query: 276 EMIRRLEEQLKQLQAAKEEL---EARQTELQLELQPIRNQSNLNK 317
+ + E+ ++ +E L ++ E LEL+ + + L++
Sbjct: 135 QNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHE 179
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 32.7 bits (75), Expect = 0.26
Identities = 18/94 (19%), Positives = 35/94 (37%), Gaps = 7/94 (7%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
+Q+ R+E+ + +LQ E+ +E ++L Q + +R L
Sbjct: 31 LQLLQEAVNSKRQEEAEPAAEEAELQAELIQQE------LAINDQLSQALNQQTERLNAL 84
Query: 272 MEAQEMIRRLEEQLKQLQ-AAKEELEARQTELQL 304
+ L QL Q +E++ + L L
Sbjct: 85 ASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLL 118
Score = 32.0 bits (73), Expect = 0.50
Identities = 18/100 (18%), Positives = 41/100 (41%), Gaps = 6/100 (6%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+++ A A +++ LQ RE+ + S+ + L ++ + +
Sbjct: 79 ERLNALASDDRQLANLLLQLLQSSRTIREQIAVL-RGSLLLSRILLQQLGPLPEAGQPQE 137
Query: 276 EMIRRLEEQLKQLQAAKEE---LEARQTELQLELQPIRNQ 312
+ + ++ LQA K LE + +L E++ I +Q
Sbjct: 138 QFE--VTQERDALQAEKAYINALEGQAEQLTAEVRDILDQ 175
Score = 27.7 bits (62), Expect = 8.9
Identities = 21/91 (23%), Positives = 36/91 (39%), Gaps = 9/91 (9%)
Query: 227 NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA-----KRDQDLMEAQEMIRRL 281
Q +R+ L E A E A + QE+V +Q E E A + + + + + +L
Sbjct: 11 ELLQSRRELLTAESAQLEAALQLLQEAVNSKRQEEAEPAAEEAELQAELIQQELAINDQL 70
Query: 282 EEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+ L Q Q + A +L + Q
Sbjct: 71 SQALNQ-QTERLNALASD---DRQLANLLLQ 97
>gnl|CDD|222629 pfam14257, DUF4349, Domain of unknown function (DUF4349). This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 282 and
353 amino acids in length. There is a single completely
conserved residue D that may be functionally important.
Length = 262
Score = 32.2 bits (74), Expect = 0.29
Identities = 15/48 (31%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
+ S ERL L +E A +DL+E + + R++ +++ L+A L+ R
Sbjct: 145 RASEERLLAL-LERAGTVEDLLEVERELSRVQAEIESLEAQLRYLDDR 191
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 30.7 bits (70), Expect = 0.29
Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 17/94 (18%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAARE--------KAEKKHQESVERLKQLEVEMA 265
++ A+ N + LE A +E + Q VE +Q E+A
Sbjct: 28 AIETQIEYAKAHGNQHRI----AGLEKALKEVQAHCTDAGLRAERQAKVEEKEQ---EVA 80
Query: 266 KRDQDLMEAQEM--IRRLEEQLKQLQAAKEELEA 297
+R+Q+L EAQ ++E++ ++L A+ EL+
Sbjct: 81 EREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114
>gnl|CDD|219412 pfam07439, DUF1515, Protein of unknown function (DUF1515). This
family consists of several hypothetical bacterial
proteins of around 130 residues in length. Members of
this family seem to be found exclusively in Rhizobium
species. The function of this family is unknown.
Length = 112
Score = 30.7 bits (69), Expect = 0.30
Identities = 16/62 (25%), Positives = 29/62 (46%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
QQ+ E KN ++ R A+R ++ E VER+ LE ++K + D+ +
Sbjct: 8 QQLGTLQAEVKNLREDIRRSEDRSDASRASMYRRMDELVERVTTLEGSVSKLNADITSMK 67
Query: 276 EM 277
+
Sbjct: 68 PV 69
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 31.8 bits (73), Expect = 0.33
Identities = 21/83 (25%), Positives = 45/83 (54%), Gaps = 2/83 (2%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
++ +EK + + + K EIA E+A++K++E +L +E ++ + ++ A+ I
Sbjct: 86 RSLKDDEKMEQLEAQLKEAKEIA--EEADRKYEEVARKLVVVEGDLERAEERAEAAESKI 143
Query: 279 RRLEEQLKQLQAAKEELEARQTE 301
LEE+LK + + LE + +
Sbjct: 144 VELEEELKVVGNNLKSLEVSEEK 166
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 32.0 bits (73), Expect = 0.34
Identities = 22/127 (17%), Positives = 49/127 (38%), Gaps = 21/127 (16%)
Query: 188 RVNKRILALCMGNHELYMRRRKPDT----IDVQQMKAQA---REEKNAKQQQRDKLQLEI 240
+V + + + +L K + + Q++A+ +EE K+++ ++L+ +
Sbjct: 42 KVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRAL 101
Query: 241 AAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
A R + +R L + Q+ I+R +L L L ++
Sbjct: 102 AQRRSDLSSASYQLEKRRA----------SQLEKLQDEIKRTRSKLNALH---SLLAEKR 148
Query: 300 TELQLEL 306
+ L EL
Sbjct: 149 SFLCREL 155
Score = 30.4 bits (69), Expect = 1.1
Identities = 20/90 (22%), Positives = 45/90 (50%), Gaps = 1/90 (1%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA-QEM 277
K A K +++++L A + +++ ++ ER+++L+ +A+R DL A ++
Sbjct: 56 KLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQL 115
Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+R QL++LQ + ++ L L
Sbjct: 116 EKRRASQLEKLQDEIKRTRSKLNALHSLLA 145
>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390). This is a
family of long proteins currently only found in the rice
genome. They have no known function. However they may be
some kind of transposable element.
Length = 843
Score = 32.5 bits (73), Expect = 0.34
Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 2/91 (2%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
+++A + A+ ++R+ + E A R +AE+ Q++ R +Q E + +A+E
Sbjct: 224 QSEAEDPAAAEARRREADRREAADRLREAEEAAQDAA-RARQAEEAAREEAARARQAEEA 282
Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQP 308
R E + +AA AR +L P
Sbjct: 283 AREAEAAFRADEAAATSEAARDEAAGAQLAP 313
Score = 29.4 bits (65), Expect = 2.8
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 11/93 (11%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLE--IAAREKAEKKHQ------ESVERLKQLEVEMAKRD 268
+ +A A EE+ A ++RD E +A RE H+ E RL+ E + +R+
Sbjct: 530 EARAMALEER-ACAKERDLADREAAVAIREATLAAHEAACAEEEFALRLR--EDALTERE 586
Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
+ L EA+ ++L + L +AA+EE R E
Sbjct: 587 RALEEAEAAAQQLADSLFLREAAQEEQARRNLE 619
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 32.5 bits (74), Expect = 0.35
Identities = 23/101 (22%), Positives = 43/101 (42%), Gaps = 13/101 (12%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEK-----KHQESVERLKQLEVEMAKRD 268
D+ +M +A + K + R++ + A+ E K+ E++K ++ + +
Sbjct: 516 DIDRMVNEAEKYKAEDEANRER----VEAKNGLENYCYSMKNTLQDEKVKG-KLSDSDKA 570
Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
E + LE K A KEE E +Q E++ PI
Sbjct: 571 TIEKAIDEALEWLE---KNQLAEKEEFEHKQKEVESVCNPI 608
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 32.1 bits (73), Expect = 0.35
Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 10/109 (9%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
+R + + A+ E+ +Q++ L +I +K K + E L+Q
Sbjct: 209 LREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQH---- 264
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
L A++ R+L+ +L++LQ E E Q EL+ +RN+
Sbjct: 265 ------LAAAKDAQRQLQAELQELQDKYAECMEMLHEAQEELKNLRNKG 307
Score = 29.0 bits (65), Expect = 3.5
Identities = 33/125 (26%), Positives = 57/125 (45%), Gaps = 23/125 (18%)
Query: 211 DTIDVQQMKAQAREEKNAK-QQQRDKLQLEIAAREKAE--------KKHQESVERLKQLE 261
++ Q K + EE+N + + L+ E E+ E K+ +E+ +++ L
Sbjct: 161 FQLEALQEKLKLLEEENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLS 220
Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQ-------AAKEELEA-------RQTELQLELQ 307
E+AK+ +DL QE I L Q+ LQ EEL+ Q +LQ ELQ
Sbjct: 221 EELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQ 280
Query: 308 PIRNQ 312
++++
Sbjct: 281 ELQDK 285
>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 220 and
257 amino acids in length.
Length = 221
Score = 31.8 bits (73), Expect = 0.36
Identities = 13/50 (26%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQL--QAAKEELEARQTELQLELQPIRNQ 312
+ ++ +E E I LE+++ +L + KE+ R+ EL LE++ ++ +
Sbjct: 171 KEDLKERVERLEQIEELEKEIAKLKKKLKKEKQFNRKVELNLEIKKLKKE 220
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 32.5 bits (74), Expect = 0.36
Identities = 18/115 (15%), Positives = 44/115 (38%), Gaps = 9/115 (7%)
Query: 198 MGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL 257
+ + + ++++ + + K +Q+ ++L+ R K + +E+ E L
Sbjct: 517 IEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKE--RERNKKLELEKEAQEAL 574
Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
K L+ E+ ++L E + + + E+L + Q Q N
Sbjct: 575 KALKKEVESIIRELKEK-------KIHKAKEIKSIEDLVKLKETKQKIPQKPTNF 622
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 32.2 bits (74), Expect = 0.37
Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 7/66 (10%)
Query: 238 LEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
L EK + + E +K +E E L + ++ I+ LEE++ +L+ +ELE
Sbjct: 69 LNPLREEKKKVSVKSLEELIKDVEEE-------LEKIEKEIKELEEEISELENEIKELEQ 121
Query: 298 RQTELQ 303
L+
Sbjct: 122 EIERLE 127
Score = 28.4 bits (64), Expect = 5.4
Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 2/54 (3%)
Query: 246 AEKKHQESVER-LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
K+ + VE LK+L E ++ E+IR ++E+L++++ +E L
Sbjct: 185 VLKELSDEVEEELKKLGFE-RLELEEEGTPSELIREIKEELEEIEKERESLLEE 237
>gnl|CDD|226203 COG3678, CpxP, P pilus assembly/Cpx signaling pathway, periplasmic
inhibitor/zinc-resistance associated protein
[Intracellular trafficking and secretion / Cell motility
and secretio / Signal transduction mechanisms /
Inorganic ion transport and metabolism].
Length = 160
Score = 31.2 bits (71), Expect = 0.37
Identities = 14/84 (16%), Positives = 36/84 (42%), Gaps = 3/84 (3%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
Q +AQ + ++ ++ + + + KA + ++ + + L K D + +
Sbjct: 72 AQRRAQREQLRSKRRALHELIAADQFDEAKARAQAEKMENQRQALRELRVKSDNQMYQV- 130
Query: 276 EMIRRLEEQLKQLQA-AKEELEAR 298
+ +L++L A +EE + R
Sbjct: 131 -LTPEQRAKLQELLAQRREERQQR 153
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 32.4 bits (74), Expect = 0.40
Identities = 16/72 (22%), Positives = 35/72 (48%), Gaps = 3/72 (4%)
Query: 249 KHQESVERLKQ---LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
+ V K+ LE E+ + ++ E +E + LE+++ +L+ L+ ++ L
Sbjct: 58 GGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDEN 117
Query: 306 LQPIRNQSNLNK 317
L+ + SNL+
Sbjct: 118 LEELSELSNLDI 129
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 31.8 bits (73), Expect = 0.42
Identities = 24/109 (22%), Positives = 43/109 (39%), Gaps = 16/109 (14%)
Query: 199 GNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
GN+ Y+ ++ + Q +Q++ K Q I + A K +++ R+K
Sbjct: 233 GNYSSYLEQK-------AERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIK 285
Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+LE +EA+ R E+ K L R +L LE +
Sbjct: 286 RLEK---------LEARLAEERPVEEGKPLAFRFPPPGKRLGKLVLEFE 325
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 32.4 bits (73), Expect = 0.42
Identities = 21/106 (19%), Positives = 50/106 (47%), Gaps = 15/106 (14%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQES-----------VE 255
++K + + + + + + + AK+ + DK + E A + + ++K E
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
LK+ E E K+ ++L +A+E + E+ K+ + E + ++ E
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKK----EAEEDKKKAE 1747
Score = 31.3 bits (70), Expect = 0.85
Identities = 21/91 (23%), Positives = 50/91 (54%), Gaps = 1/91 (1%)
Query: 216 QQMKA-QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
++MKA +A++ + AK + + + E ++ + K +E+ E+ K E++ A+ + + A
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
+E + E++ K +A K E + ++ L+
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695
Score = 30.9 bits (69), Expect = 1.3
Identities = 26/122 (21%), Positives = 58/122 (47%), Gaps = 10/122 (8%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQES--VERLKQLEVE 263
+ + + I + +A E+K ++ + + E A E +K+ +E+ E LK+ E E
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713
Query: 264 MAKRDQDL--------MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
K+ ++L ++A+E + EE K+ + AK++ E ++ L+ + + +
Sbjct: 1714 EKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773
Query: 316 NK 317
K
Sbjct: 1774 RK 1775
Score = 30.5 bits (68), Expect = 1.4
Identities = 24/113 (21%), Positives = 56/113 (49%), Gaps = 3/113 (2%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQR-DKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
D + + KA+A E+K + +++ D + + ++KA++ +++ E K+ + E+
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKAD-ELK 1411
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT-ELQLELQPIRNQSNLNK 317
K +A E ++ EE+ K +A K+ EA++ E + + + + K
Sbjct: 1412 KAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKK 1464
Score = 30.1 bits (67), Expect = 1.8
Identities = 19/97 (19%), Positives = 44/97 (45%), Gaps = 1/97 (1%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ-ESVERLKQLEVEM 264
+ + ++ K +A K ++++ + + A E +K + + K+ E
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
K+ ++ +A E ++ EE K +A K+ EA++ E
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460
Score = 30.1 bits (67), Expect = 1.8
Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 3/95 (3%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
++K D + + +A E+K A++ ++ + +A R+ E K E + EV
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK---AEEARIEEVMKLY 1601
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
++ M+A+E + E ++K + K E E ++ E
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVE 1636
Score = 29.7 bits (66), Expect = 2.3
Identities = 20/86 (23%), Positives = 42/86 (48%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+ K +A E K A + ++ + + A K + + + + K+ E K D+ +A+
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493
Query: 276 EMIRRLEEQLKQLQAAKEELEARQTE 301
E ++ +E K +A K+ EA++ E
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAE 1519
Score = 29.7 bits (66), Expect = 2.7
Identities = 21/97 (21%), Positives = 48/97 (49%), Gaps = 2/97 (2%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEV--EM 264
++K D + + + +E K ++ K E+ A+KK E+ ++ ++ + E
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
K+ ++ +A E ++ EE K +A K+ EA++ +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
Score = 29.3 bits (65), Expect = 3.2
Identities = 17/104 (16%), Positives = 53/104 (50%), Gaps = 1/104 (0%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
+++ + +++K + EEK ++ + + E+A+K+ +E ++ ++ + +
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAA-KEELEARQTELQLELQ 307
++ + +E ++ EE K+ +A +EEL+ + ++E+
Sbjct: 1754 EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
Score = 29.3 bits (65), Expect = 3.3
Identities = 26/120 (21%), Positives = 50/120 (41%), Gaps = 4/120 (3%)
Query: 184 APRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR 243
A R N+ I + RR+ + + + ++ + K+ K E
Sbjct: 1245 AEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKA 1304
Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
++A+KK +E+ K+ + K ++ +A ++ EE K +AAK E EA E +
Sbjct: 1305 DEAKKKAEEA----KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360
Score = 29.0 bits (64), Expect = 4.0
Identities = 24/98 (24%), Positives = 49/98 (50%), Gaps = 5/98 (5%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
++ + I +++K +A EEK +Q + K E E+ +K +E+ + +
Sbjct: 1612 AKKAEEAKIKAEELK-KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA--- 1667
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQA-AKEELEARQTE 301
K ++D +A+E + E++ K +A KE EA++ E
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
Score = 28.6 bits (63), Expect = 5.3
Identities = 23/81 (28%), Positives = 46/81 (56%), Gaps = 2/81 (2%)
Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
+AR+ + AK++ D + E A + + +K +E+ + VE+A++ +D +A+E R
Sbjct: 1113 EARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEE--AR 1170
Query: 281 LEEQLKQLQAAKEELEARQTE 301
E K+ +AA++ E R+ E
Sbjct: 1171 KAEDAKKAEAARKAEEVRKAE 1191
Score = 28.6 bits (63), Expect = 6.0
Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 1/86 (1%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+ K +A E K A++ ++ + + A K + + + + K+ E K+ + +A
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAA 1506
Query: 276 EMIRRLEEQLKQLQAAKEELEARQTE 301
E ++ +E K+ + AK+ EA++ E
Sbjct: 1507 EAKKKADE-AKKAEEAKKADEAKKAE 1531
Score = 28.6 bits (63), Expect = 6.1
Identities = 19/83 (22%), Positives = 47/83 (56%), Gaps = 2/83 (2%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
KA ++ A +++ D+ + + ++KA++ +++ E K E + K+ ++ +A+E
Sbjct: 1406 KADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAK--KKAEEAKKAEEAK 1463
Query: 279 RRLEEQLKQLQAAKEELEARQTE 301
++ EE K +A K+ EA++ +
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKAD 1486
Score = 28.6 bits (63), Expect = 6.3
Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 2/93 (2%)
Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
++K D + +A+ E + + + E AA EKAE ++ E K+ + K
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE-AAEEKAEAAEKKKEEAKKKAD-AAKK 1385
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
+ ++ +A E ++ EE K+ K+ A++
Sbjct: 1386 KAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418
>gnl|CDD|203244 pfam05400, FliT, Flagellar protein FliT. This family contains
several bacterial flagellar FliT proteins. The flagellar
proteins FlgN and FliT have been proposed to act as
substrate specific export chaperones, facilitating
incorporation of the enterobacterial hook-associated
axial proteins (HAPs) FlgK/FlgL and FliD into the
growing flagellum. In Salmonella typhimurium flgN and
fliT mutants, the export of target HAPs is reduced,
concomitant with loss of unincorporated flagellin into
the surrounding medium.
Length = 82
Score = 29.6 bits (67), Expect = 0.42
Identities = 16/71 (22%), Positives = 33/71 (46%), Gaps = 7/71 (9%)
Query: 247 EKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
E + ++ +ERL++ + + + E +E++RR+ ++ + L+ R EL L
Sbjct: 17 EDEREQLLERLREAPLSPPESE----EKRELLRRILANDAEI---RALLQPRLDELSQLL 69
Query: 307 QPIRNQSNLNK 317
R Q N
Sbjct: 70 GQARRQKKANN 80
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 31.8 bits (73), Expect = 0.45
Identities = 22/98 (22%), Positives = 40/98 (40%), Gaps = 17/98 (17%)
Query: 202 ELYMRRRKPDTID--VQQMKAQAREEKNAKQQQRDKLQ--LEIAAREKAEKKHQESVERL 257
E + + ID ++ KA+ + +++ K+ LE A E E K + +E L
Sbjct: 412 EKVLEEKANSEIDIDFEEAKAELEAQFKRLREEAAKIDPSLEGAL-EANEAKILKQLEFL 470
Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
++ + AQ R+ EE L+Q + L
Sbjct: 471 EK----RLLK------AQR--RKHEELLRQFDRLQTAL 496
Score = 31.8 bits (73), Expect = 0.56
Identities = 21/94 (22%), Positives = 41/94 (43%), Gaps = 7/94 (7%)
Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE---- 276
K +++ +++I E A+ + + +RL++ ++ + +EA E
Sbjct: 406 GLELLKEKVLEEKANSEIDIDFEE-AKAELEAQFKRLREEAAKIDPSLEGALEANEAKIL 464
Query: 277 -MIRRLEEQLKQLQAAKEELEARQ-TELQLELQP 308
+ LE++L + Q K E RQ LQ L P
Sbjct: 465 KQLEFLEKRLLKAQRRKHEELLRQFDRLQTALFP 498
>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
outer membrane].
Length = 372
Score = 31.6 bits (71), Expect = 0.47
Identities = 14/82 (17%), Positives = 27/82 (32%), Gaps = 2/82 (2%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREK--AEKKHQESVERLKQLEVEMAKRD 268
+ +Q QA + Q +L RE A +K S L + + +
Sbjct: 99 PSAVLQAALDQAEAQLARAQALLAPAELGDLQREAKLAAEKAAVSQAELDAAQALLRAAE 158
Query: 269 QDLMEAQEMIRRLEEQLKQLQA 290
+ AQ + + L+ +
Sbjct: 159 ALVEAAQAALASAKLNLEYTRI 180
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 32.0 bits (73), Expect = 0.48
Identities = 22/89 (24%), Positives = 36/89 (40%), Gaps = 1/89 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
D++Q A + +++ + L E A E E+ LKQ +
Sbjct: 754 DLRQRIAAMERDLARFEEEVEAL-AEAVAPEMLGTPADETARALKQRLKRARDTAAAAEK 812
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTEL 302
E I E+++ + AA +E EAR T L
Sbjct: 813 LAEEIEEAEKEVSEAAAALDEAEARLTAL 841
Score = 30.5 bits (69), Expect = 1.2
Identities = 15/88 (17%), Positives = 30/88 (34%), Gaps = 16/88 (18%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEK---------KHQESVERLKQLEVEMAKRDQD 270
+AR E A ++ + L + A + ++E L Q E
Sbjct: 245 EEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRN---- 300
Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEAR 298
A++ + E ++ + A L A+
Sbjct: 301 ---AEQDLPDREGEIANAREAAAALLAQ 325
Score = 30.5 bits (69), Expect = 1.3
Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 6/96 (6%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQE---SVERLKQLEVEMAK---R 267
++ Q A + RD+L+ E K + E +++ LK+LE E+ + +
Sbjct: 113 ELGQYLFSAAAGVGSLGSVRDELEKEADELWKPRGRKPEINVALKELKELEAEIREVQLK 172
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ + + + EE+L L+ +LE + L+
Sbjct: 173 TRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLE 208
Score = 29.3 bits (66), Expect = 3.4
Identities = 23/115 (20%), Positives = 47/115 (40%), Gaps = 14/115 (12%)
Query: 202 ELYMRRRKPDTIDVQQ---MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
L + + D I + + +A A E + ++ + + ++ RE +E+ L
Sbjct: 264 RLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALL 323
Query: 259 Q-----LEVEMAKRDQDLMEA-----QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ E A A E+ +R E + L++A++ LE R+ EL+
Sbjct: 324 AQIGPDADEE-AVESLRPSLAAKETVTELEKRKEALDQALKSARDALEERERELK 377
Score = 29.0 bits (65), Expect = 4.5
Identities = 24/109 (22%), Positives = 41/109 (37%), Gaps = 9/109 (8%)
Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAE---------KKHQESVERLKQLEVE 263
D +A ARE+ A + R +L + R + E L +L
Sbjct: 608 EDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALKAVAIVLPGEELAELLEL 667
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+ ++ + L+E+L+ + A EE E R E Q L+ + Q
Sbjct: 668 ARQLLEEAEKQAARKASLDERLRDAERALEEAEERHDEAQSALEAWQEQ 716
Score = 28.6 bits (64), Expect = 5.2
Identities = 16/80 (20%), Positives = 31/80 (38%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
+ + K Q +L L E+ S E+++ E + Q + ++ +
Sbjct: 416 EQEVAVAKRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRL 475
Query: 279 RRLEEQLKQLQAAKEELEAR 298
LEE L+ L+ L+A
Sbjct: 476 LELEEDLETLELQLRHLDAA 495
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 31.8 bits (73), Expect = 0.48
Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 6/81 (7%)
Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
+ QR L E A++ +QE+ + KQ E+ + E + L+ QL++L
Sbjct: 138 PELQRQLLDAFAGLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEEL 197
Query: 289 QAAK------EELEARQTELQ 303
+ EELE + L
Sbjct: 198 EELNLQPGEDEELEEERKRLS 218
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 31.9 bits (73), Expect = 0.49
Identities = 25/96 (26%), Positives = 50/96 (52%), Gaps = 15/96 (15%)
Query: 214 DVQQMKAQAREEK-NAKQQQRDKLQLEIAARE------KAE--------KKHQESVERLK 258
D ++ + R+E+ +++ +R + QLE +E +AE K Q+ R++
Sbjct: 694 DAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAEAAALKGGKKMIQKLEARVR 753
Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEE 294
+LE E+ + E Q+ +R++E ++K+LQ EE
Sbjct: 754 ELEAELDGEQRRHAETQKNLRKMERRVKELQFQVEE 789
Score = 31.2 bits (71), Expect = 0.89
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 10/88 (11%)
Query: 232 QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAA 291
+ ++L+ ++ A + K Q+ + L E + + E ++ RRLE + +LQAA
Sbjct: 420 ELEELKDQVEALRRENKNLQDEIHDLTDQLGEGGRN---VHELEKARRRLEAEKDELQAA 476
Query: 292 KEELEA--RQTE-----LQLELQPIRNQ 312
EE EA E Q+EL IR++
Sbjct: 477 LEEAEAALELEESKVLRAQVELSQIRSE 504
Score = 31.2 bits (71), Expect = 0.94
Identities = 36/134 (26%), Positives = 60/134 (44%), Gaps = 36/134 (26%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-------------- 260
V Q++ + +E + ++ ++L+ E AAR KAEK + L++L
Sbjct: 34 VAQLQKKIKELEARIRELEEELEAERAARAKAEKARADLSRELEELSERLEEAGGATAAQ 93
Query: 261 -------EVEMAKRDQDLMEA---------------QEMIRRLEEQLKQLQAAKEELEAR 298
E E+AK +DL EA Q+ I L EQ++QLQ K + E
Sbjct: 94 IELNKKREAELAKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKE 153
Query: 299 QTELQLELQPIRNQ 312
+++LQ E+ + Q
Sbjct: 154 KSQLQAEVDDLLAQ 167
Score = 30.0 bits (68), Expect = 2.0
Identities = 27/112 (24%), Positives = 57/112 (50%), Gaps = 19/112 (16%)
Query: 213 IDVQQMKA------QAREE-----KNAKQQQRDKLQ--LEIAAREKAE----KKHQESVE 255
+++ Q+++ +EE + Q+ + LQ LE A+ KAE KK E
Sbjct: 496 VELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEG-- 553
Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+ +LE+ + ++ EAQ+ +++ ++Q+K+LQ EE + + + + +L
Sbjct: 554 DINELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLA 605
Score = 28.9 bits (65), Expect = 4.3
Identities = 26/92 (28%), Positives = 45/92 (48%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
V++++ Q EE+ A++ R++L + E + +E L+Q E + + +L EA
Sbjct: 583 VKELQTQVEEEQRAREDAREQLAVAERRATALEAELEELRSALEQAERARKQAETELAEA 642
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
E + L Q L A K +LE LQ +L
Sbjct: 643 SERVNELTAQNSSLIAQKRKLEGELAALQSDL 674
Score = 28.1 bits (63), Expect = 8.9
Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 10/109 (9%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
+ I+ Q K +A+ EK Q Q + L +A ++ K + ++ KQLE +
Sbjct: 138 EQIE-QLQKQKAKAEKEKSQLQAEVDDL-LAQLDQIAKAKLNAEKKAKQLESQ------- 188
Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ-SNLNKV 318
L E Q + L+ QL L + K L++ ++L +L+ Q SNL+K+
Sbjct: 189 LSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKL 237
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 31.5 bits (72), Expect = 0.51
Identities = 22/115 (19%), Positives = 50/115 (43%), Gaps = 12/115 (10%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKL--------QLEIAAREKAEKKHQESVERLKQLEVE 263
D + + + + K + RD+ QL+ E + E L + + +
Sbjct: 157 KEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTE----LDRAKEK 212
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
+ K Q++M + + LEE+L++L++ E+L +++EL E+ + +
Sbjct: 213 LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRG 267
>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
Length = 359
Score = 31.6 bits (73), Expect = 0.53
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 21/100 (21%)
Query: 210 PDTI-DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRD 268
P+ I D ++ + ++E +L+ I + K+ QE +E K EM + +
Sbjct: 25 PEVISDQKRFRKLSKEYA--------ELE-PIVEAYREYKQAQEDLEEAK----EMLEEE 71
Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
D EM +E+LK+L+ EELE EL++ L P
Sbjct: 72 SD----PEMREMAKEELKELEERLEELEE---ELKILLLP 104
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 31.6 bits (72), Expect = 0.55
Identities = 22/96 (22%), Positives = 56/96 (58%), Gaps = 8/96 (8%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+V++M +A EKNA + + + ++++ + +AE ++ ++LK+L+ ++++ ++ +E
Sbjct: 506 EVERMVKEA--EKNAAEDKEKREKIDL--KNQAESLCYQAEKQLKELKDKISEEKKEKIE 561
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
+I++L + L+ E +++ ELQ L I
Sbjct: 562 --NLIKKLRQALQNDNY--ESIKSLLEELQKALMEI 593
>gnl|CDD|233897 TIGR02499, HrpE_YscL_not, type III secretion apparatus protein,
HrpE/YscL family. This model is related to pfam06188,
but is broader. pfam06188 describes HrpE-like proteins,
components of bacterial type III secretion systems
primarily in bacteria that infect plants. This model
includes also the homologous proteins of animal
pathogens, such as YscL of Yersinia pestis. This model
excludes the related protein FliH of the bacterial
flagellar apparatus (see pfam02108) [Protein fate,
Protein and peptide secretion and trafficking, Cellular
processes, Pathogenesis].
Length = 166
Score = 30.7 bits (70), Expect = 0.57
Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVER-LKQLEVEMAKRDQD-LMEAQEM 277
AQA+ A +Q+ + + + A E+AE Q E+ L+Q E A + + EA+++
Sbjct: 13 AQAQAILAAARQRAEAILAD--AEEEAEASRQLGYEQGLEQFWQEAAAQLAEWQQEAEQL 70
Query: 278 IRRLEEQLKQL 288
LEE+L +L
Sbjct: 71 EASLEERLAEL 81
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have
LPXTG C-terminal anchoring motifs and a substantial
number have the KxYKxGKxW putative sorting signal at the
N-terminus. The tetracycline resistance plasmid pCF10 in
Enterococcus faecalis promotes conjugal plasmid transfer
in response to sex pheromones, but PgrA/Sec10 encoded by
that plasmid, a member of this family, specifically
inhibits the ability of cells to receive homologous
plasmids. The phenomenon is called surface exclusion.
Length = 356
Score = 31.2 bits (71), Expect = 0.58
Identities = 21/94 (22%), Positives = 35/94 (37%), Gaps = 2/94 (2%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
Q A A+ N Q Q AA + A Q+ + + ++ A+ +L
Sbjct: 265 TAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQTAQ--NNLAT 322
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
AQ + E +L + + A L A + Q L
Sbjct: 323 AQAALANAEARLAKAKEALANLNADLAKKQAALD 356
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 31.6 bits (73), Expect = 0.60
Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 11/77 (14%)
Query: 239 EIAAREKAEKKHQESVERLKQL------EVEMAKRDQDLMEAQEMI---RRLEEQLKQLQ 289
E+ ++ ++ Q +E L+ E+ AKR + +A+ +I + L+E++K L+
Sbjct: 29 ELLELDEERRELQTELEELQAERNALSKEIGQAKRKGE--DAEALIAEVKELKEEIKALE 86
Query: 290 AAKEELEARQTELQLEL 306
A +ELEA EL L +
Sbjct: 87 AELDELEAELEELLLRI 103
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 31.5 bits (71), Expect = 0.62
Identities = 24/101 (23%), Positives = 45/101 (44%), Gaps = 6/101 (5%)
Query: 209 KPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL--KQLEVEMAK 266
K D+ Q +AQ+R + A ++ ++++ A KA Q+ + L + + K
Sbjct: 460 KAYQEDILQREAQSRGKTAAAERSQEQM----TAALKALLAFQQQIADLSGAKEKASDQK 515
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
E Q + + E + +QLQ K LE ++ L+ Q
Sbjct: 516 SLLWKAEEQYALLKEEAKQRQLQEQKALLEHKKETLEYTSQ 556
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 31.4 bits (72), Expect = 0.62
Identities = 18/70 (25%), Positives = 38/70 (54%), Gaps = 4/70 (5%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
KL+ RE+ + + + + + QL E+A+R++ QE +L+ ++K+ + +
Sbjct: 92 AKLE---EMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKEREETQI 148
Query: 294 ELEARQTELQ 303
ELE +Q+ L
Sbjct: 149 ELE-QQSSLL 157
>gnl|CDD|220388 pfam09766, FimP, Fms-interacting protein. This entry carries part
of the crucial 144 N-terminal residues of the FmiP
protein, which is essential for the binding of the
protein to the cytoplasmic domain of activated
Fms-molecules in M-CSF induced haematopoietic
differentiation of macrophages. The C-terminus contains
a putative nuclear localisation sequence and a leucine
zipper which suggest further, as yet unknown, nuclear
functions. The level of FMIP expression might form a
threshold that determines whether cells differentiate
into macrophages or into granulocytes.
Length = 352
Score = 31.2 bits (71), Expect = 0.68
Identities = 28/131 (21%), Positives = 51/131 (38%), Gaps = 24/131 (18%)
Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQ---LEIAAREKAEKKHQE---S 253
N ++R K + + EEK+ R +LQ E + EK + +
Sbjct: 7 NRLAHIRIEK--------GRDKTHEEKDKVDSLRLQLQNLLYEKSHLEKEINRCLDFKSK 58
Query: 254 VERLKQLEVEM----AKRDQDLMEA------QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
E ++ + +E A D E M+ RLE +L+Q + E+L+ ++E +
Sbjct: 59 DEEIELVPLEEFYAQAPEDISRPEKTPLDEHNLMLARLEWELEQRKELAEKLKELESEKE 118
Query: 304 LELQPIRNQSN 314
Q I +
Sbjct: 119 KLQQDIEKKKE 129
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 31.2 bits (71), Expect = 0.69
Identities = 16/70 (22%), Positives = 35/70 (50%), Gaps = 5/70 (7%)
Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME-----AQEMIRRLEEQL 285
+ +K++ E+ + +++ + E + +LE E+ K L A+ + +R+E++L
Sbjct: 329 EYAEKIKEELDQLDDSDESLEALEEEVDKLEEELDKAAVALSLIRRKAAERLAKRVEQEL 388
Query: 286 KQLQAAKEEL 295
K L K E
Sbjct: 389 KALAMEKAEF 398
Score = 30.9 bits (70), Expect = 1.0
Identities = 21/72 (29%), Positives = 33/72 (45%), Gaps = 3/72 (4%)
Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE--ARQTELQLELQ 307
E V+ ++L K Q L + Q+ + L ++L LQ EELE Q L+
Sbjct: 156 ANEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRLDFLQFQLEELEEADLQPGEDEALE 215
Query: 308 PIRNQ-SNLNKV 318
+ + SNL K+
Sbjct: 216 AEQQRLSNLEKL 227
Score = 27.8 bits (62), Expect = 9.2
Identities = 15/86 (17%), Positives = 35/86 (40%), Gaps = 9/86 (10%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+ + E + Q +L+ + A E+ + ++ E L QL+ D+ L +
Sbjct: 297 EFDPERLNEIEERLAQIKRLKRKYGASVEEVLEYAEKIKEELDQLDD----SDESL---E 349
Query: 276 EMIRRLEEQLKQLQAAKEELEA-RQT 300
+ +++ ++L A L R+
Sbjct: 350 ALEEEVDKLEEELDKAAVALSLIRRK 375
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 31.2 bits (71), Expect = 0.71
Identities = 22/113 (19%), Positives = 43/113 (38%), Gaps = 11/113 (9%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR------EKAEKKHQESVERLKQLEVEMA 265
+QQ Q+RE+ + +L E E + + + + LKQL +
Sbjct: 79 KARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNLKQLL-KPL 137
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
+ + Q + +R+ E ++ EE++ E+Q Q + NL
Sbjct: 138 REVLEKFREQ-LEQRIHESAEERSTLLEEIDRLLGEIQ---QLAQEAGNLTAA 186
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 30.0 bits (68), Expect = 0.77
Identities = 16/59 (27%), Positives = 35/59 (59%), Gaps = 1/59 (1%)
Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
R+KA+++ +E ER +++E R++ E ++ ++ +E LK L+ ++ E +TE
Sbjct: 35 RKKAQEEAKEK-EREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETE 92
>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
This model represents the MFP (membrane fusion protein)
component of the RND family of transporters. RND refers
to Resistance, Nodulation, and cell Division. It is, in
part, a subfamily of pfam00529 (Pfam release 7.5) but
hits substantial numbers of proteins missed by that
model. The related HlyD secretion protein, for which
pfam00529 is named, is outside the scope of this model.
Attributed functions imply outward transport. These
functions include nodulation, acriflavin resistance,
heavy metal efflux, and multidrug resistance proteins.
Most members of this family are found in Gram-negative
bacteria. The proposed function of MFP proteins is to
bring the inner and outer membranes together and enable
transport to the outside of the outer membrane. Note,
however, that a few members of this family are found in
Gram-positive bacteria, where there is no outer membrane
[Transport and binding proteins, Unknown substrate].
Length = 322
Score = 31.1 bits (71), Expect = 0.78
Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 7/85 (8%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
AR + + Q ++AA E + Q S ER ++L KR+ +Q +
Sbjct: 55 ARLDDDDYQLALQAALAQLAAAEAQLELAQRSFERAERL----VKRN---AVSQADLDDA 107
Query: 282 EEQLKQLQAAKEELEARQTELQLEL 306
+ ++ QA E +A QL L
Sbjct: 108 KAAVEAAQADLEAAKASLASAQLNL 132
>gnl|CDD|220278 pfam09523, DUF2390, Protein of unknown function (DUF2390). Members
of this family are bacterial hypothetical proteins,
about 160 amino acids in length, found in various
proteobacteria, including members of the genera
Pseudomonas and Vibrio. The C-terminal region is poorly
conserved and is not included in the model.
Length = 109
Score = 29.5 bits (67), Expect = 0.78
Identities = 15/55 (27%), Positives = 25/55 (45%), Gaps = 3/55 (5%)
Query: 253 SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
S E L+ L+ A +++++ +R L QLKQ EL + EL+
Sbjct: 48 SAEDLQALQAAAAPWREEVVQP---LRALRRQLKQAADETPELALYEQLKAAELE 99
>gnl|CDD|216174 pfam00887, ACBP, Acyl CoA binding protein.
Length = 87
Score = 29.1 bits (66), Expect = 0.79
Identities = 10/23 (43%), Positives = 19/23 (82%)
Query: 27 WKEHKGMLREDAMMEYLKIAQDL 49
WK+ KGM +E+AM +Y+++ ++L
Sbjct: 60 WKKLKGMSKEEAMKKYVELVEEL 82
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 31.2 bits (72), Expect = 0.87
Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 14/82 (17%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKK-HQES-VERLKQLEVEMAKRD 268
IDV+ A R EK + KL+ EI E+ EKK E V + + VE +
Sbjct: 805 GLIDVEAELA--RLEK-----ELAKLEKEI---ERVEKKLSNEGFVAKAPEEVVEKEREK 854
Query: 269 QDLMEAQEMIRRLEEQLKQLQA 290
E +E + +L+E+L +L+A
Sbjct: 855 LA--EYEEKLAKLKERLARLKA 874
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 29.9 bits (68), Expect = 0.88
Identities = 23/97 (23%), Positives = 56/97 (57%), Gaps = 1/97 (1%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
+DVQ++ +++ K A++Q + + A +K EK+ Q+ ++L++ +++ +
Sbjct: 20 VVDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSE-EARK 78
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
+ QE+ ++ +E ++ QAA++EL+ +Q EL +
Sbjct: 79 AKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYD 115
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 31.0 bits (70), Expect = 0.89
Identities = 19/71 (26%), Positives = 39/71 (54%), Gaps = 5/71 (7%)
Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
M + E+ ++++R +LE A+E+AE++ Q +R + E E A + D +A+
Sbjct: 241 MDEHSFEDFLLEEKRR---ELEKLAKEEAERERQAEEQR--RREEEKAAMEADRAQAKAE 295
Query: 278 IRRLEEQLKQL 288
+ + E+L+ L
Sbjct: 296 VEKRREKLQNL 306
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 30.8 bits (70), Expect = 0.90
Identities = 25/99 (25%), Positives = 49/99 (49%), Gaps = 12/99 (12%)
Query: 199 GNHELY---MRRRKPDTIDVQQMKA------QAREEKNAKQQQRDKLQLEIAAREKAEKK 249
G+ EL RRR V ++ A Q + E ++ +++ +KL E+A + A++
Sbjct: 13 GDPELIRESQRRRFASVELVDEVIALDKEWRQRQFELDSLRKEFNKLNKEVAKLKIAKED 72
Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
E + K+L+ E+ +++ E QE L+ +LK +
Sbjct: 73 ATELIAETKELKKEITEKEA---EVQEAKAALDAKLKTI 108
>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
DevB family. Members of this protein family are found
mostly in the Cyanobacteria, but also in the
Planctomycetes. DevB from Anabaena sp. strain PCC 7120
is partially characterized as a membrane fusion protein
of the DevBCA ABC exporter, probably a glycolipid
exporter, required for heterocyst formation. Most
Cyanobacteria have one member only, but Nostoc sp. PCC
7120 has seven members.
Length = 327
Score = 31.0 bits (70), Expect = 0.91
Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 6/75 (8%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
E + + Q D+ + +A KK + + +R + AK +D+ Q + RL
Sbjct: 53 RTAELDVARTQLDEAKARLAQVRAGAKKGEIAAQRAARAA---AKLFKDVAAQQATLNRL 109
Query: 282 EEQLKQLQAAKEELE 296
E +L+ A+ E++
Sbjct: 110 E---AELETAQREVD 121
>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
C4-dicarboxylate transport system [Signal transduction
mechanisms].
Length = 603
Score = 31.1 bits (71), Expect = 0.92
Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 9/55 (16%)
Query: 203 LYMRRRKPDTIDVQQMKAQAREEKNAKQQQR--------DKLQLEIAAREKAEKK 249
L++RRR+ +AR E + ++R +LQ EIA RE+AE
Sbjct: 314 LWLRRRR-RARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAA 367
>gnl|CDD|225507 COG2959, HemX, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 391
Score = 30.9 bits (70), Expect = 0.94
Identities = 23/76 (30%), Positives = 35/76 (46%), Gaps = 10/76 (13%)
Query: 230 QQQRDKLQLEIAARE---KAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLK 286
QQQ E+ A + KA + QE+ + L QLE + + Q + RLE QL+
Sbjct: 55 QQQNVLQTQELQALQQQLKALQLAQENQKLLAQLE-------SLIAQQQAELDRLERQLE 107
Query: 287 QLQAAKEELEARQTEL 302
LQ EL+ + +
Sbjct: 108 TLQKQLSELQKKVATI 123
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 31.0 bits (70), Expect = 0.94
Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 9/91 (9%)
Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
+RD++ + E+ + + + L+ + +R+ + +++ L + QL
Sbjct: 168 TFKRDRILQQQWREEQEKYDAADKARAIYALQTKADERN-----LETVLQSLSQADFQLA 222
Query: 290 AAKEE----LEARQTELQLELQPIRNQSNLN 316
E+ +EAR E + E++ + N+ NLN
Sbjct: 223 GVAEKELETVEARIKEARYEIEELENKLNLN 253
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 30.6 bits (69), Expect = 1.0
Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 9/86 (10%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQ-----LEIAAREKAEKKHQESVERLKQLEVEMAKR- 267
D +Q + +A +E +Q++ L+ A+E A+++ Q + ++ +V A+
Sbjct: 47 DAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEARE-DVATARDE 105
Query: 268 -DQDL-MEAQEMIRRLEEQLKQLQAA 291
+ L E QE + L++Q Q
Sbjct: 106 WLEQLEREKQEFFKALQQQTGQQMVK 131
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 29.5 bits (67), Expect = 1.0
Identities = 16/77 (20%), Positives = 40/77 (51%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
A ++E + +RD+ + EI + ++ +E + +++LE E+ +Q E++
Sbjct: 33 LASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQRYETTLELL 92
Query: 279 RRLEEQLKQLQAAKEEL 295
E++++L+A +L
Sbjct: 93 GEKSERVEELRADVVDL 109
Score = 26.8 bits (60), Expect = 9.3
Identities = 18/72 (25%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
++L + E ++ + RL+ E + L E E +R L++++++L+ E
Sbjct: 20 ERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELE 79
Query: 294 ELEAR-QTELQL 304
+LE R +T L+L
Sbjct: 80 DLEQRYETTLEL 91
>gnl|CDD|153332 cd07648, F-BAR_FCHO, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of FCH domain Only proteins.
F-BAR domains are dimerization modules that bind and
bend membranes and are found in proteins involved in
membrane dynamics and actin reorganization. Proteins in
this group have been named FCH domain Only (FCHO)
proteins. Vertebrates have two members, FCHO1 and FCHO2.
These proteins contain an F-BAR domain and a C-terminal
domain of unknown function named SAFF which is also
present in endophilin interacting protein 1. F-BAR
domains form banana-shaped dimers with a
positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 261
Score = 30.4 bits (69), Expect = 1.1
Identities = 14/51 (27%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 253 SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
S E+L +L +++ ++ Q+L +++ + EEQ K+ + KEE +Q
Sbjct: 68 STEKLSELHLQLVQKLQEL--IKDVQKYGEEQHKKHKKVKEEESGTAEAVQ 116
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 29.6 bits (67), Expect = 1.1
Identities = 24/89 (26%), Positives = 44/89 (49%), Gaps = 4/89 (4%)
Query: 228 AKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ--DLMEAQEMIRRLEEQL 285
K+Q +D+ LE K +K E +ER ++ +E ++ Q +L + IR EQ
Sbjct: 5 QKEQLKDRFFLERTQLLKRHEKELEQLERQQKRTIEKLEQRQTQELRRLPKRIRA--EQK 62
Query: 286 KQLQAAKEELEARQTELQLELQPIRNQSN 314
+L+ KE L+ + EL+ E++ +
Sbjct: 63 TRLKMFKESLKIEKKELKQEVEKLPRFQE 91
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 30.6 bits (70), Expect = 1.2
Identities = 25/94 (26%), Positives = 46/94 (48%), Gaps = 17/94 (18%)
Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA-------- 290
E+ + EK+ +E + QL +A++ E QE + +E+QL++++
Sbjct: 342 ELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIEKEQEELSES 401
Query: 291 ----AKEELEARQT--ELQLELQPIR---NQSNL 315
K+ELEAR+ E + +L I+ +SNL
Sbjct: 402 LQGLRKDELEAREKLQEYRQKLHEIKRYLEKSNL 435
Score = 28.7 bits (65), Expect = 4.3
Identities = 24/105 (22%), Positives = 51/105 (48%), Gaps = 15/105 (14%)
Query: 214 DVQQMKAQAREEKNAKQQQRDK---LQLEIAARE----KAEKKHQESVERLKQLEVEMAK 266
++ ++K + + ++ +DK L+ + A+ A + ++ +L +LE E +
Sbjct: 123 ELNELKESEEKNRKEVEELKDKYRELRKTLLAKSFSYGPALDELEK---QLDELEEEFEQ 179
Query: 267 RDQ-----DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+ D +EA+E++ +LEE+ L+ EE+ ELQ E
Sbjct: 180 FVELTESGDYLEAREVLLKLEEETDALEQKMEEIPPLLKELQNEF 224
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 30.4 bits (69), Expect = 1.2
Identities = 22/101 (21%), Positives = 44/101 (43%), Gaps = 14/101 (13%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
R ID++++ A E K + ++LQ + E KQ+
Sbjct: 20 ARGLSVDIDLEKLIALDDERKKL-LSEIEELQ-------------AKRNELSKQIGKAKG 65
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
++ + E ++ ++ L+E+L +L AA + LEA + L +
Sbjct: 66 QKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSI 106
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 29.6 bits (67), Expect = 1.2
Identities = 16/70 (22%), Positives = 34/70 (48%), Gaps = 3/70 (4%)
Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
++ +E + LE + K + ++ Q I RL+ QL+ L+ L+A++ +L+ +L
Sbjct: 43 QQRDRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKL- 101
Query: 308 PIRNQSNLNK 317
+ K
Sbjct: 102 --KTLEQKLK 109
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 30.5 bits (69), Expect = 1.3
Identities = 15/77 (19%), Positives = 41/77 (53%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
QQ+ A+ R+ + + L+ + AA EK + + + +L +L+V++ + +++ A+
Sbjct: 327 QQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLPKLQVQLRELEREAEAAR 386
Query: 276 EMIRRLEEQLKQLQAAK 292
+ L ++ ++L +
Sbjct: 387 SLYETLLQRYQELSIQE 403
Score = 28.6 bits (64), Expect = 4.8
Identities = 19/99 (19%), Positives = 39/99 (39%), Gaps = 3/99 (3%)
Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
+ ++AQ E +QQ +L+ +A+ ++ L++ ++
Sbjct: 305 STELGAKHPQLVALEAQLAE---LRQQIAAELRQILASLPNELALLEQQEAALEKELAQL 361
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
R L + Q +R LE + + ++ E L R EL
Sbjct: 362 KGRLSKLPKLQVQLRELEREAEAARSLYETLLQRYQELS 400
Score = 28.6 bits (64), Expect = 5.2
Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 16/92 (17%)
Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
R++ +QQ L E+ A+ +L LE ++A+ Q + +R++
Sbjct: 291 RQQYAQVRQQIADLSTELGAKH----------PQLVALEAQLAELRQQI---AAELRQI- 336
Query: 283 EQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
L L LE ++ L+ EL ++ + +
Sbjct: 337 --LASLPNELALLEQQEAALEKELAQLKGRLS 366
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 30.3 bits (69), Expect = 1.3
Identities = 16/78 (20%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKR-----DQDLMEAQEMIRRLEEQLKQLQAAKE 293
++ ++ +K +E L+ E++K + +A+E+I ++E ++L+ +
Sbjct: 30 KLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEA 89
Query: 294 ELEARQTELQLELQPIRN 311
L+ + EL L I N
Sbjct: 90 ALDELEAELDTLLLTIPN 107
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 30.5 bits (69), Expect = 1.3
Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 1/70 (1%)
Query: 245 KAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQL 304
K ++ QE V+R + + + A+ +AQ + +L+ Q QL + ++ Q L+
Sbjct: 347 KVDRLDQELVQRQQANDAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQALEQ 406
Query: 305 ELQPI-RNQS 313
+ Q + RN+
Sbjct: 407 QYQDLSRNRD 416
>gnl|CDD|220710 pfam10351, Apt1, Golgi-body localisation protein domain. This is
the C-terminus of a family of proteins conserved from
plants to humans. The plant members are localised to the
Golgi proteins and appear to regulate membrane
trafficking, as they are required for rapid vesicle
accumulation at the tip of the pollen tube. The
C-terminus probably contains the Golgi localisation
signal and it is well-conserved.
Length = 451
Score = 30.3 bits (69), Expect = 1.4
Identities = 15/58 (25%), Positives = 27/58 (46%), Gaps = 5/58 (8%)
Query: 242 AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
E EK +E +E+L + DL + L+E++++L + ELE R+
Sbjct: 68 YVEPKEKSRKERLEKLL-----FSSDFSDLEGLDSRVVALQERIRELAEIRRELEFRE 120
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 30.0 bits (68), Expect = 1.4
Identities = 19/69 (27%), Positives = 34/69 (49%), Gaps = 7/69 (10%)
Query: 212 TIDVQQMKAQAREE-KNAKQQQRDKLQL------EIAAREKAEKKHQESVERLKQLEVEM 264
T++ ++ EE +QQQR Q E E+AE++ +E ER K+ + E
Sbjct: 140 TLEQALLEVLEEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKER 199
Query: 265 AKRDQDLME 273
+R+++ E
Sbjct: 200 KQREKETAE 208
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 29.1 bits (66), Expect = 1.5
Identities = 13/62 (20%), Positives = 30/62 (48%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+V + A+ + + KQ+++ ++A EK E + K+LE A+ + ++
Sbjct: 12 EVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELK 71
Query: 274 AQ 275
A+
Sbjct: 72 AE 73
>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
Length = 174
Score = 29.5 bits (67), Expect = 1.5
Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ-LQAAKEELEARQTE 301
RE + +E+ ERL+Q +A+ Q L +AQ+ R+ K +A + E+E + E
Sbjct: 52 REAILQALKEAEERLRQAAQALAEAQQKLAQAQQEAERIRADAKARAEAIRAEIEKQAIE 111
Score = 29.5 bits (67), Expect = 1.6
Identities = 25/113 (22%), Positives = 51/113 (45%), Gaps = 19/113 (16%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVER--------L 257
RR+ +++ + + R+ A + + KL A+++AE+ ++ R
Sbjct: 50 ERREAILQALKEAEERLRQAAQALAEAQQKLA---QAQQEAERIRADAKARAEAIRAEIE 106
Query: 258 KQLEVEMAK----RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
KQ +MA+ DL QE R+ QL++ +AA+ + +++L L
Sbjct: 107 KQAIEDMARLKQTAAADLSAEQE---RVIAQLRR-EAAELAIAKAESQLPGRL 155
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 29.2 bits (66), Expect = 1.5
Identities = 22/90 (24%), Positives = 47/90 (52%), Gaps = 4/90 (4%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
+ QA+ + + +R K + + A + E++ +E+ +E AK++ + A+E+
Sbjct: 37 ERQAKIADDLAEAERLKEEAQ-ALLAEYEQELEEA-REQASEIIEQAKKEAE-QIAEEIK 93
Query: 279 RRLEEQL-KQLQAAKEELEARQTELQLELQ 307
EE+L + +AA+ E+EA + EL+
Sbjct: 94 AEAEEELERIKEAAEAEIEAEKERALEELR 123
Score = 27.6 bits (62), Expect = 5.0
Identities = 19/91 (20%), Positives = 48/91 (52%), Gaps = 11/91 (12%)
Query: 227 NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLK 286
A +++ K+ ++A +AE+ +E+ L + E E+ + + +A E+I + +++ +
Sbjct: 33 KALDERQAKIADDLA---EAERLKEEAQALLAEYEQELEEARE---QASEIIEQAKKEAE 86
Query: 287 QL-----QAAKEELEARQTELQLELQPIRNQ 312
Q+ A+EELE + + E++ + +
Sbjct: 87 QIAEEIKAEAEEELERIKEAAEAEIEAEKER 117
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in length.
The function of this family is unknown.
Length = 156
Score = 29.2 bits (66), Expect = 1.6
Identities = 12/68 (17%), Positives = 35/68 (51%), Gaps = 4/68 (5%)
Query: 240 IAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
+ K +K ++E L+ ++ RD+ + + ++ ++ EE K+++ K++ + +
Sbjct: 16 QNPKSKLDKANEE----RDSLKKQLKNRDKQIEDLKKKVKDNEELQKKIEKLKQQNKTAK 71
Query: 300 TELQLELQ 307
E + +L
Sbjct: 72 EEYEAKLA 79
>gnl|CDD|133403 cd04776, HTH_GnyR, Helix-Turn-Helix DNA binding domain of the
regulatory protein GnyR. Putative helix-turn-helix
(HTH) regulatory protein, GnyR, and other related
proteins. GnyR belongs to the gnyRDBHAL cluster, which
is involved in acyclic isoprenoid degradation in
Pseudomonas aeruginosa. These proteins share the
N-terminal DNA binding domain with other transcription
regulators of the MerR superfamily that promote
transcription by reconfiguring the spacer between the
-35 and -10 promoter elements. A typical MerR regulator
is comprised of distinct domains that harbor the
regulatory (effector-binding) site and the active
(DNA-binding) site. Their conserved N-terminal domains
contain predicted winged HTH motifs that mediate DNA
binding, while the dissimilar C-terminal domains bind
specific coactivator molecules.
Length = 118
Score = 28.7 bits (65), Expect = 1.6
Identities = 6/28 (21%), Positives = 19/28 (67%)
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTEL 302
++M+ ++E++ +L+ + +++A EL
Sbjct: 79 EKMLEKIEKRRAELEQQRRDIDAALAEL 106
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 29.9 bits (68), Expect = 1.6
Identities = 18/94 (19%), Positives = 46/94 (48%), Gaps = 2/94 (2%)
Query: 216 QQMKAQAREEKNAKQQQ--RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+++KA+ +EE+ + ++L+ E+ K+ +E E L+ ++ +R++ E
Sbjct: 30 KRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQE 89
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
E + EQ+ ++ +E + + + + E Q
Sbjct: 90 EYEERLQEREQMDEIIERIQEEDEAEAQEKREKQ 123
Score = 29.5 bits (67), Expect = 2.6
Identities = 18/106 (16%), Positives = 49/106 (46%)
Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
+ M + + ++ + + R+E+ + + + Q+E + + E+ + ER + E
Sbjct: 44 DEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDE 103
Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+ +++D EAQE + ++ +++ EE R+ E + +
Sbjct: 104 IIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKERER 149
Score = 29.1 bits (66), Expect = 2.8
Identities = 19/97 (19%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+ ++ + + EE+ +++Q D++ I ++AE + + ++ + E++ ++ +
Sbjct: 82 EREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERK 141
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
+E R EE+LK L+ +E+ R+ E + E + +
Sbjct: 142 EEEKEREREEELKILEYQREK-AEREEEREAERRERK 177
Score = 28.3 bits (64), Expect = 5.0
Identities = 20/106 (18%), Positives = 48/106 (45%), Gaps = 8/106 (7%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAA--REKAEKKHQESVERLKQLEV------EMAK 266
+++++ + E K++++ KL+ EI E+ E+K +E ++ E A+
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAE 164
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
R+++ + + +E+ A++E + E EL+ Q
Sbjct: 165 REEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQ 210
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set
of eukaryotic tropomyosins. Within the yeast Tmp1 and
Tmp2, biochemical and sequence analyses indicate that
Tpm2p spans four actin monomers along a filament,
whereas Tpmlp spans five. Despite its shorter length,
Tpm2p can compete with Tpm1p for binding to F-actin.
Over-expression of Tpm2p in vivo alters the axial
budding of haploids to a bipolar pattern, and this can
be partially suppressed by co-over-expression of Tpm1p.
This suggests distinct functions for the two
tropomyosins, and indicates that the ratio between them
is important for correct morphogenesis. The family also
contains higher eukaryote Tmp3 members.
Length = 143
Score = 29.1 bits (66), Expect = 1.6
Identities = 21/65 (32%), Positives = 37/65 (56%), Gaps = 7/65 (10%)
Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLME-------AQEMIRRLEEQLKQLQAAKEELE 296
E A+++ +E E+LK+LE E +++Q++ +E + +LEEQLK+ + EE E
Sbjct: 10 ENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESE 69
Query: 297 ARQTE 301
T
Sbjct: 70 KLATN 74
>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein. This family
consists of several eukaryotic paraflagellar rod
component proteins. The eukaryotic flagellum represents
one of the most complex macromolecular structures found
in any organism and contains more than 250 proteins. In
addition to its locomotive role, the flagellum is
probably involved in nutrient uptake since receptors for
host low-density lipoproteins are localised on the
flagellar membrane as well as on the flagellar pocket
membrane.
Length = 289
Score = 30.0 bits (68), Expect = 1.6
Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 8/84 (9%)
Query: 226 KNAKQQQRDKLQLEI----AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE----M 277
AK +Q+ + ++ RE+ + + +R E K Q +E Q+
Sbjct: 8 IEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQQACWRA 67
Query: 278 IRRLEEQLKQLQAAKEELEARQTE 301
I+ LE +L+ L + E R+ E
Sbjct: 68 IQELERELRDLAEERREEVERRIE 91
Score = 28.1 bits (63), Expect = 5.9
Identities = 24/101 (23%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKA----EKKHQESVERLKQLE 261
R++ D+++ K + EE A +QR + + EK ++ Q +++LE
Sbjct: 13 RQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQQACWRAIQELE 72
Query: 262 VEMAKRDQDLMEAQ-EMIRRLEEQLKQLQAAKEELEARQTE 301
E+ +DL E + E + R +++A + E E R+T+
Sbjct: 73 REL----RDLAEERREEVER------RIEAVERE-EQRRTD 102
>gnl|CDD|193472 pfam12999, PRKCSH-like, Glucosidase II beta subunit-like. The
sequences found in this family are similar to a region
found in the beta-subunit of glucosidase II, which is
also known as protein kinase C substrate 80K-H (PRKCSH).
The enzyme catalyzes the sequential removal of two
alpha-1,3-linked glucose residues in the second step of
N-linked oligosaccharide processing. The beta subunit is
required for the solubility and stability of the
heterodimeric enzyme, and is involved in retaining the
enzyme within the endoplasmic reticulum.
Length = 176
Score = 29.4 bits (66), Expect = 1.6
Identities = 14/59 (23%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
E A + + E ++ + ++ L+ AQ+ L+++LK+L ++ L A+Q L
Sbjct: 121 EIARQFEEYLTEHNNSVKNGLKIKEGLLLAAQKKRDELKKRLKEL---EDSLPAKQQRL 176
>gnl|CDD|214744 smart00611, SEC63, Domain of unknown function in Sec63p, Brr2p and
other proteins.
Length = 312
Score = 29.9 bits (68), Expect = 1.6
Identities = 20/77 (25%), Positives = 36/77 (46%), Gaps = 11/77 (14%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
+++ QM QA ++ Q L EI ++ EKK S+E L +LE E
Sbjct: 135 ALNLSQMIIQALWPTDSPLLQLPHLPEEIL--KRLEKKKVLSLEDLLELEDE-------- 184
Query: 272 MEAQEMIRRLEEQLKQL 288
E E++ L+ + +++
Sbjct: 185 -ERGELLGLLDAEGERV 200
>gnl|CDD|201885 pfam01608, I_LWEQ, I/LWEQ domain. I/LWEQ domains bind to actin. It
has been shown that the I/LWEQ domains from mouse talin
and yeast Sla2p interact with F-actin. I/LWEQ domains
can be placed into four major groups based on sequence
similarity: (1) Metazoan talin; (2) Dictyostelium
TalA/TalB and SLA110; (3) metazoan Hip1p; and (4) yeast
Sla2p. The domain has four conserved blocks, the name of
the domain is derived from the initial conserved amino
acid of each of the four blocks.
Length = 194
Score = 29.5 bits (67), Expect = 1.7
Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 6/66 (9%)
Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL-EARQ 299
A ++ EK+ + L +L + K + ME Q I +LE +L+ A++ L E R+
Sbjct: 134 AVKQGLEKEEDQEDVDLSKLTLHEGKTQE--MEQQVEILKLE---NELEEARKRLGEIRK 188
Query: 300 TELQLE 305
+ +
Sbjct: 189 AHYRDD 194
>gnl|CDD|183211 PRK11578, PRK11578, macrolide transporter subunit MacA;
Provisional.
Length = 370
Score = 30.1 bits (68), Expect = 1.7
Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 11/71 (15%)
Query: 256 RLKQLEVEMA---KRDQ-----DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+LK L V + K+DQ D +A+ I+ +E L +L+A +++ EA EL+L
Sbjct: 71 QLKTLSVAIGDKVKKDQLLGVIDPEQAENQIKEVEATLMELRAQRQQAEA---ELKLARV 127
Query: 308 PIRNQSNLNKV 318
+ Q L K
Sbjct: 128 TLSRQQRLAKT 138
>gnl|CDD|226844 COG4420, COG4420, Predicted membrane protein [Function unknown].
Length = 191
Score = 29.3 bits (66), Expect = 1.8
Identities = 16/83 (19%), Positives = 41/83 (49%), Gaps = 4/83 (4%)
Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
+ +Q R+ + + R +L ++ KAE++ E+L +L +++ +L + +E+
Sbjct: 108 LMSQNRQAERDRL--RAELDYQVN--LKAEQEVAALHEKLDELRLDLGYVRDELDDLREL 163
Query: 278 IRRLEEQLKQLQAAKEELEARQT 300
+ +E +L +A + + Q
Sbjct: 164 LAEIEPELADEEALRRAVRGGQH 186
>gnl|CDD|133382 cd01107, HTH_BmrR, Helix-Turn-Helix DNA binding domain of the BmrR
transcription regulator. Helix-turn-helix (HTH)
multidrug-efflux transporter transcription regulator,
BmrR and YdfL of Bacillus subtilis, and related
proteins; N-terminal domain. Bmr is a membrane protein
which causes the efflux of a variety of toxic substances
and antibiotics. BmrR is comprised of two distinct
domains that harbor a regulatory (effector-binding) site
and an active (DNA-binding) site. The conserved
N-terminal domain contains a winged HTH motif that
mediates DNA binding, while the C-terminal domain binds
coactivating, toxic compounds. BmrR shares the
N-terminal DNA binding domain with other transcription
regulators of the MerR superfamily that promote
transcription by reconfiguring the spacer between the
-35 and -10 promoter elements.
Length = 108
Score = 28.3 bits (64), Expect = 1.8
Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 5/68 (7%)
Query: 237 QLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA--QEMIRRLEEQLKQLQAAKEE 294
QLE R K + +E +K++ + + D + +E + LE ++++LQ
Sbjct: 44 QLERLNRIKYLRDLGFPLEEIKEI---LDADNDDELRKLLREKLAELEAEIEELQRILRL 100
Query: 295 LEARQTEL 302
LE R ++
Sbjct: 101 LEDRLKQI 108
>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57. The
CLD or centrosome localisation domain of Cep57 is found
at the N-terminus, and lies approximately between
residues 58 and 239. This region lies within the first
alpha-helical coiled-coil segment of Cep57, and
localises to the centrosome internally to gamma-tubulin,
suggesting that it is either on both centrioles or on a
centromatrix component. This N-terminal region can also
multimerise with the N-terminus of other Cep57
molecules. The C-terminal part, Family Cep57_MT_bd,
pfam06657, is the microtubule-binding region of Cep57.
Length = 178
Score = 29.4 bits (66), Expect = 1.8
Identities = 29/104 (27%), Positives = 53/104 (50%), Gaps = 13/104 (12%)
Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE 263
YMRR M A +E+ A +Q+ LQ E +E + Q +E+L+ LE E
Sbjct: 82 YMRR----------MVENAEKERTAVLEQQVSLQRE---KEHDQMDVQAQLEKLEVLEKE 128
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+ + A+ I++LEE+L++ + ++ ++ + ELQ L+
Sbjct: 129 YLRLTRTQSLAETKIQQLEEKLQEEEHQRKLVQDKAAELQTGLE 172
>gnl|CDD|238248 cd00435, ACBP, Acyl CoA binding protein (ACBP) binds thiol esters
of long fatty acids and coenzyme A in a one-to-one
binding mode with high specificity and affinity.
Acyl-CoAs are important intermediates in fatty lipid
synthesis and fatty acid degradation and play a role in
regulation of intermediary metabolism and gene
regulation. The suggested role of ACBP is to act as a
intracellular acyl-CoA transporter and pool former.
ACBPs are present in a large group of eukaryotic
species and several tissue-specific isoforms have been
detected.
Length = 85
Score = 28.1 bits (63), Expect = 1.9
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 27 WKEHKGMLREDAMMEYLKIAQDL 49
W KGM +EDAM Y+ ++L
Sbjct: 58 WNSLKGMSKEDAMKAYIAKVEEL 80
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 29.6 bits (67), Expect = 1.9
Identities = 16/36 (44%), Positives = 23/36 (63%)
Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
DQ+ EA++ I +LEEQLK+L + L R+ EL
Sbjct: 2 EDQEFEEAKDYILKLEEQLKKLSKQAQRLVKRRREL 37
Score = 27.3 bits (61), Expect = 9.5
Identities = 19/72 (26%), Positives = 32/72 (44%), Gaps = 8/72 (11%)
Query: 236 LQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
L L+ ++ A KK Q +E+LK + +E+ LEE L+ A++
Sbjct: 117 LTLQSLKKDLASKKAQ--LEKLKAAPGIKPAK------VEELEEELEEAESALEEARKRY 168
Query: 296 EARQTELQLELQ 307
E L+ EL+
Sbjct: 169 EEISERLKEELK 180
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 29.7 bits (67), Expect = 1.9
Identities = 20/96 (20%), Positives = 45/96 (46%), Gaps = 8/96 (8%)
Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
A++ + Q Q E+ A+ K + Q Q ++E A+ + + E ++ ++
Sbjct: 50 AKKAFELSKMQEKTRQAELEAKIKEYEAQQA------QAKLERARVEAE--ERRKTLQEQ 101
Query: 282 EEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
+Q +Q ++EL ++ + +LE Q +N+ L
Sbjct: 102 TQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKM 137
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 29.6 bits (67), Expect = 1.9
Identities = 24/94 (25%), Positives = 37/94 (39%), Gaps = 28/94 (29%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEV----------EMAKRDQDL------------ 271
D + E+A ++A E+ + LK+L V EM K D+ +
Sbjct: 70 DDFKRELAFYKQALAAVLEARKLLKKLGVPFKRPDDYFAEMVKSDEHMQKVRKKLLAEAA 129
Query: 272 -MEAQEMIRRLEEQL---KQLQAAKEELEARQTE 301
+A E R+ E KQ+Q K L+ R E
Sbjct: 130 AKKASEEARKQRELKKFGKQVQVEK--LQERAKE 161
>gnl|CDD|177954 PLN02320, PLN02320, seryl-tRNA synthetase.
Length = 502
Score = 29.9 bits (67), Expect = 2.0
Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE--QLKQ-LQAAKEELEARQTELQLELQ 307
Q+ VERL+ +A + + +E E +EE LK+ L +E+L ELQLE Q
Sbjct: 106 QKEVERLRAERNAVANKMKGKLEPSERQALVEEGKNLKEGLVTLEEDLVKLTDELQLEAQ 165
Query: 308 PIRNQSN 314
I N ++
Sbjct: 166 SIPNMTH 172
>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242).
Length = 401
Score = 29.9 bits (67), Expect = 2.1
Identities = 23/103 (22%), Positives = 40/103 (38%), Gaps = 22/103 (21%)
Query: 221 QAREEKNAKQQQRDKLQL-------------EIAAREKAEKKHQES-VERLK-------- 258
RE +Q++ D+L+L + + K + E LK
Sbjct: 89 NIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKLEELSRHLKEENQCLQI 148
Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
QL+ + + ++ + E QE+ R L E L Q +E +A E
Sbjct: 149 QLDALVQECNEKIEENQELNRELAETLAYQQELNDEYQATFVE 191
>gnl|CDD|222255 pfam13600, DUF4140, N-terminal domain of unknown function
(DUF4140). This family is often found at the N-terminus
of its member proteins, with DUF4139, pfam13598, at the
C-terminus.
Length = 101
Score = 28.0 bits (63), Expect = 2.1
Identities = 11/48 (22%), Positives = 22/48 (45%)
Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
L V + + + LEE+L+ L+ LEA + L+ +++
Sbjct: 51 LSVSVRRDYLPDEATSPALAELEEELRALEDELAALEAEKEALEAQIK 98
Score = 27.3 bits (61), Expect = 4.4
Identities = 15/43 (34%), Positives = 22/43 (51%)
Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
+ L E +E +R LE++L L+A KE LEA+ L
Sbjct: 59 YLPDEATSPALAELEEELRALEDELAALEAEKEALEAQIKFLS 101
Score = 26.5 bits (59), Expect = 7.3
Identities = 7/35 (20%), Positives = 12/35 (34%)
Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
L +L+ LE L+ E + + Q
Sbjct: 65 TSPALAELEEELRALEDELAALEAEKEALEAQIKF 99
>gnl|CDD|218703 pfam05700, BCAS2, Breast carcinoma amplified sequence 2 (BCAS2).
This family consists of several eukaryotic sequences of
unknown function. The mammalian members of this family
are annotated as breast carcinoma amplified sequence 2
(BCAS2) proteins. BCAS2 is a putative spliceosome
associated protein.
Length = 221
Score = 29.4 bits (66), Expect = 2.1
Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 230 QQQRDKLQLEIAAREKAEKKHQESV-ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
+++ +L+ EI + K QE E+L+ LE +E + LE++++QL
Sbjct: 149 EKELAELKEEIEEVNRQRKYSQEEAGEKLRSLEQRWVDLVSKNLEIEVACAELEQEIEQL 208
Query: 289 QAAKEELE 296
+ K EL+
Sbjct: 209 KRKKAELK 216
>gnl|CDD|147845 pfam05914, RIB43A, RIB43A. This family consists of several
RIB43A-like eukaryotic proteins. Ciliary and flagellar
microtubules contain a specialised set of
protofilaments, termed ribbons, that are composed of
tubulin and several associated proteins. RIB43A was
first characterized in the unicellular biflagellate,
Chlamydomonas reinhardtii although highly related
sequences are present in several higher eukaryotes
including humans. The function of this protein is
unknown although the structure of RIB43A and its
association with the specialised protofilament ribbons
and with basal bodies is relevant to the proposed role
of ribbons in forming and stabilising doublet and
triplet microtubules and in organising their
three-dimensional structure. Human RIB43A homologues
could represent a structural requirement in centriole
replication in dividing cells.
Length = 379
Score = 29.6 bits (67), Expect = 2.2
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 3/80 (3%)
Query: 240 IAAREKAEKKHQESVERLKQLEVEMAKR-DQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
A R+ E++ QE ER ++ E K D+ + LE Q ++L+ + +
Sbjct: 283 AAIRKGQEQQLQEK-ERRREEEQLREKEWDRQAINQARAAVLLERQERRLRKELRK-QLD 340
Query: 299 QTELQLELQPIRNQSNLNKV 318
QL + Q LNK
Sbjct: 341 SANQQLAKEQKAQQEYLNKE 360
Score = 29.2 bits (66), Expect = 2.9
Identities = 20/94 (21%), Positives = 44/94 (46%), Gaps = 5/94 (5%)
Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
Q ++ + +QQ +++ Q + AA ++A+ H + L E+A +++ + + +
Sbjct: 157 QEQQREWLEQQVQERQQAK-AAEKQADTLHDQLRVELDHRAQELASLEEEC--RRAVEQA 213
Query: 281 LEEQLKQL--QAAKEELEARQTELQLELQPIRNQ 312
+ K L + + E +Q E + L I NQ
Sbjct: 214 TADFNKALAEEQRERERREKQEEQEDNLAEIYNQ 247
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 29.7 bits (67), Expect = 2.2
Identities = 18/87 (20%), Positives = 43/87 (49%), Gaps = 6/87 (6%)
Query: 233 RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQ----LKQL 288
+ L+ + E+ EK + ++ K+ + ++ QDL E ++ + ++++ +L
Sbjct: 157 GEDLK-TVETAEEIEKSLKAMQQKWKKRKKDLPN-KQDLEEYKKRLEAIKKKDIKNPLEL 214
Query: 289 QAAKEELEARQTELQLELQPIRNQSNL 315
Q KEE + + E + + Q I++ N
Sbjct: 215 QKIKEEFDKLKKEGKADKQKIKSAKND 241
>gnl|CDD|215101 PLN00203, PLN00203, glutamyl-tRNA reductase.
Length = 519
Score = 29.7 bits (67), Expect = 2.2
Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 7/67 (10%)
Query: 254 VERLKQL-EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT--ELQLELQPIR 310
V+ LK++ R + MEAQ +IR E+ K +A ++ LE T +L+ + IR
Sbjct: 389 VDDLKEVVAANKEDRLRKAMEAQTIIR---EESKNFEAWRDSLETVPTIKKLRSYAERIR 445
Query: 311 NQSNLNK 317
+ L K
Sbjct: 446 -AAELEK 451
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 29.8 bits (68), Expect = 2.3
Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 13/104 (12%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESV------ERLKQLEVEMAKR 267
++Q++ EEKN ++ ++ K + + ++L+ LE E ++
Sbjct: 127 ELQELLES--EEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLENLEEEFSQF 184
Query: 268 DQ-----DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+ D +EA+E++ +LEE+L L+ EE+ ELQ EL
Sbjct: 185 VELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTEL 228
Score = 29.8 bits (68), Expect = 2.3
Identities = 20/94 (21%), Positives = 46/94 (48%), Gaps = 17/94 (18%)
Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA-------- 290
E+ + + EK+ + ++ ++ +A+++ E QE + + +QL++++
Sbjct: 346 ELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEM 405
Query: 291 ----AKEELEARQT--ELQLELQPIR---NQSNL 315
K+ELEAR+ + +L I+ +SNL
Sbjct: 406 LQGLRKDELEAREKLERYRNKLHEIKRYLEKSNL 439
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 29.3 bits (66), Expect = 2.3
Identities = 20/66 (30%), Positives = 36/66 (54%), Gaps = 5/66 (7%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
DKL E R + E++ + + + +K+LE R D E++ RLEE LK+LQ+ +
Sbjct: 111 DKLDEEQEERVEKEREEELAGDAMKKLE----NRTADSKREMEVLERLEE-LKELQSRRA 165
Query: 294 ELEARQ 299
+++
Sbjct: 166 DVDVNS 171
>gnl|CDD|218532 pfam05276, SH3BP5, SH3 domain-binding protein 5 (SH3BP5). This
family consists of several eukaryotic SH3 domain-binding
protein 5 or c-Jun N-terminal kinase (JNK)-interacting
proteins (SH3BP5 or Sab). Sab binds to and serves as a
substrate for JNK in vitro, and has been found to
interact with the Src homology 3 (SH3) domain of
Bruton's tyrosine kinase (Btk). Inspection of the
sequence of Sab reveals the presence of two putative
mitogen-activated protein kinase interaction motifs
(KIMs) similar to that found in the JNK docking domain
of the c-Jun transcription factor, and four potential
serine-proline JNK phosphorylation sites in the
C-terminal half of the molecule.
Length = 240
Score = 29.4 bits (66), Expect = 2.4
Identities = 14/65 (21%), Positives = 26/65 (40%)
Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
R +AEK+H L E+ + + ++ L A R E + E + EL
Sbjct: 137 RTRAEKEHASEEAELLVAELRLRQLEKILKRAINKSRPYFELKQVYTQQLEIQKLLVQEL 196
Query: 303 QLELQ 307
+ ++
Sbjct: 197 EAKVA 201
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 29.6 bits (67), Expect = 2.4
Identities = 19/94 (20%), Positives = 43/94 (45%), Gaps = 4/94 (4%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+ + +++ Q +KL I E+ + E ++L +L+ +++ Q L+
Sbjct: 296 EGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLIT 355
Query: 274 AQEMIRRLEEQLKQLQAA----KEELEARQTELQ 303
+ ++++ +++LQA EEL Q EL
Sbjct: 356 LVDKAKKVKAAIEELQAEFVDNAEELAKLQDELD 389
Score = 29.2 bits (66), Expect = 3.4
Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 19/109 (17%)
Query: 207 RRK--PDTIDVQ---QM----KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL 257
RRK D +D+ +M K + RE QQ L ++I ++ K + +++E
Sbjct: 152 RRKLVEDLLDISVLSEMDKLNKDKIRE----LNQQIQTLDMKIDHIQQQIKTYNKNIEEQ 207
Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
+ K +++ Q L E+ K ++A EEL L +++
Sbjct: 208 R------KKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDI 250
Score = 27.7 bits (62), Expect = 8.7
Identities = 19/119 (15%), Positives = 45/119 (37%), Gaps = 20/119 (16%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR-------- 267
Q + EE + + ++L E+ + ++ +L ++ +
Sbjct: 219 QNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVI 278
Query: 268 ------------DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
Q + E + I +++++LK+LQ + E+L+ EL+ + QS
Sbjct: 279 KMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSK 337
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 28.7 bits (65), Expect = 2.5
Identities = 22/99 (22%), Positives = 52/99 (52%), Gaps = 7/99 (7%)
Query: 213 IDVQQMKAQAREEKNAKQQ---QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ 269
+DVQ++ ++ K A++Q + K Q E+ EK +K +E +++ E A+ +
Sbjct: 4 VDVQKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKK 63
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
+E+ ++++E ++ Q +++L+ RQ E ++
Sbjct: 64 ----EKELQKKVQEFQRKQQKLQQDLQKRQQEELQKILD 98
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 29.6 bits (66), Expect = 2.5
Identities = 28/99 (28%), Positives = 47/99 (47%), Gaps = 13/99 (13%)
Query: 215 VQQMKAQAREEKNAKQQ---QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR---- 267
+Q++ A+ + + Q+ QR+KL++E+ EK + Q S R + E+ A R
Sbjct: 373 IQKLSAELLKLEEWLQEERSQREKLEVELG-IEKDCNRVQLSENRRELSELRSALRVLQK 431
Query: 268 --DQDLMEAQEM---IRRLEEQLKQLQAAKEELEARQTE 301
+Q E QE+ IR LE +L + K +A E
Sbjct: 432 EKEQLQEEKQELLDYIRVLELRLDKEADEKWAEDAATCE 470
Score = 27.6 bits (61), Expect = 9.3
Identities = 28/162 (17%), Positives = 64/162 (39%), Gaps = 28/162 (17%)
Query: 179 DFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKL-- 236
D + P+ + + L C M++ DV M+++ + + A + +K
Sbjct: 130 DLLLVVPKATILQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEE 189
Query: 237 ------------QLEIAAREKAEKKHQESVERLKQLEVEMA-----KRDQDLM------- 272
+ A R + E+ +R+ +LE ++ K++ D +
Sbjct: 190 LEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDI 249
Query: 273 --EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
E + M L+++LK++ + + E + +LQLE + +
Sbjct: 250 EAELERMKGELKQRLKKMTIQRRDEETERIDLQLENEQLHED 291
>gnl|CDD|111811 pfam02963, EcoRI, Restriction endonuclease EcoRI.
Length = 257
Score = 29.1 bits (65), Expect = 2.6
Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 246 AEKKHQ-ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
AE KHQ + +E ++ + ++DQDLM A I R + + ++
Sbjct: 94 AEAKHQGKDIENIRNGLLVGKRKDQDLMAAGNAIERSHKNISEI 137
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 29.5 bits (66), Expect = 2.6
Identities = 25/103 (24%), Positives = 42/103 (40%), Gaps = 5/103 (4%)
Query: 220 AQAREEKNAKQQQRD----KLQLEIAAREKAEKKHQESVE-RLKQLEVEMAKRDQDLMEA 274
A+A EE + + D KLQ E + K K+ ++ L +++ D +L
Sbjct: 131 AEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLL 190
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
+ + L EQL++ Q E E + L E + S K
Sbjct: 191 ESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIK 233
Score = 28.4 bits (63), Expect = 6.8
Identities = 24/109 (22%), Positives = 46/109 (42%), Gaps = 1/109 (0%)
Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
N+E + R + ++ ++ ++ +N + ++ +L + K Q E +
Sbjct: 101 NYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLK 160
Query: 260 LEVEMAKR-DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
E R DL E Q + + +LK L++ EEL + E Q EL
Sbjct: 161 EAKESISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELA 209
>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil. Members of
this family are predominantly found in the
actin-bundling protein Cortexillin I from Dictyostelium
discoideum. They adopt a structure consisting of an
18-heptad-repeat alpha-helical coiled-coil, and are a
prerequisite for the assembly of Cortexillin I.
Length = 107
Score = 28.0 bits (62), Expect = 2.6
Identities = 26/90 (28%), Positives = 47/90 (52%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+ +++A E N + L+ E A++E+ K+ + L LE + A+R+Q L E
Sbjct: 3 EKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRLRE 62
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
Q I L + L+ + AK ELE+R ++ +
Sbjct: 63 LQAKIDELLKNLELEKLAKMELESRLSKTE 92
>gnl|CDD|145995 pfam03148, Tektin, Tektin family. Tektins are cytoskeletal
proteins. They have been demonstrated in such cellular
sites as centrioles, basal bodies, and along ciliary and
flagellar doublet microtubules. Tektins form unique
protofilaments, organised as longitudinal polymers of
tektin heterodimers with axial periodicity matching
tubulin. Tektin polypeptides consist of several
alpha-helical regions that are predicted to form coiled
coils. Indeed, tektins share considerable structural
similarities with intermediate filament proteins.
Possible functional roles for tektins are: stabilisation
of tubulin protofilaments; attachment of A and B-tubules
in ciliary/flagellar microtubule doublets and C-tubules
in centrioles; binding of axonemal components.
Length = 384
Score = 29.5 bits (67), Expect = 2.7
Identities = 23/109 (21%), Positives = 47/109 (43%), Gaps = 24/109 (22%)
Query: 223 REEKNAKQQ---QRDKLQLEIAAREKAEKKHQESV----ERLKQLEVEMAKRDQ------ 269
E + AK + Q K EIA +EK + ++++ LK + + R Q
Sbjct: 254 SETEEAKNKLEWQLKKTLEEIAQQEKNIEALEKAIRDKEGPLKVAQTRLENRTQRPNVEL 313
Query: 270 -----------DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
++ E ++ I L+E+L + +A+ + LE + L+ ++
Sbjct: 314 CRDQAQYRLIDEVKELEDTIAALKEKLAEAEASLKALEETRARLEEDIA 362
>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H). This family
consists of several eukaryotic centromere protein H
(CENP-H) sequences. Macromolecular
centromere-kinetochore complex plays a critical role in
sister chromatid separation, but its complete protein
composition as well as its precise dynamic function
during mitosis has not yet been clearly determined.
CENP-H contains a coiled-coil structure and a nuclear
localisation signal. CENP-H is specifically and
constitutively localised in kinetochores throughout the
cell cycle. CENP-H may play a role in kinetochore
organisation and function throughout the cell cycle.
This the C-terminus of the region, which is conserved
from fungi to humans.
Length = 106
Score = 28.0 bits (63), Expect = 2.8
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 5/73 (6%)
Query: 236 LQLEIAAREKAEKKHQESVERL--KQLEVEMAKRDQDLMEAQEMIRR---LEEQLKQLQA 290
L L+ A+RE E+ ERL KQ VE+A +L + +E R L+EQL++L+A
Sbjct: 6 LNLQQASRETREELSDLEKERLQLKQKNVELALELLELTKKKESWREDMELKEQLEKLEA 65
Query: 291 AKEELEARQTELQ 303
++ +A+ ++
Sbjct: 66 DLKKSKAKWEVMK 78
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 29.6 bits (66), Expect = 2.8
Identities = 24/102 (23%), Positives = 44/102 (43%), Gaps = 7/102 (6%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAA--REKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+ + AKQ LQ AA + A ++H + V+ + ++ D +
Sbjct: 732 AMLSRKVHSI--AKQGMTGALQALGAAHFPQVAPEQHDDIVDIERIEHRRQLQKRIDAVN 789
Query: 274 AQEMIRRLEEQL-KQLQAAKEELEARQTELQLELQPIRNQSN 314
A+ +RRL E++ ++ AK+E A +E+ EL I
Sbjct: 790 AR--LRRLREEIIGRMSDAKKEDTAALSEVGAELDDIPEYLA 829
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 29.4 bits (66), Expect = 2.9
Identities = 26/127 (20%), Positives = 54/127 (42%), Gaps = 18/127 (14%)
Query: 194 LALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQE 252
L +G L + R+ D + + + EE +++ +L+ E R E+ + Q
Sbjct: 117 LCNELGEPPLSLLRKDADPLSLPNL-----EELEHFRERLGELREEKVRRLEEVDSLKQS 171
Query: 253 SVERLKQLEVEMAKRD--QDLME----------AQEMIRRLEEQLKQLQAAKEELEARQT 300
L A+ D QD++ +E I +L++ L+ L A K++ +
Sbjct: 172 IKSLCSLLGTPPARTDFEQDVLSYGEIPNDHCVTRETIDKLDKMLENLAAQKKQRADKID 231
Query: 301 ELQLELQ 307
+L+ ++Q
Sbjct: 232 DLREQIQ 238
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 29.2 bits (66), Expect = 2.9
Identities = 13/78 (16%), Positives = 39/78 (50%), Gaps = 2/78 (2%)
Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
+A + + ERL++ E + ++ Q ++++ + L ++++QL++ +++L+
Sbjct: 74 RLAKLISENEALKAENERLQKREQSIDQQIQQAVQSET--QELTKEIEQLKSERQQLQGL 131
Query: 299 QTELQLELQPIRNQSNLN 316
+LQ L + +
Sbjct: 132 IDQLQRRLAGVLTGPSGG 149
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 28.7 bits (64), Expect = 2.9
Identities = 24/94 (25%), Positives = 49/94 (52%), Gaps = 2/94 (2%)
Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQ 269
+ +++ +++ E + ++ L R E+ + K +E ERL++LE E+ + +
Sbjct: 112 EIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKEL-EELE 170
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
D + E + +E+ K+L+ KEELE EL+
Sbjct: 171 DEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204
>gnl|CDD|213395 cd12152, F1-ATPase_delta, mitochondrial ATP synthase delta subunit.
The F-ATPase is found in bacterial plasma membranes,
mitochondrial inner membranes and in chloroplast
thylakoid membranes. It has also been found in the
archaea Methanosarcina barkeri. It uses a proton
gradient to drive ATP synthesis and hydrolyzes ATP to
build the proton gradient. The extrinisic membrane
domain, F1, is composed of alpha, beta, gamma, delta,
and epsilon subunits with a stoichiometry of 3:3:1:1:1.
Alpha and beta subunit form the globular catalytic
moiety, a hexameric ring of alternating subunits. Gamma,
delta and epsilon subunits form a stalk, connecting F1
to F0, the integral membrane proton translocating
domain. In bacteria, which is lacking a eukaryotic
epsilon subunit homolog, this subunit is called the
epsilon subunit.
Length = 123
Score = 28.2 bits (64), Expect = 2.9
Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 3/36 (8%)
Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
D+ A+E + R EE+L Q AK+E E + E LE
Sbjct: 87 DVERAEEALERAEERLAQ---AKDEREKARAEAALE 119
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 28.7 bits (65), Expect = 2.9
Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 2/70 (2%)
Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ 287
Q +KL+ + +K ++ ++ + L Q+E R++ E E++ L++ K+
Sbjct: 60 PSQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETE-ERTELLEELKQLEKE 118
Query: 288 LQAAKEELEA 297
L+ K ELE
Sbjct: 119 LKKLKAELEK 128
Score = 27.6 bits (62), Expect = 7.5
Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 15/81 (18%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
++K + + K ++ + ++ A EK +K +E+ ER + LE + Q
Sbjct: 65 NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLE----ELKQ------ 114
Query: 276 EMIRRLEEQLKQLQAAKEELE 296
LE++LK+L+A E+ E
Sbjct: 115 -----LEKELKKLKAELEKYE 130
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 29.2 bits (66), Expect = 2.9
Identities = 15/51 (29%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 230 QQQRDKLQLEIAAREKAEKKHQES--VERLKQLEVEMAKRDQDLMEAQEMI 278
Q++ +KL+ ++A E+ +K+ S +L +LE ++A ++ + EA+E+I
Sbjct: 248 QEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELI 298
>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein. This
is the P3 protein section of the Potyviridae
polyproteins. The function is not known except that the
protein is essential to viral survival.
Length = 445
Score = 29.1 bits (66), Expect = 3.0
Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 10/92 (10%)
Query: 215 VQQMKAQAREEKNAKQQ-QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
+ + + R+ K + + +K + EI K + + LE K + L
Sbjct: 265 LVSIINEHRKLKREAAEEKEEKEEKEIRKLYLYSKLEGKLPTEEEFLEYIEEKNPELLET 324
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
A+E+I EE+++ QA E LE
Sbjct: 325 AEELIE--EEEVEH-QA------KTADEKNLE 347
>gnl|CDD|234937 PRK01294, PRK01294, lipase chaperone; Provisional.
Length = 336
Score = 29.3 bits (66), Expect = 3.0
Identities = 17/102 (16%), Positives = 38/102 (37%), Gaps = 16/102 (15%)
Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEI----------AAREKAEKKHQESV- 254
P +D+Q ++ + + +++ A E+ S+
Sbjct: 145 EDDGPGKLDLQALQQLLDARLALRARFFSDWEIQAFFGEENQYQRYALERLRIAQDPSLS 204
Query: 255 -----ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAA 291
RL LE ++ + + ++ + + L +QL QLQA+
Sbjct: 205 DAQKAARLAALEAQLPEDLRAALQESQRQQALLQQLAQLQAS 246
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 29.2 bits (66), Expect = 3.1
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 5/65 (7%)
Query: 255 ERLKQLEVEM--AKRDQDLMEA---QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
E L +L E+ D+ L EA + L+E++ +L A + LE EL+ +
Sbjct: 6 EALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAAS 65
Query: 310 RNQSN 314
Sbjct: 66 GEGGG 70
>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal. This domain is the C-terminal
region of the CASP family of proteins. It is a Golgi
membrane protein which is thought to have a role in
vesicle transport.
Length = 245
Score = 28.9 bits (65), Expect = 3.1
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 257 LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
L++L+ E++ + L E QE+ +LE L +Q
Sbjct: 1 LEELQKELSSLNAKLEEQQELNAKLENDLATVQDEIS 37
>gnl|CDD|220215 pfam09379, FERM_N, FERM N-terminal domain. This domain is the
N-terminal ubiquitin-like structural domain of the FERM
domain.
Length = 79
Score = 27.2 bits (61), Expect = 3.1
Identities = 8/19 (42%), Positives = 9/19 (47%)
Query: 91 QDVKKENPLQFKFRAKFYP 109
Q + P FR KFYP
Sbjct: 61 QGKDGDPPFTLFFRVKFYP 79
>gnl|CDD|235179 PRK03947, PRK03947, prefoldin subunit alpha; Reviewed.
Length = 140
Score = 28.4 bits (64), Expect = 3.1
Identities = 24/125 (19%), Positives = 54/125 (43%), Gaps = 25/125 (20%)
Query: 211 DTIDVQQMKAQAR---EEKNAKQQQRDKLQLEIAAREKAE------KKHQESVERL---- 257
+++++ AQ + + A QQQ ++LQ I + A+ K E E L
Sbjct: 4 SEQELEELAAQLQALQAQIEALQQQLEELQASINELDTAKETLEELKSKGEGKETLVPIG 63
Query: 258 --KQLEVEMAKRDQDLM----------EAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
++ ++ +D+ ++ + E I L+++ ++L+ A E+LE +L
Sbjct: 64 AGSFVKAKVKDKDKVIVSLGAGYSAEKDLDEAIEILDKRKEELEKALEKLEEALQKLASR 123
Query: 306 LQPIR 310
+ +
Sbjct: 124 IAQLA 128
>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
Synaptonemal complex protein 1 (SCP-1) is the major
component of the transverse filaments of the
synaptonemal complex. Synaptonemal complexes are
structures that are formed between homologous
chromosomes during meiotic prophase.
Length = 787
Score = 29.3 bits (65), Expect = 3.2
Identities = 26/103 (25%), Positives = 54/103 (52%), Gaps = 4/103 (3%)
Query: 209 KPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKH----QESVERLKQLEVEM 264
K D +Q ++ + ++E N K++Q L ++I +E K +ES ++ QLE +
Sbjct: 218 KEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277
Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+D++L E E L ++L+ ++ + + + Q L+ +LQ
Sbjct: 278 KLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQ 320
>gnl|CDD|221982 pfam13206, VSG_B, Trypanosomal VSG domain. This family represents
the B-type variant surface glycoproteins from
trypanosomal parasites. This family is related to
pfam00913.
Length = 253
Score = 29.0 bits (65), Expect = 3.2
Identities = 7/30 (23%), Positives = 19/30 (63%)
Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
+++L++ E+ +R++ EA+ + +LE
Sbjct: 223 LKKLEEAATELEEREKAAQEAKALAAQLEA 252
>gnl|CDD|204831 pfam12137, RapA_C, RNA polymerase recycling family C-terminal.
This domain is found in bacteria. This domain is about
360 amino acids in length. This domain is found
associated with pfam00271, pfam00176. The function of
this domain is not known, but structurally it forms an
alpha-beta fold in nature with a central beta-sheet
flanked by helices and loops, the beta-sheet being
mainly antiparallel and flanked by four alpha helices,
among which the two longer helices exhibit a coiled-coil
arrangement.
Length = 362
Score = 29.0 bits (66), Expect = 3.2
Identities = 26/98 (26%), Positives = 46/98 (46%), Gaps = 20/98 (20%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-EVEMAKRDQDLM 272
++Q+ +A EK A++Q ++ A+++ +++ + RLK L V RD++
Sbjct: 279 QIEQLLQKA--EKLAEEQLE---EIIEQAKQQMQQQLSAELARLKALKAVNPNIRDEE-- 331
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
I LE+Q +EEL + QL L IR
Sbjct: 332 -----IEALEQQ-------REELLTALDKAQLRLDAIR 357
>gnl|CDD|222588 pfam14198, TnpV, Transposon-encoded protein TnpV. This family of
proteins is functionally uncharacterized. This family of
proteins is found in bacteria and viruses. Proteins in
this family are typically between 114 and 125 amino
acids in length.
Length = 111
Score = 27.9 bits (63), Expect = 3.2
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQT 300
+A+E + R+ +Q+ + + EEL+A
Sbjct: 61 QAEERLERIVKQMAEKEGVTEELKAEDQ 88
>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain. Mitosin or
centromere-associated protein-F (Cenp-F) is found bound
across the centromere as one of the proteins of the
outer layer of the kinetochore. Most of the
kinetochore/centromere functions appear to depend upon
binding of the C-terminal par to f the molecule, whereas
the N-terminal part, here, may be a cytoplasmic player
in controlling the function of microtubules and dynein.
Length = 288
Score = 29.0 bits (64), Expect = 3.3
Identities = 21/89 (23%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+Q+++ Q + K KQQ+ Q ++ + E A +K ++ VE K + + +Q LME
Sbjct: 20 IQELEGQLDKLKKEKQQR----QFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLMEI 75
Query: 275 QEMIRRLEEQLK-QLQAAKEELEARQTEL 302
E + + ++++ +LQ + ++ ++ +L
Sbjct: 76 CEHLEKTKQKISHELQVKESQVNFQEGQL 104
>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
kinases [Cell motility and secretion / Signal
transduction mechanisms].
Length = 716
Score = 29.2 bits (66), Expect = 3.3
Identities = 27/107 (25%), Positives = 46/107 (42%), Gaps = 14/107 (13%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
DV +A+ EK+A + + A + A S + V++ K D+ +
Sbjct: 286 DVLAAQAEPLAEKSAAEAAKLSALEAAPAAKAAAAAAGAS----SSIRVDVEKLDRLMNL 341
Query: 274 AQEMI---RRLEEQLKQLQAAKEELE--ARQ-----TELQLELQPIR 310
E++ RL + +L +EEL+ RQ T+LQ E+ IR
Sbjct: 342 VGELVITRSRLSQLASELDEVREELDEALRQLSRLTTDLQDEVMKIR 388
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 28.2 bits (63), Expect = 3.5
Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 3/80 (3%)
Query: 234 DKLQLEIA-AREKAEKKH--QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
D +L REK +K Q+ +E +LE E + + + + +E +R+LE + K L
Sbjct: 58 DNERLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELKAKNLSD 117
Query: 291 AKEELEARQTELQLELQPIR 310
LE R+TEL+ E +
Sbjct: 118 QVSRLEERETELKKEYNSLH 137
>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
outer membrane / Intracellular trafficking and
secretion].
Length = 457
Score = 29.0 bits (65), Expect = 3.5
Identities = 16/80 (20%), Positives = 34/80 (42%), Gaps = 2/80 (2%)
Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ-DL 271
+Q AR+E + ++ A +A + QE++E L + + R D+
Sbjct: 352 AQYEQTVLTARQEVADALAALEAALEQLQALRQAVEAAQEALE-LARERYQAGVRTLLDV 410
Query: 272 MEAQEMIRRLEEQLKQLQAA 291
++AQ + + + L Q +
Sbjct: 411 LDAQRTLLQARQALLQARYD 430
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 29.2 bits (66), Expect = 3.5
Identities = 18/75 (24%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 226 KNAKQQQRDKLQLEIAAREKAE----KKHQESVER-LKQLEVEMAKRDQDLMEAQEMIRR 280
K K + + KL L + A K + +K ++ +E+ + LE E+ + + ++ ++ I+
Sbjct: 388 KKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKE 447
Query: 281 LEEQLKQLQAAKEEL 295
LE+QL ++ +E+
Sbjct: 448 LEKQLTNIEPTADEI 462
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 28.5 bits (64), Expect = 3.6
Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
E++K+L+ M + ++ EAQE ++LK+LQ + E+ Q EL
Sbjct: 72 EKMKELQKMMKEFQKEFREAQE--SGDMKKLKKLQEKQMEMMDDQREL 117
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 28.9 bits (66), Expect = 3.7
Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 17/68 (25%)
Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
+Q L+ E+AA KA+ E ER++ L E ++ LE++L+QL+
Sbjct: 697 NEQEALLK-ELAALLKAKPS--ELPERVEAL--------------LEELKELEKELEQLK 739
Query: 290 AAKEELEA 297
A A
Sbjct: 740 AKLAAAAA 747
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 29.2 bits (65), Expect = 3.8
Identities = 12/58 (20%), Positives = 22/58 (37%), Gaps = 5/58 (8%)
Query: 247 EKKHQESVERLKQLE-----VEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
++H +R + E +E K + L E QE + + E K+ E+
Sbjct: 459 PQEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESEF 516
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 28.3 bits (64), Expect = 3.9
Identities = 22/82 (26%), Positives = 37/82 (45%), Gaps = 3/82 (3%)
Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
E K + R + + E+ R + Q +RL Q E + ++D+ L + +E + E+
Sbjct: 57 EAKEEIHKLRAEAERELKERRN---ELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEK 113
Query: 284 QLKQLQAAKEELEARQTELQLE 305
+L Q EE E EL E
Sbjct: 114 ELAARQQQLEEKEEELEELIEE 135
Score = 27.5 bits (62), Expect = 7.8
Identities = 23/88 (26%), Positives = 50/88 (56%), Gaps = 11/88 (12%)
Query: 208 RKPDTIDVQQMKAQAREEK-NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
RK ++++ ++ + +E++ A+QQQ ++ + E+ E+ ++ Q+ +ER+ L E AK
Sbjct: 96 RKDESLEKKEESLEEKEKELAARQQQLEEKEEEL---EELIEEQQQELERISGLTAEEAK 152
Query: 267 -------RDQDLMEAQEMIRRLEEQLKQ 287
+ EA +MI+ +EE+ K+
Sbjct: 153 EILLEEVEAELKHEAAKMIKEIEEEAKE 180
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 28.9 bits (65), Expect = 3.9
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 6/86 (6%)
Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQE 276
MK +A+ E +++Q + L+ + + K K+ +S++RL K L+ E +R QD +
Sbjct: 128 MKGEAKVE-EEEEKQMEILEKALKSYLKIVKEENKSLQRLAKALQKESEERTQD---ETK 183
Query: 277 MIRRLEEQLKQLQAAKE-ELEARQTE 301
MI +++ L+ A E E E Q E
Sbjct: 184 MIEEYRDKIDALKNAIEVEKEGLQEE 209
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 28.6 bits (65), Expect = 4.0
Identities = 13/38 (34%), Positives = 25/38 (65%)
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+E LEEQ++QL+ +LEA +L+ EL+ ++++
Sbjct: 7 EERNSELEEQIRQLELKLRDLEAENEKLERELERLKSE 44
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 28.3 bits (64), Expect = 4.1
Identities = 12/48 (25%), Positives = 30/48 (62%)
Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
E+L LE E+ + ++L E +E + +L ++L +++ +++L + +L
Sbjct: 84 EKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKL 131
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 29.0 bits (65), Expect = 4.1
Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 6/93 (6%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAARE-----KAEKKHQESVERLKQLEVEMAKRDQ 269
QQ QA + Q Q L L AR+ + ++ ++ +Q E+ R Q
Sbjct: 262 RQQALQQALAAEEKAQPQLAALSLAQPARQLRPHWERIQEQSAALAHTRQQIEEVNTRLQ 321
Query: 270 DLMEAQEMIRR-LEEQLKQLQAAKEELEARQTE 301
M + IR +Q +LQA ++ L E
Sbjct: 322 STMALRARIRHHAAKQSAELQAQQQSLNTWLAE 354
Score = 29.0 bits (65), Expect = 4.5
Identities = 15/77 (19%), Positives = 30/77 (38%), Gaps = 11/77 (14%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
Q+ AQ + Q++ + + + K+ + + +K + E +
Sbjct: 439 QKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKT-----------ICEQE 487
Query: 276 EMIRRLEEQLKQLQAAK 292
I+ LE Q QLQA +
Sbjct: 488 ARIKDLEAQRAQLQAGQ 504
>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
Provisional.
Length = 784
Score = 28.8 bits (64), Expect = 4.2
Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 2/80 (2%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
D LQ+E R + KK E +R L +M ++L E + E +++ AAK
Sbjct: 205 DLLQVEKRIRNRV-KKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEALKRKIDAAKM 263
Query: 294 ELEARQTELQLELQPIRNQS 313
EA++ + + ELQ ++ S
Sbjct: 264 PKEAKE-KAEAELQKLKMMS 282
>gnl|CDD|148363 pfam06717, DUF1202, Protein of unknown function (DUF1202). This
family consists of several hypothetical bacterial
proteins of around 335 residues in length. Members of
this family are found exclusively in Escherichia coli
and Salmonella species and are often referred to as YggM
proteins. The function of this family is unknown.
Length = 308
Score = 28.4 bits (63), Expect = 4.4
Identities = 14/72 (19%), Positives = 40/72 (55%), Gaps = 1/72 (1%)
Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
A R+++E+ ++ +++ + ++ +R Q +++QE+ R+L++ L+ + L + +
Sbjct: 228 ACRKQSEECYEVPIQQKRDFDINEQRR-QTFLKSQELSRKLQDDWITLEKGQYPLTMKVS 286
Query: 301 ELQLELQPIRNQ 312
EL + I +
Sbjct: 287 ELNSQQSTILMK 298
>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1. This model
describes peptide chain release factor 1 (PrfA, RF-1),
and excludes the related peptide chain release factor 2
(PrfB, RF-2). RF-1 helps recognize and terminate
translation at UAA and UAG stop codons. The
mitochondrial release factors are prfA-like, although
not included above the trusted cutoff for this model.
RF-1 does not have a translational frameshift [Protein
synthesis, Translation factors].
Length = 360
Score = 28.5 bits (64), Expect = 4.4
Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 235 KLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEE 294
KL E + E+ ++E + + ++ ++ E +EM +E+L++L+ EE
Sbjct: 37 KLSKEYSQLEEIVDCYREYQQAQEDIKEAKEILEESDPEMREMA---KEELEELEEKIEE 93
Query: 295 LEARQTELQLELQPI 309
LE +L++ L P
Sbjct: 94 LE---EQLKVLLLPK 105
>gnl|CDD|222698 pfam14346, DUF4398, Domain of unknown function (DUF4398). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 127 and
269 amino acids in length.
Length = 105
Score = 27.3 bits (61), Expect = 4.5
Identities = 20/89 (22%), Positives = 38/89 (42%), Gaps = 11/89 (12%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIA------AREKAEKKHQESVERLKQLEVEMAKR 267
D + +A E A+Q ++L++A A+ ++ E RL E A+
Sbjct: 21 DAEAALERA-EAAGAEQYAPPYVELKLAREKLAQAKAALDEGKYEEARRL----AEQAEA 75
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
D L EA+ + + L++ + + E L
Sbjct: 76 DARLAEAKARAAKAQAALEEAKDSIERLR 104
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.4 bits (64), Expect = 4.5
Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 9/59 (15%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQE--------SVERLKQLEVEMAKRDQ 269
K + EE+ + ++ + E A +K EKK +E S E ++LE + K+
Sbjct: 263 KTREEEEEKILKAA-EEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320
Score = 28.0 bits (63), Expect = 6.8
Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 2/54 (3%)
Query: 242 AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL--EEQLKQLQAAKE 293
RE+ E+K ++ E +Q E + K ++ E + + +L EEQ K + ++
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERK 317
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 28.8 bits (64), Expect = 4.6
Identities = 21/105 (20%), Positives = 36/105 (34%), Gaps = 5/105 (4%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREK-AEKKHQESVERLKQLEVEMAKRDQDLME 273
Q+ +K + + + L L ER + LE E+ + L E
Sbjct: 178 YTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKEL----KHLRE 233
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
A + ++ L Q + A+EE +Q L+ I V
Sbjct: 234 ALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAV 278
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 28.1 bits (63), Expect = 4.6
Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 8/93 (8%)
Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
+ K EEK A++QQR + E RE+ +K ++ K+ E +R++
Sbjct: 1 KKIGAKKRAKLEEKQARRQQR---EAEEEEREERKKLEEKREGERKEEEELEEEREK--- 54
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
+E R E + +Q + +EE E ++ +E
Sbjct: 55 -KKEEEERKERE-EQARKEQEEYEKLKSSFVVE 85
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 27.9 bits (63), Expect = 4.6
Identities = 13/46 (28%), Positives = 24/46 (52%)
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRN 311
+ ++ E ++ LEE+L++L+ EEL+ R L E + R
Sbjct: 1 EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRK 46
>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
Length = 673
Score = 28.7 bits (64), Expect = 4.6
Identities = 55/286 (19%), Positives = 99/286 (34%), Gaps = 64/286 (22%)
Query: 53 GVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPL-QFKFRAKFYPED 111
GV I + +G L L + G +DN +S D + NPL +F++
Sbjct: 159 GVRASIINDVEGPRLILASNVSG------KDNQIK-ISVDAEYGNPLKRFEY-------- 203
Query: 112 VVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCLLTPKIGFPWSEIRNISFNDRKFIIK 171
+ + R+ L+ +R+A +V P LTP+ E+ ++ I
Sbjct: 204 ------KTLEDRVRALEKARAAAQQVIAP------LTPEEQKVAPEL----SDEEGNAIP 247
Query: 172 PIDKKAPDFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQ 231
P D++ + + A ++ + A E Q A + + +
Sbjct: 248 PADQEVAEEIQDAAQIAQQQEATAALAALEEPISAGGATAAAAGQAAIDAAEAKAYLRPE 307
Query: 232 QR------------------DKLQLEIAAREK----------AEKKHQESVERLK----Q 259
R + +L+ AA K A +S K +
Sbjct: 308 DRIPGWTETASGTLLDSYPEPEEELDEAAIAKAPDVPGWSNTASGTLTDSYVTPKEAQAE 367
Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
+E ++A+ L A E EQ KQ+ AK E E R+ +++
Sbjct: 368 IEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKID 413
>gnl|CDD|224002 COG1076, DjlA, DnaJ-domain-containing proteins 1 [Posttranslational
modification, protein turnover, chaperones].
Length = 174
Score = 28.1 bits (63), Expect = 4.7
Identities = 11/51 (21%), Positives = 26/51 (50%), Gaps = 4/51 (7%)
Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQ----EMIRRLEEQLKQLQAAKEELE 296
+ K ++++ D A+ E I +L+E+L+++Q A E+++
Sbjct: 124 KADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYEDIK 174
>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain. This domain is
found in the MAP2/Tau family of proteins which includes
MAP2, MAP4, Tau, and their homologs. All isoforms contain
a conserved C-terminal domain containing tubulin-binding
repeats (pfam00418), and a N-terminal projection domain
of varying size. This domain has a net negative charge
and exerts a long-range repulsive force. This provides a
mechanism that can regulate microtubule spacing which
might facilitate efficient organelle transport.
Length = 1134
Score = 29.0 bits (64), Expect = 4.8
Identities = 23/97 (23%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
+I +Q++ +EE+ K+ +R L+ + + EK + R+ E ++AK++
Sbjct: 1027 SILTEQLETIPKEERAEKEARRPSLE-----KHRKEKPFKTGRGRISTPERKVAKKEPST 1081
Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
+ E+ R+ K A K E++A +L L+P
Sbjct: 1082 VSRDEVRRKKAVYKKAELAKKSEVQAHSPSRKLILKP 1118
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 28.7 bits (65), Expect = 4.9
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 219 KAQAREEKNAKQQQRD---KLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
E K +++QR K + E A + A+KK + + ++ K + E K D D
Sbjct: 405 NLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDP 460
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 28.6 bits (64), Expect = 4.9
Identities = 20/91 (21%), Positives = 44/91 (48%), Gaps = 8/91 (8%)
Query: 219 KAQAREEKNAK--QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
+A + K A +++ + L E A++E ++R+ +L+ E+ + +L Q
Sbjct: 2 QAYSSHRKVAADGEEREESLLQESASKE------AYYLQRILELQAELKQLRAELSNVQA 55
Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
RL ++L+ E LE ++ L+ E++
Sbjct: 56 ENERLSSLSQELKEENEMLELQRGRLRDEIK 86
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 28.5 bits (64), Expect = 5.0
Identities = 21/104 (20%), Positives = 45/104 (43%), Gaps = 11/104 (10%)
Query: 213 IDVQQMKAQAREEKNA---KQQQRDKLQLEIA-AREKAEKKHQESVERLKQLEVEMAKRD 268
++QM + + +Q KL +++ A EK + + +L+QL
Sbjct: 216 SHLEQMHQLKKAISKSLPETKQYLKKLSQDLSKALEKISSREKHLNNQLEQLV------- 268
Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
Q+ EA+ + +++E+ Q EL E+ EL+ ++ +
Sbjct: 269 QEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELEQVKQE 312
>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein. This family includes FlxA from
E. coli. The expression of FlxA is regulated by the FliA
sigma factor, a transcription factor specific for class
3 flagellar operons. However FlxA is not required for
flagellar function or formation.
Length = 106
Score = 27.0 bits (60), Expect = 5.0
Identities = 14/49 (28%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAK----EELEARQTELQLELQPIRNQ 312
D + Q+ I+ L +QLK+L +++ EE + +Q +Q ++Q ++ Q
Sbjct: 18 DSQIKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQLIQQQIQMLQAQ 66
>gnl|CDD|184303 PRK13753, PRK13753, dihydropteroate synthase; Provisional.
Length = 279
Score = 28.5 bits (63), Expect = 5.1
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 109 PEDVVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCLLTP 149
PED ++EI++ R+ L+ S A R+ PG L+P
Sbjct: 142 PEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSP 182
>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
Provisional.
Length = 593
Score = 28.8 bits (64), Expect = 5.1
Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)
Query: 251 QESVERLKQLEVEMAKRDQDLM-EAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
Q +E K++ ++++K Q + EAQE E +K+ AK +A + +L
Sbjct: 112 QAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKL 164
Score = 28.0 bits (62), Expect = 8.8
Identities = 17/95 (17%), Positives = 41/95 (43%), Gaps = 10/95 (10%)
Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
+ V Q ++++E Q + Q + ++A ++ S+++ + ++ L
Sbjct: 108 LAVWQAMIESQKEMG--IQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKLT 165
Query: 273 EAQEMIRRL--------EEQLKQLQAAKEELEARQ 299
+AQ ++ L + + QA KE EA++
Sbjct: 166 QAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKE 200
>gnl|CDD|225530 COG2983, COG2983, Uncharacterized conserved protein [Function
unknown].
Length = 153
Score = 27.8 bits (62), Expect = 5.2
Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 3/53 (5%)
Query: 106 KFYPEDVVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCL-LTPKI--GFPW 155
K ED E + L + R RF P+C+ LTPK F W
Sbjct: 39 KLEDEDTDEIYYTRVACELLDPETCRCKDYENRFKTVPDCIKLTPKNLPEFEW 91
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 28.9 bits (65), Expect = 5.2
Identities = 25/101 (24%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 223 REEKNAKQQQ----RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
R + QQQ R+ Q E+ + + + + Q++ R +Q +R+ + +AQ+
Sbjct: 640 RRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQ-----RRRNDEKRQAQQEA 694
Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLN-KV 318
+ L + + +Q ++E +Q +QP R Q LN KV
Sbjct: 695 KALNVEEQSVQETEQEERVQQ------VQPRRKQRQLNQKV 729
>gnl|CDD|202204 pfam02321, OEP, Outer membrane efflux protein. The OEP family
(Outer membrane efflux protein) form trimeric channels
that allow export of a variety of substrates in Gram
negative bacteria. Each member of this family is
composed of two repeats. The trimeric channel is
composed of a 12 stranded all beta sheet barrel that
spans the outer membrane, and a long all helical barrel
that spans the periplasm.
Length = 186
Score = 27.9 bits (62), Expect = 5.2
Identities = 29/110 (26%), Positives = 43/110 (39%), Gaps = 18/110 (16%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
V+ KAQ + +Q R +L+LE+A +E +E KQ L A
Sbjct: 83 VKAAKAQLEAAEAELEQARRQLRLEVAQAYFNLLAAKEQLELAKQA----------LELA 132
Query: 275 QEMIRRLEEQLKQ--------LQAAKEELEARQTELQLELQPIRNQSNLN 316
+E + E + + LQA E LEAR L E ++ L
Sbjct: 133 EEALELAEARYEAGLISLLDVLQAEVELLEARLELLNAEADLELARAQLE 182
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 28.8 bits (64), Expect = 5.2
Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 4/88 (4%)
Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI----R 279
+E +++ K + EI EK K+ + E ++ E ++ K + L + +E + +
Sbjct: 322 KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKK 381
Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQ 307
E+L KEE + E + E +
Sbjct: 382 LESERLSSAAKLKEEELELKNEEEKEAK 409
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 28.9 bits (65), Expect = 5.2
Identities = 24/106 (22%), Positives = 39/106 (36%), Gaps = 4/106 (3%)
Query: 208 RKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
D + ++ E N + KL+ E KK V + E+
Sbjct: 938 NNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNF 997
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
++L E + L+E KQL +EL ELQ + I ++S
Sbjct: 998 KKELAELSKQYGALQESTKQL----KELPVEVAELQSASKIISSES 1039
>gnl|CDD|234345 TIGR03755, conj_TIGR03755, integrating conjugative element protein,
PFL_4711 family. Members of this protein family are
found in genomic regions associated with conjugative
transfer and integrated TOL-like plasmids. The specific
function is unknown [Mobile and extrachromosomal element
functions, Plasmid functions].
Length = 418
Score = 28.4 bits (64), Expect = 5.4
Identities = 16/70 (22%), Positives = 34/70 (48%), Gaps = 11/70 (15%)
Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRL------EEQLKQLQAAKEELEARQTELQLELQ 307
V+RL E+ +A D +E ++RR+ E + + A++E++ +L E+
Sbjct: 313 VQRLAS-EIALA----DTLEKALLMRRMLLTGLQEPNVAANKPAQQEVDKAIDKLDREIN 367
Query: 308 PIRNQSNLNK 317
++ + L K
Sbjct: 368 NLKTELELRK 377
>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
in de novo protein folding [Posttranslational
modification, protein turnover, chaperones].
Length = 145
Score = 27.7 bits (62), Expect = 5.5
Identities = 16/48 (33%), Positives = 27/48 (56%)
Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRN 311
MA+ Q+L E ++ L+ Q++ LQA L A +ELQ ++ + N
Sbjct: 1 MAQTQQELEELAAQLQILQSQIESLQAQIAALNAAISELQTAIETLEN 48
Score = 27.3 bits (61), Expect = 6.6
Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 246 AEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
AEK E++E LK+ E+ K + L Q+ + L ++++QL+ ++L+ +Q Q
Sbjct: 88 AEKSADEAIEFLKKRIEELEKAIEKL---QQALAELAQRIEQLEQEAQQLQQKQAAAQ 142
>gnl|CDD|150931 pfam10335, DUF294_C, Putative nucleotidyltransferase substrate
binding domain. This domain is found associated with
presumed nucleotidyltransferase domains and seems to be
distantly related to other helical substrate binding
domains.
Length = 145
Score = 27.5 bits (62), Expect = 5.5
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 11/48 (22%)
Query: 254 VERLKQLE------VEMAKRDQDLMEAQEMI--RRLEEQLKQLQAAKE 293
+ERL+ L E+A +DL+EA E + RL QL+ ++A +
Sbjct: 65 LERLRALAEAGVLSAELA---EDLLEAYEFLLRLRLRHQLEAIEAGRP 109
>gnl|CDD|184254 PRK13698, PRK13698, plasmid-partitioning protein; Provisional.
Length = 323
Score = 28.3 bits (63), Expect = 5.5
Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 4/65 (6%)
Query: 239 EIAAR--EKAEKKHQESVERLKQLEVEMAKRDQD--LMEAQEMIRRLEEQLKQLQAAKEE 294
E++AR E +K + E LKQ + ++ + + E +E+I L LK A++
Sbjct: 215 ELSARSGEALQKAFTDKEELLKQQTSNLHEQKKAGVIFETEEVITLLTSVLKTSSASRTS 274
Query: 295 LEARQ 299
L +R
Sbjct: 275 LSSRH 279
>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein. This family of proteins
is found in bacteria. Proteins in this family are
typically between 397 and 1543 amino acids in length.
This family is the SidE protein in the Dot/Icm pathway of
Legionella pneumophila bacteria. There is little
literature describing the family.
Length = 1443
Score = 28.7 bits (64), Expect = 5.5
Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 217 QMKAQA--REEKNAKQ--QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
QMK + +EK +Q + KL+ + + AEK +E + +LK L + K +++
Sbjct: 1271 QMKPKTFEMQEKEIQQNFELLAKLEKTLDKSDTAEKL-REDIPKLKDLLIAKQKAYPEMV 1329
Query: 273 EAQEMIRRLEEQLKQL-QAAKEELEARQTELQLEL 306
+ Q L QL++L QA ++L +T EL
Sbjct: 1330 QLQYKSEALITQLRELCQAHHDDLAKTRTARLQEL 1364
>gnl|CDD|151277 pfam10828, DUF2570, Protein of unknown function (DUF2570). This is
a family of proteins with unknown function.
Length = 110
Score = 27.1 bits (60), Expect = 5.6
Identities = 12/59 (20%), Positives = 29/59 (49%)
Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
+++ L E + Q + + Q+ +RL + L+Q + A EE + E++ + + +
Sbjct: 24 SQKIDSLRAENKAQAQTIEQQQKANQRLTDALEQERQAVEEQQKIANEIERKAEENAEE 82
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 28.1 bits (63), Expect = 5.6
Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 4/77 (5%)
Query: 211 DTIDVQQMK-AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ 269
D I Q K ++ ++EK Q + + L +I E+ + K E + + Q + E+ K +
Sbjct: 31 DKIQNQDSKLSELQKEKKNIQNEIESLDNQI---EEIQSKIDELQKEIDQSKAEIKKLQK 87
Query: 270 DLMEAQEMIRRLEEQLK 286
++ E +E I +E LK
Sbjct: 88 EIAELKENIVERQELLK 104
Score = 27.8 bits (62), Expect = 8.1
Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 5/75 (6%)
Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
+ +++ +K +K Q +E L Q+E +K D E Q+ I + + ++K+L
Sbjct: 30 SDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKID----ELQKEIDQSKAEIKKL 85
Query: 289 QAAKEELEARQTELQ 303
Q EL+ E Q
Sbjct: 86 QKEIAELKENIVERQ 100
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 28.1 bits (63), Expect = 5.6
Identities = 20/99 (20%), Positives = 46/99 (46%), Gaps = 9/99 (9%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-EVEMAK-----RD 268
V+ + +A+EE ++ + + + + E++ +E E L++L E A+ +
Sbjct: 72 VEAAEEEAKEE--LLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYE 129
Query: 269 QDLMEAQEMIRRLEEQLKQ-LQAAKEELEARQTELQLEL 306
+ E E +R L EQL + ++ ++ E + E+
Sbjct: 130 KGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEEEI 168
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 28.5 bits (64), Expect = 5.8
Identities = 17/88 (19%), Positives = 38/88 (43%), Gaps = 1/88 (1%)
Query: 209 KPDTIDVQQMKAQAREEKNA-KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
D + ++++ + +E+ + + + L+ EIA +ES E E +
Sbjct: 26 DEDPLSLEELIERINQERGSFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDS 85
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
+++ +E + +E L+QL A E
Sbjct: 86 EENKDTVEEFPKARDEVLEQLNQALNES 113
>gnl|CDD|119241 pfam10721, DUF2514, Protein of unknown function (DUF2514). This
family is conserved in bacteria and some viruses. The
function is not known.
Length = 162
Score = 27.8 bits (62), Expect = 5.8
Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 10/82 (12%)
Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
+ A A E A+ R + Q AA+ +A K QE E A D +A
Sbjct: 36 DRDAADALAEVIAETAARAEEQRRQAAQNEAAKDAQE----------EAAAARADAADAD 85
Query: 276 EMIRRLEEQLKQLQAAKEELEA 297
RL ++ +L AAK +
Sbjct: 86 AAAGRLRQEADKLAAAKSCPTS 107
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 27.8 bits (62), Expect = 6.1
Identities = 17/92 (18%), Positives = 39/92 (42%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
V ++ ++ +EE Q K E ++ + Q + + + + K + D
Sbjct: 64 RVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMVEDLFDQVKDEVDEKN 123
Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
+I L++ +L+ ++EL + EL+ E
Sbjct: 124 GAALIEELQKHRDKLKKEQKELLKKLDELEKE 155
>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase. BRE1 is an E3 ubiquitin
ligase that has been shown to act as a transcriptional
activator through direct activator interactions.
Length = 96
Score = 26.9 bits (60), Expect = 6.1
Identities = 20/83 (24%), Positives = 33/83 (39%), Gaps = 15/83 (18%)
Query: 236 LQLEIAAREKAEKKHQESV-----------ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQ 284
LQ E+ + EKA K+ ++ +L E AK DQ A L+ +
Sbjct: 1 LQKELVSLEKAFKELSSLALKKVAELTILEQKKLRLTAEKAKADQKYFAAMRSKDALDAE 60
Query: 285 LKQLQAAKEELEARQTELQLELQ 307
K+L + +EL +L+
Sbjct: 61 NKKLNT----QLNKSSELIEQLK 79
>gnl|CDD|181417 PRK08416, PRK08416, 7-alpha-hydroxysteroid dehydrogenase;
Provisional.
Length = 260
Score = 28.2 bits (63), Expect = 6.2
Identities = 11/43 (25%), Positives = 24/43 (55%), Gaps = 7/43 (16%)
Query: 75 GLNI---YEKEDNLTNVMSQDVKKENPLQFKFRAKFYPEDVVE 114
G+NI Y N +++D++++ + +AK YP +++E
Sbjct: 32 GVNIAFTYNSNVEEANKIAEDLEQK----YGIKAKAYPLNILE 70
>gnl|CDD|221120 pfam11488, Lge1, Transcriptional regulatory protein LGE1. This
family of proteins is conserved from fungi to human. In
yeasts it is involved in the ubiquitination of histones
H2A and H2B. This ubiquitination step is a vital one in
the regulation of the transcriptional activity of RNA
polymerase II. In S. cerevisiae, Rad6 and Bre1 are
present in a complex, also containing Lge1, that is
required for H2B ubiquitination. Bre1 is the H2B
ubiquitin ligase that interacts with acidic activators,
such as Gal4, and recruits Rad6 and its binding partner
Lge1 to target promoters. In S. pombe the equivalent
protein to Lge1 appears to be Shf1.
Length = 75
Score = 26.1 bits (58), Expect = 6.2
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL--QAAKEELEARQTELQLE 305
E+L+ E+ DQ L E + +LE +L L Q ++ L + T+ +L+
Sbjct: 17 QEKLRSRYKELESLDQKLEELKLQKLKLELELALLETQVERDALNVQLTQEKLD 70
>gnl|CDD|221198 pfam11740, KfrA_N, Plasmid replication region DNA-binding N-term.
The broad host-range plasmid RK2 is able to replicate in
and be inherited in a stable manner in diverse
Gram-negative bacterial species. It encodes a number of
co-ordinately regulated operons including a central
control korF1 operon that represses the kfrA operon. The
KfrA polypeptide is a site-specific DNA-binding protein
whose operator overlaps the kfrA promoter. The
N-terminus, containing an helix-turn-helix motif, is
essential for function. Downstream from this family is
an extended coiled-coil domain containing a heptad
repeat segment which is probably responsible for
formation of multimers, and may provide an example of a
bridge to host structures required for plasmid
partitioning.
Length = 120
Score = 27.3 bits (61), Expect = 6.2
Identities = 13/53 (24%), Positives = 18/53 (33%), Gaps = 2/53 (3%)
Query: 251 QESVERLKQLEVEMAKR--DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
E V L + E A+ A+ LE +L + A E LE
Sbjct: 68 AELVAALWEAAQEEAEERLAAARAAAEAERAELEAELAEAAAEAEALEEELEA 120
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 27.8 bits (62), Expect = 6.4
Identities = 19/70 (27%), Positives = 30/70 (42%), Gaps = 8/70 (11%)
Query: 248 KKHQ-ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ-------AAKEELEARQ 299
K Q + +E K LE E+A+++ E I LE ++ +LQ KE +
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161
Query: 300 TELQLELQPI 309
L+ E I
Sbjct: 162 ARLKSEASKI 171
>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
Length = 871
Score = 28.3 bits (64), Expect = 6.4
Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 4/39 (10%)
Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
R++A +K +E + RL+QL +Q E + MI RL
Sbjct: 628 RQEALEKEEELLSRLQQLP----DGEQKAKETKRMISRL 662
>gnl|CDD|234082 TIGR02997, Sig70-cyanoRpoD, RNA polymerase sigma factor,
cyanobacterial RpoD-like family. This family includes a
number of closely related sigma-70 (TIGR02937) factors
in the cyanobacteria. All appear most closely related to
the essential sigma-70 factor RpoD, and some score above
trusted to the RpoD C-terminal domain model (TIGR02393).
Length = 298
Score = 28.0 bits (63), Expect = 6.5
Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ-TELQLE 305
E+E+A++ Q +M +E+ LEEQL + + +E A +E +L
Sbjct: 21 EIELARQVQQMMVLEELREELEEQLGREPSKEEWAAAAGLSEAELR 66
>gnl|CDD|190336 pfam02520, DUF148, Domain of unknown function DUF148. This domain
has no known function nor do any of the proteins that
possess it. In one member of this family the aligned
region is repeated twice.
Length = 114
Score = 26.8 bits (60), Expect = 6.8
Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 5/81 (6%)
Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERL----KQLEVEMAK 266
TI + +A EK Q ++ + + A +AEK + L +L +
Sbjct: 17 TIAEIDAQLEAWAEKYGVADQYEEFKKNVKALKAEAEKNVTAVISNLSSVDAKLSAILDN 76
Query: 267 RDQDLMEAQEMIRRLEEQLKQ 287
+D + +E I L++QL Q
Sbjct: 77 KDLTRRQKKEAIDALKKQLPQ 97
>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
(trigger factor) [Posttranslational modification,
protein turnover, chaperones].
Length = 441
Score = 28.0 bits (63), Expect = 7.0
Identities = 13/84 (15%), Positives = 34/84 (40%), Gaps = 1/84 (1%)
Query: 215 VQQMKAQAREE-KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
++++K + R+ + ++ + + E E + E L + E++ +
Sbjct: 262 LEELKEKLRKNLERELKEATLEKRKEQLLDALVEANDFDLPESLVEAEIDNLLKQALQQL 321
Query: 274 AQEMIRRLEEQLKQLQAAKEELEA 297
Q+ I LE + + +EE +
Sbjct: 322 QQQGIDSLEASGESEEELREEFKE 345
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 27.8 bits (62), Expect = 7.1
Identities = 21/87 (24%), Positives = 38/87 (43%), Gaps = 8/87 (9%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL--EVEMAKRDQDLM 272
+Q QA +++ K ++ + A+ + S E K+ E E+ K
Sbjct: 138 CRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKL----- 192
Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQ 299
Q+ +R EE+ KQ + +EE E +Q
Sbjct: 193 -KQQQQKREEERRKQRKKQQEEEERKQ 218
>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1. MAT1
is an assembly/targeting factor for cyclin-dependent
kinase-activating kinase (CAK), which interacts with the
transcription factor TFIIH. The domain found to the
N-terminal side of this domain is a C3HC4 RING finger.
Length = 200
Score = 27.8 bits (62), Expect = 7.1
Identities = 22/101 (21%), Positives = 51/101 (50%), Gaps = 2/101 (1%)
Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
N + MR ++ T + ++++ EEK K+++R LQ E +E+ K ++ E + +
Sbjct: 77 NKDSIMRNKRRLTREQEELEQALEEEKEMKEEKRLHLQKE--EQEQKMAKEKDKQEIIDE 134
Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
LE + + + ++ ++LE Q+++L+ K +
Sbjct: 135 LETSNLPANVIIAQHKKQSKQLESQVEKLERKKRVTFSTGI 175
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 27.7 bits (62), Expect = 7.1
Identities = 21/61 (34%), Positives = 23/61 (37%), Gaps = 23/61 (37%)
Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRR------------------LEEQLKQLQAAKEELE 296
RLKQ E A RD EA+ +I E LKQ A EELE
Sbjct: 14 SRLKQ--AEGAPRDA---EAEALIAEALRRQPDAPYYLVQTILVQEAALKQANARIEELE 68
Query: 297 A 297
A
Sbjct: 69 A 69
>gnl|CDD|206361 pfam14193, DUF4315, Domain of unknown function (DUF4315). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
Length = 81
Score = 26.1 bits (58), Expect = 7.1
Identities = 18/68 (26%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
+K++ EI EK ++K E RLK+L E K + + +E +++R ++ ++L A +
Sbjct: 4 EKIEAEI---EKTKEKIAEQQARLKEL--EKQKTELENLEIVQIVRSMDMTPEELAAFLK 58
Query: 294 ELEARQTE 301
+++ Q
Sbjct: 59 AMKSGQAA 66
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 27.4 bits (61), Expect = 7.2
Identities = 9/47 (19%), Positives = 20/47 (42%)
Query: 252 ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
++ ++L+E + + + E+LK L+ K EL+
Sbjct: 60 FCANIADAAADQLYALQKELLEKLKDLAEVNERLKALEKKKAELKDL 106
>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex. Med21 has been
known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
Drosophila. The heterodimer of the two subunits Med7 and
Med21 appears to act as a hinge between the middle and
the tail regions of Mediator.
Length = 132
Score = 26.8 bits (60), Expect = 7.4
Identities = 13/46 (28%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
+E + R+K+LE E+ + + + EA ++ E+ LK+++A E+
Sbjct: 88 EEQLRRIKELEEELREVEAEREEA---VKEKEKLLKKVEALLREIA 130
>gnl|CDD|183703 PRK12724, PRK12724, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 432
Score = 28.0 bits (62), Expect = 7.4
Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
A+REK E+K Q+ E LKQ +R + L + + RLEE+ ++A E+E+ +
Sbjct: 61 ASREKIERKLQDLKELLKQKSYTEPERKKTLQTLKPLSERLEEK----ESAIYEIESFEL 116
Query: 301 E 301
E
Sbjct: 117 E 117
>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
This domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 120
Score = 26.9 bits (60), Expect = 7.5
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 3/51 (5%)
Query: 270 DLMEAQEM---IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
L EA E+ +RRL E++++ +A EELEA +E + L+ + L +
Sbjct: 4 TLEEANELLPELRRLLEEVREAKAELEELEAALSEGERSLEARGLKEELRE 54
>gnl|CDD|132697 TIGR03658, IsdH_HarA, haptoglobin-binding heme uptake protein HarA.
HarA is a heme-binding NEAT-domain (NEAr Transporter,
pfam05031) protein which has been shown to bind to the
haptoglobin-hemoglobin complex in order to extract heme
from it. HarA has also been reported to bind hemoglobin
directly. HarA (also known as IsdH) contains three NEAT
domains as well as a sortase A C-terminal signal for
localization to the cell wall. The heme bound at the
third of these NEAT domains has been shown to be
transferred to the IsdA protein also localized at the
cell wall, presumably through an additional specific
protein-protein interaction. Haptoglobin is a hemoglobin
carrier protein involved in scavenging hemoglobin in the
blood following red blood cell lysis and targetting it
to the liver.
Length = 895
Score = 28.3 bits (62), Expect = 7.5
Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 7/83 (8%)
Query: 211 DTIDVQQMKAQAREEKNAKQQ--QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRD 268
+T ++Q++ A + K ++Q + +KLQ ++ + KAE K ++L Q VE+A +
Sbjct: 473 ETYNLQKLLAPYHKAKTLERQVYELEKLQEKLPEKYKAEYK-----KKLDQTRVELADQV 527
Query: 269 QDLMEAQEMIRRLEEQLKQLQAA 291
+ + E + +QL LQ A
Sbjct: 528 KSAVTEFENVTPTNDQLTDLQEA 550
>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
ribosomal structure and biogenesis].
Length = 363
Score = 27.9 bits (63), Expect = 7.5
Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 21/109 (19%)
Query: 201 HELYMRRRKPDTI-DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
EL P+ I D + + ++E +L+ I + + KK QE +E K+
Sbjct: 17 EELEALLSDPEVISDPDEYRKLSKEYA--------ELE-PIVEKYREYKKAQEDLEDAKE 67
Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
+ E ++D ++ +EM EE++K+L+A EELE EL++ L P
Sbjct: 68 MLAE--EKDPEM---REMA---EEEIKELEAKIEELE---EELKILLLP 105
>gnl|CDD|233242 TIGR01035, hemA, glutamyl-tRNA reductase. This enzyme, together
with glutamate-1-semialdehyde-2,1-aminomutase
(TIGR00713), leads to the production of
delta-amino-levulinic acid from Glu-tRNA [Biosynthesis
of cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 417
Score = 28.1 bits (63), Expect = 7.6
Identities = 15/79 (18%), Positives = 32/79 (40%), Gaps = 5/79 (6%)
Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL-----KQL 288
D LQ + ++ E E + + E K+ +E + I+ L K+L
Sbjct: 298 DDLQPVVEENLAERREEAEKAEEIVEEETAEFKQWLRSLEVEPTIKALRSLAEIVREKEL 357
Query: 289 QAAKEELEARQTELQLELQ 307
+ A ++L +++ L+
Sbjct: 358 EKALKKLPGLSKDVEEVLE 376
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.8 bits (62), Expect = 7.7
Identities = 19/103 (18%), Positives = 47/103 (45%), Gaps = 2/103 (1%)
Query: 190 NKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKK 249
K+I N ++ + ++ T + ++++ EK ++Q+R L L+ E+ K
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRR--LLLQKEEEEQQMNK 176
Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAK 292
+ L +LE + + + ++ +LE Q+++ + K
Sbjct: 177 RKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEK 219
>gnl|CDD|215487 PLN02900, PLN02900, alanyl-tRNA synthetase.
Length = 936
Score = 28.1 bits (63), Expect = 7.8
Identities = 16/75 (21%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
D L+ E+ + K E E ++ L+ + + +E+ R+ K+L+A
Sbjct: 744 NAADSLERELDSALKVEGSDLEK--KVASLKSRVDAAVIPAAKKEEIRARVSALQKELRA 801
Query: 291 AKEELEARQTELQLE 305
A++E A + +L +
Sbjct: 802 AQKEAAALRAKLAVA 816
>gnl|CDD|222742 pfam14415, DUF4424, Domain of unknown function (DUF4424). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 310 and 361 amino
acids in length.
Length = 248
Score = 27.8 bits (62), Expect = 7.9
Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 1/24 (4%)
Query: 89 MSQDVKKENPLQFKFRAK-FYPED 111
++KK NP F+F K F P+
Sbjct: 224 FDGNLKKINPTTFEFTKKNFTPKR 247
>gnl|CDD|218152 pfam04568, IATP, Mitochondrial ATPase inhibitor, IATP. ATP
synthase inhibitor prevents the enzyme from switching to
ATP hydrolysis during collapse of the electrochemical
gradient, for example during oxygen deprivation ATP
synthase inhibitor forms a one to one complex with the
F1 ATPase, possibly by binding at the alpha-beta
interface. It is thought to inhibit ATP synthesis by
preventing the release of ATP. The minimum inhibitory
region for bovine inhibitor is from residues 39 to 72.
The inhibitor has two oligomeric states, dimer (the
active state) and tetramer. At low pH, the inhibitor
forms a dimer via antiparallel coiled coil interactions
between the C terminal regions of two monomers. At high
pH, the inhibitor forms tetramers and higher oligomers
by coiled coil interactions involving the N terminus and
inhibitory region, thus preventing the inhibitory
activity.
Length = 90
Score = 26.2 bits (58), Expect = 7.9
Identities = 12/41 (29%), Positives = 26/41 (63%), Gaps = 4/41 (9%)
Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL 257
+ A+EE+ +Q+++++L A +EK E +H++ +E L
Sbjct: 49 GKREAAQEEEYFRQKEKEQLA---ALKEKLE-EHKKEIEEL 85
>gnl|CDD|218186 pfam04636, PA26, PA26 p53-induced protein (sestrin). PA26 is a
p53-inducible protein. Its function is unknown. It has
similarity to pfam04636 in its N-terminus.
Length = 450
Score = 27.8 bits (62), Expect = 8.0
Identities = 12/50 (24%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
K + E + ++ E++K+LQ +++ EA Q E+ + + +S L
Sbjct: 221 KVQRPETEVEALM----ERMKKLQEERDDDEASQEEMTTRFEKEKKESLL 266
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 28.0 bits (63), Expect = 8.2
Identities = 19/96 (19%), Positives = 42/96 (43%), Gaps = 12/96 (12%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAA-REKAEKKHQESVER-LKQLEVEMAKRDQDL 271
D +Q +AQ K +++++ A +++ KK ++R L+QL + + ++
Sbjct: 525 DARQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAEI 584
Query: 272 MEAQEMI----------RRLEEQLKQLQAAKEELEA 297
Q + + ++ L L A++ELE
Sbjct: 585 EALQAQVADADFFSQPHEQTQKVLADLADAEQELEV 620
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 28.1 bits (62), Expect = 8.3
Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 7/86 (8%)
Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKA----EKKHQESVERLKQLEVEMAKRDQDLMEA 274
+ +E + A++ +++L A +A K + + L+ LE A + L A
Sbjct: 136 EGGQQELEEAQRLHQEQLSSLTQAHLEALSSLRSKAEGLEKSLQSLETRRAGEAKALAAA 195
Query: 275 QEMIRRLEEQLKQLQAAKEELEARQT 300
Q L EQL + Q EELEA+ T
Sbjct: 196 QAEADTLREQLSKTQ---EELEAQVT 218
>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
Length = 1213
Score = 28.0 bits (62), Expect = 8.7
Identities = 18/93 (19%), Positives = 40/93 (43%), Gaps = 4/93 (4%)
Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
+ ++ Q + K+ + K + A EK +K +++ + +K++ + E
Sbjct: 38 WKMLEKQQKLTKDGLSASKGKYEGLSEAMEKQKKAYEDLKQEVKEVNRATQASKKAYQEY 97
Query: 275 QEMIRRLEEQLK----QLQAAKEELEARQTELQ 303
+ E +L+ Q A E+L +Q E+Q
Sbjct: 98 NAQYTQAENKLRSLSGQFGVASEQLMLQQKEIQ 130
>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
Provisional.
Length = 638
Score = 27.8 bits (62), Expect = 8.8
Identities = 17/68 (25%), Positives = 35/68 (51%), Gaps = 4/68 (5%)
Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
Q+ + Q + A +E Q ++ K+ E E+ + Q L ++ I RLE+++++L A
Sbjct: 522 VQKQENQTDEAPKENNANSAQARKDQ-KRREAELRTQTQPL---RKEIARLEKEMEKLNA 577
Query: 291 AKEELEAR 298
+ E +
Sbjct: 578 QLAQAEEK 585
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 27.4 bits (60), Expect = 8.8
Identities = 16/76 (21%), Positives = 37/76 (48%)
Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
R+ E++ QE ER +Q E + +R++ A E R EE+ ++ + + + + +
Sbjct: 33 RQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKR 92
Query: 303 QLELQPIRNQSNLNKV 318
+ E + + Q ++
Sbjct: 93 KAEEEEKQEQEEQERI 108
>gnl|CDD|184056 PRK13446, atpC, F0F1 ATP synthase subunit epsilon; Provisional.
Length = 136
Score = 26.8 bits (60), Expect = 8.9
Identities = 9/39 (23%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 252 ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
E+ ER ++++VE A+ + A++ +++L + +A
Sbjct: 81 ETAERAEEIDVERARAALE--RAEQRLKKLTPEDDSARA 117
>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 27.8 bits (63), Expect = 9.0
Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 12/64 (18%)
Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQ---LEVEMAKR---------DQDLMEAQE 276
++++ +L E A E+A + +ES + + LE+E+ +L++
Sbjct: 329 REKKAAELAAEEAEEEEAAEPEEESWDDVLPVDPLELEVGYGLIPLVDENQGGELLDRIR 388
Query: 277 MIRR 280
IR+
Sbjct: 389 SIRK 392
>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein.
Length = 304
Score = 27.5 bits (61), Expect = 9.2
Identities = 20/94 (21%), Positives = 29/94 (30%), Gaps = 9/94 (9%)
Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA-----QEM 277
R + Q Q AAR + S + ++E A+ + A
Sbjct: 67 RLQALVAQAAIQSEQEADAARAALREALAASRGAASKAQLEAARDNLRSALALLPISLGR 126
Query: 278 IRR----LEEQLKQLQAAKEELEARQTELQLELQ 307
I L L LQA A+ E L+L
Sbjct: 127 IGASQATLGALLNNLQAQGLAARAQLDEAYLQLA 160
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 26.4 bits (59), Expect = 9.2
Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 12/90 (13%)
Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE----------MAKRDQDLMEAQ--EM 277
QQQ L +I E K+ + +E L+ L+ + + K+D++ ++ + E
Sbjct: 11 QQQLQLLAQQIKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEEVKEELEER 70
Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQ 307
LE+++K L+ E+LE EL+ EL
Sbjct: 71 KETLEKEIKTLEKQLEKLEKELEELKEELY 100
>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO. This family
contains the bacterial type III secretion protein YscO,
which is approximately 150 residues long. YscO has been
shown to be required for high-level expression and
secretion of the anti-host proteins V antigen and Yops
in Yersinia pestis.
Length = 152
Score = 27.0 bits (60), Expect = 9.3
Identities = 10/54 (18%), Positives = 28/54 (51%)
Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
+A+++ + +RL+Q ++ + + + E+ R+ + + + + EELE
Sbjct: 91 AEAKQRLEAERQRLRQARQQLQEARKAQEKFAELARQEQAEAQAQRQYLEELEQ 144
>gnl|CDD|184048 PRK13428, PRK13428, F0F1 ATP synthase subunit delta; Provisional.
Length = 445
Score = 27.8 bits (62), Expect = 9.4
Identities = 18/74 (24%), Positives = 31/74 (41%), Gaps = 7/74 (9%)
Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQE--SVERLKQLEVEMAKRDQDL 271
+ Q +A E+ A+ + + E A R + + Q ER+K R L
Sbjct: 53 EADQAHTKAVEDAKAEAARVVEEAREDAERIAEQLRAQADAEAERIK----VQGARQVQL 108
Query: 272 MEAQEMIRRLEEQL 285
+ AQ + R+L +L
Sbjct: 109 LRAQ-LTRQLRLEL 121
>gnl|CDD|213755 TIGR02977, phageshock_pspA, phage shock protein A. Members of this
family are the phage shock protein PspA, from the phage
shock operon. This is a narrower family than the set of
PspA and its homologs, sometimes several in a genome, as
described by pfam04012. PspA appears to maintain the
protonmotive force under stress conditions that include
overexpression of certain phage secretins, heat shock,
ethanol, and protein export defects [Cellular processes,
Adaptations to atypical conditions].
Length = 219
Score = 27.2 bits (61), Expect = 9.6
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 14/100 (14%)
Query: 208 RKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
R+ ++ Q Q + E A + R+ L AR +K + E + LE E+A
Sbjct: 59 RRVSRLEAQVADWQDKAEL-ALSKGREDL-----ARAALIEKQKAQ-ELAEALERELA-- 109
Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
+E + +L+E + +LQA E ARQ L + Q
Sbjct: 110 -----AVEETLAKLQEDIAKLQAKLAEARARQKALAIRHQ 144
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 27.6 bits (62), Expect = 9.8
Identities = 15/62 (24%), Positives = 32/62 (51%)
Query: 238 LEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
+E E A + QE + LK++ + + L + + ++ L+E K+L+ K++L A
Sbjct: 698 IEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAA 757
Query: 298 RQ 299
+
Sbjct: 758 AE 759
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.133 0.377
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,440,727
Number of extensions: 1650056
Number of successful extensions: 7147
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5573
Number of HSP's successfully gapped: 1423
Length of query: 318
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 221
Effective length of database: 6,635,264
Effective search space: 1466393344
Effective search space used: 1466393344
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.6 bits)