RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3725
(760 letters)
>gnl|CDD|241276 cd01243, PH_MRCK, MRCK (myotonic dystrophy-related Cdc42-binding
kinase) pleckstrin homology (PH) domain. MRCK is
thought to be coincidence detector of signaling by Cdc42
and phosphoinositides. It has been shown to promote
cytoskeletal reorganization, which affects many
biological processes. There are 2 members of this
family: MRCKalpha and MRCKbeta. MRCK consists of a
serine/threonine kinase domain, a cysteine rich (C1)
region, a PH domain and a p21 binding motif. The MRCK PH
domain is responsible for its targeting to cell to cell
junctions. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 136
Score = 248 bits (636), Expect = 7e-79
Identities = 100/136 (73%), Positives = 111/136 (81%), Gaps = 1/136 (0%)
Query: 270 PLGIDPTRGIGTAYEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVS 329
PLGIDPTRGIGTAYEGYVKVPK GGVKKGW RQF VVCDFKLFLYDIS D+N P+V S
Sbjct: 1 PLGIDPTRGIGTAYEGYVKVPKPGGVKKGWQRQFAVVCDFKLFLYDISEDKNPQPSVVAS 60
Query: 330 LVLDMRDEDFAVSGVRESDVIHATKKDIPCIFRITTSLMDPPG-TKNHTLMLADSDTEKT 388
VLDMRDE+F+VS V ESDVIHA KKDIPCIFR+T S + P K+ L+LADS+ EK
Sbjct: 61 QVLDMRDEEFSVSSVLESDVIHANKKDIPCIFRVTASQLHAPLLLKSSQLLLADSEAEKQ 120
Query: 389 KWVVALSELHRILKRN 404
KWV AL ELHR+LK+N
Sbjct: 121 KWVGALKELHRLLKKN 136
>gnl|CDD|216115 pfam00780, CNH, CNH domain. Domain found in NIK1-like kinase,
mouse citron and yeast ROM1, ROM2. Unpublished
observations.
Length = 266
Score = 170 bits (432), Expect = 6e-48
Identities = 65/242 (26%), Positives = 105/242 (43%), Gaps = 17/242 (7%)
Query: 457 KIYQMDYIPEEQLLVVLAGKQRYVRLVPVRALDGDEVE----WVKIPETKGCLSFTTGPL 512
+ Q+ + E LL+ L+ KQ + + P+ ALD E+ K+PETKGC F
Sbjct: 36 SVTQLAVLEEFNLLLSLSDKQ--LYVYPLSALDSKELSSKSAPQKLPETKGCSFFARAGY 93
Query: 513 THTRTQHCLALAVKRQNSSQIILYEITRTKTRHKRL-HEVILPTLAQCIHIFSEGRLCVG 571
R L +AVKR I LYE + +L E +P I +LCVG
Sbjct: 94 CGGR---FLVVAVKRT----IKLYEWYEPLKKFHKLFKEFYVPAEVFSIAFLKP-KLCVG 145
Query: 572 YQSGFSIYKFSQDNRPIPLIHQDNPLVSLLTY-SPVDALLAIELPRGEFLLVFHSLAAYV 630
GF + L+ + + + L + L EFLL + A YV
Sbjct: 146 CAKGFELVDLDTGA-TQSLLDPADQSLDFAARKENLKPLAVVRLSDDEFLLCYDEFAVYV 204
Query: 631 DSQGHKSREKEIMYPALPTGASYMDGQLLIFSETHVDVFNAESGDWLQTVNIRRALPLDT 690
+ QG +SR++E+ + P +Y+ LL F +++ + E+G+ +QT+ + L +
Sbjct: 205 NLQGRRSRDQELEWEGAPESVAYLYPYLLAFHPNFIEIRSVETGELVQTIPGPKIRFLHS 264
Query: 691 RG 692
Sbjct: 265 GS 266
>gnl|CDD|215738 pfam00130, C1_1, Phorbol esters/diacylglycerol binding domain (C1
domain). This domain is also known as the Protein
kinase C conserved region 1 (C1) domain.
Length = 53
Score = 73.3 bits (180), Expect = 2e-16
Identities = 24/51 (47%), Positives = 29/51 (56%)
Query: 211 HQFITRTFTSPTKCNHCTSLMVGLTRQGVVCDICGFACHLSCCDKVPPSCP 261
H F+ RTF SPT C+HC + GL +QG+ C CG H C VPP C
Sbjct: 1 HHFVHRTFKSPTFCDHCGEFLWGLGKQGLKCSWCGLNVHKRCHSLVPPECG 51
>gnl|CDD|214481 smart00036, CNH, Domain found in NIK1-like kinases, mouse citron
and yeast ROM1, ROM2.
Length = 302
Score = 80.5 bits (199), Expect = 2e-16
Identities = 50/250 (20%), Positives = 102/250 (40%), Gaps = 33/250 (13%)
Query: 458 IYQMDYIPEEQLLVVLAGKQRYVRLVPVRALDGDEVE------------WVKIPETKGCL 505
+ Q+ + E +L++++GK+ + P+ AL + KIP+ KGC
Sbjct: 45 VTQIWVLEENNVLLMISGKKPQLYSHPLSALVEKKEALGSARLVIRKNVLTKIPDVKGCH 104
Query: 506 SFTTGPLTHTRTQHCLALAVKRQNSSQIILYEITRTKTRHKRLHEVILPTLAQCIHIFSE 565
+ L L V Q+S ++L + + K L L + +F E
Sbjct: 105 LCAVV-----NGKRSLFLCVALQSS--VVLLQWYNPLKKFKLFKSKFLFPLISPVPVFVE 157
Query: 566 --------GRLCVG-YQSGFSIYKFSQDNRPIPLIHQDNPLVSLLTYSPVDALLAIELPR 616
+C+G + G + +F + + +D L L + + + +++PR
Sbjct: 158 LVSSSFERPGICIGSDKGGGDVVQFHESL----VSKEDLSLPFLSEETSLKPISVVQVPR 213
Query: 617 GEFLLVFHSLAAYVDSQGH-KSREKEIMYPALPTGASYMDGQLLIFSETHVDVFNAESGD 675
E LL + +V+ G +SR + + +P +Y LL F + +++ + ++G+
Sbjct: 214 DEVLLCYDEFGVFVNLYGKRRSRNPILHWEFMPESFAYHSPYLLAFHDNGIEIRSIKTGE 273
Query: 676 WLQTVNIRRA 685
LQ + R
Sbjct: 274 LLQELADRET 283
>gnl|CDD|237996 cd00029, C1, Protein kinase C conserved region 1 (C1) .
Cysteine-rich zinc binding domain. Some members of this
domain family bind phorbol esters and diacylglycerol,
some are reported to bind RasGTP. May occur in tandem
arrangement. Diacylglycerol (DAG) is a second messenger,
released by activation of Phospholipase D. Phorbol
Esters (PE) can act as analogues of DAG and mimic its
downstream effects in, for example, tumor promotion.
Protein Kinases C are activated by DAG/PE, this
activation is mediated by their N-terminal conserved
region (C1). DAG/PE binding may be phospholipid
dependent. C1 domains may also mediate DAG/PE signals in
chimaerins (a family of Rac GTPase activating proteins),
RasGRPs (exchange factors for Ras/Rap1), and Munc13
isoforms (scaffolding proteins involved in exocytosis).
Length = 50
Score = 72.5 bits (178), Expect = 4e-16
Identities = 23/50 (46%), Positives = 31/50 (62%)
Query: 211 HQFITRTFTSPTKCNHCTSLMVGLTRQGVVCDICGFACHLSCCDKVPPSC 260
H+F+ ++F PT C+ C + GL +QG+ C C CH C DKVPPSC
Sbjct: 1 HRFVRKSFFKPTFCDVCRKSIWGLFKQGLRCSWCKVKCHKKCADKVPPSC 50
>gnl|CDD|197519 smart00109, C1, Protein kinase C conserved region 1 (C1) domains
(Cysteine-rich domains). Some bind phorbol esters and
diacylglycerol. Some bind RasGTP. Zinc-binding domains.
Length = 50
Score = 65.6 bits (160), Expect = 1e-13
Identities = 22/50 (44%), Positives = 28/50 (56%)
Query: 211 HQFITRTFTSPTKCNHCTSLMVGLTRQGVVCDICGFACHLSCCDKVPPSC 260
H+ + RTFT PT C C + G +QG+ C C CH C DKVP +C
Sbjct: 1 HKHVFRTFTKPTFCCVCRKSIWGSFKQGLRCSECKVKCHKKCADKVPKAC 50
>gnl|CDD|241275 cd01242, PH_ROCK, Rho-associated coiled-coil containing protein
kinase pleckstrin homology (PH) domain. ROCK is a
serine/threonine kinase that binds GTP-Rho. It consists
of a kinase domain, a coiled coil region and a PH
domain. The ROCK PH domain is interrupted by a C1
domain. ROCK plays a role in cellular functions, such as
contraction, adhesion, migration, and proliferation and
in the regulation of apoptosis. There are two ROCK
isoforms, ROCK1 and ROCK2. In ROCK2 the Rho Binding
Domain (RBD) and the PH domain work together in membrane
localization with RBD receiving the RhoA signal and the
PH domain receiving the phospholipid signal. PH domains
have diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 126
Score = 54.7 bits (132), Expect = 5e-09
Identities = 31/82 (37%), Positives = 48/82 (58%), Gaps = 7/82 (8%)
Query: 284 EGYVKVPKTGGVKK-GWVRQFVVVCDFKLFLYDISPDR-NALPAVYVSLVLDMRDEDFAV 341
EG++ +P +K+ GW +QFVVV K+ Y+ D+ N+ P L++D+ + F V
Sbjct: 4 EGWLSIPNKQNIKRHGWKKQFVVVSSKKILFYNSEQDKQNSDP----FLIIDL-SKLFHV 58
Query: 342 SGVRESDVIHATKKDIPCIFRI 363
V + DVI A K+IP IF+I
Sbjct: 59 RSVTQGDVIRADAKEIPRIFQI 80
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 59.1 bits (143), Expect = 6e-09
Identities = 54/241 (22%), Positives = 98/241 (40%), Gaps = 18/241 (7%)
Query: 457 KIYQMDYIPEEQLLVVLAGKQRYVRLVPVRALDGDEVEWVK-IPETKGCLSFT-TGPLTH 514
I Q+ I E +L+++L+ K+ Y P+ +D E VK G +SF G
Sbjct: 903 NISQIIVIEEYKLMLLLSDKKLYS--CPLDVIDASTEENVKKSRIVNGHVSFFKQGFCNG 960
Query: 515 TR---TQHCLALAVKRQNSSQIILYEITRTKTRHKRL-----HEVILPTLAQCIHIFSEG 566
R +L+ + + ++ K L E+ +P+ +H F +
Sbjct: 961 KRLVCAVKSSSLSATLAVIEAPLALKKNKSGNLKKALTIELSTELYVPSEPLSVH-FLKN 1019
Query: 567 RLCVGYQSGFSIYKFSQDNRPIPLIHQDNPLVSLLTYSPVDALLAIELPRGEFLLVFHSL 626
+LC+G + GF I R L++ + +AI GEFLL +
Sbjct: 1020 KLCIGCKKGFEIVSLEN-LRTESLLNPADTSPLFFEKKENTKPIAIFRVSGEFLLCYSEF 1078
Query: 627 AAYVDSQGHKSREKEIM-YPALPTGASYMDGQLLIFSETHVDVFNAESGDWLQTV---NI 682
A +V+ QG + R I + P + +L F +++ + E+G+ ++ + NI
Sbjct: 1079 AFFVNDQGWRKRTSWIFHWEGEPQEFALSYPYILAFEPNFIEIRHIETGELIRCILGHNI 1138
Query: 683 R 683
R
Sbjct: 1139 R 1139
>gnl|CDD|215766 pfam00169, PH, PH domain. PH stands for pleckstrin homology.
Length = 101
Score = 50.2 bits (120), Expect = 1e-07
Identities = 28/117 (23%), Positives = 48/117 (41%), Gaps = 18/117 (15%)
Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVS 342
EG++ K G +K W +++ V+ D L Y S ++ P + L S
Sbjct: 3 KEGWLLK-KGSGGRKSWKKRYFVLFDGVLLYYKDSKKSSSRPKGSIPL-----------S 50
Query: 343 GVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVALSELHR 399
G + + V + F I T G + L+ A+S+ E+ +WV A+ R
Sbjct: 51 GCQVTKVPDSEDGKRKNCFEIRT------GDRETFLLQAESEEERKEWVKAIRSAIR 101
>gnl|CDD|214574 smart00233, PH, Pleckstrin homology domain. Domain commonly found
in eukaryotic signalling proteins. The domain family
possesses multiple functions including the abilities to
bind inositol phosphates, and various proteins. PH
domains have been found to possess inserted domains
(such as in PLC gamma, syntrophins) and to be inserted
within other domains. Mutations in Brutons tyrosine
kinase (Btk) within its PH domain cause X-linked
agammaglobulinaemia (XLA) in patients. Point mutations
cluster into the positively charged end of the molecule
around the predicted binding site for
phosphatidylinositol lipids.
Length = 102
Score = 48.7 bits (116), Expect = 4e-07
Identities = 30/117 (25%), Positives = 50/117 (42%), Gaps = 17/117 (14%)
Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVS 342
EG++ K+GG KK W +++ V+ + L Y D+ + +
Sbjct: 3 KEGWLYK-KSGGGKKSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCT------- 54
Query: 343 GVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVALSELHR 399
VRE+ ++KK P F I T + L+ A+S+ E+ KWV AL +
Sbjct: 55 -VREAPDPDSSKK--PHCFEIKT------SDRKTLLLQAESEEEREKWVEALRKAIA 102
>gnl|CDD|241231 cd00821, PH, Pleckstrin homology (PH) domain. PH domains have
diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 92
Score = 43.5 bits (102), Expect = 2e-05
Identities = 23/111 (20%), Positives = 42/111 (37%), Gaps = 19/111 (17%)
Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVS 342
EG++K + G K W +++ V+ D L Y D + P + L +
Sbjct: 1 KEGWLKK-RGGKGLKSWKKRWFVLFDDVLLYYKSKKDSSKKPKGLIPLSDGL-------- 51
Query: 343 GVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVA 393
+V + P F + T P + A+S+ E+ +W+ A
Sbjct: 52 -----EVELVSSSGKPNCFELVT-----PDRGRTYYLQAESEEEREEWLEA 92
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 41.2 bits (97), Expect = 0.002
Identities = 29/140 (20%), Positives = 59/140 (42%), Gaps = 3/140 (2%)
Query: 56 GGPGDKNWRDRRSQKLEKM--ELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICD 113
GG +K + ++L+++ EL L++ L + + EL L ++++ +
Sbjct: 654 GGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEE 713
Query: 114 FRLKMESVSMDLRRKDSQLKELQSRLESGDI-LERPASQISYLDHFLKESRHTGSVESED 172
++E + +L + +L++LQSRLE + LE ++ L L+E E
Sbjct: 714 LERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEA 773
Query: 173 GDIEDNRVPSIASSRSNLSE 192
+ + R L E
Sbjct: 774 LAKLKEEIEELEEKRQALQE 793
Score = 35.8 bits (83), Expect = 0.11
Identities = 16/80 (20%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Query: 64 RDRRSQKLEKME--LLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESV 121
+ ++LE+ E L L+ L S Q + ++ +E+ + E+ ++++ + ++E +
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854
Query: 122 SMDLRRKDSQLKELQSRLES 141
+L +L+EL++ E
Sbjct: 855 EKELEELKEELEELEAEKEE 874
Score = 33.9 bits (78), Expect = 0.40
Identities = 21/107 (19%), Positives = 50/107 (46%), Gaps = 2/107 (1%)
Query: 73 KMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQL 132
K E+ L+ + + ++ EEL + L A ++E+ + E + ++ + ++
Sbjct: 778 KEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEI 837
Query: 133 KELQSRLESGDI-LERPASQISYLDHFLKESR-HTGSVESEDGDIED 177
+EL+ +L+ + LE ++ L L+E +E E ++E+
Sbjct: 838 EELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEE 884
Score = 33.5 bits (77), Expect = 0.52
Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 4/89 (4%)
Query: 66 RRSQKLEKMELLNLQSSLNSEI----QAKTQISEELSKTRSELIAAQKEICDFRLKMESV 121
K E EL + L E+ + K ++ EEL + SEL ++EI R ++E +
Sbjct: 858 LEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEEL 917
Query: 122 SMDLRRKDSQLKELQSRLESGDILERPAS 150
L R + +L EL+ LE
Sbjct: 918 EAKLERLEVELPELEEELEEEYEDTLETE 946
Score = 31.6 bits (72), Expect = 1.8
Identities = 17/86 (19%), Positives = 41/86 (47%), Gaps = 5/86 (5%)
Query: 60 DKNWRDRRSQKLEKME-----LLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDF 114
+ ++R + E++E L L+ + + + + EEL + EL A++ +
Sbjct: 753 LEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDAL 812
Query: 115 RLKMESVSMDLRRKDSQLKELQSRLE 140
++ES+ R + +++EL+ +E
Sbjct: 813 ERELESLEQRRERLEQEIEELEEEIE 838
Score = 30.8 bits (70), Expect = 3.5
Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 3/90 (3%)
Query: 64 RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
L +L L+ L + +++ EEL + + EL A+KEI + + ++E +
Sbjct: 222 LRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELRE 281
Query: 124 DLRRKDSQLKELQSRLESGDILERPASQIS 153
+L +L EL+ +E LE S +
Sbjct: 282 ELEELQEELLELKEEIEE---LEGEISLLR 308
Score = 30.5 bits (69), Expect = 5.0
Identities = 19/92 (20%), Positives = 41/92 (44%), Gaps = 12/92 (13%)
Query: 61 KNWRDRRSQKLEKMELL------------NLQSSLNSEIQAKTQISEELSKTRSELIAAQ 108
++ RR + +++E L L+ L + ++ EEL + +E +
Sbjct: 817 ESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELE 876
Query: 109 KEICDFRLKMESVSMDLRRKDSQLKELQSRLE 140
E+ + + E + +LR +S+L EL+ +E
Sbjct: 877 DELKELEEEKEELEEELRELESELAELKEEIE 908
Score = 29.3 bits (66), Expect = 9.3
Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 4/86 (4%)
Query: 60 DKNWRDRRSQKLEKMELLNLQSSLNSEIQAK----TQISEELSKTRSELIAAQKEICDFR 115
++ + + E EL EI+ ++ EEL + + EL+ ++EI +
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301
Query: 116 LKMESVSMDLRRKDSQLKELQSRLES 141
++ + L +++L+EL+ RLE
Sbjct: 302 GEISLLRERLEELENELEELEERLEE 327
>gnl|CDD|241430 cd13276, PH_AtPH1, Arabidopsis thaliana Pleckstrin homolog (PH) 1
(AtPH1) PH domain. AtPH1 is expressed in all plant
tissue and is proposed to be the plant homolog of human
pleckstrin. Pleckstrin consists of two PH domains
separated by a linker region, while AtPH has a single PH
domain with a short N-terminal extension. AtPH1 binds
PtdIns3P specifically and is thought to be an adaptor
molecule since it has no obvious catalytic functions. PH
domains have diverse functions, but in general are
involved in targeting proteins to the appropriate
cellular location or in the interaction with a binding
partner. They share little sequence conservation, but
all have a common fold, which is electrostatically
polarized. Less than 10% of PH domains bind
phosphoinositide phosphates (PIPs) with high affinity
and specificity. PH domains are distinguished from other
PIP-binding domains by their specific high-affinity
binding to PIPs with two vicinal phosphate groups:
PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which
results in targeting some PH domain proteins to the
plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 117
Score = 38.1 bits (89), Expect = 0.002
Identities = 32/113 (28%), Positives = 48/113 (42%), Gaps = 24/113 (21%)
Query: 283 YEGYVKVPKTGGVKKGWVRQFVVVCDFKLFLY-DISPDRNALPAVYVSLVLDMRDEDFAV 341
G++ K GG K W R++ V+ KLF + D PD + +S L V
Sbjct: 9 KAGWLT--KQGGSIKTWRRRWFVLKQGKLFYFKDEDPDSEPRGVIDLSDCL-------TV 59
Query: 342 SGVRESDVIHATKKDIPCIFRITTSLMDPPGTKNHTLMLADSDTEKTKWVVAL 394
E AT K F ++T P T ++ADS+ EK +W+ A+
Sbjct: 60 KSAEE-----ATNK--EFAFEVST----PERTF---YLIADSEKEKEEWISAI 98
>gnl|CDD|117396 pfam08826, DMPK_coil, DMPK coiled coil domain like. This domain is
found in the myotonic dystrophy protein kinase (DMPK)
and adopts a coiled coil structure. It plays a role in
dimerisation.
Length = 61
Score = 36.4 bits (84), Expect = 0.003
Identities = 18/60 (30%), Positives = 37/60 (61%)
Query: 79 LQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSR 138
LQS+L +EI+AK + EEL K ++ I + ++ + K + ++R+ +++EL++R
Sbjct: 2 LQSALEAEIRAKQSLQEELEKVKAANINFESKLQEAEAKNRELEAEVRQLKKRMEELRAR 61
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 36.9 bits (86), Expect = 0.010
Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 2/74 (2%)
Query: 69 QKLEKM-ELLN-LQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
+ LE+ E+ N Q E+ + EEL R +L +KEI + + ES +L
Sbjct: 31 EDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELS 90
Query: 127 RKDSQLKELQSRLE 140
+ +E + LE
Sbjct: 91 EAEESWEEQKKMLE 104
>gnl|CDD|190531 pfam03126, Plus-3, Plus-3 domain. This domain is about 90 residues
in length and is often found associated with the
pfam02213 domain. The function of this domain is
uncertain. It is possible that this domain is involved
in DNA binding as it has three conserved positively
charged residues, hence this domain has been named the
plus-3 domain. It is found in yeast Rtf1 which may be a
transcription elongation factor.
Length = 105
Score = 35.3 bits (82), Expect = 0.020
Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 5/54 (9%)
Query: 445 RYVQIARVGDTKKIYQMDYIPEEQLLVVLAGKQRYVRLVPVRAL-DGD--EVEW 495
R QI V + KK YQ+ ++ L V GK R+ + + + E E+
Sbjct: 47 RLAQITGVKEGKKPYQLGGKTTDKYLKVTHGKSE--RVFRISFVSNSPFTEEEF 98
Score = 29.2 bits (66), Expect = 2.4
Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 415 KIARVGDTKKIYQMDYIPKEQLLVVLAGKQ-RYVQIARVGD 454
+I V + KK YQ+ ++ L V GK R +I+ V +
Sbjct: 50 QITGVKEGKKPYQLGGKTTDKYLKVTHGKSERVFRISFVSN 90
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 37.0 bits (86), Expect = 0.033
Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 16/79 (20%)
Query: 63 WRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVS 122
W R+ +++ L K + + +T+ +L EI R K+E +
Sbjct: 359 WEAHRTPLIDEYRSL------------KEKNRNKEDETQRQL----DEIKKLRNKIEELE 402
Query: 123 MDLRRKDSQLKELQSRLES 141
+L+ K+ K+L E+
Sbjct: 403 SELQTKEQLYKQLLDEYEN 421
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 35.0 bits (81), Expect = 0.14
Identities = 19/101 (18%), Positives = 44/101 (43%), Gaps = 14/101 (13%)
Query: 72 EKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQ 131
K +L + + +L E++ Q+ +EL + KE K++ + ++ K +
Sbjct: 173 IKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKE------KLKKLLQEIMIKVKK 226
Query: 132 LKELQSRLES--GDILERPA------SQISYLDHFLKESRH 164
L+EL+ L+ I + ++I+ + L++ R
Sbjct: 227 LEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRG 267
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 34.6 bits (80), Expect = 0.15
Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 13/109 (11%)
Query: 74 MELLNLQSSLNSEIQAKTQISEE----LSKTRSEL-IAAQKEICDFRLKMESVSMDLRRK 128
+ LNL +S+ +++ K Q +E L + EL + E+ R ++ S+S+ + K
Sbjct: 167 NKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEK 226
Query: 129 DSQLKELQSRL-ESGDILERPASQISYL-------DHFLKESRHTGSVE 169
QL+ELQ L E +E ++ S L + +E R + E
Sbjct: 227 RKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREECRGWSAKE 275
>gnl|CDD|219977 pfam08700, Vps51, Vps51/Vps67. This family includes a presumed
domain found in a number of components of vesicular
transport. The VFT tethering complex (also known as GARP
complex, Golgi associated retrograde protein complex,
Vps53 tethering complex) is a conserved eukaryotic
docking complex which is involved recycling of proteins
from endosomes to the late Golgi. Vps51 (also known as
Vps67) is a subunit of VFT and interacts with the SNARE
Tlg1. Cog1_N is the N-terminus of the Cog1 subunit of
the eight-unit Conserved Oligomeric Golgi (COG) complex
that participates in retrograde vesicular transport and
is required to maintain normal Golgi structure and
function. The subunits are located in two lobes and Cog1
serves to bind the two lobes together probably via the
highly conserved N-terminal domain of approximately 85
residues.
Length = 87
Score = 32.3 bits (74), Expect = 0.15
Identities = 22/72 (30%), Positives = 33/72 (45%), Gaps = 11/72 (15%)
Query: 75 ELLNLQSSLNSEIQAKTQISEELSKT----RSELIAAQKEICDFRLKMESVSMDLRRKDS 130
EL ++ L +EI+ EEL + +LI A I KM+S L +K S
Sbjct: 23 ELRQIEKKLRAEIEEL---QEELRQLVYENYRDLIKAADTI----SKMKSEMESLSQKLS 75
Query: 131 QLKELQSRLESG 142
+LK+ +L S
Sbjct: 76 ELKQALEKLSSK 87
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 34.6 bits (80), Expect = 0.21
Identities = 20/85 (23%), Positives = 43/85 (50%), Gaps = 9/85 (10%)
Query: 65 DRRSQKLE------KMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEI-CDFRLK 117
R L+ + +L +L+ L E +AK ++ +LSK +E+ + + + L+
Sbjct: 256 SRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRSKFESEGALR 315
Query: 118 MESVSMDLRRKDSQ-LKELQSRLES 141
E + +L++K +Q + EL+ E+
Sbjct: 316 AEELE-ELKKKLNQKISELEEAAEA 339
>gnl|CDD|240133 cd04863, MtLigD_Pol_like, MtLigD_Pol_like: Polymerase (Pol) domain
of bacterial LigD proteins similar to Mycobacterium
tuberculosis (Mt)LigD. The LigD Pol domain belongs to
the archaeal/eukaryal primase (AEP) superfamily. In
prokaryotes, LigD along with Ku is required for
non-homologous end joining (NHEJ)-mediated repair of DNA
double-strand breaks (DSB). NHEJ-mediated DNA DSB repair
is error-prone. MtLigD is monomeric and contains an
N-terminal Pol domain, a central phosphoesterase module,
and a C-terminal ligase domain. It has been suggested
that LigD Pol contributes to NHEJ-mediated DNA DSB
repair in vivo, by filling in short 5'-overhangs with
ribonucleotides; the filled in termini would then be
sealed by the associated LigD ligase domain, resulting
in short stretches of RNA incorporated into the genomic
DNA. The MtLigD Pol domain is stimulated by manganese,
is error-prone, and prefers adding rNTPs to dNTPs in
vitro. The MtLigD Pol domain has been shown to prefer
DNA gapped substrates containing a 5'-phosphate group at
the gap.
Length = 231
Score = 33.1 bits (76), Expect = 0.39
Identities = 21/67 (31%), Positives = 29/67 (43%), Gaps = 7/67 (10%)
Query: 670 NAESG--DWLQTVNIRRALPLDTRGSLCFSLANDIPYVVYLANIHQREL-INLMTGGDMR 726
N SG DWL T +R G+L + L ND+ + + AN+ EL + T
Sbjct: 42 NCPSGAPDWLPTAEVRSE----GSGTLTYPLVNDLATLAWAANLAALELHVPQWTVDADG 97
Query: 727 QPRPKRR 733
P P R
Sbjct: 98 NPGPPDR 104
>gnl|CDD|201356 pfam00628, PHD, PHD-finger. PHD folds into an interleaved type of
Zn-finger chelating 2 Zn ions in a similar manner to
that of the RING and FYVE domains. Several PHD fingers
have been identified as binding modules of methylated
histone H3.
Length = 51
Score = 30.2 bits (68), Expect = 0.40
Identities = 7/28 (25%), Positives = 12/28 (42%)
Query: 239 VVCDICGFACHLSCCDKVPPSCPVPPDQ 266
++CD C HL+C +P +
Sbjct: 15 LLCDGCDRWFHLACLGPPLEPEEIPEGE 42
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 33.6 bits (77), Expect = 0.42
Identities = 16/86 (18%), Positives = 40/86 (46%), Gaps = 6/86 (6%)
Query: 68 SQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRS------ELIAAQKEICDFRLKMESV 121
LE++E L + L + Q+ + L + + L ++E+ + R +++ +
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKEL 598
Query: 122 SMDLRRKDSQLKELQSRLESGDILER 147
L+ + +L +L+ L+S ++ E
Sbjct: 599 KKKLKELEERLSQLEELLQSLELSEA 624
Score = 33.6 bits (77), Expect = 0.51
Identities = 12/72 (16%), Positives = 29/72 (40%)
Query: 69 QKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
+ E EL L + ++ ++ E+L L ++++ ++E ++ +
Sbjct: 296 LEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNEL 355
Query: 129 DSQLKELQSRLE 140
L+E LE
Sbjct: 356 AKLLEERLKELE 367
>gnl|CDD|241425 cd13271, PH2_TAPP1_2, Tandem PH-domain-containing proteins 1 and 2
Pleckstrin homology (PH) domain, C-terminal repeat. The
binding of TAPP1 (also called PLEKHA1/pleckstrin
homology domain containing, family A (phosphoinositide
binding specific) member 1) and TAPP2 (also called
PLEKHA2) adaptors to PtdIns(3,4)P(2), but not PI(3,4,
5)P3, function as negative regulators of insulin and
PI3K signalling pathways (i.e. TAPP/utrophin/syntrophin
complex). TAPP1 and TAPP2 contain two sequential PH
domains in which the C-terminal PH domain specifically
binds PtdIns(3,4)P2 with high affinity. The N-terminal
PH domain does not interact with any phosphoinositide
tested. They also contain a C-terminal PDZ-binding motif
that interacts with several PDZ-binding proteins,
including PTPN13 (known previously as PTPL1 or FAP-1) as
well as the scaffolding proteins MUPP1 (multiple
PDZ-domain-containing protein 1), syntrophin and
utrophin. PH domains have diverse functions, but in
general are involved in targeting proteins to the
appropriate cellular location or in the interaction with
a binding partner. They share little sequence
conservation, but all have a common fold, which is
electrostatically polarized. Less than 10% of PH domains
bind phosphoinositide phosphates (PIPs) with high
affinity and specificity. PH domains are distinguished
from other PIP-binding domains by their specific
high-affinity binding to PIPs with two vicinal phosphate
groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3
which results in targeting some PH domain proteins to
the plasma membrane. A few display strong specificity in
lipid binding. Any specificity is usually determined by
loop regions or insertions in the N-terminus of the
domain, which are not conserved across all PH domains.
PH domains are found in cellular signaling proteins such
as serine/threonine kinase, tyrosine kinases, regulators
of G-proteins, endocytotic GTPases, adaptors, as well as
cytoskeletal associated molecules and in lipid
associated enzymes.
Length = 114
Score = 31.2 bits (71), Expect = 0.53
Identities = 24/106 (22%), Positives = 43/106 (40%), Gaps = 21/106 (19%)
Query: 291 KTGGVKKGWVRQFVVVCDFKLFLYDISPDRNALPAVYVSLVLDMRDEDFAVSGVRESDVI 350
K G V+K W R++ ++ D + Y D+ L + +++ V V E
Sbjct: 16 KQGAVRKNWKRRYFILDDNTISYYKSETDKEPLRTIP------LKE----VLKVHECLSG 65
Query: 351 HATKKDIPCIFRITTSLMDPPGTKNHTLML-ADSDTEKTKWVVALS 395
+D +F I T+ + T + ADS + W+ A+S
Sbjct: 66 DLLMRDN--LFEIITT--------SRTFYIQADSPEDMHSWIKAIS 101
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 33.5 bits (77), Expect = 0.57
Identities = 19/77 (24%), Positives = 43/77 (55%), Gaps = 6/77 (7%)
Query: 67 RSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQK---EICDFRLKMESVSM 123
+ ++ E E+L + ++SE+ ++ EEL K E+ ++ EI + ++ES+
Sbjct: 196 KEKEKELEEVLREINEISSELP---ELREELEKLEKEVKELEELKEEIEELEKELESLEG 252
Query: 124 DLRRKDSQLKELQSRLE 140
R+ + +++EL+ R+E
Sbjct: 253 SKRKLEEKIRELEERIE 269
Score = 31.6 bits (72), Expect = 2.0
Identities = 22/129 (17%), Positives = 49/129 (37%), Gaps = 14/129 (10%)
Query: 65 DRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMD 124
+ ++ K E+ L I+ I E + + EL +EI + ++ + +
Sbjct: 164 YKNLGEVIK-EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREE 222
Query: 125 LRRKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHTGSVESEDGDIEDNRVPSIA 184
L + + ++KEL+ E + LE+ +E + ++E+ I
Sbjct: 223 LEKLEKEVKELEELKEEIEELEK---------ELESLEGSKRKLEEKIRELEE----RIE 269
Query: 185 SSRSNLSEL 193
+ + EL
Sbjct: 270 ELKKEIEEL 278
Score = 30.8 bits (70), Expect = 3.8
Identities = 23/101 (22%), Positives = 43/101 (42%), Gaps = 24/101 (23%)
Query: 65 DRRSQKLEK-----MELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKME 119
+ R ++LE +EL + + L E + ++ EEL K EL +K + + R ++E
Sbjct: 591 EERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELE 650
Query: 120 -------------------SVSMDLRRKDSQLKELQSRLES 141
+S +L ++L+EL+ R E
Sbjct: 651 ELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREE 691
Score = 30.4 bits (69), Expect = 4.8
Identities = 25/100 (25%), Positives = 45/100 (45%), Gaps = 7/100 (7%)
Query: 96 ELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSRLESGDILERPASQISY- 154
EL EL +KE+ +++ +L + +L+EL+ LE LE+ S+ Y
Sbjct: 606 ELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEE---LEKKYSEEEYE 662
Query: 155 --LDHFLKESRHTGSVESEDGDIEDNRVPSIASSRSNLSE 192
+ +L+ SR + +E ++E R I + L E
Sbjct: 663 ELREEYLELSRELAGLRAELEELE-KRREEIKKTLEKLKE 701
>gnl|CDD|206350 pfam14182, YgaB, YgaB-like protein. The YgaB-like protein family
includes the B. subtilis YgaB protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are
approximately 90 amino acids in length.
Length = 79
Score = 30.4 bits (69), Expect = 0.58
Identities = 20/75 (26%), Positives = 36/75 (48%), Gaps = 17/75 (22%)
Query: 68 SQKLEKME-LLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
S++++ M+ LL LQS L E + EL+ +K+ ++ES+ ++
Sbjct: 7 SEQMKTMDKLLFLQSEL-----------ERCQEIEKELLELEKKA-----ELESIQQEIS 50
Query: 127 RKDSQLKELQSRLES 141
R +LKE+Q E
Sbjct: 51 RMRKELKEIQELFEK 65
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 33.5 bits (77), Expect = 0.58
Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 12/133 (9%)
Query: 70 KLEKM--ELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRR 127
+L+++ L L+ L+S +I L + EL A ++I + ++E + + +
Sbjct: 675 ELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEK 734
Query: 128 KDSQLKELQSRLESGDILERPASQISYLDHFLKE----SRHTGSVESEDGDIED----NR 179
+L+EL+ L S + + + S L +E D+E +R
Sbjct: 735 LKERLEELEEDLSS--LEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSR 792
Query: 180 VPSIASSRSNLSE 192
+P I + S L E
Sbjct: 793 IPEIQAELSKLEE 805
Score = 33.1 bits (76), Expect = 0.65
Identities = 28/117 (23%), Positives = 51/117 (43%), Gaps = 10/117 (8%)
Query: 64 RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
R S + E +L S L ++ Q+ EEL+K +L ++E + K+ +
Sbjct: 244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDL--GEEEQLRVKEKIGELEA 301
Query: 124 DLRRKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHTGSVESEDGDIEDNRV 180
++ + + E + LE D ER A + +D L E +E + +IE+ R
Sbjct: 302 EIASLERSIAEKERELE--DAEERLAKLEAEIDKLLAE------IEELEREIEEERK 350
Score = 33.1 bits (76), Expect = 0.75
Identities = 16/81 (19%), Positives = 36/81 (44%), Gaps = 4/81 (4%)
Query: 64 RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSK----TRSELIAAQKEICDFRLKME 119
R+ ++ + +L + L E++ EE+ K TR EL ++++ + ++
Sbjct: 343 REIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREIN 402
Query: 120 SVSMDLRRKDSQLKELQSRLE 140
+ +L R +L+ L L
Sbjct: 403 ELKRELDRLQEELQRLSEELA 423
Score = 31.6 bits (72), Expect = 1.8
Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 7/83 (8%)
Query: 69 QKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
QKL ++ L + L EIQ + +L + + +KEI + K E + +L
Sbjct: 819 QKLNRLTLE--KEYLEKEIQELQEQRIDLKEQIKSI---EKEIENLNGKKEELEEELEEL 873
Query: 129 DSQLKELQSRLE--SGDILERPA 149
++ L++L+SRL + E A
Sbjct: 874 EAALRDLESRLGDLKKERDELEA 896
Score = 31.6 bits (72), Expect = 1.9
Identities = 11/62 (17%), Positives = 32/62 (51%)
Query: 79 LQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSR 138
++ LN K + +E+ + + + I +++I ++E+++ + +L+EL++
Sbjct: 817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAA 876
Query: 139 LE 140
L
Sbjct: 877 LR 878
Score = 30.4 bits (69), Expect = 4.5
Identities = 15/90 (16%), Positives = 36/90 (40%), Gaps = 5/90 (5%)
Query: 66 RRSQKLE--KMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
R S++L + +++ +N + K + E+ K +L ++ + ++ +
Sbjct: 417 RLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKE 476
Query: 124 DLRRKDSQLKELQSRLESGDILERPASQIS 153
+ R + +L +LQ L E A
Sbjct: 477 EYDRVEKELSKLQREL---AEAEAQARASE 503
Score = 29.7 bits (67), Expect = 8.1
Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 66 RRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDL 125
RS ++ EL + + E AK + E+ K +E+ ++EI + R + + ++ +
Sbjct: 307 ERSIAEKERELEDAE-----ERLAKLE--AEIDKLLAEIEELEREIEEERKRRDKLTEEY 359
Query: 126 RRKDSQLKELQSRLESGD 143
+L++L++ LE D
Sbjct: 360 AELKEELEDLRAELEEVD 377
>gnl|CDD|182106 PRK09841, PRK09841, cryptic autophosphorylating protein tyrosine
kinase Etk; Provisional.
Length = 726
Score = 33.0 bits (75), Expect = 0.62
Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 23/117 (19%)
Query: 93 ISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK---------DSQLKELQSRLESGD 143
+ +L + RSEL A++++ +R + +SV ++L K D+QL EL R
Sbjct: 272 LQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQLNELTFR----- 326
Query: 144 ILERPASQISYLDH----FLKESRHTGSVESEDGDIEDNRVPSIASSRSNLSELTID 196
E SQ+ DH L E R T +E E + + RV ++ S++ + L+ D
Sbjct: 327 --EAEISQLYKKDHPTYRALLEKRQT--LEQERKRL-NKRVSAMPSTQQEVLRLSRD 378
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 31.7 bits (72), Expect = 0.85
Identities = 19/107 (17%), Positives = 41/107 (38%), Gaps = 9/107 (8%)
Query: 66 RRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLK-------M 118
Q + +L LQ L+ Q +++ EEL + + E+ D + +
Sbjct: 64 APVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELL 123
Query: 119 ESVSMDLRRKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHT 165
+S+ L + +KEL L ++ + ++ L L+ +
Sbjct: 124 KSLEERLESLEESIKELAKELR--ELRQDLREEVEELREELERLQEN 168
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 32.7 bits (74), Expect = 0.86
Identities = 35/145 (24%), Positives = 65/145 (44%), Gaps = 15/145 (10%)
Query: 64 RDRRSQKLEKM--ELLNLQSSLNSEIQAKTQISEELSK-------TRSELIAAQKEICDF 114
R + E++ +L LQ L S Q + EEL+ T +EL ++ E
Sbjct: 278 RIDLQLENEQLHEDLRTLQERLESSQQKAGLLGEELASLGSLRDHTIAELHQSRLESAQM 337
Query: 115 RLKMESVSMDLRRKDSQLKE-----LQSRLESGDILERPASQISYLDHFLKESRHTGSV- 168
L++ +++ L+ SQ + QS D +++ ++++ L+ +L+E R
Sbjct: 338 SLQLSQLNLALKEGQSQWAQERETLRQSAEADKDRIQKLSAELLKLEEWLQEERSQREKL 397
Query: 169 ESEDGDIEDNRVPSIASSRSNLSEL 193
E E G +D ++ +R LSEL
Sbjct: 398 EVELGIEKDCNRVQLSENRRELSEL 422
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 32.3 bits (73), Expect = 1.1
Identities = 28/149 (18%), Positives = 59/149 (39%), Gaps = 6/149 (4%)
Query: 65 DRRSQKLEKME-LLNLQS-SLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVS 122
D +LE+ L ++ SL + + K + E+ R ++++ + K+E++
Sbjct: 346 DALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQ 405
Query: 123 MDLRRKDSQLKELQSRLESGDILERPASQISYLDHFLKE-SRHTGSVESEDGDIEDNRVP 181
RRK+ +LKE + RL S + + L+ L E R ++ + E
Sbjct: 406 ETFRRKERRLKEEKERLRSLQTDTNTDTALEKLEKALAEKERIIERLKEQRDRDERYEQE 465
Query: 182 SIASSRSNLSEL--TIDQMNNMLRHDRSS 208
+ + +L + + L +R
Sbjct: 466 EFETYKKEFEDLKEEVQNLQLKL-SEREL 493
Score = 31.5 bits (71), Expect = 2.0
Identities = 22/83 (26%), Positives = 46/83 (55%), Gaps = 2/83 (2%)
Query: 61 KNWRDRRSQKL--EKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKM 118
K DR +Q+L +K ELL LQ+ L + + + + L K + +L A++E + ++
Sbjct: 286 KAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEV 345
Query: 119 ESVSMDLRRKDSQLKELQSRLES 141
+++ +L RK + L + + L++
Sbjct: 346 DALRYELERKHNTLTKKTASLQA 368
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 31.8 bits (72), Expect = 1.2
Identities = 13/82 (15%), Positives = 34/82 (41%)
Query: 59 GDKNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKM 118
G K + K + L+ L + K ++ +EL + +E Q+ + ++
Sbjct: 120 GLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVEN 179
Query: 119 ESVSMDLRRKDSQLKELQSRLE 140
+ L++ ++ +L+ R +
Sbjct: 180 SRLEEMLKKLPGEVYDLKKRWD 201
>gnl|CDD|218912 pfam06148, COG2, COG (conserved oligomeric Golgi) complex
component, COG2. The COG complex comprises eight
proteins COG1-8. The COG complex plays critical roles in
Golgi structure and function. The proposed function of
the complex is to mediate the initial physical contact
between transport vesicles and their membrane targets. A
comparable role in tethering vesicles has been suggested
for at least six additional large multisubunit
complexes, including the exocyst, a complex that
mediates trafficking to the plasma membrane. COG2
structure reveals a six-helix bundle with few conserved
surface features but a general resemblance to recently
determined crystal structures of four different exocyst
subunits. These bundles inCOG2 may act as platforms for
interaction with other trafficing proteins including
SNAREs (soluble N-ethylmaleimide factor attachment
protein receptors) and Rabs.
Length = 133
Score = 30.7 bits (70), Expect = 1.3
Identities = 24/111 (21%), Positives = 46/111 (41%), Gaps = 17/111 (15%)
Query: 66 RRSQKLEKM--ELLNLQSSLNSEI---------------QAKTQISEELSKTRSELIAAQ 108
R+ LE + +L L SE+ + E+L + R L+A +
Sbjct: 23 RKGVPLESLRDDLRAYLKLLKSELIELINEDYADFVSLSTNLVGLDEKLERIRVPLLALR 82
Query: 109 KEICDFRLKMESVSMDLRRKDSQLKELQSRLESGDILERPASQISYLDHFL 159
KE+ R ++E+ ++ K + K+L+ + ES +L + L+ L
Sbjct: 83 KEVESVRSELEARLEEVEAKLEEKKDLREKKESLKLLLDLDHVVEKLEDLL 133
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
Length = 850
Score = 31.9 bits (73), Expect = 1.5
Identities = 17/77 (22%), Positives = 45/77 (58%), Gaps = 5/77 (6%)
Query: 65 DRRSQKLEKMELLNLQSSLNSEIQAKTQ-ISEELSKTRSELIAAQKEICDFRLKMESVSM 123
D+ + ++ +LL LQ+ ++ +++ Q +E L++ + LI A+ E+ L+ +++ +
Sbjct: 690 DKAKAEEQRTKLLELQAE-SAAVESSGQSRAEALAEAEARLIEAEAEVEQAELRAKALRI 748
Query: 124 DLRRKDSQLKELQSRLE 140
+++L++L+ R E
Sbjct: 749 ---EAEAELEKLRKRQE 762
>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640). This
family consists of sequences derived from hypothetical
eukaryotic proteins. A region approximately 100 residues
in length is featured.
Length = 177
Score = 30.7 bits (70), Expect = 1.7
Identities = 16/78 (20%), Positives = 40/78 (51%), Gaps = 10/78 (12%)
Query: 69 QKLEKMELLNLQSS---LNSEI-QAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMD 124
Q LEK E NLQ+ L +++ + K ++ +E++K + + D L+ + +
Sbjct: 68 QSLEKSEFANLQAENEKLKNDLEKLKNRLRDEITKVTAGV------RLDLNLEKGRIREE 121
Query: 125 LRRKDSQLKELQSRLESG 142
++ ++KE +++++
Sbjct: 122 SAEQELKIKETDTKIDTE 139
>gnl|CDD|150852 pfam10241, KxDL, Uncharacterized conserved protein. This is a
family of short proteins which are conserved over a
region of 80 residues. There is a characteristic KxDL
motif towards the C-terminus. The function is unknown.
Length = 88
Score = 29.2 bits (66), Expect = 1.7
Identities = 16/62 (25%), Positives = 23/62 (37%), Gaps = 3/62 (4%)
Query: 80 QSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSRL 139
L+ I + QI L KT L+ + RLK D LKE++ L
Sbjct: 10 PEDLDRIIAKQAQILGRLEKTNRMLLELN-ALSQARLK--GARADFAEGTRLLKEMKKDL 66
Query: 140 ES 141
+
Sbjct: 67 DW 68
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 31.9 bits (72), Expect = 1.8
Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 6/80 (7%)
Query: 68 SQKLEKMELL--NLQSSLNSEIQAKTQISEEL----SKTRSELIAAQKEICDFRLKMESV 121
L+ L + S SE++ K + +E+ +S + +KEI + R K++ V
Sbjct: 697 ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKV 756
Query: 122 SMDLRRKDSQLKELQSRLES 141
+ D++R + ++E ++ L +
Sbjct: 757 NRDIQRLKNDIEEQETLLGT 776
Score = 31.9 bits (72), Expect = 1.8
Identities = 17/75 (22%), Positives = 35/75 (46%), Gaps = 4/75 (5%)
Query: 86 EIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQSRLESGDIL 145
E++ + L++ ++EL+ E +L+ + +R +DS ++ L +RLE D
Sbjct: 327 ELEKLNKERRLLNQEKTELLV---EQGRLQLQADRHQEHIRARDSLIQSLATRLEL-DGF 382
Query: 146 ERPASQISYLDHFLK 160
ER + +F
Sbjct: 383 ERGPFSERQIKNFHT 397
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 31.6 bits (72), Expect = 2.1
Identities = 14/73 (19%), Positives = 36/73 (49%), Gaps = 1/73 (1%)
Query: 69 QKLEKMELLNLQSSLNSEIQAKTQISEELSKT-RSELIAAQKEICDFRLKMESVSMDLRR 127
+ E E L + E++A+ + E + +L + ++ L++ S++ ++ R
Sbjct: 345 KLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIER 404
Query: 128 KDSQLKELQSRLE 140
+++L+ L+ R E
Sbjct: 405 LEARLERLEDRRE 417
Score = 30.8 bits (70), Expect = 3.2
Identities = 16/78 (20%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 64 RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
+ ++L++ E L +E+Q E+L + R E+ ++EI + + ++ +++
Sbjct: 241 LEELQEELKEAEEE--LEELTAELQELE---EKLEELRLEVSELEEEIEELQKELYALAN 295
Query: 124 DLRRKDSQLKELQSRLES 141
++ R + Q + L+ RL +
Sbjct: 296 EISRLEQQKQILRERLAN 313
Score = 30.0 bits (68), Expect = 6.2
Identities = 15/78 (19%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
Query: 65 DRRSQKLEKM--ELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVS 122
+ +++E++ ++ +L + + + ++ EL +E + ++ + R ++E +S
Sbjct: 841 EDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELS 900
Query: 123 MDLRRKDSQLKELQSRLE 140
+LR +S+ EL+ LE
Sbjct: 901 EELRELESKRSELRRELE 918
Score = 29.6 bits (67), Expect = 8.6
Identities = 22/73 (30%), Positives = 40/73 (54%), Gaps = 9/73 (12%)
Query: 71 LEKMELL--NLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
+E++E L L+S L + + + + E L+ RSEL +E+ ++ES +LRR
Sbjct: 861 IEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL----RELESKRSELRR- 915
Query: 129 DSQLKELQSRLES 141
+L+EL+ +L
Sbjct: 916 --ELEELREKLAQ 926
Score = 29.3 bits (66), Expect = 9.9
Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 7/101 (6%)
Query: 67 RSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
+KLE+ EL LQ+ L + ++ EEL + L ++E+ E L
Sbjct: 426 LLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL-------EEAEQALD 478
Query: 127 RKDSQLKELQSRLESGDILERPASQISYLDHFLKESRHTGS 167
+ +L +LQ+RL+S + L+ S L +++ S
Sbjct: 479 AAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLS 519
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.5 bits (69), Expect = 2.6
Identities = 10/64 (15%), Positives = 30/64 (46%)
Query: 78 NLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKELQS 137
N S L+ + K I E+ +++ Q +I + + +++ ++++ ++ EL+
Sbjct: 35 NQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKE 94
Query: 138 RLES 141
+
Sbjct: 95 NIVE 98
>gnl|CDD|214584 smart00249, PHD, PHD zinc finger. The plant homeodomain (PHD)
finger is a C4HC3 zinc-finger-like motif found in
nuclear proteins thought to be involved in epigenetics
and chromatin-mediated transcriptional regulation. The
PHD finger binds two zinc ions using the so-called
'cross-brace' motif and is thus structurally related to
the RING finger and the FYVE finger. It is not yet known
if PHD fingers have a common molecular function. Several
reports suggest that it can function as a
protein-protein interacton domain and it was recently
demonstrated that the PHD finger of p300 can cooperate
with the adjacent BROMO domain in nucleosome binding in
vitro. Other reports suggesting that the PHD finger is a
ubiquitin ligase have been refuted as these domains were
RING fingers misidentified as PHD fingers.
Length = 47
Score = 27.6 bits (61), Expect = 2.7
Identities = 11/47 (23%), Positives = 15/47 (31%), Gaps = 14/47 (29%)
Query: 224 CNHC-----TSLMVGLTRQGVVCDICGFACHLSCCDKVPPSCPVPPD 265
C+ C ++ CD C H +C PP PD
Sbjct: 2 CSVCGKPDDGGELLQ-------CDGCDRWYHQTCLG--PPLLEEEPD 39
>gnl|CDD|206675 cd01888, eIF2_gamma, Gamma subunit of initiation factor 2 (eIF2
gamma). eIF2 is a heterotrimeric translation initiation
factor that consists of alpha, beta, and gamma subunits.
The GTP-bound gamma subunit also binds initiator
methionyl-tRNA and delivers it to the 40S ribosomal
subunit. Following hydrolysis of GTP to GDP, eIF2:GDP is
released from the ribosome. The gamma subunit has no
intrinsic GTPase activity, but is stimulated by the
GTPase activating protein (GAP) eIF5, and GDP/GTP
exchange is stimulated by the guanine nucleotide
exchange factor (GEF) eIF2B. eIF2B is a heteropentamer,
and the epsilon chain binds eIF2. Both eIF5 and
eIF2B-epsilon are known to bind strongly to eIF2-beta,
but have also been shown to bind directly to eIF2-gamma.
It is possible that eIF2-beta serves simply as a
high-affinity docking site for eIF5 and eIF2B-epsilon,
or that eIF2-beta serves a regulatory role. eIF2-gamma
is found only in eukaryotes and archaea. It is closely
related to SelB, the selenocysteine-specific elongation
factor from eubacteria. The translational factor
components of the ternary complex, IF2 in eubacteria and
eIF2 in eukaryotes are not the same protein (despite
their unfortunately similar names). Both factors are
GTPases; however, eubacterial IF-2 is a single
polypeptide, while eIF2 is heterotrimeric. eIF2-gamma is
a member of the same family as eubacterial IF2, but the
two proteins are only distantly related. This family
includes translation initiation, elongation, and release
factors.
Length = 197
Score = 30.3 bits (69), Expect = 2.8
Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 7/44 (15%)
Query: 617 GEFLLVFHSLAAYVDSQGHKSREKEIMYPALPTGASYMDGQLLI 660
GE LV H ++VD GH EI+ + +GA+ MDG LL+
Sbjct: 71 GETKLVRH--VSFVDCPGH-----EILMATMLSGAAVMDGALLL 107
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 30.8 bits (70), Expect = 2.9
Identities = 20/104 (19%), Positives = 37/104 (35%), Gaps = 6/104 (5%)
Query: 93 ISEELSKTRSELIAAQKEICDFRLK-MESVSMDLRRKDSQLKELQSRLESG--DILERPA 149
I E++ +L AA+ + F+ + + S++ E Q LE+ ++ E A
Sbjct: 166 IDEQIKTYEKKLEAAENRLKAFKQENGGILPDQEGDYYSEISEAQEELEAARLELNEAIA 225
Query: 150 SQISYLDHFLKESRHTGSVESEDGDIEDNRVPSIASSRSNLSEL 193
+ + E + S D R I + L L
Sbjct: 226 QRDALKRQLGGEEPVLLAGSSVANSELDGR---IEALEKQLDAL 266
>gnl|CDD|215437 PLN02816, PLN02816, mannosyltransferase.
Length = 546
Score = 30.8 bits (69), Expect = 2.9
Identities = 35/128 (27%), Positives = 52/128 (40%), Gaps = 22/128 (17%)
Query: 597 LVSLLTYSPVDALLAIELPRGEFL--LVFHSLAAYVDSQGHKSREKEIMYPALPTGASYM 654
LV L T++P ++ I + + L L+ LA Y GHK E + P LP
Sbjct: 299 LVMLFTFTPF-SIAGIIKSKNQKLSALILWVLAIY-SILGHK--EFRFVLPVLPIA---- 350
Query: 655 DGQLLIFSETHVDVFNAESGDWLQTVNIRRALPLDT--------RGSLCFSLANDIPYVV 706
LIFS +V ++ +P R S+ F LA +IP +
Sbjct: 351 ----LIFSGYAFAQMEVSGSSSSSSVTKKKQVPRQNHTKWSPKLRLSVYFLLATNIPMAL 406
Query: 707 YLANIHQR 714
Y++ HQR
Sbjct: 407 YMSLFHQR 414
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 30.4 bits (68), Expect = 4.0
Identities = 20/96 (20%), Positives = 36/96 (37%), Gaps = 9/96 (9%)
Query: 75 ELLNLQSSLNSEIQAKTQI-------SEELSKTRSELIAAQKEICDFRLKMESVSMDLRR 127
+L L++ L + K EL K R E A ++E+ R + +L R
Sbjct: 82 QLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELAR 141
Query: 128 KDSQLKELQSRLESGDILERPASQISYLDHFLKESR 163
Q ++LQ+RL + E+ + +
Sbjct: 142 LTKQAQDLQTRL--KTLAEQRRQLEAQAQSLQASQK 175
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 28.3 bits (64), Expect = 5.1
Identities = 16/81 (19%), Positives = 38/81 (46%), Gaps = 17/81 (20%)
Query: 60 DKNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKME 119
D+ R+ + +LE LQ+ N ELSK + +++ +++
Sbjct: 35 DEERRELQ-VELE-----ELQAERN-----------ELSKEIGKAKKKKEDAEALIAEVK 77
Query: 120 SVSMDLRRKDSQLKELQSRLE 140
+ +L+ +++L+EL++ L+
Sbjct: 78 ELKDELKALEAELRELEAELD 98
>gnl|CDD|218380 pfam05010, TACC, Transforming acidic coiled-coil-containing protein
(TACC). This family contains the proteins TACC 1, 2 and
3 the genes for which are found concentrated in the
centrosomes of eukaryotic and may play a conserved role
in organising centrosomal microtubules. The human TACC
proteins have been linked to cancer and TACC2 has been
identified as a possible tumour suppressor (AZU-1). The
functional homologue (Alp7) in Schizosaccharomyces pombe
has been shown to be required for organisation of
bipolar spindles.
Length = 207
Score = 29.3 bits (66), Expect = 5.6
Identities = 13/48 (27%), Positives = 28/48 (58%)
Query: 88 QAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQLKEL 135
A +I++ SK ++E A Q + ++K++S+ L +K+ + +EL
Sbjct: 147 IANEEIAQVRSKAKAETAALQASLRKEQMKVQSLEETLEQKNKENEEL 194
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 29.7 bits (67), Expect = 5.9
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 7/69 (10%)
Query: 72 EKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRKDSQ 131
E+ EL L S ++ Q+ EE KT ++L ++ +L+ +LR +S+
Sbjct: 183 EQAELTTLLSEQRAQQAKLAQLLEERKKTLAQL-NSELSADQKKLE------ELRANESR 235
Query: 132 LKELQSRLE 140
LK + E
Sbjct: 236 LKNEIASAE 244
>gnl|CDD|237653 PRK14276, PRK14276, chaperone protein DnaJ; Provisional.
Length = 380
Score = 29.7 bits (67), Expect = 6.3
Identities = 12/36 (33%), Positives = 16/36 (44%), Gaps = 9/36 (25%)
Query: 219 TSPTKCNHC---------TSLMVGLTRQGVVCDICG 245
TSP C C T +G+ R+ V CD+C
Sbjct: 161 TSPVTCGKCHGSGVITVDTQTPLGMMRRQVTCDVCH 196
>gnl|CDD|227419 COG5087, RTT109, Uncharacterized conserved protein [Function
unknown].
Length = 349
Score = 29.6 bits (66), Expect = 6.6
Identities = 17/67 (25%), Positives = 22/67 (32%), Gaps = 10/67 (14%)
Query: 90 KTQISEELSKTRSELIAAQKEICDFRLKMESVSM-DLRRKDSQLKELQSRLESGDILERP 148
I +S R I A+K ICDF L E L R + P
Sbjct: 284 DKYIFGSISDARRSAIEAKKNICDFELAREYFQGQGLVRPKKNEEANAV---------MP 334
Query: 149 ASQISYL 155
A+ + L
Sbjct: 335 ATNKNML 341
>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil. Members of
this family are predominantly found in the
actin-bundling protein Cortexillin I from Dictyostelium
discoideum. They adopt a structure consisting of an
18-heptad-repeat alpha-helical coiled-coil, and are a
prerequisite for the assembly of Cortexillin I.
Length = 107
Score = 28.0 bits (62), Expect = 6.9
Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 3/76 (3%)
Query: 65 DRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMD 124
++ + K EL N + L + ++++ EEL+K + +L + + +
Sbjct: 3 EKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASL---EAQKAEREQR 59
Query: 125 LRRKDSQLKELQSRLE 140
LR +++ EL LE
Sbjct: 60 LRELQAKIDELLKNLE 75
>gnl|CDD|224264 COG1345, FliD, Flagellar capping protein [Cell motility and
secretion].
Length = 483
Score = 29.7 bits (67), Expect = 6.9
Identities = 21/93 (22%), Positives = 42/93 (45%), Gaps = 14/93 (15%)
Query: 79 LQSSLNSEIQA----------KTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLRRK 128
L+++LN + + T I++ L + + L+A I + +S++ ++
Sbjct: 377 LKAALNEDPDSVREFFGGDGKDTGIAKTLGEGLNSLLAVTGGI--ITGRTDSLNKQIKSL 434
Query: 129 DSQLKELQSRLESGDILERPASQISYLDHFLKE 161
D +K L RLE ER +Q + LD + +
Sbjct: 435 DKDIKSLDKRLE--AAEERYKTQFNTLDDMMTQ 465
>gnl|CDD|235861 PRK06782, PRK06782, flagellar motor switch protein; Reviewed.
Length = 528
Score = 29.4 bits (66), Expect = 7.3
Identities = 18/86 (20%), Positives = 41/86 (47%), Gaps = 6/86 (6%)
Query: 68 SQKLEKMELLNLQSSLNSEI-QAKTQISEELSKTRSELIAAQKEICDFRLKMESVSMDLR 126
++L ++EL ++ +N + A T +SE + + ++ + + +M S++
Sbjct: 319 EKELTELELSAIKEIMNQMMGHAATAMSEMF-QEKIDISPPTVKFVTLKEEMLGESIE-- 375
Query: 127 RKDSQLKELQSRLESGDILERPASQI 152
+L ++ LE GD+L+ QI
Sbjct: 376 --VDELVQITFNLEVGDLLQTKMYQI 399
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 29.1 bits (66), Expect = 7.9
Identities = 18/39 (46%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 66 RRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSEL 104
R QKLEK E+ L+ L +E + ISEEL +T +EL
Sbjct: 197 RSVQKLEK-EVDRLEDELLAEKEKYKAISEELDQTLAEL 234
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 29.2 bits (66), Expect = 8.2
Identities = 13/77 (16%), Positives = 28/77 (36%), Gaps = 16/77 (20%)
Query: 64 RDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKEICDFRLKMESVSM 123
+ + K E L + S++ QI + + EL +++ R +++
Sbjct: 80 SENEALKAENERLQKREQSIDQ------QIQQAVQSETQELTKEIEQLKSERQQLQ---- 129
Query: 124 DLRRKDSQLKELQSRLE 140
+ +LQ RL
Sbjct: 130 ------GLIDQLQRRLA 140
>gnl|CDD|151039 pfam10473, Cenp-F_leu_zip, Leucine-rich repeats of kinetochore
protein Cenp-F/LEK1. Cenp-F, a centromeric kinetochore,
microtubule-binding protein consisting of two
1,600-amino acid-long coils, is essential for the full
functioning of the mitotic checkpoint pathway. There are
several leucine-rich repeats along the sequence of LEK1
that are considered to be zippers, though they do not
appear to be binding DNA directly in this instance.
Length = 140
Score = 28.2 bits (63), Expect = 8.9
Identities = 16/50 (32%), Positives = 29/50 (58%), Gaps = 3/50 (6%)
Query: 61 KNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIAAQKE 110
+ ++R S+ LEK+ + ++ L + Q K Q+ EE SKT E++ Q +
Sbjct: 90 QEKQERVSE-LEKL-NSSTENLLEEKEQEKIQMKEE-SKTAVEMLQTQLK 136
>gnl|CDD|216292 pfam01086, Clathrin_lg_ch, Clathrin light chain.
Length = 225
Score = 28.9 bits (65), Expect = 9.3
Identities = 21/77 (27%), Positives = 30/77 (38%), Gaps = 24/77 (31%)
Query: 61 KNWRDRRSQKLEKMELLNLQSSLNSEIQAKTQISEELSKTRSELIA-AQKEICDFRL--- 116
+ WR+RR ++E+ + E K + ELI AQKEI DF
Sbjct: 115 RKWRERRDLRIEERD-------------------EASEKKKEELIEKAQKEIDDFYENYN 155
Query: 117 -KMESVSMDLRRKDSQL 132
K E R++ Q
Sbjct: 156 EKKEKTKAQNRKEAEQF 172
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family
consists of the p50 and variable adherence-associated
antigen (Vaa) adhesins from Mycoplasma hominis. M.
hominis is a mycoplasma associated with human urogenital
diseases, pneumonia, and septic arthritis. An adhesin is
a cell surface molecule that mediates adhesion to other
cells or to the surrounding surface or substrate. The
Vaa antigen is a 50-kDa surface lipoprotein that has
four tandem repetitive DNA sequences encoding a periodic
peptide structure, and is highly immunogenic in the
human host. p50 is also a 50-kDa lipoprotein, having
three repeats A,B and C, that may be a tetramer of
191-kDa in its native environment.
Length = 353
Score = 28.9 bits (64), Expect = 9.6
Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 8/80 (10%)
Query: 65 DRRSQKLEKMELLNLQSSLNSEIQAKTQISEE---LSKTRSELIA-AQKEI---CDFRLK 117
D ++ ++ K EL + + S I+ +I+ E LSKT ++ IA +K+ DF+ +
Sbjct: 110 DDKNAQIAK-ELAERNAKIQSNIEELKKINNEAFELSKTVNKTIAEVEKKFKIPKDFKEQ 168
Query: 118 MESVSMDLRRKDSQLKELQS 137
+E+ + DL K Q+ E +
Sbjct: 169 LENFADDLLDKSRQIDEFTT 188
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.134 0.394
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,214,205
Number of extensions: 3749831
Number of successful extensions: 3409
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3344
Number of HSP's successfully gapped: 121
Length of query: 760
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 656
Effective length of database: 6,324,786
Effective search space: 4149059616
Effective search space used: 4149059616
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.1 bits)