RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10293
(179 letters)
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 153 bits (389), Expect = 4e-48
Identities = 62/158 (39%), Positives = 87/158 (55%), Gaps = 6/158 (3%)
Query: 5 TLLIVFISVCTALLSEGLTWLLVYRT-EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKK 63
++I+ I + LLSE L WLL+ R EKYQK + EI+K+ + ++ DK KK
Sbjct: 16 FIVILLIGILRGLLSELLQWLLIDRKMEKYQKREKEIQKRARELRKN-----GDKLSPKK 70
Query: 64 IEREEERLKNNNKDLTMVKMKSMFAIGFAFTALLSMFNTIFDGKIVAKLPFTPISWVQGL 123
E+ +E L + K++ M MK FAI + N+ F G +VAKLPF +G+
Sbjct: 71 FEKRQEELMEDQKEMMMDMMKPNFAIMIPQILIFGWINSFFSGFVVAKLPFPLTGRFKGM 130
Query: 124 SHRNLSGEDYTDCSFIFLYILCTMSIRQNIQKMLGFAP 161
R + G D TD S I Y LC+ +RQ IQK+LGFA
Sbjct: 131 LQRGVDGLDVTDVSSISWYFLCSFGLRQVIQKLLGFAN 168
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 35.8 bits (83), Expect = 0.005
Identities = 34/162 (20%), Positives = 69/162 (42%), Gaps = 12/162 (7%)
Query: 6 LLIVFISVCTALLSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIE 65
L+I+ +V T L L LL+ + EK ++L+ +++ ++ +EA ES D KK++
Sbjct: 48 LVILVAAVITGLYITILQKLLIDQ-EKMKELQKMMKEFQKEFREAQESG--DMKKLKKLQ 104
Query: 66 REEERLKNNNKDLTMVKMKSMFAIGFAFTAL--LSMFNTIFDGKIVAKLP--FTPISWVQ 121
++ + ++ ++L ++ K M I T + + G + P P +
Sbjct: 105 EKQMEMMDDQRELMKMQFKPMLYI-SVLTIPFFAWLRWFVGTGGYLVSEPNMALPTLFHI 163
Query: 122 GLSHRNLSGEDYTDCSFIFLYILCTMSIRQNIQKMLGFAPSR 163
+I Y LC+ + Q ++K+L A
Sbjct: 164 LYHTAVFGDFLG----WIGWYFLCSFVVSQILRKVLNIAVFT 201
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 34.4 bits (80), Expect = 0.024
Identities = 18/57 (31%), Positives = 32/57 (56%), Gaps = 8/57 (14%)
Query: 31 EKYQKLKTEIEKQMEKKKEAI---ESTLLDK--NVKKK---IEREEERLKNNNKDLT 79
E+ KL+ E EK++ +++ + E LL K N+ +K +E+ EE L+ K+L
Sbjct: 64 EEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELE 120
Score = 26.3 bits (59), Expect = 9.8
Identities = 13/82 (15%), Positives = 36/82 (43%), Gaps = 9/82 (10%)
Query: 4 DTLLIVFISVCTALLSEGLTWLLVYRT---------EKYQKLKTEIEKQMEKKKEAIEST 54
L+I+ I++ ++ + + + + E+ +++ E +K+ E K+
Sbjct: 2 MLLIIILIALVALVVGAVIGYFVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLE 61
Query: 55 LLDKNVKKKIEREEERLKNNNK 76
++ K + E E+E + N+
Sbjct: 62 AKEEIHKLRNEFEKELRERRNE 83
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 34.1 bits (79), Expect = 0.035
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 38 TEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKN 73
+ + ++E+ +E IE L K K+K E+E ++L+
Sbjct: 246 DDDKDEVEELREKIEKLKLPKEAKEKAEKELKKLET 281
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 32.2 bits (73), Expect = 0.15
Identities = 12/59 (20%), Positives = 23/59 (38%), Gaps = 1/59 (1%)
Query: 18 LSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNK 76
+ E R +K ++LK IE+ + I+ L K E+ ++ L+
Sbjct: 159 IEEEAAGSREKR-KKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQL 216
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 31.7 bits (73), Expect = 0.16
Identities = 16/49 (32%), Positives = 27/49 (55%), Gaps = 7/49 (14%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLT 79
E+ +L+ +I Q+E K +E+ +K+ERE ERLK+ + L
Sbjct: 8 ERNSELEEQIR-QLELKLRDLEAE------NEKLERELERLKSELEKLK 49
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 31.6 bits (73), Expect = 0.22
Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 10/49 (20%)
Query: 26 LVYRTEKYQK---------LKTEIEKQMEKKKEAIESTLLDKNVKKKIE 65
L+Y+TEK K K +IE +++ KEA++ + +K K E
Sbjct: 535 LIYQTEKTLKELGDKVPADEKEKIEAAIKELKEALKGEDKEA-IKAKTE 582
>gnl|CDD|129165 TIGR00055, uppS, undecaprenyl diphosphate synthase. This enzyme
builds undecaprenyl diphosphate, a molecule that in
bacteria is used a carrier in synthesizing cell wall
components. Alternate name: undecaprenyl pyrophosphate
synthetase. Activity has been demonstrated
experimentally for members of this family from
Micrococcus luteus, E. coli, Haemophilus influenzae, and
Streptococcus pneumoniae [Cell envelope, Biosynthesis
and degradation of surface polysaccharides and
lipopolysaccharides].
Length = 226
Score = 30.8 bits (70), Expect = 0.25
Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 19/75 (25%)
Query: 25 LLVYRTEKYQKLKTEIEKQMEKKKEAIES------------------TLLDKNVKKKIER 66
L + TE +++ K E++ ME ++ ++ +LL K +++KI++
Sbjct: 49 LYAFSTENWKRPKEEVDFLMELFEKKLDREVKELHRYNVRIRIIGDLSLLSKELQEKIKK 108
Query: 67 EEERLKNNNKDLTMV 81
EE K NN D T+
Sbjct: 109 AEEDTK-NNTDFTLN 122
>gnl|CDD|221833 pfam12889, DUF3829, Protein of unknown function (DUF3829). This is
a small family of proteins from several bacterial
species, whose function is not known. It may, however,
be related to the GvpL_GvpF family of proteins,
pfam06386.
Length = 277
Score = 30.9 bits (70), Expect = 0.33
Identities = 13/64 (20%), Positives = 28/64 (43%), Gaps = 5/64 (7%)
Query: 28 YRTEKYQKLK---TEIEKQMEKKKEAIE--STLLDKNVKKKIEREEERLKNNNKDLTMVK 82
Y+ + + K K ++ E+ K A E L+K +++ + E LK K +
Sbjct: 106 YKDDNFAKGKELHAKLLAAYEEFKAAYEAFLAALEKINDERLLEQLEELKKEGKKIAYYT 165
Query: 83 MKSM 86
++ +
Sbjct: 166 LELL 169
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 30.3 bits (69), Expect = 0.36
Identities = 13/49 (26%), Positives = 26/49 (53%), Gaps = 4/49 (8%)
Query: 30 TEKYQKLKTEIEKQMEKKKEAIES---TLLD-KNVKKKIEREEERLKNN 74
++ + L+ E+E+ E+ +E + L + +N +K+ ERE E K
Sbjct: 10 EDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKY 58
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 30.5 bits (69), Expect = 0.56
Identities = 12/50 (24%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 29 RTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDL 78
R ++ +K E+E+++ + +E ++S L + + ++E EE L++ + L
Sbjct: 594 RLKELKKKLKELEERLSQLEELLQSLELSEA-ENELEEAEEELESELEKL 642
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is
a small protein module with a triple-stranded
beta-sheet fold. This is a family of WW domain binding
proteins.
Length = 78
Score = 28.4 bits (64), Expect = 0.58
Identities = 10/38 (26%), Positives = 21/38 (55%)
Query: 39 EIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNK 76
++++++K K ES L K+K+++ E+ L K
Sbjct: 41 RLQRELDKLKAQEESGKLRPVEKQKLKKLEKDLDAVKK 78
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 29.7 bits (67), Expect = 0.72
Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 34 QKLKTEIEKQMEKKKEAIESTLLD-KNVKKKIEREEERLKNNNKDLTMVK 82
QK K I+ ++E IE ++K+I++ + +K K++ +K
Sbjct: 44 QKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELK 93
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.5 bits (67), Expect = 0.79
Identities = 15/55 (27%), Positives = 25/55 (45%), Gaps = 8/55 (14%)
Query: 34 QKLKTEIE------KQMEKKKEAIESTLLDKNVKKKIEREEE--RLKNNNKDLTM 80
+KL+ E E ++E K EAIE ++ ++ +E LK N+ L
Sbjct: 130 KKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQLKS 184
Score = 26.4 bits (59), Expect = 7.8
Identities = 13/45 (28%), Positives = 24/45 (53%), Gaps = 5/45 (11%)
Query: 37 KTEIEKQMEKKKEAIESTLLDKN-VKKKIE----REEERLKNNNK 76
K+E+E++++K +E E ++ K+E REEE + K
Sbjct: 122 KSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEK 166
>gnl|CDD|218657 pfam05615, THOC7, Tho complex subunit 7. The Tho complex is
involved in transcription elongation and mRNA export
from the nucleus.
Length = 132
Score = 28.8 bits (65), Expect = 0.82
Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 6/44 (13%)
Query: 30 TEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKN 73
E Y++ K EIE +E K+ IE +KK++E + KN
Sbjct: 69 RENYEQEKEEIETTIENAKDEIE------ELKKELEEAKRIRKN 106
>gnl|CDD|219180 pfam06798, PrkA, PrkA serine protein kinase C-terminal domain.
This is a family of PrkA bacterial and archaeal serine
kinases approximately 630 residues long. This family
corresponds to the C-terminal domain.
Length = 254
Score = 29.5 bits (67), Expect = 0.85
Identities = 12/49 (24%), Positives = 25/49 (51%), Gaps = 6/49 (12%)
Query: 33 YQKLKTEIEKQM-EKKKEAI-----ESTLLDKNVKKKIEREEERLKNNN 75
Y++L+ IEK++ K+ + S + D+ +KKI+ +R+
Sbjct: 193 YERLREAIEKKLFSDVKDLLKLISLVSKVTDEEQQKKIDEVVDRMIKKY 241
>gnl|CDD|109608 pfam00558, Vpu, Vpu protein. The Vpu protein contains an
N-terminal transmembrane spanning region and a
C-terminal cytoplasmic region. The HIV-1 Vpu protein
stimulates virus production by enhancing the release of
viral particles from infected cells. The VPU protein
binds specifically to CD4.
Length = 81
Score = 28.1 bits (63), Expect = 0.94
Identities = 12/44 (27%), Positives = 25/44 (56%)
Query: 6 LLIVFISVCTALLSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKE 49
+I I++ AL+ + W +VYR + K + EI + +++ +E
Sbjct: 5 EIIGLIALIVALIINIVVWTIVYREYRKIKKQREILRLIKRIRE 48
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 29.3 bits (65), Expect = 1.2
Identities = 17/45 (37%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 29 RTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREE-ERLK 72
R +K + IEK+ +K A+E +L+K +++EREE ERL+
Sbjct: 429 RVDKDHAERARIEKENAHRK-ALEMKILEKKRIERLEREERERLE 472
>gnl|CDD|153307 cd07623, BAR_SNX1_2, The Bin/Amphiphysin/Rvs (BAR) domain of
Sorting Nexins 1 and 2. BAR domains are dimerization,
lipid binding and curvature sensing modules found in
many different proteins with diverse functions. Sorting
nexins (SNXs) are Phox homology (PX) domain containing
proteins that are involved in regulating membrane
traffic and protein sorting in the endosomal system.
SNXs differ from each other in their lipid-binding
specificity, subcellular localization and specific
function in the endocytic pathway. A subset of SNXs also
contain BAR domains. The PX-BAR structural unit
determines the specific membrane targeting of SNXs. This
subfamily consists of SNX1, SNX2, and similar proteins.
SNX1 and SNX2 are components of the retromer complex, a
membrane coat multimeric complex required for endosomal
retrieval of lysosomal hydrolase receptors to the Golgi.
The retromer consists of a cargo-recognition subcomplex
and a subcomplex formed by a dimer of sorting nexins
(SNX1 and/or SNX2), which ensures efficient cargo
sorting by facilitating proper membrane localization of
the cargo-recognition subcomplex. BAR domains form
dimers that bind to membranes, induce membrane bending
and curvature, and may also be involved in
protein-protein interactions.
Length = 224
Score = 28.8 bits (65), Expect = 1.3
Identities = 17/61 (27%), Positives = 36/61 (59%), Gaps = 5/61 (8%)
Query: 29 RTEKYQKLKTEI---EKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKS 85
RT+K + + EI E ++++ ++ E + K +KK+IER E+ + KD+ + ++S
Sbjct: 147 RTDKLDQAQQEIKEWEAKVDRGQKEFEE--ISKTIKKEIERFEKNRVKDFKDIIIKYLES 204
Query: 86 M 86
+
Sbjct: 205 L 205
>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
recombination, and repair].
Length = 440
Score = 29.1 bits (66), Expect = 1.3
Identities = 16/77 (20%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 18 LSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIES--TLLDKNVKKKIEREEERLKNNN 75
L L L + ++ ++L + Q++++++ ++ LDK ++++++R+ ERL+
Sbjct: 322 LRRALENQLALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRKRERLEALV 381
Query: 76 KDLTMVKMKSMFAIGFA 92
+ L + + A G+A
Sbjct: 382 EQLESLSPLATLARGYA 398
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 29.3 bits (66), Expect = 1.4
Identities = 16/59 (27%), Positives = 32/59 (54%), Gaps = 7/59 (11%)
Query: 31 EKYQKLKTEIE------KQMEKKKEAIESTLLD-KNVKKKIEREEERLKNNNKDLTMVK 82
E+Y +L E+ +++EK++E I+ TL K ++ E+ ++ L+ K L V+
Sbjct: 666 EEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVE 724
Score = 28.9 bits (65), Expect = 1.6
Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 6/59 (10%)
Query: 29 RTEKYQKLKTEIEKQM---EKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMK 84
R + ++ IE+++ E+K+E +E K K++E+ E L+ ++ K K
Sbjct: 315 RLSRLEEEINGIEERIKELEEKEERLEEL---KKKLKELEKRLEELEERHELYEEAKAK 370
Score = 28.1 bits (63), Expect = 2.9
Identities = 9/41 (21%), Positives = 27/41 (65%), Gaps = 3/41 (7%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERL 71
+ ++ EI++++E+ ++ I+ T +N+++ I+ +E+ L
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRT---ENIEELIKEKEKEL 202
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 29.2 bits (66), Expect = 1.4
Identities = 20/121 (16%), Positives = 42/121 (34%), Gaps = 35/121 (28%)
Query: 31 EKYQKLKTEIEKQMEKK--------KEAIESTLLD--------------KNVKKKIEREE 68
++++ K E+ K+ E+K EA E L + K +K+K+E E
Sbjct: 226 LEFEREKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEER 285
Query: 69 E-RLKNNNKDLTMVKMKSMFAIGFA------------FTALLSMFNTIFDGKIVAKLPFT 115
RL + + +K + + A+ ++ + + G + P
Sbjct: 286 NGRLAKLAELNSRLKGLEKALDSRSEAEDENHKVQQLWLAVEALKSALKSGSAGSPRPLV 345
Query: 116 P 116
Sbjct: 346 K 346
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 28.8 bits (65), Expect = 1.6
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 37 KTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKN 73
+ + + ++EK KE +E L + VKK IE+E +L
Sbjct: 242 EKDDKDELEKLKEKLEELKLPEEVKKVIEKELTKLSL 278
>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
[Transcription].
Length = 217
Score = 28.6 bits (64), Expect = 1.6
Identities = 16/54 (29%), Positives = 20/54 (37%)
Query: 18 LSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERL 71
LS T L E ++K EIE + E I L KN +E R
Sbjct: 137 LSRAQTELQKQLNELMDRIKEEIEDKDGDMTERIVLEYLLKNGAADVEETSRRT 190
>gnl|CDD|238265 cd00475, CIS_IPPS, Cis (Z)-Isoprenyl Diphosphate Synthases
(cis-IPPS); homodimers which catalyze the successive
1'-4 condensation of the isopentenyl diphosphate (IPP)
molecule to trans,trans-farnesyl diphosphate (FPP) or to
cis,trans-FPP to form long-chain polyprenyl
diphosphates. A few can also catalyze the condensation
of IPP to trans-geranyl diphosphate to form the
short-chain cis,trans- FPP. In prokaryotes, the
cis-IPPS, undecaprenyl diphosphate synthase (UPP
synthase) catalyzes the formation of the carrier lipid
UPP in bacterial cell wall peptidooglycan biosynthesis.
Similarly, in eukaryotes, the cis-IPPS, dehydrodolichyl
diphosphate (dedol-PP) synthase catalyzes the formation
of the polyisoprenoid glycosyl carrier lipid dolichyl
monophosphate. cis-IPPS are mechanistically and
structurally distinct from trans-IPPS, lacking the DDXXD
motifs, yet requiring Mg2+ for activity.
Length = 221
Score = 28.4 bits (64), Expect = 1.6
Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 20/76 (26%)
Query: 21 GLTWLLVY--RTEKYQKLKTEIEKQMEKKKEAIES------------------TLLDKNV 60
G+ + +Y TE +++ K E++ ME ++ + +LL +++
Sbjct: 44 GVKEVTLYAFSTENWKRPKEEVDFLMELFRDVLRRILKELEKLGVRIRIIGDLSLLPESL 103
Query: 61 KKKIEREEERLKNNNK 76
+K+I++ EE KNN
Sbjct: 104 QKEIKKAEEATKNNTG 119
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 28.9 bits (65), Expect = 1.7
Identities = 15/54 (27%), Positives = 32/54 (59%), Gaps = 5/54 (9%)
Query: 34 QKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNK---DLTMVKMK 84
++++EIE+++ +K+E E+T KN ++ IE + L+ K + + +K K
Sbjct: 499 SQIRSEIERRLAEKEEEFENT--RKNHQRAIESLQATLEAEAKGKAEASRLKKK 550
>gnl|CDD|180627 PRK06569, PRK06569, F0F1 ATP synthase subunit B'; Validated.
Length = 155
Score = 27.9 bits (62), Expect = 1.9
Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 3/67 (4%)
Query: 16 ALLSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNN 75
L+ + L Y E+ K TEI++ ++K +++ES L K KK +E++ + N N
Sbjct: 54 DTLTIEVEKLNKYYNEEIDKTNTEIDRLKKEKIDSLESEFLIK--KKNLEQDLKNSINQN 111
Query: 76 -KDLTMV 81
+D+ +
Sbjct: 112 IEDINLA 118
>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
(trigger factor) [Posttranslational modification,
protein turnover, chaperones].
Length = 441
Score = 28.4 bits (64), Expect = 2.0
Identities = 4/36 (11%), Positives = 16/36 (44%)
Query: 34 QKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEE 69
++LK ++ K +E++ + + + + +
Sbjct: 263 EELKEKLRKNLERELKEATLEKRKEQLLDALVEAND 298
>gnl|CDD|219003 pfam06381, DUF1073, Protein of unknown function (DUF1073). This
family consists of several hypothetical bacterial
proteins. The function of this family is unknown.
Length = 361
Score = 28.4 bits (64), Expect = 2.0
Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 7/69 (10%)
Query: 48 KEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKSMFAIGFAFTALLSMFNTIFDGK 107
+ IE T D + +KIE EEERL L + ++ DG+
Sbjct: 30 RAGIELTSDDDDDIEKIEAEEERLNVKEALLDAIIWARLYGGAAIV------IKVKSDGQ 83
Query: 108 IVAKLPFTP 116
+++ P P
Sbjct: 84 DLSQ-PLNP 91
>gnl|CDD|223099 COG0020, UppS, Undecaprenyl pyrophosphate synthase [Lipid
metabolism].
Length = 245
Score = 28.0 bits (63), Expect = 2.3
Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 9/61 (14%)
Query: 34 QKLKTEIEKQMEK--KKEAI-ESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKSMFAIG 90
+ L+ E++K + + I + + L + V++ IE+ EE+ KNN + A+
Sbjct: 90 KALREELKKLHKNGVRIRIIGDLSRLPEEVREAIEKAEEKTKNNT------GLTLNIAVN 143
Query: 91 F 91
+
Sbjct: 144 Y 144
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.1 bits (63), Expect = 2.4
Identities = 14/45 (31%), Positives = 23/45 (51%)
Query: 35 KLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLT 79
K + + EK + KE + ++ KK+IER EER++ T
Sbjct: 316 KFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQAT 360
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at
their amino terminus. This family represents the rest
of these proteins.
Length = 244
Score = 27.8 bits (62), Expect = 2.6
Identities = 10/43 (23%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLD-KNVKKKIEREEERLK 72
E+ ++ + E+E++ME+ +E + + + ++ EE+LK
Sbjct: 1 EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLK 43
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 28.1 bits (63), Expect = 2.8
Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 1/50 (2%)
Query: 29 RTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDL 78
EK ++L+ EIEK+ E++ E ++ LDK KK ++ + L K L
Sbjct: 107 LEEKIEQLEAEIEKK-EEELEKAKNKFLDKAWKKLAKKYDSNLSEALKGL 155
>gnl|CDD|216295 pfam01093, Clusterin, Clusterin.
Length = 434
Score = 28.1 bits (63), Expect = 2.8
Identities = 13/41 (31%), Positives = 26/41 (63%), Gaps = 3/41 (7%)
Query: 29 RTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEE 69
E+++ L + +EK ++K+EA++ V++K+E EEE
Sbjct: 35 TEEEHKNLMSTLEKTKKEKEEALKLAN---EVEEKLEEEEE 72
>gnl|CDD|232831 TIGR00115, tig, trigger factor. Trigger factor is a
ribosome-associated molecular chaperone and is the first
chaperone to interact with nascent polypeptide. Trigger
factor can bind at the same time as the signal
recognition particle (SRP), but is excluded by the SRP
receptor (FtsY). The central domain of trigger factor
has peptidyl-prolyl cis/trans isomerase activity. This
protein is found in a single copy in virtually every
bacterial genome [Protein fate, Protein folding and
stabilization].
Length = 408
Score = 27.9 bits (63), Expect = 2.9
Identities = 15/60 (25%), Positives = 26/60 (43%), Gaps = 13/60 (21%)
Query: 31 EKYQKLKTEIEKQME---------KKKEAIESTLLDKNV----KKKIEREEERLKNNNKD 77
E ++LK +I K +E K KE + L++ N + +E+E +RL
Sbjct: 250 ETLEELKADIRKNLEREKKERAKNKLKEQLLDKLVENNEFELPESLVEQEIDRLLEQALQ 309
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 28.0 bits (62), Expect = 3.0
Identities = 13/44 (29%), Positives = 24/44 (54%)
Query: 35 KLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDL 78
+LK E + + + EA E+ K +K K++ E +L+N +D
Sbjct: 115 RLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQ 158
>gnl|CDD|224520 COG1604, COG1604, CRISPR system related protein, RAMP superfamily
[Defense mechanisms].
Length = 257
Score = 27.8 bits (62), Expect = 3.2
Identities = 15/70 (21%), Positives = 27/70 (38%), Gaps = 2/70 (2%)
Query: 18 LSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKD 77
+SE L + K E+++ K + L K +K+ E+ +KN
Sbjct: 17 ISEYNPLLK-EQIINL-DKKNAFEEEIIKDFVNEIQSYLSKVLKRLGRSYEQYVKNTKIT 74
Query: 78 LTMVKMKSMF 87
L K+K+
Sbjct: 75 LNSFKLKTKS 84
>gnl|CDD|224076 COG1154, Dxs, Deoxyxylulose-5-phosphate synthase [Coenzyme
metabolism / Lipid metabolism].
Length = 627
Score = 27.9 bits (63), Expect = 3.3
Identities = 16/88 (18%), Positives = 29/88 (32%), Gaps = 25/88 (28%)
Query: 30 TEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKSMF-A 88
YQ L+ +K + K V ++R +R + + K L + +F
Sbjct: 197 GPFYQSLREGGKKVLSK-------------VGPPLKRFAKRAEESIKGLLVPGT--LFEE 241
Query: 89 IGFAFT---------ALLSMFNTIFDGK 107
+GF + L+ D K
Sbjct: 242 LGFNYIGPIDGHNLEELIPTLKNAKDLK 269
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 27.3 bits (61), Expect = 3.4
Identities = 14/49 (28%), Positives = 27/49 (55%), Gaps = 1/49 (2%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLT 79
E+ QK K + +K +KK + + K+ K + + E+E ++ +DLT
Sbjct: 79 EEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA-EDKLEDLT 126
>gnl|CDD|227333 COG5000, NtrY, Signal transduction histidine kinase involved in
nitrogen fixation and metabolism regulation [Signal
transduction mechanisms].
Length = 712
Score = 27.8 bits (62), Expect = 3.4
Identities = 11/61 (18%), Positives = 22/61 (36%), Gaps = 5/61 (8%)
Query: 55 LLDKNVKKKIER-----EEERLKNNNKDLTMVKMKSMFAIGFAFTALLSMFNTIFDGKIV 109
+ + V K+ E + + ++ FA+ + TALL + I+
Sbjct: 242 YVVRPVDPKVAEHADLTEGAAAEYRELEAGRDGLQIAFALLYLSTALLVLLAAIWTAIAF 301
Query: 110 A 110
A
Sbjct: 302 A 302
>gnl|CDD|216227 pfam00982, Glyco_transf_20, Glycosyltransferase family 20. Members
of this family belong to glycosyl transferase family 20.
OtsA (Trehalose-6-phosphate synthase) is homologous to
regions in the subunits of yeast trehalose-6-phosphate
synthase/phosphate complex.
Length = 470
Score = 27.6 bits (62), Expect = 3.6
Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 11/68 (16%)
Query: 49 EAIESTLLDKNVKKKIEREEERLKNNNK------DLTMVK-MKSMFAIGFAFTALLSMFN 101
IES L +V++K++ +ER N K L +K + AF L
Sbjct: 246 GRIESGLKSPSVQEKVKELKERFGNKKKLILGVDRLDYIKGIPQKL---LAFERFLEE-Y 301
Query: 102 TIFDGKIV 109
+ GK+V
Sbjct: 302 PEWRGKVV 309
>gnl|CDD|234956 PRK01490, tig, trigger factor; Provisional.
Length = 435
Score = 27.8 bits (63), Expect = 3.6
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 13/53 (24%)
Query: 35 KLKTEIEKQME---------KKKEAIESTLLDKNV----KKKIEREEERLKNN 74
+LK +I K +E K KEA+ L++ + +E+E +RL
Sbjct: 264 ELKADIRKNLEREKKEAQRAKVKEAVLDALVENAEIDLPEALVEQEIDRLLRQ 316
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 27.3 bits (61), Expect = 3.9
Identities = 9/53 (16%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 30 TEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVK 82
E+ Q LK + +E + +++ S L + ++++I+ ++ ++ + L ++
Sbjct: 85 REEVQLLKQDC-STLEIELKSLTSDLTTEELQEEIQELKKEVREIEEKLESLE 136
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 27.1 bits (60), Expect = 3.9
Identities = 15/48 (31%), Positives = 31/48 (64%), Gaps = 1/48 (2%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDL 78
E+ ++L+ E+E+ E +K+ +E L +K KK++E +E L+ ++L
Sbjct: 157 ERLEELEKELEEL-EDEKDLLEKLLEEKEKKKELEELKEELEELLEEL 203
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45;
Provisional.
Length = 193
Score = 27.4 bits (60), Expect = 4.0
Identities = 15/36 (41%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 34 QKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEE 69
+KLK E+E+ E+K E I L D+ ++ E+EEE
Sbjct: 25 EKLKKEVEEIPEQKPEDIVEELEDQP-EEPPEQEEE 59
>gnl|CDD|188648 cd09816, prostaglandin_endoperoxide_synthase, Animal prostaglandin
endoperoxide synthase and related bacterial proteins.
Animal prostaglandin endoperoxide synthases, including
prostaglandin H2 synthase and a set of similar bacterial
proteins which may function as cyclooxygenases.
Prostaglandin H2 synthase catalyzes the synthesis of
prostaglandin H2 from arachidonic acid. In two reaction
steps, arachidonic acid is converted to Prostaglandin
G2, a peroxide (cyclooxygenase activity) and
subsequently converted to the end product via the
enzyme's peroxidase activity. Prostaglandin H2 synthase
is the target of aspirin and other non-steroid
anti-inflammatory drugs such as ibuprofen, which block
the substrate's access to the active site and may
acetylate a conserved serine residue. In humans and
other mammals, prostaglandin H2 synthase (PGHS), also
called cyclooxygenase (COX) is present as at least two
isozymes, PGHS-1 (or COX-1) and PGHS-2 (or COX-2),
respectively. PGHS-1 is expressed constitutively in most
mammalian cells, while the expression of PGHS-2 is
induced via inflammation response in endothelial cells,
activated macrophages, and others. COX-3 is a splice
variant of COX-1.
Length = 490
Score = 27.6 bits (62), Expect = 4.4
Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 3/34 (8%)
Query: 74 NNKDLTMVKMKSMFAIG---FAFTALLSMFNTIF 104
+LT + +FA+G F T L M NTI+
Sbjct: 186 LGDELTPEREAKLFAVGHERFNLTPGLFMLNTIW 219
>gnl|CDD|220897 pfam10883, DUF2681, Protein of unknown function (DUF2681). This
family of proteins is found in bacteria. Proteins in
this family are typically between 81 and 117 amino
acids in length.
Length = 87
Score = 26.2 bits (58), Expect = 4.4
Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 10/75 (13%)
Query: 1 MWADTLLIVFISVCTALLSEGLTWLLVYRTEKYQ----KLKTEIEKQMEKKKEAIESTLL 56
M +++ ++V +L+ L ++ +K Q KL+ E E Q+ +K E+ +
Sbjct: 1 MMNLYIVLGVLAVFACILA-----YLYWKLKKAQRENRKLQAENE-QLATEKAVAETEVK 54
Query: 57 DKNVKKKIEREEERL 71
+ V++K E RL
Sbjct: 55 NAKVRQKNEENTRRL 69
>gnl|CDD|162595 TIGR01908, cas_CXXC_CXXC, CRISPR-associated protein Cas8a1/Cst1,
subtype I-B/TNEAP. CRISPR is a term for Clustered,
Regularly Interspaced Short Palidromic Repeats. A number
of protein families appear only in association with
these repeats and are designated Cas (CRISPR-Associated)
proteins. This (revised) model describes a conserved
region from an otherwise highly divergent protein found
in the Tneap subtype of CRISPR/Cas regions. This
Cys-rich region features two motifs of CXXC [Mobile and
extrachromosomal element functions, Other].
Length = 309
Score = 27.5 bits (61), Expect = 4.7
Identities = 15/56 (26%), Positives = 21/56 (37%)
Query: 15 TALLSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEER 70
LL LT + + + Q +K + E KEA+ L K RE E
Sbjct: 224 WNLLPYYLTEVNLLPKKALQFIKEVGDSISETIKEALWEIYLQGKKLAKRLRELEA 279
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 27.7 bits (61), Expect = 4.8
Identities = 12/63 (19%), Positives = 27/63 (42%), Gaps = 9/63 (14%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDK--------NVKKKIEREEERLKNNNKDLTMVK 82
++ K I + + KE S+ DK + + +ER +E ++ ++K M+
Sbjct: 601 ASLEQNKNHINNE-LESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLA 659
Query: 83 MKS 85
+
Sbjct: 660 GAT 662
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 27.4 bits (61), Expect = 5.2
Identities = 15/59 (25%), Positives = 31/59 (52%), Gaps = 13/59 (22%)
Query: 31 EKYQKLKTEIE-----------KQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDL 78
E+YQ+LK E+ K++ K+ E +E L +++++E +E L+ K++
Sbjct: 213 ERYQELKAELRELELALLLAKLKELRKELEELEEEL--SRLEEELEELQEELEEAEKEI 269
>gnl|CDD|226109 COG3581, COG3581, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 420
Score = 27.0 bits (60), Expect = 5.4
Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 7/82 (8%)
Query: 9 VFISVCTALLSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIESTL-------LDKNVK 61
I + A+L L + Y+ + Y+K+ K +EK + I TL L KN++
Sbjct: 127 FLIRLMLAILYGDLLMRVYYKVKPYEKVPGSANKLLEKWLDKIRKTLRNKSFRELKKNLR 186
Query: 62 KKIEREEERLKNNNKDLTMVKM 83
+ + + ++ V +
Sbjct: 187 EIVREFDNLEILTDEKKPKVGL 208
>gnl|CDD|177400 PHA02571, a-gt.4, hypothetical protein; Provisional.
Length = 109
Score = 26.2 bits (58), Expect = 5.4
Identities = 20/76 (26%), Positives = 38/76 (50%), Gaps = 12/76 (15%)
Query: 34 QKLKTEIEKQMEKKKEAIESTLLDKNVKKKIER----EEERLKNNNKD---LTMVKMKSM 86
+L+ E + EKK I L KN +++I+R EE L +NNK+ + K++ +
Sbjct: 21 SELQARNEAEAEKKAAKI----LKKN-RREIKRLKKHAEEALFDNNKEQYVYAIKKLRDI 75
Query: 87 FAIGFAFTALLSMFNT 102
+ + + +M+ T
Sbjct: 76 YKQPYTDELIETMWET 91
>gnl|CDD|215591 PLN03124, PLN03124, poly [ADP-ribose] polymerase; Provisional.
Length = 643
Score = 27.1 bits (60), Expect = 5.9
Identities = 35/111 (31%), Positives = 52/111 (46%), Gaps = 14/111 (12%)
Query: 32 KYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKSMFAIGF 91
KY L+ + E + E KK+ + DKN + K++ + + D++M+K + M IG+
Sbjct: 262 KYTWLEMDYEDEEESKKDKPSVSSEDKNKQSKLDPRVAQFISLICDVSMMK-QQMMEIGY 320
Query: 92 AFTAL----LSMFNTIFDGKIVAKLPFTPISWVQGLSHR----NLSGEDYT 134
L LS +TI G V K I+ V S R LSGE YT
Sbjct: 321 NARKLPLGKLSK-STILKGYEVLK----RIAEVISRSDRETLEELSGEFYT 366
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 26.3 bits (59), Expect = 6.4
Identities = 13/47 (27%), Positives = 25/47 (53%)
Query: 38 TEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMK 84
T++ ++ + KE++E+ LLDK K E+ E++ + K K
Sbjct: 45 TDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAK 91
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 26.8 bits (60), Expect = 6.6
Identities = 10/47 (21%), Positives = 21/47 (44%), Gaps = 4/47 (8%)
Query: 29 RTEKYQKLKTEIEKQME----KKKEAIESTLLDKNVKKKIEREEERL 71
R K + + E+E+Q+E ++ E L + ++ E E +
Sbjct: 282 RRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEEAERQAR 328
Score = 26.8 bits (60), Expect = 6.8
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEER 70
E Q+ + + + + +E + +K ++ EREEE
Sbjct: 273 ELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEEL 312
Score = 26.8 bits (60), Expect = 7.7
Identities = 10/52 (19%), Positives = 28/52 (53%), Gaps = 8/52 (15%)
Query: 29 RTEKYQKLKTEIEKQME----KKKEAIESTLLDK----NVKKKIEREEERLK 72
R E+ ++ + +++Q+E +++E E L ++ + ++I+ E+E
Sbjct: 65 RKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEA 116
>gnl|CDD|218372 pfam05000, RNA_pol_Rpb1_4, RNA polymerase Rpb1, domain 4. RNA
polymerases catalyze the DNA dependent polymerisation
of RNA. Prokaryotes contain a single RNA polymerase
compared to three in eukaryotes (not including
mitochondrial. and chloroplast polymerases). This
domain, domain 4, represents the funnel domain. The
funnel contain the binding site for some elongation
factors.
Length = 108
Score = 25.8 bits (57), Expect = 6.7
Identities = 13/52 (25%), Positives = 22/52 (42%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVK 82
E+Y KL+ +E+ EA+ + +L+K + L NN M
Sbjct: 5 ERYGKLEDIWGMTLEETFEALINNILNKARDPAGNIASKSLDPNNSIYMMAD 56
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 26.9 bits (60), Expect = 6.7
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 36 LKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEER 70
L+ E E++ K++ L + KKK E E+
Sbjct: 553 LQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLE 587
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 27.1 bits (61), Expect = 6.7
Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 2/37 (5%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTL--LDKNVKKKIE 65
++ +KLK E+E++ EK +E + L +K ++ I+
Sbjct: 544 KEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIK 580
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 26.9 bits (60), Expect = 6.9
Identities = 15/53 (28%), Positives = 27/53 (50%), Gaps = 10/53 (18%)
Query: 26 LVYRTEKYQK---------LKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEE 69
L Y+ EK K K +IEK + + KEA++ +++ +K K E ++
Sbjct: 533 LAYQAEKTLKEAGDKLPAEEKEKIEKAVAELKEALKGEDVEE-IKAKTEELQQ 584
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 26.5 bits (59), Expect = 7.1
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 35 KLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNN 74
KLK E EKQ EK++++ E L KKIE+E E+ +
Sbjct: 153 KLK-EAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
>gnl|CDD|206332 pfam14163, SieB, Superinfection exclusion protein B. This family
includes superinfection exclusion proteins. These
proteins prevent the growth of superinfecting phage
which are insensitive to repression. It aborts lytic
development of superinfecting phage.
Length = 151
Score = 26.4 bits (59), Expect = 7.1
Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 1 MWADTLLIVFISVCTALLSEGLTWLLVYRTEKYQKLKTEIEKQMEK 46
+ + ++F+ L++ L+ +L EKYQK K E E+ +K
Sbjct: 31 KYRPWIGLIFLISVAYLITLLLSKILQEAKEKYQK-KREQERIEKK 75
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents
the seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is
recognized by the pfam model tRNA-synt_2b. The
seryl-tRNA synthetases of two archaeal species,
Methanococcus jannaschii and Methanobacterium
thermoautotrophicum, differ considerably and are
included in a different model [Protein synthesis, tRNA
aminoacylation].
Length = 418
Score = 26.6 bits (59), Expect = 7.7
Identities = 13/58 (22%), Positives = 26/58 (44%), Gaps = 5/58 (8%)
Query: 26 LVYRTEKYQKLKTEIEKQMEKKKEAIESTLLD-----KNVKKKIEREEERLKNNNKDL 78
L+ E+ Q + E+ KQ+ K K + + + K +K+++ LK +L
Sbjct: 42 LLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAEL 99
>gnl|CDD|212017 cd11627, HR1_Ste20-like, Protein kinase C-related kinase homology
region 1 (HR1) Rho-binding domain of
Schizosaccharomyces pombe Ste20-like proteins. This
group is composed of predominantly uncharacterized
fungal proteins, which contain two known domains: HR1
at the N-terminal region and REM (Ras exchanger motif)
at the C-terminal region. One member protein from
Schizosaccharomyces pombe is named Ste16 while its gene
is called ste20 (a target of rapamycin complex 2
subunit). It is a subunit in the protein kinase TOR
complexes in fission yeast. The REM domain is usually
found in nucleotide exchange factors for Ras-like small
GTPases. HR1 domains are anti-parallel coiled-coil
(ACC) domains that bind small GTPases from the Rho
family.
Length = 71
Score = 25.0 bits (55), Expect = 7.8
Identities = 9/44 (20%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 29 RTEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLK 72
++ ++L+ ++E + K K+ E+ L + K E++++R +
Sbjct: 3 SEQRLEELRGKLEIET-KIKDGAENLLQVLDSKNAKEKKDQRAR 45
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A
peptide from the central, syntaxin binding domain of
synaphin competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal
exocytosis by promoting interaction between the
complementary syntaxin and synaptobrevin transmembrane
regions that reside in opposing membranes prior to
fusion.
Length = 139
Score = 26.0 bits (57), Expect = 7.8
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 9/42 (21%)
Query: 29 RTEKYQKLKTEIEKQMEKKKEAIESTLLDK-NVKKKIEREEE 69
R K++K ME+++E + + DK +KKK E EEE
Sbjct: 50 RKAKHRK--------MEEEREVMRQGIRDKYGIKKKEEDEEE 83
>gnl|CDD|221686 pfam12644, DUF3782, Protein of unknown function (DUF3782). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria and
archaea. Proteins in this family are typically between
91 and 186 amino acids in length.
Length = 61
Score = 24.8 bits (55), Expect = 7.9
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 5/41 (12%)
Query: 31 EKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERL 71
E+Y+KL+ EIEK EK +E LL K KK ++ + L
Sbjct: 1 EEYKKLQKEIEKLEEKLRE-----LLSKEQKKLLDEYIDAL 36
>gnl|CDD|220615 pfam10175, MPP6, M-phase phosphoprotein 6. This is a family of
M-phase phosphoprotein 6s which is necessary for
generation of the 3' end of the 5.8S rRNA precursor. It
preferentially binds to poly(C) and poly(U).
Length = 104
Score = 25.6 bits (56), Expect = 8.1
Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 8/48 (16%)
Query: 42 KQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKSMFAI 89
K M++ K+ E KK + EE L + DL M+K F I
Sbjct: 9 KFMQRTKDKEE--------KKADDEEERALISEEIDLKMLKETENFII 48
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 27.0 bits (60), Expect = 8.2
Identities = 16/59 (27%), Positives = 27/59 (45%), Gaps = 13/59 (22%)
Query: 31 EKYQKLKTEIE-----------KQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDL 78
E+YQ L E + +E++KEAIE L ++++E+ E + K L
Sbjct: 211 ERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASL--EEELEKLTEEISELEKRL 267
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 26.7 bits (59), Expect = 8.4
Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 6/49 (12%)
Query: 26 LVYRTEKY------QKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREE 68
+V EKY + + E + +E ++++TL D+ VK K+ +
Sbjct: 520 MVNEAEKYKAEDEANRERVEAKNGLENYCYSMKNTLQDEKVKGKLSDSD 568
>gnl|CDD|201967 pfam01778, Ribosomal_L28e, Ribosomal L28e protein family.
Length = 114
Score = 25.6 bits (57), Expect = 8.5
Identities = 12/54 (22%), Positives = 23/54 (42%)
Query: 36 LKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKSMFAI 89
L T+ K+ K + E L KN +K +++ ++ L DL + +
Sbjct: 57 LVTKKPKRAHKPAKLWEKVTLSKNYRKALKQIKKALAYYRPDLIKKAKRRASKL 110
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 26.0 bits (58), Expect = 8.7
Identities = 8/43 (18%), Positives = 25/43 (58%)
Query: 30 TEKYQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLK 72
E+ ++LK E++++ E+ + + K + K+ ++ ++L+
Sbjct: 90 EEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKLR 132
>gnl|CDD|235864 PRK06807, PRK06807, DNA polymerase III subunit epsilon; Validated.
Length = 313
Score = 26.3 bits (58), Expect = 9.0
Identities = 11/53 (20%), Positives = 26/53 (49%), Gaps = 6/53 (11%)
Query: 33 YQKLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTMVKMKS 85
YQK + + E K+++ + L + V + ++ E L N +D+ ++ +
Sbjct: 165 YQKCAS---IEEEAKRKSNKEVLDETAVYEAVK---EILVKNKRDIEWIRCMN 211
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 26.6 bits (60), Expect = 9.0
Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 11/46 (23%)
Query: 35 KLKTEIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDLTM 80
+LK IE +E++KE +E+ RL D+T+
Sbjct: 63 ELKQAIEAALEERKEELEA-----------AALNARLAAETIDVTL 97
>gnl|CDD|223610 COG0536, Obg, Predicted GTPase [General function prediction only].
Length = 369
Score = 26.4 bits (59), Expect = 9.0
Identities = 15/60 (25%), Positives = 25/60 (41%), Gaps = 4/60 (6%)
Query: 11 ISVCTALLSEGLTWLLVYRTEKYQKLKTEIEKQMEKKKEAIES-TLLDKNVKKKIEREEE 69
IS T EGL LL E ++ K E E ++ D++ +I R+++
Sbjct: 313 ISALT---REGLDELLRALAELLEETKAEAEAAEAEELPVEVEVLYDDEHEDFEIVRDDD 369
>gnl|CDD|217401 pfam03169, OPT, OPT oligopeptide transporter protein. The OPT
family of oligopeptide transporters is distinct from the
ABC pfam00005 and PTR pfam00854 transporter families.
OPT transporters were first recognised in fungi (Candida
albicans and Schizosaccharomyces pombe), but this
alignment also includes orthologues from Arabidopsis
thaliana. OPT transporters are thought to have 12-14
transmembrane domains and contain the following motif:
SPYxEVRxxVxxxDDP.
Length = 619
Score = 26.5 bits (59), Expect = 9.3
Identities = 7/33 (21%), Positives = 9/33 (27%), Gaps = 4/33 (12%)
Query: 86 MFAIGFAFTALLSMFNTIFDGKIVAKLPFTPIS 118
+G L + N F K IS
Sbjct: 6 AVVLGILLAILGAAVNMYF----GLKTGSVSIS 34
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 26.6 bits (59), Expect = 9.3
Identities = 15/51 (29%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 31 EKYQKLKT---EIEKQMEKKKEAIESTLLDKNVKKKIEREEERLKNNNKDL 78
E +Q+L E E + EKK++ +E K +K+ E RL ++ L
Sbjct: 193 EDHQELLQEEYEKEVKAEKKRQELERVEEKK--LEKMAPEASRLDEMSEGL 241
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 26.4 bits (58), Expect = 9.6
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 5/42 (11%)
Query: 40 IEKQMEKKKEAIESTLLDK-----NVKKKIEREEERLKNNNK 76
I +M+ KEA E TLL+ N++ K ER+ E +K K
Sbjct: 22 ISIKMKSAKEAAELTLLNAEQEAVNLRGKAERDAEHIKKTAK 63
>gnl|CDD|216392 pfam01255, Prenyltransf, Putative undecaprenyl diphosphate
synthase. Previously known as uncharacterized protein
family UPF0015, a single member of this family has been
identified as an undecaprenyl diphosphate synthase.
Length = 222
Score = 26.3 bits (59), Expect = 9.6
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 54 TLLDKNVKKKIEREEERLKNNNK 76
+LL ++++K IE EE KNN
Sbjct: 91 SLLPEDLRKAIEDAEEATKNNTG 113
>gnl|CDD|182109 PRK09848, PRK09848, glucuronide transporter; Provisional.
Length = 448
Score = 26.3 bits (58), Expect = 9.9
Identities = 18/81 (22%), Positives = 37/81 (45%), Gaps = 15/81 (18%)
Query: 108 IVAKLPFTPISWVQGLSHRNLSGED----YTDCSFIF------LYILCTMSIRQNIQKML 157
I A L F ++++ G S +N S E+ Y + + LY +C S R+N+ +++
Sbjct: 153 IAASLTFVCLAFLIGPSIKNSSPEEMVSVYHFWTIVLAIAGMVLYFICFKSTRENVVRIV 212
Query: 158 GFAPSRTAAKQG-GGMFGTQP 177
++ + K + +P
Sbjct: 213 ----AQPSLKISLQTLKRNRP 229
>gnl|CDD|218499 pfam05208, ALG3, ALG3 protein. The formation of N-glycosidic
linkages of glycoproteins involves the ordered assembly
of the common Glc3Man9GlcNAc2 core-oligosaccharide on
the lipid carrier dolichyl pyrophosphate. Whereas early
mannosylation steps occur on the cytoplasmic side of the
endoplasmic reticulum with GDP-Man as donor, the final
reactions from Man5GlcNAc2-PP-Dol to Man9GlcNAc2-PP-Dol
on the lumenal side use Dol-P-Man. ALG3 gene encodes the
Dol-P-Man:Man5GlcNAc2-PP-Dol mannosyltransferase.
Length = 368
Score = 26.2 bits (58), Expect = 10.0
Identities = 12/45 (26%), Positives = 20/45 (44%), Gaps = 3/45 (6%)
Query: 93 FTALLSMFNTIFDGKIVAKLPFTPISWVQGLSHRN--LSGE-DYT 134
L + + I+ K+P+T I W + L+GE DY+
Sbjct: 1 VPPALILAEAVLCFLIIQKVPYTEIDWKAYMEQVEGFLNGERDYS 45
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.135 0.393
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,063,156
Number of extensions: 830542
Number of successful extensions: 1939
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1899
Number of HSP's successfully gapped: 231
Length of query: 179
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 88
Effective length of database: 6,901,388
Effective search space: 607322144
Effective search space used: 607322144
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 56 (25.2 bits)