RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy18023
(151 letters)
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 37.8 bits (89), Expect = 0.001
Identities = 28/109 (25%), Positives = 57/109 (52%), Gaps = 10/109 (9%)
Query: 47 IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
+EE + E E L + K+E+ ++ E ++ ++E +++ E L+ KE N
Sbjct: 44 LEEAKKEAEAIKK-EALLEAKEEIHKLRNEFEKELRERRNELQK-----LEKRLLQKEEN 97
Query: 107 ----QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
L++ ++EL++ KEL+ +EL + +EL ++I+E Q +E
Sbjct: 98 LDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146
Score = 34.4 bits (80), Expect = 0.017
Identities = 25/101 (24%), Positives = 63/101 (62%), Gaps = 6/101 (5%)
Query: 48 EEQEMEKECKVDFN-KLDQIKKELDQIIKELDQIIKELD--LIIKESNQVIKELNLIIKE 104
E + ++KE ++ ++ +++ E ++ ++E +++L+ L+ KE N + ++L L+ K
Sbjct: 50 EAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEEN-LDRKLELLEKR 108
Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
+ +L++ +KEL+Q +EL+ +EL ++I+E Q ++ +
Sbjct: 109 --EEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERI 147
Score = 28.2 bits (64), Expect = 1.6
Identities = 22/78 (28%), Positives = 48/78 (61%), Gaps = 4/78 (5%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL-NLIIKESNQNKLDQIKKELDQ 119
+L++ +KEL+Q +EL++ +EL+ +I+E Q ++ + L +E+ + L+++++E
Sbjct: 110 EELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEA-- 167
Query: 120 MVKELDLTIKELNQIIKE 137
E + IKE+ + KE
Sbjct: 168 -RHEAAVLIKEIEEEAKE 184
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 37.6 bits (88), Expect = 0.001
Identities = 25/99 (25%), Positives = 55/99 (55%), Gaps = 14/99 (14%)
Query: 53 EKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQ 112
E E VDF +L + KEL++ +KEL++ ++++ ++K + + + +L
Sbjct: 322 EIEVGVDFPELKEELKELEEELKELEEELEKIKKLLK-----------KLPKKARGQLPP 370
Query: 113 IKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
K+E + + E T ++L++ ++EL + +KEL + +E
Sbjct: 371 EKREQLEKLLE---TKEKLSEELEELEEELKELKEELES 406
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 36.3 bits (85), Expect = 0.003
Identities = 24/92 (26%), Positives = 50/92 (54%), Gaps = 2/92 (2%)
Query: 62 KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
+L+Q +E + ++KE +++ +EL+ ++ + +L ++ Q + + KKE D+++
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590
Query: 122 KELDLTIKELNQIIK--ELNQIIKELNQIVEE 151
KEL K +K EL + K LN+ E+
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEK 622
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 34.3 bits (79), Expect = 0.016
Identities = 23/93 (24%), Positives = 47/93 (50%), Gaps = 12/93 (12%)
Query: 57 KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE 116
+D KL + E K+L I+EL + N++ K++ + ++K+++IKKE
Sbjct: 26 DIDLEKLIALDDER----KKLLSEIEELQ---AKRNELSKQIGKAKGQ-KKDKIEEIKKE 77
Query: 117 LDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
L ++ +EL EL+ +K L +++ +
Sbjct: 78 LKELKEEL----TELSAALKALEAELQDKLLSI 106
Score = 26.9 bits (60), Expect = 5.3
Identities = 20/94 (21%), Positives = 45/94 (47%), Gaps = 4/94 (4%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM 120
N D +K+ L +D +++L + E +++ E+ + + N+ QI K Q
Sbjct: 9 NNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELS-KQIGKAKGQK 67
Query: 121 VKELDLTIKELNQI---IKELNQIIKELNQIVEE 151
+++ KEL ++ + EL+ +K L +++
Sbjct: 68 KDKIEEIKKELKELKEELTELSAALKALEAELQD 101
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 34.2 bits (79), Expect = 0.018
Identities = 26/114 (22%), Positives = 56/114 (49%), Gaps = 7/114 (6%)
Query: 32 GSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKES 91
G+ +E I + +S+G+ E+E+E+ K + E D+ KE + D + ++
Sbjct: 483 GTGKEQSITITASSGLSEEEIERMVKEA-----EANAEEDKKRKEEIEARNNADSLAYQA 537
Query: 92 NQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
+ +KE + + K+++ EL + +K D+ E+ +EL Q +++L
Sbjct: 538 EKTLKEAGDKLPAEEKEKIEKAVAELKEALKGEDVE--EIKAKTEELQQALQKL 589
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 33.6 bits (77), Expect = 0.025
Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 9/81 (11%)
Query: 62 KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
KL ++KEL+Q + L++++KE + + KE Q ++EL + K EL++
Sbjct: 519 KLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERN--------KKLELEKEA 570
Query: 122 KE-LDLTIKELNQIIKELNQI 141
+E L KE+ II+EL +
Sbjct: 571 QEALKALKKEVESIIRELKEK 591
>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein. The coronavirus
spike glycoprotein forms the characteristic 'corona'
after which the group is named. The Spike glycoprotein
is translated as a large polypeptide that is
subsequently cleaved to S1 pfam01600 and S2.
Length = 609
Score = 33.8 bits (78), Expect = 0.027
Identities = 25/98 (25%), Positives = 40/98 (40%), Gaps = 23/98 (23%)
Query: 56 CKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKK 115
C V++ + L II + KELD K N + +L+L D+
Sbjct: 460 CDVNYVNIT--NTVLPPIIPDYIDFNKELDKWFKNLNSTVPDLDL----------DKFNA 507
Query: 116 ELDQMVKELDLT--IKELNQIIKELNQII---KELNQI 148
+ L+LT I L ++IK LN + + LN+
Sbjct: 508 TI------LNLTGEIDRLQEVIKNLNDSLIDLEWLNRY 539
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 33.7 bits (78), Expect = 0.028
Identities = 24/78 (30%), Positives = 46/78 (58%), Gaps = 6/78 (7%)
Query: 69 ELDQIIKELDQIIKELDLIIKESNQV-IKELNLIIKESNQNKLDQIKKELDQMVKELDLT 127
+L + + +L + +L+ + +E +V +K L +IK+ +L++I+KE+ ++ +E
Sbjct: 54 KLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVE-EELEKIEKEIKELEEE---- 108
Query: 128 IKELNQIIKELNQIIKEL 145
I EL IKEL Q I+ L
Sbjct: 109 ISELENEIKELEQEIERL 126
Score = 31.8 bits (73), Expect = 0.11
Identities = 19/80 (23%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 74 IKELDQIIKELDLI--IKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKEL 131
L ++ + LD + ++E + L+++ K++++ +++++ IKEL
Sbjct: 49 RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVK---SLEELIKDVEEELEKIEKEIKEL 105
Query: 132 NQIIKELNQIIKELNQIVEE 151
+ I EL IKEL Q +E
Sbjct: 106 EEEISELENEIKELEQEIER 125
Score = 26.4 bits (59), Expect = 7.7
Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 6/84 (7%)
Query: 49 EQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQN 108
+ K + K K L+++IK++++ +++++ IKE + I EL IKE
Sbjct: 63 RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKE---- 118
Query: 109 KLDQIKKELDQMVKELDLTIKELN 132
L+Q + L+ DL + L
Sbjct: 119 -LEQEIERLEPW-GNFDLDLSLLL 140
Score = 26.0 bits (58), Expect = 9.5
Identities = 18/64 (28%), Positives = 35/64 (54%)
Query: 41 LESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNL 100
L S E++ KV L+++ K++++ ++++++ IKEL+ I E IKEL
Sbjct: 62 LRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQ 121
Query: 101 IIKE 104
I+
Sbjct: 122 EIER 125
>gnl|CDD|224380 COG1463, Ttg2C, ABC-type transport system involved in resistance to
organic solvents, periplasmic component [Secondary
metabolites biosynthesis, transport, and catabolism].
Length = 359
Score = 33.2 bits (76), Expect = 0.035
Identities = 24/121 (19%), Positives = 45/121 (37%), Gaps = 12/121 (9%)
Query: 30 TDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK 89
T G E I L + + I E Q+ +L+ ++ DL++
Sbjct: 103 TTGLLGEKYIELTPGDAGGPKLRLG---------ATIPLERTQVPIDLEDLL--GDLLLL 151
Query: 90 ESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
LN I+ E+ L +L+ ++ L LN ++ +I LNQ++
Sbjct: 152 LGGLDPDRLNAILNEAAAA-LAGTGPQLNALLDNLAQFTDALNARDGDIGALIANLNQLL 210
Query: 150 E 150
+
Sbjct: 211 D 211
>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
Length = 653
Score = 33.2 bits (76), Expect = 0.038
Identities = 25/121 (20%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 32 GSTEESEINLESSNGIEEQE---MEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
G+ E I + ++ G+ E M +E + + + D+ +K+L ++ + D ++ + +
Sbjct: 487 GTGREQSIRITNTGGLSSNEIERMRQEAEK-YAEEDRRRKQLIELKNQADSLLYSYESTL 545
Query: 89 KESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
KE+ ++I E +K+ + K++Q++ L +++++EL Q ++E Q + +
Sbjct: 546 KENGELISE---ELKQRAEQKVEQLEAALTDP----NISLEELKQQLEEFQQALLAIGAE 598
Query: 149 V 149
V
Sbjct: 599 V 599
>gnl|CDD|215960 pfam00509, Hemagglutinin, Haemagglutinin. Hemagglutinin from
influenza virus causes membrane fusion of the viral
membrane with the host membrane. Fusion occurs after the
host cell internalises the virus by endocytosis. The
drop of pH causes release of a hydrophobic fusion
peptide and a large conformational change leading to
membrane fusion.
Length = 550
Score = 32.7 bits (75), Expect = 0.049
Identities = 11/45 (24%), Positives = 24/45 (53%), Gaps = 4/45 (8%)
Query: 103 KESNQNKLDQIKKEL----DQMVKELDLTIKELNQIIKELNQIIK 143
+S Q +DQI +L ++M + + KE +++ + +N + K
Sbjct: 366 LKSTQKAIDQITGKLNSLIEKMNTQFEAIEKEFSELERRINNLNK 410
Score = 26.5 bits (59), Expect = 5.9
Identities = 11/49 (22%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD 111
+DQI +L+ +I +++ + + I KE +++ + +N + K+ + D
Sbjct: 373 IDQITGKLNSLI---EKMNTQFEAIEKEFSELERRINNLNKKVDDGITD 418
>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
and secretion / Signal transduction mechanisms].
Length = 408
Score = 32.7 bits (74), Expect = 0.054
Identities = 20/87 (22%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 68 KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK---KELDQMVKEL 124
KE+ +I+E+ + ++ES + E + + E + L +I +E+ Q++ E+
Sbjct: 292 KEIGLLIEEIQNEAADAVEHMEESASEVSE-GVKLVEETGSSLGEIAAAIEEVSQLISEI 350
Query: 125 DLTIKELNQIIKELNQIIKELNQIVEE 151
+E +++E+N I+EL+ + +E
Sbjct: 351 AAATEEQTAVLEEINASIEELDDVTQE 377
Score = 32.3 bits (73), Expect = 0.089
Identities = 22/94 (23%), Positives = 53/94 (56%), Gaps = 4/94 (4%)
Query: 61 NKLDQIKKELDQI---IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKEL 117
+ DQ + L+++ I+EL + +KE+ KE+ + E + + +E + + Q +++
Sbjct: 159 ARADQQAESLEEVASAIEELSETVKEVAFNAKEAAALASEASQVAEEGGEE-VRQAVEQM 217
Query: 118 DQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
++ +EL +K+L++ +E+ +I +N I E+
Sbjct: 218 QEIAEELAEVVKKLSESSQEIEEITSVINSIAEQ 251
Score = 31.9 bits (72), Expect = 0.11
Identities = 19/92 (20%), Positives = 48/92 (52%), Gaps = 5/92 (5%)
Query: 60 FNKLDQIKKELDQIIKELDQI---IKELDLIIKESNQVIKELNLIIKESNQN--KLDQIK 114
+++ + K +++ L +I I+E+ +I E +E +++E N + +LD +
Sbjct: 316 ASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINASIEELDDVT 375
Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELN 146
+E V+EL +EL ++ ++L +++ +
Sbjct: 376 QENAAAVEELAAASEELKELAEKLLELVAKFK 407
Score = 31.1 bits (70), Expect = 0.22
Identities = 18/89 (20%), Positives = 42/89 (47%), Gaps = 2/89 (2%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
+++I+ E ++ +++ E+ +K + L I + + Q+ E+ +
Sbjct: 298 IEEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEE--VSQLISEIAAATE 355
Query: 123 ELDLTIKELNQIIKELNQIIKELNQIVEE 151
E ++E+N I+EL+ + +E VEE
Sbjct: 356 EQTAVLEEINASIEELDDVTQENAAAVEE 384
Score = 28.0 bits (62), Expect = 2.0
Identities = 20/112 (17%), Positives = 52/112 (46%), Gaps = 4/112 (3%)
Query: 42 ESSNGIEE--QEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN 99
+S+ I +E++ E +++ E+ + +K +++ L I + + +L
Sbjct: 289 DSAKEIGLLIEEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEE-VSQLI 347
Query: 100 LIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
I + + + +E++ ++ELD +E ++EL +EL ++ E+
Sbjct: 348 SEIAAATEE-QTAVLEEINASIEELDDVTQENAAAVEELAAASEELKELAEK 398
>gnl|CDD|217653 pfam03639, Glyco_hydro_81, Glycosyl hydrolase family 81. Family of
eukaryotic beta-1,3-glucanases. Within the Aspergillus
fumigatus protein two perfectly conserved Glu residues
(E550 or E554) have been proposed as putative
nucleophiles of the active site of the Engl1
endoglucanase, while the proton donor would be D475. The
endo-beta-1,3-glucanase activity is essential for
efficient spore release.
Length = 694
Score = 32.6 bits (74), Expect = 0.064
Identities = 13/31 (41%), Positives = 13/31 (41%), Gaps = 9/31 (29%)
Query: 16 YFALFRMFD---------GLTPVTDGSTEES 37
YF FR FD GL DG EES
Sbjct: 499 YFPQFRNFDWFNGHSWAGGLFESGDGKNEES 529
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 32.7 bits (75), Expect = 0.068
Identities = 22/104 (21%), Positives = 56/104 (53%), Gaps = 14/104 (13%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIK------ESNQVIKELNLIIK------ESNQN 108
++Q+++ +D+ +++ ++EL+ I+ E ++EL +++ E N
Sbjct: 282 ELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQ 341
Query: 109 KLDQIKKELDQMVKELDLT--IKELNQIIKELNQIIKELNQIVE 150
KL++ K+ ++ +T I+ +N II +N++I+E N+ ++
Sbjct: 342 KLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKID 385
Score = 30.0 bits (68), Expect = 0.55
Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 6/91 (6%)
Query: 61 NKLDQIKKELDQIIKELDQ-IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQ 119
K+D +KKE ++ K+L ++ EL I + K L I S + ++ Q++ E
Sbjct: 382 EKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAI-NSLEKEIKQLEAE--- 437
Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIVE 150
+K L+ IKEL + + + E+N++++
Sbjct: 438 -IKALEKEIKELEKQLTNIEPTADEINKLLK 467
Score = 29.6 bits (67), Expect = 0.64
Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 9/91 (9%)
Query: 61 NKLDQIKKELDQI---IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE- 116
+ ++ I +D I I+E ++ I L KE N+ K+L L + + +D +KE
Sbjct: 361 DLIESINDIIDAINELIREHNEKIDNLK---KEKNKAKKKLWLHLVAELKEDIDAYQKEK 417
Query: 117 --LDQMVKELDLTIKELNQIIKELNQIIKEL 145
L++ + L+ IK+L IK L + IKEL
Sbjct: 418 KGLEKAINSLEKEIKQLEAEIKALEKEIKEL 448
Score = 27.3 bits (61), Expect = 4.5
Identities = 25/122 (20%), Positives = 59/122 (48%), Gaps = 20/122 (16%)
Query: 50 QEMEKECKVDFNKLDQIKKELDQIIKE---------LDQIIKELDLIIKESNQVIKELNL 100
+E++ + L++ ++L++ +K+ + +I+ ++ II N++I+E N
Sbjct: 323 EELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNE 382
Query: 101 IIK--ESNQNKL---------DQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
I + +NK ++K+++D KE K +N + KE+ Q+ E+ +
Sbjct: 383 KIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALE 442
Query: 150 EE 151
+E
Sbjct: 443 KE 444
>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains
(chemotaxis sensory transducer). Thought to undergo
reversible methylation in response to attractants or
repellants during bacterial chemotaxis.
Length = 262
Score = 32.3 bits (74), Expect = 0.069
Identities = 22/87 (25%), Positives = 48/87 (55%), Gaps = 4/87 (4%)
Query: 68 KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK---KELDQMVKEL 124
KE++ +IKE+ + E ++ES+ ++E +++E+ L++I +E+ +V+E+
Sbjct: 147 KEIESLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGDA-LEEIVDSVEEIADLVQEI 205
Query: 125 DLTIKELNQIIKELNQIIKELNQIVEE 151
E +E+N I E+ Q+ +E
Sbjct: 206 AAATDEQAAGSEEVNAAIDEIAQVTQE 232
>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
biogenesis, outer membrane].
Length = 760
Score = 32.5 bits (74), Expect = 0.071
Identities = 14/31 (45%), Positives = 14/31 (45%), Gaps = 9/31 (29%)
Query: 16 YFALFRMFD---------GLTPVTDGSTEES 37
YF FRMFD GL DG EES
Sbjct: 557 YFPQFRMFDWFNGHSWADGLFESGDGKDEES 587
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 32.3 bits (74), Expect = 0.073
Identities = 25/109 (22%), Positives = 64/109 (58%), Gaps = 8/109 (7%)
Query: 48 EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL-----NLII 102
E +E++KE ++L +++L+++ +E++++ +ELD + +E ++ +EL +
Sbjct: 372 ELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAG 431
Query: 103 KESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
E+ N+L++ K++ +K+ + +L Q+ +L++ +EL + EE
Sbjct: 432 IEAKINELEEEKEDKALEIKKQE---WKLEQLAADLSKYEQELYDLKEE 477
Score = 30.8 bits (70), Expect = 0.28
Identities = 20/104 (19%), Positives = 60/104 (57%), Gaps = 4/104 (3%)
Query: 48 EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ 107
E++ +E++ + +L +++EL+++ +E+ ++ K ++E Q+++ELN IK+ +
Sbjct: 231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKR----LEEIEQLLEELNKKIKDLGE 286
Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ ++K+++ ++ E+ + + + +EL + L ++ E
Sbjct: 287 EEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330
Score = 30.8 bits (70), Expect = 0.30
Identities = 21/97 (21%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 55 ECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK 114
E + N+L K+ L++ I+EL + +L IK + I+ LN +E + +L++++
Sbjct: 816 EIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEE-ELEELE 874
Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
L + L KE +++ +L ++ +++ ++ +
Sbjct: 875 AALRDLESRLGDLKKERDELEAQLRELERKIEELEAQ 911
Score = 30.4 bits (69), Expect = 0.33
Identities = 24/105 (22%), Positives = 54/105 (51%), Gaps = 9/105 (8%)
Query: 47 IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
+EE + E ++ +++ +I+ EL +L++ + ++ ++E Q + L L
Sbjct: 777 LEEALNDLEARLSHSRIPEIQAELS----KLEEEVSRIEARLREIEQKLNRLTL-----E 827
Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ L++ +EL + +L IK + + I+ LN +EL + +EE
Sbjct: 828 KEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEE 872
Score = 30.4 bits (69), Expect = 0.39
Identities = 22/104 (21%), Positives = 61/104 (58%), Gaps = 5/104 (4%)
Query: 48 EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ 107
E ++ E+ ++D++ E++++ +E+++ K D + +E ++ +EL + +
Sbjct: 316 ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDL-----R 370
Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+L+++ KE + EL ++L ++ +E+N++ +EL+++ EE
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEE 414
Score = 30.4 bits (69), Expect = 0.40
Identities = 21/104 (20%), Positives = 56/104 (53%), Gaps = 4/104 (3%)
Query: 48 EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKE---LNLIIKE 104
+ +++E ++L +I+ LD++ +EL +++ I KE Q+ +E L ++E
Sbjct: 682 RLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741
Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
+ L +++E++ + EL + ++ ++L+++ + LN +
Sbjct: 742 L-EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784
Score = 29.3 bits (66), Expect = 0.81
Identities = 25/135 (18%), Positives = 69/135 (51%), Gaps = 4/135 (2%)
Query: 21 RMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDF-NKLDQIKKELDQIIKELDQ 79
R+ + ++D S + EI E +E+E KE + L +++E++ + EL +
Sbjct: 703 RLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKE 762
Query: 80 IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM---VKELDLTIKELNQIIK 136
+ ++ + ++ +++ + LN + + +++ +I+ EL ++ V ++ ++E+ Q +
Sbjct: 763 LEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN 822
Query: 137 ELNQIIKELNQIVEE 151
L + L + ++E
Sbjct: 823 RLTLEKEYLEKEIQE 837
Score = 28.5 bits (64), Expect = 1.5
Identities = 22/126 (17%), Positives = 63/126 (50%), Gaps = 22/126 (17%)
Query: 46 GIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIK--------- 96
G+ E + +KE +L+++++ ++++ +D+ ++L+ + +E + +
Sbjct: 164 GVAEFDRKKEKA--LEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKR 221
Query: 97 --ELNLIIKESN---------QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
E ++KE + +L +++EL+++ +E+ K L +I + L ++ K++
Sbjct: 222 EYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKI 281
Query: 146 NQIVEE 151
+ EE
Sbjct: 282 KDLGEE 287
Score = 26.6 bits (59), Expect = 7.3
Identities = 23/121 (19%), Positives = 59/121 (48%), Gaps = 4/121 (3%)
Query: 31 DGSTEESEINLESSNGIEEQEMEK--ECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
+ E E L +E ++ E + L + K +++ I+ L+ +EL+ +
Sbjct: 811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEEL 870
Query: 89 KESNQVIKEL--NLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELN 146
+E +++L L + +++L+ +EL++ ++EL+ I++ + + EL ++ L
Sbjct: 871 EELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALE 930
Query: 147 Q 147
+
Sbjct: 931 E 931
>gnl|CDD|240411 PTZ00419, PTZ00419, valyl-tRNA synthetase-like protein;
Provisional.
Length = 995
Score = 32.3 bits (74), Expect = 0.084
Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 8/101 (7%)
Query: 48 EEQEMEKECKVDF--NKLDQIKKELDQII---KELDQIIKELDLIIKESNQVIKELNLII 102
EE E+ K C D NK+ I LD+ I KEL ++ K+L + K +K+++ I
Sbjct: 898 EEAEVPKGCGFDVVDNKVI-IYLNLDEFIDLKKELAKLEKKLAKLQKSLESYLKKIS--I 954
Query: 103 KESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIK 143
+ ++K D+ + EL+ IK+L Q I+EL ++K
Sbjct: 955 PNYEDKVPEDVRKLNDEKIDELNEEIKQLEQAIEELKSLLK 995
Score = 30.7 bits (70), Expect = 0.31
Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 15/74 (20%)
Query: 88 IKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL---------TIKELN-QIIKE 137
+ E + KEL + K KL +++K L+ +K++ + +++LN + I E
Sbjct: 921 LDEFIDLKKELAKLEK-----KLAKLQKSLESYLKKISIPNYEDKVPEDVRKLNDEKIDE 975
Query: 138 LNQIIKELNQIVEE 151
LN+ IK+L Q +EE
Sbjct: 976 LNEEIKQLEQAIEE 989
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 31.8 bits (73), Expect = 0.11
Identities = 16/83 (19%), Positives = 47/83 (56%), Gaps = 1/83 (1%)
Query: 69 ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTI 128
+L ++ +E ++++EL+ + E N++ KE+ +K ++ +++ E+ ++ ++L
Sbjct: 30 KLLELDEERRKLLRELEELQAERNELSKEIGRALKR-GEDDAEELIAEVKELKEKLKELE 88
Query: 129 KELNQIIKELNQIIKELNQIVEE 151
L+++ EL+ ++ + I E
Sbjct: 89 AALDELEAELDTLLLTIPNIPHE 111
Score = 28.3 bits (64), Expect = 1.6
Identities = 16/75 (21%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 77 LDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIK 136
+D+++ ELD +E ++++EL + E N+ +I + L + + + I E+ ++ +
Sbjct: 28 VDKLL-ELD---EERRKLLRELEELQAERNELS-KEIGRALKRGEDDAEELIAEVKELKE 82
Query: 137 ELNQIIKELNQIVEE 151
+L ++ L+++ E
Sbjct: 83 KLKELEAALDELEAE 97
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 32.0 bits (73), Expect = 0.12
Identities = 34/110 (30%), Positives = 64/110 (58%), Gaps = 6/110 (5%)
Query: 45 NGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKE 104
NGIEE+ E E K +L+++KK+L ++ K L + ++E + +E+ +EL + K
Sbjct: 324 NGIEERIKELEEKE--ERLEELKKKLKELEKRL-EELEERHELYEEAKAKKEELERLKKR 380
Query: 105 SNQNKLDQIKKELDQMVK---ELDLTIKELNQIIKELNQIIKELNQIVEE 151
++++KEL+++ K E++ I ++ I EL + IKEL + +EE
Sbjct: 381 LTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEE 430
Score = 30.4 bits (69), Expect = 0.36
Identities = 29/109 (26%), Positives = 64/109 (58%), Gaps = 9/109 (8%)
Query: 48 EEQEMEKECKVDFNKLDQIKKELDQIIKELDQI----IKELDLIIKESNQVIKELNLIIK 103
+ +E++K+ KLD++++EL +++KEL+++ ++EL+ +KE E +
Sbjct: 550 KLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKD 609
Query: 104 -----ESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
E + +L ++++ELD+ +EL T K L ++ KEL ++ K+ ++
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE 658
Score = 30.0 bits (68), Expect = 0.54
Identities = 31/119 (26%), Positives = 66/119 (55%), Gaps = 2/119 (1%)
Query: 33 STEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESN 92
S+E E+ E +E + +E K + +L++ + L+ ++L++ I+EL+ I+E
Sbjct: 213 SSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELK 272
Query: 93 QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ I+EL +KE +L + +E ++ + + + EL +I K L+++ +E+N I E
Sbjct: 273 KEIEELEEKVKEL--KELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEER 329
Score = 28.9 bits (65), Expect = 1.2
Identities = 24/92 (26%), Positives = 58/92 (63%), Gaps = 1/92 (1%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ-NKLDQIKKELDQ 119
+ + +KEL+++++E+++I EL + +E ++ KE+ + + + +L++ + L+
Sbjct: 193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEG 252
Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
++L+ I+EL + I+EL + I+EL + V+E
Sbjct: 253 SKRKLEEKIRELEERIEELKKEIEELEEKVKE 284
Score = 27.0 bits (60), Expect = 5.8
Identities = 27/105 (25%), Positives = 59/105 (56%), Gaps = 4/105 (3%)
Query: 47 IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
+E ++ EKE + + +L ++++ELD+ +EL + K L+ + KE ++EL E
Sbjct: 605 LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKE----LEELEKKYSEEE 660
Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+L + EL + + L ++EL + +E+ + +++L + +EE
Sbjct: 661 YEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEE 705
Score = 26.6 bits (59), Expect = 5.9
Identities = 36/127 (28%), Positives = 70/127 (55%), Gaps = 11/127 (8%)
Query: 34 TEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQ 93
EE E E ++ + K K + KL+++KK+L ++ K+LD++ +EL ++KE +
Sbjct: 524 AEEYEKLKEKLIKLKGEI--KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581
Query: 94 V----IKELNLIIKE-----SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
+ ++EL +KE + +L +KEL++ KEL +EL++ +EL + K
Sbjct: 582 LGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKR 641
Query: 145 LNQIVEE 151
L ++ +E
Sbjct: 642 LEELRKE 648
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 31.9 bits (73), Expect = 0.12
Identities = 21/95 (22%), Positives = 46/95 (48%), Gaps = 10/95 (10%)
Query: 59 DFNKLDQIKKELDQIIKELDQIIK-ELDLIIKESNQVIKELNLIIKESNQNKLDQIKKEL 117
DFN+L + + L+ L++++K + I++E N +E+ +K + + +K L
Sbjct: 312 DFNQLPDVTEILEN----LEEVVKADFSSIVQEGNSTFEEIPSKVKNQTSSVVPDVKAAL 367
Query: 118 DQMVKEL-----DLTIKELNQIIKELNQIIKELNQ 147
D + ++ DL ++ L+ + + LN N
Sbjct: 368 DSLGTDIKSVAEDLPLQVLSVLSQILNNTQSSSNP 402
Score = 27.7 bits (62), Expect = 3.2
Identities = 18/69 (26%), Positives = 36/69 (52%), Gaps = 5/69 (7%)
Query: 57 KVDFNKLDQIKK---ELDQIIKELDQIIKELDLIIKESNQVIKELN--LIIKESNQNKLD 111
D N+L IK+ E+DQ ++ L + + L+ E+ + +++L + + N L
Sbjct: 555 SYDLNELLDIKQYTPEIDQELESLKVNLSTITLLTPEARRNLEDLASSGLDDINFSNFLK 614
Query: 112 QIKKELDQM 120
QI+K L ++
Sbjct: 615 QIQKPLTEV 623
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.4 bits (72), Expect = 0.16
Identities = 24/93 (25%), Positives = 48/93 (51%), Gaps = 5/93 (5%)
Query: 61 NKLDQIKKELDQIIKELDQIIKE--LDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
L + K+E+ ++ EL++ +KE +L E + +E L + LD+ ++ L+
Sbjct: 51 EALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETL---DRKMESLDKKEENLE 107
Query: 119 QMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ KEL K L++ +EL ++I E + +E
Sbjct: 108 KKEKELSNKEKNLDEKEEELEELIAEQREELER 140
Score = 29.1 bits (66), Expect = 0.82
Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 14/110 (12%)
Query: 31 DGSTEESEINLESSNGIEE--QEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
+ T + E LE+ + + E+E+E K N+L ++++ L Q +E L
Sbjct: 44 EAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQ---------REETLDR 94
Query: 89 KESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
K + KE NL E + +L +K LD+ +EL+ I E + ++ +
Sbjct: 95 KMESLDKKEENL---EKKEKELSNKEKNLDEKEEELEELIAEQREELERI 141
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 31.2 bits (71), Expect = 0.16
Identities = 23/105 (21%), Positives = 59/105 (56%), Gaps = 2/105 (1%)
Query: 47 IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
++E+E +E + + + ++ IK+L++ ++ L+ E + ++EL I E
Sbjct: 397 VKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREI-EKL 455
Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+++L++ ++E+ V++ D I+ ++ I+ L + ++E + VEE
Sbjct: 456 ESELERFRREVRDKVRK-DREIRARDRRIERLEKELEEKKKRVEE 499
Score = 27.0 bits (60), Expect = 5.0
Identities = 16/96 (16%), Positives = 54/96 (56%), Gaps = 7/96 (7%)
Query: 62 KLDQIKKELDQIIKELDQIIKELDLIIKESNQV------IKELNLIIKESNQNKLDQIKK 115
+L+++K+E++++ EL++ +E+ +++ ++ I+ L ++E + +++++++
Sbjct: 444 ELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKK-RVEELER 502
Query: 116 ELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+L ++ K L + +K + ++ E + EE
Sbjct: 503 KLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEE 538
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 31.2 bits (71), Expect = 0.18
Identities = 9/47 (19%), Positives = 25/47 (53%)
Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
++ +L QI+KE+ + K++ + ++ K+L + E+ + +
Sbjct: 35 ADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQ 81
Score = 30.5 bits (69), Expect = 0.34
Identities = 12/70 (17%), Positives = 33/70 (47%), Gaps = 2/70 (2%)
Query: 57 KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKE--SNQNKLDQIK 114
D +L QI+KE+ + K++ + + + K+ + E+ + + + L +++
Sbjct: 34 AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93
Query: 115 KELDQMVKEL 124
K++ + L
Sbjct: 94 KQIADLNARL 103
Score = 28.1 bits (63), Expect = 1.8
Identities = 17/81 (20%), Positives = 36/81 (44%), Gaps = 15/81 (18%)
Query: 68 KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM---VKEL 124
K+L QI KE+ + K++ + ++ K+L ++ E+ + + E
Sbjct: 38 KQLKQIQKEIAALEKKIREQQDQRAKLEKQLK------------SLETEIASLEAQLIET 85
Query: 125 DLTIKELNQIIKELNQIIKEL 145
+K+L + I +LN + L
Sbjct: 86 ADDLKKLRKQIADLNARLNAL 106
Score = 26.2 bits (58), Expect = 8.5
Identities = 6/45 (13%), Positives = 22/45 (48%)
Query: 104 ESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
+ + K+ + + + ++ K+L E+ + +L + +L ++
Sbjct: 48 AALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKL 92
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 31.0 bits (71), Expect = 0.20
Identities = 19/85 (22%), Positives = 49/85 (57%), Gaps = 6/85 (7%)
Query: 66 IKKELDQIIKELDQII---KELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
++ ++ + +K L++++ K+ + I++ +++L L++K S + ++ E D++ K
Sbjct: 257 VEGDVAKALKILEELLDEGKDPNRFIEDLIYYLRDL-LLVKTSPELEILDESTENDELFK 315
Query: 123 EL--DLTIKELNQIIKELNQIIKEL 145
EL L I+ L ++I LN +++
Sbjct: 316 ELSEKLDIERLYRMIDILNDAQQQI 340
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 30.8 bits (70), Expect = 0.21
Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 3/124 (2%)
Query: 30 TDGSTEESEINLESSNGIEEQEMEKECKVD--FNKLDQIKKELDQIIKELDQIIKELDLI 87
+G E E NLE EE + ++ KL + + L + I L Q+ EL+L
Sbjct: 145 LEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLC 204
Query: 88 IKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
+ ++ + K Q++ EL Q ++EL + I+ L EL + I E +
Sbjct: 205 DPLELEKARQELRSLSVKISEKRKQLE-ELQQELQELTIAIEALTNKKSELLEEIAEAEK 263
Query: 148 IVEE 151
I EE
Sbjct: 264 IREE 267
>gnl|CDD|182106 PRK09841, PRK09841, cryptic autophosphorylating protein tyrosine
kinase Etk; Provisional.
Length = 726
Score = 31.0 bits (70), Expect = 0.22
Identities = 30/115 (26%), Positives = 60/115 (52%), Gaps = 11/115 (9%)
Query: 44 SNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNL--- 100
+N +Q + ++ D L+ ++++L ++ ELDQ ++L++ ++ + V +LNL
Sbjct: 250 ANNYLQQNIARQAAQDSQSLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSV--DLNLEAK 307
Query: 101 -----IIKESNQ-NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
I+ NQ N+L + E+ Q+ K+ T + L + + L Q K LN+ V
Sbjct: 308 AVLEQIVNVDNQLNELTFREAEISQLYKKDHPTYRALLEKRQTLEQERKRLNKRV 362
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 30.8 bits (70), Expect = 0.23
Identities = 24/132 (18%), Positives = 56/132 (42%), Gaps = 1/132 (0%)
Query: 21 RMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKE-LDQIIKELDQ 79
R+ + + + I+ +E + E K + E ++I L +
Sbjct: 876 RVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKK 935
Query: 80 IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELN 139
++ +DL S + +K L ++KL + +E + ++K+ + ++E N+ EL
Sbjct: 936 LLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELK 995
Query: 140 QIIKELNQIVEE 151
KEL ++ ++
Sbjct: 996 NFKKELAELSKQ 1007
Score = 30.0 bits (68), Expect = 0.47
Identities = 26/114 (22%), Positives = 53/114 (46%), Gaps = 7/114 (6%)
Query: 38 EINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKE 97
+ L + + E +E + NKL +++ +L + +E + ++K+ ++++E N+ E
Sbjct: 934 KKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSE 993
Query: 98 LNLIIKE----SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
L KE S Q Q + +KEL + + EL K ++ EL+
Sbjct: 994 LKNFKKELAELSKQYGALQESTKQ---LKELPVEVAELQSASKIISSESTELSI 1044
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 30.8 bits (70), Expect = 0.26
Identities = 21/98 (21%), Positives = 57/98 (58%), Gaps = 9/98 (9%)
Query: 61 NKLDQIKKELDQI---IKELDQIIKELDLIIKESNQVIKELN----LIIKESNQNKLDQI 113
+KL +++ L+++ I EL++I+ E + K+ ++ +++ +I ++ K ++
Sbjct: 306 DKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAK--KV 363
Query: 114 KKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
K ++++ E +EL ++ EL++I+K +++V+E
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKE 401
Score = 28.1 bits (63), Expect = 2.1
Identities = 23/78 (29%), Positives = 35/78 (44%), Gaps = 1/78 (1%)
Query: 74 IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQ 133
I+EL+Q I+ LD+ I Q IK N I+E + K + EL K +
Sbjct: 176 IRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRK-KNGENIARKQNKYDELVEEAKTIKA 234
Query: 134 IIKELNQIIKELNQIVEE 151
I+EL + L +E+
Sbjct: 235 EIEELTDELLNLVMDIED 252
Score = 26.9 bits (60), Expect = 4.9
Identities = 23/109 (21%), Positives = 43/109 (39%), Gaps = 8/109 (7%)
Query: 29 VTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
+T + E+ + +E +FN+ + EL I Q + L
Sbjct: 301 ITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA 360
Query: 89 KESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKE 137
K+ I+EL Q + +EL ++ ELD +K ++++KE
Sbjct: 361 KKVKAAIEEL--------QAEFVDNAEELAKLQDELDKIVKTKSELVKE 401
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 30.8 bits (70), Expect = 0.27
Identities = 22/112 (19%), Positives = 57/112 (50%), Gaps = 10/112 (8%)
Query: 50 QEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ-- 107
+E +E + + +L++ +EL + I+ L + ++E + +++E Q++ EL +E +
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367
Query: 108 --------NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ +++EL ++ EL EL ++ +E+ + + L ++ E
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER 419
Score = 30.5 bits (69), Expect = 0.34
Identities = 23/116 (19%), Positives = 63/116 (54%), Gaps = 1/116 (0%)
Query: 36 ESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVI 95
E E+ ++ +E + + +L++ EL++ ++EL++ ++EL ++E
Sbjct: 813 ERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEK 872
Query: 96 KELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+EL +KE + + +++++EL ++ EL +E+ ++ + L ++ +L ++ E
Sbjct: 873 EELEDELKE-LEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVE 927
Score = 30.1 bits (68), Expect = 0.48
Identities = 22/107 (20%), Positives = 57/107 (53%), Gaps = 3/107 (2%)
Query: 48 EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN---LIIKE 104
+E +E + +L+Q+ EL++ +EL++ + L ++E + ++E
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELA 393
Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+N+L+++K+E++ + + L+ + L + +EL ++ EL ++ E
Sbjct: 394 EIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTE 440
Score = 29.7 bits (67), Expect = 0.57
Identities = 30/118 (25%), Positives = 63/118 (53%), Gaps = 2/118 (1%)
Query: 36 ESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVI 95
E E+ E QE +E + + L++ +L + I+EL++ + L ++E + +
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802
Query: 96 KELNLII--KESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+E + E L+Q ++ L+Q ++EL+ I+EL + + EL + ++EL + +EE
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEE 860
Score = 28.1 bits (63), Expect = 2.1
Identities = 24/111 (21%), Positives = 60/111 (54%), Gaps = 6/111 (5%)
Query: 42 ESSNGIEEQEMEKE-CKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNL 100
E +EE E + E K + L++ ++L ++EL++ ++EL+ ++E + ++EL
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEEL-- 763
Query: 101 IIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
E L++ +L + ++EL+ + L + ++EL + ++E + ++
Sbjct: 764 ---EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDA 811
>gnl|CDD|189014 cd09607, M3B_PepF_2, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF;
Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
includes oligoendopeptidase F from Lactococcus lactis.
This enzyme hydrolyzes peptides containing between 7 and
17 amino acids with fairly broad specificity. The PepF
gene is duplicated in L. lactis on the plasmid that
bears it, while a shortened second copy is found in
Bacillus subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus
lactis has been identified in Bacillus subtilis as an
inhibitor of sporulation initiation when over expressed
from a multicopy plasmid.
Length = 581
Score = 30.6 bits (70), Expect = 0.29
Identities = 10/45 (22%), Positives = 23/45 (51%)
Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
Q L ++K+ +D++ LD + ++ L +I L +++E
Sbjct: 16 QEDLKKLKQLIDELKSLLDALLPSDEDAVETLEALITLLQELLEL 60
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 30.3 bits (69), Expect = 0.29
Identities = 14/70 (20%), Positives = 25/70 (35%), Gaps = 9/70 (12%)
Query: 84 LDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM----VKELDLTIKELNQIIKELN 139
L+L I E KE++ ++ E Q + + ++ + L I L I E
Sbjct: 101 LELKIFE-----KEIDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQ 155
Query: 140 QIIKELNQIV 149
+ Q
Sbjct: 156 AEVNAAYQEA 165
>gnl|CDD|236084 PRK07739, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 507
Score = 30.4 bits (69), Expect = 0.30
Identities = 13/42 (30%), Positives = 24/42 (57%), Gaps = 6/42 (14%)
Query: 108 NKLDQIKKELDQMVKELDLTIKELNQI---IKELNQIIKELN 146
L I+ +L E+D+T+KE+N + I +LN+ I ++
Sbjct: 160 QSLTDIQNDLKS---EIDVTVKEINSLASQISDLNKQIAKVE 198
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 30.4 bits (69), Expect = 0.34
Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKE--LNLIIKESNQNKLDQIKKELD 118
N+L++ +K L EL+++ K+LD + + E L++K + ++ + +
Sbjct: 105 NQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVK 164
Query: 119 QMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
++ +E DL + Q I + + + +I E
Sbjct: 165 KIEEEADLEAERKAQNI-----LAQAMQRIASE 192
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.1 bits (68), Expect = 0.38
Identities = 29/140 (20%), Positives = 56/140 (40%), Gaps = 20/140 (14%)
Query: 20 FRMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQ 79
FR+ TE+ E ++ + EM+ E + KE + ++ L +
Sbjct: 226 FRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMDTE--------EDRTKEREAELEALQE 277
Query: 80 IIKELDLIIKESNQVIKELNLIIKESNQNKLDQ------------IKKELDQMVKELDLT 127
I EL+ I+E IK L IK+ N+ IKK +++ + +
Sbjct: 278 QIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENN 337
Query: 128 IKELNQIIKELNQIIKELNQ 147
+ +L ++ ++ + EL Q
Sbjct: 338 VAKLQALVVASSERLLELAQ 357
Score = 27.8 bits (62), Expect = 2.9
Identities = 28/118 (23%), Positives = 58/118 (49%), Gaps = 16/118 (13%)
Query: 47 IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN------- 99
E + +E + + IKK +++ + + + +L ++ S++ + EL
Sbjct: 305 EELTTVRQENEELEEEYK-IKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHR 363
Query: 100 --LI-----IKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVE 150
LI +KE N+NK D+ +++LD+ +K+L I+EL ++ Q+ K+L E
Sbjct: 364 TPLIDEYRSLKEKNRNKEDETQRQLDE-IKKLRNKIEELESELQTKEQLYKQLLDEYE 420
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.0 bits (68), Expect = 0.43
Identities = 20/93 (21%), Positives = 40/93 (43%), Gaps = 3/93 (3%)
Query: 62 KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQI--KKELDQ 119
+L Q ++ L + + Q ++L E + + Q +LD + E
Sbjct: 608 RLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQS 667
Query: 120 MVKELDLTIKE-LNQIIKELNQIIKELNQIVEE 151
+ +L+L I E Q +L Q+ +L Q++E+
Sbjct: 668 LKDKLELAIAERKQQAETQLRQLDAQLKQLLEQ 700
>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
family.
Length = 308
Score = 30.0 bits (68), Expect = 0.45
Identities = 19/82 (23%), Positives = 44/82 (53%), Gaps = 6/82 (7%)
Query: 68 KELDQIIKELDQIIKELDL--IIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELD 125
KE+ Q+++ LD + L L I++ ++++ L ++ + + +++ D ELD
Sbjct: 92 KEIRQLLESLDGPLLGLLLKVILEPLLELLELLLELLNDDDPLEVNDGGLIKDGFDPELD 151
Query: 126 LTIKELNQIIKELNQIIKELNQ 147
EL + ++EL + ++EL +
Sbjct: 152 ----ELREKLEELEEELEELLK 169
Score = 27.3 bits (61), Expect = 3.3
Identities = 12/62 (19%), Positives = 38/62 (61%)
Query: 70 LDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIK 129
L I++ L ++++ L ++ + + + +IK+ +LD+++++L+++ +EL+ +K
Sbjct: 110 LKVILEPLLELLELLLELLNDDDPLEVNDGGLIKDGFDPELDELREKLEELEEELEELLK 169
Query: 130 EL 131
+
Sbjct: 170 KE 171
>gnl|CDD|217011 pfam02388, FemAB, FemAB family. The femAB operon codes for two
nearly identical approximately 50-kDa proteins involved
in the formation of the Staphylococcal pentaglycine
interpeptide bridge in peptidoglycan. These proteins are
also considered as a factor influencing the level of
methicillin resistance.
Length = 408
Score = 29.9 bits (68), Expect = 0.45
Identities = 13/66 (19%), Positives = 36/66 (54%), Gaps = 4/66 (6%)
Query: 77 LDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIK 136
LD++ ++L+ + K+ ++ ++L +NKL +++++L + K +D E ++I
Sbjct: 244 LDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRID----EAKELIA 299
Query: 137 ELNQII 142
+ +
Sbjct: 300 KYGNEV 305
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 29.6 bits (67), Expect = 0.53
Identities = 17/89 (19%), Positives = 51/89 (57%), Gaps = 1/89 (1%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
+ KEL+ + ++ D + +E Q +K+L +++ + +LD+ K++L ++++
Sbjct: 160 YKLLMKELELLNSIKPKLRDRKDALEEELRQ-LKQLEDELEDCDPTELDRAKEKLKKLLQ 218
Query: 123 ELDLTIKELNQIIKELNQIIKELNQIVEE 151
E+ + +K+L ++ +EL ++ ++ + +
Sbjct: 219 EIMIKVKKLEELEEELQELESKIEDLTNK 247
>gnl|CDD|197699 smart00397, t_SNARE, Helical region found in SNAREs. All
alpha-helical motifs that form twisted and parallel
four-helix bundles in target soluble
N-ethylmaleimide-sensitive factor (NSF) attachment
protein (SNAP) receptor proteins. This motif found in
"Q-SNAREs".
Length = 66
Score = 27.9 bits (63), Expect = 0.53
Identities = 14/67 (20%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 68 KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLT 127
++ +E D+ +++L+ I+E Q+ ++ ++E + LD+I+ +D L
Sbjct: 1 QQALAREEERDEELEQLEKSIQELKQIFLDMGTELEEQGEQ-LDRIEDNVDDADVNLKKA 59
Query: 128 IKELNQI 134
K L +
Sbjct: 60 NKRLKKA 66
Score = 24.9 bits (55), Expect = 8.1
Identities = 10/37 (27%), Positives = 21/37 (56%)
Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
++ +E D +++L + I+EL QI ++ +EE
Sbjct: 1 QQALAREEERDEELEQLEKSIQELKQIFLDMGTELEE 37
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 29.8 bits (68), Expect = 0.55
Identities = 19/125 (15%), Positives = 55/125 (44%), Gaps = 5/125 (4%)
Query: 23 FDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIK 82
DG + +E E E+E + D + + ++++++ + K
Sbjct: 170 IDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAK 229
Query: 83 ELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQII 142
+ + K + ++ + + K +++++L + +K L LT K++++++++L I
Sbjct: 230 QYKKLRKAQEKKVEG-----RLAQHKKYAKLREKLKEELKSLRLTSKQIDELVEQLRDIN 284
Query: 143 KELNQ 147
K +
Sbjct: 285 KRVRG 289
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 29.6 bits (67), Expect = 0.67
Identities = 32/128 (25%), Positives = 72/128 (56%), Gaps = 7/128 (5%)
Query: 29 VTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
+T GS + + LE IEE +E++ + K+ +++K L ++ KEL+++ +EL+ +
Sbjct: 661 ITGGSAKTNSSILERRREIEE--LEEKIEELEEKIAELEKALAELRKELEELEEELEQLR 718
Query: 89 KESNQVIKELN-----LIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIK 143
KE ++ ++++ L E+ +L++ +L + + EL+ I+EL + ++E + +
Sbjct: 719 KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA 778
Query: 144 ELNQIVEE 151
E +EE
Sbjct: 779 EAEAEIEE 786
Score = 27.7 bits (62), Expect = 2.5
Identities = 23/96 (23%), Positives = 54/96 (56%), Gaps = 4/96 (4%)
Query: 59 DFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
+L + +EL + +KE ++ ++EL ++E + ++EL L + E + ++++++KEL
Sbjct: 233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEE-EIEELQKELY 291
Query: 119 QMVKE---LDLTIKELNQIIKELNQIIKELNQIVEE 151
+ E L+ + L + + L + ++EL +EE
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEE 327
Score = 27.7 bits (62), Expect = 2.8
Identities = 24/97 (24%), Positives = 49/97 (50%), Gaps = 5/97 (5%)
Query: 55 ECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK 114
+ L++ +EL + I+ L I+EL+ +I+E ++ L L + S + L ++
Sbjct: 835 ATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAL-LNERASLEEALALLR 893
Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
EL+ EL ++EL EL + ++EL + + +
Sbjct: 894 SELE----ELSEELRELESKRSELRRELEELREKLAQ 926
Score = 27.7 bits (62), Expect = 3.2
Identities = 25/120 (20%), Positives = 56/120 (46%), Gaps = 2/120 (1%)
Query: 34 TEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQ 93
E I S E + +E + + ++ E + I+EL+ I++L +K +
Sbjct: 744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALRE 803
Query: 94 VIKELNLIIKESNQN--KLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ EL + N+ L + + L++ + + +++L + I+EL++ I+ L +EE
Sbjct: 804 ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEE 863
>gnl|CDD|236364 PRK09034, PRK09034, aspartate kinase; Reviewed.
Length = 454
Score = 29.4 bits (67), Expect = 0.68
Identities = 13/63 (20%), Positives = 25/63 (39%), Gaps = 6/63 (9%)
Query: 81 IKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQ 140
K DL+I + V+ + ++ + I ++ KEL L L +I + L
Sbjct: 46 TKVTDLLILYAEAVLAGEDY------EDIFEAIIARYAEIAKELGLDADILEKIEEILEH 99
Query: 141 IIK 143
+
Sbjct: 100 LAN 102
>gnl|CDD|147975 pfam06103, DUF948, Bacterial protein of unknown function (DUF948).
This family consists of bacterial sequences several of
which are thought to be general stress proteins.
Length = 90
Score = 27.9 bits (63), Expect = 0.72
Identities = 20/74 (27%), Positives = 40/74 (54%), Gaps = 8/74 (10%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD---Q 119
L + K LD++ K L + K++D I E+ ++ + N+++++ N + LD Q
Sbjct: 21 LKSLSKTLDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDVNGK-----VETLDPVFQ 75
Query: 120 MVKELDLTIKELNQ 133
V +L ++ +LNQ
Sbjct: 76 AVADLGESVSDLNQ 89
Score = 26.4 bits (59), Expect = 3.1
Identities = 11/42 (26%), Positives = 24/42 (57%)
Query: 110 LDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
LD++ K L + K++D E ++ + N +++++N VE
Sbjct: 28 LDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDVNGKVET 69
Score = 25.6 bits (57), Expect = 5.1
Identities = 11/58 (18%), Positives = 32/58 (55%), Gaps = 1/58 (1%)
Query: 94 VIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+K L+ + E N+ L + K++D + E + + + N +++++N ++ L+ + +
Sbjct: 20 TLKSLSKTLDEVNKT-LAGLTKQVDGITTETEDLLAKTNVLLEDVNGKVETLDPVFQA 76
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 29.3 bits (66), Expect = 0.90
Identities = 21/117 (17%), Positives = 47/117 (40%), Gaps = 7/117 (5%)
Query: 34 TEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQ 93
+NLE QE+ + +L Q +++L +I+ L Q+ E + E +
Sbjct: 62 RSLQALNLE-----LIQELNELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFE 116
Query: 94 VIKELNLIIKESNQNKLDQIKKELDQMVKEL--DLTIKELNQIIKELNQIIKELNQI 148
+ + + N +L + +E+ + +E + + L +I + L +I
Sbjct: 117 ELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLLGEI 173
>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
[Function unknown].
Length = 827
Score = 29.3 bits (66), Expect = 0.93
Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 15/113 (13%)
Query: 54 KECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKE--SNQVIKELNLIIKES-----N 106
K ++D + L + L ++ KE D+I+K+L+ + E + + K K+S +
Sbjct: 234 KVGEIDSDVLCRDLGTLQELYKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKLTPS 293
Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQII--KELNQI------IKELNQIVEE 151
+ K +I E K IK+LN+ I K+++ I + +L+ +E
Sbjct: 294 RKKEFEILPEYVPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIEN 346
>gnl|CDD|214860 smart00856, PMEI, Plant invertase/pectin methylesterase inhibitor.
This domain inhibits pectin methylesterases (PMEs) and
invertases through formation of a non-covalent 1:1
complex. It has been implicated in the regulation of
fruit development, carbohydrate metabolism and cell wall
extension. It may also be involved in inhibiting
microbial pathogen PMEs. It has been observed that it is
often expressed as a large inactive preprotein. It is
also found at the N-termini of PMEs predicted from DNA
sequences, suggesting that both PMEs and their
inhibitors are expressed as a single polyprotein and
subsequently processed. It has two disulphide bridges
and is mainly alpha-helical.
Length = 148
Score = 28.5 bits (64), Expect = 0.94
Identities = 8/57 (14%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 84 LDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQ 140
+ + + ++ + + ++ ++K++ +L K L ++ D + L + ++EL
Sbjct: 43 IKVALSQATKTLSFISKLLKKTKDPRL---KAALKDCLELYDDAVDSLEKALEELKS 96
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 28.8 bits (65), Expect = 1.1
Identities = 17/75 (22%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 73 IIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELN 132
++ E+ ++ K L +I E+ + E N +++ Q+ QI++ + +EL I++L
Sbjct: 64 LVAEVKELRKRLAKLISENEALKAE-NERLQKREQSIDQQIQQAVQSETQELTKEIEQLK 122
Query: 133 QIIKELNQIIKELNQ 147
++L +I +L +
Sbjct: 123 SERQQLQGLIDQLQR 137
>gnl|CDD|237076 PRK12362, PRK12362, germination protease; Provisional.
Length = 318
Score = 28.7 bits (65), Expect = 1.1
Identities = 23/66 (34%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 78 DQIIKELDLIIKESNQVIKELNLIIKESNQN-KLDQIKKELDQMVKELDLTIKELNQIIK 136
D I LD +I +S + E ++K NQ K IK+ L V L +T KE++++I+
Sbjct: 240 DTIDLVLDELINQSEEG-SEFYNMLKSLNQEEKQQLIKEVLSPYVGNLIVTPKEIDELIE 298
Query: 137 ELNQII 142
L++II
Sbjct: 299 NLSKII 304
>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
kinases [Cell motility and secretion / Signal
transduction mechanisms].
Length = 716
Score = 28.8 bits (65), Expect = 1.1
Identities = 10/33 (30%), Positives = 24/33 (72%)
Query: 106 NQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
+++L Q+ ELD++ +ELD +++L+++ +L
Sbjct: 348 TRSRLSQLASELDEVREELDEALRQLSRLTTDL 380
>gnl|CDD|236679 PRK10387, PRK10387, glutaredoxin 2; Provisional.
Length = 210
Score = 28.3 bits (64), Expect = 1.2
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL 98
D + +IKE++ ++ LD +I + N V EL
Sbjct: 133 FDALLAHTPGLIKEINADLRALDPLIVKPNAVNGEL 168
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 28.9 bits (65), Expect = 1.2
Identities = 24/125 (19%), Positives = 63/125 (50%), Gaps = 9/125 (7%)
Query: 24 DGLTPVT---DGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQI 80
+G+ VT G+ +E I ++ ++ + + E+E+ K + K++ ++I +
Sbjct: 476 NGILSVTAKDKGTGKEQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKI-----DL 530
Query: 81 IKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL-TIKELNQIIKELN 139
+ + + ++ + +KEL I E + K++ + K+L Q ++ + +IK L + +++
Sbjct: 531 KNQAESLCYQAEKQLKELKDKISEEKKEKIENLIKKLRQALQNDNYESIKSLLEELQKAL 590
Query: 140 QIIKE 144
I +
Sbjct: 591 MEIGK 595
>gnl|CDD|203795 pfam07889, DUF1664, Protein of unknown function (DUF1664). The
members of this family are hypothetical plant proteins
of unknown function. The region featured in this family
is approximately 100 amino acids long.
Length = 125
Score = 27.8 bits (62), Expect = 1.3
Identities = 15/83 (18%), Positives = 34/83 (40%), Gaps = 4/83 (4%)
Query: 52 MEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD 111
M +L+++ + + K L Q I LD + E ++ + + E + L
Sbjct: 40 MSDAVASVGKQLEKVSESIAAAKKHLSQRIDNLDDKLDEQKEISESTRDEVTEI-REDLS 98
Query: 112 QIKKELD---QMVKELDLTIKEL 131
I +++ Q V+ L+ + +
Sbjct: 99 NIGEDVKSVHQAVEGLEGKLDSI 121
>gnl|CDD|205914 pfam13740, ACT_6, ACT domain. ACT domains bind to amino acids and
regulate associated enzyme domains.
Length = 76
Score = 27.1 bits (61), Expect = 1.3
Identities = 13/59 (22%), Positives = 28/59 (47%), Gaps = 3/59 (5%)
Query: 73 IIKELDQIIKELDLIIKESNQVI--KELNLIIK-ESNQNKLDQIKKELDQMVKELDLTI 128
+ L +++ E I +S Q + EL L + + + +++K L + +EL L +
Sbjct: 15 LTASLTRVLAEHGCNILDSRQAVIGGELTLGMLVSGPWDAIARLEKALLFLAQELGLQV 73
>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
Length = 780
Score = 28.6 bits (64), Expect = 1.4
Identities = 24/120 (20%), Positives = 49/120 (40%), Gaps = 8/120 (6%)
Query: 25 GLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKEL 84
GLT +TDG + + GI E K +L++ E + EL+ +++L
Sbjct: 228 GLTTLTDGLNQLDSGLGTLAAGIGE------LKQGAEQLNEGIGEFSSGLSELNSGVQDL 281
Query: 85 DLIIKESNQVIKEL--NLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQII 142
+ + NQ I L L + +S ++ +++ L Q+ + L + + +
Sbjct: 282 AAGVPQLNQGISALAAGLSLPDSLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSL 341
>gnl|CDD|191096 pfam04799, Fzo_mitofusin, fzo-like conserved region. Family of
putative transmembrane GTPase. The fzo protein is a
mediator of mitochondrial fusion. This conserved region
is also found in the human mitofusin protein.
Length = 171
Score = 28.1 bits (63), Expect = 1.5
Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
Query: 96 KELNLIIKESNQNKLDQIKKELDQMVKEL----DLTIKELNQIIKELNQIIKELNQI 148
++L LI+ ++ N Q+++EL L D+T K+L + I EL + I+ L I
Sbjct: 86 EKLQLIVSLTSSNCSHQVQQELASTFARLCQQVDVTSKDLEEEIAELTKEIQRLETI 142
>gnl|CDD|182747 PRK10807, PRK10807, paraquat-inducible protein B; Provisional.
Length = 547
Score = 28.5 bits (64), Expect = 1.6
Identities = 22/103 (21%), Positives = 51/103 (49%), Gaps = 21/103 (20%)
Query: 63 LDQIKK-ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
LD+I L+ +I++ + E ++E + LN I S+Q+ + Q+ ++ + +
Sbjct: 426 LDKINNLPLNPMIEQATSTLSESQRTMRELQTTLDSLNKIT--SSQS-MQQLPADMQKTL 482
Query: 122 KELDLTIK-----------------ELNQIIKELNQIIKELNQ 147
+EL+ +++ L+Q+++EL ++K LN+
Sbjct: 483 RELNRSMQGFQPGSPAYNKMVADMQRLDQVLRELQPVLKTLNE 525
Score = 26.5 bits (59), Expect = 6.3
Identities = 12/40 (30%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 110 LDQIKK-ELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
LD+I L+ M+++ T+ E + ++EL + LN+I
Sbjct: 426 LDKINNLPLNPMIEQATSTLSESQRTMRELQTTLDSLNKI 465
>gnl|CDD|217372 pfam03105, SPX, SPX domain. We have named this region the SPX
domain after (SYG1, Pho81 and XPR1). This 180 residue
length domain is found at the amino terminus of a
variety of proteins. In the yeast protein SYG1, the
N-terminus directly binds to the G- protein beta subunit
and inhibits transduction of the mating pheromone
signal. This finding suggests that all the members of
this family are involved in G-protein associated signal
transduction. The N-termini of several proteins involved
in the regulation of phosphate transport, including the
putative phosphate level sensors PHO81 from
Saccharomyces cerevisiae and NUC-2 from Neurospora
crassa, are also members of this family. The SPX domain
of S. cerevisiae low-affinity phosphate transporters
Pho87 and Pho90 auto-regulates uptake and prevents
efflux. This SPX dependent inhibition is mediated by the
physical interaction with Spl2 NUC-2 contains several
ankyrin repeats pfam00023. Several members of this
family are annotated as XPR1 proteins: the xenotropic
and polytropic retrovirus receptor confers
susceptibility to infection with murine leukaemia
viruses (MLV). The similarity between SYG1, phosphate
regulators and XPR1 sequences has been previously noted,
as has the additional similarity to several predicted
proteins, of unknown function, from Drosophila
melanogaster, Arabidopsis thaliana, Caenorhabditis
elegans, Schizosaccharomyces pombe, and Saccharomyces
cerevisiae. In addition, given the similarities between
XPR1 and SYG1 and phosphate regulatory proteins, it has
been proposed that XPR1 might be involved in G-protein
associated signal transduction and may itself function
as a phosphate sensor.
Length = 176
Score = 27.8 bits (62), Expect = 1.6
Identities = 24/126 (19%), Positives = 55/126 (43%), Gaps = 13/126 (10%)
Query: 24 DGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQ-IKKELDQIIKELDQIIK 82
LT DG E S + E +E E + + K++ K++ ++++ L+++ K
Sbjct: 37 LSLTTNEDGEDESSNLLNEEELEEDESEFFEALDKELEKVNSFYKEKEAELLERLEELEK 96
Query: 83 ELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQII 142
+L+ + + ++ + ++ +++KK L EL + ++ L I
Sbjct: 97 QLEELEERRDETSSAKSSPSDKNLLKAFEELKKAL-----------LELYRELQLLKSYI 145
Query: 143 KELNQI 148
ELN+
Sbjct: 146 -ELNRT 150
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 28.5 bits (64), Expect = 1.7
Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 5/67 (7%)
Query: 71 DQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKE 130
D +++EL I +L IKE N +LI S + +LDQ+ K+L ++ E + +E
Sbjct: 135 DPVVQELVSIFNDLIDSIKEDNLKDDLESLI--ASAKEELDQLSKKLAELKAEEE---EE 189
Query: 131 LNQIIKE 137
L + +KE
Sbjct: 190 LERALKE 196
Score = 27.7 bits (62), Expect = 3.0
Identities = 19/106 (17%), Positives = 45/106 (42%), Gaps = 17/106 (16%)
Query: 53 EKECKVDFNKL-DQIKKELDQIIKELDQIIKELD----LIIKES---------NQVIKEL 98
E K D L K+ELDQ+ K+L ++ E + +KE +++ L
Sbjct: 154 EDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARL 213
Query: 99 NLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
+ + ++E +++ K+ + ++L Q ++ + ++
Sbjct: 214 ESKEAALEKQLRLEFEREKEELRKKYE---EKLRQELERQAEAHEQ 256
>gnl|CDD|234403 TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target.
Members of this protein family include secretion targets
for the two main variants of type VII secretion systems
(T7SS), one found in the Actinobacteria, one found in
the Firmicutes. This model was derived through iteration
from pfam06013. The best characterized member of this
family is ESAT-6 from Mycobacterium tuberculosis.
Members of this family usually are ~100 amino acids in
length but occasionally have a long C-terminal
extension.
Length = 90
Score = 27.2 bits (61), Expect = 1.7
Identities = 18/85 (21%), Positives = 32/85 (37%), Gaps = 10/85 (11%)
Query: 57 KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE 116
KVD +L Q + +EL Q+++ L I L + + + ++
Sbjct: 3 KVDPEELRQAAARFGKTAQELQQLLQRLR-------SEIDALQGTWEGAAADAFQAEFQQ 55
Query: 117 LDQMVKELDLTIKELNQIIKELNQI 141
D + L I L +I + L Q
Sbjct: 56 WDPAAERL---INALQEISEALRQA 77
>gnl|CDD|227109 COG4768, COG4768, Uncharacterized protein containing a divergent
version of the methyl-accepting chemotaxis-like domain
[General function prediction only].
Length = 139
Score = 27.8 bits (62), Expect = 1.7
Identities = 21/78 (26%), Positives = 42/78 (53%), Gaps = 2/78 (2%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
L ++ K LD++ K L + ++D I E+ +++ + N + E Q K+ + D VK
Sbjct: 26 LKKVSKTLDEVAKTLKGLTSQVDGITHETEELLHKTN-TLAEDVQGKVATLDPVFDA-VK 83
Query: 123 ELDLTIKELNQIIKELNQ 140
+L ++ +LNQ ++ L
Sbjct: 84 DLGQSVSDLNQSVRHLAT 101
>gnl|CDD|233894 TIGR02492, flgK_ends, flagellar hook-associated protein FlgK. The
flagellar hook-associated protein FlgK of bacterial
flagella has conserved N- and C-terminal domains. The
central region is highly variable in length and
sequence, and often contains substantial runs of
low-complexity sequence. This model is built from an
alignment of FlgK sequences with the central region
excised. Note that several other proteins of the
flagellar apparatus also are homologous in the N- and
C-terminal regions to FlgK, but are excluded from this
model [Cellular processes, Chemotaxis and motility].
Length = 324
Score = 28.1 bits (63), Expect = 1.7
Identities = 9/41 (21%), Positives = 25/41 (60%), Gaps = 3/41 (7%)
Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
++L ++K ++ E+ + E+N ++K++ + KE+ Q+
Sbjct: 148 SELQDLRKNINA---EIKSAVTEINSLLKQIASLNKEIKQV 185
>gnl|CDD|203375 pfam06013, WXG100, Proteins of 100 residues with WXG. ESAT-6 is a
small protein appears to be of fundamental importance in
virulence and protective immunity in Mycobacterium
tuberculosis. Homologues have been detected in other
Gram-positive bacterial species. It may represent a
novel secretion system potentially driven by the
pfam01580 domains in the YukA-like proteins.
Length = 85
Score = 26.9 bits (60), Expect = 1.8
Identities = 16/82 (19%), Positives = 31/82 (37%), Gaps = 11/82 (13%)
Query: 57 KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIK----ESNQNKLDQ 112
KVD +L Q + +EL+ I++ L + L ++ Q +
Sbjct: 3 KVDPEELRQAAARFQKAAQELEDILQRLK-------AQVAALQGGWSGAAADAFQAAFAE 55
Query: 113 IKKELDQMVKELDLTIKELNQI 134
K +++V L+ + L Q
Sbjct: 56 WKPAAEKLVDALEEISEALKQA 77
>gnl|CDD|224493 COG1577, ERG12, Mevalonate kinase [Lipid metabolism].
Length = 307
Score = 28.0 bits (63), Expect = 1.8
Identities = 13/74 (17%), Positives = 37/74 (50%), Gaps = 1/74 (1%)
Query: 68 KELDQIIKELDQIIKEL-DLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL 126
KEL + +L + E+ D I+ ++++E ++ + +L ++ ++K L +
Sbjct: 185 KELVAGVAKLLEEEPEVIDPILDAIGELVQEAEAALQTGDFEELGELMNINQGLLKALGV 244
Query: 127 TIKELNQIIKELNQ 140
+ EL+++++
Sbjct: 245 STPELDELVEAARS 258
>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1. NBP1 is a
nuclear protein which has been shown in Saccharomyces
cerevisiae to be essential for the G2/M transition of
the cell cycle.
Length = 313
Score = 28.3 bits (63), Expect = 1.8
Identities = 14/39 (35%), Positives = 28/39 (71%), Gaps = 1/39 (2%)
Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
Q K+D+++++L ++ +EL T K+L Q +KE N +++ L
Sbjct: 172 QKKIDKLEEKLAELEQELQSTQKKL-QFVKEKNHLLESL 209
>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
chaperones [Posttranslational modification, protein
turnover, chaperones].
Length = 786
Score = 28.4 bits (64), Expect = 1.8
Identities = 17/59 (28%), Positives = 37/59 (62%)
Query: 68 KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL 126
+ELD++ +EL Q+ E + + +E ++ K+L I + + ++ +++KEL+ V E D+
Sbjct: 401 EELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDEDDI 459
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 28.1 bits (63), Expect = 1.8
Identities = 17/89 (19%), Positives = 41/89 (46%), Gaps = 6/89 (6%)
Query: 62 KLDQIKKELDQIIKELDQIIKEL--DLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQ 119
++I+K L + ++ + K+L ++E + ++ + N +L +IK+E D+
Sbjct: 165 TAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIK-KKDIKNPLELQKIKEEFDK 223
Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQI 148
+ KE + +I N + + Q+
Sbjct: 224 LKKEGK---ADKQKIKSAKNDLQNDKKQL 249
>gnl|CDD|182386 PRK10334, PRK10334, mechanosensitive channel MscS; Provisional.
Length = 286
Score = 28.0 bits (62), Expect = 1.9
Identities = 15/65 (23%), Positives = 36/65 (55%), Gaps = 9/65 (13%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNK---------LDQI 113
+DQ+K+ L II+ D+I+K+ ++ ++ + +N +++ + + L++I
Sbjct: 198 IDQVKQILTNIIQSEDRILKDREMTVRLNELGASSINFVVRVWSNSGDLQNVYWDVLERI 257
Query: 114 KKELD 118
K+E D
Sbjct: 258 KREFD 262
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 28.2 bits (63), Expect = 1.9
Identities = 21/110 (19%), Positives = 59/110 (53%)
Query: 32 GSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKES 91
E E+ E S EE E+ +E + +L ++++EL ++++ + + +EL+ +++
Sbjct: 479 YELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKL 538
Query: 92 NQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQI 141
+++EL + ++ +L + ++L+ ++EL ++EL + ++
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEEL 588
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 28.4 bits (63), Expect = 2.0
Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 7/90 (7%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLII-KESNQNKLDQIKKELDQ 119
++ +K+ ++L++ ++ E ++E N E N L++ K +
Sbjct: 413 ERIKNFEKQKKSSKEQLEK------FLVNEFKSDVQEYNKYCGLEKKINNLEKEIKNNQE 466
Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIV 149
VK+L IKE+ + + + I+KE+NQ +
Sbjct: 467 EVKKLSNEIKEIEKFLVSIKPIVKEINQTL 496
>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein.
Length = 554
Score = 28.2 bits (62), Expect = 2.0
Identities = 23/112 (20%), Positives = 43/112 (38%), Gaps = 1/112 (0%)
Query: 21 RMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELD-QIIKELDQ 79
F + E I LE + E Q ++E KV+ +++ + ++L+ +I
Sbjct: 206 GKFRNSDIELRFADENFRIILEIMDLSEFQNHQQEGKVNQDEILETIEQLESEIFDINSF 265
Query: 80 IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKEL 131
++ N +I++L N L I D + L+ IK L
Sbjct: 266 FANFIEESAGVHNHIIEDLADEFASENAIALSAIGDLKDDFLAFLEDRIKNL 317
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 28.1 bits (63), Expect = 2.0
Identities = 18/79 (22%), Positives = 45/79 (56%), Gaps = 8/79 (10%)
Query: 74 IKELDQIIKELDLIIKESNQVIKELNLIIKE-----SNQNKLDQIKKELDQMVKELDLTI 128
+KEL++ KE ++ K + ++I++ ++E + K+++ ++ QM + L+ +
Sbjct: 93 LKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEE--KQM-EILEKAL 149
Query: 129 KELNQIIKELNQIIKELNQ 147
K +I+KE N+ ++ L +
Sbjct: 150 KSYLKIVKEENKSLQRLAK 168
>gnl|CDD|234955 PRK01433, hscA, chaperone protein HscA; Provisional.
Length = 595
Score = 27.9 bits (62), Expect = 2.1
Identities = 23/108 (21%), Positives = 51/108 (47%), Gaps = 11/108 (10%)
Query: 39 INLESSNGIEEQEMEK--ECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIK 96
I ++ ++GI++ E++ E K+D + L + + E + +I + + I
Sbjct: 474 IEVKPNHGIDKTEIDIMLENAYKNAKIDYTTRLLQEAVIEAEALIFNI-------ERAIA 526
Query: 97 ELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
EL ++ ES + ++ + + + V D+ + +N IKE IK+
Sbjct: 527 ELTTLLSESEISIINSLLDNIKEAVHARDIIL--INNSIKEFKSKIKK 572
>gnl|CDD|235277 PRK04309, PRK04309, DNA-directed RNA polymerase subunit A'';
Validated.
Length = 383
Score = 27.9 bits (63), Expect = 2.1
Identities = 11/45 (24%), Positives = 28/45 (62%)
Query: 94 VIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
+ L +++++ ++K+EL + ++E LT +E+ +II+E+
Sbjct: 3 SEETLEEKLEDASLELPQKLKEELREKLEERKLTEEEVEEIIEEV 47
>gnl|CDD|223377 COG0300, DltE, Short-chain dehydrogenases of various substrate
specificities [General function prediction only].
Length = 265
Score = 27.6 bits (62), Expect = 2.2
Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 7/49 (14%)
Query: 99 NLIIKESNQNKLDQIKKELDQM------VKELDLT-IKELNQIIKELNQ 140
NLI+ ++KL+ + KEL+ V DL+ + L ++ EL +
Sbjct: 32 NLILVARREDKLEALAKELEDKTGVEVEVIPADLSDPEALERLEDELKE 80
>gnl|CDD|153165 cd04893, ACT_GcvR_1, ACT domains that comprise the Glycine Cleavage
System Transcriptional Repressor (GcvR) protein, and
other related domains. This CD includes the first of
the two ACT domains that comprise the Glycine Cleavage
System Transcriptional Repressor (GcvR) protein, and
other related domains. The glycine cleavage enzyme
system in Escherichia coli provides one-carbon units for
cellular methylation reactions. This enzyme system,
encoded by the gcvTHP operon and lpd gene, catalyzes the
cleavage of glycine into CO2 + NH3 and transfers a
one-carbon unit to tetrahydrofolate, producing
5,10-methylenetetrahydrofolate. The gcvTHP operon is
activated by the GcvA protein in response to glycine and
repressed by a GcvA/GcvR interaction in the absence of
glycine. It has been proposed that the co-activator
glycine acts through a mechanism of de-repression by
binding to GcvR and preventing GcvR from interacting
with GcvA to block GcvA's activator function. Evidence
also suggests that GcvR interacts directly with GcvA
rather than binding to DNA to cause repression. Members
of this CD belong to the superfamily of ACT regulatory
domains.
Length = 77
Score = 26.5 bits (59), Expect = 2.2
Identities = 15/59 (25%), Positives = 29/59 (49%), Gaps = 3/59 (5%)
Query: 73 IIKELDQIIKELDLIIKESNQVI--KELNLI-IKESNQNKLDQIKKELDQMVKELDLTI 128
I+ EL + + E I +S I E L + E + + + +++ L + + LDLT+
Sbjct: 14 ILNELTRAVSESGCNILDSRMAILGTEFALTMLVEGSWDAIAKLEAALPGLARRLDLTL 72
>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
The binding of the LYPxL motif of late HIV p6Gag and
EIAV p9Gag to this domain is necessary for viral
budding.This domain is generally central between an
N-terminal Bro1 domain, pfam03097 and a C-terminal
proline-rich domain. The retroviruses thus used this
domain to hijack the ESCRT system of the cell.
Length = 292
Score = 27.7 bits (62), Expect = 2.2
Identities = 15/92 (16%), Positives = 45/92 (48%), Gaps = 9/92 (9%)
Query: 69 ELDQIIKELDQIIKELDLIIKESNQVIKELNL---------IIKESNQNKLDQIKKELDQ 119
E+ + I L +++ +L+ + E ++++EL ++ N+ ++ ++
Sbjct: 138 EVSEQISRLRELLNKLNELKAEREKLLEELKEKAQDDDISKLLIALNKLGSSNEEQLFEE 197
Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+K+ D + + Q + + +++KE+ + EE
Sbjct: 198 ELKKFDPLQERVEQNLSKQEELLKEIQEANEE 229
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 27.7 bits (62), Expect = 2.3
Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 5/91 (5%)
Query: 61 NKLDQIKKELDQIIKEL-DQIIKELDLIIKES---NQVIKELNLIIKESNQNKLDQIKKE 116
+ + E Q +K+L + K L+ I N +++L +E + L Q++++
Sbjct: 226 KAISKSLPETKQYLKKLSQDLSKALEKISSREKHLNNQLEQLVQEYRE-ARRTLSQVQEK 284
Query: 117 LDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
+Q + + +ELN+I +EL Q+ +E+ +
Sbjct: 285 YNQASQGVSELTRELNEISEELEQVKQEMEE 315
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 27.9 bits (62), Expect = 2.5
Identities = 21/73 (28%), Positives = 42/73 (57%), Gaps = 3/73 (4%)
Query: 62 KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNK--LDQIKKELDQ 119
L +++KE ++ + +D IIK+++ + K N +IK LN+ I SN NK ++ +
Sbjct: 850 NLKELEKEFNENNQIVDNIIKDIENMNKNIN-IIKTLNIAINRSNSNKQLVEHLLNNKID 908
Query: 120 MVKELDLTIKELN 132
+ +L+ +K +N
Sbjct: 909 LKNKLEQHMKIIN 921
>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), A subunit [DNA replication,
recombination, and repair].
Length = 804
Score = 27.6 bits (62), Expect = 2.6
Identities = 19/73 (26%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 64 DQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKE 123
++I+KEL ++ KE+ + K L + + + KEL I K+ + +I +E + +++
Sbjct: 437 EKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERRTEIVEEEEDEIED 496
Query: 124 LDLTIKELNQIIK 136
DL I E + ++
Sbjct: 497 EDL-IAEEDVVVT 508
>gnl|CDD|212126 cd10815, GH38N_AMII_EcMngB_like, N-terminal catalytic domain of
Escherichia coli alpha-mannosidase MngB and its
bacterial homologs; glycoside hydrolase family 38
(GH38). The bacterial subfamily is represented by
Escherichia coli alpha-mannosidase MngB, which is
encoded by the mngB gene (previously called ybgG). MngB
exhibits alpha-mannosidase activity that converts
2-O-(6-phospho-alpha-mannosyl)-D-glycerate to
mannose-6-phosphate and glycerate in the pathway which
enables use of mannosyl-D-glycerate as a sole carbon
source. A divalent metal ion is required for its
activity.
Length = 270
Score = 27.5 bits (62), Expect = 2.7
Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 7/56 (12%)
Query: 64 DQ--IKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKEL 117
DQ I+K L ++I+EL++I + + +I + K L E N++ L I+ EL
Sbjct: 216 DQMPIRKNLPEVIEELNEISPDYEYVISSYEEFFKAL-----EKNKDLLPTIEGEL 266
>gnl|CDD|185112 PRK15190, PRK15190, fimbrial protein BcfE; Provisional.
Length = 181
Score = 27.4 bits (60), Expect = 2.7
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 12 LRRHYFALFRMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQ 65
+ R + ALF M L P + S ++ NLE + I Q C +D + L Q
Sbjct: 1 MNRSHCALFAMGLMLCPQINASADDLAYNLEFTGTIVAQ----TCDIDISSLSQ 50
>gnl|CDD|220387 pfam09763, Sec3, Exocyst complex component Sec3. This entry is the
conserved middle and C-terminus of the Sec3 protein.
Sec3 binds to the C-terminal cytoplasmic domain of GLYT1
(glycine transporter protein 1). Sec3 is the exocyst
component that is closest to the plasma membrane docking
site and it serves as a spatial landmark in the plasma
membrane for incoming secretory vesicles. Sec3 is
recruited to the sites of polarised membrane growth
through its interaction with Rho1p, a small GTP-binding
protein.
Length = 691
Score = 27.7 bits (62), Expect = 2.8
Identities = 16/82 (19%), Positives = 39/82 (47%), Gaps = 11/82 (13%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK------ 114
+++++ K LD+ + E D++ L L E + + +++ I +QN Q++
Sbjct: 30 KRVNELMKYLDEALAECDELDPTLSLYSMELSSLSEDIEYI---ESQNNGLQVQSANQKL 86
Query: 115 --KELDQMVKELDLTIKELNQI 134
EL+ ++ + L +L +
Sbjct: 87 LWNELESLLNTVSLPESDLRAL 108
>gnl|CDD|215787 pfam00206, Lyase_1, Lyase.
Length = 312
Score = 27.7 bits (62), Expect = 2.8
Identities = 26/103 (25%), Positives = 43/103 (41%), Gaps = 19/103 (18%)
Query: 67 KKELDQIIKELDQIIKELDL----IIKES------------NQVIKELNLIIKESNQNKL 110
K+E IIK LD++ +E +K N+VI EL + N +K+
Sbjct: 47 KEEAAAIIKALDEVAEEGKADDAFPLKVIQEGSGTAVNMNLNEVIGELLGQLVHPN-DKV 105
Query: 111 DQIKKELDQMVKELDLTIKE--LNQIIKELNQIIKELNQIVEE 151
+ DQ+ L L +K ++ L Q+I L + +E
Sbjct: 106 HTGQSSNDQVPTALRLALKLALSEVLLPALGQLIDALKEKAKE 148
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 27.2 bits (61), Expect = 2.9
Identities = 14/65 (21%), Positives = 40/65 (61%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM 120
++L+Q K+L++ +EL++ + EL+ ++ + +E I ++ + +++ +KK+ Q+
Sbjct: 123 SELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQL 182
Query: 121 VKELD 125
+L+
Sbjct: 183 KSQLE 187
>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
Length = 663
Score = 27.5 bits (61), Expect = 2.9
Identities = 29/110 (26%), Positives = 58/110 (52%), Gaps = 7/110 (6%)
Query: 31 DGST-EESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK 89
D ST ++ EI ++SS G+ ++E+EK K + KE D+ KEL E + +I
Sbjct: 524 DKSTGKKQEITIQSSGGLSDEEIEKMVKE-----AEEYKEQDEKKKELVDAKNEAETLIY 578
Query: 90 ESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL-TIKELNQIIKEL 138
+ + +L I ++++++L Q +L + D+ +IK+ + ++E
Sbjct: 579 SVEKQLSDLKDKISDADKDELKQKITKLRSTLSSEDVDSIKDKTKQLQEA 628
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 27.1 bits (60), Expect = 3.1
Identities = 11/75 (14%), Positives = 34/75 (45%), Gaps = 6/75 (8%)
Query: 77 LDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIK 136
+ +I + D + K +++ E + E KE D+ ++EL + + ++
Sbjct: 3 IKEIRNKTDELEKLKDEIKAEEEALESEEK------HLKEYDKEMEELLEEKMQHVEELR 56
Query: 137 ELNQIIKELNQIVEE 151
+++ I ++ +++
Sbjct: 57 QIHADINDMETEIKQ 71
Score = 26.4 bits (58), Expect = 6.4
Identities = 12/62 (19%), Positives = 30/62 (48%)
Query: 36 ESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVI 95
+ EI E E++ KE + +L + K + + ++++ I +++ IK+S +
Sbjct: 17 KDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKSEL 76
Query: 96 KE 97
+
Sbjct: 77 ER 78
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 27.3 bits (61), Expect = 3.5
Identities = 19/145 (13%), Positives = 61/145 (42%), Gaps = 6/145 (4%)
Query: 9 CLRLRRHYFALFRMFDGLTPVTDGSTE----ESEINLESSNGIEEQEMEKECKVDFNKLD 64
+ L + D L V E+ + +E+ K + ++++
Sbjct: 53 EVAQISLSSLLSEVLDYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVE 112
Query: 65 QIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKEL 124
++ ++L++ + ELD+ +++L+ +++E + + L+ + K ++ L + ++L
Sbjct: 113 ELTRKLEERLSELDEELEDLEDLLEEL-EPLAYLDFDLSLLRGLKFLLVRLGLVR-REKL 170
Query: 125 DLTIKELNQIIKELNQIIKELNQIV 149
+ + + + + ++ IV
Sbjct: 171 EALVGVIEDEVALYGENVEASVVIV 195
>gnl|CDD|204831 pfam12137, RapA_C, RNA polymerase recycling family C-terminal.
This domain is found in bacteria. This domain is about
360 amino acids in length. This domain is found
associated with pfam00271, pfam00176. The function of
this domain is not known, but structurally it forms an
alpha-beta fold in nature with a central beta-sheet
flanked by helices and loops, the beta-sheet being
mainly antiparallel and flanked by four alpha helices,
among which the two longer helices exhibit a coiled-coil
arrangement.
Length = 362
Score = 27.1 bits (61), Expect = 3.7
Identities = 19/100 (19%), Positives = 50/100 (50%), Gaps = 11/100 (11%)
Query: 57 KVDFNKLDQ----IKKELD-QIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD 111
+V F L++ + + +++K L I++L + K ++L II+++ Q
Sbjct: 251 QVSFESLNKQLSPVNRHTARKLVKALRPQIEQL--LQKAEKLAEEQLEEIIEQAKQQMQQ 308
Query: 112 QIKKELDQMV--KELDLTIK--ELNQIIKELNQIIKELNQ 147
Q+ EL ++ K ++ I+ E+ + ++ +++ L++
Sbjct: 309 QLSAELARLKALKAVNPNIRDEEIEALEQQREELLTALDK 348
>gnl|CDD|219289 pfam07083, DUF1351, Protein of unknown function (DUF1351). This
family consists of several bacterial and phage proteins
of around 230 residues in length. The function of this
family is unknown.
Length = 220
Score = 26.9 bits (60), Expect = 3.8
Identities = 20/96 (20%), Positives = 47/96 (48%), Gaps = 3/96 (3%)
Query: 57 KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE 116
++F ++ +K +D+++ + + + K+ + ELN +IK N + + IKK+
Sbjct: 11 TIEFLDFEEFEKAIDEVVAKYKGYVVTEE-NYKDDKKERAELNKLIKALNDKRKE-IKKQ 68
Query: 117 LDQMVKELDLTIKELNQIIKE-LNQIIKELNQIVEE 151
+ E + +K+L IK ++ I + + E+
Sbjct: 69 YSKPADEFEAYMKKLIAPIKAVIDLIDSGIKEFEEK 104
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 27.1 bits (61), Expect = 3.8
Identities = 18/97 (18%), Positives = 49/97 (50%), Gaps = 16/97 (16%)
Query: 63 LDQIKKELD---QIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQI------ 113
L+++++ L ++K L++I + +I+ S + K +I+E ++ + D +
Sbjct: 329 LEKLEERLHILEGLLKALNKI-DFVIEVIRGSIDLKKAKKELIEELSEIQADYLLDMRLR 387
Query: 114 ------KKELDQMVKELDLTIKELNQIIKELNQIIKE 144
++L++ ++EL+ I EL +I+ ++ +
Sbjct: 388 RLTKEEIEKLEKEIEELEKEIAELEKILASEKKLWIK 424
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 27.0 bits (61), Expect = 3.9
Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 11/88 (12%)
Query: 70 LDQIIKELDQ----IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK--KELDQMVKE 123
+D+++ ELD+ + EL+ + E N + KE+ K ++ I KEL + +K
Sbjct: 27 VDELL-ELDEERRELQTELEELQAERNALSKEIG-QAKRKGEDAEALIAEVKELKEEIKA 84
Query: 124 LDLTIKELNQIIKELNQIIKELNQIVEE 151
L+ EL+++ EL +++ + + +
Sbjct: 85 LE---AELDELEAELEELLLRIPNLPHD 109
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 27.3 bits (61), Expect = 3.9
Identities = 20/109 (18%), Positives = 48/109 (44%), Gaps = 17/109 (15%)
Query: 33 STEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESN 92
S EE E +E + + + V+ + E + +I L++ +KE+ + +E
Sbjct: 484 SDEEIERMVEDAEANAALDKKFRELVEA------RNEAESLIYSLEKALKEIVKVSEEE- 536
Query: 93 QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQI 141
KE + + +++ L+ +E+ I+EL ++ ++L +
Sbjct: 537 ----------KEKIEEAITDLEEALEGEKEEIKAKIEELQEVTQKLAEK 575
>gnl|CDD|147082 pfam04740, LXG, LXG domain of WXG superfamily. This domain is
present is the N-terminal region of a group of
polymorphic toxin proteins in bacteria. It is predicted
to use Type VII secretion pathway to mediate export of
bacterial toxins.
Length = 204
Score = 26.8 bits (60), Expect = 4.1
Identities = 29/110 (26%), Positives = 54/110 (49%), Gaps = 12/110 (10%)
Query: 43 SSNG-IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLI 101
SSNG I+E+ +E E K K ++ +EL ++ I+ + I+ L
Sbjct: 90 SSNGYIDEEFLEHELKNGLKKAEEKTEELKD---AINSILASISDIV--------HLPKF 138
Query: 102 IKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ Q +LD+ KK+ + +++L +EL + EL +I+E+ Q V +
Sbjct: 139 STSAFQEQLDKAKKKRNDTLEKLYEFDQELTNELAELEALIQEIQQYVRQ 188
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 27.0 bits (60), Expect = 4.3
Identities = 23/91 (25%), Positives = 47/91 (51%), Gaps = 23/91 (25%)
Query: 61 NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM 120
+KL +++KE I E++ + +++ I Q+K+D+++KE+DQ
Sbjct: 38 SKLSELQKEKKNIQNEIESLDNQIEEI-------------------QSKIDELQKEIDQS 78
Query: 121 VKELDLTIKELNQIIKELNQIIKELNQIVEE 151
E IK+L + I EL + I E +++++
Sbjct: 79 KAE----IKKLQKEIAELKENIVERQELLKK 105
>gnl|CDD|217273 pfam02895, H-kinase_dim, Signal transducing histidine kinase,
homodimeric domain. This helical bundle domain is the
homodimer interface of the signal transducing histidine
kinase family.
Length = 63
Score = 25.2 bits (56), Expect = 4.3
Identities = 12/54 (22%), Positives = 29/54 (53%), Gaps = 10/54 (18%)
Query: 106 NQNKLDQI----------KKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
+ KLD++ + L+Q+++ L+ +EL + ++ L+++ +EL V
Sbjct: 7 DVEKLDRLMNLVGELVIARNRLEQLLERLEELDRELKEALERLSRLTRELQDAV 60
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 27.1 bits (61), Expect = 4.4
Identities = 25/110 (22%), Positives = 51/110 (46%), Gaps = 10/110 (9%)
Query: 40 NLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN 99
L EE+E+E + ++ ++ K +I E Q + +D K ++ + L
Sbjct: 116 LLPDEPIDEEREIEALVRSILSEFEEYAKLNKKIPPEELQSLNSIDDPGKLADTIAAHLP 175
Query: 100 LIIKE-----SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
L ++E + ++++K LD + KE+D L Q+ K + + +KE
Sbjct: 176 LKLEEKQEILETLDVKERLEKLLDLLEKEID-----LLQLEKRIRKKVKE 220
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 27.0 bits (60), Expect = 4.7
Identities = 19/102 (18%), Positives = 41/102 (40%), Gaps = 14/102 (13%)
Query: 60 FNKLDQIKKELDQIIKELDQI------IKELDLIIKESNQVIKELNLIIKESNQ--NKLD 111
DQ+ + L+Q + L+ + + L L + +S++ I+E +++ S L
Sbjct: 64 LAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLLSRILL 123
Query: 112 QIKKELDQMVK--ELDLTIKELNQIIKELNQIIKELNQIVEE 151
Q L + + E +E + + E I N + +
Sbjct: 124 QQLGPLPEAGQPQEQFEVTQERDALQAEKAYI----NALEGQ 161
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 26.7 bits (59), Expect = 4.8
Identities = 18/91 (19%), Positives = 46/91 (50%), Gaps = 6/91 (6%)
Query: 47 IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
+ +E E + + L + + L +++K L++ ++ L+ IKE + ++EL ++E
Sbjct: 96 EDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEV 155
Query: 107 QNK------LDQIKKELDQMVKELDLTIKEL 131
+ L + + L + ++EL +++L
Sbjct: 156 EELREELERLQENLQRLQEAIQELQSLLEQL 186
>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
transduction. This family is a four helix bundle that
operates as a ubiquitous sensory module in prokaryotic
signal-transduction. The 4HB_MCP is always found between
two predicted transmembrane helices indicating that it
detects only extracellular signals. In many cases the
domain is associated with a cytoplasmic HAMP domain
suggesting that most proteins carrying the bundle might
share the mechanism of transmembrane signalling which is
well-characterized in E coli chemoreceptors.
Length = 181
Score = 26.4 bits (59), Expect = 5.0
Identities = 15/94 (15%), Positives = 43/94 (45%), Gaps = 7/94 (7%)
Query: 61 NKLDQIKKELDQIIKELDQII------KELDLIIKESNQVIKELNLIIKESNQNKLDQIK 114
+++++ E+D+++K+ ++ I K + ++ K N ++ + K D+
Sbjct: 82 KDIEELRAEIDKLLKKYEKTILTEEEKKLFNEFKEQLKAYRKVRNKVLDLAKAGKNDEAY 141
Query: 115 KELDQMVKEL-DLTIKELNQIIKELNQIIKELNQ 147
++ D I+ L+++I ++ KE +
Sbjct: 142 ALYLTELEPARDAVIEALDELIDYNLKVAKEAYE 175
>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
type) [DNA replication, recombination, and repair].
Length = 1444
Score = 26.9 bits (60), Expect = 5.0
Identities = 29/143 (20%), Positives = 60/143 (41%), Gaps = 10/143 (6%)
Query: 17 FALFRMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKE 76
+ +F P+ E+ ++ E + + E + E + +I+ +II++
Sbjct: 45 WEFHFLFAETLPIEIYKELENALSEEFEQLVAKVEFDIEVRNVDITEQEIQTYFKKIIEQ 104
Query: 77 L---DQIIKEL----DLIIKESNQVI-KELNLIIKESNQNKLDQIKKELDQMVKELDLTI 128
L I K L L +K +N +I + LN + +NK +++K+L L
Sbjct: 105 LSVKSPIFKSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKLQSFGFPQLLIE 164
Query: 129 KELNQIIKELNQIIKELNQIVEE 151
E+N I +E ++ + + E
Sbjct: 165 FEVNDISEEQE--FEKFEEAINE 185
>gnl|CDD|236270 PRK08471, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 613
Score = 26.9 bits (60), Expect = 5.3
Identities = 11/44 (25%), Positives = 30/44 (68%), Gaps = 3/44 (6%)
Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
+ +LD ++K++++ EL +T+ E+N + K++ +I K++ ++
Sbjct: 150 DTRERLDTLQKKVNE---ELKVTVDEINSLGKQIAEINKQIKEV 190
>gnl|CDD|177094 CHL00195, ycf46, Ycf46; Provisional.
Length = 489
Score = 26.9 bits (60), Expect = 5.3
Identities = 18/84 (21%), Positives = 43/84 (51%), Gaps = 10/84 (11%)
Query: 73 IIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD-----QIKKELDQMVKELDLT 127
++K+ ++ + ++ + K ++ L+ I+K + + I KEL ++ L+
Sbjct: 86 LLKDFNRFLNDISISRK-----LRNLSRILKTQPKTIIIIASELNIPKELKDLITVLEFP 140
Query: 128 IKELNQIIKELNQIIKELNQIVEE 151
+ ++I KEL ++IK LN ++
Sbjct: 141 LPTESEIKKELTRLIKSLNIKIDS 164
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 26.7 bits (60), Expect = 5.3
Identities = 19/141 (13%), Positives = 55/141 (39%), Gaps = 8/141 (5%)
Query: 17 FALFRMFDGLTPVTDGSTEESEINLE---SSNGIEEQEMEKECKVDFNKLDQIKKELDQI 73
F++F + + ++ + E+ + K + +
Sbjct: 54 IEDFKLFKEKLKQSFSHIADIKVTFSIEVENITFTEELLLDYWNEIIEKAKKNSPLFKSL 113
Query: 74 IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKK---ELDQMVKELDLTIKE 130
+K+ ++ LIIK +N++ + + + K+ + Q +K + ++ E+D + +E
Sbjct: 114 LKKQKVEVEGNKLIIKVNNEIER--DHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEE 171
Query: 131 LNQIIKELNQIIKELNQIVEE 151
L + + + ++L + E
Sbjct: 172 LEKFEAQKEEEDEKLAKEALE 192
>gnl|CDD|184829 PRK14807, PRK14807, histidinol-phosphate aminotransferase;
Provisional.
Length = 351
Score = 26.7 bits (59), Expect = 5.4
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 93 QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
++K LNL+ N N L Q+ + V + + +N I+ E ++IKEL++I
Sbjct: 230 NILKYLNLVKSPYNINSLSQV---IALKVLRTGVLKERVNYILNERERLIKELSKI 282
>gnl|CDD|133381 cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the
transcription regulators TipAL, Mta, and SkgA.
Helix-turn-helix (HTH) TipAL, Mta, and SkgA
transcription regulators, and related proteins,
N-terminal domain. TipAL regulates resistance to and
activation by numerous cyclic thiopeptide antibiotics,
such as thiostrepton. Mta is a global transcriptional
regulator; the N-terminal DNA-binding domain of Mta
interacts directly with the promoters of mta, bmr, blt,
and ydfK, and induces transcription of these
multidrug-efflux transport genes. SkgA has been shown to
control stationary-phase expression of
catalase-peroxidase in Caulobacter crescentus. These
proteins are comprised of distinct domains that harbor
an N-terminal active (DNA-binding) site and a
regulatory (effector-binding) site. The conserved
N-terminal domain of these transcription regulators
contains winged HTH motifs that mediate DNA binding.
These proteins share the N-terminal DNA binding domain
with other transcription regulators of the MerR
superfamily that promote transcription by reconfiguring
the spacer between the -35 and -10 promoter elements.
Unique to this family, is a TipAL-like, lineage specific
Bacilli subgroup, which has five conserved cysteines in
the C-terminus of the protein.
Length = 103
Score = 25.9 bits (58), Expect = 5.6
Identities = 18/70 (25%), Positives = 37/70 (52%), Gaps = 9/70 (12%)
Query: 59 DFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
D +L QI +KEL +KE+ ++K+ ++ + E + L++ K+ LD
Sbjct: 43 DLERLQQI-----LFLKELGFSLKEIKELLKDPSEDLLE----ALREQKELLEEKKERLD 93
Query: 119 QMVKELDLTI 128
+++K +D T+
Sbjct: 94 KLIKTIDRTL 103
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 26.6 bits (59), Expect = 5.9
Identities = 14/44 (31%), Positives = 26/44 (59%)
Query: 33 STEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKE 76
E S +LE+ +++E E + K + +L ++K++DQ IKE
Sbjct: 77 EKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 26.9 bits (59), Expect = 6.1
Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 4/90 (4%)
Query: 62 KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
L++ K L I EL QI+++L+L + E+N + KE N+ +L + E++++
Sbjct: 484 TLEEDIKNLKHDINELTQILEKLELELSEANSKFELS----KEENERELVAQRIEIEKLE 539
Query: 122 KELDLTIKELNQIIKELNQIIKELNQIVEE 151
KEL+ I + Q+++ ++E
Sbjct: 540 KELNDLNLLSKTSILDAEQLVQSTEIKLDE 569
>gnl|CDD|181490 PRK08588, PRK08588, succinyl-diaminopimelate desuccinylase;
Reviewed.
Length = 377
Score = 26.4 bits (59), Expect = 6.1
Identities = 11/23 (47%), Positives = 15/23 (65%), Gaps = 2/23 (8%)
Query: 127 TIKELN--QIIKELNQIIKELNQ 147
TI E + Q+I L +II E+NQ
Sbjct: 252 TIPEYDNDQVISLLQEIINEVNQ 274
>gnl|CDD|235308 PRK04570, PRK04570, cell division protein ZipA; Provisional.
Length = 243
Score = 26.4 bits (58), Expect = 6.1
Identities = 13/40 (32%), Positives = 21/40 (52%)
Query: 24 DGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKL 63
DG E+SE+NLE+++ E+ K DF+K+
Sbjct: 59 DGDVDERGEGVEQSELNLEATDAAGGNELGKRPNQDFDKI 98
>gnl|CDD|215363 PLN02677, PLN02677, mevalonate kinase.
Length = 387
Score = 26.7 bits (59), Expect = 6.1
Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 8/51 (15%)
Query: 60 FNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKL 110
FN +D I +EL II Q E +L I E + +KEL E NQ L
Sbjct: 265 FNAVDSISEELATII----QSPAEDELSITEKEEKLKEL----MEMNQGLL 307
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 26.4 bits (58), Expect = 6.2
Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLI------IKESNQVIKELNLIIKESNQ--NKLDQIK 114
L+++ EL KE D + L + I + EL+ + KE + ++L++++
Sbjct: 31 LERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELE 90
Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
KE D + EL +E Q+ E Q ++E N
Sbjct: 91 KEDDDLDGELVELQEEKEQLENEELQYLREYNL 123
>gnl|CDD|218903 pfam06122, TraH, Conjugative relaxosome accessory transposon
protein. The TraH protein is thought to be a relaxosome
accessory component, also necessary for transfer but not
for H-pilus synthesis within the conjugative transposon.
Length = 360
Score = 26.5 bits (59), Expect = 6.3
Identities = 6/24 (25%), Positives = 13/24 (54%)
Query: 124 LDLTIKELNQIIKELNQIIKELNQ 147
L ++ +++ EL + + LNQ
Sbjct: 94 LQTLCPQIGKVMDELQKKAQALNQ 117
Score = 26.1 bits (58), Expect = 9.8
Identities = 6/24 (25%), Positives = 12/24 (50%)
Query: 117 LDQMVKELDLTIKELNQIIKELNQ 140
L + ++ + EL + + LNQ
Sbjct: 94 LQTLCPQIGKVMDELQKKAQALNQ 117
>gnl|CDD|223964 COG1033, COG1033, Predicted exporters of the RND superfamily
[General function prediction only].
Length = 727
Score = 26.5 bits (59), Expect = 6.5
Identities = 17/87 (19%), Positives = 40/87 (45%), Gaps = 5/87 (5%)
Query: 65 QIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKEL 124
+ EL++ +++ ++ + I+ QV II + ++ + I +EL + +K+
Sbjct: 453 RWMDELEEELEKGEEHVFSASSIVDLVKQVNG---GIIPDKSKIQ--IILEELPESIKKR 507
Query: 125 DLTIKELNQIIKELNQIIKELNQIVEE 151
++ +LN + L EL + E
Sbjct: 508 YISGDQLNLLGYSLGDTQGELEDVGRE 534
>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
Members of this family function as part of the Mediator
(Med) complex, which links DNA-bound transcriptional
regulators and the general transcription machinery,
particularly the RNA polymerase II enzyme. They play a
role in basal transcription by mediating activation or
repression according to the specific complement of
transcriptional regulators bound to the promoter.
Length = 176
Score = 26.2 bits (58), Expect = 6.9
Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 9/71 (12%)
Query: 79 QIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
++ ++L E +KEL + NQ ++ +++ E V+ LD IK I+KEL
Sbjct: 2 ELAEDLISADDELKSALKELEEHQE--NQQRILELRAE----VESLDEKIK---DILKEL 52
Query: 139 NQIIKELNQIV 149
+ KEL +
Sbjct: 53 KEAEKELRTLP 63
>gnl|CDD|187635 cd08930, SDR_c8, classical (c) SDR, subgroup 8. This subgroup has
a fairly well conserved active site tetrad and domain
size of the classical SDRs, but has an atypical
NAD-binding motif ([ST]G[GA]XGXXG). SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold (alpha/beta folding pattern with a central
beta-sheet), an NAD(P)(H)-binding region, and a
structurally diverse C-terminal region. Classical SDRs
are typically about 250 residues long, while extended
SDRs are approximately 350 residues. Sequence identity
between different SDR enzymes are typically in the
15-30% range, but the enzymes share the Rossmann fold
NAD-binding motif and characteristic NAD-binding and
catalytic sequence patterns. These enzymes catalyze a
wide range of activities including the metabolism of
steroids, cofactors, carbohydrates, lipids, aromatic
compounds, and amino acids, and act in redox sensing.
Classical SDRs have an TGXXX[AG]XG cofactor binding
motif and a YXXXK active site motif, with the Tyr
residue of the active site motif serving as a critical
catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase (15-PGDH)
numbering). In addition to the Tyr and Lys, there is
often an upstream Ser (Ser-138, 15-PGDH numbering)
and/or an Asn (Asn-107, 15-PGDH numbering) contributing
to the active site; while substrate binding is in the
C-terminal region, which determines specificity. The
standard reaction mechanism is a 4-pro-S hydride
transfer and proton relay involving the conserved Tyr
and Lys, a water molecule stabilized by Asn, and
nicotinamide. Extended SDRs have additional elements in
the C-terminal region, and typically have a TGXXGXXG
cofactor binding motif. Complex (multidomain) SDRs such
as ketoreductase domains of fatty acid synthase have a
GGXGXXG NAD(P)-binding motif and an altered active site
motif (YXXXN). Fungal type ketoacyl reductases have a
TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
have lost catalytic activity and/or have an unusual
NAD(P)-binding motif and missing or unusual active site
residues. Reactions catalyzed within the SDR family
include isomerization, decarboxylation, epimerization,
C=N bond reduction, dehydratase activity,
dehalogenation, Enoyl-CoA reduction, and
carbonyl-alcohol oxidoreduction.
Length = 250
Score = 26.1 bits (58), Expect = 6.9
Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 7/58 (12%)
Query: 99 NLIIKESNQNKLDQIKKELDQ------MVKELDLTIKE-LNQIIKELNQIIKELNQIV 149
LI+ + N L+Q+K+EL + ELD+T KE + ++I+ + ++ ++
Sbjct: 28 RLILADINAPALEQLKEELTNLYKNRVIALELDITSKESIKELIESYLEKFGRIDILI 85
>gnl|CDD|236047 PRK07560, PRK07560, elongation factor EF-2; Reviewed.
Length = 731
Score = 26.4 bits (59), Expect = 7.3
Identities = 14/30 (46%), Positives = 23/30 (76%), Gaps = 4/30 (13%)
Query: 118 DQMVKELDLTIKELNQ----IIKELNQIIK 143
D+++KEL LT +E+ Q IIK++N++IK
Sbjct: 150 DRLIKELKLTPQEMQQRLLKIIKDVNKLIK 179
>gnl|CDD|118727 pfam10200, Ndufs5, NADH:ubiquinone oxidoreductase, NDUFS5-15kDa.
This is a family of short, approximately 105 amino acid
residue, proteins which form part of NADH:ubiquinone
oxidoreductase complex I. Complex I is the first
multisubunit inner membrane protein complex of the
mitochondrial electron transport chain and it transfers
two electrons from NADH to ubiquinone. The protein
carries four highly conserved cysteine residues but
these do not appear to be in a configuration which
would favour metal binding so the exact function of the
protein is uncertain.
Length = 96
Score = 25.5 bits (56), Expect = 7.3
Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 13/49 (26%)
Query: 41 LESSNGIEEQEMEKECKVDFN-------------KLDQIKKELDQIIKE 76
+E S+GI + +KECK++F +L I+K+ +++IKE
Sbjct: 41 IECSHGIGQTRAKKECKIEFEDFYECLHRQKTMQRLQAIRKQREKLIKE 89
>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein.
Length = 304
Score = 26.3 bits (58), Expect = 7.4
Identities = 16/105 (15%), Positives = 31/105 (29%), Gaps = 3/105 (2%)
Query: 48 EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK-ESNQVIKELNLIIKESN 106
E +L+ + L + L + + + +
Sbjct: 91 REALAASRGAASKAQLEAARDNLRSALALLPISLGRIGASQATLGALLNNLQAQGLAARA 150
Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
Q LD+ +L Q EL I+ Q + ++ Q+ EE
Sbjct: 151 Q--LDEAYLQLAQTAAELLRLIRAPAQGERVQSEAQLRSVQVGEE 193
>gnl|CDD|225089 COG2178, COG2178, Predicted RNA-binding protein of the translin
family [Translation, ribosomal structure and
biogenesis].
Length = 204
Score = 26.1 bits (58), Expect = 7.6
Identities = 10/76 (13%), Positives = 39/76 (51%), Gaps = 4/76 (5%)
Query: 66 IKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELD 125
+++E++ I + L + K + +K S ++++ I ++ ++ +K+L + + +
Sbjct: 1 LREEINSIREVLQEKDKAREEALKLSREIVRLSGEAIFLLHRGDFEEAEKKLKKASEAV- 59
Query: 126 LTIKELNQIIKELNQI 141
++L +++ ++
Sbjct: 60 ---EKLKRLLAGFPEL 72
>gnl|CDD|221833 pfam12889, DUF3829, Protein of unknown function (DUF3829). This is
a small family of proteins from several bacterial
species, whose function is not known. It may, however,
be related to the GvpL_GvpF family of proteins,
pfam06386.
Length = 277
Score = 26.3 bits (58), Expect = 7.8
Identities = 13/82 (15%), Positives = 29/82 (35%), Gaps = 10/82 (12%)
Query: 59 DFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
+F K ++ +L +E + L I E +L+++KKE
Sbjct: 110 NFAKGKELHAKLLAAYEEFKAAYEAFL----------AALEKINDERLLEQLEELKKEGK 159
Query: 119 QMVKELDLTIKELNQIIKELNQ 140
++ + E Q++ +
Sbjct: 160 KIAYYTLELLLEAEQLLNLFSS 181
>gnl|CDD|218912 pfam06148, COG2, COG (conserved oligomeric Golgi) complex
component, COG2. The COG complex comprises eight
proteins COG1-8. The COG complex plays critical roles in
Golgi structure and function. The proposed function of
the complex is to mediate the initial physical contact
between transport vesicles and their membrane targets. A
comparable role in tethering vesicles has been suggested
for at least six additional large multisubunit
complexes, including the exocyst, a complex that
mediates trafficking to the plasma membrane. COG2
structure reveals a six-helix bundle with few conserved
surface features but a general resemblance to recently
determined crystal structures of four different exocyst
subunits. These bundles inCOG2 may act as platforms for
interaction with other trafficing proteins including
SNAREs (soluble N-ethylmaleimide factor attachment
protein receptors) and Rabs.
Length = 133
Score = 25.7 bits (57), Expect = 8.0
Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 8/86 (9%)
Query: 62 KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL--NLIIKESNQNKLDQIKKELDQ 119
L+ ++ +L +K L + EL I E L NL+ KL++I+ L
Sbjct: 27 PLESLRDDLRAYLKLLKSELIEL---INEDYADFVSLSTNLV---GLDEKLERIRVPLLA 80
Query: 120 MVKELDLTIKELNQIIKELNQIIKEL 145
+ KE++ EL ++E+ ++E
Sbjct: 81 LRKEVESVRSELEARLEEVEAKLEEK 106
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 26.5 bits (59), Expect = 8.0
Identities = 19/99 (19%), Positives = 39/99 (39%), Gaps = 10/99 (10%)
Query: 61 NKLDQIKKELDQIIKELDQI-IKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQ 119
+L + ++KEL ++ IIK+ ++ I E + D+ +E
Sbjct: 1024 GELVITNAKKKDLVKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELG 1083
Query: 120 MVKELD---------LTIKELNQIIKELNQIIKELNQIV 149
D LT +++ ++ EL + KEL ++
Sbjct: 1084 AAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLK 1122
>gnl|CDD|151528 pfam11083, Streptin-Immun, Lantibiotic streptin immunity protein.
Streptococcal species produce a lantibiotic, streptin,
in a similar manner to the production of nisin and
subtilin by other lactic acid bacteria, in order to
compete against competing bacteria within the
environment. The immunity protein protects the bacterium
from destruction by its own lantibiotic. In general,
there is little homology between the immunity proteins
of different genera of bacteria.
Length = 99
Score = 25.4 bits (56), Expect = 8.3
Identities = 18/68 (26%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 48 EEQEMEKECKVDFNKLDQIKK-ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
+ E K + DF+KL+ + L+Q+ +E+ ++ +L L + E ++++ L +K N
Sbjct: 31 ADVESRKLARYDFSKLNLTESITLEQVEEEILKLQNQLSLYLDEYEKLVRRLEKFVKILN 90
Query: 107 QNKLDQIK 114
+K IK
Sbjct: 91 TSKETGIK 98
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 26.2 bits (58), Expect = 8.5
Identities = 14/76 (18%), Positives = 42/76 (55%)
Query: 69 ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTI 128
E+++ +++L+ IK+L + +K++ + ++ ++ E+ ++ + L+
Sbjct: 35 EVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLE 94
Query: 129 KELNQIIKELNQIIKE 144
KE+N++ + LN + +E
Sbjct: 95 KEINELEEWLNVLDEE 110
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 26.4 bits (59), Expect = 8.5
Identities = 15/79 (18%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 47 IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
I+E E +E ++ +N+ I + I K++ I+ + ++ K+ + + K+
Sbjct: 565 IDETERRREIQMAYNEEHGITPQ--TIKKKIRDILDGEYEEDEYKAKIEKKASKMSKKEL 622
Query: 107 QNKLDQIKKELDQMVKELD 125
+ + +++KE+ + K L+
Sbjct: 623 EKLIKKLEKEMKEAAKNLE 641
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 26.2 bits (57), Expect = 8.5
Identities = 20/117 (17%), Positives = 50/117 (42%)
Query: 30 TDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK 89
TD + + E K +L + L + L+++ +E+D++
Sbjct: 176 TDSGKASASVAGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKN 235
Query: 90 ESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELN 146
+ ++V +E + E ++ + K+ L+++ E D + ++ IK + +K+ N
Sbjct: 236 DGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRN 292
>gnl|CDD|218065 pfam04399, Glutaredoxin2_C, Glutaredoxin 2, C terminal domain.
Glutaredoxins are a multifunctional family of
glutathione-dependent disulphide oxidoreductases.
Unlike other glutaredoxins, glutaredoxin 2 (Grx2)
cannot reduce ribonucleotide reductase. Grx2 has
significantly higher catalytic activity in the
reduction of mixed disulphides with glutathione (GSH)
compared with other glutaredoxins. The active site
residues (Cys9-Pro10-Tyr11-Cys12, in Escherichia coli
Grx2), which are found at the interface between the N-
and C-terminal domains are identical to other
glutaredoxins, but there is no other similarity between
glutaredoxin 2 and other glutaredoxins. Grx2 is
structurally similar to glutathione-S-transferases
(GST), but there is no obvious sequence similarity. The
inter-domain contacts are mainly hydrophobic,
suggesting that the two domains are unlikely to be
stable on their own. Both domains are needed for
correct folding and activity of Grx2. It is thought
that the primary function of Grx2 is to catalyze
reversible glutathionylation of proteins with GSH in
cellular redox regulation including the response to
oxidative stress.
Length = 132
Score = 25.7 bits (57), Expect = 8.6
Identities = 11/37 (29%), Positives = 21/37 (56%)
Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN 99
+Q+ + Q I EL+ ++ELD ++ S V +L+
Sbjct: 50 FEQLLADTPQYIAELNADLEELDPLLLPSEAVNGQLS 86
>gnl|CDD|150302 pfam09590, Env-gp36, Lentivirus surface glycoprotein. This
protein is found in feline immunodeficiency retrovirus.
It represents the surface glycoprotein which is found
in the polyprotein C-terminal to the Env protein.
Length = 591
Score = 26.0 bits (57), Expect = 8.8
Identities = 16/51 (31%), Positives = 25/51 (49%)
Query: 23 FDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQI 73
FD T VT+ +N GI EQE ++ C + KL ++K E+ +
Sbjct: 19 FDIATQVTEEGPLNPGVNPFRVPGITEQEKQEYCVILQPKLQELKGEITEG 69
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 26.0 bits (58), Expect = 8.8
Identities = 10/57 (17%), Positives = 26/57 (45%), Gaps = 1/57 (1%)
Query: 95 IKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+ +L L I++ + L+ +K+++ + ++ I + EL K L + +
Sbjct: 392 LDKLGLSIEDVFLHGLELLKEKVLEEKANSEIDID-FEEAKAELEAQFKRLREEAAK 447
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 26.0 bits (58), Expect = 9.0
Identities = 11/44 (25%), Positives = 24/44 (54%)
Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
+K+D+ K+E +Q+ ++L +L Q EL + + ++ E
Sbjct: 73 DKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRE 116
>gnl|CDD|206685 cd01898, Obg, Obg GTPase. The Obg nucleotide binding protein
subfamily has been implicated in stress response,
chromosome partitioning, replication initiation,
mycelium development, and sporulation. Obg proteins are
among a large group of GTP binding proteins conserved
from bacteria to humans. The E. coli homolog, ObgE is
believed to function in ribosomal biogenesis. Members of
the subfamily contain two equally and highly conserved
domains, a C-terminal GTP binding domain and an
N-terminal glycine-rich domain.
Length = 170
Score = 25.8 bits (58), Expect = 9.1
Identities = 18/83 (21%), Positives = 35/83 (42%), Gaps = 18/83 (21%)
Query: 58 VDFNKLDQIKKELDQIIKELDQIIKEL------------DLIIKESNQVIKELNLIIKES 105
+D + D ++ + I EL+ L DL+ E ++L ++KE
Sbjct: 86 IDLSGEDDPVEDYETIRNELEAYNPGLAEKPRIVVLNKIDLLDAEERF--EKLKELLKEL 143
Query: 106 NQNKLDQI----KKELDQMVKEL 124
K+ I + LD+++K+L
Sbjct: 144 KGKKVFPISALTGEGLDELLKKL 166
>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein. A number of members of this
family are annotated as being transmembrane proteins
induced by tumour necrosis factor alpha, but no
literature was found to support this.
Length = 330
Score = 25.9 bits (57), Expect = 9.1
Identities = 17/81 (20%), Positives = 37/81 (45%), Gaps = 19/81 (23%)
Query: 66 IKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELD 125
++E ++ KE Q+ +E++++ K+ KL++++K +Q +
Sbjct: 2 CEEEWKELEKEFQQL--------QETHRLYKQ-----------KLEEVEKLQEQCTSAIA 42
Query: 126 LTIKELNQIIKELNQIIKELN 146
K L Q+I L ++ K L
Sbjct: 43 RQRKRLKQLIVSLKKLKKSLT 63
>gnl|CDD|237216 PRK12821, PRK12821, aspartyl/glutamyl-tRNA amidotransferase subunit
C-like protein; Provisional.
Length = 477
Score = 26.0 bits (57), Expect = 9.2
Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 4/64 (6%)
Query: 89 KESNQVIKELN-LIIKESNQNKLDQI-KKELDQMVK--ELDLTIKELNQIIKELNQIIKE 144
K + +L + + +K Q+ K EL ++ + DL EL ++ E I
Sbjct: 364 KGIKKSFSDLQSPLFPKHWTSKKQQLNKDELKKLARLVMFDLDDAELEKLQVEFKDITSS 423
Query: 145 LNQI 148
Q+
Sbjct: 424 FKQV 427
>gnl|CDD|233691 TIGR02027, rpoA, DNA-directed RNA polymerase, alpha subunit,
bacterial and chloroplast-type. This family consists of
the bacterial (and chloroplast) DNA-directed RNA
polymerase alpha subunit, encoded by the rpoA gene. The
RNA polymerase catalyzes the transcription of DNA into
RNA using the four ribonucleoside triphosphates as
substrates. The amino terminal domain is involved in
dimerizing and assembling the other RNA polymerase
subunits into a transcriptionally active enzyme. The
carboxy-terminal domain contains determinants for
interaction with DNA and with transcriptional activator
proteins [Transcription, DNA-dependent RNA polymerase].
Length = 297
Score = 26.1 bits (58), Expect = 9.3
Identities = 13/59 (22%), Positives = 26/59 (44%), Gaps = 2/59 (3%)
Query: 74 IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELN 132
I E +I+ E ++ I+ +E + + K L ++ELDL+++ N
Sbjct: 192 IAEAAKILIEHLEPFVNLDEEIEAFEEEQEEEEEE--LEDAKLLSIKIEELDLSVRSYN 248
>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK). This
family consists of several plant specific
phytosulfokine precursor proteins. Phytosulfokines, are
active as either a pentapeptide or a C-terminally
truncated tetrapeptide. These compounds were first
isolated because of their ability to stimulate cell
division in somatic embryo cultures of Asparagus
officinalis.
Length = 74
Score = 24.6 bits (54), Expect = 9.4
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 30 TDGSTEESEINLESSNGIEEQEMEKECK 57
TD E S+I S+ ++E E+ EC+
Sbjct: 20 TDTGHESSKIEAAESSKVDEDEVVSECE 47
>gnl|CDD|179143 PRK00855, PRK00855, argininosuccinate lyase; Provisional.
Length = 459
Score = 25.9 bits (58), Expect = 9.4
Identities = 6/19 (31%), Positives = 16/19 (84%)
Query: 67 KKELDQIIKELDQIIKELD 85
++E ++I+ LD+I++E++
Sbjct: 55 EEEAEKILAGLDEILEEIE 73
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 25.7 bits (57), Expect = 9.4
Identities = 16/93 (17%), Positives = 41/93 (44%), Gaps = 9/93 (9%)
Query: 64 DQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN----LIIKESNQNKLDQIKKELDQ 119
+I +L + +++ +E ++ E Q ++E II+++ + ++ +
Sbjct: 40 AKIADDLAEA----ERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAE 95
Query: 120 MVKELDLTIKELNQIIK-ELNQIIKELNQIVEE 151
+EL+ + I+ E + ++EL V E
Sbjct: 96 AEEELERIKEAAEAEIEAEKERALEELRAEVAE 128
>gnl|CDD|217899 pfam04108, APG17, Autophagy protein Apg17. Apg17 is required for
activating Apg1 protein kinases.
Length = 408
Score = 26.1 bits (58), Expect = 9.5
Identities = 12/46 (26%), Positives = 21/46 (45%)
Query: 93 QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
++++ LI E Q+ ++ ELD L+ T+ L I L
Sbjct: 58 KIVRSTALIRTEWGQSVFVNLQNELDAADARLESTLDRLRSTIVPL 103
>gnl|CDD|225553 COG3008, PqiB, Paraquat-inducible protein B [General function
prediction only].
Length = 553
Score = 26.1 bits (58), Expect = 9.5
Identities = 8/37 (21%), Positives = 18/37 (48%)
Query: 113 IKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
K LD +V ++ +T+ L +K ++ +I+
Sbjct: 436 NKLPLDFVVNDVTVTLSGLQFTLKSAQTLLDGGIRIL 472
>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
Length = 880
Score = 26.2 bits (59), Expect = 9.9
Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Query: 51 EMEKE-CKVDFNKLDQIKKELDQIIKELDQIIKEL 84
ME+ KVD L ++ EL Q + EL+Q I EL
Sbjct: 492 RMERNGIKVDREYLKELSAELAQRLAELEQEIYEL 526
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.136 0.353
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,851,367
Number of extensions: 782373
Number of successful extensions: 4817
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3675
Number of HSP's successfully gapped: 1248
Length of query: 151
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 63
Effective length of database: 7,034,450
Effective search space: 443170350
Effective search space used: 443170350
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 54 (24.7 bits)