RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4582
(133 letters)
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 37.5 bits (88), Expect = 9e-04
Identities = 25/87 (28%), Positives = 44/87 (50%)
Query: 28 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 87
E IK +++I + NE + + +E L++ E L Q + L R E ++K + E
Sbjct: 52 EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEE 111
Query: 88 LERSISFLKEKETELDEILAVLTEKTE 114
LE+ L++K+ EL++ L E E
Sbjct: 112 LEKKEKELEQKQQELEKKEEELEELIE 138
Score = 35.9 bits (84), Expect = 0.003
Identities = 22/81 (27%), Positives = 45/81 (55%), Gaps = 4/81 (4%)
Query: 34 LKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 93
L++ E ++R + ++Q E+ L Q E L++ L + E ++K++ ELE+
Sbjct: 69 LRNEFEKELRER--RNELQKLEKR--LLQKEENLDRKLELLEKREEELEKKEKELEQKQQ 124
Query: 94 FLKEKETELDEILAVLTEKTE 114
L++KE EL+E++ ++ E
Sbjct: 125 ELEKKEEELEELIEEQLQELE 145
>gnl|CDD|235360 PRK05183, hscA, chaperone protein HscA; Provisional.
Length = 616
Score = 34.8 bits (81), Expect = 0.007
Identities = 19/46 (41%), Positives = 26/46 (56%), Gaps = 6/46 (13%)
Query: 19 PSQGGTITDEHI----KMSLKSAVEDKIRRKMNEQKVQAQEEIEIL 60
PS G +TD+ I K S+ A ED R + EQKV+A+ +E L
Sbjct: 500 PSYG--LTDDEIARMLKDSMSHAEEDMQARALAEQKVEAERVLEAL 543
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 34.5 bits (80), Expect = 0.010
Identities = 19/63 (30%), Positives = 29/63 (46%)
Query: 52 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 111
Q+E+ LKQ + K + + E +E LE + L+EK+ EL+ L L E
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE 205
Query: 112 KTE 114
K
Sbjct: 206 KAA 208
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 32.4 bits (74), Expect = 0.050
Identities = 19/69 (27%), Positives = 39/69 (56%)
Query: 44 RKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELD 103
++ E+ + +E+++ L++ EL + +L E ++ EK ELE + L+E++ EL+
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE 890
Query: 104 EILAVLTEK 112
E L L +
Sbjct: 891 EELRELESE 899
Score = 29.3 bits (66), Expect = 0.54
Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 12/99 (12%)
Query: 28 EHIKMSLKSAVEDKIR-----RKMNEQKVQAQEEIEILKQTENELN-------QGKFKLA 75
+ ++ L+ E+ R ++ E+ +A++EIE LK EL + +L
Sbjct: 235 KELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELK 294
Query: 76 RMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTE 114
E ++ E + L + L+ + EL+E L L EK E
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIE 333
Score = 27.8 bits (62), Expect = 2.2
Identities = 15/70 (21%), Positives = 35/70 (50%)
Query: 45 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 104
++ +K + ++E++ L++ + EL + +L + +E +L + L+ K L+
Sbjct: 867 ELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEV 926
Query: 105 ILAVLTEKTE 114
L L E+ E
Sbjct: 927 ELPELEEELE 936
Score = 27.4 bits (61), Expect = 2.8
Identities = 18/72 (25%), Positives = 33/72 (45%), Gaps = 3/72 (4%)
Query: 41 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 100
+ ++ + +E +E L+ EL + +L + E+I+ K ELE + L+E E
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEE---RLEELKEKIEALKEELEERETLLEELEQ 351
Query: 101 ELDEILAVLTEK 112
L E+ E
Sbjct: 352 LLAELEEAKEEL 363
Score = 26.6 bits (59), Expect = 4.6
Identities = 22/72 (30%), Positives = 31/72 (43%)
Query: 43 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 102
++ E+ + QEE+ LK+ EL L E ++ E ELE + LKEK L
Sbjct: 276 LEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEAL 335
Query: 103 DEILAVLTEKTE 114
E L E
Sbjct: 336 KEELEERETLLE 347
Score = 26.6 bits (59), Expect = 5.0
Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 4/83 (4%)
Query: 33 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
L A+ +++ E + + E+E LK+ L + +L+ E + +E ELE
Sbjct: 376 ELFEALREELAELEAELA-EIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE- 433
Query: 93 SFLKEKETELDEILAVLTEKTEV 115
L+E +TEL+E+ L E E
Sbjct: 434 --LEELQTELEELNEELEELEEQ 454
Score = 25.8 bits (57), Expect = 8.9
Identities = 27/89 (30%), Positives = 48/89 (53%), Gaps = 2/89 (2%)
Query: 28 EHIKMSLKSAVED--KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEK 85
E LK +E+ + R+ + E+ + +EE+E ++ + L + L + ER+++E
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830
Query: 86 AELERSISFLKEKETELDEILAVLTEKTE 114
ELE I L+EK EL+E L L ++ E
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELE 859
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 32.4 bits (74), Expect = 0.054
Identities = 21/77 (27%), Positives = 40/77 (51%)
Query: 28 EHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 87
E + ++ + R+ N+Q+ +A+ E E L++ E L Q + +L E++D + +
Sbjct: 47 EEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQ 106
Query: 88 LERSISFLKEKETELDE 104
LE L +E EL+E
Sbjct: 107 LEEREKALSARELELEE 123
Score = 27.0 bits (60), Expect = 3.3
Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 2/90 (2%)
Query: 45 KMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDE 104
K++ + Q +E + L E EL + + +L R+ E R + LK + EL+E
Sbjct: 99 KLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKL-LLKLLDAELEE 157
Query: 105 ILAVLTEKTEVDVDEAVTTTA-PIYKQIVN 133
A +K E + D A I Q +
Sbjct: 158 EKAQRVKKIEEEADLEAERKAQNILAQAMQ 187
>gnl|CDD|233673 TIGR01991, HscA, Fe-S protein assembly chaperone HscA. The Heat
Shock Cognate proteins HscA and HscB act together as
chaperones. HscA resembles DnaK but belongs in a
separate clade. The apparent function is to aid assembly
of iron-sulfur cluster proteins. Homologs from Buchnera
and Wolbachia are clearly in the same clade but are
highly derived and score lower than some examples of
DnaK [Protein fate, Protein folding and stabilization].
Length = 599
Score = 31.9 bits (73), Expect = 0.075
Identities = 14/33 (42%), Positives = 20/33 (60%)
Query: 30 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQ 62
+K S K A ED R + EQKV+A+ +E L+
Sbjct: 497 LKDSFKHAEEDMYARALAEQKVEAERILEALQA 529
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 31.8 bits (72), Expect = 0.086
Identities = 22/120 (18%), Positives = 51/120 (42%), Gaps = 14/120 (11%)
Query: 15 GGSSPSQGGTITDEH------IKMSLKSAVEDKIRRKMNEQK----VQAQEEIEILKQTE 64
Q T +D+ + +A++++ + + + Q ++ + +++ E
Sbjct: 82 TVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLEDLVGMIQNAE 141
Query: 65 NE---LNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLT-EKTEVDVDEA 120
LNQ + + E+I EK L+ I+ L+ + +E D + + EK V++ E
Sbjct: 142 KNILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEE 201
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 31.4 bits (72), Expect = 0.12
Identities = 21/75 (28%), Positives = 36/75 (48%)
Query: 38 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKE 97
++++ + E + + +E L++ E L Q + L R E +DK++ LE+ L
Sbjct: 56 AKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSN 115
Query: 98 KETELDEILAVLTEK 112
KE LDE L E
Sbjct: 116 KEKNLDEKEEELEEL 130
Score = 28.7 bits (65), Expect = 0.92
Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 4/91 (4%)
Query: 28 EHIKMSLKSAVEDKIRRKMNEQK---VQAQEEIEI-LKQTENELNQGKFKLARMFERIDK 83
+ I K E + + E K + + E+E LK+ NEL + + +L + E +D+
Sbjct: 35 KRIIEEAKKEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDR 94
Query: 84 EKAELERSISFLKEKETELDEILAVLTEKTE 114
+ L++ L++KE EL L EK E
Sbjct: 95 KMESLDKKEENLEKKEKELSNKEKNLDEKEE 125
Score = 26.8 bits (60), Expect = 4.1
Identities = 21/75 (28%), Positives = 38/75 (50%), Gaps = 11/75 (14%)
Query: 39 EDKIRR---KMNEQKVQAQEEI-----EILKQTENELNQGKFKLARMFERIDKEKAELER 90
E K RR + E+++ +EE E L + E L + + +L+ + +D+++ ELE
Sbjct: 70 ELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELEE 129
Query: 91 SISFLKEKETELDEI 105
I E+ EL+ I
Sbjct: 130 LI---AEQREELERI 141
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 30.2 bits (69), Expect = 0.15
Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 1/73 (1%)
Query: 42 IRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETE 101
+RR E ++E+ L+ +E Q KL E + + K E+E L++ E
Sbjct: 26 LRRLEGE-LASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQR 84
Query: 102 LDEILAVLTEKTE 114
+ L +L EK+E
Sbjct: 85 YETTLELLGEKSE 97
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 30.9 bits (70), Expect = 0.17
Identities = 26/96 (27%), Positives = 46/96 (47%), Gaps = 7/96 (7%)
Query: 27 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARM----FERI- 81
D ++ + E K+ R+ + ++ E I LKQ E ++ Q K AR +RI
Sbjct: 328 DRELEEKAREVSELKMERQRKKPQIDELESIVRLKQAEADMFQLKADEARREAERLQRIA 387
Query: 82 --DKEKAELERSISFLKEKETELDEILAVLTEKTEV 115
EK+E E + +LK + +E +E L E+ ++
Sbjct: 388 LAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKL 423
>gnl|CDD|222968 PHA03065, PHA03065, Hypothetical protein; Provisional.
Length = 438
Score = 30.7 bits (70), Expect = 0.18
Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 16/92 (17%)
Query: 33 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
+K ++ +K R+ + +EEIE L+ L+ M+E I K +LE I
Sbjct: 79 PIKQSLREKRRKASKNTIKRKREEIEKLEDDIKNLDVDDE----MYEEI---KTDLELKI 131
Query: 93 S------FLKEK---ETELDEILAVLTEKTEV 115
FL + L+ LA L E E+
Sbjct: 132 DKLSFQLFLANSNNLKRLLESALARLGENVEI 163
>gnl|CDD|218167 pfam04599, Pox_G5, Poxvirus G5 protein. This protein has been
predicted to be related to the FEN-1 endonuclease.
Length = 426
Score = 30.8 bits (70), Expect = 0.18
Identities = 24/94 (25%), Positives = 40/94 (42%), Gaps = 16/94 (17%)
Query: 31 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
+ +K ++ +K + + + EEIE L+ L+ M+E I K +LE
Sbjct: 76 SIDIKESLREKRKNSLKNTSKRKTEEIENLEDAIKNLDVEDE----MYEEI---KTDLEL 128
Query: 91 SIS------FLKEK---ETELDEILAVLTEKTEV 115
I FL K + L+EIL+ L E +
Sbjct: 129 RIDKLSFHNFLANKSNLKITLEEILSSLKEDVTI 162
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 30.8 bits (70), Expect = 0.20
Identities = 20/87 (22%), Positives = 41/87 (47%)
Query: 39 EDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 98
DK + ++ +E++E LK+ NEL + +L +R+ +E A+L +I+ ++ K
Sbjct: 376 VDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAK 435
Query: 99 ETELDEILAVLTEKTEVDVDEAVTTTA 125
EL+E + + + A
Sbjct: 436 INELEEEKEDKALEIKKQEWKLEQLAA 462
Score = 28.5 bits (64), Expect = 1.1
Identities = 18/70 (25%), Positives = 39/70 (55%)
Query: 40 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
+K++R++NE K + E L++ EL +A + +I++ + E E +K++E
Sbjct: 395 EKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQE 454
Query: 100 TELDEILAVL 109
+L+++ A L
Sbjct: 455 WKLEQLAADL 464
Score = 27.7 bits (62), Expect = 2.0
Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 8/87 (9%)
Query: 41 KIRRKMNEQKVQA--------QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
K + K E++++ +EE+E L+ +L L + + ++ + ELER I
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKI 905
Query: 93 SFLKEKETELDEILAVLTEKTEVDVDE 119
L+ + + + L+ L K E +E
Sbjct: 906 EELEAQIEKKRKRLSELKAKLEALEEE 932
Score = 25.8 bits (57), Expect = 8.2
Identities = 14/72 (19%), Positives = 37/72 (51%)
Query: 41 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 100
+ R + E++ + ++ E L + E E+++ ++ + I++E+ ++ E +
Sbjct: 305 SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364
Query: 101 ELDEILAVLTEK 112
EL+++ A L E
Sbjct: 365 ELEDLRAELEEV 376
Score = 25.8 bits (57), Expect = 8.2
Identities = 14/76 (18%), Positives = 38/76 (50%)
Query: 40 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
+I +++E + + + + E E+ Q + + ++ ER+++ + +L ++ +
Sbjct: 698 RRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVK 757
Query: 100 TELDEILAVLTEKTEV 115
+EL E+ A + E E
Sbjct: 758 SELKELEARIEELEED 773
Score = 25.8 bits (57), Expect = 8.6
Identities = 21/84 (25%), Positives = 45/84 (53%)
Query: 31 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
+++L+ +K +++ EQ++ +E+I+ +++ LN K +L E ++ +LE
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLES 882
Query: 91 SISFLKEKETELDEILAVLTEKTE 114
+ LK++ EL+ L L K E
Sbjct: 883 RLGDLKKERDELEAQLRELERKIE 906
Score = 25.8 bits (57), Expect = 9.9
Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 6/78 (7%)
Query: 33 SLKSAVEDKIRR--KMNEQKVQAQEEIEILK----QTENELNQGKFKLARMFERIDKEKA 86
SL+ ++ +K R E+ + + EI+ L + E E+ + + + ++ E + K
Sbjct: 305 SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364
Query: 87 ELERSISFLKEKETELDE 104
ELE + L+E + E E
Sbjct: 365 ELEDLRAELEEVDKEFAE 382
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 30.8 bits (70), Expect = 0.21
Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 2/88 (2%)
Query: 33 SLKSAVEDKIRRKMNEQKVQAQ--EEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
SL S V D +R + EE+E L+ L + + +++ +LE
Sbjct: 61 SLLSEVLDYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEE 120
Query: 91 SISFLKEKETELDEILAVLTEKTEVDVD 118
+S L E+ +L+++L L +D D
Sbjct: 121 RLSELDEELEDLEDLLEELEPLAYLDFD 148
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 30.6 bits (69), Expect = 0.26
Identities = 19/74 (25%), Positives = 33/74 (44%)
Query: 31 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELER 90
+ + + ++ + E K + IE L E EL Q L ++ + +K K ELE+
Sbjct: 491 RYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQ 550
Query: 91 SISFLKEKETELDE 104
+ LKE+E
Sbjct: 551 EMEELKERERNKKL 564
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 30.4 bits (69), Expect = 0.29
Identities = 22/73 (30%), Positives = 35/73 (47%)
Query: 43 RRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETEL 102
++ E+ + Q+E+ L + L Q K L ++++ ELE + L+ K EL
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335
Query: 103 DEILAVLTEKTEV 115
E LA L EK E
Sbjct: 336 AEELAELEEKLEE 348
Score = 26.6 bits (59), Expect = 5.1
Identities = 17/85 (20%), Positives = 36/85 (42%), Gaps = 6/85 (7%)
Query: 34 LKSAVEDKIRRKMN--EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERS 91
L+ +E E+ +A++ ++ ++ +L L R+ E ++
Sbjct: 452 LQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG----FSEG 507
Query: 92 ISFLKEKETELDEILAVLTEKTEVD 116
+ L + ++ L IL VL+E VD
Sbjct: 508 VKALLKNQSGLSGILGVLSELISVD 532
Score = 26.2 bits (58), Expect = 6.4
Identities = 21/66 (31%), Positives = 32/66 (48%)
Query: 47 NEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEIL 106
EQ + E+IE L EL + +L E + E+A LE +++ L+ + EL E L
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903
Query: 107 AVLTEK 112
L K
Sbjct: 904 RELESK 909
>gnl|CDD|215457 PLN02852, PLN02852, ferredoxin-NADP+ reductase.
Length = 491
Score = 30.0 bits (68), Expect = 0.36
Identities = 12/35 (34%), Positives = 20/35 (57%)
Query: 78 FERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
+E+ID + E R+ +EK T ++E+L EK
Sbjct: 457 WEKIDSAEKEAGRARGKPREKITSIEEMLKAANEK 491
>gnl|CDD|236583 PRK09601, PRK09601, GTP-binding protein YchF; Reviewed.
Length = 364
Score = 29.6 bits (68), Expect = 0.37
Identities = 16/59 (27%), Positives = 32/59 (54%), Gaps = 3/59 (5%)
Query: 40 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 98
+++ +K +A+ E+E+L++ L +G K AR E + E+ +L +S+ L K
Sbjct: 145 ERLEKKAKGGDKEAKAELELLEKLLEHLEEG--KPARTLE-LTDEEEKLLKSLQLLTAK 200
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 29.9 bits (68), Expect = 0.38
Identities = 19/107 (17%), Positives = 48/107 (44%), Gaps = 7/107 (6%)
Query: 20 SQGGTITDEHIKMSLKSAVE-----DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL 74
S GT T+ + + E ++ +++ E+ + ++ ++ L + Q +
Sbjct: 315 SPAGTKTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLP--KKARGQLPPEK 372
Query: 75 ARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAV 121
E++ + K +L + L+E+ EL E L L + ++ V++ +
Sbjct: 373 REQLEKLLETKEKLSEELEELEEELKELKEELESLYSEGKISVNKTI 419
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 30.0 bits (68), Expect = 0.39
Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 7/80 (8%)
Query: 31 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTE------NELNQGKFKLARMFERIDKE 84
M + S ++K+ K+N + + ++E+E LK T +L++ + L E +KE
Sbjct: 1093 SMPIWSLTKEKVE-KLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKE 1151
Query: 85 KAELERSISFLKEKETELDE 104
A+ +R S K K ++L +
Sbjct: 1152 IAKEQRLKSKTKGKASKLRK 1171
>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 879
Score = 29.5 bits (67), Expect = 0.49
Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 53 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
QE+ ++LK+ + L L ++ ER+ +E ELE+ + LK+K E+ +L+
Sbjct: 710 LQEQEDLLKEIASLLKVPPELLPKV-ERLLEELKELEKELERLKKKL-AAAELADLLSNA 767
Query: 113 TEVDV 117
++
Sbjct: 768 KAEEI 772
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 29.7 bits (67), Expect = 0.49
Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 1/64 (1%)
Query: 53 AQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
E+ + LK+ + L +L + ER +E L++ + LK K + +I + +
Sbjct: 703 LNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQ-KIDELKDKA 761
Query: 113 TEVD 116
++
Sbjct: 762 ETIN 765
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 29.5 bits (67), Expect = 0.53
Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 13/96 (13%)
Query: 36 SAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQ--------GKFKLARMFERIDKEKAE 87
+ DK+ +++E++ + E+E L+ NEL++ G+ + + + K +
Sbjct: 25 ALDVDKLL-ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEK 83
Query: 88 LERSISFLKEKETELDEILAVL----TEKTEVDVDE 119
L+ + L E E ELD +L + E V DE
Sbjct: 84 LKELEAALDELEAELDTLLLTIPNIPHESVPVGKDE 119
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 29.2 bits (66), Expect = 0.59
Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 1/76 (1%)
Query: 40 DKIRRKMNEQKVQAQEEIEILKQTENELNQ-GKFKLARMFERIDKEKAELERSISFLKEK 98
+ I+ K+ ++ +EEI L+Q +ELN +L + + + ++ L+E
Sbjct: 174 NSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEEL 233
Query: 99 ETELDEILAVLTEKTE 114
+ EL E+ + T
Sbjct: 234 QQELQELTIAIEALTN 249
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.0 bits (66), Expect = 0.84
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 48 EQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF-LKEKETELDEIL 106
E + +A+E +LK+ E + + K ++ E DK E E+ +KE + E DEI+
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590
Query: 107 AVLTE 111
L +
Sbjct: 591 KELRQ 595
Score = 27.1 bits (61), Expect = 3.4
Identities = 12/81 (14%), Positives = 34/81 (41%), Gaps = 8/81 (9%)
Query: 35 KSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISF 94
K ++++ + + E + +AQ+ I+ K+ +E+ + + + A ++
Sbjct: 557 KEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEI------IKELRQLQKGGYASVKAHE-- 608
Query: 95 LKEKETELDEILAVLTEKTEV 115
L E L++ +K +
Sbjct: 609 LIEARKRLNKANEKKEKKKKK 629
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 28.4 bits (64), Expect = 1.2
Identities = 21/79 (26%), Positives = 40/79 (50%), Gaps = 3/79 (3%)
Query: 40 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
DK+R + + +E+ ++ ++ EL + + E+I+KE ELE IS L+ +
Sbjct: 60 DKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEE---ELEKIEKEIKELEEEISELENEI 116
Query: 100 TELDEILAVLTEKTEVDVD 118
EL++ + L D+D
Sbjct: 117 KELEQEIERLEPWGNFDLD 135
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 28.3 bits (64), Expect = 1.2
Identities = 15/81 (18%), Positives = 35/81 (43%), Gaps = 6/81 (7%)
Query: 38 VEDKIRRKMNEQKVQAQEEIEILKQTENELNQ---GKFKLARMFERIDKEKAELERSISF 94
+E K+ +K+++ + ++ + L + + + + ++ KAELE
Sbjct: 384 IEPKVEKKLDKLGLSIED---VFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKR 440
Query: 95 LKEKETELDEILAVLTEKTEV 115
L+E+ ++D L E E
Sbjct: 441 LREEAAKIDPSLEGALEANEA 461
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 28.2 bits (64), Expect = 1.4
Identities = 17/56 (30%), Positives = 32/56 (57%), Gaps = 3/56 (5%)
Query: 40 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFL 95
+++ +K +A+EE+E+L++ + L +G K AR E D+E L +S+ L
Sbjct: 141 ERLEKKAKSGDKEAKEELELLEKIKEHLEEG--KPARTLELTDEEIKIL-KSLQLL 193
>gnl|CDD|224307 COG1389, COG1389, DNA topoisomerase VI, subunit B [DNA replication,
recombination, and repair].
Length = 538
Score = 28.1 bits (63), Expect = 1.5
Identities = 17/66 (25%), Positives = 28/66 (42%), Gaps = 1/66 (1%)
Query: 58 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEKTEVDV 117
EI + L + KL R +E E ++ + K E+ + LA + EK E D+
Sbjct: 442 EIENEIRLALMEVARKLKLYLSRKRREMEERKKRKTIEKYLP-EIAKKLAEILEKPEEDI 500
Query: 118 DEAVTT 123
E +
Sbjct: 501 REKLAK 506
>gnl|CDD|110537 pfam01540, Lipoprotein_7, Adhesin lipoprotein. This family
consists of the p50 and variable adherence-associated
antigen (Vaa) adhesins from Mycoplasma hominis. M.
hominis is a mycoplasma associated with human urogenital
diseases, pneumonia, and septic arthritis. An adhesin is
a cell surface molecule that mediates adhesion to other
cells or to the surrounding surface or substrate. The
Vaa antigen is a 50-kDa surface lipoprotein that has
four tandem repetitive DNA sequences encoding a periodic
peptide structure, and is highly immunogenic in the
human host. p50 is also a 50-kDa lipoprotein, having
three repeats A,B and C, that may be a tetramer of
191-kDa in its native environment.
Length = 353
Score = 28.1 bits (62), Expect = 1.6
Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 35 KSAVEDK---IRRKMNEQKVQAQEEIEILKQTENE-------LNQGKFKLARMFERIDKE 84
K A++DK I +++ E+ + Q IE LK+ NE +N+ ++ + F+
Sbjct: 106 KKAIDDKNAQIAKELAERNAKIQSNIEELKKINNEAFELSKTVNKTIAEVEKKFKIPKDF 165
Query: 85 KAELERSISFLKEKETELDEILAVLTEKTEVDVDE 119
K +LE L +K ++DE V + + + E
Sbjct: 166 KEQLENFADDLLDKSRQIDEFTTVTSTQEGFTLAE 200
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 27.7 bits (62), Expect = 1.7
Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 1/62 (1%)
Query: 54 QEEIEILKQTENELNQGKFK-LARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
+EE+ LKQ E+EL L R E++ K E+ + L+E E EL E+ + + +
Sbjct: 185 EEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDL 244
Query: 113 TE 114
T
Sbjct: 245 TN 246
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 28.1 bits (63), Expect = 1.7
Identities = 21/84 (25%), Positives = 38/84 (45%), Gaps = 1/84 (1%)
Query: 30 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFER-IDKEKAEL 88
++ + +A DK R++ E + +A+ I L++ E+ + + E I + L
Sbjct: 492 VEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLEEAL 551
Query: 89 ERSISFLKEKETELDEILAVLTEK 112
E +K K EL E+ L EK
Sbjct: 552 EGEKEEIKAKIEELQEVTQKLAEK 575
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 27.6 bits (61), Expect = 2.1
Identities = 27/91 (29%), Positives = 39/91 (42%), Gaps = 15/91 (16%)
Query: 39 EDKIRRKMNEQK-----VQAQ----EEIEILKQTENELNQGKFKLARMFERIDKEKAELE 89
E+K R+K E+ QA E E LKQ + +L KL + ++E LE
Sbjct: 501 EEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRMLE 560
Query: 90 RSISFLK-----EKETE-LDEILAVLTEKTE 114
+ L+ EKETE L L + +K
Sbjct: 561 KEAQELRKYQESEKETEVLMSALQAMQDKNL 591
>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
Length = 535
Score = 27.5 bits (62), Expect = 2.3
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 7/67 (10%)
Query: 58 EILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEI---LAVLTEKTE 114
EI K+ L + KL + R KE+ +++ +F K + EI LA + EK E
Sbjct: 439 EIEKEIRLALQEVARKLKKYLSRKRKEEEAKKKAKTFEK----YIPEIARKLAEILEKEE 494
Query: 115 VDVDEAV 121
D++E +
Sbjct: 495 PDIEEVL 501
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 27.6 bits (62), Expect = 2.4
Identities = 24/87 (27%), Positives = 43/87 (49%), Gaps = 11/87 (12%)
Query: 27 DEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKA 86
+ +K + + A EDK R++ E K +A+E + E L + KL + +K
Sbjct: 507 ERMVKDAEEYAAEDKKRKERIEAKNEAEEYVY---SLEKSLKEEGDKLP------EADKK 557
Query: 87 ELERSISFLKE--KETELDEILAVLTE 111
++E +I +LKE + + +EI A E
Sbjct: 558 KVEEAIEWLKEELEGEDKEEIEAKTEE 584
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 27.5 bits (61), Expect = 2.6
Identities = 11/47 (23%), Positives = 25/47 (53%)
Query: 61 KQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILA 107
+ E L++ +L + E++ + E++R L++KE+ + E L
Sbjct: 442 RMDEGILDERLGRLQALSEKLRTQHEEIKRCREALQKKESIVMETLE 488
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 27.3 bits (61), Expect = 2.8
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 37 AVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLK 96
A EDK R++ E + A + Q E L + KL +EK ++E++++ LK
Sbjct: 514 AEEDKKRKEEIEARNNAD---SLAYQAEKTLKEAGDKLP------AEEKEKIEKAVAELK 564
Query: 97 E--KETELDEILAVLTEKTEV 115
E K +++EI A E +
Sbjct: 565 EALKGEDVEEIKAKTEELQQA 585
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 27.3 bits (61), Expect = 2.8
Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 8/68 (11%)
Query: 47 NEQKVQAQEEIEILKQTENELN--------QGKFKLARMFERIDKEKAELERSISFLKEK 98
++++ + EIE L+ NEL+ Q K K+ + + + + K EL + LK
Sbjct: 36 DDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKAL 95
Query: 99 ETELDEIL 106
E EL + L
Sbjct: 96 EAELQDKL 103
>gnl|CDD|234053 TIGR02903, spore_lon_C, ATP-dependent protease, Lon family.
Members of this protein family resemble the widely
distributed ATP-dependent protease La, also called Lon
and LonA. It resembles even more closely LonB, which is
a LonA paralog found in genomes if and only if the
species is capable of endospore formation (as in
Bacillus subtilis, Clostridium tetani, and select other
members of the Firmicutes) and expressed specifically in
the forespore compartment. Members of this family are
restricted to a subset of spore-forming species, and are
very likely to participate in the program of endospore
formation. We propose the designation LonC [Protein
fate, Degradation of proteins, peptides, and
glycopeptides, Cellular processes, Sporulation and
germination].
Length = 615
Score = 27.5 bits (61), Expect = 2.8
Identities = 20/80 (25%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 25 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEI--EILKQTENELNQGKFKLARMFERID 82
I D + ++++ +E K+ +M E++ + EEI ++LK+ + N K E+++
Sbjct: 77 IADILARRTVENRIERKVETRMQERQNKYLEEIRLQVLKEEKGPENSSTLKKLERLEKLE 136
Query: 83 KEKAELERSI-SFLKEKETE 101
K+K L +S S L+ +
Sbjct: 137 KKK--LHKSAQSLLRPRAFS 154
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 27.0 bits (60), Expect = 3.2
Identities = 19/94 (20%), Positives = 38/94 (40%), Gaps = 2/94 (2%)
Query: 23 GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID 82
G E + L E ++ E+++ ++E L++ EL + +L +
Sbjct: 464 GQELPEEHEKELLELYELELEEL--EEELSREKEEAELREEIEELEKELRELEEELIELL 521
Query: 83 KEKAELERSISFLKEKETELDEILAVLTEKTEVD 116
+ + L+ + EK L E L L EK ++
Sbjct: 522 ELEEALKEELEEKLEKLENLLEELEELKEKLQLQ 555
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.0 bits (60), Expect = 3.2
Identities = 14/62 (22%), Positives = 27/62 (43%)
Query: 41 KIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKET 100
+++ + + E L +E + KLA++ E K A+L +S ++K
Sbjct: 168 ATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLE 227
Query: 101 EL 102
EL
Sbjct: 228 EL 229
>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein. This
is the P3 protein section of the Potyviridae
polyproteins. The function is not known except that the
protein is essential to viral survival.
Length = 445
Score = 27.1 bits (61), Expect = 3.4
Identities = 14/83 (16%), Positives = 37/83 (44%), Gaps = 12/83 (14%)
Query: 33 SLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSI 92
+L S + + + K + + ++E + +++ ++ +++ + E +
Sbjct: 264 TLVSIINEHRKLKREAAEEKEEKEEKEIRKLY------------LYSKLEGKLPTEEEFL 311
Query: 93 SFLKEKETELDEILAVLTEKTEV 115
+++EK EL E L E+ EV
Sbjct: 312 EYIEEKNPELLETAEELIEEEEV 334
>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
Length = 238
Score = 27.0 bits (60), Expect = 3.4
Identities = 21/66 (31%), Positives = 32/66 (48%), Gaps = 9/66 (13%)
Query: 47 NEQKVQAQEEIEILKQTENELNQGKFKLARMFE----RIDKEKAELERSISFLKEKETEL 102
+ Q ++E+E LKQ ELN ++A F+ R +E+ +L L+ K L
Sbjct: 66 AARLAQLEQELESLKQELEELNSQYMRIAADFDNFRKRTSREQEDLR-----LQLKCNTL 120
Query: 103 DEILAV 108
EIL V
Sbjct: 121 SEILPV 126
>gnl|CDD|215960 pfam00509, Hemagglutinin, Haemagglutinin. Hemagglutinin from
influenza virus causes membrane fusion of the viral
membrane with the host membrane. Fusion occurs after the
host cell internalises the virus by endocytosis. The
drop of pH causes release of a hydrophobic fusion
peptide and a large conformational change leading to
membrane fusion.
Length = 550
Score = 26.9 bits (60), Expect = 4.2
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 54 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEK 98
Q+ I+ Q +LN K+ FE I+KE +ELER I+ L +K
Sbjct: 370 QKAID---QITGKLNSLIEKMNTQFEAIEKEFSELERRINNLNKK 411
>gnl|CDD|129690 TIGR00602, rad24, checkpoint protein rad24. All proteins in this
family for which functions are known are involved in DNA
damage tolerance (likely cell cycle checkpoints).This
family is based on the phylogenomic analysis of JA Eisen
(1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 637
Score = 26.8 bits (59), Expect = 4.5
Identities = 19/84 (22%), Positives = 31/84 (36%), Gaps = 3/84 (3%)
Query: 18 SPSQGGTITDEHIKMSLKS-AVEDKIRRKMNEQKVQAQEEIEILKQTENEL-NQGKFKLA 75
S S+ G++ + +MS KS A K + K +EI+ L + L
Sbjct: 327 SSSKSGSLPIKK-RMSTKSDAHASKSKIKGKHSSNNENQEIQALGGKDVSLFLFRALGKI 385
Query: 76 RMFERIDKEKAELERSISFLKEKE 99
+R + + R S L E
Sbjct: 386 LYCKRATLNELDSPRLPSHLSELS 409
>gnl|CDD|223258 COG0180, TrpS, Tryptophanyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 314
Score = 26.4 bits (59), Expect = 5.0
Identities = 16/60 (26%), Positives = 25/60 (41%), Gaps = 2/60 (3%)
Query: 54 QEEIEILKQTEN-ELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTEK 112
+EI + EL G+ K + E I + ++ L+E LD+IL EK
Sbjct: 256 DSILEIEAEYRGGELGCGECK-KELAEAIQEFLKPIQERREELREDPAYLDDILRKGAEK 314
>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
Length = 663
Score = 26.7 bits (59), Expect = 5.1
Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 8/85 (9%)
Query: 20 SQGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEIL-KQTENELNQGKFKLARMF 78
S GG ++DE I+ +K A E K + + ++ V A+ E E L E +L+ K K++
Sbjct: 537 SSGG-LSDEEIEKMVKEAEEYKEQDEKKKELVDAKNEAETLIYSVEKQLSDLKDKIS--- 592
Query: 79 ERIDKEKAELERSISFLKEKETELD 103
D +K EL++ I+ L+ + D
Sbjct: 593 ---DADKDELKQKITKLRSTLSSED 614
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 26.5 bits (59), Expect = 5.1
Identities = 17/73 (23%), Positives = 34/73 (46%), Gaps = 11/73 (15%)
Query: 30 IKMSLKSAVEDKIRRK-----MNEQKVQAQEEIEILKQTENELNQGKFKL------ARMF 78
++ + VE+K+ + + EQ ++E+ I K ++EL + K KL +
Sbjct: 207 LQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEELKLPEEVK 266
Query: 79 ERIDKEKAELERS 91
+ I+KE +L
Sbjct: 267 KVIEKELTKLSLL 279
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 26.2 bits (59), Expect = 5.7
Identities = 21/69 (30%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 45 KMNEQKVQAQEEIEILKQTENELNQ--GKFKLARMFERIDKEKAELERSISFLKEKETEL 102
+++E++ + Q E+E L+ N L++ G+ K E + AE++ +K E EL
Sbjct: 32 ELDEERRELQTELEELQAERNALSKEIGQAKRKG--EDAEALIAEVKELKEEIKALEAEL 89
Query: 103 DEILAVLTE 111
DE+ A L E
Sbjct: 90 DELEAELEE 98
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 26.1 bits (58), Expect = 5.7
Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 7/77 (9%)
Query: 31 KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENE---LNQGKFKL--ARMFERIDKEK 85
K+S + ++++E + EE LK+ E + + + + E +
Sbjct: 291 KLSGLKNSGEPSLKEIHEAR--LNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348
Query: 86 AELERSISFLKEKETEL 102
++LE L++ E L
Sbjct: 349 SKLEEIQKKLEDLEKRL 365
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 26.0 bits (57), Expect = 5.7
Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 15/83 (18%)
Query: 40 DKIRRKMNEQKVQAQEEIE-----------ILKQTENELNQGKFKLARMFERIDKEKAEL 88
DK ++ E+K+Q EE+ +KQ+++EL + K+ARM E L
Sbjct: 38 DKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKSELERRMGKIARMHG----EYNPL 93
Query: 89 ERSISFLKEKETELDEILAVLTE 111
+ SI+ +++ E L+E+ + E
Sbjct: 94 KESINEMRKLELGLEELPQLHEE 116
>gnl|CDD|181856 PRK09436, thrA, bifunctional aspartokinase I/homoserine
dehydrogenase I; Provisional.
Length = 819
Score = 26.3 bits (59), Expect = 5.9
Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 8/78 (10%)
Query: 23 GTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERID 82
+T+ + M K+A D ++ + + E ++ L + + K ++D
Sbjct: 41 AKVTNHLVAMIEKAAKGDDAYPEILDAERIFHELLDGLAAALPGFDLAQLK-----AKVD 95
Query: 83 KEKAELE---RSISFLKE 97
+E A+L+ IS L E
Sbjct: 96 QEFAQLKDILHGISLLGE 113
>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1112
Score = 26.5 bits (59), Expect = 6.0
Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 9/77 (11%)
Query: 34 LKSAVEDKIRRKMNE-----QKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 88
LK DK R + E +K++A+ E+ + +EL K + I + A++
Sbjct: 5 LKKLFGDKSERDLKEIQPIVEKIKAEYP-ELEALSNDELRA---KTDEFKQYIKEAVADI 60
Query: 89 ERSISFLKEKETELDEI 105
+ I LK + E +I
Sbjct: 61 DAKIEELKAEAIESLDI 77
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 26.2 bits (58), Expect = 6.0
Identities = 20/61 (32%), Positives = 30/61 (49%)
Query: 52 QAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKETELDEILAVLTE 111
Q Q E++ L++ N+LN+ KL E + AE + + EKE E+ E A L
Sbjct: 44 QRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAALDA 103
Query: 112 K 112
K
Sbjct: 104 K 104
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 26.1 bits (58), Expect = 6.4
Identities = 16/90 (17%), Positives = 39/90 (43%)
Query: 25 ITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKE 84
I D+ ++ D ++ E + E+ + L + +N+++ K L + ++ K
Sbjct: 304 IKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKV 363
Query: 85 KAELERSISFLKEKETELDEILAVLTEKTE 114
KA +E + + EL ++ L + +
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVK 393
>gnl|CDD|185474 PTZ00141, PTZ00141, elongation factor 1- alpha; Provisional.
Length = 446
Score = 26.2 bits (58), Expect = 6.6
Identities = 15/40 (37%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query: 54 QEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSIS 93
+ IE ++ E+ +G FK A + +DK KAE ER I+
Sbjct: 36 KRTIEKFEKEAAEMGKGSFKYAWV---LDKLKAERERGIT 72
>gnl|CDD|217337 pfam03050, DDE_Tnp_IS66, Transposase IS66 family. Transposase
proteins are necessary for efficient DNA transposition.
This family includes IS66 from Agrobacterium
tumefaciens.
Length = 277
Score = 26.0 bits (58), Expect = 6.7
Identities = 11/75 (14%), Positives = 24/75 (32%), Gaps = 3/75 (4%)
Query: 29 HIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAEL 88
I+ + ++ E + +E + + K KL + + L
Sbjct: 177 AIEREARGLPPEERLALRQEYSRPLLDALEAWLEAQLPGVLPKSKLGKAIRYLLNRWDAL 236
Query: 89 ERSISFLKEKETELD 103
R FL++ +D
Sbjct: 237 TR---FLEDGRVPID 248
>gnl|CDD|233723 TIGR02094, more_P_ylases, alpha-glucan phosphorylases. This family
consists of known phosphorylases, and homologs believed
to share the function of using inorganic phosphate to
cleave an alpha 1,4 linkage between the terminal glucose
residue and the rest of the polymer (maltodextrin,
glycogen, etc.). The name of the glucose storage polymer
substrate, and therefore the name of this enzyme,
depends on the chain lengths and branching patterns. A
number of the members of this family have been shown to
operate on small maltodextrins, as may be obtained by
utilization of exogenous sources. This family represents
a distinct clade from the related family modeled by
TIGR02093/pfam00343.
Length = 601
Score = 26.1 bits (58), Expect = 6.7
Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 5/59 (8%)
Query: 25 ITDEHI---KMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKL--ARMF 78
I DE + + LK+ + D IRR++ E+ ++ + IL T+ L+ + AR F
Sbjct: 340 IPDEELWEVHLKLKARLIDYIRRRLRERWLRRGADAAILMATDRFLDPDVLTIGFARRF 398
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 26.1 bits (58), Expect = 7.2
Identities = 25/90 (27%), Positives = 48/90 (53%), Gaps = 13/90 (14%)
Query: 30 IKMSLKSAVE--DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAE 87
I+M + S E D++ R++ +Q + E E LK+ ++E ++ + E ++KE AE
Sbjct: 396 IRMEIDSKPEELDELDRRI----IQLEIEREALKKEKDEASKERL------EDLEKELAE 445
Query: 88 LERSISFLKEK-ETELDEILAVLTEKTEVD 116
LE + L+E+ + E I + K E++
Sbjct: 446 LEEEYADLEEQWKAEKAAIQGIQQIKEEIE 475
>gnl|CDD|177362 PHA02415, PHA02415, DNA primase domain-containing protein.
Length = 930
Score = 26.1 bits (57), Expect = 7.2
Identities = 20/84 (23%), Positives = 34/84 (40%), Gaps = 10/84 (11%)
Query: 8 WTLDQEVGGSSPSQGGTITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENEL 67
W E G PS+GG D +M+ A E + + I++ +E+
Sbjct: 770 WRYLCEFAGMDPSEGGFPHDLLAEMNAHVA----------ETSADREPWVWIMETVLSEI 819
Query: 68 NQGKFKLARMFERIDKEKAELERS 91
+ G +K F+ +D E L R+
Sbjct: 820 DGGNYKHPYTFDTVDGEFCLLLRT 843
>gnl|CDD|233004 TIGR00513, accA, acetyl-CoA carboxylase, carboxyl transferase,
alpha subunit. The enzyme acetyl-CoA carboxylase
contains a biotin carboxyl carrier protein or domain, a
biotin carboxylase, and a carboxyl transferase. This
model represents the alpha chain of the carboxyl
transferase for cases in which the architecture of the
protein is as in E. coli, in which the
carboxyltransferase portion consists of two
non-identical subnits, alpha and beta [Fatty acid and
phospholipid metabolism, Biosynthesis].
Length = 316
Score = 25.9 bits (57), Expect = 7.4
Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 9/81 (11%)
Query: 40 DKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELERSISFLKEKE 99
+ +R + ++ V EEIE L++ EL + F ++R +L R ++
Sbjct: 20 ESLRARSRDEDVDLSEEIERLEKRSVELTKKIFSNLGAWQR-----LQLAR----HPDRP 70
Query: 100 TELDEILAVLTEKTEVDVDEA 120
LD I + + E+ D A
Sbjct: 71 YTLDYIELIFDDFFELAGDRA 91
>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1025
Score = 25.9 bits (57), Expect = 8.0
Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 30 IKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDKEKAELE 89
I+ SL S +D +R K E K + + +E ++Q + +L + E + EL+
Sbjct: 30 IQASLASLSDDALREKGMELKSRVRGALEPIEQKKKDLEKKLDNPDISLEEAESINEELD 89
Query: 90 R-SISFLKEKETELDEIL 106
+ + + L+EIL
Sbjct: 90 TLAKEYEEATAAALEEIL 107
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 25.6 bits (56), Expect = 9.0
Identities = 21/99 (21%), Positives = 48/99 (48%), Gaps = 5/99 (5%)
Query: 24 TITDEHIKMSLKSAVEDKIRRKMNEQKVQAQEEIEILKQTENELNQGKFKLARMFERIDK 83
T+ M LK E+ ++ K+ E + + +E ++ L++ E E + + +L +R++
Sbjct: 123 TLQRVPEYMDLKEDYEE-LKEKLEELQKEKEELLKELEELEAEYEEVQERL----KRLEV 177
Query: 84 EKAELERSISFLKEKETELDEILAVLTEKTEVDVDEAVT 122
E + LE + L + +L + L E+ +E ++
Sbjct: 178 ENSRLEEMLKKLPGEVYDLKKRWDELEPGVELPEEELIS 216
>gnl|CDD|165320 PHA03022, PHA03022, hypothetical protein; Provisional.
Length = 335
Score = 25.7 bits (56), Expect = 10.0
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 32 MSLKSAVEDKIRRKMNEQKVQAQEEI 57
+SLK+++ DKI K + + +EEI
Sbjct: 70 LSLKNSIADKIILKFKDNGIMTEEEI 95
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.127 0.325
Gapped
Lambda K H
0.267 0.0761 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 6,587,981
Number of extensions: 595750
Number of successful extensions: 1603
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1443
Number of HSP's successfully gapped: 490
Length of query: 133
Length of database: 10,937,602
Length adjustment: 86
Effective length of query: 47
Effective length of database: 7,123,158
Effective search space: 334788426
Effective search space used: 334788426
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.1 bits)