RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7417
(372 letters)
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 45.5 bits (108), Expect = 4e-05
Identities = 45/187 (24%), Positives = 87/187 (46%), Gaps = 13/187 (6%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
AK +EQL + E Q+ E E R KQA+ + R + EQ+ + K E
Sbjct: 619 AVAKQKQAEEQLVQANAELEEQKRAEAE--ARTALKQARLDLQRLQNEQQSLKDKLELAI 676
Query: 77 RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ-----TNKSKGPNFTIQRREPG 131
+++ E + R L+ + ++L E ++QA + ALK+ T + +
Sbjct: 677 AERKQQAETQLRQLDAQLKQLLE---QQQAFLEALKDDFRELRTERLAKWQVVEGELDNQ 733
Query: 132 LTLSTAQLERNKT--KEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET 189
L +A +E +T K +++E KK + + L+V+ ++ +L+ + EL TI ++
Sbjct: 734 LAQLSAAIEAARTQAKARLKELKK-QYDRELASLDVDPNTVKELKRQIEELETTIERIAV 792
Query: 190 DKYDLEE 196
+ ++ E
Sbjct: 793 RRPEVRE 799
Score = 33.9 bits (78), Expect = 0.14
Identities = 41/208 (19%), Positives = 82/208 (39%), Gaps = 25/208 (12%)
Query: 12 EFVKRQEAKSSALDEQLKEYI--VEWRNQRAK----EEEDLKRLKEKQAKRKVMRAEEEQ 65
+ ++ EA +E+ E + QR E+L++L+ + +A+EEQ
Sbjct: 428 QLRQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEIND--EALEKAQEEQ 485
Query: 66 KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTI 125
+ A E+ + + K++ + E QR E + + + L+ Q + G
Sbjct: 486 EQA-EANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGSLLHF 544
Query: 126 QRREP-------GLTLSTAQLERN----KTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQ 174
R E G +S LER + E + + +++ ++ L+V +
Sbjct: 545 LRNEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDY-----A 599
Query: 175 AKATELWETIVKLETDKYDLEERQKRQD 202
A TEL E + + E +QK+ +
Sbjct: 600 ANETELRERLQQAEEALQSAVAKQKQAE 627
Score = 33.9 bits (78), Expect = 0.17
Identities = 27/124 (21%), Positives = 50/124 (40%), Gaps = 5/124 (4%)
Query: 34 EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK 93
R++ + E L L K + V R E K +++ E + + R EEK
Sbjct: 356 SIRSELEEVEARLDALTGKH--QDVQRKYERLKQKIKEQLERDLEKNNERLAAIR--EEK 411
Query: 94 RQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
++ E+ QA+ + L++Q K F + E L L + + EE ++
Sbjct: 412 DRQKAAIEEDLQALESQLRQQLEAGK-LEFNEEEYELELRLGRLKQRLDSATATPEELEQ 470
Query: 154 IALN 157
+ +N
Sbjct: 471 LEIN 474
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 44.4 bits (106), Expect = 7e-05
Identities = 33/142 (23%), Positives = 60/142 (42%), Gaps = 23/142 (16%)
Query: 28 LKEYIVEWRNQRAK-----EEEDLKRLKEK-QAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
L E I+E AK ++E L L + + E++ + A+ +E E+ + E
Sbjct: 499 LPENIIE----EAKKLIGEDKEKLNELIASLEELER--ELEQKAEEAEALLKEAEKLKEE 552
Query: 82 VEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLER 141
+EEKK++ EE+ + LEEAEK+ Q + K++ ++ +Q+
Sbjct: 553 LEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYA---------- 602
Query: 142 NKTKEQIEEEKKIALNIRIKPL 163
K E + LN +
Sbjct: 603 -SVKAHELIEARKRLNKANEKK 623
Score = 39.0 bits (92), Expect = 0.003
Identities = 27/117 (23%), Positives = 49/117 (41%), Gaps = 13/117 (11%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRK---VMRAEEE--QKMAQ 69
+ E K+ + LKE K +E+L+ KEK + + + AE+E Q + +
Sbjct: 530 RELEQKAEEAEALLKEA--------EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581
Query: 70 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
KKE +E + + +K K L EA K+ +++ K K ++
Sbjct: 582 AKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELK 638
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 42.0 bits (99), Expect = 5e-04
Identities = 37/297 (12%), Positives = 95/297 (31%), Gaps = 26/297 (8%)
Query: 64 EQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNF 123
++ +++KE E+ + ++ DL++ + LEE E+ + + + K
Sbjct: 1108 NAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEE----KEIAKEQRLKSKTK 1163
Query: 124 TIQR--REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELW 181
R+P L + +++ + + + + +
Sbjct: 1164 GKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGS 1223
Query: 182 ------ETIVKLETDKYDLEERQKRQDYDVSKIIINKKIMPSPGQGRPGFNPRYVQTPNL 235
E K + + +K S+ +G+P P+ V
Sbjct: 1224 DQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQ- 1282
Query: 236 FNNATALTGKYPPKIQVASKYERRVDTRSYDDKKKLFEGGYLSLTKESLEKQYAEKVDQF 295
PP E ++ KK + +K+ +EK
Sbjct: 1283 ---------YSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTAR 1333
Query: 296 GNRQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDEEEENPIYLKISDDEEEKEEEN 352
+ KT++ + + + P + + E+++ + S+DE+++++E+
Sbjct: 1334 KKKSKTRVKQASASQSSRLLRRPRKKKSD----SSSEDDDDSEVDDSEDEDDEDDED 1386
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 39.5 bits (93), Expect = 0.002
Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 14/72 (19%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ-KMAQRKKEEEERRQREVEEKKQRDLEEKR 94
R +EEE + + E++ R EE Q K ++KKEE E + ++ ++QR LEEK
Sbjct: 262 DKTREEEEEKILKAAEEE------RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEK- 314
Query: 95 QRLEEAEKKRQA 106
E+K+QA
Sbjct: 315 ------ERKKQA 320
Score = 33.8 bits (78), Expect = 0.14
Identities = 11/43 (25%), Positives = 32/43 (74%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR 78
R + A+E+++ K+ +E++AK + EE++K+ +++++++ R+
Sbjct: 280 RQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 40.1 bits (95), Expect = 0.002
Identities = 28/104 (26%), Positives = 55/104 (52%), Gaps = 11/104 (10%)
Query: 18 EAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVM--RAEEEQKMAQRKKE-- 73
EAK +E+ K + E + KE E +K+ +AK ++ R E E+++ +R+ E
Sbjct: 30 EAKIKEAEEEAKRILEEAK----KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQ 85
Query: 74 --EEERRQREVE-EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
E+ Q+E ++K LE++ + LE+ EK+ + L+++
Sbjct: 86 KLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKK 129
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 39.8 bits (93), Expect = 0.002
Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 6/82 (7%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
QRA E+ K L+++ A AE+ K A++ ++ E +Q++ EE K + E + +
Sbjct: 83 QRAAEQARQKELEQRAA------AEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKA 136
Query: 98 EEAEKKRQAMMAALKEQTNKSK 119
E +K+ A + +
Sbjct: 137 EAEAEKKAKEEAKKQAEEEAKA 158
Score = 35.2 bits (81), Expect = 0.045
Identities = 20/81 (24%), Positives = 34/81 (41%)
Query: 37 NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
+ E E K+ KE+ K+ A+ + +KK E +++ E E K + + + K +
Sbjct: 132 AKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKA 191
Query: 97 LEEAEKKRQAMMAALKEQTNK 117
E K A A E K
Sbjct: 192 EEAKAKAEAAKAKAAAEAAAK 212
Score = 31.0 bits (70), Expect = 0.89
Identities = 20/83 (24%), Positives = 42/83 (50%)
Query: 37 NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
KE+E K+L+++ + + RA E+ + + ++ + + E+ + EEK+++
Sbjct: 61 PAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQ 120
Query: 97 LEEAEKKRQAMMAALKEQTNKSK 119
EEA+ K+ A A E + K
Sbjct: 121 AEEAKAKQAAEAKAKAEAEAEKK 143
Score = 30.2 bits (68), Expect = 1.6
Identities = 18/80 (22%), Positives = 34/80 (42%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
K+ E+ + K +K +++ A+ K + E + + + EE K + K + E
Sbjct: 149 KKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAE 208
Query: 100 AEKKRQAMMAALKEQTNKSK 119
A K +A AA + K
Sbjct: 209 AAAKAEAEAAAAAAAEAERK 228
Score = 30.2 bits (68), Expect = 1.7
Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 9/90 (10%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ--RKKEEEERRQREVEEKKQRDLEEKRQ 95
++ K+ E+ K + +AK K AE E+K + +K+ EEE + + E K++ E K++
Sbjct: 116 EKQKQAEEAKAKQAAEAKAK-AEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKK 174
Query: 96 RL------EEAEKKRQAMMAALKEQTNKSK 119
EA+ K +A A K + K+K
Sbjct: 175 AEAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204
Score = 29.8 bits (67), Expect = 2.0
Identities = 17/65 (26%), Positives = 38/65 (58%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
+ +Q K+ + E+E++ ++ EE +QR E+ +Q++LE++ + A++ QA
Sbjct: 54 RIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQ 113
Query: 110 ALKEQ 114
A ++Q
Sbjct: 114 AEEKQ 118
Score = 29.8 bits (67), Expect = 2.4
Identities = 31/114 (27%), Positives = 55/114 (48%), Gaps = 6/114 (5%)
Query: 11 PEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQK---- 66
E K++ A+ + E + E ++A++ K+KQA+ + E K
Sbjct: 78 EEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAE 137
Query: 67 -MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSK 119
A++K +EE ++Q E EE K + E +++ EA+KK +A A E K+K
Sbjct: 138 AEAEKKAKEEAKKQAE-EEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKAKAK 190
Score = 28.6 bits (64), Expect = 6.2
Identities = 16/80 (20%), Positives = 45/80 (56%), Gaps = 7/80 (8%)
Query: 37 NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR-- 94
N+ ++++ + ++++ K + E++ + A++++ E+ RQ+E+E++ + K+
Sbjct: 53 NRIQQQKKPAAKKEQERQK----KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAE 108
Query: 95 -QRLEEAEKKRQAMMAALKE 113
+ EK++QA A K+
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQ 128
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 39.5 bits (93), Expect = 0.002
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 7/84 (8%)
Query: 41 KEEEDLKRLKEKQAKRKVM--RAEEEQKMAQRKKEE--EERRQREVEE---KKQRDLEEK 93
KE E LK+ +AK +V RAE E+++ +R+ E ERR + EE +K L++K
Sbjct: 43 KEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKK 102
Query: 94 RQRLEEAEKKRQAMMAALKEQTNK 117
+ LE+ EK+ L E+ +
Sbjct: 103 EENLEKKEKELSNKEKNLDEKEEE 126
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 39.3 bits (92), Expect = 0.002
Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 21/120 (17%)
Query: 13 FVKRQEAKSSALDEQLKEYIVE-----WRNQRAKEE----EDLKRLKEKQAKRKVMRAEE 63
+K+ + A D +L+E E QR K + E + RLK+ + A+
Sbjct: 316 MLKKARSALDACDRELEEKAREVSELKMERQRKKPQIDELESIVRLKQAE-------ADM 368
Query: 64 EQKMAQRKKEEEERRQREVEEKKQRDLEE-----KRQRLEEAEKKRQAMMAALKEQTNKS 118
Q A + E ER QR K ++ EE + RL EAE++RQ + LK Q
Sbjct: 369 FQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEESH 428
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 38.9 bits (91), Expect = 0.003
Identities = 39/169 (23%), Positives = 72/169 (42%), Gaps = 16/169 (9%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR- 94
R + AKE E + + +K+A+ K+ A E K + E ++RQ E+++ EE+R
Sbjct: 28 RQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLR-ERNQQRQEARREREELQREEERL 86
Query: 95 QRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
+ EE R + L+ Q + + LS +LE + ++Q++ E
Sbjct: 87 VQKEEQLDARAEKLDNLENQLEEREK------------ALSARELELEELEKQLDNELYR 134
Query: 155 ALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDY 203
+ + + L A+ E VK ++ DLE +K Q+
Sbjct: 135 VAGLTPEQARKL--LLKLLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181
Score = 35.1 bits (81), Expect = 0.052
Identities = 35/152 (23%), Positives = 61/152 (40%), Gaps = 13/152 (8%)
Query: 27 QLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR-------Q 79
+L + + KE E+ +AK ++R +Q+ R++ EE +R +
Sbjct: 30 RLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLVQK 89
Query: 80 REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG-----PNFT-IQRREPGLT 133
E + + L+ +LEE EK A L+E + T Q R+ L
Sbjct: 90 EEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLK 149
Query: 134 LSTAQLERNKTKEQIEEEKKIALNIRIKPLNV 165
L A+LE K + + E++ L K N+
Sbjct: 150 LLDAELEEEKAQRVKKIEEEADLEAERKAQNI 181
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 39.3 bits (92), Expect = 0.003
Identities = 41/191 (21%), Positives = 81/191 (42%), Gaps = 28/191 (14%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
+ ++ + A E+L+ + E ++ + E+L L+EK + K E+ +
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK------EELESLEA 358
Query: 72 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
+ EE + E E + +LEE+ + L + + +A+L + I+R E
Sbjct: 359 ELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE----------IERLEA- 407
Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
L + R + +++IEE K +K +LQA+ EL E + +L+ +
Sbjct: 408 -RLERLEDRRERLQQEIEELLKKLEEAELK----------ELQAELEELEEELEELQEEL 456
Query: 192 YDLEERQKRQD 202
LEE +
Sbjct: 457 ERLEEALEELR 467
Score = 38.1 bits (89), Expect = 0.008
Identities = 43/201 (21%), Positives = 86/201 (42%), Gaps = 18/201 (8%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ-------- 65
++ + L E+LKE E A+ +E ++L+E + + + E E+
Sbjct: 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYAL 293
Query: 66 KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTI 125
+ E++++ RE +R LEE +LEE E K + L E K +
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEK-------L 346
Query: 126 QRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIV 185
+ + L A+LE + + + E + L +++ L + + +L+ + L I
Sbjct: 347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK---VAQLELQIASLNNEIE 403
Query: 186 KLETDKYDLEERQKRQDYDVS 206
+LE LE+R++R ++
Sbjct: 404 RLEARLERLEDRRERLQQEIE 424
Score = 32.3 bits (74), Expect = 0.52
Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 6/95 (6%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
E+L E I + + EE ++ L+ + RA E+ +A + E EE +
Sbjct: 848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEEL------SE 901
Query: 86 KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG 120
+ R+LE KR L ++ + +A L+ + +
Sbjct: 902 ELRELESKRSELRRELEELREKLAQLELRLEGLEV 936
Score = 31.2 bits (71), Expect = 1.0
Identities = 41/181 (22%), Positives = 82/181 (45%), Gaps = 8/181 (4%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
EQL++ + E Q + +DL RL+ + + + A+ +++ + + E EE +R E +
Sbjct: 715 EQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAE 774
Query: 86 KQRD-LEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK- 143
++ E + + LE ++ + + AL+E ++ + + L LER
Sbjct: 775 EELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIA 834
Query: 144 -TKEQIE--EEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKR 200
T+ ++E EE+ L+ I+ L E I +L+ EL + L ++ LEE
Sbjct: 835 ATERRLEDLEEQIEELSEDIESLAAE---IEELEELIEELESELEALLNERASLEEALAL 891
Query: 201 Q 201
Sbjct: 892 L 892
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 38.9 bits (91), Expect = 0.005
Identities = 41/214 (19%), Positives = 90/214 (42%), Gaps = 19/214 (8%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
E ++ + + E+LKE I + + + E L+ L++ A+ + + E E+K++
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371
Query: 72 KEEEERRQREVEE---------KKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPN 122
+E EE + EE + + +LEE ++ +E E++ + + L++ + K
Sbjct: 372 EELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELE 431
Query: 123 FTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWE 182
+ E L E + +EQ+EE + + + + +LQ + L +
Sbjct: 432 --AELEELQTELEELNEELEELEEQLEELRDRLKELERE--------LAELQEELQRLEK 481
Query: 183 TIVKLETDKYDLEERQKRQDYDVSKIIINKKIMP 216
+ LE LE Q+ + + + +P
Sbjct: 482 ELSSLEARLDRLEAEQRASQGVRAVLEALESGLP 515
Score = 38.9 bits (91), Expect = 0.005
Identities = 43/214 (20%), Positives = 96/214 (44%), Gaps = 36/214 (16%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
++ + + E+L+E + E + + + EE+L+ L+E AK K E E+K ++E
Sbjct: 735 LQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEE 794
Query: 74 EEERRQ-------------------REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
EE + + E+ ++++EE + +EE E+K + L+E
Sbjct: 795 LEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854
Query: 115 TNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQ 174
+ E L + E+ + +++++E ++ + L E + +L+
Sbjct: 855 EKE---------LEELKEELEELEAEKEELEDELKELEE-----EKEELEEE---LRELE 897
Query: 175 AKATELWETIVKLETDKYDLEERQKRQDYDVSKI 208
++ EL E I KL +LE + +R + ++ ++
Sbjct: 898 SELAELKEEIEKLRERLEELEAKLERLEVELPEL 931
Score = 35.5 bits (82), Expect = 0.056
Identities = 41/187 (21%), Positives = 87/187 (46%), Gaps = 6/187 (3%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
+ + + + + + +E + + K E + +Q + ++ EEE + + + E
Sbjct: 695 NELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELE 754
Query: 74 EEERRQREVEE---KKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
E + R E+EE + L + ++ +EE E+KRQA+ L+E + + + E
Sbjct: 755 ELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALER 814
Query: 131 GLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETD 190
L + ER + + + EE+ L ++ L E + +L+ + EL E + +LE +
Sbjct: 815 ELESLEQRRERLEQEIEELEEEIEELEEKLDELEEE---LEELEKELEELKEELEELEAE 871
Query: 191 KYDLEER 197
K +LE+
Sbjct: 872 KEELEDE 878
Score = 35.1 bits (81), Expect = 0.068
Identities = 37/176 (21%), Positives = 78/176 (44%), Gaps = 5/176 (2%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEE-EQKMAQRKK-EEEERRQREVEEKKQRDLEEK 93
Q K E + E + + + ++ + ++ EEE R E E+ Q +LEE
Sbjct: 206 ERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEA 265
Query: 94 RQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
+ +EE + + + + L+E + I+ E ++L +LE + + + EE+
Sbjct: 266 EKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERL 325
Query: 154 IALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKII 209
L +I+ L E + + + EL + + +LE K +LEE+ ++ ++
Sbjct: 326 EELKEKIEALKEE---LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELF 378
Score = 33.9 bits (78), Expect = 0.15
Identities = 38/191 (19%), Positives = 87/191 (45%), Gaps = 15/191 (7%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
E +KR+ A EQL+ + E + + EE+L+ L+E+ + EEE + +
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEE-----LEEELESLEEA 773
Query: 72 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
+ + E+EEK+Q L+E+ + LEE ++ + + AL+ + + +R
Sbjct: 774 LAKLKEEIEELEEKRQA-LQEELEELEEELEEAERRLDALERELESLE-----QRRERLE 827
Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
+ + E + +E+++E ++ + + ++ +L+ E E +L+ +
Sbjct: 828 QEIEELEEEIEELEEKLDELEEELEELEKELEELKE----ELEELEAEKEELEDELKELE 883
Query: 192 YDLEERQKRQD 202
+ EE ++
Sbjct: 884 EEKEELEEELR 894
Score = 33.5 bits (77), Expect = 0.19
Identities = 35/190 (18%), Positives = 89/190 (46%), Gaps = 5/190 (2%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEK--QAKRKVMRAEEEQKMAQRKKEEEERRQREVE 83
+L+E + E + + + E++++ LK + + + ++ +EE + + EE E + +
Sbjct: 249 SRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEG-EISLL 307
Query: 84 EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK 143
++ +LE + + LEE ++ + + ALKE+ + + +++ L + +LE
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367
Query: 144 TKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDY 203
+EE +++ +R + L + +++ + EL I LE L ER +
Sbjct: 368 -SALLEELEELFEALREE-LAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKE 425
Query: 204 DVSKIIINKK 213
++ ++ +
Sbjct: 426 ELKELEAELE 435
Score = 33.1 bits (76), Expect = 0.25
Identities = 45/196 (22%), Positives = 89/196 (45%), Gaps = 15/196 (7%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
Q+ + L+E+L E Q K EE+LK LK + + + E +++ + +++ EE
Sbjct: 665 QKRELKELEEELAEL----EAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEE 720
Query: 77 RR------QREVEEKKQR--DLEEKRQRLEEAEKKRQAMMAALKEQTNK--SKGPNFTIQ 126
+ + E+E+ + R +LEE+ + LEE ++ Q + L+E+ +
Sbjct: 721 LKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEE 780
Query: 127 RREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVK 186
E Q E + +E++EE ++ + + L +L+ + EL E I +
Sbjct: 781 IEELEEKRQALQEELEELEEELEEAERRLDALE-RELESLEQRRERLEQEIEELEEEIEE 839
Query: 187 LETDKYDLEERQKRQD 202
LE +LEE + +
Sbjct: 840 LEEKLDELEEELEELE 855
Score = 30.5 bits (69), Expect = 1.6
Identities = 24/101 (23%), Positives = 50/101 (49%), Gaps = 1/101 (0%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
+++ + L+ +L+ VE + EE+ + E + +R++ R EEE +
Sbjct: 907 IEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNL 966
Query: 74 EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
EVEE+ +L+ +R+ LEEA++K ++ L ++
Sbjct: 967 RAIEEYEEVEERY-EELKSQREDLEEAKEKLLEVIEELDKE 1006
Score = 29.3 bits (66), Expect = 3.7
Identities = 37/171 (21%), Positives = 75/171 (43%), Gaps = 17/171 (9%)
Query: 39 RAKEEEDLKRLKE----KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE-----KKQRD 89
AK EE K ++E + K + AE + + + E E E+E+ ++Q +
Sbjct: 151 NAKPEERRKLIEEAAGVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAE 210
Query: 90 LEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE 149
E+ Q L+ ++ + + K + + + ++ E L+ +LE + + +
Sbjct: 211 KAERYQELKAELRELELALLLAKLKELRKE-----LEELEEELSRLEEELEELQEELEEA 265
Query: 150 EEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKR 200
E++ L ++ L E + +LQ + EL E I +LE + L ER +
Sbjct: 266 EKEIEELKSELEELREE---LEELQEELLELKEEIEELEGEISLLRERLEE 313
Score = 28.1 bits (63), Expect = 8.5
Identities = 28/132 (21%), Positives = 65/132 (49%), Gaps = 5/132 (3%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
E+ + + + E+ +RL+++ + + E E+K+ + ++E EE + E K
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELK 862
Query: 86 KQRD-LEEKRQRLE----EAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLE 140
++ + LE +++ LE E E++++ + L+E ++ I++ L A+LE
Sbjct: 863 EELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLE 922
Query: 141 RNKTKEQIEEEK 152
R + + EE+
Sbjct: 923 RLEVELPELEEE 934
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 37.0 bits (86), Expect = 0.010
Identities = 22/65 (33%), Positives = 41/65 (63%), Gaps = 2/65 (3%)
Query: 42 EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
EEE+L L+++Q + + R E ++ EE ERR+RE +E++++ +E++QR +E
Sbjct: 150 EEEELAELRQQQRQFEQRRNAE--LAETQRLEEAERRRREEKERRKKQDKERKQREKETA 207
Query: 102 KKRQA 106
+K A
Sbjct: 208 EKIAA 212
Score = 30.0 bits (68), Expect = 1.8
Identities = 18/61 (29%), Positives = 37/61 (60%)
Query: 53 QAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
QA +V+ EE ++ Q++++ E+RR E+ E ++ + E+R+R E+ +K+Q +
Sbjct: 143 QALLEVLEEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQR 202
Query: 113 E 113
E
Sbjct: 203 E 203
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 37.0 bits (86), Expect = 0.011
Identities = 20/106 (18%), Positives = 49/106 (46%), Gaps = 5/106 (4%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
+ KE + + R + ++ +E++++ +E+ + K A+ ++ + E+RR ++ EK
Sbjct: 67 ARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQL-EK 125
Query: 86 KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
Q +++ R +L + L + K F ++R G
Sbjct: 126 LQDEIKRTRSKLNALHSLLAEKRSFLCRELAKL----FPLRRVIRG 167
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 37.0 bits (85), Expect = 0.015
Identities = 36/156 (23%), Positives = 78/156 (50%), Gaps = 4/156 (2%)
Query: 2 KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRA 61
K D A E K EAK A + + K + + + AK+ + + + + A + A
Sbjct: 1303 KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362
Query: 62 EEEQKMAQRKKEEEERRQREVEEKKQ--RDLEEKRQRLEEAEKKRQAM--MAALKEQTNK 117
EE+ + A++KKEE +++ ++K + + +E +++ EE +KK + AA K++ ++
Sbjct: 1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADE 1422
Query: 118 SKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
+K ++ + + + ++ K++ EE KK
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK 1458
Score = 37.0 bits (85), Expect = 0.019
Identities = 40/153 (26%), Positives = 75/153 (49%), Gaps = 1/153 (0%)
Query: 2 KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKE-KQAKRKVMR 60
K D A E K EAK A + + + + + K EE K+ +E K+A +
Sbjct: 1419 KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Query: 61 AEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG 120
AEE +K + KK+ EE +++ E KK + ++K ++AE+ ++A A E+ K+
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADE 1538
Query: 121 PNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
++++ +L++ + K++ EE KK
Sbjct: 1539 AKKAEEKKKADELKKAEELKKAEEKKKAEEAKK 1571
Score = 36.3 bits (83), Expect = 0.030
Identities = 42/162 (25%), Positives = 82/162 (50%), Gaps = 12/162 (7%)
Query: 16 RQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEE 75
+ +A+ E+ K+ + + + + A+E++ + LK+ + + K+ AEE +K + KK+ E
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
Query: 76 ERRQREVEEKK-----QRDLEEKRQ----RLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
E ++ E +EKK +++ EE ++ + +EAE+K++A E+ NK K +
Sbjct: 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKE 1738
Query: 127 RREPGLTLSTAQL---ERNKTKEQIEEEKKIALNIRIKPLNV 165
E A+ E+ K +EE+K A IR + V
Sbjct: 1739 AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAV 1780
Score = 36.3 bits (83), Expect = 0.034
Identities = 29/106 (27%), Positives = 54/106 (50%), Gaps = 6/106 (5%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMR--AEEEQKMAQRKK 72
+EAK + D++ E + K E LK+ E+ K + ++ EE+K A+ K
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELK 1722
Query: 73 EEEERRQREVEEKKQRDLEEKRQ----RLEEAEKKRQAMMAALKEQ 114
+ EE + + EE K+ E+K++ + +E EKK+ A + +E+
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
Score = 34.3 bits (78), Expect = 0.12
Identities = 29/112 (25%), Positives = 59/112 (52%)
Query: 2 KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRA 61
K D A E K EAK A + + K + + K+ ++ K+ +E + + +A
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKA 1530
Query: 62 EEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
EE +K + KK EE+++ E+++ ++ E++++ EEA+K + AL++
Sbjct: 1531 EEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582
Score = 32.0 bits (72), Expect = 0.55
Identities = 58/325 (17%), Positives = 122/325 (37%), Gaps = 35/325 (10%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ 95
+ + AK+ E ++ +E + ++ +AE+ +K +K EEER+ E + + E +
Sbjct: 1171 KAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVK 1230
Query: 96 RLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE--EEKK 153
+ EEA+K + A +E+ N+ + + + E + ++++ EEKK
Sbjct: 1231 KAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKK 1290
Query: 154 IALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKIIINKK 213
A + + K +A + E K E K + +K+ + K
Sbjct: 1291 KADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350
Query: 214 IMPSPGQGRPGFNPRYVQTPNLFNNATALTGKYPPKIQVASKYERRVDTRSYDDKKKLFE 273
+ A++ + + ++ KK +
Sbjct: 1351 -----------------------------EAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
Query: 274 GGYLSLTKESLEKQYAEKVDQFGNRQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDEEE 333
+ K++ EK+ A++ + K K + KKK + EE K+ DE +
Sbjct: 1382 ----AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAK 1437
Query: 334 ENPIYLKISDDEEEKEEENFMSNNA 358
+ K +D+ ++K EE + A
Sbjct: 1438 KKAEEAKKADEAKKKAEEAKKAEEA 1462
Score = 31.6 bits (71), Expect = 0.73
Identities = 32/150 (21%), Positives = 67/150 (44%), Gaps = 5/150 (3%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRK---VMRAEEEQKMA 68
E K E A +E+ E I ++ R + + + RK + +AEE++K
Sbjct: 1234 EAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
Query: 69 QRKKEEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
+ KK EE+++ E ++K + + +E +++ EEA+KK A +E ++ +
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Query: 127 RREPGLTLSTAQLERNKTKEQIEEEKKIAL 156
+ + E + K++ ++K A
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKKADAA 1383
Score = 29.7 bits (66), Expect = 3.0
Identities = 28/140 (20%), Positives = 65/140 (46%), Gaps = 1/140 (0%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ-KMAQRKKE 73
++ + + + E + R +E L ++K + +AEE + K + KK
Sbjct: 1569 AKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA 1628
Query: 74 EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLT 133
EEE+++ E +KK+ + ++K + L++AE++ + A ++ + K ++ E
Sbjct: 1629 EEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Query: 134 LSTAQLERNKTKEQIEEEKK 153
+ L++ + + EE K
Sbjct: 1689 KAAEALKKEAEEAKKAEELK 1708
Score = 29.7 bits (66), Expect = 3.5
Identities = 28/132 (21%), Positives = 61/132 (46%), Gaps = 8/132 (6%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE--EKKQRDLEEK 93
+ + A++ ED K+ + + +V +AEE +K +K E R+ E E+ ++ + K
Sbjct: 1165 KAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAK 1224
Query: 94 R----QRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE 149
+ ++ EEA+K + A +E+ N+ + + + E + ++++
Sbjct: 1225 KAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELK 1284
Query: 150 --EEKKIALNIR 159
EEKK A +
Sbjct: 1285 KAEEKKKADEAK 1296
Score = 29.3 bits (65), Expect = 4.7
Identities = 36/158 (22%), Positives = 73/158 (46%), Gaps = 7/158 (4%)
Query: 2 KTDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMR- 60
K D A E K +EAK A + + + + + K +E K+ +E + K +
Sbjct: 1445 KADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKK 1504
Query: 61 AEEEQKMAQRKKEEEERRQ----REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
A E +K A K+ EE ++ ++ EE K+ D +K + ++A++ ++A E+
Sbjct: 1505 AAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK 1564
Query: 117 KSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
K++ + + + L A+ + + +IEE K+
Sbjct: 1565 KAEEAKKAEEDKN--MALRKAEEAKKAEEARIEEVMKL 1600
Score = 29.0 bits (64), Expect = 6.0
Identities = 25/117 (21%), Positives = 52/117 (44%), Gaps = 3/117 (2%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
+ AK+ E K + ++A+ +AE+ +K + +K E+ R+ E + + E ++
Sbjct: 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKA 1160
Query: 98 EEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
E+A K +A A E K++ + R+ + + + EEE+K
Sbjct: 1161 EDARKAEEARKA---EDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKA 1214
Score = 29.0 bits (64), Expect = 6.3
Identities = 25/132 (18%), Positives = 58/132 (43%), Gaps = 3/132 (2%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR---QREV 82
E+ K+ + K EE K+ ++ + + +AE+ +K + +K E+ +R R+
Sbjct: 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKA 1160
Query: 83 EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERN 142
E+ ++ + K + ++AE R+A E+ K++ R+ + +
Sbjct: 1161 EDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKA 1220
Query: 143 KTKEQIEEEKKI 154
+ ++ E KK
Sbjct: 1221 EDAKKAEAVKKA 1232
Score = 28.2 bits (62), Expect = 9.7
Identities = 25/104 (24%), Positives = 53/104 (50%), Gaps = 4/104 (3%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ--RKK 72
K E K +A ++ + + ++ K +E K+ +E + + + EE K A+ +KK
Sbjct: 1406 KADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465
Query: 73 EEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
EE ++ E ++K + + +E +++ EEA+KK A + +
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAK 1509
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 36.9 bits (86), Expect = 0.017
Identities = 26/119 (21%), Positives = 49/119 (41%), Gaps = 19/119 (15%)
Query: 19 AKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEK---------------QAKRKVMRAEE 63
AK+S L+E+L++ + + +R + E L++ + + AKR+ M E
Sbjct: 981 AKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQ-MLQEL 1039
Query: 64 EQKMAQ---RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSK 119
+Q++ E R R ++ L R R + EK+ A + T K +
Sbjct: 1040 KQELQDLGVPADSGAEERARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLR 1098
Score = 30.7 bits (70), Expect = 1.5
Identities = 39/185 (21%), Positives = 66/185 (35%), Gaps = 44/185 (23%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEE---------DLKRLKEKQAKRKVMRAE-- 62
V R EA A +E + R QR E+ +L++ +Q + + + AE
Sbjct: 489 VSRSEAWDVA-----RELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQQRAERLLAEFC 543
Query: 63 -----------------EEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKR- 104
EE + E RE ++ LE+ + R++ +
Sbjct: 544 KRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRLAARAP 603
Query: 105 -----QAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE-EEKKIALNI 158
Q +A L+EQ+ G F + + ER T E+ E +K AL+
Sbjct: 604 AWLAAQDALARLREQS----GEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDE 659
Query: 159 RIKPL 163
I+ L
Sbjct: 660 EIERL 664
Score = 30.3 bits (69), Expect = 2.2
Identities = 24/93 (25%), Positives = 37/93 (39%), Gaps = 10/93 (10%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK----KEEEERRQREVEEKKQRDLE 91
R Q A E+ L + + A+ ++ EQ + RQ+E E+ Q DLE
Sbjct: 299 RRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQEKIERYQADLE 358
Query: 92 EKRQRLEEAE------KKRQAMMAALKEQTNKS 118
E +RLEE ++Q A E +
Sbjct: 359 ELEERLEEQNEVVEEADEQQEENEARAEAAEEE 391
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 36.5 bits (85), Expect = 0.021
Identities = 26/107 (24%), Positives = 45/107 (42%), Gaps = 4/107 (3%)
Query: 53 QAKRKVMRAEEEQKMAQRKKEE-EERRQREVEEKKQRDLEEKR---QRLEEAEKKRQAMM 108
QAK ++ E+E+K A+ K E R+ R EK R+ K+ R + + A +
Sbjct: 433 QAKAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAAL 492
Query: 109 AALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIA 155
A +K + + P P + A E K + + + +K A
Sbjct: 493 ARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQA 539
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 36.2 bits (83), Expect = 0.022
Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 13/83 (15%)
Query: 21 SSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQR 80
++ + +E E K E+++ RLKE+ +++ R+ EE ERR
Sbjct: 348 YEDIEYETEENREEGTQAE-KYEQEIARLKERL------------RVSVRRLEEYERRLL 394
Query: 81 EVEEKKQRDLEEKRQRLEEAEKK 103
E++ Q+ L+E + RLE++E++
Sbjct: 395 GQEQQMQKLLQEYQARLEDSEER 417
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 36.6 bits (85), Expect = 0.023
Identities = 26/134 (19%), Positives = 61/134 (45%), Gaps = 11/134 (8%)
Query: 26 EQLKEYIVEWRNQRA---KEEEDLK-RLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
++L+E ++ + Q KE E+L + +E + + + + A ++ ++ER + E
Sbjct: 836 QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895
Query: 82 VE----EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTA 137
+ E+K +LE + ++ + + +A + AL+E+ P LS
Sbjct: 896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE-LSEIEDPKGEDEEIPEEELSLE 954
Query: 138 QLERNKTKEQIEEE 151
++ +++EEE
Sbjct: 955 DVQ--AELQRVEEE 966
Score = 33.9 bits (78), Expect = 0.18
Identities = 42/191 (21%), Positives = 81/191 (42%), Gaps = 11/191 (5%)
Query: 9 GDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMA 68
+ E V+ + + ++ ++ + R +R K E LKEK+ +E++ +
Sbjct: 177 EELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALE 236
Query: 69 QRKKEEEERRQ--REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
++K+ E + E EK ++ E +RLEE E+ + + +K+ + ++
Sbjct: 237 RQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD---LGEEEQLRVK 293
Query: 127 RREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVK 186
+ L A LER+ +++ E E R+ L E I KL A+ EL I +
Sbjct: 294 EKIGELEAEIASLERSIAEKERELED---AEERLAKLEAE---IDKLLAEIEELEREIEE 347
Query: 187 LETDKYDLEER 197
+ L E
Sbjct: 348 ERKRRDKLTEE 358
Score = 32.7 bits (75), Expect = 0.40
Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 8/105 (7%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDL---KRLKEKQAKRKVMRAEEEQKM-AQ 69
+ +AK AL+E+L E EE+L E Q + +RA E M A
Sbjct: 919 LSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAI 978
Query: 70 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAE----KKRQAMMAA 110
++ EE +R E++EK+ + EE++ LE E KKR+ M A
Sbjct: 979 QEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEA 1023
Score = 32.0 bits (73), Expect = 0.68
Identities = 28/198 (14%), Positives = 80/198 (40%), Gaps = 9/198 (4%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
E ++++E K E+L+E + + + +LK L+ + + + + E+ + +
Sbjct: 726 EQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLE 785
Query: 72 KEEEERRQREVEEKKQ------RDLEEKRQRLEEAEKKRQAMMAALKE--QTNKSKGPNF 123
R E++ + +E + + +E+ + L++ Q + + +
Sbjct: 786 ARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDL 845
Query: 124 TIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWET 183
Q + + ++ + +E++EE + ++ + +++ +L+A+ EL
Sbjct: 846 KEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKK-ERDELEAQLRELERK 904
Query: 184 IVKLETDKYDLEERQKRQ 201
I +LE +R
Sbjct: 905 IEELEAQIEKKRKRLSEL 922
Score = 30.8 bits (70), Expect = 1.4
Identities = 21/97 (21%), Positives = 44/97 (45%), Gaps = 5/97 (5%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLK----RLKEKQAKRKVMRAEEEQKMAQR 70
KR+ + ++L E + + A E + ++K + K + EQ A
Sbjct: 405 KRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADL 464
Query: 71 KKEEEERRQREVE-EKKQRDLEEKRQRLEEAEKKRQA 106
K E+E + E ++ +++L + ++ L EAE + +A
Sbjct: 465 SKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARA 501
Score = 30.0 bits (68), Expect = 2.4
Identities = 47/219 (21%), Positives = 85/219 (38%), Gaps = 34/219 (15%)
Query: 10 DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
+P ++R + L +L E R R + D + A RK+ E+E + +
Sbjct: 672 EPAELQRLRERLEGLKRELSSLQSELR--RIENRLDELSQELSDASRKIGEIEKEIEQLE 729
Query: 70 RKKEEEERRQREVEEK---KQRDLEEKRQRLEEAEKKRQAMMAAL--------------- 111
+++E+ + R E+EE ++++E + L+E E + + + L
Sbjct: 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLS 789
Query: 112 ----KEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK-------IALNIRI 160
E + + R E L +L R +++ E++ I L +I
Sbjct: 790 HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQI 849
Query: 161 KPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQK 199
K + E I L K EL E + +LE DLE R
Sbjct: 850 KSIEKE---IENLNGKKEELEEELEELEAALRDLESRLG 885
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 36.1 bits (84), Expect = 0.023
Identities = 29/150 (19%), Positives = 51/150 (34%), Gaps = 32/150 (21%)
Query: 26 EQLKEYIVEW-RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE- 83
E+ + R K + QA+ +AE + + E E V
Sbjct: 120 EEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQCEAEGTGGTGVAG 179
Query: 84 -----EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQ 138
++K+ L+ + RLE + + A +A L+ Q A
Sbjct: 180 KGPVYKEKREKLDAAQARLETLKARLDAAIAQLEAQ---------------------KAA 218
Query: 139 LERNKTKEQIEEEKKIA----LNIRIKPLN 164
LERN+ E++ +I L R++ L
Sbjct: 219 LERNRQAAVAEKQARIEANDGLLARLEALG 248
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 36.0 bits (83), Expect = 0.027
Identities = 18/46 (39%), Positives = 32/46 (69%)
Query: 69 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
+R +E+ ER QRE +++ + LE +R+ EEA K+R+A+ AA + +
Sbjct: 6 RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSR 51
Score = 33.3 bits (76), Expect = 0.23
Identities = 20/63 (31%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 52 KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
++A+ K+ R + E+K + K E ER+ +E E KQR+ E QR ++ A+ A +
Sbjct: 6 RRAREKLEREQRERKQRAKLKLERERKAKE-EAAKQREAIEAAQR----SRRLDAIEAQI 60
Query: 112 KEQ 114
K
Sbjct: 61 KAD 63
Score = 32.2 bits (73), Expect = 0.49
Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 11/78 (14%)
Query: 34 EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD--LE 91
E R R K E + + K++ AK ++ E E RK +EE +QRE E QR L+
Sbjct: 4 ELRRAREKLEREQRERKQR-AK---LKLERE-----RKAKEEAAKQREAIEAAQRSRRLD 54
Query: 92 EKRQRLEEAEKKRQAMMA 109
+++ ++ ++++ A
Sbjct: 55 AIEAQIKADQQMQESLQA 72
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 34.5 bits (80), Expect = 0.029
Identities = 22/110 (20%), Positives = 60/110 (54%), Gaps = 14/110 (12%)
Query: 10 DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEK-QAKRKVMRAEEEQKMA 68
D + V + A +QL++ + + + K+E++L++ ++K Q + + E +
Sbjct: 22 DVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKA-- 79
Query: 69 QRKKEEEERRQREVEEKKQRDLEEKRQRLE-EAEKKRQAMMAALKEQTNK 117
+Q+E++ +KQ++L++K+Q + E ++K+Q ++ + ++ +K
Sbjct: 80 ---------KQQELQ-QKQQELQQKQQAAQQELQQKQQELLQPIYDKIDK 119
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 34.1 bits (79), Expect = 0.037
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
L L A E + + EEEK+ L R PL + S +LQ +L I +L+ ++
Sbjct: 7 LLLLKAAEELEFEQRKKEEEKEKYLAERCPPLRLSL-SRAELQELCKKLHARIDRLDEER 65
Query: 192 YDLEERQKRQDYDV 205
YD+EE+ ++D ++
Sbjct: 66 YDIEEKVAKKDKEI 79
Score = 28.3 bits (64), Expect = 3.0
Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 11/85 (12%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
QR KEEE K L E+ ++ + E Q ++ R ++E+ R +
Sbjct: 20 QRKKEEEKEKYLAERCPPLRLSLSRAEL---QELCKKLHARIDRLDEE--------RYDI 68
Query: 98 EEAEKKRQAMMAALKEQTNKSKGPN 122
EE K+ + LK++ N +G
Sbjct: 69 EEKVAKKDKEIEDLKKKVNDLRGKF 93
Score = 27.9 bits (63), Expect = 4.5
Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 39 RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
RA+ +E K+L + + R + E+K+A++ KE E+ +++ + + + K+ L+
Sbjct: 44 RAELQELCKKLHARIDRLDEERYDIEEKVAKKDKEIEDLKKKVNDLRGKF----KKPTLK 99
Query: 99 EAEKKRQAMMAALKEQTNKSK 119
+ K AM+ AL +K
Sbjct: 100 KVRKSADAMLKALLGSKHKVS 120
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 36.0 bits (83), Expect = 0.037
Identities = 17/41 (41%), Positives = 26/41 (63%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQR 80
AKEE + +R E+Q +R+ +A E AQ K E E+RR++
Sbjct: 262 AKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREK 302
Score = 34.8 bits (80), Expect = 0.083
Identities = 14/63 (22%), Positives = 37/63 (58%), Gaps = 6/63 (9%)
Query: 41 KEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
++ +L++L +++A+R+ E+ ++ ++ E +R Q + E +E++R++L+
Sbjct: 253 EKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAE------VEKRREKLQNL 306
Query: 101 EKK 103
KK
Sbjct: 307 LKK 309
Score = 31.4 bits (71), Expect = 1.0
Identities = 17/58 (29%), Positives = 34/58 (58%), Gaps = 6/58 (10%)
Query: 61 AEEEQKMAQRKKEEEERRQREVEEKKQRDLE------EKRQRLEEAEKKRQAMMAALK 112
EE+++ ++ +EE R+R+ EE+++R+ E ++ Q E EK+R+ + LK
Sbjct: 251 LEEKRRELEKLAKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNLLK 308
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 35.4 bits (81), Expect = 0.043
Identities = 26/127 (20%), Positives = 54/127 (42%), Gaps = 11/127 (8%)
Query: 24 LDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE 83
+ E K I + + + K +E+ + K+ +A ++ +K A+ K+ E ++++ V
Sbjct: 275 VAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVA 334
Query: 84 EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK 143
E Q+ + A +L E S P + ++ +P LS L K
Sbjct: 335 EDLQKTKPQVE-----------AQPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDLK 383
Query: 144 TKEQIEE 150
T+ ++ E
Sbjct: 384 TEVRLRE 390
Score = 33.4 bits (76), Expect = 0.18
Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 2/84 (2%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
QR + + + K +E++++A+ +K E E+ Q E+++ + L+ K +
Sbjct: 246 QRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKA 305
Query: 98 EEA--EKKRQAMMAALKEQTNKSK 119
+ E K A KE + K
Sbjct: 306 FDLKQESKASEKEAEDKELEAQKK 329
Score = 31.5 bits (71), Expect = 0.69
Identities = 13/84 (15%), Positives = 42/84 (50%), Gaps = 2/84 (2%)
Query: 25 DEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE 84
+ + + + + ++E+ + +++ +K + A++ Q+ A ++ ++++ EV +
Sbjct: 193 GVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQ 252
Query: 85 KKQ--RDLEEKRQRLEEAEKKRQA 106
K+Q ++L + E K+ A
Sbjct: 253 KQQEAKNLPKPADTSSPKEDKQVA 276
Score = 29.6 bits (66), Expect = 2.7
Identities = 21/138 (15%), Positives = 61/138 (44%), Gaps = 6/138 (4%)
Query: 22 SALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
S D+++ E + E + D+ LKE++++ RA++ ++ +K+ + ++ Q++
Sbjct: 176 SISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQK 235
Query: 82 VE------EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLS 135
+ +K++ ++ +K+Q + K +Q +++ + E
Sbjct: 236 ADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDE 295
Query: 136 TAQLERNKTKEQIEEEKK 153
A ++ +++E K
Sbjct: 296 EALKAKDHKAFDLKQESK 313
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 35.0 bits (81), Expect = 0.044
Identities = 21/81 (25%), Positives = 40/81 (49%)
Query: 18 EAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEER 77
EA+ L E+ KE Q +E +K+L EK + E+++M + K +E+E
Sbjct: 217 EAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQEQEE 276
Query: 78 RQREVEEKKQRDLEEKRQRLE 98
+E + + L+++ Q L+
Sbjct: 277 LLKEGFKTEAESLQKEIQDLK 297
Score = 30.3 bits (69), Expect = 1.3
Identities = 13/60 (21%), Positives = 35/60 (58%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
KEK + + +AE + + +E+++ ++ +E +++ E +Q +E+ E +R+ ++A
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLA 261
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 35.0 bits (81), Expect = 0.059
Identities = 14/48 (29%), Positives = 33/48 (68%), Gaps = 6/48 (12%)
Query: 41 KEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQR 88
++E+L+R KE++ A +EQK ++ K++EE++++E+E+ ++
Sbjct: 548 DDKEELQREKEEK------EALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 30.0 bits (68), Expect = 2.6
Identities = 9/39 (23%), Positives = 25/39 (64%)
Query: 62 EEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
EE Q+ + K+ +E+++ +K++ +++ ++LE+A
Sbjct: 551 EELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 29.2 bits (66), Expect = 3.9
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 69 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQ 105
QR+KEE+E + + +K + EEK+++ E +K +
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 29.2 bits (66), Expect = 4.1
Identities = 13/46 (28%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 52 KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
K ++ ++ E+E+K A K+++ R+ ++ EEKK+++LE+ +
Sbjct: 546 KLDDKEELQREKEEKEAL-KEQKRLRKLKKQEEKKKKELEKLEKAK 590
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 35.0 bits (81), Expect = 0.064
Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 10/104 (9%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQR---AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQR 70
+K+ EA+ + L QL+ R Q ++E E ++LKEK ++ E E++ +
Sbjct: 153 IKQLEAELAGLQAQLQAL----RQQLEVISEELEARRKLKEKGLVSRLELLELERE--RA 206
Query: 71 KKEEEERRQREVEEKKQRDLEEKRQRLEEAE-KKRQAMMAALKE 113
+ + E R E +R ++E + ++ E R+ ++ L E
Sbjct: 207 EAQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTE 250
Score = 28.4 bits (64), Expect = 7.3
Identities = 37/161 (22%), Positives = 65/161 (40%), Gaps = 25/161 (15%)
Query: 53 QAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
++++ +RA+ E +AQ K+ E E Q L+ RQ+LE ++ + LK
Sbjct: 136 ESRKSTLRAQLELILAQIKQLEAEL------AGLQAQLQALRQQLEVISEELE-ARRKLK 188
Query: 113 EQTNKSKGPNFTIQRREPGLTLSTAQLE--RNKTKEQIEEEKKIALNIR-IKPLNVEGWS 169
E+ S+ ++R +LE K QI+E + L + I+ E
Sbjct: 189 EKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDE---LQLERQQIEQTFREEVL 245
Query: 170 IHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKIII 210
EL E +L +L ER + + ++II
Sbjct: 246 --------EELTEAQARLA----ELRERLNKARDRLQRLII 274
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 34.9 bits (81), Expect = 0.074
Identities = 24/105 (22%), Positives = 44/105 (41%), Gaps = 14/105 (13%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE---KKQRDLEEKRQRLEEAEKKRQA 106
+++QA+ ++ +K + + E +R ++ K QR+LE+ Q LE+ E +
Sbjct: 527 RQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAE--- 583
Query: 107 MMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEE 151
+ AL+ Q +F Q E + L EQ E
Sbjct: 584 -IEALQAQVA---DADFFSQPHE----QTQKVLADLADAEQELEV 620
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 34.5 bits (80), Expect = 0.079
Identities = 20/74 (27%), Positives = 42/74 (56%)
Query: 33 VEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE 92
+E+ K + +K++ EK K++ +E++K A K++EE + E E+K++ EE
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEE 458
Query: 93 KRQRLEEAEKKRQA 106
+ + EE E++ +
Sbjct: 459 EEEAEEEKEEEEEK 472
Score = 34.5 bits (80), Expect = 0.080
Identities = 19/81 (23%), Positives = 45/81 (55%), Gaps = 2/81 (2%)
Query: 42 EEEDLKRL--KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
EE+++ L +K K+ E+ +K + +K+E++++ ++K++ + EEK ++ EE
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 100 AEKKRQAMMAALKEQTNKSKG 120
E++ + +E+ K K
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475
Score = 29.5 bits (67), Expect = 3.5
Identities = 26/92 (28%), Positives = 43/92 (46%), Gaps = 4/92 (4%)
Query: 276 YLSLTKESLE----KQYAEKVDQFGNRQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDE 331
+L LT+E +E + A K + + K + + KK A + EEEE + ++
Sbjct: 391 FLELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEK 450
Query: 332 EEENPIYLKISDDEEEKEEENFMSNNATAEDF 363
+EE + +EE++EEE AT DF
Sbjct: 451 KEEEKEEEEEEAEEEKEEEEEKKKKQATLFDF 482
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 34.8 bits (80), Expect = 0.083
Identities = 31/167 (18%), Positives = 68/167 (40%), Gaps = 7/167 (4%)
Query: 57 KVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
E++ + ++ +E+E+ ++E+E++ + E +R + E EK+ Q + ALK++
Sbjct: 523 LEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVE 582
Query: 117 KSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEK-------KIALNIRIKPLNVEGWS 169
+ + S L + K +Q +K KI +RI+ +G
Sbjct: 583 SIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKI 642
Query: 170 IHKLQAKATELWETIVKLETDKYDLEERQKRQDYDVSKIIINKKIMP 216
+ L + ++++ +LE+ K K+ K P
Sbjct: 643 VQILGGNKWNVTVGGMRMKVHGSELEKINKAPPPKKFKVPKTTKPEP 689
Score = 28.6 bits (64), Expect = 7.0
Identities = 30/171 (17%), Positives = 65/171 (38%), Gaps = 25/171 (14%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
K E K+ L++ LKE ++ K+E + + + K+ +R K
Sbjct: 522 ALEKELEQKNEHLEKLLKE------QEKLKKELEQEMEELKERERN-------------K 562
Query: 72 KEEEERRQREVEEKKQRDLEEK----RQRLEEAEKKRQAMMAALKEQTNKSKGPN--FTI 125
K E E+ +E + ++++E +++ K+ +++ +K + K K P
Sbjct: 563 KLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNF 622
Query: 126 QRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAK 176
Q + G + + QI K + + + V G + K+
Sbjct: 623 QADKIGDKVRIRYFGQKGKIVQILGGNKWNVTVGGMRMKVHGSELEKINKA 673
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 34.4 bits (79), Expect = 0.091
Identities = 17/101 (16%), Positives = 50/101 (49%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
Q+ KS+ E+ ++ + + + ++++ ++ + KQ +++ + A+E++K A+ ++
Sbjct: 69 QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAA 128
Query: 77 RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNK 117
+Q++ EE + + + E K+ A + K
Sbjct: 129 LKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKK 169
Score = 33.2 bits (76), Expect = 0.19
Identities = 23/82 (28%), Positives = 35/82 (42%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
AK K E +AKR A++ A++K E E ++ E KK+ + E +
Sbjct: 137 AAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAA 196
Query: 98 EEAEKKRQAMMAALKEQTNKSK 119
EA+KK +A K K
Sbjct: 197 AEAKKKAEAEAKKKAAAEAKKK 218
Score = 32.1 bits (73), Expect = 0.49
Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 3/103 (2%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQR- 70
E +K+ E + A EQ K+ E Q A +++ + K A +AE E K A
Sbjct: 101 ERLKQLEKERLAAQEQKKQAE-EAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAA 159
Query: 71 -KKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
KK E +++ E ++ E +++ E + A A K
Sbjct: 160 AKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK 202
Score = 29.0 bits (65), Expect = 3.9
Identities = 19/82 (23%), Positives = 32/82 (39%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
++A E K E K ++ + A K E +++ E E KK+ E K++
Sbjct: 161 KKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220
Query: 98 EEAEKKRQAMMAALKEQTNKSK 119
EA+ A K K+
Sbjct: 221 AEAKAAAAKAAAEAKAAAEKAA 242
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 33.8 bits (77), Expect = 0.12
Identities = 26/103 (25%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 15 KRQEAKSSALDEQLKEYIV--EWRNQRAKEEEDLKRLKEKQ---AKRKVMRAEEEQKMAQ 69
++ AK + KE V E + ++A E+E LK+L++++ +++ E E++
Sbjct: 70 QQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQL 129
Query: 70 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
+K++EE+ ++ E+K++ K + EA K + A A K
Sbjct: 130 EQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKK 172
Score = 31.1 bits (70), Expect = 0.89
Identities = 25/109 (22%), Positives = 56/109 (51%), Gaps = 4/109 (3%)
Query: 6 AGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ 65
GGG + A+ +Q + + + E++ K+ ++ + K +A E++
Sbjct: 42 GGGGGGSVIDAVMVDPGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQE 101
Query: 66 KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
++ Q +KE R + E++KQ + EK+ +LE+ +++ QA AA +++
Sbjct: 102 RLKQLEKE----RLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146
Score = 29.5 bits (66), Expect = 3.2
Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 4/107 (3%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEE-EDLKRLKEKQAKRKVMRAEEEQKM--- 67
E +Q A+ L + KE + Q+ EE E +L++KQ + + +A EQK
Sbjct: 91 ELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAE 150
Query: 68 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
A + K E + + + ++ EE + EEA+ K +A A K +
Sbjct: 151 AAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAE 197
Score = 28.4 bits (63), Expect = 7.1
Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 2/80 (2%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
Q+ K E + + AK K A E +K A+ + E + + E + E +
Sbjct: 145 QKKKAEAAKAKAAAEAAKLKA--AAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKA 202
Query: 98 EEAEKKRQAMMAALKEQTNK 117
+ K +A A E+ +
Sbjct: 203 AAEKAKAEAEAKAKAEKKAE 222
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 33.6 bits (77), Expect = 0.12
Identities = 18/82 (21%), Positives = 45/82 (54%)
Query: 34 EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK 93
W++ +++E + + + K++ + + MAQ ++ +E+RQ + E ++ +
Sbjct: 44 RWQDAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEAREDVATAR 103
Query: 94 RQRLEEAEKKRQAMMAALKEQT 115
+ LE+ E+++Q AL++QT
Sbjct: 104 DEWLEQLEREKQEFFKALQQQT 125
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 34.1 bits (78), Expect = 0.12
Identities = 19/86 (22%), Positives = 36/86 (41%)
Query: 48 RLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAM 107
L + A+ K R E+ + RKKE ++R+ E + +E+ + A +R A+
Sbjct: 324 ELHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAV 383
Query: 108 MAALKEQTNKSKGPNFTIQRREPGLT 133
A K + S + + E +
Sbjct: 384 KAKKKGLIDASPNEDTPSENEESKGS 409
Score = 31.0 bits (70), Expect = 1.2
Identities = 12/84 (14%), Positives = 31/84 (36%), Gaps = 1/84 (1%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK-KEEEERRQREVEEKKQRDLEEKRQRLE 98
K E ++ + Q +R R + +R R +R + K++ L + +
Sbjct: 339 EKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNED 398
Query: 99 EAEKKRQAMMAALKEQTNKSKGPN 122
+ ++ + + + + PN
Sbjct: 399 TPSENEESKGSPPQVEATTTAEPN 422
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 33.0 bits (76), Expect = 0.13
Identities = 18/59 (30%), Positives = 36/59 (61%), Gaps = 5/59 (8%)
Query: 70 RKKEEEERRQREVEEKKQRDLEE--KRQRLEEAEKKRQAMMAALKE-QTNKSKGPNFTI 125
RK E+ ++R++E+ +K+ R+L + + ++ EK+++ +M KE + K PNF I
Sbjct: 40 RKMEKYQKREKEI-QKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMMK-PNFAI 96
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.9 bits (77), Expect = 0.14
Identities = 20/55 (36%), Positives = 37/55 (67%), Gaps = 6/55 (10%)
Query: 52 KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
K AK++ EE + A+R+ E++ R +RE E++K+++ E +R+R EAE+ +A
Sbjct: 576 KLAKKR----EEAVEKAKREAEQKAREEREREKEKEKERERERER--EAERAAKA 624
Score = 33.5 bits (76), Expect = 0.18
Identities = 16/47 (34%), Positives = 35/47 (74%), Gaps = 2/47 (4%)
Query: 66 KMAQRKKEEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAA 110
K+A++++E E+ +RE E+K ++R+ E+++++ E E++R+A AA
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAA 622
Score = 28.5 bits (63), Expect = 6.5
Identities = 11/40 (27%), Positives = 26/40 (65%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD 89
K ++A K R E++ +R++E+E+ ++RE E +++ +
Sbjct: 580 KREEAVEKAKREAEQKAREEREREKEKEKEREREREREAE 619
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 34.1 bits (79), Expect = 0.14
Identities = 25/142 (17%), Positives = 49/142 (34%), Gaps = 3/142 (2%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
E K S L E+LKE +E ++ + EE+ R +E + + + E ++ A +
Sbjct: 8 ELAKELGVSSKELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAE 67
Query: 72 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
EEE + + + E R A A + ++ P ++
Sbjct: 68 AEEEAKAEAAAAAPAE---EAAEAAAAAEAAARPAEDEAARPAEAAARRPKAKKAAKKKK 124
Query: 132 LTLSTAQLERNKTKEQIEEEKK 153
+ + K + K
Sbjct: 125 GPKPKKKKPKRKAARGGKRGKG 146
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 34.0 bits (78), Expect = 0.16
Identities = 33/155 (21%), Positives = 70/155 (45%), Gaps = 5/155 (3%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEKQ-----AKRKVMRAEEEQKMAQRKKEEEERRQR 80
E+L+E + R + EE L++LK + + K+ + E E + +K E +
Sbjct: 301 EELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLE 360
Query: 81 EVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLE 140
E ++ + LEE + LE+A ++ + + A++E + + ++ + L +LE
Sbjct: 361 ERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELE 420
Query: 141 RNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQA 175
+ + + EE+ L +I L + I +L
Sbjct: 421 ELERELEELEEEIKKLEEQINQLESKELMIAELAG 455
Score = 33.6 bits (77), Expect = 0.20
Identities = 40/203 (19%), Positives = 85/203 (41%), Gaps = 7/203 (3%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEED-----LKRLKEKQAKRKVMRAEEEQKMA 68
+K E + S L+E L+ + +E E+ L++L + ++++A E+
Sbjct: 602 LKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEE 661
Query: 69 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 128
+ ++ E E R+ + + LEEK + LE+ E++ + + L+E K I+
Sbjct: 662 KVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEEL 721
Query: 129 EPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKAT-ELWETIVKL 187
E + + E+ E + + + + L A+ E E + KL
Sbjct: 722 ESRKAELEELKKELEKLEKALELLEELREK-LGKAGLRADILRNLLAQIEAEANEILSKL 780
Query: 188 ETDKYDLEERQKRQDYDVSKIII 210
++YDL R+D + +++
Sbjct: 781 SLNRYDLRRLTIRKDGNGGLVVV 803
Score = 30.9 bits (70), Expect = 1.3
Identities = 43/189 (22%), Positives = 88/189 (46%), Gaps = 5/189 (2%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
E +K +AK L+ QL E + + + EE+LK LK+ + ++ E+E++ +++
Sbjct: 178 EVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQE----EQEEEELEQE 233
Query: 72 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
E E R E+EE+K+R LEE + RL E E + +E+ + + ++ +
Sbjct: 234 IEALEERLAELEEEKER-LEELKARLLEIESLELEALKIREEELRELERLLEELEEKIER 292
Query: 132 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 191
L ++E + + + L ++ L + KL+ K +L + +L +K
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352
Query: 192 YDLEERQKR 200
+L + +
Sbjct: 353 NELAKLLEE 361
Score = 30.1 bits (68), Expect = 2.0
Identities = 34/168 (20%), Positives = 77/168 (45%), Gaps = 5/168 (2%)
Query: 34 EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK 93
+ + ++ K ++ + + EE ++ Q ++ +EE RQ E ++ ++L E+
Sbjct: 518 IELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEE 577
Query: 94 RQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKK 153
+ L +++ + + LKE K K ++ R L LE ++ + ++EE ++
Sbjct: 578 LRLLRTRKEELEELRERLKELKKKLK----ELEERLSQLEELLQSLELSEAENELEEAEE 633
Query: 154 IALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQ 201
+ + LN++ LQA EL E + +LE + +R + +
Sbjct: 634 ELESEL-EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENE 680
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 33.5 bits (77), Expect = 0.18
Identities = 18/81 (22%), Positives = 42/81 (51%), Gaps = 1/81 (1%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKE-KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE 84
E+L+E I + Q + ++ + + KRK+ E + +E++++++ EE
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEE 339
Query: 85 KKQRDLEEKRQRLEEAEKKRQ 105
KK++ +E +R+E+ E +
Sbjct: 340 KKKKQIERLEERIEKLEVQAT 360
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 33.7 bits (77), Expect = 0.20
Identities = 25/97 (25%), Positives = 50/97 (51%), Gaps = 10/97 (10%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLK--EKQAKRKVMRAEEEQKMAQRKK 72
+++ L+ + K + ++AKE+E LK+LK +K+AK K+ + KK
Sbjct: 9 EKKILTEEELERKKK------KEEKAKEKE-LKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 73 EEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
E++ R+R+VE++ D + + +K+ + MA
Sbjct: 62 SEKKSRKRDVEDENPEDFIDPDTPFGQ-KKRLSSQMA 97
Score = 31.0 bits (70), Expect = 1.1
Identities = 15/67 (22%), Positives = 36/67 (53%), Gaps = 6/67 (8%)
Query: 10 DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
E ++R++ K +E+ KE E + +A ++E +L+ +QA ++ +K ++
Sbjct: 14 TEEELERKKKK----EEKAKEK--ELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKKSR 67
Query: 70 RKKEEEE 76
++ E+E
Sbjct: 68 KRDVEDE 74
Score = 29.1 bits (65), Expect = 5.3
Identities = 10/52 (19%), Positives = 27/52 (51%)
Query: 66 KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNK 117
+ + E++ + E+E KK+++ + K + L++ + ++ A L+ Q
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQAS 53
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 33.1 bits (76), Expect = 0.20
Identities = 15/68 (22%), Positives = 27/68 (39%)
Query: 39 RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
R E K L++ K+ AEEE+ KEE ++ E++ ++ E + +
Sbjct: 5 REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64
Query: 99 EAEKKRQA 106
E
Sbjct: 65 SGEGGGGE 72
Score = 32.3 bits (74), Expect = 0.40
Identities = 17/109 (15%), Positives = 39/109 (35%), Gaps = 5/109 (4%)
Query: 49 LKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMM 108
LKE + + E + K E E + E + L+E+ +L+ + + ++
Sbjct: 1 LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDA-----LKEEIDKLDAEIDRLEELL 55
Query: 109 AALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALN 157
L+ + S + E + + + E++ AL+
Sbjct: 56 DELEAKPAASGEGGGGEEEEEEAKAEAAEFRAYLRGGDDALAEERKALS 104
Score = 28.9 bits (65), Expect = 5.4
Identities = 22/87 (25%), Positives = 35/87 (40%), Gaps = 4/87 (4%)
Query: 23 ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
L E L E E R E+E L +E++A+ ++ E ++ + + E E+
Sbjct: 3 ELREALAELAKELRK--LTEDEKLAEAEEEKAEYDALKEEIDK--LDAEIDRLEELLDEL 58
Query: 83 EEKKQRDLEEKRQRLEEAEKKRQAMMA 109
E K E EE E K +A
Sbjct: 59 EAKPAASGEGGGGEEEEEEAKAEAAEF 85
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 32.7 bits (75), Expect = 0.20
Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 2/53 (3%)
Query: 60 RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
+ +E QKM + ++E Q + KK + L+EK+ + + +R+ M K
Sbjct: 73 KMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEM--MDDQRELMKMQFK 123
Score = 28.5 bits (64), Expect = 4.9
Identities = 15/43 (34%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRN-QRAKEEEDLKRLKEKQ 53
+ + QE K L + +KE+ E+R Q + + + LK+L+EKQ
Sbjct: 66 KLLIDQE-KMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQ 107
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 33.6 bits (76), Expect = 0.21
Identities = 14/56 (25%), Positives = 32/56 (57%)
Query: 49 LKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKR 104
+K++ K ++ EE+++M ++ +EE ++ +E++QR + +E KKR
Sbjct: 1017 IKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKEMAKKR 1072
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 32.8 bits (75), Expect = 0.23
Identities = 19/71 (26%), Positives = 41/71 (57%), Gaps = 1/71 (1%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
++++ + ++ + EE+ + AQ++ EE E E+EEK ++ EE+ Q LE+ + +
Sbjct: 6 EQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQ-EEEEAQLLEKKADELEEENR 64
Query: 110 ALKEQTNKSKG 120
L+E+ S+
Sbjct: 65 RLEEEAAASEE 75
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 33.4 bits (77), Expect = 0.24
Identities = 17/98 (17%), Positives = 48/98 (48%), Gaps = 6/98 (6%)
Query: 11 PEFV--KRQEAKSSALDEQ---LKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQ 65
FV + E AL ++ LK+ + ++A+ + + +++ + + AE E+
Sbjct: 132 GPFVPPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEE 191
Query: 66 KMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
K + + + E+ ++ + E Q +++++ ++A K+
Sbjct: 192 KQQELEAQLEQLQE-KAAETSQERKQKRKEITDQAAKR 228
Score = 30.7 bits (70), Expect = 1.8
Identities = 20/85 (23%), Positives = 45/85 (52%), Gaps = 6/85 (7%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
E R++A+S AL E ++ +V A E E+ + +E +A+ + ++ + + +RK
Sbjct: 159 ELQAREKAQSQALAEAQQQELVAL-EGLAAELEE--KQQELEAQLEQLQEKAAETSQERK 215
Query: 72 KEEEERRQREVEEKKQRDLEEKRQR 96
++R++ + K+ +L E+ R
Sbjct: 216 ---QKRKEITDQAAKRLELSEEETR 237
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 33.1 bits (77), Expect = 0.24
Identities = 18/82 (21%), Positives = 36/82 (43%), Gaps = 13/82 (15%)
Query: 42 EEEDLKRLKEKQAKRKVMRAEEEQKMAQR------------KKEEEERRQREVEEKKQRD 89
D+ L E +R+ ++ E E+ A+R K E+ E EV+E K+
Sbjct: 23 FPLDVDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEE- 81
Query: 90 LEEKRQRLEEAEKKRQAMMAAL 111
++ L+E E + + ++ +
Sbjct: 82 IKALEAELDELEAELEELLLRI 103
>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631). The
members of this family are sequences derived from a
group of hypothetical proteins expressed by certain
bacterial species. The region concerned is approximately
440 amino acid residues in length.
Length = 729
Score = 33.0 bits (76), Expect = 0.28
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 3/42 (7%)
Query: 74 EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 115
E+ERR+ E+ E++ R+ EE R RLE A RQ + L+++
Sbjct: 459 EQERRRSELVEQRTREAEEGRARLELA---RQQVQQELEQRL 497
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 32.7 bits (75), Expect = 0.31
Identities = 16/82 (19%), Positives = 39/82 (47%), Gaps = 5/82 (6%)
Query: 41 KEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
K + + +A+ E ++K ++ +E+E+R E+ +++L E+R+ L
Sbjct: 296 KNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKR-----LEELEQNLIEERKELNSK 350
Query: 101 EKKRQAMMAALKEQTNKSKGPN 122
++ Q + L+++ K K
Sbjct: 351 LEEIQKKLEDLEKRLEKLKSNK 372
Score = 27.7 bits (62), Expect = 9.8
Identities = 16/69 (23%), Positives = 36/69 (52%), Gaps = 1/69 (1%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE-AEKKRQAMM 108
EK + K +++ + + EEER ++ +K R+ E++ + LE+ ++R+ +
Sbjct: 289 TEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNLIEERKELN 348
Query: 109 AALKEQTNK 117
+ L+E K
Sbjct: 349 SKLEEIQKK 357
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 33.0 bits (76), Expect = 0.31
Identities = 19/70 (27%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 53 QAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE---AEKKRQAMMA 109
+ R++++ E E++ +++K+E + + E EK+ +LEE+ LEE AEK +
Sbjct: 409 ELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQ 468
Query: 110 ALKEQTNKSK 119
+KE+ + +
Sbjct: 469 QIKEEIEQVR 478
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 32.6 bits (75), Expect = 0.32
Identities = 21/92 (22%), Positives = 40/92 (43%), Gaps = 7/92 (7%)
Query: 49 LKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMM 108
L++K + + A E++ + KE+E R+ + K + + + +RLE+ +
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEK-------LE 291
Query: 109 AALKEQTNKSKGPNFTIQRREPGLTLSTAQLE 140
A L E+ +G + PG L LE
Sbjct: 292 ARLAEERPVEEGKPLAFRFPPPGKRLGKLVLE 323
Score = 29.9 bits (68), Expect = 2.2
Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 12/77 (15%)
Query: 39 RAKEEEDLKRLKEKQAKRKVMRAEEE-----------QKMAQ-RKKEEEERRQREVEEKK 86
K E + + ++K + E+E K A+ R K E+ R EE+
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIKRLEKLEARLAEERP 299
Query: 87 QRDLEEKRQRLEEAEKK 103
+ + R K+
Sbjct: 300 VEEGKPLAFRFPPPGKR 316
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 31.8 bits (73), Expect = 0.38
Identities = 19/70 (27%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 52 KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR--QRLEEAEKKRQAMMA 109
K + + ++E EQ++ + ++E+EE +R E + + + EKR + + EK+ +A
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173
Query: 110 ALKEQTNKSK 119
LK+Q + K
Sbjct: 174 FLKKQNQQLK 183
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 32.5 bits (74), Expect = 0.38
Identities = 41/156 (26%), Positives = 67/156 (42%), Gaps = 26/156 (16%)
Query: 57 KVMRAEEEQKMAQRKKEEEERR-----QREVEEKKQR------DLEEKRQRLEEAEKKRQ 105
K++R + E KM + +E E Q+ +E+ +R L QR+ E +++ +
Sbjct: 460 KLIRLQHENKMLRLGQEGSENERITELQQLLEDANRRNNELETQLRLANQRILELQQQVE 519
Query: 106 AMMAALKEQTNKSKGPNFTIQRREPGL-TLSTAQLERNKTKEQIEEEKKIALNIRIKPLN 164
+ AL+EQ +KS+ + + E L L A E K +EQIEE +
Sbjct: 520 DLQKALQEQGSKSEDSSLLKSKLEEHLEQLHEANEELQKKREQIEELEPDQDQ------- 572
Query: 165 VEGWSIHKLQAKATELWETIVKLETDKYDLEERQKR 200
L K EL + K + D +EER K+
Sbjct: 573 -------NLSRKIAELEAALQKKDEDMRAMEERYKK 601
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 32.4 bits (74), Expect = 0.38
Identities = 21/85 (24%), Positives = 47/85 (55%), Gaps = 6/85 (7%)
Query: 8 GGDPEFVKRQEAKSSALDEQLKEYIV---EWRNQRAKEEE---DLKRLKEKQAKRKVMRA 61
GGDPE ++ + + A E + E I EWR ++ + + + +L ++ AK K+ +
Sbjct: 12 GGDPELIRESQRRRFASVELVDEVIALDKEWRQRQFELDSLRKEFNKLNKEVAKLKIAKE 71
Query: 62 EEEQKMAQRKKEEEERRQREVEEKK 86
+ + +A+ K+ ++E ++E E ++
Sbjct: 72 DATELIAETKELKKEITEKEAEVQE 96
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 32.7 bits (74), Expect = 0.39
Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 8/148 (5%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
E + RQ + L+E+++ Q+ + E + L+ K RAE E A
Sbjct: 66 ELISRQLQELRRLEEEVRSLRETSLQQKMRLEAQAEELEALAVAEKAGRAEAEGLRAALA 125
Query: 72 KEEEERRQREVEEKKQRDLEE-KRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ---- 126
EE R+ +EE Q++LEE +R E+ QA + AL +K++G ++Q
Sbjct: 126 GAEEVRKN--LEEGGQQELEEAQRLHQEQLSSLTQAHLEALSSLRSKAEGLEKSLQSLET 183
Query: 127 -RREPGLTLSTAQLERNKTKEQIEEEKK 153
R L+ AQ E + +EQ+ + ++
Sbjct: 184 RRAGEAKALAAAQAEADTLREQLSKTQE 211
Score = 31.9 bits (72), Expect = 0.54
Identities = 30/109 (27%), Positives = 53/109 (48%), Gaps = 1/109 (0%)
Query: 3 TDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAE 62
T+ A E ++QE AL E++ E R Q + E L + + AK V +
Sbjct: 555 TEEAASLRQELTQQQEVYGQALQEKVAEVETRLREQLSDTERRLNEARREHAKAVVSLRQ 614
Query: 63 EEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
+++ AQ K+ +E R R EE ++ + + +RL+E E+ + M+ L
Sbjct: 615 IQRQAAQEKERNQELR-RLQEEARKEEGQRLSRRLQELERDKNLMLQRL 662
>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
Provisional.
Length = 589
Score = 32.4 bits (74), Expect = 0.39
Identities = 15/59 (25%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 43 EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
EEDL+RL++ + ++ R E+ + + +++++ Q + E+ R R+ EAE
Sbjct: 28 EEDLRRLRD-DSALRLRRYREDLLRDRLLRRRLGEELDDLQKRLQTECEDLRSRVSEAE 85
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 32.2 bits (74), Expect = 0.48
Identities = 30/113 (26%), Positives = 56/113 (49%), Gaps = 16/113 (14%)
Query: 43 EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEK 102
E D++ + K+K+ E E K+ Q + E+ + ++++ + E+ +Q+L +A
Sbjct: 38 EADVQAQLDALNKQKL--LEAEDKLVQ-QDLEQTLALLDKIDRQKEETEQLKQQLAQAPA 94
Query: 103 K-RQAM--MAALKEQTNKSKGPNFTIQRREPGLTLSTAQLER--NKTKEQIEE 150
K RQA + ALK+ ++ RE TLS QLE +T +Q++
Sbjct: 95 KLRQAQAELEALKDDNDEET--------RETLSTLSLRQLESRLAQTLDQLQN 139
>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein. This family
consists of several eukaryotic paraflagellar rod
component proteins. The eukaryotic flagellum represents
one of the most complex macromolecular structures found
in any organism and contains more than 250 proteins. In
addition to its locomotive role, the flagellum is
probably involved in nutrient uptake since receptors for
host low-density lipoproteins are localised on the
flagellar membrane as well as on the flagellar pocket
membrane.
Length = 289
Score = 31.9 bits (73), Expect = 0.48
Identities = 20/85 (23%), Positives = 47/85 (55%), Gaps = 2/85 (2%)
Query: 25 DEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEE 84
+ + I Q++ EEDL+R KEK+ + A++ ++ + +++E E+ Q+ VE+
Sbjct: 1 TKDISAVIEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQ 60
Query: 85 KKQ--RDLEEKRQRLEEAEKKRQAM 107
++ R ++E + L + ++R+
Sbjct: 61 QQACWRAIQELERELRDLAEERREE 85
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 31.9 bits (72), Expect = 0.53
Identities = 29/84 (34%), Positives = 51/84 (60%), Gaps = 3/84 (3%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ 95
+ ++ + +L+ LK+K+ +R+ + EEEQ+ RK+EE +R+ RE EEK++ E +R+
Sbjct: 206 KQKQQEAALELEELKKKREERRKVLEEEEQR---RKQEEADRKSREEEEKRRLKEEIERR 262
Query: 96 RLEEAEKKRQAMMAALKEQTNKSK 119
R E AEK+++ L E K
Sbjct: 263 RAEAAEKRQKVPEDGLSEDKKPFK 286
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 31.0 bits (71), Expect = 0.57
Identities = 18/60 (30%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 38 QRAKEEEDLKRLKEKQAKRKVMRAEEEQKM-AQRKKEEEERRQREVEEKKQRDLEEKRQR 96
++AK+ + KR + KQA++ ++E K A+ K E+ R RE+ ++Q + E+K
Sbjct: 15 KKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQKAIV 74
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 31.2 bits (71), Expect = 0.63
Identities = 22/65 (33%), Positives = 42/65 (64%)
Query: 42 EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
+ +L+EKQA+R+ AEEE++ ++K EE+ +R+ EE+ + + E+K++ E E
Sbjct: 4 GAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKE 63
Query: 102 KKRQA 106
++ QA
Sbjct: 64 REEQA 68
Score = 30.8 bits (70), Expect = 0.79
Identities = 22/57 (38%), Positives = 38/57 (66%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE 92
R QR EEE+ + K+ + KR+ R EEE+ +R+K++EE ++E EE+ +++ EE
Sbjct: 18 RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEE 74
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 31.9 bits (73), Expect = 0.65
Identities = 19/98 (19%), Positives = 46/98 (46%), Gaps = 6/98 (6%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKM---AQRKKEEEERRQREVEEKKQRDLEE 92
+ +E + R +QA+++ E Q+ + + ++E+++ E +++R+ +E
Sbjct: 628 TRREGRENREENRRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRN-DE 686
Query: 93 KRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
KRQ +EA K + ++T + + RR+
Sbjct: 687 KRQAQQEA--KALNVEEQSVQETEQEERVQQVQPRRKQ 722
Score = 30.0 bits (68), Expect = 2.6
Identities = 21/68 (30%), Positives = 35/68 (51%), Gaps = 6/68 (8%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ 95
R +R + D KR + Q + K + EE+ E+EER Q+ +KQR L +K
Sbjct: 676 RRERQRRRNDEKR--QAQQEAKALNVEEQSVQET---EQEERVQQVQPRRKQRQLNQK-V 729
Query: 96 RLEEAEKK 103
R+E++ +
Sbjct: 730 RIEQSVAE 737
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 31.5 bits (72), Expect = 0.66
Identities = 22/127 (17%), Positives = 48/127 (37%), Gaps = 8/127 (6%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
+ + L + + R + E + RL+ A+ A E + + +E +
Sbjct: 72 ERQGNQDLQDSV----ANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127
Query: 77 RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLST 136
+Q V + +E Q++ ++ A+ AAL + + I + G L+
Sbjct: 128 EKQ--VSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIA--DLGRRLNV 183
Query: 137 AQLERNK 143
A +R +
Sbjct: 184 ALAQRVQ 190
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.9 bits (72), Expect = 0.69
Identities = 18/45 (40%), Positives = 25/45 (55%), Gaps = 8/45 (17%)
Query: 308 GERPGKKKDAPESPEEEEVKRDDEEEENPIYLKISDDEEEKEEEN 352
G D+ E EEEE + ++EEEE ++EEE+EEEN
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEE--------EEEEEEEEEN 890
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 31.6 bits (72), Expect = 0.71
Identities = 20/126 (15%), Positives = 42/126 (33%), Gaps = 19/126 (15%)
Query: 5 VAGGGDPEFVKR----------QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQA 54
V GG + + Q + S + +K + N + E ++ +
Sbjct: 826 VPSGGGSQDIIADHQGHQAIIEQRTQDSGIRNDVKHQV---DNMVTEYEGNIGDTQNS-- 880
Query: 55 KRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
+R EE Q + + + + + + + E+ Q + +M KE
Sbjct: 881 ----IRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQERMPGADSPEELMKRAKEY 936
Query: 115 TNKSKG 120
+K KG
Sbjct: 937 QDKHKG 942
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 30.2 bits (69), Expect = 0.74
Identities = 10/78 (12%), Positives = 44/78 (56%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
+ ++ L++ + + + ++ + ++ A + E++++E+++K Q ++++ ++
Sbjct: 25 KEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQD 84
Query: 100 AEKKRQAMMAALKEQTNK 117
+K++Q + + ++ NK
Sbjct: 85 LQKRQQEELQKILDKINK 102
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 31.4 bits (72), Expect = 0.79
Identities = 24/112 (21%), Positives = 47/112 (41%), Gaps = 26/112 (23%)
Query: 10 DPEFVKR----------QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVM 59
+P+ V+ K LDE+ ++ + E L+E QA+R +
Sbjct: 10 NPDAVREKLKKRGGDALDVDKLLELDEERRKLLRE--------------LEELQAERNEL 55
Query: 60 RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
E + + + + + EE EV+E K+ L+E L+E E + ++ +
Sbjct: 56 SKEIGRALKRGEDDAEELIA-EVKELKE-KLKELEAALDELEAELDTLLLTI 105
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 31.6 bits (72), Expect = 0.80
Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 43 EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD-----LEEKRQRL 97
E + + + + K R ++EQ++AQR + + + E+EE + LE ++QRL
Sbjct: 164 RELYQAWLKARQQLK-DRQQKEQELAQRL-DFLQFQLEELEEADLQPGEDEALEAEQQRL 221
Query: 98 EEAEKKRQAMMAAL 111
EK R+ AL
Sbjct: 222 SNLEKLRELSQNAL 235
>gnl|CDD|151935 pfam11498, Activator_LAG-3, Transcriptional activator LAG-3. The
C.elegans Notch pathway, involved in the control of
growth, differentiation and patterning in animal
development, relies on either of the receptors GLP-1 or
LIN-12. Both these receptors promote signalling by the
recruitment of LAG-3 to target promoters, where it then
acts as a transcriptional activator. LAG-3 works as a
ternary complex together with the DNA binding protein,
LAG-1.
Length = 476
Score = 31.1 bits (69), Expect = 0.91
Identities = 21/109 (19%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 47 KRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
+ + + ++ MR ++Q+M ++ E++ ++Q + ++Q+ E ++Q++ ++Q
Sbjct: 320 QHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQQIMHQHQQQQQQEHQQQQM---LLQQQQ 376
Query: 107 MMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIA 155
M L++ + G F Q + L Q++ + +EQI+ +++ A
Sbjct: 377 QMHQLQQHHQMNGGGQFATQAHQHAAYLQ--QMQHMRLQEQIQHQQQQA 423
>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form. The
Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
2.A.36) The CPA1 family is a large family of proteins
derived from Gram-positive and Gram-negative bacteria,
blue green bacteria, yeast, plants and animals.
Transporters from eukaryotes have been functionally
characterized, and all of these catalyze Na+:H+
exchange. Their primary physiological functions may be
in (1) cytoplasmic pH regulation, extruding the H+
generated during metabolism, and (2) salt tolerance (in
plants), due to Na+ uptake into vacuoles. This model is
specific for the bacterial members of this family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 525
Score = 31.4 bits (71), Expect = 0.92
Identities = 25/123 (20%), Positives = 45/123 (36%), Gaps = 25/123 (20%)
Query: 38 QRAKEEEDLKRLKEKQ---------AKRKVMRAEEEQKMA-----QRKKEEEER---RQR 80
+R E +R E+ AK +M+A E+ ++ + E + R +
Sbjct: 405 KRKFVSEHSERELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDARIEELRA 464
Query: 81 EVEEKKQRDLEE----KRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLST 136
+ EEK + + E +R RL + KR A++ + L L
Sbjct: 465 DGEEKIRSGMGEKNLRRRARLYVLDAKRSAVVDLR----AGGLISQEVLLELMRELDLKE 520
Query: 137 AQL 139
A+L
Sbjct: 521 AEL 523
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 30.3 bits (69), Expect = 1.0
Identities = 20/81 (24%), Positives = 44/81 (54%)
Query: 39 RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
+ + E+ K L+E + + ++A+ E+ R++ EE E ++ +++L++ + LE
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELE 127
Query: 99 EAEKKRQAMMAALKEQTNKSK 119
+ EK + LKE+T +K
Sbjct: 128 KYEKNDPERIEKLKEETKVAK 148
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 30.1 bits (68), Expect = 1.1
Identities = 22/80 (27%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMA--QRKKEEEERRQREVEEKKQRDLEEK 93
++ + K+ E+ K L+++ A++++ R E + ++A +RK E +R+ ++EKK+ + +E
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161
Query: 94 RQRLEEAEKKRQAMMAALKE 113
+ EA K +Q ++ A E
Sbjct: 162 ARLKSEASKIKQELVDAELE 181
Score = 27.8 bits (62), Expect = 6.4
Identities = 16/83 (19%), Positives = 34/83 (40%), Gaps = 6/83 (7%)
Query: 44 EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
L LK+ Q K+ R E+++A+++ +E + + +LE K L+
Sbjct: 96 NKLLSLKDDQTKKLEERKGLEKEIAEKEISRQEL------DSEIAELERKILELQRQAAL 149
Query: 104 RQAMMAALKEQTNKSKGPNFTIQ 126
+ A ++ + K I+
Sbjct: 150 LKEKKEAEDKEIARLKSEASKIK 172
>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B. A
small number of taxonomically diverse prokaryotic
species, including Methanosarcina barkeri, have what
appears to be a second ATP synthase, in addition to the
normal F1F0 ATPase in bacteria and A1A0 ATPase in
archaea. These enzymes use ion gradients to synthesize
ATP, CC and in principle may run in either direction.
This model represents the F0 subunit B of this apparent
second ATP synthase.
Length = 246
Score = 30.4 bits (69), Expect = 1.3
Identities = 20/93 (21%), Positives = 47/93 (50%)
Query: 23 ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
A+D + K+ E + K+ E + +E + K + + + E + + K+E + RQR +
Sbjct: 33 AMDAREKKIAGELADADTKKREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLL 92
Query: 83 EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 115
+E ++ E + + E +++ A+ L+ +T
Sbjct: 93 DEAREEADEIREKWQEALRREQAALSDELRRRT 125
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 30.9 bits (69), Expect = 1.3
Identities = 22/85 (25%), Positives = 50/85 (58%), Gaps = 3/85 (3%)
Query: 20 KSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQ 79
K +A + L+ I+E + E E+ +RL+ ++ +R + R E++ +R++ E +R +
Sbjct: 442 KENAHRKALEMKILEKKRIERLEREERERLERERMER-IERERLERERLERERLERDRLE 500
Query: 80 REVEEKKQRDLEEK--RQRLEEAEK 102
R+ ++ +R+ ++ R RLE+A +
Sbjct: 501 RDRLDRLERERVDRLERDRLEKARR 525
Score = 30.1 bits (67), Expect = 2.5
Identities = 21/67 (31%), Positives = 41/67 (61%), Gaps = 3/67 (4%)
Query: 41 KEEEDLKRLKEKQAKRKV--MRAEEEQKMAQRKKEEEERRQREVEEKKQRD-LEEKRQRL 97
K+ + R++++ A RK M+ E++++ + ++EE ER +RE E+ +R+ LE +R
Sbjct: 432 KDHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLER 491
Query: 98 EEAEKKR 104
E E+ R
Sbjct: 492 ERLERDR 498
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 30.8 bits (70), Expect = 1.3
Identities = 37/148 (25%), Positives = 58/148 (39%), Gaps = 23/148 (15%)
Query: 9 GDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMA 68
GD V R+ KS L E E LKE+ K + +E
Sbjct: 189 GDLTNVLRRRKKSELPSSILSEI-----------EALEAELKEQSEKYED--LAQEIAHL 235
Query: 69 QRKKEEEERRQREVEEKKQR---DLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTI 125
+ + EE +R +E+K + DL E+R++LE K+ +A A + Q +
Sbjct: 236 RNELEEAQRSLESLEKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLRE------LA 289
Query: 126 QRREPGLTLSTAQLERNKTKEQIEEEKK 153
P L L L+ K + Q EE+ +
Sbjct: 290 ADPLP-LLLIPNLLDSTKAQLQKEEQSQ 316
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.8 bits (70), Expect = 1.3
Identities = 24/112 (21%), Positives = 45/112 (40%), Gaps = 7/112 (6%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
+ A+ +A +L + E R Q+AK + L+ K+ A+ + +++K+ + +
Sbjct: 173 LAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRAN 232
Query: 74 EEERRQR----EVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGP 121
E + E K R E E A + +A A +T K P
Sbjct: 233 ESRLKNEIASAEAAAAKAR---EAAAAAEAAAARARAAEAKRTGETYKPTAP 281
Score = 29.7 bits (67), Expect = 2.6
Identities = 37/187 (19%), Positives = 66/187 (35%), Gaps = 22/187 (11%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
+K E + ++L+ QL E + + R + + RL A E Q+ QR++
Sbjct: 68 LKSLETEIASLEAQLIETADDLKKLRKQIADLNARLN----------ALEVQEREQRRRL 117
Query: 74 EEE----RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR-- 127
E+ +R E QR A+ A E+ + K +
Sbjct: 118 AEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVR 177
Query: 128 ------REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELW 181
+ TL + Q + Q+ EE+K L L+ + + +L+A + L
Sbjct: 178 AEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLK 237
Query: 182 ETIVKLE 188
I E
Sbjct: 238 NEIASAE 244
Score = 29.3 bits (66), Expect = 3.1
Identities = 9/67 (13%), Positives = 24/67 (35%), Gaps = 1/67 (1%)
Query: 87 QRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLER-NKTK 145
Q+++ +++ E + +R + LK + + L Q+ N
Sbjct: 44 QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARL 103
Query: 146 EQIEEEK 152
+E ++
Sbjct: 104 NALEVQE 110
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 30.6 bits (69), Expect = 1.3
Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 4/89 (4%)
Query: 22 SALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
L E++K+ I E + EED + E R+ ++ E+ + +R EEEE + E
Sbjct: 356 KELKERVKDAIPEDLYDKIATEEDATTIDEL---REFLK-EKGHPVVERWAEEEEEEEEE 411
Query: 82 VEEKKQRDLEEKRQRLEEAEKKRQAMMAA 110
EE+ + + E M AA
Sbjct: 412 EEEEAAEAEAPMEEPVPGFEVPEMPMPAA 440
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 30.6 bits (69), Expect = 1.4
Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 9/140 (6%)
Query: 42 EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
EE D R +EK R R+ E + R + R R E+ R+ R R
Sbjct: 3 EEPD--REREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDS 60
Query: 102 KKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIK 161
+ +++ +S+ +RR + R + + + +K +
Sbjct: 61 RSPRSLR---YSSVRRSRDR---PRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKKRSLW 114
Query: 162 PLNVEGW-SIHKLQAKATEL 180
+ G+ + QAKA+++
Sbjct: 115 DIKPPGYELVTADQAKASQV 134
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 29.9 bits (68), Expect = 1.4
Identities = 14/54 (25%), Positives = 23/54 (42%), Gaps = 4/54 (7%)
Query: 298 RQKTKLPKWFGERPGKKKDAPESPEEEEVKRDDEEEENPIYLKISDDEEEKEEE 351
R+K LP +P K+ E E ++ E L D+E ++EE+
Sbjct: 8 RKKGILP----PKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEED 57
>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
Provisional.
Length = 170
Score = 29.7 bits (67), Expect = 1.5
Identities = 16/32 (50%), Positives = 21/32 (65%), Gaps = 4/32 (12%)
Query: 69 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 100
RK EE+E + +EVEEK+ E +RLEEA
Sbjct: 135 LRKFEEDEAKAKEVEEKR----EATWKRLEEA 162
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 30.6 bits (69), Expect = 1.6
Identities = 31/199 (15%), Positives = 69/199 (34%), Gaps = 16/199 (8%)
Query: 10 DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
D E + + K + + + +++ K ++ + K++ AE E +
Sbjct: 222 DAEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTRE-VKIILAETEAE-VA 279
Query: 70 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRRE 129
K E R + E ++ ++E++ + E+ + +A+ A +++E
Sbjct: 280 AWKAETRREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIE-------RQKE 332
Query: 130 PGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET 189
L N + Q +EE K A NI +A ET
Sbjct: 333 TELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQA-------EAAVETARETEEAERA 385
Query: 190 DKYDLEERQKRQDYDVSKI 208
++ L + + + +I
Sbjct: 386 EQAALVAAAEAAEQEQVEI 404
Score = 28.6 bits (64), Expect = 6.2
Identities = 23/150 (15%), Positives = 48/150 (32%), Gaps = 11/150 (7%)
Query: 11 PEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQR 70
+ K EA+ + ++ E Q + + + ++ + E
Sbjct: 311 VQHAKALEAREMRVGLIERQKETELEPQERSYFINAAQRQAQEEAKAAANIAEAIGAQAE 370
Query: 71 KKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
E R E E +Q L E E+ A+ A + +++
Sbjct: 371 AAVETARETEEAERAEQAALVAA-AEAAEQEQVEIAVRAEAAKAEAEAQ--------AAE 421
Query: 131 GLTLSTAQLERNKTKEQIEEEKKIALNIRI 160
A+ R K K + E ++ +A I++
Sbjct: 422 IKAE--AEAIREKGKAEAEAKRALAEAIQV 449
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 29.8 bits (67), Expect = 1.6
Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 4/43 (9%)
Query: 76 ERRQREVEEKKQRDLEE----KRQRLEEAEKKRQAMMAALKEQ 114
E RE E+KK R EE K ++ +EAE + +KEQ
Sbjct: 89 EWHLREKEKKKSRTKEEKKALKEEKDKEAEPYMWCVWDGVKEQ 131
>gnl|CDD|153328 cd07644, I-BAR_IMD_BAIAP2L2, Inverse (I)-BAR, also known as the
IRSp53/MIM homology Domain (IMD), of Brain-specific
Angiogenesis Inhibitor 1-Associated Protein 2-Like 2.
The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs
(I-BAR) domain, is a dimerization and lipid-binding
module that bends membranes and induces membrane
protrusions. This group is composed of uncharacterized
proteins known as BAIAP2L2 (Brain-specific Angiogenesis
Inhibitor 1-Associated Protein 2-Like 2). They contain
an N-terminal IMD, an SH3 domain, and a WASP homology 2
(WH2) actin-binding motif at the C-terminus. The related
proteins, BAIAP2L1 and IRSp53, function as regulators of
membrane dynamics and the actin cytoskeleton. The IMD
domain binds and bundles actin filaments, binds
membranes and produces membrane protrusions, and
interacts with the small GTPase Rac.
Length = 215
Score = 29.9 bits (67), Expect = 1.6
Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 11/89 (12%)
Query: 20 KSSALDEQL-----KEYIVEWRNQRAKEEEDLKRL--KEKQAKRKVMRAEEEQKMAQRKK 72
K++ LD Q + Y +E+R++ A E+ + L E+Q R V +E ++
Sbjct: 101 KNTKLDMQFIEDSRRVYELEYRHRAANLEKCMSELWRMERQRDRNVREMKENVNRLRQSM 160
Query: 73 E----EEERRQREVEEKKQRDLEEKRQRL 97
+ E +R E+++ R L EK L
Sbjct: 161 QAFLKESQRAAELEEKRRYRFLAEKHYLL 189
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 30.7 bits (69), Expect = 1.6
Identities = 24/94 (25%), Positives = 42/94 (44%), Gaps = 7/94 (7%)
Query: 24 LDEQLKEYIVEWRNQRAKEEEDLKRLKEK----QAKRKVM---RAEEEQKMAQRKKEEEE 76
+ +L+ I + + DL +LK++ Q K M + +++Q M +K +
Sbjct: 428 NESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKS 487
Query: 77 RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAA 110
V +KQ E+KR++ EE R A AA
Sbjct: 488 EADSRVNAEKQLAEEKKRKKEEEETAARAAAQAA 521
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 30.3 bits (69), Expect = 1.7
Identities = 13/64 (20%), Positives = 31/64 (48%), Gaps = 4/64 (6%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
+EE + L + K+ ++Q+ A++K E+EE + ++K + ++ + E
Sbjct: 397 EEEEGENGNLSPAERKK----LRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE 452
Query: 100 AEKK 103
+K
Sbjct: 453 TKKV 456
>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO. PilO proteins are
involved in the assembly of pilin. However, the precise
function of this family of proteins is not known.
Length = 144
Score = 29.4 bits (67), Expect = 1.7
Identities = 13/49 (26%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 69 QRKKEEEERRQREVEEKKQR--DLEEKRQRLEEAEKKRQAMMAALKEQT 115
+R + +E ++E EEK ++ +L +++LEE E++ A++ L +
Sbjct: 2 ERAQAKEATLKQEFEEKARQAANLPAYKKQLEELEERFGALLKQLPSKK 50
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 30.2 bits (68), Expect = 1.8
Identities = 25/100 (25%), Positives = 43/100 (43%), Gaps = 5/100 (5%)
Query: 3 TDVAGGGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAE 62
TD G +P ++R E ++ + + + + KE E L + +E KR +
Sbjct: 352 TDNTCGPEPPLLERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVAL--- 408
Query: 63 EEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEK 102
EE + EE ++ EEK+ LE+ + E EK
Sbjct: 409 EEGLKELEQDEENFLKELSKEEKEL--LEKLKMEASEVEK 446
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 30.3 bits (69), Expect = 1.8
Identities = 14/59 (23%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 56 RKVMRAEEEQKMAQRKKEEEE-RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
K + EE+ + ++E +E ++E+ E +++ LE+K ++ + K +A+ L E
Sbjct: 60 MKPLEEIEEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEELLE 118
Score = 29.9 bits (68), Expect = 2.3
Identities = 22/101 (21%), Positives = 47/101 (46%), Gaps = 12/101 (11%)
Query: 22 SALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQRE 81
SA+D+ + + E + EE + LKE +A++++ AE++ + KK ++ ++
Sbjct: 53 SAVDKLAMKPLEEI--EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKK--AIKKGKD 108
Query: 82 VEEKKQRDLEEKRQRLEEAEKKR--------QAMMAALKEQ 114
E + LE + + E + R +A++ L E
Sbjct: 109 EEALAEELLELEAEEPEPPLRPRLIVNDATPEALLELLAEN 149
>gnl|CDD|223945 COG1013, PorB, Pyruvate:ferredoxin oxidoreductase and related
2-oxoacid:ferredoxin oxidoreductases, beta subunit
[Energy production and conversion].
Length = 294
Score = 30.0 bits (68), Expect = 1.8
Identities = 13/68 (19%), Positives = 30/68 (44%), Gaps = 6/68 (8%)
Query: 32 IVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLE 91
+ + +A+EE K K + EE + + + + E+ +K Q+D++
Sbjct: 231 LYRYEPGKAEEE------KGKDIPIGIFYPVEEYLEYEERFKHLTKSNPELIQKLQKDVD 284
Query: 92 EKRQRLEE 99
E+ + L+
Sbjct: 285 ERWEELKR 292
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 30.4 bits (69), Expect = 1.8
Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 11/87 (12%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKR-KVMRAEEEQKMAQRKKE 73
K+ + +E+L+ + + K EE L +L+E+ R + A E+++ +
Sbjct: 177 KKLAELKAEEEEELERAL------KEKREELLSKLEEELLARLESKEAALEKQLRLEFER 230
Query: 74 EEERRQREVEEKKQRDLEEKRQRLEEA 100
E+E E+ +K + L ++ +R EA
Sbjct: 231 EKE----ELRKKYEEKLRQELERQAEA 253
Score = 28.1 bits (63), Expect = 8.5
Identities = 21/110 (19%), Positives = 49/110 (44%), Gaps = 10/110 (9%)
Query: 12 EFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
K +E + L+ + + R + +E+E+L RK + Q++ ++
Sbjct: 201 LLSKLEEELLARLESKEAALEKQLRLEFEREKEEL---------RKKYEEKLRQELERQA 251
Query: 72 KEEEERRQREVEEKKQRDLEEKRQRLEEA-EKKRQAMMAALKEQTNKSKG 120
+ E++ + E+ + E + ++E E++R +A L E ++ KG
Sbjct: 252 EAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRLAKLAELNSRLKG 301
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 30.0 bits (68), Expect = 1.9
Identities = 22/99 (22%), Positives = 46/99 (46%), Gaps = 9/99 (9%)
Query: 8 GGDPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKM 67
+P+ VK + E ++ ++R K L ++E QAKR E +++
Sbjct: 8 RNNPDLVKESLKARGLSVDIDLEKLIALDDERKKL---LSEIEELQAKRN----ELSKQI 60
Query: 68 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
+ K +++ + E +K+ ++L+E+ L A K +A
Sbjct: 61 GKAKGQKK--DKIEEIKKELKELKEELTELSAALKALEA 97
>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
protein. Members of this protein family are homologs of
the HlyD membrane fusion protein of type I secretion
systems. Their occurrence in prokaryotic genomes is
associated with the occurrence of a novel class of
microcin (small bacteriocins) with a leader peptide
region related to nitrile hydratase. We designate the
class of bacteriocin as Nitrile Hydratase Leader
Microcin, or NHLM. This family, therefore, is designated
as NHLM bacteriocin system secretion protein. Some but
not all NHLM-class putative microcins belong to the TOMM
(thiazole/oxazole modified microcin) class as assessed
by the presence of the scaffolding protein and/or
cyclodehydratase in the same gene clusters [Transport
and binding proteins, Amino acids, peptides and amines,
Cellular processes, Biosynthesis of natural products].
Length = 421
Score = 30.2 bits (68), Expect = 1.9
Identities = 31/165 (18%), Positives = 61/165 (36%), Gaps = 26/165 (15%)
Query: 44 EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
E +RL+E K Q + EE ++E ++R ++ ++ LEE +
Sbjct: 93 ELRERLQESYQK---------LTQLQEQLEEVRNYTGRLKEGRERHFQKSKEALEETIGR 143
Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPL 163
+ +AAL + K +G + R ++ + +EQ + +
Sbjct: 144 LREELAALSREVGKQRG---LLSRGLATFKRDRILQQQWR-EEQEKYDAADKAR------ 193
Query: 164 NVEGWSIHKLQAKATE--LWETIVKLETDKYDLEERQKRQDYDVS 206
+I+ LQ KA E L + L + L +++ V
Sbjct: 194 -----AIYALQTKADERNLETVLQSLSQADFQLAGVAEKELETVE 233
>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
ArcB; Provisional.
Length = 779
Score = 30.3 bits (69), Expect = 1.9
Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 20/73 (27%)
Query: 43 EEDLKRLKEKQAKRKVMRAEE--------------EQKMAQRKKEEEERRQ------REV 82
EE +RL AK + MR + Q++A+R+K EE R++ E+
Sbjct: 81 EESRQRLSRLVAKLEEMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEI 140
Query: 83 EEKKQRDLEEKRQ 95
+E+++ +E ++Q
Sbjct: 141 KEREETQIELEQQ 153
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 30.1 bits (68), Expect = 2.0
Identities = 22/92 (23%), Positives = 37/92 (40%), Gaps = 7/92 (7%)
Query: 11 PEFVKRQEAKSSALDEQLKEYIVEWRNQ---RAKEEED--LKRLKEKQAKRKVMRAEEEQ 65
E K ++ L + + + EWR++ AK E L + K EQ
Sbjct: 323 EEVEKSKQKFEPLLAQYGESF-QEWRSKLQALAKVESRSLLSVEERLPTIEKEDLELYEQ 381
Query: 66 KMAQRKKEEEERRQREVEEKKQRDLEEKRQRL 97
+ E + QR+ + ++Q D EE +QR
Sbjct: 382 RQQLWFFENRKLWQRKKKLREQAD-EEYQQRH 412
>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
Length = 694
Score = 30.3 bits (68), Expect = 2.0
Identities = 14/55 (25%), Positives = 23/55 (41%), Gaps = 14/55 (25%)
Query: 311 PGKKKDAPESPEEEEVKRDD--------------EEEENPIYLKISDDEEEKEEE 351
P + +D P+++ R D +++PIY +SD EE EE
Sbjct: 534 PEQMEDPYLEPDDDRFDRRDGAAAAATSHPREAPAPDDDPIYEGVSDSEEPVYEE 588
>gnl|CDD|180240 PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated.
Length = 156
Score = 29.4 bits (67), Expect = 2.0
Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 9/83 (10%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE--- 92
+RAK+E L+ QAK + AE + A+ ++ ++R + +EE K E
Sbjct: 46 AAERAKKE-----LELAQAKYEAQLAEARAEAAEIIEQAKKRAAQIIEEAKAEAEAEAAR 100
Query: 93 -KRQRLEEAEKKRQAMMAALKEQ 114
K Q E E++R+ L++Q
Sbjct: 101 IKAQAQAEIEQERKRAREELRKQ 123
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 30.1 bits (69), Expect = 2.0
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 13/65 (20%)
Query: 62 EEEQKM---AQRKKEEEERRQREVEEKKQRD---------LEEKRQRLEEAEKKR-QAMM 108
EE ++M A+ EE+++R+ VE + Q D L+E ++ EK++ +A +
Sbjct: 503 EEIERMVKDAEANAEEDKKRKELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAI 562
Query: 109 AALKE 113
LKE
Sbjct: 563 KELKE 567
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.1 bits (68), Expect = 2.0
Identities = 21/81 (25%), Positives = 44/81 (54%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE 73
V+R E ++S L +L+E E ++ E + +++K K + +RA + + K+
Sbjct: 431 VERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL 490
Query: 74 EEERRQREVEEKKQRDLEEKR 94
EE++++ E E+K +L + R
Sbjct: 491 EEKKKRVEELERKLAELRKMR 511
Score = 29.3 bits (66), Expect = 3.6
Identities = 15/90 (16%), Positives = 42/90 (46%)
Query: 10 DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
+ + E + L+E ++ E + + EE + +++ +++ + R E K+ +
Sbjct: 413 ERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRK 472
Query: 70 RKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
++ R+ E EK+ + +++ + LE
Sbjct: 473 DREIRARDRRIERLEKELEEKKKRVEELER 502
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 29.6 bits (67), Expect = 2.0
Identities = 14/35 (40%), Positives = 16/35 (45%)
Query: 52 KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKK 86
K AK A EE + A KEEE + E KK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKK 194
>gnl|CDD|223407 COG0330, HflC, Membrane protease subunits, stomatin/prohibitin
homologs [Posttranslational modification, protein
turnover, chaperones].
Length = 291
Score = 29.7 bits (67), Expect = 2.5
Identities = 13/82 (15%), Positives = 23/82 (28%), Gaps = 4/82 (4%)
Query: 51 EKQAKRKVMRAEEEQ-KMAQR-KKEEEERRQREVEEKKQRDLEEK--RQRLEEAEKKRQA 106
+ A K M AE ++ + E + R E + + + + A + A
Sbjct: 175 VQAAMEKQMAAERDKRAEILEAEGEAQAAILRAEGEAEAAIILAEAEAEAEVIARAEADA 234
Query: 107 MMAALKEQTNKSKGPNFTIQRR 128
P QR
Sbjct: 235 AKIIAAALREAPAAPQALAQRY 256
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.9 bits (68), Expect = 2.8
Identities = 22/88 (25%), Positives = 37/88 (42%), Gaps = 8/88 (9%)
Query: 30 EYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE-----E 84
E + + R+ K E L +L+ K E+K K EE + E E
Sbjct: 43 ERLRKLRSLLTKLSEALDKLRSYLPKLNP---LREEKKKVSVKSLEELIKDVEEELEKIE 99
Query: 85 KKQRDLEEKRQRLEEAEKKRQAMMAALK 112
K+ ++LEE+ LE K+ + + L+
Sbjct: 100 KEIKELEEEISELENEIKELEQEIERLE 127
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 28.6 bits (65), Expect = 2.8
Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 36 RNQRAKEEEDLKRLKE--KQAKRKVMRA--EEEQKMAQRKKEEEERRQREVEEKKQRDLE 91
+ + K E ++ L E K K + A ++++ A K E++ + + EKK+ E
Sbjct: 34 KKKGKKGELEITDLNEEYKDLKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKKAKKE 93
Query: 92 EKRQRL 97
E + RL
Sbjct: 94 EPKPRL 99
>gnl|CDD|234604 PRK00058, PRK00058, methionine sulfoxide reductase A; Provisional.
Length = 213
Score = 29.1 bits (66), Expect = 2.9
Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 2/47 (4%)
Query: 208 IIINKKIMPSPGQGRPGFNPRYVQTPNL-FNNATALTGKYPPKIQVA 253
+ +KK + + + PG + L N LTG P ++ A
Sbjct: 3 SLFDKKHLVTAEEALPG-RATPMPVATLHAVNGHPLTGPVPEGMEQA 48
>gnl|CDD|215101 PLN00203, PLN00203, glutamyl-tRNA reductase.
Length = 519
Score = 29.7 bits (67), Expect = 2.9
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 18/63 (28%)
Query: 164 NVEGW--------SIHKLQAKATELWETIVKLETDKY------DLEERQKRQDYDVSKII 209
N E W +I KL++ A E I E +K DL ++Q++ D+S+ I
Sbjct: 420 NFEAWRDSLETVPTIKKLRSYA----ERIRAAELEKCLSKMGDDLTKKQRKAVEDLSRGI 475
Query: 210 INK 212
+NK
Sbjct: 476 VNK 478
>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein. Members of this
protein family belong to a conserved gene four-gene
neighborhood found sporadically in a phylogenetically
broad range of bacteria: Nocardia farcinica,
Symbiobacterium thermophilum, and Streptomyces
avermitilis (Actinobacteria), Geobacillus kaustophilus
(Firmicutes), Azoarcus sp. EbN1 and Ralstonia
solanacearum (Betaproteobacteria). Proteins in this
family average over 1400 amino acids in length
[Hypothetical proteins, Conserved].
Length = 1353
Score = 29.8 bits (67), Expect = 2.9
Identities = 24/98 (24%), Positives = 38/98 (38%), Gaps = 3/98 (3%)
Query: 16 RQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEE 75
E ++ AL +L+ Q A+E E + E A+ Q + + + E
Sbjct: 308 ALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAA---ADARQAIREAESRLE 364
Query: 76 ERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
E R+R EE + D E+ R + R A A L
Sbjct: 365 EERRRLDEEAGRLDDAERELRAAREQLARAAERAGLSP 402
Score = 29.0 bits (65), Expect = 5.7
Identities = 17/67 (25%), Positives = 28/67 (41%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
A +R + +A+ A E+ A+ + EE R R +EE ++E R RL E
Sbjct: 873 AATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAMVDEIRARLAE 932
Query: 100 AEKKRQA 106
+
Sbjct: 933 TRAALAS 939
>gnl|CDD|235870 PRK06834, PRK06834, hypothetical protein; Provisional.
Length = 488
Score = 29.6 bits (67), Expect = 3.0
Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 9/92 (9%)
Query: 96 RLEEAEKKRQAMMAALKEQTNKSKG-PNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
++E K+ AMM+ L + +G P + RR P L L TA R + + + +
Sbjct: 355 GMDEPRKRIAAMMSGLDIHYDLGEGHP--LLGRRMPDLDLVTADGPR-RVFTLLHNARPV 411
Query: 155 ALNIRIKP--LNVEGWS--IHKLQAKATELWE 182
LN+ P ++ WS + + AK WE
Sbjct: 412 LLNLG-APGAFDIAPWSDRVRLVDAKYAGPWE 442
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 29.3 bits (66), Expect = 3.0
Identities = 15/83 (18%), Positives = 44/83 (53%)
Query: 37 NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
+ E E+ ++ ++Q +++ RA+ + ++A+++ ++E +QR E+ + EE R
Sbjct: 85 ERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKMQEESVLR 144
Query: 97 LEEAEKKRQAMMAALKEQTNKSK 119
E + + + ++ +T + +
Sbjct: 145 QEAMRRATEEEILEMRRETIEEE 167
>gnl|CDD|233879 TIGR02468, sucrsPsyn_pln, sucrose phosphate synthase/possible
sucrose phosphate phosphatase, plant. Members of this
family are sucrose-phosphate synthases of plants. This
enzyme is known to exist in multigene families in
several species of both monocots and dicots. The
N-terminal domain is the glucosyltransferase domain.
Members of this family also have a variable linker
region and a C-terminal domain that resembles sucrose
phosphate phosphatase (SPP) (EC 3.1.3.24) (see
TIGR01485), the next and final enzyme of sucrose
biosynthesis. The SPP-like domain likely serves a
binding and not a catalytic function, as the reported
SPP is always encoded by a distinct protein.
Length = 1050
Score = 29.7 bits (67), Expect = 3.2
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 55 KRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEE 92
K+K + EE Q++A+R+ E E R RE DL E
Sbjct: 90 KKKQLEWEEAQRLAKRRLERERGR-REATADMSEDLSE 126
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 29.2 bits (66), Expect = 3.2
Identities = 12/66 (18%), Positives = 36/66 (54%), Gaps = 3/66 (4%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEK---RQR 96
+E+E+ + K K A + + + K+ +++K + E+ ++ + E ++ E++ + R
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELAEKLR 98
Query: 97 LEEAEK 102
L + ++
Sbjct: 99 LRKLQE 104
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 28.4 bits (64), Expect = 3.2
Identities = 13/78 (16%), Positives = 41/78 (52%), Gaps = 5/78 (6%)
Query: 45 DLKRLKEK--QAKRKVMRAEEEQKMAQRKKEEEE---RRQREVEEKKQRDLEEKRQRLEE 99
++RLK + +R++ + +++ ++K + E + ++E ++ + +++++ +
Sbjct: 72 TIERLKTQLEDLERELALLQAKERQLEKKLKTLEQKLKNEKEEVQRLKNIIQQRKTQYNH 131
Query: 100 AEKKRQAMMAALKEQTNK 117
KKR + LKE+ +
Sbjct: 132 ELKKRDREIEKLKERLLQ 149
>gnl|CDD|149180 pfam07960, CBP4, CBP4. The CBP4 in S. cerevisiae is essential for
the expression and activity of ubiquinol-cytochrome c
reductase. This family appears to be fungal specific.
Length = 128
Score = 28.1 bits (63), Expect = 3.3
Identities = 18/77 (23%), Positives = 39/77 (50%), Gaps = 9/77 (11%)
Query: 36 RNQRAKEEEDLKRLKEKQAKRK---------VMRAEEEQKMAQRKKEEEERRQREVEEKK 86
R+ R +E ++L + ++ +K E+++ + + +K + E Q+E E+
Sbjct: 52 RDLRQQEFDELVTILKETSKSDKPIWKTGPIESPWEKKRNVREFQKTKAEEAQKEELERI 111
Query: 87 QRDLEEKRQRLEEAEKK 103
+ +LEE R + EE K+
Sbjct: 112 REELEEARAQSEEMRKE 128
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 29.3 bits (66), Expect = 3.4
Identities = 22/63 (34%), Positives = 30/63 (47%), Gaps = 11/63 (17%)
Query: 67 MAQRKKEEEERRQ-REVE---------EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
+AQ++KEEEERR R E KQ+ EE++ EE + R A AL
Sbjct: 118 LAQKQKEEEERRVERRRELGLEDPEQLRLKQKAKEEQKAESEET-RHRAANATALAAIGG 176
Query: 117 KSK 119
+ K
Sbjct: 177 RKK 179
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 29.6 bits (67), Expect = 3.4
Identities = 21/107 (19%), Positives = 41/107 (38%), Gaps = 16/107 (14%)
Query: 10 DPEFVKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
D E ++ + L I ++ + L+ +K + + ++ E E+ A+
Sbjct: 35 DLEEGEKGLKELEKLKVAEVAQISLSSLL-SEVLDYLRSVKGLEGRLFILPEEVEKLEAE 93
Query: 70 RKKEEEERRQ------------REVEEK---KQRDLEEKRQRLEEAE 101
K EE + R++EE+ +LE+ LEE E
Sbjct: 94 LKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLEDLLEELE 140
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 29.6 bits (67), Expect = 3.4
Identities = 25/152 (16%), Positives = 62/152 (40%), Gaps = 16/152 (10%)
Query: 44 EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
E +++E + + K + + EQ A+ +K+EEE E + K D K+ +
Sbjct: 92 EIEAQIEELKKELKKLEEKIEQLEAEIEKKEEE---LEKAKNKFLDKAWKKLAKKYDSNL 148
Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALN---IRI 160
+A+ ++ K K ++ + + +++ L + E+++ + K + +
Sbjct: 149 SEALKGLNYKKNFKEK----LLKELKSVILNASSLL----SLEELKAKIKTLFSSNKPEL 200
Query: 161 KPLNVEGWSIHKLQAKATELWETIVKLETDKY 192
L + I + + E+ E + +D
Sbjct: 201 ALLTLS--VIDFDEIEQAEILEKSIIGSSDVP 230
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 29.6 bits (67), Expect = 3.5
Identities = 15/74 (20%), Positives = 36/74 (48%), Gaps = 15/74 (20%)
Query: 55 KRKVMRAEEEQKMAQRKKEEEERRQRE---------------VEEKKQRDLEEKRQRLEE 99
K+++ + + K+ ++ +E+ E+ QRE +E+ + +LE+ +++LEE
Sbjct: 199 KKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEKDDKDELEKLKEKLEE 258
Query: 100 AEKKRQAMMAALKE 113
+ + KE
Sbjct: 259 LKLPEEVKKVIEKE 272
>gnl|CDD|227201 COG4864, COG4864, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 328
Score = 29.1 bits (65), Expect = 3.5
Identities = 20/61 (32%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 54 AKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK-RQAMMAALK 112
AK + +AE ++ +AQ K EE +E++ + +EE R ++ EAE + A+ AL+
Sbjct: 231 AKLQTEQAEADKNIAQAKAEERRAMAVALEQEMRARVEEMRAKVVEAEAEVPLALSEALR 290
Query: 113 E 113
E
Sbjct: 291 E 291
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.4 bits (66), Expect = 3.6
Identities = 18/69 (26%), Positives = 38/69 (55%), Gaps = 3/69 (4%)
Query: 36 RNQRAKE-EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR 94
R + KE + L+ +K + K E+ A+ K +EE+ +Q E E+K++ +E +
Sbjct: 367 RQELLKEYNKKLQDYTKKLGEVKDETDASEE--AEAKAKEEKLKQEENEKKQKEQADEDK 424
Query: 95 QRLEEAEKK 103
++ ++ E+K
Sbjct: 425 EKRQKDERK 433
>gnl|CDD|221431 pfam12127, YdfA_immunity, SigmaW regulon antibacterial. This
protein is found in bacteria. Proteins in this family
are about 330 amino acids in length. The operon from
which this protein is derived confers immunity for the
host species to a broad range of antibacterial
compounds, unlike the specific immunity proteins that
are linked to and co-regulated with their
antibiotic-synthesis proteins.
Length = 318
Score = 29.2 bits (66), Expect = 3.9
Identities = 18/61 (29%), Positives = 35/61 (57%), Gaps = 1/61 (1%)
Query: 54 AKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK-RQAMMAALK 112
A+ + +AE ++++AQ K EE E++ + ++E R ++ EAE + +AM A +
Sbjct: 228 ARLQTDQAEADKRIAQAKAEERRAMAVAREQEMKAKVQEMRAKVVEAEAEVPKAMAEAFR 287
Query: 113 E 113
E
Sbjct: 288 E 288
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
Length = 850
Score = 29.2 bits (66), Expect = 3.9
Identities = 18/80 (22%), Positives = 41/80 (51%), Gaps = 10/80 (12%)
Query: 39 RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEE---ERRQREVEEKKQRDLEEKRQ 95
+ E ++++ + + ++ AE E + ++++E E E+ Q E+E K ++L +
Sbjct: 729 LIEAEAEVEQAELRAKALRIE-AEAELEKLRKRQELELEYEQAQNELEIAKAKELADI-- 785
Query: 96 RLEEAEKKRQAMMAALKEQT 115
EA K + ++ AL +T
Sbjct: 786 ---EATKFER-IVEALGRET 801
>gnl|CDD|221041 pfam11241, DUF3043, Protein of unknown function (DUF3043). Some
members in this family of proteins with unknown function
are annotated as membrane proteins. This cannot be
confirmed.
Length = 168
Score = 28.4 bits (64), Expect = 4.0
Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 7/44 (15%)
Query: 70 RKKEEEERRQR----EVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
R++ E RR+ + + K+ +R+R A R AMMA
Sbjct: 9 RREAEAARRRPLVPEDRKAAKKAARAARRERRARA---RAAMMA 49
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 28.7 bits (64), Expect = 4.0
Identities = 15/45 (33%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 61 AEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQ 105
E+E + RKK ++E R+R ++K++R E Q+ +E +K+Q
Sbjct: 143 FEKECRALSRKKSDDEHRKRSGKQKEKRR-VEDSQKHKEDRRKKQ 186
>gnl|CDD|116627 pfam08017, Fibrinogen_BP, Fibrinogen binding protein. Proteins in
this family bind to fibrinogen. Members of this family
includes the fibrinogen receptor, FbsA, which mediates
platelet aggregation.
Length = 393
Score = 29.1 bits (64), Expect = 4.2
Identities = 25/111 (22%), Positives = 52/111 (46%)
Query: 44 EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
E +R E +++ V+ + + + ERRQR+VE K Q ++ E+RQR E + +
Sbjct: 144 ERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQ 203
Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 154
+ ++ N+S+G ++R+ LER + + + + +
Sbjct: 204 GNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNV 254
Score = 28.7 bits (63), Expect = 6.3
Identities = 21/86 (24%), Positives = 43/86 (50%)
Query: 44 EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
E +R E +++ V+ + + + ERRQR+ E + Q ++ E+RQR E + +
Sbjct: 48 ERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQ 107
Query: 104 RQAMMAALKEQTNKSKGPNFTIQRRE 129
+ ++ NKS+G ++R+
Sbjct: 108 GNVLERRQRDVENKSQGNVLERRQRD 133
Score = 28.3 bits (62), Expect = 7.2
Identities = 24/100 (24%), Positives = 47/100 (47%)
Query: 44 EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
E +R E +++ V+ + + + ERRQR+ E + Q ++ E+RQR E + +
Sbjct: 128 ERRQRDAENRSQGNVLERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQ 187
Query: 104 RQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK 143
+ ++ NKS+G ++R+ LER +
Sbjct: 188 GNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQ 227
Score = 28.3 bits (62), Expect = 8.0
Identities = 22/86 (25%), Positives = 44/86 (51%)
Query: 44 EDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 103
E +R E +++ V+ + + + ERRQR+VE K Q ++ E+RQR E + +
Sbjct: 64 ERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQ 123
Query: 104 RQAMMAALKEQTNKSKGPNFTIQRRE 129
+ ++ N+S+G ++R+
Sbjct: 124 GNVLERRQRDAENRSQGNVLERRQRD 149
Score = 27.9 bits (61), Expect = 9.3
Identities = 22/83 (26%), Positives = 42/83 (50%)
Query: 47 KRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
+R E +++ V+ + + + ERRQR+VE K Q ++ E+RQR E +
Sbjct: 83 QRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNV 142
Query: 107 MMAALKEQTNKSKGPNFTIQRRE 129
+ ++ NKS+G ++R+
Sbjct: 143 LERRQRDAENKSQGNVLERRQRD 165
>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family. This is a family
of sequences found in both bacteria and bacteriophages.
This region is approximately 130 residues long and in
some cases is found as part of the PVL
(Panton-Valentine leukocidin) group of genes, which
encode a member of the leukocidin group of bacterial
toxins that kill leukocytes by creation of pores in the
cell membrane. PVL appears to be a virulence factor
associated with a number of human diseases.
Length = 118
Score = 27.9 bits (62), Expect = 4.2
Identities = 14/49 (28%), Positives = 31/49 (63%)
Query: 48 RLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
L E MR +E +++ + + E+ER++RE+E K++++ E +R++
Sbjct: 38 ELSEALDAPYGMRLKEYREIKKSENIEQERKERELERKRRKEAELRRKK 86
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 29.1 bits (65), Expect = 4.2
Identities = 22/83 (26%), Positives = 48/83 (57%)
Query: 40 AKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEE 99
+EE++ +++KE++ K+K EE + +++ +E+R +E E++K++ +EE R R EE
Sbjct: 104 KEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163
Query: 100 AEKKRQAMMAALKEQTNKSKGPN 122
+++R + K+ K
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNK 186
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 29.1 bits (65), Expect = 4.3
Identities = 15/81 (18%), Positives = 34/81 (41%), Gaps = 6/81 (7%)
Query: 67 MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 126
MA R + E + ++K + ++++ R+E EK K++ + + ++
Sbjct: 1 MAYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLD------KKEEVVTDNVDSPVK 54
Query: 127 RREPGLTLSTAQLERNKTKEQ 147
+ L A + TKE+
Sbjct: 55 EQSSQENLKIADEVKKSTKEE 75
>gnl|CDD|224486 COG1570, XseA, Exonuclease VII, large subunit [DNA replication,
recombination, and repair].
Length = 440
Score = 29.1 bits (66), Expect = 4.4
Identities = 27/113 (23%), Positives = 57/113 (50%), Gaps = 6/113 (5%)
Query: 52 KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK-RQAMMA- 109
Q +R++ RA + + Q+K+ E +R +R L E++QRL+E + R+A+
Sbjct: 271 DQLQRRLHRALR-RLLDQKKQRLEHLARRLQFRSPERLLSEQQQRLDELAIRLRRALENQ 329
Query: 110 -ALKEQT--NKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIR 159
ALK+Q ++ N IQR++ L +L++ ++ + +++ +
Sbjct: 330 LALKKQRLERLTQRLNPQIQRQQQRLQQLERRLDKALRRQLKRKRERLEALVE 382
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 29.3 bits (66), Expect = 4.5
Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 2/57 (3%)
Query: 60 RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
EE + KE EE +E++ +LEE L+E + + L E +N
Sbjct: 72 LEEEILDLEAEIKEVEEN-LESLEKEI-NELEEWLNVLDEEKSFLDENLEELSELSN 126
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 29.0 bits (65), Expect = 4.5
Identities = 21/80 (26%), Positives = 40/80 (50%), Gaps = 4/80 (5%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
E A D ++ E I + + A + E +RL+ +QA+ RAEE+ ++ + E+
Sbjct: 374 AEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAE----RAEEDARLRELYPLPED 429
Query: 77 RRQREVEEKKQRDLEEKRQR 96
+ E E ++ + EE+ R
Sbjct: 430 EFEDEDELEEAQPEEEEEAR 449
>gnl|CDD|168632 PRK06633, PRK06633, acetyl-CoA acetyltransferase; Provisional.
Length = 392
Score = 28.8 bits (64), Expect = 4.5
Identities = 19/74 (25%), Positives = 35/74 (47%), Gaps = 8/74 (10%)
Query: 237 NNATALTGKYPPKIQVASKYE--RRVDTRSYDDKKKLFEGGYLSLTKESLEKQY---AEK 291
N + + G Y I+ +K+ + VD YD +F G ++ +T E++ KQ+ ++
Sbjct: 118 NMSLGMHGSY---IRAGAKFGDIKMVDLMQYDGLTDVFSGVFMGITAENISKQFNISRQE 174
Query: 292 VDQFGNRQKTKLPK 305
D+F K K
Sbjct: 175 QDEFALSSHKKAAK 188
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 28.9 bits (65), Expect = 4.8
Identities = 20/101 (19%), Positives = 41/101 (40%), Gaps = 7/101 (6%)
Query: 23 ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
+L E I E +A+ ++ L + +E+ + +AQ E +E
Sbjct: 63 SLQALNLELIQELNELKARLQQQLLQSREQLQ-------LLIESLAQLSSEFQELANEIF 115
Query: 83 EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNF 123
EE +R E +Q L++ K + ++ +EQ + +
Sbjct: 116 EELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESA 156
>gnl|CDD|235362 PRK05192, PRK05192, tRNA uridine 5-carboxymethylaminomethyl
modification enzyme GidA; Validated.
Length = 618
Score = 28.9 bits (66), Expect = 4.9
Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 11/84 (13%)
Query: 128 REPGLTLSTAQLER-NKTKEQIEEEKKIALNIRIKP--LNVEGWSIHKLQAKATEL---- 180
E GL + + R + KE IEEE + + R+ P LN G K + +L
Sbjct: 449 YELGL-VDDERWARFEEKKEAIEEEIERLKSTRVTPDELNELGGDPLKREVSLLDLLRRP 507
Query: 181 ---WETIVKLETDKYDLEERQKRQ 201
+E + KL+ + DL+ Q
Sbjct: 508 EITYEDLAKLDPELADLDPEVAEQ 531
>gnl|CDD|184697 PRK14475, PRK14475, F0F1 ATP synthase subunit B; Provisional.
Length = 167
Score = 28.4 bits (63), Expect = 4.9
Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 6/55 (10%)
Query: 59 MRAEEEQKMAQRK--KEEEERRQREVEEKKQRDL----EEKRQRLEEAEKKRQAM 107
+R E + +A K +EE ER+ + + D E +++LEE K+R M
Sbjct: 56 LREEAQALLADVKAEREEAERQAAAMLAAAKADARRMEAEAKEKLEEQIKRRAEM 110
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 28.7 bits (65), Expect = 5.0
Identities = 22/82 (26%), Positives = 43/82 (52%), Gaps = 10/82 (12%)
Query: 40 AKEEEDLKRLKE-----KQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRD----- 89
EE + +E KQA+R++ +E+++ ++ + + + E+EE +
Sbjct: 150 GLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEE 209
Query: 90 LEEKRQRLEEAEKKRQAMMAAL 111
LEE+R+RL +EK +A+ AL
Sbjct: 210 LEEERKRLSNSEKLAEAIQNAL 231
>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435). This
family of proteins are functionally uncharacterized.
This protein is found in eukaryotes. Proteins in this
family are typically between 435 to 791 amino acids in
length. This family is related to pfam00589 suggesting
it may be an integrase enzyme.
Length = 418
Score = 28.9 bits (65), Expect = 5.1
Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 38 QRAKEEED--LKRLKEKQAKRKVMRAEEEQKMAQRK----KEEEERRQREVEEKKQRDLE 91
Q+A EED L+ L K+ K ++A+ K E E+R+REV ++QR
Sbjct: 280 QKASVEEDPELQELIRKRDHLKKEIIALYGQVAKAKGTPLYERLEKRRREVRNERQRLRR 339
Query: 92 EKRQRLEEAEKKRQA 106
E ++++ E + Q
Sbjct: 340 ELKKKIREEFDEEQP 354
>gnl|CDD|148049 pfam06210, DUF1003, Protein of unknown function (DUF1003). This
family consists of several hypothetical bacterial
proteins of unknown function.
Length = 108
Score = 27.5 bits (62), Expect = 5.2
Identities = 12/49 (24%), Positives = 25/49 (51%), Gaps = 2/49 (4%)
Query: 53 QAKRKVMRAEEEQKMAQRKKEEEERRQREVEE--KKQRDLEEKRQRLEE 99
QA R +RAE + ++ + + E E R+++ + +L + Q +E
Sbjct: 60 QADRDRLRAELDYQINLKAELEIEALHRKLDALREALGELLTRDQERQE 108
>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 27.4 bits (61), Expect = 5.3
Identities = 16/66 (24%), Positives = 38/66 (57%), Gaps = 11/66 (16%)
Query: 42 EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
++ +L+R+ EK+ + +V++ ++E++ A+R + E ER L ++ Q+LE+ E
Sbjct: 49 QKPELQRVLEKRKRDQVLKEQKEEEEAKRLQSELERE-----------LMKRAQKLEQLE 97
Query: 102 KKRQAM 107
++
Sbjct: 98 LEKAKE 103
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 29.2 bits (65), Expect = 5.4
Identities = 10/43 (23%), Positives = 22/43 (51%), Gaps = 4/43 (9%)
Query: 314 KKDAPESPE--EEEVKRDDEEEENPIYL--KISDDEEEKEEEN 352
K+D ++ E + + K D+EE + + + +I D +E +
Sbjct: 3938 KEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQP 3980
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 28.8 bits (64), Expect = 5.7
Identities = 19/103 (18%), Positives = 40/103 (38%), Gaps = 5/103 (4%)
Query: 15 KRQEAKSS-----ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ 69
+R++A+ S AL + L + + + +++ QAK KV A+ E A+
Sbjct: 270 RREKAQQSKDHANALKKALPTVGNRIKKEEQETLVRQFTVEQTQAKSKVESAKIETDRAR 329
Query: 70 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
+ +++ + L KR+ + + A L
Sbjct: 330 EMETLAHDNVKQIVGAQHGILSAKREGAVDKRRTISTARAGLD 372
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.5 bits (64), Expect = 5.7
Identities = 26/128 (20%), Positives = 57/128 (44%), Gaps = 8/128 (6%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEA---EKKRQA 106
++Q K + + E++++ ++++EE E RQR +EK + +K Q+ + + K++A
Sbjct: 98 AQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEA 157
Query: 107 MMAALKEQTNKSKGPNFTIQ-----RREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIK 161
+A + +K Q R + Q ++ + +E+ ++ KK K
Sbjct: 158 AESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEEEERK 217
Query: 162 PLNVEGWS 169
E W
Sbjct: 218 QKAEEAWQ 225
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 28.8 bits (64), Expect = 6.0
Identities = 16/79 (20%), Positives = 30/79 (37%), Gaps = 4/79 (5%)
Query: 29 KEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQR 88
KE + ++ K EE LKE+ +R++ + +K+ E Q +Q
Sbjct: 508 KEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKALL--EHKKETLEYTSQLAELLDQQA 565
Query: 89 DLEEKRQR--LEEAEKKRQ 105
D E + + E+
Sbjct: 566 DRFELSAQAAGSQKERGSD 584
>gnl|CDD|224796 COG1884, Sbm, Methylmalonyl-CoA mutase, N-terminal domain/subunit
[Lipid metabolism].
Length = 548
Score = 28.5 bits (64), Expect = 6.3
Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 1/57 (1%)
Query: 31 YIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQ 87
Y VEW EE ++E +A +++A E+ + E RRQ +EE ++
Sbjct: 397 YYVEWLTDEL-EEAAWAYIEEIEAMGGMLKAIEDGYPQREIAESAARRQARIEEGER 452
>gnl|CDD|214351 CHL00063, atpE, ATP synthase CF1 epsilon subunit.
Length = 134
Score = 27.5 bits (62), Expect = 6.8
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 84 EKKQRDLEEKRQRLEEAEKKRQAMMAALK 112
++ Q+ LE LE+AE K+Q + A L
Sbjct: 90 QEAQQTLEIAEANLEKAEGKKQKIEANLA 118
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 27.6 bits (62), Expect = 6.8
Identities = 15/73 (20%), Positives = 34/73 (46%)
Query: 39 RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLE 98
+A E + L+E + + + + +++ Q +E + + E+E K+ E E
Sbjct: 57 QALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELERIKEAAEAEIEAEKE 116
Query: 99 EAEKKRQAMMAAL 111
A ++ +A +A L
Sbjct: 117 RALEELRAEVAEL 129
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 28.4 bits (63), Expect = 6.9
Identities = 20/100 (20%), Positives = 41/100 (41%), Gaps = 5/100 (5%)
Query: 26 EQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEK 85
E KE I + ++ K E L+ +++ K + + + K+ EER + +E
Sbjct: 121 ESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWR--NLYKKNEERFKEYLENF 178
Query: 86 KQRDLEEKRQRLEEAEKKRQAMMA---ALKEQTNKSKGPN 122
K++ + K + E K + + +KE+ N
Sbjct: 179 KRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHN 218
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 28.7 bits (65), Expect = 6.9
Identities = 21/102 (20%), Positives = 39/102 (38%), Gaps = 23/102 (22%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKE--KQAKRKVMRAEEEQKMAQRKKEE 74
QE S AL+ A +EE + +L + ++A MR EQ ++ +
Sbjct: 478 QERLSEALE------------NGASDEE-IAKLMQELREALDDYMRQLAEQ---AQRNPQ 521
Query: 75 EERRQREVEEKK---QRDLEEKRQRLEEAEKKRQAMMAALKE 113
+ + + Q+DL+ R+EE A ++
Sbjct: 522 QGDQPPDQGNSMELTQQDLQRMMDRIEEL--MESGRRAEAQQ 561
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly
related to archaeal Holliday junction resolvase
[Nucleotide transport and metabolism].
Length = 175
Score = 27.8 bits (62), Expect = 6.9
Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 2/44 (4%)
Query: 37 NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQ--RKKEEEERR 78
R EE K +E+ R EEE K+ + KK EE R
Sbjct: 34 KARELEETLQKAERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77
>gnl|CDD|215458 PLN02853, PLN02853, Probable phenylalanyl-tRNA synthetase alpha
chain.
Length = 492
Score = 28.5 bits (64), Expect = 7.0
Identities = 25/102 (24%), Positives = 43/102 (42%), Gaps = 22/102 (21%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMA 109
+ Q RKV E+E K +E + EV++K L+ KR++L
Sbjct: 118 GKPQVSRKVQHVEDEVKELLLAIQEGK----EVDDKDIDALK-KRRKL------------ 160
Query: 110 ALKEQTNK----SKGPNFTIQRREPGLTLSTAQLERNKTKEQ 147
+ +T K KGPN+ +R++ L+ L+ K+
Sbjct: 161 -ITLETWKGYSIKKGPNYAPERKKAATDLTREMLQSGDWKDL 201
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 28.5 bits (64), Expect = 7.1
Identities = 39/198 (19%), Positives = 79/198 (39%), Gaps = 15/198 (7%)
Query: 13 FVKRQEAKSSAL-DEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRK 71
F+K Q S ++ + Y+ K ++ +KR K+ + +V + + + + Q+
Sbjct: 796 FIKLQPLLSLLGSRKEYRSYL----ACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKF 851
Query: 72 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 131
+ ++R KK+ + QR+E AE++ Q + +K + K N ++
Sbjct: 852 GRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKS-ISSLKLVNLELESEIIE 910
Query: 132 LTLSTAQLERNKTK---EQIEEEKKIALNIRIK---PLNVEGWSIH-KLQAKATELWETI 184
L S + + E I KK+ NI ++ + KL ++L ET
Sbjct: 911 LKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS 970
Query: 185 VKLET--DKYDLEERQKR 200
+ E K + R+
Sbjct: 971 EEYEDLLKKSTILVREGN 988
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 28.2 bits (63), Expect = 7.2
Identities = 17/67 (25%), Positives = 30/67 (44%), Gaps = 6/67 (8%)
Query: 50 KEKQAKRKVMRAEEEQKMAQRKKEEEE----RRQREVEEKKQRDLEEKRQRLEEAEKKRQ 105
+E++A +++ + E EE R +RE EE+ R E +RQR+ +
Sbjct: 100 QEEKAVNPFTPQPGQREERRPTLESEEEWRARLKREQEEQYLR--ERQRQRMARLQANAA 157
Query: 106 AMMAALK 112
A + L
Sbjct: 158 AYDSPLA 164
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 28.4 bits (64), Expect = 7.2
Identities = 20/93 (21%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 23 ALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREV 82
L E+LKE + + EE+L+++K+ K +A + +R++ E+ +E
Sbjct: 331 ELKEELKEL----EEELKELEEELEKIKKLLKKLP-KKARGQLPPEKREQLEKLLETKEK 385
Query: 83 EEKKQRDLEEKRQRLEE--AEKKRQAMMAALKE 113
++ +LEE+ + L+E + ++ K
Sbjct: 386 LSEELEELEEELKELKEELESLYSEGKISVNKT 418
>gnl|CDD|236153 PRK08116, PRK08116, hypothetical protein; Validated.
Length = 268
Score = 28.1 bits (63), Expect = 7.3
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 60 RAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQR 96
E E + A+ ++EE +QR +E K L +++ R
Sbjct: 44 ECEREAEEAKEREEENREKQRRIERLKSNSLLDEKFR 80
>gnl|CDD|233161 TIGR00870, trp, transient-receptor-potential calcium channel
protein. The Transient Receptor Potential Ca2+ Channel
(TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also
been called the store-operated calcium channel (SOC)
family. The prototypical members include the Drosophila
retinal proteinsTRP and TRPL (Montell and Rubin, 1989;
Hardie and Minke, 1993). SOC members of the family
mediate the entry of extracellular Ca2+ into cells in
responseto depletion of intracellular Ca2+ stores
(Clapham, 1996) and agonist stimulated production of
inositol-1,4,5 trisphosphate (IP3). One member of the
TRP-CCfamily, mammalian Htrp3, has been shown to form a
tight complex with the IP3 receptor (TC #1.A.3.2.1).
This interaction is apparently required for IP3
tostimulate Ca2+ release via Htrp3. The vanilloid
receptor subtype 1 (VR1), which is the receptor for
capsaicin (the ?hot? ingredient in chili peppers) and
servesas a heat-activated ion channel in the pain
pathway (Caterina et al., 1997), is also a member of
this family. The stretch-inhibitable non-selective
cation channel(SIC) is identical to the vanilloid
receptor throughout all of its first 700 residues, but
it exhibits a different sequence in its last 100
residues. VR1 and SICtransport monovalent cations as
well as Ca2+. VR1 is about 10x more permeable to Ca2+
than to monovalent ions. Ca2+ overload probably causes
cell deathafter chronic exposure to capsaicin.
(McCleskey and Gold, 1999) [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 743
Score = 28.5 bits (64), Expect = 7.8
Identities = 22/95 (23%), Positives = 31/95 (32%), Gaps = 22/95 (23%)
Query: 33 VEWRNQRAKE----EEDLKRL----------KEKQAKRKVMRAEEEQKMAQRKKEEEERR 78
EW+ QRAK E + K K + + +K +R
Sbjct: 624 EEWKFQRAKLWMSYEREGGTCPPPFNIIPGPKSFVGLFKRIEKHDGKKR--------QRW 675
Query: 79 QREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
R VEE E K + L E Q +M L +
Sbjct: 676 CRRVEEVNWTTWERKAETLIEDGLHYQRVMKRLIK 710
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 28.4 bits (63), Expect = 7.8
Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 10/114 (8%)
Query: 42 EEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAE 101
E+DL+ LK++ A+ RA+EE + K E + E E +++ K + E
Sbjct: 3 PEKDLEDLKKEAAR----RAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEE 58
Query: 102 KKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIA 155
KR+A AA K+K Q+RE ++ + + K K + K A
Sbjct: 59 AKRRAAAAA------KAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAA 106
>gnl|CDD|233149 TIGR00845, caca, sodium/calcium exchanger 1. The Ca2+:Cation
Antiporter (CaCA) Family (TC 2.A.19)Proteins of the CaCA
family are found ubiquitously, having been identified in
animals, plants, yeast, archaea and widely divergent
bacteria.All of the characterized animal proteins
catalyze Ca2+:Na+ exchange although some also transport
K+. The NCX1 plasma membrane protein exchanges 3 Na+ for
1 Ca2+. The E. coli ChaA protein catalyzes Ca2+:H+
antiport but may also catalyze Na+:H+ antiport. All
remaining well-characterized members of the family
catalyze Ca2+:H+ exchange.This model is specific for the
eukaryotic sodium ion/calcium ion exchangers of the Caca
family [Transport and binding proteins, Other].
Length = 928
Score = 28.2 bits (63), Expect = 7.9
Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 4/60 (6%)
Query: 296 GNRQKTKLPKWFGERPGKKKDAPESPE--EEEVKRDDEEEENPIYLKISDDEEEKEEENF 353
G R +P E G + + E E++ +++E E I +KI DDEE ++ + F
Sbjct: 555 GARGTVIVPYRTVE--GTARGGGKDFEDTCGELEFENDETEKTIRVKIVDDEEYEKNDTF 612
>gnl|CDD|219004 pfam06386, GvpL_GvpF, Gas vesicle synthesis protein GvpL/GvpF.
This family consists of several bacterial and archaeal
gas vesicle synthesis protein (GvpL/GvpF) sequences. The
exact function of this family is unknown.
Length = 246
Score = 28.0 bits (63), Expect = 7.9
Identities = 14/73 (19%), Positives = 30/73 (41%), Gaps = 5/73 (6%)
Query: 33 VEWR-----NQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQ 87
VEW ++ A +E ++ E +A R + + E K + + + + E++Q
Sbjct: 111 VEWGVKVYWDEDAILKEVVEEDPELRALRAEIESAPEGKAYLLRIRLGQLIEEALRERRQ 170
Query: 88 RDLEEKRQRLEEA 100
E+ + L
Sbjct: 171 ELAEDILEALSPL 183
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 27.8 bits (63), Expect = 8.0
Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 2/39 (5%)
Query: 71 KKEEEERRQREVEEKKQRDLE-EKRQRLEEAEKK-RQAM 107
KK EE R+ EE ++ LE + EA + ++A+
Sbjct: 85 KKGTEEERRERREENLEKALELLREGNRSEARECFQRAV 123
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 28.3 bits (63), Expect = 8.0
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 32 IVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKM 67
IV+W K++E + LK+KQA +++ A E+ K+
Sbjct: 268 IVDWLASNFKKDEGIDLLKDKQALQRLTEAAEKAKI 303
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 28.1 bits (63), Expect = 8.1
Identities = 21/81 (25%), Positives = 40/81 (49%), Gaps = 18/81 (22%)
Query: 51 EKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQ------RLEEAEKKR 104
E+ +RK+ E EQK ++E E+ +++ EK DLEE + ++EE E+K+
Sbjct: 86 EQGLQRKLKELEREQK----EEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEEKQ 141
Query: 105 --------QAMMAALKEQTNK 117
++ + +KE+
Sbjct: 142 MEILEKALKSYLKIVKEENKS 162
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 28.5 bits (64), Expect = 8.1
Identities = 26/99 (26%), Positives = 38/99 (38%), Gaps = 2/99 (2%)
Query: 16 RQEAKSSALDEQLKEYIVE-WRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEE 74
R A AL L E I+E + + +E+E L+ L EK + EEE + ++ +E
Sbjct: 475 RSNAFDIALRLGLPEPIIEEAKTEFGEEKELLEELIEK-LEEVRKELEEELEEVEKLLDE 533
Query: 75 EERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 113
E K LE K Q + A K
Sbjct: 534 VELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKV 572
>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
unknown].
Length = 108
Score = 26.7 bits (59), Expect = 8.1
Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 3/91 (3%)
Query: 24 LDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVE 83
+ E L++ + A+ E +++L ++ K+ + AEE ++ + + Q E+E
Sbjct: 4 MGEGLRKLALIGAGLAAETAEKVQKLVDELVKKGELNAEEAKRFVDDLLRQAKEAQGELE 63
Query: 84 EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
EK R +EE L + E RQ+ M L E+
Sbjct: 64 EKIPRKIEEM---LSDLEVARQSEMDELTER 91
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 28.0 bits (63), Expect = 8.5
Identities = 25/96 (26%), Positives = 49/96 (51%), Gaps = 11/96 (11%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEE 76
+EA+++A +++ ++ +E RN + L EK K E K+ +KE+ E
Sbjct: 508 KEAEANAEEDKKRKEEIEARNN----ADSLAYQAEKTLK------EAGDKLPAEEKEKIE 557
Query: 77 RRQREVEEK-KQRDLEEKRQRLEEAEKKRQAMMAAL 111
+ E++E K D+EE + + EE ++ Q + A+
Sbjct: 558 KAVAELKEALKGEDVEEIKAKTEELQQALQKLAEAM 593
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 27.7 bits (62), Expect = 8.5
Identities = 27/89 (30%), Positives = 49/89 (55%), Gaps = 11/89 (12%)
Query: 21 SSALDEQLK---EYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEER 77
S AL+E+L+ E ++E R E++ +R + RK +R + Q ++EEEE+
Sbjct: 75 SLALEERLRNLLEKLIEVSEHRVDSEKEDERYEITSDVRKQLRF-----LEQLEREEEEK 129
Query: 78 RQREVEEKKQRDLEEKRQRLEEAEKKRQA 106
R EE+++R L + R E++ K++A
Sbjct: 130 R---DEEERERLLRAAKSRSEQSRLKQKA 155
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 28.0 bits (61), Expect = 8.6
Identities = 24/86 (27%), Positives = 43/86 (50%), Gaps = 4/86 (4%)
Query: 35 WRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKR 94
+RN+R E K EK+ +R++ EQ A+R K+E+ R + +E ++R+ + +R
Sbjct: 132 FRNKRNLMEHIHKVKNEKKKERQL----AEQLAAKRLKDEQHRHKARKQELRKREKDRER 187
Query: 95 QRLEEAEKKRQAMMAALKEQTNKSKG 120
R E+A A A ++ G
Sbjct: 188 ARREDAAAAAAAKQKAAAKKAAAPSG 213
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 26.8 bits (60), Expect = 8.6
Identities = 28/105 (26%), Positives = 46/105 (43%), Gaps = 13/105 (12%)
Query: 10 DPEFVKRQEAK---SSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQK 66
+PE VK + K ++L E E R E L+E QA+R + E +
Sbjct: 10 NPEAVKEKLRKRGVDVLDVDELLELDEERR-------ELQVELEELQAERNELSKEIGKA 62
Query: 67 MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 111
+KKE+ E EV+E K +L+ L E E + ++ ++
Sbjct: 63 K--KKKEDAEALIAEVKELKD-ELKALEAELRELEAELDKLLLSI 104
>gnl|CDD|130083 TIGR01010, BexC_CtrB_KpsE, polysaccharide export inner-membrane
protein, BexC/CtrB/KpsE family. This family contains
gamma proteobacterial proteins involved in capsule
polysaccharide export [Transport and binding proteins,
Carbohydrates, organic alcohols, and acids].
Length = 362
Score = 28.1 bits (63), Expect = 8.7
Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 1/60 (1%)
Query: 61 AEEEQKMAQRKKEEEERRQREVEEKKQRD-LEEKRQRLEEAEKKRQAMMAALKEQTNKSK 119
AEE QK+ QR +E ER + E+ ++D + ++EAE++ A A L + K+K
Sbjct: 142 AEEAQKINQRLLKEGERLINRLNERARKDTIAFAENEVKEAEQRLNATKAELLKYQIKNK 201
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 26.8 bits (60), Expect = 9.0
Identities = 15/52 (28%), Positives = 34/52 (65%), Gaps = 5/52 (9%)
Query: 59 MRAEEEQKMAQRKKEEEERRQREVEEKKQR----DLEEKRQRLEEAEKKRQA 106
+RAE + K+ + K++E R++E++E + + +E+K+++L EA+ + Q
Sbjct: 64 LRAERQAKVEE-KEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114
>gnl|CDD|181442 PRK08476, PRK08476, F0F1 ATP synthase subunit B'; Validated.
Length = 141
Score = 27.0 bits (60), Expect = 9.1
Identities = 15/72 (20%), Positives = 33/72 (45%)
Query: 43 EEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEK 102
+ DL+++K + + E E + ++E + RQ+ + + K+ ++ + E E
Sbjct: 44 KNDLEKVKTNSSDVSEIEHEIETILKNAREEANKIRQKAIAKAKEEAEKKIEAKKAELES 103
Query: 103 KRQAMMAALKEQ 114
K +A L Q
Sbjct: 104 KYEAFAKQLANQ 115
>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
C4-dicarboxylate transport system [Signal transduction
mechanisms].
Length = 603
Score = 28.1 bits (63), Expect = 9.2
Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 2/57 (3%)
Query: 35 WRNQRAKEEEDLKRLKEKQAKRKVM--RAEEEQKMAQRKKEEEERRQREVEEKKQRD 89
R +RA+ + + +R+V A+ + A+ + E ER Q E ++ +D
Sbjct: 317 RRRRRARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRRAQD 373
>gnl|CDD|234937 PRK01294, PRK01294, lipase chaperone; Provisional.
Length = 336
Score = 27.7 bits (62), Expect = 9.3
Identities = 24/112 (21%), Positives = 51/112 (45%), Gaps = 17/112 (15%)
Query: 17 QEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKE--- 73
+ A+ +AL+ QL E + + +++ L++L + QA A ++ R +
Sbjct: 208 KAARLAALEAQLPEDLRAALQESQRQQALLQQLAQLQAS----GASPQELRLMRAQLVGP 263
Query: 74 ------EEERRQREVEEKKQRDLEEKRQRLEEA----EKKRQAMMAALKEQT 115
E+ +QR +++ D +R ++ A + RQA +A L++Q
Sbjct: 264 EAAQRLEQLDQQRAAWQQRYDDYLAQRAQILNAAGLSPQDRQAQIAQLRQQR 315
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.1 bits (63), Expect = 9.4
Identities = 41/187 (21%), Positives = 83/187 (44%), Gaps = 8/187 (4%)
Query: 15 KRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEE 74
+++ +L+E I E + + +++++ L+EK + K ++ + E+ + + E
Sbjct: 244 EKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303
Query: 75 EERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTL 134
E + EK+ LEE+ +EE K+ + L+E K K +++R L
Sbjct: 304 EYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKE----LEKRLEELEE 359
Query: 135 STAQLERNKTK-EQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYD 193
E K K E++E KK + + L E + +L+ E+ E I K+ +
Sbjct: 360 RHELYEEAKAKKEELERLKKRLTGLTPEKLEKE---LEELEKAKEEIEEEISKITARIGE 416
Query: 194 LEERQKR 200
L++ K
Sbjct: 417 LKKEIKE 423
>gnl|CDD|202204 pfam02321, OEP, Outer membrane efflux protein. The OEP family
(Outer membrane efflux protein) form trimeric channels
that allow export of a variety of substrates in Gram
negative bacteria. Each member of this family is
composed of two repeats. The trimeric channel is
composed of a 12 stranded all beta sheet barrel that
spans the outer membrane, and a long all helical barrel
that spans the periplasm.
Length = 186
Score = 27.5 bits (61), Expect = 9.6
Identities = 26/105 (24%), Positives = 50/105 (47%), Gaps = 3/105 (2%)
Query: 14 VKRQEAKSSALDEQLKEYIVEWRNQRAKEEEDLKRLKE--KQAKRKVMRAEEEQKMAQRK 71
VK +A+ A + +L++ + R + A+ +L KE + AK+ + AEE ++A+ +
Sbjct: 83 VKAAKAQLEAAEAELEQARRQLRLEVAQAYFNLLAAKEQLELAKQALELAEEALELAEAR 142
Query: 72 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 116
E +V + + +L E R L AE + A L++
Sbjct: 143 YEAGLISLLDVLQA-EVELLEARLELLNAEADLELARAQLEQLLG 186
>gnl|CDD|233357 TIGR01322, scrB_fam, sucrose-6-phosphate hydrolase. [Energy
metabolism, Biosynthesis and degradation of
polysaccharides].
Length = 445
Score = 28.1 bits (63), Expect = 9.6
Identities = 30/117 (25%), Positives = 42/117 (35%), Gaps = 23/117 (19%)
Query: 57 KVMRAEEEQKM--AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 114
K +R EE + Q + E+ DLE K A L
Sbjct: 328 KALRTEEHINVFGDQEHTLPGLNGEFELIL----DLE-----------KDSAFELGLAL- 371
Query: 115 TNKSKGPNFTIQRREPGLTL---STAQLERNKT--KEQIEEEKKIALNIRIKPLNVE 166
TNK + TI E +TL S+ LE + KK++L+I I +VE
Sbjct: 372 TNKGEETLLTIDADEGKVTLDRRSSGNLEDYGGTRSCPLPNTKKVSLHIFIDKSSVE 428
>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
Length = 174
Score = 27.2 bits (61), Expect = 9.7
Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 10/76 (13%)
Query: 39 RAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERRQREVEEKKQRDLEEKRQRL- 97
R + LK +E+ + AE +QK+AQ ++E E R D + + + +
Sbjct: 52 REAILQALKEAEERLRQAAQALAEAQQKLAQAQQEAERIRA---------DAKARAEAIR 102
Query: 98 EEAEKKRQAMMAALKE 113
E EK+ MA LK+
Sbjct: 103 AEIEKQAIEDMARLKQ 118
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 27.7 bits (62), Expect = 9.7
Identities = 20/76 (26%), Positives = 38/76 (50%), Gaps = 4/76 (5%)
Query: 34 EWRNQRAKEEEDLKRLKEKQAKRKVMRAEEEQKMAQRKKEEEERR----QREVEEKKQRD 89
E N+ + + K + K+ +RK + E +++ +KK + R +EV +K++
Sbjct: 267 EPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIAKEVAQKEKAR 326
Query: 90 LEEKRQRLEEAEKKRQ 105
+K QR E EKK+
Sbjct: 327 ARKKEQRKERGEKKKL 342
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 27.6 bits (62), Expect = 10.0
Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 11/62 (17%)
Query: 71 KKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREP 130
++E+E+++ EEKK K ++ + E ++ KE+ NF I+ P
Sbjct: 95 EEEKEKKKAMSKEEKK----AIKEEKEKLEEPYGYCLVDGHKEKVG-----NFRIEP--P 143
Query: 131 GL 132
GL
Sbjct: 144 GL 145
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.307 0.127 0.344
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,414,440
Number of extensions: 1971623
Number of successful extensions: 11900
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7284
Number of HSP's successfully gapped: 1639
Length of query: 372
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 274
Effective length of database: 6,590,910
Effective search space: 1805909340
Effective search space used: 1805909340
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (26.7 bits)