RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3984
(183 letters)
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 93.7 bits (233), Expect = 6e-24
Identities = 85/185 (45%), Positives = 112/185 (60%), Gaps = 4/185 (2%)
Query: 1 MMERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKK 60
RLEE ER +LE E+ EV +++ E K+ ET++LQ+E+ A+ A ++
Sbjct: 62 ENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHER 121
Query: 61 QDEMNAALLMATSTPQH-HHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEE-RRTLA 118
+ A + P H E E DDN S DL+TD ++ D EE R T A
Sbjct: 122 ARQELLEAAAAPTAPPHVAAPVNGEQLEPDDN--GEEASADLETDPDMKDRSEEERVTYA 179
Query: 119 ERNERLQDQLKMLKEDLAGTRDETKETAMDKIHRENVKQGRDKYKTLREIRKGNTKRRVD 178
E+NERLQ QL+ LK +LA RDE+KETA D +H ENV+ GRDKYKTLR+IR GNTK+RVD
Sbjct: 180 EKNERLQTQLQALKSELAAARDESKETANDILHEENVRAGRDKYKTLRQIRSGNTKQRVD 239
Query: 179 QFENM 183
QFE+M
Sbjct: 240 QFESM 244
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 39.2 bits (92), Expect = 6e-04
Identities = 15/64 (23%), Positives = 35/64 (54%), Gaps = 4/64 (6%)
Query: 11 MEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKR----LQEEVENARNARKKQDEMNA 66
AAE A+LE+E++ ++E+ RI+ +++++ K +++E E + K ++
Sbjct: 811 DLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKLEE 870
Query: 67 ALLM 70
L +
Sbjct: 871 RLAV 874
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.0 bits (81), Expect = 0.017
Identities = 33/162 (20%), Positives = 65/162 (40%), Gaps = 25/162 (15%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARN--ARK 59
+E L+E E +L E+Q +E+++ ++ EV+ +EE + LQ+E+ N +R
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
Query: 60 KQDEMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAE 119
+Q + E + E L + + LAE
Sbjct: 301 EQQKQ-------------ILRERLANLERQLEELEAQLEELE----------SKLDELAE 337
Query: 120 RNERLQDQLKMLKEDLAGTRDETKETAMDKIHRENVKQGRDK 161
L+++L+ LKE+L E +E + E+ + ++
Sbjct: 338 ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEE 379
Score = 33.5 bits (77), Expect = 0.051
Identities = 36/176 (20%), Positives = 68/176 (38%), Gaps = 9/176 (5%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
E+LEE + E +LE EI+ +E ++ + E+ + E + L+ ++E + K
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR 802
Query: 63 EMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERNE 122
E L + NE N L R T+ + D E+ L+E E
Sbjct: 803 EALDEL-------RAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIE 855
Query: 123 RLQDQLKMLKEDLAGTRDETKE--TAMDKIHRENVKQGRDKYKTLREIRKGNTKRR 176
L +++ L+E + E + + + + E+R+ +KR
Sbjct: 856 SLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRS 911
Score = 28.9 bits (65), Expect = 1.7
Identities = 25/139 (17%), Positives = 58/139 (41%), Gaps = 7/139 (5%)
Query: 14 AERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENAR----NARKKQDEMNAALL 69
E +LEE+I+ +E++ ++ + +E + L+EE+E R ++ + L
Sbjct: 677 REIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLA 736
Query: 70 MATSTPQHHHVE---ENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERNERLQD 126
+ + ++ + + R + ++ + + E L + E+L++
Sbjct: 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796
Query: 127 QLKMLKEDLAGTRDETKET 145
+LK L+E L R E
Sbjct: 797 ELKALREALDELRAELTLL 815
Score = 28.1 bits (63), Expect = 2.9
Identities = 30/143 (20%), Positives = 60/143 (41%), Gaps = 9/143 (6%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
+E LE AE +LE ++ +E+++ ++S+V + + L E+E AR ++
Sbjct: 353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLE-ARLER 411
Query: 62 DEMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERN 121
E L +EE E E + + ++ + + EE L E
Sbjct: 412 LEDRRERLQQEIEELLKKLEEAELKE--------LQAELEELEEELEELQEELERLEEAL 463
Query: 122 ERLQDQLKMLKEDLAGTRDETKE 144
E L+++L+ ++ L E +
Sbjct: 464 EELREELEEAEQALDAAERELAQ 486
Score = 27.7 bits (62), Expect = 4.4
Identities = 33/169 (19%), Positives = 69/169 (40%), Gaps = 12/169 (7%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVEN-ARNARKK 60
+E LEE E A+ E EI+ + ++++++ E+ + E L+ E+ A
Sbjct: 763 IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANL 822
Query: 61 QDEMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAER 120
++ + + +T + E + +E +++ + I + E L
Sbjct: 823 RERLESLERRIAATERRLEDLEEQIEELSEDIES--------LAAEIEELEELIEELESE 874
Query: 121 NERLQDQLKMLKEDLAGTRDETKETAMDKIH--RENVKQGRDKYKTLRE 167
E L ++ L+E LA R E E +++ + R + + LRE
Sbjct: 875 LEALLNERASLEEALALLRSE-LEELSEELRELESKRSELRRELEELRE 922
Score = 27.3 bits (61), Expect = 6.2
Identities = 15/63 (23%), Positives = 31/63 (49%)
Query: 1 MMERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKK 60
+E LEE + + L EI+ +E ++ ++SE+ + E L+E + R+ ++
Sbjct: 839 RLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEE 898
Query: 61 QDE 63
E
Sbjct: 899 LSE 901
>gnl|CDD|224143 COG1222, RPT1, ATP-dependent 26S proteasome regulatory subunit
[Posttranslational modification, protein turnover,
chaperones].
Length = 406
Score = 33.0 bits (76), Expect = 0.077
Identities = 16/54 (29%), Positives = 26/54 (48%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENAR 55
E E +E + LE+E + E QR+++E E RL+EE+E +
Sbjct: 16 YEPQEYLNKLEDTKLKLLEKEKRLLLLEEQRLEAEGLRLKREVDRLREEIERLK 69
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 33.1 bits (76), Expect = 0.082
Identities = 33/144 (22%), Positives = 61/144 (42%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
E+L+ E +LEEE++ QE ++ ++ E+ S +E +L+EE+E R+
Sbjct: 733 EQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQ 792
Query: 63 EMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERNE 122
E L + E+ + + ++ I + E+ L E E
Sbjct: 793 EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELE 852
Query: 123 RLQDQLKMLKEDLAGTRDETKETA 146
L+ +L+ LKE+L E +E
Sbjct: 853 ELEKELEELKEELEELEAEKEELE 876
Score = 31.6 bits (72), Expect = 0.26
Identities = 35/143 (24%), Positives = 66/143 (46%), Gaps = 11/143 (7%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
+E LEE + E +L+EE++ ++E++ ++SE+ EE + LQEE+ + ++
Sbjct: 241 LEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEEL 300
Query: 62 DEMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERN 121
+ + E E EN+ L + + + + + +EER TL E
Sbjct: 301 EGEIS-----------LLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEEL 349
Query: 122 ERLQDQLKMLKEDLAGTRDETKE 144
E+L +L+ KE+L E
Sbjct: 350 EQLLAELEEAKEELEEKLSALLE 372
Score = 29.7 bits (67), Expect = 1.0
Identities = 33/143 (23%), Positives = 60/143 (41%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
+E LE E A LEEE++ Q ++ ++ E+ +EE + LQE +E +
Sbjct: 711 LEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESL 770
Query: 62 DEMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERN 121
+E A L + E E + L R + + + R L +
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830
Query: 122 ERLQDQLKMLKEDLAGTRDETKE 144
E L+++++ L+E L +E +E
Sbjct: 831 EELEEEIEELEEKLDELEEELEE 853
Score = 28.9 bits (65), Expect = 1.9
Identities = 17/62 (27%), Positives = 36/62 (58%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
+E LEE + L+EE++ + E++ +Q+E+ +EE + L+E++E R+ K+
Sbjct: 406 IESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKEL 465
Query: 62 DE 63
+
Sbjct: 466 ER 467
Score = 27.8 bits (62), Expect = 4.1
Identities = 30/148 (20%), Positives = 59/148 (39%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
E EE + +A +LEE +A +EE+ +++E+ E + L+ E+E+ ++
Sbjct: 357 EEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERL 416
Query: 62 DEMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERN 121
E L + E E + L + + D + + E L E
Sbjct: 417 SERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEEL 476
Query: 122 ERLQDQLKMLKEDLAGTRDETKETAMDK 149
+RL+ +L L+ L E + + +
Sbjct: 477 QRLEKELSSLEARLDRLEAEQRASQGVR 504
Score = 27.4 bits (61), Expect = 5.9
Identities = 7/53 (13%), Positives = 29/53 (54%)
Query: 15 ERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEMNAA 67
+ + E +++ +E ++R++ + +++ ++L+ + E A ++ + E+
Sbjct: 173 RKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELREL 225
Score = 27.0 bits (60), Expect = 8.5
Identities = 19/66 (28%), Positives = 33/66 (50%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
E LEE +LE E+ QEE+QR++ E++S + RL+ E ++ R +
Sbjct: 449 EELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLE 508
Query: 63 EMNAAL 68
+ + L
Sbjct: 509 ALESGL 514
>gnl|CDD|179508 PRK02944, PRK02944, OxaA-like protein precursor; Validated.
Length = 255
Score = 31.7 bits (72), Expect = 0.16
Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 13/51 (25%)
Query: 1 MMERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEET-KRLQEE 50
M+++ + +K M QA Q E+Q+++ + +SKD+ T ++LQ+E
Sbjct: 78 MIKQTKSTKAM------------QALQPEMQKLKEKYSSKDQATQQKLQQE 116
>gnl|CDD|236136 PRK07946, PRK07946, putative monovalent cation/H+ antiporter
subunit C; Reviewed.
Length = 163
Score = 30.8 bits (70), Expect = 0.24
Identities = 11/44 (25%), Positives = 15/44 (34%)
Query: 101 LDTDDNIVDPVEERRTLAERNERLQDQLKMLKEDLAGTRDETKE 144
L T D + D E+ R R + D T +T E
Sbjct: 101 LTTADEVEDDPEDTRVALRRRGLASAAPDHDRSDDPETGRQTAE 144
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 31.0 bits (71), Expect = 0.34
Identities = 14/59 (23%), Positives = 27/59 (45%), Gaps = 8/59 (13%)
Query: 14 AERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRL---QEEVENARNARKKQDEMNAALL 69
+R L IQ + ++Q +Q +NS KRL ++ V+ A++ + L+
Sbjct: 227 KQRDYLTARIQRLEHQLQLLQEAINS-----KRLTLSEKTVQEAQSQDEAARIQANPLV 280
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 30.0 bits (68), Expect = 0.45
Identities = 10/46 (21%), Positives = 20/46 (43%), Gaps = 11/46 (23%)
Query: 2 MERLEESKNM--------EAAERAKLE---EEIQAKQEEVQRIQSE 36
E+++E + M A+ + +++Q KQ E+ Q E
Sbjct: 71 QEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRE 116
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 30.0 bits (68), Expect = 0.48
Identities = 14/71 (19%), Positives = 34/71 (47%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
+ L+E + A+L++E Q ++E+ +++E+ +E R+++ NA ++
Sbjct: 75 LAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEEN 134
Query: 62 DEMNAALLMAT 72
E+ L
Sbjct: 135 RELREELAELK 145
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.5 bits (68), Expect = 0.56
Identities = 20/62 (32%), Positives = 34/62 (54%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
+++ EE ++AAE AK EE + K EE ++ + + E K+ EE + A +KK+
Sbjct: 1652 LKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKE 1711
Query: 62 DE 63
E
Sbjct: 1712 AE 1713
Score = 27.4 bits (60), Expect = 5.4
Identities = 35/177 (19%), Positives = 67/177 (37%), Gaps = 7/177 (3%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
++ + + A+ AK + E + K +E ++ E K +E K+ + A A+KK +
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428
Query: 63 EMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERNE 122
E A + E + DE + + +E + AE
Sbjct: 1429 EKKKA-----DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE-A 1482
Query: 123 RLQDQLKMLKEDLAGTRDETKETAMDKIHRENVKQGRDKYKTLREIRKGNTKRRVDQ 179
+ D+ K E+ DE K+ A K + K+ + K E +K ++ D+
Sbjct: 1483 KKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKA-DEAKKAEEAKKADE 1538
Score = 27.4 bits (60), Expect = 6.2
Identities = 39/175 (22%), Positives = 76/175 (43%), Gaps = 7/175 (4%)
Query: 4 RLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDE 63
+L E + AE AK EE + K EE+++ + E ++ K+ EE + A +K ++E
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Query: 64 --MNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERN 121
+ AA + EE + E D+ + ++ + EE +
Sbjct: 1659 NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK----AEELKKKEAEE 1714
Query: 122 ERLQDQLKMLKEDLAGTRDETK-ETAMDKIHRENVKQGRDKYKTLREIRKGNTKR 175
++ ++LK +E+ +E K E DK E K+ ++ K + ++K K+
Sbjct: 1715 KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 29.9 bits (68), Expect = 0.69
Identities = 8/32 (25%), Positives = 14/32 (43%)
Query: 105 DNIVDPVEERRTLAERNERLQDQLKMLKEDLA 136
+ + + L E NE L+ +L L+ L
Sbjct: 62 SGVFESLASLFDLREENEELKKELLELESRLQ 93
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 30.0 bits (68), Expect = 0.70
Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 1/69 (1%)
Query: 5 LEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEM 64
L E AE LEE I+ ++ ++ +E L EE+E+ ++ E
Sbjct: 83 LPEEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERLSELDEELEDLE-DLLEELEP 141
Query: 65 NAALLMATS 73
A L S
Sbjct: 142 LAYLDFDLS 150
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.8 bits (68), Expect = 0.75
Identities = 14/64 (21%), Positives = 32/64 (50%), Gaps = 4/64 (6%)
Query: 1 MMERLEESKN---MEAAERAKLEEEIQAKQEEVQRIQSEVNS-KDEETKRLQEEVENARN 56
++ LEE + +A E L +E + +EE++ + ++ +D+ + ++E + A
Sbjct: 521 LIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIK 580
Query: 57 ARKK 60
KK
Sbjct: 581 EAKK 584
Score = 26.7 bits (60), Expect = 8.8
Identities = 11/74 (14%), Positives = 28/74 (37%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
E+ E+ + E + E+E Q +E ++ E+ + + ++ A + +
Sbjct: 555 EKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARK 614
Query: 63 EMNAALLMATSTPQ 76
+N A +
Sbjct: 615 RLNKANEKKEKKKK 628
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 29.1 bits (66), Expect = 0.94
Identities = 7/48 (14%), Positives = 23/48 (47%)
Query: 21 EEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEMNAAL 68
+ + + +++++ E+ + LQ ++E + R++ +E L
Sbjct: 62 QALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELL 109
Score = 26.8 bits (60), Expect = 6.4
Identities = 11/54 (20%), Positives = 32/54 (59%), Gaps = 1/54 (1%)
Query: 6 EESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEET-KRLQEEVENARNAR 58
+ E ER +L EE++ ++E++++++E+ ++ +R+++ E + A+
Sbjct: 95 LKKGREETEERTELLEELKQLEKELKKLKAELEKYEKNDPERIEKLKEETKVAK 148
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 29.3 bits (66), Expect = 1.1
Identities = 12/57 (21%), Positives = 24/57 (42%)
Query: 12 EAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEMNAAL 68
E + E E++A QE++ ++S + E K L +++ + N L
Sbjct: 261 EEDRTKEREAELEALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEEL 317
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 29.7 bits (67), Expect = 1.1
Identities = 12/71 (16%), Positives = 30/71 (42%), Gaps = 4/71 (5%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENA----RNA 57
L+E E L + + +EE++ +++ + + L++E + R
Sbjct: 842 RIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLREL 901
Query: 58 RKKQDEMNAAL 68
+K +E+ A +
Sbjct: 902 ERKIEELEAQI 912
Score = 29.7 bits (67), Expect = 1.2
Identities = 28/140 (20%), Positives = 60/140 (42%)
Query: 2 MERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
+E LE E R KL EE +EE++ +++E+ D+E ++E+++ R +K
Sbjct: 338 IEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKL 397
Query: 62 DEMNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERN 121
L Q +E + + + G ++ + ++ D E + +
Sbjct: 398 KREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKL 457
Query: 122 ERLQDQLKMLKEDLAGTRDE 141
E+L L +++L ++E
Sbjct: 458 EQLAADLSKYEQELYDLKEE 477
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 29.2 bits (66), Expect = 1.2
Identities = 13/41 (31%), Positives = 26/41 (63%), Gaps = 4/41 (9%)
Query: 21 EEIQAKQEEV-QRIQSEVNSKDEETKRLQEEVENARNARKK 60
E +Q +++ + Q+IQ V S ET+ L +E+E ++ R++
Sbjct: 90 ERLQKREQSIDQQIQQAVQS---ETQELTKEIEQLKSERQQ 127
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.1 bits (66), Expect = 1.2
Identities = 12/44 (27%), Positives = 25/44 (56%)
Query: 9 KNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVE 52
+ + LEE I+ +EE+++I+ E+ +EE L+ E++
Sbjct: 74 EEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIK 117
Score = 28.4 bits (64), Expect = 2.3
Identities = 30/178 (16%), Positives = 60/178 (33%), Gaps = 48/178 (26%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQ-------------- 48
+ LEE E K+E+EI+ +EE+ +++E+ ++E +RL+
Sbjct: 82 KSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLG 141
Query: 49 ----------------------EEVENARNARKKQDEMNAALLMATSTPQHHHVEENEHD 86
+VEN + + + V E
Sbjct: 142 FKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVV-----------VVLKELS 190
Query: 87 ENDDNMLNGHVSRDLDTDDNIVDPVEERRTLAERNERLQDQLKMLKEDLAGTRDETKE 144
+ + L L+ ++ P E R + E E ++ + + L E+L + E
Sbjct: 191 DEVEEELKKLGFERLELEE-EGTPSELIREIKEELEEIEKERESLLEELKELAKKYLE 247
Score = 26.8 bits (60), Expect = 7.2
Identities = 7/63 (11%), Positives = 22/63 (34%), Gaps = 10/63 (15%)
Query: 15 ERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQ----EEVENARNARKKQDEMNAALLM 70
+++EE++ ++E + + E+ ++ E +E + A
Sbjct: 216 LIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIEL------ERAEALSKF 269
Query: 71 ATS 73
+
Sbjct: 270 LKT 272
>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
Length = 874
Score = 28.9 bits (66), Expect = 1.5
Identities = 13/67 (19%), Positives = 34/67 (50%), Gaps = 18/67 (26%)
Query: 13 AAERAKLEEEIQAKQEEVQRIQSEVNSK-----------DEETKRLQEEVENARNARKKQ 61
AE A+LE+E+ ++E++R++ +++++ ++E ++L E +K
Sbjct: 810 EAELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAE-------YEEKL 862
Query: 62 DEMNAAL 68
++ L
Sbjct: 863 AKLKERL 869
>gnl|CDD|212127 cd10816, GH57N_BE_TK1436_like, N-terminal catalytic domain of Gh57
branching enzyme TK 1436 and similar proteins. The
subfamily is represented by a novel branching-enzyme
TK1436 of hyperthermophilic archaeon Thermococcus
kodakaraensis KOD1. Branching enzymes (BEs, EC 2.4.1.18)
play a key role in synthesis of alpha-glucans and they
generally are classified into glycoside hydrolase family
13 (GH13). However, TK1436 belongs to the GH57 family.
It functions as a monomer and possesses BE activity.
TK1436 is composed of a distorted N-terminal
(beta/alpha)7-barrel domain and a C-terminal five
alpha-helical domain, both of which participate in the
formation of the active-site cleft.
Length = 423
Score = 29.0 bits (65), Expect = 1.5
Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%)
Query: 1 MMERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQ 48
+ME+L + + E LE +I+A QE++ R S EE R
Sbjct: 62 LMEQLADPYILAEFE-EYLERKIEAAQEDLARYTQSGESHYEEAARYW 108
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 28.9 bits (66), Expect = 1.6
Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 2/52 (3%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENA 54
ER SK E + + E+ +A EV+ ++ E+ + + E L+ E+E
Sbjct: 50 ERNALSK--EIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEEL 99
>gnl|CDD|185052 PRK15097, PRK15097, cytochrome d terminal oxidase subunit 1;
Provisional.
Length = 522
Score = 29.0 bits (65), Expect = 1.6
Identities = 13/34 (38%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 151 HRENVKQGRDKYKTLREIRKGNTKRRV-DQFENM 183
H E ++ G Y L ++R G+T + V DQF +M
Sbjct: 314 HEERIRNGMKAYSLLEQLRSGSTDQAVRDQFNSM 347
>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 514
Score = 28.3 bits (63), Expect = 2.3
Identities = 7/64 (10%), Positives = 26/64 (40%)
Query: 25 AKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEMNAALLMATSTPQHHHVEENE 84
+ +++ +Q+++ + ++ L+ + A ++ +A A + P
Sbjct: 22 SDASQIKALQAQLTALQQQVNELRAALAAKPAAAGGGAKIQSAAAAAAAAPSSDAAAALT 81
Query: 85 HDEN 88
+D+
Sbjct: 82 NDDV 85
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 28.4 bits (64), Expect = 2.4
Identities = 15/52 (28%), Positives = 31/52 (59%)
Query: 9 KNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKK 60
K + A+L+E+I A Q+E + ++ +NS ++E K+L+ E++ K+
Sbjct: 396 KKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKE 447
Score = 26.9 bits (60), Expect = 7.1
Identities = 11/57 (19%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 6 EESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
EE+ +EA + +L++E++ +E+++++++E+ K+EE ++ + + + + +
Sbjct: 88 EENIEIEA-QIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKK 143
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 27.5 bits (62), Expect = 2.8
Identities = 16/60 (26%), Positives = 33/60 (55%), Gaps = 6/60 (10%)
Query: 1 MMERLE-ESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSK-----DEETKRLQEEVENA 54
+ E+L+ ++ + A R K E+E+Q K +E QR Q ++ EE +++ +++ A
Sbjct: 44 LKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQKILDKINKA 103
Score = 27.5 bits (62), Expect = 2.9
Identities = 17/68 (25%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 1 MMERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKK 60
++LE+ AE KLE+E+Q +E++Q+ + S+ K+ +E + + ++K
Sbjct: 19 AQKQLEKEFKKRQAELEKLEKELQKLKEKLQK-DAATLSEAAREKKEKELQKKVQEFQRK 77
Query: 61 QDEMNAAL 68
Q ++ L
Sbjct: 78 QQKLQQDL 85
>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 247
Score = 27.8 bits (62), Expect = 3.0
Identities = 16/78 (20%), Positives = 37/78 (47%), Gaps = 5/78 (6%)
Query: 1 MMERLEESKNMEAAERA---KLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNA 57
M+ + ++ E+ RA L +E+++ Q++V + +EV + + ++ V A
Sbjct: 34 MLAAVFQTSKGESVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVRRSVLTDDAA 93
Query: 58 RKKQDEMNAALLMATSTP 75
+D + ++A S P
Sbjct: 94 --LEDRLEKLRMLAGSVP 109
>gnl|CDD|191114 pfam04859, DUF641, Plant protein of unknown function (DUF641).
Plant protein of unknown function.
Length = 132
Score = 27.3 bits (61), Expect = 3.3
Identities = 17/55 (30%), Positives = 29/55 (52%), Gaps = 7/55 (12%)
Query: 13 AAERAKLEEEIQAKQEE-------VQRIQSEVNSKDEETKRLQEEVENARNARKK 60
A + A L EIQ ++ V+++++EV +KD E L+E++E A K
Sbjct: 73 APQDAILAAEIQEQRSLLKTYEIMVKKLEAEVRAKDSEIDSLREKLEELLVANSK 127
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 28.0 bits (63), Expect = 3.6
Identities = 17/74 (22%), Positives = 31/74 (41%), Gaps = 9/74 (12%)
Query: 7 ESKNMEAAERAKLE-----EEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
E+ N+ AER KL E +A++EE ++ ++ ++ K + E +K
Sbjct: 402 ENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGE----TKKVD 457
Query: 62 DEMNAALLMATSTP 75
+ L T P
Sbjct: 458 PDPLGEKLARTEDP 471
>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein. This is a family
of proteins which regulate checkpoint kinases. In
Schizosaccharomyces pombe this protein is called Rad26
and in Saccharomyces cerevisiae it is called LCD1.
Length = 648
Score = 27.9 bits (62), Expect = 3.8
Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 4/48 (8%)
Query: 16 RAKLEEEIQAKQEEVQRIQSEVN----SKDEETKRLQEEVENARNARK 59
R KL+ Q KQEE + +S VN D+E ++L++E+++ + RK
Sbjct: 3 RDKLDMLQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERK 50
Score = 27.1 bits (60), Expect = 7.6
Identities = 18/82 (21%), Positives = 37/82 (45%), Gaps = 1/82 (1%)
Query: 5 LEESKNMEAAERAKLEEEIQAKQ-EEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDE 63
L++ K E ++ E++ K +E+Q+++ E+ S ++E K L E + +
Sbjct: 9 LQQQKQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLVLEQRGLSANDLRTEL 68
Query: 64 MNAALLMATSTPQHHHVEENEH 85
+ L+ TS H E +
Sbjct: 69 SPPSNLLKTSDASHIADESQPN 90
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 27.7 bits (62), Expect = 3.8
Identities = 11/57 (19%), Positives = 30/57 (52%)
Query: 5 LEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQ 61
E K+ A+ + +I+ + E+ +Q+++ + ++ + + EE+E R ++K
Sbjct: 135 FESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKG 191
>gnl|CDD|205988 pfam13815, Dzip-like_N, Iguana/Dzip1-like DAZ-interacting protein
N-terminal. The DAZ gene-product - Deleted in
Azoospermia - and a closely related sequence are
required early in germ-cell development in order to
maintain germ-cell populations. This family is the
N-terminal region that is the only part of the protein
in some fungi and lower metazoa.
Length = 118
Score = 26.5 bits (59), Expect = 4.4
Identities = 14/51 (27%), Positives = 27/51 (52%)
Query: 1 MMERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEV 51
++E L S+ ++LEE +Q +E + + E++ + EE K L+ E
Sbjct: 67 IIEYLLHSQECLTQSNSELEERLQEALQEAEEREQELSKQSEELKSLKRES 117
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters
and repress their activity. by competing with the
binding of TATA binding protein. TMF1_TATA_bd is the
most conserved part of the TMFs. TMFs are
evolutionarily conserved golgins that bind Rab6, a
ubiquitous ras-like GTP-binding Golgi protein, and
contribute to Golgi organisation in animal and plant
cells. The Rab6-binding domain appears to be the same
region as this C-terminal family.
Length = 121
Score = 26.8 bits (60), Expect = 4.4
Identities = 15/51 (29%), Positives = 31/51 (60%)
Query: 18 KLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEMNAAL 68
+LE E+ + ++E+ R+++E + +E +L EE E R +K+ +E+ L
Sbjct: 28 RLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKEL 78
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 27.2 bits (61), Expect = 4.5
Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSE--VNSKDEETKRLQEEVENARNARK 59
R EE + + A + +EE Q K+EE ++ + E + E +R EE E + ARK
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 27.5 bits (61), Expect = 4.5
Identities = 19/94 (20%), Positives = 33/94 (35%), Gaps = 9/94 (9%)
Query: 64 MNAALLMATSTPQHHHVEENEHDENDDNMLNGHVSRDLDTDDNIVDPVEER------RTL 117
M A+ L + P+H V + + + S+ D V ++E L
Sbjct: 396 MGASALSPSGKPRHSGVSVPASTSAMTHSFDDNTSKHADPCAMGVKRMDEGILDERLGRL 455
Query: 118 AERNERLQDQ---LKMLKEDLAGTRDETKETAMD 148
+E+L+ Q +K +E L ET
Sbjct: 456 QALSEKLRTQHEEIKRCREALQKKESIVMETLEK 489
>gnl|CDD|219970 pfam08689, Med5, Mediator complex subunit Med5. The mediator
complex is required for the expression of nearly all RNA
pol II dependent genes in Saccharomyces cerevisiae.
Deletion of the MED5 gene leads to increased
transcription of nuclear genes encoding components of
the oxidative phosphorylation machinery, and decreased
transcription of mitochondrial genes encoding components
of the same machinery. There is no orthologue from
pombe, and this subunit appears to be fungal specific.
Length = 977
Score = 27.5 bits (61), Expect = 4.6
Identities = 12/78 (15%), Positives = 24/78 (30%)
Query: 5 LEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEM 64
L ++ ++ K E + E Q ++V S +E + E + M
Sbjct: 883 LRDALKLDDDFDMKFGESDPLRAETAQVDNADVKSIIQEMSSKENEGLGGKRNSFGVLLM 942
Query: 65 NAALLMATSTPQHHHVEE 82
L + +E
Sbjct: 943 ELKKLGDAALQSGGISKE 960
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.4 bits (61), Expect = 4.7
Identities = 14/50 (28%), Positives = 25/50 (50%), Gaps = 2/50 (4%)
Query: 3 ERLEESKNMEA--AERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEE 50
R E ++A + A + EI A+Q E+ + SE ++ + +L EE
Sbjct: 158 ARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEE 207
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 26.0 bits (58), Expect = 4.9
Identities = 7/34 (20%), Positives = 18/34 (52%)
Query: 23 IQAKQEEVQRIQSEVNSKDEETKRLQEEVENARN 56
+E+ +Q+E+ E + L+ EV++ ++
Sbjct: 19 YYQLNQEIAALQAELAKLKAENEELEAEVKDLKS 52
>gnl|CDD|233841 TIGR02388, rpoC2_cyan, DNA-directed RNA polymerase, beta'' subunit.
The family consists of the product of the rpoC2 gene, a
subunit of DNA-directed RNA polymerase of cyanobacteria
and chloroplasts. RpoC2 corresponds largely to the
C-terminal region of the RpoC (the beta' subunit) of
other bacteria. Members of this family are designated
beta'' in chloroplasts/plastids, and beta' (confusingly)
in Cyanobacteria, where RpoC1 is called beta' in
chloroplasts/plastids and gamma in Cyanobacteria. We
prefer to name this family beta'', after its organellar
members, to emphasize that this RpoC1 and RpoC2 together
replace RpoC in other bacteria [Transcription,
DNA-dependent RNA polymerase].
Length = 1227
Score = 27.5 bits (61), Expect = 5.2
Identities = 18/62 (29%), Positives = 34/62 (54%), Gaps = 11/62 (17%)
Query: 20 EEEIQAKQE--------EVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEMNAALLMA 71
E+EI+A +E EV+R Q +++ + + L++EV N ++ D +N+ +MA
Sbjct: 69 EKEIRATEERYRRGEITEVERFQKVIDTWNGTNEELKDEV---VNNFRQTDPLNSVYMMA 125
Query: 72 TS 73
S
Sbjct: 126 FS 127
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
family. This model family represents the major capsid
protein component of the heads (capsids) of
bacteriophage HK97, phi-105, P27, and related phage.
This model represents one of several analogous families
lacking detectable sequence similarity. The gene
encoding this component is typically located in an
operon encoding the small and large terminase subunits,
the portal protein and the prohead or maturation
protease [Mobile and extrachromosomal element
functions, Prophage functions].
Length = 384
Score = 27.3 bits (61), Expect = 5.2
Identities = 11/65 (16%), Positives = 24/65 (36%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
E +K + + E + ++ E ++ E++ D E RL+E ++
Sbjct: 7 ALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASG 66
Query: 63 EMNAA 67
E
Sbjct: 67 EGGGG 71
>gnl|CDD|221682 pfam12639, Colicin-DNase, DNase/tRNase domain of colicin-like
bacteriocin. Colicin-like bacteriocins are complex
structures with an N-terminal beta-barrel translocation
domain (pfam09000), a long double-alpha-helical
receptor-binding domain (pfam11570) and this C-terminal
RNAse/DNase domain with endonuclease activity. Their
competitor bacteriocidal action is by a process that
involves binding to a surface receptor, entering the
cell, and, finally, killing it. The lethal action of
colicin E3 is a specific cleavage in the ribosomal
decoding A site. The crystal structure of colicin E3
reveals a Y-shaped molecule with the receptor binding
domain forming a 100 Angstrom long stalk and the two
globular heads of the translocation domain and this
catalytic domain comprising the two arms.
Length = 110
Score = 26.2 bits (58), Expect = 5.4
Identities = 12/59 (20%), Positives = 19/59 (32%), Gaps = 4/59 (6%)
Query: 27 QEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEMNAALLMATSTPQ----HHHVE 81
Q + +S D K + V K+ E N + P+ HHH +
Sbjct: 31 QIADKLRGRRFSSFDAFRKAFWKAVAKDPELSKQFTEKNLEQIKNGKAPKGYTWHHHQD 89
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 27.2 bits (61), Expect = 5.4
Identities = 22/87 (25%), Positives = 34/87 (39%), Gaps = 24/87 (27%)
Query: 3 ERLEESKNMEAAE------------RAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEE 50
L E + E A A L EI A Q E+ Q+EVN+ +E Q E
Sbjct: 112 TVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQE---AQCE 168
Query: 51 VE---------NARNARKKQDEMNAAL 68
E ++K+++++AA
Sbjct: 169 AEGTGGTGVAGKGPVYKEKREKLDAAQ 195
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 27.3 bits (61), Expect = 5.6
Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 1/48 (2%)
Query: 17 AKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARN-ARKKQDE 63
A+L EE+ Q E+ R ++E+ + + L+E +E R +K +
Sbjct: 424 AQLLEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQ 471
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 26.9 bits (60), Expect = 6.0
Identities = 17/65 (26%), Positives = 33/65 (50%), Gaps = 2/65 (3%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQD 62
E +E+ E E E + +Q ++ ++++++ +E RLQ + EN + R+ Q
Sbjct: 19 EAVEDEVEEETVEEESEAELLDEEQAKIAELEAKLDELEERYLRLQADFENYK--RRIQK 76
Query: 63 EMNAA 67
E AA
Sbjct: 77 ENEAA 81
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 27.1 bits (60), Expect = 6.5
Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 3/68 (4%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSK---DEETKRLQEEVENARNARK 59
ERL + + AE A + ++ KQ E ++ +K + E KR + A K
Sbjct: 109 ERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK 168
Query: 60 KQDEMNAA 67
K+ E AA
Sbjct: 169 KKAEAEAA 176
>gnl|CDD|189014 cd09607, M3B_PepF_2, Peptidase family M3B Oligopeptidase F
(PepF). Peptidase family M3B Oligopeptidase F (PepF;
Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
includes oligoendopeptidase F from Lactococcus lactis.
This enzyme hydrolyzes peptides containing between 7
and 17 amino acids with fairly broad specificity. The
PepF gene is duplicated in L. lactis on the plasmid
that bears it, while a shortened second copy is found
in Bacillus subtilis. Most bacterial PepFs are
cytoplasmic endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus
lactis has been identified in Bacillus subtilis as an
inhibitor of sporulation initiation when over expressed
from a multicopy plasmid.
Length = 581
Score = 27.1 bits (61), Expect = 6.5
Identities = 10/67 (14%), Positives = 23/67 (34%), Gaps = 9/67 (13%)
Query: 3 ERLEESKNMEAAERAKLEEEIQAKQEEVQRI---------QSEVNSKDEETKRLQEEVEN 53
L+ + LE I QE ++ + ++ DEE + L ++
Sbjct: 31 SLLDALLPSDEDAVETLEALITLLQELLELLRTLGAFVSCLLSADTTDEEAQALLSQLNQ 90
Query: 54 ARNARKK 60
+ ++
Sbjct: 91 LSASLEQ 97
>gnl|CDD|176114 cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of
LysR-type transcriptional regulator CrgA and its related
homologs, contains the type 2 periplasmic binding
domain. This CD includes the substrate binding domain
of LysR-type transcriptional regulator (LTTR) CrgA and
its related homologs. The LTTRs are acting as both
auto-repressors and activators of target promoters,
controlling operons involved in a wide variety of
cellular processes such as amino acid biosynthesis, CO2
fixation, antibiotic resistance, degradation of aromatic
compounds, nodule formation of nitrogen-fixing bacteria,
and synthesis of virulence factors, to name a few. In
contrast to the tetrameric form of other LTTRs, CrgA
from Neisseria meningitides assembles into an octameric
ring, which can bind up to four 63-bp DNA
oligonucleotides. Phylogenetic cluster analysis further
showed that the CrgA-like regulators form a subclass of
the LTTRs that function as octamers. The CrgA is an
auto-repressor of its own gene and activates the
expression of the mdaB gene which coding for an
NADPH-quinone reductase and that its action is increased
by MBL (alpha-methylene-gamma-butyrolactone), an inducer
of NADPH-quinone oxidoreductase. The structural
topology of this substrate-binding domain is most
similar to that of the type 2 periplasmic binding
proteins (PBP2), which are responsible for the uptake of
a variety of substrates such as phosphate, sulfate,
polysaccharides, lysine/arginine/ornithine, and
histidine. The PBP2 bind their ligand in the cleft
between these domains in a manner resembling a Venus
flytrap. After binding their specific ligand with high
affinity, they can interact with a cognate membrane
transport complex comprised of two integral membrane
domains and two cytoplasmically located ATPase domains.
This interaction triggers the ligand translocation
across the cytoplasmic membrane energized by ATP
hydrolysis.
Length = 197
Score = 26.6 bits (60), Expect = 6.6
Identities = 7/15 (46%), Positives = 9/15 (60%)
Query: 100 DLDTDDNIVDPVEER 114
+L D +VD VEE
Sbjct: 33 ELVLSDRLVDLVEEG 47
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 26.7 bits (59), Expect = 7.0
Identities = 16/63 (25%), Positives = 30/63 (47%)
Query: 5 LEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEM 64
LEE++ KL E + + + +R D++ +RL ++ R RK+Q ++
Sbjct: 43 LEEAERKALEALKKLAEATASIRAQNKRQLDRFGLGDKQRERLDARLQIDRTFRKQQRDL 102
Query: 65 NAA 67
N A
Sbjct: 103 NKA 105
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 26.4 bits (59), Expect = 7.8
Identities = 17/59 (28%), Positives = 32/59 (54%), Gaps = 2/59 (3%)
Query: 12 EAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARN--ARKKQDEMNAAL 68
E E +LE+E +A +EE++ ++ E+ + RL E EN R R++++ A+
Sbjct: 2 EKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAI 60
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 26.6 bits (59), Expect = 8.0
Identities = 8/39 (20%), Positives = 24/39 (61%)
Query: 14 AERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVE 52
E+ ++ EI++ +++ IQS+++ +E + + E++
Sbjct: 45 KEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIK 83
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 26.1 bits (58), Expect = 8.1
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 14 AERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENA 54
E KL+ EI+ +EEVQ ++ + ++ + E K L ++
Sbjct: 72 EELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLTTE 112
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 26.7 bits (60), Expect = 8.5
Identities = 14/48 (29%), Positives = 26/48 (54%)
Query: 6 EESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVEN 53
EE + + ER +LEE+I +EE+Q I ++ + E + E++
Sbjct: 167 EEFEALPEEEREELEEKIDELEEELQEILRQLRELEREAREKLRELDR 214
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 26.1 bits (58), Expect = 8.9
Identities = 12/54 (22%), Positives = 29/54 (53%), Gaps = 5/54 (9%)
Query: 6 EESKNMEAAERAKLEEEIQAKQEEVQR----IQSEVNSK-DEETKRLQEEVENA 54
+++ + R ++E+Q KQ+E+Q+ Q E+ K E + + ++++ A
Sbjct: 67 KQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKA 120
>gnl|CDD|181620 PRK09040, PRK09040, hypothetical protein; Provisional.
Length = 214
Score = 26.1 bits (58), Expect = 9.0
Identities = 8/19 (42%), Positives = 14/19 (73%)
Query: 17 AKLEEEIQAKQEEVQRIQS 35
+KLE E++ +Q E QR ++
Sbjct: 45 SKLEAEVKQRQAEAQRRKT 63
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 26.8 bits (59), Expect = 9.1
Identities = 23/85 (27%), Positives = 37/85 (43%), Gaps = 5/85 (5%)
Query: 5 LEESKNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQEEVENARNARKKQDEM 64
L E K E + AK E E + + EE +R + E + E R Q + E + K Q+ +
Sbjct: 250 LLEEKRRELEKLAKEEAERERQAEEQRRREEEKAA--MEADRAQAKAEVEKRREKLQNLL 307
Query: 65 NAALLMATSTPQHHHVEENEHDEND 89
A + S ++E +E D
Sbjct: 308 KKA---SRSADNVWYIEPSEFKAGD 329
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 26.6 bits (58), Expect = 9.2
Identities = 20/51 (39%), Positives = 28/51 (54%), Gaps = 8/51 (15%)
Query: 9 KNMEAAERAKLEEEIQAKQEEVQRIQSEVNSKDEETKRLQE-EVENARNAR 58
K EA E+AK E E Q +EE +R K++E +R +E E E R A+
Sbjct: 580 KREEAVEKAKREAE-QKAREERER------EKEKEKEREREREREAERAAK 623
>gnl|CDD|227900 COG5613, COG5613, Uncharacterized conserved protein [Function
unknown].
Length = 400
Score = 26.5 bits (58), Expect = 9.6
Identities = 14/46 (30%), Positives = 22/46 (47%), Gaps = 1/46 (2%)
Query: 9 KNMEAAERAKLEEEIQAKQEEVQRI-QSEVNSKDEETKRLQEEVEN 53
N EAA+ + A + +QRI + E ++ QEE+EN
Sbjct: 328 LNAEAAQLQADSQLAAADVQNLQRIIERLKEELKLELEKAQEEMEN 373
>gnl|CDD|225783 COG3243, PhaC, Poly(3-hydroxyalkanoate) synthetase [Lipid
metabolism].
Length = 445
Score = 26.6 bits (59), Expect = 9.7
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 107 IVDPVEERRTLAERNERLQDQLKMLKEDLA 136
I +P R TL + + L L+ L EDL
Sbjct: 32 ITNPEALRETLESQGKNLVAGLENLAEDLE 61
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.123 0.317
Gapped
Lambda K H
0.267 0.0686 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,212,877
Number of extensions: 870052
Number of successful extensions: 3084
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2480
Number of HSP's successfully gapped: 794
Length of query: 183
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 92
Effective length of database: 6,901,388
Effective search space: 634927696
Effective search space used: 634927696
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 56 (25.4 bits)