RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2152
(205 letters)
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins in
this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 171 bits (436), Expect = 6e-55
Identities = 106/158 (67%), Positives = 126/158 (79%)
Query: 23 VQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNE 82
V LA SIYQEFE++I Y E+VVK+LMPLV+ VLE+LD +ENQE EVELELLREDNE
Sbjct: 1 VYDLANSIYQEFERLIDNYGEEVVKNLMPLVIRVLENLDSLASENQELEVELELLREDNE 60
Query: 83 QLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVY 142
+L TQYEREK+LRK A+QKL + ED LE +KKEL KIE LE VR LELK+KN SD V
Sbjct: 61 RLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELKAKNLSDQVS 120
Query: 143 RLEEKEQELKKEYSILHERYTELLKTHMEYIERTKLMM 180
RLEE+E ELKKEY+ LHERYT+LLK ++EY+ER K +
Sbjct: 121 RLEERETELKKEYNSLHERYTKLLKNYVEYVERQKDEI 158
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 41.2 bits (97), Expect = 2e-04
Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 3/66 (4%)
Query: 66 ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
E +E L L +NE L + ER ++ ++ DQ QI+ A++ + +EL +IE L+S
Sbjct: 67 EVKELRKRLAKLISENEALKAENERLQKREQSIDQ---QIQQAVQSETQELTKEIEQLKS 123
Query: 126 IVRMLE 131
+ L+
Sbjct: 124 ERQQLQ 129
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 38.1 bits (89), Expect = 0.002
Identities = 20/82 (24%), Positives = 41/82 (50%), Gaps = 1/82 (1%)
Query: 48 DLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQ-LRKAADQKLFQIE 106
+ M +LD F E E E E L E+ + + + + ++ ++ + +E
Sbjct: 489 ERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAITDLE 548
Query: 107 DALEDDKKELNSKIESLESIVR 128
+ALE +K+E+ +KIE L+ + +
Sbjct: 549 EALEGEKEEIKAKIEELQEVTQ 570
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 37.5 bits (88), Expect = 0.003
Identities = 28/110 (25%), Positives = 55/110 (50%), Gaps = 18/110 (16%)
Query: 66 ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
E + E LE +E+ +L ++E+E + R+ +L ++E L ++ L+ K+E LE
Sbjct: 52 EAIKKEALLEA-KEEIHKLRNEFEKELRERR---NELQKLEKRLLQKEENLDRKLELLE- 106
Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
K + LE+KE+EL+++ L ++ EL + E ++
Sbjct: 107 -------KREE------ELEKKEKELEQKQQELEKKEEELEELIEEQLQE 143
Score = 27.0 bits (61), Expect = 8.3
Identities = 20/112 (17%), Positives = 53/112 (47%), Gaps = 22/112 (19%)
Query: 71 EVELELLREDNEQLVTQYEREKQLRK-----AADQKLFQIEDALEDDKKELNSKIESLES 125
E +++ E+ ++++ + ++E + K A +++ ++ + E + +E ++++ LE
Sbjct: 30 EAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEK 89
Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTK 177
RL +KE+ L ++ +L +R EL K E ++ +
Sbjct: 90 -----------------RLLQKEENLDRKLELLEKREEELEKKEKELEQKQQ 124
>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
topoisomerase IV), A subunit [DNA replication,
recombination, and repair].
Length = 804
Score = 36.9 bits (86), Expect = 0.005
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 16/104 (15%)
Query: 82 EQLVTQYER-EKQLRKAADQKLFQI----EDALEDDKKELNSKIESLESIVRMLELKSKN 136
E+L+ ++ EKQ D +L ++ E+ +E + KEL +I LE I L S+
Sbjct: 407 EELMARFGLSEKQAEAILDLRLRRLTGLEEEKIEKELKELEKEIADLEKI-----LASEE 461
Query: 137 ASDHVYRLEEKE-QELKKEYSILHERYTELLKTHMEYIERTKLM 179
+ + +KE E+KK++ ER TE+++ + IE L+
Sbjct: 462 R---LLDIIKKELLEIKKKFG--DERRTEIVEEEEDEIEDEDLI 500
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 35.5 bits (82), Expect = 0.018
Identities = 29/111 (26%), Positives = 55/111 (49%)
Query: 57 LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
+E L+ E +E ELE E+ E+ + + + E + +A ++L LE++K+EL
Sbjct: 830 IEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889
Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
++ LES + L+ + + + + LE K + L+ E L E E +
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYE 940
Score = 33.1 bits (76), Expect = 0.084
Identities = 30/122 (24%), Positives = 61/122 (50%), Gaps = 3/122 (2%)
Query: 47 KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIE 106
+ L + + E L+ E +E + LE L E+ E L + K+ + ++K ++
Sbjct: 733 EQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQ 792
Query: 107 DALEDDKKEL---NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYT 163
+ LE+ ++EL ++++LE + LE + + + LEE+ +EL+++ L E
Sbjct: 793 EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELE 852
Query: 164 EL 165
EL
Sbjct: 853 EL 854
Score = 31.6 bits (72), Expect = 0.31
Identities = 28/98 (28%), Positives = 51/98 (52%), Gaps = 4/98 (4%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIV 127
+E E ELE + L + E +Q R+ +Q++ + LE++ +EL K++ LE +
Sbjct: 796 EELEEELEEAERRLDALERELESLEQRRERLEQEI----EELEEEIEELEEKLDELEEEL 851
Query: 128 RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
LE + + + + LE +++EL+ E L E EL
Sbjct: 852 EELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889
Score = 31.2 bits (71), Expect = 0.36
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 3/103 (2%)
Query: 66 ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALED---DKKELNSKIES 122
E E ELE L+ + E L + ER + + ++L ++E LE+ + +ELN ++E
Sbjct: 391 ELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEE 450
Query: 123 LESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
LE + L + K + L+E+ Q L+KE S L R L
Sbjct: 451 LEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRL 493
Score = 30.5 bits (69), Expect = 0.70
Identities = 28/123 (22%), Positives = 58/123 (47%), Gaps = 10/123 (8%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNS--------- 118
+E ELE L E+ +L + E ++ + A++++ +++ LE+ ++EL
Sbjct: 235 KELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELK 294
Query: 119 -KIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTK 177
+IE LE + +L + + + + LEE+ +ELK++ L E E E +
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354
Query: 178 LMM 180
+
Sbjct: 355 ELE 357
Score = 30.1 bits (68), Expect = 0.80
Identities = 23/97 (23%), Positives = 45/97 (46%)
Query: 69 EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
+ E EL+ L+ + L E ++ + +++L +++ L ++EL LE +
Sbjct: 685 KLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEE 744
Query: 129 MLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
LE + + RLEE E+EL+ L + E+
Sbjct: 745 ELEELEEELEELQERLEELEEELESLEEALAKLKEEI 781
Score = 29.7 bits (67), Expect = 1.2
Identities = 26/118 (22%), Positives = 54/118 (45%)
Query: 55 NVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKK 114
LE L+ + +E EL+ E+ E+ + + + E + + ++L + L+++ +
Sbjct: 239 KELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIE 298
Query: 115 ELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEY 172
EL +I L + LE + + + + L+EK + LK+E EL + E
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAEL 356
Score = 29.7 bits (67), Expect = 1.2
Identities = 17/72 (23%), Positives = 35/72 (48%)
Query: 106 EDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
A + + KEL ++ LE+ + LE + K+ + + LE+ +EL+++ L + EL
Sbjct: 662 SLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEEL 721
Query: 166 LKTHMEYIERTK 177
+ E +
Sbjct: 722 KRELAALEEELE 733
Score = 29.3 bits (66), Expect = 1.7
Identities = 32/109 (29%), Positives = 55/109 (50%)
Query: 57 LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
LESL+ A + +E ELE R+ ++ + + E E + + L + ++LE ++ L
Sbjct: 767 LESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERL 826
Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
+IE LE + LE K + + LE++ +ELK+E L EL
Sbjct: 827 EQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEEL 875
Score = 28.9 bits (65), Expect = 2.3
Identities = 27/109 (24%), Positives = 50/109 (45%)
Query: 57 LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
LE L+ + LE L E+ + + + ER+ + K L + + L+ +EL
Sbjct: 683 LEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEEL 742
Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
++E LE + L+ + + + + LEE +LK+E L E+ L
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQAL 791
Score = 27.8 bits (62), Expect = 5.6
Identities = 24/103 (23%), Positives = 52/103 (50%)
Query: 65 TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
E +E E LE L + +L E ++ A ++L ++ +AL ++ EL +++ +
Sbjct: 337 EELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIR 396
Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
+ + L+ + ++ + + RL E+ ++LK+E L EL
Sbjct: 397 NELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQT 439
Score = 27.0 bits (60), Expect = 9.5
Identities = 26/111 (23%), Positives = 57/111 (51%)
Query: 57 LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
LE L+ F +E ELE + + + +RE + + ++L + + L+++ KEL
Sbjct: 371 LEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKEL 430
Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
+++E L++ + L + + + + L ++ +EL++E + L E L K
Sbjct: 431 EAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEK 481
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 34.2 bits (79), Expect = 0.021
Identities = 20/76 (26%), Positives = 36/76 (47%), Gaps = 8/76 (10%)
Query: 66 ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
ENQE + EL L + E+L + R KQL A L+++ +EL ++ L+
Sbjct: 95 ENQELKQELSTLEAELERLQKELARIKQLSANAI--------ELDEENRELREELAELKQ 146
Query: 126 IVRMLELKSKNASDHV 141
LE +++ ++
Sbjct: 147 ENEALEAENERLQENE 162
Score = 31.5 bits (72), Expect = 0.20
Identities = 24/82 (29%), Positives = 38/82 (46%), Gaps = 8/82 (9%)
Query: 73 ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL 132
EL L+E+ +L Q +Q + Q+L +E LE +KEL + + + EL
Sbjct: 74 ELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI---EL 130
Query: 133 KSKNASDHVYRLEEKEQELKKE 154
+N L E+ ELK+E
Sbjct: 131 DEENR-----ELREELAELKQE 147
Score = 26.5 bits (59), Expect = 8.6
Identities = 21/106 (19%), Positives = 49/106 (46%), Gaps = 8/106 (7%)
Query: 77 LREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL---K 133
RE +L + ++ ++L L+ + +EL ++ +LE+ + L+ +
Sbjct: 64 ARERLPELQQELAELQEELAELQEQL----AELQQENQELKQELSTLEAELERLQKELAR 119
Query: 134 SKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTKLM 179
K S + L+E+ +EL++E + L + E L+ E ++ +
Sbjct: 120 IKQLSANAIELDEENRELREELAELKQE-NEALEAENERLQENEQR 164
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 34.8 bits (81), Expect = 0.024
Identities = 23/66 (34%), Positives = 40/66 (60%), Gaps = 3/66 (4%)
Query: 105 IEDA---LEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHER 161
IE+A + +DK++LN I SLE + R LE K++ A + E+ ++EL+++ L E
Sbjct: 504 IEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEE 563
Query: 162 YTELLK 167
+LL+
Sbjct: 564 EDKLLE 569
Score = 30.2 bits (69), Expect = 0.84
Identities = 23/107 (21%), Positives = 47/107 (43%), Gaps = 6/107 (5%)
Query: 61 DLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKI 120
E E E +L E E+ E E +L + A+++ + A+++ KKE + I
Sbjct: 535 KAEEAEALLKEAE-KLKEELEEKKEKLQEEEDKLLEEAEKE---AQQAIKEAKKEADEII 590
Query: 121 ESLESIVRML--ELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
+ L + + +K+ + RL + ++ +K+ E+ EL
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEEL 637
Score = 29.0 bits (66), Expect = 1.8
Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 15/76 (19%)
Query: 64 FTENQEHEVELELLREDNEQLVTQYEREKQLRKA----------ADQKLFQIEDALEDDK 113
F E+ E E EL +L E ++ T E E+++ A +KL I L
Sbjct: 105 FIEDLEEEEELPILEEWVAKIRTLPELEQEIHNCIDEEGEVKDSASEKLRGIRRQL---- 160
Query: 114 KELNSKI-ESLESIVR 128
+ S+I E LESI+R
Sbjct: 161 RRKKSRIREKLESIIR 176
Score = 27.1 bits (61), Expect = 8.8
Identities = 14/67 (20%), Positives = 33/67 (49%), Gaps = 3/67 (4%)
Query: 109 LEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
LE+ ++EL K E E++++ E + + +L+E+E +L +E + + +K
Sbjct: 525 LEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEE---AEKEAQQAIKE 581
Query: 169 HMEYIER 175
+ +
Sbjct: 582 AKKEADE 588
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 34.5 bits (80), Expect = 0.027
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 15/111 (13%)
Query: 65 TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
E + E LE +E+ +L + ERE + R+ +L ++E L ++ L+ K+ESL+
Sbjct: 45 AETLKKEALLEA-KEEVHKLRAELERELKERR---NELQRLERRLLQREETLDRKMESLD 100
Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
LE K K L KE+ L + E EL+ E +ER
Sbjct: 101 KKEENLEKKEK-------ELSNKEKNLDE----KEEELEELIAEQREELER 140
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 34.5 bits (79), Expect = 0.029
Identities = 26/79 (32%), Positives = 44/79 (55%)
Query: 75 ELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKS 134
ELL E E+ E +Q K+ Q+L QIE L + L+ K E+L S + LE K
Sbjct: 72 ELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSSKEKTLESKE 131
Query: 135 KNASDHVYRLEEKEQELKK 153
++ +D ++E+E++++K
Sbjct: 132 QSLTDKSKHIDEREEQVEK 150
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 33.9 bits (78), Expect = 0.053
Identities = 25/111 (22%), Positives = 46/111 (41%), Gaps = 4/111 (3%)
Query: 58 ESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
E L+ E E + EL+ L+E+ ++L + + K+ ++E+ ED E+
Sbjct: 392 EKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIK 451
Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
+ LE + L +Y L+E+ ++KE S L E
Sbjct: 452 KQEWKLEQLAADLS----KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ 498
Score = 32.0 bits (73), Expect = 0.23
Identities = 24/97 (24%), Positives = 43/97 (44%)
Query: 69 EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
E +E + EQ + + EK+ + Q+L + L++ K + +IE+L
Sbjct: 805 EEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE 864
Query: 129 MLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
LE + + + LE + +LKKE L + EL
Sbjct: 865 ELEEELEELEAALRDLESRLGDLKKERDELEAQLREL 901
Score = 31.6 bits (72), Expect = 0.31
Identities = 31/113 (27%), Positives = 50/113 (44%), Gaps = 4/113 (3%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIV 127
E ELE L E+ +L + E +QL + ++K I+D E+++ + KI LE+ +
Sbjct: 247 ASLEEELEKLTEEISELEKRLEEIEQLLEELNKK---IKDLGEEEQLRVKEKIGELEAEI 303
Query: 128 RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEY-IERTKLM 179
LE + EE+ +L+ E L EL + E R KL
Sbjct: 304 ASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356
Score = 29.3 bits (66), Expect = 1.7
Identities = 31/152 (20%), Positives = 67/152 (44%), Gaps = 19/152 (12%)
Query: 24 QSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQ 83
L+ S E + ++K +E+V + + L ++ + +E E L ++ ++
Sbjct: 786 ARLSHSRIPEIQAELSKLEEEVSR-----IEARLREIE---QKLNRLTLEKEYLEKEIQE 837
Query: 84 LVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLE-----LKSK--N 136
L + + + + + + + + L K+EL ++E LE+ +R LE LK +
Sbjct: 838 LQ-EQRIDLKEQIKSIE---KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDE 893
Query: 137 ASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
+ LE K +EL+ + +R +EL
Sbjct: 894 LEAQLRELERKIEELEAQIEKKRKRLSELKAK 925
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 33.3 bits (76), Expect = 0.072
Identities = 19/78 (24%), Positives = 38/78 (48%), Gaps = 4/78 (5%)
Query: 104 QIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY----SILH 159
Q + + K+E+N IE L ++ + LE K+++ + E+ ++EL++E
Sbjct: 501 QAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERER 560
Query: 160 ERYTELLKTHMEYIERTK 177
+ EL K E ++ K
Sbjct: 561 NKKLELEKEAQEALKALK 578
Score = 32.1 bits (73), Expect = 0.18
Identities = 22/86 (25%), Positives = 39/86 (45%), Gaps = 3/86 (3%)
Query: 89 EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKE 148
E+ K +++ + + L +KEL K E LE +++ E K + L+E+E
Sbjct: 500 EQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERE 559
Query: 149 QELKKEYSILHERYTELLKTHMEYIE 174
+ K E L + E LK + +E
Sbjct: 560 RNKKLE---LEKEAQEALKALKKEVE 582
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 33.1 bits (76), Expect = 0.089
Identities = 32/108 (29%), Positives = 54/108 (50%), Gaps = 4/108 (3%)
Query: 58 ESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
+ L+ Q H E E LRED + L ER ++LR+ A + ++E+A + ++
Sbjct: 328 DRLEECRVAAQAHNEEAESLREDADDL---EERAEELREEAAELESELEEA-REAVEDRR 383
Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
+IE LE + L + +A + E+ +EL++E L ER EL
Sbjct: 384 EEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAEL 431
Score = 28.5 bits (64), Expect = 2.7
Identities = 26/101 (25%), Positives = 44/101 (43%), Gaps = 6/101 (5%)
Query: 55 NVLESLDLAFTENQEHEVELELLREDNEQL----VTQYEREKQLRKAADQKLFQIEDALE 110
++ E + +E E LRE +L + E + + A++ ++ L
Sbjct: 520 DLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEV-AELN 578
Query: 111 DDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQEL 151
EL +IESLE I +L + +A D + RL EK + L
Sbjct: 579 SKLAELKERIESLERIRTLLAAIA-DAEDEIERLREKREAL 618
Score = 27.7 bits (62), Expect = 5.3
Identities = 23/82 (28%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 69 EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
E E ELE L E+ E++ + ER + L +A D+ + ++E+ ED ++ + + E++E
Sbjct: 479 ELEAELEDLEEEVEEVEERLERAEDLVEAEDR-IERLEERREDLEELIAERRETIEEKRE 537
Query: 129 MLELKSKNASDHVYRLEEKEQE 150
E + A++ EEK +
Sbjct: 538 RAEELRERAAELEAEAEEKREA 559
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 32.6 bits (74), Expect = 0.13
Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 14/129 (10%)
Query: 28 GSIYQEFEKMITKYDEDVV---KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQL 84
G + +E + + TK + V KD + S++ + V E +QL
Sbjct: 451 GQLKKENDMLQTKLNSMVSAKQKDKQSMQ-----SMEKRLKSEADSRVNAE------KQL 499
Query: 85 VTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRL 144
+ +R+K+ + A + Q + E+ + L + LE ++ LE K + L
Sbjct: 500 AEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKLKEEECRML 559
Query: 145 EEKEQELKK 153
E++ QEL+K
Sbjct: 560 EKEAQELRK 568
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 32.0 bits (73), Expect = 0.20
Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 5/117 (4%)
Query: 74 LELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELK 133
+E L E+N +L + E K+ + + +L + + D ++ + +I + + + LE +
Sbjct: 431 VERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRK-DREIRARDRRIERLEKE 489
Query: 134 SKNASDHVYRLEEKEQELKKEYSILHERYTELLKT----HMEYIERTKLMMGSGSSD 186
+ V LE K EL+K + +K +E IE + G D
Sbjct: 490 LEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEEEYGIKEGD 546
Score = 28.1 bits (63), Expect = 3.6
Identities = 32/149 (21%), Positives = 63/149 (42%), Gaps = 12/149 (8%)
Query: 29 SIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQY 88
+ ++ ++ D + +K L+ + E+L E + E E E E +T Y
Sbjct: 363 KVERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERRE--ITVY 420
Query: 89 EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRML----------ELKSKNAS 138
E+ + + ++L + L+ + +EL +IE LES + + + +
Sbjct: 421 EKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARD 480
Query: 139 DHVYRLEEKEQELKKEYSILHERYTELLK 167
+ RLE++ +E KK L + EL K
Sbjct: 481 RRIERLEKELEEKKKRVEELERKLAELRK 509
>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
Length = 663
Score = 32.1 bits (73), Expect = 0.21
Identities = 32/112 (28%), Positives = 46/112 (41%), Gaps = 29/112 (25%)
Query: 23 VQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNE 82
+QS G +E EKM+ + +E + E E + EL + + E
Sbjct: 535 IQSSGGLSDEEIEKMVKEAEE--------------------YKEQDEKKKELVDAKNEAE 574
Query: 83 QLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKS 134
L+ Y EKQL D+ I DA DK EL KI L S + ++ S
Sbjct: 575 TLI--YSVEKQLSDLKDK----ISDA---DKDELKQKITKLRSTLSSEDVDS 617
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 31.2 bits (71), Expect = 0.36
Identities = 36/163 (22%), Positives = 71/163 (43%), Gaps = 11/163 (6%)
Query: 11 THDDSHIVMSEKVQSLAGSIY------QEFEKMITKYDEDVVKDLMPLVVNVLESLDLAF 64
+ E+++ L +Y E+ E + +L + + L+
Sbjct: 271 ELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERL-ANLERQLEELEAQLEELE 329
Query: 65 TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
++ E EL L E E+L + E + + + +L ++E LE+ +++L E+L
Sbjct: 330 SKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL----ETLR 385
Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
S V LEL+ + ++ + RLE + + L+ L + ELLK
Sbjct: 386 SKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLK 428
Score = 30.0 bits (68), Expect = 1.0
Identities = 23/81 (28%), Positives = 40/81 (49%), Gaps = 7/81 (8%)
Query: 69 EHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
E E ELE L+E+ E+L E ++ + A+Q L DA E + +L ++++SLE R
Sbjct: 444 ELEEELEELQEELERLEEALEELREELEEAEQAL----DAAERELAQLQARLDSLE---R 496
Query: 129 MLELKSKNASDHVYRLEEKEQ 149
+ E + L+ +
Sbjct: 497 LQENLEGFSEGVKALLKNQSG 517
Score = 28.9 bits (65), Expect = 2.2
Identities = 20/69 (28%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 89 EREKQLRKAADQKLFQIE--DALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEE 146
RE +L + E + L+++ KE ++E L + ++ LE K + V LEE
Sbjct: 222 LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEE 281
Query: 147 KEQELKKEY 155
+ +EL+KE
Sbjct: 282 EIEELQKEL 290
Score = 28.5 bits (64), Expect = 2.9
Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 9/101 (8%)
Query: 55 NVLESLDLAFTENQEHEVELELLREDNEQLVTQYERE-KQLRKAADQKLFQIEDALEDDK 113
+E L+ E +E E E E+ E + Q + E K LR+A D L +
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALD--------ELRAEL 812
Query: 114 KELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
LN + +L + LE + + LEE+ +EL ++
Sbjct: 813 TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSED 853
Score = 28.1 bits (63), Expect = 3.9
Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 13/112 (11%)
Query: 65 TENQEHEVELELLR-EDN-EQLVTQYER-EKQLRKAADQKLFQIEDALED--------DK 113
TE + L R ED +L Q + E+Q KA ++ +++ L +
Sbjct: 177 TERKLERTRENLDRLEDILNELERQLKSLERQAEKA--ERYKELKAELRELELALLVLRL 234
Query: 114 KELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
+EL ++E L+ ++ E + + + + LEEK +EL+ E S L E EL
Sbjct: 235 EELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEEL 286
Score = 28.1 bits (63), Expect = 4.0
Identities = 18/107 (16%), Positives = 40/107 (37%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIV 127
+ E + E L Q E + ++ ++ ++E+ +E+ + EL + + S+
Sbjct: 827 ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLE 886
Query: 128 RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIE 174
L L + L E E + + L E +L + +
Sbjct: 887 EALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933
Score = 27.3 bits (61), Expect = 7.1
Identities = 23/106 (21%), Positives = 48/106 (45%), Gaps = 3/106 (2%)
Query: 73 ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALE---DDKKELNSKIESLESIVRM 129
+L L + EQL + + + + ++ ++E+ LE ++ E ++IE LE+ +
Sbjct: 734 DLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ 793
Query: 130 LELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
L+ + K + + L + L +E + L ER L + R
Sbjct: 794 LKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERR 839
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 31.0 bits (71), Expect = 0.37
Identities = 19/88 (21%), Positives = 41/88 (46%), Gaps = 23/88 (26%)
Query: 89 EREKQLRKAADQK------LFQIEDALED--------DKKELNSKIESLESIVRMLELKS 134
+K+ ++ + K ++ +E +L++ DKK++ IE L+ EL+
Sbjct: 518 AEDKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKKVEEAIEWLKE-----ELEG 572
Query: 135 KNASDHVYRLEEKEQELKKEYSILHERY 162
++ + +E K +EL+K + ER
Sbjct: 573 EDKEE----IEAKTEELQKVVQPIGERM 596
>gnl|CDD|233067 TIGR00641, acid_CoA_mut_N, methylmalonyl-CoA mutase N-terminal
domain. Methylmalonyl-CoA mutase (EC 5.4.99.2)
catalyzes a reversible isomerization between
L-methylmalonyl-CoA and succinyl-CoA. The enzyme uses an
adenosylcobalamin cofactor. It may be a homodimer, as in
mitochondrion, or a heterodimer with partially
homologous beta chain that does not bind the
adenosylcobalamin cofactor, as in Propionibacterium
freudenreichii. The most similar archaeal sequences are
separate chains, such as AF2215 abd AF2219 of
Archaeoglobus fulgidus, that correspond roughly to the
first 500 and last 130 residues, respectively of known
methylmalonyl-CoA mutases. This model describes the
N-terminal domain subfamily. In a neighbor-joining tree,
AF2215 branches with a bacterial isobutyryl-CoA mutase,
which is also the same length. Scoring between the noise
and trusted cutoffs are the non-catalytic, partially
homologous beta chains from certain heterodimeric
examples of 5.4.99.2.
Length = 524
Score = 30.9 bits (70), Expect = 0.47
Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 69 EHEVELELLREDNEQLVT-QYEREKQLRKAADQKLFQ 104
E E E+E+L+ DN + Q + K+LR DQ+ +
Sbjct: 437 EEEDEVEVLKVDNSSVREEQIAKLKKLRAERDQEKVE 473
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 30.4 bits (69), Expect = 0.53
Identities = 18/79 (22%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
Query: 77 LREDNEQLVTQYEREK-QLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSK 135
LRE +L + + + A ++ DAL+++ +L+++I+ LE ++ LE K
Sbjct: 4 LREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPA 63
Query: 136 NASDHVYRLEEKEQELKKE 154
+ + EE+E+ +
Sbjct: 64 ASGEGGGGEEEEEEAKAEA 82
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 30.7 bits (70), Expect = 0.56
Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 3/60 (5%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAAD---QKLFQIEDALEDDKKELNSKIESLE 124
E EL L + L ++++EK+L +A +L DA DD L +++ LE
Sbjct: 453 AELRAELAALEAELAALEARWQQEKELVEAILALRAELEADADAPADDDAALRAQLAELE 512
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 30.2 bits (69), Expect = 0.67
Identities = 13/44 (29%), Positives = 22/44 (50%)
Query: 111 DDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
+ + L + LE +R LELK ++ +LE + + LK E
Sbjct: 1 ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSE 44
>gnl|CDD|234348 TIGR03761, ICE_PFL4669, integrating conjugative element protein,
PFL_4669 family. Members of this protein family, such
as PFL4669, are found in integrating conjugative
elements (ICE) of the PFGI-1 class as in Pseudomonas
fluorescens.
Length = 216
Score = 30.0 bits (68), Expect = 0.69
Identities = 9/33 (27%), Positives = 20/33 (60%)
Query: 98 ADQKLFQIEDALEDDKKELNSKIESLESIVRML 130
AD L +IE+ L ++E+ + ++ L+ ++ L
Sbjct: 58 ADWALLRIEEKLLSARQEMQALLQRLDDLLAQL 90
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 30.4 bits (69), Expect = 0.74
Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 1/70 (1%)
Query: 104 QIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY-SILHERY 162
+ + ++K+ L IE LE + + LE + + + +E S+L +
Sbjct: 494 EAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKA 553
Query: 163 TELLKTHMEY 172
++ HM
Sbjct: 554 QIVVLAHMGL 563
>gnl|CDD|202685 pfam03564, DUF1759, Protein of unknown function (DUF1759). This is
a family of proteins of unknown function. Most of the
members are gag-polyproteins.
Length = 146
Score = 29.3 bits (66), Expect = 0.79
Identities = 14/58 (24%), Positives = 26/58 (44%)
Query: 82 EQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASD 139
E L +Y+ + + ++ KL ++ D +L + I+R LE +NA D
Sbjct: 61 EALKERYDNPRVIIRSLLNKLMKLPSTNNDSVSQLRRLYDEANEIIRQLEQLGENADD 118
>gnl|CDD|217457 pfam03255, ACCA, Acetyl co-enzyme A carboxylase carboxyltransferase
alpha subunit. Acetyl co-enzyme A carboxylase
carboxyltransferase is composed of an alpha and beta
subunit.
Length = 145
Score = 29.2 bits (66), Expect = 0.80
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 3/46 (6%)
Query: 109 LEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
E EL +KI+ L ++ + SD ++RLE+K +LK+E
Sbjct: 5 FEKPLAELEAKIDELR---KLARKNDVDVSDEIHRLEKKLDKLKRE 47
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 30.0 bits (68), Expect = 0.87
Identities = 29/143 (20%), Positives = 58/143 (40%), Gaps = 23/143 (16%)
Query: 66 ENQEHEVELELLRED-----------NEQLVTQYEREKQL-----RKAADQKLFQIEDAL 109
EN E E ++E L+++ ++ + E ++ KA + + + L
Sbjct: 89 ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNL 148
Query: 110 EDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYT--ELLK 167
+ K LN K E +++ L+ NAS + LEE + ++K +S L
Sbjct: 149 SEALKGLNYKKNFKEKLLKELKSVILNAS-SLLSLEELKAKIKTLFSSNKPELALLTLSV 207
Query: 168 THMEYIERTKLM----MGSGSSD 186
+ IE+ +++ +GS
Sbjct: 208 IDFDEIEQAEILEKSIIGSSDVP 230
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 30.0 bits (67), Expect = 0.87
Identities = 19/128 (14%), Positives = 36/128 (28%), Gaps = 4/128 (3%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDD----KKELNSKIESL 123
+E E + + Q ++ E K A + D ++ L++K E
Sbjct: 298 EEQETLVRQFTVEQTQAKSKVESAKIETDRAREMETLAHDNVKQIVGAQHGILSAKREGA 357
Query: 124 ESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTKLMMGSG 183
R + V L E +E L+ + + E +
Sbjct: 358 VDKRRTISTARAGLDALVKGLGGAAPESAEELLELNNAARLTVDEYPAAREALESAGQRN 417
Query: 184 SSDRMDSR 191
DR +
Sbjct: 418 VEDRTRAV 425
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 29.8 bits (67), Expect = 0.94
Identities = 19/88 (21%), Positives = 36/88 (40%)
Query: 67 NQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESI 126
Q+ E E L++ ++ + E++KQ +AA Q + + A E K + E+
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152
Query: 127 VRMLELKSKNASDHVYRLEEKEQELKKE 154
+ +K A+ + E E K
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKAA 180
>gnl|CDD|235580 PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase
subunit alpha; Validated.
Length = 319
Score = 29.7 bits (68), Expect = 0.95
Identities = 14/41 (34%), Positives = 25/41 (60%), Gaps = 3/41 (7%)
Query: 114 KELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKE 154
EL +KIE L ++ ++ + S+ + RLE+K +EL K+
Sbjct: 13 AELEAKIEELRAVAEDSDV---DLSEEIERLEKKLEELTKK 50
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.5 bits (66), Expect = 1.1
Identities = 24/100 (24%), Positives = 48/100 (48%), Gaps = 5/100 (5%)
Query: 73 ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL 132
E L+ED E+L + E ++ ++ ++L ++E E E+ +++ LE LE
Sbjct: 129 EYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYE----EVQERLKRLEVENSRLEE 184
Query: 133 KSKNASDHVYRLEEKEQELKKEYSIL-HERYTELLKTHME 171
K VY L+++ EL+ + E ++L+K +
Sbjct: 185 MLKKLPGEVYDLKKRWDELEPGVELPEEELISDLVKETLN 224
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 29.7 bits (67), Expect = 1.1
Identities = 27/115 (23%), Positives = 51/115 (44%), Gaps = 11/115 (9%)
Query: 55 NVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAA---DQKLFQIE---DA 108
L++L+ Q+ E E+ +L + R KQ +A ++L Q+E +A
Sbjct: 420 EDLQALESQL--RQQLEAGKLEFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEA 477
Query: 109 LEDDKKELNS---KIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHE 160
LE ++E +E L+S +R L + A + + R E + +L++ L
Sbjct: 478 LEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELEL 532
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 28.8 bits (65), Expect = 1.2
Identities = 28/111 (25%), Positives = 51/111 (45%), Gaps = 15/111 (13%)
Query: 50 MPLVVNVLESLDLAFTENQEHEVELEL----LREDNEQLVTQYEREKQLRKAADQKLFQI 105
+ ++NV+ SL + E LE L + E+L ER K + +++L
Sbjct: 31 IVRIINVIYSLLQQRDRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLEREL--- 87
Query: 106 EDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYS 156
L+ +++L K+++LE K KN + V RL+ Q+ K +Y+
Sbjct: 88 -ALLQAKERQLEKKLKTLEQ-------KLKNEKEEVQRLKNIIQQRKTQYN 130
>gnl|CDD|223944 COG1012, PutA, NAD-dependent aldehyde dehydrogenases [Energy
production and conversion].
Length = 472
Score = 29.6 bits (67), Expect = 1.2
Identities = 12/41 (29%), Positives = 20/41 (48%), Gaps = 9/41 (21%)
Query: 85 VTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
++ ER LR+ A D LE +EL + + +LE+
Sbjct: 55 LSAEERAAILRRIA--------DLLEARAEEL-AALITLET 86
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 29.6 bits (66), Expect = 1.3
Identities = 22/99 (22%), Positives = 39/99 (39%), Gaps = 8/99 (8%)
Query: 67 NQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESI 126
Q + L +Q R K L ++L +L+ +K+L + L+S
Sbjct: 129 RQNLAKAQQELARLTKQAQDLQTRLKTL-AEQRRQLEAQAQSLQASQKQLQASATQLKSQ 187
Query: 127 VRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
V L+L+S ++E++ Q L + R EL
Sbjct: 188 VLDLKLRSA-------QIEQEAQNLATRANAAQARTEEL 219
Score = 27.3 bits (60), Expect = 7.0
Identities = 11/90 (12%), Positives = 31/90 (34%)
Query: 85 VTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRL 144
+ Q + R A Q + ++ I+ ++ + + ++ +
Sbjct: 198 IEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRER 257
Query: 145 EEKEQELKKEYSILHERYTELLKTHMEYIE 174
E + Q L+ + L + +L + Y+
Sbjct: 258 ERQLQRLETAQARLEQEVAQLEAYYQAYVR 287
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 29.3 bits (66), Expect = 1.5
Identities = 23/104 (22%), Positives = 44/104 (42%), Gaps = 8/104 (7%)
Query: 58 ESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
E L F + + + LL+++ L + E R+ + K D K +
Sbjct: 845 EVLIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELK--------IDVKSISS 896
Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHER 161
K+ +LE ++ELK +SD + LE K + + + +L+
Sbjct: 897 LKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNI 940
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 29.0 bits (65), Expect = 1.7
Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 9/128 (7%)
Query: 47 KDLMPLVVNVLESLDLAFTENQEHEVELEL------LREDNEQLVTQYEREKQLRKAADQ 100
K L + V E L EN+E E E ++ L D E V + + L A+ +
Sbjct: 294 KALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENNVAKLQA---LVVASSE 350
Query: 101 KLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHE 160
+L ++ E + L + SL+ R E +++ D + +L K +EL+ E +
Sbjct: 351 RLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTKEQ 410
Query: 161 RYTELLKT 168
Y +LL
Sbjct: 411 LYKQLLDE 418
Score = 28.6 bits (64), Expect = 2.7
Identities = 23/79 (29%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 89 EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKE 148
+ E+ K + +L +AL++ EL S IE + S ++ L K K ++ + + ++
Sbjct: 259 DTEEDRTKEREAEL----EALQEQIDELESSIEEVLSEIKALASKIKQVNEELTTVRQEN 314
Query: 149 QELKKEYSILHERYTELLK 167
+EL++EY I +R ELL
Sbjct: 315 EELEEEYKI-KKRTVELLP 332
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 28.7 bits (65), Expect = 1.8
Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 5/62 (8%)
Query: 94 LRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKK 153
+RKA + Q + LE + K+L + E LE V LE K + R EE+ Q +K
Sbjct: 112 MRKALQAE--QGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEK---REEEERQIEEK 166
Query: 154 EY 155
+
Sbjct: 167 RH 168
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 29.1 bits (66), Expect = 1.9
Identities = 15/53 (28%), Positives = 29/53 (54%), Gaps = 3/53 (5%)
Query: 82 EQLVTQYEREKQLRKAADQKLFQIED---ALEDDKKELNSKIESLESIVRMLE 131
E+ + ++L K +++L +IE LE++ EL ++I+ LE + LE
Sbjct: 75 EKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLE 127
Score = 28.7 bits (65), Expect = 2.4
Identities = 20/79 (25%), Positives = 33/79 (41%), Gaps = 9/79 (11%)
Query: 47 KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYERE-KQLRKAADQKLFQI 105
K + +VV VL+ EVE EL + E+L + E +L + ++L +I
Sbjct: 176 KGYVYVVVVVLK--------ELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEI 227
Query: 106 EDALEDDKKELNSKIESLE 124
E E +EL +
Sbjct: 228 EKERESLLEELKELAKKYL 246
Score = 28.7 bits (65), Expect = 2.6
Identities = 18/101 (17%), Positives = 42/101 (41%), Gaps = 19/101 (18%)
Query: 84 LVTQYEREKQLRKAADQKLFQIEDALEDDKKE-----------LNSKIESLESIV----- 127
+ + +++ L + + IED E+ E L+ ++ L S +
Sbjct: 12 VTLKSYKDEVLEALHELGVVHIEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNP 71
Query: 128 ---RMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
++ K+ + + +EE+ ++++KE L E +EL
Sbjct: 72 LREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISEL 112
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 29.0 bits (65), Expect = 2.0
Identities = 20/78 (25%), Positives = 33/78 (42%)
Query: 63 AFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIES 122
A +E E + EL+ L+ +L +YE EK + Q L + +LE E +
Sbjct: 358 AVSEVGELKAELKALKARYNELQERYEDEKDRLRNEIQNLAEKLLSLEKSSHEDQERSGD 417
Query: 123 LESIVRMLELKSKNASDH 140
LE +R L + +
Sbjct: 418 LEKELRKLTHVAGESQGS 435
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.0 bits (65), Expect = 2.1
Identities = 25/100 (25%), Positives = 50/100 (50%)
Query: 66 ENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
+ ELE L E + L + E+ ++ + + +L ++ + + K L +++ LE
Sbjct: 309 GLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEE 368
Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
+ LE + + A + + +LEE QELK+E + L E+
Sbjct: 369 RLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEI 408
Score = 27.8 bits (62), Expect = 5.5
Identities = 26/89 (29%), Positives = 39/89 (43%)
Query: 65 TENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLE 124
+E ELE +E E+L + + L A + + LE +EL KIE LE
Sbjct: 235 EALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLE 294
Query: 125 SIVRMLELKSKNASDHVYRLEEKEQELKK 153
+ R +E + LEE E+ L+K
Sbjct: 295 ELEREIEELEEELEGLRALLEELEELLEK 323
>gnl|CDD|227506 COG5179, TAF1, Transcription initiation factor TFIID, subunit TAF1
[Transcription].
Length = 968
Score = 28.9 bits (64), Expect = 2.2
Identities = 30/167 (17%), Positives = 63/167 (37%), Gaps = 8/167 (4%)
Query: 12 HDDSHIVMSEKVQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHE 71
+D + S V + +I +F + + D V ++S+DL +
Sbjct: 225 YDFDLVFNSNYVAPIKNNIIYDFSRAGGRMKRDTVTT------EFVDSIDLLIATDDWER 278
Query: 72 VELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLE 131
+ +D + + + K L I DA + K + + E ++ +
Sbjct: 279 YVIVDEADDMKTPRKSFYKLKSLTDVDWLYDEDIIDAKLKESKAPLNLND--EKLLLKRQ 336
Query: 132 LKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIERTKL 178
K K V+ ++ ELK +++I +++Y +LK + R L
Sbjct: 337 AKPKKKQQEVFSDDKPLNELKGKFNISNDKYYGILKKTHQSKVRNTL 383
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 28.6 bits (65), Expect = 2.4
Identities = 30/164 (18%), Positives = 78/164 (47%), Gaps = 16/164 (9%)
Query: 19 MSEKVQSLAGSIYQEFEKMITKYD-EDVVKDLMPLVVNVLESLDLAFTENQEHEVELELL 77
EK + + I Q ++ + + V+ + + L+ A +N+E + E++ +
Sbjct: 280 AEEKNEEIQERIDQLYDILEREVKARKYVEKNSD---TLPDFLEHAKEQNKELKEEIDRV 336
Query: 78 RE----DNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKK---ELNSKIESLESIVRML 130
++ + +L + + EKQL ++ +++ +I + + + + EL ++E + + +
Sbjct: 337 KQSYTLNESELESVRQLEKQL-ESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEI 395
Query: 131 ELKSKNASDHVYRLEEKEQELK---KEY-SILHERYTELLKTHM 170
E + + S+ + L + E E + + Y + LHE L K+++
Sbjct: 396 EKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEKSNL 439
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 28.0 bits (63), Expect = 2.4
Identities = 20/67 (29%), Positives = 35/67 (52%), Gaps = 7/67 (10%)
Query: 89 EREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKE 148
ER K+L K A + + + A+ + +++ N KI+ LE + + + D V + EE
Sbjct: 104 ERRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEK-------EKEISEDEVKKAEEDL 156
Query: 149 QELKKEY 155
Q+L EY
Sbjct: 157 QKLTDEY 163
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 28.6 bits (64), Expect = 2.4
Identities = 21/94 (22%), Positives = 41/94 (43%), Gaps = 4/94 (4%)
Query: 75 ELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKS 134
L + QLV + +LR+ +L QI +L ++ L + +LE + L+ +
Sbjct: 306 TELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRL 365
Query: 135 KNASDHVYRLEEKEQELKKEYSILHERYTELLKT 168
+L+ + +EL++E Y LL+
Sbjct: 366 SKLP----KLQVQLRELEREAEAARSLYETLLQR 395
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 28.6 bits (64), Expect = 2.4
Identities = 32/131 (24%), Positives = 50/131 (38%), Gaps = 23/131 (17%)
Query: 81 NEQLVTQYEREKQLRKAADQKL------FQIEDALEDDKKELNSKIESLESIVR------ 128
E+ + ++ + A D++L KK +I+ LESIVR
Sbjct: 310 AEEKMRMLKKARSALDACDRELEEKAREVSELKMERQRKKP---QIDELESIVRLKQAEA 366
Query: 129 -MLELKSKNASDHVYRLEE----KEQELKKEY--SILHERYTELLKTHMEYIERTKLMM- 180
M +LK+ A RL+ K ++ ++EY L R +E + E KL
Sbjct: 367 DMFQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEE 426
Query: 181 GSGSSDRMDSR 191
SD M R
Sbjct: 427 SHRDSDSMKMR 437
>gnl|CDD|116408 pfam07794, DUF1633, Protein of unknown function (DUF1633). This
family contains sequences derived from a group of
hypothetical proteins expressed by Arabidopsis thaliana.
These sequences are highly similar and the region
concerned is about 100 residues long.
Length = 698
Score = 28.7 bits (63), Expect = 2.8
Identities = 25/85 (29%), Positives = 41/85 (48%), Gaps = 8/85 (9%)
Query: 76 LLREDNEQLVTQYEREKQLRKAADQKLFQIE------DALEDDKKELNSKIESLESIVRM 129
+RE++ L +RE +R A+ + IE +D +K KI +E
Sbjct: 459 AIREEDPHLGADQDRE--VRFGAEGIVPGIERLKIELSTSKDLEKGYAEKIGFMEMEFGG 516
Query: 130 LELKSKNASDHVYRLEEKEQELKKE 154
LE + A + ++RLEEK+ EL K+
Sbjct: 517 LEADKQMARNQIHRLEEKKDELSKK 541
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 27.8 bits (62), Expect = 2.8
Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 4/88 (4%)
Query: 91 EKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQE 150
KQ ++L +E+ + + + EL E L+ + +L+ + + LEE +E
Sbjct: 84 LKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLE----ERLESLEESIKE 139
Query: 151 LKKEYSILHERYTELLKTHMEYIERTKL 178
L KE L + E ++ E +ER +
Sbjct: 140 LAKELRELRQDLREEVEELREELERLQE 167
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 28.5 bits (65), Expect = 2.9
Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 18/74 (24%)
Query: 90 REK-QLRKAADQKLFQIEDAL--------EDDKKELNSKIESLESIVRMLELKSKNASDH 140
+E + R AD ++Q E L D+K+++ + I+ L+ LK ++
Sbjct: 523 KELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAAIKELKEA-----LKGEDKEA- 576
Query: 141 VYRLEEKEQELKKE 154
++ K +EL +
Sbjct: 577 ---IKAKTEELTQA 587
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 28.5 bits (64), Expect = 2.9
Identities = 14/79 (17%), Positives = 32/79 (40%)
Query: 82 EQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASDHV 141
+Q+ + ++ + + ++E L+ + E+ S L L+ K +D
Sbjct: 41 KQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLN 100
Query: 142 YRLEEKEQELKKEYSILHE 160
RL E + +++ L E
Sbjct: 101 ARLNALEVQEREQRRRLAE 119
>gnl|CDD|184695 PRK14471, PRK14471, F0F1 ATP synthase subunit B; Provisional.
Length = 164
Score = 27.8 bits (62), Expect = 3.2
Identities = 34/118 (28%), Positives = 59/118 (50%), Gaps = 11/118 (9%)
Query: 47 KDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQ--YEREKQLRKAADQKLFQ 104
K ++ V +S+ A +E E++ L+ DNE+L+ + ER+ L++A + K
Sbjct: 31 KPILGAVKEREDSIKNALASAEEARKEMQNLQADNERLLKEARAERDAILKEAREIKEKM 90
Query: 105 IEDALEDDKKELNSKIES------LESIVRMLELKSKNASDHVYRLEEKEQELKKEYS 156
I DA E+ + E + IE E M E+K++ A+ +E E+ L+KE S
Sbjct: 91 IADAKEEAQVEGDKMIEQAKASIESEKNAAMAEIKNQVAN---LSVEIAEKVLRKELS 145
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 27.3 bits (61), Expect = 3.6
Identities = 28/110 (25%), Positives = 48/110 (43%), Gaps = 14/110 (12%)
Query: 65 TENQEHEVELELLREDNEQLV-------TQYEREKQLRKAADQKLFQIEDALEDDKKELN 117
+ ++ E +++ L+ED E+ +YERE ++L + L + KKE+
Sbjct: 17 EKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLNELKKEIA 76
Query: 118 SKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
ES EL S EE+++ L+ E S L +R EL +
Sbjct: 77 QLKAEAESA--QAELSEAEES-----WEEQKKMLEDELSELEKRIEELNE 119
>gnl|CDD|217135 pfam02601, Exonuc_VII_L, Exonuclease VII, large subunit. This
family consist of exonuclease VII, large subunit
EC:3.1.11.6 This enzyme catalyzes exonucleolytic
cleavage in either 5'->3' or 3'->5' direction to yield
5'-phosphomononucleotides. This exonuclease VII enzyme
is composed of one large subunit and 4 small ones.
Length = 295
Score = 28.0 bits (63), Expect = 3.8
Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 11/86 (12%)
Query: 75 ELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIE--------SLESI 126
L ++ ++L ER L+ + + L Q E+ L + + L S I+ LE +
Sbjct: 165 NRLEKEQDRLNLLRER---LKSLSRRLLEQHEERLAELRDRLISSIQNLLSRKQSRLERL 221
Query: 127 VRMLELKSKNASDHVYRLEEKEQELK 152
+ EL+ + ++ +LK
Sbjct: 222 ILNRELEKNSLLENKLATANLTAQLK 247
>gnl|CDD|183759 PRK12804, PRK12804, flagellin; Provisional.
Length = 301
Score = 28.0 bits (62), Expect = 3.8
Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 121 ESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELL 166
E + +R LE+ SKNA D + ++ E L + +SIL +R EL+
Sbjct: 50 EKMRGQIRGLEMASKNAQDGISLIQTAEGALTETHSIL-QRVRELV 94
>gnl|CDD|224202 COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and
metabolism].
Length = 533
Score = 28.0 bits (63), Expect = 3.8
Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 16/78 (20%)
Query: 40 KYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNE---QLVTQYER----EK 92
+ ED +++L L LE+L LA + L+ D E +LV + +R E+
Sbjct: 435 AFSEDGLEELDALFALTLENLRLAIS---------VLVTGDLELARRLVERKKRVRRLER 485
Query: 93 QLRKAADQKLFQIEDALE 110
+ K +L ++E
Sbjct: 486 RSSKRHLDRLRDGAASVE 503
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 27.9 bits (63), Expect = 3.8
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 3/67 (4%)
Query: 60 LDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAAD---QKLFQIEDALEDDKKEL 116
LD E ++E++ LR E +YE E LR++A+ L + D + +L
Sbjct: 63 LDELTNERARLQLEIDNLRLAAEDFREKYEDELNLRQSAENDIVGLRKDLDEATLARVDL 122
Query: 117 NSKIESL 123
KIESL
Sbjct: 123 EMKIESL 129
>gnl|CDD|227318 COG4985, COG4985, ABC-type phosphate transport system, auxiliary
component [Inorganic ion transport and metabolism].
Length = 289
Score = 28.0 bits (62), Expect = 3.9
Identities = 19/89 (21%), Positives = 38/89 (42%), Gaps = 3/89 (3%)
Query: 80 DNEQLVTQYEREKQLRKA---ADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKN 136
N Q + E++L+K + Q++ +E + +NS++E L R L+L +
Sbjct: 152 SNGQELDGDPLERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLEKRRLQLNGQL 211
Query: 137 ASDHVYRLEEKEQELKKEYSILHERYTEL 165
+ ++ EL+K + L L
Sbjct: 212 DDEFQQHYVAEKSELQKRLAQLQTELDAL 240
>gnl|CDD|235871 PRK06835, PRK06835, DNA replication protein DnaC; Validated.
Length = 329
Score = 27.9 bits (63), Expect = 4.0
Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 9/91 (9%)
Query: 92 KQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRM-----LELKSKNASDHVYRLEE 146
++ K +++ + E L++ K+E+ KI + I ++L S+ + + EE
Sbjct: 7 SEILKEYEKRRDKEELELKNRKEEVYKKIPEIAEIDDEIAKLGIKL-SRAILKNPDKKEE 65
Query: 147 KEQELKKEYSILHERYTELLKTH---MEYIE 174
+ELK++ + L + ELL ++ +Y+E
Sbjct: 66 TLKELKEKITDLRVKKAELLVSNGYPPDYLE 96
>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
mechanisms].
Length = 352
Score = 27.7 bits (62), Expect = 4.2
Identities = 9/51 (17%), Positives = 20/51 (39%)
Query: 78 REDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVR 128
RE+ ++ + + AA Q LE + +++ S E+ +
Sbjct: 150 REELDRARAALQAAEAALAAAQAAQKQNLALLESEVSGAQAQVASAEAALD 200
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.1 bits (63), Expect = 4.3
Identities = 28/104 (26%), Positives = 53/104 (50%), Gaps = 6/104 (5%)
Query: 73 ELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLEL 132
ELE L+E+ E+L + E + ++ ++K+ ++E+ +E EL +IE LE V+ L+
Sbjct: 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIE----ELKKEIEELEEKVKELKE 287
Query: 133 KSKNASDHVYRLEEKEQELKKEYSI--LHERYTELLKTHMEYIE 174
+ A +++ E E+ L + I R E + E I+
Sbjct: 288 LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIK 331
Score = 27.0 bits (60), Expect = 9.4
Identities = 29/107 (27%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAADQ--KLFQIEDALEDDKKELNSKIESLES 125
+E E +E L+++ E+L + + K+L++ A++ KL + + D+ +E+ ++ LE
Sbjct: 262 RELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE 321
Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEY 172
+ +E + K + RLEE +++LK+ L +R EL + H Y
Sbjct: 322 EINGIEERIKELEEKEERLEELKKKLKE----LEKRLEELEERHELY 364
>gnl|CDD|216035 pfam00632, HECT, HECT-domain (ubiquitin-transferase). The name
HECT comes from Homologous to the E6-AP Carboxyl
Terminus.
Length = 298
Score = 27.7 bits (62), Expect = 5.1
Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 3/73 (4%)
Query: 41 YDEDVVKDLMPL--VVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAA 98
D ++ + L L + E L L FT + E E +EL+ + VT +E+ +
Sbjct: 83 VDPELYRSLKSLLELDGDEEELCLTFTVSGEGEGTIELIPNGSNIPVTNENKEEYVNLYV 142
Query: 99 DQKLF-QIEDALE 110
D +L I+ LE
Sbjct: 143 DYRLNESIKKQLE 155
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 27.5 bits (61), Expect = 5.3
Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 9/82 (10%)
Query: 82 EQLVTQY---EREKQLRKAADQKLFQ----IED--ALEDDKKELNSKIESLESIVRMLEL 132
E+L ++ E+E+ K KL I + AL+ + EL + E L + LE
Sbjct: 32 ERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEK 91
Query: 133 KSKNASDHVYRLEEKEQELKKE 154
+ + + L+E++++L+ E
Sbjct: 92 EDDDLDGELVELQEEKEQLENE 113
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 27.7 bits (62), Expect = 5.3
Identities = 24/143 (16%), Positives = 57/143 (39%), Gaps = 17/143 (11%)
Query: 26 LAGSIYQEFEKMITKYDEDVVKDLMPL------VVNVLESLDLAFTENQEHEVELELLRE 79
L+GS + E +I + ++ L + L +D + E ++ L E
Sbjct: 370 LSGSELTQLEVLIQQVKRELQDAKSQLLKELRELEEELAEVDKKISTIPSEE-QIAQLLE 428
Query: 80 DNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLESIVRMLELKSKNASD 139
+ + + R + + ++L +++A+E +K L+ K + + + +
Sbjct: 429 ELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTK---------QKINAFELE 479
Query: 140 HVYRLEEKEQELKKEY-SILHER 161
+ +K ++ KE+ L ER
Sbjct: 480 RAITIADKAKKTLKEFREKLLER 502
>gnl|CDD|143395 cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase
superfamily. The aldehyde dehydrogenase superfamily
(ALDH-SF) of NAD(P)+-dependent enzymes, in general,
oxidize a wide range of endogenous and exogenous
aliphatic and aromatic aldehydes to their corresponding
carboxylic acids and play an important role in
detoxification. Besides aldehyde detoxification, many
ALDH isozymes possess multiple additional catalytic and
non-catalytic functions such as participating in
metabolic pathways, or as binding proteins, or
osmoregulants, to mention a few. The enzyme has three
domains, a NAD(P)+ cofactor-binding domain, a catalytic
domain, and a bridging domain; and the active enzyme is
generally either homodimeric or homotetrameric. The
catalytic mechanism is proposed to involve cofactor
binding, resulting in a conformational change and
activation of an invariant catalytic cysteine
nucleophile. The cysteine and aldehyde substrate form an
oxyanion thiohemiacetal intermediate resulting in
hydride transfer to the cofactor and formation of a
thioacylenzyme intermediate. Hydrolysis of the
thioacylenzyme and release of the carboxylic acid
product occurs, and in most cases, the reduced cofactor
dissociates from the enzyme. The evolutionary
phylogenetic tree of ALDHs appears to have an initial
bifurcation between what has been characterized as the
classical aldehyde dehydrogenases, the ALDH family
(ALDH) and extended family members or aldehyde
dehydrogenase-like (ALDH-L) proteins. The ALDH proteins
are represented by enzymes which share a number of
highly conserved residues necessary for catalysis and
cofactor binding and they include such proteins as
retinal dehydrogenase, 10-formyltetrahydrofolate
dehydrogenase, non-phosphorylating glyceraldehyde
3-phosphate dehydrogenase,
delta(1)-pyrroline-5-carboxylate dehydrogenases,
alpha-ketoglutaric semialdehyde dehydrogenase,
alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
aldehyde dehydrogenase and succinate-semialdehyde
dehydrogenase. Included in this larger group are all
human, Arabidopsis, Tortula, fungal, protozoan, and
Drosophila ALDHs identified in families ALDH1 through
ALDH22 with the exception of families ALDH18, ALDH19,
and ALDH20 which are present in the ALDH-like group. The
ALDH-like group is represented by such proteins as
gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA
reductase, and coenzyme A acylating aldehyde
dehydrogenase. All of these proteins have a conserved
cysteine that aligns with the catalytic cysteine of the
ALDH group.
Length = 367
Score = 27.6 bits (62), Expect = 5.4
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 9/40 (22%)
Query: 86 TQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESLES 125
ER LRK A D LE+ ++EL + +E+LE+
Sbjct: 14 PPAERAAILRKIA--------DLLEERREEL-AALETLET 44
>gnl|CDD|219330 pfam07201, HrpJ, HrpJ-like domain. This family represents a
conserved region approximately 200 residues long within
a number of bacterial hypersensitivity response
secretion protein HrpJ and similar proteins. HrpJ forms
part of a type III secretion system through which, in
phytopathogenic bacterial species, virulence factors are
thought to be delivered to plant cells. This family also
includes the InvE invasion protein from Salmonella. This
protein is involved in host parasite interactions and
mutations in the InvE gene render Salmonella typhimurium
non-invasive. InvE S. typhimurium mutants fail to elicit
a rapid Ca2+ increase in cultured cells, an important
event in the infection procedure and internalisation of
S. typhimurium into epithelial cells. This family
includes bacterial SepL and SsaL proteins. SepL plays an
essential role in the infection process of
enterohemorrhagic Escherichia coli and is thought to be
responsible for the secretion of EspA, EspD, and EspB.
SsaL of Salmonella typhimurium is thought to be a
component of the type III secretion system.
Length = 161
Score = 26.9 bits (60), Expect = 5.5
Identities = 15/66 (22%), Positives = 27/66 (40%)
Query: 106 EDALEDDKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL 165
E+ L + ++ L S SL ++ LE + SD L Q+ + L E +
Sbjct: 41 EEKLRELRQLLKSGRSSLPQLLAYLESLFPDPSDQDLALRALLQQARPLKRALKELLEQA 100
Query: 166 LKTHME 171
L+ +
Sbjct: 101 LEQLEQ 106
>gnl|CDD|183173 PRK11519, PRK11519, tyrosine kinase; Provisional.
Length = 719
Score = 27.4 bits (61), Expect = 5.9
Identities = 20/60 (33%), Positives = 34/60 (56%), Gaps = 7/60 (11%)
Query: 98 ADQKLFQIEDALEDDKKELNSKIESL----ESIVRMLELKSKNASDHVY-RLEEKEQELK 152
A + L + ALED+K +LN ++ ++ + IVR+ + + VY +L K+QELK
Sbjct: 340 AYRTLLEKRKALEDEKAKLNGRVTAMPKTQQEIVRL--TRDVESGQQVYMQLLNKQQELK 397
>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form. The
Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
2.A.36) The CPA1 family is a large family of proteins
derived from Gram-positive and Gram-negative bacteria,
blue green bacteria, yeast, plants and animals.
Transporters from eukaryotes have been functionally
characterized, and all of these catalyze Na+:H+
exchange. Their primary physiological functions may be
in (1) cytoplasmic pH regulation, extruding the H+
generated during metabolism, and (2) salt tolerance (in
plants), due to Na+ uptake into vacuoles. This model is
specific for the bacterial members of this family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 525
Score = 27.2 bits (60), Expect = 7.1
Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 9/85 (10%)
Query: 16 HIVMSEKVQSLAGSIYQEFEKMITKY-DEDVVKDLMPLVVNVLESL-DLAFTENQEHEVE 73
+SE + +E E++I +Y K + V L + +A E +
Sbjct: 406 RKFVSEHSE-------RELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDAR 458
Query: 74 LELLREDNEQLVTQYEREKQLRKAA 98
+E LR D E+ + EK LR+ A
Sbjct: 459 IEELRADGEEKIRSGMGEKNLRRRA 483
>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain. Mitosin or
centromere-associated protein-F (Cenp-F) is found bound
across the centromere as one of the proteins of the
outer layer of the kinetochore. Most of the
kinetochore/centromere functions appear to depend upon
binding of the C-terminal par to f the molecule, whereas
the N-terminal part, here, may be a cytoplasmic player
in controlling the function of microtubules and dynein.
Length = 288
Score = 27.1 bits (59), Expect = 7.4
Identities = 24/99 (24%), Positives = 49/99 (49%), Gaps = 3/99 (3%)
Query: 60 LDLAFTENQEHEVELELLR---EDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
LD E Q+ + +L+ L + +Q V + E K +Q+L +I + LE K+++
Sbjct: 27 LDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLMEICEHLEKTKQKI 86
Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY 155
+ +++ ES V E + + + +LE++ + K E
Sbjct: 87 SHELQVKESQVNFQEGQLNSCKKQIEKLEQELKRCKSEL 125
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 27.3 bits (61), Expect = 7.4
Identities = 29/111 (26%), Positives = 44/111 (39%), Gaps = 12/111 (10%)
Query: 57 LESLDLAFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKEL 116
LE LD A E +E E + +L + KQ K +++ A+E+ + E
Sbjct: 315 LEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF 374
Query: 117 NSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLK 167
E L + L+ K S+ V KE K I+ T+LLK
Sbjct: 375 VDNAEELAKLQDELDKIVKTKSELV-----KE---KYHRGIV----TDLLK 413
>gnl|CDD|238352 cd00652, TBP_TLF, TATA box binding protein (TBP): Present in
archaea and eukaryotes, TBPs are transcription factors
that recognize promoters and initiate transcription. TBP
has been shown to be an essential component of three
different transcription initiation complexes: SL1, TFIID
and TFIIIB, directing transcription by RNA polymerases
I, II and III, respectively. TBP binds directly to the
TATA box promoter element, where it nucleates polymerase
assembly, thus defining the transcription start site.
TBP's binding in the minor groove induces a dramatic DNA
bending while its own structure barely changes. The
conserved core domain of TBP, which binds to the TATA
box, has a bipartite structure, with intramolecular
symmetry generating a saddle-shaped structure that sits
astride the DNA. New members of the TBP family, called
TBP-like proteins (TBLP, TLF, TLP) or TBP-related
factors (TRF1, TRF2,TRP), are similar to the core domain
of TBPs, with identical or chemically similar amino
acids at many equivalent positions, suggesting similar
structure. However, TLFs contain distinct, conserved
amino acids at several positions that distinguish them
from TBP.
Length = 174
Score = 26.7 bits (60), Expect = 7.5
Identities = 15/56 (26%), Positives = 23/56 (41%), Gaps = 7/56 (12%)
Query: 36 KMITKYDEDVVKDLMPLVVNVLESLDLAFTENQEHEVELELLREDNEQLVTQYERE 91
+++ K V K V N++ S DL F + LE L + + YE E
Sbjct: 78 RILQKLGFPVEKFPEFKVQNIVASCDLGF------PIRLEELALKHPE-NASYEPE 126
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 27.2 bits (61), Expect = 7.9
Identities = 27/104 (25%), Positives = 40/104 (38%), Gaps = 21/104 (20%)
Query: 71 EVELELLRED---NEQLVTQYEREKQLRKAADQKLFQIEDALEDDKKELNSKIESL--ES 125
E + +L+ D EQL Y++ +Q ++ A Q+ F AL E+ + E
Sbjct: 924 EPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAF----AL----TEVVQRRAHFSYED 975
Query: 126 IVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTH 169
ML S RLE+ EQE R E L+
Sbjct: 976 AAEMLAKNSDLNEKLRQRLEQAEQER--------TRAREQLRQA 1011
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 27.2 bits (61), Expect = 8.2
Identities = 14/59 (23%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 68 QEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIED---ALEDDKKELNSKIESL 123
+ E+ L++ E + + + L +A Q+L +E LE+ ++EL +++E L
Sbjct: 145 HALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQL 203
>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
Length = 977
Score = 27.2 bits (60), Expect = 8.7
Identities = 34/155 (21%), Positives = 65/155 (41%), Gaps = 13/155 (8%)
Query: 3 LDQETVYGTHDDSHIVMSEKVQSLAGSIYQEFEKMITKYDEDVVKDLMPLVVNVLESLDL 62
L E + + + EK +SL + +E E EDV K L PL +
Sbjct: 245 LKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSK-LSPLQYDCW----W 299
Query: 63 AFTENQEHEVELELLREDNEQLVTQYEREKQLRKAADQKLFQIEDALEDDK--KELNSKI 120
EN + L+ E+ ++ + LR D ++E +L++ K + K+
Sbjct: 300 EKVENLQD--LLDRATNQVEKAALVLDQNQDLRDKVD----KLEASLKEANVSKFSSYKV 353
Query: 121 ESLESIVRMLELKSKNASDHVYRLEEKEQELKKEY 155
E L+ +++LE + + + ++ + QE KE+
Sbjct: 354 ELLQQKLKLLEERLQASDHEIHSYIQLYQESIKEF 388
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 27.0 bits (60), Expect = 8.9
Identities = 32/141 (22%), Positives = 64/141 (45%), Gaps = 10/141 (7%)
Query: 56 VLESL-DLAFTENQEHEVELEL---LREDNEQLVTQYEREKQLRKAADQKLFQIEDALED 111
VLESL D A + + V +L ++ D + + E+ QL+KA + L + + L+
Sbjct: 182 VLESLIDAAEWKLEVERVLPQLKVTIKTDAKDWRSHLEQMHQLKKAISKSLPETKQYLKK 241
Query: 112 DKKELNSKIESLESIVRMLELKSKNASDHVYRLEEKEQELKKEYSILHERYTEL---LKT 168
++L+ +E + S + L + + +++++Y+ + +EL L
Sbjct: 242 LSQDLSKALEKISSREKHLNNQLEQLVQEYREARRTLSQVQEKYNQASQGVSELTRELNE 301
Query: 169 HMEYIERTKLMM---GSGSSD 186
E +E+ K M G+ SD
Sbjct: 302 ISEELEQVKQEMEERGASMSD 322
>gnl|CDD|227612 COG5293, COG5293, Predicted ATPase [General function prediction
only].
Length = 591
Score = 26.8 bits (59), Expect = 9.0
Identities = 26/104 (25%), Positives = 43/104 (41%), Gaps = 5/104 (4%)
Query: 76 LLREDNEQLVTQYEREKQLRKAADQK--LFQIEDALEDDK--KELNSKIESLESIVRMLE 131
L+R+ E +E + K A K F + ED K E KI+ LES ++ E
Sbjct: 154 LIRDSQENFSKVFELSHSVLKHAKWKNYPFLLHLLGEDGKCAAEYYDKIQELESKKKLAE 213
Query: 132 LKSKNASDHVYRLEEKEQELKKEYSILHERYTELLKTHMEYIER 175
L K + LEE E ++ ++++ L T + +
Sbjct: 214 LLRKTWIGSLDSLEEIETTELRKQDEVNKK-QATLNTFDFHAQD 256
>gnl|CDD|177669 PLN00035, PLN00035, histone H4; Provisional.
Length = 103
Score = 25.6 bits (56), Expect = 9.5
Identities = 8/26 (30%), Positives = 18/26 (69%)
Query: 23 VQSLAGSIYQEFEKMITKYDEDVVKD 48
V+ ++G IY+E ++ + E+V++D
Sbjct: 44 VKRISGLIYEETRGVLKIFLENVIRD 69
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.129 0.337
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,225,197
Number of extensions: 978732
Number of successful extensions: 2322
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2030
Number of HSP's successfully gapped: 576
Length of query: 205
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 113
Effective length of database: 6,857,034
Effective search space: 774844842
Effective search space used: 774844842
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.8 bits)