RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7536
(265 letters)
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 43.1 bits (101), Expect = 1e-04
Identities = 25/145 (17%), Positives = 62/145 (42%), Gaps = 4/145 (2%)
Query: 99 ESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQ 158
E E TE + ++ + + E E ETK + ESE E + +GE + + + +
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEI--PAERKGEQEGEGEIE 700
Query: 159 AESKKQVDAETKKQVDSQVETKKQFDS-QAETKKQLDSQVETKKQNDSQAETKKQNDSQA 217
A+ ++V+ + ET+ + + E + + + + + + +AE K + +++
Sbjct: 701 AKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEV-EDEGEGEAEGKHEVETEG 759
Query: 218 ETKKQNDSQAEMEKQFDSQAEAKKH 242
+ K+ + + + E +
Sbjct: 760 DRKETEHEGETEAEGKEDEDEGEIQ 784
Score = 33.8 bits (77), Expect = 0.083
Identities = 26/144 (18%), Positives = 58/144 (40%), Gaps = 3/144 (2%)
Query: 87 KKSEEILESKKIESPSAAI-ESETETNKQVDNQAETKKQLESEIETKKQLESEAETK-KQ 144
+ EE + E + E E + + + E + + E E K + E E E + K+
Sbjct: 644 RTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKE 703
Query: 145 VDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQND 204
D +GET+ + + + E + + + + E ++ + + E + + +VET+
Sbjct: 704 ADHKGETEAE-EVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRK 762
Query: 205 SQAETKKQNDSQAETKKQNDSQAE 228
+ E + + + QA
Sbjct: 763 ETEHEGETEAEGKEDEDEGEIQAG 786
Score = 33.0 bits (75), Expect = 0.17
Identities = 25/121 (20%), Positives = 61/121 (50%), Gaps = 10/121 (8%)
Query: 123 KQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESK-KQVDAETKKQVDSQVETKK 181
KQ+E + K+QL K V + G+ K ++AE ++ E ++ +++ E +
Sbjct: 606 KQIE--LWVKEQLSRRPVAK--VMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGE 661
Query: 182 QFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKK 241
+ +AE + + +++ E + + + AE K + + + E + + E + + +++AE +
Sbjct: 662 ESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK-----EADHKGETEAEEVE 716
Query: 242 H 242
H
Sbjct: 717 H 717
Score = 33.0 bits (75), Expect = 0.18
Identities = 30/144 (20%), Positives = 70/144 (48%), Gaps = 6/144 (4%)
Query: 99 ESPSAAIESETETNKQ-VDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDS 157
E+ + +E E ET + +++ E + E E E + + E EAE K +V+++G+ K+
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGE-EVEDEGEGEAEGKHEVETEGDRKETEHE 767
Query: 158 QAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQA 217
+ + E + +++ + + + + + + E + + + E + Q+++QA
Sbjct: 768 GETEAEGKEDED----EGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQA 823
Query: 218 ETKKQNDSQAEMEKQFDSQAEAKK 241
+ + D E E ++Q EAK+
Sbjct: 824 DDTEVKDETGEQELNAENQGEAKQ 847
Score = 31.9 bits (72), Expect = 0.45
Identities = 27/161 (16%), Positives = 70/161 (43%), Gaps = 3/161 (1%)
Query: 101 PSAAIESETETNKQVDNQAETKKQLE-SEIETKKQLESEAETKKQVDSQGETKKQVDSQA 159
P A + + + +K +AE + E E + E E + +++ E + + +
Sbjct: 620 PVAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGEN 679
Query: 160 ESKKQVDAETKKQVDSQVETK-KQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAE 218
ES+ ++ AE K + + + E + K+ D + ET+ + + ++ E + + + E
Sbjct: 680 ESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGE 739
Query: 219 TKKQND-SQAEMEKQFDSQAEAKKHLAQRYDDFMKELDYKE 258
+ +AE + + +++ + K+ + + + D E
Sbjct: 740 EVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780
Score = 29.6 bits (66), Expect = 2.1
Identities = 35/173 (20%), Positives = 83/173 (47%), Gaps = 23/173 (13%)
Query: 89 SEEILESKKIESPSAAIESETETN---KQVDNQAETKKQLESEIETK---KQLESEAETK 142
+EE+ + E+ E E ET ++V+++ E + + + E+ET+ K+ E E ET+
Sbjct: 712 AEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETE 771
Query: 143 KQV---DSQGETKKQVDSQ----AESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDS 195
+ + +GE + D + ++ +V+ E + + + E + Q ++QA+ + D
Sbjct: 772 AEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDE 831
Query: 196 QVETKKQNDSQAETKKQ----------NDSQAETKKQNDSQAEMEKQFDSQAE 238
E + ++Q E K+ + +E +++ + + E E++ + + E
Sbjct: 832 TGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEE 884
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 37.4 bits (87), Expect = 0.008
Identities = 34/198 (17%), Positives = 95/198 (47%), Gaps = 3/198 (1%)
Query: 58 LKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDN 117
LK+ + L++L+ E ++ + +++ ++++E+ +E K E E E +++
Sbjct: 234 LKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELR-EELEELQEELLE 292
Query: 118 QAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQV 177
E ++LE EI ++ E E + + + +++ + E+ K+ E + ++
Sbjct: 293 LKEEIEELEGEISLLRERLEELENELEELEERL--EELKEKIEALKEELEERETLLEELE 350
Query: 178 ETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQA 237
+ + + E ++ S + + + +A ++ + +AE + + E++++ +S
Sbjct: 351 QLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLE 410
Query: 238 EAKKHLAQRYDDFMKELD 255
E + L++R +D +EL
Sbjct: 411 ERLERLSERLEDLKEELK 428
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 35.7 bits (83), Expect = 0.019
Identities = 22/85 (25%), Positives = 38/85 (44%), Gaps = 9/85 (10%)
Query: 57 LLKDAAQVLDKLKHEEAKFNVDSDKIDSL-IKKSEEILESKKIESPSAAIESETETNKQV 115
LL ++ LDKL+ K N ++ + +K EE+++ + E IE E +
Sbjct: 51 LLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEK--IEKE------I 102
Query: 116 DNQAETKKQLESEIETKKQLESEAE 140
E +LE+EI+ +Q E
Sbjct: 103 KELEEEISELENEIKELEQEIERLE 127
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.7 bits (79), Expect = 0.046
Identities = 40/189 (21%), Positives = 82/189 (43%), Gaps = 3/189 (1%)
Query: 69 KHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESE 128
K EEAK + K KK E L+ K+ E A E + + AE K+ E +
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
Query: 129 IETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAE 188
+ ++ + E +K+ + + + +AE K+ +AE KK+ + E KK +
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE---ELKKAEEENKI 1730
Query: 189 TKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKKHLAQRYD 248
++ + E K+ +A+ ++ + K+ + + E + + +A ++ L + +
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE 1790
Query: 249 DFMKELDYK 257
E+D K
Sbjct: 1791 KRRMEVDKK 1799
Score = 33.2 bits (75), Expect = 0.15
Identities = 37/176 (21%), Positives = 77/176 (43%), Gaps = 1/176 (0%)
Query: 67 KLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLE 126
K K EEAK ++ K ++ E +K E+ E + +AE KK+ +
Sbjct: 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAD 1394
Query: 127 SEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDS-QVETKKQFDS 185
+ ++ + +A+ K+ + + + +AE KK+ D KK ++ + + K+
Sbjct: 1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454
Query: 186 QAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKK 241
+A+ ++ + E K+ D + ++ E KK+ + + + AEAKK
Sbjct: 1455 EAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKK 1510
Score = 33.2 bits (75), Expect = 0.18
Identities = 39/176 (22%), Positives = 74/176 (42%), Gaps = 5/176 (2%)
Query: 66 DKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQL 125
D+LK E K D K KK++E KK E A E++ + + KK+
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEA--KKKAEEAKKADEAKKKAEEAKKKADAAKKKA 1338
Query: 126 ESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDS 185
E + + ++EAE E K + + + + + A+ K+ + + +
Sbjct: 1339 EEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKK 1398
Query: 186 QAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKK 241
+AE K+ +++ +A+ K+ +AE KK+ D + ++ EAKK
Sbjct: 1399 KAEEDKKKADELKKAAAAKKKADEAKK---KAEEKKKADEAKKKAEEAKKADEAKK 1451
Score = 32.0 bits (72), Expect = 0.33
Identities = 38/184 (20%), Positives = 78/184 (42%), Gaps = 8/184 (4%)
Query: 66 DKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQL 125
+K K +EAK + K D KK+EE ++ + + + + + + K+ +A+ +
Sbjct: 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEA 1347
Query: 126 ESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDA-----ETKKQVDSQVETK 180
E+EA +K ++ + ++ +KK+ + E KK+ + +
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407
Query: 181 KQFDSQAETKKQLDS---QVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQA 237
+ A KK+ D + E KK+ D + ++ E KK+ + + E+
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Query: 238 EAKK 241
EAKK
Sbjct: 1468 EAKK 1471
Score = 31.3 bits (70), Expect = 0.63
Identities = 35/187 (18%), Positives = 81/187 (43%), Gaps = 8/187 (4%)
Query: 66 DKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQL 125
+ K EEAK K D+ KK+EE KK + E + + ++ A KK+
Sbjct: 1369 AEKKKEEAK-----KKADAAKKKAEEK---KKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
Query: 126 ESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDS 185
+ + ++ + E KK+ + + + E+KK +A+ K + + + K+
Sbjct: 1421 DEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAE 1480
Query: 186 QAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKKHLAQ 245
+A+ + + E K+ +A+ + +A+ K+ + + ++ ++ K A+
Sbjct: 1481 EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
Query: 246 RYDDFMK 252
+ ++ K
Sbjct: 1541 KAEEKKK 1547
Score = 29.7 bits (66), Expect = 1.8
Identities = 40/182 (21%), Positives = 87/182 (47%), Gaps = 4/182 (2%)
Query: 60 DAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQA 119
+A + + K +EAK + K D L KK+EE+ ++++ + A ++E + N +
Sbjct: 1526 EAKKAEEAKKADEAKKAEEKKKADEL-KKAEELKKAEEKKKAEEAKKAEEDKNMALRKAE 1584
Query: 120 ETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVET 179
E KK E+ IE +L E + K +++ + ++ ++ K + E KK+V+ +
Sbjct: 1585 EAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAE---EEKKKVEQLKKK 1641
Query: 180 KKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEA 239
+ + +AE K+ + + + K +++ + + ++ K + D + E EA
Sbjct: 1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
Query: 240 KK 241
KK
Sbjct: 1702 KK 1703
Score = 29.7 bits (66), Expect = 2.1
Identities = 44/187 (23%), Positives = 85/187 (45%), Gaps = 6/187 (3%)
Query: 58 LKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDN 117
K A+ K + K K D+ KK+EE K E+ A E+ + + +
Sbjct: 1307 AKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE--AKKAAEAAKAEAEAAADEAEAAEE 1364
Query: 118 QAETKKQLESEIETK-KQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKK--QVD 174
+AE ++ + E + K + +AE KK+ D + ++ +A+ K+ A KK +
Sbjct: 1365 KAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAK 1424
Query: 175 SQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFD 234
+ E KK+ D +A+ K + + + K+ +A+ ++ +AE K+ D + ++
Sbjct: 1425 KKAEEKKKAD-EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAK 1483
Query: 235 SQAEAKK 241
EAKK
Sbjct: 1484 KADEAKK 1490
Score = 29.3 bits (65), Expect = 2.4
Identities = 25/151 (16%), Positives = 62/151 (41%)
Query: 108 ETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDA 167
E + ++ E KK+ E + + + E KK+ D+ + ++ AE+ K
Sbjct: 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
Query: 168 ETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQA 227
+ ++ E + + + E K+ + K + +A+ K+ + + K +A
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413
Query: 228 EMEKQFDSQAEAKKHLAQRYDDFMKELDYKE 258
K+ +A+ K ++ D+ K+ + +
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAK 1444
Score = 29.3 bits (65), Expect = 2.8
Identities = 30/151 (19%), Positives = 69/151 (45%), Gaps = 3/151 (1%)
Query: 66 DKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQL 125
+K K EE K + +KI + + + + KK E A E E + + + +AE K+
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA 1704
Query: 126 ESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDS 185
E E KK+ E + +++ E K +A+ + + D + ++ E KK+
Sbjct: 1705 E---ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH 1761
Query: 186 QAETKKQLDSQVETKKQNDSQAETKKQNDSQ 216
+ +++ ++ +K+ + E ++++ +
Sbjct: 1762 LKKEEEKKAEEIRKEKEAVIEEELDEEDEKR 1792
Score = 27.8 bits (61), Expect = 8.7
Identities = 36/160 (22%), Positives = 75/160 (46%), Gaps = 2/160 (1%)
Query: 59 KDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQ 118
K A+ L K + E+ K K KK+EE+ ++++ AA E++ + +
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
Query: 119 AETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVE 178
E KK E E + + L+ EAE K+ + + + + +AE K+ + E K + + + +
Sbjct: 1679 -EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE-EAK 1736
Query: 179 TKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAE 218
+ + D + + + D + + K + + E KK + + E
Sbjct: 1737 KEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 34.2 bits (79), Expect = 0.054
Identities = 28/162 (17%), Positives = 66/162 (40%), Gaps = 23/162 (14%)
Query: 83 DSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETK 142
LIK + + ES+K + Q++ KQLE+E+ L+++ +
Sbjct: 125 PELIKGQQSLFESRK-----------STLRAQLELILAQIKQLEAELAG---LQAQLQAL 170
Query: 143 KQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQ 202
+Q + + + E+++++ +K + S++E + +AE + +L +
Sbjct: 171 RQ------QLEVISEELEARRKL---KEKGLVSRLELLELERERAEAQGELGRLEAELEV 221
Query: 203 NDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKKHLA 244
Q + + Q E + + E+ + AE ++ L
Sbjct: 222 LKRQIDELQLERQQIEQTFREEVLEELTEAQARLAELRERLN 263
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 33.8 bits (77), Expect = 0.086
Identities = 36/175 (20%), Positives = 66/175 (37%), Gaps = 18/175 (10%)
Query: 59 KDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKK-----------IESPSAAIES 107
+D A + KL++ N ++ ++EE+ E + E+ SA ++
Sbjct: 856 QDLAVLERKLQNPSFVQNAPPAVVEKDRARAEELREKRGKLEAHRAMLSGSEANSARRDT 915
Query: 108 ETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESK-KQVD 166
N+Q Q + + E +K V + E +Q + S ++V
Sbjct: 916 MEIQNEQKPTQDGPAAEAQPAQEN----TVVESAEKAVAAVSEAAQQAATAVASGIEKVA 971
Query: 167 AETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKK 221
+K V V KK ++A KK++ + KK +A KK + KK
Sbjct: 972 EAVRKTVRRSV--KKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAKK 1024
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 33.8 bits (77), Expect = 0.089
Identities = 21/160 (13%), Positives = 65/160 (40%), Gaps = 3/160 (1%)
Query: 83 DSLIKKSEEILESKKIESPSAAIESET---ETNKQVDNQAETKKQLESEIETKKQLESEA 139
+ + ++ + K+ ES A ++ E +K+ + + +++ + + + E
Sbjct: 191 EKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEV 250
Query: 140 ETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVET 199
K+Q S E K+ + + ++ +Q+E KK + + K ++
Sbjct: 251 RQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQ 310
Query: 200 KKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEA 239
+ + + K+ ++Q + + + + + Q ++Q +
Sbjct: 311 ESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTS 350
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 33.6 bits (78), Expect = 0.10
Identities = 30/183 (16%), Positives = 82/183 (44%), Gaps = 19/183 (10%)
Query: 76 NVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQL 135
+ KI ++++ ILE K E+ + E+ E +++ + + E E+ ++
Sbjct: 27 KIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKL---RNEFEKELRERRNE 83
Query: 136 ESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDS 195
+ E + + ++ +D + E ++ + E +K+ + +++ + + E ++L
Sbjct: 84 LQKLEKRLL-----QKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIE 138
Query: 196 QVETKKQNDS---QAETKKQNDSQAETKKQNDS-----QAEMEKQFDSQAEAKKHLA--- 244
+ + + S E K+ + E + ++++ + E E + ++ +AK+ LA
Sbjct: 139 EQLQELERISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEEAKEEADKKAKEILAQAI 198
Query: 245 QRY 247
QR
Sbjct: 199 QRC 201
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 33.5 bits (76), Expect = 0.11
Identities = 23/142 (16%), Positives = 53/142 (37%), Gaps = 4/142 (2%)
Query: 108 ETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDA 167
+T + + + + Q +S ++KQL++ A Q+ SQ K +Q E + Q A
Sbjct: 150 QTRLKTLAEQRRQLEAQAQSLQASQKQLQASAT---QLKSQVLDLKLRSAQIEQEAQNLA 206
Query: 168 ETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQA 227
++ E + + A+ Q Q + + +Q + + ++ +
Sbjct: 207 TRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQRLET 266
Query: 228 EMEKQFDSQAEAKKHLAQRYDD 249
+ A+ + + Q Y
Sbjct: 267 AQARLEQEVAQLEAY-YQAYVR 287
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 33.0 bits (76), Expect = 0.19
Identities = 25/134 (18%), Positives = 55/134 (41%), Gaps = 18/134 (13%)
Query: 46 KRYNYFPAASVLLKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAI 105
K +++ P V +D +L L+ E + +S+ + E+++ E
Sbjct: 125 KDWDFKPGPFVPPEDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQEL----- 179
Query: 106 ESETETNKQVDNQAETKKQLESEIE--TKKQLESEAETKKQVDSQGETKKQVDSQAESKK 163
E +++LE+++E +K E+ E K++ K++ QA +
Sbjct: 180 ---VALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKR-------KEITDQAAKRL 229
Query: 164 QVD-AETKKQVDSQ 176
++ ET+ +D Q
Sbjct: 230 ELSEEETRILIDQQ 243
Score = 32.6 bits (75), Expect = 0.25
Identities = 26/111 (23%), Positives = 50/111 (45%), Gaps = 7/111 (6%)
Query: 117 NQAETKKQLESEIET-KKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDS 175
+ L+ E+ T K+QLE +A K Q + E ++Q + A + E K+Q
Sbjct: 139 DPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQ-ELVALEGLAAELEEKQQ--- 194
Query: 176 QVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQ 226
++E + + + + + + + K+ D A K+ S+ ET+ D Q
Sbjct: 195 ELEAQLEQLQEKAAETSQERKQKRKEITDQAA--KRLELSEEETRILIDQQ 243
>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
Length = 639
Score = 32.5 bits (74), Expect = 0.24
Identities = 27/112 (24%), Positives = 48/112 (42%), Gaps = 11/112 (9%)
Query: 77 VDSDKIDSLIKKSEEILESKKIESPSAAIESETETN---------KQVDNQAETKKQLES 127
V SD I+ + L K A + S T+ + KQV +A +
Sbjct: 74 VYSDNSSKTIEPDSQDLLLDKRGEHKARVLSATDDDTHSQTDDIIKQVTQKAGQDDSDQQ 133
Query: 128 EIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQV--DSQV 177
E ++ Q S+AE+ + V +T ++VD + + D +T + V D++V
Sbjct: 134 EKNSQSQSASQAESLEHVQQSAQTSEKVDEKEPLLTKTDKQTDQTVMPDARV 185
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 31.9 bits (73), Expect = 0.36
Identities = 33/171 (19%), Positives = 84/171 (49%), Gaps = 10/171 (5%)
Query: 78 DSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLES 137
S ++++ + E + +++E+ +E E E ++ +AE + E IE ++
Sbjct: 463 GSPHVETIEEDRERV---EELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERRE 519
Query: 138 EAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQV 197
+ E + + + ET ++ +AE + E +++++ E K++ ++AE + + +++
Sbjct: 520 DLE--ELIAERRETIEEKRERAEELR----ERAAELEAEAEEKREAAAEAEEEAE-EARE 572
Query: 198 ETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKKHLAQRYD 248
E + N AE K++ +S + + A+ E + + E ++ LA+ D
Sbjct: 573 EVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELND 623
Score = 29.6 bits (67), Expect = 2.2
Identities = 20/114 (17%), Positives = 48/114 (42%), Gaps = 10/114 (8%)
Query: 53 AASVLLKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETN 112
L ++ +L + ++A + + L + EE+ + + +A N
Sbjct: 287 RLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAA-----QAHN 341
Query: 113 KQVDNQAETKKQLESEIETKKQ----LESEAE-TKKQVDSQGETKKQVDSQAES 161
++ ++ E LE E ++ LESE E ++ V+ + E ++++ + E
Sbjct: 342 EEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEE 395
>gnl|CDD|204349 pfam09949, DUF2183, Uncharacterized conserved protein (DUF2183).
This domain, found in various hypothetical bacterial
proteins, has no known function.
Length = 99
Score = 29.8 bits (68), Expect = 0.42
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 1/45 (2%)
Query: 3 RYNYFPAASVLLKDAGSKSPSGSGSSLSHLKTSGIKKLLAALMKR 47
N FP +LL+D G S + S K I+K+L R
Sbjct: 21 ERNGFPKGPLLLRDWGP-SLTSFFKSGHEHKRDAIRKILRDFPDR 64
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 31.3 bits (71), Expect = 0.50
Identities = 30/171 (17%), Positives = 73/171 (42%), Gaps = 13/171 (7%)
Query: 55 SVLLKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQ 114
SVLL A + ++ L + ++ + + ILE E A+++ E +
Sbjct: 257 SVLLGKALEAMELLASNGVALSEEA------LDLLQSILEKLSSELDEASLKKLQEAIDK 310
Query: 115 VDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVD 174
++ + L E+E KQ + + + GE+ ++ S+ ++ +V++ + V+
Sbjct: 311 SESSELDEDLLLEEVEKSKQ-----KFEPLLAQYGESFQEWRSKLQALAKVESRSLLSVE 365
Query: 175 SQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQND--SQAETKKQN 223
++ T ++ D + ++Q E +K + + ++Q D Q +
Sbjct: 366 ERLPTIEKEDLELYEQRQQLWFFENRKLWQRKKKLREQADEEYQQRHATRA 416
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 31.6 bits (72), Expect = 0.51
Identities = 27/182 (14%), Positives = 66/182 (36%), Gaps = 13/182 (7%)
Query: 58 LKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDN 117
L A + +L+ ++ ++ +++ E LE E S E E + +
Sbjct: 290 LYALANEISRLEQQKQILRERLANLERQLEELEAQLE----ELESKLDELAEELAELEEK 345
Query: 118 QAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQV 177
E L+ E+E+ + E E + + + + + K A+ + Q+ S
Sbjct: 346 LEE----LKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV--AQLELQIASL- 398
Query: 178 ETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQA 237
+ + ++L+ + E +Q + K + E + + + E ++ +
Sbjct: 399 --NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEEL 456
Query: 238 EA 239
E
Sbjct: 457 ER 458
Score = 30.4 bits (69), Expect = 1.1
Identities = 25/146 (17%), Positives = 61/146 (41%), Gaps = 3/146 (2%)
Query: 106 ESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGE--TKKQVDSQAESKK 163
E E ++++ ++LE ++E + SE E + + + Q E S+ E +K
Sbjct: 246 EELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIE-ELQKELYALANEISRLEQQK 304
Query: 164 QVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQN 223
Q+ E ++ Q+E + + E+K ++ + + + ++ +AE ++
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELE 364
Query: 224 DSQAEMEKQFDSQAEAKKHLAQRYDD 249
E+E + + E + L +
Sbjct: 365 AELEELESRLEELEEQLETLRSKVAQ 390
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 31.3 bits (71), Expect = 0.55
Identities = 31/172 (18%), Positives = 63/172 (36%), Gaps = 11/172 (6%)
Query: 75 FNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQ 134
FN++SD I E+ S+K + S + N ++ + + + +K
Sbjct: 127 FNLNSD-----ISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSK-----QDKADNQKA 176
Query: 135 LESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLD 194
S + Q + K + + +T Q S + + DS ++
Sbjct: 177 PSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQY 236
Query: 195 SQVETKKQNDSQAETKKQNDSQAETKK-QNDSQAEMEKQFDSQAEAKKHLAQ 245
S+ K Q D +++KK + TK Q +Q E++ + + + Q
Sbjct: 237 SEDAKKTQKDYASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQ 288
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 31.5 bits (71), Expect = 0.60
Identities = 25/153 (16%), Positives = 56/153 (36%), Gaps = 1/153 (0%)
Query: 76 NVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQL 135
N D D + L +E SK + +E+ E ++ D + + Q E +E L
Sbjct: 3984 NEDLDLPEDLKLDEKEGDVSKDSDLEDMDMEAADENKEEADAEKDEPMQDEDPLEENNTL 4043
Query: 136 ESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDS 195
+ + + D + +K + E Q + E+ + + + + + E + +
Sbjct: 4044 DEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTED-GVKSDEELEQGEVPEDQAIDNH 4102
Query: 196 QVETKKQNDSQAETKKQNDSQAETKKQNDSQAE 228
K + AE ++N + + + E
Sbjct: 4103 PKMDAKSTFASAEADEENTDKGIVGENEELGEE 4135
Score = 30.4 bits (68), Expect = 1.3
Identities = 31/182 (17%), Positives = 73/182 (40%), Gaps = 17/182 (9%)
Query: 66 DKLKHEEAKFNVDSDKID------SLIKKSEEILESKKIESPSAAIESETETNKQVDNQA 119
++ +E++ N +SD + +L K + E ++ S I+ E + + Q +N
Sbjct: 3920 EQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQ 3979
Query: 120 ETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVET 179
+ + ++ +L+ + +G+ K D + + E K++ D++ +
Sbjct: 3980 PPPENEDLDLPEDLKLDEK---------EGDVSKDSD-LEDMDMEAADENKEEADAEKDE 4029
Query: 180 KKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEA 239
Q + E LD ++ +D + +K N+ E Q + + E S E
Sbjct: 4030 PMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEE-STEDGVKSDEEL 4088
Query: 240 KK 241
++
Sbjct: 4089 EQ 4090
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 30.8 bits (70), Expect = 0.69
Identities = 26/134 (19%), Positives = 58/134 (43%), Gaps = 11/134 (8%)
Query: 75 FNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQ 134
+ + K+DS + +L+ + +++ A E K + + K+ + ++ K+
Sbjct: 137 KSSELKKVDSQLPDPRALLKGEDLKTVETA----EEIEKSLKAMQQKWKKRKKDLPNKQD 192
Query: 135 LESEAETKKQVDSQGETK-KQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQL 193
LE E KK++++ + K + K++ D K+ + + K + KKQL
Sbjct: 193 LE---EYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQL 249
Query: 194 DSQ---VETKKQND 204
+ ++ QND
Sbjct: 250 KADLAELKKAPQND 263
>gnl|CDD|227668 COG5377, COG5377, Phage-related protein, predicted endonuclease
[DNA replication, recombination, and repair].
Length = 319
Score = 30.6 bits (69), Expect = 0.73
Identities = 16/77 (20%), Positives = 32/77 (41%), Gaps = 5/77 (6%)
Query: 107 SETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQ--GETKKQVDSQAESKKQ 164
++TE K VD + L K+ + + K+++++ E K+
Sbjct: 216 AKTEAVKTVDLPHFYAEALAVLHSIKRDISELEDFAKEIENRLKNELKEAEYGFTA---D 272
Query: 165 VDAETKKQVDSQVETKK 181
DA K ++V+TK+
Sbjct: 273 YDAVWKPVDSNRVDTKR 289
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 31.1 bits (70), Expect = 0.75
Identities = 20/151 (13%), Positives = 50/151 (33%), Gaps = 1/151 (0%)
Query: 103 AAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESK 162
A +++ E + K +T+ Q E + +++V KK + A
Sbjct: 677 AELQARLERLTHTQSDIAIAKAALDAAQTR-QKVLERQYQQEVTECAGLKKDLKRAAMLS 735
Query: 163 KQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQ 222
++V + K+ + ++ +Q D V+ ++ + K+ + A ++
Sbjct: 736 RKVHSIAKQGMTGALQALGAAHFPQVAPEQHDDIVDIERIEHRRQLQKRIDAVNARLRRL 795
Query: 223 NDSQAEMEKQFDSQAEAKKHLAQRYDDFMKE 253
+ + A D + E
Sbjct: 796 REEIIGRMSDAKKEDTAALSEVGAELDDIPE 826
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 31.0 bits (71), Expect = 0.75
Identities = 39/223 (17%), Positives = 86/223 (38%), Gaps = 34/223 (15%)
Query: 61 AAQVLDKLKHEEAKFNVDSDKIDSLIKKSEE---------------------ILESKKIE 99
A ++LD+L+ E A ++I L+K+ + L+ IE
Sbjct: 196 AREILDQLEEELAALEQIMEEIPELLKELQTELPDQLQELKAGYRELVEEGYHLDHLDIE 255
Query: 100 SPSAAIESETETNKQ------VDNQAETKKQLESEIET-KKQLESEAETKKQVDS-QGET 151
++ + + N +D E ++++ I+ LE E + +K V+
Sbjct: 256 KEIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILEREVKARKYVEKNSDTL 315
Query: 152 KKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKK 211
++ E K++ E + S + + +S + +KQL+S + + + ++
Sbjct: 316 PDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQE 375
Query: 212 QNDS--QAETKKQNDSQAEMEKQFDSQAEAKKHLAQRYDDFMK 252
S Q E ++ E+EK+ Q + + L D ++
Sbjct: 376 IAYSELQEELEEILKQLEEIEKE---QEKLSEMLQGLRKDELE 415
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 30.4 bits (69), Expect = 0.86
Identities = 24/132 (18%), Positives = 55/132 (41%), Gaps = 12/132 (9%)
Query: 132 KKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKK 191
+++L EAE Q + +K + E+K+ + E +Q +++ + E
Sbjct: 28 RQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEERLV 87
Query: 192 QLDSQVETKKQNDSQAETKKQNDSQAET-KKQNDSQAEMEKQFD---------SQAEAKK 241
Q + Q++ + + + Q + + + + E+EKQ D + +A+K
Sbjct: 88 QKEEQLDAR--AEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARK 145
Query: 242 HLAQRYDDFMKE 253
L + D ++E
Sbjct: 146 LLLKLLDAELEE 157
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 30.7 bits (69), Expect = 0.94
Identities = 19/107 (17%), Positives = 48/107 (44%), Gaps = 8/107 (7%)
Query: 151 TKKQVDSQAESKKQVDAETKKQVDSQVETKKQF---DSQAETKKQLDSQVETKKQNDSQA 207
Q + ++ D + + ++ +V KK+ D+ K+ SQ K ++ +
Sbjct: 11 INNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEVKK 70
Query: 208 ETKKQNDSQAE---TKKQNDSQAEME--KQFDSQAEAKKHLAQRYDD 249
TK+++ E TK+++ + + E ++ E K+ + ++ +D
Sbjct: 71 STKEESKQLLEVLKTKEEHQKEIQYEILQKTIPSFEPKESILKKLED 117
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 30.6 bits (70), Expect = 0.95
Identities = 34/147 (23%), Positives = 62/147 (42%), Gaps = 19/147 (12%)
Query: 71 EEAKFNV--DSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESE 128
EEAK + D +K++ LI LE + E A E+E + + E +++ E
Sbjct: 505 EEAKKLIGEDKEKLNELIAS----LEELERELEQKAEEAEALLKEAEKLKEELEEKKEKL 560
Query: 129 IETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQA- 187
E + +L EAE + Q + E+KK+ D K+ Q +
Sbjct: 561 QEEEDKLLEEAEKEAQ-----------QAIKEAKKEADEIIKELRQLQKGGYASVKAHEL 609
Query: 188 -ETKKQLDSQVETKKQNDSQAETKKQN 213
E +K+L+ E K++ + + K++
Sbjct: 610 IEARKRLNKANEKKEKKKKKQKEKQEE 636
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 30.5 bits (69), Expect = 0.95
Identities = 26/136 (19%), Positives = 64/136 (47%), Gaps = 5/136 (3%)
Query: 106 ESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQV 165
+ E ++ + E KQLE K++L ++ + K+ ++ + + E+ +
Sbjct: 87 QQAEELQQKQAAEQERLKQLE-----KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKA 141
Query: 166 DAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDS 225
A K + +++ + +A + + ++ E K+ ++A+ K + ++ A+ +
Sbjct: 142 AAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKK 201
Query: 226 QAEMEKQFDSQAEAKK 241
+AE E + + AEAKK
Sbjct: 202 KAEAEAKKKAAAEAKK 217
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 30.6 bits (70), Expect = 1.00
Identities = 20/101 (19%), Positives = 40/101 (39%), Gaps = 2/101 (1%)
Query: 63 QVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETK 122
+++ K+ +E + ++ + LIK+ E+ E + + E +
Sbjct: 125 KLIIKVNNEIERDHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEELEKFEAQKEEEDE 184
Query: 123 KQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKK 163
K + +E K+LE AE KKQ + + V + K
Sbjct: 185 KLAKEALEAMKKLE--AEKKKQSKNFDPKEGPVQIGKKIDK 223
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.0 bits (68), Expect = 1.1
Identities = 20/95 (21%), Positives = 42/95 (44%), Gaps = 2/95 (2%)
Query: 113 KQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQ 172
K ++ E K L+ +++ K++ E + + K + + E E KK+
Sbjct: 277 KSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKR--KLKSKFERDNEKLDAEVKEKKKE 334
Query: 173 VDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQA 207
+ + KKQ + E ++L+ Q K++N + A
Sbjct: 335 KKKEEKKKKQIERLEERIEKLEVQATDKEENKTVA 369
Score = 29.2 bits (66), Expect = 2.3
Identities = 14/89 (15%), Positives = 40/89 (44%), Gaps = 1/89 (1%)
Query: 106 ESETETNKQVDNQAETKKQLESEI-ETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQ 164
+S + +++ K+L+ I + + + + K + + E + + +K+
Sbjct: 277 KSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKK 336
Query: 165 VDAETKKQVDSQVETKKQFDSQAETKKQL 193
+ + KKQ++ E ++ + QA K++
Sbjct: 337 KEEKKKKQIERLEERIEKLEVQATDKEEN 365
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 30.2 bits (68), Expect = 1.2
Identities = 16/99 (16%), Positives = 42/99 (42%), Gaps = 3/99 (3%)
Query: 159 AESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAE 218
+ + DA+ + D Q + + ++ E + D E + ++ ++ + A
Sbjct: 223 GDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEE--DEAEATDGEGEEGEMDAA 280
Query: 219 TKKQNDSQAEMEKQFDSQAEAKKHLAQRYDDFMKELDYK 257
++ E ++ ++ E + A + + M+E+DYK
Sbjct: 281 EASEDSESDESDEDTETPGEDARP-ATPFTELMEEVDYK 318
>gnl|CDD|225661 COG3119, AslA, Arylsulfatase A and related enzymes [Inorganic ion
transport and metabolism].
Length = 475
Score = 29.8 bits (67), Expect = 1.5
Identities = 5/31 (16%), Positives = 10/31 (32%)
Query: 227 AEMEKQFDSQAEAKKHLAQRYDDFMKELDYK 257
+ + E L Y ++ LD +
Sbjct: 242 PAADGLPAEEEEDGARLMTVYAACVRYLDDQ 272
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 29.9 bits (67), Expect = 1.5
Identities = 15/99 (15%), Positives = 36/99 (36%), Gaps = 3/99 (3%)
Query: 161 SKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQA--ETKKQNDSQAE 218
+++ D + + + + + Q E + E +S+A + + +
Sbjct: 200 AEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMV 259
Query: 219 TKKQNDSQAEMEKQFDSQAEAKKHLAQRYDDFMKELDYK 257
Q+D E + ++ E + A+ + E DYK
Sbjct: 260 QSDQDDLPDESDDDSETPGEGARP-ARPFTSTGGEPDYK 297
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 29.7 bits (67), Expect = 2.0
Identities = 19/94 (20%), Positives = 33/94 (35%), Gaps = 8/94 (8%)
Query: 148 QGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQA 207
+ + K QVD+ + +T+ + + T K S + Q K + SQ
Sbjct: 856 RNDVKHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYS--------ELQNHHKTEALSQN 907
Query: 208 ETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKK 241
+ S E DS E+ K+ + K
Sbjct: 908 NKYNEEKSAQERMPGADSPEELMKRAKEYQDKHK 941
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 29.3 bits (66), Expect = 2.4
Identities = 32/169 (18%), Positives = 64/169 (37%), Gaps = 20/169 (11%)
Query: 98 IESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVD------SQGET 151
I +PS ++ E K + K + K E EA +KQ D ++
Sbjct: 25 IFAPSEEEDAAEEETKGFNADIPQPKDEGIIGDKMKAYEQEALEQKQKDKKRTLQDLADS 84
Query: 152 KKQVDSQAESKKQVDAETKKQVDSQVETKK---QFDSQAET----KKQLDSQVETKKQND 204
++ D +++ D+ ++ D ET S A ++L S E K ++
Sbjct: 85 FEKSDKESQDYSSSDSGFDEEPDEPAETAGSLRPIRSSAAAYRDINRELGSFYEYPKTDE 144
Query: 205 SQAETKK--QNDSQAETKKQ-----NDSQAEMEKQFDSQAEAKKHLAQR 246
+ ++ + +S+ T+ + A MEK ++ A+ +R
Sbjct: 145 EKELLREVEELESRLATEPSPAPELEEQLALMEKSYELAAKYMPRGQER 193
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 29.1 bits (65), Expect = 2.8
Identities = 35/144 (24%), Positives = 66/144 (45%), Gaps = 5/144 (3%)
Query: 87 KKSEEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVD 146
K S +SKK S +A + T N +N+ T +E + K+L++E + +Q +
Sbjct: 372 KSSSGARDSKKDASGMSA--NGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQNE 429
Query: 147 SQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETK-KQLDSQVET--KKQN 203
S+ + + + E + D K+ + ++TK A+ K KQ +E K +
Sbjct: 430 SELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEA 489
Query: 204 DSQAETKKQNDSQAETKKQNDSQA 227
DS+ +KQ + + KK+ + A
Sbjct: 490 DSRVNAEKQLAEEKKRKKEEEETA 513
>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 548
Score = 29.0 bits (65), Expect = 2.9
Identities = 32/193 (16%), Positives = 72/193 (37%), Gaps = 10/193 (5%)
Query: 67 KLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIES-----ETETNKQVDNQAET 121
+ + E+A+ + + + +E+ +K +E+ + ETE Q +
Sbjct: 286 RREAEQAEILAEQAIQEEKAQAEQEVQHAKALEAREMRVGLIERQKETELEPQERSYFIN 345
Query: 122 KKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKK 181
Q +++ E K Q ++ ET ++ + +AE +Q + Q + +
Sbjct: 346 AAQRQAQEEAKAAANIAEAIGAQAEAAVETARETE-EAERAEQAALVAAAEAAEQEQVEI 404
Query: 182 QFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKK 241
++A + E K ++AE ++ K+ ++A + AE K
Sbjct: 405 AVRAEAAKAEAEAQAAEIK----AEAEAIREKGKAEAEAKRALAEAIQVLGDAAAAELFK 460
Query: 242 HLAQRYDDFMKEL 254
L Q + +E
Sbjct: 461 ALVQALPEVAEEA 473
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 29.1 bits (66), Expect = 3.1
Identities = 41/221 (18%), Positives = 83/221 (37%), Gaps = 30/221 (13%)
Query: 61 AAQVLDKLKHEEAKFNVDSDKIDSLIKKSEE---------------------ILESKKIE 99
A +VL KL+ E ++I L+K+ + + IE
Sbjct: 192 AREVLLKLEEETDALEQKMEEIPPLLKELQNEFPDQLEELKAGYREMTEEGYHFDHLDIE 251
Query: 100 SPSAAIESETETNK------QVDNQAETKKQLESEIET-KKQLESEAETKKQVDS-QGET 151
++ + + N +D E +++E I+T LE E + KK V+ +
Sbjct: 252 KELQDLKEQIDQNLALLEELDLDEAEEENEEIEERIDTLYDILEKEVKAKKFVEKNIDKL 311
Query: 152 KKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKK 211
++ E KQ+ E + S + + ++ E +KQL+ Q + KK
Sbjct: 312 TDFLEHAREQNKQLLLELDRLQQSYTLNEDELETVRELEKQLEELEAQYDQLVERIAEKK 371
Query: 212 QNDSQAETKKQNDSQAEMEKQFDSQAEAKKHLAQRYDDFMK 252
S+ + + + + ++E+ Q E + L D ++
Sbjct: 372 VAYSELQE-ELEEIEKQLEEIEKEQEELSESLQGLRKDELE 411
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 28.1 bits (63), Expect = 3.2
Identities = 29/120 (24%), Positives = 49/120 (40%), Gaps = 9/120 (7%)
Query: 101 PSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAE 160
PSA +E + QV KK +E Q E+E + + E +K++ Q
Sbjct: 12 PSADSAAENDGKLQVKPNRYEKKDIEIN-TDYLQEETELDKELFTP---EEQKKITFQKH 67
Query: 161 SKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETK 220
+K E K Q+ S E + ++ TKKQL S ++ + S + + +T
Sbjct: 68 KEKPEQEELKNQLFS--ENATENNTVKATKKQLFS---SEYEQTSSSSESTSEEETKKTS 122
>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase. The model describes
alanine--tRNA ligase. This enzyme catalyzes the reaction
(tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
pyrophosphate + AMP) [Protein synthesis, tRNA
aminoacylation].
Length = 851
Score = 28.9 bits (65), Expect = 3.3
Identities = 17/69 (24%), Positives = 34/69 (49%), Gaps = 4/69 (5%)
Query: 32 LKTSGIKKLLAALMKRY-NYFPAASVLLKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSE 90
LK + + KL+ L++ +Y+P L + V L+ EE +F ++ L +K
Sbjct: 319 LKEAFLYKLVPTLIEVLGDYYP---ELKEKQEFVEQILELEEERFAKTLERGLRLFEKLL 375
Query: 91 EILESKKIE 99
+ L K+++
Sbjct: 376 KKLGKKELD 384
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 28.4 bits (64), Expect = 3.9
Identities = 26/112 (23%), Positives = 51/112 (45%), Gaps = 19/112 (16%)
Query: 72 EAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVD----NQAETKKQLES 127
E F++D++ I + K + + + I +++ SE E + V N E KK+ E
Sbjct: 464 EVTFDIDANGILHVSAKDKGTGKEQSITITASSGLSEEEIERMVKEAEANAEEDKKRKE- 522
Query: 128 EIETKKQLESEA-ETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVE 178
EIE + +S A + +K + G+ ++ AE K++++ V
Sbjct: 523 EIEARNNADSLAYQAEKTLKEAGD-------------KLPAEEKEKIEKAVA 561
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 28.5 bits (64), Expect = 4.3
Identities = 37/219 (16%), Positives = 82/219 (37%), Gaps = 7/219 (3%)
Query: 36 GIKKLLAALMKRYNYFPAASVLLKDAAQVLDKLKHEEA-KFNVDSDKIDSLIKKSEEILE 94
+ + ++ L KR LL++ + + L EE + ++++ I E +
Sbjct: 255 KLTEEISELEKRLE---EIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIA 311
Query: 95 SKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQ 154
K+ E A E + ++D ++LE EIE +++ + + + +
Sbjct: 312 EKERELEDAE-ERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR 370
Query: 155 VDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQND 214
+ + K+ + D + + +K E K++LD E ++ +
Sbjct: 371 AELEEVDKE-FAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAI 429
Query: 215 SQAETKKQNDSQAEMEKQFDSQAEAKKHLAQRYDDFMKE 253
+ E K N+ + E E + + + L Q D K
Sbjct: 430 AGIE-AKINELEEEKEDKALEIKKQEWKLEQLAADLSKY 467
Score = 28.5 bits (64), Expect = 4.4
Identities = 30/219 (13%), Positives = 94/219 (42%), Gaps = 22/219 (10%)
Query: 54 ASVLLKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNK 113
+++ + Q L++L+ E K +++ +L+K+ E + ++ A + +
Sbjct: 189 LDLIIDEKRQQLERLRREREK----AERYQALLKEKREYEGYELLKEKEALERQKEAIER 244
Query: 114 QVDNQAETKKQLESEIETK-KQLESEAETKKQV---------DSQGETKKQVDS----QA 159
Q+ + E ++L EI K+LE + +++ + Q K+++ A
Sbjct: 245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIA 304
Query: 160 ESKKQVDAETKKQVDSQVETKK---QFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQ 216
++ + + ++ D++ K + D ++L+ ++E +++ + + +
Sbjct: 305 SLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364
Query: 217 AETKKQNDSQAEMEKQFDSQAEAKKHLAQRYDDFMKELD 255
+ + + E++K+F + K ++ + +E++
Sbjct: 365 ELEDLRAELE-EVDKEFAETRDELKDYREKLEKLKREIN 402
Score = 28.1 bits (63), Expect = 6.8
Identities = 32/165 (19%), Positives = 71/165 (43%), Gaps = 6/165 (3%)
Query: 61 AAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAE 120
D K EEA ++++ S I + + E K+E + IE+ +Q N+
Sbjct: 768 EELEEDLHKLEEALNDLEARLSHSRIPEIQA--ELSKLEEEVSRIEARLREIEQKLNRLT 825
Query: 121 TKKQ-LESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVET 179
+K+ LE EI+ ++ + K+Q+ S + + ++ + E ++ E + +
Sbjct: 826 LEKEYLEKEIQELQEQR--IDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESR 883
Query: 180 KKQFDSQ-AETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQN 223
+ E + QL ++ ++Q E K++ S+ + K +
Sbjct: 884 LGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEA 928
>gnl|CDD|222859 PHA02535, P, terminase ATPase subunit; Provisional.
Length = 581
Score = 28.5 bits (64), Expect = 4.5
Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 68 LKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIESETETNKQVDNQAETKK 123
L +E K D +ID LI++ E + ++ + +E + N V N+ + K
Sbjct: 64 LIEKENKTGGDYKEIDLLIRQHERLARVRR----YSGTGNEADLNPNVANRNKGPK 115
>gnl|CDD|147580 pfam05474, Semenogelin, Semenogelin. This family consists of
several mammalian semenogelin (I and II) proteins.
Freshly ejaculated human semen has the appearance of a
loose gel in which the predominant structural protein
components are the seminal vesicle secreted semenogelins
(Sg).
Length = 450
Score = 28.5 bits (63), Expect = 4.5
Identities = 24/131 (18%), Positives = 52/131 (39%)
Query: 106 ESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQV 165
+++ Q Q K +S +++L ++ S+G +Q + + K Q
Sbjct: 259 QTKNLNQDQEHGQKAHKISYQSSSTEERRLNHGENGVQKDVSKGSISRQTEEKIMGKSQK 318
Query: 166 DAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDS 225
T Q Q K + S + +++L+ + +++ S+ T K+ + + K QN
Sbjct: 319 QVTTPSQEQGQKANKISYQSSSTEERRLNHGEKGIQKDVSKGSTSKKTEEKIHDKSQNQV 378
Query: 226 QAEMEKQFDSQ 236
+ Q Q
Sbjct: 379 TIPSQDQRSGQ 389
>gnl|CDD|200432 TIGR04181, NHT_00031, aminotransferase, LLPSF_NHT_00031 family.
This clade of aminotransferases is a member of the
pfam01041 (DegT/DnrJ/EryC1/StrS) superfamily. The family
is named after the instance in Leptospira interrogans
serovar Lai, str. 56601, where it is the 31st gene in
the 91-gene lipopolysaccharide biosynthesis locus.
Members of this family are generally found within a
subcluster of seven or more genes including an
epimerase/dehydratase, four genes homologous to the
elements of the neuraminic (sialic) acid biosynthesis
cluster (NeuABCD) and a nucleotidyl transferase.
Together it is very likely that these enzymes direct the
biosynthesis of a nine-carbon sugar analogous to
CMP-neuraminic acid. These seven genes form the core of
the cassette, although they are often accompanied by
additional genes that may further modify the product
sugar.
Length = 359
Score = 28.3 bits (64), Expect = 4.9
Identities = 10/40 (25%), Positives = 18/40 (45%), Gaps = 5/40 (12%)
Query: 227 AEMEKQFDSQAEAKKHLAQRYDDFMKELDY----KETSGT 262
A++E Q + K+ LA+ Y +F + E +G
Sbjct: 253 AQLE-QLEEFLARKRELAEIYKEFFSGIPGVEFLPEPAGA 291
>gnl|CDD|219018 pfam06419, COG6, Conserved oligomeric complex COG6. COG6 is a
component of the conserved oligomeric golgi complex,
which is composed of eight different subunits and is
required for normal golgi morphology and localisation.
Length = 615
Score = 28.0 bits (63), Expect = 5.6
Identities = 24/98 (24%), Positives = 44/98 (44%), Gaps = 11/98 (11%)
Query: 115 VDNQAETKKQLESEIETKKQLESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVD 174
+ N E++++L +IE K+ + S GE K+ AE K++ + K +
Sbjct: 11 IQNTLESRRKLRLDIE-----------KELLKSNGEILKEFGPVAEQLKRIGEDLSKLNN 59
Query: 175 SQVETKKQFDSQAETKKQLDSQVETKKQNDSQAETKKQ 212
+ E K Q ++ + K L + T + Q E KK+
Sbjct: 60 TCEEMKSQLNNAKQDTKDLLEEASTLMEQKRQIELKKK 97
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 28.3 bits (63), Expect = 5.6
Identities = 22/79 (27%), Positives = 34/79 (43%), Gaps = 5/79 (6%)
Query: 136 ESEAETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDS 195
ESEAE K + + E KK+ + +A+ K E KK +Q E K + +Q +
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEK-----ELKKLKAAQKEAKAKLQAQQASDGTNVP 59
Query: 196 QVETKKQNDSQAETKKQND 214
+ KK E + D
Sbjct: 60 KKSEKKSRKRDVEDENPED 78
Score = 27.9 bits (62), Expect = 6.1
Identities = 33/106 (31%), Positives = 50/106 (47%), Gaps = 11/106 (10%)
Query: 113 KQVDNQAETKKQLESEIETKKQLESEA---ETKKQVDSQGETKKQVDSQAES------KK 163
+ +++AE K E E+E KK+ E +A E KK +Q E K ++ +Q S KK
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 164 QVDAETKKQV-DSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAE 208
K+ V D E D+ KK+L SQ+ K+ + S E
Sbjct: 62 SEKKSRKRDVEDENPEDFIDPDTPFGQKKRLSSQM-AKQYSPSAVE 106
>gnl|CDD|235638 PRK05896, PRK05896, DNA polymerase III subunits gamma and tau;
Validated.
Length = 605
Score = 27.9 bits (62), Expect = 5.8
Identities = 25/125 (20%), Positives = 51/125 (40%), Gaps = 8/125 (6%)
Query: 130 ETKKQLESEAETKKQVDSQGETKKQVDSQAES-KKQVDAETKKQVDSQVETKKQFDSQAE 188
K ++ + E K +SQ T K+++ ES K V +T + KK + E
Sbjct: 370 NKKIEISIKPEKKTINNSQIITTKKINIVEESNKNSVHFDTLYKTKIFYHKKKINQNNKE 429
Query: 189 TKKQLDSQVETKKQNDSQAETKKQNDSQAETKKQNDSQAEMEKQFDSQAEAKKHLAQRYD 248
+ + +E + S+ K + ++ + +K F+ E K++ Q+++
Sbjct: 430 QDIKKEELLEKEFVKKSEKIPKN-------DELLDNLELAKQKFFNKDIELSKNMLQKFN 482
Query: 249 DFMKE 253
F E
Sbjct: 483 KFKNE 487
>gnl|CDD|177506 PHA03003, PHA03003, palmytilated EEV membrane glycoprotein;
Provisional.
Length = 369
Score = 27.7 bits (62), Expect = 5.9
Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 4/37 (10%)
Query: 21 SPSGSGSSLSHLKTSGI----KKLLAALMKRYNYFPA 53
+ S +G S+S +KT G+ L L +R++ F A
Sbjct: 132 NASLTGGSISTIKTLGVYSTYPPLATDLRRRFDTFKA 168
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 28.0 bits (63), Expect = 6.0
Identities = 17/70 (24%), Positives = 34/70 (48%), Gaps = 1/70 (1%)
Query: 95 SKKIESPSAAIESETETNKQVDNQAETKK-QLESEIETKKQLESEAETKKQVDSQGETKK 153
+KK++ S E E ET+ +++ +ETK + E E T++ +E K+ D + K
Sbjct: 223 TKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDK 282
Query: 154 QVDSQAESKK 163
+ + +
Sbjct: 283 LEILKEKKDE 292
>gnl|CDD|233230 TIGR01000, bacteriocin_acc, bacteriocin secretion accessory
protein. This family represents an accessory protein
that works with the bacteriocin maturation and ABC
transport secretion protein described by TIGR01193
[Transport and binding proteins, Other].
Length = 457
Score = 27.4 bits (61), Expect = 8.1
Identities = 25/151 (16%), Positives = 58/151 (38%), Gaps = 9/151 (5%)
Query: 97 KIESPSAAIESETETNKQVDNQAETK----KQLESEIETKKQLESEAETKKQVDS---QG 149
K ++ + A E Q ++ + K + L++ I ++ + + ++ Q
Sbjct: 162 KSQTQNEAAEKTKAQLDQQISKTDQKLQDYQALKNAISNGTKVANFNPYQSLYENYQAQL 221
Query: 150 ETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQNDSQAET 209
++ D + + K + A ++Q+D ++ + Q + + QN A+
Sbjct: 222 KSASDKDQKNQVKSTILATIQQQIDQLQKSIASYQVQKAGLTKSTASNYASSQNSKLAQL 281
Query: 210 KKQ--NDSQAETKKQNDSQAEMEKQFDSQAE 238
K+Q + E N E+E + S E
Sbjct: 282 KEQQLAKVKQEITDLNQKLLELESKIKSLKE 312
>gnl|CDD|197556 smart00178, SAR, Sar1p-like members of the Ras-family of small
GTPases. Yeast SAR1 is an essential gene required for
transport of secretory proteins from the endoplasmic
reticulum to the Golgi apparatus.
Length = 184
Score = 27.2 bits (60), Expect = 8.1
Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 5/62 (8%)
Query: 49 NYFPA--ASVLLKDAAQVLDKLKHEEAKFNVDSDKIDSLIKKSEEILESKKIESPSAAIE 106
+YFP V L DA DK + E+K +D+ D + ++ KI++P AA E
Sbjct: 80 DYFPEVNGIVYLVDAY---DKERFAESKRELDALLSDEELATVPFLILGNKIDAPYAASE 136
Query: 107 SE 108
E
Sbjct: 137 DE 138
>gnl|CDD|240411 PTZ00419, PTZ00419, valyl-tRNA synthetase-like protein;
Provisional.
Length = 995
Score = 27.7 bits (62), Expect = 8.5
Identities = 22/113 (19%), Positives = 44/113 (38%), Gaps = 1/113 (0%)
Query: 90 EEILESKKIESPSAAIESETETNKQVDNQAETKKQLESEIETKKQLESEAETKKQVDSQG 149
E++ES + + A ++ +AE K ++ K + + +D +
Sbjct: 871 IELIESAENLISTLAKIGSVSVIPPIEEEAEVPKGCGFDVVDNK-VIIYLNLDEFIDLKK 929
Query: 150 ETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVETKKQ 202
E K A+ +K +++ KK E K D + +++D E KQ
Sbjct: 930 ELAKLEKKLAKLQKSLESYLKKISIPNYEDKVPEDVRKLNDEKIDELNEEIKQ 982
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.4 bits (61), Expect = 8.7
Identities = 18/101 (17%), Positives = 39/101 (38%), Gaps = 6/101 (5%)
Query: 140 ETKKQVDSQGETKKQVDSQAESKKQVDAETKKQVDSQVETKKQFDSQAETKKQLDSQVET 199
+++D+ T KQ+ + AE + Q + + E +K+ +Q+ +
Sbjct: 158 ARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNS 217
Query: 200 KKQNDSQ-AETKKQNDSQAETKKQNDSQAEMEKQFDSQAEA 239
+ D + E + N+S+ + + A E EA
Sbjct: 218 ELSADQKKLEELRANESRLK-----NEIASAEAAAAKAREA 253
>gnl|CDD|234923 PRK01222, PRK01222, N-(5'-phosphoribosyl)anthranilate isomerase;
Provisional.
Length = 210
Score = 27.1 bits (61), Expect = 9.7
Identities = 9/28 (32%), Positives = 12/28 (42%)
Query: 4 YNYFPAASVLLKDAGSKSPSGSGSSLSH 31
Y+ A LL DA P G+G +
Sbjct: 119 AAYYGDADGLLLDAYVGLPGGTGKTFDW 146
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.298 0.115 0.284
Gapped
Lambda K H
0.267 0.0724 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,738,750
Number of extensions: 1037228
Number of successful extensions: 2839
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2012
Number of HSP's successfully gapped: 672
Length of query: 265
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 170
Effective length of database: 6,723,972
Effective search space: 1143075240
Effective search space used: 1143075240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.6 bits)
S2: 58 (26.1 bits)