RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17541
(717 letters)
>gnl|CDD|216240 pfam01008, IF-2B, Initiation factor 2 subunit family. This family
includes initiation factor 2B alpha, beta and delta
subunits from eukaryotes, initiation factor 2B subunits
1 and 2 from archaebacteria and some proteins of unknown
function from prokaryotes. Initiation factor 2 binds to
Met-tRNA, GTP and the small ribosomal subunit. Members
of this family have also been characterized as
5-methylthioribose- 1-phosphate isomerases, an enzyme of
the methionine salvage pathway. The crystal structure of
Ypr118w, a non-essential, low-copy number gene product
from Saccharomyces cerevisiae, reveals a dimeric protein
with two domains and a putative active site cleft.
Length = 281
Score = 252 bits (646), Expect = 1e-78
Identities = 92/289 (31%), Positives = 150/289 (51%), Gaps = 8/289 (2%)
Query: 414 VVRGSNARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAV 473
VRGS A +A L + ++ D TP +E+L A+ +L RP +VS+ NA+
Sbjct: 1 KVRGSPAIAIAALEGLLLVIQDARTP----TVAELKEQLRSAIEFLISARPTAVSLGNAI 56
Query: 474 KHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGNAICMFFHNKLANDDVILTYG 533
+++L + +A+ L E +I E A I + + D ILT+
Sbjct: 57 DRLLRIISELHSSSDVEEAKESLIEAADEFIDEIE-EARRKIGAIAAELIKDGDTILTHC 115
Query: 534 CSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVDCSYVLLSAVSYIMRE 593
S V +L AH++G +FRVI+ + P +G+ + LV+ + + + SAV Y+M+E
Sbjct: 116 NSGTVLGVLRAAHKEGKRFRVIVTESRPRLQGRLTAKELVQAGIPVTLITDSAVGYVMQE 175
Query: 594 VSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNEL 653
V KVI+GA +L+NG + ++ GT Q++L+A+A NVP ET+KF R D +F E
Sbjct: 176 VDKVIVGADRILANGGIANKIGTYQLALLAKAHNVPFYVVAETYKFDPRFPLDEDIFIEE 235
Query: 654 GDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIVPC 702
DP E++ ++L P +D TP L+T +ITE+ ++P
Sbjct: 236 RDPEEVLYRTGVRIAPENLKVRNP---AFDYTPPELITLIITEVGVLPP 281
>gnl|CDD|224105 COG1184, GCD2, Translation initiation factor 2B subunit, eIF-2B
alpha/beta/delta family [Translation, ribosomal
structure and biogenesis].
Length = 301
Score = 221 bits (564), Expect = 2e-66
Identities = 88/312 (28%), Positives = 155/312 (49%), Gaps = 16/312 (5%)
Query: 399 VHPAIYRLGVQYATGVVRGSNARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSY 458
+ P + + + +RG++ +A A++ + D P+ +E E
Sbjct: 2 IMPEVDETAEKLKSMEIRGASWIAIAAAEALEILASDSQAPTVEELIDAIRE----LSET 57
Query: 459 LNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGNAICMF 518
L K RP +VS+ N ++ + R L + +I V+ A I
Sbjct: 58 LVKARPTAVSLGNLIRFVLR----DSSGGDKENRRQSLIKAAQEFIDR-VEKAKERIAEI 112
Query: 519 FHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVD 578
++ + DVILT+ S V ++L TA ++G +F+VI+ + P EG+ M + L + +
Sbjct: 113 GAERIHDGDVILTHSFSKTVLEVLKTAADRGKRFKVIVTESRPRGEGRIMAKELRQSGIP 172
Query: 579 CSYVLLSAVSYIMREVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHK 638
+ ++ SAV M V KV++GA A+L+NGA++++ GT+ ++L AR VP E++K
Sbjct: 173 VTVIVDSAVGAFMSRVDKVLVGADAILANGALVNKIGTSPLALAARELRVPFYVVAESYK 232
Query: 639 FCERVQTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELA 698
F + D LV EL DP E+ + + L +L + +D+TP + A+ITEL
Sbjct: 233 FVPKTLLDTLVEIELRDPLEVARE-------EPLGNLKVRNPAFDVTPPEYIDAIITELG 285
Query: 699 IVPCTSVPVVLR 710
I+P +S+ +LR
Sbjct: 286 IIPPSSIYRILR 297
>gnl|CDD|236282 PRK08535, PRK08535, translation initiation factor IF-2B subunit
delta; Provisional.
Length = 310
Score = 136 bits (345), Expect = 1e-35
Identities = 89/318 (27%), Positives = 153/318 (48%), Gaps = 23/318 (7%)
Query: 398 EVHPAIYRLGVQYATGVVRGSN--ARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPA 455
EV P + + T +RG+ AR A A+K S +E F+ + A
Sbjct: 2 EVMPEVLETAEKIKTMEIRGAGRIARAAA--EALKDQAEKSDAESPEE----FKAEMRAA 55
Query: 456 MSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVIA---TYIHEQVDMAG 512
+ L RP +VS+ NAV++ + + +T R + VI +I E + A
Sbjct: 56 ANILISTRPTAVSLPNAVRYVMRYYSG------ETVEEAR-ESVIERAEEFI-ESSENAV 107
Query: 513 NAICMFFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRL 572
I ++ + DVI+T+ SS ++ TAHE+G VI + P +G + L
Sbjct: 108 EKIGEIGAKRIRDGDVIMTHCNSSAALSVIKTAHEQGKDIEVIATETRPRNQGHITAKEL 167
Query: 573 VKHQVDCSYVLLSAVSYIMREVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLA 632
++ + + ++ SAV Y M++V KV++GA A+ +NGAV+++ GT+Q++L A VP +
Sbjct: 168 AEYGIPVTLIVDSAVRYFMKDVDKVVVGADAITANGAVINKIGTSQIALAAHEARVPFMV 227
Query: 633 ACETHKFCERVQTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTA 692
A ET+KF + LV E DP E++ + + P +D+TP + A
Sbjct: 228 AAETYKFSPKTLLGELVEIEERDPTEVL-PEEILAKLPGVKVRNP---AFDVTPPEYIDA 283
Query: 693 VITELAIVPCTSVPVVLR 710
+ITE+ +P +++
Sbjct: 284 IITEIGAIPPEMAYTIIK 301
>gnl|CDD|188057 TIGR00511, ribulose_e2b2, ribose-1,5-bisphosphate isomerase, e2b2
family. The delineation of this family was based
originally, in part, on a discussion and
neighbor-joining phylogenetic study by Kyrpides and
Woese of archaeal and other proteins homologous to the
alpha, beta, and delta subunits of eukaryotic initiation
factor 2B (eIF-2B), a five-subunit molecule that
catalyzes GTP recycling for eIF-2. Recently, Sato, et
al. assigned the function ribulose-1,5 bisphosphate
isomerase [Energy metabolism, Other].
Length = 301
Score = 126 bits (319), Expect = 3e-32
Identities = 73/276 (26%), Positives = 136/276 (49%), Gaps = 9/276 (3%)
Query: 435 DYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARL 494
+ S+ F + A + L RP +VS+ NA+++ +++ R
Sbjct: 30 EQAEKSDAASPEEFRAEMRAAANILISTRPTAVSLPNALRYVLKYMSGA----DVETLRQ 85
Query: 495 RLKEVIATYIHEQVDMAGNAICMFFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRV 554
+ E +I+ + + A I ++ + DVI+T+ S ++ TA E+G V
Sbjct: 86 SVIERADEFIN-RSEKAQERIGEIGAKRIRDGDVIMTHCNSEAALSVIKTAFEQGKDIEV 144
Query: 555 IIVDGSPWYEGKEMLRRLVKHQVDCSYVLLSAVSYIMREVSKVIIGAHALLSNGAVMSRA 614
I+ + P +G + L + + + ++ SA Y M+EV V++GA A+ +NGAV+++
Sbjct: 145 IVTETRPRNQGHITAKELRDYGIPVTLIVDSAARYYMKEVDHVVVGADAITANGAVINKI 204
Query: 615 GTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELISDKSAAKNWKSLAH 674
GT+Q++L AR VP + A ET+KF + T LV E DP E++ ++ + L +
Sbjct: 205 GTSQLALAAREARVPFMVAAETYKFHPKTITGELVEIEERDPTEVLDEEDLKQ----LGN 260
Query: 675 LTPLSLTYDITPSHLVTAVITELAIVPCTSVPVVLR 710
+ + +D+TP V A+ITE +P +++
Sbjct: 261 VKVRNPAFDVTPPEYVDAIITERGQIPPEMAYTIIK 296
>gnl|CDD|233008 TIGR00524, eIF-2B_rel, eIF-2B alpha/beta/delta-related
uncharacterized proteins. This model, eIF-2B_rel,
describes half of a superfamily, where the other half
consists of eukaryotic translation initiation factor 2B
(eIF-2B) subunits alpha, beta, and delta. It is unclear
whether the eIF-2B_rel set is monophyletic, or whether
they are all more closely related to each other than to
any eIF-2B subunit because the eIF-2B clade is highly
derived. Members of this branch of the family are all
uncharacterized with respect to function and are found
in the Archaea, Bacteria, and Eukarya, although a number
are described as putative translation intiation factor
components. Proteins found by eIF-2B_rel include at
least three clades, including a set of uncharacterized
eukaryotic proteins, a set found in some but not all
Archaea, and a set universal so far among the Archaea
and closely related to several uncharacterized bacterial
proteins [Unknown function, General].
Length = 303
Score = 99.8 bits (249), Expect = 4e-23
Identities = 76/306 (24%), Positives = 131/306 (42%), Gaps = 36/306 (11%)
Query: 414 VVRGSNARCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAV 473
VVRG+ A VA + T + +E F+E L A +L RP +V++ A+
Sbjct: 15 VVRGAPAIGVAAAYGLALAARKIETDNVEE----FKEDLEKAADFLLSTRPTAVNLFWAL 70
Query: 474 KHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGN-AICMFFHNKLANDDVILTY 532
+ L N + +A+ L I E D+ N I + + D +LT+
Sbjct: 71 ERV---LNSAENGESVEEAKESLLREAIEIIEE--DLETNRKIGENGAKLIKDGDTVLTH 125
Query: 533 GCSSLVEK---------ILLTAHEKGTKFRVIIVDGSPWYEGKEM-LRRLVKHQVDCSYV 582
C++ ++ +A E G + RVI + P +G + L++ +D + +
Sbjct: 126 -CNAGALATSDYGTALGVIRSAWEDGKRIRVIACETRPRNQGSRLTAWELMQDGIDVTLI 184
Query: 583 LLSAVSYIMR--EVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFC 640
S +Y M+ E+ VI+GA + NG V ++ GT Q++++A+ F +P A F
Sbjct: 185 TDSMAAYFMQKGEIDAVIVGADRIARNGDVANKIGTYQLAVLAKEFRIPFFVAAPLSTFD 244
Query: 641 ERVQTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSL-----TYDITPSHLVTAVIT 695
+ + E DP E+ + PL + +DITP L+ A+IT
Sbjct: 245 TKTSCGEDIVIEERDPEEVA--------QVGGVRIAPLGVKVYNPAFDITPHDLIDAIIT 296
Query: 696 ELAIVP 701
E I+
Sbjct: 297 EKGIIT 302
>gnl|CDD|169387 PRK08335, PRK08335, translation initiation factor IF-2B subunit
alpha; Validated.
Length = 275
Score = 79.8 bits (197), Expect = 2e-16
Identities = 48/176 (27%), Positives = 84/176 (47%), Gaps = 15/176 (8%)
Query: 525 NDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVDCSYVLL 584
+ DVI+T+ SS V +IL TA KG +F+VI+ + +P YEG + L ++ +
Sbjct: 109 DGDVIITHSFSSAVLEILKTAKRKGKRFKVILTESAPDYEGLALANELEFLGIEFEVITD 168
Query: 585 SAVSYIMREVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQ 644
+ + +E + ++GA + +G V+++AGT ++L VP A ET KF ++
Sbjct: 169 AQLGLFAKEATLALVGADNVTRDGYVVNKAGTYLLALACHDNGVPFYVAAETFKFHPELK 228
Query: 645 TDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
++ + E +N +D+TP V +ITEL I+
Sbjct: 229 SEEVELVE----RPYARQGHRVRNV-----------LFDVTPWKYVRGIITELGIL 269
>gnl|CDD|233003 TIGR00512, salvage_mtnA, S-methyl-5-thioribose-1-phosphate
isomerase. The delineation of this family was based in
part on a discussion and neighbor-joining phylogenetic
study, by Kyrpides and Woese, of archaeal and other
proteins homologous to the alpha, beta, and delta
subunits of eukaryotic initiation factor 2B (eIF-2B), a
five-subunit molecule that catalyzes GTP recycling for
eIF-2. This clade is now recognized to include the
methionine salvage pathway enzyme MtnA [Amino acid
biosynthesis, Aspartate family].
Length = 331
Score = 73.0 bits (180), Expect = 7e-14
Identities = 68/287 (23%), Positives = 116/287 (40%), Gaps = 46/287 (16%)
Query: 441 EKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVI 500
E + F+ L + YL RP +V++ A+ ++ L +A L
Sbjct: 62 EADEREEFKALLEEKLQYLVSSRPTAVNLSWALDRMRAALEAAKTVADIKEALLAE---- 117
Query: 501 ATYIHEQVDMAGN-AI----CMFFHNKLANDDVILT-----------YGCSSLVEKILLT 544
A I E+ D+ N AI +A +LT YG + V + +
Sbjct: 118 AERILEE-DLEDNRAIGENGAALIKKGVAAPLRVLTHCNTGSLATAGYGTALGV---IRS 173
Query: 545 AHEKGTKFRVIIVDGSPWYEGKEMLR-RLVKHQVDCSYVLLSAVSYIM--REVSKVIIGA 601
AHEKG V + P +G + LV+ + + + S +++M EV VI+GA
Sbjct: 174 AHEKGRLEHVYADETRPRLQGARLTAWELVQEGIPATLITDSMAAHLMKHGEVDAVIVGA 233
Query: 602 HALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNEL-- 659
+ +NG ++ GT Q++++A+ VP A T + A + E P E+
Sbjct: 234 DRIAANGDTANKIGTYQLAVLAKHHGVPFYVAAPTSTIDLETKDGAEIPIEERPPEEVTH 293
Query: 660 -----ISDK-SAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
I+ N P +D+TP+ L+T +ITE ++
Sbjct: 294 VGGVRIAPPGIDVWN--------P---AFDVTPAELITGIITEKGVI 329
>gnl|CDD|223260 COG0182, COG0182, Predicted translation initiation factor 2B
subunit, eIF-2B alpha/beta/delta family [Translation,
ribosomal structure and biogenesis].
Length = 346
Score = 68.8 bits (169), Expect = 2e-12
Identities = 67/308 (21%), Positives = 126/308 (40%), Gaps = 41/308 (13%)
Query: 414 VVRGSNA-RCVALLSAIKQMVCDYTTPSEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNA 472
VVRG+ A A +E+ E+ A L RP +V++ A
Sbjct: 45 VVRGAPAIGVAAAYGLALAARESKNDSKGEEFIEALEK----AAETLKSTRPTAVNLFWA 100
Query: 473 VKHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVDMAGN-AICMFFHNKLANDDVILT 531
+ + + + + A I E+ D+ N AI L + D +LT
Sbjct: 101 LDRMLNAAKEAI----EVKEPKESILQEAEEIAEE-DLEANRAIGENGAELLPDGDTVLT 155
Query: 532 YGCS--SLVEKILLTA-------HEKGTKFRVIIVDGSPWYEGKEMLRRL-----VKHQV 577
+ C+ +L TA HE+G RV + P+ +G RL V+ +
Sbjct: 156 H-CNAGALATVGYGTALGVIRSAHEEGKDIRVFADETRPYLQGA----RLTAWELVQDGI 210
Query: 578 DCSYVLLSAVSYIMR--EVSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACE 635
+ + +A ++M+ + VI+GA + +NG V ++ GT Q++++A+ +P A
Sbjct: 211 PVTLITDNAAGHLMQQGMIDAVIVGADRIAANGDVANKIGTYQLAVLAKHHGIPFYVAAP 270
Query: 636 THKFCERVQTDALVFNELGDPNELI---SDKSAAKNWKSLAHLTPLSLTYDITPSHLVTA 692
+++ + E DP E++ + A + ++ + +D+TP L+T
Sbjct: 271 LSTIDFELKSGEDIPIEERDPEEVLEVGGVRIAPEGVEA------YNPAFDVTPPELITG 324
Query: 693 VITELAIV 700
+ITE +
Sbjct: 325 IITEKGVF 332
>gnl|CDD|168237 PRK05772, PRK05772, translation initiation factor IF-2B subunit
alpha; Provisional.
Length = 363
Score = 59.4 bits (144), Expect = 3e-09
Identities = 62/267 (23%), Positives = 119/267 (44%), Gaps = 26/267 (9%)
Query: 452 LGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQAR-----LRLKEVIATYIHE 506
L A + L+ RP +V+++ A L + N + A+ + L +V A I E
Sbjct: 91 LTRAKTILDSARPTAVNLVWATSRM---LNKAKNTVESGNAKSVNELIELLKVEAKKIFE 147
Query: 507 QVDMAGNAICMFFHNKLANDDVILTY----------GCSSLVEKILLTAHEKGTKFRVII 556
+ A + ++ KL + D +LT G + + + L A G VI
Sbjct: 148 EEYDAEIQMGLYGLEKLNDGDTVLTQCNAGGLATGTGLGTALAPVKL-AKALGMSVSVIA 206
Query: 557 VDGSPWYEGKEM-LRRLVKHQVDCSYVLLSAVSYIM--REVSKVIIGAHALLSNGAVMSR 613
+ PW +G + + L++ + + + +AV +M V+ V++GA +L +G V ++
Sbjct: 207 PETRPWLQGSRLTVYELMEEGIKVTLITDTAVGLVMYKDMVNNVMVGADRILRDGHVFNK 266
Query: 614 AGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELISDKSAAKNWKSLA 673
GT + +++A +P A T F + + + E DPNE+ + + + +
Sbjct: 267 IGTFKEAVIAHELGIPFYALAPTSTFDLKSDVNDVKIEER-DPNEVRTIRGVPITPEDVN 325
Query: 674 HLTPLSLTYDITPSHLVTAVITELAIV 700
P+ +D+TP +T +ITE I+
Sbjct: 326 VYNPV---FDVTPPKYITGIITEKGII 349
>gnl|CDD|235578 PRK05720, mtnA, methylthioribose-1-phosphate isomerase; Reviewed.
Length = 344
Score = 57.1 bits (139), Expect = 1e-08
Identities = 41/171 (23%), Positives = 72/171 (42%), Gaps = 30/171 (17%)
Query: 545 AHEKGTKFRVIIVDGSPWYEGKEMLRRL-----VKHQVDCSYVLLSAVSYIMRE--VSKV 597
A EKG V + P +G RL + +D + + + +++M+ + V
Sbjct: 174 AKEKGIDIHVYADETRPRLQGA----RLTAWELYQAGIDVTVITDNMAAHLMQTGKIDAV 229
Query: 598 IIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPN 657
I+GA + +NG V ++ GT Q+++ A+ VP A + + + E DP
Sbjct: 230 IVGADRIAANGDVANKIGTYQLAIAAKYHGVPFYVAAPSSTIDLTLADGKEIPIEERDPE 289
Query: 658 ELIS--------DKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
E+ + N P +D+TP+ L+T +ITE IV
Sbjct: 290 EVTEVGGVRIAPEGVKVYN--------PA---FDVTPAELITGIITEKGIV 329
>gnl|CDD|180547 PRK06371, PRK06371, translation initiation factor IF-2B subunit
alpha; Provisional.
Length = 329
Score = 54.1 bits (130), Expect = 9e-08
Identities = 43/166 (25%), Positives = 74/166 (44%), Gaps = 22/166 (13%)
Query: 545 AHEKGTKFRVIIVDGSPWYEGKEMLR-RLVKHQVDCSYVLLSAVSYIMR--EVSKVIIGA 601
AH G V + + P +G + L + +D + + +A Y MR E+ VI+GA
Sbjct: 164 AHRNGKNIFVFVDETRPRLQGARLTAWELAQEGIDHAIIADNAAGYFMRKKEIDLVIVGA 223
Query: 602 HALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELIS 661
+ SNG ++ GT + +++A+ +P A F +++ + E D NE++
Sbjct: 224 DRIASNGDFANKIGTYEKAVLAKVNGIPFYVAAPGSTFDFSIKSGDEIPIEERDENEVLE 283
Query: 662 --------DKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAI 699
+S A+N P +D+TP+ VT ITE I
Sbjct: 284 INGCRIGPQESHARN--------P---AFDVTPNEYVTGFITEYGI 318
>gnl|CDD|180362 PRK06036, PRK06036, translation initiation factor IF-2B subunit
alpha; Provisional.
Length = 339
Score = 52.4 bits (126), Expect = 3e-07
Identities = 57/206 (27%), Positives = 95/206 (46%), Gaps = 28/206 (13%)
Query: 522 KLAND-DVILTY---GCSSLVE-----KILLTAHEKGTKFRVIIVDGSPWYEGKEMLR-R 571
KL D D +LT+ G + V+ ++ +A E+G + +VI + P +G +
Sbjct: 143 KLLEDGDTVLTHCNAGRLACVDWGTALGVIRSAVEQGKEIKVIACETRPLNQGSRLTTWE 202
Query: 572 LVKHQVDCSYVLLSAVSYIMRE--VSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVP 629
L++ + + + S +MR+ V KVI+GA + + AV ++ GT S++A+ +P
Sbjct: 203 LMQDNIPVTLITDSMAGIVMRQGMVDKVIVGADRITRD-AVFNKIGTYTHSVLAKEHEIP 261
Query: 630 --VLAACETHKFCERVQTDALVFNELGDPNELI---SDKSAAKNWKSLAHLTPLSLTYDI 684
V A T F E + + E DP+EL + A K+ P +D
Sbjct: 262 FYVAAPLSTFDF-EGWEGSVKI--EERDPDELRYCGKTQIAPKDVPVY---NP---AFDA 312
Query: 685 TPSHLVTAVITELAIV-PCTSVPVVL 709
TP VTA+ITE + P + VL
Sbjct: 313 TPMENVTAIITEKGVFYPPFLLDEVL 338
>gnl|CDD|235788 PRK06372, PRK06372, translation initiation factor IF-2B subunit
delta; Provisional.
Length = 253
Score = 46.1 bits (109), Expect = 3e-05
Identities = 43/184 (23%), Positives = 74/184 (40%), Gaps = 17/184 (9%)
Query: 525 NDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQVDCSYVLL 584
ND VI T S +++ +++ EK V I++ P EG +M + LVK +D +L
Sbjct: 86 NDSVIGTISSSQVLKA-FISSSEKIK--SVYILESRPMLEGIDMAKLLVKSGID-VVLLT 141
Query: 585 SAVSYIMRE-VSKVIIGAHALLSNGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERV 643
A V VI+G+ ++L +G ++ + GT ++L AR P + + K
Sbjct: 142 DASMCEAVLNVDAVIVGSDSVLYDGGLIHKNGTFPLALCARYLKKPFYSLTISMKIERNF 201
Query: 644 QTDALVFNELGDPNELISDKSAAKNWKSLAHLTPLSLTYDITPSHLVTAVITELAIVPCT 703
+ +E D + +D TP L+ I E V +
Sbjct: 202 LYSTYPNFKNHPCSEWNIDIPCINRY------------FDKTPPDLIDYYINENGFVKPS 249
Query: 704 SVPV 707
V +
Sbjct: 250 DVNI 253
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 47.4 bits (112), Expect = 3e-05
Identities = 37/161 (22%), Positives = 67/161 (41%), Gaps = 6/161 (3%)
Query: 196 DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK----AKSKSEPSKE 251
K EAKKAA+ A + + A + E K+ E K AK K+E K+
Sbjct: 1333 AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Query: 252 ANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQA 311
A+ K A+ +D K + + K + +E K + K + ++
Sbjct: 1393 ADEAKKKAE--EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAK 1450
Query: 312 KAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSK 352
K A +K + + +K+ K +++K +E++ K + K K
Sbjct: 1451 KKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
Score = 44.7 bits (105), Expect = 2e-04
Identities = 47/177 (26%), Positives = 83/177 (46%), Gaps = 18/177 (10%)
Query: 186 EKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK 245
EK + E + KA+ + KKA +L AK K D +A + E K+ + +AK K
Sbjct: 1389 EKKKADEAKK---KAEEDKKKADELKKAAAAKKKAD-----EAKKKAEEKK-KADEAKKK 1439
Query: 246 SEPSKEANVQNKPAQ---GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQ---LKA 299
+E +K+A+ K A+ +++ K K + DE K KA+ KA
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1499
Query: 300 ER-RAKQEQQRQAKAAALLEKTKTSNE--KSMTKSKTEDSKPASEKSSKTEVLKSKD 353
+ + E +++A A E+ K ++E K+ K +++K A EK E+ K+++
Sbjct: 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556
Score = 44.4 bits (104), Expect = 3e-04
Identities = 47/186 (25%), Positives = 75/186 (40%), Gaps = 7/186 (3%)
Query: 174 ASLSQVTTAIMSEKNNSAELSR--DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQ 231
A ++ AI +E+ A+ + + K EAKKA + +AK K + D +
Sbjct: 1263 AHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKK 1322
Query: 232 P--ESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGD 289
E+K+ + K E K A A+ + D + A + KE K + D
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK-KKEEAKKKAD 1381
Query: 290 EVKSKAQ--LKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
K KA+ KA+ K+ ++ + KA L + + K K E+ K A E K E
Sbjct: 1382 AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAE 1441
Query: 348 VLKSKD 353
K D
Sbjct: 1442 EAKKAD 1447
Score = 40.1 bits (93), Expect = 0.004
Identities = 38/174 (21%), Positives = 71/174 (40%), Gaps = 15/174 (8%)
Query: 187 KNNSAELSRDLVKAQREAKKAAKL----------AAKQKAKVKGDVTAPPDAVTQPESKQ 236
AE + D +A E +AA+ AAK+KA+ K + + K
Sbjct: 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407
Query: 237 SEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQ 296
E KA + + + EA + + + + + K K + +E K KA+
Sbjct: 1408 DELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Query: 297 LKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
E + E +++A+ A ++ K E++ K K +++K A+E K + K
Sbjct: 1468 ---EAKKADEAKKKAEEAKKADEAKKKAEEA--KKKADEAKKAAEAKKKADEAK 1516
Score = 40.1 bits (93), Expect = 0.005
Identities = 42/178 (23%), Positives = 77/178 (43%), Gaps = 14/178 (7%)
Query: 184 MSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAK 243
+K A+ + + K EAKKA + K + K D +A + E + + +AK
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKAD-----EAKKKAEEAKKKADEAK 1503
Query: 244 SKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQL-KAERR 302
+E K+A+ K + + A ++ K K + DE+K +L KAE +
Sbjct: 1504 KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEK 1563
Query: 303 AKQEQQRQA--------KAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSK 352
K E+ ++A + A +K + + + + K E+ K +E++ K E K K
Sbjct: 1564 KKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK 1621
Score = 34.7 bits (79), Expect = 0.24
Identities = 39/173 (22%), Positives = 74/173 (42%), Gaps = 7/173 (4%)
Query: 185 SEKNNSAELSR--DLVKAQREAKKAAKLAAK-QKAKVKGDVTAPPDAVTQPESKQSEPSK 241
+ K + E + + K EAKK A+ A K +AK K + + + + + +
Sbjct: 1415 AAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADE 1474
Query: 242 AKSKSEPSKEANVQNKPAQGSQ---DTPKPGGPPATLSNVDNKENVKSEGDEV-KSKAQL 297
AK K+E +K+A+ K A+ ++ D K ++ K + DE K++
Sbjct: 1475 AKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAK 1534
Query: 298 KAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
KA+ K E++++A E+ K + EK + + + + K E K
Sbjct: 1535 KADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587
Score = 32.4 bits (73), Expect = 1.2
Identities = 33/167 (19%), Positives = 55/167 (32%), Gaps = 30/167 (17%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKS--KSEPSKEANVQN 256
K EAKKA + ++ + + A Q K E KA K+E K+A+
Sbjct: 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAK 1296
Query: 257 KPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLK--AERRAK---------Q 305
K + + K + +E K + K AE K +
Sbjct: 1297 KAEE-----------------KKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAE 1339
Query: 306 EQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSK 352
E ++ A+AA + ++ + K E K + K K
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Score = 32.0 bits (72), Expect = 1.4
Identities = 41/196 (20%), Positives = 75/196 (38%), Gaps = 12/196 (6%)
Query: 170 QTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKL--AAKQKAKVKGDVTAPPD 227
+ A + +V EK AE ++ KA+ KA +L A ++K KV+ +
Sbjct: 1588 KAEEARIEEVMKLYEEEKKMKAEEAK---KAEEAKIKAEELKKAEEEKKKVEQLKKKEAE 1644
Query: 228 AVTQPES--KQSEPSKAKSKSEPSKEANVQNKPAQGSQ-DTPKPGGPPATLSNVDNKENV 284
+ E K E +K K+ E K + K + + + + A + +
Sbjct: 1645 EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA 1704
Query: 285 ----KSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS 340
K E +E K +LK + + +AK A +K K K + K + +
Sbjct: 1705 EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKK 1764
Query: 341 EKSSKTEVLKSKDPNV 356
E+ K E ++ + V
Sbjct: 1765 EEEKKAEEIRKEKEAV 1780
Score = 31.3 bits (70), Expect = 2.7
Identities = 35/193 (18%), Positives = 76/193 (39%), Gaps = 8/193 (4%)
Query: 185 SEKNNSAELSRDLVKAQR--EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK- 241
+E AE +R + ++ E +KA + A+ + +A ++K++E K
Sbjct: 1172 AEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKK 1231
Query: 242 ---AKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLK 298
AK +E +K+A + + + A E + + K++ + K
Sbjct: 1232 AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKK 1291
Query: 299 AERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPS 358
A+ K E+++ KA +K + + + K K E++K ++ + K K
Sbjct: 1292 ADEAKKAEEKK--KADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Query: 359 TKKYSGVDGVKAT 371
+ + D +A
Sbjct: 1350 AEAEAAADEAEAA 1362
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 46.1 bits (109), Expect = 4e-05
Identities = 37/185 (20%), Positives = 80/185 (43%), Gaps = 15/185 (8%)
Query: 154 FISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVK-AQREAKKAAKLAA 212
I + ++ + S V T S +V A+ + RD+ +RE+++ AK
Sbjct: 157 VIPLKKNILSGNVSDVDTDSISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAK--R 214
Query: 213 KQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPP 272
Q+ K + D Q ++ ++ + K + E V+ K Q +++ PKP
Sbjct: 215 AQQLKEELDKKQIDADKAQQKADFAQDNADKQRDE------VRQK-QQEAKNLPKPADTS 267
Query: 273 ATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSK 332
+ + EN K E + KAQ++ ++ ++ + + A L++ ++EK + K
Sbjct: 268 SPKEDKQVAENQKREIE----KAQIEIKKNDEEALKAKDHKAFDLKQESKASEK-EAEDK 322
Query: 333 TEDSK 337
+++
Sbjct: 323 ELEAQ 327
Score = 30.4 bits (68), Expect = 4.0
Identities = 11/79 (13%), Positives = 23/79 (29%), Gaps = 4/79 (5%)
Query: 278 VDNKENVKSEG----DEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKT 333
D KE E ++K + K K +Q+ ++ +K
Sbjct: 201 TDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNL 260
Query: 334 EDSKPASEKSSKTEVLKSK 352
S +V +++
Sbjct: 261 PKPADTSSPKEDKQVAENQ 279
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 44.5 bits (105), Expect = 1e-04
Identities = 36/143 (25%), Positives = 54/143 (37%), Gaps = 13/143 (9%)
Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGS 262
AKKAAK A + K A A SK+ A S + +K A K A+ +
Sbjct: 198 NAKKAAKTPAAKSGAKKASAKAKSAAKKV--SKKKAAKTAVSAKKAAKTA---AKAAKKA 252
Query: 263 QDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKT 322
+ T K A + VK + KA KA + A + + +AKA K
Sbjct: 253 KKTAKKALKKA-------AKAVKKAAKKAA-KAAAKAAKGAAKATKGKAKAKKKAGKKAA 304
Query: 323 SNEKSMTKSKTEDSKPASEKSSK 345
+ K+ +K +K+ K
Sbjct: 305 AGSKAKATAKAPKRGAKGKKAKK 327
Score = 35.2 bits (81), Expect = 0.10
Identities = 30/136 (22%), Positives = 52/136 (38%), Gaps = 13/136 (9%)
Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTA-PPDAVTQPESKQSEPSKAKSKSEPSKEANVQN 256
++A AK AAK+ +K K TA + +K ++ +K +K K A
Sbjct: 209 KSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVK 268
Query: 257 KPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAAL 316
K A+ + A K+ + K+K + + A + + AKA
Sbjct: 269 KAAKKAAKAAAKAAKGAA----------KATKGKAKAKKKAGKKAAAGSKAKATAKAPKR 318
Query: 317 LEKTKTSNEKSMTKSK 332
K K + K +TK +
Sbjct: 319 GAKGKKA--KKVTKKR 332
Score = 31.8 bits (72), Expect = 1.2
Identities = 22/118 (18%), Positives = 42/118 (35%), Gaps = 3/118 (2%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
KA+ AKK +K A + A + K ++ + K+ K A K
Sbjct: 218 KAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKA 277
Query: 259 A-QGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
A + ++ K A K+ + G + K+ A+ ++ ++ K A
Sbjct: 278 AAKAAKGAAKATKGKAKAKKKAGKK--AAAGSKAKATAKAPKRGAKGKKAKKVTKKRA 333
Score = 31.4 bits (71), Expect = 1.5
Identities = 21/86 (24%), Positives = 36/86 (41%)
Query: 165 SPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTA 224
+ ++V A+ + A ++K L + ++ AKKAAK AAK
Sbjct: 232 AKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKG 291
Query: 225 PPDAVTQPESKQSEPSKAKSKSEPSK 250
A + K + SKAK+ ++ K
Sbjct: 292 KAKAKKKAGKKAAAGSKAKATAKAPK 317
Score = 30.6 bits (69), Expect = 3.0
Identities = 30/119 (25%), Positives = 43/119 (36%), Gaps = 7/119 (5%)
Query: 259 AQGSQDTPKPG-GPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALL 317
A G+ + K P A KS +V K K AK+ + AKAA
Sbjct: 193 AVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKA 252
Query: 318 EKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVP--STKKYSGVDGVKATPGT 374
+KT K K+ K A++K++K +K K G KA G+
Sbjct: 253 KKTAKKALKKAAKA----VKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGS 307
Score = 29.5 bits (66), Expect = 5.9
Identities = 34/140 (24%), Positives = 57/140 (40%), Gaps = 4/140 (2%)
Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
+A + K P S A++ + S+ + + T S K + ++ KA+
Sbjct: 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAV--SAKKAAKTAAKAAKKAK 253
Query: 202 REAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQG 261
+ AKKA K AAK K + +K ++ KAK+K + K+A +K A+
Sbjct: 254 KTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKATKG-KAKAKKKAGKKAAAGSK-AKA 311
Query: 262 SQDTPKPGGPPATLSNVDNK 281
+ PK G V K
Sbjct: 312 TAKAPKRGAKGKKAKKVTKK 331
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 40.7 bits (96), Expect = 0.002
Identities = 19/130 (14%), Positives = 42/130 (32%), Gaps = 2/130 (1%)
Query: 234 SKQSEPSKAKSKSEPSKEANVQNKPAQG--SQDTPKPGGPPATLSNVDNKENVKSEGDEV 291
+ A+ +++ + ++G +++ K + + + V
Sbjct: 4 ASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMV 63
Query: 292 KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKS 351
K K++ + AKAAA K + + SK + K A +K +K
Sbjct: 64 KDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKD 123
Query: 352 KDPNVPSTKK 361
D +
Sbjct: 124 IDVLNQADDD 133
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 40.2 bits (94), Expect = 0.003
Identities = 32/146 (21%), Positives = 63/146 (43%), Gaps = 20/146 (13%)
Query: 200 AQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPA 259
A ++A K A + AK A +A E+K++ + K+ +E K+A +
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEA----EAKRAAAAAKKAAAEAKKKAEAEAA-K 177
Query: 260 QGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEK 319
+ + + K A E K KA+ +A+++A E +++A A A
Sbjct: 178 KAAAEAKKKAEAEAA----------AKAAAEAKKKAEAEAKKKAAAEAKKKAAAEA---- 223
Query: 320 TKTSNEKSMTKSKTEDSKPASEKSSK 345
K + K+ ++K K A+ K+++
Sbjct: 224 -KAAAAKAAAEAKAAAEKAAAAKAAE 248
Score = 35.6 bits (82), Expect = 0.089
Identities = 33/145 (22%), Positives = 60/145 (41%), Gaps = 7/145 (4%)
Query: 186 EKNNSAELSRDLVKAQREAKKAA-KLAAKQKAKVKGDVTAPPDAVTQPES----KQSEPS 240
+K + E ++ Q++A++AA K AA KAK + + A + + K +
Sbjct: 116 QKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEA 175
Query: 241 KAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAE 300
K+ +E K+A + ++ K + + K+ +E +KA AE
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKA--AAE 233
Query: 301 RRAKQEQQRQAKAAALLEKTKTSNE 325
+A E+ AKAA K + E
Sbjct: 234 AKAAAEKAAAAKAAEKAAAAKAAAE 258
Score = 32.5 bits (74), Expect = 0.82
Identities = 28/122 (22%), Positives = 43/122 (35%), Gaps = 28/122 (22%)
Query: 199 KAQREAKK----AAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANV 254
KA EAKK A A +AK K + A A + + K +K K+ +E K+A
Sbjct: 162 KAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAA 221
Query: 255 QNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAA 314
+ K A E K+ A+ A +A ++ AA
Sbjct: 222 EAKAAA------------------------AKAAAEAKAAAEKAAAAKAAEKAAAAKAAA 257
Query: 315 AL 316
+
Sbjct: 258 EV 259
Score = 30.9 bits (70), Expect = 2.7
Identities = 18/65 (27%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 281 KENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS 340
++ +E + +K QL+ ER A QEQ++QA+ AA K + +K +
Sbjct: 93 QQKQAAEQERLK---QLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKA 149
Query: 341 EKSSK 345
E +K
Sbjct: 150 EAEAK 154
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 39.5 bits (92), Expect = 0.007
Identities = 46/205 (22%), Positives = 73/205 (35%), Gaps = 5/205 (2%)
Query: 171 TTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVT 230
T AS S+ T E ++VK + A K A K K + D
Sbjct: 1211 TKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRL 1270
Query: 231 QPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDE 290
+ S P+++ E K + A+ KP + +S+ D+ ++ +
Sbjct: 1271 AAYNLDSAPAQSAKMEETVKAVPARRAAAR-----KKPLASVSVISDSDDDDDDFAVEVS 1325
Query: 291 VKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
+ + + K R+ ++ AK A +K + +S K TE KPA E
Sbjct: 1326 LAERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKLLTEMLKPAEAIGISPEKKV 1385
Query: 351 SKDPNVPSTKKYSGVDGVKATPGTT 375
K P KK V G AT T
Sbjct: 1386 RKMRASPFNKKSGSVLGRAATNKET 1410
Score = 36.0 bits (83), Expect = 0.088
Identities = 30/163 (18%), Positives = 48/163 (29%), Gaps = 13/163 (7%)
Query: 201 QREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQ 260
+ +AK Q+A +G+ A Q K + K SE A
Sbjct: 1170 KEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAM 1229
Query: 261 GSQDTP---KPGGPPATLSNVDNKENVKSEGDEVKS-KAQLKAERRAKQEQQRQAKAAAL 316
+++ KP G K E DE+ K +L A Q +
Sbjct: 1230 ETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILDLKDRLAAYNLDSAPAQSAKMEETV 1289
Query: 317 LE---------KTKTSNEKSMTKSKTEDSKPASEKSSKTEVLK 350
K ++ ++ S +D A E S + K
Sbjct: 1290 KAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKK 1332
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 38.4 bits (89), Expect = 0.011
Identities = 29/132 (21%), Positives = 41/132 (31%), Gaps = 5/132 (3%)
Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
SA GP P + + + V A+ S V A S + +A
Sbjct: 313 SAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVA--DRDGESTPAVEETSEAD 370
Query: 202 REAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQG 261
E ++ LA + A + D A A +P + SE P K A P
Sbjct: 371 IEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAA---PIPDPA 427
Query: 262 SQDTPKPGGPPA 273
D GP
Sbjct: 428 KPDELAVAGPGD 439
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 38.8 bits (91), Expect = 0.012
Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 7/149 (4%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
KA REA+ K A + AK K V A V ++ ++P K+ + P A + +
Sbjct: 466 KAAREAR-HKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAARE 524
Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLE 318
A+ +Q + A + K V + + A+ KA++ A+Q +A+ +
Sbjct: 525 ARKAQARARQAEKQAAAAADPKKAAVAA------AIARAKAKKAAQQAANAEAEEEVDPK 578
Query: 319 KTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
K + + K+K + AS + +
Sbjct: 579 KAAVAAAIARAKAKKAAQQAASAEPEEQV 607
Score = 34.5 bits (80), Expect = 0.20
Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 36/164 (21%)
Query: 186 EKNNSAELSR---DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKA 242
++ AE ++ + +A+ E +KAA+ A +KA +A
Sbjct: 443 QEKKKAEEAKARFEARQARLEREKAAREARHKKAAEA---------------------RA 481
Query: 243 KSKSEPSKEAN--VQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAE 300
+ A V+ K A +Q G DN + + +AE
Sbjct: 482 AKDKDAVAAALARVKAKKAAATQPIVIKAGARP-----DNSAVIAAREARKAQARARQAE 536
Query: 301 RRAKQ-EQQRQAKAAALLEKTK----TSNEKSMTKSKTEDSKPA 339
++A ++A AA + + K + + D K A
Sbjct: 537 KQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKA 580
Score = 33.8 bits (78), Expect = 0.33
Identities = 27/141 (19%), Positives = 44/141 (31%), Gaps = 13/141 (9%)
Query: 182 AIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK 241
A N AE D KA A A A K + A P+ +
Sbjct: 562 AAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAI 621
Query: 242 AKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAER 301
A++K +K+A Q P+ A ++ + K+ Q
Sbjct: 622 ARAK---AKKAEQQANAEPEEPVDPRKAAVAAAIARAKAR----------KAAQQQANAE 668
Query: 302 RAKQEQQRQAKAAALLEKTKT 322
+ E ++A AA + + K
Sbjct: 669 PEEAEDPKKAAVAAAIARAKA 689
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 36.7 bits (85), Expect = 0.031
Identities = 36/146 (24%), Positives = 58/146 (39%), Gaps = 27/146 (18%)
Query: 186 EKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK 245
EK +AE +R QR A + A A+Q AK + + KQ+E +KAK
Sbjct: 81 EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEE-----------KQKQAEEAKAKQA 129
Query: 246 SEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKA-----QLKAE 300
+E +A + + + + +E E K KA + +AE
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQA-----------EEEAKAKAAAEAKKKAAEAKKKAEAE 178
Query: 301 RRAKQEQQRQAKAAALLEKTKTSNEK 326
+AK E + +AKA K + + K
Sbjct: 179 AKAKAEAKAKAKAEEAKAKAEAAKAK 204
Score = 36.0 bits (83), Expect = 0.069
Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 24/120 (20%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPES---KQSEPSKAKSKSEPSKEANVQ 255
K EAK A+ A++KAK + A +A + + K++ +K K+++E +A +
Sbjct: 127 KQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAK 186
Query: 256 NKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
K E K++ + K+KA +A +A+ E A A A
Sbjct: 187 AKAKA---------------------EEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAAEA 225
Score = 29.4 bits (66), Expect = 6.2
Identities = 23/145 (15%), Positives = 56/145 (38%), Gaps = 14/145 (9%)
Query: 205 KKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQD 264
A+ A + + + K + + E + E K ++ + ++ Q A+ +
Sbjct: 46 GAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAE--KA 103
Query: 265 TPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQE----QQRQAKAAALLEKT 320
+ K+ + + ++KA+ AE +AK E ++ + +A E+
Sbjct: 104 AKQA--------EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEE 155
Query: 321 KTSNEKSMTKSKTEDSKPASEKSSK 345
+ + K K ++K +E +K
Sbjct: 156 AKAKAAAEAKKKAAEAKKKAEAEAK 180
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 36.4 bits (84), Expect = 0.051
Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 9/159 (5%)
Query: 206 KAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDT 265
+A K K +K T P + ES + E + + E K+ KP +
Sbjct: 75 EAVKRVEKGGSKGPAAKTKPAKE-PKNESGKEEEKEKEQVKEEKKKKKE--KPKE-EPKD 130
Query: 266 PKPG-----GPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKT 320
KP P + ++ V+ D + K + + +++ ++ + K ++
Sbjct: 131 RKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEP 190
Query: 321 KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPST 359
++ + KP ++ + D T
Sbjct: 191 PEEEKQRQAAREAVKGKPEEPDVNEEREKEEDDGKDRET 229
Score = 29.9 bits (67), Expect = 5.2
Identities = 18/122 (14%), Positives = 38/122 (31%), Gaps = 19/122 (15%)
Query: 240 SKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKA 299
K SK +K +++ G + ++ + + + + +
Sbjct: 81 EKGGSKGPAAKTK--------PAKEPKNESGKE---EEKEKEQVKEEKKKKKEKPKEEPK 129
Query: 300 ERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPST 359
+R+ K+E + ++ EK K E EK + KS+ P
Sbjct: 130 DRKPKEEAKE--------KRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKK 181
Query: 360 KK 361
K
Sbjct: 182 KP 183
Score = 29.9 bits (67), Expect = 5.7
Identities = 32/175 (18%), Positives = 64/175 (36%), Gaps = 3/175 (1%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
+ E +K K+K K K P D + E+K+ P K K K + K +++
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEE-PKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDRE 161
Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEGD-EVKSKAQLKAERRAKQEQQRQAKAAALL 317
+ ++ + P K + E + +A +A + +E +
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221
Query: 318 EKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDGVKATP 372
+ K E + + + ++S+ +SE S ++ K PS + K T
Sbjct: 222 DDGKDR-ETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTE 275
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 34.6 bits (80), Expect = 0.054
Identities = 15/68 (22%), Positives = 32/68 (47%), Gaps = 1/68 (1%)
Query: 289 DEVKSKAQLK-AERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
+E K K + K A++RAK+++++Q K K E+ +E+S E+ + +
Sbjct: 73 EEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDK 132
Query: 348 VLKSKDPN 355
+ +
Sbjct: 133 QEEPVEIM 140
Score = 28.1 bits (63), Expect = 9.4
Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 1/65 (1%)
Query: 294 KAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS-EKSSKTEVLKSK 352
K + E+ AK+ +RQ K +K K + + S S ++ + E K +
Sbjct: 75 KKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQE 134
Query: 353 DPNVP 357
+P
Sbjct: 135 EPVEI 139
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 36.1 bits (83), Expect = 0.062
Identities = 34/154 (22%), Positives = 59/154 (38%), Gaps = 6/154 (3%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
KA AK A AKQK + +VT A + ++ + +KA + ++ +E +
Sbjct: 96 KAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTE 155
Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEG-----DEVKSKAQLKAERRAKQEQQRQAKA 313
+ D K A + K + + + KA+ +AK +AKA
Sbjct: 156 EEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKA 215
Query: 314 AALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
AA L K K S + + +K + +K
Sbjct: 216 AA-LAKQKASQGNGDSGDEDAKAKAIAAAKAKAA 248
Score = 35.3 bits (81), Expect = 0.097
Identities = 27/162 (16%), Positives = 65/162 (40%), Gaps = 26/162 (16%)
Query: 187 KNNSAELSRDLVKAQREAKKAAKLA-AKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK 245
K +A +++ + + AK A+++ +++ + K D +T E+K+ + AK+K
Sbjct: 11 KKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAK 70
Query: 246 SEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQ 305
+ A + + +G+++ +E ++ K+KA+ A +AK
Sbjct: 71 A-----AALAKQKREGTEEV--------------------TEEEKAKAKAKAAAAAKAKA 105
Query: 306 EQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
+ K E T+ + K+ A+ + +
Sbjct: 106 AALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKR 147
Score = 31.5 bits (71), Expect = 1.8
Identities = 35/143 (24%), Positives = 50/143 (34%), Gaps = 24/143 (16%)
Query: 192 ELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKE 251
E D KA+ +A AAK A AK K E K +KA + ++
Sbjct: 157 EEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAA 216
Query: 252 ANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQA 311
A + K +QG+ D+ GDE +KA+ A +AK +A
Sbjct: 217 ALAKQKASQGNGDS----------------------GDE-DAKAKAIAAAKAKAAAAARA 253
Query: 312 KAAALLEKTKTSNEKSMTKSKTE 334
K E K K S +
Sbjct: 254 KTKG-AEGKKEEEPKQEEPSVNQ 275
Score = 29.6 bits (66), Expect = 7.4
Identities = 25/90 (27%), Positives = 36/90 (40%), Gaps = 5/90 (5%)
Query: 187 KNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAK--- 243
K +AE + E K AK A AK K A A E +KAK
Sbjct: 182 KQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAKAKAIA 241
Query: 244 -SKSEPSKEANVQNKPAQGS-QDTPKPGGP 271
+K++ + A + K A+G ++ PK P
Sbjct: 242 AAKAKAAAAARAKTKGAEGKKEEEPKQEEP 271
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 36.1 bits (84), Expect = 0.063
Identities = 22/86 (25%), Positives = 36/86 (41%), Gaps = 7/86 (8%)
Query: 271 PPATLSNVDNKENVKSEGDEVKS--KAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSM 328
P + EN E K K Q KAE++A++E+ +A A E +
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPD 450
Query: 329 TKSKTEDSKPASEKSSKTEVLKSKDP 354
++K D P EK + +++DP
Sbjct: 451 GETKKVDPDPLGEK-----LARTEDP 471
Score = 32.2 bits (74), Expect = 1.0
Identities = 18/75 (24%), Positives = 28/75 (37%), Gaps = 5/75 (6%)
Query: 183 IMSEKNNSAELSRDLVKAQREAKKAAKLAA-----KQKAKVKGDVTAPPDAVTQPESKQS 237
+ + AE + K ++ KKA K A K+KA+ PD T+
Sbjct: 401 GENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDP 460
Query: 238 EPSKAKSKSEPSKEA 252
K +P +EA
Sbjct: 461 LGEKLARTEDPLEEA 475
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 36.2 bits (84), Expect = 0.071
Identities = 30/194 (15%), Positives = 57/194 (29%), Gaps = 16/194 (8%)
Query: 173 PASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAK-VKGDVTAPPDAVTQ 231
P + S + S + S + ++ + KL K K + D Q
Sbjct: 1172 PKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQ 1231
Query: 232 PESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEV 291
+ K + + + ++ S D K G P V
Sbjct: 1232 KTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVS-----------A 1280
Query: 292 KSKAQLKAERRAKQEQQRQAKA----AALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
+ +R E +K ++K + ++ K K + K A +K SKT
Sbjct: 1281 VQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTR 1340
Query: 348 VLKSKDPNVPSTKK 361
V ++ +
Sbjct: 1341 VKQASASQSSRLLR 1354
>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
protein (MOBP). MOBP is abundantly expressed in central
nervous system myelin, and shares several
characteristics with myelin basic protein (MBP), in
terms of regional distribution and function. This family
is the middle and C-terminal regions of MOBP which has
been shown to be essential for normal arrangement of the
radial component in central nervous system myelin. Most
member-proteins carry a FVHE-PHD type zinc-finger at
their N-terminus.
Length = 710
Score = 35.9 bits (82), Expect = 0.083
Identities = 37/155 (23%), Positives = 70/155 (45%), Gaps = 9/155 (5%)
Query: 205 KKAAKLAAKQKAKVKG--DVTAPPDAVTQPESKQSEPSKAKSKSE--PSKEANVQNKPAQ 260
+KA K A +K + G DV P A S S+P +A ++ + + P +
Sbjct: 328 RKAPKNPAAEKMPLHGELDVNFNPQAAGGETSDSSDPEEALHTADRRARRWRRARLGPEE 387
Query: 261 GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKT 320
+++ P PG L +++ SE D + ++AQ E++ + + L E
Sbjct: 388 PNKELPSPGAHLRALDTAQVSDDL-SETD-ISNEAQDPQSLTDSTEEKLRNR---LYELA 442
Query: 321 KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPN 355
+EK + + ++S+P +E ++ E L S+D N
Sbjct: 443 MKMSEKETSSGEDQESEPKAEPENQKESLSSEDNN 477
>gnl|CDD|225711 COG3170, FimV, Tfp pilus assembly protein FimV [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 755
Score = 35.7 bits (82), Expect = 0.098
Identities = 22/140 (15%), Positives = 43/140 (30%), Gaps = 33/140 (23%)
Query: 195 RDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANV 254
+L +A+ A A + + AP + T P S P+ + +
Sbjct: 305 AELQAQLNKAQHELAQKAAPLAAAQAALDAPAETATAP-------------SAPAPQVSA 351
Query: 255 QNKPAQG-----SQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQR 309
++ PAQ + P G A + + + A + + R
Sbjct: 352 ESSPAQPGSYLLAAPGDAPLGELAQA--------------QSARERLAEESVPAAEPRSR 397
Query: 310 QAKAAALLEKTKTSNEKSMT 329
A A +E+ E ++
Sbjct: 398 LA-PVAAVEQPFAEVESPLS 416
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 35.3 bits (81), Expect = 0.10
Identities = 25/119 (21%), Positives = 40/119 (33%), Gaps = 4/119 (3%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
K + EA KA A K K + + + E + K+A + K
Sbjct: 146 KKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAE---EAKAKAEAAAAKKKAEAEAKA 202
Query: 259 AQGSQDTPKPGGPPATLSNVDN-KENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAAL 316
A A +E +E + +KA+ A + +R+A AAAL
Sbjct: 203 AAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAAL 261
Score = 34.5 bits (79), Expect = 0.17
Identities = 37/154 (24%), Positives = 60/154 (38%), Gaps = 7/154 (4%)
Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNK 257
+++Q+ + K + K+K + + P A Q KQ E + K++ E K+A K
Sbjct: 67 IQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQ-EQQKQAEEAEK 125
Query: 258 PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQA------ 311
AQ Q + A E K++ +K + AE + K E+ +A
Sbjct: 126 QAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKA 185
Query: 312 KAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSK 345
KA A K K E K + A K+ K
Sbjct: 186 KAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEK 219
Score = 31.8 bits (72), Expect = 1.2
Identities = 32/173 (18%), Positives = 73/173 (42%), Gaps = 8/173 (4%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSK---EANVQ 255
+ Q++A++A K A ++ + + A A Q K++E +KAK+ +E +K A +
Sbjct: 115 EQQKQAEEAEKQAQLEQKQQEE--QARKAAAEQ--KKKAEAAKAKAAAEAAKLKAAAEAK 170
Query: 256 NKPAQGSQDTPKPGGPPATLSNVDNKE-NVKSEGDEVKSKAQLKAERRAKQEQQRQAKAA 314
K + ++ + + E K+ ++ K++A+ KA+ K E + KAA
Sbjct: 171 KKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAA 230
Query: 315 ALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDG 367
A +K + + + ++ + ++ ++ + K G
Sbjct: 231 AEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSSGKNAPKTGGGAKN 283
>gnl|CDD|197239 cd09141, PLDc_vPLD1_2_yPLD_like_2, Catalytic domain, repeat 2, of
vertebrate phospholipases, PLD1 and PLD2, yeast PLDs,
and similar proteins. Catalytic domain, repeat 2, of
vertebrate phospholipases D (PLD1 and PLD2), yeast
phospholipase D (PLD SPO14/PLD1), and other similar
eukaryotic proteins. These PLD enzymes play a pivotal
role in transmembrane signaling and cellular regulation.
They hydrolyze the terminal phosphodiester bond of
phospholipids resulting in the formation of phosphatidic
acid and alcohols. Phosphatidic acid is an essential
compound involved in signal transduction. PLDs also
catalyze the transphosphatidylation of phospholipids to
acceptor alcohols, by which various phospholipids can be
synthesized. The vertebrate PLD1 and PLD2 are membrane
associated phosphatidylinositol 4,5-bisphosphate
(PIP2)-dependent enzymes that selectively hydrolyze
phosphatidylcholine (PC). Protein cofactors and calcium
may be required for their activation. Yeast SPO14/PLD1
is a calcium-independent PLD, which needs PIP2 for its
activity. Instead of the regulatory calcium-dependent
phospholipid-binding C2 domain in plants, most mammalian
and yeast PLDs have adjacent Phox (PX) and the
Pleckstrin homology (PH) domains at the N-terminus,
which have been shown to mediate membrane targeting of
the protein and are closely linked to
polyphosphoinositide signaling. The PX and PH domains
are also present in zeta-type PLD from Arabidopsis,
which is more closely related to vertebrate PLDs than to
other plant PLD types. In addition, this subfamily also
includes some related proteins which have either PX-like
or PH domains in their N-termini. Like other members of
the PLD superfamily, the monomer of mammalian and yeast
PLDs consists of two catalytic domains, each containing
one copy of the conserved HKD motif (H-x-K-x(4)-D, where
x represents any amino acid residue). Two HKD motifs
from the two domains form a single active site. These
PLDs utilize a common two-step ping-pong catalytic
mechanism involving an enzyme-substrate intermediate to
cleave phosphodiester bonds. The two histidine residues
from the two HKD motifs play key roles in the catalysis.
Upon substrate binding, a histidine residue from one HKD
motif could function as the nucleophile, attacking the
phosphodiester bond to create a covalent
phosphohistidine intermediate, while the other histidine
residue from the second HKD motif could serve as a
general acid, stabilizing the leaving group.
Length = 183
Score = 34.5 bits (80), Expect = 0.11
Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 11/55 (20%)
Query: 503 YIHEQVDMAGNAICMFFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIV 557
YI Q FF + +D + +LV++I+ AH++G KFRV IV
Sbjct: 25 YIENQ----------FFISSTGGEDPVKNRIGEALVDRII-RAHKEGEKFRVYIV 68
>gnl|CDD|238319 cd00570, GST_N_family, Glutathione S-transferase (GST) family,
N-terminal domain; a large, diverse group of cytosolic
dimeric proteins involved in cellular detoxification by
catalyzing the conjugation of glutathione (GSH) with a
wide range of endogenous and xenobiotic alkylating
agents, including carcinogens, therapeutic drugs,
environmental toxins and products of oxidative stress.
In addition, GSTs also show GSH peroxidase activity and
are involved in the synthesis of prostaglandins and
leukotrienes. This family, also referred to as soluble
GSTs, is the largest family of GSH transferases and is
only distantly related to the mitochondrial GSTs (GSTK
subfamily, a member of the DsbA family). Soluble GSTs
bear no structural similarity to microsomal GSTs (MAPEG
family) and display additional activities unique to
their group, such as catalyzing thiolysis, reduction
and isomerization of certain compounds. The GST fold
contains an N-terminal TRX-fold domain and a C-terminal
alpha helical domain, with an active site located in a
cleft between the two domains. Based on sequence
similarity, different classes of GSTs have been
identified, which display varying tissue distribution,
substrate specificities and additional specific
activities. In humans, GSTs display polymorphisms which
may influence individual susceptibility to diseases such
as cancer, arthritis, allergy and sclerosis. Some GST
family members with non-GST functions include
glutaredoxin 2, the CLIC subfamily of anion channels,
prion protein Ure2p, crystallins, metaxin 2 and
stringent starvation protein A.
Length = 71
Score = 31.4 bits (72), Expect = 0.17
Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 1/47 (2%)
Query: 531 TYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLRRLVKHQV 577
+ S ++ L EKG + ++ VD E +E L +V
Sbjct: 5 YFPGSPRSLRVRLALEEKGLPYELVPVDLGEG-EQEEFLALNPLGKV 50
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 34.8 bits (80), Expect = 0.17
Identities = 16/96 (16%), Positives = 39/96 (40%), Gaps = 4/96 (4%)
Query: 265 TPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSN 324
T KP A+ + ++ E++ DE++ + K ++ + ++ K+S
Sbjct: 15 TKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKP----PKKPKEASRPGTPRNPKKSSK 70
Query: 325 EKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTK 360
+ + +E+ KS+++ PST+
Sbjct: 71 PTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTE 106
Score = 33.6 bits (77), Expect = 0.37
Identities = 25/123 (20%), Positives = 45/123 (36%), Gaps = 14/123 (11%)
Query: 207 AAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSK----------EANVQN 256
AK K K + SK +E S A S+ +P+K
Sbjct: 42 EQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSK 101
Query: 257 KPAQGSQDTPKPGGPPATLSNVDNKEN----VKSEGDEVKSKAQLKAERRAKQEQQRQAK 312
P+ S++ +P P ++++ + + S S A +AE+ + QQ +A+
Sbjct: 102 APSTESEEEEEPEETPDPIASIGGWWSLWGSITSTATSTASAAVKQAEQAVNEIQQEEAQ 161
Query: 313 AAA 315
A
Sbjct: 162 LWA 164
>gnl|CDD|107248 cd01391, Periplasmic_Binding_Protein_Type_1, Type 1 periplasmic
binding fold superfamily. Type 1 periplasmic binding
fold superfamily. This model and hierarchy represent the
ligand binding domains of the LacI family of
transcriptional regulators, periplasmic binding proteins
of the ABC-type transport systems, the family C
G-protein couples receptors (GPCRs), membrane bound
guanylyl cyclases including the family of natriuretic
peptide receptors (NPRs), and the N-terminal
leucine/isoleucine/valine- binding protein (LIVBP)-like
domains of the ionotropic glutamate receptors (iGluRs).
In LacI-like transcriptional regulator and the bacterial
periplasmic binding proteins the ligands are
monosaccharides including lactose, ribose, fructose,
xylose, arabinose, galactose/glucose, and other sugars,
with a few exceptions. Periplasmic sugar binding
proteins are one of the components of ABC transporters
and are involved in the active transport of
water-soluble ligands. The LacI family of proteins
consists of transcriptional regulators related to the
lac repressor. In this case, the sugar binding domain
binds a sugar which changes the DNA binding activity of
the repressor domain. The periplasmic binding proteins
are the primary receptors for chemotaxis and transport
of many sugar based solutes. The core structures of
periplasmic binding proteins are classified into two
types, and they differ in number and order of beta
strands: type 1 has six beta strands, while type 2 has
five beta strands per sub-domain. These two structural
folds are thought to be distantly related via a common
ancestor. Notably, while the N-terminal LIVBP-like
domain of iGluRs belongs to the type 1
periplasmic-binding fold protein superfamily, the
glutamate-binding domain of the iGluR is structurally
similar to the type 2 periplasmic-binding fold.
Length = 269
Score = 33.7 bits (77), Expect = 0.24
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 3/58 (5%)
Query: 541 ILLTAHEKGTKFRVIIVDG-SPWYEGKEMLRRLVKHQVDCSYVLLSAVSYIMREVSKV 597
I L A E G VI+ D S E LR L++ VD ++ S V ++
Sbjct: 22 IELAAEEIGRGLEVILADSQSDPERALEALRDLIQQGVD--GIIGPPSSSSALAVVEL 77
>gnl|CDD|222808 PHA00666, PHA00666, putative protease.
Length = 233
Score = 33.5 bits (76), Expect = 0.32
Identities = 27/118 (22%), Positives = 48/118 (40%), Gaps = 8/118 (6%)
Query: 201 QREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQ 260
++ ++ AA + A GD AP TQ E + +P+ K E K+A +
Sbjct: 15 EQPGDGGSQPAASEPAAGAGDNPAPQGDPTQEEGDKPQPAAGADKPEGDKKA--DGDKPE 72
Query: 261 GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLE 318
+ KP G P + E +EG E+ + A E A++ +A +++
Sbjct: 73 EKKPGEKPEGAP------EKYEFQAAEGVELDTGALGAFEPVARELNLTNEQAQKVVD 124
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 33.7 bits (77), Expect = 0.33
Identities = 30/171 (17%), Positives = 52/171 (30%), Gaps = 12/171 (7%)
Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEA--NVQNKPAQ 260
K+A + + G P P K + K S P V+ K A+
Sbjct: 123 YGKQAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVKVKSAK 182
Query: 261 GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKS---------KAQLKAERRAKQEQQRQA 311
+QDT K T K KS K + K E++ E +
Sbjct: 183 KTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKE 242
Query: 312 KAAALLEKT-KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKK 361
++ K ++S + ++ + E E S++ K+
Sbjct: 243 ESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKE 293
Score = 30.2 bits (68), Expect = 4.3
Identities = 37/217 (17%), Positives = 69/217 (31%), Gaps = 14/217 (6%)
Query: 155 ISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQ 214
+ T+P+ + T L V A S + + + K K
Sbjct: 122 EYGKQAGPITNPNVKRRTGVGLPPVAPA-ASPALKPTANGK--RPSSKPPKSIMSPEVKV 178
Query: 215 KAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQN-----KPAQGSQDTPKPG 269
K+ K T+ T+ ++ A K P K++N+ + K + +
Sbjct: 179 KSAKKTQDTSKE-TTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASE 237
Query: 270 GPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQ---QRQAKAAALLEKTKTSNE- 325
S ++ + DE L + + + +R EK K +
Sbjct: 238 STVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKR 297
Query: 326 -KSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKK 361
K M + + ED + S E +S++P P K
Sbjct: 298 LKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPK 334
>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
Provisional.
Length = 672
Score = 33.7 bits (78), Expect = 0.35
Identities = 20/132 (15%), Positives = 40/132 (30%), Gaps = 12/132 (9%)
Query: 196 DLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQ 255
+L+ A +EA+ AA + +P+ + ++ +
Sbjct: 50 ELIAAIKEARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAAR 109
Query: 256 NKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
A ++ P A E E A+ A R+A + ++ A A
Sbjct: 110 AAAAAAAEAASAPEAAQA------------RERRERGEAARRGAARKAGEGGEQPATEAR 157
Query: 316 LLEKTKTSNEKS 327
+T E+
Sbjct: 158 ADAAERTEEEER 169
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 33.5 bits (77), Expect = 0.39
Identities = 21/98 (21%), Positives = 31/98 (31%)
Query: 177 SQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQ 236
+ A +EK A A A+ AA A A P A P
Sbjct: 368 AAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVA 427
Query: 237 SEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPAT 274
+ + A + + + A V PA +Q P+ P
Sbjct: 428 APAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVR 465
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 33.2 bits (76), Expect = 0.44
Identities = 23/120 (19%), Positives = 43/120 (35%), Gaps = 6/120 (5%)
Query: 192 ELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKS--KSEPS 249
+ R+L K ++ K+ K + K T +P+ + + K +S S
Sbjct: 92 QQLRELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATS 151
Query: 250 KEANVQNKPAQGSQDTPKPGGPPATLSNVDNK----ENVKSEGDEVKSKAQLKAERRAKQ 305
E++ + P TP S K + S GDE K++ ++ K+
Sbjct: 152 PESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKSPKSKAAPKKAGKK 211
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 33.2 bits (76), Expect = 0.53
Identities = 38/175 (21%), Positives = 59/175 (33%), Gaps = 30/175 (17%)
Query: 226 PDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQ-----------DTPKPGGPPAT 274
P P K+ + SK EP Q PA SQ + T
Sbjct: 233 PSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSRRT 292
Query: 275 ------LSNVDNKENVKSEGDE-------VKSKAQLKAERRAKQEQ-QRQAKAAALLEKT 320
+N E +K E E ++ + L +R ++ Q + EK
Sbjct: 293 PSLLRSRTNSSLIEFLKRESSENLLPSLSSRTSSDLLKNKRLQKRQVDLSDSSRQHEEKL 352
Query: 321 KTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDG---VKATP 372
K + K + + + A +KS++ L SKD + K+ S V ATP
Sbjct: 353 KKKQMLNEQKKELKRAISALKKSNRE--LSSKDIVETAEKRSSQFGQGVQVTATP 405
>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive.
Pullulan is an unusual, industrially important
polysaccharide in which short alpha-1,4 chains
(maltotriose) are connected in alpha-1,6 linkages.
Enzymes that cleave alpha-1,6 linkages in pullulan and
release maltotriose are called pullulanases although
pullulan itself may not be the natural substrate. In
contrast, a glycogen debranching enzyme such GlgX,
homologous to this family, can release glucose at
alpha,1-6 linkages from glycogen first subjected to limit
degradation by phosphorylase. Characterized members of
this family include a surface-located pullulanase from
Streptococcus pneumoniae (PMID:11083842) and an
extracellular bifunctional amylase/pullulanase with
C-terminal pullulanase activity (PMID:8798645).
Length = 1111
Score = 32.9 bits (75), Expect = 0.64
Identities = 17/100 (17%), Positives = 29/100 (29%), Gaps = 6/100 (6%)
Query: 227 DAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKS 286
D +T + + P + KP D KP D K+N
Sbjct: 995 DPLTAAVVRVGGIEAPEKTPPPPEHEPQAPKPPTQDPDGSKPKDKV------DPKDNKDP 1048
Query: 287 EGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEK 326
+ + +++++ Q A L T T N
Sbjct: 1049 LTPPGSDDENGETPKGNEEKKEEQPDKGANLPNTGTKNSN 1088
>gnl|CDD|227499 COG5171, YRB1, Ran GTPase-activating protein (Ran-binding protein)
[Intracellular trafficking and secretion].
Length = 211
Score = 32.3 bits (73), Expect = 0.66
Identities = 28/132 (21%), Positives = 52/132 (39%), Gaps = 5/132 (3%)
Query: 200 AQREAKKAAKLAAKQKAKVKG-DVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
+R+ K+A + + K + DV + DA ++ E K + P E V
Sbjct: 3 VERKKKQAKIEKEENEQKERSLDVVSKGDAFGDGKAGGEE---KKVQQSPFLENAVPEGD 59
Query: 259 AQGSQDTPKPGGPPAT-LSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALL 317
++P P L V K N + E K++A+L +E + + ++
Sbjct: 60 EGKGPESPNIHFEPVVELQRVHLKTNEEDETVLFKARAKLFRFDEEAKEWKERGTGDMII 119
Query: 318 EKTKTSNEKSMT 329
K K +N+ +T
Sbjct: 120 LKHKKTNKARIT 131
>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
Length = 423
Score = 32.7 bits (75), Expect = 0.72
Identities = 13/104 (12%), Positives = 26/104 (25%), Gaps = 7/104 (6%)
Query: 212 AKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGP 271
+KQ PP + + A + + P +
Sbjct: 187 SKQVEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPL-----PQAQPKKAATEEEL 241
Query: 272 PATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
A + +K + + + K+ +Q QA A
Sbjct: 242 IADAK--KAATGEPAANAAKAAKPEPMPDDQQKEAEQLQAAIAQ 283
Score = 30.1 bits (68), Expect = 4.4
Identities = 15/104 (14%), Positives = 32/104 (30%), Gaps = 5/104 (4%)
Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTA----PPDAVTQPESKQSEPSKAKSKSEPSKEAN 253
V+ A+++A+ TA P A P+++ + + + +K+A
Sbjct: 190 VEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIADAKKAA 249
Query: 254 VQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQL 297
A ++ KP P + +L
Sbjct: 250 TGEPAANAAK-AAKPEPMPDDQQKEAEQLQAAIAQAIGGVAGKL 292
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 32.9 bits (74), Expect = 0.74
Identities = 39/173 (22%), Positives = 69/173 (39%), Gaps = 12/173 (6%)
Query: 198 VKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEAN---- 253
VK + ++AA AA+ K +VK A A E+K++ + + K +
Sbjct: 154 VKKETTTQQAAP-AAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWA 212
Query: 254 VQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQ------ 307
+ K QD ++ V K +K + KA++K E A ++Q
Sbjct: 213 LSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVVK 272
Query: 308 -QRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPST 359
A +K T+ +++ K+ TE +KPA S+ T K+ +T
Sbjct: 273 ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNT 325
>gnl|CDD|169386 PRK08334, PRK08334, translation initiation factor IF-2B subunit
beta; Validated.
Length = 356
Score = 32.2 bits (73), Expect = 0.88
Identities = 57/274 (20%), Positives = 119/274 (43%), Gaps = 26/274 (9%)
Query: 440 SEKEYSRGFEERLGPAMSYLNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEV 499
++ E+ GF + A L RP +V++ A+ K + + D D RL ++E
Sbjct: 81 TKDEFMDGFYK----AYETLKNTRPTAVNLFWALNRIKKLVEEHLEDPLDEIKRLIVEE- 135
Query: 500 IATYIHEQVDMAGNAICMFFHNKLANDDVILTY-GCSSL-------VEKILLTAHEKGTK 551
A I ++ D+ N + ++ + +LT+ SL V +L H+ GT
Sbjct: 136 -AQKIADE-DVEANLRMGHYGAEVLPEGNVLTHCNAGSLATVHLGTVGAVLRVMHKDGT- 192
Query: 552 FRVIIVDGS-PWYEGKEMLRRLVKHQVDCSYVLLS--AVSYIMRE--VSKVIIGAHALLS 606
+++ VD + P +G L H L+S ++M++ V +I+GA +++
Sbjct: 193 LKLLWVDETRPVLQGAR-LSAWEYHYDGIPLKLISDNMAGFVMQQGKVDAIIVGADRIVA 251
Query: 607 NGAVMSRAGTAQVSLVARAFNVPVLAACETHKFCERVQTDALVFNELGDPNELISDKSAA 666
NG ++ GT ++++A+ +P +++ + E P E+++
Sbjct: 252 NGDFANKIGTYTLAVLAKEHGIPFFTVAPLSTIDMSLKSGKEIPIEERSPEEVLT----C 307
Query: 667 KNWKSLAHLTPLSLTYDITPSHLVTAVITELAIV 700
+ + + +D+TP +T +IT+ +V
Sbjct: 308 GGCRIAPDVDVYNPAFDVTPHKYLTGIITDRGVV 341
>gnl|CDD|234336 TIGR03734, PRTRC_parB, PRTRC system ParB family protein. A novel
genetic system characterized by six major proteins,
included a ParB homolog and a ThiF homolog, is
designated PRTRC, or ParB-Related,ThiF-Related Cassette.
It is often found on plasmids. This protein family the
member related to ParB, and is designated PRTRC system
ParB family protein.
Length = 554
Score = 32.4 bits (74), Expect = 1.0
Identities = 29/129 (22%), Positives = 48/129 (37%), Gaps = 19/129 (14%)
Query: 130 KPLSGNKEIPLYSATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVT------TAI 183
+P IPL +A G Q +C + ++V P SL +VT T
Sbjct: 257 RPGDNFTVIPL-AADGTLGVGEEQATAC--RSCEDFGAAVSALPGSLGKVTKGLCFDTVC 313
Query: 184 MSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAK 243
+ K V A+++A++AA AA++ A P + E+ S +K
Sbjct: 314 NTTK----------VAARKKAERAAAAAAQKPAAPAAGPGTPAKEKSPAETATSGAAKPA 363
Query: 244 SKSEPSKEA 252
+K
Sbjct: 364 AKKAVPSSQ 372
Score = 31.2 bits (71), Expect = 2.2
Identities = 20/64 (31%), Positives = 30/64 (46%), Gaps = 5/64 (7%)
Query: 292 KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKS 351
K A+ KAER A Q+ A AA + EKS ++ T A++ ++K V S
Sbjct: 317 KVAARKKAERAAAAAAQKPAAPAA--GPGTPAKEKSPAETAT---SGAAKPAAKKAVPSS 371
Query: 352 KDPN 355
+ N
Sbjct: 372 QPSN 375
>gnl|CDD|106976 PHA00667, PHA00667, hypothetical protein.
Length = 158
Score = 30.8 bits (68), Expect = 1.4
Identities = 6/91 (6%), Positives = 20/91 (21%), Gaps = 5/91 (5%)
Query: 179 VTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSE 238
T A ++ A ++ +++ + + A + S+
Sbjct: 27 TTAAQVAAVAAEAPAAQQEASTEKKPRATSAKTTAATAPTPPTAEVGAADAPE-----SK 81
Query: 239 PSKAKSKSEPSKEANVQNKPAQGSQDTPKPG 269
+ + A +
Sbjct: 82 AENSAPVQAGDAGQDAAAGDAAPTYQDTADA 112
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 32.0 bits (72), Expect = 1.4
Identities = 19/79 (24%), Positives = 32/79 (40%), Gaps = 11/79 (13%)
Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGS 262
EA K+A+ ++A TA P+ T+P + + + P ++ +
Sbjct: 60 EASKSAEANTTKEA----PATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVE--- 112
Query: 263 QDTPKPGGPPATLSNVDNK 281
K PAT+SN DN
Sbjct: 113 ----KAPAEPATVSNPDNA 127
>gnl|CDD|215467 PLN02866, PLN02866, phospholipase D.
Length = 1068
Score = 32.0 bits (73), Expect = 1.5
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 518 FFHNKLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIV 557
FF + L+ DD I +L +IL AH++ FRVIIV
Sbjct: 753 FFISGLSGDDTIQNRVLEALYRRIL-RAHKEKKCFRVIIV 791
>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB. Isd proteins are
iron-regulated surface proteins found in Bacillus,
Staphylococcus and Listeria species and are responsible
for heme scavenging from hemoproteins. The IsdB protein
is only observed in Staphylococcus and consists of an
N-terminal hydrophobic signal sequence, a pair of tandem
NEAT (NEAr Transporter, pfam05031) domains which confers
the ability to bind heme and a C-terminal sortase
processing signal which targets the protein to the cell
wall. IsdB is believed to make a direct contact with
methemoglobin facilitating transfer of heme to IsdB. The
heme is then transferred to other cell wall-bound NEAT
domain proteins such as IsdA and IsdC.
Length = 644
Score = 31.8 bits (71), Expect = 1.5
Identities = 30/120 (25%), Positives = 55/120 (45%), Gaps = 5/120 (4%)
Query: 135 NKEIPLYSATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELS 194
NK+ P +S ++ + + A + S TTP + T + S+E +
Sbjct: 502 NKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETT 561
Query: 195 RDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPE-SKQSEPSKAKSKSEPSKEAN 253
+D+V+ + +A A QKA +K D TQ + +K ++ +KAKS + +E+N
Sbjct: 562 KDVVQTSAGSSEAKDSAPLQKANIKNT----NDGHTQSQNNKNTQENKAKSLPQTGEESN 617
>gnl|CDD|222110 pfam13409, GST_N_2, Glutathione S-transferase, N-terminal domain.
This family is closely related to pfam02798.
Length = 68
Score = 28.7 bits (65), Expect = 1.6
Identities = 9/36 (25%), Positives = 15/36 (41%)
Query: 535 SSLVEKILLTAHEKGTKFRVIIVDGSPWYEGKEMLR 570
S ++ L KG + + V PW + E+L
Sbjct: 2 SPFARRVRLALELKGLPYEIEEVPLDPWDKPPELLA 37
>gnl|CDD|181561 PRK08818, PRK08818, prephenate dehydrogenase; Provisional.
Length = 370
Score = 31.4 bits (71), Expect = 1.7
Identities = 24/92 (26%), Positives = 46/92 (50%), Gaps = 9/92 (9%)
Query: 141 YSATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKA 200
+ P S + A+ LI+++P ++ T A + + ++ + +L D V +
Sbjct: 35 HDPADPGSLDPATLLQRADVLIFSAP--IRHTAALIEEYVA--LAGGRAAGQLWLD-VTS 89
Query: 201 QREAKKAAKLAAKQKAKVKG--DVTAPPDAVT 230
++A AA LA+ +A+V G +TAPP + T
Sbjct: 90 IKQAPVAAMLAS--QAEVVGLHPMTAPPKSPT 119
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 31.6 bits (72), Expect = 1.8
Identities = 23/130 (17%), Positives = 47/130 (36%), Gaps = 6/130 (4%)
Query: 232 PESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEV 291
E E ++ SK + + P G ++ N ++ E +E+
Sbjct: 404 EEESDEEENEEPSKKNVGRR---KFGPENGEKEAESKK-LKKENKNEFKEKKESDEEEEL 459
Query: 292 KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKS 351
+ + + K E+ A + +R KA E+ + ++ KT S S K ++ S
Sbjct: 460 EDEEEAKVEKVANKLLKRSEKAQKEEEEEE--LDEENPWLKTTSSVGKSAKKQDSKKKSS 517
Query: 352 KDPNVPSTKK 361
+ + K
Sbjct: 518 SKLDKAANKI 527
>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
dihydrolipoamide succinyltransferase (E2 component).
This model describes the TCA cycle 2-oxoglutarate system
E2 component, dihydrolipoamide succinyltransferase. It
is closely related to the pyruvate dehydrogenase E2
component, dihydrolipoamide acetyltransferase. The seed
for this model includes mitochondrial and Gram-negative
bacterial forms. Mycobacterial candidates are highly
derived, differ in having and extra copy of the
lipoyl-binding domain at the N-terminus. They score
below the trusted cutoff, but above the noise cutoff and
above all examples of dihydrolipoamide acetyltransferase
[Energy metabolism, TCA cycle].
Length = 403
Score = 31.2 bits (71), Expect = 1.8
Identities = 19/110 (17%), Positives = 33/110 (30%), Gaps = 7/110 (6%)
Query: 228 AVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGP----PATLSNVDNKEN 283
A+ + + + AKS E + A + P A +D
Sbjct: 72 AILEEGNDATAAPPAKSGEEKEETPAASAAAAPTAAANRPSLSPAARRLAKEHGIDLSAV 131
Query: 284 VKSEGDEVKSKAQLKAERRAKQEQQRQAKAAAL---LEKTKTSNEKSMTK 330
+ +K + + A Q+ A AAA T+ MT+
Sbjct: 132 PGTGVTGRVTKEDIIKKTEAPASAQQPAPAAAAKAPANFTRPEERVKMTR 181
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 31.1 bits (69), Expect = 1.9
Identities = 38/148 (25%), Positives = 55/148 (37%), Gaps = 7/148 (4%)
Query: 200 AQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNK-- 257
A KK+AK AA KA A P A ++ + AK+ + P+K A K
Sbjct: 209 AAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAA 268
Query: 258 --PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
PA+ + K PPA + K + + A KA + AKAAA
Sbjct: 269 APPAKAAAPPAKAAAPPAKAAAPPAK---AAAAPAKAAAAPAKAAAAPAKAAAPPAKAAA 325
Query: 316 LLEKTKTSNEKSMTKSKTEDSKPASEKS 343
K T K+ + P +K+
Sbjct: 326 PPAKAATPPAKAAAPPAKAAAAPVGKKA 353
Score = 30.7 bits (68), Expect = 3.1
Identities = 40/154 (25%), Positives = 56/154 (36%), Gaps = 12/154 (7%)
Query: 200 AQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSK-----AKSKSEPSKEANV 254
A + K AAK AA K AP A P + P+K AK+ + P+K A
Sbjct: 197 AAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAP 256
Query: 255 QNK----PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQ 310
K PA+ + K PPA + K + + A KA +
Sbjct: 257 PAKAAAPPAKAAAPPAKAAAPPAKAAAPPAK---AAAPPAKAAAAPAKAAAAPAKAAAAP 313
Query: 311 AKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSS 344
AKAAA K K+ T + PA ++
Sbjct: 314 AKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAA 347
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 31.2 bits (70), Expect = 2.1
Identities = 16/61 (26%), Positives = 26/61 (42%)
Query: 305 QEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSG 364
+ + R A AAA+LE+ + ++ + E KP K ++ E L S SG
Sbjct: 99 KSRVRSASAAAILEEDDSKDDMEFKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKSG 158
Query: 365 V 365
Sbjct: 159 S 159
>gnl|CDD|233208 TIGR00956, 3a01205, Pleiotropic Drug Resistance (PDR) Family
protein. [Transport and binding proteins, Other].
Length = 1394
Score = 31.2 bits (71), Expect = 2.3
Identities = 13/60 (21%), Positives = 28/60 (46%)
Query: 179 VTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSE 238
+++E N A+ +++ +R + K AK A + A K D+ A + + +S+
Sbjct: 684 FVYILLTEFNKGAKQKGEILVFRRGSLKRAKKAGETSASNKNDIEAGEVLGSTDLTDESD 743
>gnl|CDD|188123 TIGR01265, tyr_nico_aTase, tyrosine/nicotianamine family
aminotransferase. This subfamily of pyridoxal
phosphate-dependent enzymes includes known examples of
both tyrosine aminotransferase from animals and
nicotianamine aminotransferase from barley.
Length = 403
Score = 30.8 bits (70), Expect = 2.6
Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 12/49 (24%)
Query: 497 KEVIATYIHEQVDMAGNAICMFFHNKLANDDVILTYGCSSLVEKILLTA 545
+E +A Y+ + KL DDV+LT GCS +E I + A
Sbjct: 79 REAVAEYLSSDLP-----------GKLTADDVVLTSGCSQAIE-ICIEA 115
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 30.7 bits (69), Expect = 2.8
Identities = 21/111 (18%), Positives = 44/111 (39%), Gaps = 8/111 (7%)
Query: 239 PSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVD-----NKENVKSEGDEVKS 293
+ + +S +E + + S+D +P P D +E ++ ++
Sbjct: 6 ELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITIQEREREAAKEKALEE 65
Query: 294 KAQLKAERRAKQEQQ---RQAKAAALLEKTKTSNEKSMTKSKTEDSKPASE 341
+A+ KAE R ++ + + K L+K T E ++ T+D E
Sbjct: 66 EAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDDENEEEE 116
>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
Length = 413
Score = 30.6 bits (69), Expect = 2.9
Identities = 27/156 (17%), Positives = 51/156 (32%), Gaps = 9/156 (5%)
Query: 218 VKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSN 277
VKG V D + +++ +A++ + PA+ P P
Sbjct: 251 VKG-VQEALDLRSPERKERAAAREARAAAAAPAATAAAAAPAEE------PAVAPMAAIR 303
Query: 278 VDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSK 337
+ E +G+ A A++ A + R A EK + ++ K + +
Sbjct: 304 AAHVETAADKGERAAKPA--AADKAADKPADRPDAAEKAAEKPAEAAPRAADKPAGQAAD 361
Query: 338 PASEKSSKTEVLKSKDPNVPSTKKYSGVDGVKATPG 373
PAS + K P+ + + PG
Sbjct: 362 PASSSADKPGASADAAARTPARARDAAAPDADTPPG 397
Score = 29.1 bits (65), Expect = 8.5
Identities = 22/113 (19%), Positives = 34/113 (30%), Gaps = 8/113 (7%)
Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGS 262
A AA A A + P A + ++ K + ++P+ +KPA
Sbjct: 275 RAAAAAPAATAAAAAPAEEPAVAPMAAIRAAHVETAADKGERAAKPAAADKAADKPADRP 334
Query: 263 QDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAA 315
K PA E D+ +A A A + AA
Sbjct: 335 DAAEKAAEKPA--------EAAPRAADKPAGQAADPASSSADKPGASADAAAR 379
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 29.9 bits (68), Expect = 3.3
Identities = 19/108 (17%), Positives = 42/108 (38%), Gaps = 9/108 (8%)
Query: 194 SRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEAN 253
++ V+AQR ++A K A + + P + ++ K + + P +
Sbjct: 106 AKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDI- 164
Query: 254 VQNKPAQGSQDTPKPGGPP--ATLSNVDNKENVK---SEGDEVKSKAQ 296
++ G K G AT+ + K++V+ G + +A+
Sbjct: 165 --SELTVGQAVKVKAGKSAMDATVLEI-TKDDVRVQLDSGLSMIVRAE 209
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 30.4 bits (69), Expect = 3.5
Identities = 20/102 (19%), Positives = 37/102 (36%), Gaps = 23/102 (22%)
Query: 279 DNKENVKSEGDEV--------KSKAQLKAERRAKQEQQRQAKAAALLEKTKTSN------ 324
E +SE + + K+KAQ E+R K+ ++ + L +K
Sbjct: 257 SAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLKEIA 316
Query: 325 ------EKSMTKSKTEDSKPASEKSSKTEVL---KSKDPNVP 357
EK+ + K + + +K K L K +P +
Sbjct: 317 KEVAQKEKARARKKEQRKERGEKKKLKRRKLGKHKYPEPPLE 358
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 29.6 bits (67), Expect = 3.9
Identities = 29/124 (23%), Positives = 45/124 (36%), Gaps = 12/124 (9%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQN-K 257
+ +RE +KA K +++AK K D +A + KA +
Sbjct: 12 RRKREQRKARKKQKRKEAKKKEDAQK-SEAEEVKNEENKSKKKAAPIENAEGNIVFSKVE 70
Query: 258 PAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALL 317
A G Q L K+ K + D K + R+ K E+ + KAA +
Sbjct: 71 FADGEQAKKD-------LKLKKKKK--KKKTD-YKQLLKKLEARKKKLEELDEDKAAEIE 120
Query: 318 EKTK 321
EK K
Sbjct: 121 EKEK 124
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 30.3 bits (68), Expect = 4.0
Identities = 21/107 (19%), Positives = 39/107 (36%), Gaps = 19/107 (17%)
Query: 166 PSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAA---------KLAAKQKA 216
P+ + + E + V A EA + A K+A +
Sbjct: 924 PTQDGPAAEAQPAQENTV-------VESAEKAVAAVSEAAQQAATAVASGIEKVAEAVRK 976
Query: 217 KVKGDV---TAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQ 260
V+ V A A+ + +K++ KA +K +K+A + K A+
Sbjct: 977 TVRRSVKKAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAAKKKVAK 1023
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 29.8 bits (66), Expect = 4.0
Identities = 37/158 (23%), Positives = 65/158 (41%), Gaps = 15/158 (9%)
Query: 199 KAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKS----------KSEP 248
K EA+ AA+ ++AK + D + E K+ E A + K+E
Sbjct: 37 KENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAAKEEAPKAEPEKTEG 96
Query: 249 SKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQE-- 306
+ EA + A + P PA S+ E S A K E ++K+E
Sbjct: 97 AAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAE--SAAPAKEEEKSKEEGE 154
Query: 307 -QQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKS 343
++ +A AAA E + S +K + ++ P+S+++
Sbjct: 155 AKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKET 192
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 30.4 bits (68), Expect = 4.3
Identities = 23/117 (19%), Positives = 43/117 (36%), Gaps = 6/117 (5%)
Query: 159 EQLIYTSPSSVQT-TPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAK 217
E L+ S V T P+ ++ I + ++++ E+ L +A A A+++A
Sbjct: 158 EPLLEVSTDKVDTEIPSPVAGTLLEIRAPEDDTVEVGTVLAIIG-DANAAPAEPAEEEAP 216
Query: 218 VKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPAT 274
+ + P P+ P A P+ PA + + GP T
Sbjct: 217 APSEAGSEP----APDPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVT 269
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 30.3 bits (68), Expect = 4.3
Identities = 29/133 (21%), Positives = 44/133 (33%), Gaps = 12/133 (9%)
Query: 196 DLVKAQREAKKAAKLAAKQKA--KVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEAN 253
DL + K+A+ K+++ P Q P+ + + A+
Sbjct: 367 DLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPAS 426
Query: 254 VQNKPAQ-------GSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQE 306
PA GS D P P PP E + D+ KA L A R +
Sbjct: 427 APPPPATPLPSAEPGSDDGPAP--PPERQPPAPATEPAPDDPDDATRKA-LDALRERRPP 483
Query: 307 QQRQAKAAALLEK 319
+ A A LL +
Sbjct: 484 EPPGADLAELLGR 496
>gnl|CDD|129310 TIGR00206, fliF, flagellar basal-body M-ring protein/flagellar
hook-basal body protein (fliF). Component of the M
(cytoplasmic associated) ring, one of four rings
(L,P,S,M) which make up the flagellar hook-basal body
which is a major portion of the flagellar organelle.
Although the basic structure of the flagella appears to
be similar for all bacteria, additional rings and
structures surrounding the basal body have been observed
for some bacteria (eg Vibrio cholerae and Treponema
pallidum) [Cellular processes, Chemotaxis and motility].
Length = 555
Score = 30.2 bits (68), Expect = 4.5
Identities = 33/172 (19%), Positives = 53/172 (30%), Gaps = 39/172 (22%)
Query: 183 IMSEKNNSAELSRDLVKAQREAK---------KAAKLAAK---QKAKVKGDVTAPPD-AV 229
I+++ N DL + Q + K + + + V V A D +
Sbjct: 219 ILNDSNLDGIDRIDLAEKQLKYKSKVEAMLQGEIDSILSPIVGVDNFVVARVNAKLDFSK 278
Query: 230 TQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEGD 289
++ P++ +S + + Q + QG PGG P SN
Sbjct: 279 ETTTEEEYAPNELESSDSTIRSSQTQEEEYQG--QGYSPGGVPGQESNT---------PP 327
Query: 290 EVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASE 341
V+ LK +E Q EK NE K E + E
Sbjct: 328 PVQD---LKDNTGNYEESQY--------EKNVQLNE----KKNYEVKRTIRE 364
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 29.7 bits (67), Expect = 4.6
Identities = 35/165 (21%), Positives = 66/165 (40%), Gaps = 23/165 (13%)
Query: 195 RDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANV 254
++L + +R KK K K P A P++ P K S +
Sbjct: 76 KELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKSPKAAAPRPK--KKSERISWAPTL 133
Query: 255 QNKPAQGSQDTPKPGGPPATLSNVDNKE--NVKSEGDEVKSKAQLKAERRAKQEQQR--- 309
+ P + S + S V NKE + + + E++ K R+ K++++
Sbjct: 134 LDSPRRKSSRS----------STVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKEL 183
Query: 310 -QAKAAALLEKTKTSNEKSMTKSKT-EDSKPASEKSSKTEVLKSK 352
Q + A ++T+ N KS+ + + E+ K K +K + LK +
Sbjct: 184 TQEERLAEAKETERINLKSLERYEEQEEEK----KKAKIQALKKR 224
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 30.2 bits (68), Expect = 4.6
Identities = 14/35 (40%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 281 KENVKSEGDEVKSKAQLKA--ERRAKQEQQRQAKA 313
+E ++ E E K +A+LK ER+AK+E +Q +A
Sbjct: 9 REKLEREQRERKQRAKLKLERERKAKEEAAKQREA 43
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 30.1 bits (68), Expect = 4.8
Identities = 12/51 (23%), Positives = 23/51 (45%)
Query: 298 KAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTEV 348
+A+R A+ E K A L+K S+ + K S+ +++ E+
Sbjct: 588 QADRAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAEREI 638
>gnl|CDD|234399 TIGR03923, T7SS_EccE, type VII secretion protein EccE. This model
represents the transmembrane protein EccB of the
actinobacterial flavor of type VII secretion systems.
Species such as Mycobacterium tuberculosis have several
instances of this system per genome, designated EccE1,
EccE2, etc. This model represents a conserved core
region, and many members have 200 or more additional
C-terminal residues [Protein fate, Protein and peptide
secretion and trafficking].
Length = 341
Score = 29.7 bits (67), Expect = 4.9
Identities = 9/48 (18%), Positives = 20/48 (41%)
Query: 669 WKSLAHLTPLSLTYDITPSHLVTAVITELAIVPCTSVPVVLRVKPTET 716
W+ L Y + P +++ + +L +P V +R++P
Sbjct: 249 WRGLRGDNGYLAAYRVDPDPILSDTLAQLWALPADETTVTVRLRPGAG 296
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 30.0 bits (67), Expect = 5.2
Identities = 21/119 (17%), Positives = 38/119 (31%)
Query: 227 DAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKS 286
+ PE + KSK KE + K + G L+ D+ + S
Sbjct: 182 ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAAS 241
Query: 287 EGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPASEKSSK 345
+ ++ A + + K A E K+ K + K ++ K +K
Sbjct: 242 VAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHH 300
>gnl|CDD|190741 pfam03763, Remorin_C, Remorin, C-terminal region. Remorins are
plant-specific plasma membrane-associated proteins. In
tobacco remorin co-purifies with lipid rafts. Most
remorins have a variable, proline-rich C-half and a more
conserved N-half that is predicted to form coiled coils.
Consistent with this, circular dichroism studies have
demonstrated that much of the protein is alpha-helical.
Remorins exist in plasma membrane preparations as
oligomeric structures and form filaments in vitro. The
proteins can bind polyanions including the extracellular
matrix component oligogalacturonic acid (OGA). In vitro,
remorin in plasma membrane preparations is
phosphorylated (principally on threonine residues) in
the presence of OGA and thus co-purifies with a protein
kinases(s). The biological functions of remorins are
unknown but roles as components of the
membrane/cytoskeleton are possible.
Length = 112
Score = 28.5 bits (64), Expect = 5.4
Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 3/71 (4%)
Query: 280 NKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPA 339
+E K + E K KA+ +AE + K E++ + K A EK K N+ + K E+ +
Sbjct: 26 QREEAKIQAWENKKKAKAEAELK-KIERKLEKKKAEAAEKLK--NKLAAAHKKAEEKRAT 82
Query: 340 SEKSSKTEVLK 350
+E E K
Sbjct: 83 AEAKRGEEEAK 93
>gnl|CDD|215074 PLN00145, PLN00145, tyrosine/nicotianamine aminotransferase;
Provisional.
Length = 430
Score = 29.7 bits (67), Expect = 5.6
Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 28/127 (22%)
Query: 522 KLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVD--GSPWYEGKEMLRRL-VKH--- 575
+L+ DD+ LT GC+ +E I+ + G I++ G P YE + + L V+H
Sbjct: 114 ELSTDDIYLTAGCAQAIEIIMSVLAQPGAN---ILLPRPGYPLYEARAVFSGLEVRHFDL 170
Query: 576 ------QVDCSYVLLSAVSYIMRE--VSKVIIGAHALLSN--GAVMSRAGTAQVSLVARA 625
+VD L V + E V+ VII +N G+V S A+++ AR
Sbjct: 171 LPERGWEVD-----LEGVEALADENTVAMVIINP----NNPCGSVYSYEHLAKIAETARK 221
Query: 626 FNVPVLA 632
+ V+A
Sbjct: 222 LGILVIA 228
>gnl|CDD|119332 cd06562, GH20_HexA_HexB-like, Beta-N-acetylhexosaminidases catalyze
the removal of beta-1,4-linked N-acetyl-D-hexosamine
residues from the non-reducing ends of
N-acetyl-beta-D-hexosaminides including
N-acetylglucosides and N-acetylgalactosides. The hexA
and hexB genes encode the alpha- and beta-subunits of
the two major beta-N-acetylhexosaminidase isoenzymes,
N-acetyl-beta-D-hexosaminidase A (HexA) and
beta-N-acetylhexosaminidase B (HexB). Both the alpha
and the beta catalytic subunits have a TIM-barrel fold
and belong to the glycosyl hydrolase family 20 (GH20).
The HexA enzyme is a heterodimer containing one alpha
and one beta subunit while the HexB enzyme is a
homodimer containing two beta-subunits. Hexosaminidase
mutations cause an inability to properly hydrolyze
certain sphingolipids which accumulate in lysosomes
within the brain, resulting in the lipid storage
disorders Tay-Sachs and Sandhoff. Mutations in the
alpha subunit cause in a deficiency in the HexA enzyme
and result in Tay-Sachs, mutations in the beta-subunit
cause in a deficiency in both HexA and HexB enzymes and
result in Sandhoff disease. In both disorders GM(2)
gangliosides accumulate in lysosomes. The GH20
hexosaminidases are thought to act via a catalytic
mechanism in which the catalytic nucleophile is not
provided by solvent or the enzyme, but by the substrate
itself.
Length = 348
Score = 29.5 bits (67), Expect = 5.8
Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 6/42 (14%)
Query: 522 KLANDDVILTYGCSSLVEKILLTAHEKGTKFRVIIVDGSPWY 563
L D ++ +G S ++ +L G ++VI+ WY
Sbjct: 218 LLPKDTIVQVWGGSDELKNVL----AAG--YKVILSSYDFWY 253
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 29.3 bits (66), Expect = 6.1
Identities = 19/91 (20%), Positives = 37/91 (40%), Gaps = 12/91 (13%)
Query: 269 GGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSM 328
P+ + D +E KS+ E ++K E ++E AA +K + +
Sbjct: 37 PSSPSDQAAADEQEAKKSDDQETAEIEEVKEE---EKE------AANSEDKEDKGDAEKE 87
Query: 329 TKSKTEDSKPASEKSSK---TEVLKSKDPNV 356
+ E+++ E+SS E + + NV
Sbjct: 88 DEESEEENEEEDEESSDENEKETEEKTESNV 118
>gnl|CDD|234681 PRK00182, tatB, sec-independent translocase; Provisional.
Length = 160
Score = 28.6 bits (64), Expect = 6.2
Identities = 10/39 (25%), Positives = 15/39 (38%)
Query: 235 KQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPA 273
+++ + S ANV +PA GSQ P
Sbjct: 108 GEAQRQQQASAGRADNNANVVTRPADGSQPRSSQNDPKG 146
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 29.9 bits (67), Expect = 6.2
Identities = 13/63 (20%), Positives = 27/63 (42%), Gaps = 1/63 (1%)
Query: 157 CAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKA 216
C + T Q L + A+ + + A L++ +AQ E K +L + +A
Sbjct: 209 CTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKR-EAQEEQLKKQQLLKQLRA 267
Query: 217 KVK 219
+++
Sbjct: 268 RIE 270
>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
Length = 400
Score = 29.8 bits (67), Expect = 6.3
Identities = 23/131 (17%), Positives = 38/131 (29%), Gaps = 5/131 (3%)
Query: 176 LSQVTTAIMSEKNN-SAELSRDLVKAQREAKKAAKLAAKQKAKVKGDVTAPPDAVTQPES 234
L+ +T + N S L+ D+ + V PP +P+S
Sbjct: 194 LTSLTEEAGKARYNASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPV--PPSEAARPKS 251
Query: 235 KQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGP-PATLSNVDNKENVKSEGDEVKS 293
+E + A + +A +N A +Q PA E K
Sbjct: 252 ADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAAAAAKPALEDKP 311
Query: 294 KAQLKAERRAK 304
+ L R
Sbjct: 312 RP-LGIARPGG 321
>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway components
[Intracellular trafficking and secretion].
Length = 94
Score = 27.7 bits (62), Expect = 6.3
Identities = 18/68 (26%), Positives = 30/68 (44%), Gaps = 1/68 (1%)
Query: 192 ELSRDLVKAQREAKKAAK-LAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSK 250
E RDL KA RE KKAA + + ++K + E + ++ + SE +
Sbjct: 27 EAGRDLGKAIREFKKAASDVKNELDEELKLEELDDKKKELTAELQATKEELDQLASELKE 86
Query: 251 EANVQNKP 258
+ + KP
Sbjct: 87 DLKKKAKP 94
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 29.2 bits (66), Expect = 6.7
Identities = 14/47 (29%), Positives = 16/47 (34%)
Query: 222 VTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKP 268
V P V P+ K E K K K + EA KP K
Sbjct: 97 VEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 28.9 bits (65), Expect = 6.7
Identities = 22/114 (19%), Positives = 41/114 (35%), Gaps = 8/114 (7%)
Query: 203 EAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSK----SEPSKEANVQNKP 258
EAKK K A++ KVK + + + + + K K K + E + +
Sbjct: 59 EAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEA 118
Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSE--GDEVKSKAQLKAERRAKQEQQRQ 310
+D K TLS + + K D +S+ K +++ +
Sbjct: 119 EDKLEDLTKS--YSETLSTLSELKPRKYALHKDIYQSRLDRKRRAEVAKKEAEE 170
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 28.6 bits (65), Expect = 7.2
Identities = 17/45 (37%), Positives = 22/45 (48%), Gaps = 2/45 (4%)
Query: 294 KAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKP 338
K L+A K+E + KA EK K EK K+K E+ KP
Sbjct: 55 KESLEAALLDKKELKAWEKAEKKAEKAKAKAEK--KKAKKEEPKP 97
>gnl|CDD|181437 PRK08470, PRK08470, adenylosuccinate lyase; Provisional.
Length = 442
Score = 29.3 bits (66), Expect = 7.2
Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 6/44 (13%)
Query: 89 PINNKG--LLLHAALNDYQSACQVSEQEVKQFFDSKGIYTYCSK 130
IN KG L L A LND +SE+E++ FD Y+Y +K
Sbjct: 392 AINEKGESLFLQALLNDEDLRKSLSEEEIRACFD----YSYYTK 431
>gnl|CDD|221445 pfam12157, DUF3591, Protein of unknown function (DUF3591). This
domain is found in eukaryotes and is typically between
445 to 462 amino acids in length. Most members are
annotated as being transcription initiation factor TFIID
subunit 1, and this region is the conserved central
portion of these proteins.
Length = 457
Score = 29.6 bits (67), Expect = 7.3
Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 9/51 (17%)
Query: 459 LNKCRPHSVSMLNAVKHFKSHLTQLPNDITDTQARLRLKEVIATYIHEQVD 509
L K +S+ + KHF + Q R RLKE Y E D
Sbjct: 251 LRKSGRRRISVDDITKHFPDQ--------NEMQNRQRLKE-FMKYQREGGD 292
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 28.6 bits (64), Expect = 7.3
Identities = 11/67 (16%), Positives = 29/67 (43%), Gaps = 2/67 (2%)
Query: 279 DNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKP 338
++++ K E + K + + +++++++ K A L K K++ T K
Sbjct: 94 ESEKEQKEVSKETEEKE--AIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 339 ASEKSSK 345
++ K
Sbjct: 152 TKKEEGK 158
>gnl|CDD|232831 TIGR00115, tig, trigger factor. Trigger factor is a
ribosome-associated molecular chaperone and is the first
chaperone to interact with nascent polypeptide. Trigger
factor can bind at the same time as the signal
recognition particle (SRP), but is excluded by the SRP
receptor (FtsY). The central domain of trigger factor
has peptidyl-prolyl cis/trans isomerase activity. This
protein is found in a single copy in virtually every
bacterial genome [Protein fate, Protein folding and
stabilization].
Length = 408
Score = 29.4 bits (67), Expect = 7.8
Identities = 14/51 (27%), Positives = 24/51 (47%), Gaps = 9/51 (17%)
Query: 280 NKENVKSEGDEVKSKAQLKA------ERRAKQEQQRQAKAA---ALLEKTK 321
+ E KS G+E ++ +LKA ER K+ + + K L+E +
Sbjct: 238 DDEFAKSLGEEFETLEELKADIRKNLEREKKERAKNKLKEQLLDKLVENNE 288
>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein. This family includes the M1
phosphoprotein non-structural RNA polymerase alpha
subunit, which is thought to be a component of the
active polymerase, and may be involved in template
binding.
Length = 298
Score = 28.9 bits (65), Expect = 8.0
Identities = 18/66 (27%), Positives = 36/66 (54%), Gaps = 3/66 (4%)
Query: 225 PPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPA---TLSNVDNK 281
PP T+ +S Q++P K+K S+ S + + + D+ + GPPA + +N ++
Sbjct: 136 PPGKSTEDKSTQTDPEKSKPASQSSAPVKKEEQSSSEKVDSQEASGPPALEWSATNDEDD 195
Query: 282 ENVKSE 287
+V++E
Sbjct: 196 ASVEAE 201
>gnl|CDD|235026 PRK02292, PRK02292, V-type ATP synthase subunit E; Provisional.
Length = 188
Score = 28.8 bits (65), Expect = 8.1
Identities = 14/71 (19%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 285 KSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEK----SMTKSKTEDSKPAS 340
+++ +E+ + +AER +Q ++++ +A L K + N + +++ ED +
Sbjct: 37 EADAEEILEDREAEAEREIEQLREQELSSAKLEAKRERLNARKEVLEDVRNQVEDEIASL 96
Query: 341 EKSSKTEVLKS 351
+ + E+ KS
Sbjct: 97 DGDKREELTKS 107
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 29.4 bits (65), Expect = 8.6
Identities = 38/238 (15%), Positives = 71/238 (29%), Gaps = 11/238 (4%)
Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
S T ++ A TSP++ +T+ + T I+ +A S
Sbjct: 490 SRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTAT-SPPTGTTS 548
Query: 202 REAKKAAKLA--AKQKAKVKGDVTAPPD-AVTQPESKQSEPSKAKSKSEPSKEANVQNKP 258
+ ++ + VT+ P + + Q + + +P ++ + P
Sbjct: 549 VPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTP 608
Query: 259 AQGSQDTPKPGGPPATLSNVDNKENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLE 318
S T P T ++ EN+ E V S + +
Sbjct: 609 RSNSTSTT----PLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPG--TTSQVSGPG 662
Query: 319 KTKTSNEKSMTKSKTEDSKPASEKSSKTEVLKSKDPNVPSTKKYSGVDGVKATPGTTL 376
+ TS P + S K+ P V ++ K T G+TL
Sbjct: 663 NSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTV-TSTGGKANSTTKETSGSTL 719
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 29.4 bits (66), Expect = 8.6
Identities = 28/135 (20%), Positives = 48/135 (35%), Gaps = 4/135 (2%)
Query: 142 SATGPSSCKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQ 201
+ AP+ + A +P + PA+ + A + A+ +
Sbjct: 402 VTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADS 461
Query: 202 REAKKAAKLAAKQKAKVKGDVTAPPDAVTQPESKQSEPSKAKSKSEPSKEANVQNKPAQG 261
R ++ A+ A + APPDA +P + + PS A + P A A
Sbjct: 462 RCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARA----PAAAS 517
Query: 262 SQDTPKPGGPPATLS 276
+D P PPA +
Sbjct: 518 REDAPAAAAPPAPEA 532
>gnl|CDD|239173 cd02772, MopB_NDH-1_NuoG2, MopB_NDH-1_NuoG2: The second domain of
the NuoG subunit of the NADH-quinone oxidoreductase/NADH
dehydrogenase-1 (NDH-1), found in beta- and
gammaproteobacteria. The NDH-1 is the first
energy-transducting complex in the respiratory chain and
functions as a redox pump that uses the redox energy to
translocate H+ ions across the membrane, resulting in a
significant contribution to energy production. In
Escherichia coli NDH-1, the largest subunit is encoded
by the nuoG gene, and is part of the 14 distinct
subunits constituting the functional enzyme. The NuoG
subunit is made of two domains: the first contains three
binding sites for FeS clusters (the fer2 domain), the
second domain (this CD), is of unknown function or, as
postulated, has lost an ancestral formate dehydrogenase
activity that became redundant during the evolution of
the complex I enzyme. Although only vestigial sequence
evidence remains of a molybdopterin binding site, this
protein domain belongs to the molybdopterin_binding
(MopB) superfamily of proteins.
Length = 414
Score = 29.2 bits (66), Expect = 9.0
Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 5/71 (7%)
Query: 149 CKAPQFISCAEQLIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKK-A 207
+ + I +PS++ +L+QV A+ EK + V+A EA+K A
Sbjct: 190 ADDDFLFPLSGKAI-VAPSALA---NALAQVAKALAEEKGLAVPDEDAKVEASEEARKIA 245
Query: 208 AKLAAKQKAKV 218
A L + ++A V
Sbjct: 246 ASLVSAERAAV 256
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 29.3 bits (65), Expect = 9.0
Identities = 31/143 (21%), Positives = 51/143 (35%), Gaps = 20/143 (13%)
Query: 233 ESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTP--------KPGGPPATLSNVDNKENV 284
+SK+S+ K K +KE V KP + P KP P D +E
Sbjct: 536 DSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEE-- 593
Query: 285 KSEGDEVKSKAQLKAERRAKQEQQRQAKAAALLEKTKTSNEKSMTKSKTEDSKPAS-EKS 343
K K A+R + + + + + + K K +P+S E+
Sbjct: 594 -----PKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERP 648
Query: 344 SKTEVLKS----KDPNVPSTKKY 362
+++KS K P P K+
Sbjct: 649 EGPKIIKSPKPPKSPKPPFDPKF 671
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 29.1 bits (65), Expect = 9.0
Identities = 26/101 (25%), Positives = 41/101 (40%), Gaps = 17/101 (16%)
Query: 264 DTPKPGGPPATLSNV------DNKENVKSEGDEVKSKAQLK------AERRAKQEQQRQA 311
D KPG PA L N+E S +EV SK +K ERR +QE++RQ
Sbjct: 354 DDYKPGTDPALLDPSLARKLRSNREVAVSRLEEVISKYAMKQDDTEEEERRKRQERERQG 413
Query: 312 KAA-----ALLEKTKTSNEKSMTKSKTEDSKPASEKSSKTE 347
++ + T + ++ E+ E+ + E
Sbjct: 414 TSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEE 454
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 28.8 bits (64), Expect = 9.0
Identities = 29/116 (25%), Positives = 47/116 (40%), Gaps = 9/116 (7%)
Query: 174 ASLSQVTTAIMSEKNNSAELSRDLVKAQREA-KKAAKLAAKQKAKVKGDVTAPPDAVTQP 232
+SL + IMS +N + ++R L + E K +L+ + K T +
Sbjct: 98 SSLRNIPVVIMSSENVPSRITRCLEEGAEEFFLKPVQLSDLNRLKPHMMKTKSKN----- 152
Query: 233 ESKQSEPSKAKSKSEPSKEANVQNKPAQGSQDTPKPGGPPATLSNVDNKENVKSEG 288
KQ K + E ++ Q +P+Q Q P+P P +N NK EG
Sbjct: 153 -QKQENQEKQEKLEESEIQSEKQEQPSQQPQSQPQPQQQPQQPNN--NKRKAMEEG 205
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 29.1 bits (65), Expect = 9.5
Identities = 17/84 (20%), Positives = 30/84 (35%), Gaps = 3/84 (3%)
Query: 281 KENVKSEGDEVKSKAQLKAERRAKQEQQRQAKAA---ALLEKTKTSNEKSMTKSKTEDSK 337
E K E + + + K E +AK+++ ++ KAA A + K + K
Sbjct: 6 SEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEKK 65
Query: 338 PASEKSSKTEVLKSKDPNVPSTKK 361
DP+ P +K
Sbjct: 66 SRKRDVEDENPEDFIDPDTPFGQK 89
>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain. This region of the APC
family of proteins is known as the basic domain. It
contains a high proportion of positively charged amino
acids and interacts with microtubules.
Length = 359
Score = 28.9 bits (64), Expect = 9.5
Identities = 23/120 (19%), Positives = 44/120 (36%), Gaps = 8/120 (6%)
Query: 161 LIYTSPSSVQTTPASLSQVTTAIMSEKNNSAELSRDLVKAQREAKKAAKLAAKQKAKVKG 220
+IY + ++ + S+ K ++ + Q+ ++ + A K
Sbjct: 8 VIYIPGPANRSQSTTPSKKGP---PLKTQPSDPPKSPSPGQQRSRSLHRPA---KPSELA 61
Query: 221 DVTAPPDAVTQPESKQSEPSKAKSK-SEPSKEANVQNKPAQGSQDTPKP-GGPPATLSNV 278
+++ PP + T P PS + S+ S PS+ S GP +LS V
Sbjct: 62 ELSPPPRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQV 121
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.124 0.346
Gapped
Lambda K H
0.267 0.0699 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 33,314,120
Number of extensions: 3078797
Number of successful extensions: 4070
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3474
Number of HSP's successfully gapped: 310
Length of query: 717
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 613
Effective length of database: 6,324,786
Effective search space: 3877093818
Effective search space used: 3877093818
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 63 (28.1 bits)