RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy114
(609 letters)
>gnl|CDD|218624 pfam05529, Bap31, B-cell receptor-associated protein 31-like.
Bap31 is a polytopic integral protein of the endoplasmic
reticulum membrane and a substrate of caspase-8. Bap31
is cleaved within its cytosolic domain, generating
pro-apoptotic p20 Bap31.
Length = 149
Score = 150 bits (381), Expect = 9e-43
Identities = 57/143 (39%), Positives = 79/143 (55%), Gaps = 5/143 (3%)
Query: 1 MSLQWTLIATVLYFEMAFMLLLILPILSTQRLHKILKSKFVQGVKTQAGWYFGCILVILS 60
MSL WTL+ +LY EMA LLL+LP+ S R KI KS + Q F +L+ L
Sbjct: 1 MSLYWTLVFGLLYAEMAVFLLLVLPLPSPIR-RKIFKSVSKLQLSQQFQTGFKILLIFLL 59
Query: 61 LFFLDAIREMRKYAS---PEVKEEAHGHLDAEMQNNMKLFRAQRNFYISGFSLFLWLVIR 117
+ F+D++R + KY+ G +E Q +LF AQRN Y+SGF+LFL LVIR
Sbjct: 60 VLFIDSVRRVYKYSGELQGSGNASLGGGERSEYQTLARLFYAQRNLYLSGFTLFLTLVIR 119
Query: 118 QIIQLIAQQANLLAQNEASMNQA 140
+++ L+ + L EA QA
Sbjct: 120 RVVTLVEELVKLEEL-EALKKQA 141
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 56.4 bits (136), Expect = 4e-09
Identities = 40/175 (22%), Positives = 71/175 (40%), Gaps = 24/175 (13%)
Query: 1 MSLQWTLIATVLYFEMAFMLLLILPILSTQR---LHKILKSKFVQGVKTQAGWYFGCILV 57
M + +TL+ ++L EM +L+LP+ R + SK +G K +
Sbjct: 1 MGIYYTLVFSLLVVEMVMFFILVLPLPKRLRRSLMKLYSTSKVYRGFKH----ILKITFI 56
Query: 58 ILSLFFLDAIREM------RKYASPEVKEEAHGHLDAEMQNNMKLFRAQRNFYISGFSLF 111
+ L F+D+ + + S + A + F AQRN Y+SG +LF
Sbjct: 57 FILLLFIDSWKRVYRVSKEANLYSASINNYAVTRIAVLA----SRFYAQRNMYLSGSALF 112
Query: 112 LWLVI-RQ---IIQLIAQQANLLAQ---NEASMNQARQAAVAAQALLDGPRGKWF 159
L +V+ R + +++ + A + EA + AQ LL+G +
Sbjct: 113 LSIVVMRVMSIVEEMLEENAKKGGKIDKMEADSTDLKARLRKAQILLEGLQKNQE 167
Score = 32.5 bits (74), Expect = 0.45
Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 13/80 (16%)
Query: 529 VQKVVTEQGDK--KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
V++++ E K K DK + +D LK +L K + L+ + N L K
Sbjct: 124 VEEMLEENAKKGGKIDKMEADSTD----LKARLRKAQILLEGLQKNQEELF-------KL 172
Query: 587 YDRLLKEHEKVQKVVTEQGD 606
D+ + E+VQK +++ +
Sbjct: 173 LDKYNELREQVQKESSKKKE 192
>gnl|CDD|216504 pfam01437, PSI, Plexin repeat. A cysteine rich repeat found in
several different extracellular receptors. The function
of the repeat is unknown. Three copies of the repeat are
found Plexin. Two copies of the repeat are found in
mahogany protein. A related C. elegans protein contains
four copies of the repeat. The Met receptor contains a
single copy of the repeat. The Pfam alignment shows 6
conserved cysteine residues that may form three
conserved disulphide bridges, whereas shows 8 conserved
cysteines. The pattern of conservation suggests that
cysteines 5 and 7 (that are not absolutely conserved)
form a disulphide bridge (Personal observation. A
Bateman).
Length = 48
Score = 48.7 bits (116), Expect = 7e-08
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 3/50 (6%)
Query: 278 DCNTYSSCTQCVSS-DFPCDWCVDGHRCTHDTAENCRNDILITGVSRVGP 326
+C+ Y+SC+ C+++ D C WC RCT C + S P
Sbjct: 1 NCSQYTSCSSCLAARDPYCGWCSSEGRCTR--GTECPRSEGPSWSSSKCP 48
Score = 41.0 bits (96), Expect = 4e-05
Identities = 13/41 (31%), Positives = 17/41 (41%), Gaps = 1/41 (2%)
Query: 445 NCGLCLALPEKYGCGWCESSKRCEIFEQCDKGLEGDSDEIK 485
+C CLA + CGWC S RC +C + K
Sbjct: 7 SCSSCLAARD-PYCGWCSSEGRCTRGTECPRSEGPSWSSSK 46
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 52.6 bits (127), Expect = 7e-08
Identities = 38/125 (30%), Positives = 62/125 (49%), Gaps = 9/125 (7%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
IK LKE++ E+KK N + + +E RL++ +K ++ V E K D
Sbjct: 29 IKSLKEEI----AEMKK---NEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKD- 80
Query: 544 YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
Y+K D +K LK +L + E+ELK K S L+ + + V +E D L + E + V +
Sbjct: 81 YEK-DKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQ 139
Query: 604 QGDKK 608
+ K
Sbjct: 140 KTGLK 144
Score = 38.4 bits (90), Expect = 0.004
Identities = 29/114 (25%), Positives = 56/114 (49%), Gaps = 4/114 (3%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
D +K LK +L + E+ELK K S L+ + + V +E D L + E + V ++
Sbjct: 84 DKQSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKTGL 143
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
K+ ++ +++ L E+L K E +L + + +V ++ D +L+E
Sbjct: 144 KNLLLEQ----KLEALNEELEKKEAQLNEVLAAANLDPAALQAVSEKLDDVLEE 193
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 53.1 bits (128), Expect = 3e-07
Identities = 33/133 (24%), Positives = 72/133 (54%), Gaps = 8/133 (6%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVG----KEYDRLLKEHEKVQ 530
K L+ + ++++ LK+KL++ E++L + + A L + + +G +E + LKE E
Sbjct: 542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFY 601
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
E D + + ++ E+K+L+E+L K EEL + + L+ + + + K+Y
Sbjct: 602 NEYLELKDAEKELERE--EKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE- 658
Query: 591 LKEHEKVQKVVTE 603
+E+E++++ E
Sbjct: 659 -EEYEELREEYLE 670
Score = 46.6 bits (111), Expect = 3e-05
Identities = 33/125 (26%), Positives = 71/125 (56%), Gaps = 4/125 (3%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
K LE + E+K+L+E+L K EEL + + L+ + + + K+Y +E+E++++
Sbjct: 612 KELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE--EEYEELREEYL 669
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
E ++ + + +E+++ +E++ KT E+LK+E K + + + K +R+ +
Sbjct: 670 EL--SRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELR 727
Query: 595 EKVQK 599
EKV+K
Sbjct: 728 EKVKK 732
Score = 43.9 bits (104), Expect = 2e-04
Identities = 30/142 (21%), Positives = 64/142 (45%), Gaps = 13/142 (9%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKE------KLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E+ E + +E + +K L+ +L E+EL++E+ L+ + D +E
Sbjct: 578 ELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAE 637
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
K E+++K + E K ++ + +E L +L+ EL+ L+ + +
Sbjct: 638 TEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELE-------ELEKRRE 690
Query: 582 SVGKEYDRLLKEHEKVQKVVTE 603
+ K ++L +E E+ +K E
Sbjct: 691 EIKKTLEKLKEELEEREKAKKE 712
Score = 43.9 bits (104), Expect = 3e-04
Identities = 35/139 (25%), Positives = 66/139 (47%), Gaps = 13/139 (9%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E K E E + LE E++ E+L K EEL+++ LK +A +EY +
Sbjct: 242 ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKA----EEYIK 297
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
L + +E+ + E +K+ + +EI ++E++ + EE K+E+ LK +
Sbjct: 298 LSEFYEEYLDELRE-IEKRLSRL----EEEINGIEERIKELEE--KEERLEE--LKKKLK 348
Query: 582 SVGKEYDRLLKEHEKVQKV 600
+ K + L + HE ++
Sbjct: 349 ELEKRLEELEERHELYEEA 367
Score = 41.6 bits (98), Expect = 0.001
Identities = 27/131 (20%), Positives = 60/131 (45%), Gaps = 8/131 (6%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E KR E+ G+E E++ +E+L + +++LK+ + L+ + + Y+
Sbjct: 311 EIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH----ELYEE 366
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
+ E+++++ +K +K E++ L++ + EEE+ K LK +
Sbjct: 367 AKAKKEELERLKKRLTGLTPEKLEK----ELEELEKAKEEIEEEISKITARIGELKKEIK 422
Query: 582 SVGKEYDRLLK 592
+ K + L K
Sbjct: 423 ELKKAIEELKK 433
Score = 41.2 bits (97), Expect = 0.001
Identities = 30/124 (24%), Positives = 56/124 (45%), Gaps = 14/124 (11%)
Query: 476 GLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE 535
G E + K L E+ + + ++KE + + +E +++LK+ ++ K
Sbjct: 442 GRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIK---- 497
Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTE-EELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
K ++++K L+EKL K EEL+K+ LK + + E L KE
Sbjct: 498 ---------LKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKEL 548
Query: 595 EKVQ 598
EK++
Sbjct: 549 EKLE 552
Score = 40.8 bits (96), Expect = 0.002
Identities = 27/124 (21%), Positives = 62/124 (50%), Gaps = 6/124 (4%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
K L +EI ++E++ + EE K+E+ LK + + K + L + HE ++
Sbjct: 314 KRLSRLEEEINGIEERIKELEE--KEERLEE--LKKKLKELEKRLEELEERHELYEEAKA 369
Query: 535 EQGDKKDDKYQKGDS--DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
++ + + K + +++++ E+L K +EE+++E + A + KE + ++
Sbjct: 370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIE 429
Query: 593 EHEK 596
E +K
Sbjct: 430 ELKK 433
Score = 39.7 bits (93), Expect = 0.004
Identities = 31/138 (22%), Positives = 76/138 (55%), Gaps = 7/138 (5%)
Query: 463 SSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRL 522
SS+ E+ E+ +K LE + E++ LKE++ + E+EL+ + + L+ + + + + L
Sbjct: 213 SSELPELREELEK-LEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271
Query: 523 LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADS 582
KE E++++ V K+ K K ++E +L E + +EL++ + + L+ + +
Sbjct: 272 KKEIEELEEKV------KELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEING 325
Query: 583 VGKEYDRLLKEHEKVQKV 600
+ + L ++ E+++++
Sbjct: 326 IEERIKELEEKEERLEEL 343
Score = 39.3 bits (92), Expect = 0.006
Identities = 23/128 (17%), Positives = 60/128 (46%), Gaps = 15/128 (11%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+ IK +++L + E+ + + L+ + + + KE L + E++ E+ +K+
Sbjct: 193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEI-----EELEKEL 247
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV------GKEYDRLLKEHE 595
+ + ++L+EK+ + EE +++ K L+ + + +EY +L + +E
Sbjct: 248 ESLEG----SKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303
Query: 596 KVQKVVTE 603
+ + E
Sbjct: 304 EYLDELRE 311
Score = 39.3 bits (92), Expect = 0.007
Identities = 35/149 (23%), Positives = 65/149 (43%), Gaps = 27/149 (18%)
Query: 482 DEIKRLKEKLS--------KTEEELKKEKTNSAALKSQADSVGKEYDRL------LKE-H 526
+++K L+EKL K EE +K K LK + S+ KE ++L L E
Sbjct: 503 EQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELE 562
Query: 527 EKVQKVVTEQGDKKDDKYQKGDSD------EIKRLKE------KLSKTEEELKKEKTNSA 574
+K+ ++ E + + + G +K L+ +L E+EL++E+
Sbjct: 563 KKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELK 622
Query: 575 ALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
L+ + D +E K E+++K + E
Sbjct: 623 KLEEELDKAFEELAETEKRLEELRKELEE 651
Score = 38.1 bits (89), Expect = 0.013
Identities = 35/128 (27%), Positives = 67/128 (52%), Gaps = 9/128 (7%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ--KVVTEQGDKK 540
E+KR++++L + EE+ +K + + + V K+ L+K E + K + E+ K
Sbjct: 460 ELKRIEKELKEIEEKERKLRKE----LRELEKVLKKESELIKLKELAEQLKELEEKLKKY 515
Query: 541 DDKYQKGDSDEIKRLKEKLSKTE---EELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
+ + + ++E ++LKEKL K + + LKKE LK + + K+ D L +E ++
Sbjct: 516 NLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL 575
Query: 598 QKVVTEQG 605
K + E G
Sbjct: 576 LKELEELG 583
>gnl|CDD|214655 smart00423, PSI, domain found in Plexins, Semaphorins and
Integrins.
Length = 47
Score = 46.4 bits (110), Expect = 5e-07
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 278 DCNTYSSCTQCVSSDFP-CDWCVDGHRCTHDTAENCRNDI 316
C+ Y+SC++C+ + P C WC RCT E C +
Sbjct: 1 RCSKYTSCSECLLARDPYCAWCSSQGRCTS--GERCDSRR 38
Score = 37.9 bits (88), Expect = 4e-04
Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 2/40 (5%)
Query: 439 CRDLANNCGLCLALPEKYGCGWCESSKRCEIFEQCDKGLE 478
C +C CL + Y C WC S RC E+CD +
Sbjct: 2 CS-KYTSCSECLLARDPY-CAWCSSQGRCTSGERCDSRRQ 39
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 49.7 bits (119), Expect = 4e-06
Identities = 33/158 (20%), Positives = 80/158 (50%), Gaps = 10/158 (6%)
Query: 462 ESSKRCEIFEQCDKGLEGDSD----EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK 517
E + E ++ L+ + + EI L+E+L + E EL++ + LK + +++ +
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337
Query: 518 EYDR---LLKEHEKVQKVVTEQGDKKDDK---YQKGDSDEIKRLKEKLSKTEEELKKEKT 571
E + LL+E E++ + E ++ ++K + + + L+E+L++ E EL + +
Sbjct: 338 ELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRN 397
Query: 572 NSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
LK + +S+ + +RL + E +++ + E + +
Sbjct: 398 ELEELKREIESLEERLERLSERLEDLKEELKELEAELE 435
Score = 48.9 bits (117), Expect = 6e-06
Identities = 32/133 (24%), Positives = 72/133 (54%), Gaps = 5/133 (3%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-QGDKK 540
++K L+++L + EEEL + + L+ + + KE + L E E++++ + E Q +
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELL 291
Query: 541 DDKYQKGD-SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR---LLKEHEK 596
+ K + + EI L+E+L + E EL++ + LK + +++ +E + LL+E E+
Sbjct: 292 ELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQ 351
Query: 597 VQKVVTEQGDKKD 609
+ + E ++ +
Sbjct: 352 LLAELEEAKEELE 364
Score = 48.9 bits (117), Expect = 7e-06
Identities = 24/121 (19%), Positives = 64/121 (52%), Gaps = 1/121 (0%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
+ + KE+ + E +L++ + N L+ + + K+ ++L ++ EK ++ + + ++ +
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELE 226
Query: 544 YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
++K L+++L + EEEL + + L+ + + KE + L E E++++ + E
Sbjct: 227 LALL-LAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEE 285
Query: 604 Q 604
Sbjct: 286 L 286
Score = 47.8 bits (114), Expect = 2e-05
Identities = 28/144 (19%), Positives = 73/144 (50%), Gaps = 14/144 (9%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV- 529
E+ ++ L +E++ L+E+L + E+E+++ K+ L+ + + + +E L +E E++
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301
Query: 530 -----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
+ E+ + + ++ ++ ++ LKEK+ +EEL++ +T L+
Sbjct: 302 GEISLLRERLEELENELEELEE----RLEELKEKIEALKEELEERETLLEELEQLL---- 353
Query: 585 KEYDRLLKEHEKVQKVVTEQGDKK 608
E + +E E+ + E+ ++
Sbjct: 354 AELEEAKEELEEKLSALLEELEEL 377
Score = 46.2 bits (110), Expect = 5e-05
Identities = 29/120 (24%), Positives = 67/120 (55%), Gaps = 5/120 (4%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
DE+K L+E+ + EEEL++ ++ A LK + + + + + L + E+++ + E ++
Sbjct: 876 EDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEEL 935
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKT-NSAALKNQADSVGKEYDRLLKEHEKVQK 599
+++Y+ E++R E+L EEE++ N A+ + + V + Y+ L + E +++
Sbjct: 936 EEEYEDTLETELEREIERL---EEEIEALGPVNLRAI-EEYEEVEERYEELKSQREDLEE 991
Score = 45.5 bits (108), Expect = 9e-05
Identities = 28/124 (22%), Positives = 63/124 (50%), Gaps = 9/124 (7%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+ ++ L+E+L EE L K K L+ + ++ +E + L +E E+ ++ + + +
Sbjct: 758 ERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELE 817
Query: 542 DKYQKGDS--DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
Q+ + EI+ L+E++ + EE+L L+ + + + KE + L +E E+++
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKL-------DELEEELEELEKELEELKEELEELEA 870
Query: 600 VVTE 603
E
Sbjct: 871 EKEE 874
Score = 43.5 bits (103), Expect = 4e-04
Identities = 28/133 (21%), Positives = 66/133 (49%), Gaps = 9/133 (6%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRL---LKEHEKVQKVV 533
LE + + L+E+L++ E EL + + LK + +S+ + +RL L++ ++ K +
Sbjct: 371 LEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKEL 430
Query: 534 TEQGDKKDDKYQKGDS------DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
+ ++ + ++ + ++++ L+++L + E EL + + L+ + S+
Sbjct: 431 EAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARL 490
Query: 588 DRLLKEHEKVQKV 600
DRL E Q V
Sbjct: 491 DRLEAEQRASQGV 503
Score = 42.8 bits (101), Expect = 6e-04
Identities = 29/131 (22%), Positives = 75/131 (57%), Gaps = 2/131 (1%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
E+ + LE +++ LK +L+ EEEL++ ++ L+ + + + +E + L + E+++
Sbjct: 705 EELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELE 764
Query: 531 KVVTEQGDK--KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
+ + + K + + ++ + L+E+L + EEEL++ + AL+ + +S+ + +
Sbjct: 765 EELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRE 824
Query: 589 RLLKEHEKVQK 599
RL +E E++++
Sbjct: 825 RLEQEIEELEE 835
Score = 41.6 bits (98), Expect = 0.001
Identities = 32/142 (22%), Positives = 75/142 (52%), Gaps = 9/142 (6%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAAL-------KSQADSVGKEYDRLLKEHEKV 529
L+ +E+ L+ +L K EEELK K +L + Q + + ++ + L +E +
Sbjct: 669 LKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAAL 728
Query: 530 QKVVTE-QGDKKD-DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
++ + + Q ++ ++ + +E++ L+E+L + EEEL+ + A LK + + + ++
Sbjct: 729 EEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR 788
Query: 588 DRLLKEHEKVQKVVTEQGDKKD 609
L +E E++++ + E + D
Sbjct: 789 QALQEELEELEEELEEAERRLD 810
Score = 39.7 bits (93), Expect = 0.005
Identities = 25/110 (22%), Positives = 56/110 (50%), Gaps = 11/110 (10%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSA-------ALKSQADSVGKEYDRL---LKEH 526
E EI+ L+E++ + EE+L + + LK + + + E + L LKE
Sbjct: 823 RERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKEL 882
Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
E+ ++ + E+ + + + + +EI++L+E+L + E +L++ + L
Sbjct: 883 EEEKEELEEELRELESELAE-LKEEIEKLRERLEELEAKLERLEVELPEL 931
Score = 37.4 bits (87), Expect = 0.027
Identities = 34/141 (24%), Positives = 78/141 (55%), Gaps = 11/141 (7%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
E+ E+ D LE + +E+++ E+L + EEL+ EK + + +E + L+E E
Sbjct: 839 ELEEKLD-ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELE 897
Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE--KTNSAALKNQADSVGK 585
+ E+ +K ++ ++ ++ +++RL+ +L + EEEL++E T L+ + + + +
Sbjct: 898 SELAELKEEIEKLRERLEELEA-KLERLEVELPELEEELEEEYEDTLETELEREIERLEE 956
Query: 586 EYDRL-------LKEHEKVQK 599
E + L ++E+E+V++
Sbjct: 957 EIEALGPVNLRAIEEYEEVEE 977
Score = 35.5 bits (82), Expect = 0.10
Identities = 24/117 (20%), Positives = 56/117 (47%), Gaps = 13/117 (11%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKS-QADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
+E+++ EKL + E+ ++ + A L+ + + + L KE E++++ +
Sbjct: 196 EELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEEL------- 248
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
+ + +E++ E+ K EELK E L+ + + + +E L +E E++
Sbjct: 249 --SRLEEELEELQEELEEAEKEIEELKSELEE---LREELEELQEELLELKEEIEEL 300
Score = 30.5 bits (69), Expect = 3.4
Identities = 27/137 (19%), Positives = 52/137 (37%), Gaps = 30/137 (21%)
Query: 458 CGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKT-NSAALKSQADSVG 516
E E ++ + E++R E+L EEE++ N A++ + + V
Sbjct: 921 LERLEVELPELEEELEEEYEDTLETELEREIERL---EEEIEALGPVNLRAIE-EYEEVE 976
Query: 517 KEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
+ Y+ L + E +++ ++L E + EEL KEK
Sbjct: 977 ERYEELKSQREDLEE-------------------AKEKLLEVI----EELDKEKRE--RF 1011
Query: 577 KNQADSVGKEYDRLLKE 593
K D + + + + KE
Sbjct: 1012 KETFDKINENFSEIFKE 1028
Score = 30.1 bits (68), Expect = 4.0
Identities = 10/58 (17%), Positives = 33/58 (56%)
Query: 552 IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
+ + KE+ + E +L++ + N L++ + + K+ ++L ++ EK ++ + + ++
Sbjct: 167 VSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRE 224
Score = 28.9 bits (65), Expect = 9.5
Identities = 16/63 (25%), Positives = 32/63 (50%)
Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
E+ ++ LE ++++ L+++L + E EL + + L+ + S+ DRL E
Sbjct: 441 LEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRAS 500
Query: 530 QKV 532
Q V
Sbjct: 501 QGV 503
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 46.7 bits (110), Expect = 4e-05
Identities = 41/153 (26%), Positives = 75/153 (49%), Gaps = 12/153 (7%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK----EKTNSAALKSQADSVGK 517
E K+ E ++ ++ + + E + E+ K EE KK EK + ALK +A+ K
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Query: 518 EYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
+ KE E+ +K + KK ++ K ++E K+ E+ K EE KK++ K
Sbjct: 1704 AEELKKKEAEEKKKA---EELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE----K 1756
Query: 578 NQADSVGKEYDRLLKEHEKVQKVVTEQG-DKKD 609
+ + KE ++ +E K ++ V E+ D++D
Sbjct: 1757 KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEED 1789
Score = 46.7 bits (110), Expect = 4e-05
Identities = 34/139 (24%), Positives = 58/139 (41%), Gaps = 14/139 (10%)
Query: 481 SDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
+DE K+ E+ K + KK E +A + +E + + E+
Sbjct: 1317 ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376
Query: 538 DKKDDKYQKG-----DSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADSVGKEYDR 589
KK D +K +DE K+ E+ K +ELKK K + K +A+ K+ D
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK-KKADE 1435
Query: 590 LLKEHEKVQKVVTEQGDKK 608
K+ E+ +K ++ KK
Sbjct: 1436 AKKKAEEAKK--ADEAKKK 1452
Score = 45.5 bits (107), Expect = 8e-05
Identities = 34/131 (25%), Positives = 63/131 (48%), Gaps = 12/131 (9%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
+ E + + + + E E +EK +A K + K+ D K+ E+ +K ++ KK
Sbjct: 1344 AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK--KKADAAKKKAEEKKK--ADEAKKK 1399
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADSVGKEYDRLLKEHEKV 597
++ +K +DE+K+ K +E KK EK + K +A+ K+ D K+ E+
Sbjct: 1400 AEEDKK-KADELKK-AAAAKKKADEAKKKAEEKKKADEAKKKAEE-AKKADEAKKKAEEA 1456
Query: 598 QKVVTEQGDKK 608
+K E+ KK
Sbjct: 1457 KK--AEEAKKK 1465
Score = 44.0 bits (103), Expect = 3e-04
Identities = 37/136 (27%), Positives = 63/136 (46%), Gaps = 13/136 (9%)
Query: 481 SDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
+DE K+ E+ K EE KK E + K +A+ K+ D K+ E+ +K E
Sbjct: 1446 ADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE-AKKADEAKKKAEEAKKKADEAKK 1504
Query: 539 KKDDKYQKGDSDEIKRLKEK-----LSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
+ K + +DE K+ +E K EE K ++ A K +AD + K + LK+
Sbjct: 1505 AAEAKKK---ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE--LKK 1559
Query: 594 HEKVQKVVTEQGDKKD 609
E+ +K + ++D
Sbjct: 1560 AEEKKKAEEAKKAEED 1575
Score = 42.8 bits (100), Expect = 6e-04
Identities = 29/128 (22%), Positives = 57/128 (44%), Gaps = 6/128 (4%)
Query: 481 SDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
+DE K+ +EK K +E KK E + K +A+ K+ D K+ E+ +K
Sbjct: 1292 ADEAKKAEEK--KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAK 1349
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL-KNQADSVGKEYDRLLKEHEK 596
+ + + ++ E K + K E + K + A K +AD K+ + K+ ++
Sbjct: 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE 1409
Query: 597 VQKVVTEQ 604
++K +
Sbjct: 1410 LKKAAAAK 1417
Score = 42.4 bits (99), Expect = 7e-04
Identities = 34/133 (25%), Positives = 63/133 (47%), Gaps = 9/133 (6%)
Query: 478 EGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
E +DE + +EK E++ ++ K + A K +A+ K+ D K+ E+ +K E
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEK-KKADEAKKKAEEDKKKADELK 1411
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKK--EKTNSAALKNQADSVGKEYDRLLKEHE 595
K + +DE K+ E+ K +E KK E + K +A+ K+ + K+ E
Sbjct: 1412 KAAAAKKK---ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE-AKKAEEAKKKAE 1467
Query: 596 KVQKVVTEQGDKK 608
+ +K ++ KK
Sbjct: 1468 EAKK--ADEAKKK 1478
Score = 42.4 bits (99), Expect = 7e-04
Identities = 39/154 (25%), Positives = 71/154 (46%), Gaps = 10/154 (6%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKE---KTNSAALKSQADSVGKE 518
E ++ E+ + ++ + ++E K+ +E K EE K E K K +A+ K
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
Query: 519 YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAA 575
+ LK+ E+ K+ + KK ++ K ++E K+ +E K E LKK E +
Sbjct: 1650 EE--LKKAEEENKIKAAEEAKKAEE-DKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEE 1706
Query: 576 LKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
LK + K+ + L K E+ K+ E+ K+
Sbjct: 1707 LKKKEAEEKKKAEELKKA-EEENKIKAEEAKKEA 1739
Score = 41.3 bits (96), Expect = 0.002
Identities = 32/127 (25%), Positives = 63/127 (49%), Gaps = 11/127 (8%)
Query: 482 DEIKRLKEKLSKTEEELKKE----KTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
+++K+ + + K EELKK K +A +A+ K+ + K E +K E
Sbjct: 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK-AAEAL 1694
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE----KTNSAALKNQADSVGKEYDRLLKE 593
K+ ++ +K ++E+K+ + + K EELKK K + K +A+ K+ + K+
Sbjct: 1695 KKEAEEAKK--AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKD 1752
Query: 594 HEKVQKV 600
E+ +K+
Sbjct: 1753 EEEKKKI 1759
Score = 40.9 bits (95), Expect = 0.002
Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 5/142 (3%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
E+ K E E K+ ++L K EE K E+ A +A+ R +E +K +
Sbjct: 1531 EEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE 1590
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE---KTNSAALKNQADSVGKEY 587
+ E+ K ++ +K ++E K+ +E K EE K E K K +A+ K
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650
Query: 588 DRLLKEHEKVQKVVTEQGDKKD 609
+ LK+ E+ K+ + KK
Sbjct: 1651 E--LKKAEEENKIKAAEEAKKA 1670
Score = 39.0 bits (90), Expect = 0.009
Identities = 37/138 (26%), Positives = 61/138 (44%), Gaps = 15/138 (10%)
Query: 480 DSDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
++ E K+ + K K + KK EK + K +A+ K+ D L K +K ++
Sbjct: 1367 EAAEKKKEEAK--KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKK--ADE 1422
Query: 537 GDKKDDKYQKGD-----SDEIKRLKEKLSKTEEELKKEK-TNSAALKNQADSVGKEYDRL 590
KK ++ +K D ++E K+ E K EE K E+ A +AD K+ +
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA 1482
Query: 591 LKEHEKVQKVVTEQGDKK 608
K E +K E+ KK
Sbjct: 1483 KKADEAKKK--AEEAKKK 1498
Score = 38.6 bits (89), Expect = 0.011
Identities = 32/121 (26%), Positives = 57/121 (47%), Gaps = 12/121 (9%)
Query: 481 SDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
+DE K+ E+ K +E KK E + K +A+ K+ + K+ E+ +K ++
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE-AKKAEEAKKKAEEAKK--ADEAK 1476
Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADSVGKEYDRLLKEHE 595
KK ++ +K +DE K+ E+ K +E KK K + K ++ K+ D K E
Sbjct: 1477 KKAEEAKK--ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEA--KKADEAKKAEE 1532
Query: 596 K 596
Sbjct: 1533 A 1533
Score = 37.0 bits (85), Expect = 0.032
Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 15/138 (10%)
Query: 481 SDEIKRLKEK-----LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE 535
+DE K+ +E K EE K ++ A K +AD + K + LK+ E+ +K
Sbjct: 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE--LKKAEEKKKAEEA 1569
Query: 536 QGDKKDDKYQKGDSDEIKRLKEK----LSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
+ ++D ++E K+ +E + K EE KK K A +A +E L
Sbjct: 1570 KKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEE----L 1625
Query: 592 KEHEKVQKVVTEQGDKKD 609
K+ E+ +K V + K+
Sbjct: 1626 KKAEEEKKKVEQLKKKEA 1643
Score = 36.3 bits (83), Expect = 0.057
Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 21/166 (12%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEE-----ELKK---EKTNSAALKSQAD 513
E+ K+ E ++ D+ + ++E K+ +E K EE E KK E + K +A+
Sbjct: 1435 EAKKKAEEAKKADE-AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493
Query: 514 SVGKEYDRLLKEHE---KVQKVVTEQGDKKDDKYQKGD----SDEIKRLKEK-----LSK 561
K+ D K E K + + KK D+ +K + +DE K+ +EK L K
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Query: 562 TEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
EE K E+ A +A+ R +E +K ++ E+ K
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599
Score = 35.9 bits (82), Expect = 0.068
Identities = 30/128 (23%), Positives = 53/128 (41%), Gaps = 4/128 (3%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+E K+ E+ K EEE E+ A ++ +E K ++ + KK
Sbjct: 1233 EEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKA 1292
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKE--KTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
D+ +K ++E K+ E K EE K + K + K +AD+ K+ + K E +
Sbjct: 1293 DEAKK--AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKA 1350
Query: 600 VVTEQGDK 607
D+
Sbjct: 1351 EAEAAADE 1358
Score = 35.9 bits (82), Expect = 0.089
Identities = 28/115 (24%), Positives = 47/115 (40%), Gaps = 21/115 (18%)
Query: 493 KTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGD---- 548
K EE K ++ A K +AD K+ E+ +K ++ KK ++ +K D
Sbjct: 1273 KAEEARKADELKKAEEKKKADE--------AKKAEEKKKA--DEAKKKAEEAKKADEAKK 1322
Query: 549 -SDEIKRLKEKLSKTEE------ELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
++E K+ + K E E K + +AA + +A E KE K
Sbjct: 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK 1377
Score = 35.5 bits (81), Expect = 0.10
Identities = 32/142 (22%), Positives = 63/142 (44%), Gaps = 7/142 (4%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E K+ E ++ ++ + ++E K+ E+ K EE KK++ + K+ +
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
+ KE E V + E+ D++D+K + +IK + + + E K+ K D
Sbjct: 1773 IRKEKEAV---IEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMED 1829
Query: 582 SVGKEY----DRLLKEHEKVQK 599
S KE + L+E + +K
Sbjct: 1830 SAIKEVADSKNMQLEEADAFEK 1851
Score = 35.1 bits (80), Expect = 0.15
Identities = 34/146 (23%), Positives = 70/146 (47%), Gaps = 16/146 (10%)
Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTE-EELKKEKTNSAALKSQADSVGKEYDRLL 523
++ E ++ + E +E+ +L E+ K + EE KK + K +A+ + K +
Sbjct: 1581 RKAEEAKKAE---EARIEEVMKLYEEEKKMKAEEAKKAEEA----KIKAEELKKA-EEEK 1632
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
K+ E+++K E+ K ++ + + ++IK + +K EE +K + K +
Sbjct: 1633 KKVEQLKKKEAEEKKKAEELKKAEEENKIK--AAEEAKKAEE---DKKKAEEAKKAEEDE 1687
Query: 584 GKEYDRLLKEHEKVQKVVTEQGDKKD 609
K + L KE E+ +K E+ KK+
Sbjct: 1688 KKAAEALKKEAEEAKKA--EELKKKE 1711
Score = 32.8 bits (74), Expect = 0.71
Identities = 30/142 (21%), Positives = 58/142 (40%), Gaps = 6/142 (4%)
Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH-EK 528
E+ K E E + E + K EE K + A + + + ++++ H +
Sbjct: 1208 AEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFAR 1267
Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK-KEKTNSAALKNQADSVGKEY 587
Q + + +K D+ +K E K+ ++ K EE+ K E A +AD K+
Sbjct: 1268 RQAAIKAEEARKADELKKA---EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKA 1324
Query: 588 DRLLKEHEKVQKVVTEQGDKKD 609
+ K+ + +K E+ K
Sbjct: 1325 EEAKKKADAAKK-KAEEAKKAA 1345
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 45.8 bits (109), Expect = 6e-05
Identities = 26/135 (19%), Positives = 66/135 (48%), Gaps = 12/135 (8%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK-------EYDRLLKEHEKVQKVV 533
+E R+KEK+ + E E+ + + A + + + + E D+LL E E++++ +
Sbjct: 286 EEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREI 345
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
E+ ++D ++E LKE+L EL++ A +++ ++ ++L +E
Sbjct: 346 EEERKRRDKL-----TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400
Query: 594 HEKVQKVVTEQGDKK 608
++++ + ++
Sbjct: 401 INELKRELDRLQEEL 415
Score = 44.7 bits (106), Expect = 1e-04
Identities = 23/132 (17%), Positives = 62/132 (46%), Gaps = 9/132 (6%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
++E LKE+L EL++ A + + ++ ++L +E ++++ + ++
Sbjct: 356 TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEEL 415
Query: 541 DDKYQKGD--SDEIKRLKEKLSKTEE-------ELKKEKTNSAALKNQADSVGKEYDRLL 591
++ + I ++ K+++ EE E+KK++ L +E L
Sbjct: 416 QRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLK 475
Query: 592 KEHEKVQKVVTE 603
+E+++V+K +++
Sbjct: 476 EEYDRVEKELSK 487
Score = 44.7 bits (106), Expect = 2e-04
Identities = 25/126 (19%), Positives = 62/126 (49%), Gaps = 12/126 (9%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
++ + E++ +E+L+K E E+ K L ++ + + +E + K +K+
Sbjct: 304 ASLERSIAEKERELEDAEERLAKLEAEIDK-------LLAEIEELEREIEEERKRRDKLT 356
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
+ E ++ +D E++ + ++ ++T +ELK + LK + + + +E DRL
Sbjct: 357 EEYAELKEELEDLRA-----ELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411
Query: 591 LKEHEK 596
+E ++
Sbjct: 412 QEELQR 417
Score = 42.0 bits (99), Expect = 0.001
Identities = 28/140 (20%), Positives = 63/140 (45%), Gaps = 21/140 (15%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALK-----SQADSVGKEYDRLLKE 525
++ +E E+K L+ ++ + EE+L K + L+ S+ + E +L +E
Sbjct: 747 SSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEE 806
Query: 526 HEKVQKVV--TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
+++ + EQ ++ RL + E+E+++ + LK Q S+
Sbjct: 807 VSRIEARLREIEQ--------------KLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSI 852
Query: 584 GKEYDRLLKEHEKVQKVVTE 603
KE + L + E++++ + E
Sbjct: 853 EKEIENLNGKKEELEEELEE 872
Score = 40.4 bits (95), Expect = 0.003
Identities = 28/134 (20%), Positives = 64/134 (47%), Gaps = 25/134 (18%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
+ + ++ +L+ EEEL+K L + + K + + + E++ K + + G
Sbjct: 238 QKEAIERQLASLEEELEK-------LTEEISELEKRLEEIEQLLEELNKKIKDLG----- 285
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKNQADSVGK------EYDRLLKEHE 595
+E R+KEK+ + E E+ E++ + + D+ + E D+LL E E
Sbjct: 286 ------EEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIE 339
Query: 596 KVQKVVTEQGDKKD 609
++++ + E+ ++D
Sbjct: 340 ELEREIEEERKRRD 353
Score = 38.9 bits (91), Expect = 0.008
Identities = 23/138 (16%), Positives = 61/138 (44%), Gaps = 7/138 (5%)
Query: 463 SSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTN-SAALKSQADSVGKEYDR 521
+S E+ E+ + + ++ +++ L + +++K +K + + E
Sbjct: 247 ASLEEEL-EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIAS 305
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
L E+ + + +++ K ++ EI +L ++ + E E+++E+ L +
Sbjct: 306 L----ERSIAEKERELEDAEERLAKLEA-EIDKLLAEIEELEREIEEERKRRDKLTEEYA 360
Query: 582 SVGKEYDRLLKEHEKVQK 599
+ +E + L E E+V K
Sbjct: 361 ELKEELEDLRAELEEVDK 378
Score = 38.1 bits (89), Expect = 0.015
Identities = 33/136 (24%), Positives = 62/136 (45%), Gaps = 2/136 (1%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
E ++ L EI +L ++ + E E+++E+ L + + +E + L E E+V
Sbjct: 318 EDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVD 377
Query: 531 KVVTEQGDKKDDKYQKGD--SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
K E D+ D +K + EI LK +L + +EEL++ A L + + +
Sbjct: 378 KEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKIN 437
Query: 589 RLLKEHEKVQKVVTEQ 604
L +E E + +Q
Sbjct: 438 ELEEEKEDKALEIKKQ 453
Score = 37.7 bits (88), Expect = 0.021
Identities = 24/86 (27%), Positives = 41/86 (47%), Gaps = 5/86 (5%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
EI+ L K + EEEL++ + L+S+ + KE D L + ++++ + E + +
Sbjct: 854 KEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIE 913
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELK 567
K + LK KL EEEL
Sbjct: 914 KK-----RKRLSELKAKLEALEEELS 934
Score = 35.4 bits (82), Expect = 0.097
Identities = 23/118 (19%), Positives = 49/118 (41%), Gaps = 1/118 (0%)
Query: 489 EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGD 548
K K EEL++ + N L D ++ +RL +E EK ++ +K +
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALL-KEKREYEGYEL 228
Query: 549 SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGD 606
E + L+ + E +L + L + + K + + + E++ K + + G+
Sbjct: 229 LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGE 286
Score = 34.3 bits (79), Expect = 0.25
Identities = 23/130 (17%), Positives = 44/130 (33%), Gaps = 9/130 (6%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
E ++ ++++L LSK E+EL K ++ + + +E + +
Sbjct: 444 EDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ---AR 500
Query: 531 KVVTEQGDKKDDKYQKGDSDE-IKRLKEKLSKTEEELKK--EKTNSAALKN---QADSVG 584
+ + S + + +L E E L N + D+V
Sbjct: 501 ASEERVRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVA 560
Query: 585 KEYDRLLKEH 594
KE LLK
Sbjct: 561 KEAIELLKRR 570
Score = 32.3 bits (74), Expect = 0.79
Identities = 26/131 (19%), Positives = 63/131 (48%), Gaps = 10/131 (7%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
+ L E L+ +L + E ++++ L++Q + K L + E ++
Sbjct: 878 RDLESRLGDLKKERDELEAQLRELERKIEE-------LEAQIEKKRKRLSELKAKLEALE 930
Query: 531 KVVTEQGD-KKDDKYQKGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKNQADSVGKEYD 588
+ ++E D K +D+ + ++ ++ +L + EEE++ E N A++ + + V K D
Sbjct: 931 EELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPVNMLAIQ-EYEEVLKRLD 989
Query: 589 RLLKEHEKVQK 599
L ++ K+++
Sbjct: 990 ELKEKRAKLEE 1000
Score = 32.3 bits (74), Expect = 0.85
Identities = 18/103 (17%), Positives = 48/103 (46%), Gaps = 3/103 (2%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
+G S ++L + E LK+E ++ + + ++ E + L + + K+
Sbjct: 663 RGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR--KIGE 720
Query: 535 EQGDKKD-DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
+ + + ++ ++ + ++ L+E LS E+E++ K+ L
Sbjct: 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKEL 763
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 45.5 bits (108), Expect = 7e-05
Identities = 24/122 (19%), Positives = 57/122 (46%), Gaps = 7/122 (5%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+ + L+ +L K + + E+ AAL+ + V + + +E ++++ +
Sbjct: 630 EAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENE-------EQ 682
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVV 601
+ + + ++++ E+L + EEL K+ L + +S E + L KE EK++K +
Sbjct: 683 LEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELESRKAELEELKKELEKLEKAL 742
Query: 602 TE 603
Sbjct: 743 EL 744
Score = 44.8 bits (106), Expect = 1e-04
Identities = 29/135 (21%), Positives = 69/135 (51%), Gaps = 4/135 (2%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
+ S+ E E + LE + E+K+L+E + EEE +++ AL+ + + +E +R
Sbjct: 193 QLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIE--ALEERLAELEEEKER 250
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
L + ++ ++ + + + K ++ + E++RL E+L + E L++ + L+ + +
Sbjct: 251 LEELKARLLEIESLELEA--LKIREEELRELERLLEELEEKIERLEELEREIEELEEELE 308
Query: 582 SVGKEYDRLLKEHEK 596
+ + L + EK
Sbjct: 309 GLRALLEELEELLEK 323
Score = 40.9 bits (96), Expect = 0.002
Identities = 29/122 (23%), Positives = 64/122 (52%), Gaps = 8/122 (6%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+E++ L EKL EE L+K + L+S+ +E E K+ + ++ +++
Sbjct: 315 EELEELLEKLKSLEERLEKLEEKLEKLESE----LEELAEEKNELAKLLEERLKELEERL 370
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVV 601
++ +K E+++ E+L + EE +++ K A L + + +E + L KE E++++ +
Sbjct: 371 EELEK----ELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELEREL 426
Query: 602 TE 603
E
Sbjct: 427 EE 428
Score = 39.0 bits (91), Expect = 0.008
Identities = 32/142 (22%), Positives = 64/142 (45%), Gaps = 24/142 (16%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTN--------SAALKSQADSVGKEYDRLLKE--- 525
LE + + ++ +L + +E L++ K ALK + + + +E +RLL+E
Sbjct: 230 LEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEEL-RELERLLEELEE 288
Query: 526 ---------HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK---EKTNS 573
E + +G + + + +++K L+E+L K EE+L+K E
Sbjct: 289 KIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEEL 348
Query: 574 AALKNQADSVGKEYDRLLKEHE 595
A KN+ + +E + L+E
Sbjct: 349 AEEKNELAKLLEERLKELEERL 370
Score = 37.0 bits (86), Expect = 0.031
Identities = 23/134 (17%), Positives = 63/134 (47%), Gaps = 13/134 (9%)
Query: 468 EIF--EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
E+F E+ +K E + IK K K+ + E +L + + + L +E
Sbjct: 162 ELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIE----------DLLEALEEE 211
Query: 526 HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
++++K+ E ++++++ + + + ++ +L + +E L++ K +++ K
Sbjct: 212 LKELKKL-EEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALK 270
Query: 586 EYDRLLKEHEKVQK 599
+ L+E E++ +
Sbjct: 271 IREEELRELERLLE 284
Score = 36.7 bits (85), Expect = 0.045
Identities = 29/130 (22%), Positives = 62/130 (47%), Gaps = 9/130 (6%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
LE E++ L+E+LS+ +EE + + L+ + + +E LL+ E +++ + E+
Sbjct: 476 LELYELELEELEEELSREKEE-AELREEIEELEKELRELEEELIELLELEEALKEELEEK 534
Query: 537 GDK-------KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
+K ++ +K ++K +L +ELK+ L+ + + + +E
Sbjct: 535 LEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEEL-EELRE 593
Query: 590 LLKEHEKVQK 599
LKE +K K
Sbjct: 594 RLKELKKKLK 603
Score = 34.4 bits (79), Expect = 0.20
Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 4/74 (5%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
+ + E E+ + LE +E+++L+E+L EEL K+ L + +S E +
Sbjct: 675 QRIENEEQLEEKLEELEQLEEELEQLREEL----EELLKKLGEIEQLIEELESRKAELEE 730
Query: 522 LLKEHEKVQKVVTE 535
L KE EK++K +
Sbjct: 731 LKKELEKLEKALEL 744
Score = 33.2 bits (76), Expect = 0.42
Identities = 20/108 (18%), Positives = 47/108 (43%), Gaps = 5/108 (4%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAAL--KSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
+ LKE+L + E+L+ L K Q + +E +L +Q++ + +
Sbjct: 523 LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDR---LQELKELLEELR 579
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
+ +K + +E++ ++L K +EL++ + L + E +
Sbjct: 580 LLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENE 627
Score = 31.3 bits (71), Expect = 1.7
Identities = 26/128 (20%), Positives = 54/128 (42%), Gaps = 13/128 (10%)
Query: 485 KRLKEKLSKTEEELKKEKTN--SAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
++LKE+L + E+ L++ K L +E LKE +K K + E+ + ++
Sbjct: 555 QQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEE 614
Query: 543 KYQKGDSD----EIKRLKEKLSKTEEELKKEKTNSAA-------LKNQADSVGKEYDRLL 591
Q + E++ +E+L E+L + L+ + + + E R L
Sbjct: 615 LLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRREL 674
Query: 592 KEHEKVQK 599
+ E ++
Sbjct: 675 QRIENEEQ 682
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 45.4 bits (107), Expect = 7e-05
Identities = 32/144 (22%), Positives = 68/144 (47%), Gaps = 1/144 (0%)
Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK 524
R ++ + ++ + D+ ++LKE+L K + + K + + + AD E + +
Sbjct: 196 FRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQ 255
Query: 525 EHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSA-ALKNQADSV 583
E + + K K+D + + EI++ + ++ K +EE K K + A LK ++ +
Sbjct: 256 EAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKAS 315
Query: 584 GKEYDRLLKEHEKVQKVVTEQGDK 607
KE + E +K ++ V E K
Sbjct: 316 EKEAEDKELEAQKKREPVAEDLQK 339
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 44.2 bits (105), Expect = 1e-04
Identities = 25/112 (22%), Positives = 50/112 (44%), Gaps = 6/112 (5%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
D E+ R KEKL K +E+ + L+ + + + + L + ++ + E +K
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEA-EK 260
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
K ++ + EI++LKE+L + T K +++ YDR +
Sbjct: 261 KLEQCRGFTFKEIEKLKEQLKLLQ-----SLTGWKITKLSGNTLSMTYDREI 307
Score = 42.7 bits (101), Expect = 3e-04
Identities = 24/100 (24%), Positives = 42/100 (42%), Gaps = 12/100 (12%)
Query: 484 IKRLKEKLSKTEEELKKEKTN----SAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
++ LKE L + E LK++ L S + D L +E +++++ E D
Sbjct: 142 LEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDC 201
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
D E+ R KEKL K +E+ + L+ +
Sbjct: 202 --------DPTELDRAKEKLKKLLQEIMIKVKKLEELEEE 233
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 42.7 bits (101), Expect = 4e-04
Identities = 13/86 (15%), Positives = 42/86 (48%), Gaps = 1/86 (1%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
D E+++ +++L ++ +++ L+ + + + L + ++ + + E +K
Sbjct: 205 DPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEA-EK 263
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEE 565
++ + + EI +LK K+S ++
Sbjct: 264 IREECRGWSAKEISKLKAKVSLLQKL 289
Score = 41.6 bits (98), Expect = 8e-04
Identities = 15/104 (14%), Positives = 45/104 (43%), Gaps = 12/104 (11%)
Query: 484 IKRLKEKLSKTEEELKKEKT----NSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
++ LKE L + E +K+++ + + S + K+ L +E ++++ E
Sbjct: 145 LEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLC 204
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
D E+++ +++L ++ +++ L+ + +
Sbjct: 205 --------DPLELEKARQELRSLSVKISEKRKQLEELQQELQEL 240
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 42.6 bits (100), Expect = 6e-04
Identities = 28/133 (21%), Positives = 58/133 (43%), Gaps = 6/133 (4%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK----VVTE 535
D +++K +++L K E+ELKKEK L+ + + R +E E+ Q E
Sbjct: 313 DEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEI--KREAEEEEEEQLEKLQEKLE 370
Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE 595
Q +++ +K +S+ + + + E +E+ + L ++ KE
Sbjct: 371 QLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDLLKEEKKEEL 430
Query: 596 KVQKVVTEQGDKK 608
K+ + + E + K
Sbjct: 431 KIVEELEESLETK 443
Score = 33.0 bits (75), Expect = 0.55
Identities = 16/139 (11%), Positives = 52/139 (37%), Gaps = 10/139 (7%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
++ + ++ K ++ ++E A + + S + +R + E+ K ++ K
Sbjct: 268 LAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKKL 327
Query: 541 DDKYQKGDSDEIKRLKEKLS----------KTEEELKKEKTNSAALKNQADSVGKEYDRL 590
+ + +K + + KE + E+ K ++ + E +RL
Sbjct: 328 EKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERL 387
Query: 591 LKEHEKVQKVVTEQGDKKD 609
+ ++ + + +++
Sbjct: 388 SSAAKLKEEELELKNEEEK 406
Score = 30.7 bits (69), Expect = 2.6
Identities = 26/138 (18%), Positives = 53/138 (38%), Gaps = 15/138 (10%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKE---YDRLLKEHEKVQKVVTEQGD 538
+E +EK K +E LKK + L + + +L ++ +K + +
Sbjct: 161 EEAAGSREKRKK-KERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEK 219
Query: 539 KKD-------DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
+ Y K + + I L+E L +EE++ K + + + + +
Sbjct: 220 LELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSK----QELEKEEEILAQVLKEN 275
Query: 592 KEHEKVQKVVTEQGDKKD 609
KE EK +K+ E+
Sbjct: 276 KEEEKEKKLQEEELKLLA 293
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 41.4 bits (97), Expect = 0.001
Identities = 34/130 (26%), Positives = 61/130 (46%), Gaps = 6/130 (4%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
E+ D L E K+ LSK E + + N AL S+ D + KE +RLL E E+++
Sbjct: 32 ERLDSELRDAEKERDTYKQYLSKLESQNVEIS-NYEALDSELDELKKEEERLLDELEELE 90
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
K + D D + + ++ + E+L E N+ L++ S+ +Y+
Sbjct: 91 K----EDDDLDGELVELQEEKEQLENEELQY-LREYNLFDRNNLQLEDNLQSLELQYEYS 145
Query: 591 LKEHEKVQKV 600
L + +K++K
Sbjct: 146 LNQLDKLRKT 155
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 41.7 bits (99), Expect = 0.001
Identities = 36/147 (24%), Positives = 65/147 (44%), Gaps = 31/147 (21%)
Query: 462 ESSKRC----EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK 517
E +KR I E+ K + D +++ L L + E EL+++ + AL +A+ + +
Sbjct: 492 EIAKRLGLPENIIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKE 551
Query: 518 EY-----------DRLLKE-HEKVQKVVTE------QGDKKDDKYQKGD---------SD 550
E D+LL+E ++ Q+ + E + K+ + QKG +
Sbjct: 552 ELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIE 611
Query: 551 EIKRLKEKLSKTEEELKKEKTNSAALK 577
KRL + K E++ KK+K LK
Sbjct: 612 ARKRLNKANEKKEKKKKKQKEKQEELK 638
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 40.8 bits (96), Expect = 0.001
Identities = 29/141 (20%), Positives = 68/141 (48%), Gaps = 7/141 (4%)
Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADSVGKEYDRLLKEHE 527
E +K LE E++ L+ ++S+ E E+++ E+ A K A +E L E +
Sbjct: 40 LEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQ 99
Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
+ E+ + +D+ + +EI++L++++ +E L++ + N A + + + E
Sbjct: 100 IAK----ERINSLEDELAELM-EEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEI 154
Query: 588 DRLLKEHEKVQKVVTEQGDKK 608
+E ++ + E+ D +
Sbjct: 155 REEGQELSSKREELKEKLDPE 175
Score = 34.3 bits (79), Expect = 0.14
Identities = 27/131 (20%), Positives = 56/131 (42%), Gaps = 28/131 (21%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
E RL+ ++ + + LKK K++ +++ K + L E E ++ V
Sbjct: 17 LEKDRLEPRIKEIRKALKK-------AKAELEALNKALEALEIELEDLENQV-------- 61
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSA---------ALKNQADSVGKEYDRLLK 592
EI+ ++E++ + EE+L K K + +S+ E L++
Sbjct: 62 ----SQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELME 117
Query: 593 EHEKVQKVVTE 603
E EK++K + +
Sbjct: 118 EIEKLEKEIED 128
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 39.7 bits (93), Expect = 0.001
Identities = 23/97 (23%), Positives = 38/97 (39%), Gaps = 13/97 (13%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
E+ EK+ K EE +K K K + D + K ++ DK +
Sbjct: 65 KELAEEIEKVKKEYEEKQKWKWKKKKSKKKKD-------------KDKDKKDDKKDDKSE 111
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
K +K D+++ L + S+T L + K AL
Sbjct: 112 KKDEKEAEDKLEDLTKSYSETLSTLSELKPRKYALHK 148
Score = 35.1 bits (81), Expect = 0.050
Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 12/92 (13%)
Query: 493 KTEEELKKEKTNSAALKSQADSVGKEY---DRLLKEHEKVQKVVTEQGDKKDDKYQKGDS 549
+ E KK+K L + + V KEY + + +K +K + DK +K D
Sbjct: 56 EYTEAKKKKK----ELAEEIEKVKKEYEEKQKWKWKKKKSKK--KKDKDKDKKDDKKDDK 109
Query: 550 DEIKRLKE---KLSKTEEELKKEKTNSAALKN 578
E K KE KL + + + + LK
Sbjct: 110 SEKKDEKEAEDKLEDLTKSYSETLSTLSELKP 141
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 41.2 bits (97), Expect = 0.002
Identities = 25/130 (19%), Positives = 62/130 (47%), Gaps = 6/130 (4%)
Query: 471 EQCDKGLEGD----SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
E+ + LE + L+++L + EEEL++ + L Q ++ K+ RL E
Sbjct: 683 EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742
Query: 527 EKVQKVVTEQGD--KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
E++++ + + + + + + ++ +E+L++ E E+++ + LK + ++
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR 802
Query: 585 KEYDRLLKEH 594
+ D L E
Sbjct: 803 EALDELRAEL 812
Score = 40.4 bits (95), Expect = 0.003
Identities = 27/132 (20%), Positives = 63/132 (47%), Gaps = 9/132 (6%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+E++ L+E+L + EEEL++ L+ + + + E L +E E++QK + ++
Sbjct: 239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIS 298
Query: 542 DKYQ---------KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
Q +++ L+ +L + E +L + A L+ + + + +E + L
Sbjct: 299 RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA 358
Query: 593 EHEKVQKVVTEQ 604
E E+++ + E
Sbjct: 359 ELEELEAELEEL 370
Score = 39.7 bits (93), Expect = 0.005
Identities = 20/114 (17%), Positives = 53/114 (46%), Gaps = 5/114 (4%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
L EI+ L+ ++ + +EELK + L+++ + +E L + E +++ +
Sbjct: 777 LAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAAT 836
Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
+ +D ++I+ L E + E+++ + L+++ +++ E L
Sbjct: 837 ERRLEDL-----EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASL 885
Score = 38.9 bits (91), Expect = 0.008
Identities = 23/134 (17%), Positives = 55/134 (41%), Gaps = 8/134 (5%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
+I + LE +E++ E+L +EL +E A L+ + + + +E + L E E
Sbjct: 305 QILRERLANLERQLEELEAQLEELESKLDELAEEL---AELEEKLEELKEELESLEAELE 361
Query: 528 KVQKVVTEQGDKKDDK-----YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADS 582
+++ + E + ++ + +++ L+ E L+ + +
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421
Query: 583 VGKEYDRLLKEHEK 596
+E + L+E E
Sbjct: 422 EIEELLKKLEEAEL 435
Score = 37.3 bits (87), Expect = 0.023
Identities = 25/131 (19%), Positives = 65/131 (49%), Gaps = 6/131 (4%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKE----KTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
L + +E++ E+L K EEL ++ + + A L+++ + + + +L KE +++
Sbjct: 701 AELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELE 760
Query: 531 KVVTEQGDKKDDKYQ--KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
+ E ++ ++ + EI+ L+ ++ + +EELK + L+ + + +E
Sbjct: 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820
Query: 589 RLLKEHEKVQK 599
L + E +++
Sbjct: 821 NLRERLESLER 831
Score = 35.8 bits (83), Expect = 0.077
Identities = 27/122 (22%), Positives = 62/122 (50%), Gaps = 3/122 (2%)
Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQ 545
L E+L++ EE+L++ K +L+++ + + E + L E++++ + K
Sbjct: 334 ELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLEL 393
Query: 546 KGDS--DEIKRLKEKLSKTEEELKKEKTN-SAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
+ S +EI+RL+ +L + E+ ++ + LK ++ KE L+E E+ + +
Sbjct: 394 QIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQ 453
Query: 603 EQ 604
E+
Sbjct: 454 EE 455
Score = 35.4 bits (82), Expect = 0.11
Identities = 22/123 (17%), Positives = 60/123 (48%), Gaps = 5/123 (4%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV--VTEQGDKKD 541
I + KE+ +TE +L++ + N L+ + + ++ L ++ EK ++ + + + +
Sbjct: 167 ISKYKERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELE 226
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVV 601
++ L+E+L + +EELK+ + L + + ++ + L E ++++ +
Sbjct: 227 LALLVLR---LEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEI 283
Query: 602 TEQ 604
E
Sbjct: 284 EEL 286
Score = 35.0 bits (81), Expect = 0.14
Identities = 27/133 (20%), Positives = 61/133 (45%), Gaps = 8/133 (6%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
+ LE +E++ +L + EEL+KE A S+ + + L E+ + +
Sbjct: 263 QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
Q ++ + K + L E+L++ EE+L++ K +L+ + + + E + L
Sbjct: 323 AQLEELESK--------LDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRL 374
Query: 595 EKVQKVVTEQGDK 607
E++++ + K
Sbjct: 375 EELEEQLETLRSK 387
Score = 33.1 bits (76), Expect = 0.46
Identities = 15/120 (12%), Positives = 53/120 (44%), Gaps = 12/120 (10%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEE-------ELKKEKTNSAALKSQADSVGKEYDRLL 523
E ++ + ++ L+E++ + E E+++ + L+S+ +++ E
Sbjct: 827 ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA--- 883
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
E+ ++ + ++ ++ + + L+ +L + E+L + + L+ + D++
Sbjct: 884 -SLEEALALLRSELEELSEE-LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNL 941
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 38.9 bits (91), Expect = 0.002
Identities = 22/94 (23%), Positives = 45/94 (47%), Gaps = 4/94 (4%)
Query: 483 EIKRLKEKLSKTEEELK-KEKTNSAALKSQADSVGKEYDRLLKE-HEKVQKVVTEQGDKK 540
EIK LK +L + K +K + + Y+ +L E ++++K + E
Sbjct: 47 EIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDS 106
Query: 541 DDKYQ--KGDSDEIKRLKEKLSKTEEELKKEKTN 572
+ + K D++K+ +++L K +EL+KE+
Sbjct: 107 EGLLEELKKHRDKLKKEQKELRKKLDELEKEEKK 140
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 40.8 bits (95), Expect = 0.002
Identities = 43/156 (27%), Positives = 72/156 (46%), Gaps = 28/156 (17%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNS-----------AALKS 510
E SK + LE E+++++EK K E+ELK+ + N +S
Sbjct: 501 EVSKLASNQLKQRSDLERAHIELEKIREKHEKLEKELKRLRANPESADRGSAVDAGTSRS 560
Query: 511 QADSVG--KEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
+ADS G E DRLL EK ++ ++DD E RL ++L K + L K
Sbjct: 561 RADSAGARNEVDRLLDRLEKAEQ-------ERDDT-----EMEAGRLAKELEKAQRHLTK 608
Query: 569 EKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
++ + A + + + E LL+E E+++K E+
Sbjct: 609 QQEKTEATRIEFERKSAE---LLEEAERLEKSEAEE 641
Score = 36.9 bits (85), Expect = 0.034
Identities = 33/132 (25%), Positives = 60/132 (45%), Gaps = 7/132 (5%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQ--ADSVGKEYDRLLKEHEK 528
E+ + L+ +EIK+L E L K E+ +S L A+ + LL++ EK
Sbjct: 145 ERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERLGRIREAEDQQSHLEVLLEQKEK 204
Query: 529 VQKVVTEQGDKK-----DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
++ E+ +K DD + I K+ + E+ L+K + L+++ D+
Sbjct: 205 EHMMLREEIHQKLQMERDDAKTEASQKLIDEKDTKIKEFEKMLEKAENEIYRLQSRCDTS 264
Query: 584 GKEYDRLLKEHE 595
+ +RL KE E
Sbjct: 265 EADRNRLDKEVE 276
Score = 35.8 bits (82), Expect = 0.087
Identities = 25/101 (24%), Positives = 47/101 (46%), Gaps = 11/101 (10%)
Query: 508 LKSQADSVGKEYDRLLKEHE----KVQKVVTE------QGDKKDDKYQKGDSDEIKRLKE 557
L++Q ++ +E + L KE E K+ + K++ +K ++ I KE
Sbjct: 1 LQAQLENAQRENELLKKELERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKE 60
Query: 558 KLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
+L T+EE + + AL+ + S E +RL ++ EK
Sbjct: 61 QLRVTQEENQHAQLTVQALQEELKSQR-ELNRLQQKLEKSG 100
Score = 35.8 bits (82), Expect = 0.087
Identities = 20/94 (21%), Positives = 41/94 (43%), Gaps = 9/94 (9%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-----QG 537
E RL ++L K + L K++ + A + + + E LL+E E+++K E Q
Sbjct: 591 EAGRLAKELEKAQRHLTKQQEKTEATRIEFERKSAE---LLEEAERLEKSEAEEETLRQS 647
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
+ + + +KL ++L+ E+
Sbjct: 648 TQIGHAQAAAHNHIEHHV-QKLESDLKQLRAERE 680
Score = 32.3 bits (73), Expect = 0.81
Identities = 35/158 (22%), Positives = 66/158 (41%), Gaps = 16/158 (10%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAAL-------KSQADS 514
+ E FE K E +E++ L+ KLS+ E +L+ K + L +S +
Sbjct: 459 DERYEQEEFETYKKEFEDLKEEVQNLQLKLSERELQLELLKEEVSKLASNQLKQRSDLER 518
Query: 515 VGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSD-----EIKRLKEKLSKTEEELKKE 569
E +++ ++HEK++K + + + D R + + E+ +
Sbjct: 519 AHIELEKIREKHEKLEK----ELKRLRANPESADRGSAVDAGTSRSRADSAGARNEVDRL 574
Query: 570 KTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
+ + D E RL KE EK Q+ +T+Q +K
Sbjct: 575 LDRLEKAEQERDDTEMEAGRLAKELEKAQRHLTKQQEK 612
Score = 31.9 bits (72), Expect = 1.0
Identities = 25/107 (23%), Positives = 55/107 (51%), Gaps = 6/107 (5%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELK--KEKTNSAALKSQADSVGKEY 519
E + +E+ ++ L +I+ L+E + E LK KE+ S + D+ ++
Sbjct: 379 EIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERRLKEEKERLRSLQTDTNTDTALEKL 438
Query: 520 DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
++ L E E++ + + EQ D+ D++Y+ +E + K++ +EE+
Sbjct: 439 EKALAEKERIIERLKEQRDR-DERYE---QEEFETYKKEFEDLKEEV 481
Score = 30.4 bits (68), Expect = 3.2
Identities = 22/128 (17%), Positives = 59/128 (46%), Gaps = 5/128 (3%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
+ DK +E + + +K K + +EL ++KT L+++ +++ + + + +K++
Sbjct: 269 NRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQTELETLANQDSDMRQHLDKLK 328
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
+ +T +K E+ L+ +L + L K+ + A + + + E + +
Sbjct: 329 EDLTRAEQEKAIL-----QTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDM 383
Query: 591 LKEHEKVQ 598
+EK +
Sbjct: 384 RDRYEKTE 391
Score = 29.6 bits (66), Expect = 5.9
Identities = 26/130 (20%), Positives = 56/130 (43%), Gaps = 5/130 (3%)
Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
+K LE +EI RL+ + +E + + A +S ++ + DR +E + + +
Sbjct: 244 EKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTEL 303
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
+ + DSD +++ L K +E+L + + A L+ + D++ E +R
Sbjct: 304 LGLQTELETL-ANQDSD----MRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNT 358
Query: 594 HEKVQKVVTE 603
K +
Sbjct: 359 LTKKTASLQA 368
Score = 29.2 bits (65), Expect = 7.5
Identities = 21/139 (15%), Positives = 48/139 (34%), Gaps = 1/139 (0%)
Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEEL-KKEKTNSAALKSQADSVGKEYDRLLKEHEK 528
E+ L + IK K E L K E K Q +E +
Sbjct: 19 LERKQSKLGSSMNSIKTFWSPELKRERALRKDEAARITVWKEQLRVTQEENQHAQLTVQA 78
Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
+Q+ + Q + + + S ++L K+E + + + + + +
Sbjct: 79 LQEELKSQRELNRLQQKLEKSGNETGELQQLEKSEGQFSRIQAERERQAMELFLLRDTLE 138
Query: 589 RLLKEHEKVQKVVTEQGDK 607
+L E++Q+ + + ++
Sbjct: 139 KLQLRMERLQQTLDARDEE 157
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 40.3 bits (95), Expect = 0.002
Identities = 26/141 (18%), Positives = 63/141 (44%), Gaps = 32/141 (22%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
E+ ++ E + I + + L+ E+ ++ E+ + A +++ + LL+E +
Sbjct: 176 EVLQEFLNSKEAVEEAILQTDQALTAKEKAIEAERAKAEAAEAEQE--------LLREKQ 227
Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
K ++ + E ++ YQ + +K+L EK+ E+L E+
Sbjct: 228 KEEEQMMEA---QERSYQ----EHVKQLIEKMEAEREKLLAEQ----------------- 263
Query: 588 DRLLKEHEKVQKVVTEQGDKK 608
+R+L+ + Q+ + ++G K
Sbjct: 264 ERMLEHKLQEQEELLKEGFKT 284
Score = 29.9 bits (68), Expect = 3.7
Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 22/76 (28%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG-DKK 540
+ +K+L EK+ E+L E+ +R+L+ + Q+ + ++G +
Sbjct: 243 EHVKQLIEKMEAEREKLLAEQ-----------------ERMLEHKLQEQEELLKEGFKTE 285
Query: 541 DDKYQKGDSDEIKRLK 556
+ QK EI+ LK
Sbjct: 286 AESLQK----EIQDLK 297
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 40.5 bits (96), Expect = 0.003
Identities = 21/93 (22%), Positives = 50/93 (53%), Gaps = 5/93 (5%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+E+++L+++L + EE L ++ + + + KE ++ +E EK ++ + E +++
Sbjct: 82 NELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQL 141
Query: 542 DKYQK--G-DSDEIKRLKEKLSKTEEELKKEKT 571
+ ++ G ++E K + L K EEE + E
Sbjct: 142 QELERISGLTAEEAK--EILLEKVEEEARHEAA 172
Score = 39.0 bits (92), Expect = 0.008
Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 26/111 (23%)
Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK--VQKVVTEQGDKKDDKYQ 545
KE L + +EE+ K L+++ + +E L++ EK +QK E D+K
Sbjct: 56 KEALLEAKEEIHK-------LRNEFEKELRERRNELQKLEKRLLQK--EENLDRK----- 101
Query: 546 KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
E L K EEEL+K++ + + + +E + L++E +
Sbjct: 102 ----------LELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ 142
Score = 38.2 bits (90), Expect = 0.011
Identities = 28/129 (21%), Positives = 63/129 (48%), Gaps = 24/129 (18%)
Query: 485 KRLKEKLSKTEEE----LKKEKTNSAALKSQADSVGK-EYDRLLKEHEKVQKVVTEQGDK 539
K + K+ + EEE L++ K + A+K +A K E +L E EK +
Sbjct: 27 KIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEK---------EL 77
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE----HE 595
++ + +E+++L+++L + EE L ++ + + + KE ++ +E E
Sbjct: 78 RERR------NELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEE 131
Query: 596 KVQKVVTEQ 604
++++++ EQ
Sbjct: 132 ELEELIEEQ 140
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 38.8 bits (91), Expect = 0.004
Identities = 24/143 (16%), Positives = 56/143 (39%), Gaps = 9/143 (6%)
Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
++ L+ D K L++ + E EL K + A ++ + ++ + EK+++
Sbjct: 16 NELLDKAEDPEKMLEQAIRDMESELAKARQALAQAIARQKQLERKLEEAQARAEKLEEKA 75
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
++ + +E + L++ E EL++ + LK Q ++ ++ L +
Sbjct: 76 ELALQAGNEDLAREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAK 135
Query: 594 HE---------KVQKVVTEQGDK 607
E K Q+ V
Sbjct: 136 KEALKARKAAAKAQEKVNRSLGG 158
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 39.8 bits (93), Expect = 0.004
Identities = 33/144 (22%), Positives = 61/144 (42%), Gaps = 18/144 (12%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
I EQ +EI L EKLS E+EL+++ + ++LLKE E
Sbjct: 497 FIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHL--------------EKLLKEQE 542
Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
K++K + ++ ++ ++ + + K +E L ++E++ K K
Sbjct: 543 KLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSI 602
Query: 588 DRLLK----EHEKVQKVVTEQGDK 607
+ L+K + + QK Q DK
Sbjct: 603 EDLVKLKETKQKIPQKPTNFQADK 626
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 39.3 bits (92), Expect = 0.005
Identities = 22/99 (22%), Positives = 48/99 (48%), Gaps = 6/99 (6%)
Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEE 565
AA S A K+ ++ KE ++K + EQ D++ K +K ++K L+ +++ E +
Sbjct: 28 AAAFSAAAD-DKQLKQIQKEIAALEKKIREQQDQRA-KLEK----QLKSLETEIASLEAQ 81
Query: 566 LKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
L + + L+ Q + + L + + ++ + EQ
Sbjct: 82 LIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQ 120
Score = 35.5 bits (82), Expect = 0.088
Identities = 19/140 (13%), Positives = 48/140 (34%), Gaps = 18/140 (12%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
++K L+ +++ E +L + + L+ Q + + L + + ++ + EQ
Sbjct: 66 KQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAA-- 123
Query: 542 DKYQKG------------DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
+ G D+ RL + ALK +
Sbjct: 124 -LQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERI---DALKATLKQLAAVRAE 179
Query: 590 LLKEHEKVQKVVTEQGDKKD 609
+ E ++ +++EQ ++
Sbjct: 180 IAAEQAELTTLLSEQRAQQA 199
Score = 30.8 bits (70), Expect = 2.3
Identities = 21/99 (21%), Positives = 40/99 (40%), Gaps = 1/99 (1%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
D +K ++L+ E+ E+ L S+ + + +LL+E +K D
Sbjct: 164 DALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLA-QLNSELSAD 222
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
K + RLK +++ E K + +AA + A
Sbjct: 223 QKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAA 261
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 36.9 bits (86), Expect = 0.005
Identities = 26/105 (24%), Positives = 50/105 (47%), Gaps = 16/105 (15%)
Query: 465 KRCEIFEQCDKG-LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLL 523
KR I Q + G+ I L++ L E++ T++ +R
Sbjct: 25 KRQAIETQIEYAKAHGNQHRIAGLEKALK----EVQAHCTDAGLRA----------ERQA 70
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
K EK Q+V + + K+ + KGD+D+I++ + KL++ + EL++
Sbjct: 71 KVEEKEQEVAEREQELKEAQ-AKGDADKIEKKQRKLAEAQAELQE 114
>gnl|CDD|227599 COG5274, CYB5, Cytochrome b involved in lipid metabolism [Energy
production and conversion / Lipid metabolism].
Length = 164
Score = 37.5 bits (87), Expect = 0.007
Identities = 25/105 (23%), Positives = 39/105 (37%), Gaps = 12/105 (11%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYD--RLLKEHEKVQKVVTEQGDKK 540
+I + T EE+ K S + GK YD + L EH + ++ + K
Sbjct: 42 DILNDESPKPITAEEVAKHN-KSEDCWIVIN--GKVYDVSQFLDEHPGGEDIIKDTAGKD 98
Query: 541 D-DKYQKGD-SDEIKRLKEK-----LSKTEEELKKEKTNSAALKN 578
+ S +I L + + + EEEL KT LK
Sbjct: 99 ATKAFNFLHHSHQIGNLLKDVYVDQVHRPEEELPLFKTAWYPLKV 143
>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
Length = 621
Score = 38.6 bits (90), Expect = 0.010
Identities = 30/108 (27%), Positives = 53/108 (49%), Gaps = 6/108 (5%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE-----HEKVQKVVT 534
DE++R+ ++ K E KEK LK+QA+S+ + ++ LKE E+ ++ +
Sbjct: 503 PKDEVERMVKEAEKNAAE-DKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIE 561
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADS 582
K Q + + IK L E+L K E+ KE +S + + A +
Sbjct: 562 NLIKKLRQALQNDNYESIKSLLEELQKALMEIGKEVYSSTSTTDPASN 609
Score = 30.5 bits (69), Expect = 2.9
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 10/65 (15%)
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE---------HEKVQ 598
DE++R+ ++ K E KEK LKNQA+S+ + ++ LKE EK++
Sbjct: 503 PKDEVERMVKEAEKNAAE-DKEKREKIDLKNQAESLCYQAEKQLKELKDKISEEKKEKIE 561
Query: 599 KVVTE 603
++ +
Sbjct: 562 NLIKK 566
Score = 30.1 bits (68), Expect = 4.3
Identities = 31/124 (25%), Positives = 58/124 (46%), Gaps = 11/124 (8%)
Query: 491 LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDS- 549
LS T ++ K S ++ + E +R++KE EK E+ +K D K Q +S
Sbjct: 479 LSVTAKDKGTGKEQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKIDLKNQ-AESL 537
Query: 550 -----DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
++K LK+K+S+ ++E K + L+ + Y+ + E++QK + E
Sbjct: 538 CYQAEKQLKELKDKISEEKKE--KIENLIKKLRQALQN--DNYESIKSLLEELQKALMEI 593
Query: 605 GDKK 608
G +
Sbjct: 594 GKEV 597
>gnl|CDD|191748 pfam07464, ApoLp-III, Apolipophorin-III precursor (apoLp-III).
This family consists of several insect
apolipoprotein-III sequences. Exchangeable
apolipoproteins constitute a functionally important
family of proteins that play critical roles in lipid
transport and lipoprotein metabolism. Apolipophorin III
(apoLp-III) is a prototypical exchangeable
apolipoprotein found in many insect species that
functions in transport of diacylglycerol (DAG) from the
fat body lipid storage depot to flight muscles in the
adult life stage.
Length = 155
Score = 36.8 bits (85), Expect = 0.010
Identities = 18/87 (20%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQ-ADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
+++L++ + +T EEL+K + D + ++E +K+ K V + ++
Sbjct: 65 LEQLRQNIEETAEELRKAHPEVEQQAQELRDKLQAAVQSTVQESQKLAKEVASNVEGTNE 124
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKE 569
K +IK+ + K EE++K+
Sbjct: 125 KLA----PKIKQAYDDFVKNAEEVQKK 147
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 38.6 bits (90), Expect = 0.010
Identities = 32/138 (23%), Positives = 60/138 (43%), Gaps = 18/138 (13%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE----KVQKVVTEQ 536
EIK L K+ + EEL + + L+ + + + LL + E K+Q +V
Sbjct: 290 LSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVE-LLPDAENNVAKLQALVVAS 348
Query: 537 GDKKD------DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV------- 583
++ + ++ DE + LKEK E+E +++ L+N+ + +
Sbjct: 349 SERLLELAQQWEAHRTPLIDEYRSLKEKNRNKEDETQRQLDEIKKLRNKIEELESELQTK 408
Query: 584 GKEYDRLLKEHEKVQKVV 601
+ Y +LL E+E K V
Sbjct: 409 EQLYKQLLDEYENAPKSV 426
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 38.4 bits (89), Expect = 0.012
Identities = 32/148 (21%), Positives = 70/148 (47%), Gaps = 15/148 (10%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
L+ + + +E E+ + T+ A LK+Q D++ ++ +K +K++ + +
Sbjct: 245 LKLEDNYEPSEQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKW 304
Query: 537 GDKKDD--KYQ------KGDSDE----IKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
K D KY+ K S E +++LK ++ EEE+K ++N L Q G
Sbjct: 305 RALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQG 364
Query: 585 ---KEYDRLLKEHEKVQKVVTEQGDKKD 609
++++ + +E EK+ + + + + D
Sbjct: 365 ISTEQFELMNQEREKLTRELDKINIQSD 392
Score = 33.4 bits (76), Expect = 0.43
Identities = 25/129 (19%), Positives = 54/129 (41%), Gaps = 9/129 (6%)
Query: 470 FEQCDKGLEGDSDE----IKRLKEKLSKTEEELKKEKTNSAALKSQADSVG---KEYDRL 522
+E ++ S E +++LK ++ EEE+K ++N L Q G ++++ +
Sbjct: 314 YENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELM 373
Query: 523 LKEHEKVQKVVTEQGDKKDD--KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
+E EK+ + + + + D K K E + + + L KT + N ++Q
Sbjct: 374 NQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQI 433
Query: 581 DSVGKEYDR 589
+
Sbjct: 434 GHNVNDSSL 442
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 37.4 bits (87), Expect = 0.014
Identities = 31/147 (21%), Positives = 70/147 (47%), Gaps = 12/147 (8%)
Query: 464 SKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLL 523
+R EI E+ + E + +L+E + E +K + +S S+ +E +RL
Sbjct: 62 EERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSI-KSLEREIERLE 120
Query: 524 -----------KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
+E E VQK+ + + +D K +++++K LK ++ + +++ ++
Sbjct: 121 KKQQTSVLTPEEERELVQKIKELRKELEDAKKALEENEKLKELKAEIDELKKKAREIHEK 180
Query: 573 SAALKNQADSVGKEYDRLLKEHEKVQK 599
L N+A +E +L +E ++++K
Sbjct: 181 IQELANEAQEYHEEMIKLFEEADELRK 207
Score = 37.0 bits (86), Expect = 0.020
Identities = 24/132 (18%), Positives = 59/132 (44%), Gaps = 17/132 (12%)
Query: 483 EIKRLKEKLSKTEEELK-KEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+IK L+++L ++ L+ EK LK++ D + K+ + HEK+Q++ E + +
Sbjct: 139 KIKELRKELEDAKKALEENEKL--KELKAEIDELKKKAREI---HEKIQELANEAQEYHE 193
Query: 542 D---KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE---YDRLLKEHE 595
+ +++ D L+++ + EE + L + ++ E ++ +K
Sbjct: 194 EMIKLFEEAD-----ELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALR 248
Query: 596 KVQKVVTEQGDK 607
+K + +
Sbjct: 249 AKEKAAKRREKR 260
Score = 32.3 bits (74), Expect = 0.56
Identities = 27/123 (21%), Positives = 52/123 (42%), Gaps = 22/123 (17%)
Query: 462 ESSKRCEIFEQCD---KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKE 518
E+ K E+ + D K ++I+ L + + EE+ K L +AD + KE
Sbjct: 156 ENEKLKELKAEIDELKKKAREIHEKIQELANEAQEYHEEMIK-------LFEEADELRKE 208
Query: 519 YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
D L +E ++ K + E +E + L+ +L + E+++K + A K
Sbjct: 209 ADELHEEFVELSKKIDEL------------HEEFRNLQNELRELEKKIKALRAKEKAAKR 256
Query: 579 QAD 581
+
Sbjct: 257 REK 259
Score = 31.2 bits (71), Expect = 1.4
Identities = 28/142 (19%), Positives = 61/142 (42%), Gaps = 19/142 (13%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAA-----------LKSQADSVGKEYDRLLKE 525
LE ++K E+L + +EL+KE + A L+ +A + +E D + +E
Sbjct: 11 LELKRKQLKEEIEELKEKRDELRKEASELAEKRDELNAKVRELREKAQELREERDEINEE 70
Query: 526 HEKVQKVVTEQGDKKDDKYQK--------GDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
+++++ E K + ++ + + R + L + E L+K++ S
Sbjct: 71 VQELKEKRDEINAKLQELRKEYRELKEKRNEFNLGGRSIKSLEREIERLEKKQQTSVLTP 130
Query: 578 NQADSVGKEYDRLLKEHEKVQK 599
+ + ++ L KE E +K
Sbjct: 131 EEERELVQKIKELRKELEDAKK 152
>gnl|CDD|215618 PLN03184, PLN03184, chloroplast Hsp70; Provisional.
Length = 673
Score = 37.9 bits (88), Expect = 0.015
Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 13/106 (12%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
DE++R+ ++ K +E KEK ++ K+QADSV + ++ LKE KV + +K
Sbjct: 540 PKDEVERMVQEAEKFAKE-DKEKRDAVDTKNQADSVVYQTEKQLKELG--DKVPADVKEK 596
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
+ K +K LK+ ++ +K K AAL + +G+
Sbjct: 597 VEAK--------LKELKDAIAS--GSTQKMKDAMAALNQEVMQIGQ 632
Score = 34.8 bits (80), Expect = 0.13
Identities = 23/62 (37%), Positives = 37/62 (59%), Gaps = 2/62 (3%)
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE-HEKVQKVVTEQGD 606
DE++R+ ++ K +E KEK ++ KNQADSV + ++ LKE +KV V E+ +
Sbjct: 540 PKDEVERMVQEAEKFAKE-DKEKRDAVDTKNQADSVVYQTEKQLKELGDKVPADVKEKVE 598
Query: 607 KK 608
K
Sbjct: 599 AK 600
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 35.7 bits (83), Expect = 0.015
Identities = 27/118 (22%), Positives = 46/118 (38%), Gaps = 27/118 (22%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
E+KRL EK EE+++K + + A+ ++Y+R L +H
Sbjct: 10 AELKRLLEKKEDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKH--------------- 54
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
E+L ++L + K A LK +A+S E +E + QK
Sbjct: 55 -----------AEDIEELQALRKQLNELKKEIAQLKAEAESAQAELSE-AEESWEEQK 100
>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H). This family
consists of several eukaryotic centromere protein H
(CENP-H) sequences. Macromolecular
centromere-kinetochore complex plays a critical role in
sister chromatid separation, but its complete protein
composition as well as its precise dynamic function
during mitosis has not yet been clearly determined.
CENP-H contains a coiled-coil structure and a nuclear
localisation signal. CENP-H is specifically and
constitutively localised in kinetochores throughout the
cell cycle. CENP-H may play a role in kinetochore
organisation and function throughout the cell cycle.
This the C-terminus of the region, which is conserved
from fungi to humans.
Length = 106
Score = 35.4 bits (82), Expect = 0.016
Identities = 24/90 (26%), Positives = 34/90 (37%), Gaps = 20/90 (22%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
S EI L++ +T EEL + LK K E +++ K+
Sbjct: 2 SLEILNLQQASRETREELSDLEKERLQLK-------------QKNVELALELLELTKKKE 48
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
E LKE+L K E +LKK K
Sbjct: 49 SW-------REDMELKEQLEKLEADLKKSK 71
>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 36.3 bits (84), Expect = 0.020
Identities = 21/121 (17%), Positives = 29/121 (23%), Gaps = 14/121 (11%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
D L K E K+ + E L +E E + +D
Sbjct: 66 DAAADQLYALQKELLEKLKDLAEVNERLKALEKKKAELKDLEEEREGIL----RSKQAED 121
Query: 542 DK----YQKGDSDEIKRLKEKLSKTEEELK-----KEKTNSAALKNQADSVGKEYDRLLK 592
K Y K D + E L EE K + L L+
Sbjct: 122 GKLVKIYSKMKPDAAAAILENL-PDEEAAAILMKLKPRKLGLILAKMDPEKAATLTELIA 180
Query: 593 E 593
Sbjct: 181 S 181
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 35.7 bits (83), Expect = 0.033
Identities = 25/83 (30%), Positives = 49/83 (59%), Gaps = 7/83 (8%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADS--VGKEY----DRLLKEHEKVQKVVT 534
S + +LK +L K ++EL++ K A L++Q + G+E LL+E ++++K +
Sbjct: 61 SQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELK 120
Query: 535 EQGDKKDDKYQKGDSDEIKRLKE 557
+ + +KY+K D + I++LKE
Sbjct: 121 KL-KAELEKYEKNDPERIEKLKE 142
Score = 32.2 bits (74), Expect = 0.49
Identities = 15/71 (21%), Positives = 34/71 (47%), Gaps = 7/71 (9%)
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT------NSAALKNQADSVGKEYD 588
+ +K + +K E++ LK+++++ + +++K K L + + KE
Sbjct: 62 QALNKLKTRLEKLKK-ELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELK 120
Query: 589 RLLKEHEKVQK 599
+L E EK +K
Sbjct: 121 KLKAELEKYEK 131
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 36.9 bits (86), Expect = 0.035
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 11/89 (12%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
DE+ R +L E LKKEK A K + + + KE L E E+ + EQ
Sbjct: 408 DELDRRIIQLEIEREALKKEK--DEASKERLEDLEKE----LAELEEEYADLEEQWKA-- 459
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
+K I+++KE++ + EL++ +
Sbjct: 460 ---EKAAIQGIQQIKEEIEQVRLELEQAE 485
>gnl|CDD|215591 PLN03124, PLN03124, poly [ADP-ribose] polymerase; Provisional.
Length = 643
Score = 36.7 bits (85), Expect = 0.036
Identities = 38/138 (27%), Positives = 56/138 (40%), Gaps = 27/138 (19%)
Query: 472 QCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYD---RLLKEHEK 528
Q D E S + + E T EL++ AA + + G++ D RL E
Sbjct: 61 QDDGDDEPVSPKRIAIDEVKGMTVRELRE-----AASERGLATTGRKKDLLERLCAALES 115
Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
KV + G +D+K + GD EE K+EK +A K +A D
Sbjct: 116 DVKVGSANGTGEDEKEKGGD--------------EEREKEEKIVTATKKGRA-----VLD 156
Query: 589 RLLKEHEKVQKVVTEQGD 606
+ L +H K V E+GD
Sbjct: 157 QWLPDHIKSNYHVLEEGD 174
>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
ribosomal structure and biogenesis].
Length = 363
Score = 36.4 bits (85), Expect = 0.037
Identities = 26/96 (27%), Positives = 45/96 (46%), Gaps = 22/96 (22%)
Query: 489 EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-------QGDKKD 541
E L + EEL+ ++ + EY +L KE+ +++ +V + Q D +D
Sbjct: 10 ESLLERYEELEALLSDPEVISDP-----DEYRKLSKEYAELEPIVEKYREYKKAQEDLED 64
Query: 542 DK---YQKGDSD-------EIKRLKEKLSKTEEELK 567
K ++ D + EIK L+ K+ + EEELK
Sbjct: 65 AKEMLAEEKDPEMREMAEEEIKELEAKIEELEEELK 100
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 35.6 bits (83), Expect = 0.038
Identities = 31/127 (24%), Positives = 58/127 (45%), Gaps = 34/127 (26%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAA-----LKSQADSVGKEYDRLLKEHEK--VQKVVTEQG 537
K++ E+ K E LKKE A L+++A+ KE L+ EK +QK E
Sbjct: 37 KQIIEEAKKEAEALKKEALLEAKEEIHKLRAEAERELKERRNELQRQEKRLLQK--EETL 94
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
D+KD+ +++ +E L + E+EL + +L ++ E++
Sbjct: 95 DRKDES--------LEKKEESLEEKEKELAARQ-----------------QQLEEKEEEL 129
Query: 598 QKVVTEQ 604
++++ EQ
Sbjct: 130 EELIEEQ 136
>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein. YkyA is a
family of proteins containing a lipoprotein signal and a
hydrolase domain. It is similar to cell wall binding
proteins and might also be recognisable by a host immune
defence system. It is thus likely to belong to pathways
important for pathogenicity.
Length = 205
Score = 35.8 bits (83), Expect = 0.039
Identities = 28/144 (19%), Positives = 57/144 (39%), Gaps = 28/144 (19%)
Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK 524
KR ++ ++ + +E +E K K+ + K E++ K+K K+ +++K
Sbjct: 80 KREKLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKK-------------AKQLVKVMK 126
Query: 525 EHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
E K DK Y+K + KE LK + L + +V
Sbjct: 127 ERYKSY-------DKLYKAYKKA----LNLEKELYEY----LKDKDLTLKELDEKIKAVN 171
Query: 585 KEYDRLLKEHEKVQKVVTEQGDKK 608
+ Y+++ K +K + + +K
Sbjct: 172 QSYEKIQKAKKKFNEYTKKYNKEK 195
Score = 30.4 bits (69), Expect = 1.9
Identities = 27/129 (20%), Positives = 54/129 (41%), Gaps = 6/129 (4%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
+ + E+L K E+ K + L+ + YD++++ + + + + D+
Sbjct: 14 GKKPEESIYEQLEKAAEQEKPFEEEQKKLEELEKKEQELYDQIIQLGKDDNEAIKKLSDQ 73
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKK-----EKTNSAALKNQADSVGKEYDRLLKEH 594
K + +K+ KE + K+EEE K EK LK +A + K K +
Sbjct: 74 ALANVDKRE-KLLKKEKESIEKSEEEFKSAKKYIEKIEDKKLKKKAKQLVKVMKERYKSY 132
Query: 595 EKVQKVVTE 603
+K+ K +
Sbjct: 133 DKLYKAYKK 141
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 36.4 bits (85), Expect = 0.047
Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 21/89 (23%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
E K+L++K K E++ +KE+ AA K +A + K K + K
Sbjct: 407 SPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA-----------EAAAKKAKGPDGETKK 455
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
D L EKL++TE+ L++
Sbjct: 456 VDP----------DPLGEKLARTEDPLEE 474
Score = 31.1 bits (71), Expect = 2.0
Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 11/62 (17%)
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
E K+L++K K E++ +KE+ AA K +A + K K + K
Sbjct: 407 SPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKA-----------EAAAKKAKGPDGETKK 455
Query: 608 KD 609
D
Sbjct: 456 VD 457
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 36.4 bits (84), Expect = 0.047
Identities = 32/139 (23%), Positives = 66/139 (47%), Gaps = 13/139 (9%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
S + + E+ D+ E + I R+K LS+ + + T L+S+ + + ++ +
Sbjct: 146 AESLKLQN-EKEDQLKEA-KESISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEE 203
Query: 522 LLKE----HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
KE +K+Q + +EQ D+ S +IK L+E+L + E++ + K+ L
Sbjct: 204 CQKELAEAEKKLQSLTSEQASSADN------SVKIKHLEEELKRYEQDAEVVKSMKEQLL 257
Query: 578 NQADSVGKEYDRLLKEHEK 596
+ + +E L +E+ K
Sbjct: 258 QIPE-LERELAALREENRK 275
Score = 32.6 bits (74), Expect = 0.79
Identities = 34/138 (24%), Positives = 59/138 (42%), Gaps = 21/138 (15%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV---TEQGDK 539
EIK LKE+L E L K + AL+ + +++ +E DRL +E ++ + QGD
Sbjct: 483 EIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLEHLCLQGDY 542
Query: 540 KDD-------------KYQKGDSDEIKRLKEKLSKTEEELKK---EKTNSAALKNQADS- 582
+ ++ I+ L+ + K +E L+ K+ L+ S
Sbjct: 543 NASRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQALEEGKSQPGDLEKAVGSH 602
Query: 583 -VGKEYDRLLKEHEKVQK 599
KE +L K+ E +K
Sbjct: 603 ISSKEIAQLKKQVESAEK 620
Score = 31.1 bits (70), Expect = 2.0
Identities = 25/100 (25%), Positives = 49/100 (49%), Gaps = 8/100 (8%)
Query: 483 EIKRLKEKLSKTEEELKK----EKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
E+ R K KL + E EL + K L+ +A ++ + Y+R L + +++ V + +
Sbjct: 62 ELIRSKSKLIQLENELMQKELEHKRAQIELERKASTLAENYERELDRNLELE-VRLKALE 120
Query: 539 KKDDKYQKGDS---DEIKRLKEKLSKTEEELKKEKTNSAA 575
+ + K + + +E K LK+KL +L+ EK +
Sbjct: 121 ELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLK 160
Score = 30.3 bits (68), Expect = 3.9
Identities = 22/109 (20%), Positives = 48/109 (44%), Gaps = 10/109 (9%)
Query: 460 WCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEY 519
E+ K+ + +S +IK L+E+L + E++ + K+ K Q + +
Sbjct: 208 LAEAEKKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSM----KEQLLQIPELE 263
Query: 520 DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
L E+ +K+ + + D + K + D L+ +L + E+ +K
Sbjct: 264 RELAALREENRKLRSMKEDNELLKEELED------LQSRLERFEKMREK 306
>gnl|CDD|200533 cd11272, Sema_plexin_A2, The Sema domain, a protein interacting
module, of Plexin A2. Plexin A2 serves as a receptor
for class 6 semaphorins. Interactions between Plexin A2,
A4 and semaphorins 6A and 6B control the
lamina-restricted projection of hippocampal mossy
fibers. Sema6B also repels the growth of mossy fibers in
a Plexin A4 dependent manner. Plexin A2 does not
suppress Sema6B function. In addition, studies have
shown that Plexin A2 may be related to anxiety and other
psychiatric disorders. The Sema domain is located at the
N-terminus and contains four disulfide bonds formed by
eight conserved cysteine residues. It serves as a
ligand-recognition and -binding module.
Length = 515
Score = 36.1 bits (83), Expect = 0.053
Identities = 11/28 (39%), Positives = 17/28 (60%), Gaps = 1/28 (3%)
Query: 279 CNTYSSCTQCVSSDFP-CDWCVDGHRCT 305
C Y++C +C+SS P C WC + C+
Sbjct: 474 CEQYTTCGECLSSGDPHCGWCALHNMCS 501
>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
division and chromosome partitioning].
Length = 570
Score = 36.2 bits (84), Expect = 0.056
Identities = 28/126 (22%), Positives = 54/126 (42%), Gaps = 16/126 (12%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
LE + + ++E L + L+K K N+ LK + + V + Y L E E
Sbjct: 297 LEREVEAKNVVEENLPILPDYLEKAKENNEHLKEEIERVKESY--RLAETELG------- 347
Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
+ E+K L+ L + E ++ ++ + L++ + + K + E EK
Sbjct: 348 -------SVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEEIEKALTDIEDEQEK 400
Query: 597 VQKVVT 602
VQ+ +T
Sbjct: 401 VQEHLT 406
Score = 34.3 bits (79), Expect = 0.21
Identities = 29/123 (23%), Positives = 51/123 (41%), Gaps = 15/123 (12%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQ--ADSVGKEYDRLLKEHE-KVQKVVTEQGDK 539
++RLKE+L + E L + + + A + + + YD L E E + + VV E
Sbjct: 256 RLERLKEQLVENSELLTQLELDEAEEELGLIQEKIESLYDLL--EREVEAKNVVEENLPI 313
Query: 540 KDDKYQKGDS------DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
D +K +EI+R+KE E EL + + + + D +L+
Sbjct: 314 LPDYLEKAKENNEHLKEEIERVKESYRLAETELGSVR----KFEKELKELESVLDEILEN 369
Query: 594 HEK 596
E
Sbjct: 370 IEA 372
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 35.9 bits (83), Expect = 0.056
Identities = 21/116 (18%), Positives = 54/116 (46%), Gaps = 6/116 (5%)
Query: 483 EIKRLKEKLSKTEEELKKE-----KTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
E+ +L+E+L ++E K K AL+ + + + + L E +K +++ ++
Sbjct: 65 ELSQLEEQLINQKKEQKNLFNEQIKQFELALQDEIAKL-EALELLNLEKDKELELLEKEL 123
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
D+ + QK + + +++K + E + + N L+ + ++++ L E
Sbjct: 124 DELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHE 179
>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
unknown].
Length = 543
Score = 35.9 bits (83), Expect = 0.057
Identities = 33/140 (23%), Positives = 51/140 (36%), Gaps = 31/140 (22%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQAD-------SVGKEYDRLLKEHEKVQKVVT 534
+ +K L E++S EE LKK AL KE + K+
Sbjct: 410 ERLKELTEEISLHEERLKKLTKLLVALVKVKIESKKNILPPDKESLLTAVNNTKITL--- 466
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT-NSAALKNQADS-----VGKEYD 588
+ I+++K +L +EEL+ S ++KNQ S G Y
Sbjct: 467 --------------RNSIEKIKAELEGLQEELEVVGIEGSVSVKNQVYSGVKIYFGDAYQ 512
Query: 589 RLLKEHEKVQKVVTEQGDKK 608
+KEH V + G+ K
Sbjct: 513 ETIKEHGNV-EFYLRDGEIK 531
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 35.8 bits (83), Expect = 0.062
Identities = 30/155 (19%), Positives = 56/155 (36%), Gaps = 27/155 (17%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEY--------------------D 520
+DE+ L + L K T +A +KS+ + K D
Sbjct: 240 TDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPD 299
Query: 521 RLLKEHEKVQKVVTEQGDKKDDKYQKGDS--DEIKRLKEKLSKTEEELKKEKTNSAALKN 578
R+ K +K+ K + +K D + + DE +KL + + ++ K + L +
Sbjct: 300 RITKIKDKL-KELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVD 358
Query: 579 QADSVGKEYDRLLKEH----EKVQKVVTEQGDKKD 609
+A V + L E E++ K+ E
Sbjct: 359 KAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVK 393
Score = 31.1 bits (71), Expect = 2.0
Identities = 27/112 (24%), Positives = 40/112 (35%), Gaps = 32/112 (28%)
Query: 472 QCDKGLEGDSDEIKRLKEK------LSKTEEELK----KEKTNSAALKSQADSVGKEYDR 521
+ LE I L+E SK ELK K + L +A V +
Sbjct: 310 ELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEE 369
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRL---KEKLSKTEEELKKEK 570
L Q + D+ +E+ +L +K+ KT+ EL KEK
Sbjct: 370 L-------------QAEFVDNA------EELAKLQDELDKIVKTKSELVKEK 402
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 36.1 bits (84), Expect = 0.063
Identities = 27/120 (22%), Positives = 53/120 (44%), Gaps = 6/120 (5%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
KR+ E+ K E LKKE A K + + E +R LKE + + + ++++
Sbjct: 35 KRIIEEAKKEAETLKKEALLEA--KEEVHKLRAELERELKERRNELQRLERRLLQREETL 92
Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
+ K+ E L K E+EL ++ N + + + + +E E++ + E+
Sbjct: 93 DRKMESLDKKE-ENLEKKEKELSNKEKNLDEKEEELE---ELIAEQREELERISGLTQEE 148
Score = 30.7 bits (70), Expect = 2.9
Identities = 29/126 (23%), Positives = 58/126 (46%), Gaps = 21/126 (16%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
++ +E + EE KKE A+++ KE LL+ E+V K+ E +
Sbjct: 26 KLGSAEELAKRIIEEAKKE----------AETLKKE--ALLEAKEEVHKLRAEL-----E 68
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE----HEKVQ 598
+ K +E++RL+ +L + EE L ++ + + + KE K E+++
Sbjct: 69 RELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEELE 128
Query: 599 KVVTEQ 604
+++ EQ
Sbjct: 129 ELIAEQ 134
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 35.8 bits (84), Expect = 0.063
Identities = 22/93 (23%), Positives = 46/93 (49%), Gaps = 16/93 (17%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
+G D DE+ L +EE ++ +T L+++ +++ KE + ++ E + ++
Sbjct: 21 RGFPLDVDELLEL-------DEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIA 73
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK 567
E + K+ EIK L+ +L + E EL+
Sbjct: 74 EVKELKE---------EIKALEAELDELEAELE 97
Score = 28.9 bits (66), Expect = 9.3
Identities = 10/55 (18%), Positives = 25/55 (45%), Gaps = 4/55 (7%)
Query: 550 DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
+E++ + LSK + K++ ++ AL + + +E +K E + +
Sbjct: 45 EELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEE----IKALEAELDELEAE 95
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 35.8 bits (83), Expect = 0.066
Identities = 19/93 (20%), Positives = 42/93 (45%), Gaps = 2/93 (2%)
Query: 518 EYDR-LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
E +R L K K++K+ D S E L+E++ E E+K+ + N +L
Sbjct: 35 EVERKLRKLESKIKKLGIPLKDTGGKP-DVPPSKEFLDLEEEILDLEAEIKEVEENLESL 93
Query: 577 KNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
+ + + + + + L +E + + + E + +
Sbjct: 94 EKEINELEEWLNVLDEEKSFLDENLEELSELSN 126
Score = 32.4 bits (74), Expect = 0.84
Identities = 15/78 (19%), Positives = 36/78 (46%), Gaps = 2/78 (2%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
S E L+E++ E E+K+ + N +L+ + + + + + L +E + + + E +
Sbjct: 65 PSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSEL 124
Query: 540 K--DDKYQKGDSDEIKRL 555
D ++ E +L
Sbjct: 125 SNLDIDFKYLRGAEGLKL 142
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 35.3 bits (81), Expect = 0.076
Identities = 32/121 (26%), Positives = 53/121 (43%), Gaps = 31/121 (25%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
++K E+L + EEL+KEK LLKE E+++
Sbjct: 132 DLKEDYEELKEKLEELQKEKE-----------------ELLKELEELEA----------- 163
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
+Y++ + +KRL+ + S+ EE LKK LK + D + + L E E + +V
Sbjct: 164 EYEEVQ-ERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVE--LPEEELISDLVK 220
Query: 603 E 603
E
Sbjct: 221 E 221
Score = 29.5 bits (66), Expect = 4.7
Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
E+ + E + +KRL+ + S+ EE LKK LK + D + + L E E +
Sbjct: 159 EELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPGVE--LPEEELIS 216
Query: 531 KVVTE 535
+V E
Sbjct: 217 DLVKE 221
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 35.8 bits (82), Expect = 0.077
Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 18/131 (13%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSA----ALKSQADSVGKEYDRLL-----KEHEKVQKV 532
D+IK L +K+ + L++ K S +K+Q + + D+ + +E EK +
Sbjct: 1118 DDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEIEKKIEN 1177
Query: 533 VTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
+ + DKK + Y DEIK+L ++++ E K+KT+ +K S GK +L
Sbjct: 1178 IVTKIDKKKNIY-----DEIKKLLNEIAEIE----KDKTSLEEVKGINLSYGKNLGKLFL 1228
Query: 593 EHEKVQKVVTE 603
E +K +E
Sbjct: 1229 EKIDEEKKKSE 1239
Score = 35.0 bits (80), Expect = 0.14
Identities = 38/155 (24%), Positives = 73/155 (47%), Gaps = 27/155 (17%)
Query: 468 EIFEQCDK--GLEGDSDEIKRLKEKLSKTEEELKKE------KTNSAALKSQADSVGKEY 519
E+ E DK G + ++D+ + EK + E+ KK+ K ++ A+K++ K+
Sbjct: 1487 ELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDS 1546
Query: 520 DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
+ ++KE + K + +K + K + EIK KEK ++ K +K+N AA+ Q
Sbjct: 1547 EIIIKEIKDAHKKFILEAEKSEQKIK-----EIK--KEKFRIEDDAAKNDKSNKAAIDIQ 1599
Query: 580 AD------------SVGKEYDRLLKEHEKVQKVVT 602
+ K+ + LKE E ++K ++
Sbjct: 1600 LSLENFENKFLKISDIKKKINDCLKETESIEKKIS 1634
Score = 33.1 bits (75), Expect = 0.49
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 10/117 (8%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
DEIK+L ++++ E K+KT+ +K S GK +L E +K +E K
Sbjct: 1190 DEIKKLLNEIAEIE----KDKTSLEEVKGINLSYGKNLGKLFLEKIDEEKKKSEHMIKAM 1245
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK--NQADSVGKEYDRLLKEHEK 596
+ Y + D DEI KEK + E E+ E A ++ N + K++ + K+H++
Sbjct: 1246 EAYIE-DLDEI---KEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISKKHDE 1298
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 34.7 bits (80), Expect = 0.091
Identities = 17/85 (20%), Positives = 44/85 (51%), Gaps = 9/85 (10%)
Query: 499 KKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEK 558
++ S K + + + E + L + E++Q + E K+ D+ + EIK+L+++
Sbjct: 35 NQDSKLSELQKEKKN-IQNEIESLDNQIEEIQSKIDEL-QKEIDQSKA----EIKKLQKE 88
Query: 559 LSKTEEELKKEKTNSAALKNQADSV 583
+++ +E + + + LK +A ++
Sbjct: 89 IAELKENIVERQE---LLKKRARAM 110
Score = 30.5 bits (69), Expect = 2.1
Identities = 17/86 (19%), Positives = 37/86 (43%), Gaps = 3/86 (3%)
Query: 498 LKKEKTNSAALKSQADSVGKEYDRL---LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKR 554
L+++K + +L+ + ++ + + L E E + Q +K+ + E
Sbjct: 150 LEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKEASA 209
Query: 555 LKEKLSKTEEELKKEKTNSAALKNQA 580
L EK + E++ E + A K +A
Sbjct: 210 LGEKAALEEQKALAEAAAAEAAKQEA 235
>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
protein turnover, chaperones].
Length = 579
Score = 35.4 bits (82), Expect = 0.095
Identities = 28/103 (27%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 475 KGLEGDSD-EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
K G SD EI+R+ E KK + A +++A+S+ ++ LKE KV +
Sbjct: 478 KASSGLSDEEIERMVEDAEANAALDKKFRELVEA-RNEAESLIYSLEKALKEIVKVSEEE 536
Query: 534 TEQ-GDKKDD--KYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
E+ + D + +G+ +EIK E+L + ++L ++K
Sbjct: 537 KEKIEEAITDLEEALEGEKEEIKAKIEELQEVTQKLAEKKYQQ 579
>gnl|CDD|218701 pfam05698, Trigger_C, Bacterial trigger factor protein (TF)
C-terminus. In the E. coli cytosol, a fraction of the
newly synthesised proteins requires the activity of
molecular chaperones for folding to the native state.
The major chaperones implicated in this folding process
are the ribosome-associated Trigger Factor (TF), and the
DnaK and GroEL chaperones with their respective
co-chaperones. Trigger Factor is an ATP-independent
chaperone and displays chaperone and
peptidyl-prolyl-cis-trans-isomerase (PPIase) activities
in vitro. It is composed of at least three domains, an
N-terminal domain which mediates association with the
large ribosomal subunit, a central substrate binding and
PPIase domain with homology to FKBP proteins, and a
C-terminal domain of unknown function. The positioning
of TF at the peptide exit channel, together with its
ability to interact with nascent chains as short as 57
residues renders TF a prime candidate for being the
first chaperone that binds to the nascent polypeptide
chains. This family represents the C-terminal region of
the protein.
Length = 162
Score = 33.8 bits (78), Expect = 0.097
Identities = 39/156 (25%), Positives = 65/156 (41%), Gaps = 38/156 (24%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSA-------ALKSQAD------SVGKEYDRLLKE----- 525
++ LKEKL K EE KKE T L A+ V +E DRLL++
Sbjct: 2 LEELKEKLRKQLEEEKKEATAEELKEAILDKLVENAEIELPESLVEEEIDRLLRQFLQQL 61
Query: 526 ---HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLK-----------EKLSKTEEELKKEKT 571
+++ ++ G +++ ++ + KR+K EK+ T+EE+K+E
Sbjct: 62 QGQGLDLEEYLSLSGSSEEELREEFKEEAEKRVKLGLILEEIAKEEKIEVTDEEIKEEIE 121
Query: 572 NSA-ALKNQADSVGKEYD-----RLLKEHEKVQKVV 601
A + + V + Y LKE +K V
Sbjct: 122 ELASQYGMEPEEVKEFYKKNEQLSALKEDLLEEKAV 157
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 35.3 bits (82), Expect = 0.11
Identities = 30/110 (27%), Positives = 51/110 (46%), Gaps = 13/110 (11%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
+ K +I GL DEI+R+ + + E KK K A K++A+ Y
Sbjct: 486 GTGKEQKITITASSGL--SDDEIERMVKDAEEYAAEDKKRKERIEA-KNEAEEY--VYS- 539
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
++K + E+GDK + +K + I+ LKE+L ++E + KT
Sbjct: 540 -------LEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGEDKEEIEAKT 582
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 34.1 bits (78), Expect = 0.13
Identities = 26/118 (22%), Positives = 52/118 (44%), Gaps = 10/118 (8%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK-VQKVVTEQGDKK 540
EI+ ++L K ++E+K E+ + + KE + LL+E + V+++ D
Sbjct: 4 KEIRNKTDELEKLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADIN 63
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY---DRLLKEHE 595
D + EIK+ K +L + ++ + LK + + K + L + HE
Sbjct: 64 DME------TEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLEELPQLHE 115
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 33.4 bits (77), Expect = 0.14
Identities = 31/100 (31%), Positives = 53/100 (53%), Gaps = 10/100 (10%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSA--ALKSQADSVGKEYD--RLLKEHEKVQKVVTEQG 537
EI+ L++ L KT++ +KE+ ++KS+ ++ + +LKEH+K +K + ++G
Sbjct: 63 KEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEG 122
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALK 577
K Y K S+ K L K +ELKK K AL+
Sbjct: 123 KKP--YYLK-KSE---IKKLVLKKKFDELKKSKQLDKALE 156
>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20. This
family consists of several phage minor structural
protein GP20 sequences of around 180 residues in length.
The function of this family is unknown.
Length = 156
Score = 33.1 bits (76), Expect = 0.16
Identities = 27/109 (24%), Positives = 42/109 (38%), Gaps = 30/109 (27%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
EQ DK + +I+ K KL K EE +LK Q LK +K
Sbjct: 2 EQIDKIMAEYGKDIQNPKSKLDKANEERD-------SLKKQ-----------LKNRDK-- 41
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
+D K + D++E L++K+ K +++ K K A
Sbjct: 42 -------QIEDLKKKVKDNEE---LQKKIEKLKQQNKTAKEEYEAKLAD 80
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 33.9 bits (78), Expect = 0.19
Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 8/111 (7%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E K + Q L+ ++++ L ++L++ +E+K+ + + L+ + + + KE D+
Sbjct: 2 EVKKAARLLNQSLSKLDTPEEKLETLIKRLAELLKEMKRLEKDLKKLQKRGEQLQKEKDK 61
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKE---KLSKTEEELKKE 569
L EH K K + + K LKE S EEE +KE
Sbjct: 62 LQSEHSK-----AILARSKLESLCRELQKHNKTLKEENKARSDEEEEKRKE 107
Score = 32.7 bits (75), Expect = 0.49
Identities = 31/139 (22%), Positives = 59/139 (42%), Gaps = 14/139 (10%)
Query: 483 EIKRLKE---KLSKTEEELKKEKTN--SAALK---SQADSVGKEYDRLLKEHEKVQ---K 531
K LKE S EEE +KE T + LK Q + +L +E+E+++ K
Sbjct: 86 HNKTLKEENKARSDEEEEKRKELTEKFQSTLKDIQDQMEEHSNPNHKLCEENEELREKLK 145
Query: 532 VVTEQGDKKD---DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
+ EQ + ++ +K K E + KL + +E+ ++E+ K +
Sbjct: 146 ELIEQYELREQHFEKLLKTKDLEQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVS 205
Query: 589 RLLKEHEKVQKVVTEQGDK 607
L + ++++ + DK
Sbjct: 206 TLKETEKELRAQLNLYSDK 224
Score = 30.0 bits (68), Expect = 3.6
Identities = 28/129 (21%), Positives = 56/129 (43%), Gaps = 15/129 (11%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
E + KL + +E+ ++E+ K + L + ++++ + DK +
Sbjct: 168 EQQLNDAKLEQAQEQAEQEQEKHKREKEILLKEAAQVSTLKETEKELRAQLNLYSDKFKE 227
Query: 543 KYQK-GDSDEI----KRLKEKLSKTEEELKKEKTNSAALKNQADSVGK-------EYDRL 590
S+E+ K+ EK+SK ++L+KE + A K + + K E +
Sbjct: 228 FQDTLNKSNEVFLTFKQEMEKMSKKIKKLEKE---NLAWKRKWEKSNKALLEMAEERQKR 284
Query: 591 LKEHEKVQK 599
KE EK++K
Sbjct: 285 DKEIEKLKK 293
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 34.5 bits (79), Expect = 0.20
Identities = 36/161 (22%), Positives = 70/161 (43%), Gaps = 22/161 (13%)
Query: 449 CLALPEKYGCGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAA 507
C++ E+ E +C LE ++++ ++KL + + EL +++NS A
Sbjct: 589 CISTLEELETLKSEKENLDGELSKCKDDLEESKNKLQETEKKLEELKSELDASQESNSLA 648
Query: 508 LKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK 567
++Q + + Y L+ K + E K L+EK+S E+EL+
Sbjct: 649 -ETQLKCMVESY-ESLELRAKDLEA------------------EHKSLQEKISSLEDELE 688
Query: 568 KEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
KE+ N L + + +E ++ E +QK+ E+ K
Sbjct: 689 KERQNHQELIAKCREL-EEKIERAEQEENMQKLDEEEQKIK 728
Score = 31.1 bits (70), Expect = 2.0
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 8/97 (8%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
E +E + + E K L+EK+S E+EL+KE+ N L ++ + +E ++ E
Sbjct: 657 ESYESLELRAKDLEAEHKSLQEKISSLEDELEKERQNHQELIAKCREL-EEKIERAEQEE 715
Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEE 564
+QK+ E+ K +K EI EKL++ +E
Sbjct: 716 NMQKLDEEEQKIKQEK-------EIAAAAEKLAECQE 745
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 34.2 bits (79), Expect = 0.20
Identities = 28/146 (19%), Positives = 57/146 (39%), Gaps = 10/146 (6%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQAD----SVGKEYDRLL 523
I E+ + + ++ L EKL + +EL++E L + + + LL
Sbjct: 490 PIIEEAKTEFGEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLL 549
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEK---LSKTE-EELKKEKTNSAALKNQ 579
+ ++ + ++ + DE+K LK+K L E K+K N + +
Sbjct: 550 ELKAQIVVLAHMGLPVPAEEAKVEAVDEVKFLKKKKGILDAGAFESTLKDKKNKVLPEAK 609
Query: 580 ADS--VGKEYDRLLKEHEKVQKVVTE 603
+G E + + E V K++
Sbjct: 610 KRKLKLGDEVEVITGEPGAVVKIIAG 635
>gnl|CDD|191973 pfam08232, Striatin, Striatin family. Striatin is an intracellular
protein which has a caveolin-binding motif, a
coiled-coil structure, a calmodulin-binding site, and a
WD (pfam00400) repeat domain. It acts as a scaffold
protein and is involved in signalling pathways.
Length = 133
Score = 32.7 bits (75), Expect = 0.21
Identities = 26/94 (27%), Positives = 43/94 (45%), Gaps = 11/94 (11%)
Query: 477 LEGDSDEIKRLKEKLSKT----EEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV 532
LEG+ ++ LKE L + E LK+E+ LK DS K +E
Sbjct: 37 LEGERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEEL----- 91
Query: 533 VTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
E+ D ++ +K D E+K+ + L K+ +E+
Sbjct: 92 --EEDDDEESNSEKVDESELKKSRLYLRKSLQEI 123
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 34.2 bits (78), Expect = 0.22
Identities = 32/128 (25%), Positives = 64/128 (50%), Gaps = 10/128 (7%)
Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-----QGDKK 540
+ E++ E EL + + LK + + KE ++LKE E ++ V + + +K
Sbjct: 26 QRSERVLVKETELMLAQEEANKLKKELEVAEKEKLQVLKELESTKRTVEDLKLKLEKAEK 85
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSA-----ALKNQADSVGKEYDRLLKEHE 595
+++ K DS+ K E+L + +EL+ E+ +A ++K + + +EYD L++E +
Sbjct: 86 EEQQAKQDSELAKLRAEELEQGIQELEVERYITATAELDSVKEELRKIRQEYDALVEERD 145
Query: 596 KVQKVVTE 603
K E
Sbjct: 146 AALKRAEE 153
Score = 32.7 bits (74), Expect = 0.56
Identities = 32/151 (21%), Positives = 68/151 (45%), Gaps = 17/151 (11%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E+ ++ E+ + L + +K E+ E ++E+ +A K Q + Y++
Sbjct: 153 EAICASKVNEKKVEELTKEIIAMKESLERAHAAHLEAEEERIGAALEKDQDR---ETYEK 209
Query: 522 LLKEHEKVQKVVTEQGD-KKDDKYQKGDSDEIKRLKEKLSKTE-------------EELK 567
LKE EK + + + D +KD + S E++ L++++S L+
Sbjct: 210 ELKEAEKELERLKQDLDPEKDLEKLAEASAELESLQKEISIMASVASVLKELEEAKANLE 269
Query: 568 KEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
K +L+N +S+ +E + KE E+++
Sbjct: 270 KAAEEEKSLRNLVESLKQELEEEKKELEELR 300
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 34.4 bits (79), Expect = 0.22
Identities = 23/127 (18%), Positives = 55/127 (43%), Gaps = 8/127 (6%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
K ++ + +++ +K K + KEK Q + K+ L K +++++K +
Sbjct: 1091 KKIDENKNKLIEIKNKSHEHVVNADKEKNK------QTEHYNKKKKSLEKIYKQMEKTLK 1144
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTE--EELKKEKTNSAALKNQADSVGKEYDRLLK 592
E + + + +EI+ E++ E++ E S + + +S K+ D++ K
Sbjct: 1145 ELENMNLEDITLNEVNEIEIEYERILIDHIVEQINNEAKKSKTIMEEIESYKKDIDQVKK 1204
Query: 593 EHEKVQK 599
K +
Sbjct: 1205 NMSKERN 1211
Score = 29.8 bits (67), Expect = 5.1
Identities = 14/72 (19%), Positives = 34/72 (47%), Gaps = 3/72 (4%)
Query: 519 YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
YD+ +E ++++ TE K + + + DE+K +K ++ L++ + ++N
Sbjct: 1223 YDKATASYENIEELTTEAKGLKGEANRSTNVDELKEIKLQV---FSYLQQVIKENNKMEN 1279
Query: 579 QADSVGKEYDRL 590
+ Y+ L
Sbjct: 1280 ALHEIKNMYEFL 1291
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 33.8 bits (78), Expect = 0.23
Identities = 21/103 (20%), Positives = 43/103 (41%), Gaps = 12/103 (11%)
Query: 484 IKRLKEKLSKT------EEELKKEK-TNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
++ LKE E+L K + +LK ++ E +R LK+ + E+
Sbjct: 272 LQELKETTENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEK 331
Query: 537 G-DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
++ + + E K L KL + +++L+ + LK+
Sbjct: 332 RLEELEQNLIE----ERKELNSKLEEIQKKLEDLEKRLEKLKS 370
Score = 28.8 bits (65), Expect = 9.5
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKS 510
E K L KL + +++L+ + LKS
Sbjct: 343 ERKELNSKLEEIQKKLEDLEKRLEKLKS 370
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 33.9 bits (78), Expect = 0.23
Identities = 25/108 (23%), Positives = 55/108 (50%), Gaps = 8/108 (7%)
Query: 472 QCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK 531
+ ++ LEG ++E +L L K E KKE+ N ++++ + +E ++ + E++++
Sbjct: 42 KVEEALEGATNEDGKLAADLLKLEVARKKERLN--QIRARISQLKEEIEQKRERIEELKR 99
Query: 532 VV-TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
+ + D YQ KR +L K ++E+K+ ++ AL +
Sbjct: 100 ALAQRRSDLSSASYQL-----EKRRASQLEKLQDEIKRTRSKLNALHS 142
>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
[Function unknown].
Length = 827
Score = 33.9 bits (78), Expect = 0.26
Identities = 27/140 (19%), Positives = 52/140 (37%), Gaps = 11/140 (7%)
Query: 454 EKYGCGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQA- 512
+K G +S C + + +K+L++ L++ +L K+ KS
Sbjct: 231 DKLKVGEIDSDVLCRDLGTLQELYKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKL 290
Query: 513 -DSVGKEYDRLLKE-------HEKVQKVVTEQGDKKDD--KYQKGDSDEIKRLKEKLSKT 562
S KE++ L + +++K+ KK D Y ++ E K+
Sbjct: 291 TPSRKKEFEILPEYVPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIENARKS 350
Query: 563 EEELKKEKTNSAALKNQADS 582
E K+ K+QA +
Sbjct: 351 LYENTPTKSGFITFKSQASA 370
Score = 30.1 bits (68), Expect = 3.8
Identities = 22/117 (18%), Positives = 44/117 (37%), Gaps = 10/117 (8%)
Query: 497 ELKKEKTNSAALKSQADSVGK---EYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIK 553
L E + LK D + + D L ++ +Q++ K+ DK K +
Sbjct: 215 GLPSELRSDEELKELFDKLKVGEIDSDVLCRDLGTLQELY-----KERDKILKKLEKALN 269
Query: 554 RLKEKLSK--TEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
L KL K + KK + + K + + + + + +++K+ KK
Sbjct: 270 ELLNKLLKKSHLKTNKKSGKLTPSRKKEFEILPEYVPDKKRPKHRIKKLNKGIFGKK 326
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 32.8 bits (75), Expect = 0.26
Identities = 21/108 (19%), Positives = 53/108 (49%), Gaps = 8/108 (7%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEE-------LKKEKTNSAALKSQADSVGKEYDRLL 523
E+ +K + ++R+ + LS+ +EE + K + K + Y+ ++
Sbjct: 49 EELEKERIMNDRLLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMV 108
Query: 524 KE-HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
++ ++V+ V E+ + + D++K+ +++L K +EL+KE+
Sbjct: 109 EDLFDQVKDEVDEKNGAALIEELQKHRDKLKKEQKELLKKLDELEKEE 156
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 32.6 bits (75), Expect = 0.27
Identities = 9/42 (21%), Positives = 22/42 (52%)
Query: 553 KRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
L+ K+++ ++E + ++ + L+ +A E D L +H
Sbjct: 70 AELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQH 111
Score = 31.8 bits (73), Expect = 0.51
Identities = 9/42 (21%), Positives = 23/42 (54%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
L+ K+++ ++E + ++ + L+++A E D L +H
Sbjct: 70 AELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQH 111
>gnl|CDD|227674 COG5384, Mpp10, U3 small nucleolar ribonucleoprotein component
[Translation, ribosomal structure and biogenesis].
Length = 569
Score = 33.9 bits (77), Expect = 0.28
Identities = 24/106 (22%), Positives = 41/106 (38%), Gaps = 12/106 (11%)
Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
K K +EEL +E+ +SA K + D E D E G+ D
Sbjct: 227 DVKDPKEDEELDEEEHDSAMDKVKLDLFADEEDEPNAE---------GVGEASDKNLSSF 277
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
+ +I + E++ + E+EL K + + + + LL E
Sbjct: 278 EKQQI-EMDEQIEELEKELVAPK--EWKYAGEVSAKKRPKNSLLAE 320
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 33.2 bits (76), Expect = 0.31
Identities = 29/130 (22%), Positives = 57/130 (43%), Gaps = 9/130 (6%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEE---LKKE----KTNSAALKSQADSVGKEYD 520
E + K LE + L+EKL + EEE L+K+ + + L+ +A + +E +
Sbjct: 19 EDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEAAASEEERE 78
Query: 521 RLLKEHEKVQKVVTEQGDKKDDKYQKGD--SDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
RL E ++ V + ++++ K + E++ +E + +EL + A +
Sbjct: 79 RLEAEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHERARQELLEAAAAPTAPPH 138
Query: 579 QADSVGKEYD 588
A V E
Sbjct: 139 VAAPVNGEQL 148
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 33.6 bits (77), Expect = 0.31
Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 472 QCDKGLEGD--SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
+ + L GD + +K +KL ++L + K + A + +A++ KE +E+EK
Sbjct: 357 KSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASE-EAEAKAKEEKLKQEENEKK 415
Query: 530 QKVVTEQGDKKDDKYQK 546
QK EQ D+ +K QK
Sbjct: 416 QK---EQADEDKEKRQK 429
Score = 29.4 bits (66), Expect = 5.7
Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 12/87 (13%)
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
L K +E+V+ GDK+ + +K +KL ++L + K + A + +A+
Sbjct: 349 LTKLYEEVKSNTDLSGDKR--------QELLKEYNKKLQDYTKKLGEVKDETDASE-EAE 399
Query: 582 SVGKEYDRLLKEHEKVQKVVTEQGDKK 608
+ KE +E+EK QK EQ D+
Sbjct: 400 AKAKEEKLKQEENEKKQK---EQADED 423
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 32.3 bits (74), Expect = 0.32
Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 13/85 (15%)
Query: 483 EIKRLKEKLSKTEEEL----KKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
I+RLK +L E EL KE+ LK+ + E + + + +Q+ T+
Sbjct: 72 TIERLKTQLEDLERELALLQAKERQLEKKLKTLEQKLKNEKEEVQRLKNIIQQRKTQYNH 131
Query: 539 --KKDDKYQKGDSDEIKRLKEKLSK 561
KK D+ EI++LKE+L +
Sbjct: 132 ELKKRDR-------EIEKLKERLLQ 149
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 33.7 bits (78), Expect = 0.32
Identities = 30/91 (32%), Positives = 48/91 (52%), Gaps = 17/91 (18%)
Query: 485 KRLKEKLSKTEEE---LKKEKTNSAALKSQADSVGKEYDRLLKEHEKV-QKVVTEQGDKK 540
+RL++ L E+E L+ EK +K Q + +EY L+E K QK E G+
Sbjct: 192 ERLEKLLDLLEKEIDLLQLEKRIRKKVKEQMEKSQREY--YLREQLKAIQK---ELGEDD 246
Query: 541 DDKYQKGDSDEIKRLKEKLSKTE--EELKKE 569
DDK DE++ L+EK+ K + +E K++
Sbjct: 247 DDK------DEVEELREKIEKLKLPKEAKEK 271
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 33.5 bits (78), Expect = 0.33
Identities = 26/112 (23%), Positives = 51/112 (45%), Gaps = 6/112 (5%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH-EKV----QKVVTEQ 536
+EI+R+ + EE KK K ++QADS+ + ++ LKE +KV ++ +
Sbjct: 503 EEIERMVKDAEANAEEDKKRK-ELVEARNQADSLIYQTEKTLKELGDKVPADEKEKIEAA 561
Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
+ + + D + IK E+L++ ++L + A A + D
Sbjct: 562 IKELKEALKGEDKEAIKAKTEELTQASQKLGEAMYQQAQAAQGAAGAAAKDD 613
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 33.2 bits (76), Expect = 0.33
Identities = 34/144 (23%), Positives = 68/144 (47%), Gaps = 26/144 (18%)
Query: 466 RCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
R ++FE D +E + D KR+ + +K EE+ +L+ EY+ L+E
Sbjct: 59 RVQLFE--DPTVEKEVDIRKRVLKIYNKREEDF-------PSLR--------EYNDYLEE 101
Query: 526 HEKV-----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
E + + E KK + YQK + D I++ KEK ++ +EEL++ + + +
Sbjct: 102 VEDIVYNLTNNIDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEE----ALEFEKEE 157
Query: 581 DSVGKEYDRLLKEHEKVQKVVTEQ 604
+ + + +E +++ K +Q
Sbjct: 158 EEQRRLLLQKEEEEQQMNKRKNKQ 181
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 33.5 bits (77), Expect = 0.34
Identities = 29/103 (28%), Positives = 47/103 (45%), Gaps = 12/103 (11%)
Query: 496 EELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRL 555
EEL++EK ALK Q RL K ++ +K E +K +K + ++ KR
Sbjct: 551 EELQREKEEKEALKEQ--------KRLRKLKKQEEKKKKEL--EKLEKAKIPPAEFFKRQ 600
Query: 556 KEKLSKTEEEL--KKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
++K S +E + K + + KEYD+ K HE+
Sbjct: 601 EDKYSAFDETGLPTHDADGEEISKKERKKLSKEYDKQAKLHEE 643
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 33.4 bits (77), Expect = 0.35
Identities = 26/108 (24%), Positives = 45/108 (41%), Gaps = 13/108 (12%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
+EI+R+ ++ EE KK K ++ ADS+ + ++ LK E GDK
Sbjct: 499 SEEEIERMVKEAEANAEEDKKRKEE-IEARNNADSLAYQAEKTLK----------EAGDK 547
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
+ ++ + LKE L + E K KT L+ + +
Sbjct: 548 LPAEEKEKIEKAVAELKEALKGEDVEEIKAKT--EELQQALQKLAEAM 593
Score = 31.5 bits (72), Expect = 1.4
Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 10/65 (15%)
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH---------EKVQ 598
+EI+R+ ++ EE KK K +N ADS+ + ++ LKE EK++
Sbjct: 499 SEEEIERMVKEAEANAEEDKKRKEE-IEARNNADSLAYQAEKTLKEAGDKLPAEEKEKIE 557
Query: 599 KVVTE 603
K V E
Sbjct: 558 KAVAE 562
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 32.3 bits (74), Expect = 0.37
Identities = 26/108 (24%), Positives = 50/108 (46%), Gaps = 11/108 (10%)
Query: 496 EELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRL 555
EEL K L+ + + ++ L E + + +T ++ Q+ EI+ L
Sbjct: 72 EELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLT------TEELQE----EIQEL 121
Query: 556 KEKLSKTEEELKK-EKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
K+++ + EE+L+ E+ + + V KEY L KE +K +K+
Sbjct: 122 KKEVREIEEKLESLEEGWKPVTPEEMEKVKKEYKDLHKEWKKRKKLCN 169
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 33.5 bits (77), Expect = 0.38
Identities = 18/126 (14%), Positives = 60/126 (47%), Gaps = 11/126 (8%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
+ L+++ + + L++ + + A +A+S+ ++ D L + E+++ E+ +
Sbjct: 313 EARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELR----EEAAEL 368
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR---LLKEHEKV 597
+ + ++ + ++ +E++ + EEE+ + + +G D L +E +++
Sbjct: 369 ESELEE-AREAVEDRREEIEELEEEI---EELRERFGDAPVDLGNAEDFLEELREERDEL 424
Query: 598 QKVVTE 603
++ E
Sbjct: 425 REREAE 430
Score = 30.4 bits (69), Expect = 3.4
Identities = 21/108 (19%), Positives = 49/108 (45%), Gaps = 12/108 (11%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
EI+ L+E +++TE E ++ L+ + + + +E D LL E D +
Sbjct: 259 EIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAE--------AGLDDADAE 310
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
+ L+++ + + L++ + + A +A+S+ ++ D L
Sbjct: 311 AV----EARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDL 354
Score = 30.4 bits (69), Expect = 3.6
Identities = 22/112 (19%), Positives = 54/112 (48%), Gaps = 8/112 (7%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
D I+RL+E+ EE + + + + +A+ + + L E E+ ++ E ++ +
Sbjct: 509 DRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAE 568
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEE---LKKEKTNSAALKNQADSVGKEYDRL 590
+ +E+ L KL++ +E L++ +T AA+ + D + + ++
Sbjct: 569 EA-----REEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKR 615
Score = 30.4 bits (69), Expect = 3.6
Identities = 26/141 (18%), Positives = 56/141 (39%), Gaps = 33/141 (23%)
Query: 473 CDKGLEG--DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRL--LKEHEK 528
C + +EG + I+ +E++ + E EL L+ + + V + +R L E E
Sbjct: 457 CGQPVEGSPHVETIEEDRERVEELEAEL-------EDLEEEVEEVEERLERAEDLVEAE- 508
Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
D I+RL+E+ EE + + + + +A+ + +
Sbjct: 509 ---------------------DRIERLEERREDLEELIAERRETIEEKRERAEELRERAA 547
Query: 589 RLLKEHEKVQKVVTEQGDKKD 609
L E E+ ++ E ++ +
Sbjct: 548 ELEAEAEEKREAAAEAEEEAE 568
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 33.5 bits (76), Expect = 0.41
Identities = 28/140 (20%), Positives = 59/140 (42%), Gaps = 8/140 (5%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
K L+ + ++ R KL + E+L T ++ + +++ K + +T
Sbjct: 504 KSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELT 563
Query: 535 EQGDKKDDKYQKGD-----SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
+K Q D S EI + +++L+K +EL + N + N+ +S ++
Sbjct: 564 SLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQ--- 620
Query: 590 LLKEHEKVQKVVTEQGDKKD 609
L +K+ V Q ++ D
Sbjct: 621 LSSYEDKLFDVCGSQDEESD 640
Score = 31.6 bits (71), Expect = 1.4
Identities = 22/121 (18%), Positives = 50/121 (41%), Gaps = 10/121 (8%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE------ 535
DE K L + +E L+K++ + + + DR+L+ ++++K E
Sbjct: 433 DEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEK 492
Query: 536 ----QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLL 591
+ KK+ K + + ++ R KL + E+L T ++ + +++
Sbjct: 493 NSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIR 552
Query: 592 K 592
K
Sbjct: 553 K 553
Score = 29.2 bits (65), Expect = 8.7
Identities = 26/156 (16%), Positives = 55/156 (35%), Gaps = 30/156 (19%)
Query: 481 SDEIKRLKEKLSKTEEELK----KEKTNSAAL---------KSQADSVGKEYDRLLKEHE 527
+ +I+RLK + + E L +E++ L + + V ++ + + +
Sbjct: 757 NRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQ 816
Query: 528 KV--QKVVTEQGDKKDDKYQKGD------------SDEIKRLKEKLSKTEEELKKEKTNS 573
+ V + +K +K + D + + + L ELK EK
Sbjct: 817 GSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQI 876
Query: 574 AALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
+ ++ L E VQ ++ E D K+
Sbjct: 877 GTNLQRRQQFEEQLVELSTE---VQSLIREIKDAKE 909
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 32.9 bits (74), Expect = 0.42
Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 10/119 (8%)
Query: 458 CGWCESSKRCEIFEQCDKGLEGDSDEIKR----LKEKLSKTEEELKKEKTNSAALKSQAD 513
C + K+ E+ Q K E D + L+++ KTE+E K+KT ++
Sbjct: 119 CFADDQDKKIEL-AQAKKEAENARDRANKSGIELEQEEQKTEQE--KQKTEKEGIELANS 175
Query: 514 SVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
+ E ++ E EK QK TEQ +K ++ E+++ K+K +++L KE+ +
Sbjct: 176 QIKAEQEKQKTEQEK-QK--TEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKEQKD 231
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 33.0 bits (76), Expect = 0.45
Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 14/98 (14%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE-------KVQKVVTEQ 536
+K ++E E+LKK + A + + +E +LL+E E ++ K +
Sbjct: 4 LKLIRENPDAVREKLKK-RGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRA 62
Query: 537 GDKKDDKYQ------KGDSDEIKRLKEKLSKTEEELKK 568
+ +D + K +++K L+ L + E EL
Sbjct: 63 LKRGEDDAEELIAEVKELKEKLKELEAALDELEAELDT 100
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 31.9 bits (73), Expect = 0.46
Identities = 19/101 (18%), Positives = 52/101 (51%), Gaps = 8/101 (7%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE-HEKVQKVV---TEQGDKK 540
K + + L + + ++ + + LK +A ++ EY++ L+E E+ +++ ++ ++
Sbjct: 29 KPILKALDERQAKIADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQI 88
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEK-TNSAALKNQA 580
++ + +E++R+KE E E++ EK L+ +
Sbjct: 89 AEEIKAEAEEELERIKE---AAEAEIEAEKERALEELRAEV 126
>gnl|CDD|227602 COG5277, COG5277, Actin and related proteins [Cytoskeleton].
Length = 444
Score = 33.2 bits (76), Expect = 0.46
Identities = 19/101 (18%), Positives = 43/101 (42%), Gaps = 8/101 (7%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV-VTEQGD 538
S+ + +KE++ +T++E SA + A+ +E + E ++++
Sbjct: 221 SSEIVNEIKEEVCETDDE-------SAYVSLDAEEEFEEEEEKPAEKSTESTFQLSKETS 273
Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
+ + D +EI+ E+ K E L K + + L+
Sbjct: 274 IAKESKELPDGEEIEFGNEERFKAPEILFKPELPISGLEEA 314
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 32.1 bits (73), Expect = 0.48
Identities = 23/105 (21%), Positives = 48/105 (45%), Gaps = 14/105 (13%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKK----EKTNSAALKSQADSVGKEYDRLLKE------H 526
E + ++K+LKE K ++ELKK ++ + +E R E
Sbjct: 47 REKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQE--REKTEINRSHIQ 104
Query: 527 EKVQ--KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE 569
E VQ K + E ++ +K ++ ++ +++++E+ K + E E
Sbjct: 105 EVVQSIKRLEEAQKRRQEKLEEKQAEILQQIQEEEPKLQAEATAE 149
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 33.2 bits (76), Expect = 0.50
Identities = 13/100 (13%), Positives = 31/100 (31%), Gaps = 4/100 (4%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
I+RL E +++ + L A + +E +++ D
Sbjct: 855 KLIERLIEAINQYRAKKLDTAEKLEELYILAKKEEEFKQFAEEEGLNEEELAF----YDD 910
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
G E + E+ L+++ + K + +
Sbjct: 911 LALNGGKLPENGTELVEKLAKEKSLREKNKDDWKAKEEVE 950
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 31.0 bits (71), Expect = 0.50
Identities = 25/99 (25%), Positives = 50/99 (50%), Gaps = 21/99 (21%)
Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK------------VV 533
+L+++L +++K+ +T + + V +E + LL E KV K V
Sbjct: 9 QLQQQLQLLAQQIKQLETQ----LKEIELVLEELE-LLDEDTKVYKLIGDVLVKQDKEEV 63
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
E+ +++ + +K EIK L+++L K E+EL++ K
Sbjct: 64 KEELEERKETLEK----EIKTLEKQLEKLEKELEELKEE 98
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set
of eukaryotic tropomyosins. Within the yeast Tmp1 and
Tmp2, biochemical and sequence analyses indicate that
Tpm2p spans four actin monomers along a filament,
whereas Tpmlp spans five. Despite its shorter length,
Tpm2p can compete with Tpm1p for binding to F-actin.
Over-expression of Tpm2p in vivo alters the axial
budding of haploids to a bipolar pattern, and this can
be partially suppressed by co-over-expression of Tpm1p.
This suggests distinct functions for the two
tropomyosins, and indicates that the ratio between them
is important for correct morphogenesis. The family also
contains higher eukaryote Tmp3 members.
Length = 143
Score = 31.4 bits (72), Expect = 0.50
Identities = 36/144 (25%), Positives = 69/144 (47%), Gaps = 25/144 (17%)
Query: 466 RCEIFEQCDKGLEGDS----DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
R E E+ K LE ++ EI L++K + EEE++K + K + + E ++
Sbjct: 15 RAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLE----ESEK 70
Query: 522 LLKEHEKVQKVVT---EQGDKKDDKYQKGDSDEIKRLKEKLSKTEE-ELKKEKT--NSAA 575
L E + + + E+ ++ + KRLKE K E + K E++ A
Sbjct: 71 LATNAEALTRRIQLLEEELEESE-----------KRLKETTEKLREADKKAEESERKVKA 119
Query: 576 LKNQADSVGKEYDRLLKEHEKVQK 599
L+N+ D ++Y+ L K++++ +K
Sbjct: 120 LENERDEWEEKYEELEKKYKEAKK 143
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 32.5 bits (74), Expect = 0.52
Identities = 27/105 (25%), Positives = 50/105 (47%), Gaps = 12/105 (11%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
C K L +D+I L E+L+K E+L++++ L SQ + K+ E+E++Q
Sbjct: 203 NDCVKQLREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQ 262
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEK-------LSKTEEELKK 568
+ + D + E++ L++K L + +EELK
Sbjct: 263 QHLAAAKDAQRQL-----QAELQELQDKYAECMEMLHEAQEELKN 302
Score = 31.3 bits (71), Expect = 1.2
Identities = 34/148 (22%), Positives = 68/148 (45%), Gaps = 15/148 (10%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
ESS+ + Q +++ L+EKL EEE + L+S+A + E
Sbjct: 141 ESSESTPLRPQESSSSSHGCFQLEALQEKLKLLEEE-------NEHLRSEASHLKTE--- 190
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
+ EK Q++V + + + +D+I L E+L+K E+L++++ L +Q
Sbjct: 191 TVTYEEKEQQLVNDCVKQLREA-----NDQIASLSEELAKKTEDLERQQEEITHLLSQIV 245
Query: 582 SVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
+ K+ E+E++Q+ + D +
Sbjct: 246 DLQKKCKSYALENEELQQHLAAAKDAQR 273
>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 220 and
257 amino acids in length.
Length = 221
Score = 32.2 bits (74), Expect = 0.53
Identities = 12/42 (28%), Positives = 27/42 (64%)
Query: 532 VVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
T +K+D K + ++I+ L+++++K +++LKKEK +
Sbjct: 164 SNTGVYEKEDLKERVERLEQIEELEKEIAKLKKKLKKEKQFN 205
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 33.1 bits (76), Expect = 0.54
Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 12/104 (11%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
E+ K L+ ++I++L+ ++ K EEEL+K K L + K+YD L E
Sbjct: 95 AQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNK--FLDKAWKKLAKKYDSNLSEAL 152
Query: 528 KVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLS--KTEEELKKE 569
K G +++ E+K + S + EELK +
Sbjct: 153 K--------GLNYKKNFKEKLLKELKSVILNASSLLSLEELKAK 188
Score = 30.0 bits (68), Expect = 4.1
Identities = 28/137 (20%), Positives = 53/137 (38%), Gaps = 31/137 (22%)
Query: 480 DSDEIKRLKEKLSKTEEELKK---EKTNSAALKSQADSVGKEYD----------RLLKEH 526
D +E+K L E L + E+ + EK + + +S+ + L++EH
Sbjct: 321 DVEELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREH 380
Query: 527 -EKVQKVVTEQGDKKD--------------DKYQKGDSD---EIKRLKEKLSKTEEELKK 568
EK+ + E+ K D YQK I L++++ + E E+K
Sbjct: 381 NEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKA 440
Query: 569 EKTNSAALKNQADSVGK 585
+ L+ Q ++
Sbjct: 441 LEKEIKELEKQLTNIEP 457
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 32.8 bits (74), Expect = 0.54
Identities = 24/87 (27%), Positives = 43/87 (49%), Gaps = 11/87 (12%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
EI RLKE+L + L++ + + Q + +EY L++ E+ + Q ++KD
Sbjct: 371 EIARLKERLRVSVRRLEEYERRLLGQEQQMQKLLQEYQARLEDSEERLR---RQQEEKD- 426
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKE 569
++K + +L EEELKK+
Sbjct: 427 -------SQMKSIISRLMAVEEELKKD 446
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 33.0 bits (76), Expect = 0.55
Identities = 17/61 (27%), Positives = 35/61 (57%), Gaps = 4/61 (6%)
Query: 476 GLEGDSDEIKRLKEKLSKTEEELKK--EKTNSAALKSQADS--VGKEYDRLLKEHEKVQK 531
GL + E+ RL+++L K E+E+ + +K ++ ++A V KE ++L + K+ K
Sbjct: 808 GLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAK 867
Query: 532 V 532
+
Sbjct: 868 L 868
Score = 32.6 bits (75), Expect = 0.81
Identities = 15/54 (27%), Positives = 31/54 (57%), Gaps = 4/54 (7%)
Query: 551 EIKRLKEKLSKTEEELKK--EKTNSAALKNQADS--VGKEYDRLLKEHEKVQKV 600
E+ RL+++L K E+E+ + +K ++ +A V KE ++L + K+ K+
Sbjct: 815 ELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEKEKLAEYQVKLAKL 868
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 32.8 bits (75), Expect = 0.59
Identities = 28/140 (20%), Positives = 48/140 (34%), Gaps = 26/140 (18%)
Query: 489 EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGD 548
K K +EL++ + + S E L E + + D+ +G
Sbjct: 207 VKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDD----GEEESDDESAWEGF 262
Query: 549 SDE--------------------IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
E KR KE L + +E K+ K A L + + KE
Sbjct: 263 ESEYEPINKPVRPKRKTKAQRNKEKRRKE-LEREAKEEKQLKKKLAQL-ARLKEIAKEVA 320
Query: 589 RLLKEHEKVQKVVTEQGDKK 608
+ K + ++ E+G+KK
Sbjct: 321 QKEKARARKKEQRKERGEKK 340
>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
Length = 653
Score = 32.8 bits (75), Expect = 0.61
Identities = 21/62 (33%), Positives = 39/62 (62%), Gaps = 2/62 (3%)
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH-EKVQKVVTEQGD 606
S+EI+R++++ K EE ++ K LKNQADS+ Y+ LKE+ E + + + ++ +
Sbjct: 503 SSNEIERMRQEAEKYAEEDRRRK-QLIELKNQADSLLYSYESTLKENGELISEELKQRAE 561
Query: 607 KK 608
+K
Sbjct: 562 QK 563
Score = 31.6 bits (72), Expect = 1.2
Identities = 26/90 (28%), Positives = 53/90 (58%), Gaps = 3/90 (3%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH-EKVQKVVTEQGD 538
S+EI+R++++ K EE ++ K LK+QADS+ Y+ LKE+ E + + + ++ +
Sbjct: 503 SSNEIERMRQEAEKYAEEDRRRK-QLIELKNQADSLLYSYESTLKENGELISEELKQRAE 561
Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
+K ++ + +D L E+L + EE ++
Sbjct: 562 QKVEQLEAALTDPNISL-EELKQQLEEFQQ 590
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 32.7 bits (74), Expect = 0.64
Identities = 25/125 (20%), Positives = 54/125 (43%), Gaps = 15/125 (12%)
Query: 474 DKGLEGDSDEIKRLK----EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
D LE D + + +K K + +EE +++ + + + +K E V
Sbjct: 272 DAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDE-----EEPPEAAMDKVKLDEPV 326
Query: 530 QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
+ V + K+ ++K + +LK+++ + E+E K S LK + + + +
Sbjct: 327 LEGVDLESPKELSSFEKRQA----KLKQQIEQLEKENLAPK--SWTLKGEVTAKKRPQNS 380
Query: 590 LLKEH 594
LL+E
Sbjct: 381 LLEED 385
>gnl|CDD|184358 PRK13874, PRK13874, conjugal transfer protein TrbJ; Provisional.
Length = 230
Score = 31.8 bits (73), Expect = 0.67
Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 4/34 (11%)
Query: 117 RQIIQLIAQQANLLAQNEA-SMNQARQAAVAAQA 149
+Q+ L A A AQ A ++ ARQAA Q
Sbjct: 190 QQLADLQALMA---AQGRAQALEAARQAAAEEQG 220
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 30.8 bits (70), Expect = 0.67
Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 14/106 (13%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNS---AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
D IK KE+ + + +KE +L A+ ++ +R + E K+QK ++GD
Sbjct: 25 DSIKAAKEEKKQAKLSERKENRKPKYIGSLLEAAER--RKREREIAEERKLQKEREKEGD 82
Query: 539 KKDDK-------YQKGDSDEIKRLKEKLSKTEE-ELKKEKTNSAAL 576
+ DK Y+K +E ++L+E+ + EE E + + T L
Sbjct: 83 EFADKEKFVTSAYKK-QLEENRKLEEEEKEREELEEENDVTKGKDL 127
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 32.6 bits (75), Expect = 0.69
Identities = 25/111 (22%), Positives = 54/111 (48%), Gaps = 20/111 (18%)
Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVT-------EQGDKKDDKYQKGDSDEIKRLKEK 558
A +++Q D++ K+ K E K+V DK D QK ++++ LK++
Sbjct: 39 ADVQAQLDALNKQ-----KLLEAEDKLVQQDLEQTLALLDKIDR--QKEETEQ---LKQQ 88
Query: 559 LSKTEEELKKEKTNSAALKNQADSVGKE-YDRL-LKEHE-KVQKVVTEQGD 606
L++ +L++ + ALK+ D +E L L++ E ++ + + + +
Sbjct: 89 LAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQN 139
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 31.9 bits (73), Expect = 0.72
Identities = 18/112 (16%), Positives = 50/112 (44%), Gaps = 2/112 (1%)
Query: 498 LKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKE 557
L + +++ + + + + +E L +E ++Q+ + E + + Q+ + E + E
Sbjct: 54 LSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAE--LE 111
Query: 558 KLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 609
+L K +K+ N+ L + + +E L +E+E ++ + +
Sbjct: 112 RLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQ 163
>gnl|CDD|214016 cd12923, iSH2_PI3K_IA_R, Inter-Src homology 2 (iSH2) helical domain
of Class IA Phosphoinositide 3-kinase Regulatory
subunits. PI3Ks catalyze the transfer of the
gamma-phosphoryl group from ATP to the 3-hydroxyl of the
inositol ring of D-myo-phosphatidylinositol (PtdIns) or
its derivatives. They play an important role in a
variety of fundamental cellular processes, including
cell motility, the Ras pathway, vesicle trafficking and
secretion, immune cell activation, and apoptosis. They
are classified according to their substrate specificity,
regulation, and domain structure. Class IA PI3Ks are
heterodimers of a p110 catalytic (C) subunit and a
p85-related regulatory (R) subunit. The R subunit
down-regulates PI3K basal activity, stabilizes the C
subunit, and plays a role in the activation downstream
of tyrosine kinases. All R subunits contain two SH2
domains that flank an intervening helical domain (iSH2),
which binds to the N-terminal adaptor-binding domain
(ABD) of the catalytic subunit. In vertebrates, there
are three genes (PIK3R1, PIK3R2, and PIK3R3) that encode
for different Class IA PI3K R subunits.
Length = 152
Score = 31.0 bits (71), Expect = 0.77
Identities = 28/143 (19%), Positives = 63/143 (44%), Gaps = 18/143 (12%)
Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
+K S E L EK +K +E++ LK QA +E ++ +E + Q
Sbjct: 14 NKEYLDKSREYDELYEKYNKLSQEIQ--------LKRQALEAFEEAVKMFEEQLRTQ--- 62
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLS-------KTEEELKKEKTNSAALKNQADSVGKE 586
+ + ++ + + LK +L + EE+L+K+ + L+ + +S+ E
Sbjct: 63 EKFQKEAQPHEKQRLMENNELLKSRLKELEESKEQLEEDLRKQVAYNRELEREMNSLKPE 122
Query: 587 YDRLLKEHEKVQKVVTEQGDKKD 609
+L K+ ++ + + +G ++
Sbjct: 123 LMQLRKQKDQYLRWLKRKGVSQE 145
>gnl|CDD|202143 pfam02183, HALZ, Homeobox associated leucine zipper.
Length = 45
Score = 28.6 bits (65), Expect = 0.77
Identities = 16/56 (28%), Positives = 23/56 (41%), Gaps = 19/56 (33%)
Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSK 561
LK D++ E D L KE+EK++ E+ LKEKL +
Sbjct: 8 EVLKRCYDALKAENDSLQKENEKLRA-------------------EVLSLKEKLQE 44
Score = 27.1 bits (61), Expect = 3.5
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 574 AALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDK 607
LK D++ E D L KE+EK++ V +K
Sbjct: 8 EVLKRCYDALKAENDSLQKENEKLRAEVLSLKEK 41
>gnl|CDD|149223 pfam08019, DUF1705, Domain of unknown function (DUF1705). Some
members of this family are putative bacterial membrane
proteins. This domain is found immediately N terminal to
the sulfatase domain in many sulfatases.
Length = 156
Score = 31.3 bits (72), Expect = 0.78
Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 17/83 (20%)
Query: 3 LQWTLIATVLYFEMAFMLLLILPILSTQRLHKILKSKFVQGVKTQAGWYFGCILVILSLF 62
L LI +L LL +LP L R+ KI +++ + + +LVIL +
Sbjct: 64 LSPKLILYLL-------LLGVLPALLLLRV-KIRYRPWLRELLQRLLSILASLLVILLIA 115
Query: 63 FLD-----AI----REMRKYASP 76
FL + +E+RK +P
Sbjct: 116 FLFYQDYASFFRNNKELRKLINP 138
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 32.4 bits (74), Expect = 0.79
Identities = 31/131 (23%), Positives = 57/131 (43%), Gaps = 19/131 (14%)
Query: 478 EGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
E S E+K++ +L LK E LK+ +E ++ LK ++ K +
Sbjct: 135 EQKSSELKKVDSQLPDPRALLKGED-----LKTV--ETAEEIEKSLKAMQQKWKKRKKDL 187
Query: 538 DKKDD--KYQK----------GDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
K D +Y+K + E++++KE+ K ++E K +K + KN + K
Sbjct: 188 PNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKK 247
Query: 586 EYDRLLKEHEK 596
+ L E +K
Sbjct: 248 QLKADLAELKK 258
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 32.3 bits (73), Expect = 0.85
Identities = 27/117 (23%), Positives = 57/117 (48%), Gaps = 14/117 (11%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE- 535
E D D I++L +L K EE L++E++ L+ + +G E D + + ++ ++E
Sbjct: 366 AEADKDRIQKLSAELLKLEEWLQEERSQREKLEVE---LGIEKDCNRVQLSENRRELSEL 422
Query: 536 -------QGDKKDDKYQKGD-SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVG 584
Q +K+ + +K + D I+ L+ +L K +E K ++A ++ +
Sbjct: 423 RSALRVLQKEKEQLQEEKQELLDYIRVLELRLDKEADE--KWAEDAATCEDAKSYLH 477
>gnl|CDD|237785 PRK14671, uvrC, excinuclease ABC subunit C; Provisional.
Length = 621
Score = 32.4 bits (74), Expect = 0.88
Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 11/75 (14%)
Query: 468 EIFEQCDKGLEGDSDEIKR-LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
E+ E+ + L+G + + R L E++ + ELK E+ +A LK Q +S LK +
Sbjct: 200 EMIEEIVRLLKGKTSALIRSLTEEMQRAAAELKFEE--AAELKDQIES--------LKRY 249
Query: 527 EKVQKVVTEQGDKKD 541
+ QKVV++ G +D
Sbjct: 250 AERQKVVSQDGADRD 264
Score = 31.2 bits (71), Expect = 1.7
Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 10/61 (16%)
Query: 549 SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
S I+ L E++ + ELK E+ +A LK+Q +S LK + + QKVV++ G +
Sbjct: 214 SALIRSLTEEMQRAAAELKFEE--AAELKDQIES--------LKRYAERQKVVSQDGADR 263
Query: 609 D 609
D
Sbjct: 264 D 264
>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
Fer (Fes related) tyrosine kinases. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Fes (feline sarcoma), also called
Fps (Fujinami poultry sarcoma), and Fer (Fes related)
are cytoplasmic (or nonreceptor) tyrosine kinases that
play roles in haematopoiesis, inflammation and immunity,
growth factor signaling, cytoskeletal regulation, cell
migration and adhesion, and the regulation of cell-cell
interactions. Although Fes and Fer show redundancy in
their biological functions, they show differences in
their expression patterns. Fer is ubiquitously expressed
while Fes is expressed predominantly in myeloid and
endothelial cells. Fes and Fer contain an N-terminal
F-BAR domain, an SH2 domain, and a C-terminal catalytic
kinase domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules. The
F-BAR domain of Fes is critical in its role in
microtubule nucleation and bundling.
Length = 237
Score = 31.6 bits (72), Expect = 0.88
Identities = 24/84 (28%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
KE + T++ K K ++ AL+S + L+K+ K +K E+ + D++Y+K
Sbjct: 68 KEIMDSTDQLSKLIKQHAEALES---GTLDKLTLLIKDKRKAKKAYQEERQQIDEQYKK- 123
Query: 548 DSDEIKRLKEKLSKTEEELKKEKT 571
+DE+++LK + K E+ K K+
Sbjct: 124 LTDEVEKLKSEYQKLLEDYKAAKS 147
>gnl|CDD|148363 pfam06717, DUF1202, Protein of unknown function (DUF1202). This
family consists of several hypothetical bacterial
proteins of around 335 residues in length. Members of
this family are found exclusively in Escherichia coli
and Salmonella species and are often referred to as YggM
proteins. The function of this family is unknown.
Length = 308
Score = 31.8 bits (72), Expect = 0.89
Identities = 22/130 (16%), Positives = 51/130 (39%), Gaps = 11/130 (8%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSV-GKEYDRLLKEHEKVQKVVTEQGDKK 540
+ R K + E+++K + +AD+ GK D E K + ++ +
Sbjct: 145 SALNRQKNSIPALEKDVKALDKQIVVAQKEADAYWGKGADGKQMTREDAFKKIHKE---R 201
Query: 541 DDKYQKGDSD--EIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
DD ++ DS+ +K KE +K+ + + ++ D + E +
Sbjct: 202 DDFNKQNDSEAFAVKYDKEVYQPAIAACRKQS-----EECYEVPIQQKRDFDINEQRRQT 256
Query: 599 KVVTEQGDKK 608
+ +++ +K
Sbjct: 257 FLKSQELSRK 266
Score = 30.3 bits (68), Expect = 2.6
Identities = 16/80 (20%), Positives = 30/80 (37%), Gaps = 12/80 (15%)
Query: 539 KKDDKYQKGDS-----------DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV-GKE 586
K ++KY +S + R K + E+++K + +AD+ GK
Sbjct: 123 KTNNKYLIVNSEDFNYRFSQIKSALNRQKNSIPALEKDVKALDKQIVVAQKEADAYWGKG 182
Query: 587 YDRLLKEHEKVQKVVTEQGD 606
D E K + ++ D
Sbjct: 183 ADGKQMTREDAFKKIHKERD 202
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 32.1 bits (74), Expect = 0.90
Identities = 15/87 (17%), Positives = 38/87 (43%), Gaps = 8/87 (9%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
EI ++ L EE++++ L + +E ++L + +++K +
Sbjct: 105 HEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLAN----- 159
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKK 568
++ G + + L+++L EEE +
Sbjct: 160 -RFSFGPA--LDELEKQLENLEEEFSQ 183
Score = 31.7 bits (73), Expect = 1.1
Identities = 31/131 (23%), Positives = 61/131 (46%), Gaps = 22/131 (16%)
Query: 483 EIKRLKEKLSKTEE-----ELKKEKTNSAALKSQADSVGKEYDRLLKEHE-KVQKVVTEQ 536
EI+ LKE++ + +L + + + ++ + D + YD L E E K +K V +
Sbjct: 257 EIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQL---YDIL--EREVKARKYVEKN 311
Query: 537 GDKKDDKYQKGD------SDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
D D + +EI R+K+ + E EL+ + L+ Q +S+ K+YD +
Sbjct: 312 SDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVR----QLEKQLESLEKQYDEI 367
Query: 591 LKEHEKVQKVV 601
+ + Q++
Sbjct: 368 TERIAE-QEIA 377
>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1. This model
describes peptide chain release factor 1 (PrfA, RF-1),
and excludes the related peptide chain release factor 2
(PrfB, RF-2). RF-1 helps recognize and terminate
translation at UAA and UAG stop codons. The
mitochondrial release factors are prfA-like, although
not included above the trusted cutoff for this model.
RF-1 does not have a translational frameshift [Protein
synthesis, Translation factors].
Length = 360
Score = 32.0 bits (73), Expect = 0.92
Identities = 22/95 (23%), Positives = 50/95 (52%), Gaps = 11/95 (11%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLL------KEHEKVQKVVTE 535
++++ L E+ + E L + S K + + KEY +L +E+++ Q+ + E
Sbjct: 7 EKLESLLERYEELEALLSDPEVISDQDKLRK--LSKEYSQLEEIVDCYREYQQAQEDIKE 64
Query: 536 QG---DKKDDKYQKGDSDEIKRLKEKLSKTEEELK 567
++ D + ++ +E++ L+EK+ + EE+LK
Sbjct: 65 AKEILEESDPEMREMAKEELEELEEKIEELEEQLK 99
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 32.3 bits (73), Expect = 0.94
Identities = 22/128 (17%), Positives = 50/128 (39%), Gaps = 16/128 (12%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK--EHEKVQKVVTEQGDKK 540
E + L + ALK+ L + K++ + +
Sbjct: 2605 ETEIPDALEKLVSGPLFVHEKALNALKA-------CGSLFLWVLARYLLAKLMLSISNME 2657
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKV 600
+DEI L L K+ +E+++E++ S ++++ D++ EY+ +K E ++
Sbjct: 2658 Q-------TDEIAVLLHNLKKSRKEIEEEESESMEIEDRIDALKSEYNASVKRLESIRVE 2710
Query: 601 VTEQGDKK 608
+ +K
Sbjct: 2711 IAMFDEKA 2718
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 30.7 bits (70), Expect = 0.98
Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 17/103 (16%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVG-KEYDRLLKEHEKVQKVVTEQGDKK 540
D K +K K +T E+K LK +G K++ +LLK +KV+K++ DKK
Sbjct: 50 DAWKAIK-KHKQTTPEIK---ECCKDLKV----LGKKDFKKLLKWRKKVRKLLGL--DKK 99
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEE------ELKKEKTNSAALK 577
+ + ++ + E++ L E+ E +LK+EK K
Sbjct: 100 EKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERK 142
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 30.3 bits (69), Expect = 1.0
Identities = 32/121 (26%), Positives = 52/121 (42%), Gaps = 22/121 (18%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
E +R +E L + EEEL+K KE L + K K + E K+
Sbjct: 22 EFERREELLKQREEELEK----------------KEE-ELQESLIKFDKFLKENEAKRRR 64
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
+K + +E K KEK EEE+K+ K LK + + + ++ + E ++ VV
Sbjct: 65 AEKKAE-EEKKLRKEK----EEEIKELKAELEELKAEIEKLEEKLEEYQPYEEFLESVVE 119
Query: 603 E 603
Sbjct: 120 R 120
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 31.8 bits (72), Expect = 1.0
Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 3/128 (2%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK-K 540
E +RLK+ + + +++K A K ++ ++ K + +K K
Sbjct: 98 AEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAA 157
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKV 600
+ + + E K+ E+ +K EE K K +AA K +A++ K K + K
Sbjct: 158 AEAAKLKAAAEAKKKAEEAAKAAEEAKA-KAEAAAAKKKAEAEAKAAAEKAKAEAEA-KA 215
Query: 601 VTEQGDKK 608
E+ +
Sbjct: 216 KAEKKAEA 223
Score = 30.3 bits (68), Expect = 3.0
Identities = 24/98 (24%), Positives = 41/98 (41%), Gaps = 8/98 (8%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK----EYDRLLKEHEKVQKVVTEQGD 538
E K+ E+ +K EE K K +AA K +A++ K + + K +K +
Sbjct: 168 EAKKKAEEAAKAAEEAKA-KAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAE 226
Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
+K +K + + K K + E K +AAL
Sbjct: 227 EKAAAEKKKAAAKAKADKAAAAAKAAE---RKAAAAAL 261
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 32.2 bits (73), Expect = 1.0
Identities = 21/190 (11%), Positives = 45/190 (23%), Gaps = 22/190 (11%)
Query: 433 HVNIYKCRDLANNCGLCLALPEKYGCGWCESSKRCEIFEQCDKGLEGDSDEIKRLKEKLS 492
+ + + Y ++ Q + + LKE++
Sbjct: 514 NPARQDIDNPGPLTRRMQRGEQTYAQLETSEED---VYHQLTSERK----QRASLKEQMQ 566
Query: 493 KTEEELKKEKTNSAALKSQADSVGKEYDRLLK------EHEK----VQKVVTEQGDKKDD 542
+ ++ K ++ RL E E Q + + + D
Sbjct: 567 EIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQD 626
Query: 543 KYQKGDSD-----EIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
E+ L + L +E+ AL + R L +
Sbjct: 627 LQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQ 686
Query: 598 QKVVTEQGDK 607
+ K
Sbjct: 687 SEKEQLTYWK 696
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 30.8 bits (70), Expect = 1.0
Identities = 19/64 (29%), Positives = 36/64 (56%), Gaps = 5/64 (7%)
Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL----KNQADS 582
+K++ V+ + + D+K +K D D+ K+ +E K E ELK K + K++A++
Sbjct: 28 DKIKAVIDSRQSEIDNKIEKADEDQ-KKARELKLKNERELKNAKEEGKKIVEEYKSKAEN 86
Query: 583 VGKE 586
V +E
Sbjct: 87 VYEE 90
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 31.9 bits (72), Expect = 1.0
Identities = 25/147 (17%), Positives = 47/147 (31%), Gaps = 18/147 (12%)
Query: 468 EIFEQCDKGLEGDSD-EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH 526
E+ + K +D K +E + + E + +AA +SQ +
Sbjct: 442 ELNKALAKAKILQADKAAKAYQEDILQ--REAQSRGKTAAAERSQE-----------QMT 488
Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEK----LSKTEEELKKEKTNSAALKNQADS 582
++ ++ Q D K + + K L K + +EE K+ + +
Sbjct: 489 AALKALLAFQQQIADLSGAKEKASDQKSLLWKAEEQYALLKEEAKQRQLQEQKALLEHKK 548
Query: 583 VGKEYDRLLKEHEKVQKVVTEQGDKKD 609
EY L E Q E +
Sbjct: 549 ETLEYTSQLAELLDQQADRFELSAQAA 575
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 31.7 bits (72), Expect = 1.0
Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 11/124 (8%)
Query: 463 SSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKSQADSVGKEYDR 521
S+KR E EQ K + ++E L++K + +E LK+ EK AA + + K+ +
Sbjct: 73 SAKRAE--EQRKKKEQQQAEE---LQQKQAAEQERLKQLEKERLAAQEQK-----KQAEE 122
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
K+ QK E K + E KR K E KK+ AA K A+
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE 182
Query: 582 SVGK 585
+ K
Sbjct: 183 AKKK 186
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 31.3 bits (71), Expect = 1.1
Identities = 18/99 (18%), Positives = 47/99 (47%), Gaps = 8/99 (8%)
Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
E ++ + E++ LKE L E LK + +L+ L KE ++
Sbjct: 95 LEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEE-------SIKELAKELREL 147
Query: 530 QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
++ + E+ ++ ++ ++ + ++RL+E + + + L++
Sbjct: 148 RQDLREEVEELREELER-LQENLQRLQEAIQELQSLLEQ 185
>gnl|CDD|197711 smart00414, H2A, Histone 2A.
Length = 106
Score = 30.0 bits (68), Expect = 1.1
Identities = 19/54 (35%), Positives = 25/54 (46%), Gaps = 8/54 (14%)
Query: 166 YLTPNLLVTRSYSTRVARSFSAQNGISDLKPRTLDLAIENLPELPGQLLCAFTI 219
YLT +L +A + + N + PR L LAI N EL +LL TI
Sbjct: 42 YLTAEVL-------ELAGNAARDNKKRRITPRHLQLAIRNDEEL-NKLLKGVTI 87
>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
protein, PEP-CTERM locus subfamily. Members of this
protein family belong to the family of polysaccharide
chain length determinant proteins (pfam02706). All are
found in species that encode the PEP-CTERM/exosortase
system predicted to act in protein sorting in a number
of Gram-negative bacteria, and are found near the epsH
homolog that is the putative exosortase gene [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 498
Score = 31.9 bits (73), Expect = 1.2
Identities = 19/127 (14%), Positives = 48/127 (37%), Gaps = 11/127 (8%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
++ K ++++ EE+ ++E + + + Y +L E + + + + +
Sbjct: 276 DVIATKREIAQLEEQKEEEGSAKNGGPERGEIANPVYQQLQIELAEAEAEIASL-EARVA 334
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE--KVQKV 600
+ I+RL+ L + + + L + Y++LL E +V K
Sbjct: 335 ELTA----RIERLESLL----RTIPEVEAELTQLNRDYEVNKSNYEQLLTRRESAEVSKQ 386
Query: 601 VTEQGDK 607
+ Q
Sbjct: 387 MEVQDKA 393
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 32.0 bits (73), Expect = 1.2
Identities = 28/124 (22%), Positives = 57/124 (45%), Gaps = 9/124 (7%)
Query: 478 EGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQG 537
E + EI ++++ K EE +++ + ++ LK E + L +E EK++ +
Sbjct: 411 EEERREITVYEKRIKKLEETVERLEEENSELKR-------ELEELKREIEKLESELERFR 463
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
+ DK +K EI+ ++ + E+EL+++K L+ + + K L
Sbjct: 464 REVRDKVRK--DREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTP 521
Query: 598 QKVV 601
KVV
Sbjct: 522 VKVV 525
Score = 30.1 bits (68), Expect = 4.4
Identities = 25/130 (19%), Positives = 53/130 (40%), Gaps = 31/130 (23%)
Query: 476 GLEGDSDEIK-------RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK 528
G+ D + IK L E LSK +EE + + G E +
Sbjct: 372 GIWKDVERIKALVIRGYPLAEALSKVKEEERPR-----------EKEGTE--EEERREIT 418
Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
V + ++ ++ ++ ++ E LK +L EELK+E L+++ + +E
Sbjct: 419 VYEKRIKKLEETVERLEE----ENSELKREL----EELKREIEK---LESELERFRREVR 467
Query: 589 RLLKEHEKVQ 598
+++ +++
Sbjct: 468 DKVRKDREIR 477
Score = 29.3 bits (66), Expect = 7.9
Identities = 20/102 (19%), Positives = 50/102 (49%), Gaps = 15/102 (14%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
++ ++ +E +E LK +L EELK+E L+S+ + +E +++ ++
Sbjct: 425 KKLEETVERLEEENSELKREL----EELKREIEK---LESELERFRREVRDKVRKDREI- 476
Query: 531 KVVTEQGDKKDDKYQKGDSDE---IKRLKEKLSKTEEELKKE 569
D++ ++ +K ++ ++ L+ KL++ + K E
Sbjct: 477 ----RARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLE 514
>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
Length = 359
Score = 31.6 bits (73), Expect = 1.2
Identities = 26/101 (25%), Positives = 49/101 (48%), Gaps = 23/101 (22%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE-------Q 536
+ +L E L + EEL+ ++ + Q K + +L KE+ +++ +V Q
Sbjct: 5 LDKL-EALEERYEELEALLSDPEVISDQ-----KRFRKLSKEYAELEPIVEAYREYKQAQ 58
Query: 537 GDKKDDK---YQKGDSD-------EIKRLKEKLSKTEEELK 567
D ++ K ++ D + E+K L+E+L + EEELK
Sbjct: 59 EDLEEAKEMLEEESDPEMREMAKEELKELEERLEELEEELK 99
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 31.7 bits (73), Expect = 1.2
Identities = 26/140 (18%), Positives = 58/140 (41%), Gaps = 21/140 (15%)
Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
++ E++ L + + EEE + + + +A + V +++ L K+++K++K
Sbjct: 179 EEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQ 238
Query: 534 TEQGD---KKDDKYQKGDSDEIKRLKEKL------SKTEEEL--------KKEKTNSAAL 576
++ + + KY K ++LKE+L SK +EL K+ + L
Sbjct: 239 EKKVEGRLAQHKKYAK----LREKLKEELKSLRLTSKQIDELVEQLRDINKRVRGQEREL 294
Query: 577 KNQADSVGKEYDRLLKEHEK 596
+ + LK +
Sbjct: 295 LRLVERLKMPRKDFLKLFQG 314
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 32.0 bits (72), Expect = 1.2
Identities = 22/104 (21%), Positives = 50/104 (48%), Gaps = 3/104 (2%)
Query: 476 GLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE 535
G+ G +E + EEL + N ALK + + +E D L + +KV + E
Sbjct: 187 GIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKNDGDKVAE---E 243
Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ 579
+ + D+++++ + D+ + L++ ++ +EE K +++
Sbjct: 244 EEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHD 287
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 31.5 bits (72), Expect = 1.3
Identities = 25/92 (27%), Positives = 45/92 (48%), Gaps = 14/92 (15%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+E+K L+E+L + EEEL+K K L K ++ EQ +K
Sbjct: 334 EELKELEEELKELEEELEKIKKLLKKLPK-------------KARGQLPPEKREQLEKLL 380
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
+ +K S+E++ L+E+L + +EEL+ +
Sbjct: 381 ETKEKL-SEELEELEEELKELKEELESLYSEG 411
Score = 31.5 bits (72), Expect = 1.4
Identities = 23/96 (23%), Positives = 47/96 (48%), Gaps = 3/96 (3%)
Query: 492 SKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDE 551
+KTE E+ + LK + + +E L +E EK++K++ + K + ++
Sbjct: 319 TKTEIEVG---VDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQ 375
Query: 552 IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
+++L E K EEL++ + LK + +S+ E
Sbjct: 376 LEKLLETKEKLSEELEELEEELKELKEELESLYSEG 411
>gnl|CDD|215878 pfam00362, Integrin_beta, Integrin, beta chain. Integrins have
been found in animals and their homologues have also
been found in cyanobacteria, probably due to horizontal
gene transfer. The sequences repeats have been trimmed
due to an overlap with EGF.
Length = 424
Score = 31.4 bits (72), Expect = 1.3
Identities = 11/36 (30%), Positives = 15/36 (41%), Gaps = 10/36 (27%)
Query: 284 SCTQCVSSDFPCDWCVD--------GHRCTHDTAEN 311
+C +C+ S C WC RC DT E+
Sbjct: 1 TCGECIRSGPSCAWCTQENFTGEFNSPRC--DTLES 34
>gnl|CDD|239398 cd03124, alpha_CA_prokaryotic_like, Carbonic anhydrase alpha,
prokaryotic-like subfamily. Carbonic anhydrases (CAs)
are zinc-containing enzymes that catalyze the reversible
hydration of carbon dioxide in a two-step mechanism: a
nucleophilic attack of a zinc-bound hydroxide ion on
carbon dioxide, followed by the regeneration of the
active site by ionization of the zinc-bound water
molecule and removal of a proton from the active site.
They are ubiquitous enzymes involved in fundamental
processes like photosynthesis, respiration, pH
homeostasis and ion transport. Most alpha CAs are
monomeric enzymes. The zinc ion is complexed by three
histidines. This sub-family includes bacterial carbonic
anhydrase alpha, as well as plant enzymes such as
tobacco nectarin III and yam dioscorin and, carbonic
anhydrases from molluscs, such as nacrein, which are
part of the organic matrix layer in shells. Other
members of this family may be involved in maintaining pH
balance, in facilitating transport of carbon dioxide or
carbonic acid, or in sensing carbon dioxide levels in
the environment. Dioscorin is the major storage protein
of yam tubers and may play a role as an antioxidant.
Tobacco Nectarin may play a role in the maintenace of pH
and oxidative balance in nectar. Mollusc nacrein may
participate in calcium carbonate crystal formation of
the nacreous layer. This subfamily also includes three
alpha carbonic anhydrases from Chlamydomonas reinhardtii
(CAH 1-3). CAHs1-2 are localized in the periplasmic
space. CAH1 faciliates the movement of carbon dioxide
across the plasma membrane when the medium is alkaline.
CAH3 is localized to the thylakoid lumen and provides
CO2 to Rubisco.
Length = 216
Score = 31.1 bits (71), Expect = 1.3
Identities = 25/105 (23%), Positives = 35/105 (33%), Gaps = 24/105 (22%)
Query: 53 GCILVILSLF-------FLDAI-REMRKYASPEVKEEAHGHLDAEMQNNMKLFRAQRNFY 104
G + V+ LF FL I M K EV A + L R++Y
Sbjct: 108 GQLAVVAVLFEEGKENPFLKKILDNMPKKEGTEVNLPAILDPNE-------LLPESRSYY 160
Query: 105 ISGFSL--------FLWLVIRQIIQLIAQQANLLAQNEASMNQAR 141
SL W+V++Q I + +Q N AR
Sbjct: 161 RYEGSLTTPPCSEGVRWIVLKQPITISKEQLAKFRAAVYPNN-AR 204
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 31.8 bits (72), Expect = 1.3
Identities = 20/141 (14%), Positives = 47/141 (33%), Gaps = 13/141 (9%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
L D R + ++ E +L E + K + + + ++ ++ V K
Sbjct: 242 NELSSLEDMKNRYESEIKTAESDLSMELEKNNYYK----ELEERHMKI--INDPVYKNRN 295
Query: 535 EQGDKKDDKYQKGDSDEI-KRLKEKLSKTEEELKK----EKTNSAALKNQA--DSVGKEY 587
D K + +I + +++K +KK +K + +K ++ D + +
Sbjct: 296 YINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQI 355
Query: 588 DRLLKEHEKVQKVVTEQGDKK 608
L + K
Sbjct: 356 LELEGYEMDYNSYLKSIESLK 376
Score = 31.8 bits (72), Expect = 1.5
Identities = 22/99 (22%), Positives = 41/99 (41%), Gaps = 8/99 (8%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
E K L EKL + KK+ A + S + + R+ + ++K D K +
Sbjct: 640 ENKILIEKLRGKIDNYKKQ---IAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKAN 696
Query: 543 KYQKGDSDEIKR-----LKEKLSKTEEELKKEKTNSAAL 576
+ + + EI R L ++++ E L+ K A+
Sbjct: 697 RARLESTIEILRTRINELSDRINDINETLESMKKIKKAI 735
Score = 31.4 bits (71), Expect = 1.5
Identities = 29/157 (18%), Positives = 63/157 (40%), Gaps = 18/157 (11%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKE-----------KTNSAALKSQADSVGKEY 519
E+ K +E S I + + + +KKE + ++L + ++ +
Sbjct: 380 EEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENL 439
Query: 520 DRL------LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
D L L T G++K + +++ RL+EK+ + E E+K
Sbjct: 440 DELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKI 499
Query: 574 AALKNQADSV-GKEYDRLLKEHEKVQKVVTEQGDKKD 609
LK + + + +E ++ + E+ K++ + D K
Sbjct: 500 VDLKKRKEYLESEEINKSINEYNKIESARADLEDIKI 536
Score = 30.3 bits (68), Expect = 3.6
Identities = 26/134 (19%), Positives = 45/134 (33%), Gaps = 9/134 (6%)
Query: 482 DEIKRLKEKLSK---TEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
D I L+ ++S EE+LK +K Q K + LKE E++ D
Sbjct: 173 DVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMD 232
Query: 539 KKDD-----KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK-EYDRLLK 592
++ D R + ++ E +L E + K + K D + K
Sbjct: 233 DYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYK 292
Query: 593 EHEKVQKVVTEQGD 606
+ + D
Sbjct: 293 NRNYINDYFKYKND 306
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 31.3 bits (71), Expect = 1.3
Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 16/101 (15%)
Query: 491 LSKTEE-----ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQ 545
LSK +E EL+ + A ++QA +R E E+ +K + EQ ++ + Q
Sbjct: 56 LSKMQEKTRQAELEAKIKEYEAQQAQAKL-----ERARVEAEERRKTLQEQTQQEQQRAQ 110
Query: 546 KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
D KR +++L ++ + N LK Q +SV ++
Sbjct: 111 YQDELARKRYQKEL------EQQRRQNEELLKMQEESVLRQ 145
>gnl|CDD|226703 COG4252, COG4252, Predicted transmembrane sensor domain [Signal
transduction mechanisms].
Length = 400
Score = 31.6 bits (72), Expect = 1.4
Identities = 18/71 (25%), Positives = 26/71 (36%), Gaps = 3/71 (4%)
Query: 140 ARQAAVAAQALLDGPRGKWFGSHVWAYLTPNLLVTRSYSTRVARSFSAQNGISDLKPRTL 199
RQA + L+ GK + TP S + ++A + A GIS P
Sbjct: 148 PRQAQIGFSDLILDSDGKVRRLLL--AATPGPEPKYSLALKLALQYLASLGISPKYPDPE 205
Query: 200 DLAIENLPELP 210
L + LP
Sbjct: 206 RLRL-GKTTLP 215
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 30.4 bits (69), Expect = 1.5
Identities = 22/102 (21%), Positives = 40/102 (39%), Gaps = 14/102 (13%)
Query: 62 FFLDAIREMRKYASPEVKEE------AHGHLDAEMQNNMKL---FRAQRNFYI-SGFSLF 111
F L + ++R+ + K E L+ ++Q +K + Q + +G S
Sbjct: 2 FRLQKLLDLREKEEEQAKLELAKAQAEFERLETQLQQLIKYREEYEQQALEKVGAGTSAL 61
Query: 112 LWL----VIRQIIQLIAQQANLLAQNEASMNQARQAAVAAQA 149
IRQ+ Q I QQ LA + + R+ + A+
Sbjct: 62 ELSNYQRFIRQLDQRIQQQQQELALLQQEVEAKRERLLEARR 103
>gnl|CDD|221959 pfam13177, DNA_pol3_delta2, DNA polymerase III, delta subunit. DNA
polymerase III, delta subunit (EC 2.7.7.7) is required
for, along with delta' subunit, the assembly of the
processivity factor beta(2) onto primed DNA in the DNA
polymerase III holoenzyme-catalyzed reaction. The delta
subunit is also known as HolA.
Length = 161
Score = 30.2 bits (69), Expect = 1.5
Identities = 22/77 (28%), Positives = 31/77 (40%), Gaps = 14/77 (18%)
Query: 448 LCLALPEKYGCGWCESSKRCEIFEQCD-KGLEGDS-----DEIKRLKEKLSKTEEELKK- 500
C + CG C+S KR E D +E + D+I+ L E+ SK E KK
Sbjct: 44 FCENPQDDEPCGECKSCKRIENGNHPDVIIIEPEGKSIKIDQIRELIEEFSKKPFEGKKK 103
Query: 501 -------EKTNSAALKS 510
EK +A +
Sbjct: 104 VYIIEDAEKMTESAANA 120
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 31.3 bits (72), Expect = 1.5
Identities = 34/156 (21%), Positives = 68/156 (43%), Gaps = 40/156 (25%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQA---------DSVGKEYDRLLKEHEKVQK---- 531
+ LK+ + + EELKKE AA +S+ + ++ + +E E+ K
Sbjct: 87 RELKKDMEELVEELKKEI--PAAFESEEYEARKEEIEEEFQEKREEAFEELEEEAKEKGF 144
Query: 532 --VVTEQG-------DKK---DDKYQKGDSDEIKRLKEKLSKTEEEL--------KKEKT 571
V T G D + +++++ +E + L+EK+ + EEEL + E+
Sbjct: 145 ALVRTPGGFVFAPLKDGEPLTEEEFEALPEEEREELEEKIDELEEELQEILRQLRELERE 204
Query: 572 NSAALK--NQ---ADSVGKEYDRLLKEHEKVQKVVT 602
L+ ++ +VG D L +++ + KV+
Sbjct: 205 AREKLRELDREVALFAVGPLLDELKEKYADLPKVLA 240
>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain. Mitosin or
centromere-associated protein-F (Cenp-F) is found bound
across the centromere as one of the proteins of the
outer layer of the kinetochore. Most of the
kinetochore/centromere functions appear to depend upon
binding of the C-terminal par to f the molecule, whereas
the N-terminal part, here, may be a cytoplasmic player
in controlling the function of microtubules and dynein.
Length = 288
Score = 31.3 bits (70), Expect = 1.5
Identities = 31/114 (27%), Positives = 58/114 (50%), Gaps = 9/114 (7%)
Query: 477 LEGDSDEIKRLKEK----LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK--EH-EKV 529
LEG D++K+ K++ L E L+K+K K++ ++ +E RL++ EH EK
Sbjct: 23 LEGQLDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLMEICEHLEKT 82
Query: 530 QKVVTEQGDKKDDK--YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQAD 581
++ ++ + K+ + +Q+G + K+ EKL + + K E S AD
Sbjct: 83 KQKISHELQVKESQVNFQEGQLNSCKKQIEKLEQELKRCKSELERSQQAAQSAD 136
>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1. MAT1
is an assembly/targeting factor for cyclin-dependent
kinase-activating kinase (CAK), which interacts with the
transcription factor TFIIH. The domain found to the
N-terminal side of this domain is a C3HC4 RING finger.
Length = 200
Score = 30.9 bits (70), Expect = 1.5
Identities = 35/144 (24%), Positives = 70/144 (48%), Gaps = 26/144 (18%)
Query: 466 RCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
R +IFE D +E + D KR+ + +K EE+ S+ +EY+ L+E
Sbjct: 7 RVQIFE--DPTIEKEVDIRKRILKIFNKREEDF--------------PSL-REYNDYLEE 49
Query: 526 HEKV-----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
E + + E+ ++K D+Y+K + D I R K +L++ +EEL++ +K +
Sbjct: 50 VETIVFNLTNGIDVEETEEKVDQYEKENKDSIMRNKRRLTREQEELEQALEEEKEMKEEK 109
Query: 581 DSVGKEYDRLLKEHEKVQKVVTEQ 604
+ + + ++ +K+ K +Q
Sbjct: 110 ----RLHLQKEEQEQKMAKEKDKQ 129
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 30.2 bits (69), Expect = 1.6
Identities = 27/110 (24%), Positives = 60/110 (54%), Gaps = 11/110 (10%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEK-TNSAALKSQADSVGKEYD 520
ES +Q +K + E+++L+++L K +E+L+K+ T S A + + + KE
Sbjct: 12 ESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKE---KELQ 68
Query: 521 RLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
+ ++E ++ Q+ K QK +E++++ +K++K +E+ K+K
Sbjct: 69 KKVQEFQRKQQ-------KLQQDLQKRQQEELQKILDKINKAIKEVAKKK 111
Score = 29.1 bits (66), Expect = 3.5
Identities = 20/81 (24%), Positives = 46/81 (56%), Gaps = 9/81 (11%)
Query: 520 DRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK-TNSAALKN 578
++L+E + +Q +K+ K Q E+++L+++L K +E+L+K+ T S A +
Sbjct: 7 QKILQESPAGKAAQ-KQLEKEFKKRQA----ELEKLEKELQKLKEKLQKDAATLSEAARE 61
Query: 579 QADSVGKEYDRLLKEHEKVQK 599
+ + KE + ++E ++ Q+
Sbjct: 62 KKE---KELQKKVQEFQRKQQ 79
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 31.2 bits (71), Expect = 1.6
Identities = 18/71 (25%), Positives = 33/71 (46%)
Query: 539 KKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
KK + SD ++LK K + E+L E K + + K+ +RL + EK++
Sbjct: 297 KKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLE 356
Query: 599 KVVTEQGDKKD 609
T++ + K
Sbjct: 357 VQATDKEENKT 367
Score = 31.2 bits (71), Expect = 1.6
Identities = 25/104 (24%), Positives = 54/104 (51%), Gaps = 10/104 (9%)
Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
+K +E ++IK LK +L + ++ + + S + ++ ++L E ++ +K
Sbjct: 276 EKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEK 335
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELK-KEKTNSAAL 576
KK++K +K +I+RL+E++ K E + KE+ + AL
Sbjct: 336 -----KKEEKKKK----QIERLEERIEKLEVQATDKEENKTVAL 370
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 31.1 bits (70), Expect = 1.6
Identities = 28/156 (17%), Positives = 60/156 (38%), Gaps = 14/156 (8%)
Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLK 524
R E+ E + + +L K +E + + AA + +E RL K
Sbjct: 86 LRTEL-GTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTK 144
Query: 525 EHEKVQ---KVVTEQGDKKDDKYQ---------KGDSDEIKRLKEKLSKTEEELKKEKTN 572
+ + +Q K + EQ + + + Q + + ++K L ++++E N
Sbjct: 145 QAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQN 204
Query: 573 SAALKNQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 608
A N A + +E R ++ + + +Q D +
Sbjct: 205 LATRANAAQARTEELARRAAAAQQTAQAI-QQRDAQ 239
Score = 31.1 bits (70), Expect = 2.0
Identities = 20/113 (17%), Positives = 49/113 (43%), Gaps = 5/113 (4%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
K L+ + ++K L ++++E N A + A + +E R ++ + +
Sbjct: 175 KQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQ 234
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEY 587
++ + K Q +I E++ + E +L++ +T A L+ + + Y
Sbjct: 235 QRDAQISQKAQ-----QIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYY 282
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 29.9 bits (68), Expect = 1.6
Identities = 17/89 (19%), Positives = 44/89 (49%), Gaps = 4/89 (4%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
E + LKE++ + EE+L++ K L+ + + + +K +++ ++ + K +
Sbjct: 40 ENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKEAEAKAE 99
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
+ ++E K+L + +ELK++
Sbjct: 100 RIVNDANEEAKKLATEY----DELKRQSR 124
Score = 27.6 bits (62), Expect = 8.5
Identities = 15/66 (22%), Positives = 25/66 (37%), Gaps = 26/66 (39%)
Query: 513 DSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
D V K+Y+ L KE + LKE++ + EE+L++ K
Sbjct: 28 DQVIKDYEALYKE--------------------------NEELKEEIERLEEKLEEYKEL 61
Query: 573 SAALKN 578
L+
Sbjct: 62 EETLQK 67
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 30.8 bits (70), Expect = 1.7
Identities = 22/129 (17%), Positives = 53/129 (41%), Gaps = 15/129 (11%)
Query: 484 IKRLKEKLSKTEEELKKEKT-NSAALKSQADSVGKEYDRLLKE------------HEKVQ 530
I + +EK ++E++K KT + ++++ + K+ ++L +E H
Sbjct: 69 ILKREEKEVLSKEQVKLLKTQDLNYVRTKRQAEAKKIEKLKEELHLTDSGSAGPKHIVFV 128
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
EQ +Y + + R + + +L+K K ++ ++ +
Sbjct: 129 DDEEEQKSFDPAEYFDTTPELLDRRENRP--RISQLEKTSLVDEKQKKKSAKKKRKLYKE 186
Query: 591 LKEHEKVQK 599
LKE ++ +K
Sbjct: 187 LKERKEREK 195
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 31.2 bits (71), Expect = 1.7
Identities = 22/124 (17%), Positives = 51/124 (41%), Gaps = 6/124 (4%)
Query: 482 DEIKRLKEKL-----SKTEEELKKEKTNSAALK-SQADSVGKEYDRLLKEHEKVQKVVTE 535
EI+ L+ +L S EE + K N K + + + LK+ K + +
Sbjct: 392 AEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKK 451
Query: 536 QGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE 595
+ D++++ + ++ K + L ++E+ K+E+ +N + K+ +
Sbjct: 452 ESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDEENPWLKTTSSVGKSAKKQD 511
Query: 596 KVQK 599
+K
Sbjct: 512 SKKK 515
>gnl|CDD|187794 cd09663, Csm4_III-A, CRISPR/Cas system-associated RAMP superfamily
protein Csm4. CRISPR (Clustered Regularly Interspaced
Short Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
RAMP superfamily protein.
Length = 301
Score = 30.9 bits (70), Expect = 1.7
Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 11/79 (13%)
Query: 474 DKGLEGDSDEIKR--------LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
LE ++K+ L+ LS +E K+ K + K + + G + K
Sbjct: 77 LNLLEKKKKKLKKISYIPIDELESWLSGEADEKKQLKNIT---KIKTEGNGDNKGKPHKV 133
Query: 526 HEKVQKVVTEQGDKKDDKY 544
+ KV ++ + K + Y
Sbjct: 134 LIEEPKVSIDRIEIKSNLY 152
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 29.2 bits (66), Expect = 1.7
Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 7/88 (7%)
Query: 483 EIKRLK-EKLSKTEEELKKEKT---NSAALKSQADSVGKEYDRLLKEHEKV-QKVVTEQG 537
E+KRLK K + EE+L+K K A S+ D ++D ++ ++ K+ +
Sbjct: 14 ELKRLKNLKREEIEEKLEKIKKVAGLRGADLSEEDLADGDFDP--EKWDEEMAKLFGDDY 71
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEE 565
+++D+ +K D+ + + + + +EE
Sbjct: 72 YEEEDEEEKPTWDDDEDIGDLVPDEDEE 99
>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN. All proteins in this
family for which functions are known are ATP binding
proteins involved in the initiation of recombination and
recombinational repair [DNA metabolism, DNA replication,
recombination, and repair].
Length = 563
Score = 31.2 bits (71), Expect = 1.9
Identities = 26/112 (23%), Positives = 54/112 (48%), Gaps = 10/112 (8%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
E++ ++L E L + + A +K G + +L+ EK++ E+ D+ DD
Sbjct: 288 ELQNYLDELEFDPERLNEIEERLAQIKRLKRKYGASVEEVLEYAEKIK----EELDQLDD 343
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQ-ADSVGKEYDRLLKE 593
+ ++ L+E++ K EEEL K + ++ + A+ + K ++ LK
Sbjct: 344 SDESLEA-----LEEEVDKLEEELDKAAVALSLIRRKAAERLAKRVEQELKA 390
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 31.2 bits (71), Expect = 1.9
Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 8/79 (10%)
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL--------KKEKTNSAA 575
E ++ ++ ++G KK +K K D D+ +L +E KK+K
Sbjct: 8 NEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKK 67
Query: 576 LKNQADSVGKEYDRLLKEH 594
KN ++ YD +
Sbjct: 68 KKNLGEAYDLAYDLPVVWS 86
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 29.5 bits (67), Expect = 1.9
Identities = 23/114 (20%), Positives = 47/114 (41%), Gaps = 21/114 (18%)
Query: 501 EKTNSAALKSQAD--------SVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDS--- 549
++A LK+ D S+G+ + +E ++ QG + D +
Sbjct: 11 AAADAAKLKTDKDKLSYAIGASLGRYLKQQGEEQLDKDALL--QGLQ--DALAGKELKLT 66
Query: 550 -DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVT 602
+EI++ + K + ++ K A +N+A G+ + L E+ K + V T
Sbjct: 67 DEEIQKALQAFQKKLQAKQQAKAEKKAEENKAA--GEAF---LAENAKKEGVKT 115
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.0 bits (70), Expect = 2.0
Identities = 30/130 (23%), Positives = 57/130 (43%), Gaps = 3/130 (2%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKS--QADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
+ K+L+++ + E++ E+ L+ A+ K+ ++ K+ E+ QK E K
Sbjct: 68 ERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAK 127
Query: 540 -KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
+ K +++ K+ KE+ K EE K K + A K A++ K + E
Sbjct: 128 QAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKAEAKA 187
Query: 599 KVVTEQGDKK 608
K E+ K
Sbjct: 188 KAKAEEAKAK 197
Score = 29.4 bits (66), Expect = 6.0
Identities = 25/101 (24%), Positives = 39/101 (38%), Gaps = 15/101 (14%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
K+ E +K E E +K+ K QA E + K + + KK +
Sbjct: 127 KQAAEAKAKAEAEAEKKAK--EEAKKQA-----EEEAKAKAAAEAK--------KKAAEA 171
Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
+K E K E +K + E K K +A K A++ K
Sbjct: 172 KKKAEAEAKAKAEAKAKAKAEEAKAKAEAAKAKAAAEAAAK 212
>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
factor dissociates the posttermination complex, composed
of the ribosome, deacylated tRNA, and mRNA, after
termination of translation. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis. RRF is believed to bind the ribosome at the
A-site in a manner that mimics tRNA, but the specific
mechanisms remain unclear. RRF is essential for
bacterial growth. It is not necessary for cell growth
in archaea or eukaryotes, but is found in mitochondria
or chloroplasts of some eukaryotic species.
Length = 179
Score = 29.9 bits (68), Expect = 2.2
Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
KE K K + E+ ++ +D+IK+L+++ +E+E+KK + L+ D
Sbjct: 107 KELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAE---EDLQKLTDEY 163
Query: 584 GKEYDRLLKEHEK 596
K+ D LLK EK
Sbjct: 164 IKKIDELLKSKEK 176
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 30.9 bits (70), Expect = 2.2
Identities = 21/105 (20%), Positives = 39/105 (37%), Gaps = 19/105 (18%)
Query: 493 KTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY-QKGDSDE 551
K E+ ++++ + K + D+ G + K D + S +
Sbjct: 191 KEEKAEERKQESKKGAKRKKDASGDD----------KSKKAKTDRDVSTSTAASQKKSSD 240
Query: 552 IKRLKEKLSKTEEELK---KEKTNSAALK-----NQADSVGKEYD 588
++ E SK LK K+ ++A L+ N D+ G E D
Sbjct: 241 LESKLEAQSKELWSLKDDLKKHVSTAELREMLEANGQDTSGSELD 285
>gnl|CDD|237631 PRK14162, PRK14162, heat shock protein GrpE; Provisional.
Length = 194
Score = 30.2 bits (68), Expect = 2.3
Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 21/144 (14%)
Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVG------KEYDRLLK-EH 526
+E EI LK K E++ L+SQA+ KE +L+K E
Sbjct: 38 QNPVEDLEKEIADLKAKNKDLEDKY---------LRSQAEIQNMQNRYAKERAQLIKYES 88
Query: 527 EKVQKVVTEQGDKKDDKYQ-KGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGK 585
+ + K V D + K D + K+LK+ + T + L K + + +AD G+
Sbjct: 89 QSLAKDVLPAMDNLERALAVKADDEAAKQLKKGVQMTLDHLVKALKDHGVTEIKAD--GE 146
Query: 586 EYDRLLKEHEKVQKVVTEQGDKKD 609
++D L H+ VQ V E D+KD
Sbjct: 147 KFDPTL--HQAVQTVAAENDDQKD 168
Score = 29.8 bits (67), Expect = 3.1
Identities = 19/91 (20%), Positives = 44/91 (48%), Gaps = 4/91 (4%)
Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTE---QGDKKDDKY 544
KE+ ++ ++E T+ A K ++ +E E ++K + + + +DKY
Sbjct: 3 KEEFPSEKDLPQEETTDEAPKKEAKEAPKEEDQEKQNPVEDLEKEIADLKAKNKDLEDKY 62
Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAA 575
+ + EI+ ++ + +K +L K ++ S A
Sbjct: 63 LRSQA-EIQNMQNRYAKERAQLIKYESQSLA 92
>gnl|CDD|214660 smart00434, TOP4c, DNA Topoisomerase IV. Bacterial DNA
topoisomerase IV, GyrA, ParC.
Length = 444
Score = 30.6 bits (70), Expect = 2.3
Identities = 19/94 (20%), Positives = 35/94 (37%), Gaps = 29/94 (30%)
Query: 481 SDEIKRLKEKLSKTEEELKKEK-TNSAALKSQADSV---------GKEYDRLLKEHEKVQ 530
DEI L +E K++ + QAD++ E ++L KE ++++
Sbjct: 368 IDEIIVLIRSSKDLAKEAKEKLMERFELSEIQADAILDMRLRRLTKLEVEKLEKELKELE 427
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEE 564
K EI+ L++ L+ E
Sbjct: 428 K-------------------EIEDLEKILASEEW 442
Score = 29.1 bits (66), Expect = 8.8
Identities = 14/61 (22%), Positives = 26/61 (42%), Gaps = 10/61 (16%)
Query: 549 SDEIKRLKEKLSKTEEELKKEK-TNSAALKNQADSV---------GKEYDRLLKEHEKVQ 598
DEI L +E K++ + QAD++ E ++L KE ++++
Sbjct: 368 IDEIIVLIRSSKDLAKEAKEKLMERFELSEIQADAILDMRLRRLTKLEVEKLEKELKELE 427
Query: 599 K 599
K
Sbjct: 428 K 428
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 29.7 bits (67), Expect = 2.3
Identities = 17/72 (23%), Positives = 36/72 (50%), Gaps = 4/72 (5%)
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
+ + D QK +++ +K E L K EEL+KE L+ + ++ ++Y L+
Sbjct: 93 KTTNPSDQALQK-ENERLKNQNESLQKRNEELEKEL---EKLRQRLSTIEEDYQTLIDIM 148
Query: 595 EKVQKVVTEQGD 606
++ +K+ + D
Sbjct: 149 DRARKLAVVEDD 160
Score = 29.7 bits (67), Expect = 2.6
Identities = 16/65 (24%), Positives = 35/65 (53%), Gaps = 3/65 (4%)
Query: 474 DKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV 533
D+ L+ +++ +K E L K EEL+KE L+ + ++ ++Y L+ ++ +K+
Sbjct: 99 DQALQKENERLKNQNESLQKRNEELEKEL---EKLRQRLSTIEEDYQTLIDIMDRARKLA 155
Query: 534 TEQGD 538
+ D
Sbjct: 156 VVEDD 160
>gnl|CDD|234547 TIGR04330, cas_Cpf1, CRISPR-associated protein Cpf1, subtype
PREFRAN. This family is the long protein of a novel
CRISPR subtype, PREFRAN, which is most common in
Prevotella and Francisella, although widely distributed.
The PREFRAN type has Cas1, Cas2, and Cas4, but lacks the
helicase Cas3 and endonuclease Cas3-HD.
Length = 1287
Score = 31.0 bits (70), Expect = 2.3
Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 9/116 (7%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQAD------SVGKEYDRLLKEHEKVQKVVTEQ 536
E ++ K K K EE++KK S KE+D + E ++ E
Sbjct: 386 EPRKGKSKKEKYEEKVKK-VVKKQDYLSIQFINDLLEKYIKEFDSKDRLFENIRDYFAEI 444
Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLK 592
+++ +Q D+ I +K E L+KE+ + + + D L
Sbjct: 445 SLMRNEDFQTNDN--IANIKTAYQTFERILQKERKDLKQDEEVVKLIKSLLDALKN 498
>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904). This
family consists of several bacterial and archaeal
hypothetical proteins of unknown function.
Length = 72
Score = 28.0 bits (63), Expect = 2.4
Identities = 12/63 (19%), Positives = 30/63 (47%), Gaps = 4/63 (6%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHE 527
E ++ LE +I++ E ++ + E+++ K + L + + + +E +L +E
Sbjct: 1 MSLELLEE-LES---KIQQAVETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERN 56
Query: 528 KVQ 530
Q
Sbjct: 57 AWQ 59
Score = 27.3 bits (61), Expect = 5.9
Identities = 15/64 (23%), Positives = 30/64 (46%), Gaps = 8/64 (12%)
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
E ++ + K Q+ + I L+ ++ EELK+E L + + + +E +L +E
Sbjct: 4 ELLEELESKIQQA-VETIALLQMEI----EELKEENE---QLSEENEELEEENKKLKEER 55
Query: 595 EKVQ 598
Q
Sbjct: 56 NAWQ 59
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 30.7 bits (70), Expect = 2.4
Identities = 23/114 (20%), Positives = 44/114 (38%), Gaps = 15/114 (13%)
Query: 469 IFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEK 528
IF+ + RL L TEEE++ S+ + + E ++
Sbjct: 377 IFKHNP-------ELAARLAAFLELTEEEIEF------LTGSKKATKKIKKIVEKAEKKR 423
Query: 529 VQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE--KTNSAALKNQA 580
++ ++ K ++ + +E K KE+ + EEE +E + K QA
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQA 477
>gnl|CDD|225121 COG2211, MelB, Na+/melibiose symporter and related transporters
[Carbohydrate transport and metabolism].
Length = 467
Score = 30.7 bits (70), Expect = 2.5
Identities = 32/167 (19%), Positives = 57/167 (34%), Gaps = 15/167 (8%)
Query: 17 AFMLLLILPILSTQRLHKILKSKFVQGVKTQAGWYFGCILVILSLFFLDAIREMRKYASP 76
+L+ +L + K+ QG G I VIL LF ++E
Sbjct: 161 GGLLVAVLFPPLVKLFGGGDKALGYQGT----ALVLGVIGVILLLFCFFNVKER--VVET 214
Query: 77 EVKEEAHGHLDAEMQNNMKLFRAQRNFYI-SGFSLFLWL--VIRQIIQLIAQQANL-LAQ 132
+ + +++++ L R + +L L++ IR I + L +
Sbjct: 215 QPTKTGVKL---KLKDSFLLIFKNRPLLLLLLMNLLLFIAFNIRGSIMVYYVTYVLGDPE 271
Query: 133 NEASM-NQARQAAVAAQALLDGPRGKWFGSHVWAYLTPNLLVTRSYS 178
A + A A + +L K FG +L LL+ Y
Sbjct: 272 LFAYLLLLASGAGLLIGLILWPRLVKKFGKK-KLFLIGLLLLAVGYL 317
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 30.8 bits (70), Expect = 2.6
Identities = 21/131 (16%), Positives = 53/131 (40%), Gaps = 7/131 (5%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV--QKV 532
+ L+ D +++ L+ +L EL ++ A + + + L+ E +
Sbjct: 249 EKLQEDFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEAR 308
Query: 533 VTEQGDKKDDKYQ-KGDSDEIKRLKEKLSKTE----EELKKEKTNSAALKNQADSVGKEY 587
+ + D E++ L+++ E E+L+ + +++++ + V
Sbjct: 309 DELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEARL 368
Query: 588 DRLLKEHEKVQ 598
D L +H+ VQ
Sbjct: 369 DALTGKHQDVQ 379
Score = 28.9 bits (65), Expect = 9.9
Identities = 20/110 (18%), Positives = 47/110 (42%), Gaps = 3/110 (2%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
+RL E+ + +E + + L+ Q E ++ L + + +D+
Sbjct: 276 EERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQ 335
Query: 544 YQKGDSDEIKRLK---EKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
+ +I++L+ ++L EL++ + AL + V ++Y+RL
Sbjct: 336 KGAFEDADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERL 385
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 30.7 bits (70), Expect = 2.7
Identities = 38/159 (23%), Positives = 69/159 (43%), Gaps = 28/159 (17%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQAD-------------- 513
E+ + ++ LE EIK L+E++S+ E E+K+ + L+ +
Sbjct: 86 ELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYV 145
Query: 514 -----SVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY-----QKGDSDEIKRLKEKLSKTE 563
+V ++ LK E + V K Y K SDE++ +KL
Sbjct: 146 SVFVGTVPEDKLEELKL-ESDVENVEYISTDKGYVYVVVVVLKELSDEVEEELKKLGFER 204
Query: 564 EELKKEKTNSAAL---KNQADSVGKEYDRLLKEHEKVQK 599
EL++E T S + K + + + KE + LL+E +++ K
Sbjct: 205 LELEEEGTPSELIREIKEELEEIEKERESLLEELKELAK 243
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 30.3 bits (68), Expect = 2.7
Identities = 32/125 (25%), Positives = 54/125 (43%), Gaps = 13/125 (10%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E++K + E+ + E E ++ E+ K E ELKK T A D+ E +
Sbjct: 56 EAAKEKALEEEAKRKAEERKRETLKIVEEEVKKELELKKRNTLLEANIDDVDTDD-ENEE 114
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSK----TEEE-LKKEKTNSAAL 576
E K++ E K D+ + +E++R K ++ K TEEE + + N
Sbjct: 115 EEYEAWKLR----ELKRIKRDREE---REEMEREKAEIEKMRNMTEEERRAELRKNPKVS 167
Query: 577 KNQAD 581
N+A
Sbjct: 168 TNKAK 172
>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element.
Length = 370
Score = 30.4 bits (68), Expect = 2.8
Identities = 24/87 (27%), Positives = 38/87 (43%), Gaps = 3/87 (3%)
Query: 487 LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQK 546
K L EE KK+ +S L+ ++ GKE LKE E+ E + +K K
Sbjct: 41 EKSYLIMMEEGFKKDNYSS--LREDIETKGKEVQNFLKELEECITKQVEAHIENTEKCLK 98
Query: 547 GDSDEIKRLKEKLSKTEEELKKEKTNS 573
+ K+ +E L + +KKE +
Sbjct: 99 ELMELKKKARE-LKEECRSIKKEHMET 124
>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
[Transcription].
Length = 217
Score = 30.1 bits (68), Expect = 2.8
Identities = 23/111 (20%), Positives = 58/111 (52%), Gaps = 5/111 (4%)
Query: 495 EEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKR 554
E++L+ E++ + L + + + RLL+ + +++++ Q + QK ++ + R
Sbjct: 100 EDDLESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQTE-----LQKQLNELMDR 154
Query: 555 LKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQG 605
+KE++ + ++ + LKN A V + R + + E+V +++ E+G
Sbjct: 155 IKEEIEDKDGDMTERIVLEYLLKNGAADVEETSRRTVLKIEEVLEILAEKG 205
>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1025
Score = 30.5 bits (69), Expect = 2.9
Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 4/67 (5%)
Query: 474 DKGLEGDSDEIK-RLKEKLSKTEE---ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
D L E+K R++ L E+ +L+K+ N +A+S+ +E D L KE+E+
Sbjct: 39 DDALREKGMELKSRVRGALEPIEQKKKDLEKKLDNPDISLEEAESINEELDTLAKEYEEA 98
Query: 530 QKVVTEQ 536
E+
Sbjct: 99 TAAALEE 105
Score = 29.7 bits (67), Expect = 5.8
Identities = 31/125 (24%), Positives = 53/125 (42%), Gaps = 34/125 (27%)
Query: 489 EKL--SKTEEELKK-----EKTNS--AALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
EK+ SK E+++KK ++ N A+L S +D +E LK +V+ + K
Sbjct: 6 EKIFGSKHEKDIKKIQPIIDRINEIQASLASLSDDALREKGMELKS--RVRGALEPIEQK 63
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
K D +K D+ +I +A+S+ +E D L KE+E+
Sbjct: 64 KKDLEKKLDNPDI-----------------------SLEEAESINEELDTLAKEYEEATA 100
Query: 600 VVTEQ 604
E+
Sbjct: 101 AALEE 105
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 30.5 bits (69), Expect = 3.0
Identities = 25/96 (26%), Positives = 38/96 (39%), Gaps = 16/96 (16%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
KR +E +K + E +EK A K + +S +L K + E D DD
Sbjct: 44 KRQEEAEAKRKREELREKIAKAREKRERNS------KLGG-----IKTLGEDDDDDDD-- 90
Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
+K+ K++ K E E KK K +A
Sbjct: 91 ---TKAWLKKSKKRQKKKEAERKKALLLDEKEKERA 123
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 30.4 bits (69), Expect = 3.1
Identities = 26/113 (23%), Positives = 46/113 (40%), Gaps = 16/113 (14%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
E+K+L+ KL K E+EL++ + + L+ + + + + E+ K V D
Sbjct: 294 ELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLI----EEGLKSVR-----LAD 344
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHE 595
Y ++EIK +K E ++ LK DR L E +
Sbjct: 345 FYG---NEEIKIELDKSKTPSENAQRYFKKYKKLKGAK----VNLDRQLSELK 390
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 29.2 bits (66), Expect = 3.3
Identities = 18/90 (20%), Positives = 38/90 (42%), Gaps = 16/90 (17%)
Query: 482 DEIKRLKEKLSKTEE--ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
+E K KE+ + EE +++E+ +E ++ LKE ++ K++ E+ D
Sbjct: 40 EEAKE-KEREERIEERKRIREERK-------------QELEKQLKERKEALKLLEEENDD 85
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKE 569
++D + D E + +E
Sbjct: 86 EEDAETEDTEDVEDDEWEGFPEPTVTDYEE 115
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 29.8 bits (67), Expect = 3.3
Identities = 30/121 (24%), Positives = 53/121 (43%), Gaps = 15/121 (12%)
Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
K K K E+E E A + +DS E + ++E + T +D
Sbjct: 66 KTKKKKKEKEEVPEL----AAEELSDSEENEENDKKVDYELPKVQNTAAEVNHEDVI--- 118
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKN-----QADSVGKEYDRLLKEHEKVQKVVT 602
D ++K E + K +LKK + +A+ QA S+ ++Y L+KE +K++ +V
Sbjct: 119 DLSDLKLATENIIK---DLKKINSRVSAVSTVLEDVQAASISRQYTSLIKEIDKLKDLVN 175
Query: 603 E 603
Sbjct: 176 L 176
>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal. This domain is the C-terminal
region of the CASP family of proteins. It is a Golgi
membrane protein which is thought to have a role in
vesicle transport.
Length = 245
Score = 30.1 bits (68), Expect = 3.4
Identities = 11/45 (24%), Positives = 29/45 (64%), Gaps = 3/45 (6%)
Query: 554 RLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
R +++ ++ EEEL+K+ ++L+ + +S+ + +L +EK++
Sbjct: 90 RFRKRNTELEEELRKQNQTISSLRRELESLKADNIKL---YEKIR 131
Score = 29.3 bits (66), Expect = 4.7
Identities = 11/45 (24%), Positives = 29/45 (64%), Gaps = 3/45 (6%)
Query: 486 RLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
R +++ ++ EEEL+K+ ++L+ + +S+ + +L +EK++
Sbjct: 90 RFRKRNTELEEELRKQNQTISSLRRELESLKADNIKL---YEKIR 131
>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase. BRE1 is an E3 ubiquitin
ligase that has been shown to act as a transcriptional
activator through direct activator interactions.
Length = 96
Score = 28.4 bits (64), Expect = 3.4
Identities = 29/106 (27%), Positives = 43/106 (40%), Gaps = 26/106 (24%)
Query: 497 ELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY-----QKGDSD- 550
E ++ +S ALK A+ E +K ++ E+ K D KY K D
Sbjct: 9 EKAFKELSSLALKKVAELTILE--------QKKLRLTAEK-AKADQKYFAAMRSKDALDA 59
Query: 551 EIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
E K+L +L+K+ E L Q + E R +KE EK
Sbjct: 60 ENKKLNTQLNKSSE-----------LIEQLKDLENELRRRIKELEK 94
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 29.7 bits (67), Expect = 3.5
Identities = 21/144 (14%), Positives = 57/144 (39%), Gaps = 27/144 (18%)
Query: 479 GDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQA-----------DSVGKEYDRLLKEHE 527
D + ++ L +K E EL + AL + + + + L + E
Sbjct: 30 DDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEELNQRWE 89
Query: 528 KVQKVVTEQGDKKDDKYQK-GDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
+++++ E+ + ++ + L++ L + E L ++ +GK+
Sbjct: 90 ELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAAL------------ASEDLGKD 137
Query: 587 YD---RLLKEHEKVQKVVTEQGDK 607
+ LLK+H+++++ + +
Sbjct: 138 LESVEELLKKHKELEEELEAHEPR 161
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 30.3 bits (69), Expect = 3.5
Identities = 22/101 (21%), Positives = 48/101 (47%), Gaps = 19/101 (18%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSV---------GKEYDRLL----KEHEKV 529
E+++L++KL K E+EL++ + N+ + + + G + L E +
Sbjct: 304 ELEKLEKKLEKLEKELEEAE-NAENYRLYGELLTANLYAIKKGMKEVELPNYYTGEEITI 362
Query: 530 ----QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
K +E K KY+K ++ ++E++ +T+EE+
Sbjct: 363 PLDPAKSPSENAQKYYKKYKKLKR-AVEAVEEQIEETKEEI 402
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 30.2 bits (68), Expect = 3.5
Identities = 36/171 (21%), Positives = 64/171 (37%), Gaps = 40/171 (23%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKT--NSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
LE +++ L KLS EE K +K L+ + +++ KE +RLL E + +++
Sbjct: 365 LETYKRQVQELHAKLS--EESKKADKLEFEYKRLEEKLEALQKEKERLLAERDSLRETNE 422
Query: 535 E---------QGDKKDDKYQKGD---SDEI--KRLKEKLSKTEEELK--------KEKTN 572
E Q + D + E+ +EKL + + E K E
Sbjct: 423 ELRCGQAQQDQLTQADAGTSPSGDNLAAELLPSEYREKLIRLQHENKMLRLGQEGSENER 482
Query: 573 SAALKNQADSVGKEYDRLLKEH--------------EKVQKVVTEQGDKKD 609
L+ + + + L + E +QK + EQG K +
Sbjct: 483 ITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSE 533
>gnl|CDD|215019 smart01090, Copper-fist, Copper fist is an N-terminal domain
involved in copper-dependent DNA binding. The domain is
named for its resemblance to a fist. It can be found in
some fungal transcription factors. These proteins
activate the transcription of the metallothionein gene
in response to copper. Metallothionein maintains copper
levels in yeast. The copper fist domain is similar in
structure to metallothionein itself, and on copper
binding undergoes a large conformational change, which
allows DNA binding.
Length = 38
Score = 26.9 bits (60), Expect = 3.6
Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 10/34 (29%)
Query: 295 CDWCVDGHR---CTHDTAENCRNDILITGVSRVG 325
C+ C+ GHR CTH ND + V + G
Sbjct: 9 CEKCIRGHRSSTCTH-------NDRPLVKVRKRG 35
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.0 bits (68), Expect = 3.8
Identities = 16/119 (13%), Positives = 33/119 (27%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
+ K +E +K + D + D + + D DD
Sbjct: 88 AKAPAKKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDI 147
Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTE 603
D DE + ++E +++K K D + + + +K
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKL 206
>gnl|CDD|221120 pfam11488, Lge1, Transcriptional regulatory protein LGE1. This
family of proteins is conserved from fungi to human. In
yeasts it is involved in the ubiquitination of histones
H2A and H2B. This ubiquitination step is a vital one in
the regulation of the transcriptional activity of RNA
polymerase II. In S. cerevisiae, Rad6 and Bre1 are
present in a complex, also containing Lge1, that is
required for H2B ubiquitination. Bre1 is the H2B
ubiquitin ligase that interacts with acidic activators,
such as Gal4, and recruits Rad6 and its binding partner
Lge1 to target promoters. In S. pombe the equivalent
protein to Lge1 appears to be Shf1.
Length = 75
Score = 27.7 bits (62), Expect = 3.8
Identities = 10/39 (25%), Positives = 14/39 (35%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK 500
E K LE +++ LK + K E EL
Sbjct: 12 SDETTQEKLRSRYKELESLDQKLEELKLQKLKLELELAL 50
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 29.4 bits (66), Expect = 3.9
Identities = 17/138 (12%), Positives = 53/138 (38%), Gaps = 15/138 (10%)
Query: 467 CEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALK-----SQADSVGKEYDR 521
E+FEQ + + + +K + L + E + ++S + + + ++ +
Sbjct: 20 SELFEQYRQLAD-RTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQKLAK 78
Query: 522 LLKEHEKVQK---------VVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
L +E ++ K ++ ++ + + I L+ +++ E E++ +
Sbjct: 79 LREELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLREE 138
Query: 573 SAALKNQADSVGKEYDRL 590
+ +++ E L
Sbjct: 139 LQEKEKDNETLQDELISL 156
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 29.8 bits (67), Expect = 3.9
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 21/88 (23%)
Query: 491 LSKTEEELKKEKTNSAALKSQADSV------GKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
+S EEEL+K ++ LK + + GK RL K +EK + + +Q
Sbjct: 7 VSAWEEELRKAQS-LEELKQKYEEAQKQIADGKTLKRLYKVYEKREFELKKQ-------- 57
Query: 545 QKGDSDEIKRLKEKLSKTEEELKKEKTN 572
+ ++LK +LSK +++ KKEK +
Sbjct: 58 ------QFEQLKAELSKKKKKFKKEKVD 79
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 29.5 bits (67), Expect = 3.9
Identities = 16/73 (21%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQ-----ADSVGKEYDRL 522
E E+ K L+ + + +L+++L K + E K + + L+ Q ++ +YD+
Sbjct: 91 EELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKLRQQGGLLHVPALLYDYDKT 150
Query: 523 LKEHEKVQKVVTE 535
+++ E+ +K V +
Sbjct: 151 VEKVEEKRKSVEK 163
>gnl|CDD|197563 smart00187, INB, Integrin beta subunits (N-terminal portion of
extracellular region). Portion of beta integrins that
lies N-terminal to their EGF-like repeats. Integrins are
cell adhesion molecules that mediate cell-extracellular
matrix and cell-cell interactions. They contain both
alpha and beta subunits. Beta integrins are proposed to
have a von Willebrand factor type-A "insert" or "I"
-like domain (although this remains to be confirmed).
Length = 423
Score = 30.0 bits (68), Expect = 3.9
Identities = 13/35 (37%), Positives = 15/35 (42%), Gaps = 9/35 (25%)
Query: 284 SCTQCVSSDFPCDWCVD-------GHRCTHDTAEN 311
SC +C+ S C WC D RC DT N
Sbjct: 1 SCEECIQSGPNCAWCTDENFTSGGSARC--DTRAN 33
>gnl|CDD|147381 pfam05167, DUF711, Uncharacterized ACR (DUF711). The proteins in
this family are functionally uncharacterized. The
proteins are around 450 amino acids long. It is likely
that this family represents a group of
glycerol-3-phosphate dehydrogenases.
Length = 390
Score = 29.9 bits (68), Expect = 4.2
Identities = 19/101 (18%), Positives = 33/101 (32%), Gaps = 17/101 (16%)
Query: 478 EGDSDEIKRLKEKLSKTEEELKKEK---------------TNSAALKSQADSVGKEYDRL 522
E D+I E L + EE++ E + A S+ K D+
Sbjct: 20 EKIYDKITSKAENLVEVAEEIEAEYGIPIVTKRISVTPISLVAEASDDDYVSLAKALDKA 79
Query: 523 LKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTE 563
K+ + + +GD I + + L+ TE
Sbjct: 80 AKDVG--VDFIGGFSALVHKGFTRGDRALIDSIPDALASTE 118
>gnl|CDD|226684 COG4231, COG4231, Indolepyruvate ferredoxin oxidoreductase, alpha
and beta subunits [Energy production and conversion].
Length = 640
Score = 30.0 bits (68), Expect = 4.3
Identities = 13/47 (27%), Positives = 20/47 (42%), Gaps = 2/47 (4%)
Query: 119 IIQLIAQQANLLAQ--NEASMNQARQAAVAAQALLDGPRGKWFGSHV 163
+I+ +A+ +L E S+N+ AA A G R HV
Sbjct: 45 LIETLAKAKKILGDVYFEWSLNEKVALETAAGASYAGVRALVTMKHV 91
>gnl|CDD|235219 PRK04098, PRK04098, sec-independent translocase; Provisional.
Length = 158
Score = 28.9 bits (65), Expect = 4.4
Identities = 24/105 (22%), Positives = 42/105 (40%), Gaps = 8/105 (7%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKD 541
+E + K++ E LKK K L + E + + +K + E D
Sbjct: 61 EEALKYKKEFESAVESLKK-KLKFEELDDLKITAENEIKSIQDLLQDYKKSLEE--DTIP 117
Query: 542 DKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKE 586
+ + S+E K +E S ++E K+ K + KN+ KE
Sbjct: 118 NHLNEEVSNETKLTQE--SSSDESPKEVKLAT---KNKTKKHDKE 157
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 29.1 bits (66), Expect = 4.4
Identities = 13/81 (16%), Positives = 37/81 (45%), Gaps = 7/81 (8%)
Query: 519 YDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
++L E + + +K K E+++ +++L K E++L+K+ + +
Sbjct: 23 VQKVLSESPAGKAAQKQL-----EKEFKKLQAELQKKEKELQKEEQKLQKQA--ATLSEE 75
Query: 579 QADSVGKEYDRLLKEHEKVQK 599
+ +E + +E ++ Q+
Sbjct: 76 ARKAKQQELQQKQQELQQKQQ 96
Score = 28.0 bits (63), Expect = 9.4
Identities = 22/109 (20%), Positives = 53/109 (48%), Gaps = 9/109 (8%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
ES +Q +K + E+++ +++L K E++L+K+ AA S+
Sbjct: 29 ESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQ----AATLSEE-----ARKA 79
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
+E ++ Q+ + ++ + Q+ + ++ + +K+ K +E+ KEK
Sbjct: 80 KQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYDKIDKAIKEVAKEK 128
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 28.3 bits (64), Expect = 4.6
Identities = 23/116 (19%), Positives = 49/116 (42%), Gaps = 24/116 (20%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDK 543
IK ++E +E+L+K + + D LL+ E+ +++ E
Sbjct: 4 IKLIRENPEAVKEKLRKRGVDVLDV-----------DELLELDEERRELQVE-------- 44
Query: 544 YQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
+E++ + +LSK + KK+K ++ AL + + E L E +++
Sbjct: 45 -----LEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDELKALEAELRELEA 95
>gnl|CDD|214826 smart00806, AIP3, Actin interacting protein 3. Aip3p/Bud6p is a
regulator of cell and cytoskeletal polarity in
Saccharomyces cerevisiae that was previously identified
as an actin-interacting protein. Actin-interacting
protein 3 (Aip3p) localizes at the cell cortex where
cytoskeleton assembly must be achieved to execute
polarized cell growth, and deletion of AIP3 causes gross
defects in cell and cytoskeletal polarity. Aip3p
localization is mediated by the secretory pathway,
mutations in early- or late-acting components of the
secretory apparatus lead to Aip3p mislocalization.
Length = 426
Score = 29.6 bits (67), Expect = 4.6
Identities = 22/144 (15%), Positives = 59/144 (40%), Gaps = 12/144 (8%)
Query: 468 EIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTN-SAALKSQADSVGKEYDRLLKEH 526
++ + LE + I+RLKE+ + + + + S L S + ++ +
Sbjct: 92 NELDEVKQALESQREAIQRLKERQQNSAANIARPAASPSPVLASSSSAISLANNPDKLNK 151
Query: 527 EKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA------ 580
E+ ++ + Q + + + +KE + E++ K K++S + +
Sbjct: 152 EQRAELKSLQRELAVLRQTH--NSFFTEIKESIKDILEKIDKFKSSSLSASGSSNRAYVE 209
Query: 581 ---DSVGKEYDRLLKEHEKVQKVV 601
+ ++ D LL + + +Q ++
Sbjct: 210 SSKKKLSEDSDSLLTKVDDLQDII 233
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 28.9 bits (65), Expect = 4.6
Identities = 19/119 (15%), Positives = 56/119 (47%), Gaps = 7/119 (5%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
ES + ++ + + E+++++++L + + + + +++ +D K
Sbjct: 38 ESPQAKKVSADLESEFKKRQKELQKMQKEL----KAKEAKLQDDGKMEALSD-RAKAEAE 92
Query: 522 LLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
+ KE V +Q + + D ++ +++E ++L EK+ + E + ++ S L + A
Sbjct: 93 IKKEK-LVNAFNKKQQEYEKDLNRR-EAEEEQKLLEKIQRAIESVAEKGGYSLVLDSNA 149
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 29.4 bits (66), Expect = 4.6
Identities = 20/95 (21%), Positives = 45/95 (47%), Gaps = 1/95 (1%)
Query: 475 KGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKS-QADSVGKEYDRLLKEHEKVQKVV 533
K ++ EI + E +E LK +K L + + + + ++ E+++++
Sbjct: 104 KKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELE 163
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
E + +D+K E K K++L + +EEL++
Sbjct: 164 KELEELEDEKDLLEKLLEEKEKKKELEELKEELEE 198
Score = 29.1 bits (65), Expect = 6.1
Identities = 18/87 (20%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 481 SDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
+ +K KE L ++ + + +E ++ L+E E + ++ + ++K
Sbjct: 124 DELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEK 183
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELK 567
+ K E++ LKE+L + EEL+
Sbjct: 184 EKK------KELEELKEELEELLEELE 204
>gnl|CDD|202101 pfam02050, FliJ, Flagellar FliJ protein.
Length = 122
Score = 28.4 bits (64), Expect = 4.7
Identities = 12/34 (35%), Positives = 16/34 (47%)
Query: 116 IRQIIQLIAQQANLLAQNEASMNQARQAAVAAQA 149
I + + IAQQ L Q E + QAR+ A
Sbjct: 54 ISALDEAIAQQQQELEQAEKQVEQAREQWQEANQ 87
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 29.3 bits (66), Expect = 4.7
Identities = 28/141 (19%), Positives = 46/141 (32%), Gaps = 36/141 (25%)
Query: 474 DKGLEGDSDEIKR-LKEKLSKTEEELKKEKTNSAAL----KSQADSVGKEYDRLLKEHEK 528
G + + KR L+ +L + E K +K N A+ KS E D E ++
Sbjct: 45 QPGQGTEDELRKRDLRAELEEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDE 104
Query: 529 VQKVVTEQG-------DKKDDKYQKGDSDE-------------------IK--RLKEKLS 560
+++ E+ D D+ IK R +EK
Sbjct: 105 IKRKRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKER 164
Query: 561 KTEE---ELKKEKTNSAALKN 578
+ EE E +K + N
Sbjct: 165 EEEEKAAEEEKAREEEILTGN 185
>gnl|CDD|214017 cd12924, iSH2_PIK3R1, Inter-Src homology 2 (iSH2) helical domain of
Class IA Phosphoinositide 3-kinase Regulatory subunit 1,
PIK3R1, also called p85alpha. PI3Ks catalyze the
transfer of the gamma-phosphoryl group from ATP to the
3-hydroxyl of the inositol ring of
D-myo-phosphatidylinositol (PtdIns) or its derivatives.
They play an important role in a variety of fundamental
cellular processes, including cell motility, the Ras
pathway, vesicle trafficking and secretion, immune cell
activation and apoptosis. They are classified according
to their substrate specificity, regulation, and domain
structure. Class IA PI3Ks are heterodimers of a p110
catalytic (C) subunit and a p85-related regulatory (R)
subunit. The R subunit down-regulates PI3K basal
activity, stabilizes the C subunit, and plays a role in
the activation downstream of tyrosine kinases. All R
subunits contain two SH2 domains that flank an
intervening helical domain (iSH2), which binds to the
N-terminal adaptor-binding domain (ABD) of the catalytic
subunit. In addition, p85alpha, also called PIK3R1,
contains N-terminal SH3 and GAP domains. p85alpha carry
functions independent of its PI3K regulatory role. It
can independently stimulate signaling pathways involved
in cytoskeletal rearrangements. Insulin-sensitive
tissues express splice variants of the PIK3R1 gene,
p50alpha and p55alpha, which may play important roles in
insulin signaling during lipid and glucose metabolism.
Mice deficient with PIK3R1 die perinatally, indicating
its importance in development.
Length = 161
Score = 28.9 bits (64), Expect = 4.7
Identities = 30/143 (20%), Positives = 70/143 (48%), Gaps = 24/143 (16%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
+ S E RL E+ ++T +E++ ++T A+++ +++ K ++ + E+ K E+
Sbjct: 17 FQEKSREYDRLYEEYTRTSQEIQMKRT---AIEAFNETI-KIFEEQCQTQERYSKEYIEK 72
Query: 537 GDKKDDKYQKGDSDEIKR-------LKEKLS-------KTEEELKKEKTNSAALKNQADS 582
++ G+ EI+R LK ++S + EE+LKK+ + + +S
Sbjct: 73 FKRE------GNEKEIQRIMHNYEKLKSRISEIVDSRRRLEEDLKKQAAEYREIDKRMNS 126
Query: 583 VGKEYDRLLKEHEKVQKVVTEQG 605
+ + +L K ++ +T++G
Sbjct: 127 IKPDLIQLRKTRDQYLMWLTQKG 149
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 28.4 bits (64), Expect = 4.9
Identities = 20/97 (20%), Positives = 41/97 (42%), Gaps = 15/97 (15%)
Query: 492 SKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDE 551
EE+ +K A+ + K+ + E EK+ K+ E+ + E
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEY----------E 50
Query: 552 IKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD 588
+++L+++L + E EL + + LK +A + E
Sbjct: 51 LEKLEKELEELEAELARRE-----LKAEAKKMLSEKG 82
>gnl|CDD|193253 pfam12777, MT, Microtubule-binding stalk of dynein motor. the 380
kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules,
of which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This family is the region between D4 and D5 and is the
two predicted alpha-helical coiled coil segments that
form the stalk supporting the ATP-sensitive microtubule
binding component.
Length = 344
Score = 29.6 bits (66), Expect = 5.0
Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 8/90 (8%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
L+ + ++ LK KL+ E ELK++ ++ L VG E +++ KE K + +Q
Sbjct: 10 LQSTASQVDDLKAKLAIQEVELKQKNEDADKL---IQVVGVETEKVSKE-----KAIADQ 61
Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEEL 566
+ K K +++ K +E L+K E L
Sbjct: 62 EEVKVAVINKEVTEKQKDCEEDLAKAEPAL 91
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 28.7 bits (65), Expect = 5.2
Identities = 26/141 (18%), Positives = 55/141 (39%), Gaps = 32/141 (22%)
Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQK-VVTEQGDKKDDKYQK 546
KE+ + E+E + + L+ + + + RLL E E +K E+ + K +K
Sbjct: 3 KEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRKRTEREREEAKKYAIEK 62
Query: 547 -------------------GDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA----DSV 583
+++++K L E + ++L K L+ +
Sbjct: 63 FAKDLLPVLDNLERALAAAKENEDVKSLVEGVEMILKQLLK------VLEKHGVEKIEPE 116
Query: 584 GKEYDRLLKEHEKVQKVVTEQ 604
G+ +D HE V +V +++
Sbjct: 117 GEPFDP--NLHEAVSQVPSDE 135
>gnl|CDD|214019 cd12926, iSH2_PIK3R2, Inter-Src homology 2 (iSH2) helical domain of
Class IA Phosphoinositide 3-kinase Regulatory subunit 2,
PIK3R2, also called p85beta. PI3Ks catalyze the
transfer of the gamma-phosphoryl group from ATP to the
3-hydroxyl of the inositol ring of
D-myo-phosphatidylinositol (PtdIns) or its derivatives.
They play an important role in a variety of fundamental
cellular processes, including cell motility, the Ras
pathway, vesicle trafficking and secretion, immune cell
activation, and apoptosis. They are classified according
to their substrate specificity, regulation, and domain
structure. Class IA PI3Ks are heterodimers of a p110
catalytic (C) subunit and a p85-related regulatory (R)
subunit. The R subunit down-regulates PI3K basal
activity, stabilizes the C subunit, and plays a role in
the activation downstream of tyrosine kinases. All R
subunits contain two SH2 domains that flank an
intervening helical domain (iSH2), which binds to the
N-terminal adaptor-binding domain (ABD) of the catalytic
subunit. p85beta, also called PIK3R2, contains
N-terminal SH3 and GAP domains. It is expressed
ubiquitously but at lower levels than p85alpha. Its
expression is increased in breast and colon cancer,
correlates with tumor progression, and enhanced
invasion. During viral infection, the viral
nonstructural (NS1) protein binds p85beta specifically,
which leads to PI3K activation and the promotion of
viral replication. Mice deficient with PIK3R2 develop
normally and exhibit moderate metabolic and
immunological defects.
Length = 161
Score = 28.9 bits (64), Expect = 5.3
Identities = 20/92 (21%), Positives = 42/92 (45%), Gaps = 9/92 (9%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKK- 540
+E ++ E+ +T+E+ KE + + + KE R+L E+++ + E + +
Sbjct: 49 NETIKIFEEQGQTQEKCSKEYLE----RFRREGNEKEMQRILLNSERLKSRIAEIHESRT 104
Query: 541 ----DDKYQKGDSDEIKRLKEKLSKTEEELKK 568
D + Q D+ EI + L +L+K
Sbjct: 105 KLEQDLRAQASDNREIDKRMNSLKPDLMQLRK 136
>gnl|CDD|178593 PLN03021, PLN03021, Low-temperature-induced protein; Provisional.
Length = 619
Score = 29.6 bits (65), Expect = 5.7
Identities = 13/43 (30%), Positives = 26/43 (60%)
Query: 526 HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
HE+ ++ + ++D+ ++KG S +K++KEK K + L K
Sbjct: 11 HEQAEEPIRIHHPEEDEHHEKGASKVLKKVKEKAKKIKNSLTK 53
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 29.6 bits (66), Expect = 5.8
Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 2/63 (3%)
Query: 464 SKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKE-KTNSAALKSQADSVGKEYDRL 522
+R E Q K ++ + ++RL+E++ + KKE + + ++ D + EY
Sbjct: 772 IERIEHRRQLQKRIDAVNARLRRLREEIIGRMSDAKKEDTAALSEVGAELDDIP-EYLAR 830
Query: 523 LKE 525
L+
Sbjct: 831 LQT 833
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 29.6 bits (66), Expect = 5.8
Identities = 24/101 (23%), Positives = 49/101 (48%), Gaps = 6/101 (5%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
E +RL E + E EL++ + + A+ +Q ++ +E +++ +T+Q +
Sbjct: 515 ERQRLAEVAQQLERELQESQESLASAGAQLEAARAGQQESTEEAASLRQELTQQQEVYGQ 574
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSV 583
Q+ ++ RL+E+LS TE L + A + A +V
Sbjct: 575 ALQEKVAEVETRLREQLSDTERRL------NEARREHAKAV 609
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 29.0 bits (65), Expect = 5.8
Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 19/112 (16%)
Query: 488 KEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKG 547
K EE+ KE N + + GKE D +E EK + + E ++ ++ +
Sbjct: 7 DAKHENMEEDCCKENEN------KEEDKGKEEDLEFEEIEKEEII--EDSEESNEVKIEE 58
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYD----RLLKEHE 595
DE +LKE+ K E EL+ ALK++ EYD R KE E
Sbjct: 59 LKDENNKLKEENKKLENELE-------ALKDRLLRTVAEYDNYRKRTAKEKE 103
>gnl|CDD|145573 pfam02507, PSI_PsaF, Photosystem I reaction centre subunit III.
Photosystem I (PSI) is an integral membrane protein
complex that uses light energy to mediate electron
transfer from plastocyanin to ferredoxin. Subunit III
(or PSI-F) is one of at least 14 different subunits that
compose the PSI complex.
Length = 159
Score = 28.6 bits (64), Expect = 6.0
Identities = 17/40 (42%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query: 485 KRLKEKLSKTEEELKKEKTNSA---ALKSQADSVGKEYDR 521
KRLK + K E LKK NSA AL++Q + K ++R
Sbjct: 21 KRLKNSVKKLESRLKKYDPNSAPALALQAQIEKTKKRFER 60
Score = 28.2 bits (63), Expect = 7.8
Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 3/40 (7%)
Query: 553 KRLKEKLSKTEEELKKEKTNSA---ALKNQADSVGKEYDR 589
KRLK + K E LKK NSA AL+ Q + K ++R
Sbjct: 21 KRLKNSVKKLESRLKKYDPNSAPALALQAQIEKTKKRFER 60
>gnl|CDD|176511 cd08568, GDPD_TmGDE_like, Glycerophosphodiester phosphodiesterase
domain of Thermotoga maritime and similar proteins.
This subfamily corresponds to the glycerophosphodiester
phosphodiesterase domain (GDPD) present in Thermotoga
maritime glycerophosphodiester phosphodiesterase (TmGDE,
EC 3.1.4.46) and its uncharacterized homologs. Members
in this family show high sequence similarity to
Escherichia coli GP-GDE, which catalyzes the degradation
of glycerophosphodiesters to produce
sn-glycerol-3-phosphate (G3P) and the corresponding
alcohols. TmGDE exists as a monomer that might be the
biologically relevant form.
Length = 226
Score = 29.2 bits (66), Expect = 6.0
Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 10/68 (14%)
Query: 57 VILSLFFLDAIREMRKYASPEVK------EEAHGHLDAEMQNNMKLFRAQ---RNFYISG 107
VI S F DA+RE+RK P+ K EE G E+ +KL+ G
Sbjct: 121 VIFSSFNHDALRELRKL-DPDAKVGLLIGEEEEGFSIPELHEKLKLYSLHVPIDAIGYIG 179
Query: 108 FSLFLWLV 115
F F+ L+
Sbjct: 180 FEKFVELL 187
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 29.6 bits (67), Expect = 6.1
Identities = 14/81 (17%), Positives = 35/81 (43%), Gaps = 3/81 (3%)
Query: 507 ALKSQADSVGKEYDRLLKEHEKVQKV---VTEQGDKKDDKYQKGDSDEIKRLKEKLSKTE 563
L+ + + E + L E+E++QK + +Q + + + EI++LK + + +
Sbjct: 70 ELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQLQ 129
Query: 564 EELKKEKTNSAALKNQADSVG 584
+ + + A + G
Sbjct: 130 GLIDQLQRRLAGVLTGPSGGG 150
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 28.0 bits (62), Expect = 6.1
Identities = 14/72 (19%), Positives = 29/72 (40%), Gaps = 3/72 (4%)
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
D K D + + ++K KE + + E + +N A E + +E E
Sbjct: 2 DTKVDAAAELSAKDLKEKKEVVEEKENGKNAPANGNENEENGAQEGDDEME---EEEEVD 58
Query: 598 QKVVTEQGDKKD 609
+ E+G+ ++
Sbjct: 59 EDDEEEEGEGEE 70
>gnl|CDD|227468 COG5139, COG5139, Uncharacterized conserved protein [Function
unknown].
Length = 397
Score = 29.3 bits (65), Expect = 6.2
Identities = 19/81 (23%), Positives = 32/81 (39%), Gaps = 7/81 (8%)
Query: 480 DSDEIKRLK-EKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGD 538
S KR + L T E + + ++Q+ +G DR LK +
Sbjct: 91 MSLLRKRKNDQSLQPTREPMDSRDSGQDFTEAQSGELGDTGDRQLK-----APAASRARR 145
Query: 539 KKDDKYQKGDSDEIKRLKEKL 559
K+D Q D + RLK+++
Sbjct: 146 KEDLLEQTVD-EISLRLKKRM 165
>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
and chromosome partitioning].
Length = 212
Score = 28.9 bits (65), Expect = 6.2
Identities = 24/102 (23%), Positives = 41/102 (40%), Gaps = 26/102 (25%)
Query: 513 DSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTN 572
D V +Y++LL E+E ++ DEI LKE+L E ++
Sbjct: 33 DDVIDDYEQLLDENEDLE-------------------DEIDELKEELK--EAADAEDSQA 71
Query: 573 SAALKNQADSV----GKEYDRLLKE-HEKVQKVVTEQGDKKD 609
+ +A+ + E D +LK + Q+V + K D
Sbjct: 72 IQQAETEAEELKQAAEAEADDILKRASAQAQRVFGKAQYKAD 113
>gnl|CDD|217479 pfam03297, Ribosomal_S25, S25 ribosomal protein.
Length = 105
Score = 27.7 bits (62), Expect = 6.4
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 9/65 (13%)
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKV 597
+K S K K+K SK + K+K N+A L ++A YD+LLKE
Sbjct: 6 QQKKAAKAAAASAGGKAKKKKWSKGKV---KDKLNNAVLFDKA-----TYDKLLKEVPT- 56
Query: 598 QKVVT 602
K++T
Sbjct: 57 YKLIT 61
>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
II-B/NMENI. Members of this family of CRISPR-associated
(cas) protein are found, so far, in CRISPR/cas loci in
Wolinella succinogenes DSM 1740, Legionella pneumophila
str. Paris, and Francisella tularensis, where the last
probably is an example of a degenerate CRISPR locus,
having neither repeats nor a functional Cas1. The
characteristic repeat length is 37 base pairs and period
is about 72. One region of this large protein shows
sequence similarity to pfam01844, HNH endonuclease
[Mobile and extrachromosomal element functions, Other].
Length = 802
Score = 29.6 bits (66), Expect = 6.5
Identities = 22/96 (22%), Positives = 33/96 (34%), Gaps = 4/96 (4%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKV--VTEQG 537
D+ EI LKE L + ++ L+ S +Y E + V E+
Sbjct: 111 DATEINLLKELLPFKLRSISEDNFIDWFLQKIQSSA--DYLANYSEEFRKILVSKTEEKK 168
Query: 538 DKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
D K+ K D IK + +E L N
Sbjct: 169 DDKELKNAVKDKYNIKEFLTGATINDEILGTFDKNE 204
>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil. Members of
this family are predominantly found in the
actin-bundling protein Cortexillin I from Dictyostelium
discoideum. They adopt a structure consisting of an
18-heptad-repeat alpha-helical coiled-coil, and are a
prerequisite for the assembly of Cortexillin I.
Length = 107
Score = 27.6 bits (61), Expect = 6.5
Identities = 22/85 (25%), Positives = 36/85 (42%), Gaps = 7/85 (8%)
Query: 487 LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQK 546
L KL+ E L+ EK + L Q D + L + + ++ + E K D+ +
Sbjct: 14 LANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRLRELQAKIDELLKN 73
Query: 547 GDSDEIKRLKEKLSKTEEELKKEKT 571
+ EKL+K E E + KT
Sbjct: 74 LE-------LEKLAKMELESRLSKT 91
>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
(trigger factor) [Posttranslational modification,
protein turnover, chaperones].
Length = 441
Score = 29.2 bits (66), Expect = 6.8
Identities = 33/122 (27%), Positives = 49/122 (40%), Gaps = 33/122 (27%)
Query: 479 GDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADS--------------VGKEYDRLLK 524
G+ D ++ LKEKL K E KE T K Q V E D LLK
Sbjct: 257 GEEDTLEELKEKLRKNLERELKEATL-EKRKEQLLDALVEANDFDLPESLVEAEIDNLLK 315
Query: 525 E------HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLK-----------EKLSKTEEELK 567
+ + + + G+ +++ ++ + KR+K EKL TEEE+K
Sbjct: 316 QALQQLQQQGID-SLEASGESEEELREEFKEEAEKRVKLGLLLEEIAKEEKLEVTEEEIK 374
Query: 568 KE 569
E
Sbjct: 375 AE 376
>gnl|CDD|219445 pfam07517, SecA_DEAD, SecA DEAD-like domain. SecA protein binds to
the plasma membrane where it interacts with proOmpA to
support translocation of proOmpA through the membrane.
SecA protein achieves this translocation, in association
with SecY protein, in an ATP dependent manner. This
domain represents the N-terminal ATP-dependent helicase
domain, which is related to the pfam00270.
Length = 381
Score = 29.0 bits (66), Expect = 6.8
Identities = 12/45 (26%), Positives = 25/45 (55%), Gaps = 5/45 (11%)
Query: 550 DEIKRLKEKLSK-TEEELKKEKTNSAALKNQADSVGKEYDRLLKE 593
D+I L+E+ ++EEL+ + + L+ + + G+ D +L E
Sbjct: 21 DQINALEEEYKALSDEELRAK---TQELRKRLAN-GETLDDILPE 61
>gnl|CDD|214369 CHL00118, atpG, ATP synthase CF0 B' subunit; Validated.
Length = 156
Score = 28.4 bits (64), Expect = 6.9
Identities = 23/115 (20%), Positives = 63/115 (54%), Gaps = 11/115 (9%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR-LLKEHEKVQKVVTEQGDKKDDK 543
K L + L + +E ++K T ++ + ++A+ + K+Y++ L K ++ Q +T + +
Sbjct: 45 KPLLKVLDERKEYIRKNLTKASEILAKANELTKQYEQELSKARKEAQLEIT----QSQKE 100
Query: 544 YQKGDSDEIKRLKEK----LSKTEEELKKEK-TNSAALKNQADSVGKE-YDRLLK 592
++ +E+K+ ++ L++ ++L+ +K +L+ Q D++ + ++LL
Sbjct: 101 AKEIVENELKQAQKYIDSLLNEATKQLEAQKEKALKSLEEQVDTLSDQIEEKLLI 155
>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
transferase subunit alpha; Provisional.
Length = 762
Score = 29.4 bits (66), Expect = 7.1
Identities = 31/136 (22%), Positives = 55/136 (40%), Gaps = 23/136 (16%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E S+ + E+ K + ++ K+ KE + + E + K E + S A S + D
Sbjct: 542 EFSRAKALSEKKSKAEKLKAEINKKFKEVMDRPEIKEKMEALKAEVASSGASSGDELDDD 601
Query: 522 LLKEHEKVQK----------------VVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEE 565
L ++ EK++K V+ KD Q + L+EK+ EE
Sbjct: 602 LKEKVEKMKKEIELELAGVLKSMGLEVIGVTKKNKDTAEQTPPPN----LQEKIESLNEE 657
Query: 566 LKKE---KTNSAALKN 578
+ K+ S+ LK+
Sbjct: 658 INKKIERVIRSSDLKS 673
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 29.1 bits (66), Expect = 7.2
Identities = 22/125 (17%), Positives = 55/125 (44%), Gaps = 15/125 (12%)
Query: 480 DSDEIKRLKEKL-----SKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVT 534
+S+E++ L EKL +K + +EK A + + + + D ++ E E+++ +
Sbjct: 2 NSEELRELNEKLRAAKVNKERDAQIEEKKRIKAEEKEEE---RRIDEMM-EEERLKALAE 57
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
E+ ++ K ++ + + L + EE +K + + Q E ++E
Sbjct: 58 EEERERKRKEER------REGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEE 111
Query: 595 EKVQK 599
++ +
Sbjct: 112 DEAEA 116
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 28.0 bits (63), Expect = 7.3
Identities = 20/74 (27%), Positives = 36/74 (48%), Gaps = 12/74 (16%)
Query: 506 AALKSQADSVGKEYDRLLKEHEKVQK-----------VVTEQGDKKDDKYQKGDSDEIKR 554
AL+ + D+ ++YD+ +E ++ K VT G K+ + + E +R
Sbjct: 17 EALQEEVDTYMEKYDKREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERR 76
Query: 555 -LKEKLSKTEEELK 567
LKEK K ++EL+
Sbjct: 77 KLKEKKKKKKKELE 90
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 29.5 bits (67), Expect = 7.5
Identities = 24/135 (17%), Positives = 57/135 (42%)
Query: 470 FEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKV 529
+E + L +SD ++L+++L + E+E + + ++Q + L ++
Sbjct: 973 YEDAAEMLAKNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAK 1032
Query: 530 QKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDR 589
++++ E + D DS +R + + + L ++ L+ Q E D
Sbjct: 1033 RQMLQELKQELQDLGVPADSGAEERARARRDELHARLSANRSRRNQLEKQLTFCEAEMDN 1092
Query: 590 LLKEHEKVQKVVTEQ 604
L K+ K+++ E
Sbjct: 1093 LTKKLRKLERDYHEM 1107
>gnl|CDD|240345 PTZ00287, PTZ00287, 6-phosphofructokinase; Provisional.
Length = 1419
Score = 29.2 bits (65), Expect = 7.6
Identities = 27/91 (29%), Positives = 42/91 (46%), Gaps = 6/91 (6%)
Query: 506 AALKSQADSVGKEYDRLLKEHEKVQKVVT--EQGDKKDDKYQKGDSDEIKRLKEKLSKTE 563
A K AD + K LL+++E + +Q K K DS+ I L +KL+ +
Sbjct: 10 EAKKPSADGLVKTMSVLLRDNECAFNIEENEDQNSKDQLPCDKKDSEMIDCLMQKLTSKK 69
Query: 564 EELKKEKTNSAALKNQADSVGKEYDRLLKEH 594
++KE NS L + + K+ LKEH
Sbjct: 70 FLVEKETKNSFYLVDNENMKIKK----LKEH 96
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 29.1 bits (65), Expect = 8.0
Identities = 30/110 (27%), Positives = 52/110 (47%), Gaps = 17/110 (15%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQAD-------SVGKEYDRLLKEHEKVQKVVT 534
I+RL++ + K + EL++ + N + L++Q S+ + +L KE++ +Q +
Sbjct: 406 SAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLN 465
Query: 535 EQGDKKDDKYQKGDSDEIKRLK---------EKLSKTEEELKKEKTNSAA 575
K Q S E KRLK EK E++ KKE+ +AA
Sbjct: 466 SMVSAKQKDKQSMQSME-KRLKSEADSRVNAEKQLAEEKKRKKEEEETAA 514
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 28.8 bits (65), Expect = 8.1
Identities = 22/127 (17%), Positives = 42/127 (33%), Gaps = 28/127 (22%)
Query: 487 LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEH------------EKVQKVVT 534
L +K+ + + L K K S + + + LL+ + ++++
Sbjct: 154 LLKKVERLVKYLPKSKEESLVRSALELDIPDDILNLLRLRYACDLLCSYLPPDLYKELLK 213
Query: 535 EQG---DKKDDKYQKGDSDE-----------IKRLKEKLSKTEEELKKEKTNSAALKN-- 578
K DKY K + R ++K EE KK+ S +K
Sbjct: 214 SLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALK 273
Query: 579 QADSVGK 585
+ + G
Sbjct: 274 KVVAKGM 280
>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 799
Score = 29.3 bits (66), Expect = 8.1
Identities = 23/105 (21%), Positives = 43/105 (40%), Gaps = 14/105 (13%)
Query: 482 DEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVV-------- 533
+ + + K + +K E N ++ + K + L+K++ K+V
Sbjct: 469 SDELKFLPIIKKAQ--VKDEILNPVVKRALLQAR-KVVNELVKKYGPPDKIVIEMARENG 525
Query: 534 TEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKN 578
T +G K K QK + D I KE S E++LK+ + +
Sbjct: 526 TNKGTKNRKKRQKKNEDNI---KEAASALEQKLKELPLDIKSKNI 567
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome partitioning].
Length = 1480
Score = 29.1 bits (65), Expect = 8.3
Identities = 22/117 (18%), Positives = 53/117 (45%), Gaps = 4/117 (3%)
Query: 487 LKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE----HEKVQKVVTEQGDKKDD 542
L EKL + E+ + E+T + Q + +Y+++L ++ ++++ E + D
Sbjct: 985 LNEKLRQRLEQAEAERTRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQELQD 1044
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQK 599
+ DS +R + + + +L ++ L+ Q E D L ++ K+++
Sbjct: 1045 IGVRADSGAEERARIRRDELHAQLSTNRSRRNQLEKQLTFCEAEMDNLTRKLRKLER 1101
>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus. This family contains the N
terminal region of several LUC7 protein homologues and
only contains eukaryotic proteins. LUC7 has been shown
to be a U1 snRNA associated protein with a role in
splice site recognition. The family also contains human
and mouse LUC7 like (LUC7L) proteins and human cisplatin
resistance-associated overexpressed protein (CROP).
Length = 252
Score = 28.5 bits (64), Expect = 8.3
Identities = 29/106 (27%), Positives = 44/106 (41%), Gaps = 28/106 (26%)
Query: 467 CEIF-EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKE 525
E F CD+ +I + K++L T+EE K +S A + + +E +LL E
Sbjct: 88 LERFVHDCDR-------KIDKAKQRLELTQEEQTKIAADSKAE--ELAELDEEIGKLLAE 138
Query: 526 HEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKT 571
E + + +G DE KL K EELK +K
Sbjct: 139 AEALGE--------------EGKVDE----AMKLMKEVEELKAKKK 166
>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
Provisional.
Length = 593
Score = 29.2 bits (65), Expect = 8.3
Identities = 22/125 (17%), Positives = 40/125 (32%), Gaps = 7/125 (5%)
Query: 491 LSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSD 550
LS+ E L + + K V KE+ L E ++ + K D D+
Sbjct: 101 LSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA 160
Query: 551 EIK------RLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQKVVTEQ 604
E K +L+ L + + + +A + D+ K +
Sbjct: 161 EKKLTQAQNKLQS-LDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAK 219
Query: 605 GDKKD 609
+K D
Sbjct: 220 AEKAD 224
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 28.9 bits (65), Expect = 8.3
Identities = 27/115 (23%), Positives = 47/115 (40%), Gaps = 12/115 (10%)
Query: 490 KLSKTEEELKKEKTNSAALKSQADSVGKEYDRL--LKEHEKVQKVVTEQGDKKDDKYQKG 547
KL + +L KE LK++ SV + ++L L + K E+ K ++ K
Sbjct: 5 KLLRNNPDLVKES-----LKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQ 59
Query: 548 DSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKE-----HEKV 597
+ K+K+ + ++ELK+ K L ++ E L HE V
Sbjct: 60 IGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNIPHESV 114
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 29.1 bits (66), Expect = 8.5
Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 6/97 (6%)
Query: 480 DSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDK 539
E++ L +++ + E+ELK+ K + + KE K+ K +K +G K
Sbjct: 367 YDTELEVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKE-----KKKNKKKKYKVPRG-K 420
Query: 540 KDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAAL 576
++ +++ LK + E LK ++ + AL
Sbjct: 421 IYKVLKEATRQDLELLKNVWGEILESLKAQRKSLRAL 457
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that assembles
onto the CENP-A-nucleosome distal (CAD) centromere. The
centromere, which is the basic element of chromosome
inheritance, is epigenetically determined in mammals.
CENP-A, the centromere-specific histone H3 variant,
assembles an array of nucleosomes and it is this that
seems to be the prime candidate for specifying
centromere identity. CENP-A nucleosomes directly recruit
a proximal CENP-A nucleosome associated complex (NAC)
comprised of CENP-M, CENP-N and CENP-T, CENP-U(50),
CENP-C and CENP-H. Assembly of the CENP-A NAC at
centromeres is dependent on CENP-M, CENP-N and CENP-T.
Additionally, there are seven other subunits which make
up the CENP-A-nucleosome distal (CAD) centromere,
CENP-K, CENP-L, CENP-O, CENP-P, CENP-Q, CENP-R and
CENP-S, also assembling on the CENP-A NAC. Fta7 is the
equivalent component of the fission yeast Sim4 complex.
Length = 159
Score = 28.1 bits (63), Expect = 8.6
Identities = 24/114 (21%), Positives = 49/114 (42%), Gaps = 12/114 (10%)
Query: 477 LEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQ 536
LE D + ++ L++ E E ++E N + +S +L+E+E +Q + E
Sbjct: 57 LEKDLEYLQELEKNAKALEREREEESKNLHPVLRLLES------EVLEENELLQDSLLEL 110
Query: 537 GDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRL 590
++ E+ L ++L+K E L + N L +++ + Y L
Sbjct: 111 SERNFS---PNLDPELLDLLKQLNKHLESL---QGNLEQLAGLVEALERLYAAL 158
>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A.
Length = 427
Score = 29.0 bits (66), Expect = 8.6
Identities = 26/120 (21%), Positives = 47/120 (39%), Gaps = 37/120 (30%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRL---KEKLSKTEEELKKEKTNSAALKSQADSV--- 515
+ +R I E K L D + + L K ++EL +E + + QAD +
Sbjct: 331 KLEERLHILEGLLKALN-KIDFVIEVIRGSIDLKKAKKELIEELS-----EIQADYLLDM 384
Query: 516 ------GKEYDRLLKEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKE 569
+E ++L KE E+++K EI L++ L+ ++ KE
Sbjct: 385 RLRRLTKEEIEKLEKEIEELEK-------------------EIAELEKILASEKKLWIKE 425
>gnl|CDD|145972 pfam03112, DUF244, Uncharacterized protein family (ORF7) DUF.
Several members of this family are Borrelia burgdorferi
plasmid proteins of uncharacterized function.
Length = 158
Score = 28.2 bits (63), Expect = 8.6
Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 1/50 (2%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKSQADSVGKEY 519
EQ LE EI LK +L+K + E KK EK LK + D + +Y
Sbjct: 66 EQSALNLECSLFEIDNLKTELNKIQNENKKEEKPIKDLLKIKIDKIMIKY 115
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 28.9 bits (65), Expect = 8.8
Identities = 9/87 (10%), Positives = 37/87 (42%), Gaps = 8/87 (9%)
Query: 485 KRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDDKY 544
++ +++ E + + + ++ + ++V +Y L H+ ++ ++K
Sbjct: 861 HQVDNMVTEYEGNIGDTQNS---IRGEENTVKGQYSELQNHHKTEAL--SQNNKYNEEKS 915
Query: 545 QKG---DSDEIKRLKEKLSKTEEELKK 568
+ +D + L ++ + +++ K
Sbjct: 916 AQERMPGADSPEELMKRAKEYQDKHKG 942
>gnl|CDD|241543 cd13392, PH_AKAP13, A-kinase anchoring protein 13 Pleckstrin
homology (PH) domain. The Rho-specific GEF activity of
AKAP13 (also called Brx-1, AKAP-Lbc, and proto-Lbc)
mediates signaling downstream of G-protein coupled
receptors and Toll-like receptor 2. It plays a role in
cell growth, cell development and actin fiber formation.
Protein kinase A (PKA) binds and phosphorylates AKAP13,
regulating its Rho-GEF activity. Alternative splicing of
this gene in humans has at least 3 transcript variants
encoding different isoforms (i.e. proto-/onco-Lymphoid
blast crisis, Lbc and breast cancer nuclear
receptor-binding auxiliary protein, Brx) containing a
dbl oncogene homology (DH) domain and PH domain which
are required for full transforming activity. The DH
domain is associated with guanine nucleotide exchange
activation while the PH domain has multiple functions
including determine protein sub-cellular localisation
via phosphoinositide interactions, while others bind
protein partners. Other ligands include protein kinase C
which is bound by the PH domain of AKAP13, serving to
activate protein kinase D and mobilize a cardiac
hypertrophy signaling pathway. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 157
Score = 28.0 bits (62), Expect = 8.9
Identities = 24/82 (29%), Positives = 35/82 (42%), Gaps = 8/82 (9%)
Query: 535 EQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK---TNSAALKNQADSVGKEYDRLL 591
E+ + + Y + DS I R+K E+LK+ K +LKN A + LL
Sbjct: 19 EKKKRLKEIYGRTDSKSIMRMKSGQMFAREDLKRRKLVRDGPLSLKNTAGRLKDVQAVLL 78
Query: 592 KE-----HEKVQKVVTEQGDKK 608
+ EK QK V D+K
Sbjct: 79 SDILVFLQEKDQKYVFASLDQK 100
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 29.0 bits (66), Expect = 8.9
Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 22/94 (23%)
Query: 471 EQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
EQ K +E +E K K+++ K +EL + + L S+A+ VG
Sbjct: 757 EQLPKTVERFFEEWKEQKKEIEKLRKELAELLAS--ELLSKAEEVGG------------V 802
Query: 531 KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEE 564
KVV E+ D D DE++++ +L+K +
Sbjct: 803 KVVVEEVD--------ADEDELRKIANELAKEGK 828
>gnl|CDD|206755 cd08774, 14-3-3, 14-3-3 domain. 14-3-3 domain is an essential part
of 14-3-3 proteins, a ubiquitous class of regulatory,
phosphoserine/threonine-binding proteins found in all
eukaryotic cells, including yeast, protozoa and
mammalian cells. 14-3-3 proteins play important roles in
many biological processes that are regulated by
phosphorylation, including cell cycle regulation, cell
proliferation, protein trafficking, metabolic regulation
and apoptosis. More than 300 binding partners of the
14-3-3 domain have been identified in all subcellular
compartments and include transcription factors,
signaling molecules, tumor suppressors, biosynthetic
enzymes, cytoskeletal proteins and apoptosis factors.
14-3-3 binding can alter the conformation, localization,
stability, phosphorylation state, activity as well as
molecular interactions of a target protein. They
function only as dimers, some preferring strictly
homodimeric interaction, while others form heterodimers.
Binding of the 14-3-3 domain to its target occurs in a
phosphospecific manner where it binds to one of two
consensus sequences of their target proteins; RSXpSXP
(mode-1) and RXXXpSXP (mode-2). In some instances,
14-3-3 domain containing proteins are involved in
regulation and signaling of a number of cellular
processes in phosphorylation-independent manner. Many
organisms express multiple isoforms: there are seven
mammalian 14-3-3 family members (beta, gamma, eta,
theta, epsilon, sigma, zeta), each encoded by a distinct
gene, while plants contain up to 13 isoforms. The
flexible C-terminal segment of 14-3-3 isoforms shows the
highest sequence variability and may significantly
contribute to individual isoform uniqueness by playing
an important regulatory role by occupying the ligand
binding groove and blocking the binding of inappropriate
ligands in a distinct manner. Elevated amounts of 14-3-3
proteins are found in the cerebrospinal fluid of
patients with Creutzfeldt-Jakob disease. In protozoa,
like Plasmodium or Cryptosporidium parvum 14-3-3
proteins play an important role in key steps of parasite
development.
Length = 225
Score = 28.3 bits (64), Expect = 9.0
Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 21/105 (20%)
Query: 466 RCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTN--SAALKSQADSVGKEYDRLL 523
++ EQ D++ + +++++ EL KE+ N S A K+ S + R+L
Sbjct: 6 LAKLAEQA-----ERYDDMVKYMKQVAELNGELTKEERNLLSVAYKNVVGSRRASW-RIL 59
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKK 568
E+ + KG+ +++K +KE K E+ELK
Sbjct: 60 SSIEQKES-------------SKGNEEKLKLIKEYKEKIEKELKD 91
>gnl|CDD|153337 cd07653, F-BAR_CIP4-like, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Cdc42-Interacting Protein
4 and similar proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. This subfamily is composed of
Cdc42-Interacting Protein 4 (CIP4), Formin Binding
Protein 17 (FBP17), FormiN Binding Protein 1-Like
(FNBP1L), and similar proteins. CIP4 and FNBP1L are
Cdc42 effectors that bind Wiskott-Aldrich syndrome
protein (WASP) and function in endocytosis. CIP4 and
FBP17 bind to the Fas ligand and may be implicated in
the inflammatory response. CIP4 may also play a role in
phagocytosis. Members of this subfamily typically
contain an N-terminal F-BAR domain and a C-terminal SH3
domain. In addition, some members such as FNBP1L contain
a central Cdc42-binding HR1 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 251
Score = 28.4 bits (64), Expect = 9.2
Identities = 26/100 (26%), Positives = 43/100 (43%), Gaps = 26/100 (26%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAALKSQA---DSVGKEYDRLLKEHEKVQKVVTEQGDKK 540
I L+++ K E K L+S + K Y++ KE EK +
Sbjct: 100 ISELRQERKKHLSEGSKL---QQKLESSIKQLEKSKKAYEKAFKEAEKAK---------- 146
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNSAALKNQA 580
KY+K D+D ++ T+ +++K K +A LK QA
Sbjct: 147 -QKYEKADAD--------MNLTKADVEKAK-ANANLKTQA 176
>gnl|CDD|185746 cd08915, V_Alix_like, Protein-interacting V-domain of mammalian
Alix and related domains. This superfamily contains the
V-shaped (V) domain of mammalian Alix (apoptosis-linked
gene-2 interacting protein X), His-Domain type N23
protein tyrosine phosphatase (HD-PTP, also known as
PTPN23), Bro1 and Rim20 (also known as PalA) from
Saccharomyces cerevisiae, and related domains. Alix,
HD-PTP, Bro1, and Rim20 all interact with the ESCRT
(Endosomal Sorting Complexes Required for Transport)
system. Alix, also known as apoptosis-linked gene-2
interacting protein 1 (AIP1), participates in membrane
remodeling processes during the budding of enveloped
viruses, vesicle budding inside late endosomal
multivesicular bodies (MVBs), and the abscission
reactions of mammalian cell division. It also functions
in apoptosis. HD-PTP functions in cell migration and
endosomal trafficking, Bro1 in endosomal trafficking,
and Rim20 in the response to the external pH via the
Rim101 pathway. The Alix V-domain contains a binding
site, partially conserved in this superfamily, for the
retroviral late assembly (L) domain YPXnL motif. The
Alix V-domain is also a dimerization domain. Members of
this superfamily have an N-terminal Bro1-like domain,
which binds components of the ESCRT-III complex. The
Bro1-like domains of Alix and HD-PTP can also bind human
immunodeficiency virus type 1 (HIV-1) nucleocapsid. Many
members, including Alix, HD-PTP, and Bro1, also have a
proline-rich region (PRR), which binds multiple partners
in Alix, including Tsg101 (tumor susceptibility gene
101, a component of ESCRT-1) and the apoptotic protein
ALG-2. The C-terminal portion (V-domain and PRR) of Bro1
interacts with Doa4, a ubiquitin thiolesterase needed to
remove ubiquitin from MVB cargoes; it interacts with a
YPxL motif in Doa4s catalytic domain to stimulate its
deubiquitination activity. Rim20 may bind the ESCRT-III
subunit Snf7, bringing the protease Rim13 (a
YPxL-containing transcription factor) into proximity
with Rim101, and promoting the proteolytic activation of
Rim101. HD-PTP is encoded by the PTPN23 gene, a tumor
suppressor gene candidate often absent in human kidney,
breast, lung, and cervical tumors. HD-PTP has a
C-terminal catalytically inactive tyrosine phosphatase
domain.
Length = 342
Score = 28.8 bits (65), Expect = 9.4
Identities = 22/133 (16%), Positives = 51/133 (38%), Gaps = 21/133 (15%)
Query: 467 CEIFEQCDKGLEGDSDEIKRLKEK-------LSKTEEELKKEKTNSAALKS---QADSVG 516
E+ ++C++ LE ++ E +L+ K ++E K+ L+ QA +
Sbjct: 90 EELLQECEELLEEEAAEDDQLRAKFGTLRWRRPSSDEAAKELYEKVTKLRGYLEQASNSD 149
Query: 517 KEYDRLLKEHEKVQKVVTEQGDKK---------DDKYQKGDSDEIKRLKEKLSKTEEELK 567
E + + + +V G K S+ + L+ L++ E K
Sbjct: 150 NEVLQCYESIDP--NLVLLCGGYKELKAFIPSPYPALDPEVSEVVSSLRPLLNEVSELEK 207
Query: 568 KEKTNSAALKNQA 580
+ + + L+ ++
Sbjct: 208 ERERFISELEIKS 220
>gnl|CDD|224237 COG1318, COG1318, Predicted transcriptional regulators
[Transcription].
Length = 182
Score = 28.3 bits (63), Expect = 9.6
Identities = 13/88 (14%), Positives = 29/88 (32%), Gaps = 9/88 (10%)
Query: 483 EIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGK--EYDRLLKEHEKVQKVVTEQGDKK 540
++ R + K E K ++ EH + K +
Sbjct: 90 QLVRETYEKLKEGGLDAVEVEIEKLEKEGLKIRWAVEVLKKIKGEHFPMDK-------EL 142
Query: 541 DDKYQKGDSDEIKRLKEKLSKTEEELKK 568
++ KG+ + + L+E + K E ++
Sbjct: 143 LEEKLKGEVIKGEPLEEVIDKLEYPVES 170
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 28.7 bits (64), Expect = 9.6
Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 484 IKRLKEKLSKTEEELKKEKTNSAA-LKSQADSVGKEYDRLLKEHEKVQKVVTEQGDKKDD 542
++RL++ + + E+ + KE L+ + + +GKE + L K +++ + E ++ ++
Sbjct: 363 LERLEKDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKR-VALEEGLKEL-EQDEE 420
Query: 543 KYQKGDSDEIKRLKEKLSKTEEELKKE 569
+ K S E K L EKL E++K
Sbjct: 421 NFLKELSKEEKELLEKLKMEASEVEKL 447
>gnl|CDD|148313 pfam06631, DUF1154, Protein of unknown function (DUF1154). This
family represents a small conserved region of unknown
function within eukaryotic phospholipase C (EC:3.1.4.3).
All members also contain pfam00387 and pfam00388.
Length = 47
Score = 25.8 bits (57), Expect = 9.7
Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 4/37 (10%)
Query: 562 TEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEKVQ 598
T E L+++K LK Q KE D L K+H K +
Sbjct: 10 TLESLRQDKAYQKLLKKQQ----KELDSLKKKHAKER 42
Score = 25.8 bits (57), Expect = 9.9
Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 4/37 (10%)
Query: 494 TEEELKKEKTNSAALKSQADSVGKEYDRLLKEHEKVQ 530
T E L+++K LK Q KE D L K+H K +
Sbjct: 10 TLESLRQDKAYQKLLKKQQ----KELDSLKKKHAKER 42
>gnl|CDD|225887 COG3351, FlaD, Putative archaeal flagellar protein D/E [Cell
motility and secretion].
Length = 214
Score = 28.3 bits (63), Expect = 9.7
Identities = 22/107 (20%), Positives = 45/107 (42%), Gaps = 11/107 (10%)
Query: 465 KRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKK-EKTNSAALKSQADSVGKEYDRLL 523
K+ + E+ +K + + ++ LKEK+ + + KK + L + + + L
Sbjct: 1 KKPYLEEKIEKAEKVTAFALEELKEKIEELPIQAKKSDDELVEELPER---YEQTKENSL 57
Query: 524 KEHEKVQKVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEK 570
E + + +K S+++K + S +EEE KK K
Sbjct: 58 IEKVDSIEEEISEKEKV-------MSEKLKEPAQMSSTSEEEEKKAK 97
>gnl|CDD|227680 COG5391, COG5391, Phox homology (PX) domain protein [Intracellular
trafficking and secretion / General function prediction
only].
Length = 524
Score = 29.0 bits (65), Expect = 9.8
Identities = 26/147 (17%), Positives = 43/147 (29%), Gaps = 19/147 (12%)
Query: 462 ESSKRCEIFEQCDKGLEGDSDEIKRLKEKLSKTEEELKKEKTNSAALKSQADSVGKEYDR 521
E + E K L + + +E L T+ L+K + R
Sbjct: 355 EGVVQAETLRSSLKELLTQLQDEIKSRESLILTDSNLEKLTD----QNLEDVEELSRSLR 410
Query: 522 LLKEHEKVQ--------KVVTEQGDKKDDKYQKGDSDEIKRLKEKLSKTEEELKKEKTNS 573
V +D +K S I+ L++ K EE+L
Sbjct: 411 KNSSQRAVVSQQPEGLTSFSKLSYKLRDFVQEKSRSKSIESLQQDKEKLEEQL------- 463
Query: 574 AALKNQADSVGKEYDRLLKEHEKVQKV 600
A + A + +E LK V+
Sbjct: 464 AIAEKDAQEINEELKNELKFFFSVRNS 490
>gnl|CDD|216689 pfam01765, RRF, Ribosome recycling factor. The ribosome recycling
factor (RRF / ribosome release factor) dissociates the
ribosome from the mRNA after termination of translation,
and is essential bacterial growth. Thus ribosomes are
"recycled" and ready for another round of protein
synthesis.
Length = 165
Score = 27.8 bits (63), Expect = 9.8
Identities = 14/47 (29%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 550 DEIKRLKEKLSKTEEELKKEKTNSAALKNQADSVGKEYDRLLKEHEK 596
D++K+L++ +E+E+K+ + ++ D K+ D LLK+ EK
Sbjct: 119 DKLKKLEKDKEISEDEVKRAEKE---IQKLTDKYIKKIDELLKKKEK 162
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.134 0.397
Gapped
Lambda K H
0.267 0.0715 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 29,909,620
Number of extensions: 2893741
Number of successful extensions: 6081
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5350
Number of HSP's successfully gapped: 588
Length of query: 609
Length of database: 10,937,602
Length adjustment: 103
Effective length of query: 506
Effective length of database: 6,369,140
Effective search space: 3222784840
Effective search space used: 3222784840
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.7 bits)