RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2772
(746 letters)
>gnl|CDD|99996 cd04300, GT1_Glycogen_Phosphorylase, This is a family of
oligosaccharide phosphorylases. It includes yeast and
mammalian glycogen phosphorylases, plant starch/glucan
phosphorylase, as well as the maltodextrin
phosphorylases of bacteria. The members of this family
catalyze the breakdown of oligosaccharides into
glucose-1-phosphate units. They are important allosteric
enzymes in carbohydrate metabolism. The allosteric
control mechanisms of yeast and mammalian members of
this family are different from that of bacterial
members. The members of this family belong to the GT-B
structural superfamily of glycoslytransferases, which
have characteristic N- and C-terminal domains each
containing a typical Rossmann fold. The two domains have
high structural homology despite minimal sequence
homology. The large cleft that separates the two
domains includes the catalytic center and permits a high
degree of flexibility.
Length = 797
Score = 108 bits (272), Expect = 3e-24
Identities = 37/65 (56%), Positives = 45/65 (69%)
Query: 253 LKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVISLPFKD 312
L N GDYI+AV ++N AENIS+VLYPND+ GKELRLKQ+YF +A+LQ I FK
Sbjct: 231 LDAFNRGDYIRAVEEKNRAENISKVLYPNDSTEEGKELRLKQQYFFVSASLQDIIRRFKK 290
Query: 313 QGRAL 317
L
Sbjct: 291 THGPL 295
>gnl|CDD|215868 pfam00343, Phosphorylase, Carbohydrate phosphorylase. The members
of this family catalyze the formation of glucose
1-phosphate from one of the following polyglucoses;
glycogen, starch, glucan or maltodextrin.
Length = 712
Score = 101 bits (254), Expect = 3e-22
Identities = 43/69 (62%), Positives = 51/69 (73%)
Query: 249 EKIPLKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVISL 308
E+ L NDGDY+ AV D+N AENISRVLYPNDN F GKELRLKQ+YF+ +ATLQ I
Sbjct: 141 EEFNLADFNDGDYLAAVEDKNRAENISRVLYPNDNTFEGKELRLKQQYFLVSATLQDIIR 200
Query: 309 PFKDQGRAL 317
FK ++L
Sbjct: 201 RFKKSHKSL 209
>gnl|CDD|233722 TIGR02093, P_ylase, glycogen/starch/alpha-glucan phosphorylases.
This family consists of phosphorylases. Members use
phosphate to break alpha 1,4 linkages between pairs of
glucose residues at the end of long glucose polymers,
releasing alpha-D-glucose 1-phosphate. The nomenclature
convention is to preface the name according to the
natural substrate, as in glycogen phosphorylase, starch
phosphorylase, maltodextrin phosphorylase, etc. Name
differences among these substrates reflect differences
in patterns of branching with alpha 1,6 linkages.
Members include allosterically regulated and unregulated
forms. A related family, TIGR02094, contains examples
known to act well on particularly small alpha 1,4
glucans, as may be found after import from exogenous
sources [Energy metabolism, Biosynthesis and degradation
of polysaccharides].
Length = 794
Score = 96.6 bits (241), Expect = 1e-20
Identities = 36/69 (52%), Positives = 46/69 (66%)
Query: 249 EKIPLKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVISL 308
E+ L N GDY +AV ++N AENISRVLYPND+ + GKELRLKQ+YF +A+LQ I
Sbjct: 224 EEFDLDAFNAGDYYEAVEEKNRAENISRVLYPNDSTYEGKELRLKQQYFFVSASLQDIIR 283
Query: 309 PFKDQGRAL 317
+ L
Sbjct: 284 RHLETHPDL 292
>gnl|CDD|184948 PRK14986, PRK14986, glycogen phosphorylase; Provisional.
Length = 815
Score = 78.3 bits (193), Expect = 6e-15
Identities = 32/53 (60%), Positives = 43/53 (81%), Gaps = 1/53 (1%)
Query: 254 KFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVI 306
KF N GDY AV D+N +EN+SRVLYP+D+ + G+ELRL+QEYF+ +AT+Q I
Sbjct: 246 KF-NQGDYFAAVEDKNHSENVSRVLYPDDSTYSGRELRLRQEYFLVSATVQDI 297
>gnl|CDD|223136 COG0058, GlgP, Glucan phosphorylase [Carbohydrate transport and
metabolism].
Length = 750
Score = 62.3 bits (152), Expect = 7e-10
Identities = 27/54 (50%), Positives = 35/54 (64%), Gaps = 11/54 (20%)
Query: 253 LKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCAATLQVI 306
L F N G+ ++N A NI+RVLYP D+ KELRLKQEYF+ +A +Q I
Sbjct: 210 LDF-NVGE------NKNDARNITRVLYPGDS----KELRLKQEYFLGSAGVQDI 252
>gnl|CDD|237881 PRK14985, PRK14985, maltodextrin phosphorylase; Provisional.
Length = 798
Score = 62.0 bits (151), Expect = 7e-10
Identities = 29/59 (49%), Positives = 39/59 (66%), Gaps = 9/59 (15%)
Query: 242 FNLTERNEKIPLKFVNDGDYIQAVLDRNLAENISRVLYPNDNNFGGKELRLKQEYFMCA 300
F+LT KF NDGD+++A AE +++VLYPNDN+ GK+LRL Q+YF CA
Sbjct: 231 FDLT--------KF-NDGDFLRAEQQGIDAEKLTKVLYPNDNHTAGKKLRLMQQYFQCA 280
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 46.2 bits (110), Expect = 7e-05
Identities = 25/115 (21%), Positives = 51/115 (44%), Gaps = 6/115 (5%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
AEVE+ + L +EL+ + + + ++LEE A E+E+ L +++ + E+
Sbjct: 740 AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK 799
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+L L AEL + +L+ + + L ++ + L +QI
Sbjct: 800 ALREALD------ELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIE 848
Score = 42.4 bits (100), Expect = 0.001
Identities = 28/115 (24%), Positives = 54/115 (46%), Gaps = 1/115 (0%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
+ + R + LER+L + + L KL+E A LE +L L+++L+ E+
Sbjct: 302 QQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELE 361
Query: 529 NGTASLSNGPS-VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
A L S + LE +L +++ V L+ Q L+ ++++L + L +
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRR 416
Score = 40.8 bits (96), Expect = 0.003
Identities = 26/122 (21%), Positives = 54/122 (44%), Gaps = 8/122 (6%)
Query: 470 EVERYRRQQRLLERELSRVRSI-------LAHNSKKLEETVAANARLESELVVLRQKLQW 522
E+E + + LE +L +R + K+L +RLE + +LR++L
Sbjct: 254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLAN 313
Query: 523 SRREVCNGTASLSNGPSVA-ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQ 581
R++ A L S L EL ++ + +L+ + + L A++++L + L +
Sbjct: 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESR 373
Query: 582 IR 583
+
Sbjct: 374 LE 375
Score = 40.4 bits (95), Expect = 0.004
Identities = 27/109 (24%), Positives = 54/109 (49%), Gaps = 8/109 (7%)
Query: 476 RQQRL--LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTAS 533
R++ + LE ++ + +A K L E LE EL LR++L+ R++
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKD 734
Query: 534 LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
L A LEAE+ +++ + L ++ EL A++++L E+ +++
Sbjct: 735 L------ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEEL 777
Score = 40.0 bits (94), Expect = 0.006
Identities = 31/138 (22%), Positives = 60/138 (43%), Gaps = 9/138 (6%)
Query: 444 GQAKEMLENALGSLRHKMHGVHASPAEVERYR----RQQRLLERELSRVRSILAHNSKKL 499
LE LR ++ + E+E + L EL+ + L ++L
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 500 EETVAANARLESELVVLRQKLQWSRREV---CNGTASLSNGPSVAALEAELRRVQALVGD 556
E A LE+EL L +L+ ++ + A L +A+L E+ R++A +
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLEL--QIASLNNEIERLEARLER 411
Query: 557 LQRQRQELSAQVKQLTEK 574
L+ +R+ L ++++L +K
Sbjct: 412 LEDRRERLQQEIEELLKK 429
Score = 38.9 bits (91), Expect = 0.012
Identities = 32/164 (19%), Positives = 70/164 (42%), Gaps = 27/164 (16%)
Query: 424 SRDQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMH----GVHASPAEVERYRRQQR 479
D++A +L + ++ + E LE L L ++ + ++E R +
Sbjct: 331 KLDELAEELAELEEKLEELKEE-LESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
Query: 480 LLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS 539
LE +++ + + + +LE RL+ E+ L +KL+
Sbjct: 390 QLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE------------------ 431
Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
EAEL+ +QA + +L+ + +EL ++++L E L +++
Sbjct: 432 ----EAELKELQAELEELEEELEELQEELERLEEALEELREELE 471
Score = 38.1 bits (89), Expect = 0.022
Identities = 28/147 (19%), Positives = 56/147 (38%), Gaps = 10/147 (6%)
Query: 440 CSVQGQAKEMLENALGSLRHKMH----GVHASPAEVERYRRQQRLLERELSRVRSILAHN 495
+A + L L L + + + + R+ LE ++ + +
Sbjct: 798 LKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL 857
Query: 496 SKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVG 555
+ ++EE LESEL L + + + L L ELR +++
Sbjct: 858 AAEIEELEELIEELESELEALLNERASLEEALALLRSELEE------LSEELRELESKRS 911
Query: 556 DLQRQRQELSAQVKQLTEKSNSLSQQI 582
+L+R+ +EL ++ QL + L +I
Sbjct: 912 ELRRELEELREKLAQLELRLEGLEVRI 938
Score = 36.6 bits (85), Expect = 0.052
Identities = 20/110 (18%), Positives = 41/110 (37%), Gaps = 13/110 (11%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
++E + LE + + S L + A A L SEL L ++L+ +
Sbjct: 852 EDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKR- 910
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSL 578
+ L EL ++ + L+ + + L ++ L E+ +
Sbjct: 911 ------------SELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEE 948
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 45.8 bits (109), Expect = 9e-05
Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 14/148 (9%)
Query: 447 KEMLENALGSLRHKMHGVHASPAE--VERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
E LE L L ++ + A + + + + LE E+SR+ + L +KL
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826
Query: 505 ANARLESE---LVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALV------- 554
LE E L R L+ + + +L NG LE EL ++A +
Sbjct: 827 EKEYLEKEIQELQEQRIDLKEQIKSIEKEIENL-NG-KKEELEEELEELEAALRDLESRL 884
Query: 555 GDLQRQRQELSAQVKQLTEKSNSLSQQI 582
GDL+++R EL AQ+++L K L QI
Sbjct: 885 GDLKKERDELEAQLRELERKIEELEAQI 912
Score = 43.1 bits (102), Expect = 6e-04
Identities = 29/123 (23%), Positives = 48/123 (39%), Gaps = 6/123 (4%)
Query: 466 ASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRR 525
+E LE+E+ V+S L ++EE +LE L L +L SR
Sbjct: 734 KLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRI 793
Query: 526 EVCNGTAS------LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
S + +E +L R+ L+++ QEL Q L E+ S+
Sbjct: 794 PEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE 853
Query: 580 QQI 582
++I
Sbjct: 854 KEI 856
Score = 39.7 bits (93), Expect = 0.007
Identities = 30/170 (17%), Positives = 71/170 (41%), Gaps = 17/170 (10%)
Query: 425 RDQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPA-----------EVER 473
++ +L + + + + K++ E ++ K+ + A A E+E
Sbjct: 260 ISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELED 319
Query: 474 YRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTAS 533
+ LE E+ ++ + + +++EE +L E L+++L+ R E+
Sbjct: 320 AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKE 379
Query: 534 LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
A EL+ + + L+R+ EL ++ +L E+ LS+++
Sbjct: 380 F------AETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELA 423
Score = 33.1 bits (76), Expect = 0.63
Identities = 22/113 (19%), Positives = 52/113 (46%), Gaps = 13/113 (11%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
E++ + Q+ L+ ++ + + + + K EE LE+ L L +L ++E
Sbjct: 834 EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDE 893
Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
LEA+LR ++ + +L+ Q ++ ++ +L K +L +++
Sbjct: 894 -------------LEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEEL 933
Score = 32.7 bits (75), Expect = 0.85
Identities = 26/107 (24%), Positives = 45/107 (42%), Gaps = 1/107 (0%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
E+ +R+ L+ EL R+ LA + + A LE E +++ ++
Sbjct: 400 EINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ 459
Query: 530 GTASLSN-GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
A LS + L+ E RV+ + LQR+ E AQ + E+
Sbjct: 460 LAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERV 506
Score = 32.0 bits (73), Expect = 1.5
Identities = 23/117 (19%), Positives = 49/117 (41%), Gaps = 13/117 (11%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESE-LVVLR-QKLQWSRRE 526
+ E+ + +E + R+ I+ ++LE RL E R Q L +RE
Sbjct: 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLE-------RLRREREKAERYQALLKEKRE 222
Query: 527 VCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
G L ALE + ++ + L+ + ++L+ ++ +L ++ + Q +
Sbjct: 223 Y-EGYELLKE---KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLE 275
Score = 31.6 bits (72), Expect = 2.1
Identities = 26/112 (23%), Positives = 54/112 (48%), Gaps = 8/112 (7%)
Query: 468 PAEVERYRRQQRLLERELSRVRSILAH-------NSKKLEETVAANARLESELVVLRQKL 520
PAE++R R + L+RELS ++S L S++L + +E E+ L Q+
Sbjct: 673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEE 732
Query: 521 QWSRREVCNGTASLSN-GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 571
+ + + LS+ + +++EL+ ++A + +L+ +L + L
Sbjct: 733 EKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784
Score = 31.2 bits (71), Expect = 2.5
Identities = 27/140 (19%), Positives = 60/140 (42%), Gaps = 6/140 (4%)
Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAAN 506
K L + LR + + E+ R+ +E+E+ ++ ++LEE
Sbjct: 687 KRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDL 746
Query: 507 ARLESELVVLRQKLQWSRREVCNGTASLSN-GPSVAALEAELRR-----VQALVGDLQRQ 560
+ LE E+ ++ +L+ + L ++ LEA L +QA + L+ +
Sbjct: 747 SSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEE 806
Query: 561 RQELSAQVKQLTEKSNSLSQ 580
+ A+++++ +K N L+
Sbjct: 807 VSRIEARLREIEQKLNRLTL 826
Score = 30.8 bits (70), Expect = 3.6
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 9/78 (11%)
Query: 2 ELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALV-------GDLQRQRQEL 54
EL R L+ + + +L NG LE EL ++A + GDL+++R EL
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENL-NG-KKEELEEELEELEAALRDLESRLGDLKKERDEL 894
Query: 55 SAQVKQLTEKSNSLSQQI 72
AQ+++L K L QI
Sbjct: 895 EAQLRELERKIEELEAQI 912
Score = 30.4 bits (69), Expect = 4.2
Identities = 27/145 (18%), Positives = 60/145 (41%), Gaps = 10/145 (6%)
Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRL----L 481
+++ +++ R + +Q + + L L L + G+ A E+E + + L
Sbjct: 395 EKLKREINELKRELDRLQEELQR-LSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQ 453
Query: 482 ERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVA 541
E +L ++ + L+ ++L + R+E EL L+++L + E + A
Sbjct: 454 EWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQREL--AEAEAQARASEERVRGGRA 511
Query: 542 A---LEAELRRVQALVGDLQRQRQE 563
L+A ++ V V L +
Sbjct: 512 VEEVLKASIQGVHGTVAQLGSVGER 536
Score = 30.0 bits (68), Expect = 5.2
Identities = 35/144 (24%), Positives = 66/144 (45%), Gaps = 6/144 (4%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
++E+ +R+ L+REL R++ L S++L + AA A +E+++ L ++ + E+
Sbjct: 393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKK 452
Query: 530 GTASL-SNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTG 588
L ++ E EL ++ + R +ELS ++L E G
Sbjct: 453 QEWKLEQLAADLSKYEQELYDLKE---EYDRVEKELSKLQRELAEAEAQARASEERVRGG 509
Query: 589 VAVAGKSVNRPGPTGVAGKVSKSG 612
AV + V + GV G V++ G
Sbjct: 510 RAV--EEVLKASIQGVHGTVAQLG 531
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 43.1 bits (102), Expect = 1e-04
Identities = 22/107 (20%), Positives = 50/107 (46%), Gaps = 6/107 (5%)
Query: 480 LLERELSR---VRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN 536
+L R LS R L ++L E A L+ +L L+Q+ Q ++E+ + +
Sbjct: 53 VLSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQEL---STLEAE 109
Query: 537 GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+ A ++++ A +L + +EL ++ +L +++ +L +
Sbjct: 110 LERLQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENE 156
Score = 39.6 bits (93), Expect = 0.003
Identities = 24/102 (23%), Positives = 51/102 (50%), Gaps = 3/102 (2%)
Query: 475 RRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASL 534
R + L++EL+ ++ LA ++L E N L+ EL L +L+ ++E+
Sbjct: 65 RERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124
Query: 535 SNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 576
+N L+ E R ++ + +L+++ + L A+ ++L E
Sbjct: 125 AN---AIELDEENRELREELAELKQENEALEAENERLQENEQ 163
Score = 30.7 bits (70), Expect = 1.8
Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 4/54 (7%)
Query: 24 LSNGPS----VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
LS+ PS + L+ EL +Q + +LQ Q EL + ++L ++ ++L ++
Sbjct: 58 LSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELE 111
Score = 30.0 bits (68), Expect = 3.4
Identities = 12/73 (16%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 1 SELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 60
+L L+Q+ Q ++E+ + + + A ++++ A +L + +EL ++ +
Sbjct: 87 EQLAELQQENQELKQEL---STLEAELERLQKELARIKQLSANAIELDEENRELREELAE 143
Query: 61 LTEKSNSLSQQIR 73
L +++ +L +
Sbjct: 144 LKQENEALEAENE 156
Score = 28.8 bits (65), Expect = 7.7
Identities = 8/35 (22%), Positives = 20/35 (57%)
Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 66
L+ E R ++ + +L+++ + L A+ ++L E
Sbjct: 129 ELDEENRELREELAELKQENEALEAENERLQENEQ 163
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 44.3 bits (105), Expect = 2e-04
Identities = 30/114 (26%), Positives = 61/114 (53%), Gaps = 6/114 (5%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
A+++ R++ LE ELSR+ L ++LEE L+SEL LR++L+ + E+
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELL 291
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
+ LE E+ ++ + +L+ + +EL ++++L EK +L +++
Sbjct: 292 ELKEEI------EELEGEISLLRERLEELENELEELEERLEELKEKIEALKEEL 339
Score = 43.2 bits (102), Expect = 6e-04
Identities = 28/114 (24%), Positives = 53/114 (46%), Gaps = 13/114 (11%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
+E RRQ LER+L ++ LA ++LE+ + LE EL L ++L+ +
Sbjct: 703 LLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER--- 759
Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+ LE EL ++ + L+ + +EL + + L E+ L +++
Sbjct: 760 ----------LEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803
Score = 40.5 bits (95), Expect = 0.003
Identities = 28/123 (22%), Positives = 61/123 (49%), Gaps = 6/123 (4%)
Query: 461 MHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKL 520
+ G + +R+ + LE EL+ + + L ++L+ LE L LR++L
Sbjct: 652 ITGGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQL 711
Query: 521 QWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQ 580
+ R++ +AALE EL ++Q+ + +L+ + +EL ++++L E+ L +
Sbjct: 712 EELERQL------EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEE 765
Query: 581 QIR 583
++
Sbjct: 766 ELE 768
Score = 40.5 bits (95), Expect = 0.004
Identities = 25/116 (21%), Positives = 50/116 (43%), Gaps = 6/116 (5%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
E+E+ + + LE EL + L ++LEE LE L L+++++ +
Sbjct: 730 EELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQ 789
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
L LE EL + + L+R+ + L + ++L ++ L ++I
Sbjct: 790 ALQEEL------EELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEE 839
Score = 40.1 bits (94), Expect = 0.004
Identities = 29/168 (17%), Positives = 65/168 (38%), Gaps = 12/168 (7%)
Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
+++ +L + I ++ + +E L L L+ ++ + E+E R EL
Sbjct: 256 EELQEELEEAEKEIEELKSELEE-LREELEELQEELLELKEEIEELEGEISLLRERLEEL 314
Query: 486 SRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN----------GTASLS 535
L ++L+E + A E L ++L+
Sbjct: 315 ENELEELEERLEELKEKIEALKEELEERETLLEELE-QLLAELEEAKEELEEKLSALLEE 373
Query: 536 NGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
AL EL ++A + +++ + +EL +++ L E+ LS+++
Sbjct: 374 LEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421
Score = 37.4 bits (87), Expect = 0.032
Identities = 32/130 (24%), Positives = 58/130 (44%), Gaps = 4/130 (3%)
Query: 443 QGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEET 502
E LE +LR ++ + A E+ R + L+RE+ + L S++LE+
Sbjct: 367 LSALLEELEELFEALREELAELEA---ELAEIRNELEELKREIESLEERLERLSERLEDL 423
Query: 503 VAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS-VAALEAELRRVQALVGDLQRQR 561
LE+EL L+ +L+ E+ L + LE EL +Q + L+++
Sbjct: 424 KEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKEL 483
Query: 562 QELSAQVKQL 571
L A++ +L
Sbjct: 484 SSLEARLDRL 493
Score = 36.6 bits (85), Expect = 0.054
Identities = 30/139 (21%), Positives = 55/139 (39%), Gaps = 30/139 (21%)
Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
Q +E LE + L ++ E + LE EL + L ++LEE A
Sbjct: 821 QRRERLEQEIEELEEEI----------EELEEKLDELEEELEELEKELEELKEELEELEA 870
Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
LE EL L ++ LE ELR +++ + +L+ + ++L
Sbjct: 871 EKEELEDELKELEEEK--------------------EELEEELRELESELAELKEEIEKL 910
Query: 565 SAQVKQLTEKSNSLSQQIR 583
++++L K L ++
Sbjct: 911 RERLEELEAKLERLEVELP 929
Score = 33.5 bits (77), Expect = 0.54
Identities = 35/153 (22%), Positives = 65/153 (42%), Gaps = 35/153 (22%)
Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLER---ELSRVRSILAHNSKKLE--E 501
++++E A G ++K ER +R LER L R+ +L K+LE E
Sbjct: 158 RKLIEEAAGVSKYK-----------ERKEEAERKLERTEENLERLEDLLEELEKQLEKLE 206
Query: 502 TVAANA----RLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDL 557
A A L++EL L L ++ + + LE EL R++ + +L
Sbjct: 207 RQAEKAERYQELKAELRELELALLLAKLKELRK--------ELEELEEELSRLEEELEEL 258
Query: 558 QRQRQELSAQV-------KQLTEKSNSLSQQIR 583
Q + +E ++ ++L E+ L +++
Sbjct: 259 QEELEEAEKEIEELKSELEELREELEELQEELL 291
Score = 30.1 bits (68), Expect = 6.0
Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 3/105 (2%)
Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
++ +L R ++ + + LE L + ++E R + + LEREL
Sbjct: 410 EERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELEREL 469
Query: 486 SRVRSILAHNSKKLEETVAANARLESELV---VLRQKLQWSRREV 527
+ ++ L K+L A RLE+E +R L+ +
Sbjct: 470 AELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGL 514
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 43.5 bits (103), Expect = 4e-04
Identities = 34/133 (25%), Positives = 66/133 (49%), Gaps = 7/133 (5%)
Query: 451 ENALGSLRHKMHGVHASPAE-VERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARL 509
E AL +LR K +E +E+ ++Q+ E+E S++++ + +L++ A A+L
Sbjct: 119 EEALATLRKKHQDAINELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQ--IAKAKL 176
Query: 510 ESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVK 569
+E ++L+ E+ L + L ++ R+Q+ DL RQ +E AQV
Sbjct: 177 NAEKK--AKQLESQLSELQVKLDELQR--QLNDLTSQKSRLQSENSDLTRQLEEAEAQVS 232
Query: 570 QLTEKSNSLSQQI 582
L++ + L Q+
Sbjct: 233 NLSKLKSQLESQL 245
Score = 39.3 bits (92), Expect = 0.008
Identities = 32/137 (23%), Positives = 64/137 (46%), Gaps = 9/137 (6%)
Query: 446 AKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAA 505
K L+ + L ++ + + E+ +Q LE +LS ++ L ++L + +
Sbjct: 153 EKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQ---LESQLSELQVKLDELQRQLNDLTSQ 209
Query: 506 NARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELS 565
+RL+SE L ++L+ E S + LE++L + + + R+R L
Sbjct: 210 KSRLQSENSDLTRQLE--EAEAQVSNLSKLK----SQLESQLEEAKRSLEEESRERANLQ 263
Query: 566 AQVKQLTEKSNSLSQQI 582
AQ++QL +SL +Q+
Sbjct: 264 AQLRQLEHDLDSLREQL 280
Score = 38.1 bits (89), Expect = 0.021
Identities = 32/145 (22%), Positives = 64/145 (44%), Gaps = 18/145 (12%)
Query: 450 LENALGSLRHKMHGVHASPAEVERYRRQ--QRLLER--ELSRVRSILAHNSKKLEETVAA 505
LE A +L + V + E+ + R + +RL E+ E R + L+ T+ A
Sbjct: 477 LEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEA 536
Query: 506 NARLESELVVLRQKL--QWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQ----- 558
A+ ++E L++KL + E+ A+ +N A + +++ Q V +LQ
Sbjct: 537 EAKGKAEASRLKKKLEGDINELEIALDHANKAN----AEAQKNVKKYQQQVKELQTQVEE 592
Query: 559 --RQRQELSAQVKQLTEKSNSLSQQ 581
R R++ Q+ + E+ + +
Sbjct: 593 EQRAREDAREQL-AVAERRATALEA 616
Score = 35.8 bits (83), Expect = 0.11
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
+ L ++ R+Q+ DL RQ +E AQV L++ + L Q+
Sbjct: 202 QLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQL 245
Score = 33.1 bits (76), Expect = 0.68
Identities = 22/92 (23%), Positives = 44/92 (47%), Gaps = 11/92 (11%)
Query: 484 ELSRVRSILAHNSKKLEETVAA-NARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAA 542
EL R + L + + E ++ +++LE E L +LQ +E+ +
Sbjct: 1 ELERQKRELENQLYRKESELSQLSSKLEDEQA-LVAQLQKKIKELEA---------RIRE 50
Query: 543 LEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
LE EL +A ++ R +LS ++++L+E+
Sbjct: 51 LEEELEAERAARAKAEKARADLSRELEELSER 82
Score = 33.1 bits (76), Expect = 0.73
Identities = 13/41 (31%), Positives = 28/41 (68%)
Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
LE++L +Q + +LQRQ +L++Q +L +++ L++Q+
Sbjct: 184 QLESQLSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQL 224
Score = 31.6 bits (72), Expect = 2.2
Identities = 32/150 (21%), Positives = 59/150 (39%), Gaps = 21/150 (14%)
Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAE-VERYRRQQRLLERE 484
D+I ++ + + + A+ N L ++ + VE RR+ + L+ E
Sbjct: 383 DKILAEWKRKVDELQAELDTAQREARNLSTEL-FRLKNELEELKDQVEALRRENKNLQDE 441
Query: 485 LSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALE 544
+ + L + + E A RLE+E L+ L+ + AALE
Sbjct: 442 IHDLTDQLGEGGRNVHELEKARRRLEAEKDELQAALEEAE----------------AALE 485
Query: 545 AELRRVQALVGDLQRQRQELSAQVKQLTEK 574
E +V +L + R E+ ++L EK
Sbjct: 486 LEESKVLRAQVELSQIRSEIE---RRLAEK 512
Score = 30.8 bits (70), Expect = 3.6
Identities = 30/139 (21%), Positives = 58/139 (41%), Gaps = 8/139 (5%)
Query: 444 GQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETV 503
+ + L+ L L H + + +E + LER+LS+ + + K E
Sbjct: 256 SRERANLQAQLRQLEHDLDSLREQ---LEEESEAKAELERQLSKANAEIQQWRSKFESEG 312
Query: 504 AANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQE 563
A A EL L++KL E+ + + +LE R+Q+ + DLQ + +
Sbjct: 313 ALRAE---ELEELKKKLNQKISELEEAAEAANA--KCDSLEKTKSRLQSELEDLQIELER 367
Query: 564 LSAQVKQLTEKSNSLSQQI 582
+A +L +K + + +
Sbjct: 368 ANAAASELEKKQKNFDKIL 386
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 41.3 bits (97), Expect = 5e-04
Identities = 22/127 (17%), Positives = 46/127 (36%), Gaps = 6/127 (4%)
Query: 456 SLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVV 515
+ + E+ ++Q L+ EL + +A +LE+ L L
Sbjct: 66 VQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKS 125
Query: 516 LRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
L ++L+ + L + V+ L +L+R ++ L + + E
Sbjct: 126 LEERLESLEESIKELAKELREL-----RQDLREEVEELREELERLQENLQRLQEAIQE-L 179
Query: 576 NSLSQQI 582
SL +Q+
Sbjct: 180 QSLLEQL 186
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 39.9 bits (94), Expect = 0.005
Identities = 30/119 (25%), Positives = 45/119 (37%), Gaps = 24/119 (20%)
Query: 465 HASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSR 524
+A+PA +E RR+ LE EL L + + A L +EL L +L
Sbjct: 415 NATPAALEDLRRRIAALELEL----DALEREAALGADHDERLAELRAELAALEAEL---- 466
Query: 525 REVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
AALEA ++ + LV + R EL A + +L Q+
Sbjct: 467 ----------------AALEARWQQEKELVEAILALRAELEADADAPADDDAALRAQLA 509
>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family. This family includes PspA
a protein that suppresses sigma54-dependent
transcription. The PspA protein, a negative regulator of
the Escherichia coli phage shock psp operon, is produced
when virulence factors are exported through secretins in
many Gram-negative pathogenic bacteria and its homologue
in plants, VIPP1, plays a critical role in thylakoid
biogenesis, essential for photosynthesis. Activation of
transcription by the enhancer-dependent bacterial
sigma(54) containing RNA polymerase occurs through ATP
hydrolysis-driven protein conformational changes enabled
by activator proteins that belong to the large AAA(+)
mechanochemical protein family. It has been shown that
PspA directly and specifically acts upon and binds to
the AAA(+) domain of the PspF transcription activator.
Length = 220
Score = 38.1 bits (89), Expect = 0.009
Identities = 27/108 (25%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 478 QRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
+++LE+ + ++S L + L + +A +LE +L QK Q + E A+L+ G
Sbjct: 25 EKMLEQAIRDMQSELGKARQALAQVIARQKQLERKLE--EQKEQAKKLEN-KARAALTKG 81
Query: 538 PSVAALEA--ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
A EA E+ ++ L+ Q + + V+QL ++ +L +I+
Sbjct: 82 NEELAREALAEIATLEKQAEALETQLTQQRSAVEQLRKQLAALETKIQ 129
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 36.2 bits (84), Expect = 0.063
Identities = 25/117 (21%), Positives = 50/117 (42%), Gaps = 18/117 (15%)
Query: 467 SPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQW--SR 524
S +E+ + + ++REL +S L ++LEE EL + +K+ S
Sbjct: 371 SGSELTQLEVLIQQVKRELQDAKSQLLKELRELEE----------ELAEVDKKISTIPSE 420
Query: 525 REVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQ 581
++ L + EL R +A + +L RQ + L ++ L + + ++Q
Sbjct: 421 EQIAQLLEEL------GEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQ 471
Score = 32.0 bits (73), Expect = 1.6
Identities = 19/107 (17%), Positives = 40/107 (37%), Gaps = 16/107 (14%)
Query: 477 QQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN 536
++R S ++E A + L Q++ R E+ SL +
Sbjct: 196 RRRKKSELPS-------SILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSLES 248
Query: 537 GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
++ ++ GDL +R++L Q+K++ + Q+R
Sbjct: 249 L---------EKKFRSEGGDLFEEREQLERQLKEIEAARKANRAQLR 286
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 36.2 bits (84), Expect = 0.068
Identities = 27/109 (24%), Positives = 50/109 (45%), Gaps = 10/109 (9%)
Query: 467 SPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSR-R 525
+ E+ER ++ + LE EL + LA K+LEE L EL L +K
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEE-------LRKELEELEKKYSEEEYE 662
Query: 526 EVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
E+ LS +A L AEL ++ ++++ ++L ++++ +
Sbjct: 663 ELREEYLELSR--ELAGLRAELEELEKRREEIKKTLEKLKEELEEREKA 709
Score = 34.7 bits (80), Expect = 0.21
Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 8/114 (7%)
Query: 469 AEVERY-RRQQRLLERELSRVRSI---LAHNSKKLEETVAANARLESELVVLRQKLQWSR 524
EV + +R+ LE+ + R +I + K+LEE + + SEL LR++L+
Sbjct: 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLE 227
Query: 525 REV--CNGTASLSNGPS--VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
+EV + +LE R+++ + +L+ + +EL ++++L EK
Sbjct: 228 KEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281
Score = 31.6 bits (72), Expect = 1.7
Identities = 30/165 (18%), Positives = 68/165 (41%), Gaps = 31/165 (18%)
Query: 430 SKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVR 489
+L + + + ++ +A+E ++ L E Y + R +E+ LSR+
Sbjct: 276 EELEEKVKELKELKEKAEEYIK--LSEFY-------------EEYLDELREIEKRLSRLE 320
Query: 490 SILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRR 549
+ ++++E RLE EL ++L+ R E L A + EL R
Sbjct: 321 EEINGIEERIKELEEKEERLE-ELKKKLKELE-KRLEELEERHELYE--EAKAKKEELER 376
Query: 550 VQALVGD------------LQRQRQELSAQVKQLTEKSNSLSQQI 582
++ + L++ ++E+ ++ ++T + L ++I
Sbjct: 377 LKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEI 421
Score = 30.8 bits (70), Expect = 3.0
Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 9/80 (11%)
Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANA 507
E L L L K S E E R + L REL+ +R+ L K+ EE
Sbjct: 643 EELRKELEELEKKY-----SEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLE 697
Query: 508 RLESELVVLRQKLQWSRREV 527
+L+ EL + ++E+
Sbjct: 698 KLKEELEEREKA----KKEL 713
Score = 30.8 bits (70), Expect = 3.6
Identities = 26/135 (19%), Positives = 52/135 (38%), Gaps = 12/135 (8%)
Query: 450 LENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARL 509
+ + L LR ++ + E+E + + LE+EL + +K+ E L
Sbjct: 212 ISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271
Query: 510 ESELVVLRQK------LQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQE 563
+ E+ L +K L+ E + ELR ++ + L+ +
Sbjct: 272 KKEIEELEEKVKELKELKEKAEEYIKLSEFYEE------YLDELREIEKRLSRLEEEING 325
Query: 564 LSAQVKQLTEKSNSL 578
+ ++K+L EK L
Sbjct: 326 IEERIKELEEKEERL 340
Score = 29.6 bits (67), Expect = 6.7
Identities = 24/134 (17%), Positives = 48/134 (35%), Gaps = 26/134 (19%)
Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL-SRVRSILAHNSKKLEETVAAN 506
E LE L SL + E+E + + EL +V+ + K+L+E
Sbjct: 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKEL-----KELKEKAEEY 295
Query: 507 ARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSA 566
+L +L+ +E L R++ + ++ + +EL
Sbjct: 296 IKLSEFYEEYLDELR--------------------EIEKRLSRLEEEINGIEERIKELEE 335
Query: 567 QVKQLTEKSNSLSQ 580
+ ++L E L +
Sbjct: 336 KEERLEELKKKLKE 349
>gnl|CDD|218852 pfam06008, Laminin_I, Laminin Domain I. coiled-coil structure. It
has been suggested that the domains I and II from
laminin A, B1 and B2 may come together to form a triple
helical coiled-coil structure.
Length = 263
Score = 35.4 bits (82), Expect = 0.082
Identities = 37/153 (24%), Positives = 56/153 (36%), Gaps = 22/153 (14%)
Query: 427 QIASKLHSRGRNICSVQGQA-KEMLENA---LGSLRHKMHGVHASPAEVERYRRQQRLLE 482
+I KL G N ++ G A ML A LG +R + AE E + + LL
Sbjct: 104 EILEKLAGLGENAFALPGSALSRMLAEAQRMLGEIRERDFQKQLQNAEAE-LKAAEELLN 162
Query: 483 RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAA 542
R + ++ N E E++L LR+ L E T
Sbjct: 163 RVQTWFQTPQEENEGLFEALRDDLNEYEAKLSDLRELLD----EAQAKTR---------- 208
Query: 543 LEAE--LRRVQALVGDLQRQRQELSAQVKQLTE 573
+AE Q + + Q ++QE+S Q E
Sbjct: 209 -DAERLNLANQKRLREFQEKKQEVSELKNQAEE 240
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 35.5 bits (82), Expect = 0.092
Identities = 32/146 (21%), Positives = 56/146 (38%), Gaps = 15/146 (10%)
Query: 444 GQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRL--LERELSRVRSILAHNSKKL-- 499
+A + L+ L LR + + + A+VE +R Q L R L+ + +L
Sbjct: 194 RRASDSLDERLEELRAR---LQEAEAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQLQS 250
Query: 500 --EETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDL 557
A ARL S L +L + + + P++ L + +V+ + DL
Sbjct: 251 ARARLAQAEARLASLLQLLPLGREAAA------LREVLESPTIQDLRQQYAQVRQQIADL 304
Query: 558 QRQRQELSAQVKQLTEKSNSLSQQIR 583
+ Q+ L + L QQI
Sbjct: 305 STELGAKHPQLVALEAQLAELRQQIA 330
>gnl|CDD|147975 pfam06103, DUF948, Bacterial protein of unknown function (DUF948).
This family consists of bacterial sequences several of
which are thought to be general stress proteins.
Length = 90
Score = 32.5 bits (75), Expect = 0.11
Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 7/58 (12%)
Query: 547 LRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI-----RPGPTGVAVA--GKSVN 597
L V + L +Q ++ + + L K+N L + + P AVA G+SV+
Sbjct: 28 LDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDVNGKVETLDPVFQAVADLGESVS 85
Score = 32.1 bits (74), Expect = 0.16
Identities = 8/36 (22%), Positives = 17/36 (47%)
Query: 37 LRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
L V + L +Q ++ + + L K+N L + +
Sbjct: 28 LDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDV 63
>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF. Sequences
in this family of proteins are members of the chain
length determinant family (pfam02706) which includes the
wzc protein from E.coli. This family of proteins are
homologous to the EpsF protein of the methanolan
biosynthesis operon of Methylobacillus species strain
12S. The distribution of this protein appears to be
restricted to a subset of exopolysaccharide operons
containing a syntenic grouping of genes including a
variant of the EpsH exosortase protein. Exosortase has
been proposed to be involved in the targetting and
processing of proteins containing the PEP-CTERM domain
to the exopolysaccharide layer.
Length = 444
Score = 35.2 bits (81), Expect = 0.11
Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 8/90 (8%)
Query: 481 LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSV 540
L+ +++R S LA S++L R ++E+ L+ +L E+ T+S+ +
Sbjct: 259 LKTDIARAESKLAELSQRLGPNHPQYKRAQAEINSLKSQL---NAEIKKVTSSVGTNSRI 315
Query: 541 A-ALEAELR----RVQALVGDLQRQRQELS 565
EAELR +A V +L RQR E+S
Sbjct: 316 LKQREAELREALENQKAKVLELNRQRDEMS 345
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 34.7 bits (80), Expect = 0.13
Identities = 7/44 (15%), Positives = 20/44 (45%)
Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
++ E+ + + ++Q + EL ++ Q + L ++I
Sbjct: 46 EKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEI 89
Score = 34.7 bits (80), Expect = 0.13
Identities = 7/44 (15%), Positives = 20/44 (45%)
Query: 539 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
++ E+ + + ++Q + EL ++ Q + L ++I
Sbjct: 46 EKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEI 89
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 35.0 bits (81), Expect = 0.14
Identities = 24/111 (21%), Positives = 45/111 (40%), Gaps = 20/111 (18%)
Query: 481 LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS- 539
LE L ++ +K L + S LR+KLQ + E+ ASL
Sbjct: 152 LEENLEGMKRDEELLNKDLNL-------INSIKPKLRKKLQALKEEI----ASLRQLADE 200
Query: 540 --------VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
+ ELR + + + ++Q +EL ++++LT +L+ +
Sbjct: 201 LNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKK 251
>gnl|CDD|205229 pfam13048, DUF3908, Protein of unknown function (DUF3908). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and viruses.
Proteins in this family are approximately 140 amino
acids in length. There is a single completely conserved
residue Y that may be functionally important.
Length = 125
Score = 33.1 bits (76), Expect = 0.16
Identities = 28/107 (26%), Positives = 41/107 (38%), Gaps = 13/107 (12%)
Query: 197 VLSAYIRYV-SARLADLETPKRMEHLQKLEELKRHLVELEKQVSITFNLTERNEKIPLKF 255
++V + D E K E L+K++EL L E +V NL N+K+ L
Sbjct: 1 NYEEVDQWVEGSNARDEEDRKLRELLKKVKEL---LDEKNIKVIYPKNLFNDNKKLELII 57
Query: 256 V-NDGDYI--------QAVLDRNLAENISRVLYPNDNNFGGKELRLK 293
D I ++ E+I + NN G K L LK
Sbjct: 58 FLEDNKLIVIIENEEETITIEILKLEDIKKSELETKNNNGNKNLTLK 104
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 33.4 bits (77), Expect = 0.16
Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 6/91 (6%)
Query: 497 KKLEETVAANARLESELVVLRQKLQWSRREVCNGTASL---SNGPSVAALEAELRRVQAL 553
+ AA +LE E L+ +LQ +E+ L + S A +A+ + +Q
Sbjct: 28 SESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQK 87
Query: 554 VGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
+LQ+++Q ++L +K L Q I
Sbjct: 88 QQELQQKQQAAQ---QELQQKQQELLQPIYD 115
Score = 30.7 bits (70), Expect = 1.6
Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 24 LSNGPSVAALEAELR-RVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
LS P+ A + +L + L +LQ++ +EL + ++L +++ +LS++ R
Sbjct: 27 LSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEAR 77
Score = 30.7 bits (70), Expect = 1.6
Identities = 14/51 (27%), Positives = 31/51 (60%), Gaps = 1/51 (1%)
Query: 534 LSNGPSVAALEAELR-RVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
LS P+ A + +L + L +LQ++ +EL + ++L +++ +LS++ R
Sbjct: 27 LSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEAR 77
Score = 28.4 bits (64), Expect = 9.3
Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 3/46 (6%)
Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 74
S A +A+ + +Q +LQ+++Q ++L +K L Q I
Sbjct: 73 SEEARKAKQQELQQKQQELQQKQQAAQ---QELQQKQQELLQPIYD 115
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 34.6 bits (79), Expect = 0.21
Identities = 25/137 (18%), Positives = 60/137 (43%), Gaps = 13/137 (9%)
Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
+ L A G R A+ +E+++ R+++ + +EL+ R LA ++L
Sbjct: 85 ALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTK 144
Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
L++ L L ++ + + +L+A +++QA L+ Q +L
Sbjct: 145 QAQDLQTRLKTLAEQRRQLEAQA-------------QSLQASQKQLQASATQLKSQVLDL 191
Query: 565 SAQVKQLTEKSNSLSQQ 581
+ Q+ +++ +L+ +
Sbjct: 192 KLRSAQIEQEAQNLATR 208
>gnl|CDD|115928 pfam07303, Occludin_ELL, Occludin homology domain. This domain
represents a conserved region approximately 100
residues long within eukaryotic occludin proteins and
the RNA polymerase II elongation factor ELL. Occludin
is an integral membrane protein that localises to tight
junctions, while ELL is an elongation factor that can
increase the catalytic rate of RNA polymerase II
transcription by suppressing transient pausing by
polymerase at multiple sites along the DNA. This shared
domain is thought to mediate protein interactions.
Length = 101
Score = 31.9 bits (73), Expect = 0.24
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 35 AELRRVQALVGDLQRQRQELSAQVKQLTEKS 65
E + +QA V + ++ Q+L A++ L E S
Sbjct: 22 QEYKELQAEVDAINKRLQQLDAELDSLPEGS 52
Score = 31.9 bits (73), Expect = 0.24
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 545 AELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
E + +QA V + ++ Q+L A++ L E S
Sbjct: 22 QEYKELQAEVDAINKRLQQLDAELDSLPEGS 52
>gnl|CDD|191973 pfam08232, Striatin, Striatin family. Striatin is an
intracellular protein which has a caveolin-binding
motif, a coiled-coil structure, a calmodulin-binding
site, and a WD (pfam00400) repeat domain. It acts as a
scaffold protein and is involved in signalling
pathways.
Length = 133
Score = 32.7 bits (75), Expect = 0.24
Identities = 14/59 (23%), Positives = 23/59 (38%)
Query: 30 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAGKSAQGST 88
+A LE E R ++ L DL+R+ + L +KQ K L + +
Sbjct: 34 IAQLEGERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQKPSKEEEEEELE 92
Score = 31.6 bits (72), Expect = 0.52
Identities = 14/47 (29%), Positives = 21/47 (44%)
Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
+A LE E R ++ L DL+R+ + L +KQ K L
Sbjct: 34 IAQLEGERRGLENLKEDLKRRIKMLEYALKQERAKLKKLKYGTDSPQ 80
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 34.4 bits (79), Expect = 0.25
Identities = 24/117 (20%), Positives = 44/117 (37%), Gaps = 12/117 (10%)
Query: 470 EVERYRRQQRLLE---RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRRE 526
E+E R LE +L + L +KLE+ + L E L + L+ E
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLE----E 361
Query: 527 VCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+ LE EL + + L+ QEL ++ +L+ + +++
Sbjct: 362 RLK-----ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELE 413
Score = 34.4 bits (79), Expect = 0.27
Identities = 27/143 (18%), Positives = 53/143 (37%), Gaps = 16/143 (11%)
Query: 447 KEMLENALGSLRHKMHGVHASPAEVE-RYRRQQRLL--------ERELSRVRSILAHNSK 497
+ L LR ++ + E+E R + + LL E EL L +
Sbjct: 581 LRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELE 640
Query: 498 KLEETVAANARLESELVVLRQKLQWSRREV-------CNGTASLSNGPSVAALEAELRRV 550
KL L++ L L +K++ E+ N + LE EL ++
Sbjct: 641 KLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQL 700
Query: 551 QALVGDLQRQRQELSAQVKQLTE 573
+ + +L ++ E+ +++L
Sbjct: 701 REELEELLKKLGEIEQLIEELES 723
Score = 31.3 bits (71), Expect = 2.3
Identities = 29/113 (25%), Positives = 44/113 (38%), Gaps = 8/113 (7%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
E+ R R + EL +R L KKL+E S+L L Q L+ S E
Sbjct: 573 ELLEELRLLRTRKEELEELRERLKELKKKLKEL----EERLSQLEELLQSLELSEAENEL 628
Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
A + + +L L LQ +EL +V++L + Q+I
Sbjct: 629 EEAEE----ELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 34.3 bits (78), Expect = 0.30
Identities = 57/281 (20%), Positives = 89/281 (31%), Gaps = 23/281 (8%)
Query: 61 LTEKSNSLSQQIRPGPTGVAGK-SAQGSTSPSGRSTAAETGTERTGQTADGEIQLFES-- 117
L K++ I T G S +G RS +A E D E + +
Sbjct: 70 LENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEA 129
Query: 118 -------ADQARTHWSSGPGSTGVAGKSAQGSTSPSGRSTAAETGTERTGQTADGEIQLF 170
+Q SS G + + KS S S G S +++ +G+ A +
Sbjct: 130 GKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGK-ASASVAGI 188
Query: 171 ESADQARTHWSNPGLMCILSSYVYYCVLSAYIRYVSARLADLETPKRMEHLQKLEELKRH 230
AD+ L + Y L A + +++ K EE
Sbjct: 189 VGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKNDGDKVAEEEEFEL 248
Query: 231 LVELEKQVSITFNLTERNEKI-------PLKFVNDGDYIQ-AVLDRNLAENISRVLYPND 282
E E+ + E EKI KF + +I+ V DRN+A S
Sbjct: 249 DEEHEE---AEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRNIAGGFSEFF-SKL 304
Query: 283 NNFGGKELRLKQEYFMCAATLQVISLPFKDQGRALLGYSHS 323
N F E K+E + + K++ LG
Sbjct: 305 NPFKKDEKIEKKEISKITFSGIGDKITDKEKIAKDLGIGAK 345
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 32.2 bits (74), Expect = 0.35
Identities = 25/107 (23%), Positives = 45/107 (42%), Gaps = 11/107 (10%)
Query: 479 RLLE--RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN 536
RLLE RE+ V+ L ++ E + E EL ++LQ S +
Sbjct: 1 RLLEKRREMEEVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDK------- 53
Query: 537 GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+ EA+ RR + + ++ R+E ++K+L + L +I
Sbjct: 54 --FLKENEAKRRRAEKKAEEEKKLRKEKEEEIKELKAELEELKAEIE 98
Score = 31.8 bits (73), Expect = 0.48
Identities = 14/103 (13%), Positives = 37/103 (35%), Gaps = 27/103 (26%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
E +++ L+ L + L N K A + E E + + +
Sbjct: 32 QREEELEKKEEELQESLIKFDKFLKENEAKRRR---AEKKAEEEK---KLRKE------- 78
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 571
E E++ ++A + +L+ + ++L ++++
Sbjct: 79 --------------KEEEIKELKAELEELKAEIEKLEEKLEEY 107
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 33.4 bits (77), Expect = 0.45
Identities = 21/76 (27%), Positives = 32/76 (42%), Gaps = 6/76 (7%)
Query: 507 ARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSA 566
RL S++ L L R+ G L + SVA L A L +A LQ EL+
Sbjct: 56 DRLNSQIAELADLLSLERQ----GNQDLQD--SVANLRASLSAAEAERSRLQALLAELAG 109
Query: 567 QVKQLTEKSNSLSQQI 582
++ L+Q++
Sbjct: 110 AGAAAEGRAGELAQEL 125
Score = 31.1 bits (71), Expect = 2.0
Identities = 13/44 (29%), Positives = 20/44 (45%)
Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
SVA L A L +A LQ EL+ ++ L+Q++
Sbjct: 82 SVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQEL 125
>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 514
Score = 33.4 bits (76), Expect = 0.46
Identities = 36/141 (25%), Positives = 60/141 (42%), Gaps = 28/141 (19%)
Query: 14 RREVCNGTASLSNGPSVAALEA-ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
++ +C A G + AA +A +++ +QA + LQ+Q EL A +
Sbjct: 3 KKLICALIAGALPGAAAAASDASQIKALQAQLTALQQQVNELRAALAA------------ 50
Query: 73 RPGPTGVAGK----SAQGSTSPSGRSTAAETG---TERTGQTADGEIQLFESADQARTHW 125
+P G K +A + +PS + AA T T+ Q A+ +++ D A T
Sbjct: 51 KPAAAGGGAKIQSAAAAAAAAPSSDAAAALTNDDVTQMREQIANQSLKVDSLTDAATT-- 108
Query: 126 SSGPGSTGVAGKSAQGSTSPS 146
GP +AG S G P+
Sbjct: 109 --GP----IAGLSVTGYLDPT 123
>gnl|CDD|113664 pfam04899, MbeD_MobD, MbeD/MobD like. The MbeD and MobD proteins
are plasmid encoded, and are involved in the plasmids
mobilisation and transfer in the presence of
conjugative plasmids.
Length = 70
Score = 30.2 bits (68), Expect = 0.50
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 41 QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
Q + G QR+ LS QV L+++ SLS+Q+R
Sbjct: 34 QRMFGLTQRENAALSEQVTGLSQQVQSLSEQVR 66
Score = 30.2 bits (68), Expect = 0.50
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 551 QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
Q + G QR+ LS QV L+++ SLS+Q+R
Sbjct: 34 QRMFGLTQRENAALSEQVTGLSQQVQSLSEQVR 66
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 33.5 bits (77), Expect = 0.50
Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 34/139 (24%)
Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
+ + LE LG L+ ++ A+P E+E Q + + L + + + E+ A
Sbjct: 442 EEEYELELRLGRLKQRLDSATATPEELE----QLEINDEALEKAQE-------EQEQAEA 490
Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
+L+SEL LR++ + A AE R L + RQ L
Sbjct: 491 NVEQLQSELRQLRKR---------------RDEALEALQRAERR--------LLQLRQAL 527
Query: 565 SAQVKQLTEKSNSLSQQIR 583
QL+ ++ SL +R
Sbjct: 528 DELELQLSPQAGSLLHFLR 546
Score = 32.4 bits (74), Expect = 1.1
Identities = 19/104 (18%), Positives = 43/104 (41%), Gaps = 9/104 (8%)
Query: 489 RSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN-----GPSVAAL 543
R L + L+ VA + E +LV +L+ +R +L
Sbjct: 606 RERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQ 665
Query: 544 EAELRRVQALVGD----LQRQRQELSAQVKQLTEKSNSLSQQIR 583
++ +++ + + + Q ++L AQ+KQL E+ + + ++
Sbjct: 666 QSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALK 709
Score = 30.4 bits (69), Expect = 5.0
Identities = 26/161 (16%), Positives = 54/161 (33%), Gaps = 24/161 (14%)
Query: 430 SKLHSRGRNICSVQGQAKEMLENAL----GSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
K + R I + + K +E L LR ++ + ++ LE L
Sbjct: 398 EKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGK------LEFNEEEYELELRL 451
Query: 486 SRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEA 545
R + +A E + + ++ E + +V L++
Sbjct: 452 GR------------LKQRLDSATATPEEL-EQLEINDEALEKAQEEQEQAEA-NVEQLQS 497
Query: 546 ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
ELR+++ + Q ++ QL + + L Q+ P
Sbjct: 498 ELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQA 538
Score = 29.3 bits (66), Expect = 9.0
Identities = 16/47 (34%), Positives = 22/47 (46%), Gaps = 1/47 (2%)
Query: 28 PSVAALEAELR-RVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
P AA E ELR R+Q LQ + +QL + + L +Q R
Sbjct: 596 PDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKR 642
>gnl|CDD|221757 pfam12761, End3, Actin cytoskeleton-regulatory complex protein
END3. Endocytosis is accomplished through the
sequential recruitment at endocytic sites of proteins
that drive cargo sorting, membrane invagination and
vesicle release. End3p is part of the coat module
protein complex Pan1, along with Pan1p, Sla1p, and
Sla2p. The proteins in this complex are regulated by
phosphorylation events. End3p also regulates the
cortical actin cytoskeleton. The subunits of the Pan1
complex are homologous to mammalian intersectin.
Length = 193
Score = 32.3 bits (74), Expect = 0.59
Identities = 23/104 (22%), Positives = 44/104 (42%), Gaps = 11/104 (10%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVV--LRQKLQWSRREV 527
E R RR+ LE +LS+ + E +A+ S LV L Q L++ ++
Sbjct: 97 EEVRLRRELAELESKLSKAQQ---------ESESRNDAKPNSALVKRELEQLLKYKENQL 147
Query: 528 CNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 571
SNG ++ ++ ++ ++ V L+ ++ QL
Sbjct: 148 RELGGRSSNGVNLGSVREDIDSIEEQVTGLEEHLASRKEELAQL 191
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.8 bits (75), Expect = 0.63
Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 29/163 (17%)
Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
+ + L+ L L A AE+ + + L E ++ LA + LEE
Sbjct: 161 ERIDALKATLKQLA-------AVRAEIAAEQAELTTLLSEQRAQQAKLA---QLLEERKK 210
Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
A+L SEL ++KL+ R + L N +A+ EA + + E
Sbjct: 211 TLAQLNSELSADQKKLEELRAN----ESRLKN--EIASAEAAAAKARE-----AAAAAEA 259
Query: 565 SAQVKQLTEKSNSLSQQIRPGPTGVAVAGKSVNRPGPTGVAGK 607
+A + E + P TG G
Sbjct: 260 AAARARAAEAKRTGETYKPTAP--------EKMLISSTGGFGA 294
Score = 31.2 bits (71), Expect = 2.1
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 34 EAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
+ E+ ++ + + Q QR +L Q+K L + SL Q+
Sbjct: 44 QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLI 83
Score = 31.2 bits (71), Expect = 2.1
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 544 EAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+ E+ ++ + + Q QR +L Q+K L + SL Q+
Sbjct: 44 QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLI 83
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 31.0 bits (71), Expect = 0.64
Identities = 12/53 (22%), Positives = 24/53 (45%), Gaps = 10/53 (18%)
Query: 30 VAALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 72
+ L+ E R +Q + +LQ +R ELS ++ + L + L ++
Sbjct: 31 LLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDEL 83
Score = 31.0 bits (71), Expect = 0.64
Identities = 12/53 (22%), Positives = 24/53 (45%), Gaps = 10/53 (18%)
Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 582
+ L+ E R +Q + +LQ +R ELS ++ + L + L ++
Sbjct: 31 LLELDEERRELQVELEELQAERNELSKEIGKAKKKKEDAEALIAEVKELKDEL 83
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 32.4 bits (74), Expect = 0.64
Identities = 27/159 (16%), Positives = 61/159 (38%), Gaps = 23/159 (14%)
Query: 426 DQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL 485
+ + ++L + + ++ ++++E H E++ R + L +R
Sbjct: 43 EALEAELAAHEERVEALNELGEQLIEEG-----------HPDAEEIQE--RLEELNQRW- 88
Query: 486 SRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEA 545
+R + ++LEE + L+ QW + AS G + ++E
Sbjct: 89 EELRELAEERRQRLEEAL----DLQQFFRDADDLEQWLEEKEAAL-ASEDLGKDLESVEE 143
Query: 546 ELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
L++ + L +L+ L K L E + L ++ P
Sbjct: 144 LLKKHKELEEELEAHEPRL----KSLNELAEELLEEGHP 178
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 32.3 bits (74), Expect = 0.66
Identities = 27/127 (21%), Positives = 48/127 (37%), Gaps = 34/127 (26%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
E++ + + LE EL+ + + K++E+ RLE L
Sbjct: 97 EIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEA----------- 145
Query: 530 GTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN----SLSQQIRPG 585
LE E+ ++ + QELS++ ++L EK + S ++IR
Sbjct: 146 ------------RLEEEVAE-------IREEGQELSSKREELKEKLDPELLSEYERIRKN 186
Query: 586 PTGVAVA 592
GV V
Sbjct: 187 KKGVGVV 193
Score = 32.0 bits (73), Expect = 0.89
Identities = 22/115 (19%), Positives = 46/115 (40%), Gaps = 20/115 (17%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQ-WSRREVC 528
E+E Q LE E+ +R + +KL + + E EL L ++Q R
Sbjct: 53 ELEDLENQVSQLESEIQEIRERIKRAEEKL-----SAVKDERELRALNIEIQIAKERIN- 106
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+LE EL + + L+++ ++L ++++L + ++
Sbjct: 107 -------------SLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLE 148
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 33.0 bits (75), Expect = 0.72
Identities = 21/103 (20%), Positives = 39/103 (37%)
Query: 478 QRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
+ L+ E + S + KKLE A RL+ L ++ ++ R + + L+
Sbjct: 373 LQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDKELTET 432
Query: 538 PSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQ 580
L L + LV +Q ++ Q+ +L E
Sbjct: 433 SVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKD 475
Score = 30.3 bits (68), Expect = 4.6
Identities = 23/115 (20%), Positives = 47/115 (40%), Gaps = 6/115 (5%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
E++ + Q ER LS V+ K+E RL E +L KL+ +
Sbjct: 482 TEIKLLKEQLVSNERLLSFVKEATNALRLKIETLERERDRLRQEKSLLEMKLEHLCLQGD 541
Query: 529 NGTAS-----LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSL 578
+ +S P+ A + + ++AL + ++ ++ L A +++ + L
Sbjct: 542 YNASRTKVLHMSLNPASEAEQIAKQTIEALQAECEKLKERLQA-LEEGKSQPGDL 595
>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin.
Length = 237
Score = 32.2 bits (74), Expect = 0.77
Identities = 25/68 (36%), Positives = 32/68 (47%), Gaps = 2/68 (2%)
Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
+A+E LE A L + AEV R+ +LLE +L R LA +KLEE
Sbjct: 12 KAEERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLATALEKLEEAEK 71
Query: 505 ANARLESE 512
A ESE
Sbjct: 72 AAD--ESE 77
>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640). This
family consists of sequences derived from hypothetical
eukaryotic proteins. A region approximately 100 residues
in length is featured.
Length = 177
Score = 31.8 bits (73), Expect = 0.79
Identities = 14/53 (26%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 470 EVERYRRQQRL-LERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQ 521
E+ + RL L E R+R A K++ET + ++++E+ L+ +L+
Sbjct: 99 EITKVTAGVRLDLNLEKGRIREESAEQELKIKET---DTKIDTEIAGLKTQLE 148
Score = 28.7 bits (65), Expect = 6.9
Identities = 22/124 (17%), Positives = 50/124 (40%), Gaps = 22/124 (17%)
Query: 460 KMHGVHASPAEVERYRRQQRLLERELSRVRS-ILAHNSKKLEETVAANARLESELVVLRQ 518
+ + + E+ QQ + + ++++S + + + A N +L+++L L+
Sbjct: 38 NVSKDMVTKEQQEKTEYQQ---KVDFAKLKSELQSLEKSEFANLQAENEKLKNDLEKLKN 94
Query: 519 KLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSL 578
+L R E+ TA L+ L + R R+E + Q ++ E +
Sbjct: 95 RL---RDEITKVTAG-------VRLDLNLEK--------GRIREESAEQELKIKETDTKI 136
Query: 579 SQQI 582
+I
Sbjct: 137 DTEI 140
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 31.1 bits (71), Expect = 0.85
Identities = 23/114 (20%), Positives = 46/114 (40%), Gaps = 17/114 (14%)
Query: 472 ERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGT 531
E + + L+ +L I +K E + +A EL LR++L
Sbjct: 20 EDAEEKIQKLQEDLEEQAEIANEAQQKYERELVKHAEDIEELQALRKQLN---------- 69
Query: 532 ASLSNGPSVAALEAELRRVQALVGDLQR----QRQELSAQVKQLTEKSNSLSQQ 581
+A L+AE QA + + + Q++ L ++ +L ++ L++Q
Sbjct: 70 ---ELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDELSELEKRIEELNEQ 120
>gnl|CDD|117396 pfam08826, DMPK_coil, DMPK coiled coil domain like. This domain is
found in the myotonic dystrophy protein kinase (DMPK)
and adopts a coiled coil structure. It plays a role in
dimerisation.
Length = 61
Score = 29.4 bits (66), Expect = 0.88
Identities = 15/51 (29%), Positives = 30/51 (58%)
Query: 476 RQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRRE 526
R ++ L+ EL +V++ + KL+E A N LE+E+ L+++++ R
Sbjct: 11 RAKQSLQEELEKVKAANINFESKLQEAEAKNRELEAEVRQLKKRMEELRAR 61
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 32.6 bits (75), Expect = 1.1
Identities = 21/107 (19%), Positives = 44/107 (41%), Gaps = 27/107 (25%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVC 528
+E+E+ RQQ+ ER +LA K+L + + LE L +L+
Sbjct: 523 SELEQRLRQQQRAER-------LLAEFCKRLGKNLDDEDELEQLQEELEARLE------- 568
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
+L + + L++Q ++L A++++L ++
Sbjct: 569 -------------SLSESVSEARERRMALRQQLEQLQARIQRLAARA 602
Score = 30.7 bits (70), Expect = 3.9
Identities = 38/177 (21%), Positives = 65/177 (36%), Gaps = 37/177 (20%)
Query: 426 DQIASKLHSRGRNICSVQG---QAKEMLEN------ALGSLRHKMHGVHASPAE------ 470
+ I S L S ++ QA++ + AL + + H S +
Sbjct: 924 EPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRA--HFSYEDAAEMLA 981
Query: 471 -----VERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRR 525
E+ R++ E+E +R R L +L + A L+S RQ LQ ++
Sbjct: 982 KNSDLNEKLRQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQ 1041
Query: 526 EVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
E L + A AE R + +R EL A++ + N L +Q+
Sbjct: 1042 E-------LQDLGVPADSGAEER--------ARARRDELHARLSANRSRRNQLEKQL 1083
Score = 29.5 bits (67), Expect = 8.4
Identities = 12/36 (33%), Positives = 17/36 (47%)
Query: 466 ASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEE 501
+V Y +Q EREL+ R LA + L+E
Sbjct: 624 EDSQDVTEYMQQLLERERELTVERDELAARKQALDE 659
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 29.5 bits (67), Expect = 1.2
Identities = 12/42 (28%), Positives = 23/42 (54%)
Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
L E+ +QA + L+ + +EL A+VK L + + ++ R
Sbjct: 21 QLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERAR 62
Score = 29.5 bits (67), Expect = 1.2
Identities = 12/42 (28%), Positives = 23/42 (54%)
Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
L E+ +QA + L+ + +EL A+VK L + + ++ R
Sbjct: 21 QLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERAR 62
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 31.6 bits (72), Expect = 1.3
Identities = 27/111 (24%), Positives = 48/111 (43%), Gaps = 8/111 (7%)
Query: 472 ERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVL---RQKLQWSRREVC 528
E R+Q+ LE + ++ + K+LEE LE +L Q L+ E+
Sbjct: 1 EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60
Query: 529 NGTASLSNGPSVAA-----LEAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
L + + LEAE+ A V L+ +R++ A+ +QL ++
Sbjct: 61 EENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQE 111
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 31.3 bits (71), Expect = 1.3
Identities = 17/89 (19%), Positives = 38/89 (42%), Gaps = 9/89 (10%)
Query: 495 NSKKLEETVAANARLE---SELVVLRQKLQWSRREVCNGTASLS------NGPSVAALEA 545
K E +L L V LQ + E + +S + ++A +E
Sbjct: 15 REKAFSELFEQYRQLADRTRLLKVENDALQAEKYEQQSSHSSSPSADGPGSDAAIAEMEQ 74
Query: 546 ELRRVQALVGDLQRQRQELSAQVKQLTEK 574
+L +++ + +L ++R EL+ ++ L ++
Sbjct: 75 KLAKLREELTELHKKRGELAQRLLLLNDE 103
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 32.3 bits (73), Expect = 1.3
Identities = 23/118 (19%), Positives = 48/118 (40%), Gaps = 14/118 (11%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCN 529
E++ + + + +E LS++ KL+ + A + ++ +L+ +V
Sbjct: 246 ELDPLKNRLKEIEHNLSKIM--------KLDNEIKALKSRKKQMEKDNSELELKMEKVFQ 297
Query: 530 GTASLSN------GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQ 581
GT N +V E EL Q + L ++R+ L+ + +L + L Q
Sbjct: 298 GTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQ 355
>gnl|CDD|236989 PRK11809, putA, trifunctional transcriptional regulator/proline
dehydrogenase/pyrroline-5-carboxylate dehydrogenase;
Reviewed.
Length = 1318
Score = 32.3 bits (74), Expect = 1.4
Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 5/62 (8%)
Query: 22 ASLSNGPSVAALEAELRRV-----QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 76
+L+ + ++A+LR AL + EL+A Q E + + + ++ PGP
Sbjct: 1117 VTLARQDAEYPVDAQLRAALLAPLTALREWAAEREPELAALCDQYAELAQAGTTRLLPGP 1176
Query: 77 TG 78
TG
Sbjct: 1177 TG 1178
Score = 32.3 bits (74), Expect = 1.4
Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 5/62 (8%)
Query: 532 ASLSNGPSVAALEAELRRV-----QALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
+L+ + ++A+LR AL + EL+A Q E + + + ++ PGP
Sbjct: 1117 VTLARQDAEYPVDAQLRAALLAPLTALREWAAEREPELAALCDQYAELAQAGTTRLLPGP 1176
Query: 587 TG 588
TG
Sbjct: 1177 TG 1178
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 31.0 bits (71), Expect = 1.4
Identities = 32/125 (25%), Positives = 55/125 (44%), Gaps = 26/125 (20%)
Query: 479 RLLERELSRVRSILAHNSKKLEETVAANARL-------ESELVVLRQKLQWSRREVCNGT 531
+ L+ E++ ++ HN K + E N RL E E+ LR+KL+ ++
Sbjct: 30 KSLKEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDK---- 85
Query: 532 ASLSNGPS-VAALEAELRRVQ-------ALVGDLQRQRQELSAQ-------VKQLTEKSN 576
SL N + + LE EL+ ++ ++R+R EL + V+Q T N
Sbjct: 86 QSLKNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAAIQDVQQKTGLKN 145
Query: 577 SLSQQ 581
L +Q
Sbjct: 146 LLLEQ 150
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 31.9 bits (73), Expect = 1.4
Identities = 30/124 (24%), Positives = 52/124 (41%), Gaps = 18/124 (14%)
Query: 466 ASPAEVERYRRQQRLLERELSRVRSILA---HNSKKLEETVAANARLESELVVLRQKLQW 522
A P ++ QQ L E S +R+ L K+LE +A L+++L LRQ+L+
Sbjct: 123 AVPELIKG---QQSLFESRKSTLRAQLELILAQIKQLEAELA---GLQAQLQALRQQLEV 176
Query: 523 SRREVCNGTASLSN---GPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
E+ L + LE E R + Q + L A+++ L + + L
Sbjct: 177 ISEEL-EARRKLKEKGLVSRLELLELERERA-----EAQGELGRLEAELEVLKRQIDELQ 230
Query: 580 QQIR 583
+ +
Sbjct: 231 LERQ 234
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 32.2 bits (74), Expect = 1.4
Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 17/125 (13%)
Query: 467 SPAEVERYRRQQRLLERELSRVRSILAHNS-------KKLEETVAANARLESELVVLRQK 519
P++ + +Q LL + R L N+ K+ + A RLE +L +L++
Sbjct: 190 RPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEA 249
Query: 520 LQWSRREVCNGTASLSNGPSVAALEAELRRVQA---LVGDLQRQRQELSAQVKQLTEKSN 576
+ R + T + AA R+QA + +L+ Q LS ++ + TEK N
Sbjct: 250 INSKRLTLSEKTVQEAQSQDEAA------RIQANPLVAQELEINLQ-LSQRLLKATEKLN 302
Query: 577 SLSQQ 581
+L+QQ
Sbjct: 303 TLTQQ 307
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 31.8 bits (73), Expect = 1.5
Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 6/82 (7%)
Query: 500 EETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQR 559
E+ L+ E++ L+Q+L+ RE A A + EL ++ L +L+
Sbjct: 138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALA------EAQQQELVALEGLAAELEE 191
Query: 560 QRQELSAQVKQLTEKSNSLSQQ 581
++QEL AQ++QL EK+ SQ+
Sbjct: 192 KQQELEAQLEQLQEKAAETSQE 213
Score = 29.5 bits (67), Expect = 9.1
Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 6/71 (8%)
Query: 1 SELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 60
E++ L+Q+L+ RE A A + EL ++ L +L+ ++QEL AQ++Q
Sbjct: 149 QEVLTLKQQLELQAREKAQSQALA------EAQQQELVALEGLAAELEEKQQELEAQLEQ 202
Query: 61 LTEKSNSLSQQ 71
L EK+ SQ+
Sbjct: 203 LQEKAAETSQE 213
>gnl|CDD|165553 PHA03296, PHA03296, envelope glycoprotein H; Provisional.
Length = 814
Score = 31.9 bits (72), Expect = 1.6
Identities = 17/65 (26%), Positives = 27/65 (41%), Gaps = 9/65 (13%)
Query: 664 DLDMNSTLDVAIESPCISYPSSPQ-HMTSSPLPSSPTSTSIP-----LYVNTDSGTDVKL 717
D+ +N +VA S CI+ P+SP M ++P + + L D V +
Sbjct: 203 DIKINIGKNVA--SLCIASPASPPIEMIATPRSAKLKESRFKQWPRMLKA-EDQELSVLI 259
Query: 718 NGKTG 722
N G
Sbjct: 260 NVAAG 264
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 31.7 bits (72), Expect = 1.6
Identities = 39/156 (25%), Positives = 64/156 (41%), Gaps = 47/156 (30%)
Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANA 507
E LE L L+H+ + + AE + L+ E+ +L +S K
Sbjct: 270 EELEKELAELQHRNDELTSLAAES-------QALKDEID----VLRESSDKA-------K 311
Query: 508 RLESELVVLRQKLQWS---RREV-----CNGTASLSNGPSVAALEAELRRVQALVGDLQ- 558
+LE+++ ++KL+ RR+V N + LE EL++ A G L+
Sbjct: 312 KLEAQVETYKKKLEDLNDLRRQVKLLEERNAMYMQNT----VQLEEELKKANAARGQLET 367
Query: 559 --RQRQELSAQV--------------KQLTEKSNSL 578
RQ QEL A++ K+L EK +L
Sbjct: 368 YKRQVQELHAKLSEESKKADKLEFEYKRLEEKLEAL 403
>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal. This domain is the C-terminal
region of the CASP family of proteins. It is a Golgi
membrane protein which is thought to have a role in
vesicle transport.
Length = 245
Score = 31.2 bits (71), Expect = 1.7
Identities = 17/72 (23%), Positives = 28/72 (38%), Gaps = 2/72 (2%)
Query: 33 LEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAGKSAQGSTSPSG 92
LE ELR+ + L+R+ + L A +L EK L Q G + + + S S
Sbjct: 98 LEEELRKQNQTISSLRRELESLKADNIKLYEKIRYL--QSYQGNSPPSDSAVSNSPGRSS 155
Query: 93 RSTAAETGTERT 104
+ +
Sbjct: 156 LQPGPGSIDVES 167
Score = 30.1 bits (68), Expect = 3.7
Identities = 16/59 (27%), Positives = 23/59 (38%)
Query: 543 LEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAVAGKSVNRPGP 601
LE ELR+ + L+R+ + L A +L EK L P + S R
Sbjct: 98 LEEELRKQNQTISSLRRELESLKADNIKLYEKIRYLQSYQGNSPPSDSAVSNSPGRSSL 156
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 31.0 bits (71), Expect = 1.9
Identities = 12/42 (28%), Positives = 21/42 (50%)
Query: 31 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
L AE+R++ + +L+ +L V ++ SL QQI
Sbjct: 53 QELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQI 94
Score = 31.0 bits (71), Expect = 1.9
Identities = 12/42 (28%), Positives = 21/42 (50%)
Query: 541 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
L AE+R++ + +L+ +L V ++ SL QQI
Sbjct: 53 QELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIASLQQQI 94
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 29.2 bits (66), Expect = 2.0
Identities = 13/55 (23%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 22 ASLSNGPSVAALEAELRRVQALVGDLQRQR---QELSAQVKQLTEKSNSLSQQIR 73
AS G + ++EA L++ +A +L+ + L+ +QL E+ + +++I
Sbjct: 22 ASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIE 76
Score = 29.2 bits (66), Expect = 2.0
Identities = 13/55 (23%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 532 ASLSNGPSVAALEAELRRVQALVGDLQRQR---QELSAQVKQLTEKSNSLSQQIR 583
AS G + ++EA L++ +A +L+ + L+ +QL E+ + +++I
Sbjct: 22 ASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIE 76
>gnl|CDD|182745 PRK10803, PRK10803, tol-pal system protein YbgF; Provisional.
Length = 263
Score = 30.9 bits (70), Expect = 2.1
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 2/78 (2%)
Query: 33 LEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTGVAGKSAQGSTSPSG 92
L+ +L Q+ + L+ Q QE Q+ Q+ E+ + QI +G G +AQ ++
Sbjct: 59 LQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQKQIYLQIDSLSSG--GAAAQSTSGDQS 116
Query: 93 RSTAAETGTERTGQTADG 110
+ A+ T G G
Sbjct: 117 GAAASATPAADAGTANAG 134
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 31.3 bits (71), Expect = 2.2
Identities = 30/157 (19%), Positives = 58/157 (36%), Gaps = 16/157 (10%)
Query: 442 VQGQAKEMLENALGSLRHKMHGVHASP----AEVERYRRQQRLLERELSRVRSILAHNSK 497
+ A EML + A E+ + E+E+S + L
Sbjct: 777 AEAVAPEMLGTPADETARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEA 836
Query: 498 KLEETV-AANARLESELVVLRQKLQWS---RREVCNGTASLSN---GPSVAALEAE---- 546
+L + AA EL+ ++ R+ + +L G S+ AL AE
Sbjct: 837 RLTALLRAARCTTIEELLAAVERSDTYRELRKRIAALERTLVRAGGGLSLEALVAEAAAL 896
Query: 547 -LRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
+ A + +L R +EL ++ +L ++ + Q++
Sbjct: 897 DPDELPARLEELARDIEELEEELNELAQEVGAAKQEL 933
Score = 30.5 bits (69), Expect = 4.5
Identities = 26/137 (18%), Positives = 47/137 (34%), Gaps = 14/137 (10%)
Query: 457 LRHKMHGVHASPAEVERYRRQ------QRLLEREL-SRVRSILAHNSKKLEETVAANARL 509
+R + + + E + L E +L ++S+ + E +L
Sbjct: 518 IRRGYALLEPTASAYEEAVESADQLADRLLREAQLVGELQSLRQ----QEEAARRRLEQL 573
Query: 510 ESELVVLRQKLQWSRREVCNGTASLSNGPSVAALE---AELRRVQALVGDLQRQRQELSA 566
E EL VL L R A+ + A +E AE + V + R EL A
Sbjct: 574 EKELEVLELALAALREAWQAQWAAAGLPLTPAEMEDWLAERATAREQVRAYFKARAELDA 633
Query: 567 QVKQLTEKSNSLSQQIR 583
+ + +L ++
Sbjct: 634 LLDRRARLRAALRAALK 650
Score = 30.5 bits (69), Expect = 4.8
Identities = 34/146 (23%), Positives = 57/146 (39%), Gaps = 18/146 (12%)
Query: 445 QAKEMLENALGSLRH--------KMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNS 496
A + E L +L ++ R++ LER L R A
Sbjct: 829 AALDEAEARLTALLRAARCTTIEELLAAVERSDTYRELRKRIAALERTLVR-----AGGG 883
Query: 497 KKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSN-GPSVAALEAELRRV--QAL 553
LE VA A L+ L +L+ R++ L+ V A + EL R+ +
Sbjct: 884 LSLEALVAEAAALD--PDELPARLEELARDIEELEEELNELAQEVGAAKQELARMDGGST 941
Query: 554 VGDLQRQRQELSAQVKQLTEKSNSLS 579
+L+ +R+ L AQ++ L E+ L+
Sbjct: 942 AAELEAERESLLAQLRDLAERYLELA 967
>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
(DUF2317). Members of this family of hypothetical
bacterial proteins have no known function.
Length = 541
Score = 31.0 bits (71), Expect = 2.3
Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 1/78 (1%)
Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
+++ + L L+ K+ A+ + + LE + R+R A LE +
Sbjct: 398 SIEDVFLHGLELLKEKVLEEKANSEIDIDFEEAKAELEAQFKRLREEAAKIDPSLEGALE 457
Query: 505 AN-ARLESELVVLRQKLQ 521
AN A++ +L L ++L
Sbjct: 458 ANEAKILKQLEFLEKRLL 475
>gnl|CDD|217914 pfam04129, Vps52, Vps52 / Sac2 family. Vps52 complexes with
Vps53 and Vps54 to form a multi- subunit complex
involved in regulating membrane trafficking events.
Length = 511
Score = 31.2 bits (71), Expect = 2.4
Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 7/67 (10%)
Query: 7 RQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 66
+ + S ASL N +AA ++ L R++ ++ Q +S +K L EKSN
Sbjct: 7 QDVIDES-----ENLASLHN--QIAACDSVLERMEDMLTSFQSDLSSISQDIKFLQEKSN 59
Query: 67 SLSQQIR 73
+ ++
Sbjct: 60 EMQLRLE 66
Score = 31.2 bits (71), Expect = 2.4
Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 7/67 (10%)
Query: 517 RQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSN 576
+ + S ASL N +AA ++ L R++ ++ Q +S +K L EKSN
Sbjct: 7 QDVIDES-----ENLASLHN--QIAACDSVLERMEDMLTSFQSDLSSISQDIKFLQEKSN 59
Query: 577 SLSQQIR 583
+ ++
Sbjct: 60 EMQLRLE 66
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains
are found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of
helix C. Although the domain occurs in ultiple repeats
along sequences, the domains are actually stable on
their own - ie they act, biophysically, like domains
rather than repeats that along function when
aggregated.
Length = 105
Score = 29.2 bits (66), Expect = 2.5
Identities = 11/50 (22%), Positives = 29/50 (58%), Gaps = 3/50 (6%)
Query: 27 GPSVAALEAELRRVQALVGDL---QRQRQELSAQVKQLTEKSNSLSQQIR 73
G + +++A L++ +AL +L Q + + L+ ++L + + S++I+
Sbjct: 30 GKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEGHYASEEIQ 79
Score = 29.2 bits (66), Expect = 2.5
Identities = 11/50 (22%), Positives = 29/50 (58%), Gaps = 3/50 (6%)
Query: 537 GPSVAALEAELRRVQALVGDL---QRQRQELSAQVKQLTEKSNSLSQQIR 583
G + +++A L++ +AL +L Q + + L+ ++L + + S++I+
Sbjct: 30 GKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEGHYASEEIQ 79
>gnl|CDD|240354 PTZ00310, PTZ00310, AMP deaminase; Provisional.
Length = 1453
Score = 31.3 bits (71), Expect = 2.5
Identities = 36/149 (24%), Positives = 59/149 (39%), Gaps = 12/149 (8%)
Query: 439 ICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKK 498
+ S QA E++ + + + VH A+V+ RRQQR + R+ ++
Sbjct: 1316 LLSTHDQAMEVMLREMETTNER--IVH-LRAQVDSLRRQQR---SLVERLTEEGIR-RQE 1368
Query: 499 LEETVAANARLESELV-VLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDL 557
+ E VA + V LR++LQ V G L + EA + V L
Sbjct: 1369 VSEQVAGENNSIALRVNRLRERLQSEGLTVGGG---LYGRSNEEESEASSTQQGETVKRL 1425
Query: 558 QRQRQELSAQVKQLTEKSNSLSQQIRPGP 586
R + ++ +T S S+ RP P
Sbjct: 1426 LRWKPMPPDVMRAVTVASVPGSRG-RPLP 1453
>gnl|CDD|216560 pfam01544, CorA, CorA-like Mg2+ transporter protein. The CorA
transport system is the primary Mg2+ influx system of
Salmonella typhimurium and Escherichia coli. CorA is
virtually ubiquitous in the Bacteria and Archaea. There
are also eukaryotic relatives of this protein. The
family includes the MRS2 protein from yeast that is
thought to be an RNA splicing protein. However its
membership of this family suggests that its effect on
splicing is due to altered magnesium levels in the cell.
Length = 291
Score = 30.7 bits (70), Expect = 2.7
Identities = 18/87 (20%), Positives = 36/87 (41%), Gaps = 8/87 (9%)
Query: 188 ILSSYVYYCVLSAYIRYVSARLADLE----TPKRMEHLQKLEELKRHLVELEKQVSITFN 243
+L S V + + L +LE E L++L L+R LV L + ++
Sbjct: 115 LLDSIVDR--YFEILEKLEEELDELEDELEDSTTNELLRELLRLRRSLVRLRRSLAPLRE 172
Query: 244 LTERNEKIPLKFVNDGD--YIQAVLDR 268
+ R +++ + Y++ +LD
Sbjct: 173 VLNRLLSDDGPLIDEEEKEYLRDLLDE 199
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 30.7 bits (70), Expect = 2.8
Identities = 21/111 (18%), Positives = 33/111 (29%), Gaps = 26/111 (23%)
Query: 478 QRLLERELSRVRSILAHNSKKLEETVAAN-----ARLESELVVLRQ----------KLQW 522
LLE + + A VAA A L +E+ L+
Sbjct: 112 TVLLEIQQEEQAAAQA--------QVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQ 163
Query: 523 SRREVCNGTAS---LSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 570
+ GT GP +L QA + L+ + AQ++
Sbjct: 164 EAQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARLETLKARLDAAIAQLEA 214
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 30.8 bits (71), Expect = 2.9
Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 10/51 (19%)
Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 72
L+ E R +Q + +LQ +R LS ++ Q L + L ++I
Sbjct: 32 ELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEI 82
Score = 30.8 bits (71), Expect = 2.9
Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 10/51 (19%)
Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQ----------LTEKSNSLSQQI 582
L+ E R +Q + +LQ +R LS ++ Q L + L ++I
Sbjct: 32 ELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEI 82
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 30.4 bits (69), Expect = 3.0
Identities = 23/112 (20%), Positives = 47/112 (41%), Gaps = 25/112 (22%)
Query: 480 LLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPS 539
LL +EL + SI + + LE EL L+Q + P+
Sbjct: 162 LLMKELELLNSIKPKLRDRKDA-------LEEELRQLKQL-----EDELE-----DCDPT 204
Query: 540 -VAALEAELRRVQA-------LVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
+ + +L+++ + +L+ + QEL ++++ LT K + L+ +I
Sbjct: 205 ELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIA 256
>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
The small mechanosensitive channel, MscS, is a part of
the turgor-driven solute efflux system that protects
bacteria from lysis in the event of osmotic shock. The
MscS protein alone is sufficient to form a functional
mechanosensitive channel gated directly by tension in
the lipid bilayer. The MscS proteins are heptamers of
three transmembrane subunits with seven converging M3
domains, and this MscS_porin is towards the N-terminal
of the molecules. The high concentration of negative
charges at the extracellular entrance of the pore helps
select the cations for efflux.
Length = 239
Score = 30.3 bits (69), Expect = 3.1
Identities = 12/44 (27%), Positives = 25/44 (56%)
Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 72
S++ LE L + + + +LQ Q Q+ ++Q+ +L + QQ+
Sbjct: 79 SLSQLEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQQQL 122
Score = 30.3 bits (69), Expect = 3.1
Identities = 12/44 (27%), Positives = 25/44 (56%)
Query: 539 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQI 582
S++ LE L + + + +LQ Q Q+ ++Q+ +L + QQ+
Sbjct: 79 SLSQLEQRLAQTLSQLQELQEQLQQENSQLIELQTRPERAQQQL 122
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 30.0 bits (68), Expect = 3.7
Identities = 27/108 (25%), Positives = 52/108 (48%), Gaps = 7/108 (6%)
Query: 478 QRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
+++LE+ + + S LA + L + +A +LE +L + + + E +L G
Sbjct: 26 EKMLEQAIRDMESELAKARQALAQAIARQKQLERKLE--EAQARAEKLEE-KAELALQAG 82
Query: 538 PSVAALEAELRRVQALVGD---LQRQRQELSAQVKQLTEKSNSLSQQI 582
A EA L Q+L L+ + Q+ QV++L ++ +L Q+I
Sbjct: 83 NEDLAREA-LEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKI 129
>gnl|CDD|234592 PRK00045, hemA, glutamyl-tRNA reductase; Reviewed.
Length = 423
Score = 30.5 bits (70), Expect = 3.7
Identities = 20/96 (20%), Positives = 34/96 (35%), Gaps = 20/96 (20%)
Query: 488 VRSILAHNSKK-LEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAAL--- 543
++ I+ N + E A A +E E+ + EV P++ AL
Sbjct: 303 LQEIVEENLAQRQEAAEKAEAIVEEEVAEFME--WLRSLEV---------VPTIRALREQ 351
Query: 544 -----EAELRRVQALVGDLQRQRQELSAQVKQLTEK 574
E EL R +G + + + L + L K
Sbjct: 352 AEEIREEELERALKKLGPGEDEEEVLEKLARSLVNK 387
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 30.2 bits (69), Expect = 3.9
Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 28/95 (29%)
Query: 470 EVERYRRQQRLLERELSRVRSILAHNSKKLEETVA-ANARLESELVVLRQKLQWSRREVC 528
E+ RRQ + LE EL ++ A LE +A R E EL + +
Sbjct: 217 EITELRRQIQSLEIELQSLKKQKA----SLERQLAELEERYELELADYQDTI-------- 264
Query: 529 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQE 563
+ LE EL++++A ++ RQ +E
Sbjct: 265 ------------SELEEELQQLKA---EMARQLRE 284
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 30.4 bits (68), Expect = 4.0
Identities = 33/132 (25%), Positives = 47/132 (35%), Gaps = 13/132 (9%)
Query: 448 EMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANA 507
+ AL SLR K G+ S +E R + L K +E + A
Sbjct: 158 QAHLEALSSLRSKAEGLEKSLQSLETRRAGEAKALAAAQAEADTLREQLSKTQEELEAQV 217
Query: 508 RLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQ 567
L +L R+ V G ++ S A E E + V LQ R L A
Sbjct: 218 TLVEQL----------RKYV--GEQVVAESRS-QAWELERGELLQTVKHLQEDRAALQAT 264
Query: 568 VKQLTEKSNSLS 579
V+ L + SL+
Sbjct: 265 VELLQVRVQSLT 276
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 30.4 bits (69), Expect = 4.2
Identities = 17/73 (23%), Positives = 26/73 (35%), Gaps = 5/73 (6%)
Query: 509 LESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQV 568
L +E L LQ +E N P+ E L+ +LS +
Sbjct: 20 LTAESAQLEAALQ-LLQEAVNSKRQEEAEPAAEEAEL----QAELIQQELAINDQLSQAL 74
Query: 569 KQLTEKSNSLSQQ 581
Q TE+ N+L+
Sbjct: 75 NQQTERLNALASD 87
Score = 30.0 bits (68), Expect = 5.8
Identities = 16/70 (22%), Positives = 24/70 (34%), Gaps = 5/70 (7%)
Query: 2 ELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQL 61
E L LQ +E N P+ E L+ +LS + Q
Sbjct: 23 ESAQLEAALQ-LLQEAVNSKRQEEAEPAAEEAEL----QAELIQQELAINDQLSQALNQQ 77
Query: 62 TEKSNSLSQQ 71
TE+ N+L+
Sbjct: 78 TERLNALASD 87
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.4 bits (69), Expect = 4.4
Identities = 18/94 (19%), Positives = 31/94 (32%), Gaps = 3/94 (3%)
Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLEREL---SRVRSILAHNSKKLEETV 503
E LE+ L R ++ E+ R+ LE+EL + L +L +
Sbjct: 452 IEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMR 511
Query: 504 AANARLESELVVLRQKLQWSRREVCNGTASLSNG 537
+ V + +KL E + G
Sbjct: 512 KLELSGKGTPVKVVEKLTLEAIEEAEEEYGIKEG 545
Score = 29.3 bits (66), Expect = 8.7
Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 5/103 (4%)
Query: 471 VERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNG 530
VER + L+REL ++ + +LE + + V ++++ R +
Sbjct: 431 VERLEEENSELKRELEELKREIEKLESELER---FRREVRDK-VRKDREIRARDRRIERL 486
Query: 531 TASLSNGPS-VAALEAELRRVQALVGDLQRQRQELSAQVKQLT 572
L V LE +L ++ + + V++LT
Sbjct: 487 EKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLT 529
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 30.1 bits (68), Expect = 4.4
Identities = 23/133 (17%), Positives = 51/133 (38%), Gaps = 17/133 (12%)
Query: 464 VHASPAEVERYRRQQRLLER-----ELSRVRSILAHNSKKLEETVAANARLESELVVLRQ 518
++ E R R++QRL ++ + +S + + RL+S R
Sbjct: 178 WNSQGLEDYRKRKRQRLDQKAAEYLRQAAQAGTTGSSSADDLADLLSKFRLDSWDKGSRF 237
Query: 519 KLQWSRREVCNGTASLSNGPSVA---------ALEAELRRVQALVGDLQRQRQELSAQVK 569
+V + + P EAEL +Q + +L+ +E+ +++K
Sbjct: 238 T---RTEKVEQAATAAAEVPPAEMDTEEDRTKEREAELEALQEQIDELESSIEEVLSEIK 294
Query: 570 QLTEKSNSLSQQI 582
L K +++++
Sbjct: 295 ALASKIKQVNEEL 307
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 30.3 bits (69), Expect = 4.7
Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 11/52 (21%)
Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQ-----------LTEKSNSLSQQI 72
L+ E R++ + +LQ +R ELS ++ + L + L +++
Sbjct: 33 ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKL 84
Score = 30.3 bits (69), Expect = 4.7
Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 11/52 (21%)
Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQ-----------LTEKSNSLSQQI 582
L+ E R++ + +LQ +R ELS ++ + L + L +++
Sbjct: 33 ELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKL 84
>gnl|CDD|220180 pfam09325, Vps5, Vps5 C terminal like. Vps5 is a sorting nexin
that functions in membrane trafficking. This is the C
terminal dimerisation domain.
Length = 236
Score = 29.5 bits (67), Expect = 5.2
Identities = 9/40 (22%), Positives = 23/40 (57%)
Query: 30 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 69
+ +LE++L+++ + L QR+EL++ + + L+
Sbjct: 33 IDSLESQLKKLYKALELLVNQRKELASATGEFAKSLAMLA 72
Score = 29.5 bits (67), Expect = 5.2
Identities = 9/40 (22%), Positives = 23/40 (57%)
Query: 540 VAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
+ +LE++L+++ + L QR+EL++ + + L+
Sbjct: 33 IDSLESQLKKLYKALELLVNQRKELASATGEFAKSLAMLA 72
>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
unknown].
Length = 833
Score = 30.0 bits (67), Expect = 5.8
Identities = 22/164 (13%), Positives = 44/164 (26%)
Query: 19 NGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPGPTG 78
G +A L A +GD+ + ++ + +K+ ++ P
Sbjct: 210 AGITQHQAAEVLAQLVAAGVATAEQLGDVAQAAARFASASGESIDKTIKAFSKLTDDPVN 269
Query: 79 VAGKSAQGSTSPSGRSTAAETGTERTGQTADGEIQLFESADQARTHWSSGPGSTGVAGKS 138
A + + +R G TA E+A + + G +
Sbjct: 270 GAKALNRQFHYLTAAQLEQIAALQRAGDTAAAAAAAAEAAAAMDDRTARVKENMGTLETA 329
Query: 139 AQGSTSPSGRSTAAETGTERTGQTADGEIQLFESADQARTHWSN 182
+ + A G R + A + AR
Sbjct: 330 WDALADAAKKMWDAVLGIGREDKQAALLAAKLAAEKLARVTAQG 373
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 29.9 bits (67), Expect = 6.2
Identities = 31/144 (21%), Positives = 60/144 (41%), Gaps = 12/144 (8%)
Query: 423 ISRDQIASKLHSRGRNICSVQGQAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLE 482
R++ K+ + K LE L L+ ++ A + R +++ +
Sbjct: 63 QVREEQEQKIQDAATEKTKEWEKVKAELEKKLLELQKELARAAAENDALSRSLQEKSKMI 122
Query: 483 RELSRVRSILAHNSKKLEETVAA----NARLESELVVLRQKLQW--SRREVCNGTAS--- 533
ELS +S + L+ + + N+ L+ E+ VL ++L+ R + +A
Sbjct: 123 MELSEEKSRAESEIEDLKARLESAEKENSSLKYEVHVLSKELEIRNEERNMSLQSADAAS 182
Query: 534 ---LSNGPSVAALEAELRRVQALV 554
L +A LEAE +R++ LV
Sbjct: 183 KQHLEGVKKIAKLEAECQRLRGLV 206
>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding. This family
is found in mammals where it is localised at cell-cell
adherens junctions, and in Sch. pombe and other fungi
where it anchors spindle-pole bodies to spindle
microtubules. It is a coiled-coil structure, and in
pombe, it is required for anchoring the minus end of
spindle microtubules to the centrosome equivalent, the
spindle-pole body. The name ADIP derives from the family
being composed of Afadin- and alpha -Actinin-Binding
Proteins Localised at Cell-Cell Adherens Junctions.
Length = 149
Score = 28.8 bits (65), Expect = 6.5
Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 1/85 (1%)
Query: 445 QAKEMLENALGSLRHKMHGVHASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVA 504
+ E E+ +LR + +ER + Q LEREL+ +++ KKL+
Sbjct: 47 RDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTLEQ 106
Query: 505 ANARLESELVVLRQKLQWSRREVCN 529
+ E+ L+ +Q R+ N
Sbjct: 107 KLKNEKEEVQRLKNIIQ-QRKTQYN 130
Score = 28.4 bits (64), Expect = 6.9
Identities = 19/99 (19%), Positives = 43/99 (43%), Gaps = 9/99 (9%)
Query: 472 ERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGT 531
+R + LE L ++ + + +E LE EL +L+ K + +++
Sbjct: 46 DRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKL---- 101
Query: 532 ASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQ 570
+L L+ E VQ L +Q+++ + + ++K+
Sbjct: 102 KTLEQ-----KLKNEKEEVQRLKNIIQQRKTQYNHELKK 135
>gnl|CDD|184543 PRK14157, PRK14157, heat shock protein GrpE; Provisional.
Length = 227
Score = 29.2 bits (65), Expect = 6.5
Identities = 14/82 (17%), Positives = 27/82 (32%), Gaps = 2/82 (2%)
Query: 74 PGPTGVAGKSAQGSTSPSGRSTAAETGTERTGQTADGEIQLFESADQARTHWSSGPGSTG 133
P + + S ++A + + AD + A + TG
Sbjct: 14 PDADDLEAQGQAAQASSGADASAESGEQQDSAAQADANAGADAAPAAAEGEAKAAAEKTG 73
Query: 134 VAGKSAQGSTSPSGRST--AAE 153
A + + +P G++ AAE
Sbjct: 74 EAQSDSDDTLTPLGQAKKEAAE 95
>gnl|CDD|221919 pfam13094, CENP-Q, CENP-A-nucleosome distal (CAD) centromere
subunit. CENP-Q is one of the components that
assembles onto the CENP-A-nucleosome distal (CAD)
centromere. The centromere, which is the basic element
of chromosome inheritance, is epigenetically determined
in mammals. CENP-A, the centromere-specific histone H3
variant, assembles an array of nucleosomes and it is
this that seems to be the prime candidate for
specifying centromere identity. CENP-A nucleosomes
directly recruit a proximal CENP-A nucleosome
associated complex (NAC) comprised of CENP-M, CENP-N
and CENP-T, CENP-U(50), CENP-C and CENP-H. Assembly of
the CENP-A NAC at centromeres is dependent on CENP-M,
CENP-N and CENP-T. Additionally, there are seven other
subunits which make up the CENP-A-nucleosome distal
(CAD) centromere, CENP-K, CENP-L, CENP-O, CENP-P,
CENP-Q, CENP-R and CENP-S, also assembling on the
CENP-A NAC. Fta7 is the equivalent component of the
fission yeast Sim4 complex.
Length = 159
Score = 28.9 bits (65), Expect = 6.7
Identities = 13/46 (28%), Positives = 20/46 (43%)
Query: 29 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 74
+ LE E+RR +A + QEL K L + S+ + P
Sbjct: 42 QLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHP 87
Score = 28.9 bits (65), Expect = 6.7
Identities = 13/46 (28%), Positives = 20/46 (43%)
Query: 539 SVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRP 584
+ LE E+RR +A + QEL K L + S+ + P
Sbjct: 42 QLELLEEEIRREEAELEKDLEYLQELEKNAKALEREREEESKNLHP 87
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 27.3 bits (61), Expect = 6.7
Identities = 11/52 (21%), Positives = 24/52 (46%)
Query: 469 AEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKL 520
E E+ +++ + ++R+ K++ E +LE EL L ++L
Sbjct: 19 EEGEKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERL 70
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This
family includes outer membrane proteins such as OmpH
among others. Skp (OmpH) has been characterized as a
molecular chaperone that interacts with unfolded
proteins as they emerge in the periplasm from the Sec
translocation machinery.
Length = 140
Score = 28.3 bits (64), Expect = 6.8
Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 3/43 (6%)
Query: 31 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 73
LE E ++ QA +L++ +EL ++L + + +LS+ R
Sbjct: 21 KQLEKEFKKRQA---ELEKLEKELQKLKEKLQKDAATLSEAAR 60
Score = 28.3 bits (64), Expect = 6.8
Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 3/43 (6%)
Query: 541 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR 583
LE E ++ QA +L++ +EL ++L + + +LS+ R
Sbjct: 21 KQLEKEFKKRQA---ELEKLEKELQKLKEKLQKDAATLSEAAR 60
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters
and repress their activity. by competing with the
binding of TATA binding protein. TMF1_TATA_bd is the
most conserved part of the TMFs. TMFs are
evolutionarily conserved golgins that bind Rab6, a
ubiquitous ras-like GTP-binding Golgi protein, and
contribute to Golgi organisation in animal and plant
cells. The Rab6-binding domain appears to be the same
region as this C-terminal family.
Length = 121
Score = 28.3 bits (64), Expect = 7.1
Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 13/68 (19%)
Query: 19 NGTASLSNGPSVAALE---AELRR-------VQALVGDLQRQRQELSAQVKQLT---EKS 65
++ GP+V +E + LRR ++ + L+ +R E ++ +LT E+
Sbjct: 5 VSVSTSGAGPNVQLVERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEEL 64
Query: 66 NSLSQQIR 73
L ++I
Sbjct: 65 RELKKEIE 72
Score = 28.3 bits (64), Expect = 7.1
Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 13/68 (19%)
Query: 529 NGTASLSNGPSVAALE---AELRR-------VQALVGDLQRQRQELSAQVKQLT---EKS 575
++ GP+V +E + LRR ++ + L+ +R E ++ +LT E+
Sbjct: 5 VSVSTSGAGPNVQLVERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEEL 64
Query: 576 NSLSQQIR 583
L ++I
Sbjct: 65 RELKKEIE 72
>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
outer membrane].
Length = 372
Score = 29.3 bits (65), Expect = 7.7
Identities = 19/132 (14%), Positives = 41/132 (31%), Gaps = 14/132 (10%)
Query: 466 ASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRR 525
+PAE+ +R+ +L + + ++ L L A ++ L + L+++R
Sbjct: 121 LAPAELGDLQREAKLAAEKAAVSQAELDAAQALLRAAEALVEAAQAALASAKLNLEYTRI 180
Query: 526 EVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIRPG 585
A + + VG L Q L+ + + +
Sbjct: 181 --------------TAPISGVIGARLVRVGQLVSAGQALATIADLDPAAALWVLAAVLER 226
Query: 586 PTGVAVAGKSVN 597
G+ V
Sbjct: 227 DLLAVKVGQKVT 238
>gnl|CDD|222849 PHA02109, PHA02109, hypothetical protein.
Length = 233
Score = 28.9 bits (64), Expect = 8.2
Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 3/66 (4%)
Query: 465 HASPAEVERYRRQQRLLERELSRVRSILAHNSKKLEETVAANARLES---ELVVLRQKLQ 521
A P VE + +R+ + E S L + KL++ +LE+ E ++ K+
Sbjct: 158 PAKPKAVEIHASTERIDQVERSHTGENLEGLTDKLKQISELTIKLEALSDEACQVKHKIL 217
Query: 522 WSRREV 527
R EV
Sbjct: 218 NLRAEV 223
>gnl|CDD|182972 PRK11113, PRK11113, D-alanyl-D-alanine
carboxypeptidase/endopeptidase; Provisional.
Length = 477
Score = 29.2 bits (66), Expect = 8.3
Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 2/31 (6%)
Query: 244 LTERNEKIPLKF-VNDGD-YIQAVLDRNLAE 272
L +R++ +PL F V DG Y A+L L +
Sbjct: 235 LPQRSKPLPLAFAVQDGASYAGAILKDELKQ 265
>gnl|CDD|218237 pfam04738, Lant_dehyd_C, Lantibiotic dehydratase, C terminus.
Lantibiotics are ribosomally synthesised antimicrobial
agents derived from ribosomally synthesised peptides.
They are produced by bacteria of the Firmicutes phylum,
and include mutacin, subtilin, and nisin. Lantibiotic
peptides contain thioether bridges termed lanthionines
that are thought to be generated by dehydration of
serine and threonine residues followed by addition of
cysteine residues. This family constitutes the
C-terminus of the enzyme proposed to catalyze the
dehydration step.
Length = 500
Score = 29.3 bits (66), Expect = 8.6
Identities = 25/115 (21%), Positives = 47/115 (40%), Gaps = 19/115 (16%)
Query: 201 YIRYVSARLADLETPKRMEHLQKLEELKRHL-----VELEKQVSITFNLTERNEKI---- 251
+ Y+ +L L+ P+ E L L E+++ + + + + + L E+ ++
Sbjct: 48 PLDYLIEKLEALDVPEANELLAALREIQKLIAEYAELPIGEGRKLLQELEEKMSQLTPSR 107
Query: 252 --PLK--FVNDGDYIQAVLDRNLAENISRVLYP----NDNNFGGKELRLKQEYFM 298
PL D + + L N+AE I+R + N G LR E F+
Sbjct: 108 RGPLYVDLRLDAEEVT--LGENVAEEIARAAELLLRLSPNPAGRPYLRDYHERFL 160
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 29.4 bits (65), Expect = 8.7
Identities = 23/113 (20%), Positives = 37/113 (32%), Gaps = 6/113 (5%)
Query: 61 LTEKSNSLSQQIRPGPTGVAGKSAQGSTSPSGRSTAAE-TGTERTGQTADGEIQLFESAD 119
T + S + PTG + TSP+ R+T+A T T +
Sbjct: 459 STGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQK 518
Query: 120 QARTHWSSGP-----GSTGVAGKSAQGSTSPSGRSTAAETGTERTGQTADGEI 167
+ T ++ P G T A G+TS ++ T T +
Sbjct: 519 TSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVV 571
>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 247
Score = 28.9 bits (65), Expect = 9.3
Identities = 9/38 (23%), Positives = 18/38 (47%)
Query: 32 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 69
A + + LQ++ L+A+V+ L K +S+
Sbjct: 47 VRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVR 84
Score = 28.9 bits (65), Expect = 9.3
Identities = 9/38 (23%), Positives = 18/38 (47%)
Query: 542 ALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLS 579
A + + LQ++ L+A+V+ L K +S+
Sbjct: 47 VRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVR 84
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.1 bits (65), Expect = 9.4
Identities = 20/93 (21%), Positives = 35/93 (37%), Gaps = 6/93 (6%)
Query: 483 RELSRVRSILAHNSKKLEETVAANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAA 542
E ++ +KLEE L EL L + + + + S
Sbjct: 128 PEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSR------ 181
Query: 543 LEAELRRVQALVGDLQRQRQELSAQVKQLTEKS 575
LE L+++ V DL+++ EL V+ E+
Sbjct: 182 LEEMLKKLPGEVYDLKKRWDELEPGVELPEEEL 214
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 29.2 bits (66), Expect = 9.4
Identities = 27/139 (19%), Positives = 57/139 (41%), Gaps = 23/139 (16%)
Query: 447 KEMLENALGSLRHKMHGVHASPAEVERYRRQQRL--LERELSRVRSILAHNSKKLEETVA 504
+ +L + GSL + A E E +RL LE EL+ + + ++ E+
Sbjct: 179 ERVLSDQRGSLD----QLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARE 234
Query: 505 ANARLESELVVLRQKLQWSRREVCNGTASLSNGPSVAALEAELRRVQALVGDLQRQRQEL 564
+ L ++ + + LEAE+ ++ + + +R+R+EL
Sbjct: 235 TRDEADEVLEEHEERRE-----------------ELETLEAEIEDLRETIAETEREREEL 277
Query: 565 SAQVKQLTEKSNSLSQQIR 583
+ +V+ L E+ L ++
Sbjct: 278 AEEVRDLRERLEELEEERD 296
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 29.0 bits (65), Expect = 10.0
Identities = 17/97 (17%), Positives = 41/97 (42%), Gaps = 2/97 (2%)
Query: 541 AALEAELRRVQALVGDLQRQRQELSAQVKQLTEKSNSLSQQIR-PGPTGVAVAGKSVNRP 599
A ++ + ++ + L +QR + ++++L + + +L++Q++ G V G+ V +
Sbjct: 79 AQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATGEPVPQM 138
Query: 600 GPTGVAGKVSKSGLDPTGVAVDTRIPNPCSLSLGLYP 636
P G + T V+ + YP
Sbjct: 139 -PASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYP 174
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.128 0.366
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,887,066
Number of extensions: 3617179
Number of successful extensions: 4704
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4535
Number of HSP's successfully gapped: 329
Length of query: 746
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 642
Effective length of database: 6,324,786
Effective search space: 4060512612
Effective search space used: 4060512612
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 63 (27.9 bits)