RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16513
(156 letters)
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 121 bits (306), Expect = 9e-33
Identities = 79/109 (72%), Positives = 94/109 (86%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ARI ELEEE+E ER +RAKAEK RADL+RELEEL ERLEEAGGAT+AQIELNKKREAE+
Sbjct: 45 EARIRELEEELEAERAARAKAEKARADLSRELEELSERLEEAGGATAAQIELNKKREAEL 104
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
+KLR+DLEEAN+QHE LA LRKKH DA++E+ EQI+QL K K K ++
Sbjct: 105 AKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKE 153
Score = 37.7 bits (88), Expect = 0.001
Identities = 36/152 (23%), Positives = 67/152 (44%), Gaps = 17/152 (11%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+ +I ELEE E EK ++ L ELE+L LE A A S + K +
Sbjct: 326 LNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKNFDKI 385
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDA------VSEMGEQIDQLNK-----------L 103
+++ +R ++E + ++ R + + E+ +Q++ L + L
Sbjct: 386 LAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEALRRENKNLQDEIHDL 445
Query: 104 KTKLDEANRTINDLDVARKKLIIENSDLVRQL 135
+L E R +++L+ AR++L E +L L
Sbjct: 446 TDQLGEGGRNVHELEKARRRLEAEKDELQAAL 477
Score = 31.6 bits (72), Expect = 0.14
Identities = 37/154 (24%), Positives = 75/154 (48%), Gaps = 3/154 (1%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+QA + +L +++ +++ AEK+ L +L EL +L+E + + ++E
Sbjct: 157 LQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSE 216
Query: 61 MSKLRRDLEEANIQHESTL---ANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
S L R LEEA Q + + L + +A + E+ + L+ +L + ++ L
Sbjct: 217 NSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSL 276
Query: 118 DVARKKLIIENSDLVRQLSKVTADARMWKIKYET 151
++ ++L RQLSK A+ + W+ K+E+
Sbjct: 277 REQLEEESEAKAELERQLSKANAEIQQWRSKFES 310
Score = 31.2 bits (71), Expect = 0.22
Identities = 37/137 (27%), Positives = 57/137 (41%), Gaps = 10/137 (7%)
Query: 7 ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
E + +E E + RA + Q L +L+ L E+LEE A + K AE+ + R
Sbjct: 247 EAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRS 306
Query: 67 DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLII 126
E L L+KK N +SE+ E + AN + L+ + +L
Sbjct: 307 KFESEGALRAEELEELKKKLNQKISELEEAA----------EAANAKCDSLEKTKSRLQS 356
Query: 127 ENSDLVRQLSKVTADAR 143
E DL +L + A A
Sbjct: 357 ELEDLQIELERANAAAS 373
Score = 26.9 bits (60), Expect = 5.2
Identities = 32/143 (22%), Positives = 61/143 (42%), Gaps = 4/143 (2%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
Q I L+ +E E + +A+A + + L ++ EL L+ A A + + KK + +
Sbjct: 523 HQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALDHANKANAEAQKNVKKYQQQ 582
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
+ +L+ +EE E L A + E L +L++ L++A R +
Sbjct: 583 VKELQTQVEEEQRAREDAREQLAVAERRATALEAE----LEELRSALEQAERARKQAETE 638
Query: 121 RKKLIIENSDLVRQLSKVTADAR 143
+ ++L Q S + A R
Sbjct: 639 LAEASERVNELTAQNSSLIAQKR 661
Score = 26.2 bits (58), Expect = 8.8
Identities = 25/84 (29%), Positives = 46/84 (54%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++AR+ ELE E++ E++ A+ +K + R ++EL ++EE +L K +A+
Sbjct: 748 LEARVRELEAELDGEQRRHAETQKNLRKMERRVKELQFQVEEDKKNLERLQDLVDKLQAK 807
Query: 61 MSKLRRDLEEANIQHESTLANLRK 84
+ +R LEEA + L+ RK
Sbjct: 808 IKTYKRQLEEAEEVAQINLSKYRK 831
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 37.8 bits (88), Expect = 7e-04
Identities = 25/116 (21%), Positives = 49/116 (42%)
Query: 9 EEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDL 68
+++E + ++ ++Q ++L ELE+L ER+ E E + + L L
Sbjct: 71 PQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERL 130
Query: 69 EEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
E + LR+ D E+ E ++L +L+ L I +L ++L
Sbjct: 131 ESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 38.1 bits (89), Expect = 0.001
Identities = 26/117 (22%), Positives = 51/117 (43%), Gaps = 4/117 (3%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ RI E +E + + + LA E+EEL E +EE +++E A
Sbjct: 829 LERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEEL----ESELEALLNERAS 884
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
+ + L + L L K ++ E+ E ++L +L+ +L+ I++L
Sbjct: 885 LEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNL 941
Score = 37.3 bits (87), Expect = 0.002
Identities = 33/144 (22%), Positives = 68/144 (47%), Gaps = 3/144 (2%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
I LE++ ++ R+ A E+Q +L +LEEL +L+E + E ++ + E+
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
Query: 62 SKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
L +LEE + ES L L ++ S++ + Q+ L +++ + L+
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413
Query: 119 VARKKLIIENSDLVRQLSKVTADA 142
R++L E +L+++L +
Sbjct: 414 DRRERLQQEIEELLKKLEEAELKE 437
Score = 36.6 bits (85), Expect = 0.003
Identities = 28/126 (22%), Positives = 59/126 (46%), Gaps = 3/126 (2%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+A+I +L+EE++ R++ + + L E L ERLE +A + E ++
Sbjct: 788 EAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
Query: 62 SKLRRDLEEANIQHES---TLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
+L D+E + E + L + ++E + L L+++L+E + + +L+
Sbjct: 848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELE 907
Query: 119 VARKKL 124
R +L
Sbjct: 908 SKRSEL 913
Score = 36.6 bits (85), Expect = 0.003
Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 4/148 (2%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ I ELEE++E + A+ EK A+L +ELEEL E LE+ ++E ++ + +
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL----RKELEELSRQISAL 731
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
K LE Q E +A L K+ + +E+ E ++L + + +L EA I +L+
Sbjct: 732 RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791
Query: 122 KKLIIENSDLVRQLSKVTADARMWKIKY 149
++L E L L ++ A+ + +
Sbjct: 792 EQLKEELKALREALDELRAELTLLNEEA 819
Score = 36.2 bits (84), Expect = 0.004
Identities = 29/118 (24%), Positives = 55/118 (46%), Gaps = 1/118 (0%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+++R+ ELEE++E R A+ E Q A L E+E L RLE + ++ +
Sbjct: 370 LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKK 429
Query: 61 MSKLRRDLEEANI-QHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
+ + +A + + E L L+++ + E ++L + + LD A R + L
Sbjct: 430 LEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQL 487
Score = 35.8 bits (83), Expect = 0.005
Identities = 27/127 (21%), Positives = 54/127 (42%), Gaps = 3/127 (2%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+ +I L +++ + E++ A L++EL EL +EE E + EAE
Sbjct: 724 LSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE 783
Query: 61 MSKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
+ +L +E+ + L LR + E ++L L+ ++ R + DL
Sbjct: 784 IEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDL 843
Query: 118 DVARKKL 124
+ ++L
Sbjct: 844 EEQIEEL 850
Score = 33.5 bits (77), Expect = 0.032
Identities = 26/126 (20%), Positives = 55/126 (43%), Gaps = 4/126 (3%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ ELE E+E + +AE++ A+ E+EEL ++E+ A E + AE+
Sbjct: 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAEL 812
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
+ L + L +L ++ + + +Q+ +L ++ I +L+
Sbjct: 813 TLLNEEAANL----RERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELI 868
Query: 122 KKLIIE 127
++L E
Sbjct: 869 EELESE 874
Score = 33.5 bits (77), Expect = 0.034
Identities = 35/136 (25%), Positives = 60/136 (44%), Gaps = 11/136 (8%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+A + L EE R+ E++ A R LE+L E++EE S IE AE+
Sbjct: 809 RAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEEL----SEDIESL---AAEI 861
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
+L +EE + E+ L N R +A + +L +L +L E ++L
Sbjct: 862 EELEELIEELESELEA-LLNERASLEEA---LALLRSELEELSEELRELESKRSELRREL 917
Query: 122 KKLIIENSDLVRQLSK 137
++L + + L +L
Sbjct: 918 EELREKLAQLELRLEG 933
Score = 32.3 bits (74), Expect = 0.084
Identities = 25/139 (17%), Positives = 53/139 (38%), Gaps = 4/139 (2%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGA---TSAQIELNKKR 57
++ ++ EL+EE+E + E + +L LEEL E+LE QI
Sbjct: 342 LEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401
Query: 58 EAEMSKLRRDLEEA-NIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTIND 116
+ LE+ + L+K + E+ ++++L + +L E + +
Sbjct: 402 IERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEE 461
Query: 117 LDVARKKLIIENSDLVRQL 135
++ + E +
Sbjct: 462 ALEELREELEEAEQALDAA 480
Score = 31.2 bits (71), Expect = 0.22
Identities = 30/120 (25%), Positives = 49/120 (40%), Gaps = 11/120 (9%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ I ELE E+E RA E+ A L ELEEL E L E S ++ +
Sbjct: 864 LEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREK 923
Query: 61 MSKLRRDLEEANIQHESTLANLR-----------KKHNDAVSEMGEQIDQLNKLKTKLDE 109
+++L LE ++ ++ L N + E +L +L+ K+ E
Sbjct: 924 LAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983
Score = 26.6 bits (59), Expect = 6.9
Identities = 34/132 (25%), Positives = 65/132 (49%), Gaps = 7/132 (5%)
Query: 13 EVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN 72
E+ERQ + E+Q A+ A +EL L E A ++ E+ +L+ +L+EA
Sbjct: 197 ELERQLK-SLERQ-AEKAERYKELKAELRELELALLVL--RLEELREELEELQEELKEAE 252
Query: 73 IQHESTLANLRK---KHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENS 129
+ E A L++ K + E+ E +++ +L+ +L I+ L+ ++ L +
Sbjct: 253 EELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312
Query: 130 DLVRQLSKVTAD 141
+L RQL ++ A
Sbjct: 313 NLERQLEELEAQ 324
Score = 26.2 bits (58), Expect = 8.8
Identities = 32/161 (19%), Positives = 69/161 (42%), Gaps = 20/161 (12%)
Query: 3 ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE-----------------AGG 45
R+ EL EE+E ++ +AE++ +L EL+EL E+LEE
Sbjct: 232 LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291
Query: 46 ATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDA---VSEMGEQIDQLNK 102
A + +I ++++ + + +LE + E+ L L K ++ ++E+ E++++L +
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351
Query: 103 LKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTADAR 143
L+ + + + E + + L A
Sbjct: 352 ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLE 392
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 37.7 bits (88), Expect = 0.001
Identities = 26/129 (20%), Positives = 68/129 (52%), Gaps = 4/129 (3%)
Query: 3 ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 62
AR+ E+E+++ + EK+ +L + +L E+++ +IE ++ E+
Sbjct: 812 ARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE----KEIENLNGKKEELE 867
Query: 63 KLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK 122
+ +LE A ES L +L+K+ ++ +++ E ++ +L+ ++++ + +++L +
Sbjct: 868 EELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927
Query: 123 KLIIENSDL 131
L E S++
Sbjct: 928 ALEEELSEI 936
Score = 35.0 bits (81), Expect = 0.011
Identities = 22/118 (18%), Positives = 60/118 (50%), Gaps = 4/118 (3%)
Query: 8 LEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRD 67
LE+E++ ++ R ++Q + +E+E L + EE ++E + ++ D
Sbjct: 831 LEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE----LEEELEELEAALRDLESRLGD 886
Query: 68 LEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLI 125
L++ + E+ L L +K + +++ ++ +L++LK KL+ +++++ + +
Sbjct: 887 LKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944
Score = 33.9 bits (78), Expect = 0.024
Identities = 37/141 (26%), Positives = 62/141 (43%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
Q R+ E E+E E S ++ ++ + EE +LE A+IE ++ E
Sbjct: 289 QLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEE 348
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
K R L E + + L +LR + + E E D+L + KL++ R IN+L
Sbjct: 349 RKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKREL 408
Query: 122 KKLIIENSDLVRQLSKVTADA 142
+L E L +L+ + A
Sbjct: 409 DRLQEELQRLSEELADLNAAI 429
Score = 32.0 bits (73), Expect = 0.12
Identities = 37/165 (22%), Positives = 68/165 (41%), Gaps = 20/165 (12%)
Query: 3 ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIE---------L 53
YEL +E E + + E+Q A L ELE+L E + E +IE +
Sbjct: 223 YEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISEL-EKRLEEIEQLLEELNKKI 281
Query: 54 NKKREAEMSKLRRDLEEANIQHESTLANLRKKHN---DAVSEMGEQIDQLNKLKTKLDEA 110
E E +++ + E + S ++ +K DA + + +++KL +++E
Sbjct: 282 KDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEEL 341
Query: 111 NRTINDLDVARKKLIIENS-------DLVRQLSKVTADARMWKIK 148
R I + R KL E + DL +L +V + + +
Sbjct: 342 EREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE 386
Score = 31.2 bits (71), Expect = 0.23
Identities = 21/116 (18%), Positives = 56/116 (48%), Gaps = 4/116 (3%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ R+ L+ E+ + + E + +L++EL + ++ E +IE ++ E +
Sbjct: 679 LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEI----EKEIEQLEQEEEK 734
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTIND 116
+ + +LEE E + N++ + + + + E + L+KL+ L++ ++
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790
Score = 30.0 bits (68), Expect = 0.50
Identities = 30/144 (20%), Positives = 61/144 (42%), Gaps = 4/144 (2%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ + +L E+E + A+ + D +LE+L + E E ++ E+
Sbjct: 363 KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEEL 422
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
+ L + E+ + L ++ D E+ +Q +L +L L + + + DL
Sbjct: 423 ADLNAAIAGI----EAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEY 478
Query: 122 KKLIIENSDLVRQLSKVTADARMW 145
++ E S L R+L++ A AR
Sbjct: 479 DRVEKELSKLQRELAEAEAQARAS 502
Score = 26.6 bits (59), Expect = 8.3
Identities = 30/165 (18%), Positives = 66/165 (40%), Gaps = 15/165 (9%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEEL------------GERLEEAGGATSA 49
+ + LE+E+E + + E + +L +L +L R+ E S
Sbjct: 743 EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSK 802
Query: 50 QIELNKKREAEMSKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNKLKTK 106
E + EA + ++ + L ++ E + L+++ D ++ ++ L K
Sbjct: 803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK 862
Query: 107 LDEANRTINDLDVARKKLIIENSDLVRQLSKVTADARMWKIKYET 151
+E + +L+ A + L DL ++ ++ A R + K E
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEE 907
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 37.0 bits (86), Expect = 0.003
Identities = 34/134 (25%), Positives = 69/134 (51%), Gaps = 4/134 (2%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ R+ LE E+E Q R + E++ +L E+EEL E+L+E ++E +K E+
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL----EEELEELEKELEEL 861
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
+ +LE + E L L ++ + E+ E +L +LK ++++ + +L+
Sbjct: 862 KEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKL 921
Query: 122 KKLIIENSDLVRQL 135
++L +E +L +L
Sbjct: 922 ERLEVELPELEEEL 935
Score = 35.5 bits (82), Expect = 0.009
Identities = 32/146 (21%), Positives = 67/146 (45%), Gaps = 4/146 (2%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++A++ +LEEE++ + E +L R+LEEL +LEE ++ ++ +
Sbjct: 679 LEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKR----ELAALEEELEQ 734
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
+ +LEE + E L L+++ + E+ + L KLK +++E L
Sbjct: 735 LQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEE 794
Query: 121 RKKLIIENSDLVRQLSKVTADARMWK 146
++L E + R+L + + +
Sbjct: 795 LEELEEELEEAERRLDALERELESLE 820
Score = 35.1 bits (81), Expect = 0.010
Identities = 32/146 (21%), Positives = 75/146 (51%), Gaps = 4/146 (2%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+Q R+ ELEEE+E ++ AK +++ +L + + L E LEE ++E ++R
Sbjct: 756 LQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE----LEEELEEAERRLDA 811
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
+ + LE+ + E + L ++ + ++ E ++L +L+ +L+E + +L+
Sbjct: 812 LERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE 871
Query: 121 RKKLIIENSDLVRQLSKVTADARMWK 146
+++L E +L + ++ + R +
Sbjct: 872 KEELEDELKELEEEKEELEEELRELE 897
Score = 35.1 bits (81), Expect = 0.012
Identities = 25/140 (17%), Positives = 63/140 (45%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ +I L+EE+E + E+ A+L EEL E+L E ++ AE+
Sbjct: 329 KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAEL 388
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
++ + + + +L ++ + + ++L +L+ +L+E + +L+
Sbjct: 389 EAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEEL 448
Query: 122 KKLIIENSDLVRQLSKVTAD 141
++L + +L +L ++ +
Sbjct: 449 EELEEQLEELRDRLKELERE 468
Score = 33.5 bits (77), Expect = 0.032
Identities = 30/141 (21%), Positives = 63/141 (44%), Gaps = 7/141 (4%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ R+ ELE E+E + + +++ L ELEE LEE + ++ + E
Sbjct: 307 LRERLEELENELEELEERLEELKEKIEALKEELEERETLLEEL----EQLLAELEEAKEE 362
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
+ + L E E LR++ + +E+ E ++L +LK +++ + L
Sbjct: 363 LEEKLSALLEE---LEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER 419
Query: 121 RKKLIIENSDLVRQLSKVTAD 141
+ L E +L +L ++ +
Sbjct: 420 LEDLKEELKELEAELEELQTE 440
Score = 33.1 bits (76), Expect = 0.053
Identities = 37/167 (22%), Positives = 76/167 (45%), Gaps = 14/167 (8%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGG----------ATSAQ 50
++ + EL+EE+ ++ + E + + L LEEL LEE A +
Sbjct: 279 LREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEE 338
Query: 51 IELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLN----KLKTK 106
+E + E+ +L +LEEA + E L+ L ++ + + E++ +L +++ +
Sbjct: 339 LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE 398
Query: 107 LDEANRTINDLDVARKKLIIENSDLVRQLSKVTADARMWKIKYETNK 153
L+E R I L+ ++L DL +L ++ A+ + + E
Sbjct: 399 LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELN 445
Score = 33.1 bits (76), Expect = 0.054
Identities = 33/150 (22%), Positives = 70/150 (46%), Gaps = 3/150 (2%)
Query: 7 ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
ELE+ + +++ + E++ + L ELEEL E L E A++ + E+ +
Sbjct: 348 ELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE 407
Query: 67 DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL-- 124
LEE + L +L+++ + +E+ E +L +L +L+E + +L K+L
Sbjct: 408 SLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELER 467
Query: 125 -IIENSDLVRQLSKVTADARMWKIKYETNK 153
+ E + +++L K + + E +
Sbjct: 468 ELAELQEELQRLEKELSSLEARLDRLEAEQ 497
Score = 32.8 bits (75), Expect = 0.067
Identities = 32/152 (21%), Positives = 69/152 (45%), Gaps = 10/152 (6%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ + +L+EE+E + R +++ +L ELEE RL+ + + ++ E E+
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830
Query: 62 SKLRRDLEEANIQ----------HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEAN 111
+L ++EE + E L L+++ + +E E D+L +L+ + +E
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE 890
Query: 112 RTINDLDVARKKLIIENSDLVRQLSKVTADAR 143
+ +L+ +L E L +L ++ A
Sbjct: 891 EELRELESELAELKEEIEKLRERLEELEAKLE 922
Score = 31.2 bits (71), Expect = 0.20
Identities = 36/155 (23%), Positives = 75/155 (48%), Gaps = 3/155 (1%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ + LE+ +E R+ + E+Q +L REL L E LE+ E ++ E E+
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753
Query: 62 SKLRRDLEEANIQHEST---LANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
+L+ LEE + ES LA L+++ + + ++L +L+ +L+EA R ++ L+
Sbjct: 754 EELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALE 813
Query: 119 VARKKLIIENSDLVRQLSKVTADARMWKIKYETNK 153
+ L L +++ ++ + + K + +
Sbjct: 814 RELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848
Score = 30.1 bits (68), Expect = 0.53
Identities = 30/122 (24%), Positives = 55/122 (45%), Gaps = 3/122 (2%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ R LE+E+E + + E++ +L ELEEL + LEE ++ E E+
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDEL 879
Query: 62 SKLRRDLEEANIQ---HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
+L + EE + ES LA L+++ + E +L +L+ +L E + +
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEY 939
Query: 119 VA 120
Sbjct: 940 ED 941
Score = 29.7 bits (67), Expect = 0.67
Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 1/106 (0%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ L EE+ A+ + +L RE+E L ERLE E K+ EAE+
Sbjct: 375 EELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAEL 434
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKL 107
+L+ +LEE N + E L + D + E+ ++ +L + +L
Sbjct: 435 EELQTELEELNEELEELEEQLEELR-DRLKELERELAELQEELQRL 479
Score = 29.7 bits (67), Expect = 0.74
Identities = 30/129 (23%), Positives = 62/129 (48%), Gaps = 3/129 (2%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEEL---GERLEEAGGATSAQIELNKKRE 58
Q+R+ ELEEE+E + + +++ +L ELE L +L+E + + ++
Sbjct: 736 QSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEEL 795
Query: 59 AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
E+ + + E E L +L ++ E+ E +++ +L+ KLDE + +L+
Sbjct: 796 EELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELE 855
Query: 119 VARKKLIIE 127
++L E
Sbjct: 856 KELEELKEE 864
Score = 29.7 bits (67), Expect = 0.76
Identities = 33/141 (23%), Positives = 64/141 (45%)
Query: 3 ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 62
ELEE++ + + + + ELE + +IE ++R +S
Sbjct: 358 EAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLS 417
Query: 63 KLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK 122
+ DL+E + E+ L L+ + + E+ E +QL +L+ +L E R + +L +
Sbjct: 418 ERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQ 477
Query: 123 KLIIENSDLVRQLSKVTADAR 143
+L E S L +L ++ A+ R
Sbjct: 478 RLEKELSSLEARLDRLEAEQR 498
Score = 28.9 bits (65), Expect = 1.2
Identities = 29/124 (23%), Positives = 56/124 (45%), Gaps = 4/124 (3%)
Query: 14 VERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANI 73
++S +++ +L EL EL +LE+ + + E + +LRR LEE
Sbjct: 657 RNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELER 716
Query: 74 QHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVR 133
Q E L+++ E+ + +L +L+ +L+E + +L ++L E L
Sbjct: 717 QLEE----LKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772
Query: 134 QLSK 137
L+K
Sbjct: 773 ALAK 776
Score = 28.1 bits (63), Expect = 2.0
Identities = 29/136 (21%), Positives = 60/136 (44%), Gaps = 12/136 (8%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ + ELE E E + E+++ +L EL EL L E E ++ EA+
Sbjct: 861 LKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAK 920
Query: 61 MSKLRRDLEEANIQ--------HESTLANLRKKHNDAVSEMG----EQIDQLNKLKTKLD 108
+ +L +L E + E+ L ++ + + +G I++ +++ + +
Sbjct: 921 LERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYE 980
Query: 109 EANRTINDLDVARKKL 124
E DL+ A++KL
Sbjct: 981 ELKSQREDLEEAKEKL 996
Score = 27.4 bits (61), Expect = 4.3
Identities = 27/137 (19%), Positives = 61/137 (44%), Gaps = 7/137 (5%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+Q + E E+E+E + + ++ +L EL EL E +EE +I L ++R E
Sbjct: 258 LQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEE----LEGEISLLRERLEE 313
Query: 61 MSKLRRDLEEANIQHESTLANLRKK---HNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
+ +LEE + + + L+++ + E+ + + +L + K +L+E + +
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEE 373
Query: 118 DVARKKLIIENSDLVRQ 134
+ + E +
Sbjct: 374 LEELFEALREELAELEA 390
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 32.2 bits (74), Expect = 0.10
Identities = 19/104 (18%), Positives = 36/104 (34%), Gaps = 15/104 (14%)
Query: 7 ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
L++EV +Q Q ARE + E A L + E + +L
Sbjct: 146 ALQQEVLTLKQ-------QLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEA 198
Query: 67 DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
LE+ + T ++K + + ++++ L E
Sbjct: 199 QLEQLQEKAAETSQERKQKRKEITDQAAKRLE--------LSEE 234
>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
Members of this protein family are homologs of ClpB, an
ATPase associated with chaperone-related functions.
These ClpB homologs, designated ClpV1, are a key
component of the bacterial pathogenicity-associated type
VI secretion system [Protein fate, Protein and peptide
secretion and trafficking, Cellular processes,
Pathogenesis].
Length = 852
Score = 31.8 bits (73), Expect = 0.12
Identities = 19/82 (23%), Positives = 28/82 (34%), Gaps = 8/82 (9%)
Query: 1 MQARIYELEEEVEVERQSRAKAEK---QRADLARELEELGERLEEAGGATSAQIELNKKR 57
R+ EL E+ A E Q +L + L LE A +
Sbjct: 448 HDERLAELRAELAALEAELAALEARWQQEKELVEAILALRAELEADADAPADDDAA---L 504
Query: 58 EAEMSKLRRDLEEANIQHESTL 79
A++++L L A Q E L
Sbjct: 505 RAQLAELEAALASA--QGEEPL 524
>gnl|CDD|236508 PRK09426, PRK09426, methylmalonyl-CoA mutase; Reviewed.
Length = 714
Score = 31.8 bits (73), Expect = 0.13
Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 23/109 (21%)
Query: 14 VERQSRAKAEKQRADLARELEELG------------ERLEEAGGATSAQIELNKKREAEM 61
VE + AEK A E+E LG R+EEA T A+I+ K+ +
Sbjct: 399 VESLTHELAEKAWA-HIEEVEALGGMAKAIEAGIPKLRIEEAAARTQARIDSGKQVIVGV 457
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
+K R D E+ ++ + N AV EQI +L +L+ + DEA
Sbjct: 458 NKYRLDKEDP--------IDVLEVDNTAVRA--EQIARLERLRAERDEA 496
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 31.3 bits (72), Expect = 0.16
Identities = 35/149 (23%), Positives = 73/149 (48%), Gaps = 14/149 (9%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ + E +EE+ + R + EK+ + EL++L +RL + ++EL +KRE E+
Sbjct: 56 KEALLEAKEEIH---KLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEEL 112
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
K ++LE+ + E L + + + E+ E+I L +EA + L+
Sbjct: 113 EKKEKELEQKQQELEKKEEELEELIEEQLQEL-ERISGLTA-----EEAKEIL--LEKVE 164
Query: 122 KKLIIENSDLVRQL---SKVTADARMWKI 147
++ E + L++++ +K AD + +I
Sbjct: 165 EEARHEAAVLIKEIEEEAKEEADKKAKEI 193
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 31.3 bits (71), Expect = 0.17
Identities = 13/49 (26%), Positives = 22/49 (44%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSA 49
MQ +++++ E R+ KQR D R +E+LG+ A
Sbjct: 74 MQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKA 122
>gnl|CDD|182871 PRK10966, PRK10966, exonuclease subunit SbcD; Provisional.
Length = 407
Score = 31.4 bits (72), Expect = 0.17
Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 17/94 (18%)
Query: 1 MQARIYELEEEVEVE----RQSRAKAEKQRADLARE-LEELGERLEEAGGATSAQIELNK 55
+Q RI L E + VE R+SR + E+ A RE L EL +EE A EL++
Sbjct: 326 IQRRIQALTESLPVEVLLVRRSREQRERSLASEQRETLSEL--SVEEVFERRLALEELDE 383
Query: 56 KREAEMSKLRRDLEEANIQHESTLANLRKKHNDA 89
++ +++L L L ++H A
Sbjct: 384 PQQQRLTQL----------FTQVLHELAEEHEAA 407
>gnl|CDD|236038 PRK07521, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 483
Score = 31.4 bits (72), Expect = 0.18
Identities = 10/38 (26%), Positives = 20/38 (52%), Gaps = 3/38 (7%)
Query: 77 STLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTI 114
+ + R DA +E+ + +D LN L + ++AN +
Sbjct: 143 DAVQSAR---ADADAEIADSVDTLNDLLAQFEDANNAV 177
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 31.3 bits (71), Expect = 0.21
Identities = 26/135 (19%), Positives = 48/135 (35%)
Query: 10 EEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLE 69
E + +AEK+ ++ A L+E RL A E S R+L
Sbjct: 808 AAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVERSDTYRELR 867
Query: 70 EANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENS 129
+ E TL + ++L +L+E R I +L+ +L E
Sbjct: 868 KRIAALERTLVRAGGGLSLEALVAEAAALDPDELPARLEELARDIEELEEELNELAQEVG 927
Query: 130 DLVRQLSKVTADARM 144
++L+++ +
Sbjct: 928 AAKQELARMDGGSTA 942
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 31.1 bits (71), Expect = 0.22
Identities = 20/87 (22%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 42 EAGGATSAQIELN----KKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQI 97
AG T ++ ++ K+ E+ + ++LEE + + L L KK + E+
Sbjct: 316 PAGTKTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPP--EKR 373
Query: 98 DQLNKLKTKLDEANRTINDLDVARKKL 124
+QL KL ++ + + +L+ K+L
Sbjct: 374 EQLEKLLETKEKLSEELEELEEELKEL 400
>gnl|CDD|191367 pfam05761, 5_nucleotid, 5' nucleotidase family. This family of
eukaryotic proteins includes 5' nucleotidase enzymes,
such as purine 5'-nucleotidase EC:3.1.3.5.
Length = 448
Score = 30.7 bits (70), Expect = 0.27
Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 16/68 (23%)
Query: 5 IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 64
+ ELE E+E+ + + E+ + L+ L ERL AE+S L
Sbjct: 323 VPELEREIEIWNTEQYRFEELQ-----RLDILLERLY----DHLDVH-------AELSTL 366
Query: 65 RRDLEEAN 72
R DL E +
Sbjct: 367 RPDLSELD 374
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 30.8 bits (70), Expect = 0.27
Identities = 26/117 (22%), Positives = 55/117 (47%), Gaps = 2/117 (1%)
Query: 10 EEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKRE--AEMSKLRRD 67
E +E +R+ + E + DL E+EE+ ERLE A A+ + + E ++ +L +
Sbjct: 468 ETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAE 527
Query: 68 LEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
E + LR++ + +E E+ + + + + +EA + +L+ +L
Sbjct: 528 RRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAEL 584
Score = 30.0 bits (68), Expect = 0.49
Identities = 31/143 (21%), Positives = 65/143 (45%), Gaps = 3/143 (2%)
Query: 4 RIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE---AGGATSAQIELNKKREAE 60
+L E ++ E +R +L EEL +RLEE A A + + E ++ +
Sbjct: 294 ERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADD 353
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
+ + +L E + ES L R+ D E+ E +++ +L+ + +A + + +
Sbjct: 354 LEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDF 413
Query: 121 RKKLIIENSDLVRQLSKVTADAR 143
++L E +L + +++ A R
Sbjct: 414 LEELREERDELREREAELEATLR 436
Score = 30.0 bits (68), Expect = 0.53
Identities = 37/132 (28%), Positives = 62/132 (46%), Gaps = 17/132 (12%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAG----------GATSAQI 51
+ + LE E+E R++ A+ E++R +LA E+ +L ERLEE G A
Sbjct: 250 REELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADA 309
Query: 52 ELNKKR----EAEMSKLRRDLEEANIQ---HESTLANLRKKHNDAVSEMGEQIDQLNKLK 104
E + R E +LR LEE + H +LR+ +D E ++ +L+
Sbjct: 310 EAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELE 369
Query: 105 TKLDEANRTIND 116
++L+EA + D
Sbjct: 370 SELEEAREAVED 381
Score = 26.5 bits (59), Expect = 8.6
Identities = 29/118 (24%), Positives = 53/118 (44%), Gaps = 8/118 (6%)
Query: 7 ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
E E+ +E + R E+ A+ +EE ER EE E + EAE + R
Sbjct: 506 EAEDRIERLEERREDLEELIAERRETIEEKRERAEELR-------ERAAELEAEAEEKRE 558
Query: 67 DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
EA + E + + N ++E+ E+I+ L +++T L ++++ R+K
Sbjct: 559 AAAEAEEEAEEAREEV-AELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKR 615
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 30.6 bits (69), Expect = 0.32
Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 1 MQARIYELEEEVEVERQ--SRAKAEKQR-ADLARELEELGERLEEAGGATSAQIELNKKR 57
RI EL+ E++ R S +AE +R + L++EL+E E LE G +I+ K R
Sbjct: 32 YLQRILELQAELKQLRAELSNVQAENERLSSLSQELKEENEMLELQRGRLRDEIKEYKFR 91
Query: 58 EAEMSKLRRDLEEANIQHESTLANLR 83
EA + + +LEE NI + ++ LR
Sbjct: 92 EARLLQDYSELEEENISLQKQVSVLR 117
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 30.4 bits (69), Expect = 0.35
Identities = 21/116 (18%), Positives = 45/116 (38%)
Query: 26 RADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKK 85
R L + + E L+ G + +E + E KL ++EE + + K
Sbjct: 4 RKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKA 63
Query: 86 HNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTAD 141
++ E +L +LK +L E + + L+ + ++ ++ + V D
Sbjct: 64 KGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNIPHESVPVGKD 119
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.7 bits (67), Expect = 0.60
Identities = 32/120 (26%), Positives = 55/120 (45%)
Query: 5 IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 64
+ LEE +E + K E + +LA E EL + LEE ++E +K + +
Sbjct: 324 LKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALER 383
Query: 65 RRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
+ LEEA + + LA L + E+ E +L +L+ +L+E I L+ +L
Sbjct: 384 LKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQL 443
>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
production and conversion].
Length = 660
Score = 29.6 bits (67), Expect = 0.60
Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 5/72 (6%)
Query: 3 ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEA----GGATSAQIELNKKRE 58
+YE+ E + ++ E+ A++ ELE L LE A E+ + E
Sbjct: 215 FELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLAVREI-LEIE 273
Query: 59 AEMSKLRRDLEE 70
+ + L
Sbjct: 274 KALGDVLSKLAR 285
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 29.9 bits (68), Expect = 0.61
Identities = 25/112 (22%), Positives = 45/112 (40%), Gaps = 26/112 (23%)
Query: 49 AQIELNKKR---EAEMSKLRRDLEEA-----NIQ-HESTLANLRKKHNDAVSEMGEQIDQ 99
AQ++ K+ EAE +++DLE+ I + L+++ A +++ + +
Sbjct: 43 AQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAE 102
Query: 100 LNKLKTKLDEA-----------------NRTINDLDVARKKLIIENSDLVRQ 134
L LK DE +T++ L A+ L NS LV
Sbjct: 103 LEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSL 154
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 29.7 bits (67), Expect = 0.61
Identities = 27/140 (19%), Positives = 49/140 (35%), Gaps = 20/140 (14%)
Query: 3 ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGG----------------- 45
AR+ EL+++ + E S +L R++EEL +E
Sbjct: 750 ARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWL 809
Query: 46 ATSAQIELNKKREAEMSKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNK 102
+ E ++ +L EE + RKK + +Q+DQL++
Sbjct: 810 HRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDE 869
Query: 103 LKTKLDEANRTINDLDVARK 122
L L + R + +L
Sbjct: 870 LLRGLRDEMRQLAELKEPAN 889
Score = 27.0 bits (60), Expect = 5.9
Identities = 23/123 (18%), Positives = 52/123 (42%), Gaps = 8/123 (6%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ R+ + EE ++ + +AE+Q ELEE EA A K+ +
Sbjct: 605 LRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTAL-------KQARLD 657
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL-NKLKTKLDEANRTINDLDV 119
+ +L+ + + + E +A +++ + ++ Q+ QL + + L+ +L
Sbjct: 658 LQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRT 717
Query: 120 ARK 122
R
Sbjct: 718 ERL 720
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 29.5 bits (66), Expect = 0.65
Identities = 19/70 (27%), Positives = 37/70 (52%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ EL+E++E ++ + + K+ +L E EE+ ERL+ S E+ KK E
Sbjct: 133 LKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGE 192
Query: 61 MSKLRRDLEE 70
+ L++ +E
Sbjct: 193 VYDLKKRWDE 202
Score = 26.0 bits (57), Expect = 9.7
Identities = 20/73 (27%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 28 DLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHN 87
DL + EEL E+LEE ++ ++ EAE +++ L+ ++ S L + KK
Sbjct: 132 DLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVE-NSRLEEMLKKLP 190
Query: 88 DAVSEMGEQIDQL 100
V ++ ++ D+L
Sbjct: 191 GEVYDLKKRWDEL 203
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 29.6 bits (67), Expect = 0.69
Identities = 19/89 (21%), Positives = 38/89 (42%)
Query: 29 LARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHND 88
+ R+ E L + L + + + E++ LR+ +E N+ L R++
Sbjct: 159 MKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRS 218
Query: 89 AVSEMGEQIDQLNKLKTKLDEANRTINDL 117
++ E+ QL +L+ +L E I L
Sbjct: 219 LSVKISEKRKQLEELQQELQELTIAIEAL 247
>gnl|CDD|219412 pfam07439, DUF1515, Protein of unknown function (DUF1515). This
family consists of several hypothetical bacterial
proteins of around 130 residues in length. Members of
this family seem to be found exclusively in Rhizobium
species. The function of this family is unknown.
Length = 112
Score = 28.4 bits (63), Expect = 0.74
Identities = 15/49 (30%), Positives = 28/49 (57%), Gaps = 3/49 (6%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSA 49
+QA + L E++ R+S +++ RA + R ++EL ER+ G+ S
Sbjct: 13 LQAEVKNLREDI---RRSEDRSDASRASMYRRMDELVERVTTLEGSVSK 58
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 29.6 bits (66), Expect = 0.81
Identities = 32/108 (29%), Positives = 52/108 (48%), Gaps = 21/108 (19%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++A+ ELE E+ RA+AE RA LA EE+ + LEE G
Sbjct: 96 LEAQAEELEALAVAEKAGRAEAEGLRAALAGA-EEVRKNLEEGG---------------- 138
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLD 108
+++LEEA H+ L++L + H +A+S + + + L K L+
Sbjct: 139 ----QQELEEAQRLHQEQLSSLTQAHLEALSSLRSKAEGLEKSLQSLE 182
>gnl|CDD|224202 COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and
metabolism].
Length = 533
Score = 29.1 bits (66), Expect = 0.84
Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 10/93 (10%)
Query: 55 KKREAEMSKLRRDLEEANIQHESTLANLRKKHN-------DAVSEMGEQIDQLNKLKTKL 107
+R AE+ +LE E L KK D + E+ + L
Sbjct: 399 SRRWAEIIDAAINLEHIGDIIERLLELADKKIANGRAFSEDGLEELDALFALTLEN---L 455
Query: 108 DEANRTINDLDVARKKLIIENSDLVRQLSKVTA 140
A + D+ + ++E VR+L + ++
Sbjct: 456 RLAISVLVTGDLELARRLVERKKRVRRLERRSS 488
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 28.8 bits (65), Expect = 1.1
Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 3/117 (2%)
Query: 1 MQARIYELEEEVEVERQS-RAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREA 59
+ A + E ++ ++ ++ RA L L ++ + E G + L ++ E
Sbjct: 117 LSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEV 176
Query: 60 EMSKL--RRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTI 114
+L RR L+E + L L ++ +A E+G +L LK ++DE
Sbjct: 177 ISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDELQLER 233
Score = 26.9 bits (60), Expect = 5.1
Identities = 26/123 (21%), Positives = 61/123 (49%), Gaps = 12/123 (9%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAG-GATSAQIELNKKR-- 57
+ A+I +LE E+ Q++ +A +Q+ ++ E E +L+E G + +EL ++R
Sbjct: 149 ILAQIKQLEAELA-GLQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAE 207
Query: 58 --------EAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDE 109
EAE+ L+R ++E ++ + R++ + ++E ++ +L + K +
Sbjct: 208 AQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAELRERLNKARD 267
Query: 110 ANR 112
+
Sbjct: 268 RLQ 270
>gnl|CDD|188998 cd06460, M32_Taq, Peptidase family M32 includes thermostable
carboxypeptidases TaqCP and PfuCP. Peptidase family M32
is a subclass of metallocarboxypeptidases which are
distributed mainly in bacteria and archaea, and contain
a HEXXH motif that coordinates a divalent cation such as
Zn2+ or Co2+, so far only observed in the active site of
neutral metallopeptidases but not in carboxypeptidases.
M32 includes the thermostable carboxypeptidases (E.C.
3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus
furiosus (PfuCP), which have broad specificities toward
a wide range of C-terminal substrates that include
basic, aromatic, neutral and polar amino acids. These
enzymes have a similar fold to the M3 peptidases such as
neurolysin and the M2 angiotensin converting enzyme
(ACE). Novel peptidases from protozoa Trypanosoma cruzi,
a causative agent of Chagas' disease, and Leishmania
major, a parasite that causes leishmaniasis, are the
first eukaryotic M32 enzymes identified so far, thus
making these enzymes an attractive potential target for
drug development against these organisms.
Length = 484
Score = 29.0 bits (66), Expect = 1.1
Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 8/70 (11%)
Query: 77 STLANLRKKHNDAVS-EMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIEN---SDLV 132
+TL+ L H S E+GE ++ L + + LDE + R+ ++LV
Sbjct: 36 ATLSGLA--HELLTSPEIGELLEALEEEELALDEVEAAN--VRELRRDYERATKIPAELV 91
Query: 133 RQLSKVTADA 142
+L+++T++A
Sbjct: 92 EELARLTSEA 101
>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex. Med21 has been
known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
Drosophila. The heterodimer of the two subunits Med7 and
Med21 appears to act as a hinge between the middle and
the tail regions of Mediator.
Length = 132
Score = 27.6 bits (62), Expect = 1.8
Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 7/42 (16%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEA 43
RI ELEEE+ E +R + +E E+L +++E
Sbjct: 91 LRRIKELEEELRE-------VEAEREEAVKEKEKLLKKVEAL 125
>gnl|CDD|109884 pfam00846, Hanta_nucleocap, Hantavirus nucleocapsid protein.
Length = 428
Score = 28.4 bits (63), Expect = 1.9
Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 14/76 (18%)
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNK------------LKTKLD 108
MS L +L+E HE L R+K DA + D +NK L+ KL
Sbjct: 1 MSNLE-ELQEEITAHEQQLVIARQKLKDAEKAVEVDPDDVNKSTLQSRRAAVSALEAKLA 59
Query: 109 EANRTINDLDVARKKL 124
E R + D +A +KL
Sbjct: 60 ELKRQLADR-IATQKL 74
>gnl|CDD|137505 PRK09751, PRK09751, putative ATP-dependent helicase Lhr;
Provisional.
Length = 1490
Score = 28.4 bits (63), Expect = 2.0
Identities = 17/54 (31%), Positives = 29/54 (53%), Gaps = 6/54 (11%)
Query: 5 IYELEEEVE-VERQSRAKAEKQRADLAREL-----EELGERLEEAGGATSAQIE 52
I ++EEE++ + RAK E+ DL REL E+L +R + ++ +E
Sbjct: 871 IRQVEEELQRLAPGRRAKGEEGLFDLLRELGPMTVEDLAQRHTGSSEEVASYLE 924
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 28.1 bits (63), Expect = 2.2
Identities = 25/138 (18%), Positives = 57/138 (41%), Gaps = 9/138 (6%)
Query: 9 EEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDL 68
EE + + + ++ + +++L E +E S ++ + E+ KL +L
Sbjct: 400 EERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESEL 459
Query: 69 EEANIQHESTLANLRKKHNDAVSEMGEQIDQLNK----LKTKLDEANRTINDLDVARKKL 124
E + + R+ + +I++L K K +++E R + +L RK
Sbjct: 460 ERFRREVRDKVRKDRE-----IRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLE 514
Query: 125 IIENSDLVRQLSKVTADA 142
+ V+ + K+T +A
Sbjct: 515 LSGKGTPVKVVEKLTLEA 532
>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1. All proteins in this family for
which functions are known are cyclins that are
components of TFIIH, a complex that is involved in
nucleotide excision repair and transcription initiation.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 305
Score = 27.9 bits (62), Expect = 2.2
Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 8/82 (9%)
Query: 58 EAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
E R+ I L L KK+ + SE ++ LK KLDE + +L
Sbjct: 230 EQLSVPGNREELPQLIDIMRELRILVKKYEEPRSE------KVAALKKKLDECHSIEENL 283
Query: 118 DVA--RKKLIIENSDLVRQLSK 137
RK+ E+ + + +K
Sbjct: 284 SSVKIRKRKGYEDDTYISKKAK 305
>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1104
Score = 28.1 bits (62), Expect = 2.6
Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 12/102 (11%)
Query: 2 QARIYELEEEVEVERQSRAKAE--KQRADLARELEELGERLEEAGGATSAQIELNKKREA 59
AR+ L EE+ + R S AK E +++ EL+++ E L L +
Sbjct: 789 NARLRRLREEI-IGRMSDAKKEDTAALSEVGAELDDIPEYLARLQ-------TLTEDALP 840
Query: 60 EMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLN 101
E L R E N + + L + + + E+I+ +N
Sbjct: 841 E--FLARFQELLNRSSDDGVTQLLSHLDHERALIEERIEAIN 880
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 27.6 bits (62), Expect = 3.0
Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 12/93 (12%)
Query: 38 ERLEEAGGATSAQIE-LNKKRE--AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMG 94
+ + G + +IE + K E A K R++ EA + E + +L K + E G
Sbjct: 493 ITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKS----LKEEG 548
Query: 95 EQIDQLNKLKTKLDEANRTINDLDVARKKLIIE 127
+++ + K K++EA I L + E
Sbjct: 549 DKLPE--ADKKKVEEA---IEWLKEELEGEDKE 576
>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407). This
family of proteins is found in bacteria. Proteins in
this family are typically between 366 and 597 amino
acids in length. There is a single completely conserved
residue R that may be functionally important.
Length = 297
Score = 27.6 bits (62), Expect = 3.0
Identities = 9/52 (17%), Positives = 19/52 (36%)
Query: 71 ANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK 122
IQ E A + ++ ++ L+ ++DEA +N +
Sbjct: 115 LEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQ 166
>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
exopolysaccharide biosynthesis [Cell envelope
biogenesis, outer membrane].
Length = 458
Score = 27.4 bits (61), Expect = 3.2
Identities = 32/137 (23%), Positives = 59/137 (43%), Gaps = 15/137 (10%)
Query: 12 VEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQI-ELNKKREAEMSKLRRDLE- 69
++ RQ A+ +Q ADL+ EL +L A AQ+ EL ++ AE+ ++ L
Sbjct: 287 IQDLRQQYAQVRQQIADLSTELGAKHPQLV----ALEAQLAELRQQIAAELRQILASLPN 342
Query: 70 --EANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIE 127
Q E+ L + +S++ + QL +L+ + + A L ++L I
Sbjct: 343 ELALLEQQEAALEKELAQLKGRLSKLPKLQVQLRELEREAEAARSLYETLLQRYQELSI- 401
Query: 128 NSDLVRQLSKVTADARM 144
Q + +AR+
Sbjct: 402 ------QEASPIGNARV 412
>gnl|CDD|111809 pfam02961, BAF, Barrier to autointegration factor. The BAF
protein has a SAM-domain-like bundle of orthogonally
packed alpha-hairpins - one classic and one pseudo
helix-hairpin-helix motif. The protein is involved in
the prevention of retroviral DNA integration.
Length = 89
Score = 26.2 bits (58), Expect = 3.4
Identities = 19/63 (30%), Positives = 25/63 (39%), Gaps = 4/63 (6%)
Query: 23 EKQRADLARELEELGERLEEAGGATS----AQIELNKKREAEMSKLRRDLEEANIQHEST 78
+K LA E LG RLE+ G + Q L KK E + +D AN +
Sbjct: 18 DKPVTALAGIGEVLGGRLEDKGFDKAYVVLGQFLLLKKDEELFKEWLKDTCGANAKQARD 77
Query: 79 LAN 81
N
Sbjct: 78 CYN 80
>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
C4-dicarboxylate transport system [Signal transduction
mechanisms].
Length = 603
Score = 27.3 bits (61), Expect = 3.9
Identities = 16/59 (27%), Positives = 29/59 (49%)
Query: 14 VERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN 72
R A+ ++ RA+L R +EE L A A+I ++ EA + + + +L +A
Sbjct: 321 RARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRRAQDELVQAG 379
>gnl|CDD|198091 smart01023, BAF, Barrier to autointegration factor.
Barrier-to-autointegration factor (BAF) is an essential
protein that is highly conserved in metazoan evolution,
and which may act as a DNA-bridging protein. BAF binds
directly to double-stranded DNA, to transcription
activators, and to inner nuclear membrane proteins,
including lamin A filament proteins that anchor
nuclear-pore complexes in place, and nuclear LEM-domain
proteins that bind to laminins filaments and chromatin.
New findings suggest that BAF has structural roles in
nuclear assembly and chromatin organization, represses
gene expression and might interlink chromatin
structure, nuclear architecture and gene regulation in
metazoans. BAF can be exploited by retroviruses to act
as a host component of pre-integration complexes, which
promote the integration of the retroviral DNA into the
host chromosome by preventing autointegration of
retroviral DNA. BAF might contribute to the assembly or
activity of retroviral pre-integration complexes
through direct binding to the retroviral proteins p55
Gag and matrix, as well as to DNA.
Length = 87
Score = 26.2 bits (58), Expect = 3.9
Identities = 20/63 (31%), Positives = 24/63 (38%), Gaps = 4/63 (6%)
Query: 23 EKQRADLARELEELGERLEEAGGATS----AQIELNKKREAEMSKLRRDLEEANIQHEST 78
EK LA E LG RLE G + Q L KK E + +D AN +
Sbjct: 16 EKPVTALAGIGEVLGGRLETKGFDKAYVVLGQFLLLKKDEELFKEWLKDTCGANAKQARD 75
Query: 79 LAN 81
N
Sbjct: 76 CYN 78
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 27.1 bits (60), Expect = 3.9
Identities = 12/57 (21%), Positives = 24/57 (42%), Gaps = 2/57 (3%)
Query: 46 ATSAQIELNKKRE--AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL 100
T+ + L + + + KL + LEEA + L L + ++ Q+D+
Sbjct: 19 NTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASIRAQNKRQLDRF 75
>gnl|CDD|168351 PRK06023, PRK06023, enoyl-CoA hydratase; Provisional.
Length = 251
Score = 27.1 bits (60), Expect = 3.9
Identities = 18/73 (24%), Positives = 38/73 (52%), Gaps = 11/73 (15%)
Query: 5 IYELEEEVEVERQSRAKAEK------QRADLAREL-----EELGERLEEAGGATSAQIEL 53
I+++ +E VE ++ AE+ Q +AR+L E++ R++E +A+++
Sbjct: 179 IWKIVDEEAVEAETLKAAEELAAKPPQALQIARDLMRGPREDILARIDEEAKHFAARLKS 238
Query: 54 NKKREAEMSKLRR 66
+ R A + +RR
Sbjct: 239 AEARAAFEAFMRR 251
>gnl|CDD|233004 TIGR00513, accA, acetyl-CoA carboxylase, carboxyl transferase,
alpha subunit. The enzyme acetyl-CoA carboxylase
contains a biotin carboxyl carrier protein or domain, a
biotin carboxylase, and a carboxyl transferase. This
model represents the alpha chain of the carboxyl
transferase for cases in which the architecture of the
protein is as in E. coli, in which the
carboxyltransferase portion consists of two
non-identical subnits, alpha and beta [Fatty acid and
phospholipid metabolism, Biosynthesis].
Length = 316
Score = 27.1 bits (60), Expect = 4.0
Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 3/38 (7%)
Query: 5 IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE 42
I ELE ++E RA++ + DL+ E+E L +R E
Sbjct: 12 IAELEAKIE---SLRARSRDEDVDLSEEIERLEKRSVE 46
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 27.3 bits (61), Expect = 4.0
Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 10/87 (11%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+Q ++ ELE E + E R K K+ +E+ GE LEE + ++ ++ E +
Sbjct: 89 LQRKLKELEREQKEEEV-REKHNKKI------IEKFGEDLEEVYKFMKGEAKVEEEEEKQ 141
Query: 61 MSKLRRDLEEAN--IQHES-TLANLRK 84
M L + L+ ++ E+ +L L K
Sbjct: 142 MEILEKALKSYLKIVKEENKSLQRLAK 168
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 27.2 bits (61), Expect = 4.2
Identities = 23/72 (31%), Positives = 42/72 (58%), Gaps = 4/72 (5%)
Query: 4 RIYELE-EEVEVERQSRAKAEKQRADLARELEELGER---LEEAGGATSAQIELNKKREA 59
RI +LE E ++++ ++++ DL +EL EL E LEE A A I+ ++ +
Sbjct: 413 RIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKE 472
Query: 60 EMSKLRRDLEEA 71
E+ ++R +LE+A
Sbjct: 473 EIEQVRLELEQA 484
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 26.8 bits (60), Expect = 4.3
Identities = 26/140 (18%), Positives = 58/140 (41%), Gaps = 29/140 (20%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ R+ +L++E+E +Q A+ + Q L + EE ER E L + ++
Sbjct: 67 LKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTE----------LLEELKQ-- 114
Query: 61 MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDV- 119
LE+ + ++ L K + + ++ E+ K + ANR +++
Sbjct: 115 -------LEKELKKLKAELEKYEKNDPERIEKLKEET------KVAKEAANRWTDNIFSL 161
Query: 120 ---ARKKLIIENSDLVRQLS 136
KK ++ + L ++
Sbjct: 162 KSYLSKKFGLDEAQLRKEFG 181
>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
PFL_4705 family. Members of this protein family are
found occasionally on plasmids such as the Pseudomonas
putida toluene catabolic TOL plasmid pWWO_p085. Usually,
however, they are found on the bacterial main chromosome
in regions flanked by markers of conjugative transfer
and/or transposition [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 472
Score = 27.3 bits (61), Expect = 4.4
Identities = 19/95 (20%), Positives = 44/95 (46%), Gaps = 18/95 (18%)
Query: 27 ADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKH 86
L E++EL +RL + I N+ +AE +L++ + + Q + + + +
Sbjct: 62 RTLVAEVKELRKRLAKL-------ISENEALKAENERLQKREQSIDQQIQQAVQSETQ-- 112
Query: 87 NDAVSEMGEQIDQLN----KLKTKLDEANRTINDL 117
E+ ++I+QL +L+ +D+ R + +
Sbjct: 113 -----ELTKEIEQLKSERQQLQGLIDQLQRRLAGV 142
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 26.9 bits (60), Expect = 4.7
Identities = 19/70 (27%), Positives = 37/70 (52%)
Query: 2 QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
+ R EL +E++ E+Q A+A Q L +++ L +L A A + +++ +A++
Sbjct: 115 EGRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKI 174
Query: 62 SKLRRDLEEA 71
+ L R L A
Sbjct: 175 ADLGRRLNVA 184
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 26.5 bits (59), Expect = 4.8
Identities = 18/112 (16%), Positives = 49/112 (43%), Gaps = 15/112 (13%)
Query: 4 RIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSK 63
R+ EL++E+ ++ A+ ++Q A+L +E +EL + L ++ + E+++
Sbjct: 67 RLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAE-------LERLQKELAR 119
Query: 64 LRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTIN 115
+++ A L +++ + E+ E + L+ + +
Sbjct: 120 IKQLSANAI--------ELDEENRELREELAELKQENEALEAENERLQENEQ 163
>gnl|CDD|224176 COG1256, FlgK, Flagellar hook-associated protein [Cell motility and
secretion].
Length = 552
Score = 27.0 bits (60), Expect = 4.9
Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 7/63 (11%)
Query: 77 STLANLRKKHNDA-------VSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENS 129
L +LRK N V+ + +QI LNK K+ A NDL R +L+ E S
Sbjct: 152 EQLTDLRKDINAEIAATVDEVNSLLKQIADLNKQIRKVKAAGNDPNDLLDQRDQLVDELS 211
Query: 130 DLV 132
L+
Sbjct: 212 QLI 214
>gnl|CDD|218602 pfam05478, Prominin, Prominin. The prominins are an emerging
family of proteins that among the multispan membrane
proteins display a novel topology. Mouse prominin and
human prominin (mouse)-like 1 (PROML1) are predicted to
contain five membrane spanning domains, with an
N-terminal domain exposed to the extracellular space
followed by four, alternating small cytoplasmic and
large extracellular, loops and a cytoplasmic C-terminal
domain. The exact function of prominin is unknown
although in humans defects in PROM1, the gene coding for
prominin, cause retinal degeneration.
Length = 807
Score = 26.9 bits (60), Expect = 5.0
Identities = 17/89 (19%), Positives = 29/89 (32%), Gaps = 26/89 (29%)
Query: 27 ADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKH 86
A EL+ L L +++ K EA +LR
Sbjct: 626 NSFAEELDALANNLPNGK------LKVALKNEAL--------------------DLRSIQ 659
Query: 87 NDAVSEMGEQIDQLNKLKTKLDEANRTIN 115
+ V M +++LN LDE++ +
Sbjct: 660 QNFVPPMESLLEKLNSNVKTLDESSNDLQ 688
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 27.1 bits (60), Expect = 5.2
Identities = 27/150 (18%), Positives = 54/150 (36%), Gaps = 12/150 (8%)
Query: 17 QSRAKAEKQRADLAREL-----EELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEA 71
+S A QR + + GE EE L K+ E + L + L+E
Sbjct: 482 ESYAFEIAQRYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQ 541
Query: 72 NIQHESTLANLRKKHNDAVSEMGEQIDQ-----LNKLKTKLDEANRTINDLDVARKKLII 126
+ L ++ + ++++ L LK +++ R + + + + K I
Sbjct: 542 EKLKKE-LEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIK 600
Query: 127 ENSDLVRQLSKVTADARMWKIKYETNKPKD 156
DLV +L + ++ +K D
Sbjct: 601 SIEDLV-KLKETKQKIPQKPTNFQADKIGD 629
>gnl|CDD|218618 pfam05508, Ran-binding, RanGTP-binding protein. The small Ras-like
GTPase Ran plays an essential role in the transport of
macromolecules in and out of the nucleus and has been
implicated in spindle and nuclear envelope formation
during mitosis in higher eukaryotes. The S. cerevisiae
ORF YGL164c encoding a novel RanGTP-binding protein,
termed Yrb30p was identified. The protein competes with
yeast RanBP1 (Yrb1p) for binding to the GTP-bound form
of yeast Ran (Gsp1p) and is, like Yrb1p, able to form
trimeric complexes with RanGTP and some of the
karyopherins.
Length = 302
Score = 26.5 bits (59), Expect = 6.1
Identities = 11/43 (25%), Positives = 23/43 (53%)
Query: 28 DLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEE 70
L RE++E ERL+EA + ++ + + K+ R +++
Sbjct: 88 SLRREIDEFDERLDEAASSEELSSRTSENEKESIKKIERYMKD 130
>gnl|CDD|239651 cd03679, MM_CoA_mutase_alpha_like, Coenzyme
B12-dependent-methylmalonyl coenzyme A (CoA) mutase
(MCM) family, Alpha subunit-like subfamily; contains
proteins similar to the alpha subunit of
Propionbacterium shermanni MCM, as well as human and E.
coli MCM. Members of this subfamily contain an
N-terminal MCM domain and a C-terminal coenzyme B12
binding domain. MCM catalyzes the isomerization of
methylmalonyl-CoA to succinyl-CoA. The reaction proceeds
via radical intermediates beginning with a
substrate-induced homolytic cleavage of the Co-C bond of
coenzyme B12 to produce cob(II)alamin and the
deoxyadenosyl radical. MCM plays an important role in
the conversion of propionyl-CoA to succinyl-CoA during
the degradation of propionate for the Krebs cycle. In
higher animals, MCM is involved in the breakdown of
odd-chain fatty acids, several amino acids, and
cholesterol. Methylobacterium extorquens MCM
participates in the glyoxylate regeneration pathway. In
M. extorquens, MCM forms a complex with MeaB; MeaB may
protect MCM from irreversible inactivation. In some
bacteria, MCM is involved in the reverse metabolic
reaction, the rearrangement of succinyl-CoA to
methylmalonyl-CoA. Examples include P. shermanni MCM
during propionic acid fermentation, E.coli MCM in a
pathway for the conversion of succinate to propionate
and Streptomyces MCM in polyketide biosynthesis.
Sinorhizobium meliloti strain SU47 MCM plays a role in
the polyhydroxyalkanoate degradation pathway. P.
shermanni and Streptomyces cinnamonensis MCMs are
alpha/beta heterodimers. It has been shown for P.
shermanni MCM that only the alpha subunit binds coenzyme
B12 and substrates. Human MCM is a homodimer with two
active sites. Mouse and E.coli MCMs are also homodimers.
In humans, impaired activity of MCM results in
methylmalonic aciduria, a disorder of propionic acid
metabolism.
Length = 536
Score = 26.6 bits (59), Expect = 6.2
Identities = 32/109 (29%), Positives = 48/109 (44%), Gaps = 23/109 (21%)
Query: 14 VERQSRAKAEKQRADLARELEELG------------ERLEEAGGATSAQIELNKKREAEM 61
+E + AEK A + E+EELG R+EEA A+I+ ++ +
Sbjct: 393 MESLTDDLAEKAWALIQ-EIEELGGMAKAIESGIPKLRIEEAAARRQARIDSGREVIVGV 451
Query: 62 SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
+K R D EE + + K N AV EQI +L KL+ + D
Sbjct: 452 NKYRLDHEE---PLD-----VLKIDNTAVRA--EQIARLKKLRAERDPE 490
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 26.6 bits (59), Expect = 6.2
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 4/80 (5%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
+Q I LE+++ ++ RAK EKQ L E+ LE T+ ++ +K+ A+
Sbjct: 43 IQKEIAALEKKIREQQDQRAKLEKQLKSLETEI----ASLEAQLIETADDLKKLRKQIAD 98
Query: 61 MSKLRRDLEEANIQHESTLA 80
++ LE + LA
Sbjct: 99 LNARLNALEVQEREQRRRLA 118
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 26.8 bits (59), Expect = 6.4
Identities = 31/127 (24%), Positives = 56/127 (44%), Gaps = 10/127 (7%)
Query: 5 IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKRE-AEMSK 63
+ +E+ ++ E SR AEKQ A+ E+ ++ EE A +A + E AE K
Sbjct: 478 MQSMEKRLKSEADSRVNAEKQLAE-----EKKRKKEEEETAARAAAQAAASREECAESLK 532
Query: 64 LRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKK 123
+ E I+ L + K + + ++ +L K + E ++ L + K
Sbjct: 533 QAKQDLEMEIKK---LEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDK 589
Query: 124 -LIIENS 129
L++ENS
Sbjct: 590 NLMLENS 596
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 26.6 bits (59), Expect = 6.5
Identities = 10/42 (23%), Positives = 19/42 (45%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE 42
+Y++ E + +K K+ +L R LE+ LE+
Sbjct: 206 FGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESHLEK 247
>gnl|CDD|214396 CHL00206, ycf2, Ycf2; Provisional.
Length = 2281
Score = 26.8 bits (59), Expect = 6.6
Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 7/52 (13%)
Query: 87 NDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK-KLIIENSDLVRQLSK 137
N + +G QI L KLK L ++D D ++K K +I + L
Sbjct: 920 NQLLESIGLQIVHLKKLKPFL------LDDHDTSQKSKFLINGGTISPFLFN 965
>gnl|CDD|235558 PRK05683, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 676
Score = 26.6 bits (59), Expect = 6.6
Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 7/50 (14%)
Query: 88 DAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSK 137
D V+ + I NK + + T NDL AR + VRQL++
Sbjct: 166 DQVNNLTTSIASYNKQIAQASASGATPNDLLDAR-------DEAVRQLNE 208
>gnl|CDD|218489 pfam05192, MutS_III, MutS domain III. This domain is found in
proteins of the MutS family (DNA mismatch repair
proteins) and is found associated with pfam00488,
pfam05188, pfam01624 and pfam05190. The MutS family of
proteins is named after the Salmonella typhimurium MutS
protein involved in mismatch repair; other members of
the family included the eukaryotic MSH 1,2,3, 4,5 and 6
proteins. These have various roles in DNA repair and
recombination. Human MSH has been implicated in
non-polyposis colorectal carcinoma (HNPCC) and is a
mismatch binding protein. The aligned region corresponds
with domain III, which is central to the structure of
Thermus aquaticus MutS as characterized in.
Length = 290
Score = 26.2 bits (58), Expect = 7.0
Identities = 22/120 (18%), Positives = 49/120 (40%), Gaps = 8/120 (6%)
Query: 24 KQRADLARELEELGERLEEAGGATSAQIELNKKRE---AEMSKLRRDLEEANIQHESTLA 80
+L +L EL ER E G S ++ N+ E+ + D + ST
Sbjct: 166 ALLDELREKLAELLERERERTGIKSLKVGYNRVFGYYVIEVKASKADKVPGDYIRRSTTK 225
Query: 81 NLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTA 140
N + + E+ ++ + + L + +L ++++ E ++L+++L+ A
Sbjct: 226 NAVRFTTPELKELERKLLEAE--ERALALEKEILEEL---LERVLSEYAELLQELADAIA 280
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 26.4 bits (59), Expect = 7.0
Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 5/64 (7%)
Query: 8 LEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRD 67
LE++ E RQ A EKQ+ +LA+E E + G A +++ + K R + KL
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWI-----RRGKAAASKAKKAKSRIKRLEKLEAR 293
Query: 68 LEEA 71
L E
Sbjct: 294 LAEE 297
>gnl|CDD|206167 pfam13997, YqjK, YqjK-like protein. The YqjK-like protein family
includes the E. coli YqjK protein, which is
functionally uncharacterized. This family of proteins
is found in bacteria. Proteins in this family are
approximately 100 amino acids in length. There is a
single completely conserved residue R that may be
functionally important.
Length = 73
Score = 24.9 bits (55), Expect = 7.1
Identities = 7/22 (31%), Positives = 10/22 (45%)
Query: 20 AKAEKQRADLARELEELGERLE 41
+ ++QR DLA E E
Sbjct: 3 QRIQQQRLDLAAHAEPWLEATA 24
>gnl|CDD|235580 PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase
subunit alpha; Validated.
Length = 319
Score = 26.3 bits (59), Expect = 7.6
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 3/39 (7%)
Query: 5 IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEA 43
I ELE ++E + RA AE DL+ E+E L ++LEE
Sbjct: 12 IAELEAKIE---ELRAVAEDSDVDLSEEIERLEKKLEEL 47
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 26.7 bits (59), Expect = 7.7
Identities = 18/95 (18%), Positives = 36/95 (37%), Gaps = 9/95 (9%)
Query: 15 ERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQ 74
+ +K ++ E E ++ + G + K + +K RD+ +
Sbjct: 176 QEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKK--AKTDRDVSTST-- 231
Query: 75 HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDE 109
A +KK +D S++ Q +L LK L +
Sbjct: 232 -----AASQKKSSDLESKLEAQSKELWSLKDDLKK 261
>gnl|CDD|227184 COG4847, COG4847, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 103
Score = 25.6 bits (56), Expect = 8.1
Identities = 16/34 (47%), Positives = 20/34 (58%)
Query: 3 ARIYELEEEVEVERQSRAKAEKQRADLARELEEL 36
A I E EE E RQ R + EK DL +++EEL
Sbjct: 70 AVIAESEEAKESIRQVRQEVEKLAGDLTKKIEEL 103
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 26.2 bits (58), Expect = 8.4
Identities = 17/78 (21%), Positives = 39/78 (50%), Gaps = 13/78 (16%)
Query: 1 MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
++ARI +L+EE+E +R+ + ++ A +L SA +L K+R ++
Sbjct: 76 IRARISQLKEEIEQKRERIEELKRALAQRRSDLS-------------SASYQLEKRRASQ 122
Query: 61 MSKLRRDLEEANIQHEST 78
+ KL+ +++ + +
Sbjct: 123 LEKLQDEIKRTRSKLNAL 140
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 26.0 bits (58), Expect = 8.8
Identities = 20/89 (22%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 50 QIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDE 109
+I KK E KL ++ + N + L ++ V E+ +++ K K L
Sbjct: 35 EIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEE----VEELRKKLKDYEKDKQSLKN 90
Query: 110 ANRTINDLDVARKKLIIENSDLVRQLSKV 138
+ +L+ K L E+ L ++ KV
Sbjct: 91 LKARLKELEKELKNLKWESEVLEQRFEKV 119
>gnl|CDD|187736 cd02255, Peptidase_C12, Cysteine peptidase C12 contains ubiquitin
carboxyl-terminal hydrolase (UCH) families L1, L3, L5
and BAP1. The ubiquitin C-terminal hydrolase (UCH;
ubiquitinyl hydrolase; ubiquitin thiolesterase) family
of deubiquitinating enzymes (DUBs) consists of four
members to date: UCH-L1, UCH-L3, UCH-L5 (UCH37) and
BRCA1-associated protein-1 (BAP1), all containing a
conserved catalytic domain with cysteine peptidase
activity. UCH-L1 hydrolyzes carboxyl terminal esters
and amides of ubiquitin (Ub). Dysfunction of this
hydrolase activity can lead to an accumulation of
alpha-synuclein, which is linked to Parkinson's disease
(PD) and neurofibrillary tangles, linked to Alzheimer's
disease (AD). UCH-L1, in its dimeric form, has
additional enzymatic activity as a ubiquitin ligase.
UCH-L3 hydrolyzes isopeptide bonds at the C-terminal
glycine of either Ub or Nedd8, a ubiquitin-like protein.
UCH-L3 can also interact with Lys48-linked Ub dimers to
protect it from degradation while inhibiting its
hydrolase activity at the same time. UCH-L1 and UCH-L3
are the most closely related of the UCH members. UCH-L5
(UCH37) is involved in the deubiquitinating activity in
the 19S proteasome regulatory complex. It is also
associated with the human Ino80 chromatin-remodeling
complex (hINO80) in the nucleus. BAP1 binds to the
wild-type BRCA1 RING finger domain, localized in the
nucleus. It consists of the N-terminal UCH domain and
two predicted nuclear localization signals (NLSs), only
one of which is functional. The full-length human BRCA1
is a ubiquitin ligase. However, BAP1 does not appear to
function in the deubiquitination of autoubiquitinated
BRCA1. There is growing evidence that UCH enzymes and
human malignancies are closely correlated. Studies show
that UCH enzymes play a crucial role in some signaling
pathways and in cell-cycle regulation.
Length = 222
Score = 25.9 bits (56), Expect = 9.4
Identities = 7/57 (12%), Positives = 18/57 (31%), Gaps = 2/57 (3%)
Query: 100 LNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTADARMWKIKYETNKPKD 156
+ + L + + + +EN D +R + +A + + K
Sbjct: 107 HFESGSTLKKFLEESVSMSPEERARYLENYDAIRVTHETSAHEGQT--EAPSIDEKV 161
>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated. The HNOBA domain is
found associated with the HNOB domain and pfam00211 in
soluble cyclases and signalling proteins. The HNOB
domain is predicted to function as a heme-dependent
sensor for gaseous ligands, and transduce diverse
downstream signals, in both bacteria and animals.
Length = 209
Score = 25.7 bits (57), Expect = 9.9
Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 10/46 (21%)
Query: 28 DLARELEELGERLEEAGGATSAQIELNKKREAEMSKLR---RDLEE 70
D R+L LGE+ AQ++L K+ E KL R+LEE
Sbjct: 150 DATRDLILLGEQ-------ARAQLKLKKRLEKLKDKLEEAHRELEE 188
>gnl|CDD|177432 PHA02607, wac, fibritin; Provisional.
Length = 454
Score = 26.1 bits (58), Expect = 9.9
Identities = 12/48 (25%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 95 EQIDQ-LNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTAD 141
EQ+D+ K K K++E +N + +I + ++ Q+++ AD
Sbjct: 55 EQLDENTKKTKDKINEVVDDVNTIQENLD--VIGDISVIDQINQNVAD 100
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.307 0.124 0.315
Gapped
Lambda K H
0.267 0.0709 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,826,972
Number of extensions: 743898
Number of successful extensions: 3103
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2480
Number of HSP's successfully gapped: 788
Length of query: 156
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 67
Effective length of database: 6,990,096
Effective search space: 468336432
Effective search space used: 468336432
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 55 (25.0 bits)